-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy path10_arg.Rmd
1122 lines (945 loc) · 87.6 KB
/
10_arg.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
# Introduction: Policy Ideology and Congressional Primaries {#ch:arg}
```{r stopper, eval = FALSE, cache = FALSE, include = FALSE}
knitr::knit_exit()
```
```{r knitr-01-arg, include = FALSE, cache = FALSE}
source(here::here("assets-bookdown", "knitr-helpers.R"))
```
```{r r-01-arg, cache = FALSE}
library("knitr")
library("here")
library("magrittr")
library("tidyverse")
library("scales")
# library("latex2exp")
library("patchwork")
# library("ggdag")
```
Elections are the foremost venue for citizens to influence government actors and public policy.
Classic theories of voting suggest that citizens weigh the policy positions of alternative candidates and vote for the candidate whose platform most closely aligns with their own preferences [@downs:1957:economic-theory].
Political parties simplify the voter's calculations by providing a powerful heuristic in the form of the party label, enabling voters to infer candidates' values and issue positions without expending the effort to thoroughly appraise each campaign [@campbell-et-al:1960:american-voter; @green-et-al:2002:partisan-hearts; @rahn:1993:stereotypes].
The rise of partisan polarization, however, has complicated the role of parties in U.S.\ politics.
Although citizens, journalists, pundits, and even elected leaders frequently bemoan the bitter rhetoric and policy gridlock that has accompanied the widening partisan divide, political scientists have noted several positive consequences to polarization.
Compared to the parties of the early- and mid-1900s that political scientists believed were too similar to provide voters with meaningful choices [@apsa:1950:party-system], the Democratic and Republican Parties of recent decades have taken divergent and oppositional stances across a greater number of policy issues.
As a result, voters can more easily differentiate the policy platforms of the two parties in order to vote consistently with their political values.
Voters in turn are more thoroughly sorted into partisan groups that represent distinct ideological viewpoints in American politics, hold beliefs across multiple issues that are more ideologically consistent, think more abstractly about the ideological underpinnings of issue stances, and participate more in politics than they did in the past [@abramowitz:1998:realignment; @layman-carsey:2002:conflict-extension; @fiorina:2005:culture; @levendusky:2009:partisan].
Even as polarization has strengthened many aspects of political representation between the two parties, it may have troubling effects on representation within the parties.
The typical voter is a partisan who intends to cast their ballot for their preferred party, whoever that candidate may be [@bartels:2000:partisanship; @petrocik:2009:leaners]. As party-line voting increases, voters are more thoroughly captured by their loyalties.
A partisan voter's choices are locked in long before the general election.
A candidate from the voter's preferred party has already been selected through a nomination process, and the voter may be more likely to abstain from voting when faced with an undesirable candidate than they are to vote for the other party [@hall-thompson:2018:who-punishes].
Recent research supports this notion of capture amid polarization---when voters must choose between polarized candidates, they become less responsive to candidates' actual platforms and instead are more influenced by motivated reasoning and partisan teamsmanship [@Rogowski:2016:polarized-choices].
Voters relax their substantive scrutiny of candidates to cast low-cost votes for their own party, weakening the influence of _policy_ as a separate consideration from partisanship.
<!------- TO DO ---------
- More forgiving of bad traits?
(we don't care about traits, we care about policy)
------------------------->
This presents an important problem for the study of electoral representation.
Elections are intended to be a voter's choice over which political values will be expressed in government, but if the choice of candidates does not present the average partisan voter with realistic alternatives, how should we think about the "representation" of these voters' actual policy preferences?
If general elections provide an ever-coarsening choice over policy priorities, does the U.S.\ electoral system incorporate voters policy preferences in other ways?
When the choice before voters in the general election does not present realistic alternatives, political scientists naturally shift their focus to the nomination of partisan candidates.
A classic example is Key's -@Key:1949:southern-politics study of Democratic Party dominance in the American South.
Although scholars are right to examine within-party competition, focusing on contexts of single-party dominance is a serious limitation.
Even in races between viable candidates from both major parties, within-party competition plays a crucial role simply due to the fact that partisan voters almost certainly cast a vote for their own party.
The votes of rank-and-file partisan constituents are all guaranteed.
If they are to express their policy preferences through the act of voting, their voices may register as relatively weak because they present little electoral risk to their party in the general election.
The nomination stage---the primary election in particular---remains an important venue for the representation of partisans' policy views, whether the general election is closely contested or not.
## Policy Preferences and the Strategic Positioning Dilemma
This dissertation is chiefly concerned with the policy preferences of partisan voters and their role in electoral representation through congressional primary elections.
The study of American electoral politics has not ignored the representational function of primary elections [@geer:1988:primary-electorates; @norrander:1989:primary-voters; @cohen-et-al:2009:party-decides; @aldrich:2011:why-parties; @sides-et-al:2018:primary-representativeness], but as I discuss below, the quantifiable impact of primary voters' policy preferences in government is a startlingly open question.
<!------- TO DO ---------
- punch up or just preview?
------------------------->
Several existing studies have examined other aspects of representation through House primaries, such as the introduction of the direct primary [@ansolabehere-et-al:2010:direct-primary], how candidates position themselves in response to the presence or threat of primary challenges
[@burden:2004:candidate-positioning; @brady-han-pope:2007:out-of-step; @hirano-et-al:2010:primary-polarization], and how primary nomination rules affect elite polarization [@hirano-et-al:2010:primary-polarization; @mcghee-et-al:2014:nomination-systems; @rogowski-langella:2015:primary-systems].
Though these studies address interesting aspects of electoral representation and party competition, they cannot speak directly to the influence of voter's policy preferences on (1) the positioning of House primary candidates and (2) the outcomes of House primary elections.
The absence of voter preferences from the empirical study of primaries is troubling because they play a crucial role in the dominant theory that relates representation to primary politics.
Although the Downsian model of candidate positioning explains the incentives for candidates to stake out moderate policy positions to cater to the ideological "median voter" [@downs:1957:economic-theory], candidates behave differently in the real world.
Instead, candidates engage in highly partisan behavior and take divergent issue stances even on salient local issues and in closely competitive districts [@ansolabehere-et-al:2001:candidate-positioning; @fowler-hall:2016:convergence].
But why?
Scholars and political observers have argued that because competing in the general election requires each candidate to clinch their party's nomination contest, these candidates face a combination of convergence-promoting and divergence-promoting incentives.
Primary elections tend to be dominated by partisan voters who are more attentive to politics, hold non-centrist issue preferences, and "weight" candidates' issue positions more heavily than the average voter in the general election.^[
Primary elections are not *entirely* partisan affairs.
States vary in the degree to which their primaries are "closed" to partisan voters only.
Recent research finds that these regulations do little to affect the policy preferences of House primary candidates [@rogowski-langella:2015:primary-systems], state legislators [@mcghee-et-al:2014:nomination-systems], or validated primary voters [@hill:2015:nominating-institution].
]
As a result, a candidate's risk of being defeated in the primary for being too moderate can easily outweigh their risk of losing the general election for being too partisan.
The conflicting incentives imposed by partisan constituency and the general election constituency create what @brady-han-pope:2007:out-of-step call a "strategic-positioning dilemma" that leads candidates to take divergent issue stances rather than targeting a district median voter [see also @aldrich:1983:downsian-parties; @burden:2001:polarizing-primaries].
<!------- TO DO ---------
- considered cites in comment
------------------------->
<!-- \citep{aldrich1983downsian,burden2004candidate,jacobson2015politics,hill2015institution}. -->
The strategic positioning dilemma (SPD) is the central theoretical focus of this project, and tests of the SPD are key empirical contributions in the following chapters.
The sections that follow introduce key terms for understanding my critique of the existing research and my contribution to it in this project.
### Key concept: policy ideology
If we had an ideal test of the SPD's implications, the policy preferences of partisan primary voters would be an essential ingredient.
Primary voters are one of the key constituencies that a candidate must please in the SPD view of primary elections.
When partisan voters in a district are more conservative, the SPD claims that the candidate experiences a pressure to stake out a more conservative campaign position.
This section briefly discusses this project's terminology around voter ideology, the groups in the electorate for whom these concepts are at play, and how relate to other political science research.
When this project discusses voter "preferences" or voter "ideology," it specifically refers to a notion of _policy ideology_.
An individual's policy ideology is a summary of their policy views in a left–right ideological space.
Policy views are naturally complex and multidimensional, and it is possible for individuals to hold beliefs across policy areas that would strike many political scientists as being "ideologically inconsistent" [e.g. @campbell-et-al:1960:american-voter].
Policy ideology distills this complexity into average tendencies; voters who hold a greater number of progressive preferences about policy are more ideologically progressive, and vice versa for voters with more conservative policy preferences.
Voters who hold a mixture of progressive and conservative beliefs are ideologically moderate or "mixed" [@broockman:2016:policy-representation].
<!------- TO DO ---------
- not lofty,
- not symbolic
------------------------->
Policy ideology is different from policy _mood_.
Policy mood captures a voter's preference for the government to be more or less progressive compared to a shifting status-quo, while ideology is meant to be directly comparable using only issue information [@stimson:1991:moods; @enns-koch:2013:state-mood; @mcgann:2014:irt-mood].
Policy ideology is similar to other concepts that represent latent ideological preferences.
These latent constructs include ideal points for members of Congress, Supreme Court justices, and even individual citizens [@poole-rosenthal:1997:roll-call-history;@martin-quinn:2002:ideal;@clinton-jackman-rivers:2004:ideal;@treier-hillygus:2009:ideology; @tausanovitch-warshaw:2013:constituent-prefs].
Other researchers have called this concept "policy liberalism" [@caughey-warshaw:2015:DGIRT], which orients the concept so that "greater" values represent "more liberalism."
For this project, I prefer to orient the construct as policy _conservatism_, which orients a scale so that greater conservatism corresponds to "rightward" movements on a number line.
I try to be conscious of the difference between _consistent_ issue beliefs and _extreme_ issue beliefs throughout this project.
Consistently conservative issue beliefs do not necessarily imply that an actor is "extremely" conservative [@fiorina-et-al:2005:culture-war], and an actor may appear "moderate" even if they hold a mixture of non-moderate progressive and conservative issue beliefs [@broockman:2016:policy-representation].
This project views policy ideology in a measurement modeling context, which I return to in Chapter \@ref(ch:model).
Policy ideology affects voters' issue beliefs, and while issue beliefs can be measured using a survey, policy ideology itself is not observable.
Instead, policy ideology exists in a latent space, and survey items on specific issues reveal only limited information about voters' locations in the latent space.
This is distinct from an approach where researchers measure ideology by adding or averaging responses to policy items, an approach that implicitly assumes that all survey items about all issues are equally informative about ideology.
Modern measurement approaches relax this assumption, instead viewing survey items as sources of correlated measurement error across respondents.
This approach results in more rigorous approaches for estimating latent ideological signals from noisy survey data [@ansolabehere-et-al:2008:voter-ideology].
Following this modeling tradition, I refer to an individual's location in ideological space as their "ideal point," the point at which the utility of a policy proposal is maximized with respect to the individual's ideological preferences.
### Key concept: district-party groups
I argue that another key construct at work in the SPD is the notion of _groups_ in the electorate.
For a given district, the general election is a contest among all voters, so we consider this constituency as a group.
We sometimes refer to this group as the "general election constituency," since it contains anybody who is eligible to vote in the general election.
It does not specifically refer to voters only, but contains any citizen who could potentially be a voter.
This ambiguity of who among the general election constituency actually votes is important to understanding a candidate's incentives during the campaign, since the candidate is uncertain whether certain campaign tactics will galvanize some constituents while alienating others.
Another important grouping for this project is the partisan constituency within a district.
Each congressional district contains constituents who are aligned with the Democratic Party or the Republican Party.
I call these two groups of constituents _district-party groups_.
All $435$ congressional districts contain voters from the two major parties, totaling `r 435 * 2` district-party groups.
For brevity, I sometimes refer to district-party groups as "party groups" or "partisan groups."
A district-party group contains any voting-eligible citizen who resides in a given district and identifies with a given party.
As with the general election constituency, membership in a party group is no guarantee that the constituent votes in either the primary or the general election.
The important fact is that they belong to only one party's base of supporters.
As I discuss below, decomposing a district's voters into separate party groups is the key empirical innovation in this project.
Research on primary representation routinely theorizes about the role of district-party groups, but the empirics in these studies typically do not operationalize party groups as distinct from one another, even though the operational distinction is crucial for testing the SPD theory.
One important point about district-party groups is that they are made of constituents, not organizations.
For this reason, it is sometimes helpful to refer to district-party groups as district party "publics," which emphasizes that the groups are composed of ordinary citizens [@caughey-warshaw:2018:dynamic-responsiveness].
There is no formal registration requirement to be a member of a party group, only a partisan identification.
This construction of district-party publics aligns most closely with Key's "party in the electorate" rather than "party as organization" [@key:1955:politics-parties-groups].
This distinguishes party publics from interest groups, policy groups, "intense policy demanders," or the "extended party network," which are concepts that describe organizations or maneuvers by political elites rather than rank-and-file constituents [@cohen-et-al:2009:party-decides; @koger-et-al:2009-partisan-webs; @masket:2009:no-middle-ground].
Although recent research has underscored the importance of elite actors in shaping party nominations, this project focuses specifically on testing the SPD, which is a voter-centric view of primary representation.
I bring in important concepts from elite-driven stories of primaries as they apply to particular claims being tested in later chapters.
<!------- TO DO ---------
- dan schlozman
- Klar networks?
- ruth bloch rubin?
------------------------->
### Key concept: district-party ideology
It is important to define both "policy ideology" and "district-party publics" because they combine to form a key concept that anchors the substantive contributions of this project.
This concept is _district-party ideology_: policy ideology aggregated to the level of the district-party group.
Just as any individual might have an ideological ideal point, and any individual might affiliate with a party, district-party ideology averages the ideological variation within a district-party group into one group-level ideal point.
This concept is similar to the median Republican or median Democrat within a congressional district.^[
My method for measuring district-party ideology in Chapter \@ref(ch:model) assumes a Normal distribution of individual ideal points within a district-party group, so the mean and median are equivalent.
]
By aggregating policy ideology within groups in this way, this project measures how policy ideology differs between Democrats and Republicans in the same district, and it shows how party ideologies vary across congressional districts.
This enables us to consider how candidates are responsive to partisan sub-constituencies that together compose the general election constituency [see also @clinton:2006:constituent-representation].
### Key concept: candidate campaign positioning
As with individual voters, we can imagine that candidates for Congress have locations in ideological space.
These locations are captured by campaign promises made or issue positions taken during the campaign.
The study of United States politics most commonly places elite political actors in ideological space using the voting records of members of Congress, Supreme Court justices, federal judges, and state legislators [@poole-rosenthal:1997:roll-call-history; @clinton-jackman-rivers:2004:ideal; @martin-quinn:2002:ideal; @epstein-et-al:2007:judicial-common-space; @shor-mccarty:2011:map-state-leg].
Researchers have extended the modeling intuitions to estimate ideal points from unconventional sources of data, including surveys of congressional candidates, campaign finance transactions, interest group ratings, text from political advertisements, and even Twitter activity [@ansolabehere-et-al:2001:candidate-positioning; @burden:2004:candidate-positioning; @burden-et-al:2000:senator-ideals; @bonica:2013:ideology-interests; @henderson:2016:text-ideology; @barbera:2015:twitter-scores].
<!------- TO DO ---------
- henderson tbd
------------------------->
This project is interested in the ideological locations of candidates for office as measured through their campaigns.
The positioning of campaigns is more directly related to the strategic positioning dilemma than any other concepts that we might scale in ideological space: candidates compete against one another by positioning themselves to appeal to a partisan base of voters, and partisan constituents consult use these campaign positions to nominate the candidate of their liking.
To be sure, campaign positions are influenced by other activities that researchers have used to scale candidates for office.
Incumbent legislators cast votes to form a defensible record in office, for instance, which both bolsters and constrains their campaign messages [@mayhew:1974:congress; @canes-wrone-et-al:2002:out-of-step].
Not every primary candidate has a roll-call voting record to compare, however, so this project requires an ideal point measure that places incumbents, challengers of incumbents, and candidates running for open seats in a comparable ideological space.
This project measures primary candidates' campaign positioning using CF scores from Bonica's -@bonica:2019:dime _Database on Ideology, Money in Politics, and Elections_ (DIME).
CF scores use campaign contributions to scale the political ideologies of contributors and recipients of campaign contributions.
Because a wide variety of political actors contribute or receive campaign funds, the DIME contains CF score estimates for political candidates, party organizations, PACs, and individual donors.
Unlike interest group ratings, another source of ideology scores for non-incumbent political candidates, CF scores are not constructed with a political agenda that implicitly "weights" issues according to the interest group's priorities [@fowler:1982:interest-group-scores; @snyder:1992:interest-group-ratings].
CF scores assume that a donor makes financial contributions to political actors to maximize their utility over all potential contribution amounts to all potential recipients, where the donor utility decreases as the ideological distance between the donor and the recipient increases.
The estimated ideal points from this method are called CF scores [@bonica:2013:ideology-interests; @bonica:2014:mapping-ideology].
CF scores have been used in other studies of primary candidate ideology by @thomsen:2014:moderate-candidates, @thomsen:2020:ideology-gender, @rogowski-langella:2015:primary-systems, @ahler-citrin-lenz:2016:CA-open-primaries, and @porter-treul:2020:primary-experience, and similar donation-based ideal point measures by @hall-snyder:2015:ideology have been used by @hall:2015:extremists and @hall-thompson:2018:who-punishes.
As I discuss in future chapters, CF score are not without controversy as indicators of elite ideology, especially when comparing members of the same party [@tausanovitch-warshaw:2017:polarized-congress; @hill-huber:2017:donorate], but other research shows that donors differentiate between moderate and ideological candidates when making financial contributions [@barber-et-al:2016:ideological-donors].
Validation studies of CF scores show that the ideological component of CF scores outperform a party-only model of giving [@bonica:2014:mapping-ideology], and CF scores predict future votes by members of Congress to a similar degree of accuracy as roll-call based scores do [@bonica:2019:cf-validity].
### The strategic positioning dilemma, implications, and research questions
Now that we have defined some key terms, I discuss how they relate to previous research on the strategic positioning dilemma.
The theory states that candidates balance two competing constituencies during their campaign for office.
Candidates face incentives to cater to the median voter in the general election, but they do not advance to the general election without first catering to partisan voters in the primary election.
As a result, their campaign position is tailored to split the difference between the two constituencies, perhaps leaning more to the partisan base in safe districts and to the median voter in competitive districts.
This section unpacks this intuition in detail and argues that existing research does not test the key claims.
<!------- TO DO ---------
- cite this: MPR, Jackman(?), LJP,
------------------------->
First, how does district-party ideology affect the way candidates position themselves in a campaign?
The logic of the SPD suggests that, at minimum, district-party conservatism should be positively correlated to the conservatism of a candidate's campaign position.
At maximum, district-party conservatism has a positive causal effect on the conservatism of a candidate's campaign position.
This prediction implies that candidates can perceive the conservatism of their partisan constituents.
Even if these perceptions are not unbiased [@broockman-skovron:2018:bias], the SPD implies that candidates can recognize when some constituencies are at least _relatively_ more conservative than others.
Second, if candidates anticipate partisan voters' policy views and position themselves accordingly, this suggests that candidates believe partisan voters are capable of voting in accordance with their policy views.
If this is true, we should expect that more conservative candidates are more likely to win primary nominations when the district-party is more conservative.
These two predictions are the core empirical implications of the "strategic positioning dilemma" theory of representation in primaries.
Crucially, testing each prediction requires a researcher to observe the policy ideologies of the partisan constituency within a district, separate from the location of the median voter or general election constituency.
This project argues that district-party policy preferences are either absent from existing research or thoroughly misconstrued—an important theoretical and methodological point that I unpack in Section \@ref(sec:ideal-votes).
As a result, U.S. elections research has been unable to empirically evaluate a widely held theory of representation in primaries.
<!------- TO DO ---------
- move this below the evidence of the above claim?
------------------------->
Stated differently, this dissertation asks if primaries "work" the way the SPD claims they do.
It is widely believed that primaries are effective means for voters to inject their sincere preferences into the selection of candidates and, in turn, the priorities of elected officials.
Is this *actually* true?
The two empirical research questions underlying this project are:
1. Do candidates position themselves to win the favor of primary voters?
2. Do primary voters select the candidate who best represents their issue beliefs?
<!------- TO DO ---------
borrow framing from: Lee, Moretti, and Butler (2004 QJPS) article, "Do voter affect or elect policies?"
- And further, do institutional factors that purportedly alter the composition of primary electorates, such as primary "openness" rules, affect how candidates are positioned or nominated for office?
------------------------->
## Does the Strategic Positioning Dilemma Describe Primary Representation?
### Theoretical concerns
The strategic positioning dilemma view of U.S. primaries has reasonable intuitions, but there are reasons to doubt some of its theoretical premises.
First, the SPD is put forth as a theory to explain divergent candidate platforms across parties, but there are numerous theories that explain candidate divergence that do not rely on bottom-up pressures from primary voters.
And second, the SPD requires voters and candidates to be highly sophisticated actors.
Candidates must perceive the relative extremity of their constituents, and voters must learn about candidate platforms, differentiate between candidates, and act on sincerely-held preferences over candidate platforms.
The notion of the SPD emerges from a clash between idealized candidate positioning in formal models and the candidate positioning we observe in the real world.
Classic formal models highlight a strategic logic for candidates to position themselves by "converging" to the location of the median voter: if constituents vote primarily with policy-based or ideological considerations, then candidates maximize the probability of electoral victory by positioning themselves as closely to the median constituent as possible [@downs:1957:economic-theory; @black:1948:group-decisions].^[
Some empirical studies of candidate positioning [e.g. @ansolabehere-et-al:2001:candidate-positioning; @brady-han-pope:2007:out-of-step] claim that these formal models "predict" candidate convergence at the median voter.
In my opinion, this misrepresents the formal work.
@downs:1957:economic-theory in particular explains the logic of candidate convergence, but he also explores many circumstances that would prevent the convergent equilibrium from appearing in the real world.
This is important to clarify because, although it is common to describe candidate convergence as a "Downsian result" or a "Downsian prediction," we should recognize that the convergent equilibrium is an oversimplification.
Understanding the theoretical incentives that promote candidate moderation is more important than the whether we observe perfect candidate convergence empirically.
]
Empirical work finds evidence in partial support of both convergent and divergent candidate incentives.
Candidates who run in electorally competitive districts are more moderate than co-partisans who run in electorally "safe" districts [@ansolabehere-et-al:2001:candidate-positioning; @burden:2004:candidate-positioning], and even candidates who run in safe districts are marginally rewarded for taking more moderate issue positions than a typical party member would [@canes-wrone-et-al:2002:out-of-step].
Extremist candidates, meanwhile, earn fewer votes and are less likely to win in Congressional elections, and this tendency is stronger in competitive districts than in safe districts [@hall:2015:extremists].
Despite these incentives to take moderate campaign positions, candidates nonetheless take divergent rather than convergent stances in general.
Republican and Democratic members of Congress vote very differently from one another, and this partisan divergence has increased in recent years [@poole-rosenthal:1997:roll-call-history; @McCarty-Poole-Rosenthal:2006:book].
Republicans and Democrats who represent similar districts (or the same state, in the case of U.S. Senators) vote differently from one another, so the difference in legislative voting behavior is not simply a function of district characteristics [@mccarty-poole-rosenthal:2009:gerrymandering; @brunell:2006:rethinking-redistricting; @brunell-grofman-merrill:2016:polarization-components].
And although qualitative evidence from decades past suggests that candidates take careful positions on issues of local concern [@fenno:1978:home-style], more recent systematic tests find mixed evidence of localized, particularistic position-taking. [@canes-wrone-et-al:2011:issue-accountability; @fowler-hall:2016:convergence].
In total, even though there is some evidence that candidates benefit by positioning themselves as marginally more moderate or more in line with local public opinion, the dominant finding is that candidates take divergent positions that are more closely aligned with a national party platform than with a set of local issue priorities.
The Downsian logic is a strong "centripetal" force that promotes moderation among candidates, but what "centrifugal" forces explain the non-moderate stances [@cox:1990:centripetal-centrifugal]?
Political scientists have explored several theories whose underlying mechanisms are distinct from the SPD mechanism of competing local constituencies.
Parties are interested in cultivating long-term reputations for pursuing certain policy priorities [@downs:1957:economic-theory; @stokes:1963:spatial].
It benefits both major parties for these reputations to be distinct from one another because parties have office-seeking motivations to mutually divide districts into geographic bases that consistently support one party over time [@snyder:1994:platform-differentiation].
Party leaders maintain these reputations by setting brand-consistent legislative agendas and pressuring legislators to support reputation-boosting legislation [@cox-mccubbins:2005:agenda-cartel; @butler-powell:2014:valence-brand; @lebo-et-al:2007:strategic-party-government].
In turn, non-median party platforms are more appealing to constituents with ideologically consistent issue beliefs.
Candidates benefit by rewarding these constituents in particular because they are more likely to influence election outcomes in favor of the candidate [@hirano-ting:2015:direct-indirect-representation].
These voters are more likely to turn out in general elections than moderate voters are, so it is more efficient for candidates to cater to these constituents especially in districts with greater ideological heterogeneity [@ensley:2012:heterogeneity].
Partisan constituents are also more likely to engage in pro-party activism, such as staffing campaigns, contributing financially to campaigns, and attending party conventions [@aldrich:1983:downsian-parties; @barber:2016:contributions-polarization; @laraja-schaffner:2015:campaign-finance-polarization; @mcclosky-et-al:1960:conventions; @layman-et-al:2010:activists-conflict-extension].
These incentives for candidates to diverge from median positions are possible without considering primary elections whatsoever.
Even if we introduce primary elections into the theoretical story, many plausible explanations for divergence do not rely on outward pressures from partisan primary voters either.
Many scholars of political parties maintain that parties retained their gatekeeping roles over party nominations even as the direct primary ostensibly removed their formal powers over candidate selection.
<!------- TO DO ---------
- hirano/snyder lit?
------------------------->
Although primary campaigns take place, these scholars argue that an informal network of party actors wields substantial influence behind the scenes, controlling which candidates obtain access to the party's resources, donor lists, and partisan campaign labor [@cohen-et-al:2009:party-decides; @masket:2009:no-middle-ground].
Through these mechanisms, candidates can live or die by the nomination process long before primary _voters_ ever enter the picture.
<!------- TO DO ---------
Primaries without voters:
- parties gate-keep the ballot (direct primary?)
- must please issue groups in private, campaign on valence in public
- party activists (campaign labor)
------------------------->
One reason to doubt the SPD on theoretical grounds is that it has high demands of voter sophistication in primary elections.
It is well understood that learning about the characteristics and issue positions of political candidates is costly for voters, particularly in non-presidential elections.
<!------- TO DO ---------
- cite information costs
- cite comparison to non-presidential elections
------------------------->
Party labels on the ballot are valuable heuristics for voters to differentiate between the issue positions of Republican and Democratic candidates [@hill:2015:nominating-institution].
Primary elections, however, occur most of the time between candidates in the same party,^[
There are a few exceptions to this institutional configuration of intra-party nominations.
Some states hold blanket primaries, top-two primaries, or "jungle" primaries, where candidates from all parties compete on one ballot to be included in a runoff general election.
]
which denies voters' the informational shortcut of a candidate's party affiliation [@norrander:1989:primary-voters].
<!------- TO DO ---------
- cite party label?
Kenney and Rice 1992;
Monardi 1994; Wattier 1983
- cost of learning?
------------------------->
Primary elections often occur during months when voters are paying less attention to politics, and the press cover primary campaigns less closely than general election campaigns.
<!------- TO DO ---------
- primary vs. general coverage
------------------------->
Primary voters have a reputation for being more attentive and sophisticated consumers of political information than the typical general election voter, but in these lower-information environments, they may cast their ballots for non-policy reasons by prioritizing "Washington outsiders" or identity-based candidate features such as gender or race [@porter-treul:2020:primary-experience; @thomsen:2020:ideology-gender].
They may also vote for the familiar candidate instead of the ideologically proximate one, in which case asymmetric campaign expenditures or news coverage may advantage one candidate over the other.
For example, @bonica:2020:lawyers-in-congress attributes lawyers' numerical prominence in Congress to their ability to raise early money from their wealthy social networks.
<!------- TO DO ---------
- name recognition
------------------------->
Furthermore, despite the disproportionate news coverage received by primary candidates who challenge incumbents on ideological grounds, the absolute number of explicitly ideological primary challenges in a given election cycle is low [@boatright:2013:getting-primaried], so primary voters are unlikely to experience a deluge of policy-focused campaign messages even if they are attentive enough to receive these messages and sophisticated enough to process them.
In short, the claim that voters' policy preferences affect their choices in primary campaigns sounds straightforward, but the information environment of primary campaigns makes it difficult for constituents to vote foremost with their policy ideologies.
The SPD also requires candidates to perceive the policy ideologies of their partisan constituencies accurately in order to position their campaigns optimally.
@broockman-skovron:2018:bias lend contradictory evidence to this notion by measuring the degree to which politicians "misperceive" their constituency's policy views.
The authors find that elected politicians believe that their constituents are much more conservative on many issues than they actually are, which could affect how accurately candidates position themselves in relation to constituent views.
This is especially problematic if voters' policy positions are actually more progressive than their "symbolic" conservatism would suggest [@ellis-stimson:2012:symbolic-ideology].
Concerns about biased misperceptions of voters' policy views may be ameliorated if candidates successfully perceive _relative_ variation in voter ideology.
### Empirical ambiguity
<!------- TO DO ---------
- this should be structured around EXPLICIT TESTS
- review the theoretical proposition, introduce a few areas people have looked.
- they don't find clear evidence?
- [x] presence of primary challengers
- [x] threat of primary challengers
- (???) voters vs. organizations (The parties are different maybe?)
- primary rules
- extremity/heterogeneity of the constituency...
- redistricting?
- they use ONE measure for partisan preferences (which would be bad even if we had to infer one midpoint)
------------------------->
Empirical support for the strategic positioning dilemma is as unclear as the theoretical underpinning.
When researchers conduct empirical tests of the SPD or the narrower premises of primary representation and competition on which it rests, the results are ambiguous and often contradictory of the SPD story.
This section reviews existing research in this area to highlight the outstanding questions and preview the substantive innovations in this project.
Much of the interest in primary elections and representation comes from a focus on candidate divergence and partisan polarization.
Why do candidates who stand for general election take divergent stances from one another, and do the competitive dynamics of primary elections increase this divergence?
Prominent studies of candidate positioning in general elections initially found conflicting evidence about the influence of stiff primary competition on candidate extremity.
Using survey data from congressional candidates during the 2000 campaign, @burden:2004:candidate-positioning finds that general election candidates take more extreme policy positions in their campaigns if they also faced stronger primary competition.
This makes sense especially if primary voters care more about the candidate's ideological positioning than general election voters do.
@ansolabehere-et-al:2001:candidate-positioning find the reverse pattern using 1996 survey data.
The gap between major party candidates was actually smaller when one of the candidates faced stiffer primary competition.
This counter-intuitive finding makes sense if the presence of a primary challenger is itself a consequence of candidate positioning.
If an incumbent maintains a partisan reputation, this may fend off credible primary challengers who have less room to wage an ideological campaign against the incumbent.
Moderate incumbents, in turn, invite primary challenges from more extreme candidates.
The _threat_ of a primary challenge therefore exerts an outward pull on candidate's ideological positioning, even if a primary challenger never actually enters the race [@hacker-pierson:2005:off-center].
@hirano-et-al:2010:primary-polarization study this threat-based hypothesis by measuring potential primary threat as the average presence of primary competitors in down-ballot races.
In district with higher levels of latent primary threat, we might expect the incumbent to take more extreme stances in Congress.
Although the idea that incumbents vote as party faithfuls to preempt opportunistic challengers is intuitive and supported by other research [e.g. @mann:1978:unsafe], this measure was not meaningfully related to the extremity of an incumbent's voting record in Congress [@hirano-et-al:2010:primary-polarization].
In short, the evidence of the polarizing effects of primary challenges is mixed and unclear.
Researchers interested in the polarizing effects of primaries on candidates and legislators has also examined primary "rules."
Political parties are private organizations, and nominees are intended to represent the parties' priorities and governing values, but participation in primary elections is not always restricted to party members only.
Primary "openness" rules that govern who can vote in a partisan primary are managed by state election law, with some allowances for parties to set rules within those limits.
States with "closed" primaries restrict participation in primaries only to individuals who are registered as Republicans or Democrats.
States that allow third-party or non-partisan voters to participate in partisan primaries are "partially" open, and states where any voter can participate in any primary are regarded as "open" primaries.
I discuss finer details of primary rules in later chapters.
Researchers seeking to exploit state-level variation in primary rules hypothesize that restrictive participation criteria produce primary electorates that are more extreme, leading to the nomination of more extreme nominees [@jacobson:2012:polarization-origins].
However, recent studies finds little evidence supporting the hypothesis that primary rules affect polarization in Congress or candidate positioning more broadly.
There is little consensus in public opinion research that partisans who participate in primaries are much different from partisans who do not participate in primaries, either demographically or ideologically [@geer:1988:primary-electorates; @norrander:1989:primary-voters; @jacobson:2012:polarization-origins; @hill:2015:nominating-institution; @sides-et-al:2018:primary-representativeness], though these studies cover many years, and the dynamics of primary voting might have changed.
And even recent studies that find that primary voters hold more ideologically consistent views find no evidence that closed primaries select more extreme candidates [@hill:2015:nominating-institution].
This finding appears holds for the House, Senate, and state legislatures through the past several decades [@hirano-et-al:2010:primary-polarization; @mcghee-et-al:2014:nomination-systems; @rogowski-langella:2015:primary-systems].
Even reforms that drastically change primary rules, such as California's recent shift to a blanket primary where candidates from all parties compete for the same limited number of positions on the general election ballot, do not nominate legislators whose voting records are much more moderate than before [@bullock-clinton:2011:CA-blanket-primary; @ahler-citrin-lenz:2016:CA-open-primaries].
These studies are incomplete in important ways that bear on the key substantive questions underlying this project.
Most of these studies evaluate primaries' effects on representation by examining roll-call votes, which are observable only for incumbents, so they provide no insights about the positioning effects for non-incumbent candidates.
Some notable studies examine non-incumbent candidates for general election using candidate surveys [@ansolabehere-et-al:2001:candidate-positioning; @burden:2004:candidate-positioning], but these studies are also limited because they do not observe the positions of candidates who lose the primary nomination.
Without observing primary losers, we have no way of knowing if the general election candidate was moderate or ideological _relative to other primary candidates_.
It is much rarer for a study to measure primary candidate positioning as the key outcome variable using a method that covers incumbents, challengers, and open-seat candidates [e.g. @rogowski-langella:2015:primary-systems].
### Vote shares do not identify policy ideology {#sec:ideal-votes}
Another important drawback of the existing research on primaries and ideological representation is the way these studies handle voters' policy preferences.
The strategic positioning dilemma pits two constituencies in a district against one another: the nominating constituency (district-party public) and the general election constituency.
The former is theorized to prefer ideologically faithful candidates who adhere to a partisan policy platform, while the latter prefers marginally moderate candidates in the general election.
Studies routinely acknowledge this distinction in theory, but they often abandon the distinction between the two groups in applied studies, instead operationalizing the preferences of all three constituencies---the general constituency and two partisan primary constituencies---using the same measure: the district-level presidential vote.
<!------- TO DO ---------
- strategic voting anywhere?
------------------------->
This project argues that this use of presidential vote is not suitable to test the strategic positioning dilemma for the simple reason that votes are not equivalent to policy preferences or policy ideology.
Votes are choices that voters make under constraints, namely, the distance between the voter and the presidential candidates.
Even in simple models where ideology is the only factor influencing vote choice, observing a voter's choice of candidate contains very little information about their ideological location.
In the aggregate, Republican voters in a district may be ideological moderates or ideological conservatives, and the fact that they vote Republican does not inform us about the distribution of Republican voters.
Similarly, a district's vote outcome captures how all of its constituents vote _on average_, but because partisans tend to vote foremost for their preferred party even in the face of strong policy disagreements with the candidate [e.g. @barber-pope:2019:party-trump-ideology], aggregate vote shares for a district could easily be more affected by the _number_ of Republicans and Democrats in a district rather than the exact location of their ideological preferences.
Using the terminology by @tomz-van-houweling:2008:candidate-positioning, studying vote shares rarely presents a "critical test" of theories of voting because the same observable vote outcome can arise from many underlying voter preference configurations.
<!------- TO DO ---------
- here's two normal setup
- two party distributions
- increase Republican vote by:
- move Republicans right
- reduce dispersion of Republicans
- more Republicans
------------------------->
```{r hypothetical-parties}
sd_same <- 1
sd_shift <- 1
sd_narrow <- 0.5
median_same <- -1
median_shift <- -2
median_narrow <- -1
two_party_data <-
tibble(
x = seq(-5, 5, .01),
rep = dnorm(x, mean = -1 * median_same, sd = sd_same),
dem_shift = dnorm(x, mean = median_shift, sd = sd_shift),
dem_same = dnorm(x, mean = median_same, sd = sd_same),
dem_narrow = dnorm(x, mean = median_narrow, sd = sd_narrow)
) %>%
pivot_longer(
cols = starts_with("dem_"),
names_to = "modification",
values_to = "dem"
) %>%
mutate(
mixture_equal = 0.5*(dem + rep),
mixture_biased = (2/3 * dem) + (1/3 * rep)
) %>%
pivot_longer(
cols = starts_with("mixture_"),
names_to = "population",
values_to = "mix_density"
) %>%
pivot_longer(
cols = c(rep, dem, mix_density),
names_to = "aggregation",
values_to = "density"
) %>%
mutate(
density = case_when(
population == "mixture_equal" & aggregation != "mix_density" ~
1/2 * density,
population == "mixture_biased" & aggregation == "dem" ~
2/3 * density,
population == "mixture_biased" & aggregation == "rep" ~
1/3 * density,
TRUE ~ density
),
# mixture = case_when(
# aggregation == "mix_density" ~ "Combined Population Distribution",
# TRUE ~ "Partisan Base Distributions"
# ),
vote_dem = as.numeric(aggregation == "mix_density" & x < 0)
) %>%
filter(
population == "mixture_equal" |
modification == "dem_same"
) %>%
print()
two_party_data %>% count(modification, population, aggregation)
# make data frames for two halves of graphic
balanced_data <- two_party_data %>%
filter(modification == "dem_same" & population == "mixture_equal") %>%
print()
unbalanced_data <- two_party_data %>%
filter(modification != "dem_same" | population == "mixture_biased") %>%
print()
# parameters of underlying distribution
mod_stat_labels <- unbalanced_data %>%
filter(aggregation == c("mix_density")) %>%
group_by(modification) %>%
summarize(
dem_share = weighted.mean(vote_dem, density),
dem_mean = weighted.mean(x, density)
) %>%
mutate(
dem_sd = case_when(
modification == "dem_narrow" ~ sd_narrow,
modification == "dem_shift" ~ sd_shift,
modification == "dem_same" ~ sd_same
),
dem_median = case_when(
modification == "dem_narrow" ~ median_narrow,
modification == "dem_shift" ~ median_shift,
modification == "dem_same" ~ median_same
),
aggregation = "dem"
) %>%
print()
plot_balanced <- ggplot(balanced_data) +
aes(x = x, y = density) +
facet_wrap(
~ aggregation == "mix_density", nrow = 2,
labeller = as_labeller(
c(
"FALSE" = "District contains two partisan bases\nwith separate preference distributions",
"TRUE" = "Combined preference distribution\ndetermines election outcome"
)
)
) +
geom_ribbon(
aes(ymax = density, ymin = 0,
alpha = as.factor(vote_dem), fill = aggregation)
) +
scale_fill_manual(
values = c("rep" = rred, "dem" = dblue, "mix_density" = "gray80")
) +
scale_alpha_manual(values = c("1" = 1, "0" = 0.5)) +
geom_segment(
data = filter(balanced_data, aggregation == "mix_density" & x == 0),
aes(x = x, xend = 0, y = 0, yend = 1.3 * density),
linetype = "dashed"
) +
# geom_vline(xintercept = 0, linetype = "dashed") +
geom_text(
size = 3.5,
data = filter(mod_stat_labels, modification == "dem_same"),
aes(
x = -2.75, y = .25,
label = str_glue("Democrats:\nMedian = {dem_median}\nStd. dev = {dem_sd}")
),
hjust = 1
) +
geom_text(
size = 3.5,
data = filter(mod_stat_labels, modification == "dem_same"),
aes(
x = 2.75, y = .25,
label = str_glue("Republicans:\nMedian = {-1 * dem_median}\nStd. dev = {dem_sd}")
),
hjust = 0
) +
geom_text(
size = 3.5,
data = filter(balanced_data, aggregation == "mix_density" & x == 0),
aes(x = 0.2, y = 1.35*density, label = "Midpoint between\ncandidates"),
hjust = 0,
vjust = 2
) +
geom_text(
size = 3.5,
data = filter(balanced_data, aggregation == "mix_density" & x %in% c(-1.5, 1.5)),
aes(x = x, y = 0,
label = c("Vote Dem", "Vote Rep")),
vjust = -2
) +
labs(
y = NULL, x = "Voters' Policy Conservatism",
title = "District vote shaped by underlying\npartisan policy preferences"
) +
theme(
legend.position = "none",
axis.text.y = element_blank(),
plot.background = element_rect(color = "gray80")
)
plot_balanced
plot_unbalanced <- ggplot(unbalanced_data) +
aes(x = x, y = density) +
facet_wrap(
~ fct_relevel(modification, "dem_shift", "dem_narrow", "dem_same"),
labeller = as_labeller(
c(
"dem_shift" = "More progressive Democratic base\nincreases Democratic vote share",
"dem_narrow" = "Lower ideological variance increases Dem vote\nwithout shifting median Democrat",
"dem_same" = "Larger Democratic population increases Dem vote\nwithout changing ideal points"
)
),
ncol = 1
) +
geom_ribbon(
data = filter(unbalanced_data, aggregation == "mix_density"),
aes(ymax = density, ymin = 0,
alpha = as.factor(vote_dem), fill = aggregation),
fill = "gray"
) +
geom_line(
data = filter(unbalanced_data, aggregation != "mix_density"),
alpha = 1,
aes(color = aggregation)
) +
annotate(
geom = "segment",
x = 0, xend = 0, y = 0, yend = .3,
linetype = "dashed"
) +
geom_text(
size = 3.5,
data = mod_stat_labels,
aes(
x = 0.15, y = .35,
label = str_glue("Dem vote share: {percent(dem_share, accuracy = 1)}")
),
hjust = 0
) +
geom_text(
size = 3.5,
data = mod_stat_labels,
aes(
x = -5, y = .3,
label = str_glue("Dem median: {round(dem_median)}\nDem std. dev: {dem_sd}")
),
hjust = 0
) +
scale_color_manual(values = c("rep" = rred, "dem" = dblue)) +
scale_alpha_manual(values = c("1" = 1, "0" = 0.5)) +
labs(
y = NULL, x = "Voters' Policy Conservatism",
title = "District vote does not uniquely\nidentify preference distributions"
) +
theme(
legend.position = "none",
axis.text.y = element_blank(),
plot.background = element_rect(color = "gray80")
) +
NULL
plot_unbalanced
```
```{r plot-non-id}
plot_balanced + plot_unbalanced
```
Stated differently, the observed vote share in a district does not uniquely identify any important features of the underlying preferences of voters.
Figure \@ref(fig:plot-non-id) demonstrates the problem using a simple theoretical model of ideological voting for president.
The two left-side panels demonstrate the basic mechanics of the scenario.
We consider one congressional district that contains many constituents.
Every constituent has a policy ideal point represented on the real number line, with larger values indicating greater policy conservatism.
Every constituent also identifies with either the Republican Party or the Democratic Party.
The top-left panel breaks voters into Democratic and Republican Party affiliations and shows the probability distribution of ideal points within each partisan base.
In this example, both distributions are Normal with a scale of $1$.
Republican-identifying constituents hold policy preferences that are more conservative than Democratic constituents on average: the median Republican and Democrat are respectively located at $+`r -1 * median_same`$ and $`r median_same`$.^[
Because these are Normal distributions, the median and the mean are equivalent.
I refer to the median instead of the mean because medians are more directly relevant to spatial models of voting.
]
There is enough within-party variation that some Democratic constituents are more conservative than some Republican constituents, despite their party affiliation.
The bottom-left panel combines the two partisan distributions into one distribution for the entire constituency.
We assume at first that both partisan constituencies are equally sized, so the composite distribution is a simple finite mixture of the two distributions.^[
Analytically, if $f_{p}\left(x\right)$ is the probability density of ideal points $x$ in party $p$, then the composite density $f_{m}\left(x\right)$ is a weighted sum of the component densities: $f_{m}\left(x\right) = \sum\limits_{p}w_{p}f_{p}\left(x\right)$, where $w_{p}$ is a mixture weight representing the proportion of the total distribution contributed by party $p$, with weights that sum to $1$.
In this first example, both partisan constituencies are equally populous, so both parties have weight $w_{p} = \frac{1}{2}$.
If parties had different population sizes within the same district, $w_{p}$ would take values in proportion to those population sizes.
]
The midpoint between two presidential candidates is shown at policy location $0$.
Assuming all constituents vote according to single-peaked and symmetric utility functions over policy space, a constituent is indifferent between candidates if their ideal point is $0$, votes for the Democratic candidate if their ideal point is less than $0$ (shown in darker gray), and vote for the Republican candidate if their ideal point is greater than $0$ (shown in lighter gray).
The aggregate election result, therefore, is equal to the cumulative distribution function of the combined distribution evaluated at the candidate midpoint.
In the bottom-left panel, the vote share for the Democrat is $50$%, with some Democrats voting for the Republican candidate, and some Republicans voting for the Democratic candidate.
```{r plot-non-id, include = TRUE, fig.width = 10, fig.height = 10, out.width = "100%", fig.scap = "Non-identifiability of partisan group preferences from district vote shares.", fig.cap = "Demonstrating how district vote shares from a single election are insufficient to identify underlying policy-ideological features of the district. The left side shows how the policy preference distributions for two parties in a district (top panel) combine to form an aggregate preference distribution for the district as a whole (bottom panel). The right side shows how the Democratic vote share is affected by changes to either the locations, the scales, or the population sizes of the underlying partisan distributions."}
```
The panels on the right side of Figure \@ref(fig:plot-non-id) show how changes to one party's preference distribution affects the aggregate distribution of preferences in the combined constituency and, as a result, the presidential vote outcome in the district.
The composite distribution is again shown in gray, with dark and light shades indicating vote choice as in the bottom-left panel.
The underlying partisan distributions are outlined only with red and blue lines to reduce visual clutter.
The modifications to the underlying partisan preferences are simple, but even these simple changes reveal the fundamental problem with using district voting as a proxy for district-party public ideology.
In each panel, I intervene on only one feature of the Democratic public ideal point distribution, leaving the Republican distribution untouched (median of `r -1 * median_same`, standard deviation of `r sd_same`).
Intervening on just one component of one party's distribution is meant to keep the demonstration simple, bearing in mind that the problem is much more complex in the real world, where we can imagine multiple simultaneous changes to both parties at once.
The interventions highlight two classes of problems.
First, there are multiple modifications to the underlying partisan distribution that result in the same aggregate vote share.
This proves that the district vote does not uniquely identify the characteristics of the underlying voter distributions.
And second, we can alter the district vote outcome by changing party _sizes_ without any change to the ideal point distributions in either party.
This proves that vote shares may vary across districts even if partisan ideal points distributions are exactly identical.
In the top-right panel, I shift the location of the Democratic ideal point distribution to the left, from a median of `r median_same` to `r median_shift`.
This location shift results in a greater number of Democratic constituents with ideal points left of the candidate midpoint, increasing the Democratic vote share in the district from 50% to `r mod_stat_labels %$% dem_share[modification == "dem_shift"] %>% percent(accuracy = 1)`.
In the middle-right panel, I shrink the scale of the Democratic ideal point distribution from a standard deviation of `r sd_same` to a standard deviation of `r sd_narrow`.
Lower ideal point variance within the Democratic base has the exact same effect on the vote as shifting the location: more Democratic voters left of the midpoint, which increases the Democratic vote share to `r mod_stat_labels %$% dem_share[modification == "dem_narrow"] %>% percent(accuracy = 1)`.
This means that compared to a district with a 50% presidential vote split, we would not be able to attribute the increased Democratic vote to a constituency that is _more progressive on average_ (location) or simply _less heterogeneous_ in its policy preferences (scale).
The bottom-right panel in the figure shows how we obtain a different district vote without changing the underlying ideological distribution in either party whatsoever, instead changing only the relative population size of each partisan base.
The Democratic base in the final panel is unchanged compared to the original distribution laid out in the top-left: median of `r median_same` and standard deviation of `r sd_same`.
The only difference is that the district contains an unequal balance of partisan voters, two Democratic constituents to every one Republican constituent.
This results in an increased Democratic vote from 50% to `r mod_stat_labels %$% dem_share[modification == "dem_same"] %>% percent(accuracy = 1)`---ironically, the largest impact on the overall district vote despite not changing the ideological distribution of either party.
To review the lessons of Figure \@ref(fig:plot-non-id), observing a Democratic vote share greater than 50% reveals very little about the underlying distribution of voters.
In every panel, we observe an increase in the Democratic vote compared to our baseline scenario, but the median voter in either party does not need to change in order for vote shares to be affected.
Since the Republican distribution is identical in every panel, inferring that Republicans are less conservative in districts with greater Democratic voting would be incorrect in every case.
For the Democratic constituents, inferring a more progressive Democratic median voter from greater Democratic voting would be incorrect in two of the three cases.
It is worth repeating that the scenario laid out in Figure \@ref(fig:plot-non-id) is a vast oversimplification of the real electorate.
This is intentional, as it shows how intractable the problem becomes even in an artificial setting where we can take many variables as given.
This scenario contains no complicating elements such as non-partisan or third-party identifiers, non-policy voting, random sources of utility or utility function heterogeneity across different voters, differential turnout between partisan bases, and so on, that we might incorporate directly into a formal model.
It also does not take into account the inconveniences of real election data, where short-term forces impose additional shocks to vote shares that are unrelated to underlying voter preferences.
```{r read-cces}
cces_raw <- read_rds(here("data", "cces", "cumulative_2006_2018.Rds"))
```
```{r read-dime}
# Bonica scores and other candidate features
dime_cong_raw <-
read_csv(
here("data", "dime-v3", "cong", "dime_v3_cong_elections.csv")
) %>%
print()
```
```{r}
cces <- cces_raw %>%
mutate_all(labelled::remove_labels) %>%
mutate(
ideo5_ch = as.character(ideo5),
ideo5 = as.numeric(ideo5),
party = case_when(
pid3 == 1 ~ "Democrat",
pid3 == 2 ~ "Republican",
pid3 == 3 ~ "Independent",
TRUE ~ "Other/DK/NA"
)
) %>%
select(
cycle = year, state_abb = st, st_cd = cd, district_num = dist,
weight, party, ideo5, ideo5_ch
) %>%
filter(ideo5 %in% 1:5) %>%
na.omit() %>%
print()
```
```{r clean-dime}
dime <- dime_cong_raw %>%
transmute(
cycle, state_abb = state, district, seat,
district_num = parse_number(district),
district_pres_vs = 1 - dem_pres_vs
) %>%
filter(
seat == "federal:house"
) %>%
na.omit() %>%
distinct() %>%
print()
# dime <- dime_all_raw %>%
# mutate(
# cycle = parse_number(cycle),
# fecyear = parse_number(fecyear),
# district_num = parse_number(district),
# party =
# case_when(
# party == 100 ~ 1,
# party == 200 ~ 2
# ),
# district.pres.vs = 1 - district.pres.vs
# ) %>%
# rename_all(str_replace_all, "[.]", "_") %>%
# filter(
# seat == "federal:house",
# state %in% state.abb,
# fecyear == cycle,
# fecyear %in% cces$cycle,
# ) %>%
# select(
# state_abb = state,
# district_num,
# cycle,
# # party,
# district_pres_vs
# ) %>%
# na.omit() %>%
# distinct() %>%
# print()
```
```{r}
dime %>%
group_by(cycle) %>%
count()
```
```{r mean-ideology}
ideo_all <- cces %>%
group_by(cycle, state_abb, st_cd, district_num) %>%
summarize(
mean_ideo = weighted.mean(x = ideo5, w = weight)
) %>%
mutate(party = "All") %>%
ungroup() %>%
print()
ideo_party <- cces %>%
group_by(cycle, state_abb, st_cd, district_num, party) %>%
summarize(
mean_ideo = weighted.mean(x = ideo5, w = weight)
) %>%
ungroup() %>%
print()
ideo_merge <-
bind_rows(ideo_all, ideo_party) %>%
left_join(dime) %>%
mutate(
party_group = case_when(
party == "All" ~ "District Means",
party %in% c("Democrat", "Republican") ~ "District-Party Means"
)
) %>%
filter(cycle %% 2 == 0) %>%
print()
# write_rds(
# ideo_merge,
# here("present", "apw-2020", "easy-data", "ideo-vote.rds")
# )
```
```{r plot-ideo-mean}
ideo_merge %>%
filter(is.na(party_group) == FALSE) %>%
ggplot() +
aes(x = district_pres_vs, y = mean_ideo, color = party) +
geom_point(alpha = 0.4, shape = 16) +
geom_smooth(
aes(group = party),
color = "black"
) +
facet_grid(party_group ~ cycle) +
scale_color_manual(
values = c("All" = "gray", "Democrat" = dblue, "Republican" = rred)
) +
scale_y_continuous(
breaks = 1:5,
labels = c("Very Liberal", "Liberal", "Moderate",
"Conservative", "Very Conservative"),
limits = c(1, 5)
)
```
```{r}
selected_year <- 2018
plot_one_year <- ideo_merge %>%
filter(cycle == selected_year) %>%
filter(is.na(party_group) == FALSE) %>%
ggplot() +
aes(x = district_pres_vs, y = mean_ideo,
color = party, shape = party) +
geom_point(aes(size = party), alpha = 0.6) +
geom_smooth(
aes(group = party),
color = "black",
# method = "lm",
se = FALSE,
) +
facet_wrap(~ str_glue("{party_group} ({selected_year})")) +
scale_color_manual(
values = c("All" = "gray", "Democrat" = dblue, "Republican" = rred)
) +
scale_shape_manual(
values = c("All" = "•", "Democrat" = "D", "Republican" = "R")
) +
scale_size_manual(
values = c("All" = 10, "Democrat" = 3, "Republican" = 3)
) +
scale_y_continuous(
breaks = 1:5,
limits = c(1, 5),
labels = c(
"Very Liberal", "Liberal", "Moderate",
"Conservative", "Very Conservative"
)
) +
scale_x_continuous(
limits = c(0, 1),
labels = scales::percent_format(accuracy = 1)
) +
coord_cartesian(xlim = c(0, 1), ylim = c(1, 5)) +
theme(legend.position = "none") +
labs(
y = NULL,
x = NULL
)
plot_one_year
```
```{r}
plot_all_lines <- ideo_merge %>%
filter(is.na(party_group) == FALSE) %>%
ggplot() +
aes(x = district_pres_vs, y = mean_ideo, color = party) +
geom_smooth(
aes(group = paste(party, cycle)),
# color = "black",
# method = "lm",
se = FALSE,
) +
scale_color_manual(
values = c("All" = "gray", "Democrat" = dblue, "Republican" = rred)
) +
scale_y_continuous(
breaks = 1:5,
limits = c(1, 5),
labels = c(
"Very Liberal", "Liberal", "Moderate",
"Conservative", "Very Conservative"
)
) +
facet_wrap(
~ str_glue("Loess Fits: ({min(ideo_merge$cycle)}–{max(ideo_merge$cycle)})")
) +
scale_x_continuous(
limits = c(0, 1),
labels = scales::percent_format(accuracy = 1)
) +
coord_cartesian(xlim = c(0, 1), ylim = c(1, 5)) +
theme(
legend.position = "none",
axis.text.y = element_blank()
) +
labs(
y = NULL,
x = "District two-party vote share\n for Republican candidate in\nprior presidential election"
)
plot_all_lines
```
```{r plot-compare-measures}
plot_one_year + plot_all_lines +
plot_layout(widths = c(2, 1)) +
plot_annotation(
title = "Weak Relationship Between District Voting and Ideology Within Parties",
subtitle = "Average ideological self-placement in each congressional district",
caption = "Data: Cooperative Congressional Election Studies"
)
```
The conceptual difference between district vote shares and aggregate ideology appears in real data as well, as I show in Figure \@ref(fig:plot-compare-measures).
The figure visualizes ideological self-placement responses to the Cooperative Congressional Election Study (CCES) as an approximate measure of policy ideology [although see @ellis-stimson:2012:symbolic-ideology].
I code the responses as numeric values and calculate the average self-placement for all respondents in each congressional district, as well as the average self-placement of Republican and Democratic identifiers as separate subgroups within each district.
I then plot these average self-placements against the past presidential vote in the district.
The first two panels use `r selected_year` data to show that the district vote predicts variation in ideological self-placement reasonably well when examining congressional districts as a whole, but it does a poorer job capturing variation in self-placement within each party.
The first panel shows that districts that voted more strongly for Democratic presidential candidate in `r selected_year` were more liberal on average, and districts that voted more strongly for the Republican candidate were more conservative, indicated by positively sloped loess fit lines.
The middle panel shows that this pattern does not hold as strongly within parties.
Among Republican identifiers within each district, a weaker but still positive relationship holds overall, with more conservative Republicans in districts that voted more Republican.
Among Democratic identifiers, however, ideological self-placement is not as strongly related to aggregate voting, with a loess fit that is flatter and even negative at several points.
The final panel of loess fits is included to show that this pattern appears in all CCES years and is not particular to the responses in the `r selected_year` wave: a strong relationship between vote shares and self-placement _on average_ and weak or non-relationships within each party.
```{r plot-compare-measures, include = TRUE, fig.width = 10.5, fig.height = 6, out.width = "100%", fig.scap = "The relationship between average ideological self-placement and district vote share in congressional districts.", fig.cap = "Average ideological self placement (vertical axis) and Republican vote share (horizontal axis) in all 435 congressional districts. Mean self-placement is calculated by numerically coding CCES ideological self-placement responses before averaging. The first panel plots average self-placement among all CCES respondents in each congressional districts. The middle panel breaks respondents in each congressional district into Republican and Democratic subgroups before averaging. The final panel plots loess fits for the same relationship measured over all CCES years."}
```
The substantive takeaway from Figure \@ref(fig:plot-compare-measures) is further evidence that we should doubt the use of aggregate voting in a district as a proxy for ideological variation among partisan groups.
Because the presidential candidates are the same in each district in each year, we know that this mismatch isn't due to different candidates with different campaign positions in each district.
Instead, the observed pattern suggests that any aggregate relationship between ideological self-placement and district voting is driven at least in part by the partisan _composition_ of a district—more Republicans or more Democrats—rather than cross-district ideological variation within either party.
As a result, studies that use the presidential vote to proxy within-party ideology may simply be measuring the _size_ of a partisan group in a district instead of its ideological makeup.
Some researchers have recognized the identifiability problems with district presidential vote shares as a measure of district preferences.