Automatic Redistricting Using GFlowNets to Solve Gerrymandering

Gerrymandering is the practice of drawing electoral district boundaries to unfairly favor one political party or group.

We replicate the results of the Markov chain Monte Carlo (MCMC) algorithms presented in the paper Automated Redistricting Simulation Using Markov Chain Monte Carlo. The MCMC algorithm incorporates contiguity and equal population constraints to produce valid districting plans.

To extend these results, we implement a novel GFlowNet approach to redistricting. GFlowNets allow for efficient exploration of diverse redistricting solutions, complementing the MCMC framework by improving the sampling process.

Sections

Currently under construction

Introduction
Problem
Data
Methodology
Results
Conclusion
References
Contributors
License

Introduction

Gerrymandering is the practice of manipulating the boundaries of an electoral constituency to favor one party or class. This project aims to reduce the impact of gerrymandering by creating a automatic simulation tool that can be used to draw fair and unbiased electoral districts.

Problem

Current Markov Chain Monte Carlo implementations (Fifield 2020) are too expensive to run on large states like Pennsylvania. Our automatic redistricting tool will use Generative Flow Networks (GFNs) to generate a set of possible district boundaries that are both contiguous and compact and then "select" the most fair and unbiased redistricting plan based on specific constraints.

Markov Chain Monte Carlo (MCMC) Sampling is inefficient, especially with the parallel tempering for Markov Chains on the temperature of the Gibbs distribution approximation.

Generative Flow Networks (GNFs) should solve this issue by providing a more efficient and scalable approach to generating and evaluating potential districting plans. Unlike MCMC methods, GFNs leverage directed acyclic graphs (DAGs) to model the sequential generation process of district boundaries, allowing for faster convergence and better handling of complex constraints such as contiguity and compactness.

Data

We use the following website to find the data for the congressional districts of any state in the US: https://alarm-redist.org/fifty-states/PA_cd_2020/

We use the data or Pennsylvania like in the reference paper.

The data has been extracted and saved in json format using jsonlite in R.

Methodology

MCMC Sampling

Core algorithm:

Initialization: Start from a valid partition of the graph into contiguous districts.
Turn On Edges: Randomly activate edges between nodes (precincts) with a small probability ( q ) and gather connected components.
Boundary Identification: Identify all connected components along the boundaries of districts using BFS.
Select Components for Swapping: Randomly choose a subset of non-adjacent connected components along the boundaries using the Zero-truncated Poisson distribution.
Propose Swaps: Reassign the chosen components to adjacent districts, ensuring districts remain contiguous.
Acceptance Check: Evaluate the proposed swap using an acceptance probability based on the Metropolis-Hastings criterion.

The variations of the MCMC algorithm modify the Acceptance Check to handle the population constraint:

Hard Constraint: Plans violating the allowed population deviation ( \delta ) are immediately rejected, strictly enforcing the constraint but limiting mixing efficiency due to frequent rejections.
Soft Constraint: Acceptance is adjusted using a Gibbs distribution to favor near-valid plans. Invalid plans assist transitions and are reweighted later with Sampling-Importance Resampling, improving mixing efficiency.
Target Distribution: Plans can be sampled either uniformly from all contiguous districts or using a Gibbs distribution that emphasizes plans with near-equal populations.

Generative Flow Networks (GFNs)

Torchgfn:

Results

In the paper, they demonstrate that small changes in the district boundaries can nearly eliminate partisan bias in the electoral outcome. They also show that the proposed method can be used to generate a large number of redistricting plans that are both contiguous and compact.

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
R_scripts		R_scripts
common		common
data		data
examples		examples
notebooks		notebooks
output		output
papers		papers
partitions/IA		partitions/IA
poster		poster
src		src
utils		utils
.env		.env
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
map_with_graph.png		map_with_graph.png
map_with_node_graph.pdf		map_with_node_graph.pdf
test_graph_borders.ipynb		test_graph_borders.ipynb
zoomed_in_map.pdf		zoomed_in_map.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automatic Redistricting Using GFlowNets to Solve Gerrymandering

Sections

Introduction

Problem

Data

Methodology

MCMC Sampling

Generative Flow Networks (GFNs)

Results

Conclusion

References

Contributors

License

About

Releases

Packages

Contributors 4

Languages

arnaudbergeron/GFN_Gerrymandering

Folders and files

Latest commit

History

Repository files navigation

Automatic Redistricting Using GFlowNets to Solve Gerrymandering

Sections

Introduction

Problem

Data

Methodology

MCMC Sampling

Generative Flow Networks (GFNs)

Results

Conclusion

References

Contributors

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages