Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01st74ct60x
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorImai, Kosuke
dc.contributor.authorTarr, Alexander
dc.contributor.otherElectrical Engineering Department
dc.date.accessioned2021-10-04T13:49:09Z-
dc.date.available2021-10-04T13:49:09Z-
dc.date.created2021-01-01
dc.date.issued2021
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/dsp01st74ct60x-
dc.description.abstractContributing to the rising popularity of computational social science, this dissertation presents new methods grounded in machine learning for solving several important problems in political science. In Chapter 2, adapted from coauthored work in Fifield et al. (2020), we present a new algorithm for sampling redistricting plans from arbitrary distributions. We formulate redistricting as a graph-cut problem and adapt an image segmentation algorithm from the computer vision literature to construct a Metropolis-Hastings style algorithm for sampling graph partitions. We then validate our algorithm using a small-scale map for which all possible redistricting plans can be enumerated, finding that our method samples from the true distribution. Lastly, we apply our algorithm to a more realistic redistricting problem using data from New Hampshire. In Chapter 3, adapted from coauthored work with June Hwang and Kosuke Imai, we develop a fully-automated video processing system for encoding information in political campaign advertisement videos. Our approach applies state-of-the-art algorithms to replicate a subset of variables in the human-labeled Wesleyan Media Project (WMP) data, performing tasks including video summarization, facial recognition, text recognition, speech recognition, audio classification, and text classification. We validate our method using the WMP data from the 2012 and 2014 election cycles, finding that machine coding is competitive with human coding for most of the variables considered in our study. In Chapter 4, adapted from coauthored work in Tarr and Imai (2021), we adapt the support vector machine (SVM) algorithm to address the balancing problem in causal inference. We first establish SVM as a kernel balancing method by showingthat the soft-margin SVM dual problem computes weights which balance functions in a reproducing kernel Hilbert space. We then show that the SVM cost parameter controls a trade-off between balance and sample size, allowing us to use path algorithms to give exact characterizations of how balance and causal effect estimates change over the path. We validate our method using simulation data, showing that our algorithm is competitive with leading balancing methods. Finally, we conduct an empirical study using the right heart catheterization data from Connors et al. (1996).
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.publisherPrinceton, NJ : Princeton University
dc.relation.isformatofThe Mudd Manuscript Library retains one bound copy of each dissertation. Search for these copies in the library's main catalog: <a href=http://catalog.princeton.edu>catalog.princeton.edu</a>
dc.subjectcausal inference
dc.subjectgerrymandering
dc.subjectmachine learning
dc.subjectMCMC
dc.subjectSVM
dc.subjectvideo data
dc.subject.classificationPolitical science
dc.subject.classificationArtificial intelligence
dc.subject.classificationStatistics
dc.titleMachine Learning Methods for Computational Social Science
dc.typeAcademic dissertations (Ph.D.)
pu.date.classyear2021
pu.departmentElectrical Engineering
Appears in Collections:Electrical Engineering

Files in This Item:
File Description SizeFormat 
Tarr_princeton_0181D_13868.pdf14.41 MBAdobe PDFView/Download


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.