Keyword-assisted LDA: Exploring New
Methods for Supervised Topic Modeling

Abdurehman, Rahji

Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01kp78gj709

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Imai, Kosuke	-
dc.contributor.author	Abdurehman, Rahji	-
dc.date.accessioned	2015-07-01T18:28:15Z	-
dc.date.available	2015-07-01T18:28:15Z	-
dc.date.created	2015-04-30	-
dc.date.issued	2015-07-01	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/dsp01kp78gj709	-
dc.description.abstract	This paper introduces an alternative to the popular machine learning algorithm known as Latent Dirichlet Allocation, or LDA for short. In this paper we derive the theory behind this alternative algorithm and demonstrate a specific use case for it with sample results. We call this new algorithm "keyword-assisted LDA". It works by taking a set of constraints which are set based on prior knowledge of the underlying topic structure within a corpus and then ensuring that they are maintained. Depending on one’s underlying implementation of LDA, keeping these constraints in order takes a variety of forms. This paper delves into the details for implementations using Gibbs sampling or Expectation-Maximization.	en_US
dc.format.extent	36 pages	en_US
dc.language.iso	en_US	en_US
dc.title	Keyword-assisted LDA: Exploring New Methods for Supervised Topic Modeling	en_US
dc.type	Princeton University Senior Theses	-
pu.date.classyear	2015	en_US
pu.department	Computer Science	en_US
pu.pdf.coverpage	SeniorThesisCoverPage	-
dc.rights.accessRights	Walk-in Access. This thesis can only be viewed on computer terminals at the <a href=http://mudd.princeton.edu>Mudd Manuscript Library</a>.	*
Appears in Collections:	Computer Science, 1987-2023

Files in This Item:

File	Size	Format
PUTheses2015-Abdurehman_Rahji.pdf	609.88 kB	Adobe PDF	Request a copy

Show simple item record

Search

Browse