Tissue- and Disease-specific Analysis of Genome-scale Data

Wong, Aaron

Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp012z10ws52f

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Troyanskaya, Olga G	en_US
dc.contributor.author	Wong, Aaron	en_US
dc.contributor.other	Computer Science Department	en_US
dc.date.accessioned	2015-06-18T19:03:24Z	-
dc.date.available	2015-06-19T05:28:45Z	-
dc.date.issued	2015	en_US
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/dsp012z10ws52f	-
dc.description.abstract	Biologists using modern experimental methods are generating a massive number of genome-scale datasets. In particular, the rate of large-scale data creation in most organisms is quickly outpacing biologists' ability to perform detailed follow-up experiments. Thus a substantial gap exists between the massive data being generated and the comparatively small number of experimental validations being performed (i.e. biological knowledge). In this manuscript, we present four solutions that broadly address this growing disparity, focusing on disease- and tissue-specific genomic analysis. These solutions are unified by their approaches to this problem: by combining and integrating available public genome-wide measurements to enable biological discoveries that would otherwise be impossible. First, we demonstrate a method to systematically transfer experimental knowledge between organisms inferred from high-throughput experimental data. By leveraging functional genomic data, we can improve the coverage and accuracy of function predictions across diverse organisms and machine learning methods. Second, we present an interactive web server that addresses the needs of biologists to visualize their experimental results in the context of multi-species functional predictions and relationships. Third, we describe a method that, for the first time, leverages large data compendia to build genome-scale tissue-specific functional maps in human by integrating thousands of genome-scale datasets. Our method can extract both functional and tissue/cell-type signals even when genomic data are not resolved for the tissue and very little are known about the expression of genes in the tissue. Finally, we detail a method for biologists to analyze their genome-scale datasets in the context of the massive public data compendium. Biologists are generating and trying to make sense of massive high-throughput datasets, and their biological questions can be more precisely addressed within the biological context of their experiment. By incorporating their experimental results in the search and integration of gene expression compendia, we demonstrate improved predictive performance in identifying additional functionally related genes.	en_US
dc.language.iso	en	en_US
dc.publisher	Princeton, NJ : Princeton University	en_US
dc.relation.isformatof	The Mudd Manuscript Library retains one bound copy of each dissertation. Search for these copies in the <a href=http://catalog.princeton.edu> library's main catalog </a>	en_US
dc.subject	function prediction	en_US
dc.subject	homology	en_US
dc.subject	tissue specificity	en_US
dc.subject.classification	Computer science	en_US
dc.subject.classification	Biology	en_US
dc.title	Tissue- and Disease-specific Analysis of Genome-scale Data	en_US
dc.type	Academic dissertations (Ph.D.)	en_US
pu.projectgrantnumber	690-2143	en_US
pu.embargo.terms	2015-06-19	en_US
Appears in Collections:	Computer Science

Files in This Item:

File	Description	Size	Format
Wong_princeton_0181D_11241.pdf		14.65 MB	Adobe PDF	View/Download

Show simple item record

Search

Browse