Advances in Computational Protein Structure Prediction

Subramani, Ashwin

Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01br86b359k

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Floudas, Christodoulos A	en_US
dc.contributor.author	Subramani, Ashwin	en_US
dc.contributor.other	Chemical and Biological Engineering Department	en_US
dc.date.accessioned	2011-11-18T14:45:06Z	-
dc.date.available	2011-11-18T14:45:06Z	-
dc.date.issued	2011	en_US
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/dsp01br86b359k	-
dc.description.abstract	The thesis is premised on the application of optimization theory, and algorithms based on it, to the protein structure prediction problem. The protein structure prediction problem can be expressed simply as, "Given the amino acid sequence of the protein, what is its three dimensional structure?". A number of theories suggest alternate pathways for a protein to undertake the folding process. One such theory is the hierarchical theory of protein folding, which proposes that local secondary structure of proteins is formed prior to their three dimensional arrangement. Hence, the thesis aims to address a number of the intermediate problems to the tertiary structure prediction problem. Given an amino acid sequence of a protein, we first aim to predict the location of the secondary structure elements. To address this issue, a novel mixed-integer linear optimization model has been developed. The model divides a given protein sequence into overlapping nonapeptides, and evaluates the likelihood of the central amino acid to be in an alpha helix. This likelihood is expressed as a weighted linear sum of the pairwise probabilities of neighboring amino acid pairs to form hydrogen bonds. In addition, chemical shift based data from a large database is used to reduce the superstructure of possible helical locations in the protein. Having gathered information on the location of alpha helices and beta strands through different means, a novel mixed-integer optimization model to predict beta sheet topology has been developed. Accurate prediction of the topology of a protein would provide important distance constraints between amino acids separated in the primary sequence. The model aims to maximize the pseudo-contact energy between beta-strands in the protein. In addition, a model based on torsion angle dynamics and clustering aims to re-rank the shortlisted set of topologies in order to identify the native topology of the protein. In addition to constraints derived out of location and mutual contacts of secondary structures, it is important to impose distance and angle constraints on the disordered loop regions of the protein. Unlike the previous methods, the flexible nature of loops precludes successful prediction using only database-based methods. Hence, a novel loop structure prediction framework has been developed, which incorporates non-linear non-convex optimization, along with dihedral angle sampling and discrete side-chain optimization. An iterative approach is introduced to sequentially reduce the predicted bounds on the dihedral angles. All of these preceding steps are used to generate constraints, which are incorporated into the three dimensional structure prediction. The tertiary structure prediction algorithm combines deterministic global optimization, stochastic conformational space annealing and torsion angle dynamics to generate structural conformers which satisfy the constraints. For a blind case study, it is difficult to determine the native structure from an ensemble. Hence, a new, traveling-salesman problem (TSP) based clustering approach has been introduced. The method iteratively eliminates low quality structures from the ensemble, and eventually helps select five conformers which are closest to the native structure from the generated ensemble	en_US
dc.language.iso	en	en_US
dc.publisher	Princeton, NJ : Princeton University	en_US
dc.relation.isformatof	The Mudd Manuscript Library retains one bound copy of each dissertation. Search for these copies in the <a href=http://catalog.princeton.edu> library's main catalog </a>	en_US
dc.subject	Astrofold	en_US
dc.subject	Hierarchical Protein Folding	en_US
dc.subject	Protein Structure Prediction	en_US
dc.subject.classification	Chemical engineering	en_US
dc.title	Advances in Computational Protein Structure Prediction	en_US
dc.type	Academic dissertations (Ph.D.)	en_US
pu.projectgrantnumber	690-2143	en_US
Appears in Collections:	Chemical and Biological Engineering

Files in This Item:

File	Description	Size	Format
Subramani_princeton_0181D_10082.pdf		8.37 MB	Adobe PDF	View/Download

Show simple item record

Search

Browse