We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cond-mat

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Condensed Matter > Soft Condensed Matter

Title: On Design of Optimal Nonlinear Kernel Potential Function for Protein Folding and Protein Design

Abstract: Potential functions are critical for computational studies of protein structure prediction, folding, and sequence design. A class of widely used potentials for coarse grained models of proteins are contact potentials in the form of weighted linear sum of pairwise contacts. However, these potentials have been shown to be unsuitable choices because they cannot stabilize native proteins against a large number of decoys generated by gapless threading. We develop an alternative framework for designing protein potential. We describe how finding optimal protein potential can be understood from two geometric viewpoints, and we derive nonlinear potentials using mixture of Gaussian kernel functions for folding and design. The optimization criterion for obtaining parameters of the potential is to minimize bounds on the generalization error of discriminating protein structures and decoys not used in training. In our experiment we use a training set of 440 protein structures repre senting a major portion of all known protein structures, and about 14 million structure decoys and sequence decoys obtained by gapless threading. We succeeded in obtaining nonlinear potential with perfect discrimination of the 440 native structures and native sequences. For the more challenging task of sequence design when decoys are obtained by gapless threading, we show that there is no linear potential with perfect discrimination of all 440 native sequences. Results on an independent test set of 194 proteins also showed that nonlinear kernel potential performs well.
Comments: 22 pages, 7 figures, and 5 tables
Subjects: Soft Condensed Matter (cond-mat.soft); Quantitative Biology (q-bio)
Cite as: arXiv:cond-mat/0302002 [cond-mat.soft]
  (or arXiv:cond-mat/0302002v1 [cond-mat.soft] for this version)

Submission history

From: Jie Liang [view email]
[v1] Fri, 31 Jan 2003 23:11:24 GMT (163kb)

Link back to: arXiv, form interface, contact.