Download citation
Download citation
link to html
A protein sequence can be classified into one of four structural classes, namely α, β, α + β and α/β, based on its amino-acid composition. The present study aims at understanding why a particular sequence with a given amino-acid composition should fold into a specific structural class. In order to answer this question, each amino acid in the protein sequence was classified to a particular neighbor density based on the number of spatial residues surrounding it within a distance of 6.5 Å. Each of the four structural classes showed a unique preference of amino acids in each of the neighbor densities. Residues which show a high compositional bias in a structural class are also found to occur in high neighbor densities. This high compositional bias towards specific residues in the four different structural classes of proteins appears to be caused by structural and functional requirements. The distribution of amino acids in different neighbor densities is graphically presented in a novel logo form which incorporates several features such as composition, the frequency of occurrence and color code for amino acids. The spatial neighbors of the residues in different neighbor densities and their secondary structural location are also represented in the form of logos. This representation helped in the identification of specific details of the whole data which may otherwise have gone unnoticed. It is suggested that the data presented in this study may be useful in knowledge-based structure modelling and de novo protein design.

Follow Acta Cryst. D
Sign up for e-alerts
Follow Acta Cryst. on Twitter
Follow us on facebook
Sign up for RSS feeds