Comparing and Combining Predictors of Mostly Disordered Proteins
Diseases and Disorders, Entropy, Genomics, Order, Protein Structure
Digital Object Identifier (DOI)
Intrinsically disordered proteins and regions carry out varied and vital cellular functions. Proteins with disordered regions are especially common in eukaryotic cells, with a subset of these proteins being mostly disordered, e.g., with more disordered than ordered residues. Two distinct methods have been previously described for using amino acid sequences to predict which proteins are likely to be mostly disordered. These methods are based on the net charge−hydropathy distribution and disorder prediction score distribution. Each of these methods is reexamined, and the prediction results are compared herein. A new prediction method based on consensus is described. Application of the consensus method to whole genomes reveals that approximately 4.5% of Yersinia pestis, 5% of Escherichia coli K12, 6% of Archaeoglobus fulgidus, 8% of Methanobacterium thermoautotrophicum, 23% of Arabidopsis thaliana, and 28% of Mus musculus proteins are mostly disordered. The unexpectedly high frequency of mostly disordered proteins in eukaryotes has important implications both for large-scale, high-throughput projects and also for focused experiments aimed at determination of protein structure and function.
Was this content written or created while at USF?
Citation / Publisher Attribution
Biochemistry, v. 44, issue 6, p. 1989-2000
Scholar Commons Citation
Oldfield, Christopher J.; Cheng, Yugong; Cortese, Marc S.; Brown, Celeste J.; Uversky, Vladimir N.; and Dunker, A. Keith, "Comparing and Combining Predictors of Mostly Disordered Proteins" (2005). Molecular Medicine Faculty Publications. 737.