What percentage of the human proteome has known structure?
Drug companies solve a large number of structures
but most are not deposited in the Protein Data Bank.
- ~40,000 genes in the human genome.
- ~25,000 entries in the Protein Data Bank:
sequence-distinct entries of good quality.
- ~1,100 of these are
- These entries are mostly single domains or fragments of proteins.
Answer (homology modeling): ~20% of domains, so
~10% of whole proteins?
Solution: Structural Genomics?
Eric Martz, University of Massachusetts, July 2003 (revised February 2004)