Size and Redundancy of 43,000 Entries
in the Protein Data Bank (PDB)
April 2007 by Eric Martz for Protein Explorer.


Method Molecular Weight* Non-Redundant Sequences
(<30% Identity)
X-Ray
85% of PDB
Median: 45,000
90% < 145,000
22% Unique 25% Unique
NMR
15% of PDB
Median: 9,300
90% < 19,000
55% Unique
More Details More Details

*Insulin 5,800;   Lysozyme 15,000
Albumin 70,000;   Taq DNA Polymerase 94,000
Antibody 150,000;   Chaperonin GROEL/S 875,000