Size and Redundancy of 81,277 Entries
in the Protein Data Bank (PDB)
May 2012 by Eric Martz.


Method Molecular Weight* Non-Redundant Sequences
(<30% Identity)
X-Ray
88% of PDB
Median: 48,000
90% < 159,000
21% Unique 25% Unique
NMR
11% of PDB
Median: 9,800
90% < 18,500
57% Unique
More Details More Details

*Insulin 5,800;   Lysozyme 15,000
Albumin 70,000;   Taq DNA Polymerase 94,000
Antibody 150,000;   Chaperonin GROEL/S 875,000