Bioinformatics October 22, 2004

TO BRING:
  All physical models
  Toobers for HomolMod!
  Notebook computer: SLIDES ALL OFF-LINE

[Frontispiece: Project HIV protease binding inhibitor]

START IN NETSCAPE 4.8 FOR HOT KEY CONTROL OF FONT SIZE

----------------------------------------------------
I structure data 10:10 - 10:22
II whence                      - 10:34
III knowledge               - 10:46
IV derived                     - 10:58
----------------------------------------------------

INTRO

Cup largely empty or slightly full (comp sci/math vs. biol).
---------------------------------------------------
WHENCE DATA?
Function-driven structure vs. genome driven.
NMR median chain length about 75AA=8kD (20% > 150AA=17kD)
    (30kD=270AA; 1% of PDB NMR entries are >225AA=25kD)

Baker & Sali:
    Median protein chain length in PDB ~200 (230 XRD, 75 NMR).
    60% of PDB is longer than 150AA (2/3 X-ray; 1/5 NMR)
    Median intronless protein lengths in eukaryotes ~300.
    http://sege.ntu.edu.sg/wester/intronless/length.htm

---------------------------------------------------
KNOWLEDGE

1o 2o 3o in ab initio: USE TOOBERS

STRUCT GENOMICS
    Oct 2004 TargetDB: 76,000, 1,307 (1.7%) in PDB vs. 1 year ago 33,680 (3.9%)
---------------------------------------------------

DERIVED

PQS
    asymmetric unit may be part of specific oligomer