WARNING:
This document is under construction and may contain errors!
Amino Acid Sequence of Human CD11a

Domains are colored as given by Swiss-Prot ( ITAL_HUMAN P20701, also at NCBI Entrez as accession 1170591).
Also indicated is the "I domain" crystallized by Qu & Leahy.
Total length (without signal sequence) is 1145 amino acids.
Signal sequence length 25   Extracellular length 1063
Possible N-linked glycosylation   I domain    MIDAS (see Lee, Bergelson)
Discrepancy R -> W found by Qu & Leahy.
        1 MKDSCITVMA MALLSGFFFF APASSYNLDV RGARSFSPPR AGRHFGYRVL QVGNGVIVGA
       61 PGEGNSTGSL YQCQSGTGHC LPVTLRGSNY TSKYLGMTLA TDPTDGSILA CDPGLSRTCD
      121 QNTYLSGLCY LFRQNLQGPM LQGRPGFQEC IKGNVDLVFL FDGSMSLQPD EFQKILDFMK
                                         ^ begin Qu/Leahy   
      181 DVMKKLSNTS YQFAAVQFST SYKTEFDFSD YVKRKDPDAL LKHVKHMLLL TNTFGAINYV
      241 ATEVFREELG ARPDATKVLI IITDGEATDS GNIDAAKDII RYIIGIGKHF QTKESQETLH
      301 KFASKPASEF VKILDTFEKL KDLFTELQKK IYVIEGTSKQ DLTSFNMELS SSGISADLSR
                                end Qu/Leahy    ^
      361 GHAVVGAVGA KDWAGGFLDL KADLQDDTFI GNEPLTPEVR AGYLGYTVTW LPSRQKTSLL
      421 ASGAPRYQHM GRVLLFQEPQ GGGHWSQVQT IHGTQIGSYF GGELCGVDVD QDGETELLLI
      481 GAPLFYGEQR GGRVFIYQRR QLGFEEVSEL QGDPGYPLGR FGEAITALTD INGDGLVDVA
      541 VGAPLEEQGA VYIFNGRHGG LSPQPSQRIE GTQVLSGIQW FGRSIHGVKD LEGDGLADVA
      601 VGAESQMIVL SSRPVVDMVT LMSFSPAEIP VHEVECSYST SNKMKEGVNI TICFQIKSLY
      661 PQFQGRLVAN LTYTLQLDGH RTRRRGLFPG GRHELRRNIA VTTSMSCTDF SFHFPVCVQD
      721 LISPINVSLN FSLWEEEGTP RDQRAQGKDI PPILRPSLHS ETWEIPFEKN CGEDKKCEAN
      781 LRVSFSPARS RALRLTAFAS LSVELSLSNL EEDAYWVQLD LHFPPGLSFR KVEMLKPHSQ
      841 IPVSCEELPE ESRLLSRALS CNVSSPIFKA GHSVALQMMF NTLVNSSWGD SVELHANVTC
      901 NNEDSDLLED NSATTIIPIL YPINILIQDQ EDSTLYVSFT PKGPKIHQVK HMYQVRIQPS
      961 IHDHNIPTLE AVVGVPQPPS EGPITHQWSV QMEPPVPCHY EDLERLPDAA EPCLPGALFR
     1021 CPVVFRQEIL VQVIGTLELV GEIEASSMFS LCSSLSISFN SSKHFHLYGS NASLAQVVMK
     1081 VDVVYEKQML YLYVLSGIGG LLLLLLIFIV LYKVGFFKRN LKEKMEAGRG VPNGIPAEDS

                                              ^^^^^^ heterodimerization
     1141 EQLASGQEAG DPGCLKPLHE KDSESGGGKD
Transmembrane length 24 (29?)   Cytoplasmic length (53?) 58
24/58 given by Swiss-Prot. 29/53 given by Larson's original sequence (and recently agreed to by Nancy Hogg).
Heterodimerization & GFFKR motif: see Pardi, Peter, O'Toole. GFFKR is conserved in all integrin alpha chains ( Song).
Discrepancy: Position 189 was Trp(W) in the sequence reported by Larson et al., but Arg(R) in that reported by Qu & Leahy. The former cDNA clone was obtained from cell line U-937, a monocytic cell line derived from the lymphoma of a 37 year old male, while the latter came from THP-1, a human monocytic cell line derived from a 1 year old male patient with acute monocytic leukemia.

Crystallographically Determined Secondary Structure of the I domain

1LFA: Qu & Leahy's 1LFA Alpha helix   Beta strand as given in 1LFA.PDB. RasMol's determination (said to be by Kabsch & Sander's DSSP algorithm) agrees quite well (tho not quite perfectly) with that in the PDB file. However, the helix and strand ranges given in Qu & Leahy's paper, said to be from PROCHECK's implementation of the Kabsch & Sander algorithm, agree with neither the PDB file nor the NCBI record.
NCBI: Qu & Leahy's 1LFA Alpha helix   Beta strand as given by NCBI Entrez, which quotes the PDB as the source.

1LFA  121                                C IKGNVDLVFL FDGSMSLQPD EFQKILDFMK
NCBI  121                                C IKGNVDLVFL FDGSMSLQPD EFQKILDFMK

1LFA  181 DVMKKLSNTS YQFAAVQFST SYKTEFDFSD YVKRKDPDAL LKHVKHMLLL TNTFGAINYV
NCBI  181 DVMKKLSNTS YQFAAVQFST SYKTEFDFSD YVKRKDPDAL LKHVKHMLLL TNTFGAINYV

1LFA  241 ATEVFREELG ARPDATKVLI IITDGEATDS GNIDAAKDII RYIIGIGKHF QTKESQETLH
NCBI  241 ATEVFREELG ARPDATKVLI IITDGEATDS GNIDAAKDII RYIIGIGKHF QTKESQETLH

1LFA  301 KFASKPASEF VKILDTFEKL KDLFTELQKK IYVIEG
NCBI  301 KFASKPASEF VKILDTFEKL KDLFTELQKK IYVIEG


Feedback to Eric Martz.