Gene T3F12.11
Putative Identification microfibrillar-associated protein
Position 101963 to 103491, from the initial methionine to the termination codon
Strand -
EST match F14281, 3' sequence of clone YAY871
Database match microfibril-associated proteins

 

CDS:  The table below lists both exons of T3F12.11, their coordinates within the BAC, and which exon-finding program(s) predicted the 3' or 5' terminus of that exon (EST = terminus defined by an EST, GS = GenScan, Gr = Grail, M = MZEF).

Exon Range 3' 5'
1 102296 to 103491 EST,GS,Gr GS,Gr
2 101963 to 102074 GS,Gr EST,GS,Gr

CDS sequence from the initial methionine to the termination codon

ATGTCGGTCACAGCGGGAGTGAGTGAATCTGCAATTGCTGTAAGGGAGAAACTAAAAGGT
GGTATTGGACAGACTAAAGTAAGAAGGTATTGGCCAGGAAAAGCTCCGGAGTGGGCTGAG
GAAGCTGAAGAAGATGATGATGTTAGGATGCAGAAGTTTTCTGTTTTGGATAGAGCTTTT
CCAAAGAATGATGATTTGGGTGTTGCTAGGAAGGATGATCCAAGGCTGCGGCGTTTAGCT
CAGACTAAAGTTGAAAACCGTGACGAAGTTAGAGCTGATCATCGGCGTATTAGACAGGCT
GAGATTATATCTACGGAAGAAGAAGAATCGAGGAATCAAGAGAATAGAGACGAGGATGAT
GATGAAGATGCCTTGGAAGAAAGAAGAAGAAGAATTAAGGAGAAGAATCTTAGACGAGCA
CAAGAGGAGGCTGCTTTACTTCCTTTAGAAGAAGAAGATGAGATACAAGAGGAAGAAGAG
GAGGAGGAGGAGTCTGAGTACGAGACTGATTCGGAAGATGATATGCCTGGTATTGCCTTG
ATTAAGCCTGTTTTTGTACCGAAAGCTGAGAGAGATACAATTGCAGAGCGAGAGAGGCTT
GAGGCTGAAGAAGAAGCTCTTGAGGAATTAGCAAAGAGAAAATTGGAGCAAAGAAAAATA
GAGACAAAGCAAATTGTGGTTGAGGAAGTTAGGAAAGATGAAGAGATACGGAAGAACATA
CTATTGGAGGAAGCTAATATTGGAGATGTGGAAACTGATGACGAACTCAATGAAGCTGAG
GAGTATGAAGTTTGGAAGACAAGAGAGATTGGTAGGATCAAGAGAGAAAGAGATGCAAGG
GAAGCTATGCTGAGAGAGAGGGAAGAAATAGAGAAGTTGAGGAATATGACAGAGCAGGAG
AGGAGAGATTGGGAGAGGAAGAATCCGAAACCTTCTTCAGCTCAACCGAAAAAGAAATGG
AACTTTATGCAGAAATATTACCATAAGGGTGCCTTCTTCCAGGCAGATCCTGATGATGAG
GCAGGTTCTGCTGGAACCGATGGTATATTTCAGCGCGACTTCTCTGCTCCAACCGGAGAA
GATAGGTTGGACAAATCGATTCTCCCCAAAGTTATGCAAGTCAAGCACTTTGGTCGTAGT
GGAAGAACTAAATGGACTCACCTTGTCAATGAAGACACAACAGATTGGAGTAACCCGTGG
ACTTCCAATGATCCTCTACGTGAAAAATACAACAAGAAAATGGCAGGCATGGATGCTCCA
ATCGCAAAACCAAAAGGGAGCAAGAAGATGAAAGATTGGGAGACTTAA

 

Protein sequence:


MSVTAGVSESAIAVREKLKGGIGQTKVRRYWPGKAPEWAEEAEEDDDVRMQKFSVLDRAF
PKNDDLGVARKDDPRLRRLAQTKVENRDEVRADHRRIRQAEIISTEEEESRNQENRDEDD
DEDALEERRRRIKEKNLRRAQEEAALLPLEEEDEIQEEEEEEEESEYETDSEDDMPGIAL
IKPVFVPKAERDTIAERERLEAEEEALEELAKRKLEQRKIETKQIVVEEVRKDEEIRKNI
LLEEANIGDVETDDELNEAEEYEVWKTREIGRIKRERDAREAMLREREEIEKLRNMTEQE
RRDWERKNPKPSSAQPKKKWNFMQKYYHKGAFFQADPDDEAGSAGTDGIFQRDFSAPTGE
DRLDKSILPKVMQVKHFGRSGRTKWTHLVNEDTTDWSNPWTSNDPLREKYNKKMAGMDAP
IAKPKGSKKMKDWET*

Alignment of microfibrillar-associated protein 1 from chicken and human, with C. elegans protein F43G9.10, T3F12.11, and a consensus sequence. In the consensus sequence, amino acids are placed only if that residue is encoded by T3F12.11 and at least two of the other sequences.


            1                                                         60
   HumMFA1  ~~~~~~~~~MSVPSALMKQPPIQSTAGAVPVRNEKGEISMEKVKVKRYVSGK.RP..... 
 ChickMFA1  ~~~~~~~~~MSAPSALVKQPPIQSTAGACPSRNEKGRAVYGEGEGETVCVGKAAA..... 
CeF43G9.10  MGDYVPGFEQRESDNRSFGHSRLPTLGAIPIKNEKGQTVMQKVKVSRYVAGKAPEYARNY 
  T3F12.11  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MSVTAGVSESAIAV 
 consensus  ------------------------------------------------V-----------

            61                                                       120
   HumMFA1  DYAPMESSDEEDEEFQFIKKAKEQEAEPEEQEEDSSSDP..RLRR.LQNRISEDVEERLA 
 ChickMFA1  DYVPMESSEEEDEEFQFIKKAKEQEVEPEEQEEEVANDP..RLRRLLQNRITEDVEERLA 
CeF43G9.10  DSDSSESDRETDRDDDRRRRRRRESSDEEDRRRHRRHEDYGRRRQVEKPEVLGKVEDESS 
  T3F12.11  REKLKGGIGQTKVRRYWPGKAPEWAEEAEEDDDVRMQKFSVLDRAFPKNDDLGVARKDDP 
 consensus  -------------------KA-E---E-EE-------------R----N-----------

            121                                                      180
   HumMFA1  RHRKIVEPEVV.GESDSEVEGDAWRMEREDSSEEEEE.....EIDDEE...IERRRGMMR 
 ChickMFA1  RHRKIVEPEVVSGESDSEVEGEAWRVEREDTSEEEEE.....EIDDEE...IERWRGMMR 
CeF43G9.10  ENEQESEEDEEKQEERRERARMRRLELHENNREKDEEQEDSAESDEED...FERRRQMLR 
  T3F12.11  RLRRLAQTKVENRDEVRADHRRIRQAEIISTEEEESRNQENRDEDDDEDALEERRRRIKE 
 consensus  R-R------V----------------E-----EEE---------DD-E----ERRR----

            181                                                      240
   HumMFA1  QRAQERKNE.EMEVMEVEDEGRSGEESESESEYEEYTDSEDEMEPRLKPVFIRKKDRVTV 
 ChickMFA1  QRAQERKTE.ELEVMELEDEGRSGEESELESEYEEYTDSEDEMEPRLKPVFIRKKDRITV 
CeF43G9.10  DRAIKREEEIKREIKEELEEDDVEEEEEEESSEEEDSDEDDDPVPRLKPIFTRKKDRITL 
  T3F12.11  KNLRRAQEEAALLPLEEEDEIQEEEEEEEESEYETDSEDDMPGIALIKPVFVPKAERDTI 
 consensus  --------E------E-EDE----EE-E-ESEYE-------------KPVF--K--R-T-

            241                                                      300
   HumMFA1  QEREAEALKQKELEQEAKRMAEERRQYTLQIVGEETPKELEENKRSL...AALDALNTDD 
 ChickMFA1  QEREAEALKQKELEQEAKRLAEERRKYTLKIVEEEAKKELEENKRSL...AALDALDTDD 
CeF43G9.10  QEAEKEKEKEILKKIEDEKRAEERKRESAKLVEKVLQEEEAAEKRKTEDRVDLSSVLTDD 
  T3F12.11  AERERLEAEEEALEELAKRKLEQRKIETKQIVVEEVRKDEEIRKNILLEEANIGDVETDD 
 consensus  -ERE--------LE--AKR--E-R---T--IV-EE--K--E--K--L---A------TDD

            301                                                      360
   HumMFA1  E.NDEEEYEAWKVRELKRIKRDREDREALEKEKAEIERMRNLTEEERRAELRANGKVITN 
 ChickMFA1  E.NDEEEYEAWKVRELKRIKRDREEREAMEKEKAEIERMRNLTEEERRAELRANGKVVTN 
CeF43G9.10  E.TENMAYEAWKLREMKRLKRNRDEREEAAREKAELDKIHAMSEEERLKYLRLNPKVITN 
  T3F12.11  ELNEAEEYEVWKTREIGRIKRERDAREAMLREREEIEKLRNMTEQERRDWERKNPKP.SS 
 consensus  E-N--EEYE-WK-RE--RIKR-R--REA---E--EIE--RN-TE-ERR---R-N-K----

            361                                                      420
   HumMFA1  KAVKGKYKFLQKYYHRGAFFM.DEDEE........VYKRDFSAPTLEDHFNKTILPKVMQ 
 ChickMFA1  KAVKGKYKFLQKYYHRGAFFM.DEDEE........VYKRDFSAPTLEDHFNKTILPKVMQ 
CeF43G9.10  KQDKGKYKFLQKYFHRGAFFL.DEEDE........VLKRNFAEATNDDQFDKTILPKVMQ 
  T3F12.11  AQPKKKWNFMQKYYHKGAFFQADPDDEAGSAGTDGIFQRDFSAPTGEDRLDKSILPKVMQ 
 consensus  ---K-K--F-QKYYH-GAFF--D-DDE-----------RDFSAPT-ED---K-ILPKVMQ

            421                                                         483
   HumMFA1  VKNFGRSGRTKYTHLVDQDTTSFDSAWGQESAQNTKFFKQKAAGVRDVFERPSAKKRKTT*~~
 ChickMFA1  VKNFGRSGRTKYTHLVDQDTTSFDSAWGQESAQNTKFFKQKAAGVRDVFERPSAKKRKTT*~~
CeF43G9.10  VKNFGKASRTKYTHLTEEDTTDHQGVWASTNQLNSQFSTKRAGGSRPVFERPATKKRKN*~~~
  T3F12.11  VKHFGRSGRTKWTHLVNEDTTDWSNPWTSNDPLREKYNKKMAGMDAPIAKPKGSKKMKDWET*
 consensus  VK-FGRSGRTK-THLV--DTT-----W--------K--K--A------------KK-K-----

 

_________________________________________________________________________

created 7 Oct 97
Larry Parnell