| Gene | T3F12.11 |
| Putative Identification | microfibrillar-associated protein |
| Position | 101963 to 103491, from the initial methionine to the termination codon |
| Strand | - |
| EST match | F14281, 3' sequence of clone YAY871 |
| Database match | microfibril-associated proteins |
CDS: The table below lists both exons of T3F12.11, their coordinates within the BAC, and which exon-finding program(s) predicted the 3' or 5' terminus of that exon (EST = terminus defined by an EST, GS = GenScan, Gr = Grail, M = MZEF).
| Exon | Range | 3' | 5' |
|---|---|---|---|
| 1 | 102296 to 103491 | EST,GS,Gr | GS,Gr |
| 2 | 101963 to 102074 | GS,Gr | EST,GS,Gr |
CDS sequence from the initial methionine to the termination codon
ATGTCGGTCACAGCGGGAGTGAGTGAATCTGCAATTGCTGTAAGGGAGAAACTAAAAGGT GGTATTGGACAGACTAAAGTAAGAAGGTATTGGCCAGGAAAAGCTCCGGAGTGGGCTGAG GAAGCTGAAGAAGATGATGATGTTAGGATGCAGAAGTTTTCTGTTTTGGATAGAGCTTTT CCAAAGAATGATGATTTGGGTGTTGCTAGGAAGGATGATCCAAGGCTGCGGCGTTTAGCT CAGACTAAAGTTGAAAACCGTGACGAAGTTAGAGCTGATCATCGGCGTATTAGACAGGCT GAGATTATATCTACGGAAGAAGAAGAATCGAGGAATCAAGAGAATAGAGACGAGGATGAT GATGAAGATGCCTTGGAAGAAAGAAGAAGAAGAATTAAGGAGAAGAATCTTAGACGAGCA CAAGAGGAGGCTGCTTTACTTCCTTTAGAAGAAGAAGATGAGATACAAGAGGAAGAAGAG GAGGAGGAGGAGTCTGAGTACGAGACTGATTCGGAAGATGATATGCCTGGTATTGCCTTG ATTAAGCCTGTTTTTGTACCGAAAGCTGAGAGAGATACAATTGCAGAGCGAGAGAGGCTT GAGGCTGAAGAAGAAGCTCTTGAGGAATTAGCAAAGAGAAAATTGGAGCAAAGAAAAATA GAGACAAAGCAAATTGTGGTTGAGGAAGTTAGGAAAGATGAAGAGATACGGAAGAACATA CTATTGGAGGAAGCTAATATTGGAGATGTGGAAACTGATGACGAACTCAATGAAGCTGAG GAGTATGAAGTTTGGAAGACAAGAGAGATTGGTAGGATCAAGAGAGAAAGAGATGCAAGG GAAGCTATGCTGAGAGAGAGGGAAGAAATAGAGAAGTTGAGGAATATGACAGAGCAGGAG AGGAGAGATTGGGAGAGGAAGAATCCGAAACCTTCTTCAGCTCAACCGAAAAAGAAATGG AACTTTATGCAGAAATATTACCATAAGGGTGCCTTCTTCCAGGCAGATCCTGATGATGAG GCAGGTTCTGCTGGAACCGATGGTATATTTCAGCGCGACTTCTCTGCTCCAACCGGAGAA GATAGGTTGGACAAATCGATTCTCCCCAAAGTTATGCAAGTCAAGCACTTTGGTCGTAGT GGAAGAACTAAATGGACTCACCTTGTCAATGAAGACACAACAGATTGGAGTAACCCGTGG ACTTCCAATGATCCTCTACGTGAAAAATACAACAAGAAAATGGCAGGCATGGATGCTCCA ATCGCAAAACCAAAAGGGAGCAAGAAGATGAAAGATTGGGAGACTTAA
Protein sequence:
MSVTAGVSESAIAVREKLKGGIGQTKVRRYWPGKAPEWAEEAEEDDDVRMQKFSVLDRAF PKNDDLGVARKDDPRLRRLAQTKVENRDEVRADHRRIRQAEIISTEEEESRNQENRDEDD DEDALEERRRRIKEKNLRRAQEEAALLPLEEEDEIQEEEEEEEESEYETDSEDDMPGIAL IKPVFVPKAERDTIAERERLEAEEEALEELAKRKLEQRKIETKQIVVEEVRKDEEIRKNI LLEEANIGDVETDDELNEAEEYEVWKTREIGRIKRERDAREAMLREREEIEKLRNMTEQE RRDWERKNPKPSSAQPKKKWNFMQKYYHKGAFFQADPDDEAGSAGTDGIFQRDFSAPTGE DRLDKSILPKVMQVKHFGRSGRTKWTHLVNEDTTDWSNPWTSNDPLREKYNKKMAGMDAP IAKPKGSKKMKDWET*
Alignment of microfibrillar-associated protein 1 from chicken and human, with C. elegans protein F43G9.10, T3F12.11, and a consensus sequence. In the consensus sequence, amino acids are placed only if that residue is encoded by T3F12.11 and at least two of the other sequences.
1 60
HumMFA1 ~~~~~~~~~MSVPSALMKQPPIQSTAGAVPVRNEKGEISMEKVKVKRYVSGK.RP.....
ChickMFA1 ~~~~~~~~~MSAPSALVKQPPIQSTAGACPSRNEKGRAVYGEGEGETVCVGKAAA.....
CeF43G9.10 MGDYVPGFEQRESDNRSFGHSRLPTLGAIPIKNEKGQTVMQKVKVSRYVAGKAPEYARNY
T3F12.11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MSVTAGVSESAIAV
consensus ------------------------------------------------V-----------
61 120
HumMFA1 DYAPMESSDEEDEEFQFIKKAKEQEAEPEEQEEDSSSDP..RLRR.LQNRISEDVEERLA
ChickMFA1 DYVPMESSEEEDEEFQFIKKAKEQEVEPEEQEEEVANDP..RLRRLLQNRITEDVEERLA
CeF43G9.10 DSDSSESDRETDRDDDRRRRRRRESSDEEDRRRHRRHEDYGRRRQVEKPEVLGKVEDESS
T3F12.11 REKLKGGIGQTKVRRYWPGKAPEWAEEAEEDDDVRMQKFSVLDRAFPKNDDLGVARKDDP
consensus -------------------KA-E---E-EE-------------R----N-----------
121 180
HumMFA1 RHRKIVEPEVV.GESDSEVEGDAWRMEREDSSEEEEE.....EIDDEE...IERRRGMMR
ChickMFA1 RHRKIVEPEVVSGESDSEVEGEAWRVEREDTSEEEEE.....EIDDEE...IERWRGMMR
CeF43G9.10 ENEQESEEDEEKQEERRERARMRRLELHENNREKDEEQEDSAESDEED...FERRRQMLR
T3F12.11 RLRRLAQTKVENRDEVRADHRRIRQAEIISTEEEESRNQENRDEDDDEDALEERRRRIKE
consensus R-R------V----------------E-----EEE---------DD-E----ERRR----
181 240
HumMFA1 QRAQERKNE.EMEVMEVEDEGRSGEESESESEYEEYTDSEDEMEPRLKPVFIRKKDRVTV
ChickMFA1 QRAQERKTE.ELEVMELEDEGRSGEESELESEYEEYTDSEDEMEPRLKPVFIRKKDRITV
CeF43G9.10 DRAIKREEEIKREIKEELEEDDVEEEEEEESSEEEDSDEDDDPVPRLKPIFTRKKDRITL
T3F12.11 KNLRRAQEEAALLPLEEEDEIQEEEEEEEESEYETDSEDDMPGIALIKPVFVPKAERDTI
consensus --------E------E-EDE----EE-E-ESEYE-------------KPVF--K--R-T-
241 300
HumMFA1 QEREAEALKQKELEQEAKRMAEERRQYTLQIVGEETPKELEENKRSL...AALDALNTDD
ChickMFA1 QEREAEALKQKELEQEAKRLAEERRKYTLKIVEEEAKKELEENKRSL...AALDALDTDD
CeF43G9.10 QEAEKEKEKEILKKIEDEKRAEERKRESAKLVEKVLQEEEAAEKRKTEDRVDLSSVLTDD
T3F12.11 AERERLEAEEEALEELAKRKLEQRKIETKQIVVEEVRKDEEIRKNILLEEANIGDVETDD
consensus -ERE--------LE--AKR--E-R---T--IV-EE--K--E--K--L---A------TDD
301 360
HumMFA1 E.NDEEEYEAWKVRELKRIKRDREDREALEKEKAEIERMRNLTEEERRAELRANGKVITN
ChickMFA1 E.NDEEEYEAWKVRELKRIKRDREEREAMEKEKAEIERMRNLTEEERRAELRANGKVVTN
CeF43G9.10 E.TENMAYEAWKLREMKRLKRNRDEREEAAREKAELDKIHAMSEEERLKYLRLNPKVITN
T3F12.11 ELNEAEEYEVWKTREIGRIKRERDAREAMLREREEIEKLRNMTEQERRDWERKNPKP.SS
consensus E-N--EEYE-WK-RE--RIKR-R--REA---E--EIE--RN-TE-ERR---R-N-K----
361 420
HumMFA1 KAVKGKYKFLQKYYHRGAFFM.DEDEE........VYKRDFSAPTLEDHFNKTILPKVMQ
ChickMFA1 KAVKGKYKFLQKYYHRGAFFM.DEDEE........VYKRDFSAPTLEDHFNKTILPKVMQ
CeF43G9.10 KQDKGKYKFLQKYFHRGAFFL.DEEDE........VLKRNFAEATNDDQFDKTILPKVMQ
T3F12.11 AQPKKKWNFMQKYYHKGAFFQADPDDEAGSAGTDGIFQRDFSAPTGEDRLDKSILPKVMQ
consensus ---K-K--F-QKYYH-GAFF--D-DDE-----------RDFSAPT-ED---K-ILPKVMQ
421 483
HumMFA1 VKNFGRSGRTKYTHLVDQDTTSFDSAWGQESAQNTKFFKQKAAGVRDVFERPSAKKRKTT*~~
ChickMFA1 VKNFGRSGRTKYTHLVDQDTTSFDSAWGQESAQNTKFFKQKAAGVRDVFERPSAKKRKTT*~~
CeF43G9.10 VKNFGKASRTKYTHLTEEDTTDHQGVWASTNQLNSQFSTKRAGGSRPVFERPATKKRKN*~~~
T3F12.11 VKHFGRSGRTKWTHLVNEDTTDWSNPWTSNDPLREKYNKKMAGMDAPIAKPKGSKKMKDWET*
consensus VK-FGRSGRTK-THLV--DTT-----W--------K--K--A------------KK-K-----
_________________________________________________________________________
created 7 Oct 97
Larry Parnell