| Gene | T10M13.4 |
| Putative Identification | predicted protein of unknown function |
| Position | 18527 to 23440, from the initial methionine to the termination codon |
| Strand | + |
| EST match | N65123 |
| Database match | C. elegans protein B0414.8 |
CDS: The table below lists the coordinates of the exons of T10M13.4 and which gene modeling algorithm selected the 5' and 3' termini for each exon (GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons). Splice sites of the CDS determined by comparison to EST N65123 are designated by EST and those determined by similarity to EST AA660863 are designated by est. Alignment with Medicago truncatula EST AA660863 is not at perfect identity to T10M13.4, but can aid in suggesting splice sites. Interestingly, the first ATG codon for which the alignment to the EST is perfect is at position 18569-18571 and this may represent an alternate choice for the initial methionine. This codon and its methionine are highlighted below.
| Exon | Range | 5' | 3' |
|---|---|---|---|
| 1 | 18527 - 18697 | Gr | est, GS, Gr, NPG |
| 2 | 19163 - 19302 | est, GS, Gr, M, NPG | est, GS, Gr, M, NPG |
| 3 | 19428 - 19482 | est, GS, Gr, NPG | est, GS, Gr, NPG |
| 4 | 19571 - 19678 | est, GS, Gr, M, NPG | est, GS, Gr, M, NPG |
| 5 | 19786 - 19884 | est, Gr, NPG | Gr, NPG |
| 6 | 20015 - 20092 | GS, Gr, M, NPG | GS, Gr, M, NPG |
| 7 | 20247 - 20324 | Gr, M, NPG | Gr, NPG |
| 8 | 20463 - 20615 | GS, Gr, M, NPG | GS, M, NPG |
| 9 | 20856 - 20956 | Gr, M, NPG | Gr, M, NPG |
| 10 | 21395 - 21461 | Gr, NPG | Gr |
| 11 | 21869 - 22123 | GS, M, NPG | EST, GS, M, NPG |
| 12 | 22204 - 22320 | EST, GS, NPG | EST, GS, Gr, NPG |
| 13 | 22397 - 22468 | EST, M, NPG | EST, M, NPG |
| 14 | 22549 - 22599 | EST, Gr, M, NPG | EST, Gr, NPG |
| 15 | 22685 - 22921 | EST, GS, Gr, M, NPG | GS, Gr, M |
| 16 | 23016 - 23192 | GS, Gr, NPG | GS, NPG |
| 17 | 23315 - 23440 | GS, Gr, NPG | GS, Gr |
Alternate exons not used in building the gene model: GenScan concatenates T10M13.3 and T10M13.4 and so does not predict an initial exon for T10M13.4. Alternate GenScan exons are from 18350 to 18481, from 18489 to 18697 and from 18947 to 19078. GRAIL predicts exons from 20463 to 20690, 21237 to 21334, from 21656 to 21787, from 22274 to 22320 and from 23016 to 21395. MZEF predicts exons from 20247 to 20311 and from 22549 to 22566. NetPlantGene does not predict any potential splice sites above a confidence score of 0.90 that were not used in building the gene model.
Complete CDS of T10M13.4
ATGGCGACGGAGGCAGCTCCGATGGACGAAAAGGCGAAGAGAATGAGAGATCTATTGTCG AGCTTCTACGCACCGGATCCTTCAATCTCGACGAGTGGTTCCTCCATCAACGCCTCTTTC GATAACATCAACAGCACTTCCTTTGATGCTGATCAGTACATGGATCTCATGATCAAAAAG TCGAATTTGGAGGTGCTTCTGCAAAGACATGTTCAAATGGCTGCTGAGATTAAAAATCTC GACACAGACTTGCAAATGCTAGTCTATGAAAATTACAACAAGTTTATCAGTGCAACAGAT ACAATCAAAAGGATGAAGAGTAATATTTTCGGGATGGAAGGCAATATGGACCAGCTTCTT CAGAAGATAATGTCAGTACAATCAAAGAGTGATGGGGTCAACACTTCTCTTTTCGAAAAG AGAGAACATATAGAGAAACTGCACCGGACTCGTAATCTTCTTCGTAAAGTTCAGTTCATC TATGATTTACCTGCAAGACTGCAAAAATGTATCAAGTCAGAAGCCTATGGCGATGCTGTC AGGTTCTATACTGGAGCAATGCCAATTCTCAAGGTATATGGCGATACATCATTCCAAGAC TGCAGGCGAGCTTCCGAAGAAGCTATAGAAATTATCATAAAGAACTTGCAGACGAAGCTA TTTTCAGATTCAGAATCCATACAGGCGAGAGCTGAAGCTGCAGTGCTTCTTAAGCAGTTA GATGTCCCTGTTGATAGCTTAAAGGCTAAACTGTTGGAAAAACTGGAACAGTCTCTTGAT GGTCTTCAAATAAAGCCTGAGGAGGCAAGTACACTTGTAGAGGACGATGATTCATCTAAC GATACAGAAAGCAATGACCAACATCCTGCTAAAATTCATGAGGATGCCGTACGTGGATTT TCTGAGGCCATACGTGCTTATCGAGAAATATTCCCCGACTCAGAAGAAAGACTTTTCAAA CTCGCGAGAGCCTTAACAGCAATGCTGCCCAGGTTACTCTTAAGCAATTTGTTGCGAGAA TGTTTTCTCATCTTCAACAGGATATATCAGGACTTCCGTCAGCTTCTTGATGAAAAAACT GGAATATTTATAAAGATGAAGGATTTAATCAGTGGTTGGATTCAGAAAGGATCTCAAGAC TTCTTCAGGTCACTAGAAGCTCAATTTCTAGTGCTTTCTGGAAAAACTAGCTCATCAAAC GATATAGAGGGAAAATCGAGTGACAAAATTCATGCTGGTCTCATTCTTGTATTGGCGCAG CTCTCTGTCTTCATCGAACAAAAGGTCATCCCGAGAGTTACTGAGGAAATAGCTGCTTCC TTCTCTGGTGGAAATTCCCAAGCCTTTGAGAATGGACCTGCTTTTATTCCTGGAGAACTT TGCCGGGTCTTCCACGCAGCCAGCGAGAAACTTCTCCAGCATTACATAGACACAAGAACA CAGAAGGTATCAGTTTTACTGAGAAAAAGGTTCAAGACACCTAACTGGGTTAAGCACAAG GAGCCACGAGAGGTTCACATGTATGTCGATATGTTTCTTCACGAGTTAGAAGAAGTTGGT AAAGAAGTGAAACAAGTTTTACCTCAAGGGACTTTCCGTAAGCACAAAAGAACAGACAGC AACGGAAGTAACACTACAACCTCATCACGAAGCAATACCCTCCATAATGATAAGATGGCA CGGTCAAATTCACAAAGAGCTCGTAGTCAGCTTTTCGAGACACATCTTGCAAAACTGTTC AAGCAAAAAGTAGAAATATTCACCAAGGTTGAATTTACCCAGGAATCCGTTGTTACAACA ACAGTAAAACTTTGTCTGAAGAGTTTGCAAGAATATGTCCGTCTCCAAACGTTTAACCGG AGCGGGTTCCAGCAGATTCAGCTAGACATTCAGTTCTTAAAAGCTCCATTGAAGGAAGCA GTTGAGGACGAAGCTGCAATAGACTTCTTGCTCGACGAGGTGATCGTTGCGGCTTCGGAG AGATGTCTTGATGTGATTCCATTGGAGCCACCGATCTTGGACAAACTCATACAAGCTAAA CTCGCTAAATCAAAGGAGCACAACAACAACACAGTTTCTTCTTAA
Protein translation of T10M13.4
MATEAAPMDEKAKRMRDLLSSFYAPDPSISTSGSSINASFDNINSTSFDADQYMDLMIKK SNLEVLLQRHVQMAAEIKNLDTDLQMLVYENYNKFISATDTIKRMKSNIFGMEGNMDQLL QKIMSVQSKSDGVNTSLFEKREHIEKLHRTRNLLRKVQFIYDLPARLQKCIKSEAYGDAV RFYTGAMPILKVYGDTSFQDCRRASEEAIEIIIKNLQTKLFSDSESIQARAEAAVLLKQL DVPVDSLKAKLLEKLEQSLDGLQIKPEEASTLVEDDDSSNDTESNDQHPAKIHEDAVRGF SEAIRAYREIFPDSEERLFKLARALTAMLPRLLLSNLLRECFLIFNRIYQDFRQLLDEKT GIFIKMKDLISGWIQKGSQDFFRSLEAQFLVLSGKTSSSNDIEGKSSDKIHAGLILVLAQ LSVFIEQKVIPRVTEEIAASFSGGNSQAFENGPAFIPGELCRVFHAASEKLLQHYIDTRT QKVSVLLRKRFKTPNWVKHKEPREVHMYVDMFLHELEEVGKEVKQVLPQGTFRKHKRTDS NGSNTTTSSRSNTLHNDKMARSNSQRARSQLFETHLAKLFKQKVEIFTKVEFTQESVVTT TVKLCLKSLQEYVRLQTFNRSGFQQIQLDIQFLKAPLKEAVEDEAAIDFLLDEVIVAASE RCLDVIPLEPPILDKLIQAKLAKSKEHNNNTVSS*
Alignment of T10M13.4 and C. elegans protein B0414.8
1 60
T10M13-4 MATEAAPMDEKAKRMRDLLSSFYAPDPSISTSGSSINASFDNINSTSFDADQYMDLMIKK
CeB0414.8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MSSVLDVTKPDFDVEAFVVKLLRE
61 120
T10M13-4 SNLEVLLQRHVQMAAEIKNLDTDLQMLVYENYNKFISATDTIKRMKSNIFGMEGNMDQLL
CeB0414.8 KSLDGLVKEEEEMVSAVRRLDSDVHQIVYENYNKFLTATNTVRKIQDEFTQLDSEMKSLS
121 180
T10M13-4 QKIMSVQSKSDGVNTSLFEKREHIEKLHRTRNLLRKVQFIYDLPARLQKCIKSEAYGDAV
CeB0414.8 RSMSTISTLIGNLDGVLGEKRDDILQLGSSYKVVNSLKHIFDLPHVLRSEFDERNYGEVL
181 240
T10M13-4 RFYTGAMPILKVYGDT.SFQDCRRASEEAIEIIIKNLQTKLFSDSESIQARAEAAVLLKQ
CeB0414.8 RMFKLAEESLSQYKDVPTVQLVLQKSKKIYDMTENQLMDQLRNPASGAELVSEAVDLLLT
241 300
T10M13-4 LDVPVDSLKAKLLEKLEQSL..DGLQIKPEEASTLVEDDDSSNDTESNDQHPAKIHEDAV
CeB0414.8 IGRDEDEVQKVLLTCSEQSLRVDLKELSANHSDVLDLVDKASESFIPNLTLIATTHDRLF
301 360
T10M13-4 RGFSEAIRAYREIFPDSEERLFKLARALTAMLPRLLLSNLLRECFLIFNRIYQ.......
CeB0414.8 EDKREDLITVLKTEMNSLHAL..VSKVFLSSSDAKDCSIVVRALDRYFRKISTCRYVIPG
361 420
T10M13-4 .DFRQLLDEKTGIFIKMK.DL....ISGWIQKGSQDFFRSLEAQFLVLSGKTSSSNDIEG
CeB0414.8 LDFLPLTIELINAVSKHEIDLSLTRIKEELKNGLNEVRKALINEEKDLSALASKIEQVFV
421 480
T10M13-4 KSSDKIHAGLILVLAQLSVF.......IEQKVIPRVTEEIAASFSGGNSQ...AFENGPA
CeB0414.8 HQVKTALANLLLFTASDVTFANLPPDEFRQSFSFNAHERLLVQAFHRFSELADEYESGAG
481 540
T10M13-4 FIPGELCRVFHAASEKLLQHYIDTRTQKVSVLLRKRFK............TPNWVKHKEP
CeB0414.8 EIRFVDPRV.HLVFAVALQHLSNKSAVYLLNLCREQFSLSPDDGLTDITVVMSEVKTRAQ
541 600
T10M13-4 REVHMYVDMFLHELEE..VGKEVKQVLPQGT......FRKHKRTDSNGSNTTTSSRSNTL
CeB0414.8 KLVRCYAEKTGLSMGETLIKGCAMLVQPAATPSAKFDFRASVRRLVEEMNTCDSELTLLL
601 660
T10M13-4 HNDKMARSN..SQRARSQLFETHLAKLFKQKVEIFTKVEFTQESVVTTTVKLCLKSLQEY
CeB0414.8 GGDSKPKDSRVSRRPITTALDAARDSLWCERIDFHLQIHFNRASIITVIVKVVLKIFIES
661 720
T10M13-4 VRLQTFNRSGFQQIQLDIQFLKAPLKEAVEDEAAIDFLLDEVIVAASERCLDVIPLEPPI
CeB0414.8 IRLQTYSKFGVEQVQVDCYYLQRCLAALVSDEVVVNSMVDQALSSALKRCQDPVLVHPSR
721 743
T10M13-4 LDKLIQAKLAKSKEHNNNTVSS*
CeB0414.8 LAQLCEQPPANRPSSQASSLGY*
written 29 Jul 97
updated 31 Jul 98
Larry
Parnell