| Gene | T2H3.7 |
| Putative Identification | hypothetical protein |
| Position | 899 to 1948, from the initial methionine to the termination codon |
| Strand | + |
| EST hits | none |
| Database match | A. thaliana hypothetical proteins |
CDS: The table below lists the coordinates of the single T2H3.7 exon and which exon prediction algorithms selected the 5' and 3' termini (GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons).
| Exon | Range | 5' | 3' |
|---|---|---|---|
| 1 | 899 - 1948 | GS | GS |
Alternate exons not used in building the gene model: GRAIL predicts a single-exon gene from 848 to 1948, but when allowed to select exons beyond the termination codon at 1948, predicts a two-exon gene. These exons are from 848 to 1735 and from 2472 to 2525. The second exon shows no matches to entries in the database. NetPlantGene predicts splice acceptors at 883 (confidence score = 0.86), at 1495 (0.56) and at 2472 (1.00) and splice donors at 990 (0.87), at 2587 (0.97) and at 2969 (0.90).
Complete CDS of T2H3.7
ATGTCTACGACGGCCAAGGAGGCCCCGTCGTCGTTATTTTCGTTACTGCCAAACGATATT GTTTTAAACATCTTAGCCCGCGTGCCAAGATGGTATCATCCAATTCTCTCTTGCGTGTCG AAGAACTTACGATTTCTCGTAAGTTCCTCTGAGCTTAAGATTACGCGATCTCTTTTGGAG AAAGATCGCTTCTACGTCTGCTTTCAAGAACATAGTAATTCTCCTTCCTTAACCACGTAT CATTGGTTCAGTTTCACTGAGAATCGTCGTTGTTTGGTCTCGATCCCGTTCACTTCTCCT GTTGAGCCTTACTTTGCAACTCTCACGCTGGGCCCCGAAATCTACTTTGTTGGCAAGTCT AGGAGCATGTGGATCCTCGACTCTCGGTCTGGAAAGTTGCGTCAGGGTCCAAGGCCGCTT GTGGCCTGTGATCAGGCAGCTGTGGGACTAGTTAACGACAAGATTTACGTTTTTGGAGGA ATTGATGACATGAATAAAAGATACTACGAAGGTATCCATGCGCAAGTGTTTGACCTAAAG ACACAAACTTGGCATGTCGGGCCAAATCTCAGTGTGAAATTGGCATGCCTAAATAGGTCT GTGGTAACCCCGTCACTAGGCAGAAAAATTTATGTGAGGGGTACTGATCGAGACGTCACA ATCTATGACATTAAAGATGGTAAATGTGACAAGATAATTCCAGCTGATGATTTCAGCAGT GGAGATATGTGTGTGGTAGACAACGTGATCTACATGTATTACCATAATGTTGGGCTAATG TGGTATGAATCTAAGGAGAAACAATGGAGTGTGGTTCATGGTTTGGAGTTCAACGGGGTT TTTAATAGCATCGCAATTGCTGAATACAATGGAAAGTTGGCTTTTTTGTGGCATGATAGG AACAAACGTGAGATTTGGTGTGCAATGATCAACTTGTATGGGAGTAGTAAAGTAGCAATT CGAGGGCGGGTCGAGTGGTCTCATCGTCTACTTTCTGATCTCCCTTCTAACTACAACTTT AAACATTTTACTATTTGTACAGATTATTAA
Protein translation of T2H3.7
MSTTAKEAPSSLFSLLPNDIVLNILARVPRWYHPILSCVSKNLRFLVSSSELKITRSLLE KDRFYVCFQEHSNSPSLTTYHWFSFTENRRCLVSIPFTSPVEPYFATLTLGPEIYFVGKS RSMWILDSRSGKLRQGPRPLVACDQAAVGLVNDKIYVFGGIDDMNKRYYEGIHAQVFDLK TQTWHVGPNLSVKLACLNRSVVTPSLGRKIYVRGTDRDVTIYDIKDGKCDKIIPADDFSS GDMCVVDNVIYMYYHNVGLMWYESKEKQWSVVHGLEFNGVFNSIAIAEYNGKLAFLWHDR NKREIWCAMINLYGSSKVAIRGRVEWSHRLLSDLPSNYNFKHFTICTDY*
Multiple sequence analysis of selected A. thaliana hypothetical proteins
1 60
F16B22.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F16B22.19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T2H3.7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T5K18.110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19P19.140 MSSEVEPPQKKKQPWLPDYIVENCLAHISRSYYPKLSLVSSPFALSSYPKSSTKRDIAST
61 120
F16B22.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F16B22.19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T2H3.7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T5K18.110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19P19.140 TEEYFFHVCLQLPKSPLPTWYTLWIKPDQIEKKKKINTFTGNTRLVQIPSSYHYPFDQVF
121 180
F16B22.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F16B22.19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T2H3.7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T5K18.110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19P19.140 IFNLYGAMRSSKAMVRPPSYGLHVLNRTRRRCLIEIGNYGGKLLILWNKFVHPGQFGDKH
181 240
F16B22.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F16B22.19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T2H3.7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T5K18.110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19P19.140 IWCEVIALERRNETAASAATTSKGEPPSKKRKTNPSPPPSLLSLPDVLILNCLSRIPKSY
241 300
F16B22.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F16B22.19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T2H3.7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T5K18.110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19P19.140 YPKLSIVSKTFRDLIISIDLNHARFHHKTQEHFFHVCLKLPDRPLPSWYTLWIKPQGFDD
301 360
F16B22.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F16B22.19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T2H3.7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T5K18.110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19P19.140 KEEEKKKKKKSTLVQVPSSYASQTPLLVVGIDSDVYAFKQCYPPSRVMFVRNKECVIWRN
361 420
F16B22.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F16B22.19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T2H3.7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T5K18.110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MERETSSSST
T19P19.140 APDMTVARANPVAYVFDRKIYVMGGCAETESANWGEVFDPKTQTWEPLPVPSPELRFSSM
421 480
F16B22.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F16B22.19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T2H3.7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T5K18.110 PPEDLVTSMIGKF.VAVMSNNDIRYEGVISLLNLQDSKLGLQNGNNSIPNPLQDLLFMIF
T19P19.140 IRK..IEMIQGKFYVRSNDSKDSVYDPIREKWNVA.AKPQLNDSRCSVGNVWYSCRPNSF
481 540
F16B22.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F16B22.19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T2H3.7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T5K18.110 FFSIVRVYGREVENDNEQRVFQVLKEVHSHMVFRGSDIKSVEVLSLPPPARHNSAIGHVG
T19P19.140 LW.....FDNEIKNWRLIKGLSSLNHSCRSGLIETVCYDGNLLLLWDKPTKPRRRV....
541 600
F16B22.12 ~~~~~~~~~~~~~~~~~~~~MSNADEPPQKTNQPPSSSLT...PPSLF.SLPVDIVLNIL
F16B22.19 ~~~~~~~~~~~~~~~~~~~~MSSSNEPPRKTDQPSSSSASASASPSLFLSLPLEIISMIL
T2H3.7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MSTTAKEAPSSLFSLLPNDIVLNIL
T5K18.110 SLITTEDVRI.EGVISHVKFHDSMIFMKNCMCYGTEGRTKRRRSIVACNQLADDIVLNIL
T19P19.140 ....CEDKYICCALISFNKRKNGQVWAMDSEAEPPQEK.KKPNSCPSFLSLPEEILVNCL
601 660
F16B22.12 ALVPKRYYPILCCVSKSLRSLIRSPEIHKTRSLHG..KDSLYLCFS......TRTTYPNR
F16B22.19 ARVPKRYYPILCSVSKNMRSLVRSPEIHKARSLLG..KDYLYIGFI......DENYRPVY
T2H3.7 ARVPRWYHPILSCVSKNLRFLVSSSELKITRSLLE..KDRFYVCFQ......EHSNSP..
T5K18.110 ARISTSYYQTLSLVSKTFRLLILSKELDMERSYLGTRKPCVYVCLQSPTHPFDRRWFGLW
T19P19.140 ARIPKSYYPKLSLVCKSFCSLILSMELYVERLYLGTHEDVLHVCLQLPDRRLP.SWFSLW
661 720
F16B22.12 NRTTFHWFTLRRNDNKMNTTENVFVSIDVPYRPGHASYPLSNIAIDTEIYCIPGYNFPSS
F16B22.19 D....YWYTLRRIEN..STTENLFESIEFPY.PSEPN.RFSMNAVGPKIFFISESCTPSS
T2H3.7 SLTTYHWFSF..TENRRCLVSIPFTSPVEPY........FATLTLGPEIYFVG.....KS
T5K18.110 IKPYDHQPLTHWTIDIKCTGHWLL.PMPSPYSR...CLQIVHETVGSETYEIGGQNMTPS
T19P19.140 TKP.DQTLTNDIGKKKKSTRNTLLVPIPSSYSP...RVPMFIGEIGSELYAISKHN.TPS
721 780
F16B22.12 SIVWIFD.TQSGQWRQGPSMQVERLSATVGLVGGKIYVIGGNRG......EEILAEVFDL
F16B22.19 RLS.IFD.TRFGELRQGPCLLVKRGYNCVGLVGGKVYVIGGYQD......DEIAAESFDL
T2H3.7 RSMWILD.SRSGKLRQGPRPLVACDQAAVGLVNDKIYVFGGIDDMNKRYYEGIHAQVFDL
T5K18.110 TDVWVYDKL.IGKQRKAPSMMVARKNAFTCVLDGKLYVMGGCEADESTHW....AEVFDP
T19P19.140 SVMWVRDKTSIYAWRKAPSMTVARANVFAYVINGKIYVMGGCAADESKYW....AEVFDP
781 840
F16B22.12 KTQTWEAAPIPKAKDRNEWFTHASVSLDRKVYALNSREYMNSYDTRDGSYQRYTIPEDNW
F16B22.19 NTQTWEAAPIPDEKESHRWICKANVSFDRKVCALRSREGMTCYDTRDGSCQRSEMPNDQW
T2H3.7 KTQTWHVGPNLSVKLACLNRSVVTPSLGRKIYVRGTDRDVTIYDIKDGKCDKI.IPADDF
T5K18.110 KTQTWEALPDPGVELRYSSVKNIQTKQG.KVYVR.SNKKNFVYLIKE...CMWE.VAEEN
T19P19.140 KTQTWKPLTDPGAELRVSSIIGMAVSEG.KIYVKNSYVKDYVYDPEE...DKWDVVASSF
841 900
F16B22.12 WKTGKCVIDNVLFVYFLRFGLMWYDSELMLWRVVYGLDLDKA.R...CIGIGEYYGKLAF
F16B22.19 SRVGLCVIDNVLFVYFSRFGLMWYDSKLMLWRVVYGFDLDNA.R...SVGIGEYYGKLAF
T2H3.7 SSGDMCVVDNVIYMYYHNVGLMWYESKEKQWSVVHGLEFNGVFN...SIAIAEYNGKLAF
T5K18.110 LGESTCEIENVCYCY.SNKRYWWYDAKCEEWRLVKG................NYGGKLVV
T19P19.140 MIERKCEIENVLYRF.SRQSCSWYDTKHKEWRDIKGLATLNRRRRSSILEVAKYGDKVLI
901 960
F16B22.12 IWGKPS.NVSESKEIWCRMIGLL.RSEVGIHGTEEPS.QLLRIVPNNYSLRHCLSLSG*~
F16B22.19 IWEKPSLNVSESKEIWCRMIGLL.RSEVGIHGAAEPS.QLVEIVPNGYRMCHCLSLSG*~
T2H3.7 LW.....HDRNKREIWCAMINLYGSSKVAIRGRVEWSHRLLSDLPSNYNFKHFTICTDY*
T5K18.110 FWDRAVSRLTATKEIWCAMISLEKGHDGEIWGHIEWLDAVL.IAPRS...LKLIIQLVLI
T19P19.140 LWEIFAKPFYQNKSIWCAVIALEKRKIDEIWGKVKWASIVL.TVPRSYVFLRCEVKPV*~
961 986
F16B22.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~
F16B22.19 ~~~~~~~~~~~~~~~~~~~~~~~~~~
T2H3.7 ~~~~~~~~~~~~~~~~~~~~~~~~~~
T5K18.110 RTIYIIDIYSFLFSVMFFILYFIYI*
T19P19.140 ~~~~~~~~~~~~~~~~~~~~~~~~~~
written 18 Aug 98
Larry
Parnell