Gene T2H3.7
Putative Identification hypothetical protein
Position 899 to 1948, from the initial methionine to the termination codon
Strand +
EST hits none
Database match A. thaliana hypothetical proteins

 

CDS:  The table below lists the coordinates of the single T2H3.7 exon and which exon prediction algorithms selected the 5' and 3' termini (GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons).

Exon Range 5' 3'
1 899 - 1948 GS GS

Alternate exons not used in building the gene model:   GRAIL predicts a single-exon gene from 848 to 1948, but when allowed to select exons beyond the termination codon at 1948, predicts a two-exon gene. These exons are from 848 to 1735 and from 2472 to 2525. The second exon shows no matches to entries in the database. NetPlantGene predicts splice acceptors at 883 (confidence score = 0.86), at 1495 (0.56) and at 2472 (1.00) and splice donors at 990 (0.87), at 2587 (0.97) and at 2969 (0.90).

Complete CDS of T2H3.7

ATGTCTACGACGGCCAAGGAGGCCCCGTCGTCGTTATTTTCGTTACTGCCAAACGATATT
GTTTTAAACATCTTAGCCCGCGTGCCAAGATGGTATCATCCAATTCTCTCTTGCGTGTCG
AAGAACTTACGATTTCTCGTAAGTTCCTCTGAGCTTAAGATTACGCGATCTCTTTTGGAG
AAAGATCGCTTCTACGTCTGCTTTCAAGAACATAGTAATTCTCCTTCCTTAACCACGTAT
CATTGGTTCAGTTTCACTGAGAATCGTCGTTGTTTGGTCTCGATCCCGTTCACTTCTCCT
GTTGAGCCTTACTTTGCAACTCTCACGCTGGGCCCCGAAATCTACTTTGTTGGCAAGTCT
AGGAGCATGTGGATCCTCGACTCTCGGTCTGGAAAGTTGCGTCAGGGTCCAAGGCCGCTT
GTGGCCTGTGATCAGGCAGCTGTGGGACTAGTTAACGACAAGATTTACGTTTTTGGAGGA
ATTGATGACATGAATAAAAGATACTACGAAGGTATCCATGCGCAAGTGTTTGACCTAAAG
ACACAAACTTGGCATGTCGGGCCAAATCTCAGTGTGAAATTGGCATGCCTAAATAGGTCT
GTGGTAACCCCGTCACTAGGCAGAAAAATTTATGTGAGGGGTACTGATCGAGACGTCACA
ATCTATGACATTAAAGATGGTAAATGTGACAAGATAATTCCAGCTGATGATTTCAGCAGT
GGAGATATGTGTGTGGTAGACAACGTGATCTACATGTATTACCATAATGTTGGGCTAATG
TGGTATGAATCTAAGGAGAAACAATGGAGTGTGGTTCATGGTTTGGAGTTCAACGGGGTT
TTTAATAGCATCGCAATTGCTGAATACAATGGAAAGTTGGCTTTTTTGTGGCATGATAGG
AACAAACGTGAGATTTGGTGTGCAATGATCAACTTGTATGGGAGTAGTAAAGTAGCAATT
CGAGGGCGGGTCGAGTGGTCTCATCGTCTACTTTCTGATCTCCCTTCTAACTACAACTTT
AAACATTTTACTATTTGTACAGATTATTAA

 

Protein translation of T2H3.7

MSTTAKEAPSSLFSLLPNDIVLNILARVPRWYHPILSCVSKNLRFLVSSSELKITRSLLE
KDRFYVCFQEHSNSPSLTTYHWFSFTENRRCLVSIPFTSPVEPYFATLTLGPEIYFVGKS
RSMWILDSRSGKLRQGPRPLVACDQAAVGLVNDKIYVFGGIDDMNKRYYEGIHAQVFDLK
TQTWHVGPNLSVKLACLNRSVVTPSLGRKIYVRGTDRDVTIYDIKDGKCDKIIPADDFSS
GDMCVVDNVIYMYYHNVGLMWYESKEKQWSVVHGLEFNGVFNSIAIAEYNGKLAFLWHDR
NKREIWCAMINLYGSSKVAIRGRVEWSHRLLSDLPSNYNFKHFTICTDY*

 

Multiple sequence analysis of selected A. thaliana hypothetical proteins

            1                                                         60
 F16B22.12  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 F16B22.19  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
    T2H3.7  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 T5K18.110  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
T19P19.140  MSSEVEPPQKKKQPWLPDYIVENCLAHISRSYYPKLSLVSSPFALSSYPKSSTKRDIAST 

            61                                                       120
 F16B22.12  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 F16B22.19  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
    T2H3.7  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 T5K18.110  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
T19P19.140  TEEYFFHVCLQLPKSPLPTWYTLWIKPDQIEKKKKINTFTGNTRLVQIPSSYHYPFDQVF 

            121                                                      180
 F16B22.12  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 F16B22.19  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
    T2H3.7  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 T5K18.110  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
T19P19.140  IFNLYGAMRSSKAMVRPPSYGLHVLNRTRRRCLIEIGNYGGKLLILWNKFVHPGQFGDKH 

            181                                                      240
 F16B22.12  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 F16B22.19  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
    T2H3.7  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 T5K18.110  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
T19P19.140  IWCEVIALERRNETAASAATTSKGEPPSKKRKTNPSPPPSLLSLPDVLILNCLSRIPKSY 

            241                                                      300
 F16B22.12  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 F16B22.19  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
    T2H3.7  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 T5K18.110  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
T19P19.140  YPKLSIVSKTFRDLIISIDLNHARFHHKTQEHFFHVCLKLPDRPLPSWYTLWIKPQGFDD 

            301                                                      360
 F16B22.12  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 F16B22.19  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
    T2H3.7  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 T5K18.110  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
T19P19.140  KEEEKKKKKKSTLVQVPSSYASQTPLLVVGIDSDVYAFKQCYPPSRVMFVRNKECVIWRN 

            361                                                      420
 F16B22.12  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 F16B22.19  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
    T2H3.7  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 T5K18.110  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MERETSSSST 
T19P19.140  APDMTVARANPVAYVFDRKIYVMGGCAETESANWGEVFDPKTQTWEPLPVPSPELRFSSM 

            421                                                      480
 F16B22.12  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 F16B22.19  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
    T2H3.7  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 T5K18.110  PPEDLVTSMIGKF.VAVMSNNDIRYEGVISLLNLQDSKLGLQNGNNSIPNPLQDLLFMIF 
T19P19.140  IRK..IEMIQGKFYVRSNDSKDSVYDPIREKWNVA.AKPQLNDSRCSVGNVWYSCRPNSF 

            481                                                      540
 F16B22.12  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 F16B22.19  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
    T2H3.7  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 T5K18.110  FFSIVRVYGREVENDNEQRVFQVLKEVHSHMVFRGSDIKSVEVLSLPPPARHNSAIGHVG 
T19P19.140  LW.....FDNEIKNWRLIKGLSSLNHSCRSGLIETVCYDGNLLLLWDKPTKPRRRV.... 

            541                                                      600
 F16B22.12  ~~~~~~~~~~~~~~~~~~~~MSNADEPPQKTNQPPSSSLT...PPSLF.SLPVDIVLNIL 
 F16B22.19  ~~~~~~~~~~~~~~~~~~~~MSSSNEPPRKTDQPSSSSASASASPSLFLSLPLEIISMIL 
    T2H3.7  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MSTTAKEAPSSLFSLLPNDIVLNIL 
 T5K18.110  SLITTEDVRI.EGVISHVKFHDSMIFMKNCMCYGTEGRTKRRRSIVACNQLADDIVLNIL 
T19P19.140  ....CEDKYICCALISFNKRKNGQVWAMDSEAEPPQEK.KKPNSCPSFLSLPEEILVNCL 

            601                                                      660
 F16B22.12  ALVPKRYYPILCCVSKSLRSLIRSPEIHKTRSLHG..KDSLYLCFS......TRTTYPNR 
 F16B22.19  ARVPKRYYPILCSVSKNMRSLVRSPEIHKARSLLG..KDYLYIGFI......DENYRPVY 
    T2H3.7  ARVPRWYHPILSCVSKNLRFLVSSSELKITRSLLE..KDRFYVCFQ......EHSNSP.. 
 T5K18.110  ARISTSYYQTLSLVSKTFRLLILSKELDMERSYLGTRKPCVYVCLQSPTHPFDRRWFGLW 
T19P19.140  ARIPKSYYPKLSLVCKSFCSLILSMELYVERLYLGTHEDVLHVCLQLPDRRLP.SWFSLW 

            661                                                      720
 F16B22.12  NRTTFHWFTLRRNDNKMNTTENVFVSIDVPYRPGHASYPLSNIAIDTEIYCIPGYNFPSS 
 F16B22.19  D....YWYTLRRIEN..STTENLFESIEFPY.PSEPN.RFSMNAVGPKIFFISESCTPSS 
    T2H3.7  SLTTYHWFSF..TENRRCLVSIPFTSPVEPY........FATLTLGPEIYFVG.....KS 
 T5K18.110  IKPYDHQPLTHWTIDIKCTGHWLL.PMPSPYSR...CLQIVHETVGSETYEIGGQNMTPS 
T19P19.140  TKP.DQTLTNDIGKKKKSTRNTLLVPIPSSYSP...RVPMFIGEIGSELYAISKHN.TPS 

            721                                                      780
 F16B22.12  SIVWIFD.TQSGQWRQGPSMQVERLSATVGLVGGKIYVIGGNRG......EEILAEVFDL 
 F16B22.19  RLS.IFD.TRFGELRQGPCLLVKRGYNCVGLVGGKVYVIGGYQD......DEIAAESFDL 
    T2H3.7  RSMWILD.SRSGKLRQGPRPLVACDQAAVGLVNDKIYVFGGIDDMNKRYYEGIHAQVFDL 
 T5K18.110  TDVWVYDKL.IGKQRKAPSMMVARKNAFTCVLDGKLYVMGGCEADESTHW....AEVFDP 
T19P19.140  SVMWVRDKTSIYAWRKAPSMTVARANVFAYVINGKIYVMGGCAADESKYW....AEVFDP 

            781                                                      840
 F16B22.12  KTQTWEAAPIPKAKDRNEWFTHASVSLDRKVYALNSREYMNSYDTRDGSYQRYTIPEDNW 
 F16B22.19  NTQTWEAAPIPDEKESHRWICKANVSFDRKVCALRSREGMTCYDTRDGSCQRSEMPNDQW 
    T2H3.7  KTQTWHVGPNLSVKLACLNRSVVTPSLGRKIYVRGTDRDVTIYDIKDGKCDKI.IPADDF 
 T5K18.110  KTQTWEALPDPGVELRYSSVKNIQTKQG.KVYVR.SNKKNFVYLIKE...CMWE.VAEEN 
T19P19.140  KTQTWKPLTDPGAELRVSSIIGMAVSEG.KIYVKNSYVKDYVYDPEE...DKWDVVASSF 

            841                                                      900
 F16B22.12  WKTGKCVIDNVLFVYFLRFGLMWYDSELMLWRVVYGLDLDKA.R...CIGIGEYYGKLAF 
 F16B22.19  SRVGLCVIDNVLFVYFSRFGLMWYDSKLMLWRVVYGFDLDNA.R...SVGIGEYYGKLAF 
    T2H3.7  SSGDMCVVDNVIYMYYHNVGLMWYESKEKQWSVVHGLEFNGVFN...SIAIAEYNGKLAF 
 T5K18.110  LGESTCEIENVCYCY.SNKRYWWYDAKCEEWRLVKG................NYGGKLVV 
T19P19.140  MIERKCEIENVLYRF.SRQSCSWYDTKHKEWRDIKGLATLNRRRRSSILEVAKYGDKVLI 

            901                                                      960
 F16B22.12  IWGKPS.NVSESKEIWCRMIGLL.RSEVGIHGTEEPS.QLLRIVPNNYSLRHCLSLSG*~ 
 F16B22.19  IWEKPSLNVSESKEIWCRMIGLL.RSEVGIHGAAEPS.QLVEIVPNGYRMCHCLSLSG*~ 
    T2H3.7  LW.....HDRNKREIWCAMINLYGSSKVAIRGRVEWSHRLLSDLPSNYNFKHFTICTDY* 
 T5K18.110  FWDRAVSRLTATKEIWCAMISLEKGHDGEIWGHIEWLDAVL.IAPRS...LKLIIQLVLI 
T19P19.140  LWEIFAKPFYQNKSIWCAVIALEKRKIDEIWGKVKWASIVL.TVPRSYVFLRCEVKPV*~ 

            961                    986
 F16B22.12  ~~~~~~~~~~~~~~~~~~~~~~~~~~
 F16B22.19  ~~~~~~~~~~~~~~~~~~~~~~~~~~
    T2H3.7  ~~~~~~~~~~~~~~~~~~~~~~~~~~
 T5K18.110  RTIYIIDIYSFLFSVMFFILYFIYI*
T19P19.140  ~~~~~~~~~~~~~~~~~~~~~~~~~~

 


written 18 Aug 98
Larry Parnell