Gene T2H3.8
Putative Identification sugar synthetase
Position 17909 to 21462, from the initial methionine to the termination codon
Strand -
EST hits F13912 and F13913
Database match several plant sugar synthetases, including
P. sativum second sugar synthase AJ001071 and
beet sucrose synthase (EC 2.4.1.13) S71494

 

mRNA:  Using the ESTs, 5'-UTR can be defined from 21463 to 21640 and the 3'-UTR can be defined from 17765 to 17908.

CDS:  The table below lists the coordinates of the T2H3.8 exons and which exon prediction algorithms selected the 3' and 5' termini (GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons). EST signifies a splice site determined by identity to an EST, while est signifies that determined by similarity to a related EST.

Exon Range 3' 5'
1 21365 - 21462 EST, GS, NPG GS, Gr
2 21144 - 21276 est, GS, Gr, M, NPG EST, GS, M, NPG
3 20729 - 21073 GS, Gr, M, NPG est, GS, Gr, M, NPG
4 20307 - 20642 est, GS, Gr, M, NPG GS, Gr, M, NPG
5 20136 - 20231 est, M, NPG est, Gr, M, NPG
6 19748 - 20038 est, GS (split exon), Gr, M, NPG est, GS (split exon), NPG
7 19488 - 19654 est, GS, M, NPG est, GS, Gr, M, NPG
8 19199 - 19423 GS, Gr, M, NPG est, GS, Gr, M, NPG
9 18733 - 19054 GS, Gr, M, NPG GS, Gr, M, NPG
10 18248 - 18492 EST, GS, Gr, M, NPG GS, Gr, M, NPG
11 18027 - 18165 EST, GS, Gr, NPG EST, GS, Gr, NPG
12 17909 - 17941 GS, Gr EST, GS, Gr, NPG

Alternate exons not used in building the gene model:  GenScan predicts a split in exon 6: 19748 to 19878 and 19918 to 20038, yielding an intron of 29 nt. GRAIL predicts exon 1 from 21358 to 21462, exon 2 from 21144 to 21263, exon 5 from 20164 to 20231, exon 6 from 19748 to 20069, exon 7 from 19458 to 19654. MZEF predicts exon 6 from 19748 to 20069. NetPlantGene predicts a splice acceptor of confidence score 0.94 at position 21874.

Complete CDS of T2H3.8

ATGGCAAACCCTAAGCTCACTAGGGTTCTAAGCACAAGGGATCGCGTGCAAGACACGCTT
TCCGCTCACCGCAACGAACTCGTTGCTCTTCTCTCCAGGTATGTGGATCAGGGGAAAGGG
ATTCTTCAACCACATAACTTAATTGACGAACTCGAATCTGTTATCGGAGACGATGAAACA
AAGAAGAGTCTCTCTGATGGTCCTTTTGGAGAGATCCTTAAATCAGCAATGGAAGCTATA
GTTGTACCACCTTTTGTTGCGTTAGCCGTTAGACCAAGACCTGGTGTTTGGGAATATGTT
CGTGTTAATGTCTTCGAGCTAAGTGTTGAACAATTAACAGTCTCTGAGTATCTTCGTTTC
AAAGAAGAACTCGTTGATGGACCTAATAGTGACCCTTTTTGTCTTGAGCTTGATTTCGAG
CCCTTTAACGCAAACGTGCCACGTCCTTCTCGTTCGTCTTCGATTGGTAATGGAGTTCAG
TTTCTGAATCGTCACTTGTCTTCTGTTATGTTCCGTAACAAAGATTGCTTGGAGCCTCTG
CTTGATTTCCTTAGAGTTCATAAGTACAAAGGTCATCCGTTGATGTTGAATGATCGGATT
CAAAGCATATCTAGGCTTCAAATCCAGCTTAGTAAAGCAGAAGATCATATCTCTAAGCTT
TCACAAGAAACTCCGTTCTCGGAATTCGAATACGCGTTGCAAGGAATGGGTTTTGAGAAA
GGATGGGGAGATACCGCAGGGAGAGTTCTTGAAATGATGCATCTTCTCTCTGATATTCTT
CAAGCTCCTGATCCTTCGTCCTTGGAGAAGTTTCTTGGGATGGTACCAATGGTTTTCAAC
GTTGTGATCTTATCTCCACATGGATATTTCGGGCAAGCCAATGTTTTAGGCTTACCTGAC
ACTGGTGGACAAGTTGTCTATATTCTTGACCAAGTCCGTGCCCTTGAGACTGAAATGCTG
TTGAGAATAAAGAGACAGGGGTTGGATATATCACCTAGTATTCTTATTGTAACTAGGTTG
ATACCGGATGCTAAAGGAACTACGTGTAACCAGCGGTTAGAGAGAGTCAGCGGAACAGAG
CATACTCATATTCTCCGGGTTCCTTTTAGGTCTGAGAAAGGAATCCTCCGTAAGTGGATT
TCAAGATTCGACGTATGGCCTTATCTAGAGAACTATGCTCAGGATGCAGCAAGCGAGATT
GTCGGTGAATTGCAAGGCGTACCGGACTTTATCATCGGTAACTATAGTGACGGAAACCTT
GTTGCATCGTTAATGGCACATAGAATGGGTGTTACACAATGTACTATTGCACATGCTTTG
GAGAAAACCAAGTATCCAGATTCAGACATTTACTGGAAAGACTTCGACAACAAGTATCAT
TTCTCTTGTCAATTCACAGCTGATCTTATCGCAATGAACAACGCAGATTTCATCATCACA
AGCACTTACCAAGAAATCGCAGGAACGAAGAACACCGTCGGTCAATATGAAAGCCACGGG
GCTTTTACGCTCCCGGGACTATATAGAGTAGTACACGGCATCGATGTGTTTGATCCGAAG
TTCAACATAGTCTCGCCCGGTGCAGACATGACCATATATTTCCCGTATTCCGAAGAAACT
AGGAGACTTACAGCTTTACATGGTTCAATAGAGGAAATGCTCTATAGCCCTGACCAGACT
GATGAGCATGTCGGTACACTGAGTGATCGATCAAAGCCAATACTCTTCTCTATGGCGAGG
CTCGACAAAGTGAAGAACATCTCCGGTTTAGTTGAGATGTATAGTAAGAACACAAAGTTG
AGGGAGCTGGTTAATCTGGTTGTAATAGCTGGTAACATTGATGTGAACAAGTCCAAAGAT
AGAGAAGAAATCGTAGAGATTGAGAAAATGCATAACCTTATGAAGAATTACAAGCTTGAT
GGACAGTTTCGTTGGATAACTGCTCAGACTAACCGAGCTCGAAATGGTGAGCTTTACCGC
TACATCGCGGATACAAGAGGTGCTTTTGCTCAGCCTGCGTTCTACGAGGCTTTTGGACTT
ACGGTAGTGGAAGCGATGACTTGCGGGCTCCCGACTTTTGCCACTTGTCACGGTGGTCCA
GCAGAGATCATCGAGCACGGGCTCTCGGGTTTCCACATCGATCCATACCATCCTGAGCAA
GCGGGTAACATAATGGCTGATTTCTTTGAACGTTGTAAGGAAGATCCAAACCATTGGAAG
AAAGTATCAGACGCTGGTCTCCAAAGGATATACGAAAGGTACACATGGAAGATATACTCG
GAGAGATTGATGACACTAGCTGGTGTGTATGGTTTCTGGAAATACGTATCGAAATTGGAG
CGTCGTGAGACTCGGCGATATCTTGAAATGTTCTACATTCTCAAATTCCGCGACTTGGTG
AAAACTGTTCCTTCAACCGCCGATGACTGA

 

Protein translation of T2H3.8

MANPKLTRVLSTRDRVQDTLSAHRNELVALLSRYVDQGKGILQPHNLIDELESVIGDDET
KKSLSDGPFGEILKSAMEAIVVPPFVALAVRPRPGVWEYVRVNVFELSVEQLTVSEYLRF
KEELVDGPNSDPFCLELDFEPFNANVPRPSRSSSIGNGVQFLNRHLSSVMFRNKDCLEPL
LDFLRVHKYKGHPLMLNDRIQSISRLQIQLSKAEDHISKLSQETPFSEFEYALQGMGFEK
GWGDTAGRVLEMMHLLSDILQAPDPSSLEKFLGMVPMVFNVVILSPHGYFGQANVLGLPD
TGGQVVYILDQVRALETEMLLRIKRQGLDISPSILIVTRLIPDAKGTTCNQRLERVSGTE
HTHILRVPFRSEKGILRKWISRFDVWPYLENYAQDAASEIVGELQGVPDFIIGNYSDGNL
VASLMAHRMGVTQCTIAHALEKTKYPDSDIYWKDFDNKYHFSCQFTADLIAMNNADFIIT
STYQEIAGTKNTVGQYESHGAFTLPGLYRVVHGIDVFDPKFNIVSPGADMTIYFPYSEET
RRLTALHGSIEEMLYSPDQTDEHVGTLSDRSKPILFSMARLDKVKNISGLVEMYSKNTKL
RELVNLVVIAGNIDVNKSKDREEIVEIEKMHNLMKNYKLDGQFRWITAQTNRARNGELYR
YIADTRGAFAQPAFYEAFGLTVVEAMTCGLPTFATCHGGPAEIIEHGLSGFHIDPYHPEQ
AGNIMADFFERCKEDPNHWKKVSDAGLQRIYERYTWKIYSERLMTLAGVYGFWKYVSKLE
RRETRRYLEMFYILKFRDLVKTVPSTADD*

 


written 28 Aug 98
Larry Parnell