| Gene | T2H3.8 |
| Putative Identification | sugar synthetase |
| Position | 17909 to 21462, from the initial methionine to the termination codon |
| Strand | - |
| EST hits | F13912 and F13913 |
| Database match | several plant sugar synthetases, including P. sativum second sugar synthase AJ001071 and beet sucrose synthase (EC 2.4.1.13) S71494 |
mRNA: Using the ESTs, 5'-UTR can be defined from 21463 to 21640 and the 3'-UTR can be defined from 17765 to 17908.
CDS: The table below lists the coordinates of the T2H3.8 exons and which exon prediction algorithms selected the 3' and 5' termini (GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons). EST signifies a splice site determined by identity to an EST, while est signifies that determined by similarity to a related EST.
| Exon | Range | 3' | 5' |
|---|---|---|---|
| 1 | 21365 - 21462 | EST, GS, NPG | GS, Gr |
| 2 | 21144 - 21276 | est, GS, Gr, M, NPG | EST, GS, M, NPG |
| 3 | 20729 - 21073 | GS, Gr, M, NPG | est, GS, Gr, M, NPG |
| 4 | 20307 - 20642 | est, GS, Gr, M, NPG | GS, Gr, M, NPG |
| 5 | 20136 - 20231 | est, M, NPG | est, Gr, M, NPG |
| 6 | 19748 - 20038 | est, GS (split exon), Gr, M, NPG | est, GS (split exon), NPG |
| 7 | 19488 - 19654 | est, GS, M, NPG | est, GS, Gr, M, NPG |
| 8 | 19199 - 19423 | GS, Gr, M, NPG | est, GS, Gr, M, NPG |
| 9 | 18733 - 19054 | GS, Gr, M, NPG | GS, Gr, M, NPG |
| 10 | 18248 - 18492 | EST, GS, Gr, M, NPG | GS, Gr, M, NPG |
| 11 | 18027 - 18165 | EST, GS, Gr, NPG | EST, GS, Gr, NPG |
| 12 | 17909 - 17941 | GS, Gr | EST, GS, Gr, NPG |
Alternate exons not used in building the gene model: GenScan predicts a split in exon 6: 19748 to 19878 and 19918 to 20038, yielding an intron of 29 nt. GRAIL predicts exon 1 from 21358 to 21462, exon 2 from 21144 to 21263, exon 5 from 20164 to 20231, exon 6 from 19748 to 20069, exon 7 from 19458 to 19654. MZEF predicts exon 6 from 19748 to 20069. NetPlantGene predicts a splice acceptor of confidence score 0.94 at position 21874.
Complete CDS of T2H3.8
ATGGCAAACCCTAAGCTCACTAGGGTTCTAAGCACAAGGGATCGCGTGCAAGACACGCTT TCCGCTCACCGCAACGAACTCGTTGCTCTTCTCTCCAGGTATGTGGATCAGGGGAAAGGG ATTCTTCAACCACATAACTTAATTGACGAACTCGAATCTGTTATCGGAGACGATGAAACA AAGAAGAGTCTCTCTGATGGTCCTTTTGGAGAGATCCTTAAATCAGCAATGGAAGCTATA GTTGTACCACCTTTTGTTGCGTTAGCCGTTAGACCAAGACCTGGTGTTTGGGAATATGTT CGTGTTAATGTCTTCGAGCTAAGTGTTGAACAATTAACAGTCTCTGAGTATCTTCGTTTC AAAGAAGAACTCGTTGATGGACCTAATAGTGACCCTTTTTGTCTTGAGCTTGATTTCGAG CCCTTTAACGCAAACGTGCCACGTCCTTCTCGTTCGTCTTCGATTGGTAATGGAGTTCAG TTTCTGAATCGTCACTTGTCTTCTGTTATGTTCCGTAACAAAGATTGCTTGGAGCCTCTG CTTGATTTCCTTAGAGTTCATAAGTACAAAGGTCATCCGTTGATGTTGAATGATCGGATT CAAAGCATATCTAGGCTTCAAATCCAGCTTAGTAAAGCAGAAGATCATATCTCTAAGCTT TCACAAGAAACTCCGTTCTCGGAATTCGAATACGCGTTGCAAGGAATGGGTTTTGAGAAA GGATGGGGAGATACCGCAGGGAGAGTTCTTGAAATGATGCATCTTCTCTCTGATATTCTT CAAGCTCCTGATCCTTCGTCCTTGGAGAAGTTTCTTGGGATGGTACCAATGGTTTTCAAC GTTGTGATCTTATCTCCACATGGATATTTCGGGCAAGCCAATGTTTTAGGCTTACCTGAC ACTGGTGGACAAGTTGTCTATATTCTTGACCAAGTCCGTGCCCTTGAGACTGAAATGCTG TTGAGAATAAAGAGACAGGGGTTGGATATATCACCTAGTATTCTTATTGTAACTAGGTTG ATACCGGATGCTAAAGGAACTACGTGTAACCAGCGGTTAGAGAGAGTCAGCGGAACAGAG CATACTCATATTCTCCGGGTTCCTTTTAGGTCTGAGAAAGGAATCCTCCGTAAGTGGATT TCAAGATTCGACGTATGGCCTTATCTAGAGAACTATGCTCAGGATGCAGCAAGCGAGATT GTCGGTGAATTGCAAGGCGTACCGGACTTTATCATCGGTAACTATAGTGACGGAAACCTT GTTGCATCGTTAATGGCACATAGAATGGGTGTTACACAATGTACTATTGCACATGCTTTG GAGAAAACCAAGTATCCAGATTCAGACATTTACTGGAAAGACTTCGACAACAAGTATCAT TTCTCTTGTCAATTCACAGCTGATCTTATCGCAATGAACAACGCAGATTTCATCATCACA AGCACTTACCAAGAAATCGCAGGAACGAAGAACACCGTCGGTCAATATGAAAGCCACGGG GCTTTTACGCTCCCGGGACTATATAGAGTAGTACACGGCATCGATGTGTTTGATCCGAAG TTCAACATAGTCTCGCCCGGTGCAGACATGACCATATATTTCCCGTATTCCGAAGAAACT AGGAGACTTACAGCTTTACATGGTTCAATAGAGGAAATGCTCTATAGCCCTGACCAGACT GATGAGCATGTCGGTACACTGAGTGATCGATCAAAGCCAATACTCTTCTCTATGGCGAGG CTCGACAAAGTGAAGAACATCTCCGGTTTAGTTGAGATGTATAGTAAGAACACAAAGTTG AGGGAGCTGGTTAATCTGGTTGTAATAGCTGGTAACATTGATGTGAACAAGTCCAAAGAT AGAGAAGAAATCGTAGAGATTGAGAAAATGCATAACCTTATGAAGAATTACAAGCTTGAT GGACAGTTTCGTTGGATAACTGCTCAGACTAACCGAGCTCGAAATGGTGAGCTTTACCGC TACATCGCGGATACAAGAGGTGCTTTTGCTCAGCCTGCGTTCTACGAGGCTTTTGGACTT ACGGTAGTGGAAGCGATGACTTGCGGGCTCCCGACTTTTGCCACTTGTCACGGTGGTCCA GCAGAGATCATCGAGCACGGGCTCTCGGGTTTCCACATCGATCCATACCATCCTGAGCAA GCGGGTAACATAATGGCTGATTTCTTTGAACGTTGTAAGGAAGATCCAAACCATTGGAAG AAAGTATCAGACGCTGGTCTCCAAAGGATATACGAAAGGTACACATGGAAGATATACTCG GAGAGATTGATGACACTAGCTGGTGTGTATGGTTTCTGGAAATACGTATCGAAATTGGAG CGTCGTGAGACTCGGCGATATCTTGAAATGTTCTACATTCTCAAATTCCGCGACTTGGTG AAAACTGTTCCTTCAACCGCCGATGACTGA
Protein translation of T2H3.8
MANPKLTRVLSTRDRVQDTLSAHRNELVALLSRYVDQGKGILQPHNLIDELESVIGDDET KKSLSDGPFGEILKSAMEAIVVPPFVALAVRPRPGVWEYVRVNVFELSVEQLTVSEYLRF KEELVDGPNSDPFCLELDFEPFNANVPRPSRSSSIGNGVQFLNRHLSSVMFRNKDCLEPL LDFLRVHKYKGHPLMLNDRIQSISRLQIQLSKAEDHISKLSQETPFSEFEYALQGMGFEK GWGDTAGRVLEMMHLLSDILQAPDPSSLEKFLGMVPMVFNVVILSPHGYFGQANVLGLPD TGGQVVYILDQVRALETEMLLRIKRQGLDISPSILIVTRLIPDAKGTTCNQRLERVSGTE HTHILRVPFRSEKGILRKWISRFDVWPYLENYAQDAASEIVGELQGVPDFIIGNYSDGNL VASLMAHRMGVTQCTIAHALEKTKYPDSDIYWKDFDNKYHFSCQFTADLIAMNNADFIIT STYQEIAGTKNTVGQYESHGAFTLPGLYRVVHGIDVFDPKFNIVSPGADMTIYFPYSEET RRLTALHGSIEEMLYSPDQTDEHVGTLSDRSKPILFSMARLDKVKNISGLVEMYSKNTKL RELVNLVVIAGNIDVNKSKDREEIVEIEKMHNLMKNYKLDGQFRWITAQTNRARNGELYR YIADTRGAFAQPAFYEAFGLTVVEAMTCGLPTFATCHGGPAEIIEHGLSGFHIDPYHPEQ AGNIMADFFERCKEDPNHWKKVSDAGLQRIYERYTWKIYSERLMTLAGVYGFWKYVSKLE RRETRRYLEMFYILKFRDLVKTVPSTADD*
written 28 Aug 98
Larry
Parnell