Gene T2H3.13
indentical to T10M13.20, within the overlap region of T10M13 and T2H3
Putative Identification drought-induced-19-like 1
Position 42425 to 43852, from the initial methionine to the termination codon
Strand -
EST match T42823 and AA395188
Database match Di19 (drought-induced 19, X78584)

 

CDS:  The table below lists the coordinates for each exon of T2H3.R and which exon predicting programs selected the 5' and 3' termini (GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons). Exon termini determined by ESTs AA395188 or T42823 are so designated.

Exon Range 3' 5'
1 43771 - 43852 EST, GS, Gr, M, NPG GS, Gr
2 43431 - 43564 EST, Gr, M, NPG EST, GS, Gr, M, NPG
3 43021 - 43098 EST, Gr, M, NPG EST, Gr, M, NPG
4 42638 - 42870 EST, NPG EST, Gr, NPG
5 42425 - 42542   ESTM, NPG

Alternate exons not used in building the gene model. GenScan predicts a terminal exon from 43371 to 43564. GRAIL does not select any model exons in this region. The EST (T42823) match to a GRAIL "intergenic region" necessitated assembly of shadow exons. Exon 4 extends from 42766 to 42870, exon 5 from 42469 to 42542, and a sixth exon from 42162 to 42260. MZEF selects internal exons from 43771 to 43859 and from 42469 to 42542. NetPlantGene predicts many putative splice sites in this region, only the splice donor at 42469 has a high confidence score (0.96).

Complete CDS of T2H3.13

ATGGAAGAAGATTTGTTGGGCATTTGTGGGTTTGATTCGTCGAAGAAATATCGATTAGAA
GAACTTGCCAAGTATCAGTCGGGTTCATGTATTGAGTTTGAAGATGATGATGAAATGGCA
GTGGATTATCCATGCCCGTTTTGCTCAGATGATTATGATTTAGTTGAATTGTGTCACCAT
ATCGATGAGGAGCATCAACTAGACGCCAACAATGGGATATGTCCGGTTTGTAGCAGACGA
GTGAAGATGCATATGGTTGATCACATTACCACTCAGCATAGAGATGTCTTCAAGAGACTT
TACAAGGATGAGTCATATTCAGCATTTTCTCCAGGGACTAGGAAATACTTACAGTCTCTA
ATCGATGAGCCGTTGTCTACTAATCATACATCTAAAAGTGTTCTGGACCCATTATTATCA
TTTATATACAATCCGCCGTCACCAAAGAAGTCCAAGCTTGTACAACCTGATTCATCTAGT
GAAGCAAGCATGGAAGACAATAGCTTAATAAGGGATTCAACAGAAAAAGACTGGGAATCG
CCGTCTCCCTTGTCAGATACAGAGCTACTAGAGAAGGCAAAGAAGAGAGAGTTTGTACAA
GGTTTAATSTCATCAGCCATATTTGATCACATTTACAACTTCTAA

 

Protein sequence

MEEDLLGICGFDSSKKYRLEELAKYQSGSCIEFEDDDEMAVDYPCPFCSDDYDLVELCHH
IDEEHQLDANNGICPVCSRRVKMHMVDHITTQHRDVFKRLYKDESYSAFSPGTRKYLQSL
IDEPLSTNHTSKSVLDPLLSFIYNPPSPKKSKLVQPDSSSEASMEDNSLIRDSTEKDWES
PSPLSDTELLEKAKKREFVQGLISSAIFDHIYNF*

 

For further information on T2H3.13, please see the page for T10M13.20, the identical clone found in the overlap region of these two BACs.


written 13 Aug 98
Larry Parnell