| Gene | T2H3.13 indentical to T10M13.20, within the overlap region of T10M13 and T2H3 |
| Putative Identification | drought-induced-19-like 1 |
| Position | 42425 to 43852, from the initial methionine to the termination codon |
| Strand | - |
| EST match | T42823 and AA395188 |
| Database match | Di19 (drought-induced 19, X78584) |
CDS: The table below lists the coordinates for each exon of T2H3.R and which exon predicting programs selected the 5' and 3' termini (GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons). Exon termini determined by ESTs AA395188 or T42823 are so designated.
| Exon | Range | 3' | 5' |
|---|---|---|---|
| 1 | 43771 - 43852 | EST, GS, Gr, M, NPG | GS, Gr |
| 2 | 43431 - 43564 | EST, Gr, M, NPG | EST, GS, Gr, M, NPG |
| 3 | 43021 - 43098 | EST, Gr, M, NPG | EST, Gr, M, NPG |
| 4 | 42638 - 42870 | EST, NPG | EST, Gr, NPG |
| 5 | 42425 - 42542 | ESTM, NPG |
Alternate exons not used in building the gene model. GenScan predicts a terminal exon from 43371 to 43564. GRAIL does not select any model exons in this region. The EST (T42823) match to a GRAIL "intergenic region" necessitated assembly of shadow exons. Exon 4 extends from 42766 to 42870, exon 5 from 42469 to 42542, and a sixth exon from 42162 to 42260. MZEF selects internal exons from 43771 to 43859 and from 42469 to 42542. NetPlantGene predicts many putative splice sites in this region, only the splice donor at 42469 has a high confidence score (0.96).
Complete CDS of T2H3.13
ATGGAAGAAGATTTGTTGGGCATTTGTGGGTTTGATTCGTCGAAGAAATATCGATTAGAA GAACTTGCCAAGTATCAGTCGGGTTCATGTATTGAGTTTGAAGATGATGATGAAATGGCA GTGGATTATCCATGCCCGTTTTGCTCAGATGATTATGATTTAGTTGAATTGTGTCACCAT ATCGATGAGGAGCATCAACTAGACGCCAACAATGGGATATGTCCGGTTTGTAGCAGACGA GTGAAGATGCATATGGTTGATCACATTACCACTCAGCATAGAGATGTCTTCAAGAGACTT TACAAGGATGAGTCATATTCAGCATTTTCTCCAGGGACTAGGAAATACTTACAGTCTCTA ATCGATGAGCCGTTGTCTACTAATCATACATCTAAAAGTGTTCTGGACCCATTATTATCA TTTATATACAATCCGCCGTCACCAAAGAAGTCCAAGCTTGTACAACCTGATTCATCTAGT GAAGCAAGCATGGAAGACAATAGCTTAATAAGGGATTCAACAGAAAAAGACTGGGAATCG CCGTCTCCCTTGTCAGATACAGAGCTACTAGAGAAGGCAAAGAAGAGAGAGTTTGTACAA GGTTTAATSTCATCAGCCATATTTGATCACATTTACAACTTCTAA
Protein sequence
MEEDLLGICGFDSSKKYRLEELAKYQSGSCIEFEDDDEMAVDYPCPFCSDDYDLVELCHH IDEEHQLDANNGICPVCSRRVKMHMVDHITTQHRDVFKRLYKDESYSAFSPGTRKYLQSL IDEPLSTNHTSKSVLDPLLSFIYNPPSPKKSKLVQPDSSSEASMEDNSLIRDSTEKDWES PSPLSDTELLEKAKKREFVQGLISSAIFDHIYNF*
For further information on T2H3.13, please see the page for T10M13.20, the identical clone found in the overlap region of these two BACs.
written 13 Aug 98
Larry
Parnell