| Gene | T10M13.20/T2H3.13 |
| Putative Identification | drought-induced-19-like 1 |
| Position | 99041 to 100468, from the initial methionine to the termination codon |
| Strand | + |
| EST match | T42823 and AA395188 |
| Database match | Di19 (drought-induced 19, X78584) |
CDS: The table below lists the coordinates for each exon of T10M13.20 and which exon predicting programs selected the 5' and 3' termini (GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons). Exon termini determined by ESTs AA395188 or T42823 are so designated.
| Exon | Range | 5' | 3' |
|---|---|---|---|
| 1 | 99041 - 99122 | GS, Gr | EST, GS, Gr, M, NPG |
| 2 | 99329 - 99462 | EST, GS, Gr, M, NPG | EST, Gr, M, NPG |
| 3 | 99795 - 99872 | EST, Gr, M, NPG | EST, Gr, M, NPG |
| 4 | 100023 - 100255 | EST, Gr, NPG | EST, NPG |
| 5 | 100351 - 100468 | EST, M, NPG |
Alternate exons not used in building the gene model. GenScan predicts a terminal exon from 99329 to 99522. GRAIL does not select any model exons in this region. The EST (T42823) match to a GRAIL "intergenic region" necessitated assembly of shadow exons. A terminal exon is selected from 100023 to 100258. MZEF selects internal exons from 99034 to 99122 and from 100351 to 100424. NetPlantGene predicts many putative splice sites in this region, only the splice donor at 100424 has a high confidence score (0.96).
Complete CDS of T10M13.20
ATGGAAGAAGATTTGTTGGGCATTTGTGGGTTTGATTCGTCGAAGAAATATCGATTAGAA GAACTTGCCAAGTATCAGTCGGGTTCATGTATTGAGTTTGAAGATGATGATGAAATGGCA GTGGATTATCCATGCCCGTTTTGCTCAGATGATTATGATTTAGTTGAATTGTGTCACCAT ATCGATGAGGAGCATCAACTAGACGCCAACAATGGGATATGTCCGGTTTGTAGCAGACGA GTGAAGATGCATATGGTTGATCACATTACCACTCAGCATAGAGATGTCTTCAAGAGACTT TACAAGGATGAGTCATATTCAGCATTTTCTCCAGGGACTAGGAAATACTTACAGTCTCTA ATCGATGAGCCGTTGTCTACTAATCATACATCTAAAAGTGTTCTGGACCCATTATTATCA TTTATATACAATCCGCCGTCACCAAAGAAGTCCAAGCTTGTACAACCTGATTCATCTAGT GAAGCAAGCATGGAAGACAATAGCTTAATAAGGGATTCAACAGAAAAAGACTGGGAATCG CCGTCTCCCTTGTCAGATACAGAGCTACTAGAGAAGGCAAAGAAGAGAGAGTTTGTACAA GGTTTAATSTCATCAGCCATATTTGATCACATTTACAACTTCTAA
Protein sequence
MEEDLLGICGFDSSKKYRLEELAKYQSGSCIEFEDDDEMAVDYPCPFCSDDYDLVELCHH IDEEHQLDANNGICPVCSRRVKMHMVDHITTQHRDVFKRLYKDESYSAFSPGTRKYLQSL IDEPLSTNHTSKSVLDPLLSFIYNPPSPKKSKLVQPDSSSEASMEDNSLIRDSTEKDWES PSPLSDTELLEKAKKREFVQGLISSAIFDHIYNF*
Multiple sequence analysis of putative drought induced gene products
Di19 - drought induced 19 (X78584), F2P16.10 - gene on BAC F2P16 from chromosome V (PID:g2191179), T10M13.20 - gene on BAC T10M13 from chromosome IV
1 60
Di19 ~~~~~MDADSKRFLATLRSRS..EMLMG.FEEIDGDDDFQEEFACPFCAESYDIIGLCCH
F2P16.10 ~~~~~MFLLKPICIGTSRMERVFNNFLG.FEEIEGEDDFREEYACPFCSDYFDIVSLCCH
T10M13.20 MEEDLLGICGFDSSKKYRLEELAKYQSGSCIEFEDDDEMAVDYPCPFCSDDYDLVELCHH
61 120
Di19 IDDEHTLESKNAVCPVCSLKVGVDIVAHITLHHGSLFK.LQ................RKR
F2P16.10 IDEDHPMDAKNGVCPICAVKVSSDMIAHITLQHANMFK.ISFLLSLPLHSLTKYYVTRKR
T10M13.20 IDEEHQLDANNGICPVCSRRVKMHMVDHITTQHRDVFKRLYKDESYSAFSPGTRKYLQSL
121 180
Di19 KSRKSGTNSTLSLLRKGLREGDLQRLLGFTSRN.GSVASSVTPDPLLSSFISPTR.....
F2P16.10 KSRRGGAQSMLSILKREFPDGNFQSLFEGTSRAVSSSSASIAADPLLSSFISPMADDFFI
T10M13.20 IDEPLSTNHTSKSVLDPLLSFIYNPPSPKKSKLVQPDSSS.EASMEDNSLIRDSTEKDWE
181 238
Di19 SQSSPAPRQTKNVSEDKQIERKRQVFISPVSLKDREERRHKSEFVQRLLSSAIFDEV~
F2P16.10 SESSLCADTSSAKKTLNQSLPERNVEKQSLSAEDHREKLKQSEFVQGILSSMILEDGL
T10M13.20 SPSPLSDTELLEKAKKREFVQGLISSAIFDHIYNF*~~~~~~~~~~~~~~~~~~~~~~
written 31 Jul 97
updated 4 Aug 98
Larry
Parnell