| Gene | T2H3.4 |
| Putative Identification | hypothetical protein similar to extensin-like protein |
| Position | 23632 - 24245, from the initial methionine to the termination codon |
| Strand | + |
| EST hits | AA395170 |
| Database match | A. thaliana hypothetical proteins T30B22.16
and T30B22.17 and N. tabacum pistil-specific extensin-like protein (PELP) |
CDS: The table below lists the coordinates of the T2H3.4 exons and which exon prediction algorithms selected the 5' and 3' termini (GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons).
| Exon | Range | 5' | 3' |
|---|---|---|---|
| 1 | 23632 - 23806 | GS, Gr | GS, M, NPG |
| 2 | 23923 - 24245 | GS, Gr, NPG | GS, Gr |
Alternate exons not used in building the gene model: GRAIL predicts exon 1 from 23632 to 23875. MZEF predicts an exon from 23699 to 23806. NetPlantGene predicts 3 splice acceptors and 4 splice donors in the region of T2H3.4. The acceptor predicted at 23699 has a confidence score of (0.87).
Complete CDS of T2H3.4
ATGGCTTTCTCACGCCTCTCATTTGCAGCTTCTCTCATCGTCTTCTCATCTCTAATCATA TCTTCAGTTGCTTATTACGGCAATGAAGCTGACCCGGAGACCGGAAAATTGATTCCCATC GCCGTTGAAGGGATCATCAAGTGCAAGTCCGGTGGCAAAACTTACCCAATTCAAGGTGCA ACGGCAAGAATTGCGTGTGTGAAGGTCGATGCATATGGGAATGAGTTAGTTCCAATATCG ATATTGAGCAGCAAAACTGATGCTAAAGGTTACTTCATCGCCACGATATTCCCTTCGCAG CTTCGTGCAGGAAGGACAGTGACCAAATGCAAAACTTACCTTTACAAATCTCCACTCGCT GATTGCGATTTTCCGACCGATGTGAACAAAGGTGTTAGAGGACAGCCATTGAGCACGTAC CGTATTCTTCAAGACAAGAGCTTCAAGCTTTACTGGGCTGGTCCTTTCTTCTACACGTCT GAACCTACTTACTACTAA
Protein translation of T2H3.4
MAFSRLSFAASLIVFSSLIISSVAYYGNEADPETGKLIPIAVEGIIKCKSGGKTYPIQGA TARIACVKVDAYGNELVPISILSSKTDAKGYFIATIFPSQLRAGRTVTKCKTYLYKSPLA DCDFPTDVNKGVRGQPLSTYRILQDKSFKLYWAGPFFYTSEPTYY*
Multiple Sequence Analysis
T2H3.4 is aligned to A. thaliana proteins T30B22.16 and T30B22.17, described in GenBank as hypothetical, and a pistil-specific extensin-like protein (PELP) from N. tabacum. While the poly-proline repeats of PELP are not conserved in the three Arabidopsis proteins, other residues in the carboxyl terminal portion of PELP are conserved.
1 60
T2H3.4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T30B22.16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T30B22.17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
NtaPELP MAVIISSKVLLIQLFVLVLGSFSKLSHGELWLELPLPFDWPPAEIPLPDIPSPFDGPTFV
61 120
T2H3.4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T30B22.16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T30B22.17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
NtaPELP LPPPSPLPSPPPPSPSPPPPSPSPPPPSTIPLIPPFTGGFLPPLPGSKLPDFAGLLPLIP
121 180
T2H3.4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T30B22.16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T30B22.17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
NtaPELP NLPDVPPIGGGPPVNQPKPSSPSPLVKPPPPPPSPCKPSPPDQSAKQPPQPPPAKQPSPP
181 240
T2H3.4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T30B22.16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T30B22.17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MASTGAATN
NtaPELP PPPPPVKAPSPSPAKQPPPPPPPVKAPSPSPATQPPTKQPPPPPRAKKSPLLPPPPPVAY
241 300
T2H3.4 ~~~~~MAFSRLSFAASLIVFSSLIISSVAYYGNEADPETGKLIP....IAVEGIIKCKS.
T30B22.16 ~~~~~~~~~~MAQTCSILYIPYMLLLSSLFAAGIATITEGELLS..SMIGVQGLIYCKR.
T30B22.17 LLLLAMVVVVATADYYAQPQPYVPKPTTTYTSPVKTPYLPKSNP..D.IAIEGFILCKS.
NtaPELP PPVMTPSPSPAAEPPIIAPFPSPPANPPLIPRRPAPPVVKPLPPLGKPPIVSGLVYCKSC
301 360
T2H3.4 GGKTYP..IQGATARIACVKVDAYGNELVPISILSSKTDAKGYFIATIFPSQLRAGRTVT
T30B22.16 GSKLTP..IQGAVARVTCERTDEYGYEAEDVTVLSQATDAKGYFLATLSSSEVKDYKKVI
T30B22.17 GYKTYP..IQGGKVKVVCPVVDSYGKLVAKVTISSYPTDLKGYFYFITYGLS....HKVN
NtaPELP NSYGVPTLLNASLLQGAVVKLICYGK...KTMVQWATTDNKGEF..RIMPKSL....TTA
361 420
T2H3.4 K...CKTYLYKSPLADCDFPTDVNKGVRGQPL.......S..TYRILQDK.SFK.LYWAG
T30B22.16 KIKECRAFLELSPSDTCSFPTEINRGISGAIL.......Q..NYRLLENKLKMK.LFTVG
T30B22.17 NISSCKVKLESSPVFTCKTPTNVNKGVTGAPL.......SPDNSKFLSHD.NLT.LYTLE
NtaPELP DVGKCKVYLVKSPNPNCNVPTNFNGGKSGGLLKPLLPPKQPITPAVVPVQPPMSDLYGVG
421 439
T2H3.4 PFFYTSEPTYY*~~~~~~~
T30B22.16 PFVFSSEETQDKSIPNGY*
T30B22.17 PFYFSSPVAPKPVY*~~~~
NtaPELP PFIFEASSKMPCDKN*~~~
written 10 Sep 98
Larry
Parnell