Gene T2H3.4
Putative Identification hypothetical protein similar to extensin-like protein
Position 23632 - 24245, from the initial methionine to the termination codon
Strand +
EST hits AA395170
Database match A. thaliana hypothetical proteins T30B22.16 and T30B22.17 and
N. tabacum pistil-specific extensin-like protein (PELP)

 

CDS:  The table below lists the coordinates of the T2H3.4 exons and which exon prediction algorithms selected the 5' and 3' termini (GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons).

Exon Range 5' 3'
1 23632 - 23806 GS, Gr GS, M, NPG
2 23923 - 24245 GS, Gr, NPG GS, Gr

Alternate exons not used in building the gene model:   GRAIL predicts exon 1 from 23632 to 23875. MZEF predicts an exon from 23699 to 23806. NetPlantGene predicts 3 splice acceptors and 4 splice donors in the region of T2H3.4. The acceptor predicted at 23699 has a confidence score of (0.87).

Complete CDS of T2H3.4

ATGGCTTTCTCACGCCTCTCATTTGCAGCTTCTCTCATCGTCTTCTCATCTCTAATCATA
TCTTCAGTTGCTTATTACGGCAATGAAGCTGACCCGGAGACCGGAAAATTGATTCCCATC
GCCGTTGAAGGGATCATCAAGTGCAAGTCCGGTGGCAAAACTTACCCAATTCAAGGTGCA
ACGGCAAGAATTGCGTGTGTGAAGGTCGATGCATATGGGAATGAGTTAGTTCCAATATCG
ATATTGAGCAGCAAAACTGATGCTAAAGGTTACTTCATCGCCACGATATTCCCTTCGCAG
CTTCGTGCAGGAAGGACAGTGACCAAATGCAAAACTTACCTTTACAAATCTCCACTCGCT
GATTGCGATTTTCCGACCGATGTGAACAAAGGTGTTAGAGGACAGCCATTGAGCACGTAC
CGTATTCTTCAAGACAAGAGCTTCAAGCTTTACTGGGCTGGTCCTTTCTTCTACACGTCT
GAACCTACTTACTACTAA

 

Protein translation of T2H3.4

MAFSRLSFAASLIVFSSLIISSVAYYGNEADPETGKLIPIAVEGIIKCKSGGKTYPIQGA
TARIACVKVDAYGNELVPISILSSKTDAKGYFIATIFPSQLRAGRTVTKCKTYLYKSPLA
DCDFPTDVNKGVRGQPLSTYRILQDKSFKLYWAGPFFYTSEPTYY*

 

Multiple Sequence Analysis

T2H3.4 is aligned to A. thaliana proteins T30B22.16 and T30B22.17, described in GenBank as hypothetical, and a pistil-specific extensin-like protein (PELP) from N. tabacum. While the poly-proline repeats of PELP are not conserved in the three Arabidopsis proteins, other residues in the carboxyl terminal portion of PELP are conserved.

           1                                                         60
   T2H3.4  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
T30B22.16  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
T30B22.17  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
  NtaPELP  MAVIISSKVLLIQLFVLVLGSFSKLSHGELWLELPLPFDWPPAEIPLPDIPSPFDGPTFV 

           61                                                       120
   T2H3.4  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
T30B22.16  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
T30B22.17  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
  NtaPELP  LPPPSPLPSPPPPSPSPPPPSPSPPPPSTIPLIPPFTGGFLPPLPGSKLPDFAGLLPLIP 

           121                                                      180
   T2H3.4  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
T30B22.16  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
T30B22.17  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
  NtaPELP  NLPDVPPIGGGPPVNQPKPSSPSPLVKPPPPPPSPCKPSPPDQSAKQPPQPPPAKQPSPP 

           181                                                      240
   T2H3.4  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
T30B22.16  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
T30B22.17  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MASTGAATN 
  NtaPELP  PPPPPVKAPSPSPAKQPPPPPPPVKAPSPSPATQPPTKQPPPPPRAKKSPLLPPPPPVAY 

           241                                                      300
   T2H3.4  ~~~~~MAFSRLSFAASLIVFSSLIISSVAYYGNEADPETGKLIP....IAVEGIIKCKS. 
T30B22.16  ~~~~~~~~~~MAQTCSILYIPYMLLLSSLFAAGIATITEGELLS..SMIGVQGLIYCKR. 
T30B22.17  LLLLAMVVVVATADYYAQPQPYVPKPTTTYTSPVKTPYLPKSNP..D.IAIEGFILCKS. 
  NtaPELP  PPVMTPSPSPAAEPPIIAPFPSPPANPPLIPRRPAPPVVKPLPPLGKPPIVSGLVYCKSC 

           301                                                      360
   T2H3.4  GGKTYP..IQGATARIACVKVDAYGNELVPISILSSKTDAKGYFIATIFPSQLRAGRTVT 
T30B22.16  GSKLTP..IQGAVARVTCERTDEYGYEAEDVTVLSQATDAKGYFLATLSSSEVKDYKKVI 
T30B22.17  GYKTYP..IQGGKVKVVCPVVDSYGKLVAKVTISSYPTDLKGYFYFITYGLS....HKVN 
  NtaPELP  NSYGVPTLLNASLLQGAVVKLICYGK...KTMVQWATTDNKGEF..RIMPKSL....TTA 

           361                                                      420
   T2H3.4  K...CKTYLYKSPLADCDFPTDVNKGVRGQPL.......S..TYRILQDK.SFK.LYWAG 
T30B22.16  KIKECRAFLELSPSDTCSFPTEINRGISGAIL.......Q..NYRLLENKLKMK.LFTVG 
T30B22.17  NISSCKVKLESSPVFTCKTPTNVNKGVTGAPL.......SPDNSKFLSHD.NLT.LYTLE 
  NtaPELP  DVGKCKVYLVKSPNPNCNVPTNFNGGKSGGLLKPLLPPKQPITPAVVPVQPPMSDLYGVG 

           421             439
   T2H3.4  PFFYTSEPTYY*~~~~~~~
T30B22.16  PFVFSSEETQDKSIPNGY*
T30B22.17  PFYFSSPVAPKPVY*~~~~
  NtaPELP  PFIFEASSKMPCDKN*~~~

written 10 Sep 98
Larry Parnell