Gene T10P11.9
Putative Identification bZIP-like DNA-binding protein
Position 55009 to 57086, from initial methionine to the 3'-UTR
Strand -
EST match H76688, T04707, H36104, and T45488
Database match several plant nucleic acid-binding proteins, including Parsley CPRF-2 (bZIP, X58577), Soybean G/HBF-1 (Y10685), Rice bZIP protein (D78609), and maize opaque2 (L06478).

 

mRNA: Based on a comparison of T10P11 and the EST sequences, the 3'-UTR sequence extends from 57064 to 57086.

CDS:  The table below lists the coordinates of each T10P11.9 exon and whcih exon prediction algorithms selected the exon terminus (Gr = Grail, M = MZEF). Splice junctins determined by comparison to EST sequences are designated as EST.

Exon Range 3' 5'
1 56653 to 57063 Gr EST,Gr
2 56483 to 56564 EST,M,Gr EST,M,Gr
3 55950 to 56118 EST,M,Gr EST,Gr
4 55793 to 55868 EST,M,Gr EST,M,Gr
5 55591 to 55716 EST,M EST,M,Gr
6 55152 to 55501 EST,M EST,M
7 55009 to 55030 EST,M EST

Complete CDS of T10P11.9

ATGAACAGTATCTTCTCCATTGACGATTTCTCCGATCCTTTCTGGGAAACTCCTCCGATT
CCTCTCAATCCCGACTCTTCTAAGCCTGTTACGGCGGATGAAGTTAGCCAGAGTCAACCG
GAATGGACTTTCGAGATGTTTCTCGAAGAGATTTCTTCGTCGGCGGTGAGCTCTGAGCCA
CTTGGTAACAACAACAACGCGATCGTCGGTGTTTCTTCGGCGCAATCTCTTCCTTCTGTT
TCCGGACAGAATGATTTCGAGGATGATAGTCGATTTCGTGATCGCGATTCGGGAAATTTG
GATTGTGCTGCTCCCATGACGACGAAGACGGTGATTGTTGATTCCGATGATTATCGTCGT
GTTCTTAAGAACAAGCTTGAGACTGAGTGCGCTACTGTTGTTTCTCTTCGGGTTGGGTCT
GTGAAGCCTGAAGATTCGACTAGTTCTCCAGAAACTCAACTTCAACCAGTTCAATCCAGT
CCTCTTACTCAAGGAGAACTTGGTGTTACTTCTTCCTTACCAGCTGAGGTGAAAAAAACT
GGTGTATCAATGAAGCAGGTTACTAGTGGATCGTCGAGAGAATATTCTGATGACGAGGAC
CTTGATGAAGAGAATGAAACCACCGGTTCCTTGAAGCCAGAGGACGTTAAAAAATCTAGA
AGGATGCTGTCAAATCGTGAGTCAGCTAGGCGATCTAGAAGGAGAAAGCAGGAGCAAACA
AGTGACCTCGAAACACAGGTTAATGATCTAAAAGGTGAGCATTCATCACTTCTTAAACAA
CTGAGCAACATGAATCACAAGTATGACGAGGCTGCTGTTGGCAATAGAATACTAAAGGCT
GACATTGAGACATTAAGAGCTAAGGTGAAAATGGCGGAAGAAACCGTGAAGAGAGTAACA
GGAATGAATCCGATGCTTCTCGGAAGATCAAGTGGACATAACAACAACAACAGAATGCCA
ATAACTGGTAACAACAGGATGGATTCTTCTAGCATTATTCCAGCTTATCAACCACACTCA
AACCTAAACCATATGTCAAACCAAAACATCGGGATCCCAACCATTCTACCTCCAAGACTC
GGAAACAATTTCGCTGCTCCTCCATCCCAAACCAGCTCTCCCTTGCAGAGAATTAGAAAT
GGGCAAAATCACCATGTTACTCCAAGCGCCAACCCGTATGGCTGGAATACCGAACCTCAG
AACGATTCAGCATGGCCGAAAAAATGCGTGGACTGA

 

Protein Translation:

MNSIFSIDDFSDPFWETPPIPLNPDSSKPVTADEVSQSQPEWTFEMFLEEISSSAVSSEP
LGNNNNAIVGVSSAQSLPSVSGQNDFEDDSRFRDRDSGNLDCAAPMTTKTVIVDSDDYRR
VLKNKLETECATVVSLRVGSVKPEDSTSSPETQLQPVQSSPLTQGELGVTSSLPAEVKKT
GVSMKQVTSGSSREYSDDEDLDEENETTGSLKPEDVKKSRRMLSNRESARRSRRRKQEQT
SDLETQVNDLKGEHSSLLKQLSNMNHKYDEAAVGNRILKADIETLRAKVKMAEETVKRVT
GMNPMLLGRSSGHNNNNRMPITGNNRMDSSSIIPAYQPHSNLNHMSNQNIGIPTILPPRL
GNNFAAPPSQTSSPLQRIRNGQNHHVTPSANPYGWNTEPQNDSAWPKKCVD*

Protein motifs:

A bZIP signature is found from residues 220 to 235 [RRMLSNRESARRSRRR].

Alignment of select plant bZIP proteins from rice (D78609), maize (L06478), soybean (Y10685), parsley (X58577), and Arabidopsis (T10P11.9). The bZIP motif is highlighted in red.

           1                                                         60
RicBZIPPA  MERVFSVEEISDPFWVPPPPPQSAAAAQQQGGGGVASGGGGGVAGGGGGGNAMNRCPSEW 
ZmOpaque2  MERVFSMEEIPNPYWAPPHP.......QPAAGGAVAA..PGGVGGAGDEAGAMNRCPSEW 
  GmGHBF1  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
  PcCPRF2  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MDRVFSVEDISDQFWSPPARE 
 T10P11.9  MNSIFSIDDFSDPFWETPPIPLNPDSSKPVTADEVSQSQPEWTFEMFLEEISSSAVSSEP 

           61                                                       120
RicBZIPPA  YFQKFLEEAVLDSPVPNPSPRAEAGGIRGAGGVVPVDVKQPQLSAAAAAAATTSAVVDPV 
ZmOpaque2  YFEKFLEEAVLDSPGP.......VAGVGRSSGQAGVEAAESKPLGAAAPASVSSSVVDPV 
  GmGHBF1  ~~~~~~MTASSSSSSHQNDVVEIKDENLSIPNLNPSTALNSKPASSFGLAPPPNIAVDSE 
  PcCPRF2  DSSKLVMNRSDSEWAFQSFLQQASALESSQP..LPSDPVPVAGDVKNPVEIPANVPVDSE 
 T10P11.9  LGNNNNAIVGVSSAQSLPSV....SGQNDFEDDSRFRDRDSGNLDCAAPMTTKTVIVDSD 

           121                                                      180
RicBZIPPA  EYNAMLKQKLEKDLAAVAMWRASGTVPPERPGA.....GSSLLNADVSHIGAPISIGGNA 
ZmOpaque2  EYNAMLKQKLEKDLAAIAMWRASGAAPPDLSAT.....AASLPSVGVPHAAPLKPVGGTE 
  GmGHBF1  EYQAFLKSQLHLACAAVALTRGKSLNPQDSGSTAHDKGSETASAAQSGSHVSTLGSGQEV 
  PcCPRF2  DYQAYLKSRLDLACAAVALTRASSLKPQDSAALL.DNGSQASNTSQLVSQVPPKGSGHDL 
 T10P11.9  DYRRVLKNKLETECATVVSLRVGSVKPEDSTSSPETQLQPVQSSP............... 

           181                                                      240
RicBZIPPA  TPVQNMLSG.PSG.GSGSQLVQNVDVLVKQATSSSSREQSDDDD.MEGEAETTGTARPAD 
ZmOpaque2  SLVQNMLAGAPVG.GSGPHIVQIADIPVKQTTSSSSREQSDDDD.MEGDAETNGNGNPVQ 
  GmGHBF1  AKIQDKDAGGPVGIPSLPPVQKKPVVQVRSTTSGSSREQS.DDDEAEGEAETTQGMDPAD 
  PcCPRF2  SKEEDKEALAATATPLLPALQKKSAIQVKSTTSGSSRDHSDDDDELEGETETTRNGDPSD 
 T10P11.9  .....LTQGELGVTSSLPAEVKKTGVSMKQVTSGSSREYSDDED.LDEENETTGSLKPED 

           241                                                      300
RicBZIPPA  QRLQRRKQSNRESARRSRSRKAAHLNELEAQVSQLRVENSSLLRRLADVNQKYNDAAVDN 
ZmOpaque2  QRQQRRKQSNRESARRSRSRKAAHLNELEAQVAQLRVENSSLLRRLADVNQKFNEAAVDN 
  GmGHBF1  AKRVRRMLSNRESARRSRRRKQAHLTELETQVSQLRVENSSLLKRLTDISQKYNEAAVDN 
  PcCPRF2  AKRVRRMLSNRESARRSRRRKQAHMTELETQVSQLRVENSSLLKRLTDISQRYNDAAVDN 
 T10P11.9  VKKSRRMLSNRESARRSRRRKQEQTSDLETQVNDLKGEHSSLLKQLSNMNHKYDEAAVGN 

           301                                                      360
RicBZIPPA  RVLKADVETLRAKVKMAEDSVKRVTGMNALFPAA.SDMSSLSMP.FNSSPSEATSDAAVP 
ZmOpaque2  RVLKADVETLRAKVKMAEDSVKRVTGMNALYPAV.SDMSSLSMP.FNGSPSDSASDSTVP 
  GmGHBF1  RVLKADVETLRTKVKMAEETVKRVTGLNPLFQAM.SEISSMVMPSYSGSPSDTSADAAVP 
  PcCPRF2  RVLKADIETMRAKVKMAEETVKRVTGLNPMFQSMSSEISTIGMQSFSGSPSDTSADT... 
 T10P11.9  RILKADIETLRAKVKMAEETVKRVTGMNPMLLGRSSGHNNNNRMPITGNNRMDSSSIIPA 

           361                                                      420
RicBZIPPA  IQDDPNNYFAT.......NNDIGGNNNYMPDIPSSAQEDEDFVNGALAAGKIGRTASLQR 
ZmOpaque2  VQDDLNSYFAN.......PSEIGGNNGYMPDIASSVQQDDNFVNGYQAAGKMGRTDSLQR 
  GmGHBF1  VQDDPKHHYYQQPPNNLMPTHDPRIQNGMVDVPPIENVEQNPATAAVGGNKMGRTTSMQR 
  PcCPRF2  TQDGSKQHFYQPAPTSHMPAQDQKIQNGLLQVPPVDNLQQHSASGPVEGNKMERTSSMQR 
 T10P11.9  YQPHSNLNHMSNQNIGIPTILPPRLGNNFAAPPSQTSSPLQRIRNGQNHHVTPSANPYGW 

           421                     447
RicBZIPPA  VASLEHLQKRMCGGPASSGSTS*~~~~
ZmOpaque2  VASLEHLQKRMCGGPASSGSTS*~~~~
  GmGHBF1  VASLEHLQKRIRGEVSSCGTQGRGEQ*
  PcCPRF2  VASLEHLQKRIRGGVSSCEAQVSGKQ*
 T10P11.9  NTEPQNDSAWPKKCVD*~~~~~~~~~~

 



written 11 Aug 97
Larry Parnell