| Gene | T10P11.9 |
| Putative Identification | bZIP-like DNA-binding protein |
| Position | 55009 to 57086, from initial methionine to the 3'-UTR |
| Strand | - |
| EST match | H76688, T04707, H36104, and T45488 |
| Database match | several plant nucleic acid-binding proteins, including Parsley CPRF-2 (bZIP, X58577), Soybean G/HBF-1 (Y10685), Rice bZIP protein (D78609), and maize opaque2 (L06478). |
mRNA: Based on a comparison of T10P11 and the EST sequences, the 3'-UTR sequence extends from 57064 to 57086.
CDS: The table below lists the coordinates of each T10P11.9 exon and whcih exon prediction algorithms selected the exon terminus (Gr = Grail, M = MZEF). Splice junctins determined by comparison to EST sequences are designated as EST.
| Exon | Range | 3' | 5' |
|---|---|---|---|
| 1 | 56653 to 57063 | Gr | EST,Gr |
| 2 | 56483 to 56564 | EST,M,Gr | EST,M,Gr |
| 3 | 55950 to 56118 | EST,M,Gr | EST,Gr |
| 4 | 55793 to 55868 | EST,M,Gr | EST,M,Gr |
| 5 | 55591 to 55716 | EST,M | EST,M,Gr |
| 6 | 55152 to 55501 | EST,M | EST,M |
| 7 | 55009 to 55030 | EST,M | EST |
Complete CDS of T10P11.9
ATGAACAGTATCTTCTCCATTGACGATTTCTCCGATCCTTTCTGGGAAACTCCTCCGATT CCTCTCAATCCCGACTCTTCTAAGCCTGTTACGGCGGATGAAGTTAGCCAGAGTCAACCG GAATGGACTTTCGAGATGTTTCTCGAAGAGATTTCTTCGTCGGCGGTGAGCTCTGAGCCA CTTGGTAACAACAACAACGCGATCGTCGGTGTTTCTTCGGCGCAATCTCTTCCTTCTGTT TCCGGACAGAATGATTTCGAGGATGATAGTCGATTTCGTGATCGCGATTCGGGAAATTTG GATTGTGCTGCTCCCATGACGACGAAGACGGTGATTGTTGATTCCGATGATTATCGTCGT GTTCTTAAGAACAAGCTTGAGACTGAGTGCGCTACTGTTGTTTCTCTTCGGGTTGGGTCT GTGAAGCCTGAAGATTCGACTAGTTCTCCAGAAACTCAACTTCAACCAGTTCAATCCAGT CCTCTTACTCAAGGAGAACTTGGTGTTACTTCTTCCTTACCAGCTGAGGTGAAAAAAACT GGTGTATCAATGAAGCAGGTTACTAGTGGATCGTCGAGAGAATATTCTGATGACGAGGAC CTTGATGAAGAGAATGAAACCACCGGTTCCTTGAAGCCAGAGGACGTTAAAAAATCTAGA AGGATGCTGTCAAATCGTGAGTCAGCTAGGCGATCTAGAAGGAGAAAGCAGGAGCAAACA AGTGACCTCGAAACACAGGTTAATGATCTAAAAGGTGAGCATTCATCACTTCTTAAACAA CTGAGCAACATGAATCACAAGTATGACGAGGCTGCTGTTGGCAATAGAATACTAAAGGCT GACATTGAGACATTAAGAGCTAAGGTGAAAATGGCGGAAGAAACCGTGAAGAGAGTAACA GGAATGAATCCGATGCTTCTCGGAAGATCAAGTGGACATAACAACAACAACAGAATGCCA ATAACTGGTAACAACAGGATGGATTCTTCTAGCATTATTCCAGCTTATCAACCACACTCA AACCTAAACCATATGTCAAACCAAAACATCGGGATCCCAACCATTCTACCTCCAAGACTC GGAAACAATTTCGCTGCTCCTCCATCCCAAACCAGCTCTCCCTTGCAGAGAATTAGAAAT GGGCAAAATCACCATGTTACTCCAAGCGCCAACCCGTATGGCTGGAATACCGAACCTCAG AACGATTCAGCATGGCCGAAAAAATGCGTGGACTGA
Protein Translation:
MNSIFSIDDFSDPFWETPPIPLNPDSSKPVTADEVSQSQPEWTFEMFLEEISSSAVSSEP LGNNNNAIVGVSSAQSLPSVSGQNDFEDDSRFRDRDSGNLDCAAPMTTKTVIVDSDDYRR VLKNKLETECATVVSLRVGSVKPEDSTSSPETQLQPVQSSPLTQGELGVTSSLPAEVKKT GVSMKQVTSGSSREYSDDEDLDEENETTGSLKPEDVKKSRRMLSNRESARRSRRRKQEQT SDLETQVNDLKGEHSSLLKQLSNMNHKYDEAAVGNRILKADIETLRAKVKMAEETVKRVT GMNPMLLGRSSGHNNNNRMPITGNNRMDSSSIIPAYQPHSNLNHMSNQNIGIPTILPPRL GNNFAAPPSQTSSPLQRIRNGQNHHVTPSANPYGWNTEPQNDSAWPKKCVD*
Protein motifs:
A bZIP signature is found from residues 220 to 235 [RRMLSNRESARRSRRR].
Alignment of select plant bZIP proteins from rice (D78609), maize (L06478), soybean (Y10685), parsley (X58577), and Arabidopsis (T10P11.9). The bZIP motif is highlighted in red.
1 60
RicBZIPPA MERVFSVEEISDPFWVPPPPPQSAAAAQQQGGGGVASGGGGGVAGGGGGGNAMNRCPSEW
ZmOpaque2 MERVFSMEEIPNPYWAPPHP.......QPAAGGAVAA..PGGVGGAGDEAGAMNRCPSEW
GmGHBF1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
PcCPRF2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MDRVFSVEDISDQFWSPPARE
T10P11.9 MNSIFSIDDFSDPFWETPPIPLNPDSSKPVTADEVSQSQPEWTFEMFLEEISSSAVSSEP
61 120
RicBZIPPA YFQKFLEEAVLDSPVPNPSPRAEAGGIRGAGGVVPVDVKQPQLSAAAAAAATTSAVVDPV
ZmOpaque2 YFEKFLEEAVLDSPGP.......VAGVGRSSGQAGVEAAESKPLGAAAPASVSSSVVDPV
GmGHBF1 ~~~~~~MTASSSSSSHQNDVVEIKDENLSIPNLNPSTALNSKPASSFGLAPPPNIAVDSE
PcCPRF2 DSSKLVMNRSDSEWAFQSFLQQASALESSQP..LPSDPVPVAGDVKNPVEIPANVPVDSE
T10P11.9 LGNNNNAIVGVSSAQSLPSV....SGQNDFEDDSRFRDRDSGNLDCAAPMTTKTVIVDSD
121 180
RicBZIPPA EYNAMLKQKLEKDLAAVAMWRASGTVPPERPGA.....GSSLLNADVSHIGAPISIGGNA
ZmOpaque2 EYNAMLKQKLEKDLAAIAMWRASGAAPPDLSAT.....AASLPSVGVPHAAPLKPVGGTE
GmGHBF1 EYQAFLKSQLHLACAAVALTRGKSLNPQDSGSTAHDKGSETASAAQSGSHVSTLGSGQEV
PcCPRF2 DYQAYLKSRLDLACAAVALTRASSLKPQDSAALL.DNGSQASNTSQLVSQVPPKGSGHDL
T10P11.9 DYRRVLKNKLETECATVVSLRVGSVKPEDSTSSPETQLQPVQSSP...............
181 240
RicBZIPPA TPVQNMLSG.PSG.GSGSQLVQNVDVLVKQATSSSSREQSDDDD.MEGEAETTGTARPAD
ZmOpaque2 SLVQNMLAGAPVG.GSGPHIVQIADIPVKQTTSSSSREQSDDDD.MEGDAETNGNGNPVQ
GmGHBF1 AKIQDKDAGGPVGIPSLPPVQKKPVVQVRSTTSGSSREQS.DDDEAEGEAETTQGMDPAD
PcCPRF2 SKEEDKEALAATATPLLPALQKKSAIQVKSTTSGSSRDHSDDDDELEGETETTRNGDPSD
T10P11.9 .....LTQGELGVTSSLPAEVKKTGVSMKQVTSGSSREYSDDED.LDEENETTGSLKPED
241 300
RicBZIPPA QRLQRRKQSNRESARRSRSRKAAHLNELEAQVSQLRVENSSLLRRLADVNQKYNDAAVDN
ZmOpaque2 QRQQRRKQSNRESARRSRSRKAAHLNELEAQVAQLRVENSSLLRRLADVNQKFNEAAVDN
GmGHBF1 AKRVRRMLSNRESARRSRRRKQAHLTELETQVSQLRVENSSLLKRLTDISQKYNEAAVDN
PcCPRF2 AKRVRRMLSNRESARRSRRRKQAHMTELETQVSQLRVENSSLLKRLTDISQRYNDAAVDN
T10P11.9 VKKSRRMLSNRESARRSRRRKQEQTSDLETQVNDLKGEHSSLLKQLSNMNHKYDEAAVGN
301 360
RicBZIPPA RVLKADVETLRAKVKMAEDSVKRVTGMNALFPAA.SDMSSLSMP.FNSSPSEATSDAAVP
ZmOpaque2 RVLKADVETLRAKVKMAEDSVKRVTGMNALYPAV.SDMSSLSMP.FNGSPSDSASDSTVP
GmGHBF1 RVLKADVETLRTKVKMAEETVKRVTGLNPLFQAM.SEISSMVMPSYSGSPSDTSADAAVP
PcCPRF2 RVLKADIETMRAKVKMAEETVKRVTGLNPMFQSMSSEISTIGMQSFSGSPSDTSADT...
T10P11.9 RILKADIETLRAKVKMAEETVKRVTGMNPMLLGRSSGHNNNNRMPITGNNRMDSSSIIPA
361 420
RicBZIPPA IQDDPNNYFAT.......NNDIGGNNNYMPDIPSSAQEDEDFVNGALAAGKIGRTASLQR
ZmOpaque2 VQDDLNSYFAN.......PSEIGGNNGYMPDIASSVQQDDNFVNGYQAAGKMGRTDSLQR
GmGHBF1 VQDDPKHHYYQQPPNNLMPTHDPRIQNGMVDVPPIENVEQNPATAAVGGNKMGRTTSMQR
PcCPRF2 TQDGSKQHFYQPAPTSHMPAQDQKIQNGLLQVPPVDNLQQHSASGPVEGNKMERTSSMQR
T10P11.9 YQPHSNLNHMSNQNIGIPTILPPRLGNNFAAPPSQTSSPLQRIRNGQNHHVTPSANPYGW
421 447
RicBZIPPA VASLEHLQKRMCGGPASSGSTS*~~~~
ZmOpaque2 VASLEHLQKRMCGGPASSGSTS*~~~~
GmGHBF1 VASLEHLQKRIRGEVSSCGTQGRGEQ*
PcCPRF2 VASLEHLQKRIRGGVSSCEAQVSGKQ*
T10P11.9 NTEPQNDSAWPKKCVD*~~~~~~~~~~
written 11 Aug 97
Larry Parnell