| Gene | T10M13.2 |
| Putative Identification | serine/threonine protein kinase |
| Position | 8320 to 11487, from the initial methionine to the termination codon |
| Strand | + |
| EST match | T44402 |
| Database match | A. thaliana ATHPRKINA (L07248) protein kinase |
CDS: The table below lists the coordinates of the T10M13.2 exons and which exon prediction algorithms selected the 5' and 3' termini (GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons). Splice sites determined by identity to EST T44402 are designated by EST.
| Exon | Range | 5' | 3' |
|---|---|---|---|
| 1 | 8320 - 8713 | GS | GS, Gr, M, NPG |
| 2 | 8855 - 9226 | GS, M, NPG | GS, Gr, M, NPG |
| 3 | 9463 - 9675 | GS, M, NPG | EST, GS, Gr, M, NPG |
| 4 | 10037 - 10408 | EST, GS, Gr, M, NPG | GS, Gr, M, NPG |
| 5 | 10530 - 10773 | GS, Gr, M, NPG | GS, Gr, M, NPG |
| 6 | 10856 - 11003 | GS, Gr, M, NPG | GS, Gr, M, NPG |
| 7 | 11107 - 11487 | GS, Gr, M, NPG | GS, Gr |
Alternate exons not used in building the gene model: GRAIL fails to select an initial exon. It begins the gene model with an internal exon from 7686 to 7752. Other exons are from 8078 to 8112, 8297 to 8713, from 8870 to 9226 and from 9445 to 9675. In addition, GRAIL splits exon 6 resulting in exons from 10856 to 10303 and from 10338 to 11003; the intron is 34 nt. MZEF predicts exons from 7686 to 7752, from 8370 to 8713, and from 11107 to 11376. NPG predicts many splice donor and acceptor sites in this region. Notable potential false splice sites are splice acceptors at 7686 (cofidence score = 0.86) and at 9851 (0.90) and splice donors at 7752 (1.00) and at 9524 (0.92).
Complete CDS of T10M13.2
ATGGCGTTGAAGGCGGTGGTTATTGTATATTGTGTGGTTTCTCTCGTCAGTGTTCAATTA GCTGATGCTCAACATGAAGGACTGCCAGTTTCACCAACGTTATCACCTTCAACTTCACCA GTTATCACTGATCTGCCCCTACCAGCTGAATTTCCGCGGTTTCACAGAAAGTATTTCGCA CCACAACAAGCAGAAGCACCTCAGCATTCTCCCCCTTATAGTCGTTTGGTCGCTTCTGAT CATCCCCCTACCAGCTCACATTTCTCCAAACCTTCCATGAAAAGGAATGCTCAGTCTCCT GGAGCCGGCTTGGCTGATATTGCTCCAGCACAATCTAGCAATGGTGTTCTTCCTGATGCC TTAACTCAGCCACCTTTGTCGCCCTCCATTTCAAATTGTTGCAAATCAGATATGGTGCTT AAACGAAGAAGTATTGGTTGCCACTGCGTGTATCCTATAAAACTGGACATCCTTCTCTTG AATGTTTCAGAAACTCCTAGTTGGAACATGTTCTTGAACGAATTTGCTACCCAGCTTGGT CTCCTACCTCACCAAATCGAGCTGATTAACTTCTATGTGCTAAGCTTATCAAGGATGAAC ATATCGATGGATATCACCCCTCATTCTGGAATTAGTTTCTCAGCTAGTCAGGCATCCGCA ATAAACTCTTCCCTTATCAGCCACAAGATTCAATTTAGCCCTACTTTGGTGGGAGATTAC AAACTTCTAAACCTTACTTGGTTTGAGGCCCCTGCACCTTCGCAAGCACCTCTAGTGGCT TCTTCACCTCATAAAGCACCATCACAAGGATCCTCAGCAACTACGTCAGTAAGATCTCCA GGGAAAAAGAGGCATCCCAATCTTATTCTTATCTTTTCTATAGCCGCTGGTGTGCTTATA CTTGCCATAATCACTGTACTTGTTATTTGTTCCCGCGCACTCCGAGAAGAGAAAGCTCCA GATCCTCACAAAGAAGCTGTAAAACCAAGGAACCTGGACGCTGGTTCATTTGGGGGATCT CTTCCTCACCCAGCAAGTACACGGTTTCTGTCATATGAAGAACTCAAAGAGGCAACTAGC AATTTTGAATCTGCTAGCATTCTAGGAGAAGGTGGGTTTGGCAAGGTTTACAGAGGCATC TTAGCCGATGGTACTGCTGTAGCGATTAAGAAGCTCACAAGTGGTGGGCCACAAGGTGAT AAAGAATTCCAGGTGGAGATTGATATGCTTAGCCGTCTTCATCATCGTAATCTTGTGAAA CTTGTGGGTTACTATAGTAGTCGAGATTCTTCTCAGCACCTACTTTGTTATGAGCTTGTT CCAAATGGCAGCCTCGAGGCTTGGCTCCATGGGCCTCTCGGGTTGAACTGTCCTCTTGAT TGGGACACCAGAATGAAGATTGCACTTGATGCTGCAAGAGGACTTGCATACCTTCATGAA GACTCGCAACCCTCCGTTATACACAGAGATTTTAAAGCCTCTAATATACTCCTTGAAAAC AACTTCAACGCCAAAGTTGCAGATTTTGGCCTAGCCAAACAAGCTCCTGAAGGCAGGGGT AATCACTTATCTACTCGTGTTATGGGCACATTTGGATATGTTGCGCCTGAATATGCAATG ACGGGACACCTACTCGTCAAGAGTGATGTTTATAGTTACGGTGTGGTCCTTCTCGAATTG TTAACTGGTAGAAAACCTGTGGATATGTCACAACCTTCAGGCCAAGAAAATCTCGTCACT TGGACAAGGCCAGTTTTAAGAGACAAAGACCGGTTAGAAGAACTAGTCGATTCAAGACTT GAAGGAAAATACCCGAAAGAAGATTTCATAAGAGTATGCACAATCGCTGCAGCTTGTGTT GCACCTGAAGCTAGCCAGAGACCAACGATGGGCGAAGTGGTTCAGTCACTTAAAATGGTT CAACGGGTGGTTGAGTATCAAGACCCGGTTTTAAACACTTCAAATAAAGCTCGTCCTAAC CGGAGACAATCATCAGCTACGTTCGAGTCAGAAGTAACCTCTTCTATGTTCTCTTCTGGT CCTTATTCTGGTCTAAGCGCTTTTGATCATGAAAATATTACACGAACAACTGTTTTCTCA GAAGATCTTCACGAAGGCCGATGA
Protein translation:
MALKAVVIVYCVVSLVSVQLADAQHEGLPVSPTLSPSTSPVITDLPLPAEFPRFHRKYFA PQQAEAPQHSPPYSRLVASDHPPTSSHFSKPSMKRNAQSPGAGLADIAPAQSSNGVLPDA LTQPPLSPSISNCCKSDMVLKRRSIGCHCVYPIKLDILLLNVSETPSWNMFLNEFATQLG LLPHQIELINFYVLSLSRMNISMDITPHSGISFSASQASAINSSLISHKIQFSPTLVGDY KLLNLTWFEAPAPSQAPLVASSPHKAPSQGSSATTSVRSPGKKRHPNLILIFSIAAGVLI LAIITVLVICSRALREEKAPDPHKEAVKPRNLDAGSFGGSLPHPASTRFLSYEELKEATS NFESASILGEGGFGKVYRGILADGTAVAIKKLTSGGPQGDKEFQVEIDMLSRLHHRNLVK LVGYYSSRDSSQHLLCYELVPNGSLEAWLHGPLGLNCPLDWDTRMKIALDAARGLAYLHE DSQPSVIHRDFKASNILLENNFNAKVADFGLAKQAPEGRGNHLSTRVMGTFGYVAPEYAM TGHLLVKSDVYSYGVVLLELLTGRKPVDMSQPSGQENLVTWTRPVLRDKDRLEELVDSRL EGKYPKEDFIRVCTIAAACVAPEASQRPTMGEVVQSLKMVQRVVEYQDPVLNTSNKARPN RRQSSATFESEVTSSMFSSGPYSGLSAFDHENITRTTVFSEDLHEGR*
Protein kinase signature:
| There are eleven regions that comprise the general kinase motif (see Hanks SK, Quinn AM, Hunter T (1988) Science 241:42-52). While all eleven regions are found in protein T10M13.2, many do not form good signature sequences for pattern searching. The three regions that do present good, searchable motifs are presented below. | |
| General kinase motif regions I and II present from residues 368 to 390. Region I is known to bind ATP; region II is involved in phosphate transfer. | |
| sequence | 368 LGEGGFGKVYRGILADGTAVAIK 390 |
| General kinase motif region VI present from residues 486 to 498. This region is known to make important contacts with the protein substrate. | |
| sequence | 486 VIHRDFKASNILL 498 |
Alignment of T10M13.2 (bottom) x AthPRKINA (L07248)
With only 43% identity and 52% similarity between these two protein kinase sequences, it is likely that they phosphorylate different protein substrates and are regulated by different biochemical pathways.
. . . . .
1 ............................................MGGCFS 6
|
251 PAPSQAPLVASSPHKAPSQGSSATTSVRSPGKKRHPNLILIFSIAAGVLI 300
. . . . .
7 NRIKTDIASSTWLSSKFLSRDGSKGSSTASFSYMPRTEGEILQNANLKNF 56
| | : . . . | | . | : |. :
301 LAIITVLVICSRALREEKAPDPHKEAVKPRNLDAGSFGGSLPHPASTRFL 350
. . . . .
57 SLSELKSATRNFRPDSVVGEGGFGCVFKGWIDESSLAPSKPGTGIVIAVK 106
| ||| || || |:.|||||| |::| : : | :|:|
351 SYEELKEATSNFESASILGEGGFGKVYRGILAD..........GTAVAIK 390
. . . . .
107 RLNQEGFQGHREWLAEINYLGQLDHPNLVKLIGY..CLEEEHRLLVYEFM 154
:| | || :|. ||. | .| | |||||:|| : || || .
391 KLTSGGPQGDKEFQVEIDMLSRLHHRNLVKLVGYYSSRDSSQHLLCYELV 440
. . . . .
155 TRGSLENHLFRRGTFYQPLSWNTRVRMALGAARGLAFLH.NAQPQVIYRD 203
|||| | || |.||.:.|| ||||||:|| ..|| ||:||
441 PNGSLEAWLHGPLGLNCPLDWDTRMKIALDAARGLAYLHEDSQPSVIHRD 490
. . . . .
204 FKASNILLDSNYNAKLSDFGLARDGPMGDNSHVSTRVMGTQGYAAPEYLA 253
||||||||:.|:|||..|||||: | | .|.||||||| || ||||
491 FKASNILLENNFNAKVADFGLAKQAPEGRGNHLSTRVMGTFGYVAPEYAM 540
. . . . .
254 TGHLSVKSDVYSFGVVLLELLSGRRAIDKNQPVVEHNLVDWARPYLTNKR 303
|||| |||||||:||||||||.||: :| .|| : ||| | || | .|
541 TGHLLVKSDVYSYGVVLLELLTGRKPVDMSQPSGQENLVTWTRPVLRDKD 590
. . . . .
304 RLLRVMDPRLQGQYSLTRALKIAVLALDCISIDAKSRPTMNEIVKTMEEL 353
|| ..| ||:|.| ::: :| |:. :| |||| |:|..:. .
591 RLEELVDSRLEGKYPKEDFIRVCTIAAACVAPEASQRPTMGEVVQSLKMV 640
. . . . .
354 HIQKEASKEQQNPQISIDNIINKSPQAVNYPRPSIM*............. 390
: | |.| :. | . . . | .
641 ....QRVVEYQDPVLNTSNKARPNRRQSSATFESEVTSSMFSSGPYSGLS 686
written 29 Jul 97
updated 2 Aug 98
Larry Parnell