Gene T10M13.2
Putative Identification serine/threonine protein kinase
Position 8320 to 11487, from the initial methionine to the termination codon
Strand +
EST match T44402
Database match A. thaliana ATHPRKINA (L07248) protein kinase

 

CDS:  The table below lists the coordinates of the T10M13.2 exons and which exon prediction algorithms selected the 5' and 3' termini (GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons). Splice sites determined by identity to EST T44402 are designated by EST.

Exon Range 5' 3'
1 8320 - 8713 GS GS, Gr, M, NPG
2 8855 - 9226 GS, M, NPG GS, Gr, M, NPG
3 9463 - 9675 GS, M, NPG EST, GS, Gr, M, NPG
4 10037 - 10408 EST, GS, Gr, M, NPG GS, Gr, M, NPG
5 10530 - 10773 GS, Gr, M, NPG GS, Gr, M, NPG
6 10856 - 11003 GS, Gr, M, NPG GS, Gr, M, NPG
7 11107 - 11487 GS, Gr, M, NPG GS, Gr

Alternate exons not used in building the gene model:   GRAIL fails to select an initial exon. It begins the gene model with an internal exon from 7686 to 7752. Other exons are from 8078 to 8112, 8297 to 8713, from 8870 to 9226 and from 9445 to 9675. In addition, GRAIL splits exon 6 resulting in exons from 10856 to 10303 and from 10338 to 11003; the intron is 34 nt. MZEF predicts exons from 7686 to 7752, from 8370 to 8713, and from 11107 to 11376. NPG predicts many splice donor and acceptor sites in this region. Notable potential false splice sites are splice acceptors at 7686 (cofidence score = 0.86) and at 9851 (0.90) and splice donors at 7752 (1.00) and at 9524 (0.92).

Complete CDS of T10M13.2

ATGGCGTTGAAGGCGGTGGTTATTGTATATTGTGTGGTTTCTCTCGTCAGTGTTCAATTA
GCTGATGCTCAACATGAAGGACTGCCAGTTTCACCAACGTTATCACCTTCAACTTCACCA
GTTATCACTGATCTGCCCCTACCAGCTGAATTTCCGCGGTTTCACAGAAAGTATTTCGCA
CCACAACAAGCAGAAGCACCTCAGCATTCTCCCCCTTATAGTCGTTTGGTCGCTTCTGAT
CATCCCCCTACCAGCTCACATTTCTCCAAACCTTCCATGAAAAGGAATGCTCAGTCTCCT
GGAGCCGGCTTGGCTGATATTGCTCCAGCACAATCTAGCAATGGTGTTCTTCCTGATGCC
TTAACTCAGCCACCTTTGTCGCCCTCCATTTCAAATTGTTGCAAATCAGATATGGTGCTT
AAACGAAGAAGTATTGGTTGCCACTGCGTGTATCCTATAAAACTGGACATCCTTCTCTTG
AATGTTTCAGAAACTCCTAGTTGGAACATGTTCTTGAACGAATTTGCTACCCAGCTTGGT
CTCCTACCTCACCAAATCGAGCTGATTAACTTCTATGTGCTAAGCTTATCAAGGATGAAC
ATATCGATGGATATCACCCCTCATTCTGGAATTAGTTTCTCAGCTAGTCAGGCATCCGCA
ATAAACTCTTCCCTTATCAGCCACAAGATTCAATTTAGCCCTACTTTGGTGGGAGATTAC
AAACTTCTAAACCTTACTTGGTTTGAGGCCCCTGCACCTTCGCAAGCACCTCTAGTGGCT
TCTTCACCTCATAAAGCACCATCACAAGGATCCTCAGCAACTACGTCAGTAAGATCTCCA
GGGAAAAAGAGGCATCCCAATCTTATTCTTATCTTTTCTATAGCCGCTGGTGTGCTTATA
CTTGCCATAATCACTGTACTTGTTATTTGTTCCCGCGCACTCCGAGAAGAGAAAGCTCCA
GATCCTCACAAAGAAGCTGTAAAACCAAGGAACCTGGACGCTGGTTCATTTGGGGGATCT
CTTCCTCACCCAGCAAGTACACGGTTTCTGTCATATGAAGAACTCAAAGAGGCAACTAGC
AATTTTGAATCTGCTAGCATTCTAGGAGAAGGTGGGTTTGGCAAGGTTTACAGAGGCATC
TTAGCCGATGGTACTGCTGTAGCGATTAAGAAGCTCACAAGTGGTGGGCCACAAGGTGAT
AAAGAATTCCAGGTGGAGATTGATATGCTTAGCCGTCTTCATCATCGTAATCTTGTGAAA
CTTGTGGGTTACTATAGTAGTCGAGATTCTTCTCAGCACCTACTTTGTTATGAGCTTGTT
CCAAATGGCAGCCTCGAGGCTTGGCTCCATGGGCCTCTCGGGTTGAACTGTCCTCTTGAT
TGGGACACCAGAATGAAGATTGCACTTGATGCTGCAAGAGGACTTGCATACCTTCATGAA
GACTCGCAACCCTCCGTTATACACAGAGATTTTAAAGCCTCTAATATACTCCTTGAAAAC
AACTTCAACGCCAAAGTTGCAGATTTTGGCCTAGCCAAACAAGCTCCTGAAGGCAGGGGT
AATCACTTATCTACTCGTGTTATGGGCACATTTGGATATGTTGCGCCTGAATATGCAATG
ACGGGACACCTACTCGTCAAGAGTGATGTTTATAGTTACGGTGTGGTCCTTCTCGAATTG
TTAACTGGTAGAAAACCTGTGGATATGTCACAACCTTCAGGCCAAGAAAATCTCGTCACT
TGGACAAGGCCAGTTTTAAGAGACAAAGACCGGTTAGAAGAACTAGTCGATTCAAGACTT
GAAGGAAAATACCCGAAAGAAGATTTCATAAGAGTATGCACAATCGCTGCAGCTTGTGTT
GCACCTGAAGCTAGCCAGAGACCAACGATGGGCGAAGTGGTTCAGTCACTTAAAATGGTT
CAACGGGTGGTTGAGTATCAAGACCCGGTTTTAAACACTTCAAATAAAGCTCGTCCTAAC
CGGAGACAATCATCAGCTACGTTCGAGTCAGAAGTAACCTCTTCTATGTTCTCTTCTGGT
CCTTATTCTGGTCTAAGCGCTTTTGATCATGAAAATATTACACGAACAACTGTTTTCTCA
GAAGATCTTCACGAAGGCCGATGA

 

Protein translation:

MALKAVVIVYCVVSLVSVQLADAQHEGLPVSPTLSPSTSPVITDLPLPAEFPRFHRKYFA
PQQAEAPQHSPPYSRLVASDHPPTSSHFSKPSMKRNAQSPGAGLADIAPAQSSNGVLPDA
LTQPPLSPSISNCCKSDMVLKRRSIGCHCVYPIKLDILLLNVSETPSWNMFLNEFATQLG
LLPHQIELINFYVLSLSRMNISMDITPHSGISFSASQASAINSSLISHKIQFSPTLVGDY
KLLNLTWFEAPAPSQAPLVASSPHKAPSQGSSATTSVRSPGKKRHPNLILIFSIAAGVLI
LAIITVLVICSRALREEKAPDPHKEAVKPRNLDAGSFGGSLPHPASTRFLSYEELKEATS
NFESASILGEGGFGKVYRGILADGTAVAIKKLTSGGPQGDKEFQVEIDMLSRLHHRNLVK
LVGYYSSRDSSQHLLCYELVPNGSLEAWLHGPLGLNCPLDWDTRMKIALDAARGLAYLHE
DSQPSVIHRDFKASNILLENNFNAKVADFGLAKQAPEGRGNHLSTRVMGTFGYVAPEYAM
TGHLLVKSDVYSYGVVLLELLTGRKPVDMSQPSGQENLVTWTRPVLRDKDRLEELVDSRL
EGKYPKEDFIRVCTIAAACVAPEASQRPTMGEVVQSLKMVQRVVEYQDPVLNTSNKARPN
RRQSSATFESEVTSSMFSSGPYSGLSAFDHENITRTTVFSEDLHEGR*

 

Protein kinase signature:

  There are eleven regions that comprise the general kinase motif (see Hanks SK, Quinn AM, Hunter T (1988) Science 241:42-52). While all eleven regions are found in protein T10M13.2, many do not form good signature sequences for pattern searching. The three regions that do present good, searchable motifs are presented below.
  General kinase motif regions I and II present from residues 368 to 390. Region I is known to bind ATP; region II is involved in phosphate transfer.
sequence
   368  LGEGGFGKVYRGILADGTAVAIK  390
  General kinase motif region VI present from residues 486 to 498. This region is known to make important contacts with the protein substrate.
sequence
   486  VIHRDFKASNILL  498

 

Alignment of T10M13.2 (bottom) x AthPRKINA (L07248)

With only 43% identity and 52% similarity between these two protein kinase sequences, it is likely that they phosphorylate different protein substrates and are regulated by different biochemical pathways.

                  .         .         .         .         .
       1 ............................................MGGCFS 6
                                                       |   
     251 PAPSQAPLVASSPHKAPSQGSSATTSVRSPGKKRHPNLILIFSIAAGVLI 300
                  .         .         .         .         .
       7 NRIKTDIASSTWLSSKFLSRDGSKGSSTASFSYMPRTEGEILQNANLKNF 56
           | | :   .    .  . |  | .            | :   |. :  
     301 LAIITVLVICSRALREEKAPDPHKEAVKPRNLDAGSFGGSLPHPASTRFL 350
                  .         .         .         .         .
      57 SLSELKSATRNFRPDSVVGEGGFGCVFKGWIDESSLAPSKPGTGIVIAVK 106
         |  ||| || ||   |:.|||||| |::| : :          |  :|:|
     351 SYEELKEATSNFESASILGEGGFGKVYRGILAD..........GTAVAIK 390
                  .         .         .         .         .
     107 RLNQEGFQGHREWLAEINYLGQLDHPNLVKLIGY..CLEEEHRLLVYEFM 154
         :|   | || :|.  ||. | .| | |||||:||    :    || || .
     391 KLTSGGPQGDKEFQVEIDMLSRLHHRNLVKLVGYYSSRDSSQHLLCYELV 440
                  .         .         .         .         .
     155 TRGSLENHLFRRGTFYQPLSWNTRVRMALGAARGLAFLH.NAQPQVIYRD 203
           ||||  |        || |.||.:.|| ||||||:|| ..|| ||:||
     441 PNGSLEAWLHGPLGLNCPLDWDTRMKIALDAARGLAYLHEDSQPSVIHRD 490
                  .         .         .         .         .
     204 FKASNILLDSNYNAKLSDFGLARDGPMGDNSHVSTRVMGTQGYAAPEYLA 253
         ||||||||:.|:|||..|||||:  | |  .|.||||||| || ||||  
     491 FKASNILLENNFNAKVADFGLAKQAPEGRGNHLSTRVMGTFGYVAPEYAM 540
                  .         .         .         .         .
     254 TGHLSVKSDVYSFGVVLLELLSGRRAIDKNQPVVEHNLVDWARPYLTNKR 303
         |||| |||||||:||||||||.||: :| .||  : ||| | || | .| 
     541 TGHLLVKSDVYSYGVVLLELLTGRKPVDMSQPSGQENLVTWTRPVLRDKD 590
                  .         .         .         .         .
     304 RLLRVMDPRLQGQYSLTRALKIAVLALDCISIDAKSRPTMNEIVKTMEEL 353
         ||  ..| ||:|.|     :::  :|  |:. :|  |||| |:|..:. .
     591 RLEELVDSRLEGKYPKEDFIRVCTIAAACVAPEASQRPTMGEVVQSLKMV 640
                  .         .         .         .         .
     354 HIQKEASKEQQNPQISIDNIINKSPQAVNYPRPSIM*............. 390
             :   | |.| :.  |    . .  .    | .              
     641 ....QRVVEYQDPVLNTSNKARPNRRQSSATFESEVTSSMFSSGPYSGLS 686

 


written 29 Jul 97
updated 2 Aug 98
Larry Parnell