| Gene | T17A5.2 |
| Putative Identification | protein kinase |
| Position | 56927 to 61991, from the initial methionine to the termination codon |
| Strand | + |
| EST match | T43472 and T20930 |
| Database match | soybean PK6 |
CDS: The table below lists the coordinates of the T17A5.2 exons and which exon prediction algorithm selected the 5' or 3' terminus (GF = Genefinder, GS = GenScan, Gr = Grail, M = MZEF, NPG = NetPlantGene - selects splice sites, not exons). Splice sites determined by identity to an EST are designated as EST.
| Exon | Range | 5' | 3' |
|---|---|---|---|
| 1 | 56927 to 57129 | GS,Gr | GS,NPG |
| 2 | 57441 to 57555 | GS,Gr,M,NPG | GS,Gr,M,NPG |
| 3 | 57634 to 57713 | GS,Gr,M,NPG | GS,Gr,M,NPG |
| 4 | 57811 to 57921 | GS,M,NPG | GS,M,NPG |
| 5 | 58036 to 58090 | GS,Gr,M,NPG | EST,GF,GS,Gr,NPG |
| 6 | 58237 to 58347 | EST,GF,GS,Gr,M,NPG | EST,GF,GS,Gr,M,NPG |
| 7 | 58450 to 58500 | EST,GF,Gr,M,NPG | EST,GF,Gr,M,NPG |
| 8 | 58652 to 58824 | EST,GF,GS,Gr,M,NPG | EST,GF,GS,Gr,M,NPG |
| 9 | 58913 to 59020 | EST,GF,Gr,M,NPG | EST,M |
| 10 | 59251 to 59324 | EST,GS,NPG | GS,NPG |
| 11 | 59406 to 59581 | NPG | GF,Gr,NPG |
| 12 | 59700 to 59830 | GS,NPG | GS,Gr,M,NPG |
| 13 | 59955 to 60035 | GS,Gr | GF,GS,Gr,NPG |
| 14 | 60128 to 60181 | GS,Gr,M,NPG | GS,Gr,M,NPG |
| 15 | 60275 to 60409 | GF,GS,Gr,M,NPG | GS,Gr,M |
| 16 | 61154 to 61254 | GS,Gr,M,NPG | GS,Gr,M,NPG |
| 17 | 61359 to 61535 | GS,Gr,M,NPG | GS,Gr,M,NPG |
| 18 | 61610 to 61991 | GS,Gr,M,NPG | GS |
Complete CDS of T17A5.2
ATGACGATCAAAGATGAGTCGGAGAGTTGCGGTAGCAGAGCCGTCGTTGCTTCGCCGTCA CAAGAAAACCCTAGACATTACCGGATGAAACTTGATGTCTATAGTGAGGTTTTACAGCGA CTTCAAGAATCTAATTACGAAGAAGCCACTCTTCCTGATTTCGAGGATCAACTTTGGCTC CATTTCAATCGTCTTCCTGCTCGATATGCTCTTGATGTTAAAGTCGAGAGAGCGGAAGAT GTTCTCACACATCAGAGATTGCTAAAATTGGCTGCAGATCCTGCTACTAGGCCTGTCTTT GAAGTTCGAAGTGTACAGGTTTCTCCCAGAATCTCTGCTGACTCTGACCCTGCGGTGGAG GAAGATGCTCAAAGCTCTCACCAACCAAGCGGACCGGGGGTTCTTGCTCCTCCAACTTTT GGTTCTTCTCCAAATTTTGAGGCTATTACTCAGGGAAGTAAAATTGTTGAAGATGTTGAT AGTGTTGTGAATGCAACATTGTCTACACGACCGATGCACGAGATCACTTTTTCAACCATT GATAAACCGAAACTCCTTAGCCAGCTAACTTCCCTGCTTGGTGAGCTTGGACTGAATATA CAAGAGGCTCATGCTTTTTCCACTGTAGATGGTTTCTCTTTAGATGTCTTTGTAGTTGAC GGTTGGTCTCAGGAGGAAACTGATGGCCTAAGAGATGCATTGAGCAAAGAAATACTGAAG CTTAAGGATCAACCTGGTTCAAAACAGAAATCTATTTCTTTCTTTGAGCATGACAAATCA AGCAATGAGCTTATACCCGCCTGCATTGAAATACCCACGGATGGAACTGATGAGTGGGAA ATCGACGTGACTCAGCTCAAAATTGAAAAGAAAGTGGCATCTGGTTCATATGGGGATCTG CATAGAGGCACTTATTGCAGTCAGGAAGTAGCTATCAAATTTCTCAAGCCTGATCGTGTA AACAATGAGATGCTGAGAGAATTTTCTCAAGAAGTTTTTATAATGAGGAAAGTTCGACAC AAAAACGTCGTTCAATTTTTGGGTGCATGCACAAGATCTCCAACCCTCTGTATAGTGACT GAGTTTATGGCTCGAGGGAGCATATATGATTTTCTTCACAAACAGAAATGCGCTTTCAAA CTTCAAACTTTACTCAAAGTTGCACTTGATGTCGCAAAAGGAATGAGCTATTTGCATCAA AACAACATTATTCACAGGGACCTTAAGACTGCGAATCTTCTTATGGATGAACATGGACTT GTCAAGGTTGCTGATTTCGGAGTTGCCAGAGTGCAGATTGAATCAGGGGTCATGACTGCT GAAACTGGGACATACCGGTGGATGGCTCCAGAGGTCATTGAGCACAAACCTTACAATCAC AAGGCAGATGTGTTCAGTTATGCGATAGTGCTATGGGAACTTCTGACTGGTGACATCCCA TATGCTTTCTTGACTCCACTACAAGCAGCTGTTGGCGTTGTCCAAAAGGGGCTTCGACCC AAAATCCCAAAGAAAACACACCCAAAAGTGAAGGGGCTTCTAGAGAGATGCTGGCATCAA GACCCAGAACAGAGACCACTGTTTGAGGAAATCATAGAAATGCTACAACAGATAATGAAA GAGCCGGTGACTGTTTTTGGGTCTGCTTCAATTGCAGTGGAAGAGATGGTGTTCTTGAGT TGGGGCCGTCCATCTTCCGAGCAACAGCAACAAGTCATCAACAAAACAGGAACCTTCAAT TATGACAACAAGTACAGAGGAGTTTCGTCTAGGTCTATTGCTAAACTTAAGGAAGATTCA GAGATTGATAAAGATGGATTCTTGATCAATCATGCTCGTGTTTTAGTCGGTTCTGGTAGA GAGAGTTATGAGAAGGGCAAAAAAGCTCTCCAGAATTGGAAGCATTTTGGTATGGATTGG GCATTTGTTGATCCTGCAACACCTGTTGAAACCGGTAAGAAGTTTTGCATTTGCGTGAAA GAAGTTCTTCCATGGGTGATGCTTCCTCTACAAGTGGTTTATGTTGACGAAAGCCGGAAA TCAAGAAAAGGCCCTGCGCATTTCGGTTACGGAAGCGGTACTCTTCAGGGACATTTACTC GCTGGAGAAGAAAAGTTTTCGATAGAGCTTGATGGAAATGGTGAGGTATGGTATGAGATA ACGTCTTTCTCAAAGCCTGCTCATTTCTTGTCGTTCCTCGGGTATCCTTATGTGAAGCTA AGGCAGAAGCACTTTGCTCGTCATTCTTCTGAAGCTGTGCTGAAACATGTCAATGCTTCG TGA
Protein sequence:
MTIKDESESCGSRAVVASPSQENPRHYRMKLDVYSEVLQRLQESNYEEATLPDFEDQLWL HFNRLPARYALDVKVERAEDVLTHQRLLKLAADPATRPVFEVRSVQVSPRISADSDPAVE EDAQSSHQPSGPGVLAPPTFGSSPNFEAITQGSKIVEDVDSVVNATLSTRPMHEITFSTI DKPKLLSQLTSLLGELGLNIQEAHAFSTVDGFSLDVFVVDGWSQEETDGLRDALSKEILK LKDQPGSKQKSISFFEHDKSSNELIPACIEIPTDGTDEWEIDVTQLKIEKKVASGSYGDL HRGTYCSQEVAIKFLKPDRVNNEMLREFSQEVFIMRKVRHKNVVQFLGACTRSPTLCIVT EFMARGSIYDFLHKQKCAFKLQTLLKVALDVAKGMSYLHQNNIIHRDLKTANLLMDEHGL VKVADFGVARVQIESGVMTAETGTYRWMAPEVIEHKPYNHKADVFSYAIVLWELLTGDIP YAFLTPLQAAVGVVQKGLRPKIPKKTHPKVKGLLERCWHQDPEQRPLFEEIIEMLQQIMK EPVTVFGSASIAVEEMVFLSWGRPSSEQQQQVINKTGTFNYDNKYRGVSSRSIAKLKEDS EIDKDGFLINHARVLVGSGRESYEKGKKALQNWKHFGMDWAFVDPATPVETGKKFCICVK EVLPWVMLPLQVVYVDESRKSRKGPAHFGYGSGTLQGHLLAGEEKFSIELDGNGEVWYEI TSFSKPAHFLSFLGYPYVKLRQKHFARHSSEAVLKHVNAS*
Alignment of T17A5.2 and SoyPK6. There is significant sequence conservation within elements of the general kinase motif (noted by Roman numerals, after Hanks et al., 1988), but notable differences between the two sequences indicate that these two kinases likely have distinct biochemical properties.
. . . . .
T17A5.2 101 EVRSVQVSPRISADSDPAVEEDAQSSHQPSGPGVLAPPTFGSS..PNFEA 148
| : | . :
SoyPK6 1 ..........................MGEDGNSWIRRTNFSHTVCHRLDP 24
. . . . .
T17A5.2 149 ITQGSKIVEDVDSVVNATLSTRPMHEITFSTIDKPKLLSQLTSLLGELGL 198
|| : . | | .|: | | |
SoyPK6 25 ARLGSIPISVQSEQKSRPSSKAQRHPMTYKQRSLSPLPE..TYLSEAFRE 72
. . . . .
T17A5.2 199 NIQEAHAFSTVDGFSLDVFVVDGWSQEETDGLRDALSKEILKLKDQPGSK 248
| ||| . . ..: : . | ||
SoyPK6 73 ARLEQKRFSTPNPRREKRIMGKLLNKDSRETKESSSKSPSRSPNRQVKSK 122
. . . . .
T17A5.2 249 QKSISFFEHDKSSNELIPACIEIPTDGTDEWEIDVTQLKIEKKVASGSYG 298
: | . . :| :|| :|..|| | | |.:
SoyPK6 123 NRKDSAWTKLLDNGGGKITAVET....AEEWNVDMSQLFFGLKFAHGAHS 168
. II . . III .
T17A5.2 299 DLHRGTYCSQEVAIKFLK.PDRVNNEML.....REFSQEVFIMRKVRHKN 342
|: | | : ||:| : |: | | ::| .|| :: :. |.|
SoyPK6 169 RLYHGVYKDEAVAVKIIMVPEDDGNGALASRLEKQFIREVTLLSRLHHQN 218
. . IV . . . V
T17A5.2 343 VVQFLGACTRSPTLCIVTEFMARGSIYDFLHK.QKCAFKLQTLLKVALDV 391
|:.| || : | ||:||::| ||: :||| : || |: |||:
SoyPK6 219 VIKFSAACRKPPVYCIITEYLAEGSLRAYLHKLEHQTISLQKLIAFALDI 268
. . VI . .VII .
T17A5.2 392 AKGMSYLHQNNIIHRDLKTANLLMDEHGLVKVADFGVARVQIESGVMTAE 441
|:|| |:| :|||||| |:|..| .|:||||:| : .: :
SoyPK6 269 ARGMEYIHSQGVIHRDLKPENILINEDNHLKIADFGIACEEASCDLLADD 318
. VIII . IX . .
T17A5.2 442 TGTYRWMAPEVIEHKPYNHKADVFSYAIVLWELLTGDIPYAFLTPLQAAV 491
|||||||||.|. | | | ||:|: ::|||:||| ||| : |:|||
SoyPK6 319 PGTYRWMAPEMIKRKSYGKKVDVYSFGLILWEMLTGTIPYEDMNPIQAAF 368
. . . .XI .
T17A5.2 492 GVVQKGLRPKIPKKTHPKVKGLLERCWHQDPEQRPLFEEIIEMLQQIMKE 541
|| | || || | .: |:|.|| |:.|| | :::..|:| .
SoyPK6 369 AVVNKNSRPIIPSNCPPAMRALIEQCWSLQPDKRPEFWQVVKILEQ.FES 417
. . . . .
T17A5.2 542 PVTVFGSASIAVEEMVFLSWGRPSSEQQQQVINKTGTFNYDNKYRGVSSR 591
. |. |: | |. :.. |:
SoyPK6 418 SLASDGTLSLVPNPCWDHKKGLLHWIQKLGPLHQNSGPVPKPKFT*.... 463
created 31 Oct 97
Larry Parnell