Gene F6P23.9
Putative Identification leucine-rich repeat disease resistance protein
Position 42999 to 47465, from the initial methionine to the termination codon
Strand +
EST match none
Database match tobacco TMV-resistance protein N and A. thaliana downy mildew resistance protein RPP5

 

CDS:  The table below lists the coordinates for each F6P23.9 exon and which gene predicting algorithms selected the 5' and 3' termini (GS = GenScan, Gr = GRAIL, M = MZEF).

Exon Range 5' 3'
1 42999 to 43483 GS GS,Gr,M
2 44031 to 45264 GS,Gr,M GS,Gr,M
3 45352 to 45666 GS,Gr,M GS,Gr
4 45747 to 46712 GS,Gr,M GS,Gr
5 46804 to 47101 GS,Gr Gr
6 47140 to 47465 Gr Gr

Complete CDS of F6P23.9

ATGAAAGAAAGTCCCACTTTGAGAAGATATTCCATGGAGGCCTCACGGCAACCTCAAGTG
TTCATCAACTTCCGGGGAAGTGAACTACGGTATACCTTTGTCTACTATCTCAGGACGGCC
TTGGTAAAAAACGGGATCAACGTTTTTACAGACAACATGGAGCCGAAAGGCCGTAACCAG
AAAATTTTATTCAAGAGAATCGAAGAGTCGAAGATCGCACTGGCGATTTTCTCCTCAAGG
TATACGGAGTCAAGTTGGTGTCTGGAAGAGCTGGTAAAGATGAAAGAATGCATGGATGCG
GAGAAACTCGTGATCATTCCCATCTTCTACATCGTGACTCCATACACCATTAAAAAGCAG
ATGGGAGACTTTGGTGACAAGTTTAGAGTGCTGGTGGATTATGTTGATGATGTGACGGAG
AAAAAATGGACTGATGCTTTGAAGTCTGTTCCCTTAATTTTAGGCATCACTTACGACGGG
CAGAGCGAAGAACAACTTTTAATCAATCAAATCGTTGGGGAGGTTCAGAGAGTAATAAAG
ATAATTTCACAAGGAGAAGGAGATGAAAAAAATAAAATGGTGTGCACAAATACCTCTACA
GGTTCAAGCTTTATCCCACAAAATAGAAACATGGTAGATCCTGAAAATCAAATTGAGCTC
GTTGGACTCAGCCAACGCCTCAAGGAACTAAAAGAAAAGTTGGACCTTAGCCGCAAGGAA
ACTCGCATTGTTGGGGTTTTAGGGATGCCCGGAATCGGCAAGACCACACTGGTGAAGAGA
TTGTATGATGAGTGGAAACACAATTTCCAGCGTCACTTGCATATGGTGAATATCCGTCAG
AAGTCGAAGGAATACGGGACGCATTCCCTAGAGAGAATGATTCTGAAAGAGTTGCTTAGC
GACACTTACAATGATATAACTGAAGAGATGACGTACGCTTCTGTAAAGGATGAGCTGCTG
AAGAAGAAAGTTCTTCTTGTTCTCGATGACGTGAGTAGCAAGAAACAAATACAAGGTCTT
CTCGGGAATCTTAACTGGATTAGGAAAGGAAGCAGGATTGTTATTACTACGCGTGACAAG
ATATCCATAAGCCAGTTCGAGTATACTTATGTTGTCCCAAGATTGAATATCACAGATGGC
TTAAAGCAATTCAGCTTTTATGCCTTTGAAGATCACAACTGTCCATACCCGGGGAATCTC
ATGGATCTTTCCACAAAGTTCGTAGATTATGCCAGAGGCAATCCCTTAGCTCTCAAGATA
TTGGGTCGGGAGCTTCTTTCGATAGACAAGGATCAGTGGCCAAAGAGACTCGATACTCTG
GCACAACTTCCCATCCCGTATATACAGGATCTATTGAGAGCAAGTTATGATGATCTGAGC
AATCAACAGAAAGAAGTTTTTCTTGTCGTAGCCTGGTTCTTTGGATCAGGGGATGAATAT
TACATTAGGAGTTTAGTGGATACAGAAGATCCTGATTCTGCCGATGATGCTGCTAGCGAA
GTAAGAGATTTCGCAGGCAACTTACTAATTAGCATCTCAAGTGGTCGATTGGAGATGCAT
GATTTAATGGCTACGTTCGCCAAAAAACTTTGTTCATCCTTATCTAATGAAAATAACTAT
GGATACCAAATGATTTGGAATCATGAAAGCTTTAATGCTGCGGCTAAAAATAAAAGGATG
AGATATGTCAATCAACCAAGGAAAAAAGTCACAGAATCGGAAATGGACAATGTCATGGGT
ATATTGCTGGACGTGTCTGAAATGGATAATAACATGACCTTAGATTCTAAATTCTTCAGC
GAGATGTGCAATCTACGGTACCTCAAAGTCTACAATTCACAATGCTCTCGAGATTGTGAT
GTTGGTTGCAAACTGACCTTCCCAGATGGACTTAAATGCTCAATGGAAAATGTCCGATAT
CTCTATTGGCTACAATTCCCATTGAAGAAGCTTTCAAAAGCATTTAACCCTAAGAATCTT
ATCGAGCTCAACCTCCCTTACAGCAAAATTACACGACTTTGGAAGGAAAGTAAGGAAATA
TCCAAACTAAAATGGGTCGATCTCAGCCACTCGAGTGAGCTATGCGATATATCAGGGTTA
ATAGGAGCTCATAATATTAGAAGATTGAATCTTGAAGGCTGCATAGAATTGAAAACATTA
CCACAAGAGATGCAAGAAATGGAAAGTCTGATTTATCTAAACCTGGGAGGATGCACGCGT
CTTGTGTCTCTTCCAGAGTTTAAGTTGAAATCCCTAAAGACTCTCATCTTAAGTCACTGC
AAGAACTTTGAACAATTTCCGGTTATTTCAGAATGTTTAGAAGCTCTTTACTTGCAAGGC
ACGGCAATAAAGTGTATTCCTACCTCCATTGAGAACCTTCAGAAACTAATCCTATTGGAT
CTAAAAGACTGCGAAGTATTGGTAAGTCTTCCTGATTGTCTTGGAAATCTAAGATCTCTT
CAAGAGCTAATACTCTCTGGCTGCTCAAAGTTAAAGTTTTTTCCAGAGTTGAAGGAGACC
ATGAAATCAATAAAGATTTTGCTGCTTGATGGAACAGCCATTAAGCAGATGCCAATATTA
TTGCAGTGTATTCAGTCACAAGGCCACTCTGTTGCAAATAAGACGCTTCCAAACAGTTTA
AGTGATTACTACCTACCTTCCTCATTGTTGTCTTTATGCTTAAGTGGAAATGATATCGAG
AGCTTGCATGCTAACATCAGCCAGCTTTATCATCTGAAATGGCTTGACTTGAAGAATTGC
AAGAAGCTTAAATCTGTGTCGGTGCTTCCACCAAACCTCAAGTGCTTAGATGCACATGGT
TGTGACTCACTGGAAGAAGTTGGAAGCCCTCTAGCGGTTCTCATGGTGACAGGAAAAATC
CATTGCACATACATTTTCACAAACTGCAACAAATTGGATCAAGTTGCAGAAAGTAACATC
ATATCTTTCACTTGGAGGAAAAGCCAGATGATGTCAGATGCTCTGAATCGCTACAATGGG
GGATTTGTTTTGGAATCTTTGGTCAGCACTTGCTTTCCTGGATGTGAAGTACCTGCATCA
TTCGATCACCAAGCCTATGGAGCATTGTTACAGACGAAATTACCTCGGCACTGGTGTGAT
AGTAGGCTTACTGGGATAGCTTTATGCGCTGTTATATTGTTTCCGGACTACCAACATCAA
AGCAATCGTTTCTTGGTGAAATGCACTTGTGAGTTCGGAACTGAAGATGGGCCATGTATC
AGCTTTAGTTCCATTGTTGGAGGTTGGAGCGAACCAGGCTATGAGCCACGACTATTAAAC
ATTAACAAACGTCACGTGGAAAAGCATGGGAATGGATGTATTCCTTCTAAGGCTTCACTC
AGATTTCAAGTCACAGATGGTGCGAGTGAGGTAGGAAATTGCCATGTCCTGAAATGTGGC
TTTACTTTAGTGTATACACCAAACGATAGTGATGATATCTCTCCGGCGAGAGTAGTTGAT
ATCACTACACGGGATAAGGAGGATGGCCTAGAAAATGCAACTTCTAACAAACTTTCAAGA
AACGATTATGAATTCTCGCATCAGTCAAACTGTGGAGTGACTTCGAGAAGGGATGAATGT
TTCCTTTCTGAAGAGCTTCCATAA

 

Protein translation:

MKESPTLRRYSMEASRQPQVFINFRGSELRYTFVYYLRTALVKNGINVFTDNMEPKGRNQ
KILFKRIEESKIALAIFSSRYTESSWCLEELVKMKECMDAEKLVIIPIFYIVTPYTIKKQ
MGDFGDKFRVLVDYVDDVTEKKWTDALKSVPLILGITYDGQSEEQLLINQIVGEVQRVIK
IISQGEGDEKNKMVCTNTSTGSSFIPQNRNMVDPENQIELVGLSQRLKELKEKLDLSRKE
TRIVGVLGMPGIGKTTLVKRLYDEWKHNFQRHLHMVNIRQKSKEYGTHSLERMILKELLS
DTYNDITEEMTYASVKDELLKKKVLLVLDDVSSKKQIQGLLGNLNWIRKGSRIVITTRDK
ISISQFEYTYVVPRLNITDGLKQFSFYAFEDHNCPYPGNLMDLSTKFVDYARGNPLALKI
LGRELLSIDKDQWPKRLDTLAQLPIPYIQDLLRASYDDLSNQQKEVFLVVAWFFGSGDEY
YIRSLVDTEDPDSADDAASEVRDFAGNLLISISSGRLEMHDLMATFAKKLCSSLSNENNY
GYQMIWNHESFNAAAKNKRMRYVNQPRKKVTESEMDNVMGILLDVSEMDNNMTLDSKFFS
EMCNLRYLKVYNSQCSRDCDVGCKLTFPDGLKCSMENVRYLYWLQFPLKKLSKAFNPKNL
IELNLPYSKITRLWKESKEISKLKWVDLSHSSELCDISGLIGAHNIRRLNLEGCIELKTL
PQEMQEMESLIYLNLGGCTRLVSLPEFKLKSLKTLILSHCKNFEQFPVISECLEALYLQG
TAIKCIPTSIENLQKLILLDLKDCEVLVSLPDCLGNLRSLQELILSGCSKLKFFPELKET
MKSIKILLLDGTAIKQMPILLQCIQSQGHSVANKTLPNSLSDYYLPSSLLSLCLSGNDIE
SLHANISQLYHLKWLDLKNCKKLKSVSVLPPNLKCLDAHGCDSLEEVGSPLAVLMVTGKI
HCTYIFTNCNKLDQVAESNIISFTWRKSQMMSDALNRYNGGFVLESLVSTCFPGCEVPAS
FDHQAYGALLQTKLPRHWCDSRLTGIALCAVILFPDYQHQSNRFLVKCTCEFGTEDGPCI
SFSSIVGGWSEPGYEPRLLNINKRHVEKHGNGCIPSKASLRFQVTDGASEVGNCHVLKCG
FTLVYTPNDSDDISPARVVDITTRDKEDGLENATSNKLSRNDYEFSHQSNCGVTSRRDEC
FLSEELP*

Protein motifs:  Several disease resistance proteins contain a signature sequence for an ATP/GTP-binding site. This P-loop domain is found in F6P23.9 from residues 248 to 255 [GMPGIGKT].

 


written 18 Oct 97
Larry Parnell