Gene F6P23.8
Putative Identification leucine-rich repeat disease resistance protein
Position 29039 to 36198, from the initial methionine to the termination codon
Strand -
EST match none
Database match resistance gene homolog ATFCA7 from Z97342

 

CDS:  The table below lists the coordinates of each of the F6P23.8 exons and indicates which gene prediction program selected the 3' and 5' termini (GS = GenScan, Gr = GRAIL, M = MZEF).

Exon Range 3' 5'
1 35131 to 36198 GS,Gr,M GS
2 34739 to 35038 GS,Gr,M GS,Gr,M
3 33231 to 34652 GS GS,Gr,M
4 32947 to 33140 GS GS
5 32577 to 32745 GS GS
6 31893 to 32440 GS GS
7 31530 to 31681 GS GS
8 29039 to 29205 GS GS

Complete CDS of F6P23.8

ATGGATTATGGTGCTTGTGAACTAGTGGAAGACATTGCTAGGGATATGTATGAAAAGATT
TTCCCTACCAAACGAATTGGGATCTACAGGAAGATGCTGAAGTTAGAAAAAATTGTTTAC
AAGCAACTATGGGGTATCCGCAGCATAGGCATATGGGGTATGCCAGGCATAGGAAAAACA
ACACTTGCCGAAGCAGCATTTGACCAATTTTCTGGTGATTATGAAGCTTCTTGCATCATC
AAAGACTTCGACAAAGAATTTCTTGCGAAAGGACTTTATCATTTGTGGAATGAATACTTA
GGGGAAAATATAAACAATTCTTTTATCAAGTCCGGACAAAAAAGACTTCTTATCGTCCTT
GACAATGTTCTTAAACCTCTGGATGCAGACGCTTTTCTTAATGGGTTTGATTGGTTTGGT
CCAGGGAGCCTGATAATCATAACCTCTAGAGATAAACAAGTACTAGTGCAGTGTGGCGTC
AACCAAATATATGAGGTTGAAGGTTTAAACAAAGATGAGGCTAAGCAACTACTTCACGGA
TGCGCATTTGGAATAGACTGGAGAAAACAGAGTGGGTTGGAAACTCTTGCACCTTACTAT
ATTTCTGTCAAATATTTTAGTGGAAACCCTCTAGCTCTTAGCCTTTATGAAGAAATGTTG
TCACACATGAAGTCAGATAAAATGGAGGTTAAGCTCCTCAAGCTCAACCACCCTCCACCT
CAGATTATGGAAGTATTCAAGAGCAATTACAATGCACTTAATGAGAACGAGAAGAGCATG
TTTCTAGACATTGCTTGTTTCTTCAGGGGAGAAAAGGCGGACTACGTGATGCAACTGTTT
GAAGGATGTGGATTCTTTCCACATGTTGGAATATATGTTCTAGTGGATAAGTGTCTGGTG
ACTATTGTAAAAAGAAAGATGGAAATGCATAATCTGATCCAGATTGTTGGGAAGGCGATT
TCAAATGAAGGAACTGTAGAACTTGATAGGCATGTCAGACTATGGGACACTTCAATCATT
CAACCTCTGCTAGAAGATGAAGAAACCAAATTAAAGGGTGAATCTAAGGGTACTACTGAA
GATATTGAAGTCATATTTCTAGACATGTCTAACTTGAAATTCTTTGTCAAACCTGATGCT
TTCAAGAGTATGCATAATCTTAGATTTCTGAAGATTTATAGTTCTAATCCTGGAAAGCAT
CAGAGAATTCGCTTTCGAGAAGCTCTTCAGTCTCTTCCCAATGAGCTACGGCTACTCCAC
TGGGAGGATTATCCTCTACAATCCCTGCCACAACATTTTGATCCTACGCACCTTGTTGAA
CTCAACATGCCTTACAGTAAACTTCAAAAACTTTGGGGTGGAACCAAGAACCTTGAGATG
TTGAAGATGGTCAGGCTTAGTCACTCGCAAGATCTAGTTGAAATTGAAGAACTGATAAAA
TCTAAAAATATTGAAGTAATCGATCTCCAAGGTTGTACAAAAATACAGAGTTTTCCAGCT
ACAAGACATCTACAACATCTCAGAGTTATTAATCTTTCTGGTTGCGTTGAGATCAAAAGC
ACACAGCTCGAAGAATTTCAGGGCTTTCCAAGGAACCTGAAAGAATTATATCTTTCTGGT
ACTGGGATAAGAGAAGTGACATCATCAATCCACCTCTCCTCACTTGAAGTTTTGGATTTG
TCTAACTGCAAAAGACTTCAAAACTTGCCCATGGGAAAGGGTAATTTGGCTTCTCTTATT
AAACTCATGTTATCAGGGTGTTCAAAGCTCCAGAATATTCAAGATCTCCCAACAAACCTG
AAAGAGCTATATCTTGCTGGGACCTCTATAAGAGAAGTTCCATCATCAATCTGTCATCTC
ACTCAACTTGTTGTTTTCGATGCCGAGAACTGCAAGAAGCTTCAAGACTTGCCTATGGGA
ATGGGTAATTTGATCTCTCTGACTATGTTAATTTTATCTGGCTGCTCAGAGCTCAGGAGT
ATCCCCGATCTTCCACGGAACCTTAGACATTTAAATCTAGCCGAGACCCCCATAAAGAAA
CTGCCATCATCATTTGAGGATCTCACTAAACTGGTTTCACTAGATTTGAATCACTGCGAA
AGGCTTCAACATCTCCAAATGGAATCTTTCGAATCAGTAGTTAGGGTGGATCTATCTGGC
TGCTTAGAGCTGAAGTATATCCTAGGTTTCTCACTTCAAGACATTACACAACTTCATGAA
GATGGGACCGACAAAGTGATGTTACATGGAACTCCTCCATGTAACGTCACCTTAATTTTA
GAGACGTGGAGAACAAGACATGTCACTCCAATGGAAAAGAGTGGGTCTAAATTTTATCTT
AAGCTTATGCCATTTGTAACCACGCCATATCGTTCAAAGTTGCAATCTTCCCTAGTGTTC
CGTATGTATGCCATGGTATCTTTGTTTCTCAGTAAAGCATACCTCCTTGACATACATATA
CCTCAAGAGATATGTAATCTCCTTTCACTCAAGACATTGGATCTCAGTGGAAACAATTTT
GGTAAACTTCCTGAAAGCATCAAGCAGTTCCGCAATTTGGAGAGCCTTATATTATGTCAT
TGCAAAAACCTCGAATCACTTCCAGAGCTTCCTCAAAGTCTAGAGTTTTTGAATGCACAT
GGTTGTGTGTGTCTAAAAAATATTCATAGGAGCTTCCAACAGTTTCCTAGACATTGCACA
TTCAGCAATTGCTTTGAGATTTCTCCAGACATCGTTAGGGAGATTTTAGAAGCAAGAGTT
GCGCAAATGGTCATAGATCATACTCTGCAGAAGCTCATCGAAGCACCAGCATTCAGTTTC
TCTGTTCCAGCATTCCGTGATCCAAATTACATCTTTCACTTAAACCGAGGTTCATCTGTG
ATGATACGTTTAACTCCTAGCATAGAAACGCTTTTGGGGTTCCAAATATCAGTTGCAGTT
GCATTTTGGAATGATTCCTACAGTAATGCCGGTTTTGGAATCAGGGAGAACAAGATTCTT
GATGATTGTTGCACTGTTACAGAGTGTGGAGTATATGCAATTACTGAAAACGTAGACCAA
ACAAATCTTGACTTTAGAGGACCGTCTTTTGCCTTGTTACCACCGTATAAGAAGCGAAAA
CGAAGTTTCTCGGGATCAGAAGACATTGAAATGGAAAACCAGAGGTTGAACATCTCAAAA
ACCAAACAAGGGGATGTCCCAGAAAAGATAAGCCAAGCCGATACGGCTTATCTCACATCA
CCTTCTTTGCTTCAACGTCGGAGCCATCAGGTTTTTTTGAGCTTCAGTGAGGATGTCCCA
AGATATTTTGTCAGCTATTTAATCAAGAAATTAAAATGGATTGGCATTACTGTAGTATAT
AGTGGATTCATGGGAGGCAAGTCTATGAGTCGTCCCGAGGTAACACAAGCCATAGAAGAA
TCAAGTATCTCAGTCGTCATTCTATCAAAAGACTATGTTTCTTCAAGTAAGTGCTTGGAT
GAATTGGTGGAGATCATAAGGTGGAGGGAAGAAAACTTGGGAAACAGAGTGATGCCAATT
TACTATGAAATGGGTACATCTGATGTAATGAAGCAAGCCAAAACTATTGGGAATAGATTG
GTGGAAACCTATTTGGGGAAAGTAGTAGAGAAACCAGAGCTGAGATGGATGAGAGCTTTG
GCATACATTGTCAATATAGTCGGTGAATCTTCCCAATACTGGGTTGACAAAGCGAAGATG
ATTGAGAAGACTGTTGTGGATGTCTCAAATCAAATGAATATCTTGGAATCAAATGAAGCT
GGTTTATTGTTCATTTACCAGGAAGAGGAGAACATGGAAAACTTTAAGAGAAACGTTTAC
GATGAGATGAACGGGTATATGTCCTCTGTTTCTCGAAAACAGAGAATTAGGCCGTTAGCT
TCACTCACCGAGAGAACCGTCCAAGATCAACCACATTTGTTTACGGATGGAAACAAGACA
AGGACCTTCACGAGTGAGTCACCCACATTGAGCAAATTACGCAGAAGAAGAAGTGGATGA

 

Protein translation:

MDYGACELVEDIARDMYEKIFPTKRIGIYRKMLKLEKIVYKQLWGIRSIGIWGMPGIGKT
TLAEAAFDQFSGDYEASCIIKDFDKEFLAKGLYHLWNEYLGENINNSFIKSGQKRLLIVL
DNVLKPLDADAFLNGFDWFGPGSLIIITSRDKQVLVQCGVNQIYEVEGLNKDEAKQLLHG
CAFGIDWRKQSGLETLAPYYISVKYFSGNPLALSLYEEMLSHMKSDKMEVKLLKLNHPPP
QIMEVFKSNYNALNENEKSMFLDIACFFRGEKADYVMQLFEGCGFFPHVGIYVLVDKCLV
TIVKRKMEMHNLIQIVGKAISNEGTVELDRHVRLWDTSIIQPLLEDEETKLKGESKGTTE
DIEVIFLDMSNLKFFVKPDAFKSMHNLRFLKIYSSNPGKHQRIRFREALQSLPNELRLLH
WEDYPLQSLPQHFDPTHLVELNMPYSKLQKLWGGTKNLEMLKMVRLSHSQDLVEIEELIK
SKNIEVIDLQGCTKIQSFPATRHLQHLRVINLSGCVEIKSTQLEEFQGFPRNLKELYLSG
TGIREVTSSIHLSSLEVLDLSNCKRLQNLPMGKGNLASLIKLMLSGCSKLQNIQDLPTNL
KELYLAGTSIREVPSSICHLTQLVVFDAENCKKLQDLPMGMGNLISLTMLILSGCSELRS
IPDLPRNLRHLNLAETPIKKLPSSFEDLTKLVSLDLNHCERLQHLQMESFESVVRVDLSG
CLELKYILGFSLQDITQLHEDGTDKVMLHGTPPCNVTLILETWRTRHVTPMEKSGSKFYL
KLMPFVTTPYRSKLQSSLVFRMYAMVSLFLSKAYLLDIHIPQEICNLLSLKTLDLSGNNF
GKLPESIKQFRNLESLILCHCKNLESLPELPQSLEFLNAHGCVCLKNIHRSFQQFPRHCT
FSNCFEISPDIVREILEARVAQMVIDHTLQKLIEAPAFSFSVPAFRDPNYIFHLNRGSSV
MIRLTPSIETLLGFQISVAVAFWNDSYSNAGFGIRENKILDDCCTVTECGVYAITENVDQ
TNLDFRGPSFALLPPYKKRKRSFSGSEDIEMENQRLNISKTKQGDVPEKISQADTAYLTS
PSLLQRRSHQVFLSFSEDVPRYFVSYLIKKLKWIGITVVYSGFMGGKSMSRPEVTQAIEE
SSISVVILSKDYVSSSKCLDELVEIIRWREENLGNRVMPIYYEMGTSDVMKQAKTIGNRL
VETYLGKVVEKPELRWMRALAYIVNIVGESSQYWVDKAKMIEKTVVDVSNQMNILESNEA
GLLFIYQEEENMENFKRNVYDEMNGYMSSVSRKQRIRPLASLTERTVQDQPHLFTDGNKT
RTFTSESPTLSKLRRRRSG*

 

Protein motifs:  Several disease resistance proteins contain a signature sequence for an ATP/GTP-binding site. This P-loop domain is found in F6P23.8 from residue 53 to 60 [GMPGIGKT].

 


written 18 Oct 97
Larry Parnell