| Gene | F6P23.8 |
| Putative Identification | leucine-rich repeat disease resistance protein |
| Position | 29039 to 36198, from the initial methionine to the termination codon |
| Strand | - |
| EST match | none |
| Database match | resistance gene homolog ATFCA7 from Z97342 |
CDS: The table below lists the coordinates of each of the F6P23.8 exons and indicates which gene prediction program selected the 3' and 5' termini (GS = GenScan, Gr = GRAIL, M = MZEF).
| Exon | Range | 3' | 5' |
| 1 | 35131 to 36198 | GS,Gr,M | GS |
| 2 | 34739 to 35038 | GS,Gr,M | GS,Gr,M |
| 3 | 33231 to 34652 | GS | GS,Gr,M |
| 4 | 32947 to 33140 | GS | GS |
| 5 | 32577 to 32745 | GS | GS |
| 6 | 31893 to 32440 | GS | GS |
| 7 | 31530 to 31681 | GS | GS |
| 8 | 29039 to 29205 | GS | GS |
Complete CDS of F6P23.8
ATGGATTATGGTGCTTGTGAACTAGTGGAAGACATTGCTAGGGATATGTATGAAAAGATT TTCCCTACCAAACGAATTGGGATCTACAGGAAGATGCTGAAGTTAGAAAAAATTGTTTAC AAGCAACTATGGGGTATCCGCAGCATAGGCATATGGGGTATGCCAGGCATAGGAAAAACA ACACTTGCCGAAGCAGCATTTGACCAATTTTCTGGTGATTATGAAGCTTCTTGCATCATC AAAGACTTCGACAAAGAATTTCTTGCGAAAGGACTTTATCATTTGTGGAATGAATACTTA GGGGAAAATATAAACAATTCTTTTATCAAGTCCGGACAAAAAAGACTTCTTATCGTCCTT GACAATGTTCTTAAACCTCTGGATGCAGACGCTTTTCTTAATGGGTTTGATTGGTTTGGT CCAGGGAGCCTGATAATCATAACCTCTAGAGATAAACAAGTACTAGTGCAGTGTGGCGTC AACCAAATATATGAGGTTGAAGGTTTAAACAAAGATGAGGCTAAGCAACTACTTCACGGA TGCGCATTTGGAATAGACTGGAGAAAACAGAGTGGGTTGGAAACTCTTGCACCTTACTAT ATTTCTGTCAAATATTTTAGTGGAAACCCTCTAGCTCTTAGCCTTTATGAAGAAATGTTG TCACACATGAAGTCAGATAAAATGGAGGTTAAGCTCCTCAAGCTCAACCACCCTCCACCT CAGATTATGGAAGTATTCAAGAGCAATTACAATGCACTTAATGAGAACGAGAAGAGCATG TTTCTAGACATTGCTTGTTTCTTCAGGGGAGAAAAGGCGGACTACGTGATGCAACTGTTT GAAGGATGTGGATTCTTTCCACATGTTGGAATATATGTTCTAGTGGATAAGTGTCTGGTG ACTATTGTAAAAAGAAAGATGGAAATGCATAATCTGATCCAGATTGTTGGGAAGGCGATT TCAAATGAAGGAACTGTAGAACTTGATAGGCATGTCAGACTATGGGACACTTCAATCATT CAACCTCTGCTAGAAGATGAAGAAACCAAATTAAAGGGTGAATCTAAGGGTACTACTGAA GATATTGAAGTCATATTTCTAGACATGTCTAACTTGAAATTCTTTGTCAAACCTGATGCT TTCAAGAGTATGCATAATCTTAGATTTCTGAAGATTTATAGTTCTAATCCTGGAAAGCAT CAGAGAATTCGCTTTCGAGAAGCTCTTCAGTCTCTTCCCAATGAGCTACGGCTACTCCAC TGGGAGGATTATCCTCTACAATCCCTGCCACAACATTTTGATCCTACGCACCTTGTTGAA CTCAACATGCCTTACAGTAAACTTCAAAAACTTTGGGGTGGAACCAAGAACCTTGAGATG TTGAAGATGGTCAGGCTTAGTCACTCGCAAGATCTAGTTGAAATTGAAGAACTGATAAAA TCTAAAAATATTGAAGTAATCGATCTCCAAGGTTGTACAAAAATACAGAGTTTTCCAGCT ACAAGACATCTACAACATCTCAGAGTTATTAATCTTTCTGGTTGCGTTGAGATCAAAAGC ACACAGCTCGAAGAATTTCAGGGCTTTCCAAGGAACCTGAAAGAATTATATCTTTCTGGT ACTGGGATAAGAGAAGTGACATCATCAATCCACCTCTCCTCACTTGAAGTTTTGGATTTG TCTAACTGCAAAAGACTTCAAAACTTGCCCATGGGAAAGGGTAATTTGGCTTCTCTTATT AAACTCATGTTATCAGGGTGTTCAAAGCTCCAGAATATTCAAGATCTCCCAACAAACCTG AAAGAGCTATATCTTGCTGGGACCTCTATAAGAGAAGTTCCATCATCAATCTGTCATCTC ACTCAACTTGTTGTTTTCGATGCCGAGAACTGCAAGAAGCTTCAAGACTTGCCTATGGGA ATGGGTAATTTGATCTCTCTGACTATGTTAATTTTATCTGGCTGCTCAGAGCTCAGGAGT ATCCCCGATCTTCCACGGAACCTTAGACATTTAAATCTAGCCGAGACCCCCATAAAGAAA CTGCCATCATCATTTGAGGATCTCACTAAACTGGTTTCACTAGATTTGAATCACTGCGAA AGGCTTCAACATCTCCAAATGGAATCTTTCGAATCAGTAGTTAGGGTGGATCTATCTGGC TGCTTAGAGCTGAAGTATATCCTAGGTTTCTCACTTCAAGACATTACACAACTTCATGAA GATGGGACCGACAAAGTGATGTTACATGGAACTCCTCCATGTAACGTCACCTTAATTTTA GAGACGTGGAGAACAAGACATGTCACTCCAATGGAAAAGAGTGGGTCTAAATTTTATCTT AAGCTTATGCCATTTGTAACCACGCCATATCGTTCAAAGTTGCAATCTTCCCTAGTGTTC CGTATGTATGCCATGGTATCTTTGTTTCTCAGTAAAGCATACCTCCTTGACATACATATA CCTCAAGAGATATGTAATCTCCTTTCACTCAAGACATTGGATCTCAGTGGAAACAATTTT GGTAAACTTCCTGAAAGCATCAAGCAGTTCCGCAATTTGGAGAGCCTTATATTATGTCAT TGCAAAAACCTCGAATCACTTCCAGAGCTTCCTCAAAGTCTAGAGTTTTTGAATGCACAT GGTTGTGTGTGTCTAAAAAATATTCATAGGAGCTTCCAACAGTTTCCTAGACATTGCACA TTCAGCAATTGCTTTGAGATTTCTCCAGACATCGTTAGGGAGATTTTAGAAGCAAGAGTT GCGCAAATGGTCATAGATCATACTCTGCAGAAGCTCATCGAAGCACCAGCATTCAGTTTC TCTGTTCCAGCATTCCGTGATCCAAATTACATCTTTCACTTAAACCGAGGTTCATCTGTG ATGATACGTTTAACTCCTAGCATAGAAACGCTTTTGGGGTTCCAAATATCAGTTGCAGTT GCATTTTGGAATGATTCCTACAGTAATGCCGGTTTTGGAATCAGGGAGAACAAGATTCTT GATGATTGTTGCACTGTTACAGAGTGTGGAGTATATGCAATTACTGAAAACGTAGACCAA ACAAATCTTGACTTTAGAGGACCGTCTTTTGCCTTGTTACCACCGTATAAGAAGCGAAAA CGAAGTTTCTCGGGATCAGAAGACATTGAAATGGAAAACCAGAGGTTGAACATCTCAAAA ACCAAACAAGGGGATGTCCCAGAAAAGATAAGCCAAGCCGATACGGCTTATCTCACATCA CCTTCTTTGCTTCAACGTCGGAGCCATCAGGTTTTTTTGAGCTTCAGTGAGGATGTCCCA AGATATTTTGTCAGCTATTTAATCAAGAAATTAAAATGGATTGGCATTACTGTAGTATAT AGTGGATTCATGGGAGGCAAGTCTATGAGTCGTCCCGAGGTAACACAAGCCATAGAAGAA TCAAGTATCTCAGTCGTCATTCTATCAAAAGACTATGTTTCTTCAAGTAAGTGCTTGGAT GAATTGGTGGAGATCATAAGGTGGAGGGAAGAAAACTTGGGAAACAGAGTGATGCCAATT TACTATGAAATGGGTACATCTGATGTAATGAAGCAAGCCAAAACTATTGGGAATAGATTG GTGGAAACCTATTTGGGGAAAGTAGTAGAGAAACCAGAGCTGAGATGGATGAGAGCTTTG GCATACATTGTCAATATAGTCGGTGAATCTTCCCAATACTGGGTTGACAAAGCGAAGATG ATTGAGAAGACTGTTGTGGATGTCTCAAATCAAATGAATATCTTGGAATCAAATGAAGCT GGTTTATTGTTCATTTACCAGGAAGAGGAGAACATGGAAAACTTTAAGAGAAACGTTTAC GATGAGATGAACGGGTATATGTCCTCTGTTTCTCGAAAACAGAGAATTAGGCCGTTAGCT TCACTCACCGAGAGAACCGTCCAAGATCAACCACATTTGTTTACGGATGGAAACAAGACA AGGACCTTCACGAGTGAGTCACCCACATTGAGCAAATTACGCAGAAGAAGAAGTGGATGA
Protein translation:
MDYGACELVEDIARDMYEKIFPTKRIGIYRKMLKLEKIVYKQLWGIRSIGIWGMPGIGKT TLAEAAFDQFSGDYEASCIIKDFDKEFLAKGLYHLWNEYLGENINNSFIKSGQKRLLIVL DNVLKPLDADAFLNGFDWFGPGSLIIITSRDKQVLVQCGVNQIYEVEGLNKDEAKQLLHG CAFGIDWRKQSGLETLAPYYISVKYFSGNPLALSLYEEMLSHMKSDKMEVKLLKLNHPPP QIMEVFKSNYNALNENEKSMFLDIACFFRGEKADYVMQLFEGCGFFPHVGIYVLVDKCLV TIVKRKMEMHNLIQIVGKAISNEGTVELDRHVRLWDTSIIQPLLEDEETKLKGESKGTTE DIEVIFLDMSNLKFFVKPDAFKSMHNLRFLKIYSSNPGKHQRIRFREALQSLPNELRLLH WEDYPLQSLPQHFDPTHLVELNMPYSKLQKLWGGTKNLEMLKMVRLSHSQDLVEIEELIK SKNIEVIDLQGCTKIQSFPATRHLQHLRVINLSGCVEIKSTQLEEFQGFPRNLKELYLSG TGIREVTSSIHLSSLEVLDLSNCKRLQNLPMGKGNLASLIKLMLSGCSKLQNIQDLPTNL KELYLAGTSIREVPSSICHLTQLVVFDAENCKKLQDLPMGMGNLISLTMLILSGCSELRS IPDLPRNLRHLNLAETPIKKLPSSFEDLTKLVSLDLNHCERLQHLQMESFESVVRVDLSG CLELKYILGFSLQDITQLHEDGTDKVMLHGTPPCNVTLILETWRTRHVTPMEKSGSKFYL KLMPFVTTPYRSKLQSSLVFRMYAMVSLFLSKAYLLDIHIPQEICNLLSLKTLDLSGNNF GKLPESIKQFRNLESLILCHCKNLESLPELPQSLEFLNAHGCVCLKNIHRSFQQFPRHCT FSNCFEISPDIVREILEARVAQMVIDHTLQKLIEAPAFSFSVPAFRDPNYIFHLNRGSSV MIRLTPSIETLLGFQISVAVAFWNDSYSNAGFGIRENKILDDCCTVTECGVYAITENVDQ TNLDFRGPSFALLPPYKKRKRSFSGSEDIEMENQRLNISKTKQGDVPEKISQADTAYLTS PSLLQRRSHQVFLSFSEDVPRYFVSYLIKKLKWIGITVVYSGFMGGKSMSRPEVTQAIEE SSISVVILSKDYVSSSKCLDELVEIIRWREENLGNRVMPIYYEMGTSDVMKQAKTIGNRL VETYLGKVVEKPELRWMRALAYIVNIVGESSQYWVDKAKMIEKTVVDVSNQMNILESNEA GLLFIYQEEENMENFKRNVYDEMNGYMSSVSRKQRIRPLASLTERTVQDQPHLFTDGNKT RTFTSESPTLSKLRRRRSG*
Protein motifs: Several disease resistance proteins contain a signature sequence for an ATP/GTP-binding site. This P-loop domain is found in F6P23.8 from residue 53 to 60 [GMPGIGKT].
written 18 Oct 97
Larry Parnell