Gene F5I10.20
Putative Identification membrane-associated protein
Position 56387 - 60410, from the initial methionine to the termination codon
Strand -
EST match H76978
Database match Arabidopsis hypothetical protein and putative membrane-associated proteins of S cerevisiae and S. pombe

 

CDS:  The table below lists the coordinates of the exons for F5I10.20

Exon

Range

1 59982 - 60410
2 59272 - 59897
3 58711 - 59062
4 58534 - 58639
5 56387 - 57114

 

Protein sequence of F5I10.20

MEIPVREERRSSSSSAGPLQQTISLAADDAIDSGPSSPLVVKVSVFETEHETTKLIHAPS
TLLGETTGDADFPPIQSFRDAKLVCVVETSKLWEIAAPIAFNILCNYGVNSFTSIFVGHI
GDLELSAVAIALSVVSNFSFGFLLGMASALETLCGQAFGAGQMDMLGVYMQRSWLILLGT
SVCLLPLYIYATPLLILLGQEPEIAEISGKFTTQIIPQMFALAINFPTQKFLQSQSKVGI
MAWIGFFALTLHIFILYLFINVFKWGLNGAAAAFDVSAWGIAIAQVVYVVGWCKDGWKGL
SWLAFQDVWPFLKLSFASAVMLCLEIWYFMTIIVLTGHLEDPVIAVGSLSIWVSNELGSG
HPRAAKYSVIVTVIESLVIGVVCAIVILITRDDFAVIFTESEEMRKAVADLAYLLGITMI
LNSLQPVISGVAVGGGWQAPVAYINLFCYYAFGLPLGFLLGYKTSLGVQGIWIGMICGTS
LQTLILLYMIYITNWNKEVPPLISAYAAAPGPKPVVDSKDDDHKEVTPCYEAETSHPLYM
SRHFVFPPTGQLENTSDLTEASLTGSHCKEGSDLSLKGLDLSDDFGGLEFSEDKGKKEEN
IYTTAMSSLDDERAIGGSHVYEPVEEPTEPVSPSDVTLDLNPIKDDEVANSPPSEEAWWK
RSVASLIAQAKETNTVWSICIAAAVMGIVILGQHWQQERWQILQQKWESSIGNEVFSLGS
LFFEPPWYDVSLNYVTTLFGCRKLED

 

Multiple sequence analysis of F5I10.20 and similar proteins. Each of the sequences presented in this alignment is a hypothetical protein. The proteins from S. pombe and S. cerevisiae are described as putative membrane-associated proteins. The carboxyl-terminal 240 amino acids of F5I10.20 show similarity to a hypothetical protein from the plant S. stapfianus. The S. stapfianus sequence is partial.

            1                                                         60
YscYDR338c  MAGILSKTLSEVHPSLRTNGMGIGNTHRRISLGFLPPNKKNPLVRKFRARTRNIDQRSFR 

            61                                                       120
YscYDR338c  SLTDDFGSNVHEPNPYLGNIDEEPDLYYHDEEDGELSRTISLPSRVSETPELSPQDVDWI 

            121                                                      180
  At81kb.4  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
  F5I10.20  ~~~~~~~~~~~~~~~~~~~~MEIPVREERRSSSSSAGPLQQTISLAADDAIDSGPSSPLV 
YspC11D3.6  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
YscYDR338c  LHEHERRYSSVCNSDNEEASQSNTPDRIQEYSGRELEYDEFMNRLQAQKQKLTRSAVTDA 

            181                                                      240
  At81kb.4  ~~~MAKDKDITETLLTAAEERSDLPFLSVDDIPPITTVGGFVREFNVETKKLWYLAGPAI 
  F5I10.20  VKVSVFETEHETTKLIHAPSTLLGETTGDADFPPIQSFRDAKLVCVVETSKLWEIAAPIA 
YspC11D3.6  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MGRPLTEVKYLLINSAPVI 
YscYDR338c  KGTSHHRRPSFVSVTSRGSVPTIYQEIDENDSEALAELAHSHVTFKSEARVLASYSFPLI 

            241                                                      300
  At81kb.4  FTSVNQYSLGAITQVFAGHISTIALAAVSVENSVVAGFSFGIMLGMGSALETLCGQAFGA 
  F5I10.20  FNILCNYGVNSFTSIFVGHIGDLELSAVAIALSVVSNFSFGFLLGMASALETLCGQAFGA 
YspC11D3.6  LGYALQNSLQTSSVIVTGRLGPSELSVAAFAYMFAMSTGWLIALGGTTAFDTLGSNLWGA 
YscYDR338c  FTFLLEQIFPMVCSLTVGHLGKNELAAVSLASMTSNIT.LAIFEGIATSLDTLCPQAYGS 

            301                                                      360
  At81kb.4  GKLSMLGVYLQRSWVILNVTALILSLLYIFAAPILASIGQTAAISSAAGIFSIYMIPQIF 
  F5I10.20  GQMDMLGVYMQRSWLILLGTSVCLLPLYIYATPLLILLGQEPEIAEISGKFTTQIIPQMF 
YspC11D3.6  GKKQELGILLQTGFIVLSILYLPICLVWWYSKPILIFLHQTPELAEASQKFLRYLIPGGL 
YscYDR338c  GRFYSVGVHLQRCIAFSLVIYIPFAVMWWYSEPLLSYIIPEKELINLTSRFLRVLILGAP 

            361                                                      420
  At81kb.4  AYAINFPTAKFLQSQSKIMVMAVISAVALVIHVPLTWFVIVKLQWGMP..GLAVVLNASW 
  F5I10.20  ALAINFPTQKFLQSQSKVGIMAWIGFFALTLHIFILYLFINVFKWGLN..GAAAAFDVSA 
YspC11D3.6  GYVCFELLKKFLQTQEITRAGSYILLVTSPLNVALNFLLV..HYYGLGLKGAPLATGLSY 
YscYDR338c  AYIFFENLKRFLQAQGIFDAGIYVLTICAPLNVLVSYTLVWNKYIGVGFIGAAIAVVLNF 

            421                                                      480
  At81kb.4  CFIDMAQLVYIFSGTCGEAWSGFSWEAFHNLWSFVRLSLASAVMLCLEVWYFMAIILFAG 
  F5I10.20  WGIAIAQVVYVV.GWCKDGWKGLSWLAFQDVWPFLKLSFASAVMLCLEIWYFMTIIVLTG 
YspC11D3.6  WLSFILLTQYAKYVKGAEAWNGWNKRCLENFGPFVKLSLLGIVMVGTEWWAFEIVALVAG 
YscYDR338c  WLMFFLLLFYALYIDGRKCWGGFSRKAFTHWNDLGHLAFSGIIMLEAEELSYELLTLFSA 

            481                                                      540
  At81kb.4  YLKNAEISVAALSICMNILGWTAMIAIGMNTAVSVRVSNELGANHPRTAKFSLLVAVITS 
  F5I10.20  HLEDPVIAVGSLSI.....................WVSNELGSGHPRAAKYSVIVTVIES 
YspC11D3.6  KL..GAVPLAAQSVIMTTDQLLNTIPFGLGIITSNRVAYYLGAGLPDNASLTAKVAAIVG 
YscYDR338c  YY..GVSYLAAQSAVSTMAALLYMIPFAIGISTSTRIANFIGAKRTDFAHISSQVGLSFS 

            541                                                      600
  At81kb.4  TLIGFIVSMILLIFRDQYPSLFVKDEKVIILVKELTPILALSIVINNVQPVLSGVAVGAG 
  F5I10.20  LVIGVVCAIVILITRDDFAVIFTESEEMRKAVADLAYLLGITMILNSLQPVISGVAVGGG 
YspC11D3.6  VAVGSVIMITMIAVRNIYGRIFTNDPDVIQLVALVMPLVAAFQISDSLNGTMGGALRGTG 
YscYDR338c  FIAGFINCCILVFGRNLIANIYSKDPEVIKLIAQVLPLVGIVQNFDSLNAVAGSCLRGQG 

            601                                                      660
  At81kb.4  WQAVVAYVNIACYYVFGIPFGLLLGYKLNYGVMGIWCGMLTGTVVQTIVLTWMICKTNWD 
  F5I10.20  WQAPVAYINLFCYYAFGLPLGFLLGYKTSLGVQGIWIGMICGTSLQTLILLYMIYITNWN 
YspC11D3.6  RQKVGAIVNITAYYLFALPLGIYLA.FHGKGLVGLWIGQVIALSIVGILELKIVMATDWI 
YscYDR338c  MQSLGSIVNLMAYYLFGIPLALILSWFFDMKLYGLWIGIGSAMLLIGLVEAYYVLFPDWD 
  Sstap-hp  ~~~~~~~~~~~~SLLRASHRDLLPPGEPLRPPPSSGVRFKDMADSEKEVVEGTTTPRGAD

            661                                                      720
  At81kb.4  TEASMAEDRIREWGGEVSEIKQLIN*~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
  F5I10.20  KEVPPLISAYAAAPGPKPVVDSKDDDHKEVTPCYEAETSHPLYMSRHFVFPPTGQ....L 
YspC11D3.6  S.QSRKAISRFGDSSELTALLN*~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
YscYDR338c  KIMTYAEILKETEDDEVDSDEYLTDSDDPDENTALLGA*~~~~~~~~~~~~~~~~~~~~~ 
  Sstap-hp  WEVVTLTSAYAAAPGPEGRPTASNPSD.............PLLMSDHFVFPPSEHENLPI

            721                                                      780
  F5I10.20  ENTSDLTEASL.TGSHCKEGSDLSLKGLDLSDDFGGLEFSEDKGKKEENIYTTAMSSLDD 
  Sstap-hp  QTNFDETEKDVQDASTSVEDYSFKNMGAKNDTGPERIEF.YDEGR...NLSVDDIEMRED

            781                                                      840
  F5I10.20  ERAIG......GSHVYEPVEEPTEPVSPSDVTLDLNPIKDD.EVANSPPSEEAWWKRSVA 
  Sstap-hp  APEYGSVPAEDGGHGFVAHDDGIDAGGESDEKLDQPPKSADCKSGGAGASCKCWLKKHMT

            841                                                      900
  F5I10.20  SLIAQAKETNTVWSICIAAAVMGIVILGQHWQQERWQILQQKWESSIGNEVFSLGSLFFE 
  Sstap-hp  CLYHQAKETNAIWSVVVVAALVGIVILG.HWHKDKLHINPLKWRSGSAVRG~~~~~~~~~

            901                 923
  F5I10.20  PPWYDVSLNYVTTLFGCRKLED*
  Sstap-hp  ~~~~~~~~~~~~~~~~~~~~~~~

 


written 8 Jan 98
Larry Parnell