Gene T17A5.7
Putative Identification vacuolar sorting protein
Position 6861 to 12520, from the initial methionine to the termination codon
Strand -
EST match F14169
Database match Mouse maternal-embryonic 3 and
S. cerevisiae VPS35, vacuolar protein sorting

 

Note:  T17A5 from position 12521 to 17000 has no matches of any kind to any sequence in the database. Within this region exons are predicted which would extend the amino-terminal portion of the protein (see Protein Sequence below.)

 

CDS:  The table below lists the coordinates of the T17A5.7 exons and which exon prediction algorithm selcted the 3' and 5' termini (GF = Genefinder, GS = GenScan, Gr = Grail, M = MZEF). Exon termini desginated by EST were selected based on an exact match to EST F14169, and those marked by est were confirmed by comparison to EST T04572, which is very similar but not exact to T17A5.

Exon Range 3' 5'
1 12428 to 12520 GF,Gr GF,Gr
2 12170 to 12266 GF,GS,Gr GF,GS,Gr,M
3 11941 to 12064 GF,GS,Gr GF,GS,Gr,M
4 11519 to 11704 GF,GS,Gr GF,GS,Gr
5 11243 to 11330 GF,GS,M GF,GS,Gr,M
6 10990 to 11046 GF,Gr GF
7 10806 to 10895 GF,GS,Gr,M GS,Gr,M
8 10593 to 10715 GF,GS,Gr,M GF,GS,Gr,M
9 10314 to 10448 GF,GS,Gr,M GF,GS,Gr,M
10 9842 to 9958 GF,GS,M GF,GS,Gr,M
11 9179 to 9418 GF,GS,Gr GF,GS,Gr,M
12 9028 to 9093 GF,Gr,M GF,Gr,M
13 8822 to 8911 GF,GS,Gr GF,GS,Gr
14 8480 to 8569 GF,Gr,M GF,Gr,M
15 8201 to 8290 GF,GS,Gr,M GF,GS,Gr,M
16 7887 to 7955 est,GF,GS,Gr,M GF,GS,Gr,M
17 7732 to 7812 est,GF,GS,Gr est,GF,GS,Gr,M
18 7521 to 7628 GF,GS est,GF,GS
19 7328 to 7461 GF,GS,Gr,M GF,GS
20 7142 to 7255 EST,GF,GS,Gr,M GF,GS,Gr,M
21 6861 to 7062 GF,Gr EST,GF,GS,Gr

Complete CDS of T17A5.7

ATGATCGCAGACGGATCAGAAGATGAAGAGAAATGGCTCGCCGCCGGTGCTGCTGCTTTC
AAGCAAAACGCATTTTACATGCAACGCGCTATTGACTCGAATAATCTGAAAGATGCTCTC
AAGTATTCGGCTCAGATGCTAAGCGAGTTACGGACTTCGAAGCTATCACCTCACAAGTAC
TATGATCTATATATGAGAGCTTTTGATGAATTGAGGAAACTTGAGATTTTCTTTATGGAA
GAAACTCGTCGTGGCTGCTCAGTCATTGAACTCTATGAGCTGGTTCAGCATGCTGGTAAC
ATATTACCGCGTTTGTATCTCCTATGTACAGCAGGATCCGTGTATATCAAAACCAAGGAA
GCTCCTGCCAAGGAAATTCTTAAAGATCTTGTTGAGATGTGCCGTGGGATTCAGCATCCT
CTACGTGGTCTCTTCTTAAGAAGTTACCTTGCGCAGATTAGTCGAGATAAATTACCTGAC
ATTGGTTCTGAGTATGAAGGAGATGCTGATACAGTCATAGATGCTGTGGAGTTTGTACTA
CTGAACTTTACTGAGATGAATAAACTCTGGGTCAGAATGCAACATCAGGGACCTGCTCGA
GAAAAGGAGAGACGGGAGAAAGAGAGGGGCGAGCTTCGTGACCTTGTTGGAAAGAACCTT
CACGTGCTGAGTCAGTTAGAAGGTGTGGACCTTGATATGTACAGAGATACAGTTCTTCCT
AGAGTCTTAGAGCAGATTGTGAACTGCAGAGATGAGATTGCCCAATATTACCTAATAGAC
TGTATAATTCAAGTTTTTCCTGACGAGTATCACTTGCAGACTCTAGATGTACTTCTTGGG
GCGTGTCCTCAACTTCAGGCATCAGTTGACATCATGACAGTGCTTTCCCGTTTAATGGAG
AGGCTGTCAAATTATGCTGCCTTAAATGCGGAAGTATTACCTTATTTCCTGCAAGTGGAA
GCTTTCTCAAAGTTGAATAATGCAATTGGAAAGGTGATAGAAGCACAAGAAGACATGCCT
ATTCTGAGTGCAGTAACCCTATATTCCTCCCTTCTCAAGTTTACTCTTCACGTTCACCCT
GATCGGCTTGATTATGCGGACCAAGTGTTGGGATCATGTGTTAAGCAACTGTCCGGAAAA
GGAAAGATTGATGACACTCGTGCAACAAAGGAGCTTGTCTCGCTTTTAAGTGCTCCCTTA
GAGAAGTATAATGATGTTGTCACCGCCCTTAAACTAACTAACTATCCCCTCGTGGTGGAG
TACCTTGATACCGAAACAAAGAGAATAATGGCTACTGTTATAGTTCGAAGCATTATGAAA
AACAATACTCTTATTACTACAGCGGAGAAGGTTGAAGCATTGTTTGAACTGATTAAAGGA
ATTATCAACGATTTGGATGAGCCGCAAGGTCTTGAGGTTGATGAAGATGATTTTCAGGAG
GAGCAGAATTCTGTTGCGCTTCTCATTCATATGTTATATAATGATGACCCAGAAGAGATG
TTTAAGATAGTCAATGTCCTGAAGAAGCATTTCCTGACAGGAGGGCCAAAGCGCTTAAAA
TTCACCATTCCTCCCCTTGTTGTTTCTACTCTAAAGCTAATCAGGCGATTGCCAGTGGAA
GGAGACAATCCTTTTGGAAAAGAGGCTTCCGTTACTGCTACTAAAATATTCCAATTTCTA
AATCAGATTATCGAAGCGCTACCTAATGTTCCATCACCTGACCTGGCATTCCGGTTGTAC
TTGCAATGTGCTGAGGCTGCGGATAAGTGTGATGAAGAACCAATTGCATACGAATTTTTC
ACCCAGGCATACATCTTATACGAAGAAGAAATTTCGGACTCAAAGGCCCAGGTGACTGCG
TTACAACTTATAATTGGAACTCTGCAGAGGATGCAAGTATTTGGTGTTGAGAATAGAGAT
ACATTAACGCACAAGGCTACGGGGGCTGACAAGGGAAAACTTATACTCTTGCAGTATGCA
GCGAAACTTCTAAAGAAACCTGATCAATGTCGAGCTGTTTATGCCTGTTCTCATTTGTTC
TGGCTGGAAGATCGTGAGACCATACAAGATGGAGAAAGGGTTCTACTTTGTCTGAAACGA
GCGCTTAAAATTGCAAATTCGGCTCAACAAGTGGCCAACACAGCTCGGGGTAGTACAGGG
TCTGTTACCCTCTTCATCGAGATACTAAACAAGTACCTCTATTTCTATGAGAAAGGGGTT
CCACAGATAACAGTTGAATCAGTAGAAAGCCTGATAAAACTGATCAAGAACGAAGAATCG
ATGCCCTCTGATCCATCTGCTGAATCATTCTTTGCAACTACGCTTGAGTTCATGGAGTTC
CAAAAGCAGAAAGAGGGTGCCATTGGTGAGAGATACCAGGCGATCAAAGTATAG

 

Protein sequence:

MIADGSEDEEKWLAAGAAAFKQNAFYMQRAIDSNNLKDALKYSAQMLSELRTSKLSPHKY
YDLYMRAFDELRKLEIFFMEETRRGCSVIELYELVQHAGNILPRLYLLCTAGSVYIKTKE
APAKEILKDLVEMCRGIQHPLRGLFLRSYLAQISRDKLPDIGSEYEGDADTVIDAVEFVL
LNFTEMNKLWVRMQHQGPAREKERREKERGELRDLVGKNLHVLSQLEGVDLDMYRDTVLP
RVLEQIVNCRDEIAQYYLIDCIIQVFPDEYHLQTLDVLLGACPQLQASVDIMTVLSRLME
RLSNYAALNAEVLPYFLQVEAFSKLNNAIGKVIEAQEDMPILSAVTLYSSLLKFTLHVHP
DRLDYADQVLGSCVKQLSGKGKIDDTRATKELVSLLSAPLEKYNDVVTALKLTNYPLVVE
YLDTETKRIMATVIVRSIMKNNTLITTAEKVEALFELIKGIINDLDEPQGLEVDEDDFQE
EQNSVALLIHMLYNDDPEEMFKIVNVLKKHFLTGGPKRLKFTIPPLVVSTLKLIRRLPVE
GDNPFGKEASVTATKIFQFLNQIIEALPNVPSPDLAFRLYLQCAEAADKCDEEPIAYEFF
TQAYILYEEEISDSKAQVTALQLIIGTLQRMQVFGVENRDTLTHKATGADKGKLILLQYA
AKLLKKPDQCRAVYACSHLFWLEDRETIQDGERVLLCLKRALKIANSAQQVANTARGSTG
SVTLFIEILNKYLYFYEKGVPQITVESVESLIKLIKNEESMPSDPSAESFFATTLEFMEF
QKQKEGAIGERYQAIKV*

If other exons predicted by the programs in the range of 13400 to 16600 are
selected for the gene model, the first 31 amino acids [MIADS..MQRAI] of
T17A5.7 can be replaced with

MYLSSKTVIKNRVPVSMSLRHMPTAPMPGVIQQCLKLASCNECSFNPSQEESQVYWIRKL
WTLIVVSFIGLEIQGAKEEVKKDRKHKRNEKDRKDRDNEAGRSRKHRHKRRRKDEGAIAS
GKLVSSEVELLEKSCQTVELELQTSSQNSCDSTLHSNERPKQIQSQPLDETSIRTRLPDK
GQEDPEDGVMMTSKDQKQRFSREMLDASQAATAPNESVGHSRVCQEKRIDPTFGSSREIT
TKLNKEKKSVPSKDNRKVSKEKKMPSLSSCNPLEQEKPTSSHQETPGPSKLLCRKCPPSM
AGQLLNLIENWAPDRVESKLTDSEDQEWWLFIKFGAKSPQVSNQKTNQGSSSM

 

Alignment of T17A5.7 to S. cerevisiae VPS35, Mouse MEM3, and C. elegans protein F59G1.3 The consensus sequence shows residue conservations in all four of the aligned sequences.

           1                                                                        75
CeF59G1.3  MLPNVKREQIDEMKFPEAESWWNRTDTRQKWDEPPRKAEVNVDRCENLPSVQTRTERIDESRKEDEYCEVVVSTK 

           76                                                                      150
CeF59G1.3  KTYYSRRNGMKPGTLKNLRCYYQSAGIPLQLYVNCSPGNESEVIETRITETKQIGDGQYEVRIESRYKFLKTAKI 

           151                                                                     225
CeF59G1.3  LKARNRRQLSSIILSDAHETKFSTIAHSSLFPLLHLQLKMYENSGNTTDQEKFLDQSIRVVKAESFEMKRCLDKG 
  T17A5.7  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MIADGSEDEEKWLAAGAAAFKQNAFYMQRAIDSN 
 YscVPS35  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MAYADSPENAIAVMNRCLSQH 

           226                                                                     300
  MusMEM3  ~~~~~~~MPPICLESSGPLCCHQRVTMNFIWLFLMNCTTWKVYLTDEFA.KGERLADLYELVQYSGNIIPRLYLL 
CeF59G1.3  KTMDALKHALQMLNEMRTAELSPKFYYRLYMDSMHELQCLEVNLVQEYAQEPAKLGNLYECVQYASAIIPRLYLL 
  T17A5.7  NLKDALKYSAQMLSELRTSKLSPHKYYDLYMRAFDELRKLEIFFMEE.TRRGCSVIELYELVQHAGNILPRLYLL 
 YscVPS35  KLMESLQHTSIMLTELRNPNLSPKKYYELYVIIFDSLTNLSTYLIENHPQN.HHLADLYELVQYTGNVVPRLYLM 
consensus         -----L--------------------------------------------LYE-VQ------PRLYL-

           301                                                                     375
  MusMEM3  ITVGVVYVKSFPQSRKDILKDLVEMCRGVQHPLRGLFLRNYLLQCTRNILPD................EGEPTDE 
CeF59G1.3  VTIGGVFIKCGLGSRKEILKDLVEMCRGVQHPLRGLFLRNYLMQCTRSVLPDFPETEEMLVAHNDNLSKGTPKLK 
  T17A5.7  CTAGSVYIKTKEAPAKEILKDLVEMCRGIQHPLRGLFLRSYLAQISRDKLPDIGSEYEGDA.............. 
 YscVPS35  ITVGTSYLTFNEAPKKEILKDMIEMCRGVQNPIRGLFLRYYLSQRTKELLPEDDPSFNS................ 
consensus  -T-G-----------K-ILKD--EMCRG-Q-P-RGLFLR-YL-Q-----LP------------------------

           376                                                                     450
  MusMEM3  ETTGDISDSMDFVLLNFAEMNKLWVRMQHQGHSRDREKRERERQELRILVGTNLVALTLVSWRCKCGTLQQIVLT 
CeF59G1.3  PRDGTVDDTIDFVLINFAEMNKLWVRMQHQGPSKEKEKREKDRMELRILVGTNLVRLAQLEALTEEMYVKD.VLP 
  T17A5.7  ...DTVIDAVEFVLLNFTEMNKLWVRMQHQGPAREKERREKERGELRDLVGKNLHVLSQLEGVDLDMY.RDTVLP 
 YscVPS35  ..........QFIMNNFIEMNKLWVRLQHQGPLRERETRTRERKELQILVGSQLVRLSQIIDDNFQMYKQD.ILP 
consensus  -----------F---NF-EMNKLWVR-QHQG-----E-R---R-EL--LVG--L--L----------------L-

           451                                                                     525
  MusMEM3  GILEQVVNCRDALAQEISMECIIQVFPDEFHLQTLNPFLRACAELHQNVNVKNIIIALIDRLALFAHREME.PG. 
CeF59G1.3  SILEQIVSCRDPISQEYLMECVIQVFADDFHLATLTEFLNACGQLQQDVNIKILLIALVDRLALYTTSYNEGQP. 
  T17A5.7  RVLEQIVNCRDEIAQYYLIDCIIQVFPDEYHLQTLDVLLGACPQLQASVDIMTVLSRLMERLSNYAALNAEVLP. 
 YscVPS35  TILEQVIQCRDLVSQEYLLDVICQVFADEFHLKTLDTLLQTTLHLNPDVSINKIVLTLVDRLNDYVTRQLEDDPN 
consensus  --LEQ---CRD---Q--------QVF-D--HL-TL---L-----L---V--------L--RL--------E----

           526                                                                     600
  MusMEM3  .....IPAELKLFDIFSQQVATVIQSRRDMPSEDVVSLQVSLINLAMKCYPDRVDYVDKVLETTVEIFNKLNLEH 
CeF59G1.3  .....APTKMQLFEIFSEQATTLIKNRPDMPLDDIVALHVSLVSLAVKCYPDRQDYANMTFQGLRQVIEEKGVTD 
  T17A5.7  .....YFLQVEAFSKLNNAIGKVIEAQEDMPILSAVTLYSSLLKFTLHVHPDRLDYADQVLGSCVKQLSGKGKID 
 YscVPS35  ATSTNAYLDMDVFGTFWDYLTVLNHERPDLSLQQFIPLVESVIVLSLKWYPNNFDNLNKLFELVLQKTKDYGQKN 
consensus  ------------F---------------D--------L--S---------P---D--------------------

           601                                                                     675
  MusMEM3  IATSSA...........VSKELTRLLKIP..................VDTYNNILTVLKLKHFHPLFEYF..... 
CeF59G1.3  IEAFGK...........VGRELTKLLNIP..................IDEYKNVLRLSQLPEYIKVMNYF..... 
  T17A5.7  ...DTR...........ATKELVSLLSAP..................LEKYNDVVTALKLTNYPLVVEYL..... 
 YscVPS35  ISLESEHLFLVLLSFQNSKLQLTSSTTAPPNSPVTSKKHFIFQLISQCQAYKNILALQSISLQKKVVNEIIDILM 
consensus  ---------------------L------P---------------------Y------------------------

           676                                                                     750
  MusMEM3  ..DYESSPGKSMSCYVLSNVLDYNTEIVSQDQVDSIMNLVSTLIQDQPDQPVEDPDPEDFADE............ 
CeF59G1.3  ..DYRGQ..CNIASYMIQNMLEEETVFRNQDDVDSAFSLISSLLKDQEKQSSDSHETEEFADE............ 
  T17A5.7  ..DTETK..RIMATVIVRSIMKNNTLITTAEKVEALFELIKGIINDLDEPQGLEVDEDDFQEE............ 
 YscVPS35  DREVEEMADNDSESKLHPPGHSAYLVIEDKLQVQRLLSICEPLIISRSGPPANVASSDTNVDEVFFNRHDEEESW 
consensus  --------------------------------V-----------------------------E------------

           751                                                                     825
  MusMEM3  .....QSLVGRFIHLLRSD.............DPDQQYLILNTARKHFGAGGNQRIRFTLPPLVFAAYQLAFRYK 
CeF59G1.3  .....QNLVARLLHLIRAD.............DVDSQFLLLNSARKTLGEGGRHRLRYTLPPIIFELYRLVLQFS 
  T17A5.7  .....QNSVALLIHMLYND.............DPEEMFKIVNVLKKHFLTGGPKRLKFTIPPLVVSTLKLIRRLP 
 YscVPS35  ILDPIQEKLAHLIHWIMNTTSRKQTMKNKIQFSLEAQLEILLLIKSSFIKGGIN.VKYTFPAIITNFWKLMRKCR 
consensus  -----Q-------H------------------------------------GG------T-P--------L-----

           826                                                                     900
  MusMEM3  ENSKWMTSGKR...........NARRYFHLPHQTISALIK.A..ELAELPLRLFLQGALAAGEIGFENHETVAYE 
CeF59G1.3  DMKDEDDKWDA...........KIRKMFVCAMGTIGALVSTA..ELAELPMKLYLNGAITADRVPFEDNHTVVYE 
  T17A5.7  VEGDNPFGKEA.........SVTATKIFQFLNQIIEALPNVP..S.PDLAFRLYLQCAEAADKC...DEEPIAYE 
 YscVPS35  MIQEYLLKKRPDNKTLLSHYSNLLKQMFKFVSRCINDIFNSCNNSCTDLILKLNLQCAILAEQLQLNE...ISYD 
consensus  ---------------------------F------I-------------L---L-L--A--A------------Y-

           901                                                                     975
  MusMEM3  FMSQAFSLYEDEISDSKAQLAAITLIIGTFERMKCFSEENHEPLRTECALA..........ASKLLKKPDQAERE 
CeF59G1.3  FVSKALSILEDDVVDSRDRVRCLHLTVGTLLKTTHLPEENWQPLANQTVLA..........AAKMFKKPDQVRSL 
  T17A5.7  FFTQAYILYEEEISDSKAQVTALQLIIGTLQRMQVFGVENRDTLTHKATGADKGKLILLQYAAKLLKKPDQCRAV 
 YscVPS35  FFSQAFTIFEESLSDSKTQLQALIYIAQSLQKTRSLYKE.........AYYDSLIVRCTLHGSKLLKKQDQCRAV 
consensus  F---A----E----DS----------------------E------------------------K--KK-DQ----

           976                                                                    1050
  MusMEM3  HMCTSL.W.SGRNTDKNGE....E.LHGGKRVMECLKKALKIANQCMDPSLQVQLFIEILNRYIYFYEKENDAVT 
CeF59G1.3  VTVAALYW.HGQTLETNGE....K.MKNGKKVVDILRKAAKIARECLEPLVQQQLFIQLLSAYTYYYEDNCSEVN 
  T17A5.7  YACSHLFWLEDRETIQDGE....RVLLCLKRALKIANSAQQVANTARGSTGSVTLFIEILNKYLYFYEKGVPQIT 
 YscVPS35  YLCSHLWWATEISNIGEEEGITDNFYRDGKRVLECLQRSLRVADSIMDNEQSCELMVEILNRCLYYFIHGDESET 
consensus  -----L-W----------E----------K------------A-----------L----L----Y----------

           1051                                                                   1125
  MusMEM3  IQVLNQLIQKIREDLPNLESSEETEQINKHFHNTLEHLRTRRESPESEGPIYEGLIL*~~~~~~~~~~~~~~~~~ 
CeF59G1.3  VDHIEELIARTQDNAVQLDVSAEADSLEKQLGEAIRRLQLAKLDVAAAQATTIRSEPELPQPPS*~~~~~~~~~~ 
  T17A5.7  VESVESLIKLIKNE....ESMPSDPSAESFFATTLEFMEFQKQKEGAIGERYQAIKV*~~~~~~~~~~~~~~~~~ 
 YscVPS35  HISIKYINGLIELIKTNLKSLKLEDNSASMITNSISDLHITGENNVKASSNADDGSVITDKESNVAIGSDGTYIQ 
consensus  ---------------------------------------------------------

           1126                                                  1183
 YscVPS35  LNTLNGSSTLIRGVVATASGSKLLHQLKYIPIHHFRRTCEYIESQREVDDRFKVIYV*

 


created 29 Oct 97
Larry Parnell