| Gene | T17A5.7 |
| Putative Identification | vacuolar sorting protein |
| Position | 6861 to 12520, from the initial methionine to the termination codon |
| Strand | - |
| EST match | F14169 |
| Database match | Mouse
maternal-embryonic 3 and S. cerevisiae VPS35, vacuolar protein sorting |
Note: T17A5 from position 12521 to 17000 has no matches of any kind to any sequence in the database. Within this region exons are predicted which would extend the amino-terminal portion of the protein (see Protein Sequence below.)
CDS: The table below lists the coordinates of the T17A5.7 exons and which exon prediction algorithm selcted the 3' and 5' termini (GF = Genefinder, GS = GenScan, Gr = Grail, M = MZEF). Exon termini desginated by EST were selected based on an exact match to EST F14169, and those marked by est were confirmed by comparison to EST T04572, which is very similar but not exact to T17A5.
| Exon | Range | 3' | 5' |
|---|---|---|---|
| 1 | 12428 to 12520 | GF,Gr | GF,Gr |
| 2 | 12170 to 12266 | GF,GS,Gr | GF,GS,Gr,M |
| 3 | 11941 to 12064 | GF,GS,Gr | GF,GS,Gr,M |
| 4 | 11519 to 11704 | GF,GS,Gr | GF,GS,Gr |
| 5 | 11243 to 11330 | GF,GS,M | GF,GS,Gr,M |
| 6 | 10990 to 11046 | GF,Gr | GF |
| 7 | 10806 to 10895 | GF,GS,Gr,M | GS,Gr,M |
| 8 | 10593 to 10715 | GF,GS,Gr,M | GF,GS,Gr,M |
| 9 | 10314 to 10448 | GF,GS,Gr,M | GF,GS,Gr,M |
| 10 | 9842 to 9958 | GF,GS,M | GF,GS,Gr,M |
| 11 | 9179 to 9418 | GF,GS,Gr | GF,GS,Gr,M |
| 12 | 9028 to 9093 | GF,Gr,M | GF,Gr,M |
| 13 | 8822 to 8911 | GF,GS,Gr | GF,GS,Gr |
| 14 | 8480 to 8569 | GF,Gr,M | GF,Gr,M |
| 15 | 8201 to 8290 | GF,GS,Gr,M | GF,GS,Gr,M |
| 16 | 7887 to 7955 | est,GF,GS,Gr,M | GF,GS,Gr,M |
| 17 | 7732 to 7812 | est,GF,GS,Gr | est,GF,GS,Gr,M |
| 18 | 7521 to 7628 | GF,GS | est,GF,GS |
| 19 | 7328 to 7461 | GF,GS,Gr,M | GF,GS |
| 20 | 7142 to 7255 | EST,GF,GS,Gr,M | GF,GS,Gr,M |
| 21 | 6861 to 7062 | GF,Gr | EST,GF,GS,Gr |
Complete CDS of T17A5.7
ATGATCGCAGACGGATCAGAAGATGAAGAGAAATGGCTCGCCGCCGGTGCTGCTGCTTTC AAGCAAAACGCATTTTACATGCAACGCGCTATTGACTCGAATAATCTGAAAGATGCTCTC AAGTATTCGGCTCAGATGCTAAGCGAGTTACGGACTTCGAAGCTATCACCTCACAAGTAC TATGATCTATATATGAGAGCTTTTGATGAATTGAGGAAACTTGAGATTTTCTTTATGGAA GAAACTCGTCGTGGCTGCTCAGTCATTGAACTCTATGAGCTGGTTCAGCATGCTGGTAAC ATATTACCGCGTTTGTATCTCCTATGTACAGCAGGATCCGTGTATATCAAAACCAAGGAA GCTCCTGCCAAGGAAATTCTTAAAGATCTTGTTGAGATGTGCCGTGGGATTCAGCATCCT CTACGTGGTCTCTTCTTAAGAAGTTACCTTGCGCAGATTAGTCGAGATAAATTACCTGAC ATTGGTTCTGAGTATGAAGGAGATGCTGATACAGTCATAGATGCTGTGGAGTTTGTACTA CTGAACTTTACTGAGATGAATAAACTCTGGGTCAGAATGCAACATCAGGGACCTGCTCGA GAAAAGGAGAGACGGGAGAAAGAGAGGGGCGAGCTTCGTGACCTTGTTGGAAAGAACCTT CACGTGCTGAGTCAGTTAGAAGGTGTGGACCTTGATATGTACAGAGATACAGTTCTTCCT AGAGTCTTAGAGCAGATTGTGAACTGCAGAGATGAGATTGCCCAATATTACCTAATAGAC TGTATAATTCAAGTTTTTCCTGACGAGTATCACTTGCAGACTCTAGATGTACTTCTTGGG GCGTGTCCTCAACTTCAGGCATCAGTTGACATCATGACAGTGCTTTCCCGTTTAATGGAG AGGCTGTCAAATTATGCTGCCTTAAATGCGGAAGTATTACCTTATTTCCTGCAAGTGGAA GCTTTCTCAAAGTTGAATAATGCAATTGGAAAGGTGATAGAAGCACAAGAAGACATGCCT ATTCTGAGTGCAGTAACCCTATATTCCTCCCTTCTCAAGTTTACTCTTCACGTTCACCCT GATCGGCTTGATTATGCGGACCAAGTGTTGGGATCATGTGTTAAGCAACTGTCCGGAAAA GGAAAGATTGATGACACTCGTGCAACAAAGGAGCTTGTCTCGCTTTTAAGTGCTCCCTTA GAGAAGTATAATGATGTTGTCACCGCCCTTAAACTAACTAACTATCCCCTCGTGGTGGAG TACCTTGATACCGAAACAAAGAGAATAATGGCTACTGTTATAGTTCGAAGCATTATGAAA AACAATACTCTTATTACTACAGCGGAGAAGGTTGAAGCATTGTTTGAACTGATTAAAGGA ATTATCAACGATTTGGATGAGCCGCAAGGTCTTGAGGTTGATGAAGATGATTTTCAGGAG GAGCAGAATTCTGTTGCGCTTCTCATTCATATGTTATATAATGATGACCCAGAAGAGATG TTTAAGATAGTCAATGTCCTGAAGAAGCATTTCCTGACAGGAGGGCCAAAGCGCTTAAAA TTCACCATTCCTCCCCTTGTTGTTTCTACTCTAAAGCTAATCAGGCGATTGCCAGTGGAA GGAGACAATCCTTTTGGAAAAGAGGCTTCCGTTACTGCTACTAAAATATTCCAATTTCTA AATCAGATTATCGAAGCGCTACCTAATGTTCCATCACCTGACCTGGCATTCCGGTTGTAC TTGCAATGTGCTGAGGCTGCGGATAAGTGTGATGAAGAACCAATTGCATACGAATTTTTC ACCCAGGCATACATCTTATACGAAGAAGAAATTTCGGACTCAAAGGCCCAGGTGACTGCG TTACAACTTATAATTGGAACTCTGCAGAGGATGCAAGTATTTGGTGTTGAGAATAGAGAT ACATTAACGCACAAGGCTACGGGGGCTGACAAGGGAAAACTTATACTCTTGCAGTATGCA GCGAAACTTCTAAAGAAACCTGATCAATGTCGAGCTGTTTATGCCTGTTCTCATTTGTTC TGGCTGGAAGATCGTGAGACCATACAAGATGGAGAAAGGGTTCTACTTTGTCTGAAACGA GCGCTTAAAATTGCAAATTCGGCTCAACAAGTGGCCAACACAGCTCGGGGTAGTACAGGG TCTGTTACCCTCTTCATCGAGATACTAAACAAGTACCTCTATTTCTATGAGAAAGGGGTT CCACAGATAACAGTTGAATCAGTAGAAAGCCTGATAAAACTGATCAAGAACGAAGAATCG ATGCCCTCTGATCCATCTGCTGAATCATTCTTTGCAACTACGCTTGAGTTCATGGAGTTC CAAAAGCAGAAAGAGGGTGCCATTGGTGAGAGATACCAGGCGATCAAAGTATAG
Protein sequence:
MIADGSEDEEKWLAAGAAAFKQNAFYMQRAIDSNNLKDALKYSAQMLSELRTSKLSPHKY YDLYMRAFDELRKLEIFFMEETRRGCSVIELYELVQHAGNILPRLYLLCTAGSVYIKTKE APAKEILKDLVEMCRGIQHPLRGLFLRSYLAQISRDKLPDIGSEYEGDADTVIDAVEFVL LNFTEMNKLWVRMQHQGPAREKERREKERGELRDLVGKNLHVLSQLEGVDLDMYRDTVLP RVLEQIVNCRDEIAQYYLIDCIIQVFPDEYHLQTLDVLLGACPQLQASVDIMTVLSRLME RLSNYAALNAEVLPYFLQVEAFSKLNNAIGKVIEAQEDMPILSAVTLYSSLLKFTLHVHP DRLDYADQVLGSCVKQLSGKGKIDDTRATKELVSLLSAPLEKYNDVVTALKLTNYPLVVE YLDTETKRIMATVIVRSIMKNNTLITTAEKVEALFELIKGIINDLDEPQGLEVDEDDFQE EQNSVALLIHMLYNDDPEEMFKIVNVLKKHFLTGGPKRLKFTIPPLVVSTLKLIRRLPVE GDNPFGKEASVTATKIFQFLNQIIEALPNVPSPDLAFRLYLQCAEAADKCDEEPIAYEFF TQAYILYEEEISDSKAQVTALQLIIGTLQRMQVFGVENRDTLTHKATGADKGKLILLQYA AKLLKKPDQCRAVYACSHLFWLEDRETIQDGERVLLCLKRALKIANSAQQVANTARGSTG SVTLFIEILNKYLYFYEKGVPQITVESVESLIKLIKNEESMPSDPSAESFFATTLEFMEF QKQKEGAIGERYQAIKV*
If other exons predicted by the programs in the range of 13400
to 16600 are
selected for the gene model, the first 31 amino acids
[MIADS..MQRAI] of
T17A5.7 can be replaced with
MYLSSKTVIKNRVPVSMSLRHMPTAPMPGVIQQCLKLASCNECSFNPSQEESQVYWIRKL WTLIVVSFIGLEIQGAKEEVKKDRKHKRNEKDRKDRDNEAGRSRKHRHKRRRKDEGAIAS GKLVSSEVELLEKSCQTVELELQTSSQNSCDSTLHSNERPKQIQSQPLDETSIRTRLPDK GQEDPEDGVMMTSKDQKQRFSREMLDASQAATAPNESVGHSRVCQEKRIDPTFGSSREIT TKLNKEKKSVPSKDNRKVSKEKKMPSLSSCNPLEQEKPTSSHQETPGPSKLLCRKCPPSM AGQLLNLIENWAPDRVESKLTDSEDQEWWLFIKFGAKSPQVSNQKTNQGSSSM
Alignment of T17A5.7 to S. cerevisiae VPS35, Mouse MEM3, and C. elegans protein F59G1.3 The consensus sequence shows residue conservations in all four of the aligned sequences.
1 75
CeF59G1.3 MLPNVKREQIDEMKFPEAESWWNRTDTRQKWDEPPRKAEVNVDRCENLPSVQTRTERIDESRKEDEYCEVVVSTK
76 150
CeF59G1.3 KTYYSRRNGMKPGTLKNLRCYYQSAGIPLQLYVNCSPGNESEVIETRITETKQIGDGQYEVRIESRYKFLKTAKI
151 225
CeF59G1.3 LKARNRRQLSSIILSDAHETKFSTIAHSSLFPLLHLQLKMYENSGNTTDQEKFLDQSIRVVKAESFEMKRCLDKG
T17A5.7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MIADGSEDEEKWLAAGAAAFKQNAFYMQRAIDSN
YscVPS35 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MAYADSPENAIAVMNRCLSQH
226 300
MusMEM3 ~~~~~~~MPPICLESSGPLCCHQRVTMNFIWLFLMNCTTWKVYLTDEFA.KGERLADLYELVQYSGNIIPRLYLL
CeF59G1.3 KTMDALKHALQMLNEMRTAELSPKFYYRLYMDSMHELQCLEVNLVQEYAQEPAKLGNLYECVQYASAIIPRLYLL
T17A5.7 NLKDALKYSAQMLSELRTSKLSPHKYYDLYMRAFDELRKLEIFFMEE.TRRGCSVIELYELVQHAGNILPRLYLL
YscVPS35 KLMESLQHTSIMLTELRNPNLSPKKYYELYVIIFDSLTNLSTYLIENHPQN.HHLADLYELVQYTGNVVPRLYLM
consensus -----L--------------------------------------------LYE-VQ------PRLYL-
301 375
MusMEM3 ITVGVVYVKSFPQSRKDILKDLVEMCRGVQHPLRGLFLRNYLLQCTRNILPD................EGEPTDE
CeF59G1.3 VTIGGVFIKCGLGSRKEILKDLVEMCRGVQHPLRGLFLRNYLMQCTRSVLPDFPETEEMLVAHNDNLSKGTPKLK
T17A5.7 CTAGSVYIKTKEAPAKEILKDLVEMCRGIQHPLRGLFLRSYLAQISRDKLPDIGSEYEGDA..............
YscVPS35 ITVGTSYLTFNEAPKKEILKDMIEMCRGVQNPIRGLFLRYYLSQRTKELLPEDDPSFNS................
consensus -T-G-----------K-ILKD--EMCRG-Q-P-RGLFLR-YL-Q-----LP------------------------
376 450
MusMEM3 ETTGDISDSMDFVLLNFAEMNKLWVRMQHQGHSRDREKRERERQELRILVGTNLVALTLVSWRCKCGTLQQIVLT
CeF59G1.3 PRDGTVDDTIDFVLINFAEMNKLWVRMQHQGPSKEKEKREKDRMELRILVGTNLVRLAQLEALTEEMYVKD.VLP
T17A5.7 ...DTVIDAVEFVLLNFTEMNKLWVRMQHQGPAREKERREKERGELRDLVGKNLHVLSQLEGVDLDMY.RDTVLP
YscVPS35 ..........QFIMNNFIEMNKLWVRLQHQGPLRERETRTRERKELQILVGSQLVRLSQIIDDNFQMYKQD.ILP
consensus -----------F---NF-EMNKLWVR-QHQG-----E-R---R-EL--LVG--L--L----------------L-
451 525
MusMEM3 GILEQVVNCRDALAQEISMECIIQVFPDEFHLQTLNPFLRACAELHQNVNVKNIIIALIDRLALFAHREME.PG.
CeF59G1.3 SILEQIVSCRDPISQEYLMECVIQVFADDFHLATLTEFLNACGQLQQDVNIKILLIALVDRLALYTTSYNEGQP.
T17A5.7 RVLEQIVNCRDEIAQYYLIDCIIQVFPDEYHLQTLDVLLGACPQLQASVDIMTVLSRLMERLSNYAALNAEVLP.
YscVPS35 TILEQVIQCRDLVSQEYLLDVICQVFADEFHLKTLDTLLQTTLHLNPDVSINKIVLTLVDRLNDYVTRQLEDDPN
consensus --LEQ---CRD---Q--------QVF-D--HL-TL---L-----L---V--------L--RL--------E----
526 600
MusMEM3 .....IPAELKLFDIFSQQVATVIQSRRDMPSEDVVSLQVSLINLAMKCYPDRVDYVDKVLETTVEIFNKLNLEH
CeF59G1.3 .....APTKMQLFEIFSEQATTLIKNRPDMPLDDIVALHVSLVSLAVKCYPDRQDYANMTFQGLRQVIEEKGVTD
T17A5.7 .....YFLQVEAFSKLNNAIGKVIEAQEDMPILSAVTLYSSLLKFTLHVHPDRLDYADQVLGSCVKQLSGKGKID
YscVPS35 ATSTNAYLDMDVFGTFWDYLTVLNHERPDLSLQQFIPLVESVIVLSLKWYPNNFDNLNKLFELVLQKTKDYGQKN
consensus ------------F---------------D--------L--S---------P---D--------------------
601 675
MusMEM3 IATSSA...........VSKELTRLLKIP..................VDTYNNILTVLKLKHFHPLFEYF.....
CeF59G1.3 IEAFGK...........VGRELTKLLNIP..................IDEYKNVLRLSQLPEYIKVMNYF.....
T17A5.7 ...DTR...........ATKELVSLLSAP..................LEKYNDVVTALKLTNYPLVVEYL.....
YscVPS35 ISLESEHLFLVLLSFQNSKLQLTSSTTAPPNSPVTSKKHFIFQLISQCQAYKNILALQSISLQKKVVNEIIDILM
consensus ---------------------L------P---------------------Y------------------------
676 750
MusMEM3 ..DYESSPGKSMSCYVLSNVLDYNTEIVSQDQVDSIMNLVSTLIQDQPDQPVEDPDPEDFADE............
CeF59G1.3 ..DYRGQ..CNIASYMIQNMLEEETVFRNQDDVDSAFSLISSLLKDQEKQSSDSHETEEFADE............
T17A5.7 ..DTETK..RIMATVIVRSIMKNNTLITTAEKVEALFELIKGIINDLDEPQGLEVDEDDFQEE............
YscVPS35 DREVEEMADNDSESKLHPPGHSAYLVIEDKLQVQRLLSICEPLIISRSGPPANVASSDTNVDEVFFNRHDEEESW
consensus --------------------------------V-----------------------------E------------
751 825
MusMEM3 .....QSLVGRFIHLLRSD.............DPDQQYLILNTARKHFGAGGNQRIRFTLPPLVFAAYQLAFRYK
CeF59G1.3 .....QNLVARLLHLIRAD.............DVDSQFLLLNSARKTLGEGGRHRLRYTLPPIIFELYRLVLQFS
T17A5.7 .....QNSVALLIHMLYND.............DPEEMFKIVNVLKKHFLTGGPKRLKFTIPPLVVSTLKLIRRLP
YscVPS35 ILDPIQEKLAHLIHWIMNTTSRKQTMKNKIQFSLEAQLEILLLIKSSFIKGGIN.VKYTFPAIITNFWKLMRKCR
consensus -----Q-------H------------------------------------GG------T-P--------L-----
826 900
MusMEM3 ENSKWMTSGKR...........NARRYFHLPHQTISALIK.A..ELAELPLRLFLQGALAAGEIGFENHETVAYE
CeF59G1.3 DMKDEDDKWDA...........KIRKMFVCAMGTIGALVSTA..ELAELPMKLYLNGAITADRVPFEDNHTVVYE
T17A5.7 VEGDNPFGKEA.........SVTATKIFQFLNQIIEALPNVP..S.PDLAFRLYLQCAEAADKC...DEEPIAYE
YscVPS35 MIQEYLLKKRPDNKTLLSHYSNLLKQMFKFVSRCINDIFNSCNNSCTDLILKLNLQCAILAEQLQLNE...ISYD
consensus ---------------------------F------I-------------L---L-L--A--A------------Y-
901 975
MusMEM3 FMSQAFSLYEDEISDSKAQLAAITLIIGTFERMKCFSEENHEPLRTECALA..........ASKLLKKPDQAERE
CeF59G1.3 FVSKALSILEDDVVDSRDRVRCLHLTVGTLLKTTHLPEENWQPLANQTVLA..........AAKMFKKPDQVRSL
T17A5.7 FFTQAYILYEEEISDSKAQVTALQLIIGTLQRMQVFGVENRDTLTHKATGADKGKLILLQYAAKLLKKPDQCRAV
YscVPS35 FFSQAFTIFEESLSDSKTQLQALIYIAQSLQKTRSLYKE.........AYYDSLIVRCTLHGSKLLKKQDQCRAV
consensus F---A----E----DS----------------------E------------------------K--KK-DQ----
976 1050
MusMEM3 HMCTSL.W.SGRNTDKNGE....E.LHGGKRVMECLKKALKIANQCMDPSLQVQLFIEILNRYIYFYEKENDAVT
CeF59G1.3 VTVAALYW.HGQTLETNGE....K.MKNGKKVVDILRKAAKIARECLEPLVQQQLFIQLLSAYTYYYEDNCSEVN
T17A5.7 YACSHLFWLEDRETIQDGE....RVLLCLKRALKIANSAQQVANTARGSTGSVTLFIEILNKYLYFYEKGVPQIT
YscVPS35 YLCSHLWWATEISNIGEEEGITDNFYRDGKRVLECLQRSLRVADSIMDNEQSCELMVEILNRCLYYFIHGDESET
consensus -----L-W----------E----------K------------A-----------L----L----Y----------
1051 1125
MusMEM3 IQVLNQLIQKIREDLPNLESSEETEQINKHFHNTLEHLRTRRESPESEGPIYEGLIL*~~~~~~~~~~~~~~~~~
CeF59G1.3 VDHIEELIARTQDNAVQLDVSAEADSLEKQLGEAIRRLQLAKLDVAAAQATTIRSEPELPQPPS*~~~~~~~~~~
T17A5.7 VESVESLIKLIKNE....ESMPSDPSAESFFATTLEFMEFQKQKEGAIGERYQAIKV*~~~~~~~~~~~~~~~~~
YscVPS35 HISIKYINGLIELIKTNLKSLKLEDNSASMITNSISDLHITGENNVKASSNADDGSVITDKESNVAIGSDGTYIQ
consensus ---------------------------------------------------------
1126 1183
YscVPS35 LNTLNGSSTLIRGVVATASGSKLLHQLKYIPIHHFRRTCEYIESQREVDDRFKVIYV*
created 29 Oct 97
Larry Parnell