| This page contains the annotation of genes T10M13.17.1 and T10M13.18 |
|---|
| Gene | T10M13.17.1 |
| Putative Identification | hypothetical protein |
| Position | 86229 to 89782, from the initial methionine to the termination codon |
| Strand | + |
| EST match | none |
| Database match | several Arabidopsis hypothetical proteins |
CDS: The table below lists the coordinates of the T10M13.17.1 exons and which algorithms selected the given termini (GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons).
| Exon | Range | 5' | 3' |
|---|---|---|---|
| 1 | 86229 - 86321 | GS, Gr | GS, Gr, NPG |
| 2 | 86906 - 89782 | GS, Gr, M, NPG | GS, Gr |
Alternate exons not used in building the gene model: GRAIL does not select exon 1 as an initial exon. Rather, a splice site is predicted that coincides with the start methionine predicted by GenScan and this joins T10M13.17 and T10M13.17.1. Other GRAIL exons extend from 86906 to 87358, from 87788 to 88327, from 88805 to 88915, from 89145 to 89349 and from 89598 to 89782. MZEF predicts internal exons from 86906 to 87433, from 89188 to 89349 and from 90227 to 90297. NetPlantGene predicts several putative splice sites in this region. Notable sites include splice donors at 87433 (confidence score = 0.86), at 88391 (0.93) and at 90297 (0.96).
Complete CDS of T10M13.17.1
ATGGGTCTTATCAACCAATGGTTTCCACTGGACACTCTTGCAACTCCAGAGCGTATCAGG ATGGCTTGTTTCGACGAGTTTACTTCTCCTTTTTATATGACCATGGATTCTGAGGGAGTG TTAATGCCGTTGGTCCATGAACATCTTATGATGCCTTGGAATGATCTAAGGAAAGGTGAT TGTTGTGGACTCTCGAAAGCTGTCAGTGTTGGTTATTATTGCAAAAGCTGCGATTTTTTC GCCCACAAGAAATGTGGCGAGTCCTCCGAGTTTATCCAACATCCATCTCACCCCAATCAC ACTCTTCAGCTTCGAAGTAGCGAAGGCTGTAATAGCTGCAATTTATGTGGAAGGACCATT AGTGACCTATTCTATCGTTGTGATCTCTGTGATTTCGACGTGGATCTATATTGTGCCAAG TATCCACCACCAGAGGTTATTGACATTCCTGAGACGCATCACCACAAGCTCAACCTTCAC AAGGAGTTGATCGAGTTCAACTGTGATGCTAAATGTGGGAAGATTACTGGTGCTGAGTTT CCTTACGTATGTTATGAATGTCAATTACCTTTCCATGTGGATTGCGTATGGCGGCCATCG GAGTTAAAACACCCATCAGAGGTAAACCATTCTTACCACTCCTTACACCCTCTTAAGCTC CTCTCAGGTCAATTACCGGATAATTGTGATGGAAAATGTCGTCTTTGCGCAAGAAAGATT GATGATAGATTGTTCTATCATTGTTCTCCGTGCAACTTCACCTTGGATCTACGTTGTGTT TTAAACCCACCACAACAATCTCTTCTGAATTTGAAAGCTCATGATCATCAACTCACCCTT CTCCCAAGACTCTTGTCATTTACATGTAATGCTTGCGGGTTGAATGGAGATCGAAGCCCT TACACTTGTGTTCAATGTGATTTCATGATTCATCAAGATTGTCTTGACTTACCACGCGTC ATAAACATTAATCAGCATGATCACCGTGTTTCTCGAACTTCTGTTCCTGGTGTTGGAAAT TCTAAGGGAGTGTTACTACCGTGGTTTCACAGGAAGCATATAATGATGCCTTGGAATGAT ATGAGGAAAGGAGATTGTTGTGGAAGTTTGGAATCTGTCACTGACGGCTTCTATTGCAAA AGCTGCGATATTTTCATCCACAAGAAATGTGGCGAGTCCTCCGGAATTATCGACCACTCA TCTCACCCCGATCACACTCTTGAGCTTAACCGTTATCCTAATAAAAGTTGCAATTTATGT GGAAGGTCCAAAGGTGTGAATGTATGCTATCGTTGTGATCACTGTTACTACCAATTGGAT CTATATTGTGCCAAGTACCCACCACCAGAGTTTATTGACATTCCCGAAACGCATCACCAC AAGCTCACCCTTCTCAAGGAGCGGATTGAGTTTGATTGTGATGCTAAATGTGGGGAGACT GGAGATGGGTTTGCTTACAAATGTCCTGAATGTGATTTATTCTTCCATGTGGATTGCGTA TGGCATCCGCCAGAAGTAAATCACCCTTTAGAGGTAAACCACTCTTACCACCCCTGGCAC CCTCTTAAGCTCCACACAGGTCAACCACCACACTATTCTGATGGAACATGTCGTCTTTGT GCAAGAAATATTGATGATAGATTGTTTTATCATTGCTCTCCGTGCAACTTCACATTGGAT CTGCGTTGTGTTTTAAATCCACCGCCACAATCTCTTTTGAATTTGAAAGCTCATGACCAC CAACTCATCCTTCTCCCAAGACTCCGTTCTTTTACATGTAACGCTTGCGGGTTGAGTGGT GATCGAAGCCCTTACATATGTATTCAATGTGACTTTATGATCCATCAAGATTGTCTTGAC CTTCCACGCCTCATAAACGTTAATCGTCATGATCACCGAGTTTCACGAACTTCTGTTCTT GGTGTTGTAAATTCTGTATGTGGAGTTTGTCATCAGAAGGTGGATTGGACTTGTGGAGGT TTTTCTTGTCAGAGGTGTTCTAGCTATGTTGTTCATTCGAAATGTGCTACCAGAAGAGAT GTATGGAACGGGAAAGAACTGGAAGGAGTACCTGAAGAAATAGAAGATATAGAGCCATAT GTGGTAATAGATGACAATACAATACAACATTTCAGCCACAAAGAGCATTACCTAAGACTC CACGTTAATGATGTTCTATGTAACGACAACAAGCGGTGCAAGGCATGCACCCATCCCATC TTTCTTCAATCCTTCTACAGTTGTATGGATTGTGACTTCCTTCTCCACCAAAATTGCGCT GGATTTCCTAGAATGAAAAGGCATGTGCTACACAATGAGCGTCTTACTTTAGTTACTAAC GAGACTAAACTCTTTCAGTGTGCTCCTTGTGATAGATGGTCCAATGGTTTCAGGTACCAA CATGGATATAAGTCACTCGATCTTCAGTGTGGTTCAATATCCGAGCCATTTGTCCATCCA AGCCATCCCGATCATCCCTTATATCACACTTTACTGGATGGAAGGAATGAGATTTGTGAT GGCTGCAAAAAGAGTTGGTATTATGTGCTAAGTTGCATTGAAGATGATTGTAGATTTGTC TTATGTTTCAAATGCCTCACTTTACCACAAGTGGTAAAGCATAGAGTTGACGATGATCCT CTCTTGTTATGCTATGGCGAAAAGGCTTGTGGTAAATACTGGTGTGATATTTGTGAAAAG GAAACCAATCCAGAGACATGGTTCTACACGTGTAAAGATCATCAATCTAGTATGCATACA ATGTGTGTGCTTGGTGACTCTGTAGGGATCATGCCAAAAAGCACAATAATGGGTTGGTAC AAATATTATGAGGTGGTGCTCAATAATAGTATATCTCGTCCAGTTTGCGAGATGTGCAAG TCGCGTTGCATTTTTCCTACGATCATGAAGATTCTTGAAGCCTCAAATGAATACGTTTGC TCTTTAGCATGCGTCCTAGAGAGGGGGTGA
Protein sequence
MGLINQWFPLDTLATPERIRMACFDEFTSPFYMTMDSEGVLMPLVHEHLMMPWNDLRKGD CCGLSKAVSVGYYCKSCDFFAHKKCGESSEFIQHPSHPNHTLQLRSSEGCNSCNLCGRTI SDLFYRCDLCDFDVDLYCAKYPPPEVIDIPETHHHKLNLHKELIEFNCDAKCGKITGAEF PYVCYECQLPFHVDCVWRPSELKHPSEVNHSYHSLHPLKLLSGQLPDNCDGKCRLCARKI DDRLFYHCSPCNFTLDLRCVLNPPQQSLLNLKAHDHQLTLLPRLLSFTCNACGLNGDRSP YTCVQCDFMIHQDCLDLPRVININQHDHRVSRTSVPGVGNSKGVLLPWFHRKHIMMPWND MRKGDCCGSLESVTDGFYCKSCDIFIHKKCGESSGIIDHSSHPDHTLELNRYPNKSCNLC GRSKGVNVCYRCDHCYYQLDLYCAKYPPPEFIDIPETHHHKLTLLKERIEFDCDAKCGET GDGFAYKCPECDLFFHVDCVWHPPEVNHPLEVNHSYHPWHPLKLHTGQPPHYSDGTCRLC ARNIDDRLFYHCSPCNFTLDLRCVLNPPPQSLLNLKAHDHQLILLPRLRSFTCNACGLSG DRSPYICIQCDFMIHQDCLDLPRLINVNRHDHRVSRTSVLGVVNSVCGVCHQKVDWTCGG FSCQRCSSYVVHSKCATRRDVWNGKELEGVPEEIEDIEPYVVIDDNTIQHFSHKEHYLRL HVNDVLCNDNKRCKACTHPIFLQSFYSCMDCDFLLHQNCAGFPRMKRHVLHNERLTLVTN ETKLFQCAPCDRWSNGFRYQHGYKSLDLQCGSISEPFVHPSHPDHPLYHTLLDGRNEICD GCKKSWYYVLSCIEDDCRFVLCFKCLTLPQVVKHRVDDDPLLLCYGEKACGKYWCDICEK ETNPETWFYTCKDHQSSMHTMCVLGDSVGIMPKSTIMGWYKYYEVVLNNSISRPVCEMCK SRCIFPTIMKILEASNEYVCSLACVLERG*
| Gene | T10M13.18 |
| Putative Identification | hypothetical protein |
| Position | 93637 to 95616, from the initial methionine to the termination codon |
| Strand | + |
| EST match | none |
| Database match | several Arabidopsis hypothetical proteins |
CDS: The table below lists the coordinates of the T10M13.18 exons and which algorithms selected the particular termini (GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene).
| Exon | Range | 5' | 3' |
|---|---|---|---|
| 1 | 93637 - 95616 | GS, Gr | GS |
Alternate exons not used in building the gene model: The initial exon predicted by GRAIL extends from 93637 to 94468. Other exons extend from 94842 to 95004 and from 95034 to 95308. MZEF, not designed to predict terminal exons, locates no internal exons in this region. NetPlantGene predicts putative splice sites throughout this region. Notable site include splice acceptors at and splice donors at 93832 (0.88), at 93853 (0.92) and at 95512 (0.99).
Complete CDS of T10M13.18
ATGGATTCTGAGGGAGTGTTATTGCGGTTGATCCACGAGCATTCTATGATCCCTTGTAAT CATCTGAGGAAAGGTGATTGTTGTGGACGCTTTGAAGCTATAAGCGATGGCTACTACTGC AAAACATGCGATTTTTTCGTCCACAAGAAATGTGTCGATGAGGCCTCTGAATCTATCGAG CATCCATCTCATCCCGGTCACACTCTTCAGCTTCTAAGTAAACAAAAATATCGTTATTGT AATTTATGTGGAAGGGACATTAAGGATCTATGCTATCATTTTGGTAACTTCGACGTGGAT CTATATTGTGCAAAGTACCCACCACCAGAGGTTATTGAAAGTTCTGAGACACCATGCCAC AAGCGCACCCTTCTCAAGGAGAAGATCGAGTTTCATTGTGATGCTAAATGTGAGGAGATT GGTTATGGCTTTCCTTACAAATGTCTTGAATGTGGTTTGACCTTTCATATCGACTGTGTT TTGCATCCATCCGGGGTAAAACACCTGCCAGAGGTAATCCACCCGCTAGACGTAAAGAAT CACTCTTGCCACCCCTTGCACCCTCTTAAGCTTGTTCTCACAGGTCAACTACCAGACTAT TCTGATGGAAAATGTCGTCTTTGCGAAAAAAAGATCGATAGCCCTTTGTTCTATCATTGT TCTCCATGTAACTTCACCTTGGATATGCGTTGTGCTTTAAACCCGCCATCAATATCTTTC GAGGATTCGAAAACTCATGATCACCAACTCACTCTTCTCCCAAGACTCGACTCTTTTACA TGTAATGCTTGTGGGCTGAAGGGAGATCGAAGCCCTTACATATGTGTTCAATGTAATTTC ATCATCCACCAAGAGTGTCTTACCCTTCCACGCCTTATAAACATAAATCGGCATGATCAC CGTGTTGCTCGCACTAATCTTCTGGGTCTTGTATATTCTGTGTGTGGAGTTTGTCGCCAA AAAGTGGATTGGACTTGGGGAGGTTATTCTTGTCAAAGGTGTTCTAACTATGTCGTTCAT TCCAAATGTGCTACAAGAAAAGATGTGTGGAACGGGAAAGAACTCGAAGGAGTTATTGAA GAAACAGAAGATATAGAGCCGTATGTGGTAATAGATGACAACACAATACAACATTTCAGC CACGAAGAGCATTACCTAAAACTCGACGACAATGGTGTTTTATGTGATGAAAACAAGCGG TGCAGCGCATGCACCCATTCCGTTTGCCTCGAATCCTTCTATGGTTGTATGGATTGTGAC TTCATTCTTCACCAAAACTGTGCTAAATTTCCTAAAAGGAAACGACATGTGCTACACAAT GAACGGCTTACTTTATTTACACGCGAGGCTGGTCACTTTTGGTGTAATGTTTGTGGTAGA ATCTCCAATGGTTTCAGTTACCAATATGGGGATATGAAATTAGATGTCATATGTTGTTCA GTGTTGGAGCCATTTGTCCATCCAAGTCATCCCGATCATCCCTTATATTACATTTCACCA GAGATGGAAGAAGTCTGCAATGGTTGTAACATGTCGGGAACTAGAATGCTCAGGTGCATT GAAGATGGTTGTGGATTTGTCTTATGCTTCAAATGCGCCACTTTACCGCGAGTGTTAAAG CATAGAGTTGACGATTATCCTCTCTTACTATGCTATGGCGAAAAGGCAAATGGTATATAT TGGTGTGAGATTTGTGAGAAAAAAATGAATCCAGAGAAATGGTTCTACACATGTAAAGAT CAATGGGCTAGTTTGCACACAGAGTGTGTAGTCGGAGACTTTTCAGGGCTCATGCCAGGA AGCGTAGTAAAGGCTGAGACGGGATCATATGAGGTTGTGCTCAACAAGAATGTATCTCGC CCGTTTTGCAGGCAGTGTAAGTCGCATTGCATGTACCCTATCATCTATAAGATACCTGAA ACCTCAGTTTCTTATCTTTGCTCTGATATTTGCATAAAGCGTTTTACAAAGAGAGACTGA
Protein sequence
MDSEGVLLRLIHEHSMIPCNHLRKGDCCGRFEAISDGYYCKTCDFFVHKKCVDEASESIE HPSHPGHTLQLLSKQKYRYCNLCGRDIKDLCYHFGNFDVDLYCAKYPPPEVIESSETPCH KRTLLKEKIEFHCDAKCEEIGYGFPYKCLECGLTFHIDCVLHPSGVKHLPEVIHPLDVKN HSCHPLHPLKLVLTGQLPDYSDGKCRLCEKKIDSPLFYHCSPCNFTLDMRCALNPPSISF EDSKTHDHQLTLLPRLDSFTCNACGLKGDRSPYICVQCNFIIHQECLTLPRLININRHDH RVARTNLLGLVYSVCGVCRQKVDWTWGGYSCQRCSNYVVHSKCATRKDVWNGKELEGVIE ETEDIEPYVVIDDNTIQHFSHEEHYLKLDDNGVLCDENKRCSACTHSVCLESFYGCMDCD FILHQNCAKFPKRKRHVLHNERLTLFTREAGHFWCNVCGRISNGFSYQYGDMKLDVICCS VLEPFVHPSHPDHPLYYISPEMEEVCNGCNMSGTRMLRCIEDGCGFVLCFKCATLPRVLK HRVDDYPLLLCYGEKANGIYWCEICEKKMNPEKWFYTCKDQWASLHTECVVGDFSGLMPG SVVKAETGSYEVVLNKNVSRPFCRQCKSHCMYPIIYKIPETSVSYLCSDICIKRFTKRD*
Comparison of T10M13.17.1 and T10M13.18. There are numerous CxxC peptide elements repeated in these proteins. Protein T10M13.17.1 from residues 48 to 344 is 75% identical to residues 353 to 649. The two proteins from T10M13 are aligned to three other hypothetical proteins.
1 60
T10M13-17.1 MGLINQWFPLDTLATPERIRMACFDEFTSPFYMTMDSEGVLMPLVHEHLMMPWNDLRKGD
T10M13-18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F2N1.27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19K24.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F21P24.16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
61 120
T10M13-17.1 CCGLSKAVSVGYYCKSCDFFAHKKCGESSEFIQHPSHPNHTLQLRSSEGCNSCNLCGRTI
T10M13-18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F2N1.27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19K24.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F21P24.16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
121 180
T10M13-17.1 SDLFYRCDLCDFDVDLYCAKYPPPEVIDIPETHHHKLNLHKELIEFNCDAKCGKITGAEF
T10M13-18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F2N1.27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19K24.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F21P24.16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
181 240
T10M13-17.1 PYVCYECQLPFHVDCVWRPSELKHPSEVNHSYHSLHPLKLLSGQLPDNCDGKCRLCARKI
T10M13-18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F2N1.27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19K24.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F21P24.16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
241 300
T10M13-17.1 DDRLFYHCSPCNFTLDLRCVLNPPQQSLLNLKAHDHQLTLLPRLLSFTCNACGLNGDRSP
T10M13-18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F2N1.27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19K24.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F21P24.16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
301 360
T10M13-17.1 YTCVQCDFMIHQDCLDLPRVININQHDHRVSRTSVPGVGNSKGVLLPWFHRKHIMMPWND
T10M13-18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MDSEGVLLRLIH.EHSMIPCNH
F2N1.27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MATTAVNLP.IH.EHPLYPSAR
T19K24.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F21P24.16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MASLRKKLPLNR
361 420
T10M13-17.1 MRKGDC..CGSLESVTDGFYCKS..CDIFIHKKC.GESSGIIDHSSHPDHTLE.LNRYPN
T10M13-18 LRKGDC..CGRFEAISDGYYCKT..CDFFVHKKCVDEASESIEHPSHPGHTLQLLSKQKY
F2N1.27 CIIGECDGCHVNGYMYNGYFCNEPFCFIWFHKECA.EAPAEISHSSHHQHPLLLTNDSID
T19K24.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F21P24.16 FRRRLDKLRSRHNIRSVRQRFGNGGNTQQQRPPTKHLCPFSDPVEPHKLQRRRLLSQSQL
421 480
T10M13-17.1 KSCNLCGRSKGVNVCYRCDHCYYQLDLYCAKYPPPEFIDIPETHHHKLTLLKERIEFDCD
T10M13-18 RYCNLCGRDI.KDLCYHFGN..FDVDLYCAKYPPPEVIESSETPCHKRTLLKEKIEFHCD
F2N1.27 GPCDLCGQKL.LPPCYSCSTCEFKVDLTCAMKPSPPAIEHPLCHHHSLVFLKKREEKVSC
T19K24.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MSKLMLPFHEHPMTISNLYNHAC
F21P24.16 SECFGCKGKHLEGKRYYYFCPLCNLEFHRGCHVLPQQMKHPLHLSHPLTLISLDPNFDT.
481 540
T10M13-17.1 AKC.GETGDGFAYKCPECDLFFHVDCVWHP......PEVNHPLEV.NHSYHPWHPLKL.H
T10M13-18 AKC.EEIGYGFPYKCLECGLTFHIDCVLHPSGVKHLPEVIHPLDVKNHSCHPLHPLKLVL
F2N1.27 ELC.KDSIAGPSYSCLECDVYFHVNCI......QLSKEVNHP.......CHSAHPLKLIP
T19K24.12 DFCPGRPKPGTIFFCEKCDLDLHKGCA..GKFLNNSSRCNHSLKIFQGR...........
F21P24.16 SKIPQEWGESSS.SDSEGSFDSDDEFGGRRRLLRRRARFFSPFFDVEDDDDDDVDLDDDD
541 600
T10M13-17.1 TGQPPHYSD......GTCRLCARNIDDRLFYHCSPCNFTLDLRCVLNPPPQSLLNLKAHD
T10M13-18 TGQLPDYSD......GKCRLCEKKIDSPLFYHCSPCNFTLDMRCALNPPSISFEDSKTHD
F2N1.27 FESLTDDAE......TTCLLCGKKPAENMLYHCSVCNFTSCLGCTKNPPLLVIEHIKTHK
T19K24.12 FGQFGEDDD......YQCHYCQQNVGFH.FARCTICNISIDEKCLRNPPPLTIFQPKHHK
F21P24.16 DDDDDDDNDLSDIYYGKCKCCQKRFED.IYYHCSVCNFSLNFTCTIKPPPLTITHLKSHS
601 660
T10M13-17.1 HQLILLPRLRSFTCNACGLSGDRS...PYICIQCDFMIHQDCLDLPRLINVNRHDHRVSR
T10M13-18 HQLTLLPRLDSFTCNACGLKGDRS...PYICVQCNFIIHQECLTLPRLININRHDHRVAR
F2N1.27 HPLILFPRRIPCICDICGKKCEFA...VYVCPQCDFVTARGCIDLARVININRHDHRIYL
T19K24.12 HSLFLLGRLVAFTCNACGVEGDRN...PYICIECNLMLHKECTDLPRVISINRHEHRISH
F21P24.16 HILTLFPRRIPLPCDVCGLSLNDAHDPVYACLPCNYMVHRACINLPRVIKITRHQHRLSL
661 720
T10M13-17.1 TSVLGVVNSV..CGVCHQKVDWTCGGFSCQRC.SSYVVHSKCATRRDVWNGKELEGVPEE
T10M13-18 TNLLGLVYSV..CGVCRQKVDWTWGGYSCQRC.SNYVVHSKCATRKDVWNGKELEGVIEE
F2N1.27 THHLGTGYSK..CGVCHENVNQYKGAYSCSVC.PHYAVHSTCAVRTDVWDGVELEGTTEI
T19K24.12 TFHLGKGEGVWECGVCRKTIDCVYGAFICSRCPSSYAVHSRCATRKEVWDGIELEDVPEE
F21P24.16 ISSL..ISGDFSCGVCRKTIDISYGHYSCIK.GCCYGAHSKCATNKNVWDGKDLEDVPEE
721 780
T10M13-17.1 IEDIE.PYVVIDD.NTIQHFSHKEHYLRLHVNDV.LCNDNKRCKACTHPIFL.QSFYSCM
T10M13-18 TEDIE.PYVVIDD.NTIQHFSHEEHYLKLDDNGV.LCDENKRCSACTHSVCL.ESFYGCM
F2N1.27 TEDNS.PFKVVGD.NLICHFSHEEHPLRFQKDDVFIHDEHKRCAACIHPLGF.GSIYCCE
T19K24.12 DEELEDPFKVINEKGDIVHFSHEEHVLRLDVNYVH.DNDTMCCGGCILPIND.DPCYKCV
F21P24.16 PEEVIEPFVRIDEE.TIHHFTHSCH.LKLQGKGSHR.GEKKFCKACCFPISIYDTFYTCL
781 840
T10M13-17.1 DCDFLLHQNCAGFPRMKRHVLHNERLTLVTNETKLF............QCAPCDRWSNGF
T10M13-18 DCDFILHQNCAKFPKRKRHVLHNERLTLFTREAGHF............WCNVCGRISNGF
F2N1.27 TCPFILHEKCANLPMKKRLVFGTRPYTLMKETTQLT............DCTLCGIFSDGF
T19K24.12 ECDFCLHKGCASLPRKKAHFLHNHKISLRAHKDDVY..........VDKCGACKTYSNGF
F21P24.16 DCDFVLHEACACLPRIQHHPLHKHPLILNKSDPSLQLYVDGYSVEGMFACSACRRHSTGF
841 900
T10M13-17.1 RYQ.......HGYKSLDLQCGSISEPFVHPSHPDHPLYHTLLDGRNEICDGCKKS.WYYV
T10M13-18 SYQ.......YGDMKLDVICCSVLEPFVHPSHPDHPLYY.ISPEMEEVCNGCNMS.GTRM
F2N1.27 AYS.......SRWLDVDVHCGSLNEPLVHDGH.IHPLNFDKMEG..HFCDGCKKNIDDYM
T19K24.12 KYECHDKSSCRGNIKYDIRCSSISEPFHHDDLHPHPLYWTLEETKQ..CKACGTEI.LKP
F21P24.16 VYSCNVEFC...DFQVDVKCAFLPDTSVHESHPNHPIYINLTKG...ACMGCSNACSRKY
901 960
T10M13-17.1 LSCIEDDCRFVLCFKCLTLPQVVKHRVDDDPLLLCYGEK..ACGKYWCDICEKETNPETW
T10M13-18 LRCIEDGCGFVLCFKCATLPRVLKHRVDDYPLLLCYGEK..ANGIYWCEICEKKMNPEKW
F2N1.27 LRC..KACDFDLCLYCATLPKKIWHRNDGHPLTLCCEEKEEASDKYWCDICEKELDP...
T19K24.12 LSCLV..CEYALCIRCATLPKKVKHRCDDHFLSLCH.DAGNSSGDLWCDICETKTDPSVC
F21P24.16 LECLI..CDSFLGVECATLPSVAHYKHDSHPLTLCYGEKGTTSGQYWCEICESKLDASEW
961 1020
T10M13-17.1 FYTCKDHQSSMHTMCVLGDSVGIMPKSTIMGWYKYYEVVLNNSISRPVCEMCKSRCIFPT
T10M13-18 FYTCKDQWASLHTECVVGDFSGLMPGSVVKAETGSYEVVLNKNVSRPFCRQCKSHCMYPI
F2N1.27 MCIRRFLTSRARTKLLVWGKSQVWGGS*~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19K24.12 YYTCDKCGVSLHINCVLGDLYYTKVGLVYPG....LEVLPNNGATRPFCNSCKVRCKFSF
F21P24.16 FYTCDSCRVTLHLKCLLGNDMYIKPQHFV.GKEGEIEIALNDGNTRPFCSVCHVRCTDIS
1021 1080
T10M13-17.1 IMK.ILEASNEYVCSLACVLERG*~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T10M13-18 IYK.IPETSVSYLCSDICIKRFTKRD*~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F2N1.27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19K24.12 LMKQTLGDLVRYFCSIRCIPRAR*~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F21P24.16 VFKY....QDNVYCDLGCVRKVKIYESRGSELFSSQTAQLKSLKLIRLLDRARKLKDSQR
1081 1140
T10M13-17.1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T10M13-18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F2N1.27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19K24.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F21P24.16 EFKFPLILSLHTKEENLRQDEIIWQREEVSVSGISFSEIPTTRRSVNDETDSLDHVTSQI
1141 1200
T10M13-17.1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T10M13-18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F2N1.27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19K24.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F21P24.16 YNNIKRTLITLDSSDSQGLVHVLMWNDQGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLK
1201 1260
T10M13-17.1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T10M13-18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F2N1.27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19K24.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F21P24.16 PSTLALRKETKHFQMQYRSLHQHTDEDESGVLAAIEEKIALATRFPKDYYESFNILRYQL
1261 1320
T10M13-17.1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T10M13-18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F2N1.27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19K24.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F21P24.16 GQKYDSHYDAFHSAEYGPLISQRVIDSKLPSVDPPLMAQHMNAGRDFPIVLIECRRRWRD
1321 1380
T10M13-17.1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T10M13-18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F2N1.27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19K24.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
F21P24.16 DVPIRGNRDIDKNVETKKKMKKLELGSQIKVSITTPIDPKFTGKSAGERSKRTNKKTQLR
1381 1407
T10M13-17.1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~
T10M13-18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~
F2N1.27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~
T19K24.12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~
F21P24.16 RREFEILTDMFEFTTTQLIAKDVYQI*
Amino acid composition of T10M13.17.1 and T10M13.18
| amino acid | T10M13.17.1 # % |
T10M13.18 # % |
| alanine | 23 2.3% | 15 2.3% |
| arginine | 49 5.0% | 31 4.7% |
| asparagine | 46 4.7% | 26 3.9% |
| aspartic acid | 71 7.2% | 41 6.2% |
| cysteine | 86 8.7% | 59 9.0% |
| glutamine | 27 2.7% | 13 2.0% |
| glutamic acid | 47 4.8% | 43 6.5% |
| glycine | 49 5.0% | 37 5.6% |
| histidine | 63 6.4% | 38 5.8% |
| isoleucine | 41 4.1% | 33 5.0% |
| leucine | 100 10.1% | 60 9.1% |
| lysine | 51 5.2% | 44 6.7% |
| methionine | 20 2.0% | 11 1.7% |
| phenylalanine | 41 4.1% | 27 4.1% |
| proline | 62 6.3% | 35 5.3% |
| serine | 66 6.7% | 42 6.4% |
| threonine | 38 3.8% | 26 3.9% |
| tryptophan | 14 1.4% | 7 1.1% |
| tyrosine | 39 3.9% | 30 4.6% |
| valine |
56 5.7% |
41 6.2% |
| calculated pI | 6.89 | 6.82 |
written 30 Jul 97
updated 5 Aug 98
Larry
Parnell