| Gene | T10M13.13 |
| Putative Identification | CTP synthase, UTP ammonia ligase |
| Position | 67138 to 70236, from the initial methionine to the termination codon |
| Strand | + |
| EST hits | none |
| Database match: | CTP synthase sequences from several species, including H. sapiens (P17812), S. cerevisiae (P38627), and Haemophilus influenzae (P44341) |
CTP synthase catalyzes the final step in pyrimidine biosynthesis. The reaction, requiring ATP, can be described as the addition of an amino group (from glutamine) to UTP thus forming CTP.
CDS: The table below lists the coordinates of the T10M13.13 exons and which algorithms selected the particular exon terminus (GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons). Splice sites determined by similarity to M. truncatula EST AA660762 are designated by est.
| Exon | Range | 5' | 3' |
|---|---|---|---|
| 1 | 67138 - 67255 | Gr | est, Gr, NPG |
| 2 | 67332 - 67411 | est, GS, Gr, M, NPG | est, GS, Gr, M, NPG |
| 3 | 67492 - 67578 | est, M, NPG | est, GS, Gr, M, NPG |
| 4 | 67667 - 67835 | est, GS, Gr, M, NPG | GS, Gr, M, NPG |
| 5 | 67910 - 67966 | Gr, M, NPG | Gr, M |
| 6 | 68064 - 68128 | GS, Gr, M | GS, Gr, M, NPG |
| 7 | 68194 - 68277 | GS, Gr, M, NPG | GS, Gr, M, NPG |
| 8 | 68347 - 68397 | M, NPG | M, NPG |
| 9 | 68468 - 68530 | Gr, NPG | Gr |
| 10 | 68704 - 68808 | GS, Gr, NPG | GS, Gr, NPG |
| 11 | 68875 - 68937 | GS, Gr, M, NPG | GS, Gr, M, NPG |
| 12 | 69003 - 69092 | GS, Gr, M, NPG | GS, Gr, M, NPG |
| 13 | 69180 - 69218 | GS, Gr, M, NPG | GS, Gr, M, NPG |
| 14 | 69280 - 69429 | GS, Gr, M, NPG | GS, Gr, NPG |
| 15 | 69514 - 69591 | GS, Gr, M, NPG | GS, Gr, M, NPG |
| 16 | 69676 - 69761 | GS, Gr, NPG | GS, Gr, NPG |
| 17 | 69844 - 69892 | GS, NPG | GS, NPG |
| 18 | 69972 - 70052 | GS, Gr, M, NPG | GS, M, NPG |
| 19 | 70132 - 70236 | GS, NPG | GS |
Alternate exons not used in building the gene model: GenScan concatenates T10M13.12 and T10M13.13 and thus does not predict an initial exon. An internal exon is predicted from 67090 to 67167. GRAIL predicts a terminal exon from 69972 to 70139. MZEF predicts exon 14 from 69280 to 69408. NetPlantGene predicts many splice sites throughout this region. Noteble sites include a splice acceptor at 69166 (confidence score = 1.00) and splice donors at 68035 (0.90), 68651 (0.90) and at 70225.
Complete CDS of T10M13.13
ATGAAGTACGTATTGGTGACTGGAGGAGTGGTGAGTGGGCTTGGCAAAGGCGTTACGGCG AGTAGTATCGGCGTGGTGCTTAAAGCTTGTGGCCTTGGAGTTACCTCCATTAAAATTGAT CCATATCTGAACACTGATGCTGGTACAATGTCCCCTTTTGAGCATGGAGAGGTTTTTGTA CTCGATGATGGTGGAGAGGTTGATCTTGATTTGGGAAACTACGAAAGATTTCTCGATGTG ACACTAACCAAAGACAACAACATAACCACTGGGAAGATATATCAGTCTGTTCTTGACAAG GAACGGAAAGGCGACTACCTTGGGAAGACTGTTCAGGTTGTTCCACACATTACTGATGCA ATCAAGGATTGGATTGAGTCAGTCTCTCTGATTCCTGTGGATGGAAAGGAAGGCCAAGCT GATGTTTGTGTTATTGAGTTGGGAGGGACTGTTGGTGACATCGAGTCAATGCCTTTTATC GAGGCCTTGAGACAATTGTCATTCTCTGTTGGACCGGATAATTTCTGCCTGATTCACGTG AGCTTGATTCCCGTTCTTGGTGTTGTTGGGGAGCAGAAAACAAAGCCAACCCAACACACT GTCCGAGAACTAAGAGCTTTGGGATTGACTCCTCACTTTTTGGCATGTCGCTCTGCTCAG CCACTGCTGGAAAGTACCAAGGCTAAATTGTCTCAGTTCTGTCATGTGGCGGCAGCTAAT ATTCTCAATATCCATGATGTTCCAAACATTTGGCATGTTCCTCTCCTACTCCGTTGTAAC ACCAAGTTCAATAAACTCATCTCTTTCAGCGTTGCGACAGCACCTGATCTGGATAGCTGG AATAAGATGGCTGAAACTTTTGATAACTTGACAAATCATGTCCAAATTGCTATGGTTGGA AAGTACATTGGTCTAACGGATTCTTATCTCTCGGTTGTGAAGGCCCTTCTTCATGCCTGC ATTGCATGTTCATTGAAGCCACATATCGAATGGATCGCAGCTTCAGATCTTGAAGACGAA AGTGAGAAATCGACTCCGGAGGCGCATGCTGCTGCTTGGAAGATCTTGAAGAGCGCAGAA TGCATTCTGGTTCCTGGTGGTTTTGGAGATCGTGGTGTGAGTGGAATGGTTTTAGCAGCC AAATACGCTAGAGAGAACAAAATTCCTTATCTTGGGATCTGCTTGGGGATGCAAATTGCA GTAATTGAGTTTGCAAGATCTGTTTTGGGTTTGGAAAGAGCAAATAGCACTGAGTTCGAT GCTCAGACATCTGACCCTGTTGTTATATTTATGCCAGAAGGCTCAAGAACACATATGGGA AGCACAATGAGACTTGGATCCCGAAGAACACATTTACACAACCGAGACTCTCTCACTTCA AAACTATACGGCCAAGTTTCTTATGTAGACGAAAGGCATAGGCACCGTTACGAGGTGAAT CCAGAAGTAGCTCAAGCACTTGAAGAAGCTGGTTTAAGATTTGTGGGTAAAGATGATACA GGGAAACGTGTTGAGGTGATTGAGTTTCAAGATCATCCATTCTATGTTGGAGTTCAGTTT CATCCAGAATTTAAGTCAAGACCCACTAGACCTTCTCCTCTGTTTTTAGGCAAGTCTTGA
Protein translation:
MKYVLVTGGVVSGLGKGVTASSIGVVLKACGLGVTSIKIDPYLNTDAGTMSPFEHGEVFV LDDGGEVDLDLGNYERFLDVTLTKDNNITTGKIYQSVLDKERKGDYLGKTVQVVPHITDA IKDWIESVSLIPVDGKEGQADVCVIELGGTVGDIESMPFIEALRQLSFSVGPDNFCLIHV SLIPVLGVVGEQKTKPTQHTVRELRALGLTPHFLACRSAQPLLESTKAKLSQFCHVAAAN ILNIHDVPNIWHVPLLLRCNTKFNKLISFSVATAPDLDSWNKMAETFDNLTNHVQIAMVG KYIGLTDSYLSVVKALLHACIACSLKPHIEWIAASDLEDESEKSTPEAHAAAWKILKSAE CILVPGGFGDRGVSGMVLAAKYARENKIPYLGICLGMQIAVIEFARSVLGLERANSTEFD AQTSDPVVIFMPEGSRTHMGSTMRLGSRRTHLHNRDSLTSKLYGQVSYVDERHRHRYEVN PEVAQALEEAGLRFVGKDDTGKRVEVIEFQDHPFYVGVQFHPEFKSRPTRPSPLFLGKS*
Protein motifs:
The signature for the active site of class I glutamine amidotransferases is found in T10M13.13.
sequence: 389 PYLGICLGMQIA 400
Alignment of CTP synthase sequences: Sequences from human, Arabidopsis, S. cerevisiae, and H. influenzae are aligned. The active site signature is highlighted. residues in bold are conserved in all four sequences.
1 60
Human MKYILVTGGVISGIGKGIIASSVGTILKSCGLHVTSIKIDPYINIDAGTFSPYEHGEV
A. thaliana MKYVLVTGGVVSGLGKGVTASSIGVVLKACGLGVTSIKIDPYLNTDAGTMSPFEHGEV
S. cerevis. MKYVVVSGGVISGIGKGVLASSTGMLLKTLGLKVTSIKIDPYMNIDAGTMSPLEHGEC
Hae. influ. MATNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPTQHGEV
61 120
Human FVLDDGGEVDLDLGNYERFLDIRLTKDNNLTTGKIYQYVINKERKGDYLGKTVQVVPHIT
A. thaliana FVLDDGGEVDLDLGNYERFLDVTLTKDNNITTGKIYQSVLDKERKGDYLGKTVQVVPHIT
S. cerevis. FVLDDGGETDLDLGNYERYLGITLSRDHNITTGKIYSQLISRERRGDYLGKTVQIVPHLT
Hae. influ. FVTQDGAETDLDLGHYERFIRTKMTKRNNFTTGKIYSEVLRKERRGDYLGATIQVIPHIT
121 180
Human DAIQEWVMRQALIPVDEDGLEPQVCVIELGGTVGDIESMPFIEAFRQFQFKVKRENFCNI
A. thaliana DAIKDWIESVSLIPVDGKEGQADVCVIELGGTVGDIESMPFIEALRQLSFSVGPDNFCLI
S. cerevis. NAIQDWIQRVSKIPVDDTGLEPDVCIIELGGTVGDIESAPFVEALRQFQFEVGRENFALI
Hae. influ. NEIKDR........VIAGAQGHDVVIVEVGGTVGDIESLPFLEALRQLAVQVGREHTLFM
181 240
Human HVSLVPQPSSTGEQKTKPTQNSVRELRGLGLSPDLVVCRCSNPLDTSVKEKISMFCHVEP
A. thaliana HVSLIPVLGVVGEQKTKPTQHTVRELRALGLTPHFLACRSAQPLLESTKAKLSQFCHVAA
S. cerevis. HVSLVPV..IHGEQKTKPTQAAIKDLRLLGLIPDMIACRCSEELNRSTIDKIAMFCHVGP
Hae. influ. HLTLVPYIPTAGEVKTKPTQHSVKELLSIGIQPDVLICRSDRMIPPNERAKIALFCNVAE
241 300
Human EQVICVHDVSSIYRVPLLLEEQGVVDYFLRRLDLPIER.......QPRKMLMKWKEMADR
A. thaliana ANILNIHDVPNIWHVPLLLRCNTK...FNKLISFSVAT.......APD..LDSWNKMAET
S. cerevis. EQVVNVHDVNSTYHVPLLLLKQHMIDYLHSRLKLGEVPLTLEDKERGSQLLTNWENMTKN
Hae. influ. RAVISLKDVNSIYQIPALLKSQGLDDFVCERFRLTC..........PEADLTEWEQVLYK
301 360
Human YDRLLETCSIALVAKYTEFSDSYASVIKALEHSALAINHKLEIKYIDSADLEPITSQEEP
A. thaliana FDNLTNHVQIAMVGKYIGLTDSYLSVVKALLHACIACSLKPHIEWIAASDLEDESEKSTP
S. cerevis. LDDSDDVVKIALVGKYTNLKDSYLSVTKSLEHASMKCRRQLEILWVEASNLEPETQEVDK
Hae. influ. QANPVGEVTIGMVGKYTELPDAYKSVNEALKHAGLTNRLSVNIKYIDSQDVETKGVEV..
351 420
Human VRYHEAWQKLCSAHGVLVPGGFGVRGTEGKIQAIAWARNQKKPFLGVCLGMQLAVVEFSR
A. thaliana EAHAAAWKILKSAECILVPGGFGDRGVSGMVLAAKYARENKIPYLGICLGMQIAVIEFAR
S. cerevis. NKFHDSWNKLSSADGILVPGGFGTRGIEGMILAAKWARESGVPFLGVCLGLQVAAIEFAR
Hae. influ. .........LKGIDGILVPGGFGYRGVEGKIRTAQYARENKIPYLGICLGMQIALIEYAR
421 450
Human NVLGWQDANSTEFDPTT....SHPVVVDMPE...........HNPGQMGGTMRLGKRRTL
A. thaliana SVLGLERANSTEFDAQT....SDPVVIFMPE...........GSRTHMGSTMRLGSRRTH
S. cerevis. NVIGRPNSSSTEFLDETLLAPEDPSSHIHAE...........IDKEHMGGTMRLGLRPTI
Hae. influ. NVAGLTKANSSEFDKDC....EQPVVALITEWQDAEGNTEVRTDESDLGGTMRLGAQQCH
451 540
Human FQTKNS.......VMRKL..YGDADYLEERHRHRFEVNPVWKKCLEEQGLKFVGQDVEGE
A. thaliana LHNRHGFKDSKLFFLRAFTRYGQVSYVDERHRHRYEVNPEVAQALEEAGLRFVGKDDTGK
S. cerevis. FQPNSEWSN.....IRKL..YGEVNEVHERHRHRYEINPKIVNDMESRGFIFVGKDETGQ
Hae. influ. LVSGS..........RARELYGK.ETIEERHRHRYEVNNTLLPQIEKAGLKVTGLSADKK
541 601
Human RMEIVELEDHPFFVGVQYHPEFLSRPIKPSPPYFGLLLASVGRLSHYLQKGCRLSPRDTYS
A. thaliana RVEVIEFQDHPFYVGVQFHPEFKSRPTRPSPLFLGKS
S. cerevis. RCEIFELKGHPYYVGTQYHPEYTSKVLEPSRPFWGLV..AQLRHTCEVIKDINLSEGNENE
Hae. inlfu. LVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAAYENHKKSVK
602 624
Human SDRSGSSSPDSEITELKFPSINHD
written 29 Jul 97
updated 3 Aug 98
Larry Parnell