Gene T10M13.13
Putative Identification CTP synthase, UTP ammonia ligase
Position 67138 to 70236, from the initial methionine to the termination codon
Strand +
EST hits none
Database match: CTP synthase sequences from several species, including H. sapiens (P17812), S. cerevisiae (P38627), and Haemophilus influenzae (P44341)

 

CTP synthase catalyzes the final step in pyrimidine biosynthesis. The reaction, requiring ATP, can be described as the addition of an amino group (from glutamine) to UTP thus forming CTP.

 

CDS:  The table below lists the coordinates of the T10M13.13 exons and which algorithms selected the particular exon terminus (GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons). Splice sites determined by similarity to M. truncatula EST AA660762 are designated by est.

Exon Range 5' 3'
1 67138 - 67255 Gr est, Gr, NPG
2 67332 - 67411 est, GS, Gr, M, NPG est, GS, Gr, M, NPG
3 67492 - 67578 est, M, NPG est, GS, Gr, M, NPG
4 67667 - 67835 est, GS, Gr, M, NPG GS, Gr, M, NPG
5 67910 - 67966 Gr, M, NPG Gr, M
6 68064 - 68128 GS, Gr, M GS, Gr, M, NPG
7 68194 - 68277 GS, Gr, M, NPG GS, Gr, M, NPG
8 68347 - 68397 M, NPG M, NPG
9 68468 - 68530 Gr, NPG Gr
10 68704 - 68808 GS, Gr, NPG GS, Gr, NPG
11 68875 - 68937 GS, Gr, M, NPG GS, Gr, M, NPG
12 69003 - 69092 GS, Gr, M, NPG GS, Gr, M, NPG
13 69180 - 69218 GS, Gr, M, NPG GS, Gr, M, NPG
14 69280 - 69429 GS, Gr, M, NPG GS, Gr, NPG
15 69514 - 69591 GS, Gr, M, NPG GS, Gr, M, NPG
16 69676 - 69761 GS, Gr, NPG GS, Gr, NPG
17 69844 - 69892 GS, NPG GS, NPG
18 69972 - 70052 GS, Gr, M, NPG GS, M, NPG
19 70132 - 70236 GS, NPG GS

Alternate exons not used in building the gene model:   GenScan concatenates T10M13.12 and T10M13.13 and thus does not predict an initial exon. An internal exon is predicted from 67090 to 67167. GRAIL predicts a terminal exon from 69972 to 70139. MZEF predicts exon 14 from 69280 to 69408. NetPlantGene predicts many splice sites throughout this region. Noteble sites include a splice acceptor at 69166 (confidence score = 1.00) and splice donors at 68035 (0.90), 68651 (0.90) and at 70225.

Complete CDS of T10M13.13

ATGAAGTACGTATTGGTGACTGGAGGAGTGGTGAGTGGGCTTGGCAAAGGCGTTACGGCG
AGTAGTATCGGCGTGGTGCTTAAAGCTTGTGGCCTTGGAGTTACCTCCATTAAAATTGAT
CCATATCTGAACACTGATGCTGGTACAATGTCCCCTTTTGAGCATGGAGAGGTTTTTGTA
CTCGATGATGGTGGAGAGGTTGATCTTGATTTGGGAAACTACGAAAGATTTCTCGATGTG
ACACTAACCAAAGACAACAACATAACCACTGGGAAGATATATCAGTCTGTTCTTGACAAG
GAACGGAAAGGCGACTACCTTGGGAAGACTGTTCAGGTTGTTCCACACATTACTGATGCA
ATCAAGGATTGGATTGAGTCAGTCTCTCTGATTCCTGTGGATGGAAAGGAAGGCCAAGCT
GATGTTTGTGTTATTGAGTTGGGAGGGACTGTTGGTGACATCGAGTCAATGCCTTTTATC
GAGGCCTTGAGACAATTGTCATTCTCTGTTGGACCGGATAATTTCTGCCTGATTCACGTG
AGCTTGATTCCCGTTCTTGGTGTTGTTGGGGAGCAGAAAACAAAGCCAACCCAACACACT
GTCCGAGAACTAAGAGCTTTGGGATTGACTCCTCACTTTTTGGCATGTCGCTCTGCTCAG
CCACTGCTGGAAAGTACCAAGGCTAAATTGTCTCAGTTCTGTCATGTGGCGGCAGCTAAT
ATTCTCAATATCCATGATGTTCCAAACATTTGGCATGTTCCTCTCCTACTCCGTTGTAAC
ACCAAGTTCAATAAACTCATCTCTTTCAGCGTTGCGACAGCACCTGATCTGGATAGCTGG
AATAAGATGGCTGAAACTTTTGATAACTTGACAAATCATGTCCAAATTGCTATGGTTGGA
AAGTACATTGGTCTAACGGATTCTTATCTCTCGGTTGTGAAGGCCCTTCTTCATGCCTGC
ATTGCATGTTCATTGAAGCCACATATCGAATGGATCGCAGCTTCAGATCTTGAAGACGAA
AGTGAGAAATCGACTCCGGAGGCGCATGCTGCTGCTTGGAAGATCTTGAAGAGCGCAGAA
TGCATTCTGGTTCCTGGTGGTTTTGGAGATCGTGGTGTGAGTGGAATGGTTTTAGCAGCC
AAATACGCTAGAGAGAACAAAATTCCTTATCTTGGGATCTGCTTGGGGATGCAAATTGCA
GTAATTGAGTTTGCAAGATCTGTTTTGGGTTTGGAAAGAGCAAATAGCACTGAGTTCGAT
GCTCAGACATCTGACCCTGTTGTTATATTTATGCCAGAAGGCTCAAGAACACATATGGGA
AGCACAATGAGACTTGGATCCCGAAGAACACATTTACACAACCGAGACTCTCTCACTTCA
AAACTATACGGCCAAGTTTCTTATGTAGACGAAAGGCATAGGCACCGTTACGAGGTGAAT
CCAGAAGTAGCTCAAGCACTTGAAGAAGCTGGTTTAAGATTTGTGGGTAAAGATGATACA
GGGAAACGTGTTGAGGTGATTGAGTTTCAAGATCATCCATTCTATGTTGGAGTTCAGTTT
CATCCAGAATTTAAGTCAAGACCCACTAGACCTTCTCCTCTGTTTTTAGGCAAGTCTTGA

 

Protein translation:

MKYVLVTGGVVSGLGKGVTASSIGVVLKACGLGVTSIKIDPYLNTDAGTMSPFEHGEVFV
LDDGGEVDLDLGNYERFLDVTLTKDNNITTGKIYQSVLDKERKGDYLGKTVQVVPHITDA
IKDWIESVSLIPVDGKEGQADVCVIELGGTVGDIESMPFIEALRQLSFSVGPDNFCLIHV
SLIPVLGVVGEQKTKPTQHTVRELRALGLTPHFLACRSAQPLLESTKAKLSQFCHVAAAN
ILNIHDVPNIWHVPLLLRCNTKFNKLISFSVATAPDLDSWNKMAETFDNLTNHVQIAMVG
KYIGLTDSYLSVVKALLHACIACSLKPHIEWIAASDLEDESEKSTPEAHAAAWKILKSAE
CILVPGGFGDRGVSGMVLAAKYARENKIPYLGICLGMQIAVIEFARSVLGLERANSTEFD
AQTSDPVVIFMPEGSRTHMGSTMRLGSRRTHLHNRDSLTSKLYGQVSYVDERHRHRYEVN
PEVAQALEEAGLRFVGKDDTGKRVEVIEFQDHPFYVGVQFHPEFKSRPTRPSPLFLGKS*

 

Protein motifs:

The signature for the active site of class I glutamine amidotransferases is found in T10M13.13.

sequence:   389 PYLGICLGMQIA 400

Alignment of CTP synthase sequences:  Sequences from human, Arabidopsis, S. cerevisiae, and H. influenzae are aligned. The active site signature is highlighted. residues in bold are conserved in all four sequences.

               1                                                         60
        Human    MKYILVTGGVISGIGKGIIASSVGTILKSCGLHVTSIKIDPYINIDAGTFSPYEHGEV 
  A. thaliana    MKYVLVTGGVVSGLGKGVTASSIGVVLKACGLGVTSIKIDPYLNTDAGTMSPFEHGEV 
  S. cerevis.    MKYVVVSGGVISGIGKGVLASSTGMLLKTLGLKVTSIKIDPYMNIDAGTMSPLEHGEC 
  Hae. influ.  MATNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPTQHGEV 

               61                                                       120
        Human  FVLDDGGEVDLDLGNYERFLDIRLTKDNNLTTGKIYQYVINKERKGDYLGKTVQVVPHIT 
  A. thaliana  FVLDDGGEVDLDLGNYERFLDVTLTKDNNITTGKIYQSVLDKERKGDYLGKTVQVVPHIT 
  S. cerevis.  FVLDDGGETDLDLGNYERYLGITLSRDHNITTGKIYSQLISRERRGDYLGKTVQIVPHLT 
  Hae. influ.  FVTQDGAETDLDLGHYERFIRTKMTKRNNFTTGKIYSEVLRKERRGDYLGATIQVIPHIT 

               121                                                      180
        Human  DAIQEWVMRQALIPVDEDGLEPQVCVIELGGTVGDIESMPFIEAFRQFQFKVKRENFCNI 
  A. thaliana  DAIKDWIESVSLIPVDGKEGQADVCVIELGGTVGDIESMPFIEALRQLSFSVGPDNFCLI 
  S. cerevis.  NAIQDWIQRVSKIPVDDTGLEPDVCIIELGGTVGDIESAPFVEALRQFQFEVGRENFALI 
  Hae. influ.  NEIKDR........VIAGAQGHDVVIVEVGGTVGDIESLPFLEALRQLAVQVGREHTLFM 

               181                                                      240
        Human  HVSLVPQPSSTGEQKTKPTQNSVRELRGLGLSPDLVVCRCSNPLDTSVKEKISMFCHVEP 
  A. thaliana  HVSLIPVLGVVGEQKTKPTQHTVRELRALGLTPHFLACRSAQPLLESTKAKLSQFCHVAA 
  S. cerevis.  HVSLVPV..IHGEQKTKPTQAAIKDLRLLGLIPDMIACRCSEELNRSTIDKIAMFCHVGP 
  Hae. influ.  HLTLVPYIPTAGEVKTKPTQHSVKELLSIGIQPDVLICRSDRMIPPNERAKIALFCNVAE 

               241                                                      300
        Human  EQVICVHDVSSIYRVPLLLEEQGVVDYFLRRLDLPIER.......QPRKMLMKWKEMADR 
  A. thaliana  ANILNIHDVPNIWHVPLLLRCNTK...FNKLISFSVAT.......APD..LDSWNKMAET 
  S. cerevis.  EQVVNVHDVNSTYHVPLLLLKQHMIDYLHSRLKLGEVPLTLEDKERGSQLLTNWENMTKN 
  Hae. influ.  RAVISLKDVNSIYQIPALLKSQGLDDFVCERFRLTC..........PEADLTEWEQVLYK 

               301                                                      360
        Human  YDRLLETCSIALVAKYTEFSDSYASVIKALEHSALAINHKLEIKYIDSADLEPITSQEEP 
  A. thaliana  FDNLTNHVQIAMVGKYIGLTDSYLSVVKALLHACIACSLKPHIEWIAASDLEDESEKSTP 
  S. cerevis.  LDDSDDVVKIALVGKYTNLKDSYLSVTKSLEHASMKCRRQLEILWVEASNLEPETQEVDK 
  Hae. influ.  QANPVGEVTIGMVGKYTELPDAYKSVNEALKHAGLTNRLSVNIKYIDSQDVETKGVEV.. 

               351                                                      420
        Human  VRYHEAWQKLCSAHGVLVPGGFGVRGTEGKIQAIAWARNQKKPFLGVCLGMQLAVVEFSR 
  A. thaliana  EAHAAAWKILKSAECILVPGGFGDRGVSGMVLAAKYARENKIPYLGICLGMQIAVIEFAR 
  S. cerevis.  NKFHDSWNKLSSADGILVPGGFGTRGIEGMILAAKWARESGVPFLGVCLGLQVAAIEFAR 
  Hae. influ.  .........LKGIDGILVPGGFGYRGVEGKIRTAQYARENKIPYLGICLGMQIALIEYAR 

               421                                                      450
        Human  NVLGWQDANSTEFDPTT....SHPVVVDMPE...........HNPGQMGGTMRLGKRRTL 
  A. thaliana  SVLGLERANSTEFDAQT....SDPVVIFMPE...........GSRTHMGSTMRLGSRRTH 
  S. cerevis.  NVIGRPNSSSTEFLDETLLAPEDPSSHIHAE...........IDKEHMGGTMRLGLRPTI 
  Hae. influ.  NVAGLTKANSSEFDKDC....EQPVVALITEWQDAEGNTEVRTDESDLGGTMRLGAQQCH 

               451                                                      540
        Human  FQTKNS.......VMRKL..YGDADYLEERHRHRFEVNPVWKKCLEEQGLKFVGQDVEGE 
  A. thaliana  LHNRHGFKDSKLFFLRAFTRYGQVSYVDERHRHRYEVNPEVAQALEEAGLRFVGKDDTGK 
  S. cerevis.  FQPNSEWSN.....IRKL..YGEVNEVHERHRHRYEINPKIVNDMESRGFIFVGKDETGQ 
  Hae. influ.  LVSGS..........RARELYGK.ETIEERHRHRYEVNNTLLPQIEKAGLKVTGLSADKK 

               541                                                       601
        Human  RMEIVELEDHPFFVGVQYHPEFLSRPIKPSPPYFGLLLASVGRLSHYLQKGCRLSPRDTYS 
  A. thaliana  RVEVIEFQDHPFYVGVQFHPEFKSRPTRPSPLFLGKS 
  S. cerevis.  RCEIFELKGHPYYVGTQYHPEYTSKVLEPSRPFWGLV..AQLRHTCEVIKDINLSEGNENE 
  Hae. inlfu.  LVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAAYENHKKSVK 

               602                  624
        Human  SDRSGSSSPDSEITELKFPSINHD

 


written 29 Jul 97
updated 3 Aug 98
Larry Parnell