Gene T2H3.12/T10M13.22
Putative Identification zinc-finger protein
Position 37726 - 39984, from the initial methionine to the termination codon
Strand -
EST match none
Database match RP8 sequences of rat, mouse, and human

 

Note: This region was annotated using sequence from the overlapping clone T2H3. Although the entire gene model is encoded by T10M13, the sequence of T2H3, which is 100.0% identical to T10M13, is interior to that clone and thus avoids gene modeling problems that can occur at the very end of a sequence.

 

CDS:  The table below lists the coordinates of the T2H3.12 exons and which algorithm selected the particular termini (GF = GeneFinder, GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons). Note: GRAIL, MZEF and NetPlantGene all suggest an exon in the region of exon 1 (see table below), but no initial methionine is described for T2H3.12.

Exon Range 3' 5'
1 39664 - 39984 GF, GS, Gr, M, NPG GF
2 39482 - 39565 GF, GS, M, NPG GF, M
3 39265 - 39398 GF, Gr, M, NPG GF, Gr, M
4 39068 - 39165 GF, GS, M, NPG GF, GS, M
5 38893 - 38981 GF, GS, Gr, NPG GF, GS, Gr
6 38685 - 38799 GF, GS, M, NPG GF, GS, Gr, M
7 38454 - 38608 GF, GS, Gr, M, NPG GF, GS, M
8 38304 - 38374 GF, M, NPG GF, GS, M
9 38103 - 38217 GF, M, NPG GF, M
10 37726 - 37884 GF, GS, Gr GF, GS, Gr

Alternate exons not used in building the gene model. GenScan predicts an initial exon from 40387 to 40361; and internal exons from 40343 to 40195, from 39600 to 39482, from 39664 to 39934 and from 38374 to 38201. GRAIL predicts internal exons from 40529 to 40467, from 39664 to 39934, from 38799 to 38710 and from 38634 to 38454. MZEF predicts an internal exons from 40467 to 40537, from 40246 to 40190 and from 39664 to 39934. NetPlantGene predicts several putative splice sites in this region. Those not used include splice donors at 40501 (confidence score = 1.00), at 40467 and at 40190 (0.93) and a splice acceptor at 39934.

CDS of T2H3.12

ATGAAACAGAGTAAAACCCTAGAATTCAATTCCTTTGCTCTGTTATGCAGCAGTCTTCAT
TTCATCAAATTCAACACAGAAGATTCGATGAGTAGCTTCAACGGAGACTCCATGGATGAT
TTCAAAGGCCTCCGAATAACTCAGCTTGATGATGATGACGATGATGAAACTGCGGTGGAA
CCTATAAATATGGATGAATTTGATGATGATGATGAGGAAGATGATGAAGATTATGAACCT
GTGATGTTAGGTTTCGTTGAGAGTCCTAAATTCGCATGGTCAAATCTTCGTCAACTGTTC
CCTAATTTAGCTGGTGGTGTTCCGGCATGGTTGGATCCAGTTAATTTACCATCAGGGAAG
TCAATTCTATGTGATCTATGTGAAGAACCTATGCAATTTGTACTTCAACTTTATGCTCCT
TTAACAGACAAGGAATCAGCTTTTCATCGGACATTGTTTCTCTTCATGTGTCCATCTATG
TCTTGTCTCCTTCGTGATCAACATGAACAATGGAAACGTGCCCCAGAGAAGGCTATGCGG
AGTACGAAGGTTTTCCGTTGCCAATTGCCTCGAGCTAATCCTTTTTACTCGAGTGAGGCT
CCAAAGCACGATGGAACAGACAAGCCATTGGGTCATGGAGCTCCGCTTTGCACTTGGTGT
GGAACATGGAAAGGAGATAAACTATGTAGCGGCTGCAAAAATGCTCGATACTGTTCACCG
AAACACCAGGCCCTGCATTGGCGCTTGGGTCATAAAACCGAATGCCAACAGCTTCGAACG
GTTAGCGAAACATCTGACTCAGGCCCTGTCAATAATGGGGTTGCTCCAACTGAAAAGCAG
AAAGTTGCGAGTAAGAGTTTATGGAAAGAATTTGTATTGATCAATGAGGATGAAAGTGAG
TATGATACTGAGATGTCAGGAGATGATGAAGTAGCTAAACCTTTGGTCTCAAAGAGAGAA
GTTGACGACCAAATGAAATCGCTAATGAACGATTTTGAGGGAGATGCTGATAAAAAGAAT
TGGGTTAATTTCCAGCAACGCGTAGACAAAGCCCCTGAACAAGTTCTGAGATATTCCCGG
AGCTCTGGTGCTAAACCCCTTTGGCCAATAGCAAGCGGACGAGTCTCTAAATCTGAGCTT
CCCAGTTGCAAATCTTGTGGTGGTCCTCGTTGTTTTGAATTTCAGGTGATGCCGCAGCTA
CTGTTCTTCTTCGGTGGGAAGAACGAGAGGGAATCTCTCGATTGGGCAACAATCGTGGTG
TACACTTGTGAGAACTCGTGTGACTCGAGCTTAAGTTACAAGGAAGAGTTTGTCTGGGTT
CAACTATACAGTCAGACAACTTAG

 

Protein sequence

MKQSKTLEFNSFALLCSLHFIKFNTEDSMSSFNGDSMDDFKGLRITQLDDDDDDETAVEP
INMDEFDDDDEEDDEDYEPVMLGFVESPKFAWSNLRQLFPNLAGGVPAWLDPVNLPSGKS
ILCDLCEEPMQFVLQLYAPLTDKESAFHRTLFLFMCPSMSCLLRDQHEQWKRAPEKAMRS
TKVFRCQLPRANPFYSSEAPKHDGTDKPLGHGAPLCTWCGTWKGDKLCSGCKNARYCSPK
HQALHWRLGHKTECQQLRTVSETSDSGPVNNGVAPTEKQKVASKSLWKEFVLINEDESEY
DTEMSGDDEVAKPLVSKREVDDQMKSLMNDFEGDADKKNWVNFQQRVDKAPEQVLRYSRS
SGAKPLWPIASGRVSKSELPSCKSCGGPRCFEFQVMPQLLFFFGGKNERESLDWATIVVY
TCENSCDSSLSYKEEFVWVQLYSQTT*

 

For more information on T2H3.12/T10M13.22, please go to the annotation page T10M13.22.

 


written 11 Sep 98
updated 16 Sep 98
Larry Parnell