| Gene | T10M13.22/T2H3.12 |
| Putative Identification | zinc-finger protein |
| Position | 102356 to 105167, from the initial methionine to the termination codon |
| Strand | + |
| EST match | none |
| Database match | RP8 sequences of rat, mouse, and human |
Note: This region was annotated using sequence from the overlapping clone T2H3. Although the entire gene model is encoded by T10M13, the sequence of T2H3, which is 100.0% identical to T10M13, is interior to that clone and thus avoids gene modeling problems that can occur at the very end of a sequence.
CDS: The table below lists the coordinates of the T10M13.22 exons and which algorithm selected the particular termini (GF = GeneFinder, GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons). Note: GRAIL, MZEF and NetPlantGene all suggest an exon in the region of exon 1 (see table below), but no initial methionine is described for T10M13.22.
| Exon | Range | 5' | 3' |
|---|---|---|---|
| 1 | 102909 - 103229 | GF | GF, GS, Gr, M, NPG |
| 2 | 103328 - 103411 | GF, M, NPG | GF, GS, M, NPG |
| 3 | 103495 - 103628 | GF, Gr, M, NPG | GF, Gr, M, NPG |
| 4 | 103728 - 103825 | GF, GS, M, NPG | GF, GS, M, NPG |
| 5 | 103912 - 104000 | GF, GS, Gr, NPG | GF, GS, Gr, NPG |
| 6 | 104094 - 104208 | GF, GS, Gr, M, NPG | GF, GS, M, NPG |
| 7 | 104285 - 104439 | GF, GS, M, NPG | GF, GS, Gr, M, NPG |
| 8 | 104519 - 104589 | GF, GS, M, NPG | GF, M, NPG |
| 9 | 104676 - 104790 | GF, M, NPG | GF, M, NPG |
| 10 | 105009 - 105167 | GF, GS, Gr, NPG | GF, GS, Gr |
Alternate exons not used in building the gene model. GenScan predicts an initial exon from 102506 to 102532; and internal exons from 102550 to 102698, from 102959 to 103229, from 103293 to 103411, and from 104519 to 104692. GRAIL predicts internal exons from 102364 to 102426, from 102959 to 103229, from 104094 to 104183 and from 104259 to 104439. MZEF predicts internal exons from 102356 to 102426, from 102647 to 102703, and from 102959 to 103229. NetPlantGene predicts several putative splice sites in this region. Those not used include splice donors at 102392 (confidence score = 1.00), at 102426 and at 102703 (0.93) and splice acceptors at 102959 and at 105471 (0.93).
CDS of T10M13.22
ATGAAACAGAGTAAAACCCTAGAATTCAATTCCTTTGCTCTGTTATGCAGCAGTCTTCAT TTCATCAAATTCAACACAGAAGATTCGATGAGTAGCTTCAACGGAGACTCCATGGATGAT TTCAAAGGCCTCCGAATAACTCAGCTTGATGATGATGACGATGATGAAACTGCGGTGGAA CCTATAAATATGGATGAATTTGATGATGATGATGAGGAAGATGATGAAGATTATGAACCT GTGATGTTAGGTTTCGTTGAGAGTCCTAAATTCGCATGGTCAAATCTTCGTCAACTGTTC CCTAATTTAGCTGGTGGTGTTCCGGCATGGTTGGATCCAGTTAATTTACCATCAGGGAAG TCAATTCTATGTGATCTATGTGAAGAACCTATGCAATTTGTACTTCAACTTTATGCTCCT TTAACAGACAAGGAATCAGCTTTTCATCGGACATTGTTTCTCTTCATGTGTCCATCTATG TCTTGTCTCCTTCGTGATCAACATGAACAATGGAAACGTGCCCCAGAGAAGGCTATGCGG AGTACGAAGGTTTTCCGTTGCCAATTGCCTCGAGCTAATCCTTTTTACTCGAGTGAGGCT CCAAAGCACGATGGAACAGACAAGCCATTGGGTCATGGAGCTCCGCTTTGCACTTGGTGT GGAACATGGAAAGGAGATAAACTATGTAGCGGCTGCAAAAATGCTCGATACTGTTCACCG AAACACCAGGCCCTGCATTGGCGCTTGGGTCATAAAACCGAATGCCAACAGCTTCGAACG GTTAGCGAAACATCTGACTCAGGCCCTGTCAATAATGGGGTTGCTCCAACTGAAAAGCAG AAAGTTGCGAGTAAGAGTTTATGGAAAGAATTTGTATTGATCAATGAGGATGAAAGTGAG TATGATACTGAGATGTCAGGAGATGATGAAGTAGCTAAACCTTTGGTCTCAAAGAGAGAA GTTGACGACCAAATGAAATCGCTAATGAACGATTTTGAGGGAGATGCTGATAAAAAGAAT TGGGTTAATTTCCAGCAACGCGTAGACAAAGCCCCTGAACAAGTTCTGAGATATTCCCGG AGCTCTGGTGCTAAACCCCTTTGGCCAATAGCAAGCGGACGAGTCTCTAAATCTGAGCTT CCCAGTTGCAAATCTTGTGGTGGTCCTCGTTGTTTTGAATTTCAGGTGATGCCGCAGCTA CTGTTCTTCTTCGGTGGGAAGAACGAGAGGGAATCTCTCGATTGGGCAACAATCGTGGTG TACACTTGTGAGAACTCGTGTGACTCGAGCTTAAGTTACAAGGAAGAGTTTGTCTGGGTT CAACTATACAGTCAGACAACTTAG
Protein sequence
MKQSKTLEFNSFALLCSLHFIKFNTEDSMSSFNGDSMDDFKGLRITQLDDDDDDETAVEP INMDEFDDDDEEDDEDYEPVMLGFVESPKFAWSNLRQLFPNLAGGVPAWLDPVNLPSGKS ILCDLCEEPMQFVLQLYAPLTDKESAFHRTLFLFMCPSMSCLLRDQHEQWKRAPEKAMRS TKVFRCQLPRANPFYSSEAPKHDGTDKPLGHGAPLCTWCGTWKGDKLCSGCKNARYCSPK HQALHWRLGHKTECQQLRTVSETSDSGPVNNGVAPTEKQKVASKSLWKEFVLINEDESEY DTEMSGDDEVAKPLVSKREVDDQMKSLMNDFEGDADKKNWVNFQQRVDKAPEQVLRYSRS SGAKPLWPIASGRVSKSELPSCKSCGGPRCFEFQVMPQLLFFFGGKNERESLDWATIVVY TCENSCDSSLSYKEEFVWVQLYSQTT*
Alignment of RP8 zinc-finger proteins:
The RP8 sequences from rat (M80601) and mouse (U10903) are zinc-finger proteins associated with programmed cell death; the human RP8 homolog (also called Programmed Cell Death 2, S78085) was cloned from fetal lung; and T10M13.22 was discovered by genomic sequencing of A. thaliana. Owens GR, et al. (1991, Mol. Cell. Biol. 11:4177-4188) report that mRNA levels of the Rat-RP8 gene accumulate after gamma radiation.
1 60
Rat-RP8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Mouse-RP8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
HumanPDCD2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
T10M13.22 MKQSKTLEFNSFALLCSLHFIKFNTEDSMSSFNGDSMDDFKGLRITQLDDDDDDETAVEP
61 120
Rat-RP8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Mouse-RP8 ~~~~~~~~~~~MAAAAPGPVELGFAEEAP.AWRLRSEQFPSKVGGRPAWLGLAELPGPGA
HumanPDCD2 ~~~~~~~~~~~MAAAGARPVELGFAESAP.AWRLRSEQFPSKVGGRPAWLGAAGLPGPQA
T10M13.22 INMDEFDDDDEEDDEDYEPVMLGFVESPKFAWSNLRQLFPNLAGGVPAWLDPVNLPSGKS
121 180
Rat-RP8 ~~~~~~~~PLAFLLQVYAPLPGRDDAFHRSLFLFCCREPLC................CAG
Mouse-RP8 LACARCGRPLAFLLQVYAPLPGRDDAFHRSLFLFCCREPLC................CAG
HumanPDCD2 LACELCGRPLSFLLQVYAPLPGRPDAFHRCIFLFCCREQPC................CAG
T10M13.22 ILCDLCEEPMQFVLQLYAPLTDKESAFHRTLFLFMCPSMSCLLRDQHEQWKRAPEKAMRS
181 240
Rat-RP8 LRVFRNQLPRKNAFYSYEPPSETGASDT.ECVCLQLKSGAHLCRVCGC.LAPMTCSRCKQ
Mouse-RP8 LRVFRNQLPRNNAFYSYEPPSETEALGT.ECVCLQLKSGAHLCRVCGC.LAPMTCSRCKQ
HumanPDCD2 LRVFRNQLPRKNDFYSYEPPSENPPPETGESVCLQLKSGAHLCRVCGC.LGPKTCSRCHK
T10M13.22 TKVFRCQLPRANPFYSSEAPKHDGTDK.......PLGHGAPLCTWCGTWKGDKLCSGCKN
241 300
Rat-RP8 AHYCSKEHQTLDWQLGHKQACTQSDHLDHMV...PDHNLLFP.............EFEIV
Mouse-RP8 AHYCSKEHQTLDWRLGHKQACTQSDKIDHMV...PDHNFLFP.............EFEIV
HumanPDCD2 AYYCSKEHQTLDWRLGHKQACAQPDHLDHII...PDHNFLFP.............EFEIV
T10M13.22 ARYCSPKHQALHWRLGHKTECQQLRTVSETSDSGPVNNGVAPTEKQKVASKSLWKEFVLI
301 360
Rat-RP8 TETEDEIGPEVVEMEDYSEVIGSMEGVPEEELDSMAKHESKED.HIFQKFKSKIALEPEQ
Mouse-RP8 TETEDEILPEVVEMEDYSEVTGSMGGIPEEELDSMAKHESKED.HIFQKFKSKIPLEPEQ
HumanPDCD2 IETEDEIMPEVVEKEDYSEIIGSMGEALEEGLDSMAKHESRED.KIFQKFKTQIALEPEQ
T10M13.22 NEDESEYDTEMSGDDEVAKPLVSKREVDDQMKSLMNDFEGDADKKNWVNFQQRVDKAPEQ
361 420
Rat-RP8 ILRYGR..GIKPIWISGENIPQEKDIPDC.SCGVKRIFEFQVMPQLLNHLKADRLGTSVD
Mouse-RP8 ILRYGR..GIKPIWISGENIPQEKDIPDC.PCGAKRIFEFQVMPQLLNHLKADRLGRSID
HumanPDCD2 ILRYGR..GIAPIWISGENIPQEKDIPDC.PCGAKRILEFQVMPQLLNYLKADRLGKSID
T10M13.22 VLRYSRSSGAKPLWPIASGRVSKSELPSCKSCGGPRCFEFQVMPQLLFFFGGKNERESLD
421 453
Rat-RP8 WGILAVFTCAESCSLGIGFTEEFVWKQDVTETP*
Mouse-RP8 WGVLAVFTCAESCSLGSGYTEEFVWKQDVTDTP*
HumanPDCD2 WGILAAFTCAESCSLGTGYTEEFVWKQDVTDTP*
T10M13.22 WATIVVYTCENSCDSSLSYKEEFVWVQLYSQTT*
written 31 Jul 97
updated 4 Aug 98
updated 16 Sep 98
Larry
Parnell