| Gene | T3F12.10 |
| Putative Identification | zinc-finger protein |
| Position | 99046 to 101596, from the initial methionine to the termination codon |
| Strand | + |
| EST match | none exact |
| Database match | weakly similar to RING Zn-finger proteins |
CDS: The table below lists the range of each exon and which gene prediction programs selected the termini of that particular exon (GS = GenScan, Gr = Grail, M = MZEF). Although there are no exact EST matches for T3F12.10, ESTs T21332 and T21607 have strong similarity to this region of T3F12. These two ESTs were used to confirm splice junctions selected for some exons (these splice sites are designated by est in the table).
Exon |
Range |
5' |
3' |
| 1 | 99046 - 99408 | GS | GS |
| 2 | 100006 - 100529 | GS,Gr,M | GS,Gr,M (Grail predicts this as two separate exons) |
| 3 | 100948 - 101047 | Gr,M | Gr,M |
| 4 | 101127 - 101191 | Gr,M | est,Gr,M |
| 5 | 101257 - 101359 | est,GS,Gr,M | est,GS,Gr,M |
| 6 | 101453 - 101596 | est,GS,Gr | GS |
CDS sequence from the initial methionine to the termination codon
ATGACGCGTGTCAACCAGCTTCCATGCGACTGTGTATCTACGGCGGAGGAATCTCTCACT TCCGGCACGTGCATTACCCCTACGCACGTGACTTCCCTTTCGTCTCCTCTTGATCGCTCC GGCGATGTTGATCCTCTTCCTGTTTCCGACGAATCTGGTGGCTCGAAGGCCGATGAATCT ATGACTGACGCAGATGAGACTAAAAAGAGGAAACGGATACTCAGTGGTGATTGTGAAGCC GATGAGAATAATAAGAGTGACGGAGAAATCGCGAGTCTCAATGACGGTGTTGATGCGTTT ACTGCGATTTGTGAAGACTTGAATTGTTCTCTGTGTAATCAATTGCCTGATAGGCCCGTC ACGATATTATGTGGACATAACTTTTGCTTGAAATGTTTTGACAAGTGGATTGATCAAGGG AATCAAATTTGTGCTACATGTCGTAGCACAATTCCTGACAAAATGGCTGCCAATCCTCGT GTTAACTCGTCTCTTGTGTCTGTTATCCGTTATGTAAAAGTTGCTAAAACTGCTGGTGTT GGTACTGCAAATTTTTTTCCTTTTACAAGCAACCAAGACGGCCCAGAGAATGCCTTTAGA ACCAAGCGCGCTAAAATCGGGGAGGAGAATGCTGCAAGGATATATGTTACCGTACCATTT GATCACTTTGGTCCTATACCAGCTGAACATGATCCTGTCAGGAACCAAGGTGTTTTAGTT GGAGAATCCTGGGAGAATCGAGTAGAGTGTCGGCAGTGGGGGGTTCACTTGCCACATGTT TCTTGCATTGCTGGACAAGAAGACTATGGAGCTCAGTCTGTAGTAATATCTGGAGGTTAT AAGGATGACGAGGATCATGGAGAATGGTTTCTATACACAGGAAGGAGGTCTTACAAGGAT AGGTATTCTGCATATGCCCCTAAGGAAGGAGTGAGATATGATGGTGTATACAGGATCGAG AAATGCTGGCGAAAAGCTAGATTCCCGGATTCATTTAAGGTCTGTCGTTACCTGTTTGTA AGATGTGACAATGAGCCAGCTCCATGGAACAGTGATGAGAGTGGAGATCGTCCAAGACCT TTGCCTAATATTCCAGAGCTTGAAACGGCCTCAGACCTGTTTGAGAGAAAGGAAAGTCCA TCATGGGATTTTGATGAAGCCGAGGGCCGTTGGAGATGGATGAAGCCTCCACCTGCAAAT CATGAGCAGAGGGAGAGAATGAAGATGGCTATGACATGTCTTCTCCTTTTTGTCCTTATC ATTCTCGTTGGTTCATCATCTATCTTATATCAGTATTAG
Protein sequence:
MTRVNQLPCDCVSTAEESLTSGTCITPTHVTSLSSPLDRSGDVDPLPVSDESGGSKADES MTDADETKKRKRILSGDCEADENNKSDGEIASLNDGVDAFTAICEDLNCSLCNQLPDRPV TILCGHNFCLKCFDKWIDQGNQICATCRSTIPDKMAANPRVNSSLVSVIRYVKVAKTAGV GTANFFPFTSNQDGPENAFRTKRAKIGEENAARIYVTVPFDHFGPIPAEHDPVRNQGVLV GESWENRVECRQWGVHLPHVSCIAGQEDYGAQSVVISGGYKDDEDHGEWFLYTGRRSYKD RYSAYAPKEGVRYDGVYRIEKCWRKARFPDSFKVCRYLFVRCDNEPAPWNSDESGDRPRP LPNIPELETASDLFERKESPSWDFDEAEGRWRWMKPPPANHEQRERMKMAMTCLLLFVLI ILVGSSSILYQY*
Alignment: BLAST analysis indicated that T3F12.10 shows weak similarity to vertebrate zinc-finger proteins. T3F12.10 is aligned with X. laevis XNF7 and P. waltlii PWa33. It is clear from this alignment that T3P12.10 consists of three sub-domains: An amino-terminal peptide of weak similarity to the vertebrate zinc-finger proteins, a central peptide of marked similarity containing many conserved cysteine residues, followed by a carboxyl-terminus of very weak similarity. The central peptide fragment is underscored and conserved cysteines (note CxxC pairs) are highlighted.
1 60
PWa33 MANSTVAEPEKMEQTALVKRMRKKKTWRETKTSVMTTKRRMWNQQRPSPSAVHTSVAVAN
XenoXNF7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~MEEEEGADDGEQGEEEVLVVNVGSTYPCKRSDG
T3F12.10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MTRVNQLPCDCVST
61 120
PWa33 TLHAAEIIKTRKTKE.NAEEFYVHYVGLNRRQNEWVDKSRVLQAKQIKTEELNNTEDETN
XenoXNF7 SQHDAEIVKVRYNKQAGREEYYAHYVGLNRRQNEWVDKSRLVLTKPPKEGETNGTDQEVT
T3F12.10 AEE.........SLTSGTCITPTHVTSLSSPLDRSGDVDPLPVS...DESGGSKADESMT
121 180
PWa33 GVSDQSEGKAARSNKRKIEDGDGDQKKRKVDDEE...........DDFTEDLTCPLCRSL
XenoXNF7 DTAEQPDSKTPQ..KRKIEEPEPEPKKAKVEEKDASKNASSLGAAGDFAEELTCPLCVEL
T3F12.10 DADETKK.......RKRILSGDCEADENNKSDGEIASLNDGVDAFTAICEDLNCSLCNQL
181 240
PWa33 FKEPVILECGHNFCKHCIDKSWESASAFSCPECKEVLTERKYTTNRVLANLVKKAA...V
XenoXNF7 FKDPVMVACGHNFCRSCIDKAWEGQSSFACPECRESITDRKYTINRVLANLAKKAACTPV
T3F12.10 PDRPVTILCGHNFCLKCFDK.WIDQGNQICATCRSTIPDKMAANPRVNSSLVSVIRYVKV
241 300
PWa33 GVKDKDVKPKEKCDEHDERLKLFCKDDGTLACVICRDSLKHSNHNFLPIQDAVGVYRDQL
XenoXNF7 TPVEKKTRPLEKCSEHDERLKLYCKDDGTLSCVICRDSLKHASHNFLPILDAVGVYREEL
T3F12.10 AKTAGV...................................GTANFFPFTSNQDGPENAF
301 360
PWa33 IALVSPLETTMKENQKLKCDQSQKISLHRENIVDCKKHIECEFEKLHQFLREKEAKMVED
XenoXNF7 SAIVAPLEASLKVTEQLSSEQSDKIEQHNKNMSQYKEHITSEFEKLHKFLREREEKLLEQ
T3F12.10 RTKRAKIGEENAARIYVTVPFDHFG...........................PIPAEHDP
361 420
PWa33 LNAEREGLLKDMEANLVKMTDNCEFIEEAISTTQSRLNESDPIAFLTDIKSFIEKCCEEH
XenoXNF7 LKEQGENLLTEMENNLVKMQESQDAIKKTISLAKERMEDTDSISFLMDIKAFIDKCQEQQ
T3F12.10 VRNQGV.LVGESWENRVECRQWGVHLPHVSCIAGQ..EDYGAQSVV..ISGGYKDDEDHG
421 480
PWa33 RKGVPAESVLVNKELSQGRFNPGLQYIIWKELKSVVQPGLAPLTLDPNTAHPNLVLSEGL
XenoXNF7 RAVISTGNTLLSKELCQGTFKGPIQYIMWKELKSVVIPSLTPMLLDPTSAHPNLHLSDGL
T3F12.10 EWFLYTGRRSYKDRYSAYAPKEGVRY...............................DGV
481 540
PWa33 TSVKYTDTKQQLPDNPKRFSQCILVLGAEGFDSGKHYWEVEVGNKTAWDVGMASESSNRK
XenoXNF7 TSVRYGENKLSLPDNPKRFSQCILVLGSQGFDSGRHYWEVEVGDKTAWDVGMASESSNRK
T3F12.10 YRIEKCWRKARFPDS...FKVC............RYLFVRCDNEPAPWN...SDESGDRP
541 600
PWa33 GKIKLNPKNGYWAIWLRNGNAFKALESPSKTLNLTSKPSKIGVYLDYEGGQVSFYNADDM
XenoXNF7 GKIKLNPKNGYWAIWLRNGNAYKALESPSKSLSLSSHPRKIGVYVDYEGGQISFYNADDM
T3F12.10 RPLPNIPE......LETASDLFERKESPSWDFDEAE.....GRWRWMKPPPANHEQRERM
601 660
PWa33 SPIYTFNGSFTEKLYPYLSPFLQDSGKNAEPLKLVHTKL*~~~~~~~~~~~~~~~~~~~~
XenoXNF7 TIIYTFNATFTEKLYPYLSPFLHDSGKNVDPLRFVHNK*~~~~~~~~~~~~~~~~~~~~~
T3F12.10 KMAMTCLLLFVLIILVGSSSILYQY*
___________________________________________________________________
created 15 Oct 97
Larry Parnell