Gene T3F12.10
Putative Identification zinc-finger protein
Position 99046 to 101596, from the initial methionine to the termination codon
Strand +
EST match none exact
Database match weakly similar to RING Zn-finger proteins

 

CDS:  The table below lists the range of each exon and which gene prediction programs selected the termini of that particular exon (GS = GenScan, Gr = Grail, M = MZEF). Although there are no exact EST matches for T3F12.10, ESTs T21332 and T21607 have strong similarity to this region of T3F12. These two ESTs were used to confirm splice junctions selected for some exons (these splice sites are designated by est in the table).

Exon

Range

5'

3'

1 99046 - 99408 GS GS
2 100006 - 100529 GS,Gr,M GS,Gr,M (Grail predicts this as two separate exons)
3 100948 - 101047 Gr,M Gr,M
4 101127 - 101191 Gr,M est,Gr,M
5 101257 - 101359 est,GS,Gr,M est,GS,Gr,M
6 101453 - 101596 est,GS,Gr GS

CDS sequence from the initial methionine to the termination codon

ATGACGCGTGTCAACCAGCTTCCATGCGACTGTGTATCTACGGCGGAGGAATCTCTCACT
TCCGGCACGTGCATTACCCCTACGCACGTGACTTCCCTTTCGTCTCCTCTTGATCGCTCC
GGCGATGTTGATCCTCTTCCTGTTTCCGACGAATCTGGTGGCTCGAAGGCCGATGAATCT
ATGACTGACGCAGATGAGACTAAAAAGAGGAAACGGATACTCAGTGGTGATTGTGAAGCC
GATGAGAATAATAAGAGTGACGGAGAAATCGCGAGTCTCAATGACGGTGTTGATGCGTTT
ACTGCGATTTGTGAAGACTTGAATTGTTCTCTGTGTAATCAATTGCCTGATAGGCCCGTC
ACGATATTATGTGGACATAACTTTTGCTTGAAATGTTTTGACAAGTGGATTGATCAAGGG
AATCAAATTTGTGCTACATGTCGTAGCACAATTCCTGACAAAATGGCTGCCAATCCTCGT
GTTAACTCGTCTCTTGTGTCTGTTATCCGTTATGTAAAAGTTGCTAAAACTGCTGGTGTT
GGTACTGCAAATTTTTTTCCTTTTACAAGCAACCAAGACGGCCCAGAGAATGCCTTTAGA
ACCAAGCGCGCTAAAATCGGGGAGGAGAATGCTGCAAGGATATATGTTACCGTACCATTT
GATCACTTTGGTCCTATACCAGCTGAACATGATCCTGTCAGGAACCAAGGTGTTTTAGTT
GGAGAATCCTGGGAGAATCGAGTAGAGTGTCGGCAGTGGGGGGTTCACTTGCCACATGTT
TCTTGCATTGCTGGACAAGAAGACTATGGAGCTCAGTCTGTAGTAATATCTGGAGGTTAT
AAGGATGACGAGGATCATGGAGAATGGTTTCTATACACAGGAAGGAGGTCTTACAAGGAT
AGGTATTCTGCATATGCCCCTAAGGAAGGAGTGAGATATGATGGTGTATACAGGATCGAG
AAATGCTGGCGAAAAGCTAGATTCCCGGATTCATTTAAGGTCTGTCGTTACCTGTTTGTA
AGATGTGACAATGAGCCAGCTCCATGGAACAGTGATGAGAGTGGAGATCGTCCAAGACCT
TTGCCTAATATTCCAGAGCTTGAAACGGCCTCAGACCTGTTTGAGAGAAAGGAAAGTCCA
TCATGGGATTTTGATGAAGCCGAGGGCCGTTGGAGATGGATGAAGCCTCCACCTGCAAAT
CATGAGCAGAGGGAGAGAATGAAGATGGCTATGACATGTCTTCTCCTTTTTGTCCTTATC
ATTCTCGTTGGTTCATCATCTATCTTATATCAGTATTAG

 

Protein sequence:

MTRVNQLPCDCVSTAEESLTSGTCITPTHVTSLSSPLDRSGDVDPLPVSDESGGSKADES
MTDADETKKRKRILSGDCEADENNKSDGEIASLNDGVDAFTAICEDLNCSLCNQLPDRPV
TILCGHNFCLKCFDKWIDQGNQICATCRSTIPDKMAANPRVNSSLVSVIRYVKVAKTAGV
GTANFFPFTSNQDGPENAFRTKRAKIGEENAARIYVTVPFDHFGPIPAEHDPVRNQGVLV
GESWENRVECRQWGVHLPHVSCIAGQEDYGAQSVVISGGYKDDEDHGEWFLYTGRRSYKD
RYSAYAPKEGVRYDGVYRIEKCWRKARFPDSFKVCRYLFVRCDNEPAPWNSDESGDRPRP
LPNIPELETASDLFERKESPSWDFDEAEGRWRWMKPPPANHEQRERMKMAMTCLLLFVLI
ILVGSSSILYQY*

 

Alignment:  BLAST analysis indicated that T3F12.10 shows weak similarity to vertebrate zinc-finger proteins. T3F12.10 is aligned with X. laevis XNF7 and P. waltlii PWa33. It is clear from this alignment that T3P12.10 consists of three sub-domains: An amino-terminal peptide of weak similarity to the vertebrate zinc-finger proteins, a central peptide of marked similarity containing many conserved cysteine residues, followed by a carboxyl-terminus of very weak similarity. The central peptide fragment is underscored and conserved cysteines (note CxxC pairs) are highlighted.

           1                                                         60
    PWa33  MANSTVAEPEKMEQTALVKRMRKKKTWRETKTSVMTTKRRMWNQQRPSPSAVHTSVAVAN 
 XenoXNF7  ~~~~~~~~~~~~~~~~~~~~~~~~~~~MEEEEGADDGEQGEEEVLVVNVGSTYPCKRSDG 
 T3F12.10  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~MTRVNQLPCDCVST 

           61                                                       120
    PWa33  TLHAAEIIKTRKTKE.NAEEFYVHYVGLNRRQNEWVDKSRVLQAKQIKTEELNNTEDETN 
 XenoXNF7  SQHDAEIVKVRYNKQAGREEYYAHYVGLNRRQNEWVDKSRLVLTKPPKEGETNGTDQEVT 
 T3F12.10  AEE.........SLTSGTCITPTHVTSLSSPLDRSGDVDPLPVS...DESGGSKADESMT

           121                                                      180
    PWa33  GVSDQSEGKAARSNKRKIEDGDGDQKKRKVDDEE...........DDFTEDLTCPLCRSL 
 XenoXNF7  DTAEQPDSKTPQ..KRKIEEPEPEPKKAKVEEKDASKNASSLGAAGDFAEELTCPLCVEL 
 T3F12.10  DADETKK.......RKRILSGDCEADENNKSDGEIASLNDGVDAFTAICEDLNCSLCNQL

           181                                                      240
    PWa33  FKEPVILECGHNFCKHCIDKSWESASAFSCPECKEVLTERKYTTNRVLANLVKKAA...V 
 XenoXNF7  FKDPVMVACGHNFCRSCIDKAWEGQSSFACPECRESITDRKYTINRVLANLAKKAACTPV 
 T3F12.10  PDRPVTILCGHNFCLKCFDK.WIDQGNQICATCRSTIPDKMAANPRVNSSLVSVIRYVKV

           241                                                      300
    PWa33  GVKDKDVKPKEKCDEHDERLKLFCKDDGTLACVICRDSLKHSNHNFLPIQDAVGVYRDQL 
 XenoXNF7  TPVEKKTRPLEKCSEHDERLKLYCKDDGTLSCVICRDSLKHASHNFLPILDAVGVYREEL 
 T3F12.10  AKTAGV...................................GTANFFPFTSNQDGPENAF

           301                                                      360
    PWa33  IALVSPLETTMKENQKLKCDQSQKISLHRENIVDCKKHIECEFEKLHQFLREKEAKMVED 
 XenoXNF7  SAIVAPLEASLKVTEQLSSEQSDKIEQHNKNMSQYKEHITSEFEKLHKFLREREEKLLEQ 
 T3F12.10  RTKRAKIGEENAARIYVTVPFDHFG...........................PIPAEHDP

           361                                                      420
    PWa33  LNAEREGLLKDMEANLVKMTDNCEFIEEAISTTQSRLNESDPIAFLTDIKSFIEKCCEEH 
 XenoXNF7  LKEQGENLLTEMENNLVKMQESQDAIKKTISLAKERMEDTDSISFLMDIKAFIDKCQEQQ 
 T3F12.10  VRNQGV.LVGESWENRVECRQWGVHLPHVSCIAGQ..EDYGAQSVV..ISGGYKDDEDHG

           421                                                      480
    PWa33  RKGVPAESVLVNKELSQGRFNPGLQYIIWKELKSVVQPGLAPLTLDPNTAHPNLVLSEGL 
 XenoXNF7  RAVISTGNTLLSKELCQGTFKGPIQYIMWKELKSVVIPSLTPMLLDPTSAHPNLHLSDGL 
 T3F12.10  EWFLYTGRRSYKDRYSAYAPKEGVRY...............................DGV

           481                                                      540
    PWa33  TSVKYTDTKQQLPDNPKRFSQCILVLGAEGFDSGKHYWEVEVGNKTAWDVGMASESSNRK 
 XenoXNF7  TSVRYGENKLSLPDNPKRFSQCILVLGSQGFDSGRHYWEVEVGDKTAWDVGMASESSNRK 
 T3F12.10  YRIEKCWRKARFPDS...FKVC............RYLFVRCDNEPAPWN...SDESGDRP

           541                                                      600
    PWa33  GKIKLNPKNGYWAIWLRNGNAFKALESPSKTLNLTSKPSKIGVYLDYEGGQVSFYNADDM 
 XenoXNF7  GKIKLNPKNGYWAIWLRNGNAYKALESPSKSLSLSSHPRKIGVYVDYEGGQISFYNADDM 
 T3F12.10  RPLPNIPE......LETASDLFERKESPSWDFDEAE.....GRWRWMKPPPANHEQRERM

           601                                                      660
    PWa33  SPIYTFNGSFTEKLYPYLSPFLQDSGKNAEPLKLVHTKL*~~~~~~~~~~~~~~~~~~~~ 
 XenoXNF7  TIIYTFNATFTEKLYPYLSPFLHDSGKNVDPLRFVHNK*~~~~~~~~~~~~~~~~~~~~~ 
 T3F12.10  KMAMTCLLLFVLIILVGSSSILYQY*

 

___________________________________________________________________

created 15 Oct 97
Larry Parnell