| Gene | T13L16.8 |
| Putative Identification | zinc-finger protein |
| Position | 47666 - 50921, from the initial methionine to the termination codon |
| Strand | + |
| EST match | none |
| Database match | murine Bop, alternately spliced product encoding zinc finger-like motifs |
CDS: The table below lists the coordinates of the exons for T13L16.8 and which exon predicting algorithms selected the given termini (GS = GenScan, Gr = Grail, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons). Rice EST C72823 suggests three splice sites and these are designated by est. There are two possible choices for exon 12. The gene model selecting exon 12a aligns better to murine Bop than does that utilizing exon 12b; both exons cannot be incorporated into a single open reading frame.
Exon |
Range |
5' |
3' |
| 1 | 47666 - 47765 | GS | GS |
| 2 | 47844- 47989 | est, GS, M | est, GS, M |
| 3 | 48078 - 48212 | est, GS, Gr, M, NPG | GS, Gr, M, NPG |
| 4 | 48550 - 48624 | Gr, NPG | GS, NPG |
| 5 | 48721 - 48809 | GS, Gr, M, NPG | GS, Gr, M, NPG |
| 6 | 48874 - 48955 | GS, M, NPG | GS, M, NPG |
| 7 | 49045 - 49149 | GS, NPG | GS, NPG |
| 8 | 49241 - 49331 | GS, M, NPG | GS, M, NPG |
| 9 | 49413 - 49532 | GS, Gr, M, NPG | GS, Gr, M, NPG |
| 10 | 49651 - 49757 | GS, Gr, M, NPG | GS, Gr, M |
| 11 | 49852 - 49921 | GS, NPG | GS, Gr, NPG |
| 12a | 50253 - 50316 | GS, M, NPG | GS, M, NPG |
| 12b | 50532 - 50643 | Gr, M, NPG | Gr, M, NPG |
| 13 | 50762 - 50921 | GS, Gr, NPG | Gr |
Complete CDS of T13L16.8
ATGGCGGATTTGCAGAGATTTCTGCAAGATCGCTGCTTAGGCGTTTCGAATCTCCCACAG AAAGGCCGCTCTCTATTCACTGCCAGAGATTTTCGTCCAGGAGAAGTGATTCTAAGCCAA AAGCCATATATATGTGTTCCGAATAATACATCCTCGGAATCAAGATGTGACGGATGTTTC AAGACCAATAACCTTAAGAAATGTTCTGCTTGTCAAGTGGTTTGGTACTGTGGGAGCTCT TGCCAGAAGTCAGAGTGGAAGTTACATCGCGATGAATGCAAAGCTCTCACTAGACTTGAG AAGGAGAAACGGAAGTTTGTTACTCCTACAATACGTCTGATGGTTAGACTTTACATCAAG AGGAATTTGCAAAATGAAAAGATGGCTAATCTTGTGAACTTGATTCTTCAATTTCCTAGT GTTGACCTAAGAGAGATCGCCGAGAACTTTTCAAAGTTCTCATGCAATGCTCATAGCATT TGTGATAGCGAATTGAGACCTCAAGGGATTGGATTGTTTCCATTGGTTTCCATCATTAAT CACAGCTGCTCTCCCAATGCGGTTTTAGTGTTTGAGGAGCAGATGGCTGTTGTTCGGGCG ATGGATAACATATCAAAGGATTCAGAGATAACTATCAGTTATATTGAAACCGCTGGAAGC ACTCTGACTCGGCAGAAGTCTCTGAAAGAACAATACCTCTTCCACTGTCAGTGTGCCCGT TGCAGTAACTTTGGAAAACCTCATGATATTGAAGAAAGTGCAATATTGGAAGGCTATCGG TGTGCCAATGAGAAATGCACCGGTTTCTTACTCCGTGATCCTGAAGAAAAAGGCTTCGTT TGCCAGAAATGCTTGCTTCTTAGGAGCAAGGAAGAGGTTAAAAAGTTAGCAAGTGATCTC AAAACAGTTTCAGAGAAGGCTCCTACATCTCCTTCCGCAGAAGATAAACAAGCCGCTATT GAACTATATAAGACAATTGAGAAACTACAAGTTAAGCTTTACCATTCTTTCTCCATCCCT TTAATGAGAACCCGTGAAAAACTTCTCAAGATGCTAATGGACGTAGAAATCTGGAGAGAA GCTTTGAATTACTGCAGACTAATAGTCCCTGTTTACCAAAGAGTATATCCGGCAACCCAT CCTTTGATTGGACTGCAGTTCTATACCCAAGGAAAACTCGAATGGTTGCTGGGGGAAACC AAAGAGGCGGTGAGTTCATTGATTAAGGCATTTGACATTCTGCGGATCAGCCATGGAATA AGCACACCTTTCATGAAAGAGCTCTCAGCAAAGTTGGAGGAAGCTCGTGCAGAAGCTTCT TATAAGCAGCTTGCATTGCATTGA
Protein sequence of T13L16.8
MADLQRFLQDRCLGVSNLPQKGRSLFTARDFRPGEVILSQKPYICVPNNTSSESRCDGCF KTNNLKKCSACQVVWYCGSSCQKSEWKLHRDECKALTRLEKEKRKFVTPTIRLMVRLYIK RNLQNEKMANLVNLILQFPSVDLREIAENFSKFSCNAHSICDSELRPQGIGLFPLVSIIN HSCSPNAVLVFEEQMAVVRAMDNISKDSEITISYIETAGSTLTRQKSLKEQYLFHCQCAR CSNFGKPHDIEESAILEGYRCANEKCTGFLLRDPEEKGFVCQKCLLLRSKEEVKKLASDL KTVSEKAPTSPSAEDKQAAIELYKTIEKLQVKLYHSFSIPLMRTREKLLKMLMDVEIWRE ALNYCRLIVPVYQRVYPATHPLIGLQFYTQGKLEWLLGETKEAVSSLIKAFDILRISHGI STPFMKELSAKLEEARAEASYKQLALH*
BLAST output showing match to murine Bop. A putative zinc-finger motif identified with ProDom is highlighted.
gb|U76374|MMU76374 Mus musculus skm-BOP2 (Bop) mRNA, complete cds
Length = 3458
Score = 155 bits (388), Expect = 1e-36
Query: 21 KGRSLFTARDFRPGEVILSQKPYICVPNNTSSESRCDGCFKTNN-LKKCSACQVVWYCGS 79
KGR L ++F +VI +++ Y V ++ C CFK L +C C+ YC
Sbjct: 1500 KGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQEKLHRCGQCKFAHYCDR 1679
Query: 80 SCQKSEWKLHRDECKALTRLEKEKRKFVTPTIRLMVR------------LYIKRNLQN-- 125
+CQK W H++EC A+ + K + + R+M R + +LQN
Sbjct: 1680 TCQKDAWLNHKNECAAIKKYGKVPNENIRLAARIMWRVEREGTGLTEGCMVSVDDLQNHV 1859
Query: 126 -----EKMANL---VNLILQF-----PSVDLREIAENFSKFSCNAHSICDSE-LRPQGIG 171
E+ L V+ LQ+ ++ I+ F +CN ++ D L+ G+G
Sbjct: 1860 EHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVG 2039
Query: 172 LFPLVSIINHSCSPNAVLVFEEQMAVVRAMDNISKDSEITISYIETAGSTLTRQKSLKEQ 231
+FP + ++NH C PN ++F +RA+ IS+ E+T+SYI+ + R++ LK+Q
Sbjct: 2040 IFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQ 2219
Query: 232 YLFHCQCARCSNFGKPHDIEESAILEGYRCANEKCTGFLLRDPEEKGFVCQKCLLLRSKE 291
Y F C C C G D+ + A E DP+ S+E
Sbjct: 2220 YYFDCSCEHCQK-GLKDDL--------FLAAKE--------DPKP------------SQE 2312
Query: 292 EVKKLASDLKTVSEKAPTSPSAEDKQAAIELYKTIEKLQVKLYHSFSIPLMRTREKLLKM 351
VK++ K EK + S ++L + + Q ++ ++ ++R ++
Sbjct: 2313 VVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRLLSIASEV 2492
Query: 352 LMDVEIWREALNYCRLIVPVYQRVYPATHPLIGLQFYTQGKLEWLLGETKEAVSSLIKAF 411
L ++ + EA +Y R +V Y ++Y + +G+ G W G + + KA+
Sbjct: 2493 LSYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMICKAY 2672
Query: 412 DILRISHGISTPFMKELSAKLEEARAE 438
IL ++HG S P K+L A + E
Sbjct: 2673 AILLVTHGPSHPITKDLEAMRMQTEME 2753
Sequence of T13L16.8 protein from incorporation of
alternate exon 12b
The peptide encoded by this alternate exon is
shown in bold.
MADLQRFLQDRCLGVSNLPQKGRSLFTARDFRPGEVILSQKPYICVPNNTSSESRCDGCF KTNNLKKCSACQVVWYCGSSCQKSEWKLHRDECKALTRLEKEKRKFVTPTIRLMVRLYIK RNLQNEKMANLVNLILQFPSVDLREIAENFSKFSCNAHSICDSELRPQGIGLFPLVSIIN HSCSPNAVLVFEEQMAVVRAMDNISKDSEITISYIETAGSTLTRQKSLKEQYLFHCQCAR CSNFGKPHDIEESAILEGYRCANEKCTGFLLRDPEEKGFVCQKCLLLRSKEEVKKLASDL KTVSEKAPTSPSAEDKQAAIELYKTIEKLQVKLYHSFSIPLMRTREKLLKMLMDVEIWRE ALNYCRLIVPVYQIREVGTRRDWLVLGGTWLISLQDIPKLESLWKETFVYLLLGETKEAV SSLIKAFDILRISHGISTPFMKELSAKLEEARAEASYKQLALH*
written 18 Dec 97
Larry Parnell