| Gene | T13L16.7 |
| Putative Identification | Tal-1-like reverse transcriptase |
| Position | 41812 - 45909, from within the CDS to the termination codon |
| Strand | - |
| EST match | none |
| Database match | reverse transcriptase encoded by Tal-1 retrotransposon |
The putative zinc-finger protein encoded by T13L16.6 and the putative reverse transcriptase encoded by T13L16.7 may together comprise a putative non-LTR retrotransposon similar to Tal-1.
CDS: The table below lists the coordinates of the single T13L16.7 exon. This exon consists of a very long open reading frame and is not predicted by any of the algorithms tested. The 5' region of T13L16.7 is not defined.
Exon |
Range |
5' |
3' |
| 1 | 41812 - 45909 |
partial CDS of T13L16.7
GGCGTCATTAGCTGGAATTGTCAAGGTCTGAGGAATCCTTGGACAATTCGATACTTGAAG GAAATGAAAAAGGATCATTTTCCAGACATTCTATTCCTCATGGAGACAAAAAATTCTCAA GATTTTGTTTATAAAGTTTTTTGTTGGCTTGGTTATGATTTTATACATACAGTAGAACCT GAAGGAAGAAGTGGTGGCTTAGCAATTTTCTGGAAAAGCCATCTGGAGATTGAGTTTCTT TATGCTGATAAAAATCTTATGGATCTTCAGGTTTCTTCTAGAAATAAAGTTTGGTTTATT TCTTGTGTTTATGGACTTCCGGTTACCCATATGCGACCTAAACTATGGGAACACCTGAAT TCTATTGGTCTCAAGAGAGCAGAAGCATGGTGTTTAATAGGAGACTTCAATGATATTAGG TCGAATGATGAAAAATTAGGAGGACCAAGACGATCACCTTCTTCTTTTCAATGCTTTGAA CACATGTTACTCAACTGTTCTATGCATGAATTAGGAAGTACAGGTAATAGTTTCACTTGG GGAGGAAACAGAAATGATCAGTGGGTTCAGTGTAAATTGGATCGATGCTTTGGAAATCCA GCTTGGTTTTCTATTTTCCCTAATGCTCATCAGTGGTTTTTAGAGAAGTTTGGATCTGAT CATCGCCCGGTATTGGTCAAATTCACTAATGACAATGAGCTTTTTCGTGGACAATTTCGT TATGATAAAAGACTTGATGACGATCCCTACTGTATTGAGGTTATACATCGTTCTTGGAAT AGTGCAATGTCTCAAGGTACTCACTCTTCTTTTTTCAGTCTTATTGAATGTCGTAGAGCT ATCAGTGTATGGAAACATTCGTCTGATACTAATGCTCAAAGTAGAATTAAGAGATTGAGA AAAGATTTAGATGCAGAGAAGAGTATTCAAATTCCATGTTGGCCTAGAATTGAGTATATC AAGGATCAACTGAGTTTAGCATATGGTGATGAAGAATTGTTTTGGAGACAAAAAAGCAGA CAAAAATGGCTTGCAGGGGGTGACAAGAATACTGGATTCTTTCATGCTACTGTACACTCT GAGAGATTAAAGAATGAGTTAAGCTTTCTACTTGATGAGAATGATCAGGAATTTACAAGA AACAGTGATAAAGGAAAAATTGCATCTTCCTTTTTTGAGAATCTGTTCACCTCTACGTAT ATATTGACTCATAACAATCACCTGGAAGGTCTCCAAGCAAAGGTAACATCAGAAATGAAT CACAATTTAATTCAAGAGGTCACTGAACTGGAAGTGTATAATGCAGTATTCTCTATTAAC AAGGAAAGTGCTCCAGGACCTGATGGTTTTACTGCTTTGTTCTTTCAACAACATTGGGAT TTAGTGAAACATCAGATCTTAACTGAGATTTTTGGTTTCTTTGAGACTGGAGTTCTACCC CAGGATTGGAATCACACTCATATTTGTCTCATTCCTAAGATCACTAGCCCACAGAGAATG TCAGATCTTCGACCTATAAGTTTGTGTTCTGTGCTATACAAGATAATATCGAAGATTTTG ACTCAAAGATTGAAAAAACATCTTCCAGCTATTGTTTCTACAACGCAATCTGCATTTGTT CCTCAACGCTTGATATCTGATAATATATTGGTTGCTCATGAAATGATTCATAGTCTCAGA ACCAATGATCGAATATCTAAGGAACATATGGCTTTCAAGACTGACATGTCTAAGGCTTAT GACAGGGTTGAGTGGCCTTTTTTGGAAACTATGATGACTGCTCTTGGTTTTAACAACAAA TGGATTTCCTGGATCATGAATTGTGTAACTTCAGTCTCTTACTCAGTCTTGATCAATGGA CAGCCTTATGGCCATATCATTCCAACTCGTGGAATTCGACAAGGAGATCCACTTTCACCG GCCTTGTTTGTACTTTGCACTGAAGCTTTGATTCATATTCTCAACAAGGCAGAACAAGCA GGAAAAATCACGGGTATTCAGTTTCAAGACAAGAAAGTTTCAGTAAATCATTTACTATTT GCTGATGACACTCTTCTAATGTGTAAAGCTACAAAACAAGAGTGTGAAGAGCTAATGCAA TGCTTATCTCAGTATGGTCAATTATCTGGTCAAATGATCAACCTGAATAAATCTGCAATC ACTTTTGGGAAAAATGTTGATATTCAAATTAAAGATTGGATAAAGTCTAGATCAGGTATT TCTTTGGAAGGTGGAACAGGAAAATACCTTGGTTTACCTGAATGTTTAAGTGGTTCAAAA AGGGATTTATTTGGGTTTATTAAGGAAAAATTGCAATCAAGACTTACAGGTTGGTATGCC AAAACATTATCTCAAGGAGGCAAGGAAGTTTTACTCAAATCCATTGCTTTAGCTCTTCCT GTTTATGTCATGTCCTGCTTCAAATTACCAAAGAACTTATGTCAAAAGCTAACCACTGTG ATGATGGATTTCTGGTGGAATAGTATGCAACAGAAAAGGAAAATTCATTGGCTTAGTTGG CAAAGGTTAACACTTCCAAAGGATCAGGGTGGATTTGGTTTCAAAGATTTACAATGTTTC AATCAAGCTCTATTGGCCAAACAAGCATGGAGAGTACTACAGGAGAAAGGAAGTCTTTTT TCCAGGGTTTTTCAAAGTAGGTACTTCTCTAATTCTGATTTTCTTTCTGCTACCAGAGGA TCTAGACCTTCATATGCTTGGAGAAGCATATTATTTGGAAGAGAGCTACTTATGCAAGGT TTAAGAACAGTGATTGGGAACGGGCAAAAGACCTTTGTATGGACTGACAAGTGGTTACAT GATGGTTCTAATAGACGACCTCTGAATAGGAGACGCTTTATTAATGTTGATTTGAAAGTC AGTCAATTGATTGATCCGACATCTAGGAACTGGAATCTTAATATGCTTCGGGATTTGTTT CCCTGGAAAGATGTTGAGATCATCCTCAAGCAAAGACCACTGTTTTTTAAAGAAGACTCA TTTTGTTGGTTGCATTCTCACAATGGATTATATTCTGTTAAAACTGGATATGAATTCCTA AGTAAACAAGTTCATCATCGTTTGTATCAAGAAGCTAAAGTTAAGCCTTCTGTCAATTCT CTTTTTGACAAAATATGGAATCTGCATACGGCTCCAAAGATCAGAATTTTTCTATGGAAA GCTTTACATGGTGCAATCCCTGTTGAAGACAGACTTCGAACAAGAGGTATTAGAAGTGAT GATGGCTGTTTGATGTGTGATACAGAAAATGAAACCATCAATCATATTTTGTTTGAATGT CCTCTAGCTAGACAAGTCTGGGCAATTACTCATTTATCATCTGCAGGGTCTGAGTTTTCA AATTCTGTTTATACTAATATGAGTAGATTGATTGACTTAACTCAGCAAAATGATCTTCCT CATCATTTGCGGTTTGTTAGTCCATGGATTCTTTGGTTTTTATGGAAAAACAGGAATGCA TTGCTATTCGAAGGAAAAGGCTCAATAACAACTACTCTAGTTGACAAAGCTTATGAAGCA TATCATGAATGGTTTTCAGCTCAAACACACATGCAAAATGATGAAAAACATTTGAAGATC ACGAAATGGTGTCCACCGTTGCCTGGTGAATTGAAGTGTAATATTGGTTTTGCCTGGTCA AAACAACATCACTTTTCGGGTGCATCTTGGGTGGTACGTGATTCACAAGGAAAAGTCTTA TTGCATAGTCGCAGATCTTTTAATGAGGTACATTCTCCTTACTCTGCTAAGATAAGAAGC TGGGAATGGGCATTAGAATCTATGACTCATCATCACTTTGATAGAGTCATTTTTGCTTCC TCAACACATGAGATTATTCAAGCCTTACACAAACCACATGAATGGCCTCTCCTATTGGGT GATATTTCTGAGCTTTTAAGCTTCACTAAAGACAAACCACACTGGTTTCTCTGTATGGAA CCCTTTTGTTGTAACAGAGGTGCGAATGCTATTGCCACGAGTGTCATCACAGGATGCAGA TTTCAATCATACGTGGCTAGAGGTTATCCATCGTGGATGACAAATGTTTTTACTGCTGAG AGAGGTAACTTGATTTGA
Protein sequence of T13L16.7
GVISWNCQGLRNPWTIRYLKEMKKDHFPDILFLMETKNSQDFVYKVFCWLGYDFIHTVEP EGRSGGLAIFWKSHLEIEFLYADKNLMDLQVSSRNKVWFISCVYGLPVTHMRPKLWEHLN SIGLKRAEAWCLIGDFNDIRSNDEKLGGPRRSPSSFQCFEHMLLNCSMHELGSTGNSFTW GGNRNDQWVQCKLDRCFGNPAWFSIFPNAHQWFLEKFGSDHRPVLVKFTNDNELFRGQFR YDKRLDDDPYCIEVIHRSWNSAMSQGTHSSFFSLIECRRAISVWKHSSDTNAQSRIKRLR KDLDAEKSIQIPCWPRIEYIKDQLSLAYGDEELFWRQKSRQKWLAGGDKNTGFFHATVHS ERLKNELSFLLDENDQEFTRNSDKGKIASSFFENLFTSTYILTHNNHLEGLQAKVTSEMN HNLIQEVTELEVYNAVFSINKESAPGPDGFTALFFQQHWDLVKHQILTEIFGFFETGVLP QDWNHTHICLIPKITSPQRMSDLRPISLCSVLYKIISKILTQRLKKHLPAIVSTTQSAFV PQRLISDNILVAHEMIHSLRTNDRISKEHMAFKTDMSKAYDRVEWPFLETMMTALGFNNK WISWIMNCVTSVSYSVLINGQPYGHIIPTRGIRQGDPLSPALFVLCTEALIHILNKAEQA GKITGIQFQDKKVSVNHLLFADDTLLMCKATKQECEELMQCLSQYGQLSGQMINLNKSAI TFGKNVDIQIKDWIKSRSGISLEGGTGKYLGLPECLSGSKRDLFGFIKEKLQSRLTGWYA KTLSQGGKEVLLKSIALALPVYVMSCFKLPKNLCQKLTTVMMDFWWNSMQQKRKIHWLSW QRLTLPKDQGGFGFKDLQCFNQALLAKQAWRVLQEKGSLFSRVFQSRYFSNSDFLSATRG SRPSYAWRSILFGRELLMQGLRTVIGNGQKTFVWTDKWLHDGSNRRPLNRRRFINVDLKV SQLIDPTSRNWNLNMLRDLFPWKDVEIILKQRPLFFKEDSFCWLHSHNGLYSVKTGYEFL SKQVHHRLYQEAKVKPSVNSLFDKIWNLHTAPKIRIFLWKALHGAIPVEDRLRTRGIRSD DGCLMCDTENETINHILFECPLARQVWAITHLSSAGSEFSNSVYTNMSRLIDLTQQNDLP HHLRFVSPWILWFLWKNRNALLFEGKGSITTTLVDKAYEAYHEWFSAQTHMQNDEKHLKI TKWCPPLPGELKCNIGFAWSKQHHFSGASWVVRDSQGKVLLHSRRSFNEVHSPYSAKIRS WEWALESMTHHHFDRVIFASSTHEIIQALHKPHEWPLLLGDISELLSFTKDKPHWFLCME PFCCNRGANAIATSVITGCRFQSYVARGYPSWMTNVFTAERGNLI*
BLAST output demonstrating the similarity between T13L16.7 and the reverse transcriptase of Tal-1
gb|L47193|ATHRETR Arabidopsis thaliana (clone
DW15) zinc-finger protein gene and
reverse transcriptase gene,
complete cds.
Length = 7808
Score = 1251 bits (3201), Expect(2) = 0.0
Query: 2 VISWNCQGL--RNPWTIRYLKEMKKDHFPDILFLMETKNSQDFVYKVFCWLGYDFIHTVE 59
++SWNCQGL TI L EM+ HFP++LFLMETKN + V + WLGY+ + TV
Sbjct: 2909 LVSWNCQGLGWSQDLTIPRLMEMRLSHFPEVLFLMETKNCSNVVVDLQEWLGYERVFTVN 3088
Query: 60 PEGRSGGLAIFWKSHLEIEFLYADKNLMDLQVSSRNKVWFISCVYGLPVTHMRPKLWEHL 119
P G SGGLA+FWK ++I YADKNL+D Q+ + +++SCVYG P + +WE +
Sbjct: 3089 PIGLSGGLALFWKKGVDIVIKYADKNLIDFQIQFGSHEFYVSCVYGNPAFSDKHLVWEKI 3268
Query: 120 NSIGLKRAEAWCLIGDFNDIRSNDEKLGGPRRSPSSFQCFEHMLLNCSMHELGSTGNSFT 179
IG+ R E WC++GDFN I N EK GGPRR SSF F ML +C M EL S GN FT
Sbjct: 3269 TRIGINRKEPWCMLGDFNPILHNGEKRGGPRRGDSSFLPFTDMLDSCDMLELPSIGNPFT 3448
Query: 180 WGGNRNDQWVQCKLDRCFGNPAWFSIFPNAHQWFLEKFGSDHRPVLVKFTNDNELFRGQF 239
WGG N+ W+Q +LDRCFGN WF FP ++Q FL+K GSDHRPVLV+ T E +RG F
Sbjct: 3449 WGGKTNEMWIQSRLDRCFGNKNWFRFFPISNQEFLDKRGSDHRPVLVRLTKTKEEYRGNF 3628
Query: 240 RYDKRLDDDPYCIEVIHRSWNSAMSQGTHSSFFSLIECRRAISVWKHSSDTNAQSRIKRL 299
R+DKRL + P E I ++WN + L CR A+S WK ++ N+ +RI +
Sbjct: 3629 RFDKRLFNQPNVKETIVQAWNGSQRNENLLVLDKLKHCRSALSRWKKENNINSSTRITQA 3808
Query: 300 RKDLDAEKSIQIPCWPRIEYIKDQLSLAYGDEELFWRQKSRQKWLAGGDKNTGFFHATVH 359
R L+ E+S P + +K+ L A DEE+FW QKSR KW+ GDKNT FFHA+V
Sbjct: 3809 RAALELEQSSGFPRADLVFSLKNDLCKANHDEEVFWSQKSRAKWMHSGDKNTSFFHASVK 3988
Query: 360 SERLKNELSFLLDENDQEFTRNSDKGKIASSFFENLFTSTYILTHNNHLEGLQAKVTSEM 419
R K + L D N +KG IA ++F +LF ST + + E Q +VT M
Sbjct: 3989 DNRGKQHIDQLCDVNGLFHKDEMNKGAIAEAYFSDLFKSTDPSSFVDLFEDYQPRVTESM 4168
Query: 420 NHNLIQEVTELEVYNAVFSINKESAPGPDGFTALFFQQHWDLVKHQILTEIFGFFETGVL 479
N+ LI V++ E+ AVF+I SAPG DGFT FFQ++W ++ Q+ EI FF G
Sbjct: 4169 NNTLIAAVSKNEIREAVFAIRSSSAPGVDGFTGFFFQKYWSIICLQVTKEIQNFFLLGYF 4348
Query: 480 PQDWNHTHICLIPKITSPQRMSDLRPISLCSVLYKIISKILTQRLKKHLPAIVSTTQSAF 539
P+ WN TH+CL+PK P +M+DLRPISLCSVLYKIISKI+ +RL+ LP +VS QSAF
Sbjct: 4349 PKSWNFTHLCLLPKKKKPDKMTDLRPISLCSVLYKIISKIMVRRLQPFLPDLVSPNQSAF 4528
Query: 540 VPQRLISDNILVAHEMIHSLRTNDRISKEHMAFKTDMSKAYDRVEWPFLETMMTALGFNN 599
V +RLI DNIL+AHE++H LRT+ +SK +A K++MSKA+DRVEW ++ ++ ALGF+
Sbjct: 4529 VAERLIFDNILIAHEVVHGLRTHKSVSKGFIAIKSNMSKAFDRVEWNYVRALLDALGFHQ 4708
Query: 600 KWISWIMNCVTSVSYSVLINGQPYGHIIPTRGIRQGDPLSPALFVLCTEALIHILNKAEQ 659
KW+ WIM ++SVSYSVLIN + +G+I+P+RG+RQGDPLSP LFVLC+E L H++N+AE+
Sbjct: 4709 KWVGWIMFMISSVSYSVLINDKAFGNIVPSRGLRQGDPLSPFLFVLCSEGLTHLMNRAER 4888
Query: 660 AGKITGIQFQDKKVSVNHLLFADDTLLMCKATKQECEELMQCLSQYGQLSGQMINLNKSA 719
G ++GI+F + +++HLLFADD+L MCKA K+E + YG ++GQ IN +KS+
Sbjct: 4889 QGLLSGIRFSENGPAIHHLLFADDSLFMCKAVKEEVTVIKSIFKVYGDVTGQRINYDKSS 5068
Query: 720 ITFGKNVDIQIKDWIKSRSGISLEGGTGKYLGLPECLSGSKRDLFGFIKEKLQSRLTGWY 779
IT G VD K WI++ GI+ EGG YLGLPEC SGSK L +IK++L++RL+GW+
Sbjct: 5069 ITLGALVDEDCKVWIQAELGITNEGGASTYLGLPECFSGSKVQLLDYIKDRLKTRLSGWF 5248
Query: 780 AKTLSQGGKEVLLKSIALALPVYVMSCFKLPKNLCQKLTTVMMDFWWNSMQQKRKIHWLS 839
A+TLS GGKE LLK+ ALAL Y MSCFKL K C +T+ M DFWWN+++ KRK HW+S
Sbjct: 5249 ARTLSMGGKETLLKAFALALLFYAMSCFKLTKTTCVNMTSAMSDFWWNALEHKRKTHWVS 5428
Query: 840 WQRLTLPKDQGGFGFKDLQCFNQALLAKQAWRVLQEKGSLFSRVFQSRYFSNSDFLSATR 899
+++ L K+ GG GF+D++ FNQALLAKQAWR+LQ SLF+R F+SRY+ DFL A
Sbjct: 5429 CEKMCLSKENGGLGFRDIESFNQALLAKQAWRLLQFPNSLFARFFKSRYYDEEDFLDAEL 5608
Query: 900 GSRPSYAWRSILFGRELLMQGLRTVIGNGQKTFVWTDKWLHDGSNRRPLNRRRFINVDLK 959
+ PSYAWRSIL GR+LL++G R +GNG T VW D W++D R PL + +N+DL+
Sbjct: 5609 KATPSYAWRSILHGRDLLIKGFRKKVGNGSSTSVWMDPWIYDNDPRLPLQKHFSVNLDLR 5788
Query: 960 VSQLIDPTSRNWNLNMLRDLFPWKDVEIILKQRPLFFKEDSFCWLHSHNGLYSVKTGYEF 1019
V LI+ R + L +LF D+EII+K+ P+ +D + WLHS +G YSVK+GY
Sbjct: 5789 VHDLINVEDRCRRRDRLEELFYPADIEIIVKRNPVVSMDDFWVWLHSKSGEYSVKSGYWL 5968
Query: 1020 LSKQVHHRLYQEAKVKPSVNSLFDKIWNLHTAPKIRIFLWKALHGAIPVEDRLRTRGIRS 1079
+ L +EA+V+PS N L +KIW+ T+PKI++FLW+ L A+PV ++ RG+
Sbjct: 5969 AFQTNKPELIREARVQPSTNGLKEKIWSTLTSPKIKLFLWRILSSALPVAYQIIRRGMPI 6148
Query: 1080 DDGCLMCDTENETINHILFECPLARQVWAITHLSSAGSEFSN-SVYTNMSRLIDLTQQND 1138
D C +C E E+INH+LF C LARQVWA++ + ++ F N S++ N+ L++L +
Sbjct: 6149 DPRCQVCGEEGESINHVLFTCSLARQVWALSGVPTSQFGFQNSSIFANIQYLLELKGKGL 6328
Query: 1139 LPHHLRFVSPWILWFLWKNRNALLFEGKGSITTTLVDKAYEAYHEWFSAQTHMQNDEKHL 1198
+P ++ PW+LW LWKNR+ L FEG ++K + EWF AQ + + +
Sbjct: 6329 IPEQIKKSWPWVLWRLWKNRDKLFFEGTIFSPLKSIEKIRDDVQEWFLAQALVASVDAGE 6508
Query: 1199 KI------TKWCPPLPGELKCNIGFAWSKQHHFSGASWVVRDSQGKVLLHSRRSFNEVHS 1252
+ + W PP G +KCNI WS + G +WV+RD GKVLLHSRR+F+ +
Sbjct: 6509 TVCSAPCPSSWEPPPLGWVKCNISGVWSGKKRVCGGAWVLRDDHGKVLLHSRRAFSNLSV 6688
Query: 1253 PYSAKIRSWEWAL 1265
A +WAL
Sbjct: 6689 KKDALFCCVKWAL 6727
Score = 61.7 bits (147), Expect(2) = 0.0
Query: 1284 EIIQALHKPHEWPLLLGDISELLSFTKDKPHWFLCMEPFCCNRGANAIATSVITGCRFQS 1343
+++ A +P WP +SEL F + W + E NRGA+ IA S++ G RFQS
Sbjct: 6780 DLLSAFARPKAWPSFAFHVSELTHFLEKIGDWKVSEEKVDSNRGASLIAQSIVKGDRFQS 6959
Query: 1344 YVARGYPSWMTNVFTAER 1361
YVA G+ W+ +F ER
Sbjct: 6960 YVAVGHLRWLHQLFEGER 7013
written 29 Dec 97
Larry Parnell