Gene T13L16.7
Putative Identification Tal-1-like reverse transcriptase
Position 41812 - 45909, from within the CDS to the termination codon
Strand -
EST match none
Database match reverse transcriptase encoded by Tal-1 retrotransposon

The putative zinc-finger protein encoded by T13L16.6 and the putative reverse transcriptase encoded by T13L16.7 may together comprise a putative non-LTR retrotransposon similar to Tal-1.

 

CDS:  The table below lists the coordinates of the single T13L16.7 exon. This exon consists of a very long open reading frame and is not predicted by any of the algorithms tested. The 5' region of T13L16.7 is not defined.

Exon

Range

5'

3'

1 41812 - 45909    

partial CDS of T13L16.7

GGCGTCATTAGCTGGAATTGTCAAGGTCTGAGGAATCCTTGGACAATTCGATACTTGAAG
GAAATGAAAAAGGATCATTTTCCAGACATTCTATTCCTCATGGAGACAAAAAATTCTCAA
GATTTTGTTTATAAAGTTTTTTGTTGGCTTGGTTATGATTTTATACATACAGTAGAACCT
GAAGGAAGAAGTGGTGGCTTAGCAATTTTCTGGAAAAGCCATCTGGAGATTGAGTTTCTT
TATGCTGATAAAAATCTTATGGATCTTCAGGTTTCTTCTAGAAATAAAGTTTGGTTTATT
TCTTGTGTTTATGGACTTCCGGTTACCCATATGCGACCTAAACTATGGGAACACCTGAAT
TCTATTGGTCTCAAGAGAGCAGAAGCATGGTGTTTAATAGGAGACTTCAATGATATTAGG
TCGAATGATGAAAAATTAGGAGGACCAAGACGATCACCTTCTTCTTTTCAATGCTTTGAA
CACATGTTACTCAACTGTTCTATGCATGAATTAGGAAGTACAGGTAATAGTTTCACTTGG
GGAGGAAACAGAAATGATCAGTGGGTTCAGTGTAAATTGGATCGATGCTTTGGAAATCCA
GCTTGGTTTTCTATTTTCCCTAATGCTCATCAGTGGTTTTTAGAGAAGTTTGGATCTGAT
CATCGCCCGGTATTGGTCAAATTCACTAATGACAATGAGCTTTTTCGTGGACAATTTCGT
TATGATAAAAGACTTGATGACGATCCCTACTGTATTGAGGTTATACATCGTTCTTGGAAT
AGTGCAATGTCTCAAGGTACTCACTCTTCTTTTTTCAGTCTTATTGAATGTCGTAGAGCT
ATCAGTGTATGGAAACATTCGTCTGATACTAATGCTCAAAGTAGAATTAAGAGATTGAGA
AAAGATTTAGATGCAGAGAAGAGTATTCAAATTCCATGTTGGCCTAGAATTGAGTATATC
AAGGATCAACTGAGTTTAGCATATGGTGATGAAGAATTGTTTTGGAGACAAAAAAGCAGA
CAAAAATGGCTTGCAGGGGGTGACAAGAATACTGGATTCTTTCATGCTACTGTACACTCT
GAGAGATTAAAGAATGAGTTAAGCTTTCTACTTGATGAGAATGATCAGGAATTTACAAGA
AACAGTGATAAAGGAAAAATTGCATCTTCCTTTTTTGAGAATCTGTTCACCTCTACGTAT
ATATTGACTCATAACAATCACCTGGAAGGTCTCCAAGCAAAGGTAACATCAGAAATGAAT
CACAATTTAATTCAAGAGGTCACTGAACTGGAAGTGTATAATGCAGTATTCTCTATTAAC
AAGGAAAGTGCTCCAGGACCTGATGGTTTTACTGCTTTGTTCTTTCAACAACATTGGGAT
TTAGTGAAACATCAGATCTTAACTGAGATTTTTGGTTTCTTTGAGACTGGAGTTCTACCC
CAGGATTGGAATCACACTCATATTTGTCTCATTCCTAAGATCACTAGCCCACAGAGAATG
TCAGATCTTCGACCTATAAGTTTGTGTTCTGTGCTATACAAGATAATATCGAAGATTTTG
ACTCAAAGATTGAAAAAACATCTTCCAGCTATTGTTTCTACAACGCAATCTGCATTTGTT
CCTCAACGCTTGATATCTGATAATATATTGGTTGCTCATGAAATGATTCATAGTCTCAGA
ACCAATGATCGAATATCTAAGGAACATATGGCTTTCAAGACTGACATGTCTAAGGCTTAT
GACAGGGTTGAGTGGCCTTTTTTGGAAACTATGATGACTGCTCTTGGTTTTAACAACAAA
TGGATTTCCTGGATCATGAATTGTGTAACTTCAGTCTCTTACTCAGTCTTGATCAATGGA
CAGCCTTATGGCCATATCATTCCAACTCGTGGAATTCGACAAGGAGATCCACTTTCACCG
GCCTTGTTTGTACTTTGCACTGAAGCTTTGATTCATATTCTCAACAAGGCAGAACAAGCA
GGAAAAATCACGGGTATTCAGTTTCAAGACAAGAAAGTTTCAGTAAATCATTTACTATTT
GCTGATGACACTCTTCTAATGTGTAAAGCTACAAAACAAGAGTGTGAAGAGCTAATGCAA
TGCTTATCTCAGTATGGTCAATTATCTGGTCAAATGATCAACCTGAATAAATCTGCAATC
ACTTTTGGGAAAAATGTTGATATTCAAATTAAAGATTGGATAAAGTCTAGATCAGGTATT
TCTTTGGAAGGTGGAACAGGAAAATACCTTGGTTTACCTGAATGTTTAAGTGGTTCAAAA
AGGGATTTATTTGGGTTTATTAAGGAAAAATTGCAATCAAGACTTACAGGTTGGTATGCC
AAAACATTATCTCAAGGAGGCAAGGAAGTTTTACTCAAATCCATTGCTTTAGCTCTTCCT
GTTTATGTCATGTCCTGCTTCAAATTACCAAAGAACTTATGTCAAAAGCTAACCACTGTG
ATGATGGATTTCTGGTGGAATAGTATGCAACAGAAAAGGAAAATTCATTGGCTTAGTTGG
CAAAGGTTAACACTTCCAAAGGATCAGGGTGGATTTGGTTTCAAAGATTTACAATGTTTC
AATCAAGCTCTATTGGCCAAACAAGCATGGAGAGTACTACAGGAGAAAGGAAGTCTTTTT
TCCAGGGTTTTTCAAAGTAGGTACTTCTCTAATTCTGATTTTCTTTCTGCTACCAGAGGA
TCTAGACCTTCATATGCTTGGAGAAGCATATTATTTGGAAGAGAGCTACTTATGCAAGGT
TTAAGAACAGTGATTGGGAACGGGCAAAAGACCTTTGTATGGACTGACAAGTGGTTACAT
GATGGTTCTAATAGACGACCTCTGAATAGGAGACGCTTTATTAATGTTGATTTGAAAGTC
AGTCAATTGATTGATCCGACATCTAGGAACTGGAATCTTAATATGCTTCGGGATTTGTTT
CCCTGGAAAGATGTTGAGATCATCCTCAAGCAAAGACCACTGTTTTTTAAAGAAGACTCA
TTTTGTTGGTTGCATTCTCACAATGGATTATATTCTGTTAAAACTGGATATGAATTCCTA
AGTAAACAAGTTCATCATCGTTTGTATCAAGAAGCTAAAGTTAAGCCTTCTGTCAATTCT
CTTTTTGACAAAATATGGAATCTGCATACGGCTCCAAAGATCAGAATTTTTCTATGGAAA
GCTTTACATGGTGCAATCCCTGTTGAAGACAGACTTCGAACAAGAGGTATTAGAAGTGAT
GATGGCTGTTTGATGTGTGATACAGAAAATGAAACCATCAATCATATTTTGTTTGAATGT
CCTCTAGCTAGACAAGTCTGGGCAATTACTCATTTATCATCTGCAGGGTCTGAGTTTTCA
AATTCTGTTTATACTAATATGAGTAGATTGATTGACTTAACTCAGCAAAATGATCTTCCT
CATCATTTGCGGTTTGTTAGTCCATGGATTCTTTGGTTTTTATGGAAAAACAGGAATGCA
TTGCTATTCGAAGGAAAAGGCTCAATAACAACTACTCTAGTTGACAAAGCTTATGAAGCA
TATCATGAATGGTTTTCAGCTCAAACACACATGCAAAATGATGAAAAACATTTGAAGATC
ACGAAATGGTGTCCACCGTTGCCTGGTGAATTGAAGTGTAATATTGGTTTTGCCTGGTCA
AAACAACATCACTTTTCGGGTGCATCTTGGGTGGTACGTGATTCACAAGGAAAAGTCTTA
TTGCATAGTCGCAGATCTTTTAATGAGGTACATTCTCCTTACTCTGCTAAGATAAGAAGC
TGGGAATGGGCATTAGAATCTATGACTCATCATCACTTTGATAGAGTCATTTTTGCTTCC
TCAACACATGAGATTATTCAAGCCTTACACAAACCACATGAATGGCCTCTCCTATTGGGT
GATATTTCTGAGCTTTTAAGCTTCACTAAAGACAAACCACACTGGTTTCTCTGTATGGAA
CCCTTTTGTTGTAACAGAGGTGCGAATGCTATTGCCACGAGTGTCATCACAGGATGCAGA
TTTCAATCATACGTGGCTAGAGGTTATCCATCGTGGATGACAAATGTTTTTACTGCTGAG
AGAGGTAACTTGATTTGA

 

Protein sequence of T13L16.7

GVISWNCQGLRNPWTIRYLKEMKKDHFPDILFLMETKNSQDFVYKVFCWLGYDFIHTVEP
EGRSGGLAIFWKSHLEIEFLYADKNLMDLQVSSRNKVWFISCVYGLPVTHMRPKLWEHLN
SIGLKRAEAWCLIGDFNDIRSNDEKLGGPRRSPSSFQCFEHMLLNCSMHELGSTGNSFTW
GGNRNDQWVQCKLDRCFGNPAWFSIFPNAHQWFLEKFGSDHRPVLVKFTNDNELFRGQFR
YDKRLDDDPYCIEVIHRSWNSAMSQGTHSSFFSLIECRRAISVWKHSSDTNAQSRIKRLR
KDLDAEKSIQIPCWPRIEYIKDQLSLAYGDEELFWRQKSRQKWLAGGDKNTGFFHATVHS
ERLKNELSFLLDENDQEFTRNSDKGKIASSFFENLFTSTYILTHNNHLEGLQAKVTSEMN
HNLIQEVTELEVYNAVFSINKESAPGPDGFTALFFQQHWDLVKHQILTEIFGFFETGVLP
QDWNHTHICLIPKITSPQRMSDLRPISLCSVLYKIISKILTQRLKKHLPAIVSTTQSAFV
PQRLISDNILVAHEMIHSLRTNDRISKEHMAFKTDMSKAYDRVEWPFLETMMTALGFNNK
WISWIMNCVTSVSYSVLINGQPYGHIIPTRGIRQGDPLSPALFVLCTEALIHILNKAEQA
GKITGIQFQDKKVSVNHLLFADDTLLMCKATKQECEELMQCLSQYGQLSGQMINLNKSAI
TFGKNVDIQIKDWIKSRSGISLEGGTGKYLGLPECLSGSKRDLFGFIKEKLQSRLTGWYA
KTLSQGGKEVLLKSIALALPVYVMSCFKLPKNLCQKLTTVMMDFWWNSMQQKRKIHWLSW
QRLTLPKDQGGFGFKDLQCFNQALLAKQAWRVLQEKGSLFSRVFQSRYFSNSDFLSATRG
SRPSYAWRSILFGRELLMQGLRTVIGNGQKTFVWTDKWLHDGSNRRPLNRRRFINVDLKV
SQLIDPTSRNWNLNMLRDLFPWKDVEIILKQRPLFFKEDSFCWLHSHNGLYSVKTGYEFL
SKQVHHRLYQEAKVKPSVNSLFDKIWNLHTAPKIRIFLWKALHGAIPVEDRLRTRGIRSD
DGCLMCDTENETINHILFECPLARQVWAITHLSSAGSEFSNSVYTNMSRLIDLTQQNDLP
HHLRFVSPWILWFLWKNRNALLFEGKGSITTTLVDKAYEAYHEWFSAQTHMQNDEKHLKI
TKWCPPLPGELKCNIGFAWSKQHHFSGASWVVRDSQGKVLLHSRRSFNEVHSPYSAKIRS
WEWALESMTHHHFDRVIFASSTHEIIQALHKPHEWPLLLGDISELLSFTKDKPHWFLCME
PFCCNRGANAIATSVITGCRFQSYVARGYPSWMTNVFTAERGNLI*

 

BLAST output demonstrating the similarity between T13L16.7 and the reverse transcriptase of Tal-1

  gb|L47193|ATHRETR Arabidopsis thaliana (clone DW15) zinc-finger protein gene and
     reverse transcriptase gene, complete cds.
     Length = 7808

 Score = 1251 bits (3201), Expect(2) = 0.0

Query: 2    VISWNCQGL--RNPWTIRYLKEMKKDHFPDILFLMETKNSQDFVYKVFCWLGYDFIHTVE 59
            ++SWNCQGL      TI  L EM+  HFP++LFLMETKN  + V  +  WLGY+ + TV 
Sbjct: 2909 LVSWNCQGLGWSQDLTIPRLMEMRLSHFPEVLFLMETKNCSNVVVDLQEWLGYERVFTVN 3088

Query: 60   PEGRSGGLAIFWKSHLEIEFLYADKNLMDLQVSSRNKVWFISCVYGLPVTHMRPKLWEHL 119
            P G SGGLA+FWK  ++I   YADKNL+D Q+   +  +++SCVYG P    +  +WE +
Sbjct: 3089 PIGLSGGLALFWKKGVDIVIKYADKNLIDFQIQFGSHEFYVSCVYGNPAFSDKHLVWEKI 3268

Query: 120  NSIGLKRAEAWCLIGDFNDIRSNDEKLGGPRRSPSSFQCFEHMLLNCSMHELGSTGNSFT 179
              IG+ R E WC++GDFN I  N EK GGPRR  SSF  F  ML +C M EL S GN FT
Sbjct: 3269 TRIGINRKEPWCMLGDFNPILHNGEKRGGPRRGDSSFLPFTDMLDSCDMLELPSIGNPFT 3448

Query: 180  WGGNRNDQWVQCKLDRCFGNPAWFSIFPNAHQWFLEKFGSDHRPVLVKFTNDNELFRGQF 239
            WGG  N+ W+Q +LDRCFGN  WF  FP ++Q FL+K GSDHRPVLV+ T   E +RG F
Sbjct: 3449 WGGKTNEMWIQSRLDRCFGNKNWFRFFPISNQEFLDKRGSDHRPVLVRLTKTKEEYRGNF 3628

Query: 240  RYDKRLDDDPYCIEVIHRSWNSAMSQGTHSSFFSLIECRRAISVWKHSSDTNAQSRIKRL 299
            R+DKRL + P   E I ++WN +           L  CR A+S WK  ++ N+ +RI + 
Sbjct: 3629 RFDKRLFNQPNVKETIVQAWNGSQRNENLLVLDKLKHCRSALSRWKKENNINSSTRITQA 3808

Query: 300  RKDLDAEKSIQIPCWPRIEYIKDQLSLAYGDEELFWRQKSRQKWLAGGDKNTGFFHATVH 359
            R  L+ E+S   P    +  +K+ L  A  DEE+FW QKSR KW+  GDKNT FFHA+V 
Sbjct: 3809 RAALELEQSSGFPRADLVFSLKNDLCKANHDEEVFWSQKSRAKWMHSGDKNTSFFHASVK 3988

Query: 360  SERLKNELSFLLDENDQEFTRNSDKGKIASSFFENLFTSTYILTHNNHLEGLQAKVTSEM 419
              R K  +  L D N        +KG IA ++F +LF ST   +  +  E  Q +VT  M
Sbjct: 3989 DNRGKQHIDQLCDVNGLFHKDEMNKGAIAEAYFSDLFKSTDPSSFVDLFEDYQPRVTESM 4168

Query: 420  NHNLIQEVTELEVYNAVFSINKESAPGPDGFTALFFQQHWDLVKHQILTEIFGFFETGVL 479
            N+ LI  V++ E+  AVF+I   SAPG DGFT  FFQ++W ++  Q+  EI  FF  G  
Sbjct: 4169 NNTLIAAVSKNEIREAVFAIRSSSAPGVDGFTGFFFQKYWSIICLQVTKEIQNFFLLGYF 4348

Query: 480  PQDWNHTHICLIPKITSPQRMSDLRPISLCSVLYKIISKILTQRLKKHLPAIVSTTQSAF 539
            P+ WN TH+CL+PK   P +M+DLRPISLCSVLYKIISKI+ +RL+  LP +VS  QSAF
Sbjct: 4349 PKSWNFTHLCLLPKKKKPDKMTDLRPISLCSVLYKIISKIMVRRLQPFLPDLVSPNQSAF 4528

Query: 540  VPQRLISDNILVAHEMIHSLRTNDRISKEHMAFKTDMSKAYDRVEWPFLETMMTALGFNN 599
            V +RLI DNIL+AHE++H LRT+  +SK  +A K++MSKA+DRVEW ++  ++ ALGF+ 
Sbjct: 4529 VAERLIFDNILIAHEVVHGLRTHKSVSKGFIAIKSNMSKAFDRVEWNYVRALLDALGFHQ 4708

Query: 600  KWISWIMNCVTSVSYSVLINGQPYGHIIPTRGIRQGDPLSPALFVLCTEALIHILNKAEQ 659
            KW+ WIM  ++SVSYSVLIN + +G+I+P+RG+RQGDPLSP LFVLC+E L H++N+AE+
Sbjct: 4709 KWVGWIMFMISSVSYSVLINDKAFGNIVPSRGLRQGDPLSPFLFVLCSEGLTHLMNRAER 4888

Query: 660  AGKITGIQFQDKKVSVNHLLFADDTLLMCKATKQECEELMQCLSQYGQLSGQMINLNKSA 719
             G ++GI+F +   +++HLLFADD+L MCKA K+E   +      YG ++GQ IN +KS+
Sbjct: 4889 QGLLSGIRFSENGPAIHHLLFADDSLFMCKAVKEEVTVIKSIFKVYGDVTGQRINYDKSS 5068

Query: 720  ITFGKNVDIQIKDWIKSRSGISLEGGTGKYLGLPECLSGSKRDLFGFIKEKLQSRLTGWY 779
            IT G  VD   K WI++  GI+ EGG   YLGLPEC SGSK  L  +IK++L++RL+GW+
Sbjct: 5069 ITLGALVDEDCKVWIQAELGITNEGGASTYLGLPECFSGSKVQLLDYIKDRLKTRLSGWF 5248

Query: 780  AKTLSQGGKEVLLKSIALALPVYVMSCFKLPKNLCQKLTTVMMDFWWNSMQQKRKIHWLS 839
            A+TLS GGKE LLK+ ALAL  Y MSCFKL K  C  +T+ M DFWWN+++ KRK HW+S
Sbjct: 5249 ARTLSMGGKETLLKAFALALLFYAMSCFKLTKTTCVNMTSAMSDFWWNALEHKRKTHWVS 5428

Query: 840  WQRLTLPKDQGGFGFKDLQCFNQALLAKQAWRVLQEKGSLFSRVFQSRYFSNSDFLSATR 899
             +++ L K+ GG GF+D++ FNQALLAKQAWR+LQ   SLF+R F+SRY+   DFL A  
Sbjct: 5429 CEKMCLSKENGGLGFRDIESFNQALLAKQAWRLLQFPNSLFARFFKSRYYDEEDFLDAEL 5608

Query: 900  GSRPSYAWRSILFGRELLMQGLRTVIGNGQKTFVWTDKWLHDGSNRRPLNRRRFINVDLK 959
             + PSYAWRSIL GR+LL++G R  +GNG  T VW D W++D   R PL +   +N+DL+
Sbjct: 5609 KATPSYAWRSILHGRDLLIKGFRKKVGNGSSTSVWMDPWIYDNDPRLPLQKHFSVNLDLR 5788

Query: 960  VSQLIDPTSRNWNLNMLRDLFPWKDVEIILKQRPLFFKEDSFCWLHSHNGLYSVKTGYEF 1019
            V  LI+   R    + L +LF   D+EII+K+ P+   +D + WLHS +G YSVK+GY  
Sbjct: 5789 VHDLINVEDRCRRRDRLEELFYPADIEIIVKRNPVVSMDDFWVWLHSKSGEYSVKSGYWL 5968

Query: 1020 LSKQVHHRLYQEAKVKPSVNSLFDKIWNLHTAPKIRIFLWKALHGAIPVEDRLRTRGIRS 1079
              +     L +EA+V+PS N L +KIW+  T+PKI++FLW+ L  A+PV  ++  RG+  
Sbjct: 5969 AFQTNKPELIREARVQPSTNGLKEKIWSTLTSPKIKLFLWRILSSALPVAYQIIRRGMPI 6148

Query: 1080 DDGCLMCDTENETINHILFECPLARQVWAITHLSSAGSEFSN-SVYTNMSRLIDLTQQND 1138
            D  C +C  E E+INH+LF C LARQVWA++ + ++   F N S++ N+  L++L  +  
Sbjct: 6149 DPRCQVCGEEGESINHVLFTCSLARQVWALSGVPTSQFGFQNSSIFANIQYLLELKGKGL 6328

Query: 1139 LPHHLRFVSPWILWFLWKNRNALLFEGKGSITTTLVDKAYEAYHEWFSAQTHMQNDEKHL 1198
            +P  ++   PW+LW LWKNR+ L FEG        ++K  +   EWF AQ  + + +   
Sbjct: 6329 IPEQIKKSWPWVLWRLWKNRDKLFFEGTIFSPLKSIEKIRDDVQEWFLAQALVASVDAGE 6508

Query: 1199 KI------TKWCPPLPGELKCNIGFAWSKQHHFSGASWVVRDSQGKVLLHSRRSFNEVHS 1252
             +      + W PP  G +KCNI   WS +    G +WV+RD  GKVLLHSRR+F+ +  
Sbjct: 6509 TVCSAPCPSSWEPPPLGWVKCNISGVWSGKKRVCGGAWVLRDDHGKVLLHSRRAFSNLSV 6688

Query: 1253 PYSAKIRSWEWAL 1265
               A     +WAL
Sbjct: 6689 KKDALFCCVKWAL 6727


 Score = 61.7 bits (147), Expect(2) = 0.0

Query: 1284 EIIQALHKPHEWPLLLGDISELLSFTKDKPHWFLCMEPFCCNRGANAIATSVITGCRFQS 1343
            +++ A  +P  WP     +SEL  F +    W +  E    NRGA+ IA S++ G RFQS
Sbjct: 6780 DLLSAFARPKAWPSFAFHVSELTHFLEKIGDWKVSEEKVDSNRGASLIAQSIVKGDRFQS 6959

Query: 1344 YVARGYPSWMTNVFTAER 1361
            YVA G+  W+  +F  ER
Sbjct: 6960 YVAVGHLRWLHQLFEGER 7013

 


written 29 Dec 97
Larry Parnell