| Gene | T2H3.3 |
| Putative Identification | ribosomal protein L19 |
| Position | 35988 - 37237, from the initial methionine to the termination codon |
| Strand | + |
| EST hits | T14056, T44067, T04719, H36046, F13950, Z17981 |
| Database match | L19 proteins from several species |
CDS: The table below lists the coordinates of the T2H3.3 exons and which exon prediction algorithms selected the 5' and 3' termini (GS = GenScan, Gr = GRAIL, M = MZEF, NPG = NetPlantGene - selects splice sites only, not exons).
| Exon | Range | 5' | 3' |
|---|---|---|---|
| 1 | 35988 - 36099 | GS, Gr | EST, GS, NPG |
| 2 | 36559 - 36681 | EST, GS, Gr, M, NPG | EST, GS, Gr, M, NPG |
| 3 | 36762 - 37125 | EST, GS, NPG | EST, GS, NPG |
| 4 | 37210 - 37237 | EST, GS, Gr, M, NPG | GS |
Alternate exons not used in building the gene model: GRAIL does not predict any model exons in this region. Gene modeling was performed with shadow exons. GRAIL predicts exon 1 from 35988 to 36125. GRAIL splits intron 1 to give an exon from 36258 to 36282. NetPlantGene predicts several splice sites in this region but none have significant confidence scores.
Complete CDS of T2H3.3
ATGGTTTCGTTGAAGCTCCAGAAGCGTCTCGCATCCTCAGTACTCAAGTGCGGGAAGAGA AAAGTATGGCTCGACCCGAACGAAGGAAGTGATATCTCCATGGCCAATTCGCGTCAAAAC ATCAGGAAGCTTGTGAAGGATGGATTCATCATCAGGAAGCCAACGAAGATCCACTCACGA TCCAGGGCACGTCAATTGAATATAGCCAAGAGAAAGGGACGTCACTCTGGTTATGGTAAG AGGAAGGGTACCCGAGAGGCAAGGTTGCCAACAAAAGTTTTGTGGATGAGGAGGATGAGA GTGCTGAGGCGTCTGTTGAAGAAGTATAGGGAGACCAAGAAGATTGACAGACATATGTAC CATGACATGTACATGAAAGTCAAGGGAAACGTCTTCAAGAACAAGCGTGTGTTGATGGAA AGTATTCACAAATCCAAGGCTGAGAAGGCTAGAGAGAAGACATTGTCTGACCAGTTTGAG GCTAAGAGGGCTAAGAACAAGGCTAGCCGAGAGAGGAAACATGCTCGACGAGAGGAGCGT CTTGCCAAGGGTCCTGGAGGAGACATACCTGCCGCTGCACCACCTGCACAAACCGCTGAG GTACCTGCGAAAAAGTCTAAGAAGTGA
Protein translation of T2H3.3
MVSLKLQKRLASSVLKCGKRKVWLDPNEGSDISMANSRQNIRKLVKDGFIIRKPTKIHSR SRARQLNIAKRKGRHSGYGKRKGTREARLPTKVLWMRRMRVLRRLLKKYRETKKIDRHMY HDMYMKVKGNVFKNKRVLMESIHKSKAEKAREKTLSDQFEAKRAKNKASRERKHARREER LAKGPGGDIPAAAPPAQTAEVPAKKSKK*
Multiple sequence analysis of L19 proteins
L19 sequences from several species are aligned here. Some of these sequences are partial. From top to bottom, the source organisms for the aligned sequences are rat, human, mouse, D. melanogaster, A. thaliana T2H3.3, previously described partial sequence of A. thaliana, Z. mays, D. discoideum, C. elegans, S. pombe, S. cerevisiae, Methanococcus vannielii, Mathanococcus jannaschii, Methanobacterium thermoautotrophicum, Archaeoglobus fulgidus, Pyrococcus horikoshii, Haloarcula marismortui, Sulfolobus acidocaldarius and A. thaliana L19-like.
1 60
RatL19 ~~MSMLRLQKRLASSVLRCGKKKVWLDPNETNEIANANSRQQIRKLIKDGLIIRKPVTVH
HumanL19 ~~MSMLRLQKRLASSVLRCGKKKVWLDPNETNEIANANSRQQIRKLIKDGLIIRKPVTVH
MouseL19 ~~MSMLRLQKRLASSVLRCGKKKVWLDPNETNEIANANSRQQIRKLIKDGLIIRKPVTVH
DmelanoL19 ~~MSSLKLQKRLAASVLRCGKKKVWLDPNEINEIANTNSRQNIRKLIKDGLIIKKPVVVH
T2H3.3 ~~MVSLKLQKRLASSVLKCGKRKVWLDPNEGSDISMANSRQNIRKLVKDGFIIRKPTKIH
AthalL19 ~~~~~LKLQKRLRSSVLKCGKRKVWLDPNEGSDISMANSRQNIRKLVKDGFIIRKPTKIH
ZmaysL19 ~~~~~LKLQKRLAASYLKCGKGKVWLDPNEVSEISMANSRQNIRKLVKDGFIIKKPHKIH
DdiscoidL19 ~~MVSLKLQKRLAASILKCGKGRVWIDPNEIADVAMANSRDNVRRLIATGFIMRKPVVVH
CelegansL19 ~~MSNLRLQKRLASAVLKCGKHRVWLDPNEVSEISGANSRQSIRRLVNDGLIIRKPVTVH
SpombeL19 ~~MANLRTQKRLAASVLKCGKRKVWMDPNEISEISNANSRQNVRKLIKDGLVIRKPNLMH
ScerevisL19 ~~MANLRTQKRLAASVVGVGKRKVWLDPNETSEIAQANSRNAIRKLVKNGTIVKKAVTVH
MvanniL19 ~~~MDVSTQRRIAAAVLDCGIDRVWVDPENLEKVKMAITKDDIRLLINDGIIVKKQEKGI
MjannaL19 MIIMDVSVQRRMAAEILKCGIERVWIDPTQLDRVKMAMSKDDIRALIKEGVIKKKQKKGI
MthermoL19 ~~~MNLTTQKRLAADILKVGVNRIWIDPERIDEVSRAITRDGVKQLIKDGAIKAKPKKGI
AfulgiL19 MMVMDLSLQRRLAASVLKCGENRVWFDPAALEDIATAATKQDIRELIEQGVIKRKPVNGV
PhorikoL19 ~~MNTLKMQRRIAAELLKCGENRVWIDPEKVDEVASAITREDIRRLIKEGVIRKKPIEGQ
HmarisL19 ~~MTDLSAQKRLAADVLDVGKNRVWFNPERQGDIADAITREDVRELVDEGAIQAKDKKGN
SacidoL19 ~~MPEFQLQRRLAADIAGVGLNNIKFNPERLEEVEEALTREDIKKLIKERAVIVNPKRGI
AthalL19a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
61 120
RatL19 SRARCRKNTLARR.KGRHMGIGKRKGTANARMPEKVTWMRRMRILRRLLRRYRESKKIDR
HumanL19 SRARCRKNTLARR.KGRHMGIGKRKGTANARMPEKVTWMRRMRILRRLLRRYRESKKIDR
MouseL19 SRARCRKNTLARR.KGRHMGIGKRKGTANARMPEKVTWMRRMRILRRLLRRYRESKKIDR
DmelanoL19 SRYRVRKNTEARR.KDRHCGFGKRKGTANARMPTKLLWMQRQPFCRRLLKKYRDSKKIDR
T2H3.3 SRSRARQLNIAKR.KGRHSGYGKRKGTREARLPTKVLWMRRMRVLRRLLKKYRETKKIDR
AthalL19 SRSRARQLNIAKR.KGRHSGYGKRNGTREARLPTKVLWMRRMRVLRRLLKKYRETKKIDK
ZmaysL19 SRSRCKK~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
DdiscoidL19 SRSRAREHNAAKR.LGRHRGAGNRLGTREARLPSKILWIRRIRVLRRLLKKYREAKKIDK
CelegansL19 SRFRAREYEEARR.KGRHTGYGKRRGTANARMPEKTLWIRRMRVLRNLLRRYRDAKKLDK
SpombeL19 SRFRIRKTHAAKR.LGRHTGYGKRKGTAEARMPSAVVWMRRQRVLRRLLRKYRESGKIDK
ScerevisL19 SKSRTRAHAQSKR.EGRHSGYGKRKGTREARLPSQVVWIRRLRVLRRLLAKYRDAGKIDK
MvanniL19 SSARKKEVQEQKR.KGKRKGPGSRRGAKGARTPKKEKWMNTIRPLRTLLKELRENEKIER
MjannaL19 SSARVKKLKEQRK.KGRRRGPGSRRGAAGARTPPKERWMATIRALRKTLKQLRDSGKIDR
MthermoL19 SSYRSKKIAQQKK.KGRRRGPGSIKGAKGARRPKKDEWMTTIRALRKDLKEMRDNREINK
AfulgiL19 SRARINKRKLQKR.KGRRRGHGSRKGAKGARMPRKRMWILRIRALRKALRQMKAEGVVDR
PhorikoL19 SRYRARIRHEQKK.KGRHRGPGSRKGKKTARMGKKELWIKTIRALRRELRKLKEQKKIDR
HmarisL19 SRGRARERQKKRA.YGHQKGAGSRKGKAGARQNSKEDWESRIRAQRTKLRELRDEGTLSS
SacidoL19 SSGRLKERKHKRRSKGEGRKHGSRKGKSGARTGDKEIWINKIRKIRRYIRWLRDNNVIDK
AthalL19a ~~~~~~~~~~~~~~~~~~~~~~~~~~~MEALRRIMKRLTRRLKMLKRLLKKFCWNKKIDK
121 180
RatL19 HM.YHSLYLKVKGNVFKNKRILMEHIHKLKADKARKKLLADQAEARRSKTKEARKRREER
HumanL19 HM.YHSLYLKVKGNVFKNKRILMEHIHKLKADKARKKLLADQAEARRSKTKEARKRREER
MouseL19 HM.YHSLYLKVKGNVFKNKRILMEHIHKLKADKARKKLLADQAEARRSKTKEARKRREER
DmelanoL19 HL.YHDLYMKCKGNVFKNKRVLMEYIHKKKAEKQRSKMLADQAEARRQKVREARKRREER
T2H3.3 HM.YHDMYMKVKGNVFKNKRVLMESIHKSKAEKAREKTLSDQFEAKRAKNKASRERKHAR
AthalL19 HM.SMTMYMS~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ZmaysL19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
DdiscoidL19 HS.YRELYLKAKGNVFKNKRTLIEYIVKSKTEKLREKLIQDQAQARRNKNKSIKDRRAAK
CelegansL19 HL.YHELYLRAKGNNFKNKKNLIEYIFKKKTENKRAKQLADQAQARRDKNKESRKRREER
SpombeL19 HL.YHTLYLEAKGNTFKHKRALIEHIQRAKAEANRTKLIQEQQDARRARAKAARQRRAKA
ScerevisL19 HL.YHVLYKESKGNAFKHKRALVEHIIQAKADAQREKALNEEAEARRLKNRAARDRRAQR
MvanniL19 SS.YRKLYRMAKGGAFRSRNHMKLYMK.EHGILAE~~~~~~~~~~~~~~~~~~~~~~~~~
MjannaL19 KV.YRKLYRMAKGGAFRSRSHLFLYMR.EHELLK~~~~~~~~~~~~~~~~~~~~~~~~~~
MthermoL19 ST.YRKLYKMAKGGAFKSKSYMKTYAR.DHDMLR~~~~~~~~~~~~~~~~~~~~~~~~~~
AfulgiL19 RT.YRILYRKAKGGEFRSVAHLKIYVE.QMKR*~~~~~~~~~~~~~~~~~~~~~~~~~~~
PhorikoL19 KT.YRMLYIRAKGGQFKNKHQLYLFLE.EHGLLKK*~~~~~~~~~~~~~~~~~~~~~~~~
HmarisL19 SQ.YRDLYDKAGGGEFDSVADLERYIDANHGDA~~~~~~~~~~~~~~~~~~~~~~~~~~~
SacidoL19 HT.YRLLYKRAKGNYFKNLSDVKSYLRQMGHKV~~~~~~~~~~~~~~~~~~~~~~~~~~~
AthalL19a LVYYHDMFMKVKGKVYKNKCVLMESMHKSSRERKFSGSEMRLALVTIKSCFIKICSEPEG
181 213
RatL19 LQAKKEEIIKTLSKEEETKK*~~~~~~~~~~~~
HumanL19 LQAKKEEIIKTLSKEEETKK*~~~~~~~~~~~~
MouseL19 LQSKKEEIIKTLSKEEETKK*~~~~~~~~~~~~
DmelanoL19 IATKKQELIALHAKEDEIAAKAATAGH*~~~~~
T2H3.3 REERLAKGPGGDIPAAAPPAQTAEVPAKKSKK*
AthalL19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ZmaysL19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
DdiscoidL19 SAAKEIVSSN*~~~~~~~~~~~~~~~~~~~~~~
CelegansL19 QVVKRAELLRKISQSEKVIAGK*~~~~~~~~~~
SpombeL19 VEEKREQLYTAAEKIEE*~~~~~~~~~~~~~~~
ScerevisL19 VAEKRDALLKEDA~~~~~~~~~~~~~~~~~~~~
MvanniL19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
MjannaL19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
MthermoL19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
AfulgiL19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
PhorikoL19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
HmarisL19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
SacidoL19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
AthalL19a EKTASAPQ~~~~~~~~~~~~~~~~~~~~~~~~~
written 11 Sep 98
Larry
Parnell