Selenoprotein evolution: introduction: Difference between revisions
Tomemerald (talk | contribs) |
Tomemerald (talk | contribs) No edit summary |
||
Line 3: | Line 3: | ||
=== (other selenoproteins shortly) === | === (other selenoproteins shortly) === | ||
== Selenoprotein SELU: 3 paralogs, variable timing losses == | === Selenoprotein SELU: 3 paralogs, variable timing losses === | ||
SELU: This family consists of three deeply diverged (distinct exon patterns) paralogs. The encoding gene has 5 average exons with anomalously short introns like many selenoproteins. In the SELU1 group, selenocysteine occurs in a UxxC motif already in the earliest deuterostome but drops out in mammals after monotremes, being replaced by CxxC in marsupials and placentals. Amphibia separately lost selenocysteine. | SELU: This family consists of three deeply diverged (distinct exon patterns) paralogs. The encoding gene has 5 average exons with anomalously short introns like many selenoproteins. In the SELU1 group, selenocysteine occurs in a UxxC motif already in the earliest deuterostome but drops out in mammals after monotremes, being replaced by CxxC in marsupials and placentals. Amphibia separately lost selenocysteine. | ||
Line 56: | Line 56: | ||
<font color="magenta" face="Courier" size="3">C Gasterosteus aculeatus AANH01005113 ......AKTLWDKTGAVVMVVRRPGCLLCRE</font> (anomalous gene duplication with cysteine) | <font color="magenta" face="Courier" size="3">C Gasterosteus aculeatus AANH01005113 ......AKTLWDKTGAVVMVVRRPGCLLCRE</font> (anomalous gene duplication with cysteine) | ||
== Selenoprotein SEPW1: small protein with an odd paralog == | === Selenoprotein SEPW1: small protein with an odd paralog === | ||
Selenoprotein SEPW1 is one of the shortest known mammalian proteins at 87 aa. With its CxxU motif, it is likely limited to simple redox reactions. Curiously, this small protein still has 5 coding exons. | Selenoprotein SEPW1 is one of the shortest known mammalian proteins at 87 aa. With its CxxU motif, it is likely limited to simple redox reactions. Curiously, this small protein still has 5 coding exons. (more shortly) | ||
== Reference sets of vertebrate selenoproteins == | == Reference sets of vertebrate selenoproteins == | ||
Line 379: | Line 379: | ||
0 ISEAVQKANNGEELQKIENSRPPCVIL* 0 | 0 ISEAVQKANNGEELQKIENSRPPCVIL* 0 | ||
0 VMHAIQCVSDGKPVEKITKSRPPCVIM* 0 | 0 VMHAIQCVSDGKPVEKITKSRPPCVIM* 0 | ||
</pre> | |||
== Reference sets of vertebrate SECIS elements == | |||
=== Mammalian SECIS sequences for SEPW1: === | |||
<pre> | |||
>selW_hsa | |||
agggaccttgacccagcccctctcagcagacgcttcatgataggaaggactgaaaagtcttgtggacacctggtctttccctgatgttctcgtggctgctgttgggggcagagattgacgcccccggtctttgcct | |||
>selW_chimp | |||
AGGGACCTTGACCCAGCCCCTCTCAGCAGACGCTTCATGATAGGAAGGACTGAAAAGTCTTGTGGACACCTGGTCTTTCCCTGATGTTCTtGTGGCTGCTGTTGGGGGCAGAGATTGACGCCCCCGGTCTTTGCCT | |||
>selW_pongo | |||
AGTCCAGGGACCTTGACCCAGCCCCTCTCAGCAGACGCTTCATGATAGGAAGGACTGAAAAATCTTGTGGACACCTGGTCTTTCCCTGATGTTCTCGTGGCTGCTGTTGGGGGCAGAGATTGACGCCCCTGGTCTTTGCCT | |||
>selW_rhesus | |||
AGcGACCTTGACCCAGCCCCTCTCAGCAGACGCTTCATGATAGGAAGGACTGAAAAGTCTTGTGGACgCCTGGTCTTTCCCTGATGTTCTCGTGGCTGCTGTTGGGGGCAGAGATTGACGCCgCtGGTCTTTGCCT | |||
>selW_mouse | |||
ACTGAAATGTCTTAGACTTGGCCCAGCCCCTCGTGGCAGACGCTTCATGATGGGAAGAACTGAAATGTCTCGTGGACGCCTGGTCTTTCCCTGATGTCCCTGCGACTGCCACGTAGGGGCAGAGACTGATGCCCCTGTGGGTGCCT | |||
>selW_rat | |||
CCTGGCCGGCCTTTCTTGGCAGCCGCTTCATGACAGGAAGGACTGAAATGTCTCAAAGACCTGTGGTCTTTCTTCGATGTTCCTGCGGCCACCAAGTCAGGCCAGAGATGGATTCTGTGTGTGGGTGCCT | |||
>selW_rabbit | |||
AGTAACCTTGACCCAGCCCCTTTCATGCCTCAGCCTCGTCTCCATAGGCTAAGACTGGAGAAATGAGTCCCCTGAAGAACTGAAACTGGGGGTAGAGGGTTGGTGTTTTAAGATGTGGATGAGCTGGTCTTTAC | |||
>selW_dog | |||
CCAGtGACCTTGgCCCAGCCCCTCgtgGCAGACGCTTCATGATgGGAAGaACTGAAAtGTCTcGTGGACgCCTGGTCTTTCCCTGATGTccctgcgactgccacgtaGGGGCAGAGAcTGAtGCCCCtGGTCTTTGCCT | |||
>selW_pig | |||
AGTAACCTTGACCCAGCCCCTTTCATGCCTCAGCCTCGTCTCCATAGGCTAAGACTGGAGAAGTCTTGTGGACGCCTGGTCTTTCCCTGATGTTCTCGTGGCTGCTGTTGGGGGCAGAGATGGATGAGCTGGTCTTTAC | |||
>selW_ban_Boreo | |||
AGTCCAGCAACCTTGGCCCAGCCCCTCTCAGCAGATGCTTCATGACAGGAAGGACTGAAATGTCTTGTGGACGCCTGGTCTTTCCCTGATGTTCTTGTGGCTGCTGGTTGGGGCAGAGATTGACACCCCTGGTCTTTGCCT | |||
>selW_armadillo | |||
CCAGCAACCTCAGCCCAGCTGCCCTTGGCAGACGCTTCATGAGGGGAAGGACCTAAATGCGTCGTGGATGCCTGGTCTTTCCCTGATGCTCCTTCACCTGCCAGATGGGGCAGAGGTCATTGCCCCTGGTCTTGGCCT | |||
>selW_elephant | |||
GGGACCTTGGCCCAGCCCCTTTCAGCAGACACTTCATGACAGGAGGACTGAAATGTCTCCCAGACGCCTGGCTCTTTCCCTGAATCTGTCGGCTGCAGGACAGGGCAGCGGTTGACTCTCTCGTTTTTTGCAT | |||
>selW_tenrec | |||
AGGCCAGAGACCTTGGCCCAGTCCCTCCATGACAGGCAGAACTGAAATGTCCTCTGGACAAGTGGTCTTTTCCAGAAACCCCAGGGCTGCTGGGCCGGAGCCGAGGCTGACAACCCTGGTCTTTGCCT | |||
>selW_consensus | |||
agtacCtTGaCCcaGCCccTttcagCAGcCCtTCatgAtaGGaAGaactGAaAagTctaGaCccTGGtCTttccctGatgttcgGgctgctgttagggcAGagatgGAtgcgctggTcttTgCt | |||
</pre> | </pre> | ||
[[Category:Comparative Genomics]] | [[Category:Comparative Genomics]] |
Revision as of 18:14, 20 April 2008
Introduction to selenoprotein evolution
(other selenoproteins shortly)
Selenoprotein SELU: 3 paralogs, variable timing losses
SELU: This family consists of three deeply diverged (distinct exon patterns) paralogs. The encoding gene has 5 average exons with anomalously short introns like many selenoproteins. In the SELU1 group, selenocysteine occurs in a UxxC motif already in the earliest deuterostome but drops out in mammals after monotremes, being replaced by CxxC in marsupials and placentals. Amphibia separately lost selenocysteine.
The second paralog SELU2 has selenocysteine in bilaterans only to the node of sea urchin, suggesting it was lost early in the deuterostome ancestor. It is the closer paralog of SelU1, 36% vs 27% percent identity. No vestigal SECIS element persists in living species that encode cysteine. (The decayed SECIS elements still identifiable in 3' UTR of cysteine-containing GPX6 genes in rodents and human GPX5 represent much more recent loss of selenocysteine.)
The third paralog SELU3 has cysteine in all species for which a sequence is available. It might be called virtual selenoprotein supposing orthologs in early diverging eukaryotes could be located that contained selenocysteine. This would suggest a scenario in which selenocysteine was present in an ancestral gene prior to gene duplications followed by conversion to cysteine in different phylogenetic patterns within each gene subfamily.
This family exhibits the "selenocysteine rachet": if selenocysteine happens to be replaced by ordinary cysteine (despite catalytic inferiority) in some stem lineage, the unselected 3' UTR SECIS element then deteriorates over a few million years from accrued mutations, for the same reason (lack of purifying selection) the crayfish in the cave loses its imaging opsins. Consequently the whole following clade will contain cysteine -- a reversion to TGA at the cystein codon might occur but it would simultaneously require a multi-step reversion or de novo evolution of a SECIS element, ie all SECIS elements are ancient and selenocysteines cannot wink back on paraphyletically. (However the overall selenoproteome can still increase over time because of gene duplications elsewhere.)
A phylogenetic overview of the occurence of selenocysteine in SELU1 in 38 vertebrates:
.........................*..... C Homo sapiens genome EPRTFKAKELWEKNGAVIMAVRRPGcFLCRE C Pan troglodytes AACZ02115591 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE C Pongo abelii ABGA01228099 EPRTLKAKELWEKNGAVIMAVRRPGCFLCRE C Macaca mulatta AANU01282766 EPRTLKAKELWEKNGAVIMAVRRPGCFLCRE C Microcebus murinus ABDC01489848 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE C Otolemur garnettii AAQR01538573 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE C Tupaia belangeri AAPY01309022 EPRTFKAKELWGERGAVIMAVRRPGCFLCRE C Mus musculus AAHY01113156 EPRTFKAKELWEKNGAVIMAVRRPGCFLCR. C Rattus norvegicus AAHX01086750 EPRTFKAKELWEKNGAVIMAVRRPGCFLCR. C Spermophilus tridec AAQQ01288000 EPRTFKAKELWEKSGAVIMAVRRPGCFLCRE C Cavia porcellus AAKN02044618 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE C Oryctolagus cuniculus AAGW01591660 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE C Canis familiaris AAEX02011808 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE C Bos taurus AAFC03065652 ...TFKAKALWEKNGAVIMAVRRPGCFLCRE C Equus caballus AAWR02000382 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE C Myotis lucifugus AAPE01631988 EPRTFKAKELWEEKGAVIMAVRRPGCFLCRE C Sorex araneus AALT01607337 zPKTFKAKELWSKSGAVIMAVRRPGCFLCRE C Boreoeuthere ancestralis ancestral EPRTFKAKELWEKNGAVIMAVRRPGcFLCRE C Echinops telfairi AAIY01623759 ...TFQSKGALGKNGAVIMAVRRPGCFLCRE C Dasypus novemcinctus AAGV01392885 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE C Monodelphis domestica AAFR03024314 SPKTFKARELWEHRGAVIMAVRRPGCFLCRE C Trichosurus vulpecula transcript SPKTFKARELWEHRGAVIMAVRRPGCFLCRE C Macropus eugenii genome ..KTFKARELWEHRGAVIMAVRRPGCFLCRE U Ornithorhynchus anatin AAPN01249400 EPRTFKARELWQRNGAVIMAVRRPGUFLCRE U Tachyglossus aculeatus genome EPRTFKARELWQRNGAVIMAVRRPGUFLCRE U Anolis carolinensis AAW.01013574 ..RTFKAEELWKKNGAVIMAVRRPGUFLCRE U Gallus gallus AADN02035315 EPRTFKASELWKKNGAVIMAVRRPGUFLCRE U Taeniopygia guttata genome EKRTFKAGELWKQNGAVIMAVRRPGUFLCRE C Xenopus tropicalis genome EPKSFKAKDLWEKNGAVVMAVRRPGCFLCRE C Xenopus laevis transcript EPRLFKAKDLWERDGAVIMAVRRPGCFLCRE U Danio rerio CAAK04015812 DDRVFKARELWESSGAVIMAVRRPGUFMCRE U Tetraodon nigroviridis CAAE01014976 ETKTFKAKTLWEKCGAVVMAVRRPGUFLCRE U Fugu rubripes CAAB01000016 ETKTFKAKSLWENSGAVVMAVRRPGUFLCRE U Gasterosteus aculeatus AANH01005113 ...VIKGRSLWDKNGAVVMAVRRPGUFLCRE U Oryzias latipes BAAE01190338 DTKIIKAKSLWDKNGAVVMAVRRPGUFLCRE U Fundulus heteroclitus transcript .....KAKSLWEKNGAVVMAVRRPGUFLCRE U Oncorhynchus mykiss CR369769 .....KAKALWEKTGAVVMAVRRPGUFLCRE U Callorhinchus milii AAVX01258517 ENRTFRASELWAGRGAVIMAVRRPGUFLCRE C Gasterosteus aculeatus AANH01005113 ......AKTLWDKTGAVVMVVRRPGCLLCRE (anomalous gene duplication with cysteine)
Selenoprotein SEPW1: small protein with an odd paralog
Selenoprotein SEPW1 is one of the shortest known mammalian proteins at 87 aa. With its CxxU motif, it is likely limited to simple redox reactions. Curiously, this small protein still has 5 coding exons. (more shortly)
Reference sets of vertebrate selenoproteins
SELU1: 13 vertebrate proteins
>SELU1_homSap Homo sapiens (human) processed pseudogenes chr8 and chr12 0 MSFLQDPSFFTMGMWSIGAGALGAAALALLLANTDVFLSKPQKAALEYLEDIDLKTLEK 1 2 EPRTFKAKELWEKNGAVIMAVRRPGcFLCREE 0 0 AADLSSLKSMLDQLGVPLYAVVKEHIRTEVKDFQPYFKGEIFLDEK 0 0 KKFYGPQRRKMMFMGFIRLGVWYNFFRAWNGGFSGNLEGEGFILGGVFVVGSGKQ 0 0 GILLEHRENEFGDKVNLLSVLEAAKMIKPQTLASEKK* 0 >SELU2_homSap Homo sapiens (human) 7 exons chr1p36.32 36% id NM_152371 0 MSTVDLARVGACILKHAVTGE 0 0 AVELRSLWREHACVVAGLRRFGCVVCRWIAQDLSSLAGLLDQHGVRLVGVGPEALGLQEFLDGDYFAG 1 2 ELYLDESKQLYKELGFKR 2 1 YNSLSILPAALGKPVRDVAAK 0 0 AKAVGIQGNLSGDLLQSGGLLVVSK 1 2 GGDKVLLHFVQKSPGDYVPKEHILQVLGISAEVCASDPPQ 0 0 CDREV* 0 >SELU3_homSap Homo sapiens (human) 6 exons chr9q22.32 25% id processed pseudogene chrX 0 MAAPAPVTRQVSGAAALVPAPSGPDSGQPLAAAVAELPVLDARGQRVPFGALFRERRAVVVFVR 0 0 HFLCYICKEYVEDLAKIPRSFLQ 0 0 EANVTLIVIGQSSYHHIE 0 0 PFCKLTGYSHEIYVDPEREIYKRLGMKRGEEIASS 1 2 GQSPHIKSNLLSGSLQSLWRAVTGPLFDFQGDPAQQGGTLILGP 1 2 GNNIHFIHRDRNRLDHKPINSVLQLVGVQHVNFTNRPSVIHV* 0 >SELU1_borAnc Boreoeuthere ancestralis (northern beast) 5 exons no selenocyseine 0 MSFLQDPSFFTMGMWSIGAGALGAAALALLLANTDVFLSKPQKAALEYLEDIDLKTLEK 1 2 EPRTFKAKELWEKNGAVIMAVRRPGcFLCREV 0 0 AADLSSLKPKLDELGVPLYAVVKEHIRTEVKDFQPYFKGEIFLDEK 0 0 KKFYGPQRRKMMFMGFVRLGVWYNFFRAWNGGFSGNLEGEGFILGGVFVVGPGKQ 0 0 GILLEHREKEFGDKVNPVSVLEAARKIKPQTSASEKK* 0 >SELU1_triVul Trichosurus vulpecula (brushtail opossum) EC360881 0 MSFLDLSFFSMGMWSLGAGALGAAVLSLILANTNLFLTKSVTATLEFLEEIELKTLDN 1 2 ESPKTFKARELWEHRGAVIMAVRRPGCFLCREE 0 0 AAELSALKPQLDQLGIPLYAVVKEKIGSEVENFQPYFKGKIFLDER 0 0 KKFYGPQKRKMMFMGFVRLGVWQNFFRARSKGFSGNLEGEGFILGGVYVIGPGKQ 0 0 GILLEHREKEFGDKVDPASVLEAA * 0 >SELU1_macEug Macropus eugenii (tammar wallaby) EX196548 full 0 MSFLDLSFLSMGMWSLGAGALGAAVLSLILANTDVFLTKSVTATLEFLEDIELKTLDN 1 2 KTFKARELWEHRGAVIMAVRRPGCFLCREE 0 0 AADLSALKPQLDQLGIPLYAVVKEKIGSEVEDFQPYFKGKIFLDER 0 0 KKFYGPQKRKMMFMGFVRLGVWQNFFRARSKGFSGNLEGEGFILGGVYVIGPRKQ 0 0 GILLDHREKELGDKVNPASVLEACKKIKLHA* 0 >SELU1_monDom Monodelphis domestica (opossum) tgt-cys 0 MSFLDLNFFSMSMWSLGAGALGAAALSLILANTDLFLTKSVDATLEFLEEIQLKTLDN 1 2 ESPKTFKARELWEHRGAVIMAVRRPGCFLCREV 0 0 AADLSALKPQLDLLGVPLYAVVKEKIGSEVENFQPYFKGKIFLDER 0 0 KKFYGPQKRKMMFMGFVRLGVWQNFFRARSKGFSGNLEGEGFVLGGVYVIGPGKQ 0 0 GILLEHREKEFGDKVNPASVLEAAKKIKPHTSTSEGK* 0 >SELU1_oan data Ornithorhynchus anatinus (platypus) taa early stop full 0 MPLPPDLGLFNLGMWSVGVGALGAAAVGLLLANTDLLLTKPEKATLEYLEDTELKTLGK 1 2 EPRTFKARELWQRNGAVIMAVRRPGuFLCREE 0 0 AAELSSLKPQLDRLGVPLYAVVKEKIGTEVEDFQPYFKGEIFLDER 0 0 KKFYGPHKRKMLFLGFIRLGVWQNFLRARNRGFSGNLEGEGLILGGVYVLGAGKQ 0 0 GILLEHREREFGDKVSPASVLEAAQRIKPQPL* 0 >SELU1_tacAcu Tachyglossus aculeatus (echidna) 454:EUEMSW405C31QQ (74%) tSASEKK terminus? frag 0 1 2 EPRTFKARELWQRNGAVIMAVRRPGuFLCREE 0 0 AAELSSLKPQLDQLGVPLYAVVKENIGTEVEDFQPYFKGEIFLDER 0 0 KRFYGPHKRKMLFLGLIRLGVWQNFIRARNKGFPPVTWEGEG 0 0 GVLLEHREREFGDKVSPASVLEAAQKIKPQ* 0 >SELU1_gga Gallus gallus (chicken) 0 MSFLPDFGIFTMGMWSVGLGAVGAAITGIVLANTDLFLSKPEKATLEFLEAIELKTLGS 1 2 EPRTFKASELWKKNGAVIMAVRRPGuFLCREE 0 0 ASELSSLKPQLSKLGVPLYAVVKEKIGTEVEDFQHYFQGEIFLDEK 0 0 RSFYGPRKRKMMLSGFFRXGVWQNFFRAWKNGYSGNLEGEGFTLGGVYVIGAGRQ 0 0 GVLLEHREKEFGDKVSLPSVLEAAEKIKPQAS* 0 >SELU1_tgu Taeniopygia guttata (finch) 0 msflpdfgiFTMGMWSVGLGAIGAAVTGIVLANTDLFLSKPEKATLEFLEEIELKTLGS 1 2 EKRTFKAGELWKQNGAVIMAVRRPGuFLCREE 0 0 ASELSSLKPQLSKLGVPLYAVVKENIGTEVEDFQHYFKGEIFLDEK 0 0 KGFYGPRRRKMMLSGFFRLGVWQNFVRAWRSGYSGNLEGEGFTLGGVYVIGAGRQ 0 0 GVLLEHREKEFGDKVSLPSVLEAAEKIKPQAS* 0 >SELU1_anoCar Anolis carolinensis (lizard) 0 MWTIGLGAIGAAVTGIILANTDLFLSKAEQASLDFLEAIDLKTLGE 1 2 NQRTFKAEELWKKNGAVIMAVRRPGuFLCREV 0 0 AAELSSLKPQLDKLGVPLYAVVKENLGTEVMDFQPYFKGEIFLDEK 0. 0 KQFYGPQKRKMLFMGFIRCSVWRNFFRAWKSGYTGNIDGEGFVLGGVFVVGPGKQ 0 0 GVLLEHREKEFGDKVSLDAVLEAVKNIQPQPSEKDK* 0 >SelU1_fugRer Fugu rubripes (fugu) 0 MGLLAKLLAAVGGFVTAVMNSVTDAFLTPPLRATLEHLEETDLKTLSG 1 2 ALVIRLIPTRTETKTFKAKSLWENSGAVVMAVRRPGuFLCRE 0 0 EAAELSSLKPRLDQLGVPLYAVVKEDVGTEIQNFRPYFQGEIFLDEK 0 0 RRFYGPRERKMGLLGFLRVGVWMNGLRAFRSGFMGNVLGEGFVLGGVFVIGREQQ 0 0 GILLEHREREFGDKVNIEDVIQAVDRIAQELMPVTQN* 0 >SELU1_gasAcu Gasterosteus aculeatus (stickleback) chrVI.790.1 length=214 MGMWSLGLGAVGAALAGIFLANTDLCLPKAASASLENLEDADLRS KGRSLWDKNGAVVMAVRRPGuFLCREV ASGLSSLKPQLEELGVPLVAVVKEDVGTEIRDFRPHFAGDIFIDEK SFYGPLQRKMGGLGFIRLGVWQNFMRAWRSGYQGNMNGEGFILGGVFVFGAGNQ GILLEHREKEFGDKVQIADVLEAVKKIVPAK* >SELU1_calMil Callorhinchus milii (elephantfish) frag 2 ENRTFRASELWAGRGAVIMAVRRPGuFLCRE 0 0 AAALSSLRPSLAQLGVPL 0 GHLLEHREKEFGDAVNLTAVMEAAGKISPRQSAE* 0 >SELU1_squAca Squalus acanthias (spiny dogfish) also selenocysteine 0 MVVVVEDFHMGLWTLGLGALGAAITGVILANTDLLLPKAETASLAYLSGAELRTLDR 1 2 EERTLKAGDLWSRSGAVIMVVRRPGuFLCREE 0 0 AAEISSLRPQLDELGVPLYGVIKENINNELKNFQPFFKGEIFLDVE 0 0 MRFYGPKPRTMGLMGFMRLGVWKNFVRAWQKGFSGNTDGEGFILgGVFVIGAGQQ 0 0 GVLLEHREKEFGDVVNISSVLEARRKIETQRTEP* 0
SEPW1: 26 vertebrate proteins
>SEPW1_homSap Homo sapiens (human) Selenoprotein W chr19 87 aa uc002phn.1 has retroprocessed pseudogene 0 MALAVRVVYC 2 1 GAuGYKSK 0 0 YLQLKKKLEDEFPGRLDI 0 0 CGEGTPQATGFFEVMVAGKLIHSKK 0 0 KGDGYVDTESKFLKLVAAIKAALAQG* 0 >SEPW1_panTro Pan troglodytes (chimp) 0 MALAVRVVYC 2 1 GAuGYKSK 0 0 YLQLKKKLEDEFPGRLDI 0 0 RGEGTPQATGFFEVMVAGKLIHSKK 0 0 KGDGYVDTESKFLKLVAAIKAALAQG* 0 >SEPW1_ponPyg Pongo pygmaeus (orang_sumatran) CR926472 0 MALAVRVVYC 2 1 GAuGYKSK 0 0 YLQLKKKLEDEFPGRLDI 0 0 CGEGTPQATGFFEVMVAGKLIHSKK 0 0 KGDGYVDTESKFLKLVAAIKAALAQG* 0 >SEPW1_macMul Macaca mulatta (rhesus) 0 MALAVRVVYC 2 1 GAuGYKSK 0 0 YLQLKKKLEDEFPGRLDI 0 0 CGEGTPQATGFFEVMVAGKLIHSKK 0 0 KGDGYVDTESKFLKLVAAIKAALAQG* 0 >SEPW1_macFas Macaca fascicularis (cynomolgus_monkey) AB169486 0 MALAVRVVYC 2 1 GAuGYKSK 0 0 YLQLKKKLEDEFPGRLDI 0 0 CGEGTPQATGFFEVMVAGKLIHSKK 0 0 KGDGYVDTESKFLKLVAAIKAALAQG* 0 >SEPW1_papAnu Papio anubis (baboon) EY285690 0 MALAVRVVYC 2 1 GAuGYKSK 0 0 YLQLKKKLEDEFPGRLDI 0 0 CGEGTPQATGFFEVMVAGKLIHSKK 0 0 KGDGYVDTESKFLKLVAAIKAALAQG* 0 >SEPW1_calJac Callithrix jacchus (marmoset) 0 MALTVRVVYC 2 1 GAuGYKSK 0 0 YLQLKKKLEDEFPGRLDI 0 0 SGEGTPQATGFFEVTVAGKLIHSKK 0 0 KGDGYVDTESKFLKLVAAIKAALAQG* 0 >SEPW1_micMur Microcebus murinus (mouse_lemur) 1 GAuGYKSK 0 0 YLQLKKKLEDEFPGCLDI 0 0 CGEGTPQATGFFEVMVAGKLVHSKK 0 0 GDGYVDTESKFLKLV >SEPW1_musMus Mus musculus (mouse) 0 MALAVRVVYC 2 1 GAuGYKPK 0 0 YLQLKEKLEHEFPGCLDI 0 0 CGEGTPQVTGFFEVTVAGKLVHSKK 0 0 RGDGYVDTESKFRKLVTAIKAALAQCQ* 0 >SEPW1_ratNor Rattus norvegicus (rat) BC087625 0 MALAVRVVYC 2 1 GAuGYKPK 0 0 YLQLKEKLEHEFPGCLDI 0 0 CGEGTPQVTGFFEVTVAGKLVHSKK 0 0 RGDGYVDTESKFRKLVTAIKAALAQCQ* 0 >SEPW1_cavPor Cavia porcellus (guinea_pig) 0 MALAVRVVYC 2 1 GAuGYKPK 0 0 YLQLKEKLEDEFPGCLDI 0 0 CGEGTPQTTGFFEVTVAGKLVHSKK 0 0 GGDGFVDTEGKFRKLVAAIKAALAQG* 0 >SEPW1_oryCun Oryctolagus cuniculus (rabbit) 0 MALAVRVVYC 2 1 GAuGYKPK 0 0 YLQLKKKLEDEFPGCLDI 0 0 CGEGTPQVTGFFEVTVAGKLVHSKK 0 0 RGDGYVDTESKFLKLVAAIKAALAQG* 0 >SEPW1_ochPri Ochotona princeps (pika) 0 MALSVRVVYW 2 1 GAuGYKPK 0 0 YLQLKKRLEDEFPGCLDI 0 0 GEGTPQVTGFFEVMVAGKLVHSKK 0 0 SGDGYVDTESKFLKLVAAIKAALAQG* 0 >SEPW1_canFam Canis familiaris (dog) 0 MALAVRVVYC 2 1 GAuGYKSK 0 0 YLQLKKKLEDEFPGCLDI 0 0 RGEGTPQATGFFEVTVAGKLVHSKK 0 0 RGDGYVDTESKFLRLVAAIKTALAQG* 0 >SEPW1_felCat Felis catus (cat) 0 2 1 GAuGYKSK 0 0 YLQLKKKLEDEFPGCLDI 0 0 RGEGTPQATGFFEVMVGGKLVHSKK 0 0 RGDGYVDTESKFLKLVAAIKAALAQG* 0 >SEPW1_ bosTau Bos taurus (cow) 0 MAVVVRVVYC 2 1 GAuGYKSK 0 0 YLQLKKKLEDEFPSRLDI 0 0 RGEGTPQVTGFFEVFVAGKLVHSKK 0 0 GGDGYVDTESKFLKLVAAIKAALAQA* 0 >SEPW1_oviAri Ovis aries (sheep) 0 MAVVVRVVYC 2 1 GAuGYKPK 0 0 YLQLKKKLEDEFPSRLDI 0 0 CGEGTPQVTGFFEVFVAGKLVHSKK 0 0 GGDGYVDTESKFLKLVAAIKAALAQA* 0 >SEPW1_susScr Sus scrofa (pig) AF380118 0 MGVAVRVVYC 2 1 GAuGYKSK 0 0 YLQLKKKLEDEFPGRLDI 0 0 CGEGTPQVTGFFEVLVAGKLVHSKK 0 0 GGDGYVDTESKFLKLVAAIKAALAQG* 0 >SEPW1_eriEur Erinaceus europaeus (hedgehog) 0 MALAVRVVYC 2 1 GAuGYKSK 0 0 YLQLKKKLEDEFPGCLDI 0 0 RGEGTPQGTGFFEVLVAGKLVHSKK 0 0 KGDGYVDTETKFLKLVTAIKAALAQG* 0 >SEPW1_sorAra Sorex araneus (shrew) 0 2 1 GAuGYKSK 0 0 YLQLKKKLEDEFPGCVDV 0 0 CGEGTPQVTGFFEVMVAGKLVHSKK 0 0 RGDGYVDSESKYVRLVTAIKTALAQA* 0 >SEPW1_Choloepus hoffmanni (sloth) 0 MALAVRVVYW 2 1 GAuGYKPK 0 0 YVQLKKKLEDEFPGCLDI 0 0 SGEGTPQTTGFFEVMVAGKLVHSKK 0 0 QKGDGFVDTESKFLRLVAAIKAALAQG* 0 >SEPW1_monDom Monodelphis domestica (opossum) diverged 0 MAIQVRVVYW 2 1 GAuGYKPK 0 0 YLLLKKKLEDEYPGLLRH 0 0 NGEGTPEVTGFFEVTVAGKLVHSKK 0 0 AGHGFVDTADKYLQIVAEIKAALA* 0 >SEPW1_ornAna Ornithorhynchus anatinus (platypus) 0 MASLEAFPRGVVPVHVVYC 2 1 GAuGYKPK 0 0 FLQLKKKLENEFPGQVEI 0 0 SGEGTPQVTGWFEVTVAGKLVHSKK 0 0 EGDGFVDSESKFAKIRMAIKAALVPGY* 0 >SELW_galGal Gallus gallus (chicken) tga confirmed 0 MPLRVTVLYC 2 1 GAuGYKPK 0 0 YERLRAELEKRFPGALEM 0 0 RGQGTQEVTGWFEVTVGSRLVHSKK 0 0 NGDGFVDTNAKLQRIVAAIQAALP* 0 >SELW_anoCar Anolis carolinensis (lizard) 1 GAuGYSPK 0 0 YQQLKRGLEKEFPGKLEI 0 0 TGEGTPQVTGWFEVTVAGKLVHSKK 0 0 NGDGFVDNDTKLHKILMAIKAALA* 0 >SELW_xenTro Xenopus tropicalis (frog) tga confirmed 0 MPDTMVKVNVVYC 2 1 GAuGYLSK 0 0 FRRLKKELEQRFPGKLSI 0 0 DGEGTERMTGWFEVSINGKLVHSKK 0 0 NGDGYVDNDAKLQKIILAIEAALKQ* 0 >SELW_danRer Danio rerio (zebrafish) tga confirmed 0 MTVKVHVVYC 2 1 GGuGYRPK 0 0 FIKLKTLLEDEFPNELEI 0 0 TGEGTPSTTGWLEVEVNGKLVHSKK 0 0 NGDGFVDSDSKMQKIVTAIEQAMGK* 0 >SEPW1_takRub 0 MGVTIRVEYC 2 1 GGuGYGPR 0 0 YEELARVVRAEFPDADVSGFVGRM 1 2 GSFEIQINEQLIFSKLETGGFPYEDD 0 >SEPW2_calMil Callorhinchus milii (elephantfish) 1 GAuGYEPRYQKLAIVIKDEFPDADVSGKVGRT 1 2 GSFEIEINGQLIFSKLETGGFPYEND 0 0 ISEAVQKANNGEELQKIENSRPPCVIL* 0 0 VMHAIQCVSDGKPVEKITKSRPPCVIM* 0
Reference sets of vertebrate SECIS elements
Mammalian SECIS sequences for SEPW1:
>selW_hsa agggaccttgacccagcccctctcagcagacgcttcatgataggaaggactgaaaagtcttgtggacacctggtctttccctgatgttctcgtggctgctgttgggggcagagattgacgcccccggtctttgcct >selW_chimp AGGGACCTTGACCCAGCCCCTCTCAGCAGACGCTTCATGATAGGAAGGACTGAAAAGTCTTGTGGACACCTGGTCTTTCCCTGATGTTCTtGTGGCTGCTGTTGGGGGCAGAGATTGACGCCCCCGGTCTTTGCCT >selW_pongo AGTCCAGGGACCTTGACCCAGCCCCTCTCAGCAGACGCTTCATGATAGGAAGGACTGAAAAATCTTGTGGACACCTGGTCTTTCCCTGATGTTCTCGTGGCTGCTGTTGGGGGCAGAGATTGACGCCCCTGGTCTTTGCCT >selW_rhesus AGcGACCTTGACCCAGCCCCTCTCAGCAGACGCTTCATGATAGGAAGGACTGAAAAGTCTTGTGGACgCCTGGTCTTTCCCTGATGTTCTCGTGGCTGCTGTTGGGGGCAGAGATTGACGCCgCtGGTCTTTGCCT >selW_mouse ACTGAAATGTCTTAGACTTGGCCCAGCCCCTCGTGGCAGACGCTTCATGATGGGAAGAACTGAAATGTCTCGTGGACGCCTGGTCTTTCCCTGATGTCCCTGCGACTGCCACGTAGGGGCAGAGACTGATGCCCCTGTGGGTGCCT >selW_rat CCTGGCCGGCCTTTCTTGGCAGCCGCTTCATGACAGGAAGGACTGAAATGTCTCAAAGACCTGTGGTCTTTCTTCGATGTTCCTGCGGCCACCAAGTCAGGCCAGAGATGGATTCTGTGTGTGGGTGCCT >selW_rabbit AGTAACCTTGACCCAGCCCCTTTCATGCCTCAGCCTCGTCTCCATAGGCTAAGACTGGAGAAATGAGTCCCCTGAAGAACTGAAACTGGGGGTAGAGGGTTGGTGTTTTAAGATGTGGATGAGCTGGTCTTTAC >selW_dog CCAGtGACCTTGgCCCAGCCCCTCgtgGCAGACGCTTCATGATgGGAAGaACTGAAAtGTCTcGTGGACgCCTGGTCTTTCCCTGATGTccctgcgactgccacgtaGGGGCAGAGAcTGAtGCCCCtGGTCTTTGCCT >selW_pig AGTAACCTTGACCCAGCCCCTTTCATGCCTCAGCCTCGTCTCCATAGGCTAAGACTGGAGAAGTCTTGTGGACGCCTGGTCTTTCCCTGATGTTCTCGTGGCTGCTGTTGGGGGCAGAGATGGATGAGCTGGTCTTTAC >selW_ban_Boreo AGTCCAGCAACCTTGGCCCAGCCCCTCTCAGCAGATGCTTCATGACAGGAAGGACTGAAATGTCTTGTGGACGCCTGGTCTTTCCCTGATGTTCTTGTGGCTGCTGGTTGGGGCAGAGATTGACACCCCTGGTCTTTGCCT >selW_armadillo CCAGCAACCTCAGCCCAGCTGCCCTTGGCAGACGCTTCATGAGGGGAAGGACCTAAATGCGTCGTGGATGCCTGGTCTTTCCCTGATGCTCCTTCACCTGCCAGATGGGGCAGAGGTCATTGCCCCTGGTCTTGGCCT >selW_elephant GGGACCTTGGCCCAGCCCCTTTCAGCAGACACTTCATGACAGGAGGACTGAAATGTCTCCCAGACGCCTGGCTCTTTCCCTGAATCTGTCGGCTGCAGGACAGGGCAGCGGTTGACTCTCTCGTTTTTTGCAT >selW_tenrec AGGCCAGAGACCTTGGCCCAGTCCCTCCATGACAGGCAGAACTGAAATGTCCTCTGGACAAGTGGTCTTTTCCAGAAACCCCAGGGCTGCTGGGCCGGAGCCGAGGCTGACAACCCTGGTCTTTGCCT >selW_consensus agtacCtTGaCCcaGCCccTttcagCAGcCCtTCatgAtaGGaAGaactGAaAagTctaGaCccTGGtCTttccctGatgttcgGgctgctgttagggcAGagatgGAtgcgctggTcttTgCt