Selenoprotein evolution: introduction: Difference between revisions

From genomewiki
Jump to navigationJump to search
No edit summary
Line 381: Line 381:
</pre>
</pre>


=== DIO1, DIO2, and DIO3: 24 vertebrate deiodinases ===
=== DIO1A23: 24 vertebrate deiodinases ===
<pre>
<pre>
>DIO1_homSap Homo sapiens (human) iodothyronine deiodinase  type I; 4 exons on chr1p32.3
>DIO1_homSap Homo sapiens (human) iodothyronine deiodinase  type I; 4 exons on chr1p32.3
Line 531: Line 531:
SLEERLSAARLMEREAPGCAVVADGMENAANSAYGAYFDRLYIVQDGRVVYQ
SLEERLSAARLMEREAPGCAVVADGMENAANSAYGAYFDRLYIVQDGRVVYQ
</pre>
</pre>


=== MSRB1: 17 vertebrate sequences ===
=== MSRB1: 17 vertebrate sequences ===

Revision as of 00:51, 21 April 2008

Introduction to selenoprotein evolution

(other selenoproteins shortly)

Selenoprotein SELU: 3 paralogs, variable timing losses

SELU: This family consists of three deeply diverged (distinct exon patterns) paralogs. The encoding gene has 5 average exons with anomalously short introns like many selenoproteins. In the SELU1 group, selenocysteine occurs in a UxxC motif already in the earliest deuterostome but drops out in mammals after monotremes, being replaced by CxxC in marsupials and placentals. Amphibia separately lost selenocysteine.

The second paralog SELU2 has selenocysteine in bilaterans only to the node of sea urchin, suggesting it was lost early in the deuterostome ancestor. It is the closer paralog of SelU1, 36% vs 27% percent identity. No vestigal SECIS element persists in living species that encode cysteine. (The decayed SECIS elements still identifiable in 3' UTR of cysteine-containing GPX6 genes in rodents and human GPX5 represent much more recent loss of selenocysteine.)

The third paralog SELU3 has cysteine in all species for which a sequence is available. It might be called virtual selenoprotein supposing orthologs in early diverging eukaryotes could be located that contained selenocysteine. This would suggest a scenario in which selenocysteine was present in an ancestral gene prior to gene duplications followed by conversion to cysteine in different phylogenetic patterns within each gene subfamily.

This family exhibits the "selenocysteine rachet": if selenocysteine happens to be replaced by ordinary cysteine (despite catalytic inferiority) in some stem lineage, the unselected 3' UTR SECIS element then deteriorates over a few million years from accrued mutations, for the same reason (lack of purifying selection) the crayfish in the cave loses its imaging opsins. Consequently the whole following clade will contain cysteine -- a reversion to TGA at the cystein codon might occur but it would simultaneously require a multi-step reversion or de novo evolution of a SECIS element, ie all SECIS elements are ancient and selenocysteines cannot wink back on paraphyletically. (However the overall selenoproteome can still increase over time because of gene duplications elsewhere.)

A phylogenetic overview of the occurence of selenocysteine in SELU1 in 38 vertebrates:

                                        .........................*.....
C  Homo sapiens                  genome EPRTFKAKELWEKNGAVIMAVRRPGcFLCRE
C  Pan troglodytes         AACZ02115591 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE
C  Pongo abelii            ABGA01228099 EPRTLKAKELWEKNGAVIMAVRRPGCFLCRE
C  Macaca mulatta          AANU01282766 EPRTLKAKELWEKNGAVIMAVRRPGCFLCRE
C  Microcebus murinus      ABDC01489848 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE
C  Otolemur garnettii      AAQR01538573 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE
C  Tupaia belangeri        AAPY01309022 EPRTFKAKELWGERGAVIMAVRRPGCFLCRE
C  Mus musculus            AAHY01113156 EPRTFKAKELWEKNGAVIMAVRRPGCFLCR.
C  Rattus norvegicus       AAHX01086750 EPRTFKAKELWEKNGAVIMAVRRPGCFLCR.
C  Spermophilus tridec     AAQQ01288000 EPRTFKAKELWEKSGAVIMAVRRPGCFLCRE
C  Cavia porcellus         AAKN02044618 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE
C  Oryctolagus cuniculus   AAGW01591660 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE
C  Canis familiaris        AAEX02011808 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE
C  Bos taurus              AAFC03065652 ...TFKAKALWEKNGAVIMAVRRPGCFLCRE
C  Equus caballus          AAWR02000382 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE
C  Myotis lucifugus        AAPE01631988 EPRTFKAKELWEEKGAVIMAVRRPGCFLCRE
C  Sorex araneus           AALT01607337 zPKTFKAKELWSKSGAVIMAVRRPGCFLCRE
C  Boreoeuthere ancestralis   ancestral EPRTFKAKELWEKNGAVIMAVRRPGcFLCRE
C  Echinops telfairi       AAIY01623759 ...TFQSKGALGKNGAVIMAVRRPGCFLCRE
C  Dasypus novemcinctus    AAGV01392885 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE
C  Monodelphis domestica   AAFR03024314 SPKTFKARELWEHRGAVIMAVRRPGCFLCRE
C  Trichosurus vulpecula     transcript SPKTFKARELWEHRGAVIMAVRRPGCFLCRE
C  Macropus eugenii              genome ..KTFKARELWEHRGAVIMAVRRPGCFLCRE
U  Ornithorhynchus anatin  AAPN01249400 EPRTFKARELWQRNGAVIMAVRRPGUFLCRE
U  Tachyglossus aculeatus        genome EPRTFKARELWQRNGAVIMAVRRPGUFLCRE
U  Anolis carolinensis     AAW.01013574 ..RTFKAEELWKKNGAVIMAVRRPGUFLCRE
U  Gallus gallus           AADN02035315 EPRTFKASELWKKNGAVIMAVRRPGUFLCRE
U  Taeniopygia guttata           genome EKRTFKAGELWKQNGAVIMAVRRPGUFLCRE
C  Xenopus tropicalis            genome EPKSFKAKDLWEKNGAVVMAVRRPGCFLCRE
C  Xenopus laevis            transcript EPRLFKAKDLWERDGAVIMAVRRPGCFLCRE
U  Danio rerio             CAAK04015812 DDRVFKARELWESSGAVIMAVRRPGUFMCRE
U  Tetraodon nigroviridis  CAAE01014976 ETKTFKAKTLWEKCGAVVMAVRRPGUFLCRE
U  Fugu rubripes           CAAB01000016 ETKTFKAKSLWENSGAVVMAVRRPGUFLCRE
U  Gasterosteus aculeatus  AANH01005113 ...VIKGRSLWDKNGAVVMAVRRPGUFLCRE
U  Oryzias latipes         BAAE01190338 DTKIIKAKSLWDKNGAVVMAVRRPGUFLCRE
U  Fundulus heteroclitus     transcript .....KAKSLWEKNGAVVMAVRRPGUFLCRE
U  Oncorhynchus mykiss         CR369769 .....KAKALWEKTGAVVMAVRRPGUFLCRE
U  Callorhinchus milii     AAVX01258517 ENRTFRASELWAGRGAVIMAVRRPGUFLCRE

C  Gasterosteus  aculeatus AANH01005113 ......AKTLWDKTGAVVMVVRRPGCLLCRE (anomalous gene duplication with cysteine)

Selenoprotein SEPW1: small protein with an odd paralog

Selenoprotein SEPW1 is one of the shortest known mammalian proteins at 87 aa. With its CxxU motif, it is likely limited to simple redox reactions. Curiously, despite the small size protein still has 5 coding exons. One of these is of relatively recent origin because chondrichtyes and telost fish have the second and third exons fused (which, given the tree and extreme rarity of intron gain/loss must be the ancestral condition. (more shortly)

Reference sets of vertebrate selenoproteins

SELU1: 13 vertebrate sequences

>SELU1_homSap Homo sapiens (human) processed pseudogenes chr8 and chr12
0 MSFLQDPSFFTMGMWSIGAGALGAAALALLLANTDVFLSKPQKAALEYLEDIDLKTLEK 1
2 EPRTFKAKELWEKNGAVIMAVRRPGcFLCREE 0
0 AADLSSLKSMLDQLGVPLYAVVKEHIRTEVKDFQPYFKGEIFLDEK 0
0 KKFYGPQRRKMMFMGFIRLGVWYNFFRAWNGGFSGNLEGEGFILGGVFVVGSGKQ 0
0 GILLEHRENEFGDKVNLLSVLEAAKMIKPQTLASEKK* 0

>SELU2_homSap Homo sapiens (human) 7 exons chr1p36.32 36% id NM_152371 
0 MSTVDLARVGACILKHAVTGE 0
0 AVELRSLWREHACVVAGLRRFGCVVCRWIAQDLSSLAGLLDQHGVRLVGVGPEALGLQEFLDGDYFAG 1
2 ELYLDESKQLYKELGFKR 2 
1 YNSLSILPAALGKPVRDVAAK 0
0 AKAVGIQGNLSGDLLQSGGLLVVSK 1
2 GGDKVLLHFVQKSPGDYVPKEHILQVLGISAEVCASDPPQ 0
0 CDREV* 0

>SELU3_homSap Homo sapiens (human) 6 exons chr9q22.32 25% id processed pseudogene chrX
0 MAAPAPVTRQVSGAAALVPAPSGPDSGQPLAAAVAELPVLDARGQRVPFGALFRERRAVVVFVR 0
0 HFLCYICKEYVEDLAKIPRSFLQ 0
0 EANVTLIVIGQSSYHHIE 0
0 PFCKLTGYSHEIYVDPEREIYKRLGMKRGEEIASS 1
2 GQSPHIKSNLLSGSLQSLWRAVTGPLFDFQGDPAQQGGTLILGP 1
2 GNNIHFIHRDRNRLDHKPINSVLQLVGVQHVNFTNRPSVIHV* 0

>SELU1_borAnc Boreoeuthere ancestralis (northern beast) 5 exons no selenocyseine
0 MSFLQDPSFFTMGMWSIGAGALGAAALALLLANTDVFLSKPQKAALEYLEDIDLKTLEK 1
2 EPRTFKAKELWEKNGAVIMAVRRPGcFLCREV 0
0 AADLSSLKPKLDELGVPLYAVVKEHIRTEVKDFQPYFKGEIFLDEK 0
0 KKFYGPQRRKMMFMGFVRLGVWYNFFRAWNGGFSGNLEGEGFILGGVFVVGPGKQ 0
0 GILLEHREKEFGDKVNPVSVLEAARKIKPQTSASEKK* 0

>SELU1_triVul Trichosurus vulpecula (brushtail opossum) EC360881
0 MSFLDLSFFSMGMWSLGAGALGAAVLSLILANTNLFLTKSVTATLEFLEEIELKTLDN 1
2 ESPKTFKARELWEHRGAVIMAVRRPGCFLCREE 0
0 AAELSALKPQLDQLGIPLYAVVKEKIGSEVENFQPYFKGKIFLDER 0
0 KKFYGPQKRKMMFMGFVRLGVWQNFFRARSKGFSGNLEGEGFILGGVYVIGPGKQ 0
0 GILLEHREKEFGDKVDPASVLEAA   * 0

>SELU1_macEug Macropus eugenii (tammar wallaby) EX196548 full
0 MSFLDLSFLSMGMWSLGAGALGAAVLSLILANTDVFLTKSVTATLEFLEDIELKTLDN 1
2 KTFKARELWEHRGAVIMAVRRPGCFLCREE 0
0 AADLSALKPQLDQLGIPLYAVVKEKIGSEVEDFQPYFKGKIFLDER 0
0 KKFYGPQKRKMMFMGFVRLGVWQNFFRARSKGFSGNLEGEGFILGGVYVIGPRKQ 0
0 GILLDHREKELGDKVNPASVLEACKKIKLHA* 0

>SELU1_monDom Monodelphis domestica (opossum) tgt-cys
0 MSFLDLNFFSMSMWSLGAGALGAAALSLILANTDLFLTKSVDATLEFLEEIQLKTLDN 1
2 ESPKTFKARELWEHRGAVIMAVRRPGCFLCREV 0
0 AADLSALKPQLDLLGVPLYAVVKEKIGSEVENFQPYFKGKIFLDER 0
0 KKFYGPQKRKMMFMGFVRLGVWQNFFRARSKGFSGNLEGEGFVLGGVYVIGPGKQ 0
0 GILLEHREKEFGDKVNPASVLEAAKKIKPHTSTSEGK* 0

>SELU1_oan data Ornithorhynchus anatinus (platypus) taa early stop full
0 MPLPPDLGLFNLGMWSVGVGALGAAAVGLLLANTDLLLTKPEKATLEYLEDTELKTLGK 1
2 EPRTFKARELWQRNGAVIMAVRRPGuFLCREE 0
0 AAELSSLKPQLDRLGVPLYAVVKEKIGTEVEDFQPYFKGEIFLDER 0
0 KKFYGPHKRKMLFLGFIRLGVWQNFLRARNRGFSGNLEGEGLILGGVYVLGAGKQ 0
0 GILLEHREREFGDKVSPASVLEAAQRIKPQPL* 0

>SELU1_tacAcu Tachyglossus aculeatus (echidna) 454:EUEMSW405C31QQ (74%) tSASEKK terminus? frag
0 1
2 EPRTFKARELWQRNGAVIMAVRRPGuFLCREE 0
0 AAELSSLKPQLDQLGVPLYAVVKENIGTEVEDFQPYFKGEIFLDER 0
0 KRFYGPHKRKMLFLGLIRLGVWQNFIRARNKGFPPVTWEGEG     0
0 GVLLEHREREFGDKVSPASVLEAAQKIKPQ* 0

>SELU1_gga Gallus gallus (chicken)
0 MSFLPDFGIFTMGMWSVGLGAVGAAITGIVLANTDLFLSKPEKATLEFLEAIELKTLGS 1
2 EPRTFKASELWKKNGAVIMAVRRPGuFLCREE 0
0 ASELSSLKPQLSKLGVPLYAVVKEKIGTEVEDFQHYFQGEIFLDEK 0
0 RSFYGPRKRKMMLSGFFRXGVWQNFFRAWKNGYSGNLEGEGFTLGGVYVIGAGRQ 0
0 GVLLEHREKEFGDKVSLPSVLEAAEKIKPQAS* 0 

>SELU1_tgu Taeniopygia guttata (finch)
0 msflpdfgiFTMGMWSVGLGAIGAAVTGIVLANTDLFLSKPEKATLEFLEEIELKTLGS 1
2 EKRTFKAGELWKQNGAVIMAVRRPGuFLCREE 0
0 ASELSSLKPQLSKLGVPLYAVVKENIGTEVEDFQHYFKGEIFLDEK 0
0 KGFYGPRRRKMMLSGFFRLGVWQNFVRAWRSGYSGNLEGEGFTLGGVYVIGAGRQ 0
0 GVLLEHREKEFGDKVSLPSVLEAAEKIKPQAS* 0 

>SELU1_anoCar Anolis carolinensis (lizard)
0 MWTIGLGAIGAAVTGIILANTDLFLSKAEQASLDFLEAIDLKTLGE 1
2 NQRTFKAEELWKKNGAVIMAVRRPGuFLCREV 0
0 AAELSSLKPQLDKLGVPLYAVVKENLGTEVMDFQPYFKGEIFLDEK 0.
0 KQFYGPQKRKMLFMGFIRCSVWRNFFRAWKSGYTGNIDGEGFVLGGVFVVGPGKQ 0
0 GVLLEHREKEFGDKVSLDAVLEAVKNIQPQPSEKDK* 0

>SelU1_fugRer Fugu rubripes (fugu)
0 MGLLAKLLAAVGGFVTAVMNSVTDAFLTPPLRATLEHLEETDLKTLSG 1
2 ALVIRLIPTRTETKTFKAKSLWENSGAVVMAVRRPGuFLCRE 0
0 EAAELSSLKPRLDQLGVPLYAVVKEDVGTEIQNFRPYFQGEIFLDEK 0
0 RRFYGPRERKMGLLGFLRVGVWMNGLRAFRSGFMGNVLGEGFVLGGVFVIGREQQ 0
0 GILLEHREREFGDKVNIEDVIQAVDRIAQELMPVTQN* 0

>SELU1_gasAcu Gasterosteus aculeatus (stickleback) chrVI.790.1 length=214
MGMWSLGLGAVGAALAGIFLANTDLCLPKAASASLENLEDADLRS
KGRSLWDKNGAVVMAVRRPGuFLCREV
ASGLSSLKPQLEELGVPLVAVVKEDVGTEIRDFRPHFAGDIFIDEK
SFYGPLQRKMGGLGFIRLGVWQNFMRAWRSGYQGNMNGEGFILGGVFVFGAGNQ
GILLEHREKEFGDKVQIADVLEAVKKIVPAK*

>SELU1_calMil Callorhinchus milii (elephantfish) frag
2 ENRTFRASELWAGRGAVIMAVRRPGuFLCRE 0
0 AAALSSLRPSLAQLGVPL
0 GHLLEHREKEFGDAVNLTAVMEAAGKISPRQSAE* 0	
	
>SELU1_squAca Squalus acanthias (spiny dogfish) also selenocysteine
0 MVVVVEDFHMGLWTLGLGALGAAITGVILANTDLLLPKAETASLAYLSGAELRTLDR 1
2 EERTLKAGDLWSRSGAVIMVVRRPGuFLCREE 0
0 AAEISSLRPQLDELGVPLYGVIKENINNELKNFQPFFKGEIFLDVE 0
0 MRFYGPKPRTMGLMGFMRLGVWKNFVRAWQKGFSGNTDGEGFILgGVFVIGAGQQ 0
0 GVLLEHREKEFGDVVNISSVLEARRKIETQRTEP* 0 


SEPW1: 26 vertebrate sequences

>SEPW1_homSap  Homo sapiens (human) Selenoprotein W  chr19 87 aa uc002phn.1 has retroprocessed pseudogene
0 MALAVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGRLDI 0
0 CGEGTPQATGFFEVMVAGKLIHSKK 0
0 KGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_panTro Pan troglodytes (chimp)
0 MALAVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGRLDI 0
0 RGEGTPQATGFFEVMVAGKLIHSKK 0
0 KGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_ponPyg Pongo pygmaeus (orang_sumatran) CR926472 
0 MALAVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGRLDI 0
0 CGEGTPQATGFFEVMVAGKLIHSKK 0
0 KGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_macMul Macaca mulatta (rhesus)
0 MALAVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGRLDI 0
0 CGEGTPQATGFFEVMVAGKLIHSKK 0
0 KGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_macFas Macaca fascicularis (cynomolgus_monkey) AB169486 
0 MALAVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGRLDI 0
0 CGEGTPQATGFFEVMVAGKLIHSKK 0
0 KGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_papAnu Papio anubis (baboon) EY285690 
0 MALAVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGRLDI 0
0 CGEGTPQATGFFEVMVAGKLIHSKK 0
0 KGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_calJac Callithrix jacchus (marmoset)
0 MALTVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGRLDI 0
0 SGEGTPQATGFFEVTVAGKLIHSKK 0
0 KGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_micMur Microcebus murinus (mouse_lemur)
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGCLDI 0
0 CGEGTPQATGFFEVMVAGKLVHSKK 0
0 GDGYVDTESKFLKLV

>SEPW1_musMus Mus musculus (mouse) 
0 MALAVRVVYC 2
1 GAuGYKPK 0
0 YLQLKEKLEHEFPGCLDI 0
0 CGEGTPQVTGFFEVTVAGKLVHSKK 0
0 RGDGYVDTESKFRKLVTAIKAALAQCQ* 0

>SEPW1_ratNor Rattus norvegicus (rat) BC087625 
0 MALAVRVVYC 2
1 GAuGYKPK 0
0 YLQLKEKLEHEFPGCLDI 0
0 CGEGTPQVTGFFEVTVAGKLVHSKK 0
0 RGDGYVDTESKFRKLVTAIKAALAQCQ* 0

>SEPW1_cavPor Cavia porcellus (guinea_pig)
0 MALAVRVVYC 2
1 GAuGYKPK 0
0 YLQLKEKLEDEFPGCLDI 0
0 CGEGTPQTTGFFEVTVAGKLVHSKK 0
0 GGDGFVDTEGKFRKLVAAIKAALAQG* 0

>SEPW1_oryCun Oryctolagus cuniculus (rabbit)
0 MALAVRVVYC 2
1 GAuGYKPK 0
0 YLQLKKKLEDEFPGCLDI 0
0 CGEGTPQVTGFFEVTVAGKLVHSKK 0
0 RGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_ochPri Ochotona princeps (pika)
0 MALSVRVVYW 2
1 GAuGYKPK 0
0 YLQLKKRLEDEFPGCLDI 0
0 GEGTPQVTGFFEVMVAGKLVHSKK 0
0 SGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_canFam Canis familiaris (dog)
0 MALAVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGCLDI 0
0 RGEGTPQATGFFEVTVAGKLVHSKK 0
0 RGDGYVDTESKFLRLVAAIKTALAQG* 0

>SEPW1_felCat Felis catus (cat)
0 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGCLDI 0
0 RGEGTPQATGFFEVMVGGKLVHSKK 0
0 RGDGYVDTESKFLKLVAAIKAALAQG* 0 

>SEPW1_ bosTau Bos taurus (cow) 
0 MAVVVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPSRLDI 0
0 RGEGTPQVTGFFEVFVAGKLVHSKK 0
0 GGDGYVDTESKFLKLVAAIKAALAQA* 0

>SEPW1_oviAri Ovis aries (sheep)
0 MAVVVRVVYC 2
1 GAuGYKPK 0
0 YLQLKKKLEDEFPSRLDI 0
0 CGEGTPQVTGFFEVFVAGKLVHSKK 0
0 GGDGYVDTESKFLKLVAAIKAALAQA* 0 

>SEPW1_susScr Sus scrofa (pig) AF380118 
0 MGVAVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGRLDI 0
0 CGEGTPQVTGFFEVLVAGKLVHSKK 0
0 GGDGYVDTESKFLKLVAAIKAALAQG* 0

>SEPW1_eriEur Erinaceus europaeus (hedgehog) 
0 MALAVRVVYC 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGCLDI 0
0 RGEGTPQGTGFFEVLVAGKLVHSKK 0
0 KGDGYVDTETKFLKLVTAIKAALAQG* 0

>SEPW1_sorAra Sorex araneus (shrew)
0 2
1 GAuGYKSK 0
0 YLQLKKKLEDEFPGCVDV 0
0 CGEGTPQVTGFFEVMVAGKLVHSKK 0
0 RGDGYVDSESKYVRLVTAIKTALAQA* 0

>SEPW1_Choloepus hoffmanni (sloth)
0 MALAVRVVYW 2
1 GAuGYKPK 0
0 YVQLKKKLEDEFPGCLDI 0
0 SGEGTPQTTGFFEVMVAGKLVHSKK 0
0 QKGDGFVDTESKFLRLVAAIKAALAQG* 0

>SEPW1_monDom Monodelphis domestica (opossum) diverged
0 MAIQVRVVYW 2
1 GAuGYKPK 0
0 YLLLKKKLEDEYPGLLRH 0
0 NGEGTPEVTGFFEVTVAGKLVHSKK 0
0 AGHGFVDTADKYLQIVAEIKAALA* 0

>SEPW1_ornAna Ornithorhynchus anatinus (platypus) 
0 MASLEAFPRGVVPVHVVYC 2
1 GAuGYKPK 0
0 FLQLKKKLENEFPGQVEI 0
0 SGEGTPQVTGWFEVTVAGKLVHSKK 0
0 EGDGFVDSESKFAKIRMAIKAALVPGY* 0

>SELW_galGal Gallus gallus (chicken) tga confirmed
0 MPLRVTVLYC 2
1 GAuGYKPK 0
0 YERLRAELEKRFPGALEM 0
0 RGQGTQEVTGWFEVTVGSRLVHSKK 0
0 NGDGFVDTNAKLQRIVAAIQAALP* 0

>SELW_anoCar Anolis carolinensis (lizard) 
1 GAuGYSPK 0
0 YQQLKRGLEKEFPGKLEI 0
0 TGEGTPQVTGWFEVTVAGKLVHSKK 0
0 NGDGFVDNDTKLHKILMAIKAALA* 0

>SELW_xenTro Xenopus tropicalis (frog) tga confirmed 
0 MPDTMVKVNVVYC 2
1 GAuGYLSK 0
0 FRRLKKELEQRFPGKLSI 0
0 DGEGTERMTGWFEVSINGKLVHSKK 0
0 NGDGYVDNDAKLQKIILAIEAALKQ* 0

>SELW_danRer Danio rerio (zebrafish) tga confirmed
0 MTVKVHVVYC 2
1 GGuGYRPK 0
0 FIKLKTLLEDEFPNELEI 0
0 TGEGTPSTTGWLEVEVNGKLVHSKK 0
0 NGDGFVDSDSKMQKIVTAIEQAMGK* 0

>SEPW1_takRub
0 MGVTIRVEYC 2
1 GGuGYGPR 0
0 YEELARVVRAEFPDADVSGFVGRM 1
2 GSFEIQINEQLIFSKLETGGFPYEDD 0

>SEPW2_calMil Callorhinchus milii (elephantfish)
1 GAuGYEPRYQKLAIVIKDEFPDADVSGKVGRT 1
2 GSFEIEINGQLIFSKLETGGFPYEND 0
0 ISEAVQKANNGEELQKIENSRPPCVIL* 0
0 VMHAIQCVSDGKPVEKITKSRPPCVIM* 0

DIO1A23: 24 vertebrate deiodinases

>DIO1_homSap Homo sapiens (human) iodothyronine deiodinase  type I; 4 exons on chr1p32.3
0 MGLPQPGLWLKRLWVLLEVAVHVVVGKVLLILFPDRVKRNILAMGEKTGMTRNPHFSHDNWIPTFFSTQYFWFVLKVRWQRLEDTTELGGLAPNCPVVRLSGQRCNIWEFMQ 1
2 GNRPLVLNFGSCTuPSFMFKFDQFKRLIEDFSSIADFLVIYIEEAHAS 1
2 DGWAFKNNMDIRNHQNLQDRLQAAHLLLARSPQCPVVVDTMQNQSSQLYAALPERLYIIQEGRILYK 0
0 GKSGPWNYNPEEVRAVLEKLHS 0*

>DIOI_ratNor Rattus norvegicus (rat)  7-258 thioredoxin-like
MGLSQLWLWLKRLVIFLQVALEVATGKVLMTLFPERVKQNILAMGQKTGMTRNPRFAPDNWVPTFFSIQY
FWFVLKVRWQRLEDRAEYGGLAPNCTVVRLSGQKCNVWDFIQGSRPLVLNFGSCTuPSFLLKFDQFKRLV
DDFASTADFLIIYIEEAHATDGWAFKNNVDIRQHRSLQDRLRAAHLLLARSPQCPVVVDTMQNQSSQLYA
ALPERLYVIQEGRICYKGKPGPWNYNPEEVRAVLEKLCIPPGHMPQF

>DIO1_susScr Sus scrofa (pig)
MELPLPGLWLKRLWVLFQVALHVAMGKVLMTLFPGRVKQDILAMSQKTGMAKNPHFSHENWIPTFFSAQY
FWFVLKVRWQRLEDKTEEGGLAPNCPVVSLSGQRCHIWDFMQGNRPLVLNFGSCTUPSFIFKFDQFKRLI
EDFSSIADFLIIYIEEAHASDGWAFKNNVDIKNHQNLQDRLRAAHLLLDRSPQCPVVVDTMKNQSSRLYA
ALPERLYVLQAGRILYKGKPGPWNYHPEEVRAVLEKLHS

>DIO1_sunMur Suncus murinus (shrew)
MGLPGLGLLLKRFGVLVRVALKVAVGKVLLTLWPSAIRPHLLAMSEKTGMAKNPRFTYEDWAPTFFSTQY
FWFVLKVNWQQLEDRTKQGDIAPDSPVVHLSGQRARLWDFMQGNRPLVLNFGSCSuPSFLFKFDQFKRLV
EDFSSVADFLTVYIEEAHASDGWAFKNNVDIRRHRDLQERLQAARLLLDRNPGCPVVVDTMENRSSQLYA
ALPERLYVLQEGRILYKGGPGPWNYHPEEVHAVLEQLCRSSAQSPRL

>DIO1_galGal Gallus gallus (chicken) Type I iodothyronine deiodinase 4 exons chr8
MLSIRVLLHKLLILLQVTLSVVVGKTMMILFPDTTKRYILKLGEKSRMNQNPKFSYENWGPTFFSFQYLLF
VLKVKWRRLEDEAHEGRPAPNTPVVALNGEMQHLFSFMRDNRPLILNFGSCTuPSFMLKFDEFNKLVKDF
SSIADFLIIYIEEAHAVDGWAFRNNVVIKNHRSLEDRKTAAQFLQQKNPLCPVVLDTMENLSSSKYAALP
ERLYILQAGNVIYKGGVGPWNYHPQEIRAVLEKLK

>DIO1_xenTro Xenopus laevis (frog) tga confirmed
MLRYIQKALILFFLFLYVVVGKVLMFLFPQTMASVLKSRFEISGVH-DPKFQYEDWGPTFFTYKFLRSVLEIMWMRLE
DEAFVGHSAPNTPVVDLSGELHHIWDYLQGTRPLVLSFGSCT*PPFLFRLG
EFNKLVNEFNSIADFLIIYIDEAHAADEWALKNNLHIKKHRSLQDRLAAAKRLME ESPSCPVVLDTMSNLCSAKYAA
LPERLYILQEGKIIYKGKMGPWGYKPEEVCSVLEKKK*

>DIO1_danRer Danio rerio (zebrafish)
MGSAVGFALRKLFVYISAVLMVCAAILQMSMLKLLSFISPGRMRKIHMKMGERSTMTQNPKFRYEDWGPA
FFSLAFIKTLFFVNWCSLGLEAFEGHAAPDSALITLDRQKTSVHRFLKGNRPLVLSFGSCTuPPFLYKLD
EFKQLVKDFSNVADFLIVYLAEAHATDAWAFKNNVDISVHKNLEERLAAARTLLKEDPPCPVVVDEMNNI
TASKYGALPERLYVIQSGKVIYQASDLGGQA

>DIO1_tru fugu  4 exons genome glitches
0 MLLQKLAMYLSTAGLFCFMITLNVVLWILNIVAPALAKKIALKMGEKATMTQDPLFKYEDWGLTFASTALVKTASRHMWLSLGQEAFAGLEAPDSPVVTMERKRSSIGEFMK 1
2 TNRPLVLNFGSCTuPPFMFKLEEFKQLVRDFSDVADFLVVYIAEAHST 1
2 DGWAFKNNFDINQHRNLEDRLSAAQILVQKDPLCPVVVDDMNNSCAIKYGALPERLYVLQAGKVLYK 0
0 GAVGPWGYDPREVRSYLEKMK *0

>DIO2_homSap Homo sapiens (human) iodothyronine deiodinase type II; 2 exons size 8,781 bp on chr14q31.1
0 MGILSVDLLITLQILPVFFSNCLFLALYDSVILLKHVVLLLSRSKSTRGEWRRMLTSEGLRCVWKSFLLDAYKQ 0
0 VKLGEDAPNSSVVHVSSAEGGDNSGNGTQEKIAEGATCHLLDFASPERPLVVNFGSATuPPFTSQLPAFRKLVEEFSSVADFLLVYIDEAHPSDGWAI
PGDSSLSFEVKKHQNQEDRCAAAQQLLERFSLPPQCRVVADRMDNNANIAYGVAFERVCIVQRQKIAYLGGKGPFSYNLQEVRHWLEKNFSKRuKKTRLAG 0*

>DIO2_ratNor Rattus norvegicus (rat) 85-253  thioredoxin-like
MGLLSVDLLITLQILPVFFSNCLFLALYDSVILLKHVALLLSRSKSTRGEWRRMLTSEGLRCVWNSFLLD
AYKQVKLGEDAPNSSVVHVSNPEAGNNCASEKTADGAECHLLDFACAERPLVVNFGSATuPPFTRQLPAF
RQLVEEFSSVADFLLVYIDEAHPSDGWAVPGDSSMSFEVKKHRNQEDRCAAAHQLLERFSLPPQCQVVAD
RMDNNANVAYGVAFERVCIVQRRKIAYLGGKGPFSYNLQEVRSWLEKNFSKRXILD

>DIO2_susScr Sus scrofa (pig)
MGILSVDLLITLQILPVFFSNCLFLALYDSVILLKHVVLLLSRSKSTRGEWRRMLTSEGMRCIWKSFLLD
AYKQVKLGEDAPNSSVVHVSNPEGSNNHGHGTQEKTVDGAECHLLDFANPERPLVVNFGSATUPPFTSQL
PAFSKLVEEFSSVADFLLVYIDEAHPSDGWAVPGDSSLSFEVKKHQNQEDRCAAAHQLLERFSLPPQCRV
VADRMDNNANVAYGVAFERVCIVQRQKIAYLGGKGPFYYNLQEVRRWLEKNFSKR

>DIO2_galGal Gallus gallus (chicken) deiodinase, iodothyronine, type II 2 exons chr5
MGLLSADLLITLQILPVFFSNCLFLALYDSVILLKHMVLFLSRSKSARGEWRRMLTSEGLRCVWNSFLLD
AYKQVKLGGEAPNSSVIHIAKGNDGSNSSWKSVGGKCGTKCHLLDFANSERPLVVNFGSATuPPFTSQLS
AFSKLVEEFSGVADFLLVYIDEAHPSDGWAAPGISPSSFEVKKHRNQEDRCAAAHQLLERFSLPPQCQVV
ADCMDNNANVAYGVSFERVCIVQRQKIAYLGGKGPFFYNLQEVRLWLEQNFSKRUNPLSTEDLSTDVSL

>DIO2_xenTro Xenopus laevis (frog)
GTRRERERLSVDLLITLQILPGFFSNCLFLALYDSVVLVKHVLLQLNRSKSSQSQWRRMLTPEGLRCVWN
SFLLDAYKQVKLGQDAPNSNVIQVSNNRTSKSVQRKFAGKCHLLDFASSERPLVVNFGSATuPPFISQLP
AFSKLVEEFSSVADFVLVYIDEAHPSDGWAAPGTASYEVKKHRSQEERCAAASKLLQHFSIPPQCQVVAD
CMDNNANVAYGVSFERVCIVQRQKIVYLGGKGPFFYNIQEIRRWLELSFGKR

>DIO2_ranCat Rana catesbiana (bullfrog)
MGLLSVDLLITLQILPGFFSNCLFLALYDSVVLVKHVLLQLNRSKSSHGQWRRMLTPEGLRCVWNSFLLD
AYKQVKLGGDAPNSNVIHVTDKNSSSGKPGTPCHLLDFASSERPLVVNFGSATuPPFISQLPAFSKMVEE
FSAVADFLLVYIDEAHPSDGWAAPGISSYEVKKHRNQEDRCAAANKLLEQYSLPPQCQVVADCMDNNTNA
AYGVSFERVCIVQRQKIVYLGGKGPFFYNLQEVRQWLELTFGKKAESGQTGTEK

>DIO2_nfo lungfish Neoceratodus forsteri exons unknown
MGLLSVDLLITLQILPWFFSNCLFLALYDSVVLLKHVILLLSCSKSSRGEWRRMLTSEGLRTVWNSFLLD
AYKQVKLGGDAPNSKVVRVTSGCCRRRSFSGKGESECHLLDFASSNRPLVVNFGSATUPPFISQLPTFRK
LVEEFSDVADFLLVYIDEAHPADGWAAPGVATKSFEVKKHRSQEERCVAAHKLLEHFSLPPQCQVVADCM
DNNTNVAYGVSFERVCIVQRQKIAYLGGKGPFFYNLKEVRHWLEQTYRKRUVPTCELIM

>DIO2_danRer Danio rerio (zebrafish)
MGLLSVDLLVTLQILPGFFSNCLFFVLYDSIVLVKRVVSLLSCSGSTGEWQRMLTTAGVRSIWNSFLLDA
YKQVKLGEAAPNSKVVKVTGINRCWSISGKTHNQCHLLDFESPDRPLVVNFGSATuPPFISQLPVFRRMV
EEFSDVADFLLVYIDEAHPSNGWVGPP MENFSFEVRKHRNLEERM
FAARTLLEHFSLPPQCQLVADCM  DNNANIAYGVSYERVCIVQKNKIAYLGGKGPFFYNLKDVRRWLEKC

>DIO3_homSap Homo sapiens (human) size iodothyronine deiodinase type III; 1 exon 2,502 bp chr14q32.31
0 MLRSLLLHSLRLCAQTASCLVLFPRFLGTAFMLWLLDFLCIRKHFLGRRRRGQPEPEVEL
NSEGEEVPPDDPPICVSDDNRLCTLASLKAVWHGQKLDFFKQAHEGGPAPNSEVVLPDGF
QSQHILDYAQGNRPLVLNFGSCTuPPFMARMSAFQRLVTKYQRDVDFLIIYIEEAHPSDG
WVTTDSPYIIPQHRSLEDRVSAARVLQQGAPGCALVLDTMANSSSSAYGAYFERLYVIQS
GTIMYQGGRGPDGYQVSELRTWLERYDEQLHGARPRRV 0*

>DIO3_ratNor Rattus norvegicus (rat) 103-266  thioredoxin-like
MLRSLLLHSLRLCAQTASCLVLFPRFLGTAFMLWLLDFLCIRKHFLRRRHPDHPEPEVELNSEGEEMPPD
DPPICVSDDNRLCTLASLKAVWHGQKLDFFKQAHEGGPAPNSEVVRPDGFQSQRILDYAQGTRPLVLNFG
SCTuPPFMARMSAFQRLVTKYQRDVDFLIIYIEEAHPSDGWVTTDSPYVIPQHRSLEDRVSAARVLQQGA
PGCALVLDTMANSSSSAYGAYFERLYVIQSGTIMYQGGRGPDGYQVSELRTWLERYDEQLHGTRPRRL

>DIO3_susScr Sus scrofa (pig) 
MLHSLLLHSLRLCAQTASCLVLFPRFLGTACMLWLLDFLCIRKHLLGRRRRGEPETEVELNSDGDEVPPD
DPPICVSDDNRLCTLASLRAVWHGQKLDFFKQAHEGGPAPNSEVVLPDGFQNQHILDYARGNRPLVLNFG
SCTUPPFMARMSAFQRLVTKYQRDVDFLIIYIEEAHPSDGWVTTDSPYSIPQHRSLEDRVSAARVLQQGA
PECSLVLDTMANSSSSAYGAYFERLYVIQSGTIMYQGGRGPDGYQVSELRTWLERYDQQLHGPQPRRV

>DIO3_galGal Gallus gallus (chicken)type III iodothyronine deiodinase 1 exon chr5 11mbp separation
AACILLFPRFLLTAVMLWLLDFLCIRKKMLTMPTAEEAAGAGEGPPPDDPPVCVSDSNRMFTLESLKAVW
HGQKLDFFKSAHVGSPAPNPEVIQLDGQKRLRILDFARGKRPLILNFGSCToPPFMARLRSFRRLAADFV
DIADFLLVYIEEAHPSDGWVSSDAAYSIPKHQCLQDRLRAAQLMREGAPDCPLAVDTMDNASSAAYGAYF
ERLYVIQEEKVMYQGGRGPEGYKISELRTWLDQYKTRLQSPGAVVIQV

>DIO3_xenTro Xenopus laevis (frog)
MLHCAGPHTGKLVKQVAACCLLLPRFLLTGLMLWLLDFQCIRRRVLLTAREESTAEHEDPPLCVSDSNRM
CTVESLRAVWHGQKLDYFKSAHLGCSAPNTEVVMLEGRRLCKILDFSQGKRPLVVNFGSCTuPPFMARLQ
AYRRLAAQHVGIADFLLVYIEEAHPSDGWLSTDASYQIPQHQCLQDRLAAAQLMLQGAPGCRVVVDTMDN
SSNAAYGAYFERLYIVLEGKVVYQGGRGPEGYKISELRMWLEQYQQGLMGTKGSGQVVIQV

>DIO3_ranCat Rana catesbiana (bullfrog)
MLPAPHTCCRLLQQLLACCLLLPRFLLTVLLLWLLDFPCVRRRVIRGAKEEDPGAPEREDPPLCVSDTNR
MCTLESLKAVWYGQKLDFFKSAHLGGGAPNTEVVTLEGQRLCRILDFSKGHRPLVLNFGSCTuPPFMARL
QAYQRLAAQRLDFADFLLVYIEEAHPCDGWLSTDAAYQIPTHQCLQDRLRAAQLMLQGAPGCRVVADTMT
NASNAAYGAYFERLYVILDGKVVYQGGRGPEGYKIGELRNWLDQYQTRATGNGALVIQV

>DIO3_nfo lungfish Neoceratodus forsteri 
0 MYQSSGVHTMNEVLKQAFACFILLPRFLVTALMLWLLDFLCVRRRVLLHMSRRQEASDLPDEPELCVSDS
NRMFTLKSLRAVWHDQKLDFFKAAHIGLVAPNTEVIKLEGQRKAKILEFGGGKRPLILNFGSCTuPPFMARLKAFRGVATQYKDVADFLLIYIEEAHPSDGWVSTDAPYQIPKHQCLEDRLKAAQLMNLEIPGCLVVVDTMDNASNAAYGAFFERLYIVQQERVVYQGGRGPEGYKISELKNWLDQYKSQLQNSSAVVIQV 0*

>DIO3a_danRer Danio rerio (zebrafish)
SALKNAAVCVLLLPRFLLAALMLCLLDFLCIRRKLLLKMQEGAFSSPDDPPLRVSDSN
KMFTLESLRAVWYGQKLDfFkSARLGGAAPNTEVFPLDGDARAAERILDYARGRRPLILN
FGSCSuPPFMTRLSAFQRVARQYADIADSLLVYIEEAHPSDGWVSSDAPVQIPRHRCLED
RLRAAQMLHRDAAGNAGVVDSMQNS

>DIO3b_dre Danio rerio
GALKNALVCLLILTRFLVAAFMLWCLDFLCVRKRVLVHLQERAYAEQEEEPL
CISDSSRMFSWESLKAVFHGHKLDYMKSARLGHAAPDSEVFPLAEPRRGRVLEFARGHRP
LVLSFGSCSuPPFMRRLKAFRRLVLRYADVADALLIYIEEAHPSDGWRSSDAPHQIRRHR
SLEERLSAARLMEREAPGCAVVADGMENAANSAYGAYFDRLYIVQDGRVVYQ

MSRB1: 17 vertebrate sequences


>MSRB1_homSap Homo sapiens (human) SEPX1 (uc002cng.1) SELX SELR human chr16
0 MSFCSFFGGEVFQNHFEP 1
2 GVYVCAKCGYELFSSRSKYAHSSPWPAFTETIHADSVAKRPEHNRSEALK 0
0 VSCGKCGNGLGHEFLNDGPKPGQSRFuIFSSSLKFVPK 1
2 GKETSASQGH* 0

>MSRB1_macMul Macaca mulatta (rhesus)
0 MSFCSFFGGEVFQNHFEP 1
2 GVYACAKCGYELFSSRSKYAHSSPWPAFTETIHADSVAKRPEHNRPGALK 0
0 VSCGKCGNGLGHEFLNDGPKPGQSRFuIFSSSLKFVPK 1
2 GKETSTSQGH* 0

>MSRB1_musMus Mus musculus (mouse) exon break just at stop codon NP_038787
0 MSFCSFFGGEVFQNHFEP 1
2 GVYVCAKCSYELFSSHSKYAHSSPWPAFTETIHPDSVTKCPEKNRPEALK 0
0 VSCGKCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFVPK 1
2 GKEAAASQGH* 0

>MSRB1_ratNor Rattus norvegicus (rat)
0 MSFCSFFGGEVFQNHFEP 1
2 GVYVCAKCGYELFSSRSKYAHSSPWPAFTETIHEDSVAKCPEKNRPEALK 0
0 VSCGKCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFIPK 1
2 GKEAPASQGD* 0

>MSRB1_cavPor Cavia porcellus (guinea_pig)
0 MSFCSFFGGEVFQNHFES 1
2 GIYVCAKCGYELFSSRSKYAHSSPWPAFTDTIHADSVAKCPEHNRPGALK 0
0 VSCGKCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFIPK 1
2 DKETSASQGH* 0

>MSRB1_canFam Canis familiaris (dog) frag
0 MSFCSFFGGEVFQNHFEP 1
2 GVYVCAKCGYELFSSRSKYAHSSPWPAFTETIHADSVAKRPERNSPEALK 0
0 VSCGKCGNGLGHEFLNDGPKPGKSRFuIFSSSLKFVPK 1
2 GKGTSGSQEA* 0

>MSRB1_bosTau Bos taurus (cow)
0 MSFCSFFGGEIFQNHFEP 1
2 GIYVCAKCGYELFSSRSKYAHSSPWPAFTETIHADSVAKRPEHNRPGAIK 0
0 VSCGRCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFIPK 1
2 AEETSASQGQ* 0

>MSRB1_equCab Equus caballus (horse)
0 MSFCSFFGGEIFQNHFEP 1
2 GIYVCAKCGYELFSSRSKYAHSSPWPAFTETIHADSVAKRPEHNRPEALK 0
0 VSCGKCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFVPK 1
2 GKESSASQGQ* 0

>MSRB1_ loxAfr Loxodonta africana (elephant) 
0 MSFCSFFRSEVFQNHFEP 1
2 GVYVCAKCGYELFSSRSKYAHSSPWPAFTETIHADSVGKHPEHNRPEALK 0
0 VSCGKCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFIPK 1
2 GKETSASQGK* 0

>MSRB1_monDom Monodelphis domestica (opossum) frag
2 GVYVCAKCGYELFSSRSKYHHSSPWPAFTETIHADSVSKRPESGRSEALK 0
0 VSCGKCGNGLGHEFINDGPKKGQSRFuIFSSSLKFVPKG 1

>MSRB1_ornAna Ornithorhynchus anatinus (platypus)
0 1
2 GTYVCARCGYELFSSRSKYEHSSPWPAFTETIHPDSVAKREEPGRPNAFK 0
0 VSCGKCGNGLGHEFLNDGPRRGQSRFuIFSSLKFIPK 1
2 GKDSQAAQDK* 0

>MSRB1_galGal Gallus gallus (chicken)
0 MSFCSFFGGEVFKDHFEP 1
2 GVYVCARCGYELFSSRAKYEHSSPWPAFTETIHEDSVAKRKERPGALK 0
0 VSCGKCGNGLGHEFLNDGPKRGQSRFuIFSSSLKFIPK 1
0 GKSPQEN* 0

>MSRB1_anoCar Anolis carolinensis (lizard)
0 MSFCAFSGGEIYQGHFEA 1
2 GMYVCSKCGFELFSSKSKYAHSSPWPAFTETIHDDSITKYLERPNAFK 0
0 VLCGKCGNGLGHEFINDGPKKGQSRFZIFSSSLKFVPK 1

>MSRB1_xenTro Xenopus tropicalis (frog)
0 MSFCSFFGGEVYKDHFKS 1
2 GIYVCSECNYELFSSRSKYQHSSPWPAFTETVHKDSISKYLERPNAYK 0
0 VSCGKCGNGLGHEFINDGPKKGQSRFuIFSSSLKFIPK 0
0 DKVDGEVQRE* 0

>MSRB1_danRer Danio rerio (zebrafish) tga confirmed
0 MSFCSFSGGEIYKDHFES 1
2 GMYVCAQCGYELFSSRSKYEHSSPWPAFTETIHEDSVSKQEERWGAYK 0
0 VRCGKCGNGLGHEFVNDGPKHGLSRFuIFSSSLKFI
0 PKVKNEQQ* 0

>MSRB1_ictFur Ictalurus furcatus (fish)
0 MAFCSFKGGEIFKDHYEP 1
2 GIYVCVKCGYELFSSTSKYKLSSPWPAFTTTIHEDSVSKQEERPGALK 0
0 IRCGKCNNGLGHEFLNDGPKHGLSRFuIFSSSLKFV 0
0 PKDKGGQ* 0

>MSRB1_oncMyk Oncorhynchus mykiss (trout)
0 MSFCSFFGGEVFKDHFKT 1
2 GLYMCAQCGHQLFSSRSKYEHSSPWPAFTETILQDSVSKHEERPGAFK 0
0 VRCGKCGNGLGHEFVGDGPKKGLSRFuIFSSSLKFV 0
0 PKDKVDGQ* 0

>MSRB1_salSal Salmo salar (salmon)
0 MSFCSFFGGEVFKDHFKT 1
2 GLYVCAQCGHQLFSSRSKYEHSSPWPAFTETVLQDSVSKHEERPGAFK 0
0 VRCGKCGNGLGHEFVGDGPKKGLSRFuIFSSSLKFV 0
0 PKDKVDGQ* 0

Reference sets of vertebrate SECIS elements

SEPW1: 13 SECIS sequences

>selW_SECIS_homSap Homo sapiens (human)
agggaccttgacccagcccctctcagcagacgcttcatgataggaaggactgaaaagtcttgtggacacctggtctttccctgatgttctcgtggctgctgttgggggcagagattgacgcccccggtctttgcct

>selW_SECIS_panTro Pan troglodytes (chimp) 
AGGGACCTTGACCCAGCCCCTCTCAGCAGACGCTTCATGATAGGAAGGACTGAAAAGTCTTGTGGACACCTGGTCTTTCCCTGATGTTCTtGTGGCTGCTGTTGGGGGCAGAGATTGACGCCCCCGGTCTTTGCCT

>selW_SECIS_ponPyg Pongo pygmaeus (orang_sumatran)
AGTCCAGGGACCTTGACCCAGCCCCTCTCAGCAGACGCTTCATGATAGGAAGGACTGAAAAATCTTGTGGACACCTGGTCTTTCCCTGATGTTCTCGTGGCTGCTGTTGGGGGCAGAGATTGACGCCCCTGGTCTTTGCCT 

>selW_SECIS_macMul Macaca mulatta (rhesus)
AGcGACCTTGACCCAGCCCCTCTCAGCAGACGCTTCATGATAGGAAGGACTGAAAAGTCTTGTGGACgCCTGGTCTTTCCCTGATGTTCTCGTGGCTGCTGTTGGGGGCAGAGATTGACGCCgCtGGTCTTTGCCT

>selW_SECIS_musMus Mus musculus (mouse)
ACTGAAATGTCTTAGACTTGGCCCAGCCCCTCGTGGCAGACGCTTCATGATGGGAAGAACTGAAATGTCTCGTGGACGCCTGGTCTTTCCCTGATGTCCCTGCGACTGCCACGTAGGGGCAGAGACTGATGCCCCTGTGGGTGCCT

>selW_SECIS_ ratNor Rattus norvegicus (rat)
CCTGGCCGGCCTTTCTTGGCAGCCGCTTCATGACAGGAAGGACTGAAATGTCTCAAAGACCTGTGGTCTTTCTTCGATGTTCCTGCGGCCACCAAGTCAGGCCAGAGATGGATTCTGTGTGTGGGTGCCT

>selW_SECIS_oryCun Oryctolagus cuniculus (rabbit)
AGTAACCTTGACCCAGCCCCTTTCATGCCTCAGCCTCGTCTCCATAGGCTAAGACTGGAGAAATGAGTCCCCTGAAGAACTGAAACTGGGGGTAGAGGGTTGGTGTTTTAAGATGTGGATGAGCTGGTCTTTAC

>selW_SECIS_canFam Canis familiaris (dog)
CCAGtGACCTTGgCCCAGCCCCTCgtgGCAGACGCTTCATGATgGGAAGaACTGAAAtGTCTcGTGGACgCCTGGTCTTTCCCTGATGTccctgcgactgccacgtaGGGGCAGAGAcTGAtGCCCCtGGTCTTTGCCT

>selW_SECIS_susScr Sus scrofa (pig)
AGTAACCTTGACCCAGCCCCTTTCATGCCTCAGCCTCGTCTCCATAGGCTAAGACTGGAGAAGTCTTGTGGACGCCTGGTCTTTCCCTGATGTTCTCGTGGCTGCTGTTGGGGGCAGAGATGGATGAGCTGGTCTTTAC

>selW_SECIS_borAnc Boreoeuthere ancestralis (ancestral)
AGTCCAGCAACCTTGGCCCAGCCCCTCTCAGCAGATGCTTCATGACAGGAAGGACTGAAATGTCTTGTGGACGCCTGGTCTTTCCCTGATGTTCTTGTGGCTGCTGGTTGGGGCAGAGATTGACACCCCTGGTCTTTGCCT

>selW_SECIS_dasNov Dasypus novemcinctus (armadillo)
CCAGCAACCTCAGCCCAGCTGCCCTTGGCAGACGCTTCATGAGGGGAAGGACCTAAATGCGTCGTGGATGCCTGGTCTTTCCCTGATGCTCCTTCACCTGCCAGATGGGGCAGAGGTCATTGCCCCTGGTCTTGGCCT

>selW_SECIS_loxAfr Loxodonta africana (elephant)
GGGACCTTGGCCCAGCCCCTTTCAGCAGACACTTCATGACAGGAGGACTGAAATGTCTCCCAGACGCCTGGCTCTTTCCCTGAATCTGTCGGCTGCAGGACAGGGCAGCGGTTGACTCTCTCGTTTTTTGCAT

>selW_SECIS_echTel Echinops telfairi (tenrec)
AGGCCAGAGACCTTGGCCCAGTCCCTCCATGACAGGCAGAACTGAAATGTCCTCTGGACAAGTGGTCTTTTCCAGAAACCCCAGGGCTGCTGGGCCGGAGCCGAGGCTGACAACCCTGGTCTTTGCCT

DIO1: 29 SECIS sequences

>DIO1_SECIS_homSap Homo sapiens (human) COVE score: 29
ttttaactctgtgtctttacatatttgtttatgatggccacagcctaaagtacacacggctgtgacttgattcaaaagaaaatgttataag

>DIO1_SECIS_ponPyg Pongo pygmaeus (orang_sumatran) COVE score: 29
ttttaactctgtgtctttacatatttgtttatgatggccacagcctaaagtacacacggctgtgacttgattcaaaagaaaatgttataag

>DIO1_SECIS_macMul Macaca mulatta (rhesus) COVE score: 29
tttcaactctgtgtctttacatatttgtttatgatggccacagcctaaagtacacacggctgtgacttgattcaaaagaaaatgttataag

>DIO1_SECIS_cavPor Cavia porcellus (guinea_pig) COVE score: 24
tgttaactctgcttcttttcatatttgttcatgacggtcacagtctaaagtacacacagctgtgacctgatttgaaagaaaatgttttaag

>DIO1_SECIS_canFam Canis familiaris (dog) COVE score: -
ttttaactctgcttcttttcatgtttgtctatgacggccacagcctaaagcacacacagctgtgacttgatttgaaagaaaatgttttaag

>DIO1_SECIS_felCat Felis catus (cat) COVE score:  -
ttttaactctgcttcttttcacgtttgtctatgacggccacagtctaaagtgcacacagctgtgacttgacttgaacgaaaatgttttaag

>DIO1_SECIS_bosTau Bos taurus (cow) COVE score: 28
ttttaactctgcctcttttcatatttgttcatgacggccacagcctaaagtacacacggctgtgacttgatttgaaagaaaatgttttaag

>DIO1_SECIS_sorAra Sorex araneus (shrew) COVE score: 24
cggaaactcagcttctcttcatatttgtttatgacagccccagctgaaagtacacacagctgtggcttgattggaaagaaaatgttttaag

>DIO1_SECIS_eriEur Erinaceus europaeus (hedgehog) COVE score: 26
tttaactctgctttcttctcatatttgcttatgatggtcacagcttaaagtatacacagctgtgacttgattggaaagaaaatattttaag

>DIO1_SECIS_borAnc Boreoeuthere ancestralis (ancestral) COVE score: 29
ttttaactctgcttcttttcatatttgttcatgatggccacagcctaaagtacacacggctgtgacttgatttgaaagaaaatgttttaag

>DIO1_SECIS_dasNov Dasypus novemcinctus (armadillo) COVE score: -
ttttaactctgcttcttttcatatttgtttatgatggccacagtttaaagtacatacagctgtgacttgatatgaaaaagaaatattttaag

>DIO1_SECIS_loxAfr Loxodonta africana (elephant) COVE score: - 
ttttaactctgcttcttttttcatgtatttatgatgggccacagcctaaagtgcacaacagctgtgacttgatttgaaaaacatctttaag

>DIO1_SECIS_monDom Monodelphis domestica (opossum) COVE score: -
tttccatcctgcttctacaaatatttatttatgacaatcacagcctaaagctcagggcagctgggattcgacgggagaaaaagtttgtaag

>DIO1_SECIS_ornAna Ornithorhynchus anatinus (platypus) COVE score: 24
ccccggatccggttccgtgaatattggtttatgagggtcacagtgtaaagcgcatgcagctgtgacttgatctgagaaaatatttctgcggc
 
 
>DIO2_SECIS_homSap Homo sapiens (human) 5,630 bp utr COVE score: 30
cagagatgtgcagagttgaccagtgtgcggatgataactactgacgaaagagtcatcgactcagttagtggttggatgtagtcacattagtttgcctctc

>DIO2_SECIS_macMul Macaca mulatta (rhesus) COVE score: 30
cagagatgtgcagagttgaccagtgtgcggatgataactactgacgaaagagtcatcgactcagttagtggttggatgtagtcacattagtttgcctctc

>DIO2_SECIS_musMus Mus musculus (mouse) COVE score: 29
cggagatgttcagagctcactggtgtgcgaatgataactactgacgaaagagctgtctgctcagtctgtggttggatgtagtcacacgagtctgcctttctgca

>DIO2_SECIS_ratNor Rattus norvegicus (rat) COVE score: 28
ccgagatgttcggagctcactggtgtgcgaatgataactactgacgaaagagtcatctgctcagtctgtggttggatgtagtcacacgagtctgcctctccatc

>DIO2_SECIS_canFam Canis familiaris (dog) COVE score: 27
ctgggatgtgcagaggtgaccagtgtgcgaatgataactactgatgaaagagtcactgactcagttagtggttggatacagtcacattagttttcctct 

>DIO2_SECIS_borAnc Boreoeuthere ancestralis (ancestral) COVE score: 30
ctgggatgtgcagaggtgaccagtgtgcaaatgataactactgatgaaagagtcattgactcagttagtggttggatgtagtcacattagtttgcctctc

>DIO2_SECIS_dasNov Dasypus novemcinctus (armadillo) iodothyronine deiodinase type II
ctgggaagttcagaggctaccagtgtgccaatgataactactgacgaaagaggcatcgactcagttagtggttggatgtagccacattagtttgcctctc
 
 
>DIO3_SECIS_homSap Homo sapiens (human) COVE score: 31
ttgggtgcacaggagccccactgctgatgacgaactatctctaactggtcttgaccacgagctagttctgaattgcaggggcctcaaagcagca

>DIO3_SECIS_macMul Macaca mulatta (rhesus) COVE score: 30 
ttgggtgcacaggagccccactgctgatgacgaactgtctctaactggtcttgaccacgagctagttctgaattgcaggggcctcaaaacagca

>DIO3_SECIS_musMus Mus musculus (mouse) COVE score: 26  
ttgggtgcgctggagccctggctgctgatgacgaaccgcctctaactgggcttgaccacgggtcggctctgaattgcagagaggctcgaaacagc

>DIO3_SECIS_ratNor Rattus norvegicus (rat) COVE score: 26 
ttgggtgcgctggagccctggctgctgatgacgaaccgcctctaactgggcttgaccacgggtcggctctgaattgcagagaggctcgaaacagc

>DIO3_SECIS_canFam Canis familiaris (dog) COVE score: 26
ttgggtgctggcgagccccactgctgatgacgagccgcctctaactggtcttgaccacgagctggttctgagttgcaggggggcttgcagcggc

>DIO3_SECIS_bosTau Bos taurus (cow) COVE score: 30
ttgggtgctcacgagccccactgctgatgaagagctgtctctaactggcctcgaccacgagctggttctgatttgcaggaggctcgcagcagc

>DIO3_SECIS_borAnc Boreoeuthere ancestralis (ancestral) COVE score: 27
ttgggtgctcaggagccccactgctgatgacgaactgtctctaactggtcttgaccacgagctggttctgaattgcagggggctcgcagcagca

>DIO3_SECIS_loxAfr Loxodonta africana (elephant) COVE score: 22
ttcggtgcgctagagccccactgctgatgacgaactgtctctaactggtcttgaccacgagctgattccgaattgcagggaactcgcagcagc

>DIO3_SECIS_echTel Echinops telfairi (tenrec) COVE score: -
ttcggtgctctgcagccccactgctgatgacgaactgcctctcactggtcttgaccacgagctgcttctgaaatgcaggggactcgcagccgca