Opsin evolution: Peropsin phyloSNPs: Difference between revisions
Tomemerald (talk | contribs) No edit summary |
Tomemerald (talk | contribs) No edit summary |
||
Line 1: | Line 1: | ||
PhyloSNPs in vertebrate peropsins | == Introduction to Peropsin Evolution == | ||
PhyloSNPs are defined as single coding residue changes at a site (1) strictly conserved ancestrally over great branch lengths but (2) switching in an ancestral stem to a new residue that in turn is (3) strictly conserved in all descendent extant species. A certain amount of excursion from perfect conservation can be admitted (reduced alphabet, disproportionately attributable to sequencing error) and even low levels of homoplasy (where the ancestral value reoccurs in a later diverging a species or two. Peropsin (or RRH) has just a handful of them at its 337 positions, the number depending on quality cutoff. Most of these occured in the earliest mammals. | |||
PhyloSNPs are thus a special case of synapomorphy adapted to protein molecular evolution. Because they are fixed both before and after the stem event, they greatly enrich for defining characters of the stem clade (and its complement) -- they answer the question, what makes a mammal a mammal. | |||
Interpretation of phyloSNPs is not so easy from bioinformatics alone. In opsins, a great deal of structural and experimental information is availble. | |||
For example, a peropsin douple phyloSNP in adacent N.S columns at positions 211-213 might appear to be an ancestral glycosylation site lost in a coordinated way in the post- marsupial stem but before atlantogenata diverged, perhaps suggesting a shift in subcellular location in placental mammals. However, the site in fact is on the cytoplasmic side and consequently cannot have been a glycosylation site to begin with. | |||
=== PhyloSNPs in vertebrate peropsins === | |||
PhyloSNPs | |||
<pre> | |||
homSap..L.S.S.P.I.G.G.H.T.P.L.G.H | |||
panTro........................... | |||
gorGor........................... | |||
ponPyg........................... | |||
nomLeu........................... | |||
macMul........................... | |||
papHam........................... | |||
calJac........................... | |||
tarSyr......................R...Y | |||
otoGar....................A.R.... | |||
micMur......................R.... | |||
tupBel..............S.L.A.S.R.... | |||
musMus..............-...S...C.... | |||
ratNor..............-...S...C.... | |||
speTri........................... | |||
dipOrd......................H.... | |||
cavPor......................R.... | |||
oryCun......................H.... | |||
ochPri................Y.....Q...P | |||
canFam......................S.... | |||
felCat......................S...R | |||
bosTau......................H...T | |||
turTru......................C.... | |||
susScr......................R.... | |||
vicVic......................R.... | |||
equCab........S.............R.... | |||
myoLuc....................T...... | |||
pteVam........................... | |||
sorAra........R.........I...R...Q | |||
eriEur......................R.... | |||
loxAfr..............A.......H.... | |||
proCap......................H.... | |||
echTel....................R.H...Q | |||
dasNov............S.........R.... | |||
choHof......................H.... | |||
monDom........A.....P.N.S...Q.... | |||
macEug........A.....Y.N.S...Q.... | |||
ornAna....G...A.....R.N.S.T.R.... | |||
galGal....G...A.....H.N.S.T.R.A.R | |||
taeGut....G...A.....H.N.S.T.R.A.R | |||
anoCar....G...A.....H.N.S.T.K.A.R | |||
xenTro....G...A.....H.N.S.T.R...R | |||
danRer....G.A.A.M.L.K.N.S.T.R.A.R | |||
takRub..I.G.A.A.M.L.K.N.S.T.R.A.R | |||
tetNig..I.G.A.A.M.L.K.N.S.T.R.A.R | |||
gasAcu..I.G.A.A.M.L.K.N.S.T.R.A.R | |||
oryLat..I.G.A.A.M.L.K.N.S.T.R.A.R | |||
calMil................N.S.T.....R | |||
homSap..L.S.S.P.I.G.G.H.T.P.L.G.H | |||
........x.x.x.x.x.x.x.x.x.x.y.y.y | |||
......................2.2.3.1.1.3 | |||
........8.7.7.5.4.4.1.1.1.1.2.0.1 | |||
........7.6.5.9.9.8.7.1.3.8.9.3.4 | |||
........2.2.2.2.2.2.1.5.5.7.3.3.7 | |||
</pre> | |||
<pre> | <pre> | ||
..........................................................................................................1.........1.........1.........1.........1.........1.........1.........1.........1...1 | ..........................................................................................................1.........1.........1.........1.........1.........1.........1.........1.........1...1 | ||
Line 114: | Line 183: | ||
homSap SFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKHHTTSDCTESLNRDWSDQIDVTKMSVIMICMFLVAWSPYSIVCLWASFGDPKKIPPPMAIIAPLFAKSSTFYNPCIYVVANKKFRRAMLAMFKCQTHQTMPVTSILPMDVSQNPLASGRI* | homSap SFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKHHTTSDCTESLNRDWSDQIDVTKMSVIMICMFLVAWSPYSIVCLWASFGDPKKIPPPMAIIAPLFAKSSTFYNPCIYVVANKKFRRAMLAMFKCQTHQTMPVTSILPMDVSQNPLASGRI* | ||
.................................x.y....................................................................................................y...x.................... | .................................x.y....................................................................................................y...x.................... | ||
</pre> | |||
=== A Curated Set of 48 vertebrate Peropsin Opsins === | |||
The sequences below represent all that are available as of March 2008. A few sequences such as chondrichthyes are incomplete. Note the main reference sequence collection contains additional deuterostome and even some lophotrochozoan peropsins. | |||
<PRE> | <PRE> | ||
>homSap Homo sapiens (human) | >homSap Homo sapiens (human) |
Revision as of 14:14, 8 March 2008
Introduction to Peropsin Evolution
PhyloSNPs are defined as single coding residue changes at a site (1) strictly conserved ancestrally over great branch lengths but (2) switching in an ancestral stem to a new residue that in turn is (3) strictly conserved in all descendent extant species. A certain amount of excursion from perfect conservation can be admitted (reduced alphabet, disproportionately attributable to sequencing error) and even low levels of homoplasy (where the ancestral value reoccurs in a later diverging a species or two. Peropsin (or RRH) has just a handful of them at its 337 positions, the number depending on quality cutoff. Most of these occured in the earliest mammals.
PhyloSNPs are thus a special case of synapomorphy adapted to protein molecular evolution. Because they are fixed both before and after the stem event, they greatly enrich for defining characters of the stem clade (and its complement) -- they answer the question, what makes a mammal a mammal.
Interpretation of phyloSNPs is not so easy from bioinformatics alone. In opsins, a great deal of structural and experimental information is availble. For example, a peropsin douple phyloSNP in adacent N.S columns at positions 211-213 might appear to be an ancestral glycosylation site lost in a coordinated way in the post- marsupial stem but before atlantogenata diverged, perhaps suggesting a shift in subcellular location in placental mammals. However, the site in fact is on the cytoplasmic side and consequently cannot have been a glycosylation site to begin with.
PhyloSNPs in vertebrate peropsins
PhyloSNPs
homSap..L.S.S.P.I.G.G.H.T.P.L.G.H panTro........................... gorGor........................... ponPyg........................... nomLeu........................... macMul........................... papHam........................... calJac........................... tarSyr......................R...Y otoGar....................A.R.... micMur......................R.... tupBel..............S.L.A.S.R.... musMus..............-...S...C.... ratNor..............-...S...C.... speTri........................... dipOrd......................H.... cavPor......................R.... oryCun......................H.... ochPri................Y.....Q...P canFam......................S.... felCat......................S...R bosTau......................H...T turTru......................C.... susScr......................R.... vicVic......................R.... equCab........S.............R.... myoLuc....................T...... pteVam........................... sorAra........R.........I...R...Q eriEur......................R.... loxAfr..............A.......H.... proCap......................H.... echTel....................R.H...Q dasNov............S.........R.... choHof......................H.... monDom........A.....P.N.S...Q.... macEug........A.....Y.N.S...Q.... ornAna....G...A.....R.N.S.T.R.... galGal....G...A.....H.N.S.T.R.A.R taeGut....G...A.....H.N.S.T.R.A.R anoCar....G...A.....H.N.S.T.K.A.R xenTro....G...A.....H.N.S.T.R...R danRer....G.A.A.M.L.K.N.S.T.R.A.R takRub..I.G.A.A.M.L.K.N.S.T.R.A.R tetNig..I.G.A.A.M.L.K.N.S.T.R.A.R gasAcu..I.G.A.A.M.L.K.N.S.T.R.A.R oryLat..I.G.A.A.M.L.K.N.S.T.R.A.R calMil................N.S.T.....R homSap..L.S.S.P.I.G.G.H.T.P.L.G.H ........x.x.x.x.x.x.x.x.x.x.y.y.y ......................2.2.3.1.1.3 ........8.7.7.5.4.4.1.1.1.1.2.0.1 ........7.6.5.9.9.8.7.1.3.8.9.3.4 ........2.2.2.2.2.2.1.5.5.7.3.3.7
..........................................................................................................1.........1.........1.........1.........1.........1.........1.........1.........1...1 ................1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2.........3.........4.........5.........6.........7.........8...8 .......1234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234 .......EEEEEEEEEEEEEEEEEEEEEEEEEEMMMMMMMMMMMMMMMMMMMMMMMCCCCCCCCCCCCMMMMMMMMMMMMMMMMMMMMMMMMMMEEEEEEEEEEEEEEMMMMMMMMMMMMMMMMMMMMCCCCCCCCCCCCCCCCCCCMMMMMMMMMMMMMMMMMMMMMMMMEEEEEEEEEEEEEEEEEEEE ..............GGG.......................................................................................S.......................DRY..................................................S......... .......................y..............................xx.........x...............xx..........x...............y.........................y....................................................... homSap MLRNNLGNSSDSKNEDGSVFSQTEHNIVATYLIMAGMISIISNIIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQVYAGLNIFFGMASIGLLTVVAVDRYLTICLPDVGRRMTTNTYIGLILGAWINGLFWALMPIIGWASYAPDPTGATCTINWRKNDR panTro ........................................................................................................................................................................................ gorGor ........G................................................................................................................................................L.............................. ponPyg ....................................I..............................................................................................I.................................................... nomLeu ........................................................................................................................................................................................ macMul ........................................L..........................................................................................I...........M........................................ papHam ........................................L..........................................................................................I...........M........................................ calJac .............................S..........L...............................................................................A..........I......S...IM................V....................... tarSyr ..K.D...................................L.........V................S...............................I............................R..I.........V.M............S...A...T..................A otoGar ........N...............................L..............................................H...........I............................R..I.......S...M.....V..........T......................A micMur I....................M.......R..I......H..V.M.....V......V...T...G..................A tupBel ..GSDA....NLG...S.A..........A..LT..V...L...V.....VTH...............F..................H.R....H....I..........................L.R.A.....GSS..AAM.....L......A...A...G..................A musMus ..SEASD...G.RS.G-....R...SVI.A...V..IT..L..VV.................V.....F..................H......H....I..........V.........M......SC............LSM....................................N..T ratNor ...DA.D...G.GS.G-...TKS..S.I.A...V..I...L.....................V.....F..................H......H...............V.........L......SC........G...LSMV...............V......................T speTri dipOrd .....V....G.R....................T..V...L..L........................L....................R.........I....................I.......H..I..G...R..VTM.......................................T cavPor ...HS........................A...L..L...L.......................M...L..................H........................................R..I.....SHS.V.M.......................................V oryCun ......S....F.H...................L......L..L........................F..................H...........I....................M.......H.........R..L.......V..........A......................T ochPri ...H......EA.V.A.............A...L......L..L........................F..................H...........I....................M.......Q..I......H..F.M................V......................K canFam ......D..T...................A...T..I...F..L.......................................................I....................M.......S..T..........SM.....L.................................A felCat ......D......................A...T. I....................M.......S.NS..........SM.....L.................................A bosTau ...........C...N.............A...T..V...L...........F..................................H...........I............................H..A.....A....SM.....T.................................V turTru ..................T..........A...T.....VL..............................................H...........I............................C.GA..........SM.....F..........V......................V susScr ..LMD...N................S...A...T......L...V.......H...............A..................H...........I............................R.EA..........SM.......................................V vicVic ...............Y......A......A...T......L...V.......H..............................................I....................M.......R..A..........SM............SV......G...............Q..V equCab .............................A..........L.........V.......S..............................R.........I....................L.....T.R..A......S..TSM.....V..........V...N..................A myoLuc ..K...S....F............................L..V.......T...................................H........... ..K...H...SM.......................................V pteVam ..........N......................I......L.........F................................................ ..........IM.......................................A sorAra ..E.RSD...GPTP.G.A.L.R...RL..A...V..V...L..TA.............R.....M...L..................H...R..H......................L........L.R..A..S....S.V..........F.......L.........M.........G..V eriEur ..E.....G...E................A...V. I............................R.HT..S.SA.S..AM.....L..........S...G...............E..G loxAfr ....S.D.........A.......................L..............................................H.R.....T...I............................H.HI.....S...VSM.........L...L..T......................A proCap ...............E.............A..... I..........S.........I.......H.N. echTel ..K..E.T..G.HS.E......A..R...S...T......L..F..................M........................H.............................A..........H..R.....S...V.M........FL...L.V....................E..A dasNov .....T.......D......................I...F......S.............T.....................................I..........T.........M.......R..T.....I....SM................A.....................NI choHof .I...L...V................TV....................................I............................H........I....SM.............L.VT......................G monDom .FK..SVKTLAPEK.GP....PI..K...A...T..V...V..V......V...A...A..T.................................D...I.................A..I.......Q..L.G...SYN.TLM..T..V..F.......V...G..................V macEug .FQ.DS---LEPEK.SY....P.......A...T..V...P.........V.......A..T.....................................I.................A..I.......Q..L ornAna .R..DSA.LLE.EHH.R.A....D.....A...T..IM..V..V......V.FE....A................G...........H......H....I..........S.................R.AI..K..RSN.TAM..A..M..F...S..LL......S...............A galGal .HW.DSA...E.DA.AH...T........A...T..V...F...V.....V.......A.........F......G...........H.......T...I..A.........................R..I......RN.AA...A....AV...S..TV...G..S........A......T taeGut .HW.DSS...E.DD.AH.A.T........A...T..V...F...V.....V.......A.........F......G...........H.......T...I..A.........................R..I......RS.AT...A....AV..SS..TA...............V......V anoCar .FL.DSA...E.DD.PH.A...A......A...T..V..LL...V.....V.......A.........F......G...........H.......T...I..A.................I.......K.HI.S.L.ATN.TT...A....A....S..VV...............V......T xenTro .AGTGTV.I..ASS.VH.....S......A...T..V...L.........V.......A.........F......G...........H.......V...I....................I.......R..I...ISGRH.TAM..A....AV..SV..VV..S...................A danRer .ESGL.NV.AETVYGEK.A.T........A...T..V..LS...V..LM.V.FR....A.........F.....AG...........H.......M...I..A.................I.......R..I.QKL..RS.TL..VA..L.AV..SS...V...G...............N..V takRub ..KVFSLVNISNEV.GK...T.W......G...I..V..LT...V..LM.V.F.....A..F......F.....AG..........IH......HT...I..A.................I...I...R..I..K..VQS.NL...A..L.AV..SS..VV...A...............Q.NV tetNig ..KVFSLVN.SNDV.GK.A.T.W......G...T..I..LT...V..LM.V.F.....A..F......F.....AG..........IH......HT...I..A.................I.......R..I..K..VQS.NL..AA..L.AV..SS..VV...A...............Q.NA gasAcu .DPEVNVTDDVTLYGGK.A.T.L......G...T..V..LF...V..LM.W.F.....A..F......F.....AG..........IH...........I..A.................I.......R..I.QK..MQS.NL...A..L.AV..SS..VV...................Q..A oryLat .DSAVNVSDAAAPYGGK.A.T.L......G...T..V..LS..VL..LM.V.FR....A..I......F..V..AG..........IH.......T...I..A.................I.......R..L.QK..MQS.NL...A..L.AV..SS...V...G...............Q..V calMil homSap MLRNNLGNSSDSKNEDGSVFSQTEHNIVATYLIMAGMISIISNIIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQVYAGLNIFFGMASIGLLTVVAVDRYLTICLPDVGRRMTTNTYIGLILGAWINGLFWALMPIIGWASYAPDPTGATCTINWRKNDR .......................y..............................xx.........x...............xx..........x...............y.........................y....................................................... .......1....1.........2.........2.........2.........2.........2.........2.........2.........2.........2.........2.........3........3..........3.........3.......3 .......8....9.........0.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0........1..........2.........3.......3 .......5678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678 .......EEEEMMMMMMMMMMMMMMMMMMMMMMMMCCCCCCCCCCCCCCCCCCCCCCCCCCCCMMMMMMMMMMMMMMMMMMMMMMMMEEEEEEEEMMMMMMMMMMMMMMMMMMMMMMMMMCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC* .................................ggg......................................................................K...................................................... .................................x.y....................................................................................................y...x.................... homSap SFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKHHTTSDCTESLNRDWSDQIDVTKMSVIMICMFLVAWSPYSIVCLWASFGDPKKIPPPMAIIAPLFAKSSTFYNPCIYVVANKKFRRAMLAMFKCQTHQTMPVTSILPMDVSQNPLASGRI* panTro .........................................................................................................................................................* gorGor ......V................................................. .....................................* ponPyg .........................................................................................................................................................* nomLeu .................................Y....................................................................................K.......WPN.....G.............T..K.* macMul ...................................A..........E..........................................................................................................* papHam ...................................A..........E...............................................................M..........................................* calJac .......A......................F......................................................N...A..................................L............V....I..........* tarSyr T......A...................A.Q.....VA.N.V..........V.... .....F..L....Y.A..A..S...N......T..KN* otoGar .......................L.........S...............................F.........A.A........I.......R..* micMur ...........V..V...M..........Q...R..ATN............V....................A.........N.E....S.....................I.........F.........A......F..G......P...T* tupBel .......A...V..V......S...VL.ARA.AG.AA.S.S.HRC....E.V.......L..L.....................QR...A...V.................L......K..C.........A.S...V...AS.PR...PA.V* musMus .......M..VV................SR.LRLYAA....AH.H...A..A..........L...L............C..N......S.....................A.H....K.........P.LAV.EP.T....MP.SS..PV..* ratNor .......M..VV................SQ.MRLSAA.N..TH.....AH.A.......M..L...L......V.....C..N......SL....................A......K..F..L...P..A..EP.T.A.G.PHS...PA..* speTri ...........V.................R.....AA.N..AY........V..........L...................N......S.....................A...R.....F.........A.....V.......S.R.....* dipOrd ...V..L......................E......................................L......A.....................* cavPor A.................A......L.I.RA.RR.VAG.RPPN.SG.....V.......V..L.....................RR.S.S.....................I.........F...Q.....AV..A......A..S.......* oryCun ....F..A......V..............Q...Q.RA.....Y........V..........F..........................A.....................A...R.....F.........A.....V..........P..I.* ochPri ....F..A..MV..V...........Y..Q......A....K.........V..........L.....................Q....S.....................A...RS....F......IP.AK....LS.R....S..S...T* canFam F...F......V.................R...C....N...Y........V..........F..........................S............................K.IF.........A..G.................N* felCat ....F.........N..............QT..C.D..N..GY......N.V.... ..K..F.....ENR.P................T...K* bosTau .......M.V...................Q.....G.NN...Y........V..........L..........................S.....................I.................T.A.....V.....P....T..KV* turTru .......M.V........IM.........R.....A..N...Y........V..........L..........................SV..L.................I...................A..ME.......P....T..KV* susScr .........V.V......I..........Q...S.A..N..AY........V..........L................................................I...................A..LE.T.....P........V* vicVic .......M.V........I..........R...CRA..N...Y........V..........L...L......................S.....................I...................A..M........P....T...L* equCab .......A.V...................R.M.R.P..N..QY..T.....V..........F..........................S...............................F........RA...........P..Q......* myoLuc ....F........................R...CRA..N...Y.....A..V..........F..........................S...............................F...........TTM.F.....P....T....* pteVam ....F.............A..........R...C....N...Y........V..........F..........................S...............................F......D..S.....V.....P....T....* sorAra .........V.V..VM.....A......IR.L.C...HSSPGH.DG...S.V.......V..LA..A.........F............S..V............................S..LT.RAQGA..AA.T....AAHS.Q....N* eriEur ..............M...A..L.......R.M.C..ARS.A.YVHG.....V..........L..........V...............S...M.................L.........F.........A....NT....IP.K-.D.R.N* loxAfr ..........V...V...A..........R...R..A.N.A.Y........L..........L....................S.....S...............................F.........AE...C....N.......A...* proCap ......V...MV..V...A........I.....C.AARN.A.P.C......L..........L...M.................T....S.........................................AV...N....T....SS.....* echTel .........VG...V...A..........QT..RQ.A.NGA.N.H......L.... .....F.LLQ..PQEARR.......N.....M....L* dasNov .........V....L...A..L.......R......K.N.S.YF.....N.V..........S.....................E....S..............................IF..L......A...M.....N..E........* choHof .........VV...V..VA........I.R...RRNS.N.S.Y........V..........L...L......................S............................TI.F..L......AV........N..E........* monDom ..........T...AM..G.......N.SQKM.QYSP.N.PDHI.....N.VA......V..L...L..................E...A...V.................A........IS..IR.....S..ISNA...N* macEug ..............VM..V.......N.S.KM.QY.R.S.P.HI.....N.V..........L...L......V...........E...A.....................A........IS..MR.E...S...SNA..LNLT* ornAna ..I........V..A...I.......N.SKAMRQYPA.RVL.N..I...E.V.......V..L...M...........S........S.AV..M............................S.VQ....REITI.DV...NR.RS..TL galGal .......S...V..V...........N.SRTM.QY.S.N.L..I.M.....V.......V..V...............S........S.A.....................I........I...VR...R.EITISNA...T..LSA.T. taeGut ..I....S...V..V...........N.SRTM.QYAS.N.L..I.I.....V.......V..I...............S........S.A.....................I........I...VR...R.EITINNA...S...SA.T.QNS* anoCar .......S...V..VI..S.......N.SKTM.YYMRNS.L.NI.I.....V.......V..I...L...........S........S.A...V.................I...R....I...IR...R.EITINNV...S...STI. xenTro .......S.V.V..V...M.......N.SRTM.GYGSRSSLGGI.A.....T......MV..V...............S.....R....A.....................I........I.S.VQ.KSR.EVTLDNHF..N...ST.TT* danRer ..........TV...I..S.......N.SATV.RFKA.N.LD.I.M.....M..........V...A.................Q...A........L.............I........IIG.IR...R.RVTINNQ...MA.SV..NP* takRub ..I....A..GV..ML..F.......N.SVTVNRCK..N.LDDI.IE..E.M......LV..I......................A..A......................I........IIG.IR...R.Q.TINTEI..TT..QTATQ* tetNig ..I....A..SV...L..F.......N.SVTV.QYKANN.LDNI.IE..E.M......IV..I......................T.SA......................IT.....Q.IIG.IR...R.QITINTDI..TA..QT.TQ* gasAcu ..I....A...V..VL..SA......N.SATV.RYKA.N.LD.A.I.....M......IV..I......................T..A......................I........IIG.VR...R.RITIN.QV..TT..Q..TQ* oryLat .......A...V..V...........N.SATV.RYKA.N.LD.I.I.....M......IV..V..........M...........T..A......................I........IIG.IR...R.RITISTQV..TI..Q..TQ* calMil L..........V..V...S...F...N.SKTMSRFIS.PSP.NI.L.....L.......V..V...L...............N..L...A.....................I......K.IM..IC..NR.EITINHT...TI.RV..TE* homSap SFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKHHTTSDCTESLNRDWSDQIDVTKMSVIMICMFLVAWSPYSIVCLWASFGDPKKIPPPMAIIAPLFAKSSTFYNPCIYVVANKKFRRAMLAMFKCQTHQTMPVTSILPMDVSQNPLASGRI* .................................x.y....................................................................................................y...x....................
A Curated Set of 48 vertebrate Peropsin Opsins
The sequences below represent all that are available as of March 2008. A few sequences such as chondrichthyes are incomplete. Note the main reference sequence collection contains additional deuterostome and even some lophotrochozoan peropsins.
>homSap Homo sapiens (human) 0 MLRNNLGNSSDSKNEDGSVFSQTEHNIVATYLIMA 1 2 GMISIISNIIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQ 0 0 VYAGLNIFFGMASIGLLTVVAVDRYLTICLPDV 1 2 GRRMTTNTYIGLILGAWINGLFWALMPIIGWASYAPDPTGATCTINWRKNDR 2 1 SFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKHHTTSDCTESLNRDWSDQIDVTK 0 0 MSVIMICMFLVAWSPYSIVCLWASFGDPKKIPPPMAIIAPLFAKSSTFYNPCIYVVANKK 2 1 FRRAMLAMFKCQTHQTMPVTSILPMDVSQNPLASGRI* 0 >panTro Pan troglodytes (chimp) 0 MLRNNLGNSSDSKNEDGSVFSQTEHNIVATYLIMA 1 2 GMISIISNIIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQ 0 0 VYAGLNIFFGMASIGLLTVVAVDRYLTICLPDV 1 2 GRRMTTNTYIGLILGAWINGLFWALMPIIGWASYAPDPTGATCTINWRKNDR 2 1 SFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKHHTTSDCTESLNRDWSDQIDVTK 0 0 MSVIMICMFLVAWSPYSIVCLWASFGDPKKIPPPMAIIAPLFAKSSTFYNPCIYVVANKK 2 1 FRRAMLAMFKCQTHQTMPVTSILPMDVSQNPLASGRI* 0 >gorGor Gorilla gorilla (gorilla) 0 MLRNNLGNGSDSKNEDGSVFSQTEHNIVATYLIMA 1 2 GMISIISNIIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQ 0 0 VYAGLNIFFGMASIGLLTVVAVDRYLTICLPDV 1 2 GRRMTTNTYIGLILGAWINGLLWALMPIIGWASYAPDPTGATCTINWRKNDR 2 1 SFVSYTVTVIAINFIVPLTVMFYCYYHVTLSIKHHTTSDCTESLNRDWSDQIDVTK 0 0 2 1 FRRAMLAMFKCQTHQTMPVTSILPMDVSQNPLASGRI* 0 >ponPyg Pongo pygmaeus (orang_abelii) 0 MLRNNLGNSSDSKNEDGSVFSQTEHNIVATYLIMA 1 2 GIISIISNIIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQ 0 0 VYAGLNIFFGMASIGLLTVVAVDRYLTICLPDI 1 2 GRRMTTNTYIGLILGAWINGLFWALMPIIGWASYAPDPTGATCTINWRKNDR 2 1 SFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKHHTTSDCTESLNRDWSDQIDVTK 0 0 MSVIMICMFLVAWSPYSIVCLWASFGDPKKIPPPMAIIAPLFAKSSTFYNPCIYVVANKK 2 1 FRRAMLAMFKCQTHQTMPVTSILPMDVSQNPLASGRI* 0 >nomLeu Nomascus leucogenys (gibbon) 0 MLRNNLGNSSDSKNEDGSVFSQTEHNIVATYLIMA 1 2 GMISIISNIIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQ 0 0 VYAGLNIFFGMASIGLLTVVAVDRYLTICLPDV 1 2 GRRMTTNTYIGLILGAWINGLFWALMPIIGWASYAPDPTGATCTINWRKNDR 2 1 SFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKYHTTSDCTESLNRDWSDQIDVTK 0 0 MSVIMICMFLVAWSPYSIVCLWASFGDPKKIPPPMAIIAPLFAKSSTFYNPCIYVVANKK 2 1 FRKAMLAMFKWPNHQTMPGTSILPMDVSQNPLTSGKI* 0 >macMul Macaca mulatta (rhesus) 0 MLRNNLGNSSDSKNEDGSVFSQTEHNIVATYLIMA 1 2 GMISILSNIIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQ 0 0 VYAGLNIFFGMASIGLLTVVAVDRYLTICLPDI 1 2 GRRMTTNTYIGMILGAWINGLFWALMPIIGWASYAPDPTGATCTINWRKNDR 2 1 SFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKHHATSDCTESLNREWSDQIDVTK 0 0 MSVIMICMFLVAWSPYSIVCLWASFGDPKKIPPPMAIIAPLFAKSSTFYNPCIYVVANKK 2 1 FRRAMLAMFKCQTHQTMPVTSILPMDVSQNPLASGRI* 0 >papHam Papio hamadryas (baboon) 0 MLRNNLGNSSDSKNEDGSVFSQTEHNIVATYLIMA 1 2 GMISILSNIIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQ 0 0 VYAGLNIFFGMASIGLLTVVAVDRYLTICLPDI 1 2 GRRMTTNTYIGMILGAWINGLFWALMPIIGWASYAPDPTGATCTINWRKNDR 2 1 SFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKHHATSDCTESLNREWSDQIDVTK 0 0 MSVIMICMFLVAWSPYSIVCLWASFGDPKKIPPPMAIIAPLFAKSSTFYNPCIYMVANKK 2 1 FRRAMLAMFKCQTHQTMPVTSILPMDVSQNPLASGRI* 0 >calJac Callithrix jacchus (marmoset) 0 MLRNNLGNSSDSKNEDGSVFSQTEHNIVASYLIMA 1 2 GMISILSNIIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQ 0 0 VYAGLNIFFGMASIGLLTVVAADRYLTICLPDI 1 2 GRRMTTSTYIIMILGAWINGLFWALMPIVGWASYAPDPTGATCTINWRKNDR 2 1 SFVSYTMAVIAINFIVPLTVMFYCYYHVTLFIKHHTTSDCTESLNRDWSDQIDVTK 0 0 MSVIMICMFLVAWSPYSIVCLWASFGDPKNIPPAMAIIAPLFAKSSTFYNPCIYVVANKK 2 1 FRRAMLAMLKCQTHQTMPVTSVLPMDISQNPLASGRI* 0 >tarSyr Tarsius syrichta (tarsier) 0 MLKNDLGNSSDSKNEDGSVFSQTEHNIVATYLIMA 1 2 GMISILSNIIVLGIFVKYKELRTPTNAIIINLSVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQ 0 0 IYAGLNIFFGMASIGLLTVVAVDRYLTICRPDI 1 2 GRRMTTNTYVGMILGAWINGLFWASMPIAGWATYAPDPTGATCTINWRKNDA 2 1 TFVSYTMAVIAINFIVPLTVMFYCYYHATQSIKHHVASNCVESLNRDWSDQVDVTK 0 0 2 1 FRRAMFAMLKCQTYQAMPATSSLPMNVSQNPLTSGKN* 0 >otoGar Otolemur garnettii (bushbaby) 0 MLRNNLGNNSDSKNEDGSVFSQTEHNIVATYLIMA 1 2 GMISILSNIIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLHGSWKFGYAGCQ 0 0 IYAGLNIFFGMASIGLLTVVAVDRYLTICRPDI 1 2 GRRMTTNSYIGMILGAWVNGLFWALMPITGWASYAPDPTGATCTINWRKNDA 2 1 SFVSYTMTVIAVNFVVPLMVMFYCYYHVTQSIKRHTATNCTESLNRDWSDQVDVTK 0 0 MSVIMICMFLVAWSPYSIVCLWALFGDPKKIPPSMAIIAPLFAKSSTFYNPCIYVVANKK 2 1 FRRAMFAMFKCQTHQAMAVTSILPMDISQNPLASRRI* 0 >micMur Microcebus murinus (mouse_lemur) 0 1 2 0 0 IYAGLNIFFGMASIGLLTVVAMDRYLTICRPDI 1 2 GRRMTTHTYVGMILGAWVNGLFWAVMPITGWAGYAPDPTGATCTINWRKNDA 2 1 0 0 MSVIMICMFLVAWSPYAIVCLWASFGNPEKIPPSMAIIAPLFAKSSTFYNPCIYVIANKK 2 1 FRRAMFAMFKCQTHQAMPVTSIFPMGVSQNPLPSGRT* 0 >tupBel Tupaia belangeri (treeshrew) 0 MLGSDAGNSSNLGNEDSSAFSQTEHNIVAAYLLTA 1 2 GVISILSNIVVLGIFVTHKELRTPTNAIIINLAFTDIGVSSIGYPMSAASDLHGRWKFGHAGCQ 0 0 IYAGLNIFFGMASIGLLTVVAVDRYLTLCRPAV 1 2 GRRMGSSTYAAMILGAWLNGLFWAAMPIAGWAGYAPDPTGATCTINWRKNDA 2 1 SFVSYTMAVIAVNFVVPLTVMSYCYVLVARAIAGHAASSCSEHRCRDWSEQVDVTK 0 0 MSVLMILMFLVAWSPYSIVCLWASFGDPQRIPPAMAIVAPLFAKSSTFYNPCIYVLANKK 2 1 FRKAMCAMFKCQTHQAMSVTSVLPMASSPRPLAPARV* 0 >musMus Mus musculus (mouse) 0 MLSEASDNSSGSRSE-GSVFSRTEHSVIAAYLIVA 1 2 GITSILSNVVVLGIFIKYKELRTPTNAVIINLAFTDIGVSSIGYPMSAASDLHGSWKFGHAGCQ 0 0 IYAGLNIFFGMVSIGLLTVVAMDRYLTISCPDV 1 2 GRRMTTNTYLSMILGAWINGLFWALMPIIGWASYAPDPTGATCTINWRNNDT 2 1 SFVSYTMMVIVVNFIVPLTVMFYCYYHVSRSLRLYAASDCTAHLHRDWADQADVTK 0 0 MSVIMILMFLLAWSPYSIVCLWACFGNPKKIPPSMAIIAPLFAKSSTFYNPCIYVAAHKK 2 1 FRKAMLAMFKCQPHLAVPEPSTLPMDMPQSSLAPVRI* 0 >ratNor Rattus norvegicus (rat) 0 MLRDALDNSSGSGSE-GSVFTKSEHSIIAAYLIVA 1 2 GIISILSNIIVLGIFIKYKELRTPTNAVIINLAFTDIGVSSIGYPMSAASDLHGSWKFGHAGCQ 0 0 VYAGLNIFFGMVSIGLLTVVALDRYLTISCPDV 1 2 GRRMTGNTYLSMVLGAWINGLFWALMPIVGWASYAPDPTGATCTINWRKNDT 2 1 SFVSYTMMVIVVNFIVPLTVMFYCYYHVSQSMRLSAASNCTTHLNRDWAHQADVTK 0 0 MSVMMILMFLLAWSPYSVVCLWACFGNPKKIPPSLAIIAPLFAKSSTFYNPCIYVAANKK 2 1 FRKAMFAMLKCQPHQAMPEPSTLAMGVPHSPLAPARI* 0 >speTri Spermophilus tridecemlineatus (ground_squirrel) 0 1 2 0 0 1 2 2 1 SFVSYTMTVIAVNFIVPLTVMFYCYYHVTRSIKHHAASNCTAYLNRDWSDQVDVTK 0 0 MSVIMILMFLVAWSPYSIVCLWASFGNPKKIPPSMAIIAPLFAKSSTFYNPCIYVAANKR 2 1 FRRAMFAMFKCQTHQAMPVTSVLPMDVSQSPRASGRI* 0 >dipOrd Dipodomys ordii (kangaroo_rat) 0 MLRNNVGNSSGSRNEDGSVFSQTEHNIVATYLITA 1 2 GVISILSNLIVLGIFIKYKELRTPTNAIIINLALTDIGVSSIGYPMSAASDLYGRWKFGYAGCQ 0 0 IYAGLNIFFGMASIGLLTVVAIDRYLTICHPDI 1 2 GRGMTTRTYVTMILGAWINGLFWALMPIIGWASYAPDPTGATCTINWRKNDT 2 1 0 0 MSVVMILMFLVAWSPYSIVCLWASFGDPKEIPPPMAIIAPLFAKSSTFYNPCIYVVANKK 2 1 FRRAMLAMLKCQTHQAMPVTSILPMDVSQNPLASGRI* 0 >cavPor Cavia porcellus (guinea_pig) 0 MLRHSLGNSSDSKNEDGSVFSQTEHNIVAAYLILA 1 2 GLISILSNIIVLGIFIKYKELRTPTNAIIMNLALTDIGVSSIGYPMSAASDLHGSWKFGYAGCQ 0 0 VYAGLNIFFGMASIGLLTVVAVDRYLTICRPDI 1 2 GRRMTSHSYVGMILGAWINGLFWALMPIIGWASYAPDPTGATCTINWRKNDV 2 1 AFVSYTMTVIAINFIVPLAVMFYCYLHITRAIRRHVAGDRPPNLSGDWSDQVDVTK 0 0 MSVVMILMFLVAWSPYSIVCLWASFGDPRRISPSMAIIAPLFAKSSTFYNPCIYVIANKK 2 1 FRRAMFAMFQCQTHQAVPVASILPMDASQSPLASGRI* 0 >oryCun Oryctolagus cuniculus (rabbit) 0 MLRNNLSNSSDFKHEDGSVFSQTEHNIVATYLILA 1 2 GMISILSNLIVLGIFIKYKELRTPTNAIIINLAFTDIGVSSIGYPMSAASDLHGSWKFGYAGCQ 0 0 IYAGLNIFFGMASIGLLTVVAMDRYLTICHPDV 1 2 GRRMTTRTYLGLILGAWVNGLFWALMPIAGWASYAPDPTGATCTINWRKNDT 2 1 SFVSFTMAVIAINFVVPLTVMFYCYYHVTQSIKQHRASDCTEYLNRDWSDQVDVTK 0 0 MSVIMIFMFLVAWSPYSIVCLWASFGDPKKIPPAMAIIAPLFAKSSTFYNPCIYVAANKR 2 1 FRRAMFAMFKCQTHQAMPVTSVLPMDVSQNPLPSGII* 0 >ochPri Ochotona princeps (pika) 0 MLRHNLGNSSEAKVEAGSVFSQTEHNIVAAYLILA 1 2 GMISILSNLIVLGIFIKYKELRTPTNAIIINLAFTDIGVSSIGYPMSAASDLHGSWKFGYAGCQ 0 0 IYAGLNIFFGMASIGLLTVVAMDRYLTICQPDI 1 2 GRRMTTHTYFGMILGAWINGLFWALMPIVGWASYAPDPTGATCTINWRKNDK 2 1 SFVSFTMAVIMVNFVVPLTVMFYCYYYVTQSIKHHTASDCTKSLNRDWSDQVDVTK 0 0 MSVIMILMFLVAWSPYSIVCLWASFGDPQKIPPSMAIIAPLFAKSSTFYNPCIYVAANKR 2 1 SRRAMFAMFKCQIPQAKPVTSLSPRDVSQSPLSSGRT* 0 >canFam Canis familiaris (dog) 0 MLRNNLDNSTDSKNEDGSVFSQTEHNIVAAYLITA 1 2 GIISIFSNLIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQ 0 0 IYAGLNIFFGMASIGLLTVVAMDRYLTICSPDT 1 2 GRRMTTNTYISMILGAWLNGLFWALMPIIGWASYAPDPTGATCTINWRKNDA 2 1 FFVSFTMTVIAVNFIVPLTVMFYCYYHVTRSIKCHTTSNCTEYLNRDWSDQVDVTK 0 0 MSVIMIFMFLVAWSPYSIVCLWASFGDPKKIPPSMAIIAPLFAKSSTFYNPCIYVVANKK 2 1 FRKAIFAMFKCQTHQAMPGTSILPMDVSQNPLASGRN* 0 >felCat Felis catus (cat) 0 MLRNNLDNSSDSKNEDGSVFSQTEHNIVAAYLITA 1 2 0 0 IYAGLNIFFGMASIGLLTVVAMDRYLTICSPNS 1 2 GRRMTTNTYISMILGAWLNGLFWALMPIIGWASYAPDPTGATCTINWRKNDA 2 1 SFVSFTMTVIAINFNVPLTVMFYCYYHVTQTIKCHDTSNCTGYLNRDWSNQVDVTK 0 0 2 1 FRKAMFAMFKCENRQPMPVTSILPMDVSQNPLTSGRK* 0 >bosTau Bos taurus (cow) 0 MLRNNLGNSSDCKNENGSVFSQTEHNIVAAYLITA 1 2 GVISILSNIIVLGIFIKFKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLHGSWKFGYAGCQ 0 0 IYAGLNIFFGMASIGLLTVVAVDRYLTICHPDA 1 2 GRRMTANTYISMILGAWTNGLFWALMPIIGWASYAPDPTGATCTINWRKNDV 2 1 SFVSYTMMVVAINFIVPLTVMFYCYYHVTQSIKHHGTNNCTEYLNRDWSDQVDVTK 0 0 MSVIMILMFLVAWSPYSIVCLWASFGDPKKIPPSMAIIAPLFAKSSTFYNPCIYVIANKK 2 1 FRRAMLAMFKCQTTQAMPVTSVLPMDVPQNPLTSGKV* 0 >turTru Tursiops truncatus (dolphin) 0 MLRNNLGNSSDSKNEDGSTFSQTEHNIVAAYLITA 1 2 GMISVLSNIIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLHGSWKFGYAGCQ 0 0 IYAGLNIFFGMASIGLLTVVAVDRYLTICCPGA 1 2 GRRMTTNTYISMILGAWFNGLFWALMPIVGWASYAPDPTGATCTINWRKNDV 2 1 SFVSYTMMVVAINFIVPLIMMFYCYYHVTRSIKHHATSNCTEYLNRDWSDQVDVTK 0 0 MSVIMILMFLVAWSPYSIVCLWASFGDPKKIPPSVAILAPLFAKSSTFYNPCIYVIANKK 2 1 FRRAMLAMFKCQTHQAMPMESILPMDVPQNPLTSGKV* 0 >susScr Sus scrofa (pig) 0 MLMDLGNNSDSKNEDGSVFSQTEHSIVAAYLITA 1 2 GMISILSNIVVLGIFIKHKELRTPTNAIIINLAATDIGVSSIGYPMSAASDLHGSWKFGYAGCQ 0 0 IYAGLNIFFGMASIGLLTVVAVDRYLTICRPEA 1 2 GRRMTTNTYISMILGAWINGLFWALMPIIGWASYAPDPTGATCTINWRKNDV 2 1 SFVSYTMTVVAVNFIVPLIVMFYCYYHVTQSIKSHATSNCTAYLNRDWSDQVDVTK 0 0 MSVIMILMFLVAWSPYSIVCLWASFGDPKKIPPPMAIIAPLFAKSSTFYNPCIYVIANKK 2 1 FRRAMLAMFKCQTHQAMPLESTLPMDVPQNPLASGRV* 0 >vicVic Vicugna vicugna (vicugna) 0 MLRNNLGNSSDSKNEYGSVFSQAEHNIVAAYLITA 1 2 LRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQ 0 0 IYAGLNIFFGMASIGLLTVVAMDRYLTICRPDA 1 2 GRRMTTNTYISMILGAWINGLFWASVPIIGWAGYAPDPTGATCTINWRQNDV 2 1 SFVSYTMMVVAINFIVPLIVMFYCYYHVTRSIKCRATSNCTEYLNRDWSDQVDVTK 0 0 MSVIMILMFLLAWSPYSIVCLWASFGDPKKIPPSMAIIAPLFAKSSTFYNPCIYVIANKK 2 1 FRRAMLAMFKCQTHQAMPMTSILPMDVPQNPLTSGRL* 0 >equCab Equus caballus (horse) 0 MLRNNLGNSSDSKNEDGSVFSQTEHNIVAAYLIMA 1 2 GMISILSNIIVLGIFVKYKELRTSTNAIIINLAVTDIGVSSIGYPMSAASDLYGRWKFGYAGCQ 0 0 IYAGLNIFFGMASIGLLTVVALDRYLTTCRPDA 1 2 GRRMTTSTYTSMILGAWVNGLFWALMPIVGWANYAPDPTGATCTINWRKNDA 2 1 SFVSYTMAVVAINFIVPLTVMFYCYYHVTRSMKRHPTSNCTQYLNTDWSDQVDVTK 0 0 MSVIMIFMFLVAWSPYSIVCLWASFGDPKKIPPSMAIIAPLFAKSSTFYNPCIYVVANKK 2 1 FRRAMFAMFKCQTHRAMPVTSILPMDVPQNQLASGRI* 0 >myoLuc Myotis lucifugus (microbat) 0 MLKNNLSNSSDFKNEDGSVFSQTEHNIVATYLIMA 1 2 GMISILSNVIVLGIFITYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLHGSWKFGYAGCQ 0 0 1 2 GRKMTTHTYISMILGAWINGLFWALMPIIGWASYAPDPTGATCTINWRKNDV 2 1 SFVSFTMTVIAINFIVPLTVMFYCYYHVTRSIKCRATSNCTEYLNRDWADQVDVTK 0 0 MSVIMIFMFLVAWSPYSIVCLWASFGDPKKIPPSMAIIAPLFAKSSTFYNPCIYVVANKK 2 1 FRRAMFAMFKCQTHQTMTTMSFLPMDVPQNPLTSGRI* 0 >pteVam Pteropus vampyrus (macrobat) 0 MLRNNLGNSSNSKNEDGSVFSQTEHNIVATYLIIA 1 2 GMISILSNIIVLGIFFKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQ 0 0 1 2 GRRMTTNTYIIMILGAWINGLFWALMPIIGWASYAPDPTGATCTINWRKNDA 2 1 SFVSFTMTVIAINFIVPLAVMFYCYYHVTRSIKCHTTSNCTEYLNRDWSDQVDVTK 0 0 MSVIMIFMFLVAWSPYSIVCLWASFGDPKKIPPSMAIIAPLFAKSSTFYNPCIYVVANKK 2 1 FRRAMFAMFKCQDHQSMPVTSVLPMDVPQNPLTSGRI* 0 >sorAra Sorex araneus (shrew) 0 MLENRSDNSSGPTPEGGAVLSRTEHRLVAAYLIVA 1 2 GVISILSNTAVLGIFIKYKELRTRTNAIIMNLALTDIGVSSIGYPMSAASDLHGSWRFGHAGCQ 0 0 VYAGLNIFFGMASIGLLTLVAVDRYLTLCRPDA 1 2 GRSMTTNSYVGLILGAWINGFFWALMPILGWASYAPDPMGATCTINWRGNDV 2 1 SFVSYTMTVVAVNFVMPLTVMAYCYYHVIRSL-----HSSPGHLDGDWSSQVDVTK 0 0 MSVVMILAFLAAWSPYSIVCFWASFGDPKKIPPSMAVIAPLFAKSSTFYNPCIYVVANKK 2 1 FRRAMSAMLTCRAQGAMPAASTLPMDAAHSPQASGRN* 0 >eriEur Erinaceus europaeus (hedgehog) 0 MLENNLGNGSDSENEDGSVFSQTEHNIVAAYLIVA 1 2 0 0 IYAGLNIFFGMASIGLLTVVAVDRYLTICRPHT 1 2 GRSMSANSYIAMILGAWLNGLFWALMPISGWAGYAPDPTGATCTINWRENDG 2 1 SFVSYTMTVIAINFMVPLAVMLYCYYHVTRSMKCHTARSCAEYVHGDWSDQVDVTK 0 0 MSVIMILMFLVAWSPYSVVCLWASFGDPKKIPPSMAIMAPLFAKSSTFYNPCIYVLANKK 2 1 FRRAMFAMFKCQTHQAMPVTNTLPMDIPQK* 0LDSRRN* 0 >loxAfr Loxodonta africana (elephant) 0 MLRNSLDNSSDSKNEDASVFSQTEHNIVATYLIMA 1 2 GMISILSNIIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLHGRWKFGYTGCQ 0 0 IYAGLNIFFGMASIGLLTVVAVDRYLTICHPHI 1 2 GRRMTSNTYVSMILGAWINGLLWALLPITGWASYAPDPTGATCTINWRKNDA 2 1 SFVSYTMTVIVINFVVPLAVMFYCYYHVTRSIKRHTASNCAEYLNRDWSDQLDVTK 0 0 MSVIMILMFLVAWSPYSIVCLWASFGDSKKIPPSMAIIAPLFAKSSTFYNPCIYVVANKK 2 1 FRRAMFAMFKCQTHQAEPVTCILPMNVSQNPLAAGRI* 0 >proCap Procavia capensis (hyrax) 0 MLRNNLGNSSDSKNEEGSVFSQTEHNIVAAYLIMA 1 2 0 0 IYAGLNIFFGMSSIGLLTVVAIDRYLTICHPNV 1 2 2 1 SFVSYTVTVIMVNFVVPLAVMFYCYYHITLSIKCHAARNCAEPLCRDWSDQLDVTK 0 0 MSVIMILMFLMAWSPYSIVCLWASFGDPTKIPPSMAIIAPLFAKSSTFYNPCIYVVANKK 2 1 FRRAMLAMFKCQTHQAVPVTNILPMTVSQNSSASGRI* 0 >echTel Echinops telfairi (tenrec) 0 MLKNNEGTSSGSHSEEGSVFSQAEHRIVASYLITA 1 2 GMISILSNFIVLGIFIKYKELRTPTNAMIINLAVTDIGVSSIGYPMSAASDLHGSWKFGYAGCQ 0 0 VYAGLNIFFGMASIGLLTAVAVDRYLTICHPDR 1 2 GRRMTSNTYVGMILGAWINGFLWALLPVIGWASYAPDPTGATCTINWRENDA 2 1 SFVSYTMTVVGINFVVPLAVMFYCYYHVTQTIKRQTASNGAENLHRDWSDQLDVTK 0 0 2 1 FRRAMFALLQCQPQEARRVTSILPMNVSQNPMASGRL* 0 >dasNov Dasypus novemcinctus (armadillo) 0 MLRNNTGNSSDSKDEDGSVFSQTEHNIVATYLIMA 1 2 GIISIFSNIIVLSIFIKYKELRTPTNTIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQ 0 0 IYAGLNIFFGMTSIGLLTVVAMDRYLTICRPDT 1 2 GRRMTINTYISMILGAWINGLFWALMPIAGWASYAPDPTGATCTINWRKNNI 2 1 SFVSYTMTVVAINFLVPLAVMLYCYYHVTRSIKHHTKSNCSEYFNRDWSNQVDVTK 0 0 MSVIMISMFLVAWSPYSIVCLWASFGDPEKIPPSMAIIAPLFAKSSTFYNPCIYVVANKK 2 1 FRRAIFAMLKCQTHQAMPVMSILPMNVSENPLASGRI* 0 >choHof Choloepus hoffmanni (sloth) 0 1 2 GIISILSNIVVLGIFIKYKELRTPTNTVIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQ 0 0 IYAGLNIFFGMASIGLLTVVAVDRYLTICHPDV 1 2 GRRMTINTYISMILGAWINGLFWALLPVTGWASYAPDPTGATCTINWRKNDG 2 1 SFVSYTMTVVVINFVVPVAVMFYCYYHITRSIKRRNSSNCSEYLNRDWSDQVDVTK 0 0 MSVIMILMFLLAWSPYSIVCLWASFGDPKKIPPSMAIIAPLFAKSSTFYNPCIYVVANKK 2 1 FRTIMFAMLKCQTHQAVPVTSILPMNVSENPLASGRI* 0 >macEug Macropus eugenii (wallaby) 0 MFQNDSLEPEKESYSVFSPTEHNIVAAYLITA 1 2 GVISIPSNIIVLGIFVKYKELRTATNTIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQ 0 0 IYAGLNIFFGMASIGLLTAVAIDRYLTICQPDL 1 2 2 1 SFVSYTMTVIAINFVMPLVVMFYCYYNVSLKMKQYTRSSCPEHINRDWSNQVDVTK 0 0 MSVIMILMFLLAWSPYSVVCLWASFGDPKEIPPAMAIIAPLFAKSSTFYNPCIYVAANKK 2 1 FRRAISAMMRCETHQSMPVSNALPLNLT* 0 >monDom Monodelphis domestica (opossum) 0 MFKNNSVKTLAPEKEGPSVFSPIEHKIVAAYLITA 1 2 GVISIVSNVIVLGIFVKYKALRTATNTIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYDGCQ 0 0 IYAGLNIFFGMASIGLLTAVAIDRYLTICQPDL 1 2 GGRMTSYNYTLMILTAWVNGFFWALMPIVGWAGYAPDPTGATCTINWRKNDV 2 1 SFVSYTMTVITINFAMPLGVMFYCYYNVSQKMKQYSPSNCPDHINRDWSNQVAVTK 0 0 MSVVMILMFLLAWSPYSIVCLWASFGDPKEIPPAMAIVAPLFAKSSTFYNPCIYVAANKK 2 1 FRRAISAMIRCQTHQSMPISNALPMN* 0 >ornAna Ornithorhynchus anatinus (platypus) 0 MRRNDSANLLESEHHDRSAFSQTDHNIVAAYLITA 1 2 GIMSIVSNVIVLGIFVKFEELRTATNAIIINLAVTDIGVSGIGYPMSAASDLHGSWKFGHAGCQ 0 0 IYAGLNIFFGMSSIGLLTVVAVDRYLTICRPAI 1 2 GRKMTRSNYTAMILAAWMNGFFWASMPLLGWASYASDPTGATCTINWRKNDA 2 1 SFISYTMTVIAVNFAVPLIVMFYCYYNVSKAMRQYPASRVLENLNIDWSEQVDVTK 0 0 MSVVMILMFLMAWSPYSIVCLWSSFGDPKKISPAVAIMAPLFAKSSTFYNPCIYVVANKK 2 1 FRRAMLSMVQCQTHREITITDVLPMNRSRSPLTL* 0 >galGal Gallus gallus (chicken) 0 MHWNDSANSSESDAEAHSVFTQTEHNIVAAYLITA 1 2 GVISIFSNIVVLGIFVKYKELRTATNAIIINLAFTDIGVSGIGYPMSAASDLHGSWKFGYTGCQ 0 0 IYAALNIFFGMASIGLLTVVAVDRYLTICRPDI 1 2 GRRMTTRNYAALILAAWINAVFWASMPTVGWAGYASDPTGATCTANWRKNDV 2 1 SFVSYTMSVIAVNFVVPLTVMFYCYYNVSRTMKQYTSSNCLESINMDWSDQVDVTK 0 0 MSVVMIVMFLVAWSPYSIVCLWSSFGDPKKISPAMAIIAPLFAKSSTFYNPCIYVIANKK 2 1 FRRAILAMVRCQTRQEITISNALPMTVSLSALTS* 0 >taeGut Taeniopygia guttata (finch) 0 MHWNDSSNSSESDDEAHSAFTQTEHNIVAAYLITA 1 2 GVISIFSNIVVLGIFVKYKELRTATNAIIINLAFTDIGVSGIGYPMSAASDLHGSWKFGYTGCQ 0 0 IYAALNIFFGMASIGLLTVVAVDRYLTICRPDI 1 2 GRRMTTRSYATLILAAWINAVFWSSMPTAGWASYAPDPTGATCTVNWRKNDA 2 1 SFISYTMSVIAVNFVVPLTVMFYCYYNVSRTMKQYASSNCLESINIDWSDQVDVTK 0 0 MSVVMIIMFLVAWSPYSIVCLWSSFGDPKKISPAMAIIAPLFAKSSTFYNPCIYVIANKK 2 1 FRRAILAMVRCQTRQEITINNALPMSVSQSALTSQNSSHLPA* 0 >anoCar Anolis carolinensis (lizard) 0 MFLNDSANSSESDDEPHSAFSQAEHNIVAAYLITA 1 2 GVISLLSNIVVLGIFVKYKELRTATNAIIINLAFTDIGVSGIGYPMSAASDLHGSWKFGYTGCQ 0 0 IYAALNIFFGMASIGLLTVVAIDRYLTICKPHI 1 2 GSRLTATNYTTLILAAWINALFWASMPVVGWASYAPDPTGATCTVNWRKNDT 2 1 SFVSYTMSVIAVNFVIPLSVMFYCYYNVSKTMKYYMRNSCLENINIDWSDQVDVTK 0 0 MSVVMIIMFLLAWSPYSIVCLWSSFGDPKKISPAMAIVAPLFAKSSTFYNPCIYVIANKR 2 1 FRRAILAMIRCQTRQEITINNVLPMSVSQSTIA* 0 >xenTro Xenopus tropicalis (frog) 0 METLAEVSTLLPAGTGTVNISDASSEVHSVFSQSEHNIVAAYLITA 1 2 GVISILSNIIVLGIFVKYKELRTATNAIIINLAFTDIGVSGIGYPMSAASDLHGSWKFGYVGCQ 0 0 IYAGLNIFFGMASIGLLTVVAIDRYLTICRPDI 1 2 GRRISGRHYTAMILAAWINAVFWSVMPVVGWSSYAPDPTGATCTINWRKNDV 2 1 SFVSYTMSVVAVNFVVPLMVMFYCYYNVSRTMKGYGSRSSLGGINADWSDQTDVTK 0 0 MSMVMIVMFLVAWSPYSIVCLWSSFGDPRKIPPAMAIIAPLFAKSSTFYNPCIYVIANKK 2 1 FRRAILSMVQCKSRQEVTLDNHFPMNVSQSTLTT* 0 >danRer Danio rerio (zebrafish) 0 MESGLLNVSAETVYGEKSAFTQTEHNIVAAYLITA 1 2 GVISLSSNIVVLLMFVKFRELRTATNAIIINLAFTDIGVAGIGYPMSAASDLHGSWKFGYMGCQ 0 0 IYAALNIFFGMASIGLLTVVAIDRYLTICRPDI 1 2 GQKLTTRSYTLLIVAAWLNAVFWSSMPIVGWAGYAPDPTGATCTINWRNNDT 2 1 SFVSYTMTVITVNFIIPLSVMFYCYYNVSATVKRFKASNCLDSINMDWSDQMDVTK 0 0 MSVIMIVMFLAAWSPYSIVCLWASFGDPQKIPAPMAIIAPLLAKSSTFYNPCIYVIANKK 2 1 FRRAIIGMIRCQTRQRVTINNQLPMMASSVPLNP* 0 >takRub Takifugu rubripes (fugu) 0 MKVFSLVNISNEVEGKSVFTQWEHNIVAGYLIIA 1 2 GVISLTSNIVVLLMFVKFKELRTATNFIIINLAFTDIGVAGIGYPMSAASDIHGSWKFGHTGCQ 0 0 IYAALNIFFGMASIGLLTVVAIDRYITICRPDI 1 2 GRKMTVQSYNLLILAAWLNAVFWSSMPVVGWAAYAPDPTGATCTINWRQNNA 2 1 SFISYTMAVIGVNFMLPLFVMFYCYYNVSVTVNRCKTSNCLDDINIEWSEQMDVTK 0 0 MSLVMIIMFLVAWSPYSIVCLWASFGDPKAIPAPMAIIAPLFAKSSTFYNPCIYVIANKK 2 1 FRRAIIGMIRCQTRQQMTINTEIPMTTSQQTATQ* 0 >tetNig Tetraodon nigroviridis (pufferfish) 0 MKSVNSSNDVEGKSAFTQWEHNIVAGYLITA 1 2 GIISLTSNIVVLLMFVKFKELRTATNFIIINLAFTDIGVAGIGYPMSAASDIHGSWKFGHTGCQ 0 0 IYAALNIFFGMASIGLLTVVAIDRYLTICRPDI 1 2 GRKMTVQSYNLLIAAAWLNAVFWSSMPVVGWAAYAPDPTGATCTINWRQNNV 2 1 SFISYTMAVISVNFILPLFVMFYCYYNVSVTVKQYKANNCLDNINIEWSEQMDVTK 0 0 MSIVMIIMFLVAWSPYSIVCLWASFGDPKTISAPMAIIAPLFAKSSTFYNPCIYVITNKK 2 1 FRQAIIGMIRCQTRQQITINTDIPMTASQQTLTQ* 0 >gasAcu Gasterosteus aculeatus (stickleback) 0 MGIDPEVNVTDDVTLYGGKSAFTQLEHNIVAGYLITA 1 2 GVISLFSNIVVLLMFWKFKELRTATNFIIINLAFTDIGVAGIGYPMSAASDIHGSWKFGYAGCQ 0 0 IYAALNIFFGMASIGLLTVVAIDRYLTICRPDI 1 2 GQKMTMQSYNLLILAAWLNAVFWSSMPVVGWASYAPDPTGATCTINWRQNDV 2 1 SFISYTMAVIAVNFVLPLSAMFYCYYNVSATVKRYKASNCLDSANIDWSDQMDVTK 0 0 MSIVMIIMFLVAWSPYSIVCLWASFGDPKTIPAPMAIIAPLFAKSSTFYNPCIYVIANKK 2 1 FRRAIIGMVRCQTRQRITINSQVPMTTSQQPLTQ* 0 >oryLat Oryzias latipes (medaka) 0 DSAVNVSDAAAPYGGKSAFTQLEHNIVAGYLITA 1 2 GVISLSSNVLVLLMFVKFRELRTATNIIIINLAFTDVGVAGIGYPMSAASDIHGSWKFGYTGCQ 0 0 IYAALNIFFGMASIGLLTVVAIDRYLTICRPDL 1 2 GQKMTMQSYNLLILAAWLNAVFWSSMPIVGWAGYAPDPTGATCTINWRQNDA 2 1 SFVSYTMAVIAVNFVVPLTVMFYCYYNVSATVKRYKASNCLDSINIDWSDQMDVTK 0 0 MSIVMIVMFLVAWSPYSMVCLWASFGDPKTIPAPMAIIAPLFAKSSTFYNPCIYVIANKK 2 1 FRRAIIGMIRCQTRQRITISTQVPMTISQQPLTQ* 0 >calMil Callorhinchus milii (elephantfish) 0 1 2 0 0 1 2 2 1 LFVSYTMTVIAVNFVVPLSVMFFCYYNVSKTMSRFISSPSPENINLDWSDQLDVTK 0 0 MSVVMIVMFLLAWSPYSIVCLWASFGNPKLIPPAMAIIAPLFAKSSTFYNPCIYVIANKK 2 1 FRKAIMAMICCQNRQEITINHTLPMTISRVPLTE* 0