Opsin evolution: Neuropsin phyloSNPs: Difference between revisions
Tomemerald (talk | contribs) No edit summary |
Tomemerald (talk | contribs) No edit summary |
||
Line 135: | Line 135: | ||
59calMil .............RV..S.LI.V.T.........III.....I.........A..........QNH.S...N..........................................................................................................* | 59calMil .............RV..S.LI.V.T.........III.....I.........A..........QNH.S...N..........................................................................................................* | ||
10homSap PFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKS-LEGFRLHTVT-TVRKSSAVLEIHEEV* | 10homSap PFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKS-LEGFRLHTVT-TVRKSSAVLEIHEEV* | ||
phyloSNP ...........................................B............B......CA.......................................................B.................................B...B.BBA............A..*</pre> | phyloSNP ...........................................B............B......CA.......................................................B.................................B...B.BBA............A..* | ||
</pre> | |||
== Neuropsins compared to Newropsins == | |||
Newropsins are a new gene family closely related to neuropsins. Like so many opsin families, they persist from chondrichthyes to archeosaurs but vanish without a trace in platypus, marsupials, and placentals. (The syntenic order B4GALT6 NEWR KIAA1012 remains conserved in mammals but no NEWR debris remains.) Newropsins retain many key attributes of GPCR signaling proteins and indeed opsins such as the seven transmembrane arrangement, Schiff base lysine, counterion tyrosine, amino terminal glycosylation site, and disulfide but have a very odd replacement of the G-protein binding site DRY with (invariantly conserved) VCC. | |||
This motif must be an ancient derived feature that followed the gene duplication event with neuropsin since the much older DRY could not plausibly have re-evolved in neuropsin from VCC. Newropsins very likely link covalently with a retinal and interact with light according to some lambda max type action spectrum. The VCC motif has been conserved over billions of years of branch length so cannot reflect simple loss of DRY functionality; however its signaling capabilities if any are most unclear. | |||
[[Image:Opsins_NEWR.png]] | |||
<br clear="all" /> | |||
<pre> | |||
position ...................................................................................................1.........1.........1.........1.........1.........1.........1.........1........1 | |||
position .........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2.........3.........4.........5.........6.........7........7 | |||
position 12345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789 | |||
excMemCy eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMccccccccccccccccccccMMMMMMMMMMMMMMMMMMMMMeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMcccccccccccccccccccccMMMMMMMMMMMMMMMMMMMMMeeeeeeee | |||
keyResid ...GLC.................................................................................................diS..cIon.................DRY?.............................................. | |||
NEWR2_galG MDPSFANS-TFQSKITEAADIVVGTCYMVFGICSLCGNSILLYISYKKKHLLKPAEYFIINLAISDLAMTLTLYPLAVTSSLSHRWLYGKHICLFYAFCGLFFGICSLSTLTLLSVVCCLKICFPAYGNRFRRKHGQILIACAWTYAAIFACSPLAHWGEYGEEPY | |||
NEWR2_anoC MESYFANT-TFHSKITEAADVIVGVFYIVFGICSFCGNSILLYVSYKKKNLLKPAEYFMINLAISDLGMTLTLYPLAVTSSLAHRWLFGQQVCLFYAFCGVFFGVCSLTTLTLLSIVCCLKICFPVYGNRFRPGHGWILIACAWVYAAIFAFSPLAHWGEYGAEPY | |||
NEWR2_xenT MGNKSDAS-AFYSSISETDDIVLGVLYSVFGLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHRWLFDKVTCNYYAFCGMLFGLCSLTNLTVLSSVCCLKVCYPAYGNKFSTAHSRILLLGIWAYAGLFATAPLADWGKYGPEPY | |||
NEWR2_danR MGNVSKTA-LFMSTISRQHDILMGSLYSVFFVLSLLGNGMLLFVAYRKRSSLKPAEFFVVNLSVSDLGMTLSLFPLAIPSALAHRWLFGEITCLCYAVCGVLFGLCSLTNLTALSSVCCLKVCFPNYGNKFSSSHACVMVIGVWCYASVFAVGPLVHWGSFGPEPY | |||
NEWR2_pimP MGNVSETA-LFVSTISRQHDILMGSLYSVFCVLSLLGNGMLLFVAYRKRSSLKPAEFFVINLSVSDLGMTLSLFPLAIPSALAHRWLFGEVVCLCYAVCGVLFGLCSLTNLTALSSVCCLKVCCPNYGNKFSSNHACVMVIGVWCYASVFAVGPLIRWGSFAPEPY | |||
NEWR2_tetN MGNASDTSDAFNSKISKEHDFLIGSIYSVFCVLSLMGNCILLLVAHHKRSTLKPAEFFIVNLSISDLGMTLTLFPLAIPSSFSHRWLFGEIACQLYATCGVLFGLCSLTNLTVLSSVCCLKVCLPNLGSKFSSSHARLLVAGVWGYASVFAVGPLVQWGHYGPEPY | |||
NEWR2_takR MGNASEASDIFLSKISKEHDILIGSIYSVFGLLSLAGNCILLLVAYHKRSMLKPAEFFIINLSISDLGMTLTLFPLAIPSSFSHRWLFGEITCQLYAMCGVLFGLCSLTNLTALSLVCCLKVCFPNHGSRFSSSHARLLVVGVWCYASVFAVGPLVQWGHYGPEPY | |||
NEWR2_gasA MGNASDTSAVFASTISKERDILMGSLYSVFGVLSLVGNCILLLVAYHKRSTLKPAEFFIINLSISDLGMTLSLFPLAIPSAFKHRWLFGELTCQLYAMCGVLFGLCSLTNLTALSFVCCLKVCFPNHGNRFSSSHARLLVVAVWGYASVFAVGPLARWGRYSPEPY | |||
NEWR2_oryL MGNVSDTSSLFASSISREHDILMGSLYSVFGLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHRWLFGEITCQLYAMCGVLFGLSSLTNLTALSLVCCLKVCFPNHGNKFSFSHARLLVAGVWCYASVFAVGPLARWGRYSAEPY | |||
NEWR2_calM GILSLVGNSVLLFVAYRKRQILKPAEYFVANLAVSDISMTVTLLPLAISSNFSHRWLFVSKPCMYYGFCSMLFGICSLTNLTVLSTVCCMKVCFPAYMSVVMIV-MFLLAWSPYSIVCLWASFGNPKLIPPAMAII | |||
NEUR_homSa MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGISVVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPEPF | |||
NEUR_canFa MALNHTARPQDERLPHYLREGDPFASKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAICDLGISVVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPF | |||
NEUR_musMu MALNHTALPQDERLPHYLRDEDPFASKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGISVVGKPFTIISCFCHRWVFGWFGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPF | |||
NEUR_loxAf MTLNHTAPPQDDRLPQYLQDGDPFTSKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSCRRKKKLRPAEIMTINLAVCDLGISVVGKPFVIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPF | |||
NEUR_monDo MALNHSVSPQDDYIPHYLRDGDPFASKLSWEADLVAGFYLTIIGVLSTLGNGYVIYMSSKRKKKLRPAEIMTVNLAVCDLGISVVGKPFTIISCFSHRWVFGWVGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSYGTWLKRHHAFICLALIWAYATFWATVPFAGVGSYAPEPF | |||
NEUR_ornAn MTNYSAPQLGDYLPHYLREGDPFVSKLSWEADLVAGVYLVIIGVLSTLGNGYVIYMSSRRKKKLRPAEIMTVNLAVCDLGISVVGKPFTIVSCFCHRWVFGWMGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSYGTWLKRHHAYICLAIIWAYASFWATMPLVGLGNYAPEPF | |||
NEUR_calMi MTAFDNSTALYSGYWLHDSLHGDPFVSKLSWEADIISACYLIVTGLLSTLGNGYVIYLSITQKRKLKPPEILITNLAISDFGMSVGGQPFLIISCFSHRWIFGWVGCRWHGWAGFFFGCGSLITMTVVSLDRYLKICHLQYGSWLQRRHVFMSLAFIWFYAAFWATMPLVGWGNYAPEPF | |||
NEUR_galGa MASDCNSSSQEEYLPHYMQQEDPFASKLSREADIIAGFYLTVIGILSTLGNGYVIFMSSKRKKKLRPAEIMTVNLAVCDLGISVVGKPFSIISFFSHRWIFGWMGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLAYGTWLKRHHAFICLALIWAYATFWATVPFAGVGSYAPEPF | |||
NEUR_xenTr MAGNSSYREESGYIPHYERDSDPFASKLSREADIFAGVYLMAIGILSTLGNGYVIYMACSRKKKLRPAEIMTINLAVCDLGISVTGKPFAIVSCFSHRWVFGWNACRWYGWAGFFFGCGSLITLTVVSLDRYLKICHLRYGTWLKRRHAFIALAVIWAYATLWATLPLVGVGNYAPEPF | |||
NEUR_danRe MENETSISSGYIPHYLLRGDPFASKLSKEADIVAAFYILVIGILSATGNGYVMYMTFKRKTKLKPPEIMTLNLAIFDFGISVSGKPFFIVSSFSHRWLFGWQGCRYYGWAGFFFGCGSLITMTIVSFDRYLKICHLRYGTWLKRHHAFLSVVFIWAYAAFWATMPVVGWGNYAPEPF | |||
NEUR_takRu MENDTSIPSGYVPHYLLRGDPFASKLSKEADIVAAFYILVIGVLSATGNGYVIYQTIKRKTKLKPPEFMTLNLAVFDFGISVTGKPFFIVSSFSHRWLFGWQGCRYYGWAGFFFGCGSLITMTIVSLDRYLKICHLRYGTWFKRHHAFLCLVFTWLYAAFWATMPVVGWGNYAPEPF | |||
NEUR_tetNi MENETWTHSSYVPHYLLRGDPFASRLSKEADIVAALYICIIGLMSATGNGYVLYMTFKRKTKLKPPELMTLNLAIFDFGISVTGKPFFIVSSLSHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRYGAWLKRHHAFLCLASVWAYAAFWATMPLVGWGSYAPEPF | |||
NEUR_gasAc MDNETRSHPSYVPHYLLRGDPFASRLSKEADIVAAFYIFIIGVMSATGNGYVLYMTFKRKTKLKPPELMTVNLAIFDFGISVTGKPFFIVSSLSHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRYGTWLKRHHAFVCLALVWAYAAFWATMPLVGWGSYAPEPF | |||
NEUR_oryLa MENETWTHPSYIPHYLLRGDPFASRLSKEADIIAAFYICIIGIMSATGNGYVIYMTIKRKSKLKPPELMTVNLAVFDFGISVTGKPFFVVSSFAHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRYGTWLKRQHAFLCLVFVWMYAAFWATMPLVGWGNYAPEPF | |||
NEUR_pimPr MENTSWPHSSYVPHYLLRGDPFASRLSKEADIVAAFYILIIGIMSATGNGYVIYMTIKRKSKLKPPELMTVNLAVFDFGISVTGKPFFVVSSFSHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRYGTWLKRQHIFLCLVFVWIYAAFWATMPLVGWGSYAPEPF | |||
NEUR_anoCa MEQGQNISSQDDNQQEEDPFASKLSVEADIVAGVYLLVIGILSTLGNGYVIYMSTQRKKKLKPAEIMTVNLAVCDLGISVVGKPFSIIAFFSHRWIFGWSGCRWYGWAGFFFGIGSLITMTAVSLDRYFKICHLSYGTWLKRHHVFICLGIIWSYAAFWATIPFAGFGNYAPEPF | |||
position 1.........1.........2.........2.........2.........2.........2.........2.........2.........2.........2.........2.........3........3..........3.........3.........3.........3.....3 | |||
position 8.........9.........0.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0........1..........2.........3.........4.........5.....6 | |||
position 012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456 | |||
excMemCy eeeeeeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMccccccccccccccccccccccccccccccccccMMMMMMMMMMMMMMMMMMMMMeeeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMcccccccccccccccccccccccccccccccccccccccccccccc* | |||
keyResid .diS................................................................................................................K............................................................ | |||
NEWR2_galG GTACCIDWQSTNVDVMSMSYTVVLFVLCFILPCGVIVTSYSLILVTVKESRKAVEQHVSGPTRINNVQTITAKLSIAVCIGFFAAWSPYAIIAMWAAFGSIDKIPPLAFAIPAVFAKSSTLYNPIIHLLLKPNFRSNIAKDFTVIQQLCVR---CCFCVKELQ--TYRSTFNTGLRTFKG | |||
NEWR2_anoC GTACCIDWRISNMKKTAMSYTTALFVFCYIIPCGIIITSYTLILITVKDSRKAVEQHALGPTRMSSVHTITAKLSIAVCIGFFVAWSPYAIIAMWAAFGSIDMIPPLAFAVPAVFAKSSTLYNPAMYLFLKPNFRSTIAKDLTVLHRLCLK---SCFCPRGMQNCSYRSALEAPLKSFKG | |||
NEWR2_xenT GTACCLDWEASYRERKALSYTISLFVFCYLIPSSLIFISYTLIFVTVKGARRAVQQHLSPQAKGSSIHSLIIKLSIAVCIGFLIAWTPYAIVAMMAAFGDPTKIPSLVFALAAAFAKSSTIYNPVVYLLLKPNFLNVVTKDLTLFQTMCAV---VCGWCR-----TPAVKTPCPHKDLKT | |||
NEWR2_danR GTACCINWYTPSHDALAMSYIISLFIFCYVVPCTIIILSYTFILVTVRGSQQAVQQHVSPQTKVTNAHALIVKLSVAVCIGFLTAWSPYAIVAMWAAFSANEQVPPTAFALAAIMAKSSTIYNPMVYLLFKPNFRKSLSQDTQMFRHRICLSHSKASPSPGMKDQERQSSQQCNNKDGSI | |||
NEWR2_pimP GTACCINWYIPSHDALAMSYIISLFIFCYVVPCTIIILSYTFILLRVRGSRQAVQKHVSPKTKETNAHTLIVKLSVAVCIGFVTAWSPYAVVAMWAAFSANEPVPPTAFALAAILAKSSTIYNPMVYLLFKPNFRKILSQDTQNIRHRMCVSHSKASPTPEIK---AQSSQQC--KDATI | |||
NEWR2_tetN GTACCINWQAPNHELSSLSYIVCLFLFCYVLPCAIIILSYTCILMTVRGSRQAIQQHVSPQTKTANAHALIVKLSVAVCIGFLGAWSPYAVVAMWASFGDATWVPPDAFAIAAILAKSSTIYNPLVYLLCKPNFRECLYKDTSTLRQRIY----RGSPLSGPRDRSGGVTQR--HKDLSV | |||
NEWR2_takR GTACCIDWRAPNHELSSLSYIVCLFFFCYVLPCATIILSYTCILMTVRGSRQAIQQHVSPQTKTANAHSLIVKLSVAVCIGFLGAWSPYAIVAMWAAFGDATWVPPDAFAIAAILAKSSTIYNPVVYLLCKPNFRECLYKDTSTLRQRIY----RGSPQSEPRERFGGTSQR--HKDLSI | |||
NEWR2_gasA GTACCIDWHAPNHELAALSYIVCLFVFCYALPCATIFLSYTFILLTVRGSRQAVQQHVSPQTKTTNTHALIVKLSVAVCIGFLGAWTPYAVVAIWAAFGDATLVPPDAFALAAMFAKSSTIYNPVVYLLCKPNFRACLYRDTTLLRQRIY----RGSPRSEPKAHFGSTSQR--NKDMSV | |||
NEWR2_calM APLFAKSSTFYNPCIYVISYTMTVIAVNFVVPLSVMFFCYYNV | |||
NEWR2_oryL GTACCIDWHAPNHELWALSYILCLFIFCYALPCTIIFLSYAFILLTVRGSRQAVQQHVSPQTKTTNAHTLIVKLSVAVCIGFLGAWTPYAVIAMWAAFGDATQVPPTAFALAAVFAKSSTIYNPMVYLLCKPNFRECLCRDTSLLRHMIY----RGSP--QPQERFGSDSRR--NKDITA | |||
NEUR_homSa GTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKA-TKKKSLEGFRLHTVT-TVRKSSAVLEIHEEV | |||
NEUR_calJa GTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKA-TKKKSLEDFRLHTVT-TVRKSSAVLEIHEEV | |||
NEUR_canFa GTSCTLDWWLAQASLGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGRLKA-TKKKSLEDFRLNTVT-TVRKSSAVLEIHQEV | |||
NEUR_musMu GTSCTLDWWLAQASGGGQVFILSILFFCLLLPTAVIVFSYAKIIAKVKSSSKEVAHFDSRIHSSHVLEVKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYRFACCQAGGLRG-TKKKSLEDFRLHTVT-TVRKSSAVLEIHQEV | |||
NEUR_loxAf GTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEIAHFDSRIHSSHMLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLRA-TKKKSLEGFRLHTVT-TVKKSSAVLEVHQEV | |||
NEUR_monDo GTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKSSTKEVAHYDTRIQNSHILEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGQPDSIPVQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCQSGGQKA-AKKESLRTYRLHTVT-TVRRSSAVLEIHQEV | |||
NEUR_ornAn GTSCTLDWWLAQASVAGQAFILNILFFCLLLPTAVIVFSYVKIIAKVKSSTKEVAHFDSRIQNSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGQPDSIPIQFSVVPTLLAKSAAMYNPIIYQVIDCRISCCRLGGPKT-GKKESLKNSRSHSMS-TIRKPSAVSGPHQEV | |||
NEUR_calMi GTSCTLDWWLARVSVSGLIFVLTILFFCLLLPIIIIVFSYIKIIAKVKSSAKEVAHFDSRIQNHHSLEMNLTK | |||
NEUR_galGa GTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKSSTKEVAHYDTRIQNSHILEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGQPDSVPIQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCRSGGPKTLQKKSSLKESRMYTIS-SHRDSAALSGTQLEV | |||
NEUR_xenTr GTTCTLDWWLAQASVKGQIFVLSMLFFCLLFPTMVIVFSYAKIIAKVKSSAKEVAHFDTRNQNNHTLEIKLTKVAMLICAGFLIAWFPYAVVSVWSAFGQPDSIPIELSVVPTMMAKSASMYNPIIYQVIDCKPACCKK------DKSLQNTTSRVYTIS-TFRKSTTSAR | |||
NEUR_danRe GTSCTLDWWLTQASVSGQSFVMCMLFFCLIFPTVIIVFSYVMIIFKVKSSAKEVSHFDTRNKNNHSLEMKLTKVAMLICAGFLIAWIPYAVVSVMSAFGEPDSVPIPVSVVPTLLAKSSAMYNPIIYQVIDCKKKCVKSCCFQAWRKKKPSKTSRFYTISGSIKQR-PGDEASIEI | |||
NEUR_takRu GTSCTLDWWLAQASVSGQSFVMCMLIFCLVLPTGVIVFSYVMIILQVKSSAQEVSHFDTQNKNKHHLEMKLTKVAMLICAGFLIAWIPYAVVSVVSAFGDPDSVPISISVVPTLLAKSSAMYNPIIYQVIDCKKNCAKLSCFQAWSKRKHYKTSRFYSISASMKKR-PANEVPTEI | |||
NEUR_tetNi GTSCTLDWWLAQASVSGQSFVMAILFFCLILPTGIIVFSYVMIIFKVKSSAKEISHFDARIRNSHDLEIKLTKVAMLICAGFLIAWIPYAVVSVISAFGEPDSVPIPVSVIPTLLAKSSAMYNPIIYQVVDVKTSCTNFSCCKALKERIHFRKSRLYTISGSLRDPLPPKEAHIEM | |||
NEUR_gasAc GTACTLDWWLAQASVSGQSFVMAILFFCLVLPTGIIVFSYIMIIFKVKSSAKEISHFDARIKNSHSLEIKLTKVAMLICAGFLIAWIPYAVVSVVSAFGEPDSVPIPVSVIPTLLAKSSAMYNPIIYQVADLKTSCTSSSCCKALKERVLFRKARLYTISGSLRDTLPPKEAHIEM | |||
NEUR_oryLa GTSCTLDWWLAQASVSGQSFVVAILFFCLVLPAGIIVFSYVMIIFKVKSSAKEISNFDARIKNSHNLEIKLTKVAMLICAGFLIAWIPYAVVSVVSAFGEPDSVPISVSVIPTLLAKSSAMYNPIIYQVLDLKNSCMKSSCFKGLKKPRHFRKSRFYTISGSVKDNTTAKEAQIEM | |||
NEUR_pimPr GTSCTLDWWLAQASVSGQSFVMSILFFCLVLPAGIIVFSYVMIICKVKSSSKEVSSFDARIKNSHTLEIKLTKVAMLICAGFLIAWIPYAVVSVVSAFGEPDSIPIPVSVIPTLLAKSSAMYNPIIYQLVDLKNSC-STCCAKVIRKRTHFRNSRFYTISGSLKDTAPAKEAHIEI | |||
NEUR_anoCa GTSCTLDWWLAQGSVAGQAFILNILFFCLVLPTAVIMFCYVKIIAKVQSSTKEVAHYDTRIQNQHVLEMKLTKVAMLICAGFMFAWIPYAVVSVWSAFGRPDSVPIKVSVIPTLLAKSAAMYNPVIYQVIDCKSACCRPGNLQPLQKKNSR | |||
</pre> | |||
== Curated Set of 51 deuterostome neuropsins == | == Curated Set of 51 deuterostome neuropsins == | ||
Line 558: | Line 635: | ||
</pre> | </pre> | ||
== Curated Set of 51 vertebrate newropsins == | |||
<pre> | <pre> | ||
> | >NEWR_galGal GenBank 3 garbage 5'UTR translated exons -B4GALT6 -NEWR2_galGal -KIAA1012 | ||
0 MDPSFANSTFQSKITEAADIVVGTCYMVF 1 | 0 MDPSFANSTFQSKITEAADIVVGTCYMVF 1 | ||
2 GICSLCGNSILLYISYKKKHLLKPAEYFIINLAISDLAMTLTLYPLAVTSSLSHR 2 | 2 GICSLCGNSILLYISYKKKHLLKPAEYFIINLAISDLAMTLTLYPLAVTSSLSHR 2 | ||
Line 568: | Line 645: | ||
CSYFPSEKGSHTFECFKSYPNCFQERLSTMGCHLQDCESLENDLQVEVTQGSRNSMKVVEQEEKSTELDNLEITLEAVPVSCTFTDL* 0 | CSYFPSEKGSHTFECFKSYPNCFQERLSTMGCHLQDCESLENDLQVEVTQGSRNSMKVVEQEEKSTELDNLEITLEAVPVSCTFTDL* 0 | ||
> | >NEWR_anoCar Anolis carolinensis | ||
0 MESYFANTTFHSKITEAADVIVGVFYIVF 1 | 0 MESYFANTTFHSKITEAADVIVGVFYIVF 1 | ||
2 GICSFCGNSILLYVSYKKKNLLKPAEYFMINLAISDLGMTLTLYPLAVTSSLAHR 2 | 2 GICSFCGNSILLYVSYKKKNLLKPAEYFMINLAISDLGMTLTLYPLAVTSSLAHR 2 | ||
Line 576: | Line 653: | ||
YFPCEKCHDPFECFKNYPKCCQGRLNVMDHTPRESISVENNMQSKTKHASEKYIKVVIRGEKNTDIDNLEITLEHIPTDIKFANL* 0 | YFPCEKCHDPFECFKNYPKCCQGRLNVMDHTPRESISVENNMQSKTKHASEKYIKVVIRGEKNTDIDNLEITLEHIPTDIKFANL* 0 | ||
> | >NEWR_xenTro abundant transcripts | ||
0 MGNKSDASAFYSSISETDDIVLGVLYSVF 1 | 0 MGNKSDASAFYSSISETDDIVLGVLYSVF 1 | ||
2 GLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHR 2 | 2 GLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHR 2 | ||
Line 584: | Line 661: | ||
LKTTSKPPSSFKKSQGVCRNCVDTFECFRNYPRCCSVGNVDAAQPMAASLVRIPPANGAPQQTVQLVVSSSRTRSGVETVEVSTEAPMSDFIKDFI* 0 | LKTTSKPPSSFKKSQGVCRNCVDTFECFRNYPRCCSVGNVDAAQPMAASLVRIPPANGAPQQTVQLVVSSSRTRSGVETVEVSTEAPMSDFIKDFI* 0 | ||
> | >NEWR_danRer Danio rerio acquired new intron | ||
0 MGNVSKTALFMSTISRQHDILMGSLYSVF 1 | 0 MGNVSKTALFMSTISRQHDILMGSLYSVF 1 | ||
2 FVLSLLGNGMLLFVAYRKRSSLKPAEFFVVNLSVSDLGMTLSLFPLAIPSALAHR 2 | 2 FVLSLLGNGMLLFVAYRKRSSLKPAEFFVVNLSVSDLGMTLSLFPLAIPSALAHR 2 | ||
Line 593: | Line 670: | ||
SQQCNNKDGSISTPFSSGQAESYGACHVYAEAGPHYQQISRQITARVLEGSVQSEIPVKQLTEKMQNDLL* 0 | SQQCNNKDGSISTPFSSGQAESYGACHVYAEAGPHYQQISRQITARVLEGSVQSEIPVKQLTEKMQNDLL* 0 | ||
> | >NEWR_tetNig Tetraodon nigroviridis gene mix | ||
0 MGNASDTSDAFNSKISKEHDFLIGSIYSVF 1 | 0 MGNASDTSDAFNSKISKEHDFLIGSIYSVF 1 | ||
2 CVLSLMGNCILLLVAHHKRSTLKPAEFFIVNLSISDLGMTLTLFPLAIPSSFSHR 2 | 2 CVLSLMGNCILLLVAHHKRSTLKPAEFFIVNLSISDLGMTLTLFPLAIPSSFSHR 2 | ||
1 WLFGEIACQLYATCGVLFGLCSLTNLTVLSSVCCLKVCLPNL 1 | 1 WLFGEIACQLYATCGVLFGLCSLTNLTVLSSVCCLKVCLPNL 1 | ||
2 | 2 GSKFSSSHARLLVAGVWGYASVFAVGPLVQWGHYGPEPYGTACCINWQAPNHELSSLSYIVCLFLFCYVLPCAIIILSYTCILMTVRGSRQAIQQHVSPQTKTANAHALIVK 0 | ||
0 | 0 LSVAVCIGFLGAWSPYAVVAMWASFGDATWVPPDAFAIAAILAKSSTIYNPLVYLLCKPNFRECLYKDTSTLRQRIYRGSPLSGPRDRSGGVTQRHKDLSVSTR | ||
LSNGQQDSYGTCLHCAEDAELGHVTGSRRTACILTGSTFTEVTLSQLSATPADLL* 0 | |||
> | >NEWR_takRub Fugu rubripes | ||
0 MGNASEASDIFLSKISKEHDILIGSIYSVF 1 | 0 MGNASEASDIFLSKISKEHDILIGSIYSVF 1 | ||
2 GLLSLAGNCILLLVAYHKRSMLKPAEFFIINLSISDLGMTLTLFPLAIPSSFSHR 2 | 2 GLLSLAGNCILLLVAYHKRSMLKPAEFFIINLSISDLGMTLTLFPLAIPSSFSHR 2 | ||
Line 609: | Line 686: | ||
LSNGQQDSYGTCLHCADDAERGHVTTSQRTACILTGSTFTEVTVGQLSAAPADLL* | LSNGQQDSYGTCLHCADDAERGHVTTSQRTACILTGSTFTEVTVGQLSAAPADLL* | ||
> | >NEWR_gasAcu Gasterosteus aculeatus | ||
0 MGNASDTSAVFASTISKERDILMGSLYSVF 1 | 0 MGNASDTSAVFASTISKERDILMGSLYSVF 1 | ||
2 GVLSLVGNCILLLVAYHKRSTLKPAEFFIINLSISDLGMTLSLFPLAIPSAFKHR 2 | 2 GVLSLVGNCILLLVAYHKRSTLKPAEFFIINLSISDLGMTLSLFPLAIPSAFKHR 2 | ||
Line 617: | Line 694: | ||
APCHVMTPQRTACILTESTNREVTVSRLADKPQADFL* | APCHVMTPQRTACILTESTNREVTVSRLADKPQADFL* | ||
> | >NEWR_oryLat Oryzias latipes | ||
0 MGNVSDTSSLFASSISREHDILMGSLYSVF 1 | 0 MGNVSDTSSLFASSISREHDILMGSLYSVF 1 | ||
2 GLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHR 2 | 2 GLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHR 2 | ||
Line 625: | Line 702: | ||
TGLCQLASPQNTACILTGSTYAEVTVQQLVDKQQPDFL* 0 | TGLCQLASPQNTACILTGSTYAEVTVQQLVDKQQPDFL* 0 | ||
> | >NEWR_pimPro Pimephales promelas | ||
0 MGNVSETALFVSTISRQHDILMGSLYSVF 1 | 0 MGNVSETALFVSTISRQHDILMGSLYSVF 1 | ||
2 CVLSLLGNGMLLFVAYRKRSSLKPAEFFVINLSVSDLGMTLSLFPLAIPSALAHR 2 | 2 CVLSLLGNGMLLFVAYRKRSSLKPAEFFVINLSVSDLGMTLSLFPLAIPSALAHR 2 | ||
Line 632: | Line 709: | ||
0 LSVAVCIGFVTAWSPYAVVAMWAAFSANEPVPPTAFALAAILAKSSTIYNPMVYLLFKPNFRKILSQDTQNIRHRMCVSHSKASPTPEIK-AQSSQQCKDATISTPFSSGQAESYGTCHIYAEAEPHFQQISPQRTVRILEGIIQSEISVRHMTDRMQNDLL* 0 | 0 LSVAVCIGFVTAWSPYAVVAMWAAFSANEPVPPTAFALAAILAKSSTIYNPMVYLLFKPNFRKILSQDTQNIRHRMCVSHSKASPTPEIK-AQSSQQCKDATISTPFSSGQAESYGTCHIYAEAEPHFQQISPQRTVRILEGIIQSEISVRHMTDRMQNDLL* 0 | ||
> | >NEWR_calMil frag | ||
0 | 0 1 | ||
2 GILSLVGNSVLLFVAYRKRQILKPAEYFVANLAVSDISMTVTLLPLAISSNFSHR 2 | 2 GILSLVGNSVLLFVAYRKRQILKPAEYFVANLAVSDISMTVTLLPLAISSNFSHR 2 | ||
1 WLFVSKpCMYYGFCSMLFGICSLTNLTVLSTVCCMKVCFPAY 1 | 1 WLFVSKpCMYYGFCSMLFGICSLTNLTVLSTVCCMKVCFPAY 1 |
Revision as of 14:18, 17 March 2008
Neuropsin backgrounder
Neuropsin (OPN5) is a deeply diverged member of the opsin family with a single publication on it and considerable confusion over the name (mostly used for an unrelated kalikrein serine protease KLK8, not an opsin). There are no known disease associations or described knockout phenotypes; it is expressed primarily in brain, spinal chord, and testes.
Neuropsin has all the classical attributes of a rhodopsin-class GPCR and indeed opsin photoreceptor: Schiff base lysine at expected position, standard tyrosine counterion and DRY motif, seven transmembrane configuration, disulfide at expected position, proximal glycosylation and distal palmitoylation and kinase sites. It is most closely related to peropsin and rgropsin in terms of blast clustering and intron positioning. Its G-protein signalling partner is not known.
The evolution is illuminated by the massive comparative genomics study described here, which extractes and compares over 50 full length sequences from various genomics projects. Neuropsin can be located outside chordates but not outside deuterostomes. However, like peropsin and rgropsin, it must have originated much earlier in pre-Bilaterans. Thus its absence in earlier diverging species must be due either to gene loss or unrecognizability.
Within placental mammals, neuropsin is extraordinarily conserved, with percent identity relative to human protein 96% averaged over 31 species (likely above the 95% percentile proteomewide). That conservation drops considerably at marsupials and monotremes (86%), is less striking at tetrapods (78%), and unremarkable at teleost fish (68%). This pattern suggests neuropsin acquired significant new adaptive functionality on the placental mammal stem, leading to marked resilience to fixation of any further variation.
The structure of the neuropsin gene is rather odd. In human, a weak splice signal appears to have developed that results in a seventh very short exon. However a stop codon is encountered if the splice is not made. This results in two slightly different alternative carboxy termini sequences QVE vs XXX. Very few transcripts exist in this region for any species but it appears that the recent ancestral form of the protein only utilizes the stop codon in exon 6.
Novel neuropsins in amphioxus and sea urchin
The genome of Branchiostoma (amphioxus, lancelet) contains two distinct neuropsins about 75% identical to each other and 42% to human. These cluster unambiguously with vertebrate opsins and share critical conserved residues. An extra intron distinguishes them from the vertebrate neuropsin pattern. Recall Branchiostoma species has three rather diverged (and well-studied) peropsins but no evident Rgr opsin. These raises the question whether neuropsin and peropsin developed substantial visual roles in this species as an alternative to the ciliary imaging opsin pathway seen by lamprey divergence. Sea urchins, but not acornworm Saccoglossus, contain a single neuropsin that is quite diverged.
These neuropsins are newly reported here, meaning they were not localized in recent in situ hybridization studies. That's especially unfortunate in view of the antecedent role the Branchiostoma ancestral node plays in the evolution of chordate eye and the complexities of photoreceptor tissues in the extant species.
PhyloSNPs in vertebrate neuropsins
Alignment analysis coming shortly. Neuropsin has rather few of them.
position ...................................................................................................1.........1.........1.........1.........1.........1.........1.........1......1.. position .........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2.........3.........4.........5.........6.........7......7.. position 123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567.. excMemCy eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMccccccccccccccccccccMMMMMMMMMMMMMMMMMMMMMeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMcccccccccccccccccccccMMMMMMMMMMMMMMMMMMMMMeeeeee.. keyResid ...GLC.................................................................................................diS..cIon.................DRY............................................... exonNumb 111111111111111111111111111111111111111111122222222222222222222222222222222222222223333333333333333333333333333333333333333333333333333333334444444444444444444444444444444444444.. 10homSap MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGISVVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPE.. 11panTro ................................................................................................................................................................................... 12gorGor ............................................................................................................................................------------------..................... 13ponPyg ................................................................................................................................................................................... 14nomLeu ................................................................................................................................................................................... 15rheMac .................................I............................................................................................................................................A.... 16papHam .................................I............................................................................................................................................A.... 17calJac ......S.......................................................................................................................................................................A.... 18tarSyr ....................................................................................I.........................................................................................A.... 19otoGar ............LR..........................................................V...................................................................----------------------..M......E....... 20micMur ......V.................................I..........................................---------------------------------------------------------..............L....V..............A.... 21tupBel ............S.............................................................................................................................................V...................A.... 22musMus ....................E..................................................................................F..................................................V...................A.... 23ratNor ....................E..................................................................................F..................................................V...................A.... 24speTri ............H.......E................................................................................................................................F....V...................A.... 25dipOrd ..F....GT.GQG.....PEE...T........I........................................................................................................................V........................ 26cavPor .......P..N.H..R..Q.E...V.......................V...........................I..................R.............................V.................................V..............A.... 27oryCun ............H......E..............................................................................................................................R.......L...................A.... 28ochPri ....D.......H....F.........................................................................................L....D.....................------......R.......V...................A.... 29canFam .......R...........E........................................................I.............................................................................V...................A.... 30felCat .......P...........E.................................................--------------.......................................................................V...................A.... 31bosTau .......P.P...R.P........................................................V...I................................................................I............V.....A.............A.... 32turTru ......................K..........I.............................................................................V...................A.... 33susScr .......P.P...R.....E....................................................V...I.............................................................................V.....A.............A.... 34vicVic .......P.P...R.RH...............................L..................................................M........................................-------------------------------------.. 35equCab ............................................................................I.............................................................................V........................ 36myoLuc ............G.....Q.............................V.......................V...........................................................................T.....F........................ 37pteVam ......V.....H....V..............................V...........................I.............................................................................V........................ 38sorAra ............N..........................................................................................M.................................................VV...................A.... 39eriEur .S..Q.......G.........................................................................................................................................L...V...................A.... 40loxAfr .T.....P...D...Q..Q.....T.................................C..............................V................................................................V...................A.... 41proCap .T....V..E.D..S.........T................V......C.........Y..............................I...S..........................................H...-------------------------------------.. 42echTel .......P...NS...........V.........G.....I.................Y....................................S.......T..................................................V........................ 43dasNov ...........D...............................................K..............................................................................................V........................ 44choHof ......G....DS....F......................I....................R............................................................................................V.............L.......... 45monDom .....SVS...DYI..............................V...L.....I....K............V......................S.......V................................H....T....H..F....L.....T..A.V.FA.V.S.A.... 46macEug .V...L.....I....K............V..............................V................................H....T....H......VI.....T..A....A...N.A.... 47ornAna MT.YS.PQLGDY......E....V............V..V...V...L.....I.................V..................V...........M................................H....T....H.......I........A........N.A.... 48galGal ..SDCNSSS.E.Y....MQQE........R...II......V......L.....IF...K............V................S...F.S...I...M................................H.A..T....H..F....L.....T..A.V.FA.V.S.A.... 49taeGut ..SEYNNSS.E.YI....QEE........R...II.............L.....IF...K............V................S...F.S...M......C.............................H....T....H..F....I.....M..A.V.FA.V.S.A.... 50anoCar .EQGQNISS..DN----QQEE........V...I...V..LV......L.....I...TQ.....K......V................S..AF.S...I...S.............I..............F...H....T....H.VF...GI..S..A..A.I.FA.F.N.A.... 51xenTro ..G.SSYREESGYI...E..S........R...IF..V..MA......L.....I..ACS........................T....A.V...S.......NA..................L.V..........H.R..T....R..F.A..V.....TL.A.L....V.N.A.... 52danRer .E-NET-SISSGYI....LR.........K...I..A..ILV.....AT.....M..TFK..T..K.P....L...IF.F....S....F.V.S.S...L...Q...Y.................I..F.......H.R..T....H..FLSVVF.....A..A...V..W.N.A.... 53pimPro .E-NDT-SIPSGYV....LR.........K...I..A..ILV..V..AT.....I.QTIK..T..K.P.F..L....F.F....T....F.V.S.S...L...Q...Y.................I..........H.R..T.F..H..FL..VFT.L..A..A...V..W.N.A.... 54takRub .E-NET-WTHSSYV....LR......R..K...I..AL.IC...LM.AT........TFK..T..K.P.L..L...IF.F....T....F.V.SLS...L...E...F.................V..........H.R..A....H..FL...SV....A..A......W.S.A.... 54tetNig .D-NET-RSHPSYV....LR......R..K...I..A..IF...VM.AT........TFK..T..K.P.L..V...IF.F....T....F.V.SLS...L...E...F.................V..........H.R..T....H..FV...LV....A..A......W.S.A.... 56gasAcu .E-NET-WTHPSYI....LR......R..K...II.A..IC....M.AT.....I..TIK..S..K.P.L..V....F.F....T....FVV.S.A...L...E...F.................V..........H.R..T....Q..FL..VFV.M..A..A......W.N.A.... 57oryLat .E-N.S-W.HSSYV....LR......R..K...I..A..IL....M.AT.....I..TIK..S..K.P.L..V....F.F....T....FVV.S.S...L...E...F.................V..........H.R..T....Q.IFL..VFV.I..A..A......W.S.A.... 59calMil .TFDNSTALYSGYWL.DSLH....V........IISAC..IVT.L...L.....I.L.ITQ.R..K.P..LIT...IS.F.M..G.Q..L.....S...I...V....H................V..........H.Q..S..Q.R.VFMS..F..F..A..A......W.N.A.... 10homSap MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGISVVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPE.. phyloSNP ..................AA......B......A..A..B.....B.BB.........B......C.A.........B.A....A..........B......................................A...B..A.......A.............A............... .. .. position 1.1.........1.........2.........2.........2.........2.........2.........2.........2.........2.........2.........2.........3........3..........3.........3.........3.........3.....3 position 7.8.........9.........0.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0........1..........2.........3.........4.........5.....6 position 89012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456 excMemCy eeeeeeeeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMccccccccccccccccccccccccccccccccccMMMMMMMMMMMMMMMMMMMMMeeeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMcccccccccccccccccccccccccccccccccccccccccccccc* keyResid ...diS................................................................................................................K........................................... exonNumb 44444444444444444444444444444444444444444444444444444444444444444444444444455555555555555555555555555555555555555555555555555555555555555555555555555555555556666666666666666666666 10homSap PFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKS-LEGFRLHTVT-TVRKSSAVLEIHEEV* 11panTro .................................................................................................................................................E.....-..........-...............* 12gorGor ...............................................................P.....................................K.................................................-....Q.....-...............* 13ponPyg .....................................................................................................S.................................................-..........-...............* 14nomLeu .......................................................................................................................................................-..........-S..............* 15rheMac .......................................................................................................................................................-..........-...............* 16papHam .......................................................................................................................................................-..........-...............* 17calJac .......................................................................................................................................................-..D.......-...............* 18tarSyr ......................................................G.........T......................................................................................-..D.......-...............* 19otoGar ..-...........-.L...............................................G................................................................................T.....-..D.......-............Q..* 20micMur .......................................................I.............................................................................----------------------H......-A...........Q..* 21tupBel ....................I..................................I..........................................................................................................-............Q..* 22musMus ................G.......S.................A...........................V...............................................................R.....A...RG.....-..D.......-............Q..* 23ratNor ................G.......S.................A...........................V................................N..............................R.........R......-..D.......-A...........Q..* 24speTri .......................-..............E..............E..............-------.......................................S....................................-..D.......-A.......V...Q..* 25dipOrd ................LA.................S...........................P.......................................................................................-.......................Q..* 26cavPor ................A...I...H...........M..................I.............................................................................SR.....NA.........-..D.......-.D...-......Q..* 27oryCun ........................................................................................................................................S..R.S.........-..D.......-............Q..* 28ochPri ................................................................G.......................................................................S..R.....Q.....-..D.......-............Q..* 29canFam ................L...I.........................................................................................................................R........-..D...N...-............Q..* 30felCat ....................I..................................................................................................................................-..D.......-............Q..* 31bosTau ....................I.................................................V................................................................................-..D.......-..........V.Q..* 32turTru ....................I..........................................P..........................................................V............................-..D.......-............Q..* 33susScr .....................................................................................V.................................................................-..D.......-...........RQ..* 34vicVic ---------------------------------------------------------------------------............................................................................-..D....A..-............Q..* 35equCab ....................I...........................................G.....V................................................................................-..D.......-............Q..* 36myoLuc ...............T....I.................................K.........-----------.............................................S..............L........R......-..N.......-............Q..* 37pteVam ....................I...........................................G..M.........................................................................S..R......-..D.....I.-...EA.......Q..* 38sorAra ....................I..................................................................................N..............................R.....S...R......-.DD.......-...E........Q..* 39eriEur ................L...I.................................K............M..............................................................................N....-.KDY...................q..* 40loxAfr ....................I..................................I...........M............................................................................R......-..........-..K.......V.Q..* 41proCap ---------------------------------------------------------------------------..............................V.................................R.R..R...E..-...V......-............Q..* 42echTel .......................................................I...........M............................................................................................I.-...........HQ..* 43dasNov ................................................................................................................................................R......-..D.......-...E......V.Q..* 44choHof ...................................................................M.......................................................................R....R......-F.........-............Q..* 45monDom .................A..A.V.S.......F.............L.....T.....Y.T..QN..I.................................Q.....V.F.......................C......S..Q..A..E.-.RTY......-...R........Q..* 46macEug .................T..T...............................T..........Q..............................................................................................RHTVSTIRKSSSVSETYQ..* 47ornAna .................A..A...............................T..........QN....................................Q.......F.......................CRIS..RL..P.TG..E.-.KNS.S.SMS-.I..P...SGP.Q..* 48galGal .................A..A.V.S.......F.............L.....T.....Y.T..QN..I.................................Q...V...F.......................C.....RS..P.TLQ...S.KES.MY.IS-SH.D.A.LSGTQL..* 49taeGut .................A....V.S.....................L.....T.....Y.T..QN..I.................L...................V...F......................ECRL...RP..* 50anoCar ..............G..A..A..........V......M.C........Q..T.....Y.T..QNQ..................MF...................V..KV..I.............V......C.S...RP.N.QPLQ..NSR* 51xenTro ....T............K..I.V.SM......F..M......A.........A.......T.NQNN.T..I.................F............Q......E......MM....S...........C.P...KKD--.SLQNTT----S.VY.IS-.F...TTSAR* 52danRer ............T....S..S.VMCM.....IF..VI......M..F.....A...S...T.NKNN.S............................M....E...V..PV..........S............C.KK.VKSCCFQ.WR..KPSKTS.FY.ISGSIKQRPGD-.ASI.I* 53pimPro .................S..S.VMCM.I...V...G.......M..LQ....AQ..S...TQNKNK.H............................V....D...V..SI..........S............C.KN.AKLSCFQ.WS.RKHYKTS.FYSISASMK.RP.N-.VPT.I* 54takRub .................S..S.VMA......I...GI......M..F.....A..IS...A..RN..D..I.........................I....E...V..PV..I.......S..........V.V.TS.TNFSCC..L.ERIHFRKS..Y.ISGSL.DPLPPK.A.I.M* 54tetNig ....A............S..S.VMA......V...GI.....IM..F.....A..IS...A..KN..S..I.........................V....E...V..PV..I.......S..........A.L.TS.TSSSCC..L.ERVLFRKA..Y.ISGSL.DTLPPK.A.I.M* 56gasAcu .................S..S.VVA......V..AGI......M..F.....A..ISN..A..KN..N..I.........................V....E...V..SV..I.......S..........L.L.NS.MKSSCF.GL..PRHFRKS.FY.ISGS.KDNTTAK.AQI.M* 57oryLat .................S..S.VMS......V..AGI......M..C.........SS..A..KN..T..I.........................V....E......PV..I.......S.........LV.L.NS.-S.CCA.VIR.RTHFRNS.FY.ISGSLKDTAPAK.A.I.I* 59calMil .............RV..S.LI.V.T.........III.....I.........A..........QNH.S...N..........................................................................................................* 10homSap PFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKS-LEGFRLHTVT-TVRKSSAVLEIHEEV* phyloSNP ...........................................B............B......CA.......................................................B.................................B...B.BBA............A..*
Neuropsins compared to Newropsins
Newropsins are a new gene family closely related to neuropsins. Like so many opsin families, they persist from chondrichthyes to archeosaurs but vanish without a trace in platypus, marsupials, and placentals. (The syntenic order B4GALT6 NEWR KIAA1012 remains conserved in mammals but no NEWR debris remains.) Newropsins retain many key attributes of GPCR signaling proteins and indeed opsins such as the seven transmembrane arrangement, Schiff base lysine, counterion tyrosine, amino terminal glycosylation site, and disulfide but have a very odd replacement of the G-protein binding site DRY with (invariantly conserved) VCC.
This motif must be an ancient derived feature that followed the gene duplication event with neuropsin since the much older DRY could not plausibly have re-evolved in neuropsin from VCC. Newropsins very likely link covalently with a retinal and interact with light according to some lambda max type action spectrum. The VCC motif has been conserved over billions of years of branch length so cannot reflect simple loss of DRY functionality; however its signaling capabilities if any are most unclear.
position ...................................................................................................1.........1.........1.........1.........1.........1.........1.........1........1 position .........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2.........3.........4.........5.........6.........7........7 position 12345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789 excMemCy eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMccccccccccccccccccccMMMMMMMMMMMMMMMMMMMMMeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMcccccccccccccccccccccMMMMMMMMMMMMMMMMMMMMMeeeeeeee keyResid ...GLC.................................................................................................diS..cIon.................DRY?.............................................. NEWR2_galG MDPSFANS-TFQSKITEAADIVVGTCYMVFGICSLCGNSILLYISYKKKHLLKPAEYFIINLAISDLAMTLTLYPLAVTSSLSHRWLYGKHICLFYAFCGLFFGICSLSTLTLLSVVCCLKICFPAYGNRFRRKHGQILIACAWTYAAIFACSPLAHWGEYGEEPY NEWR2_anoC MESYFANT-TFHSKITEAADVIVGVFYIVFGICSFCGNSILLYVSYKKKNLLKPAEYFMINLAISDLGMTLTLYPLAVTSSLAHRWLFGQQVCLFYAFCGVFFGVCSLTTLTLLSIVCCLKICFPVYGNRFRPGHGWILIACAWVYAAIFAFSPLAHWGEYGAEPY NEWR2_xenT MGNKSDAS-AFYSSISETDDIVLGVLYSVFGLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHRWLFDKVTCNYYAFCGMLFGLCSLTNLTVLSSVCCLKVCYPAYGNKFSTAHSRILLLGIWAYAGLFATAPLADWGKYGPEPY NEWR2_danR MGNVSKTA-LFMSTISRQHDILMGSLYSVFFVLSLLGNGMLLFVAYRKRSSLKPAEFFVVNLSVSDLGMTLSLFPLAIPSALAHRWLFGEITCLCYAVCGVLFGLCSLTNLTALSSVCCLKVCFPNYGNKFSSSHACVMVIGVWCYASVFAVGPLVHWGSFGPEPY NEWR2_pimP MGNVSETA-LFVSTISRQHDILMGSLYSVFCVLSLLGNGMLLFVAYRKRSSLKPAEFFVINLSVSDLGMTLSLFPLAIPSALAHRWLFGEVVCLCYAVCGVLFGLCSLTNLTALSSVCCLKVCCPNYGNKFSSNHACVMVIGVWCYASVFAVGPLIRWGSFAPEPY NEWR2_tetN MGNASDTSDAFNSKISKEHDFLIGSIYSVFCVLSLMGNCILLLVAHHKRSTLKPAEFFIVNLSISDLGMTLTLFPLAIPSSFSHRWLFGEIACQLYATCGVLFGLCSLTNLTVLSSVCCLKVCLPNLGSKFSSSHARLLVAGVWGYASVFAVGPLVQWGHYGPEPY NEWR2_takR MGNASEASDIFLSKISKEHDILIGSIYSVFGLLSLAGNCILLLVAYHKRSMLKPAEFFIINLSISDLGMTLTLFPLAIPSSFSHRWLFGEITCQLYAMCGVLFGLCSLTNLTALSLVCCLKVCFPNHGSRFSSSHARLLVVGVWCYASVFAVGPLVQWGHYGPEPY NEWR2_gasA MGNASDTSAVFASTISKERDILMGSLYSVFGVLSLVGNCILLLVAYHKRSTLKPAEFFIINLSISDLGMTLSLFPLAIPSAFKHRWLFGELTCQLYAMCGVLFGLCSLTNLTALSFVCCLKVCFPNHGNRFSSSHARLLVVAVWGYASVFAVGPLARWGRYSPEPY NEWR2_oryL MGNVSDTSSLFASSISREHDILMGSLYSVFGLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHRWLFGEITCQLYAMCGVLFGLSSLTNLTALSLVCCLKVCFPNHGNKFSFSHARLLVAGVWCYASVFAVGPLARWGRYSAEPY NEWR2_calM GILSLVGNSVLLFVAYRKRQILKPAEYFVANLAVSDISMTVTLLPLAISSNFSHRWLFVSKPCMYYGFCSMLFGICSLTNLTVLSTVCCMKVCFPAYMSVVMIV-MFLLAWSPYSIVCLWASFGNPKLIPPAMAII NEUR_homSa MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGISVVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPEPF NEUR_canFa MALNHTARPQDERLPHYLREGDPFASKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAICDLGISVVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPF NEUR_musMu MALNHTALPQDERLPHYLRDEDPFASKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGISVVGKPFTIISCFCHRWVFGWFGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPF NEUR_loxAf MTLNHTAPPQDDRLPQYLQDGDPFTSKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSCRRKKKLRPAEIMTINLAVCDLGISVVGKPFVIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPF NEUR_monDo MALNHSVSPQDDYIPHYLRDGDPFASKLSWEADLVAGFYLTIIGVLSTLGNGYVIYMSSKRKKKLRPAEIMTVNLAVCDLGISVVGKPFTIISCFSHRWVFGWVGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSYGTWLKRHHAFICLALIWAYATFWATVPFAGVGSYAPEPF NEUR_ornAn MTNYSAPQLGDYLPHYLREGDPFVSKLSWEADLVAGVYLVIIGVLSTLGNGYVIYMSSRRKKKLRPAEIMTVNLAVCDLGISVVGKPFTIVSCFCHRWVFGWMGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSYGTWLKRHHAYICLAIIWAYASFWATMPLVGLGNYAPEPF NEUR_calMi MTAFDNSTALYSGYWLHDSLHGDPFVSKLSWEADIISACYLIVTGLLSTLGNGYVIYLSITQKRKLKPPEILITNLAISDFGMSVGGQPFLIISCFSHRWIFGWVGCRWHGWAGFFFGCGSLITMTVVSLDRYLKICHLQYGSWLQRRHVFMSLAFIWFYAAFWATMPLVGWGNYAPEPF NEUR_galGa MASDCNSSSQEEYLPHYMQQEDPFASKLSREADIIAGFYLTVIGILSTLGNGYVIFMSSKRKKKLRPAEIMTVNLAVCDLGISVVGKPFSIISFFSHRWIFGWMGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLAYGTWLKRHHAFICLALIWAYATFWATVPFAGVGSYAPEPF NEUR_xenTr MAGNSSYREESGYIPHYERDSDPFASKLSREADIFAGVYLMAIGILSTLGNGYVIYMACSRKKKLRPAEIMTINLAVCDLGISVTGKPFAIVSCFSHRWVFGWNACRWYGWAGFFFGCGSLITLTVVSLDRYLKICHLRYGTWLKRRHAFIALAVIWAYATLWATLPLVGVGNYAPEPF NEUR_danRe MENETSISSGYIPHYLLRGDPFASKLSKEADIVAAFYILVIGILSATGNGYVMYMTFKRKTKLKPPEIMTLNLAIFDFGISVSGKPFFIVSSFSHRWLFGWQGCRYYGWAGFFFGCGSLITMTIVSFDRYLKICHLRYGTWLKRHHAFLSVVFIWAYAAFWATMPVVGWGNYAPEPF NEUR_takRu MENDTSIPSGYVPHYLLRGDPFASKLSKEADIVAAFYILVIGVLSATGNGYVIYQTIKRKTKLKPPEFMTLNLAVFDFGISVTGKPFFIVSSFSHRWLFGWQGCRYYGWAGFFFGCGSLITMTIVSLDRYLKICHLRYGTWFKRHHAFLCLVFTWLYAAFWATMPVVGWGNYAPEPF NEUR_tetNi MENETWTHSSYVPHYLLRGDPFASRLSKEADIVAALYICIIGLMSATGNGYVLYMTFKRKTKLKPPELMTLNLAIFDFGISVTGKPFFIVSSLSHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRYGAWLKRHHAFLCLASVWAYAAFWATMPLVGWGSYAPEPF NEUR_gasAc MDNETRSHPSYVPHYLLRGDPFASRLSKEADIVAAFYIFIIGVMSATGNGYVLYMTFKRKTKLKPPELMTVNLAIFDFGISVTGKPFFIVSSLSHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRYGTWLKRHHAFVCLALVWAYAAFWATMPLVGWGSYAPEPF NEUR_oryLa MENETWTHPSYIPHYLLRGDPFASRLSKEADIIAAFYICIIGIMSATGNGYVIYMTIKRKSKLKPPELMTVNLAVFDFGISVTGKPFFVVSSFAHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRYGTWLKRQHAFLCLVFVWMYAAFWATMPLVGWGNYAPEPF NEUR_pimPr MENTSWPHSSYVPHYLLRGDPFASRLSKEADIVAAFYILIIGIMSATGNGYVIYMTIKRKSKLKPPELMTVNLAVFDFGISVTGKPFFVVSSFSHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRYGTWLKRQHIFLCLVFVWIYAAFWATMPLVGWGSYAPEPF NEUR_anoCa MEQGQNISSQDDNQQEEDPFASKLSVEADIVAGVYLLVIGILSTLGNGYVIYMSTQRKKKLKPAEIMTVNLAVCDLGISVVGKPFSIIAFFSHRWIFGWSGCRWYGWAGFFFGIGSLITMTAVSLDRYFKICHLSYGTWLKRHHVFICLGIIWSYAAFWATIPFAGFGNYAPEPF position 1.........1.........2.........2.........2.........2.........2.........2.........2.........2.........2.........2.........3........3..........3.........3.........3.........3.....3 position 8.........9.........0.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0........1..........2.........3.........4.........5.....6 position 012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456 excMemCy eeeeeeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMccccccccccccccccccccccccccccccccccMMMMMMMMMMMMMMMMMMMMMeeeeeeeeeeeeeeeMMMMMMMMMMMMMMMMMMMMMcccccccccccccccccccccccccccccccccccccccccccccc* keyResid .diS................................................................................................................K............................................................ NEWR2_galG GTACCIDWQSTNVDVMSMSYTVVLFVLCFILPCGVIVTSYSLILVTVKESRKAVEQHVSGPTRINNVQTITAKLSIAVCIGFFAAWSPYAIIAMWAAFGSIDKIPPLAFAIPAVFAKSSTLYNPIIHLLLKPNFRSNIAKDFTVIQQLCVR---CCFCVKELQ--TYRSTFNTGLRTFKG NEWR2_anoC GTACCIDWRISNMKKTAMSYTTALFVFCYIIPCGIIITSYTLILITVKDSRKAVEQHALGPTRMSSVHTITAKLSIAVCIGFFVAWSPYAIIAMWAAFGSIDMIPPLAFAVPAVFAKSSTLYNPAMYLFLKPNFRSTIAKDLTVLHRLCLK---SCFCPRGMQNCSYRSALEAPLKSFKG NEWR2_xenT GTACCLDWEASYRERKALSYTISLFVFCYLIPSSLIFISYTLIFVTVKGARRAVQQHLSPQAKGSSIHSLIIKLSIAVCIGFLIAWTPYAIVAMMAAFGDPTKIPSLVFALAAAFAKSSTIYNPVVYLLLKPNFLNVVTKDLTLFQTMCAV---VCGWCR-----TPAVKTPCPHKDLKT NEWR2_danR GTACCINWYTPSHDALAMSYIISLFIFCYVVPCTIIILSYTFILVTVRGSQQAVQQHVSPQTKVTNAHALIVKLSVAVCIGFLTAWSPYAIVAMWAAFSANEQVPPTAFALAAIMAKSSTIYNPMVYLLFKPNFRKSLSQDTQMFRHRICLSHSKASPSPGMKDQERQSSQQCNNKDGSI NEWR2_pimP GTACCINWYIPSHDALAMSYIISLFIFCYVVPCTIIILSYTFILLRVRGSRQAVQKHVSPKTKETNAHTLIVKLSVAVCIGFVTAWSPYAVVAMWAAFSANEPVPPTAFALAAILAKSSTIYNPMVYLLFKPNFRKILSQDTQNIRHRMCVSHSKASPTPEIK---AQSSQQC--KDATI NEWR2_tetN GTACCINWQAPNHELSSLSYIVCLFLFCYVLPCAIIILSYTCILMTVRGSRQAIQQHVSPQTKTANAHALIVKLSVAVCIGFLGAWSPYAVVAMWASFGDATWVPPDAFAIAAILAKSSTIYNPLVYLLCKPNFRECLYKDTSTLRQRIY----RGSPLSGPRDRSGGVTQR--HKDLSV NEWR2_takR GTACCIDWRAPNHELSSLSYIVCLFFFCYVLPCATIILSYTCILMTVRGSRQAIQQHVSPQTKTANAHSLIVKLSVAVCIGFLGAWSPYAIVAMWAAFGDATWVPPDAFAIAAILAKSSTIYNPVVYLLCKPNFRECLYKDTSTLRQRIY----RGSPQSEPRERFGGTSQR--HKDLSI NEWR2_gasA GTACCIDWHAPNHELAALSYIVCLFVFCYALPCATIFLSYTFILLTVRGSRQAVQQHVSPQTKTTNTHALIVKLSVAVCIGFLGAWTPYAVVAIWAAFGDATLVPPDAFALAAMFAKSSTIYNPVVYLLCKPNFRACLYRDTTLLRQRIY----RGSPRSEPKAHFGSTSQR--NKDMSV NEWR2_calM APLFAKSSTFYNPCIYVISYTMTVIAVNFVVPLSVMFFCYYNV NEWR2_oryL GTACCIDWHAPNHELWALSYILCLFIFCYALPCTIIFLSYAFILLTVRGSRQAVQQHVSPQTKTTNAHTLIVKLSVAVCIGFLGAWTPYAVIAMWAAFGDATQVPPTAFALAAVFAKSSTIYNPMVYLLCKPNFRECLCRDTSLLRHMIY----RGSP--QPQERFGSDSRR--NKDITA NEUR_homSa GTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKA-TKKKSLEGFRLHTVT-TVRKSSAVLEIHEEV NEUR_calJa GTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKA-TKKKSLEDFRLHTVT-TVRKSSAVLEIHEEV NEUR_canFa GTSCTLDWWLAQASLGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGRLKA-TKKKSLEDFRLNTVT-TVRKSSAVLEIHQEV NEUR_musMu GTSCTLDWWLAQASGGGQVFILSILFFCLLLPTAVIVFSYAKIIAKVKSSSKEVAHFDSRIHSSHVLEVKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYRFACCQAGGLRG-TKKKSLEDFRLHTVT-TVRKSSAVLEIHQEV NEUR_loxAf GTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEIAHFDSRIHSSHMLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLRA-TKKKSLEGFRLHTVT-TVKKSSAVLEVHQEV NEUR_monDo GTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKSSTKEVAHYDTRIQNSHILEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGQPDSIPVQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCQSGGQKA-AKKESLRTYRLHTVT-TVRRSSAVLEIHQEV NEUR_ornAn GTSCTLDWWLAQASVAGQAFILNILFFCLLLPTAVIVFSYVKIIAKVKSSTKEVAHFDSRIQNSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGQPDSIPIQFSVVPTLLAKSAAMYNPIIYQVIDCRISCCRLGGPKT-GKKESLKNSRSHSMS-TIRKPSAVSGPHQEV NEUR_calMi GTSCTLDWWLARVSVSGLIFVLTILFFCLLLPIIIIVFSYIKIIAKVKSSAKEVAHFDSRIQNHHSLEMNLTK NEUR_galGa GTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKSSTKEVAHYDTRIQNSHILEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGQPDSVPIQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCRSGGPKTLQKKSSLKESRMYTIS-SHRDSAALSGTQLEV NEUR_xenTr GTTCTLDWWLAQASVKGQIFVLSMLFFCLLFPTMVIVFSYAKIIAKVKSSAKEVAHFDTRNQNNHTLEIKLTKVAMLICAGFLIAWFPYAVVSVWSAFGQPDSIPIELSVVPTMMAKSASMYNPIIYQVIDCKPACCKK------DKSLQNTTSRVYTIS-TFRKSTTSAR NEUR_danRe GTSCTLDWWLTQASVSGQSFVMCMLFFCLIFPTVIIVFSYVMIIFKVKSSAKEVSHFDTRNKNNHSLEMKLTKVAMLICAGFLIAWIPYAVVSVMSAFGEPDSVPIPVSVVPTLLAKSSAMYNPIIYQVIDCKKKCVKSCCFQAWRKKKPSKTSRFYTISGSIKQR-PGDEASIEI NEUR_takRu GTSCTLDWWLAQASVSGQSFVMCMLIFCLVLPTGVIVFSYVMIILQVKSSAQEVSHFDTQNKNKHHLEMKLTKVAMLICAGFLIAWIPYAVVSVVSAFGDPDSVPISISVVPTLLAKSSAMYNPIIYQVIDCKKNCAKLSCFQAWSKRKHYKTSRFYSISASMKKR-PANEVPTEI NEUR_tetNi GTSCTLDWWLAQASVSGQSFVMAILFFCLILPTGIIVFSYVMIIFKVKSSAKEISHFDARIRNSHDLEIKLTKVAMLICAGFLIAWIPYAVVSVISAFGEPDSVPIPVSVIPTLLAKSSAMYNPIIYQVVDVKTSCTNFSCCKALKERIHFRKSRLYTISGSLRDPLPPKEAHIEM NEUR_gasAc GTACTLDWWLAQASVSGQSFVMAILFFCLVLPTGIIVFSYIMIIFKVKSSAKEISHFDARIKNSHSLEIKLTKVAMLICAGFLIAWIPYAVVSVVSAFGEPDSVPIPVSVIPTLLAKSSAMYNPIIYQVADLKTSCTSSSCCKALKERVLFRKARLYTISGSLRDTLPPKEAHIEM NEUR_oryLa GTSCTLDWWLAQASVSGQSFVVAILFFCLVLPAGIIVFSYVMIIFKVKSSAKEISNFDARIKNSHNLEIKLTKVAMLICAGFLIAWIPYAVVSVVSAFGEPDSVPISVSVIPTLLAKSSAMYNPIIYQVLDLKNSCMKSSCFKGLKKPRHFRKSRFYTISGSVKDNTTAKEAQIEM NEUR_pimPr GTSCTLDWWLAQASVSGQSFVMSILFFCLVLPAGIIVFSYVMIICKVKSSSKEVSSFDARIKNSHTLEIKLTKVAMLICAGFLIAWIPYAVVSVVSAFGEPDSIPIPVSVIPTLLAKSSAMYNPIIYQLVDLKNSC-STCCAKVIRKRTHFRNSRFYTISGSLKDTAPAKEAHIEI NEUR_anoCa GTSCTLDWWLAQGSVAGQAFILNILFFCLVLPTAVIMFCYVKIIAKVQSSTKEVAHYDTRIQNQHVLEMKLTKVAMLICAGFMFAWIPYAVVSVWSAFGRPDSVPIKVSVIPTLLAKSAAMYNPVIYQVIDCKSACCRPGNLQPLQKKNSR
Curated Set of 51 deuterostome neuropsins
>NEUR_homSap OPN5 1_6 43 0 1 2_6 40 2 1 3_6 57 2 1 4_6 112 2 0 5_6 81 0 2 6_6 19 1 0 0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEGFR 2 1 LHTVTTVRKSSAVLEIHEEV* 0 >NEUR_panTro 0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKETKKKSLEGFR 2 1 LHTVTTVRKSSAVLEIHEEV* 0 >NEUR_gorGor 0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 12 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 YASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIPSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGKPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEGFQ 2 1 LHTVTTVRKSSAVLEIHEEv* 0 >NEUR_ponPyg 0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGSPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEGFR 2 1 LHTVTTVRKSSAVLEIHEEV* 0 >NEUR_nomLeu 0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEGFR 2 1 LHTVTSVRKSSAVLEIHEEv* 0 >NEUR_macMul 0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADIVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEGFR 2 1 LHTVTTVRKSSAVLEIHEEV* 0 >NEUR_papHam 0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADIVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEGFR 2 1 LHTVTTVRKSSAVLEIHEEv* 0 >NEUR_calJac 0 MALNHTSLPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2 1 LHTVTTVRKSSAVLEIHEEV* 0 >NEUR_tarSyr 0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VIGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKGVAHFDSRIHTSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2 1 LHTVTTVRKSSAVLEIHEEv* 0 >NEUR_otoGar 0 MALNHTALPQDELRPHYLRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTVNLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 WTMMPLVGLEDYVPEPFTSCTLDWWLAQSLGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHGSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKTTKKKSLEDFR 2 1 LHTVTTVRKSSAVLEIHQEV* 0 >NEUR_micMur 0 MALNHTVLPQDERLPHYLRDGDPFASKLSWEADLVAGFYLIII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 1 2 GVWLKRKHAYICLALIWAYVSFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEIAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDHR 2 1 LHTVTAVRKSSAVLEIHQEv* 0 >NEUR_tupBel 0 MALNHTALPQDESLPHYLRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEIAHFDSRIHSSHVLEMKLTK 0 0 2 1 LHTVTTVRKSSAVLEIHQEV* 0 >NEUR_musMus 0 MALNHTALPQDERLPHYLRDEDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWFGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASGGGQVFILSILFFCLLLPTAVIVFSYAKIIAKVKSSSKEVAHFDSRIHSSHVLEVKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYRFACCQAGGLRGTKKKSLEDFR 2 1 LHTVTTVRKSSAVLEIHQEV* 0 >NEUR_ratNor 0 MALNHTALPQDERLPHYLRDEDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWFGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 gVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASGGGQVFILSILFFCLLLPTAVIVFSYAKIIAKVKSSSKEVAHFDSRIHSSHVLEVKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPNSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYRFACCQTGGLRATKKKSLEDFR 2 1 LHTVTAVRKSSAVLEIHPEv* 0 >NEUR_speTri 0 MALNHTALPQDEHLPHYLRDEDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAFICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFINILFFCLLLPTAVIEFSYVKIIAKVKSSSEEVAHFDSRIHSSHV 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPSLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2 1 LHTVTAVRKSSAVVEIHQEv* 0 >NEUR_dipOrd 0 MAFNHTAGTQGQGLPHYLPEEDPFTSKLSWEADIVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASLAGQVFILNILFFCLLLPTSVIVFSYVKIIAKVKSSSKEVAHFDSRIPSSHVLEMKLTK 0 0 2 1 * 0 >NEUR_cavPor 0 MALNHTAPPQNEHLPRYLQDEDPFVSKLSWEADLVAGFYLTII 1 2 GILSTVGNGYVLYMSSRRKKKLRPAEIMTINLAICDLGIS 1 2 vVGKPFTIISCFRHRWVFGWIGCRWYGWAGFFFGCGSLITMTVVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAAIWAYVSFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASAGGQIFILHILFFCLLLPTAMIVFSYVKIIAKVKSSSKEIAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDSRFACCQNAGLKATKKKSLEDFR 2 1 LHTVTTDRKSAVLEIHQEV* 0 >NEUR_oryCun 0 MALNHTALPQDEHLPHYLREGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRRHAYICLALIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFSCCRTSGLKATKKKSLEDFR 2 1 LHTVTTVRKSSAVLEIHQEv* 0 >NEUR_ochPri 0 MALNDTALPQDEHLPHYFRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRLYGWADFFFGCGSLITMTAVSLDRYLK 1 2 GVWLKRRHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHGSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFSCCRTGGLKQTKKKSLEDFR 2 1 LHTVTTVRKSSAVLEIHQEv* 0 >NEUR_canFam 0 MALNHTARPQDERLPHYLREGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAICDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASLGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGRLKATKKKSLEDFR 2 1 LNTVTTVRKSSAVLEIhQEV* 0 >NEUR_felCat 0 MALNHTAPPQDERLPHYLREGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAE 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2 1 LHTVTTVRKSSAVLEIHQEv* 0 >NEUR_bosTau 0 MALNHTAPPPDERRPPYLRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTVNLAICDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GIWLKRKHAYICLAVIWAYAAFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEVKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2 1 LHTVTTVRKSSAVLEVHQEv* 0 >NEUR_turTru 0 1 2 GILSTFGNGYVLYMSSRRKKKLKPAEIMTINLAICDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIPSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAVYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2 1 LHTVTTVRKSSAVLEIHQEv* 0 >NEUR_susScr 0 MALNHTAPPPDERRPHYLREGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTVNLAICDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAVIWAYAAFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLVAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2 1 LHTVTTVRKSSAVLEIRQEV* 0 >NEUR_vicVic 0 MALNHTAPPPDERRPRHLRDGdPFASKLSWEADLVAGFYLTII 1 2 GILSTLGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWMFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2 1 LHAVTTVRKSSAVLEIHQEV* 0 >NEUR_equCab 0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAICDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHGSHVLEVKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2 1 LHTVTTVRKSSAVLEIHQEV* 0 >NEUR_myoLuc 0 MALNHTALPQDEGLPHYLQDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTVGNGYVLYMSSRRKKKLRPAEIMTVNLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHTYICLAFIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQATVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKKVAHFDSRIH 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSSAMYNPIIYQVIDYKLACCQTGGLRATKKKSLENFR 2 1 LHTVTTVRKSSAVLEIHQEv* 0 >NEUR_pteVam 0 MALNHTVLPQDEHLPHYVRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTVGNGYVLYMSSRRKKKLRPAEIMTINLAICDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHGSHMLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTSGLRATKKKSLEDFR 2 1 LHTITTVREASAVLEIHQEV* 0 >NEUR_sorAra 0 MALNHTALPQDENLPHYLRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWMGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLVVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPNSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYRFACCQSGGLRATKKKSLDDFr 2 1 LHTVTTVRESSAVLEIHQEV* 0 >NEUR_eriEur 0 MSLNQTALPQDEGLPHYLRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 gVWLKRKHAYLCLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASLGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKKVAHFDSRIHSSHMLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKANKKKSLKDYR 2 1 >NEUR_loxAfr 0 MTLNHTAPPQDDRLPQYLQDGDPFTSKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSCRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFVIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEIAHFDSRIHSSHMLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLRATKKKSLEGFR 2 1 LHTVTTVKKSSAVLEVHQEv* 0 >NEUR_proCap 0 MTLNHTVLPEDDRLSHYLRDGDPFTSKLSWEADLVAGFYLTVI 1 2 GILSTCGNGYVLYMSYRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFIIISSFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSY 1 2 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSVPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCRTRGLRATKEKSLEGVR 2 1 LHTVTTVRKSSAVLEIHQEv* 0 >NEUR_echTel 0 MALNHTAPPQDNSLPHYLRDGDPFVSKLSWEADLGAGFYLIII 1 2 GILSTFGNGYVLYMSYRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFSHRWVFGWTGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEIAHFDSRIHSSHMLEMKLTK 0 0 2 1 LHTITTVRKSSAVLEIHQEV* 0 >NEUR_dasNov 0 MALNHTALPQDDRLPHYLRDGDPFASKLSWEADLVAGFYLTII 1 2 gILSTFGNGYVLYMSSKRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLRATKKKSLEDFR 2 1 LHTVTTVRESSAVLEVHQEV* 0 >NEUR_choHof 0 MALNHTGLPQDDSLPHYFRDGDPFASKLSWEADLVAGFYLIII 1 2 GILSTFGNGYVLYMSSRRRKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAVIWAYASFWTTMPLLGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHMLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCRTGGLRATKKKSFEGFR 2 1 LHTVTTVRKSSAVLEIHQEv* 0 >NEUR_monDom 0 MALNHSVSPQDDYIPHYLRDGDPFASKLSWEADLVAGFYLTII 1 2 GVLSTLGNGYVIYMSSKRKKKLRPAEIMTVNLAVCDLGIS 1 2 VVGKPFTIISCFSHRWVFGWVGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSY 1 2 GTWLKRHHAFICLALIWAYATFWATVPFAGVGSYAPEPFGTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKSSTKEVAHYDTRIQNSHILEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGQPDSIPVQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCQSGGQKAAKKESLRTYR 2 1 LHTVTTVRRSSAVLEIHQEv* 0 >NEUR_macEug 0 1 2 GVLSTLGNGYVIYMSSKRKKKLRPAEIMTVNLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWVGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSy 1 2 GTWLKRHHAYICLVIIWAYATFWATMPLAGLGNYAPEPFGTSCTLDWWLAQASVTGQTFILNILFFCLLLPTAVIVFSYVKIIAKVKSSTKEVAHFDSRIQSSHVLEMKLTK 0 0 2 1 RHTVSTIRKSSSVSETYQEV* 0 >NEUR_ornAna 0 MTNYSAPQLGDYLPHYLREGDPFVSKLSWEADLVAGVYLVII 1 2 GVLSTLGNGYVIYMSSRRKKKLRPAEIMTVNLAVCDLGIS 1 2 VVGKPFTIVSCFCHRWVFGWMGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSY 1 2 GTWLKRHHAYICLAIIWAYASFWATMPLVGLGNYAPEPFGTSCTLDWWLAQASVAGQAFILNILFFCLLLPTAVIVFSYVKIIAKVKSSTKEVAHFDSRIQNSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGQPDSIPIQFSVVPTLLAKSAAMYNPIIYQVIDCRISCCRLGGPKTGKKESLKNSR 2 1 SHSMSTIRKPSAVSGPHQEV* 0 >NEUR_galGal 0 MASDCNSSSQEEYLPHYMQQEDPFASKLSREADIIAGFYLTVI 1 2 GILSTLGNGYVIFMSSKRKKKLRPAEIMTVNLAVCDLGIS 1 2 VVGKPFSIISFFSHRWIFGWMGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLAY 1 2 GTWLKRHHAFICLALIWAYATFWATVPFAGVGSYAPEPFGTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKSSTKEVAHYDTRIQNSHILEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGQPDSVPIQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCRSGGPKTLQKKSSLKESR 2 1 MYTISSHRDSAALSGTQLEV* 0 >NEUR_taeGut 0 MASEYNNSSQEEYIPHYLQEEDPFASKLSREADIIAGFYLTII 1 2 GILSTLGNGYVIFMSSKRKKKLRPAEIMTVNLAVCDLGIS 1 2 VVGKPFSIISFFSHRWMFGWIGCCWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSY 1 2 GTWLKRHHAFICLAIIWAYAMFWATVPFAGVGSYAPEPFGTSCTLDWWLAQASVAGQVFVLSILFFCLLLPTAVIVFSYVKIILKVKSSTKEVAHYDTRIQNSHILEMKLTK 0 0 VAMLICAGFLLAWIPYAVVSVWSAFGRPDSVPIQFSVVPTLLAKSAAMYNPIIYQVIECRLACCRPGG 2 1 >NEUR_anoCar 0 MEQGQNISSQDDNQQEEDPFASKLSVEADIVAGVYLLVI 1 2 GILSTLGNGYVIYMSTQRKKKLKPAEIMTVNLAVCDLGIS 1 2 VVGKPFSIIAFFSHRWIFGWSGCRWYGWAGFFFGIGSLITMTAVSLDRYFKICHLSY 1 2 GTWLKRHHVFICLGIIWSYAAFWATIPFAGFGNYAPEPFGTSCTLDWWLAQGSVAGQAFILNILFFCLVLPTAVIMFCYVKIIAKVQSSTKEVAHYDTRIQNQHVLEMKLTK 0 0 VAMLICAGFMFAWIPYAVVSVWSAFGRPDSVPIKVSVIPTLLAKSAAMYNPVIYQVIDCKSACCRPGNLQPLQKKNSR 2 1 >NEUR_xenTro 0 MAGNSSYREESGYIPHYERDSDPFASKLSREADIFAGVYLMAI 1 2 GILSTLGNGYVIYMACSRKKKLRPAEIMTINLAVCDLGIS 1 2 VTGKPFAIVSCFSHRWVFGWNACRWYGWAGFFFGCGSLITLTVVSLDRYLKICHLRY 1 2 GTWLKRRHAFIALAVIWAYATLWATLPLVGVGNYAPEPFGTTCTLDWWLAQASVKGQIFVLSMLFFCLLFPTMVIVFSYAKIIAKVKSSAKEVAHFDTRNQNNHTLEIKLTK 0 0 VAMLICAGFLIAWFPYAVVSVWSAFGQPDSIPIELSVVPTMMAKSASMYNPIIYQVIDCKPACCKKDKSLQNTTSR 2 1 VYTISTFRKSTTSAR* 0 >NEUR_danRer 0 MENETSISSGYIPHYLLRGDPFASKLSKEADIVAAFYILVI 1 2 GILSATGNGYVMYMTFKRKTKLKPPEIMTLNLAIFDFGIS 1 2 VSGKPFFIVSSFSHRWLFGWQGCRYYGWAGFFFGCGSLITMTIVSFDRYLKICHLRY 1 2 gTWLKRHHAFLSVVFIWAYAAFWATMPVVGWGNYAPEPFGTSCTLDWWLTQASVSGQSFVMCMLFFCLIFPTVIIVFSYVMIIFKVKSSAKEVSHFDTRNKNNHSLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVMSAFGEPDSVPIPVSVVPTLLAKSSAMYNPIIYQVIDCKKKCVKSCCFQAWRKKKPSKTSR 2 1 FYTISGSIKQRPGDEASIEI* 0 >NEUR_takRub 0 MENDTSIPSGYVPHYLLRGDPFASKLSKEADIVAAFYILVI 1 2 GVLSATGNGYVIYQTIKRKTKLKPPEFMTLNLAVFDFGIS 1 2 VTGKPFFIVSSFSHRWLFGWQGCRYYGWAGFFFGCGSLITMTIVSLDRYLKICHLRY 1 2 GTWFKRHHAFLCLVFTWLYAAFWATMPVVGWGNYAPEPFGTSCTLDWWLAQASVSGQSFVMCMLIFCLVLPTGVIVFSYVMIiLQVKSSAQEVSHFDTQNKNKHHLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVVSAFGDPDSVPISISVVPTLLAKSSAMYNPIIYQVIDCKKNCAKLSCFQAWSKRKHYKTSR 2 1 FYSISASMKKRPANEVPTEI* 0 >NEUR_tetNig 0 MENETWTHSSYVPHYLLRGDPFASRLSKEADIVAALYICII 1 2 gLMSATGNGYVLYMTFKRKTKLKPPELMTLNLAIFDFGIS 1 2 VTGKPFFIVSSLSHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRY 1 2 GAWLKRHHAFLCLASVWAYAAFWATMPLVGWGSYAPEPFGTSCTLDWWLAQASVSGQSFVMAILFFCLILPTGIIVFSYVMIIFKVKSSAKEISHFDARIRNSHDLEIKLTK 0 0 VAMLICAGFLIAWIPYAVVSVISAFGEPDSVPIPVSVIPTLLAKSSAMYNPIIYQVVDVKTSCTNFSCCKALKERIHFRKSR 2 1 LYTISGSLRDPLPPKEAHIEM* 0 >NEUR_gasAcu 0 MDNETRSHPSYVPHYLLRGDPFASRLSKEADIVAAFYIFII 1 2 GVMSATGNGYVLYMTFKRKTKLKPPELMTVNLAIFDFGIS 1 2 VTGKPFFIVSSLSHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRY 1 2 GTWLKRHHAFVCLALVWAYAAFWATMPLVGWGSYAPEPFGTACTLDWWLAQASVSGQSFVMAILFFCLVLPTGIIVFSYIMIIFKVKSSAKEISHFDARIKNSHSLEIKLTK 0 0 VAMLICAGFLIAWIPYAVVSVVSAFGEPDSVPIPVSVIPTLLAKSSAMYNPIIYQVADLKTSCTSSSCCKALKERVLFRKAr 2 1 LYTISGSLRDTLPPKEAHIEM* 0 >NEUR_oryLat 0 MENETWTHPSYIPHYLLRGDPFASRLSKEADIIAAFYICII 1 2 gIMSATGNGYVIYMTIKRKSKLKPPELMTVNLAVFDFGIS 1 2 VTGKPFFVVSSFAHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRY 1 2 GTWLKRQHAFLCLVFVWMYAAFWATMPLVGWGNYAPEPFGTSCTLDWWLAQASVSGQSFVVAILFFCLVLPAGIIVFSYVMIIFKVKSSAKEISNFDARIKNSHNLEIKLTK 0 0 VAMLICAGFLIAWIPYAVVSVVSAFGEPDSVPISVSVIPTLLAKSSAMYNPIIYQVLDLKNSCMKSSCFKGLKKPRHFRKSR 2 1 FYTISGSVKDNTTAKEAQIEM* 0 >NEUR_pimPro 0 MENTSWPHSSYVPHYLLRGDPFASRLSKEADIVAAFYILII 1 2 GIMSATGNGYVIYMTIKRKSKLKPPELMTVNLAVFDFGIS 1 2 VTGKPFFVVSSFSHRWLFGWEGCRFYGWAGFFFGCGSLITMTVVSLDRYLKICHLRY 1 2 GTWLKRQHIFLCLVFVWIYAAFWATMPLVGWGSYAPEPFGTSCTLDWWLAQASVSGQSFVMSILFFCLVLPAGIIVFSYVMIICKVKSSSKEVSSFDARIKNSHTLEIKLTK 0 0 VAMLICAGFLIAWIPYAVVSVVSAFGEPDSIPIPVSVIPTLLAKSSAMYNPIIYQLVDLKNSCSTCCAKVIRKRTHFRNSr 2 1 FYTISGSLKDTAPAKEAHIEI* 0 >NEUR_calMil 0 MTAFDNSTALYSGYWLHDSLHGDPFVSKLSWEADIISACYLIVT 1 2 GLLSTLGNGYVIYLSITQKRKLKPPEILITNLAISDFGMS 1 2 VGGQPFLIISCFSHRWIFGWVGCRWHGWAGFFFGCGSLITMTVVSLDRYLKICHLQY 1 2 GSWLQRRHVFMSLAFIWFYAAFWATMPLVGWGNYAPEPFGTSCTLDWWLARVSVSGLIFVLTILFFCLLLPIIIIVFSYIKIIAKVKSSAKEVAHFDSRIQNHHSLEMNLTK 0 0 2 1 * 0 >NEUR1_braFlo from cDNA and genome chrUn ++ 176419943 176431046 11104 FE548698 0 MATTPADRLDGLTPAGRGATTAETHADDFASKLSREADIVIGVYLILI 1 2 GTGAILGNGRVLWLSYRCRARLRPVEMFVVSLAAADVGLSLVGHPFSAASSLMGRWSFGSAGCTW 1 2 YGFVVFFLGIASIATMALMSIMRFMIVHKRY 1 2 GQYPSRRASCVLVAAAWLYGLFWACAPLA 1 2 GWSQYHPEPYGLSCSVDWGGFSRGAGGSSFIICMLLFCTAVPVVVMVTSYAAIFALYRQAQKGVVLNLQVNATFGGKRQRTER 0 IALAVCGGFLLAWLPYAVVGLWASVAGVDAVPLALASAAPLFAKSNSLWNPIIYLGMNERFR 2 >NEUR2_braFlo from traces and genome chrUn ++ 187375671 187384042 8372 nearly identical chrUn ++ 32271780 32281075 9296 0 MATTPGLPLDGLAPTGRGVTAADTLDDDFASKLSREADIVIGVYLLLI 1 2 GTGSILGNGRVLWLSYRNWAKLRPVELFVVSLAVTDVGISVFGYPFAASSSLLGRWSFGSAGCTW 1 2 YGFTGFFFGLTSIANMALMSIMRFMIVYKGY 1 2 GPYPSRRATSGLIAAAWLYGLFWACAPLA 1 2 GWSQYHVEPFGLSCTVDWGSFSRDAGGMSFIICLLVFCVAIPVTAIMASYVAISAIYRQAKKSIAGHLQDNSAMCKKRNKLE 0 0 MALAVCGGFLLAWLPYAVVGLWSAVAGVDAVPLALASAAPLFAKSSSLWNPIIYLGMNDRFR 2 >NEUR_strPur Strongylocentrotus purpuratus XM_001197837 CX694910 CX690664 0 MDVNAKWWTNETLRTRDQFSDDHYTSVLSYEGDIWAGVYLMFI 1 2 SLIAFIGNISVIVISLRKREKLKPIDLLTINLAIADFLICVVSYPLPMISAFRHR 0 0 WSFGKFGCVWYGFTSFLFAVGSMATLMVIALLRYAKLCRENV 1 2 DQYQSRPFVIKVIVAIWGFAFFTTAPPLFGWS 2 1 SYVPEPYHLSCTIDFADTSPSGLSYTYFTTIVVFFMPLMIIVLCYVAIARKMIHHNRRINVGHNAGRMLLEIRLLK 0 0 TACMITMAYTISWTPYAVIAMWVTYIPVNQIPDAFRILPAFCAKTSSVYNPIIYCIFNKSFRQDLSSLICCCACQCYTITINLDINSHAQQQFRRIEERR DEVGTYKRRPLMICSNPFAWSRDFHETWRQRRIRGIHRNCRNNVRVENINVNFRRDTDMVELNAPTPAEIHRPELNTASTRSGARTKSMATHLPALEEVPSG APQCSALLHNTPIPRSLQGTPLPYQPQPSTSDLHDEFLNPSVVSRNMCVIVVKPNIEEELSTD* 0
Curated Set of 51 vertebrate newropsins
>NEWR_galGal GenBank 3 garbage 5'UTR translated exons -B4GALT6 -NEWR2_galGal -KIAA1012 0 MDPSFANSTFQSKITEAADIVVGTCYMVF 1 2 GICSLCGNSILLYISYKKKHLLKPAEYFIINLAISDLAMTLTLYPLAVTSSLSHR 2 1 WLYGKHICLFYAFCGLFFGICSLSTLTLLSVVCCLKICFPAY 1 2 GNRFRRKHGQILIACAWTYAAIFACSPLAHWGEYGEEPYGTACCIDWQSTNVDVMSMSYTVVLFVLCFILPCGVIVTSYSLILVTVKESRKAVEQHVSGPTRINNVQTITAK 0 0 LSIAVCIGFFAAWSPYAIIAMWAAFGSIDKIPPLAFAIPAVFAKSSTLYNPIIHLLLKPNFRSNIAKDFTVIQQLCVRCCFCVKELQTYRSTFNTGLRTFKGKNESSCNALPIMEG CSYFPSEKGSHTFECFKSYPNCFQERLSTMGCHLQDCESLENDLQVEVTQGSRNSMKVVEQEEKSTELDNLEITLEAVPVSCTFTDL* 0 >NEWR_anoCar Anolis carolinensis 0 MESYFANTTFHSKITEAADVIVGVFYIVF 1 2 GICSFCGNSILLYVSYKKKNLLKPAEYFMINLAISDLGMTLTLYPLAVTSSLAHR 2 1 WLFGQQVCLFYAFCGVFFGVCSLTTLTLLSIVCCLKICFPVY 1 1 GNRFRPGHGWILIACAWVYAAIFAFSPLAHWGEYGAEPYGTACCIDWRISNMKKTAMSYTTALFVFCYIIPCGIIITSYTLILITVKDSRKAVEQHALGPTRMSSVHTITAK 0 0 LSIAVCIGFFVAWSPYAIIAMWAAFGSIDMIPPLAFAVPAVFAKSSTLYNPAMYLFLKPNFRSTIAKDLTVLHRLCLKSCFCPRGMQNCSYRSALEAPLKSFKGRNESSSNSVQIVGGCS YFPCEKCHDPFECFKNYPKCCQGRLNVMDHTPRESISVENNMQSKTKHASEKYIKVVIRGEKNTDIDNLEITLEHIPTDIKFANL* 0 >NEWR_xenTro abundant transcripts 0 MGNKSDASAFYSSISETDDIVLGVLYSVF 1 2 GLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHR 2 1 WLFDKVTCNYYAFCGMLFGLCSLTNLTVLSSVCCLKVCYPAY 1 2 GNKFSTAHSRILLLGIWAYAGLFATAPLADWGKYGPEPYGTACCLDWEASYRERKALSYTISLFVFCYLIPSSLIFISYTLIFVTVKGARRAVQQHLSPQAKGSSIHSLIIK 0 0 LSIAVCIGFLIAWTPYAIVAMMAAFGDPTKIPSLVFALAAAFAKSSTIYNPVVYLLLKPNFLNVVTKDLTLFQTMCAVVCGWCRTPAVKTPCPHKD LKTTSKPPSSFKKSQGVCRNCVDTFECFRNYPRCCSVGNVDAAQPMAASLVRIPPANGAPQQTVQLVVSSSRTRSGVETVEVSTEAPMSDFIKDFI* 0 >NEWR_danRer Danio rerio acquired new intron 0 MGNVSKTALFMSTISRQHDILMGSLYSVF 1 2 FVLSLLGNGMLLFVAYRKRSSLKPAEFFVVNLSVSDLGMTLSLFPLAIPSALAHR 2 1 WLFGEITCLCYAVCGVLFGLCSLTNLTALSSVCCLKVCFPNY 1 2 GNKFSSSHACVMVIGVWCYASVFAVGPLVHWGSFGPEPYGTACCINW 2 1 YTPSHDALAMSYIISLFIFCYVVPCTIIILSYTFILVTVRGSQQAVQQHVSPQTKVTNAHALIVK 0 0 LSVAVCIGFLTAWSPYAIVAMWAAFSANEQVPPTAFALAAIMAKSSTIYNPMVYLLFKPNFRKSLSQDTQMFRHRICLSHSKASPSPGMKDQERQS SQQCNNKDGSISTPFSSGQAESYGACHVYAEAGPHYQQISRQITARVLEGSVQSEIPVKQLTEKMQNDLL* 0 >NEWR_tetNig Tetraodon nigroviridis gene mix 0 MGNASDTSDAFNSKISKEHDFLIGSIYSVF 1 2 CVLSLMGNCILLLVAHHKRSTLKPAEFFIVNLSISDLGMTLTLFPLAIPSSFSHR 2 1 WLFGEIACQLYATCGVLFGLCSLTNLTVLSSVCCLKVCLPNL 1 2 GSKFSSSHARLLVAGVWGYASVFAVGPLVQWGHYGPEPYGTACCINWQAPNHELSSLSYIVCLFLFCYVLPCAIIILSYTCILMTVRGSRQAIQQHVSPQTKTANAHALIVK 0 0 LSVAVCIGFLGAWSPYAVVAMWASFGDATWVPPDAFAIAAILAKSSTIYNPLVYLLCKPNFRECLYKDTSTLRQRIYRGSPLSGPRDRSGGVTQRHKDLSVSTR LSNGQQDSYGTCLHCAEDAELGHVTGSRRTACILTGSTFTEVTLSQLSATPADLL* 0 >NEWR_takRub Fugu rubripes 0 MGNASEASDIFLSKISKEHDILIGSIYSVF 1 2 GLLSLAGNCILLLVAYHKRSMLKPAEFFIINLSISDLGMTLTLFPLAIPSSFSHR 2 1 WLFGEITCQLYAMCGVLFGLCSLTNLTALSLVCCLKVCFPNH 1 2 GSRFSSSHARLLVVGVWCYASVFAVGPLVQWGHYGPEPYGTACCIDWRAPNHELSSLSYIVCLFFFCYVLPCATIILSYTCILMTVRGSRQAIQQHVSPQTKTANAHSLIVK 0 0 LSVAVCIGFLGAWSPYAIVAMWAAFGDATWVPPDAFAIAAILAKSSTIYNPVVYLLCKPNFRECLYKDTSTLRQRIYRGSPQSEPRERFGGTSQRHKDLSISTR LSNGQQDSYGTCLHCADDAERGHVTTSQRTACILTGSTFTEVTVGQLSAAPADLL* >NEWR_gasAcu Gasterosteus aculeatus 0 MGNASDTSAVFASTISKERDILMGSLYSVF 1 2 GVLSLVGNCILLLVAYHKRSTLKPAEFFIINLSISDLGMTLSLFPLAIPSAFKHR 2 1 WLFGELTCQLYAMCGVLFGLCSLTNLTALSFVCCLKVCFPNH 1 2 GNRFSSSHARLLVVAVWGYASVFAVGPLARWGRYSPEPYGTACCIDWHAPNHELAALSYIVCLFVFCYALPCATIFLSYTFILLTVRGSRQAVQQHVSPQTKTTNTHALIVK 0 0 LSVAVCIGFLGAWTPYAVVAIWAAFGDATLVPPDAFALAAMFAKSSTIYNPVVYLLCKPNFRACLYRDTTLLRQRIYRGSPRSEPKAHFGSTSQRNKDMSVSVRSSNGQQDSYGACTENA APCHVMTPQRTACILTESTNREVTVSRLADKPQADFL* >NEWR_oryLat Oryzias latipes 0 MGNVSDTSSLFASSISREHDILMGSLYSVF 1 2 GLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHR 2 1 WLFGEITCQLYAMCGVLFGLSSLTNLTALSLVCCLKVCFPNH 1 2 GNKFSFSHARLLVAGVWCYASVFAVGPLARWGRYSAEPYGTACCIDWHAPNHELWALSYILCLFIFCYALPCTIIFLSYAFILLTVRGSRQAVQQHVSPQTKTTNAHTLIVK 0 0 LSVAVCIGFLGAWTPYAVIAMWAAFGDATQVPPTAFALAAVFAKSSTIYNPMVYLLCKPNFRECLCRDTSLLRHMIYRGSPQPQERFGSDSRRNKDITASTRFSNGQQESYGACLNCTEN TGLCQLASPQNTACILTGSTYAEVTVQQLVDKQQPDFL* 0 >NEWR_pimPro Pimephales promelas 0 MGNVSETALFVSTISRQHDILMGSLYSVF 1 2 CVLSLLGNGMLLFVAYRKRSSLKPAEFFVINLSVSDLGMTLSLFPLAIPSALAHR 2 1 WLFGEVVCLCYAVCGVLFGLCSLTNLTALSSVCCLKVCCPNY 1 2 GNKFSSNHACVMVIGVWCYASVFAVGPLIRWGSFAPEPYGTACCINWYIPSHDALAMSYIISLFIFCYVVPCTIIILSYTFILLRVRGSRQAVQKHVSPKTKETNAHTLIVK 0 0 LSVAVCIGFVTAWSPYAVVAMWAAFSANEPVPPTAFALAAILAKSSTIYNPMVYLLFKPNFRKILSQDTQNIRHRMCVSHSKASPTPEIK-AQSSQQCKDATISTPFSSGQAESYGTCHIYAEAEPHFQQISPQRTVRILEGIIQSEISVRHMTDRMQNDLL* 0 >NEWR_calMil frag 0 1 2 GILSLVGNSVLLFVAYRKRQILKPAEYFVANLAVSDISMTVTLLPLAISSNFSHR 2 1 WLFVSKpCMYYGFCSMLFGICSLTNLTVLSTVCCMKVCFPAY 1 2 0 0 MSVVMIVMFLLAWSPYSIVCLWASFGNPKLIPPAMAIIAPLFAKSSTFYNPCIYVISYTMTVIAVNFVVPLSVMFFCYYNV