Opsin evolution: Difference between revisions
Tomemerald (talk | contribs) |
Tomemerald (talk | contribs) |
||
Line 3,412: | Line 3,412: | ||
=== GPCR outgroup sequences === | === GPCR outgroup sequences === | ||
It is sometimes convenient to have | It is sometimes convenient to have a close-in outgroup to opsins selected from the roughly 100,000 GPCR receptors available at GenBank. The set below of 29 non-opsin GPCRs serves satisfactorily as proxy for an exhaustive GPCR compilation. It was constructed by taking best-blast in turn of each human opsin against all other human GPCR then collating the lists, winnowing out repeated entries and too-recent gene family expansions. TACR2 (tachykinin receptor) and SSTR1 (somatostatin receptor) are the best single representatives. These are usefully supplemented with non-opsin GPCR having determined 3D structures, some nearest neighbors in recent GRAFS classification trees, and two astonishingly close pre-opsins (called UROPS1 and UROPS2 here) from Trichoplax, an early diverging eukaryote lacking opsins. | ||
Conserved outgroup residues shared with opsins evidently describe commonalities needed for generic GPCR structure and signaling but not specifically for photobiology. Departures from the norm in certain opsin classes might indicate they are no longer signaling or are signaling constituitively. Opsins also have conserved diagnostic residues and even [[Opsin_evolution:_informative_indels#Indels_in_the_TM4-EC2-TM5_region|regions]] not found in any GPCR. | |||
The aligned sequences below have been trimmed from full length at both ends to the earliest indications of conservation. This alignable region begins at the [[Opsin_evolution:_informative_indels#Alignment_in_TM2_region:_420_curated_opsins|GN region]] of TM1 and extends to the [Opsin_evolution:_Cytoplasmic_face#The_carboxy-terminal_tail_and_VxPx_motif|FR motif]] just beyond TM7. Highly conserved residues are shown in red and less conserved residues in blue. The Schiff base lysine (position -16 relative to the FR end of TM7) does not occur outside of opsins. Note many conserved patches in these GPCR are very similar to those of opsins, implying those residues have no utility in distinguishing opsins from non-opsins. | |||
[[Image:OpsinOrigins.jpg|left]] | [[Image:OpsinOrigins.jpg|left]] | ||
The [[Opsin_evolution:_orgins_of_opsins#Introduction:_the_origin_of_opsins|origin of opsins]] is not fully understood. Opsins are not the 'original' GPCR (which are trackable, barely, to yeast) even for the 'rhodopsin' group R (or even its Ralpha subgroup) within the [http://molpharm.aspetjournals.org/content/67/5/1414.long GRAFS classification] but rather form a specialized set that arose later as the rhodopsin gene class (which contains the AMIN cluster [adrenalin, serotonin, dopamine, and histamine receptors], MECA branch [peptide and lipid binding receptors] in addition to opsins) underwent significant expansions. | |||
This expansion of the Ralpha class had largely taken place in the last common metazoan ancester shared with Monosiga and Trichoplax (which do not contain opsins), implying the ancestral metazoan lacked them as well. The orphan receptors GPR21 and GPR52 form the immediate outgroup (within the 800 human GPCR) in an oft-cited [http://molpharm.aspetjournals.org/content/63/6/1256.long#F4 2003 study]. These have isoleucine at K296; their ligands are still not known as of Dec 2009. Conservation is high throughout deuterostomes; blast matches are restricted within opsins to molluscan melanopsins suggesting Gq signaling. | |||
Another clue to the origin of opsins might be provided by examining [http://www.ncbi.nlm.nih.gov/pubmed/15302402,15302402 GPCR intron positions and phases] to see if shared with ancient introns in opsins. Many non-olfactory GPCR with sequence similarity to opsins have no introns or just one, suggesting the genes duplicated by retroprocessing | The melatonin receptor MLTNR1A emerges as a close relative to opsins. Curiously it plays a key role in circadian rhythms and so needs to coordinate with opsin photosensors. N-acetyl-5-methoxytryptamine, the ligand, bears no obvious relationship to cis-retinal however and K296 is lacking, making an immediate parent gene relationship problematic. | ||
Another clue to the origin of opsins might be provided by examining [http://www.ncbi.nlm.nih.gov/pubmed/15302402,15302402 GPCR intron positions and phases] to see if shared with ancient introns in opsins. Many non-olfactory GPCR with sequence similarity to opsins have no introns or just one, suggesting the genes duplicated by retroprocessing, perhaps acquiring an intron at unrelated position later. UROPS2 has an intron but it does not seem to correspond to one in any opsin. Cnidarian opsins are either intronless (Nematostellata) or undetermined (just known from processed transcripts). | |||
<br clear="all"> | <br clear="all"> | ||
Closeness in the GRAFS tree does not fully accord with closeness of blastp hit, suggesting (unsurprisingly) that its topology is slightly wrong at some internal nodes. On average rank in blastp top scores (or by average 5 best blast expectation values), as representatives of all opsin classes are aligned with the GPCR below, the highest scoring ones by far are are the Trichoplax opsins followed by various peptide receptors: | Closeness in the GRAFS tree does not fully accord with closeness of blastp hit and relatedness of diagnostic regions, suggesting (unsurprisingly) that its topology is slightly wrong at some internal nodes. On average rank in blastp top scores (or by average 5 best blast expectation values), as representatives of all opsin classes are aligned with the GPCR below, the highest scoring ones by far are are the Trichoplax opsins followed by various peptide receptors: | ||
Rank Gene Exp Exons Receptor Ligand | Rank Gene Exp Exons Receptor Ligand | ||
4.2 UROPS2_triAd e-29 2 orphan histamine? (HRH2: best non-opsin blast | 4.2 UROPS2_triAd e-29 2 orphan histamine? (HRH2: best human non-opsin blast match) | ||
5.4 UROPS1_triAd e-28 1 orphan peptide? (SSTR1: best non-opsin blast | 5.4 UROPS1_triAd e-28 1 orphan peptide? (SSTR1: best human non-opsin blast match) | ||
5.6 SSTR1_homSap e-26 1 somatostatin peptide | 5.6 SSTR1_homSap e-26 1 somatostatin peptide | ||
7.2 TACR2_homSap e-25 5 tachykinin peptide | 7.2 TACR2_homSap e-25 5 tachykinin peptide | ||
Line 3,435: | Line 3,439: | ||
8.9 MTNR1A_homSa e-23 2 melatonin N-acetyl-5-methoxytryptamine | 8.9 MTNR1A_homSa e-23 2 melatonin N-acetyl-5-methoxytryptamine | ||
The biological literature contains various scattered claims about 'opsins' in species such as Chlamydemonas (chlamyopsin Z48968), not to mention bacterial 'rhodopsins'. These do not have the seven transmembrane helices in the same arrangement as GPCR nor significant sequence homology and may represent independent evolution of photobiology (just as bat and butterfly wings represent independent origins of flying). | |||
Trichoplax has two very curious 7-transmembrane proteins that emerge as its [http://www.ncbi.nlm.nih.gov//genomes/geblast.cgi?bact=off&gi=6008 best genomic match] to opsin queries. While lacking K296 for a Schiff base, their best back-blast to all of GenBank returns almost entirely opsins (rather than nest within other GPCR receptors). While Trichoplax is 600+ million years removed from the common ancestor with eumetazoa, this gene could still offer clues about the immediate GPCR ancestor to opsins. | |||
These Trichoplax genes retain [[Opsin_evolution:_informative_indels#Indels_in_EC2_region|uncanny similarities]] to opsins in otherwise rapidly changing regions. These two genes not plausibly derived from an opsin expansion with subsequent loss of K296 because Trichoplax and other early diverging lineages lack opsins. Perhaps these genes should be considered opsins in spite of lacking K296. Recall here Schiff base formation dramatically redshifts the absorption spectrum, yet non-covalently bound retinal still has significant adsorption at optical wavelengths which might be further tuned by Trichoplax binding pocket residues. | |||
Conversely, several cnidarian species exhibit far too many K296-type GPCR for their apparent photoreceptive needs and accompaning lack of overt photobiological anatomical specializations. These may represent divergent gene duplications of valid opsins that have evolved into some other type of GPCR; alternatively they could represent a lineage of pre-opsin GPCR that developed K296 but never acquired an opsinlike light-sensing role nor served as parental gene to bona fide opsins. | |||
Together the Trichoplax pre-opsins lacking K296 and putative cnidarian non-opsins possessing K296 push the opsin-defining envelope to its limits. Given the immense time span separating contemporary genes from ancestral, we can anticipate their computed nesting arrangement within the opsin gene tree relative to a close-in GPCR outgroup with known non-retinal ligands will lack convincing statistical support at the critical nodes. The best way forward is additional sequencing of cubomedusae, ctenophores and sponges because these seem to contain conventional opsins that clarify the positions of the outliers. | |||
[[Image:OpsinOutgroup.jpg]] | [[Image:OpsinOutgroup.jpg]] |
Revision as of 10:42, 27 December 2009
Introduction to Opsin Evolution
The Curated Set of Metazoan Opsins
Below is the largest set of phylogenetically dispersed hand-curated opsin sequences ever assembled. The sequences are organized into true orthology classes using coding indels, intron location and phase, synteny of flanking genes, diagnostic residues, blast clustering, and experimental characterization when available. The reference set of opsin sequences includes selected GenBank entries but mostly new opsins extracted from dozens of newly completed genome projects.
The set serves as a gene family classifier ... just uBlast an unknown candidate opsin against the full database below and look for consistent labelling of the top hits from the Opsin Classifier. Then validate the apparent orthology class using three independent classes of rare genomic events (indels, introns, signature residues) by including the new sequence in a full-width Multalinment, being sure to include a couple dozen non-opsin GPCRs as controls. Then check for residual bilateral flanking gene adjacency when a genome assembly is available. The whole process takes only 15 seconds per query!
The set of reference sequences is deliberately not exhaustive -- that seriously oversamples in popular experimental clades such as vertebrates and insects. When a given clade has many similar sequences available, those in species with genome assemblies are chosen to represent the group, for example anole is preferred to gecko, and (rightly or wrongly) any experimental results from gecko transfered over. This avoids the uninformative clutter of near-identical sequences. However if a clade reflects a very deep divergence especially important to opsin evolution (such as lamprey or amphioxus), all available sequences are needed to maximally break up long branches.
Sequences not available from GenBank were culled from trace archives, tiled contigs, and genome assemblies, typically by uBlastx against the growing set of reference sequences, as described in the annotation section. The level of error in the curated sequences is very low, declines with time as anomalies are revisited and repaired, but never reaches zero because of problems inherent to experimental data, imperfect sequencing reads, less-than-complete assemblies, and sequence manipulation.
Rare genomic changes can supplement (and even displace) traditional maximal likelihood and bayesian inference in resolving polytomic divergence nodes in gene and species tree topologies. Rare genomic changes applicable to opsins include coding indels (deletions and insertions), intron placement (position and phase comparison), synteny (gene order along the chromosome), and gene copy number change (gene gain from retropositional, tandem, segmental, and whole genome duplications; gene loss from pseudogenization or deletion). Results from these methods must be evaluated for their susceptibility to homoplasy (misleading recurrent independent events that mimic a single event) and incomplete penetration in the population level at the time of speciation (lineage sorting).
Opsins are more informatively stored as proteins since nucleotide sequences are far too diverged at metazoan evolutionary distances and do not explicitly manifest residue properties that experience selection. These protein sequences are parsed into constituent exons using genomic information when available -- fortunately splicing mechanisms are exceeding well conserved across animal phyla. When not directly available (eg the opsin originated as a cDNA in a species lacking a genome project), exon breaks have been inferred from the phylogenetically closest neighbors via hterologous alignment (but not in insects where intron turnover is too high). As an example, lamprey opsins from Geotria australis and Lethenteron japonicum can work as blastn queries to locate orthologs within the Petromyzon maritimus genome project (which consists solely of 19 million traces as of mid-December 2007).Numbers flanking exons, namely 0 1 2, show the phasing of each intron, eg 12 means an overhang of 1 bp at the 3' end of an exon with fragmentary codon completed by a 2 bp overhang at the beginning of the next exon.
Intron position and phasing are conserved over vast evolutionary distances -- human to sponge and beyond -- with the exception of certain rogue lineages (that unfortunately includes two major model organisms). Informative conservation is still available long after protein percent identity has slid into the twilight zone of uncertainty (critical in opsins because they are readily confused with generic GPCR receptors). For example, no variation whatsoever in intron pattern has occured in any vertebrate opsin class since lamprey divergence, no events in many billions of years of branch length. Prior to that, rare events are observed, for example LWS opsins gained an extra intron of phase 12. In a protein having 333 locations with 3 possible phases at each location, convergent evolution (homoplasy) is uncommon, that is, close examination establishes opsin introns are still highly informative even when sequences have greatly diverged.
Insertions and deletions (coding indels) are sufficiently uncommon in opsins that they are a potentially phylogenetically informative class of rare genomic events. We'll harvest these from a massive opsin alignment and stratify by location and phylogenetic depth. Indels are more subject to homoplasy than intron gain or loss but the risk of that varies greatly by region. Indels are less affected by lineage-specific rates than other event classes.
Syntentic relationships can have great value in determining orthology relationships, though gene order is not often conserved over great evolutionary timescales due to chromosomal rearrangements. As with introns, certain rogue lineages lose almost all this information. Tracking synteny is nomenclaturally confused because many genes are unnamed and simply numbered by annotation pipeline procedures in each genome without homological consistency. Here, HUGO-convention names are used for two genes flanking each side of a given human opsin. Strand orientation is noted relative to a fixed convention of plus strand for the human opsin. Then each genome assembly is visited to determine the extent of conservation of these flanking genes and orientation. In the event humans lack the gene, synteny is defined by the nearest diverging species, typically platypus or chicken, that has the gene. Sometimes the original synteny is only partly retained (left or right half-synteny). For deeply diverging species such as amphioxus, flanking genes can be are pushed forward into 'nearby' species. This can also be done without an assembly in species with sufficiently large contigs containing the opsin. Blast clustering can be uncertain because of diminishing percent identity or even loss of these flanking genes; common chromsomal displacements such as inversions eventually disrupt most syntentic arrangements.
The fasta header of each sequence serves as a miniature database that conveniently collects this and other basic information, with fields showing the opsin type, genus, species and common name, accession number, best PubMed citation, indels, intron pattern, sequence length, lambda max adsorption, flanking synteny, and G protein type with which it interacts (all subject to availability and work-in-progress). These fasta headers by themselves provide a quick over view the opsin reference set collection -- simply paste into a blank document and pull lines containing '>'.
Thus the usual querying at GenBank does not remotely compare to the Opsin Classifier. Those sequences -- often mis-annotated by an unattended pipeline or well-intentioned individual with no qualifications in comparative genomics -- are spread out over separate databases not accessible by any single method, with difficult-to-interpret edge creep of genomic blast matches, uncorrected frameshifts, missing stop codons and erroneous amino terminals.
As worst-case scenario, half-baked annotation of the sea urchin genome by pipelines and casual procedures has left a awful legacy of bogus opsin gene structures at GenBank, journal alignments, and genome browsers -- often mis-classified because of an inadequate set of reference sequences and non-opsin GPCR controls, chimeric confusion in tandem duplicates, and non-consideration of intron structure, indels, and synteny. These errors may ripple downstream to errors by subsequent scientists trusting GenBank nr as classifier and lead to a whole subculture taken up with non-existent "virtual" opsins and attendent vacuous speculation on echinoderm photoreception.
Melanopsins, the unexpected rhabdomeric-class Gq-coupled opsin recently found in upper deuterostomes, illustrate some of the difficulties of accurate annotation. They can be confused homologically due to various expansions and contractions. Mammals, human through platypus, have a single melanopsin. However a common ancestor to chicken, lizard, frog, teleost fish, and possibly cartilagenous fish had a multi-gene segmental duplication with both resulting melanopsins retained (though substantially diverged). In ray-finned fish, a processed retrogene arose that may be functional in zebrafish though lost in fugu and stickleback. After whole genome duplication, zebrafish also retained two copies of the original melanopsin. Chondrichthyes also have a second copy of the primary melanopsin but synteny -- which is essential for analysis since intron placement is uninformative in duplications and sequence alignment too dependent on unknown rates -- is not available in the current contig-level assembly.
Amphioxus contains two melanopsins from an apparently independent duplication. Flanking gene order today bears no relation to any vertebrate gene order. The lamprey situation awaits assembly of its traces or targeted transcript studies. At this time, only a four-exon fragmentary melanopsin can be recovered (however with high percent identity, 80%). Possibly orthologs of this melanopsin locus could be tracked into the highly derived tunicates, acorn worm, and sea urchins. The distinctive intron pattern may even allow melanopsin antecedents to be identified in Cnidaria and Protostomia. At this point, the best blastp match to insects stands at 37% with no evident syntenic or intronic support.
While clade-specific proliferations of melanopsins -- and implied role subfunctionalizations -- confounds the situation for chordates, it has little impact on the opsin classifier described here. Unknown sequences readily find their place because of the extensive phylogenetic representation of reference sequence orthology classes and the inherent distance of melanopsins from the ciliary subcollection. At that level of alignment, the melanopsins serve as outgroup to ciliary opsins and so help define motifs specific to Gt-coupled signaling and other structure/function issues.
It appears however that far too much 'lumping' has taken place in nearly all non-imaging opsins, for example encephalopsins, melanopsins, and peropsins. The taxonomic counterpart is too much lumping of species outside of mammal. In all likelihood, additional opsin orthology classes need definition and distinctive naming. These opsins were belatedly discovered through whole-genome homology searches; they may differ one from another just as much as cone and rod opsins. However, it is currently difficult to disentangle phylogenetically short-lived expansions within known families from deep-rooted parallel-evolving subfamilies concentrated (superficially) within 'secondary' photoreception systems. Genome sampling density and completion efforts are overwelmingly concentrated in a few model species and near human.
We should not be too hasty to write off peropsins as mere auxillary retinal isomerases that replenish cis-retinal for 'real' opsins. This reaction is not a simple isomerization (like glucose isomerase) but photoisomerization with an evolutionarily tuned and conserved visual light action spectrum. There is no reason that trans-retinal cannot be the signaling agonist with conversion to all-cis retinal the waste product. Pairing two opposing types of opsins in a single cell or nearby cells then completes a full visual cycle with interesting opportunities for sensory capabilities. Of all the so-called retinal isomerases, peropsins retain more of the diagnostic residues of ciliary opsins; possibly they function similarly but in balance with another opsin class co-expressed in the same cell. Melanopsin, surprisingly, is fully capable of self-replenishment, so perhaps the first-studied ciliary opsins are the true anomaly with their need for an auxillary replenishment cycle in the retinal epithelium.
It's abundantly clear from distinct ancestral introns and alignment clustering that peropsins together with neuropsins and RGR opsins comprise a distinctive subclade within the overall opsin gene tree. The comparative genomics of RGR illustrates the danger of phylogenetic undersampling: with much deeper sampling, we can observe that the E/DRY motif -- conserved across all classes of opsins (indeed GPCR) and critical to maintaining non-signalling conformation -- has become GRY in all boreoeutheran mammals (it's ERY in afrotheres, lost without residual debris in marsupials, and DRY from platypus through shark and even tunicate, and DRY in neuropsins and peropsins).
In other words, after several trillion years of branch length conservation as charged amino acid, a radical amino acid substitution has taken hold -- to glycine with its tiny non-interactive side chain. Yet this subclade of placental mammals, this glycine has been conserved without exception for over two billion years of branch length. Given the importance of this motif for maintaining the non-signalling state, this suggests a major change in functional properties of RGR opsins within boreoeutheres, a change that does not tolerate a reduced alphabet at this site. That change might be breakdown to non-signaling isomerase within that clade. It remains unclear how marsupials cope without the gene at all or how Afrothera and Xenarthra visual cycles differ from other placental mammals.
Comparative Genomics of DRY motifs in exon 3 of RGR Opsins: 1 RWPYGSDGCQAHGFQGFVTALASICSSAAIAWGRYHHYCT 1 human 1 RWPYGSDGCQAHGFQGFVTALASICSSAAIAWGRYHHYCT 1 macaque 1 RWPYGSGGCQAHGFQGFTTALASICGSAAIAWGRYHHYCT 1 lemur 1 RWPYGSDGCKVHGFQGFATALASISGSAAIAWGRYHQYCT 1 treeshrew 1 RWPHGSEGCQVHGFQGFATALASICGSAAVAWGRYHHYCT 1 mouse 1 RWPYGSDGCQAHGFQGFATALASICGSAAIAWGRYHHYCT 1 rabbit 1 RWPYGSDGCQAHGFQGFATALASICGSAAIAWGRYHHYCT 1 pika 1 RWPYGSEGCQAHGFQGFVTALASICSSAAVAWGRYHHFCT 1 cow 1 RWPYGSDGCQAHGFQGFVTALASICSSAAIAWGRYHHYCT 1 horse 1 RWPYGSNGCQAHGFQGFVTALASICSSAAIAWGRYHHYCS 1 cat 1 RWPYGPDGCQAHGFQGFATALASICSSAALAWGRYHHYCT 1 dog 1 RWPYGSGGCQAHGFQGFAAALASICGSAAVAWGRYHHYCT 1 bat 1 RWPFGPDGCQAHGFQGFATALASICSSAAIAWGRYHHYCT 1 shrew 1 RWPYGSDGCQAHGFQGFVTALASICSCAAIAWERYHHYCT 1 elephant 1 RWPYGSDGCQAHGFQGFVMALTSICSCAAIAWERYHHYCT 1 hyrax 1 HWPYGSGGCQAHGFQGFTVALASICSCAAIAWERYHHYCT 1 tenrec 1 RWPHGSDSCQAHSFQGFATALASISSSAAIAWERYRHHCT 1 sloth 1 RWPYGSGGCQAHGFQGFVTALASISSSAAIAWERCHRHCI 1 armadillo 1 HWPYGAEGCRLHGFQGFATALASISLSAAIGWDRYLRHCS 1 platypus 1 YWPYGSDGCQIHGFHGFLTALTSISSAAAVAWDRHHQYCT 1 lizard 1 YWPYGSEGCQIHGFQGFLTALASISSSAAVAWDRYHHYCT 1 chicken 1 YWPYGSEGCQIHGFQGFVAALSSIGSCAAIAWDRYHQYCT 1 frog 1 YWPYGSDGCQTHGFQGFVTALASIHFIAAIAWDRYHQYCT 1 stickleback 1 YWPYGSDGCQTHGFQGFMTALASIHFIAAIAWDRYHQYCT 1 zfish1 1 HWPFGSEGCQLHAFQGMVSILAAISFLGAVAWDRYHQYCT 1 zfish2 1 YWPYGSEGCQTHGFQGFVTALASIHFVAAIAWDRYHQYCT 1 tetraodon 1 YWPYGSDGCQTHGFQGFVTALASIHFVAAIAWDRYHQYCT 1 fugu 1 YWPYGSEGCQTHGFHGFLTALASIHFIAAIAWDRYHQYCT 1 medaka 1 YWPYGSDGCQTHGFQGFMTALASIHFIAAIAWDRYHQYCT 1 pimephales 1 YWPYGSDGCQTHGFQGFMTALASIHFVAAIAWDRYHQYCT 1 osmerus 1 YWPYGSEGCQTHGFHGFLMALASINACAAIAWDRYHQNCS 1 elephantshark 1 EWPFGSIGCQLDAFIGMAPTFISIAGAALVAKDKYYRICK 1 tunicate 1 1 RWLFGKFGCYFHGFAGMLFGLGSIGNLTVISIDRYIITCK 1 tunicate 2 1 QWPFGDLGCQVDAFIGMAPTFISIAGAALIAKDKYYRFCK 1 tunicate 2
Excellent recent publications have greatly added to our understanding of the evolutionary origin of light reception capabilities, yet only with the advent of genomic sequencing have we begun to get a full grip on opsins and a few other evo-devo photoreceptor components such as PAX6. Yet critical early-diverging metazoan genomes that might retain information on ancestral characters are greatly under-represented given the branch lengths involved. No purpose is served by elaborate gene tree computations that include a single token peropsin or parietopsin among hundreds of cone opsins; if opsins are central to the endeavor yet so haphazardly annotated, what is the status of dozens of lesser but still critical photoreceptor gene families?
The question of how many times eyes independently arose is ill-posed. We can only wonder how people with no grasp whatsoever of the underlying molecular biology of photoreception could pontificate so boldly on its evolution prior to even sequencing of the first opsin in 1980. First collect comprehensive comparative genomic data sets on all the components in all the relevent clades; then ask what it could mean to homologenize two contemporary systems comprised of partially overlapping sets of heterogenous paralogs from assorted ancient gene families having long complex histories of clade-specific expansion and contraction cycles. If four arrestins serve a thousand GPCR, if even specialized photoreceptor cells express dozens of barely related 7TM signaling molecules, it's difficult to imagine what a cognate arrestin or ectopic evo-devo cassette might be. It's difficult enough to reconstruct an ancestral gene or two; reconstructing ancestral systems biology interactions lies in the future.
After reviewing topics such as ciliary opsin in protostomes, rhabdomeric opsins in deuterostomes,the rich opsin repertoires in cnidarians and probable opsins in sponges, we will consider special topics such as the origin of image-forming eyes beween amphioxus and lamprey divergences, noting throughout that our notion of 'eye' is much more nuanced than earlier. The reconstruction of Urbilateran and Urmetazoan eyes awaits additional cnidarian genomes but no new ones are currently underway. However the plethora of new arthopod and lophotrochozoan genome assemblies has opened up new avenues of research as the realization grows that fly and nematode model species are exceedingly derived, with better ancestral characters retained in other lineages.
Numerous conflicting gene trees have been published for ciliary opsins. Some methodologies have bordered on the preposterous -- thin phylogenetic coverage, dimly related outgroups such as adrenergic receptors, and naive underlying mutational models assumed for maximal likelihood despite great diversity of species and many billions of years of branch length. Nonetheless the resultant trees have only moderate conflict, suggesting that a definitive opsin gene tree is not far off. We'll do this using the multiple types of evidence discussed above.
The first point to be understood in deuterostome ciliary opsin evolution is jawless fish such as lamprey already exhibit a full-blown set of modern rod and cone opsins whereas earlier diverging clades as represented by hemichordate, echinoderm, amphioxus and tunicate genomes totally lack them and indeed lack imaging eyes altogether, while using their rhabdomeric opsins in a very distinct signaling system for their own photorecepton systems. Despite 7 sequenced opsin mRNAs in the amphioxus Branchiostoma belcheri and an initial assembly in Branchiostoma floridae providing counterparts there, no rod/cone opsin can be located there or in earlier diverging deuterostomes with genome projects (3 unicates, 2 urchins, 1 acorn worm). These species may have larval eye spots, ocelli, pigment cells, and related photoreceptors but lack imaging eyes.
Characters in extant (living) species should never be confused with ancestral characters present at the time of divergence nodes (last common ancestors); conceivably these early diverging deuterostomes have lost opsin genes, perhaps due to a habitat shift to deep water, burrowing habitat, or nocturnal lifestyle. However the molecular evidence is quite clear that full-blown pentachromatic color vision and most other modern ciliary opsin classes first appeared during the evolutionary stem preceding lamprey divergence. These are demonstrably not 'new' genes but derived from gene duplication and divergence of still older opsins of ciliary class..
The fossil record is unsatisfactory: less than 1 bilateran in 10,000 in Chengjiang and Burgess Shale fossils is even a candidate for deuterostomy. Low numbers of specimens and poor preservation conspire with career pressures and impact-seeking journals in egregiously misinterpreted data in the view of Hou, discoverer of the Chinese lagerstaette. Myllokunmingia is the best situation with 500 specimens; however Haikouichthys as supposed stem deuterostome, Metaspreggina as supposed post-Ediacaran, and Yunnanozoan generally are problematic, valid only in the eye of some beholder.
While signs of bilaterily disposed eyes are sometimes documentable, it does not follow these were image-forming eyes. Indeed contemporary Branchiostoma and tunicate larva have an eye-spot (ocellus); the genomes contain ciliary opsins but only clustering to ENCEPH and PPIN -- still a long ways from any imaging opsin. Echinoderms and hemichordates genomes also have opsins but even further diverged. Sea urchin genome encodes at least six opsins, four of these cluster classify to rhabdomeric, ciliary and Go-type. Tube feet are apparently the photosensory organ in adult urchins.
The oldest known fossil lamprey, Priscomyzon, dates at 360 myr to the Devonian. Molecular clocks place lamprey appearance at approximately 430 myr, some 100 million years after Chengjiang and Burgess Shales fossil Lagerstatte formed. Like most soft tissues, eyes seldom leave a good fossil record, though bilateral placement might be reflected in bone orbits. Hagfish, sister group to lamprey, have imaging eyes but have not been studied; their opsins situation may be derived due to deepwater marine habitat (similarly deepwater coelocanth opsins are adapted to less scattered wavelengths, centered at420 nm).
The next-diverging chondrichtyes have inadequate data at GenBank -- only a few rhodopsin genes from skates and dogfish. This makes even fragmentary opsins from the partially sequenced elephantfish Callorhyncus milii quite valuable. Those 9 fragments and 3 from the lamprey genome are provided in the data section -- the opsin classifier tool can reliably type a fragment from a single mid-sized exon. Full length genes are preferable but these fragments serve to prove existence of various gene class at the time of a given divergence node. Further, they can validate certain rare genomic events provided the fragment happens to overlap the region of interest.
On the other hand, thousands of high-quality Cambrian arthropod fossils unmistakably show stalked paired eyes. Fossil trilobite eyes have been much studied; these are better preserved due to calcite as lens crystalin. Imaging eyes of contemporary arthropods and lophotrochozoa are rhabdomeric, utilizing depolarizing Gq-type receptor, phospholipase C, phosphoinositol, diacylglycerol, and transient receptor potential TRP and TRPL channel signaling. However their genomes can also contain ciliary opsins, using hyperpolarizing Gt-type transducins and phosphodiesterase cGMP second-messaging (as well as Go-type gustducin ciliary opsins in other types of photoreceptors).
Vertebrates are just the opposite, having crossed over to a near-exclusively ciliary opsin-based imaging system, while retaining rhabdomeric signaling in retinal ganglion cells and elsewhere. (A very recent report demonstrates very rare human and mouse cone cells expressing exclusively melanopsin; however melanopsin cannot have given rise to ciliary cone opsins.)
It must not be thought that bilaterans invented imaging eyes because much earlier diverging cubomedusan jellyfish such as Carybdea marsupialis has 4 eyestalks each with 6 photoreceptors of 4 types: simple eyespots, pigment cups, complex pigment cups with lenses, and camera-type eyes with a cornea, lens, and retina. This jellyfish chases, captures, and eats 'more advanced' teleost fish. This wing of cnidarians very much needs a genome project.
Cnidarian opsins are becoming available from Hydra and Nematostella genomes and targeted hybridization experiments. Hydra may express a ciliary-type opsin in ectodermal sensory nerve cells whereas Nematostella has opsins classifying between melanopsin and encephalopsin. These distant opsin are very difficult to distinguish bioinformatically from non-opsins in the rhodopsin superfamily within GPCR, so it is exceedingly important to include controls because a somewhat unspecialized photoreceptor cell may also express other sensory system signaling non-opsins that are nonetheless genetically homologous.
In summary, there is no evidence whatsoever -- and every reason to doubt from genomic analysis -- that deuterostomes had imaging eyes during the Cambrian. Despite this, a BBC series "Walking With Monsters" portrayed a school of 25 mm Haikouichthys attacking and wounding an Anomalocaris twenty times their size. It is easy to guess at a scientific advisory panel that envisions deuterostomes triumphing over protostomes. This recurrent anthropocentric fantasy is echoed in museum imagery of early mammals nimbly predating on dinosaur nests -- dioramas quietly dismantled after Yucatan meteorite discovery.
Imaging eyes are not essential to survival; even today subterranean mammals such as blind mole rat flourish without them. Discounting ray-finned fish numbers, a very substantial proportion of all extant animal species lack imaging eyes 525 myr after the Cambrian. Of 33 animal phyla, one-third have no specialized organ for detecting light, one-third have light-sensitive organs, and the remaining 6 have imaging eyes (Cnidaria, Mollusca, Annelida, Onychophora, Arthropoda, and Chordata). Thus 82% of animal phyla have survived well over 500 myr without imaging eyes despite the supposedly unrelenting competition/predation from animals having them.
Opsin Gene and Species Trees
The phylogenetic tree below shows the presence or absence of various opsin genes in clade-representative species, as reflected in the collected reference sequences. The purpose is timing appearance (or disappearance) of a given class of opsin gene. For example, cone and rod opsins first appeared before lamprey divergence; otherwise they are absent from urochordates, cephalochordates, and earlier deuterostomes. Note however a given gene might appear absent because of a genome project gap, lack of experimental effort, insufficient or outdated bioinformatics, or species idiosyncracies (ie be present in a different species of that clade). In other cases (eg platypus SWS1) pseudogene remnants or a syntenically proven deletion establish the gene is definitely absent. Y means yes (present), N means no (absent). The figure needs a few fixes.
Opsin retention at 18 genetic loci over 540 million years (Jan 2009 update) 1 RHO1 petMar calMil takRub xenTro galGal anoCar ornAna monDom bosTau homSap RHO1 2 RHO2 geoAus calMil danRer ...... galGal anoCar ...... ...... ...... ...... RHO2 3 SWS1 geoAus ...... danRer xenLae galGal anoCar :::::: monDom bosTau homSap SWS1 4 SWS2 geoAus ...... gasAcu xenTro galGal anoCar ornAna ...... ...... ...... SWS2 5 LWS petMar calMil danRer xenTro galGal anoCar ornAna monDom bosTau homSap LWS 6 VAOP petMar calMil danRer xenTro galGal anoCar ...... ...... ...... ...... VAOP 7 PIN ------ calMil ...... xenTro galGal utaSta...... ...... ...... ...... PIN 8 PPIN petMar calMil danRer xenTro ...... anoCar ...... ...... ...... ...... PPIN 9 PARIE petMar ------ danRer xenTro ...... anoCar ...... ...... ...... ...... PARIE 10 ENCEPH petMar calMil danRer xenTro galGal anoCar ...... monDom :::::: homSap ENCEPH 11 TMT ------ calMil danRer xenTro galGal anoCar ornAna monDom ...... ...... TMT 12 MEL1 petMar calMil danRer xenTro galGal anoCar ornAna monDom bosTau homSap MEL1 13 MEL2 ------ calMil danRer xenLae galGal anoCar ...... ...... ...... ...... MEL2 14 NEUR1 petMar calMil danRer xenTro galGal anoCar ornAna monDom bosTau homSap NEUR1 15 NEUR2 ------ calMil danRer xenTro galGal anoCar ...... ...... ...... ...... NEUR2 16 NEUR3 petMar calMil danRer xenTro galGal anoCar ...... ...... ...... ...... NEUR3 17 PER ------ calMil danRer xenTro galGal anoCar ornAna monDom bosTau homSap PER 18 RGR petMar calMil danRer xenTro galGal anoCar ornAna ...... bosTau homSap RGR 1 3 15 17 17 16 18 7 8 7 8 Notes: in low coverage genomes, missing data was sought in closely related species. Absence of data (---) is distinguished from strong evidence of absence (...). In rare cases, pseudogene debris is still detectable at syntenic location (:::) Gene loss is a mixture of deep stem loss (mammals) and recent lineage-specific loss.
The opsin gene trees below illustrate only a few of the myriad possibilities, even beginning with commonsense ordering (blast nearest neighbors). Because these gene families originated long ago and are only known from remotely related representatives in extant species with wildly differently mutational mechanisms and histories, the true tree cannot be reliably infered from maximal likelihood. Indeed no two such attempts have ever come up with the same gene tree! Instead, we'll keep this set of gene trees in view until analysis of rare genomic events is completed
Using the Opsin Classifier
Below is the primary collection of opsing protein sequences. Here "fields" in the fasta header show gene name, genus, species, common name, heterotrimeric G protein alpha subunit used in signaling, intron structure, synteny (2 flanking genes on each side of the opsin), indel status, sequence length, lambda max, and comment field. The 230-odd sequences are now organized into deuterostomes, lophotrochozoans, and ecdysozoan divisions, further broken refined into ciliary, rhabdomeric, or neither. Even with the full set copied into the uBlast, a novel opsin candidate can be classified in 6 seconds over just a conventional DSL internet connection.
On 26 Nov 07, I added 41 new sequences, mostly arthropod rhabdomeric imaging opsins, extracting them from a 2007 pancrustacean opsin paper, using the much-studied accessions in their Table 1, as ordered phylogenetically according to their Fig.3, with subsampling to avoid too-close sequences and narrow lineage-specific expansions. This involved replacing a few defective accessions and partial sequences with comparable complete ones, favoring sequences with completed or planned genome projects which can be directly intronated and their synteny determined. Lambda max values were helpfully compiled for all these opsins by the original authors.
This significantly upgrades the resolving power of the Opsin Classifier vis-a-vis these new classes of protostome opsins. It raises thorny nomenclature issues because of short-sighted historic uses such as rhodopsin or LWS for both fruitfly and human genes. These are indeed vaguely homologous in the distant pre-Bilateran GPCR past but are certainly not orthologous as implied by the same common name. The definition of orthology requires genes under comparison in extant species to descend from a single parent gene in their last common ancestor. If this LCA requirement is relaxed, the entire set of a thousand GPCR all become orthologous.
Additional ecdysozoan and lophotrochozoan opsins are needed, whatever new that can be extracted from invertebrate genome projects; some of these will prove ciliary and conversely some deuterostome opsins rhabdomeric. Melanopsin/enchepalopsin appear at the heart of the Big Switchover that took place in chordates -- their imaging opsins did not arise from gene duplication and divergence of anything we see among contemporary protostomal imaging opsins or any reconstructed ancestor.
In fact, none of the opsin genes in Urbilatera destined to become rhabdomeric imaging opsins in living arthropods (even all of protostomia) seems to have descended directly to any deuterostome. It may turn out that none of the opsin genes in Ur-Bilatera destined to become ciliary imaging opsins in living vertebrates (even all of deuterostomia) survived in any protostome. The pool of GPCR genes was already large and signalling diversified across gene copies. However lophotrochozoa and basal ecdysozoan ciliary opsins are still largely unexplored. A similarly 'bad' Venn diagram could hold in Ureumetazoa. Here the only two cnidarians with sequence data (Hydra and Nematostella) were not the best choices for studying opsin evolution. We may have to wait for genome sequencing of a full-featured cubomedusan.
Please do not add text or edit sequences at this time, even though genomeWiki encourages participation; consider starting a fresh page to hold your contributions. After finishing the first round of articles, everything will be revisited and reorganized at a second pass.
Opsin Dataset
The set below is sufficiently phylogenetically representative for classification purposes. In species with major genomic projects and adequate phylogenetic separation, the opsin sets are exhaustive. Seven ortholog classes to date have greatly expanded collections within specialized articles:
- LWS reference sequences from 95 species
- Encephalopsin and TMT: yet more opsins lost in mammals
- RGR opsin: abrupt change in DRY motif in boreoeutheres
- Peropsin: phyloSNPs in persopsin evolution
- Neuropsin (OPN5): comparative genomics of 52 deuterostome genes and three new paralogs NEUR2, NEUR3, NEUR4
The fasta header lines below are themselves a flatfile database. The first field (all that is retained by most web alignment tools) gives the gene name together with the genus-species code. These gene names comply with international conventions but cannot always accommodate historic terminology (which can re-utilize an already-assigned gene name or call everything rhodopsin). Here each major branch of the gene tree is assigned a distinct name. While 6 letters for genus-species provides a better mneumonic than 2 or 3, the full genus and species names plus English-language common name are provided because of data from unfamiliar species.
Next fields include the Galpha signalling partner (when experimentally known or reliably inferable from homology), syntenic gene order context (when known and relevent), length of protein, lambda-max peak adsorption (sometimes calculated from tuning residues), PubMed identifier (most recent article), GenBank identifier (reviewed RefSeq if available), and finally an unstructured comment field with alternative gene names, tissue of expression, percent identity to nearest neighbor, data source if genomic, and various miscellany that enable web browser text searching (which also works for sequence snippets provided they don't cross exon boundaries).
Adherence to rigid database format currently drops off after the early fields because the information is just not available for many entries recovered from low-coverage genome projects. However the structure still suffices for routine queries such as 'how many entries exist for human opsins' or 'which protostomes have ciliary opsins'. Note the fasta header lines are easily isolated by pasting the entire database into a spreadsheet and sorting for lines beginning with '>'. The fields themselves can be separated into columns fairly well by replacing spaces with tabs.
The protein sequences themselves are broken into their constituent exons. The numbering indicates phase or basepair overhang, a deeply conserved invariant useful in distinguishing orthology classes and gene tree divisions. All opsins in all species use standard GT-AG splice junctions. The exon structures in genomic species have been determined individually by tblastn and those of non-genomic species predicted by homology.
Almost all entries are full length genes beginning with an intial methionine (sometimes uncertain) and ending with a stop codon. However, fragmentary genes are included if the species occupies a critical position in the phylogenetic tree and the fragment is long enough to reliably classify it. Fragmentary data often takes the form of entire exons missing from genomic coverage (rather than incomplete exons). Thus the Callorhynchus genome project, currently with 0.6 exon coverage, still suffices to establish that many gene duplications important to vertebrate photoreceptive diversity had already occured by the chondrichthyian divergence node.
>RHO1_homSap Homo sapiens (human) Gt synt(-MBD4 +IFT122 +H1FOO -PLXND1) 349 aa 497 nm 16565402 NM_000539 rod RHO ciliary 0 MNGTEGPNFYVPFSNATGVVRSPFEYPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTGCNLEGFFATLG 1 2 GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLAGWSR 2 1 YIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKE 0 0 AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTIPAFFAKSAAIYNPVIYIMMNKQ 0 0 FRNCMLTTICCGKNPLGDDEASATVSKTETSQVAPA* 0 >RHO1_bosTau Bos taurus (cow) Gt synt(-MBD4 +IFT122 +H1FOO -PLXND1) 349 aa 497 nm 2145276 NM_001014890 rod RHO most studied 0 MNGTEGPNFYVPFSNKTGVVRSPFEAPQYYLAEPWQFSMLAAYMFLLIMLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVFGGFTTTLYTSLHGYFVFGPTGCNLEGFFATLG 1 2 GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLVGWSR 2 1 YIPEGMQCSCGIDYYTPHEETNNESFVIYMFVVHFIIPLIVIFFCYGQLVFTVKE 0 0 AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWLPYAGVAFYIFTHQGSDFGPIFMTIPAFFAKTSAVYNPVIYIMMNKQ 0 0 FRNCMVTTLCCGKNPLGDDEASTTVSKTETSQVAPA* 0 >RHO1_monDom Monodelphis domesticus (opossum) Gt synt(-MBD4 +IFT122 +H1FOO -PLXND1) 349 aa rod 0 MNGTEGPNFYVPFSNKTGTVRSPFEEPQYYLADPWQFSCLAAYMFMLIVLGFPINFLTLYVTIQHKKLRTPLNYILLNLAIADLFMVFGGFTMTLYTSLHGYFVFGPTGCNLEGFFATLG 1 2 GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIIGVAFTWVMALACAFPPLIGWSR 2 1 YIPEGMQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPLIVIFFCYGQLVFTVKE 0 0 AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWLPYAGVAFYIFTHQGSNFGPIFMTIPAFFAKSSSVYNPVIYIMMNKQ 0 0 FRTCMITTLCCGKNPLGDDEASATASKTETSQVAPA* 0 >RHO1_ornAna Ornithorhynchus anatinus (platypus) Gt synt(+IFT122 - -PLXND1) 354 aa ABN43074 17339011 rod 0 MNGTEGQDFYIPMSNKTGVVRSPFEYPQYYLAEPWQYSVLAAYMFMLIMLGFPINFLTLYVTIQHKKLRTPLNYILLNLAFANHFMVLGGFTTTLYTSLHGYFVFGPTGCNIEGFFATLG 1 2 GEIALWSLVVLAIERYIVVCKPMSNFRFGENHAIMGVAFTWIMALACALPPLVGWSR 2 1 YIPEGMQCSCGIDYYTLRPEVNNESFVIYMFVVHFTIPMTIIFFCYGRLVFTVKE 0 0 AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTVPAFFAKSSAIYNPVIYIMMNKQ 0 0 FRNCMLTTICCGKNPLGDDEASATASKTEQSSVSTSQVSPA* 0 >RHO1_galGal Gallus gallus (chicken) Gt -MBD4 synt(+IFT122 +H1FOO -PLXND1) 352 aa 1385866 NM_205490 rod RH1 0 MNGTEGQDFYVPMSNKTGVVRSPFEYPQYYLAEPWKFSALAAYMFMLILLGFPVNFLTLYVTIQHKKLRTPLNYILLNLVVADLFMVFGGFTTTMYTSMNGYFVFGVTGCYIEGFFATLG 1 2 GEIALWSLVVLAVERYVVVCKPMSNFRFGENHAIMGVAFSWIMAMACAAPPLFGWSR 2 1 YIPEGMQCSCGIDYYTLKPEINNESFVIYMFVVHFMIPLAVIFFCYGNLVCTVKE 0 0 AAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTNQGSDFGPIFMTIPAFFAKSSAIYNPVIYIVMNKQ 0 0 FRNCMITTLCCGKNPLGDEDTSAGKTETSSVSTSQVSPA* 0 >RHO1_anoCar Anolis carolinensis (lizard) Gt synt(-MBD4 +IFT122 - -PLXND1) 353 aa rod 0 MNGTEGQNFYVPMSNKTGVVRNPFEYPQYYLADPWQFSALAAYMFLLILLGFPINFLTLFVTIQHKKLRTPLNYILLNLAVANLFMVLMGFTTTMYTSMNGYFIFGTVGCNIEGFFATLG 1 2 GEMGLWSLVVLAVERYVVICKPMSNFRFGETHALIGVSCTWIMALACAGPPLLGWSR 2 1 YIPEGMQCSCGVDYYTPTPEVHNESFVIYMFLVHFVTPLTIIFFCYGRLVCTVKA 0 0 AAAQQQESATTQKAEREVTRMVVIMVISFLVCWVPYASVAFYIFTHQGSDFGPVFMTIPAFFAKSSAIYNPVIYILMNKQ 0 0 FRNCMIMTLCCGKNPLGDEDTSAGTKTETSTVSTSQVSPA* 0 >RHO1_xenTro Xenopus tropicalis (frog) Gt synt(-MBD4 +IFT122 - -PLXND1) 355 aa rod 0 MNGTEGPNFYIPMSNKTGVVRSPFDYPQYYLAEPWKYSALAAYMFLLILLGFPINFMTLYVTIQHKKLRTPLNYILLNLVFANHFMVLCGFTVTMYTSMHGYFIFGQTGCYIEGFFATLG 1 2 GEMALWSLVVLAIERYVVVCKPMANFRFGENHAIMGVVFTWIMALSCAAPPLFGWSR 2 1 YIPEGMQCSCGVDYYTLKPEVNNESFVVYMFIVHFTIPLCVIFFCYGRLLCTVKE 0 0 AAAQQQESATTQKAEKEVTRMVVMMVIFFLICWVPYAYVAFYIFTHQGSDFGPVFMTVPAFFAKSSAIYNPVIYIVLNKQ 0 0 FRNCLITTLCCGKNPFGDEEGSSAASSKTEASSVSSSQVSPA* 0 >RHO1_neoFor Neoceratodus forsteri (lungfish) Gt 355 aa 17961206 EF526299 rod 0 MNGTEGPNFYVPMTNKTGVVRSPFEYPQYYLADPWKYSALAAYMFFLILTGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVFGGFTTTMYTAMNGYFVFGVVGCNLEGFFATFG 1 2 GIIALWCLVVLAIERYIVVCKPISNFRFGENHAIMGVVFTWIMALACAGPPLFGWSR 2 1 YIPEGMQCSCGIDYYTLKPEVNNESFVIYMFIVHFTIPLIIIFFCYGRLMCTVKE 0 0 AAAQQQESATTQKAEKEVTRMVYIMVISYLVCWLPYASVSFYIFTHQGSDFGPVFMTVPAFFAKTASVYNPVIYILMNKQ 0 0 FRNCMITTLCCGKNPFGDEETTSAGTSKTEASSVSSSQVSPA* 0 >RHO1_latCha Latimeria chalumnae (coelacanth) Gt 354 aa 478 nm 10339578 AAD30519 rod 0 MNGTEGPNFYVPMSNKTGVVRNPFEYPQYYLADPWKYSALAAYMFFLILVGFPINFLTLFVTIQHKKLRTPLNYILLDLAVADLCMVFGGFFVTMYSSMNGYFVLGPTGCNIEGFFATLG 1 2 GQVALWALVVLAIERYVVVCKPMSNFRFGENHAIMGVIFTWIMALSCAVPPLFGWSR 2 1 YIPEGMQSSCGVDYYTLKPEVNNESFVIYMFVVHFTIPLIVIFFCYGRLVCTVKD 0 0 AAAQQQESATTQKAEKEVTRMVIVMVISFLVCWVPYASVAAYIFFNQGSEFGPVFMTAPSFFAKSASFYNPVIYILLNKQ 0 0 FRNCMITTLCCGKNPFGDEDATSAAGSSKTEASSVSSSSVSPA* 0 >RHO1_angAng Anguilla anguilla (european_eel) 0 MNGTEGPNFYIPMSNITGVVRSPFEYPQYYLAEPWAYTILAAYMFTLILLGFPVNFLTLYVTIEHKKLRTPLNYILLNLAVANLFMVFGGFTTTVYTSMHGYFVFGETGCNLEGYFATLG 1 2 GEISLWSLVVLAIERWVVVCKPMSNFRFGENHAIMGLAFTWIMANSCAMPPLFGWSR 2 1 YIPEGMQCSCGVDYYTLKPEVNNESFVIYMFIVHFSVPLTIISFCYGRLVCTVKE 0 0 AAAQQQESETTQRAEREVTRMVVIMVIAFLVCWVPYASVAWYIFTHQGSTFGPVFMTVPSFFAKSSAIYNPLIYICLNSQ 0 0 FRNCMITTLFCGKNPFQEEEGASTTASKTEASSVSSVSPA* 0 >RHO1_conMyr Conger myriaster (conger_eel) AB043817 0 MNGTEGPNFYIPMSNATGVVRSPFEYPQYYLAEPWAFSALSAYMFFLIIAGFPINFLTLYVTIEHKKLRTPLNYILLNLAVADLFMVFGGFTTTMYTSMHGYFVFGPTGCNIEGFFATLG 1 2 GEIALWCLVVLAIERWMVVCKPVTNFRFGESHAIMGVMVTWTMALACALPPLFGWSR 2 1 YIPEGLQCSCGIDYYTRAPGINNESFVIYMFTCHFSIPLAVISFCYGRLVCTVKE 0 0 AAAQQQESETTQRAEREVTRMVVIMVISFLVCWVPYASVAWYIFTHQGSTFGPIFMTIPSFFAKSSALYNPMIYICMNKQ 0 0 FRHCMITTLCCGKNPFEEEDGASATSSKTEASSVSSSSVSPA* 0 >RHO1_takRub Takifugu rubripes (fugu) Gt synt(-MBD4 +IFT122 - -PLXND1) 355 aa 12783465 AF201472 rod 0 MNGTEGPNFYIPMSNKTGVVRSPFEYPQYYLAEPWKYSLVAAYMLFLIITAFPVNFLTLFVTVKHKKLRTPLNYVLLNLAVADLFMVIGGFTVTLYTALHAYFVLGVTGCNIEGFFATLG 1 2 GEIALWSLVVLAVERYIVVCKPMTNFRFGEKHAIAGLVFTWIMALTCATPPLLGWSR 2 1 YIPEGMQCSCGIDYYTPKPEINNTSFVIYMFILHFSIPLAIIFFCYSRLLCTVRA 0 0 AAALQQESETTQRAEKEVTRMVIVMVISFLVCWVPYASVAWYIFANQGTEFGPVFMTAPAFFAKSAALYNPVIYILLNRQ 0 0 FRNCMITTVCCGKNPFGDDDAATTVSKTQSSSVSSSQVAPA* 0 >RHO1_leuEri Leucoraja erinacea (skate) Gt 355 aa 9256070 U81514 rod 0 MNGTEGENFYVPMSNKTGVVRSPFDYPQYYLGEPWMFSALAAYMFFLILTGLPVNFLTLFVTIQHKKLRQPLNYILLNLAVSDLFMVFGGFTTTIITSMNGYFIFGPAGCNFEGFFATLG 1 2 GEVGLWCLVVLAIERYMVVCKPMANFRFGSQHAIIGVVFTWIMALSCAGPPLVGWSR 2 1 YIPEGLQCSCGVDYYTMKPEVNNESFVIYMFVVHFTIPLIVIFFCYGRLVCTVKE 0 0 AAAQQQESESTQRAEREVTRMVIIMVVAFLICWVPYASVAFYIFINQGCDFTPFFMTVPAFFAKSSAVYNPLIYILMNKQ 0 0 FRNCMITTICLGKNPFEEEESTSASASKTEASSVSSSQVAPA* 0 >RHO1_calMil Callorhinchus milii (elephantfish) Gt 355 aa rod complete wgs 0 MNGTEGENFYIPMSNKTGVVRSPFEYPQYYLAEPWQFSILAAYMFFLIITCFPVNFLTLYVTFEHKKLRQPLNFILLNLAVADLFMVFGGFFITVYTSLHGYFVFGVTGCNFEGFFATLG 1 2 GEIGLWSLVVLAIERYVVVCKPMSNFRFGTNHAIMGVAFTWVMALACAVPPLMGWSR 2 1 YIPEGLQCSCGVDYYTLKPEINNESFVIYMFVVHFLIPLIIIFFCYGRLVCTVKE 0 0 AAAQQQESESTQRAEREVTRMVIIMVIFFLICWVPYASVAFFIFTNQGSEFGPIFMAVPAFFAKSSALYNPLIYILLNKQ 0 0 FRNCMITTLCCGKNPFEEDESTSAAASKTEASSVSSSQVSPA* 0 >RHO1_petMar Petromyzon marinus (lamprey) Gt 354 aa rod 0 MNGTEGENFYIPFSNKTGLARSPFEYPQYYLAEPWKYSVLAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAVANLFMVLFGFTLTMYSSMNGYFVFGPTMCNFEGFFATLG 1 2 GEMSLWSLVVLAIERYIVICKPMGNFRFGSTHAYMGVAFTWFMALSCAAPPLVGWSR 2 1 YLPEGMQCSCGPDYYTLNPNFNNESFVIYMFLVHFIIPFIVIFFCYGRLLCTVKE 0 0 AAAAQQESASTQKAEKEVTRMVVLMVIGFLVCWVPYASVAFYIFTHQGSDFGATFMTVPAFFAKTSALYNPIIYILMNKQ 0 0 FRNCMITTLCCGKNPLGDEDSGASTSKTEVSSVSTSQVSPA* 0 >RHO1_geoAus Geotria australis (lamprey) Gt 354 aa 497 nm 17463225 AY366493 rod RhA 0 MNGTEGQNFYIPFSNKTDVARSPFEYPQYYLAEPWKFSALAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAVSNLFMILFGFTTTMYTSMNGYFVFGPTMCSIEGFFATLG 1 2 GEVSLWSLVVLAIERYIVICKPMGNFRFGNTHAIMGVALTWVMALSCAAPPLLGWSR 2 1 YLPEGMQCSCGPDYYTMNPTYNNESFVIYMFIVHFTIPFVIIFFSYGRLLCTVKE 0 0 AAAAQQESASTQKAEKEVTRMVVLMVVGFLVCWVPYASVAFYIFTNQGSDFGATFMTLPAFFAKSSALYNPVIYILMNKQ 0 0 FRNCMITTLCCGKNPLGDDDSGASTSKTEVSSVSTSQVAPA* 0 >RHO1_letJap Lethenteron japonicum (lamprey) Gt 354 aa 15096614 AB116382 cone rhodopsin 0 MNGTEGDNFYVPFSNKTGLARSPYEYPQYYLAEPWKYSALAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAMANLFMVLFGFTVTMYTSMNGYFVFGPTMCSIEGFFATLG 1 2 GEVALWSLVVLAIERYIVICKPMGNFRFGNTHAIMGVAFTWIMALACAAPPLVGWSR 2 1 YIPEGMQCSCGPDYYTLNPNFNNESYVVYMFVVHFLVPFVIIFFCYGRLLCTVKE 0 0 AAAAQQESASTQKAEKEVTRMVVLMVIGFLVCWVPYASVAFYIFTHQGSDFGATFMTLPAFFAKSSALYNPVIYILMNKQ 0 0 FRNCMITTLCCGKNPLGDDESGASTSKTEVSSVSTSQVSPA* 0 >RHO2_galGal Gallus gallus (chicken) Gt synt(-IHPK3 -LEMD2 -GRM4 +HMGA1) 356 aa 2268324 NP_990771 cone rhodopsin 0 MNGTEGINFYVPMSNKTGVVRSPFEYPQYYLAEPWKYRLVCCYIFFLISTGLPINLLTLLVTFKHKKLRQPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFVFGPVGCAVEGFFATLG 1 2 GQVALWSLVVLAIERYIVVCKPMGNFRFSATHAMMGIAFTWVMAFSCAAPPLFGWSR 2 1 YMPEGMQCSCGPDYYTHNPDYHNESYVLYMFVIHFIIPVVVIFFSYGRLICKVRE 0 0 AAAQQQESATTQKAEKEVTRMVILMVLGFMLAWTPYAVVAFWIFTNKGADFTATLMAVPAFFSKSSSLYNPIIYVLMNKQ 0 0 FRNCMITTICCGKNPFGDEDVSSTVSQSKTEVSSVSSSQVSPA* 0 >RHO2_taeGut Taeniopygia guttata (finch) NM_001076696 0 MNGTEGINFYVPMSNKTGVVRSPFEYPQYYLAEPWKYRLVCCYIFFLISTGFPINFLTLLVTFKHKKLRQPLNYILVNLAVADLCMACFGFTVTFYTAWNGYFVFGPIGCAVEGFFATLG 1 2 GQVALWSLVVLAIERYIVICKPMGNFRFSASHALMGIAFTWVMAISCAAPPLFGWSR 2 1 YIPEGMQCSCGPDYYTHNPDFHNESYVLYMFVIHFIIPVVIIFFSYGRLVCKVRE 0 0 AAAQQQESATTQKAEKEVTRMVILMVLGFMLAWTPYAVVAFWIFTNKGADFTATLMAVPAFFSKSSSLYNPIIYVLMNKQ 0 0 FRNCMITTICCGKNPFGDEETSSTVSQSKTEVSSVSSSQVSPA* 0 >RHO2_anoCar Anolis carolinensis (lizard) Gt synt(-IHPK3 -LEMD2 -MLN +rho2 -ERO1L +CHST7 -GRM4 +HMGA1) 356 aa cone rhodopsin 0 MNGTEGINFYVPLSNKTGLVRSPFEYPQYYLAEPWKYKVVCCYIFFLIFTGLPINILTLLVTFKHKKLRQPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFIFGPIGCAIEGFFATLG 1 2 GQVALWSLVVLAIERYIVVCKPMGNFRFSATHALMGISFTWFMSFSCAAPPLLGWSR 2 1 YIPEGMQCSCGPDYYTLNPDYHNESYVLYMFGVHFVIPVVVIFFSYGRLICKVRE 0 0 AAAQQQESASTQKAEREVTRMVILMVLGFLLAWTPYAMVAFWIFTNKGVDFSATLMSVPAFFSKSSSLYNPIIYVLMNKQ 0 0 FRNCMITTICCGKNPFGDEDVSSSVSQSKTEVSSVSSSQVSPA* 0 >RHO2_gekGek Gekko gekko (gecko) Gt 356 aa 11591478 AY024356 cone rhodopsin in pure rod-retina 0 MNGTEGINFYVPLSNKTGLVRSPFEYPQYYLADPWKFKVLSFYMFFLIAAGMPLNGLTLFVTFQHKKLRQPLNYILVNLAAANLVTVCCGFTVTFYASWYAYFVFGPIGCAIEGFFATIG 1 2 GQVALWSLVVLAIERYIVICKPMGNFRFSATHAIMGIAFTWFMALACAGPPLFGWSR 2 1 FIPEGMQCSCGPDYYTLNPDFHNESYVIYMFIVHFTVPMVVIFFSYGRLVCKVRE 0 0 AAAQQQESATTQKAEKEVTRMVILMVLGFLLAWTPYAATAIWIFTNRGAAFSVTFMTIPAFFSKSSSIYNPIIYVLLNKQ 0 0 FRNCMVTTICCGKNPFGDEDVSSSVSQSKTEVSSVSSSQVAPA* 0 >RHO2_podSic Podarcis sicula (lizard) AY941829 0 MNGTEGINFYVPLSNKTGLVRSPFEYPQYYLAEPWKYKMVCCYIFFLISTGLPINLLTLLVTFKHKKLRQPLNYILVNLAVADLFMACFGFTVTFYTAWNGYFIFGPIGCAIEGFFATLG 1 2 GQVALWSLVVLAIERYIVVCKPMGNFRFSSSHALMGIAFTWVMSLSCACPPLFGWSR 2 1 YIPEGMQCSCGPDYYTLNPDYHNESYVVYMFVIHFVIPVVVIFFSYGRLICKVRE 0 0 AAAQQQESASTQKAEKEVTRMVILMVLGFMLAWTPYAVVAFWIFTNKGADFSATLMSVPAFFSKSSSLYNPIIYVLMNKQ 0 0 FRNCMITTICCGKNPFGDDDVSSTVSQSKTEVSSISSSQVSPA* 0 >RHO2_pheMad Phelsuma madagascariensis (lizard) AF074044 0 MNGTEGFNFYVPVSNRTGLVRSPYEYPQYYLAEPWKFKALSLYMFFLILVGLPLNGLTLFVTFQHKKLRQPLNYILVNLAVANLLMVICGFTVTFYTSWYGYFVFGPMGCAFEGFFATIG 1 2 GQVALWSLVVLAIERYIVICKPMGNFRFSSSHAMMGISFTWFMALCCGGPPLFGWSR 2 1 FIPEGMQCSCGPDYYTLNPDFHNESYVIYLFTVHFLTPMIIIFFSYGRLVCKVRE 0 0 AAAQQQESATTQKAEKEVTRMVILMVMGFLVAWTPYATVACWIFNNKGAEFSVTFMTVPAFFSKSSCIYNPIIYVLLNKQ 0 0 FRNCMVTTICCGKNPFGDEDASSSVSQSKTEVSSVSSSQVAPA* 0 >RHO2_neoFor Neoceratodus forsteri (lungfish) Gt 356 aa 17961206 EF526299 cone rhodopsin 0 MNGTEGINFYVPHSNKTGVVRSPFEYPQYYLADPWKYSIVCAYMFFLIITGLPINLLTLVVTFKHKKLRQPLNYILVNLAVADLFMVCFGFTVTFSTAINGYFIFGPRGCAIEGFMATLG 1 2 GEVALWSLVVLAIERYIVVCKPMGNFRFSNNHSIIGIVFTWLAALSCAAPPLFGWSR 2 1 YLPEGMQCSCGPDYYTMNPDYHNESFVIYMFVVHFFIPVIVIFVSYGRLICKVKE 0 0 AAAQQQESASTQKAEREVTRMVILMVIGFMTAWTPYATVAFWIFMNKGAEFGATFMAAPAFFSKSSALYNPIIYVLMNKQ 0 0 FRNCMVTTLCCGKNPFGDDDVSSSVSAGKTEVSSVSSSQVSPA* 0 >RHO2_latCha Latimeria chalumnae (coelacanth) Gt 355 aa 485 nm 10339578 AH007713 cone rhodopsin RH2 0 MNGTEGMNFYVPLSNRTGLVRSPFEYTQYYLAEPWKFSVLCAYMFLLIILGFPINFLTLLVTFKHKKLRQPLNYILVNLAVASLFMVVFGFTVTFYSSLNGYFVLGPMGCAMEGFFATLG 1 2 GQVALWSLVVLAIERYIVVCKPMGNFRFASSHAIMGIAFTWIMALACAAPPLVGWSR 2 1 YIPEGLQCSCGPDYYTLNPDFHNESYVMYLFLVHFLLPIIIIFFTYGRLICKVKE 0 0 AAAQQQESASTQKAEKEVTRMVILMVIGFLTAWVPYASAAFWIFCNRGAEFTATLMTVPAFFSKSSCLFNPIIYVLLNKQ 0 0 FRNCMITTLCCGKNPLGDDDTSSAVSQSKTDVSSVSSSQVSPA* 0 >RHO2_danRer Danio rerio (zebrafish) opn1mw1 Chinen PMID 15647516 ancestral 84% 85% 93% 95% 0 MNGTEGNNFYIPMSNRTGLVRSPYEYPQYYLAEPWQFKLLAVYMFFLICLGFPINGLTLLVTAQHKKLRQPLNFILVNLAVAGTIMVCFGFTVTFYSAINGYFVLGPTGCAIEGFMATLG 1 2 GEVALWSLVVLAIERYIVVCKPMGSFKFSSSHAMAGIAFTWVMAMACAAPPLFGWSR 2 1 YIPEGMQCSCGPDYYTLNPEYNNESYVLYMFICHGIVPVTIIFFTYGRLVCTVKA 0 0 AAAQQQESESTQKAEREVTRMVILMVLGFLVAWTPYATVAAWIFFNKGAAFSAQFMAVPAFFSKSSALFNPIIYVLLNKQ 0 0 FRNCMLTTLFCGKNPLGDDESSTVSTSKTEVSSVSPA* 0 >RHO2a_danRer Danio rerio (zebrafish) opn1mw1 NM_131253 76% RHO2_latCha 0 MNGTEGSNFYIPMSNRTGLVRSPYDYTQYYLAEPWKFKALAFYMFLLIIFGFPINVLTLVVTAQHKKLRQPLNYILVNLAFAGTIMVIFGFTVSFYCSLVGYMALGPLGCVMEGFFATLG 1 2 GQVALWSLVVLAIERYIVVCKPMGSFKFSANHAMAGIAFTWFMACSCAVPPLFGWSR 2 1 YLPEGMQTSCGPDYYTLNPEYNNESYVMYMFSCHFCIPVTTIFFTYGSLVCTVKA 0 0 AAAQQQESESTQKAEREVTRMVILMVLGFLFAWVPYASFAAWIFFNRGAAFSAQAMAVPAFFSKTSAVFNPIIYVLLNKQ 0 0 FRSCMLNTLFCGKSPLGDDESSSVSTSKTEVSSVSPA* 0 >RHO2b_danRer Danio rerio (zebrafish) opn1mw2 NM_182891 92% RHO2a 76% RHO2_latCha highest expression 0 MNGTEGNNFYIPMSNRTGLVRSPYEYTQYYLADPWQFKALAFYMFFLICFGLPINVLTLLVTAQHKKLRQPLNYILVNLAFAGTIMAFFGFTVTFYCSINGYMALGPTGCAIEGFFATLG 1 2 GQVALWSLVVLAIERYIVVCKPMGSFKFSSNHAMAGIAFTWVMASSCAVPPLFGWSR 2 1 YIPEGMQTSCGPDYYTLNPEFNNESYVLYMFSCHFCVPVTTIFFTYGSLVCTVKA 0 0 AAAQQQESESTQKAEREVTRMVILMVLGFLVAWVPYASFAAWIFFNRGAAFSAQAMAIPAFFSKASALFNPIIYVLLNKQ 0 0 FRSCMLNTLFCGKSPLGDDESSSVSTSKTEVSSVSPA* 0 >RHO2c_danRer Danio rerio (zebrafish) opn1mw3 NM_182892 85% RHO2a 78% RHO2_latCha 0 MNGTEGNNFYIPMSNRTGLVRSPYEYPQYYLAEPWQFKLLAVYMFFLMCFGFPINGLTLVVTAQHKKLRQPLNFILVNLAVAGTIMVCFGFTVTFYTAINGYFVLGPTGCAIEGFMATLG 1 2 GQISLWSLVVLAIERYIVVCKPMGSFKFSSNHAFAGIGFTWIMALSCAAPPLVGWSR 2 1 YIPEGMQCSCGPDYYTLNPDYNNESYVLYMFCCHFIFPVTTIFFTYGRLVCTVKA 0 0 AAAQQQESESTQKAEREVTRMVILMVLGFLVAWTPYASVAAWIFFNRGAAFSAQFMAVPAFFSKSSSIFNPIIYVLLNKQ 0 0 FRNCMLTTLFCGKNPLGDDESSTVSTSKTEVSSVSPA* 0 >RHO2d_danRer Danio rerio (zebrafish) opn1mw4 NM_131254 83% RHO2a 77% RHO2_latCha 0 MNGTEGNNFYIPLSNRTGLARSPYEYPQYYLAEPWQFKLLAVYMFFLICLGFPINGLTLLVTAQHKKLRQPLNFILVNLAVAGTIMVCFGFTVTFYTAINGYFVLGPTGCAIEGFMATLG 1 2 GEVALWSLVVLAVERYIVVCKPMGSFKFSASHAFAGCAFTWVMAMACAAPPLVGWPR 2 1 YIPEGMQCSCGPDYYTLNPEYNNESYVLYMFICHFILPVTIIFFTYGRLVCTVKA 0 0 AAAQQQESESTQKAEREVTRMVILMVLGFLIAWTPYATVAAWIFFNKGAAFSAQFMAVPAFFSKTSALYNPVIYVLLNKQ 0 0 FRNCMLTTLFCGKNPLGDDESSTVSTSKTEVSSVSPA* 0 >RHO2_takRub Takifugu rubripes (fugu) AF226989 0 MAWDGGIEPNGTEGKNFYIPMSNRTGIVRSPFEYPQYYLADPIMFKILALYMFFLICTGTPINGLTLLVTAQNKKLRQPLNYILVNLAVAGLIMCAFGFTITITSAVNGYFILGATACAVEGFMATLG 1 2 GEIALWSLVVLAVERYVVVCKPMGSFKFTGTHAAVGVAFTWIMAFACAAPPLFGWSR 2 1 YLPEGMQCSCGPDYYTLAPGYNNESYVIYMFSCHFFVPVITIFFTYGSLVLTVKA 0 0 AAAQQQESESTQKAQKEVTRMCILMVFGFLMAWTPYATFSAWIFMNKGAAFHPLTAAVCAFFAKSSALYNPVIYVLLNKQ 0 0 FRNCMLSTIGMGGAVDDETSVSASKTEVSSVS* 0 >RHO2_gasAcu Gasterosteus aculeatus (stickleback) genome (has 2nd near-identical tandem copy) 0 MAWEGGLEPNGTEGKNFYIPMSNRTGVVRSPFEYQQYYLADPIMFKILALYMFFLICTGTPINGLTLLVTAQNKKLRQPLNYILVNLAVAGLIMCAFGFTITITSAVNGYFILGATACAVEGFMATLG 1 2 GEVALWSLVVLAVERYIVVCKPMGSFKFSGTHAGAGVLFTWIMAMACAAPPLFGWSR 2 1 YLPEGMQCSCGPDYYTLAPGFNNESYVIYMFVVHFFTPVFIIFFTYGSLVLTVKA 0 0 AAAQQQESESTQKAEREVTRMCILMVFGFLVAWVPYASFAGWIFLNKGAPFSALTAAIPAFFAKSSALYNPVIYVLLNKQ 0 0 FRNCMLTTIGMGGMVEDETSVSASKTEVSSVS* 0 >RHO2_oryLat Oryzias latipes (medaka) AB001603 0 MENGTEGKNFYIPMNNRTGLVRSPYEYPQYYLADPWQFKLLGIYMFFLILTGFPINALTLVVTAQNKKLRQPLNFILVNLAVAGLIMVCFGFTVCIYSCMVGYFSLGPLGCTIEGFMATLG 1 2 GQVSLWSLVVLAIERYIVVCKPMGSFKFTATHSAAGCAFTWIMASSCAVPPLVGWSR 2 1 YIPEGIQVSCGPDYYTLAPGFNNESFVMYMFSCHFCVPVFTIFFTYGSLVMTVKA 0 0 AAAQQQDSASTQKAEKEVTRMCFLMVLGFLLAWVPYASYAAWIFFNRGAAFSAMSMAIPSFFSKSSALFNPIIYILLNKQ 0 0 FRNCMLATIGMGGMVEDETSVSTSKTEVSTAA* 0 >RHO2_oreNil Oreochromis niloticus (tilapia) AF247124 0 MAWEGGIEPNGTEGKNFYIPMSNRTGIVRNPFEYSQYYLADPIFFKLLAFYMFFLICTGTPINGLTLFVTAQNKKLRQPLNYILVNLAVAGLIMCCFGFTITITSAINGYFVLGTTFCAIEGFMATLG 1 2 GEVALWSLVVLAVERYIVVCKPMGSFKFTGAHAGAGVLFTWIMAMACAAPPLFGWSR 2 1 YIPEGMQCSCGPDYYTLAPGFNNESYVIYMFVVHFFVPVFIIFFTYGSLVMTVRA 0 0 AAAQQQDSASTQKAEKEVTRMCVLMVMGFLIAWTPYASFAGWIFLNKGAAFSALTAAIPAFFAKSSALYNPIIYVLMNKQ 0 0 FRNCMLSTIGMGGMVEDETSVSTSKTEVSSVS* 0 >RHO2_hipHip Hippoglossus hippoglossus (halibut) AF156263 0 MVWDGGIEPNGTEGKNFYIPMSNRTGIVRSPFEYPQFYMVDSMMFKFLAFYMFFLVCTGTPINGLTLFVTAQNKKLRQPLNYILVNLAVAGLIMCCFGFTITITSAFNGYFILGATFCTIEGFMATLG 1 2 GEVALWSLVVLAVERYIVVCKPMGSFKFSGTHAGIGVLFTWVMAFACAGPPLFGWSR 2 1 YIPEGMQCSCGPDYYTLAPGFNNESYVIYMFVVHFFLPVFVIFFTYGSLVLTVKA 0 0 AAAQQQESESTQKAEKEVTRMCILMVFGFLFAWTPYATFAGWIFMNKGAAFTALTASIPAFFAKSSALYNPVIYVLLNKQ 0 0 FRNCMLSTIGMGGMVEDESSVSASKTEVSSVS* 0 >RHO2_mulSur Mullus surmuletus (mullet) Y18680 0 MNGTEGKNFYIPMSNRTGIVRSPFEYPQYYMVDPMIYKLLAFYMFFLICTGTPINGLTLLVTFQNKKLQQPLNYILVNLAVVGLIMCAFGFTITITSALNGYFILGPTFCAIEGFMATLG 1 2 GEVALWSLVVLAVERYIVVCKPMGSFKFSGTHAGAGVAFTWIMAFACAGPPLFGWSR 2 1 YLPEGMQCSCGPDYYTLAPGFNNESYVIYMFVVHFFVPVFVIFFTYGSLVLTVKA 0 0 AAAQQQESESTQKAEREVTRMCILMVIGFLVAWVPYATFAGWIFLNKGAAFTALTAALPAFFAKSSALYNPVIYVMMNKQ 0 0 FRNCILSAIGMGGMVEDETSVSTSKTEVSTAS* 0 >RHO2_pomMin Pomatoschistus minutus (sand_goby) Y18679 0 MNGTEGKNFYIPMSNRTGIVRSPYEYPQYYMVDPWIYKLLAFYMFFLICTGTPINALTLLVTFQNKKLRQPLNFILVNLAVAGLIMCAFGFTITITSALNGYFILGATFCAIEGFMATLG 1 2 GEVALWSLVVLAVERYIVVCKPMGSFKFSGAHAGAGVALTWIMAMACAAPPLFGWSR 2 1 YLPEGMQCSCGPDYYTLAPGFNNESYVMYMFVVHFFIPVFLIFFTYGSLVLTVKA 0 0 AAAQQQDSASTQKAEKEVTRMCFLMVMGFLVAWVPYASFAGWIFLNKGAAFTAMTAAIPAFFAKSSALYNPVIYVLMNKQ 0 0 FRNCMLSAVGMGGMVDDETSVSASKTEVSSVS* 0 >RHO2_calMil Callorhinchus milii (elephantfish) EF565168 0 MNGTKGSNFYIPMSNRTGVVRNPFEYPQYYLADRWLFSSISAYMFLLICAGLPINGLTLLVTVKHKKLRQPLNFILLNLAVADLFMVFGGFFITVYTSLHGYFVFGVTGCNFEGFFATLG 1 2 GEIGLWSLVVLAIERYVVVCKPMSNFRFGTSHALMGMGFTWFMALTAAVPPLVGWSR 2 1 FIPEGFQCSCTPDFYTTNPLYNNDSYLMYLFSVHFAFPVTLIFFSYGRLICKVKE 0 0 AAAQQQESATTQKAEKEVTRMVILMVIGFLTAWLPYASLSIWIFTHQGAWISPLLMTIPSFFSKSSVLYNPIIYILMNKQ 0 0 FRSSMITTVCCGKNPFGDDDSSSVTSQSKTEVSSVSTSQVSPA* 0 >RHO2_geoAus Geotria australis (lamprey) Gt 355 aa 492 nm 17463225 AY366494 cone rhodopsin RhB no petMar 0 MNGTEGANFYIPFHNRTGVVRSPYEYPQYYLADPWMYSAISAYVFTLILIGFPVNFMTLFVTFKLKKLRQPLNFILVNLCVADLLMIMFGFTTTFYTAMNGYFVFGPTGCNIEGFFATLG 1 2 GEVSLWSLVMLAIERYIVVCKPMGNFRFATTHAALGVVFTWVMASACAVPPLVGWSR 2 1 YIPEGMQCSCGPDYYTLNPKYYNESYVIYLFLVHFLLPVTIIFFTYGRLICTVKE 0 0 AAAQQQESASTQKAEREVTRMVIIMVVGFLVCWVPYASFAFYLFMNKGILFSATAMTVPAFFSKSSVLYNPIIYVLLNKQ 0 0 FRTCMVTTLFCGKNPFGEDDSSMVSTSKTEVSSVSSSQVSPS* 0 >SWS2_ornAna Ornithorhynchus anatinus (platypus) Gt synt(-IRAK1 -MECP2 - +TKTL1) 364 aa 17339011 ABN43074 cone short blue tandem -FLNB--+MECP2 with MWS1 0 MHKTHRNLQNELPEDFFIPLPLDTDNITSLSPFLVPQTHLGGSGIFMSLAAFMFLLITLGFPINLLTVICTIKYKKLRSHLNYILVNLAVSNMLVVCVGSATAFYSFAHMYFVLGPTACKIEGFAATLG 1 2 GMVSLWSLAVIAFERFLVICKPLGNLSFRGTHAIFGCAATWVFGLAASLPPLFGWSR 2 1 YIPEGLQCSCGPDWYTTNNKWNNESYVIFLFSFCFGVPLSIIIFSYGRLLLTLRA 0 0 VAKQQEQSATTQKAEREVTKMVIVMVLGFLVCWLPYASFSLWVVTNRGQVFDLRMASIPSVFSKASTIYNPIIYVFMNKQ 0 0 FRSCMLKLVFCGKSPFGDEDEISGSSQATQVSSVSSSQVSPA* 0 >SWS2_galGal Gallus gallus (chicken) Gt 362 aa 7975342 NP_990848 cone short2 blue 0 MHPPRPTTDLPEDFYIPMALDAPNITALSPFLVPQTHLGSPGLFRAMAAFMFLLIALGVPINTLTIFCTARFRKLRSHLNYILVNLALANLLVILVGSTTACYSFSQMYFALGPTACKIEGFAATLG 1 2 GMVSLWSLAVVAFERFLVICKPLGNFTFRGSHAVLGCVATWVLGFVASAPPLFGWSR 2 1 YIPEGLQCSCGPDWYTTDNKWHNESYVLFLFTFCFGVPLAIIVFSYGRLLITLRA 0 0 VARQQEQSATTQKADREVTKMVVVMVLGFLVCWAPYTAFALWVVTHRGRSFEVGLASIPSVFSKSSTVYNPVIYVLMNKQ 0 0 FRSCMLKLLFCGRSPFGDDEDVSGSSQATQVSSVSSSHVAPA* 0 >SWS2_taeGut Taeniopygia guttata (finch) Gt 363 aa cone short2 0 MPKPREMRDELPEDFYIPMSLETPNLTALSPFLVPQTHLGSPGIFKAMAAFMFLLVLLGVPINALTVLCTAKYKKLRSHLNYILVNLAVANLLVVCVGSTTAFYSFSQMYFALGPLACKIEGFTATLG 1 2 GMVSLWSLAVVAFERFLVICKPLGNFTFRGSHAVLGCAITWIFGLIASLPPLFGWSR 2 1 YIPEGLQCSCGPDWYTTDNKWNNESYVIFLFCFCFGFPLTVIVFSYGRLLLTLRA 0 0 VAKQQEQSASTQKAEREVTKMVVVMVLGFLVCWLPYCSFALWVVTHRGHPFDLGLASIPSVFSKASTVYNPIIYVFMNKQ 0 0 FRSCMLKLVFCGRSPFGDEDDVSGSSQATQVSSVSSSQVSPA* 0 >SWS2_anoCar Anolis carolinensis (lizard) 0 MQKSRPDSRDNLPEDFFIPVPLDVANITTLSPFLVPQTHLGNPSLFMGMAAFMFILIVLGVPINVLTIFCTFKYKKLRSHLNYILVNLSVSNLLVVCVGSTTAFYSFSNMYFSLGPTACKIEGFSATLG 1 2 GMVSLWSLAVVAFERYLVICKPLGNFTFRGTHAIIGCAVTWMFGLAASLPPLFGWSR 2 1 YIPEGLQCSCGPDWYTTENKWNNESYVIFLFCFCFGVPLSVIIFSYGRLLLTLRA 0 0 VAKQQEQSATTQKAEREVTKMVVVMVMGFLVCWLPYASFALWVVTHRGEPFDVRLASIPSVFSKASTVYNPVIYVLMNKQ 0 0 FRSCMLKLIFCGKSPFGDEDDVSGSSQATQVSSVSSSQVSPA* 0 >SWS2_utaSta Uta stansburiana (lizard) Gt 364 aa 16543463 DQ100326 cone short 0 MHNSRPHSRDDLPEDFFIPMPLDVANITTLSPFLVPQTHLGSPALFMGMAAFMFLLIILGVPINVLTIFCTFKYKKLRSHLNYILVNLAVSNLLVVCIGSTTAFYSFAQMYFSLGPTACKIEGFAATLG 1 2 GMVSLWSLAVVAFERFLVICKPLGNFSFRGTHAIIGCIITWVFGLVASLPPLFGWSR 2 1 YIPEGLQCSCGPDWYTTNNKWNNESYVLFLFSFCFGVPLSVIIFSYGRLLLTLRA 0 0 VAKQQEQSATTQKAEREVTKMVVVMVMGFLVCWLPYASFALWVVTHRGEPFDVRLATIPSVFSKASSVYNPVIYVFMNKQ 0 0 FRSCMLKLVFCGKSPFGDEDDVSGSSQTTQVSSVSSSQVSPA* 0 >SWS2_xenTro Xenopus tropicalis (frog) Gt -IRAK1 -MECP2 - - 363 aa cone short 0 MSKGRPDLRMEMPDEFYVPIPLETTNISSLSPFLVPQTHLGTPGIFMSISAFMLFTIIFGFPLNLLTIICTVKYKKLRSHLNYILVNLAVANLIVICFGSTTAFYSFSQMYFSLGTLACKIEGFTATLG 1 2 GIIGLWSLAVVAFERFLVICKPMGNFTFRESHAVLGCILTWVIGLVAAIPPLLGWSR 2 1 YIPEGLQCSCGPDWYTVNNKWNNESYVLFLFCFCFGFPLAIIVFSYGRLLLALHA 0 0 VAKQQEQSATTQKAEREVTRMVIVMVVGFLVCWLPYASFALWAVTHRGELFDLRMSSVPSVFSKASTVYNPFIYIFMNRQ 0 0 FRSCMMKMIFCGKNPLGDDEETSVSGSTQVSSVSSSQIAPS* 0 >SWS2_neoFor Neoceratodus forsteri (lungfish) Gt 364 aa 17961206 EF526299 cone short 0 MHRTKPDPQEDLPDDFYIPVSLNTNNITMLSPFLVPQTHLGSPSVFMVLSVFMFFLLITGIPINVLTIICTFKYKKLRSHLNYILVNLAVANLIVVGFGSTTAFYSFSQMYFAWGPLACKIEGFAATLG 1 2 GMVSLWSLAVVAFERFLVICKPLGNFTFRSTHAIIGCVATWVFGLISSAPPLFGWSR 2 1 YIPEGLQCSCGPDWYTTNNKWNNESYVIFLFCFCFGFPLSVIIFSYGRLLMTLRA 0 0 VAKQQEQSASTQKAEREVTKMVVVMVLGFLVCWLPYTVFSLWVVTHRGESFELALGSIPAVFSKSSTVYNPLIYVFMNKQ 0 0 FRSCMMKLIFCGKSPFGDEDDASSASQSTQVSSVSSSQVAPA* 0 >SWS2_takRub Takifugu rubripes (fugu) Gt 351 aa cone short2 0 MRGVRQHEFQEDFYIPIPLDVDNITALSPFLVPQDHLGSPAVFYGMSAFMFFLFVAGTGINVLTIACTIQYKKLRSHLNYILVNLAFSNLLVTTVGSFTCFCCFFVRYMIVGPLGCKIEGFAATLG 1 2 GMVSLWSLAVVAFERWLVVCKPLGNFIFKPDHAIVCCIFTWFFALIISAPPLFGWSR 2 1 YIPEGFQCSCGPDWYTTGNKYNNESYVWFIFGFGFAVPLFVIVFCYSQLLVMLKS 0 0 AKAQAESASTQKAEREVTRMVVVMILGFLVCWLPYASFALWVVNNRGTPFDLRLATIPACFSKASTVYNPIIYVVLNKQ 0 0 FRSCMKKMLGMSGGDDEESSSQSVTEVSKVSPS* 0 >SWS2_gasAcu Gasterosteus aculeatus (stickleback) Gt 359 aa cone short 0 MKHGRVPEIPEDFYIPISLDTDNITSLSPFLVPQDHLASKATFYSLAFYMFFILIVGTFINALTVACTVQNKKLRSHLNYILVNLAVSNLLVSGVGAFTAFLSFAARYFVLGTLACKVEGFLATLG 1 2 GMVSLWSLAVIAFERWLVICKPLGNFIFKPDHALVCCAFTWVFALAASAPPLVGWSR 2 1 YIPEGLQCSCGPDWYTTNNKYNNESYVLFLFGFCFAVPFCTICFCYSQLLFTMKMA 0 0 AKAQAESASTQKAEREVTRMVVLMVMGFLVCWMPYASFALWVVNNRGQTFDLRFASIPSVFSKSSAVYNPVIYVLLNKQ 0 0 FRSCMMKMLGMGGGDDEESSTSSVTEVSKVGPA* 0 >SWS2_geoAus Geotria australis (lamprey) Gt 362 aa 439 nm 17463225 AY366492 cone short2 blue retinal petMar ps 0 MYQGKSTQVDDLPEDFYIPIALNVKNMSELSPFLVPQVHLGDSFIFYGMSAFMLFLVLAGFPLNFLTVFVTIKYKKLRSHLNYILVNLAIANLIVVCCGSTLAFYSFMHKYFILGPLFCKMEGFTATLG 1 2 GMLSLWSLAVLAFERCLVICKPFGNIAFRGTHALIRCGFAWAAAIAASTPPLFGWSR 2 1 YIPEGLQCSCGPDWYTTNNKYNNESYVMFLFIFCFGTPFTIIIVSYSKLILTLRA 0 0 AAAQQQESASTQKAEKEVSRMVVIMVGGFLVCWLPYASLALWIVFNRGSPFDLRLATIPSVFSKASTVYNPVIYIFLNKQ 0 0 FRSCMMKTIFCGKNPLGDDEDATSTTTQVSSVSTSQVAPA* 0 >SWS1_homSap Homo sapiens (human) Gt synt(-FAM137A -CALU -NAG6 -FLNC) 348 aa 1385866 NP_990769 cone short 0 MRKMSEEEFYLFKNISSVGPWDGPQYHIAPVWAFYLQAAFMGTVFLIGFPLNAMVLVATLRYKKLRQPLNYILVNVSFGGFLLCIFSVFPVFVASCNGYFVFGRHVCALEGFLGTVA 1 2 GLVTGWSLAFLAFERYIVICKPFGNFRFSSKHALTVVLATWTIGIGVSIPPFFGWSR 2 1 FIPEGLQCSCGPDWYTVGTKYRSESYTWFLFIFCFIVPLSLICFSYTQLLRALKA 0 0 VAAQQQESATTQKAEREVSRMVVVMVGSFCVCYVPYAAFAMYMVNNRNHGLDLRLVTIPSFFSKSACIYNPIIYCFMNKQ 0 0 FQACIMKMVCGKAMTDESDTCSSQKTEVSTVSSTQVGPN* 0 >SWS1_monDom Monodelphis domesticus (opossum) Gt synt(-FAM137A -CALU -NAG6 -FLNC) 347 aa cone short 0 MSGDEEFYLFKNISSVGPWDGPQYHIAPAWAFHFQTVFMGFVFCAGTPLNAVVLVATLRYKKLRQPLNYILVNVSLCGFIFCIFAVFTVFISSSQGYFIFGRHVCAMEAFLGSVA 1 2 GLVTGWSLAFLAFERFIVICKPFGNFRFNSKHAMMVVLATWVIGIGVSIPPFFGWSR 2 1 FIPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIMPLFLICFSYSQLLRALRA 0 0 VAAQQQESATTQKAEREVSRMVVMMVGSFCLCYVPYAALAMYMVNNQNHGLDLRLVTIPAFFSKSACVYNPIIYCFMNKQ 0 0 FHACIMEMVCRKPMTDDSDVSSSQKTEVSAVSSSQVGPT* 0 >SWS1_smiCra Sminthopsis crassicaudata (dunnart) AY442173 0 MSGDEEFYLFKNISLVGPWDGPQYHLAPAWAFHFQTAFMGFVFFAGTSLNGVVLIATLRYKKLRQPLNYILVNISLAGFIFCVFSVFTVFVSSSQGYFVFGRHVCAMEGFLGSVA 1 2 GLVTGWSLAFLAFERFIVICKPFGNFRFNSKHAMMVVLATWIIGIGVSIPPFFGWSR 2 1 YIPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIVPLSLICFSYSQLLGALRA 0 0 VAAQQQESATTQKAEREVSRMVVVMVGSFCLCYVPYAAMAMYMVNNRNHGLDLRLVTIPAFFSKSACVYNPIIYCFMNKQ 0 0 FHACIMEMICKKPMTDDSETTSSQKTEVSTVSSSQVGPS* 0 >SWS1_tarRos Tarsipes rostratus (honey_possum) AY772472 0 MSGDEEFYLFKDISSVGPWDGPQYHIAPAWAFHFQTTFMGFVFFAGTPLNAVVLIATLRYKKLRQPLNYILVNISLAGFIFCVISVFTVFISSSQGYFIFGRHVCAMEAFLGSVA 1 2 GLVTGWSLAFLAFERFIVICKPFGNFRFSSKHAMMVVLATWVIGIGVSIPPFFGWSR 2 1 YIPEGLQCSCGPDWYTVGTKYHSEYYTGFLFIFCFIVPLSLICFSYSQLLGALRA 0 0 VAAQQQESATTQKAEREVSRMVVVMVGSFCLCYVPYAALAMYMVNNRNHGLDLRLVTIPAFFSKSACVYNPIVYWFMNKQ 0 0 FHACIMEMVCRKPMTDDSEISSSQKTEVSTVSSSQVGPS* 0 >SWS1_galGal Gallus gallus (chicken) Gt 348 aa cone short1 violet 0 MSSDDDFYLFTNGSVPGPWDGPQYHIAPPWAFYLQTAFMGIVFAVGTPLNAVVLWVTVRYKRLRQPLNYILVNISASGFVSCVLSVFVVFVASARGYFVFGKRVCELEAFVGTHG 1 2 GLVTGWSLAFLAFERYIVICKPFGNFRFSSRHALLVVVATWLIGVGVGLPPFFGWSR 2 1 YMPEGLQCSCGPDWYTVGTKYRSEYYTWFLFIFCFIVPLSLIIFSYSQLLSALRA 0 0 VAAQQQESATTQKAEREVSRMVVVMVGSFCLCYVPYAALAMYMVNNRDHGLDLRLVTIPAFFSKSACVYNPIIYCFMNKQ 0 0 FRACIMETVCGKPLTDDSDASTSAQRTEVSSVSSSQVGPT* 0 >SWS1_taeGut Taeniopygia guttata (finch) Gt 347 aa cone short1 0 MDEEEFYLFKNQSSVGPWDGPQYHIAPMWAFYLQTIFMGLVFVAGTPLNAIVLIVTIKYKKLRQPLNYILVNISVSGLMCCVFCIFTVFIASSQGYFVFGKHMCAFEGFAGATG 1 2 GLVTGWSLAFLAFERYIVICKPFGNFRFNSRHALLVVAATWIIGVGVAIPPFFGWSR 2 1 YIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLSLIIFSYSQLLSALRA 0 0 VAAQQQESATTQKAEREVSRMVVVMVGSFCMCYVPYAALAMYMVNNREHGIDLRLVTIPAFFSKSSCVYNPIIYCFMNKQ 0 0 FRACIMETVCGRPMTDDSEVSSSAQRTEVSSVSSSQVGPS* 0 >SWS1_anoCar Anolis carolinensis (lizard) Gt synt(- -CALU - -) 347 aa cone short 0 MSGQEDFYLFENISSVGPWDGPQYHIAPMWAFYFQTAFMGFVFFAGTPLNAIILIVTVKYKKLRQPLNYILVNISFAGFLFCTFSVFTVFMASSQGYFFFGRHVCAMEAFLGSVA 1 2 GLVTGWSLAFLAFERYIVICKPFGNFRFNSRHALLVVAATWIIGVGVAIPPFFGWSR 2 1 YIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLTLIIFSYSQLLGALRA 0 0 VAAQQQESATTQKAEREVSRMVVVMVGSFCLCYVPYASLAMYMVNNRDHGLDLRLVTIPAFFSKSSCVYNPIIYCFMNKQ 0 0 FRACILETVCGKPMSDESDVSSSAQKTEVSSVSSSQVSPS* 0 >SWS1_utaSta Uta stansburiana (lizard) Gt 348 aa 16543463 DQ100325 cone short 0 MSGEEDFYLFENISSVGPWDGPQYHIAPMWAFYFQTAFMGFVFFAGTPLNAIILIVTVKYKKLRQPLNYILVNISFAGFLFCVFSVFTVFLASSQGYFFFGRHICALEAFLGSVA 1 2 GLVTGWSLAFLAFERYIVICKPFGNFRFNSKHALLVVAATWFIGIGVSIPPFFGWSR 2 1 FIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIVPLTLIIFSYSQLLGALRA 0 0 VAAQQQESATTQKAEREVSRMVVVMVGSFCLCYVPYAALAMYMVNNRDHGIDLRLVTIPAFFSKSACVYNPIIYCFMNKQ 0 0 FRACIMETVCGKPMTDESDVSSSAQKTEVSSVSSSQVSPS* 0 >SWS1_xenLae Xenopus laevis (frog) Gt synt(- -CALU - -) 348 aa cone short 0 MLEEEDFYLFKNVSNVSPFDGPQYHIAPKWAFTLQAIFMGMVFLIGTPLNFIVLLVTIKYKKLRQPLNYILVNITVGGFLMCIFSIFPVFVSSSQGYFFFGRIACSIDAFVGTLT 1 2 GLVTGWSLAFLAFERYIVICKPMGNFNFSSSHALAVVICTWIIGIVVSVPPFLGWSR 2 1 YMPEGLQCSCGPDWYTVGTKYRSEYYTWFIFIFCFVIPLSLICFSYGRLLGALRA 0 0 VAAQQQESASTQKAEREVSRMVIFMVGSFCLCYVPYAAMAMYMVTNRNHGLDLRLVTIPAFFSKSSCVYNPIIYSFMNKQ 0 0 FRGCIMETVCGRPMSDDSSVSSTSQRTEVSTVSSSQVSPA* 0 >SWS1_neoFor Neoceratodus forsteri (lungfish) Gt 347 aa 17961206 EF526299 cone short 0 MSGEEEFYLFKNISSVGPWDGPQYHIAPKWAFFLQAAFMGFVLFVGTPLNAIVLFVTIKYKKLQQPLNYILVNISLAGFIFCFFGVFAVFIASCQGYFIFGKTVCALEGFTGSVA 1 2 GLVTGWSLAILAFERYLVICKPIGNFRFGSKHSMIAVVAAWVIGVGVSIPPFFGWSR 2 1 YIPEGLQCSCGPDWYTVGTKYKSEYYTWFLFIFCFIIPLFIICFSYSQLLGALRA 0 0 VAAQQQESATTQKAEREVSRMIIVMVGSFCVCYVPYAALAMYMVNNRDHGIDLRLVTIPAFFSKSSFVYNPIIYCFMNKQ 0 0 FRACIMQTVFGKPMTDDSDISSSGKTEVSSVSSSQVNPS* >SWS1_danRer Danio rerio (zebrafish) Gt synt(- -CALU - -) 337 aa cone short1 0 MDAWAVQFGNASKVSPFEGEQYHIAPKWAFYLQAAFMGFVFIVGTPMNGIVLFVTMKYKKLRQPLNYILVNISLAGFIFDTFSVSQVSVCAARGYYSLGYTLCSMEAAMGSIA 1 2 GLVTGWSLAVLAFERYVVICKPFGSFKFGQGQAVGAVVFTWIIGTACATPPFFGWSR 2 1 YIPEGLGTACGPDWYTKSEEYNSESYTYFLLITCFMMPMTIIIFSYSQLLGALRA 0 0 VAAQQAESESTQKAEREVSRMVVVMVGSFVLCYAPYAVTAMYFANSDEPNKDYRLVAIPAFFSKSSSVYNPLIYAFMNKQ 0 0 FNACIMETVFGKKIDESSEVSSKTETSSVSA* 0 >SWS1_oryLat Oryzias latipes (medaka) Gt synt(- - - -) 336 aa cone short1 0 MGKYFYLYENISKVGPYDGPQYYLAPTWAFYLQAAFMGFVFFVGTPLNFVVLLATAKYKKLRVPLNYILVNITFAGFIFVTFSVSQVFLASVRGYYFFGQTLCALEAAVGAVA 1 2 GLVTSWSLAVLSFERYLVICKPFGAFKFGSNHALAAVIFTWFMGVGCACPPFFGWSR 2 1 YIPEGLGCSCGPDWYTNCEEFSCASYSKFLLVTCFICPITIIIFSYSQLLGALRA 0 0 VAAQQAESASTQKAEKEVSRMIIVMVASFVTCYGPYALTAQYYAYSQDENKDYRLVTIPAFFSKSSCVYNPLIYAFMNKQ 0 0 FNGCIMEMVFGKKMEEASEVSSKTEVSTDS*0 >SWS1_petMar Petromyzon marinus (lamprey) recent exonic pseudogene with multiple frameshifts and internal stops, no synteny homSap 0 MSGDEEFYLFKNISKVGPLDGPHFHIATKWAFDFQAAFMGFVFLCGtPLNAIVLIVTVKCKKLRQPLTYMLVNISAAGLVFCLFSISTVFLFSTQGYFVFGPTVCALESLFGSMA 1 2 GLVTGWSLAFLAAERYIVICKPFGNFRFGSIHSLFAFCLTWVLGLGVALPPFFGWSR 2 1 YIPeGLQCSCSPDWNTVGTKYESEYCTYFLFVFCFFVQLSIIIFSYGKLLNTLra 0 0 VAVQqQESSLSSTQKAEREMSRMVIVMVGSFCTCYVAALALYVVTNRDHNIDLRFVTVPAFFSKASCVYNPLIYSFMNKQ 0 0 FRARIMETVCGKFITDESETSSSRTAVSSVSTSQVSPG* 0 >SWS1_geoAus Geotria australis (lamprey) Gt 346 aa 359 nm 17463225 AY366495 cone short1 UV retinal 0 MSGDEEFYLFKNISKVGPWDGPQFHIAPKWAFYLQAAFMGFVFICGTPLNAIVLVVTIKYKKLRQPLNYILVNISAAGLVFCLFSISTVFVASMQGYFFLGPTICALEAFFGSLA 1 2 GLVTGWSLAFLAAERYIVICKPFGNFRFGSKHALVAVGLTWMLGLSVALPPFFGWSR 2 1 YIPEGLQCSCGPDWYTVGTKYKSEYYTYFLFVFCFVVPLSIIIFSYGSLLGTLRA 0 0 VAAQQQESASTQKAEREVSRMVIMMVASFCTCYVPYAALAVYMVTNRDHNIDLRFVTVPAFFSKASCVYNPLIYSFMNKQ 0 0 FRACILETVCGKPITDESETSSSRTEVSSVSTTQMIPG* 0 >LWS_homSap Homo sapiens (human) Gt synt(-IRAK1 -MECP2 -TEX28 +TKTL1) 364 aa 530 nm 12853434 NP_000504 cone long OPN1MW deutan 0 MAQQWSLQRLAGRHPQDSYEDSTQSSIFTYTNSNSTR 1 2 GPFEGPNYHIAPRWVYHLTSVWMIFVVIASVFTNGLVLAATMKFKKLRHPLNWILVNLAVADLAETVIASTISVVNQVYGYFVLGHPMCVLEGYTVSLC 1 2 GITGLWSLAIISWERWMVVCKPFGNVRFDAKLAIVGIAFSWIWAAVWTAPPIFGWSR 2 1 YWPHGLKTSCGPDVFSGSSYPGVQSYMIVLMVTCCITPLSIIVLCYLQVWLAIRA 0 0 VAKQQKESESTQKAEKEVTRMVVVMVLAFCFCWGPYAFFACFAAANPGYPFHPLMAALPAFFAKSATIYNPVIYVFMNRQ 0 0 FRNCILQLFGKKVDDGSELSSASKTEVSSVSSVSPA* 0 >LWS_monDom Monodelphis domesticus (opossum) Didelphimorphia) exon 1:4 residue insert 0 MTQAWDPAGFLARRRDVNEDDNDETTRSSLFVYTNSNNTR 1 2 GPFEGPNYHIAPRWVYNLTSLWMVFVVIASIFTNGLVLVATMKFKKLRHPLNWILVNLAVADLGETVIASTISVINQIYGYFILGHPLCVLEGYTVSLC 1 2 GITGLWSLAIISWERWVVVCKPFGNVKFDAKLAMVGIIFSWVWAAVWTAPPLFGWSR 2 1 YWPHGLKTSCGPDVFSGSSDPGVQSYMIVLMATCCIFPLSIILLCYVQVWLAIRA 0 0 VAKQQKESESTQKAEKEVSRMVVVMILAYCFCWGPYTLFACFAAANPGYSFHPLTASLPAYFAKSATIYNPIIYVFMNRQ 0 0 FRTCILQLFGKKVDDGSEVSSTSKTEGSSVSSVAPA* 0 >LWS_macEug Macropus eugenii (wallaby) Diprotodontia 0 MTQAWDPAGFLAWRRDENEETTRASLFVYTNSNNTK 1 2 GPFEGPNYHIAPRWVFNLTSLWMIFVVIASIFTNGLVLVATMKFKKLRHPLNWILVNLAVADLGETLIASTISVINQIYGYFILGHPMCVLEGYTVSLC 1 2 GITGLWSLAIISWERWVVVCKPFGNVKFDAKLAMVGIVFSWVWAAVWTAPPLFGWSR 2 1 YWPHGLKTSCGPDVFSGNSDPGVQSYMIVLMSTCCILPLSVIFLCYIQVWLAIRS 2 0 VAKQQKESESTQKAEKEVSRMVVVMILAFCFCWGPYAIFACFAAANPGYAFHPLTASLPAYFAKSATIYNPIIYVFMNRQ 0 0 FRTCILQLFGKKVDDGSEVSSTSRTEVSSVSSVAPA* 0 >LWS_ornAna Ornithorhynchus anatinus (platypus) Gt synt(-IRAK1 -MECP2 - -) 365 aa 17339011 ABN43074 cone long LWS green 0 MTPAWNSGVYAARRRFEDEEDTTRTSVFVYTNSNNTR 1 2 DPFEGPNYHIAPRWAYNVTSLWMIFVVIASVFTNGLVLVATMKFKKLRHPLNWILVNLAVADLGETLIASTISVINQIFGYFILGHPMCVLEGYTVSLC 1 2 GITGLWSLSIISWERWIVVCKPFGNVKFDAKLAMVGIVFSWVWAAVWTAPPIFGWSR 2 1 YWPHGLKTSCGPDVFSGSSDPGVQSYMIVLMSTCCILPLSIIVLCYLQVWLAIRA 0 0 VAKQQKESESTQKAEKEVSRMVVVMILAYCFCWGPYTIFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ 0 0 FRNCIMQLFGKKVDDGSELSSTSRTEVSSVSSVSPA* 0 >LWS_smiCra Sminthopsis crassicaudata (dunnart) Dasyuromorphia 0 MTQAWDPAGFLAWRRDENEETTRASLFVYTNSNNTK 1 2 GPFEGPNYHIAPRWVYNLTSLWMIFVVIASVFTNGLVLVATMKFKKLRHPLNWILVNLAVADLGETIIASTISVINQIYGYFILGHPMCVLEGYTVSLC 1 2 GITGLWSLAIISWERWVVVCKPFGNVKFDAKLAMVGIVFSWVWAAVWTAPPIFGWSR 2 1 YWPHGLKTSCGPDVFSGSSDPGVQSYMIVLMSTCCILPLSIIILCYIQVWLAIRA 0 0 VAKQQKESESTQKAEKEVSRMVMVMILAFCFCWGPYALFACFAAANPGYAFHPLTASLPAYFAKSATIYNPIIYVFMNRQ 0 0 FRTCILQLFGKKVDDGSEVSSTSRTEVSSVSSVAPA* 0 >LWS_galGal Gallus gallus (chicken) Gt 63 aa 12716987 NM_205438 cone long green iodopsin missing in assembly 0 MAAWEAAFAARRRHEEEDTTRDSVFTYTNSNNTR 1 2 GPFEGPNYHIAPRWVYNLTSLWMIFVVAASVFTNGLVLVATWKFKKLRHPLNWILVNLAVADLGETVIASTISVINQISGYFILGHPMCVVEGYTVSAC 1 2 GITALWSLAIISWERWFVVCKPFGNIKFDGKLAVAGILFSWLWSCAWTAPPIFGWSR 2 1 YWPHGLKTSCGPDVFSGSSDPGVQSYMVVLMVTCCFFPLAIIILCYLQVSLAIRA 0 0 VAAQQKESESTQKAEKEVSRMVVVMIVAYCFCWGPYTFFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ 0 0 FRNCILQLFGKKVDDGSEVSTSRTEVSSVSNSSVSPA* 0 >LWS_anoCar Anolis carolinensis (lizard) Gt synt(- - -TEX28 +TKTL1) 366 aa cone long 0 MAGTVTEAWDVAVFAARRRNDEDDTTRDSLFTYTNSNNTR 1 2 GPFEGPNYHIAPRWVYNITSVWMIFVVIASIFTNGLVLVATAKFKKLRHPLNWILVNLAIADLGETVIASTISVINQISGYFILGHPMCVLEGYTVSTC 1 2 GISALWSLAVISWERWVVVCKPFGNVKFDAKLAVAGIVFSWVWSAVWTAPPVFGWSR 2 1 YWPHGLKTSCGPDVFSGSDDPGVLSYMIVLMITCCFIPLAVILLCYLQVWLAIRA 0 0 VAAQQKESESTQKAEKEVSRMVVVMIIAYCFCWGPYTVFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ 0 0 FRNCIMQLFGKKVDDGSELSSTSRTEVSSVSNSSVSPA* 0 >LWS_xenTro Xenopus tropicalis (frog) Gt synt(-IRAK1 -MECP2 - -) 370 aa cone long 0 MASHWNEAVFAARRRNDDDDTTRSSVFTYTNSNNTR 1 2 GPFEGPNYHIAPRWVYNISSLWMIFVVLASVFTNGLVLVATLKFKKLRHPLNWILVNMAIADLGETVIASTISVCNQIFGYFVLGHPMCILEGYTVSVC 1 2 GIAALWSLTVIAWERWFVVCKPFGNIKFDGKLAATGIIFSWVWAAGWCAPPIFGWSR 2 1 YWPHGLKTSCGPDVFSGSSDPGVQSYMLVLMITCCIIPLAIIVLCYMHVWLTIRQ 0 0 VAQQQKESESTQKAEREVSRMVVVMIIAYIFCWGPYTFFACFAAFNPGYNFHPLAAAMPAYFAKSATIYNPIIYVFMNRQ 0 0 FRNCIYQLFGKKVDDGSEVSSTSRTEVSSVSNSSVSPA* 0 >LWS_neoFor Neoceratodus forsteri (lungfish) Gt 365 aa 17961206 EF526299 cone long 0 MAEPWDAVLAARRRHQDEETTRSTIFVYTNSNNTR 1 2 GPFEGPNYHIAPRWVYNLTSLWMIFVVFASCFTNGLVLMATYKFKKLRHPLNWILVNLAIADLGETLIASTISVTNQIFGYFILGHPMCMLEGFTVATC 1 2 GITGLWSLTIIAWERWVVVCKPFGNIKFDGKWAAGGIIFSWVWSAFWCAMPLFGWSR 2 1 FWPHGLKTSCGPDVFSGEDKYGTRSFMIALMITCCIIPLGVIILCYIQVWWAIRT 0 0 VAKQQKESESTQKAEKEVSRMVVVMIFAYCFCWGPYTFMACFGAAYPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ 0 0 FRNCIYQLLGKKVDDGSELSSTSKTEVSSVSNSSVSPA* 0 >LWS_takRub Takifugu rubripes (fugu) Gt 358 aa cone long 0 MAEEWGKQSFAARRYHEDTTRGSAFVYTNSNHTR 1 2 DPFEGPNYHIAPRWVYNVATVWMFIVVVLSVFTNGLVLVATAKFKKLRHPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPMCVFEGYTVSTC 1 2 GIAALWSLTIISWERWVVVCKPFGNVKFDAKWATGGIVFSWVWAAVWCAPPIFGWSR 2 1 YWPHGLKTSCGPDVFSGSEDPGVQSYMIVLMITCCIIPLAIIILCYLAVWLAIRS 0 0 VAMQQKESESTQKAEKEVSRMVVVMIVAYCVCWGPYTFFACFAAANPGYAFHPLAAAMPAYFAKSATIYNPVIYVFMNRQ 0 0 FRVCIMKLFGKEVDDGSEVSTSKTEVSSVAPA* 0 >LWS_gasAcu Gasterosteus aculeatus (stickleback) Gt synt(- - - -) 358 aa cone long 0 MAEEWGKQAFAARRYNEDTTRGSMFVYTNSNNTK 1 2 DPFEGPNYHIAPRWVYNLSTLWMFIVVALSVFTNGLVLVATAKFKKLQHPLNWILVNLAIADLGETVFASTISVCNQFFGYFILGHPMCVFEGYVVSVC 1 2 GITALWSLTIISWERWIVVCKPFGNVKFDAKWATAGIVFSWIWSAVWCAPPIFGWSR 2 1 YWPHGLKTSCGPDVFSGSEDPGVQSYMIVLMITCCLIPLAIIILCYLAVWLAIRA 0 0 VAMQQKESESTQKAERDVSRMVVVMIVAYIVCWGPYTTFACFAAANPGYAFHPLAAAMPAYFAKSATIYNPVIYVFMNRQ 0 0 FRSCIMQLFGKEVDDGSEVSTsKTEVSSVAPA* 0 >LWS1_calMil Callorhinchus milii (elephantfish) EF565165 0 MTQSWELVAPAARRGFKYDEPTHSGIFVYTNSNQTR 1 2 GPFEGPNYHIAPRWAYNLTSVWMVGVVVASVFTNGLVLVATVRFKKLRHPLNWILVNMALADLGETVLASTVSVANQFFGYFILGHPLCVFEGFVVSLC 1 2 GITALWSLTIIAWERWVVVCKPFGNMKFDSKMAVAGIVFSWVWSAGWCLPPIFGWSR 2 1 YWPHGLKTSCGPDVFSGNEDPGVQSYMVALTLSCAVLPLLIIILCYFQVWWAIRA 0 0 VALQQKESESTQKAEKEVSRMVVVMVAAFCLCWGPYACFAMFSALNPGYAFHPLVASIPSYFAKSSTIYNPIIYVFMNRQ 0 0 FRNCILQLFGKKVDDGSELSSTSKTDVSSVSNSSVSPA* 0 >LWS2_calMil Callorhinchus milii (elephantfish) EF565166 0 MAEPRGSVAFAARRWNDHEGTTVGEFTYTNSNSTR 1 2 DPFEGPNYHIAPRWTYNLTSLWMVVVVILSVFTNGLVLVATWKFKKLRHPLNWILVNLAIADLGETLFASTISICNQVFGYFILGHPMCVFEGFTVSAC 1 2 GITALWSLTIIAWERWVVVCKPFGNVKFDGKWAAFGIIFSWVWSIGWCLPPVFGWSR 2 1 YWPHGLKTSCGPDVFSGSSDPGVKSYMVTLVITCAALPLTIIIVCLYQVWLAIRA 0 0 VAMQQKESESTQKAEKEVSRMVVVMIIAFCFCWGPYTSFAVFSALNPGYSFHPLMAALPAYFAKSSTIYNPIIYVFMNRQ 0 0 FRNCILQLFGKKVDDGSELSSTSKTDVSSVSNSSVSPA* 0 >LWS_petMar Petromyzon marinus (lamprey) Gt 366 aa cone traces key to intron 3 position and gapping 0 MTASWQGAMFAARRRQDDEDTTMESLFRYTNENNTK 1 2 DPFEGPNYHIAPRWVFNLTSVWMIIVVVLSLFSNGLVLVATVKFKKLRHPLNWIIVNLAIADILETIFASTISVCNQVYGYFILGHPMCVFEGYVVSTC 1 2 GIAGLWSLAIISWERWMVVCKPFGNIKFDGKIATILIVFSWVWPASWCSLPIFGWSR 2 1 YWPHGLKTSCGPDVFSGSTDPGVQSYMVVLMITCCFLPLSIIILCYLQVWLAIHS 0 0 VAQQQKESETTQKAERDVSRMVVVMILAYVFCWGPYTFFACFAAANPGYSFHPIAAALPAYFAKGATIYNPIIYVFMNRQ 0 0 FRNCILQLFGKKVDDGSEVSSSSRTEVSSVSNSSVSPA* 0 >LWS_letJap Lethenteron japonicum (lamprey) Gt 365 aa 15096614 AB116381 cone long 0 MTASWHGAVFAARRRNDDEDTTKDSIFRYTNENNTR 1 2 DPFEGPNYHIAPRWMFNLTSVWMIIVVVLSLFTNGLVLVATMKFKKLRHPLNWILVNLAIADILETIFASTISVCNQVFGYFILGHPMCVFEGYVVSTC 1 2 GIAGLWSLAIISWERWMVVCKPFGNIKFDGKIAIILIVFSWVWPACWCSLPIFGWSR 2 1 YWPHGLKTSCGPDVFSGSSDPGVQSYMVVLMVTCCFLPLSVIILCYLQVWLAIHS 0 0 VAQQQKESETTQKAERDVSRMVVVMILAYIFCWGPYTFFACYAAANPGYAFHPLTAALPAYFAKSATIYNPVIYVFMNRQ 0 0 FRNCIMQLFGKKVDDGSEVSSASRTEVSSVSNSSISPA* >LWS_geoAus Geotria australis (lamprey) Gt 365 aa 560 nm 17463225 AY366491 cone long red retinal 0 MAQSWERAMFAARRRQDEDTTKGDLFRYTNENNTR 1 2 DPFEGPNYHIAPRWMYNLTSFWMIIVVILSLFTNGLVLVATLKFKKLRHPLNWILVNLAIADIGETIFASTVSVVNQIFGYFILGHPLCVFEGFTVSVC 1 2 GITALWSLAIISFERWMVVCKPFGNLKFDGKVAIVLIIFSWAWSAGWCAPPIFGWSR 2 1 YWPHGLKTSCGPDVFSGSTDPGVQSYMVVLMITCCFIPLALIIICYLQVWLAIHT 0 0 VAQQQKESETTQKAERDVSRMVVVMIFAYIFCWGPYTFFACFAAANPGYAFHPLAAALPAYFAKSATIYNPIIYVFMNRQ 0 0 FRNCIMQLFGKKVDDGSEVSSSARTEVSSVSNSSVSPA* 0 >PIN_galGal Gallus gallus (chicken) Gt 352 aa pinopsin pineal non-visual 0 MSSNSSQAPPNGTPGPFDGPQWPYQAPQSTYVGVAVLMGTVVACASVVNGLVIVVSICYKKLRSPLNYILVNLAVADLLVTLCGSSVSLSNNINGFFVFGRRMCELEGFMVSLT 1 2 GIVGLWSLAILALERYVVVCRPLGDFQFQRRHAVSGCAFTWGWALLWSTPPLLGWSSYVPE 1 2 GLRTSCGPNWYTGGSNNNSYILSLFVTCFVLPLSLILFSYTNLLLTLRA 0 0 AAAQQKEADTTQRAEREVTRMVIVMVMAFLLCWLPYSTFALVVATHKGIIIQPVLASLPSYFSKTATVYNPIIYVFMNKQ 0 0 FQSCLLEMLCCGYQPQRTGKASPGTPGPHADVTAAGLRNKVMPAHPV* 0 >PIN_colLiv Columba livia (pigeon) 0 MDPTNSPQEPPHTSTPGPFDGPQWPHQAPRGMYLSVAVLMGIVVISASVVNGLVIVVSIRYKKLRSPLNYILVNLAMADLLVTLCGSSVSFSNNINGFFVFGKRLCELEGFMVSLT 1 2 GIVGLWSLAILALERYVVVCRPLGDFRFQHRHAVTGCAFTWVWSLLWTTPPLLGWSSYVPE 1 2 GLRTSCGPNWYTGGSNNNSYILTLFVTCFVMPLSLILFSYANLLMTLRA 0 0 AAAQQQESDTTQQAERQVTRMVVAMVMAFLICWLPYTTFALVVATNKDIAIQPALASLPSYFSKTATVYNPIIYVFMNKQ 0 0 FQSCLLKMLCCGHHPRGTGRTAPAAPASPTDGLRNKVTPSHPV* 0 >PIN_taeGut Taeniopygia guttata (finch) 0 MDSTQEPPNSSTPGPFDGPQWPHQAPRATYLVVAVLMGLVVASASLLNGLVIVVSVRHKRLRSPLNYILLNLAVANLLVTLCGSSVSLSNNISGFFVFGERLCQLEGFMVSLT 1 2 GIVGLWSLAILALERYLVVCRPLGDFRFQQQHAASGCAFTWGWSLLWTTPPLLGWSSYVPE 1 2 GLRTSCGPNWYTGGSNNSSYILALFVTCFVMPLSLILFSYTNLLLTLRA 0 0 AAARQQESDTTQQAEREVTRMVVAMVVAFLTCWLPYATFALVVATHKDIVIQPALASLPSYFSKTATVYNPIIYVFMNKQ 0 0 SCLLGMLCCGHHPRGMGKTSPAAPSPQVAAEGLRNKVTPSHPV* 0 >PIN_utaSta Uta stansburiana (lizard) Gt 359 aa 16543463 DQ100321 pinopsin pinopsin missing Anole genome 0 MVNEWSNATPGPFDGPQWPYLAPRSIYTSVAVLMGLVVVSAAFVNGLVIVVSIQYKKLRSPLNYILVNLAIADLLVTSFGSTLSFANNIYGFFVLGQTACEFEGFMVSLT 1 2 GIVGLWSLAILAFERYLVICKPVGDFRFQQRHAVFGCVFTWMWSLVWTLPPLFGWSSYVPE 1 2 GLRTSCGPNWYTGGSGNNSYIMALFVTCFALPLGMIIFSYASLLLTLRA 0 0 VATQQKEVETTQQAEKEVTRRVIAMVMAFLVCWLPYASFAMVVATNKDLVIQPALASLPSYFSKTATVYNPIIYVFMNKQ 0 0 FRSCLLSTMSCGHRPRGAQETTPAMISIPQGPTSALQGSRNKVTPSASEGSGNEAIPS* 0 >PIN_pheMad Phelsuma madagascariensis (gecko) Gt 358 aa no_ref AB022881 pinopsin 0 MHVQMANASQASLKNGTLSPFDGPQWPHRASRRVYTSLAALMGVVVLSASLANGLVIAVSVRFKRLRSPLNYILVNLATADLLVTFFGSIISFVNNAVGFFVFGKTACRFEGFMVSLT 1 2 GIVGLWSLAILAFERYLVICKPVGDFQFQRRHAVIGCLYTWGWSLIWTVPPLFGWSSYVPE 1 2 GLGTSCGPNWYMGGTNNNSYIVALFVTCFALPLSMILFSYANLLLTLRA 0 0 VAAQQKEQETTQRAEKEVTRMVITMVMAFLVCWLPYATFAMVVATTKDLSIQPGLASLPSYFSKTATVYNPIIYVFMNKQ 0 0 FRSCLLNTVSCGRIPQTMPGTPATTAVRGGFVLTSEGRGNKVASTELHS* 0 >PIN_podSic Podarcis sicula (lizard) Gt 354 aa 16688437 DQ013042 pinopsin mRNA 0 MQASNASWVEVRNRTPGPFEGPQWPYLAPQSTYISVAVLMGLVVISATLVNGLVIVVSVQFKKLRSPLNYVLVNLAVADLLVTFFGSTISFVNNAQGFFIFGQATCEFEGFMVSLT 1 2 GIVGLWSLAILAFERYLVICKPVGDFRFPARHAVLGCAFTWGWSFVWTVPPLLGWSSYVPE 1 2 GLRTSCGPNWYSGGSSNNSYIMTLFVTCFAMPLSTILFSYANLLMTLRT 0 0 VAAQQKEQETTQRAEREVTRMVVAMVAAFLVCWLPYASFAMVVATHKDLAIRPALASLPSYFSKTATVYNPIIYVFMNKQ 0 0 FRSCLLYKMSCGHRALSSQDTTPAGISLPGRLTTSASKGSRNQVSPS* 0 >PIN_xenTro Xenopus tropicalis (frog) Gt 346 aa pinopsin 0 MRAGNMSAYEAPGPYDGPQWPHLAPRSTFLTVAAVMCMVVILAFFVNGLVIVVTLKYKKLRSPLNYILVNLAIANLLVTIFGSSVSFSNNVVGYFFMGKTMCEFEGFMVSLT 1 2 GIVGLWSLAILAFERYLVICKPMGDFRFQQKHAILGCSFTWVWSFIWTSPPLFGWCSYVPE 1 2 GLRTSCGPNWYTGGTNNNSYIMALFLTCFIMPLSTIIFSYSNLLMALRA 0 0 VAAQQKDSETTQRAEKEVTRMVIAMVLAFLICWLPYASFAVVVAVNKDVVIEPTVASLPSYFSKTATVYNPIIYVFMNKQ 0 0 FRNCLMTLLCCGRSFGDDETSSASGRTDVTSVSEAGGNKVTPA* 0 >PIN_bufJap Bufo japonicus (toad) Gt 347 aa 9537517 AF200433 pinopsin classifies oddly 0 MHSANMSALETPGPFEGPQWPHVAPRSTYLTVAVLMGMVVFLAFFVNGMVIVVSLKYKKLRSPLNYILVNLAVADILVTMFGSTVSFHNNIFGFFTLGKLVCELEGFVVSLT 1 2 GIVGLWSLAILAFERYIVICKPMGDFRFQQRHAVMGCAFTWIWAFLWTSPPLIGWCSYVPE 1 2 GLGTSCGPNWYTGGTNNNSYILALFTTCFMMPLTTIIFSYSNLLLALRA 0 0 VAAQQKESETTQRAEREVTRMVIAMVLAFLICWLPYAVFAIVMASNKNVVIDPTLASMPSYFSKTATVYNPVIYVFMNKQ 0 0 FRDCLTKLLCCGRNPFGEDETSTTSGRTDVTSVSEGGGNKVTPA* 0 >PIN_calMil Callorhinchus milii (elephantfish) Gt 093 aa fragment no petMar 0 FGSTVSFSNNINGYFVLGETVCQFEGFMVSLT 1 2 GIVGLWSLAILAFERYIVICKPMGDFRFQQKHAVWGCLFTWLWSLFWTLPPLFGWCSYVPE 1 2 GLRTSCGPNWYTGGANNSSYVVALFITCFTLPLSLIIFSYASLLVVLRA 0 >VAOP_galGal Gallus gallus (chicken) Gt synt(+INPP5A -NXK6 +C10orf61 +ALDH18A1) 393 aa TCTN3 exon 1 genbank error new stop *RKNGDEH... 0 MDVFRALGNESLLSNSSGPARWDPFHHPLDSIQPWHFRLVAAVMFVVTSLSLAENLAVILVTFKFKQLRQPVNYVIVNLSVADFLVSLTGGTISFLANLKGYFYMGHWACVLEGFAVTFF 1 2 GIVALWSLALLAFERYIVICRPVGNMRLRGKHAAQGIAFVWTFSFIWTIPPTMGWSSYTTSKIGTTCEPNW 2 1 YSGAYNDRSYIIAFFTTCFIVPLLVILVSYGKLLQKLRK 0 0 VSNTQGRLRTARKPERQVTRMVVVMIIAFLICWMPYAVFSILATAYPSIELDPHLAAIPAFFSKTATVYNPIIYVFMNKQ 0 0 FRMCLIQMFKCSAIETAESNMNPTSERATLTQDKRDSQLSVMAVRSTIS* 0 >VAOP_taeGut Taeniopygia guttata (finch) last exon newly truncated 0 MHLEYLKMGHFCPVTLLLPDGDPFHRPLDSIQPWQFKLLAAVMLLVTSLSLAENLAVILVTFKFKQLRQPINYIIVNLSVADFLVSLTGGTISFLTNLKGYFFMGYWACVLEGFAVTFF 2 GIVALWSLALLAFERYIVICRPLRNARLRGKHAALGIVFVWSFSFIWTIPPTTGWSSYTTSKIGTTCEPNW 2 1 YSGAYADHTYIITFFTTCFIVPLLVILVSYGKLVWKLKK 0 0 VSDAQGRLGAARRPERQVTRMVVFMIVAFLICWMPYATFSILVTAYPSIELDPCVAAIPAFFSKTATVYNPVIYVFMNKQ 0 0 FRQCLIQMFSCSAIGTAESNMKLTSERAVLMQGRRGSKRTPMAVHSTVLKRKTGDEHRADDLWLF* 0 >VAOP_anoCar Anolis carolinensis (lizard) Gt synt(+INPP5A -NXK6 +GPR125 +KNDC1) 389 aa vertebrate ancient 0 MAGLRREAENDSWLFDPSSSSAPFDPFLQPLDIIEPWNFHLISALMFVVTLFSLSENFTVILVTIKFKQLRQPLNYVIVNLSVADFLVSLIGGTISFSTNLKGYFYMGHWACVLEGFAVTFF 1 2 GIVALWSLALLAFERYVVICRPLGNMRLNGKHAALGVAFVWIFSFIWTVPPTMGWSSYTTSKIGTTCEPNW 2 1 YSGDYNDHTFIITFFTTCFILPLLVILVSYGKLMRKLRK 0 0 VSDTQGRLGTTRKPERQVTGMVVIMILAFLICWSPYAAFSILVTACPSIELDPRLAAIPAFFSKTATVYNPVIYVFMNNQ 0 0 FRKCLVQLFQCSSQETMDANVNPISEKDTLTHTKHCGEMSTVAAHVIVFNPRSEDEQGSCQSFAQLAISENKVYPL* 0 >VAOP_xenTro Xenopus tropicalis (frog) Gt synt(- +GSTO2 -C10orf92 -) 383 aa vertebrate ancient new 0 MPTNVSLLATPENSTVWNPFTGPLKTIEAWNFHLLAALMFVVTSLSIAENFIVILVTAKFKQLRQPLNYIIVNLSVADFLVSVIGGTISIATNSRGYFYLGSWACVLEGFAVTFF 1 2 GIVALWSLSVLAFERYIVICRPLGNLRLQGKHSALAIIFVWVFSFVWTIPPTMGWSSYTTSKIGTTCEPNW 2 1 YSGEMRDHTYIITFLTTCFVFPLLVIFMSYGKLMRKLRK 0 0 VSDTQGRLGSTRKPEKEVTRMVVIMILAFLICWTPYAAFSILITAHPTIDLDPRLAAIPAFFAKTASMYNPIIYVYMNKQ 0 0 FRRCLYQMFNINDPEAKESNLNPTSERGVLTRNNNGGEMLAIATHITSSAVTNREEEKSSSNSFAHIPVSDNKVCPM* >VAOP_danRer Danio rerio (zebrafish) Gt 378 aa 17067577 NM_131586 vertebrate ancient valop vertebrate assembly missing exon 3 0 MEASSAAVNAVSPAEDPFSAPLSSIAPWNYSVLAALMFVVTALSLSENFTVMLVTFRFQQLRQPLNYIIVNLSLADFLVSLTGGSISFLTNYHGYFFLGKWACVLEGFAVTFF 1 2 GIVALWSLAVLAFERFFVICRPLGNIRLRGKHAALGLVFVWSFSFIWTVPPVLGWSSYTVSRIGTTCEPNW 2 1 YSGNFHDHTFIITLFSTCFIFPLGVIIVCYCKLIRKLRK 0 0 VSNTHGRLGNARKPERQVTRMVVVMIVAFMVAWTPYAAFSIIITAHPSMHVDPRLAAIPAFVAKTAAVYNPIIYVFMNKQ 0 0 FRKCLVQLLSCSKVTVVEGNNNQTTERAGMTSGSNTGEMSAIAARVSVPKTEENPGDRSTFSHIPIPENKVCPM* >VAOP_takRub Takifugu rubripes (fugu) Gt synt(+INPP5A -NXK6 - +KNDC1) 362 aa vertebrate ancient 0 MESLSLSVNGVSYTVAAELAPTNDPFTGPINNIAQWNFTILAVLMFVVTSLSLCENFLVMFITFKFKQLRQPLNYIIVNLAIADFLVSLTGGLISFLTNARGYFFLGRWACVLEGFAVTYF 1 2 GIVAMWSLAVLSFERFFVICRPLGNMRLQAKHAAIGLLFVWTFSFVWTFPPVLGWNRYTVSKIGTTCEPDW 2 1 YSNNMTSHSYIITFFSTCFILPLGIIFFCYGKLLRKLRK 0 0 VSHGRLATARKPERQVTRMVVVMIVAFMVAWTPYATFAILVTIHPTIELDPRLASIPAFFSKTAAVYNPIIYVFMNKQ 0 0 FRKCLIQHFIGMGVMAESNMNPTSERPGITAESQTGEMSAIAARVPVGATAALHSDGSPTDCGSLAQLPIPENKVCPI* 0 >VAOP_rutRut Rutilus rutilus (minnow) Gt 383 aa 12906786 AY116411 vertebrate ancient vertebrate 0 MELFPVAVNGVSHAEDPFSGPLTFIAPWNYKVLATLMFVVTAASLSENFAVMLVTFRFTQLRKPLNYIIVNLSLADFLVSLTGGTISFLTNYHGYFFLGKWACVLEGFAVTYF 1 2 GIVALWSLAVLAFERFFVICRPLGNIRLRGKHAALGLLFVWTFSFIWTIPPVLGWSSYTVSKIGTTCEPNW 2 1 YSGNFHDHTFIIAFFITCFILPLGVIVVCYCKLIKKLRK 0 0 VSNTHGRLGNARKPERQVTRMVVVMIVAFMVAWTPYAAFSIVVTAHPSIHLDPRLAAAPAFFSKTAAVYNPVIYVFMNKQ 0 0 FRKCLVQLLRCRDVTIIEGNINQTSERQGMTNESHTGEMSTIASRIPKDGSIPEKTQEHPGERRSLAHIPIPENKVCPM* 0 >VAOP_calMil Callorhinchus milii (elephantfish) Gt 080 aa fragment 0 VASTQGRLGVARKPEKQVTRMVIVMILAFLFCWTPYAAFSITVTACPTIKLDPRLAAIPAFFSKTATVYNPIIYVFMNKQ 0 >VAOP_petMar Petromyzon marinus (lamprey) Gt 445 aa 9427550 U90667 vertebrate ancient exons 123 in traces pineal gland-specific 0 MDALQESPPSHHSLPSALPSATGGNGTVATMHNPFERPLEGIAPWNFTMLAALMGTITALSLGENFAVIVVTARFRQLRQPLNYVLVNLAAADLLVSAIGGSVSFFTNIKGYFFLGVHACVLEGFAVTYF 1 2 GVVALWSLALLAFERYFVICRPLGNFRLQSKHAVLGLAVVWVFSLACTLPPVLGWSSYRPSMIGTTCEPNW 2 1 YSGELHDHTFILMFFSTCFIFPLAVIFFSYGKLIQKLKK 0 0 ASETQRGLESTRRAEQQVTRMVVVMILAFLVCWMPYATFSIVVTACPTIHLDPLLAAVPAFFSKTATVYNPVIYIFMNKQ 0 0 FRDCFVQVLPCKGLKKVSATQTAGAQDTEHTASVNTQSPGNRHNIALAAGSLRFTGAVAPSPATGVVEPTMSAAGSMGAPPNKSTAPCQQQGQQQQQQGTPIPAITHVQPLLTHSESVSKICPV* 0 >PPIN_anoCar Anolis carolinensis (lizard) Gt synt(-CPEB2 -CACNA2D3 +SELK +ACTR8) 346 aa parapinopsin syntenic deleted in chicken 0 MDSLDTNTLSPNASTVRVVLMPRIGYTIIAIIMATSCTLSVILNTAVIAITIKYRQLRQPINYSLVNLAIADLGAALLGGSLNVETNAVGYYNLGRVGCVTEGFAMAFF 1 2 GIVALCTIAVIAVDRAIVIAKPMGTITFTTRKAMIGVAVSWIWSLVWNTPPLFGWGGYQMEGVMTSCAPDWANSDPINVSYIICYFLFCFTIPFITILASYGYLIWTLRQ 0 0 VAKVGLAQRGSTTKAEAQVSRMVIVMVMAFLICWLPYATFALVVVGNPQIYINPIIATIPMYMAKSSTFYNPIIYIFMNKQ 0 0 FRDCLVRCLLCGRNPCASEQTDEDDLEVSTIAPAPSSRRGKVAPV* 0 >PPIN_xenTro Xenopus tropicalis (frog) Gt synt(- - +SELK -) 349 aa parapinopsin bistable UV lamprey pineal broken contigs 0 MADEALLPPMMNVTNEEMHPGKVLMPRIGYTILALIMAVFCAAALFLNVTVIVVTFKYRQLRHPINYSLVNLAIADLGVTVLGGALTVETNAVGYFNLGRVGCVIEGFAVAFF 1 2 GIAALCTIAVIALDRVFVVCKPMGTLTFTPKQALAGIAASWIWSLIWNTPPLFGWGSYELEGVMTSCAPNWYSADPVNMSYIVCYFSFCFAIPFLIIVGSYGYLMWTLRQ 0 0 VAKLGVAEGGTTSKAEVQVSRMVIVMILAFLVCWLPYAAFAMTVVANPGMHIDPIIATVPMYLTKTSTVYNPIIYIFMNKQ 0 0 FQECVIPFLFCGRNPWAAEKSSSMETSISVTSGTPTKRGQVAPA* 0 >PPINa_gasAcu Gasterosteus aculeatus (stickleback) DN691174 DN691173 adult eyes 0 MQPSHTFSNSSAYTGPHGEPPLSRTGFIILSIIMAGFTGPAIVLNATVIIVTLMHKQLRQPLNYALVNMALADLGTAMTGGVLSVVNNAQGYFSLGRSGCVMEGFAVSLF 1 2 GITSLCTVALIAVERMFVVCKPLGQIIFQKKHAVGGIAISWLWSLSWNLPPLFGWGRYELEGVGTSCAPDWHNQDPKNVSYIVAYFAVCFAVPFALILASYTKLMWTLHQ 0 0 VSKMTCLEGGAVAKGEMKVASMVVLMVLTFLISWLPYASLAMLVVYNPKVEIHALVGTVPVYLAKSSTVFNPIIYIYLNKQ 0 0 FRKYAVPFLLCCKEPLDDEEASEAATTVEISPSKVSPA* 0 >PPINa_takRub Takifugu rubripes (fugu) 0 MKPSAFYLNASLYLGPQGEPPLPRSGFIALSVIMALLTGPAIVLNATVIIVSLMHKQLRQPLNYALVNMAVADLGTAMTGGLLSVVNNAQGYFSLGRTGCVLEGFAVSLC 1 2 GIASLCTVALIAVERMFVICKPLGQMQFQKQHALGGIALAWLWSLTWNLPPLFGWGRYELEGVGTSCAPDWHSREPQNVSYVLAYFTVCFAAPFVIILVSYSKLMWTLHK 0 0 VTKMACMEGGAVAKSEMTVAYMVILMVVIFLISWLPYAGLSMLVVLSPDVKIHPLVGTVPVYLAKSSTVYNPIIYIYLNKQ 0 0 FRKYAVPFLLCGRELEMEDELSMTTVETSNRVSPA* 0 >PPINa_tetNig Tetraodon nigroviridis (pufferfish) 0 MKPSVYLNTSLYLGPPEEPPLPRSGFIVLSILMALVTGPAIVLNATVIIVSLMHKQLRQPLNYALVNMAAADLGTAVSGGLLSVVNNAQGHFSLGRTGCVLEGFAVSLC 1 2 GIASLCTVALIAVERMFVICKPLGQMQFQKQHALAGISLSWLWSLTWNLPPLFGWGRYELEGVGTSCAPDWQSREPHNVSYVLAYFTVCFAAPFAIILVSYAKLMWTLHK 0 0 VTKMTCLEGGAVAKSEMKVAYMVVLMVATFLLSWLPYAGLSMLVVFKPDVEINPLVGTVPVYLAKSSTAYNPIIYIYLNKQ 0 0 FRKYALPFLLCRRALEAEDEVSETTVESSRRVSPS* 0 >PPIN_danRer Danio rerio (zebrafish) Gt synt(- - +SELK -) 338 aa no_ref XM_681591 parapinopsin parapinopsin 0 MESETSTAASGSIAEVMPRMGYTILAVIIGVFSVCGVILNVTVITVTLKYKQLRQPLNFALVNLAVADLGCAVFGGLPTVVTNAMGYFSLGRVGCVLEGFAVAFF 1 2 GIAALCSVAVIALERCMVVCRPVGSISFQTRHAVFGVAVSWLWSFIWNTPPLFGWGRLQLEGVRTSCAPDWYSRDLANVSFIVCYFLLCFALPFSVIVYSYTRLLWTLRQ 0 0 VSRLQVCEGGSAARAEAQVSCMVVVMILAFLLTWLPYASFALCVILIPELYIDPVIATVPMYLTKSSTVFNPIIYIFMNRQ 0 0 FRDRALPFLLCGRNPWAAEAEEEEEETTVSSVSRSTSVSPA* 0 >PPIN_ictPun Ictalurus punctatus (catfish) Gt 347 aa parapinopsin parapinopsin index sequence 0 MASIILINFSETDTLHLGSVNDHIMPRIGYTILSIIMALSSTFGIILNMVVIIVTVRYKQLRQPLNYALVNLAVADLGCPVFGGLLTAVTNAMGYFSLGRVGCVLEGFAVAFF 1 2 GIAGLCSVAVIAVDRYMVVCRPLGAVMFQTKHALAGVVFSWVWSFIWNTPPLFGWGSYQLEGVMTSCAPNWYRRDPVNVSYILCYFMLCFALPFATIIFSYMHLLHTLWQ 0 0 VAKLQVADSGSTAKVEVQVARMVVIMVMAFLLTWLPYAAFALTVIIDSNIYINPVIGTIPAYLAKSSTVFNPIIYIFMNRQ 0 0 FRDYALPCLLCGKNPWAAKEGRDSDTNTLTTTVSKNTSVSPL* 0 >PPIN_oncMyk Oncorhynchus mykiss (trout) Gt 347 aa parapinopsin 0 MDHQQLLPNLHGNISSSPGSVSEALLSRTGFTILAVIIGVFSVSGVCMNVLVIMVTMRHRKLRQPLNYALVNLAVADLGCALFGGLPTMVTNAMGYFSMGRLGCVLEGFAVAFF 1 2 GIAGLCSVAVIAVDRYVVVCRPMGAVMFQTRHAVGGVVLSWVWSFLWNTPPLFGWGSFELEGVRTSCSPNWYSREPGNMSYIILYFLLCFAIPFSIIMVSYARILFTLHQ 0 0 VSKLKVLEGNSTTRVEIQVVRMVVVMVMAFLLSWLPYAAFALSVILDPSLHINPLIATVPMYLAKSSTVYNPIIYVFMNRQ 0 0 FRDCAVPFLLCGLNPWASEPVGSEADTALSSVSKNPRVSPQ* >PPINb_takRub Takifugu rubripes (fugu) synt(+PCTK1 -PPIN +DDX20 -KCND2) absent medaka, zfish 0 MEQGIQEGNSSSLSSVSSGTLSRTGYTVLAFIMGVLSAGGIILNVLVIVVTMKHRQLRQPLSYALVNLAICDLGCALFGGIPTTITSAMGYFSLGRVGCVLEGFAVAFF 1 2 GIASLCTIGVISVERYVVVSNPMGAVLFQTR 2 1 HAVAGVVFSWVWSFVWNTPPLFGWGSFDLEGVRTSCAPNWYSRDVGIMSYIVIYLLFCFAVPFTIITVSYSRLLWTLRQ 0 0 VTGLQVAEGGSTNRVEVQVARMVVVMVLAFLLTWLPYAAMALAVVMDSTLYINPIIATIPVYLAKSSTVYNPIIYIFMNRQ 0 0 FRGCAINTVLCGRRAWITDLQTSEGETTVASTSKSQKISPKGSLN* 0 >PPINb_tetNig Tetraodon nigroviridis (pufferfish) synt(-PPIN +DDX20) 0 MEQEIQDGNSSSLNSVSPGILSQTGFTVLAFIMGMLSVGGIILNVLVIVVTLKHRQLRQPLNYALVNLAICDLGCALFGGIPTTVTSAMGYFSLGRLGCVLEGFAVAFF 1 2 GIASLCTIGIISVERYIVVSNPMGAVLFQTR 2 1 HAVAGVVFSWVWSFVWNTPPLFGWGSFELEGVGTSCAPNWYSRDMGNMSYIIIYLLFCFAVPFSIIMVSYSRLLWTLHQ 0 0 VTKLHVAEGGSTNRVEVHVARMVVLMVLAFLLTWLPYASMALAVVMDSTLYIDPVTATIPVYLAKSSTVYNPIIYIFMNRQ 0 0 FRGYAINTILCGRRAWVSEQQTSEGETTVVSVSKSQKISPKGSLQ * 0 >PPINb_gasAcu Gasterosteus aculeatus (stickleback) synt(+DNMT2 -PPIN +DDX20 -TAZ) DW621258 21 day old larvae 0 MERDGNASADAQLLSPSGYAALAAVMGAFSVAGILLNALVIVVTARHRQLRQPLSYALVNLAVCDLGCAACGGLPTTVTSAMGYFSLGRAGCVLEAFAVAFF 1 2 GIASLCTIGVISVERYVVVCYPMGAVLFQTR 1 2 HAVAGVVLSWVWSFVWNTPPLFGWGSYELEGVKISCAPNWYSRDPGNVSYVTIYFLLCFAVPFSVIMVSYSRLLWTLRQ 0 0 VTKLQVSETGSTNRVEVQVARMIVVMVLAFLVTWLPYAAMALAVITDSTLHIDPVIATIPVYLAKSSTVYNPIIYIFMNRQ 0 0 FRGYAVPSILCGWNPWAEEQTSEEETVGSVMKSQRVSPKGSLQE* 0 >PPINb_mayZeb Maylandia zebra (cichlid) wgs frag 0 1 2 GIASLCTVAVISVERYVVVCYPMGAVLFQTR 2 1 HAVAGVVLSWVWSFVWSTPPLFGWGTFELEGVKTSCAPKWHSRDVGDMSYMIIFFFLCFALPFSVVMVSYSRLLWTLRQ 0 0 GMIVVMVLAFLVTWLPYAALALAVMIDSSLYVDPVIATIPVYFAKSSTVYNPIIYIFMNRQ 0 0 FRGYTVAAVLCGWDPWSSEPQTSENETTVPFFIKTPKKIVPKKSLE*0 >PPIN_calMil Callorhinchus milii (elephantfish) Gt 109 aa fragment 0 MDPHNRSANLSEGPGLGGGGAVPGWGPSVRAPLSLVMAVISLSSIVLNSLAIAVVLRFQVLQQPLNYALLSLASADLGTAATGGVLSTVCTALGSFVLGRHSCVAEGFF 1 >PPINa_petMar Petromyzon marinus (lamprey) Gt 344 aa parapinopsin bistable pineal UV/green 0 MENLTSLDLLPNGEVPLMPRYGFTILAVIMAVFTLASLVLNSTVIIVTLRHRQLRHPLNFSLVNLAVADLGVTVFGASLVVETNAVGYFNLGRVGCVIEGFAVAFF 1 2 GIAALCTIAVIAVDRFVVVCKPLGTLMFTRRHALLGITWAWLWSFVWNTPPLFGWGSYKLEGVRTSCAPDWYSRDPANVSYIVSFFSFCFAIPFLVIVVAYGRLLWTLHQ 0 0 VAKLGMGESGSTAKAEAQVSRMVVVMVVAFLVCWLPYALFAMIVVAKPGVYIDPVIATLPMYLTKTSTVYNPIIYIFMNRQ 0 0 FRDCAVPFLLCGRNPWAEPSSESATTASTSATSVTLASVPGQVSPS* 0 >PPIN_letJap Lethenteron japonicum (lamprey) Gt 344 aa 14981504 AB116380 parapinopsin bistable pineal UV/green 0 MENLTSLDLLPNGEVPLMPRYGFTILAVIMAVFTIASLVLNSTVVIVTLRHRQLRHPLNFSLVNLAVADLGVTVFGASLVVETNAVGYFNLGRVGCVIEGFAVAFF 1 2 GIAALCTIAVIAVDRFVVVCKPLGTLMFTRRHALLGIAWAWLWSFVWNTPPLFGWGSYELEGVRTSCAPDWYSRDPANVSYITSYFAFCFAIPFLVIVVAYGRLMWTLHQ 0 0 VAKLGMGESGSTAKAEAQVSRMVVVMVVAFLVCWLPYALFAMIVVTKPDVYIDPVIATLPMYLTKTSTVYNPIIYIFMNRQ 0 0 FRDCAVPFLLCGRNPWAEPSSESATAASTSATSVTLASAPGQVSPS* 0 >PPINb_petMar Petromyzon marinus (lamprey) odd genomic frag 2 GMTALITVCVLAVERYVVVCKPLGGVHFGTQHGLCGVAISWTWALAWSAPPLFGWGRYHYEGVGTSCAPDWADSSPSGRSYMTTYFIFCFALPMIIILFCYTKLMIAIHK 0 0 VSKLGLSANDTAERKVGIMVVVMVFAFFLCWLPYAALAIAVVIKPDLK 0 0 VSPVTASIPVYLAKSSGAYNPIIYIFMHRQ 0 >PPINa_cioInt Ciona intestinalis (tunicate) Gt synt(-HOXB1 +HHEX +CUL4A -) 391 aa 11591373 NM_001032555 parapinopsin Ci-opsin1 odd exons larval ocellus 0 MDHDVTPTVDLTDGVPQCKDLNPYVLKGDGWVPQHISRANRSTYSFLCVYMTFVFLLSCSLNILVIVATLKNK 0 0 VLRQPLNYIIVNLAVVDLLSGFVGGFISIAANGAGYFFWGKTMCQIEGYFVSNF 1 2 GVTGLLSIAVMAFERYFVICKPFGPVRFEEKHSIF 1 2 GIVITWVWSMFWNTPPLIFWDGYDTEGLGTSCAPNWFVKEKRERLFIILYFVFCFVIPLAVIMICYGKLILTLRQ 0 0 IAKESSLSGGTSPEGEVTKMVVVMVTAFVFCWLPYAAFAMYNVVNPEAQ 0 0 IDYALGAAPAFFAKTATIYNPLIYIGLNRQ 0 0 FRDCVVRMIFNGRNPWVDELVGSQVSSTGSQLTAVSSNKVAPA* 0 >PPINb_cioInt Ciona intestinalis (tunicate) Gt synt(-TMEM165 +FUT4 - -) 353 aa parapinopsin jgi gene model wrong both ends 0 MTTAETTTECYEKNPYIRNEMGWVPKHILIAERHIYTILAVYMTFIFLLAVSLNGFVIIATMKNK 0 0 KLRQPLNYIIINLSIADFLSGLVGGFIGMISNSAGYFYFGKTVCILEGYIVSVA 1 2 GVCGLMSISVMAFERYFVVCKPYGPFTLTNTHAAL 1 2 GIGFTWTWSVLWSTPGLIWLDGYVPEGLGTSCAPNWFSKNK 2 1 SERIFIFVYFVFCFFIPLLVIIICYGKIVLFLKQ 0 0 ATRQSSASSNRQADNKVTKMVLVMISAFLICWTPYGVLSLYNAINPDKQ 0 0 LDYGLGAVPVFFAKTANIYNPLIYIGLNKQ 0 0 FRDGVIKMVFRGRNPWAEEMSTQQRQRSTEAGQPIVSNEV* 0 >PPINa_cioSav Ciona savignyi (tunicate) 88% 0 MPTEASIAVDVSPTMGIPQCKDINPYVLKGDGWVPQHISRADRSVYSFLAVYMTFICLISCSLNILVITATLKNK 0 0 VLRQPLNYIIVNLAVVDLLSGLVGGVISIFANGAGYFFWGKFMCQVEGYTVSNF 1 2 GVTGLLSIAVMAFERYFVICKPFGPVRFEEKHAVI 1 2 GIAVTWIWAMFWNTPPLIFWDGYDTEGLGTSCAPNWFVKGNTERLFIILYFVFCFLIPLAIIVLCYGKLILQLRQ 0 0 IAKESSLSGGTSPEGEVTKMVVVMVTAFVICWLPYAAFAMYNVVNPEAQ 0 0 IDYALGAAPAFFAKTATIYNPLIYIGLNRQ 0 0 FRDCVVRMIFNGRNPWVDEMVGSQVSSSASQMTAVSSNKVAPA* 0 >PPINb_cioSav Ciona savignyi (tunicate) 59% 0 MSSIPQNYSNGNPYATTDSGWVPEHIEIANRSTYSGLCVFMSFVFVLAVPLNLLVIVATYKNK 0 0 DLRRPINYIIVNLAVADLTCSVVGGLLGVLNNGAGYYFLGKSVCIFEGYVMSVT 1 2 GVCGILSITVMAFERYFVVCKPFGQTNLKWSHAIT 1 2 GIVFTWTWSVIWHTPGLFFWNGYEPEGFGTSCAPNWFSQQK 2 1 SERIFIFAYFAFCFLTPLTIIFACYLKLILFIRK 0 0 VSKKSMVNEADRRDFEVTRMVFVMIAAFLICWLPYGCLSMYNAIHPDNL 0 0 LSYGIGSVPAFFAKTATIYNPIIYMGLNKK 0 0 FRDGVIRMLFKGRNPWLDGRNTTSSTSTRAQGSLINREVDI* 0 >PARIE_utaSta Uta stansburiana (lizard) Gd+Go 347 aa 522 nm 16543463 DQ100320 parietopsin shift in counterion Gt + Go 0 MENDSSLATELAEGAIVKPTIFPKAGYGVLAFLMFLNALFSIFNNSLVIAVTLKNPQLRNPINIFILNLSFSDLMMSLCGTTIVIATNYYGYFYLGRKFCIFQGFAVNYF 1 2 GIVSLWSLTILAYERYNVVCQPLGTLQMSTKRGYQLLGFIWVFCLFWAVVPLFGWSSYGPEGVQTSCSIGWEERSWSNYSYLIVYFLSCFFIPVLIIGFSYGNVIRSLHG 0 0 LNKKVEQLGGKSSPEEEFRAVIMVLVMVVAFLICWLPYTVFALIVVFNPALNISPLAATIPTYLSKTSPVYNPIIYIFLNKQ 0 0 FRDCAVEFITCGQVVLTSPEEDISTSAIPVEGKGPCKINQVTPV* 0 >PARIE_anoCar Anolis carolinensis (lizard) Gd+Go +EEA1 -FLJ46688 +BTG1 - 347 aa parietopsin Go like scallop, gusducin not transducin 0 MENESSLVLEGAEGYIVRPTIFPRAGYGVLAFLMFINALFSLFNNFLVIAVTLKNPQLRNPINIFILNLSFSDLMMSICGTTIVIATNYHGYFYLGRRFCIFQGFAVNYF 1 2 GIVSLWSLTILAYERYNVVCQPLGTLQMSTQRAYQLLGFIWVFCLFWAVVPLFGWSSYGPEGVQTSCSIGWEERSWNNYSYLIVYFLSCFFIPVLIIGFSYGNVIRSLHG 0 0 LNKKVEQLGGKSNPEEEFRAVIMVLVMVVAFLICWLPYTLFALTVVFNPALNISPLAATIPTYLSKTSPVYNPIIYIFLNKE 0 0 FRECAVEFITCGKVVLTSPEEDISTSAISDEGIAPCKINQVTPV* 0 >PARIE_xenTro Xenopus tropicalis (frog) Gd+Go -lum -DCN 16543463 DN070761 lung parietopsin 0 MDGNSTTPGIAVNLTVMPTIFPRSGYSILSFLMFLNAVFSICNNAIVILVTLKHPQLRNPINIFILNLSFSDLMMALCGTTIVVSTNYHGYFYLGKQFCIFQGFAVNYF 1 2 GIVSLWSLTLLAYERYNVVCEPIGALKLSTKRGYQGLVFIWLFCLFWAIAPLFGWSSYGPEGVQTSCSIGWEERSWSNYSYIISYFLTCFIIPVGIIGFSYGSILRSLHQ 0 0 LNRKIEQQGGKTNPREEKRVVIMVLFMVLAFLICWLPYTVFALIVVINPQLYISPLAATLPTYFAKTSPVYNPIIYIFLNKQ 0 0 FRTYAVQCLTCGHINLDSLEEDTESVSAQAENMLTPKTNQVAPA* 0 >PARIE_takRub Takifugu rubripes (fugu) Gd+Go synt(-HSP90B1 +NT5DC2 -KCND3 -FLNC) 351 aa 16543463 genome parietopsin 0 MDSNSTPWSSPPAPLQAEAVTVAPTIFPRVGYSILSFLMFINTVLSVFNNSLAIAVMLKNPSLLQPINIFILSLAVSDLMIGLCGSLVVTITNYHGSFFIGHTACVFQGFAVNYF 1 2 GLVSLCTLTLLAYERYNVVCKPRAGLKLTMRRSIIGLLFVWTFCLFWAVTPLLGWSSYGPEGVQTSCSLAWEERSWNNYSYLILYTLLCFIFPVGVIIYCYCKVLTSMNK 0 0 LNKSVELQGGLSCRRENKHAINMVLAMIIAFFVCWLPYTALSVVVVVDPELHIPPLVATMPMYFAKTSPVYNPIIYFLSNKQ 0 0 FRDATLEVLSCSRYIPHASSRVSINMRSLNRRSVNTHSKVSPL* 0 >PARIE_tetNig Tetraodon nigroviridis (pufferfish) genomic frameshift error 0 MDSASTPWSPHPASGQAEAVTAAPTIFPRVGYSILSFLMFINTVLSIFNNGLAITVMLKNPALLQPINIFILSLAVSDLMIGLCGSLVVTITNYQGSFFIGHTACVFQGFAVNYF 1 2 GLVSLCTLTLLAYERYNVVCKPRAGLKLNMRRSLVGLLFVWTFCLFWAVTPLLGWSSYGPEGVQTSCSLAWEERSWNNYSYLILYTLLCFILPVGVIIYCYTKVLTSMNK 0 0 LNKSVELQGGRSCQKENDHAISMVLAMIIAFFVCWLPYTALSVVVVVDPELRIPPLVATMPMYFAKTSPVYNPIIYFLSNKQ 0 0 FRDATLEVLSCGRYIPHASTRVTFNMCAFNRRSRLPSLSRSINTHSKVSPL* 0 >PARIE_gasAcu Gasterosteus aculeatus (stickleback) Gd+Go synt(-HSP90B1 +NT5DC2 -KCND3 -FLNC) 361 parietopsin 0 MDSNSTLWSSGSPPPSIHGKMLTITPTIFPRVGYSILSFLMFINTVLTVFNNVLVITVLVRNPSLLQPMNVFILSLAVSDLMIGLCGSLVVTITNYHGSFFIGHTACIFQGFAVNYF 1 2 GLVSLCTLTLLSYERYNVVCRPRNALKLSMRRSIHGLLIVWTFCLFWAVAPLFGWSGYGPEGVQTSCSLAWEERSWSNYSYLVLYTLLCFIVPVAVIIYCYAKVLTSMNT 0 0 LNRSVEVQGGRSSQKENDHAVSMVLAMIIAFFSCWLPYTALSVVVVVDPTLYIPPLVATMPMYFAKTSPVYNPIIYFLSNKQ 0 0 FRDAALEMLSCGRYIAHMPNTVSINMRSLNRRSRLSSLSRNVNSHSKVLPL* 0 >PARIE_danRer Danio rerio (zebrafish) Gd+Go - +NT5DC2 +FBXL13 - 337 aa 16543463 genome parietopsin 0 MENFAKTELTMMVQPTIFPRVGYSILSYLMFINTTLSVFNNVLVIAVMVKNLHFLNAMTVIIFSLAVSDLLIATCGSAIVTVTNYEGSFFLGDAFCVFQGFAVNYF 1 2 GLVSLCTLTLLAYERYNVVCKPMAGFKLNVGRSCQGLLLVWLYCLFWAVAPLLGWSSYGPEGVQTSCSLGWEERSWRNYSYLILYTLMCFILPTVIITYCYSNVLLTMRK 0 0 INKSIECQGGKNCAEDNEHAVRMVLAMIIAFFICWLPYTAISVLVVVNPEISIPPLIATMPMYFAKTSPVYNPIIYFLTNKR 0 0 FRESSLEVLSCGRYISRETGGPLMGSSMQRGQSRVNPV* 0 >PARIE_petMar Petromyzon marinus (lamprey) Gd+Go 082 aa fragment 0 LNKKIKRVGGHPDPREEMRATVMVLAMVGAFLACWLPYTVLALCVVLAPGTQIPPLVATLPMYFAKTSPIYNPIIYFFLNRQ 0 >ENCEPH_homSap Homo sapiens (human) Gt synt(-EXO1 -WDR64 -KMO +FH) 403 aa 12242008 NM_014322 parietopsin OPN3 with intron loss 0 MYSGNRSGGHGYWDGGGAAGAEGPAPAGTLSPAPLFSPGTYERLALLLGSIGLLGVGNNLLVLVLYYKFQRLRTPTHLLLVNISLSDLLVSLFGVTFTFVSCLRNGWVWDTVGCVWDGFSGSLF 1 2 GIVSIATLTVLAYERYIRVVHARVINFSWAWRAITYIWLYSLAWAGAPLLGWNRYILDVHGLGCTVDWKSKDANDSSFVLFLFLGCLVVPLGVIAHCYGHILYSIRM 0 0 LRCVEDLQTIQVIKILKYEKKLAKMCFLMIFTFLVCWMPYIVICFLVVNGHGHLVTPTISIVSYLFAKSNTVYNPVIYVFMIRK 0 0 FRRSLLQLLCLRLLRCQRPAKDLPAAGSEMQIRPIVMSQKDGDRPKKKVTFNSSSIIFIITSDESLSVDDSDKTNGSKVDVIQVRPL* 0 >ENCEPH_otoGar Otolemur garnettii (lemur) full 0 MYSGNRSGGQGFWEGGGAAGAEEPTPEGTLSPAPLFSPSAYERLALLLGSIGLLGVANNLLVLVLYYKFPRLRTPTHLFLVNISLSDLLVSLFGVTFTFVSCLRNGWVWDTVGCVWDGFSGSLF 1 2 GIVSIATLTVLAYERYIRVVHARVINFSWAWRAITYIWLYSLAWAGAPLLGWNRYILDVHGLGCTVDWKSKDANDSSFVLFLFLGCLVVPVGVVAHCYGHILYSIRM 0 0 LRCVEDLQTTQVIKILKYEKKVAKMCFFMIFTFLVCWMPLIVICFLVVNGQGHLVTPTVSIVSYLLAKSNTVYNPVIYIFMLRK 0 0 FRRSLLQLLCFRLLRCQRPAKDLPAAESEMQIRPIVMSQKDGDRPKKKVTFNSSSIIFIITSDESLSVDNSDKTNGSKVDVIQVRPL* 0 >ENCEPH_musMus Mus musculus (mouse) Opn3 Panopsin NM_010098 2aa del full 0 MYSGNRSGDQGYWEDGAGAEGAAPAGTRSPAPLFSPTAYERLALLLGCLALLGVGGNLLVLLLYSKFPRLRTPTHLFLVNLSLGDLLVSLFGVTFTFASCLRNGWVWDAVGCAWDGFSGSLF 1 2 GFVSITTLTVLAYERYIRVVHARVINFSWAWRAITYIWLYSLAWAGAPLLGWNRYILDIHGLGCTVDWRSKDANDSSFVLFLFLGCLVVPVGIIAHCYGHILYSVRM 0 0 LRCVEDLQTIQVIKMLRYEKKVAKMCFLMAFVFLTCWMPYIVTRFLVVNGYGHLVTPTVSIVSYLFAKSSTVYNPVIYIFMNRK 0 0 FRRSLLQLLCFRLLRCQRPAKNLPAAESEMHIRPIVMSQKDGDRPKKKVTFNSSSIIFIITSDESLSVEDSDRSSASKVDVIQVRPL* 0 >ENCEPH_canFam Canis familiaris (dog) XP_854433 full 0 MMRRVKLTLIPAAVLDIESQAPKDESLYFSICHFCPQKGFLEFQRLRTPTHLLLVNLSLSDLLVSLFGVTFTFVSCLRNGWVWDSVGCVWDGFSSSLF 1 2 GIVSITTLTVLAYERYIRVVHARVINFSWAWRAITYIWLYSLAWSGAPLLGWNRYILDVHGLGCTVDWKSKDANDSFFVLFLFLGCLVVPMGVIVHCYGHILYSIRM 0 0 LRCVEDLQTIQVIKILRYEKKVAKMCFLMIFIFLIFWMPYIVICFLVVNGYGHLVTPTVSIVSYLFAKSSTVYNPVIYIIMIRK 0 0 FRRSLLQLLCFRPLRCQRPAKDLPANGSEMQIRPIVMSQKDGDRPKKKVTFNSSSIIFIITSDESVSIDDSDKTSVSKVDVIQVRPL* 0 >ENCEPH_pteVam Pteropus vampyrus (macrobat) 86%=homSap full 0 MHSGNRSGGLDSWEGGGAAGAEGPGLAGTLSPGSVFNPSTYERLALLLGSIGLLGVANNLLVLVFYYKFQQVRTPFYLFLVNISFSDLLVSFFGVTFTFVSCLRNGWVWDTVGCVWDGFSSSLF 1 2 GTVSMTTLTVLAYERYIRVVQARAIDFSWAWRTITYIWLYSLGWSGAPLLGWNRYILDVHGLGCAVDWKSKDANDSSFVLFLFLGCLVVPVVVIAHCYGHILYSVQM 0 0 LRCVEDLQTIQVIKILRYEKKMAKMCFLMIFTFLISWMPYIVICFLVVNGYGHLVTPTVSIVSYLFAKSSTVYNPVIYIFMIRK 0 0 FRRFVLQLLCFRPLRCRRPATDLPAGGSEMQIRPIVMSQKDGDRPKKKVTFNSSSIIFVITSDESLSVDDSDKINGSKADGIQVRPL* 0 >ENCEPH_loxAfr Loxodonta africana (elephant) 2 exons in browser, 1 2x full 0 MYSGNRSGGQDLWEGGGGSGGAGPAGTLSPAPVFRSGTYERLALLVGSIGLLGVGNNLLVLVLYYKFQRLRTPTHLFLVNISLSDLLVSLFGVTFTFVSCLRNGWVWDTVGCVWDGFSSSLF 1 2 GIASITTLTVLAYERYIRVVHARVINFSWAWRAITYIWLYSLAWSGAPLLGWNRYILDTHGLACTVDWKSNNSSDSSFVLFLFLGCLVVPVGVIAHCYGHILYSIRM 0 0 LRCVEDLQTIQVIKILRHEKKLAKMCLFMIFTFLICWMPYIVICFLVVNGYGHLVTPTISIVSYLFAKSSTVYNPVIYTFMIRK 0 0 FRRSLLQLLCFRLLRCQRPAKDLPVVGSEMQIRPIVMSQKDGDRPKKKVTFNSSSIIFIITSDESLSVNNIDKTNGSKADVIQIRPL* 0 >ENCEPH_monDom Monodelphis domestica (opossum) -EXO1 synt(-WDR64 -KMO +FH) 411 aa 0 MYSDNSSDDGGGGYWGSGRAGGASGTGVTGEPGPEGSPRQAPLFSPGTYELLALLIATIGLLGLCNNLLVLVLYYKFQRLRTPTHLFLVNISFNDLLVSLFGVTFTFVSCLRSGWVWDSVGCAWDGFSNTLF 1 2 GIVSIMTLTVLAYERYNRIVHAKVINFSWAWRAITYIWLYSLVWTGAPLLGWNRYTLEIHGLGCSVDWKSKDPNDSSFVIFLFFGCLMLPVGVMAYCYGHILYAIRM 0 LRCVEELQTIQVIKILRYEKKVAKMCFLMIAIFLFCWMPYAVICLLVANGYGSLVTPTVAIIASLFAKSSTAYNPIIYIFMSRK 0 0 FRRCLLQLLCFRLLKFQQPKKDRPVIRTEKQIRPIVMSQKVGDRPKKKVTFSSSSIIFIITSDETQMIDENDKNSGTKVNVIQVRPL* 0 >ENCEPH_galGal Gallus gallus (chicken) Gt synt(-EXO1 -WDR64 -PIGM +RGS7) 396 aa encephalopsin OPN3 0 MHSGNGTGATSRPQLAAAGHEVPGERPLFSAGTYELLALLIATIGTLGVCNNLLVLVLYYKFKRLRTPTNLFLVNISLSDLLVSVCGVSLTFMSCLRSRWVWDAAGCVWDGFSNSLF 1 2 GIVSIMTLTVLAYERYIRVVHAKVIDFSWSWRAITYIWLYSLAWTGAPLLGWNRYTLEIHGLGCSMDWKSKDPNDTSFVLLFFLGCLVAPVVIMAYCYGHILYAVRM 0 0 LRCVEDFQTSQVIKLLKYEKKVAKMCFLMISTFLICWMPYAVVSLLVTYGYSNLVTPTVAIIPSFFAKSSTAYNPVIYIFMSRK 0 0 FRQCLLQLLCFRLMRFQRIMKEPSGAGNVKPIRPIVMSQKVGDRPKKKVTFSSSSIIFIIASDDTQQIDDNSKHNGTKVNVIQVKPL* 0 >ENCEPH_anoCar Anolis carolinensis (lizard) Gt synt(-EXO1 -WDR64 -PIGM +RGS7) 408 aa encephalopsin OPN3 0 MFSANGTRSGAGSDLEPGPGQQQQQREASEEEERGAGLSPFSAGTYELLALLVAAIGLLGLCNNLLVLVLYAKFKRLRTPTHLFLVNISLSDLLVSLFGVSFTFGSCLRHRWVWDAAGCVWDGFSNSLF 1 2 GIVSIMTLTVLAYERYIRVVHARVIDFSWSWRAITYIWLYSLAWTGAPLLGWNHYTLEIHGLGCSVDWQSKEPSDSSFVLFFFLGCLAAPVGIMAYCYGHILHAIRM 0 0 LRCVEDLQSIQVIKILRYEKKVAKMCFLMVTTFLICWMPYAVVSLLIAYGYGHLITPTVAIIPSFFAKSSTAYNPVIYIFMSRK 0 0 FRRCLVQLFCVQFLRFKRTLKEQPAIESNKPIRPIVMSQKVGDRPKKKVTFSSSSIIFIITSDDTEQIDVSTKCSDTKINVIQVKPL* 0 >ENCEPH_xenTro Xenopus tropicalis (frog) 45%=homSap recent pseudogene, I for Schiff K, loss of C-terminal conserved residues 0 MPVTNGSHNNSISWLHSKDMFTEDTYHFLALIVATVGFLGLVNNLLVLILYCKFKRLQTPTNLLFFNTSLCHFVFSLLAITFTFMSCVRGSWAFSVEMCVFHGFSKNLL 1 2 GIVSFGTLTVVAYERYARVVYGKYVNSSWSKRSITFVWVYSLAWTGFPLIGWNLYTFETHKLDCSFEWTATDPKDTAFVLLFFLACITLPLSIMAYCYGYILYEIQK 0 0 LRSVKNIQNFQEITILDYEIKMAKMCLLMMLTFLIGWMPYTILSLLVTSGYSKFITPTITVMPSLLAIASAAYNPVIHIFTIKK 0 0 FRQCLVQLLFHNFWRLLKNLNGRLAMKKVKPVLGKGRSHNRPEKKVFSSSDFFTRTTSDTGTHGITESTKGKRTNVRLIQVHPLYP* 0 >ENCEPH_danRer Danio rerio (zebrafish) NM_001111164 mrna 61%=homSap full 0 MNSFNETPTEAHLENYNYIFADETYKLLTFTIGSIGVLGFCNNIIVIILYSRYKRLRTPTNLLIVNISVSDLLVSLTGVNFTFVSCVKRRWVFNSATCVWDGFSNSLF 1 2 GIVSIMTLSGLAYERYIRVVHAKVVDFPWAWRAITHIWLYSLAWTGAPLLGWNRYTLEVHQLGCSLDWASKDPNDASFILFFLLGCFFVPVGVMVYCYGNILYTVKM 0 0 LRSIQDLQTVQTIKILRYEKKVAVMFLMMISCFLVCWTPYAVVSMLEAFGKKSVVSPTVAIIPSLFAKSSTAYNPVIYAFMSRK 0 0 FRRCMLQMLCSRLTSLQHTIKDRPLSRIEHPIRPIVMSQSRTDRPKKRVTFSSSSIVFIIASHDTHPLDITSKCNDEPDINVIQVRPL* 0 >ENCEPH_takRub Takifugu rubripes (fugu) homSap=61% full 0 MNPANGSRSERSAEQLLFSGDTYRVLAFTIGTIGAFGFCNNFVVLALYCRFKRLRTPTNLLLVNISLSDLLVSLFGINFTFAACVQGRWTWTQATCVWDGFSNSLF 1 2 GIVSIMTLAALAYERYIRVVHAQVVDFPWAWRAIGHIWLYALAWTGAPLLGWNRYTLEIHRLGCSLDWASKDPNDASFILLFLLACFFVPVGIMIYCYGNILYAVQM 0 0 IRSIQDLQTVQIIKILRYEKKVSVMFFLMISCFLLCWTPYAVVSMMVAFGRRSMVSPTMAIIPSFFAKSSTAYNPLIYVFMSRK 0 0 FRHCLLQLLCSRLSWLQRSLKERPLAPVQRPIRPIVMSRPCGKGNRPKKKVTFSSSSIVFIITSDDFGQLDVTSKSGDSADVNAIQVRPL* 0 >ENCEPH_gasAcu Gasterosteus aculeatus (stickleback) 58%=homSap full 0 MNPDNGTREERSTDHSIFAVGTYKLLAFAIGTIGVFGFCNNVVVIVLYCKFKRLRTPTNLLVVNISLSDLLVSVIGINFTFVSCIRGGWTWSRATCIWDGFSNSLF 1 2 GIVSIMTLASLAYERYIRVVHAQVVDFPWAWRAIGHIWLYSLVWTGAPLLGWNRYTLEIHRLGCSLDWASKDPNDASFILLFLLACFFVPVGIMIYCYGNILYAVQM 0 0 LRSIQDLQTVQIIKILRYEKKVAVMFLLMISCFLLCWTPYAVVSMMEAFGRKNMVSPTVAIIPSFFAKSSTAYNPLICVFMSRK 0 0 FRRCLMQLLCSRVTCLQCNLKERPLAPVQRPIRPIVVSAACGGGRVRPKKRVTFSSSSIVFIITRNDIRHTDVTSNTRESSEANVFQVRPL* 0 >ENCEPH_oryLat Oryzias latipes (medaka) 58%=homSap full 0 MNPANESRAGRHEERSVFAVGTYKLLTVIIGTIGVFGFCNNLLVILLYCKFKRLRTPTSLLLVNISLSDLLVSVVGINFTLASCVKGRWMWSQATCVWDGFSNSLF 1 2 GIVSIMTLAALAYERYIRVVHAQVVDFPWAWRAIGHIWLYSLAWTGAPLLGWNRYTLEIHQLGCSLDWASKDPNDAAFILLFLLGCFFVPVGIMIYCYGNILYAVRM 0 0 LRSIEDLQTVQIIKILRYEKKVAAMFLLMISCFLVCWTPYAVVSMMEAFGKKSMVSPTVAIVPSFFAKSSTAYNPLIYVFMNRK 0 0 FRRCFLQLLGSRLCSKISWLQCTLKEHPLTPVERPIRPIVASTSCGSRHRPKKRVTFNSSSIVFMITGDEFQQLDVTSKSRNSSEANVFHVRPL* 0 >ENCEPH_calMil Callorhinchus milii (elephantfish) wgs frag YRRCLSQLFCSHLMSLQWSIKDPSSKARNDMPVKPIVLSQKGDRPKKRVTFSSSSIVFIITSDDTQELGSIAGSNATQISIVQVQPL* 0 MNPTNSTEPQEEHLFSPNTYKLLAVIIGTIGIVGFCNNILVLLLYYKFKRLRTPTNLLLVNISVSDLLVSVFGLSFTFVSCTQGRWGWDSAACVWDGSHSLF 1 2 GTVSIVTLTVLAYERYIRVVNAKATNFPWAWRAITYTWFYSLAWSGAPLV 0 0 0 0 >ENCEPH_squAca Squalus acanthias (dogfish) Gt 202 aa fragment 0 MNAANSTDTREESLFSPGTYQVLAVIIGTIGVVGFCNNLLMLVLYCKFKRLRTPTNLFLVNISISDLLLSVFGVIFTFVSCVKGRWVWDSAACVWDGFSNCLF 1 2 GISSIMSLTVLAYERYIRVVNATAIDFSWAWRAITYIWLYSLAWTGAPLIGWNSYTLELHRLGCSVNWDSRNPSDTSFVLFLFLGCLLCPIGVIAYCYG >ENCEPH_petMar Petromyzon marinus (lamprey) Gt 293 aa fragment 0 MQSPKQDSLHYAGDTGAKAAPDSAQGNASALGSNFLLHGGDLGEGSTAFSAATFRLLAGVVGTIGVAGFLNNLLLVALFVGFKRLQTPTNLLLVNISLSDLLVSVFGNTLTLVSCVRRRWVWGNGGCVWDGFSNSLF 1 2 GIVSISTLTALSYERYARLIKAQVLDFSWAWRAVTYTWLYSAAWTGAPLLGWSRYVLEKHGLGCSIDWASSNPPDAAFVLFFFLGCLAAPLLVMGFCFGRIALAITQ 0 0 CWSPYAVASLFVASGFEHLVSPPVSIVPSLLAKSNAVCNPLLFLLMSGN 0 >ENCEPH4_braFlo Branchiostoma floridae (amphioxus) Gt synt(-ZFYVE1 +RTF1 -CES1 -POMT2) 402 aa 12435605 AB050608 Amphiop4 new exon 12 and 34 0 MALYNNTSSPSQDLLWDAPYSQGHIWDNSSASNSSEDVMDQGKVELQDFSDAGYTAIATCLALI 1 2 GFVGFTNNFVVILLIGCHRQLRTPFNLLLLNMSVADLLVSVCGNTLSFASAVRHRWLWGRPGCVWYGFANSLF 1 2 GIVSLVTLSALAFERYCVVVRSSDMLTYKSSLVVITFIWLYSLLWTSLPLLGWSSYQFEGHN 0 0 VGCSVNWVQHNPDNVSYIVTLMVTCFFVPMVVVCWSYAWIWRTVRM 0 0 SSEAKPECGNSQNAGRLVTTMVVVMIICFLVCWTPYAVMALIVTFGADHLVTPTASVIPSLVAKSSTAYNPIIYVLMNNQ 0 0 FREFLLARLQRVCCRQQAVPRVTPMDDNVHVRLGGEGPSQSQQFLPAGENVENVDMLEYVQENCKPKADSLSTISE* 0 >ENCEPH4_braBel Branchiostoma belcheri (amphioxus) AB050608 full Amphiop4 introns from braFlo PUBMED 12435605 0 MPLYNTSSGPTQGLPWDTPYSQDPIWNDSSPSNSSEDAVVDQGRGELQDFSDAGYTAIATGLALI 1 2 GLVGSMNNFVVILLIGCHRQLRTPFNLLLLNVSVADLLVSVCGNTLSFASAVQHRWLWGRPGCVWYGFANSLF 1 2 GIVSLVTLSALAFERYCVVVRSSEMLTYKSSLGMIAFIWMYSLLWTSLPLLGWSSYQFEGHS 0 0 VGCSVNWVKHNVNNVSYIITLMVTCFFVPMVVVCWSYACIWRTVRM 0 0 SAEMKSEFGNPQNTGRLVTTMVVVMIVCFLVCWTPYTVMALIVTFGADHLVTPTASVIPSLVAKSSTAYNPIIYVLMNNQ 0 0 FREFLLARLRTFCCRQPRMLRVTPMDDNAHARLVGEGPSHAQQVIPSEENGENVEMRKVQGNQLKADSLSTISE* 0 >ENCEPH_strPur Strongylocentrotus purpuratus (sea_urchin) GLEAN3_03451 modified terminal exon by extending penultimate to stop codon 0 MENFTSIVTDGTNEENTDGDAWPGYAHLLAGSFLTLVFIISIIGNSVVLFLFAWDRHLRTPTNMFLLSLTISDWLVTVVGIPFVTASIYAHRWLFAHVGCII 21 YAFIMTFLGLNSLMSHAVIAVDRYLVITKPHF 1 2 GIVVTYPKAFLMISIPWVFSFAWAVFPLAGWGEFTYEGTGAWCSVRWDSDQPQIMSYVLAMMFLTFISSIVIMMYCYICIFLTTRRMPRWATSNSIKTHERNRRRR 2 1 EQKLLKTLIAIAIAFLVAWSPYAITSMIVVFGGSELLSLTATTLPSLFAKSSVMINPIIYAVTSRVFRKSLKK 0 0 MLTSFFPGCMTYIMTDKSPPSSSRPIQLGLCKYHFLY* 0 >TMT_monDom Monodelphis domestica (opossum) shortened final exon DFPEVSEKQLCLLS PEVWPQP synt(+NCK2 -UXS1 +TMT -ST6 GAL2_overlap) -RALY 0 MSNNLTTNLSLEALLSASEDKQRNGLSRTGHTIVAVFLGIILIFGSISNFIVLVLFCKFKVLRNPVNMLLLNISISDMLVCLSGTTLSFASSIQGRWIGGKHGCRWYGFANSCF 1 2 GIVSLISLAILSYERYRTLTLCPGQGADYQKALLAVAGSWLYSLVWTVPPLIGWSSYGTEGAGTSCSVHWTSKSVESVSYIMCLFIFCLVIPILVMVYFYGRLLYAVKQ 0 0 VGKIRKTAARKREYHVLFMVVTAVICYLICWVPYGMIALLATFGPPGVVSPVANVVPSILAKSSTVCNPIIYVLMNKQ 0 0 FYKCFLILFHCQPAQSGPDVSLCPSNVTVIQLGQRKNKDAPGSI* >TMT_macEug Macropus eugenii (wallaby) frag 0 MSINLTANLSFGTLLPDSEEKQRSGLSRTGHTVTAVFLGLILILGVINNFIVLVLFCKFKVLRNPVNMLLLNISISDMLVCLTGTTLSFASSIRGRWIAGYHGCRWYGFANSCF 1 2 GIVSLISLAVLSYERYRTLTLCPRQGTDYHKALLAVAGSWLYSLIWTVPPLIGWSSYGTEGAGTSCSVHWTSKSVESVSYIMCLFIFCLVIPILFMVYFYGRLLYTVKQ 0 0 VGKIRKSAARKREYHVLFMVVTAVICYLICWVPYGMIALLATFGPPGVVSPVANVVPSILAKSSTVCNPIIYILMNKQ 0 0 FYKCFLILFHCQPASSASDASLCPSKMTVIQLGQRKDKEVPCAIQDLPEVSKKQLCLLSPESNVAPSSGHPQEKMEEKPLSE* 0 >TMT_ornAna Ornithorhynchus anatinus (platypus) frag 0 GLSRTGHTMVAVFLGIILVFGFMNNLIVLILFCKFKALRNPVNMIMLNISASDMLVCVSGTTLSFASNISGRWIGGDPGCRWYGFVNSCL 1 2 GIVSLISLAVLSYERYRTLTLHPKQSTDYQKAVLAVGASWIYSLIWTIPPLLGWSSYGTEGAGTSCSVHWSSKSPVSVSYIVCLFIFCLVIPVLVMIYCYGRLLYAVKQ 0 0 IGKARKTAARKREYHVLFMVITTVICYLVCWMPYGVTALLATFGQPGTVSPEASVIPSILAKSSTVCNPIIYILMNKQ 0 0 FYKCFLILFHCQPPRAADAPSTYPSQVMVIQLNQRRSRETAGAPQVLLEMKHQTLHLLGPQLHETPSWERSTPVHPE* 0 >TMT_galGal Gallus gallus (chicken) XM_001234388 mRNA multiple tissue opsin full synt(+NCK2 -UXS1 +TMT -ST6 GAL2_overlap +SLC5A7 +SULT1C4) 0 MNHTWTYNLSFGAPTDPVEPRAGLSRNGHTVVAVFLGFILFFGFLNNLIVLILFCKFKTLRNPVNMLLLNISISDMLVCISGTTLSFASNIHGKWIGGEHGCRWYGFVNSCF 1 2 GIVSLISLAVLSYERYSTLTLCNKRSDDYRKALLAVGGSWVYSLLWTVPPLLGWSSYGIEGAGTSCSVRWSSETAESTSYIICLFIFCLVIPVMVMMYCYGRLLYAVKQ 0 0 VGKIHKNTARKREYHVLFMVITTVICYLVCWIPYGVIALLATFGKPGVVTPVASIIPSILAKSSTVCNPIIYILMNKQ 0 0 FYKCFRQLFHCQPPSSTDGEPTCHSKVTVIQLNQKTDGGKLCNNKPRPETDNKVTSLLHPEPGLEPAAKTVPPM* 0 >TMT_taeGut Taeniopygia guttata (finch) 0 MNHTWMYNLSFGAPAHPVEPRAGLSRSGHTVVAVFLGLILFFGFLNNLIVLILFCKFKTLRNPVNMLLLNISVSDMLVCISGTTLSFASNIRGKWIGGDHACRWYGFVNSCF 1 2 GVVSLISLAVLSYERYNTLTLCHKRSDDFRKALLAVAGSWIYSLVWTVPPLLGWSSYGVEGAGTSCSVRWSSESAESTSYIICLFVFCLVVPVMVMMYCYGRLLYAVKQ 0 0 VGKIHKNAARKREYHVLFMVIPTVICYLVCWIPYGVIALLATFGKPGAVTPITSIIPSILAKSSTVCNPIIYILMNKQ 0 0 FYKCFRQLFHCQPPSSTDGEPTCHSKVTVIQLDQRADGGNMCNNEPHPETDSKMTSLLCPETTSKATPPTS* 0 >TMT_anoCar Anolis carolinensis (lizard) full synt(+TMT -ST6 GAL2_overlap +SLC5A7) 0 MSELSSNLTFNMSTSIEEPGSGLSRMGHNIVAVFLGLILVFGFLNNLVVLILFCKFKTLRNPVNMLLLNISASDMLVCISGTTLSFVSNIYGRWIGGEHGCRWYGFVNSCF 1 2 GIVSLISLAILSYERYSTLTQTNKRGSDYQKALLGVGGSWLYSLIWTVPPLIGWSSYGLEGAGTSCSVRWTSETLESVTYIICLFIFCLAIPVLVMIYCYARLFYAVKQ 0 0 VGKLRKTSARKREFHVLFMIITTIICYLICWMPYGVIALLATFGRPGLVSPVASVIPSILAKSSTVFNPIIYILMNKQ 0 0 FYKCFLMLLHCQPSSVADGETICQSKVMAIHQNQKAQGGVILKSQVVPQMDEKAICLLSPESSLDPVLESTPQLSKENSFL* 0 >TMT_xenTro Xenopus tropicalis (frog) full synt(-UXS1 +TMT -ST6 GAL2_overlap +SLC5A7) 0 MSTIKNWTTNISVENSMSYIENDLSLPTEAVLSRTGHTVVAIFLGFILIFGFLNNFVVLILFCKFKTLRTPVNMMLLNISASDMLVCVSGTTLSFTSSIKGKWIGGEYGCQWYGFVNSCF 1 2 GIVSLISLAILSYERYSTLTLYNKGGPNFKKALLAVASSWLYSLVWTVPPLLGWSSYGREGAGTSCSVRWTSESVESVSYIICLFIFCLALPVFVMLYCYGRLLYAVKQ 0 0 VGKIRKIAARKREYHVLFMVITTVICYLLCWLPYGVVALLATFGRPGVISPVASVVPSILAKSSTVFNPIIYILMNKQ 0 0 FYKCFLILFHCHPTSSADGKSICQSNYTVIQLNQKLNNIVAIPGQTQIPESVDKMPCIHRQNNESPSDQMPQSTTEHLISGT* 0 >TMT_danRer Danio rerio (zebrafish) synt(-UXS1 +TMT -ST6 GAL2_overlap +GPR89A -pdzk1l) 0 MFFEQADLNYSFNMSEEDRLTLLDEDWSDSPMETLSRAGFIALSVFLGFIMTFGFFNNLVVLVLFCKFKTLRTPVNMLLLNISISDMLVCMFGTTLSFASSVRGRWLLGRHGCMWYGFINSCF 1 2 GIVSLISLVVLSYDRYSTLTVYHKRAPDYRKPLLAVGGSWLYSLIWTVPPLLGWSSYGLEGAGTSCSVSWTQRTAESHAYIICLFVFCLGLPVLVMVYCYGRLLYAVKQ 0 0 VGKIRKTAARKREYHVLFMVITTVVCYLLCWMPYGVVAMMATFGRPGIISPVASVVPSLLAKSSTVINPLIYILMNKQ 0 0 FYRCFRILFCCQRSLLQNGHSSMPSKTTVIQLNRRVNSNAVACTAQISTGTHNHDCSTHVTERSNPPEVIP* 0 >TMT_tetNig Tetraodon nigroviridis (pufferfish) synt(-UXS1 +TMT -ST6GAL2) multiply frameshifted assembly 0 MFSGQAGLNSSFNLSDGRGLEDAPAGRGRLSPTGFVVLSVVLGFIITFGFLNNFIVLLLFCKFKKLRTPVNVLLLNISVSDMLVCLFGTTLSFASSLRGRWLLGRSGCNWYGFINSCF 1 2 GIVSLISLVILSHDRYSTLTVYNKQGINYRKPLLAVGGTWLYSLLWTVPPLLGWSSYGIEGAGTSCSVSWTVQTAQSHAYIICLFIFCLGLPVLVMVYCYSRLLWAVKQ 0 0 VGKIRKTSARKREYHILFMVVTTAACYLVCWMPYGVVAMMATFGPPNIISPVASVVPSLLAKSSTVINPLIYILMNKQ 0 0 FYKCFLILFHCSHWSADNGTTSVPSKITVIQLNRRAYSNTVACADPLSTDALKQCCSAKNASTIEVKLS* 0 >TMT_takRub Takifugu rubripes (fugu) 0 MFSGQAGLNYSFNLSDDRELLDAPAGRAKLSPTGFVVLSVVLGFIMTFGFLNNFVVLLLFCKFKKLRTPVNMLLLNISVSDMLVCLFGTTLSFASSIRGRWLLGRIGCSWYGFINSCF 1 2 GIVSLISLVILSYDRYSTLTVYNKQGINYRKPLLAVGGTWLYSLFWTVPPLLGWSSYGIEGAGTSCSVSWTVQTAQSHAYIICLFTFCLGIPILVMIYCYSRLLWAVKQ 0 0 VGRIRKTAARKREYHILFMVVTTAACYLVCWMPYGVVAMMATFGPPNIISPVASVVPSLLAKSSTVINPLIYILMNKQ 0 0 FYKCFLILFHCGHWSADNGNTSMPSKTTAIQLNRRVYSNTVACADQLSTDALKQCCSANTISTKNTSTVEGKLS* 0 >TMT_gasAcu Gasterosteus aculeatus (stickleback) 0 MVFGQAGLNHSFNLSDDRELLDTSAGRAKLSPTGFVVLSVMLGFIMTFGFVNNLVVLLLFCKFKKLRTPVNMLLLNISVSDMLVCLFGTTLSFASSLRGKWLLGRSGCSWYGFINSCF 1 2 GIVSLISLVILSYDRYSTLTVYNKAGPDYRKPLLAIGGSWLYSLFWTVPPLLGWSSYGIEGAGTSCSVSWTVQTAQSHAYIICLFTFCLGLPMLVMIYCYSRLLLAVKQ 0 0 VGRIRKTAARRREYHILFMVLTTAACYMLCWMPYGVVAMMATFGPPNIISPVASVVPSLLAKSSTVINPLIYILMNKQ 0 0 FYRCFLILFHCKHWSAENHNTSMPSKTTVIHLNRRVCSNTLPCTAQASTDAANHFCSTSATKHTSPPLQGHGLSLNVLNMIRQENHSHDEAAKNQLDCLT* 0 >TMT_oryLat Oryzias latipes (medaka) 0 MFSGQTGLNFSFNQSDDRELEDTPAGSAKLSQAGFVVLSVVLGFIMTFGFLNNFVVLILFCKFKKLRTPVNMLLLNISVSDMLVCLFGTTLSFASSIRGRWLLGRGGCSWYGFINSCF 1 2 GIVSLISLVILSYDRYSTLTVYNKGGLNYRKPLLAVGGSWLYSLFWTVPPLLGWSSYGLEGAGTSCSVSWTANTAQSHAYIICLFIFCLGLPILVMIYCYSRLLLAVKQ 0 0 VGKIRKTAARKREYHILFMVLTTAACYLLCWMPYGVVAMMATFGPPNIISPVASVVPSLLAKSSTVINPLIYILMNKQ 0 0 FYRCFLILFHCDHWSSENGNTSVPSKTTVIPLNRRIYTNTVAQISTDNAN* 0 >TMT_ictPun Ictalurus punctatus (catfish) transcript from whole fry 0 WLLGRTGCMWYGFINSCF 1 2 GIVSLISLMILSYERYSTMTVYNNQGPNYRKHLLAVGGSWLYSLIWTVPPLLGWSSYGLEGAGTSCSVSWTDHSPKSHAYIICLFIFCLGLPVLLMVYSYGRLLYAVKQ 0 0 LGKIHKTARRRDYHLLFMITTTVVCYLLCWTPYSVVALMASFGRPGIITPVASIIPSLLAKSSTVINPVIYIFMNKQ 0 0 FYRCFRTLLGYKERSAVPDDHSLMATKNTAIQLKCIMHNNPVPSPAHTPPPFF >TMTa_anoCar Anolis carolinensis (lizard) wgs 0 MTVPSISPSCIGVANGVWCSNGGSSSNSHHRHGQEQSLSPTGHLITAICLGVIGSLGFLNNLLVLVLFCRNKVLRSPINLLLMNISLSDLMICIVGTPFSFAASTQGKWLIGPAGCVWYGFANTFF 1 2 GTVSLISLAVLSYERYCTMMGTTEADATNYKKVWMGIFLSWIYSLFWSLPPLFGWSSYGPEGPGTTCSVNWHSRDANNISYIICLFIFCLVIPFIVIVYCYGKLLCAIKK 0 0 VSGVTQGMAQTREQRVLIMVVVMIICFLLCWLPYGIVALIATFGKPGLITPSASIIPSVLAKSSTVYNPVIYIFLNKQ 0 0 FYRCFCALLKCGKKSIASSNKCSSRSTRVCRSIRKQDNFTFVAASAGPPSSEHQDAILSIQNPEPPTDNSPKAKQRVLLVAHYSV* 0 >TMTa_xenTro Xenopus tropicalis (frog) wgs scaffold_55:1,749,966-1,816,542 0 MAFHSSSPSCSDSSSVTCRSSIQQYGPENHPNLSPTGHLLVAVFLGVIGSLGFFNNLVVLILFCQYKVLRSPINMLLMNISLSDLMVCILGTPFSFAASTQGHWLIGEIGCIWYGFVNTLF 1 2 GTVSLVSLAVLSYERYCTMLRSTEADLTNYKKAWLGILVSWIYSLVWTLPPLFGWSKYGPEGPGTTCSVNWHSRDANNISYIVCLFIFCLALPFAVIVYCYGRLLFAIKQ 0 0 VSGVSKSSSRAREQRVLIMVIVMVVCFLLCWLPYGVMALVATFGKPGIISPSASIIPSVLAKSSTVYNPIIYIFLNKQ 0 0 FYRCFTALIHCNKHPQVSSNKGSSKTTKIMLTARKLPDANFTVNAASNPPSSSVVKYEADSKTHNGDTKPFKTLVANYVI* 0 >TMTa1_danRer Danio rerio (zebrafish) NM_001118899 full synt(+PBX3 +TNK2 +TMTa1 -PAP2D +LPPR4) 0 MIVSNLSVLSCRRNSALCLGAVEGHLEASSSYRTLSPTGHILVAVSLGFIGTFGFLNNLLVLVLFGRYKVLRSPINFLLVNICLSDLLVCVLGTPFSFAASTQGRWLIGDTGCVWYGFANSLL 1 2 GIVSLISLAVLSYERYCTMMGSTEADATNYKKVIGGVLMSWIYSLIWTLPPLFGWSRYGPEGPGTTCSVDWTTKTANNISYIICLFIFCLIVPFLVIIFCYGKLLHAIKQ 0 0 VSSVNTSVSRKREHRVLLMVITMVVFYLLCWLPYGIMALLATFGAPGLVTAEASIVPSILAKSSTVINPVIYIFMNKQ 0 0 FYRCFRALLNCDKPQRGSSLKSSSKTKPFRPGRRTDNFTFMVASVGPNQTNPVEDGPPSADNTKPAVLSLVAHYNG* 0 >TMTa_takRub Takifugu rubripes (fugu) synt(-CALD1 +TNK2 -RAB18 +ABI1 12670711) AF402774 full 0 MIVSNVSLSGCAGVNGAVCAAEGHQAGGSDRSTLTPTGNLVVSVFLGFIGTFGLVNNLLVLVLFCRYKMLRSPINLLLMNISISDLLVCVLGTPFSFAASTQGRWLIGEAGCVWYGFANSLF 1 2 GVVSLISLAVLSFERYSTMMTPTEADPSNYCKVCLGITLSWVYSLVWTVPPLFGWSSYGPEGPGTTCSVNWTAKTTNSISYIICLFVFCLIVPFLVIVFCYGKLLCAIRQ 0 0 VSGINASTSRKREQRVLCMVVIMVICYLLCWLPYGVVALLATFGPPDLVTPEASIIPSVLAKSSTVINPIIYVFMNKQ 0 0 FYRCFLALLCCQDPRSGSSMKSSSKVATKAKGVTPTGQRRTDFLYMVASLGRPAATIPQLGPSFDATNDFTKPPSSDTIKPVVVSLAAHCDG* 0 >TMTa_tetNig Tetraodon nigroviridis (pufferfish) full 0 MIASNASVSGCAGVHGAACAADAPPAGGSHRSSSSLTPTGNLVVSVFLGLIGTSGLVSNLLVLVLFCRFKVLRSPINLLLVNISVSDLLVCVLGTPFSFAASTQGRWLIGAAGCVWYGFVNSLF 1 2 GIVSLISLAVLSFERYSTMMTPTEADSSNYCKVCLGIGLSWVYSLLWTVPPLLGWSSYGPEGPGTTCSVNWTAKTANSVSYIICLFVFCLILPFLVIVFCYGKLLCAIRQ 0 0 VSGVNASMSRRREQRVLFMVVVMVICYLLCWLPYGVVALLATFGPPGLVTPAASIIPSILAKSSTVINPVIYVFMNKQ 0 0 FSRCFLSLLCCEDPRSSTSLRSSSRVTTKAVRGGTLTGQRRTNHLLYMVAALGRPVATAMPQLGPSFDATYDITKAPSSDNHQPVVVSLEAHG* 0 >TMTa_gasAcu Gasterosteus aculeatus (stickleback) synt(+TNK2 +ENC) full 0 MIVSNLSLSGCAGVSSALCAAAGEGHLSGGSHRNTLTPTGHLVVAVCLGFIGTLGLMNNLLVLVLFCRYKMLRSPINLLLINISISDLLVCVLGTPFSFAASTQGRWLIGEGGCVWYGFANSLF 1 2 GIVSLISLAVLSYERYSTMVAPTEADSSNYHKISLGITLSWVYSLIWTAPPLFGWSHYGPEGPGTTCSVDWTARTANSISYIICLFVFCLIVPFLVIVFCYGKLLCAIRQ 0 0 VSGINASLSRKREQRVLFMVVIMVVCYLLCWLPYGIMALMATFGPPGLITPVASIIPSVLAKTSTVINPVIYVFMNKQ 0 0 FYRCFKALLRCEAPRPSSSLKSSSKVPTKAMRGAAVTGPRHTNNFLFVVASLGRPVATIPQLGPSVEPTIDVTGGPSSDNNKPVIVSLVAQCDG* 0 >TMTa_oryLat Oryzias latipes (medaka) genome SLC12A3 two frags 0 MLVSNVSLGGCAEFNSALCAGAGEEHLGGGSYRTTLTPTGHLIVAVCLGFIGTFGLVNNLLVLVLFCRYKILRSPINLLLINISISDLLVCVLGTPFSFAASTQGRWLIGEGGCVWYGFANSLC 1 2 GIVSLISLAVLSYERYSTMMTPAEADSSNYRKISLGIILSWGYSLLWTLPPLFGWSHYGPEGPGTTCSVDWTAKTANNISYIICLFVFCLIVPFMVIVFCYGKLLYAIKQ 0 0 VSGINVSVSRKREQRVLFMVVIMVICYLLCWLPYGIMALLATFGPPDLVTPEASIIPSVLAKTSTAINPVIYVFMNKQ 0 0 * 0 >TMTa_pimPro Pimephales promelas (minnow) frag DT200813 GHLVVAVCLGFIGTfGFLNNTLVLILFCRYKVLRSPMNYLLVSIAVSDLLVCVLGTPFSFAASTQGRWLIGRAGCVWYGFINSCL 1 2 GVVSLISLAVLSYERYCTMMGATQADSTNYKKVAMGIAFSWIYSMVWTLPPLFGWSCYGPEGPGTTCSVNWAARTANNVSYIICLFFFCLILPFIVIVYSYGRLLQAITQ 0 0 VSRINTVVSRKREQRVLFMVITMVVCYLLCWLPYGIMALLAAFGRPGLVTPAASIVPSVLAKTSTVINPIIYIFMNKQ 0 0 FCRCFHALIMCTTPQRGSSFKNSSKVTKTLRTVRRANGQNVTFAVASAGHPTICAPH >TMTa_oncMyk Oncorhynchus mykiss (trout) CU062745 testis new 0 MVVESGNLNFSTDDSSGTNLSPKDSVRDTLTSRQSDLGRTGHTVVAVFLGVIFLLGFLSNLFVLLVFARFQVLRTPINLILLNISVSDMLVCIFGTPFSFAASLYGRWLIGAHGCKWYGFANSLF 1 2 GIVSLVSLAILSYERYSTILCYTKADPSDYKKAWLAIAGAWLYSLVWTVPPFFGWSSYGPEGPGTTCSVQWHQRSSGNISYVTCLFIFCLLLPLLLMMFCYGKILFAIRG 0 0 VAKINQSSAQRRETHVLVMVVSMVSCYLLCWMPYGVVALLATFGQVGLVSPTTSIVPSILAKSSTFLNPVIYGLLNNQ 0 0 FYRCFLAFMSCGSEAAGSHTLHTLPSSRVVGPYGNSPAPEEPGTREPLDGSGTSSKTQTRGPQREKRDLVLVVHYTP* 0 >TMTb_danRer Danio rerio (zebrafish) synt(+TNK1 +TMTb -MYEOV2) 0 MIESNVSRSCEWCAGGGEGTGAHLDENHSDHSLSPTGHLVVAVCLGFIGTFGFLNNTLVLVLFCRYKVLRSPMNCLLISISVSDLLVCVLGTPFSFAASTQGRWLIGRAGCVWYGFINSFL 1 2 GVVSLISLAVLSYERYCTMMGSTQADSTNYRKVVIGIAFSWIYSMVWTLPPLFGWSCYGPEGPGTTCSVNWAARTPNNVSYIVCLFVFCLILPFIVIVYSYGRLLQAITQ 0 0 VSRINTVVSRKREQRVLFMVVTMVVCYLLCWLPYGIMALLATFGHPGLVTPAASIVPSLLAKSSTVINPIIYIFMNKQ 0 FCRCFHALIMCTTPERGSSFKNSSKVTKTLRTVRRANGQNVTFAVASAVHRTPYSDRQKSSSEGEKLPPATGQGTSKPVVSLVAYYNG* 0 >TMTb_takRub Takifugu rubripes (fugu) synt(+TFRC +TMTb +CHES1 -MYEOV2 -ARHGAP21) full 0 MIVCNVSLSCAHCPGEGTAANDAYAQASGSLATPTLSQRGHLVVAVCLGFIGTVGFLSNFLVLALFCRYRALRTPMNLMLVSISASDLLVSVLGTPFSFAASTQGRWLIGRAGCVWYGFVNACL 1 2 GIVSLISLAVLSYERYCTMVSSTIASNRDYRPVLGGICFSWFYSLAWTVPPLLGWSRYGPEGPGTTCSVDWRTQTPNNISYIVCLFTFCLLLPFFVILYSYGKLLHTIRQ 0 0 VRRVSSTVTRRREHRVLVMVVAMVVCYLICWLPYGVTALLATFGPPNLLTPEATITPSLLAKFSTVINPFIYIFMNKQ 0 0 FYRCFRAFLNCSTPKRDSTVRTFTRISLRALRQDQQQKGSALAPSSARPTPNSIHESSLKGSHSTPSNGGAAAAKSPAANRSKPKLILVAHYRE* 0 >TMTb_tetNig Tetraodon nigroviridis (pufferfish) 0 MIVCNLSLSCAHCPGGGAAATDAYAYAEAPGSLAPPTLSQRGHLVVAVCLGAIGTVGFLSNLLVLALFCRFRALRTPMNLMLVSISASDLLVSVLGTPFSFAASTQGRWLLGRAGCVWYGFVNACL 1 2 GIVSLISLAVLSYERYCTMMASTMASNRDYRPVLLGICFSWFYSLAWTVPPLLGWSRYGPEGPGTTCSVDWRTQTPNNISYIVCLFAFCLLLPFCVILYSYGKLLHTIRQ 0 0 VSSVSSAVTRRREHRVLVMVVAMVVCYLICWLPYGVTALLATFGPPNLLTPEATITPSLLAKFSTVINPFIYIFMNKQ 0 0 FYRCFRAFLSCSSPERGSTVRTFTRISLRAVCQRKQQRVSAPAASSACPTPNSIHHSSRKGSHSASSNSGTAAAAKTPAANSSKPKLILVVHYRE* 0 >TMTb_gasAcu Gasterosteus aculeatus (stickleback) full 0 MIVCNVSLSCVHCPGGGAGGTAATATGAYEEVSDSLPAPSLSPKGHLVVAVCLGFIGTFGFLSNFLVLALFCRYRALRTPMNLLLVSISASDLLVSMVGTPFSFAASTQGRWLIGRAGCVWYGFVNACL 1 2 GIVSLISLAVLSFERYSTMVKPTVADGRDFRPALGGIAFSWLYSVAWTVPPLLGWSEYGPEGPGTTCSVDWKTQTANNISYIVCLFVFCLVLPFCVILYSYSRLLQAIRQ 0 0 VSVVSSVVTRHREQRVLAMVVVMVACYLVCWLPYGVAALLATFGPRDLLSPEASITPSLLAKFSTVVNPFIYIFMNKQ 0 0 FYRCFRAFLSCSTPERGSTLKTFSRPTKTLRAGRHEKGRRVSAAAPSTAQPTRNSAPRSSQGANHASATPPPSPADGRCAAAGAAKPKRTLVAHYRE* 0 >TMTb_oryLat Oryzias latipes (medaka) 0 MIVPNASLSCAHCDGDAAEQDAPGSAAAPSLSPTGHLVVAVCLGLIGTCGFLSNLLVLALFCRYRALRTPMNLLLVSISVSDLLVSVLGTPFSFAASTQGRWLIGRAGCVWYGFINACL 1 2 GIVSLISLAVLSYERYSTVMTPNMADGRDFRPALGGICFSWLYSVAWTVPPLLGWSRYGPEGPGTTCSVDWKTQTPNNISYIICLFTFCLLLPFGVIVYSYGKMLRVIRQ 0 0 VRSMSSVVTRRREQRVLVMVVTMVVCYLVCWLPYGIAALLATFGPRDLLTPAASITPSLLAKFSTVINPLIYIFMNKQ 0 0 FYRCFWAFFCCSTPEQVSTLRTFSRVTKTIRTFRQERELHVSAPAPSSGLPTPNSIQKGNNHVDPSSINQACAASDSPDSRKPKVVLVAHYQE* 0 >TMTa1_calMil Callorhinchus milii (elephantfish) wgs frag 0 MLNSSPNSSPSLPLSQVGWTGLSRTGLTVVAVCLGIIMVLGFLNNLLVLVLFCKYKVLRSPMNMLLLNISVSDMLVCICGTPFSFAASVQGRWLVGEQGCKWYGFANSLF 1 2 GIVSLMSLTILSYDRYITITGTTEADITNYNKTIVGIALSWIYSLMWTLPPLFGWSNYGPEGPGTTCSVNWQSKEVSSKSYIICLFIFCLLMPFLVIVYCYGKLVLAVRK 0 0 VGKINQMTAQTREHRILLMVISMVTFYLLCWLPYGTVALIGTFGNADLITPTCSVIPSILAKSSTVINPVIYVIMNKQ 0 >TMTa2_calMil Callorhinchus milii (elephantfish) wgs exons 1 and 4 frag 0 MMAHSANISTTLNASDHAPNLAGLSQSGHTTVAVFLGIILVLGCVNNLLVLLLFVCFKEIRTPLNMILLNISLSDLSVCVFGTPFSFAASIYRRWLIGHKGCKWYGFANSLF 1 0 VSANNSMGRTRENKLLIMVTFMIICFLLCWLPYGIVALLATFGSPGLITPTASIIPSVLAKTSTVYNPIIYIFMNKQ 0 >TMTx_braFlo Branchiostoma floridae (amphioxus) XM_002207814 frag with assembly duplication, no N-term even in v2.0 47% TMT5_braFlo + insect TMTs 0 VAAILALIGVLGIVNNSTTLYLVGRYKQLRTPFNILMVNLSVSDLLMCVLGTPFSFVSSLHGRWMFGHSGCEWYGFICNFL 1 2 GIVSLITLTVISYERYLLMKRLPNERILSYRAVALAVVFIWCYSLLWTAPPLVGWSSYGPE 00 GYGISCSVNWESRTANDTSYIVAYFVGCLVFPVAIIVISYTRLILYMRQ 0 0 QAPSAPMQMLVRREKRVTKMVVVMIMGFTICWTPYTIVALIVTCGGEGIITPAAATVPALFAKSSVVYNAAIYVAMNNQ 0 0 FRKCFLRSLNCRSQPRDPSSQQYTLKTNQVGMSTSGSQAARTADRIKTVHVATANPQDHRSSSGQAVEDNGGFRKSLTHSLPLNSISTLLEAEK* 0 >TMT5_braFlo Branchiostoma floridae (amphioxus) extra 0 intron Amphiop5 0 MLGMHNVMNATDYDNNNATFAAWNFQRNGTTEEEVEFSGFDTVAVVIAAIGIAGFLSNGAVVLLFLKFRQLRTPFNMLLLNMSVADLLVSVCGNTLSFASAVRHRWLWGRPGCVWYGFANHLF 1 2 GLVSLISLAVISYERYRMVVKPKGPGSSYLTYNKVGLAIIFIYLYCLLWTTLPIVGWSSYQLE 0 0 GPKISCSVAWEEHSLSNTSYIVAIFIMCLLLPLLIIIYSYCRLWYKVKK 0 0 GSQNLPPAIRKSSQKEQKIARMVVVMITCFLVCWLPYGAMALVVSFGGESLISPTAAVVPSLLAKSSTCYNPLVYFAMNNQ 0 0 FRRYFQDLLCCGRRLFDASASVNTCNTSAMPRHSPVFQKPDSDQYNGIQKSREPQMRTTGQNAPYRQWIEMQTIAVVVKADEVNNKFGEVKT* 0 >TMT5_braBel Branchiostoma belcheri (amphioxus) AB050609 full introns from braFlo Amphiop5 extra Nfrag in mrna 0 MLGIYNVVNATEYGNNTTFAAWDFKRNGTGGEEEVEFFGYDAVAGVIAIIGVVGFVSNGAVVVLFLKFPQLRTPFNLLLLNMAVADLLVSVCGNTLSFASAVRHRWLWGRPGCVWYGFANHLF 1 2 GLVSLISLAVISFLRYRMVVKPKGPGSSYLTYTKVGLAILFIYLYCLLWTTLPIAGWSSYQLE 0 0 GPKIGCSVAWEEHSWSNTSYIVVLFITCLFAPLLIIVYSYYRLWHKVKQ 0 0 GSRNLPAAMRKSSQKEQKIAMMVIVMITCFMVCWLPYGAMALVVTFGGERLISHTAAVVPSLLAKSSTCYNPVVYFAMNSQ 0 0 FRRYFQDLLCCGRRLFDVSQSVVTGNTAMPRNNSQGFRKDDSDQKQDNGLPKQSEGPMCDHSSNESQMEGSRHNTAASQQWIEMQTIAVVVKAVEVDTSAANEP* 0 >TMTy_braFlo Branchiostoma floridae (amphioxus) FE572481 (to other allele) gastrula XM_002222645 flawed, allele dup, 39% ENCEPH4_braFlo new 0 MASAGQNVTFPAIDTMAPTPEALTSDPTTPAYFTTEQHLLMAVWLGFIGSFGFVTNLLTVLVFWCFKSLRTPFHLYLGGIALSDLLVAALGSPFAVASAVGERWLFGRAVCVWYAFVNYFL 1 2 SIVSIVTMATMSFSRYWVIIRPQSAPRLDTVYGACVVNALAWCYSFFWTIMPVLGWSRFTQ 0 0 VAAMTVCSLDWDHHTPLSKSYIPVAFLTCLFLPLGVIIFSVFKTTMHLRR 0 0 AAEVEDEVPNEVRAGRKTTRITLVMAGCWLVAWLPYACMALVIAAGGRVSPTVEVLATKFAKTSYIVNTIIYLVMEKE 0 0 FRKSLVLLLFCGRDPFDIQIEQPAYEKADVYVERLVTAEPMVEMEAVNVRPAQQEPARAPFGTPL* 0 >TMTPIN_strPur Strongylocentrotus purpuratus (sea_urchin) GLEAN3_05569 16311335 opsin1 PIN-type introns no cdna no sacKow 0 MSNLMTGLVTNVNALSGIGNETPTTIGLSSLVVPVSRTTYNYLTVYTGFLTIFGILNNGIVMILFARFPSLRHPINSFLFNVSLSDLIISCLASPFTFASNFAGRWLFGDLGCTLYAFLVFVA 1 2 GTEQIVILAALSIQRCMLVVRPFTAQKMTHRWALFFISLTWIYSLIICVPPLFGWNRYTYEGPGT 1 2 ACSVAWNSPSPGDTSYIIFIFVLVLVIPFGIIIFCYGLLVYAVKK 0 0 ISRTQAALSSEAKADRKVSKMIFIMILFFLIAWTPYTGFSLYVTFGKNVVITPLAGTFPPFFAKLCTIHNPIIYFLLNKQ 0 0 FKDALIQLFCCGENPFDRDESEHEGRGGRHRHRTAPSATAHIGGRGRASSLPTATSMLDIPQAASTAASSSGKTQNKESLEKGPSTSETTNKRVFELSSKIQKFEISEKNNTPSSSELPGASSLSGALMPPRRAMKNQVGCLPPVDN* 0 >RGR1_homSap Homo sapiens (human) G? synt(+PCDH21 -LRIT1 -GRID1 -WAPAL) 296 aa _001012720 var2 retinal epithelium Mueller exon-skipping splice isoform 0 MAETSALPTGFGELEVLAVGMVLLVE 1 2 ALSGLSLNTLTIFSFCKTPELRTPCHLLVLSLALADSGISLNALVAATSSLLR 2 1 RWPYGSDGCQAHGFQGFVTALASICSSAAIAWGRYHHYCT 1 2 RSQLAWNSAVSLVLFVWLSSAFWAALPLLGWGHYDYEPLGTCCTLDYSKGDR 2 1 NFTSFLFTMSFFNFAMPLFITITSYSLMEQKLGKSGHLQ 0 0 VNTTLPARTLLLGWGPYAILYLYAVIADVTSISPKLQM 0 0 VPALIAKMVPTINAINYALGNEMVCRGIWQCLSPQKREKDRTK* 0 >RGR1_ornAna Ornithorhynchus anatinus (platypus) G? missing exon 1 retinal ganglia RGR last D in DRY motif, afros ERY, other placentals GRY 0 1 2 ALLGLCLNGLTIASFRKIKELRTPSNLLVVSLALADSGICLNALMAALSSFLR 2 1 HWPYGAEGCRLHGFQGFATALASISLSAAIGWDRYLRHCS 1 2 RSKPQWGTAVSTVLFAWGFSAFWSMMPILGWGQYDYEPLRTCCTLDYSKGDR 2 1 NFTTYLFAVAFFNFVIPLFIMLTSYQSIEQRFKKSGLFK 0 0 LNTRLPTRTLLFCWGPYALLCFYATVENVTFISPKLRM 0 0 IPALIAKTVPVIDAFTYALRNEDYRGGIWQFLTGQKIERVEVENKIK* 0 >RGR1_galGal Gallus gallus (chicken) G? synt(+PCDH21 -LRIT1 +CHAT -PARG) 296 aa 14985289 NM_001031216 retinal ganglia RGR 0 MVTSHPLPEGFTEIEVFAIGTALLVE 1 2 ALLGFCLNGLTIISFRKIKELRTPSNLLVLSIALADCGICINAFIAAFSSFLR 2 1 YWPYGSEGCQIHGFQGFLTALASISSSAAVAWDRYHHYCT 1 2 RSKLQWSTAISMMVFAWLFAAFWATMPLLGWGEYDYEPLRTCCTLDYSKGDR 2 1 NYITFLFALSIFNFMIPGFIMMTAYQSIHQKFKKSGHYK 0 0 FNTGLPLKTLVICWGPYCLLSFYAAIENVMFISPKYRM 0 0 IPAIIAKTVPTVDSFVYALGNENYRGGIWQFLTGQKIEKAEVDSKTK* 0 >RGR1_xenTro Xenopus tropicalis (frog) G? synt(+PCDH21 -LRIT1 +CHAT -PARG) 296 aa no_ref BC135113 retinal ganglia RGR 0 MVTSYPLPEGFTETEVFAIGTTLLVE 0 0 ALLGLLLNGLTLLSFYKIRELRTPSNLFIISLAVADTGLCLNAFVAAFSSFLR 2 1 YWPYGSEGCQIHGFQGFVAALSSIGSCAAIAWDRYHQYCT 1 2 RSKLHWSTAVSVVFFIWGFSAFWSAMPLFGWGEYDYEPLRTCCTLDYSKGDR 2 1 NYISYLFTMAFFEFLVPLFILMTAYQSIYQKMKKSGQIR 0 0 FNTSMPVKSLVFCWGPYCLLCFYAVIQDATILSPKLRM 0 0 IPALLAKTSPAVNAYVYGLGNENYRGGIWQYLTGQKLEKAETDNKTK* 0 >RGR1_gasAcu Gasterosteus aculeatus (stickleback) G? synt(+PCDH21 -LRIT1 +CHAT -PARG) 296 aa retinal ganglia RGR 0 MVSSYPLPDGFTDFDVFSLGSCLLVE 0 0 GLLGILLNAVTIAAFLKVRELRTPSNFLVFSLAVADIGISMNATIAAFSSFLR 2 1 YWPYGSDGCQTHGFQGFVTALASIHFIAAIAWDRYHQYCT 1 2 RTKLQWSSAITLAVFVWLFTAFWSAMPLIGWGEYDYEPLRTCCTLDYTKGDR 2 1 NYVSYLIPMAIFNMAIQVFVVMSSYQSIAQKFKKTGNPR 0 0 FNPNTPLKAMLFCWGPYGILAFYAAVENATLVSTKLRM 0 0 MAPILAKTSPTFNVFLYALGNENYRGGIWQLLTGEKIDVPQIENKSK* 0 >RGR1_calMil Callorhinchus milii (elephantfish) frag 0 EGFTDFEVFGLGTALLVE 1 2 GLVGLLLNGLTLLAFYKIKELRTPSNLLITSLALSDFGISMNAFIAAFSSFLR 2 1 YWPYGSEGCQTHGFHGFLMALASINACAAIAWDRYHQNCS 1 2 RSRLQWSSAITVTVFIWGIAAFWSAMPLLGWGVYDYEPLRTCCTLDYSKGDR 2 1 NFISFFIIMGSFEFIFPIFIMLSSYQSCKSKFKKNGQVK 0 0 FNTGLPVKTLIFCWGPYSLLCFYATIENITILSPKLRM 0 0 * 0 >RGRa_cioInt Ciona intestinalis (tunicate) Ci-opsin3 mRNA synt(PPT1 NOMO2+ FOXC2- FAHD1-) AB079882 12687683 289 aa larval visual cycle photoisomerase last 4 introns like RGR 0 MEVNDKRVYGVLMGLL 1 2 GLLTITGYSLLFVIFAKRPDLKKKNKFLLSLATSDLLITVHVFASTIAAFAPQWPFGDLGCQ 0 0 VDAFIGMAPTFISIAGAALIAKDKYYRFCKPKM 1 2 VGRNYSFHVYLTWTMGIIGGALPFIGFGRYGFETDDVTWRTGCLLDFKSISA 2 1 KYSFYIILISTVWFVWPVYKLVSSYMKISTKINKFYP 0 0 LLFVVPVQMAIGLLPYAIYAMVSITIGVSAVPYFCVVINN 0 0 LAAKVFVGSNPFIYIYFDPELRESCKQIFCSPPAPTNDKISEDSKDE* 0 >RGRb1_cioInt Ciona intestinalis (tunicate) AB078611 12373590 entire neural tube CiNut 0 MEIDFGFARTVYGVALLLM 1 2 VFITLLGYAVYFGAIWRSKTLQTRHIWLTSLACGDIIMMVHLILESLSSLGMGHRPRQNFECQ 0 0 VGALVGLFSGYVTIASITWIAIDRYYRQCKPEK 1 2 VGVNYCFYVIIVWAMSFLAASGPALGFGAYESAEENTVKCLIDLNKKDT 2 1 NSRLYIILVSAVWFVYPFVKMILYNKKLVQEAKEPQP 0 0 MAFAVPLTFFLCYLPFAIYASLKITVGLPPLNSMVVASIY 0 0 MLPKVISVVNPYLYMRSDPELLAACRHVVGLTDGKKAV* 0 >RGRb2_cioInt Ciona intestinalis (tunicate) XM_002121277 0 MELEFGITRTAYGIAMLMM 1 2 AAVATFGYSVYILAIWSSKKLQTKHIWLTSLACADLLMMVHLFMDGLSSFHQGRRPKGIFECQ 0 0 VSAHMGLFSGFVSIASMTWICIDRYYRKFKPEK 1 2 VGVNYCFYVIIVWAMSFLAASGPALGFGAYESAEENTVKCLIDLENTDM 2 1 NTIKYFVVVGFLFFFYPIFKMIKYNTKFAYKSEEEKA 0 0 VVIAAPVSFVLGYLPYLVYACLKLTIGLPPLNQASIAFL 0 0 YLLPKFISVMNPYMYMRSDPELLRAAKRVVNFDFDQKIE* 0 >RGRb2_cioSav Ciona savignyi (tunicate) 80% 0 MELEFGIIRTAYGIAMLLM 1 2 AVVATFGYSIYLRAIWSSRKLQTRHIWLTSLACADLIMMVHLFMDGLSSFHQGRRPKGNFECQ 0 0 ASALMGLFSGFVSIGSVTWICIDRYYRRYKPEK 1 2 AGLNYCFYVIIVWGLSFLAASGPAMGFGTYESAQENTVKCLIDLDNTDG 2 1 NTIKYFMVIGVLYFFYPIFKMVKYNTLAQATEVEKT 0 0 VVIAAPATFVIGYLPYLSYAILKLTIGLPPVNEAFIAFV 0 0 ILPKFISVLNPYMYMRSDPELLRAAKDVVNFNFYTKMD* 0 >RGRa_cioSav Ciona savignyi (tunicate) Ci-opsin3 mRNA new 68% 288 aa larval visual cycle photoisomerase 0 MDFSSKRTYGIAMAAL 1 2 GFIAWVGYGLLFVIFAKSPDLKKKNRFLFSLAVSDLLITIHVVASVVASFQSEWPFGSIGCQ 0 0 LDAFIGMAPTFISIAGAALVAKDKYYRICKPKM 1 2 LVGRNYSFSIYANWTLGIIGGLLPFFGFGQYGFETDDLSLRTGCLLDFKTVSA 2 1 KYRFYIVFISLVWFVWPLYKLTSHYIKISAKLDRFHP 0 0 LMFVVPLQMLVSLLPYAIYAMISITVGVSSAPYYLVAVNN 0 0 IAAKVFIGTNPFIYIYFDPELRLACKNLFKYSSTPVQDQIQDKKDE* 0 >RGR2_danRer Danio rerio (zebrafish) NM_001024436 embryonic RPE, brain, gut, embryo PA 0 MASYPLPEGFTDFDMFAFGSALLVG 1 2 GLLGFFLNAISVLAFLRVREMQTPNNFFIFNLAVADLSLNINGLVAAYACYLR 2 1 HWPFGSEGCQLHAFQGMVSILAAISFLGAVAWDRYHQYCT 1 2 KQKMFWSTSITISCLIWILAVFWAAMPLPAIGWGVFDFEPLRTCCTLDYSQGDR 2 1 GYITYMLTITVLYLAFPVLVLQSSYSAIHAYFKKTHHYR 0 0 FNTGLPLKALLFCWGPYVVVCSLACFEDVSVLSPRLRM 0 0 VLPVLAKTSPIFHAVLYAYGNEFYRGGVWQFLTGQKSADKKK* 0 >RGR2_pimPro Pimephales promelas (minnow) liver brain PA 0 MASYALPEGFSDFDMFAFGSALLVG 1 2 GLLGFFLNLISVLAFLRVREIQTPNNFFIFNLAVADLSLNINGLVAAYASYLR 2 1 YWPFGSEGCQIHGFQGMVSILASISFLGAIAWDRYHLYCT 1 2 KQKMFWSTSGTISALIWILAVFWAALPLPAIGWGVFDFEPMRTCCTLDYTIGDR 2 1 NYISYMLTITVLYLAFPVLIMQSSYNGIYAHFKKTHHFK 0 0 FNTGLPLKMLLFCWGPYVLMCTYACFENASLVSPKLRM 0 0 VLPVLAKTSPIFHAAMYAYGNEFYRGGIWQFLTGQKPADKKK* 0 >RGR2_tetNig Tetraodon nigroviridis (pufferfish) eye 0 MAAYTLPEGFTEFDMFTFGTALLVG 1 2 GVLGFFLNAISIVSFLTVKEMRNPSNFFVFNLALADISLNVNGLIAAYASYLR 2 1 YWPFGQDGCSYHAFHGMISVLASISFMAAIAWDRYHQYCT 1 2 RQKLFWSTTLTMSSIIWILSIFWSAVPLMGWGVYDFEPMRTCCTLDYTRGDR 2 1 DYVTYMLTLVVLYLTFPAATMWSCYDSIYKHFKKVHQHR 0 0 FNTSMPLRVLLVCWGPYVVMCVYACFENVKVVSPKLRM 0 0 LLPVIAKTNPIFNALLYTFGNEFYRGGVWHFLTGHKIVDPVLKKSK* 0 >RGR2_gasAcu Gasterosteus aculeatus (stickleback) eye 0 MAAFALPEGFTEFDMFTFGSALLVG 1 2 GLIGFFLNAISIASFLRVKEMWNPSNFFVFNLAVADICLNVNGLTAAYASYLR 2 1 YWPFGQDGCTFHAFQGMIAVLASISFMGVIAWDRYHQYCT 1 2 RQKLFWSTTLTMSAIIWILSIFWAAVPLMGWGVYDFEPMRTCCTLDYTKGDR 2 1 DYVTYMLTLVFLYLMFPALTMWSCYDAIHKHFKKIHLHK 0 0 FNTSTPLRVLLMCWGPYVLMCIYACFENVKVVSPKLRM 0 0 LLPVVAkTNPIFNALLYSFGNEFYRGGVWHFLTGQKMVDPVVKKSK* 0 >RGR2_oryLat Oryzias latipes (medaka) whole embryo 68% identical RGR2_danRer 0 MGTHTLPEGFTDFDMFTFGSALLVG 1 2 GLLGFFLNAISILAFLRVKEMRSPSSFLVFNLALADISLNINGLTAAYASYLR 2 1 YWPFGQEGCDYHGFQGMISVLASISFMAAIAWDRYHQYCT 1 2 RQKLFWSTSITISLIIWILSILWSAFPLMGWGVYDFEPMRIGCTLDYTKGDR 2 1 DYITYMLSLVFFYLMFPAFIMLSCYDAIYKHFKKIHYYR 0 0 FNTSLPLRVMLMCWGPYVLMCIYACFENVKLVSPKLRM 0 0 LPVIAKTNPFFNALLYSFGNEFYRGGVWNFLTGQKIVEPDVKKSKQK* 0 >PER1_homSap Homo sapiens (human) G? synt(-CFI +NOLA1 +EGF -ELOVL6) 338 aa 17167409 NM_006583 peropsin RRH retinal photoisomerase 0 MLRNNLGNSSDSKNEDGSVFSQTEHNIVATYLIMA 1 2 GMISIISNIIVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQ 0 0 VYAGLNIFFGMASIGLLTVVAVDRYLTICLPDV 1 2 GRRMTTNTYIGLILGAWINGLFWALMPIIGWASYAPDPTGATCTINWRKNDR 2 1 SFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKHHTTSDCTESLNRDWSDQIDVTK 0 0 MSVIMICMFLVAWSPYSIVCLWASFGDPKKIPPPMAIIAPLFAKSSTFYNPCIYVVANKK 2 1 FRRAMLAMFKCQTHQTMPVTSILPMDVSQNPLASGRI* 0 >PER1_monDom Monodelphis domestica (opossum) G? synt(-CFI +NOLA1 +EGF -ELOVL6) 326 aa peropsin RRH 0 MFKNNSVKTLAPEKEGPSVFSPIEHKIVAAYLITA 1 2 GVISIVSNVIVLGIFVKYKALRTATNTIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYDGCQ 0 0 IYAGLNIFFGMASIGLLTAVAIDRYLTICQPDL 1 2 GRMTSYNYTLMILTAWVNGFFWALMPIVGWAGYAPDPTGATCTINWRKNDV 2 1 SFVSYTMTVITINFAMPLGVMFYCYYNVSQKMKQYSPSNCPDHINRDWSNQVAVTK 0 0 MSVVMILMFLLAWSPYSIVCLWASFGDPKEIPPAMAIVAPLFAKSSTFYNPCIYVAANKK 2 1 FRRAISAMIRCQTHQSMPISNALPMN* 0 >PER1_ornAna Ornithorhynchus anatinus (platypus) XM_001506366 peropsin RRH 0 MRRNDSANLLESEHHDRSAFSQTDHNIVAAYLITA 1 2 GIMSIVSNVIVLGIFVKFEELRTATNAIIINLAVTDIGVSGIGYPMSAASDLHGSWKFGHAGCQ 0 0 IYAGLNIFFGMSSIGLLTVVAVDRYLTICRPAI 1 2 GRKMTRSNYTAMILAAWMNGFFWASMPLLGWASYASDPTGATCTINWRKNDA 2 1 SFISYTMTVIAVNFAVPLIVMFYCYYNVSKAMRQYPASRVLENLNIDWSEQVDVTK 0 0 MSVVMILMFLMAWSPYSIVCLWSSFGDPKKISPAVAIMAPLFAKSSTFYNPCIYVVANKK 2 1 FRRAMLSMVQCQTHREITITDVLPMNRSRSPH* 0 >PER1_xenTro Xenopus tropicalis (frog) G? synt(-CFI +NOLA1 +EGF -ELOVL6) 347 aa peropsin RRH 0 METLAEVSTLLPAGTGTVNISDASSEVHSVFSQSEHNIVAAYLITA 1 2 GVISILSNIIVLGIFVKYKELRTATNAIIINLAFTDIGVSGIGYPMSAASDLHGSWKFGYVGCQ 0 0 IYAGLNIFFGMASIGLLTVVAIDRYLTICRPDI 1 2 GRRISGRHYTAMILAAWINAVFWSVMPVVGWSSYAPDPTGATCTINWRKNDV 2 1 SFVSYTMSVVAVNFVVPLMVMFYCYYNVSRTMKGYGSRSSLGGINADWSDQTDVTK 0 0 MSMVMIVMFLVAWSPYSIVCLWSSFGDPRKIPPAMAIIAPLFAKSSTFYNPCIYVIANKK 2 1 FRRAILSMVQCKSRQEVTLDNHFPMNVSQSTLTT* 0 >PER1_gasAcu Gasterosteus aculeatus (stickleback) G? synt(+GPR68 -GNPDA1 -ENPEP -C14orf100) 338 aa peropsin RRH 0 MGIDPEVNVTDDVTLYGGKSAFTQLEHNIVAGYLITA 1 2 GVISLFSNIVVLLMFWKFKELRTATNFIIINLAFTDIGVAGIGYPMSAASDIHGSWKFGYAGCQ 0 0 IYAALNIFFGMASIGLLTVVAIDRYLTICRPDI 1 2 GQKMTMQSYNLLILAAWLNAVFWSSMPVVGWASYAPDPTGATCTINWRQNDV 2 1 SFISYTMAVIAVNFVLPLSAMFYCYYNVSATVKRYKASNCLDSANIDWSDQMDVTK 0 0 MSIVMIIMFLVAWSPYSIVCLWASFGDPKTIPAPMAIIAPLFAKSSTFYNPCIYVIANKK 2 1 FRRAIIGMVRCQTRQRITINSQVPMTTSQQPLTQ* 0 >PER1_calMil Callorhinchus milii (elephantfish) G? 151 aa fragment 1 LFVSYTMTVIAVNFVVPLSVMFFCYYNVSKTMSRFISSPSPENINLDWSDQLDVTK 0 0 MSVVMIVMFLLAWSPYSIVCLWASFGNPKLIPPAMAIIAPLFAKSSTFYNPCIYVIANKK 2 1 FRKAIMAMICCQNRQEITINHTLPMTISRVPLTE* 0 >PER1_braFlo Branchiostoma floridae (amphioxus) Go 391 aa 12435605 XM_002228504 Amphiop1 0 MNASPSSWLPSGELFTDSPENSSEWPWTDGPTDTAWHHHQTVDPVTYGGYLASAVYLTIT 1 2 GLIAFVGNIFAIIVFLTEKEFRKKEHNSFALNLAIADLSVCVFAYPSSTIS 1 2 GYAGEWMLGDVGCTIYGFLCFTFSLTSMVTLCAISVYRYIVICKPQY 1 2 AHLLTHRRTNYVILGIWLYALVFSVPPLFGVNRYTYEPIS 2 1 ITCSLDWNVQHVGETIYTAAVIIIVYVLNVSIMCFCYFNIIFKSANLKFAALASEKTRTAAKKDIWKTSM 0 0 MCLAMVVSFLIAWTPYAVSSTWDILTEEDLPIIATILPTMFAKSSCMMNPIIYSCCNGKFRQAALKTFSK 0 0 VGSSNKQNGQAQVEPRDPGFAVEPAGHQAFQMRVLPSSSAMTL* 0 >PER1_braBel Branchiostoma belcheri (amphioxus) AB050606 Amphiop1 introns from braFlo 0 MNASPSSWLSSGEFFTDSPENSSEWPWTDGPTDTTWRHHQSVDSVSYEGYLASAIYITLT 1 2 GLIAFFGNVITITVFLTEKEFRKKQQNGFVLNLAIADLSVCVFAYPSSAIA 1 2 GYAGRWVLGDVGCTIYGFLCFTFALVSMVTLCVISIYRYILICKPQY 1 2 AHLLTHRRTVYVIIGTWLYALVFTVPPLVGVKRYTYEPMQ 2 1 ITCSLDWNVQHPGEKAYIAAVLVIVYVLQVLIMCFCYFNIIFKSANLKFAALASEKTKMAAKKDTWKTSV 0 0 MCLTMVVSFLIAWTPYAVSSTWDILSAEDLPIIATILPSLFAKSSCMMNPIIYACCNTKFRQAAVKSFRK 0 0 LCGMCKQKVPLSTPQVVLAMQRNTEFTSTVEPTGQAFPMRVLPSISATHTAL* 0 >PER2_braFlo Branchiostoma floridae (amphioxus) 522 aa 12435605 XM_002209058 peropsin Amphiop2 PER/NEUR frag 72% 0 MIPTNNNTENNDLEWGLEKEHGVSATIMGVYLTIV 1 2 GLVSTVGNATVVLMFMLKWRQLCRKAPNLLIINLAAVDLCISVFGYPFSASSGFANQWLFSDAICT 0 0 LYGFSCFLLSMVSMHTLCLISAHRYITICRPEH 1 2 ASKLTMTRTILAVVGAWVYGISVAVPPLFGIA 1 2 GYTYESFGLSCTIDFHGTTVADMVYLSILIILCYVINVAVMGTCYFKIIRK 2 1 FSKHRFREVRDVRTSHQHSFERGVTL 0 0 RCILMTLFYLISWTPYTAVAVWTMVGPPPPVQLGMVAALTAKTHCAFNPILYMLMSE 0 0 VYRKLVLRTMCPCCFNKISNKLVRLPADDSKHSGNLDIFTVGYNTRDQAVQINKNAARRFCFVMET 0 0 ASDDLGIDDEVFAGQLGLCSRVKATEPGVEGFGGSEVPQSPSGTESEWSLSLLDFLPKRSSSKTAKASSLSETCSDNTVLSSAARK MAFLESSHQQSDREVCCIENRQAPEDTKPCKFAIESLGVRLPHKCCTASQVAGAPSRYAGMIETFTDSKGKTKKKAAVSLSEIDVKKP PPASKTWERRKTSKNTSRGQRVKRTFGKSRKHAYIVDC* 0 >PER2_braBel Branchiostoma belcheri (amphioxus) peropsin AB050607 Amphiop2 introns from braFlo 0 MIPTNNNTENNDLEWGLEKEHGVSATIMGVYLTIV 1 2 GLVATVGNATVVLMFIMKWRQLCRKAPNLLVINLAAANLCITIFGYPFSASSGYAHQWLFPDAICT 0 0 LYGFSCFLLSMVSMHTLCLISAHRYITICRPEH 1 2 ASKLTMNRTVLAVIGTWLYAIAVAVPPLFNIA 1 2 RYTYEPSGLSCTIDFRVTTVADLVYLGSLIVLCYVIHVAVMATCYFKIIRK 2 1 FSRHRFRQVRDIRTSHQRSFEMGVTM 0 0 RCILMTLFYLLSWTPYTAVCIWTMVGPPPPVVVSMAAALIAKTHCAFNPILYAFMSE 0 0 VYRKLVFRTMCPCCFNRISCKFVGTPTGGSKVSANPDIFTVDYNSRDQAVQINKAPSRRFCFVMET 0 0 SEDLGSDDTGLTGHSGLWRSGAEVEGLGGLQVTQSPSVSGSELSLSLLDFLPPKPSGRAVSAKLPSPPALNSERATCP ESSQQPSDRPATGLRQYQKGDTTRSSVGDLILTEDDVTNLPPASETWGRKKSENPLSYRQTTRRTFGRSRKHSYIVD* 0 >PER3_braFlo Branchiostoma floridae (amphioxus) 365 aa 12435605 Amphiop3 bblast 88% 0 MDIPTETPYGAGDDPAGTGWRWAETDQNGFHKYDHLIVGLYLFVI 1 2 GIIGTVENGITLATFTKFRSLRSPTTMLLVHLAIADLGICIFGYPFSGASSLR 0 0 SHWLFGGVGCQWYGFNGMFFGMANIGLLTCVAVDRYLVICRQDL 1 2 VDKVNYNTYGVMAALGWLFAAFWAALPLVGWAEYSLEPS 1 2 GTACTINWQKNDSLYISYVTSCFILGFALPLAVMMFCYWQ 0 0 ASCFVNKVLKGDISGDLTFPVAVNVDWEYQNHFSK 0 0 MCLAMVAAFVVAWTPYSVLFLFAAFGNPADIPAWITLLPPLIAKSSALYNPIIYIIANRRFRSAIFSMVKGQNPDVE 0 0 TLFARDFRISPIEDTGKEMSSMGNANA* 0 >PER3_braBel Branchiostoma belcheri (amphioxus) AB050610 Amphiop3 introns from braFlo 0 MDIPTETPYGAEEDIGESAGWRWTETDKNGFHKYDHLIVGLYLFVI 1 2 GIIGTIENGITLATFSKFRSLRSPTTMLLVHLAIADLGICIFGYPFSGASSLR 0 0 SHWLFGGVGCQWYGFNGMFFGMANIGLLTCVAVDRYLVICRHDL 1 2 VDKVNYNTYGVMAALGWLFAAFWAALPLVGWAEYALEPS 1 2 GTACTINFQKNDSLYISYVTSCFVLGFVVPLAVMAFCYWQ 0 ASCFVSKVLKGDIAGDLTFPVAANVDWEYQNHFSK 0 0 MCLAMVAAFVVAWTPYSVLFLFAAFWNPADIPAWLTLLPPLIAKSSALYNPIIYIIANRRFRNAICSMMKGQDPDVE 0 0 DDEHADEHRVRSIEDNDKEIISMVNLNMTV* 0 >PER2a_strPur Strongylocentrotus purpuratus (sea_urchin) Go GLEAN3_27634 overshoots iMet opsin3.2 XM_778236 spread across tandem inline, introns id PER2_patYes scallop2 0 MAASVTESSATEAISRLEPEYMVPLTRTGYLLTAIYLTIV 1 2 GSIATVGNITVICVLCRYRTFRKRSINLLLINMAASDLGVSVAGYPLTTVSGYWGRWLFGDVGCQFYAFCVYTLSCSTISTHAAIAVYRYIYIVKTDL 1 2 RPKLTANFTSGVIVVIWVYAFFWTVTPFVGWSSYIYEPFGTSCSVNWVGRTISDISYMVACTIGVYLLQIFIMLYCYIRVAKK 2 1 IRGVDPGRTEEKDAGVVVFGRLRKREAKIDTHVTK 0 0 MCFMMMLTFIVVWAPYAVECLRAAHVHRISALSSVLPTMFAKSSCMVNPIIFLTSSSKFRQDLGKLWSRPSSQDSLQLEER NKTQRSLYVRHSELGSAHGNDTASVYYEKERIYIGEMRATSIQKEAELLQRDPELLSIASSTNSDVKFVVRDRPKRYTKR PVKPQGPRGPEMFTASGVTNKGSSTSDSGGQSTSSGTTGSKPKRSGRKASRQYSMKSQSEDTGEIFTLDGSALEMMSLRKL* 0 >PER2b_strPur Strongylocentrotus purpuratus (sea_urchin)Go 17067569 GLEAN3_27633 opsin3.1 RRH no cdna inline tandem partner of PER2_strPur, introns id PER2_patYes scallop2 0 MNSFSEESYVTDPTTTQPTLFLTPLSQTGYLLTALYLTLV 1 2 GIVSTIGNITVLCVLCRYGTFRKRSVNILLMNMAVSDLGVSVAGYPLTAISGYRGRWVFADIGCQFSGFCVYALSCSTISTHAVVAIYRYIYIVKPYH 1 2 RPRLSSSTSCLAILCIWTFTLFWTITPFFGWSSYTYEPFGTSCSINWYGKSLGDLTYIICCVVFVFILPIIIMLYCYIGVAKK 2 1 IKGIDPLRTEERDIAVVFGRLRKHETKIDTRVTK 0 0 ICFMMMASFIVVWTPYAVGSIWASKIGKISASASVLPTMFAKSSCMINPIIFLTSSSKFRADLGKLWNRPSSLEHTIRVEERSREQRSFF VRQSALPDAMVSRSASVYYDKERIYIGEMRAASIQKEADLLHRDPEAISIASSTSSSLQFVLKDRQNRYKKKAGEASKKGSNILHFPYDDTE GSMINNLMRPRSHSVTSDNISRVFAPSLKRPTKKRSMSHPDIPSTSADIFTVSPTTIKNLQKQ* 0 >PER1a_sacKol Saccoglossus kowalevskii (acornworm) 7 ests + ACQM01133041 0 MVTTDSLANSTDEPVPSILTLQQHYAASVTLLAL 1 2 AVIGTVLSSVNFRMLLSNPDYCSKAGNFFLSLAVTDL 1 2 CVCIFETPFSAFSHHAGFWIFGDTACQLYAFFGIFFGLVNIFMVTFISLDRYWATCSPVE 1 2 VELKSKYYTRMTALGWMVALFWAAAPVFGWSRYaMEPSMASCSIDYMTNDFSYVTYITCLTLTCYVVPIVVMVYCYVKASKNIKYTGKVTEWAHENNATK 0 0 ISRLCVLQLVFCWSLYGFNCMWTVVADDVETLPKMLTVLAPILAKTTPILNSGLYFLHNKKFRGAAVDMFKAKEE* 0 >PER1b_sacKol Saccoglossus kowalevskii (acornworm) frag 0 ests + ACQM01067921 0 1 2 gVLSVIGNSVVLEMFRRYKELLSPSAILLISLALADL 1 2 GLTIFGMSLSCVSSFAGRWLFGKFGCYFHGFAGMLFGLGSIGNLTVISIDRYIITCKRsL 1 2 QWSYRHYYALLAVAWSNALFWSMMPLFGWSSYALEPEGTSCTIDWMNNDNQYISYVSCVTVTCFILPCAVMTYDYLAAYMKMVKAGYTLSEETEKPNND 0 0 MCIALVAAFLLSWFPSATVFLWAAFGNPGNIPLSFTGVADAFTKIPAVFNPVIYVALNPEFRKYFGKTIGCRRKRKKPIAVRLNGSEQNVENTI* 0 >NEUR1_homSap Homo sapiens (human) OPN5 0 MALNHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEGFR 2 1 LHTVTTVRKSSAVLEIHEEV* 0 >NEUR1_calJac Callithrix jacchus (marmoset) 0 MALNHTSLPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2 1 LHTVTTVRKSSAVLEIHEEV* 0 >NEUR1_musMus Mus musculus (mouse) 0 MALNHTALPQDERLPHYLRDEDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWFGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASGGGQVFILSILFFCLLLPTAVIVFSYAKIIAKVKSSSKEVAHFDSRIHSSHVLEVKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYRFACCQAGGLRGTKKKSLEDFR 2 1 LHTVTTVRKSSAVLEIHQEV* 0 >NEUR1_ochPri Ochotona princeps (pika) 0 MALNDTALPQDEHLPHYFRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRLYGWADFFFGCGSLITMTAVSLDRYLK 1 2 GVWLKRRHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHGSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFSCCRTGGLKQTKKKSLEDFR 2 1 LHTVTTVRKSSAVLEIHQEv* 0 >NEUR1_canFam Canis familiaris (dog) 0 MALNHTARPQDERLPHYLREGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTINLAICDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASLGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGRLKATKKKSLEDFR 2 1 LNTVTTVRKSSAVLEIhQEV* 0 >NEUR1_bosTau Bos taurus (cow) 0 MALNHTAPPPDERRPPYLRDGDPFASKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSSRRKKKLRPAEIMTVNLAICDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GIWLKRKHAYICLAVIWAYAAFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEVKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLKATKKKSLEDFR 2 1 LHTVTTVRKSSAVLEVHQEv* 0 >NEUR1_loxAfr Loxodonta africana (elephant) 0 MTLNHTAPPQDDRLPQYLQDGDPFTSKLSWEADLVAGFYLTII 1 2 GILSTFGNGYVLYMSCRRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFVIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYAPEPFGTSCTLDWWLAQASVGGQIFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEIAHFDSRIHSSHMLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLRATKKKSLEGFR 2 1 LHTVTTVKKSSAVLEVHQEv* 0 >NEUR1_dasNov Dasypus novemcinctus (armadillo) 0 MALNHTALPQDDRLPHYLRDGDPFASKLSWEADLVAGFYLTII 1 2 gILSTFGNGYVLYMSSKRKKKLRPAEIMTINLAVCDLGIS 1 2 VVGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICYLSY 1 2 GVWLKRKHAYICLAVIWAYASFWTTMPLVGLGDYVPEPFGTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIYQVIDYKFACCQTGGLRATKKKSLEDFR 2 1 LHTVTTVRESSAVLEVHQEV* 0 >NEUR1_monDom Monodelphis domesticus (opossum) 0 MALNHSVSPQDDYIPHYLRDGDPFASKLSWEADLVAGFYLTII 1 2 GVLSTLGNGYVIYMSSKRKKKLRPAEIMTVNLAVCDLGIS 1 2 VVGKPFTIISCFSHRWVFGWVGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSY 1 2 GTWLKRHHAFICLALIWAYATFWATVPFAGVGSYAPEPFGTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKSSTKEVAHYDTRIQNSHILEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGQPDSIPVQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCQSGGQKAAKKESLRTYR 2 1 LHTVTTVRRSSAVLEIHQEv* 0 >NEUR1_ornAna Ornithorhynchus anatinus (platypus) 0 MTNYSAPQLGDYLPHYLREGDPFVSKLSWEADLVAGVYLVII 1 2 GVLSTLGNGYVIYMSSRRKKKLRPAEIMTVNLAVCDLGIS 1 2 VVGKPFTIVSCFCHRWVFGWMGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLSY 1 2 GTWLKRHHAYICLAIIWAYASFWATMPLVGLGNYAPEPFGTSCTLDWWLAQASVAGQAFILNILFFCLLLPTAVIVFSYVKIIAKVKSSTKEVAHFDSRIQNSHVLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGQPDSIPIQFSVVPTLLAKSAAMYNPIIYQVIDCRISCCRLGGPKTGKKESLKNSR 2 1 SHSMSTIRKPSAVSGPHQEV* 0 >NEUR1_galGal Gallus gallus (chicken) 0 MASDCNSSSQEEYLPHYMQQEDPFASKLSREADIIAGFYLTVI 1 2 GILSTLGNGYVIFMSSKRKKKLRPAEIMTVNLAVCDLGIS 1 2 VVGKPFSIISFFSHRWIFGWMGCRWYGWAGFFFGCGSLITMTAVSLDRYLKICHLAY 1 2 GTWLKRHHAFICLALIWAYATFWATVPFAGVGSYAPEPFGTSCTLDWWLAQASVAGQAFVLSILFFCLLFPTAVIVFSYVKIILKVKSSTKEVAHYDTRIQNSHILEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVWSAFGQPDSVPIQFSVVPTLLAKSAAMYNPIIYQVIDCKFACCRSGGPKTLQKKSSLKESR 2 1 MYTISSHRDSAALSGTQLEV* 0 >NEUR1_xenTro Xenopus tropicalis (frog) 0 MAGNSSYREESGYIPHYERDSDPFASKLSREADIFAGVYLMAI 1 2 GILSTLGNGYVIYMACSRKKKLRPAEIMTINLAVCDLGIS 1 2 VTGKPFAIVSCFSHRWVFGWNACRWYGWAGFFFGCGSLITLTVVSLDRYLKICHLRY 1 2 GTWLKRRHAFIALAVIWAYATLWATLPLVGVGNYAPEPFGTTCTLDWWLAQASVKGQIFVLSMLFFCLLFPTMVIVFSYAKIIAKVKSSAKEVAHFDTRNQNNHTLEIKLTK 0 0 VAMLICAGFLIAWFPYAVVSVWSAFGQPDSIPIELSVVPTMMAKSASMYNPIIYQVIDCKPACCKKDKSLQNTTSR 2 1 VYTISTFRKSTTSAR* 0 >NEUR1_danRer Danio rerio (zebrafish) 0 MENETSISSGYIPHYLLRGDPFASKLSKEADIVAAFYILVI 1 2 GILSATGNGYVMYMTFKRKTKLKPPEIMTLNLAIFDFGIS 1 2 VSGKPFFIVSSFSHRWLFGWQGCRYYGWAGFFFGCGSLITMTIVSFDRYLKICHLRY 1 2 gTWLKRHHAFLSVVFIWAYAAFWATMPVVGWGNYAPEPFGTSCTLDWWLTQASVSGQSFVMCMLFFCLIFPTVIIVFSYVMIIFKVKSSAKEVSHFDTRNKNNHSLEMKLTK 0 0 VAMLICAGFLIAWIPYAVVSVM >NEUR1_calMil Callorhinchus milii (elephantfish) G? 209 aa fragment maybe petMar 0 MTAFDNSTALYSGYWLHDSLHGDPFVSKLSWEADIISACYLIVT 1 2 GLLSTLGNGYVIYLSITQKRKLKPPEILITNLAISDFGMS 1 2 VGGQPFLIISCFSHRWIFGWVGCRWHGWAGFFFGCGSLITMTVVSLDRYLKICHLQY 1 2 GSWLQRRHVFMSLAFIWFYAAFWATMPLVGWGNYAPEPFGTSCTLDWWLARVSVSGLIFVLTILFFCLLLPIIIIVFSYIKIIAKVKSSAKEVAHFDSRIQNHHSLEMNLTK 0 >NEUR1_petMar Petromyzon marinus (lamprey) frag 0 VAMMICAGFLLAWIPYAVVSVWSAFGAPDSVPVAVSMVPTMFAKSAAMYNPLIYQLLSRRGTGAHCCRCRKARGTLRRPR 2 >NEUR1a_braFlo Branchiostoma floridae (amphioxus) v2.0 allelic frameshifted cDNA FE548698 XM_002210516 last two exons missing has 95% allele 0 MATTPADRLDGLTPAGRGATTAETHADDFASKLSREADIVIGVYLILI 1 2 GTGAILGNGRVLWLSYRCRARLRPVEMFVVSLAVADVGLSLVGHPFAAASSLMGRWSFGSAGCTW 1 2 YGFVVFFLGIASIASMTLMSIMRFMIVYKRYP 1 2 GQYPTRRASCVLVTAAWLYGLFWACAPLA 1 2 GWSQYQPEPYGLSCSVDWGGFSSDAGGSSFIICMLLFCTAVPVVIMVTSYAAIFVIYRQAQKGVVLNLQVNATFGGKRQRTERKLTL 0 0 IALAVCGGFLLAWLPYAVVGLWASVAGVDAVPLALASAAPLFAKSNSLWNPIIYLGMNERFR 2 >NEUR1b_braFlo Branchiostoma floridae (amphioxus) from traces genome model XM_002202511 75% identical, separate allele 0 MATTPGLPLDGLAPTGRGVTAADTLDDDFASKLSREADIVIGVYLLLI 1 2 GTGSILGNGRVLWLSYRNWAKLRPVELFVVSLAVTDVGISVFGYPFAASSSLLGRWSFGSAGCTW 1 2 YGFTGFFFGLTSIANMALMSIMRFMIVYKGYP 1 2 GPYPSRRATSGLIAAAWLYGLFWACAPLA 1 2 GWSQYHVEPFGLSCTVDWGSFSRDAGGMSFIICLLVFCVAIPVTAIMASYVAISAIYRQAKKSIAGHLQDNSAMCKKRNKLERSITL 0 0 MALAVCGGFLLAWLPYAVVGLWSAVAGVDAVPLALASAAPLFAKSSSLWNPIIYLGMNDRFS 2 1 AELSREHSYLPATGVRTTGPPAEDNQAEGDDVSSPSLLFFRRLFWTSKT 1 2 SNAMRTLVVLTWISTIFPLLLAAPQADPRTTYKVSRWDEAWRPQRFGRSGRGDTKDGWRPQRFGRGRYEQGWRPQRFGRNEGLAGLREVLDGEAFPLLQMTR TDLHHDLPGIGYTPGTAHGSLRALPLLRLYDRGALQLINGAAKQSSTNPPSLRMIRAAAESLQGFARGGDQQDEEEMFAPPRSTDDWLAEIQRLGLRGRKRRDVS* 0 >NEUR_strPur Strongylocentrotus purpuratus (sea_urchin) XM_001197837 CX694910 CX690664 0 MDVNAKWWTNETLRTRDQFSDDHYTSVLSYEGDIWAGVYLMFI 1 2 SLIAFIGNISVIVISLRKREKLKPIDLLTINLAIADFLICVVSYPLPMISAFRHR 0 0 WSFGKFGCVWYGFTSFLFAVGSMATLMVIALLRYAKLCRENV 1 2 DQYQSRPFVIKVIVAIWGFAFFTTAPPLFGWS 2 1 SYVPEPYHLSCTIDFADTSPSGLSYTYFTTIVVFFMPLMIIVLCYVAIARKMIHHNRRINVGHNAGRMLLEIRLLK 0 0 TACMITMAYTISWTPYAVIAMWVTYIPVNQIPDAFRILPAFCAKTSSVYNPIIYCIFNKSFRQDLSSLICCCACQCYTITINLDINSHAQQQFRRIEERR DEVGTYKRRPLMICSNPFAWSRDFHETWRQRRIRGIHRNCRNNVRVENINVNFRRDTDMVELNAPTPAEIHRPELNTASTRSGARTKSMATHLPALEEVPSG APQCSALLHNTPIPRSLQGTPLPYQPQPSTSDLHDEFLNPSVVSRNMCVIVVKPNIEEELSTD* 0 >NEUR2_galGal Gallus gallus (chicken) GenBank 5'UTR mistranslated as coding synt(-B4GALT6 -NEUR2_galGal -KIAA1012) 0 MDPSFANSTFQSKITEAADIVVGTCYMVF 1 2 GICSLCGNSILLYISYKKKHLLKPAEYFIINLAISDLAMTLTLYPLAVTSSLSHR 2 1 WLYGKHICLFYAFCGLFFGICSLSTLTLLSVVCCLKICFPAY 1 2 GNRFRRKHGQILIACAWTYAAIFACSPLAHWGEYGEEPYGTACCIDWQSTNVDVMSMSYTVVLFVLCFILPCGVIVTSYSLILVTVKESRKAVEQHVSGPTRINNVQTITAK 0 0 LSIAVCIGFFAAWSPYAIIAMWAAFGSIDKIPPLAFAIPAVFAKSSTLYNPIIHLLLKPNFRSNIAKDFTVIQQLCVRCCFCVKELQTYRSTFNTGLRTFKGKNESSCNALPIMEG CSYFPSEKGSHTFECFKSYPNCFQERLSTMGCHLQDCESLENDLQVEVTQGSRNSMKVVEQEEKSTELDNLEITLEAVPVSCTFTDL* 0 >NEUR2_anoCar Anolis carolinensis (lizard) 0 MESYFANTTFHSKITEAADVIVGVFYIVF 1 2 GICSFCGNSILLYVSYKKKNLLKPAEYFMINLAISDLGMTLTLYPLAVTSSLAHR 2 1 WLFGQQVCLFYAFCGVFFGVCSLTTLTLLSIVCCLKICFPVY 1 1 GNRFRPGHGWILIACAWVYAAIFAFSPLAHWGEYGAEPYGTACCIDWRISNMKKTAMSYTTALFVFCYIIPCGIIITSYTLILITVKDSRKAVEQHALGPTRMSSVHTITAK 0 0 LSIAVCIGFFVAWSPYAIIAMWAAFGSIDMIPPLAFAVPAVFAKSSTLYNPAMYLFLKPNFRSTIAKDLTVLHRLCLKSCFCPRGMQNCSYRSALEAPLKSFKGRNESSSNSVQIVGGCS YFPCEKCHDPFECFKNYPKCCQGRLNVMDHTPRESISVENNMQSKTKHASEKYIKVVIRGEKNTDIDNLEITLEHIPTDIKFANL* 0 >NEUR2_xenTro Xenopus tropicalis (frog) abundant transcripts 0 MGNKSDASAFYSSISETDDIVLGVLYSVF 1 2 GLLSLSGNSMLLLVAYRKRSILKPAEFFIVNLSISDLGMTGTLFPLAIPSLFAHR 2 1 WLFDKVTCNYYAFCGMLFGLCSLTNLTVLSSVCCLKVCYPAY 1 2 GNKFSTAHSRILLLGIWAYAGLFATAPLADWGKYGPEPYGTACCLDWEASYRERKALSYTISLFVFCYLIPSSLIFISYTLIFVTVKGARRAVQQHLSPQAKGSSIHSLIIK 0 0 LSIAVCIGFLIAWTPYAIVAMMAAFGDPTKIPSLVFALAAAFAKSSTIYNPVVYLLLKPNFLNVVTKDLTLFQTMCAVVCGWCRTPAVKTPCPHKD LKTTSKPPSSFKKSQGVCRNCVDTFECFRNYPRCCSVGNVDAAQPMAASLVRIPPANGAPQQTVQLVVSSSRTRSGVETVEVSTEAPMSDFIKDFI* 0 >NEUR2_danRer Danio rerio (zebrafish) acquired new intron 0 MGNVSKTALFMSTISRQHDILMGSLYSVF 1 2 FVLSLLGNGMLLFVAYRKRSSLKPAEFFVVNLSVSDLGMTLSLFPLAIPSALAHR 2 1 WLFGEITCLCYAVCGVLFGLCSLTNLTALSSVCCLKVCFPNY 1 2 GNKFSSSHACVMVIGVWCYASVFAVGPLVHWGSFGPEPYGTACCINW 2 1 YTPSHDALAMSYIISLFIFCYVVPCTIIILSYTFILVTVRGSQQAVQQHVSPQTKVTNAHALIVK 0 0 LSVAVCIGFLTAWSPYAIVAMWAAFSANEQVPPTAFALAAIMAKSSTIYNPMVYLLFKPNFRKSLSQDTQMFRHRICLSHSKASPSPGMKDQERQS SQQCNNKDGSISTPFSSGQAESYGACHVYAEAGPHYQQISRQITARVLEGSVQSEIPVKQLTEKMQNDLL* 0 >NEUR2_calMil Callorhinchus milii (elephantfish) frag novel paralog of neuropsin 0 1 2 GILSLVGNSVLLFVAYRKRQILKPAEYFVANLAVSDISMTVTLLPLAISSNFSHR 2 1 WLFVSKpCMYYGFCSMLFGICSLTNLTVLSTVCCMKVCFPAY 1 2 0 0 MSVVMIVMFLLAWSPYSIVCLWASFGNPKLIPPAMAIIAPLFAKSSTFYNPCIYVISYTMTVIAVNFVVPLSVMFFCYYNV >NEUR3_galGal Gallus gallus (chicken) cOpn5L2 mRMA for Opsin 5-like 2 AB368183 chr3 XM_420056 CN231992 testis exon 2^3 rel NEUR1/2 0 MEEQYISKLHPVVDYGAGVFLLII 1 2 AILTILGNSAVLATAVKRSSLLKSPELLTVNLAVADIGMAISMYPLAIASAWNHAWLGGDASCIYYALMGFLFGVCSMMTLCAMAVIRFLVTNSSKSN 1 2 SNKISKNTVHILITFIWLYSLLWAILPLVGWGYYGPEPFGISCTIAWSKFHSSSNGFSFILSMFLLCTVLPALTIVACYLGIAWKVHKAYQEIQNINRIPHAAKLEKKLTL 0 0 MAVLISVGFLSAWTPYAAASFWSIFNSSDSLQPIVTLLPCLFAKSSTAYNPFIYYIFSKTFRHEIKQLQCCWGWRVHFFSADNSAENSVSMMWSGRDNIRLSPTAKVESQGAARH* >NEUR3_taeGut Taeniopygia guttata (finch) ABQF01025032 0 MEEQYISKLHPVVDYGAGVFLLII 1 2 AILTILGNSAVLATAVKRSSLLKPPELLTVNLAVADIGMALSMYPLAIASAWSHAWLGGDASCVYYALMGFLLGVCSMMTLCAMAVIRFLVTNSPKSN 1 2 sNKITKNTVCILIAFIWLYSLLWAILPLVGWGYYGPEPFGISCTIAWSKFHNSSNGFSFILSMFLLCTVLPALTIVACYLGIAWKVHKAYQEIQNIDRIPNAAKLEKKLTL 0 0 MAVLISVGFLSSWTPYAATSFWSIFNSSHSLQPVVTLLPCLFAKSSTAYNPFIYYVFSKTFRCEVKRLQCCCAWRVHYFSSDNSVENPLSTMWSGRDNIRLSAAPQVQNPGAAAP* 0 >NEUR3_anoCar Anolis carolinensis (lizard) AAWZ01001057 0 MEEHYISKVHPVWDYGMGVFLLII 1 2 AILTILGNSMVLAVAVKRSSCLRSPELLTVNLAATDLGMGLSMYPLAIASAWNHAWLGGEATCIYYALMGFLFGVSSIMTLSAMAVIRFLVTFSSKPA 1 2 GHKINRKVMHICIMLIWAYAVLWAILPLLGWGHYGPEPFGTSCTIAWGQFHNSQKGFAFILSMFILCTFLPAITIIMCYLGIAWKFHKTHQEMQNLNRISSAAKLEKKLIL 0 0 VAVLISVGFLGAWTPYAIVSFWSVFHSSESIPYIVTLLPCLFAKSSTAYNPFIYYTFSKTFRHEVKHLRCYSGQRAQENMKNSINSNVSFMWHGGGNICLSTRQIEMREIPNQ* 0 >NEUR3_xenTro Xenopus tropicalis (frog) cdna ovary embryo 0 MEERYLSKLHPLVDFGSGVFLLLV 1 2 AILTVLGNCAVLATAVKCSSHLKAPDLLSINLAVADLGMAISMYPLAIASAWNHAWLGGDASCLYYALMGFFFGVSSMMTLTVMAIIRYRVTSSFKYS 1 2 GCTIEKKAVCILIMCIWLYALLWAVLPLLGWGRYGPEPFGTSCTIAWGDFHHSSNGFSFIISMFILCTISPAVTIVVCYSGIAWKLHKAYQEIKNQDKIPNSTKVEKKLTL 0 0 LAILVSFGFLISWTPYAAVSFWSLFHSSKYIPPVVSLLPCLFAKSSTAFNPMIYYAFSKTFRRKVKHLKCCCGWRVHFLQSENSVENPRVSVIWTGKENVMVSSVPKLMKGVPGTPTGTQ* 0 >NEUR3a_danRer Danio rerio (zebrafish) 0 MDRYTSKLSPAVDYSAGTFLLVI 1 2 AILSILGNAAVLLTAAWRHSVLKAPELLTVNLAVTDIGMALSMYPLSIASAFNHAWIGGDPSCLYYGLMGMIFSVASIMTLAVMGLVRYLVTGNPPK 1 2 SGSKFRRKTISILIGVIWMYSLLWAVFPILGWGGYGPEPFGLACSVDWMGYQHSLNRSSFIMALAILCTLMPCVVILFSYSGIAWKLHKAYQSIQSNDNLPNSGAVERKVTL 0 0 MGILISTGFIVSWAPYVFVSLWTMFRSEGEDSVVPIVSLLPCLFAKCSTVYNPLVYYVFRKSFRREIHQIRICCFQGCWDAVSKMTRGDGPEETSGTHETDNI* 0 >NEUR3a_tetNig Tetraodon nigroviridis (pufferfish) 0 MDDKYMSKLSPPVDLWAGIYLVVI 1 2 ALLSVLGNASVLFSASRRLTPLKAPELLTVNLAVTDIGMALSMYPLSIASAFNHAWMGGDTACLYYGLMGMIFSITSIMTLAVMGMIRYLVTGSPPR 1 2 SGVQFQKKTICVVICAIWLYACLWAAFPLLGWGSYGPEPFGTACSIDWTGYGDSLNNATFIVAMSVLCTFLPCLVIFFTYFGIAWKLHKAYKSIKSSDFQYASVERRITL 0 0 IAVLISVGFLGSWAPYGLVSLWSILKDSSSIPPQVSLLPCLFAKSSTVYNPVIYYIFSQSFKLEVQQLFLCC* 0 >NEUR3_petMar Petromyzon marinus (lamprey) new exon frag 0 MAEQGEDDQFRSKLSPTADIAAGTFLLAV 1 2 AVLSLAGNGAVLGVAARRWAKLKAPELLSVNLALTDLGIAASIYPLAVASAWNHRWLGGQPVCTYYAFAGFFFGTASMGTLTAMAGVRYKGTSTQVH 1 2 VKQITKRAMLAVIVAVWAYALLWSCLPLLGWGR 2 1 YGVEPFGVSCTLAWAELQLTPGGVAFLYAMFVLCLLLPAIAIGLCYAGIVCKLRRAYREGRSKRRTPTARHVESRLTK 0 >NEUR3b_danRer Danio rerio (zebrafish) 0 MDIYSSKLSSAVDYGIGAFLLLI 1 2 TILSILGNLMVLVMAYKRSNHMKPPELLSVNLAVTDLGAAVTMYPLAVASAWNHHWIGGDVSCVYYGLMGFLFGAASMMTLTIMAIVRFIVSLTLQSP 1 2 KEKISKRNAKILVATTWLYALLWAIFPLIGWGKYGPEPFGLSCTLDWRDMKEHSQSFVITIFLMNLILPAIIIVSCYCGIALRLYVTYKSMDDSNHVPNMIKMQRRLMV 0 0 IAVLISIGFVGCWAPYGIVSLWSIYRPGDSIPAEVSMLPCLFAKTSTVYNPFIYYIFSKTFKREVNQLSRFCGRSNICRPTDAKNRPENTIYLVCDVNKSKPGVEDLSLARSKENETQMLPNQDLHE* 0 >NEUR4_ornAna Ornithorhynchus anatinus (platypus) XM_001508128 0 MSLSHSLQVPWRNNLTFLNKEAQVSEQGETIIGIYLLAL 1 2 GWMSWFGNSMVIFILHRQRGILNPTDYLTFNLAVSDASVSVFGYSRGIIEIFNVFRDDGFLITSIWTCQ 0 0 VDGFLTLLFGLASINTLAMISVTRYIKGCHPHR 1 2 GHFINTANISVALILIWVSALFWSAGPVLGWGSYT 1 2 DRMYGTCEIDWAEANFSSICKSYIISIFFCCFFLPVSIMFFSYVSIIKMVKSSHTLAGADDPTDRQRRLDRDVTR 0 0 VSVVICTAFIVAWSPYAVISMWSAFGHSVPNLTSVLASLFAKSASFYNPIIYFGMNSKFRKDILVLLPCAKESKEPVKLKKFKNLRQKQGFTLQKPEKAHVLQVPDSGPMSLINTPPLGNRNSFDLACDNSDFECVRL* 0 >NEUR4_galGal Gallus gallus (chicken) genome gappy 0 MSLQLSPQAPWRNNNISFLSREAAVTEQGETIIGFYLLAL 1 2 GWMSWFGNSVVIFVLYKQRHLLQPTDYLTFNLAVSDASISVFGYSRGIIEIFNVFRDDGFIITSIWTCQ 0 0 VDGFLTLLFGLASINTLTVISVTRYIKGCHPER 1 2 AHCISNSSMTVAMVLIWIAAFFWSAAPLLGWGSYT 1 2 DRMYGTCEIDWAKANFSTIYKSYIISIFICCFFLPVTVMVFSYVSIINTVKLSHALTGLSDPTERQRRMERDVTR 0 0 IVICTAFIIAWSPYAVLLLWSAYGHPVPNLPLYLSSLFAKSASFYNPIIYFGMSSKFRRDIFILFHCAKEVKDPVKLKRFKNLKQKQEPSQKEEKYAAEMHPAPSPDSGVGSPTNTPPPANREEYFGILDTPSNSPDIECDRL* 0 >NEUR4_taeGut Taeniopygia guttata (finch) 0 MSVQFSAQAPWRNNNISFLTREAAVTEQGETIIGFYLLAL 1 2 GWLSWFGNSIVIFVLYKQRHVLQPTDYLTFNLAVSDASISVFGYSRGIIEIFNVFRDDGFIITSIWTCQ 0 0 VDGFLTLLFGLASINTLTVISVTRYIKGCHPER 1 2 GHCISNSSMSVALVLIWVAAFFWSAAPLLGWGSYT 1 2 DRMYGTCEIDWAKASFSTIYKSYIVSIFICCFFLPVTVMVFSYVSIINTVKLSHT LTGLGDPTDRQRRIERDVTR 0 0 VSLCTAFIIAWSPYAVISIWSAYGHPVPNLTSILASLFAKSASFYNPIIYFGMSSKFRRDIFIFHCAKELKDPVKLKRFKNLKPKQPQPSQKEEKYAPEMHPAPSPDSGVGSPTNSPPPANREVYFGILDTPSNNPNIECDRL* 0 >NEUR4_anocar Anolis carolinensis (lizard) 0 MSLQVSPQAPWRNNNVTFSNKEVPVSEQGETIIGFYLLAL 1 2 GWMSWFGNSIVIFVLYRQRAGLQPTDYLTFNLAVSDASVSVFGYSRGIIEIFNVFRDDGFLITSIWTCQ 0 0 VDGFLTLLFGLASINTLTVISVTRYIKGCHPDR 1 2 GKCISNSSISVALFLIWIAAFFWSVAPVLGWGSYr 1 2 DRMYGTCEIDWAKANFSTIYKSYIVSIFICCFFLPVSVMVFSYVSIINTVKSSHALSGVGDPTERQRRMERSVTR 0 0 VSLVVICTAFITAWSPYAVISMWSAYGYTVPNLTSILASLFAKSASFYNPIIYFGMSSKFRKDIFVLLHCAKEIKDPVKLKRFKNLKQKQEVSPSQREEKYAADVQPALSPDSGVGRSNTPPPVNREVYFGAFDTFSNNPDVECDRL* 0 >NEUR4_xenTro Xenopus tropicalis (frog) 38% NEUR1_galGal 0 MSLQFPRPAPWRNNNLTLLQKENPLTEQGETIIGIYLLAL 1 2 GWLSWFGNSIVIFVLYKQRANLLPTDYLTFNLAVSDASTSVFGYSRGIIEIFNVFRDDGFLITSIWTCQ 0 0 VDGFLTLLFGLASINTLTLISVTRYIKGCHPQR 1 2 ANCISNGSITISLALIWIAALFWSVAPLLGWGSYR 1 2 DRMYGTCEIDWTKASFSTIYKSYIISIFICCFFLPVMVMVFCYVSIINTVKSSRALTSEGDLSERQRKMERDVTR 0 0 VSVVICTAFIVAWSPYAVISMWSACGYYVPSLTSILAALFAKSASFYNPLIYFGMSSKFRKDLCVVLPCAKAQKDPVKLKRYKDKKQGSAPRAREQTEIEQPVQLQPAPSQDSGVGSPSNTPPLRTKDVHIVDIDLVSDNPSYECDRL* 0 >NEUR4_danRer Danio rerio (zebrafish) 0 MSAQNPLQVVNIPWRNNNFSLMSRDPPLSDQGETIIGVYLLIL 1 2 GWLSWFGNSIVIFVLFRQRSTLQPTDYLTLNLAVSDASISVFGYSRGILEIFNIFKDSGYIISSVWTCQ 0 0 VDGFFTLVFGLSSINTLTVISITRFIKGCHPHK 1 2 AHCITNSTVAVCVVFIWIGAFFWSAAPVLGWGSYT 1 2 DRGYGTCEIDWVKANYSTIHKSYIISIFIFCFLVPVLLMLFCYISIINTVKRGNAMNADGDLSDRQRKIERDVTI 0 0 VSIVICTAFILAWSPYAVVSMWSAWGFHVPNLTSIFTRLFAKSASFYNPLIYFGLSSKFRKDVSVLLPCGREGRDPVRLKRFKRLRGRAEPPGAPAHTPHPQIALKNYNNHSKPHAGPAHCTGHAPSPDSGVGSHHETPPPQPRPQLFFIDVPEPEAESECVRL* 0 >NEUR4_tetNig Tetraodon nigroviridis (pufferfish) 0 MEPSRPWRNSSVLGGGAEPPLSEQGETIIGVYLLLL 1 2 GWLSWFGNTVVLFVLVRQRSSLQPTDLLTFNLAVSDASISVFGYSRGIIQIFNVFQDSGFIISSIWTCE 0 0 VDGFLTLIFGLSSINTLTVISITRYIKGCQPSR 1 2 AALISRSSVSVCLLLIWTTAGFWSGAPLLGWGSYT 1 2 DRGYGTCEIDWSKAASSGVYRSYIISIFIFCFFIPVFIMLFCYISIINTVKRGNALAADGHLSHRQRTMERDVTV 0 0 ISVVICTAFIMAWSPYAVVSMWSAWGFHVPSTTSIVTRLFAKSASFYNPLIYFGMSSKFRKDVSLILPCAKERREVVLLQRFKNIKPKAAAAPPPPPLPVYRPKEKNEDEPKLSVHDNDSGVNSPPETPPSDAQEVFPVDPPSQIETSEYWSDRL* 0 >NEUR4_gasAcu Gasterosteus aculeatus (stickleback) 0 PVKVVNIPWRNNNLSNLNTDPPLSEQGETFIGVYLLVL 1 2 GWLSWFGNSLVMFVLYRQRASLQSTDFLTLNLAISDASISIFGYSRGILEIFNIFNDDGYLINWIWTCQ 0 0 VDGFFTLLFGLASINTLTVISVTRYIKGCHPNK 1 2 AYCISTNTIAVSLICIWTGAVFWSVAPLLGWGSFT 1 2 DRGYGTCEVDWSKANYSTIHKSYIISILIFCFFIPVMIMLFSYVSIINTVKSTNAMSADGFLSTRQRKVERDVTRV 0 0 ISIVICTAFITAWSPYAVVSMWSAWGFHVPSTTSIITRLFAKSASFYNPLIYFGMSSKFRKDVSVLVPCTRERREVVHLQHFKNIKPKAEAPPTPASLPVQKLGAKYAVPNPDADSGVNNPPQRPATDPQGDLNIDLPSHIETSEYWCDRL* 0 >NEUR4_calMil Callorhinchus milii (elephantfish) frag 0 MGCSLGWKVLLWFLHGILICPRPWRNHNSTFQPKEHPISEQGETIIGVYLLIL 1 2 GWLSWFGNSIVIFILYRQRLSLQPPDYLTLNLAVSDASISIFGYSRGIIEIFNVFRDDGFLITSIWTCQ 0 0 1 2 AVSISAGSIAASLVLIWIAAIFWSGAPLFNWGSYT 1 2 DRMYGTCEIDWSRASFSTIYKSYIISIFICCFFLPVFVMLFSYISIINTVKSSHAFAGNADLSDRQRRMEKDVTR 0 0 VSMVICTAFIIAWSPYAVISMWSASGYTVPQLTGIFASLFAKSASFYNPMIYFGLNSKFRKDIYILLPCVKEPKESVKLKRFKHLRHRPEQQQANKDRYAEELQQVASPDSGMGSPSKSPPLHNKDVFFVLWLRGLKK >PER1_lotGig Lottia gigantea (limpet) peropsin 0 MTAAEFSSFEHSIVGITYMVI 1 2 GISGTLLSLLVALTFIREKGLFKYGRAWLHISLAIANVGVVGAFPFSGSSSFSGR 2 1 WLYGSGMCTFYGFIGMFFGIAAIGNVFALCVERYLVSKKKDS 1 2 VDKVSNQFYWMITALVWINAFFWGIMPALGWTS 2 1 YDIEPSGTSCTIKWQNYDSGYPSFMAMLSLTCFLIPLPVALICLILSGTDKITEDKEEKTYFREDQLRS 0 0 TCTFLLILALIGWGPYCFICIWALFADTTQVSMLAAVIPPLAAKTMVLLYPVAYCQGNKRFKNAFLGMFIFNESPKQQ* 0 >PER1_aplCal Aplysia californica (sea_slug) peropsin from 8.8k contig 15 cdnas EB338056 to traces 0 MNDDGNALDGAGVDPAATAAHVSTALGFTKFEHASVGALYMLF 1 2 CIVGVTLNLLTALTFYKDTKLTKGSQPWLHILLALANVGVVAPSPFPASSSFSGR 2 1 WLYGSTMCQIYAFEGMFIGIAAIGAVIALCIERYIACQRSGA 1 2 NDTQGWFYGWSITLVLGNALFWAIMPLLGWSR 2 1 YSVEHTGTSCSIDWKNPDESFVSYIMTLEVFSFGIPMMSAFFCLISASPRPQPQGGATAAAGSGQEQQGEDTACKGCFSEDQLRL 0 0 LCYVFIGFVLVGWGPFAYLCTLAVFSDARGISMLAAAIPPLACKAMVSAYPLAYAVVSPRFRQSFLALIGGGEKKKE* 0 >PER1_todPac Todarodes pacificus (squid) retinochrome 2226795 peropsin 1991 seq dubious N-term to GDPAH 0 MFGNPAMTGLHQFTMWEHYFTGSIYLVL 1 2 GCVVFSLCGMCIIFLARQSPKPRRKYAILIHVLITAMAVNGGDPAHASSSIVGR 2 1 WLYGSVGCQLMGFWGFFGGMSHIWMLFAFAMERYMAVCHREF 1 2 YQQMPSVYYSIIVGLMYTFGTFWATMPLLGWAS 2 1 YGLEVHGTSCTINYSVSDESYQSYVFFLAIFSFIFPMVSGWYAISKAWSGLSAIPDAEKEKDKDILSEEQLTA 0 0 LAGAFILISLISWSGFGYVAIYSALTHGGAQLSHLRGHVPPIMSKTGCALFPLLIFLLTARSLPKSDTKKP* 0 >PER2_patYes Patinopecten yessoensis (scallop) Go depolarizing PM 9287291 AB006455 scop2 inner retina layer 0 MPFPLNRTDTALVISPSEFRIIGIFISIC 1 2 CIIGVLGNLLIIIVFAKRRSVRRPINFFVLNLAVSDLIVALLGYPMTAASAFSNRWIFDNIGCKIYAFLCFNSGVISIMTHAALSFCRYIIICQYGY 1 2 RKKITQTTVLRTLFSIWSFAMFWTLSPLFGWSSYVIEVVPVSCSVNWYGHGLGDVSYTISVIVAVYVFPLSIIVFSYGMILQEKVCKDSRKNGIRAQQRYTPRFIQDIEQRVTF 0 0 ISFLMMAAFMVAWTPYAIMSALAIGSFNVENSFAALPTLFAKASCAYNPFIYAFTNANFRDTVVEIMAPWTTRRVGVSTLPWPQVTYYPRRRTS AVNTTDIEFPDDNIFIVNSSVNGPTVKREKIVQRNPINVRLGIKIEPRDSRAATENTFTADFSVI* 0 >PER_hasAda Hasarius adansoni (spider) Deut.Chelic.Arachn AB525082 Go full 19960196 peropsin first ecdysozoan peropsin not yet in GenBank 0 MDDNMSEIALADDMSTLSTQEPSENVYPYVFPLSTHTIVGTYLIII 1 2 GILGTLGNGLVLVTFLRFRVLVTPTTLLLVNLAVSDLGLILFGFPFSASSSLSAK 2 1 WIFGEGGCQWYAFMGFLFGSAHIGTLALLALDRYLIACRISL 1 2 RGKLTFKRYTQMITVVWTYAFFWALMPLLGWGR 2 1 YGLEPSVTTCTIDWQHNDSSYKSFLIVYFVLGFMVPFAIIAVSYIAIARRVGKKSKERPVVRDLWTNERSVTL 0 0 MAFILIVTFFVAWSPYAVLCLWTIFAEPNTVPPFLTLIPPLFAKSSTVVNPLIYFLSNPKLRTAILSTLSCCNEAPIQNIELPDSPERAANNDAI* 0 >PER_ixoSca Ixodes scapularis (tick) XM_002409761 embryonated eggs XM_002435011 ABJB010816023 contig frags 62% ? 1 WLFGATGCQAYAFMGFLFGSAHIGTLTLLALDRYLATCRIGF 1 2 RSKPTFKRYFQLLLLVWLYGLFWAVMPLLGWAR 2 1 YGLEPSFQSCTIDWRHNDSSYKSFTLVYFVLGFLVPACIVLVCYRTSAIHIRAPKPKTVRRDVTDDYWASEEMVTV 0 0 MVALIVVTFFFAWTPYAVLCLWAVFADTKSVPHLLAMVPPLFAKTASTINPFIYFLSNPCIRADVLQLLGCRARSSPHMAISSDAVEEERCCQDQA* 0 >MEL1_homSap Homo sapiens (human) Gq -GRID1 -WAPAL +LDB3 +BMPR1A 483 aa NM_033282 melanopsin OPN4 0 MNPPSGPRVPPSPTQEPSCMATPAPPSWWDSSQSSISSLGRLPSISPT 0 0 APGTWAAAWVPLPTVDVPDHAHYTLGTVILLVGLTGMLGNLTVIYTFCR 2 1 SRSLRTPANMFIINLAVSDFLMSFTQAPVFFTSSLYKQWLFGET 1 2 GCEFYAFCGALFGISSMITLTAIALDRYLVITRPLATFGVASKRRAAFVLLGVWLYALAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYMSFTPAVRAYTMLLCCFVFFLPLLIIIYCYIFIFRAIRETGR 2 1 ALQTFGACKGNGESLWQRQRLQSECKMAKIMLLVILLFVLSWAPYSAVALVAFAG 2 1 YAHVLTPYMSSVPAVIAKASAIHNPIIYAITHPKYR 2 1 VAIAQHLPCLGVLLGVSRRHSRPYPSYRSTHRSTLTSHTSNLSWISIRRRQESLGSESEV 0 0 GWTHMEAAAVWGAAQQANGRSLYGQGLEDLEAKAPPRPQGHEAETPGK 0 0 TKGLIPSQDPRM* 0 >MEL1_panTro Pan troglodytes (chimp) 0 MNPPSGPRVPPSPTQEPSCMATPAPPSWWDSSQSSISSLGRLPSISPT 0 0 APGTWAAAWVPLPTVDVPDHAHYTLGTVILLVGLTGMLGNLTVIYTFCR 2 1 SRSLRTPANMFIINLAVSDFLMSFTQAPVFFTSSLYKQWLFGET 1 2 GCEFYAFCGALFGISSMITLTAIALDRYLVITRPLATFGVASKRRAAFVLLGVWLYALAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYMSFTPAVRAYTMLLCCFVFFLPLLIIIYCYIFIFRAIRETGR 2 1 ALQTFGACKGNGESLWQRQRLQSECKMAKIMLLVILLFVLSWAPYSAVALVAFAG 2 1 YAHVLTPYMSSVPAVIAKASAIHNPIIYAITHPKYR 2 1 VAIAQHLPCLGVLLGVSRRHSRPYPSYRSTHRSTLISHTSNLSWISIRRRQESLGSESEV 0 0 GWTHMEAAAVWGAAQQANGRSLYGQGLEDLEAKAPPRPQGHEAETPGK 0 0 TKGLIPSQDPRM* 0 >MEL1_gorGor Gorilla gorilla (gorilla) 0 MNPPSGPRVPPSPTQEPSCMATPAPPSWWDSSQSSISSLGRLPSISPT 0 0 APGTWAAAWVPLPTVDVPDHAHYTLGTVILLVGLTGMLGNLTVIYTFCR 2 1 SRSLRTPXNMFIINLAVSDFLMSFTQAPVFFTSSLYKQWLFGET 1 2 GCEFYAFCGALFGISSMITLXAIALDRYLVITRPLATFGVASKRRAAFVLLGVWLYALAWSLPPFFGW 1 2 SAXVPEGLLTSCSWDYMSFTPAVRAYTMLLCCFVFFLPLLIIIYCYIFIFRAIRETGR 2 1 ALQTFGACKGNGESLWQRQRLQSECKMAKIMLLVILLFVLSWAPYSAVALVAFAG 2 1 YAHVLTPYMSSVPXVIAKAXAIHNPIIYAITHPKYR 2 1 VAIAQHLPCLGVLLGVSRRHSRPYPSYRSTHRSTLISHTSNLSWISIRRRQESLGSESEV 0 0 GWTHMEAAAVWGAAQQANGRSLYGQGLEDLEAKAPPRPQGHEAETPGK 0 0 TKGLIPSQDPRM* 0 >MEL1_ponAbe Pongo abelii (orang) 0 MNPPSGPRVPPSPTQEPSCMATPAPPSWWDSSQSSISSLGQLPSISPT 0 0 APGTWAAALVPFPTVDVPDHAHYTLGTVILLVGLTGMLGNLMVIYTFCR 2 1 SRGLRTPANMFIINLAVSDFLMSFTQAPVFFTSSLYKQWLFGET 1 2 GCEFYAFCGALFGISSMITLTAIALDRYLVITRPLATIGVASKRRAAFVLLGVWLYALAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYMSFTPAVRAYTMLLCCFVFFLPLLIIIYCYIFIFRAIRETGR 2 1 ALQTFGACKGNGESLWQRQRLQSECKMAKIMLLVILLFVLSWAPYSAVALVAFAG 2 1 YAHVLTPYMSSVPAVIAKASAIHNPIIYAITHPKYR 2 1 VAIAQHLPCLGVLLGVSRRHSRPYPSYRSTHRSTMISHTSNLSWISGRRRQESLGSESEV 0 0 GWTHMEAAAVWGAAQQANGRFLYDQGLEDLEAKAPPRPQGEEAETPGK 0 0 TKGLIPSQDPRM* 0 >MEL1_rheMac Rhesus macaca (rhesus) fill-in traces 0 MNPPSGPRVPPSPTQEPSCMATPAPPSRWDSSQSSISSLGQLPSVSPT 0 0 AAGTWAAAWVPFPTVDVPDHAHYTLGTVILLVGLTGMLGNLTVIYTFCR 2 1 SRGLRTPANMFIINLAISDFLMSFTQAPVFFASSLYKHWLFGET 1 2 GCEFYAFCGALFGISSMITLTAIALDRYLVITRPLATIGVASKRRAAFVLLGVWLYALAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYMSFTPAVRAYTMLLCCFVFFLPLLIIIYCYIFIFRAIRETGr 2 1 ALQTFGACKGSGESLWQRQRLQSECKMAKIMLLVILLFVLSWAPYSAVALVAFAG 2 1 YAHVLTPYMSSVPAVIAKASAIHNPIIYAITHPKYR 2 1 VAIAQHLPCLGVLLGVSRRHSHPYPSYRSTHRSTLISHTSNLSWISGRRRQESLGSESEV 0 0 GWTHMEAAAVWGAAQQANGRSLYGQGLEDLEAKAPPRPQGQEAETPGK 0 0 TKGLLPCKDSRM* 0 >MEL1_calJac Callithrix jacchus (marmoset) fill-in traces 0 MNLPTGSRVLPSPTQEPSCMTTPAPPSRWDSSQSSISSLSQLPSISPT 0 0 AAGTWAAAWIPFPTVDVPDHAHYTLGTVILLVGVTGMLGNLTVIYTFCR 2 1 SRGLRTPANMFIINLAVSDFLMSFTQAPVFFASSLYKRWLFGET 1 2 GCEFYAFCGALFGISSMITLMAIALDRYLVITRPLATIGVASTKRAAFVLLGVWLYALAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYMSFTPAVRAYTMLLCCFVFFLPLLIIIYCYIFIFRAIRETGR 2 1 ALQTFGACKGSGESLWQRQRLQSECKMAKIMLLVILLFVLSWAPYSAVALVAFAG 2 1 YAHVLTPYMSSVPAVIAKASAIHNPIIYAITHPKYR 2 1 VAIAQHLPCLGVLLGVSRRHSHPYPSYRSTHRSTLISHTSNLSWISGRRRQESLGSESEV 0 0 GWTHMEAAAAWGAAQQANGRSLYGHGLEDLEAKAPPRPQRQEAETPGK 0 0 tkGLLPSLDARW* 0 >MEL1_micMur Microcebus murinus (mouse_lemur) fill-in traces 4aa del exon 9 confirmed 0 MNPPSGPRMPPSPAQEPSCVATPALASSGDSSQNSVSSLGQLLPASPT 0 0 ATGAWAAAWVPFPTADVPDHAHYTLGTVILLVGLTGMLGNLTVIYTFCr 2 1 SRSLRTPANMFVINLAVSDFLMSVTQAPVFFASSLYKRWLFGEA 1 2 gCEFYAFCGALFGISSMITLTAIALDRYLVITRPLASVGTASKRRAGLVLLGVWLYALAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYMTFTPSVRAYTMLLCCFVFFLPLLVIIYCYIFIFRAIRETGR 2 1 ALQTFGASKGTSESPRQQQRLQNEWKMAKIMLLVILLFLLSWAPYTAVALVAFAG 2 1 YAHILTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2 1 VAIAQHLPCVGVLLGVSRQHSRPYPSYRSTHRSTLSSQASDLSWISGRRRQESLGSENDM 0 0 GWTDMEVAAAWGAAPRVRGRCPYGQGPEDMEVKV QGQEAETPGK 0 0 AEGLPPCMDPRM* 0 >MEL1_otoGar Otolemur garnettii (lemur) fill-in traces 0 mnLPSAPRVPPSPAQASSCVATPAPHSRWDSSQTSISSLGQLRPVSPT 0 0 ASGAWAAGWIPFPTVDVPDHAHYTLGTVILLVGLTGMLGNLTVIYTFCR 2 1 VRGLRTPANMFVINLAVSDFLMSVTQAPVFFASSLYKQWLFGET 1 2 GCEFYAFCGALFGISSMITLTAIALDRYLVITRPLTTVGVASKRRAALVLLGVWLYSLAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYTSFTPSVRTYTMLLCCFVFFLPLLIIIYCYIFIFRAIRETGR 2 1 ALQTFGACKGSSESPRQRQRLQNEWKMAKITLLVILFFVLSWAPYTTVALVAFAG 2 1 YAHVLTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2 1 VAIAQHLPCLGLLLGVSRQHSRPYPSYRFTHHSTLSSQASDLSWISGRRRQESLGSESEV 0 0 GWTDMEAAATWGAALQVSGQCPYSQGLEDMEAKGPLRPQGPETKTSGK 0 0 TKGLLPSLDPRM* 0 >MEL1_musMus Mus musculus (mouse) AF147789 GenBank wrong readthru 0 MDSPSGPRVLSSLTQDPSFTTSPALQGIWNGTQNVSVRAQLLSVSPT 0 0 TSAHQAAAWVPFPTVDVPDHAHYTLGTVILLVGLTGMLGNLTVIYTFCR 2 1 NRGLRTPANMFIINLAVSDFLMSVTQAPVFFASSLYKKWLFGET 1 2 GCEFYAFCGAVFGITSMITLTAIAMDRYLVITRPLATIGRGSKRRTALVLLGVWLYALAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYMTFTPQVRAYTMLLFCFVFFLPLLIIIFCYIFIFRAIRETGR 2 1 ACEGCGESPLRQRRQWQRLQSEWKMAKVALIVILLFVLSWAPYSTVALVAFAG 2 1 YSHILTPYMSSVPAVIAKASAIHNPIIYAITHPKYR 2 1 VAIAQHLPCLGVLLGVSGQRSHPSLSYRSTHRSTLSSQSSDLSWISGRKRQESLGSESEV 0 0 GWTDTETTAAWGAAQQASGQSFCSQNLEDGELKASSSPQVQRSKTPK 0 0 TKGHLPSLDLGM* 0 >MEL1_ratNor Rattus norvegicus (rat) AY072689 0 MNSPSESRVPSSLTQDPSFTASPALLQGIWNSTQNISVRVQLLSVSPT 0 0 TPGLQAAAWVPFPTVDVPDHAHYTLGTVILLVGLTGMLGNLTVIYTFCR 2 1 NRGLRTPANMLIINLAVSDFLMSFTQAPVFFASSLYKKWLFGET 1 2 GCKFYAFCGAVFGIVSMITLTAIAMDRYLVITRPLATIGMRSKRRTALVLLGVWLYALAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYVTFTPLVRAYTMLLFCFVFFLPLLIIIFCYIFIFRAIRETGR 2 1 ACEGCGESPLRRRQWQRLQSEWKMAKVALIVILLFVLSWAPYSTVALVGFAG 2 1 YSHILTPYMSSVPAVIAKASAIHNPIIYAITHPKYR 2 1 AAIAQHLPCLGVLLGVSGQRSHPSLSYRSTHRSTLSSQSSDLSWISGQKRQESLGSESEV 0 0 GWTDTETTAAWGAAQQASGQSFCSHDLEDGEVKAPSSPQEQKSKTPK 0 0 TKRHLPSLDRRM* 0 >MEL1_nanEhr Nannospalax ehrenbergi (molerat) 0 MNSPSGPRVPPGLAQKPSFMVTPVLPNQWISFQKNVSVGIQLPPASAT 0 0 ATGAQAASWVPFPTVDVPVHAHYTLGTVILLVGLTGMLGNLIVIYTFCR 2 1 SRGLRTRANMFTVNLAVSDFLMSFTQAPVFFASSLYKRWLFGEA 2 2 GCEFYAFCGAVSGITSMTTLTAIALDRYLVITRPLATIGVASKRRTALVLLGVWLYALAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYMTFTPSVRAYTMLLFCFVFFLPLLIIIFCYIFIFKAIRETGR 2 1 ACEGCGESPQRRRQWQRLQNEWKMAKVALLVIFLFVLSWAPYSTVALVAFAG 2 1 YSHILTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2 1 LAISQHLPCLGVLIGVSSQRSHPSLSYRSTHRSTLSSQASDLSWISGRKRQESLGSESEV 0 0 GWTDTEVTAAWGVAQEASGWSPYRHSLEDGEVKASPSPQGQEAKTSR 0 0 TKGQLPSLNLRM* 0 >MEL1_phoSun Phodopus sungorus (hamster) AY726733 3D: PUBMED 15698924 ends in error: AKGQLPSLDLGMQDAP 0 MDSPPGPTAPPGLTQGPSFMASTTLHSHWNSTQKVSTRAQLLAVSPT 0 0 ASGPEAAAWVPFPTVDVPDHAHYILGTVILLVGLTGMLGNLTVIYTFCR 2 1 SRSLRTPANMLIINLAVSDFLMSFTQAPVFFASSLYKKWLFGET 2 1 GCEFYAFCGAVLGITSMITLTAIALDRYLVITRPLATIGMGSKRRTALVLLGIWLYALAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYVTFTPQVRAYTMLLFCFVFFLPLLVIIFCYISIFRAIRETGR 2 1 ACEGWSESPQRRRQWHRLQSEWKMAKVALIVILLFVLSWAPYSTVALVAFAG 2 1 YSHILTPYMSSVPAVIAKASAIHNPIVYAITHPKYR 2 1 AAIAQHLPCLGVLLGVSSQRNRPSLSYRSTHRSTLSSQSSDLSWISAPKRQESLGSESEV 0 0 GWTDTEATAVWGAAQPASGQSSCGQNLEDGMVKAPSSPQ 0 * 0 >MEL1_bosTau Bos taurus (cow) 0 MNPPSGPRAPLGPVQESSCLATPASSSRWDSSRSSASSLGHPPSISPT 0 0 AVRAQAAAWVPFPTVDVPDHAHYTLGTVILLVGLTGMLGNLTVIYTFCR 2 1 SRGLRTPANMFIINLAVSDFLMSFTQAPVFFASSLYKQWLFGEA 1 2 GCEFYAFCGALFGITSMITLTAIALDRYLVITRPLATVGMVSKRRAALVLLGVWLYALAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYVSFTPSVRAYTMLLFCFVFFLPLLIIIYCYIFIFKAIRETGQ 2 1 ALQTFGTCEGGSECPRQRQRLQNEWKMAKIELLVILLFVLSWAPYSTVALMGFAG 2 1 YAHILTPYMNSVPAVIAKASAIYNPIIYAITHPKYR 2 1 LAIAQHLPCLGVLLGVSGQRTGLYTSYRSTHRSTLSSQASDLSWISGRRRQASLGSESEV 0 0 GWMDTEATAAWGAGQQVSGWSPCSQRLDDVEAKALPRPQGRDSEAPGK 0 0 AKGLLPNLDARM* 0 >MEL1_susScr Sus scrofa (pig) 0 MNPPSRPSAPPDPALESSCMATPASPSRWDSSQSSTSSLGHPLPISPT 0 0 AARVRAAAWVPFPTVDVPDHAHYTLGTVILLVGLTGMLGNLTVIYTFCR 2 1 SRGLRTPANMFIINLAVSDFLMSFTQAPVFFASSLYKQWLFGEA 2 1 GCEFYAFCGAVFGITSMITLTAIALDRYLVITHPLATVGMVSKRRAALVLLGVWLYALAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYVSFTPKVRAYTMLLFCFVFLLPLLVIIYSYIFIFKAIRETGQ 2 1 ALQTFGACERRGKCPRQRQRLQNEWKMAKIELLVILLFVLSWAPYSTVALMGFAG 2 1 YGHVLTPYVNSVPAVIAKASAIYNPIIYAITHPKYR 2 1 MAIAQHLPCLGVLLGVSGQRTGLYTSYRSTHRSTLSSQASDLSWISGRRRQASLGSESEV 0 0 GWTDTEATAAWGAAQQVSRWSPCGQGLEDMEAKTSLGRLGWEAEAPGK 0 0 TKGLLPSLDPRM* 0 >MEL1_equCab Equus caballus (horse) bad GenBank 0 MNPPSEPQVPLGLAQEPGCVATPASPSRWSGSRSSTSSLGQPLPVGPT 0 0 AAGAQADAWVPFPTVDVPDHAHYTMGTVILLVGLTGMLGNLTVIYTFCR 2 1 SRGLRTPANMFIINLAVSDFLMSFTQAPVFFASSLYKQWLFGKA 1 1 GCEFYAFCGALFGITSMITLTAIALDRYLVITRPLATVGVVSKRWAALVLLGIWLYALAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYMTFTPSVRAYTMLLMCFVFFLPLLVIVYCYVFIFRAIRETGR 2 1 ALQTFGAWEGGGECPRQRQRLQSEWKMAKIVLLVILLFVLSWAPYSVVALVAFAG 2 1 YAHVLTPYMNSVPAVIAKASAIHNPIIYAIIHPKYR 2 1 MAIAQHLPCLGVLLGVSSQRTRPYTSYRSTHRSTLSSQGSDLSWISGRRRQASLGSESEV 0 0 GWMDTEAAAVWGAAQQMSGWSPCGQGLEDMEAKAPPRPQGWEGEALRK 0 0 IKGLLPSLDPRM* 0 >MEL1_felCat Felis catus (cat) AY382594 wrong readthru 0 MNPPSGPRTQEPSCVATPASPSRWDGYRSSTSSLDQPLPISPT 0 0 AARAQAAAWIPFPTVDVPDHAHYTLGTVILLVGLTGILGNLMVIYTFCR 2 1 SRGLRTPANMFIINLAVSDFFMSFTQAPVFFASSLHKRWLFGEA 1 2 GCEFYAFCGALFGITSMITLMAIALDRYLVITHPLATIGVVSKRRAALVLLGVWLYALAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYMSFTPSVRAYTMLLFCFVFFLPLLVIVYCYIFIFRAIRETGQ 2 1 ALQTFRACEGGGRSPRQRQRLQREWKMAKIELLVILLFVLSWAPYSIVALMAFAG 2 1 YAHVLTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2 1 MAIAQHLPCLGVLLGVSGQHTGPYASYRSTHRSTLSSQASDLSWISGRRRQASLGSESEV 0 0 GWMDTEAAAVWGAAQQVSGRFPCSQGLEDREAKAPVRPQGREAETPGQ 0 0 TKGLLPSQDPRM* 0 >MEL1_canFam Canis familiaris (dog) fixed 44-way 0 MNPPSGPGAQEPGCVATAASPGRWHGSPRSTVGLDQALPTGPT 0 0 AAGARAAAWAPFPTVDVPDHAHYILGTVILLVGLTGMLGNLMVIYTFCR 2 1 TRGLRTPSNMFIINLAVSDFFMSFTQAPVFFASSLHKRWLFGEA 1 2 GCEFYAFCGALFGITSMITLTAIALDRYLVITHPLAAVGVVSKRRAALVLLGVWLYALAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYMSFTPSVRAYTMLLFCFVFFLPLLVIVYCYVFIFRAIRETGQ 2 1 ALQTFRACEGGARSPRQRQRLQREWKMAKMELLVILLFVLSWAPYSAVALTAFAG 2 1 YSHVLTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2 1 MAIAQHLPCLGVLLGVSGQRTGPYASYRSTHRSTLSSQASDLSWISGRRRQASLGSESEV 0 0 GWMDTEAAAVWGAAQPAGGRFLCTQGLEDAEAKAPLRPRGQAVETPGK 0 0 TKGRLPSLDPSRE* 0 >MEL1_myoLuc Myotis lucifugus (microbat) 7 aa del in exon 1 confirmed, 1 aa del in DRY cyto loop 0 MNPPSGPRGPLDPAREPGCVATPVSP SSTSSVDPPLPTSPT 0 0 AAEAQAAAWVSFPTVDVPAHAHYTLGIVILLVGLTGMLGNLTVIYTFCR 2 1 SRGLRTPANMFIINLAVSDFLMCFTQAPVVFASSIYKRWLFGEA 1 2 GCEFYAFCGALFGITSMITLTAIALDRYLVITRPL AIGVVSKRRAALVLLGVWLYALAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYMSFTPAVRSYTMLLFCFVFFLPLLVIIYCYVFIFRAIRETGQ 2 1 ALQTFGACEGRSELPRQRQGLQNEWKMAKIVLLVILLFVLSWAPYSTVALMAFAG 2 1 YAHVLTPYMNSVPAIIAKASAIHNPIIYAITHPKYR 2 1 MAIAQHLPCLGLLLGVSGQRTGPYASYRSTHRSTLSSQASDLSWVSRRRRQESLGSESEM 0 0 GWTDTEAAAMWGAAQQVSGPPPCSQGLEDVETKAPPKSQGHEAEDPRK 0 0 TKGLLPSPDPRM* 0 >MEL1_pteVam Pteropus vampyrus (macrobat) 0 MNLPVGPRVPLDPAQEPSCMATPASPS WDSSPSSASSLDQPLPISPT 0 0 AAGSQAAAWVPFPTVDVPDHAHYTLGTVILLVGLTGMLGNLTVIYTFCR 2 1 SRGLRTPANMFIINLAVSDFLMSFTQAPVVFISSLYKRWLFGQA 1 2 GCEFYAFCGALFGITSMITLTAIALDRYLVITRPLAAIGVVSKRRAALVLLGVWLYALAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYMSFTPSVRAYTMLLFCFVFFLPLLIIIYCYIFIFRAIRETGQ 2 1 ALQTFGACEGLSELPRQRQRLQSEWKMAKIVLLVILLFVLSWAPYSTVALMAFAG 2 1 YSHVLTPYMNSVPAIIAKASAIHNPIIYAITHPKYR 2 1 MAIAQHLPCLGVLLGMSGQHTGPYTSYRSTHRSTLSSQASDLSWISRRRRQASLGSESEM 0 0 GWTDTEAAATWGTAQQVSGPSLWGQDLEDVEAKAPPKPQGREAEAPRK 0 0 TKELLPSLDPRM* 0 >MEL1_eriEur Erinaceus europaeus (hedgehog) 0 MALSLGPRVPTSQALDPSCMDTPASPSRWDSSQNSTSSLAQLPLISLT 0 0 SVSPQATASAPFPTVNVPDHAHYTLGTVILLVGLTGMLGNLTVIYTFCR 2 1 SRSLRTPANMFIINLAVSDFLMSFTQTPVFFASSLYKQWLFGEA 1 2 GCEFYAFCGALFGITSMITLTAIALDRYLVITRPLATIGVVSKRRVALVLLGVWLYSLAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYMRFTPSFRAYTMLLFCFVFFLPLLVIIYCYIFIFRAIRETGQ 2 1 ALQTFRACEGSCDFPRQQQRLQSEWKMAKIILLVILLFVLSWAPYSTVALMAFAG 2 1 YAHVLTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2 1 MAIAQHLPCLRVLLGVSGQRDRPYTSYRSTHRSTLSSQISDLSWVSRRRRQASLGSESEV 0 0 GWTDTEVAAVWGT MSGHFPCGQGLDDMEAKAAHNPRGLEAETPGK 0 0 IKGLLPSLDPQM* 0 >MEL1_loxAfr Loxodonta africana (elephant) fragment 0 MNPPWGPRVPSGRAQEPSCVATPASASRWNSSRASASSLGELPPSSPT 0 0 AARAHTAAWDPFPTVDVPDHAHYTLGAVILLVGLMGMLGNLMVIYIFFR 2 1 SRGLRTPANMFIINLAVSDFLMSFTQAPVFFASSLYKRWLFGEA 1 2 GCKFYAFCGALFGITSMITLTAIALDRYLVITRPLATIGVVSKRRAALVLLGIWLYALAWSLPPFFGW 1 2 SAYVPDGLLTSC 2 1 ALQTFGACEGASEPPRQWQRLQSEWKMAKIALLAILLYVLSWAPYSTVALVAFAG 2 1 YAHVLTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2 1 MAIAQHLPCLGVMLGVSGQRTRPYTSYHSTLHSTLSSQASDLSWISGRRRQASLGSESEV 0 0 GWTDTEAAAAWEGAQQVSGQASCSQALQNLEANTPPRPQGWGPETPRK 0 0 * 0 >MEL1_proCap Procavia capensis (rock_hyrax) 0 MNPPWGPRVPSRPAQEPSCMSTPASAGRWDSSQATASSLAELPPSSPT 0 0 EARTQTADWVPFPTVDVPDYAHYTLGTVILLVGLTGVLGNLMVIYIFFR 2 1 SRGLRTPANMFIINLAISDFLMSLTQAPVFFASSLYKRWLFGEA 1 2 GCEFYAFCGALFGITSMITLTAIALDRYLVITRPLATIGVVSKRRTALVLLGTWLYALAWSLPPFFGW 1 2 SAYVPDGLLTSCSWDYKSFMPSARTYTMLLCCFVFFLPLLVIIYCYVFIFKAIRETGR 2 1 ALQTFGACEGASETPRQWQRLQSEWKMAKIALLAILLYVLSWAPYSTVALVGFAG 2 1 YAHVLTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2 1 MAIAQHLPCLGVLLGVSDQHTRPYTSYRSTHHSTLSSQASDISWISGRRRQASLGSESEV 0 0 GWTDTEAAAAWEGAQQVSGRASCSQVLESMEANTPPRPQGWGPETPRK 0 0 VKGLPLLDPRA* 0 >MEL1_echTel Echinops telfairi (hedgehog) fragment 0 MNLPSGSRVPSGPAQEPSHVATAASASRWNS GSSLDAPPPSSPT 0 0 AARAPTAASAPFPIVDVPDHVHYTLGTVILLVGLTGMLGNLMVIYTFCR 2 1 SRSLRTPANMLIINLAVSDFLMSFTQAPVFFTSSLYKQWLFGEA 1 2 GCEFYAFCGALFGITSMITLTAIALDRYLVITRPLATIGVVSKRRAALVLLVIWLYALAWSLPPFFGW 1 2 SAYVPDGLLTSCSWDYMSFTPSVRAYTMLLFCFVFFLPLLVIIYCYIFIFRAIRETGR 2 1 ALQTFGACEGSRDSPRQRQRLQSEWKMAKIALLVISLFVLSWAPYSTVVLVAFAG 2 1 2 1 0 0 GWTDTEVTATSGYTQQVSGRCPRGQDLESMDANPTPRPRGWETETAQK 0 0 IKGLPLSLNPQA* 0 >MEL1_smiCra Sminthopsis crassicaudata (dunnart) DQ383281 0 MNPSPMLRHLSCPAQDSNCTKIMASISEWNNTEVDAYHLVDLPPITPT 0 0 AVVLPPYSQKVFPTADVPDYAHYTIGATILVVGFTGVLGNLLVIYTFCR 2 1 SRSLRTPANMFIINLAISDFFMSFTQAPVFFASSLYERWIFGEK 2 1 GCEFYAFCGALFGITSMITLMVIALDRYFVITRPLASIGMISKKKTGLILLGVWLYSLAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYTTFTPSVRAYTILLFCFVFFIPLTVIIYCYIFIFRAIKDTNK 2 1 AVQNIGSSEHTPSLRHFQRMKNEWKMAKIALVVILLFVLSWAPYSTVALVAFAG 2 1 YSHVLTPYMNSVPAIIAKASAIHNPIIYAISHPKYR 2 1 MAIAQNFPCLRAVLGIRHPRTQSFSSYRFTHRSTTASQASDISWQSRGRRQLSLGSESEA 0 0 GWNNIETGLTLRSLEGSCGMDEETMDTRELSASTKAKGQSWETLAKTLEE 0 0 MDDLSLLEAGTLLSSLDLQI* 0 >MEL1_monDom Monodelphis domestica (opossum) Gq -GRID1 -WAPAL +LDB3 +BMPR1A 483 aa 0 MNPSPMLRGLSCPAQDTNCTKIMASMSEWNNTEEDAYHLVDLPSIAPT 0 0 AVVLPPSSQNIFPTVDVPDHAHYTIGAIILAVGITGMLGNFLVIYTFCR 2 1 SHSLRTPANMFIINLAISDFFMSFTQAPVFFASSMYKRWIFGEK 1 2 ACEFYAFCGALFGITSMITLMAIALDRYFVITRPLASIGVISKKKTGFILLGVWLYSLAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYTTFTPSVRAYTMLLFCFVFFIPLIVIIYCYIFIFRAIQDTNK 2 1 AVHSIGSGESTASPRHCQRMKNEWKMAKIALVVILLYVLSWAPYSTVALVAFAG 2 1 YSHILTPYMNSVPAIIAKASAIHNPIIYAISHPKYR 2 1 MAIAQNFPCLRALLCVRHPRTRSFSSYRFTRRSTMTSQASDISWLPRGRRQLSLGSESEI 0 0 GWNNMEAGTTSLTSRNQQGSCRMDQETMETRELAAIAKAKGRSWETLEK 0 0 TLEEMDDSSLLEVSVDMEQ* 0 >MEL1_ornAna Ornithorhynchus anatinus (platypus) Gq fragment 0 0 0 FPTADVPDHAHYTIGATILAVGFTGVLGNLLVIYTFCR 2 1 SRSLRTPANMFIINLSISDFFMSLTQAPVFFASSLHKRWIFGEK 1 2 GCQLYAFCGALFGITSMITLTVIALDRYFVITRPLASIGVISKKRALLILTGVWFYSLAWSLPPFFGW 1 2 sAYVPEGLLTSCSWDYMTFTPPVRAYTMLLFCFVFFIPLIMIIYCYFFIFRAIRGTNK 2 1 AVETIGSDDCRGSQRQCQRMKNEWKTAKIALMVILLYVISWCPYSVVALVAFAG 1 YSHLLTPYMNSVPAVIAKSSAIHNPIIYAITHPKYR 2 1 MAITKYIPCLGPLLRVSRQDSRSSSHYASSRRSTVTSQSLDGSWLPGRRRPLSSASDSES 0 0 0 0 * 0 >MEL1_anoCar Anolis carolinensis (lizard) diverged frag 0 0 0 ERTMFNLPDPFPTVDVPTHAHYTIGAVILVVGITGTLGNLLVIYVFFR 2 1 IRGLRTPANMFVINLAVSDFL 1 2 GCELYAFCGALFGIASMITLTVIALDRYFVITRPLASIGAMSTKKALLILSGVWLYSLAWSLPPFFGW 1 2 sAYVPEGLLTSCSWDYITFTPSVRAYTMLLFCFVFFIPLIAIIYSYVFIFIAIKNSNR 2 1 AVQRTNSDNSKEGQKLYQKLKNEWKMAKVALIVILLVISWSPYSVVALVAFAG 2 1 YSHLLTPYMNSVPAVIAKASVIHNPIIYAIVHPKYR 2 1 MAIAKFLPCLGSLLRVPRKDSSYPSTRRPTVTSQSSDINGVPRGHRRLSSVSDSES 0 0 DWTDTEADISSQNSRVASGSISYRIYEDTTETIKVKSKMRSHDSGIFER 0 0 0 0 TGEDLNAFGWRREESYSGPSTSSQIPSIIVTFSNVQRTDLPLESSSGALCSRNSSYSWEKDSNS* 0 >MEL1_taeGut Taeniopygia guttata (finch) short exon 1 -GRID1 +LDB3 synteny 0 MDLLLRAPT 0 0 KMTVQDVPRAFPTVDVPDHAHYTIGVVILIVGITGTLGNFLVFYAFCR 2 1 SRSLQTPANILIINLAISDFLMSITQSPVFFTSSLYKHWIFGEK 1 2 GCELYAFCGALFGITSMITLMVIALDRYFVITKPLASVGVTSKKKALIILVGVWLYSLAWSLPPFFGW 1 2 sAYVPEGLLTSCSWDYMTFTPSVRAYTMLLFCFVFFIPLIAIIYSYVSIFEAIKKANK 2 1 SIQTFGCKRGNREFQKQYQRMKNEWKMAKIALIVILFFVISWSPYSVVALVAFAG 2 1 YSHVLTPFMNSIPAVIAKASVIHNPIIYAITHPKYR 2 1 KAIATYVPCLGPLLRVSPKDSRSFSSYHSSRRATISSQSSEISGLQERKRRLSSLSDSES 0 0 GCTETETDTPSMFSRLARRQISYKTDKDTTQTSDIRAKLTSQDSGNCGK 0 0 TAVDADDILMVELNVTEYMATPTVRTILLIFGNKN 0 0 KSESLNSIGQRREFHQGSSSAQIPSITITCSSVQGIELPSRYNSGFLYPKSSSHKQNKKSSS* 0 >MEL1_galGal Gallus gallus (chicken) Gq short exon 1 synt(-GRID1 -WAPAL +LDB3 +BMPR1A) 529 aa 16856781 AY88294 melanopsin OPN4m 0 MDLPPRAPT 0 0 KMTVKDVRGAFPTVDVPDHAHYTIGTVILIVGITGTLGNFLVIYAFCR 2 1 SRTLQKPANIFIINLAVSDFLMSITQSPVFFTNSLHKRWIFGEK 1 2 GCELYAFCGALFGITSMITLMVIALDRYFVITKPLASVRVMSKKKALIILVGVWLYSLAWSLPPFFGW 1 2 SAYVPEGLLTSCSWDYMTFTPSVRAYTMLLFCFVFFIPLIAIIYSYVFIFEAIKKANK 2 1 SVQTFGCKHGNRELQKQYHRMKNEWKLAKIALIVILLYVISWSPYSVVALVAFAG 2 1 YSHVLTPFMNSVPAVIAKASAIHNPIIYAITHPKYR 2 1 TAIATYVPCLGFLLRVSPKESRSFSSYPSSRRTTITSQSSETSGLQKGKRRLSSISDSES 0 0 GCTDTETDITSMISRPASSQVSYEMGEDTTQTSDLGGKPKVKSHDSGIFRK 0 0 TVVDADEIPMVEINDTEHSATSTCKTSEKCNVEEIQ 0 0 RSESLSGIGLREGESRHRTSASQIPSIIITYSNVQGVELHSGYSAGFLHPKNKSHKQNKSSNS* 0 >MEL1_xenTro Xenopus tropicalis (frog) Gq synt(-GRID1 -WAPAL +LDB3 +BMPR1A) 596 aa 16856781 DQ384639 melanopsin OPN4m 0 MNYQSVRKGITCPPQDANCSRILESLNSWNNSEVNSYKLVELPPIVTT 0 0 ETPQYEIHHVYPTVDVPDHVHYVVGAVILAVGITGMLGNFLVIYAFCR 2 1 SRSLRSPANMFIINLAITDFLMSVTQAPVFFATSLHKRWIFGEK 1 2 GCELYAFCGALFGITSMITLMVIAVDRYFVITRPLTSIGVMSKKRAVLILSGVWLYSLAWSLPPFFGW 1 2 SAYVPEGLLTSCTWDYMTFTPSVRAYTMLLFCFVFFIPLFIIIYCYIFIFKAIKNTNR 2 1 AVQKIGTDNNKESHKQYQKMKNEWKMAKIALIVILLYVVSWSPYSTVALLAFAG 2 1 YASILTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2 1 MAIAKYIPCLGSLLRVKRRDSRSYSSYPSSRRSTVTSHCSQSSDVGGHPKLKNHLPSVSDSES 0 0 GWTDTEADSSVNSRPASRQVSYEMGKDTTETNDLKSKAKLKSHDSGIFEK 0 0 TSMDADDISLVELGTVDRSSPIM 0 0 ANKHLNGLGQRKGDSFTRRSPSSRIPSIVVTHSNHQGSPAAVRHNSTLPGIKVSNSQDREKELKRQIEKVKQYVPIVTITSDTENSTGGFSNELLPANTS* 0 >MEL1_danRer Danio rerio (zebrafish) Gq synt(- +USP54 +LDB3 +BMPR1A) 594 aa no_ref AY078161 melanopsin OPN4m 0 MMSGAAHSVRKGISCPTQDPNCTRIVESLSAWNDSVMSAYRLVDLPPTTTTTTSVA 0 0 MVEESVYPFPTVDVPDHAHYTIGAVILTVGITGMLGNFLVIYAFSR 2 1 SRTLRTPANLFIINLAITDFLMCATQAPIFFTTSMHKRWIFGEK 1 2 GCELYAFCGALFGICSMITLMVIAVDRYFVITRPLASIGVLSQKRALLILLVAWVYSLGWSLPPFFGW 1 2 SAYVPEGLLTSCTWDYMTFTPSVRAYTMLLFIFVFFIPLIVIIYCYFFIFRSIRTTNE 2 1 AVGKINGDNKRDSMKRFQRLKNEWKMAKIALIVILMYVISWSPYSTVALTAFAG 2 1 YSDFLTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2 1 LAIAKYIPCLRLLLCVPKRDLHSFHSSLMSTRRSTVTSQSSDMSGRFRRTSTGKSRLSSASDSES 0 0 GWTDTEADLSSMSSRPASRQVSCDISKDTAEMPDFKPCNSSSFKSKLKSHDSGIFEK 0 0 SSSDVDDVSVAGIIQPDRTLTN 0 0 AGDITDVPISRGAIGRIPSIVITSESSSLLPSVRPTYRISRSNVSTVGTNPARRDSRGGVQQGAAHLSNAAETPESGHIDNHRPQYL* 0 >MEL1_danRer Danio rerio (zebrafish) Gq synt(- +USP54 +LDB3 +BMPR1A) 473 aa melanopsin OPN4m 0 QVAMVQDVRHPFPTVDVPDHAHYTIGSVILAVGITGMVGNLLVMYAFCK 2 1 SRSLRTPANMFIINLAVTDFLMCVTQTPIFFTTSLHKRWIFGEK 1 2 GCELYAFCGALFGICSMITLMIIAVDRYFVITRPLASIGVMSRKRALLILSAAWAYSMGWSLPPFFGW 1 2 SGAYVPEGLLTSCSWDYMTFSPSVRAYTMLLFTFVFFIPLFVIIYCYFFIFKAIRETNR 2 1 AVGKINGEGGPRDSIKKIHRMKNEWKMAKIALIVILLYVISWSPYSCVALTAF 2 1 YADMLTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2 1 SAIAKYIPCLGVLLCVPRRDRFSSSSFISTRRSTLTSQSSETSSNLHRAGKARLSSVSDSES 0 0 GWTDTEADLSTASSRPASRQVSSEIRKDLCDIKHSSSLRLKVKSRDSGIFDR 0 0 0 0 QNDVSEKADEKRPLVRIPSIIVTSETCPAVLPAGHSSRLIPGAPAVTDS* 0 >MEL1_takRub Takifugu rubripes (fugu) Gq synt(- +USP54 +LDB3 +BMPR1A) 555 aa melanopsin OPN4m 0 MNFGKSALQPPAQQSVVSCGGGGPEPNCTLRLAVTVMMSVRLAELQLHAST 0 0 LQVAMVRPFPTVDVPDHAHYTIGSVILVIGITGMIGNFLVIYAFCR 2 1 SRSLRTPANMFIINLAVTDLLMCVTQTPIFFTTSMYKRWIFGEK 1 2 GCELYAFCGALFGICSMITLTVIAIDRYFVITRPLTSIGVLSRKRAFVILMTVWIYSLGWSLPPFFGW 1 2 SAYVPEGLLTSCTWDYMTFSPSVRAYTMLLFIFVFFLPLFIIIYCYFFIFRAIRATNK 2 1 AVGKVNGSVHSHSRRRESVKNFQRLQNEWKMAKIALMVILLYVISWSPYSCVALTAFAG 2 1 YADMLTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2 1 LALAKYIPCLGFLLCISPHELQSTSSSFMSLRRSTVTSQTSDISGQFRPQSKPRRSSASDSES 0 0 CLTDTEADLSSMGSRPASRQVSCDISRDTTELPEYKPASSFNSKVKSPDSGIFEK 0 0 TSFDFDASMAASRERSSIPN 0 0 SGEFPEGHVMRRTLARIPSIIITSESSHFLPNGRKASSTTCIANGSDIKVGPR* 0 >MEL1_gasAcu Gasterosteus aculeatus (stickleback) Gq synt(- - +LDB3 +BMPR1A) 556 aa melanopsin OPN4m 0 MNAGESELLLPTQQSILPCGDHEPNCPVAQAETLALSAASANGSA 0 0 VQVAMVSRAPHPYPTVDVPDHAHYTIGSVILAIGITGIIGNVLVIYAFSK 2 1 SRSLRTPANMFIINLAITDLLMCVTQAPIFFTTSMHKRWIFGEK 1 2 GCELYAFCGALFGICSMITLTVIALDRYFVITRPLTSIGMMSRRRALLILMGAWTYSLGWSLPPFFGW 1 2 SAYVPEGLLTSCTWDYMTFTPSVRAYTMLLFIFVFFLPLFIIIYCYFFIFRAIRVTNR 2 1 AVGKMNGSIHSHGSGRDSTKNFHRLQNEWKMAKIALIVILLYVVSWSPYSAVALTAFAG 2 1 YADMLTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2 1 IALAKYIPFLGVLLCVPPRELRSASSSFRSTRRSTVTSQTSDVSSQQRRQGSRNSRLSSASDSES 0 0 CLTDTEADGSSVGSRPASRQVSCDIGRDTAELPEFKPSSSFKSKMKSHDSGIFEK 0 0 SYDTDISMAGVSERGSIPN 0 0 QTDFAEGRDRRSTIGRIPSIVITSETSPFLPTGRNGSCNGRPKTANSSHPGAGSG* 0 >MEL1_oryLat Oryzias latipes (medaka) Gq synt(- +USP54 +LDB3 +BMPR1A) 504 aa melanopsin OPN4m 0 LQVAMVPQTFHPFPTVDVPDHAHYTIGSVILAIGITGIIGNFLVIYAFSR 2 1 SRSLRTPANMFIINLAITDLLMCVTQSPIFFTTSMHKRWIFGEK 1 2 GCELYAFCGALFGICSMITLTVIAIDRYFVITRPLTSIGVLSRKRALLILSAAWAYSLGWSLPPFFGW 1 2 SAYVPEGLLTSCTWDYMTFTPSVRAYTMLLFIFVFFLPLFIIIYCYVFIFRAIRSTNR 2 1 AVGKINGNTRDAVKSFNRLQNEWKMAKIALIVILLYVISWSPYSTVALTAFAG 2 1 YADMLTPYMNSIPAVIAKASAIHNPIIYAITHPKYR 2 1 MALAKYIPGLGVLLCIHPKDLRSASSSFVSTRRSTVTSQSSDISSQLRRQSTFKSRLSSLSDSES 0 0 GLTDTEADLSSLSSRPASRQVSCEISRDTAELPDFKHTSSFKAKLKNNDSGIFEK 0 0 TSFDTVSIGGVSEHNSIPS 0 0 NRDFGDGNVTRATIGRIPSIVVTSEMSPFLPVGRNGSRTNRSKMANSSAGAGPV* 0 >MEL1_calMil Callorhinchus milii (elephantfish) Gq synt(- - - -) 369 aa melanopsin OPN4m 0 ASVTDAQHHHMFPTVDVPDHAHYIIGATILAVGVTGMVGNFLVIYAFLR 2 1 SRSLRTPANTFIINLAATDFLMSVTQSPIFFITSIHKRWIFGEK 1 2 GCELYAFCGALFGITSMITLMVIALDRYFVITRPLASIGVLSHRRAGLIILSLWLYSLAWSLPPFFGW 1 2 SAYVPEGLLTSCTWDYMTFTPSVRAYTMLLFCFVFFIPLGVIIYCYIFIFRAIKSTNK 2 1 KVGGSTNRESQKQHQRMKNEWKMAKIALIVILLFVISWSPYSTVALTAFAG 2 1 YADMLTPYMNSVPAVIAKASAIHNPIIYAITHPKYR 2 1 MAIAKYVPLLGLLLRVSRRDSRTSGQYYSTRRSTLTSQTSDLSGYPRGKGRLSSASDSES 0 >MEL1b_calMil Callorhinchus milii (elephantfish) Gq 113 aa no_ref EB687868 melanopsin OPN4m 1 SKSLRTPANMFIINLAISDFFMSATQPPVFFVTSLHKRWIFGEK 2 GCKLYAFCGALFGITSMITLMAISIDRYWVITKPLQSISSTTTKKNTLKVIILVWLYSLAWSLPPLLGW 1 >MEL1_petMar Petromyzon marinus (lamprey) first exon uncertain frag 0 MAPSIVPLMRKLSCPLSRKTPPKHNVSTSMFDGGDGVLGVIGSIP 0 0 VSFLPSQREKFPMLDIPHHVHHTIGSVVIAIGFTGIIGNFLVIYVFCR 2 0 VQVAMVSRAPHPYPTVDVPDHAHYTIGSVILAIGITGIIGNVLVIYAFSK 2 1 SKSLRSPANIFIINLAFADFFMSITQTPIFFVTSLHKRWIFGEK 1 2 GCELYAFCGALFGIASMVTLMVIATDRYLVLTRPLASIGAMSKRRAMYITAAVWFYSLAWSLPPFFGW 1 2 SAYVPEGLMTSCTWDYVTFTPAVRSYTMLLFCFVFFIPLIVIIFCYVRIFAAIKNTNR 2 1 AVKTLGDAHDSKESQKQQQRMNAEWKLAKIALIVILLYVVSWSPYSCVALVAWAG 2 1 YADMLTPYMNSVPAIIAKASAIHNPIVYAITHPKYR 2 >MEL1_cioInt Ciona intestinalis (tunicate) AABS01000008 3 transcripts BW447434, BW019524, BW048729 391 aa frag rough seq 0 DRRYPSCYKGEQVTFIYLIPIFLFRTLAFLSSVFVTMTPSAVLPLVTTEARPVHPINDVAMYTFGGLMLTAGTVAVVGNIMVMYTFLRR 0 1 PLHWFIVQLAVADFFVGLIVLWIGTFSSLFLDTVSLMSGIATYGVLAAATSTSTLGVLFIAVDRHFYILRHRRYKQIMTRLRVGTAIVVACVVPATFFVVVPAFGWN 1 2 PEYIEEPLIPSCIFDRFTNSLSNRLYIITMCTFVFFIPLVFICYCLYRIFWAVKSSS 0 2 SLSRAGSRKSQSSGKLSKSNSSRSKRGIQTIEFQILKSAVLLVVLFVSSWMPFTVAAIISIGSNQVSPYVILVSYLFAKASCVHSSFAYITNAHFRATIGLIRCKHTHRA* 0 >MEL1_cioSav Ciona savignyi (tunicate) AABS01000008 0 transcripts frag rough seq FTTLATLAVIPPTMVLNNSTHPIKVSALLAFGSLMLCAGIIAIVGNLVVMYTFLR PLNWFILQLAVADFFVGLIVLWIGVFSSLLLETVSLMSGVATYGVLAAATSTSTLGVLFIAVDRHFYILKHRRYKSIMTRVKVGVAIFFACVTPLAFFVAAPAAGWN 1 DYIAEPLIPSCIFDRFTTSLANQTYIITMCAFVFFVPLLFICYCLVRIYKAVKTSS QTVEFQILKSAVLLMVLFVSSWMPFTVAAMISVVAEQVDPYVILVSYLFAKASCVHSSFAYITNAHFRATLGVLRCKKRRSV* 0 >MELmop_braFlo Branchiostoma floridae (amphioxus) Gq 709 aa melanopsin Amphi-mop bblast 89% 12 exons 0 MTELPSFQPPTNSTEEENAVFPTALTEWISE 0 0 VGNQVGEAALKLLSGEGDGMEVTPTPGCTGNASVCNGTDSGGGVVWDIPPLAHYIVGTAVFCVGCCGMFGNAVVVYSFIK 2 1 SKGLRTPANFFIINLALSDFLMNLTNMPIFAVNSAFQRWLLSDF 1 2 ACELYGFAGGLFGCLSINTLMAISMDRYLVITKPFLVMRIVTKQR 0 0 VMFAILLLWIWSLVWALPPLFGWSAYVPEGF 1 2 GTSCTFDYMTPKLSYHIFTYIIFFTMYFIPMGVIIYCYYNIFATVKSGDKQFGKAVKEMAHEDVKNK 0 0 AQQERQRKNEIKTAKIAFIVITLFLSAWTPYAVVSALGTLGYQDLVTPYLQSIPAVFAKSSAVYNPI 1 2 VYAITHPKFRAAVKKHIPCLSGCLPADEEETKTKTRGATTTASMSMTQTTAPTV 0 0 HDPQASVHSGSSVSVDDSSGVSRQDTMMVK 0 0 VEVDNRMEKAGGGAADTAPKDGTSVPTVSAQIEVRPSGNVNTKAEVIPSPQSAAVAHGASASPVPK 0 0 VAELSSSVSLESAAIPGKIPTPLPSQPIAAPIERHMAAMADDPPPKPRGVATTVNVRRSESGYERSQDSLRKK 0 0 AVSETRSRSFNSTKDHFASERQTSTTLNQPRDMYSGDMVKKTRQSPEKQEYDNPAFDAGIAEIDTDSENETEGSYDMLSVRFQAMAEEPPVETYRKASDMSINLGKASLMLTEAHDETVL* 0 >MELmop_braBel Branchiostoma belcheri (amphioxus) Gq 15936279 AB205400 Amphi-mop 0 MTEIPSFQPPINATEVEEENAVFPTALTEWFSE 0 0 VGNQVGEVALKLLSGEGDGMEVTPTPGCTGNGSVCNGTDSGGVVWDIPPLAHYIVGTAVFCIGCCGMFGNAVVVYSFIK 2 1 SKGLRTPANFFIINLALSDFLMNLTNMPIFAVNSAFQRWLLSDF 1 2 ACELYGFAGGLFGCLSINTLMAISMDRYLVITKPFLVMRIVTKQR 0 0 VMFAILLLWIWSLVWALPPLFGWSAYVSEGF 1 2 GTSCTFDYMTPKLSYHIFTYIIFFTMYFIPGGVMIYCYYNIFATVKSGDKQFGKAVKEMAHEDVKNK 0 0 AQQERQRKNEIKTAKIAFIVISLFMSAWTPYAVVSALGTLGYQDLVTPYLQSIPAMFAKSSAVYSPI 1 2 VYAITYPKFREAVKKHIPCLSGCLPASEEETKTKTRGQSSASASMSMTQTTAPV 0 0 HDPQASVDSGSSVSVDDSSGVSRQDTMMVK 0 0 VEVDKRMEKAGGGAADAAPQEGASVSTVSAQIEVRPSGKVTTKADVISTPQTAHGLSASPVPK 0 0 VAELGSSATLESAAIPGKIPTPLPSQPIAAPIERHMAAMADEPPPKPRGVATTVNVRRTESGYDRSQDSQRKK 0 0 VVGDTHRSRSFNTTKDHFASEQPAALIQPKELYSDDTTKKMARQSSEKHEYDNPAFDEGITEVDTDSENETEGSYDMLSVRFQAMAEEPPVETYRKASDLAINLGKASLMLSEAHDETVL* 0 >MEL6_braFlo Branchiostoma floridae (amphioxus) Gq 402 aa melanopsin Amphiop6 bblast 68% 0 MSPNLTNTSLLPNRTDRPELSPADVTMQLVFGSMMLVFGLIGVVGNAVALYAFCR 2 1 SRSLRRPKNYLIANLCLTDMVVCLVYSPIIVTRSLSHG 2 1 LPSKESCIVEGFVVGLGSIVSICSLAGIAVERYVTITQPIKSLSILTHRALLGAVSAVWVYAFLLAFPPLVGWGRYVSEESKISCTFDYLSTDDATRAHVIVLVIGAFGLPFSVITYCYV RSFATVRKCTKERKQMSPLAKSDSRSEVKAAVNSFVITTSFCLCWCPYAVVATMGVSGFTVHSHAVFIAALLAKLSVLFNPVAYVLSIPSFRKALFSSSNDRTKYQTAFTFESLAKTSPV ERKWCADSIERYRSSNVNIESTELTVPYSASRESCLLSRAATERLAGRSPSLTDIVREFGLQQTASHRETWV* 0 >MEL6_braBel Branchiostoma belcheri (amphioxus) Gq Amphiop6 AB050611 0 MSSNLTNVSLVANRTDQTELSPTDVTMQLIFGSMMLVFGLIGVVGNVVALYAFCR 2 1 TRSLRRPKNYVVANLCLTDMFVCLVYCPIVVSRSFSHG 2 1 FPSKESCIVEGFMVGVGSIASICSLAAIAVERYLSVTQPLKSLTILTQRKLLVAVLTVWVYSLLLAFPPLVGWGRYVREETYISCTFDYLSTDDATRAYVITLVMGAFGFPLLTIAYCY IRVFTTARKHAEERKFMSPLKRPESRTEIKTAVTACVITTSFCLCWCPYAVVATLGISGVSVQQQTVFSAALLAKLTVIINPIVYVLSIPNFRKALFAQEREKYASEDVVLTSLPGKTRRMK KVERSQSSNSNVVIEVKESSMAYSTSRESCLLSRAATKRLAGKTKSIVDLVDEFGLQETAPHKESLV* 0 >MELx_braFlo Branchiostoma floridae (amphioxus) XP_002586120 0 MDRMSPNLTNTSLLPNRTDRPELTPADVTMQLVFGSMMLVFGLIGVVGNAVALYAFCS 2 1 TRKLRRPKNYVVANLCLTDLIMCIVYCPVIVISSFSGR 2 1 IPTDGACTMEGFVVGMASIASVGSL 0 0 VAIAVERFFSITRPMKSLTILTKRTFLGGVAVVWLYSLILVIPPLLGWGRYVREETKLSCSFDYLSTDDANRSYVIWLVIVAFGLPLLVIAYCYISVFITVKKCTKKRKLMSPHKKSRSEVKTAVNAFIMTTAFCLCWCPY AVVATMGISGSSVQGTVVFGAALLAKLSVLINPVAYVFSIPSFRKALFGHRKRGYGTSDGLANDSSSEKRRGKKHESDANSATEYLRLTVFYSMYSRTGQLSPWASKRLAGKTKSMLDLTTEYGRLERTAHKESWV* 0 >MEL2_galGal Gallus gallus (chicken) Gq synt(+GRID2+SMARCAD1 -PGDS -SEC24B +COL25A1) 544 aa 17977531 AY882944 0 MGTQPHSVTKSEIPDHVLYTVGTCVLVIGSIGIIGNLLVLYAFYS 2 1 NKKLRTPQNFFIMNLAVSDFLMSASQAPICFVNSLHREWILGDI 1 2 GCDLYAFCGALFGITSMMTLLAISVDRYLVITKPLRSIQWTSKKRTIQIIAAVWLYSLGWSVAPLLGW 1 2 SSYVPEGLMISCTWDYVTYSPANRSYTMILCCCVFFIPLIIILHCYLFMFLAIRSTGR 2 1 DVQKLGSCSRKSFLSQSMKNEWKLAKIAFVVIIVYVLSWSPYACVTLIAWAG 2 1 RGNTLTPYSKSVPAVIAKASAIYNPIIYAIIHPRYR 2 1 KTIHNAVPCLRFLIRISKNDLLRGSINESSFRTSLSSHQSLAGRTKNTCVSSVSTGEA 0 0 NWSDVELDTVEPAHEKLQPRRSHSFSSSLRQKRDLLPDSYSCSEETEEK 0 0 VSLSSSYLEKVLGRSAFPSSPVALVTSSLRAASLPVGLNSSSASRGAGSDISQMKTEESHNNGGLDSIVSNTVPQIIIIPTSETNLFQEEPEEEETELFHFHDKKNNLLDLEGLSSSTEFLEAVEKFLS* 0 >MEL2_anoCar Anolis carolinensis (lizard) Gq synt(+GRID2+ SMARCAD1 -ATOH1 +PDLIM5 +BMPR1B) 290 aa 0 MGPHHRTKVDVPDHVLYTVGSCVLVIGCIGITGNLLVLYAFYS 2 1 NKRLRTPPNYFIMNLAVSDFLMSATQAPICFLNSMHKEWVLGDI 1 2 GCNLYAFCGALFGITSMITLLAISVDRYCVITKPLQSIKRTSKKRTCIIIVFVWLYSLGWSVCPLFGW 1 2 SSYIPEGLMISCTWDYVTYSPANRSYTMMLCCCVFFIPLVIIFHCYIFMFLAIRSTGR 2 1 RKSSISHSIKSEWKLAKIAFVAIVVFVLSWSPYACVTLISWAG 2 1 YARTLTPYSKSVPAVIAKASAIYNPIIYAIIHPRYR 2 1 RTIRSAVPCLRFLIPISKSDLSTSSMSESSFRASVSSRHSFSYRNKSTYISSISAKET 0 0 TWCDVELDPVESGHKKLQAYRSNSFSAKGVAEEESGLLLRTNNCNVPARKK 0 >MEL2_xenLae Xenopus laevis (frog) Gq synt(SMARCAD1 +PDLIM5 +BMPR1B) 535 aa melanopsin Xmop 21 0 0 0 MDLGKTVEYGTHRQDAIAQIDVPDQVLYTIGSFILIIGSVGIIGNMLVLYAFYR 2 1 NKKLRTAPNYFIINLAISDFLMSATQAPVCFLSSLHREWILGDI 1 2 GCNVYAFCGALFGITSMMTLLAISINRYIVITKPLQSIQWSSKKRTSQIIVLVWMYSLMWSLAPLLGW 1 2 SSYVPEGLRISCTWDYVTSTMSNRSYTMMLCCCVFFIPLIVISHCYLFMFLAIRSTGR 2 1 NVQKLGSYGRQSFLSQSMKNEWKMAKIAFVIIIVFVLSWSPYACVTLIAWAG 2 1 HGKSLTPYSKTVPAVIAKASAIYNPIIYGIIHPKYR 2 1 ETIHKTVPCLRFLIREPKKDIFESSVRGSIYGRQSASRKKNSFISTVSTAET 0 0 VSSHIWDNTPNGHWDRKSLSQTMSNLCSPLLQDPNSSHTLEQTLTWPDDPSPKEILLPSSLKSVTYPIGLESIVKDEHTNNSCVR NHRVDKSGGLDWIINATLPRIVIIPTSESNISETKEEHDNNSEEKSKRTEEEEDFFNFHVDTSLLNLEGLNSSTDLYEVVERFLS* 0 >MEL2_danRer Danio rerio (zebrafish) Gq synt(- +FLJ39155 +PDLIM5 -) 346 aa 0 MEPQRQIYKRLDVPDHVHYIIAFLILIIGTLGVSGNALVMFAFYR 2 1 NKKLRSLPNYFIMNLAVSDFLMAITQSPIFFINCLYKEWMFGEL 1 2 GCKIYAFCGALFGITSMINLLAISIDRYLVITKPLQTIQWNSKRRTGLAILCIWLYSLAWSLAPLIGW 1 2 GSYIPEGLMTSCTWDYVSPSPANKSYTMMLCCFVFFIPLSIILYCYLFMFLSVRQASR 2 1 QKSSFVKQQSMRSEWKLAKIAAVVIVVYVLSWAPYACVTLVAWAG 2 1 HQDVLTPYSKTLPAVLAKSSAIYNPFIYAIIHNKYR 2 1 RTLAEKVPGLSCLSRSQKDGLSSSTNSDASAQDSSVSRQSSVSKNRLHSTMVQ* 0 >MEL2_tetNig Tetraodon nigroviridis (pufferfish) Gq synt(- - - +BMPR1B) 404 aa 0 MEPKDTHITSSFFSKVDVPDHVHYIIAFFVFVIGILGITGNVLVIFAFYS 2 1 NKKLRSLPNYFIVNLAVSDLLMASTQSPIFFINLYKEWMFGET 1 2 ACKMYAFCGALFGITSMINLLAISVDRYVVITKPLQTIRRSSKRRTALAILMVWLYSLAWSLAPLVGW 1 2 GSYIPEGLMTSCTWDYVTYTLANRSYTMMLCCFVFFIPLAIILCCYLLMFLAIRKTSR 2 1 RKSTLIQQKSIRSEWKLAKIAFVVIVVYVLSWSPYACVTLISWAG 2 1 YGSTLTPYSKSVPAVIAKASAIYNPIIYAIIHPRYR 2 1 KTIRSAVPCLRFLIPISKSDLSTSSMSDSSFRSALSCRHSYRSRSTYISSISAKET 0 0 TWCDVELDPVESGHKKLQAYRSNSFSAKGVAEEESGLLLRTNNCNVPARKK 0 >MEL2_gasAcu Gasterosteus aculeatus (stickleback) Gq synt(KNTC2 +FLJ39155 +PDLIM5 +BMPR1B) 353 aa 0 MEPDNAHTQRSFINKVDVPDHAHYIVAVFVVVIGTLGITGNALVMLAVYS 2 1 NKKLRNLPNYFIMNLAVSDFLMAFTQSPIFFINCLYKEWAFGET 1 2 GCKIYAFCGALFGIASMINLLAISIDRYLVITKPLQAIHWGSKRRTTLAILLVWLYSLAWSLAPLVGW 1 2 GSYIPEGLMTSCTWDYVTYTLANRSYTMMLCCFVFFIPLGIILYCYLFMFLAIRKTSR 2 1 RKSTLIKQKSMKSEWKLAKIAFVVIVVYVLSWSPYACVTLISWAG 2 1 HADILSPYSKAVPAIIAKASAIYNPFIYAIIHNKYR 2 1 MTLAAKFPCLRFLSPTPRKDTSSSISESSYRDSVISRQSTASRTHFITACPDTVN 0 >MEL1_strPur Strongylocentrotus purpuratus (sea_urchin) frag GLEAN3_22851 opsin4 no cdna losing introns, expressed in larval postoral arm 0 2 1 WTKSLRTPPNMLIVNLAISDFGMVITNFPLMFASTIYNRWLFGDA 1 2 GCQFYAFCGALFGIMSIANMTAIALDR 2 1 YYVICWSLEAVRSVTHRRSMIIIIIVWCYAIFWSIPPFFGVGSYVLEGYGLGCTFDFMTKDLNHYLHV SFLFASSFVVPVTIIIVCFTRIAITVRAHRHELNKMRTKLTEDKDKKHKSSIRRANKAKTEFQIAKVGFQVTIFYVLSWM PYSIVAVIGQYFDSDLLTPLGTVVPVIFAKCSAIWNPIIYCLSHEKFNAALKEKLMGMCGIEIPSKHRSMGSQESSVTGR RGMHRQNSSTLSESSVTSTVDQDAIELKDRKQGPATVKVQQEKVEGGTYRRNPGDVTFSKDAGVEVDEKRRGDQGQRDDR VRPQGEGQMDQWSQPPPAPASASAPTPGVNDKEYLTKM* 0 >MEL2_strPur Strongylocentrotus purpuratus (sea_urchin) cdna: S.droebachiensis DQ285097 retained intron: WLEKMKTTQILHKPVTFLRLKSSFEPRFKPRFKRRF tube feet 0 MPTTLMENSTPGWMADDSQMEETHPAFPLIGGYLLVVVLLGTAGNSLVIYTFLRFKKLHSPINLLIVNLSASDLLVATTG TPLSMVSSFYGRWLFGTNACAFYGFVNYYCGCISLNSLAAISVFRYIIVVRGQAQNNKLSLRSSIYAILVIHLYTLIFST PPLYGWNRFVLAGYHTSCDIDFHTKTPLFVSYICYMFFFLFFLPLGLISWSYFKIYQRVSKHSNSMRTSFTGVTKEINSDEKHA 2 1 NHRRTASTLFVTIVVFLFAWFPYCIVSLWVLIGDANSISKLSTTIPSLFAKSSVIYNPLIYVVLNSKFRKALIQTLSFLKCLSKHELSESS* 0 >TMT1_plaDum Platynereis dumerilii (ragworm) encephalopsin-class ciliary 310 aa 16311335 CT030681 htgs 5 exons 0 MDDLGFLGNSSVNYTVPLLQEDPLLLRILYFGPTSYVITAIYLCIVGVIGTLSNGVIMYLYFKDKSLRSPMNLLFVNLAMSDFTVAFFGAMFQFGLTCTRKYMSPGMALCDFYGFITFLG 1 2 GLASEMNLFIISVERYLAVVRPFDVGNLTNRRVIAGG 1 2 VFVWLYSLVFAGGPLVGWSSYRPEGLGTWCSISWQDRSMNTMSYVTAVFLGCYFFPVSIIIFCYFNVWRKVKE 0 0 AADAQGGAGTAGKAEKSIFRMSVIMVTCYLTAWTPYAIVCLIASYGPPNGLPIYAEVLPSLFAKSSQVYNPIIYVLMNKP 0 0 YRSALVSLVCRGRNPFDEAGGTAGGTTAKDETLGKGNKVAAA* 0 >TMT2_plaDum Platynereis dumerilii (ragworm) encephalopsin-class ciliary 355 aa 15514158 AY692353 lophotrochozoa polychaeta new genomic 0 MDGENLTIPNPVTELMDTPINSTYFQNLNAETDGGNHYIYNAFTATDYNICAAYLFFIACLGVSLNVLVLVLFIKDRKLRSPNNFLYVSLALGDLLVAVFGTAFKFIITARKTLLREEDGFCKWYGFITYLG 1 2 GLAALMTLSVIAFVRCLAVLRLGSFTGLTTRMGVAAMA 1 2 FIWIYSLAFTLAPLLGWNHYIPEGLATWCSIDWLSDETSDKSYVFAIFIFCFLVPVLIIVVSYGLIYDKVRK 0 0 VAKTGGSVAKAEREVLRMTLLMVSLFMLAWSPYAVICMLASFGPKDLLHPVATVIPAMFAKSSTMYNPLIYVFMNKQ 0 0 FRRSLKVLLGMGVEDLNSESERATGGTATNQVAAT* 0 >MEL1_plaDum Platynereis dumerilii (ragworm) Gq 383 aa 11874910 AJ316544 rhabdomeric melanopsin unavailable genomically 0 MSRSEVLVPGSMSLDGLLTTAHPIGNDSI 0 0 ETILHPYWQQFDIENTIPDSWHYAVAAWMTFFGILGVSGNLLVVWTFLK 2 1 TKSLRTAPNMLLVNLAIGDMAFSAINGFPLLTISSINKRWVWGKL 1 2 WRELYAFVGGIFGLMSINTLAWIAIDRFYVITNPLGAAQTMTKKRAFIILTIIWANASLWALAPFFGW 1 2 GAYIPEGFQTSCTYDYLTQDMNNYTYVLGMYLFGFIFPVAIIFFCYLGIVRAIFAHHA 2 1 EMMATAKRMGANTGKADADKKSEIQIAKVAAMTIGTFMLSWTPYAVVGVFGMIK 2 1 PHSEMFIHPLLAEIPVMMAKASARYNPIIYALSHPKFR 2 1 AEIDKHFPWLLCCCKPKPKAQLPSSTTKGSIASKTEADTSV* 0 >MEL1_capCap Capitella capitata (polychaete_worm) jgi 119596 distally uncertain shares 2 introns with melanopsins 0 MMSYADGYLDNSTAPIEESPYLPHGTFFHPHWRPYREMLLNMNPLIYYGLGLYMAVVGIVGTLGNLVVITLFIK 2 1 TRSLRTPPNMFIINLALSDMGFCATNGFPLMTVASFQKLWRWGPV 1 2 ACELYALAGSITGFNSIATLALISMDRYMVIAKPFYAMKHVSHKRSLIQIILAWTWAFIWSAPPLLRMGYGRYIP EGFQVSCTFDYLSRDLKNLIFVWCLFVFGFFIPVLAIACSYVGIIRAVGAQSKEMRKTAEKMGAKTGKSDKEKKQDIAMAKVA AGTIGLFLMSWTPYAAVSMIGIAGNRSWITPYVSQIPVMFAKASAMWNPILYALSHPKFRAALEDHMPWLLVC >MEL1_helRob Helobdella robusta (leech) 0 MDSVTWYKDFHPHWWKYHDIIVNAPMAFYYFLGTFFAVVGFLGVFGNIIVVWVFSR 2 1 TPSLRTPSNVLVINLAICDILFSALIGFPMSALSCFQRHWIWGNF 1 2 YCQFYSFVAGITGLASINCLAVIAVDRYLVVGQPLAMLNQSHFRRSFYHVLIIWTWACVWSAMPLIGWGEYILEGFGVSCTFD YLTRTTWNISFNVCLFTFCFGMPVSVIILSYIGIIRSIAKNRKEFSSLTAENSSRARQEIKIAKVFAVCMTAFILCWVPYAT VAQLGIYGYDQMVSPYTAELPVMLAKTSALWNPIIYAFSHPKYRKCLKELPIFRQKRKFRFLGSKSHTKSSDGVGTIALTSTINRTATRH* 0 >MEL2_helRob Helobdella robusta (leech) frag from scaffold_39 1 TPILRTHANVLIINLALCDLIFSSLIGFPMTALSCFKRHWIWGDL 1 2 GCDFYGFVAGWTGLGSITCLAFISIDRYMAIVHPFYMLNKKSSSVLTLLQIGAVWSWALIWSVMPLFGWGRFIPEAFGVSCTFDYLTRTWSNICFNYVLITCGFFLPVVIIVTSYIGIVIEVTKS 1 2 VEEDGMKDHMDAQNARCYVFVAVLSM 0 0 LKTAKVLACCFGAFLICWTPYAIVAQLGINGFAHLVTPFTSEVPVLFAKTSSIWNPLIYALSHPRYRRAV 0 >MEL1_schMed Schmidtea mediterranea (planaria) most like MEL1_plaDum extension of AF112361 0 eVYHYLVGVYISIVGISGVLGNLLVLYIFAR 2 1 AKSLRTPPNMFIMSLAIGDLTFSAVNGFPLLTISSFNTRWAWGKL 1 2 TCEIYGFIGGLFGFISINTMALISLDRYFVIAQPFQTMKSLTIKRAIIMLVFVWLYSLIWSTPPFFGY 1 2 GNYVPEGFQTSCTFDYLTQSKGNIIFNIGMYIGNFIIPVGIIIFCYYQIVKAVRVHELEMLKMAQKMNASHPTSMKTG 1 2 AKKADVQAAKISVIIVFLYMLSWTPYAIIALMALTGRRDHLNPYTAELPVLFAKTSAMYNPFIYAINHPKFRIQLEKKFPCLICCCPPKPK 0 0 * 0 >MEL1_schMan Schistosoma mansoni (trematode_worm) AF155134 11166392 381 aa 6 exons 0 MKQNLTFATLWPDDNDFASIVHSHWHKFIQPDPLYYYLVGIYIGIVGILAVMGNSLVITLFLL 2 1 CKQLRTPPNMLIVSLAISDFSFALINGFPLKTIAAFNHRWGWGKL 1 2 ACELYGFAGSIFGFISLTTMAFIALDRYLVIVQPFETFSRITYGKVIVMIFITWIWSALWSIPPFFGY 1 2 GSYIPEGFHTSCTFDYLSTDLPNLIFNAGLYILGFLCPVFIIIFSYYQIVKTVRLNELELMKMAQSLDLQNPSAMKTg 1 2 GDKKADIEAAKTSIILVLLYLMSWSPYAIVCLMTLIGSRDSLTPFHSELPVLFAKTSAVYNPIVYAVKHPKFRMEIEKRFPFLICCCPPKPK 0 0 ERLQNTIVSKIQVSQIGIGTVSGGNENTLNTVKRED* 0 >MEL2_schMan Schistosoma mansoni (trematode_worm) CD096414 12973350 (46%) 0 MSSNRTIEMLRPYMKDFDSIVLPYWYKFEQPNPYYQYAIGLFIAVVGITGMCLNLLVIVFFTM 2 1 FKSLRTPSNILVVNLAISDFGFSAVIGFPLKTMAAFNNFWPWGKL 1 2 ACDLYGLAGGLFGFVSLSTIAAVALDRYLVIATPFESVFQTTPRRTLLLMLFLWMWSLMWTIPPLFGF 1 2 GRYVTEGYQTSCTMDYISTDLNNRLFNIGLFGFGFLCPLFLSLFCYARIILIVRSRGKDFIEMAASSKGTNQKEKSANV 1 2 SSSKSDTFVSKSSAILLGVYLICWTPYSFVCLMALIGYADYITPLMVEIPCLCAKTANPCIYAFRYPKFRSLLQQRFGFLRLTKNRVSY 0 0 ERSQHAILSTIHVVTDCQYGTVSGGNENTLNTFILLD* 0 >MEL3_schMan Schistosoma mansoni (trematode_worm) Smp_180030 0 MSEIKNFTRSLLLYNRTFSMIKNNIHDSDIIMLNHWIKYTQPDPIYNYLVAIFVALIGIFGTITNLLVIFVFL 2 1 TPKSSISLQCALIINLAISDFGFSAVIGFPLKTIAAFNQYWPWGSV 1 2 ACQLYGFISATFGFLSLTTIAAISFDRYLVIVKDHKTTNFRVICTVIGFLWIWSIIWTIPPFFGF 1 2 GRYVLEGYQTSCTFDYISNDMPSLLFSGGMYIFGFMFPVLLCIYCYVNLLKIVRNNERVVLISLSNDGASKQRESVR 1 2 NRKRLDIEATKSVILSLLFYLMSWTPYAMVCLISILGQSYFLTPTIAEMPHIFAKMAAIYNPILYAFTNRKFKNALGIRKTSSVIMQQQRLLSKGQLKPLVSLLFLVN* 0 >MEL1_lotGig Lottia gigantea (limpet) FC774055 ests long tail omitted 3 exons best: molluscan melanopsins 0 MTTLPPKENTTHLFDTWDDMFVHPHWKNFPPVSAAWHNFIGIFITFVGITGVIGNFVVIYTFSR 2 1 TKSLRTASNMFVVNLALSDLTFSAVNGFPLFSLSSFSHKWIFGRV 1 2 ACELYGLIGGIFGLMSINTMAMISIDRYLVITSPFTAMRNMTHKRAFLMIVGVWIWSILWAIPPIFGWGAYIPEGFQTSCTFDYLTRGDNRRSYIMCLYICGFVVPLGVIIFCYVFIIKSVMNHEKE MAKMADKLDAKDVRSTKEKAKAEIKIAKVSMTIILLYLMSWTPYAIVALIAQWGPALVVTPYVSEIPVLFAKASAMHNPVIYALSHPKFRDAVSKLMPWFLCCCGLTDAEKKARDELAKSKFTRQT >MEL1_sepOff Sepia officinalis (octopus) AF000947 PM 900420 492 Mollusca Cephalopoda complete [~identical X56788 Loligo forbesi) MGRDIPDNETWWYNPTMEVHPHWKQFNQVPDAVYYSLGIFIGICGIIGCTGNGIVIYLFTK 2 1 TKSLQTPANMFIINLAFSDFTFSLVNGFPLMTISCFIKKWVFGMA 1 2 ACKVYGFIGGIFGLMSIMTMSMISIDRYNVIGRPMAASKKMSHRRAFLMIIFVWMWSTLWSIGPIFGWGAYVLEGVLCNCSFDYITRDSATRSNIVCMYIFAFCFPILIIFF CYFNIVMAVSNHEKEMAAMAKRLNAKELRKAQAGASAEMKLAKISIVIVTQFLLSWSPYAVVALLAQFGPIEWVTPYAAQLPVMFAKASAIHNPLIYSVSHPKFREAIAENFPWII TCCQFDEKEVEDDKDAETEIPATEQSGGESADAAQMKEMMAMMQKMQQQQAAYPPQGAYPPQGGYPPQGYPPPPAQGGYPPQGYPPPPQGYPPAQGYPPQGYPPPQGAPPQGAPPQ AAPPQGVDNQAYQA* 0 >MEL1_todPac Todarodes pacificus (squid) Gq X70498 480 11106382 Mollusca 'squid rhodopsin' 3D: May 2008 Cys 337 palmitoyled 0 MGRDLRDNETWWYNPSIVVHPHWREFDQVPDAVYYSLGIFIGICGIIGCGGNGIVIYLFTK 2 1 TKSLQTPANMFIINLAFSDFTFSLVNGFPLMTISCFLKKWIFGFA 1 2 ACKVYGFIGGIFGFMSIMTMAMISIDRYNVIGRPMAASKKMSHRRAFIMIIFVWLWSVLWAIGPIFGWGAYTLEGVLCNCSFDYISRDSTTRSNILCMFILGFFGPILIIFF CYFNIVMSVSNHEKEMAAMAKRLNAKELRKAQAGANAEMRLAKISIVIVSQFLLSWSPYAVVALLAQFGPLEWVTPYAAQLPVMFAKASAIHNPMIYSVSHPKFREAISQTFPWVL TCCQFDDKETEDDKDAETEIPAGESSDAAPSADAAQMKEMMAMMQKMQQQQAAYPPQGYAPPPQGYPPQGYPPQGYPPQGYPPQGYPPPPQGAPPQGAPPAAPPQGVDNQAYQA* 0 >MEL1_entDof Enteroctopus dofleini (octupus) Gq X07797 475 Mollusca Cephalopoda complete 0 MVESTTLVNQTWWYNPTVDIHPHWAKFDPIPDAVYYSVGIFIGVVGIIGILGNGVVIYLFSK 2 1 TKSLQTPANMFIINLAMSDLSFSAINGFPLKTISAFMKKWIFGKV 1 2 ACQLYGLLGGIFGFMSINTMAMISIDRYNVIGRPMAASKKMSHRRAFLMIIFVWMWSIVWSVGPVFNWGAYVPEGILTSCSFDYLSTDPSTRSFILCMYFCGFMLPIIIIA FCYFNIVMSVSNHEKEMAAMAKRLNAKELRKAQAGASAEMKLAKISMVIITQFMLSWSPYAIIALLAQFGPAEWVTPYAAELPVLFAKASAIHNPIVYSVSHPKFREAIQTTFPWL LTCCQFDEKECEDANDAEEEVVASERGGESRDAAQMKEMMAMMQKMQAQQAAYQPPPPPQGYPPQGYPPQGAYPPPQGYPPQGYPPQGYPPQGYPPQGAPPQVEAPQGAPPQGVDNQAYQA* 0 >MEL1_aplCal Aplysia californica (sea_hare) Gq 4 exons melanopsin AASC01108363 uncertainties 0 MNVSSSLTSQPYHELLHPHWLEHEEAPEGVHLSVGVFITLVGVLAVCGNSLVIITCIR 2 1 FKDLRTRSNILIINLAVGDLLMCLIDFPLLAAASFYGEWPYGRQ 1 2 VCQMYAFLTAIAGLVTINTLAVIAADRYWAVVRRPTPGQKLPKCVTSIAVASVWAYSISWALCPILGWGAYVLDGIRTTCTFDFLTRTWENRSFVIGMMIGNFVLPFALMVFSYFRIWVAVRKVKSG 2 1 NVFCAIRHNYNLALGSTLFVKQHRYRLHCEQKTVKIIMFLLIAFTVSWSPYLAVSIIGLFGDRSQLTYQNTLTASLIAKTSMVFNPILYSISHPKVRKRIANLACCYSVRRHQQQTSRIKTGRRSTSSATPSRS* 0 >MEL2_aplCal Aplysia californica (sea_hare) wgs AASC02005512 1st exon missing; new 3rd intron 42% identical MEL2_lotGig 0 2 1 HSSLRTSSNLLVVNLTVADLVMSSLDFPILAISSYKGCWVMGFL 1 2 GCQVYGVSSGVAGLVTINTLAAISVDRFVVVVHRLSPMHQMGKSTT 1 2 GVIIIAIWALSVIWAVLPITGVSSYRLEGMGTSCTFDYASRTSSNRWFFIALVIFNFFIPLALIIFSYWRIYASVRAVKRELKLLQSARTSILRKRFEVQAEI KTAITALVIISIFCLAWTPYVIIAFVGLYGPTTAIDPLVSMFPNILAKISTVSNPILYSIGHPEVRKKMKKLFLPGQQDSSWRATSATAGNSSPAQSENMNNISLPTYL* 0 >MEL2_lotGig Lottia gigantea (limpet) most like MEL_entDof e-60 84/222 (37%) 338 aa Gq-coupled 0 MSIASHVWTNSSTNHFNFSVLHQHWQNQTPLSTACQYTIGIFISTVAVIAVIGNSIVIWAHVR 2 1 IKSLSTTSNMLILNLCVGCLIMCIVDFPLYATSSFLQKWIFGHK 1 2 VCEIYATITGTAGLLIMNSYSAIAFDRFITVTRYNNPNYPRSKSATMCISGFVWIYSLSWSMAPVVGWSRYQLDGSGTT CTFDYLSTTWTNRSFILSIAFFNFVLPLCFILFAYSRILHLISSHSREMKSYRSAVIISKGKASIPKRFRSERKTAITLLI TVVVFCLSWVPYVIIALIGQFGNQSFITPQISVIPQLVAKLSTVTNPILYSLSHPVVRNKLFLRLRHELYRRPSDSVSSSRGIQMKNIEFI* 0 >MEL1_patYes Patinopecten yessoensis (scallop) Gq 9287291 AB006454 scop1 49% MOLL_RHO_entDof then MEL retina 0 MADNKSTLPGLPDINGTLNRSMTPNTGWEGPYDMSVHLHWTQFP PVTEEWHYIIGVYITIVGLLGIMGNTTVVYIFSN 2 1 TKSLRSPSNLFVVNLAVSDLIFSAVNGFPLLTVSSFHQKWIFGSL 1 2 FCQLYGFVGGVFGLMSINTLTAISIDRYVVITKPLQASQTMTRRKVHLMIVIVWVLSILLSIPPFFGWGAYIPEGFQTSCTFDYLTKTARTRTYIVVLYLFGFLIPLIIIGVC YVLIIRGVRRHDQKMLTITRSMKTEDARANNKRARSELRISKIAMTVTCLFIISWSPYAIIALIAQFGPAHWITPLVSELPMMLAKSSSMHNPVVYALSHPKFRKALYQRVPWLF CCCKPKEKADFRTSVCSKRSVTRTESVNSDVSSVISNLSDSTTTLGLTSEGATRANRETSFRRSVSIIKGDEDPCTHPDTFLLAYKEVEVGNLFDMTDDQNRRDSNLHSLYIPTRVQHRPTTQSLGTTPGGVYIVDNGQRVNGLTFNS* 0 >TMT_apiMel Apis mellifera (bee) Gt ciliary 329 aa 16291092 NM_001039968 ciliary AmLop2 compound eye not ocelli pteropsin clock 0 MSLNRSTMEHVIYEDQVSPVMYIGAAIALGFIGFFGFTANLLVAIVIVKDAQILWTPVNVILFNLV 0 0 FGDFLVSIFGNPVAMVSAATGGWYWGYKMCLW 2 1 YAWFMSTLGFASIGNLTVMAVERWLLVARPMQALSIR 2 1 HAVILASFVWIYALSLSLPPLFGWGSYGPEAGNVSCSVSWEVHDPVTNSDTYIGFLFVLGLIVPVFTIVSSYAAIVLTLKKVRKRA 1 2 GASGRREAKITKMVALMITAFLLAWSPYAALAIAAQYFN 0 0 AKPSATVAVLPALLAKSSICYNPIIYAGLNNQFSRFLKKIFDARGSRTAVPDSQHTALTALNRQEQRK* 0 >TMT1_anoGam Anopheles gambiae (mosquito) Gt encephalopsin-class ciliary 461 aa no_ref XM_312503 encephalopsin GPROP11 adjacent head-to-head tandem GPROP12 0 MYDVTDAAAINSDHQELMAPWAYNGAAVTLFFIGFFGFFLNIFVIALMYKDVQ 0 0 LWTPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWLYGKSICVAYGFFMSLL 1 2 GIASITTLTVLSYERFCLISRPFAAQNRSKQGACLAVLFIWSYSFALTSPPLFGWGAYVNEAANIS 2 1 CSVNWESQTANATSYIIFLFIFGLILPLAVIIYSYINIVLEMRK 0 0 NSARVGRVNRAERRVTSMVAVMIVAFMVAWTPYAIFALIEQFGPPELIGPGLAVLPALVAKSSICYNPIIYVGMNTQ FRAAFWRIRRSNGVAGQPDSNNTNNSNRDKESARHTAKEGL ECSLDFCHWTVRGTRVSISSAERNVPAPAARERSGGHSVTGSREESRDRHVTLKTMLSVGPRSPSSVAPVAADCSTTDVPTSGDGSVRIVRQDSELSVIHDGGGGGGGSSSRVLVIKSQKPRSNML* 0 >TMT2_anoGam Anopheles gambiae (mosquito) Gt encephalopsin-class ciliary 434 aa no_ref XM_312502 encephalopsin GPROP12 0 MNDAPNDVAASAVDYEDLMAPWAYNASAVTLFFIGFFGFFLNLFVIALMCKDMQ 0 0 LWTPMNIILFNLVCSDFSVSIIGNPLTLTSAISHRWIFGRTLCVAYGFFMSLL 1 2 GITSITTLTVLSYERYCLISRPFSSRNLTRRGAFLAIFFIWGYSFALTSPPLFGWGAYVQEAANIS 2 1 CSVNWESQTKNATTYIIFLFVFGLVVPLIVIVYSYTNIIVNMRE 0 0 NSARVGRINRAEQRVTSMVAVMIVAFMVAWTPYAIFALIEQFGPPELIGPGLAVLPALVAKSSICYNPIIYVGMNTQ FRAAFSRVRNKGQQAAADQNTTTMQRELTKSSRDMVECSF DFCRKKSRFKISLVKPTAPLAVVDVSSTSHRDKGTSRSPLDQTVLNETNEDVGRERSGGGGGGGAYAGTRFVRPDFELSVINSGKSILIKSKNFRSNLL* 0 >TMT_aedAeg Aedes aegypti (mosquito) opsin XM_001650752 genome frameshifted encephalopsin-class ciliary 0 MESWAYVASAVTLFFIGFFGFFLNLFVIALMCKDVQ 0 0 LWTPINIILFNLVCSDFSVSIIGNPFTLTSAISRHWIFGRTVCIAYGFFMSLL 1 2 GITSITTLTVLSYERFCLISHPFSSRSLSRRGAVFAILFIWSYSFALTSPPLFGWGAYVNEAANIS 2 1 CSVNWESQTLNATSYIIFLFVFGLVVPLVVIVYSYTNIVVNMKR 0 0 NAARVGRINRAEKRVTRMVFVMVLAFMIAWTPYAVFALIEQFGPTDIISPALGVLPALIAKSSICYNPIIYVGMNTQFRAAFNRVRNNESVDNNTITNQKDITMNTSK EIVECSFDFCRKKRLKIKLQSNAKSKNNNNNSRNQSIADPSSTSNGDDLDQPSPAQTVLNSTVANSGSVASFGPKKRLRSDFELSVISSGKSILIKSNTFRSNLV* 0 >TMT_culPip Culex pipiens (mosquito) encephalopsin-class ciliary 0 MPPWAYVATAVVLFFIGFFGFFLNLFVIALMCKEVQ 0 0 LWTPMNIILLNLVCSDFSVSIVGNPFTLSSAISHRWLFGRKLCVAYGFFMSLL 1 2 GITSITTLTVLSYERFYLISRPFSSRSLSRRGALGAVLLIWCYSFALTSPPLFGWGAYVNEAANIS 2 1 CSVNWETQTLNATTYIIYLFVFGLVVPLTVIVYSYTNIIVNMKK 0 0 NAARVGRINRAEKRVTTMVAVMVIAFMVAWTPYSVFALMEQFGPPDVIGPGLAVLPALIAKSSICYNPIIYVGMNTQFRAAFNRVRHDPGDMANTTTNQKELT RSSRDVASVDCSFDFCRKKNRLKMKLHNSIHRSAAIKAGRSQSPHSESERSSSGATQERSTRVQTMLNATVDSRSISGSTGKLMKSDFELSVINSGKSILIKSNTFRSNLV* 0 >TMT_triCas Tribolium castaneum (flour_beetle) (60%)55 298 encephalopsin-class ciliary 0 MKNFNSTEIGDELLIPVEGYIAAAVVLFCIGFFGFSLNLTVIIFMLKERQ 0 0 LWSPLNIILFNLVVSDFLVSVLGNPWTFFSAINYGWIFGETGCTIYGFIMSLL 1 2 SITSITTLTVLAFERYLLIARPFRNNALNFHSAALSVFSIWLYSLSLTIPPLIGWGEYVHEAANLS 2 1 CSVNWEEKSPNSTSYILYLFAFGLFLPLVIITFSYVNIILTMRR 0 0 NAAFRVGQVSKAENKVAYMIFIMIIAFLTAWSPYAIMALIVQFGDAALVTPGMAVIPALLAKSSICYNPVIYIGLNAQVKGAKWVSGLIYLFQFQQAWMQKWKKNRR GSDALGTSRVMLETIHQACRDEKTDKLLEKKTKFCKDFETDVSML* 0 >TMT_rhoPro Rhodnius prolixus (kissing_bug) Insecta; Pterygota ciliary opsin frag ACPB01038514 + ACPB01038515 56% TMT_triCas 0 MLMPSAGFLAASIILFLIGFLGFFGNLIVIIIMCRDKN 0 0 LWTPVNFILFNVIVSDFSVAALGNPFTLASAIAKRWFFGQSMCVAYGFFMALL 1 2 GITSINSLTVLALERYLIVSQPVSHGSLSRPTASDIVGSIWLYSFVITiPPLVGWGEYGLEAANIS 2 1 CSINWETRSHSSTSYILFLFTFGFFIPIIVISYSYMNIILTMKK 0 0 STMNAGRVNKAESRVTWMIFVMIFAFFLAWTPYAILALMIAFFDSNVSPAIATIPAIFAKTSICYNPFIYAGLNTQ 0 0 FR >TMT_acyPis Acyrthosiphon pisum (aphid) XM_001952259 wrong Ecdy.Insec.Hemip 0 MNGEYAQPQHISDAIYLGAAIVLSIIGIVGFIFNTCVIFIMIRDTR 0 0 LWTPQNVIIFNLATSDLAVSVLGNPVTLAAAITKGWIFGQTICVIYGFFMALF 1 2 GIASITTLTVLAYDRYLMIRYPFSSSRLTKETALYAIAGIWIYAFAVTGPPLFGWNRYVNESANIS 2 1 CSIDWESGEHSNYVIYIFVFGLFLPVTVIIYSYVSLVVTVRK 0 0 AEKIIGQATKAECRVAIMVAVMILAFLTAWMPYSVLALMIAFGGVHISPVVSIIPALCAKSSICWNPIIYIGLNTQ 0 0 FRSAWKRFLNIQDTLSEVSLDADITTGMTKLMTGHQELPAHPMNNGDASHPPGLIMCCLAHDEHRQSATYADRYECNLEMKSCNPQTLGRRPETDIGDVSL* 0 >TMT_bomMor Bombyx mori (moth) encephalopsin-class ciliary frag trace archive not provided 0 MPRWGYVASAFVLFLIGFFGFFLNLMVILLMFKDRQ 0 0 LWTPLNIILFNLVCSDFSVSVLGNPFTLISALFHRWIFGHTMCVLYGFFMALL 1 2 GITSITTLTVISFERYLMVTRPLTSRHLSSKGAVLSIMFIWTYSLALTTPPLLGWGNYVNEAANIQ 2 1 CSVNWHEQSTNTLTYIMFLFAMGQILPLSVITFSYVNIIRTLKR 0 0 NSQRLGRVSRAEARATAMVFIMIIAFTVAWTPYSLFALMEQ 2 >TMT_helVir Heliothis virescens (moth) frag GR968759 pheromone gland Ecdy.Insec.Lepid 0 MPRWGYVVSAFVLFLIGFFGFFLNLMVILLMFKDRQ 0 0 LWTPLNIILFNLVCSDFSVSVLGNPFTLISALFHRWIFGKTMCVLYGFFMALL 1 2 GITSITTLTVISFERYMMVTRPLNSRHLSSKGAIMSIVF >TMTa_dapPul Daphnia pulex (water_flea) ciliary 45% id TMT1_anoGam intron? 0 MPVWVYWSASAYLLFISIAGLFMNIVVVVIILNDSQ 0 0 KMTPLNWMLLNLACSDGAIAGFG 2 1 TPISAAAALKFTWPFSHELCVAYAMIMSTA 1 2 GIGSITTLTVLALWRCQHVVWCPTNRNSNFTDPNGRLDRRQGALLLTFIWTYTLIVTCPPLFGWGRYDREAAHIS 2 1 CSVNWESKMDNNRSYILYMFAMGLFIPLMAIFVSYISILLFIHK 0 0 SQQTSNNSDTVEKRVTFMVAVMIGAFLTAWTPYSIMALVETFTGDNVTNDSVSSEIKFYAGTISPAVATVPSLFAKTSAVLNPLIYGLLNTQ 0 0 FRTAWEKFSSRFLGRKKRHQRSQMAMGVSHKRRRDYLRTLLNRPASDEPAIVQHPSTKEMASSQAVSCVVVSNLDVPRAPNNSYVTVNDE* 0 >TMTb_dapPul Daphnia pulex (water_flea) ciliary long tail 60% identity to TMTa IGQDNHSTYYSSRINNATNFSSAFPDGDLSY intron? 0 MPTWAYRLTAAYLLLISVLGLIMNVVVVIVILNDSQ 0 0 RMTPLNWMLLNLACSDGAIAGFG 2 1 TPISTAAALEFGWPFSQELCVAYAMIMSTA 0 0 GIGSITTLTALAIWRCQLVVCCPAKRKSAFTNHSGRLGCRQGVILLVIIWIYALAITCPPLFGWGRYDREAAHIs 2 1 CSVNWESKTNNNRSYILYMFCMGLVVPLAVIIISYVRILRVVQK 0 0 NQQQSGNVHRHRRDAAEKRVTMMVACMIAAFMAAWTPYSILALFETFIGQDNHSTYYSSRINNATNFSSAFPDGDLSYVGTISPAFATIPSLFAKTSAVLNPLIYGLLNTQ 0 0 FRLAWERFSLRFLGRFQCHRTQGVSGQHGANHHKTRRNVRKYLPNCYGDSRSLKPTPTVHLPMKEMVVSHAEQKVKTAQEQASSSVTKITTIPLISSDNQTIVSCPSSIMAN CQQHETNQANHQQAARPDKVVDHQHLLQPNRLSSLLSLSLPSVLISTPNLPCSAQRQSAAEDQAMATCQQMTSGRIRDQQQQSDSFVVVGLLSRSADCYHHHTGDVEQFVFLDSTVDELGLTARSASP* 0 >UV7_ixoSca Ixodes scapularis (tick) Chelicerata Arachnida exon 1 missing, exon 2 disjunct, K90 at EIP 0 2 1 RRRIRSQANLLVFNLALSDLLMVLEIPLLVYNSLKLRPALGVW 1 2 GCQLYGLMGGLSGTSAIFSIAALSLERYLALGRPRDPFARLTRSRAFALSLSSWIYALCFSAWPLLGVTSPYVPEGFLTSCSFHFLSDATSDRCFVWIFFVAAWCVPLVFVTTCYSGILVTVIRSRKALAQES RRSELRVAKVSLALVLLWTVAWTPYAIVALLGITGRRNLLTPWGSMAPAMFCKSAAVLDPFVYGLSHPSFRRELAIMLPCLRPRQRPVSLTLRAVVQLPKRPGPRSAGSSTSVPVTAPGTTKDNHCPTPPNVSR* 0 >UV7a_acyPis Acyrthosiphon pisum (pea_aphid) Hemiptera 3 exons SCAFFOLD4798:3246-5335 altered HEK CL3 52% K in in K90 0 MIDFKTKYPVNLWKDHGLYTDDYIKLINSHWLKFMPPNPTSHYVLGLLYTVIMVFGCTGNSLVIFMYFK 2 1 CRSLQTPANMLIINLAVSDFIMLAKASVFIYNSYYLGPALGKL 1 2 GCQVCGFLGGLTGTVSIMTLAAISLDRYYVIVCPLKAAVKTTKQRARIWIGLIWIYGFSFSIVPVLDLGYSRYVSEGYLTSCSFDYLSDNDQDKRFI LVFFTAAWCIPFTIILYCYVNILMAVWMTTEIVTSRVGQQEEKRKTDIRLGYMVIGALALWFVSWTPYAVVALLGVFDLKEYISPLSSMIPALFCKAAS CTDPWFYAITHPRFKKELMKLLTKSKSRKLVRNYGMKKGWVGSHLNKNGSVDFDNCLKTEYKEENTTIFMLESDDNNLHCQGSTSGHKTESTKEPETKFTASASQETLKYMLPS* 0 >UV7b_acyPis Acyrthosiphon pisum (pea_aphid) Hemiptera SCAFFOLD14504:180756-183351 72% UVV2a_acyPis altered HEK CL3 K in in K90 0 MSDFKTKYPIDTWKEHGFYTDDYMKLINSHWFKFMPPNATSHYILGFLYSVIMVLGCFGNSLVIFMYIK 2 1 CKSLQTPANVLIMNLAVSDFIMLAKTPVFIYNSFYQGPTLGKL 1 2 GCQIYGFFGGLTGTVSIMTLAAISLDRYYVIVHPLNAAVKTTKQRARVWIGLIWIYGFLFSIIPVMDLGYNRYVPEGYLTSCSFDYLSDDNQEKGFILVFFTAAWCIPFTTISYCYIKI LRAVWMTSEMAASRFGQEEEKRKTEIRLGYVVVGVIMLWFVSWTPYAMVALLGVFDRKDYITPLSSMIPAVLCKAASCMDPWIYAITHPRFKNELTKLMSRKKTRKLERDYGMKKNWGGQ SYSNKSGAGLRNLSSSEDECVEEVIVVIDPDDKKMKRQGSTSSHKTEETKALETKFPPTRQESLKYMPPSWYKLPRTTSKSSIMLDPKLTGDDNNK* 0 >UV7_rhoPro Rhodnius prolixus (kissing_bug) Hemiptera K90 at KMP, ortholog RH7 of droMel 0 mKYFHLYPIEQWKMHRFFTEEYLKLVNTHWFEYPPPNKQIHYIFAAVYFLVMLVGVSGNLLVIFMILR 2 1 FRTLRTSSNILILNLAVSDFLMVAKMPVFIYNSFYFGPVLGEM 1 2 GCHFYGFIGGLSGTASILTLAAIAMDRYLGIAHPLNFNQGRAKKRTIVWITFIWVYSITFASIPLSHIGVKTYVPEGFLTSCSFDYLSTDIQNRCFIFIYFVAAWCLPLLVIITSYVGICREVLRVSLIRKGQE REQRKREAKLSAILALATFLWFLSWTPYAAVALLGIFGYKNHITQLASMIPALFCKTAACVNPFIYGLNHPRLRQQLLKLCCKKRYNLEKTHFSRSWRNTSCSFKLKEQSLCNVSQSRLRRTSTVASEPSEHSTHFM* 0 >UV7_pedHum Pediculus humanus (louse) Phthiraptera AAZO01007270 0 mKTFKLKWPEEWKKLGLFDDEYLYKINKYWMKFPPPSPMSHYFMGIIYSVIMVVGVFGNFLIIYLFLR 2 1 KRSLRTPSNVFIFNLAVSDSLLLLKMPVFIINSFYLGPALGNL 1 2 GCSAYGFVGGLTGTVSIMTLAAIAFDRYQVIVHPLERKTKAAVYFQILLIWIYAIFFSIIPLLDVGLNKYVPEGYLTSCSFDYLTQDTASRLTIFVFFVAAWIVPLSIILGSYM ALYKVVLKARGTHFNTVMTRHCKDIEIQRPELKAAVTVICIVCLWTLSWTPYAVVALLGITGNEKYISPMSSMIPALFCKTASCIDPFVYAATNRRFRNELKRKYRKRSRYQPSLKTE RKDFFTLSEDNNDRGKGNTIRIREK* 0 >UV7_anoGam Anopheles gambiae (mosquito) Diptera XM_308329 K90=EAP 0 MGRQGSGNAVRISPSSRNQPYFSSAHLSFVVPFPVHSKYVVRSGYVLPVDPLFVAKINPFWLRFDPPSAGEHYGLAVFYFLMMLFGVIGNALVVFMFYR 2 1 YRSLRTPANYLVINLAVADFIIMMEAPMFIYNSIHQGPALGSI 1 2 GCTVYALMGAVGGTVAIATLTVISIDRYNVVVYPLNPNRSTTKLKCYFLIAFTWAYGLLFASFPALEIGLSRYTAEGYLTACSFDYLDRTYKARVFMFVYFVFAW LIPFAIISYCYARILIAVINANAIQSSKSKNKTEVKLAGVVVGIIGLWFAAWTPYAVVAMMGVFGYEQYLTPLNSMIPAVFAKIAASIDPYFYAMNHPRYRQMLER MFCNRGADQGNSQYQTSHYTRGASRGGDSEGGGGEESGGGGGVGRAPGGGNAGLGRGGTVRGGGGGGRLIAGKGGGGANATGSTGGGGVKALKKQISNGDETSLEVSLEM* 0 >UV7_aedAeg Aedes aegypti (mosquito) Diptera frag XM_001650694: wrong exon 1 K90=EAP 0 FPPNSRYMALSGYSGPTIEDAFRDRINPFWLQFDPPSRTAHYILGFIYFMMMMFGLCGNLLVILMFFR 2 1 FKSLRTPANYLVINLAIADFIIMLEAPLFVYNSYHQGPATGNVWCTIYALLGAVGGTVAIVTLTMISIDRYNVVVYPLNPKRSTTRLKVALMIVF AWIYGLVFSVIPALDIGLSRYTPEGFLTACSFDYLERTRDARLFMFLYFIFAWVVPIIAITFCYIQILRVVIGANSIQSSKNKSKTEVKLAGVVIGIIGLWFIAWTPYAIVAMMGV FGYESLLSPLGSMVPAILAKTAACIDPYFYAMNHPRYRQELRKMFGLNQQDLGNSQYQTSRYTRNASRMDDSEGGASERVTIGRQPGKTTTDEPEPSQQTEQGPQPTYSKNLAANS RGALQRAQSSISAADDTSLSVSIDLTETNPNSNH* >UV7_culQui Culex quinquefasciatus (mosquito) Diptera frag XM_001861603: wrong exon 1 K90=EAP 0 PEPVHPASKYISLSGYDGPPVEDAFRDRINPFWLQFEPPSPVAHYALGFVYFLMMVWGLFGNVLVIFMFFK 2 1 FKSLRTPANYLVINLAVADFLIMLEAPIFVYNSYHLGPAFGNTL 1 2 CTIYSLLGAIGGTVAIMTLTMISVDRYNVVVYPLNPNRSTTRLKVMLMIVFTWIYALVFSL MPALEIGLSRYTPEGFLTACSFDYLDRGWDARVFMFMYFVFAWVIPFLTISYCYVAILRVVVGAGSIQSSKNKNKQEVKLAGVVIGIIGLWFIAWTPYAVVAMLGVFGYEHLLTPL GSMIPAILAKTASCIDPYFYAMNHPRFRQELRKMFGKEQEMNHSQYQTSRYTRNASRNDSEAGPSERVQLGRAPGKDADPIPAVSSSVAQPNYSQNLASNRKGGLQRAQSSISAAD DTSLSCSIDLTETQPNNH* 0 >UV7_droMel Drosophila melanogaster (fruitfly) Diptera RH7 CG5638-RA long N-terminal has M comp genomics support, EC074058 CO302368, 2 of 3 exons novel 0 MEAIIMTTLPNLTTDAGDSSFWLTGALSLSEMLANSSHSHSTGSTTSTAGSSATESSAVNVGKDHDKHVNDSVSTGLS 2 1 NYSNYPSYIHYRDKYDLSYIAKVNPFWLQFEPPKSSTFLIMAALYCLISVVGCVGNAFVIFMFANRKSLRTPANILVMNLAICDFLMLIKCPIAIYNNIKEGPALGDI 1 2 ACRLYGFVGGLSGTCAIGTLTAIALDRYNVVVHPLQPLRRCSRLRSYLIILLIWCYSFLFAVMPALDIGLSVYVPEGFLTTCSFDYLNKEMPARIFMALFFVAAYCIPLTSIVYSYFYILKVVFTASRIQSNKDKAKTEQ KLAFIVAAIIGLWFLAWSPYAIVAMMGVFGLERHITPLGSMIPALFCKTAACVDPYLYAATHPRFRVEVRMLFYGRGVLRRVSTTRSSYMTRSRSSFTHRLRTSTTGEGGMGDHRMENYLMNNNLMMVPEETEENEEIVVVAEINNSISSVMEQSKF* 0 >UV7_droYak Drosophila yakuba (fruitfly) Diptera RH7 Diptera chr3L:12207286 12209654 0 MEAIIMTTLPALTTDAGDSSSFWLTGALSLSEMLANSSHGHSTGSTSSTAGSSATESSTVNVGKDHDVTKHVNDSVSTGLS 2 1 NYSNYPSYIHYRDKYDLSYIAKVNPFWLQFEPPKSSTFLVMAALYFLISVVGCVGNAFVIFMFANRKSLRTPANILVMNLAICDFLMLIKCPIAIYNNIKEGPALGDI 1 2 ACRLYGFVGGLSGTCAIGTLTAIALDRYNVVVHPLQPLRRCSRLRSYLIILLIWCYSFLFAVMPALDIGLSVYVPEGFLTTCSFDYLNKETPARIFMALFFVAAYCIPLTSIVYSYFYILKVVFTASRIQSNKDKAKTEQ KLAFIVAAIIGLWFLAWSPYAIVAMMGVFGLERHITPLGSMIPALFCKTAACVDPYLYAATHPRFRVEVRMLFYGRGVLRRVSTTRSSYMTRSRSSFTHRLRTSTTGDGGMGDHRMENYLMNNNLMMVPEETEENEEIVVVAEINNSVSSVMEQSKF* 0 >UV7_droAna Drosophila ananassae (fruitfly) Diptera RH7 scaffold_13337:1483455 1485125+ frameshifted 0 MEAIILSTLPSLTTNASGSSSHWLTGALSLPEILANSSGSPNTSSADTGSGINLSARDADRHFNISTEAR 2 1 NYSYYPGYIHYRDKYDLSYIAKVNPFWLQFEPPHSSTFLAMAALYCLISVVGCVGNAFVIFMFANRKSLRTPANILVMNLAICDFLMLVKCPIAIYNNIKEGPALGDV 1 2 ACRIYGFVGGLSGTCAIGTLTAIALDRYNVVVHPLQPLRRCSRLRSYLIILLIWCYSFLFAVMPALDIGLSVYVPEGFLTTCSFDYLNKETPARIFMALFFVAAYCFPLTAIVYSYFYILKVVFSAGRIQSNKDKAKTEQ KLAFIVAAIIGLWFLAWSPYAVVAMMGVFGLEKHITPLGSMIPALFCKTAACVDPYLYAATHPRFRVEVRMLFYGRGILRRVSTTRSSYMTRSRSSFTHPAGRADGGTGRDHRMETYLMNNNLMMVPEETEENEEIVVVAEINNSVSSAIEQSKF* 0 >UV7_droPse Drosophila pseudoobscura (fruitfly) Diptera RH7 chrXR_group6:2491547 2493151 0 MEALMAALPTLTTEAAGSSLWLTSALSLSEMLANSSTSPNASLVAATTSSAAVATASTTSAAEAVGKVPDKHEVNDNVSTVLS 2 1 TSSSYPGYIHYRDKYDLSYIARVNPFWLQFEPPKSSTFYLMAALYCLISVVGCVGNAFVIFMFANRKSLRTPANILVMNLAICDFLMLVKCPIAIYNNIKEGPALGDA 1 2 ACRIYGFVGGLSGTCAIGTLTAIALDRYNVVVHPLQPLRRCSRLRSYLIIFLIWSYSFLFAVMPALDIGLSVYVPEGFLTTCSFDYLNKETPARIFMALFFVAAYCVPLTTIVYSYFYILKVVFTASRIQSNKDKAKTEQ KLAFIVAAIIGLWFLAWSPYAIVAMMGVFGQERHITPLGSMIPALFCKTAACVDPYLYAATHPRFRVEVRMLFFGRGVLRRVSTTRSSYMTRSRSSFNRRLRPTPDAEHRVESYLMNNNLMMVPEETEENEEIVVVAEFNNSSYSGMEQSKF* 0 >UV7_droWil Drosophila willistoni (fruitfly) Diptera RH7 scaffold_180949:5140016 5141994+ 0 MDMDMALDMNDAATTTSLWITSAALSLSEILVNTTSHVVTTSPASTSTVETTAVAAVTATGKVVHDDEKHHHHHHHHHQDEVNDNNVTTVLR 2 1 NFSSYPGYIHYRDKYDLSYIAKVNPFWLQFEPPRSSTFYIMAALYCLISVVGCIGNAFVIFMFSNRKSLRTPANILVMNLAICDFLMLVKCPIAIYNNIKEGPALGDI 1 2 ACRIYGFVGGLSGTCAIGTLTAIALDRYNVVVHPLQPLRRCSRLRSYLVIVMIWCYSFLFAIMPALDVGLSVYVPEGYLTTCSFDYLNKETPARIFMALFFVAAYCVPLTCIMFSYFYILKVVFTANRIQSNKDKAKTEQ KLTFIVAAIIGLWFLAWSPYAVVAMMGVFGLEQHITPLGSMIPALFCKTAACVDPYLYAATHPRFRVEVRMLIYGRGVLRRVSTTRSSYITRSRSSFTRRLRTGSELDMRTEPYIMNNNLMMVPEETEENEEIVVVAEINNPSRCVSMHEHTSKF* 0 >UV7_droMoj Drosophila mojavensis (fruitfly) Diptera RH7 scaffold_6680:4445619 4446890+ 0 METIMSTLPTLTADDGSLWITSALTELLASGANSSSGSSSVVADGTQNATFVAAATTTTTTVAAAAAAAAAAAVNASTATTANATKGHHKHPHGVNDSETDLR 2 1 LCSSYPGYIHYRDKYDLTYIAKVNPFWLQFEPPDTSTFYIMAALYCLISVVGCVGNAFVIFMFGSRKSLRTPANILVMNLAICDFLMLVKCPIAIYNNIQEGPALGDA 1 2 ACRLYGFVGGLSGTCAIGTLTAIALDRYNVVVHPLQPLRRYSRLRSYLIIFAIWCYSFLFAVMPALDVGLSVYVPEGFLTTCSFDYLNKETPARIFMALFFVAAYCIPLASIVYSYFYILKVVFTANRIQSSKDKAKTEQ KLTFIVAAIIGLWFIAWSPYAIVAMMGVFGQEQHITPLGSMIPALFCKTAACVDPYLYAATHPRFRVEVRMLMYGRGVLRRVSTTRSSYMTRSRSSFTHRLRPSSGDCENRAEPYTLNNNLMMVPEETEENEEIIVVAEINNSISGVMEQSKF* 0 >UV5_plePay Plexippus paykulli (jumping_spider) Chelicerata Arachnida PpRh3 kumopsin3 AB251851 PUBMED 18217181 MLNNTIPGPATLDDIGPPSWCYETRFNGWNAAPDIYVPDYWKQFRAPAPYLHYMLGFFYICLMSIAVV GNAIVMYIFFSAKTLRTPTNMFVIGLAMADLLMMSKTPVFIYNCFHLGPVFGQIGCDIYGIVGTYSGIGSAFCNAIIAYDRYRVIVHP FSKSGMSITKAIAFLVIIYLYITPFAILPALKIWSRYVPEGFLTSCSADFFMQDFNGRSYIV GTWFFGWFIPVAAIVFFYVQIFLAVKDHEEKIKEQARKMNVDSIRSNEAVKNSSAEVRIAKTAMCVFLMFLSSWAPYILVAFITGFSDPKLKRITPVISMVPAMTIKASACFDPFF YALSHPRYRLELQNRMPWLCINEKAEASGPADDSVSKTTEHVA* >UV5_hasAda Hasarius adansoni (jumping_spider) Chelicerata Arachnida HaRh3 kumopsin3 AB251848 MLNNTALQPAVLDDIGPPSWCYETRFNGWNAAPDILVPDYWKQFRAPAPYLHYILGCLYICLMSVALI GNAIVIYIFSVSKSLRTPTNMFVIGLAMADLLMMSKTPVFIYNCFHLGPVFGQLGCDIYAIVGTYSGIGSAFCNAVIAYDRYRVIVHP FSKSGMTMTKAIAILVIVYLYITPFAILPALKIWSRFVPEGFLTSCSSDFYMQDFNGRSYIV GTWFFGWFIPVAAIIFFYAQIFLAVKDHEEKIKEQARKMNVDSFRSNEALKNSSAEVRIAKTAMCVVLLFLTSWVPYILVAFIAGFSDPKLKRVTPVISMIPAMTIKGSACFDPFF YALSHPRYRLELQNKLPWLCINEKAEASGPADDSVSKTTEHVA* >UV5_braKug Branchinella kugenumaensis (fairy_shrimp) Branchiopoda BAG80984 60% MANVTGFDYYRYERRELGWNTPAEYMEFVHPHWKQFEAPNPFLHYMLGVFYIIFMFCSLIGNGVVIWVFASAKSLRTPSNLFVINLAVLDFLMMLKTPVFIVNSFNEGPIWGKTGCDFFALLGSYAGIGGATTNAAIAFD RYRTIAHPFDGKLSRGQAITLCMLCWLYATPFSLMPFFGIWGRFVPEGFLTTCSFDYITEDSSTRAFVGTIFFTSYVLPMILIIYFYSQIVGHVRQHEETLRAQAKKMNVATLRSGKDDQEQSAEVRIAKVCIGLFSMFV ISWTPYAAVALLCAFGNRAAVTPLVSMIPALTCKAVACIDPWIYAINHPRYRLELQKRLPWFCIHEPEPNNDSASVNSEKTVATTTPS >UV5_triLon Triops longicaudatus (tadpole_shrimp) Branchiopoda BAG80983 61% MAFLQNQTYDGTSPSFSFFRTSERIMLGHNTPKDYMEYVHPHWQTFEAPNPFLHYLLGVLYIGFMFCALVGNGVVIWIFSSAKSLRTPSNMFVINLAVLDFIMMMKTPVFIVNSFNEGPIWGKFGCDLFALMGSYSGIGG AMTNAAIAFDRYRTIARPFDGKLSRGKVLTICAGIWLWATPFSLMPLFGIWGRFVPEGFLTTCSFDYMTETSSIRWFVGCIFTYSYIIPLGLIIYYYSKIVGHVQEHERILREQARKMNVESLRSGKDQQEKSAEIRIAK VAIGLSLMFVVAWTPYALVALIAAFGNRAVLTPLVSMIPACCCKAVACIDPWIYAINHPRYRLELQKRMPWFCVHEPEPEMIDNSSAITEKTST >UV5_papXut Papilio xuthus (butterfly) Lepidoptera AB028218 --- Arthropoda Insecta Rh5 partial 0 MIPAAVMDNHTENNYNYGAYFAPYRLEGVELLGAGLTGEDLAAIPEHWLSYPAPPASAHTMLALVYVFFTAAALIGNGLVIFIFSASKSLRTPSNLLVVQLAVLDFLMMLKAPIFIYNSIK RGFASGVIGCQIFAFMGSVSGTAAGLTNACIAYDRHSTITRPLDGRLSRGKVLLMMVCVWLYTAPWAILPQLQIWGRYVPEGFLTSCTFDYLTTTFD NKLFVASMFVCVYIFPMIAILYFYSGIVKQVFAHEAALREQAKKMNVDSLRSNQNAAAESAEIRIAKAALTVCFLYVASWTPYGVMSLIGAFGDQNLLTPGVTMIPALACKGVACI DPWVYAISHPKYRQELQKRMPWLQIDEPDDNASNTTSNTANSSAPA* 0 >UV5_triGra Triops granarius (tadpole_shrimp) Branchiopoda BAG80978 MAFLQNQTHDGTSPSFSFYRTSERVMLGQYTPKDYMDYVHPHWQTFEAPNPFLHYLLGVLYIGFMFCALVGNGVVIWIFSSAKSLRTPSNMFVINLAVLDFIMMMKTPVFIVNSFNEGPIWGKFGCDMFALMGSYSGIGG AMTNAAIAFDRYRTIARPFDGKLSRGKVLTICAGIWLWATPFSLMPLFGIWGRFVPEGFLTTCSFDYMTETSSIRWFVGCVFTYSYIIPLGLIVYYYSKIVGHVQEHERILREQARKMNVESLRSGRDHQEKSAEIRIAK VAIGLSLMFVVAWTPYALVALIAAFGNRAVLTPLVSMIPACCCKAVACIDPWIYAINHPRYRLELQKRMPWFCVHEPEPEIIDNSSAITEKTST* >UV5a_dapPul Daphnia pulex (water_flea) Branchiopoda NCBI_GNO_176434 FE384049 0 MLGWNTPEDYMSYVHP 21 YWKTFEAPNPFLLYMIGFLYTIFMFCCVAGNGVVIWIFTN 2 1 CKSLRTPSNMLVVNLAILDMLMMLKSPVMIINSYNEGPIWGKLGCDVFGLMGSYNGIGSAVNNAAIAYDRHR 2 1 TISRPLDGKLSRKQVTLMIVAIWAWATPFSVMPFLGIWGRYVP 1 2 EGFLTTCTFDYMTEDASTRFFVGSIFVYSYVIPLAMLIFYYSKIVRSVGDHEKTLRDQAKKMNVTSLRSNRDQNEKSAEVRIAKVAIALATLFVFAWTPYAFVALTAAFGNR 2 1 SVLTPLLSTVPACCCKLVSCINPWIYAINHPRYR 2 1 MELQKKMPWFCIHEPVPTNDDSSVGSATTEMSGVSKETSS* 0 >UV5b_dapPul Daphnia pulex (water_flea) Branchiopoda penultimate intron lost, last intron has slid back 2 aa 0 MNGWNTPADYKSYVHPHWLSYEEPNPMLHHLLGVLYIFFMIASCLGNGIVIYIFST 2 1 TKELKTPSNILILNLAICDFIMMIKTPIFIVNSFNEGPVFGRLGCSIFGLLGAYVGPCSAVTNAAIAYDRYR 2 1 CISDPMGKRWSKSQASLIVLGCWVYASPVSLLPFTEIVNRFVP 1 2 EGYLTSCTFDYMTDNLETKMFVFILWIWCWIMPLGVIIFSYGKITTQVMTHEARLKEQAKKMNVESLRSGANKDARNEIRVAKVGISLTTLFLLSWTPYFAIAFIGCYGNR SLLTPGLSMIPACTCKMAACVDPFVYAINHPK 2 1 YRLELMKRFPWLCVHEKDDSTRSENSTNATIASEAESRT* 0 >UV5_apiMel Apis mellifera (bee) Hymenoptera AF004169 353 nm 5 exons Arthropoda Insecta complete genNow 0 MSNDSIHWEARYLPAGPPRLLGWNVPAEELIHIPEHWLVYPEPNPSLHYLLALLYILFTFLALLGNGLVIWIFCA 2 1 AKSLRTPSNMFVVNLAICDFFMMIKTPIFIYNSFNTGFALGNLGCQIFAVIGSLTGIGAAITNAAIAYDRYS 2 1 TIARPLDGKLSRGQVILFIVLIWTYTIPWALMPVMGVWGRFVPEGFLTSCSFDYLTDTNEIRIFVATIFTFSYCIPMILIIYYYSQIVSHVVNHEKALREQAKKMNVDSLRSNANTSSQSAEIRIAK 0 0 AAITICFLYVLSWTPYGVMSMIGAFGNKALLTPGVTMIPACTCKAVACLDPYVYAISHPKYR 2 1 LELQKRLPWLELQEKPISDSTSTTTETVNTPPASS* 0 >UV5_nasVit Nasonia vitripennis (jewel_wasp) Hymenoptera XM_001608024 wrong, transcripts GE436449 GE390962 0 MPYYNWNGTDQTAGWPEARIQPAGAPRLLGWNVPPEELVHIPEHWLVYPEPNPALHYLLALLYILFTFVALLGNGLVIWIFCA 2 1 AKSLRTPSNMFVVNLAICDFMMMLKTPIFIYNSFHTGFALGNLGCQIFSFIGSLSGIGASITNAAIAYDRYS 2 1 TIARPLDGKLSRGQVMMLIVLIWMYTIPWALMPSMGVWGRFVP EGFLTSCTFDYITDSDEIRYFVGTIFTFSYAIPMTLIIYFYSQIVGHVVNHEKALREQAKKMNVESLRSGQNKDQASAEVRIAK 0 0 VALTICFLFVAAWTPYGVMSLIGAFGNK SLLTPGVTMIPACCCKAVACLDPYVYAISHPRYR 2 1 LELQKRMPWLELQEKPPASDATSTTTEAVPASS* 0 >UV5_lucCru Luciola cruciata (firefly) Coleoptera AB300329 MILHNATVFAAAQTQDDPDSIVHLLGWNVPKSELHHIPEHWLVYPEPEASIHYLLGIVYIFICFMGIVGNGLVLWIFSTSKSLKTASNMFVVNLAFCDFIMM MKMPIFVYNSFNRGYALGHIGCQIFGFVGSLSGIGAGMTNAFIAYDRYATISNPLEGKLTRTKALIMIFIIWGYTFPWAVLPMFEVWCRFVPEGFLTSCTFDYLTDTFDNDMFVAV IFICSYVIPMSMIIYFYSQIVKHVMHHEKALRDQAKKMNVESLRSNQSLQSQSIEIKIAKVAIMVCFLFVASWTPYAVLALIGGFGDQSLLTPGVTMVPALACKFVACLDPYVYAL SHPRYRMELQKRLPWLAIKEDAVSDAQSMVTTTTAAATPAATEQAPTA* >UV5_triCas Tribolium castaneum (flour_beetle) Coleoptera 0 MYVVHPFKIIRNKVTILRTMETMANHLGWNVPKDELIHIPQHWLVYPEPEASMHFLLALIYIGFFIMATIGNGLVIWIFST 2 1 SKSLRTASNMFVVNLAICDFAMMIKTPIFIYNSFYRGFALGHLGCQIFAFIGSLSGIGAGMTNACIAYDRYT TITRPFDGKITRTKALVMIIFVWGYTIPWAVMPLLEIWGRFAP 1 2 EGFLTACSFDYLTDTFDNHMFVTSIFICSYVIPMSMIIYFYSQIVSKVFSHEKALREQ 0 0 AKKMNVESLRSNQSQQASQSAELRIAKAAIAICSLFVASWTPYAVLALIGAFGDQSLLTPGVTMVPACACKFVACLDPYVYAISHPKYR 2 1 LELQKRLPWLAIKETAASETQSTTTENTTTQSATTTT* 0 >UV5_anoGam Anopheles gambiae (mosquito) Diptera XM_556823 novel short exon 0 MGLVQLDNQTAYRPEALIGADQSGLRYLGWNVPPEELVHIPEHWLQFPEPEASLHYLLGLLYIAFTIFSLVGNGLVIWIFIA 2 1 AKSLRTPSNVFVINLAICDFFMMAKTPIFIYNSFTKGFTLGNLGCQIFGFVGSLT 1 2 GIGAGATNALIAYDR 2 1 YNTITRPFEGRLTQTKAIIFICLIWAYTIPWGVLPLLEIWGRYVP 1 2 EGFLTSCTFDYLSGTFDTRLFVASIFTFSYVLPMSLIIYYYSQIVSHVVNHEKSLREQAKKMNVESLRSNQNQKDASVEIRIAKAAITVC FLFVASWTPYAVLALIGAFGDKSLLTPGVTMFPACACKFVACLDPYVYAISHPRYRIELQKRLPWLAITETLPAENASTCTEQQDGNATTQS* 0 >UVB_anoGam Anopheles gambiae (mosquito) Diptera XM_312478 0 MFLGNESISEGAMLMPMARTAGEMPKLLGWNLPPEEQYLVHDHWKGFPSPPYYMHLMLAMIYFVLMNTSLIGNGIVLWIFGT 2 1 SKSLRNGSNMFIINLAIFDLLMMCEMPMFLVNSFSERLVGYGVGCSVYAALGSMSGIGGAISNAVIAFDRYRTISNPLDGRLSRVQAGLLICLTWLWTMPFTLLPLFEIWGRY IPEGYLTTCSFDYLTDDPDTRVFVGCIFTWAYVIPMIFICYFYARLFGHVRQHEMMLKNQARKMNVESLTANRSEKAQAVEMRIAKAAFTIFFLFVCAWTPYAIVTMIGAFGDR 2 1 TMLTPFVTMVPAVCCKIVSCLDPWVYAISHPKYRQELERRLPWMGIKEADDSVSTTES* 0 >UV5B_droMel Drosophila melanogaster (fruitfly) Diptera RH5 CG5279-RA two small introns also seen in Apis, Daphnia first in Aplysia, Platynereis and Homo 0 MHINGPSGPQAYVNDSLGDGSVFPMGHGYPAEYQHMVHAHWRGFREAPIYYHAGFYIAFIVLMLSSIFGNGLVIWIFST 2 1 SKSLRTPSNLLILNLAIFDLFMCTNMPHYLINATVGYIVGGDLGCDIYALNGGISGMGASITNAFIAFDRYKTISNPIDGRLSYGQIVLLILFTWLWATPFSVLPLFQIWGRYQP 1 2 EGFLTTCSFDYLTNTDENRLFVRTIFVWSYVIPMTMILVSYYKLFTHVRVHEKMLAEQAKKMNVKSLSANANADNMSVELRIAKAALIIYMLFILAWTPYSVVALI GCFGEQQLITPFVSMLPCLACKSVSCLDPWVYATSHPKYRLELERRLPWLGIREKHATSGTSGGQESVASVSGDTLALSVQN* >UV4_droMel Drosophila melanogaster (fruitfly) Diptera RH4 CG9668-RA one ancestral intron with intercolated genes 0 MEPLCNASEPPLRPEARSSGNGDLQFLGWNVPPDQIQYIPEHWLTQLEPPASMHYMLGVFYIFLFCASTVGNGMVIWIFST SKSLRTPSNMFVLNLAVFDLIMCLKAPIFIYNSFHRGFALGNTWCQIFASIGSYSGIGAGMTNAAIGYDRYNVITKPMNRNMTFTKAVIMNIIIWLYCTPWVVLPLTQFWDRFVP 1 2 EGYLTSCSFDYLSDNFDTRLFVGTIFFFSFVCPTLMILYYYSQIVGHVFSHEKALREQAKKMNVESLRSNVDKSKETAEIRIAKAAITICFLFFVSWTPYGVMSLI GAFGDKSLLTPGATMIPACTCKLVACIDPFVYAISHPRYRLELQKRCPWLGVNEKSGEISSAQSTTTQEQQQTTAA* 0 >UV3_droMel Drosophila melanogaster (fruitfly) Diptera RH3 CG10888-RA single exon 0 MESGNVSSSLFGNVSTALRPEARLSAETRLLGWNVPPEELRHIPEHWLTYPEPPESMNYLLGTLYIFFTLMSMLGNGLVIWVFSAAKSLRTPSNILVINLAFCDFMMMVKTPIFIYNSFH QGYALGHLGCQIFGIIGSYTGIAAGATNAFIAYDRFNVITRPMEGKMTHGKAIAMIIFIYMYATPWVVACYTETWGRFVPEGYLTSCTFDYLTDNFDTRLFVACIFFFSFVCPTTMITYY YSQIVGHVFSHEKALRDQAKKMNVESLRSNVDKNKETAEIRIAKAAITICFLFFCSWTPYGVMSLIGAFGDKTLLTPGATMIPACACKMVACIDPFVYAISHPRYRMELQKRCPWLALNEKAPESSAVASTSTTQEPQQTTAA* 0 >UV5_pedHum Pediculus humanus (louse) Phthiraptera AAZO01000117 best: exon 1 uncertain 0 MKITTESENNISLSYYQPF 1 2 IDKEESLIWNVDPSELVHIPDHWFNFSAPHPLSNYLLGFLYFIFFVISCTGNGIVIWIFTT 2 1 SKNLRTASNVFVVNLAIFDFIMMAKTPIMIYNSMNLGFECGFVWCQIFASAGALSGIGASITNTCIAYDRCE 2 1 TITNPLQ KSGKKKAFLLAAFTWIYALPWAVLPFLEIWGKFAPEGYLTTCTVDYLTDTSQTRMFIVTIFFAAYVLPLSLIIYFYTKIVLHVINHEKSLKAQ 0 0 AKKMNVESLRSDGNKNYAVEIRITKVAIAMCFLFVISWTPYAVVALIGCFGNK 2 1 HLITPLVSMIPACACKAVACIDPYIYAISHPRFR 2 1 VEVNKRFACLAGCLQEKELQDDAVSKNTVNAENVDT* 0 >UV5_acyPis Acyrthosiphon pisum (pea_aphid) Hemiptera 8 exons SCAFFOLD14509:41790,53815 76% identical UVVa_acyPis K in in K90 0 MDFNRTVSRPLAQLGs 2 1 SLMENEVGETHLLGWNLQAEDLIHIPEHWLKYQEPSSLQHYYLAFMYTIFMFVALFGNGLVIWVFCV 0 0 AKPLRTPSNIFVINLALCDFVMMAKAPIFILGSINRGYQ GHFLCQLFGTAGAFSGIGASATNAAIAYDRFS 2 1 TIAKPFDGRMTYGRAFFLIICIWTYTLPWGLLPLTEKWNRYVP 1 2 EGYLTSCTFDYLSPTDETRAFVGIMFVICYVIPVSLVIFFYSQIVSHVFNHEKALREQ 0 0 AKKMNVESLRSNQDANAQSAEVRIAKAAITICCLFIASWTPYAVVAMIGAFGDR 2 1 SLLTPGITMIPAIFCKTVACFDPYVYAISHPRYR 2 1 LELSKRVPCLGISEKPPPTASETQSTTTAA* 0 >UV5_rhoPro Rhodnius prolixus (kissing_bug) Hemiptera exon 1 missing, K90 at KTP 0 0 1 ASTSGNIRTLGWNLSPEDLKHIPEHWLSYPEPEPILNYALGVLYIFFMLIALIGNGLVIWIFST 2 1 AKTLRTPSNIFVVNLAICDFLMMSKTPIFIYNSFKLGYALGHRACQIFALLGSFSGIGASATNAVIAYDRYR 2 1 VIATPFAPKLSRTKAVLYLALVWAYVTPWALLPLFEQWSRFVP 1 2 EGFLTSCTFDYLTPTSEIRNFVTVMFFICYVFPMSLIIYFYSQIVSHVIIHEHNLREQ 0 0 AKKMNVESLRSNANMHTQSAEIRIAKAAITICFLFVASWTPYAVLALIGAYGNQ 2 1 DLLTPAVTMIPACACKAVACVDPYVYAISHPRYR 2 1 QELSKKFPWLDIKEAPAPSSVDANSTATEMTLPTQTSPAEA* 0 >UV5_manSex Manduca sexta (moth) Lepidoptera L78081 357 Arthropoda Insecta complete Manop2 PMID: 9343857 0 MNNQSENYYHGAQFEALKSAGAIEMLGDGLTGDDLAAIPEHWLSYPAPPASAHTALALLYIFFTFAALVGNGMVIFIFSTTKSLRTSSNFLVLNLAILDFIM MAKAPIFIYNSAMRGFAVGTVGCQIFALMGAYSGIGAGMTNACIAYDRHSTITRPLDGRLSEGKVLLMVAFVWIYSTPWALLPLLKIWGRYVPEGYLTSCSFDYLTNTFDTKLFVA CIFTCSYVFPMSLIIYFYSGIVKQVFAHEAALREQAKKMNVESLRANQGGSSESAEIRIAKAALTVCFLFVASWTPYGVMALIGAFGNQQLLTPGVTMIPAVACKAVACISPWVYA IRHPMYRQELQRRMPWLQIDEPDDTVSTATSNTTNSAPPAATA* 0 >UV5_diaNig Dianemobius nigrofasciatus (cricket) Orthoptera MELQGSNVSNLSVWRPEARLATRLLGWNVPAEELIHIPEHWLTYPAPDAFSYYILGMLYVAFCFIALIGNGLVIWVFSSAKTLRTPSNIFVINLALYDFIMM LKTPIFIYNSFNLGFGLGQLGCQIFAFMGSVSGIGAAATNACIAYDRYRVIARPFDSKMSIKGATLLVLLVWMWALPWAILPLLEIWGRYAPEGYLTSCSFDYLTDTPENHMFVLC IFICSYVIPMSLIIYFYSQIVSHVVNHEKALKEQAKKMNVDSLRSNQQQNQTSAEIRIAKVAIGICFLFVASWTPYAVLALIGAFGNKALLTPGVTMIPACTCKAVACLDPYVYAI SHPRYRAELQKRLPWLCIKEESASDTTSNATTTSTNAGATST* 0 >UVB_apiMel Apis mellifera (bee) Hymenoptera AF004168 439 nm 8 exons Arthropoda Insecta complete genNow 0 MLLHNKTLAGKALAFIAEEG 2 1 YVPSMREKFLGWNVPPEYSDLVHPHWRAFPAPGKHFHIGLAIIYSMLLIMSLVGNCCVIWIFST 2 1 SKSLRTPSNMFIVSLAIFDIIMAFEMPMLVISSFMERMIGWEIGCDVYSVFGSISGMGQAMTNAAIAFDRYR 2 1 TISCPIDGRLNSKQAAVIIAFTWFWVTPFTVLPLLKVWGRYTT 1 2 EGFLTTCSFDFLTDDEDTKVFVTCIFIWAYVIPLIFIILFYSRLLSSIRNHEKMLREQ 0 0 AKKMNVKSLVSNQDKERSAEVRIAKVAFTIFFLFLLAWTPYATVALIGVYGNR 2 1 ELLTPVSTMLPAVFAKTVSCIDPWIYAINHPR 2 1 YRQELQKRCKWMGIHEPETTSDATSAQTEKIKTDE* 0 >UVB_acyPis Acyrthosiphon pisum (pea_aphid) Hemiptera 8 exons SCAFFOLD14509:21417-33525 62% UVV_apiMel V in K90 0 MDFNRSVSRPLSQLGS 2 1 SFMENEEELQLMGWNLTPEDLTHIPEHWLSYPEVRSLYHYILAFSYTILFCLGVIGNGLVLWIFCV 0 0 SKPLRTPSNLFVLNLALCDFSMVLVLPILIYDSIDHKYP GHLQCQIFALCGSISGIGAGATNAAIAYDRYS 2 1 TIAKPFEGRMTYGKALILIICIWIYVLPWCLLPLTEKWNRFVP 1 2 EGFLTSCSFDYLTPTEETKAFVGTMFVICYVIPMSFIIYFYSQIVCHVFNHEKALREQ 0 2 AKKMNVESLRSNQDANAQSAEVRIAKAAITICFLFVAAWTPYAVVAMIGAFGDQ 2 1 SLLTPIASMLPAVFAKTVACFDPYVYAISHPKYR 2 1 LELSKRVPCLGITEKPLATSDTQSITTAA* 0 >UVB_nasVit Nasonia vitripennis (jewel_wasp) Hymenoptera XM_001604572 ES636068 0 MAFVGLNGAMGGMGPA 1 2 EKPLQRYSQGPQMQEHLLGWNHPPEHIDIVHPHWRGFLAPGKYWHIGLALIYFMLLVLSFVGNGCVVWIFST 2 1 SKVLRTPSNLFIINLALFDLVMALEIPMLIINSFIERMIGWGLGCDIYAALGSVSGIGSAITNAAIAYDRYR 2 1 TISCPIDGRLNGKQAAVMVAFTWFWTMPFTILPFAKIWGRYTT 1 2 EGFLTTCSFDFLSDDQDTKVFVAAIFSWSYCFPMVLIIYFYSQLIKSVRRHEKMLREQ 0 0 AKKMNVKSLSAQDKERSVEMRIAKVAFTIFFLFVCSWTPYAVVTMIAAFGNR 2 1 ELVTPFSSMLPAVFAKTVSCIDPWVYAINHPR 2 1 YRQELTKRCQWMGIHEPDSGPSQNNAEAVSVTTEKLKSDDA* 0 >UVB_manSex Manduca sexta (moth) Lepidoptera exons from 454 AD001674 450 Arthropoda Insecta complete MATNFTQELYEIGPMAYPLKMISKDVAEHMLGWNIPEEHQDLVHDHWRNFPAVSKYWHYVLALIYTMLMVTSLTGNGIVIWIFST SKSLRSASNMFVINLAVFDLMMMLEMPLLIMNSFYQRLVGYQLGCDVYAVLGSLSGIGGAITNAVIAFDRYK TISSPLDGRINTVQAGLLIAFTWFWALPFTILPAFRIWGRFVP EGFLTTCSFDYFTEDQDTEVFVACIFVWSYCIPMALICYFYSQLFGAVRLHERMLQEQ AKKMNVKSLASNKEDNSRSVEIRIAKVAFTIFFLFICAWTPYAFVTMTGAFGDR TLLTPIATMIPAVCCKVVSCIDPWVYAINHPR YRAELQKRLPWMGVREQDPDAVSTTTSVATAGFQPPAAEA* 0 >UVB_megVic Megoura viciae (vetch_aphid) Hemiptera AF189715 MDFNRSVSRPLSQLGSSFMENEDELQLMGWNLTPEDLTHIPEHWLSYPEVRSLYHYILAFSYTILFCIGVI GNGLVLWIFCVSKPLRTPSNLFVLNLALCDFSMVLVLPILIYDSIDHKYPGHLQCQIFALCGSISGIGAGATNAAIAYDRYSTIAKP FEGRMTYGKALILIICIWIYVLPWCLLPLTEKWNRFVPEGFLTSCSFDYLTPTEETKAFVGTMFVICYVIPMSFIIYFYSQIVCHVFNHEKALREQAKKMNVESLRSNQDANAQ SAEVRIAKAAITICFLFVAAWTPYAVVAMIGAFGDQSLLTPIASMLPAVFAKTVACFDPYVYAISHPKYRLELSKRVPCLGITEKPPAASDTQSITTAA* 0 >UVB_diaNig Dianemobius nigrofasciatus (cricket) Orthoptera MNSSVGLQGAPIALPYESYVAQMLGWNIPAEHIELVHSHWRGYEAPSKYWHYWFAFMYFCIMIMSCLGNGIVLWIFATTKSLRTPSNMFVVNQALLDLLMMI EMPMFVLNSLFYQRPIGWEMGCDIYALLGAVSGIGSAINNAAIAYDRYRTISFPLDGRLQFGHALAFIVGVWSWAMPFSLLPLLKVWGRYVPEGLLTTCSFDYLTDDEDTKVFTAS IFTWSYAFPLCLIVFFYCKLFKQVRLHEKMLQEQARKMNVKSLQTNQDVAQKSVEIRIAKVAFTIFFLFLCSWTPYATVAMIGAFGNRALLTPMSTMIPALFSKIVSCIDPWIYAI NHPRFRGELLKRAPWFGVEELKSSDVSSIGTDRTTATAAIETPAA* 0 >LMS1_droMel Drosophila melanogaster (fruitfly) Diptera CG4550-RA 0 ME 00 SFAVAAAQLGPHFAPLSNGSVVDKVTPDMAHLISPYWNQFPAMDPIWAKILTAYMIMIGMISWCGNGVVIYIFATTKSLRTPANLLVINLAISDFGIMITNTPMM GINLYFETWVLGPMMCDIYAGLGSAFGCSSIWSMCMISLDRYQVIVKGMAGRPMTIPLALGKIAYIWFMSSIWCLAPAFGWSR 2 1 YVPEGNLTSCGIDYLERDWNPRSYLIFYSIFVYYIPLFLICYSYWFIIA 0 0 AVSAHEKAMREQAKKMNVKSLRSSEDAEKSAEGKLAKVALVTITLWFMAWTPYLVINCMGLFKFEGLTPLNTIWGACFAKSAACYNPIVYGIS 2 1 HPKYRLALKEKCPCCVFGKVDDGKSSDAQSQATASEAESKA* 0 >LMS6_droMel Drosophila melanogaster (fruitfly) Diptera CG5192-RB gross genomic misassembly exon1 0 MASLHPPSFAYMRDGRNLSLAESVPAEIMHMVDPYWYQWPPLEPMWFGIIGFVIAILGTMSLAGNFIVMYIFTSSKGLRTPSNMFVVNLAFSDFMMMFTMFPPVVLNGFYGT WIMGPFLCELYGMFGSLFGCVSIWSMTLIAYDRYCVIVKGMARKPLTATAAVLRLMVVWTICGAWALM PLFGWNRYVPEGNMTACGTDYFAKDWWNRSYIIVYSLWVYLTPLLTIIFSYWHIMK 0 0 AVAAHEKAMREQAKKMNVASLRNSEADKSKAIEIKLAKVALTTISLWFFAWTPYTIINYAGIFESMHLSPLSTICGSVFAKANAVCNPIVYGLS 2 1 HPKYKQVLREKMPCLACGKDDLTSDSRTQATAEISESQA* 0 >LMS_anoGam Anopheles gambiae (mosquito) Diptera XM_319247 most introns obliterated 0 MPYYGPMQQPGLWGQPVANLTVVDKVPPEIMHLVDPHWSQFPPMNPLWHSIIGFVIFVLGVVSIIGNGMVIYIFSTAKSLRTPSNLFIVNLALSDFLMMGTN AFTMVYNCWFETWSLGLLMCDLYAFFGSLFGCCSIWTMTMIALDRHNVIVHGLSGKPLTNTGAILRILLCWLIGVVWGILPMLGWNRYVPEGNMTACGTDYLTDDWFHKSYILVYS VFVYYTPLFTIIYAYFFIIK 0 0 AVSAHEKNMREQAKRMNVQSLRSSDDGKSTEMKLAKVALVTISLWFMAWTPYTVINYTGVFKTASITPLATIWGSVFAKANAVYNPIVYGISHPKY RAALLRRFPSLACSDGPPADDKSLASEASGITSAGNPTTA* 0 >LMS_rhoPro Rhodnius prolixus (kissing_bug) Hemiptera 0 MAQPIGPSFAAYQWGQSANPSANRSVVDMVPPEMLSMVDAHW 2 1 YQFPPLNPLWHGILGFVIGVLGIISIVGNGMVIFIFSSTKTLRTPSNLLVVNLAFSDFLMMFTMSPPMVINCYNETWVL 1 2 GPLMCELYGMLGSLFGCASIWTMTMIALDRYNVIVK 0 0 GISAKPMTNKTAMLRILLVWAFSIMWTVFPFFGWNR 2 1 YVPEGNMTACGTDYLTKNWVSRSYILVYSVFVYFLPLFTIIYSYFFILQ 0 0 AVSAHEKQMREQAKKMNVASLRSAEAANTSAEAKLAK VALMTISLWFMAWTPYLVINYSGIFETISISPLFTIWGSLFAKANAVYNPIVYAIR 2 1 HPKYKQALEKKFPSLSCASPQDDTTSVATGVTTSTDDKAPSA *0 >LMS_acyPis Acyrthosiphon pisum (pea_aphid) Hemiptera SCAFFOLD6053:23617,25535 0 MLNKIGSHYERQENWVAEGGFGNETVVDRVPADMMHLIDPSW 2 1 YQFPPMESMWYKWLGVTIFFLGILSVVGNGMVIYIFTCTKNLRTPSNLLIVNLAFSDFCLMFTMCPAMVWNCFYETWMF 1 2 GPFACELYAMFGSLFGVTSIWTMVFIALDRYNVIVK 0 0 GLSAKPMTTKLALLQIFCIYLHGLFWTLTPFFGWSR 2 1 YVPEANMTACGTDYLTLAWHSRSYVLVYAIFAYYLPLLVIIYAYYFIVK 0 0 AVASHEKSMREQAKKMNVSSLRSGDQSNTSAEFKLAKVALMTISLWFMAWTPYMVINFAGIFQLMTIDPLFTIWGSVFAKANAVYNPIVYAIS 2 1 HPKYRLALDKKFPCLVCGKLEDDRSDSKSVASAQTTISEDKV* 0 >LMS2_droMel Drosophila melanogaster (fruitfly) Diptera M12896 CG16740-RA Rh2 complete ocellar-specific 0 MERSHLPETPFDLAHSGPRFQAQSSGNGSVLDN 0 0 VLPDMAHLVNPYWSRFAPMDPMMSKILGLFTLAIMIISCCGNGVVVYIFGGTKSLRTPANLLVLNLAFSDFCMMASQSPVMIINFYYETWVLGPLWCDIYAGCGSLFGCVSIWSMC MIAFDRYNVIVKGINGTPMTIKTSIMKILFIWMMAVFWTVMPLIGWSAYVPEGNLTACSIDYMTRMWNPRSYLITYSLFVYYTPLFLICYSYWFIIAAVAAHEKAMREQAKKMNVKSL RSSEDCDKSAEGKLAKVALTTISLWFMAWTPYLVICYFGLFKIDGLTPLTTIWGATFAKTSAVYNPIVYGIS 2 1 HPKYRIVLKEKCPMCVFGNTDEPKPDAPASDTETTSEADSKA* 0 >LMS_meoOer Neogonodactylus oerstedii (mantis_shrimp) Malacostraca DQ646869 489 Rh1 complete 0 MSYWNSNKIVEEYSLPSTNPYGNFTVVDTVPENMLHMIHSHWYQFPPLNPMWYGILAFVVTVVGLCSICGNFVVIWVFMNTKALRSPANTLVVSLAVSDFIM MACMFPPLVLNCYWGTWIFGPLFCEVYAFIGNTVGCASIGNMIFITFDRYNVIVKGISGTPLSQKNTTLQVLFVWICSIMWCVFPFFGWNRYVPRGDMTACGTDYLTEDEFSRSYL YVYSVWVYIGPLALIIYCYFHIVSAVATHEKQMRDQAKKMGVKSLRTEEAKKTSAECRLAKVALTTVSLWFMAWTPYLIINWAGMFYPSVVSPLFSIWGSVFAKANAVYNPIVYAI SHPKYRAALYKKLPCLACSTESADEGSATNSATTTTAEKYESA* 0 >LMSa_nasVit Nasonia vitripennis (jewel_wasp) Hymenoptera XM_001606013 GE417061 22063-23541 - strand of AAZX01007316 --><-- 0 MGPSFLTLTAMAQRGGYGGGGGFGGGFNNQTVVDKAPPEIHHMIDPYWYQFPPMNPLWYGILGFVIGCLGCISVAGNGMVVYIFASTKSLRTPSNLLVINLAFSDFCMMFTMSPPM 0 0 VINCYYETWVFGPLMCEIYALCGSIFGCGSIWTMCMIAFDRYNVIVKGLSAKPMTINGSLLRILGIWLMASIWTIAPMFGWNR 2 1 YVPEGNLTACGTDYFSKDWVSRSYIVVYSFFVYFLPLFMIIYSYYFIIKAVSAHEKNMREQAKKMNVASLRQGDSQSAENKLAK 0 0 IALMTISLWFMAWTPYLVINWAGIFDLARLTPLFTIWGSVFAKANAVYNPIVYGIS 2 1 HPKYRAALFARFPSLACAGDAPAGAASDAVSTTSGVTTLTDHDKSNA* 0 >LMSb_nasVit Nasonia vitripennis (jewel_wasp) Hymenoptera tandem pair to LWSa, fairly diverged 19237-21046 + strand of AAZX01007316 0 MEHPIVAAGVNATGEFDASSGSASSTTTMVTTAAVQVASTIGPHFARQVMRGFGNLTVVDKVPPEMLHLVGPHW 2 1 YQFPPLWPIWHKLLGVVMIFIGVLGWCGNGMVVYIFLVTPSLRTPSNLLVINLAFSDFVMMIIMSPPMVVNCWYETW 0 0 ILGPLMCDIYALIGSLCGGASIWTMTAIAYDRYNVIVK 0 0 GMSGTPLTIPRALVQIVLIWTHGLIWAMLPLFGWNR 2 1 YVPEGNMTSCGTDYVSDDWLGKSYILVYSIFVYYTPLFSIILCYWHIVS 0 0 AVAAHERGMREQAKKMNVASLRSGDQSGESAEVKLAK 0 0 VAVTTISLWFLAWTPYLVTNYMGIFAKQHVSPLFTIWASLFAKTNACYNPIVYGIS 2 1 HPKYRAGLKVKCPCLVFGDTEDKPKPAAATPAADAASTHSKA* 0 >LMS_manSex Manduca sexta (moth) Lepidoptera L78080 Manop1 520 Arthropoda Insecta meagre 454 coverage complete 0 MDPGPGLAALQAWAAKSPAYGAANQTVVDKVPPDMMHMIDPHWYQFPPMNPLWHALLGFTIGVLGFVSISGNGMVIYIFMSTKSLKTPSNLLVVNLAFSDFL MMCAMSPAMVVNCYYETWVWGPFACELYACAGSLFGCASIWTMTMIAFDRYNVIVKGIAAKPMTSNGALLRILGIWVFSLAWTLLPFFGWNRYVPEGNMTACGTDYLSKSWVSRSY ILIYSVFVYFLPLLLIIYSYFFIVQAVAAHEKAMREQAKKMNVASLRSSEAANTSAECKLAKVALMTISLWFMAWTPYLVINYTGVFESAPISPLATIWGSLFAKANAVYNPIVYG ISHPKYQAALYAKFPSLQCQSAPEDAGSVASGTTAVSEEKPAA* 0 >LMS_lucCru Luciola cruciata (firefly) Coleoptera AB300328 MSVLGEPSFAAWASQAGVMSSRFGGGNITVVDKVPPDMLHLIDAHWYQYPPLNPLWHAILGFMIGVLGCISVTGNGMVIYIFSTTKSLRSPSNLLVVNLAFS DFLMMFTMAPPMVINCYNETWVWGPLFCQIYGMLGSLFGCTSIWTMTMIALDRYNVIVKGLSAKPLTKQGALIRIFLVWVFSIGWTIAPVFGWNRYVPEGNMTACGTDYLSTGWFS RSYILFYSWFVYFIPLFAIIYSYWFIVQAVSAHEKAMREQAKKMNVASLRSSEAAQTSAECKLAKVALMTISLWFLAWTPYLVTNYAGIFDGSKISPLATIWSSLFAKANAVYNPI VYGISHPKYRQALQKKFPSLVCAAEPDDTVSQTTAATAASEEKAAA* >LMS_limPol Limulus polyphemus (horseshoe_crab) Chelicerata Merostomata L03781 520 lateral_eye complete MANQLSYSSLGWPYQPNASVVDTMPKEMLYMIHEHWYAFPPMNPLWYSILGVAMIILGIICVLGNGMVIYLMMTTKSLRTPTNLLVVNLAFSDFCMMAFMMP TMTSNCFAETWILGPFMCEVYGMAGSLFGCASIWSMVMITLDRYNVIVRGMAAAPLTHKKATLLLLFVWIWSGGWTILPFFGWSRYVPEGNLTSCTVDYLTKDWSSASYVVIYGLA VYFLPLITMIYCYFFIVHAVAEHEKQLREQAKKMNVASLRANADQQKQSAECRLAKVAMMTVGLWFMAWTPYLIISWAGVFSSGTRLTPLATIWGSVFAKANSCYNPIVYGISHPR YKAALYQRFPSLACGSGESGSDVKSEASATTTMEEKPKIPEA* >LMS_ixoSca Ixodes scapularis (tick) Chelicerata Arachnida ocellar TC19272 UP|OPSO_LIMPO 0 MGSEGQRTNMSLLDELASPYMKNGTLVESVPDEMLYMVHPHWYNFKPMNPLWHSLLGFAMVILGVISVVGNSMVIYIMTTSKSLRSPTNMLVVNLAFSDW 2 1 CMMAFMMPTMAANCFAETWILGPFMCEVYGMVGSLFGCGSIWSMVMITLDRYNVIVRGVAAAPLTHKRAALMIFFVWFWALTWTLLPFFGWSR 2 1 YVPEGNMTSCTIDYLTKALWSASYVVAYAGGVYWTPLFINIYCYSKIVRAVAQHEKQLRLQARKMNVASLRANAEQTKTSAEARLAK 0 0 IALMTVGLWFMAWTPYLTIAWAGIFSDGSKLTPLATIWGSVFAKANACYNPIVYGISHPKYRAALARRFPSLVCMPPGGDQLDTRSEASGITTIEDKVMTTET* 0 >LMSa_apiMel Apis mellifera (bee) Hymenoptera Gq 386 aa 16291092 NM_001077825 rhabdomeric AmLop2 long wavelength ocelli not compound 0 MDTLNITTSFFIEVMPSNISTLTTTGPQFARQLMRFNNQTVVSKVPEEMLHLIDLYW 2 1 YQFPPLDPLWHKILGLVMIILGIMGWCGNGVVVYVFIMTPSLRTPSNLLVVNLAFSDFIMMGFMCPPMVICCFYETW 0 0 VLGSLMCDIYAMVGSLCGCASIWTMTAIALDRYNVIVK 0 0 GMSGTPLTIKRAMLQILGIWLFGLIWTILPLVGWNR 2 1 YVPEGNMTACGTDYLSQDWTFKSYILVYSFFVYYTPLFTIIYSYYFIVS 0 0 AVAAHEKAMKEQAKKMNVTSLRSGDNQNTSAEAKLAK 0 0 VALTTISLWFMAWTPYLVINYIGIFNRSLITPLFTIWGSLFAKANAIYNPIVYGIS 2 1 HPKYRAALKEKLPFLVCGSTEDQTAATAGDKASEN* 0 >LMSb_apiMel Apis mellifera (bee) Hymenoptera U26026 529 5 exonsArthropoda Insecta 540 complete genNow 0 MIAVSGPSYEAFSYGGQARFNNQTVVDKVPPDMLHLIDANWYQYPPLNPMWHGILGFVIGMLGFVSVMGNGMVVYIFLSTKSLRTPSNLFVINLAISDFLMMFCMSPPM 0 0 VINCYYETWVLGPLFCQIYAMLGSLFGCGSIWTMTMIAFDRYNVIVKGLSGKPLSINGALIRIIAIWLFSLGWTIAPMFGWNR 2 1 YVPEGNMTACGTDYFNRGLLSASYLVCYGIWVYFVPLFLIIYSYWFIIQAVAAHEKNMREQAKKMNVASLRSSENQNTSAECKLAK 0 0 VALMTISLWFMAWTPYLVINFSGIFNLVKISPLFTIWGSLFAKANAVYNPIVYGIS 2 1 HPKYRAALFAKFPSLACAAEPSSDAVSTTSGTTTVTDNEKSNA* 0 >LMS_triCas Tribolium castaneum (flour_beetle) Coleoptera ES544655 3 exons from AAJJ01000967 5 fusion relative to bee 0 MSVMGEPNFIAWAAQRSGYGGGNLTVVDKVLPDMLHLVDAHWYQFPPMNPLWHGILGFVIGVLGFVSIVGNGMVIYIFSSTKALRTPSNLL VVNLAFSDFLMMlCMSPAMVINCYNETWVLGPLVCELYGMSGSLFGCASIWTMTFIALDRYNVIVKGLSAQPLTKKGAMLRILIIWVFSTLW TIAPFFGWNRYVPEGNMTACGTDYLTKDWVSRSYILVYAVWVYFVPLFTIIYSYWFIVQ 0 0 AVAAHEKSMREQAKKMNVASLRSSEAAQTSAECKLAKIALMTITLWFFAWTPYLVTNFTGIFEGAKISPLATIWCSLFAKANAVYNPIVYGIS 2 1 HPKYRQALQKKFPSLVCAGEPDDTTSTASGVTNVTTDEKPATA* 0 >LMS_papXut Papilio xuthus (butterfly) Lepidoptera AB007424 520 Arthropoda Insecta Rh2 complete 0 MAIANLEPGMGASEAWGGQAAAFGSNQTVVDKVTPDMMHLIDPHWYQFPPMNPMWHGLLGFTIGVLGFISITGNGMVVYIFTSTKSLKTPSNLLVVNLAFSD FLMMLCMAPPMLINCYYETWVFGPLACELYACAGSLFGSISIWTMTMIAFDRYNVIVKGIAAKPMTINGALLRILGIWLFSLAWTIAPMLGWNRYVPEGNMTACGTDYLSKSWLSR SYILVYSIFVYYTPLLLIIYSYFFIVQAVAAHEKAMREQAKKMNVASLRSSEAANTSAECKLAKVALMTISLWFMAWTPYLVINYTGVFETAPISPLATIWGSVFAKANAVYNPIV YGISHPKYRAALYQKFPSLACQPSAEETGSVASGATTACEEKPSA* 0 >LMS_homCoa Homalodisca coagulata (sharpshooter) Hemiptera AY588065 Paraneoptera MSLISEPSFSAYSWASQGGFGNQTVVDKVPPEMLYLVDAHWYQFPPMNPLWHSLLGFAMVVLGFIAVTGNGMVVYIFSCTKALRTPSNLLVVNLAFSDFLMM FTMAPPMVLNCYYETWVLGPFMCELYAMFGSILGCTSIWTMVMIANDRYNVIVKGLSAKPMTIKSALARILFCWAHSLIWCLAPFLGWGRYVPEGNMTACGTDYLTPDWISKSYIL VYSLFCYFMPLFLIIYSYWFIVQAVSAHEKAMREQAKKMNVASLRSSDAANTSAEHKLAKVALMTISLWFCAWTPYLVINYAGIFQALTISPLFTIWGSVFAKANACYNPIVYAIS HPKYRAALNKKFPSLVCGATEAPASTSDGASVASGATTLTEDKSAAA* >LMS_schGre Schistocerca gregaria (locust) Orthoptera X80071 520 Arthropoda Insecta complete 0 MASASLISEPSFSAYWGGSGGFANQTVVDKVPPEMLYLVDPHWYQFPPMNPLWHGLLGFVIGVLGVISVIGNGMVIYIFSTTKSLRTPSNLLVVNLAFSDFL MMFTMSAPMGINCYYETWVLGPFMCELYALFGSLFGCGSIWTMTMIALDRYNVIVKGLSAKPMTNKTAMLRILFIWAFSVAWTIMPLFGWNRYVPEGNMTACGTDYLTKDWVSRSY ILVYSFFVYLLPLGTIIYSYFFILQAVSAHEKQMREQRKKMNVASLRSAEASQTSAECKLAKVALMTISLWFFGWTPYLIINFTGIFETMKISPLLTIWGSLFAKANAVFNPIVYG ISHPKYRAALEKKFPSLACASSSDDNTSVASGATTVSDEKSEKSASA* 0 >LMS1_plePay Plexippus paykulli (jumping_spider) Chelicerata Arachnida PpRh1 kumopsin1 AB251849 MLPQAAKMAARASSGVDGKNISIVDLLPEDMLYMIHEHWYKYPPMESTMHYLLGITIILIGIISVS GNSIVIYLMLSVKSLRTPANFLVTSLAVSDGGMLAFMAPTMPINCFAQTWVLGPFMCELYGMVGSLFGSASIWNMVMITLDRYNVIVRG MSGKPLTKVGALLRIIFVWVWSLGWTIAPMYGWSSYAPEGSMTGCTVDYLHTDISTMSYLIVY AIFVYFVPLFIIIYCYTYIVMQVAAHEKSLREQAKKMNIKSLRSNEDNKKASAEFRLAKVALMTICLWFMAWTPYLILSLLGIFSDREWLTPLTSIWGAVFAKAASAYNPIVYGIS HPKYRAALHEKFPCLNCATESPKGDSASTVAESDKGGD* >LMS2_plePay Plexippus paykulli (jumping_spider1) Chelicerata Arachnida PpRh2 kumopsin2 AB251850 MSSQIINGAYMVSRDALGLHLPTNLGGPLPQDNSYYPYLRNTTVVDTVPKEILHMIHDHWYQFPPLNPLWHSLLGIAMILLGIVSVI GNGMVMYLMNTTKSLKTPTNMLIVNLAFSDFCMMAFMMPTMAANCFAETWILGPFMCEIYGMAGSLFGCVSIWSMVMIAFDRYNVIVRG MNAEPLTTKKAAAQIFLIWAWAIMWTVLPFFGWSRYVPEGNM TSCTVDYLSEDLKSSSYVLIYGCAVYFIPLFTLIYNYTFIVRAVSIHEDNLREQAKKMNVTSLRANADQQKQSAECRLAKIALMTVGLWFIAWTPYLCIAWSGIFSSRKHLTPLAT IWGAVFAKAVAVYNPIVYGISHPKYRAALFQKFPSLACTTESDVIDNKSEVTFVTDEKPPKTQEA* >LMS1_hasAda Hasarius adansoni (jumping_spider) Chelicerata Arachnida HaRh1 kumopsin1 AB251846 MLPHAAKMAARVAGDHDGRNISIVDLLPEDMLPMIHEHWYKFPPMETSMHYILGMLIIVIGIISVS GNGVVMYLMMTVKNLRTPGNFLVLNLALSDFGMLFFMMPTMSINCFAETWVIGPFMCELYGMIGSLFGSASIWSLVMITLDRYNVIVKG MAGKPLTKVGALLRMLFVWIWSLGWTIAPMYGWSRYVPEGSMTSCTIDYIDTAINPMSYLIAY AIFVYFVPLFIIIYCYAFIVMQVAAHEKSLREQAKKMNIKSLRSNEDNKKASAEFRLAKVAFMTICCWFMAWTPYLTLSFL GIFSDRTWLTPMTSVWGAIFAKASACYNPIVYGISHPKYRAALHDKFPCLKCGSDSPKGDSASTVAESEKAGE* >LMS2_hasAda Hasarius adansoni (jumping_spider) Chelicerata Arachnida HaRh2 kumopsin2 AB251847 MSSHTINSAFMVPRDVLGLHLPNNLGGPLPHDNSYYPYLRNATVVDTVPKEILHMIHDHWYQFAPLNPLWHSLLGIAMIILGIVSVI GNGMVIYLMSTTKSLKTPTNMLIVNLAFSDFCMMAFMMPTMAANCFAETWILGPLMCEIYGMAGSLFGCVSIWSMVMIAFDRYNVIVRG MSAEPLTTKKAAAQIFFIWTWATTWTLFPFFGWSRYVPEGNMTSCTVDYLTEDLKSSSYVLIYGCAVYFTPLFTLIYNYTFIVRSVSIHENNLREQAKKM NVSSLRANADQQKQSAECRLAKIALMTVGLWFIAWTPYLSIAWSGIFSSRKHLTPLATIWGAVFAKAVAVYNPIVYGISHPKYRAALFEKFPSLACTTESDVTDNKSEVTLVTDEKPPKTQEA* >BCRa_hemSan Hemigrapsus sanguineus (crab) Malacostraca BcRh1 D50583 PUBMED 9318091 compound eye R1-R7 blue-green 480nm Crustacea complete 0 MANVTGPQMAFYGSGAATFGYPEGMTVADFVPDRVKHMVLDHWYNYPPVNPMWHYLLGVVYLFLGVISIAGNGLVIYLYMKSQALKTPANMLIVNLALSDLI MLTTNFPPFCYNCFSGGRWMFSGTYCEIYAALGAITGVCSIWTLCMISFDRYNIICNGFNGPKLTQGKATFMCGLAWVISVGWSLPPFFGWGSYTLEGILDSCSYDYFTRDMNTIT YNICIFIFDFFLPASVIVFSYVFIVKAIFAHEAAMRAQAKKMNVTNLRSNEAETQRAEIRIAKTALVNVSLWFICWTPYAAITIQGLLGNAEGITPLLTTLPALLAKSCSCYNPFV YAISHPKFRLAITQHLPWFCVHEKDPNDVEENQSSNTQTQEKS* 0 >BCRb_hemSan Hemigrapsus sanguineus (crab) Malacostraca BcRh2 D50584 compound eye R1-R7 blue-green 480nm 75% BcRh1 identical Crustacea complete 0 MTNATGPQMAYYGAASMDFGYPEGVSIVDFVRPEIKPYVHQHWYNYPPVNPMWHYLLGVIYLFLGTVSIFGNGLVIYLFNKSAALRTPANILVVNLALSDLI MLTTNVPFFTYNCFSGGVWMFSPQYCEIYACLGAITGVCSIWLLCMISFDRYNIICNGFNGPKLTTGKAVVFALISWVIAIGCALPPFFGWGNYILEGILDSCSYDYLTQDFNTFS YNIFIFVFDYFLPAAIIVFSYVFIVKAIFAHEAAMRAQAKKMNVSTLRSNEADAQRAEIRIAKTALVNVSLWFICWTPYALISLKGVMGDTSGITPLVSTLPALLAKSCSCYNPFV YAISHPKYRLAITQHLPWFCVHETETKSNDDSQSNSTVAQDKA* 0 >BCR_triGra Triops granarius (tadpole_shrimp) Branchiopoda RhA BAG80976 AB293428 PUBMED 18984904 0 MAAYTEAWNASEEILVRMARAVPSVAWGYPAGVSIADLVPSDMKTMVHSHWNKFPPVNPMWHYLLGMVYIILGTVSIAGNSLVISLFTKTKELRTPANMFVVNLAFSDLCMMITQFPMFVYNCFNGGMWLFGPFLCELYA ATGAVFGLCSICTLACIAFDRYNLIVKGMSGPKMTSKRATILIAFCWAYAIGWSLPPFFGWGRYIPEGILDSCSFDYLTRDSSTKSFGLCLFFFDYVTPLSIIVFAYFHIVRAIFEHEKILREQAKKMNVTSLRSNADQN AQSAEIRIAKVALINISLWVAMWTPYATIVLQGLLGNQENITPLVSILPALIAKSASIYNPVIYAISHPRYRVALQQKLPWFCIHEEEKKPISDTDSAKTEASSS* 0 >BCR2_triLon Triops longicaudatus (tadpole_shrimp) Branchiopoda RhA BAG80981 AB293433 PUBMED 18984904 0 MATYTEAWNASEEILVRMVRAAPSVAWGYPTGVSIVDLVPSDMKTMVHSHWSKFPPVNPMWHYLLGLVYIVLGTVSIAGNSLVISLFTKTKELRTPANMFVVNLAFSDLCMMITQFPMFVYNCFNGGMWLFGPFLCELYA ATGAVFGLCSICTLACIAYDRYNLIVKGMSGPKMTSKRATILIAFCWSYAIGWSLPPFFGWGRYIPEGILDSCSFDYLTRDSSTKSFGLCLFFFDYITPLSIIVFAYFHIVRAIFEHEKILREQAKKMNVTSLRSNADQN AQSAEIRIAKVALINISLWVAMWTPYATIVLQGLLGNQENITPLVSILPALIAKSASIYNPVIYAISHPRYRIALQQKLPWFCIHEEEKKPISISDTDSAKTETSSS* 0 >BCR_porPel Portunus pelagicus (sand_crab) Malacostraca EF110527 horrible distal frameshifts 0 MANSTGPQMAFYGSQDMTYGYPEGVSIVDFVRPEIKPYVHQHWYNYPPVNPMWHYLLGVIYLCLGFISIIGNGMVIYLFAKCQALRTPANILVVNLALSDLI MLTTNVPFFTYNCFNGGVWMFSATYCEIYGCLGAITGVTSTWLLCMISFDRYNIICNGFNGPKLTNGKAIILAFISWAISVGFGIAPLFGWGKYILEGILTSCSYDYLTQDFNTRS YNIIIFVFDYFLPAAIIIFSYVFIVKAIFAHEAAMRAQAKKMNVTnLRSGEAESQRAEIRIARTALVNVSLWFICwTPYALISLQgvlgdlsginlLVTTLPALLARSCSW >BCR_limPol Limulus polyphemus (horseshoe_crab) Chelicerata Merostomata FJ791252 ventral eye MSTGSYFIGNSTAPRSSGWWSYDPGLSVRDTAPENIKHLISDHWSKFPAVNPMWHYLLGLIYIVLGIASLTGQSVVLYLFAKTKPLRTPANMLIVNLAFSDF MMMITQFPVFIINCLGGGAWQLGPLLCEITGFAGGLFGYGSIVTLAVISIDRYNVIVRGFSASPLTHARSAVFILVIWAWTLGWALPPFFGWGRYVPEGILNSCSFDYLTRDWATV SYIMGCWICEYALPLMVIIYCYIFIVKAVCDHERHLREQAKKMNVASLRSNVDTQKASAEMRIAKVALVNVLLWVVSWTPYAAIAMIGIAGDQMLITPLRSALPALAGKAASVYNP IVYAISHPKFRLAMQKEIPCCCINEPQPQSDTSSEMSTKTSVATVNGEDSTAGGTTNN* >BCR1_triGra Triops granarius (tadpole_shrimp) Branchiopoda BAG80979 MANASHYEALQQEFNPWALPESFTLYAYAPEDVRAFLHPHWHNFPATHPAIYYLFGLVYLVLGVTSVG GNYLVLRIFTKFQELRRPSNVLVINLALSDMLLMLTLFPECVYNFLSGGPWRFGDLGCQIHAFCGALFGYNQ ITTLVFISYDRFNVIVRGMGGTPLTYARVSAMVAFSWLWATGWSVAPLVGWGGYALDGMLGTCSFDYVTR TWNNRSHILAATAFMWVIPVLIIAGCYWFIVQAVFKHEAELKAQAKKMNVASLRSNADQQQVSAEIRIAK VAITNVVLWLSAWTPFMVISNLGIWADPQQVTPLVSSLPVLLSKTSCSYNPLVYAISHPKYRECLKTLVPWICIVLPNDRRGGDNVSSSSSRTEASGKAETVDA* >BCR2_triGra Triops granarius (tadpole_shrimp) Branchiopoda BAG80977 MSSGVFNSTDPIALARVSAGSNAHQQVGYNILIKTDGLSVRDVAPLDMHHLLHSHWDAYPPADPRIHYLL GMLYFFLGIAACMGNVLVLHIFGKHKNLRSPTNTLLMNLAFCDLMIFIGLYPEMLGNIFMNDGTWMWGDV ACRIHAWFGLVFGFGQMQTLMYMSIDRYNVIVKGLSAQPLTYKKVTQWLAQVWIVSLFWGTAPFFGFGNF ALDGILNTCSFDYFSRDMLSMSYIVSACVWAYVIPLIVIIFCYTFIVRAVFEHEETLRQQAAKMNVTSLR SSANSEDTSAEFRIAKIAMINVCLWLWAWSPFTIVSFIGIFGNQAIITPYLSSLPVILAKTSSVYNPIVYALSHPRYQAALKEEFAWLCVKTNSGNSGSSDTKSSVTMESSQPA* >BCR3_triGra Triops granarius (tadpole_shrimp) Branchiopoda BAG80980 MMHNFSEPRYEAQVVRYGDFAPGVSVRDMAPENVRYMVHLHWEKFPPPDPRVHTALGALYLIMGVMSAVG NVLVLYIFGKYKSLRSPTNVLVMNLAFCDLGLFVGLYPELLGNIFINNGPWMWGDVACKIHAWCGLAFGF GQMQTLMFVSMDRYYVIVKGLKAPPLTYWKVSVWLAMVWIVSIFWATSPFFGFGNLSVDGLLNTCSYDYY TRDLPTVAYIVGSCVHAYVLPLAVIIFCYSYIVQAVFHHERQLREQAAKMNVASLRSSGGKQDEMSAEFR IAKIALINCCLWLWAWTPFTVISFMGVLHDDQSIINPYVSSLPVLLAKTSAVYNPIVYGLSHPKFQQCLREEFGWNIGLPKKKDNDSKSVTSVETAMT* >BCR1_triLon Triops longicaudatus (tadpole_shrimp) Branchiopoda BAG80982 47% CHEL_MWS_limPol MSSSGFNSTDPIALARVSAGSNAHQQVGYNILIKTDGLSVRDVAPLDMHHLLHSHWDSYPPADPRIHYLL GMLYFFLGIAACVGNVLVLHIFGKHKNLRSPTNTLLMNLAFCDLMIFIGLYPEMLGNIFMNDGTWMWGDI ACRLHAWFGLVFGFGQMQTLMYMSIDRYNVIVKGLSAQPLTYKKVTQWLAQVWIVSLFWGTAPFFGFGNF ALDGILNTCSFDYFTRDMPAMSYIVGACVSAYVIPLIVIIVCYTFIVRAVFEHEETLRQQAAKMNVTSLR SSASAEDTSAEFRIAKIAMINVCLWLWAWSPFTIVSFIGIFGNQAIITPYLSSLPVILAKTSSVYNPIVYALSHPKYQAALKEEFAWLCVKTNAGNSGSSDTKSSVTMESNQPA* >BCR2_braKug Branchinella kugenumaensis (fairy_shrimp) Branchiopoda Rhd AB293438 BAG80986 MLNNSEPSFAAYSVADGIWYPAGTKQIDGAPADVIAMTHAHWKQFPPSNPAWNYLFGVIYFFLWIVNHI GNGLVIWIFLKTKSLRTPSNMLIVNLAIADFFMMLTQSPLYIISAFTSRWWIWGHFWCRFYGYTGGITGIA AIFTMVFIGYDRYNVIVKGMNGTKITKGMAFIMILWTWIYANAFCLPAMLEVWGNFSPEGLLSTCSFDYL NDNKFHGYFYTMYIFTGAYCVPMLLLMFFYSQIVKAVWAHEASSRAQAKKMNVESLRSNADANAESAEMR IAKVALTNVLLWVCIWTPYAFVAVTGAFGNRQILTPLVAQLPSLICKMASCLNPLVYAISHPKYRQVLQK ELPWFCIHEPEDKKSDATSVGSATTTATA* >BCR3_braKug Branchinella kugenumaensis (fairy_shrimp) Branchiopoda BAG80985 MLNFSEPRFAAYSVAEGVWYPPGTTQIDGAPADIVALTHAHWKKFPPSNPAWNYLFACLYFFLWVINHI GNGLVIKIFLKTKSLRTPSNMLIVNLAIADFFMMLTQSPLFIISAFSSRWWIWGHFWCRFYGYTGGITGIA AIFTLVFIGYDRYNVIVKGMSGKRISKGMAFGMIVWTWVYANVFCLPPMLQVWGDFSPEGMLSTCSFDYL NENRLHGPIFTGYIFFGAYCVPMFLLFFFYSQIVKAVWAHEAALKAQAKKMNVESLRSNADANAESAEVR IAKVALTNVLLWICIWTPYAFVAVTGAFGNRQILTPLVAQLPSLICKCASSLNPIVYAISHPKFRQVIQK DYPWFCIHEPESSADTKSVTSGQTQVAA* >BCRa_dapPul Daphnia pulex (water_flea) Branchiopoda NCBI_GNO_149114 RhA AB293433 0 MSNNLSSGYSSVAYRSEGASVLWGYPPGLSIVDLVPDDMKEFIHPHWNKFPPVNPMWHYL 21 LGVIYVILGITSVT 1 2 GNSLVVHLFAKTRDLRTPANMFVINLAFSDLCMMITQFPMFVFNCFNGGVWLFGPLFCELYACTGSIFGLCSICTMAAISYDRYNVIVNGMNRRRMTY 1 2 GRAGGLILFCWIYAIGWSIPPFVGWGKYIPEGILDSCSFDYLTRDTM 0 0 TISFTCCLFAFDYCVPLIIIIFCYYHIVRAIVHHEDALRDQAKKMNVSSLRSNADQKSQSAEIRVAKIAMMNITLWVAAWTPYAAICLQGAVGNQDKITPLVTILPALIAKSASIFNPVVYAISHPKYRL 0 0 ALQKALPWFCIHEKEEKEPPQDRREDSQSIATTNTNSSDVSLP* 0 >MEL1_dapPul Daphnia pulex (water_flea) Branchiopoda NCBI_GNO_366144 no close homologs 0 MTSSNDSAGYLWAINATIWIIDDSNETLGIDWDDWDVSLWTQEQRQLLEHGGIPRQVHVALGVLLSFIVLFGFAANSTILYVFSR 2 1 FKRLRTPANVFIINLTICDFLACCLHPLAVYSAFRGRWSFGQT 1 2 GCNWYGMGVAFFGLNSIVTLSAIACERYIVITSSSCRPVVAKWRITRRQAQK 0V 0 VCAGIWLHCAALVSPPLLFGWSSYLPEGVLVTCSWDYTSRTLSNRLYYFYLLFFGFFLPVSVLTFCYAAIFRFILRSSKEITRLIMTSDGTTSFSKSTVSFRKRRRQTDVRTALI ILSLAILCFTAWTPYTIVSLIGQFGPVDEDGELKLSPMVTSIPAFLAKTAIVFDPLVYGFSSPQFRNSVRQILRQQSISSSGNAGNRAGPNNMAMARTAIQNSRASSHATVSSF SRNARMFPKDPLSKKTPNDPFVSTPLAVQQIPHFRLPTDVDINEQQFRRGIYANKSVSYWIDIIVLLQLGENLRKSCMKRKNSFKIPAGSIPQKNKLSNSRCSLLEDVSTHSLA LRQMIFRKEGELYLFHHQPSHNAELAANKMDHQGNNKRIRRRFSEADMMHRSGKCRKNLPVSTSFDQ* 0 >TMT_triCys Tripedalia cystophora (box_jelly) cubomedusae Schiff lysine EU310498 mRNA with predicted introns 0 MADHGRNTTSNDTNPISKIPLDDFDPQRFYNFYTFMGSFIAGSACCSFLLNGLVIAVLIKYIRTITNTNIIVLSMSCANILIPLLGSPLSATSSLMRKWQFGNGGCTWYGFINTLS 1 2 GISGIYHLTFLSFERFITIVLPLKRDTILSTKNIYIGLGILWVAAIGVAGAPVFGWCEYIKE 12 GVRTSCSVAWSSKENMNVFSYNLFMIFTVFLLPMLVIIYCNYRFIKEVSIMSTRA 0 0 RGLQGGDSEMTASASKAEKQLTIMVITMIIAFNIAWLPYTVVSMVFLTGYGDVVGPMGASVPSVFAKTSVIYNPVIYCLLNRS 0 0 FRKMLCGNSVEPE* 0 >CUBOP_carRas Carybdea rastonii (sea_wasp) cubomedusae AB435549 cubop pubmed:18832159 MGANITEILSGFLACVVFLSISLNMIVLITFYRLRHKLAFKDALMASMAFSDVVQAIVGYPLEVFTVVDGKWTFGMELCQVAGFFITALGQVSIAHLTALAL DRYFTVCRPFVATAIHGSMRNAGMVIFVCWFYASFWAVLPLVGWSNYDVEGDGMRCSINWADDSPKSYSYRVCLFVFIYLIPVLLMVATYVLVQGEMKNMRGRAAQLFGSESEAAL KNIKAEKRHTRLVFVMILSFIVAWTPYTFVAMWVSFFTKQLGPIPLYVDTLAAMLAKSSAMFNPIIYCFLHKQFRRAVLRGVCGRIVGGNAIAPSSTAVEPGQTLASGTAES* >ENCEPHa_nemVec Nematostella vectensis (anemone) anthozoa Schiff lysine no cdna complete 1 exon 306 aa best:ENCEPH4_braFlo scaffold_465_Cont27987 alt: Nemve1:219988 Nem1 0 MIDNALGRHEANIVLGYYIAIFVIGFVTNTIVVIIFISSQRLHTTPNLILFSMSVCDWLMATMAKSVGIYGNARYWPTVGKVTCDYYAFATSAIGYASILHLAALAVEKRMAVVSPMTNSFNGRRMLVIIATLWGFAILW AVFPLIGWSSYGPEPGYVSCSITWYTTDHNNVSYIICVSVLFFLIPIVTMTFCFASIYHTIRNLSHEATARWGSDARATQETIRAKAKTAKMAFLMVMCFLFAWTPYAVVSLWTTFGDTHRIPALLGVLPSLFAKLSSCY NPIIYFFMYTKFRCAGKALLYQEHH* 0 >ENCEPHb_nemVec Nematostella vectensis (anemone) anthozoa Schiff lysine NC-extended 1 exon 275 aa best:ENCEPH4_braFlo scaffold_273_Cont21871 alt:Nemve1:130042 Nem3 0 MSNEALRRHEANIVLGYYIAIFVIGFVTNTIVVITFIFSKRLHTTPNLILFSMSVCDWLMAAMAKSVGIYGNARYWPTVGKVTCDYYAFATSAIGYASILHLAALAVEKRMAVASPMTNSLNERRMLVIIATLWGFAILWAVFPLIGWSSYGPEPGYV SCSITWYTTDHNNVSYIICISVLFFFVPIVTMTFSFASIYKAIRNISHEAIARWGSHARATQETIKAKAKTAKMAFLMVMCFLFAWTPYAVVSLWTTFGGTHRNPALLGVLPSLFAKLSSCYNPIIYFFMYTKFRRAAKL LFIKKVIRPTEAERSRVLSGIRTRAASTFAVKLTVHAEKGQQIAPNITPQGAVKGPVESIEDNLQKIETITPCTSV* 0 >ENCEPHc_nemVec Nematostella vectensis (anemone) Anthozoa Schiff lysine C-extended 1 exon 289 aa best: ENCEPH5_braFlo scaffold_11_Cont2404alt: Nemve1:85309 Nem2 0 MELFTSYHAITVMYSLLAAGAFVLNGIVLIIFLATRSLRTIPNMILLSMAWADWLMACLADAVGAYANANNWPSMVGGLCVYYGFITTALGLTSMIHLTALSVERFVTVTIPMTRPITETQMLLVVTFLWAFSFLWAIFP LVGWSSYGPEPGYAACSIAWYRQDLNNMSYILCLFMFFFFLPIVIMIACFSSIYFTVRKLTRDSMRRWGA SSDSTQQTLAAERKTAWMSFIMVLAFLFAWVPYAVVSLYASFGGVTTIPKLMSTLPAMLAKTSACYNPII YFFMYSKFRKAFQRFFFKNVITPSQTGGSSTINSASVIPTSFPRASYAVRSSSVLPSSTASHSFRSRE* 0 >ENCEPH_aneVir Anemonia viridis (symbiotic_anemone) frag:202-338 pubmed:19627569 KDAVARWGNKSPPTQQTMQAQKKTIRMSLVMVFAYLLAWTPYALTSLYSSFIASDITPLLSVMPALFAKLSSCYNPIIYFFMYSKFR KAAKKMIRRNLVGHDSNSGQGVSNTFATSFPRPISFLRYKRSAVAPLSDIPQVSSVDLPQVGRENDVTVQQDKASEINT* >MEL1_acrMil Acropora millepora (stony_coral) anthozoa 454 transcriptome shotgun assembly EZ013658 + 454 blastn, 302aa frag 40%/63% ENCEPHc_nemVec; 35%/57% MEL1_homSap HHTISFLYFLLALFSFSLNSVVILTFLLDRSLLFPANLIILSIAISDWLMSVVPNIMGGVANASNDLPFTDWSCTVFAFVATLLGLSNMLHHAAFALDRYMVITRPMRANHSMTRILAVIAFLWCFALTWSLFPLVGWSAYVREAGDVACSVNW QSDNPSDTSYMVCLFFFFYFVPLAIIVYCYVFMIRSVRFMTKNAQKIWGVRSAAALETVQATWKMAKIGLIMVVGFFVAWTPYAVVSFIIAFDSVKDIPTIAEIVPSMFAKTASVYNPIIYFFSYKSFRESLVKSWRRYRNRNNVWPL >Opsin_plePil Pleurobrachia pileus (sea_gooseberry) Ctenophora CU419614 fragment YFLAFFFGIAPLVGWCEYGPEGYGVSNSLMWNNLNDNNASYIICAMIIGYFFPLIIIMFCYRAIYRLVQAQLNTSVVKMTVSANVTSLSTNRNHIMLVQE RQLASTITFVILSFFFAWTPYVFVNIINMFSTLILKYRILATIPALFAKSSTLWWIIVYCLMDVRIKKACRKTCLRFKRSFRVWYFS
GPCR outgroup sequences
It is sometimes convenient to have a close-in outgroup to opsins selected from the roughly 100,000 GPCR receptors available at GenBank. The set below of 29 non-opsin GPCRs serves satisfactorily as proxy for an exhaustive GPCR compilation. It was constructed by taking best-blast in turn of each human opsin against all other human GPCR then collating the lists, winnowing out repeated entries and too-recent gene family expansions. TACR2 (tachykinin receptor) and SSTR1 (somatostatin receptor) are the best single representatives. These are usefully supplemented with non-opsin GPCR having determined 3D structures, some nearest neighbors in recent GRAFS classification trees, and two astonishingly close pre-opsins (called UROPS1 and UROPS2 here) from Trichoplax, an early diverging eukaryote lacking opsins.
Conserved outgroup residues shared with opsins evidently describe commonalities needed for generic GPCR structure and signaling but not specifically for photobiology. Departures from the norm in certain opsin classes might indicate they are no longer signaling or are signaling constituitively. Opsins also have conserved diagnostic residues and even regions not found in any GPCR.
The aligned sequences below have been trimmed from full length at both ends to the earliest indications of conservation. This alignable region begins at the GN region of TM1 and extends to the [Opsin_evolution:_Cytoplasmic_face#The_carboxy-terminal_tail_and_VxPx_motif|FR motif]] just beyond TM7. Highly conserved residues are shown in red and less conserved residues in blue. The Schiff base lysine (position -16 relative to the FR end of TM7) does not occur outside of opsins. Note many conserved patches in these GPCR are very similar to those of opsins, implying those residues have no utility in distinguishing opsins from non-opsins.
The origin of opsins is not fully understood. Opsins are not the 'original' GPCR (which are trackable, barely, to yeast) even for the 'rhodopsin' group R (or even its Ralpha subgroup) within the GRAFS classification but rather form a specialized set that arose later as the rhodopsin gene class (which contains the AMIN cluster [adrenalin, serotonin, dopamine, and histamine receptors], MECA branch [peptide and lipid binding receptors] in addition to opsins) underwent significant expansions.
This expansion of the Ralpha class had largely taken place in the last common metazoan ancester shared with Monosiga and Trichoplax (which do not contain opsins), implying the ancestral metazoan lacked them as well. The orphan receptors GPR21 and GPR52 form the immediate outgroup (within the 800 human GPCR) in an oft-cited 2003 study. These have isoleucine at K296; their ligands are still not known as of Dec 2009. Conservation is high throughout deuterostomes; blast matches are restricted within opsins to molluscan melanopsins suggesting Gq signaling.
The melatonin receptor MLTNR1A emerges as a close relative to opsins. Curiously it plays a key role in circadian rhythms and so needs to coordinate with opsin photosensors. N-acetyl-5-methoxytryptamine, the ligand, bears no obvious relationship to cis-retinal however and K296 is lacking, making an immediate parent gene relationship problematic.
Another clue to the origin of opsins might be provided by examining GPCR intron positions and phases to see if shared with ancient introns in opsins. Many non-olfactory GPCR with sequence similarity to opsins have no introns or just one, suggesting the genes duplicated by retroprocessing, perhaps acquiring an intron at unrelated position later. UROPS2 has an intron but it does not seem to correspond to one in any opsin. Cnidarian opsins are either intronless (Nematostellata) or undetermined (just known from processed transcripts).
Closeness in the GRAFS tree does not fully accord with closeness of blastp hit and relatedness of diagnostic regions, suggesting (unsurprisingly) that its topology is slightly wrong at some internal nodes. On average rank in blastp top scores (or by average 5 best blast expectation values), as representatives of all opsin classes are aligned with the GPCR below, the highest scoring ones by far are are the Trichoplax opsins followed by various peptide receptors:
Rank Gene Exp Exons Receptor Ligand 4.2 UROPS2_triAd e-29 2 orphan histamine? (HRH2: best human non-opsin blast match) 5.4 UROPS1_triAd e-28 1 orphan peptide? (SSTR1: best human non-opsin blast match) 5.6 SSTR1_homSap e-26 1 somatostatin peptide 7.2 TACR2_homSap e-25 5 tachykinin peptide 8.1 GALR1_homSap e-24 3 galanin peptide 8.9 MTNR1A_homSa e-23 2 melatonin N-acetyl-5-methoxytryptamine
The biological literature contains various scattered claims about 'opsins' in species such as Chlamydemonas (chlamyopsin Z48968), not to mention bacterial 'rhodopsins'. These do not have the seven transmembrane helices in the same arrangement as GPCR nor significant sequence homology and may represent independent evolution of photobiology (just as bat and butterfly wings represent independent origins of flying).
Trichoplax has two very curious 7-transmembrane proteins that emerge as its best genomic match to opsin queries. While lacking K296 for a Schiff base, their best back-blast to all of GenBank returns almost entirely opsins (rather than nest within other GPCR receptors). While Trichoplax is 600+ million years removed from the common ancestor with eumetazoa, this gene could still offer clues about the immediate GPCR ancestor to opsins.
These Trichoplax genes retain uncanny similarities to opsins in otherwise rapidly changing regions. These two genes not plausibly derived from an opsin expansion with subsequent loss of K296 because Trichoplax and other early diverging lineages lack opsins. Perhaps these genes should be considered opsins in spite of lacking K296. Recall here Schiff base formation dramatically redshifts the absorption spectrum, yet non-covalently bound retinal still has significant adsorption at optical wavelengths which might be further tuned by Trichoplax binding pocket residues.
Conversely, several cnidarian species exhibit far too many K296-type GPCR for their apparent photoreceptive needs and accompaning lack of overt photobiological anatomical specializations. These may represent divergent gene duplications of valid opsins that have evolved into some other type of GPCR; alternatively they could represent a lineage of pre-opsin GPCR that developed K296 but never acquired an opsinlike light-sensing role nor served as parental gene to bona fide opsins.
Together the Trichoplax pre-opsins lacking K296 and putative cnidarian non-opsins possessing K296 push the opsin-defining envelope to its limits. Given the immense time span separating contemporary genes from ancestral, we can anticipate their computed nesting arrangement within the opsin gene tree relative to a close-in GPCR outgroup with known non-retinal ligands will lack convincing statistical support at the critical nodes. The best way forward is additional sequencing of cubomedusae, ctenophores and sponges because these seem to contain conventional opsins that clarify the positions of the outliers.
>GPR21_homSap Homo sapiens (human) orphan receptor 1 exon good match mollusc opsin IVFLTVLIISGNIIVIFVFHCAPLLNHHTTSYFIQTMAYADLFVGVSCVVPSLSLLHHPLPVEESLTCQIFGFVVSVLKSVSMASLACISIDRYIAITKPLTYNTLVTPWRLRLCIFLIWLYSTLVFLPSFFHWGKPGYHGDVFQWCAESWHTDSYFTLFIVMMLYAPAALIVCFTYFNIFRICQQHTKDISERQARFSSQSGETGEVQACPDKRYAMVLFRITSVFYILWLPYIIYFLLESSTGHSNRFASFLTTWLAISNSFCNCVIYSLSNSVFQR >GPR52_homSap Homo sapiens (human) orphan receptor immediate outgroup to opsins 1 exon IVLLTFLIIAGNLTVIFVFHCAPLLHHYTTSYFIQTMAYADLFVGVSCLVPTLSLLHYSTGVHESLTCQVFGYIISVLKSVSMACLACISVDRYLAITKPLSYNQLVTPCRLRICIILIWIYSCLIFLPSFFGWGKPGYHGDIFEWCATSWLTSAYFTGFIVCLLYAPAAFVVCFTYFHIFKICRQHTKEINDRRARFPSHEVDSSRETGHSPDRRYAMVLFRITSVFYMLWLPYIIYFLLESSRVLDNPTLSFLTTWLAISNSFCNCVIYSLSNSVFRL >MTNR1A_homSap Homo sapiens (human) melatonin receptor 2 exons circadian rhythm 2 exons LIFTIVVDILGNLLVILSVYRNKKLRNAGNIFVVSLAVADLVVAIYPYPLVLMSIFNNGWNLGYLHCQVSGFLMGLSVIGSIFNITGIAINRYCYICHSLKYDKLYSSKNSLCYVLLIWLLTLAAVLPNLRAGTLQYDPRIYSCTFAQSVSSAYTIAVVVFHFLVPMIIVIFCYLRIWILVLQVRQRVKPDRKPKLKPQDFRNFVTMFVVFVLFAICWAPLNFIGLAVASDPASMVPRIPEWLFVASYYMAYFNSCLNAIIYGLLNQNFRK >HRH2_homSap Homo sapiens (human) histamine receptor 2 exons LAVLILITVAGNVVVCLAVGLNRRLRNLTNCFIVSLAITDLLLGLLVLPFSAIYQLSCKWSFGKVFCNIYTSLDVMLCTASILNLFMISLDRYCAVMDPLRYPVLVTPVRVAISLVLIWVISITLSFLSIHLGWNSRNETSKGNHTTSKCKVQVNEVYGLVDGLVTFYLPLLIMCITYYRIFKVARDQAKRINHISSWKAATIREHKATVTLAAVMGAFIICWFPYFTAFVYRGLRGDDAINEVLEAIVLWLGYANSALNPILYAALNRDFRT >SSTR1_homSap Homo sapiens (human) somatostatin receptor SOG Rgamma class 1 exon YSVVCLVGLCGNSMVIYVILRYAKMKTATNIYILNLAIADELLMLSVPFLVTSTLLRHWPFGALLCRLVLSVDAVNMFTSIYCLTVLSVDRYVAVVHPIKAARYRRPTVAKVVNLGVWVLSLLVILPIVVFSRTAANSDGTVACNMLMPEPAQRWLVGFVLYTFLMGFLLPVGAICLCYVLIIAKMRMVALKAGWQQRKRSERKITLMVMMVVMVFVICWMPFYVVQLVNVFAEQDDATVSQLSVILGYANSCANPILYGFLSDNFKR >OPRL1_homSap Homo sapiens (human) opiate receptor-like 3 exons YLAVCVGGLLGNCLVMYVILRHTKMKTATNIYIFNLALADTLVLLTLPFQGTDILLGFWPFGNALCKTVIAIDYYNMFTSTFTLTAMSVDRYVAICHPIRALDVRTSSKAQAVNVAIWALASVVGVPVAIMGSAQVEDEEIECLVEIPTPQDYWGPVFAICIFLFSFIVPVLVISVCYSLMIRRLRGVRLLSGSREKDRNLRRITRLVLVVVAVFVGCWTPVQVFVLAQGLGVQPSSETAVAILRFCTALGYVNSCLNPILYAFLDENFKA >OPRM1_homSap Homo sapiens (human) opioid receptor mu 4 exons YSIVCVVGLFGNFLVMYVIVRYTKMKTATNIYIFNLALADALATSTLPFQSVNYLMGTWPFGTILCKIVISIDYYNMFTSIFTLCTMSVDRYIAVCHPVKALDFRTPRNAKIINVCNWILSSAIGLPVMFMATTKYRQGSIDCTLTFSHPTWYWENLLKICVFIFAFIMPVLIITVCYGLMILRLKSVRMLSGSKEKDRNLRRITRMVLVVVAVFIVCWTPIHIYVIIKALVTIPETTFQTVSWHFCIALGYTNSCLNPVLYAFLDENFKR >GALR1_homSap Homo sapiens (human) galanin receptor SOG Rgamma 3 exons FGLIFALGVLGNSLVITVLARSKPGKPRSTTNLFILNLSIADLAYLLFCIPFQATVYALPTWVLGAFICKFIHYFFTVSMLVSIFTLAAMSVDRYVAIVHSRRSSSLRVSRNALLGVGCIWALSIAMASPVAYHQGLFHPRASNQTFCWEQWPDPRHKKAYVVCTFVFGYLLPLLLICFCYAKVLNHLHKKLKNMSKKSEASKKKTAQTVLVVVVVFGISWLPHHIIHLWAEFGVFPLTPASFLFRITAHCLAYSNSSVNPIIYAFLSENFRK >CCR4_homSap Homo sapiens (human) chemokine (C-C motif) receptor 1 exon YSLVFVFGLLGNSVVVLVLFKYKRLRSMTDVYLLNLAISDLLFVFSLPFWGYYAADQWVFGLGLCKMISWMYLVGFYSGIFFVMLMSIDRYLAIVHAVFSLRARTLTYGVITSLATWSVAVFASLPGFLFSTCYTERNHTYCKTKYSLNSTTWKVLSSLEINILGLVIPLGIMLFCYSMIIRTLQHCKNEKKNKAVKMIFAVVVLFLGFWTPYNIVLFLETLVELEVLQDCTFERYLDYAIQATETLAFVHCCLNPIIYFFLGEKFRK >BDKRB2_homSap Homo sapiens (human) bradykinin receptor 2 exons LWVLFVLATLENIFVLSVFCLHKSSCTVAEIYLGNLAAADLILACGLPFWAITISNNFDWLFGETLCRVVNAIISMNLYSSICFLMLVSIDRYLALVKTMSMGRMRGVRWAKLYSLVIWGCTLLLSSPMLVFRTMKEYSDEGHNVTACVISYPSLIWEVFTNMLLNVVGFLLPLSVITFCTMQIMQVLRNNEMQKFKEIQTERRATVLVLVVLLLFIICWLPFQISTFLDTLHRLGILSSCQDERIIDVITQIASFMAYSNSCLNPLVYVIVGKRFRK >GPR17_homSap Homo sapiens (human) uracil/cys-leukotriene dual receptor 2 exons YLLDFILALVGNTLALWLFIRDHKSGTPANVFLMHLAVADLSCVLVLPTRLVYHFSGNHWPFGEIACRLTGFLFYLNMYASIYFLTCISADRFLAIVHPVKSLKLRRPLYAHLACAFLWVVVAVAMAPLLVSPQTVQTNHTVVCLQLYREKASHHALVSLAVAFTFPFITTVTCYLLIIRSLRQGLRVEKRLKTKAVRMIAIVLAIFLVCFVPYHVNRSVYVLHYRSHGASCATQRILALANRITSCLTSLNGALDPIMYFFVAEKFRH >CYSLTR1_homSap Homo sapiens (human) cys-leukotriene receptor 1 exon YSMISVVGFFGNGFVLYVLIKTYHKKSAFQVYMINLAVADLLCVCTLPLRVVYYVHKGIWLFGDFLCRLSTYALYVNLYCSIFFMTAMSFFRCIAIVFPVQNINLVTQKKARFVCVGIWIFVILTSSPFLMAKPQKDEKNNTKCFEPPQDNQTKNHVLVLHYVSLFVGFIIPFVIIIVCYTMIILTLLKKSMKKNLSSHKKAIGMIMVVTAAFLVSFMPYHIQRTIHLHFLHNETKPCDSVLRMQKSVVITLSLAASNCCFDPLLYFFSGGNFRK >P2RY8_homSap Homo sapiens (human) purinergic receptor AMIN Ralpha class 1 exon YSLVAAVSIPGNLFSLWVLCRRMGPRSPSVIFMINLSVTDLMLASVLPFQIYYHCNRHHWVFGVLLCNVVTVAFYANMYSSILTMTCISVERFLGVLYPLSSKRWRRRRYAVAACAGTWLLLLTALSPLARTDLTYPVHALGIITCFDVLKWTMLPSVAMWAVFLFTIFILLFLIPFVITVACYTATILKLLRTEEAHGREQRRRAVGLAAVVLLAFVTCFAPNNFVLLAHIVSRLFYGKSYYHVYKLTLCLSCLNNCLDPFVYYFASREFQL >HCRTR1_homSap Homo sapiens (human) orexin receptor Rbeta class 7 exons YVAVFVVALVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKMLMVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFRE >TACR2_homSap Homo sapiens (human) tachykinin receptor Gq-coupled SOG Rgamma class 5 exons YLALVLVAVTGNAIVIWIILAHRRMRTVTNYFIVNLALADLCMAAFNAAFNFVYASHNIWYFGRAFCYFQNLFPITAMFVSIYSMTAIAADRYMAIVHPFQPRLSAPSTKAVIAGIWLVALALASPQCFYSTVTMDQGATKCVVAWPEDSGGKTLLLYHLVVIALIYFLPLAVMFVAYSVIGLTLWRRAVPGHQAHGANLRHLQAMKKFVKTMVLVVLTFAICWLPYHLYFILGSFQEDIYCHKFIQQVYLALFWLAMSSTMYNPIIYCCLNHRFRS >NMUR2_homSap Homo sapiens (human) neuromedin U receptor 4 exons YVPIFVVGVIGNVLVCLVILQHQAMKTPTNYYLFSLAVSDLLVLLLGMPLEVYEMWRNYPFLFGPVGCYFKTALFETVCFASILSITTVSVERYVAILHPFRAKLQSTRRRALRILGIVWGFSVLFSLPNTSIHGIKFHYFPNGSLVPGSATCTVIKPMWIYNFIIQVTSFLFYLLPMTVISVLYYLMALRLKKDKSLEADEGNANIQRPCRKSVNKMLFVLVLVFAICWAPFHIDRLFFSFVEEWSESLAAVFNLVHVVSGVFFYLSSAVNPIIYNLLSRRFQA >QRFPR_homSap Homo sapiens (human) pyroglutamylated RFamide peptide receptor MEC Ralpha class 6 exons GVLIFALALFGNALVFYVVTRSKAMRTVTNIFICSLALSDLLITFFCIPVTMLQNISDNWLGGAFICKMVPFVQSTAVVTEILTMTCIAVERHQGLVHPFKMKWQYTNRRAFTMLGVVWLVAVIVGSPMWHVQQLEIKYDFLYEKEHICCLEEWTSPVHQKIYTTFILVILFLLPLMVMLILYSKIGYELWIKKRVGDGSVLRTIHGKEMSKIARKKKRAVIMMVTVVALFAVCWAPFHVVHMMIEYSNFEKEYDDVTIKMIFAIVQIIGFSNSICNPIVYAFMNENFKK >GPR19_homSap Homo sapiens (human) orphan receptor 19 1 exon FGILWLFSIFGNSLVCLVIHRSRRTQSTTNYFVVSMACADLLISVASTPFVLLQFTTGRWTLGSATCKVVRYFQYLTPGVQIYVLLSICIDRFYTIVYPLSFKVSREKAKKMIAASWIFDAGFVTPVLFFYGSNWDSHCNYFLPSSWEGTAYTVIHFLVGFVIPSVLIILFYQKVIKYIWRIGTDGRTVRRTMNIVPRTKVKTIKMFLILNLLFLLSWLPFHVAQLWHPHEQDYKKSSLVFTAITWISFSSSASKPTLYSIYNANFRR >PPYR1_homSap Homo sapiens (human) pancreatic polypeptide receptor MEC Ralpha class 1 exon YSIETVVGVLGNLCLMCVTVRQKEKANVTNLLIANLAFSDFLMCLLCQPLTAVYTIMDYWIFGETLCKMSAFIQCMSVTVSILSLVLVALERHQLIINPTGWKPSISQAYLGIVLIWVIACVLSLPFLANSILENVFHKNHSKALEFLADKVVCTESWPLAHHRTIYTTFLLLFQYCLPLGFILVCYARIYRRLQRQGRVFHKGTYSLRAGHMKQVNVVLVVMVVAFAVLWLPLHVFNSLEDWHHEAIPICHGNLIFLVCHLLAMASTCVNPFIYGFLNTNFKK >NPY1R_homSap Homo sapiens (human) neuropeptide Y receptor Rbeta class 2 exons YGAVIILGVSGNLALIIIILKQKEMRNVTNILIVNLSFSDLLVAIMCLPFTFVYTLMDHWVFGEAMCKLNPFVQCVSITVSIFSLVLIAVERHQLIINPRGWRPNNRHAYVGIAVIWVLAVASSLPFLIYQVMTDEPFQNVTLDAYKDKYVCFDQFPSDSHRLSYTTLLLVLQYFGPLCFIFICYFKIYIRLKRRNNMMDKMRDNKYRSSETKRINIMLLSIVVAFAVCWLPLTIFNTVFDWNHQIIATCNHNLLFLLCHLTAMISTCVNPIFYGFLNKNFQR >PRLHR_homSap Homo sapiens (human) prolactin releasing hormone receptor 1 exon YSVVVVVGLVGNCLLVLVIARVRRLHNVTNFLIGNLALSDVLMCTACVPLTLAYAFEPRGWVFGGGLCHLVFFLQPVTVYVSVFTLTTIAVDRYVVLVHPLRRRISLRLSAYAVLAIWALSAVLALPAAVHTYHVELKPHDVRLCEEFWGSQERQRQLYAWGLLLVTYLLPLLVILLSYVRVSVKLRNRVVPGCVTQSQADWDRARRRRTFCLLVVVVVVFAVCWLPLHVFNLLRDLDPHAIDPYAFGLVQLLCHWLAMSSACYNPFIYAWLHDSFRE >GPR161_homSap Homo sapiens (human) orphan receptor 161 5 exons IIVITIFVCLGNLVIVVTLYKKSYLLTLSNKFVFSLTLSNFLLSVLVLPFVVTSSIRREWIFGVVWCNFSALLYLLISSASMLTLGVIAIDRYYAVLYPMVYPMKITGNRAVMALVYIWLHSLIGCLPPLFGWSSVEFDEFKWMCVAAWHREPGYTAFWQIWCALFPFLVMLVCYGFIFRVARVKARKVHCGTVVIVEEDAQRTGRKNSSTSTSSSGSRRNAFQGVVYSANQCKALITILVVLGAFMVTWGPYMVVIASEALWGKSSVSPSLETWATWLSFASAVCHPLIYGLWNKTVRK >ADRA1D_homSap Homo sapiens (human) alpha-1D-adrenergic receptor PUR Ralpha class 2 exons LAAFILMAVAGNLLVILSVACNRHLQTVTNYFIVNLAVADLLLSATVLPFSATMEVLGFWAFGRAFCDVWAAVDVLCCTASILSLCTISVDRYVGVRHSLKYPAIMTERKAAAILALLWVVALVVSVGPLLGWKEPVPPDERFCGITEEAGYAVFSSVCSFYLPMAVIVVMYCRVYVVARSTTRSLEAGV VLRIHCRGAATGADGAHGMRSAKGHTFRSSLSVRLLKFSREKKAAKTLAIVVGVFVLCWFPFFFVLPLGSLFPQLKPSEGVFKVIFWLGYFNSCVNPLIYPCSSREFKR >TRHR_homSap Homo sapiens (human) thyrotropin-releasing hormone receptor 2 exons VLIICGLGIVGNIMVVLVVMRTKHMRTPTNCYLVSLAVADLMVLVAAGLPNITDSIYGSWVYGYVGCLCITYLQYLGINASSCSITAFTIERYIAICHPIKAQFLCTFSRAKKIIIFVWAFTSLYCMLWFFLLDLNISTYKDAIVISCGYKISRNYYSPIYLMDFGVFYVVPMILATVLYGFIARILFLNPIPSDPKENSKTWKNDSTHQNTNLNVNTSNRCFNSTVSSRKQVTKMLAVVVILFALLWMPYRTLVVVNSFLSSPFQENWFLLFCRICIYLNSAINPVIYNLMSQKFRA >ADRB2_homSap Homo sapiens (human) beta2 adrenoceptor pdb:2R4R PUR Ralpha class 1 exon MSLIVLAIVFGNVLVITAIAKFERLQTVTNYFITSLACADLVMGLAVVPFGAAHILMKMWTFGNFWCEFWTSIDVLCVTASIETLCVIAVDRYFAITSPFKYQSLLTKNKARVIILMVWIVSGLTSFLPIQMHWYRATHQEAINCYANETCCDFFTNQAYA IASSIVSFYVPLVIMVFVYSRVFQEAKRQLQKIDKSEGRFHVQNLSQVEQDGRTGHGLRRSSKFCLKEHKALKTLGIIMGTFTLCWLPFFIVNIVHVIQDNLIRKEVYILLNWIGYVNSGFNPLIYCRSPDFRI >ADORA2A_homSap Homo sapiens (human) beta2 adenosine receptor pdb:3EML PUR Ralpha class 2 exons ELAIAVLAILGNVLVCWAVWLNSNLQNVTNYFVVSLAAADIAVGVLAIPFAITISTGFCAACHGCLFIACFVLVLTQSSIFSLLAIAIDRYIAIRIPLRYNGLVTGTRAKGIIAICWVLSFAIGLTPMLGWNNCGQPKEGKNHSQGCGEGQVACLFEDVVPMNYMVYFNFFAC VLVPLLLMLGVYLRIFLAARRQLKQMESQPLPGERARSTLQKEVHAAKSLAIIVGLFALCWLPLHIINCFTFFCPDCSHAPLWLMYLAIVLSHTNSVVNPFIYAYRIREFRQ >ADRB1_melGal Meleagris gallopavo (turkey) beta1 adrenergic receptor pdb:2VT4 1 exon MALVVLLIVAGNVLVIAAIGSTQRLQTLTNLFITSLACADLVVGLLVVPFGATLVVRGTWLWGSFLCELWTSLDVLCVTASIETLCVIAIDRYLAITSPFRYQSLMTRARAKVIICTVWAISALVSFLPIMMHWWRDEDPQALKCYQDPGCCDFVTNRAYAIASSIISFYIPLLIMIFVALRV YREAKEQIRKIDRASKRKRVMLMREHKALKTLGIIMGVFTLCWLPFFLVNIVNVFNRDLVPDWLFVAFNWLGYANSAMNPIIYCRSPDFRK >UROPS1_triAdh Trichoplax adhaerens (trichoplax) XM_002114542 ends trimmed one exon YSVLSLITILGNMLVFLTFYKHASLRTTSNLFIINLAITDLLTGGIKDTLFIYGLTSYNWPKSAILCSFVGFINCVCYVATVYTLTVIAVFRYLAIVCNLGRKIKRKHSILTIACVWIYSSACCLMPIIGWSRY IYEPTECTCVRSLDKKYYSYTIYILVADFLLPLSVVSFCYGNIFAKMKRNHHHIKRDLSDGQKLDVAEKLSRREEIVARRMFIILAEHVVCFLPYTIIVTMLAANGVDIEPVWYFIVGYLLNLNSALNPVTYIVVNPRLKK >UROPS2_triAdh Trichoplax adhaerens (trichoplax) XM_002112401 ends trimmed single phase 21 intron 0 MTLFMLMSCIGNGAVLLVLRYHHDDIKSASNYFITNLALTDFLLGVLCMPCILISCLNGQWVFGQTLCSLTGFANSFFCINSMITLAAVSVEKYCAIASPLTY HHYMSKSKVTCVISIIWIHSAINASLPFLGWGEYVYLPFETICTVAWWSFPNYVGFIVGINFGLPTVIMSCTYFLILKIARKHSRRIGVST 2 1 RRIHYKTHIKATLMLLIVIGSFIVCWLPHLISMIYLTIYEISPLPCSFHQITTWLAMANSAFNPIIYGAMDTSIRK