SECIS binding proteins: KIAA0256 and SBP2: Difference between revisions
Tomemerald (talk | contribs) |
Tomemerald (talk | contribs) |
||
Line 1: | Line 1: | ||
== KIAA0256: mis-annotated and forgotten ancestral SECIS binding protein == | == KIAA0256: mis-annotated and forgotten ancestral SECIS binding protein == | ||
KIAA0256 originally arose in a GenBank submission package from a large-scale mRNA sequencing project at the Kaluza Institute. While high quality, it skips over a highly conserved exon 8, | KIAA0256 originally arose in a GenBank submission package from a large-scale mRNA sequencing project at the Kaluza Institute. While high quality, it skips over a highly conserved exon 8, VGFRCRGHSTSSERRQNLQKRPDNKHLSSSQSHRSDPNSESLYFE. This does not alter downstream reading frame since the skipped exon resides in a series of consecutive phase 00 splices. NCBI staff then confounded the record by posting experimentally unsupported gene predictions -- all exon-skipping -- from various genome assemblies lacking significant transcript programs '''mis-labelling them as mRNAs''', thus entrenching the incomplete variant as normal form. | ||
Two [http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=86695 subsequent papers] featuring the [http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pubmed&pubmedid=10637234 twice-published Figure 1B] aligned full length SBP2 to an intermediate region of the exon-skipping gene model of KIAA0256 that by coincidence began with a methionine, namely residues 422-849 of the 1101 residue protein (which did include the motif-bearing residues 632-829 of exons 14-16). SwissProt provides the proper full length protein Q93073 without a supporting accession. | Two [http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=86695 subsequent papers] featuring the [http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pubmed&pubmedid=10637234 twice-published Figure 1B] aligned full length SBP2 to an intermediate region of the exon-skipping gene model of KIAA0256 that by coincidence began with a methionine, namely residues 422-849 of the 1101 residue protein (which did include the motif-bearing residues 632-829 of exons 14-16). SwissProt provides the proper full length protein Q93073 without a supporting accession. | ||
Line 24: | Line 24: | ||
Transcriptional processing in mammals is error-rich, producing numerous defective mRNA variants that never amount to useful regulation or stable protein -- downstream quality controls quickly eliminate them. While exon-skipping in some cases may have adaptive significance, without significant comparative genomics support, the default hypothesis is they do not. Here it is difficult to distinguish between a weakened exon 8 splice acceptor in apes leading to a fraction of defective transcripts versus an innovative functional truncated form. These alternatives might be resolvable from 3D structural considerations -- deleting 45 conserved residues in an ancient exon is highly problematic. | Transcriptional processing in mammals is error-rich, producing numerous defective mRNA variants that never amount to useful regulation or stable protein -- downstream quality controls quickly eliminate them. While exon-skipping in some cases may have adaptive significance, without significant comparative genomics support, the default hypothesis is they do not. Here it is difficult to distinguish between a weakened exon 8 splice acceptor in apes leading to a fraction of defective transcripts versus an innovative functional truncated form. These alternatives might be resolvable from 3D structural considerations -- deleting 45 conserved residues in an ancient exon is highly problematic. | ||
KIAA0256 and SECISBP2 have so-so alignment but over their [http://genome-test.cse.ucsc.edu/cgi-bin/hgNear?near_search=uc004aqj.1&hgsid=1521383&near.do.affineAli=uc001zxd.1 entire lengths] | KIAA0256 and SECISBP2 have so-so alignment but over their [http://genome-test.cse.ucsc.edu/cgi-bin/hgNear?near_search=uc004aqj.1&hgsid=1521383&near.do.affineAli=uc001zxd.1 entire lengths] plus 17 exactly comparable exons (trillion:1 odds for coincidence), meaning they reflect a segmental gene duplication (which can be dated to post-amphioxus, pre-chondrichtyhes). It is imperative to enforce exon boundaries to achieve true homological alignment of two proteins this diverged and so gappy N-terminally; structure-based alignment has different rules (allowing convergent evolution) and different goals. | ||
The teleost fish Pimephales promelas has sufficient transcript coverage to allow recover of an accurate full length KIAA0256 gene with a respectable 62% identity to human. No fish has sufficient transcripts to recover full length SBP2 as of Dec 08. Some initial exons are quite well conserved over this billion years of branch length, strong evidence that they retain an unknown function under strong selection. However the gaps in other early exons are incompatible with retention of tertiary protein structure. No early pfam domain can be found. | The teleost fish Pimephales promelas has sufficient transcript coverage to allow recover of an accurate full length KIAA0256 gene with a respectable 62% identity to human. No fish has sufficient transcripts to recover full length SBP2 as of Dec 08. Some initial exons are quite well conserved over this billion years of branch length, strong evidence that they retain an unknown function under strong selection. However the gaps in other early exons are incompatible with retention of tertiary protein structure. No early pfam domain can be found. |
Revision as of 14:31, 2 January 2009
KIAA0256: mis-annotated and forgotten ancestral SECIS binding protein
KIAA0256 originally arose in a GenBank submission package from a large-scale mRNA sequencing project at the Kaluza Institute. While high quality, it skips over a highly conserved exon 8, VGFRCRGHSTSSERRQNLQKRPDNKHLSSSQSHRSDPNSESLYFE. This does not alter downstream reading frame since the skipped exon resides in a series of consecutive phase 00 splices. NCBI staff then confounded the record by posting experimentally unsupported gene predictions -- all exon-skipping -- from various genome assemblies lacking significant transcript programs mis-labelling them as mRNAs, thus entrenching the incomplete variant as normal form.
Two subsequent papers featuring the twice-published Figure 1B aligned full length SBP2 to an intermediate region of the exon-skipping gene model of KIAA0256 that by coincidence began with a methionine, namely residues 422-849 of the 1101 residue protein (which did include the motif-bearing residues 632-829 of exons 14-16). SwissProt provides the proper full length protein Q93073 without a supporting accession.
SBP2 exhibits a fusion of two exons (2 ELSWTPMGYVVRQTLSTEL 00 SAAPKNVTSMINLKTIASS...) relative to KIA0256 within an indel-rich area of the protein. The intronation of KIA0256 is the ancestral form. The fusion occured prior to teleost fish divergence but is hard to date earlier. After consideration of anchoring patches of semi-conserved residues, the alignment of human paralogs in this region is:
SPB2 exons 5-8 showing fusion KIAA0256 exons 5-9 2 ELSWTPMGYVVRQTLSTEL 0 2 GGVNWSNVTCQATQKKPWMEKNQTFSRGGRQTEQRNNSQ 0 0 SAAPKNVTSMINLKTIASSADPKNVSIPSSEALSSDPSYNKEKHIIHPTQK 0 0 VGFRCRGHSTSSERRQNLQKRPDNKHLSSSQSHRSDPNSESLYFE 0 0 SKASQGSDLEQNEASRKNKKKKEKSTSKYEVLTVQEPPRIE 0 0 DEDGFQELNENGNAKDENIQQKLSSKV 0 0 DAEEFPNLAVASERRDRIETPKFQSKQQPQ 0 0 LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ 0 0 DNFKNNVKKSQLPVQLDLGGMLTALEKKQHSQHAKQSSKPVVVS 1 0 EALSKAAGKKNKTPVQLDLGDMLAALEKQQQAMKARQITNTRPLSYT 1
The effect of these early errors meant early SECIS binding experiments used KIAA0256 protein lacking the immensely conserved exon 8, this variant unsurprisingly lacked relevent SECIS binding properties, whick lead to abandonment of further experimentation. Consequently we know nothing about the SECIS binding properties of full length KIAA0256 protein.
There is no reason to believe an odd fragment studied on a small subset of SECIS elements could accurately reflect binding properties of full length protein in regards to all 25 orthology classes of human SECIS elements. However these results remain accepted folklore within the selenocysteine research community even today.
An Irish family with an SBP2 compound mutation (paternal allele inactive, maternal allele a splice donor mutation leading to early truncation: K438stop/IVS8ds+29G/A ) has been incorrectly described by these same authors as a SBP2 knockout; in fact 48% production of wildtype maternal allele still occurs. In addition, KIAA0256 may be able to partially compensate for reduction in level or loss of SECISBP2. Knockout mice for tRNA(Sec), unable to make any selenoproteins, die in utero.
While baboon also has experimental transcripts skipping this exon (FC145891, FC178616), mammalian transcripts almost always retain the exon, for example human (AK307480 but not BF055173, CN482709, BE930773, DW431473), macaque (CJ457866), mouse (AK145135), rat (CK602552), dog (CO708934), horse (CX604216), cow (CK846448), sheep (EE864720), and even chicken (DR417186), frogs, and fish. Thus inclusion of exon 8 is the ancestral state. Skipping is documented to date only in these two primates.
Transcriptional processing in mammals is error-rich, producing numerous defective mRNA variants that never amount to useful regulation or stable protein -- downstream quality controls quickly eliminate them. While exon-skipping in some cases may have adaptive significance, without significant comparative genomics support, the default hypothesis is they do not. Here it is difficult to distinguish between a weakened exon 8 splice acceptor in apes leading to a fraction of defective transcripts versus an innovative functional truncated form. These alternatives might be resolvable from 3D structural considerations -- deleting 45 conserved residues in an ancient exon is highly problematic.
KIAA0256 and SECISBP2 have so-so alignment but over their entire lengths plus 17 exactly comparable exons (trillion:1 odds for coincidence), meaning they reflect a segmental gene duplication (which can be dated to post-amphioxus, pre-chondrichtyhes). It is imperative to enforce exon boundaries to achieve true homological alignment of two proteins this diverged and so gappy N-terminally; structure-based alignment has different rules (allowing convergent evolution) and different goals.
The teleost fish Pimephales promelas has sufficient transcript coverage to allow recover of an accurate full length KIAA0256 gene with a respectable 62% identity to human. No fish has sufficient transcripts to recover full length SBP2 as of Dec 08. Some initial exons are quite well conserved over this billion years of branch length, strong evidence that they retain an unknown function under strong selection. However the gaps in other early exons are incompatible with retention of tertiary protein structure. No early pfam domain can be found.
We have to wonder how sea urchin, which has a full length apparent ortholog of KIAA0256 on Scaffold18963 but nothing clustering to SBP2, can insert selenocysteine into its numerous selenoproteins (SEPHS1, SELU1, SELU2, SELM, SELO, SELW, SELN1, GPX3, GPX2, GPX4, GPX7,...). Unless a second copy has been lost, all SECIS interaction at the ribosome at sea urchin divergence appears to have been handled by KIAA056.
The same can be said for amphioxus and tunicate. These species too have numerous selenoproteins yet their genome assemblies contain but a single homologous gene with vastly higher homology to KIAA0256. Lamprey genome lacks adequate coverage; elephant shark has fragments of both genes. It's difficult to extend orthologous annotation into protostomes and cnidaria because divergence is high even within the L7ae motif, though three long overlapping cDNAs from clam allow recovery of a long terminal fragment.
>SECISBP2_homSap Homo sapiens (human) full length 0 MASEGPREPESE 0 0 GIKLSADVKPFVPRFAGLNVAWLESSEACVFPSSAATYYPFVQEPPVTE 2 1 QKIYTEDMAFGASTFPPQYLSSEITLHPYAYSPYTLDSTQNVYSVPGSQYLYNQPSCYRGFQTVKHRNENTCPLPQEMKALFK 0 0 KKTYDEKKTYDQQKFDSERADGTISSEIKSARGSHHLSIYAENSLKS 1 2 DGYHKRTDRKSRIIAKNVSTSKPEFEFTTLDFPELQGAENNMSEIQKQPKWGPVHSVSTDISLLREVVKPAAVLSK 0 0 GEIVVKNNPNESVTANAATNSPSCTR 1 2 ELSWTPMGYVVRQTLSTELSAAPKNVTSMINLKTIASSADPKNVSIPSSEALSSDPSYNKEKHIIHPTQK 0 0 SKASQGSDLEQNEASRKNKKKKEKSTSKYEVLTVQEPPRIE 0 0 DAEEFPNLAVASERRDRIETPKFQSKQQPQ 0 0 DNFKNNVKKSQLPVQLDLGGMLTALEKKQHSQHAKQSSKPVVVS 1 2 VGAVPVLSKECASGERGRRMSQMKTPHNPLDSSAPLMKKGKQREIPKAKKPTSLKK 0 0 IILKERQERKQRLQENAVSPAFTSDDTQDGESGGDDQFPEQAELS 1 2 GPEGMDELISTPSVEDKSEEPPGTELQRDTEASHLAPNHTTFPKIHSRRFRD 2 1 YCSQMLSKEVDACVTDLLKELVRFQDRMYQKDPVKAKTKRRLVLGLREVLKHLKLKKLKCVIISPNCEKIQSK 1 2 GGLDDTLHTIIDYACEQNIPFVFALNRKALGRSLNKAVPVSVVGIFSYDGAQ 0 0 DQFHKMVELTVAARQAYKTMLENVQQELVGEPRPQAPPSLPTQGPSCPAEDGPPALKEKEEPHY 1 2 IEIWKKHLEAYSGCTLELEESLEASTSQMMNLNL* 0 407–525 domain required for U insertion but not SECIS binding (399–516 in rat) 540 R540Q allele of SBP2 decreases GPX1 and DIO2 650–752 L7Ae motif kink-turn binding motif 676 invariant glycine (669 in rat) >KIAA0256_homSap Homo sapiens (human) length=1101 0 MDRAPTEQ 0 0 NVKLSAEVEPFIPQKKSPDTFMIPMALPNDNGSVSGVEPTPIPSYLITCYPFVQENQSNR 2 1 QFPLYNNDIRWQQPNPNPTGPYFAYPIISAQPPVSTEYTYYQLMPAPCAQVMGFYHPFPTPYSNTFQAANTVNAITTECTERPSQLGQVFPLSSHRSRNSNRGSVVPK 0 0 QQLLQQHIKSKRPLVKNVATQKETNAAGPDSRSKIVLLVDASQQT 1 2 DFPSDIANKSLSETTATMLWKSKGRRRRASHPTAESSSEQGASEADIDSDSGYCSPKHSNNQPAAGALRNPDSGTMN 0 0 HVESSMCA 1 2 GGVNWSNVTCQATQKKPWMEKNQTFSRGGRQTEQRNNSQ 0 0 VGFRCRGHSTSSERRQNLQKRPDNKHLSSSQSHRSDPNSESLYFE 0 0 DEDGFQELNENGNAKDENIQQKLSSKV 0 0 LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ 0 0 EALSKAAGKKNKTPVQLDLGDMLAALEKQQQAMKARQITNTRPLSYT 1 2 VVTAASFHTKDSTNRKPLTKSQPCLTSFNSVDIASSKAKKGKEKEIAKLKRPTALKK 0 0 VILKEREEKKGRLTVDHNLLGSEEPTEMHLDFIDDLPQEIVSQE 1 2 DTGLSMPSDTSLSPASQNSPYCMTPVSQGSPASSGIGSPMASSTITKIHSKRFRE 2 1 YCNQVLCKEIDECVTLLLQELVSFQERIYQKDPVRAKARRRLVMGLREVTKHMKLNKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYFGAE 0 0 SLFNKLVELTEEARKAYKDMVAAMEQEQAEEALKNVKKVPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 2 ETNWRNMVETSDGLEASENEKEVSCKHSTSEKPSKLPFDTPPIGKQPSLVATGSTTSATSAGKSTASDKEEVKPDDLEWASQQSTETGSLDGSCRDLLNSSITSTTSTLVP GMLEEEEDEDEEEEEDYTHEPISVEVQLNSRIESWVSETQRTMETLQLGKTLNGSEEDNVEQSGEEEAEAPEVLEPGMDSEAWTADQQASPGQQKSSNCSSLNKEHSDSNYTTQTT* 0 phosphoserines predicted at SwissProt; no counterparts in SECISBP2 exon 8 skipped in RefSeq KIAA0256 exon 11 is very conserved KIAA0256 exon 8 conservation suggests functionality: VGFRCRGHSTSSERRQNLQKRPDNKHLSSSQSHRSDPNSESLYFE Homo sapiens VGFRCRGHSTSSERRQNLQKRPDNKHLSSSQSHRSDPNSESLYFE Macaca fascicularis VGFRCRGHSTSSERRQNLQKRQDNKQLNPSQSHRSDSNSESLYFE Tupaia belangeri VGFRCRGHSTSSERRQNLQKRQDNKHLNSTQSHRSDPNSESLYFE Mus musculus VGFRCRGHSTSSERRQNLQKRQDNKHLNSTQSHRSDPNSESLYFE Rattus norvegicus VGFRCRGHSTSSERRQNLPKRQDNNKQLNASQSHRGDSNSESLYFE Canis familiaris VGFRCRGHSTSSERRQNLQKRQDNKQLNPSQSHRGNPNSESLYFE Equus caballus VGFKCRGHSTSSERRQNLQKRQDNKQLNPNQSHRSDPNSESLYFE Myotis lucifugus VGFRCRGHSTSSERRQNLQKRQDNKQLNPSQSHRGDPNSESLYFE Bos taurus VGFRCRGHSTSSERRQNLQKRQDNKQLNPSQSHRGDPNSESLYFE Ovis aries VGFRCRGHSTSSERRQNLQKKQDNKQLNSSQSHRGDPNSESLYFE Dasypus novemcinctus VGFRCRGHSTSSERRQNLQKRQDNKQLNPIQSQRGDPNSESLYFE Loxodonta africana VGFRCRGHSTSSERRQSLQKRQDNKPL-GNHSHRVETSSDPLYFE Monodelphis domestica SGFRCRGHSTSSERRQNLQKRHE-KPLTTSQSSRAEQSPEPLYFE Gallus gallus PAFRCRGHSTSSERRQNLQKKPE-KPVSSSQSSKREQSPGSLYFE Anolis carolinensis LGYRLRGQSTSSERRHNLQRKQDNKTGTPASSNKSGQSPDHLYFE Xenopus tropicali KIAA0256 exon 11 conservation (weak in SBP2): LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Homo sapiens LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Pongo abelii LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Macaca mulatta LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Tarsius syrichta LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Otolemur garnettii LDDLPEiSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Tupaia belangeri LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Mus musculus LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Rattus norvegicus LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Dipodomys ordii LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Spermophilus tridecemlineatus LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Cavia porcellus LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Oryctolagus cuniculus LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Ochotona princeps LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Felis catus LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Canis familiaris LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Equus caballus LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Myotis lucifugus LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Pteropus vampyrus LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Tursiops truncatus LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Lama pacos LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Sorex araneus LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Echinops telfairi LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKiQ Monodelphis domestica LagLPENSPIsIVQTPIPITaSVPKRAKSQKKKALAAALATAQEYSEISMEQrKLQ Ornithorhynchus anatinus LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Gallus gallus LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Taeniopygia guttata LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQrKLQ Anolis carolinensis LNGLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ Xenopus tropicalis LDNLPENSPINIVQTPIPITTSVPKRAKSQRKKAMAAALATAQEYSEISMEQKKLQ Gasterosteus aculeatus LDNLPENSPISIVQTPIPITSSVPKRAKSQRKKALAAALATAQEYSEISMEQKKLQ Pimephales promelas LPGSQEPLNPATVVSTPVEVKKEGKNARKKRKKALLAAKAAAEEYSEITQVISENQ Branchiostoma floridae ....INSSAPYPSSAANLNEKSQAQKTKKRRKKAERAARAADEEYAEISKEQENIQ Ciona intestinalis ....NVQNQVYPPS..NSNEKAQAQKSKKRRKKAERAAKAADEEYAEISKEHENIQ Ciona savignyi DPSSIKPEELLSPANVMSTIKEG.KNARKRRKKAIMATQAAAKEYSEITEEQRQLH Strongylocentrotus purpuratus
Using the kink-turn binding motifs of the two human proteins in turn as blastp query against the both collections of deuterostome KIAA0256 and SECISBP2 sequences, establishes KIAA0256 as the slower evolving protein by a wide margin. This fits KIAA0256 retaining ancestral function and its gene duplicate SECISBP2 specializing via a neofunctionalization.
Blastp score ratio KIAA0256/SECISBP2 (human query): ratio > 1 indicates slower evolution of KIAA0256 galGal 1.41 72% identity anoCar 1.35 xenTro 1.41 68% identity danRer 1.44 tetNig 1.59 takRub 1.45 64% identity gasAcu 1.60 oryLat 1.52 calMil 1.43 65% identity
Both proteins bristle with potential NxT/S glycosylation sites, 13 for KIAA0256 and 6 for SECISBP2, with implications for cellular localization. These do not lie in homologous positions, unsurprisingly in view of the deep divergence of these genes and volatility of glycosylation sites in other gene families. These sites are conserved only to moderate depth -- and that could be for reasons unrelated to glycosylation). Hence glycosylation site do not provide reliable anchors in region of poor sequence conservation. SwissProt predicts phosphoserine sites in exon 5 (of unknown functionality); those too have only moderate phylogenetic conservation.
Comparative genomics of 4 glycosylation sites in exon 7 of KIAA0256: GGVNWSNVTCQATQKKPWMEKNQTFSRGGRQTEQRNNSQ Homo sapiens (human) GGVNWSNVTCQATQKKPWMEKNQTFSRGGRQTEQRNNSQ Macaca mulatta (rhesus) GGVNWPKVTCQATQKRPWMEKNQAFSRGGRQTEQRNNLQ Mus musculus (mouse) GGVNWPKVTCQATQKRPWMEKNQAFSRGGRQTEQRNNSQ Rattus norvegicus (rat) GSVNWSNVTCQATQKKPWMEKNQTFSRGGRQTEQRNNSQ Canis familiaris (dog) GGVNWSNVTSQATQKKPWMEKNQTFSRGGRQAEQRNNSQ Sus scrofa (pig) GGVNWSNVTCQATQKKPWMEKNQTFSRGGRQTEQRNNSQ Equus caballus (horse) GGVNWSNVTCQGTQKKPWLEKNQTFSKGGRQMEQRNNSQ Dasypus novemcinctus (armadillo) GHVNWSNVTCQATQKKPWMEKHQTFSRGGRQTEQRNNAQ Loxodonta africana (elephant) GGASWSNVTSQATQKKPWMEKSQPFSRGGRQTEQRNNSQ Monodelphis domestica (opossum) .GVSWTNVNSQATQKKPWIEKTQTFIRGGRQAEQRNSSQ Gallus gallus (chicken) AGATWANVSSQATQKKPWMERTPAFSRGGRQAEQHNSSQ Anolis carolinensis (lizard) Potential for phosphoserine conservation in exon 5 of KIAA0256: DFPSDIANKSLSETTATMLWKSKGRRRRASHPTAESSSEQGASEADIDSDSGYCSPKHSNNQPAAGALRNPDSGTMN homSap .FPSDIANKSLSESTATMLWKAKGRRRRASHPAVESSSEQGASEADIDSDSGYCSPKH-NNQSAPGALRDPASGTMN musMus DFPSDIANKSLSESSATMLWKSKGRRRRASHPTAESSSEQGASEADIDSDSGYCSPKHSNNQPAAGALRNPDSSTMN canFam DFPSDIANKSLSESSSTMLWKSKGRRRRSSHPTAESSSEQGASEADIDSDSGYCSPKHSNNQATAMTSRNTDSGSIN monDom DFPLDIANKSLSESAATVLWKSKGRRRRASHPAAESSSEQGASEADIDSDSGYCSPKHGNNQAAGPAARSADSGPAN ornAna G insertion DFPSDIANKSLSESASTMLWKSKGRRRRASHPAAESSSEQGASEADIDSDSGYCSPKHGNNQAAAVTSRNADSCAMN galGal DFPSEIASKSLSESMSTMHWKPKTRRRRSSHP-AESSSEQGASEADIDSDSGYCSPKHS-NQAAAVTSRSVESAAGN anoCar DFPNEIANKTICESVGATPWKSKVRRRRLSHPAAESSSEQGASEADIDSDSGYCSPKHC--QAAAMCTRHADCGAV. xenTro DFPGEASGGVRCVSDQVSPQQWKNKPRRRRTSQQESSSEQGASEADIDSDSGYCSPKH--NQGAA............ danRer DFPGEVSGRCAAERASPQLWKNKTKRRRASHP-AENYSEQGASEADIDSDSGYCSPKH--NQAAGVTQR........ gasAcu DFPGEAAVRCVSDQASPQLWSNKARRRRTSQ--QESSSEQGVSEADIDSDSGYCSPKHSTNQPAAAV----DAGVM pimPro SGSG NQGANNT HT insertions DFPDDIADKSLRDKPSPLLRKSKARRLASRRPQDPSSTDSEEDEGGIDSDSGYSSPKHGRNQSA..............braFlo DFPEAIANKPLSDKTSNLTSRSKAKTRKKSQGNASSSSDSEVENTPHDSDSGYYSPLHAQQ................ strPur QTGRD insertion
Reference set of metazoan KIAA0256 full length sequences
It is very difficult to extract accurate full length genes from phylogenetically representative organisms in the case of KIAA0256. That's because the gene is twice average size (thus seldom tiled completely by transcripts), has two very short exons that do not emerge consistently from alignment methods, several consecutive poorly conserved exons rife with indels, and a run-on indeterminate carboxy terminus. Nearly every pipeline entry in GenBank non-redundant contains gross errors including gratuitous long internal repeats and severely truncated genes.
It will prove imperative to initiate massive cDNA programs in non-teleost species for this (and many other anomalous genes) for which homological modelling will never work. Tiled coverage will be necessary, not merely end-sequencing.
Why would a gene seemingly essential to making numerous essential selenoproteins evolve so erratically? The ribosome and SECIS elements with which it interacts are exceedingly conserved and its role must have been stable for over half a billion years. KIAA0256 has a dumbbell conservation structure, possibly suggesting a fusion of two proteins. Only the amino terminal region of the upstream partner was conserved, along with the SECIS and L7Ae motif of the downstream partner, with little selection on the run-on carboxy terminal tail (not uncommon in proeins).
Full length metazoan KIAA0256 sequences
>KIAA0256_homSap Homo sapiens (human) length=1101 0 MDRAPTEQ 0 0 NVKLSAEVEPFIPQKKSPDTFMIPMALPNDNGSVSGVEPTPIPSYLITCYPFVQENQSNR 2 1 QFPLYNNDIRWQQPNPNPTGPYFAYPIISAQPPVSTEYTYYQLMPAPCAQVMGFYHPFPTPYSNTFQAANTVNAITTECTERPSQLGQVFPLSSHRSRNSNRGSVVPK 0 0 QQLLQQHIKSKRPLVKNVATQKETNAAGPDSRSKIVLLVDASQQT 1 2 DFPSDIANKSLSETTATMLWKSKGRRRRASHPTAESSSEQGASEADIDSDSGYCSPKHSNNQPAAGALRNPDSGTMN 0 0 HVESSMCA 1 2 GGVNWSNVTCQATQKKPWMEKNQTFSRGGRQTEQRNNSQ 0 0 VGFRCRGHSTSSERRQNLQKRPDNKHLSSSQSHRSDPNSESLYFE 0 0 DEDGFQELNENGNAKDENIQQKLSSKV 0 0 LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ 0 0 EALSKAAGKKNKTPVQLDLGDMLAALEKQQQAMKARQITNTRPLSYT 1 2 VVTAASFHTKDSTNRKPLTKSQPCLTSFNSVDIASSKAKKGKEKEIAKLKRPTALKK 0 0 VILKEREEKKGRLTVDHNLLGSEEPTEMHLDFIDDLPQEIVSQE 1 2 DTGLSMPSDTSLSPASQNSPYCMTPVSQGSPASSGIGSPMASSTITKIHSKRFRE 2 1 YCNQVLCKEIDECVTLLLQELVSFQERIYQKDPVRAKARRRLVMGLREVTKHMKLNKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYFGAE 0 0 SLFNKLVELTEEARKAYKDMVAAMEQEQAEEALKNVKKVPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 2 ETNWRNMVETSDGLEASENEKEVSCKHSTSEKPSKLPFDTPPIGKQPSLVATGSTTSATSAGKSTASDKEEVKPDDLEWASQQSTETGSLDGSCRDLLNSSITSTTSTLVP GMLEEEEDEDEEEEEDYTHEPISVEVQLNSRIESWVSETQRTMETLQLGKTLNGSEEDNVEQSGEEEAEAPEVLEPGMDSEAWTADQQASPGQQKSSNCSSLNKEHSDSNYTTQTT* 0 >KIAA0256_monDom Monodelphis domestica XM_001380435=flawed 0 MDRAAADQ 0 0 NVKLSAEVEPFVPQKKTPDTLMIPMALPGDSGSVSGVEPTPIPSYLITCYPFVQENQSNR 2 1 QFPLYNNDLRWQQPNPNPPGPYLAYPIISAQPPVSTEYTYYQLMPAPCAQVMGFYHPFPTPYSSTFPAANTLNTIPTECTDRPNQLGQVFPLSSHRSRSSNRGPIVQK 0 0 QQLLQQHVKTKRPPVKSVATQKETSAAGPDNRSKIVLLVDASQQT 1 2 DFPSDIANKSLSESSSTMLWKSKGRRRRSSHPTAESSSEQGASEADIDSDSGYCSPKHSNNQATAMTSRNTDSGSIN 0 0 LMEPSICS 1 2 GGASWSNVTSQATQKKPWMEKSQPFSRGGRQTEQRNNSQ 0 0 VGFRCRGHSTSSERRQSLQKRQDNKPLGNHSHRVETSSDPLYFE 0 0 DEDEFTELNETGSAKDENIQQKISAKV 0 0 LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKIQ 0 0 EALSKAAGKKSKTPVQLDLGDMLAALEKQQQAMKARQITNTRPLTYT 1 2 VVSAVPLQSKDSANRKSLTKSQPCLAPLNPLDTTSPKIKRGKEKEIAKLKRPTALKK 0 0 VILKEREEKKGRFTVDHSLLGSEEPIEMPLDFIDDLPQEIASQE 1 2 DTGLSMPSDTSLSPASQNSPYCMTPVSQGSPASSGIGSPMASSAITKIHSKRFRE 2 1 YCNQVLCKEIDECVTLLLQELVSFQERIYQKDPVKAKARRRLVMGLREVTKHMKLNKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYCGAE 0 0 SLFNKLVELTEEARKAYKDMVAAMEQEQAEEALKNVKKVPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 2 ETNWRNMVETSDGLETSENERDTSYKVISPETSNKVPNDKVLVNKQLPSVITGGTASTTNPGKCTVSDKEEVKPDDLEWASQQSTETGSLDGSCRDILNSSITSTTSTLVPGMLEE EEDEDDDEDEDYPHEPISVEVQLNSRIESWVSETQRTMETLQLGKTLNGAEEDNTEQSGEEEIEVPEQTDPVNDSEEWTADKQISNVQEKPNSCNSLNKEHSDSITT* 0 >KIAA0256_galGal Gallus gallus XM_413816=flawed 0 MDKADK 0 0 NVKLSAEVEPFIPQKKGPETLMIPMALPNDSGGINGVEPTPIPSYLITCYPFVQENQSNR 2 1 QFPLYNNDIRWQQPNPNPAGPYLAYPIISAQPPVSTEYTYYQLMPAPCAQVMGFYHPFPPPYSAPFQTANAVNTVTTECTERPNPPGQVFPLSTQRSRSSNRGPIIPK 0 0 QQQLQMHIKNKRPPVKNVATQKETSSSGPENRSKIVLLVDASQQT 1 2 DFPSDIANKSLSESASTMLWKSKGRRRRASHPAAESSSEQGASEADIDSDSGYCSPKHGNNQAAAVTSRNADSCAMN 0 0 VVEPSINA 1 2 TGVSWTNVNSQATQKKPWIEKTQTFIRGGRQAEQRNSSQ 0 0 SGFRCRGHSTSSERRQNLQKRHEKPLTTSQSSRAEQSPEPLYFE 0 0 DEDEFPELNSDNGNSKSSNIQQKISPKV 0 0 LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ 0 0 EALSKAAGKKSKTPVQLDLGDMLAALEKQQQAMKARQITNTRPLSYT 1 2 VGSAAPFHTKESANRKSLTKGQPSMGCLNPLDSTAPKVKRGKEREISKLKRPTALKK 0 0 IILKEREEKKGRLSVDHSLLGSDEQKQVHISLPTDQSQELASQE 1 2 ETGLSMPSDTSLSPASQNSPYCMTPVSQGSPASSGIGSPMASSAITKIHSKRFRE 2 1 YCNQVLSKEIDECVTLLLQELVSFQERIYQKDPMRAKARRRLVMGLREVTKHMKLNKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYSGAE 0 0 DLFNKLVSLTEEARKAYRDMVAAMEQEQAEEALKNVKKAPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 2 ETNWRNMVETSDGLETSENERESSSQTAVPEKAANGQIAKSTLHKQPPLAATSTTSATNHGKATPGEKEEVKPDDNLEWASQQSTETGSLDGSCRDILNSSMISTTSTLVP GMLEEEDEEDEEDDEDYAHEPISVEVQLNSRIESWVSETQRTMETLQLGKTLSGAEEDNAEQSEEEEIETSEQVDPAVDSEEWTNDKHASNIQHKPTICGSLNKEHTDSIYMP* 0 >KIAA0256_taeGut zebrafinch 0 MDKSNKI 0 0 NVKLSAEVEPFIPQKKGPETLMIPMALPNDSGGINGMEPAPIPSYLITCYPFVQENQSNR 2 1 QFPLYNNDIRWQQPSPNPAGPYLAYPIISAQPPVSTEYTYYQLMPAPCAQVMGFYHPFPTPYPAPFQTANAVNTVTTECTERPSPSGQVFPLSTQRSRSSNRGPVIQK 0 0 QQQLQMHIKSKRPPVKNVATQKETSSSGPENRSKIVLLVDASQQT 1 2 DFPSDIANKSLSESTSTMLWKSKGRRRRTSHPAAESSSEQGASEADIDSDSGYCSPKHGNNQAAAMASRNTDSCAMN 0 0 VVEPSINA 1 2 TGIGWTNVNSQATQKKPWIEKTLTFSRGGRQAEQRNNPQ 0 0 SGFRCRDHSTSSERMQSLQKREKPLAMSQASRTEQSPEPLYFE 0 0 DEDEFPELNDNGSSKSSSIQQKISPKV 0 0 LDDLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ 0 0 EALSKAAGKKSKTPVQLDLGDMLAALEKQQQAMKARQITNTRPLSYT 1 2 VGSAAPFHTKESASRKSITKGQPSMGCLNPLDSTAPKVKRGKEREIAKLKRPTALKK 0 0 IILKEREEKKGRLSADHSLLGSDEQKEAHLNLTADQSQELASQE 1 2 ETGLSMPSDTSLSPASQNSPYCMTPVSQGSPASSGIGSPMASSAITKIHSKRFRE 2 1 YCNQVLSKEIDECVTLLLQELVSFQERIYQKDPTRAKARRRLVMGLREVTKHMKLNKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYSGAE 0 0 DLFNKLVSLTEEARKAYRDMVAAMEQEQAEEALKNVKKTPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 2 ETNWRNMVETSDGLETSENERESVCKAAVPEKAGNGQMEKTTLNKQQLATTGTTSATNHGKSTPGDKDEVKPDDLEWASQQSTETGSLDGSCRDLLNSSMTSTTSTLVP GMLEEEEEEEDDDDEDYAHEPISVEVQLNSRIESWVSETQRTMETLQLGKTLSGAEEDNAEQSEEEEMETSEQADPITDGEEWTNDKHASSTQHKPTICSSLNKEHTDSIYMP* 0 >KIAA0256_xenTro Xenopus tropicalis BC167330 0 MEMNEQ 0 0 NGKLSAEVEPFVPQKKGAEALAIPMALPSDGGSVGGLEPTPIPSYLITCYPFVQENQSNR 2 1 QFPSYNNDIRWQQSNSSPAGPYLAYPIISTQPPVSQDYMYYQLMPAPCAQVMGFYHPFPTPYTTPLQATNAVSVDCSERASQQSQINALTSQRNRNTRAPLIHK 0 0 PQPALPQPRCKRPPMKSVAIQKETCASSPETRSKIVLLVDACQQT 1 2 DFPNEIANKTICESVGATPWKSKVRRRRLSHPAAESSSEQGASEADIDSDSGYCSPKHCQAAAMCTRHADCGAVS 0 0 ISDPAVPA 1 2 AGGSWASVASQATQKRPWNEKGQTFSRGGRQTEIRNNAQ 0 0 LGYRLRGQSTSSERRHNLQRKQDNKTGTPASSNKSGQSPDHLYFE 0 0 DEDAFPELNSSNGARNDNAQTKIPTKV 0 0 LNGLPENSPINIVQTPIPITTSVPKRAKSQKKKALAAALATAQEYSEISMEQKKLQ 0 0 EALSKASGKKSKTPVQLDLGDMLAELERQQQAMKARQITNTRPLSYT 1 2 VGSAVPFHIKEHTNRNVFTKAQAVMGSPNPLDSTAPRVKRGKEKEVPKLKRPTALKK 0 0 IILKEREEKKGRLPVDPSVLGSEEQKDALSFADDQSEELASQE 1 2 EAGLSAPSDTSLSPASQNSPYCMTPVSQGSPASSGIGSPMATSTLTKIHSKRFRE 2 1 YCNQVLSKEIDECVTVLLQELVSFQERVYQKDPVKAKSKRRLVMGLREVTKHMKLNKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFSYSGAE 0 0 SLFHNLVSLTEEARKAYKDMVSSMEQEQAEEALKNIKKVPHMGHSRNPSAASAISFCSVISEPISEVNEKDY 1 2 ETNWRNMVETSDGLETSENEECSVTTTGSEQAASAPLVRNNTQKQEPKTASSTTSSATLEKPTPADKEEVKQDDNLEWASQQSTETGSWDGSGRDVLNSSMTSTASTLVP EMLEEDDDEEEDDDEYPQEPISVSRIESWVSETQRTMESLQLVNSNSPEEDNIEHSEEDEVGQCEQSEAADCKERTAEMHVRNGSHTQTGRKSSLKEKVNSTFM* 0 >KIAA0256_gasAcu Gasterosteus aculeatus (stickleback) 0 MDAGDIK 0 0 DVKLSAEVEPFIPQKKGMEGSQVSMSLSGEAGGGGSGGGSGGVETTPIPSYLITCYPFVQENQPNRY 2 1 QHPMYNGGELRWWQQPNPSPGGSYLAYPILSSPQPPVSNDYAYYQIMPAPCPPVMGFYQPFPGPYAGPVQAGVVNPVSAEVGERPLPLGPAYGMNSQRGRGMVRPNVPPN 0 0 QLGVCQPLRGRRPPTRSVAVQKEVCTLGPDGRTKTVMLVDAAQQT 1 2 DFPGEVSGRCAAERASPQLWKNKTKRRRASHPAENYSEQGASEADIDSDSGYCSPKHNQAAGVTQRSAENTAAPTV 0 0 AVETGVMT 1 2 AGTWVNVASQATQSWGDRNGHFHRADQRKNSEQRNFSQ 0 0 EFHTGYAGRGPPGLSHQRPQPAVVSGTQVSPHPLYFE 0 0 DEDEFPDLASGGAAQRCTKAESTSAQTHAQPKLPKNL 0 0 LDNLPENSPINIVQTPIPITTSVPKRAKSQRKKAMAAALATAQEYSEISMEQKKLQ 0 0 EAFTKAAGKKSKTSVELDLGDMLAALEKHQQAMKARQLNNTKPLSFT 1 2 VGTTAPFHGSGLVSLPSALKGHQQPYSVPHNSLDSTAPRIKRGKEREIPKVKRPTALKK 0 0 IILKEREGKKGKTSVEQESSGQEEHADESLHFTDDLAREPASQE 1 2 ETGLSMPSDASLSPASQNSPYSITPVSQGSPASSGIGSPMASNAITKIHSRRFRE 2 1 YCNQVLSKEIDESVTMLLQELVRFQERIYQKDPTKAKTKRRLVMGLREVTKHMKLNKIKCVLISPNCEKIQAK 1 2 GGLDEALYNVIAMARDQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYSGAE 0 0 GLFNRLVSLTEEARKAYKDMVSALEQEQAEEAQKNDKKLPHHMGHSRNHSAASAISFCSIFSEPISEVNEKEY 1 2 ETNWRSMVENSDALEPVESEPRRPAPPTSTPKVGEAAAATPPATSASTATPSSTAPQTARTAPPTLTQGNGERDEVRVDDRLELASQQSTETGSLDGSCRGPLNSSITSTTSTLVPGMLA EEEEEEDYTPEPIAVEVPTLSSRIEYWVSKTLENLQLGKSQESTEEEDEDEEEEEEEERGHSEEEEDLDSADIAETRSEDKDQVEVKKVQG* 0 >KIAA0256_pimPro Pimephales promelas tiled cDNAs 0 MDAGERK 0 0 DVKLSAEVEPFIPQKKGVEASLLPMSLCGEGGAEPTQIPSYLITCYPFVQENQSNSR 2 1 QLPMYNGGDQRWQQLNPSPGGPYLAYPILSSPQPPVTSDYATYYHAIMPTPCPPVMGFYQPFPGPFAGPVPAGVLNPVSDCSDRPTPQRGRGVPRTPVLH 0 0 KQPMAQPMRAKRPVMRSVAVQKEVCATGPDGRTKTVLLVDAAQQT 1 2 DFPGEASGSGAVRCVSDQASPQLWSNKARRRRTSQQESSSEQGVSEADIDSDSGYCSPKHNQGANNTSTNQHTPA 0 0 AAVDAGVM 1 2 TAVSWGNVSSQAVQKPWPDRNTPFFRGSRTPERSYTQDF 0 0 QMSFGCRAAGPRRSTPPETPNTHLTPEPLYFQ 0 0 DEDEFPDLATGGAAQRNKPDPVQPKLPKT 0 0 LLDNLPENSPISIVQTPIPITSSVPKRAKSQRKKALAAALATAQEYSEISMEQKKLQ 0 0 EALSKAAGKKSRTPVQLDLGDMLAALEKQQQAMRARQLNNTKPLSYT 1 2 VGTVSSLHSKDCGSRVTGLKNTHTPPHNILDSSAPRIKRGKEREIPKVKKTTAMKK 0 0 IILQEREVKKGKSSADQGVSGADEQRDSLSFTDTLTQEQDENG 1 2 LSMPSDASLSPASQNSPYSITPVSQGSPASSGIGSPMAASAITKIHSRRFRE 2 1 YCNQVLSKDIDESVTLLLQELVRFQERVYQNEPSKAKAKRRLVMGLREVTKHMKLHKIKCVIISPNCEKIQAK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYSGAE 0 0 ALFNTLVSLTEEARRAYKEMVSALEQEQAEEALKNVKKVPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 2 ETNWRTMVENADAPEPPDSEPISRGNNRDQREVVSPPPQPTANQSLTPSPGVARAPDESRTDDRLEWASLSTETGSLDGSGRDRLNSSHHSTTSTLVPGMLEEE 0 >KIAA0256_calMil elephantfish fragments QDIQLSAEVEPFIPQKKGTETLVPMALPNDGNGSGVEAPPIPSYLITCYPFVQ ENQANRPVYNGDIRWQQANPNSPGPYLAYPILPTPQPPVSTDYAYYQLMPAPCTPMMGFY SPFPTPYTGTLPPASVVNAVSECSERP NPLDSTAPRVKRGKEKEIPKAKRPTALKKVKSFER YCNQVLSKDIDECVTLLLQELVRFQERVYQKDPIKAKMKRRLVMGLREVTKHMKLRKIKCVIISPNCEKIQSKG GGLDDALHNIISIACEQEIPFVFALNRKALGQCVNKPVPVSVLGIFSYDGAE FHQMVEITEEARKAYQEMLDALQQELEADEEKGDSEEQPLISSESSTIHFNNVTSQPFSEADEPEYGT DKEEGKTDDILEWASQQSTETGSLDGSCRDVLNSSMTSTTSTLVPDMLEEEEEEEEDDDDEDDEEEDYVHEPVSIAGTFSSRVDDWVSEAQKTLETLQLSKNIDSTEEDCDEQSDTEELDTVEQIDLTAESED >KIAA0256_petMar mediocre fragments LHKLRALIISPNCEKIQAKG 2 GGLDEALQTVIALASEQSVPFVFALNRKALGHCLNKKVPVSVVGVFHYGGAE 0 THFQRLVALTEEARSAYRNMVSSLQRQEAAATSEPTGHTEDPLEASA QVSCKHSRLPSALARTTPIPHPPQQLNTPPPPARRPQCPRELHYSLHSLALSLTAQPTSPHGRPPGKVPTVERCRRQRARVV >KIAA0256_braFlo Branchiostoma florida fragment with low support 1116 aa 0 0 0 VSQLSAEVEPFVPSALPLPTSDPSGGTQPHVLPRYVTSCYPFVQPPEVTP 2 1 EGYVQEVRWPSSVPNPQYNPYPPLSPQPHLPHYYPPHNTPPPPGPFLPHPSPYPPPLYAGYPPPPHLYPPPYGTRSPTQQ 0 0 TRRRSGSRSVQTKSIAVQKEASSNSPVHNRHRTIILLDASQQT 1 2 DFPDDIADKSLRDKPSPLLRKSKARRLASRRPQDPSSTDSEEDEGGIDSDSGYSSPKHGRNQSANSSTSEAATGTCI 0 0 1 2 0 0 0 0 0 0 LPGSQEPLNPATVVSTPVEVKKEGKNARKKRKKALLAAKAAAEEYSEITQVISENQ 0 0 EIQKKASGKKSKQPMQLDLGDMLAALEKRQQELKLKTAAGPAKTAAVSTGTVPVQ 1 2 DNKQWSGKKEASNVSMPHNPLDSHAPAVKRGKERETPHKKKPSALKK 0 0 VILKEREDKKMQKLMEDQAHSDAETGEVPSSTACYIPFSMEDSDGGTSQE 1 2 GSELSPLSQAMSPINFSPLSSASPLSSGTGSPLCAPSPIGPKIHSRRFRE YCNQVLDKEIDATVTMLLQDLVRFQDRQYHK 0 0 DPIKAKAKRRIVMGLREVTKHLKLRKLKCIIIAPNLEKIQSK 1 2 GGLDDAIETILNLCMEQDVPFVFALGRKALGRAVNKLVPVSVVGVFNYDGAE 0 0 EHFKTMVELTTQARNAYIDMVTIYRQEWEQMQ 0 0 AMRNSGQPIYPAHLGHSRNPSAASAVSFSSVLSETISECHPE HDGDIEGPKIKVEAAKVTEESKLKGQEECTEQGAIKAEPTKESLNSETENNLKENSSESNSDRADVESESSEGPESVSRHSEIVEFPPAYDDVLTSSAAT TVVNGAESEVTDVVEEEGDVLNTSCSSSKLRVLDTGRIESWVVEASQCVEKLDLDPQQHAEEKIDPDQKKAESKVTSEQDSSKDLRADSKPDVDTRPGRNVSPTKQMDSSPAEEQNTANCDLSNLKQNVQLQSEGGEVSAEKK TIASKDEDSTAGQTGSLSEPADDVGKFNGTVSTEINDR* 0 >KIAA0256_cioInt Ciona intestinalis XM_002123197 FK199357 BW542841 FF776957 FF925374 BW008530 1128 aa 0 MFSPGSD 0 0 RTNLRAEVPPFVPRREWPEGSMEQHNGGPLPRYVTTCYPFVQDNQD 0 1 HPAQIGINMNQRMANQNMRNSYSSVNYLSNNPNPATQLTNANTQMGMVQQNFSAQLFSR 21 GNLADTAHITAVYGDSFQCQYPPNQTSMIVQKTSSLSR 0 0 SSSRGSGKKILKRNVGTQKEVSRRSPVSPEMVDSCQQT 1 2 DFPMSVACKSLTDHPSSLRRATKSRRRRETCSSQSGNCDSSSDHADADVDSDSGYYS 21 PKHRLGHKRNGGTSTNGLWSRNE 0 0 KDPVPQVIYITPTNVAPPVSLFQVHSSAHNHMFPNSLPPQVTPPPNSLLGYGHPGPPIILSPPPNQIL 12 ANRPTPPFPIHPSMIGNRNPNQ 0 0 CPSNNNWNINQLALPPGYWPNNSTHPPQHRHQTRNPSLDFRHQRNLKKFEFYGEQPYVSAAGSQFLQGHFDKRKHDKRKTVDEVPSRERSPVVMQTHEQPNTNNLHFHGDNSMIMLS 0 0 DTQEFPGLDGNFFSTSSSPSINAFSYSAAVMGKIPRPIAP 0 0 INSSAPYPSSAANLNEKSQAQKTKKRRKKAERAARAADEEYAEISKEQENIQ 0 0 KVLKKTASSRNKNKNQVLDLGEFLTSKFEEKKLLSDSPTKNTEVAKSWEEGHLVAKPPLDLNMK 2 1 IKPTGPPANALDSTAPLIKKGKEREVPKPKKPSALKK 0 0 VILKEREEKKEHHLKLKEREEKKEHHLKQLTTMLSPETDVPPYPPALYKIP 1 2 VPSDDEKSLGHDTNTEVSVSIPPTVPQIHSRRYRE 2 1 YCCQVLDKRVDEMSNQMLQRLVYFQDR 21 LYKTDPAKAKRKRRVVLGFREVTKHLKMKKLRCVIISPNLEKIESK 1 2 GGLDDVLHEILDLCKEQNIPYVFALGKKALGRAVSKTVPVSIVGVFDYSGAEAQ 0 0 FKQLVELVKEAQLQYKDMVQIYQKQVAEANKP 00 VQSAGPSKRYAYMTHSRNASATSHLSVTSIISEPISEMNE 1 2 GSNWRVIMDAGEDGLSPPPEDDVSEEEEVEESPGKEKPAPPVPVKGGEELSRKDSGSTVVECPHPELQPDDNFVSELPGEEESSSVDAED AEEMNLNHRRALDKSFSTCSTLKPEGGVSPRISTTSESSSLIPDDVSSQSSAQDRIQLWLEDATRSVVDLDLNDVVPDAEDVNSESKLVTPDVNESK* 0 0 MTAMYYNAPSHQHQQQQHHHAPQPLHPHQHQQHHHQQTIPGMVPQPSPSQVVSGMLSEATAAMPGLKPPPPSQPQGGGGGGGGMQQYQTSSASAVATMNGKKVPLTELPRYITTCYPFVQDS 2 1 STGAAPATETWMGYPNSSQQPNQPHPQPQQHQHPPLPLPPTSQHPLSHQQPPQTTPMYAPPPPPPPGHQPPSAHLTQQQNQEYFPVHPGYN 0 0 QVPHQTPPPAAASPGGPLYQQGAYQQHGGTYQPHLTGTAPHHPTHHHHTQSPTPMPLASQSSMPA 1 2 GGVPVSHTPFAPPPMMTPPSQSPSPYPFVPPPPHGAATPGGYDAALPGTQPTLPSYGQYGYGAYPGPQVK 0 0 VRGQRPMNKDHRYPGGYQNKGREHYQAYVPPPTDLPKPKTKTVVFAEACAQT 1 2 DFPEAIANKPLSDKTSNLTSRSKAKTRKKSQGNQTGRDASSSSDSEVENTPHDSDSGYYSPLHAQQHNSTGLVSTYSTQTGKPTYSNVAMNNKSSPHQESR TVEQNTFTQNQPLVVPQGPPLGPQLGPAPVIQRGRFTPVQPGIPSFRPVMPMSYANMLTKPRAANPPPPPLANVGYPQRPPNVFPTQPPPTYRNMAVSPAPMLYQQQQQQQQRRMQSPVPAPQ 0 0 KPPVTPEDTPRKRKQKRTKGKKDGEVELEKPKMVNAATYAKPPQIQDKEEYPGLPLGSPAGNKFGMSTGGRPISYSSALQQRAPVQL 0 0 VNESSSEEEEEESGGDPSSIIKPEELLSPANVMSTIKEGKNARKRRKKAIMATQAAAK 0 0 EYSEITEEQRQLHENMKKQGKRTKMPIEFDLGDMLAALE 0 0 KQQQEIRAKQQQQQQLIQRGPVAPSRNVQFAPNVATMDPYSQSRPVKDVPR 0 0 GHNPLDMTAPVKRGKERELPAKKKPSALKRVILKEREEKKRLRTLEESRLSDD 2 1 DPVVSQGLSQGLSQGLSQGFSQGFSYGFSHGFSQGLSQD 1 2 APSDRGSFPGFNASQSDLSPLSQMSPLSMSPLSPGSPLSSGLSSPATGMGRSNPTQVATKIHSRRFRE 2 1 YCNQVLDKDIDGCCTTLLQTLVKFQDRQYHKDPAK 0 0 AKMKRRLVMGLREVTKHLKLKKIKCVVVSPNLERIQSK 1 2 GGLDEAMDRISSLASEQNVPLIFALGRKALGRAVNKVVPVSVVGIFNYDGAE 0 0 DTYKQLLDLSTRARNAYADMVRKFQQELEAANAASAARMAKHRHHMGHNRNLSGCSAISFSSVISEPISENYPNPEPEVDSQGREIEPDPPTTPTYSPQ GGGCSSDAGQQHPSAPMRSLSFTGTGSVISNSTDDTIHKEEKDGGGSSVGKDYVMSETSSRTLTAGEGDQDLEEGSKEDVGRVELEELEAGLVDQDHDEE EEDEEEEEDEDEDAEVIKANILLPEDGAPEKRVADWVAEAQQCIESLTVDDESGDDGGDAKKKGVGEKKDEKPSDANISPEQVGKMLTSLEV* 0 >Mytilus californianus ES395733 GE754305 to KIAA0256_homSap Identities = 156/269 (57%) 570 aa GSSAVGISYSAILQTVPVS 0 0 RPSTVERNKTSSSEDSPRKDNSSLEDKGTRASRRRRKRKDILNTAAEN 0 0 ELAEIGLEQQMLKEQCLKTQGQKSHKDEKGQTPGILKVNP 0 0 KAQNSGKKSKQNVSLDLGAVIDALEQKKTISLTSGARTEQKVKAEQPKNKEEQKSK 0 0 GSHNVLDASAPIKRGKERETPKAKKPSPLKKVILKEREEKKLLKMLEGT 2 1 ESGSTEAAVGIGVVSAESDLSQD 1 2 AMSTKSSIDYTGTPGSANLSPVSQTSPISMSPLSPGTSPLSSEVNSPIAGAVGKDVVKKIHSRRFRE 2 1 YCNQVLDKDIDECATTLLQDLVRFQDRMYHKDPSK 0 0 AKLKRRLVLGLREVAKHLKLRKIKCDIISPNLEKIQSK 1 2 GGLDDALNNILTLCNEQNVPFVFALGRRALGRACAKMVPVSVVGIFNYSGSE 0 0 ENFKQLIDLTAKARESYGEMVAAIEIEIKEYPMKKQQPTIPHVFAHMGHSRTPSGASVLSFTSSILSEPISENYPHSEPETDSKGYEIVKDDALIKQGLPTDSSGYQTQMRI IHSNTKDDDGNEADNEEEGDRINRDYYRT* >Nematostella vectensis NZ_ABAV01022736 fragments 1 QGSTEQENQSVKKKKKRKKKKKPTETEGES 1 0 VFHNMLDSTAPVIKRGKEREVPKKKKPSALKR 0 0 IILKEREEKKKERENAEHEKTDDGDAS 1 1 YCDQVLDKELNTVTLKLLSELVRFQDRVYFKDPEK 0 0 AKAKRRYVVGLREVTKHLKLKKIKCVILSPNIEQIKSA 1 2 GGLDDALHNIISLAHTNRIPVVFSLRRQILGRAVCKKVPVSAVGIFNYDGAQ 0 0 DLFKNLMELTENGRKVYAERWNAAQEALREELDNEHPVISCNTEQGGP 1
SBP2 L7Ae motifs from 27 vertebrates
>SECISBP2_homSap Homo sapiens (human) 1 YCSQMLSKEVDACVTDLLKELVRFQDRMYQKDPVKAKTKRRLVLGLREVLKHLKLKKLKCVIISPNCEKIQSK 1 2 GGLDDTLHTIIDYACEQNIPFVFALNRKALGRSLNKAVPVSVVGIFSYDGAQ 0 0 DQFHKMVELTVAARQAYKTMLENVQQELVGEPRPQAPPSLPTQGPSCPAEDGPPALKEKEEPHY 1 >SECISBP2_panTro Pan troglodytes (chimp) 1 YCSQMLSKEVDACVTDLLKELVRFQDRMYQKDPVKAKTKRRLVLGLREVLKHLKLKKLKCVIISPNCEKIQSK 1 2 GGLDDTLHTIIDYACEQNIPFVFALNRKALGRSLNKAVPVSVVGIFSYDGAQ 0 0 DQFHKMVELTMAARQAYKTMLENVQQELVGEPRPQAPPSLPTQGPSCPAEDGPPALTEKEEPHY 1 >SECISBP2_macMul Macaca mulatta (rhesus) 1 YCSQMLSKEVDACVTDLLKELVRFQDRMYQKDPVKAKTKRRLVLGLREVLKHLKLKKLKCVIISPNCEKIQSK 1 2 GGLDDTLHTIIDYACEQNIPFVFALNRKALGRSLNKAVPVSVVGIFSYDGAQ 0 0 DQFHKMVELTMAARQAYKTMLENVQQELAGEPRPQAPPSPPTQGPSCPAEDGPPALTEKEEPHY 1 >SECISBP2_otoGar Otolemur garnettii (bushbaby) 1 YCSQMLSKEVDACVTDLLKELVRFQDRMYQKDPVKAKERRLVLGLREVLKHLKLKKLICVISPNCERQSK 1 2 GGLDDTLHTIIDYACEQNIPFVFALNRKALGRSLNKAVPVSVVGIFSYDGAQ 0 0 DQFHKMVELTMAARQAYKTMLENVQRELAGEPGPQVPSSLPMEGPSCSVEDSPPAPTEKEEPHY 1 >SECISBP2_tupBel Tupaia belangeri (treeShrew) 1 YCSQMLSKEVDACVTDLLKELVRFQDRMYQKDPVKAKTKRRVVLGLREVLKHLKLKKLKCVIISPIZEKIQSK 1 2 GGLDDTLHTIIAYACAQNIPFVFALNRKALGRSLNKAVPVSVVGIFSYDGAQ 0 0 DQFHKMVELTMEARQAYRSMLESARQELAGEPGLQAPPQPPVQGPRASSEGSAPAPTGRQEPHC 1 >SECISBP2_musMus Mus musculus (mouse) 1 YCSQMLSKEVDACVTGLLKELVRFQDRMYQKDPVKAKTKRRLVLGLREVLKHLKLRKLKCIIISPNCEKTQSK 1 2 GGLDDTLHTIIDCACEQNIPFVFALNRKALGRSLNKAVPVSIVGIFSYDGAQ 0 0 DQFHKMVELTMAARQAYKTMLETMRQEQAGEPGPQSPPSPPMQDPIPSTEEGTLPSTGEEPHY 1 >SECISBP2_ratNor Rattus norvegicus (rat) exons 1416 1 YCSQMLSKEVDACVTGLLKELVRFQDRMYQKDPVKAKTKRRLVLGLREVLKHLKLRKLKCIIISPNCEKTQSK 1 2 GGLDDTLHTIIDCACEQNIPFVFALNRKALGRSLNKAVPVSIVGIFSYDGAQDQ 0 0 FHKMVELTMAARQAYKTMLETMRQEQAGEPGPQTPPSPPMQDPIQSTDEGTLASTGEEPHY 1 >SECISBP2_cavPor Cavia porcellus (guineaPig) 1 YCSQMLSKEVDACVTDLLKELVRFQDRMYQKDPVKAKTKRRLVLGLREVLKHLKLKKLKCIIISP 1 2 GLDDTLHTIIDYACAQNIPFVFALNRKALGRSLNKTVPVSVVGIFSYDGAQ 0 0 DQFHKMVELTMAARQAYKTMLENVRQELAGEPRPQMPPDPPSEGPSSSLEDTAPDPSAEEPHY 1 >SECISBP2_oryCun Oryctolagus cuniculus (rabbit) 1 YCSQMLSKEVDACVTDLFKELVRFHDLMYQDPVKATTKCQFELRVGKALDHLRLKKLKCIIVFPKHKKQS 1 2 TIIDYACEQNIPFVFALNRKALGRSLNKAVPVSVVGIFSYDGAQ 0 0 DQFHKMVELTMAARQAYKTMLENMRHELAGEPGPPTPQPVQGPSCSAEDGPPAPTEGEVPHY 1 >SECISBP2_canFam Canis familiaris (dog) 1 YCSQMLSKEVDACVTDLLKELVRFQDRMYQKDPVKAKTKRRLVLGLREVLKHLKLRKLKCIIISPNCEKIQSK 1 2 GGLDDTLHTIIDYACEQNIPFVFALNRKALGRSLNKAVPVSVVGIFSYDGAQ 0 0 DQFHRMVELTMAARQAYKTMLENVRQELAGEPGTPALANPPMQGLGCSTQDSPPAPTEKEEPHY 1 >SECISBP2_felCat Felis catus (cat) 1 YCSQMLSKEVDACVTDLLRELVRFQDRMYQKDPVKAKTKRRLVLGLREVLKHLKLRKLKCIIISPNCEKIQSK 1 2 GGLDDTLHTIIGYACEQNIPFVFALNRKALGRSLNKAVPVSVVGIFSYDGAQ 0 0 DQFHRMVELTMAARQAYKTMLENARQELAGEPGPPAPGSPPPQPPAPAGRDEPRY >SECISBP2_equCab Equus caballus (horse) 1 YCSQILSKEVDACVTELLKELVRFQDRMYQKDPVKAKTKRRLVLGLREVLKHLKLRKLKCIIISPNCEKIQSK 1 2 GGLDDTLHTIIDYACEQNIPCVFALNRKALGRSLNKAVPVSVVGIFSYDGAQ 0 0 DQFHKMVELTKAARQAYKAMLENVHQELAGEPGPQAPASPPAQGPSCSTEGAPPAPTGKEEPHY 1 >SECISBP2_bosTau Bos taurus (cow) 1 YCSQMLSKEVDACVTDLLKELVRFQDRMYQKDPVKAKAKRRLVLGLREVLKHLKLRKLKCIIISPNCEKIQSK 1 2 GGLDDTLHTIIDYACDQNIPFVFALNRKALGRSLNKAVPVSVVGIFSYDGAQ 0 0 DQFHKMVELTMAARQAYRTMLENARQELPGELGPCAPVGPPSQGPGCPVEDSPLAPTEKEEPHY 1 >SECISBP2_eriEur Erinaceus europaeus (hedgehog) 1 YCSQMLSKEVDACVTDLLKELVRFQDRMYQKDPVKAKTKRRLVLGLREVLKHLKLKKLKCIIISPNCEKIQSK 1 2 GGLDETLHTIIDCACEQNIPFVFALNRKALGRSLNKGVPVSVVGIFSYDGAQ 0 0 DQFHKMVELTMAARQAYKALLENMRQELAEESGSPAPSSPPVQSPSEDGPPAPAEKEEPHY 1 >SECISBP2_dasNov Dasypus novemcinctus (armadillo) 1 YCSQVLSKEVDACVTDLLKELVRFQDRMYQKDPVKAKTKRRLVLGLREVLKHLKLKKLKCVIISPNCEKIQSK 1 2 GGELDDTLHTIIDYAASRHSICVALNRKALGRSLNKAVPVSVVGIFSYDGAQ 0 0 DQFHKMVELTMAARQAYKAMLENVRKELAGEPGPRSPPSPPALGPHSSAGDVHPTSAGKEEPHY 1 >SECISBP2_loxAfr Loxodonta africana (elephant) 1 YCSQMLSKEVDACVTDLLKELVRFQDRMYQKDPVKAKTKRRLVLGLREVLKHLKLKKLKCVIISPNCEKIQSK 1 2 GGLDDTLHTIIDYACEQNIPFVFALHRKALGRSLNKPVPVSVVGIFSYDRAQ 0 0 DQFHKMVELTMAARQEYKTMLESVRQELAEEPRAGSPPSPPTQGPGCSAEVPRPAPTEKEEPRY 1 >SECISBP2_monDom Monodelphis domestica (opossum) 1 YCSQMLSKEVDDCVMDLLKELVRFQDRMYQKDPVKAKTKRRLVMGLREVLKHLKLKKLKCVIISPNCEKSKSK 1 2 GGLDETLHTIIDYACEQNVPFVFALNRKALGRSVNKVVPVSVVGIFSYDGAQ 0 0 DQFHKMIALTMEARQAYKIMLSTLKEEPALETENPPSPSLPRPSESCPSELGQTDPTQEEEPNY 1 >SECISBP2_triVul Trichosurus vulpecula (possum) 1 YCSQMLSKEVDDCVMDLLKELVRFQDRMYQKDPVKAKTKRRLVMGLREVLKHLKLKKLKCVIISPNCEKSKSK 1 2 GGLDETLHTIIDYACEQNVPFVFALNRKALGRSVNKVVPVSVVGIFSYDGAQ 0 0 DQFRKMIELTMEARQAYKVMLATLKEGAEALQTENPLPTSLTPQGQGCSSELSKTTDPTKEEEPNY 1 >SECISBP2_galGal Gallus gallus (chicken) 1 YCSQVLSKEVDSCVTDLLKELVRFQDRLYQKDPVKAKIKRRLVMGLREVLKHLRLKKLKCVIISPNCEKIQSK 1 2 GGLDETLHNIIDCACEQNIPFVFALNRKALGRCVNKAVPVSVVGIFSYDGAQ 0 0 DHFHRMVQLTTEARKAYKDMVAALEEELKELSKPLNZKSCLSETGKTSSTKEDIPNY 1 >SECISBP2_anoCar Anolis carolinensis (lizard) 1 YCTQVLSKEVDSCVTDLLKELVRFQDRLYQKDPVKAKTKRRLVMGLREVLKHLKLKKLKCVIISPNCEKIQSK 1 2 GGLDETLHLIIDSACEQNIPFVFALNRKALGRCLNKAVPVSVVGIFSYDGAQ 0 0 DYFHKMVELTMEARQAYKDMISALERELKKKTVRKKPLQSRPLDTVEASSTEEDVPDY 1 >SECISBP2_xenTro Xenopus tropicalis (frog) NM_001097262 1 YCSQVLSKDVDNCVMELLKELVRFQDRLFLKEPAKAKSKRRLVMGLREVLKHLKLQKLKCIIISPNCEKIQSK 1 2 GGLDDTLQTIISHACEQNVPFVFALNRKALGRCLNKAVPVSVVGVFSYDGAQ 0 0 DHFHKLCELTVQARQAYKDMIAAAQEQQSETEAGKNEEDPVAVNGQNKSDDMREESKAEEPDEPNY 1 >SECISBP2_danRer Danio rerio (zebrafish) 1 YCNQVLSKDVDECVSNLLKELVRFQDRLYQKDPMKARMKRRLVMGLREVLKHLKLKKVKCVIISPNCERIQSK 1 2 GGLDEALHNIIDTCRDQSVPFVFALSRKALGRCVNKAVPVSLVGIFNYDGAQ 0 0 DFYHKMIELSSEARTAYEVMLLNLEQTDAEEAQQTSPLAEKVETSSGDPQPEEPEY 1 >SECISBP2_tetNig Tetraodon nigroviridis (pufferfish) 1 YCNQVLSKEIDESVTLLLQELVRFQERVYQKDPTKAKSKRRLVMGLREVTKHMKLQTIKCVIISPNCEKIQAK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYSGAE 0 0 DFYHKMIELSSEARIAYEVMLSNLEQTSAEEEPQTCTLAEKINTSSEDAQPEEPEY 1 >SECISBP2_takRub Takifugu rubripes (fugu) 1 YCTQMLSKDVDECVTTLLKELVRFQDRLYQKDPIKARMKRRIVMGLREVQKHLKLRKLKCVIISPNCERIQSK 1 2 GGLDEALHTIIDTCREQAVPFVFALSRRALGRCVNKAVPVSLVGIFNYDGAQ 0 0 DFYHKMIELSSEARTAYEVMLLNLEQTDAEEAQQTSPLAEKVETSSGDPQPEEPEY 1 >SECISBP2_gasAcu Gasterosteus aculeatus (stickleback) 1 YCNQVLSKEIDESVTMLLQELVRFQERIYQKDPTKAKTKRRLVMGLREVTKHMKLNKIKCVLISPNCEKIQAK 1 2 GGLDEALYNVIAMARDQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYSGAE 0 0 DFYHKMIELSSEARRAYEVMVSSLEQTGQADPESVEEKLQISSAAEEAELGRDITPPEEPEY 1 >SECISBP2_oryLap Oryzias latipes (medaka) 1 YCSQMLRKDVDECVTVLLKELVRFQDRLYHKDPIKARMKRRLVMGLREVLKHLKLRKVKCVIISPNCEQIQSK 1 2 GGLDEALHTIIQTCREQAVPFVFALSRKALGHCVNKAVPVSLVGIFNYDGAQ 0 0 DHYHKMIELSAEARKAYEVLVSSLERDQQEESHPDRGTCFGSVTAEPEKPHY 1 >SECISBP2_calMil Callorhinchus milii (elephantfish) AAVX01044988 1 YCSQVLSKDVDSCVTDLLKELVRFQDRLYQKDPIKAKKKRRIVMGLREVLKHLKLKRLKCIIISPNCEKIQSR 1 2 GGLDDALHNIISIACEQEIPFVFALNRKALGQCVNKPVPVSVLGIFSYDGAE 0 0 NQFHQMVEITEEARKAYQEMLDALQQELEADEEKGDSEEQPLISSESSTIHFNNVTSQPFSEADEPEY 1 >SECISBP2_braFlo Branchiostoma floridae (amphioxus) extra exon 1 YCNQVLDKEIDATVTMLLQDLVRFQDRQYHK 00 DPIKAKAKRRIVMGLREVTKHLKLRKLKCIIIAPNLEKIQSK 1 2 GGLDDAIETILNLCMEQDVPFVFALGRKALGRAVNKLVPVSVVGVFNYDGAE 0 0 1
KIAA0256 L7Ae motifs from 23 deuterostomes
>KIAA0256_homSap Homo sapiens (human) 1 YCNQVLCKEIDECVTLLLQELVSFQERIYQKDPVRAKARRRLVMGLREVTKHMKLNKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYFGAE 0 0 SLFNKLVELTEEARKAYKDMVAAMEQEQAEEALKNVKKVPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 >KIAA0256_panTro Pan troglodytes (chimp) 1 YCNQVLCKEIDECVTLLLQELVSFQERIYQKDPVRAKARRRLVMGLREVTKHMKLNKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYFGAE 0 0 SLFNKLVELTEEARKAYKDMVAAMEQEQAEEALKNVKKVPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 >KIAA0256_macMul Macaca mulatta (rhesus) 1 YCNQVLCKEIDECVTLLLQELVSFQERIYQKDPVRAKARRRLVMGLREVTKHMKLNKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYFGAE 0 0 SLFNKLVELTEEARKAYKDMVAAMEQEQAEEALKNVKKVPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 >KIAA0256_tupBel Tupaia belangeri (treeShrew) 1 YCNQVLCKEIDECVTLLLQELVSFQERIYQKDPVRAKARRRLVMGLREVTKHMKLNKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYFGAE 0 0 SLFNKLVELTEEARKAYKDMVAAMEQEQAEEALKNVKKVPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 >KIAA0256_musMus Mus musculus (mouse) 1 YCNQVLSKEIDECVTLLLQELVSFQERIYQKDPVRAKARRRLVMGLREVTKHMKLNKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYFGAE 0 0 SLFNRLVELTEEARKAYKDMVAATEQEQAEEALRSVKTVPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 >KIAA0256_ratNor Rattus norvegicus (rat) 1 YCNQVLSKEIDECVTLLLQELVSFQERIYQKDPVRAKARRRLVMGLREVTKHMKLNKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYFGAE 0 0 SLFNRLVELTEEARKAYKDMVAATEQEQAEEALRSVKAVPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 >KIAA0256_canFam Canis familiaris (dog) 1 YCNQVLCKEIDECVTLLLQELVSFQERIYQKDPVRAKARRRLVMGLREVTKHMKLNKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYFGAE 0 0 SLFNKLVELTEEARKAYKDMVAAMEQEQAEEALKNVKKVPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 >KIAA0256_equCab Equus caballus (horse) 1 YCNQVLCKEIDECVTLLLQELVSFQERIYQKDPVRAKARRRLVMGLREVTKHMKLNKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYFGAE 0 0 SLFNKLVALTEEARRAYKDMVAALEQEQAEEASKNVKKGPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 >KIAA0256_dasNov Dasypus novemcinctus (armadillo) 1 YCNQVLCKEIDECVTLLLQELVSFQERIYQKDPVRAKARRRLVMGLREVTKHMKLNKIKCVIISPNCEKIQSKG 1 2 GLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYFGAE 0 0 SLFNKLVELTEEARKAYKDMVAAMEQEQAEEALKNVKKVPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 >KIAA0256_monDom Monodelphis domestica (opossum) 1 YCNQVLCKEIDECVTLLLQELVSFQERIYQKDPVKAKARRRLVMGLREVTKHMKLNKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYCGAE 0 0 SLFNKLVELTEEARKAYKDMVAAMEQEQAEEALKNVKKVPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 >KIAA0256_galGal Gallus gallus (chicken) 1 YCNQVLSKEIDECVTLLLQELVSFQERIYQKDPMRAKARRRLVMGLREVTKHMKLNKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYSGAE 0 0 DLFNKLVSLTEEARKAYRDMVAAMEQEQAEEALKNVKKAPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 >KIAA0256_anoCar Anolis carolinensis (lizard) 1 YCNQVLSKEIDECVTLLLQELVSFQEQIYQKDPMRAKAKRRLVMGLREVTKHMKLSKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYSGAE 0 0 NLFNKLVSLTEEARKAYRDMVAAMEQEQEALKNVKKVPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 >KIAA0256_xenTro Xenopus tropicalis (frog) 1 YCNQVLSKEIDECVTVLLQELVSFQERVYQKDPVKAKSKRRLVMGLREVTKHMKLNKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFSYSGAE 0 0 SLFHNLVSLTEEARKAYKDMVSSMEQEQAEEALKNIKKVHMGHSRNPSAASAISFCSVISEPISEVNEKDY 1 >KIAA0256_danRer Danio rerio (zebrafish) 1 YCNQVLSKEIDESVTLLLQELVRFQERVYQKEPSKAKAKRRLVMGLREVTKHMKLHKIKCVIISPNCEKIQAK 1 2 GGLDEALHNIIDTCRDQSVPFVFALSRKALGRCVNKAVPVSLVGIFNYDGAQ 0 0 GLFNKLVSLTEEARRAYKEMVSALEQEQAEEALKNVKKVPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 >KIAA0256_tetNig Tetraodon nigroviridis (pufferfish) 1 YCNQVLSKEIDESVTLLLQELVRFQERVYQKDPTKAKSKRRLVMGLREVTKHMKLQTIKCVIISPNCEKIQAK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYSGAE 0 0 SLFNQLVSLTEEARKAYKDMVSALEQEQTEEALKNEKKVPHQMGHYRNHSAASAVSFCSIFSEPISEVNEKEY 1 >KIAA0256_takRub Takifugu rubripes (fugu) 1 YCNQVLSKEIDESVTLLLQELVRFQERVYQKDPTKAKSKRRLVMGLREVTKHMKLQTIKCVIISPNCEKIQAK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNFSGAE 0 0 SLFNQLVSLTEEARKAYKDMVSALEQEQTEEALKNEKKVPHQMGHYRNHSAASAVSFCSIFSEPISEVNEKEY 1 >KIAA0256_gasAcu Gasterosteus aculeatus (stickleback) 1 YCNQVLSKEIDESVTMLLQELVRFQERIYQKDPTKAKTKRRLVMGLREVTKHMKLNKIKCVLISPNCEKIQAK 1 2 GGLDEALYNVIAMARDQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYSGAE 0 0 GLFNRLVSLTEEARKAYKDMVSALEQEQAEEAQKNDKKLPHHMGHSRNHSAASAISFCSIFSEPISEVNEKEY 1 >KIAA0256_oryLap Oryzias latipes (medaka) 1 YCNQVLSKEIDESVTLLLQELVRFQERVYQKDPSKAKSKRRLVMGLREVTKHMKLHKIKCVIISPNCEKIQAK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYSGAE 0 0 GLFNQLVSLTEEARKAYKEMVSALEQEQAEEALKHDKKVPHHMGHSRNHSAASAISFCSILSEPISEVNEKEY 1 >KIAA0256_pimPro Pimephales promelas (minnow) based on transcript tiling; exons by homology; 62% identity 1 YCNQVLSKDIDESVTLLLQELVRFQERVYQNEPSKAKAKRRLVMGLREVTKHMKLHKIKCVIISPNCEKIQAK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYSGAE 0 0 ALFNTLVSLTEEARRAYKEMVSALEQEQAEEALKNVKKVPHHMGHSRNPSAASAISFCSVISEPISEVNEKEY 1 >KIAA0256_calMil Callorhinchus milii (elephantfish) AAVX01105236 1 YCNQVLSKDIDECVTLLLQELVRFQERVYQKDPIKAKMKRRLVMGLREVTKHMKLRKIKCVIISPNCEKIQSK 1 2 GGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYFGAE 0 0 1 >KIAA0256_petMar Petromyzon marinus (lamprey) 1 LHKLRALIISPNCEKIQAK 1 2 GGLDEALQTVIALASEQSVPFVFALNRKALGHCLNKKVPVSVVGVFHYGGAE 0 0 THFQRLVALTEEARSAYRNMVSSLQRQEAAATSEPTGHTEDPLEASASPPSVPAHDPTALLHLLRPQQGPREDDPAEASGRSPGRNA 1 >KIAA0256_cioInt Ciona intestinalis (tunicate) 1 YCCQVLDKRVDEMSNQMLQRLVYFQDR 21 RLYKTDPAKAKRKRRVVLGFREVTKHLKMKKLRCVIISPNLEKIESK 1 2 GGLDDVLHEILDLCKEQNIPYVFALGKKALGRAVSKTVPVSIVGVFDYSGAE 0 0 1 >KIAA0256_strPur Strongylocentrotus purpuratus (sea_urchin) 1 YCNQVLDKDIDGCCTTLLQTLVKFQDRQYHKDPAK 00 AKMKRRLVMGLREVTKHLKLKKIKCVVVSPNLERIQSK 1 2 GGLDEAMDRISSLASEQNVPLIFALGRKALGRAVNKVVPVSVVGIFNYDGAE 0 0 DTYKQLLDLSTRARNAYADMVRKFQQELEAANAASAARMAKHRHHMGHNRNLFKG 1
Ribosomal L30 L7Ae motifs from 10 deuterostomes
>L30_homSap Homo sapiens (human) 4 exons numerous pseudogenes 0 MVAAKKT 0 0 KKSLESINSRLQLVMKSGKYVLGYKQTLKMIRQGKAKLVILANNCPALR 2 1 KSEIEYYAMLAKTGVHHYSGNNIELGTACGKYYRVCTLAIIDP 1 2 GDSDIIRSMPEQTGEK* 0 >L30_tupBel Tupaia belangeri (treeShrew) 0 MVAAKKT 0 0 KKSLESINSQLQLAMKDGKYVLGYKQTLKMIRQGKAKLVILANNCPALR 2 1 KSEIEYYAMLAKTGVHHYSGNNIELGTACGKYYRACTLAIMDP 1 2 GDSDIIRSMPEQTGEK* 0 >L30_ratNor Rattus norvegicus (rat) Sep15 Gpx4 Gpx1 Dio1 quite weak homology 35% with BP2 exons 0 MVAAKKT 0 0 KKSLESINSRLQLVMKSGKYVLGYKQTLKMIRQGKAKLVILANNCPALR 2 1 KSEIEYYAMLAKTGVHHYSGNNIELGTACGKYYRVCTLAIIDP 1 2 GDSDIIRSMPEQTGEK* 0 >L30_myoLuc Myotis lucifugus (microbat) 0 MVAAKKT 0 0 KKSLESINSRLQLVMKSGKYLLGYKQTLKMIRQGKAKLVILANNCPALR 2 1 ISEIEYYAMLAKTGVHHYSGNNIELGTACGKYYRVCTLAIIDP 1 2 GDSD-IRSMPEQTGEK* 0 >L30_echTel Echinops telfairi (tenrec) 0 MVAAKKT 0 0 KNSLESINSRLQLVMKSGKYMLGYKQMLKMIRQGKAKLVVLANNCPALR 2 1 KSEIEYYAMLAKTGVHHYSGHNIELGTACGKSCRVCTLAITDP 1 2 GDADIIRSMPEQTGEK* 0 >L30_anoCar Anolis carolinensis (lizard) 0 MVAAKKT 0 0 KKSLESINSRLQLVMKSGKYVLGYKQTLKMIQQGKAKLVILANNCPALG 2 1 KSEIEYYAMLAKTGVHHYSGNNIEMGTACGKYYRVCTLAIIDP 1 2 GDSDIIRSMQEQTAEK* 0 >L30_danRer Danio rerio (zebrafish) 94% 0 MVAAKKT 0 0 KKSLESINSRLQLVMKSGKYVLGYKQSQKMIRQGKAKLVILANNCPALR 2 1 KSEIEYYAMLAKTGVHHYSGNNIELGTACGKYYRVCTLAIIDP 1 2 GDSDIIRSMPDQQQGGEK* 0 >L30_squAca Squalus acanthias (spiny dogfish) 97% 0 MVAAKKT 0 0 KKSLESINSRLQLVMKSGKYVLGYKQTLKMIRQGKAKLVILANNCPALR 2 1 KSEIEYYAMLAKTGVHHYSGNNIELGTACGKYYRVCTLAIIDP 1 2 GDSDIIRSMPEQISEK* 0 >L30_petMar Petromyzon marinus (lamprey) 94% 0 MSAKKT 0 0 KKAIESINSRLQLVMKSGKYCLGYRQTLKMIRQGKAKLVLLANNCPALR 2 1 KSEIEYYAMLAKTGVHHYSGNNIEMGTACGKYYRVCTLAIIDP 1 2 GDSDIIRSMPEQQQPQPGDK* 0 >L30_braFlo Branchiostoma floridae (amphioxus) 84% to homSap 0 MKQK 0 0 RKTMESINSRLQLVMKSGKYVLGLKETLKVLRQGKAKLIIIANNTPALR 2 1 KSEIEYYAMLAKTGVHHYSGNNIELGTACGKYFRVCTLAITDP 1 2 GDSDIIRSMPAEDKGESK* 0