BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bmte10a06 (752 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q5TVV1 Cluster: ENSANGP00000026760; n=2; Culicidae|Rep:... 190 2e-47 UniRef50_UPI00015B5B53 Cluster: PREDICTED: similar to ENSANGP000... 185 9e-46 UniRef50_Q9VNI7 Cluster: CG10999-PA, isoform A; n=3; Drosophila ... 184 2e-45 UniRef50_UPI0000D578E2 Cluster: PREDICTED: similar to CG10999-PA... 175 7e-43 UniRef50_A7SCJ1 Cluster: Predicted protein; n=2; Eumetazoa|Rep: ... 147 2e-34 UniRef50_UPI00006A0811 Cluster: Uncharacterized protein C14orf14... 146 7e-34 UniRef50_UPI0000611295 Cluster: Uncharacterized protein C14orf14... 134 3e-30 UniRef50_UPI0000503AD3 Cluster: RIKEN cDNA 2810002I04 gene; n=1;... 119 9e-26 UniRef50_Q5TFG8 Cluster: UPF0418 protein C6orf94; n=14; Theria|R... 119 9e-26 UniRef50_UPI0000E494DB Cluster: PREDICTED: similar to Chromosome... 115 1e-24 UniRef50_Q4QE35 Cluster: Putative uncharacterized protein; n=3; ... 114 2e-24 UniRef50_UPI0000ECC8F5 Cluster: UPF0418 protein C6orf94.; n=3; G... 113 3e-24 UniRef50_Q96GY0 Cluster: UPF0418 protein C8orf70; n=23; Euteleos... 112 8e-24 UniRef50_A7T301 Cluster: Predicted protein; n=2; Eumetazoa|Rep: ... 112 1e-23 UniRef50_A4H9B2 Cluster: Putative uncharacterized protein; n=1; ... 112 1e-23 UniRef50_Q5PPV5 Cluster: UPF0418 protein C8orf70 homolog; n=7; E... 111 2e-23 UniRef50_Q4CSD1 Cluster: Putative uncharacterized protein; n=2; ... 107 2e-22 UniRef50_A2DFG0 Cluster: Putative uncharacterized protein; n=1; ... 104 2e-21 UniRef50_A2DCT4 Cluster: Putative uncharacterized protein; n=1; ... 99 1e-19 UniRef50_A0BVD8 Cluster: Chromosome undetermined scaffold_13, wh... 90 5e-17 UniRef50_A0BQE5 Cluster: Chromosome undetermined scaffold_120, w... 90 5e-17 UniRef50_A2G025 Cluster: Zinc finger, C2H2 type family protein; ... 89 8e-17 UniRef50_Q22122 Cluster: UPF0418 protein T03G11.3; n=2; Caenorha... 89 1e-16 UniRef50_Q4DZW2 Cluster: Putative uncharacterized protein; n=2; ... 88 3e-16 UniRef50_Q22W47 Cluster: Zinc finger, C2H2 type family protein; ... 85 1e-15 UniRef50_A0D772 Cluster: Chromosome undetermined scaffold_4, who... 81 4e-14 UniRef50_A2DDQ6 Cluster: Putative uncharacterized protein; n=1; ... 80 5e-14 UniRef50_Q57XS3 Cluster: Putative uncharacterized protein; n=1; ... 80 7e-14 UniRef50_Q4T2E6 Cluster: Chromosome undetermined SCAF10284, whol... 79 1e-13 UniRef50_UPI0000E48303 Cluster: PREDICTED: hypothetical protein,... 78 2e-13 UniRef50_UPI0000587617 Cluster: PREDICTED: hypothetical protein;... 77 4e-13 UniRef50_Q4QJB7 Cluster: Putative uncharacterized protein; n=3; ... 77 6e-13 UniRef50_A0BPA2 Cluster: Chromosome undetermined scaffold_12, wh... 77 6e-13 UniRef50_UPI0000DB6EB3 Cluster: PREDICTED: similar to CG30460-PC... 75 1e-12 UniRef50_Q23G89 Cluster: Zinc finger, C2H2 type family protein; ... 74 3e-12 UniRef50_A0CCZ1 Cluster: Chromosome undetermined scaffold_169, w... 73 1e-11 UniRef50_A0E4A7 Cluster: Chromosome undetermined scaffold_78, wh... 72 2e-11 UniRef50_UPI0000D56C05 Cluster: PREDICTED: similar to CG10999-PA... 71 3e-11 UniRef50_Q22FZ1 Cluster: Putative uncharacterized protein; n=1; ... 68 2e-10 UniRef50_Q4Q229 Cluster: Putative uncharacterized protein; n=3; ... 66 9e-10 UniRef50_A0E7F3 Cluster: Chromosome undetermined scaffold_81, wh... 62 1e-08 UniRef50_A2F2X4 Cluster: Putative uncharacterized protein; n=2; ... 61 3e-08 UniRef50_Q4CZP9 Cluster: Putative uncharacterized protein; n=2; ... 60 6e-08 UniRef50_A0E456 Cluster: Chromosome undetermined scaffold_77, wh... 57 5e-07 UniRef50_Q38BB2 Cluster: Putative uncharacterized protein; n=3; ... 56 7e-07 UniRef50_Q22MG0 Cluster: Putative uncharacterized protein; n=1; ... 54 3e-06 UniRef50_Q24HM2 Cluster: Zinc finger, C2H2 type family protein; ... 53 7e-06 UniRef50_Q4T3R5 Cluster: Chromosome undetermined SCAF9936, whole... 51 3e-05 UniRef50_UPI0000F20570 Cluster: PREDICTED: hypothetical protein;... 49 1e-04 UniRef50_A0D6H1 Cluster: Chromosome undetermined scaffold_4, who... 48 2e-04 UniRef50_UPI00006CBD30 Cluster: Zinc finger, C2H2 type family pr... 46 8e-04 UniRef50_UPI0000E470FE Cluster: PREDICTED: hypothetical protein;... 44 0.003 UniRef50_Q7QT08 Cluster: GLP_675_33860_35197; n=1; Giardia lambl... 44 0.003 UniRef50_A1ZAP8 Cluster: CG30460-PC, isoform C; n=5; Drosophila ... 44 0.004 UniRef50_Q381C5 Cluster: Putative uncharacterized protein; n=1; ... 44 0.005 UniRef50_Q4FY30 Cluster: Putative uncharacterized protein; n=3; ... 39 0.15 UniRef50_Q387B6 Cluster: Putative uncharacterized protein; n=1; ... 38 0.35 UniRef50_Q7QXY3 Cluster: GLP_479_39609_38410; n=1; Giardia lambl... 37 0.46 UniRef50_Q7QRE7 Cluster: GLP_503_3295_2699; n=1; Giardia lamblia... 37 0.46 UniRef50_Q17BP6 Cluster: Putative uncharacterized protein; n=1; ... 36 0.81 UniRef50_UPI00006CBAC8 Cluster: hypothetical protein TTHERM_0050... 36 1.4 UniRef50_A0BJ34 Cluster: Chromosome undetermined scaffold_11, wh... 36 1.4 UniRef50_A3PYR3 Cluster: Putative uncharacterized protein; n=1; ... 35 1.9 UniRef50_Q17JM9 Cluster: Predicted protein; n=1; Aedes aegypti|R... 35 1.9 UniRef50_Q4AP45 Cluster: Radical SAM; n=3; Bacteria|Rep: Radical... 34 3.3 UniRef50_A7ACD8 Cluster: Putative uncharacterized protein; n=1; ... 34 3.3 UniRef50_Q4Q4S7 Cluster: Putative uncharacterized protein; n=3; ... 34 3.3 UniRef50_A2EVV5 Cluster: Putative uncharacterized protein; n=1; ... 34 3.3 UniRef50_Q8MRK6 Cluster: GH27233p; n=1; Drosophila melanogaster|... 34 4.3 UniRef50_A2E8N3 Cluster: Putative uncharacterized protein; n=2; ... 34 4.3 UniRef50_Q2H9A8 Cluster: Putative uncharacterized protein; n=2; ... 34 4.3 UniRef50_P39505 Cluster: Uncharacterized 9.4 kDa protein in nrdB... 34 4.3 UniRef50_P28698 Cluster: Myeloid zinc finger 1; n=19; Eutheria|R... 34 4.3 UniRef50_Q247Z8 Cluster: Putative uncharacterized protein; n=1; ... 33 5.7 UniRef50_A5DMI5 Cluster: Putative uncharacterized protein; n=1; ... 33 5.7 UniRef50_A5KAW1 Cluster: Merozoite surface protein 3 (MSP3), put... 33 7.6 UniRef50_A4RL95 Cluster: Predicted protein; n=1; Magnaporthe gri... 33 7.6 UniRef50_Q4D375 Cluster: Dispersed gene family protein 1 (DGF-1)... 33 10.0 UniRef50_Q227R4 Cluster: Zinc finger, C2H2 type family protein; ... 33 10.0 >UniRef50_Q5TVV1 Cluster: ENSANGP00000026760; n=2; Culicidae|Rep: ENSANGP00000026760 - Anopheles gambiae str. PEST Length = 384 Score = 190 bits (464), Expect = 2e-47 Identities = 87/151 (57%), Positives = 109/151 (72%), Gaps = 3/151 (1%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINK-LRKTTATPST 435 C +C R+FA++RI KH++IC+K +KKRK FD+ KHR+ GT+AE ++ K +K ++ PST Sbjct: 226 CDICSRNFATERIDKHRQICQKTKTKKRKVFDITKHRVQGTDAESYVLKGKKKQSSQPST 285 Query: 436 --TKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCP 609 SNWR+KHEEFI IRAAK+++AHL GGKLSDL SENPDY+QCP Sbjct: 286 GAAAAAAAGSKQSNWRKKHEEFIATIRAAKEMKAHLARGGKLSDLPPPPPSENPDYIQCP 345 Query: 610 HCNRRFNQGAAERHIPKCANFQFNKPKPAAK 702 HC+RRFNQ AAERHIPKCA NKPKP K Sbjct: 346 HCSRRFNQTAAERHIPKCATMLHNKPKPKPK 376 >UniRef50_UPI00015B5B53 Cluster: PREDICTED: similar to ENSANGP00000026760; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to ENSANGP00000026760 - Nasonia vitripennis Length = 1097 Score = 185 bits (451), Expect = 9e-46 Identities = 96/202 (47%), Positives = 117/202 (57%), Gaps = 2/202 (0%) Frame = +1 Query: 109 TNKTRGAM-QRPANTTPRKPPVKANSAGSGTPKGRXXXXXXXXXXXXXGDACGVCGRHFA 285 TN++ G+ RP PP A TP + C +C R FA Sbjct: 910 TNRSHGSTASRPTGKPKAAPPTPA---ARSTPSSKGSAASNDDSL----STCKICNRRFA 962 Query: 286 SDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINK-LRKTTATPSTTKVNKGKQL 462 +DRI H++IC K KKRK FD L HR+ GTE E F+ K ++K P Sbjct: 963 TDRIGLHEQICAKTSQKKRKQFDALTHRVKGTELESFVQKPVKKQVQYPQP--------- 1013 Query: 463 NSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCNRRFNQGAA 642 SNWR+KHE+FI AIR+AKQ+QAHL +GGKLSDL S+ DY+QCPHC+R+FNQGAA Sbjct: 1014 -SNWRRKHEDFINAIRSAKQMQAHLASGGKLSDLPPPPPSDTSDYIQCPHCSRKFNQGAA 1072 Query: 643 ERHIPKCANFQFNKPKPAAKRR 708 ERHIPKCAN Q NKP P A R Sbjct: 1073 ERHIPKCANMQHNKPNPRAPPR 1094 >UniRef50_Q9VNI7 Cluster: CG10999-PA, isoform A; n=3; Drosophila melanogaster|Rep: CG10999-PA, isoform A - Drosophila melanogaster (Fruit fly) Length = 383 Score = 184 bits (449), Expect = 2e-45 Identities = 90/157 (57%), Positives = 109/157 (69%), Gaps = 7/157 (4%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLR--KTTATPS 432 C CGRHF +DR+AKH+E+C++ + KRK FD K R+ GTEA F K + + +T S Sbjct: 227 CRYCGRHFNTDRLAKHEEVCQRMLTTKRKIFDASKQRIEGTEAAAFNMKSKGNRNRSTYS 286 Query: 433 TTKVNKGKQLN---SNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQ 603 + KG +NWR+KHE+FIQ+IRAAKQV+AHL GGKLSDL SENPDYVQ Sbjct: 287 SAAQQKGLTTGVKKNNWRKKHEDFIQSIRAAKQVKAHLARGGKLSDLPPPPPSENPDYVQ 346 Query: 604 CPHCNRRFNQGAAERHIPKCANFQFNKPK--PAAKRR 708 CPHC RRFN+ AAERHIPKC N NKP+ P AKRR Sbjct: 347 CPHCGRRFNEQAAERHIPKCVNMVHNKPRNGPPAKRR 383 >UniRef50_UPI0000D578E2 Cluster: PREDICTED: similar to CG10999-PA, isoform A; n=1; Tribolium castaneum|Rep: PREDICTED: similar to CG10999-PA, isoform A - Tribolium castaneum Length = 480 Score = 175 bits (427), Expect = 7e-43 Identities = 84/201 (41%), Positives = 113/201 (56%) Frame = +1 Query: 103 SETNKTRGAMQRPANTTPRKPPVKANSAGSGTPKGRXXXXXXXXXXXXXGDACGVCGRHF 282 S ++ + A P +PP A + P + + C C R F Sbjct: 278 SLSHTMQSAKSNPVKKGTPQPPQSARAPCKDRPSAKQSAKSPVARDDL--NECRFCNRRF 335 Query: 283 ASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVNKGKQL 462 A+DR+ H+ IC K KKRK +D KHR+ GTE E ++ + + ++ S + + Sbjct: 336 AADRLQVHESICGKTAKKKRKIYDATKHRVEGTELEQYVRRGKNLSSKASNRQAPR---- 391 Query: 463 NSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCNRRFNQGAA 642 +WR+ HEEFI AIRAAK QAH+ GGKL+DL S NPDYVQCPHC R+FN+ AA Sbjct: 392 -KDWRRTHEEFINAIRAAKMAQAHVAKGGKLADLPPPPPSSNPDYVQCPHCGRKFNEAAA 450 Query: 643 ERHIPKCANFQFNKPKPAAKR 705 ERHIPKCA ++FNKPKP A + Sbjct: 451 ERHIPKCATYEFNKPKPGANK 471 >UniRef50_A7SCJ1 Cluster: Predicted protein; n=2; Eumetazoa|Rep: Predicted protein - Nematostella vectensis Length = 139 Score = 147 bits (357), Expect = 2e-34 Identities = 74/143 (51%), Positives = 91/143 (63%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C CGR+FA DRIAKH+ IC+K +K+RK FD K R +GTEA + + P+ Sbjct: 6 CPNCGRNFAMDRIAKHETICRKTGTKQRKVFDSTKARTSGTEAAGYNRPGARKKPEPAVP 65 Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618 K NWR KH+EFI+AIR AK+V H+ +GGK+SDL SENPDYV C +C Sbjct: 66 K--------GNWRAKHQEFIRAIRDAKKVSQHIASGGKVSDLPPPQYSENPDYVLCRYCQ 117 Query: 619 RRFNQGAAERHIPKCANFQFNKP 687 RRFN AERHIPKCAN N+P Sbjct: 118 RRFNPTVAERHIPKCAN-TTNRP 139 >UniRef50_UPI00006A0811 Cluster: Uncharacterized protein C14orf140.; n=1; Xenopus tropicalis|Rep: Uncharacterized protein C14orf140. - Xenopus tropicalis Length = 360 Score = 146 bits (353), Expect = 7e-34 Identities = 74/149 (49%), Positives = 90/149 (60%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C +CGR F + R+ KH ++C+K RK FD K R GTE E ++ KT P+ Sbjct: 220 CNLCGRQFLAHRLEKHTQVCQKMQKSNRKVFDSSKARAKGTELEQYLQTKGKTR--PNVP 277 Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618 KV Q N+ WRQKHE F Q IR A+ VQ + GGKLSDL ENPDYV CPHCN Sbjct: 278 KV----QSNA-WRQKHESFQQTIRHARTVQQVIAKGGKLSDLPPPPPEENPDYVTCPHCN 332 Query: 619 RRFNQGAAERHIPKCANFQFNKPKPAAKR 705 RRF AERHIPKC N + +KP+P +R Sbjct: 333 RRFAPRVAERHIPKCENIK-SKPRPLRRR 360 >UniRef50_UPI0000611295 Cluster: Uncharacterized protein C14orf140.; n=3; Gallus gallus|Rep: Uncharacterized protein C14orf140. - Gallus gallus Length = 486 Score = 134 bits (323), Expect = 3e-30 Identities = 70/150 (46%), Positives = 86/150 (57%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C CGR F R+ KH IC K+ KRK FD K R GT+ E F + K++ P Sbjct: 344 CSFCGRKFLCARLKKHMSICSKSQGSKRKTFDSSKARARGTDLEEF--QQWKSSERPQ-- 399 Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618 NK + N NWRQ HE FIQ +R A+QVQ L+ GGK+SDL ENPDY CP+C Sbjct: 400 --NKPPRRN-NWRQNHEAFIQTLRHARQVQQVLSKGGKVSDLPPLPPIENPDYTACPYCR 456 Query: 619 RRFNQGAAERHIPKCANFQFNKPKPAAKRR 708 RRF AE HIPKC N + N+P +R+ Sbjct: 457 RRFAPQVAETHIPKCKNIK-NRPSLPPQRK 485 >UniRef50_UPI0000503AD3 Cluster: RIKEN cDNA 2810002I04 gene; n=1; Rattus norvegicus|Rep: RIKEN cDNA 2810002I04 gene - Rattus norvegicus Length = 449 Score = 119 bits (286), Expect = 9e-26 Identities = 60/131 (45%), Positives = 75/131 (57%) Frame = +1 Query: 301 KHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVNKGKQLNSNWRQ 480 +H +C K KRK FD + R GTE E ++N P+T K + S WRQ Sbjct: 320 RHSTVCGKMQGSKRKVFDSSRARAKGTELEQYLN-----WRGPATDKAEPPPR-KSTWRQ 373 Query: 481 KHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCNRRFNQGAAERHIPK 660 KHE FI+ +R A+QVQ + GG SDL +ENPDYVQCPHC+R F AERHIPK Sbjct: 374 KHESFIRTLRHARQVQQVIARGGNPSDLPSILPAENPDYVQCPHCSRHFAPKVAERHIPK 433 Query: 661 CANFQFNKPKP 693 C + N+P P Sbjct: 434 CKTIK-NRPPP 443 >UniRef50_Q5TFG8 Cluster: UPF0418 protein C6orf94; n=14; Theria|Rep: UPF0418 protein C6orf94 - Homo sapiens (Human) Length = 222 Score = 119 bits (286), Expect = 9e-26 Identities = 69/152 (45%), Positives = 88/152 (57%), Gaps = 4/152 (2%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C VCGR FA+D + +H ICKK ++KRKPF LK RL GT+ P + K ++ + P Sbjct: 18 CEVCGRRFAADVLERHGPICKKLFNRKRKPFSSLKQRLQGTDI-PTVKKTPQSKSPP--- 73 Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618 V K SNWRQ+HE+FI AIR+AKQ + G L S NPDY+Q P+C Sbjct: 74 -VRK-----SNWRQQHEDFINAIRSAKQCMLAIKEGRPLP--PPPPPSLNPDYIQRPYCM 125 Query: 619 RRFNQGAAERHIPKCANFQ----FNKPKPAAK 702 RRFN+ AAERH C + FN + AAK Sbjct: 126 RRFNESAAERHTNFCKDQSSRRVFNPAQTAAK 157 >UniRef50_UPI0000E494DB Cluster: PREDICTED: similar to Chromosome 8 open reading frame 70; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to Chromosome 8 open reading frame 70 - Strongylocentrotus purpuratus Length = 323 Score = 115 bits (277), Expect = 1e-24 Identities = 60/149 (40%), Positives = 81/149 (54%), Gaps = 2/149 (1%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 CG CGR F D +A+H ++C+K KKRK FD K R GT+ P T Sbjct: 18 CGTCGRTFLPDTLARHAKVCRKTAKKKRKTFDSSKQRAEGTD----------IGTVPKTN 67 Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618 + + +NWRQKHE+FI+A+++AK V + G L NPDYVQCP C Sbjct: 68 ERDLPPSKKNNWRQKHEDFIEAMQSAKGVSKAIKTGAPLPP-PPAQKRINPDYVQCPSCE 126 Query: 619 RRFNQGAAERHIPKC--ANFQFNKPKPAA 699 R F++ A+ERHIP C N + +K P+A Sbjct: 127 RHFSESASERHIPWCKEKNKRIDKRTPSA 155 >UniRef50_Q4QE35 Cluster: Putative uncharacterized protein; n=3; Trypanosomatidae|Rep: Putative uncharacterized protein - Leishmania major Length = 723 Score = 114 bits (275), Expect = 2e-24 Identities = 62/150 (41%), Positives = 79/150 (52%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C CGR FAS+ +AKH+ IC KKR+ F+ K RL TA + Sbjct: 588 CSHCGRQFASESLAKHERIC--CSQKKRRVFNATKQRLP-----------EGATAAAKPS 634 Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618 ++ +W+ + E F +A+R A+QV L AGG DL S NPDYV CPHC Sbjct: 635 AGSQPAAPKRDWKAESESFRRALREARQVDQVLKAGGTAKDLPPPTYSTNPDYVPCPHCQ 694 Query: 619 RRFNQGAAERHIPKCANFQFNKPKPAAKRR 708 RRF A RHIP+CAN N+PKP +RR Sbjct: 695 RRFAPDVAARHIPRCAN-TVNRPKPPPRRR 723 >UniRef50_UPI0000ECC8F5 Cluster: UPF0418 protein C6orf94.; n=3; Gallus gallus|Rep: UPF0418 protein C6orf94. - Gallus gallus Length = 178 Score = 113 bits (273), Expect = 3e-24 Identities = 61/142 (42%), Positives = 80/142 (56%), Gaps = 7/142 (4%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINK--LRKTTATPS 432 C +CGR FA D + +H+ ICKK +KKRKPF+ K RL GT+ + L+ Sbjct: 22 CRICGRQFAPDVLMRHEPICKKVFNKKRKPFNSFKQRLQGTDIGTVKRQPPLKVRLMLEH 81 Query: 433 TTKVNKGKQLN-----SNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDY 597 T + + +LN SNWRQ H +FI AI++AKQV + G L S NPDY Sbjct: 82 TLSLLEAFRLNQPVKKSNWRQHHADFINAIQSAKQVTKAMQEGRPLP--PPPPPSINPDY 139 Query: 598 VQCPHCNRRFNQGAAERHIPKC 663 +QCP C RRFN+ AA +HI C Sbjct: 140 IQCPFCLRRFNEAAAAKHIKFC 161 >UniRef50_Q96GY0 Cluster: UPF0418 protein C8orf70; n=23; Euteleostomi|Rep: UPF0418 protein C8orf70 - Homo sapiens (Human) Length = 325 Score = 112 bits (270), Expect = 8e-24 Identities = 60/135 (44%), Positives = 75/135 (55%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C +CGR F + KH IC+K +KKRK FD + R GT+ P + L+ P Sbjct: 19 CKICGRTFFPVALKKHGPICQKTATKKRKTFDSSRQRAEGTDI-PTVKPLKPRPEPPKKP 77 Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618 SNWR+KHEEFI IRAAK + L GGKL S +PDY+QCP+C Sbjct: 78 ---------SNWRRKHEEFIATIRAAKGLDQALKEGGKLPP--PPPPSYDPDYIQCPYCQ 126 Query: 619 RRFNQGAAERHIPKC 663 RRFN+ AA+RHI C Sbjct: 127 RRFNENAADRHINFC 141 >UniRef50_A7T301 Cluster: Predicted protein; n=2; Eumetazoa|Rep: Predicted protein - Nematostella vectensis Length = 133 Score = 112 bits (269), Expect = 1e-23 Identities = 59/150 (39%), Positives = 81/150 (54%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C +CGR+F +DR+ KHQ++C K ++KRK FD+ K R AGTE E ++ Sbjct: 4 CSICGRNFQTDRLEKHQKVCAKNSTRKRKAFDMTKQRTAGTEHEKYVKA--------GAH 55 Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618 K K+++ WR +HE FI+AIR AK S ENP YVQCPHC Sbjct: 56 KQEPEKKVD--WRAQHESFIKAIRYAKG-----------SSDEPPPVMENPHYVQCPHCE 102 Query: 619 RRFNQGAAERHIPKCANFQFNKPKPAAKRR 708 R+FN AERHIP+C + + P + + Sbjct: 103 RKFNPETAERHIPRCKDIKARPAPPKGRNK 132 >UniRef50_A4H9B2 Cluster: Putative uncharacterized protein; n=1; Leishmania braziliensis|Rep: Putative uncharacterized protein - Leishmania braziliensis Length = 721 Score = 112 bits (269), Expect = 1e-23 Identities = 62/150 (41%), Positives = 78/150 (52%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C CGR F S+ + KH+ IC A KKR+ F+ K RLA TA + Sbjct: 586 CRHCGRRFVSESLGKHEHIC--ASLKKRRVFNATKQRLA-----------EGATAAAKVS 632 Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618 + K +W+ + F +AIR A+ V L AGG + DL S NPDYV CPHC Sbjct: 633 PAPQPKAPTRDWKAESVAFRRAIREARHVDQVLKAGGTIKDLPPPTYSINPDYVPCPHCQ 692 Query: 619 RRFNQGAAERHIPKCANFQFNKPKPAAKRR 708 RRF A RHIP+CAN N+PKP +RR Sbjct: 693 RRFAPDVAARHIPRCAN-TVNRPKPPPRRR 721 >UniRef50_Q5PPV5 Cluster: UPF0418 protein C8orf70 homolog; n=7; Eumetazoa|Rep: UPF0418 protein C8orf70 homolog - Xenopus laevis (African clawed frog) Length = 323 Score = 111 bits (267), Expect = 2e-23 Identities = 61/135 (45%), Positives = 77/135 (57%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C +CGR F + KH IC+K KKRK F+ + R GT+ IN ++ P Sbjct: 11 CKICGRTFFPATLKKHVPICQKTSVKKRKTFESSRQRAEGTD----INTVKPVKPRPEPP 66 Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618 K KQ SNW++KHEEFI IR+AK + L GG+L S +PDYVQCP+C Sbjct: 67 K----KQ--SNWKRKHEEFIATIRSAKGISQILKEGGELPP--PPPPSYDPDYVQCPYCQ 118 Query: 619 RRFNQGAAERHIPKC 663 RRFNQ AA+RHI C Sbjct: 119 RRFNQNAADRHINFC 133 >UniRef50_Q4CSD1 Cluster: Putative uncharacterized protein; n=2; Trypanosoma cruzi|Rep: Putative uncharacterized protein - Trypanosoma cruzi Length = 757 Score = 107 bits (258), Expect = 2e-22 Identities = 56/150 (37%), Positives = 80/150 (53%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C CGR FA + +++H+ +C KKR+ F++ RL+GT A+ + + Sbjct: 616 CSNCGRTFALNVLSRHERVCTT--QKKRRVFNMRAMRLSGTGADQVAKSGSSGASAAAVA 673 Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618 K +WR + E F +++R A+QV L +GG + DL SEN Y CPHC Sbjct: 674 PAPK-----RDWRAESEAFRRSMRDARQVDKVLKSGGNVKDLPPPTYSENSHYTPCPHCG 728 Query: 619 RRFNQGAAERHIPKCANFQFNKPKPAAKRR 708 R+F AERHIP+CA NKPKP +RR Sbjct: 729 RKFAPDVAERHIPRCAT-TINKPKPPPRRR 757 >UniRef50_A2DFG0 Cluster: Putative uncharacterized protein; n=1; Trichomonas vaginalis G3|Rep: Putative uncharacterized protein - Trichomonas vaginalis G3 Length = 368 Score = 104 bits (250), Expect = 2e-21 Identities = 68/184 (36%), Positives = 93/184 (50%), Gaps = 10/184 (5%) Frame = +1 Query: 142 ANTTPRKP---PVKANSAGSGTPKGRXXXXXXXXXXXXXGD--ACGVCGRHFASDRIAKH 306 A+T P+ P P K SAG K D +C CGR FASDRI KH Sbjct: 176 ADTPPKSPKPAPAKKPSAGGALNKTLRRNAPPPAEADANDDRVSCSYCGRKFASDRIEKH 235 Query: 307 QEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVNKGKQLNSNWRQKH 486 +EIC++ KK K FD K RL G EA F K+ K P +N + ++ +H Sbjct: 236 EEICRRQSMKKTKVFDSSKQRLEG-EAASFA-KVSKNKPKPKKETINGVPK----YKLQH 289 Query: 487 EEFIQAIRAAKQVQAHLNA---GGKLSDLXXXXXSE--NPDYVQCPHCNRRFNQGAAERH 651 +E ++A+RAA+++QA+ +A G + E + D VQCPHC R+F + A RH Sbjct: 290 QELVKAMRAARKLQAYQDAVERGEAVGPPPEMPKIELVDDDRVQCPHCGRKFGEEQARRH 349 Query: 652 IPKC 663 IP C Sbjct: 350 IPNC 353 >UniRef50_A2DCT4 Cluster: Putative uncharacterized protein; n=1; Trichomonas vaginalis G3|Rep: Putative uncharacterized protein - Trichomonas vaginalis G3 Length = 504 Score = 98.7 bits (235), Expect = 1e-19 Identities = 56/186 (30%), Positives = 91/186 (48%), Gaps = 4/186 (2%) Frame = +1 Query: 160 KPPVKANSAGSGTPKGRXXXXXXXXXXXXXGDACGVCGRHFASDRIAKHQEICKKAHSKK 339 +PP ++ S P + C +CGR FA+DRI KH+EIC+K+ +KK Sbjct: 324 RPPSRSTSRSQPAPVPQENPPSPYADDNVELVECSICGRRFAADRIQKHEEICRKSATKK 383 Query: 340 RKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVNKGKQLNSNWRQKHEEFIQAIRAAK 519 +K FD+ RLA T AE +I +++ + + K K ++++HE+ ++A+R A+ Sbjct: 384 KKVFDITSKRLADTGAEEYIGQIK-----AAKDEKPKPKNEVPKYKKEHEKLVEAMRNAR 438 Query: 520 QVQAHLN--AGGK--LSDLXXXXXSENPDYVQCPHCNRRFNQGAAERHIPKCANFQFNKP 687 ++Q + A GK ++ D V CP C R+F + A RH P C K Sbjct: 439 KIQQYEKDVAAGKNVKPPELAPIQMDDDDRVTCPICGRKFGKEALARHTPGCEKMNARKL 498 Query: 688 KPAAKR 705 +R Sbjct: 499 NTRGRR 504 >UniRef50_A0BVD8 Cluster: Chromosome undetermined scaffold_13, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_13, whole genome shotgun sequence - Paramecium tetraurelia Length = 416 Score = 90.2 bits (214), Expect = 5e-17 Identities = 59/149 (39%), Positives = 69/149 (46%), Gaps = 2/149 (1%) Frame = +1 Query: 268 CGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVN 447 CGR FA + KH++IC K K+RK FD KHR+ E I K K Sbjct: 281 CGRSFAKLALQKHEKICVKVFQKQRKQFDAQKHRIISNEQISHIKNQDKIEQ-----KYE 335 Query: 448 KGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSE--NPDYVQCPHCNR 621 K NW+ + E F AI AAK GGKL+ E + VQC +C R Sbjct: 336 KALAKKQNWKNQSEAFRAAIIAAK--------GGKLTKDQKNAMQEASKSNLVQCNYCGR 387 Query: 622 RFNQGAAERHIPKCANFQFNKPKPAAKRR 708 FNQ AAERHIP CA PK KRR Sbjct: 388 SFNQQAAERHIPFCAQKSKIPPKQPQKRR 416 >UniRef50_A0BQE5 Cluster: Chromosome undetermined scaffold_120, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_120, whole genome shotgun sequence - Paramecium tetraurelia Length = 352 Score = 90.2 bits (214), Expect = 5e-17 Identities = 49/152 (32%), Positives = 78/152 (51%) Frame = +1 Query: 253 DACGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPS 432 + C +C R F ++RI KH+++C+KA K+ + ++K + K + Sbjct: 37 EQCEICSRKFHTERIGKHRQVCEKAQQKQMQREKLIKRKQQ--------QKAEHQQKLDA 88 Query: 433 TTKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPH 612 K K K +N NWR++H +F + I K+ + N G + + +EN YVQC + Sbjct: 89 KEKQVKNKTVN-NWREQHRQFQEMIHCNKKEKEVQNEGEEEIAVKTLDLAENSLYVQCEY 147 Query: 613 CNRRFNQGAAERHIPKCANFQFNKPKPAAKRR 708 C R F++ AERHIPKC + KPKP K + Sbjct: 148 CKRSFDRYVAERHIPKCKEIK-AKPKPLKKNQ 178 Score = 34.7 bits (76), Expect = 2.5 Identities = 12/21 (57%), Positives = 16/21 (76%) Frame = +1 Query: 601 QCPHCNRRFNQGAAERHIPKC 663 +CP+C R+FN AA RH+P C Sbjct: 250 ECPYCLRKFNPKAALRHVPIC 270 >UniRef50_A2G025 Cluster: Zinc finger, C2H2 type family protein; n=1; Trichomonas vaginalis G3|Rep: Zinc finger, C2H2 type family protein - Trichomonas vaginalis G3 Length = 340 Score = 89.4 bits (212), Expect = 8e-17 Identities = 53/153 (34%), Positives = 82/153 (53%), Gaps = 3/153 (1%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C CGR FA DR+ H+ IC K ++KR+PF+ HR++GTE ++ + + + ++ Sbjct: 195 CHYCGRKFAPDRLPVHERICAK--TRKRRPFNASMHRVSGTEMR-YVPRSSRAESKSNSR 251 Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSEN-PD-YVQCPH 612 K GK ++ +HE + A+RAA+ + A+ GK+ + ++ PD V+CP Sbjct: 252 KYINGK---PKYKIEHENLVAALRAARGMAAY--ESGKIKAMPKMPKMQDVPDGRVKCPV 306 Query: 613 CNRRFNQGAAERHIPKC-ANFQFNKPKPAAKRR 708 C R+F AERHIP C N P KRR Sbjct: 307 CGRKFGPEQAERHIPFCKRNAGIRPPARPVKRR 339 >UniRef50_Q22122 Cluster: UPF0418 protein T03G11.3; n=2; Caenorhabditis|Rep: UPF0418 protein T03G11.3 - Caenorhabditis elegans Length = 349 Score = 88.6 bits (210), Expect = 1e-16 Identities = 47/135 (34%), Positives = 68/135 (50%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C +C R F + KH+ C+K S RKPFD K R +G++ ++K + Sbjct: 23 CPICDRRFIKSSLEKHESACRKLASLHRKPFDSGKQRASGSDLT--YADIKKVQHEKNKN 80 Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618 + +NWR++H FI A+ ++K+V L G L + DYVQC +C+ Sbjct: 81 G-GVFPRPQTNWRERHGNFIDAVSSSKRVDYALKTGAPLPP--PPKTAVPSDYVQCEYCS 137 Query: 619 RRFNQGAAERHIPKC 663 R FN AAERHIP C Sbjct: 138 RNFNAAAAERHIPFC 152 >UniRef50_Q4DZW2 Cluster: Putative uncharacterized protein; n=2; Trypanosoma cruzi|Rep: Putative uncharacterized protein - Trypanosoma cruzi Length = 657 Score = 87.8 bits (208), Expect = 3e-16 Identities = 52/146 (35%), Positives = 74/146 (50%), Gaps = 11/146 (7%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 CG+CGR F + +A+H+ C KKR FD + RL G E +R+ TA ++ Sbjct: 504 CGLCGRSFRASILARHESACSNLQ-KKRGVFDTKEQRLEGIEG------IREVTAPSNSV 556 Query: 439 KVNKGKQ-----LNSN------WRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSE 585 GK+ N+N W+ +HE+F A+RA +QV + +GG S Sbjct: 557 SQKGGKKHPVAVANTNPDKPPKWKIQHEQFQAAMRAMRQVNVNAPSGGGGSGGKQPMPEA 616 Query: 586 NPDYVQCPHCNRRFNQGAAERHIPKC 663 D V CPHC R+F + A+RHIPKC Sbjct: 617 YDDRVPCPHCGRKFAELTAQRHIPKC 642 >UniRef50_Q22W47 Cluster: Zinc finger, C2H2 type family protein; n=1; Tetrahymena thermophila SB210|Rep: Zinc finger, C2H2 type family protein - Tetrahymena thermophila SB210 Length = 1668 Score = 85.4 bits (202), Expect = 1e-15 Identities = 55/147 (37%), Positives = 73/147 (49%), Gaps = 2/147 (1%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C C R FASDRI+KH+ +CK +K+ L+ + A E KL K Sbjct: 1372 CRKCNRKFASDRISKHESVCKPGPTKQ-----ALRKQKA---LELKKQKLEKND------ 1417 Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSE--NPDYVQCPH 612 + + K N+NWRQ+HEEF ++ ++V GG + L S + QCPH Sbjct: 1418 RFYEQKLANNNWRQQHEEFQNQLKYMRKVGNVEKNGGDIRSLPPPPKSNAMRSNMKQCPH 1477 Query: 613 CNRRFNQGAAERHIPKCANFQFNKPKP 693 C R F+ AA RHIPKC NKPKP Sbjct: 1478 CLRNFSDEAAARHIPKCKT-TINKPKP 1503 >UniRef50_A0D772 Cluster: Chromosome undetermined scaffold_4, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_4, whole genome shotgun sequence - Paramecium tetraurelia Length = 775 Score = 80.6 bits (190), Expect = 4e-14 Identities = 51/151 (33%), Positives = 72/151 (47%) Frame = +1 Query: 256 ACGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPST 435 AC C R FA DRI KH ++CK +K F+ +H + + K KT Sbjct: 488 ACEKCDRRFAQDRIKKHMKVCKG-----KKYFEKKEHVVE-------VQKAPKT------ 529 Query: 436 TKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHC 615 WR+ HEEFI ++ +QV+ GG + L S N +YVQCP+C Sbjct: 530 -----------GWRKYHEEFINTVKYNRQVKKIQEEGGDIKQLGPPPVSSNSNYVQCPYC 578 Query: 616 NRRFNQGAAERHIPKCANFQFNKPKPAAKRR 708 R+F+ AE+HI C N NKPK +++ Sbjct: 579 QRKFDPSKAEKHISICQNV-VNKPKTIQEKK 608 >UniRef50_A2DDQ6 Cluster: Putative uncharacterized protein; n=1; Trichomonas vaginalis G3|Rep: Putative uncharacterized protein - Trichomonas vaginalis G3 Length = 474 Score = 80.2 bits (189), Expect = 5e-14 Identities = 54/161 (33%), Positives = 80/161 (49%), Gaps = 5/161 (3%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C +C R FA DRI +H + CK ++S+K+K FD K R A +A + + + TP Sbjct: 321 CLICHRKFAEDRIDRHMQACKTSNSRKKKVFDSAKMRNADNDAMQYQGR----SETP--P 374 Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAH--LNAGGKL---SDLXXXXXSENPDYVQ 603 KV K SN++++HE+ + ++AA+ + A GK + D V+ Sbjct: 375 KVKK-----SNYKEQHEQLVANLKAARAATEYEKAKAEGKAVGPPPKMPEYKLPDDDRVE 429 Query: 604 CPHCNRRFNQGAAERHIPKCANFQFNKPKPAAKRR*PGNPK 726 CP+C R+F AA+RHIP C K K K PG K Sbjct: 430 CPYCGRKFGSNAAQRHIPFCEKSHAGK-KLNDKGGKPGTTK 469 >UniRef50_Q57XS3 Cluster: Putative uncharacterized protein; n=1; Trypanosoma brucei|Rep: Putative uncharacterized protein - Trypanosoma brucei Length = 651 Score = 79.8 bits (188), Expect = 7e-14 Identities = 60/169 (35%), Positives = 74/169 (43%), Gaps = 25/169 (14%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEA------EPFINKLRKTT 420 C +CGR F S +A+H+ C K KKR+ FD+ RL G E I+ R Sbjct: 478 CNLCGRTFRSSILARHEAACAKV-QKKRRVFDMKGQRLEGIEGIHDVAPSSHISHGRGDG 536 Query: 421 ATPSTTKVNKGKQLNS-----NWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXS- 582 T K N Q+ W+ +HE+F A+RA +QV GGK S Sbjct: 537 GTFGAGKQNTTAQMGGQAKLPKWKIQHEQFQAAMRAMRQVTPEDAPGGKSGAQSTGSKST 596 Query: 583 -------------ENPDYVQCPHCNRRFNQGAAERHIPKCANFQFNKPK 690 E D V CPHC R+F Q AERHIPKCA KPK Sbjct: 597 QQRQLSQPVPLPAEYDDRVPCPHCGRKFAQMTAERHIPKCAT-TIAKPK 644 >UniRef50_Q4T2E6 Cluster: Chromosome undetermined SCAF10284, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF10284, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 183 Score = 79.0 bits (186), Expect = 1e-13 Identities = 46/130 (35%), Positives = 65/130 (50%) Frame = +1 Query: 301 KHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVNKGKQLNSNWRQ 480 +H ICKK +KKRK FD + R GT+ F ++ + +P KQ +NW + Sbjct: 1 RHAVICKKLANKKRKVFDSSRQRAEGTDISLF-RPIKPESESPK-------KQ--TNWHK 50 Query: 481 KHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCNRRFNQGAAERHIPK 660 KH++ I RA K + + GG L + + DY+QCP+C R FNQ A ERHI Sbjct: 51 KHKDIIAHPRAVKPLTLTMKEGGSLPP-PPPPPTYDQDYIQCPYCQRTFNQHAGERHIEF 109 Query: 661 CANFQFNKPK 690 C P+ Sbjct: 110 CQEQAARMPR 119 >UniRef50_UPI0000E48303 Cluster: PREDICTED: hypothetical protein, partial; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: hypothetical protein, partial - Strongylocentrotus purpuratus Length = 290 Score = 78.2 bits (184), Expect = 2e-13 Identities = 37/83 (44%), Positives = 52/83 (62%), Gaps = 2/83 (2%) Frame = +1 Query: 466 SNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCNRRFNQGAAE 645 +NWRQKHE+FI+A+++AK V + G L NPDYVQCP C+R F++ A+E Sbjct: 3 NNWRQKHEDFIEAMQSAKGVSKAIKTGAPLPP-PPAQKRINPDYVQCPSCDRHFSESASE 61 Query: 646 RHIPKC--ANFQFNKPKPAAKRR 708 RHIP C N + +K P+A + Sbjct: 62 RHIPWCKEKNKRIDKRTPSAAEK 84 >UniRef50_UPI0000587617 Cluster: PREDICTED: hypothetical protein; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: hypothetical protein - Strongylocentrotus purpuratus Length = 332 Score = 77.4 bits (182), Expect = 4e-13 Identities = 40/83 (48%), Positives = 48/83 (57%), Gaps = 3/83 (3%) Frame = +1 Query: 466 SNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCNRRFNQGAAE 645 +NWR H +F+ AIR+A+Q Q +N G L S NPDY+QCPHC RRF+Q AA Sbjct: 40 TNWRNNHADFVNAIRSARQAQHAINTGQPLPP--PPPPSINPDYIQCPHCGRRFSQTAAA 97 Query: 646 RHIPKCA--NFQFNKP-KPAAKR 705 RHI C F P KP KR Sbjct: 98 RHINFCGERTNTFGAPVKPLNKR 120 >UniRef50_Q4QJB7 Cluster: Putative uncharacterized protein; n=3; Leishmania|Rep: Putative uncharacterized protein - Leishmania major Length = 664 Score = 76.6 bits (180), Expect = 6e-13 Identities = 52/165 (31%), Positives = 75/165 (45%), Gaps = 21/165 (12%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C CGR F + +H+ +C+ +K RK F++ + RL G E I ++++T A Sbjct: 498 CRTCGRRFRISVVMRHEALCRNQANKPRKVFNMREQRLDGVEG---IKEVQRTAARSGGG 554 Query: 439 KVNKG------------------KQLNSNWRQKHEEFIQAIRAAKQVQAHLNAG---GKL 555 +G K W+ +HE+F A+RA +Q Q G G++ Sbjct: 555 GGGRGAGGGGGRGGGADAAAGAAKGKLPKWKVQHEQFQAAMRAVRQ-QKEAGGGFGSGRM 613 Query: 556 SDLXXXXXSENPDYVQCPHCNRRFNQGAAERHIPKCANFQFNKPK 690 + E D V CPHC R+F Q A RHIPKCA KPK Sbjct: 614 APPPAPIPEEYDDRVPCPHCGRKFAQDVAARHIPKCAT-TIAKPK 657 >UniRef50_A0BPA2 Cluster: Chromosome undetermined scaffold_12, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_12, whole genome shotgun sequence - Paramecium tetraurelia Length = 348 Score = 76.6 bits (180), Expect = 6e-13 Identities = 42/151 (27%), Positives = 77/151 (50%) Frame = +1 Query: 253 DACGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPS 432 ++C +C R F +RI +H C+KA K+++ +++ + N+ +K Sbjct: 20 ESCDLCNRKFHPERIERHLIACQKAQQKQQERDKIIQKKKKQ-------NEQKKQQLQQV 72 Query: 433 TTKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPH 612 ++ K +NWR++H++F + I+ ++++ G ++ L N +YV C + Sbjct: 73 DVEIVK-----TNWREEHQKFQEQIQYNRKLKQLETEGQDVNQLKPLETKVNSNYVFCEY 127 Query: 613 CNRRFNQGAAERHIPKCANFQFNKPKPAAKR 705 C R F++ AERHIPKC KPKP K+ Sbjct: 128 CERHFDKHVAERHIPKCKEI-IAKPKPPRKK 157 Score = 55.2 bits (127), Expect = 2e-06 Identities = 41/141 (29%), Positives = 65/141 (46%), Gaps = 6/141 (4%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKT-TATPST 435 C C RHF +H CK+ +K + P + ++P + + R+ +TPST Sbjct: 125 CEYCERHFDKHVAERHIPKCKEIIAKPKPPRKKTVEMIQ--PSQPSLQEKRQAQVSTPST 182 Query: 436 TKVNKGK-----QLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYV 600 + + K QL+ + +Q +Q A + +A+L G + ++ Sbjct: 183 SSQMERKPIIKKQLSDSSQQFRPTSLQKFIAEQSGKANLTNIGFIDCKARATAIQD---T 239 Query: 601 QCPHCNRRFNQGAAERHIPKC 663 +CPHCNRRF AAERHIP C Sbjct: 240 ECPHCNRRFISRAAERHIPIC 260 >UniRef50_UPI0000DB6EB3 Cluster: PREDICTED: similar to CG30460-PC, isoform C; n=1; Apis mellifera|Rep: PREDICTED: similar to CG30460-PC, isoform C - Apis mellifera Length = 1091 Score = 75.4 bits (177), Expect = 1e-12 Identities = 39/124 (31%), Positives = 64/124 (51%), Gaps = 1/124 (0%) Frame = +1 Query: 295 IAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVNKGKQLNSNW 474 + KH IC+++ +KKRKPFD K R+ GTE F+ + K +P ++ + +W Sbjct: 5 LEKHARICERSANKKRKPFDSAKQRIQGTELAEFLPRQEKKRRSPE-------EKSSKSW 57 Query: 475 RQKHEEFIQAIRAAKQVQAHLNAGGKLS-DLXXXXXSENPDYVQCPHCNRRFNQGAAERH 651 +Q H++F++AIRAA+ + S + + + CP CNR F A +RH Sbjct: 58 KQTHDDFLRAIRAARNEIVDSTMQKQCSTTITSSAPTRANEQGMCPTCNRHFGVKAYDRH 117 Query: 652 IPKC 663 + C Sbjct: 118 VAWC 121 >UniRef50_Q23G89 Cluster: Zinc finger, C2H2 type family protein; n=1; Tetrahymena thermophila SB210|Rep: Zinc finger, C2H2 type family protein - Tetrahymena thermophila SB210 Length = 718 Score = 74.1 bits (174), Expect = 3e-12 Identities = 46/143 (32%), Positives = 69/143 (48%), Gaps = 9/143 (6%) Frame = +1 Query: 268 CGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFI------NKLRKTTATP 429 CGR FA + + KH +ICKK +KRK FD K R+ E E + K+R A+ Sbjct: 573 CGRRFAPEALEKHAKICKKVFQQKRKKFDTKKQRINDEEHEQILQQAQMEEKMRNQYASK 632 Query: 430 S--TTKVNKGKQ-LNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYV 600 + K N +Q S WR + E+F +R+ N GG+ + + D + Sbjct: 633 NKQPAKTNTQQQDKKSKWRMQSEQFRAVLRS--------NKGGEQAQ----DIPQYDDRI 680 Query: 601 QCPHCNRRFNQGAAERHIPKCAN 669 +CPHC R+F + + +H CAN Sbjct: 681 ECPHCKRKFQESSYNKHEQICAN 703 >UniRef50_A0CCZ1 Cluster: Chromosome undetermined scaffold_169, whole genome shotgun sequence; n=3; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_169, whole genome shotgun sequence - Paramecium tetraurelia Length = 530 Score = 72.5 bits (170), Expect = 1e-11 Identities = 49/143 (34%), Positives = 66/143 (46%), Gaps = 4/143 (2%) Frame = +1 Query: 268 CGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVN 447 CGR F + + KH ++CKK KRK F+ HR E KL K + Sbjct: 406 CGRRFKENVLDKHIKVCKKVFQSKRKEFNSKAHRQVNQEQV----KLEKQGLVKDKI-IE 460 Query: 448 KGKQLNSN----WRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHC 615 K KQ+ N W+++ E F Q I AAK G +D+ D V+CP C Sbjct: 461 KKKQMAQNGDPKWKKQSEAFRQMISAAKS--------GGTADI-----QPQDDLVECPGC 507 Query: 616 NRRFNQGAAERHIPKCANFQFNK 684 R+F++ AAERHIP C F + Sbjct: 508 GRKFSEQAAERHIPGCKKRNFKR 530 >UniRef50_A0E4A7 Cluster: Chromosome undetermined scaffold_78, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_78, whole genome shotgun sequence - Paramecium tetraurelia Length = 361 Score = 71.7 bits (168), Expect = 2e-11 Identities = 46/144 (31%), Positives = 63/144 (43%), Gaps = 12/144 (8%) Frame = +1 Query: 268 CGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINK------------LR 411 CGR F + KH +ICKK +KRK FD +HR+ + + K + Sbjct: 147 CGRKFKRSALQKHIKICKKVFQEKRKAFDTKEHRILNPDHAKLLQKQEQEDKIQQQQQQK 206 Query: 412 KTTATPSTTKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENP 591 K A P Q W+ + E+F RAA ++ N G L+ E Sbjct: 207 KKQAQPKIDDRPLQGQKKPKWKLQSEQF----RAAMKI----NKGVPLTQQEQVAIEEVD 258 Query: 592 DYVQCPHCNRRFNQGAAERHIPKC 663 D VQC HC R+FN+ A +HIP C Sbjct: 259 DRVQCEHCGRKFNEQTALKHIPSC 282 >UniRef50_UPI0000D56C05 Cluster: PREDICTED: similar to CG10999-PA, isoform A; n=1; Tribolium castaneum|Rep: PREDICTED: similar to CG10999-PA, isoform A - Tribolium castaneum Length = 926 Score = 70.9 bits (166), Expect = 3e-11 Identities = 44/135 (32%), Positives = 62/135 (45%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C CGR F + KH IC+K +KKRK FD LK R+ GT+ F K S Sbjct: 15 CQTCGRTFLPLPLKKHAPICEKNATKKRKVFDSLKQRVEGTDLAQFHQKSYLKKPLESAP 74 Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618 K K + W + H++ + AIR+AK G +S + + + +CP C Sbjct: 75 KPQK-----NQWEENHQKLVDAIRSAK---------GNMSSVKKATPPPSLN-ERCPFCE 119 Query: 619 RRFNQGAAERHIPKC 663 R F A +RH+ C Sbjct: 120 RHFGPKAFDRHVEWC 134 >UniRef50_Q22FZ1 Cluster: Putative uncharacterized protein; n=1; Tetrahymena thermophila SB210|Rep: Putative uncharacterized protein - Tetrahymena thermophila SB210 Length = 1535 Score = 68.1 bits (159), Expect = 2e-10 Identities = 50/142 (35%), Positives = 72/142 (50%), Gaps = 10/142 (7%) Frame = +1 Query: 268 CGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRL--AGTEA--EP--FINKLRKTTATP 429 CGR F + + KH++ICKK +KRK FD RL +G + +P +K +K A Sbjct: 1353 CGRKFNQESLPKHEKICKKVFQQKRKQFDSQAARLNISGMQELDQPPQISSKQQKKNANQ 1412 Query: 430 STTKVNKG-KQLNSN---WRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDY 597 + K K K NSN W+++ E F ++ + +A A + S + E Y Sbjct: 1413 NQNKKEKNDKNSNSNKPSWKKQSEAFRMQLQQQRTGEA---ADPQSSAM----MQEALGY 1465 Query: 598 VQCPHCNRRFNQGAAERHIPKC 663 V C C R+FN+ AAERHIP C Sbjct: 1466 VGCNFCGRKFNKVAAERHIPFC 1487 >UniRef50_Q4Q229 Cluster: Putative uncharacterized protein; n=3; Leishmania|Rep: Putative uncharacterized protein - Leishmania major Length = 348 Score = 66.1 bits (154), Expect = 9e-10 Identities = 38/104 (36%), Positives = 53/104 (50%), Gaps = 13/104 (12%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFI------------- 399 C CGR F DRIA H+ +CK +KR+ FD K R AG+E + Sbjct: 103 CSKCGRTFNFDRIAYHESVCK--GDQKRRVFDSSKQRCAGSEGDDAYAGGAFGAPSGVRR 160 Query: 400 NKLRKTTATPSTTKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQA 531 + +K +T++ +NWRQ+HEEFI AIR+AK+ A Sbjct: 161 GRTKKLGTANTTSRYTPAPATQTNWRQQHEEFIAAIRSAKRADA 204 >UniRef50_A0E7F3 Cluster: Chromosome undetermined scaffold_81, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_81, whole genome shotgun sequence - Paramecium tetraurelia Length = 417 Score = 62.1 bits (144), Expect = 1e-08 Identities = 46/140 (32%), Positives = 62/140 (44%), Gaps = 8/140 (5%) Frame = +1 Query: 268 CGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFIN-----KLRKTTATPS 432 CGR F + + KH +ICKK +KRK F+ K R EAE + R+ P Sbjct: 267 CGRSFNAKALEKHSKICKKVFQQKRKVFNSQKQR--QIEAEDNVKGRGGAMKRQVQKQPM 324 Query: 433 TTKVNKGKQLNS---NWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQ 603 + +Q+ S W+ + E F IR AK + LS D VQ Sbjct: 325 KQGQKQQQQVKSEKPKWKAQSEAFRAIIRQAKGQRLTKEEQTSLS----GAMESAQDLVQ 380 Query: 604 CPHCNRRFNQGAAERHIPKC 663 C CNR+FN AA++HI C Sbjct: 381 CKFCNRKFNTEAAKKHIVFC 400 >UniRef50_A2F2X4 Cluster: Putative uncharacterized protein; n=2; Trichomonas vaginalis G3|Rep: Putative uncharacterized protein - Trichomonas vaginalis G3 Length = 225 Score = 60.9 bits (141), Expect = 3e-08 Identities = 31/83 (37%), Positives = 44/83 (53%), Gaps = 4/83 (4%) Frame = +1 Query: 472 WRQKHEEFIQAIRAAKQV---QAHLNAGGKLSDLXXXXXSENP-DYVQCPHCNRRFNQGA 639 W++ H++ +++IRAA++ QA L AG + E P D VQCP C R+ ++ A Sbjct: 143 WQRDHDKMVESIRAARRYAKYQADLEAGKAVGPPPELPPIEEPPDLVQCPTCGRKMSEEA 202 Query: 640 AERHIPKCANFQFNKPKPAAKRR 708 A H P C NK A KRR Sbjct: 203 ARHHFPVCERMAMNKTYSAPKRR 225 >UniRef50_Q4CZP9 Cluster: Putative uncharacterized protein; n=2; Trypanosoma cruzi|Rep: Putative uncharacterized protein - Trypanosoma cruzi Length = 560 Score = 60.1 bits (139), Expect = 6e-08 Identities = 44/152 (28%), Positives = 65/152 (42%), Gaps = 17/152 (11%) Frame = +1 Query: 259 CGVCGRHF-ASDRIAKHQEICKKAHSKKR-------------KPFDVLKHRLAGTEAEPF 396 C CGRHF A R +H +C++ ++R KPF R + E+ F Sbjct: 398 CPHCGRHFFAETRWPRHVAVCEQQQQQQRQRKSQAESSRSVQKPFSQRVSRSSNMESMNF 457 Query: 397 INKLRKTTATPSTTKVNKGKQLNSNWRQKHEEFIQA---IRAAKQVQAHLNAGGKLSDLX 567 ++ TPS+ + G S +K ++ Q +R A Q+ + + L + Sbjct: 458 SGSFQEALQTPSSNRGKAGNAATSATEKKSSKWRQQRAQLRQALQLGSARSQNNSLKSVG 517 Query: 568 XXXXSENPDYVQCPHCNRRFNQGAAERHIPKC 663 E+ D V CP C RRF AERHIP C Sbjct: 518 DIDVFED-DRVACPACGRRFAPATAERHIPFC 548 >UniRef50_A0E456 Cluster: Chromosome undetermined scaffold_77, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_77, whole genome shotgun sequence - Paramecium tetraurelia Length = 566 Score = 56.8 bits (131), Expect = 5e-07 Identities = 39/142 (27%), Positives = 59/142 (41%), Gaps = 10/142 (7%) Frame = +1 Query: 268 CGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAE-------PFINKLRKTTAT 426 CGR F + KH ++C+K +KRK FD + R E E P + ++ Sbjct: 415 CGRSFNKKALEKHAKVCQKVFQQKRKVFDSQQQRQLDEEEEAYRPPPPPSKKQQQQQQQQ 474 Query: 427 PSTTKVNKGKQLNSN---WRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDY 597 + K K+ S+ W+ + + F I+ K Q L + D Sbjct: 475 QQQKQAQKQKESKSDKPKWKAQSDAFRAIIKQGKGEQLTKEEQVSLKN----AMDATQDL 530 Query: 598 VQCPHCNRRFNQGAAERHIPKC 663 VQC CNR+FN A++HI C Sbjct: 531 VQCKFCNRKFNSETAKKHIAFC 552 >UniRef50_Q38BB2 Cluster: Putative uncharacterized protein; n=3; Trypanosoma|Rep: Putative uncharacterized protein - Trypanosoma brucei Length = 301 Score = 56.4 bits (130), Expect = 7e-07 Identities = 36/96 (37%), Positives = 48/96 (50%), Gaps = 6/96 (6%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRL------AGTEAEPFINKLRKTT 420 C CGR F DRIA H+ +CK + KRK FD K R G P +K Sbjct: 98 CSRCGRKFLFDRIAYHESVCK--GNVKRKVFDSSKQRAIEGQYSGGCFGAPSAKGRKK-- 153 Query: 421 ATPSTTKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQ 528 A P + G + WR++H EFI+A+RAA+Q + Sbjct: 154 AAPGASSPAPGVP-RTRWREQHREFIEAMRAARQAR 188 >UniRef50_Q22MG0 Cluster: Putative uncharacterized protein; n=1; Tetrahymena thermophila SB210|Rep: Putative uncharacterized protein - Tetrahymena thermophila SB210 Length = 575 Score = 54.4 bits (125), Expect = 3e-06 Identities = 48/168 (28%), Positives = 73/168 (43%), Gaps = 16/168 (9%) Frame = +1 Query: 253 DACGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLK--HRLAGTEAEPFIN----KLRK 414 + C C R F R+ HQ+ CK + K P ++LK + L+ + + + K +K Sbjct: 93 ETCNNCNRQFFQGRLNLHQKSCKPQNPLK--PLNMLKINNILSNSNEQSGLGSKQGKKKK 150 Query: 415 T------TATPSTTKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLS----DL 564 ++ S T G S Q +F+Q +QV +L K L Sbjct: 151 LGYREVLSSRLSATPTTAGSVEKSVNNQNDLKFVQP---DEQVSTNLTTIPKWKIEHQSL 207 Query: 565 XXXXXSENPDYVQCPHCNRRFNQGAAERHIPKCANFQFNKPKPAAKRR 708 +YVQC +C R+F AE+HIP C N FN+PKP K++ Sbjct: 208 LLSIKPAQMNYVQCQYCLRKFKPQVAEQHIPNCKNI-FNRPKPPKKQQ 254 >UniRef50_Q24HM2 Cluster: Zinc finger, C2H2 type family protein; n=1; Tetrahymena thermophila SB210|Rep: Zinc finger, C2H2 type family protein - Tetrahymena thermophila SB210 Length = 1167 Score = 53.2 bits (122), Expect = 7e-06 Identities = 33/96 (34%), Positives = 47/96 (48%), Gaps = 1/96 (1%) Frame = +1 Query: 424 TPSTTKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQ 603 T TKV K K S W+ + E F +R A+ + ++ EN DY+Q Sbjct: 1063 TDENTKVGKKK---SKWQIQSEAFRAQMRMARGETTNSQYDNQI----VKEAFENNDYIQ 1115 Query: 604 CPHCNRRFNQGAAERHIPKC-ANFQFNKPKPAAKRR 708 C +C R+FN+ AA+RHIP C Q N+ K K + Sbjct: 1116 CEYCGRKFNEQAAQRHIPFCKTKSQQNQIKQGGKAK 1151 Score = 38.7 bits (86), Expect = 0.15 Identities = 16/41 (39%), Positives = 22/41 (53%) Frame = +1 Query: 268 CGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAE 390 CGR F + KH ++CKK KRK FD+ + R + E Sbjct: 466 CGRTFNEFALEKHVKVCKKVFQDKRKAFDITQKRQVAPQNE 506 >UniRef50_Q4T3R5 Cluster: Chromosome undetermined SCAF9936, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF9936, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 276 Score = 51.2 bits (117), Expect = 3e-05 Identities = 20/26 (76%), Positives = 22/26 (84%) Frame = +1 Query: 586 NPDYVQCPHCNRRFNQGAAERHIPKC 663 +PDYVQCP+C RRFNQ AAERHI C Sbjct: 70 DPDYVQCPYCQRRFNQHAAERHIKFC 95 Score = 50.0 bits (114), Expect = 6e-05 Identities = 20/27 (74%), Positives = 22/27 (81%) Frame = +1 Query: 583 ENPDYVQCPHCNRRFNQGAAERHIPKC 663 +N DYVQCP+C RRFNQ AAERHI C Sbjct: 201 QNLDYVQCPYCQRRFNQHAAERHIKFC 227 Score = 40.3 bits (90), Expect = 0.050 Identities = 16/42 (38%), Positives = 24/42 (57%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTE 384 C C R F + +H +C+K+ SKKR+ FD + R GT+ Sbjct: 5 CNTCKRSFNPKVLMRHSAVCQKSLSKKRRVFDSSRQRAEGTD 46 >UniRef50_UPI0000F20570 Cluster: PREDICTED: hypothetical protein; n=3; Danio rerio|Rep: PREDICTED: hypothetical protein - Danio rerio Length = 350 Score = 48.8 bits (111), Expect = 1e-04 Identities = 24/57 (42%), Positives = 33/57 (57%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATP 429 C VC R FA +R+ H +C+K +RK FD+ K+R GTE E F+ K + TP Sbjct: 281 CSVCRRCFAPERLETHMRVCEKKR-PQRKVFDMSKYRARGTELEEFM-KTNSRSRTP 335 >UniRef50_A0D6H1 Cluster: Chromosome undetermined scaffold_4, whole genome shotgun sequence; n=4; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_4, whole genome shotgun sequence - Paramecium tetraurelia Length = 283 Score = 48.0 bits (109), Expect = 2e-04 Identities = 25/88 (28%), Positives = 46/88 (52%), Gaps = 1/88 (1%) Frame = +1 Query: 268 CGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVN 447 CGR F SD + KH ++C++ +KR+ F+ + R+ + + R+ + Sbjct: 199 CGRQFKSDALEKHVKVCRQVFQQKRQEFNSKQARVVTNDQQKL---QRQGQIKEKQLQKK 255 Query: 448 KGK-QLNSNWRQKHEEFIQAIRAAKQVQ 528 +GK L+ NW+++ EE I+ +KQ Q Sbjct: 256 QGKAPLDPNWKKQSEELRNLIKESKQQQ 283 >UniRef50_UPI00006CBD30 Cluster: Zinc finger, C2H2 type family protein; n=1; Tetrahymena thermophila SB210|Rep: Zinc finger, C2H2 type family protein - Tetrahymena thermophila SB210 Length = 891 Score = 46.4 bits (105), Expect = 8e-04 Identities = 20/34 (58%), Positives = 23/34 (67%) Frame = +1 Query: 598 VQCPHCNRRFNQGAAERHIPKCANFQFNKPKPAA 699 VQCPHC R F + A+ERHIP C N N+P P A Sbjct: 725 VQCPHCERVFAKHASERHIPICKNV-LNRPNPLA 757 Score = 45.6 bits (103), Expect = 0.001 Identities = 42/152 (27%), Positives = 69/152 (45%), Gaps = 21/152 (13%) Frame = +1 Query: 301 KHQEICKKAHSKK--RKPFDVLKHRLAGTEAEPFINKLRKT-TATPSTTKVNKGKQLNSN 471 K QE +SK KP + K+ + T ++ N++++ T + K K+L Sbjct: 407 KEQEPTTNENSKIGINKPVNAQKN--STTPSQKANNQIKQLHTEQKNHNNDEKQKKL-PK 463 Query: 472 WRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXS-ENPDYVQCPHCNRRFNQ----- 633 W+ +H++F++ IR K+++ GG S + E Y+QC +C R+F + Sbjct: 464 WKIEHQQFLENIRYNKKIKQIEKEGGDKSQIERPVDDLEALGYIQCQYCQRKFAKVGIQQ 523 Query: 634 ------------GAAERHIPKCANFQFNKPKP 693 AERHIP C N N+PKP Sbjct: 524 QFLQINLYLLKLETAERHIPLCKNI-INRPKP 554 Score = 37.5 bits (83), Expect = 0.35 Identities = 28/85 (32%), Positives = 42/85 (49%), Gaps = 8/85 (9%) Frame = +1 Query: 475 RQKHEEF------IQAIRAA-KQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCNRRFNQ 633 R ++EEF + A+ A +Q ++ A G L+ +N DY CP+CNR+F Sbjct: 126 RSQYEEFQEKPIPLTAVLAEERQKNQYIQASGALN----AQSLQNDDYEFCPNCNRKFFS 181 Query: 634 GAAERHIPKCANFQFNKP-KPAAKR 705 G H+ C + NKP KP K+ Sbjct: 182 GRLNLHLKSC---KPNKPLKPIKKQ 203 Score = 37.5 bits (83), Expect = 0.35 Identities = 16/30 (53%), Positives = 19/30 (63%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKP 348 C +CGR F DRI KHQ C K S+K +P Sbjct: 300 CDICGRKFMQDRIEKHQVACSK--SQKARP 327 Score = 35.5 bits (78), Expect = 1.4 Identities = 40/160 (25%), Positives = 66/160 (41%), Gaps = 17/160 (10%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C C R F S R+ H + CK +K KP H ++ E + ++ R ++T Sbjct: 172 CPNCNRKFFSGRLNLHLKSCKP--NKPLKPIKKQSH-ISNEEDQQQLSPQRNNSSTVQFN 228 Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAK---QV-----------QAHLNAG---GKLSDLX 567 K ++ N+ Q E Q ++ K Q+ Q+ N K+++ Sbjct: 229 KNSEASSTNNIALQNKENDEQMNKSQKNKSQINPNTEKEENFQQSKYNLDIFEHKINNQH 288 Query: 568 XXXXSENPDYVQCPHCNRRFNQGAAERHIPKCANFQFNKP 687 SE+ + VQC C R+F Q E+H C+ Q +P Sbjct: 289 DEQQSED-NRVQCDICGRKFMQDRIEKHQVACSKSQKARP 327 >UniRef50_UPI0000E470FE Cluster: PREDICTED: hypothetical protein; n=2; Deuterostomia|Rep: PREDICTED: hypothetical protein - Strongylocentrotus purpuratus Length = 589 Score = 44.4 bits (100), Expect = 0.003 Identities = 48/188 (25%), Positives = 70/188 (37%), Gaps = 19/188 (10%) Frame = +1 Query: 157 RKPPVKANSAGSGTPKGRXXXXXXXXXXXXXGDA---CGVCGRHFASDRIAKHQEICK-- 321 R PP K + G G P G C CGR FA DRI KH+ IC Sbjct: 250 RPPPKKPQALGQGQPMGAGAEAYNAIASQSASSQLAPCSRCGRTFALDRIEKHESICSVK 309 Query: 322 --KAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVNK--GKQLNSNWRQKHE 489 A S+ + P + +L+ +++ + + + T V G++ S HE Sbjct: 310 SGTAPSRGKTPSEG-NQQLSSSKSRSGPQSFQPSPPSKPRTLVCYICGREFGSKSLPIHE 368 Query: 490 E------FIQAIRAAKQVQAHL----NAGGKLSDLXXXXXSENPDYVQCPHCNRRFNQGA 639 IQ + K+ + L +A G S + N + V C C R FN Sbjct: 369 PQCLQKWKIQNSKLPKEHRKQLPRKPDASGGKSANEAAMDAANANLVACKKCGRTFNPDR 428 Query: 640 AERHIPKC 663 E+H C Sbjct: 429 IEKHQSIC 436 Score = 35.1 bits (77), Expect = 1.9 Identities = 14/22 (63%), Positives = 15/22 (68%) Frame = +1 Query: 256 ACGVCGRHFASDRIAKHQEICK 321 AC CGR F DRI KHQ IC+ Sbjct: 416 ACKKCGRTFNPDRIEKHQSICR 437 >UniRef50_Q7QT08 Cluster: GLP_675_33860_35197; n=1; Giardia lamblia ATCC 50803|Rep: GLP_675_33860_35197 - Giardia lamblia ATCC 50803 Length = 445 Score = 44.4 bits (100), Expect = 0.003 Identities = 37/148 (25%), Positives = 55/148 (37%), Gaps = 4/148 (2%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHS---KKRKPFDVLKHRLAGTEAEPFINKLRKTTATP 429 C CGR FA DRI +H+ IC K + + K D + E +P K TT Sbjct: 214 CHRCGRKFAPDRITQHERICNKLKALPDEVDKAADGDTNYARTREPDPSRFKKNGTTGFN 273 Query: 430 STTKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAG-GKLSDLXXXXXSENPDYVQC 606 V K + + + + AA + + G G + N V+C Sbjct: 274 KANIVPKTLTSSGDHDSRSPPIKSSSNAAARGKPFGGKGMGGSAPFGGGGGGMNDGRVEC 333 Query: 607 PHCNRRFNQGAAERHIPKCANFQFNKPK 690 C R+F ++H C N Q P+ Sbjct: 334 RRCGRKFAPDRIDKHESICKNIQNMDPR 361 Score = 39.5 bits (88), Expect = 0.087 Identities = 25/63 (39%), Positives = 29/63 (46%), Gaps = 8/63 (12%) Frame = +1 Query: 157 RKPPVKA--NSAGSGTP---KGRXXXXXXXXXXXXXGDA---CGVCGRHFASDRIAKHQE 312 R PP+K+ N+A G P KG D C CGR FA DRI KH+ Sbjct: 291 RSPPIKSSSNAAARGKPFGGKGMGGSAPFGGGGGGMNDGRVECRRCGRKFAPDRIDKHES 350 Query: 313 ICK 321 ICK Sbjct: 351 ICK 353 >UniRef50_A1ZAP8 Cluster: CG30460-PC, isoform C; n=5; Drosophila melanogaster|Rep: CG30460-PC, isoform C - Drosophila melanogaster (Fruit fly) Length = 1868 Score = 44.0 bits (99), Expect = 0.004 Identities = 22/47 (46%), Positives = 27/47 (57%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFI 399 C C R FA D + KH IC+KA SKKRK FD + R GT ++ Sbjct: 254 CPCCSRTFAVDTLRKHVVICEKA-SKKRKIFDSSRQRRDGTALSTYV 299 Score = 33.9 bits (74), Expect = 4.3 Identities = 12/25 (48%), Positives = 16/25 (64%) Frame = +1 Query: 589 PDYVQCPHCNRRFNQGAAERHIPKC 663 P +CPHC+R FN A +RH+ C Sbjct: 410 PPCDRCPHCDRTFNPKAFDRHVEWC 434 >UniRef50_Q381C5 Cluster: Putative uncharacterized protein; n=1; Trypanosoma brucei|Rep: Putative uncharacterized protein - Trypanosoma brucei Length = 616 Score = 43.6 bits (98), Expect = 0.005 Identities = 28/83 (33%), Positives = 37/83 (44%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C CGR F DR+ HQ CK + +P R A P NK R+ A P+T Sbjct: 185 CETCGRTFLPDRLEVHQRSCKPGSASASRPVG----RAVAKSATP--NKTRRLAAEPATA 238 Query: 439 KVNKGKQLNSNWRQKHEEFIQAI 507 + K K + + Q EE I A+ Sbjct: 239 R-RKEKLIPKAFPQDKEEEIDAV 260 >UniRef50_Q4FY30 Cluster: Putative uncharacterized protein; n=3; Leishmania|Rep: Putative uncharacterized protein - Leishmania major strain Friedlin Length = 404 Score = 38.7 bits (86), Expect = 0.15 Identities = 36/139 (25%), Positives = 52/139 (37%), Gaps = 2/139 (1%) Frame = +1 Query: 253 DACGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKH-RLAGTEAEPFINKLRKTTATP 429 + C CGR FA R+ +H C++ + K +K R + +P A Sbjct: 264 EPCPHCGRTFAPARLERHVVTCERHRNTLPKTKGDMKSCRAFSSRKKPDRTAGGDGAAAS 323 Query: 430 STTKVNK-GKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQC 606 + N G R + I+ + KQ NA S + + D V C Sbjct: 324 AAATANTAGAAPGGPLRWSTDTAIKPEKWRKQSAQLRNAMAGAS------VAVDDDRVLC 377 Query: 607 PHCNRRFNQGAAERHIPKC 663 P C R F+ A RHIP C Sbjct: 378 PSCGRHFSDDVAARHIPIC 396 Score = 32.7 bits (71), Expect = 10.0 Identities = 12/29 (41%), Positives = 14/29 (48%) Frame = +1 Query: 604 CPHCNRRFNQGAAERHIPKCANFQFNKPK 690 CPHC R F ERH+ C + PK Sbjct: 266 CPHCGRTFAPARLERHVVTCERHRNTLPK 294 >UniRef50_Q387B6 Cluster: Putative uncharacterized protein; n=1; Trypanosoma brucei|Rep: Putative uncharacterized protein - Trypanosoma brucei Length = 568 Score = 37.5 bits (83), Expect = 0.35 Identities = 15/24 (62%), Positives = 15/24 (62%) Frame = +1 Query: 592 DYVQCPHCNRRFNQGAAERHIPKC 663 D V CP C RRF AERHIP C Sbjct: 533 DRVPCPSCGRRFATHVAERHIPHC 556 >UniRef50_Q7QXY3 Cluster: GLP_479_39609_38410; n=1; Giardia lamblia ATCC 50803|Rep: GLP_479_39609_38410 - Giardia lamblia ATCC 50803 Length = 399 Score = 37.1 bits (82), Expect = 0.46 Identities = 30/142 (21%), Positives = 53/142 (37%), Gaps = 3/142 (2%) Frame = +1 Query: 250 GDACGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATP 429 G AC C + F + + H IC+ ++ AG + + + R+ Sbjct: 3 GIACPFCQKKFQAQDLITHCRICRALQAEASS---------AGQQTKTDTQRSRQPVGAG 53 Query: 430 STTKVNKGKQLNSNWRQKHEE--FIQAIRAAKQVQAH-LNAGGKLSDLXXXXXSENPDYV 600 T + ++ Q + ++Q + + + H N + SD S Sbjct: 54 ERTSNRVSEDFSTRKVQDSSKRSYVQQEQESSPSRNHSANTPARTSDPAEGEESRE---- 109 Query: 601 QCPHCNRRFNQGAAERHIPKCA 666 +CPHC RRF E+H+ CA Sbjct: 110 ECPHCGRRFISSRLEKHVSACA 131 Score = 32.7 bits (71), Expect = 10.0 Identities = 20/77 (25%), Positives = 32/77 (41%), Gaps = 3/77 (3%) Frame = +1 Query: 253 DACGVCGRHFASDRIAKHQEICKKAHSKKRKPFDV--LKHRLAGTEAEPFINKLRKTT-A 423 + C CGR F S R+ KH C K +++ F+ + R E +N+ +T Sbjct: 109 EECPHCGRRFISSRLEKHVSACAKLSTRRVPSFNPHDQRWRNVSNEDRQLVNEAEPSTPM 168 Query: 424 TPSTTKVNKGKQLNSNW 474 + S K + NW Sbjct: 169 SRSMVKSKTPVRKKLNW 185 >UniRef50_Q7QRE7 Cluster: GLP_503_3295_2699; n=1; Giardia lamblia ATCC 50803|Rep: GLP_503_3295_2699 - Giardia lamblia ATCC 50803 Length = 198 Score = 37.1 bits (82), Expect = 0.46 Identities = 15/30 (50%), Positives = 19/30 (63%) Frame = +1 Query: 256 ACGVCGRHFASDRIAKHQEICKKAHSKKRK 345 +C CGR FA DRI KH+++C K K K Sbjct: 139 SCEYCGRGFAPDRIDKHRQVCNKHPDKIAK 168 >UniRef50_Q17BP6 Cluster: Putative uncharacterized protein; n=1; Aedes aegypti|Rep: Putative uncharacterized protein - Aedes aegypti (Yellowfever mosquito) Length = 492 Score = 36.3 bits (80), Expect = 0.81 Identities = 32/131 (24%), Positives = 59/131 (45%), Gaps = 7/131 (5%) Frame = +1 Query: 259 CGVCGRHFA-SDRIAKHQEICKKAHSKKRKPF-----DVLKHRLAGTEAEPFINKLRKTT 420 C VCG+ F+ S +AKH+ K+ HSK R PF D + + ++ +++ Sbjct: 121 CDVCGKSFSESGNLAKHK---KQVHSKDR-PFKCEICDKSYPQKKDLQGHMLVHTMKRFA 176 Query: 421 ATPSTTKVNKGKQLNSNWRQKH-EEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDY 597 + + K ++ ++ + KH + I+ + A N+ K S+ N Sbjct: 177 CSICKEEFAKIEEKRAHVKAKHPNDSIERSFSCVLCNAVFNSKTKYSNHCLTHGERN--- 233 Query: 598 VQCPHCNRRFN 630 QCPHC ++F+ Sbjct: 234 FQCPHCTKKFH 244 >UniRef50_UPI00006CBAC8 Cluster: hypothetical protein TTHERM_00502700; n=1; Tetrahymena thermophila SB210|Rep: hypothetical protein TTHERM_00502700 - Tetrahymena thermophila SB210 Length = 417 Score = 35.5 bits (78), Expect = 1.4 Identities = 13/24 (54%), Positives = 15/24 (62%) Frame = +1 Query: 592 DYVQCPHCNRRFNQGAAERHIPKC 663 D V C C R+F G AE+HIP C Sbjct: 368 DLVYCECCKRKFKPGPAEKHIPSC 391 >UniRef50_A0BJ34 Cluster: Chromosome undetermined scaffold_11, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_11, whole genome shotgun sequence - Paramecium tetraurelia Length = 354 Score = 35.5 bits (78), Expect = 1.4 Identities = 31/98 (31%), Positives = 43/98 (43%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C CGR F DRI KH+ +C D+ K + E + NK P T Sbjct: 115 CRKCGRRFNPDRIRKHESVCIGPEP------DIQKIK----EQQQEQNKRAAKYLKPKKT 164 Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGK 552 GK W+Q+H EF QA+R ++V+ A G+ Sbjct: 165 ----GK-----WKQEHLEFQQAMREMRKVRQQEIAEGR 193 >UniRef50_A3PYR3 Cluster: Putative uncharacterized protein; n=1; Mycobacterium sp. JLS|Rep: Putative uncharacterized protein - Mycobacterium sp. (strain JLS) Length = 606 Score = 35.1 bits (77), Expect = 1.9 Identities = 21/37 (56%), Positives = 22/37 (59%), Gaps = 1/37 (2%) Frame = -2 Query: 667 LRTSECVARQRPG*TCDC-NEDTARSPDSPTAEGAAD 560 LR SE V Q D +EDT RSPD TAEGAAD Sbjct: 400 LRLSEQVLNQHARQNSDSVSEDTYRSPDPATAEGAAD 436 >UniRef50_Q17JM9 Cluster: Predicted protein; n=1; Aedes aegypti|Rep: Predicted protein - Aedes aegypti (Yellowfever mosquito) Length = 1131 Score = 35.1 bits (77), Expect = 1.9 Identities = 25/80 (31%), Positives = 35/80 (43%), Gaps = 2/80 (2%) Frame = +1 Query: 259 CGVCGRHFAS-DRIAKHQEICKKAHSKKRKPFDVLKHRL-AGTEAEPFINKLRKTTATPS 432 CG CG+ FA + + KHQ A KKR P + + + NKL K PS Sbjct: 509 CGECGKRFAEPNLVRKHQATVHSADKKKRAPVKITSSLVQLHRHVQMHTNKL-KCPKCPS 567 Query: 433 TTKVNKGKQLNSNWRQKHEE 492 + NK + L + KH + Sbjct: 568 --RFNKKRSLTEHVLTKHSK 585 >UniRef50_Q4AP45 Cluster: Radical SAM; n=3; Bacteria|Rep: Radical SAM - Chlorobium phaeobacteroides BS1 Length = 1005 Score = 34.3 bits (75), Expect = 3.3 Identities = 18/60 (30%), Positives = 28/60 (46%), Gaps = 1/60 (1%) Frame = +1 Query: 256 ACGVCGRHFASDRI-AKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPS 432 +CG CGR ++S + K +C HS +V+KH TE I + + + PS Sbjct: 800 SCGYCGRKYSSSSLCGKGHYVCDTCHSD--DAVEVIKHICLATEETDMIELMERIRSHPS 857 >UniRef50_A7ACD8 Cluster: Putative uncharacterized protein; n=1; Parabacteroides merdae ATCC 43184|Rep: Putative uncharacterized protein - Parabacteroides merdae ATCC 43184 Length = 393 Score = 34.3 bits (75), Expect = 3.3 Identities = 17/52 (32%), Positives = 28/52 (53%) Frame = -1 Query: 524 TCFAARIAWMNSSCFCRQLLFNCLPLFTLVVDGVAVVLRNLLIKGSASVPAK 369 T FA +I + + C C L + +PL+ ++D V+ N L+ +AS P K Sbjct: 254 TPFAKKIELLGNDCLCMSLRYEQVPLYLSIIDAGFVLRHNSLVNINAS-PTK 304 >UniRef50_Q4Q4S7 Cluster: Putative uncharacterized protein; n=3; Leishmania|Rep: Putative uncharacterized protein - Leishmania major Length = 558 Score = 34.3 bits (75), Expect = 3.3 Identities = 21/61 (34%), Positives = 28/61 (45%) Frame = +1 Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438 C CGR F DR+ H + CK K KP R+A A P T+A+P++ Sbjct: 135 CPNCGRTFLPDRLQVHMKSCKP--GKTAKPVPTAASRVAPPVATP---SAATTSASPASA 189 Query: 439 K 441 K Sbjct: 190 K 190 >UniRef50_A2EVV5 Cluster: Putative uncharacterized protein; n=1; Trichomonas vaginalis G3|Rep: Putative uncharacterized protein - Trichomonas vaginalis G3 Length = 303 Score = 34.3 bits (75), Expect = 3.3 Identities = 13/31 (41%), Positives = 18/31 (58%) Frame = +1 Query: 583 ENPDYVQCPHCNRRFNQGAAERHIPKCANFQ 675 ++ D V C +C R+F AA RHIP C + Sbjct: 268 DSSDRVVCQYCGRKFLPDAARRHIPVCGRIR 298 >UniRef50_Q8MRK6 Cluster: GH27233p; n=1; Drosophila melanogaster|Rep: GH27233p - Drosophila melanogaster (Fruit fly) Length = 1006 Score = 33.9 bits (74), Expect = 4.3 Identities = 12/25 (48%), Positives = 16/25 (64%) Frame = +1 Query: 589 PDYVQCPHCNRRFNQGAAERHIPKC 663 P +CPHC+R FN A +RH+ C Sbjct: 62 PPCDRCPHCDRTFNPKAFDRHVEWC 86 >UniRef50_A2E8N3 Cluster: Putative uncharacterized protein; n=2; Trichomonas vaginalis G3|Rep: Putative uncharacterized protein - Trichomonas vaginalis G3 Length = 227 Score = 33.9 bits (74), Expect = 4.3 Identities = 14/32 (43%), Positives = 17/32 (53%) Frame = +1 Query: 580 SENPDYVQCPHCNRRFNQGAAERHIPKCANFQ 675 +E V C +C RR AA RHIP CA + Sbjct: 190 TEGDGKVTCQYCGRRLAPDAARRHIPVCAKIR 221 >UniRef50_Q2H9A8 Cluster: Putative uncharacterized protein; n=2; Chaetomium globosum|Rep: Putative uncharacterized protein - Chaetomium globosum (Soil fungus) Length = 633 Score = 33.9 bits (74), Expect = 4.3 Identities = 25/113 (22%), Positives = 42/113 (37%), Gaps = 3/113 (2%) Frame = +1 Query: 379 TEAEPFI-NKLRKTTATPSTTKVNKGKQLNSNWRQKHEEFIQAIRAAKQ--VQAHLNAGG 549 T + P + K++ A P K K+ N K + AI AA V L+ Sbjct: 21 TPSRPQVFGKIKLKKAPPKQAKPGNWKEANIIEEDKKKSKDNAITAASPSPVTIQLDDAS 80 Query: 550 KLSDLXXXXXSENPDYVQCPHCNRRFNQGAAERHIPKCANFQFNKPKPAAKRR 708 + + + D QC HC + + A H+ +C + K + + R Sbjct: 81 RENFQTGRPLEDQLDMFQCKHCKKVITRSAGGEHVARCLKIKKEKAQRKKEAR 133 >UniRef50_P39505 Cluster: Uncharacterized 9.4 kDa protein in nrdB-nrdA intergenic region; n=5; Viruses|Rep: Uncharacterized 9.4 kDa protein in nrdB-nrdA intergenic region - Bacteriophage T4 Length = 83 Score = 33.9 bits (74), Expect = 4.3 Identities = 14/36 (38%), Positives = 19/36 (52%), Gaps = 1/36 (2%) Frame = +1 Query: 559 DLXXXXXSENPDYVQCPHCNRRFNQGAAER-HIPKC 663 DL + +Y CPHC ++ N+G A R H KC Sbjct: 42 DLISLRTKQGAEYPPCPHCGKKVNKGNALRWHYDKC 77 >UniRef50_P28698 Cluster: Myeloid zinc finger 1; n=19; Eutheria|Rep: Myeloid zinc finger 1 - Homo sapiens (Human) Length = 734 Score = 33.9 bits (74), Expect = 4.3 Identities = 20/77 (25%), Positives = 33/77 (42%), Gaps = 9/77 (11%) Frame = +1 Query: 196 TPKGRXXXXXXXXXXXXXGDACGVCGRHFAS-DRIAKHQEI--------CKKAHSKKRKP 348 +P+GR G C VCG+ F+ + +HQ+I C + + Sbjct: 337 SPRGRSRGRPSTGGGVVRGGRCDVCGKVFSQRSNLLRHQKIHTGERPFVCSECGRSFSRS 396 Query: 349 FDVLKHRLAGTEAEPFI 399 +L+H+L TE PF+ Sbjct: 397 SHLLRHQLTHTEERPFV 413 >UniRef50_Q247Z8 Cluster: Putative uncharacterized protein; n=1; Tetrahymena thermophila SB210|Rep: Putative uncharacterized protein - Tetrahymena thermophila SB210 Length = 619 Score = 33.5 bits (73), Expect = 5.7 Identities = 15/30 (50%), Positives = 20/30 (66%), Gaps = 2/30 (6%) Frame = +1 Query: 601 QCPH-CNRRFNQGAAERHIPKC-ANFQFNK 684 QCP C R+FN+ A +HIP+C +FQ K Sbjct: 499 QCPEGCGRKFNKNALAKHIPQCKKHFQPKK 528 >UniRef50_A5DMI5 Cluster: Putative uncharacterized protein; n=1; Pichia guilliermondii|Rep: Putative uncharacterized protein - Pichia guilliermondii (Yeast) (Candida guilliermondii) Length = 216 Score = 33.5 bits (73), Expect = 5.7 Identities = 13/42 (30%), Positives = 22/42 (52%) Frame = +1 Query: 601 QCPHCNRRFNQGAAERHIPKCANFQFNKPKPAAKRR*PGNPK 726 +CP CN++F Q ERH+ C + + + ++R P K Sbjct: 4 ECPICNKKFPQSLIERHVNSCLDSREAENTSKRRKRSPDTEK 45 >UniRef50_A5KAW1 Cluster: Merozoite surface protein 3 (MSP3), putative; n=1; Plasmodium vivax|Rep: Merozoite surface protein 3 (MSP3), putative - Plasmodium vivax Length = 382 Score = 33.1 bits (72), Expect = 7.6 Identities = 25/81 (30%), Positives = 36/81 (44%), Gaps = 1/81 (1%) Frame = +1 Query: 283 ASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT-KVNKGKQ 459 AS+ AK + K+A K + + K + A EA + + TP TT K + Q Sbjct: 71 ASEETAKFADEAKEAFKKAQSLAEEAKEKAA--EAAKAVGAMNGEKDTPPTTEKAQRASQ 128 Query: 460 LNSNWRQKHEEFIQAIRAAKQ 522 S QK E A+R AK+ Sbjct: 129 AASAAEQKSNEAQAAVRTAKE 149 >UniRef50_A4RL95 Cluster: Predicted protein; n=1; Magnaporthe grisea|Rep: Predicted protein - Magnaporthe grisea (Rice blast fungus) (Pyricularia grisea) Length = 593 Score = 33.1 bits (72), Expect = 7.6 Identities = 19/61 (31%), Positives = 30/61 (49%) Frame = -2 Query: 706 VV*RPASVC*TESLRTSECVARQRPG*TCDCNEDTARSPDSPTAEGAADRSAYLQHSNVP 527 V+ + AS C S RTS V+ PG C+ ++ S S ++ G++ S L +N Sbjct: 17 VIRKAASACTYPSRRTSRPVSATSPGQLSTCSSSSSGSSGSSSSSGSSRSSGSLSDTNTA 76 Query: 526 A 524 A Sbjct: 77 A 77 >UniRef50_Q4D375 Cluster: Dispersed gene family protein 1 (DGF-1), putative; n=383; Trypanosoma cruzi|Rep: Dispersed gene family protein 1 (DGF-1), putative - Trypanosoma cruzi Length = 3520 Score = 32.7 bits (71), Expect = 10.0 Identities = 21/59 (35%), Positives = 30/59 (50%), Gaps = 4/59 (6%) Frame = -1 Query: 521 CFAARIAWMNSSCFCRQLLF----NCLPLFTLVVDGVAVVLRNLLIKGSASVPAKRCLS 357 CFAA M+ SC CR +CLP++ VDG L L+ +A++ A R L+ Sbjct: 2798 CFAAATRAMSGSCRCRCAEGGYGRDCLPVYLPHVDGCNRTLEKPLLSHTATLTATRSLT 2856 >UniRef50_Q227R4 Cluster: Zinc finger, C2H2 type family protein; n=7; Tetrahymena thermophila SB210|Rep: Zinc finger, C2H2 type family protein - Tetrahymena thermophila SB210 Length = 363 Score = 32.7 bits (71), Expect = 10.0 Identities = 11/29 (37%), Positives = 17/29 (58%) Frame = +1 Query: 583 ENPDYVQCPHCNRRFNQGAAERHIPKCAN 669 +NPD+ QC C + F++ +HI C N Sbjct: 181 DNPDFFQCEICLKAFHKSNCAKHIKVCGN 209 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 663,840,328 Number of Sequences: 1657284 Number of extensions: 12206966 Number of successful extensions: 42778 Number of sequences better than 10.0: 79 Number of HSP's better than 10.0 without gapping: 40452 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 42671 length of database: 575,637,011 effective HSP length: 99 effective length of database: 411,565,895 effective search space used: 62146450145 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -