SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTX 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= bmte10a06
         (752 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_Q5TVV1 Cluster: ENSANGP00000026760; n=2; Culicidae|Rep:...   190   2e-47
UniRef50_UPI00015B5B53 Cluster: PREDICTED: similar to ENSANGP000...   185   9e-46
UniRef50_Q9VNI7 Cluster: CG10999-PA, isoform A; n=3; Drosophila ...   184   2e-45
UniRef50_UPI0000D578E2 Cluster: PREDICTED: similar to CG10999-PA...   175   7e-43
UniRef50_A7SCJ1 Cluster: Predicted protein; n=2; Eumetazoa|Rep: ...   147   2e-34
UniRef50_UPI00006A0811 Cluster: Uncharacterized protein C14orf14...   146   7e-34
UniRef50_UPI0000611295 Cluster: Uncharacterized protein C14orf14...   134   3e-30
UniRef50_UPI0000503AD3 Cluster: RIKEN cDNA 2810002I04 gene; n=1;...   119   9e-26
UniRef50_Q5TFG8 Cluster: UPF0418 protein C6orf94; n=14; Theria|R...   119   9e-26
UniRef50_UPI0000E494DB Cluster: PREDICTED: similar to Chromosome...   115   1e-24
UniRef50_Q4QE35 Cluster: Putative uncharacterized protein; n=3; ...   114   2e-24
UniRef50_UPI0000ECC8F5 Cluster: UPF0418 protein C6orf94.; n=3; G...   113   3e-24
UniRef50_Q96GY0 Cluster: UPF0418 protein C8orf70; n=23; Euteleos...   112   8e-24
UniRef50_A7T301 Cluster: Predicted protein; n=2; Eumetazoa|Rep: ...   112   1e-23
UniRef50_A4H9B2 Cluster: Putative uncharacterized protein; n=1; ...   112   1e-23
UniRef50_Q5PPV5 Cluster: UPF0418 protein C8orf70 homolog; n=7; E...   111   2e-23
UniRef50_Q4CSD1 Cluster: Putative uncharacterized protein; n=2; ...   107   2e-22
UniRef50_A2DFG0 Cluster: Putative uncharacterized protein; n=1; ...   104   2e-21
UniRef50_A2DCT4 Cluster: Putative uncharacterized protein; n=1; ...    99   1e-19
UniRef50_A0BVD8 Cluster: Chromosome undetermined scaffold_13, wh...    90   5e-17
UniRef50_A0BQE5 Cluster: Chromosome undetermined scaffold_120, w...    90   5e-17
UniRef50_A2G025 Cluster: Zinc finger, C2H2 type family protein; ...    89   8e-17
UniRef50_Q22122 Cluster: UPF0418 protein T03G11.3; n=2; Caenorha...    89   1e-16
UniRef50_Q4DZW2 Cluster: Putative uncharacterized protein; n=2; ...    88   3e-16
UniRef50_Q22W47 Cluster: Zinc finger, C2H2 type family protein; ...    85   1e-15
UniRef50_A0D772 Cluster: Chromosome undetermined scaffold_4, who...    81   4e-14
UniRef50_A2DDQ6 Cluster: Putative uncharacterized protein; n=1; ...    80   5e-14
UniRef50_Q57XS3 Cluster: Putative uncharacterized protein; n=1; ...    80   7e-14
UniRef50_Q4T2E6 Cluster: Chromosome undetermined SCAF10284, whol...    79   1e-13
UniRef50_UPI0000E48303 Cluster: PREDICTED: hypothetical protein,...    78   2e-13
UniRef50_UPI0000587617 Cluster: PREDICTED: hypothetical protein;...    77   4e-13
UniRef50_Q4QJB7 Cluster: Putative uncharacterized protein; n=3; ...    77   6e-13
UniRef50_A0BPA2 Cluster: Chromosome undetermined scaffold_12, wh...    77   6e-13
UniRef50_UPI0000DB6EB3 Cluster: PREDICTED: similar to CG30460-PC...    75   1e-12
UniRef50_Q23G89 Cluster: Zinc finger, C2H2 type family protein; ...    74   3e-12
UniRef50_A0CCZ1 Cluster: Chromosome undetermined scaffold_169, w...    73   1e-11
UniRef50_A0E4A7 Cluster: Chromosome undetermined scaffold_78, wh...    72   2e-11
UniRef50_UPI0000D56C05 Cluster: PREDICTED: similar to CG10999-PA...    71   3e-11
UniRef50_Q22FZ1 Cluster: Putative uncharacterized protein; n=1; ...    68   2e-10
UniRef50_Q4Q229 Cluster: Putative uncharacterized protein; n=3; ...    66   9e-10
UniRef50_A0E7F3 Cluster: Chromosome undetermined scaffold_81, wh...    62   1e-08
UniRef50_A2F2X4 Cluster: Putative uncharacterized protein; n=2; ...    61   3e-08
UniRef50_Q4CZP9 Cluster: Putative uncharacterized protein; n=2; ...    60   6e-08
UniRef50_A0E456 Cluster: Chromosome undetermined scaffold_77, wh...    57   5e-07
UniRef50_Q38BB2 Cluster: Putative uncharacterized protein; n=3; ...    56   7e-07
UniRef50_Q22MG0 Cluster: Putative uncharacterized protein; n=1; ...    54   3e-06
UniRef50_Q24HM2 Cluster: Zinc finger, C2H2 type family protein; ...    53   7e-06
UniRef50_Q4T3R5 Cluster: Chromosome undetermined SCAF9936, whole...    51   3e-05
UniRef50_UPI0000F20570 Cluster: PREDICTED: hypothetical protein;...    49   1e-04
UniRef50_A0D6H1 Cluster: Chromosome undetermined scaffold_4, who...    48   2e-04
UniRef50_UPI00006CBD30 Cluster: Zinc finger, C2H2 type family pr...    46   8e-04
UniRef50_UPI0000E470FE Cluster: PREDICTED: hypothetical protein;...    44   0.003
UniRef50_Q7QT08 Cluster: GLP_675_33860_35197; n=1; Giardia lambl...    44   0.003
UniRef50_A1ZAP8 Cluster: CG30460-PC, isoform C; n=5; Drosophila ...    44   0.004
UniRef50_Q381C5 Cluster: Putative uncharacterized protein; n=1; ...    44   0.005
UniRef50_Q4FY30 Cluster: Putative uncharacterized protein; n=3; ...    39   0.15 
UniRef50_Q387B6 Cluster: Putative uncharacterized protein; n=1; ...    38   0.35 
UniRef50_Q7QXY3 Cluster: GLP_479_39609_38410; n=1; Giardia lambl...    37   0.46 
UniRef50_Q7QRE7 Cluster: GLP_503_3295_2699; n=1; Giardia lamblia...    37   0.46 
UniRef50_Q17BP6 Cluster: Putative uncharacterized protein; n=1; ...    36   0.81 
UniRef50_UPI00006CBAC8 Cluster: hypothetical protein TTHERM_0050...    36   1.4  
UniRef50_A0BJ34 Cluster: Chromosome undetermined scaffold_11, wh...    36   1.4  
UniRef50_A3PYR3 Cluster: Putative uncharacterized protein; n=1; ...    35   1.9  
UniRef50_Q17JM9 Cluster: Predicted protein; n=1; Aedes aegypti|R...    35   1.9  
UniRef50_Q4AP45 Cluster: Radical SAM; n=3; Bacteria|Rep: Radical...    34   3.3  
UniRef50_A7ACD8 Cluster: Putative uncharacterized protein; n=1; ...    34   3.3  
UniRef50_Q4Q4S7 Cluster: Putative uncharacterized protein; n=3; ...    34   3.3  
UniRef50_A2EVV5 Cluster: Putative uncharacterized protein; n=1; ...    34   3.3  
UniRef50_Q8MRK6 Cluster: GH27233p; n=1; Drosophila melanogaster|...    34   4.3  
UniRef50_A2E8N3 Cluster: Putative uncharacterized protein; n=2; ...    34   4.3  
UniRef50_Q2H9A8 Cluster: Putative uncharacterized protein; n=2; ...    34   4.3  
UniRef50_P39505 Cluster: Uncharacterized 9.4 kDa protein in nrdB...    34   4.3  
UniRef50_P28698 Cluster: Myeloid zinc finger 1; n=19; Eutheria|R...    34   4.3  
UniRef50_Q247Z8 Cluster: Putative uncharacterized protein; n=1; ...    33   5.7  
UniRef50_A5DMI5 Cluster: Putative uncharacterized protein; n=1; ...    33   5.7  
UniRef50_A5KAW1 Cluster: Merozoite surface protein 3 (MSP3), put...    33   7.6  
UniRef50_A4RL95 Cluster: Predicted protein; n=1; Magnaporthe gri...    33   7.6  
UniRef50_Q4D375 Cluster: Dispersed gene family protein 1 (DGF-1)...    33   10.0 
UniRef50_Q227R4 Cluster: Zinc finger, C2H2 type family protein; ...    33   10.0 

>UniRef50_Q5TVV1 Cluster: ENSANGP00000026760; n=2; Culicidae|Rep:
           ENSANGP00000026760 - Anopheles gambiae str. PEST
          Length = 384

 Score =  190 bits (464), Expect = 2e-47
 Identities = 87/151 (57%), Positives = 109/151 (72%), Gaps = 3/151 (1%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINK-LRKTTATPST 435
           C +C R+FA++RI KH++IC+K  +KKRK FD+ KHR+ GT+AE ++ K  +K ++ PST
Sbjct: 226 CDICSRNFATERIDKHRQICQKTKTKKRKVFDITKHRVQGTDAESYVLKGKKKQSSQPST 285

Query: 436 --TKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCP 609
                       SNWR+KHEEFI  IRAAK+++AHL  GGKLSDL     SENPDY+QCP
Sbjct: 286 GAAAAAAAGSKQSNWRKKHEEFIATIRAAKEMKAHLARGGKLSDLPPPPPSENPDYIQCP 345

Query: 610 HCNRRFNQGAAERHIPKCANFQFNKPKPAAK 702
           HC+RRFNQ AAERHIPKCA    NKPKP  K
Sbjct: 346 HCSRRFNQTAAERHIPKCATMLHNKPKPKPK 376


>UniRef50_UPI00015B5B53 Cluster: PREDICTED: similar to
            ENSANGP00000026760; n=1; Nasonia vitripennis|Rep:
            PREDICTED: similar to ENSANGP00000026760 - Nasonia
            vitripennis
          Length = 1097

 Score =  185 bits (451), Expect = 9e-46
 Identities = 96/202 (47%), Positives = 117/202 (57%), Gaps = 2/202 (0%)
 Frame = +1

Query: 109  TNKTRGAM-QRPANTTPRKPPVKANSAGSGTPKGRXXXXXXXXXXXXXGDACGVCGRHFA 285
            TN++ G+   RP       PP  A      TP  +                C +C R FA
Sbjct: 910  TNRSHGSTASRPTGKPKAAPPTPA---ARSTPSSKGSAASNDDSL----STCKICNRRFA 962

Query: 286  SDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINK-LRKTTATPSTTKVNKGKQL 462
            +DRI  H++IC K   KKRK FD L HR+ GTE E F+ K ++K    P           
Sbjct: 963  TDRIGLHEQICAKTSQKKRKQFDALTHRVKGTELESFVQKPVKKQVQYPQP--------- 1013

Query: 463  NSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCNRRFNQGAA 642
             SNWR+KHE+FI AIR+AKQ+QAHL +GGKLSDL     S+  DY+QCPHC+R+FNQGAA
Sbjct: 1014 -SNWRRKHEDFINAIRSAKQMQAHLASGGKLSDLPPPPPSDTSDYIQCPHCSRKFNQGAA 1072

Query: 643  ERHIPKCANFQFNKPKPAAKRR 708
            ERHIPKCAN Q NKP P A  R
Sbjct: 1073 ERHIPKCANMQHNKPNPRAPPR 1094


>UniRef50_Q9VNI7 Cluster: CG10999-PA, isoform A; n=3; Drosophila
           melanogaster|Rep: CG10999-PA, isoform A - Drosophila
           melanogaster (Fruit fly)
          Length = 383

 Score =  184 bits (449), Expect = 2e-45
 Identities = 90/157 (57%), Positives = 109/157 (69%), Gaps = 7/157 (4%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLR--KTTATPS 432
           C  CGRHF +DR+AKH+E+C++  + KRK FD  K R+ GTEA  F  K +  +  +T S
Sbjct: 227 CRYCGRHFNTDRLAKHEEVCQRMLTTKRKIFDASKQRIEGTEAAAFNMKSKGNRNRSTYS 286

Query: 433 TTKVNKGKQLN---SNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQ 603
           +    KG       +NWR+KHE+FIQ+IRAAKQV+AHL  GGKLSDL     SENPDYVQ
Sbjct: 287 SAAQQKGLTTGVKKNNWRKKHEDFIQSIRAAKQVKAHLARGGKLSDLPPPPPSENPDYVQ 346

Query: 604 CPHCNRRFNQGAAERHIPKCANFQFNKPK--PAAKRR 708
           CPHC RRFN+ AAERHIPKC N   NKP+  P AKRR
Sbjct: 347 CPHCGRRFNEQAAERHIPKCVNMVHNKPRNGPPAKRR 383


>UniRef50_UPI0000D578E2 Cluster: PREDICTED: similar to CG10999-PA,
           isoform A; n=1; Tribolium castaneum|Rep: PREDICTED:
           similar to CG10999-PA, isoform A - Tribolium castaneum
          Length = 480

 Score =  175 bits (427), Expect = 7e-43
 Identities = 84/201 (41%), Positives = 113/201 (56%)
 Frame = +1

Query: 103 SETNKTRGAMQRPANTTPRKPPVKANSAGSGTPKGRXXXXXXXXXXXXXGDACGVCGRHF 282
           S ++  + A   P      +PP  A +     P  +              + C  C R F
Sbjct: 278 SLSHTMQSAKSNPVKKGTPQPPQSARAPCKDRPSAKQSAKSPVARDDL--NECRFCNRRF 335

Query: 283 ASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVNKGKQL 462
           A+DR+  H+ IC K   KKRK +D  KHR+ GTE E ++ + +  ++  S  +  +    
Sbjct: 336 AADRLQVHESICGKTAKKKRKIYDATKHRVEGTELEQYVRRGKNLSSKASNRQAPR---- 391

Query: 463 NSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCNRRFNQGAA 642
             +WR+ HEEFI AIRAAK  QAH+  GGKL+DL     S NPDYVQCPHC R+FN+ AA
Sbjct: 392 -KDWRRTHEEFINAIRAAKMAQAHVAKGGKLADLPPPPPSSNPDYVQCPHCGRKFNEAAA 450

Query: 643 ERHIPKCANFQFNKPKPAAKR 705
           ERHIPKCA ++FNKPKP A +
Sbjct: 451 ERHIPKCATYEFNKPKPGANK 471


>UniRef50_A7SCJ1 Cluster: Predicted protein; n=2; Eumetazoa|Rep:
           Predicted protein - Nematostella vectensis
          Length = 139

 Score =  147 bits (357), Expect = 2e-34
 Identities = 74/143 (51%), Positives = 91/143 (63%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           C  CGR+FA DRIAKH+ IC+K  +K+RK FD  K R +GTEA  +     +    P+  
Sbjct: 6   CPNCGRNFAMDRIAKHETICRKTGTKQRKVFDSTKARTSGTEAAGYNRPGARKKPEPAVP 65

Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618
           K         NWR KH+EFI+AIR AK+V  H+ +GGK+SDL     SENPDYV C +C 
Sbjct: 66  K--------GNWRAKHQEFIRAIRDAKKVSQHIASGGKVSDLPPPQYSENPDYVLCRYCQ 117

Query: 619 RRFNQGAAERHIPKCANFQFNKP 687
           RRFN   AERHIPKCAN   N+P
Sbjct: 118 RRFNPTVAERHIPKCAN-TTNRP 139


>UniRef50_UPI00006A0811 Cluster: Uncharacterized protein C14orf140.;
           n=1; Xenopus tropicalis|Rep: Uncharacterized protein
           C14orf140. - Xenopus tropicalis
          Length = 360

 Score =  146 bits (353), Expect = 7e-34
 Identities = 74/149 (49%), Positives = 90/149 (60%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           C +CGR F + R+ KH ++C+K     RK FD  K R  GTE E ++    KT   P+  
Sbjct: 220 CNLCGRQFLAHRLEKHTQVCQKMQKSNRKVFDSSKARAKGTELEQYLQTKGKTR--PNVP 277

Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618
           KV    Q N+ WRQKHE F Q IR A+ VQ  +  GGKLSDL      ENPDYV CPHCN
Sbjct: 278 KV----QSNA-WRQKHESFQQTIRHARTVQQVIAKGGKLSDLPPPPPEENPDYVTCPHCN 332

Query: 619 RRFNQGAAERHIPKCANFQFNKPKPAAKR 705
           RRF    AERHIPKC N + +KP+P  +R
Sbjct: 333 RRFAPRVAERHIPKCENIK-SKPRPLRRR 360


>UniRef50_UPI0000611295 Cluster: Uncharacterized protein C14orf140.;
           n=3; Gallus gallus|Rep: Uncharacterized protein
           C14orf140. - Gallus gallus
          Length = 486

 Score =  134 bits (323), Expect = 3e-30
 Identities = 70/150 (46%), Positives = 86/150 (57%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           C  CGR F   R+ KH  IC K+   KRK FD  K R  GT+ E F  +  K++  P   
Sbjct: 344 CSFCGRKFLCARLKKHMSICSKSQGSKRKTFDSSKARARGTDLEEF--QQWKSSERPQ-- 399

Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618
             NK  + N NWRQ HE FIQ +R A+QVQ  L+ GGK+SDL      ENPDY  CP+C 
Sbjct: 400 --NKPPRRN-NWRQNHEAFIQTLRHARQVQQVLSKGGKVSDLPPLPPIENPDYTACPYCR 456

Query: 619 RRFNQGAAERHIPKCANFQFNKPKPAAKRR 708
           RRF    AE HIPKC N + N+P    +R+
Sbjct: 457 RRFAPQVAETHIPKCKNIK-NRPSLPPQRK 485


>UniRef50_UPI0000503AD3 Cluster: RIKEN cDNA 2810002I04 gene; n=1;
           Rattus norvegicus|Rep: RIKEN cDNA 2810002I04 gene -
           Rattus norvegicus
          Length = 449

 Score =  119 bits (286), Expect = 9e-26
 Identities = 60/131 (45%), Positives = 75/131 (57%)
 Frame = +1

Query: 301 KHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVNKGKQLNSNWRQ 480
           +H  +C K    KRK FD  + R  GTE E ++N        P+T K     +  S WRQ
Sbjct: 320 RHSTVCGKMQGSKRKVFDSSRARAKGTELEQYLN-----WRGPATDKAEPPPR-KSTWRQ 373

Query: 481 KHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCNRRFNQGAAERHIPK 660
           KHE FI+ +R A+QVQ  +  GG  SDL     +ENPDYVQCPHC+R F    AERHIPK
Sbjct: 374 KHESFIRTLRHARQVQQVIARGGNPSDLPSILPAENPDYVQCPHCSRHFAPKVAERHIPK 433

Query: 661 CANFQFNKPKP 693
           C   + N+P P
Sbjct: 434 CKTIK-NRPPP 443


>UniRef50_Q5TFG8 Cluster: UPF0418 protein C6orf94; n=14; Theria|Rep:
           UPF0418 protein C6orf94 - Homo sapiens (Human)
          Length = 222

 Score =  119 bits (286), Expect = 9e-26
 Identities = 69/152 (45%), Positives = 88/152 (57%), Gaps = 4/152 (2%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           C VCGR FA+D + +H  ICKK  ++KRKPF  LK RL GT+  P + K  ++ + P   
Sbjct: 18  CEVCGRRFAADVLERHGPICKKLFNRKRKPFSSLKQRLQGTDI-PTVKKTPQSKSPP--- 73

Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618
            V K     SNWRQ+HE+FI AIR+AKQ    +  G  L        S NPDY+Q P+C 
Sbjct: 74  -VRK-----SNWRQQHEDFINAIRSAKQCMLAIKEGRPLP--PPPPPSLNPDYIQRPYCM 125

Query: 619 RRFNQGAAERHIPKCANFQ----FNKPKPAAK 702
           RRFN+ AAERH   C +      FN  + AAK
Sbjct: 126 RRFNESAAERHTNFCKDQSSRRVFNPAQTAAK 157


>UniRef50_UPI0000E494DB Cluster: PREDICTED: similar to Chromosome 8
           open reading frame 70; n=1; Strongylocentrotus
           purpuratus|Rep: PREDICTED: similar to Chromosome 8 open
           reading frame 70 - Strongylocentrotus purpuratus
          Length = 323

 Score =  115 bits (277), Expect = 1e-24
 Identities = 60/149 (40%), Positives = 81/149 (54%), Gaps = 2/149 (1%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           CG CGR F  D +A+H ++C+K   KKRK FD  K R  GT+              P T 
Sbjct: 18  CGTCGRTFLPDTLARHAKVCRKTAKKKRKTFDSSKQRAEGTD----------IGTVPKTN 67

Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618
           + +      +NWRQKHE+FI+A+++AK V   +  G  L          NPDYVQCP C 
Sbjct: 68  ERDLPPSKKNNWRQKHEDFIEAMQSAKGVSKAIKTGAPLPP-PPAQKRINPDYVQCPSCE 126

Query: 619 RRFNQGAAERHIPKC--ANFQFNKPKPAA 699
           R F++ A+ERHIP C   N + +K  P+A
Sbjct: 127 RHFSESASERHIPWCKEKNKRIDKRTPSA 155


>UniRef50_Q4QE35 Cluster: Putative uncharacterized protein; n=3;
           Trypanosomatidae|Rep: Putative uncharacterized protein -
           Leishmania major
          Length = 723

 Score =  114 bits (275), Expect = 2e-24
 Identities = 62/150 (41%), Positives = 79/150 (52%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           C  CGR FAS+ +AKH+ IC     KKR+ F+  K RL               TA    +
Sbjct: 588 CSHCGRQFASESLAKHERIC--CSQKKRRVFNATKQRLP-----------EGATAAAKPS 634

Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618
             ++      +W+ + E F +A+R A+QV   L AGG   DL     S NPDYV CPHC 
Sbjct: 635 AGSQPAAPKRDWKAESESFRRALREARQVDQVLKAGGTAKDLPPPTYSTNPDYVPCPHCQ 694

Query: 619 RRFNQGAAERHIPKCANFQFNKPKPAAKRR 708
           RRF    A RHIP+CAN   N+PKP  +RR
Sbjct: 695 RRFAPDVAARHIPRCAN-TVNRPKPPPRRR 723


>UniRef50_UPI0000ECC8F5 Cluster: UPF0418 protein C6orf94.; n=3;
           Gallus gallus|Rep: UPF0418 protein C6orf94. - Gallus
           gallus
          Length = 178

 Score =  113 bits (273), Expect = 3e-24
 Identities = 61/142 (42%), Positives = 80/142 (56%), Gaps = 7/142 (4%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINK--LRKTTATPS 432
           C +CGR FA D + +H+ ICKK  +KKRKPF+  K RL GT+      +  L+       
Sbjct: 22  CRICGRQFAPDVLMRHEPICKKVFNKKRKPFNSFKQRLQGTDIGTVKRQPPLKVRLMLEH 81

Query: 433 TTKVNKGKQLN-----SNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDY 597
           T  + +  +LN     SNWRQ H +FI AI++AKQV   +  G  L        S NPDY
Sbjct: 82  TLSLLEAFRLNQPVKKSNWRQHHADFINAIQSAKQVTKAMQEGRPLP--PPPPPSINPDY 139

Query: 598 VQCPHCNRRFNQGAAERHIPKC 663
           +QCP C RRFN+ AA +HI  C
Sbjct: 140 IQCPFCLRRFNEAAAAKHIKFC 161


>UniRef50_Q96GY0 Cluster: UPF0418 protein C8orf70; n=23;
           Euteleostomi|Rep: UPF0418 protein C8orf70 - Homo sapiens
           (Human)
          Length = 325

 Score =  112 bits (270), Expect = 8e-24
 Identities = 60/135 (44%), Positives = 75/135 (55%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           C +CGR F    + KH  IC+K  +KKRK FD  + R  GT+  P +  L+     P   
Sbjct: 19  CKICGRTFFPVALKKHGPICQKTATKKRKTFDSSRQRAEGTDI-PTVKPLKPRPEPPKKP 77

Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618
                    SNWR+KHEEFI  IRAAK +   L  GGKL        S +PDY+QCP+C 
Sbjct: 78  ---------SNWRRKHEEFIATIRAAKGLDQALKEGGKLPP--PPPPSYDPDYIQCPYCQ 126

Query: 619 RRFNQGAAERHIPKC 663
           RRFN+ AA+RHI  C
Sbjct: 127 RRFNENAADRHINFC 141


>UniRef50_A7T301 Cluster: Predicted protein; n=2; Eumetazoa|Rep:
           Predicted protein - Nematostella vectensis
          Length = 133

 Score =  112 bits (269), Expect = 1e-23
 Identities = 59/150 (39%), Positives = 81/150 (54%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           C +CGR+F +DR+ KHQ++C K  ++KRK FD+ K R AGTE E ++             
Sbjct: 4   CSICGRNFQTDRLEKHQKVCAKNSTRKRKAFDMTKQRTAGTEHEKYVKA--------GAH 55

Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618
           K    K+++  WR +HE FI+AIR AK            S        ENP YVQCPHC 
Sbjct: 56  KQEPEKKVD--WRAQHESFIKAIRYAKG-----------SSDEPPPVMENPHYVQCPHCE 102

Query: 619 RRFNQGAAERHIPKCANFQFNKPKPAAKRR 708
           R+FN   AERHIP+C + +     P  + +
Sbjct: 103 RKFNPETAERHIPRCKDIKARPAPPKGRNK 132


>UniRef50_A4H9B2 Cluster: Putative uncharacterized protein; n=1;
           Leishmania braziliensis|Rep: Putative uncharacterized
           protein - Leishmania braziliensis
          Length = 721

 Score =  112 bits (269), Expect = 1e-23
 Identities = 62/150 (41%), Positives = 78/150 (52%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           C  CGR F S+ + KH+ IC  A  KKR+ F+  K RLA              TA    +
Sbjct: 586 CRHCGRRFVSESLGKHEHIC--ASLKKRRVFNATKQRLA-----------EGATAAAKVS 632

Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618
              + K    +W+ +   F +AIR A+ V   L AGG + DL     S NPDYV CPHC 
Sbjct: 633 PAPQPKAPTRDWKAESVAFRRAIREARHVDQVLKAGGTIKDLPPPTYSINPDYVPCPHCQ 692

Query: 619 RRFNQGAAERHIPKCANFQFNKPKPAAKRR 708
           RRF    A RHIP+CAN   N+PKP  +RR
Sbjct: 693 RRFAPDVAARHIPRCAN-TVNRPKPPPRRR 721


>UniRef50_Q5PPV5 Cluster: UPF0418 protein C8orf70 homolog; n=7;
           Eumetazoa|Rep: UPF0418 protein C8orf70 homolog - Xenopus
           laevis (African clawed frog)
          Length = 323

 Score =  111 bits (267), Expect = 2e-23
 Identities = 61/135 (45%), Positives = 77/135 (57%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           C +CGR F    + KH  IC+K   KKRK F+  + R  GT+    IN ++     P   
Sbjct: 11  CKICGRTFFPATLKKHVPICQKTSVKKRKTFESSRQRAEGTD----INTVKPVKPRPEPP 66

Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618
           K    KQ  SNW++KHEEFI  IR+AK +   L  GG+L        S +PDYVQCP+C 
Sbjct: 67  K----KQ--SNWKRKHEEFIATIRSAKGISQILKEGGELPP--PPPPSYDPDYVQCPYCQ 118

Query: 619 RRFNQGAAERHIPKC 663
           RRFNQ AA+RHI  C
Sbjct: 119 RRFNQNAADRHINFC 133


>UniRef50_Q4CSD1 Cluster: Putative uncharacterized protein; n=2;
            Trypanosoma cruzi|Rep: Putative uncharacterized protein -
            Trypanosoma cruzi
          Length = 757

 Score =  107 bits (258), Expect = 2e-22
 Identities = 56/150 (37%), Positives = 80/150 (53%)
 Frame = +1

Query: 259  CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
            C  CGR FA + +++H+ +C     KKR+ F++   RL+GT A+          +  +  
Sbjct: 616  CSNCGRTFALNVLSRHERVCTT--QKKRRVFNMRAMRLSGTGADQVAKSGSSGASAAAVA 673

Query: 439  KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618
               K      +WR + E F +++R A+QV   L +GG + DL     SEN  Y  CPHC 
Sbjct: 674  PAPK-----RDWRAESEAFRRSMRDARQVDKVLKSGGNVKDLPPPTYSENSHYTPCPHCG 728

Query: 619  RRFNQGAAERHIPKCANFQFNKPKPAAKRR 708
            R+F    AERHIP+CA    NKPKP  +RR
Sbjct: 729  RKFAPDVAERHIPRCAT-TINKPKPPPRRR 757


>UniRef50_A2DFG0 Cluster: Putative uncharacterized protein; n=1;
           Trichomonas vaginalis G3|Rep: Putative uncharacterized
           protein - Trichomonas vaginalis G3
          Length = 368

 Score =  104 bits (250), Expect = 2e-21
 Identities = 68/184 (36%), Positives = 93/184 (50%), Gaps = 10/184 (5%)
 Frame = +1

Query: 142 ANTTPRKP---PVKANSAGSGTPKGRXXXXXXXXXXXXXGD--ACGVCGRHFASDRIAKH 306
           A+T P+ P   P K  SAG    K                D  +C  CGR FASDRI KH
Sbjct: 176 ADTPPKSPKPAPAKKPSAGGALNKTLRRNAPPPAEADANDDRVSCSYCGRKFASDRIEKH 235

Query: 307 QEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVNKGKQLNSNWRQKH 486
           +EIC++   KK K FD  K RL G EA  F  K+ K    P    +N   +    ++ +H
Sbjct: 236 EEICRRQSMKKTKVFDSSKQRLEG-EAASFA-KVSKNKPKPKKETINGVPK----YKLQH 289

Query: 487 EEFIQAIRAAKQVQAHLNA---GGKLSDLXXXXXSE--NPDYVQCPHCNRRFNQGAAERH 651
           +E ++A+RAA+++QA+ +A   G  +         E  + D VQCPHC R+F +  A RH
Sbjct: 290 QELVKAMRAARKLQAYQDAVERGEAVGPPPEMPKIELVDDDRVQCPHCGRKFGEEQARRH 349

Query: 652 IPKC 663
           IP C
Sbjct: 350 IPNC 353


>UniRef50_A2DCT4 Cluster: Putative uncharacterized protein; n=1;
           Trichomonas vaginalis G3|Rep: Putative uncharacterized
           protein - Trichomonas vaginalis G3
          Length = 504

 Score = 98.7 bits (235), Expect = 1e-19
 Identities = 56/186 (30%), Positives = 91/186 (48%), Gaps = 4/186 (2%)
 Frame = +1

Query: 160 KPPVKANSAGSGTPKGRXXXXXXXXXXXXXGDACGVCGRHFASDRIAKHQEICKKAHSKK 339
           +PP ++ S     P  +                C +CGR FA+DRI KH+EIC+K+ +KK
Sbjct: 324 RPPSRSTSRSQPAPVPQENPPSPYADDNVELVECSICGRRFAADRIQKHEEICRKSATKK 383

Query: 340 RKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVNKGKQLNSNWRQKHEEFIQAIRAAK 519
           +K FD+   RLA T AE +I +++      +  +  K K     ++++HE+ ++A+R A+
Sbjct: 384 KKVFDITSKRLADTGAEEYIGQIK-----AAKDEKPKPKNEVPKYKKEHEKLVEAMRNAR 438

Query: 520 QVQAHLN--AGGK--LSDLXXXXXSENPDYVQCPHCNRRFNQGAAERHIPKCANFQFNKP 687
           ++Q +    A GK            ++ D V CP C R+F + A  RH P C      K 
Sbjct: 439 KIQQYEKDVAAGKNVKPPELAPIQMDDDDRVTCPICGRKFGKEALARHTPGCEKMNARKL 498

Query: 688 KPAAKR 705
               +R
Sbjct: 499 NTRGRR 504


>UniRef50_A0BVD8 Cluster: Chromosome undetermined scaffold_13, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_13,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 416

 Score = 90.2 bits (214), Expect = 5e-17
 Identities = 59/149 (39%), Positives = 69/149 (46%), Gaps = 2/149 (1%)
 Frame = +1

Query: 268 CGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVN 447
           CGR FA   + KH++IC K   K+RK FD  KHR+   E    I    K        K  
Sbjct: 281 CGRSFAKLALQKHEKICVKVFQKQRKQFDAQKHRIISNEQISHIKNQDKIEQ-----KYE 335

Query: 448 KGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSE--NPDYVQCPHCNR 621
           K      NW+ + E F  AI AAK        GGKL+        E    + VQC +C R
Sbjct: 336 KALAKKQNWKNQSEAFRAAIIAAK--------GGKLTKDQKNAMQEASKSNLVQCNYCGR 387

Query: 622 RFNQGAAERHIPKCANFQFNKPKPAAKRR 708
            FNQ AAERHIP CA      PK   KRR
Sbjct: 388 SFNQQAAERHIPFCAQKSKIPPKQPQKRR 416


>UniRef50_A0BQE5 Cluster: Chromosome undetermined scaffold_120,
           whole genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_120,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 352

 Score = 90.2 bits (214), Expect = 5e-17
 Identities = 49/152 (32%), Positives = 78/152 (51%)
 Frame = +1

Query: 253 DACGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPS 432
           + C +C R F ++RI KH+++C+KA  K+ +   ++K +           K        +
Sbjct: 37  EQCEICSRKFHTERIGKHRQVCEKAQQKQMQREKLIKRKQQ--------QKAEHQQKLDA 88

Query: 433 TTKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPH 612
             K  K K +N NWR++H +F + I   K+ +   N G +   +     +EN  YVQC +
Sbjct: 89  KEKQVKNKTVN-NWREQHRQFQEMIHCNKKEKEVQNEGEEEIAVKTLDLAENSLYVQCEY 147

Query: 613 CNRRFNQGAAERHIPKCANFQFNKPKPAAKRR 708
           C R F++  AERHIPKC   +  KPKP  K +
Sbjct: 148 CKRSFDRYVAERHIPKCKEIK-AKPKPLKKNQ 178



 Score = 34.7 bits (76), Expect = 2.5
 Identities = 12/21 (57%), Positives = 16/21 (76%)
 Frame = +1

Query: 601 QCPHCNRRFNQGAAERHIPKC 663
           +CP+C R+FN  AA RH+P C
Sbjct: 250 ECPYCLRKFNPKAALRHVPIC 270


>UniRef50_A2G025 Cluster: Zinc finger, C2H2 type family protein;
           n=1; Trichomonas vaginalis G3|Rep: Zinc finger, C2H2
           type family protein - Trichomonas vaginalis G3
          Length = 340

 Score = 89.4 bits (212), Expect = 8e-17
 Identities = 53/153 (34%), Positives = 82/153 (53%), Gaps = 3/153 (1%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           C  CGR FA DR+  H+ IC K  ++KR+PF+   HR++GTE   ++ +  +  +  ++ 
Sbjct: 195 CHYCGRKFAPDRLPVHERICAK--TRKRRPFNASMHRVSGTEMR-YVPRSSRAESKSNSR 251

Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSEN-PD-YVQCPH 612
           K   GK     ++ +HE  + A+RAA+ + A+    GK+  +      ++ PD  V+CP 
Sbjct: 252 KYINGK---PKYKIEHENLVAALRAARGMAAY--ESGKIKAMPKMPKMQDVPDGRVKCPV 306

Query: 613 CNRRFNQGAAERHIPKC-ANFQFNKPKPAAKRR 708
           C R+F    AERHIP C  N     P    KRR
Sbjct: 307 CGRKFGPEQAERHIPFCKRNAGIRPPARPVKRR 339


>UniRef50_Q22122 Cluster: UPF0418 protein T03G11.3; n=2;
           Caenorhabditis|Rep: UPF0418 protein T03G11.3 -
           Caenorhabditis elegans
          Length = 349

 Score = 88.6 bits (210), Expect = 1e-16
 Identities = 47/135 (34%), Positives = 68/135 (50%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           C +C R F    + KH+  C+K  S  RKPFD  K R +G++       ++K     +  
Sbjct: 23  CPICDRRFIKSSLEKHESACRKLASLHRKPFDSGKQRASGSDLT--YADIKKVQHEKNKN 80

Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618
                 +  +NWR++H  FI A+ ++K+V   L  G  L        +   DYVQC +C+
Sbjct: 81  G-GVFPRPQTNWRERHGNFIDAVSSSKRVDYALKTGAPLPP--PPKTAVPSDYVQCEYCS 137

Query: 619 RRFNQGAAERHIPKC 663
           R FN  AAERHIP C
Sbjct: 138 RNFNAAAAERHIPFC 152


>UniRef50_Q4DZW2 Cluster: Putative uncharacterized protein; n=2;
           Trypanosoma cruzi|Rep: Putative uncharacterized protein
           - Trypanosoma cruzi
          Length = 657

 Score = 87.8 bits (208), Expect = 3e-16
 Identities = 52/146 (35%), Positives = 74/146 (50%), Gaps = 11/146 (7%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           CG+CGR F +  +A+H+  C     KKR  FD  + RL G E       +R+ TA  ++ 
Sbjct: 504 CGLCGRSFRASILARHESACSNLQ-KKRGVFDTKEQRLEGIEG------IREVTAPSNSV 556

Query: 439 KVNKGKQ-----LNSN------WRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSE 585
               GK+      N+N      W+ +HE+F  A+RA +QV  +  +GG  S         
Sbjct: 557 SQKGGKKHPVAVANTNPDKPPKWKIQHEQFQAAMRAMRQVNVNAPSGGGGSGGKQPMPEA 616

Query: 586 NPDYVQCPHCNRRFNQGAAERHIPKC 663
             D V CPHC R+F +  A+RHIPKC
Sbjct: 617 YDDRVPCPHCGRKFAELTAQRHIPKC 642


>UniRef50_Q22W47 Cluster: Zinc finger, C2H2 type family protein; n=1;
            Tetrahymena thermophila SB210|Rep: Zinc finger, C2H2 type
            family protein - Tetrahymena thermophila SB210
          Length = 1668

 Score = 85.4 bits (202), Expect = 1e-15
 Identities = 55/147 (37%), Positives = 73/147 (49%), Gaps = 2/147 (1%)
 Frame = +1

Query: 259  CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
            C  C R FASDRI+KH+ +CK   +K+      L+ + A    E    KL K        
Sbjct: 1372 CRKCNRKFASDRISKHESVCKPGPTKQ-----ALRKQKA---LELKKQKLEKND------ 1417

Query: 439  KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSE--NPDYVQCPH 612
            +  + K  N+NWRQ+HEEF   ++  ++V      GG +  L     S     +  QCPH
Sbjct: 1418 RFYEQKLANNNWRQQHEEFQNQLKYMRKVGNVEKNGGDIRSLPPPPKSNAMRSNMKQCPH 1477

Query: 613  CNRRFNQGAAERHIPKCANFQFNKPKP 693
            C R F+  AA RHIPKC     NKPKP
Sbjct: 1478 CLRNFSDEAAARHIPKCKT-TINKPKP 1503


>UniRef50_A0D772 Cluster: Chromosome undetermined scaffold_4, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_4,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 775

 Score = 80.6 bits (190), Expect = 4e-14
 Identities = 51/151 (33%), Positives = 72/151 (47%)
 Frame = +1

Query: 256 ACGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPST 435
           AC  C R FA DRI KH ++CK      +K F+  +H +        + K  KT      
Sbjct: 488 ACEKCDRRFAQDRIKKHMKVCKG-----KKYFEKKEHVVE-------VQKAPKT------ 529

Query: 436 TKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHC 615
                       WR+ HEEFI  ++  +QV+     GG +  L     S N +YVQCP+C
Sbjct: 530 -----------GWRKYHEEFINTVKYNRQVKKIQEEGGDIKQLGPPPVSSNSNYVQCPYC 578

Query: 616 NRRFNQGAAERHIPKCANFQFNKPKPAAKRR 708
            R+F+   AE+HI  C N   NKPK   +++
Sbjct: 579 QRKFDPSKAEKHISICQNV-VNKPKTIQEKK 608


>UniRef50_A2DDQ6 Cluster: Putative uncharacterized protein; n=1;
           Trichomonas vaginalis G3|Rep: Putative uncharacterized
           protein - Trichomonas vaginalis G3
          Length = 474

 Score = 80.2 bits (189), Expect = 5e-14
 Identities = 54/161 (33%), Positives = 80/161 (49%), Gaps = 5/161 (3%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           C +C R FA DRI +H + CK ++S+K+K FD  K R A  +A  +  +    + TP   
Sbjct: 321 CLICHRKFAEDRIDRHMQACKTSNSRKKKVFDSAKMRNADNDAMQYQGR----SETP--P 374

Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAH--LNAGGKL---SDLXXXXXSENPDYVQ 603
           KV K     SN++++HE+ +  ++AA+    +    A GK              + D V+
Sbjct: 375 KVKK-----SNYKEQHEQLVANLKAARAATEYEKAKAEGKAVGPPPKMPEYKLPDDDRVE 429

Query: 604 CPHCNRRFNQGAAERHIPKCANFQFNKPKPAAKRR*PGNPK 726
           CP+C R+F   AA+RHIP C      K K   K   PG  K
Sbjct: 430 CPYCGRKFGSNAAQRHIPFCEKSHAGK-KLNDKGGKPGTTK 469


>UniRef50_Q57XS3 Cluster: Putative uncharacterized protein; n=1;
           Trypanosoma brucei|Rep: Putative uncharacterized protein
           - Trypanosoma brucei
          Length = 651

 Score = 79.8 bits (188), Expect = 7e-14
 Identities = 60/169 (35%), Positives = 74/169 (43%), Gaps = 25/169 (14%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEA------EPFINKLRKTT 420
           C +CGR F S  +A+H+  C K   KKR+ FD+   RL G E          I+  R   
Sbjct: 478 CNLCGRTFRSSILARHEAACAKV-QKKRRVFDMKGQRLEGIEGIHDVAPSSHISHGRGDG 536

Query: 421 ATPSTTKVNKGKQLNS-----NWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXS- 582
            T    K N   Q+        W+ +HE+F  A+RA +QV      GGK         S 
Sbjct: 537 GTFGAGKQNTTAQMGGQAKLPKWKIQHEQFQAAMRAMRQVTPEDAPGGKSGAQSTGSKST 596

Query: 583 -------------ENPDYVQCPHCNRRFNQGAAERHIPKCANFQFNKPK 690
                        E  D V CPHC R+F Q  AERHIPKCA     KPK
Sbjct: 597 QQRQLSQPVPLPAEYDDRVPCPHCGRKFAQMTAERHIPKCAT-TIAKPK 644


>UniRef50_Q4T2E6 Cluster: Chromosome undetermined SCAF10284, whole
           genome shotgun sequence; n=1; Tetraodon
           nigroviridis|Rep: Chromosome undetermined SCAF10284,
           whole genome shotgun sequence - Tetraodon nigroviridis
           (Green puffer)
          Length = 183

 Score = 79.0 bits (186), Expect = 1e-13
 Identities = 46/130 (35%), Positives = 65/130 (50%)
 Frame = +1

Query: 301 KHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVNKGKQLNSNWRQ 480
           +H  ICKK  +KKRK FD  + R  GT+   F   ++  + +P        KQ  +NW +
Sbjct: 1   RHAVICKKLANKKRKVFDSSRQRAEGTDISLF-RPIKPESESPK-------KQ--TNWHK 50

Query: 481 KHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCNRRFNQGAAERHIPK 660
           KH++ I   RA K +   +  GG L        + + DY+QCP+C R FNQ A ERHI  
Sbjct: 51  KHKDIIAHPRAVKPLTLTMKEGGSLPP-PPPPPTYDQDYIQCPYCQRTFNQHAGERHIEF 109

Query: 661 CANFQFNKPK 690
           C       P+
Sbjct: 110 CQEQAARMPR 119


>UniRef50_UPI0000E48303 Cluster: PREDICTED: hypothetical protein,
           partial; n=1; Strongylocentrotus purpuratus|Rep:
           PREDICTED: hypothetical protein, partial -
           Strongylocentrotus purpuratus
          Length = 290

 Score = 78.2 bits (184), Expect = 2e-13
 Identities = 37/83 (44%), Positives = 52/83 (62%), Gaps = 2/83 (2%)
 Frame = +1

Query: 466 SNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCNRRFNQGAAE 645
           +NWRQKHE+FI+A+++AK V   +  G  L          NPDYVQCP C+R F++ A+E
Sbjct: 3   NNWRQKHEDFIEAMQSAKGVSKAIKTGAPLPP-PPAQKRINPDYVQCPSCDRHFSESASE 61

Query: 646 RHIPKC--ANFQFNKPKPAAKRR 708
           RHIP C   N + +K  P+A  +
Sbjct: 62  RHIPWCKEKNKRIDKRTPSAAEK 84


>UniRef50_UPI0000587617 Cluster: PREDICTED: hypothetical protein;
           n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
           hypothetical protein - Strongylocentrotus purpuratus
          Length = 332

 Score = 77.4 bits (182), Expect = 4e-13
 Identities = 40/83 (48%), Positives = 48/83 (57%), Gaps = 3/83 (3%)
 Frame = +1

Query: 466 SNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCNRRFNQGAAE 645
           +NWR  H +F+ AIR+A+Q Q  +N G  L        S NPDY+QCPHC RRF+Q AA 
Sbjct: 40  TNWRNNHADFVNAIRSARQAQHAINTGQPLPP--PPPPSINPDYIQCPHCGRRFSQTAAA 97

Query: 646 RHIPKCA--NFQFNKP-KPAAKR 705
           RHI  C      F  P KP  KR
Sbjct: 98  RHINFCGERTNTFGAPVKPLNKR 120


>UniRef50_Q4QJB7 Cluster: Putative uncharacterized protein; n=3;
           Leishmania|Rep: Putative uncharacterized protein -
           Leishmania major
          Length = 664

 Score = 76.6 bits (180), Expect = 6e-13
 Identities = 52/165 (31%), Positives = 75/165 (45%), Gaps = 21/165 (12%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           C  CGR F    + +H+ +C+   +K RK F++ + RL G E    I ++++T A     
Sbjct: 498 CRTCGRRFRISVVMRHEALCRNQANKPRKVFNMREQRLDGVEG---IKEVQRTAARSGGG 554

Query: 439 KVNKG------------------KQLNSNWRQKHEEFIQAIRAAKQVQAHLNAG---GKL 555
              +G                  K     W+ +HE+F  A+RA +Q Q     G   G++
Sbjct: 555 GGGRGAGGGGGRGGGADAAAGAAKGKLPKWKVQHEQFQAAMRAVRQ-QKEAGGGFGSGRM 613

Query: 556 SDLXXXXXSENPDYVQCPHCNRRFNQGAAERHIPKCANFQFNKPK 690
           +        E  D V CPHC R+F Q  A RHIPKCA     KPK
Sbjct: 614 APPPAPIPEEYDDRVPCPHCGRKFAQDVAARHIPKCAT-TIAKPK 657


>UniRef50_A0BPA2 Cluster: Chromosome undetermined scaffold_12, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_12,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 348

 Score = 76.6 bits (180), Expect = 6e-13
 Identities = 42/151 (27%), Positives = 77/151 (50%)
 Frame = +1

Query: 253 DACGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPS 432
           ++C +C R F  +RI +H   C+KA  K+++   +++ +          N+ +K      
Sbjct: 20  ESCDLCNRKFHPERIERHLIACQKAQQKQQERDKIIQKKKKQ-------NEQKKQQLQQV 72

Query: 433 TTKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPH 612
             ++ K     +NWR++H++F + I+  ++++     G  ++ L       N +YV C +
Sbjct: 73  DVEIVK-----TNWREEHQKFQEQIQYNRKLKQLETEGQDVNQLKPLETKVNSNYVFCEY 127

Query: 613 CNRRFNQGAAERHIPKCANFQFNKPKPAAKR 705
           C R F++  AERHIPKC      KPKP  K+
Sbjct: 128 CERHFDKHVAERHIPKCKEI-IAKPKPPRKK 157



 Score = 55.2 bits (127), Expect = 2e-06
 Identities = 41/141 (29%), Positives = 65/141 (46%), Gaps = 6/141 (4%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKT-TATPST 435
           C  C RHF      +H   CK+  +K + P       +    ++P + + R+   +TPST
Sbjct: 125 CEYCERHFDKHVAERHIPKCKEIIAKPKPPRKKTVEMIQ--PSQPSLQEKRQAQVSTPST 182

Query: 436 TKVNKGK-----QLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYV 600
           +   + K     QL+ + +Q     +Q   A +  +A+L   G +         ++    
Sbjct: 183 SSQMERKPIIKKQLSDSSQQFRPTSLQKFIAEQSGKANLTNIGFIDCKARATAIQD---T 239

Query: 601 QCPHCNRRFNQGAAERHIPKC 663
           +CPHCNRRF   AAERHIP C
Sbjct: 240 ECPHCNRRFISRAAERHIPIC 260


>UniRef50_UPI0000DB6EB3 Cluster: PREDICTED: similar to CG30460-PC,
           isoform C; n=1; Apis mellifera|Rep: PREDICTED: similar
           to CG30460-PC, isoform C - Apis mellifera
          Length = 1091

 Score = 75.4 bits (177), Expect = 1e-12
 Identities = 39/124 (31%), Positives = 64/124 (51%), Gaps = 1/124 (0%)
 Frame = +1

Query: 295 IAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVNKGKQLNSNW 474
           + KH  IC+++ +KKRKPFD  K R+ GTE   F+ +  K   +P        ++ + +W
Sbjct: 5   LEKHARICERSANKKRKPFDSAKQRIQGTELAEFLPRQEKKRRSPE-------EKSSKSW 57

Query: 475 RQKHEEFIQAIRAAKQVQAHLNAGGKLS-DLXXXXXSENPDYVQCPHCNRRFNQGAAERH 651
           +Q H++F++AIRAA+          + S  +     +   +   CP CNR F   A +RH
Sbjct: 58  KQTHDDFLRAIRAARNEIVDSTMQKQCSTTITSSAPTRANEQGMCPTCNRHFGVKAYDRH 117

Query: 652 IPKC 663
           +  C
Sbjct: 118 VAWC 121


>UniRef50_Q23G89 Cluster: Zinc finger, C2H2 type family protein;
           n=1; Tetrahymena thermophila SB210|Rep: Zinc finger,
           C2H2 type family protein - Tetrahymena thermophila SB210
          Length = 718

 Score = 74.1 bits (174), Expect = 3e-12
 Identities = 46/143 (32%), Positives = 69/143 (48%), Gaps = 9/143 (6%)
 Frame = +1

Query: 268 CGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFI------NKLRKTTATP 429
           CGR FA + + KH +ICKK   +KRK FD  K R+   E E  +       K+R   A+ 
Sbjct: 573 CGRRFAPEALEKHAKICKKVFQQKRKKFDTKKQRINDEEHEQILQQAQMEEKMRNQYASK 632

Query: 430 S--TTKVNKGKQ-LNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYV 600
           +    K N  +Q   S WR + E+F   +R+        N GG+ +        +  D +
Sbjct: 633 NKQPAKTNTQQQDKKSKWRMQSEQFRAVLRS--------NKGGEQAQ----DIPQYDDRI 680

Query: 601 QCPHCNRRFNQGAAERHIPKCAN 669
           +CPHC R+F + +  +H   CAN
Sbjct: 681 ECPHCKRKFQESSYNKHEQICAN 703


>UniRef50_A0CCZ1 Cluster: Chromosome undetermined scaffold_169,
           whole genome shotgun sequence; n=3; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_169,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 530

 Score = 72.5 bits (170), Expect = 1e-11
 Identities = 49/143 (34%), Positives = 66/143 (46%), Gaps = 4/143 (2%)
 Frame = +1

Query: 268 CGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVN 447
           CGR F  + + KH ++CKK    KRK F+   HR    E      KL K         + 
Sbjct: 406 CGRRFKENVLDKHIKVCKKVFQSKRKEFNSKAHRQVNQEQV----KLEKQGLVKDKI-IE 460

Query: 448 KGKQLNSN----WRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHC 615
           K KQ+  N    W+++ E F Q I AAK         G  +D+         D V+CP C
Sbjct: 461 KKKQMAQNGDPKWKKQSEAFRQMISAAKS--------GGTADI-----QPQDDLVECPGC 507

Query: 616 NRRFNQGAAERHIPKCANFQFNK 684
            R+F++ AAERHIP C    F +
Sbjct: 508 GRKFSEQAAERHIPGCKKRNFKR 530


>UniRef50_A0E4A7 Cluster: Chromosome undetermined scaffold_78, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_78,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 361

 Score = 71.7 bits (168), Expect = 2e-11
 Identities = 46/144 (31%), Positives = 63/144 (43%), Gaps = 12/144 (8%)
 Frame = +1

Query: 268 CGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINK------------LR 411
           CGR F    + KH +ICKK   +KRK FD  +HR+   +    + K             +
Sbjct: 147 CGRKFKRSALQKHIKICKKVFQEKRKAFDTKEHRILNPDHAKLLQKQEQEDKIQQQQQQK 206

Query: 412 KTTATPSTTKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENP 591
           K  A P         Q    W+ + E+F    RAA ++    N G  L+        E  
Sbjct: 207 KKQAQPKIDDRPLQGQKKPKWKLQSEQF----RAAMKI----NKGVPLTQQEQVAIEEVD 258

Query: 592 DYVQCPHCNRRFNQGAAERHIPKC 663
           D VQC HC R+FN+  A +HIP C
Sbjct: 259 DRVQCEHCGRKFNEQTALKHIPSC 282


>UniRef50_UPI0000D56C05 Cluster: PREDICTED: similar to CG10999-PA,
           isoform A; n=1; Tribolium castaneum|Rep: PREDICTED:
           similar to CG10999-PA, isoform A - Tribolium castaneum
          Length = 926

 Score = 70.9 bits (166), Expect = 3e-11
 Identities = 44/135 (32%), Positives = 62/135 (45%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           C  CGR F    + KH  IC+K  +KKRK FD LK R+ GT+   F  K        S  
Sbjct: 15  CQTCGRTFLPLPLKKHAPICEKNATKKRKVFDSLKQRVEGTDLAQFHQKSYLKKPLESAP 74

Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCN 618
           K  K     + W + H++ + AIR+AK         G +S +       + +  +CP C 
Sbjct: 75  KPQK-----NQWEENHQKLVDAIRSAK---------GNMSSVKKATPPPSLN-ERCPFCE 119

Query: 619 RRFNQGAAERHIPKC 663
           R F   A +RH+  C
Sbjct: 120 RHFGPKAFDRHVEWC 134


>UniRef50_Q22FZ1 Cluster: Putative uncharacterized protein; n=1;
            Tetrahymena thermophila SB210|Rep: Putative
            uncharacterized protein - Tetrahymena thermophila SB210
          Length = 1535

 Score = 68.1 bits (159), Expect = 2e-10
 Identities = 50/142 (35%), Positives = 72/142 (50%), Gaps = 10/142 (7%)
 Frame = +1

Query: 268  CGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRL--AGTEA--EP--FINKLRKTTATP 429
            CGR F  + + KH++ICKK   +KRK FD    RL  +G +   +P    +K +K  A  
Sbjct: 1353 CGRKFNQESLPKHEKICKKVFQQKRKQFDSQAARLNISGMQELDQPPQISSKQQKKNANQ 1412

Query: 430  STTKVNKG-KQLNSN---WRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDY 597
            +  K  K  K  NSN   W+++ E F   ++  +  +A   A  + S +      E   Y
Sbjct: 1413 NQNKKEKNDKNSNSNKPSWKKQSEAFRMQLQQQRTGEA---ADPQSSAM----MQEALGY 1465

Query: 598  VQCPHCNRRFNQGAAERHIPKC 663
            V C  C R+FN+ AAERHIP C
Sbjct: 1466 VGCNFCGRKFNKVAAERHIPFC 1487


>UniRef50_Q4Q229 Cluster: Putative uncharacterized protein; n=3;
           Leishmania|Rep: Putative uncharacterized protein -
           Leishmania major
          Length = 348

 Score = 66.1 bits (154), Expect = 9e-10
 Identities = 38/104 (36%), Positives = 53/104 (50%), Gaps = 13/104 (12%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFI------------- 399
           C  CGR F  DRIA H+ +CK    +KR+ FD  K R AG+E +                
Sbjct: 103 CSKCGRTFNFDRIAYHESVCK--GDQKRRVFDSSKQRCAGSEGDDAYAGGAFGAPSGVRR 160

Query: 400 NKLRKTTATPSTTKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQA 531
            + +K     +T++        +NWRQ+HEEFI AIR+AK+  A
Sbjct: 161 GRTKKLGTANTTSRYTPAPATQTNWRQQHEEFIAAIRSAKRADA 204


>UniRef50_A0E7F3 Cluster: Chromosome undetermined scaffold_81, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_81,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 417

 Score = 62.1 bits (144), Expect = 1e-08
 Identities = 46/140 (32%), Positives = 62/140 (44%), Gaps = 8/140 (5%)
 Frame = +1

Query: 268 CGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFIN-----KLRKTTATPS 432
           CGR F +  + KH +ICKK   +KRK F+  K R    EAE  +        R+    P 
Sbjct: 267 CGRSFNAKALEKHSKICKKVFQQKRKVFNSQKQR--QIEAEDNVKGRGGAMKRQVQKQPM 324

Query: 433 TTKVNKGKQLNS---NWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQ 603
                + +Q+ S    W+ + E F   IR AK  +        LS           D VQ
Sbjct: 325 KQGQKQQQQVKSEKPKWKAQSEAFRAIIRQAKGQRLTKEEQTSLS----GAMESAQDLVQ 380

Query: 604 CPHCNRRFNQGAAERHIPKC 663
           C  CNR+FN  AA++HI  C
Sbjct: 381 CKFCNRKFNTEAAKKHIVFC 400


>UniRef50_A2F2X4 Cluster: Putative uncharacterized protein; n=2;
           Trichomonas vaginalis G3|Rep: Putative uncharacterized
           protein - Trichomonas vaginalis G3
          Length = 225

 Score = 60.9 bits (141), Expect = 3e-08
 Identities = 31/83 (37%), Positives = 44/83 (53%), Gaps = 4/83 (4%)
 Frame = +1

Query: 472 WRQKHEEFIQAIRAAKQV---QAHLNAGGKLSDLXXXXXSENP-DYVQCPHCNRRFNQGA 639
           W++ H++ +++IRAA++    QA L AG  +         E P D VQCP C R+ ++ A
Sbjct: 143 WQRDHDKMVESIRAARRYAKYQADLEAGKAVGPPPELPPIEEPPDLVQCPTCGRKMSEEA 202

Query: 640 AERHIPKCANFQFNKPKPAAKRR 708
           A  H P C     NK   A KRR
Sbjct: 203 ARHHFPVCERMAMNKTYSAPKRR 225


>UniRef50_Q4CZP9 Cluster: Putative uncharacterized protein; n=2;
           Trypanosoma cruzi|Rep: Putative uncharacterized protein
           - Trypanosoma cruzi
          Length = 560

 Score = 60.1 bits (139), Expect = 6e-08
 Identities = 44/152 (28%), Positives = 65/152 (42%), Gaps = 17/152 (11%)
 Frame = +1

Query: 259 CGVCGRHF-ASDRIAKHQEICKKAHSKKR-------------KPFDVLKHRLAGTEAEPF 396
           C  CGRHF A  R  +H  +C++   ++R             KPF     R +  E+  F
Sbjct: 398 CPHCGRHFFAETRWPRHVAVCEQQQQQQRQRKSQAESSRSVQKPFSQRVSRSSNMESMNF 457

Query: 397 INKLRKTTATPSTTKVNKGKQLNSNWRQKHEEFIQA---IRAAKQVQAHLNAGGKLSDLX 567
               ++   TPS+ +   G    S   +K  ++ Q    +R A Q+ +  +    L  + 
Sbjct: 458 SGSFQEALQTPSSNRGKAGNAATSATEKKSSKWRQQRAQLRQALQLGSARSQNNSLKSVG 517

Query: 568 XXXXSENPDYVQCPHCNRRFNQGAAERHIPKC 663
                E+ D V CP C RRF    AERHIP C
Sbjct: 518 DIDVFED-DRVACPACGRRFAPATAERHIPFC 548


>UniRef50_A0E456 Cluster: Chromosome undetermined scaffold_77, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_77,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 566

 Score = 56.8 bits (131), Expect = 5e-07
 Identities = 39/142 (27%), Positives = 59/142 (41%), Gaps = 10/142 (7%)
 Frame = +1

Query: 268 CGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAE-------PFINKLRKTTAT 426
           CGR F    + KH ++C+K   +KRK FD  + R    E E       P   + ++    
Sbjct: 415 CGRSFNKKALEKHAKVCQKVFQQKRKVFDSQQQRQLDEEEEAYRPPPPPSKKQQQQQQQQ 474

Query: 427 PSTTKVNKGKQLNSN---WRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDY 597
               +  K K+  S+   W+ + + F   I+  K  Q        L +          D 
Sbjct: 475 QQQKQAQKQKESKSDKPKWKAQSDAFRAIIKQGKGEQLTKEEQVSLKN----AMDATQDL 530

Query: 598 VQCPHCNRRFNQGAAERHIPKC 663
           VQC  CNR+FN   A++HI  C
Sbjct: 531 VQCKFCNRKFNSETAKKHIAFC 552


>UniRef50_Q38BB2 Cluster: Putative uncharacterized protein; n=3;
           Trypanosoma|Rep: Putative uncharacterized protein -
           Trypanosoma brucei
          Length = 301

 Score = 56.4 bits (130), Expect = 7e-07
 Identities = 36/96 (37%), Positives = 48/96 (50%), Gaps = 6/96 (6%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRL------AGTEAEPFINKLRKTT 420
           C  CGR F  DRIA H+ +CK   + KRK FD  K R        G    P     +K  
Sbjct: 98  CSRCGRKFLFDRIAYHESVCK--GNVKRKVFDSSKQRAIEGQYSGGCFGAPSAKGRKK-- 153

Query: 421 ATPSTTKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQ 528
           A P  +    G    + WR++H EFI+A+RAA+Q +
Sbjct: 154 AAPGASSPAPGVP-RTRWREQHREFIEAMRAARQAR 188


>UniRef50_Q22MG0 Cluster: Putative uncharacterized protein; n=1;
           Tetrahymena thermophila SB210|Rep: Putative
           uncharacterized protein - Tetrahymena thermophila SB210
          Length = 575

 Score = 54.4 bits (125), Expect = 3e-06
 Identities = 48/168 (28%), Positives = 73/168 (43%), Gaps = 16/168 (9%)
 Frame = +1

Query: 253 DACGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLK--HRLAGTEAEPFIN----KLRK 414
           + C  C R F   R+  HQ+ CK  +  K  P ++LK  + L+ +  +  +     K +K
Sbjct: 93  ETCNNCNRQFFQGRLNLHQKSCKPQNPLK--PLNMLKINNILSNSNEQSGLGSKQGKKKK 150

Query: 415 T------TATPSTTKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLS----DL 564
                  ++  S T    G    S   Q   +F+Q     +QV  +L    K       L
Sbjct: 151 LGYREVLSSRLSATPTTAGSVEKSVNNQNDLKFVQP---DEQVSTNLTTIPKWKIEHQSL 207

Query: 565 XXXXXSENPDYVQCPHCNRRFNQGAAERHIPKCANFQFNKPKPAAKRR 708
                    +YVQC +C R+F    AE+HIP C N  FN+PKP  K++
Sbjct: 208 LLSIKPAQMNYVQCQYCLRKFKPQVAEQHIPNCKNI-FNRPKPPKKQQ 254


>UniRef50_Q24HM2 Cluster: Zinc finger, C2H2 type family protein; n=1;
            Tetrahymena thermophila SB210|Rep: Zinc finger, C2H2 type
            family protein - Tetrahymena thermophila SB210
          Length = 1167

 Score = 53.2 bits (122), Expect = 7e-06
 Identities = 33/96 (34%), Positives = 47/96 (48%), Gaps = 1/96 (1%)
 Frame = +1

Query: 424  TPSTTKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQ 603
            T   TKV K K   S W+ + E F   +R A+    +     ++         EN DY+Q
Sbjct: 1063 TDENTKVGKKK---SKWQIQSEAFRAQMRMARGETTNSQYDNQI----VKEAFENNDYIQ 1115

Query: 604  CPHCNRRFNQGAAERHIPKC-ANFQFNKPKPAAKRR 708
            C +C R+FN+ AA+RHIP C    Q N+ K   K +
Sbjct: 1116 CEYCGRKFNEQAAQRHIPFCKTKSQQNQIKQGGKAK 1151



 Score = 38.7 bits (86), Expect = 0.15
 Identities = 16/41 (39%), Positives = 22/41 (53%)
 Frame = +1

Query: 268 CGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAE 390
           CGR F    + KH ++CKK    KRK FD+ + R    + E
Sbjct: 466 CGRTFNEFALEKHVKVCKKVFQDKRKAFDITQKRQVAPQNE 506


>UniRef50_Q4T3R5 Cluster: Chromosome undetermined SCAF9936, whole
           genome shotgun sequence; n=1; Tetraodon
           nigroviridis|Rep: Chromosome undetermined SCAF9936,
           whole genome shotgun sequence - Tetraodon nigroviridis
           (Green puffer)
          Length = 276

 Score = 51.2 bits (117), Expect = 3e-05
 Identities = 20/26 (76%), Positives = 22/26 (84%)
 Frame = +1

Query: 586 NPDYVQCPHCNRRFNQGAAERHIPKC 663
           +PDYVQCP+C RRFNQ AAERHI  C
Sbjct: 70  DPDYVQCPYCQRRFNQHAAERHIKFC 95



 Score = 50.0 bits (114), Expect = 6e-05
 Identities = 20/27 (74%), Positives = 22/27 (81%)
 Frame = +1

Query: 583 ENPDYVQCPHCNRRFNQGAAERHIPKC 663
           +N DYVQCP+C RRFNQ AAERHI  C
Sbjct: 201 QNLDYVQCPYCQRRFNQHAAERHIKFC 227



 Score = 40.3 bits (90), Expect = 0.050
 Identities = 16/42 (38%), Positives = 24/42 (57%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTE 384
           C  C R F    + +H  +C+K+ SKKR+ FD  + R  GT+
Sbjct: 5   CNTCKRSFNPKVLMRHSAVCQKSLSKKRRVFDSSRQRAEGTD 46


>UniRef50_UPI0000F20570 Cluster: PREDICTED: hypothetical protein;
           n=3; Danio rerio|Rep: PREDICTED: hypothetical protein -
           Danio rerio
          Length = 350

 Score = 48.8 bits (111), Expect = 1e-04
 Identities = 24/57 (42%), Positives = 33/57 (57%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATP 429
           C VC R FA +R+  H  +C+K    +RK FD+ K+R  GTE E F+ K    + TP
Sbjct: 281 CSVCRRCFAPERLETHMRVCEKKR-PQRKVFDMSKYRARGTELEEFM-KTNSRSRTP 335


>UniRef50_A0D6H1 Cluster: Chromosome undetermined scaffold_4, whole
           genome shotgun sequence; n=4; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_4,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 283

 Score = 48.0 bits (109), Expect = 2e-04
 Identities = 25/88 (28%), Positives = 46/88 (52%), Gaps = 1/88 (1%)
 Frame = +1

Query: 268 CGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVN 447
           CGR F SD + KH ++C++   +KR+ F+  + R+   + +      R+        +  
Sbjct: 199 CGRQFKSDALEKHVKVCRQVFQQKRQEFNSKQARVVTNDQQKL---QRQGQIKEKQLQKK 255

Query: 448 KGK-QLNSNWRQKHEEFIQAIRAAKQVQ 528
           +GK  L+ NW+++ EE    I+ +KQ Q
Sbjct: 256 QGKAPLDPNWKKQSEELRNLIKESKQQQ 283


>UniRef50_UPI00006CBD30 Cluster: Zinc finger, C2H2 type family
           protein; n=1; Tetrahymena thermophila SB210|Rep: Zinc
           finger, C2H2 type family protein - Tetrahymena
           thermophila SB210
          Length = 891

 Score = 46.4 bits (105), Expect = 8e-04
 Identities = 20/34 (58%), Positives = 23/34 (67%)
 Frame = +1

Query: 598 VQCPHCNRRFNQGAAERHIPKCANFQFNKPKPAA 699
           VQCPHC R F + A+ERHIP C N   N+P P A
Sbjct: 725 VQCPHCERVFAKHASERHIPICKNV-LNRPNPLA 757



 Score = 45.6 bits (103), Expect = 0.001
 Identities = 42/152 (27%), Positives = 69/152 (45%), Gaps = 21/152 (13%)
 Frame = +1

Query: 301 KHQEICKKAHSKK--RKPFDVLKHRLAGTEAEPFINKLRKT-TATPSTTKVNKGKQLNSN 471
           K QE     +SK    KP +  K+  + T ++   N++++  T   +     K K+L   
Sbjct: 407 KEQEPTTNENSKIGINKPVNAQKN--STTPSQKANNQIKQLHTEQKNHNNDEKQKKL-PK 463

Query: 472 WRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXS-ENPDYVQCPHCNRRFNQ----- 633
           W+ +H++F++ IR  K+++     GG  S +       E   Y+QC +C R+F +     
Sbjct: 464 WKIEHQQFLENIRYNKKIKQIEKEGGDKSQIERPVDDLEALGYIQCQYCQRKFAKVGIQQ 523

Query: 634 ------------GAAERHIPKCANFQFNKPKP 693
                         AERHIP C N   N+PKP
Sbjct: 524 QFLQINLYLLKLETAERHIPLCKNI-INRPKP 554



 Score = 37.5 bits (83), Expect = 0.35
 Identities = 28/85 (32%), Positives = 42/85 (49%), Gaps = 8/85 (9%)
 Frame = +1

Query: 475 RQKHEEF------IQAIRAA-KQVQAHLNAGGKLSDLXXXXXSENPDYVQCPHCNRRFNQ 633
           R ++EEF      + A+ A  +Q   ++ A G L+        +N DY  CP+CNR+F  
Sbjct: 126 RSQYEEFQEKPIPLTAVLAEERQKNQYIQASGALN----AQSLQNDDYEFCPNCNRKFFS 181

Query: 634 GAAERHIPKCANFQFNKP-KPAAKR 705
           G    H+  C   + NKP KP  K+
Sbjct: 182 GRLNLHLKSC---KPNKPLKPIKKQ 203



 Score = 37.5 bits (83), Expect = 0.35
 Identities = 16/30 (53%), Positives = 19/30 (63%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKP 348
           C +CGR F  DRI KHQ  C K  S+K +P
Sbjct: 300 CDICGRKFMQDRIEKHQVACSK--SQKARP 327



 Score = 35.5 bits (78), Expect = 1.4
 Identities = 40/160 (25%), Positives = 66/160 (41%), Gaps = 17/160 (10%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           C  C R F S R+  H + CK   +K  KP     H ++  E +  ++  R  ++T    
Sbjct: 172 CPNCNRKFFSGRLNLHLKSCKP--NKPLKPIKKQSH-ISNEEDQQQLSPQRNNSSTVQFN 228

Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAK---QV-----------QAHLNAG---GKLSDLX 567
           K ++    N+   Q  E   Q  ++ K   Q+           Q+  N      K+++  
Sbjct: 229 KNSEASSTNNIALQNKENDEQMNKSQKNKSQINPNTEKEENFQQSKYNLDIFEHKINNQH 288

Query: 568 XXXXSENPDYVQCPHCNRRFNQGAAERHIPKCANFQFNKP 687
               SE+ + VQC  C R+F Q   E+H   C+  Q  +P
Sbjct: 289 DEQQSED-NRVQCDICGRKFMQDRIEKHQVACSKSQKARP 327


>UniRef50_UPI0000E470FE Cluster: PREDICTED: hypothetical protein;
           n=2; Deuterostomia|Rep: PREDICTED: hypothetical protein
           - Strongylocentrotus purpuratus
          Length = 589

 Score = 44.4 bits (100), Expect = 0.003
 Identities = 48/188 (25%), Positives = 70/188 (37%), Gaps = 19/188 (10%)
 Frame = +1

Query: 157 RKPPVKANSAGSGTPKGRXXXXXXXXXXXXXGDA---CGVCGRHFASDRIAKHQEICK-- 321
           R PP K  + G G P G                    C  CGR FA DRI KH+ IC   
Sbjct: 250 RPPPKKPQALGQGQPMGAGAEAYNAIASQSASSQLAPCSRCGRTFALDRIEKHESICSVK 309

Query: 322 --KAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTTKVNK--GKQLNSNWRQKHE 489
              A S+ + P +    +L+ +++       + +  +   T V    G++  S     HE
Sbjct: 310 SGTAPSRGKTPSEG-NQQLSSSKSRSGPQSFQPSPPSKPRTLVCYICGREFGSKSLPIHE 368

Query: 490 E------FIQAIRAAKQVQAHL----NAGGKLSDLXXXXXSENPDYVQCPHCNRRFNQGA 639
                   IQ  +  K+ +  L    +A G  S       + N + V C  C R FN   
Sbjct: 369 PQCLQKWKIQNSKLPKEHRKQLPRKPDASGGKSANEAAMDAANANLVACKKCGRTFNPDR 428

Query: 640 AERHIPKC 663
            E+H   C
Sbjct: 429 IEKHQSIC 436



 Score = 35.1 bits (77), Expect = 1.9
 Identities = 14/22 (63%), Positives = 15/22 (68%)
 Frame = +1

Query: 256 ACGVCGRHFASDRIAKHQEICK 321
           AC  CGR F  DRI KHQ IC+
Sbjct: 416 ACKKCGRTFNPDRIEKHQSICR 437


>UniRef50_Q7QT08 Cluster: GLP_675_33860_35197; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_675_33860_35197 - Giardia lamblia
           ATCC 50803
          Length = 445

 Score = 44.4 bits (100), Expect = 0.003
 Identities = 37/148 (25%), Positives = 55/148 (37%), Gaps = 4/148 (2%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHS---KKRKPFDVLKHRLAGTEAEPFINKLRKTTATP 429
           C  CGR FA DRI +H+ IC K  +   +  K  D   +     E +P   K   TT   
Sbjct: 214 CHRCGRKFAPDRITQHERICNKLKALPDEVDKAADGDTNYARTREPDPSRFKKNGTTGFN 273

Query: 430 STTKVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAG-GKLSDLXXXXXSENPDYVQC 606
               V K    + +   +      +  AA + +     G G  +         N   V+C
Sbjct: 274 KANIVPKTLTSSGDHDSRSPPIKSSSNAAARGKPFGGKGMGGSAPFGGGGGGMNDGRVEC 333

Query: 607 PHCNRRFNQGAAERHIPKCANFQFNKPK 690
             C R+F     ++H   C N Q   P+
Sbjct: 334 RRCGRKFAPDRIDKHESICKNIQNMDPR 361



 Score = 39.5 bits (88), Expect = 0.087
 Identities = 25/63 (39%), Positives = 29/63 (46%), Gaps = 8/63 (12%)
 Frame = +1

Query: 157 RKPPVKA--NSAGSGTP---KGRXXXXXXXXXXXXXGDA---CGVCGRHFASDRIAKHQE 312
           R PP+K+  N+A  G P   KG               D    C  CGR FA DRI KH+ 
Sbjct: 291 RSPPIKSSSNAAARGKPFGGKGMGGSAPFGGGGGGMNDGRVECRRCGRKFAPDRIDKHES 350

Query: 313 ICK 321
           ICK
Sbjct: 351 ICK 353


>UniRef50_A1ZAP8 Cluster: CG30460-PC, isoform C; n=5; Drosophila
           melanogaster|Rep: CG30460-PC, isoform C - Drosophila
           melanogaster (Fruit fly)
          Length = 1868

 Score = 44.0 bits (99), Expect = 0.004
 Identities = 22/47 (46%), Positives = 27/47 (57%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFI 399
           C  C R FA D + KH  IC+KA SKKRK FD  + R  GT    ++
Sbjct: 254 CPCCSRTFAVDTLRKHVVICEKA-SKKRKIFDSSRQRRDGTALSTYV 299



 Score = 33.9 bits (74), Expect = 4.3
 Identities = 12/25 (48%), Positives = 16/25 (64%)
 Frame = +1

Query: 589 PDYVQCPHCNRRFNQGAAERHIPKC 663
           P   +CPHC+R FN  A +RH+  C
Sbjct: 410 PPCDRCPHCDRTFNPKAFDRHVEWC 434


>UniRef50_Q381C5 Cluster: Putative uncharacterized protein; n=1;
           Trypanosoma brucei|Rep: Putative uncharacterized protein
           - Trypanosoma brucei
          Length = 616

 Score = 43.6 bits (98), Expect = 0.005
 Identities = 28/83 (33%), Positives = 37/83 (44%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           C  CGR F  DR+  HQ  CK   +   +P      R     A P  NK R+  A P+T 
Sbjct: 185 CETCGRTFLPDRLEVHQRSCKPGSASASRPVG----RAVAKSATP--NKTRRLAAEPATA 238

Query: 439 KVNKGKQLNSNWRQKHEEFIQAI 507
           +  K K +   + Q  EE I A+
Sbjct: 239 R-RKEKLIPKAFPQDKEEEIDAV 260


>UniRef50_Q4FY30 Cluster: Putative uncharacterized protein; n=3;
           Leishmania|Rep: Putative uncharacterized protein -
           Leishmania major strain Friedlin
          Length = 404

 Score = 38.7 bits (86), Expect = 0.15
 Identities = 36/139 (25%), Positives = 52/139 (37%), Gaps = 2/139 (1%)
 Frame = +1

Query: 253 DACGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKH-RLAGTEAEPFINKLRKTTATP 429
           + C  CGR FA  R+ +H   C++  +   K    +K  R   +  +P         A  
Sbjct: 264 EPCPHCGRTFAPARLERHVVTCERHRNTLPKTKGDMKSCRAFSSRKKPDRTAGGDGAAAS 323

Query: 430 STTKVNK-GKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDYVQC 606
           +    N  G       R   +  I+  +  KQ     NA    S       + + D V C
Sbjct: 324 AAATANTAGAAPGGPLRWSTDTAIKPEKWRKQSAQLRNAMAGAS------VAVDDDRVLC 377

Query: 607 PHCNRRFNQGAAERHIPKC 663
           P C R F+   A RHIP C
Sbjct: 378 PSCGRHFSDDVAARHIPIC 396



 Score = 32.7 bits (71), Expect = 10.0
 Identities = 12/29 (41%), Positives = 14/29 (48%)
 Frame = +1

Query: 604 CPHCNRRFNQGAAERHIPKCANFQFNKPK 690
           CPHC R F     ERH+  C   +   PK
Sbjct: 266 CPHCGRTFAPARLERHVVTCERHRNTLPK 294


>UniRef50_Q387B6 Cluster: Putative uncharacterized protein; n=1;
           Trypanosoma brucei|Rep: Putative uncharacterized protein
           - Trypanosoma brucei
          Length = 568

 Score = 37.5 bits (83), Expect = 0.35
 Identities = 15/24 (62%), Positives = 15/24 (62%)
 Frame = +1

Query: 592 DYVQCPHCNRRFNQGAAERHIPKC 663
           D V CP C RRF    AERHIP C
Sbjct: 533 DRVPCPSCGRRFATHVAERHIPHC 556


>UniRef50_Q7QXY3 Cluster: GLP_479_39609_38410; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_479_39609_38410 - Giardia lamblia
           ATCC 50803
          Length = 399

 Score = 37.1 bits (82), Expect = 0.46
 Identities = 30/142 (21%), Positives = 53/142 (37%), Gaps = 3/142 (2%)
 Frame = +1

Query: 250 GDACGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATP 429
           G AC  C + F +  +  H  IC+   ++            AG + +    + R+     
Sbjct: 3   GIACPFCQKKFQAQDLITHCRICRALQAEASS---------AGQQTKTDTQRSRQPVGAG 53

Query: 430 STTKVNKGKQLNSNWRQKHEE--FIQAIRAAKQVQAH-LNAGGKLSDLXXXXXSENPDYV 600
             T     +  ++   Q   +  ++Q  + +   + H  N   + SD      S      
Sbjct: 54  ERTSNRVSEDFSTRKVQDSSKRSYVQQEQESSPSRNHSANTPARTSDPAEGEESRE---- 109

Query: 601 QCPHCNRRFNQGAAERHIPKCA 666
           +CPHC RRF     E+H+  CA
Sbjct: 110 ECPHCGRRFISSRLEKHVSACA 131



 Score = 32.7 bits (71), Expect = 10.0
 Identities = 20/77 (25%), Positives = 32/77 (41%), Gaps = 3/77 (3%)
 Frame = +1

Query: 253 DACGVCGRHFASDRIAKHQEICKKAHSKKRKPFDV--LKHRLAGTEAEPFINKLRKTT-A 423
           + C  CGR F S R+ KH   C K  +++   F+    + R    E    +N+   +T  
Sbjct: 109 EECPHCGRRFISSRLEKHVSACAKLSTRRVPSFNPHDQRWRNVSNEDRQLVNEAEPSTPM 168

Query: 424 TPSTTKVNKGKQLNSNW 474
           + S  K     +   NW
Sbjct: 169 SRSMVKSKTPVRKKLNW 185


>UniRef50_Q7QRE7 Cluster: GLP_503_3295_2699; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_503_3295_2699 - Giardia lamblia ATCC
           50803
          Length = 198

 Score = 37.1 bits (82), Expect = 0.46
 Identities = 15/30 (50%), Positives = 19/30 (63%)
 Frame = +1

Query: 256 ACGVCGRHFASDRIAKHQEICKKAHSKKRK 345
           +C  CGR FA DRI KH+++C K   K  K
Sbjct: 139 SCEYCGRGFAPDRIDKHRQVCNKHPDKIAK 168


>UniRef50_Q17BP6 Cluster: Putative uncharacterized protein; n=1;
           Aedes aegypti|Rep: Putative uncharacterized protein -
           Aedes aegypti (Yellowfever mosquito)
          Length = 492

 Score = 36.3 bits (80), Expect = 0.81
 Identities = 32/131 (24%), Positives = 59/131 (45%), Gaps = 7/131 (5%)
 Frame = +1

Query: 259 CGVCGRHFA-SDRIAKHQEICKKAHSKKRKPF-----DVLKHRLAGTEAEPFINKLRKTT 420
           C VCG+ F+ S  +AKH+   K+ HSK R PF     D    +    +    ++ +++  
Sbjct: 121 CDVCGKSFSESGNLAKHK---KQVHSKDR-PFKCEICDKSYPQKKDLQGHMLVHTMKRFA 176

Query: 421 ATPSTTKVNKGKQLNSNWRQKH-EEFIQAIRAAKQVQAHLNAGGKLSDLXXXXXSENPDY 597
            +    +  K ++  ++ + KH  + I+   +     A  N+  K S+        N   
Sbjct: 177 CSICKEEFAKIEEKRAHVKAKHPNDSIERSFSCVLCNAVFNSKTKYSNHCLTHGERN--- 233

Query: 598 VQCPHCNRRFN 630
            QCPHC ++F+
Sbjct: 234 FQCPHCTKKFH 244


>UniRef50_UPI00006CBAC8 Cluster: hypothetical protein
           TTHERM_00502700; n=1; Tetrahymena thermophila SB210|Rep:
           hypothetical protein TTHERM_00502700 - Tetrahymena
           thermophila SB210
          Length = 417

 Score = 35.5 bits (78), Expect = 1.4
 Identities = 13/24 (54%), Positives = 15/24 (62%)
 Frame = +1

Query: 592 DYVQCPHCNRRFNQGAAERHIPKC 663
           D V C  C R+F  G AE+HIP C
Sbjct: 368 DLVYCECCKRKFKPGPAEKHIPSC 391


>UniRef50_A0BJ34 Cluster: Chromosome undetermined scaffold_11, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_11,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 354

 Score = 35.5 bits (78), Expect = 1.4
 Identities = 31/98 (31%), Positives = 43/98 (43%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           C  CGR F  DRI KH+ +C           D+ K +    E +   NK       P  T
Sbjct: 115 CRKCGRRFNPDRIRKHESVCIGPEP------DIQKIK----EQQQEQNKRAAKYLKPKKT 164

Query: 439 KVNKGKQLNSNWRQKHEEFIQAIRAAKQVQAHLNAGGK 552
               GK     W+Q+H EF QA+R  ++V+    A G+
Sbjct: 165 ----GK-----WKQEHLEFQQAMREMRKVRQQEIAEGR 193


>UniRef50_A3PYR3 Cluster: Putative uncharacterized protein; n=1;
           Mycobacterium sp. JLS|Rep: Putative uncharacterized
           protein - Mycobacterium sp. (strain JLS)
          Length = 606

 Score = 35.1 bits (77), Expect = 1.9
 Identities = 21/37 (56%), Positives = 22/37 (59%), Gaps = 1/37 (2%)
 Frame = -2

Query: 667 LRTSECVARQRPG*TCDC-NEDTARSPDSPTAEGAAD 560
           LR SE V  Q      D  +EDT RSPD  TAEGAAD
Sbjct: 400 LRLSEQVLNQHARQNSDSVSEDTYRSPDPATAEGAAD 436


>UniRef50_Q17JM9 Cluster: Predicted protein; n=1; Aedes aegypti|Rep:
           Predicted protein - Aedes aegypti (Yellowfever mosquito)
          Length = 1131

 Score = 35.1 bits (77), Expect = 1.9
 Identities = 25/80 (31%), Positives = 35/80 (43%), Gaps = 2/80 (2%)
 Frame = +1

Query: 259 CGVCGRHFAS-DRIAKHQEICKKAHSKKRKPFDVLKHRL-AGTEAEPFINKLRKTTATPS 432
           CG CG+ FA  + + KHQ     A  KKR P  +    +      +   NKL K    PS
Sbjct: 509 CGECGKRFAEPNLVRKHQATVHSADKKKRAPVKITSSLVQLHRHVQMHTNKL-KCPKCPS 567

Query: 433 TTKVNKGKQLNSNWRQKHEE 492
             + NK + L  +   KH +
Sbjct: 568 --RFNKKRSLTEHVLTKHSK 585


>UniRef50_Q4AP45 Cluster: Radical SAM; n=3; Bacteria|Rep: Radical
           SAM - Chlorobium phaeobacteroides BS1
          Length = 1005

 Score = 34.3 bits (75), Expect = 3.3
 Identities = 18/60 (30%), Positives = 28/60 (46%), Gaps = 1/60 (1%)
 Frame = +1

Query: 256 ACGVCGRHFASDRI-AKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPS 432
           +CG CGR ++S  +  K   +C   HS      +V+KH    TE    I  + +  + PS
Sbjct: 800 SCGYCGRKYSSSSLCGKGHYVCDTCHSD--DAVEVIKHICLATEETDMIELMERIRSHPS 857


>UniRef50_A7ACD8 Cluster: Putative uncharacterized protein; n=1;
           Parabacteroides merdae ATCC 43184|Rep: Putative
           uncharacterized protein - Parabacteroides merdae ATCC
           43184
          Length = 393

 Score = 34.3 bits (75), Expect = 3.3
 Identities = 17/52 (32%), Positives = 28/52 (53%)
 Frame = -1

Query: 524 TCFAARIAWMNSSCFCRQLLFNCLPLFTLVVDGVAVVLRNLLIKGSASVPAK 369
           T FA +I  + + C C  L +  +PL+  ++D   V+  N L+  +AS P K
Sbjct: 254 TPFAKKIELLGNDCLCMSLRYEQVPLYLSIIDAGFVLRHNSLVNINAS-PTK 304


>UniRef50_Q4Q4S7 Cluster: Putative uncharacterized protein; n=3;
           Leishmania|Rep: Putative uncharacterized protein -
           Leishmania major
          Length = 558

 Score = 34.3 bits (75), Expect = 3.3
 Identities = 21/61 (34%), Positives = 28/61 (45%)
 Frame = +1

Query: 259 CGVCGRHFASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT 438
           C  CGR F  DR+  H + CK    K  KP      R+A   A P       T+A+P++ 
Sbjct: 135 CPNCGRTFLPDRLQVHMKSCKP--GKTAKPVPTAASRVAPPVATP---SAATTSASPASA 189

Query: 439 K 441
           K
Sbjct: 190 K 190


>UniRef50_A2EVV5 Cluster: Putative uncharacterized protein; n=1;
           Trichomonas vaginalis G3|Rep: Putative uncharacterized
           protein - Trichomonas vaginalis G3
          Length = 303

 Score = 34.3 bits (75), Expect = 3.3
 Identities = 13/31 (41%), Positives = 18/31 (58%)
 Frame = +1

Query: 583 ENPDYVQCPHCNRRFNQGAAERHIPKCANFQ 675
           ++ D V C +C R+F   AA RHIP C   +
Sbjct: 268 DSSDRVVCQYCGRKFLPDAARRHIPVCGRIR 298


>UniRef50_Q8MRK6 Cluster: GH27233p; n=1; Drosophila
           melanogaster|Rep: GH27233p - Drosophila melanogaster
           (Fruit fly)
          Length = 1006

 Score = 33.9 bits (74), Expect = 4.3
 Identities = 12/25 (48%), Positives = 16/25 (64%)
 Frame = +1

Query: 589 PDYVQCPHCNRRFNQGAAERHIPKC 663
           P   +CPHC+R FN  A +RH+  C
Sbjct: 62  PPCDRCPHCDRTFNPKAFDRHVEWC 86


>UniRef50_A2E8N3 Cluster: Putative uncharacterized protein; n=2;
           Trichomonas vaginalis G3|Rep: Putative uncharacterized
           protein - Trichomonas vaginalis G3
          Length = 227

 Score = 33.9 bits (74), Expect = 4.3
 Identities = 14/32 (43%), Positives = 17/32 (53%)
 Frame = +1

Query: 580 SENPDYVQCPHCNRRFNQGAAERHIPKCANFQ 675
           +E    V C +C RR    AA RHIP CA  +
Sbjct: 190 TEGDGKVTCQYCGRRLAPDAARRHIPVCAKIR 221


>UniRef50_Q2H9A8 Cluster: Putative uncharacterized protein; n=2;
           Chaetomium globosum|Rep: Putative uncharacterized
           protein - Chaetomium globosum (Soil fungus)
          Length = 633

 Score = 33.9 bits (74), Expect = 4.3
 Identities = 25/113 (22%), Positives = 42/113 (37%), Gaps = 3/113 (2%)
 Frame = +1

Query: 379 TEAEPFI-NKLRKTTATPSTTKVNKGKQLNSNWRQKHEEFIQAIRAAKQ--VQAHLNAGG 549
           T + P +  K++   A P   K    K+ N     K +    AI AA    V   L+   
Sbjct: 21  TPSRPQVFGKIKLKKAPPKQAKPGNWKEANIIEEDKKKSKDNAITAASPSPVTIQLDDAS 80

Query: 550 KLSDLXXXXXSENPDYVQCPHCNRRFNQGAAERHIPKCANFQFNKPKPAAKRR 708
           + +        +  D  QC HC +   + A   H+ +C   +  K +   + R
Sbjct: 81  RENFQTGRPLEDQLDMFQCKHCKKVITRSAGGEHVARCLKIKKEKAQRKKEAR 133


>UniRef50_P39505 Cluster: Uncharacterized 9.4 kDa protein in
           nrdB-nrdA intergenic region; n=5; Viruses|Rep:
           Uncharacterized 9.4 kDa protein in nrdB-nrdA intergenic
           region - Bacteriophage T4
          Length = 83

 Score = 33.9 bits (74), Expect = 4.3
 Identities = 14/36 (38%), Positives = 19/36 (52%), Gaps = 1/36 (2%)
 Frame = +1

Query: 559 DLXXXXXSENPDYVQCPHCNRRFNQGAAER-HIPKC 663
           DL      +  +Y  CPHC ++ N+G A R H  KC
Sbjct: 42  DLISLRTKQGAEYPPCPHCGKKVNKGNALRWHYDKC 77


>UniRef50_P28698 Cluster: Myeloid zinc finger 1; n=19; Eutheria|Rep:
           Myeloid zinc finger 1 - Homo sapiens (Human)
          Length = 734

 Score = 33.9 bits (74), Expect = 4.3
 Identities = 20/77 (25%), Positives = 33/77 (42%), Gaps = 9/77 (11%)
 Frame = +1

Query: 196 TPKGRXXXXXXXXXXXXXGDACGVCGRHFAS-DRIAKHQEI--------CKKAHSKKRKP 348
           +P+GR             G  C VCG+ F+    + +HQ+I        C +      + 
Sbjct: 337 SPRGRSRGRPSTGGGVVRGGRCDVCGKVFSQRSNLLRHQKIHTGERPFVCSECGRSFSRS 396

Query: 349 FDVLKHRLAGTEAEPFI 399
             +L+H+L  TE  PF+
Sbjct: 397 SHLLRHQLTHTEERPFV 413


>UniRef50_Q247Z8 Cluster: Putative uncharacterized protein; n=1;
           Tetrahymena thermophila SB210|Rep: Putative
           uncharacterized protein - Tetrahymena thermophila SB210
          Length = 619

 Score = 33.5 bits (73), Expect = 5.7
 Identities = 15/30 (50%), Positives = 20/30 (66%), Gaps = 2/30 (6%)
 Frame = +1

Query: 601 QCPH-CNRRFNQGAAERHIPKC-ANFQFNK 684
           QCP  C R+FN+ A  +HIP+C  +FQ  K
Sbjct: 499 QCPEGCGRKFNKNALAKHIPQCKKHFQPKK 528


>UniRef50_A5DMI5 Cluster: Putative uncharacterized protein; n=1;
           Pichia guilliermondii|Rep: Putative uncharacterized
           protein - Pichia guilliermondii (Yeast) (Candida
           guilliermondii)
          Length = 216

 Score = 33.5 bits (73), Expect = 5.7
 Identities = 13/42 (30%), Positives = 22/42 (52%)
 Frame = +1

Query: 601 QCPHCNRRFNQGAAERHIPKCANFQFNKPKPAAKRR*PGNPK 726
           +CP CN++F Q   ERH+  C + +  +     ++R P   K
Sbjct: 4   ECPICNKKFPQSLIERHVNSCLDSREAENTSKRRKRSPDTEK 45


>UniRef50_A5KAW1 Cluster: Merozoite surface protein 3 (MSP3),
           putative; n=1; Plasmodium vivax|Rep: Merozoite surface
           protein 3 (MSP3), putative - Plasmodium vivax
          Length = 382

 Score = 33.1 bits (72), Expect = 7.6
 Identities = 25/81 (30%), Positives = 36/81 (44%), Gaps = 1/81 (1%)
 Frame = +1

Query: 283 ASDRIAKHQEICKKAHSKKRKPFDVLKHRLAGTEAEPFINKLRKTTATPSTT-KVNKGKQ 459
           AS+  AK  +  K+A  K +   +  K + A  EA   +  +     TP TT K  +  Q
Sbjct: 71  ASEETAKFADEAKEAFKKAQSLAEEAKEKAA--EAAKAVGAMNGEKDTPPTTEKAQRASQ 128

Query: 460 LNSNWRQKHEEFIQAIRAAKQ 522
             S   QK  E   A+R AK+
Sbjct: 129 AASAAEQKSNEAQAAVRTAKE 149


>UniRef50_A4RL95 Cluster: Predicted protein; n=1; Magnaporthe
           grisea|Rep: Predicted protein - Magnaporthe grisea (Rice
           blast fungus) (Pyricularia grisea)
          Length = 593

 Score = 33.1 bits (72), Expect = 7.6
 Identities = 19/61 (31%), Positives = 30/61 (49%)
 Frame = -2

Query: 706 VV*RPASVC*TESLRTSECVARQRPG*TCDCNEDTARSPDSPTAEGAADRSAYLQHSNVP 527
           V+ + AS C   S RTS  V+   PG    C+  ++ S  S ++ G++  S  L  +N  
Sbjct: 17  VIRKAASACTYPSRRTSRPVSATSPGQLSTCSSSSSGSSGSSSSSGSSRSSGSLSDTNTA 76

Query: 526 A 524
           A
Sbjct: 77  A 77


>UniRef50_Q4D375 Cluster: Dispersed gene family protein 1 (DGF-1),
            putative; n=383; Trypanosoma cruzi|Rep: Dispersed gene
            family protein 1 (DGF-1), putative - Trypanosoma cruzi
          Length = 3520

 Score = 32.7 bits (71), Expect = 10.0
 Identities = 21/59 (35%), Positives = 30/59 (50%), Gaps = 4/59 (6%)
 Frame = -1

Query: 521  CFAARIAWMNSSCFCRQLLF----NCLPLFTLVVDGVAVVLRNLLIKGSASVPAKRCLS 357
            CFAA    M+ SC CR        +CLP++   VDG    L   L+  +A++ A R L+
Sbjct: 2798 CFAAATRAMSGSCRCRCAEGGYGRDCLPVYLPHVDGCNRTLEKPLLSHTATLTATRSLT 2856


>UniRef50_Q227R4 Cluster: Zinc finger, C2H2 type family protein;
           n=7; Tetrahymena thermophila SB210|Rep: Zinc finger,
           C2H2 type family protein - Tetrahymena thermophila SB210
          Length = 363

 Score = 32.7 bits (71), Expect = 10.0
 Identities = 11/29 (37%), Positives = 17/29 (58%)
 Frame = +1

Query: 583 ENPDYVQCPHCNRRFNQGAAERHIPKCAN 669
           +NPD+ QC  C + F++    +HI  C N
Sbjct: 181 DNPDFFQCEICLKAFHKSNCAKHIKVCGN 209


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 663,840,328
Number of Sequences: 1657284
Number of extensions: 12206966
Number of successful extensions: 42778
Number of sequences better than 10.0: 79
Number of HSP's better than 10.0 without gapping: 40452
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 42671
length of database: 575,637,011
effective HSP length: 99
effective length of database: 411,565,895
effective search space used: 62146450145
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)

- SilkBase 1999-2023 -