SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTX 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= bmov11b16
         (621 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endop...   422   e-117
UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca...    61   2e-08
UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep...    59   7e-08
UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto...    57   3e-07
UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M...    57   4e-07
UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica...    56   5e-07
UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3...    56   7e-07
UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h...    54   2e-06
UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome s...    54   2e-06
UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;...    53   5e-06
UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: L...    53   5e-06
UniRef50_Q207N1 Cluster: Cathepsin S; n=2; Clupeocephala|Rep: Ca...    53   5e-06
UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No...    53   6e-06
UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei...    53   6e-06
UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist...    52   1e-05
UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ...    50   3e-05
UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ...    50   3e-05
UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]...    50   6e-05
UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata...    50   6e-05
UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat...    49   8e-05
UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C...    48   2e-04
UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain...    47   3e-04
UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n...    47   3e-04
UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt...    46   7e-04
UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;...    46   7e-04
UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ...    46   7e-04
UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr...    46   7e-04
UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi...    46   7e-04
UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re...    46   7e-04
UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy...    45   0.001
UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ...    45   0.002
UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ...    45   0.002
UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease ...    45   0.002
UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster...    45   0.002
UniRef50_UPI0000DA404B Cluster: PREDICTED: similar to cathepsin ...    44   0.002
UniRef50_UPI00015B5D85 Cluster: PREDICTED: similar to cathepsin ...    44   0.003
UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis...    44   0.003
UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er...    44   0.003
UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ...    44   0.003
UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n...    44   0.004
UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n...    43   0.005
UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ...    43   0.007
UniRef50_A7LFV2 Cluster: Cathepsin L protease inhibitor 1; n=1; ...    43   0.007
UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:...    42   0.009
UniRef50_UPI0000D566EC Cluster: PREDICTED: similar to CG10460-PA...    42   0.012
UniRef50_Q70SU8 Cluster: Cystein proteinase inhibitor protein pr...    42   0.012
UniRef50_P12400 Cluster: Protein CTLA-2-beta; n=6; Mus musculus|...    42   0.012
UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv...    42   0.016
UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh...    42   0.016
UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ...    41   0.021
UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ...    41   0.021
UniRef50_Q4N5Z7 Cluster: Cysteine proteinase, putative; n=2; The...    41   0.021
UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s...    41   0.027
UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:...    41   0.027
UniRef50_Q23H06 Cluster: Papain family cysteine protease contain...    41   0.027
UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin ...    40   0.036
UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1...    40   0.036
UniRef50_A7LFV3 Cluster: Cathepsin L protease inhibitor 2; n=1; ...    40   0.036
UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG...    40   0.063
UniRef50_UPI00001CC928 Cluster: PREDICTED: similar to CTLA-2-bet...    39   0.084
UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat...    39   0.084
UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;...    39   0.084
UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ...    39   0.084
UniRef50_UPI0000D566ED Cluster: PREDICTED: similar to CTLA-2-alp...    39   0.11 
UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste...    38   0.15 
UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted...    38   0.15 
UniRef50_A0EI50 Cluster: Chromosome undetermined scaffold_98, wh...    38   0.15 
UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ...    38   0.15 
UniRef50_Q23H10 Cluster: Papain family cysteine protease contain...    38   0.19 
UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate...    38   0.19 
UniRef50_A0M6M0 Cluster: Protein containing DUF28; n=3; Flavobac...    37   0.34 
UniRef50_UPI000155C322 Cluster: PREDICTED: similar to cathepsin ...    37   0.45 
UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve...    37   0.45 
UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve...    37   0.45 
UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh...    37   0.45 
UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w...    37   0.45 
UniRef50_A1ZBK7 Cluster: CG10460-PA; n=1; Drosophila melanogaste...    36   0.59 
UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio...    36   0.78 
UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox...    36   0.78 
UniRef50_A5Z9W8 Cluster: Putative uncharacterized protein; n=1; ...    36   1.0  
UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio...    36   1.0  
UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n...    36   1.0  
UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine prot...    36   1.0  
UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi...    35   1.4  
UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain...    35   1.4  
UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ...    34   2.4  
UniRef50_A4L250 Cluster: Putative uncharacterized protein; n=1; ...    34   3.1  
UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae...    34   3.1  
UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca...    34   3.1  
UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j...    34   3.1  
UniRef50_Q23ZE2 Cluster: Putative uncharacterized protein; n=1; ...    34   3.1  
UniRef50_Q42312 Cluster: Cysteine protease; n=1; Arabidopsis tha...    33   4.2  
UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|...    33   4.2  
UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip...    33   4.2  
UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain...    33   4.2  
UniRef50_A0DV90 Cluster: Chromosome undetermined scaffold_65, wh...    33   4.2  
UniRef50_A2YT27 Cluster: Putative uncharacterized protein; n=1; ...    33   5.5  
UniRef50_Q950M8 Cluster: Orf511; n=1; Rhizophydium sp. 136|Rep: ...    33   5.5  
UniRef50_Q9Y244 Cluster: Proteasome maturation protein; n=29; Eu...    33   5.5  
UniRef50_Q5NIG4 Cluster: CCA-adding enzyme; n=12; Francisella tu...    33   5.5  
UniRef50_UPI00006CA466 Cluster: oxidoreductase, zinc-binding deh...    33   7.3  
UniRef50_Q5GZY7 Cluster: Putative uncharacterized protein; n=1; ...    33   7.3  
UniRef50_A3IAT4 Cluster: Acyl-CoA dehydrogenase-like protein; n=...    33   7.3  
UniRef50_Q22A69 Cluster: Papain family cysteine protease contain...    33   7.3  
UniRef50_UPI00004D962D Cluster: UPI00004D962D related cluster; n...    32   9.6  
UniRef50_Q47W97 Cluster: TPR domain/sulfotransferase domain prot...    32   9.6  
UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph...    32   9.6  
UniRef50_Q5K600 Cluster: Env protein; n=3; Drosophila melanogast...    32   9.6  
UniRef50_Q231X3 Cluster: Papain family cysteine protease contain...    32   9.6  
UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina...    32   9.6  
UniRef50_Q9URY3 Cluster: GTPase activating protein; n=1; Schizos...    32   9.6  
UniRef50_A3GHI4 Cluster: Predicted protein; n=4; Saccharomycetal...    32   9.6  
UniRef50_Q0PA12 Cluster: DNA translocase ftsK; n=17; Epsilonprot...    32   9.6  
UniRef50_Q8NEC5 Cluster: Cation channel sperm-associated protein...    32   9.6  

>UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2;
           Endopterygota|Rep: Cathepsin L-like proteinase - Bombyx
           mori (Silk moth)
          Length = 402

 Score =  422 bits (1040), Expect = e-117
 Identities = 192/195 (98%), Positives = 195/195 (100%)
 Frame = +3

Query: 36  MMLIVLLLQVSLVNCVNITENTLKIKSLIMKYKNNRVHRSPMTSLDGFVQDALMFFDRTE 215
           MMLIVLLLQ+SLVNCVNITENTLKIKSLIMKYKNNRVHRSPMTSLDGFVQDALMFFDRTE
Sbjct: 1   MMLIVLLLQISLVNCVNITENTLKIKSLIMKYKNNRVHRSPMTSLDGFVQDALMFFDRTE 60

Query: 216 RFKDVLNVYNYSECVGDEGLMEKHVLKGLLVHEHLPRRHWHEYKAIHNKLYSSTHHEMAA 395
           RFKDVLNVYNYSECVGDEGLMEKHVLKGLL+HEHLPRRHWHEYKAIHNKLYSSTHHEMAA
Sbjct: 61  RFKDVLNVYNYSECVGDEGLMEKHVLKGLLIHEHLPRRHWHEYKAIHNKLYSSTHHEMAA 120

Query: 396 LIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTEYFGKVLKLIKAFPLFDPAEDH 575
           L+KWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTEYFGKVLKLIKAFPLFDPAEDH
Sbjct: 121 LMKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTEYFGKVLKLIKAFPLFDPAEDH 180

Query: 576 HKTAYRHNRRCKVPK 620
           HKTAYRHNRRCKVPK
Sbjct: 181 HKTAYRHNRRCKVPK 195


>UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep:
           Cathepsin K - Danio rerio (Zebrafish) (Brachydanio
           rerio)
          Length = 333

 Score = 60.9 bits (141), Expect = 2e-08
 Identities = 29/79 (36%), Positives = 43/79 (54%), Gaps = 1/79 (1%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W  +K  H + Y+  + E      W +N+  +  HN+EY  GI +Y L +NHFGDM + E
Sbjct: 30  WESWKITHKREYNGLNEESIRRTIWEKNMLFIEAHNKEYELGIHTYDLGMNHFGDMTLEE 89

Query: 513 YFGKVLKLIKAFPLF-DPA 566
              KV+ L    P++ DPA
Sbjct: 90  VAEKVMGL--QMPMYRDPA 106


>UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin
           L - Misgurnus mizolepis (Mud loach)
          Length = 337

 Score = 59.3 bits (137), Expect = 7e-08
 Identities = 25/62 (40%), Positives = 36/62 (58%)
 Frame = +3

Query: 330 HWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVT 509
           HW ++K  H K Y         +I W +NLR++  HN E+  GI +Y L +NHFGDM+  
Sbjct: 28  HWEQWKTWHGKNYHEKEEGWRRMI-WEKNLRKIQFHNLEHSMGIHTYRLGMNHFGDMNHE 86

Query: 510 EY 515
           E+
Sbjct: 87  EF 88


>UniRef50_P25774 Cluster: Cathepsin S precursor; n=78;
           Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens
           (Human)
          Length = 331

 Score = 57.2 bits (132), Expect = 3e-07
 Identities = 23/57 (40%), Positives = 32/57 (56%)
 Frame = +3

Query: 330 HWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDM 500
           HWH +K  + K Y   + E    + W +NL+ V  HN E+  G+ SY L +NH GDM
Sbjct: 27  HWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDM 83


>UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15)
           (Major excreted protein) (MEP) [Contains: Cathepsin L
           heavy chain; Cathepsin L light chain]; n=19;
           Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15)
           (Major excreted protein) (MEP) [Contains: Cathepsin L
           heavy chain; Cathepsin L light chain] - Homo sapiens
           (Human)
          Length = 333

 Score = 56.8 bits (131), Expect = 4e-07
 Identities = 25/81 (30%), Positives = 44/81 (54%)
 Frame = +3

Query: 288 VLKGLLVHEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQS 467
           +    L  +H     W ++KA+HN+LY          + W +N++ +  HN+EY  G  S
Sbjct: 14  IASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAV-WEKNMKMIELHNQEYREGKHS 72

Query: 468 YSLHLNHFGDMHVTEYFGKVL 530
           +++ +N FGDM  +E F +V+
Sbjct: 73  FTMAMNAFGDM-TSEEFRQVM 92


>UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus
           tropicalis|Rep: LOC594890 protein - Xenopus tropicalis
           (Western clawed frog) (Silurana tropicalis)
          Length = 355

 Score = 56.4 bits (130), Expect = 5e-07
 Identities = 23/70 (32%), Positives = 35/70 (50%)
 Frame = +3

Query: 330 HWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVT 509
           HW  +   H K+Y +   E+A  + W   L+ +  HN EY  G+ +Y + +NH GDM   
Sbjct: 51  HWRLWVQTHKKIYKNEGEELARRLIWEDTLKFIMLHNLEYSMGLHTYEVGMNHLGDMVAE 110

Query: 510 EYFGKVLKLI 539
           E   K +  I
Sbjct: 111 EMTDKQMNFI 120


>UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3;
           Bilateria|Rep: Cathepsin L-like cysteine protease -
           Neobenedenia melleni
          Length = 335

 Score = 56.0 bits (129), Expect = 7e-07
 Identities = 26/72 (36%), Positives = 40/72 (55%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W ++K  + K Y S+  E+  L+ W +NL  V +HN  Y  G +SY+L +NH  D+   E
Sbjct: 27  WSQWKVKYQKDYLSSEDELNKLLTWSKNLETVRKHNELYAQGKKSYTLAMNHMADLSSEE 86

Query: 513 YFGKVLKLIKAF 548
           +  K L L+  F
Sbjct: 87  F--KALYLVPKF 96


>UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin
           heavy chain; n=3; Amniota|Rep: PREDICTED: similar to
           ferritin heavy chain - Ornithorhynchus anatinus
          Length = 338

 Score = 54.4 bits (125), Expect = 2e-06
 Identities = 23/55 (41%), Positives = 30/55 (54%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGD 497
           W  +K +H K YS    E+     W +N+R + RHN E   G  SY L +NHFGD
Sbjct: 28  WWRWKVLHGKNYSVEAEEVFRRAAWEKNVRVIERHNEEMSQGKHSYRLAMNHFGD 82


>UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome
           shotgun sequence; n=3; Tetraodontidae|Rep: Chromosome 12
           SCAF14996, whole genome shotgun sequence - Tetraodon
           nigroviridis (Green puffer)
          Length = 362

 Score = 54.4 bits (125), Expect = 2e-06
 Identities = 23/63 (36%), Positives = 35/63 (55%)
 Frame = +3

Query: 327 RHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHV 506
           +HW  +K  H+K Y         ++ W +NL+++  HN E+  G  SY L +NHFGDM  
Sbjct: 26  QHWELWKGWHSKQYHEKEEGWRRMV-WEKNLKKIELHNLEHSMGQHSYRLGMNHFGDMTH 84

Query: 507 TEY 515
            E+
Sbjct: 85  EEF 87


>UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;
           n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
           CG5367-PA - Nasonia vitripennis
          Length = 362

 Score = 53.2 bits (122), Expect = 5e-06
 Identities = 25/70 (35%), Positives = 35/70 (50%)
 Frame = +3

Query: 330 HWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVT 509
           +WH YK  HNK Y+ T  E      W  NL ++  HN    AG   Y L  NH  D+  +
Sbjct: 60  YWHLYKMRHNKTYTGTL-EAVRREAWEDNLLKIYEHNLLAAAGHHEYILRDNHIADLSTS 118

Query: 510 EYFGKVLKLI 539
            Y  +++KL+
Sbjct: 119 SYMRELVKLV 128


>UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep:
           LOC443661 protein - Xenopus laevis (African clawed frog)
          Length = 346

 Score = 53.2 bits (122), Expect = 5e-06
 Identities = 20/57 (35%), Positives = 29/57 (50%)
 Frame = +3

Query: 330 HWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDM 500
           HW  +   H K+Y     E A    W + L+ +  HN EY  G+ +Y + +NH GDM
Sbjct: 50  HWQLWVKTHQKIYKDAEEERARRTIWEETLKFITVHNLEYSLGLHTYEVGMNHLGDM 106


>UniRef50_Q207N1 Cluster: Cathepsin S; n=2; Clupeocephala|Rep:
           Cathepsin S - Ictalurus punctatus (Channel catfish)
          Length = 84

 Score = 53.2 bits (122), Expect = 5e-06
 Identities = 23/57 (40%), Positives = 32/57 (56%)
 Frame = +3

Query: 330 HWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDM 500
           HW  +K  H+K Y+S   E+     W +NLR +  HN E   G+ +Y L +NH GDM
Sbjct: 25  HWLMWKKNHSKTYTSELEELGRREIWERNLRLITVHNLEASLGMHTYHLGMNHMGDM 81


>UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel
           protein - Danio rerio (Zebrafish) (Brachydanio rerio)
          Length = 328

 Score = 52.8 bits (121), Expect = 6e-06
 Identities = 23/60 (38%), Positives = 31/60 (51%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W  +K+ HNK Y +T  E      W+QNL+ +  HN     G+ SY+L LN   DM   E
Sbjct: 27  WTTWKSQHNKTYRNTREERLRRSVWKQNLQDILLHNEAAAVGLHSYTLGLNQLSDMTADE 86


>UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine
           proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like
           midgut cysteine proteinase - Tenebrio molitor (Yellow
           mealworm)
          Length = 330

 Score = 52.8 bits (121), Expect = 6e-06
 Identities = 24/76 (31%), Positives = 41/76 (53%)
 Frame = +3

Query: 300 LLVHEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLH 479
           L + + L +  W ++K  H K YSS   E+   + ++ N+ ++A HN ++  G  +YS  
Sbjct: 17  LALPKSLFQEQWSQFKLTHKKSYSSPIEEIRRQLIFKDNVAKIAEHNAKFEKGEVTYSKA 76

Query: 480 LNHFGDMHVTEYFGKV 527
           +N FGDM   E+   V
Sbjct: 77  MNQFGDMSKEEFLAYV 92


>UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3;
           Schistosoma japonicum|Rep: Clone ZZD209 mRNA sequence -
           Schistosoma japonicum (Blood fluke)
          Length = 339

 Score = 51.6 bits (118), Expect = 1e-05
 Identities = 19/61 (31%), Positives = 35/61 (57%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W  +K +H+K Y++ H E+     W +NL ++  HN  Y  G+++Y + L+ F D+   E
Sbjct: 31  WKIWKRLHDKHYTNRHEEVVRRRNWNENLVKIHLHNLRYDLGVETYEIGLSRFSDVDWNE 90

Query: 513 Y 515
           +
Sbjct: 91  F 91


>UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar to
           human SRY (sex determining region Y)-box 30
           (SOX30),transcript variant 1,; n=3; Mammalia|Rep: Testis
           cDNA clone: QtsA-12228, similar to human SRY (sex
           determining region Y)-box 30 (SOX30),transcript variant
           1, - Macaca fascicularis (Crab eating macaque)
           (Cynomolgus monkey)
          Length = 433

 Score = 50.4 bits (115), Expect = 3e-05
 Identities = 21/66 (31%), Positives = 38/66 (57%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W+++KA H +LY ++       + W +N++ +  HN EY  G   +++ +N FGDM   E
Sbjct: 29  WYQWKATHRRLYGASEEGWRRAV-WEKNMKMIELHNGEYSQGKHGFAMAMNAFGDM-TNE 86

Query: 513 YFGKVL 530
            F +V+
Sbjct: 87  EFRQVM 92


>UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor;
           n=35; Fasciola|Rep: Cathepsin L-like proteinase
           precursor - Fasciola hepatica (Liver fluke)
          Length = 326

 Score = 50.4 bits (115), Expect = 3e-05
 Identities = 22/66 (33%), Positives = 37/66 (56%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           WH++K ++NK Y+    +    I W +N++ +  HN  +  G+ +Y+L LN F DM   E
Sbjct: 21  WHQWKRMYNKEYNGADDQHRRNI-WEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEE 79

Query: 513 YFGKVL 530
           +  K L
Sbjct: 80  FKAKYL 85


>UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1];
           n=11; Eutheria|Rep: Testin-2 precursor [Contains:
           Testin-1] - Mus musculus (Mouse)
          Length = 333

 Score = 49.6 bits (113), Expect = 6e-05
 Identities = 19/61 (31%), Positives = 34/61 (55%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W+E++  H K Y+     +   + W +N + +  HN EYL G   +++ +N FGD+  TE
Sbjct: 29  WNEWRTKHGKAYNVNEERLRRAV-WEKNFKMIELHNWEYLEGKHDFTMTMNAFGDLTNTE 87

Query: 513 Y 515
           +
Sbjct: 88  F 88


>UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42;
           Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens
           (Human)
          Length = 334

 Score = 49.6 bits (113), Expect = 6e-05
 Identities = 19/61 (31%), Positives = 34/61 (55%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W+++KA H +LY +        + W +N++ +  HN EY  G   +++ +N FGDM   E
Sbjct: 29  WYQWKATHRRLYGANEEGWRRAV-WEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEE 87

Query: 513 Y 515
           +
Sbjct: 88  F 88


>UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes
           abbreviatus|Rep: Cathepsin L 2 - Diaprepes abbreviatus
           (Sugarcane rootstalk borer weevil)
          Length = 348

 Score = 49.2 bits (112), Expect = 8e-05
 Identities = 20/69 (28%), Positives = 36/69 (52%)
 Frame = +3

Query: 309 HEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNH 488
           ++ L +  W ++K  H K+Y S          + +NL ++  HN+ Y  G+ SY + +NH
Sbjct: 20  YQVLVQEQWEQFKLEHGKVYESESENEYRQSVFMENLFQINEHNKLYEMGLSSYQMAMNH 79

Query: 489 FGDMHVTEY 515
            GD+   E+
Sbjct: 80  LGDLTKDEF 88


>UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15)
           [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC
           3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain] - Sarcophaga peregrina (Flesh fly)
           (Boettcherisca peregrina)
          Length = 339

 Score = 48.0 bits (109), Expect = 2e-04
 Identities = 21/61 (34%), Positives = 33/61 (54%)
 Frame = +3

Query: 318 LPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGD 497
           L +  WH YK  H K Y++   E   +  + +N  ++A+HN+ +  G  SY L LN + D
Sbjct: 23  LIKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYAD 82

Query: 498 M 500
           M
Sbjct: 83  M 83


>UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain;
           n=9; Cucujiformia|Rep: Digestive cysteine proteinase
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 25/84 (29%), Positives = 40/84 (47%)
 Frame = +3

Query: 300 LLVHEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLH 479
           L V+    +  W  +K  H K Y S   E      ++ NLR++  HN +Y  G +SY L 
Sbjct: 12  LAVNALTDKDQWVAFKQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAKYDKGEESYFLG 71

Query: 480 LNHFGDMHVTEYFGKVLKLIKAFP 551
           +  F D+   E+  ++ + IK  P
Sbjct: 72  VTPFADLTHDEFKDELRRQIKTKP 95


>UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3;
           Bilateria|Rep: Cathepsin L-like cysteine proteinase -
           Longidorus elongatus
          Length = 358

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 26/81 (32%), Positives = 41/81 (50%), Gaps = 1/81 (1%)
 Frame = +3

Query: 276 MEKHVLKGLLV-HEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYL 452
           +++H    LL+ H + P   W  +K  H K Y +   E+     +  N + + +HN EY 
Sbjct: 25  IQEHPRNNLLINHPYYPV--WTNFKLKHAKSYKTKDEELLRFQVFASNHKVIEQHNIEYE 82

Query: 453 AGIQSYSLHLNHFGDMHVTEY 515
           AG  S++L LN F DM   E+
Sbjct: 83  AGQHSFALSLNKFADMTNAEF 103


>UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcoptes
           scabiei type hominis|Rep: Cathepsin L-like protease -
           Sarcoptes scabiei type hominis
          Length = 245

 Score = 46.0 bits (104), Expect = 7e-04
 Identities = 19/68 (27%), Positives = 40/68 (58%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W  +KA +N+ + + + E+   + +++N   + +HN +Y AG+ +Y L +N F D+   E
Sbjct: 33  WTVFKAKYNRQFRTVYDELLRKLIFQRNYIYIRKHNEKYEAGLSTYELGVNQFTDLTNKE 92

Query: 513 YFGKVLKL 536
           Y  ++ +L
Sbjct: 93  YNDQMNRL 100


>UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase" precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 315

 Score = 46.0 bits (104), Expect = 7e-04
 Identities = 19/61 (31%), Positives = 35/61 (57%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W  +KA HNK Y+    ++   + ++ NL+++  HN +Y +G ++Y L +N F D    E
Sbjct: 24  WTSFKATHNKSYNVIEDKLRFAV-FQDNLKKIEEHNAKYESGEETYYLAVNKFADWSSAE 82

Query: 513 Y 515
           +
Sbjct: 83  F 83


>UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 317

 Score = 46.0 bits (104), Expect = 7e-04
 Identities = 20/63 (31%), Positives = 32/63 (50%)
 Frame = +3

Query: 327 RHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHV 506
           + W ++K  H+K Y     E      + QNL+++ +HN  Y  G  S+ L +N F DM  
Sbjct: 14  QQWAQFKVNHSKKYGHLKEEQVRFQVFSQNLQKIEQHNARYQNGEVSFYLGVNQFADMTS 73

Query: 507 TEY 515
            E+
Sbjct: 74  EEF 76


>UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase
           precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin
           L-like cysteine proteinase precursor - Acanthoscelides
           obtectus (Bean weevil)
          Length = 321

 Score = 46.0 bits (104), Expect = 7e-04
 Identities = 22/70 (31%), Positives = 36/70 (51%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W ++K  H + Y +   E      ++ NLR +  HN  Y  G +++ + +N FGDM   E
Sbjct: 23  WQQFKIQHGRTYRTLLEEKRRFEIFKFNLRTIEEHNERYHNGEETFEMGINQFGDM-TQE 81

Query: 513 YFGKVLKLIK 542
            F ++L L K
Sbjct: 82  EFKRMLALQK 91


>UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18;
           Schistosoma|Rep: Preprocathepsin cathepsin L -
           Schistosoma japonicum (Blood fluke)
          Length = 331

 Score = 46.0 bits (104), Expect = 7e-04
 Identities = 20/78 (25%), Positives = 42/78 (53%), Gaps = 3/78 (3%)
 Frame = +3

Query: 288 VLKGLLVHEHLPRRH---WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAG 458
           ++  + V +H  +++   W ++K  +NK Y+S   EM   + + + + ++  HN  +  G
Sbjct: 9   IITAITVAQHYDKQYDEIWRQWKLKYNKTYTSNDDEMRRKMIFMRRIGKIQEHNLRHDLG 68

Query: 459 IQSYSLHLNHFGDMHVTE 512
           ++ Y++ LN F DM   E
Sbjct: 69  LEGYTMGLNQFCDMEWEE 86


>UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Rep:
           Cathepsin R precursor - Mus musculus (Mouse)
          Length = 334

 Score = 46.0 bits (104), Expect = 7e-04
 Identities = 23/81 (28%), Positives = 41/81 (50%)
 Frame = +3

Query: 288 VLKGLLVHEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQS 467
           V  G+ V +      W ++K  +NK YS    ++  ++ W + L+ +  HNRE   G   
Sbjct: 14  VASGVPVLDSSLDAEWQDWKIKYNKSYSLKEEKLKRVV-WEEKLKMIKLHNRENSLGKNG 72

Query: 468 YSLHLNHFGDMHVTEYFGKVL 530
           +++ +N FGD    E F K++
Sbjct: 73  FTMKMNEFGD-QTDEEFRKMM 92


>UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2;
           Platyhelminthes|Rep: Cathepsin L-like proteinase -
           Echinococcus multilocularis
          Length = 338

 Score = 45.2 bits (102), Expect = 0.001
 Identities = 22/68 (32%), Positives = 34/68 (50%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W  +K  +NK Y++   E   +  +  N   V  HN  Y  G+++YS  LN F D+ + E
Sbjct: 30  WRGWKVANNKTYATLREEHLRMRIFINNYLFVRWHNERYYLGLETYSTALNAFADLTLEE 89

Query: 513 YFGKVLKL 536
           +  K L L
Sbjct: 90  FAEKYLTL 97


>UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin
           L-like protease; n=1; Nasonia vitripennis|Rep:
           PREDICTED: similar to cathepsin L-like protease -
           Nasonia vitripennis
          Length = 353

 Score = 44.8 bits (101), Expect = 0.002
 Identities = 19/61 (31%), Positives = 33/61 (54%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W  +K  + K Y+    E      + +N R++A HN+++  G+ +Y + +N FGDM   E
Sbjct: 40  WAAFKLRYKKNYNGDVEENFRRSVFHENQRKIAEHNQKHDLGLFTYKVRINQFGDMMFEE 99

Query: 513 Y 515
           Y
Sbjct: 100 Y 100


>UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin L
           preproprotein; n=1; Monodelphis domestica|Rep:
           PREDICTED: similar to cathepsin L preproprotein -
           Monodelphis domestica
          Length = 356

 Score = 44.8 bits (101), Expect = 0.002
 Identities = 20/61 (32%), Positives = 31/61 (50%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W  +K  + K YS         + W +NL+ +  HNR +  G +SY + +N FGDM   E
Sbjct: 29  WEAWKTTYGKNYSEKEESFRRQV-WEKNLKLINDHNRLFKEGKKSYFMGMNQFGDMTDKE 87

Query: 513 Y 515
           +
Sbjct: 88  F 88


>UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 393

 Score = 44.8 bits (101), Expect = 0.002
 Identities = 23/86 (26%), Positives = 47/86 (54%), Gaps = 5/86 (5%)
 Frame = +3

Query: 273 LMEKHVLKGLLVHEHLPRRHWHEYKAIHNK-----LYSSTHHEMAALIKWRQNLRRVARH 437
           L E  +L+  L H  L + H+ ++K+ H +     L  S + E   L  +++NL +++ H
Sbjct: 30  LREVFILQSELSHAEL-KEHYEQWKSKHQQTKQTLLGDSEYSETYRLTNFKENLLKISEH 88

Query: 438 NREYLAGIQSYSLHLNHFGDMHVTEY 515
           N++++ G  S+++ LN F  +   E+
Sbjct: 89  NKKFIDGHYSFTMKLNQFAHLSSEEF 114


>UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila
           melanogaster|Rep: CG5367-PA - Drosophila melanogaster
           (Fruit fly)
          Length = 338

 Score = 44.8 bits (101), Expect = 0.002
 Identities = 21/74 (28%), Positives = 38/74 (51%)
 Frame = +3

Query: 324 RRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMH 503
           +  + ++K  +N+ Y  T+ EM +   + +N + +  HN+ Y  G  S+ L  N F DM 
Sbjct: 33  KSEFEKFKNNNNRKYLRTYDEMRSYKAFEENFKVIEEHNQNYKEGQTSFRLKPNIFADMS 92

Query: 504 VTEYFGKVLKLIKA 545
              Y    L+L+K+
Sbjct: 93  TDGYLKGFLRLLKS 106


>UniRef50_UPI0000DA404B Cluster: PREDICTED: similar to cathepsin 8;
           n=2; Rattus norvegicus|Rep: PREDICTED: similar to
           cathepsin 8 - Rattus norvegicus
          Length = 336

 Score = 44.4 bits (100), Expect = 0.002
 Identities = 20/61 (32%), Positives = 34/61 (55%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W E+K  ++K YS         + W +N++ V +HN EY  G  ++++ +N FGDM   E
Sbjct: 29  WQEWKIKYDKNYSLEEEGQRRAV-WEENMKVVKQHNIEYDQGKNNFTMKVNAFGDMTGEE 87

Query: 513 Y 515
           +
Sbjct: 88  F 88


>UniRef50_UPI00015B5D85 Cluster: PREDICTED: similar to cathepsin
           L-like proteinase; n=1; Nasonia vitripennis|Rep:
           PREDICTED: similar to cathepsin L-like proteinase -
           Nasonia vitripennis
          Length = 96

 Score = 44.0 bits (99), Expect = 0.003
 Identities = 22/70 (31%), Positives = 35/70 (50%)
 Frame = +3

Query: 288 VLKGLLVHEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQS 467
           +L  ++V + L    W +YK   NK Y++   E      +    ++V  HN +Y  G  S
Sbjct: 8   LLMAVVVVQVLADDEWEQYKIKFNKKYANPEEEQRRYKIYLDTKKKVEEHNVKYNNGEVS 67

Query: 468 YSLHLNHFGD 497
           +SL +NHF D
Sbjct: 68  FSLGINHFAD 77


>UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides
           sonorensis|Rep: Cathepsin L - Culicoides sonorensis
          Length = 331

 Score = 44.0 bits (99), Expect = 0.003
 Identities = 19/61 (31%), Positives = 33/61 (54%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W ++K  +NK+Y  +  E      + +NL  V  HN  YL+G+++Y   +N F D+   E
Sbjct: 27  WKKFKLEYNKVYPLSTEENLRKGIFERNLADVMEHNARYLSGMETYEKGVNQFSDLTYEE 86

Query: 513 Y 515
           +
Sbjct: 87  F 87


>UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra
           erinaceieuropaei|Rep: Cysteine proteinase - Spirometra
           erinaceieuropaei (Tapeworm)
          Length = 336

 Score = 44.0 bits (99), Expect = 0.003
 Identities = 26/84 (30%), Positives = 40/84 (47%), Gaps = 1/84 (1%)
 Frame = +3

Query: 288 VLKGLLVHEHLPRRH-WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQ 464
           V +G    E   RR  W  +K    K Y S+  E+     +  NL  + RHN+ Y   ++
Sbjct: 16  VCRGSTESETYVRRELWKAWKLAFKKEYFSSEEELHRKRAFFNNLDFIIRHNQRYYQQLE 75

Query: 465 SYSLHLNHFGDMHVTEYFGKVLKL 536
           SY++ LN F D+   E+  + L L
Sbjct: 76  SYAVRLNDFSDLTPGEFAERYLCL 99


>UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor;
           n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase
           precursor - Phaedon cochleariae (Mustard beetle)
          Length = 324

 Score = 44.0 bits (99), Expect = 0.003
 Identities = 20/67 (29%), Positives = 34/67 (50%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W ++K  H + Y S   E      ++  LR++A HN +Y  G  +Y L +N F D+   E
Sbjct: 23  WADFKKTHARTYKSLREEKLRFNIFQDTLRQIAEHNVKYENGESTYYLAINKFSDITDEE 82

Query: 513 YFGKVLK 533
           +   ++K
Sbjct: 83  FRDMLMK 89


>UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n=1;
           Xenopus tropicalis|Rep: UPI00006A2275 UniRef100 entry -
           Xenopus tropicalis
          Length = 272

 Score = 43.6 bits (98), Expect = 0.004
 Identities = 17/45 (37%), Positives = 26/45 (57%)
 Frame = +3

Query: 366 YSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDM 500
           Y+S   E A    W + L+ ++ HN EY  G+ +Y + +NH GDM
Sbjct: 1   YNSQEEERARRTIWEETLKFISVHNLEYSLGLHTYEVGMNHLGDM 45


>UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2;
           Taeniidae|Rep: Cathepsin L-like cysteine proteinase -
           Taenia solium (Pork tapeworm)
          Length = 339

 Score = 43.2 bits (97), Expect = 0.005
 Identities = 23/76 (30%), Positives = 38/76 (50%)
 Frame = +3

Query: 303 LVHEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHL 482
           L+ E    R W  +K  H ++YS         + + +NL  +   NR + AG++SYS  L
Sbjct: 25  LLTERELSRQWAGWKLQHGRVYSGKEEAYRRGV-FARNLLYIKGQNRRFNAGLESYSTGL 83

Query: 483 NHFGDMHVTEYFGKVL 530
           N F D+  +E+  + L
Sbjct: 84  NQFADLESSEFSERFL 99


>UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine
           protease; n=11; Callosobruchus maculatus|Rep: Putative
           gut cathepsin L-like cysteine protease - Callosobruchus
           maculatus (Southern cowpea weevil) (Pulse bruchid)
          Length = 326

 Score = 42.7 bits (96), Expect = 0.007
 Identities = 22/68 (32%), Positives = 35/68 (51%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W ++K  H K Y S   E      +++NL  +  HN++Y  G +S++  +  F DM   E
Sbjct: 23  WQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADM-THE 81

Query: 513 YFGKVLKL 536
            F  +LKL
Sbjct: 82  EFLDLLKL 89


>UniRef50_A7LFV2 Cluster: Cathepsin L protease inhibitor 1; n=1;
           Diaprepes abbreviatus|Rep: Cathepsin L protease
           inhibitor 1 - Diaprepes abbreviatus (Sugarcane rootstalk
           borer weevil)
          Length = 109

 Score = 42.7 bits (96), Expect = 0.007
 Identities = 19/62 (30%), Positives = 31/62 (50%)
 Frame = +3

Query: 330 HWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVT 509
           HW+ +K   N+ Y S   E      ++ NL+ +  H ++Y AG  SY   +N F D+   
Sbjct: 34  HWNNFKTKFNRNYESPEEESKRFEIFKNNLKDIQAHQKKYEAGEVSYQQGVNDFTDLTHE 93

Query: 510 EY 515
           E+
Sbjct: 94  EF 95


>UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:
           Cathepsin - Petromyzon marinus (Sea lamprey)
          Length = 333

 Score = 42.3 bits (95), Expect = 0.009
 Identities = 21/66 (31%), Positives = 34/66 (51%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W  +K+ + K Y S   +      + QNL+RV +HN     G  S+ L +N + D+ + E
Sbjct: 27  WDTWKSTYGKHYGSEQEDAHRRDVFEQNLKRVLQHNLLADEGNVSFHLGINKYSDLELHE 86

Query: 513 YFGKVL 530
           Y  KV+
Sbjct: 87  YHEKVV 92


>UniRef50_UPI0000D566EC Cluster: PREDICTED: similar to CG10460-PA;
           n=1; Tribolium castaneum|Rep: PREDICTED: similar to
           CG10460-PA - Tribolium castaneum
          Length = 80

 Score = 41.9 bits (94), Expect = 0.012
 Identities = 19/62 (30%), Positives = 30/62 (48%)
 Frame = +3

Query: 312 EHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHF 491
           E      W+E+KA + K Y+    E      +  NL+ V  HN +Y  G+ +Y + +N F
Sbjct: 7   EEFIEEKWNEFKAKYRKNYTDAEEESYRKSLFVANLQMVESHNEKYEDGLVNYKMGINQF 66

Query: 492 GD 497
            D
Sbjct: 67  AD 68


>UniRef50_Q70SU8 Cluster: Cystein proteinase inhibitor protein
           precursor; n=4; Salmonidae|Rep: Cystein proteinase
           inhibitor protein precursor - Salmo salar (Atlantic
           salmon)
          Length = 342

 Score = 41.9 bits (94), Expect = 0.012
 Identities = 30/104 (28%), Positives = 46/104 (44%), Gaps = 4/104 (3%)
 Frame = +3

Query: 273 LMEKHVLKGLLV----HEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHN 440
           L  + V KGLL      E    + +  +K  + K Y+ST  E      W     RV  HN
Sbjct: 89  LTTEEVPKGLLPMPRPEEEEVDKEFEMWKTHNGKTYNSTEEEAKRKEIWLATRARVMEHN 148

Query: 441 REYLAGIQSYSLHLNHFGDMHVTEYFGKVLKLIKAFPLFDPAED 572
           +    G +S+++ +N+F DM   E      +L+  FP  D  E+
Sbjct: 149 KRAENGSESFTMGINYFSDMTFEEI--PKARLMVVFPTRDGGEE 190



 Score = 41.9 bits (94), Expect = 0.012
 Identities = 20/69 (28%), Positives = 31/69 (44%)
 Frame = +3

Query: 327 RHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHV 506
           + +  +K  H K Y ST  E      W     RV  HN+    G +S+++ +NH  D   
Sbjct: 195 KEFETWKVQHGKNYGSTEEEAKRKGIWLATRTRVMEHNKRAETGSESFTMGMNHLSDKTT 254

Query: 507 TEYFGKVLK 533
            E  G+ L+
Sbjct: 255 AEVTGRRLQ 263



 Score = 39.1 bits (87), Expect = 0.084
 Identities = 17/62 (27%), Positives = 29/62 (46%)
 Frame = +3

Query: 327 RHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHV 506
           + +  +K  + K Y ST  E      W    ++V  HN     G++SY++ +NH  D+  
Sbjct: 32  KEFETWKVKYGKSYPSTEEEAKRKEMWLATRKKVMEHNTRAGNGLESYTMAVNHLADLTT 91

Query: 507 TE 512
            E
Sbjct: 92  EE 93



 Score = 38.7 bits (86), Expect = 0.11
 Identities = 17/62 (27%), Positives = 30/62 (48%)
 Frame = +3

Query: 327 RHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHV 506
           + +  +K  + K Y ST  E      W    + V  HN+    G++S+++ +NHF D+  
Sbjct: 272 KEFETWKVKYGKTYPSTVEEAKRKEIWLATRKMVMEHNKRAENGLESFTMGVNHFADLTA 331

Query: 507 TE 512
            E
Sbjct: 332 EE 333


>UniRef50_P12400 Cluster: Protein CTLA-2-beta; n=6; Mus
           musculus|Rep: Protein CTLA-2-beta - Mus musculus (Mouse)
          Length = 113

 Score = 41.9 bits (94), Expect = 0.012
 Identities = 19/61 (31%), Positives = 30/61 (49%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W E+K    K YS        L+ W +N +++  HN +Y  G  S+ + LN F D+   E
Sbjct: 16  WKEWKTTFAKAYSLDEERHRRLM-WEENKKKIEAHNADYERGKTSFYMGLNQFSDLTPEE 74

Query: 513 Y 515
           +
Sbjct: 75  F 75


>UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep:
           Dvir_CG5367 - Drosophila virilis (Fruit fly)
          Length = 298

 Score = 41.5 bits (93), Expect = 0.016
 Identities = 21/70 (30%), Positives = 35/70 (50%)
 Frame = +3

Query: 351 IHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTEYFGKVL 530
           I+N+ Y+ +H EM +   + +N   V  HN  Y  G  S+ L  N   DM+   Y    L
Sbjct: 2   INNRSYARSHDEMRSYEAYEENQIIVNEHNTYYETGKSSFRLATNTMADMNTDSYLKGYL 61

Query: 531 KLIKAFPLFD 560
           +L+++  + D
Sbjct: 62  RLLRSPEISD 71


>UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_79,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 324

 Score = 41.5 bits (93), Expect = 0.016
 Identities = 20/68 (29%), Positives = 38/68 (55%)
 Frame = +3

Query: 327 RHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHV 506
           R ++++K  +NK +SS   EM   + ++QN + +  HN +  +G  +Y++  N F D+  
Sbjct: 34  RQFNDWKIQYNKKFSSEKEEMYRYLVFQQNAQLIEAHNND-KSGKYTYTMETNQFADLTE 92

Query: 507 TEYFGKVL 530
            E+  K L
Sbjct: 93  QEFAQKYL 100


>UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K
           precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2);
           n=2; Tribolium castaneum|Rep: PREDICTED: similar to
           Cathepsin K precursor (Cathepsin O) (Cathepsin X)
           (Cathepsin O2) - Tribolium castaneum
          Length = 332

 Score = 41.1 bits (92), Expect = 0.021
 Identities = 18/65 (27%), Positives = 33/65 (50%)
 Frame = +3

Query: 303 LVHEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHL 482
           L+ E+L    W+++KA+H + +     E      + +NL  V  HN  +  G ++Y + +
Sbjct: 17  LLPENLVEEEWNKFKAMHARAFFDPLEETFRKSLFTKNLEIVEEHNERFRNGSETYEMGV 76

Query: 483 NHFGD 497
           N F D
Sbjct: 77  NKFSD 81


>UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor;
           n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase
           precursor - Diabrotica virgifera virgifera (western corn
           rootworm)
          Length = 326

 Score = 41.1 bits (92), Expect = 0.021
 Identities = 18/70 (25%), Positives = 35/70 (50%)
 Frame = +3

Query: 306 VHEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLN 485
           VH    +  W ++K  +NK Y +   E      ++ +LR++  HN +Y  G+ ++ L + 
Sbjct: 14  VHALSDKEEWVQFKVRNNKSYRNYIEEQKRFTIFQGSLRKIENHNDKYDHGLSTFKLGVT 73

Query: 486 HFGDMHVTEY 515
            F D+   E+
Sbjct: 74  KFADLTEKEF 83


>UniRef50_Q4N5Z7 Cluster: Cysteine proteinase, putative; n=2;
           Theileria|Rep: Cysteine proteinase, putative - Theileria
           parva
          Length = 440

 Score = 41.1 bits (92), Expect = 0.021
 Identities = 24/94 (25%), Positives = 48/94 (51%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           ++++   H+K +++  H+  +   +R NL  +  HN +      SY+ ++NHFGD+   +
Sbjct: 156 FNDFNKQHDKKHNNYRHKKTSYTNFRNNLNDINEHNAK---PNLSYTKNMNHFGDISSKD 212

Query: 513 YFGKVLKLIKAFPLFDPAEDHHKTAYRHNRRCKV 614
           +  +  K +    L +  +DH  T Y +NR   V
Sbjct: 213 FMKRYTKKV----LLNLPKDHVST-YNNNRPMSV 241


>UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 21 SCAF14577, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 406

 Score = 40.7 bits (91), Expect = 0.027
 Identities = 17/32 (53%), Positives = 21/32 (65%)
 Frame = +3

Query: 405 WRQNLRRVARHNREYLAGIQSYSLHLNHFGDM 500
           W +N R VARHN E  AG  S++L LNH  D+
Sbjct: 74  WERNARLVARHNLEASAGKHSFTLELNHLADL 105


>UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:
           Silicatein beta - Suberites domuncula (Sponge)
          Length = 383

 Score = 40.7 bits (91), Expect = 0.027
 Identities = 18/61 (29%), Positives = 30/61 (49%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W ++   H+K+YS     +     WR N   + +HN+   A    Y+L +N FGD+   E
Sbjct: 56  WKQWTTDHHKVYSDVRERVDKYTVWRANKEYIDQHNQN--AQRLGYTLKMNKFGDLTTKE 113

Query: 513 Y 515
           +
Sbjct: 114 F 114


>UniRef50_Q23H06 Cluster: Papain family cysteine protease containing
           protein; n=18; Tetrahymena thermophila|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 349

 Score = 40.7 bits (91), Expect = 0.027
 Identities = 18/66 (27%), Positives = 38/66 (57%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           ++++ + H ++Y + H ++   + + +NL+++  HN        +YS+HLN F DM   E
Sbjct: 29  YNKWSSEHQRVYLNEHEKLFRQMVFFENLQKIQDHNSN---PNNTYSIHLNQFSDMTKQE 85

Query: 513 YFGKVL 530
           +  K+L
Sbjct: 86  FAEKIL 91


>UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin
           L-like proteinase; n=2; Strongylocentrotus
           purpuratus|Rep: PREDICTED: similar to cathepsin L-like
           proteinase - Strongylocentrotus purpuratus
          Length = 329

 Score = 40.3 bits (90), Expect = 0.036
 Identities = 18/64 (28%), Positives = 33/64 (51%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W  +KA +++ Y +   E+     W +N R V  +NR Y  G +S+ + +N F D  +++
Sbjct: 28  WTSWKAQYSRRYYTKEEELVRWKSWVKNNRLVDENNRAYDEGRRSFKMAMNEFADQDMSK 87

Query: 513 YFGK 524
              K
Sbjct: 88  VRNK 91


>UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1;
           Dictyostelium discoideum AX4|Rep: Counting factor
           associated protein - Dictyostelium discoideum AX4
          Length = 531

 Score = 40.3 bits (90), Expect = 0.036
 Identities = 30/93 (32%), Positives = 42/93 (45%)
 Frame = +3

Query: 300 LLVHEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLH 479
           LL  E      + EYKA +NK YSS        I ++   + +A HN    A   SY L 
Sbjct: 214 LLAKEEQASNLFKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHN----AKESSYKLG 269

Query: 480 LNHFGDMHVTEYFGKVLKLIKAFPLFDPAEDHH 578
           +NH+ D+   E F  ++K   A P    A+  H
Sbjct: 270 MNHYADLSNKE-FNTLVKPKVARPSVTGADSVH 301


>UniRef50_A7LFV3 Cluster: Cathepsin L protease inhibitor 2; n=1;
           Diaprepes abbreviatus|Rep: Cathepsin L protease
           inhibitor 2 - Diaprepes abbreviatus (Sugarcane rootstalk
           borer weevil)
          Length = 91

 Score = 40.3 bits (90), Expect = 0.036
 Identities = 18/76 (23%), Positives = 38/76 (50%)
 Frame = +3

Query: 324 RRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMH 503
           +  W ++K   N+ Y S+  E      ++QNL+ +  HN ++  G  +++  +N F D+ 
Sbjct: 14  QEEWEKFKTGFNRNYDSSDEEAKRFNIFQQNLQSIREHNEKFERGETTFTQGINQFTDLT 73

Query: 504 VTEYFGKVLKLIKAFP 551
             E+  +   L++  P
Sbjct: 74  KEEFKARHTGLLRRPP 89


>UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep:
           MGC107932 protein - Xenopus tropicalis (Western clawed
           frog) (Silurana tropicalis)
          Length = 333

 Score = 39.5 bits (88), Expect = 0.063
 Identities = 18/70 (25%), Positives = 33/70 (47%)
 Frame = +3

Query: 315 HLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFG 494
           H   + W+ +K+ + K Y +   E+     W     +V +HN+    G++SY + +N F 
Sbjct: 21  HFLDQEWNAWKSKYEKKYVTLDKELNRRKAWEATWEKVQKHNQLADQGLKSYRMAMNQFA 80

Query: 495 DMHVTEYFGK 524
           D+   E   K
Sbjct: 81  DLTDNERSSK 90


>UniRef50_UPI00001CC928 Cluster: PREDICTED: similar to CTLA-2-beta
           protein precursor; n=2; Rattus norvegicus|Rep:
           PREDICTED: similar to CTLA-2-beta protein precursor -
           Rattus norvegicus
          Length = 113

 Score = 39.1 bits (87), Expect = 0.084
 Identities = 17/61 (27%), Positives = 28/61 (45%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W E+K    K YS         + W ++ + +  HN +Y  G  S+ + LN F D+   E
Sbjct: 16  WEEWKKKFGKTYSPDEERHRRAV-WEESKKTIEAHNADYKQGKTSFYMGLNQFSDLTTEE 74

Query: 513 Y 515
           +
Sbjct: 75  F 75


>UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate
           cathepsin L; n=4; Danio rerio|Rep: Novel protein similar
           to vertebrate cathepsin L - Danio rerio (Zebrafish)
           (Brachydanio rerio)
          Length = 334

 Score = 39.1 bits (87), Expect = 0.084
 Identities = 13/61 (21%), Positives = 31/61 (50%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W+ +K  H   Y     ++     W  N++++ ++N ++  G+  + + +N +GD+   E
Sbjct: 26  WNLWKKKHEISYDEESEDVHRKTIWETNMQKIWKNNNDFSFGLSMFKMAMNKYGDLTSVE 85

Query: 513 Y 515
           Y
Sbjct: 86  Y 86


>UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;
           Brugia malayi|Rep: Cahepsin L-like cysteine protease -
           Brugia malayi (Filarial nematode worm)
          Length = 371

 Score = 39.1 bits (87), Expect = 0.084
 Identities = 22/81 (27%), Positives = 39/81 (48%)
 Frame = +3

Query: 273 LMEKHVLKGLLVHEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYL 452
           LMEK +    L+ ++   R    YK  +NK     + E    + + +N++ + +HN  Y 
Sbjct: 37  LMEKKLGSKRLIKQYASYRL---YKRKYNKRDEEINLEHRRFMTYLKNVKEIEKHNERYE 93

Query: 453 AGIQSYSLHLNHFGDMHVTEY 515
              ++Y L +NH  DM   E+
Sbjct: 94  RNEETYELAINHLADMLPEEF 114


>UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain;
           n=16; Chrysomelidae|Rep: Digestive cysteine protease
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 39.1 bits (87), Expect = 0.084
 Identities = 20/67 (29%), Positives = 32/67 (47%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W  +K  H K Y +   E      +++NL ++  HN  Y  G ++Y L +  F D+   E
Sbjct: 23  WIAFKQTHGKTYKNLLEEKTRFGIFQRNLIKIKEHNARYDKGEETYLLGVTRFADL-THE 81

Query: 513 YFGKVLK 533
            F  +LK
Sbjct: 82  EFKDILK 88


>UniRef50_UPI0000D566ED Cluster: PREDICTED: similar to CTLA-2-alpha
           protein precursor; n=1; Tribolium castaneum|Rep:
           PREDICTED: similar to CTLA-2-alpha protein precursor -
           Tribolium castaneum
          Length = 101

 Score = 38.7 bits (86), Expect = 0.11
 Identities = 15/56 (26%), Positives = 32/56 (57%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDM 500
           ++E+K  + K Y+  + E      + +NL ++  HN++Y  G  +Y++ +N F D+
Sbjct: 29  FNEFKTKYGKTYADANEENFRKQLFAKNLEKIEEHNKKYEQGQVTYTMGVNQFSDL 84


>UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila
           melanogaster|Rep: CG11459-PA - Drosophila melanogaster
           (Fruit fly)
          Length = 336

 Score = 38.3 bits (85), Expect = 0.15
 Identities = 20/55 (36%), Positives = 29/55 (52%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGD 497
           W +YKA +NK Y +      AL  + Q +  V  HN+ YL G  ++ + LN F D
Sbjct: 30  WDQYKAKYNKQYRNRDKYHRAL--YEQRVLAVESHNQLYLQGKVAFKMGLNKFSD 82


>UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted,
           possible transmembrane domain near N-terminus; n=4;
           Cryptosporidium|Rep: Cryptopain-cysteine proteinase
           secreted, possible transmembrane domain near N-terminus
           - Cryptosporidium parvum Iowa II
          Length = 401

 Score = 38.3 bits (85), Expect = 0.15
 Identities = 21/73 (28%), Positives = 36/73 (49%)
 Frame = +3

Query: 324 RRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMH 503
           R+ + E+K  ++K+YSS   E      ++QN+  +   N +      SY L +N FGD+ 
Sbjct: 83  RKSFEEFKKKYHKVYSSMEEENQRFEIYKQNMNFIKTTNSQGF----SYVLEMNEFGDLS 138

Query: 504 VTEYFGKVLKLIK 542
             E+  +    IK
Sbjct: 139 KEEFMARFTGYIK 151


>UniRef50_A0EI50 Cluster: Chromosome undetermined scaffold_98, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_98,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 336

 Score = 38.3 bits (85), Expect = 0.15
 Identities = 21/73 (28%), Positives = 35/73 (47%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           + ++K  H KLY          I + QNL+ V  HN  Y  G++++ +  N F D+   E
Sbjct: 29  YSKWKQHHQKLYQGVEDTYRKQI-FHQNLQIVNDHNARYNQGLENFEIEANQFADLTFDE 87

Query: 513 YFGKVLKLIKAFP 551
           +    L L  ++P
Sbjct: 88  F--SSLYLYSSYP 98


>UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor;
           n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 355

 Score = 38.3 bits (85), Expect = 0.15
 Identities = 22/63 (34%), Positives = 32/63 (50%)
 Frame = +3

Query: 354 HNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTEYFGKVLK 533
           H+K Y S   ++     +R+NL  + + N E    I SY L LN F D+   E+ G+ L 
Sbjct: 58  HSKAYKSVEEKVHRFEVFRENLMHIDQRNNE----INSYWLGLNEFADLTHEEFKGRYLG 113

Query: 534 LIK 542
           L K
Sbjct: 114 LAK 116


>UniRef50_Q23H10 Cluster: Papain family cysteine protease containing
           protein; n=14; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 336

 Score = 37.9 bits (84), Expect = 0.19
 Identities = 16/66 (24%), Positives = 38/66 (57%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           ++++ + + ++Y + H ++   + + +N +++  HN +      +YS+HLN F DM   E
Sbjct: 29  YNQWSSQNQRVYLNEHEKLFRQMVFFENFQKIQEHNSD---PNNTYSVHLNQFSDMTKEE 85

Query: 513 YFGKVL 530
           +  K+L
Sbjct: 86  FAEKIL 91


>UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicatein
           a3 - Lubomirskia baicalensis
          Length = 344

 Score = 37.9 bits (84), Expect = 0.19
 Identities = 20/68 (29%), Positives = 31/68 (45%)
 Frame = +3

Query: 327 RHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHV 506
           + W  +K  H + Y S   EM     W  N + +  HN    A +  Y+L +N FGD+  
Sbjct: 42  QEWSVWKGHHQRSYESQLQEMERHSIWVANKKYIEHHNAN--ADLFGYTLAMNGFGDLMS 99

Query: 507 TEYFGKVL 530
            E+  + L
Sbjct: 100 AEFTERYL 107


>UniRef50_A0M6M0 Cluster: Protein containing DUF28; n=3;
           Flavobacteriaceae|Rep: Protein containing DUF28 -
           Gramella forsetii (strain KT0803)
          Length = 252

 Score = 37.1 bits (82), Expect = 0.34
 Identities = 22/62 (35%), Positives = 38/62 (61%), Gaps = 3/62 (4%)
 Frame = +3

Query: 105 KIKSLIMKYKNNRVHRSPMTSLDGFVQDALMFFDRTERFK---DVLNVYNYSECVGDEGL 275
           K++ L ++ KN+ V R P+ +++  V+DA    D  E+F+   DV NVY+  E + DE +
Sbjct: 188 KLEDLEIESKNSEVQRIPLNTVELPVEDAQKILDLVEKFEDDDDVQNVYHNLE-ITDELI 246

Query: 276 ME 281
           +E
Sbjct: 247 LE 248


>UniRef50_UPI000155C322 Cluster: PREDICTED: similar to cathepsin L,
           partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
           similar to cathepsin L, partial - Ornithorhynchus
           anatinus
          Length = 197

 Score = 36.7 bits (81), Expect = 0.45
 Identities = 14/32 (43%), Positives = 20/32 (62%)
 Frame = +3

Query: 405 WRQNLRRVARHNREYLAGIQSYSLHLNHFGDM 500
           W  NLRR+  HN E+  G  ++ L +N FGD+
Sbjct: 35  WEDNLRRIEAHNLEHGLGRTTFRLAINRFGDL 66


>UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 514

 Score = 36.7 bits (81), Expect = 0.45
 Identities = 22/63 (34%), Positives = 30/63 (47%)
 Frame = +3

Query: 327 RHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHV 506
           R + +Y+  HNK Y S H        +R N+R +   NR+ L     Y L  NHF D+  
Sbjct: 218 RMYRKYQGQHNKQYDSEHEVSKRKHIFRHNMRYIRSINRKNL----KYKLAPNHFVDLTD 273

Query: 507 TEY 515
            EY
Sbjct: 274 GEY 276


>UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 513

 Score = 36.7 bits (81), Expect = 0.45
 Identities = 24/87 (27%), Positives = 41/87 (47%)
 Frame = +3

Query: 240 YNYSECVGDEGLMEKHVLKGLLVHEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNL 419
           +++ + VG  G  E + +   + H  +    ++ +KA + K Y S H        +R N+
Sbjct: 181 FSHEKNVGAVG--EINPMFEFMPHTAVQHHLFNAFKASYRKRYPSAHEHEKRKDIYRHNM 238

Query: 420 RRVARHNREYLAGIQSYSLHLNHFGDM 500
           R +   NR++L     YSL  NH  DM
Sbjct: 239 RFIKSRNRQHL----GYSLKPNHMADM 261


>UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_21,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 349

 Score = 36.7 bits (81), Expect = 0.45
 Identities = 18/67 (26%), Positives = 35/67 (52%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           +  ++  H K Y+   +     I +++N + +  H +   AG++++ L LN F D+ V E
Sbjct: 40  YQNWQKEHGKRYTQFENSHRFGI-FKKNYQYIQEHQQRVEAGLETFELGLNDFADLSVEE 98

Query: 513 YFGKVLK 533
           +  K LK
Sbjct: 99  FEAKYLK 105


>UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184,
           whole genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_184,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 331

 Score = 36.7 bits (81), Expect = 0.45
 Identities = 19/65 (29%), Positives = 33/65 (50%)
 Frame = +3

Query: 342 YKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTEYFG 521
           +K +H K YS    E+     + QNL  V  HN ++  G ++++L +N + D+   E+  
Sbjct: 37  WKQLHGKRYSD-FEEVHRFSVFAQNLAVVMEHNSKFELGQETFTLGMNQYADLTPEEFQA 95

Query: 522 KVLKL 536
             L L
Sbjct: 96  SFLTL 100


>UniRef50_A1ZBK7 Cluster: CG10460-PA; n=1; Drosophila
           melanogaster|Rep: CG10460-PA - Drosophila melanogaster
           (Fruit fly)
          Length = 79

 Score = 36.3 bits (80), Expect = 0.59
 Identities = 17/61 (27%), Positives = 32/61 (52%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W EYK+  +K Y +    M   I + ++  R+  HNR++  G  ++ + +NH  D+   E
Sbjct: 9   WVEYKSKFDKNYEAEEDLMRRRI-YAESKARIEEHNRKFEKGEVTWKMGINHLADLTPEE 67

Query: 513 Y 515
           +
Sbjct: 68  F 68


>UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3;
           Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays
           (Maize)
          Length = 493

 Score = 35.9 bits (79), Expect = 0.78
 Identities = 16/42 (38%), Positives = 24/42 (57%)
 Frame = +3

Query: 405 WRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTEYFGKVL 530
           +R NLR +  HN E  AG+  + L L  F D+ + EY  ++L
Sbjct: 96  FRDNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLL 137


>UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep:
           Toxopain-2 - Toxoplasma gondii
          Length = 422

 Score = 35.9 bits (79), Expect = 0.78
 Identities = 20/68 (29%), Positives = 36/68 (52%)
 Frame = +3

Query: 342 YKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTEYFG 521
           ++A++ K Y++   +      ++ NL  +  HN++      SYSL +NHFGD+   E+  
Sbjct: 120 FQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY----SYSLKMNHFGDLSRDEFRR 175

Query: 522 KVLKLIKA 545
           K L   K+
Sbjct: 176 KYLGFKKS 183


>UniRef50_A5Z9W8 Cluster: Putative uncharacterized protein; n=1;
           Eubacterium ventriosum ATCC 27560|Rep: Putative
           uncharacterized protein - Eubacterium ventriosum ATCC
           27560
          Length = 460

 Score = 35.5 bits (78), Expect = 1.0
 Identities = 26/118 (22%), Positives = 53/118 (44%), Gaps = 8/118 (6%)
 Frame = +3

Query: 93  ENTLKI----KSLIMKYKNNRVHRSPMTSLDGFVQDALMFFDRTERFKDVLNVYNYSECV 260
           EN++K     K  +MKYK   ++    ++  G+  D  ++  + +++    N+   +E +
Sbjct: 228 ENSIKAANETKGTVMKYKGKLINAYYFSTSWGYTTDYRIWGIKKQKYLKETNLTTITENI 287

Query: 261 GDEGLMEKHVL-KGLLVHEHLPRRHWHEY---KAIHNKLYSSTHHEMAALIKWRQNLR 422
            DE + +K++  K   V +  P   W  Y   K I N +Y +    +  + +   N R
Sbjct: 288 SDEKIFDKYIKEKPKSVEKKSPFYRWTTYLTTKQIENSIYKNMAVNVGTINRMEINKR 345


>UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2;
           Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio
           molitor (Yellow mealworm)
          Length = 336

 Score = 35.5 bits (78), Expect = 1.0
 Identities = 16/56 (28%), Positives = 27/56 (48%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDM 500
           W  +K  + + Y +   E      +++ L     HN +Y  G+ SY+L +N F DM
Sbjct: 27  WENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDM 82


>UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase;
           n=21; Bilateria|Rep: Cathepsin L-like cysteine
           proteinase - Globodera pallida
          Length = 379

 Score = 35.5 bits (78), Expect = 1.0
 Identities = 16/62 (25%), Positives = 31/62 (50%), Gaps = 1/62 (1%)
 Frame = +3

Query: 333 WHEYKAIHN-KLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVT 509
           W+ YK  H  K Y+    E   ++ +    + + +HN+ Y+ G  ++ +  NH  D+  +
Sbjct: 70  WNAYKQKHGRKAYADQDVENERMLTYLSAKQFIDKHNQAYIEGKVTFRVGENHIADLPFS 129

Query: 510 EY 515
           EY
Sbjct: 130 EY 131


>UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine
           protease; n=1; Maconellicoccus hirsutus|Rep: Putative
           cathepsin L-like cysteine protease - Maconellicoccus
           hirsutus (hibiscus mealybug)
          Length = 339

 Score = 35.5 bits (78), Expect = 1.0
 Identities = 16/71 (22%), Positives = 37/71 (52%)
 Frame = +3

Query: 315 HLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFG 494
           +L    W  +K  ++K Y++   +   +  +  N  R+A+HN+ +  G+ ++   +N + 
Sbjct: 23  NLFHEEWQLFKTQYSKKYTTDIEDRLRMKIFIDNKYRIAQHNKLFHKGLVTFEQGINEYS 82

Query: 495 DMHVTEYFGKV 527
           DM  +E+  K+
Sbjct: 83  DMLQSEFNEKM 93


>UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L
           - Suberites domuncula (Sponge)
          Length = 324

 Score = 35.1 bits (77), Expect = 1.4
 Identities = 17/61 (27%), Positives = 30/61 (49%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W  +K  H+K Y+    E+     W+ N + +  HN   ++    Y+L +N FGD+   E
Sbjct: 23  WVAWKQEHSKEYTEELEELRRHTIWQSNKKFIDSHNS--VSDKFGYTLEMNEFGDLSGVE 80

Query: 513 Y 515
           +
Sbjct: 81  F 81


>UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 394

 Score = 35.1 bits (77), Expect = 1.4
 Identities = 16/66 (24%), Positives = 38/66 (57%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           +++++  H ++Y + H ++   + + +NL +V  HN++  A   +Y++ LN F D    E
Sbjct: 36  YNQWRNKHQRVYLNEHEQLFRQLIFLENLAKVNEHNQKSNA---TYTIGLNKFSDFTQEE 92

Query: 513 YFGKVL 530
           +  ++L
Sbjct: 93  FKHRIL 98


>UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin S
           preproprotein; n=2; Tribolium castaneum|Rep: PREDICTED:
           similar to cathepsin S preproprotein - Tribolium
           castaneum
          Length = 525

 Score = 34.3 bits (75), Expect = 2.4
 Identities = 12/57 (21%), Positives = 25/57 (43%)
 Frame = +3

Query: 327 RHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGD 497
           + W  +K  + + Y +   E      + +  + +  HN  Y  G+++Y L +N   D
Sbjct: 223 KEWENFKRKYERRYPNLEEENFRRAIFEKTFQEIKHHNERYRKGLETYYLRINDLSD 279


>UniRef50_A4L250 Cluster: Putative uncharacterized protein; n=1;
           Gryllus bimaculatus nudivirus|Rep: Putative
           uncharacterized protein - Gryllus bimaculatus nudivirus
          Length = 287

 Score = 33.9 bits (74), Expect = 3.1
 Identities = 18/73 (24%), Positives = 35/73 (47%), Gaps = 1/73 (1%)
 Frame = +3

Query: 273 LMEKHV-LKGLLVHEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREY 449
           L++K+V   G++ +    + H    +  H KLY S  H+    ++    LR+ A H R  
Sbjct: 192 LVKKYVTFYGVIQNNAFIQLHHDNQEVRHTKLYDSLQHKKHVKLEKEHVLRKTATHKRNI 251

Query: 450 LAGIQSYSLHLNH 488
                + ++H++H
Sbjct: 252 AIDELTKAIHVHH 264


>UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3;
           Curculionidae|Rep: Cysteine proteinase - Hypera postica
           (alfalfa weevil)
          Length = 324

 Score = 33.9 bits (74), Expect = 3.1
 Identities = 18/58 (31%), Positives = 25/58 (43%)
 Frame = +3

Query: 342 YKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTEY 515
           +K  H K Y +   E      +  N+R +  HN  Y  G  SY   +N F DM   E+
Sbjct: 29  FKLEHGKTYLNQAEESKRFNIFTDNVRAIEAHNALYEQGKVSYKKGINKFTDMSQEEF 86


>UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep:
           Cathepsin - Geodia cydonium (Sponge)
          Length = 322

 Score = 33.9 bits (74), Expect = 3.1
 Identities = 17/61 (27%), Positives = 29/61 (47%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W ++K  +NK YSS   +      W  NL+ V   + E     + Y++ +N F D+   E
Sbjct: 19  WEQWKLKYNKQYSSQEEDYLRQRVWLSNLKFVEEFDSER----EGYTVAMNEFADLDPRE 74

Query: 513 Y 515
           +
Sbjct: 75  F 75


>UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC06231 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 372

 Score = 33.9 bits (74), Expect = 3.1
 Identities = 17/61 (27%), Positives = 28/61 (45%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W  +K    + Y +   E    + +  N  ++  HNR Y  G  +Y + +N+F D   TE
Sbjct: 62  WKFFKINFKRAYGNVMEETKRFLIFGTNFIKMMEHNRAYQEGKATYKMGVNNFTDK--TE 119

Query: 513 Y 515
           Y
Sbjct: 120 Y 120


>UniRef50_Q23ZE2 Cluster: Putative uncharacterized protein; n=1;
            Tetrahymena thermophila SB210|Rep: Putative
            uncharacterized protein - Tetrahymena thermophila SB210
          Length = 1108

 Score = 33.9 bits (74), Expect = 3.1
 Identities = 34/156 (21%), Positives = 71/156 (45%), Gaps = 6/156 (3%)
 Frame = +3

Query: 69   LVNCVNITENTLKIKSLIMKYKN-NRVHRSPMTSLDGFVQDALMFFDRTERFKDVLNVY- 242
            L+NC+   E  ++ K +   Y+N ++++ + +   +  +Q +       + F  + N+Y 
Sbjct: 639  LLNCIKKNEQNIQSKKIHQNYQNSSQINLNCIQKNEQNIQ-SKKIHQNHQNFSQI-NIYS 696

Query: 243  -NYSECVGDEGLMEK-HVLKGLLVHEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQN 416
             N+ E + +  L E  + +K LL++         + K IH    +S+   +  + K  QN
Sbjct: 697  RNHQELLNENYLKEYFYCIKYLLLNCIKKNEQNIQSKKIHQNHQNSSQINLNCIKKNEQN 756

Query: 417  LRRVARH-NREYLAGIQSYSL-HLNHFGDMHVTEYF 518
            ++    H N +  + I  YS  H     + ++ EYF
Sbjct: 757  IQSKKIHQNHQNSSQINIYSRNHQELLNENYLKEYF 792


>UniRef50_Q42312 Cluster: Cysteine protease; n=1; Arabidopsis
           thaliana|Rep: Cysteine protease - Arabidopsis thaliana
           (Mouse-ear cress)
          Length = 105

 Score = 33.5 bits (73), Expect = 4.2
 Identities = 20/54 (37%), Positives = 26/54 (48%)
 Frame = +3

Query: 354 HNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTEY 515
           H K+Y S   +   L  +  NLR +   N E L    SY L L  FGD+ + EY
Sbjct: 56  HGKVYGSVAEKERRLTIFEDNLRFINNRNAENL----SYRLGLTGFGDLSLHEY 105


>UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum
           aestivum|Rep: Thiol protease - Triticum aestivum (Wheat)
          Length = 374

 Score = 33.5 bits (73), Expect = 4.2
 Identities = 20/73 (27%), Positives = 34/73 (46%)
 Frame = +3

Query: 297 GLLVHEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSL 476
           G L    L    +H + A H K Y+    ++     +R+N+  +   NR+   G  SY+L
Sbjct: 38  GALEDSLLMMERFHGWMAKHGKSYAGVEEKLRRFDIFRRNVEFIEAANRD---GRLSYTL 94

Query: 477 HLNHFGDMHVTEY 515
            +N F D+   E+
Sbjct: 95  GVNQFADLTHEEF 107


>UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 4 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 345

 Score = 33.5 bits (73), Expect = 4.2
 Identities = 14/56 (25%), Positives = 29/56 (51%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDM 500
           W +++ I+NK Y ++   +     +R+    +   + ++  G   YS+ +NHF DM
Sbjct: 38  WDKFRKIYNKTYGTSEETVYREQVFRRTFNFLRTVDEKFKNGTLLYSVAVNHFADM 93


>UniRef50_Q24FA8 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 335

 Score = 33.5 bits (73), Expect = 4.2
 Identities = 20/83 (24%), Positives = 46/83 (55%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           +++++  + K+YSS   ++     + +N + V  HN+       +YS+ +N F D+ + E
Sbjct: 32  YNKWREENGKVYSSEAEKIYRQSVFLENYQSVQEHNKN---SNHTYSVGINQFSDITLQE 88

Query: 513 YFGKVLKLIKAFPLFDPAEDHHK 581
           Y  ++  L+K  PL + A++ ++
Sbjct: 89  YQQRI--LMKNSPLNELAKNKNR 109


>UniRef50_A0DV90 Cluster: Chromosome undetermined scaffold_65, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_65,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 581

 Score = 33.5 bits (73), Expect = 4.2
 Identities = 23/89 (25%), Positives = 43/89 (48%), Gaps = 9/89 (10%)
 Frame = +3

Query: 57  LQVSLVNCVNITENTLKIKSLIMKYKNNRVHRSPMTSL-DGFVQDALMFF----DRTERF 221
           L+  ++NC+N   NTL+  S  ++YKN +  +     L + F+Q     F    D T+ F
Sbjct: 190 LRCDILNCLNSVVNTLQPTSNRIRYKNTQNQQQQFNLLKNDFIQSLNCLFQKLTDLTQEF 249

Query: 222 KDV----LNVYNYSECVGDEGLMEKHVLK 296
           + V     ++Y +    G + L +  V++
Sbjct: 250 QSVEDMKRSLYQFGSLDGSQSLKQSQVVQ 278


>UniRef50_A2YT27 Cluster: Putative uncharacterized protein; n=1;
           Oryza sativa (indica cultivar-group)|Rep: Putative
           uncharacterized protein - Oryza sativa subsp. indica
           (Rice)
          Length = 93

 Score = 33.1 bits (72), Expect = 5.5
 Identities = 23/71 (32%), Positives = 34/71 (47%)
 Frame = +3

Query: 324 RRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMH 503
           RR + E+KA + K Y+S   E      +R+  R V +HN    AG  SY + LN   D H
Sbjct: 28  RRMFVEWKAKYAKAYASIAEEECRYAVFRETRRAVDQHN----AGFHSYRVGLNAV-DQH 82

Query: 504 VTEYFGKVLKL 536
              +   +L +
Sbjct: 83  NAGFHSSMLAM 93


>UniRef50_Q950M8 Cluster: Orf511; n=1; Rhizophydium sp. 136|Rep:
           Orf511 - Rhizophydium sp. 136
          Length = 511

 Score = 33.1 bits (72), Expect = 5.5
 Identities = 21/109 (19%), Positives = 49/109 (44%), Gaps = 1/109 (0%)
 Frame = +3

Query: 24  SITKMMLIVLLLQVSLVNCVNITENTLKIKSLIMKY-KNNRVHRSPMTSLDGFVQDALMF 200
           +I  + LI L   + + + + +  +    K+      KNN +H  P+T  D     AL F
Sbjct: 26  NIINLDLIELRFNIDINSNIEVLNSIFLCKNFSSSTNKNNLLHPIPITGDDRITLPALEF 85

Query: 201 FDRTERFKDVLNVYNYSECVGDEGLMEKHVLKGLLVHEHLPRRHWHEYK 347
            ++ + +K++   Y ++ C     L+ +    G+ + ++   + + + K
Sbjct: 86  KEKLKEYKNLPGCYIFTNCNNGYQLIGESKDLGIRLKDYFSEKEFRKRK 134


>UniRef50_Q9Y244 Cluster: Proteasome maturation protein; n=29;
           Euteleostomi|Rep: Proteasome maturation protein - Homo
           sapiens (Human)
          Length = 141

 Score = 33.1 bits (72), Expect = 5.5
 Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 1/64 (1%)
 Frame = +3

Query: 117 LIMKYKN-NRVHRSPMTSLDGFVQDALMFFDRTERFKDVLNVYNYSECVGDEGLMEKHVL 293
           L M++K   +V R P  S      D L   D T  F+D+LN  + SE +G+  LM ++ L
Sbjct: 79  LQMEFKAVQQVQRLPFLSSSNLSLDVLRGNDETIGFEDILNDPSQSEVMGEPHLMVEYKL 138

Query: 294 KGLL 305
            GLL
Sbjct: 139 -GLL 141


>UniRef50_Q5NIG4 Cluster: CCA-adding enzyme; n=12; Francisella
           tularensis|Rep: CCA-adding enzyme - Francisella
           tularensis subsp. tularensis
          Length = 360

 Score = 33.1 bits (72), Expect = 5.5
 Identities = 30/88 (34%), Positives = 46/88 (52%), Gaps = 1/88 (1%)
 Frame = +3

Query: 138 NRVHRSPMTSLDGFVQDALMFFDRTERFKDVLNVYNYSECVGDEGLMEKHVLKGLLVHEH 317
           NR+ R   TS+  F++D L    R  RFK  L+ +N+S       L+++ V  G L H  
Sbjct: 118 NRILRH--TSI-AFIEDPLRVV-RLARFKAQLSNFNFSIAQEMLALIKELVKTGELNHLT 173

Query: 318 LPRRHWHEYKAIHN-KLYSSTHHEMAAL 398
             R H    KA++N K++ +T  E+ AL
Sbjct: 174 RERLHIEFVKALNNPKIFFTTLKELEAL 201


>UniRef50_UPI00006CA466 Cluster: oxidoreductase, zinc-binding
           dehydrogenase family protein; n=1; Tetrahymena
           thermophila SB210|Rep: oxidoreductase, zinc-binding
           dehydrogenase family protein - Tetrahymena thermophila
           SB210
          Length = 330

 Score = 32.7 bits (71), Expect = 7.3
 Identities = 21/67 (31%), Positives = 35/67 (52%), Gaps = 2/67 (2%)
 Frame = +3

Query: 369 SSTHHEMAALIKWRQNLRRVAR-HNREYLAGIQSY-SLHLNHFGDMHVTEYFGKVLKLIK 542
           SS    MA  +  ++ ++ +A  H + YL  I+   + H+ H  D H+ E+  +V+   K
Sbjct: 154 SSAVARMAIKLFHQEGIKSIAIVHEKNYLEEIKEIGATHVFHDQDEHLVEHLQEVIAKEK 213

Query: 543 AFPLFDP 563
           A  LFDP
Sbjct: 214 AKMLFDP 220


>UniRef50_Q5GZY7 Cluster: Putative uncharacterized protein; n=1;
           Xanthomonas oryzae pv. oryzae|Rep: Putative
           uncharacterized protein - Xanthomonas oryzae pv. oryzae
          Length = 356

 Score = 32.7 bits (71), Expect = 7.3
 Identities = 20/65 (30%), Positives = 32/65 (49%)
 Frame = +3

Query: 309 HEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNH 488
           H H  RR  H+++A H +L   T H +AA+   RQ  R+   H  ++ A  +     + H
Sbjct: 64  HGHDHRRDHHDHRANHVRL-GQTFHRLAAVDVQRQR-RQEEHHRGDHRARERLVDRTIEH 121

Query: 489 FGDMH 503
           F  +H
Sbjct: 122 FQRLH 126


>UniRef50_A3IAT4 Cluster: Acyl-CoA dehydrogenase-like protein; n=1;
           Bacillus sp. B14905|Rep: Acyl-CoA dehydrogenase-like
           protein - Bacillus sp. B14905
          Length = 343

 Score = 32.7 bits (71), Expect = 7.3
 Identities = 22/73 (30%), Positives = 39/73 (53%), Gaps = 2/73 (2%)
 Frame = +3

Query: 78  CVNITENTLKIKSLIMKYKNNRVHRSP-MTSLDGFVQDALMFFDRTER-FKDVLNVYNYS 251
           C+ ITEN L+   +++K +N+  HR+  M +L   +      F + E+ F   L+VY + 
Sbjct: 218 CLGITENFLEEAFILIKQRNHDAHRAERMGALQFLLMQQQKHFKQFEKQFYTTLSVY-WQ 276

Query: 252 ECVGDEGLMEKHV 290
           +   DE L E+ +
Sbjct: 277 KHQRDESLTEEEL 289


>UniRef50_Q22A69 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 330

 Score = 32.7 bits (71), Expect = 7.3
 Identities = 20/59 (33%), Positives = 32/59 (54%), Gaps = 3/59 (5%)
 Frame = +3

Query: 339 EYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNR--EYLAGIQSYS-LHLNHFGDMHV 506
           ++   +NK YSS  H  A L  +++NLRR+   N+  E   GI  ++ L    F DM++
Sbjct: 32  KFTQTYNKKYSSEEHYNARLSIFKENLRRIELFNKNDEAQHGITQFADLTHEEFADMYL 90


>UniRef50_UPI00004D962D Cluster: UPI00004D962D related cluster; n=1;
           Xenopus tropicalis|Rep: UPI00004D962D UniRef100 entry -
           Xenopus tropicalis
          Length = 725

 Score = 32.3 bits (70), Expect = 9.6
 Identities = 23/87 (26%), Positives = 40/87 (45%)
 Frame = +3

Query: 192 LMFFDRTERFKDVLNVYNYSECVGDEGLMEKHVLKGLLVHEHLPRRHWHEYKAIHNKLYS 371
           + F +   RF  +  ++ +S+   D  +  K++  GL  HE   R H   Y  +H+K Y 
Sbjct: 457 ISFTNSHSRFDKIETLFQHSQDQHDVEINMKYIC-GLCDHEKDFRGH---YNGLHSKEYG 512

Query: 372 STHHEMAALIKWRQNLRRVARHNREYL 452
               +M +LIK  +    +A+ N   L
Sbjct: 513 FMPEQMQSLIKNEEQPLSIAKPNENCL 539


>UniRef50_Q47W97 Cluster: TPR domain/sulfotransferase domain
           protein; n=1; Colwellia psychrerythraea 34H|Rep: TPR
           domain/sulfotransferase domain protein - Colwellia
           psychrerythraea (strain 34H / ATCC BAA-681)
           (Vibriopsychroerythus)
          Length = 527

 Score = 32.3 bits (70), Expect = 9.6
 Identities = 16/57 (28%), Positives = 31/57 (54%), Gaps = 1/57 (1%)
 Frame = +3

Query: 363 LYSSTHHEMAALIKWRQNLRRVARHNREYLAGI-QSYSLHLNHFGDMHVTEYFGKVL 530
           + +S HH +A  I+  +    +A +N EYL+ + + Y+L  NH   ++  E   K++
Sbjct: 47  MIASAHHNLAKAIELIKQANSLAPNNPEYLSQLAKHYALENNHVEALYFAELAAKLI 103


>UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3;
           Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber
           officinale (Ginger)
          Length = 475

 Score = 32.3 bits (70), Expect = 9.6
 Identities = 18/67 (26%), Positives = 31/67 (46%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           + E++  H    +  +     L  +++NLR V  HN     G  +Y L +N F D+   E
Sbjct: 52  YQEWRVKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEE 111

Query: 513 YFGKVLK 533
           Y  + L+
Sbjct: 112 YRARFLR 118


>UniRef50_Q5K600 Cluster: Env protein; n=3; Drosophila
           melanogaster|Rep: Env protein - Drosophila melanogaster
           (Fruit fly)
          Length = 550

 Score = 32.3 bits (70), Expect = 9.6
 Identities = 38/169 (22%), Positives = 74/169 (43%), Gaps = 3/169 (1%)
 Frame = +3

Query: 87  ITENTLKIKSLIMKYKNNRVHRSPMTSLDGFVQDALMFFDRTERFKDVLNVYNYSECVGD 266
           +T+  +K+K+L   Y+N R   + + SL   V   +   D  E  +++ N+   SE   D
Sbjct: 91  LTQAQVKLKALTPSYRNKRGLINGLGSLVKVVTGNMDANDNKEIHEELDNIKKNSEVSND 150

Query: 267 EGLMEKHVL--KGLLVHEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHN 440
              ++K V+    +L+       H +  + + +K + ++ ++    I    NL+      
Sbjct: 151 N--LQKQVMFNNEILIRFENITDHINNEQILISKFFDTSQNK----IYKHLNLQDTLLEE 204

Query: 441 REYLAGIQ-SYSLHLNHFGDMHVTEYFGKVLKLIKAFPLFDPAEDHHKT 584
            +YL  I  +  L +NH  D+  +    K+  +I  F L +   D  KT
Sbjct: 205 IQYLNRINYNIELFINHLNDITESMLLAKI-NIIPKFILNEQEMDKIKT 252


>UniRef50_Q231X3 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 323

 Score = 32.3 bits (70), Expect = 9.6
 Identities = 23/80 (28%), Positives = 36/80 (45%)
 Frame = +3

Query: 333 WHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTE 512
           W ++K  HNK Y +T +E   L  + +NL        E +   Q+ +  +  F D+   E
Sbjct: 39  WSQWKQKHNKRYENTDYESYRLEVFAENL--------EVVKNDQTGTYGITKFLDLTDDE 90

Query: 513 YFGKVLKLIKAFPLFDPAED 572
           + G  L L   +P    AED
Sbjct: 91  FAGNFLNLKAQYPEDSIAED 110


>UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteinase
           B; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like
           tick cysteine proteinase B - Haemaphysalis longicornis
           (Bush tick)
          Length = 332

 Score = 32.3 bits (70), Expect = 9.6
 Identities = 15/47 (31%), Positives = 23/47 (48%)
 Frame = +3

Query: 309 HEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNREY 449
           H+ L    W  +KA+H K  S         I + +N  ++ARHN +Y
Sbjct: 19  HQELVGAEWSAFKALHGKDTSRKQKSTTGWI-YMENRLKIARHNAKY 64


>UniRef50_Q9URY3 Cluster: GTPase activating protein; n=1;
           Schizosaccharomyces pombe|Rep: GTPase activating protein
           - Schizosaccharomyces pombe (Fission yeast)
          Length = 619

 Score = 32.3 bits (70), Expect = 9.6
 Identities = 30/120 (25%), Positives = 52/120 (43%), Gaps = 11/120 (9%)
 Frame = +3

Query: 207 RTERFKDVLNVYNYSECVG-----DEGLMEKHVLKG---LLVHEHLP--RRHWHEYKAIH 356
           R E+FKD+LN        G      +G+ +++ L+    +L+ E LP  R +W      H
Sbjct: 6   RIEKFKDILNSEEPISLPGLCSLCIQGIPDEYSLRAKAWMLMLEFLPTDRSNWQSVLEKH 65

Query: 357 NKLYSSTHHEMAALIKWRQ-NLRRVARHNREYLAGIQSYSLHLNHFGDMHVTEYFGKVLK 533
            K Y+S   E+  +  WR+  L   +  N ++       S    +F D  + E   K ++
Sbjct: 66  RKTYTSFVQEL-LIDPWRKLTLHEESGENSDHPLNTSDDSKWKEYFDDNQILEQIDKDIR 124


>UniRef50_A3GHI4 Cluster: Predicted protein; n=4;
           Saccharomycetales|Rep: Predicted protein - Pichia
           stipitis (Yeast)
          Length = 635

 Score = 32.3 bits (70), Expect = 9.6
 Identities = 16/54 (29%), Positives = 31/54 (57%)
 Frame = -2

Query: 356 VNCLILVPVPSRQMLVNEQTFQNMLLHQPFISNALTIIIHIQYILKPFSSVEEH 195
           V+C  L+    + +L NE+   N+LL  P +SN L ++I + Y L+  +++  +
Sbjct: 467 VSCYQLISKKFQDLLYNEKIVYNLLL--PNLSNELNLMIDLHYHLQSLNNISSN 518


>UniRef50_Q0PA12 Cluster: DNA translocase ftsK; n=17;
           Epsilonproteobacteria|Rep: DNA translocase ftsK -
           Campylobacter jejuni
          Length = 946

 Score = 32.3 bits (70), Expect = 9.6
 Identities = 14/49 (28%), Positives = 24/49 (48%)
 Frame = -3

Query: 298 PFKTCFSINPSSPTHSL*LYTFNTSLNLSVLSKNIRASCTNPSKDVIGL 152
           P  T F   PS+      +      L +++++K+IR     P KDV+G+
Sbjct: 534 PVVTTFEFRPSADVKVSRILNLQDDLTMALMAKSIRIQAPIPGKDVVGI 582


>UniRef50_Q8NEC5 Cluster: Cation channel sperm-associated protein 1;
           n=23; Eutheria|Rep: Cation channel sperm-associated
           protein 1 - Homo sapiens (Human)
          Length = 780

 Score = 32.3 bits (70), Expect = 9.6
 Identities = 21/71 (29%), Positives = 30/71 (42%), Gaps = 2/71 (2%)
 Frame = +3

Query: 309 HEHLPRRHWHEYKAIHNKLYSSTHHEMAALIKWRQNLRRVARHNR--EYLAGIQSYSLHL 482
           H  +P R W  +  +H+    S HHE     K   +   ++ H+    Y  GI  Y    
Sbjct: 209 HHQVPHRGWPHHHQVHHH-GRSRHHEAHQHGKSPHHGETISPHSSVGSYQRGISDYHSEY 267

Query: 483 NHFGDMHVTEY 515
            H GD H +EY
Sbjct: 268 -HQGDHHPSEY 277


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 599,307,855
Number of Sequences: 1657284
Number of extensions: 11856071
Number of successful extensions: 32323
Number of sequences better than 10.0: 114
Number of HSP's better than 10.0 without gapping: 31133
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 32273
length of database: 575,637,011
effective HSP length: 97
effective length of database: 414,880,463
effective search space used: 45221970467
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)

- SilkBase 1999-2023 -