SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTX 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= I10A02NGRL0001_F06
         (589 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw...   271   7e-72
UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ...   244   1e-63
UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteina...   242   5e-63
UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca...   231   9e-60
UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh...   212   5e-54
UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=...   208   7e-53
UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr...   202   6e-51
UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=...   193   3e-48
UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep...   191   1e-47
UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ...   189   4e-47
UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; ...   187   1e-46
UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|...   187   1e-46
UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain...   184   2e-45
UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr...   180   2e-44
UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n...   177   1e-43
UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n...   172   4e-42
UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps...   172   6e-42
UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Ca...   169   4e-41
UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: ...   167   2e-40
UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Ca...   165   7e-40
UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep...   163   2e-39
UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8....   161   1e-38
UniRef50_Q237A1 Cluster: Papain family cysteine protease contain...   157   2e-37
UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 ...   157   2e-37
UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 ...   155   5e-37
UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ...   153   3e-36
UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator ame...   153   4e-36
UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|...   153   4e-36
UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 ...   151   9e-36
UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2...   151   2e-35
UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA...   146   3e-34
UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma...   144   1e-33
UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid...   144   2e-33
UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8....   143   2e-33
UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, w...   143   3e-33
UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomati...   142   4e-33
UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2; ...   138   9e-32
UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7...   136   5e-31
UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8...   134   1e-30
UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin ...   134   2e-30
UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus co...   133   3e-30
UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl...   133   3e-30
UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhip...   132   8e-30
UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep...   129   5e-29
UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep...   127   2e-28
UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gamb...   127   2e-28
UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C...   120   3e-26
UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma j...   120   3e-26
UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; ...   118   1e-25
UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2; Ostreococcus|...   105   1e-21
UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|R...   101   1e-20
UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; ...    96   6e-19
UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O...    92   1e-17
UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve...    92   1e-17
UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma j...    86   5e-16
UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus lu...    85   2e-15
UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus...    84   3e-15
UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG011...    79   6e-14
UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|R...    78   2e-13
UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R...    75   9e-13
UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,...    73   7e-12
UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi...    72   9e-12
UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like prote...    71   2e-11
UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;...    71   2e-11
UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi...    70   4e-11
UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-L...    69   6e-11
UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia...    68   2e-10
UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|R...    68   2e-10
UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to glucocorti...    67   2e-10
UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcop...    67   2e-10
UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy...    67   3e-10
UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; ...    66   6e-10
UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen; n...    65   1e-09
UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n...    64   2e-09
UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;...    63   4e-09
UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru...    63   5e-09
UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ...    62   7e-09
UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - ...    62   7e-09
UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ...    62   7e-09
UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida...    62   7e-09
UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-li...    62   7e-09
UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ...    62   7e-09
UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain...    62   9e-09
UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re...    62   1e-08
UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35...    62   1e-08
UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ...    61   2e-08
UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio...    61   2e-08
UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1...    61   2e-08
UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain...    61   2e-08
UniRef50_Q24E33 Cluster: Papain family cysteine protease contain...    61   2e-08
UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr...    61   2e-08
UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=...    61   2e-08
UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo sapie...    60   3e-08
UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen...    60   3e-08
UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cel...    60   3e-08
UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag...    60   4e-08
UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R...    60   4e-08
UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium...    60   4e-08
UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ...    60   5e-08
UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy...    60   5e-08
UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ...    60   5e-08
UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali...    60   5e-08
UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The...    60   5e-08
UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa...    59   7e-08
UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh...    59   7e-08
UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ...    59   9e-08
UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e...    59   9e-08
UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18...    59   9e-08
UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain...    58   1e-07
UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain...    58   1e-07
UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-32...    58   1e-07
UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1...    58   2e-07
UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; ...    58   2e-07
UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy...    58   2e-07
UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n...    58   2e-07
UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The...    58   2e-07
UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG...    57   3e-07
UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted...    57   3e-07
UniRef50_Q235G6 Cluster: Papain family cysteine protease contain...    57   3e-07
UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh...    57   3e-07
UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lambl...    57   4e-07
UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain...    57   4e-07
UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus...    57   4e-07
UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ...    57   4e-07
UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er...    57   4e-07
UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl...    57   4e-07
UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto...    57   4e-07
UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ...    56   5e-07
UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try...    56   5e-07
UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain...    56   5e-07
UniRef50_O16454 Cluster: Temporarily assigned gene name protein ...    56   5e-07
UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy...    56   5e-07
UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ...    56   5e-07
UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ...    56   6e-07
UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin...    56   6e-07
UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=...    56   6e-07
UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ...    56   8e-07
UniRef50_Q231X3 Cluster: Papain family cysteine protease contain...    56   8e-07
UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis...    56   8e-07
UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ...    56   8e-07
UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy...    56   8e-07
UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh...    56   8e-07
UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;...    56   8e-07
UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;...    55   1e-06
UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big...    55   1e-06
UniRef50_Q23H32 Cluster: Papain family cysteine protease contain...    55   1e-06
UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w...    55   1e-06
UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi...    55   1e-06
UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel...    55   1e-06
UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep...    55   1e-06
UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia ...    55   1e-06
UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j...    55   1e-06
UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ...    55   1e-06
UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ...    54   2e-06
UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus...    54   2e-06
UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ...    54   2e-06
UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma...    54   2e-06
UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re...    54   3e-06
UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop...    54   3e-06
UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet...    54   3e-06
UniRef50_Q23H10 Cluster: Papain family cysteine protease contain...    54   3e-06
UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy...    54   3e-06
UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh...    54   3e-06
UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M...    54   3e-06
UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl...    54   3e-06
UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata...    54   3e-06
UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ...    53   4e-06
UniRef50_Q22W19 Cluster: Papain family cysteine protease contain...    53   4e-06
UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain...    53   4e-06
UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:...    53   4e-06
UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ...    53   6e-06
UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n...    53   6e-06
UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ...    53   6e-06
UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir...    53   6e-06
UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ...    52   8e-06
UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; ...    52   8e-06
UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n...    52   8e-06
UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip...    52   8e-06
UniRef50_Q54VR1 Cluster: Putative uncharacterized protein; n=1; ...    52   8e-06
UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M...    52   8e-06
UniRef50_P81494 Cluster: Cathepsin B; n=2; Phasianidae|Rep: Cath...    52   8e-06
UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ...    52   1e-05
UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ...    52   1e-05
UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca...    52   1e-05
UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip...    52   1e-05
UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb...    52   1e-05
UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n...    52   1e-05
UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re...    52   1e-05
UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs...    52   1e-05
UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr...    52   1e-05
UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ...    52   1e-05
UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ...    52   1e-05
UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ...    52   1e-05
UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop...    52   1e-05
UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain...    52   1e-05
UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh...    52   1e-05
UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov...    52   1e-05
UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ...    51   2e-05
UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio...    51   2e-05
UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum...    51   2e-05
UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep...    51   2e-05
UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh...    51   2e-05
UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi...    51   2e-05
UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3....    51   2e-05
UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t...    51   2e-05
UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-...    51   2e-05
UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh...    51   2e-05
UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ...    51   2e-05
UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C...    51   2e-05
UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia ...    50   3e-05
UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein...    50   3e-05
UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei...    50   3e-05
UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia...    50   3e-05
UniRef50_Q23H06 Cluster: Papain family cysteine protease contain...    50   3e-05
UniRef50_Q239L8 Cluster: Papain family cysteine protease contain...    50   3e-05
UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who...    50   3e-05
UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt...    50   4e-05
UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia...    50   4e-05
UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabdi...    50   4e-05
UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ...    50   4e-05
UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo...    50   4e-05
UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C...    50   4e-05
UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph...    50   5e-05
UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt...    50   5e-05
UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ...    50   5e-05
UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain...    50   5e-05
UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;...    50   5e-05
UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain...    50   5e-05
UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic...    50   5e-05
UniRef50_UPI00006CB77B Cluster: Papain family cysteine protease ...    49   7e-05
UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae...    49   7e-05
UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain...    49   7e-05
UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cy...    49   7e-05
UniRef50_A2DPZ6 Cluster: Clan CA, family C1, cathepsin B-like cy...    49   7e-05
UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, wh...    49   7e-05
UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h...    49   9e-05
UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n...    49   9e-05
UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No...    49   9e-05
UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi...    49   9e-05
UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n...    49   9e-05
UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;...    49   9e-05
UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis...    49   9e-05
UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain...    49   9e-05
UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ...    49   9e-05
UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophi...    49   9e-05
UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu...    49   9e-05
UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:...    49   9e-05
UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p...    48   1e-04
UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca...    48   1e-04
UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ...    48   1e-04
UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G...    48   1e-04
UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil...    48   1e-04
UniRef50_Q22A69 Cluster: Papain family cysteine protease contain...    48   1e-04
UniRef50_A0DIY3 Cluster: Chromosome undetermined scaffold_52, wh...    48   1e-04
UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ...    48   2e-04
UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat...    48   2e-04
UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist...    48   2e-04
UniRef50_Q248G1 Cluster: Papain family cysteine protease contain...    48   2e-04
UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179, w...    48   2e-04
UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D...    48   2e-04
UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|...    48   2e-04
UniRef50_Q58HK5 Cluster: Hc58; n=1; Haemonchus contortus|Rep: Hc...    48   2e-04
UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep...    48   2e-04
UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3...    48   2e-04
UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon G...    48   2e-04
UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ...    48   2e-04
UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ...    47   3e-04
UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep: Cat...    47   3e-04
UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl...    47   3e-04
UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ...    47   3e-04
UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr...    47   3e-04
UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh...    47   3e-04
UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto...    47   3e-04
UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ...    47   4e-04
UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa...    47   4e-04
UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; ...    47   4e-04
UniRef50_Q5CM16 Cluster: P3ECSL-related; n=2; Cryptosporidium|Re...    47   4e-04
UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma j...    47   4e-04
UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop...    47   4e-04
UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ...    47   4e-04
UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2...    47   4e-04
UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole...    46   5e-04
UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|...    46   5e-04
UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi...    46   5e-04
UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat...    46   5e-04
UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve...    46   5e-04
UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like cy...    46   5e-04
UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ...    46   7e-04
UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ...    46   7e-04
UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3; Bilate...    46   7e-04
UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia...    46   7e-04
UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain...    46   7e-04
UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir...    46   7e-04
UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs...    46   7e-04
UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D...    46   7e-04
UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: L...    46   9e-04
UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain...    46   9e-04
UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina...    46   9e-04
UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory...    46   9e-04
UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|...    46   9e-04
UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal...    45   0.001
UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox...    45   0.001
UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc...    45   0.001
UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; P...    45   0.001
UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster...    45   0.002
UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep...    45   0.002
UniRef50_Q3L7L2 Cluster: Sar s 1 allergen SMIPP-C Yv6008G08; n=2...    45   0.002
UniRef50_Q23H15 Cluster: Papain family cysteine protease contain...    45   0.002
UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi...    45   0.002
UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|...    45   0.002
UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p...    44   0.002
UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ...    44   0.002
UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s...    44   0.002
UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty...    44   0.002
UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa...    44   0.002
UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste...    44   0.002
UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R...    44   0.002
UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re...    44   0.003
UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ...    44   0.003
UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ...    44   0.003
UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ...    44   0.003
UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl...    44   0.003
UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain...    44   0.003
UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C...    44   0.003
UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:...    44   0.003
UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ...    44   0.003
UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba...    44   0.003
UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli...    44   0.003
UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi...    44   0.003
UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w...    44   0.003
UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re...    44   0.003
UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy...    43   0.005
UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|...    43   0.005
UniRef50_UPI00006CBB5F Cluster: Papain family cysteine protease ...    43   0.006
UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei...    43   0.006
UniRef50_Q6A1H9 Cluster: Cathepsin X/O; n=1; Suberites domuncula...    43   0.006
UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy...    43   0.006
UniRef50_UPI000155D183 Cluster: PREDICTED: similar to Cathepsin ...    42   0.008
UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ...    42   0.008
UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ...    42   0.008
UniRef50_Q8I1Y2 Cluster: Protease, putative; n=1; Plasmodium fal...    42   0.008
UniRef50_A5KAP8 Cluster: Protease, putative; n=1; Plasmodium viv...    42   0.008
UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li...    42   0.008
UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w...    42   0.008
UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli...    42   0.011
UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve...    42   0.011
UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica...    42   0.014
UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv...    42   0.014
UniRef50_A7APS9 Cluster: Papain family cysteine protease contain...    42   0.014
UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ...    42   0.014
UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; ...    41   0.019
UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n...    41   0.019
UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:...    41   0.019
UniRef50_Q8I0V1 Cluster: Preprocathepsin c, putative; n=1; Plasm...    41   0.019
UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H...    41   0.019
UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa...    41   0.025
UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil...    41   0.025
UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ...    41   0.025
UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy...    41   0.025
UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li...    41   0.025
UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease ...    40   0.033
UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ...    40   0.033
UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O...    40   0.033
UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ...    40   0.033
UniRef50_Q5CKC2 Cluster: Preprocathepsin c; n=2; Cryptosporidium...    40   0.033
UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop...    40   0.033
UniRef50_A0CS14 Cluster: Chromosome undetermined scaffold_26, wh...    40   0.033
UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s...    40   0.043
UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome s...    40   0.043
UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocysti...    40   0.043
UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv...    40   0.043
UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-...    40   0.043
UniRef50_Q54R55 Cluster: Putative uncharacterized protein; n=1; ...    40   0.043
UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli...    40   0.057
UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Sl...    40   0.057
UniRef50_Q5CYC8 Cluster: Cathepsin like thiol protease possibly ...    40   0.057
UniRef50_A5K8Y0 Cluster: Preprocathepsin c, putative; n=1; Plasm...    40   0.057
UniRef50_Q8IIJ9 Cluster: Probable cathepsin C precursor; n=11; P...    40   0.057
UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S...    39   0.075
UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz...    39   0.075
UniRef50_Q7RQM7 Cluster: Dipeptidyl-peptidase i; n=6; Plasmodium...    39   0.075
UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl...    39   0.075
UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest...    39   0.075
UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n...    39   0.075
UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280...    39   0.100
UniRef50_Q01FL5 Cluster: Cysteine protease-5; n=3; Ostreococcus|...    39   0.100
UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac...    39   0.100
UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ...    39   0.100
UniRef50_Q8I8D3 Cluster: Cysteine protease 15; n=2; Entamoeba hi...    39   0.100
UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain...    39   0.100
UniRef50_A7AX75 Cluster: Preprocathepsin c, putative; n=1; Babes...    39   0.100
UniRef50_Q2FLC7 Cluster: Periplasmic copper-binding precursor; n...    39   0.100
UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP000...    38   0.13 
UniRef50_Q8QNJ8 Cluster: EsV-1-75; n=1; Ectocarpus siliculosus v...    38   0.13 
UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v...    38   0.13 
UniRef50_UPI00006CB653 Cluster: Papain family cysteine protease ...    38   0.17 
UniRef50_Q015J8 Cluster: Cathepsin; n=2; Ostreococcus|Rep: Cathe...    38   0.17 
UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ...    38   0.17 
UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate...    38   0.17 
UniRef50_Q9UBR2 Cluster: Cathepsin Z precursor; n=40; Bilateria|...    38   0.17 
UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa...    38   0.23 
UniRef50_Q5YER6 Cluster: Cathepsin Z; n=1; Bigelowiella natans|R...    37   0.30 
UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endop...    37   0.30 
UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|...    37   0.40 
UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2; Culicid...    37   0.40 
UniRef50_Q3A8I3 Cluster: Putative serine protease; n=1; Pelobact...    36   0.53 
UniRef50_A4S004 Cluster: Predicted protein; n=2; Ostreococcus|Re...    36   0.53 
UniRef50_Q22RC9 Cluster: Papain family cysteine protease contain...    36   0.53 
UniRef50_A5KBM2 Cluster: Serine-repeat antigen; n=3; Plasmodium|...    36   0.53 
UniRef50_A0DCA5 Cluster: Chromosome undetermined scaffold_45, wh...    36   0.53 
UniRef50_A5UP12 Cluster: Adhesin-like protein; n=1; Methanobrevi...    36   0.53 
UniRef50_A0EAG2 Cluster: Chromosome undetermined scaffold_86, wh...    36   0.70 
UniRef50_A0D5R4 Cluster: Chromosome undetermined scaffold_39, wh...    36   0.70 
UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,...    36   0.93 
UniRef50_UPI00006CBAC7 Cluster: Papain family cysteine protease ...    36   0.93 
UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R...    36   0.93 
UniRef50_Q7R0G3 Cluster: GLP_29_33036_32140; n=1; Giardia lambli...    36   0.93 
UniRef50_Q4N3V5 Cluster: Cathepsin C, putative; n=1; Theileria p...    36   0.93 
UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina...    36   0.93 
UniRef50_A0BZE1 Cluster: Chromosome undetermined scaffold_139, w...    36   0.93 
UniRef50_UPI0001509FF9 Cluster: Papain family cysteine protease ...    35   1.2  
UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ...    35   1.2  
UniRef50_Q9NHY1 Cluster: Cysteine protease cp2; n=1; Theileria c...    35   1.2  
UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil...    35   1.2  
UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ...    35   1.2  
UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz...    35   1.6  
UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi...    35   1.6  
UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1; ...    35   1.6  
UniRef50_Q4YCM9 Cluster: Cysteine protease, putative; n=5; Plasm...    35   1.6  
UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;...    34   2.1  
UniRef50_Q02CM0 Cluster: 4Fe-4S ferredoxin, iron-sulfur binding ...    34   2.1  
UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cy...    34   2.1  
UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; ...    34   2.8  
UniRef50_A5HC50 Cluster: Cathepsin W; n=2; Theria|Rep: Cathepsin...    34   2.8  
UniRef50_A2F4G0 Cluster: Clan CA, family C1, cathepsin L-like cy...    34   2.8  
UniRef50_Q9NHY2 Cluster: Cysteine protease cp1; n=2; Theileria c...    33   3.7  
UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n...    33   3.7  
UniRef50_Q0GBZ7 Cluster: Membrane-associated protein 29; n=4; Sc...    33   3.7  
UniRef50_Q7M4N9 Cluster: Dipeptidyl-peptidase I; n=1; Homo sapie...    33   3.7  
UniRef50_UPI0000DB78AE Cluster: PREDICTED: similar to C25E10.7; ...    33   4.9  
UniRef50_O96163 Cluster: Cysteine protease, putative; n=5; Plasm...    33   4.9  
UniRef50_A7ASR7 Cluster: Cathepsin C, putative; n=1; Babesia bov...    33   4.9  
UniRef50_A5KBM1 Cluster: Serine-repeat antigen; n=1; Plasmodium ...    33   4.9  
UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ...    33   6.5  
UniRef50_Q0E4Y7 Cluster: 50 kDa Cathepsin B; n=2; Ascovirus|Rep:...    33   6.5  
UniRef50_A7M7G2 Cluster: ParC; n=1; Serratia entomophila|Rep: Pa...    33   6.5  
UniRef50_A3J1C4 Cluster: B-glycosyltransferase-related protein, ...    33   6.5  
UniRef50_Q0E4N0 Cluster: Os02g0109400 protein; n=3; Oryza sativa...    33   6.5  
UniRef50_Q54MB6 Cluster: Putative uncharacterized protein; n=1; ...    33   6.5  
UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10...    33   6.5  
UniRef50_A5KBM0 Cluster: Serine-repeat antigen (SERA), putative;...    33   6.5  
UniRef50_A0DZB1 Cluster: Chromosome undetermined scaffold_7, who...    33   6.5  
UniRef50_Q2H7E7 Cluster: Putative uncharacterized protein; n=1; ...    33   6.5  
UniRef50_UPI0000F2D780 Cluster: PREDICTED: similar to WW domain ...    32   8.6  
UniRef50_A1SVF0 Cluster: Peptidase C1A, papain; n=1; Psychromona...    32   8.6  
UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sa...    32   8.6  
UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryz...    32   8.6  
UniRef50_Q2QS15 Cluster: Papain family cysteine protease contain...    32   8.6  
UniRef50_Q8I8D7 Cluster: Cysteine protease 11; n=4; Entamoeba hi...    32   8.6  
UniRef50_Q7QWN9 Cluster: GLP_26_50243_51811; n=2; Giardia lambli...    32   8.6  
UniRef50_Q26153 Cluster: V-SERA 4; n=1; Plasmodium vivax|Rep: V-...    32   8.6  
UniRef50_Q26015 Cluster: Serine rich protein homologue; n=4; Pla...    32   8.6  
UniRef50_Q1AMF3 Cluster: Cathepsin C1; n=1; Toxoplasma gondii|Re...    32   8.6  
UniRef50_Q1AMF2 Cluster: Cathepsin C2; n=1; Toxoplasma gondii|Re...    32   8.6  
UniRef50_A0DKP6 Cluster: Chromosome undetermined scaffold_54, wh...    32   8.6  
UniRef50_Q1DTN0 Cluster: Predicted protein; n=1; Coccidioides im...    32   8.6  
UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]...    32   8.6  

>UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep:
           Parcxpwnx02 - Periplaneta americana (American cockroach)
          Length = 343

 Score =  271 bits (665), Expect = 7e-72
 Identities = 114/191 (59%), Positives = 139/191 (72%)
 Frame = +2

Query: 14  TWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPT 193
           TWKA RNF    P   IK LMG  +     +LP+ + + ++   +PE FDPR++WPECPT
Sbjct: 50  TWKAHRNFGNDIPLREIKKLMGVRRSLENFRLPEKSME-DIDIEIPEEFDPREQWPECPT 108

Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPT 373
           L EIRDQGSCGSCWAFGAVEAM+DRVCI+S    HFHFSAEDL++CC  CG GCNGG P 
Sbjct: 109 LKEIRDQGSCGSCWAFGAVEAMSDRVCIHSKGKTHFHFSAEDLLTCCSSCGFGCNGGEPG 168

Query: 374 LAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCESSYN 553
            AW+YW   G+VSGG+YNS QGC+PY I PCEHHV G R PC G+  TP+C K CE  Y+
Sbjct: 169 AAWDYWVSTGIVSGGSYNSHQGCQPYAIEPCEHHVNGTRKPC-GEGDTPRCVKRCEEGYD 227

Query: 554 VPFKKEQRYGK 586
           VP+ K++ +GK
Sbjct: 228 VPYGKDRHFGK 238


>UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin B;
           n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to cathepsin B - Strongylocentrotus purpuratus
          Length = 346

 Score =  244 bits (597), Expect = 1e-63
 Identities = 102/191 (53%), Positives = 137/191 (71%)
 Frame = +2

Query: 8   QNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPEC 187
           + TWKAG NF         + ++GALK+ N  +LPK+ +    I +LPENFD R+ WP C
Sbjct: 35  KTTWKAGINFEGWQ-LDDFRRMLGALKNPNG-RLPKLENQTR-IKDLPENFDARENWPNC 91

Query: 188 PTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGM 367
           PT+ E+RDQGSCGSCWAFGAVEA++DR+CI S      H SAEDL++CC  CG GCNGG 
Sbjct: 92  PTIKEVRDQGSCGSCWAFGAVEAISDRICIKSKGQTQVHISAEDLMTCCKTCGNGCNGGF 151

Query: 368 PTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCESS 547
           P  AWEY+K  G+V+GG +NSSQGC+PY+I  C+HHV G + PC G+  TP+C+  CE+S
Sbjct: 152 PGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGPCQGEGPTPECKHKCEAS 211

Query: 548 YNVPFKKEQRY 580
           Y+ P+++++ Y
Sbjct: 212 YSTPYEQDKHY 222


>UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteinase;
           n=1; Tenebrio molitor|Rep: Putative cathepsin B-like
           like proteinase - Tenebrio molitor (Yellow mealworm)
          Length = 301

 Score =  242 bits (592), Expect = 5e-63
 Identities = 106/195 (54%), Positives = 135/195 (69%), Gaps = 2/195 (1%)
 Frame = +2

Query: 5   KQNTWKAGRNFPTHTPFAHIKILMGAL-KDDNILKLPKVTHDAELIANLPENFDPRDKWP 181
           KQ TWKAGRNF  +TP +H++ L+G L K  N  KLP  TH   L A +PE+FD R+ WP
Sbjct: 37  KQTTWKAGRNFDVNTPISHVRRLLGVLPKKANAPKLPVKTHAVNLDA-IPESFDAREAWP 95

Query: 182 ECPTL-NEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCN 358
           EC ++  EIRDQ SCGSCWAFGAVEAM+DR+CI+S+A+     SAEDL  CC  CG GCN
Sbjct: 96  ECTSIIGEIRDQASCGSCWAFGAVEAMSDRICIHSDASVKVRISAEDLNDCCYDCGDGCN 155

Query: 359 GGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNC 538
           GG P LAW YW   G+V+GG Y   +GC+ Y I PC+HHV GN  PC    +TP C+K+C
Sbjct: 156 GGWPDLAWSYWSSTGIVTGGLYGVDEGCKAYSIKPCDHHVDGNLGPCGDIQRTPACKKSC 215

Query: 539 ESSYNVPFKKEQRYG 583
           +S+ ++ +K + R G
Sbjct: 216 DSTSDLEYKSDLRRG 230


>UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1)
           (Cathepsin B1) (APP secretase) (APPS) [Contains:
           Cathepsin B light chain; Cathepsin B heavy chain]; n=85;
           Eukaryota|Rep: Cathepsin B precursor (EC 3.4.22.1)
           (Cathepsin B1) (APP secretase) (APPS) [Contains:
           Cathepsin B light chain; Cathepsin B heavy chain] - Homo
           sapiens (Human)
          Length = 339

 Score =  231 bits (565), Expect = 9e-60
 Identities = 103/195 (52%), Positives = 132/195 (67%), Gaps = 1/195 (0%)
 Frame = +2

Query: 2   KKQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWP 181
           K+  TW+AG NF  +   +++K L G        K P+     E +  LP +FD R++WP
Sbjct: 36  KRNTTWQAGHNF-YNVDMSYLKRLCGTFLGGP--KPPQRVMFTEDL-KLPASFDAREQWP 91

Query: 182 ECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCN 358
           +CPT+ EIRDQGSCGSCWAFGAVEA++DR+CI++NA      SAEDL++CC  +CG GCN
Sbjct: 92  QCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCN 151

Query: 359 GGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNC 538
           GG P  AW +W   GLVSGG Y S  GCRPY IPPCEHHV G+R PC G+  TPKC K C
Sbjct: 152 GGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKIC 211

Query: 539 ESSYNVPFKKEQRYG 583
           E  Y+  +K+++ YG
Sbjct: 212 EPGYSPTYKQDKHYG 226


>UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome
           shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 5
           SCAF15026, whole genome shotgun sequence - Tetraodon
           nigroviridis (Green puffer)
          Length = 351

 Score =  212 bits (518), Expect = 5e-54
 Identities = 106/217 (48%), Positives = 136/217 (62%), Gaps = 22/217 (10%)
 Frame = +2

Query: 2   KKQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWP 181
           K  +TW AG NF  +  ++++K L G L      KLP +   A  I  LP+ FD R++WP
Sbjct: 35  KLNSTWTAGHNFH-NVDYSYVKKLCGTLLKGP--KLPLMIRYAGDI-KLPKEFDSREQWP 90

Query: 182 ECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNG 361
            CPTL EIRDQGSCGSCWAFGA EAM+DRVCI+SNA      SA+DL++CC  CG+GCNG
Sbjct: 91  NCPTLKEIRDQGSCGSCWAFGASEAMSDRVCIHSNAKVSVELSAQDLLTCCNSCGMGCNG 150

Query: 362 GMPTLAWEYWKHVGLVSGGNYNS---------------------SQGCRPYEIPPCEHHV 478
           G P+ AW +W   GLVSGG Y+S                     S GCRPY IPPCEHHV
Sbjct: 151 GYPSSAWNFWVSDGLVSGGLYDSHIGRIQVSLCVLLLAVDRDFVSPGCRPYTIPPCEHHV 210

Query: 479 PGNRMPCNGD-TKTPKCQKNCESSYNVPFKKEQRYGK 586
            G+R  C+G+   TP+C   CE+ Y+  +K+++ +GK
Sbjct: 211 NGSRPSCSGEGGDTPECIFRCEAGYSPSYKQDKHFGK 247


>UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=1;
           Biomphalaria glabrata|Rep: Cathepsin B preproprotein
           precursor - Biomphalaria glabrata (Bloodfluke planorb)
          Length = 333

 Score =  208 bits (508), Expect = 7e-53
 Identities = 88/192 (45%), Positives = 118/192 (61%), Gaps = 1/192 (0%)
 Frame = +2

Query: 14  TWKAGRNF-PTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECP 190
           TWKAGRNF P     A   + +   ++    ++       +   +LP+NFDPR KWP+C 
Sbjct: 42  TWKAGRNFHPAEIKRARALLGVNMAENKAYNRIHLKYKQVQPRNDLPDNFDPRTKWPDCA 101

Query: 191 TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMP 370
           +LNEIRDQ +CGSCWAFG+ EAMTDR+CI      + H SAED+  CC  CG+GCNGG P
Sbjct: 102 SLNEIRDQANCGSCWAFGSAEAMTDRICIAGKG--NIHISAEDINDCCKSCGMGCNGGYP 159

Query: 371 TLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCESSY 550
             AWE++   G+VSGG Y +++GC PY +P C+HH  G   PC     TPKC+K C + Y
Sbjct: 160 AAAWEWYVDTGVVSGGQYGTNEGCMPYSLPHCDHHTTGKYQPCPAVVPTPKCEKKCLTGY 219

Query: 551 NVPFKKEQRYGK 586
              +  ++  GK
Sbjct: 220 PKSYSNDKTRGK 231


>UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase
           precursor; n=28; Bilateria|Rep: Cathepsin B-like
           cysteine proteinase precursor - Schistosoma japonicum
           (Blood fluke)
          Length = 342

 Score =  202 bits (492), Expect = 6e-51
 Identities = 89/193 (46%), Positives = 120/193 (62%), Gaps = 4/193 (2%)
 Frame = +2

Query: 17  WKAGRNFPTHTPFAHIKILMGALKDDNILKL---PKVTHDAELIANLPENFDPRDKWPEC 187
           WKA ++   H+     +ILMGA K+D  +K    P V H  +L   +P  FD R KWP C
Sbjct: 46  WKADKSDRFHS-LDDARILMGARKEDAEMKRNRRPTVDHH-DLNVEIPSQFDSRKKWPHC 103

Query: 188 PTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGM 367
            ++++IRDQ  CGSCWAFGAVEAMTDR+CI S   +    SA DL+SCC  CG GC GG 
Sbjct: 104 KSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCKDCGDGCQGGF 163

Query: 368 PTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDT-KTPKCQKNCES 544
           P +AW+YW   G+V+GG+  +  GC+PY  P CEHH  G    C     KTP+C++ C+ 
Sbjct: 164 PGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQK 223

Query: 545 SYNVPFKKEQRYG 583
            Y  P+++++ YG
Sbjct: 224 GYKTPYEQDKHYG 236


>UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=1;
           Nilaparvata lugens|Rep: Cathepsin B-like protease
           precursor - Nilaparvata lugens (Brown planthopper)
          Length = 347

 Score =  193 bits (470), Expect = 3e-48
 Identities = 86/200 (43%), Positives = 123/200 (61%), Gaps = 7/200 (3%)
 Frame = +2

Query: 8   QNTWKAGRNFPTHTPFAHIKILMGALK-DDNILKLPKVTHDAELIAN----LPENFDPRD 172
           ++TWKAG NF   TP ++++ L+G  + + N+  L K     E   N    +P+ FD R 
Sbjct: 41  KSTWKAGHNFHPDTPMSYLQGLLGVSELESNLADLDKYEEMEENEENKKIKVPKYFDARK 100

Query: 173 KWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLG 352
           KW +C +L EIRDQG+CGSCWA     A  DR+CI SNA  + H S+ +L+SCC  CG G
Sbjct: 101 KWKKCKSLREIRDQGNCGSCWAVSVAAAFADRLCIASNAKWNGHISSRELMSCCSYCGFG 160

Query: 353 CNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGD--TKTPKC 526
           C GG P  AW + K  GLV+GG+Y+S  GC+PY I PCEHH+ G++  C+      TP C
Sbjct: 161 CEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPYPIAPCEHHMEGSKPNCSASPTEPTPAC 220

Query: 527 QKNCESSYNVPFKKEQRYGK 586
           +  C    ++ ++K+++ GK
Sbjct: 221 ETTCTHGSSLAYQKDRQKGK 240


>UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep:
           Cathepsin B - Pandalus borealis (Northern red shrimp)
          Length = 328

 Score =  191 bits (465), Expect = 1e-47
 Identities = 83/193 (43%), Positives = 111/193 (57%)
 Frame = +2

Query: 5   KQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPE 184
           KQ TWKAGRNF        +K L    K+ +I KLP    +      +P  FD R++WP 
Sbjct: 31  KQMTWKAGRNFAKDISKDFLKSLNCVRKNPDIPKLP--LKNVTPTKEIPVEFDAREQWPH 88

Query: 185 CPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGG 364
           CP ++EIRDQG+CGSCWA  A   MTDR CI +     F FS+E++ +CC  CG  C GG
Sbjct: 89  CPCIDEIRDQGNCGSCWAVSAASVMTDRTCIDTEGLVDFRFSSENVAACCTECGNACYGG 148

Query: 365 MPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCES 544
               A+ +W   G VSGG +NS++GC+PY +  CEHH+ G R PC GD     C + C  
Sbjct: 149 DEDTAFTHWVTKGFVSGGRHNSNEGCQPYSVEECEHHIEGPRPPCEGDMPELVCSETCHE 208

Query: 545 SYNVPFKKEQRYG 583
            Y   ++++  YG
Sbjct: 209 EYGKTYEEDLEYG 221


>UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6
           precursor; n=11; Bilateria|Rep: Cathepsin B-like
           cysteine proteinase 6 precursor - Caenorhabditis elegans
          Length = 379

 Score =  189 bits (461), Expect = 4e-47
 Identities = 79/152 (51%), Positives = 99/152 (65%), Gaps = 2/152 (1%)
 Frame = +2

Query: 131 ELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFS 310
           +L  ++PE+FD RD WP+C ++  IRDQ SCGSCWAFGAVEAM+DR+CI S+       S
Sbjct: 100 DLDLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLS 159

Query: 311 AEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNR 490
           A+DL+SCC  CG GCNGG P  AW YW   G+V+G NY ++ GC+PY  PPCEHH     
Sbjct: 160 ADDLLSCCKSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTH 219

Query: 491 M-PCNGDT-KTPKCQKNCESSYNVPFKKEQRY 580
             PC  D   TPKC+K C S Y      E ++
Sbjct: 220 FDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKF 251


>UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           B-like proteinase precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 331

 Score =  187 bits (456), Expect = 1e-46
 Identities = 85/195 (43%), Positives = 115/195 (58%), Gaps = 2/195 (1%)
 Frame = +2

Query: 5   KQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPE 184
           KQ+TW AG+NF  +     IK L+GA K   +    + TH  ++   +P +FD R+ W E
Sbjct: 35  KQSTWVAGKNFDENLSIQEIKNLLGA-KKGKLGVAKEFTHSEDI--QVPNSFDARENWKE 91

Query: 185 CP-TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNG 361
           C   ++ + DQ  CGSCWA  A  AM+DR CI S        SAE+L+SCC  CG GC G
Sbjct: 92  CSDVISTVVDQSDCGSCWAVAAASAMSDRRCIASQGKLKVPVSAENLLSCCDSCGYGCEG 151

Query: 362 GMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNG-DTKTPKCQKNC 538
           G PT+AW YW   G+ +GG Y S QGC+PY + PCEHH  GN++ C+  D  TP C+  C
Sbjct: 152 GYPTMAWSYWIDTGITTGGLYGSKQGCQPYSLQPCEHHTEGNKVQCSTLDYDTPSCKHKC 211

Query: 539 ESSYNVPFKKEQRYG 583
           + S  + +K E  +G
Sbjct: 212 DDS-ALNYKSELTFG 225


>UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis
           sinensis|Rep: Cathepsin B5 - Clonorchis sinensis
          Length = 343

 Score =  187 bits (456), Expect = 1e-46
 Identities = 84/179 (46%), Positives = 110/179 (61%), Gaps = 3/179 (1%)
 Frame = +2

Query: 17  WKAGRNFPTHTPFAHIKILMGALKDDNILKL--PKVTHDAELIANLPENFDPRDKWPECP 190
           W +GR  P       +  + GA ++    K   P + HD      LP+NFD R  WP C 
Sbjct: 42  WISGR-LPKRFESGDLIHMFGAKRETREQKAQRPTLRHDGFDNMRLPKNFDARKTWPHCS 100

Query: 191 TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMP 370
           +++EIRDQ SCGSCWAFGAVEAM+DR+CI+SN   +   SA DL+SCC  CG GC GG P
Sbjct: 101 SISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCKDCGFGCRGGYP 160

Query: 371 TLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDT-KTPKCQKNCES 544
            +AW+YWK  G+V+GG+     GCR Y  P CEHHV G+  PC  +   TP+C + C++
Sbjct: 161 AVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQCDT 219


>UniRef50_Q23FP9 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 340

 Score =  184 bits (447), Expect = 2e-45
 Identities = 82/184 (44%), Positives = 109/184 (59%), Gaps = 4/184 (2%)
 Frame = +2

Query: 11  NTWKAGRNFPTHTPFAHIKIL--MGALKDDNILKLPKVTHDAELIAN-LPENFDPRDKWP 181
           +TWKA R +P        ++L  +G+L + + +KLP    D    A+ +PE FD R++WP
Sbjct: 41  STWKAAR-YPHFEKMTREQLLGHLGSLDEPDWVKLPTKEFDPNANADPIPEFFDAREQWP 99

Query: 182 ECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP-ICGLGCN 358
            C ++  IRDQ +CGSCWAF A E  +DR+CI SN T     S+EDL+ CC   CG+GC 
Sbjct: 100 NCQSIKLIRDQSTCGSCWAFAATETFSDRICIASNQTLQTSISSEDLLECCADYCGMGCK 159

Query: 359 GGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNC 538
           GG P+ AW Y K  G+ +GG Y     C+PY  PPC+HHV G   PC     TP+C K C
Sbjct: 160 GGYPSAAWGYMKRQGVSTGGLYGDDTSCKPYIFPPCDHHVTGQYQPCGPIQPTPQCVKEC 219

Query: 539 ESSY 550
            S Y
Sbjct: 220 NSEY 223


>UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase
           precursor; n=29; Schistosomatidae|Rep: Cathepsin B-like
           cysteine proteinase precursor - Schistosoma mansoni
           (Blood fluke)
          Length = 340

 Score =  180 bits (439), Expect = 2e-44
 Identities = 80/194 (41%), Positives = 114/194 (58%), Gaps = 4/194 (2%)
 Frame = +2

Query: 17  WKAGRNFPTHTPFAHIKILMGALKDDNILKL---PKVTHDAELIANLPENFDPRDKWPEC 187
           W+A ++   H+     +I MGA +++  L+    P V H+ +    +P NFD R KWP C
Sbjct: 45  WRAEKSNRFHS-LDDARIQMGARREEPDLRRKRRPTVDHN-DWNVEIPSNFDSRKKWPGC 102

Query: 188 PTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGM 367
            ++  IRDQ  CGSCW+FGAVEAM+DR CI S   ++   SA DL++CC  CGLGC GG+
Sbjct: 103 KSIATIRDQSRCGSCWSFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCESCGLGCEGGI 162

Query: 368 PTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDT-KTPKCQKNCES 544
              AW+YW   G+V+  +  +  GC PY  P CEHH  G   PC      TP+C++ C+ 
Sbjct: 163 LGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQR 222

Query: 545 SYNVPFKKEQRYGK 586
            Y  P+ +++  GK
Sbjct: 223 KYKTPYTQDKHRGK 236


>UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n=8;
           Strongylida|Rep: Cathepsin B-like cysteine protease 2 -
           Parelaphostrongylus tenuis
          Length = 344

 Score =  177 bits (432), Expect = 1e-43
 Identities = 75/162 (46%), Positives = 101/162 (62%), Gaps = 1/162 (0%)
 Frame = +2

Query: 104 KLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYS 283
           K P+V    E    +P++FD R +WP CP+++ IRDQ  CGSCWAFG+ EAM+DRVCI S
Sbjct: 80  KKPRVDEIGEEGFKIPDSFDARVQWPHCPSISYIRDQSQCGSCWAFGSAEAMSDRVCIAS 139

Query: 284 NATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPP 463
           +  K    SA+D++SCC  CG GC+GG P  AWEY+   G+V+GG Y +   CRPYEIPP
Sbjct: 140 HGNKTVELSADDILSCCYDCGDGCDGGYPISAWEYFVETGVVTGGLYGTKDSCRPYEIPP 199

Query: 464 CEHHVPGNRM-PCNGDTKTPKCQKNCESSYNVPFKKEQRYGK 586
           C HH        C     TP C   C++ Y + +  ++ +GK
Sbjct: 200 CGHHRNETFYGNCTQIADTPDCVTTCQAGYPISYDDDKTFGK 241


>UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n=4;
           Tenebrionidae|Rep: Putative cathepsin B-like proteinase
           - Tenebrio molitor (Yellow mealworm)
          Length = 321

 Score =  172 bits (419), Expect = 4e-42
 Identities = 83/196 (42%), Positives = 120/196 (61%), Gaps = 2/196 (1%)
 Frame = +2

Query: 8   QNTWKAGRNFPTHTPFAHIKILMG--ALKDDNILKLPKVTHDAELIANLPENFDPRDKWP 181
           Q++W AGRNFP +T   ++  L G   L  D   K P + H      ++PE+FD R KWP
Sbjct: 36  QSSWVAGRNFPENTTNEYLYKLNGFIGLHPDPNYKPPVLVHTFNA-RDVPESFDARTKWP 94

Query: 182 ECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNG 361
            C +LN IRDQG+CGSCWAF ++E+M+DR+CI+S+ +  F FS EDL+SCC  CG  C G
Sbjct: 95  NCDSLNRIRDQGACGSCWAFASIESMSDRICIHSSGSAQFMFSPEDLLSCCTSCG-DCGG 153

Query: 362 GMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCE 541
           G    A +++ + G+VSGG+ NS++GCRPY     + H  G         +TP C K+C 
Sbjct: 154 GYMMSALDFYINEGIVSGGDVNSNEGCRPY---TADAHDQG---------QTPACTKSCR 201

Query: 542 SSYNVPFKKEQRYGKH 589
           + Y+  +  ++ YG +
Sbjct: 202 NGYSTSYSADKHYGSN 217


>UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Cathepsin
           B - Fasciola gigantica (Giant liver fluke)
          Length = 339

 Score =  172 bits (418), Expect = 6e-42
 Identities = 81/196 (41%), Positives = 116/196 (59%), Gaps = 6/196 (3%)
 Frame = +2

Query: 14  TWKAGRNFPTHTPFAHIKILMGALKDD----NILKLPKVTHDAELIANLPENFDPRDKWP 181
           +WKA R+    +   H K+ +GAL +     N L+ P + HD     +LPE+FD R +WP
Sbjct: 41  SWKAARS-TRFSNVDHFKLHLGALSETPEERNALR-PTIKHDISK-NDLPESFDARSQWP 97

Query: 182 ECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNG 361
           +C T++EIRDQ SCGSCWA  A  AM+DRVCI+SN       +A D +SCC  CG GC G
Sbjct: 98  QCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTYCGQGCRG 157

Query: 362 GMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP-CNGDT-KTPKCQKN 535
           G P  AW+YW   G+V+GG + +  GC+P+    C+H     +   C   T  TP C + 
Sbjct: 158 GYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTPPCARA 217

Query: 536 CESSYNVPFKKEQRYG 583
           C++ YN  +++++ YG
Sbjct: 218 CQTGYNKTYEQDKFYG 233


>UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep:
           Cathepsin b - Aedes aegypti (Yellowfever mosquito)
          Length = 386

 Score =  169 bits (411), Expect = 4e-41
 Identities = 89/193 (46%), Positives = 109/193 (56%), Gaps = 2/193 (1%)
 Frame = +2

Query: 14  TWKAGRNFPTHTPFAHIKILMGALKDDNILKLPK-VTHDAELIANLPENFDPRDKWPECP 190
           TW+AG N P         + M  L+     KLP  +  D E + +LP+ FD R+KWPECP
Sbjct: 85  TWRAGSN-PKPPAGYRSGVNMADLERT---KLPLGIMADVEDL-DLPDTFDAREKWPECP 139

Query: 191 TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMP 370
           +L EIRDQG CGSCWA  A  AMTDR C+ S   + F F + DL+SCC  CG GC GG  
Sbjct: 140 SLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCCHSCGQGCRGGTL 199

Query: 371 TLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCESSY 550
             AW++W   GL SGG  NS QGC PY I  C   +PG       D  TPKC   C S Y
Sbjct: 200 GPAWQFWVEKGLSSGGPLNSRQGCHPYPIGEC--RIPGE------DEDTPKCSNKCRSGY 251

Query: 551 NV-PFKKEQRYGK 586
           NV    +++ YG+
Sbjct: 252 NVTDVWQDRHYGR 264


>UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep:
           Cathepsin B - Apriona germari
          Length = 324

 Score =  167 bits (405), Expect = 2e-40
 Identities = 85/196 (43%), Positives = 118/196 (60%), Gaps = 3/196 (1%)
 Frame = +2

Query: 2   KKQNTWKAGRNFPTHTPFAHIKIL---MGALKDDNILKLPKVTHDAELIANLPENFDPRD 172
           +K  TW A +NF   TP   +K L   +G  +D N+  LP V H+A  I+ +P++FD R+
Sbjct: 37  EKATTWTARKNFEGRTP-EQLKALADVIGINRDPNVT-LPVVFHEA--ISGIPDSFDARE 92

Query: 173 KWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLG 352
           +WP C ++  IRD+G+CGSCWAF AVE M+DR+C+ S   K F FSAE++VSCC  CG G
Sbjct: 93  QWPFCESIRTIRDEGACGSCWAFAAVEVMSDRLCLASEGRKKFIFSAEEVVSCCTACGGG 152

Query: 353 CNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQK 532
           C GG     ++YW   G+ SGG+Y S  GC+PY        V G         +TP+CQK
Sbjct: 153 CRGGFLNEPYKYWVTNGIPSGGDYGSKLGCKPYTAA-----VSG---------ETPQCQK 198

Query: 533 NCESSYNVPFKKEQRY 580
            C S Y   ++K+ R+
Sbjct: 199 ACVSGYEKSWEKDLRH 214


>UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep:
           Cathepsin b - Aedes aegypti (Yellowfever mosquito)
          Length = 332

 Score =  165 bits (401), Expect = 7e-40
 Identities = 74/191 (38%), Positives = 109/191 (57%), Gaps = 1/191 (0%)
 Frame = +2

Query: 14  TWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPT 193
           TW     F     F + + + G  +     +LP   HD     ++PE FD R+KWP C +
Sbjct: 41  TWTPDATFRDGIRFENFQNMKGIFESKIGFRLPTKRHDVAYNMDIPEFFDAREKWPYCKS 100

Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGG-MP 370
           ++ I++QG CG+CWA  AV  M+DR+CI+S        +AEDL+ CC  CG GCNGG + 
Sbjct: 101 ISTIKNQGLCGACWAVAAVSVMSDRLCIHSEGKFDVELAAEDLMGCCKDCGNGCNGGFLD 160

Query: 371 TLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCESSY 550
             +++YW  VGLVSG  YNS+ GC+PY   PC +   G    C+ + KTP C  +C   Y
Sbjct: 161 GTSFQYWVDVGLVSGAAYNSTDGCKPYPFKPCLYPFVG----CHPE-KTPSCTHHCTEGY 215

Query: 551 NVPFKKEQRYG 583
           +  +++++ YG
Sbjct: 216 DGTYRRDKYYG 226


>UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep:
           Cathepsin B - Uronema marinum
          Length = 350

 Score =  163 bits (397), Expect = 2e-39
 Identities = 84/194 (43%), Positives = 108/194 (55%), Gaps = 14/194 (7%)
 Frame = +2

Query: 11  NTWKAGRNFPTH-TPFAHIKILMGALKDDNILKLPKVTHDA-ELIANL--PENFDPRDKW 178
           +TWKAG N       F  I+ +MG +    +  +P   +   E I NL  PE+FD R+ +
Sbjct: 38  STWKAGYNKRFEGMSFDQIQAMMGTIATP-VHMIPDERYTPFETIQNLSLPESFDLREAY 96

Query: 179 PECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP---ICGL 349
           P+C +L ++RDQ +CGSCWAFG VEA++DR+CI S        S+E+L+SCC     CG+
Sbjct: 97  PKCESLQQVRDQSNCGSCWAFGTVEAISDRICIASGQKDQTRISSENLLSCCRGTFACGM 156

Query: 350 GCNGGMPTLAWEYWKHVGLVSGG-----NYNSSQGCRPYEIPPCEHHVPGNRMPCNG--D 508
           GCNGG    AW Y+   GLVSG      N NS   C+PY  PPC HHV G    C     
Sbjct: 157 GCNGGYTAGAWNYYVKTGLVSGNLYTDDNQNSKTECQPYSFPPCSHHVQGEYQACTDLPQ 216

Query: 509 TKTPKCQKNCESSY 550
             TPKC   C S Y
Sbjct: 217 FNTPKCYTECNSQY 230


>UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.4;
           n=1; Caenorhabditis elegans|Rep: Putative
           uncharacterized protein W07B8.4 - Caenorhabditis elegans
          Length = 335

 Score =  161 bits (390), Expect = 1e-38
 Identities = 71/159 (44%), Positives = 100/159 (62%), Gaps = 7/159 (4%)
 Frame = +2

Query: 128 AELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHF 307
           AE   ++P+++D RD WP+C ++N IRDQ  CGSCWA  A EA++DR CI SN   +   
Sbjct: 67  AETADSIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLL 126

Query: 308 SAEDLVSCCP---ICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHV 478
           SAED+++CC     CG GC GG P  AW YW   GLV+GG++ S  GC+PY I PC   +
Sbjct: 127 SAEDILTCCTGKFNCGDGCEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPYSIAPCGETI 186

Query: 479 PGNRMP-CNGD-TKTPKCQKNC--ESSYNVPFKKEQRYG 583
            G   P C    + TPKC+ +C   +SY +P+ +++ +G
Sbjct: 187 DGVTWPECPMKISDTPKCEHHCTGNNSYPIPYDQDKHFG 225


>UniRef50_Q237A1 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 346

 Score =  157 bits (381), Expect = 2e-37
 Identities = 78/197 (39%), Positives = 117/197 (59%), Gaps = 6/197 (3%)
 Frame = +2

Query: 11  NTWKAGRNFP-THTPFAHIKILMGA-LKDDNILKLPKVTHDAELIANLPENFDPRDKWPE 184
           +TWKAG N    ++  A +K  MG  L  ++ +KL  V+  A     LPE FD R +W +
Sbjct: 49  STWKAGENTKWINSDIAGVKAHMGVKLGQESGIKLETVSAQAN---GLPEEFDARVQWGD 105

Query: 185 -CPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNG 361
            C +L E+RDQ +CGSCWAFGA E+++DR CI+    +    S ++L++CC  CG GC+G
Sbjct: 106 KCSSLWEVRDQSTCGSCWAFGAAESLSDRHCIHLG--QDIRLSTQNLLTCCAACGDGCDG 163

Query: 362 GMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGN-RMPCNGDTKTPKCQKNC 538
           G P  A +Y+ + GLV+G  Y ++  C+ Y   PC HHV  +   PC G+  TP C  +C
Sbjct: 164 GWPEAAMDYYVNTGLVTGDLYGNNSWCQAYTFAPCAHHVTSDIYPPCTGELPTPPCINSC 223

Query: 539 E--SSYNVPFKKEQRYG 583
           +  S++ +P+ K+   G
Sbjct: 224 DSNSTHTIPYSKDIHRG 240


>UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2
           precursor; n=8; Haemonchus contortus|Rep: Cathepsin
           B-like cysteine proteinase 2 precursor - Haemonchus
           contortus (Barber pole worm)
          Length = 342

 Score =  157 bits (380), Expect = 2e-37
 Identities = 77/184 (41%), Positives = 104/184 (56%), Gaps = 4/184 (2%)
 Frame = +2

Query: 47  TPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCG 226
           TP    KI+    K   +  + K   D E+  ++P ++DPRD W  C T   IRDQ +CG
Sbjct: 56  TPDFEQKIMSIKYKHQKLNLMVKEDPDPEV--DIPPSYDPRDVWKNCTTFY-IRDQANCG 112

Query: 227 SCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWEYWKHVG 403
           SCWA     A++DR+CI S A K  + SA D+++CC P CG GC GG P  AW+Y+ + G
Sbjct: 113 SCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDG 172

Query: 404 LVSGGNYNSSQGCRPYEIPPCEHHVPGNRM---PCNGDTKTPKCQKNCESSYNVPFKKEQ 574
           +VSGG Y +   CRPY I PC HH  GN      C G   TP C++ C       ++ ++
Sbjct: 173 VVSGGEYLTKDVCRPYPIHPCGHH--GNDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDK 230

Query: 575 RYGK 586
           RYGK
Sbjct: 231 RYGK 234


>UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1
           precursor; n=3; Haemonchidae|Rep: Cathepsin B-like
           cysteine proteinase 1 precursor - Ostertagia ostertagi
          Length = 341

 Score =  155 bits (377), Expect = 5e-37
 Identities = 78/189 (41%), Positives = 106/189 (56%), Gaps = 7/189 (3%)
 Frame = +2

Query: 41  THTPFAHIKILMGALKDDNILKLP-KVTHDAELIAN---LPENFDPRDKWPECPTLNEIR 208
           T TP  + K  +  LK  +   +P +   D EL  N   +PE++DPR +W  C +L  I 
Sbjct: 52  TATPVPYFKQRLMDLKYIDQNNIPDEEVEDEELEENNDDIPESYDPRIQWANCSSLFHIP 111

Query: 209 DQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEY 388
           DQ +CGSCWA  +  AM+DR+CI S   K    SA+D+VSCC  CG GC GG P  A+ +
Sbjct: 112 DQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVVSCCTWCGDGCEGGWPISAFRF 171

Query: 389 WKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRM---PCNGDTKTPKCQKNCESSYNVP 559
               G+V+GG+YN+   CRPYEI PC HH  GN      C G   TP+C++ C   Y   
Sbjct: 172 HADEGVVTGGDYNTKGSCRPYEIHPCGHH--GNETYYGECVGMADTPRCKRRCLLGYPKS 229

Query: 560 FKKEQRYGK 586
           +  ++ Y K
Sbjct: 230 YPSDRYYKK 238


>UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4
           precursor; n=5; Caenorhabditis|Rep: Cathepsin B-like
           cysteine proteinase 4 precursor - Caenorhabditis elegans
          Length = 335

 Score =  153 bits (371), Expect = 3e-36
 Identities = 77/198 (38%), Positives = 102/198 (51%), Gaps = 5/198 (2%)
 Frame = +2

Query: 5   KQNTWKAGRNFPTHTPFAHIK--ILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKW 178
           KQ+ WKA    P       +K  ++       +   +  V HD      +P  FD R +W
Sbjct: 35  KQSLWKA--EIPKDITIEQVKKRLMRTEFVAPHTPDVEVVKHDINE-DTIPATFDARTQW 91

Query: 179 PECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCN 358
           P C ++N IRDQ  CGSCWAF A EA +DR CI SN   +   SAED++SCC  CG GC 
Sbjct: 92  PNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSNCGYGCE 151

Query: 359 GGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP-CNGD-TKTPKCQK 532
           GG P  AW+Y    G  +GG+Y +  GC+PY + PC   V     P C  D   TP C  
Sbjct: 152 GGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVN 211

Query: 533 NC-ESSYNVPFKKEQRYG 583
            C   +YNV +  ++ +G
Sbjct: 212 KCTNKNYNVAYTADKHFG 229


>UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator
           americanus|Rep: Cysteine proteinase 4 - Necator
           americanus (Human hookworm)
          Length = 339

 Score =  153 bits (370), Expect = 4e-36
 Identities = 73/187 (39%), Positives = 104/187 (55%), Gaps = 3/187 (1%)
 Frame = +2

Query: 38  PTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQG 217
           PT+  F   +I+      +   K P+      L   LPE FD R+KWP C ++  IRD  
Sbjct: 54  PTNEQFVKARIMDIKYMTEASHKYPR--KGINLNVELPERFDAREKWPHCASIGLIRDHS 111

Query: 218 SCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWEYWK 394
           +CGSCWA  A   M+DR+CI +N T     S+ D+++CC   CG GC GG P  A+ Y +
Sbjct: 112 ACGSCWAVSAASVMSDRLCIQTNGTNQKILSSADILACCGEDCGSGCEGGYPIQAYFYLE 171

Query: 395 HVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPC--NGDTKTPKCQKNCESSYNVPFKK 568
           + G+ SGG Y     C+PY   PC+    GN  PC   G   TPKC+K C+  Y VP+++
Sbjct: 172 NTGVCSGGEYREKNVCKPYPFYPCD----GNYGPCPKEGAFDTPKCRKICQFRYPVPYEE 227

Query: 569 EQRYGKH 589
           ++ +GK+
Sbjct: 228 DKVFGKN 234


>UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7;
           Rhabditida|Rep: Cysteine proteinase 3 - Necator
           americanus (Human hookworm)
          Length = 360

 Score =  153 bits (370), Expect = 4e-36
 Identities = 64/153 (41%), Positives = 91/153 (59%), Gaps = 1/153 (0%)
 Frame = +2

Query: 125 DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH 304
           D +    +P +FD RDKWP+C ++  IRDQ  CGSCWA  + E M+DR+C+ SN T    
Sbjct: 83  DMDFSEEIPVSFDARDKWPKCTSIGFIRDQSHCGSCWAVSSAETMSDRLCVQSNGTIKVL 142

Query: 305 FSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPG 484
            S  D+++CCP CG GC GG    AWEY+K+ G+ +GG Y +   C+PY   PC+    G
Sbjct: 143 LSDTDILACCPNCGAGCGGGHTIRAWEYFKNTGVCTGGLYGTKDSCKPYAFYPCKDESYG 202

Query: 485 NRMPCNGDT-KTPKCQKNCESSYNVPFKKEQRY 580
               C  D+  TPKC+K C+  Y+  +  ++ Y
Sbjct: 203 K---CPKDSFPTPKCRKICQYKYSKKYADDKYY 232


>UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3
           precursor; n=4; Caenorhabditis|Rep: Cathepsin B-like
           cysteine proteinase 3 precursor - Caenorhabditis elegans
          Length = 370

 Score =  151 bits (367), Expect = 9e-36
 Identities = 68/148 (45%), Positives = 90/148 (60%), Gaps = 2/148 (1%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           LP+ FD R+KWP+C T+  IR+Q +CGSCWAFGA E ++DRVCI SN T+    S ED++
Sbjct: 92  LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151

Query: 326 SCC-PICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCN 502
           SCC   CG GC GG    A  +W   G V+GG+Y    GC PY   PC  + P       
Sbjct: 152 SCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDY-GGHGCMPYSFAPCTKNCP------- 203

Query: 503 GDTKTPKCQKNCESSYNV-PFKKEQRYG 583
            ++ TP C+  C+SSY    +KK++ YG
Sbjct: 204 -ESTTPSCKTTCQSSYKTEEYKKDKHYG 230


>UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2;
           Arthropoda|Rep: Cathepsin B-like cysteine protease -
           Callosobruchus maculatus (Southern cowpea weevil) (Pulse
           bruchid)
          Length = 330

 Score =  151 bits (365), Expect = 2e-35
 Identities = 71/197 (36%), Positives = 104/197 (52%), Gaps = 3/197 (1%)
 Frame = +2

Query: 5   KQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPE 184
           K   WKAGRNF   T   +I+ L+     +   +   + H+ +   +LPE FD R +W +
Sbjct: 35  KNLPWKAGRNFERDTSLYNIQRLLSVGTINPPSEFETIFHEDDG-KDLPEEFDARKQWSK 93

Query: 185 CPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGL---GC 355
           C ++ EIRDQ  CGSCWA  +   M+DR+CI S+       SA D++ CC  C     GC
Sbjct: 94  CESIKEIRDQSGCGSCWAVSSASVMSDRICIQSDQKNQLRISAADMIECCESCTFSVDGC 153

Query: 356 NGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKN 535
           +GG+P+  +  WK  G VSGG YNS+ GC  Y +P C    P     C      P C+K 
Sbjct: 154 HGGIPSFTFTEWKDSGFVSGGEYNSTNGCMSYPLPRCN---PS----CKTLYDAPTCKKE 206

Query: 536 CESSYNVPFKKEQRYGK 586
           C+    + +++++ Y K
Sbjct: 207 CDKGSPLKYEEDKHYAK 223


>UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA;
           n=1; Tribolium castaneum|Rep: PREDICTED: similar to
           CG10992-PA - Tribolium castaneum
          Length = 325

 Score =  147 bits (355), Expect = 3e-34
 Identities = 70/152 (46%), Positives = 94/152 (61%), Gaps = 3/152 (1%)
 Frame = +2

Query: 5   KQNTWKAGRNFPTHTPFAHIKILMG--ALKDDNILKLPKVTHDAELIANLPENFDPRDKW 178
           +Q +WKA  N         IK  +G   L  D   K+    H    I ++PE+FD R+KW
Sbjct: 33  EQISWKAETNC------LDIKSRLGFLGLHPDPNYKIQTKQHKISRIISIPESFDAREKW 86

Query: 179 PECP-TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355
           PEC   + +IR+QG+CGSCWAF + E MTDR+CI S     F FS E+L++CC  CG GC
Sbjct: 87  PECKDVIGKIRNQGNCGSCWAFASTEVMTDRLCISSKGKIKFVFSPENLLTCCKDCGCGC 146

Query: 356 NGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPY 451
            GG    AW+Y+ + G+ SGG+YNSS+GC+PY
Sbjct: 147 KGGYIKNAWDYYINEGIASGGDYNSSEGCQPY 178


>UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8;
           Leishmania|Rep: Cathepsin B-like protease - Leishmania
           major
          Length = 340

 Score =  144 bits (350), Expect = 1e-33
 Identities = 73/185 (39%), Positives = 97/185 (52%), Gaps = 5/185 (2%)
 Frame = +2

Query: 2   KKQNTWKAGRN---FPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRD 172
           K +  W A  N     T      ++ LMG          P+     EL  +LPE FD  +
Sbjct: 47  KAKGQWTASANNGYLVTGKSLGEVRKLMGVTDMSTEAVPPRNFSVEELQQDLPEFFDAAE 106

Query: 173 KWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLG 352
            WP C T++EIRDQ +CGSCWA  AVEA++DR C +         S  +L+SCC ICGLG
Sbjct: 107 HWPMCLTISEIRDQSNCGSCWAIAAVEAISDRYCTFGGVPDR-RMSTSNLLSCCFICGLG 165

Query: 353 CNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDT--KTPKC 526
           C+GG+PT+AW +W  VG+       +++ C+PY   PC HH    + P    T   TPKC
Sbjct: 166 CHGGIPTVAWLWWVWVGI-------ATEDCQPYPFDPCSHHGNSEKYPPCPSTIYDTPKC 218

Query: 527 QKNCE 541
              CE
Sbjct: 219 NTTCE 223


>UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15;
           Aphidoidea|Rep: Cathepsin B-S precursor - Tuberaphis
           styraci
          Length = 349

 Score =  144 bits (348), Expect = 2e-33
 Identities = 67/193 (34%), Positives = 98/193 (50%), Gaps = 4/193 (2%)
 Frame = +2

Query: 14  TWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIA--NLPENFDPRDKWPEC 187
           TWKA R FP +T   +   L+G+    N     ++     L    N P+ FD R+ W  C
Sbjct: 39  TWKAERYFPANTSEEYFIGLLGSRGYKNYTNEVEIKKYDPLYVENNSPKQFDSRENWKSC 98

Query: 188 PTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGM 367
             +  IRDQG+CGSCW+F    A  DR+C+ +    +   S E+L  CC  CG GC GG 
Sbjct: 99  KQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELAFCCMDCGKGCGGGY 158

Query: 368 PTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCESS 547
           P  AW+Y++  G+ +GG+Y++ +GC PY++PPC      N        +  +C K C   
Sbjct: 159 PIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYDEQGKNTCGGKPMERNHQCPKTCYGK 218

Query: 548 YNVP--FKKEQRY 580
             V   +K +  Y
Sbjct: 219 TTVQDRYKTKNEY 231


>UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8.1;
           n=1; Caenorhabditis elegans|Rep: Putative
           uncharacterized protein W07B8.1 - Caenorhabditis elegans
          Length = 335

 Score =  143 bits (347), Expect = 2e-33
 Identities = 67/155 (43%), Positives = 92/155 (59%), Gaps = 7/155 (4%)
 Frame = +2

Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319
           ++L  +FD R++WPEC ++ +I D   C + WAF A E+M+DR+CI S   K+   SAE+
Sbjct: 74  SDLSPSFDARERWPECMSIPQINDISECKTSWAFAAAESMSDRLCINSGGFKNTILSAEE 133

Query: 320 LVSCCP---ICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNR 490
           L+SCC     CG GC GG P  AW+Y +  G+ +GG+Y S  GC+PY IPPC   V    
Sbjct: 134 LLSCCTGMFSCGEGCEGGNPFKAWQYIQKHGIPTGGSYESQFGCKPYSIPPCGKTVGNVT 193

Query: 491 MP-CNGDTK-TPKCQKNCES--SYNVPFKKEQRYG 583
            P C   T  TP C+K C S   Y +   K++ YG
Sbjct: 194 YPACTNTTSPTPSCEKKCTSRIGYPIDIDKDRHYG 228


>UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115,
           whole genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_115,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 332

 Score =  143 bits (346), Expect = 3e-33
 Identities = 68/152 (44%), Positives = 87/152 (57%), Gaps = 11/152 (7%)
 Frame = +2

Query: 131 ELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFS 310
           E + NLP +F  ++KWP CP++  I DQG+CGSCWA  A   M+DR+CI S  T     S
Sbjct: 66  EKLENLPPSFSAQEKWPGCPSIELIPDQGNCGSCWAVSAASTMSDRLCIASGQTDKRQIS 125

Query: 311 AEDLVSCCPI-CGL----GCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEH- 472
           AEDL+SCC I C L    GC+GG P  AW+Y +  G+V+GG YN    C+PY  PPC H 
Sbjct: 126 AEDLLSCCGINCELDGNGGCDGGYPYGAWKYLRVDGIVTGGTYNDFSLCKPYSFPPCSHG 185

Query: 473 HVPGNRMPCNGD-----TKTPKCQKNCESSYN 553
           +  G    C  D       TP C K C   ++
Sbjct: 186 NDSGKYSKCENDFFMLTEVTPSCTKKCHPQFS 217


>UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4;
           Ancylostomatidae|Rep: Cysteine proteinase - Ancylostoma
           ceylanicum
          Length = 348

 Score =  142 bits (345), Expect = 4e-33
 Identities = 63/156 (40%), Positives = 89/156 (57%), Gaps = 3/156 (1%)
 Frame = +2

Query: 116 VTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATK 295
           V  + E+  ++P+ FD RD+WP C ++  IRDQ SCGSCWA  A  AM+DRVC  +N   
Sbjct: 84  VLANTEMKVDIPDTFDARDRWPNCTSMKHIRDQSSCGSCWAVAAASAMSDRVCALTNGRI 143

Query: 296 HFHFSAEDLVSCC-PICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEH 472
           +   S  +++SCC   CG GC GG P  A+ Y    GL +GG Y     C+PY   PC +
Sbjct: 144 NRILSDTEVLSCCFGSCGFGCKGGYPARAFGYAWRYGLSTGGPYGEKDACQPYAFYPCGN 203

Query: 473 HVPGNRM-PCNGDT-KTPKCQKNCESSYNVPFKKEQ 574
           H       PC  +   TP C++ C+  Y +PF+K++
Sbjct: 204 HAHEPYYGPCPDELWPTPTCRRTCQLGYPIPFEKDK 239


>UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 356

 Score =  138 bits (334), Expect = 9e-32
 Identities = 77/206 (37%), Positives = 111/206 (53%), Gaps = 11/206 (5%)
 Frame = +2

Query: 2   KKQNTWKAGRNFPT-HTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKW 178
           KKQ  WKA  +  T     A  K +     +D + +  K  +D  L+ ++P +FD R KW
Sbjct: 46  KKQKLWKAETSRMTFQEKMARAKSIKFIKSNDEVSE--KTGNDNVLV-DIPSSFDSRQKW 102

Query: 179 PECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC----PIC- 343
           P C  +  +RDQ  CGS     AVE  +DR CI SN T ++  SA+D +SCC     IC 
Sbjct: 103 PSCSQIGAVRDQSDCGSAAHLVAVEIASDRTCIASNGTFNWPLSAQDPLSCCVGLMSICG 162

Query: 344 -GLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPG--NRMPCNGDTK 514
            G GC+G  P    ++W+  GL +GGNYN   GC+PY I PC+         +PC G   
Sbjct: 163 DGWGCDGSWPKDILKWWQTHGLCTGGNYNDQFGCKPYSIYPCDKKYANGTTSVPCPG-YH 221

Query: 515 TPKCQKNCESSYNVP--FKKEQRYGK 586
           TP C+++C S+   P  +K+++ +GK
Sbjct: 222 TPTCEEHCTSNITWPIAYKQDKHFGK 247


>UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7;
           n=2; Haemonchidae|Rep: Cathepsin B-like cysteine
           protease GCP7 - Haemonchus contortus (Barber pole worm)
          Length = 348

 Score =  136 bits (328), Expect = 5e-31
 Identities = 60/163 (36%), Positives = 92/163 (56%), Gaps = 2/163 (1%)
 Frame = +2

Query: 92  DNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRV 271
           +N+L +  +T + ++    PE+FD R+KW +CP+L  I DQ +CGSCWA  A + M+DR+
Sbjct: 82  ENVLPIANITSNDDI----PESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCMSDRL 137

Query: 272 CIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRP 448
           CI+S   K    SA D+++CC   CG GC+GG    AW++    G+V+GG Y     C+P
Sbjct: 138 CIHSQGRKKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGNCKP 197

Query: 449 YEIPPCEHHVPGNRMPC-NGDTKTPKCQKNCESSYNVPFKKEQ 574
           Y  P C  H       C +    TP C+  C+  Y   ++ ++
Sbjct: 198 YVFPQCGAHKGKAFNNCPSHPYATPACKPYCQYGYGKRYENDK 240


>UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8;
           Trypanosoma|Rep: Cathepsin B-like cysteine protease -
           Trypanosoma brucei
          Length = 340

 Score =  134 bits (324), Expect = 1e-30
 Identities = 71/164 (43%), Positives = 89/164 (54%), Gaps = 5/164 (3%)
 Frame = +2

Query: 65  KILMGALKDDNILK-LPKVTH-DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWA 238
           K L G +K +N    LPK    + E  A LP +FD  + WP CPT+ +I DQ +CGSCWA
Sbjct: 65  KRLNGVIKKNNNASILPKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWA 124

Query: 239 FGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGG 418
             A  AM+DR C      +  H SA DL++CC  CG GCNGG P  AW Y+   GLVS  
Sbjct: 125 VAAASAMSDRFCT-MGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAWAYFSSTGLVS-- 181

Query: 419 NYNSSQGCRPYEIPPCEHHVPGNR--MPCNG-DTKTPKCQKNCE 541
           +Y     C+PY  P C HH        PC+  +  TPKC   C+
Sbjct: 182 DY-----CQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCNYTCD 220


>UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin
           B-like cysteine proteinase 4 precursor (Cysteine
           protease-related 4); n=2; Tribolium castaneum|Rep:
           PREDICTED: similar to Cathepsin B-like cysteine
           proteinase 4 precursor (Cysteine protease-related 4) -
           Tribolium castaneum
          Length = 360

 Score =  134 bits (323), Expect = 2e-30
 Identities = 76/201 (37%), Positives = 102/201 (50%), Gaps = 8/201 (3%)
 Frame = +2

Query: 5   KQNTWKAGRNFPTHTPFAHIKILMGAL---KDDNI---LKLPKVTHDAELIANLPENFDP 166
           +Q+ W AG N     PF  I+  +G L    D N    +K P+ T +      +PE FD 
Sbjct: 29  QQSAWTAGIN-----PFDDIESRLGFLGIHPDPNFKPEIKEPQATQNV-----IPETFDA 78

Query: 167 RDKWPECPTL-NEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPIC 343
           R+ WPEC  +   IR+QG C S WAF A E M+DR+CI +N       S EDL+ CC  C
Sbjct: 79  REYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDLIDCCHYC 138

Query: 344 GLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPK 523
           G  C GG    AW Y+   GLVSGG+YN+S GC+PY                N    TP 
Sbjct: 139 GNQCKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYS-------------ELNYYRITPP 185

Query: 524 CQKNCES-SYNVPFKKEQRYG 583
           C   C++  Y +P+  ++ +G
Sbjct: 186 CNTTCQNDKYPIPYVSDKHFG 206


>UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus
           contortus|Rep: Cysteine proteinase - Haemonchus
           contortus (Barber pole worm)
          Length = 350

 Score =  133 bits (322), Expect = 3e-30
 Identities = 66/169 (39%), Positives = 89/169 (52%), Gaps = 5/169 (2%)
 Frame = +2

Query: 95  NILKLPKVTHDAELIAN--LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDR 268
           N  KL KV    E   N  +PE+FD R  W  C ++  +RDQ  CGSCWA  A   M+DR
Sbjct: 75  NARKLYKVKKAEEQTTNEDIPESFDSRIVWKNCSSITYVRDQSRCGSCWAVSAASTMSDR 134

Query: 269 VCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCR 445
           +C+ +        S  D++SCC  +CG GC GG   LAWE+ +  G+V+GG Y     CR
Sbjct: 135 ICVQTKGKLQTILSDTDILSCCGRMCGDGCEGGYDHLAWEWVQRFGVVTGGPYQQKGVCR 194

Query: 446 PYEIPPCEHHVPGNRMPCNGD--TKTPKCQKNCESSYNVPFKKEQRYGK 586
           PY   PC  H  G R  C  D    TP C+  C+  Y   ++K++ + K
Sbjct: 195 PYAFHPCGLH-HGRRYDCPWDHSFSTPACKPYCQFGYGKRYEKDKFFVK 242


>UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core
           eudicotyledons|Rep: At1g02300/T6A9_10 - Arabidopsis
           thaliana (Mouse-ear cress)
          Length = 362

 Score =  133 bits (321), Expect = 3e-30
 Identities = 78/194 (40%), Positives = 103/194 (53%), Gaps = 5/194 (2%)
 Frame = +2

Query: 17  WKAGRNFP-THTPFAHIKILMGA--LKDDNILKLPKVTHDAELIANLPENFDPRDKWPEC 187
           WKA  N    +   A  K L+G         L +P V+HD  L   LP+ FD R  W +C
Sbjct: 62  WKASFNDRFANATVAEFKRLLGVKPTPKTEFLGVPIVSHDISL--KLPKEFDARTAWSQC 119

Query: 188 PTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP-ICGLGCNGG 364
            ++  I DQG CGSCWAFGAVE+++DR CI  N   +   S  DL++CC  +CG GCNGG
Sbjct: 120 TSIGRILDQGHCGSCWAFGAVESLSDRFCIKYN--MNVSLSVNDLLACCGFLCGQGCNGG 177

Query: 365 MPTLAWEYWKHVGLVSGGNYNSSQGCRPY-EIPPCEHHVPGNRMPCNGDTKTPKCQKNCE 541
            P  AW Y+KH G+V       ++ C PY +   C H  PG    C     TPKC + C 
Sbjct: 178 YPIAAWRYFKHHGVV-------TEECDPYFDNTGCSH--PG----CEPAYPTPKCARKCV 224

Query: 542 SSYNVPFKKEQRYG 583
           S  N  +++ + YG
Sbjct: 225 SG-NQLWRESKHYG 237


>UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 1 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 332

 Score =  132 bits (318), Expect = 8e-30
 Identities = 68/196 (34%), Positives = 101/196 (51%), Gaps = 4/196 (2%)
 Frame = +2

Query: 14  TWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTH----DAELIANLPENFDPRDKWP 181
           TWKAGRNF      +H   + G      +      +H    + +     PE+F PR+ W 
Sbjct: 41  TWKAGRNFDEKR--SHSDCVQGGDGASVLTATSTSSHFTSYEEDSRWTCPESFTPREYWS 98

Query: 182 ECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNG 361
            C ++  IRDQ +CGSCWAF A E+++DR+CI++N     + SAEDL++CC  CG GC+G
Sbjct: 99  HCSSIRVIRDQSACGSCWAFAAAESISDRICIHTNGKVQVNISAEDLLACCHTCGHGCDG 158

Query: 362 GMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCE 541
                +    +   LV      +  GC+PY +PPC   VP     C     TPKCQ  C 
Sbjct: 159 RCHCSSVAILQGRRLVP-EPVRTEDGCQPYSLPPC---VPN----CTHPEPTPKCQHVCR 210

Query: 542 SSYNVPFKKEQRYGKH 589
             Y   +++++ + K+
Sbjct: 211 KGYEKSYEEDKHFAKN 226


>UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep:
           Thiol protease - Trichuris suis
          Length = 348

 Score =  129 bits (311), Expect = 5e-29
 Identities = 65/166 (39%), Positives = 88/166 (53%), Gaps = 12/166 (7%)
 Frame = +2

Query: 125 DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH 304
           D  L  ++P +FD R  W  C +LN IRDQ  CGSCWA  A E M+DR+C+ SN +    
Sbjct: 77  DRSLALSIPPSFDVRSLWHVC-SLNLIRDQAKCGSCWAVSAAETMSDRICVQSNCSIKAC 135

Query: 305 FSAEDLVSCCPI-CGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYE-IPPCEHHV 478
            S  D++SCC + CG GCNGG P  AW ++   G  +GG      GC+PY+   P   H+
Sbjct: 136 ISDTDILSCCGLYCGYGCNGGFPIEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRHL 195

Query: 479 PGN-RMPCNGDT---------KTPKCQKNCESSYNVPFKKEQRYGK 586
             N   PC  DT          TP+C++ C   Y   +  ++ YGK
Sbjct: 196 KRNDYAPCPNDTYYGECVGMADTPRCKRRCLLGYPKSYPSDRYYGK 241


>UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep:
           Cysteine proteinase - Toxoplasma gondii
          Length = 569

 Score =  127 bits (307), Expect = 2e-28
 Identities = 59/142 (41%), Positives = 83/142 (58%), Gaps = 10/142 (7%)
 Frame = +2

Query: 146 LPENFDPRDKWPECP-TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322
           +P +FD R  +P C   +  +RDQG CGSCWAF + EA  DR+CI S   +    SA+  
Sbjct: 274 VPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHT 333

Query: 323 VSCC---PICGLGCNGGMPTLAWEYWKHVGLVSGGNYNS-SQG--CRPYEIPPCEHHVPG 484
            SCC        GCNGG P +AW +++  G+V+GG++++  +G  C PYE+P C HH   
Sbjct: 334 TSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAHHAKA 393

Query: 485 NRMPCNG---DTKTPKCQKNCE 541
               C+      KTPKC+K+CE
Sbjct: 394 PFPDCDATLVPRKTPKCRKDCE 415


>UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000012227 - Anopheles gambiae
           str. PEST
          Length = 218

 Score =  127 bits (306), Expect = 2e-28
 Identities = 53/97 (54%), Positives = 69/97 (71%), Gaps = 1/97 (1%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           +PE+FD R+ WP C +L  IR+QG+CGSCWA  A   M+DRVCI+SN T +   +AEDL+
Sbjct: 1   IPESFDARNHWPNCESLRAIRNQGTCGSCWAVAAASVMSDRVCIHSNGTINVALAAEDLM 60

Query: 326 SCCPICGLGCNGG-MPTLAWEYWKHVGLVSGGNYNSS 433
            CC  CG GCNGG +   +++YW   GLVSGG YNS+
Sbjct: 61  GCCVDCGNGCNGGFLDGTSFQYWVDAGLVSGGAYNST 97


>UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep:
           Cathepsin B - Triticum aestivum (Wheat)
          Length = 353

 Score =  120 bits (288), Expect = 3e-26
 Identities = 61/148 (41%), Positives = 85/148 (57%), Gaps = 2/148 (1%)
 Frame = +2

Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322
           +LP+ FD R +W  C T+  I DQG CG+CWAF AVEA+ DR CI+ N +     S  DL
Sbjct: 96  DLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRFCIHLNMS--VSLSVNDL 153

Query: 323 VSCCP-ICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPY-EIPPCEHHVPGNRMP 496
           ++CC  +CG GCNGG P  AW Y++  G+V       ++ C PY +   C+H  PG    
Sbjct: 154 LACCGFLCGSGCNGGYPISAWRYFRRSGVV-------TEECDPYFDQTGCQH--PG---- 200

Query: 497 CNGDTKTPKCQKNCESSYNVPFKKEQRY 580
           C     TPKCQ+ C+   N  +K+ + +
Sbjct: 201 CEPAYPTPKCQRKCKVE-NQAWKENKHF 227


>UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC06356 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 279

 Score =  120 bits (288), Expect = 3e-26
 Identities = 52/158 (32%), Positives = 84/158 (53%), Gaps = 1/158 (0%)
 Frame = +2

Query: 116 VTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATK 295
           ++H++ +   +P +FD R  W  C T+ +I D+  C + WA   V++++DR+CI SN   
Sbjct: 19  ISHNS-INMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRI 77

Query: 296 HFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHH 475
               SA D +SC      GC  G       YW   G+V+GG+Y    GC+PY +P C +H
Sbjct: 78  SVQLSARDAISCG--FSPGCFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSYH 135

Query: 476 VPGNRMPCNGDT-KTPKCQKNCESSYNVPFKKEQRYGK 586
                + CN +T + P+C   C+  YN  +  ++ YG+
Sbjct: 136 PESRFLDCNNNTFEFPQCTNECQDGYNKTYDDDKFYGE 173


>UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 421

 Score =  118 bits (283), Expect = 1e-25
 Identities = 55/137 (40%), Positives = 76/137 (55%)
 Frame = +2

Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319
           +++P+NFD R KWP CP+++ + +QG CGSC+A  A    +DR CI+SN T     S ED
Sbjct: 136 SDVPKNFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKSLLSEED 195

Query: 320 LVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPC 499
           ++ CC +CG  C GG P  A  YW + GLV+GG      GCRPY        VP +    
Sbjct: 196 IIGCCSVCG-NCYGGDPLKALTYWVNQGLVTGGR----DGCRPYSF-DLSCGVPCSPATF 249

Query: 500 NGDTKTPKCQKNCESSY 550
               +   C K C++ Y
Sbjct: 250 FEAEEKRTCMKRCQNIY 266


>UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2;
           Ostreococcus|Rep: Cysteine proteinase - Ostreococcus
           tauri
          Length = 362

 Score =  105 bits (251), Expect = 1e-21
 Identities = 60/148 (40%), Positives = 75/148 (50%), Gaps = 14/148 (9%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTL-NEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322
           LP+ FD R+KWP+C  L +E  DQG+CGSCWA    +AMTDR+CI +N   + H SA  L
Sbjct: 88  LPDTFDVREKWPKCAALVSEAVDQGACGSCWAVAPAKAMTDRLCIATNGAVNTHVSAIQL 147

Query: 323 VSCCP-----------ICG--LGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPP 463
           +SC             + G   GC GG PT A+E    VG+VSGG       C PY   P
Sbjct: 148 LSCNSHSNSAYTYDENLAGGSGGCMGGYPTEAYETAHRVGVVSGGLNGDQDTCMPYPFAP 207

Query: 464 CEHHVPGNRMPCNGDTKTPKCQKNCESS 547
           C H       PC        C + C+ S
Sbjct: 208 CHH-------PCE-PNHNAVCPRTCQRS 227


>UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep:
           Cathepsin B - Streblomastix strix
          Length = 312

 Score =  101 bits (242), Expect = 1e-20
 Identities = 52/132 (39%), Positives = 71/132 (53%)
 Frame = +2

Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316
           +ANLP+ FD R  WP C  + +I DQG CGSCWA  + E + DR CI S   +    S +
Sbjct: 73  VANLPDEFDSRTNWPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEGKQTPELSPQ 132

Query: 317 DLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP 496
            L SC P C  GCNGG  + A+ + +  G++        + C PY++  C+H  PG    
Sbjct: 133 HLTSCTPGCS-GCNGGWMSTAFGFMQSNGIL-------GEDCIPYQMGKCKH--PG---- 178

Query: 497 CNGDTKTPKCQK 532
           C+    TPKC K
Sbjct: 179 CS-TWPTPKCNK 189


>UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 314

 Score = 95.9 bits (228), Expect = 6e-19
 Identities = 56/137 (40%), Positives = 79/137 (57%), Gaps = 3/137 (2%)
 Frame = +2

Query: 5   KQNTWKAGRN--FPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKW 178
           K+++W A RN  F   T F  I  +MG  K     KL +  +  EL  ++P +FD R +W
Sbjct: 42  KKSSWTAHRNKNFEGKT-FGDIIGMMGTKKTAAPFKLTE--NGEELKGSIPTSFDSRVQW 98

Query: 179 PECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYS-NATKHFHFSAEDLVSCCPICGLGC 355
           P+C  ++ I +Q  CGSCWAF + E ++DR+CI S N T     S + LV+C      GC
Sbjct: 99  PDC--IHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGALSPQTLVACDVYGNDGC 156

Query: 356 NGGMPTLAWEYWKHVGL 406
           +GG+P LAWEY +  GL
Sbjct: 157 SGGIPQLAWEYMELKGL 173


>UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1;
           Ostreococcus tauri|Rep: Cysteine proteinase Cathepsin F
           - Ostreococcus tauri
          Length = 498

 Score = 91.9 bits (218), Expect = 1e-17
 Identities = 60/154 (38%), Positives = 80/154 (51%), Gaps = 6/154 (3%)
 Frame = +2

Query: 41  THTPFAHIKILMGALK-DDNILKLPKVTHDAELIANLPENFDPRDKWPECPTL-NEIRDQ 214
           T +P+A      GA   D   + L +V  DA L  +LP +FD RD++P+C  L   +RDQ
Sbjct: 222 TLSPYASSDETHGAHPFDRKAVGLGRVKWDA-LKHSLPRHFDARDEYPKCARLIGTVRDQ 280

Query: 215 GSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGG--MPTLAWEY 388
           G CGSCWA  A E M DR+CI S   +    S +  +SC    G GC GG  + TL    
Sbjct: 281 GKCGSCWAVAATEIMNDRLCISSGGKEVAELSPQFALSCYN-SGAGCEGGDVVDTLTLAL 339

Query: 389 WKHVGLVSGGNYNSSQGCRPYEIPPCEH--HVPG 484
            K  G+  GG  +    C PY+  PC+H   +PG
Sbjct: 340 AK--GVPHGGMLDKG-ACLPYQFEPCDHPCMIPG 370


>UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 311

 Score = 91.9 bits (218), Expect = 1e-17
 Identities = 51/147 (34%), Positives = 72/147 (48%)
 Frame = +2

Query: 125 DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH 304
           +  +  N+PENFD R +WP   +++ IR+QG CGSCWAFGA E ++DR  I S    +  
Sbjct: 76  EVRVAENIPENFDARKQWPG--SIHPIRNQGQCGSCWAFGASEVLSDRFAIASKNQIYVT 133

Query: 305 FSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPG 484
            SA+ LV  C +   GC+GG P  AW Y    GL++   Y       PY        +  
Sbjct: 134 LSAQQLVD-CDLDNSGCSGGWPINAWNYMVKTGLLTEQCYG------PYYAKQYTCRLTA 186

Query: 485 NRMPCNGDTKTPKCQKNCESSYNVPFK 565
           N   C           + +S+Y +P K
Sbjct: 187 NTTDCPWQPGVKARFYHAKSAYKLPAK 213


>UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC02853 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 181

 Score = 86.2 bits (204), Expect = 5e-16
 Identities = 43/92 (46%), Positives = 56/92 (60%), Gaps = 3/92 (3%)
 Frame = +2

Query: 17  WKAGRNFPTHTPFAHIKILMGALK---DDNILKLPKVTHDAELIANLPENFDPRDKWPEC 187
           WKA R     T   H K +MG L    D + L  P + H+ ++   LP+ FD R  W  C
Sbjct: 38  WKADRT-KRFTSIHHAKSMMGVLLNSVDQHKLHHPIIHHN-DINIKLPKYFDSRKYWKNC 95

Query: 188 PTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYS 283
            ++  IRDQ SCGSCWAFGAVE+M+DR+CI+S
Sbjct: 96  SSIRTIRDQSSCGSCWAFGAVESMSDRICIHS 127


>UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus
           lucimarinus CCE9901|Rep: Predicted protein -
           Ostreococcus lucimarinus CCE9901
          Length = 330

 Score = 84.6 bits (200), Expect = 2e-15
 Identities = 52/143 (36%), Positives = 71/143 (49%), Gaps = 4/143 (2%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTL-NEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322
           LP +FD R  +P+C  L   +RDQG CGSCWA  A E M DR+C+ ++       S +  
Sbjct: 112 LPTSFDARVAYPKCSRLLGAVRDQGRCGSCWAVAATEVMNDRLCVATDGENADELSPQYA 171

Query: 323 VSCCPICGLGCNGG--MPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP 496
           +SC    G GC+GG  + TL   + K  G+  GG  +S+  C PYE   C+H       P
Sbjct: 172 LSCFD-SGSGCDGGDVLDTLRIAFTK--GIPYGGMLDSN-ACLPYEFEACDH-------P 220

Query: 497 CNGDTKTPK-CQKNCESSYNVPF 562
           C     TP+ C   C     + F
Sbjct: 221 CMVAGTTPQSCPAKCADGSALSF 243


>UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella
           histriomuscorum|Rep: Cathepsin B - Oxytricha trifallax
           (Sterkiella histriomuscorum)
          Length = 294

 Score = 83.8 bits (198), Expect = 3e-15
 Identities = 50/123 (40%), Positives = 59/123 (47%), Gaps = 2/123 (1%)
 Frame = +2

Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316
           I  +PENFD R +W     ++ IRDQ  CGSCWAFGA EA +DR  I     K    S E
Sbjct: 73  IMTVPENFDARQQWGS--KIHAIRDQQQCGSCWAFGATEAFSDRFAING---KDVILSPE 127

Query: 317 DLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGG--NYNSSQGCRPYEIPPCEHHVPGNR 490
           DLVS C     GCNGG   +AWEY    G  +     Y++  G  P     C       R
Sbjct: 128 DLVS-CDTNDYGCNGGYMDVAWEYLADHGAATDSCFPYSAGSGFAPACSDKCADGSAMQR 186

Query: 491 MPC 499
             C
Sbjct: 187 FKC 189


>UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG01102;
           n=1; Caenorhabditis briggsae|Rep: Putative
           uncharacterized protein CBG01102 - Caenorhabditis
           briggsae
          Length = 374

 Score = 79.4 bits (187), Expect = 6e-14
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
 Frame = +2

Query: 353 CNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP-C-NGDTKTPKC 526
           C GG    AW+YW+  GL +GG+Y S  GC+PY I PC+  +     P C N   +TP C
Sbjct: 189 CAGGNVFKAWQYWQKHGLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSC 248

Query: 527 QKNCESSYNVPFKKEQRYG 583
           +K C+S Y V   K++ YG
Sbjct: 249 EKKCKSGYPVELDKDRHYG 267



 Score = 68.9 bits (161), Expect = 8e-11
 Identities = 28/59 (47%), Positives = 39/59 (66%)
 Frame = +2

Query: 158 FDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC 334
           FD R++WPEC ++  I D   C S WAF A E+M+DR+CI S    +   SA++L+SCC
Sbjct: 85  FDARERWPECSSIPIINDISDCKSSWAFSAAESMSDRLCINSGGMINTVLSAQELLSCC 143


>UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|Rep:
           Cysteine protease - Giardia muris
          Length = 301

 Score = 77.8 bits (183), Expect = 2e-13
 Identities = 42/109 (38%), Positives = 59/109 (54%), Gaps = 4/109 (3%)
 Frame = +2

Query: 92  DNILKLPKVTHDAEL----IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAM 259
           +N+  L   TH ++L       LP+++DPR +   C  L E+ DQ SCGSCWAF AV   
Sbjct: 55  ENLRSLRTETHVSQLNLGKTKELPKDYDPRVERAHC--LPEVADQASCGSCWAFSAVATF 112

Query: 260 TDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGL 406
            DR C Y   +K  H+S + +VSC    G  CNGG  +  W++    G+
Sbjct: 113 ADRRCAYGLDSKQVHYSEQYVVSCDFGDG-ACNGGWLSNVWKFLTKTGV 160


>UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep:
           Cathepsin B - Streblomastix strix
          Length = 283

 Score = 75.4 bits (177), Expect = 9e-13
 Identities = 36/89 (40%), Positives = 52/89 (58%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           +P+ FD R+KWP+   +  +RDQG CGSCWAF   E + DR+ +          + EDLV
Sbjct: 63  VPDTFDAREKWPDA--ILPVRDQGECGSCWAFSIAETIGDRLGVL--GCSRGDIAPEDLV 118

Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGLVS 412
           S C I   GC+GG   +AW++ +  GL +
Sbjct: 119 S-CDIFDDGCDGGFIDMAWDWCQENGLTT 146


>UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,
           isoform A; n=2; Endopterygota|Rep: PREDICTED: similar to
           CG3074-PA, isoform A - Tribolium castaneum
          Length = 445

 Score = 72.5 bits (170), Expect = 7e-12
 Identities = 44/126 (34%), Positives = 59/126 (46%), Gaps = 1/126 (0%)
 Frame = +2

Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322
           +LP  FD   KWP    ++EI+DQG CGS WA       +DR  I S   +    SA+ L
Sbjct: 196 SLPREFDSEFKWPGW--MSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHL 253

Query: 323 VSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGN-RMPC 499
           +SC       CNGG    AW Y + +GLV    +  S       IP     V  N ++P 
Sbjct: 254 LSCDRRGQQSCNGGYLDRAWSYIRKIGLVDEQCFPYSATNEKCRIPRRGDLVTANCQLPT 313

Query: 500 NGDTKT 517
           N D ++
Sbjct: 314 NVDRRS 319


>UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3;
           Giardia intestinalis|Rep: Cathepsin B-like CP1 precursor
           - Giardia lamblia (Giardia intestinalis)
          Length = 303

 Score = 72.1 bits (169), Expect = 9e-12
 Identities = 35/96 (36%), Positives = 50/96 (52%)
 Frame = +2

Query: 116 VTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATK 295
           +T   EL+  +P  FD RD++P+C  +    DQGSCGSCWAF A+    DR C      +
Sbjct: 69  ITEVQELVDPIPPQFDFRDEYPQC--VKPALDQGSCGSCWAFSAIGVFGDRRCAMGIDKE 126

Query: 296 HFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVG 403
              +S + L+S C +   GC+GG     W +    G
Sbjct: 127 AVSYSQQHLIS-CSLENFGCDGGDFQPTWSFLTFTG 161


>UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like protein
           F26E4.3; n=2; Caenorhabditis|Rep: Uncharacterized
           peptidase C1-like protein F26E4.3 - Caenorhabditis
           elegans
          Length = 491

 Score = 71.3 bits (167), Expect = 2e-11
 Identities = 35/93 (37%), Positives = 49/93 (52%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           LPE+FD RDKW   P ++ + DQG CGS W+       +DR+ I S    +   S++ L+
Sbjct: 223 LPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLL 280

Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424
           SC      GC GG    AW Y + +G+V    Y
Sbjct: 281 SCNQHRQKGCEGGYLDRAWWYIRKLGVVGDHCY 313


>UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;
           n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
           hypothetical protein - Strongylocentrotus purpuratus
          Length = 450

 Score = 70.9 bits (166), Expect = 2e-11
 Identities = 37/100 (37%), Positives = 49/100 (49%)
 Frame = +2

Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319
           A LPE FD R+ WP    ++E+ DQG CGS WA       +DR+ I S    +   S + 
Sbjct: 195 ARLPETFDARENWPGL--IDEVIDQGKCGSSWAISTASVASDRLAIQSMGEINPRLSEQH 252

Query: 320 LVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQG 439
           L+SC      GC+GG    AW + +  G VS   Y    G
Sbjct: 253 LLSCNIRGQRGCSGGYLDRAWYHLRRAGAVSRACYPYHSG 292


>UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4;
           Giardia intestinalis|Rep: Cathepsin B-like CP2 precursor
           - Giardia lamblia (Giardia intestinalis)
          Length = 300

 Score = 70.1 bits (164), Expect = 4e-11
 Identities = 33/87 (37%), Positives = 49/87 (56%)
 Frame = +2

Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322
           ++PE+FD R+++P C  + E+ DQG CGSCWAF +V    DR C+     K   +S + +
Sbjct: 74  DVPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQYV 131

Query: 323 VSCCPICGLGCNGGMPTLAWEYWKHVG 403
           VS C    + CNGG     W++    G
Sbjct: 132 VS-CDHGDMACNGGWLPNVWKFLTKTG 157


>UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-LDL
           responsive gene 2, partial; n=1; Strongylocentrotus
           purpuratus|Rep: PREDICTED: similar to oxidized-LDL
           responsive gene 2, partial - Strongylocentrotus
           purpuratus
          Length = 363

 Score = 69.3 bits (162), Expect = 6e-11
 Identities = 38/110 (34%), Positives = 58/110 (52%), Gaps = 1/110 (0%)
 Frame = +2

Query: 98  ILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCI 277
           +L + ++ +D    A +PE FD R +WP    +  +++QG+C S WA       +DR+ I
Sbjct: 207 VLTMHQIQNDMPPEA-IPEEFDARAQWPGL--VEGVQNQGNCASSWAMSTAATASDRLAI 263

Query: 278 YSNAT-KHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424
            SN T K+ H S + L+SC      GC GG    AW Y +  G+V+   Y
Sbjct: 264 QSNGTFKYMHLSPQHLLSCNVKRQQGCAGGHLDRAWWYMRKRGIVTEDCY 313


>UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_113_4299_5381 - Giardia lamblia ATCC
           50803
          Length = 360

 Score = 67.7 bits (158), Expect = 2e-10
 Identities = 33/87 (37%), Positives = 48/87 (55%)
 Frame = +2

Query: 149 PENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVS 328
           PE++D RD++P C T  E+ DQG+CGSCWAF +V+   D  C          +S + ++ 
Sbjct: 141 PESYDFRDEYPHCIT--EVVDQGNCGSCWAFSSVQTFADHRCRSGLDATGVSYSVQYVLD 198

Query: 329 CCPICGLGCNGGMPTLAWEYWKHVGLV 409
            C     GCNGG P  A+ +  + G V
Sbjct: 199 -CDRKDHGCNGGEPVNAFNFLHNTGTV 224


>UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|Rep:
           Cysteine proteinase - Globodera pallida
          Length = 53

 Score = 67.7 bits (158), Expect = 2e-10
 Identities = 28/52 (53%), Positives = 34/52 (65%), Gaps = 1/52 (1%)
 Frame = +2

Query: 212 QGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPI-CGLGCNGG 364
           QG CG CWAF   E ++DR CI SN T+    S  DL++CC + CG GCNGG
Sbjct: 1   QGQCGRCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCCGMSCGEGCNGG 52


>UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to
           glucocorticoid-inducible protein; n=1; Gallus
           gallus|Rep: PREDICTED: similar to
           glucocorticoid-inducible protein - Gallus gallus
          Length = 307

 Score = 67.3 bits (157), Expect = 2e-10
 Identities = 42/136 (30%), Positives = 60/136 (44%), Gaps = 1/136 (0%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           LP +FD   KWP    ++E  DQG+C   WAF      +DR+ I+S        S ++L+
Sbjct: 153 LPRHFDAATKWPGM--IHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPSLSPQNLL 210

Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYN-SSQGCRPYEIPPCEHHVPGNRMPCN 502
           SC      GC+GG    AW Y +  G+V+   Y  +SQ  +P   P   H     R    
Sbjct: 211 SCDTRNQRGCSGGRLDGAWWYLRRRGVVTDECYPFTSQDSQPAAQPCMMHSRSTGRGKRQ 270

Query: 503 GDTKTPKCQKNCESSY 550
              + P  Q +    Y
Sbjct: 271 ATARCPNPQTHANDIY 286


>UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcoptes
           scabiei type hominis|Rep: Sar s 1 allergen Yv9053H09 -
           Sarcoptes scabiei type hominis
          Length = 253

 Score = 67.3 bits (157), Expect = 2e-10
 Identities = 39/98 (39%), Positives = 54/98 (55%), Gaps = 4/98 (4%)
 Frame = +2

Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAV----EAMTDRVCIYSNATKHFHFS 310
           +LPE FD RD       L++IR+QG CG+CWAF A+     A   R  I  N T+  HFS
Sbjct: 36  DLPEKFDLRD----LGYLSKIRNQGRCGACWAFAALASVESAYNRRTRIVHNRTRKHHFS 91

Query: 311 AEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424
            ++LV C P    GC+G + +   +Y +  G+V   NY
Sbjct: 92  EQELVDCSPNTE-GCSGNIISNGLKYVQLRGVVKSANY 128


>UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 234

 Score = 66.9 bits (156), Expect = 3e-10
 Identities = 35/87 (40%), Positives = 50/87 (57%)
 Frame = +2

Query: 134 LIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSA 313
           ++ ++P+  D R K      +NEI+DQ  CGSCWAFG+  AM     +       +  S 
Sbjct: 14  IVGDIPDEIDYRTKG----AVNEIKDQKHCGSCWAFGSCAAMESSWFLKHGTL--YSLSE 67

Query: 314 EDLVSCCPICGLGCNGGMPTLAWEYWK 394
           + LV CC  C LGC+G +P+LA+EY K
Sbjct: 68  QCLVDCCHDC-LGCHGCLPSLAFEYVK 93


>UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p;
           n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
           GM06507p - Nasonia vitripennis
          Length = 483

 Score = 66.1 bits (154), Expect = 6e-10
 Identities = 36/115 (31%), Positives = 53/115 (46%), Gaps = 1/115 (0%)
 Frame = +2

Query: 83  LKDDNILKLPKVTHDAELIAN-LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAM 259
           L   +I ++PK      +  N LP  FD R +W     +  ++DQG CG+ WA   V+  
Sbjct: 214 LHSTDIFQIPKQNKQQWINPNDLPREFDSRIQWGN--DITPVQDQGWCGASWAISTVDVA 271

Query: 260 TDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424
           +DR  I S   +    S + L+SC      GC GG    AW + +  G+V    Y
Sbjct: 272 SDRFAIMSKGIEKVQLSGQHLISCNNRGQRGCKGGYLDRAWLFMRKFGVVDEDCY 326


>UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen;
           n=20; Amniota|Rep: Tubulointerstitial nephritis antigen
           - Homo sapiens (Human)
          Length = 476

 Score = 64.9 bits (151), Expect = 1e-09
 Identities = 37/109 (33%), Positives = 51/109 (46%)
 Frame = +2

Query: 98  ILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCI 277
           +L + ++T       +LPE F    KWP   T   + DQ +C + WAF       DR+ I
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPGW-THGPL-DQKNCAASWAFSTASVAADRIAI 258

Query: 278 YSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424
            S      + S ++L+SCC     GCN G    AW Y +  GLVS   Y
Sbjct: 259 QSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWWYLRKRGLVSHACY 307


>UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase;
           n=21; Bilateria|Rep: Cathepsin L-like cysteine
           proteinase - Globodera pallida
          Length = 379

 Score = 64.1 bits (149), Expect = 2e-09
 Identities = 46/134 (34%), Positives = 67/134 (50%), Gaps = 7/134 (5%)
 Frame = +2

Query: 14  TWKAGRNFPTHTPFAHIKILMG--ALKDDNILKLPKVTHDAELIANLPENFDPRDK-WPE 184
           T++ G N     PF+  K L G   L  DN+ +          + +LPE+ D RDK W  
Sbjct: 115 TFRVGENHIADLPFSEYKKLNGYRRLLGDNLRRNASTFLAPMNVGDLPESVDWRDKGW-- 172

Query: 185 CPTLNEIRDQGSCGSCWAF---GAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LG 352
              + E+++QG CGSCWAF   GA+EA   R        +    S ++L+ C    G +G
Sbjct: 173 ---VTEVKNQGMCGSCWAFSSTGALEAQHAR-----QTGQLISLSEQNLIDCSKKYGNMG 224

Query: 353 CNGGMPTLAWEYWK 394
           CNGG+   A++Y K
Sbjct: 225 CNGGIMDNAFQYIK 238


>UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase" precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 315

 Score = 63.3 bits (147), Expect = 4e-09
 Identities = 31/83 (37%), Positives = 44/83 (53%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355
           W +   L  ++DQG CGSCWAF    ++  ++ I+ N  +    S ++LV C      GC
Sbjct: 117 WRDSAVLG-VKDQGQCGSCWAFSTTGSLEGQLAIHKN--QRVPLSEQELVDCDTSRNAGC 173

Query: 356 NGGMPTLAWEYWKHVGLVSGGNY 424
           NGG+ T A+ Y K  GL S   Y
Sbjct: 174 NGGLMTDAFNYVKRHGLSSESQY 196


>UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus
           salmonis|Rep: Cysteine proteinase - Lepeophtheirus
           salmonis (salmon louse)
          Length = 372

 Score = 62.9 bits (146), Expect = 5e-09
 Identities = 36/106 (33%), Positives = 54/106 (50%), Gaps = 5/106 (4%)
 Frame = +2

Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316
           I +LPE+ D    W E   + ++++QGSCGSCW F AVE +   V I +N T     S +
Sbjct: 112 IKDLPESVD----WREKGVITDVKNQGSCGSCWVFSAVEQIESYVAIENNMTSPPLLSTQ 167

Query: 317 DLVSCCP---ICG--LGCNGGMPTLAWEYWKHVGLVSGGNYNSSQG 439
            + SC      CG   GC G +  +A+ Y +  G+ +   Y  + G
Sbjct: 168 QITSCSSNPYSCGGSGGCKGAINEIAYMYTQLYGIETEKEYPYTSG 213


>UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin C;
           n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to cathepsin C - Strongylocentrotus purpuratus
          Length = 482

 Score = 62.5 bits (145), Expect = 7e-09
 Identities = 43/131 (32%), Positives = 60/131 (45%), Gaps = 8/131 (6%)
 Frame = +2

Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319
           +NLPE FD RD       ++ +RDQG CGSC+AF +      R+ + +N       S ++
Sbjct: 247 SNLPEKFDWRDVGG-IDYVSPVRDQGICGSCYAFASTATQESRLRVMTNNNVKVVMSPQE 305

Query: 320 LVSCCPICGLGCNGGMPTL-AWEYWKHVGLVSGGNYNSSQGCRPYEIPPCE-------HH 475
           +VSC      GC GG P L A +Y +  GLV    Y   +   P     C        H+
Sbjct: 306 VVSCSEY-AQGCEGGFPYLIAGKYGQDFGLVDETCYPYRERDAPCRQVSCRRFRTSEYHY 364

Query: 476 VPGNRMPCNGD 508
           + G    CN D
Sbjct: 365 IGGFYGACNED 375


>UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p -
           Drosophila melanogaster (Fruit fly)
          Length = 431

 Score = 62.5 bits (145), Expect = 7e-09
 Identities = 32/97 (32%), Positives = 47/97 (48%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           LP +F+  DKW     ++E+ DQG CG+ W        +DR  I S   ++   SA++++
Sbjct: 187 LPSSFNALDKWSSY--ISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNIL 244

Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQ 436
           SC      GC GG    AW Y    G+V    Y  +Q
Sbjct: 245 SCTR-RQQGCEGGHLDAAWRYLHKKGVVDENCYPYTQ 280


>UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor;
           n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase
           precursor - Diabrotica virgifera virgifera (western corn
           rootworm)
          Length = 326

 Score = 62.5 bits (145), Expect = 7e-09
 Identities = 35/104 (33%), Positives = 48/104 (46%)
 Frame = +2

Query: 110 PKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNA 289
           P+V H    + +LP  FD    W E   + E++DQGSCGSCW+F      T     +   
Sbjct: 98  PRVIHSLTPVKDLPSKFD----WREKGAVTEVKDQGSCGSCWSFSTTG--TVEGAYFLKT 151

Query: 290 TKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGN 421
            K    S ++LV C      GC+GG    A EY +  G +   N
Sbjct: 152 GKLVSLSEQNLVDCAKEDCYGCSGGYMDKALEYIETAGGIMSEN 195


>UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11;
           Trichomonadidae|Rep: Cysteine protease - Tritrichomonas
           foetus (Trichomonas foetus)
          Length = 315

 Score = 62.5 bits (145), Expect = 7e-09
 Identities = 31/78 (39%), Positives = 45/78 (57%)
 Frame = +2

Query: 155 NFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC 334
           N D  D W E   +NEI+DQ +CGSCWAF A++A      I +   +   +S ++LV C 
Sbjct: 100 NVDSID-WREKGVVNEIKDQAACGSCWAFSAIQAAESAYAISTGTLE--SYSEQNLVDCV 156

Query: 335 PICGLGCNGGMPTLAWEY 388
             C  GC+GG+   A++Y
Sbjct: 157 QGC-YGCSGGLMDYAYKY 173


>UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-like
           precursor; n=26; Euteleostomi|Rep: Tubulointerstitial
           nephritis antigen-like precursor - Homo sapiens (Human)
          Length = 467

 Score = 62.5 bits (145), Expect = 7e-09
 Identities = 39/112 (34%), Positives = 52/112 (46%), Gaps = 2/112 (1%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           LP  F+  +KWP    ++E  DQG+C   WAF      +DRV I+S        S ++L+
Sbjct: 203 LPTAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLL 260

Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPY--EIPPCEHH 475
           SC      GC GG    AW + +  G+VS   Y  S   R      PPC  H
Sbjct: 261 SCDTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMH 312


>UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor;
           n=35; Fasciola|Rep: Cathepsin L-like proteinase
           precursor - Fasciola hepatica (Liver fluke)
          Length = 326

 Score = 62.5 bits (145), Expect = 7e-09
 Identities = 31/84 (36%), Positives = 43/84 (51%), Gaps = 1/84 (1%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLG 352
           W E   + E++DQG+CGSCWAF     M  +     N      FS + LV C  P    G
Sbjct: 114 WRESGYVTEVKDQGNCGSCWAFSTTGTMEGQ--YMKNERTSISFSEQQLVDCSGPWGNNG 171

Query: 353 CNGGMPTLAWEYWKHVGLVSGGNY 424
           C+GG+   A++Y K  GL +  +Y
Sbjct: 172 CSGGLMENAYQYLKQFGLETESSY 195


>UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 367

 Score = 62.1 bits (144), Expect = 9e-09
 Identities = 32/87 (36%), Positives = 49/87 (56%), Gaps = 4/87 (4%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC----PIC 343
           W +   ++ +++QGSCGSCWAF AV A+ + V +  N +    +S ++LV C        
Sbjct: 161 WRQSGAVSPVKNQGSCGSCWAFSAV-ALAESVNLLRNNSLAL-YSEQELVDCTYKNPQYY 218

Query: 344 GLGCNGGMPTLAWEYWKHVGLVSGGNY 424
             GC GG P++A+ Y K  G+ S  NY
Sbjct: 219 NYGCQGGWPSVAYRYIKDQGISSQQNY 245


>UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep:
           Cathepsin L - Stylonychia lemnae
          Length = 340

 Score = 61.7 bits (143), Expect = 1e-08
 Identities = 34/100 (34%), Positives = 48/100 (48%)
 Frame = +2

Query: 104 KLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYS 283
           K  K  +    + ++PE+ D    W E   +N ++DQG CGSCWAF  + ++  R  I +
Sbjct: 111 KTGKEVYSTPNLKDIPESID----WREKGAVNAVKDQGQCGSCWAFSTIASLESRYFIET 166

Query: 284 NATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVG 403
              K    S + LV C      GCNGG   LA +Y    G
Sbjct: 167 G--KLQSLSEQQLVDCSKNGNEGCNGGDMGLAMDYIASAG 204


>UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35;
           Viridiplantae|Rep: Cysteine proteinase 15A precursor -
           Pisum sativum (Garden pea)
          Length = 363

 Score = 61.7 bits (143), Expect = 1e-08
 Identities = 40/106 (37%), Positives = 52/106 (49%), Gaps = 10/106 (9%)
 Frame = +2

Query: 101 LKLPKVTHDAELI--ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVC 274
           L+LP     A ++   NLPE+FD R+K    P    ++DQGSCGSCWAF    A+     
Sbjct: 115 LRLPAHAQKAPILPTTNLPEDFDWREKGAVTP----VKDQGSCGSCWAFSTTGALEG--A 168

Query: 275 IYSNATKHFHFSAEDLVSCCPI--------CGLGCNGGMPTLAWEY 388
            Y    K    S + LV C  +        C  GCNGG+   A+EY
Sbjct: 169 HYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEY 214


>UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 317

 Score = 61.3 bits (142), Expect = 2e-08
 Identities = 44/134 (32%), Positives = 60/134 (44%), Gaps = 5/134 (3%)
 Frame = +2

Query: 80  ALKDDNILKLPKVTHDAELIAN----LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGA 247
           A+ D  ++  PK    +  +A+    +PE+ D    W E   +N +RDQ  CGSCWAF A
Sbjct: 78  AMLDSQLIHKPKRDITSRFVADPQLTVPESID----WREKGAVNPVRDQEQCGSCWAFSA 133

Query: 248 VEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424
             A+  +   +    K    S + LV C       GCNGG P  A++Y K  GL     Y
Sbjct: 134 AGALEGQ--RFLKEGKLEVLSTQQLVDCSRDYKNEGCNGGWPHWAYDYIKDNGLCLESKY 191

Query: 425 NSSQGCRPYEIPPC 466
              QG   Y    C
Sbjct: 192 -KYQGYDGYYCKEC 204


>UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2;
           Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio
           molitor (Yellow mealworm)
          Length = 336

 Score = 61.3 bits (142), Expect = 2e-08
 Identities = 43/129 (33%), Positives = 63/129 (48%), Gaps = 2/129 (1%)
 Frame = +2

Query: 113 KVTHDAELIANL--PENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSN 286
           K   D  L A++  P +FD RD+    P    +++QGSCGSCWAF +  A+  ++ I + 
Sbjct: 108 KTREDLGLNASVRYPASFDWRDQGMVSP----VKNQGSCGSCWAFSSTGAIESQMKIANG 163

Query: 287 ATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPC 466
           A      S + LV C P   LGC+GG    A+ Y    G +       S+G  PYE+   
Sbjct: 164 AGYDSSVSEQQLVDCVP-NALGCSGGWMNDAFTYVAQNGGI------DSEGAYPYEMADG 216

Query: 467 EHHVPGNRM 493
             H   N++
Sbjct: 217 NCHYDPNQV 225


>UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1;
           Dictyostelium discoideum AX4|Rep: Counting factor
           associated protein - Dictyostelium discoideum AX4
          Length = 531

 Score = 61.3 bits (142), Expect = 2e-08
 Identities = 34/103 (33%), Positives = 54/103 (52%), Gaps = 2/103 (1%)
 Frame = +2

Query: 122 HDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHF 301
           HD E + ++P   D R++   C T   ++DQG CGSCW FG+  ++    C+ +   +  
Sbjct: 301 HDDESLRSIPSTVDWRNQ--NCVT--PVKDQGICGSCWTFGSTGSLEGTNCVTNG--ELV 354

Query: 302 HFSAEDLVSCCPICG-LGCNGGMPTLAWEYWKHVG-LVSGGNY 424
             S + LV C  + G  GC GG  + A++Y   +G L +  NY
Sbjct: 355 SLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNY 397


>UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing
           protein; n=7; Hymenostomatida|Rep: Papain family
           cysteine protease containing protein - Tetrahymena
           thermophila SB210
          Length = 387

 Score = 61.3 bits (142), Expect = 2e-08
 Identities = 42/134 (31%), Positives = 61/134 (45%), Gaps = 5/134 (3%)
 Frame = +2

Query: 47  TPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCG 226
           T   + K +  A    N+ +  K T D   + +LP++ D    W +   +  ++DQG CG
Sbjct: 101 TTLGYSKTVKNAANKQNMFRNLK-TSDKINVKDLPKSVD----WRDAGVVTPVKDQGHCG 155

Query: 227 SCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP---ICG--LGCNGGMPTLAWEYW 391
           SCWAF     +     I +   K    S + LVSC      CG   GCNG +  LA+ Y 
Sbjct: 156 SCWAFATTAVIESYAAIATGQLK--TLSTQQLVSCVQNSYQCGGQGGCNGAVSELAYNYV 213

Query: 392 KHVGLVSGGNYNSS 433
           +  GL S   Y+ S
Sbjct: 214 QLFGLTSEYKYSYS 227


>UniRef50_Q24E33 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 328

 Score = 60.9 bits (141), Expect = 2e-08
 Identities = 38/142 (26%), Positives = 65/142 (45%), Gaps = 2/142 (1%)
 Frame = +2

Query: 5   KQNTWKAGRNFPTH-TPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWP 181
           K NT+K   N     T   +  + +   + ++I     +  D E + ++P   +    W 
Sbjct: 79  KNNTFKLAINIMAILTDEEYSSLYLNLDQQESIDIFDSLVDDNETVGDIPSEVN----WT 134

Query: 182 ECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPI-CGLGCN 358
               +  +++QGSCGSCWAF    A+     + +N  +   FS + LV C  +   +GCN
Sbjct: 135 AQGAVTPVKNQGSCGSCWAFSTTGALEGSYFLKNN--QLISFSEQQLVDCSRLYLNMGCN 192

Query: 359 GGMPTLAWEYWKHVGLVSGGNY 424
           GG+   A+ Y K  G+ +   Y
Sbjct: 193 GGLMPRAFRYVKAHGITTEEEY 214


>UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase
           precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine
           proteinase precursor - Heterodera glycines (Soybean cyst
           nematode worm)
          Length = 353

 Score = 60.9 bits (141), Expect = 2e-08
 Identities = 33/84 (39%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
 Frame = +2

Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319
           + LPE  D    W E   + E++DQG CGSCWAF A  A+ +       A+K    S ++
Sbjct: 133 STLPEKLD----WREKGAVTEVKDQGDCGSCWAFSATGAI-EGALAQKKASKIISLSEQN 187

Query: 320 LVSCCPICG-LGCNGGMPTLAWEY 388
           LV C    G  GC+GG+   A+EY
Sbjct: 188 LVDCSSKYGNEGCDGGLMDSAFEY 211


>UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15;
           Magnoliophyta|Rep: Cysteine proteinase RD19a precursor -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 368

 Score = 60.9 bits (141), Expect = 2e-08
 Identities = 43/118 (36%), Positives = 59/118 (50%), Gaps = 11/118 (9%)
 Frame = +2

Query: 104 KLPKVTHDAELIA--NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCI 277
           KLPK  + A ++   NLPE+FD RD     P    +++QGSCGSCW+F A  A+     +
Sbjct: 119 KLPKDANKAPILPTENLPEDFDWRDHGAVTP----VKNQGSCGSCWSFSATGALEGANFL 174

Query: 278 YSNATKHFHFSAEDLVSC--------CPICGLGCNGGMPTLAWEY-WKHVGLVSGGNY 424
            +   K    S + LV C           C  GCNGG+   A+EY  K  GL+   +Y
Sbjct: 175 ATG--KLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDY 230


>UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo
           sapiens|Rep: Isoform 2 of Q9GZM7 - Homo sapiens (Human)
          Length = 283

 Score = 60.5 bits (140), Expect = 3e-08
 Identities = 32/98 (32%), Positives = 47/98 (47%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           LP  F+  +KWP    ++E  DQG+C   WAF      +DRV I+S        S ++L+
Sbjct: 69  LPTAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLL 126

Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQG 439
           SC      GC GG    AW + +  G  + G+    +G
Sbjct: 127 SCDTHQQQGCRGGRLDGAWWFLRRRGYAATGDVGREEG 164


>UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia
           medanensis|Rep: Sui m 1 allergen - Suidasia medanensis
          Length = 336

 Score = 60.5 bits (140), Expect = 3e-08
 Identities = 45/142 (31%), Positives = 65/142 (45%), Gaps = 11/142 (7%)
 Frame = +2

Query: 125 DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH 304
           ++++   LP  FD R +W        +R+QG CGSCWAF     +  +  I  N   H  
Sbjct: 108 ESDISVALPAAFDWRQQWNTA-----VRNQGQCGSCWAFATAATVEAQYAIRKNV--HVT 160

Query: 305 FSAEDLVSC--CPICGL----GCNGGMPTLAWEYWKHVGLV--SGGNYNSSQG-CRPYEI 457
            S + LV C   P  G     GC GG P +A+ Y +  GLV  S   Y +  G C+   +
Sbjct: 161 LSEQQLVDCDHRPFQGQYEDHGCQGGNPIIAYAYVQQTGLVEESAYPYQARDGQCQSSTV 220

Query: 458 PPCE-HHV-PGNRMPCNGDTKT 517
              + +HV  G  +P N   +T
Sbjct: 221 NGHQRYHVSAGRELPFNATDET 242


>UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2;
           cellular organisms|Rep: Cysteine proteinase, putative -
           Archaeoglobus fulgidus
          Length = 1088

 Score = 60.5 bits (140), Expect = 3e-08
 Identities = 29/72 (40%), Positives = 41/72 (56%)
 Frame = +2

Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316
           +A+LP  FD    W +   L+ +RDQGSCGSCWA  AV A+   + + S A+     S +
Sbjct: 591 MASLPSRFD----WRDYTGLSAVRDQGSCGSCWAHSAVAALESALIVESGASSSIDLSEQ 646

Query: 317 DLVSCCPICGLG 352
            L+SC   C +G
Sbjct: 647 HLLSCEQDCEVG 658


>UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag-RP
           - Bombyx mori (Silk moth)
          Length = 404

 Score = 60.1 bits (139), Expect = 4e-08
 Identities = 44/129 (34%), Positives = 62/129 (48%), Gaps = 5/129 (3%)
 Frame = +2

Query: 83  LKDDNILKLPKVTHDAELIA-----NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGA 247
           LKD  I KL     +  +I+       P+ FD R +W     ++ I DQ  CGS WA   
Sbjct: 159 LKDGLIYKLGTFPLNVTVISYSKDGQYPDEFDARREW--YGYISPIADQDWCGSDWAVSI 216

Query: 248 VEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYN 427
              + DR  I S  T++   S++ L+SC      GCNGG   +A+++ K  GLV      
Sbjct: 217 ASIVGDRFSIQSFGTENVRMSSQTLLSCHLKGQRGCNGGNLDIAFDFVKTHGLV------ 270

Query: 428 SSQGCRPYE 454
            S+ C PYE
Sbjct: 271 -SEQCFPYE 278


>UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep:
           Cathepsin L precursor - Schistosoma mansoni (Blood
           fluke)
          Length = 319

 Score = 60.1 bits (139), Expect = 4e-08
 Identities = 35/97 (36%), Positives = 51/97 (52%), Gaps = 1/97 (1%)
 Frame = +2

Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316
           + N+P+NFD    W E   + E+++QG CGSCWAF     +  +   +    K    S +
Sbjct: 102 VNNIPKNFD----WREKGAVTEVKNQGMCGSCWAFSTTGNVESQ--WFRKTGKLLSLSEQ 155

Query: 317 DLVSCCPICGLGCNGGMPTLAWE-YWKHVGLVSGGNY 424
            LV C  +   GCNGG+P+ A+E   K  GL+   NY
Sbjct: 156 QLVDCDGLDD-GCNGGLPSNAYESIIKMGGLMLEDNY 191


>UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium
           tetraurelia|Rep: Cathepsin L1 precursor - Paramecium
           tetraurelia
          Length = 314

 Score = 60.1 bits (139), Expect = 4e-08
 Identities = 32/75 (42%), Positives = 41/75 (54%), Gaps = 1/75 (1%)
 Frame = +2

Query: 203 IRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLA 379
           +++QGSCGSCWAF AV A+     I  N  + +  S +DLV C  P    GCNGG    A
Sbjct: 126 VKNQGSCGSCWAFSAVGALEINTDIELN--RKYELSEQDLVDCSGPYDNDGCNGGWMDSA 183

Query: 380 WEYWKHVGLVSGGNY 424
           +EY    GL    +Y
Sbjct: 184 FEYVADNGLAEAKDY 198


>UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep:
           Vivapain-4 - Plasmodium vivax
          Length = 484

 Score = 59.7 bits (138), Expect = 5e-08
 Identities = 35/100 (35%), Positives = 53/100 (53%), Gaps = 2/100 (2%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355
           W E   ++EI++Q  CGSCWAFGAV A+  +  I  N  +H   S ++LV C      GC
Sbjct: 268 WREHNAVSEIKNQNLCGSCWAFGAVGAVESQYAIRKN--QHVLISEQELVDCSD-KNFGC 324

Query: 356 NGGMPTLAWEYWKHVGLVSGGNYNSSQGCRP--YEIPPCE 469
            GG+ +LA++    +G +   +     G +P   EI  C+
Sbjct: 325 FGGLASLAFDDMIDLGYLCSESDYPYVGFKPRKCEIKKCK 364


>UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan
           CA, family C1, cathepsin L-like cysteine peptidase -
           Trichomonas vaginalis G3
          Length = 306

 Score = 59.7 bits (138), Expect = 5e-08
 Identities = 28/82 (34%), Positives = 44/82 (53%)
 Frame = +2

Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322
           N+  +      W E   +N+I++QG+CGSCWAF A++ +  +V    N  + +  S ++L
Sbjct: 83  NIKNDVPTEIDWREQGIVNKIKNQGACGSCWAFSAIQVIESQVA--KNQKQLYDLSEQNL 140

Query: 323 VSCCPICGLGCNGGMPTLAWEY 388
           + C   C  GC GG    A EY
Sbjct: 141 LDCVTSC-FGCGGGWSPGALEY 161


>UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep:
           Cathepsin L - Kudoa thyrsites
          Length = 300

 Score = 59.7 bits (138), Expect = 5e-08
 Identities = 36/116 (31%), Positives = 58/116 (50%), Gaps = 2/116 (1%)
 Frame = +2

Query: 110 PKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNA 289
           PK T   ++ + LP + D    W     +  +++QG CGSCW+F A  A+     I +  
Sbjct: 90  PKETATKDIKSTLPSSVD----WKALGKVTSVKNQGHCGSCWSFSAAGAIESAYAIKTG- 144

Query: 290 TKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGN--YNSSQGCRPY 451
            +  +FS + LV C      GCNGG+P +A+ Y  + G++   +  Y + QG   Y
Sbjct: 145 -ELVNFSEQQLVDCSTE-NHGCNGGLPEIAFLYVINNGIMKLKDYPYTAKQGTCQY 198


>UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia
           tropicalis|Rep: Blo t 1 allergen - Blomia tropicalis
           (Mite)
          Length = 333

 Score = 59.7 bits (138), Expect = 5e-08
 Identities = 43/124 (34%), Positives = 55/124 (44%), Gaps = 10/124 (8%)
 Frame = +2

Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322
           +LP+NFD    W +   L  IR QGSCGSCWAF A         I     +    S ++L
Sbjct: 112 SLPQNFD----WRQKARLTRIRQQGSCGSCWAFAAAGVAESLYSIQKQ--QSIELSEQEL 165

Query: 323 VSC-------CPICGLGCNGGMPTLAWEYWKHVGLVSGGNY---NSSQGCRPYEIPPCEH 472
           V C          C  GC  G  T A++Y    GLV   NY     +Q C P ++    +
Sbjct: 166 VDCTYNRYDSSYQCN-GCGSGYSTEAFKYMIRTGLVEEENYPYNMRTQWCNP-DVEGQRY 223

Query: 473 HVPG 484
           HV G
Sbjct: 224 HVSG 227


>UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5;
           Theileria|Rep: Cysteine proteinase precursor - Theileria
           parva
          Length = 440

 Score = 59.7 bits (138), Expect = 5e-08
 Identities = 35/122 (28%), Positives = 54/122 (44%)
 Frame = +2

Query: 95  NILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVC 274
           N+ K      D +L     EN D    W    ++  ++DQ +CG CWAF  V ++     
Sbjct: 212 NLKKALNTDEDVDLAKLTGENLD----WRRSSSVTSVKDQSNCGGCWAFSTVGSVEG--Y 265

Query: 275 IYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYE 454
             S+  K +  S ++L+ C      GC GG+   A+EY +  GLVS  +       R   
Sbjct: 266 YMSHFDKSYELSVQELLDCDSFSN-GCQGGLLESAYEYVRKYGLVSAKDLPFVDKARRCS 324

Query: 455 IP 460
           +P
Sbjct: 325 VP 326


>UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1;
           Trypanosoma cruzi|Rep: Cysteine protease, putative -
           Trypanosoma cruzi
          Length = 434

 Score = 59.3 bits (137), Expect = 7e-08
 Identities = 34/83 (40%), Positives = 41/83 (49%), Gaps = 7/83 (8%)
 Frame = +2

Query: 176 WPEC--PTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC---CPI 340
           W E   P L  ++DQGSCGSCWA  A E++     I S   K    S + + SC      
Sbjct: 131 WQEAKNPVLTPVKDQGSCGSCWAHAATESVESMYAISSG--KLLTLSTQQITSCVNNTRK 188

Query: 341 CG--LGCNGGMPTLAWEYWKHVG 403
           CG   GC GG   LAWEY  + G
Sbjct: 189 CGGSGGCGGGTAQLAWEYIMNTG 211


>UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_56,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 314

 Score = 59.3 bits (137), Expect = 7e-08
 Identities = 28/77 (36%), Positives = 40/77 (51%)
 Frame = +2

Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPT 373
           +  +++QG+CGSCWAF AV A+   + I    +K    S + LV C      GCNGG   
Sbjct: 122 ITSVKNQGNCGSCWAFSAVGAVETLLTIKGVISKDLWLSEQQLVDCDKGTNNGCNGGFEN 181

Query: 374 LAWEYWKHVGLVSGGNY 424
           L  ++ K  GL +   Y
Sbjct: 182 LGIQWAKKNGLTTDKQY 198


>UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 280

 Score = 58.8 bits (136), Expect = 9e-08
 Identities = 31/98 (31%), Positives = 51/98 (52%), Gaps = 3/98 (3%)
 Frame = +2

Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319
           ++LP+ FD    W     + ++++QG+CGSCWAF  +  + + + +  N T    +S ++
Sbjct: 66  SSLPQQFD----WRNLGKVTQVKNQGNCGSCWAF-TITGLFESINLIRNKTVEL-YSEQE 119

Query: 320 LVSCCP---ICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424
           L+ C         GC GG P LA+EY K  G+     Y
Sbjct: 120 LLDCSSNGIYRNSGCQGGWPHLAFEYSKKNGISLSSQY 157


>UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean
           endopeptidase) (Cysteine proteinase)
           (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
           Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor
           (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase)
           (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
           Vignain-2] - Vigna mungo (Rice bean) (Black gram)
          Length = 362

 Score = 58.8 bits (136), Expect = 9e-08
 Identities = 33/99 (33%), Positives = 52/99 (52%), Gaps = 1/99 (1%)
 Frame = +2

Query: 131 ELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFS 310
           E + ++P + D    W +   + +++DQG CGSCWAF  + A+     I +N  K    S
Sbjct: 123 EKVGSVPASVD----WRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTN--KLVSLS 176

Query: 311 AEDLVSCCPICGLGCNGGMPTLAWEYWKHV-GLVSGGNY 424
            ++LV C      GCNGG+   A+E+ K   G+ +  NY
Sbjct: 177 EQELVDCDKEENQGCNGGLMESAFEFIKQKGGITTESNY 215


>UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18;
           Magnoliophyta|Rep: Thiol protease aleurain precursor -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 358

 Score = 58.8 bits (136), Expect = 9e-08
 Identities = 33/89 (37%), Positives = 43/89 (48%), Gaps = 1/89 (1%)
 Frame = +2

Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319
           A LPE  D    W E   ++ ++DQG CGSCW F    A+      +    K    S + 
Sbjct: 139 AALPETKD----WREDGIVSPVKDQGGCGSCWTFSTTGAL--EAAYHQAFGKGISLSEQQ 192

Query: 320 LVSCC-PICGLGCNGGMPTLAWEYWKHVG 403
           LV C       GCNGG+P+ A+EY K  G
Sbjct: 193 LVDCAGAFNNYGCNGGLPSQAFEYIKSNG 221


>UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing
           protein; n=5; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 437

 Score = 58.4 bits (135), Expect = 1e-07
 Identities = 31/91 (34%), Positives = 46/91 (50%), Gaps = 2/91 (2%)
 Frame = +2

Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGS-CGSCWAFGAVEAMTDRVCIYSNATKHFHFSA 313
           ++ LP+  D    W E   + +++ QG  CGSCWAF AV A+     +     K   FS 
Sbjct: 202 LSQLPQYVD----WREKGVVTQVKSQGKDCGSCWAFAAVAALESHYAL-KTGKKPIQFSE 256

Query: 314 EDLVSCC-PICGLGCNGGMPTLAWEYWKHVG 403
           + LV C       GC+GG+P+  +EY  + G
Sbjct: 257 QQLVDCARKFDTKGCSGGLPSKGFEYLAYAG 287


>UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 394

 Score = 58.4 bits (135), Expect = 1e-07
 Identities = 32/80 (40%), Positives = 39/80 (48%), Gaps = 3/80 (3%)
 Frame = +2

Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGL---GCNGG 364
           LN ++DQG CGSCW FGA   M     I +   K   FS + LV C    G    GCNGG
Sbjct: 196 LNPVKDQGQCGSCWTFGAAGVMESFNAITNGVLK--SFSEQQLVDCVHQAGFSSDGCNGG 253

Query: 365 MPTLAWEYWKHVGLVSGGNY 424
             +   EY    G+V+   Y
Sbjct: 254 FQSDGVEYAIKFGIVTEDKY 273


>UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-329;
           n=2; Caenorhabditis|Rep: Putative uncharacterized
           protein tag-329 - Caenorhabditis elegans
          Length = 374

 Score = 58.4 bits (135), Expect = 1e-07
 Identities = 32/94 (34%), Positives = 44/94 (46%), Gaps = 1/94 (1%)
 Frame = +2

Query: 146 LPENFDPRDKWPECP-TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322
           LP+ FD R+K       +  I+ Q SC  CW F A       + ++    K  + S +++
Sbjct: 140 LPKTFDLRNKKVGGHYIIGPIKTQDSCACCWGFAATAVAEAALTVHLK--KAMNLSEQEV 197

Query: 323 VSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424
             C P  G GCNGG P    EY K +GL  G  Y
Sbjct: 198 CDCAPKHGPGCNGGDPVDGLEYIKEMGLTGGKEY 231


>UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1;
           Uronema marinum|Rep: Cathepsin L-like cysteine protease
           - Uronema marinum
          Length = 333

 Score = 58.0 bits (134), Expect = 2e-07
 Identities = 33/83 (39%), Positives = 43/83 (51%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355
           W     +  +++QG CGSCWAF AV ++     I  N  K   FS + LVSC P    GC
Sbjct: 126 WVSKGAVQGVQNQGVCGSCWAFSAVCSLERLYKI--NTGKLLSFSEQQLVSCEP-KSYGC 182

Query: 356 NGGMPTLAWEYWKHVGLVSGGNY 424
           +GG P  A+ Y    GL S  +Y
Sbjct: 183 DGGWPEAAFAYSATHGLESSASY 205


>UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 323

 Score = 58.0 bits (134), Expect = 2e-07
 Identities = 34/107 (31%), Positives = 52/107 (48%), Gaps = 8/107 (7%)
 Frame = +2

Query: 116 VTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATK 295
           +++    +  +P +FD R  W +C  ++ +R+Q SCGSCWA      + DR+CI S+   
Sbjct: 36  ISYSQNELDTIPASFDVRTNWGDC--MSPVREQQSCGSCWAQVTSGILADRMCIESDKNI 93

Query: 296 HFHFSAEDLVSC---CPI-----CGLGCNGGMPTLAWEYWKHVGLVS 412
               S + L+ C   C       C  GC GG   LA     + G+VS
Sbjct: 94  KMLLSPQYLMDCDGSCVSDGVSGCNNGCKGGFVGLALTRLINEGIVS 140


>UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2;
           Platyhelminthes|Rep: Cathepsin L-like proteinase -
           Echinococcus multilocularis
          Length = 338

 Score = 58.0 bits (134), Expect = 2e-07
 Identities = 31/84 (36%), Positives = 40/84 (47%), Gaps = 1/84 (1%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LG 352
           W +   +  I+DQG CGSCWAF A  A+  +  +     K    S + LV C    G  G
Sbjct: 128 WRKKGLVTPIKDQGDCGSCWAFSATGALEGQ--LKRKTGKLISLSEQQLVDCSTYTGNEG 185

Query: 353 CNGGMPTLAWEYWKHVGLVSGGNY 424
           CNGG    A+ YW   G  S  +Y
Sbjct: 186 CNGGDMNDAFRYWMRNGAESESDY 209


>UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1;
           Brugia malayi|Rep: Cathepsin F-like cysteine proteinase
           - Brugia malayi (Filarial nematode worm)
          Length = 461

 Score = 57.6 bits (133), Expect = 2e-07
 Identities = 32/89 (35%), Positives = 44/89 (49%)
 Frame = +2

Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316
           I NLP  FD    W     +  ++DQGSCGSCWAF     +     I +   K    S +
Sbjct: 245 IYNLPSKFD----WRTEGVVTPVKDQGSCGSCWAFSVTGNIESLWAIKTG--KLISLSEQ 298

Query: 317 DLVSCCPICGLGCNGGMPTLAWEYWKHVG 403
           +L+  C +   GCNGG+P  A+   K +G
Sbjct: 299 ELID-CDVIDKGCNGGLPINAFREIKRMG 326


>UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3;
           Theileria|Rep: Cysteine proteinase precursor - Theileria
           annulata
          Length = 441

 Score = 57.6 bits (133), Expect = 2e-07
 Identities = 27/72 (37%), Positives = 41/72 (56%), Gaps = 1/72 (1%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGS-CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLG 352
           W     ++ I+DQG  CGSCWAF ++ ++     +Y N  K +  S ++LV+ C    +G
Sbjct: 233 WARTDAVSPIKDQGDHCGSCWAFSSIASVESLYRLYKN--KSYFLSEQELVN-CDKSSMG 289

Query: 353 CNGGMPTLAWEY 388
           C GG+P  A EY
Sbjct: 290 CAGGLPITALEY 301


>UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep:
           MGC107932 protein - Xenopus tropicalis (Western clawed
           frog) (Silurana tropicalis)
          Length = 333

 Score = 57.2 bits (132), Expect = 3e-07
 Identities = 32/88 (36%), Positives = 43/88 (48%), Gaps = 1/88 (1%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGS-CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLG 352
           W +   +  +++QG+ CGSCWAF  V  M  R CI     +  + S + LV C  I   G
Sbjct: 121 WRKSNCVTPVKNQGTFCGSCWAFATVGVMESRYCI--RTKELLNLSEQQLVDCDEI-NEG 177

Query: 353 CNGGMPTLAWEYWKHVGLVSGGNYNSSQ 436
           C GG P  A EY    G++    Y  SQ
Sbjct: 178 CCGGFPIKALEYVAQHGVMRNKEYEYSQ 205


>UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted,
           possible transmembrane domain near N-terminus; n=4;
           Cryptosporidium|Rep: Cryptopain-cysteine proteinase
           secreted, possible transmembrane domain near N-terminus
           - Cryptosporidium parvum Iowa II
          Length = 401

 Score = 57.2 bits (132), Expect = 3e-07
 Identities = 32/82 (39%), Positives = 42/82 (51%), Gaps = 3/82 (3%)
 Frame = +2

Query: 152 ENFDPRDK--WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           E F P +   W E   +N IR+Q +CGSCWAF AV A+    C  +N       S +  V
Sbjct: 172 EEFVPPNSINWVEAGCVNPIRNQKNCGSCWAFSAVAALEGATCAQTNRGLP-SLSEQQFV 230

Query: 326 SCCPICG-LGCNGGMPTLAWEY 388
            C    G  GC+GG   LA++Y
Sbjct: 231 DCSKQNGNFGCDGGTMGLAFQY 252


>UniRef50_Q235G6 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 325

 Score = 57.2 bits (132), Expect = 3e-07
 Identities = 33/129 (25%), Positives = 56/129 (43%)
 Frame = +2

Query: 41  THTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGS 220
           THT FA + +      D+ I  L  + H+ +++ +          W E   +  +++QG 
Sbjct: 88  THTEFAELYLNPAENIDEEIDSLQPIQHNEDIVID----------WVEKGAVTPVKNQGG 137

Query: 221 CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHV 400
           CG CW+F     +     +Y N     + S + L+  C     GC GG+  +A  Y K  
Sbjct: 138 CGGCWSFATTGGVEGANFVYKNVLP--NLSQQQLID-CNTQNKGCGGGLRDIALNYVKET 194

Query: 401 GLVSGGNYN 427
           GL +   Y+
Sbjct: 195 GLTTEEEYS 203


>UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_36,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 307

 Score = 57.2 bits (132), Expect = 3e-07
 Identities = 30/77 (38%), Positives = 40/77 (51%)
 Frame = +2

Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPT 373
           +N I++QG+CGSCW F A+ A+   + I          S + LV C    G GCNGG   
Sbjct: 118 MNPIKNQGNCGSCWTFSAIGAVEGFLAIRKGFKG--VLSEQQLVDCAVDAGEGCNGGNSD 175

Query: 374 LAWEYWKHVGLVSGGNY 424
           LA +Y   VG V   +Y
Sbjct: 176 LALDYIAEVGSVYERDY 192


>UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_217_11853_10927 - Giardia lamblia
           ATCC 50803
          Length = 308

 Score = 56.8 bits (131), Expect = 4e-07
 Identities = 27/87 (31%), Positives = 49/87 (56%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           +P++FD R+++P+C T  E+ D G C S WA+ AV+A + R C+     +   +SA+ ++
Sbjct: 75  VPDHFDFREEYPQCIT--EVIDIGLCSSSWAYSAVDAFSHRRCLTGLDQEATRYSAQYIL 132

Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGL 406
           SC    G        ++AW++    G+
Sbjct: 133 SCSSTNGCFGFSTRESIAWDFIATTGI 159


>UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 330

 Score = 56.8 bits (131), Expect = 4e-07
 Identities = 31/97 (31%), Positives = 43/97 (44%), Gaps = 3/97 (3%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355
           W     ++ ++DQG CGSCWAF    ++   + I   A +    S + LV  C     GC
Sbjct: 123 WVTRGKVSAVKDQGQCGSCWAFSTTGSVESALIIAGYANQTIDLSEQQLVD-CSATNYGC 181

Query: 356 NGGMPTLAWEYWKHVGLVSGGNY---NSSQGCRPYEI 457
            GG    A+EY +   L +  NY      Q C   EI
Sbjct: 182 GGGWMDNAFEYIEESPLTTNSNYPYVAVDQACNSTEI 218


>UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes
           vastus|Rep: Cathepsin L - Aphrocallistes vastus
          Length = 329

 Score = 56.8 bits (131), Expect = 4e-07
 Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 1/85 (1%)
 Frame = +2

Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322
           +LP   D R K    P    +++QG CGSCW+F A  ++  +  I S   K   FS ++L
Sbjct: 114 DLPTTVDWRSKGVVTP----VKNQGQCGSCWSFSATGSLEGQYAIKSG--KLVSFSEQEL 167

Query: 323 VSCCPICG-LGCNGGMPTLAWEYWK 394
           V C    G  GC GG+   A++YW+
Sbjct: 168 VDCSTSLGNHGCQGGLMDYAFKYWE 192


>UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain -
           Tetrahymena pyriformis
          Length = 330

 Score = 56.8 bits (131), Expect = 4e-07
 Identities = 29/86 (33%), Positives = 38/86 (44%), Gaps = 3/86 (3%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGL-- 349
           W     L  +++Q  CGSCWAF     +     I+ +      FS + LV CC   G   
Sbjct: 126 WTAKNVLPPVKNQQQCGSCWAFSTAGMLEGVYNIHESPQTPISFSEQQLVDCCGAQGFGC 185

Query: 350 -GCNGGMPTLAWEYWKHVGLVSGGNY 424
            GCNG  PT A  Y +  G+V    Y
Sbjct: 186 EGCNGAWPTDAVAYTQKFGIVQESQY 211


>UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra
           erinaceieuropaei|Rep: Cysteine proteinase - Spirometra
           erinaceieuropaei (Tapeworm)
          Length = 336

 Score = 56.8 bits (131), Expect = 4e-07
 Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 1/111 (0%)
 Frame = +2

Query: 107 LPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSN 286
           L K+     +   L EN      W E   +  +++QG CGSCW+F A  A+   + I + 
Sbjct: 104 LTKLRRKEAVSVPLKENLPDSVNWRERGAVTSVKNQGQCGSCWSFSANGAIEGAIQIKTG 163

Query: 287 ATKHFHFSAEDLVSCCPICG-LGCNGGMPTLAWEYWKHVGLVSGGNYNSSQ 436
           A +    S + L+ C    G  GCNGG+   A++Y +  G+ +  +Y  ++
Sbjct: 164 ALR--SLSEQQLMDCSWDYGNQGCNGGLMPQAFQYAQRYGVEAEVDYRYTE 212


>UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18;
           Plasmodium|Rep: Cysteine proteinase precursor -
           Plasmodium vivax (strain Salvador I)
          Length = 583

 Score = 56.8 bits (131), Expect = 4e-07
 Identities = 34/101 (33%), Positives = 54/101 (53%), Gaps = 2/101 (1%)
 Frame = +2

Query: 128 AELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKH--F 301
           A L+A++PE  D R+K      ++E +DQG CGSCWAF +V  +    C+Y+        
Sbjct: 333 ANLLADVPEILDYREKG----IVHEPKDQGLCGSCWAFASVGNVE---CMYAKEHNKTIL 385

Query: 302 HFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424
             S +++V C  +   GC+GG P  ++ Y    G+  G +Y
Sbjct: 386 TLSEQEVVDCSKL-NFGCDGGHPFYSFIYAIENGICMGDDY 425


>UniRef50_P25774 Cluster: Cathepsin S precursor; n=78;
           Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens
           (Human)
          Length = 331

 Score = 56.8 bits (131), Expect = 4e-07
 Identities = 34/83 (40%), Positives = 49/83 (59%), Gaps = 2/83 (2%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           LP++ D R+K   C T  E++ QGSCG+CWAF AV A+  ++ + +   K    SA++LV
Sbjct: 115 LPDSVDWREKG--CVT--EVKYQGSCGACWAFSAVGALEAQLKLKTG--KLVSLSAQNLV 168

Query: 326 SCC--PICGLGCNGGMPTLAWEY 388
            C        GCNGG  T A++Y
Sbjct: 169 DCSTEKYGNKGCNGGFMTTAFQY 191


>UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin O
           precursor; n=2; Apocrita|Rep: PREDICTED: similar to
           Cathepsin O precursor - Apis mellifera
          Length = 374

 Score = 56.4 bits (130), Expect = 5e-07
 Identities = 28/74 (37%), Positives = 38/74 (51%)
 Frame = +2

Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322
           ++P  FD RDK    P    +R QGSCG+CWAF  +E +     I  N T H   S +++
Sbjct: 154 SIPLRFDWRDKGVITP----VRSQGSCGACWAFSTIEVIESMFAI-KNGTLH-SLSVQEM 207

Query: 323 VSCCPICGLGCNGG 364
           + C      GC GG
Sbjct: 208 IDCAKNSNFGCEGG 221


>UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2;
           Trypanosoma cruzi|Rep: Cysteine proteinase, putative -
           Trypanosoma cruzi
          Length = 392

 Score = 56.4 bits (130), Expect = 5e-07
 Identities = 39/100 (39%), Positives = 48/100 (48%), Gaps = 6/100 (6%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH-FSAEDL 322
           +P+  D R+  P    L  ++DQG CGSCWA GA E M     I    T   H  S + L
Sbjct: 141 IPDEVDYRNSSPAI--LTAVKDQGRCGSCWAHGAAEEMESHFAI---LTGRLHVLSQQQL 195

Query: 323 VSCCP---ICG--LGCNGGMPTLAWEYWKHVGLVSGGNYN 427
            SC P    CG   GC G    LA+EY K  G+ S   Y+
Sbjct: 196 TSCAPNPKKCGGTGGCYGSTADLAYEYAKQ-GITSEWVYS 234


>UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 356

 Score = 56.4 bits (130), Expect = 5e-07
 Identities = 26/83 (31%), Positives = 44/83 (53%), Gaps = 1/83 (1%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLG 352
           W +   ++ ++DQ +CGSCW F    A+     I+ +  +    S + L+ C       G
Sbjct: 133 WKDLNKVSPVKDQQNCGSCWTFSTTGAIESHYAIFED-VEPTSLSEQQLIDCAGAFNNNG 191

Query: 353 CNGGMPTLAWEYWKHVGLVSGGN 421
           C+GG+P+ A+EY K+ G +S  N
Sbjct: 192 CSGGLPSQAFEYIKYNGGISYEN 214


>UniRef50_O16454 Cluster: Temporarily assigned gene name protein
           196; n=4; Bilateria|Rep: Temporarily assigned gene name
           protein 196 - Caenorhabditis elegans
          Length = 477

 Score = 56.4 bits (130), Expect = 5e-07
 Identities = 30/81 (37%), Positives = 46/81 (56%)
 Frame = +2

Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322
           +LPE+FD    W E   + ++++QG+CGSCWAF     +     I  N  K    S ++L
Sbjct: 263 DLPESFD----WREKGAVTQVKNQGNCGSCWAFSTTGNVEGAWFIAKN--KLVSLSEQEL 316

Query: 323 VSCCPICGLGCNGGMPTLAWE 385
           V C  +   GCNGG+P+ A++
Sbjct: 317 VDCDSM-DQGCNGGLPSNAYK 336


>UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 291

 Score = 56.4 bits (130), Expect = 5e-07
 Identities = 40/130 (30%), Positives = 58/130 (44%), Gaps = 1/130 (0%)
 Frame = +2

Query: 2   KKQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWP 181
           K    +K   N  +H      + L+G   D N++   K       I + P   D R    
Sbjct: 32  KANANYKLSLNSLSHLTPTEYQSLLGTKIDKNLVSQGKKVRPQ--IKDSPGILDYR---- 85

Query: 182 ECPTLNEIRDQGSCGSCWAFGAVEAM-TDRVCIYSNATKHFHFSAEDLVSCCPICGLGCN 358
           E   +N IRDQ  CGSCWAFG V A  ++   +YSN  +    S ++++ C   C  GC 
Sbjct: 86  EMGVVNPIRDQKQCGSCWAFGTVAACESNYALLYSNLPQ---LSEQNIIDCATTC-YGCG 141

Query: 359 GGMPTLAWEY 388
           GG+   A  +
Sbjct: 142 GGIIQAAMSF 151


>UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor;
           n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase
           precursor - Phaedon cochleariae (Mustard beetle)
          Length = 324

 Score = 56.4 bits (130), Expect = 5e-07
 Identities = 35/93 (37%), Positives = 44/93 (47%), Gaps = 1/93 (1%)
 Frame = +2

Query: 149 PENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVS 328
           PE+ D R K    P    +R+QG CGSCWA     A+  +  I S +      S + LV 
Sbjct: 111 PESIDWRSKGVVLP----VRNQGECGSCWALSTAAAIESQSAIKSGS--KVPLSPQQLVD 164

Query: 329 CCPICG-LGCNGGMPTLAWEYWKHVGLVSGGNY 424
           C    G  GCNGG     +EY K  GL S  +Y
Sbjct: 165 CSTSYGNHGCNGGFAVNGFEYVKDNGLESDADY 197


>UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F
            like protease; n=1; Nasonia vitripennis|Rep: PREDICTED:
            similar to cathepsin F like protease - Nasonia
            vitripennis
          Length = 1036

 Score = 56.0 bits (129), Expect = 6e-07
 Identities = 36/112 (32%), Positives = 54/112 (48%), Gaps = 1/112 (0%)
 Frame = +2

Query: 71   LMGALKDDNILKLPKVT-HDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGA 247
            L   LK +N + +P  T  D EL    P ++D    W     +  ++DQGSCGSCWAF  
Sbjct: 795  LKPTLKSENDIPMPMATIPDIEL----PSDYD----WRHHNVVTPVKDQGSCGSCWAFSV 846

Query: 248  VEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVG 403
               +  +  I     +    S ++LV C  +   GCNGG+P  A+   + +G
Sbjct: 847  TGNIEGQYAIKHG--ELLSLSEQELVDCDKL-DSGCNGGLPDTAYRAIEELG 895


>UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia
           intestinalis|Rep: GLP_41_8294_9919 - Giardia lamblia
           ATCC 50803
          Length = 541

 Score = 56.0 bits (129), Expect = 6e-07
 Identities = 33/97 (34%), Positives = 51/97 (52%), Gaps = 4/97 (4%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH----FSA 313
           LP++FD RD       +  + DQG+CGSC+ FGAV+AM  R+ I +N T         S 
Sbjct: 241 LPDDFDWRDV-NGVSYIPGVLDQGACGSCFTFGAVQAMNSRIMIATNRTDPVGTKTILST 299

Query: 314 EDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424
           E  +  C +   GC+GG P     + +  G+++  +Y
Sbjct: 300 EHALD-CNVYSQGCDGGFPEHVLRFAETNGIMTEDDY 335


>UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor;
           n=176; Viridiplantae|Rep: Cysteine proteinase RD21a
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 462

 Score = 56.0 bits (129), Expect = 6e-07
 Identities = 32/105 (30%), Positives = 52/105 (49%)
 Frame = +2

Query: 74  MGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVE 253
           +GA  +    +   + ++A +   LPE+ D    W +   + E++DQG CGSCWAF  + 
Sbjct: 113 LGAKMEKKGERRTSLRYEARVGDELPESID----WRKKGAVAEVKDQGGCGSCWAFSTIG 168

Query: 254 AMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEY 388
           A+     I +        S ++LV C      GCNGG+   A+E+
Sbjct: 169 AVEGINQIVTGDL--ITLSEQELVDCDTSYNEGCNGGLMDYAFEF 211


>UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 336

 Score = 55.6 bits (128), Expect = 8e-07
 Identities = 34/105 (32%), Positives = 51/105 (48%), Gaps = 4/105 (3%)
 Frame = +2

Query: 122 HDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHF 301
           H A+ +  LP +FD    W +   L++++DQG CGSCWAF +   + + +    N  K  
Sbjct: 118 HTAQDV-QLPASFD----WRDYGILSDVKDQGQCGSCWAF-STTGILEALYFMENRQK-I 170

Query: 302 HFSAEDLVSCCP----ICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424
            FS + LV C          GC+GG P  A +Y    G++    Y
Sbjct: 171 SFSEQQLVDCATNSNGFNSYGCSGGWPEEALKYVAKFGILKEEQY 215


>UniRef50_Q231X3 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 323

 Score = 55.6 bits (128), Expect = 8e-07
 Identities = 26/84 (30%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LG 352
           W E   ++ ++ QG+CGSCWAF A  ++   + I     K    S + L+ C    G  G
Sbjct: 121 WVEAGKVSNVKSQGNCGSCWAFSATASVESALIIAGKVDKSISLSEQQLIDCSGDYGNYG 180

Query: 353 CNGGMPTLAWEYWKHVGLVSGGNY 424
           C  G    A  Y K   + +  NY
Sbjct: 181 CAAGQKEQALVYIKRYSITTEQNY 204


>UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia
           bovis|Rep: Cysteine protease 2 - Babesia bovis
          Length = 445

 Score = 55.6 bits (128), Expect = 8e-07
 Identities = 30/84 (35%), Positives = 41/84 (48%)
 Frame = +2

Query: 155 NFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC 334
           NF+  D W     +  ++DQG CGSCWAF AV ++     +          S ++LVS C
Sbjct: 236 NFEDID-WRRADAVTPVKDQGMCGSCWAFAAVGSVES---LLKRQKTDVRLSEQELVS-C 290

Query: 335 PICGLGCNGGMPTLAWEYWKHVGL 406
            +   GCNGG    A  Y K  G+
Sbjct: 291 QLGNQGCNGGYSDYALNYIKFNGI 314


>UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain;
           n=16; Chrysomelidae|Rep: Digestive cysteine protease
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 55.6 bits (128), Expect = 8e-07
 Identities = 41/130 (31%), Positives = 60/130 (46%), Gaps = 2/130 (1%)
 Frame = +2

Query: 41  THTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGS 220
           TH  F  I  L G +K+   L         +L   +P++ D    W E   + E++DQ  
Sbjct: 79  THEEFKDI--LKGQIKNKPRLNATPTVFPEDL--EVPDSID----WTEKGAVLEVKDQNP 130

Query: 221 CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLG-C-NGGMPTLAWEYWK 394
           CGSCWAF A  A+  +  I +N       S + L+ C    G G C  GG  + A+EY +
Sbjct: 131 CGSCWAFSATGALEGQNAILNNV--KISLSEQQLLDCSAAYGNGNCKEGGDMSAAFEYVR 188

Query: 395 HVGLVSGGNY 424
             G+ S  +Y
Sbjct: 189 DYGIQSEKSY 198


>UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like
           cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin B-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 288

 Score = 55.6 bits (128), Expect = 8e-07
 Identities = 38/131 (29%), Positives = 61/131 (46%), Gaps = 2/131 (1%)
 Frame = +2

Query: 2   KKQNTWKAGRN--FPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDK 175
           +K   W AG N  F   T F    ++ G         +P +    ++  ++P +++  ++
Sbjct: 20  EKDLPWVAGENERFKGMT-FKDASVISGNAHKLRPDTIP-LARPPKINISIPMSYNFTER 77

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355
           +P+C     + DQG CGSCW+F   ++ + R C   N  K   FS   LV+ C     GC
Sbjct: 78  FPQCDF--GVLDQGKCGSCWSFAVSKSFSHRYCRKYN--KPVLFSQSHLVA-CDRRNSGC 132

Query: 356 NGGMPTLAWEY 388
            GG+   AW Y
Sbjct: 133 GGGIEVNAWRY 143


>UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole
           genome shotgun sequence; n=7; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_22,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 350

 Score = 55.6 bits (128), Expect = 8e-07
 Identities = 31/92 (33%), Positives = 43/92 (46%), Gaps = 2/92 (2%)
 Frame = +2

Query: 155 NFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC 334
           N  P D W     +N+++DQG CGSCWAF     +     + +        S + LV C 
Sbjct: 142 NATPID-WRTRGAVNKVKDQGQCGSCWAFSTTGVLEGFYKVQTGELP--DLSEQQLVDCS 198

Query: 335 PICGL--GCNGGMPTLAWEYWKHVGLVSGGNY 424
            +     GC+GGMP+ A  Y K  GL +   Y
Sbjct: 199 TLIDFNQGCDGGMPSRALNYVKRNGLTTQDAY 230


>UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;
           n=17; Magnoliophyta|Rep: Thiol protease aleurain-like
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 358

 Score = 55.6 bits (128), Expect = 8e-07
 Identities = 28/82 (34%), Positives = 42/82 (51%), Gaps = 1/82 (1%)
 Frame = +2

Query: 161 DPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-P 337
           D +D W E   ++ +++QG CGSCW F    A+      +    K    S + LV C   
Sbjct: 143 DTKD-WREDGIVSPVKEQGHCGSCWTFSTTGAL--EAAYHQAFGKGISLSEQQLVDCAGT 199

Query: 338 ICGLGCNGGMPTLAWEYWKHVG 403
               GC+GG+P+ A+EY K+ G
Sbjct: 200 FNNFGCHGGLPSQAFEYIKYNG 221


>UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;
           n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
           hypothetical protein - Strongylocentrotus purpuratus
          Length = 331

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 33/99 (33%), Positives = 48/99 (48%), Gaps = 3/99 (3%)
 Frame = +2

Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316
           +  +P  +D R   P  P +  +++Q SCG+CWAF  VE M  ++ +     +    SA+
Sbjct: 124 LKTMPLVYDLRSIKP--PVVTPVKNQKSCGACWAFSVVETMETQIAL--KTKRLTQLSAQ 179

Query: 317 DLVSCCPICG-LGCNGGMP--TLAWEYWKHVGLVSGGNY 424
           +LV C    G  GC GG+P  TL W       LV    Y
Sbjct: 180 ELVDCGTAAGDGGCRGGIPCKTLDWLNRTKTSLVPESTY 218


>UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1;
           Bigelowiella natans|Rep: Digestive cysteine proteinase -
           Bigelowiella natans (Pedinomonas minutissima)
           (Chlorarachnion sp.(strain CCMP 621))
          Length = 360

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 33/86 (38%), Positives = 41/86 (47%), Gaps = 3/86 (3%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNAT--KHFHFSAEDLVSCCPICGL 349
           W +   L  ++DQG CGSCWAF A +A+     I  N T       S E LV  C     
Sbjct: 115 WRDFNALTPVKDQGGCGSCWAFSATQALESAHYIKHNDTLDSPIALSTEQLVE-CDQHDY 173

Query: 350 GCNGGMPTLAWEYWKHV-GLVSGGNY 424
            C GG P  A +Y K   GLV+  +Y
Sbjct: 174 ACYGGFPRDAMKYIKESGGLVAEADY 199


>UniRef50_Q23H32 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 365

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 34/97 (35%), Positives = 52/97 (53%), Gaps = 3/97 (3%)
 Frame = +2

Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKH-FHFSAED 319
           ++PE+ D R+K      +  ++ QG CGSCWAF  V A+      Y+  T +   FS ++
Sbjct: 134 SVPESVDWREK-----LVAPVQKQGGCGSCWAFSTVIALEG---AYAKQTGNVIKFSEQN 185

Query: 320 LVSCCPICGLGCNGGMPTLAWEYWKHV--GLVSGGNY 424
           L+ CC I   GCNGG P  A +   +V  G++   +Y
Sbjct: 186 LIDCCRIENNGCNGGDPEPALDCVMNVLKGIMKNQDY 222


>UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101,
           whole genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_101,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 306

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 37/104 (35%), Positives = 52/104 (50%), Gaps = 3/104 (2%)
 Frame = +2

Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322
           NLPE+ D   K      +N +++QG+CGS W+F AV A  +   I+   T HF +S ++L
Sbjct: 109 NLPESVDWSSK------MNPVKNQGTCGSGWSFSAVGAF-EAFFIFVKGT-HFQYSEQNL 160

Query: 323 VSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY---NSSQGCR 445
           V  C     GC+GG P  A +Y    G      Y    S + CR
Sbjct: 161 VD-CDTNSHGCDGGYPAKAIDYLNKNGAFLESEYPYVASKEKCR 203


>UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis
           zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa
           zeasingle nucleocapsid nuclear polyhedrosis virus)
          Length = 367

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 30/80 (37%), Positives = 44/80 (55%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           LP+ +D    W +   +  I+DQG CGSCWAF A+  +  +  I  N  K    S + L+
Sbjct: 156 LPDYYD----WRDTNKVTPIKDQGVCGSCWAFVAIGNIESQYAIRHN--KLIDLSEQQLL 209

Query: 326 SCCPICGLGCNGGMPTLAWE 385
            C  +  LGCNGG+  LA++
Sbjct: 210 DCDEV-DLGCNGGLMHLAFQ 228


>UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16;
           Bromeliaceae|Rep: Fruit bromelain precursor - Ananas
           comosus (Pineapple)
          Length = 351

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 2/143 (1%)
 Frame = +2

Query: 2   KKQNTWKAGRN-FPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKW 178
           + +N++  G N F   T    +    G     NI + P V+ D   I+ +P++ D    W
Sbjct: 74  RNENSYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVSFDDVNISAVPQSID----W 129

Query: 179 PECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCN 358
            +   +NE+++Q  CGSCW+F A+  +     IY   T +    +E  V  C +   GC 
Sbjct: 130 RDYGAVNEVKNQNPCGSCWSFAAIATVEG---IYKIKTGYLVSLSEQEVLDCAV-SYGCK 185

Query: 359 GGMPTLAWEY-WKHVGLVSGGNY 424
           GG    A+++   + G+ +  NY
Sbjct: 186 GGWVNKAYDFIISNNGVTTEENY 208


>UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin
           L - Misgurnus mizolepis (Mud loach)
          Length = 337

 Score = 54.8 bits (126), Expect = 1e-06
 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLG 352
           W E   +  ++DQG CGSCWAF    AM  +  ++    K    S ++LV C  P    G
Sbjct: 122 WREKGYVTPVKDQGECGSCWAFSTTGAMEGQ--MFRKQGKLVSLSEQNLVDCSRPEGNEG 179

Query: 353 CNGGMPTLAWEYWK 394
           CNGG+   A++Y K
Sbjct: 180 CNGGLMDQAFQYIK 193


>UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia
           irregularis virus a|Rep: FirrV-1-A48 precursor -
           Feldmannia irregularis virus a
          Length = 373

 Score = 54.8 bits (126), Expect = 1e-06
 Identities = 27/78 (34%), Positives = 42/78 (53%), Gaps = 2/78 (2%)
 Frame = +2

Query: 209 DQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP-ICGLGCN-GGMPTLAW 382
           DQGSC SCW+   V+ + DRV + +N       S ++++SC     GL C+ GG+P  A+
Sbjct: 80  DQGSCASCWSISVVQMLADRVSVSTNGKIKLKLSVQEMISCWDGHDGLACSKGGVPEKAY 139

Query: 383 EYWKHVGLVSGGNYNSSQ 436
           +Y    G+    +Y   Q
Sbjct: 140 QYIIENGIGLAEDYPYEQ 157


>UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC06231 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 372

 Score = 54.8 bits (126), Expect = 1e-06
 Identities = 38/127 (29%), Positives = 58/127 (45%), Gaps = 2/127 (1%)
 Frame = +2

Query: 14  TWKAG-RNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECP 190
           T+K G  NF   T +  ++ L G      I K    T  +   A LP+  D    W    
Sbjct: 106 TYKMGVNNFTDKTEY-ELRKLRGYRSACRIAKPKGSTFISSEHAKLPDRVD----WRRNG 160

Query: 191 TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LGCNGGM 367
            +  +++QG CGSCWAF +  A+  +   Y    +  + S + L+ C    G  GC GG+
Sbjct: 161 AVTPVKNQGQCGSCWAFSSTGAIEGQ--HYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGL 218

Query: 368 PTLAWEY 388
             LA++Y
Sbjct: 219 MDLAFQY 225


>UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor;
           n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 355

 Score = 54.8 bits (126), Expect = 1e-06
 Identities = 30/84 (35%), Positives = 43/84 (51%)
 Frame = +2

Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316
           I +LP++ D R K    P    ++DQG CGSCWAF  V A+     I +        S +
Sbjct: 134 ITDLPKSVDWRKKGAVAP----VKDQGQCGSCWAFSTVAAVEGINQITTGNLS--SLSEQ 187

Query: 317 DLVSCCPICGLGCNGGMPTLAWEY 388
           +L+ C      GCNGG+   A++Y
Sbjct: 188 ELIDCDTTFNSGCNGGLMDYAFQY 211


>UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep:
           Cysteine protease - Saprolegnia parasitica
          Length = 523

 Score = 54.4 bits (125), Expect = 2e-06
 Identities = 35/113 (30%), Positives = 54/113 (47%), Gaps = 9/113 (7%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355
           W E   +  +++QG CGSCWAF    A+     + S   +    S ++LV C     +GC
Sbjct: 122 WVEQGGVTPVKNQGMCGSCWAFSTTGAIEGAAFVSSK--QLVSVSEQELVDCDHNGDMGC 179

Query: 356 NGGMPTLAWEYWK-HVGLVSGGN--YNSSQG------CRPYEIPPCEHHVPGN 487
           NGG+   A+++ K H GL    +  Y++ +G      C+P       H VP N
Sbjct: 180 NGGLMDNAFKWVKTHKGLCKEEDYPYHAKEGTCALKKCKPVTKVTAFHDVPAN 232


>UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella
           histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax
           (Sterkiella histriomuscorum)
          Length = 366

 Score = 54.0 bits (124), Expect = 2e-06
 Identities = 29/89 (32%), Positives = 45/89 (50%), Gaps = 1/89 (1%)
 Frame = +2

Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319
           AN+P  +D    W     ++ +++QG CGSCW F  V  +     +   A +  + S + 
Sbjct: 133 ANIPTEWD----WRTFGVVSPVKNQGKCGSCWTFSTVGCVESHYLLKYGAFR--NLSEQQ 186

Query: 320 LVSCC-PICGLGCNGGMPTLAWEYWKHVG 403
           LV C       GC+GG+P+ A+EY K  G
Sbjct: 187 LVDCAGDYDNHGCSGGLPSHAFEYIKDNG 215


>UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46;
           Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea
           mays (Maize)
          Length = 371

 Score = 54.0 bits (124), Expect = 2e-06
 Identities = 41/135 (30%), Positives = 61/135 (45%), Gaps = 12/135 (8%)
 Frame = +2

Query: 35  FPTHTPFAHIKILMGALKDDNIL--KLPKVTHDAELIAN--LPENFDPRDKWPECPTLNE 202
           F   TP    +  +G  K    L  +L +  H+A ++    LP++FD    W +   +  
Sbjct: 96  FSDLTPAEFRRTYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDFD----WRDHGAVGP 151

Query: 203 IRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC---C-----PICGLGCN 358
           +++QGSCGSCW+F A  A+      Y    K    S +  V C   C       C  GCN
Sbjct: 152 VKNQGSCGSCWSFSASGALEG--AHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCN 209

Query: 359 GGMPTLAWEYWKHVG 403
           GG+ T A+ Y +  G
Sbjct: 210 GGLMTTAFSYLQKAG 224


>UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6;
           Schistosoma|Rep: Cathepsin C precursor - Schistosoma
           mansoni (Blood fluke)
          Length = 454

 Score = 54.0 bits (124), Expect = 2e-06
 Identities = 36/97 (37%), Positives = 49/97 (50%), Gaps = 6/97 (6%)
 Frame = +2

Query: 134 LIANLPENFDPRDKWPECPT-----LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKH 298
           L  NLP  FD    W   P      +  IR+QG CGSC+A  +  A+  R+ + SN ++ 
Sbjct: 214 LTGNLPLEFD----WTSPPDGSRSPVTPIRNQGICGSCYASPSAAALEARIRLVSNFSEQ 269

Query: 299 FHFSAEDLVSCCPICGLGCNGGMPTL-AWEYWKHVGL 406
              S + +V C P    GCNGG P L A +Y +  GL
Sbjct: 270 PILSPQTVVDCSPY-SEGCNGGFPFLIAGKYGEDFGL 305


>UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep:
           Cysteine protease - Solanum lycopersicum (Tomato)
           (Lycopersicon esculentum)
          Length = 345

 Score = 53.6 bits (123), Expect = 3e-06
 Identities = 30/95 (31%), Positives = 51/95 (53%), Gaps = 2/95 (2%)
 Frame = +2

Query: 110 PKVTHDAELIANLPENFDPRD-KWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSN 286
           P  + + + I +L +++ P +  W E   + +++ QG CG CWAF AV ++      Y  
Sbjct: 114 PMSSTEFKKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEG---AYKI 170

Query: 287 ATKH-FHFSAEDLVSCCPICGLGCNGGMPTLAWEY 388
           AT +   FS ++L+  C     GCNGG  T A+++
Sbjct: 171 ATGNLMEFSEQELLD-CTTNNYGCNGGFMTNAFDF 204


>UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcoptes
           scabiei type hominis|Rep: Sar s 1 allergen Yv6030H07 -
           Sarcoptes scabiei type hominis
          Length = 322

 Score = 53.6 bits (123), Expect = 3e-06
 Identities = 30/89 (33%), Positives = 43/89 (48%), Gaps = 4/89 (4%)
 Frame = +2

Query: 194 LNEIRDQGSCGSCWAFGAV-EAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMP 370
           L  IR+QG+CGSCWAF  +  A ++ +         +  S + LV C      GC+G  P
Sbjct: 117 LTPIREQGACGSCWAFSTICTAESNYLTTRQAPLNKWTLSEQQLVDCA--SPKGCDGEKP 174

Query: 371 TLAWEYWKHVGLVSGGNY---NSSQGCRP 448
           T  ++Y    G+ +G  Y      Q CRP
Sbjct: 175 TTGFKYLLEKGVTTGDRYPYVGKVQPCRP 203


>UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas
           foetus|Rep: TFCP2 protein - Tritrichomonas foetus
           (Trichomonas foetus)
          Length = 270

 Score = 53.6 bits (123), Expect = 3e-06
 Identities = 32/91 (35%), Positives = 43/91 (47%), Gaps = 2/91 (2%)
 Frame = +2

Query: 122 HDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHF 301
           H+     + P +FD    W     +N I++QGSCGSCWAF A+ A     C      +  
Sbjct: 42  HERIQYKDTPTSFD----WRSEGKVNPIKNQGSCGSCWAFSAIAAQES--CHAIATGELL 95

Query: 302 HFSAEDLVSC--CPICGLGCNGGMPTLAWEY 388
            FS + LV C        GC+GG P  A +Y
Sbjct: 96  RFSEQSLVDCVTSDYSCQGCSGGWPDQAMKY 126


>UniRef50_Q23H10 Cluster: Papain family cysteine protease containing
           protein; n=14; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 336

 Score = 53.6 bits (123), Expect = 3e-06
 Identities = 45/140 (32%), Positives = 63/140 (45%), Gaps = 12/140 (8%)
 Frame = +2

Query: 41  THTPFAHIKILMGALKDDNILK--LPKVTH------DAELIANLPENFDPRDKWPECPTL 196
           T   FA  KILM +   D+++K    + TH      + +L +N     D  D W     +
Sbjct: 82  TKEEFAE-KILMKSDLVDHLMKGISQEATHNDTNNNETQLSSNSLTLADSID-WRTKGAV 139

Query: 197 NEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC-CPICG---LGCNGG 364
             +++QG CGSCW+F A   M     I + A     FS + LV C  P  G    GCNGG
Sbjct: 140 TSVKNQGGCGSCWSFSAAAVMESFNFIQNKAL--VDFSEQQLVDCVIPANGYNSYGCNGG 197

Query: 365 MPTLAWEYWKHVGLVSGGNY 424
            P    +Y   VG+ +   Y
Sbjct: 198 WPVQCLDYASKVGITTLDKY 217


>UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=17; Trichomonas vaginalis|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 318

 Score = 53.6 bits (123), Expect = 3e-06
 Identities = 24/71 (33%), Positives = 37/71 (52%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355
           W     +N I+DQ  CGSCWAF  V+A   +  +     +    + +++V C   C  GC
Sbjct: 106 WRNAKIVNPIKDQAQCGSCWAFSVVQAQESQWALKKG--QLLSLAEQNMVDCVDTC-YGC 162

Query: 356 NGGMPTLAWEY 388
           +GG   LA++Y
Sbjct: 163 DGGDEYLAYDY 173


>UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_23,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 321

 Score = 53.6 bits (123), Expect = 3e-06
 Identities = 36/110 (32%), Positives = 49/110 (44%), Gaps = 6/110 (5%)
 Frame = +2

Query: 77  GALKDDNILKLPKVTHDAELIANLPENFDP-----RDKWPECPTLNEIRDQGSCGSCWAF 241
           G L D   L +         + N+ +N +P        W +   +  I+DQG CGSCWAF
Sbjct: 87  GDLTDQEFLTIYLNLQMPARVKNIQKNEEPFLVQEEVDWVQKGKVPAIKDQGDCGSCWAF 146

Query: 242 GAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWEY 388
            AV A+     I  N  +    S +DLV C  P    GC+GG    A +Y
Sbjct: 147 SAVGALEINTKIQFN--EIVDLSEQDLVDCAGPYGNAGCDGGWMESALDY 194


>UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1;
           Methanospirillum hungatei JF-1|Rep: Peptidase C1A,
           papain precursor - Methanospirillum hungatei (strain
           JF-1 / DSM 864)
          Length = 1096

 Score = 53.6 bits (123), Expect = 3e-06
 Identities = 44/132 (33%), Positives = 64/132 (48%), Gaps = 1/132 (0%)
 Frame = +2

Query: 59  HIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWA 238
           H+K L   LK   I+    +T     +  LP +FD R+   +  T   I++QGSCGSCWA
Sbjct: 296 HLKGLRHDLKSSTIVSGAGITP----MEGLPTSFDWRNNGGDYTT--PIKNQGSCGSCWA 349

Query: 239 FGAVEAMTDRVCIYS-NATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSG 415
           F    A      I S N   +  ++ + LV+C      GCNGG+ T A  Y+ +   +SG
Sbjct: 350 FATTGAFESYKEIKSGNPGMNPDYAEQYLVNCAG-DQRGCNGGLFT-AMAYFVNKAGLSG 407

Query: 416 GNYNSSQGCRPY 451
           G    ++   PY
Sbjct: 408 GVGTVTEANYPY 419


>UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16;
           Plasmodium (Vinckeia)|Rep: Cysteine proteinase precursor
           - Plasmodium vinckei
          Length = 506

 Score = 53.6 bits (123), Expect = 3e-06
 Identities = 37/120 (30%), Positives = 58/120 (48%), Gaps = 6/120 (5%)
 Frame = +2

Query: 83  LKDDNILKLPKVTHDAELIA------NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFG 244
           LK   I+ L K   +  LI+      + P++ D R K+   P     +DQG+CGSCWAF 
Sbjct: 236 LKSKYIVPLKKHLANTNLISVDNKSKDFPDSRDYRSKFNFLPP----KDQGNCGSCWAFA 291

Query: 245 AVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424
           A+    + + +++       FS + +V C      GC+GG P  A+ Y  + G+  G  Y
Sbjct: 292 AI-GNFEYLYVHTRHEMPISFSEQQMVDCSTE-NYGCDGGNPFYAFLYMINNGVCLGDEY 349


>UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42;
           Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens
           (Human)
          Length = 334

 Score = 53.6 bits (123), Expect = 3e-06
 Identities = 37/112 (33%), Positives = 55/112 (49%), Gaps = 1/112 (0%)
 Frame = +2

Query: 71  LMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAV 250
           +MG  ++    K  KV  +  L  +LP++ D R K    P    +++Q  CGSCWAF A 
Sbjct: 91  MMGCFRNQKFRK-GKVFREP-LFLDLPKSVDWRKKGYVTP----VKNQKQCGSCWAFSAT 144

Query: 251 EAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWEYWKHVG 403
            A+  +  ++    K    S ++LV C  P    GCNGG    A++Y K  G
Sbjct: 145 GALEGQ--MFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENG 194


>UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 360

 Score = 53.2 bits (122), Expect = 4e-06
 Identities = 34/98 (34%), Positives = 46/98 (46%)
 Frame = +2

Query: 89  DDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDR 268
           DDN  K P +  D     NLP +FD RDK    P    ++ Q  CG CWAF  V+++   
Sbjct: 117 DDNKNKQPHLPTD-----NLPASFDWRDKGAITP----VKVQNGCGGCWAFSTVQSIEG- 166

Query: 269 VCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAW 382
              +    K    S + ++ CC I   GC GG P  A+
Sbjct: 167 -LYFLKTGKLESLSTQQVIDCCRIDESGCLGGDPEPAF 203


>UniRef50_Q22W19 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score = 53.2 bits (122), Expect = 4e-06
 Identities = 29/99 (29%), Positives = 47/99 (47%)
 Frame = +2

Query: 128 AELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHF 307
           + LI +L  +  P   W +   +  +++QG CGSCWAF  V  +      Y+ AT +   
Sbjct: 113 SHLIYSLKGDVAPSIDWRQKNAVTPVKNQGQCGSCWAFSTVGGLEG---AYAIATGNLTS 169

Query: 308 SAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424
            +E  +  C     GCNGG    A++Y    G+ +  +Y
Sbjct: 170 FSEQQIVDCSKANAGCNGGDLPPAYKYVVQNGIETEADY 208


>UniRef50_Q22DX2 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 358

 Score = 53.2 bits (122), Expect = 4e-06
 Identities = 32/94 (34%), Positives = 46/94 (48%)
 Frame = +2

Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322
           ++P ++D R   P    L  + +QG CGSCWAF    A+        N T   + S + L
Sbjct: 146 SIPSSWDIRTDGPGL--LQPVENQGQCGSCWAFSTSGAVESYYSAKKNIT--LNLSKQQL 201

Query: 323 VSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424
           V C    G GC+GG    A++Y + VG+V    Y
Sbjct: 202 VDCVYDHG-GCDGGWFNDAFKYIQSVGIVLNATY 234


>UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:
           Aca s 1 allergen - Acarus siro (Dust mite)
          Length = 331

 Score = 53.2 bits (122), Expect = 4e-06
 Identities = 35/100 (35%), Positives = 45/100 (45%), Gaps = 6/100 (6%)
 Frame = +2

Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322
           NLPE FD R K      L  I +QG CG+CWAF ++  +     I  N   H   S ++L
Sbjct: 108 NLPETFDWRSK------LGPIENQGRCGACWAFASLATVEAAFAIKYNT--HIRLSKQEL 159

Query: 323 VSC------CPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424
           V C       P    GC GG    A +Y +  G+V    Y
Sbjct: 160 VECTRESDHTPYENSGCQGGYSWEALKYVQVTGVVEEAAY 199


>UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 350

 Score = 52.8 bits (121), Expect = 6e-06
 Identities = 48/165 (29%), Positives = 68/165 (41%), Gaps = 15/165 (9%)
 Frame = +2

Query: 11  NTWKAGRN-FP--THTPFAHIKILMGALK-----DDNILKLPKVTHDAELIANLPENFDP 166
           NT+K   N F   T   FAH ++L   LK          + P++    +   N  + FD 
Sbjct: 87  NTYKLQHNQFSDMTKDEFAH-RVLNSQLKTSASSSSQPAQTPQLRGSVDASLNASQGFDW 145

Query: 167 RDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP--- 337
           R+       L  +++QG CGSCW F     +     +     +   FS +D+V C     
Sbjct: 146 RNYQG---VLGNVKNQGQCGSCWTFATAGVLESYYAL--KYQQSLIFSEQDIVDCASRSY 200

Query: 338 -ICGLGCNGGMPTLAWEYWKHVGLVSGGNYN--SSQG-CRPYEIP 460
                GCNGG P+   +Y   VGLV    Y   + QG CR    P
Sbjct: 201 GYQSDGCNGGFPSEGLQYASTVGLVQSDYYPYVAVQGTCRQVNAP 245


>UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2;
           Taeniidae|Rep: Cathepsin L-like cysteine proteinase -
           Taenia solium (Pork tapeworm)
          Length = 339

 Score = 52.8 bits (121), Expect = 6e-06
 Identities = 31/84 (36%), Positives = 43/84 (51%), Gaps = 1/84 (1%)
 Frame = +2

Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319
           A LP+  D RDK      + E+++QG+CGSCWAF +  A+           K    S + 
Sbjct: 122 AGLPDTVDWRDK----NLVTEVKNQGNCGSCWAFSSTGALEG--AFAKKTGKLISLSEQQ 175

Query: 320 LVSCCPICGL-GCNGGMPTLAWEY 388
           LV C    G  GCNGG  + A++Y
Sbjct: 176 LVDCSLKNGNDGCNGGYMSYAFKY 199


>UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 395

 Score = 52.8 bits (121), Expect = 6e-06
 Identities = 24/76 (31%), Positives = 41/76 (53%), Gaps = 2/76 (2%)
 Frame = +2

Query: 203 IRDQGSCGSCWAFGAVEAMTDRVCIYSNATKH--FHFSAEDLVSCCPICGLGCNGGMPTL 376
           +RDQG C SCW FG++ A+  R  I +  ++    H SA++ ++C      GC  G P  
Sbjct: 201 VRDQGECKSCWVFGSLAALESRYLIKNGVSEKSTLHLSAQNAMNCIT---SGCESGWPAN 257

Query: 377 AWEYWKHVGLVSGGNY 424
            ++Y++  G+    +Y
Sbjct: 258 VFDYFESSGIAFEKDY 273


>UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5;
           Piroplasmida|Rep: Cysteine proteinase, putative -
           Theileria parva
          Length = 460

 Score = 52.8 bits (121), Expect = 6e-06
 Identities = 36/120 (30%), Positives = 59/120 (49%), Gaps = 3/120 (2%)
 Frame = +2

Query: 56  AHIKILMGAL-KDDNILKLPKVTHDAELIANLPENFDPRD-KWPECPTLNEIRDQG-SCG 226
           +H+  LM  +  D+  LK  K   + +   + P+N       W +   +++I++QG  CG
Sbjct: 214 SHVDRLMARMVSDETYLKNLKKALNTDKDVD-PKNITGEGLDWRKADGVSKIKNQGLECG 272

Query: 227 SCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGL 406
           SCWAF +V ++     IY N T     S ++LV  C     GC GG    A +Y ++ G+
Sbjct: 273 SCWAFASVSSVESLYKIYRNVT--LDLSEQELVD-CETSSKGCEGGFGDTALKYIQNKGV 329


>UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease;
           n=23; Magnoliophyta|Rep: Senescence-specific cysteine
           protease - Arabidopsis thaliana (Mouse-ear cress)
          Length = 346

 Score = 52.4 bits (120), Expect = 8e-06
 Identities = 31/84 (36%), Positives = 41/84 (48%), Gaps = 1/84 (1%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355
           W +   +  I++QGSCG CWAF AV A+     I     K    S + LV  C     GC
Sbjct: 136 WRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKG--KLISLSEQQLVD-CDTNDFGC 192

Query: 356 NGGMPTLAWEYWKHV-GLVSGGNY 424
            GG+   A+E+ K   GL +  NY
Sbjct: 193 EGGLMDTAFEHIKATGGLTTESNY 216


>UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3;
           Eukaryota|Rep: Cathepsin-like cysteine protease -
           Phytophthora infestans (Potato late blight fungus)
          Length = 635

 Score = 52.4 bits (120), Expect = 8e-06
 Identities = 34/97 (35%), Positives = 52/97 (53%), Gaps = 4/97 (4%)
 Frame = +2

Query: 122 HDAELIANLPENFDPRD-KWPECPTLNEIRDQGS-CGSCWAFGAVEAMTDRVCIYSNAT- 292
           H+   + +LP+++D RD       T ++ +     CGSCWA G   A++DR+ I  NA+ 
Sbjct: 354 HETMDVTDLPKSWDWRDVNGKNYVTWDKNQHIPKYCGSCWAQGTTSALSDRISILRNASW 413

Query: 293 KHFHFSAEDLVSCCPICGLGCNGGMPTLAWEY-WKHV 400
                S + L++C    G  CNGG P L +EY  +HV
Sbjct: 414 PEIALSPQVLINC--HAGGTCNGGNPGLVYEYAHRHV 448



 Score = 41.1 bits (92), Expect = 0.019
 Identities = 33/111 (29%), Positives = 49/111 (44%), Gaps = 12/111 (10%)
 Frame = +2

Query: 122 HDAELIANLPENFDPRDKWPECPTLNEIRDQGS---CGSCWAFGAVEAMTDRVCIYSNAT 292
           HD   ++ LP+NFD R+       ++  R+Q     CGSCW+F A  A+ DR+ I+    
Sbjct: 48  HDYIDVSKLPKNFDWRNV-NGTRYVSISRNQHIPHYCGSCWSFAATSALADRILIFKERN 106

Query: 293 KHFHFSAE---------DLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGG 418
                S E          ++  C     GC+GG    A+ Y K  G+   G
Sbjct: 107 PGNKPSVEVHRGVVLSPQVILNCDKKDNGCHGGDQLEAYRYIKEHGVPEEG 157


>UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4;
           core eudicotyledons|Rep: Papain-like cysteine peptidase
           XBCP3 - Arabidopsis thaliana (Mouse-ear cress)
          Length = 437

 Score = 52.4 bits (120), Expect = 8e-06
 Identities = 24/71 (33%), Positives = 37/71 (52%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355
           W +   +  ++DQGSCG+CW+F A  AM     I +        S ++L+ C      GC
Sbjct: 124 WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDL--ISLSEQELIDCDKSYNAGC 181

Query: 356 NGGMPTLAWEY 388
           NGG+   A+E+
Sbjct: 182 NGGLMDYAFEF 192


>UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 2 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 564

 Score = 52.4 bits (120), Expect = 8e-06
 Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 3/129 (2%)
 Frame = +2

Query: 62  IKILMGAL--KDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCW 235
           I +L G L  KD +    P   H     A LP+  D    W     +  ++DQ  CGSCW
Sbjct: 317 ISVLRGRLQSKDGSSRAEPFPRH--RFTAKLPDQID----WRPYGAVTPVKDQAVCGSCW 370

Query: 236 AFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LGCNGGMPTLAWEYWKHVGLVS 412
           +FG V  +      +    +    S + LV C    G  GC+GG    A+EY    GL S
Sbjct: 371 SFGTVGELEG--AYFRKTGRLVRLSEQQLVDCSWNNGNNGCDGGEDFRAYEYIADHGLAS 428

Query: 413 GGNYNSSQG 439
             +Y +  G
Sbjct: 429 DEDYGAYIG 437


>UniRef50_Q54VR1 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 291

 Score = 52.4 bits (120), Expect = 8e-06
 Identities = 35/117 (29%), Positives = 48/117 (41%)
 Frame = +2

Query: 221 CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHV 400
           CGSCWA G   A+ DR+ I    T      A  ++  C      C+GG PT A+ Y    
Sbjct: 76  CGSCWAHGTTSALGDRIKIGRKGTFPEVVLAPQVLLNCAGPDNTCDGGDPTEAYAYMAAK 135

Query: 401 GLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCESSYNVPFKKE 571
           G+       + + C PYE    E +  G    CN D   P      + +Y   F +E
Sbjct: 136 GI-------TDETCAPYEAIDNECNAEGICKNCNFDLSNPTADCFAQPTYTTYFVEE 185


>UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15)
           (Major excreted protein) (MEP) [Contains: Cathepsin L
           heavy chain; Cathepsin L light chain]; n=19;
           Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15)
           (Major excreted protein) (MEP) [Contains: Cathepsin L
           heavy chain; Cathepsin L light chain] - Homo sapiens
           (Human)
          Length = 333

 Score = 52.4 bits (120), Expect = 8e-06
 Identities = 25/72 (34%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLG 352
           W E   +  +++QG CGSCWAF A  A+  +  ++    +    S ++LV C  P    G
Sbjct: 120 WREKGYVTPVKNQGQCGSCWAFSATGALEGQ--MFRKTGRLISLSEQNLVDCSGPQGNEG 177

Query: 353 CNGGMPTLAWEY 388
           CNGG+   A++Y
Sbjct: 178 CNGGLMDYAFQY 189


>UniRef50_P81494 Cluster: Cathepsin B; n=2; Phasianidae|Rep:
           Cathepsin B - Coturnix coturnix japonica (Japanese
           quail)
          Length = 48

 Score = 52.4 bits (120), Expect = 8e-06
 Identities = 32/72 (44%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           LP+ FD R +WP CPT++EIRDQGS        +VE                  SAEDL+
Sbjct: 1   LPDTFDSRKQWPNCPTISEIRDQGSV-------SVEV-----------------SAEDLL 36

Query: 326 SCCPI-CGLGCN 358
           SCC   CG+GCN
Sbjct: 37  SCCGFECGMGCN 48


>UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin W,
           partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
           similar to Cathepsin W, partial - Ornithorhynchus
           anatinus
          Length = 229

 Score = 52.0 bits (119), Expect = 1e-05
 Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 3/110 (2%)
 Frame = +2

Query: 128 AELIANLPENFDPRDK--WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHF 301
           A  +A++PE    ++   W +   +  +++QGSCGSCWAF AV         Y  A K  
Sbjct: 56  ANQMASIPEGPLRKETCDWRKRGAITSVKNQGSCGSCWAFAAVG--NAESMWYLRAGKRL 113

Query: 302 HFSAEDLVSCCPICGLGCNGGMPTLAW-EYWKHVGLVSGGNYNSSQGCRP 448
              +   V  C  C  GC GG P  A+   W + GL S  +Y      RP
Sbjct: 114 VSLSVQEVLDCGRCRDGCQGGYPEDAFVTMWFNRGLASEKDYPYKVRARP 163


>UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar to
           human SRY (sex determining region Y)-box 30
           (SOX30),transcript variant 1,; n=3; Mammalia|Rep: Testis
           cDNA clone: QtsA-12228, similar to human SRY (sex
           determining region Y)-box 30 (SOX30),transcript variant
           1, - Macaca fascicularis (Crab eating macaque)
           (Cynomolgus monkey)
          Length = 433

 Score = 52.0 bits (119), Expect = 1e-05
 Identities = 36/112 (32%), Positives = 55/112 (49%), Gaps = 1/112 (0%)
 Frame = +2

Query: 71  LMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAV 250
           +MG  ++  + K  K+  +  L  +LP++ D R K    P    +++Q  CGSCWAF A 
Sbjct: 91  VMGCFRNQKLRK-GKLFREP-LFLDLPKSVDWRKKGYVTP----VKNQKQCGSCWAFSAT 144

Query: 251 EAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWEYWKHVG 403
            A+  +  ++    K    S ++LV C  P    GCNGG    A+ Y K  G
Sbjct: 145 GALEGQ--MFRKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMNSAFRYVKENG 194


>UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep:
           Cathepsin - Geodia cydonium (Sponge)
          Length = 322

 Score = 52.0 bits (119), Expect = 1e-05
 Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 3/101 (2%)
 Frame = +2

Query: 131 ELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNAT-KHFHF 307
           E ++ LP   D R K      +  +++QG CGSCWAF A  ++  +   + NAT K    
Sbjct: 98  EDVSALPTTVDWRTKG----YVTGVKNQGQCGSCWAFSATGSLEGQ---HFNATGKLVSL 150

Query: 308 SAEDLVSCCPICG-LGCNGGMPTLAWEY-WKHVGLVSGGNY 424
           S ++LV C    G  GCNGG+P  A++Y  K+ G+ +  +Y
Sbjct: 151 SEQNLVDCSSAEGNEGCNGGLPDDAFKYVIKNGGIDTEASY 191


>UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 4 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 345

 Score = 52.0 bits (119), Expect = 1e-05
 Identities = 39/148 (26%), Positives = 66/148 (44%), Gaps = 3/148 (2%)
 Frame = +2

Query: 86  KDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTD 265
           K  +  +L ++   A L  + PE  +    W E   +  +++QG CGSCWAF +  A+  
Sbjct: 106 KPPSAQQLAEIPLYAPLFGDTPEFIE----WRENGFVTPVKNQGQCGSCWAFSSTGALEG 161

Query: 266 RVCIYSNATKHFHFSAEDLVSCC--PICGLGCNGGMPTLAWEYWKHV-GLVSGGNYNSSQ 436
           +V  +    +    S ++L+ C        GCNGG    A++Y +   GL +   Y   Q
Sbjct: 162 QV--FKRTRRLISLSEQNLMDCAGQRYGNNGCNGGQMPGAFQYVQDAGGLDTEARYPYRQ 219

Query: 437 GCRPYEIPPCEHHVPGNRMPCNGDTKTP 520
           G   ++     +     R+  NG T+ P
Sbjct: 220 GTN-FQC-QFSNSFEARRVSVNGHTRVP 245


>UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae
           str. PEST
          Length = 559

 Score = 52.0 bits (119), Expect = 1e-05
 Identities = 30/94 (31%), Positives = 50/94 (53%), Gaps = 1/94 (1%)
 Frame = +2

Query: 125 DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH 304
           D   + +LP +FD    W +   + E+++QGSCGSCWAF AV  +     ++   TK   
Sbjct: 332 DVAGVGDLPRSFD----WRDHGAVTEVKNQGSCGSCWAFSAVGNVEG---LHQIKTKKLE 384

Query: 305 -FSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVG 403
            +S ++L+ C  +   GC GG    A++  + +G
Sbjct: 385 SYSEQELIDCDKVDN-GCGGGYMDDAFKAIEQLG 417


>UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3;
           Bilateria|Rep: Cathepsin L-like cysteine proteinase -
           Longidorus elongatus
          Length = 358

 Score = 52.0 bits (119), Expect = 1e-05
 Identities = 31/89 (34%), Positives = 47/89 (52%), Gaps = 4/89 (4%)
 Frame = +2

Query: 134 LIANLPENFDPRDK--WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHF 307
           +I  +P+N    D   W +   + +++DQGSCGSCWAF A  ++  +   Y    K    
Sbjct: 129 MIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAFSATGSLEGQ--HYKQTGKLVSL 186

Query: 308 SAEDLVSCCPICG--LGCNGGMPTLAWEY 388
           S ++LV  C + G   GCNGG    A++Y
Sbjct: 187 SEQNLVD-CDVNGDDEGCNGGYMDGAFQY 214


>UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep:
           Cysteine protease - Babesia equi
          Length = 438

 Score = 52.0 bits (119), Expect = 1e-05
 Identities = 27/77 (35%), Positives = 40/77 (51%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355
           W +   +  ++DQG+CGSCWAF AV ++     I     +    S ++LV+C      GC
Sbjct: 230 WRKLNGVTPVKDQGNCGSCWAFAAVGSVESLYLIKKG--QALDLSEQELVNCEENSN-GC 286

Query: 356 NGGMPTLAWEYWKHVGL 406
            G +P  A EY K  G+
Sbjct: 287 EGDLPNKALEYIKAKGI 303


>UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precursor;
           n=3; Plasmodium (Laverania)|Rep: Trophozoite cysteine
           proteinase precursor - Plasmodium falciparum
          Length = 569

 Score = 52.0 bits (119), Expect = 1e-05
 Identities = 41/134 (30%), Positives = 64/134 (47%), Gaps = 5/134 (3%)
 Frame = +2

Query: 38  PTHTPFAHIKILMGALKDDNILKLPKVTH----DAELIANLPENFDPRDKWPECPTLNEI 205
           P H    + K     LKD NIL     T+    + ++ + +PE  D R+K      ++E 
Sbjct: 294 PNHMIEKYSKPFENHLKD-NILISEFYTNGKRNEKDIFSKVPEILDYREKG----IVHEP 348

Query: 206 RDQGSCGSCWAFGAVEAMTDRVCIYSNATKH-FHFSAEDLVSCCPICGLGCNGGMPTLAW 382
           +DQG CGSCWAF +V  +     +++   K+   FS +++V C      GC+GG P  ++
Sbjct: 349 KDQGLCGSCWAFASVGNIES---VFAKKNKNILSFSEQEVVDCSK-DNFGCDGGHPFYSF 404

Query: 383 EYWKHVGLVSGGNY 424
            Y     L  G  Y
Sbjct: 405 LYVLQNELCLGDEY 418


>UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163
           precursor; n=4; Schizophora|Rep: Putative cysteine
           proteinase CG12163 precursor - Drosophila melanogaster
           (Fruit fly)
          Length = 614

 Score = 52.0 bits (119), Expect = 1e-05
 Identities = 27/86 (31%), Positives = 44/86 (51%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           LP+ FD    W +   + ++++QGSCGSCWAF     +     + +   K   FS ++L+
Sbjct: 394 LPKEFD----WRQKDAVTQVKNQGSCGSCWAFSVTGNIEGLYAVKTGELK--EFSEQELL 447

Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVG 403
             C      CNGG+   A++  K +G
Sbjct: 448 D-CDTTDSACNGGLMDNAYKAIKDIG 472


>UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin
           L-like protease; n=1; Nasonia vitripennis|Rep:
           PREDICTED: similar to cathepsin L-like protease -
           Nasonia vitripennis
          Length = 353

 Score = 51.6 bits (118), Expect = 1e-05
 Identities = 33/98 (33%), Positives = 50/98 (51%), Gaps = 3/98 (3%)
 Frame = +2

Query: 143 NLPENFDPRDKWPECPTLNEIRDQG-SCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319
           N+PE+ D    W +   +  +RDQG +CGSCWAF A  A+  +   +         SA++
Sbjct: 131 NVPEHVD----WRQRGAVTPVRDQGLTCGSCWAFSAAGALEAQ--YFKKTGVLTALSAQN 184

Query: 320 LVSCCPICG-LGCNGGMPTLAWEY-WKHVGLVSGGNYN 427
           L+ C    G LGC GG   L++++     GL    NY+
Sbjct: 185 LIDCTMEYGNLGCGGGSAALSFQFVVDQKGLEPEANYS 222


>UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 344

 Score = 51.6 bits (118), Expect = 1e-05
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 6/104 (5%)
 Frame = +2

Query: 152 ENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC 331
           +N  P D W     +  ++ QG CGSCW F A  A+ +      N     +FS + ++ C
Sbjct: 134 KNAPPMD-WRNASAITPVKQQGKCGSCWTF-ASTAVLESFSFIKNGAPLTNFSEQQILDC 191

Query: 332 CPICGL---GCNGGMPTLAWEYWKHVGLVSGGNY---NSSQGCR 445
               G    GCNGG  + A  Y    G+     Y      QGC+
Sbjct: 192 VYGSGYYSNGCNGGFGSEALNYAIQNGIAPLSQYPYVGKQQGCK 235


>UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1;
           Phytophthora infestans|Rep: Cathepsin-like cysteine
           protease - Phytophthora infestans (Potato late blight
           fungus)
          Length = 376

 Score = 51.6 bits (118), Expect = 1e-05
 Identities = 29/79 (36%), Positives = 42/79 (53%), Gaps = 1/79 (1%)
 Frame = +2

Query: 131 ELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFS 310
           E + +LP  +D    W E  T+  +++QG CGSCWAF AV AM    C Y+ +T      
Sbjct: 128 ENVEDLPATWD----WREHSTVTPVKNQGQCGSCWAFSAVAAME---CAYALSTGTLESL 180

Query: 311 AEDLVSCCPICGLG-CNGG 364
           +E  +  C + G+  CN G
Sbjct: 181 SEQELVDCTLNGIDTCNHG 199


>UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcoptes
           scabiei type hominis|Rep: Sar s 1 allergen Yv5032C08 -
           Sarcoptes scabiei type hominis
          Length = 340

 Score = 51.6 bits (118), Expect = 1e-05
 Identities = 30/80 (37%), Positives = 44/80 (55%), Gaps = 3/80 (3%)
 Frame = +2

Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATK---HFHFSAEDLVSCCPICGLGCNGG 364
           + +IR+Q +CGSCWAF +V A  + + + SN T+   +   S + LV C      GCNG 
Sbjct: 126 VTKIREQLACGSCWAF-SVTANVESLLLGSNCTRWSTNDWLSPQQLVDCA--SDHGCNGE 182

Query: 365 MPTLAWEYWKHVGLVSGGNY 424
             +   EY +H G+V  G Y
Sbjct: 183 KTSTGLEYVQHKGIVKEGVY 202


>UniRef50_Q23EG5 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 429

 Score = 51.6 bits (118), Expect = 1e-05
 Identities = 26/81 (32%), Positives = 42/81 (51%), Gaps = 5/81 (6%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGS----CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PI 340
           W E   ++ ++DQ +    CGSCW F A  A+   + + +     F+ S + LV C    
Sbjct: 128 WREKGIVSSVKDQDAVGDDCGSCWTFSATGAIESHLALKTGKAP-FNLSQQQLVDCAGKF 186

Query: 341 CGLGCNGGMPTLAWEYWKHVG 403
              GC+GG+P+ A+EY  + G
Sbjct: 187 DNQGCDGGLPSRAFEYIAYAG 207


>UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_79,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 324

 Score = 51.6 bits (118), Expect = 1e-05
 Identities = 42/119 (35%), Positives = 57/119 (47%), Gaps = 7/119 (5%)
 Frame = +2

Query: 110 PKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGS-CGSCWAFGAVEAMTDRVCIYSN 286
           PK T+ ++    +P N   RD W E   +  I+DQGS CGS WAF AV  +     I SN
Sbjct: 104 PKSTNKSKSTDYVP-NGQARD-WVEEGKVPPIKDQGSSCGSSWAFSAVGVLE----INSN 157

Query: 287 ATKHFH--FSAEDLVSCC-PICGLGCNGGMPTLAWEYWKHVGLVSGGNY---NSSQGCR 445
                    S +D++ C  P    GC+GG     +EY +  G+ +G  Y    S Q CR
Sbjct: 158 IEFGLETTLSEQDMLDCSGPYGNQGCSGGWMDSGFEYVRDHGIANGSVYPYVGSDQTCR 216


>UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria
           dispar multicapsid nuclear polyhedrosis virus (LdMNPV)
          Length = 356

 Score = 51.6 bits (118), Expect = 1e-05
 Identities = 27/91 (29%), Positives = 47/91 (51%), Gaps = 1/91 (1%)
 Frame = +2

Query: 134 LIANLPENFDPRD-KWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFS 310
           +I N P +  P    W E   +  I++QG+CG+CWAF  + ++  +  +  N  +    S
Sbjct: 135 IILNQPPDKGPLHFDWREQNKVTSIKNQGACGACWAFATLASVESQFAMRHN--RLIDLS 192

Query: 311 AEDLVSCCPICGLGCNGGMPTLAWEYWKHVG 403
            + L+ C  +  +GCNGG+   A+E    +G
Sbjct: 193 EQQLIDCDSV-DMGCNGGLLHTAFEEIMRMG 222


>UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin O;
           n=1; Danio rerio|Rep: PREDICTED: similar to cathepsin O
           - Danio rerio
          Length = 327

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 30/92 (32%), Positives = 40/92 (43%), Gaps = 2/92 (2%)
 Frame = +2

Query: 155 NFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC 334
           N  PR  W +   +  + +QGSCG CWAF  VEA+     + +   +     +   V  C
Sbjct: 119 NNPPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAIES---VSAKVGEKLQQLSVQQVIDC 175

Query: 335 PICGLGCNGGMP--TLAWEYWKHVGLVSGGNY 424
                GCNGG P   L W     + LVS   Y
Sbjct: 176 SYQNQGCNGGSPVEALYWLTQSKLKLVSEAEY 207


>UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3;
           Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays
           (Maize)
          Length = 493

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 27/84 (32%), Positives = 44/84 (52%), Gaps = 1/84 (1%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355
           W E   + E++DQG CG CWAF AV A+     I + +      S ++L+ C      GC
Sbjct: 170 WRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVTGSL--ISLSEQELIDCDKFQDQGC 227

Query: 356 NGGMPTLAWEYW-KHVGLVSGGNY 424
           +GG+   A+ +  K+ G+ +  +Y
Sbjct: 228 DGGLMDNAFVFMIKNGGIDTEADY 251


>UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium
           falciparum|Rep: Falcipain 2 - Plasmodium falciparum
          Length = 484

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 31/93 (33%), Positives = 47/93 (50%), Gaps = 2/93 (2%)
 Frame = +2

Query: 152 ENFDPRD-KWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVS 328
           ENFD     W     +  ++DQ +CGSCWAF ++ ++  +  I  N  K    S ++LV 
Sbjct: 258 ENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKN--KLITLSEQELVD 315

Query: 329 CCPICGLGCNGGMPTLAWEYWKHV-GLVSGGNY 424
            C     GCNGG+   A+E    + G+   G+Y
Sbjct: 316 -CSFKNYGCNGGLINNAFEDMIELGGICPDGDY 347


>UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathepsin
           o - Aedes aegypti (Yellowfever mosquito)
          Length = 375

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 29/94 (30%), Positives = 43/94 (45%)
 Frame = +2

Query: 83  LKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMT 262
           +KDD I    K   D +++  LP+  D RDK    P    +R QGSCG+CWA   V+ +T
Sbjct: 134 MKDDIIFSRAK--RDLKILDYLPKVVDWRDKGVVAP----VRSQGSCGACWAISVVDTIT 187

Query: 263 DRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGG 364
             +              + +++C      GC GG
Sbjct: 188 S-ISAIKRQQNFSELCLDQVINCAGNGNFGCEGG 220


>UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_21,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 349

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 26/80 (32%), Positives = 40/80 (50%), Gaps = 3/80 (3%)
 Frame = +2

Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC---PICGLGCNGG 364
           ++E+++QGSCGSCWAF AV A+     +     K+   S ++LV C         GC+GG
Sbjct: 137 VSEVKNQGSCGSCWAFSAVAAL--ETALRQGGVKNVELSEQELVDCAVKDEFESEGCDGG 194

Query: 365 MPTLAWEYWKHVGLVSGGNY 424
                ++Y    G+     Y
Sbjct: 195 EMYDGFQYASKYGIAIRSEY 214


>UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera
           litura multicapsid nucleopolyhedrovirus (SpltMNPV)
          Length = 337

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 27/88 (30%), Positives = 47/88 (53%)
 Frame = +2

Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319
           A  PE+FD    W +   + ++++QG CGSCWAF A+  +  +  I  ++      S + 
Sbjct: 124 ARTPESFD----WRKLNKVTKVKEQGVCGSCWAFAAIGNIESQYAIMHDSL--IDLSEQQ 177

Query: 320 LVSCCPICGLGCNGGMPTLAWEYWKHVG 403
           L+ C  +   GC+GG+  LA++    +G
Sbjct: 178 LLDCDRV-DQGCDGGLMHLAFQEIIRIG 204


>UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC
           3.4.14.1) (Dipeptidyl-peptidase I) (DPP-I) (DPPI)
           (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase)
           [Contains: Dipeptidyl-peptidase 1 exclusion domain chain
           (Dipeptidyl- peptidase I exclusion domain chain);
           Dipeptidyl-peptidase 1 heavy chain (Dipeptidyl-peptidase
           I heavy chain); Dipeptidyl-peptidase 1 light chain
           (Dipeptidyl-peptidase I light chain)]; n=50;
           Coelomata|Rep: Dipeptidyl-peptidase 1 precursor (EC
           3.4.14.1) (Dipeptidyl-peptidase I) (DPP-I) (DPPI)
           (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase)
           [Contains: Dipeptidyl-peptidase 1 exclusion domain chain
           (Dipeptidyl- peptidase I exclusion domain chain);
           Dipeptidyl-peptidase 1 heavy chain (Dipeptidyl-peptidase
           I heavy chain); Dipeptidyl-peptidase 1 light chain
           (Dipeptidyl-peptidase I light chain)] - Homo sapiens
           (Human)
          Length = 463

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 32/101 (31%), Positives = 55/101 (54%), Gaps = 1/101 (0%)
 Frame = +2

Query: 110 PKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNA 289
           P      + I +LP ++D R+       ++ +R+Q SCGSC++F ++  +  R+ I +N 
Sbjct: 219 PLTAEIQQKILHLPTSWDWRNVHG-INFVSPVRNQASCGSCYSFASMGMLEARIRILTNN 277

Query: 290 TKHFHFSAEDLVSCCPICGLGCNGGMPTL-AWEYWKHVGLV 409
           ++    S +++VSC      GC GG P L A +Y +  GLV
Sbjct: 278 SQTPILSPQEVVSCSQY-AQGCEGGFPYLIAGKYAQDFGLV 317


>UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis
           thaliana|Rep: Cysteine proteinase - Arabidopsis thaliana
           (Mouse-ear cress)
          Length = 348

 Score = 50.8 bits (116), Expect = 2e-05
 Identities = 33/104 (31%), Positives = 50/104 (48%), Gaps = 4/104 (3%)
 Frame = +2

Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322
           N+ +N +  D W +   +  ++ QG CG CWAF AV A+     I     +    S + L
Sbjct: 124 NVSDNGESMD-WRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKG--ELVSLSEQQL 180

Query: 323 VSCCPICGLGCNGGMPTLAWEY-WKHVGLVSGGNY---NSSQGC 442
           + C      GC GG+ + A+EY  K+ G+ +  NY    S Q C
Sbjct: 181 LDCDRDYNQGCRGGIMSKAFEYIIKNQGITTEDNYPYQESQQTC 224


>UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-ear
           cress). SAG12 protein; n=2; Dictyostelium
           discoideum|Rep: Similar to Arabidopsis thaliana
           (Mouse-ear cress). SAG12 protein - Dictyostelium
           discoideum (Slime mold)
          Length = 358

 Score = 50.8 bits (116), Expect = 2e-05
 Identities = 28/79 (35%), Positives = 37/79 (46%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355
           W +   +  ++DQG CGSC+ F AVE +           K    S +  V C P  G  C
Sbjct: 151 WRKKGLVTPVKDQGQCGSCYIFSAVEQI--ETAWIKAGNKPILLSEQQAVDCDPYDG-QC 207

Query: 356 NGGMPTLAWEYWKHVGLVS 412
            GG P   +EY+  VG VS
Sbjct: 208 GGGDPYTVYEYFSQVGGVS 226


>UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_46,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 336

 Score = 50.8 bits (116), Expect = 2e-05
 Identities = 28/81 (34%), Positives = 39/81 (48%)
 Frame = +2

Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPT 373
           + +++DQG C  CWAFGAV A      + +  T     S + L+  C     GCNGG   
Sbjct: 151 ITQVKDQGQCSGCWAFGAVGAAEAWFYVKNKTT--VLLSEQQLID-CDTQSFGCNGGYQN 207

Query: 374 LAWEYWKHVGLVSGGNYNSSQ 436
           LA +Y  + GL     Y  +Q
Sbjct: 208 LALKYIANHGLNDARVYPYTQ 228


>UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11;
           Entamoeba|Rep: Cysteine proteinase 2 precursor -
           Entamoeba histolytica
          Length = 315

 Score = 50.8 bits (116), Expect = 2e-05
 Identities = 30/94 (31%), Positives = 46/94 (48%), Gaps = 2/94 (2%)
 Frame = +2

Query: 149 PENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKH-FHFSAEDLV 325
           PE+ D    W +   +  IRDQ  CGSC+ FG++ A+  R+ I      +    S E +V
Sbjct: 95  PESVD----WRKEGKVTPIRDQAQCGSCYTFGSLAALEGRLLIEKGGDANTLDLSEEHMV 150

Query: 326 SCCPICG-LGCNGGMPTLAWEYWKHVGLVSGGNY 424
            C    G  GCNGG+ +  ++Y    G+    +Y
Sbjct: 151 QCTRDNGNNGCNGGLGSNVYDYIIEHGVAKESDY 184


>UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15)
           [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC
           3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain] - Sarcophaga peregrina (Flesh fly)
           (Boettcherisca peregrina)
          Length = 339

 Score = 50.8 bits (116), Expect = 2e-05
 Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 1/77 (1%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LG 352
           W E   +  ++DQG CGSCWAF +  A+  +   +  A      S ++LV C    G  G
Sbjct: 128 WREHGAVTGVKDQGHCGSCWAFSSTGALEGQ--HFRKAGVLVSLSEQNLVDCSTKYGNNG 185

Query: 353 CNGGMPTLAWEYWKHVG 403
           CNGG+   A+ Y K  G
Sbjct: 186 CNGGLMDNAFRYIKDNG 202


>UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia
           theta|Rep: Cathepsin H precursor - Guillardia theta
           (Cryptomonas phi)
          Length = 353

 Score = 50.4 bits (115), Expect = 3e-05
 Identities = 29/89 (32%), Positives = 47/89 (52%), Gaps = 2/89 (2%)
 Frame = +2

Query: 152 ENFDPRDKW-PECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVS 328
           + FD R++   E   ++ +++QG+CGSCW F    A+     I +   +    S + LV 
Sbjct: 120 DEFDWRNQTCGETSCVSMVKNQGTCGSCWTFSTAAALESLHAIKTG--EMVLLSEQQLVD 177

Query: 329 C-CPICGLGCNGGMPTLAWEYWKHVGLVS 412
           C       GCNGG+P+ A+EY  + G +S
Sbjct: 178 CAADFKNNGCNGGLPSQAFEYIMYNGGLS 206


>UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein A;
           n=2; Dictyostelium discoideum|Rep: Gamete and
           mating-type specific protein A - Dictyostelium
           discoideum (Slime mold)
          Length = 448

 Score = 50.4 bits (115), Expect = 3e-05
 Identities = 28/70 (40%), Positives = 37/70 (52%), Gaps = 2/70 (2%)
 Frame = +2

Query: 203 IRDQGSCGSCWAFGAVEAMTDRVCI-YSNATKH-FHFSAEDLVSCCPICGLGCNGGMPTL 376
           IRDQG CGSCWAF +  A+  R  I Y  A K     S ++ V+C      GCNGG    
Sbjct: 253 IRDQGQCGSCWAFASSAALESRYLIKYGTAQKSTLQLSNQNAVNC---IASGCNGGWSGN 309

Query: 377 AWEYWKHVGL 406
            + ++K  G+
Sbjct: 310 YFNFFKTPGI 319


>UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine
           proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like
           midgut cysteine proteinase - Tenebrio molitor (Yellow
           mealworm)
          Length = 330

 Score = 50.4 bits (115), Expect = 3e-05
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 1/78 (1%)
 Frame = +2

Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LGCNGGMP 370
           ++E++DQG CGSCW+F    A+  ++ +     +    S ++L+ C    G  GC+GG  
Sbjct: 128 VSEVKDQGQCGSCWSFSTTGAVEGQLALQRG--RLTSLSEQNLIDCSSSYGNAGCDGGWM 185

Query: 371 TLAWEYWKHVGLVSGGNY 424
             A+ Y    G++S   Y
Sbjct: 186 DSAFSYIHDYGIMSESAY 203


>UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_542_3431_1206 - Giardia lamblia ATCC
           50803
          Length = 741

 Score = 50.4 bits (115), Expect = 3e-05
 Identities = 43/133 (32%), Positives = 62/133 (46%), Gaps = 6/133 (4%)
 Frame = +2

Query: 89  DDNILKLPKVTHDAELI-ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTD 265
           +D   +LP    +A+L  A LP NF  R     C    +I +QGSCG C+A  AVE +T 
Sbjct: 40  EDEYNELPDGPDNADLTRAALPTNFTYRGH--RCI---QIINQGSCGCCYAAAAVEMVTA 94

Query: 266 RVCIYSNATKHFHFSAEDLVSC-----CPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNS 430
           R C+  N ++    S EDLV+C       I   GC GG    + ++ +  G+V     + 
Sbjct: 95  RRCLQLNDSR--LVSLEDLVTCDHTKYLNIQNNGCRGGNSLASLKFGETTGMVYDTCEDY 152

Query: 431 SQGCRPYEIPPCE 469
                PY    C+
Sbjct: 153 WNRTYPYPTETCK 165


>UniRef50_Q23H06 Cluster: Papain family cysteine protease containing
           protein; n=18; Tetrahymena thermophila|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 349

 Score = 50.4 bits (115), Expect = 3e-05
 Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 4/87 (4%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC-CPICGL- 349
           W     + +++ QG+CG+CWAF A   M     I + A     FS + L+ C  P  G  
Sbjct: 147 WRSRGAVTQVKWQGNCGACWAFSATGVMESFNFIQNKAL--VEFSEQQLLDCVIPANGYP 204

Query: 350 --GCNGGMPTLAWEYWKHVGLVSGGNY 424
             GC+GG P    +Y   VG+++   Y
Sbjct: 205 SSGCHGGWPVQCIDYASKVGILNQDRY 231


>UniRef50_Q239L8 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 323

 Score = 50.4 bits (115), Expect = 3e-05
 Identities = 24/83 (28%), Positives = 39/83 (46%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355
           W     +  ++DQG CGSCW+F    A+     ++ +  K    S + LV C      GC
Sbjct: 129 WTTKGAVTPVKDQGQCGSCWSFSTTGAVEG--ALFLSTKKLTSLSEQYLVDCSKDGNEGC 186

Query: 356 NGGMPTLAWEYWKHVGLVSGGNY 424
           NGG+   A+++    G+ +   Y
Sbjct: 187 NGGLMDTAFDFISQHGIPTEAAY 209


>UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, whole
           genome shotgun sequence; n=3; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_2,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 376

 Score = 50.4 bits (115), Expect = 3e-05
 Identities = 25/77 (32%), Positives = 39/77 (50%)
 Frame = +2

Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPT 373
           + E++ QG CGSCWAF   + +  R+ I +N  K    S   L+ C      GC+GG  +
Sbjct: 175 VTEVQQQGRCGSCWAFAVQDVVISRLAI-ANKNKLDQLSKTHLIDCADGNTEGCDGGSVS 233

Query: 374 LAWEYWKHVGLVSGGNY 424
            A+++    G V   +Y
Sbjct: 234 DAFDFINKYGTVYEKDY 250


>UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcoptes
           scabiei type hominis|Rep: Cathepsin L-like protease -
           Sarcoptes scabiei type hominis
          Length = 245

 Score = 50.0 bits (114), Expect = 4e-05
 Identities = 37/115 (32%), Positives = 58/115 (50%), Gaps = 5/115 (4%)
 Frame = +2

Query: 125 DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH 304
           D E +++LP+  D    W     +  I+DQ  CGSCWAF AV +M  +  + +   +   
Sbjct: 113 DNEDVSDLPDEVD----WTLKNVVAPIKDQKQCGSCWAFSAVASMESQNALKTG--QLVE 166

Query: 305 FSAEDLVSCCPICG-LGCNGGMPTLAWEY-WKHVGLVSGGNY---NSSQGCRPYE 454
            S ++LV C    G  GC+GG    A+E+  K  G+ +  +Y     +Q CR Y+
Sbjct: 167 LSEQELVDCSVGEGNEGCDGGWMDSAFEFVIKADGIDTEKSYPYHGVNQVCRSYQ 221


>UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_567_6496_7413 - Giardia lamblia ATCC
           50803
          Length = 305

 Score = 50.0 bits (114), Expect = 4e-05
 Identities = 35/117 (29%), Positives = 49/117 (41%), Gaps = 3/117 (2%)
 Frame = +2

Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319
           A  P+  D R   PEC    E  DQ  C  C+AF  + A++ R CI     +    S + 
Sbjct: 79  AGSPDRLDYRQTHPEC--FFEPEDQKECSCCYAFATLGALSTRRCIAKLDPQAVSLSVQH 136

Query: 320 LVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGG--NYNSSQGCRPYEIP-PCEHHVP 481
           +VS C     GC GG    +W + +  G V      Y S +  +  E P  C+   P
Sbjct: 137 MVS-CDSGEAGCQGGEFESSWAFLETEGAVKSDCLPYTSGETGKSGECPTTCQDGTP 192


>UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4;
           Caenorhabditis|Rep: Cathepsin z protein 1 -
           Caenorhabditis elegans
          Length = 306

 Score = 50.0 bits (114), Expect = 4e-05
 Identities = 29/81 (35%), Positives = 41/81 (50%), Gaps = 4/81 (4%)
 Frame = +2

Query: 221 CGSCWAFGAVEAMTDRVCI-YSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKH 397
           CGSCWAFGA  A+ DR+ I   NA    + S ++++ C    G    GG P   ++Y   
Sbjct: 92  CGSCWAFGATSALADRINIKRKNAWPQAYLSVQEVIDCSG-AGTCVMGGEPGGVYKYAHE 150

Query: 398 VGL--VSGGNYNSSQG-CRPY 451
            G+   +  NY +  G C PY
Sbjct: 151 HGIPHETCNNYQARDGKCDPY 171


>UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10;
           Dictyostelium discoideum|Rep: Cysteine proteinase 7
           precursor - Dictyostelium discoideum (Slime mold)
          Length = 460

 Score = 50.0 bits (114), Expect = 4e-05
 Identities = 26/73 (35%), Positives = 38/73 (52%), Gaps = 2/73 (2%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHF-HFSAEDLVSCCPICG-L 349
           W     +  I++QG CG CW+F    A T+     +N  K+    S ++L+ C    G  
Sbjct: 116 WRTQGAVTPIKNQGQCGGCWSFSTTGA-TEGAQYLANGKKNLVSLSEQNLIDCSGSYGNN 174

Query: 350 GCNGGMPTLAWEY 388
           GC GG+ TLA+EY
Sbjct: 175 GCEGGLMTLAFEY 187


>UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27;
           Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber
           officinale (Ginger)
          Length = 221

 Score = 50.0 bits (114), Expect = 4e-05
 Identities = 32/97 (32%), Positives = 48/97 (49%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           LP++ D R+K    P    +++QG CGSCWAF A+ A+     I +        S + LV
Sbjct: 3   LPDSIDWREKGAVVP----VKNQGGCGSCWAFDAIAAVEGINQIVTGDL--ISLSEQQLV 56

Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQ 436
             C     GC GG P  A++Y     +++ G  NS +
Sbjct: 57  D-CSTRNHGCEGGWPYRAFQY-----IINNGGINSEE 87


>UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16)
           [Contains: Cathepsin H mini chain; Cathepsin H heavy
           chain; Cathepsin H light chain]; n=37; Eukaryota|Rep:
           Cathepsin H precursor (EC 3.4.22.16) [Contains:
           Cathepsin H mini chain; Cathepsin H heavy chain;
           Cathepsin H light chain] - Homo sapiens (Human)
          Length = 335

 Score = 50.0 bits (114), Expect = 4e-05
 Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 1/66 (1%)
 Frame = +2

Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMP 370
           ++ +++QG+CGSCW F    A+   + I +   K    + + LV C       GC GG+P
Sbjct: 129 VSPVKNQGACGSCWTFSTTGALESAIAIATG--KMLSLAEQQLVDCAQDFNNHGCQGGLP 186

Query: 371 TLAWEY 388
           + A+EY
Sbjct: 187 SQAFEY 192


>UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3;
           Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber
           officinale (Ginger)
          Length = 475

 Score = 49.6 bits (113), Expect = 5e-05
 Identities = 31/97 (31%), Positives = 47/97 (48%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           LP++ D    W E   +  +++QG CGSCWAF A+ A+     I +        S + LV
Sbjct: 143 LPDSID----WREKGAVVAVKNQGRCGSCWAFAAIAAVEGINQIVTGDL--ISLSEQQLV 196

Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQ 436
             C     GC GG P  A++Y     +++ G  NS +
Sbjct: 197 D-CSTRNYGCEGGWPYRAFQY-----IINNGGVNSEE 227


>UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24;
           Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa
           (Rice)
          Length = 339

 Score = 49.6 bits (113), Expect = 5e-05
 Identities = 37/99 (37%), Positives = 49/99 (49%), Gaps = 3/99 (3%)
 Frame = +2

Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316
           I  LP   D R K    P    I+DQG CG CWAF AV AM   V +  +  K    S +
Sbjct: 120 IDTLPATVDWRTKGAVTP----IKDQGQCGCCWAFSAVAAMEGIVKL--STGKLISLSEQ 173

Query: 317 DLVSCCPICG--LGCNGGMPTLAWEY-WKHVGLVSGGNY 424
           +LV  C + G   GC GG+   A+++  K+ GL +   Y
Sbjct: 174 ELVD-CDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKY 211


>UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine
           protease; n=11; Callosobruchus maculatus|Rep: Putative
           gut cathepsin L-like cysteine protease - Callosobruchus
           maculatus (Southern cowpea weevil) (Pulse bruchid)
          Length = 326

 Score = 49.6 bits (113), Expect = 5e-05
 Identities = 28/85 (32%), Positives = 44/85 (51%), Gaps = 2/85 (2%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC--PICGL 349
           W E   +  ++DQ +CGSCWAF AV A+  +     N T     SA++LV C        
Sbjct: 118 WREEGAVTPVKDQANCGSCWAFSAVGAIEGQF-FKKNGTL-VSLSAQELVDCATEDYGNN 175

Query: 350 GCNGGMPTLAWEYWKHVGLVSGGNY 424
           GC GG+   A+++ +  G+ +  +Y
Sbjct: 176 GCKGGLMGQAFDFVQDEGIQTEESY 200


>UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain;
           n=9; Cucujiformia|Rep: Digestive cysteine proteinase
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 49.6 bits (113), Expect = 5e-05
 Identities = 35/130 (26%), Positives = 60/130 (46%), Gaps = 4/130 (3%)
 Frame = +2

Query: 47  TPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDK--WPECPTLNEIRDQGS 220
           TPFA +       KD+   ++    +    +A  PE  +  D   W +   + +++ QG 
Sbjct: 73  TPFADLT--HDEFKDELRRQIKTKPNVEATLAVFPEGLEVPDSIDWTQKGAVLDVKYQGG 130

Query: 221 CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGC-NGGMPTLAWEYWK 394
           CGSCWAF A  A+  +  I +N       S + L+ C  P     C +GG+ + A++Y  
Sbjct: 131 CGSCWAFSATGALEGQNAIVNNV--KIPLSEQQLLDCSKPYGNDDCEHGGLMSFAFDYVL 188

Query: 395 HVGLVSGGNY 424
             G+ +  +Y
Sbjct: 189 DKGIEADSSY 198


>UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;
           Theileria|Rep: Cysteine protease, tacP, putative -
           Theileria annulata
          Length = 461

 Score = 49.6 bits (113), Expect = 5e-05
 Identities = 29/87 (33%), Positives = 44/87 (50%), Gaps = 1/87 (1%)
 Frame = +2

Query: 149 PENFDPRDKWPECPTLNEIRDQG-SCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           PE+ D    W     + +++DQG  C SCWAF +V A+     +  +       S + L+
Sbjct: 237 PEDLD----WRRPDVVTKVKDQGLDCSSCWAFASVAAVESIFQLLQDV--DLDLSEQHLI 290

Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGL 406
           +C   C  GC+GG   LA +Y K+ GL
Sbjct: 291 NCETRCS-GCSGGYADLALDYVKNKGL 316


>UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 344

 Score = 49.6 bits (113), Expect = 5e-05
 Identities = 27/88 (30%), Positives = 43/88 (48%)
 Frame = +2

Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319
           ++LPE+FD RDK    P     + Q +CGSCW F     +  +  +      HF   +E 
Sbjct: 129 SDLPESFDWRDKGIITPA----KFQNTCGSCWTFATTGVIESQYALKYGELLHF---SEQ 181

Query: 320 LVSCCPICGLGCNGGMPTLAWEYWKHVG 403
           ++  C     GC GG+ T A+++ +  G
Sbjct: 182 MLLDCDNINQGCRGGLMTDAYQFLQQSG 209


>UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core
           eudicotyledons|Rep: Chymopapain precursor - Carica
           papaya (Papaya)
          Length = 352

 Score = 49.6 bits (113), Expect = 5e-05
 Identities = 37/138 (26%), Positives = 62/138 (44%)
 Frame = +2

Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316
           + N P++ D R K    P    +++QG+CGSCWAF  +  +     I +        S +
Sbjct: 132 VTNYPQSIDWRAKGAVTP----VKNQGACGSCWAFSTIATVEGINKIVTG--NLLELSEQ 185

Query: 317 DLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP 496
           +LV  C     GC GG  T + +Y  + G+ +   Y      + Y+    +   PG ++ 
Sbjct: 186 ELVD-CDKHSYGCKGGYQTTSLQYVANNGVHTSKVY--PYQAKQYKCRATDK--PGPKVK 240

Query: 497 CNGDTKTPKCQKNCESSY 550
             G  + P    NCE+S+
Sbjct: 241 ITGYKRVP---SNCETSF 255


>UniRef50_UPI00006CB77B Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 382

 Score = 49.2 bits (112), Expect = 7e-05
 Identities = 32/112 (28%), Positives = 54/112 (48%), Gaps = 6/112 (5%)
 Frame = +2

Query: 155 NFDPRDKWPECPTLNEIRDQGS-CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC 331
           +F+   K+P+C  +  I +QG  C + ++  AV ++ DR+C+ S    +F  SA+  +SC
Sbjct: 128 SFNFHTKYPQC--VRPIANQGKDCSASYSIAAVSSVADRLCMASEGDFNFGLSAQPTISC 185

Query: 332 CPICGLGCNGGMPTLAWEYWKHVGLVSG-----GNYNSSQGCRPYEIPPCEH 472
                  C GG  +  ++  K  G V          +S++GC    I  CEH
Sbjct: 186 YENQSYKCEGGYVSKTFQKGKTTGFVKEECLPYHGTDSNEGCS--LIDKCEH 235


>UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3;
           Curculionidae|Rep: Cysteine proteinase - Hypera postica
           (alfalfa weevil)
          Length = 324

 Score = 49.2 bits (112), Expect = 7e-05
 Identities = 26/83 (31%), Positives = 40/83 (48%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355
           W +   +  ++DQG CGSCWAF ++   T+      +  K    S + L+ CC     GC
Sbjct: 118 WRKEGRVTGVKDQGDCGSCWAF-SITGSTEGAYARKSG-KLVSLSEQQLIDCCTDTSAGC 175

Query: 356 NGGMPTLAWEYWKHVGLVSGGNY 424
           +GG     ++Y    GL S  +Y
Sbjct: 176 DGGSLDDNFKYVMKDGLQSEESY 198


>UniRef50_Q23FQ5 Cluster: Papain family cysteine protease containing
           protein; n=4; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score = 49.2 bits (112), Expect = 7e-05
 Identities = 28/93 (30%), Positives = 41/93 (44%), Gaps = 3/93 (3%)
 Frame = +2

Query: 155 NFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC 334
           N+     W     LN I++QG CGSC AFG    +      Y  + +   FS + L+ C 
Sbjct: 124 NYPTSVDWRNSGALNPIQNQGQCGSCAAFGTAGVLES--FYYLKSKQLLKFSEQQLLDCA 181

Query: 335 PICGL---GCNGGMPTLAWEYWKHVGLVSGGNY 424
              G    GC+G      ++Y    G+V G +Y
Sbjct: 182 RQAGFDTYGCDGAWQQEYFKYAIKYGIVQGSSY 214


>UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 452

 Score = 49.2 bits (112), Expect = 7e-05
 Identities = 30/84 (35%), Positives = 44/84 (52%), Gaps = 2/84 (2%)
 Frame = +2

Query: 119 THDAELIANLPENFDPRDKWPECPTLNEI-RDQGSCGSCWAFGAVEAMTDRVCIYSNATK 295
           T+D ++I NLPE+F     W   P + E   DQ  CG+C+AFGA EA+  +  + +N  +
Sbjct: 216 TYDQKVIQNLPESFS----WRNVPYVLEYPHDQAVCGTCFAFGASEAINGQFSLRAN--R 269

Query: 296 HFHFSAEDLVSCC-PICGLGCNGG 364
               S + LV C        C+GG
Sbjct: 270 SIITSVQQLVDCTWGTINYACDGG 293


>UniRef50_A2DPZ6 Cluster: Clan CA, family C1, cathepsin B-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin B-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 253

 Score = 49.2 bits (112), Expect = 7e-05
 Identities = 33/119 (27%), Positives = 56/119 (47%)
 Frame = +2

Query: 128 AELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHF 307
           + +   LPE ++  +++PEC     I+    CG C+ + A++++  R C      +   F
Sbjct: 22  SNISVELPEYYNFLEEYPECDFGPLIQH---CGCCYVYSALKSLAHRYC--RALRRRIQF 76

Query: 308 SAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPG 484
           SA+ ++S C +  LGCNGG     + Y +  G V        +G R Y    C+  V G
Sbjct: 77  SAQYIIS-CDLFNLGCNGGNEKAVFYYLEQHG-VPELECQPWRGIRGYNQEVCKKCVNG 133


>UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, whole
           genome shotgun sequence; n=3; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_31,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 358

 Score = 49.2 bits (112), Expect = 7e-05
 Identities = 30/89 (33%), Positives = 42/89 (47%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           +PE+++ R+  PEC     I  QG+C S ++  AV A +DR+C   N       S +  +
Sbjct: 131 IPESYNFREAQPECA--QPIYFQGNCSSSYSIAAVSATSDRLCKSKNGEFQDQLSPQSPI 188

Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGLVS 412
           S C      C GG  T   E  K  G VS
Sbjct: 189 S-CDDKNYKCGGGSVTRVLEVGKKQGFVS 216


>UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin
           heavy chain; n=3; Amniota|Rep: PREDICTED: similar to
           ferritin heavy chain - Ornithorhynchus anatinus
          Length = 338

 Score = 48.8 bits (111), Expect = 9e-05
 Identities = 30/81 (37%), Positives = 40/81 (49%), Gaps = 1/81 (1%)
 Frame = +2

Query: 149 PENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVS 328
           PE  D R K    P    +++QG CGSCWAF A  A+     ++    K    S ++LV 
Sbjct: 121 PEEVDWRTKGYVTP----VKNQGLCGSCWAFSATGAL--EALVFKTTGKMVSLSEQNLVD 174

Query: 329 CCPICG-LGCNGGMPTLAWEY 388
           C    G +GC GG    A+EY
Sbjct: 175 CSWRQGNVGCRGGQYIGAFEY 195


>UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n=1;
           Xenopus tropicalis|Rep: UPI00006A2275 UniRef100 entry -
           Xenopus tropicalis
          Length = 272

 Score = 48.8 bits (111), Expect = 9e-05
 Identities = 33/90 (36%), Positives = 44/90 (48%), Gaps = 3/90 (3%)
 Frame = +2

Query: 164 PRDKWPECPTLNEIRDQGS-CGSCWAFGAVEAMTDRVCIYSNATKHF-HFSAEDLVSCCP 337
           P   W     +  +RDQGS C SC+AF AV A+    C +   T     FS ++LV C  
Sbjct: 81  PSIDWRTQNCVTPVRDQGSFCRSCYAFSAVGALE---CQWKKKTVRLVTFSPQELVDCSD 137

Query: 338 ICGL-GCNGGMPTLAWEYWKHVGLVSGGNY 424
             G  GCNGG    A++Y K  G++    Y
Sbjct: 138 GEGNHGCNGGKIEKAFKYMKKYGVMEESAY 167


>UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel
           protein - Danio rerio (Zebrafish) (Brachydanio rerio)
          Length = 328

 Score = 48.8 bits (111), Expect = 9e-05
 Identities = 37/122 (30%), Positives = 57/122 (46%), Gaps = 3/122 (2%)
 Frame = +2

Query: 167 RDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG 346
           R  W E   ++ +++QG CGSCWAF AV ++  ++   + A      SA++L+ C    G
Sbjct: 116 RVNWTEHGMVSPVQNQGPCGSCWAFSAVGSLEAQMKRRTAAL--VPLSAQNLLDCSVSLG 173

Query: 347 -LGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPP--CEHHVPGNRMPCNGDTKT 517
             GC GG  + A+ Y     ++     +SS    PYE     C + V G    C G    
Sbjct: 174 NRGCKGGFLSRAFLY-----VIQNRGIDSST-FYPYEHKEGVCRYSVSGRAGYCTGFRIV 227

Query: 518 PK 523
           P+
Sbjct: 228 PR 229


>UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba
           histolytica|Rep: Cysteine protease 19 - Entamoeba
           histolytica
          Length = 324

 Score = 48.8 bits (111), Expect = 9e-05
 Identities = 27/81 (33%), Positives = 45/81 (55%), Gaps = 4/81 (4%)
 Frame = +2

Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCI-YSN-ATKHFHFSAEDLVSCC--PICGLGCNG 361
           +  ++DQG+CGSC+AF +V  M   V + Y + +  ++  S  ++VSCC  P    GC G
Sbjct: 112 MTPVKDQGNCGSCYAFSSVALMETAVLLSYDDLSPSNYALSTAEIVSCCYDPSECRGCEG 171

Query: 362 GMPTLAWEYWKHVGLVSGGNY 424
           G    A +Y +  G+ S  ++
Sbjct: 172 GSIGGALKYAQDNGMQSESSF 192


>UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n=1;
           Myxobolus cerebralis|Rep: Cathepsin Z-like cysteine
           proteinase - Myxobolus cerebralis
          Length = 297

 Score = 48.8 bits (111), Expect = 9e-05
 Identities = 29/86 (33%), Positives = 44/86 (51%), Gaps = 4/86 (4%)
 Frame = +2

Query: 143 NLPENFDPRDKWPECPTLNEIRDQGS---CGSCWAFGAVEAMTDRVCIYSNATKHFHFS- 310
           N+P++FD    W E   L+ +++Q     CGSCWAF +   + DR+ I  N +   HFS 
Sbjct: 49  NMPKSFD----WRENAYLSSVKNQHLPTYCGSCWAFASTSTIADRIYIAKNLSHFDHFSL 104

Query: 311 AEDLVSCCPICGLGCNGGMPTLAWEY 388
           +  +V  C   G    GG  +  +EY
Sbjct: 105 SVQVVIACAQSGDCKLGGFASGVYEY 130


>UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;
           Brugia malayi|Rep: Cahepsin L-like cysteine protease -
           Brugia malayi (Filarial nematode worm)
          Length = 371

 Score = 48.8 bits (111), Expect = 9e-05
 Identities = 27/83 (32%), Positives = 43/83 (51%), Gaps = 2/83 (2%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           LP++ D    W     + +++DQG CGSCW F AV A+  +  + +   K    S ++L+
Sbjct: 143 LPKSID----WRTSGAVTKVKDQGYCGSCWTFSAVGALEGQHFLQTG--KLVELSMQNLL 196

Query: 326 SCC--PICGLGCNGGMPTLAWEY 388
            C        GC+GG+   A+EY
Sbjct: 197 DCSDDTYGNYGCDGGLMMEAFEY 219


>UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides
           sonorensis|Rep: Cathepsin L - Culicoides sonorensis
          Length = 331

 Score = 48.8 bits (111), Expect = 9e-05
 Identities = 24/68 (35%), Positives = 38/68 (55%)
 Frame = +2

Query: 203 IRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAW 382
           +++Q  CGSCWAF +V ++  R   + N  K +  + ++LV  C     GC+GG   LA 
Sbjct: 131 VKNQAQCGSCWAFASVASVEMRYKRFHN--KSYTLAEQELVD-CETTSHGCSGGWSDLAL 187

Query: 383 EYWKHVGL 406
           +Y +  GL
Sbjct: 188 QYMRDNGL 195


>UniRef50_Q24FA8 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 335

 Score = 48.8 bits (111), Expect = 9e-05
 Identities = 24/87 (27%), Positives = 41/87 (47%), Gaps = 4/87 (4%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPI----C 343
           W +   ++ +++QG CG CW F A   M     I++       +S + L+ C  +     
Sbjct: 130 WRKKGGVSPVKNQGECGGCWTFSATGLMESFNLIHNKPQNVSLYSQQQLLDCVTLENGYF 189

Query: 344 GLGCNGGMPTLAWEYWKHVGLVSGGNY 424
             GC GG+P+ A +Y    G++S   Y
Sbjct: 190 SEGCEGGVPSDAVQYAADFGVLSDNEY 216


>UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 383

 Score = 48.8 bits (111), Expect = 9e-05
 Identities = 29/83 (34%), Positives = 40/83 (48%)
 Frame = +2

Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355
           W E   L  I++QG CGSCWAF  V ++  +  I     K    S +++V  C     GC
Sbjct: 174 WREQGKLTPIKNQGQCGSCWAFATVASVEAQNAIKKG--KLVSLSEQEMVD-CDGRNNGC 230

Query: 356 NGGMPTLAWEYWKHVGLVSGGNY 424
           +GG    A ++ K  GL S   Y
Sbjct: 231 SGGYRPYAMKFVKENGLESEKEY 253


>UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophila
           SB210|Rep: Cathepsin z - Tetrahymena thermophila SB210
          Length = 585

 Score = 48.8 bits (111), Expect = 9e-05
 Identities = 40/138 (28%), Positives = 62/138 (44%), Gaps = 5/138 (3%)
 Frame = +2

Query: 8   QNTWKAGRNFPTHTP-FAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPE 184
           +NT K       HT  F H   +  + K+   L    + H+    A+LP N+D R+    
Sbjct: 290 RNTTKVTEVSNNHTNNFRHTTCIRESNKNSTQLITGPLPHEYINAASLPANWDWRNI-NG 348

Query: 185 CPTLNEIRDQGS---CGSCWAFGAVEAMTDRVCIYSNAT-KHFHFSAEDLVSCCPICGLG 352
              L+  R+Q     CGSCWA G   ++ DR+ I  N T      S + +++C    G  
Sbjct: 349 VNYLSFTRNQHIPQYCGSCWAHGTTSSLADRINIARNRTWPDIALSVQVVLNC--QAGGS 406

Query: 353 CNGGMPTLAWEYWKHVGL 406
           CNGG P   +++    G+
Sbjct: 407 CNGGQPMGVYQFANKQGI 424



 Score = 40.7 bits (91), Expect = 0.025
 Identities = 41/163 (25%), Positives = 63/163 (38%), Gaps = 5/163 (3%)
 Frame = +2

Query: 110 PKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGS---CGSCWAFGAVEAMTDRVCIY 280
           P V  +AE  + LP NF  ++       L  +R+Q     CGSCWA  A   + DR+ I 
Sbjct: 31  PYVISNAEFNSVLPSNFTWQNV-NGTDYLTLVRNQHIPQYCGSCWAQAASSTLADRIKIA 89

Query: 281 SNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIP 460
             A       A  ++  C     GC+GG    A+++ K   +       + + C PY+  
Sbjct: 90  RKAQWPDVVIAPQVLVSCDEYSNGCHGGNSGTAFQWIKEHNI-------TDETCSPYQA- 141

Query: 461 PCEHHVPGNRMPCNGDTKTPKC--QKNCESSYNVPFKKEQRYG 583
               +   N + C+       C   K C +  N        YG
Sbjct: 142 ----YGHDNGLGCSAQIMCKNCMPNKGCWAQENAKVYTVAEYG 180


>UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1
           precursor; n=20; Psoroptidia|Rep: Major mite fecal
           allergen Der f 1 precursor - Dermatophagoides farinae
           (House-dust mite)
          Length = 321

 Score = 48.8 bits (111), Expect = 9e-05
 Identities = 32/94 (34%), Positives = 41/94 (43%)
 Frame = +2

Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322
           N+P   D R       T+  IR QG CGSCWAF  V A       Y N +     S ++L
Sbjct: 108 NVPSELDLRS----LRTVTPIRMQGGCGSCWAFSGVAATESAYLAYRNTS--LDLSEQEL 161

Query: 323 VSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424
           V C      GC+G       EY +  G+V   +Y
Sbjct: 162 VDCA--SQHGCHGDTIPRGIEYIQQNGVVEERSY 193


>UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:
           Viral cathepsin - Xestia c-nigrum granulosis virus
           (XnGV) (Xestia c-nigrumgranulovirus)
          Length = 346

 Score = 48.8 bits (111), Expect = 9e-05
 Identities = 28/80 (35%), Positives = 43/80 (53%)
 Frame = +2

Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325
           +P++FD RD+     ++  ++ Q  CGSCWAF AV  +     I  N +     S + LV
Sbjct: 133 VPDSFDWRDR----NSVTSVKMQKECGSCWAFSAVANIESLYHIKHNVS--LDLSEQQLV 186

Query: 326 SCCPICGLGCNGGMPTLAWE 385
            C  +   GCNGG+ + A+E
Sbjct: 187 DCDKV-NNGCNGGLMSWAFE 205


>UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine
           protease; n=1; Strongylocentrotus purpuratus|Rep:
           PREDICTED: similar to cysteine protease -
           Strongylocentrotus purpuratus
          Length = 494

 Score = 48.4 bits (110), Expect = 1e-04
 Identities = 39/128 (30%), Positives = 56/128 (43%), Gaps = 1/128 (0%)
 Frame = +2

Query: 5   KQNTWKAG-RNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWP 181
           +Q T K G   F   T     K+  G LK   I K   +         +PE +D    W 
Sbjct: 197 EQGTAKYGPTKFADMTEAEFRKLQSGPLKKTGIKKQAAIPQGP-----VPEEYD----WR 247

Query: 182 ECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNG 361
               +  +++QG CGSCWAF A+  M  +  I     +    S ++LV C  + G GC G
Sbjct: 248 THGAVTPVKNQGMCGSCWAFSAIGNMEGQWQIKKG--ELISLSEQELVDCDKVDG-GCEG 304

Query: 362 GMPTLAWE 385
           G  + A+E
Sbjct: 305 GEMSDAYE 312


>UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep:
           Cathepsin K - Danio rerio (Zebrafish) (Brachydanio
           rerio)
          Length = 333

 Score = 48.4 bits (110), Expect = 1e-04
 Identities = 37/131 (28%), Positives = 58/131 (44%), Gaps = 2/131 (1%)
 Frame = +2

Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316
           +  LP++ D R    +   +  +++QGSCGSCWAF +V A+  +  +     +    S +
Sbjct: 115 VGKLPKSIDYR----KLGYVTSVKNQGSCGSCWAFSSVGALEGQ--LMKTKGQLVDLSPQ 168

Query: 317 DLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPY--EIPPCEHHVPGNR 490
           +LV C      GC GG  T A+ Y      VS      S+   PY      C ++  G  
Sbjct: 169 NLVDCVTE-NDGCGGGYMTNAFRY------VSNNQGIDSEESYPYVGTDQQCAYNTSGVA 221

Query: 491 MPCNGDTKTPK 523
             C G  + P+
Sbjct: 222 ASCRGYKEIPQ 232


>UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 343

 Score = 48.4 bits (110), Expect = 1e-04
 Identities = 30/94 (31%), Positives = 44/94 (46%)
 Frame = +2

Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322
           NLP + D R+       +  I+ QG CGSCWAF    A+   V I     +    S++ L
Sbjct: 134 NLPNSVDWRNV-NGTNHVTGIKYQGPCGSCWAFATAAAIESAVSISGGGLQ--SLSSQQL 190

Query: 323 VSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424
           + C  +    C GG P  A +Y +  G+ +  NY
Sbjct: 191 LDCTVVSD-KCGGGEPVEALKYAQSHGITTAHNY 223


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 603,931,243
Number of Sequences: 1657284
Number of extensions: 12407851
Number of successful extensions: 34860
Number of sequences better than 10.0: 468
Number of HSP's better than 10.0 without gapping: 33339
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 34539
length of database: 575,637,011
effective HSP length: 97
effective length of database: 414,880,463
effective search space used: 40658285374
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)

- SilkBase 1999-2023 -