SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTX 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= I10A02NGRL0003_K10
         (548 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw...   190   1e-47
UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteina...   187   1e-46
UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh...   165   6e-40
UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca...   160   2e-38
UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ...   160   2e-38
UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=...   153   3e-36
UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n...   144   1e-33
UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr...   143   2e-33
UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: ...   140   1e-32
UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; ...   138   8e-32
UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep...   136   3e-31
UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=...   135   6e-31
UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps...   130   2e-29
UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|...   130   2e-29
UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ...   129   5e-29
UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr...   128   6e-29
UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n...   128   8e-29
UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain...   125   6e-28
UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhip...   122   4e-27
UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ...   121   1e-26
UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA...   120   2e-26
UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Ca...   119   4e-26
UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2...   118   9e-26
UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8...   118   1e-25
UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Ca...   118   1e-25
UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma...   118   1e-25
UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8....   115   8e-25
UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|...   114   1e-24
UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid...   113   3e-24
UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl...   111   8e-24
UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep...   111   8e-24
UniRef50_Q237A1 Cluster: Papain family cysteine protease contain...   109   5e-23
UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 ...   107   2e-22
UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gamb...   106   4e-22
UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin ...   105   5e-22
UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator ame...   105   5e-22
UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomati...   104   2e-21
UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7...   104   2e-21
UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C...   103   4e-21
UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 ...   101   8e-21
UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; ...   101   1e-20
UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 ...   101   1e-20
UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, w...   100   3e-20
UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma j...   100   4e-20
UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8....    99   6e-20
UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep...    98   1e-19
UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus co...    94   2e-18
UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; ...    93   3e-18
UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|R...    89   6e-17
UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2; ...    89   6e-17
UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus...    87   3e-16
UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep...    85   1e-15
UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve...    85   1e-15
UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2; Ostreococcus|...    80   4e-14
UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O...    79   9e-14
UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|R...    75   8e-13
UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG011...    75   1e-12
UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi...    75   1e-12
UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R...    72   1e-11
UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus lu...    69   5e-11
UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi...    69   7e-11
UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|R...    68   2e-10
UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like prote...    65   9e-10
UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;...    64   2e-09
UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia...    64   3e-09
UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-L...    63   4e-09
UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy...    63   4e-09
UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ...    63   5e-09
UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma j...    62   6e-09
UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; ...    62   1e-08
UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,...    62   1e-08
UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ...    61   2e-08
UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n...    60   3e-08
UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcop...    60   3e-08
UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy...    60   3e-08
UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cel...    60   3e-08
UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida...    60   4e-08
UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re...    59   6e-08
UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag...    59   6e-08
UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35...    59   8e-08
UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R...    58   1e-07
UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ...    58   1e-07
UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr...    58   1e-07
UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru...    58   2e-07
UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-li...    58   2e-07
UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen; n...    58   2e-07
UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma...    58   2e-07
UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to glucocorti...    57   2e-07
UniRef50_Q24E33 Cluster: Papain family cysteine protease contain...    57   2e-07
UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=...    57   2e-07
UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=...    57   2e-07
UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; ...    57   3e-07
UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy...    57   3e-07
UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ...    56   4e-07
UniRef50_O16454 Cluster: Temporarily assigned gene name protein ...    56   4e-07
UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ...    56   4e-07
UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re...    56   5e-07
UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - ...    56   5e-07
UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18...    56   7e-07
UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio...    55   9e-07
UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa...    55   9e-07
UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w...    55   9e-07
UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi...    55   9e-07
UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ...    55   1e-06
UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo sapie...    55   1e-06
UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lambl...    55   1e-06
UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n...    55   1e-06
UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1...    55   1e-06
UniRef50_Q23H32 Cluster: Papain family cysteine protease contain...    55   1e-06
UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The...    55   1e-06
UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;...    54   2e-06
UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted...    54   2e-06
UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin...    54   2e-06
UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;...    54   2e-06
UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e...    54   2e-06
UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto...    54   2e-06
UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium...    54   2e-06
UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel...    54   2e-06
UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain...    54   3e-06
UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh...    54   3e-06
UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ...    53   4e-06
UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ...    53   4e-06
UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain...    53   4e-06
UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy...    53   4e-06
UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain...    53   5e-06
UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ...    53   5e-06
UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ...    52   7e-06
UniRef50_P81494 Cluster: Cathepsin B; n=2; Phasianidae|Rep: Cath...    52   7e-06
UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j...    52   9e-06
UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try...    52   9e-06
UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain...    52   9e-06
UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ...    52   9e-06
UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia ...    52   1e-05
UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ...    52   1e-05
UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb...    52   1e-05
UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh...    52   1e-05
UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl...    52   1e-05
UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep...    51   2e-05
UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n...    51   2e-05
UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1...    51   2e-05
UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep...    51   2e-05
UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er...    51   2e-05
UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis...    51   2e-05
UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon G...    51   2e-05
UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov...    51   2e-05
UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ...    51   2e-05
UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet...    51   2e-05
UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy...    51   2e-05
UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen...    51   2e-05
UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi...    51   2e-05
UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum...    50   3e-05
UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh...    50   3e-05
UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The...    50   3e-05
UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio...    50   4e-05
UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n...    50   4e-05
UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ...    50   4e-05
UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;...    50   4e-05
UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ...    50   5e-05
UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG...    50   5e-05
UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus...    50   5e-05
UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n...    50   5e-05
UniRef50_Q22W19 Cluster: Papain family cysteine protease contain...    50   5e-05
UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs...    50   5e-05
UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ...    50   5e-05
UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M...    50   5e-05
UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; ...    49   6e-05
UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia...    49   6e-05
UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain...    49   6e-05
UniRef50_Q231X3 Cluster: Papain family cysteine protease contain...    49   6e-05
UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus...    49   6e-05
UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cy...    49   6e-05
UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p...    49   8e-05
UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big...    49   8e-05
UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca...    49   8e-05
UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir...    49   8e-05
UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain...    49   8e-05
UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy...    49   8e-05
UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:...    49   8e-05
UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh...    49   8e-05
UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh...    49   8e-05
UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr...    49   8e-05
UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:...    49   8e-05
UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata...    49   8e-05
UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3....    49   8e-05
UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ...    48   1e-04
UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ...    48   1e-04
UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein...    48   1e-04
UniRef50_Q235G6 Cluster: Papain family cysteine protease contain...    48   1e-04
UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophi...    48   1e-04
UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ...    48   1e-04
UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M...    48   1e-04
UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ...    48   1e-04
UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl...    48   1e-04
UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia ...    48   2e-04
UniRef50_Q58HK5 Cluster: Hc58; n=1; Haemonchus contortus|Rep: Hc...    48   2e-04
UniRef50_Q239L8 Cluster: Papain family cysteine protease contain...    48   2e-04
UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali...    48   2e-04
UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ...    47   3e-04
UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ...    47   3e-04
UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl...    47   3e-04
UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt...    47   3e-04
UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia...    47   3e-04
UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr...    47   3e-04
UniRef50_Q23H10 Cluster: Papain family cysteine protease contain...    47   3e-04
UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain...    47   3e-04
UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain...    47   3e-04
UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ...    47   3e-04
UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C...    47   3e-04
UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt...    47   3e-04
UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae...    47   3e-04
UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re...    47   3e-04
UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who...    47   3e-04
UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ...    47   3e-04
UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C...    47   3e-04
UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ...    47   3e-04
UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ...    46   4e-04
UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi...    46   4e-04
UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi...    46   4e-04
UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n...    46   4e-04
UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ...    46   4e-04
UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-32...    46   4e-04
UniRef50_A2DPZ6 Cluster: Clan CA, family C1, cathepsin B-like cy...    46   4e-04
UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh...    46   4e-04
UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ...    46   4e-04
UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2...    46   4e-04
UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ...    46   6e-04
UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h...    46   6e-04
UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;...    46   6e-04
UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ...    46   6e-04
UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia...    46   6e-04
UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like cy...    46   6e-04
UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa...    46   8e-04
UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G...    46   8e-04
UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip...    46   8e-04
UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist...    46   8e-04
UniRef50_Q22A69 Cluster: Papain family cysteine protease contain...    46   8e-04
UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179, w...    46   8e-04
UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory...    46   8e-04
UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo...    46   8e-04
UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ...    45   0.001
UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No...    45   0.001
UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph...    45   0.001
UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ...    45   0.001
UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ...    45   0.001
UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip...    45   0.001
UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain...    45   0.001
UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster...    44   0.002
UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain...    44   0.002
UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil...    44   0.002
UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain...    44   0.002
UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R...    44   0.002
UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu...    44   0.002
UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D...    44   0.002
UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto...    44   0.002
UniRef50_UPI00006CB77B Cluster: Papain family cysteine protease ...    44   0.002
UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t...    44   0.002
UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei...    44   0.002
UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma j...    44   0.002
UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, wh...    44   0.002
UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh...    44   0.002
UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic...    44   0.002
UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|...    44   0.002
UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ...    44   0.003
UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ...    44   0.003
UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat...    44   0.003
UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ...    44   0.003
UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa...    44   0.003
UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|...    44   0.003
UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep...    44   0.003
UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir...    44   0.003
UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p...    43   0.004
UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca...    43   0.004
UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s...    43   0.004
UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis...    43   0.004
UniRef50_Q23H06 Cluster: Papain family cysteine protease contain...    43   0.004
UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabdi...    43   0.004
UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3...    43   0.004
UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ...    43   0.005
UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re...    43   0.005
UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3; Bilate...    43   0.005
UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ...    43   0.005
UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei...    43   0.005
UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli...    43   0.005
UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox...    43   0.005
UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain...    43   0.005
UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C...    43   0.005
UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina...    43   0.005
UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:...    43   0.005
UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal...    42   0.007
UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy...    42   0.007
UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ...    42   0.007
UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;...    42   0.007
UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop...    42   0.007
UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ...    42   0.007
UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat...    42   0.007
UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li...    42   0.007
UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs...    42   0.007
UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D...    42   0.007
UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ...    42   0.007
UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP000...    42   0.009
UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|...    42   0.009
UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-...    42   0.009
UniRef50_Q5CM16 Cluster: P3ECSL-related; n=2; Cryptosporidium|Re...    42   0.009
UniRef50_A5KAP8 Cluster: Protease, putative; n=1; Plasmodium viv...    42   0.009
UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy...    42   0.009
UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ...    42   0.012
UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ...    42   0.012
UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ...    42   0.012
UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi...    42   0.012
UniRef50_Q6A1H9 Cluster: Cathepsin X/O; n=1; Suberites domuncula...    42   0.012
UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep...    42   0.012
UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain...    42   0.012
UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain...    42   0.012
UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv...    42   0.012
UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|...    42   0.012
UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|...    41   0.016
UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi...    41   0.016
UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H...    41   0.016
UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep: Cat...    41   0.022
UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve...    41   0.022
UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy...    41   0.022
UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li...    41   0.022
UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh...    41   0.022
UniRef50_Q8IIJ9 Cluster: Probable cathepsin C precursor; n=11; P...    41   0.022
UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ...    40   0.029
UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ...    40   0.029
UniRef50_Q8I1Y2 Cluster: Protease, putative; n=1; Plasmodium fal...    40   0.029
UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl...    40   0.029
UniRef50_UPI00006CBB5F Cluster: Papain family cysteine protease ...    40   0.038
UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-...    40   0.038
UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ...    40   0.038
UniRef50_Q3L7L2 Cluster: Sar s 1 allergen SMIPP-C Yv6008G08; n=2...    40   0.038
UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ...    40   0.050
UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli...    40   0.050
UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop...    40   0.050
UniRef50_A5K8Y0 Cluster: Preprocathepsin c, putative; n=1; Plasm...    40   0.050
UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc...    40   0.050
UniRef50_UPI000155D183 Cluster: PREDICTED: similar to Cathepsin ...    39   0.066
UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocysti...    39   0.066
UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa...    39   0.066
UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv...    39   0.066
UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl...    39   0.066
UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest...    39   0.066
UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w...    39   0.066
UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:...    39   0.088
UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280...    39   0.088
UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; ...    39   0.088
UniRef50_Q8I8D3 Cluster: Cysteine protease 15; n=2; Entamoeba hi...    39   0.088
UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve...    39   0.088
UniRef50_Q2FLC7 Cluster: Periplasmic copper-binding precursor; n...    39   0.088
UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w...    38   0.12 
UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: L...    38   0.15 
UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica...    38   0.15 
UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s...    38   0.15 
UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S...    38   0.15 
UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil...    38   0.15 
UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty...    38   0.15 
UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ...    38   0.15 
UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste...    38   0.15 
UniRef50_Q8I0V1 Cluster: Preprocathepsin c, putative; n=1; Plasm...    38   0.15 
UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Sl...    38   0.15 
UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop...    38   0.15 
UniRef50_Q248G1 Cluster: Papain family cysteine protease contain...    38   0.15 
UniRef50_Q23H15 Cluster: Papain family cysteine protease contain...    38   0.15 
UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain...    38   0.15 
UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v...    38   0.15 
UniRef50_A0DIY3 Cluster: Chromosome undetermined scaffold_52, wh...    38   0.15 
UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ...    38   0.15 
UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa...    38   0.20 
UniRef50_Q01FL5 Cluster: Cysteine protease-5; n=3; Ostreococcus|...    38   0.20 
UniRef50_Q54VR1 Cluster: Putative uncharacterized protein; n=1; ...    38   0.20 
UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n...    37   0.27 
UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole...    37   0.27 
UniRef50_Q5YER6 Cluster: Cathepsin Z; n=1; Bigelowiella natans|R...    37   0.27 
UniRef50_Q7RQM7 Cluster: Dipeptidyl-peptidase i; n=6; Plasmodium...    37   0.27 
UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop...    37   0.27 
UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba...    37   0.35 
UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac...    37   0.35 
UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ...    37   0.35 
UniRef50_Q5CYC8 Cluster: Cathepsin like thiol protease possibly ...    37   0.35 
UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n...    37   0.35 
UniRef50_A7APS9 Cluster: Papain family cysteine protease contain...    37   0.35 
UniRef50_Q3A8I3 Cluster: Putative serine protease; n=1; Pelobact...    36   0.47 
UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli...    36   0.47 
UniRef50_A5KBM2 Cluster: Serine-repeat antigen; n=3; Plasmodium|...    36   0.47 
UniRef50_A5UP12 Cluster: Adhesin-like protein; n=1; Methanobrevi...    36   0.47 
UniRef50_Q8QNJ8 Cluster: EsV-1-75; n=1; Ectocarpus siliculosus v...    36   0.62 
UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O...    36   0.62 
UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate...    36   0.62 
UniRef50_A0EAG2 Cluster: Chromosome undetermined scaffold_86, wh...    36   0.62 
UniRef50_A0D5R4 Cluster: Chromosome undetermined scaffold_39, wh...    36   0.62 
UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re...    36   0.62 
UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,...    36   0.82 
UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ...    36   0.82 
UniRef50_UPI00006CBAC7 Cluster: Papain family cysteine protease ...    36   0.82 
UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R...    36   0.82 
UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina...    36   0.82 
UniRef50_Q54R55 Cluster: Putative uncharacterized protein; n=1; ...    35   1.1  
UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain...    35   1.1  
UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ...    35   1.1  
UniRef50_A7AX75 Cluster: Preprocathepsin c, putative; n=1; Babes...    35   1.1  
UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cy...    35   1.1  
UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease ...    35   1.4  
UniRef50_UPI00006CB653 Cluster: Papain family cysteine protease ...    35   1.4  
UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz...    35   1.4  
UniRef50_Q5CKC2 Cluster: Preprocathepsin c; n=2; Cryptosporidium...    35   1.4  
UniRef50_Q4YCM9 Cluster: Cysteine protease, putative; n=5; Plasm...    35   1.4  
UniRef50_Q9UBR2 Cluster: Cathepsin Z precursor; n=40; Bilateria|...    35   1.4  
UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ...    34   1.9  
UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; ...    34   1.9  
UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|...    34   1.9  
UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2; Culicid...    34   1.9  
UniRef50_A0CS14 Cluster: Chromosome undetermined scaffold_26, wh...    34   1.9  
UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;...    34   2.5  
UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; ...    34   2.5  
UniRef50_A5HC50 Cluster: Cathepsin W; n=2; Theria|Rep: Cathepsin...    34   2.5  
UniRef50_A2F4G0 Cluster: Clan CA, family C1, cathepsin L-like cy...    34   2.5  
UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz...    33   3.3  
UniRef50_Q015J8 Cluster: Cathepsin; n=2; Ostreococcus|Rep: Cathe...    33   3.3  
UniRef50_A4S004 Cluster: Predicted protein; n=2; Ostreococcus|Re...    33   3.3  
UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n...    33   3.3  
UniRef50_Q5C5Q2 Cluster: SJCHGC05915 protein; n=1; Schistosoma j...    33   3.3  
UniRef50_Q0GBZ7 Cluster: Membrane-associated protein 29; n=4; Sc...    33   3.3  
UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10...    33   3.3  
UniRef50_Q7M4N9 Cluster: Dipeptidyl-peptidase I; n=1; Homo sapie...    33   3.3  
UniRef50_A5DIN6 Cluster: Putative uncharacterized protein; n=1; ...    33   3.3  
UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil...    33   4.4  
UniRef50_O96163 Cluster: Cysteine protease, putative; n=5; Plasm...    33   4.4  
UniRef50_A7SHX2 Cluster: Predicted protein; n=1; Nematostella ve...    33   4.4  
UniRef50_A5KBM1 Cluster: Serine-repeat antigen; n=1; Plasmodium ...    33   4.4  
UniRef50_A2DKX2 Cluster: Clan CA, family C1, cathepsin L-like cy...    33   4.4  
UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ...    33   5.8  
UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome s...    33   5.8  
UniRef50_A7M7G2 Cluster: ParC; n=1; Serratia entomophila|Rep: Pa...    33   5.8  
UniRef50_Q0E4N0 Cluster: Os02g0109400 protein; n=3; Oryza sativa...    33   5.8  
UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi...    33   5.8  
UniRef50_Q7R0G3 Cluster: GLP_29_33036_32140; n=1; Giardia lambli...    33   5.8  
UniRef50_Q54MB6 Cluster: Putative uncharacterized protein; n=1; ...    33   5.8  
UniRef50_A5KBM0 Cluster: Serine-repeat antigen (SERA), putative;...    33   5.8  
UniRef50_Q4WAY3 Cluster: Polyketide synthase, putative; n=1; Asp...    33   5.8  
UniRef50_Q2H7E7 Cluster: Putative uncharacterized protein; n=1; ...    33   5.8  
UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sa...    32   7.6  
UniRef50_Q2QS15 Cluster: Papain family cysteine protease contain...    32   7.6  
UniRef50_Q8I8D7 Cluster: Cysteine protease 11; n=4; Entamoeba hi...    32   7.6  
UniRef50_Q7QWN9 Cluster: GLP_26_50243_51811; n=2; Giardia lambli...    32   7.6  
UniRef50_Q26153 Cluster: V-SERA 4; n=1; Plasmodium vivax|Rep: V-...    32   7.6  
UniRef50_Q26015 Cluster: Serine rich protein homologue; n=4; Pla...    32   7.6  
UniRef50_Q22DA9 Cluster: Putative uncharacterized protein; n=1; ...    32   7.6  
UniRef50_Q1AMF3 Cluster: Cathepsin C1; n=1; Toxoplasma gondii|Re...    32   7.6  
UniRef50_Q1DTN0 Cluster: Predicted protein; n=1; Coccidioides im...    32   7.6  

>UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep:
           Parcxpwnx02 - Periplaneta americana (American cockroach)
          Length = 343

 Score =  190 bits (464), Expect = 1e-47
 Identities = 85/144 (59%), Positives = 101/144 (70%)
 Frame = +1

Query: 115 SDLPHPLSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAE 294
           S L  PLSD FI+ IN    TWKA RNF    P   IK LMG  +     +LP+ + + +
Sbjct: 30  SVLVDPLSDDFIDHINSLNTTWKAHRNFGNDIPLREIKKLMGVRRSLENFRLPEKSME-D 88

Query: 295 LIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSA 474
           +   +PE FDPR++WPECPTL EIRDQGSCGSCWAFGAVEAM+DRVCI+S    HFHFSA
Sbjct: 89  IDIEIPEEFDPREQWPECPTLKEIRDQGSCGSCWAFGAVEAMSDRVCIHSKGKTHFHFSA 148

Query: 475 EDLVSCCPICGLGCNGGMPTLAWE 546
           EDL++CC  CG GCNGG P  AW+
Sbjct: 149 EDLLTCCSSCGFGCNGGEPGAAWD 172


>UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteinase;
           n=1; Tenebrio molitor|Rep: Putative cathepsin B-like
           like proteinase - Tenebrio molitor (Yellow mealworm)
          Length = 301

 Score =  187 bits (456), Expect = 1e-46
 Identities = 90/156 (57%), Positives = 106/156 (67%), Gaps = 2/156 (1%)
 Frame = +1

Query: 82  VALACILAVVASDLPHPLSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGAL-KDDN 258
           V LA +         HPLSD FIN IN KQ TWKAGRNF  +TP +H++ L+G L K  N
Sbjct: 9   VVLASVALSYGGVKLHPLSDEFINEINSKQTTWKAGRNFDVNTPISHVRRLLGVLPKKAN 68

Query: 259 ILKLPKVTHDAELIANLPENFDPRDKWPECPTL-NEIRDQGSCGSCWAFGAVEAMTDRVC 435
             KLP  TH   L A +PE+FD R+ WPEC ++  EIRDQ SCGSCWAFGAVEAM+DR+C
Sbjct: 69  APKLPVKTHAVNLDA-IPESFDAREAWPECTSIIGEIRDQASCGSCWAFGAVEAMSDRIC 127

Query: 436 IYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAW 543
           I+S+A+     SAEDL  CC  CG GCNGG P LAW
Sbjct: 128 IHSDASVKVRISAEDLNDCCYDCGDGCNGGWPDLAW 163


>UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome
           shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 5
           SCAF15026, whole genome shotgun sequence - Tetraodon
           nigroviridis (Green puffer)
          Length = 351

 Score =  165 bits (401), Expect = 6e-40
 Identities = 82/162 (50%), Positives = 105/162 (64%)
 Frame = +1

Query: 58  MAPSCALYVALACILAVVASDLPHPLSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILM 237
           M P+  L++A A   ++    L  PLS   +N INK  +TW AG NF  +  ++++K L 
Sbjct: 1   MWPAAFLFLAAAWSSSLARPHLK-PLSSEMVNYINKLNSTWTAGHNFH-NVDYSYVKKLC 58

Query: 238 GALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEA 417
           G L      KLP +   A  I  LP+ FD R++WP CPTL EIRDQGSCGSCWAFGA EA
Sbjct: 59  GTLLKGP--KLPLMIRYAGDI-KLPKEFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEA 115

Query: 418 MTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAW 543
           M+DRVCI+SNA      SA+DL++CC  CG+GCNGG P+ AW
Sbjct: 116 MSDRVCIHSNAKVSVELSAQDLLTCCNSCGMGCNGGYPSSAW 157


>UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1)
           (Cathepsin B1) (APP secretase) (APPS) [Contains:
           Cathepsin B light chain; Cathepsin B heavy chain]; n=85;
           Eukaryota|Rep: Cathepsin B precursor (EC 3.4.22.1)
           (Cathepsin B1) (APP secretase) (APPS) [Contains:
           Cathepsin B light chain; Cathepsin B heavy chain] - Homo
           sapiens (Human)
          Length = 339

 Score =  160 bits (389), Expect = 2e-38
 Identities = 78/160 (48%), Positives = 105/160 (65%), Gaps = 4/160 (2%)
 Frame = +1

Query: 76  LYVALACILAVV-ASDLP--HPLSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGAL 246
           L+ +L C+L +  A   P  HPLSD  +N +NK+  TW+AG NF  +   +++K L G  
Sbjct: 4   LWASLCCLLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLCGTF 62

Query: 247 KDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTD 426
                 K P+     E +  LP +FD R++WP+CPT+ EIRDQGSCGSCWAFGAVEA++D
Sbjct: 63  LGGP--KPPQRVMFTEDL-KLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISD 119

Query: 427 RVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAW 543
           R+CI++NA      SAEDL++CC  +CG GCNGG P  AW
Sbjct: 120 RICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAW 159


>UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin B;
           n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to cathepsin B - Strongylocentrotus purpuratus
          Length = 346

 Score =  160 bits (388), Expect = 2e-38
 Identities = 76/157 (48%), Positives = 99/157 (63%)
 Frame = +1

Query: 76  LYVALACILAVVASDLPHPLSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGALKDD 255
           L VA    + +  +DL   +    +  +N  + TWKAG NF         + ++GALK+ 
Sbjct: 5   LIVASLLAVGMAMTDLDI-MQATVVQKVNSLKTTWKAGINFEGWQ-LDDFRRMLGALKNP 62

Query: 256 NILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVC 435
           N  +LPK+ +    I +LPENFD R+ WP CPT+ E+RDQGSCGSCWAFGAVEA++DR+C
Sbjct: 63  NG-RLPKLENQTR-IKDLPENFDARENWPNCPTIKEVRDQGSCGSCWAFGAVEAISDRIC 120

Query: 436 IYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWE 546
           I S      H SAEDL++CC  CG GCNGG P  AWE
Sbjct: 121 IKSKGQTQVHISAEDLMTCCKTCGNGCNGGFPGSAWE 157


>UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=1;
           Biomphalaria glabrata|Rep: Cathepsin B preproprotein
           precursor - Biomphalaria glabrata (Bloodfluke planorb)
          Length = 333

 Score =  153 bits (370), Expect = 3e-36
 Identities = 77/161 (47%), Positives = 98/161 (60%), Gaps = 4/161 (2%)
 Frame = +1

Query: 76  LYVALACILAVVASDLPH--PLSDAFINLINKKQNT-WKAGRNF-PTHTPFAHIKILMGA 243
           + VA+  +LAV  +   H  PLSDA I  IN   NT WKAGRNF P     A   + +  
Sbjct: 6   ILVAICGLLAVALATPFHIEPLSDAEIFYINHVANTTWKAGRNFHPAEIKRARALLGVNM 65

Query: 244 LKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMT 423
            ++    ++       +   +LP+NFDPR KWP+C +LNEIRDQ +CGSCWAFG+ EAMT
Sbjct: 66  AENKAYNRIHLKYKQVQPRNDLPDNFDPRTKWPDCASLNEIRDQANCGSCWAFGSAEAMT 125

Query: 424 DRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWE 546
           DR+CI      + H SAED+  CC  CG+GCNGG P  AWE
Sbjct: 126 DRICIAGKG--NIHISAEDINDCCKSCGMGCNGGYPAAAWE 164


>UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n=4;
           Tenebrionidae|Rep: Putative cathepsin B-like proteinase
           - Tenebrio molitor (Yellow mealworm)
          Length = 321

 Score =  144 bits (349), Expect = 1e-33
 Identities = 69/154 (44%), Positives = 100/154 (64%), Gaps = 4/154 (2%)
 Frame = +1

Query: 76  LYVALACILAVVASDLPH--PLSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMG--A 243
           ++++   ++AV+++ L     LS  FI+ IN+ Q++W AGRNFP +T   ++  L G   
Sbjct: 3   IFLSFVVLVAVLSASLAEIDVLSSEFIDSINRIQSSWVAGRNFPENTTNEYLYKLNGFIG 62

Query: 244 LKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMT 423
           L  D   K P + H      ++PE+FD R KWP C +LN IRDQG+CGSCWAF ++E+M+
Sbjct: 63  LHPDPNYKPPVLVHTFNA-RDVPESFDARTKWPNCDSLNRIRDQGACGSCWAFASIESMS 121

Query: 424 DRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGG 525
           DR+CI+S+ +  F FS EDL+SCC  CG  C GG
Sbjct: 122 DRICIHSSGSAQFMFSPEDLLSCCTSCG-DCGGG 154


>UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase
           precursor; n=28; Bilateria|Rep: Cathepsin B-like
           cysteine proteinase precursor - Schistosoma japonicum
           (Blood fluke)
          Length = 342

 Score =  143 bits (347), Expect = 2e-33
 Identities = 70/143 (48%), Positives = 90/143 (62%), Gaps = 4/143 (2%)
 Frame = +1

Query: 130 PLSDAFINLINKKQNT-WKAGRNFPTHTPFAHIKILMGALKDDNILKL---PKVTHDAEL 297
           PLSD  I+ IN+  +  WKA ++   H+     +ILMGA K+D  +K    P V H  +L
Sbjct: 29  PLSDEMISFINEHPDAGWKADKSDRFHS-LDDARILMGARKEDAEMKRNRRPTVDHH-DL 86

Query: 298 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 477
              +P  FD R KWP C ++++IRDQ  CGSCWAFGAVEAMTDR+CI S   +    SA 
Sbjct: 87  NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSAL 146

Query: 478 DLVSCCPICGLGCNGGMPTLAWE 546
           DL+SCC  CG GC GG P +AW+
Sbjct: 147 DLISCCKDCGDGCQGGFPGVAWD 169


>UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep:
           Cathepsin B - Apriona germari
          Length = 324

 Score =  140 bits (340), Expect = 1e-32
 Identities = 66/133 (49%), Positives = 90/133 (67%), Gaps = 3/133 (2%)
 Frame = +1

Query: 136 SDAFINLINKKQNTWKAGRNFPTHTPFAHIKIL---MGALKDDNILKLPKVTHDAELIAN 306
           ++AFI  IN+K  TW A +NF   TP   +K L   +G  +D N+  LP V H+A  I+ 
Sbjct: 28  TEAFIQSINEKATTWTARKNFEGRTP-EQLKALADVIGINRDPNVT-LPVVFHEA--ISG 83

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           +P++FD R++WP C ++  IRD+G+CGSCWAF AVE M+DR+C+ S   K F FSAE++V
Sbjct: 84  IPDSFDAREQWPFCESIRTIRDEGACGSCWAFAAVEVMSDRLCLASEGRKKFIFSAEEVV 143

Query: 487 SCCPICGLGCNGG 525
           SCC  CG GC GG
Sbjct: 144 SCCTACGGGCRGG 156


>UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           B-like proteinase precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 331

 Score =  138 bits (334), Expect = 8e-32
 Identities = 67/158 (42%), Positives = 90/158 (56%), Gaps = 1/158 (0%)
 Frame = +1

Query: 73  ALYVALACILAVVASDLPHPLSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGALKD 252
           A  + L   + +     P+PLS+ FIN IN KQ+TW AG+NF  +     IK L+GA K 
Sbjct: 4   AFIITLLLPIVLSYKGSPNPLSNDFINYINSKQSTWVAGKNFDENLSIQEIKNLLGA-KK 62

Query: 253 DNILKLPKVTHDAELIANLPENFDPRDKWPECP-TLNEIRDQGSCGSCWAFGAVEAMTDR 429
             +    + TH  ++   +P +FD R+ W EC   ++ + DQ  CGSCWA  A  AM+DR
Sbjct: 63  GKLGVAKEFTHSEDI--QVPNSFDARENWKECSDVISTVVDQSDCGSCWAVAAASAMSDR 120

Query: 430 VCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAW 543
            CI S        SAE+L+SCC  CG GC GG PT+AW
Sbjct: 121 RCIASQGKLKVPVSAENLLSCCDSCGYGCEGGYPTMAW 158


>UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep:
           Cathepsin B - Pandalus borealis (Northern red shrimp)
          Length = 328

 Score =  136 bits (329), Expect = 3e-31
 Identities = 66/148 (44%), Positives = 83/148 (56%)
 Frame = +1

Query: 82  VALACILAVVASDLPHPLSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGALKDDNI 261
           V L   L   AS    PLSD F+ L+  KQ TWKAGRNF        +K L    K+ +I
Sbjct: 3   VLLLLALVAAASAELDPLSDEFLELLQSKQMTWKAGRNFAKDISKDFLKSLNCVRKNPDI 62

Query: 262 LKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIY 441
            KLP    +      +P  FD R++WP CP ++EIRDQG+CGSCWA  A   MTDR CI 
Sbjct: 63  PKLP--LKNVTPTKEIPVEFDAREQWPHCPCIDEIRDQGNCGSCWAVSAASVMTDRTCID 120

Query: 442 SNATKHFHFSAEDLVSCCPICGLGCNGG 525
           +     F FS+E++ +CC  CG  C GG
Sbjct: 121 TEGLVDFRFSSENVAACCTECGNACYGG 148


>UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=1;
           Nilaparvata lugens|Rep: Cathepsin B-like protease
           precursor - Nilaparvata lugens (Brown planthopper)
          Length = 347

 Score =  135 bits (327), Expect = 6e-31
 Identities = 66/165 (40%), Positives = 96/165 (58%), Gaps = 7/165 (4%)
 Frame = +1

Query: 70  CALYVALACILAVVASD-LPHPLSDAFINLINKK-QNTWKAGRNFPTHTPFAHIKILMGA 243
           C L+  ++ I A+   +     +++ +I+ IN   ++TWKAG NF   TP ++++ L+G 
Sbjct: 6   CLLFAVVSAISALPDQENTVREIANKWIDAINNNPKSTWKAGHNFHPDTPMSYLQGLLGV 65

Query: 244 LK-DDNILKLPKVTHDAELIAN----LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGA 408
            + + N+  L K     E   N    +P+ FD R KW +C +L EIRDQG+CGSCWA   
Sbjct: 66  SELESNLADLDKYEEMEENEENKKIKVPKYFDARKKWKKCKSLREIRDQGNCGSCWAVSV 125

Query: 409 VEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAW 543
             A  DR+CI SNA  + H S+ +L+SCC  CG GC GG P  AW
Sbjct: 126 AAAFADRLCIASNAKWNGHISSRELMSCCSYCGFGCEGGFPDAAW 170


>UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Cathepsin
           B - Fasciola gigantica (Giant liver fluke)
          Length = 339

 Score =  130 bits (315), Expect = 2e-29
 Identities = 69/166 (41%), Positives = 96/166 (57%), Gaps = 10/166 (6%)
 Frame = +1

Query: 79  YVALACILAVVASDLPHP-----LSDAFINLINKKQN-TWKAGRNFPTHTPFAHIKILMG 240
           ++ +  I+AVV +   H       SD  I  +N++   +WKA R+    +   H K+ +G
Sbjct: 3   WLIVFAIIAVVQAKPNHKPQFEAFSDELIRFVNEESGASWKAARS-TRFSNVDHFKLHLG 61

Query: 241 ALKDD----NILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGA 408
           AL +     N L+ P + HD     +LPE+FD R +WP+C T++EIRDQ SCGSCWA  A
Sbjct: 62  ALSETPEERNALR-PTIKHDISK-NDLPESFDARSQWPQCWTISEIRDQASCGSCWATAA 119

Query: 409 VEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWE 546
             AM+DRVCI+SN       +A D +SCC  CG GC GG P  AW+
Sbjct: 120 ASAMSDRVCIHSNGQMRPRLAAADPLSCCTYCGQGCRGGYPPKAWD 165


>UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis
           sinensis|Rep: Cathepsin B5 - Clonorchis sinensis
          Length = 343

 Score =  130 bits (314), Expect = 2e-29
 Identities = 60/125 (48%), Positives = 77/125 (61%), Gaps = 2/125 (1%)
 Frame = +1

Query: 178 WKAGRNFPTHTPFAHIKILMGALKDDNILKL--PKVTHDAELIANLPENFDPRDKWPECP 351
           W +GR  P       +  + GA ++    K   P + HD      LP+NFD R  WP C 
Sbjct: 42  WISGR-LPKRFESGDLIHMFGAKRETREQKAQRPTLRHDGFDNMRLPKNFDARKTWPHCS 100

Query: 352 TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMP 531
           +++EIRDQ SCGSCWAFGAVEAM+DR+CI+SN   +   SA DL+SCC  CG GC GG P
Sbjct: 101 SISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCKDCGFGCRGGYP 160

Query: 532 TLAWE 546
            +AW+
Sbjct: 161 AVAWD 165


>UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6
           precursor; n=11; Bilateria|Rep: Cathepsin B-like
           cysteine proteinase 6 precursor - Caenorhabditis elegans
          Length = 379

 Score =  129 bits (311), Expect = 5e-29
 Identities = 60/140 (42%), Positives = 82/140 (58%), Gaps = 5/140 (3%)
 Frame = +1

Query: 139 DAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLP-----KVTHDAELIA 303
           D  I+ +N+ QN W A +     + +         L   N ++L       ++   +L  
Sbjct: 44  DDLIDYVNENQNLWTAKKQRRFSSVYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDL 103

Query: 304 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 483
           ++PE+FD RD WP+C ++  IRDQ SCGSCWAFGAVEAM+DR+CI S+       SA+DL
Sbjct: 104 DIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDL 163

Query: 484 VSCCPICGLGCNGGMPTLAW 543
           +SCC  CG GCNGG P  AW
Sbjct: 164 LSCCKSCGFGCNGGDPLAAW 183


>UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase
           precursor; n=29; Schistosomatidae|Rep: Cathepsin B-like
           cysteine proteinase precursor - Schistosoma mansoni
           (Blood fluke)
          Length = 340

 Score =  128 bits (310), Expect = 6e-29
 Identities = 64/143 (44%), Positives = 88/143 (61%), Gaps = 4/143 (2%)
 Frame = +1

Query: 130 PLSDAFINLINKKQNT-WKAGRNFPTHTPFAHIKILMGALKDDNILKL---PKVTHDAEL 297
           PLSD  I+ IN+  N  W+A ++   H+     +I MGA +++  L+    P V H+ + 
Sbjct: 28  PLSDDIISYINEHPNAGWRAEKSNRFHS-LDDARIQMGARREEPDLRRKRRPTVDHN-DW 85

Query: 298 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 477
              +P NFD R KWP C ++  IRDQ  CGSCW+FGAVEAM+DR CI S   ++   SA 
Sbjct: 86  NVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWSFGAVEAMSDRSCIQSGGKQNVELSAV 145

Query: 478 DLVSCCPICGLGCNGGMPTLAWE 546
           DL++CC  CGLGC GG+   AW+
Sbjct: 146 DLLTCCESCGLGCEGGILGPAWD 168


>UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n=8;
           Strongylida|Rep: Cathepsin B-like cysteine protease 2 -
           Parelaphostrongylus tenuis
          Length = 344

 Score =  128 bits (309), Expect = 8e-29
 Identities = 65/150 (43%), Positives = 89/150 (59%), Gaps = 3/150 (2%)
 Frame = +1

Query: 106 VVASDLPHPLSDAFINLINKKQNTWKAGRNFPTHTPFAHIKIL-MGALKDDNIL--KLPK 276
           V+  +   P  DA ++ +N +Q  +KA        P A I+ L M  +K   I   K P+
Sbjct: 31  VITPETQVPTGDALVDYVNNQQQLFKA-------EPAAAIEELRMKIMKSKFISRSKKPR 83

Query: 277 VTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATK 456
           V    E    +P++FD R +WP CP+++ IRDQ  CGSCWAFG+ EAM+DRVCI S+  K
Sbjct: 84  VDEIGEEGFKIPDSFDARVQWPHCPSISYIRDQSQCGSCWAFGSAEAMSDRVCIASHGNK 143

Query: 457 HFHFSAEDLVSCCPICGLGCNGGMPTLAWE 546
               SA+D++SCC  CG GC+GG P  AWE
Sbjct: 144 TVELSADDILSCCYDCGDGCDGGYPISAWE 173


>UniRef50_Q23FP9 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 340

 Score =  125 bits (302), Expect = 6e-28
 Identities = 66/168 (39%), Positives = 92/168 (54%), Gaps = 6/168 (3%)
 Frame = +1

Query: 58  MAPSCALYVALACILAVVASDLPHPLSDAFINL-INKKQN-TWKAGRNFPTHTPFAHIKI 231
           M  S    + L C+ +  A+         FI   +N   N TWKA R +P        ++
Sbjct: 1   MRKSILSILILGCLFSTSANCFKFGEMSPFIVFEVNSNPNSTWKAAR-YPHFEKMTREQL 59

Query: 232 L--MGALKDDNILKLPKVTHDAELIAN-LPENFDPRDKWPECPTLNEIRDQGSCGSCWAF 402
           L  +G+L + + +KLP    D    A+ +PE FD R++WP C ++  IRDQ +CGSCWAF
Sbjct: 60  LGHLGSLDEPDWVKLPTKEFDPNANADPIPEFFDAREQWPNCQSIKLIRDQSTCGSCWAF 119

Query: 403 GAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAW 543
            A E  +DR+CI SN T     S+EDL+ CC   CG+GC GG P+ AW
Sbjct: 120 AATETFSDRICIASNQTLQTSISSEDLLECCADYCGMGCKGGYPSAAW 167


>UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 1 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 332

 Score =  122 bits (295), Expect = 4e-27
 Identities = 61/155 (39%), Positives = 86/155 (55%), Gaps = 4/155 (2%)
 Frame = +1

Query: 70  CALYVALACILAVVASDLPHPLSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGALK 249
           C L+V  A    +V S +  PLS+  IN IN    TWKAGRNF      +H   + G   
Sbjct: 7   CVLFVVAAQGRLMVPSSV-EPLSEEMINFINSINTTWKAGRNFDEKR--SHSDCVQGGDG 63

Query: 250 DDNILKLPKVTH----DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEA 417
              +      +H    + +     PE+F PR+ W  C ++  IRDQ +CGSCWAF A E+
Sbjct: 64  ASVLTATSTSSHFTSYEEDSRWTCPESFTPREYWSHCSSIRVIRDQSACGSCWAFAAAES 123

Query: 418 MTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNG 522
           ++DR+CI++N     + SAEDL++CC  CG GC+G
Sbjct: 124 ISDRICIHTNGKVQVNISAEDLLACCHTCGHGCDG 158


>UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4
           precursor; n=5; Caenorhabditis|Rep: Cathepsin B-like
           cysteine proteinase 4 precursor - Caenorhabditis elegans
          Length = 335

 Score =  121 bits (291), Expect = 1e-26
 Identities = 65/162 (40%), Positives = 85/162 (52%), Gaps = 6/162 (3%)
 Frame = +1

Query: 79  YVALACILAVVASDLPHPL----SDAFINLINKKQNTWKAGRNFPTHTPFAHIK--ILMG 240
           Y+ LA ++AV A  L  PL     +A    +N KQ+ WKA    P       +K  ++  
Sbjct: 3   YLILAALVAVTAG-LVIPLVPKTQEAITEYVNSKQSLWKA--EIPKDITIEQVKKRLMRT 59

Query: 241 ALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAM 420
                +   +  V HD      +P  FD R +WP C ++N IRDQ  CGSCWAF A EA 
Sbjct: 60  EFVAPHTPDVEVVKHDINE-DTIPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEAA 118

Query: 421 TDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWE 546
           +DR CI SN   +   SAED++SCC  CG GC GG P  AW+
Sbjct: 119 SDRFCIASNGAVNTLLSAEDVLSCCSNCGYGCEGGYPINAWK 160


>UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA;
           n=1; Tribolium castaneum|Rep: PREDICTED: similar to
           CG10992-PA - Tribolium castaneum
          Length = 325

 Score =  120 bits (289), Expect = 2e-26
 Identities = 65/159 (40%), Positives = 86/159 (54%), Gaps = 4/159 (2%)
 Frame = +1

Query: 82  VALACILAVVAS-DLPHPLSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMG--ALKD 252
           +   C L +  S   P+  S   I  IN +Q +WKA  N         IK  +G   L  
Sbjct: 4   ITFLCALTLPLSWSKPNTSSLQVIQEINSEQISWKAETNC------LDIKSRLGFLGLHP 57

Query: 253 DNILKLPKVTHDAELIANLPENFDPRDKWPECP-TLNEIRDQGSCGSCWAFGAVEAMTDR 429
           D   K+    H    I ++PE+FD R+KWPEC   + +IR+QG+CGSCWAF + E MTDR
Sbjct: 58  DPNYKIQTKQHKISRIISIPESFDAREKWPECKDVIGKIRNQGNCGSCWAFASTEVMTDR 117

Query: 430 VCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWE 546
           +CI S     F FS E+L++CC  CG GC GG    AW+
Sbjct: 118 LCISSKGKIKFVFSPENLLTCCKDCGCGCKGGYIKNAWD 156


>UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep:
           Cathepsin b - Aedes aegypti (Yellowfever mosquito)
          Length = 332

 Score =  119 bits (287), Expect = 4e-26
 Identities = 50/132 (37%), Positives = 73/132 (55%)
 Frame = +1

Query: 130 PLSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANL 309
           P +D F+  + +   TW     F     F + + + G  +     +LP   HD     ++
Sbjct: 26  PFNDGFLAQVQRHAKTWTPDATFRDGIRFENFQNMKGIFESKIGFRLPTKRHDVAYNMDI 85

Query: 310 PENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVS 489
           PE FD R+KWP C +++ I++QG CG+CWA  AV  M+DR+CI+S        +AEDL+ 
Sbjct: 86  PEFFDAREKWPYCKSISTIKNQGLCGACWAVAAVSVMSDRLCIHSEGKFDVELAAEDLMG 145

Query: 490 CCPICGLGCNGG 525
           CC  CG GCNGG
Sbjct: 146 CCKDCGNGCNGG 157


>UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2;
           Arthropoda|Rep: Cathepsin B-like cysteine protease -
           Callosobruchus maculatus (Southern cowpea weevil) (Pulse
           bruchid)
          Length = 330

 Score =  118 bits (284), Expect = 9e-26
 Identities = 58/161 (36%), Positives = 86/161 (53%), Gaps = 7/161 (4%)
 Frame = +1

Query: 82  VALACILAVVASDLPHP----LSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGALK 249
           +A   + AVV+     P    LSD +I  +N K   WKAGRNF   T   +I+ L+    
Sbjct: 3   LAFIALAAVVSCTFAQPELDFLSDEYIEQLNSKNLPWKAGRNFERDTSLYNIQRLLSVGT 62

Query: 250 DDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDR 429
            +   +   + H+ +   +LPE FD R +W +C ++ EIRDQ  CGSCWA  +   M+DR
Sbjct: 63  INPPSEFETIFHEDDG-KDLPEEFDARKQWSKCESIKEIRDQSGCGSCWAVSSASVMSDR 121

Query: 430 VCIYSNATKHFHFSAEDLVSCCPICGL---GCNGGMPTLAW 543
           +CI S+       SA D++ CC  C     GC+GG+P+  +
Sbjct: 122 ICIQSDQKNQLRISAADMIECCESCTFSVDGCHGGIPSFTF 162


>UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8;
           Trypanosoma|Rep: Cathepsin B-like cysteine protease -
           Trypanosoma brucei
          Length = 340

 Score =  118 bits (283), Expect = 1e-25
 Identities = 66/163 (40%), Positives = 88/163 (53%), Gaps = 5/163 (3%)
 Frame = +1

Query: 70  CALYVALACI-LAVVASDLPHPLSDAFINLINK-KQNTWKAGRN-FPTHTPFAHIKILMG 240
           C    A+  +  A+VA D P  LS AF++ +N+  +  WKA  +    +      K L G
Sbjct: 11  CIASTAVVAVNAALVAEDAP-VLSKAFVDRVNRLNRGIWKAKYDGVMQNITLREAKRLNG 69

Query: 241 ALKDDNILK-LPKVTH-DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVE 414
            +K +N    LPK    + E  A LP +FD  + WP CPT+ +I DQ +CGSCWA  A  
Sbjct: 70  VIKKNNNASILPKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAAS 129

Query: 415 AMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAW 543
           AM+DR C      +  H SA DL++CC  CG GCNGG P  AW
Sbjct: 130 AMSDRFCT-MGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAW 171


>UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep:
           Cathepsin b - Aedes aegypti (Yellowfever mosquito)
          Length = 386

 Score =  118 bits (283), Expect = 1e-25
 Identities = 60/125 (48%), Positives = 74/125 (59%), Gaps = 1/125 (0%)
 Frame = +1

Query: 175 TWKAGRNFPTHTPFAHIKILMGALKDDNILKLPK-VTHDAELIANLPENFDPRDKWPECP 351
           TW+AG N P         + M  L+     KLP  +  D E + +LP+ FD R+KWPECP
Sbjct: 85  TWRAGSN-PKPPAGYRSGVNMADLERT---KLPLGIMADVEDL-DLPDTFDAREKWPECP 139

Query: 352 TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMP 531
           +L EIRDQG CGSCWA  A  AMTDR C+ S   + F F + DL+SCC  CG GC GG  
Sbjct: 140 SLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCCHSCGQGCRGGTL 199

Query: 532 TLAWE 546
             AW+
Sbjct: 200 GPAWQ 204


>UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8;
           Leishmania|Rep: Cathepsin B-like protease - Leishmania
           major
          Length = 340

 Score =  118 bits (283), Expect = 1e-25
 Identities = 61/147 (41%), Positives = 81/147 (55%), Gaps = 4/147 (2%)
 Frame = +1

Query: 115 SDLPHPLSDAFINLINKK-QNTWKAGRN---FPTHTPFAHIKILMGALKDDNILKLPKVT 282
           SD P  L  +F+  +N K +  W A  N     T      ++ LMG          P+  
Sbjct: 31  SDFPL-LGKSFVAEVNSKAKGQWTASANNGYLVTGKSLGEVRKLMGVTDMSTEAVPPRNF 89

Query: 283 HDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHF 462
              EL  +LPE FD  + WP C T++EIRDQ +CGSCWA  AVEA++DR C +       
Sbjct: 90  SVEELQQDLPEFFDAAEHWPMCLTISEIRDQSNCGSCWAIAAVEAISDRYCTFGGVPDR- 148

Query: 463 HFSAEDLVSCCPICGLGCNGGMPTLAW 543
             S  +L+SCC ICGLGC+GG+PT+AW
Sbjct: 149 RMSTSNLLSCCFICGLGCHGGIPTVAW 175


>UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.4;
           n=1; Caenorhabditis elegans|Rep: Putative
           uncharacterized protein W07B8.4 - Caenorhabditis elegans
          Length = 335

 Score =  115 bits (276), Expect = 8e-25
 Identities = 68/159 (42%), Positives = 86/159 (54%), Gaps = 3/159 (1%)
 Frame = +1

Query: 76  LYVALACILAVVASDLPHPLSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGALKDD 255
           L  +L  ILA  A  LP   +  FIN IN  Q  W A       TPF  +K LM    + 
Sbjct: 4   LLPSLLFILAASAVVLPR--NKLFINHINSAQKLWTAEHYT---TPF-EVKNLMKV--EH 55

Query: 256 NILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVC 435
               L K    AE   ++P+++D RD WP+C ++N IRDQ  CGSCWA  A EA++DR C
Sbjct: 56  VAAHLDKDIKLAETADSIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTC 115

Query: 436 IYSNATKHFHFSAEDLVSCCP---ICGLGCNGGMPTLAW 543
           I SN   +   SAED+++CC     CG GC GG P  AW
Sbjct: 116 IASNGDVNTLLSAEDILTCCTGKFNCGDGCEGGYPIQAW 154


>UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7;
           Rhabditida|Rep: Cysteine proteinase 3 - Necator
           americanus (Human hookworm)
          Length = 360

 Score =  114 bits (274), Expect = 1e-24
 Identities = 51/136 (37%), Positives = 77/136 (56%)
 Frame = +1

Query: 139 DAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPEN 318
           +AF   +NK+Q+ + A +  P       ++++     D+   ++ K   D +    +P +
Sbjct: 36  EAFAEFLNKRQSFFTA-KYTPNALNILKMRVMESRFLDNEEGEMLK-EEDMDFSEEIPVS 93

Query: 319 FDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP 498
           FD RDKWP+C ++  IRDQ  CGSCWA  + E M+DR+C+ SN T     S  D+++CCP
Sbjct: 94  FDARDKWPKCTSIGFIRDQSHCGSCWAVSSAETMSDRLCVQSNGTIKVLLSDTDILACCP 153

Query: 499 ICGLGCNGGMPTLAWE 546
            CG GC GG    AWE
Sbjct: 154 NCGAGCGGGHTIRAWE 169


>UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15;
           Aphidoidea|Rep: Cathepsin B-S precursor - Tuberaphis
           styraci
          Length = 349

 Score =  113 bits (272), Expect = 3e-24
 Identities = 59/163 (36%), Positives = 81/163 (49%), Gaps = 5/163 (3%)
 Frame = +1

Query: 73  ALYVALACILAV---VASDLPHPLSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGA 243
           A +V + C + V   +A      LSD  I  IN+   TWKA R FP +T   +   L+G+
Sbjct: 2   AKFVTIVCAIFVSVYLAEPTLQFLSDERIKYINEVAKTWKAERYFPANTSEEYFIGLLGS 61

Query: 244 LKDDNILKLPKVTHDAELIA--NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEA 417
               N     ++     L    N P+ FD R+ W  C  +  IRDQG+CGSCW+F    A
Sbjct: 62  RGYKNYTNEVEIKKYDPLYVENNSPKQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTGA 121

Query: 418 MTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWE 546
             DR+C+ +    +   S E+L  CC  CG GC GG P  AW+
Sbjct: 122 FADRLCVSTGGKFNQLLSPEELAFCCMDCGKGCGGGYPIKAWK 164


>UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core
           eudicotyledons|Rep: At1g02300/T6A9_10 - Arabidopsis
           thaliana (Mouse-ear cress)
          Length = 362

 Score =  111 bits (268), Expect = 8e-24
 Identities = 59/142 (41%), Positives = 78/142 (54%), Gaps = 5/142 (3%)
 Frame = +1

Query: 133 LSDAFINLINKKQNT-WKAGRNFP-THTPFAHIKILMGA--LKDDNILKLPKVTHDAELI 300
           L +  +  +N+  N  WKA  N    +   A  K L+G         L +P V+HD  L 
Sbjct: 46  LQNEIVKEVNENPNAGWKASFNDRFANATVAEFKRLLGVKPTPKTEFLGVPIVSHDISL- 104

Query: 301 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 480
             LP+ FD R  W +C ++  I DQG CGSCWAFGAVE+++DR CI  N   +   S  D
Sbjct: 105 -KLPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYN--MNVSLSVND 161

Query: 481 LVSCCP-ICGLGCNGGMPTLAW 543
           L++CC  +CG GCNGG P  AW
Sbjct: 162 LLACCGFLCGQGCNGGYPIAAW 183


>UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep:
           Cathepsin B - Uronema marinum
          Length = 350

 Score =  111 bits (268), Expect = 8e-24
 Identities = 57/131 (43%), Positives = 79/131 (60%), Gaps = 7/131 (5%)
 Frame = +1

Query: 172 NTWKAGRNFPTH-TPFAHIKILMGALKDDNILKLPKVTHDA-ELIANL--PENFDPRDKW 339
           +TWKAG N       F  I+ +MG +    +  +P   +   E I NL  PE+FD R+ +
Sbjct: 38  STWKAGYNKRFEGMSFDQIQAMMGTIATP-VHMIPDERYTPFETIQNLSLPESFDLREAY 96

Query: 340 PECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP---ICGL 510
           P+C +L ++RDQ +CGSCWAFG VEA++DR+CI S        S+E+L+SCC     CG+
Sbjct: 97  PKCESLQQVRDQSNCGSCWAFGTVEAISDRICIASGQKDQTRISSENLLSCCRGTFACGM 156

Query: 511 GCNGGMPTLAW 543
           GCNGG    AW
Sbjct: 157 GCNGGYTAGAW 167


>UniRef50_Q237A1 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 346

 Score =  109 bits (261), Expect = 5e-23
 Identities = 57/143 (39%), Positives = 82/143 (57%), Gaps = 3/143 (2%)
 Frame = +1

Query: 127 HPLSDAFINLINKKQNTWKAGRNFP-THTPFAHIKILMGA-LKDDNILKLPKVTHDAELI 300
           H      I  +N   +TWKAG N    ++  A +K  MG  L  ++ +KL  V+  A   
Sbjct: 34  HDKLKQIIQKVNSSNSTWKAGENTKWINSDIAGVKAHMGVKLGQESGIKLETVSAQAN-- 91

Query: 301 ANLPENFDPRDKWPE-CPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 477
             LPE FD R +W + C +L E+RDQ +CGSCWAFGA E+++DR CI+    +    S +
Sbjct: 92  -GLPEEFDARVQWGDKCSSLWEVRDQSTCGSCWAFGAAESLSDRHCIHLG--QDIRLSTQ 148

Query: 478 DLVSCCPICGLGCNGGMPTLAWE 546
           +L++CC  CG GC+GG P  A +
Sbjct: 149 NLLTCCAACGDGCDGGWPEAAMD 171


>UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3
           precursor; n=4; Caenorhabditis|Rep: Cathepsin B-like
           cysteine proteinase 3 precursor - Caenorhabditis elegans
          Length = 370

 Score =  107 bits (256), Expect = 2e-22
 Identities = 53/132 (40%), Positives = 75/132 (56%), Gaps = 6/132 (4%)
 Frame = +1

Query: 148 INLINKKQNTWKAGRN----FPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIAN-LP 312
           ++ +N  Q +W A  N    F        +K      KD ++    ++    E++   LP
Sbjct: 36  VDHVNTVQTSWVAEHNEISEFEMKFKVMDVKFAEPLEKDSDVAS--ELFVRGEIVPEPLP 93

Query: 313 ENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC 492
           + FD R+KWP+C T+  IR+Q +CGSCWAFGA E ++DRVCI SN T+    S ED++SC
Sbjct: 94  DTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDILSC 153

Query: 493 C-PICGLGCNGG 525
           C   CG GC GG
Sbjct: 154 CGTTCGYGCKGG 165


>UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000012227 - Anopheles gambiae
           str. PEST
          Length = 218

 Score =  106 bits (254), Expect = 4e-22
 Identities = 42/73 (57%), Positives = 53/73 (72%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           +PE+FD R+ WP C +L  IR+QG+CGSCWA  A   M+DRVCI+SN T +   +AEDL+
Sbjct: 1   IPESFDARNHWPNCESLRAIRNQGTCGSCWAVAAASVMSDRVCIHSNGTINVALAAEDLM 60

Query: 487 SCCPICGLGCNGG 525
            CC  CG GCNGG
Sbjct: 61  GCCVDCGNGCNGG 73


>UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin
           B-like cysteine proteinase 4 precursor (Cysteine
           protease-related 4); n=2; Tribolium castaneum|Rep:
           PREDICTED: similar to Cathepsin B-like cysteine
           proteinase 4 precursor (Cysteine protease-related 4) -
           Tribolium castaneum
          Length = 360

 Score =  105 bits (253), Expect = 5e-22
 Identities = 58/141 (41%), Positives = 74/141 (52%), Gaps = 7/141 (4%)
 Frame = +1

Query: 142 AFINLINKKQNTWKAGRNFPTHTPFAHIKILMGAL---KDDNI---LKLPKVTHDAELIA 303
           + IN IN +Q+ W AG N     PF  I+  +G L    D N    +K P+ T +     
Sbjct: 21  SLINQINSQQSAWTAGIN-----PFDDIESRLGFLGIHPDPNFKPEIKEPQATQNV---- 71

Query: 304 NLPENFDPRDKWPECPTL-NEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 480
            +PE FD R+ WPEC  +   IR+QG C S WAF A E M+DR+CI +N       S ED
Sbjct: 72  -IPETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPED 130

Query: 481 LVSCCPICGLGCNGGMPTLAW 543
           L+ CC  CG  C GG    AW
Sbjct: 131 LIDCCHYCGNQCKGGYTYYAW 151


>UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator
           americanus|Rep: Cysteine proteinase 4 - Necator
           americanus (Human hookworm)
          Length = 339

 Score =  105 bits (253), Expect = 5e-22
 Identities = 61/170 (35%), Positives = 87/170 (51%), Gaps = 8/170 (4%)
 Frame = +1

Query: 58  MAPSCALYVALACILAVVASDL------PHPLS-DAFINLINKKQNTWKAGRNFPTHTPF 216
           M  + AL V L  I  + A +L       H LS  A ++ +N  Q+ +K   + PT+  F
Sbjct: 1   MKANFALVVVLLAINQLYADELLHKQESEHGLSGQALVDYVNSHQSLFKTEYS-PTNEQF 59

Query: 217 AHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCW 396
              +I+      +   K P+      L   LPE FD R+KWP C ++  IRD  +CGSCW
Sbjct: 60  VKARIMDIKYMTEASHKYPR--KGINLNVELPERFDAREKWPHCASIGLIRDHSACGSCW 117

Query: 397 AFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAW 543
           A  A   M+DR+CI +N T     S+ D+++CC   CG GC GG P  A+
Sbjct: 118 AVSAASVMSDRLCIQTNGTNQKILSSADILACCGEDCGSGCEGGYPIQAY 167


>UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4;
           Ancylostomatidae|Rep: Cysteine proteinase - Ancylostoma
           ceylanicum
          Length = 348

 Score =  104 bits (249), Expect = 2e-21
 Identities = 52/136 (38%), Positives = 77/136 (56%), Gaps = 2/136 (1%)
 Frame = +1

Query: 142 AFINLINKKQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPK-VTHDAELIANLPEN 318
           AF++ IN++Q+ ++A  + P    F   +I+      D     P  V  + E+  ++P+ 
Sbjct: 39  AFVDYINQQQSFFRAEYS-PDAEEFVRNRIMDVKFAVDPEKTEPNYVLANTEMKVDIPDT 97

Query: 319 FDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC- 495
           FD RD+WP C ++  IRDQ SCGSCWA  A  AM+DRVC  +N   +   S  +++SCC 
Sbjct: 98  FDARDRWPNCTSMKHIRDQSSCGSCWAVAAASAMSDRVCALTNGRINRILSDTEVLSCCF 157

Query: 496 PICGLGCNGGMPTLAW 543
             CG GC GG P  A+
Sbjct: 158 GSCGFGCKGGYPARAF 173


>UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7;
           n=2; Haemonchidae|Rep: Cathepsin B-like cysteine
           protease GCP7 - Haemonchus contortus (Barber pole worm)
          Length = 348

 Score =  104 bits (249), Expect = 2e-21
 Identities = 43/99 (43%), Positives = 64/99 (64%), Gaps = 1/99 (1%)
 Frame = +1

Query: 253 DNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRV 432
           +N+L +  +T + ++    PE+FD R+KW +CP+L  I DQ +CGSCWA  A + M+DR+
Sbjct: 82  ENVLPIANITSNDDI----PESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCMSDRL 137

Query: 433 CIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWE 546
           CI+S   K    SA D+++CC   CG GC+GG    AW+
Sbjct: 138 CIHSQGRKKVLLSATDILACCGKFCGYGCDGGYNARAWK 176


>UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep:
           Cathepsin B - Triticum aestivum (Wheat)
          Length = 353

 Score =  103 bits (246), Expect = 4e-21
 Identities = 57/136 (41%), Positives = 72/136 (52%), Gaps = 4/136 (2%)
 Frame = +1

Query: 148 INLINKKQNT-WKAGRN--FPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPEN 318
           I  +NK  N  W AG N  F  +T     K ++G       L L  V        +LP+ 
Sbjct: 43  IQTVNKHPNAGWTAGHNPYFANYT-IEQFKHILGVKPTPPGL-LAGVPIKIHPEMDLPKE 100

Query: 319 FDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP 498
           FD R +W  C T+  I DQG CG+CWAF AVEA+ DR CI+ N +     S  DL++CC 
Sbjct: 101 FDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRFCIHLNMS--VSLSVNDLLACCG 158

Query: 499 -ICGLGCNGGMPTLAW 543
            +CG GCNGG P  AW
Sbjct: 159 FLCGSGCNGGYPISAW 174


>UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1
           precursor; n=3; Haemonchidae|Rep: Cathepsin B-like
           cysteine proteinase 1 precursor - Ostertagia ostertagi
          Length = 341

 Score =  101 bits (243), Expect = 8e-21
 Identities = 51/118 (43%), Positives = 68/118 (57%), Gaps = 4/118 (3%)
 Frame = +1

Query: 202 THTPFAHIKILMGALKDDNILKLP-KVTHDAELIAN---LPENFDPRDKWPECPTLNEIR 369
           T TP  + K  +  LK  +   +P +   D EL  N   +PE++DPR +W  C +L  I 
Sbjct: 52  TATPVPYFKQRLMDLKYIDQNNIPDEEVEDEELEENNDDIPESYDPRIQWANCSSLFHIP 111

Query: 370 DQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAW 543
           DQ +CGSCWA  +  AM+DR+CI S   K    SA+D+VSCC  CG GC GG P  A+
Sbjct: 112 DQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVVSCCTWCGDGCEGGWPISAF 169


>UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 314

 Score =  101 bits (241), Expect = 1e-20
 Identities = 64/164 (39%), Positives = 90/164 (54%), Gaps = 5/164 (3%)
 Frame = +1

Query: 70  CALYVALACILAVVASDLPHP-LSDAFINLINK-KQNTWKAGRN--FPTHTPFAHIKILM 237
           C ++V+       + S L  P L D  IN IN  K+++W A RN  F   T F  I  +M
Sbjct: 8   CLIFVSFYFASVCLGSFLDKPVLDDNLINSINNNKKSSWTAHRNKNFEGKT-FGDIIGMM 66

Query: 238 GALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEA 417
           G  K     KL +  +  EL  ++P +FD R +WP+C  ++ I +Q  CGSCWAF + E 
Sbjct: 67  GTKKTAAPFKLTE--NGEELKGSIPTSFDSRVQWPDC--IHPILNQEQCGSCWAFSSSEV 122

Query: 418 MTDRVCIYS-NATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWE 546
           ++DR+CI S N T     S + LV+C      GC+GG+P LAWE
Sbjct: 123 LSDRLCIASNNKTNPGALSPQTLVACDVYGNDGCSGGIPQLAWE 166


>UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2
           precursor; n=8; Haemonchus contortus|Rep: Cathepsin
           B-like cysteine proteinase 2 precursor - Haemonchus
           contortus (Barber pole worm)
          Length = 342

 Score =  101 bits (241), Expect = 1e-20
 Identities = 49/114 (42%), Positives = 66/114 (57%), Gaps = 1/114 (0%)
 Frame = +1

Query: 208 TPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCG 387
           TP    KI+    K   +  + K   D E+  ++P ++DPRD W  C T   IRDQ +CG
Sbjct: 56  TPDFEQKIMSIKYKHQKLNLMVKEDPDPEV--DIPPSYDPRDVWKNCTTFY-IRDQANCG 112

Query: 388 SCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWE 546
           SCWA     A++DR+CI S A K  + SA D+++CC P CG GC GG P  AW+
Sbjct: 113 SCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWK 166


>UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115,
           whole genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_115,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 332

 Score =  100 bits (239), Expect = 3e-20
 Identities = 46/90 (51%), Positives = 58/90 (64%), Gaps = 5/90 (5%)
 Frame = +1

Query: 292 ELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFS 471
           E + NLP +F  ++KWP CP++  I DQG+CGSCWA  A   M+DR+CI S  T     S
Sbjct: 66  EKLENLPPSFSAQEKWPGCPSIELIPDQGNCGSCWAVSAASTMSDRLCIASGQTDKRQIS 125

Query: 472 AEDLVSCCPI-CGL----GCNGGMPTLAWE 546
           AEDL+SCC I C L    GC+GG P  AW+
Sbjct: 126 AEDLLSCCGINCELDGNGGCDGGYPYGAWK 155


>UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC02853 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 181

 Score = 99.5 bits (237), Expect = 4e-20
 Identities = 52/109 (47%), Positives = 66/109 (60%), Gaps = 4/109 (3%)
 Frame = +1

Query: 130 PLSDAFINLINKKQNT-WKAGRNFPTHTPFAHIKILMGALK---DDNILKLPKVTHDAEL 297
           PLSD  I  INK+ N  WKA R     T   H K +MG L    D + L  P + H+ ++
Sbjct: 21  PLSDELITFINKQPNIEWKADRT-KRFTSIHHAKSMMGVLLNSVDQHKLHHPIIHHN-DI 78

Query: 298 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYS 444
              LP+ FD R  W  C ++  IRDQ SCGSCWAFGAVE+M+DR+CI+S
Sbjct: 79  NIKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHS 127


>UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8.1;
           n=1; Caenorhabditis elegans|Rep: Putative
           uncharacterized protein W07B8.1 - Caenorhabditis elegans
          Length = 335

 Score = 99.1 bits (236), Expect = 6e-20
 Identities = 55/158 (34%), Positives = 86/158 (54%), Gaps = 5/158 (3%)
 Frame = +1

Query: 88  LACILAVV--ASDLPHPLSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGALKDDNI 261
           L C++ V+  A  +P    D  I+ +N ++ TW AG   P  +  + +K L+        
Sbjct: 5   LICLIGVLFQADGVPPSEIDRIIHYVNSQKTTWTAG--IPALSRNSMLKTLVTDAATIGF 62

Query: 262 LKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIY 441
            K+      ++  ++L  +FD R++WPEC ++ +I D   C + WAF A E+M+DR+CI 
Sbjct: 63  -KIQNFGV-SQANSDLSPSFDARERWPECMSIPQINDISECKTSWAFAAAESMSDRLCIN 120

Query: 442 SNATKHFHFSAEDLVSCCP---ICGLGCNGGMPTLAWE 546
           S   K+   SAE+L+SCC     CG GC GG P  AW+
Sbjct: 121 SGGFKNTILSAEELLSCCTGMFSCGEGCEGGNPFKAWQ 158


>UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep:
           Thiol protease - Trichuris suis
          Length = 348

 Score = 97.9 bits (233), Expect = 1e-19
 Identities = 43/87 (49%), Positives = 54/87 (62%), Gaps = 1/87 (1%)
 Frame = +1

Query: 286 DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH 465
           D  L  ++P +FD R  W  C +LN IRDQ  CGSCWA  A E M+DR+C+ SN +    
Sbjct: 77  DRSLALSIPPSFDVRSLWHVC-SLNLIRDQAKCGSCWAVSAAETMSDRICVQSNCSIKAC 135

Query: 466 FSAEDLVSCCPI-CGLGCNGGMPTLAW 543
            S  D++SCC + CG GCNGG P  AW
Sbjct: 136 ISDTDILSCCGLYCGYGCNGGFPIEAW 162


>UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus
           contortus|Rep: Cysteine proteinase - Haemonchus
           contortus (Barber pole worm)
          Length = 350

 Score = 93.9 bits (223), Expect = 2e-18
 Identities = 56/154 (36%), Positives = 76/154 (49%), Gaps = 4/154 (2%)
 Frame = +1

Query: 97  ILAVVASDLPHPLS-DAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLP 273
           +LA   SD    L+ +A +  +NK Q+  +   +       AH   LM      N  KL 
Sbjct: 25  LLAQQTSDDSDTLTGEALVEYVNKHQSFSRLNTS-KAEERMAH---LMKTDYIRNARKLY 80

Query: 274 KVTHDAELIAN--LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSN 447
           KV    E   N  +PE+FD R  W  C ++  +RDQ  CGSCWA  A   M+DR+C+ + 
Sbjct: 81  KVKKAEEQTTNEDIPESFDSRIVWKNCSSITYVRDQSRCGSCWAVSAASTMSDRICVQTK 140

Query: 448 ATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWE 546
                  S  D++SCC  +CG GC GG   LAWE
Sbjct: 141 GKLQTILSDTDILSCCGRMCGDGCEGGYDHLAWE 174


>UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 421

 Score = 93.5 bits (222), Expect = 3e-18
 Identities = 37/80 (46%), Positives = 52/80 (65%)
 Frame = +1

Query: 301 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 480
           +++P+NFD R KWP CP+++ + +QG CGSC+A  A    +DR CI+SN T     S ED
Sbjct: 136 SDVPKNFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKSLLSEED 195

Query: 481 LVSCCPICGLGCNGGMPTLA 540
           ++ CC +CG  C GG P  A
Sbjct: 196 IIGCCSVCG-NCYGGDPLKA 214


>UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep:
           Cathepsin B - Streblomastix strix
          Length = 312

 Score = 89.0 bits (211), Expect = 6e-17
 Identities = 38/82 (46%), Positives = 48/82 (58%)
 Frame = +1

Query: 298 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 477
           +ANLP+ FD R  WP C  + +I DQG CGSCWA  + E + DR CI S   +    S +
Sbjct: 73  VANLPDEFDSRTNWPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEGKQTPELSPQ 132

Query: 478 DLVSCCPICGLGCNGGMPTLAW 543
            L SC P C  GCNGG  + A+
Sbjct: 133 HLTSCTPGCS-GCNGGWMSTAF 153


>UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 356

 Score = 89.0 bits (211), Expect = 6e-17
 Identities = 51/132 (38%), Positives = 70/132 (53%), Gaps = 7/132 (5%)
 Frame = +1

Query: 157 INKKQNTWKAGRNFPT-HTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRD 333
           +NKKQ  WKA  +  T     A  K +     +D + +  K  +D  L+ ++P +FD R 
Sbjct: 44  VNKKQKLWKAETSRMTFQEKMARAKSIKFIKSNDEVSE--KTGNDNVLV-DIPSSFDSRQ 100

Query: 334 KWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC----PI 501
           KWP C  +  +RDQ  CGS     AVE  +DR CI SN T ++  SA+D +SCC     I
Sbjct: 101 KWPSCSQIGAVRDQSDCGSAAHLVAVEIASDRTCIASNGTFNWPLSAQDPLSCCVGLMSI 160

Query: 502 C--GLGCNGGMP 531
           C  G GC+G  P
Sbjct: 161 CGDGWGCDGSWP 172


>UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella
           histriomuscorum|Rep: Cathepsin B - Oxytricha trifallax
           (Sterkiella histriomuscorum)
          Length = 294

 Score = 87.0 bits (206), Expect = 3e-16
 Identities = 55/155 (35%), Positives = 74/155 (47%)
 Frame = +1

Query: 82  VALACILAVVASDLPHPLSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGALKDDNI 261
           V +  I+AV  +   HP+++  +  I  K + W+      T  PF ++       K    
Sbjct: 5   VIIGTIVAVAVAT--HPINEEMVAHIKAKTSLWQPHET--TTNPFNNMTKEQLLAKCGTY 60

Query: 262 LKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIY 441
           +      +    I  +PENFD R +W     ++ IRDQ  CGSCWAFGA EA +DR  I 
Sbjct: 61  IVPANKEYPGSKIMTVPENFDARQQWGS--KIHAIRDQQQCGSCWAFGATEAFSDRFAIN 118

Query: 442 SNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWE 546
               K    S EDLVS C     GCNGG   +AWE
Sbjct: 119 G---KDVILSPEDLVS-CDTNDYGCNGGYMDVAWE 149


>UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep:
           Cysteine proteinase - Toxoplasma gondii
          Length = 569

 Score = 85.0 bits (201), Expect = 1e-15
 Identities = 37/83 (44%), Positives = 47/83 (56%), Gaps = 4/83 (4%)
 Frame = +1

Query: 307 LPENFDPRDKWPECP-TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 483
           +P +FD R  +P C   +  +RDQG CGSCWAF + EA  DR+CI S   +    SA+  
Sbjct: 274 VPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHT 333

Query: 484 VSCC---PICGLGCNGGMPTLAW 543
            SCC        GCNGG P +AW
Sbjct: 334 TSCCNAIHCASFGCNGGQPGMAW 356


>UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 311

 Score = 85.0 bits (201), Expect = 1e-15
 Identities = 39/86 (45%), Positives = 53/86 (61%)
 Frame = +1

Query: 286 DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH 465
           +  +  N+PENFD R +WP   +++ IR+QG CGSCWAFGA E ++DR  I S    +  
Sbjct: 76  EVRVAENIPENFDARKQWPG--SIHPIRNQGQCGSCWAFGASEVLSDRFAIASKNQIYVT 133

Query: 466 FSAEDLVSCCPICGLGCNGGMPTLAW 543
            SA+ LV  C +   GC+GG P  AW
Sbjct: 134 LSAQQLVD-CDLDNSGCSGGWPINAW 158


>UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2;
           Ostreococcus|Rep: Cysteine proteinase - Ostreococcus
           tauri
          Length = 362

 Score = 79.8 bits (188), Expect = 4e-14
 Identities = 43/94 (45%), Positives = 55/94 (58%), Gaps = 14/94 (14%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTL-NEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 483
           LP+ FD R+KWP+C  L +E  DQG+CGSCWA    +AMTDR+CI +N   + H SA  L
Sbjct: 88  LPDTFDVREKWPKCAALVSEAVDQGACGSCWAVAPAKAMTDRLCIATNGAVNTHVSAIQL 147

Query: 484 VSCCP-----------ICGL--GCNGGMPTLAWE 546
           +SC             + G   GC GG PT A+E
Sbjct: 148 LSCNSHSNSAYTYDENLAGGSGGCMGGYPTEAYE 181


>UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1;
           Ostreococcus tauri|Rep: Cysteine proteinase Cathepsin F
           - Ostreococcus tauri
          Length = 498

 Score = 78.6 bits (185), Expect = 9e-14
 Identities = 46/110 (41%), Positives = 60/110 (54%), Gaps = 2/110 (1%)
 Frame = +1

Query: 202 THTPFAHIKILMGALK-DDNILKLPKVTHDAELIANLPENFDPRDKWPECPTL-NEIRDQ 375
           T +P+A      GA   D   + L +V  DA L  +LP +FD RD++P+C  L   +RDQ
Sbjct: 222 TLSPYASSDETHGAHPFDRKAVGLGRVKWDA-LKHSLPRHFDARDEYPKCARLIGTVRDQ 280

Query: 376 GSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGG 525
           G CGSCWA  A E M DR+CI S   +    S +  +SC    G GC GG
Sbjct: 281 GKCGSCWAVAATEIMNDRLCISSGGKEVAELSPQFALSCYN-SGAGCEGG 329


>UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|Rep:
           Cysteine protease - Giardia muris
          Length = 301

 Score = 75.4 bits (177), Expect = 8e-13
 Identities = 41/102 (40%), Positives = 56/102 (54%), Gaps = 4/102 (3%)
 Frame = +1

Query: 253 DNILKLPKVTHDAEL----IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAM 420
           +N+  L   TH ++L       LP+++DPR +   C  L E+ DQ SCGSCWAF AV   
Sbjct: 55  ENLRSLRTETHVSQLNLGKTKELPKDYDPRVERAHC--LPEVADQASCGSCWAFSAVATF 112

Query: 421 TDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWE 546
            DR C Y   +K  H+S + +VSC    G  CNGG  +  W+
Sbjct: 113 ADRRCAYGLDSKQVHYSEQYVVSCDFGDG-ACNGGWLSNVWK 153


>UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG01102;
           n=1; Caenorhabditis briggsae|Rep: Putative
           uncharacterized protein CBG01102 - Caenorhabditis
           briggsae
          Length = 374

 Score = 74.9 bits (176), Expect = 1e-12
 Identities = 47/144 (32%), Positives = 71/144 (49%), Gaps = 4/144 (2%)
 Frame = +1

Query: 76  LYVALACILAVVASDLPHPLSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGALKDD 255
           L+  L     V ASD     S   IN +N +++ W AG   P  +    +K L    +  
Sbjct: 4   LFFLLVFFTFVWASDFSD--STKIINYVNSQKSLWTAGN--PKISKDYMLKTLTTDPETV 59

Query: 256 NILKLPKVTHDAELIA--NLPEN--FDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMT 423
               L    +   + +  NL ++  FD R++WPEC ++  I D   C S WAF A E+M+
Sbjct: 60  GFRNLGPTFYSKNIFSPENLDDSNFFDARERWPECSSIPIINDISDCKSSWAFSAAESMS 119

Query: 424 DRVCIYSNATKHFHFSAEDLVSCC 495
           DR+CI S    +   SA++L+SCC
Sbjct: 120 DRLCINSGGMINTVLSAQELLSCC 143


>UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3;
           Giardia intestinalis|Rep: Cathepsin B-like CP1 precursor
           - Giardia lamblia (Giardia intestinalis)
          Length = 303

 Score = 74.9 bits (176), Expect = 1e-12
 Identities = 54/154 (35%), Positives = 77/154 (50%), Gaps = 6/154 (3%)
 Frame = +1

Query: 82  VALACILAVVASDLPHPL-SDAFINLINKKQNTWKAG--RNFP--THTPFAHIKILMGAL 246
           +AL+ +LAVV +    PL S A +  I      WKAG  + F   T   F  + I    L
Sbjct: 1   MALSLLLAVVCAK---PLVSRAELRRIQALNPPWKAGMPKRFENVTEDEFRSMLIRPDRL 57

Query: 247 KDDNILKLP-KVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMT 423
           +  +    P  +T   EL+  +P  FD RD++P+C  +    DQGSCGSCWAF A+    
Sbjct: 58  RARSGSLPPISITEVQELVDPIPPQFDFRDEYPQC--VKPALDQGSCGSCWAFSAIGVFG 115

Query: 424 DRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGG 525
           DR C      +   +S + L+S C +   GC+GG
Sbjct: 116 DRRCAMGIDKEAVSYSQQHLIS-CSLENFGCDGG 148


>UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep:
           Cathepsin B - Streblomastix strix
          Length = 283

 Score = 71.7 bits (168), Expect = 1e-11
 Identities = 34/80 (42%), Positives = 47/80 (58%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           +P+ FD R+KWP+   +  +RDQG CGSCWAF   E + DR+ +          + EDLV
Sbjct: 63  VPDTFDAREKWPDA--ILPVRDQGECGSCWAFSIAETIGDRLGVL--GCSRGDIAPEDLV 118

Query: 487 SCCPICGLGCNGGMPTLAWE 546
           S C I   GC+GG   +AW+
Sbjct: 119 S-CDIFDDGCDGGFIDMAWD 137


>UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus
           lucimarinus CCE9901|Rep: Predicted protein -
           Ostreococcus lucimarinus CCE9901
          Length = 330

 Score = 69.3 bits (162), Expect = 5e-11
 Identities = 32/74 (43%), Positives = 43/74 (58%), Gaps = 1/74 (1%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTL-NEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 483
           LP +FD R  +P+C  L   +RDQG CGSCWA  A E M DR+C+ ++       S +  
Sbjct: 112 LPTSFDARVAYPKCSRLLGAVRDQGRCGSCWAVAATEVMNDRLCVATDGENADELSPQYA 171

Query: 484 VSCCPICGLGCNGG 525
           +SC    G GC+GG
Sbjct: 172 LSCFD-SGSGCDGG 184


>UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4;
           Giardia intestinalis|Rep: Cathepsin B-like CP2 precursor
           - Giardia lamblia (Giardia intestinalis)
          Length = 300

 Score = 68.9 bits (161), Expect = 7e-11
 Identities = 50/153 (32%), Positives = 73/153 (47%), Gaps = 3/153 (1%)
 Frame = +1

Query: 97  ILAVVASDLPHPLSDAFINLINKKQNTWKAG--RNFPTHTPFAHIKILMGALKDDNIL-K 267
           +LA  A   P  L+ + +N I      WKAG  + F   T      +LM      N    
Sbjct: 5   LLAAAAFSAP-ALTVSELNHIKSLNPRWKAGIPKRFEGLTKDEISSLLMPVSFLKNAKGA 63

Query: 268 LPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSN 447
            P+ T   +   ++PE+FD R+++P C  + E+ DQG CGSCWAF +V    DR C+   
Sbjct: 64  APRGTFTDK--DDVPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGL 119

Query: 448 ATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWE 546
             K   +S + +VS C    + CNGG     W+
Sbjct: 120 DKKPVKYSPQYVVS-CDHGDMACNGGWLPNVWK 151


>UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|Rep:
           Cysteine proteinase - Globodera pallida
          Length = 53

 Score = 67.7 bits (158), Expect = 2e-10
 Identities = 28/52 (53%), Positives = 34/52 (65%), Gaps = 1/52 (1%)
 Frame = +1

Query: 373 QGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPI-CGLGCNGG 525
           QG CG CWAF   E ++DR CI SN T+    S  DL++CC + CG GCNGG
Sbjct: 1   QGQCGRCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCCGMSCGEGCNGG 52


>UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like protein
           F26E4.3; n=2; Caenorhabditis|Rep: Uncharacterized
           peptidase C1-like protein F26E4.3 - Caenorhabditis
           elegans
          Length = 491

 Score = 65.3 bits (152), Expect = 9e-10
 Identities = 31/79 (39%), Positives = 42/79 (53%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           LPE+FD RDKW   P ++ + DQG CGS W+       +DR+ I S    +   S++ L+
Sbjct: 223 LPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLL 280

Query: 487 SCCPICGLGCNGGMPTLAW 543
           SC      GC GG    AW
Sbjct: 281 SCNQHRQKGCEGGYLDRAW 299


>UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;
           n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
           hypothetical protein - Strongylocentrotus purpuratus
          Length = 450

 Score = 64.5 bits (150), Expect = 2e-09
 Identities = 32/81 (39%), Positives = 42/81 (51%)
 Frame = +1

Query: 301 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 480
           A LPE FD R+ WP    ++E+ DQG CGS WA       +DR+ I S    +   S + 
Sbjct: 195 ARLPETFDARENWPGL--IDEVIDQGKCGSSWAISTASVASDRLAIQSMGEINPRLSEQH 252

Query: 481 LVSCCPICGLGCNGGMPTLAW 543
           L+SC      GC+GG    AW
Sbjct: 253 LLSCNIRGQRGCSGGYLDRAW 273


>UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_113_4299_5381 - Giardia lamblia ATCC
           50803
          Length = 360

 Score = 63.7 bits (148), Expect = 3e-09
 Identities = 31/78 (39%), Positives = 44/78 (56%)
 Frame = +1

Query: 310 PENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVS 489
           PE++D RD++P C T  E+ DQG+CGSCWAF +V+   D  C          +S + ++ 
Sbjct: 141 PESYDFRDEYPHCIT--EVVDQGNCGSCWAFSSVQTFADHRCRSGLDATGVSYSVQYVLD 198

Query: 490 CCPICGLGCNGGMPTLAW 543
            C     GCNGG P  A+
Sbjct: 199 -CDRKDHGCNGGEPVNAF 215


>UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-LDL
           responsive gene 2, partial; n=1; Strongylocentrotus
           purpuratus|Rep: PREDICTED: similar to oxidized-LDL
           responsive gene 2, partial - Strongylocentrotus
           purpuratus
          Length = 363

 Score = 63.3 bits (147), Expect = 4e-09
 Identities = 34/96 (35%), Positives = 51/96 (53%), Gaps = 1/96 (1%)
 Frame = +1

Query: 259 ILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCI 438
           +L + ++ +D    A +PE FD R +WP    +  +++QG+C S WA       +DR+ I
Sbjct: 207 VLTMHQIQNDMPPEA-IPEEFDARAQWPGL--VEGVQNQGNCASSWAMSTAATASDRLAI 263

Query: 439 YSNAT-KHFHFSAEDLVSCCPICGLGCNGGMPTLAW 543
            SN T K+ H S + L+SC      GC GG    AW
Sbjct: 264 QSNGTFKYMHLSPQHLLSCNVKRQQGCAGGHLDRAW 299


>UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 234

 Score = 63.3 bits (147), Expect = 4e-09
 Identities = 33/84 (39%), Positives = 48/84 (57%)
 Frame = +1

Query: 295 LIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSA 474
           ++ ++P+  D R K      +NEI+DQ  CGSCWAFG+  AM     +       +  S 
Sbjct: 14  IVGDIPDEIDYRTKG----AVNEIKDQKHCGSCWAFGSCAAMESSWFLKHGTL--YSLSE 67

Query: 475 EDLVSCCPICGLGCNGGMPTLAWE 546
           + LV CC  C LGC+G +P+LA+E
Sbjct: 68  QCLVDCCHDC-LGCHGCLPSLAFE 90


>UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin C;
           n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to cathepsin C - Strongylocentrotus purpuratus
          Length = 482

 Score = 62.9 bits (146), Expect = 5e-09
 Identities = 43/140 (30%), Positives = 64/140 (45%), Gaps = 3/140 (2%)
 Frame = +1

Query: 127 HPLSDAFINLINKKQNTWKAG-RNFPTHTPFAHIKILMGALKDDNILK--LPKVTHDAEL 297
           H  +D FI  INK Q++WKA   +   +     ++   G      +     P      + 
Sbjct: 186 HRRNDKFIEGINKHQDSWKATYYDRYVNLTLGDMRRRAGGKLWKRVWPDVSPTDERTKQA 245

Query: 298 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 477
            +NLPE FD RD       ++ +RDQG CGSC+AF +      R+ + +N       S +
Sbjct: 246 ASNLPEKFDWRDV-GGIDYVSPVRDQGICGSCYAFASTATQESRLRVMTNNNVKVVMSPQ 304

Query: 478 DLVSCCPICGLGCNGGMPTL 537
           ++VSC      GC GG P L
Sbjct: 305 EVVSCSEY-AQGCEGGFPYL 323


>UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC06356 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 279

 Score = 62.5 bits (145), Expect = 6e-09
 Identities = 25/72 (34%), Positives = 42/72 (58%)
 Frame = +1

Query: 277 VTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATK 456
           ++H++ +   +P +FD R  W  C T+ +I D+  C + WA   V++++DR+CI SN   
Sbjct: 19  ISHNS-INMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRI 77

Query: 457 HFHFSAEDLVSC 492
               SA D +SC
Sbjct: 78  SVQLSARDAISC 89


>UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p;
           n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
           GM06507p - Nasonia vitripennis
          Length = 483

 Score = 61.7 bits (143), Expect = 1e-08
 Identities = 33/101 (32%), Positives = 47/101 (46%), Gaps = 1/101 (0%)
 Frame = +1

Query: 244 LKDDNILKLPKVTHDAELIAN-LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAM 420
           L   +I ++PK      +  N LP  FD R +W     +  ++DQG CG+ WA   V+  
Sbjct: 214 LHSTDIFQIPKQNKQQWINPNDLPREFDSRIQWGN--DITPVQDQGWCGASWAISTVDVA 271

Query: 421 TDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAW 543
           +DR  I S   +    S + L+SC      GC GG    AW
Sbjct: 272 SDRFAIMSKGIEKVQLSGQHLISCNNRGQRGCKGGYLDRAW 312


>UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,
           isoform A; n=2; Endopterygota|Rep: PREDICTED: similar to
           CG3074-PA, isoform A - Tribolium castaneum
          Length = 445

 Score = 61.7 bits (143), Expect = 1e-08
 Identities = 32/80 (40%), Positives = 40/80 (50%)
 Frame = +1

Query: 304 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 483
           +LP  FD   KWP    ++EI+DQG CGS WA       +DR  I S   +    SA+ L
Sbjct: 196 SLPREFDSEFKWPGW--MSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHL 253

Query: 484 VSCCPICGLGCNGGMPTLAW 543
           +SC       CNGG    AW
Sbjct: 254 LSCDRRGQQSCNGGYLDRAW 273


>UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep:
           Vivapain-4 - Plasmodium vivax
          Length = 484

 Score = 60.9 bits (141), Expect = 2e-08
 Identities = 49/143 (34%), Positives = 71/143 (49%), Gaps = 8/143 (5%)
 Frame = +1

Query: 142 AFINLINKKQNT-WKAGRNFPTHTPFAHIKILMGALKDDNILKL---PKVT-HDAELIAN 306
           A IN  N K N  +K G N  +   F   +  M  L+ D   KL   P V+ +D  L   
Sbjct: 195 ARINSHNSKANILYKKGTNQYSDISFEEFRKTMLTLRFDLKKKLANSPYVSNYDDVLKKY 254

Query: 307 LPENF---DPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 477
            P +    + +  W E   ++EI++Q  CGSCWAFGAV A+  +  I  N  +H   S +
Sbjct: 255 KPADAVVDNEKYDWREHNAVSEIKNQNLCGSCWAFGAVGAVESQYAIRKN--QHVLISEQ 312

Query: 478 DLVSCCPICGLGCNGGMPTLAWE 546
           +LV C      GC GG+ +LA++
Sbjct: 313 ELVDCSD-KNFGCFGGLASLAFD 334


>UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase;
           n=21; Bilateria|Rep: Cathepsin L-like cysteine
           proteinase - Globodera pallida
          Length = 379

 Score = 60.5 bits (140), Expect = 3e-08
 Identities = 44/131 (33%), Positives = 65/131 (49%), Gaps = 7/131 (5%)
 Frame = +1

Query: 175 TWKAGRNFPTHTPFAHIKILMG--ALKDDNILKLPKVTHDAELIANLPENFDPRDK-WPE 345
           T++ G N     PF+  K L G   L  DN+ +          + +LPE+ D RDK W  
Sbjct: 115 TFRVGENHIADLPFSEYKKLNGYRRLLGDNLRRNASTFLAPMNVGDLPESVDWRDKGW-- 172

Query: 346 CPTLNEIRDQGSCGSCWAF---GAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LG 513
              + E+++QG CGSCWAF   GA+EA   R        +    S ++L+ C    G +G
Sbjct: 173 ---VTEVKNQGMCGSCWAFSSTGALEAQHAR-----QTGQLISLSEQNLIDCSKKYGNMG 224

Query: 514 CNGGMPTLAWE 546
           CNGG+   A++
Sbjct: 225 CNGGIMDNAFQ 235


>UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcoptes
           scabiei type hominis|Rep: Sar s 1 allergen Yv9053H09 -
           Sarcoptes scabiei type hominis
          Length = 253

 Score = 60.5 bits (140), Expect = 3e-08
 Identities = 34/79 (43%), Positives = 45/79 (56%), Gaps = 4/79 (5%)
 Frame = +1

Query: 304 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAV----EAMTDRVCIYSNATKHFHFS 471
           +LPE FD RD       L++IR+QG CG+CWAF A+     A   R  I  N T+  HFS
Sbjct: 36  DLPEKFDLRD----LGYLSKIRNQGRCGACWAFAALASVESAYNRRTRIVHNRTRKHHFS 91

Query: 472 AEDLVSCCPICGLGCNGGM 528
            ++LV C P    GC+G +
Sbjct: 92  EQELVDCSPNTE-GCSGNI 109


>UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 291

 Score = 60.5 bits (140), Expect = 3e-08
 Identities = 41/129 (31%), Positives = 59/129 (45%), Gaps = 1/129 (0%)
 Frame = +1

Query: 145 FINLINKKQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFD 324
           F+   NK    +K   N  +H      + L+G   D N++   K       I + P   D
Sbjct: 26  FVETHNKANANYKLSLNSLSHLTPTEYQSLLGTKIDKNLVSQGKKVRPQ--IKDSPGILD 83

Query: 325 PRDKWPECPTLNEIRDQGSCGSCWAFGAVEAM-TDRVCIYSNATKHFHFSAEDLVSCCPI 501
            R    E   +N IRDQ  CGSCWAFG V A  ++   +YSN  +    S ++++ C   
Sbjct: 84  YR----EMGVVNPIRDQKQCGSCWAFGTVAACESNYALLYSNLPQ---LSEQNIIDCATT 136

Query: 502 CGLGCNGGM 528
           C  GC GG+
Sbjct: 137 C-YGCGGGI 144


>UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2;
           cellular organisms|Rep: Cysteine proteinase, putative -
           Archaeoglobus fulgidus
          Length = 1088

 Score = 60.5 bits (140), Expect = 3e-08
 Identities = 29/72 (40%), Positives = 41/72 (56%)
 Frame = +1

Query: 298 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 477
           +A+LP  FD    W +   L+ +RDQGSCGSCWA  AV A+   + + S A+     S +
Sbjct: 591 MASLPSRFD----WRDYTGLSAVRDQGSCGSCWAHSAVAALESALIVESGASSSIDLSEQ 646

Query: 478 DLVSCCPICGLG 513
            L+SC   C +G
Sbjct: 647 HLLSCEQDCEVG 658


>UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11;
           Trichomonadidae|Rep: Cysteine protease - Tritrichomonas
           foetus (Trichomonas foetus)
          Length = 315

 Score = 59.7 bits (138), Expect = 4e-08
 Identities = 30/77 (38%), Positives = 44/77 (57%)
 Frame = +1

Query: 316 NFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC 495
           N D  D W E   +NEI+DQ +CGSCWAF A++A      I +   +   +S ++LV C 
Sbjct: 100 NVDSID-WREKGVVNEIKDQAACGSCWAFSAIQAAESAYAISTGTLE--SYSEQNLVDCV 156

Query: 496 PICGLGCNGGMPTLAWE 546
             C  GC+GG+   A++
Sbjct: 157 QGC-YGCSGGLMDYAYK 172


>UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep:
           Cathepsin L - Stylonychia lemnae
          Length = 340

 Score = 59.3 bits (137), Expect = 6e-08
 Identities = 43/137 (31%), Positives = 63/137 (45%), Gaps = 2/137 (1%)
 Frame = +1

Query: 142 AFINLINKKQN--TWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPE 315
           AFIN  N + +  ++  G N          K ++G  K  N  K  K  +    + ++PE
Sbjct: 71  AFINNHNSQNDGTSFTLGPNHLADYTHDEYKKMLG-YKPRN--KTGKEVYSTPNLKDIPE 127

Query: 316 NFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC 495
           + D    W E   +N ++DQG CGSCWAF  + ++  R  I +   K    S + LV C 
Sbjct: 128 SID----WREKGAVNAVKDQGQCGSCWAFSTIASLESRYFIETG--KLQSLSEQQLVDCS 181

Query: 496 PICGLGCNGGMPTLAWE 546
                GCNGG   LA +
Sbjct: 182 KNGNEGCNGGDMGLAMD 198


>UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag-RP
           - Bombyx mori (Silk moth)
          Length = 404

 Score = 59.3 bits (137), Expect = 6e-08
 Identities = 39/138 (28%), Positives = 62/138 (44%)
 Frame = +1

Query: 133 LSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLP 312
           +S+  +N +N++  TW+A     T+  F   K+  G +       L             P
Sbjct: 131 MSEDLVNDVNQQGTTWRA----TTYPEFNEKKLKDGLIYKLGTFPLNVTVISYSKDGQYP 186

Query: 313 ENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC 492
           + FD R +W     ++ I DQ  CGS WA      + DR  I S  T++   S++ L+SC
Sbjct: 187 DEFDARREW--YGYISPIADQDWCGSDWAVSIASIVGDRFSIQSFGTENVRMSSQTLLSC 244

Query: 493 CPICGLGCNGGMPTLAWE 546
                 GCNGG   +A++
Sbjct: 245 HLKGQRGCNGGNLDIAFD 262


>UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35;
           Viridiplantae|Rep: Cysteine proteinase 15A precursor -
           Pisum sativum (Garden pea)
          Length = 363

 Score = 58.8 bits (136), Expect = 8e-08
 Identities = 39/105 (37%), Positives = 51/105 (48%), Gaps = 10/105 (9%)
 Frame = +1

Query: 262 LKLPKVTHDAELI--ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVC 435
           L+LP     A ++   NLPE+FD R+K    P    ++DQGSCGSCWAF    A+     
Sbjct: 115 LRLPAHAQKAPILPTTNLPEDFDWREKGAVTP----VKDQGSCGSCWAFSTTGALEG--A 168

Query: 436 IYSNATKHFHFSAEDLVSCCPI--------CGLGCNGGMPTLAWE 546
            Y    K    S + LV C  +        C  GCNGG+   A+E
Sbjct: 169 HYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFE 213


>UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep:
           Cathepsin L precursor - Schistosoma mansoni (Blood
           fluke)
          Length = 319

 Score = 58.4 bits (135), Expect = 1e-07
 Identities = 30/83 (36%), Positives = 45/83 (54%)
 Frame = +1

Query: 298 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 477
           + N+P+NFD    W E   + E+++QG CGSCWAF     +  +   +    K    S +
Sbjct: 102 VNNIPKNFD----WREKGAVTEVKNQGMCGSCWAFSTTGNVESQ--WFRKTGKLLSLSEQ 155

Query: 478 DLVSCCPICGLGCNGGMPTLAWE 546
            LV C  +   GCNGG+P+ A+E
Sbjct: 156 QLVDCDGLDD-GCNGGLPSNAYE 177


>UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor;
           n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase
           precursor - Diabrotica virgifera virgifera (western corn
           rootworm)
          Length = 326

 Score = 58.0 bits (134), Expect = 1e-07
 Identities = 32/92 (34%), Positives = 43/92 (46%)
 Frame = +1

Query: 271 PKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNA 450
           P+V H    + +LP  FD    W E   + E++DQGSCGSCW+F      T     +   
Sbjct: 98  PRVIHSLTPVKDLPSKFD----WREKGAVTEVKDQGSCGSCWSFSTTG--TVEGAYFLKT 151

Query: 451 TKHFHFSAEDLVSCCPICGLGCNGGMPTLAWE 546
            K    S ++LV C      GC+GG    A E
Sbjct: 152 GKLVSLSEQNLVDCAKEDCYGCSGGYMDKALE 183


>UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase
           precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine
           proteinase precursor - Heterodera glycines (Soybean cyst
           nematode worm)
          Length = 353

 Score = 58.0 bits (134), Expect = 1e-07
 Identities = 32/83 (38%), Positives = 44/83 (53%), Gaps = 1/83 (1%)
 Frame = +1

Query: 301 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 480
           + LPE  D    W E   + E++DQG CGSCWAF A  A+ +       A+K    S ++
Sbjct: 133 STLPEKLD----WREKGAVTEVKDQGDCGSCWAFSATGAI-EGALAQKKASKIISLSEQN 187

Query: 481 LVSCCPICG-LGCNGGMPTLAWE 546
           LV C    G  GC+GG+   A+E
Sbjct: 188 LVDCSSKYGNEGCDGGLMDSAFE 210


>UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus
           salmonis|Rep: Cysteine proteinase - Lepeophtheirus
           salmonis (salmon louse)
          Length = 372

 Score = 57.6 bits (133), Expect = 2e-07
 Identities = 32/87 (36%), Positives = 46/87 (52%), Gaps = 5/87 (5%)
 Frame = +1

Query: 298 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 477
           I +LPE+ D    W E   + ++++QGSCGSCW F AVE +   V I +N T     S +
Sbjct: 112 IKDLPESVD----WREKGVITDVKNQGSCGSCWVFSAVEQIESYVAIENNMTSPPLLSTQ 167

Query: 478 DLVSCCP---ICG--LGCNGGMPTLAW 543
            + SC      CG   GC G +  +A+
Sbjct: 168 QITSCSSNPYSCGGSGGCKGAINEIAY 194


>UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-like
           precursor; n=26; Euteleostomi|Rep: Tubulointerstitial
           nephritis antigen-like precursor - Homo sapiens (Human)
          Length = 467

 Score = 57.6 bits (133), Expect = 2e-07
 Identities = 39/135 (28%), Positives = 62/135 (45%), Gaps = 3/135 (2%)
 Frame = +1

Query: 148 INLINKKQNTWKAGRN--FPTHTPFAHIKILMGALK-DDNILKLPKVTHDAELIANLPEN 318
           I  IN+    W+AG +  F   T    I+  +G ++   +++ + ++         LP  
Sbjct: 147 IKAINQGNYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNPGEVLPTA 206

Query: 319 FDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP 498
           F+  +KWP    ++E  DQG+C   WAF      +DRV I+S        S ++L+SC  
Sbjct: 207 FEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDT 264

Query: 499 ICGLGCNGGMPTLAW 543
               GC GG    AW
Sbjct: 265 HQQQGCRGGRLDGAW 279


>UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen;
           n=20; Amniota|Rep: Tubulointerstitial nephritis antigen
           - Homo sapiens (Human)
          Length = 476

 Score = 57.6 bits (133), Expect = 2e-07
 Identities = 41/135 (30%), Positives = 56/135 (41%), Gaps = 3/135 (2%)
 Frame = +1

Query: 148 INLINKKQNTWKAGR--NFPTHTPFAHIKILMGALKDDN-ILKLPKVTHDAELIANLPEN 318
           I  +NK    W A     F   T     K  +G L     +L + ++T       +LPE 
Sbjct: 161 IEQVNKGDYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPATTDLPEF 220

Query: 319 FDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP 498
           F    KWP   T   + DQ +C + WAF       DR+ I S      + S ++L+SCC 
Sbjct: 221 FVASYKWPGW-THGPL-DQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA 278

Query: 499 ICGLGCNGGMPTLAW 543
               GCN G    AW
Sbjct: 279 KNRHGCNSGSIDRAW 293


>UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6;
           Schistosoma|Rep: Cathepsin C precursor - Schistosoma
           mansoni (Blood fluke)
          Length = 454

 Score = 57.6 bits (133), Expect = 2e-07
 Identities = 45/147 (30%), Positives = 71/147 (48%), Gaps = 12/147 (8%)
 Frame = +1

Query: 133 LSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKV----THDAELI 300
           ++ +F+  IN  Q +W+ G  +P  + +   ++   A    +++  P V    T   ELI
Sbjct: 154 INPSFVGKINAHQKSWR-GEIYPELSKYTIDELRNRAGGVKSMVTRPSVLNRKTPSKELI 212

Query: 301 A---NLPENFDPRDKWPECPT-----LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATK 456
           +   NLP  FD    W   P      +  IR+QG CGSC+A  +  A+  R+ + SN ++
Sbjct: 213 SLTGNLPLEFD----WTSPPDGSRSPVTPIRNQGICGSCYASPSAAALEARIRLVSNFSE 268

Query: 457 HFHFSAEDLVSCCPICGLGCNGGMPTL 537
               S + +V C P    GCNGG P L
Sbjct: 269 QPILSPQTVVDCSPY-SEGCNGGFPFL 294


>UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to
           glucocorticoid-inducible protein; n=1; Gallus
           gallus|Rep: PREDICTED: similar to
           glucocorticoid-inducible protein - Gallus gallus
          Length = 307

 Score = 57.2 bits (132), Expect = 2e-07
 Identities = 29/79 (36%), Positives = 40/79 (50%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           LP +FD   KWP    ++E  DQG+C   WAF      +DR+ I+S        S ++L+
Sbjct: 153 LPRHFDAATKWPGM--IHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPSLSPQNLL 210

Query: 487 SCCPICGLGCNGGMPTLAW 543
           SC      GC+GG    AW
Sbjct: 211 SCDTRNQRGCSGGRLDGAW 229


>UniRef50_Q24E33 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 328

 Score = 57.2 bits (132), Expect = 2e-07
 Identities = 35/130 (26%), Positives = 60/130 (46%), Gaps = 2/130 (1%)
 Frame = +1

Query: 145 FINLINKKQNTWKAGRNFPTH-TPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENF 321
           +I   N K NT+K   N     T   +  + +   + ++I     +  D E + ++P   
Sbjct: 72  YIQSENAKNNTFKLAINIMAILTDEEYSSLYLNLDQQESIDIFDSLVDDNETVGDIPSEV 131

Query: 322 DPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPI 501
           +    W     +  +++QGSCGSCWAF    A+     + +N  +   FS + LV C  +
Sbjct: 132 N----WTAQGAVTPVKNQGSCGSCWAFSTTGALEGSYFLKNN--QLISFSEQQLVDCSRL 185

Query: 502 -CGLGCNGGM 528
              +GCNGG+
Sbjct: 186 YLNMGCNGGL 195


>UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor;
           n=176; Viridiplantae|Rep: Cysteine proteinase RD21a
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 462

 Score = 57.2 bits (132), Expect = 2e-07
 Identities = 38/135 (28%), Positives = 64/135 (47%), Gaps = 1/135 (0%)
 Frame = +1

Query: 145 FINLINKKQNTWKAG-RNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENF 321
           F++  N+K  +++ G   F   T   +    +GA  +    +   + ++A +   LPE+ 
Sbjct: 82  FVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLRYEARVGDELPESI 141

Query: 322 DPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPI 501
           D    W +   + E++DQG CGSCWAF  + A+     I +        S ++LV C   
Sbjct: 142 D----WRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDL--ITLSEQELVDCDTS 195

Query: 502 CGLGCNGGMPTLAWE 546
              GCNGG+   A+E
Sbjct: 196 YNEGCNGGLMDYAFE 210


>UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15;
           Magnoliophyta|Rep: Cysteine proteinase RD19a precursor -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 368

 Score = 57.2 bits (132), Expect = 2e-07
 Identities = 38/104 (36%), Positives = 52/104 (50%), Gaps = 10/104 (9%)
 Frame = +1

Query: 265 KLPKVTHDAELIA--NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCI 438
           KLPK  + A ++   NLPE+FD RD     P    +++QGSCGSCW+F A  A+     +
Sbjct: 119 KLPKDANKAPILPTENLPEDFDWRDHGAVTP----VKNQGSCGSCWSFSATGALEGANFL 174

Query: 439 YSNATKHFHFSAEDLVSC--------CPICGLGCNGGMPTLAWE 546
            +   K    S + LV C           C  GCNGG+   A+E
Sbjct: 175 ATG--KLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFE 216


>UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 323

 Score = 56.8 bits (131), Expect = 3e-07
 Identities = 29/87 (33%), Positives = 46/87 (52%), Gaps = 4/87 (4%)
 Frame = +1

Query: 277 VTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATK 456
           +++    +  +P +FD R  W +C  ++ +R+Q SCGSCWA      + DR+CI S+   
Sbjct: 36  ISYSQNELDTIPASFDVRTNWGDC--MSPVREQQSCGSCWAQVTSGILADRMCIESDKNI 93

Query: 457 HFHFSAEDLVSC---CPICGL-GCNGG 525
               S + L+ C   C   G+ GCN G
Sbjct: 94  KMLLSPQYLMDCDGSCVSDGVSGCNNG 120


>UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan
           CA, family C1, cathepsin L-like cysteine peptidase -
           Trichomonas vaginalis G3
          Length = 306

 Score = 56.8 bits (131), Expect = 3e-07
 Identities = 25/74 (33%), Positives = 41/74 (55%)
 Frame = +1

Query: 304 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 483
           N+  +      W E   +N+I++QG+CGSCWAF A++ +  +V    N  + +  S ++L
Sbjct: 83  NIKNDVPTEIDWREQGIVNKIKNQGACGSCWAFSAIQVIESQVA--KNQKQLYDLSEQNL 140

Query: 484 VSCCPICGLGCNGG 525
           + C   C  GC GG
Sbjct: 141 LDCVTSC-FGCGGG 153


>UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin O
           precursor; n=2; Apocrita|Rep: PREDICTED: similar to
           Cathepsin O precursor - Apis mellifera
          Length = 374

 Score = 56.4 bits (130), Expect = 4e-07
 Identities = 28/74 (37%), Positives = 38/74 (51%)
 Frame = +1

Query: 304 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 483
           ++P  FD RDK    P    +R QGSCG+CWAF  +E +     I  N T H   S +++
Sbjct: 154 SIPLRFDWRDKGVITP----VRSQGSCGACWAFSTIEVIESMFAI-KNGTLH-SLSVQEM 207

Query: 484 VSCCPICGLGCNGG 525
           + C      GC GG
Sbjct: 208 IDCAKNSNFGCEGG 221


>UniRef50_O16454 Cluster: Temporarily assigned gene name protein
           196; n=4; Bilateria|Rep: Temporarily assigned gene name
           protein 196 - Caenorhabditis elegans
          Length = 477

 Score = 56.4 bits (130), Expect = 4e-07
 Identities = 30/81 (37%), Positives = 46/81 (56%)
 Frame = +1

Query: 304 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 483
           +LPE+FD    W E   + ++++QG+CGSCWAF     +     I  N  K    S ++L
Sbjct: 263 DLPESFD----WREKGAVTQVKNQGNCGSCWAFSTTGNVEGAWFIAKN--KLVSLSEQEL 316

Query: 484 VSCCPICGLGCNGGMPTLAWE 546
           V C  +   GCNGG+P+ A++
Sbjct: 317 VDCDSM-DQGCNGGLPSNAYK 336


>UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep:
           Cathepsin L - Kudoa thyrsites
          Length = 300

 Score = 56.4 bits (130), Expect = 4e-07
 Identities = 30/91 (32%), Positives = 47/91 (51%)
 Frame = +1

Query: 271 PKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNA 450
           PK T   ++ + LP + D    W     +  +++QG CGSCW+F A  A+     I +  
Sbjct: 90  PKETATKDIKSTLPSSVD----WKALGKVTSVKNQGHCGSCWSFSAAGAIESAYAIKTG- 144

Query: 451 TKHFHFSAEDLVSCCPICGLGCNGGMPTLAW 543
            +  +FS + LV C      GCNGG+P +A+
Sbjct: 145 -ELVNFSEQQLVDCSTE-NHGCNGGLPEIAF 173


>UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep:
           Cysteine protease - Solanum lycopersicum (Tomato)
           (Lycopersicon esculentum)
          Length = 345

 Score = 56.0 bits (129), Expect = 5e-07
 Identities = 41/139 (29%), Positives = 68/139 (48%), Gaps = 5/139 (3%)
 Frame = +1

Query: 145 FINLINKKQN-TWKAGRN-FPTHTPFAHIKILMGA-LKDDNILKLPKVTHDAELIANLPE 315
           FI  +NK  N ++K G N F   T    +    G  + +  +   P  + + + I +L +
Sbjct: 69  FIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKKINDLSD 128

Query: 316 NFDPRD-KWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKH-FHFSAEDLVS 489
           ++ P +  W E   + +++ QG CG CWAF AV ++      Y  AT +   FS ++L+ 
Sbjct: 129 DYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEG---AYKIATGNLMEFSEQELLD 185

Query: 490 CCPICGLGCNGGMPTLAWE 546
            C     GCNGG  T A++
Sbjct: 186 -CTTNNYGCNGGFMTNAFD 203


>UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p -
           Drosophila melanogaster (Fruit fly)
          Length = 431

 Score = 56.0 bits (129), Expect = 5e-07
 Identities = 27/79 (34%), Positives = 40/79 (50%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           LP +F+  DKW     ++E+ DQG CG+ W        +DR  I S   ++   SA++++
Sbjct: 187 LPSSFNALDKWSSY--ISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNIL 244

Query: 487 SCCPICGLGCNGGMPTLAW 543
           SC      GC GG    AW
Sbjct: 245 SCTR-RQQGCEGGHLDAAW 262


>UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18;
           Magnoliophyta|Rep: Thiol protease aleurain precursor -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 358

 Score = 55.6 bits (128), Expect = 7e-07
 Identities = 44/135 (32%), Positives = 60/135 (44%), Gaps = 2/135 (1%)
 Frame = +1

Query: 148 INLINKKQNTWKAGRN-FPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFD 324
           I   NKK  ++K G N F   T     +  +GA +  N     K +H     A LPE  D
Sbjct: 90  IRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQ--NCSATLKGSHKVTEAA-LPETKD 146

Query: 325 PRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PI 501
               W E   ++ ++DQG CGSCW F    A+      +    K    S + LV C    
Sbjct: 147 ----WREDGIVSPVKDQGGCGSCWTFSTTGAL--EAAYHQAFGKGISLSEQQLVDCAGAF 200

Query: 502 CGLGCNGGMPTLAWE 546
              GCNGG+P+ A+E
Sbjct: 201 NNYGCNGGLPSQAFE 215


>UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2;
           Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio
           molitor (Yellow mealworm)
          Length = 336

 Score = 55.2 bits (127), Expect = 9e-07
 Identities = 33/86 (38%), Positives = 47/86 (54%), Gaps = 2/86 (2%)
 Frame = +1

Query: 274 KVTHDAELIANL--PENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSN 447
           K   D  L A++  P +FD RD+    P    +++QGSCGSCWAF +  A+  ++ I + 
Sbjct: 108 KTREDLGLNASVRYPASFDWRDQGMVSP----VKNQGSCGSCWAFSSTGAIESQMKIANG 163

Query: 448 ATKHFHFSAEDLVSCCPICGLGCNGG 525
           A      S + LV C P   LGC+GG
Sbjct: 164 AGYDSSVSEQQLVDCVP-NALGCSGG 188


>UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1;
           Trypanosoma cruzi|Rep: Cysteine protease, putative -
           Trypanosoma cruzi
          Length = 434

 Score = 55.2 bits (127), Expect = 9e-07
 Identities = 32/77 (41%), Positives = 38/77 (49%), Gaps = 7/77 (9%)
 Frame = +1

Query: 337 WPEC--PTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC---CPI 501
           W E   P L  ++DQGSCGSCWA  A E++     I S   K    S + + SC      
Sbjct: 131 WQEAKNPVLTPVKDQGSCGSCWAHAATESVESMYAISSG--KLLTLSTQQITSCVNNTRK 188

Query: 502 CG--LGCNGGMPTLAWE 546
           CG   GC GG   LAWE
Sbjct: 189 CGGSGGCGGGTAQLAWE 205


>UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101,
           whole genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_101,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 306

 Score = 55.2 bits (127), Expect = 9e-07
 Identities = 42/129 (32%), Positives = 65/129 (50%), Gaps = 1/129 (0%)
 Frame = +1

Query: 157 INKKQNTWKAGRN-FPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRD 333
           +N +Q+++  G N F T T     +I +G      I    ++    + I NLPE+ D   
Sbjct: 64  VNSRQSSYTLGINQFATLTDEEFEQIYLGRADSSPI----EIDESIDSI-NLPESVDWSS 118

Query: 334 KWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLG 513
           K      +N +++QG+CGS W+F AV A  +   I+   T HF +S ++LV  C     G
Sbjct: 119 K------MNPVKNQGTCGSGWSFSAVGAF-EAFFIFVKGT-HFQYSEQNLVD-CDTNSHG 169

Query: 514 CNGGMPTLA 540
           C+GG P  A
Sbjct: 170 CDGGYPAKA 178


>UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis
           zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa
           zeasingle nucleocapsid nuclear polyhedrosis virus)
          Length = 367

 Score = 55.2 bits (127), Expect = 9e-07
 Identities = 30/80 (37%), Positives = 44/80 (55%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           LP+ +D    W +   +  I+DQG CGSCWAF A+  +  +  I  N  K    S + L+
Sbjct: 156 LPDYYD----WRDTNKVTPIKDQGVCGSCWAFVAIGNIESQYAIRHN--KLIDLSEQQLL 209

Query: 487 SCCPICGLGCNGGMPTLAWE 546
            C  +  LGCNGG+  LA++
Sbjct: 210 DCDEV-DLGCNGGLMHLAFQ 228


>UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F
            like protease; n=1; Nasonia vitripennis|Rep: PREDICTED:
            similar to cathepsin F like protease - Nasonia
            vitripennis
          Length = 1036

 Score = 54.8 bits (126), Expect = 1e-06
 Identities = 35/105 (33%), Positives = 51/105 (48%), Gaps = 1/105 (0%)
 Frame = +1

Query: 232  LMGALKDDNILKLPKVT-HDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGA 408
            L   LK +N + +P  T  D EL    P ++D    W     +  ++DQGSCGSCWAF  
Sbjct: 795  LKPTLKSENDIPMPMATIPDIEL----PSDYD----WRHHNVVTPVKDQGSCGSCWAFSV 846

Query: 409  VEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAW 543
               +  +  I     +    S ++LV C  +   GCNGG+P  A+
Sbjct: 847  TGNIEGQYAIKHG--ELLSLSEQELVDCDKL-DSGCNGGLPDTAY 888


>UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo
           sapiens|Rep: Isoform 2 of Q9GZM7 - Homo sapiens (Human)
          Length = 283

 Score = 54.8 bits (126), Expect = 1e-06
 Identities = 29/79 (36%), Positives = 39/79 (49%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           LP  F+  +KWP    ++E  DQG+C   WAF      +DRV I+S        S ++L+
Sbjct: 69  LPTAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLL 126

Query: 487 SCCPICGLGCNGGMPTLAW 543
           SC      GC GG    AW
Sbjct: 127 SCDTHQQQGCRGGRLDGAW 145


>UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_217_11853_10927 - Giardia lamblia
           ATCC 50803
          Length = 308

 Score = 54.8 bits (126), Expect = 1e-06
 Identities = 26/80 (32%), Positives = 46/80 (57%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           +P++FD R+++P+C T  E+ D G C S WA+ AV+A + R C+     +   +SA+ ++
Sbjct: 75  VPDHFDFREEYPQCIT--EVIDIGLCSSSWAYSAVDAFSHRRCLTGLDQEATRYSAQYIL 132

Query: 487 SCCPICGLGCNGGMPTLAWE 546
           SC    G        ++AW+
Sbjct: 133 SCSSTNGCFGFSTRESIAWD 152


>UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1;
           Brugia malayi|Rep: Cathepsin F-like cysteine proteinase
           - Brugia malayi (Filarial nematode worm)
          Length = 461

 Score = 54.8 bits (126), Expect = 1e-06
 Identities = 30/82 (36%), Positives = 41/82 (50%)
 Frame = +1

Query: 298 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 477
           I NLP  FD    W     +  ++DQGSCGSCWAF     +     I +   K    S +
Sbjct: 245 IYNLPSKFD----WRTEGVVTPVKDQGSCGSCWAFSVTGNIESLWAIKTG--KLISLSEQ 298

Query: 478 DLVSCCPICGLGCNGGMPTLAW 543
           +L+  C +   GCNGG+P  A+
Sbjct: 299 ELID-CDVIDKGCNGGLPINAF 319


>UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1;
           Dictyostelium discoideum AX4|Rep: Counting factor
           associated protein - Dictyostelium discoideum AX4
          Length = 531

 Score = 54.8 bits (126), Expect = 1e-06
 Identities = 35/130 (26%), Positives = 59/130 (45%), Gaps = 1/130 (0%)
 Frame = +1

Query: 160 NKKQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKW 339
           N K++++K G N            L+        +      HD E + ++P   D R++ 
Sbjct: 260 NAKESSYKLGMNHYADLSNKEFNTLVKPKVARPSVTGADSVHDDESLRSIPSTVDWRNQ- 318

Query: 340 PECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LGC 516
             C T   ++DQG CGSCW FG+  ++    C+ +   +    S + LV C  + G  GC
Sbjct: 319 -NCVT--PVKDQGICGSCWTFGSTGSLEGTNCVTNG--ELVSLSEQQLVDCAILTGSQGC 373

Query: 517 NGGMPTLAWE 546
            GG  + A++
Sbjct: 374 GGGFASSAFQ 383


>UniRef50_Q23H32 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 365

 Score = 54.8 bits (126), Expect = 1e-06
 Identities = 30/77 (38%), Positives = 43/77 (55%), Gaps = 1/77 (1%)
 Frame = +1

Query: 304 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKH-FHFSAED 480
           ++PE+ D R+K      +  ++ QG CGSCWAF  V A+      Y+  T +   FS ++
Sbjct: 134 SVPESVDWREK-----LVAPVQKQGGCGSCWAFSTVIALEG---AYAKQTGNVIKFSEQN 185

Query: 481 LVSCCPICGLGCNGGMP 531
           L+ CC I   GCNGG P
Sbjct: 186 LIDCCRIENNGCNGGDP 202


>UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3;
           Theileria|Rep: Cysteine proteinase precursor - Theileria
           annulata
          Length = 441

 Score = 54.8 bits (126), Expect = 1e-06
 Identities = 26/71 (36%), Positives = 40/71 (56%), Gaps = 1/71 (1%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGS-CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLG 513
           W     ++ I+DQG  CGSCWAF ++ ++     +Y N  K +  S ++LV+ C    +G
Sbjct: 233 WARTDAVSPIKDQGDHCGSCWAFSSIASVESLYRLYKN--KSYFLSEQELVN-CDKSSMG 289

Query: 514 CNGGMPTLAWE 546
           C GG+P  A E
Sbjct: 290 CAGGLPITALE 300


>UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;
           n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
           hypothetical protein - Strongylocentrotus purpuratus
          Length = 331

 Score = 54.4 bits (125), Expect = 2e-06
 Identities = 30/85 (35%), Positives = 45/85 (52%), Gaps = 3/85 (3%)
 Frame = +1

Query: 298 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 477
           +  +P  +D R   P  P +  +++Q SCG+CWAF  VE M  ++ +     +    SA+
Sbjct: 124 LKTMPLVYDLRSIKP--PVVTPVKNQKSCGACWAFSVVETMETQIAL--KTKRLTQLSAQ 179

Query: 478 DLVSCCPICG-LGCNGGMP--TLAW 543
           +LV C    G  GC GG+P  TL W
Sbjct: 180 ELVDCGTAAGDGGCRGGIPCKTLDW 204


>UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted,
           possible transmembrane domain near N-terminus; n=4;
           Cryptosporidium|Rep: Cryptopain-cysteine proteinase
           secreted, possible transmembrane domain near N-terminus
           - Cryptosporidium parvum Iowa II
          Length = 401

 Score = 54.4 bits (125), Expect = 2e-06
 Identities = 31/81 (38%), Positives = 41/81 (50%), Gaps = 3/81 (3%)
 Frame = +1

Query: 313 ENFDPRDK--WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           E F P +   W E   +N IR+Q +CGSCWAF AV A+    C  +N       S +  V
Sbjct: 172 EEFVPPNSINWVEAGCVNPIRNQKNCGSCWAFSAVAALEGATCAQTNRGLP-SLSEQQFV 230

Query: 487 SCCPICG-LGCNGGMPTLAWE 546
            C    G  GC+GG   LA++
Sbjct: 231 DCSKQNGNFGCDGGTMGLAFQ 251


>UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia
           intestinalis|Rep: GLP_41_8294_9919 - Giardia lamblia
           ATCC 50803
          Length = 541

 Score = 54.0 bits (124), Expect = 2e-06
 Identities = 31/79 (39%), Positives = 43/79 (54%), Gaps = 4/79 (5%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH----FSA 474
           LP++FD RD       +  + DQG+CGSC+ FGAV+AM  R+ I +N T         S 
Sbjct: 241 LPDDFDWRDV-NGVSYIPGVLDQGACGSCFTFGAVQAMNSRIMIATNRTDPVGTKTILST 299

Query: 475 EDLVSCCPICGLGCNGGMP 531
           E  +  C +   GC+GG P
Sbjct: 300 EHALD-CNVYSQGCDGGFP 317


>UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase" precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 315

 Score = 54.0 bits (124), Expect = 2e-06
 Identities = 25/69 (36%), Positives = 38/69 (55%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 516
           W +   L  ++DQG CGSCWAF    ++  ++ I+ N  +    S ++LV C      GC
Sbjct: 117 WRDSAVLG-VKDQGQCGSCWAFSTTGSLEGQLAIHKN--QRVPLSEQELVDCDTSRNAGC 173

Query: 517 NGGMPTLAW 543
           NGG+ T A+
Sbjct: 174 NGGLMTDAF 182


>UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean
           endopeptidase) (Cysteine proteinase)
           (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
           Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor
           (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase)
           (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
           Vignain-2] - Vigna mungo (Rice bean) (Black gram)
          Length = 362

 Score = 54.0 bits (124), Expect = 2e-06
 Identities = 29/85 (34%), Positives = 45/85 (52%)
 Frame = +1

Query: 292 ELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFS 471
           E + ++P + D    W +   + +++DQG CGSCWAF  + A+     I +N  K    S
Sbjct: 123 EKVGSVPASVD----WRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTN--KLVSLS 176

Query: 472 AEDLVSCCPICGLGCNGGMPTLAWE 546
            ++LV C      GCNGG+   A+E
Sbjct: 177 EQELVDCDKEENQGCNGGLMESAFE 201


>UniRef50_P25774 Cluster: Cathepsin S precursor; n=78;
           Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens
           (Human)
          Length = 331

 Score = 54.0 bits (124), Expect = 2e-06
 Identities = 33/82 (40%), Positives = 48/82 (58%), Gaps = 2/82 (2%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           LP++ D R+K   C T  E++ QGSCG+CWAF AV A+  ++ + +   K    SA++LV
Sbjct: 115 LPDSVDWREKG--CVT--EVKYQGSCGACWAFSAVGALEAQLKLKTG--KLVSLSAQNLV 168

Query: 487 SCC--PICGLGCNGGMPTLAWE 546
            C        GCNGG  T A++
Sbjct: 169 DCSTEKYGNKGCNGGFMTTAFQ 190


>UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium
           tetraurelia|Rep: Cathepsin L1 precursor - Paramecium
           tetraurelia
          Length = 314

 Score = 54.0 bits (124), Expect = 2e-06
 Identities = 28/62 (45%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
 Frame = +1

Query: 364 IRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLA 540
           +++QGSCGSCWAF AV A+     I  N  + +  S +DLV C  P    GCNGG    A
Sbjct: 126 VKNQGSCGSCWAFSAVGALEINTDIELN--RKYELSEQDLVDCSGPYDNDGCNGGWMDSA 183

Query: 541 WE 546
           +E
Sbjct: 184 FE 185


>UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16;
           Bromeliaceae|Rep: Fruit bromelain precursor - Ananas
           comosus (Pineapple)
          Length = 351

 Score = 54.0 bits (124), Expect = 2e-06
 Identities = 36/130 (27%), Positives = 61/130 (46%), Gaps = 1/130 (0%)
 Frame = +1

Query: 160 NKKQNTWKAGRN-FPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDK 336
           ++ +N++  G N F   T    +    G     NI + P V+ D   I+ +P++ D    
Sbjct: 73  SRNENSYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVSFDDVNISAVPQSID---- 128

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 516
           W +   +NE+++Q  CGSCW+F A+  +     IY   T +    +E  V  C +   GC
Sbjct: 129 WRDYGAVNEVKNQNPCGSCWSFAAIATVEG---IYKIKTGYLVSLSEQEVLDCAV-SYGC 184

Query: 517 NGGMPTLAWE 546
            GG    A++
Sbjct: 185 KGGWVNKAYD 194


>UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing
           protein; n=5; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 437

 Score = 53.6 bits (123), Expect = 3e-06
 Identities = 29/85 (34%), Positives = 43/85 (50%), Gaps = 2/85 (2%)
 Frame = +1

Query: 298 IANLPENFDPRDKWPECPTLNEIRDQGS-CGSCWAFGAVEAMTDRVCIYSNATKHFHFSA 474
           ++ LP+  D    W E   + +++ QG  CGSCWAF AV A+     +     K   FS 
Sbjct: 202 LSQLPQYVD----WREKGVVTQVKSQGKDCGSCWAFAAVAALESHYAL-KTGKKPIQFSE 256

Query: 475 EDLVSCC-PICGLGCNGGMPTLAWE 546
           + LV C       GC+GG+P+  +E
Sbjct: 257 QQLVDCARKFDTKGCSGGLPSKGFE 281


>UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_56,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 314

 Score = 53.6 bits (123), Expect = 3e-06
 Identities = 24/61 (39%), Positives = 33/61 (54%)
 Frame = +1

Query: 355 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPT 534
           +  +++QG+CGSCWAF AV A+   + I    +K    S + LV C      GCNGG   
Sbjct: 122 ITSVKNQGNCGSCWAFSAVGAVETLLTIKGVISKDLWLSEQQLVDCDKGTNNGCNGGFEN 181

Query: 535 L 537
           L
Sbjct: 182 L 182


>UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 360

 Score = 53.2 bits (122), Expect = 4e-06
 Identities = 34/98 (34%), Positives = 46/98 (46%)
 Frame = +1

Query: 250 DDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDR 429
           DDN  K P +  D     NLP +FD RDK    P    ++ Q  CG CWAF  V+++   
Sbjct: 117 DDNKNKQPHLPTD-----NLPASFDWRDKGAITP----VKVQNGCGGCWAFSTVQSIEG- 166

Query: 430 VCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAW 543
              +    K    S + ++ CC I   GC GG P  A+
Sbjct: 167 -LYFLKTGKLESLSTQQVIDCCRIDESGCLGGDPEPAF 203


>UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 317

 Score = 53.2 bits (122), Expect = 4e-06
 Identities = 35/107 (32%), Positives = 51/107 (47%), Gaps = 5/107 (4%)
 Frame = +1

Query: 241 ALKDDNILKLPKVTHDAELIAN----LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGA 408
           A+ D  ++  PK    +  +A+    +PE+ D    W E   +N +RDQ  CGSCWAF A
Sbjct: 78  AMLDSQLIHKPKRDITSRFVADPQLTVPESID----WREKGAVNPVRDQEQCGSCWAFSA 133

Query: 409 VEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWE 546
             A+  +   +    K    S + LV C       GCNGG P  A++
Sbjct: 134 AGALEGQ--RFLKEGKLEVLSTQQLVDCSRDYKNEGCNGGWPHWAYD 178


>UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing
           protein; n=7; Hymenostomatida|Rep: Papain family
           cysteine protease containing protein - Tetrahymena
           thermophila SB210
          Length = 387

 Score = 53.2 bits (122), Expect = 4e-06
 Identities = 36/117 (30%), Positives = 53/117 (45%), Gaps = 5/117 (4%)
 Frame = +1

Query: 208 TPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCG 387
           T   + K +  A    N+ +  K T D   + +LP++ D    W +   +  ++DQG CG
Sbjct: 101 TTLGYSKTVKNAANKQNMFRNLK-TSDKINVKDLPKSVD----WRDAGVVTPVKDQGHCG 155

Query: 388 SCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP---ICG--LGCNGGMPTLAW 543
           SCWAF     +     I +   K    S + LVSC      CG   GCNG +  LA+
Sbjct: 156 SCWAFATTAVIESYAAIATGQLK--TLSTQQLVSCVQNSYQCGGQGGCNGAVSELAY 210


>UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like
           cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin B-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 288

 Score = 53.2 bits (122), Expect = 4e-06
 Identities = 38/132 (28%), Positives = 61/132 (46%), Gaps = 2/132 (1%)
 Frame = +1

Query: 154 LINKKQNTWKAGRN--FPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDP 327
           L  +K   W AG N  F   T F    ++ G         +P +    ++  ++P +++ 
Sbjct: 17  LKGEKDLPWVAGENERFKGMT-FKDASVISGNAHKLRPDTIP-LARPPKINISIPMSYNF 74

Query: 328 RDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG 507
            +++P+C     + DQG CGSCW+F   ++ + R C   N  K   FS   LV+ C    
Sbjct: 75  TERFPQCDF--GVLDQGKCGSCWSFAVSKSFSHRYCRKYN--KPVLFSQSHLVA-CDRRN 129

Query: 508 LGCNGGMPTLAW 543
            GC GG+   AW
Sbjct: 130 SGCGGGIEVNAW 141


>UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 394

 Score = 52.8 bits (121), Expect = 5e-06
 Identities = 27/60 (45%), Positives = 31/60 (51%), Gaps = 3/60 (5%)
 Frame = +1

Query: 355 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGL---GCNGG 525
           LN ++DQG CGSCW FGA   M     I +   K   FS + LV C    G    GCNGG
Sbjct: 196 LNPVKDQGQCGSCWTFGAAGVMESFNAITNGVLK--SFSEQQLVDCVHQAGFSSDGCNGG 253


>UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor;
           n=35; Fasciola|Rep: Cathepsin L-like proteinase
           precursor - Fasciola hepatica (Liver fluke)
          Length = 326

 Score = 52.8 bits (121), Expect = 5e-06
 Identities = 26/71 (36%), Positives = 36/71 (50%), Gaps = 1/71 (1%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLG 513
           W E   + E++DQG+CGSCWAF     M  +     N      FS + LV C  P    G
Sbjct: 114 WRESGYVTEVKDQGNCGSCWAFSTTGTMEGQ--YMKNERTSISFSEQQLVDCSGPWGNNG 171

Query: 514 CNGGMPTLAWE 546
           C+GG+   A++
Sbjct: 172 CSGGLMENAYQ 182


>UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 280

 Score = 52.4 bits (120), Expect = 7e-06
 Identities = 27/85 (31%), Positives = 46/85 (54%), Gaps = 3/85 (3%)
 Frame = +1

Query: 301 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 480
           ++LP+ FD    W     + ++++QG+CGSCWAF  +  + + + +  N T    +S ++
Sbjct: 66  SSLPQQFD----WRNLGKVTQVKNQGNCGSCWAF-TITGLFESINLIRNKTVEL-YSEQE 119

Query: 481 LVSCCP---ICGLGCNGGMPTLAWE 546
           L+ C         GC GG P LA+E
Sbjct: 120 LLDCSSNGIYRNSGCQGGWPHLAFE 144


>UniRef50_P81494 Cluster: Cathepsin B; n=2; Phasianidae|Rep:
           Cathepsin B - Coturnix coturnix japonica (Japanese
           quail)
          Length = 48

 Score = 52.4 bits (120), Expect = 7e-06
 Identities = 32/72 (44%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           LP+ FD R +WP CPT++EIRDQGS        +VE                  SAEDL+
Sbjct: 1   LPDTFDSRKQWPNCPTISEIRDQGSV-------SVEV-----------------SAEDLL 36

Query: 487 SCCPI-CGLGCN 519
           SCC   CG+GCN
Sbjct: 37  SCCGFECGMGCN 48


>UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC06231 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 372

 Score = 52.0 bits (119), Expect = 9e-06
 Identities = 37/126 (29%), Positives = 57/126 (45%), Gaps = 2/126 (1%)
 Frame = +1

Query: 175 TWKAG-RNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECP 351
           T+K G  NF   T +  ++ L G      I K    T  +   A LP+  D    W    
Sbjct: 106 TYKMGVNNFTDKTEY-ELRKLRGYRSACRIAKPKGSTFISSEHAKLPDRVD----WRRNG 160

Query: 352 TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LGCNGGM 528
            +  +++QG CGSCWAF +  A+  +   Y    +  + S + L+ C    G  GC GG+
Sbjct: 161 AVTPVKNQGQCGSCWAFSSTGAIEGQ--HYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGL 218

Query: 529 PTLAWE 546
             LA++
Sbjct: 219 MDLAFQ 224


>UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2;
           Trypanosoma cruzi|Rep: Cysteine proteinase, putative -
           Trypanosoma cruzi
          Length = 392

 Score = 52.0 bits (119), Expect = 9e-06
 Identities = 34/86 (39%), Positives = 41/86 (47%), Gaps = 6/86 (6%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH-FSAEDL 483
           +P+  D R+  P    L  ++DQG CGSCWA GA E M     I    T   H  S + L
Sbjct: 141 IPDEVDYRNSSPAI--LTAVKDQGRCGSCWAHGAAEEMESHFAI---LTGRLHVLSQQQL 195

Query: 484 VSCCP---ICG--LGCNGGMPTLAWE 546
            SC P    CG   GC G    LA+E
Sbjct: 196 TSCAPNPKKCGGTGGCYGSTADLAYE 221


>UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 367

 Score = 52.0 bits (119), Expect = 9e-06
 Identities = 26/73 (35%), Positives = 42/73 (57%), Gaps = 4/73 (5%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC----PIC 504
           W +   ++ +++QGSCGSCWAF AV A+ + V +  N +    +S ++LV C        
Sbjct: 161 WRQSGAVSPVKNQGSCGSCWAFSAV-ALAESVNLLRNNSLAL-YSEQELVDCTYKNPQYY 218

Query: 505 GLGCNGGMPTLAW 543
             GC GG P++A+
Sbjct: 219 NYGCQGGWPSVAY 231


>UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor;
           n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 355

 Score = 52.0 bits (119), Expect = 9e-06
 Identities = 29/83 (34%), Positives = 42/83 (50%)
 Frame = +1

Query: 298 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 477
           I +LP++ D R K    P    ++DQG CGSCWAF  V A+     I +        S +
Sbjct: 134 ITDLPKSVDWRKKGAVAP----VKDQGQCGSCWAFSTVAAVEGINQITTGNLS--SLSEQ 187

Query: 478 DLVSCCPICGLGCNGGMPTLAWE 546
           +L+ C      GCNGG+   A++
Sbjct: 188 ELIDCDTTFNSGCNGGLMDYAFQ 210


>UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia
           irregularis virus a|Rep: FirrV-1-A48 precursor -
           Feldmannia irregularis virus a
          Length = 373

 Score = 51.6 bits (118), Expect = 1e-05
 Identities = 23/61 (37%), Positives = 36/61 (59%), Gaps = 2/61 (3%)
 Frame = +1

Query: 370 DQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP-ICGLGCN-GGMPTLAW 543
           DQGSC SCW+   V+ + DRV + +N       S ++++SC     GL C+ GG+P  A+
Sbjct: 80  DQGSCASCWSISVVQMLADRVSVSTNGKIKLKLSVQEMISCWDGHDGLACSKGGVPEKAY 139

Query: 544 E 546
           +
Sbjct: 140 Q 140


>UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1;
           Phytophthora infestans|Rep: Cathepsin-like cysteine
           protease - Phytophthora infestans (Potato late blight
           fungus)
          Length = 376

 Score = 51.6 bits (118), Expect = 1e-05
 Identities = 29/79 (36%), Positives = 42/79 (53%), Gaps = 1/79 (1%)
 Frame = +1

Query: 292 ELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFS 471
           E + +LP  +D    W E  T+  +++QG CGSCWAF AV AM    C Y+ +T      
Sbjct: 128 ENVEDLPATWD----WREHSTVTPVKNQGQCGSCWAFSAVAAME---CAYALSTGTLESL 180

Query: 472 AEDLVSCCPICGLG-CNGG 525
           +E  +  C + G+  CN G
Sbjct: 181 SEQELVDCTLNGIDTCNHG 199


>UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae
           str. PEST
          Length = 559

 Score = 51.6 bits (118), Expect = 1e-05
 Identities = 28/81 (34%), Positives = 44/81 (54%), Gaps = 1/81 (1%)
 Frame = +1

Query: 286 DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH 465
           D   + +LP +FD    W +   + E+++QGSCGSCWAF AV  +     ++   TK   
Sbjct: 332 DVAGVGDLPRSFD----WRDHGAVTEVKNQGSCGSCWAFSAVGNVEG---LHQIKTKKLE 384

Query: 466 -FSAEDLVSCCPICGLGCNGG 525
            +S ++L+ C  +   GC GG
Sbjct: 385 SYSEQELIDCDKVDN-GCGGG 404


>UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_23,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 321

 Score = 51.6 bits (118), Expect = 1e-05
 Identities = 34/102 (33%), Positives = 46/102 (45%), Gaps = 6/102 (5%)
 Frame = +1

Query: 238 GALKDDNILKLPKVTHDAELIANLPENFDP-----RDKWPECPTLNEIRDQGSCGSCWAF 402
           G L D   L +         + N+ +N +P        W +   +  I+DQG CGSCWAF
Sbjct: 87  GDLTDQEFLTIYLNLQMPARVKNIQKNEEPFLVQEEVDWVQKGKVPAIKDQGDCGSCWAF 146

Query: 403 GAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGG 525
            AV A+     I  N  +    S +DLV C  P    GC+GG
Sbjct: 147 SAVGALEINTKIQFN--EIVDLSEQDLVDCAGPYGNAGCDGG 186


>UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18;
           Plasmodium|Rep: Cysteine proteinase precursor -
           Plasmodium vivax (strain Salvador I)
          Length = 583

 Score = 51.6 bits (118), Expect = 1e-05
 Identities = 30/83 (36%), Positives = 46/83 (55%), Gaps = 2/83 (2%)
 Frame = +1

Query: 289 AELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKH--F 462
           A L+A++PE  D R+K      ++E +DQG CGSCWAF +V  +    C+Y+        
Sbjct: 333 ANLLADVPEILDYREKG----IVHEPKDQGLCGSCWAFASVGNVE---CMYAKEHNKTIL 385

Query: 463 HFSAEDLVSCCPICGLGCNGGMP 531
             S +++V C  +   GC+GG P
Sbjct: 386 TLSEQEVVDCSKL-NFGCDGGHP 407


>UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin
           L - Misgurnus mizolepis (Mud loach)
          Length = 337

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 26/71 (36%), Positives = 37/71 (52%), Gaps = 1/71 (1%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLG 513
           W E   +  ++DQG CGSCWAF    AM  +  ++    K    S ++LV C  P    G
Sbjct: 122 WREKGYVTPVKDQGECGSCWAFSTTGAMEGQ--MFRKQGKLVSLSEQNLVDCSRPEGNEG 179

Query: 514 CNGGMPTLAWE 546
           CNGG+   A++
Sbjct: 180 CNGGLMDQAFQ 190


>UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4;
           core eudicotyledons|Rep: Papain-like cysteine peptidase
           XBCP3 - Arabidopsis thaliana (Mouse-ear cress)
          Length = 437

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 24/70 (34%), Positives = 36/70 (51%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 516
           W +   +  ++DQGSCG+CW+F A  AM     I +        S ++L+ C      GC
Sbjct: 124 WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDL--ISLSEQELIDCDKSYNAGC 181

Query: 517 NGGMPTLAWE 546
           NGG+   A+E
Sbjct: 182 NGGLMDYAFE 191


>UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1;
           Uronema marinum|Rep: Cathepsin L-like cysteine protease
           - Uronema marinum
          Length = 333

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 28/69 (40%), Positives = 37/69 (53%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 516
           W     +  +++QG CGSCWAF AV ++     I  N  K   FS + LVSC P    GC
Sbjct: 126 WVSKGAVQGVQNQGVCGSCWAFSAVCSLERLYKI--NTGKLLSFSEQQLVSCEP-KSYGC 182

Query: 517 NGGMPTLAW 543
           +GG P  A+
Sbjct: 183 DGGWPEAAF 191


>UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathepsin
           o - Aedes aegypti (Yellowfever mosquito)
          Length = 375

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 29/94 (30%), Positives = 43/94 (45%)
 Frame = +1

Query: 244 LKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMT 423
           +KDD I    K   D +++  LP+  D RDK    P    +R QGSCG+CWA   V+ +T
Sbjct: 134 MKDDIIFSRAK--RDLKILDYLPKVVDWRDKGVVAP----VRSQGSCGACWAISVVDTIT 187

Query: 424 DRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGG 525
             +              + +++C      GC GG
Sbjct: 188 S-ISAIKRQQNFSELCLDQVINCAGNGNFGCEGG 220


>UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra
           erinaceieuropaei|Rep: Cysteine proteinase - Spirometra
           erinaceieuropaei (Tapeworm)
          Length = 336

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 29/94 (30%), Positives = 45/94 (47%), Gaps = 1/94 (1%)
 Frame = +1

Query: 268 LPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSN 447
           L K+     +   L EN      W E   +  +++QG CGSCW+F A  A+   + I + 
Sbjct: 104 LTKLRRKEAVSVPLKENLPDSVNWRERGAVTSVKNQGQCGSCWSFSANGAIEGAIQIKTG 163

Query: 448 ATKHFHFSAEDLVSCCPICG-LGCNGGMPTLAWE 546
           A +    S + L+ C    G  GCNGG+   A++
Sbjct: 164 ALR--SLSEQQLMDCSWDYGNQGCNGGLMPQAFQ 195


>UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia
           bovis|Rep: Cysteine protease 2 - Babesia bovis
          Length = 445

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 26/70 (37%), Positives = 36/70 (51%)
 Frame = +1

Query: 316 NFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC 495
           NF+  D W     +  ++DQG CGSCWAF AV ++     +          S ++LVS C
Sbjct: 236 NFEDID-WRRADAVTPVKDQGMCGSCWAFAAVGSVES---LLKRQKTDVRLSEQELVS-C 290

Query: 496 PICGLGCNGG 525
            +   GCNGG
Sbjct: 291 QLGNQGCNGG 300


>UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon
           GZfos34G5|Rep: Cathepsin C - uncultured archaeon
           GZfos34G5
          Length = 760

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 43/144 (29%), Positives = 64/144 (44%), Gaps = 4/144 (2%)
 Frame = +1

Query: 106 VVASDLPHPLSDAFINLINKKQNTWKAGRNFPTHTPFAHIKILMG--ALKDDNILKLPKV 279
           + A++   P S+    +I +K   W AG    +   F   K+L G  +L    IL   + 
Sbjct: 236 ITATNKTKPSSEEIQRVIEEKGAKWTAGETSVSDLTFEEKKMLCGIKSLYGLRILSTEER 295

Query: 280 THDAELIANLP-ENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCI-YSNAT 453
                L A++P   FD RDK      +  +++QGSCGSC AFG + A+   + I  +N +
Sbjct: 296 VRVVALDASVPIGTFDWRDK-DGANWITSVKEQGSCGSCVAFGTIGALEPLIRIDKNNPS 354

Query: 454 KHFHFSAEDLVSCCPICGLGCNGG 525
                S   L  C    G  C GG
Sbjct: 355 MPMDLSEAHLFFC---GGGTCTGG 375


>UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria
           dispar multicapsid nuclear polyhedrosis virus (LdMNPV)
          Length = 356

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 26/85 (30%), Positives = 45/85 (52%), Gaps = 1/85 (1%)
 Frame = +1

Query: 295 LIANLPENFDPRD-KWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFS 471
           +I N P +  P    W E   +  I++QG+CG+CWAF  + ++  +  +  N  +    S
Sbjct: 135 IILNQPPDKGPLHFDWREQNKVTSIKNQGACGACWAFATLASVESQFAMRHN--RLIDLS 192

Query: 472 AEDLVSCCPICGLGCNGGMPTLAWE 546
            + L+ C  +  +GCNGG+   A+E
Sbjct: 193 EQQLIDCDSV-DMGCNGGLLHTAFE 216


>UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 336

 Score = 50.8 bits (116), Expect = 2e-05
 Identities = 30/87 (34%), Positives = 44/87 (50%), Gaps = 4/87 (4%)
 Frame = +1

Query: 283 HDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHF 462
           H A+ +  LP +FD    W +   L++++DQG CGSCWAF +   + + +    N  K  
Sbjct: 118 HTAQDV-QLPASFD----WRDYGILSDVKDQGQCGSCWAF-STTGILEALYFMENRQK-I 170

Query: 463 HFSAEDLVSCCP----ICGLGCNGGMP 531
            FS + LV C          GC+GG P
Sbjct: 171 SFSEQQLVDCATNSNGFNSYGCSGGWP 197


>UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas
           foetus|Rep: TFCP2 protein - Tritrichomonas foetus
           (Trichomonas foetus)
          Length = 270

 Score = 50.8 bits (116), Expect = 2e-05
 Identities = 31/88 (35%), Positives = 41/88 (46%), Gaps = 2/88 (2%)
 Frame = +1

Query: 283 HDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHF 462
           H+     + P +FD    W     +N I++QGSCGSCWAF A+ A     C      +  
Sbjct: 42  HERIQYKDTPTSFD----WRSEGKVNPIKNQGSCGSCWAFSAIAAQES--CHAIATGELL 95

Query: 463 HFSAEDLVSC--CPICGLGCNGGMPTLA 540
            FS + LV C        GC+GG P  A
Sbjct: 96  RFSEQSLVDCVTSDYSCQGCSGGWPDQA 123


>UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=17; Trichomonas vaginalis|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 318

 Score = 50.8 bits (116), Expect = 2e-05
 Identities = 23/70 (32%), Positives = 36/70 (51%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 516
           W     +N I+DQ  CGSCWAF  V+A   +  +     +    + +++V C   C  GC
Sbjct: 106 WRNAKIVNPIKDQAQCGSCWAFSVVQAQESQWALKKG--QLLSLAEQNMVDCVDTC-YGC 162

Query: 517 NGGMPTLAWE 546
           +GG   LA++
Sbjct: 163 DGGDEYLAYD 172


>UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia
           medanensis|Rep: Sui m 1 allergen - Suidasia medanensis
          Length = 336

 Score = 50.8 bits (116), Expect = 2e-05
 Identities = 31/92 (33%), Positives = 43/92 (46%), Gaps = 6/92 (6%)
 Frame = +1

Query: 286 DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH 465
           ++++   LP  FD R +W        +R+QG CGSCWAF     +  +  I  N   H  
Sbjct: 108 ESDISVALPAAFDWRQQWNTA-----VRNQGQCGSCWAFATAATVEAQYAIRKNV--HVT 160

Query: 466 FSAEDLVSC--CPICGL----GCNGGMPTLAW 543
            S + LV C   P  G     GC GG P +A+
Sbjct: 161 LSEQQLVDCDHRPFQGQYEDHGCQGGNPIIAY 192


>UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera
           litura multicapsid nucleopolyhedrovirus (SpltMNPV)
          Length = 337

 Score = 50.8 bits (116), Expect = 2e-05
 Identities = 26/82 (31%), Positives = 45/82 (54%)
 Frame = +1

Query: 301 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 480
           A  PE+FD    W +   + ++++QG CGSCWAF A+  +  +  I  ++      S + 
Sbjct: 124 ARTPESFD----WRKLNKVTKVKEQGVCGSCWAFAAIGNIESQYAIMHDSL--IDLSEQQ 177

Query: 481 LVSCCPICGLGCNGGMPTLAWE 546
           L+ C  +   GC+GG+  LA++
Sbjct: 178 LLDCDRV-DQGCDGGLMHLAFQ 198


>UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium
           falciparum|Rep: Falcipain 2 - Plasmodium falciparum
          Length = 484

 Score = 50.4 bits (115), Expect = 3e-05
 Identities = 28/79 (35%), Positives = 41/79 (51%), Gaps = 1/79 (1%)
 Frame = +1

Query: 313 ENFDPRD-KWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVS 489
           ENFD     W     +  ++DQ +CGSCWAF ++ ++  +  I  N  K    S ++LV 
Sbjct: 258 ENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKN--KLITLSEQELVD 315

Query: 490 CCPICGLGCNGGMPTLAWE 546
            C     GCNGG+   A+E
Sbjct: 316 -CSFKNYGCNGGLINNAFE 333


>UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_36,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 307

 Score = 50.4 bits (115), Expect = 3e-05
 Identities = 25/62 (40%), Positives = 33/62 (53%)
 Frame = +1

Query: 355 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPT 534
           +N I++QG+CGSCW F A+ A+   + I          S + LV C    G GCNGG   
Sbjct: 118 MNPIKNQGNCGSCWTFSAIGAVEGFLAIRKGFKG--VLSEQQLVDCAVDAGEGCNGGNSD 175

Query: 535 LA 540
           LA
Sbjct: 176 LA 177


>UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5;
           Theileria|Rep: Cysteine proteinase precursor - Theileria
           parva
          Length = 440

 Score = 50.4 bits (115), Expect = 3e-05
 Identities = 28/97 (28%), Positives = 44/97 (45%)
 Frame = +1

Query: 256 NILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVC 435
           N+ K      D +L     EN D    W    ++  ++DQ +CG CWAF  V ++     
Sbjct: 212 NLKKALNTDEDVDLAKLTGENLD----WRRSSSVTSVKDQSNCGGCWAFSTVGSVEG--Y 265

Query: 436 IYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWE 546
             S+  K +  S ++L+ C      GC GG+   A+E
Sbjct: 266 YMSHFDKSYELSVQELLDCDSFSN-GCQGGLLESAYE 301


>UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3;
           Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays
           (Maize)
          Length = 493

 Score = 50.0 bits (114), Expect = 4e-05
 Identities = 23/64 (35%), Positives = 34/64 (53%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 516
           W E   + E++DQG CG CWAF AV A+     I + +      S ++L+ C      GC
Sbjct: 170 WRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVTGSL--ISLSEQELIDCDKFQDQGC 227

Query: 517 NGGM 528
           +GG+
Sbjct: 228 DGGL 231


>UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2;
           Taeniidae|Rep: Cathepsin L-like cysteine proteinase -
           Taenia solium (Pork tapeworm)
          Length = 339

 Score = 50.0 bits (114), Expect = 4e-05
 Identities = 30/83 (36%), Positives = 42/83 (50%), Gaps = 1/83 (1%)
 Frame = +1

Query: 301 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 480
           A LP+  D RDK      + E+++QG+CGSCWAF +  A+           K    S + 
Sbjct: 122 AGLPDTVDWRDK----NLVTEVKNQGNCGSCWAFSSTGALEG--AFAKKTGKLISLSEQQ 175

Query: 481 LVSCCPICGL-GCNGGMPTLAWE 546
           LV C    G  GCNGG  + A++
Sbjct: 176 LVDCSLKNGNDGCNGGYMSYAFK 198


>UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain -
           Tetrahymena pyriformis
          Length = 330

 Score = 50.0 bits (114), Expect = 4e-05
 Identities = 24/69 (34%), Positives = 31/69 (44%), Gaps = 3/69 (4%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGL-- 510
           W     L  +++Q  CGSCWAF     +     I+ +      FS + LV CC   G   
Sbjct: 126 WTAKNVLPPVKNQQQCGSCWAFSTAGMLEGVYNIHESPQTPISFSEQQLVDCCGAQGFGC 185

Query: 511 -GCNGGMPT 534
            GCNG  PT
Sbjct: 186 EGCNGAWPT 194


>UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;
           n=17; Magnoliophyta|Rep: Thiol protease aleurain-like
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 358

 Score = 50.0 bits (114), Expect = 4e-05
 Identities = 25/76 (32%), Positives = 38/76 (50%), Gaps = 1/76 (1%)
 Frame = +1

Query: 322 DPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-P 498
           D +D W E   ++ +++QG CGSCW F    A+      +    K    S + LV C   
Sbjct: 143 DTKD-WREDGIVSPVKEQGHCGSCWTFSTTGAL--EAAYHQAFGKGISLSEQQLVDCAGT 199

Query: 499 ICGLGCNGGMPTLAWE 546
               GC+GG+P+ A+E
Sbjct: 200 FNNFGCHGGLPSQAFE 215


>UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin
           L-like protease; n=1; Nasonia vitripennis|Rep:
           PREDICTED: similar to cathepsin L-like protease -
           Nasonia vitripennis
          Length = 353

 Score = 49.6 bits (113), Expect = 5e-05
 Identities = 29/83 (34%), Positives = 44/83 (53%), Gaps = 2/83 (2%)
 Frame = +1

Query: 304 NLPENFDPRDKWPECPTLNEIRDQG-SCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 480
           N+PE+ D    W +   +  +RDQG +CGSCWAF A  A+  +   +         SA++
Sbjct: 131 NVPEHVD----WRQRGAVTPVRDQGLTCGSCWAFSAAGALEAQ--YFKKTGVLTALSAQN 184

Query: 481 LVSCCPICG-LGCNGGMPTLAWE 546
           L+ C    G LGC GG   L+++
Sbjct: 185 LIDCTMEYGNLGCGGGSAALSFQ 207


>UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep:
           MGC107932 protein - Xenopus tropicalis (Western clawed
           frog) (Silurana tropicalis)
          Length = 333

 Score = 49.6 bits (113), Expect = 5e-05
 Identities = 27/71 (38%), Positives = 36/71 (50%), Gaps = 1/71 (1%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGS-CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLG 513
           W +   +  +++QG+ CGSCWAF  V  M  R CI     +  + S + LV C  I   G
Sbjct: 121 WRKSNCVTPVKNQGTFCGSCWAFATVGVMESRYCI--RTKELLNLSEQQLVDCDEI-NEG 177

Query: 514 CNGGMPTLAWE 546
           C GG P  A E
Sbjct: 178 CCGGFPIKALE 188


>UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella
           histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax
           (Sterkiella histriomuscorum)
          Length = 366

 Score = 49.6 bits (113), Expect = 5e-05
 Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 1/83 (1%)
 Frame = +1

Query: 301 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 480
           AN+P  +D    W     ++ +++QG CGSCW F  V  +     +   A +  + S + 
Sbjct: 133 ANIPTEWD----WRTFGVVSPVKNQGKCGSCWTFSTVGCVESHYLLKYGAFR--NLSEQQ 186

Query: 481 LVSCC-PICGLGCNGGMPTLAWE 546
           LV C       GC+GG+P+ A+E
Sbjct: 187 LVDCAGDYDNHGCSGGLPSHAFE 209


>UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3;
           Bilateria|Rep: Cathepsin L-like cysteine proteinase -
           Longidorus elongatus
          Length = 358

 Score = 49.6 bits (113), Expect = 5e-05
 Identities = 29/81 (35%), Positives = 43/81 (53%), Gaps = 4/81 (4%)
 Frame = +1

Query: 295 LIANLPENFDPRDK--WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHF 468
           +I  +P+N    D   W +   + +++DQGSCGSCWAF A  ++  +   Y    K    
Sbjct: 129 MIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAFSATGSLEGQ--HYKQTGKLVSL 186

Query: 469 SAEDLVSCCPICG--LGCNGG 525
           S ++LV  C + G   GCNGG
Sbjct: 187 SEQNLVD-CDVNGDDEGCNGG 206


>UniRef50_Q22W19 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score = 49.6 bits (113), Expect = 5e-05
 Identities = 25/79 (31%), Positives = 38/79 (48%)
 Frame = +1

Query: 289 AELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHF 468
           + LI +L  +  P   W +   +  +++QG CGSCWAF  V  +      Y+ AT +   
Sbjct: 113 SHLIYSLKGDVAPSIDWRQKNAVTPVKNQGQCGSCWAFSTVGGLEG---AYAIATGNLTS 169

Query: 469 SAEDLVSCCPICGLGCNGG 525
            +E  +  C     GCNGG
Sbjct: 170 FSEQQIVDCSKANAGCNGG 188


>UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precursor;
           n=3; Plasmodium (Laverania)|Rep: Trophozoite cysteine
           proteinase precursor - Plasmodium falciparum
          Length = 569

 Score = 49.6 bits (113), Expect = 5e-05
 Identities = 37/116 (31%), Positives = 58/116 (50%), Gaps = 5/116 (4%)
 Frame = +1

Query: 199 PTHTPFAHIKILMGALKDDNILKLPKVTH----DAELIANLPENFDPRDKWPECPTLNEI 366
           P H    + K     LKD NIL     T+    + ++ + +PE  D R+K      ++E 
Sbjct: 294 PNHMIEKYSKPFENHLKD-NILISEFYTNGKRNEKDIFSKVPEILDYREKG----IVHEP 348

Query: 367 RDQGSCGSCWAFGAVEAMTDRVCIYSNATKH-FHFSAEDLVSCCPICGLGCNGGMP 531
           +DQG CGSCWAF +V  +     +++   K+   FS +++V C      GC+GG P
Sbjct: 349 KDQGLCGSCWAFASVGNIES---VFAKKNKNILSFSEQEVVDCSK-DNFGCDGGHP 400


>UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46;
           Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea
           mays (Maize)
          Length = 371

 Score = 49.6 bits (113), Expect = 5e-05
 Identities = 39/128 (30%), Positives = 58/128 (45%), Gaps = 12/128 (9%)
 Frame = +1

Query: 196 FPTHTPFAHIKILMGALKDDNIL--KLPKVTHDAELIAN--LPENFDPRDKWPECPTLNE 363
           F   TP    +  +G  K    L  +L +  H+A ++    LP++FD    W +   +  
Sbjct: 96  FSDLTPAEFRRTYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDFD----WRDHGAVGP 151

Query: 364 IRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC---C-----PICGLGCN 519
           +++QGSCGSCW+F A  A+      Y    K    S +  V C   C       C  GCN
Sbjct: 152 VKNQGSCGSCWSFSASGALEG--AHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCN 209

Query: 520 GGMPTLAW 543
           GG+ T A+
Sbjct: 210 GGLMTTAF 217


>UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15)
           (Major excreted protein) (MEP) [Contains: Cathepsin L
           heavy chain; Cathepsin L light chain]; n=19;
           Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15)
           (Major excreted protein) (MEP) [Contains: Cathepsin L
           heavy chain; Cathepsin L light chain] - Homo sapiens
           (Human)
          Length = 333

 Score = 49.6 bits (113), Expect = 5e-05
 Identities = 24/71 (33%), Positives = 38/71 (53%), Gaps = 1/71 (1%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLG 513
           W E   +  +++QG CGSCWAF A  A+  +  ++    +    S ++LV C  P    G
Sbjct: 120 WREKGYVTPVKNQGQCGSCWAFSATGALEGQ--MFRKTGRLISLSEQNLVDCSGPQGNEG 177

Query: 514 CNGGMPTLAWE 546
           CNGG+   A++
Sbjct: 178 CNGGLMDYAFQ 188


>UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3;
           Eukaryota|Rep: Cathepsin-like cysteine protease -
           Phytophthora infestans (Potato late blight fungus)
          Length = 635

 Score = 49.2 bits (112), Expect = 6e-05
 Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 3/91 (3%)
 Frame = +1

Query: 283 HDAELIANLPENFDPRD-KWPECPTLNEIRDQGS-CGSCWAFGAVEAMTDRVCIYSNAT- 453
           H+   + +LP+++D RD       T ++ +     CGSCWA G   A++DR+ I  NA+ 
Sbjct: 354 HETMDVTDLPKSWDWRDVNGKNYVTWDKNQHIPKYCGSCWAQGTTSALSDRISILRNASW 413

Query: 454 KHFHFSAEDLVSCCPICGLGCNGGMPTLAWE 546
                S + L++C    G  CNGG P L +E
Sbjct: 414 PEIALSPQVLINC--HAGGTCNGGNPGLVYE 442



 Score = 37.9 bits (84), Expect = 0.15
 Identities = 21/56 (37%), Positives = 32/56 (57%), Gaps = 3/56 (5%)
 Frame = +1

Query: 283 HDAELIANLPENFDPRDKWPECPTLNEIRDQGS---CGSCWAFGAVEAMTDRVCIY 441
           HD   ++ LP+NFD R+       ++  R+Q     CGSCW+F A  A+ DR+ I+
Sbjct: 48  HDYIDVSKLPKNFDWRNV-NGTRYVSISRNQHIPHYCGSCWSFAATSALADRILIF 102


>UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_567_6496_7413 - Giardia lamblia ATCC
           50803
          Length = 305

 Score = 49.2 bits (112), Expect = 6e-05
 Identities = 47/159 (29%), Positives = 63/159 (39%), Gaps = 7/159 (4%)
 Frame = +1

Query: 88  LACILAVVASDLPHPLSDAFINLINKKQNT-WKAG--RNFPTHTPFAHIKILMG-ALKDD 255
           L  +L V     P   S   +  +NKK+N  W+AG    F   T     K+    A    
Sbjct: 2   LFAVLVVAVLSTPF-YSPHLLKYLNKKENKLWEAGIPAKFANRTHDEVTKMFFPHAFLRP 60

Query: 256 NILKLPKVT---HDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTD 426
           NI +   V     D    A  P+  D R   PEC    E  DQ  C  C+AF  + A++ 
Sbjct: 61  NIPRYYGVNITEDDLYPPAGSPDRLDYRQTHPEC--FFEPEDQKECSCCYAFATLGALST 118

Query: 427 RVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAW 543
           R CI     +    S + +VS C     GC GG    +W
Sbjct: 119 RRCIAKLDPQAVSLSVQHMVS-CDSGEAGCQGGEFESSW 156


>UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 356

 Score = 49.2 bits (112), Expect = 6e-05
 Identities = 21/71 (29%), Positives = 37/71 (52%), Gaps = 1/71 (1%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLG 513
           W +   ++ ++DQ +CGSCW F    A+     I+ +  +    S + L+ C       G
Sbjct: 133 WKDLNKVSPVKDQQNCGSCWTFSTTGAIESHYAIFED-VEPTSLSEQQLIDCAGAFNNNG 191

Query: 514 CNGGMPTLAWE 546
           C+GG+P+ A+E
Sbjct: 192 CSGGLPSQAFE 202


>UniRef50_Q231X3 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 323

 Score = 49.2 bits (112), Expect = 6e-05
 Identities = 21/64 (32%), Positives = 31/64 (48%), Gaps = 1/64 (1%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LG 513
           W E   ++ ++ QG+CGSCWAF A  ++   + I     K    S + L+ C    G  G
Sbjct: 121 WVEAGKVSNVKSQGNCGSCWAFSATASVESALIIAGKVDKSISLSEQQLIDCSGDYGNYG 180

Query: 514 CNGG 525
           C  G
Sbjct: 181 CAAG 184


>UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes
           vastus|Rep: Cathepsin L - Aphrocallistes vastus
          Length = 329

 Score = 49.2 bits (112), Expect = 6e-05
 Identities = 29/82 (35%), Positives = 42/82 (51%), Gaps = 1/82 (1%)
 Frame = +1

Query: 304 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 483
           +LP   D R K    P    +++QG CGSCW+F A  ++  +  I S   K   FS ++L
Sbjct: 114 DLPTTVDWRSKGVVTP----VKNQGQCGSCWSFSATGSLEGQYAIKSG--KLVSFSEQEL 167

Query: 484 VSCCPICG-LGCNGGMPTLAWE 546
           V C    G  GC GG+   A++
Sbjct: 168 VDCSTSLGNHGCQGGLMDYAFK 189


>UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 452

 Score = 49.2 bits (112), Expect = 6e-05
 Identities = 30/84 (35%), Positives = 44/84 (52%), Gaps = 2/84 (2%)
 Frame = +1

Query: 280 THDAELIANLPENFDPRDKWPECPTLNEI-RDQGSCGSCWAFGAVEAMTDRVCIYSNATK 456
           T+D ++I NLPE+F     W   P + E   DQ  CG+C+AFGA EA+  +  + +N  +
Sbjct: 216 TYDQKVIQNLPESFS----WRNVPYVLEYPHDQAVCGTCFAFGASEAINGQFSLRAN--R 269

Query: 457 HFHFSAEDLVSCC-PICGLGCNGG 525
               S + LV C        C+GG
Sbjct: 270 SIITSVQQLVDCTWGTINYACDGG 293


>UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine
           protease; n=1; Strongylocentrotus purpuratus|Rep:
           PREDICTED: similar to cysteine protease -
           Strongylocentrotus purpuratus
          Length = 494

 Score = 48.8 bits (111), Expect = 8e-05
 Identities = 40/135 (29%), Positives = 60/135 (44%), Gaps = 2/135 (1%)
 Frame = +1

Query: 148 INLINK-KQNTWKAG-RNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENF 321
           + + N+ +Q T K G   F   T     K+  G LK   I K   +         +PE +
Sbjct: 190 VEMFNQFEQGTAKYGPTKFADMTEAEFRKLQSGPLKKTGIKKQAAIPQGP-----VPEEY 244

Query: 322 DPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPI 501
           D    W     +  +++QG CGSCWAF A+  M  +  I     +    S ++LV C  +
Sbjct: 245 D----WRTHGAVTPVKNQGMCGSCWAFSAIGNMEGQWQIKKG--ELISLSEQELVDCDKV 298

Query: 502 CGLGCNGGMPTLAWE 546
            G GC GG  + A+E
Sbjct: 299 DG-GCEGGEMSDAYE 312


>UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1;
           Bigelowiella natans|Rep: Digestive cysteine proteinase -
           Bigelowiella natans (Pedinomonas minutissima)
           (Chlorarachnion sp.(strain CCMP 621))
          Length = 360

 Score = 48.8 bits (111), Expect = 8e-05
 Identities = 26/67 (38%), Positives = 31/67 (46%), Gaps = 2/67 (2%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNAT--KHFHFSAEDLVSCCPICGL 510
           W +   L  ++DQG CGSCWAF A +A+     I  N T       S E LV  C     
Sbjct: 115 WRDFNALTPVKDQGGCGSCWAFSATQALESAHYIKHNDTLDSPIALSTEQLVE-CDQHDY 173

Query: 511 GCNGGMP 531
            C GG P
Sbjct: 174 ACYGGFP 180


>UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep:
           Cathepsin - Geodia cydonium (Sponge)
          Length = 322

 Score = 48.8 bits (111), Expect = 8e-05
 Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 2/87 (2%)
 Frame = +1

Query: 292 ELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNAT-KHFHF 468
           E ++ LP   D R K      +  +++QG CGSCWAF A  ++  +   + NAT K    
Sbjct: 98  EDVSALPTTVDWRTKG----YVTGVKNQGQCGSCWAFSATGSLEGQ---HFNATGKLVSL 150

Query: 469 SAEDLVSCCPICG-LGCNGGMPTLAWE 546
           S ++LV C    G  GCNGG+P  A++
Sbjct: 151 SEQNLVDCSSAEGNEGCNGGLPDDAFK 177


>UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5;
           Piroplasmida|Rep: Cysteine proteinase, putative -
           Theileria parva
          Length = 460

 Score = 48.8 bits (111), Expect = 8e-05
 Identities = 33/106 (31%), Positives = 52/106 (49%), Gaps = 3/106 (2%)
 Frame = +1

Query: 217 AHIKILMGAL-KDDNILKLPKVTHDAELIANLPENFDPRD-KWPECPTLNEIRDQG-SCG 387
           +H+  LM  +  D+  LK  K   + +   + P+N       W +   +++I++QG  CG
Sbjct: 214 SHVDRLMARMVSDETYLKNLKKALNTDKDVD-PKNITGEGLDWRKADGVSKIKNQGLECG 272

Query: 388 SCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGG 525
           SCWAF +V ++     IY N T     S ++LV  C     GC GG
Sbjct: 273 SCWAFASVSSVESLYKIYRNVT--LDLSEQELVD-CETSSKGCEGG 315


>UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 330

 Score = 48.8 bits (111), Expect = 8e-05
 Identities = 23/70 (32%), Positives = 33/70 (47%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 516
           W     ++ ++DQG CGSCWAF    ++   + I   A +    S + LV  C     GC
Sbjct: 123 WVTRGKVSAVKDQGQCGSCWAFSTTGSVESALIIAGYANQTIDLSEQQLVD-CSATNYGC 181

Query: 517 NGGMPTLAWE 546
            GG    A+E
Sbjct: 182 GGGWMDNAFE 191


>UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2;
           Platyhelminthes|Rep: Cathepsin L-like proteinase -
           Echinococcus multilocularis
          Length = 338

 Score = 48.8 bits (111), Expect = 8e-05
 Identities = 25/64 (39%), Positives = 32/64 (50%), Gaps = 1/64 (1%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LG 513
           W +   +  I+DQG CGSCWAF A  A+  +  +     K    S + LV C    G  G
Sbjct: 128 WRKKGLVTPIKDQGDCGSCWAFSATGALEGQ--LKRKTGKLISLSEQQLVDCSTYTGNEG 185

Query: 514 CNGG 525
           CNGG
Sbjct: 186 CNGG 189


>UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:
           Aca s 1 allergen - Acarus siro (Dust mite)
          Length = 331

 Score = 48.8 bits (111), Expect = 8e-05
 Identities = 32/87 (36%), Positives = 40/87 (45%), Gaps = 6/87 (6%)
 Frame = +1

Query: 304 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 483
           NLPE FD R K      L  I +QG CG+CWAF ++  +     I  N   H   S ++L
Sbjct: 108 NLPETFDWRSK------LGPIENQGRCGACWAFASLATVEAAFAIKYNT--HIRLSKQEL 159

Query: 484 VSC------CPICGLGCNGGMPTLAWE 546
           V C       P    GC GG    +WE
Sbjct: 160 VECTRESDHTPYENSGCQGG---YSWE 183


>UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole
           genome shotgun sequence; n=7; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_22,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 350

 Score = 48.8 bits (111), Expect = 8e-05
 Identities = 26/77 (33%), Positives = 37/77 (48%), Gaps = 2/77 (2%)
 Frame = +1

Query: 316 NFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC 495
           N  P D W     +N+++DQG CGSCWAF     +     + +        S + LV C 
Sbjct: 142 NATPID-WRTRGAVNKVKDQGQCGSCWAFSTTGVLEGFYKVQTGELP--DLSEQQLVDCS 198

Query: 496 PICGL--GCNGGMPTLA 540
            +     GC+GGMP+ A
Sbjct: 199 TLIDFNQGCDGGMPSRA 215


>UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_21,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 349

 Score = 48.8 bits (111), Expect = 8e-05
 Identities = 23/60 (38%), Positives = 34/60 (56%), Gaps = 3/60 (5%)
 Frame = +1

Query: 355 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC---PICGLGCNGG 525
           ++E+++QGSCGSCWAF AV A+     +     K+   S ++LV C         GC+GG
Sbjct: 137 VSEVKNQGSCGSCWAFSAVAAL--ETALRQGGVKNVELSEQELVDCAVKDEFESEGCDGG 194


>UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163
           precursor; n=4; Schizophora|Rep: Putative cysteine
           proteinase CG12163 precursor - Drosophila melanogaster
           (Fruit fly)
          Length = 614

 Score = 48.8 bits (111), Expect = 8e-05
 Identities = 25/80 (31%), Positives = 41/80 (51%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           LP+ FD    W +   + ++++QGSCGSCWAF     +     + +   K   FS ++L+
Sbjct: 394 LPKEFD----WRQKDAVTQVKNQGSCGSCWAFSVTGNIEGLYAVKTGELK--EFSEQELL 447

Query: 487 SCCPICGLGCNGGMPTLAWE 546
             C      CNGG+   A++
Sbjct: 448 D-CDTTDSACNGGLMDNAYK 466


>UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:
           Viral cathepsin - Xestia c-nigrum granulosis virus
           (XnGV) (Xestia c-nigrumgranulovirus)
          Length = 346

 Score = 48.8 bits (111), Expect = 8e-05
 Identities = 28/80 (35%), Positives = 43/80 (53%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           +P++FD RD+     ++  ++ Q  CGSCWAF AV  +     I  N +     S + LV
Sbjct: 133 VPDSFDWRDR----NSVTSVKMQKECGSCWAFSAVANIESLYHIKHNVS--LDLSEQQLV 186

Query: 487 SCCPICGLGCNGGMPTLAWE 546
            C  +   GCNGG+ + A+E
Sbjct: 187 DCDKV-NNGCNGGLMSWAFE 205


>UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42;
           Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens
           (Human)
          Length = 334

 Score = 48.8 bits (111), Expect = 8e-05
 Identities = 34/106 (32%), Positives = 52/106 (49%), Gaps = 1/106 (0%)
 Frame = +1

Query: 232 LMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAV 411
           +MG  ++    K  KV  +  L  +LP++ D R K    P    +++Q  CGSCWAF A 
Sbjct: 91  MMGCFRNQKFRK-GKVFREP-LFLDLPKSVDWRKKGYVTP----VKNQKQCGSCWAFSAT 144

Query: 412 EAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWE 546
            A+  +  ++    K    S ++LV C  P    GCNGG    A++
Sbjct: 145 GALEGQ--MFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQ 188


>UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC
           3.4.14.1) (Dipeptidyl-peptidase I) (DPP-I) (DPPI)
           (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase)
           [Contains: Dipeptidyl-peptidase 1 exclusion domain chain
           (Dipeptidyl- peptidase I exclusion domain chain);
           Dipeptidyl-peptidase 1 heavy chain (Dipeptidyl-peptidase
           I heavy chain); Dipeptidyl-peptidase 1 light chain
           (Dipeptidyl-peptidase I light chain)]; n=50;
           Coelomata|Rep: Dipeptidyl-peptidase 1 precursor (EC
           3.4.14.1) (Dipeptidyl-peptidase I) (DPP-I) (DPPI)
           (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase)
           [Contains: Dipeptidyl-peptidase 1 exclusion domain chain
           (Dipeptidyl- peptidase I exclusion domain chain);
           Dipeptidyl-peptidase 1 heavy chain (Dipeptidyl-peptidase
           I heavy chain); Dipeptidyl-peptidase 1 light chain
           (Dipeptidyl-peptidase I light chain)] - Homo sapiens
           (Human)
          Length = 463

 Score = 48.8 bits (111), Expect = 8e-05
 Identities = 38/134 (28%), Positives = 64/134 (47%), Gaps = 3/134 (2%)
 Frame = +1

Query: 145 FINLINKKQNTWKAGR--NFPTHTPFAHIKILMG-ALKDDNILKLPKVTHDAELIANLPE 315
           F+  IN  Q +W A     + T T    I+   G + K       P      + I +LP 
Sbjct: 174 FVKAINAIQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILHLPT 233

Query: 316 NFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC 495
           ++D R+       ++ +R+Q SCGSC++F ++  +  R+ I +N ++    S +++VSC 
Sbjct: 234 SWDWRNVHG-INFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCS 292

Query: 496 PICGLGCNGGMPTL 537
                GC GG P L
Sbjct: 293 QY-AQGCEGGFPYL 305


>UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin O;
           n=1; Danio rerio|Rep: PREDICTED: similar to cathepsin O
           - Danio rerio
          Length = 327

 Score = 48.4 bits (110), Expect = 1e-04
 Identities = 25/75 (33%), Positives = 34/75 (45%)
 Frame = +1

Query: 316 NFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC 495
           N  PR  W +   +  + +QGSCG CWAF  VEA+     + +   +     +   V  C
Sbjct: 119 NNPPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAIES---VSAKVGEKLQQLSVQQVIDC 175

Query: 496 PICGLGCNGGMPTLA 540
                GCNGG P  A
Sbjct: 176 SYQNQGCNGGSPVEA 190


>UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep:
           Cysteine protease - Saprolegnia parasitica
          Length = 523

 Score = 48.4 bits (110), Expect = 1e-04
 Identities = 39/135 (28%), Positives = 60/135 (44%), Gaps = 3/135 (2%)
 Frame = +1

Query: 133 LSDAFINLINKK-QNTWKAGRNFPTHTPFAHIKILMGALK--DDNILKLPKVTHDAELIA 303
           L+D  I   NK   +++  G N  +H  F   K L   L+     I    K    A  + 
Sbjct: 53  LNDQRIEAHNKDASSSFTMGHNEYSHLTFDEFKKLRTGLRVSPSYIQSRAKYALMAPAV- 111

Query: 304 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 483
           N+ +  +  D W E   +  +++QG CGSCWAF    A+     + S   +    S ++L
Sbjct: 112 NMTDVPNEMD-WVEQGGVTPVKNQGMCGSCWAFSTTGAIEGAAFVSSK--QLVSVSEQEL 168

Query: 484 VSCCPICGLGCNGGM 528
           V C     +GCNGG+
Sbjct: 169 VDCDHNGDMGCNGGL 183


>UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein A;
           n=2; Dictyostelium discoideum|Rep: Gamete and
           mating-type specific protein A - Dictyostelium
           discoideum (Slime mold)
          Length = 448

 Score = 48.4 bits (110), Expect = 1e-04
 Identities = 26/56 (46%), Positives = 31/56 (55%), Gaps = 2/56 (3%)
 Frame = +1

Query: 364 IRDQGSCGSCWAFGAVEAMTDRVCI-YSNATKH-FHFSAEDLVSCCPICGLGCNGG 525
           IRDQG CGSCWAF +  A+  R  I Y  A K     S ++ V+C      GCNGG
Sbjct: 253 IRDQGQCGSCWAFASSAALESRYLIKYGTAQKSTLQLSNQNAVNC---IASGCNGG 305


>UniRef50_Q235G6 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 325

 Score = 48.4 bits (110), Expect = 1e-04
 Identities = 28/113 (24%), Positives = 49/113 (43%)
 Frame = +1

Query: 202 THTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGS 381
           THT FA + +      D+ I  L  + H+ +++ +          W E   +  +++QG 
Sbjct: 88  THTEFAELYLNPAENIDEEIDSLQPIQHNEDIVID----------WVEKGAVTPVKNQGG 137

Query: 382 CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLA 540
           CG CW+F     +     +Y N     + S + L+  C     GC GG+  +A
Sbjct: 138 CGGCWSFATTGGVEGANFVYKNVLP--NLSQQQLID-CNTQNKGCGGGLRDIA 187


>UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophila
           SB210|Rep: Cathepsin z - Tetrahymena thermophila SB210
          Length = 585

 Score = 48.4 bits (110), Expect = 1e-04
 Identities = 40/129 (31%), Positives = 58/129 (44%), Gaps = 5/129 (3%)
 Frame = +1

Query: 160 NKKQNTWKAGRNFPTHTP-FAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDK 336
           N  +NT K       HT  F H   +  + K+   L    + H+    A+LP N+D R+ 
Sbjct: 287 NDVRNTTKVTEVSNNHTNNFRHTTCIRESNKNSTQLITGPLPHEYINAASLPANWDWRNI 346

Query: 337 WPECPTLNEIRDQGS---CGSCWAFGAVEAMTDRVCIYSNAT-KHFHFSAEDLVSCCPIC 504
                 L+  R+Q     CGSCWA G   ++ DR+ I  N T      S + +++C    
Sbjct: 347 -NGVNYLSFTRNQHIPQYCGSCWAHGTTSSLADRINIARNRTWPDIALSVQVVLNC--QA 403

Query: 505 GLGCNGGMP 531
           G  CNGG P
Sbjct: 404 GGSCNGGQP 412



 Score = 37.5 bits (83), Expect = 0.20
 Identities = 29/95 (30%), Positives = 42/95 (44%), Gaps = 3/95 (3%)
 Frame = +1

Query: 271 PKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGS---CGSCWAFGAVEAMTDRVCIY 441
           P V  +AE  + LP NF  ++       L  +R+Q     CGSCWA  A   + DR+ I 
Sbjct: 31  PYVISNAEFNSVLPSNFTWQNV-NGTDYLTLVRNQHIPQYCGSCWAQAASSTLADRIKIA 89

Query: 442 SNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWE 546
             A       A  ++  C     GC+GG    A++
Sbjct: 90  RKAQWPDVVIAPQVLVSCDEYSNGCHGGNSGTAFQ 124


>UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain;
           n=16; Chrysomelidae|Rep: Digestive cysteine protease
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 48.4 bits (110), Expect = 1e-04
 Identities = 37/117 (31%), Positives = 53/117 (45%), Gaps = 2/117 (1%)
 Frame = +1

Query: 202 THTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGS 381
           TH  F  I  L G +K+   L         +L   +P++ D    W E   + E++DQ  
Sbjct: 79  THEEFKDI--LKGQIKNKPRLNATPTVFPEDL--EVPDSID----WTEKGAVLEVKDQNP 130

Query: 382 CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLG-C-NGGMPTLAWE 546
           CGSCWAF A  A+  +  I +N       S + L+ C    G G C  GG  + A+E
Sbjct: 131 CGSCWAFSATGALEGQNAILNNV--KISLSEQQLLDCSAAYGNGNCKEGGDMSAAFE 185


>UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1;
           Methanospirillum hungatei JF-1|Rep: Peptidase C1A,
           papain precursor - Methanospirillum hungatei (strain
           JF-1 / DSM 864)
          Length = 1096

 Score = 48.4 bits (110), Expect = 1e-04
 Identities = 37/106 (34%), Positives = 52/106 (49%), Gaps = 1/106 (0%)
 Frame = +1

Query: 220 HIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWA 399
           H+K L   LK   I+    +T     +  LP +FD R+   +  T   I++QGSCGSCWA
Sbjct: 296 HLKGLRHDLKSSTIVSGAGITP----MEGLPTSFDWRNNGGDYTT--PIKNQGSCGSCWA 349

Query: 400 FGAVEAMTDRVCIYS-NATKHFHFSAEDLVSCCPICGLGCNGGMPT 534
           F    A      I S N   +  ++ + LV+C      GCNGG+ T
Sbjct: 350 FATTGAFESYKEIKSGNPGMNPDYAEQYLVNCAG-DQRGCNGGLFT 394


>UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin O
           precursor; n=1; Tribolium castaneum|Rep: PREDICTED:
           similar to Cathepsin O precursor - Tribolium castaneum
          Length = 326

 Score = 48.0 bits (109), Expect = 1e-04
 Identities = 22/63 (34%), Positives = 33/63 (52%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 516
           W E   +  I +QGSCG+CWA+  +E +     I +N  K    S ++++ C      GC
Sbjct: 127 WREKNAVTRIYNQGSCGACWAYSVIETVESMNAIKTN--KSEELSVQEIIDCAG-NNKGC 183

Query: 517 NGG 525
           NGG
Sbjct: 184 NGG 186


>UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16;
           Plasmodium (Vinckeia)|Rep: Cysteine proteinase precursor
           - Plasmodium vinckei
          Length = 506

 Score = 48.0 bits (109), Expect = 1e-04
 Identities = 33/106 (31%), Positives = 52/106 (49%), Gaps = 6/106 (5%)
 Frame = +1

Query: 244 LKDDNILKLPKVTHDAELIA------NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFG 405
           LK   I+ L K   +  LI+      + P++ D R K+   P     +DQG+CGSCWAF 
Sbjct: 236 LKSKYIVPLKKHLANTNLISVDNKSKDFPDSRDYRSKFNFLPP----KDQGNCGSCWAFA 291

Query: 406 AVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAW 543
           A+    + + +++       FS + +V C      GC+GG P  A+
Sbjct: 292 AI-GNFEYLYVHTRHEMPISFSEQQMVDCSTE-NYGCDGGNPFYAF 335


>UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia
           theta|Rep: Cathepsin H precursor - Guillardia theta
           (Cryptomonas phi)
          Length = 353

 Score = 47.6 bits (108), Expect = 2e-04
 Identities = 42/143 (29%), Positives = 67/143 (46%), Gaps = 10/143 (6%)
 Frame = +1

Query: 148 INLINKKQNT-WKAGRNFP---THTPFAHIKILM----GALKDDNILKLPKVTHDAELIA 303
           +  IN +  T W+A  N     T   F H K++     GA     + KL K+     ++A
Sbjct: 64  VEAINSRPGTTWRAALNQYSDLTWEEFKHAKLMAEQNCGATVTTPVEKLVKMG----IVA 119

Query: 304 NLPENFDPRDKW-PECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 480
              + FD R++   E   ++ +++QG+CGSCW F    A+     I +   +    S + 
Sbjct: 120 ---DEFDWRNQTCGETSCVSMVKNQGTCGSCWTFSTAAALESLHAIKTG--EMVLLSEQQ 174

Query: 481 LVSC-CPICGLGCNGGMPTLAWE 546
           LV C       GCNGG+P+ A+E
Sbjct: 175 LVDCAADFKNNGCNGGLPSQAFE 197


>UniRef50_Q58HK5 Cluster: Hc58; n=1; Haemonchus contortus|Rep: Hc58
           - Haemonchus contortus (Barber pole worm)
          Length = 241

 Score = 47.6 bits (108), Expect = 2e-04
 Identities = 18/27 (66%), Positives = 21/27 (77%)
 Frame = +1

Query: 364 IRDQGSCGSCWAFGAVEAMTDRVCIYS 444
           IRDQ +CGSCWA  A E M+DR CI+S
Sbjct: 108 IRDQSNCGSCWAVSAAETMSDRACIHS 134


>UniRef50_Q239L8 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 323

 Score = 47.6 bits (108), Expect = 2e-04
 Identities = 22/70 (31%), Positives = 34/70 (48%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 516
           W     +  ++DQG CGSCW+F    A+     ++ +  K    S + LV C      GC
Sbjct: 129 WTTKGAVTPVKDQGQCGSCWSFSTTGAVEG--ALFLSTKKLTSLSEQYLVDCSKDGNEGC 186

Query: 517 NGGMPTLAWE 546
           NGG+   A++
Sbjct: 187 NGGLMDTAFD 196


>UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia
           tropicalis|Rep: Blo t 1 allergen - Blomia tropicalis
           (Mite)
          Length = 333

 Score = 47.6 bits (108), Expect = 2e-04
 Identities = 25/63 (39%), Positives = 31/63 (49%)
 Frame = +1

Query: 304 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 483
           +LP+NFD    W +   L  IR QGSCGSCWAF A         I     +    S ++L
Sbjct: 112 SLPQNFD----WRQKARLTRIRQQGSCGSCWAFAAAGVAESLYSIQKQ--QSIELSEQEL 165

Query: 484 VSC 492
           V C
Sbjct: 166 VDC 168


>UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease;
           n=23; Magnoliophyta|Rep: Senescence-specific cysteine
           protease - Arabidopsis thaliana (Mouse-ear cress)
          Length = 346

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 26/70 (37%), Positives = 34/70 (48%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 516
           W +   +  I++QGSCG CWAF AV A+     I     K    S + LV  C     GC
Sbjct: 136 WRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKG--KLISLSEQQLVD-CDTNDFGC 192

Query: 517 NGGMPTLAWE 546
            GG+   A+E
Sbjct: 193 EGGLMDTAFE 202


>UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar to
           human SRY (sex determining region Y)-box 30
           (SOX30),transcript variant 1,; n=3; Mammalia|Rep: Testis
           cDNA clone: QtsA-12228, similar to human SRY (sex
           determining region Y)-box 30 (SOX30),transcript variant
           1, - Macaca fascicularis (Crab eating macaque)
           (Cynomolgus monkey)
          Length = 433

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 33/105 (31%), Positives = 52/105 (49%), Gaps = 1/105 (0%)
 Frame = +1

Query: 232 LMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAV 411
           +MG  ++  + K  K+  +  L  +LP++ D R K    P    +++Q  CGSCWAF A 
Sbjct: 91  VMGCFRNQKLRK-GKLFREP-LFLDLPKSVDWRKKGYVTP----VKNQKQCGSCWAFSAT 144

Query: 412 EAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAW 543
            A+  +  ++    K    S ++LV C  P    GCNGG    A+
Sbjct: 145 GALEGQ--MFRKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMNSAF 187


>UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13;
           Plasmodium|Rep: Cysteine protease falcipain-3 -
           Plasmodium falciparum
          Length = 492

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 24/61 (39%), Positives = 35/61 (57%)
 Frame = +1

Query: 364 IRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAW 543
           ++DQ  CGSCWAF +V ++  +  I   A   F FS ++LV  C +   GC GG  T A+
Sbjct: 284 VKDQALCGSCWAFSSVGSVESQYAIRKKAL--FLFSEQELVD-CSVKNNGCYGGYITNAF 340

Query: 544 E 546
           +
Sbjct: 341 D 341


>UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcoptes
           scabiei type hominis|Rep: Cathepsin L-like protease -
           Sarcoptes scabiei type hominis
          Length = 245

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 30/88 (34%), Positives = 45/88 (51%), Gaps = 1/88 (1%)
 Frame = +1

Query: 286 DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH 465
           D E +++LP+  D    W     +  I+DQ  CGSCWAF AV +M  +  + +   +   
Sbjct: 113 DNEDVSDLPDEVD----WTLKNVVAPIKDQKQCGSCWAFSAVASMESQNALKTG--QLVE 166

Query: 466 FSAEDLVSCCPICG-LGCNGGMPTLAWE 546
            S ++LV C    G  GC+GG    A+E
Sbjct: 167 LSEQELVDCSVGEGNEGCDGGWMDSAFE 194


>UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_542_3431_1206 - Giardia lamblia ATCC
           50803
          Length = 741

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 38/98 (38%), Positives = 50/98 (51%), Gaps = 6/98 (6%)
 Frame = +1

Query: 250 DDNILKLPKVTHDAELI-ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTD 426
           +D   +LP    +A+L  A LP NF  R     C    +I +QGSCG C+A  AVE +T 
Sbjct: 40  EDEYNELPDGPDNADLTRAALPTNFTYRGH--RCI---QIINQGSCGCCYAAAAVEMVTA 94

Query: 427 RVCIYSNATKHFHFSAEDLVSC-----CPICGLGCNGG 525
           R C+  N ++    S EDLV+C       I   GC GG
Sbjct: 95  RRCLQLNDSR--LVSLEDLVTCDHTKYLNIQNNGCRGG 130


>UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase
           precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin
           L-like cysteine proteinase precursor - Acanthoscelides
           obtectus (Bean weevil)
          Length = 321

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 21/55 (38%), Positives = 34/55 (61%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPI 501
           W E   + E++ QG+CGSCWAF AV ++  +V + + + +    SA++LV C  I
Sbjct: 116 WREKGAVTEVKKQGNCGSCWAFSAVGSIEGQVFLKNGSLE--SLSAQNLVDCAGI 168


>UniRef50_Q23H10 Cluster: Papain family cysteine protease containing
           protein; n=14; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 336

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 41/122 (33%), Positives = 56/122 (45%), Gaps = 12/122 (9%)
 Frame = +1

Query: 202 THTPFAHIKILMGALKDDNILK--LPKVTH------DAELIANLPENFDPRDKWPECPTL 357
           T   FA  KILM +   D+++K    + TH      + +L +N     D  D W     +
Sbjct: 82  TKEEFAE-KILMKSDLVDHLMKGISQEATHNDTNNNETQLSSNSLTLADSID-WRTKGAV 139

Query: 358 NEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC-CPICG---LGCNGG 525
             +++QG CGSCW+F A   M     I + A     FS + LV C  P  G    GCNGG
Sbjct: 140 TSVKNQGGCGSCWSFSAAAVMESFNFIQNKAL--VDFSEQQLVDCVIPANGYNSYGCNGG 197

Query: 526 MP 531
            P
Sbjct: 198 WP 199


>UniRef50_Q23EG5 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 429

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 24/75 (32%), Positives = 39/75 (52%), Gaps = 5/75 (6%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGS----CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PI 501
           W E   ++ ++DQ +    CGSCW F A  A+   + + +     F+ S + LV C    
Sbjct: 128 WREKGIVSSVKDQDAVGDDCGSCWTFSATGAIESHLALKTGKAP-FNLSQQQLVDCAGKF 186

Query: 502 CGLGCNGGMPTLAWE 546
              GC+GG+P+ A+E
Sbjct: 187 DNQGCDGGLPSRAFE 201


>UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 344

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 26/82 (31%), Positives = 40/82 (48%)
 Frame = +1

Query: 301 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 480
           ++LPE+FD RDK    P     + Q +CGSCW F     +  +  +      HF   +E 
Sbjct: 129 SDLPESFDWRDKGIITPA----KFQNTCGSCWTFATTGVIESQYALKYGELLHF---SEQ 181

Query: 481 LVSCCPICGLGCNGGMPTLAWE 546
           ++  C     GC GG+ T A++
Sbjct: 182 MLLDCDNINQGCRGGLMTDAYQ 203


>UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10;
           Dictyostelium discoideum|Rep: Cysteine proteinase 7
           precursor - Dictyostelium discoideum (Slime mold)
          Length = 460

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 2/72 (2%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHF-HFSAEDLVSCCPICG-L 510
           W     +  I++QG CG CW+F    A T+     +N  K+    S ++L+ C    G  
Sbjct: 116 WRTQGAVTPIKNQGQCGGCWSFSTTGA-TEGAQYLANGKKNLVSLSEQNLIDCSGSYGNN 174

Query: 511 GCNGGMPTLAWE 546
           GC GG+ TLA+E
Sbjct: 175 GCEGGLMTLAFE 186


>UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16)
           [Contains: Cathepsin H mini chain; Cathepsin H heavy
           chain; Cathepsin H light chain]; n=37; Eukaryota|Rep:
           Cathepsin H precursor (EC 3.4.22.16) [Contains:
           Cathepsin H mini chain; Cathepsin H heavy chain;
           Cathepsin H light chain] - Homo sapiens (Human)
          Length = 335

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 21/65 (32%), Positives = 35/65 (53%), Gaps = 1/65 (1%)
 Frame = +1

Query: 355 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMP 531
           ++ +++QG+CGSCW F    A+   + I +   K    + + LV C       GC GG+P
Sbjct: 129 VSPVKNQGACGSCWTFSTTGALESAIAIATG--KMLSLAEQQLVDCAQDFNNHGCQGGLP 186

Query: 532 TLAWE 546
           + A+E
Sbjct: 187 SQAFE 191


>UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24;
           Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa
           (Rice)
          Length = 339

 Score = 46.8 bits (106), Expect = 3e-04
 Identities = 32/79 (40%), Positives = 39/79 (49%), Gaps = 2/79 (2%)
 Frame = +1

Query: 298 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 477
           I  LP   D R K    P    I+DQG CG CWAF AV AM   V +  +  K    S +
Sbjct: 120 IDTLPATVDWRTKGAVTP----IKDQGQCGCCWAFSAVAAMEGIVKL--STGKLISLSEQ 173

Query: 478 DLVSCCPICG--LGCNGGM 528
           +LV  C + G   GC GG+
Sbjct: 174 ELVD-CDVHGEDQGCEGGL 191


>UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3;
           Curculionidae|Rep: Cysteine proteinase - Hypera postica
           (alfalfa weevil)
          Length = 324

 Score = 46.8 bits (106), Expect = 3e-04
 Identities = 21/63 (33%), Positives = 32/63 (50%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 516
           W +   +  ++DQG CGSCWAF ++   T+      +  K    S + L+ CC     GC
Sbjct: 118 WRKEGRVTGVKDQGDCGSCWAF-SITGSTEGAYARKSG-KLVSLSEQQLIDCCTDTSAGC 175

Query: 517 NGG 525
           +GG
Sbjct: 176 DGG 178


>UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep:
           Cysteine protease - Babesia equi
          Length = 438

 Score = 46.8 bits (106), Expect = 3e-04
 Identities = 24/70 (34%), Positives = 36/70 (51%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 516
           W +   +  ++DQG+CGSCWAF AV ++     I     +    S ++LV+C      GC
Sbjct: 230 WRKLNGVTPVKDQGNCGSCWAFAAVGSVESLYLIKKG--QALDLSEQELVNCEENSN-GC 286

Query: 517 NGGMPTLAWE 546
            G +P  A E
Sbjct: 287 EGDLPNKALE 296


>UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, whole
           genome shotgun sequence; n=3; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_2,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 376

 Score = 46.8 bits (106), Expect = 3e-04
 Identities = 22/64 (34%), Positives = 34/64 (53%)
 Frame = +1

Query: 355 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPT 534
           + E++ QG CGSCWAF   + +  R+ I +N  K    S   L+ C      GC+GG  +
Sbjct: 175 VTEVQQQGRCGSCWAFAVQDVVISRLAI-ANKNKLDQLSKTHLIDCADGNTEGCDGGSVS 233

Query: 535 LAWE 546
            A++
Sbjct: 234 DAFD 237


>UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61;
           Leishmania|Rep: Cysteine proteinase 2 precursor -
           Leishmania pifanoi
          Length = 444

 Score = 46.8 bits (106), Expect = 3e-04
 Identities = 25/70 (35%), Positives = 38/70 (54%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 516
           W E   +  ++DQG+CGSCWAF AV  +  +   Y    +    S + LVSC  +   GC
Sbjct: 132 WREKGAVTPVKDQGACGSCWAFSAVGNIEGQ--WYLAGHELVSLSEQQLVSCDDM-NDGC 188

Query: 517 NGGMPTLAWE 546
           +GG+   A++
Sbjct: 189 DGGLMLQAFD 198


>UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15)
           [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC
           3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain] - Sarcophaga peregrina (Flesh fly)
           (Boettcherisca peregrina)
          Length = 339

 Score = 46.8 bits (106), Expect = 3e-04
 Identities = 24/65 (36%), Positives = 34/65 (52%), Gaps = 1/65 (1%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LG 513
           W E   +  ++DQG CGSCWAF +  A+  +   +  A      S ++LV C    G  G
Sbjct: 128 WREHGAVTGVKDQGHCGSCWAFSSTGALEGQ--HFRKAGVLVSLSEQNLVDCSTKYGNNG 185

Query: 514 CNGGM 528
           CNGG+
Sbjct: 186 CNGGL 190


>UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor;
           n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase
           precursor - Phaedon cochleariae (Mustard beetle)
          Length = 324

 Score = 46.8 bits (106), Expect = 3e-04
 Identities = 29/80 (36%), Positives = 37/80 (46%), Gaps = 1/80 (1%)
 Frame = +1

Query: 310 PENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVS 489
           PE+ D R K    P    +R+QG CGSCWA     A+  +  I S +      S + LV 
Sbjct: 111 PESIDWRSKGVVLP----VRNQGECGSCWALSTAAAIESQSAIKSGS--KVPLSPQQLVD 164

Query: 490 CCPICG-LGCNGGMPTLAWE 546
           C    G  GCNGG     +E
Sbjct: 165 CSTSYGNHGCNGGFAVNGFE 184


>UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 344

 Score = 46.4 bits (105), Expect = 4e-04
 Identities = 24/74 (32%), Positives = 34/74 (45%), Gaps = 3/74 (4%)
 Frame = +1

Query: 313 ENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC 492
           +N  P D W     +  ++ QG CGSCW F A  A+ +      N     +FS + ++ C
Sbjct: 134 KNAPPMD-WRNASAITPVKQQGKCGSCWTF-ASTAVLESFSFIKNGAPLTNFSEQQILDC 191

Query: 493 CPICGL---GCNGG 525
               G    GCNGG
Sbjct: 192 VYGSGYYSNGCNGG 205


>UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba
           histolytica|Rep: Cysteine protease 17 - Entamoeba
           histolytica
          Length = 420

 Score = 46.4 bits (105), Expect = 4e-04
 Identities = 29/83 (34%), Positives = 41/83 (49%), Gaps = 5/83 (6%)
 Frame = +1

Query: 292 ELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNAT-----K 456
           +++  LPE  D R    +   L  IR+Q  CG CW+F +V A+  R  I  N T     +
Sbjct: 162 DIVKELPEGIDFR----KFGKLTYIREQTGCGGCWSFASVCALESRYLIDYNLTVDDVGR 217

Query: 457 HFHFSAEDLVSCCPICGLGCNGG 525
            +  S + L+ CC I   GC GG
Sbjct: 218 TWALSEQQLLDCC-IENNGCEGG 239


>UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba
           histolytica|Rep: Cysteine protease 19 - Entamoeba
           histolytica
          Length = 324

 Score = 46.4 bits (105), Expect = 4e-04
 Identities = 23/61 (37%), Positives = 36/61 (59%), Gaps = 4/61 (6%)
 Frame = +1

Query: 355 LNEIRDQGSCGSCWAFGAVEAMTDRVCI-YSN-ATKHFHFSAEDLVSCC--PICGLGCNG 522
           +  ++DQG+CGSC+AF +V  M   V + Y + +  ++  S  ++VSCC  P    GC G
Sbjct: 112 MTPVKDQGNCGSCYAFSSVALMETAVLLSYDDLSPSNYALSTAEIVSCCYDPSECRGCEG 171

Query: 523 G 525
           G
Sbjct: 172 G 172


>UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n=1;
           Myxobolus cerebralis|Rep: Cathepsin Z-like cysteine
           proteinase - Myxobolus cerebralis
          Length = 297

 Score = 46.4 bits (105), Expect = 4e-04
 Identities = 26/77 (33%), Positives = 41/77 (53%), Gaps = 7/77 (9%)
 Frame = +1

Query: 304 NLPENFDPRDKWPECPTLNEIRDQGS---CGSCWAFGAVEAMTDRVCIYSNAT--KHFHF 468
           N+P++FD    W E   L+ +++Q     CGSCWAF +   + DR+ I  N +   HF  
Sbjct: 49  NMPKSFD----WRENAYLSSVKNQHLPTYCGSCWAFASTSTIADRIYIAKNLSHFDHFSL 104

Query: 469 SAEDLVSCCPI--CGLG 513
           S + +++C     C LG
Sbjct: 105 SVQVVIACAQSGDCKLG 121


>UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 395

 Score = 46.4 bits (105), Expect = 4e-04
 Identities = 21/58 (36%), Positives = 32/58 (55%), Gaps = 2/58 (3%)
 Frame = +1

Query: 364 IRDQGSCGSCWAFGAVEAMTDRVCIYSNATKH--FHFSAEDLVSCCPICGLGCNGGMP 531
           +RDQG C SCW FG++ A+  R  I +  ++    H SA++ ++C      GC  G P
Sbjct: 201 VRDQGECKSCWVFGSLAALESRYLIKNGVSEKSTLHLSAQNAMNCIT---SGCESGWP 255


>UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-329;
           n=2; Caenorhabditis|Rep: Putative uncharacterized
           protein tag-329 - Caenorhabditis elegans
          Length = 374

 Score = 46.4 bits (105), Expect = 4e-04
 Identities = 25/76 (32%), Positives = 36/76 (47%), Gaps = 1/76 (1%)
 Frame = +1

Query: 307 LPENFDPRDKWPECP-TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 483
           LP+ FD R+K       +  I+ Q SC  CW F A       + ++    K  + S +++
Sbjct: 140 LPKTFDLRNKKVGGHYIIGPIKTQDSCACCWGFAATAVAEAALTVHLK--KAMNLSEQEV 197

Query: 484 VSCCPICGLGCNGGMP 531
             C P  G GCNGG P
Sbjct: 198 CDCAPKHGPGCNGGDP 213


>UniRef50_A2DPZ6 Cluster: Clan CA, family C1, cathepsin B-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin B-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 253

 Score = 46.4 bits (105), Expect = 4e-04
 Identities = 24/79 (30%), Positives = 43/79 (54%)
 Frame = +1

Query: 289 AELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHF 468
           + +   LPE ++  +++PEC     I+    CG C+ + A++++  R C      +   F
Sbjct: 22  SNISVELPEYYNFLEEYPECDFGPLIQH---CGCCYVYSALKSLAHRYC--RALRRRIQF 76

Query: 469 SAEDLVSCCPICGLGCNGG 525
           SA+ ++S C +  LGCNGG
Sbjct: 77  SAQYIIS-CDLFNLGCNGG 94


>UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_46,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 336

 Score = 46.4 bits (105), Expect = 4e-04
 Identities = 23/62 (37%), Positives = 31/62 (50%)
 Frame = +1

Query: 355 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPT 534
           + +++DQG C  CWAFGAV A      + +  T     S + L+  C     GCNGG   
Sbjct: 151 ITQVKDQGQCSGCWAFGAVGAAEAWFYVKNKTT--VLLSEQQLID-CDTQSFGCNGGYQN 207

Query: 535 LA 540
           LA
Sbjct: 208 LA 209


>UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11;
           Entamoeba|Rep: Cysteine proteinase 2 precursor -
           Entamoeba histolytica
          Length = 315

 Score = 46.4 bits (105), Expect = 4e-04
 Identities = 27/75 (36%), Positives = 38/75 (50%), Gaps = 2/75 (2%)
 Frame = +1

Query: 310 PENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKH-FHFSAEDLV 486
           PE+ D    W +   +  IRDQ  CGSC+ FG++ A+  R+ I      +    S E +V
Sbjct: 95  PESVD----WRKEGKVTPIRDQAQCGSCYTFGSLAALEGRLLIEKGGDANTLDLSEEHMV 150

Query: 487 SCCPICG-LGCNGGM 528
            C    G  GCNGG+
Sbjct: 151 QCTRDNGNNGCNGGL 165


>UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2;
           Entamoeba|Rep: Cysteine proteinase ACP1 precursor -
           Entamoeba histolytica
          Length = 308

 Score = 46.4 bits (105), Expect = 4e-04
 Identities = 23/60 (38%), Positives = 30/60 (50%)
 Frame = +1

Query: 355 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPT 534
           +N  +DQG CGSCW F     +  RV    +  K + FS + LV  C     GC GG P+
Sbjct: 103 MNPAKDQGQCGSCWTFCTTAVLEGRV--NKDLGKLYSFSEQQLVD-CDASDNGCEGGHPS 159


>UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin W,
           partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
           similar to Cathepsin W, partial - Ornithorhynchus
           anatinus
          Length = 229

 Score = 46.0 bits (104), Expect = 6e-04
 Identities = 28/83 (33%), Positives = 39/83 (46%), Gaps = 2/83 (2%)
 Frame = +1

Query: 289 AELIANLPENFDPRDK--WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHF 462
           A  +A++PE    ++   W +   +  +++QGSCGSCWAF AV         Y  A K  
Sbjct: 56  ANQMASIPEGPLRKETCDWRKRGAITSVKNQGSCGSCWAFAAVG--NAESMWYLRAGKRL 113

Query: 463 HFSAEDLVSCCPICGLGCNGGMP 531
              +   V  C  C  GC GG P
Sbjct: 114 VSLSVQEVLDCGRCRDGCQGGYP 136


>UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin
           heavy chain; n=3; Amniota|Rep: PREDICTED: similar to
           ferritin heavy chain - Ornithorhynchus anatinus
          Length = 338

 Score = 46.0 bits (104), Expect = 6e-04
 Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 1/80 (1%)
 Frame = +1

Query: 310 PENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVS 489
           PE  D R K    P    +++QG CGSCWAF A  A+     ++    K    S ++LV 
Sbjct: 121 PEEVDWRTKGYVTP----VKNQGLCGSCWAFSATGAL--EALVFKTTGKMVSLSEQNLVD 174

Query: 490 CCPICG-LGCNGGMPTLAWE 546
           C    G +GC GG    A+E
Sbjct: 175 CSWRQGNVGCRGGQYIGAFE 194


>UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;
           Brugia malayi|Rep: Cahepsin L-like cysteine protease -
           Brugia malayi (Filarial nematode worm)
          Length = 371

 Score = 46.0 bits (104), Expect = 6e-04
 Identities = 26/82 (31%), Positives = 42/82 (51%), Gaps = 2/82 (2%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           LP++ D    W     + +++DQG CGSCW F AV A+  +  + +   K    S ++L+
Sbjct: 143 LPKSID----WRTSGAVTKVKDQGYCGSCWTFSAVGALEGQHFLQTG--KLVELSMQNLL 196

Query: 487 SCC--PICGLGCNGGMPTLAWE 546
            C        GC+GG+   A+E
Sbjct: 197 DCSDDTYGNYGCDGGLMMEAFE 218


>UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine
           protease; n=11; Callosobruchus maculatus|Rep: Putative
           gut cathepsin L-like cysteine protease - Callosobruchus
           maculatus (Southern cowpea weevil) (Pulse bruchid)
          Length = 326

 Score = 46.0 bits (104), Expect = 6e-04
 Identities = 26/72 (36%), Positives = 37/72 (51%), Gaps = 2/72 (2%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC--PICGL 510
           W E   +  ++DQ +CGSCWAF AV A+  +     N T     SA++LV C        
Sbjct: 118 WREEGAVTPVKDQANCGSCWAFSAVGAIEGQF-FKKNGTL-VSLSAQELVDCATEDYGNN 175

Query: 511 GCNGGMPTLAWE 546
           GC GG+   A++
Sbjct: 176 GCKGGLMGQAFD 187


>UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia
           circumcincta|Rep: Secreted cathepsin F - Teladorsagia
           circumcincta
          Length = 364

 Score = 46.0 bits (104), Expect = 6e-04
 Identities = 25/80 (31%), Positives = 40/80 (50%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           LPE+FD    W E   + +++ +G C +CWAF     +  +  +     K    SA+ L+
Sbjct: 153 LPESFD----WREHGAVTKVKTEGHCAACWAFSVTGNIEGQWFLAKK--KLVSLSAQQLL 206

Query: 487 SCCPICGLGCNGGMPTLAWE 546
             C +   GCNGG P  A++
Sbjct: 207 D-CDVVDEGCNGGFPLDAYK 225


>UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin B-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 255

 Score = 46.0 bits (104), Expect = 6e-04
 Identities = 27/95 (28%), Positives = 51/95 (53%)
 Frame = +1

Query: 241 ALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAM 420
           A  D++I   P+     ++  ++P+ ++   ++P C  L  +  +  CG C+A+G ++AM
Sbjct: 14  AFVDESIRSFPE-----DISIDIPDEYNFLQEYPHCD-LGPLTQE--CGCCYAYGPIKAM 65

Query: 421 TDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGG 525
           + R+C   N  K    SA+ +V+ C +   GC GG
Sbjct: 66  SHRICKAKN--KKTFLSAQFIVA-CDLLESGCEGG 97


>UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza
           sativa|Rep: Os09g0497500 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 349

 Score = 45.6 bits (103), Expect = 8e-04
 Identities = 28/80 (35%), Positives = 44/80 (55%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           LP++ D    W +   + E+++QG CGSCWAF AV A+ + +    N  +    S ++LV
Sbjct: 122 LPKSVD----WRKKGAVVEVKNQGDCGSCWAFSAVAAI-EGINQIKNG-ELVSLSEQELV 175

Query: 487 SCCPICGLGCNGGMPTLAWE 546
            C     +GC GG  + A+E
Sbjct: 176 DCDDE-AVGCGGGYMSWAFE 194


>UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease
           Gip1p; n=4; Tetrahymena thermophila|Rep:
           Granule-biosynthesis induced protease Gip1p -
           Tetrahymena thermophila
          Length = 345

 Score = 45.6 bits (103), Expect = 8e-04
 Identities = 30/93 (32%), Positives = 45/93 (48%), Gaps = 3/93 (3%)
 Frame = +1

Query: 256 NILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVC 435
           N+   P V++      NLP + D    W +   LN +++QG+CGSCW F A   + +   
Sbjct: 116 NLAADPAVSNLVFPTNNLPLSVD----WRKRGVLNPVKNQGTCGSCWTF-ATAGILESFN 170

Query: 436 IYSNATKHFHFSAEDLVSCCPICGL---GCNGG 525
              N  +   FS + LV C  + G    GC+GG
Sbjct: 171 QIKN-KQLLKFSEQQLVDCVSLAGYDSDGCDGG 202


>UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 4 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 345

 Score = 45.6 bits (103), Expect = 8e-04
 Identities = 29/98 (29%), Positives = 47/98 (47%), Gaps = 3/98 (3%)
 Frame = +1

Query: 247 KDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTD 426
           K  +  +L ++   A L  + PE  +    W E   +  +++QG CGSCWAF +  A+  
Sbjct: 106 KPPSAQQLAEIPLYAPLFGDTPEFIE----WRENGFVTPVKNQGQCGSCWAFSSTGALEG 161

Query: 427 RVCIYSNATKHFHFSAEDLVSCC--PICGLGCNGG-MP 531
           +V  +    +    S ++L+ C        GCNGG MP
Sbjct: 162 QV--FKRTRRLISLSEQNLMDCAGQRYGNNGCNGGQMP 197


>UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3;
           Schistosoma japonicum|Rep: Clone ZZD209 mRNA sequence -
           Schistosoma japonicum (Blood fluke)
          Length = 339

 Score = 45.6 bits (103), Expect = 8e-04
 Identities = 23/82 (28%), Positives = 41/82 (50%), Gaps = 1/82 (1%)
 Frame = +1

Query: 283 HDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHF 462
           +D   +   P+++D    W     +NE RDQGSC   +AF    +   +  +++  + H 
Sbjct: 113 YDVNNVGWTPDSYD----WRHLNIVNEPRDQGSCIGSYAFAVTASTESQYALHT--SNHM 166

Query: 463 HFSAEDLVSCCPICG-LGCNGG 525
           + S +  + C  I G +GC+GG
Sbjct: 167 NLSVQQFIDCTRIYGNMGCHGG 188


>UniRef50_Q22A69 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 330

 Score = 45.6 bits (103), Expect = 8e-04
 Identities = 23/66 (34%), Positives = 33/66 (50%), Gaps = 2/66 (3%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIY--SNATKHFHFSAEDLVSCCPICGL 510
           W     +  +++QGSCGSCWAF    ++  +  +    N T    FS + LV C      
Sbjct: 118 WTTKGAVTPVKNQGSCGSCWAFSTTGSIEGQYVLQLKQNLTS---FSEQQLVDCDTKEDQ 174

Query: 511 GCNGGM 528
           GCNGG+
Sbjct: 175 GCNGGL 180


>UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179,
           whole genome shotgun sequence; n=3; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_179,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 339

 Score = 45.6 bits (103), Expect = 8e-04
 Identities = 23/72 (31%), Positives = 42/72 (58%)
 Frame = +1

Query: 310 PENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVS 489
           P  ++ ++ +P+C   +++ +QG+C S ++     + +DRVC   N T+    SA++L+S
Sbjct: 126 PVYYNFKEAYPQCN--HQVYNQGNCSSSYSIAVSSSFSDRVC-KQNQTQ--QLSAQNLLS 180

Query: 490 CCPICGLGCNGG 525
           C     LGC GG
Sbjct: 181 CDGKLNLGCKGG 192


>UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza
           sativa|Rep: Cysteine protease 1 precursor - Oryza sativa
           subsp. japonica (Rice)
          Length = 490

 Score = 45.6 bits (103), Expect = 8e-04
 Identities = 30/83 (36%), Positives = 43/83 (51%), Gaps = 1/83 (1%)
 Frame = +1

Query: 283 HDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHF 462
           HD   +  LP++ D RDK      +  +++QG CGSCWAF AV A+     I +   +  
Sbjct: 149 HDG--VEALPDSVDWRDKGA---VVAPVKNQGQCGSCWAFSAVAAVEGINKIVTG--ELV 201

Query: 463 HFSAEDLVSCCPI-CGLGCNGGM 528
             S ++LV C       GCNGG+
Sbjct: 202 SLSEQELVECARNGQNSGCNGGI 224


>UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27;
           Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber
           officinale (Ginger)
          Length = 221

 Score = 45.6 bits (103), Expect = 8e-04
 Identities = 28/80 (35%), Positives = 40/80 (50%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           LP++ D R+K    P    +++QG CGSCWAF A+ A+     I +        S + LV
Sbjct: 3   LPDSIDWREKGAVVP----VKNQGGCGSCWAFDAIAAVEGINQIVTGDL--ISLSEQQLV 56

Query: 487 SCCPICGLGCNGGMPTLAWE 546
             C     GC GG P  A++
Sbjct: 57  D-CSTRNHGCEGGWPYRAFQ 75


>UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l;
           n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to cathepsin l - Strongylocentrotus purpuratus
          Length = 489

 Score = 45.2 bits (102), Expect = 0.001
 Identities = 22/64 (34%), Positives = 33/64 (51%), Gaps = 1/64 (1%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LG 513
           W     ++ ++DQ  CGSCW+FG+ E +   V + S   K    S + L+ C    G  G
Sbjct: 273 WNVLGAVSPVKDQAVCGSCWSFGSAETIEGAVFMQSG--KRVRLSQQMLMDCTWAAGNNG 330

Query: 514 CNGG 525
           C+GG
Sbjct: 331 CDGG 334


>UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel
           protein - Danio rerio (Zebrafish) (Brachydanio rerio)
          Length = 328

 Score = 45.2 bits (102), Expect = 0.001
 Identities = 25/73 (34%), Positives = 40/73 (54%), Gaps = 1/73 (1%)
 Frame = +1

Query: 328 RDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG 507
           R  W E   ++ +++QG CGSCWAF AV ++  ++   + A      SA++L+ C    G
Sbjct: 116 RVNWTEHGMVSPVQNQGPCGSCWAFSAVGSLEAQMKRRTAAL--VPLSAQNLLDCSVSLG 173

Query: 508 -LGCNGGMPTLAW 543
             GC GG  + A+
Sbjct: 174 NRGCKGGFLSRAF 186


>UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3;
           Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber
           officinale (Ginger)
          Length = 475

 Score = 45.2 bits (102), Expect = 0.001
 Identities = 27/80 (33%), Positives = 39/80 (48%)
 Frame = +1

Query: 307 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 486
           LP++ D    W E   +  +++QG CGSCWAF A+ A+     I +        S + LV
Sbjct: 143 LPDSID----WREKGAVVAVKNQGRCGSCWAFAAIAAVEGINQIVTGDL--ISLSEQQLV 196

Query: 487 SCCPICGLGCNGGMPTLAWE 546
             C     GC GG P  A++
Sbjct: 197 D-CSTRNYGCEGGWPYRAFQ 215


>UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus
           tauri|Rep: Cysteine protease-1 - Ostreococcus tauri
          Length = 430

 Score = 45.2 bits (102), Expect = 0.001
 Identities = 26/72 (36%), Positives = 37/72 (51%), Gaps = 3/72 (4%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAF---GAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG 507
           W E   +   ++QG CGSCWAF   GAVE +T          +    S +++VSC     
Sbjct: 207 WVELGAVTPPKNQGQCGSCWAFSTTGAVEGITK-----IRTGRLVSLSEQEMVSCSK-QN 260

Query: 508 LGCNGGMPTLAW 543
           +GCNGG+   A+
Sbjct: 261 MGCNGGLMDYAF 272


>UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh
           fly) (Boettcherisca peregrina). Cathepsin L; n=2;
           Dictyostelium discoideum|Rep: Similar to Sarcophaga
           peregrina (Flesh fly) (Boettcherisca peregrina).
           Cathepsin L - Dictyostelium discoideum (Slime mold)
          Length = 265

 Score = 45.2 bits (102), Expect = 0.001
 Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 1/91 (1%)
 Frame = +1

Query: 256 NILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVC 435
           N +K     H+    A +P++FD    W +   + ++++QGSC SCW+F A+ A+     
Sbjct: 32  NDIKATPFKHNVN--ATIPKSFD----WRDHGAVGKVKNQGSCASCWSFSALGALEGH-- 83

Query: 436 IYSNATKHFHFSAEDLVSCC-PICGLGCNGG 525
            Y    +    S ++LV C  P    GC  G
Sbjct: 84  YYIKYGELLDLSEQNLVDCATPFGPKGCKTG 114


>UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 2 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 564

 Score = 44.8 bits (101), Expect = 0.001
 Identities = 38/137 (27%), Positives = 54/137 (39%), Gaps = 3/137 (2%)
 Frame = +1

Query: 145 FINLINKKQNTWKAGRNFPTHTPFAHIKILMGAL--KDDNILKLPKVTHDAELIANLPEN 318
           FI+  N+    +    N         I +L G L  KD +    P   H     A LP+ 
Sbjct: 291 FIDSKNRANLGYNLAVNHLADRTREEISVLRGRLQSKDGSSRAEPFPRH--RFTAKLPDQ 348

Query: 319 FDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP 498
            D    W     +  ++DQ  CGSCW+FG V  +      +    +    S + LV C  
Sbjct: 349 ID----WRPYGAVTPVKDQAVCGSCWSFGTVGELEG--AYFRKTGRLVRLSEQQLVDCSW 402

Query: 499 ICG-LGCNGGMPTLAWE 546
             G  GC+GG    A+E
Sbjct: 403 NNGNNGCDGGEDFRAYE 419


>UniRef50_Q22RR8 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score = 44.8 bits (101), Expect = 0.001
 Identities = 30/103 (29%), Positives = 47/103 (45%), Gaps = 1/103 (0%)
 Frame = +1

Query: 220 HIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWA 399
           ++++    +K  N    PK   +A+L  N+    D    W +   +  ++DQ  CGSCWA
Sbjct: 97  YLRLKTNTIKRQNFKSNPK---NAQL--NMKLGDDIIIDWTKKGAVTPVKDQEQCGSCWA 151

Query: 400 FGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LGCNGG 525
           F A  A+     I +        S ++LV C    G  GC+GG
Sbjct: 152 FSATGALESATFISTGTLP--SLSEQELVDCSTSYGNEGCDGG 192


>UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila
           melanogaster|Rep: CG5367-PA - Drosophila melanogaster
           (Fruit fly)
          Length = 338

 Score = 44.4 bits (100), Expect = 0.002
 Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 1/78 (1%)
 Frame = +1

Query: 295 LIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSA 474
           L+AN+PE+ D R K    P  N++    SCGSC+AF   E++  +V  +    K    S 
Sbjct: 123 LMANVPESLDWRSKGFITPPYNQL----SCGSCYAFSIAESIMGQV--FKRTGKILSLSK 176

Query: 475 EDLVSCCPICG-LGCNGG 525
           + +V C    G  GC GG
Sbjct: 177 QQIVDCSVSHGNQGCVGG 194


>UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain;
           n=9; Cucujiformia|Rep: Digestive cysteine proteinase
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 44.4 bits (100), Expect = 0.002
 Identities = 32/117 (27%), Positives = 54/117 (46%), Gaps = 4/117 (3%)
 Frame = +1

Query: 208 TPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDK--WPECPTLNEIRDQGS 381
           TPFA +       KD+   ++    +    +A  PE  +  D   W +   + +++ QG 
Sbjct: 73  TPFADLT--HDEFKDELRRQIKTKPNVEATLAVFPEGLEVPDSIDWTQKGAVLDVKYQGG 130

Query: 382 CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGC-NGGMPTLAWE 546
           CGSCWAF A  A+  +  I +N       S + L+ C  P     C +GG+ + A++
Sbjct: 131 CGSCWAFSATGALEGQNAIVNNV--KIPLSEQQLLDCSKPYGNDDCEHGGLMSFAFD 185


>UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3;
           Theileria|Rep: Cysteine protease, putative - Theileria
           annulata
          Length = 580

 Score = 44.4 bits (100), Expect = 0.002
 Identities = 21/52 (40%), Positives = 29/52 (55%)
 Frame = +1

Query: 337 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC 492
           W E   +NE+ +QGSCGSCWA  + +  +    I  N  K   FS++ LV C
Sbjct: 370 WRESGFVNEVVNQGSCGSCWAIASEDIFSTFKSIKKN--KLMKFSSQQLVDC 419


>UniRef50_Q22DX2 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 358

 Score = 44.4 bits (100), Expect = 0.002
 Identities = 26/74 (35%), Positives = 36/74 (48%)
 Frame = +1

Query: 304 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 483
           ++P ++D R   P    L  + +QG CGSCWAF    A+        N T   + S + L
Sbjct: 146 SIPSSWDIRTDGPGL--LQPVENQGQCGSCWAFSTSGAVESYYSAKKNIT--LNLSKQQL 201

Query: 484 VSCCPICGLGCNGG 525
           V C    G GC+GG
Sbjct: 202 VDCVYDHG-GCDGG 214


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 527,693,012
Number of Sequences: 1657284
Number of extensions: 10427706
Number of successful extensions: 29421
Number of sequences better than 10.0: 451
Number of HSP's better than 10.0 without gapping: 28420
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 29252
length of database: 575,637,011
effective HSP length: 96
effective length of database: 416,537,747
effective search space used: 35822246242
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)

- SilkBase 1999-2023 -