SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTX 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= fe100P02_F_E16
         (651 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw...   161   1e-38
UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteina...   152   8e-36
UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n...   136   6e-31
UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh...   127   3e-28
UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep...   123   4e-27
UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca...   123   4e-27
UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=...   120   3e-26
UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ...   119   7e-26
UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: ...   108   1e-22
UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; ...   105   9e-22
UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma j...   104   2e-21
UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2...   103   4e-21
UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps...   101   1e-20
UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr...   101   1e-20
UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA...   101   2e-20
UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ...    99   1e-19
UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhip...    98   1e-19
UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Ca...    95   1e-18
UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr...    95   1e-18
UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|...    95   2e-18
UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=...    93   4e-18
UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ...    93   4e-18
UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Ca...    91   3e-17
UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid...    90   5e-17
UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8....    89   1e-16
UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8...    88   2e-16
UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n...    87   3e-16
UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 ...    87   3e-16
UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain...    85   1e-15
UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl...    85   2e-15
UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma...    85   2e-15
UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, w...    83   4e-15
UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep...    82   1e-14
UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gamb...    81   2e-14
UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C...    80   5e-14
UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomati...    80   5e-14
UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|...    80   5e-14
UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin ...    79   1e-13
UniRef50_Q237A1 Cluster: Papain family cysteine protease contain...    79   1e-13
UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2; Ostreococcus|...    78   2e-13
UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7...    78   2e-13
UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator ame...    76   6e-13
UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; ...    75   2e-12
UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2; ...    73   5e-12
UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; ...    73   6e-12
UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8....    73   6e-12
UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 ...    71   2e-11
UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|R...    67   3e-10
UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve...    67   3e-10
UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 ...    67   3e-10
UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O...    66   5e-10
UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus...    65   1e-09
UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep...    64   4e-09
UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|R...    64   4e-09
UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG011...    63   5e-09
UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep...    63   6e-09
UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus co...    63   6e-09
UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus lu...    62   1e-08
UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma j...    62   1e-08
UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R...    58   1e-07
UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia...    56   7e-07
UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi...    56   1e-06
UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi...    55   1e-06
UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re...    52   1e-05
UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,...    52   2e-05
UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cel...    52   2e-05
UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;...    51   2e-05
UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-L...    51   3e-05
UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon G...    51   3e-05
UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted...    50   4e-05
UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like prote...    50   4e-05
UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1...    50   5e-05
UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; ...    49   8e-05
UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ...    48   3e-04
UniRef50_Q58HK5 Cluster: Hc58; n=1; Haemonchus contortus|Rep: Hc...    48   3e-04
UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru...    48   3e-04
UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ...    47   3e-04
UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lambl...    47   3e-04
UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ...    47   5e-04
UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin...    47   5e-04
UniRef50_P81494 Cluster: Cathepsin B; n=2; Phasianidae|Rep: Cath...    47   5e-04
UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb...    46   6e-04
UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali...    46   6e-04
UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re...    46   8e-04
UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35...    46   8e-04
UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio...    46   0.001
UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ...    46   0.001
UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcop...    46   0.001
UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy...    46   0.001
UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida...    45   0.001
UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|R...    45   0.001
UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel...    45   0.001
UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to glucocorti...    45   0.002
UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - ...    45   0.002
UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-li...    45   0.002
UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18...    45   0.002
UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ...    44   0.002
UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy...    44   0.002
UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=...    44   0.002
UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; ...    44   0.003
UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n...    44   0.003
UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re...    44   0.003
UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh...    44   0.003
UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ...    44   0.004
UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo sapie...    44   0.004
UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big...    44   0.004
UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr...    44   0.004
UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh...    44   0.004
UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ...    43   0.006
UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia ...    43   0.006
UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy...    43   0.006
UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag...    43   0.006
UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory...    43   0.006
UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ...    43   0.007
UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v...    43   0.007
UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ...    42   0.010
UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain...    42   0.010
UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=...    42   0.010
UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic...    42   0.010
UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto...    42   0.010
UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try...    42   0.013
UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n...    42   0.017
UniRef50_Q24E33 Cluster: Papain family cysteine protease contain...    42   0.017
UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh...    42   0.017
UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs...    42   0.017
UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ...    42   0.017
UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa...    41   0.022
UniRef50_Q22W19 Cluster: Papain family cysteine protease contain...    41   0.022
UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li...    41   0.022
UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ...    41   0.022
UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl...    41   0.022
UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ...    41   0.030
UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ...    41   0.030
UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ...    41   0.030
UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr...    41   0.030
UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ...    41   0.030
UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh...    41   0.030
UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M...    41   0.030
UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R...    41   0.030
UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ...    40   0.039
UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p...    40   0.039
UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ...    40   0.039
UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio...    40   0.039
UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa...    40   0.039
UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n...    40   0.039
UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet...    40   0.039
UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain...    40   0.039
UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis...    40   0.039
UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ...    40   0.052
UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia ...    40   0.052
UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;...    40   0.052
UniRef50_O16454 Cluster: Temporarily assigned gene name protein ...    40   0.052
UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy...    40   0.052
UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen...    40   0.052
UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, wh...    40   0.052
UniRef50_Q2FLC7 Cluster: Periplasmic copper-binding precursor; n...    40   0.052
UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ...    40   0.052
UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e...    40   0.052
UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi...    40   0.052
UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt...    40   0.068
UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ...    40   0.068
UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H...    40   0.068
UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy...    40   0.068
UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo...    40   0.068
UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ...    40   0.068
UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ...    39   0.090
UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;...    39   0.090
UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ...    39   0.090
UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl...    39   0.090
UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n...    39   0.090
UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain...    39   0.090
UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve...    39   0.090
UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy...    39   0.090
UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi...    39   0.090
UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p...    39   0.12 
UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca...    39   0.12 
UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No...    39   0.12 
UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n...    39   0.12 
UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;...    39   0.12 
UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ...    39   0.12 
UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep...    39   0.12 
UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ...    39   0.12 
UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr...    39   0.12 
UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;...    39   0.12 
UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ...    38   0.16 
UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s...    38   0.16 
UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil...    38   0.16 
UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n...    38   0.16 
UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ...    38   0.16 
UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ...    38   0.16 
UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain...    38   0.16 
UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain...    38   0.16 
UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like cy...    38   0.16 
UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh...    38   0.16 
UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto...    38   0.16 
UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma...    38   0.16 
UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip...    38   0.21 
UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia...    38   0.21 
UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ...    38   0.21 
UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain...    38   0.21 
UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu...    38   0.21 
UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ...    38   0.28 
UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ...    38   0.28 
UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep...    38   0.28 
UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R...    38   0.28 
UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph...    38   0.28 
UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j...    38   0.28 
UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er...    38   0.28 
UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cy...    38   0.28 
UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w...    38   0.28 
UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl...    38   0.28 
UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:...    38   0.28 
UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium...    38   0.28 
UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocysti...    37   0.36 
UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa...    37   0.36 
UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein...    37   0.36 
UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli...    37   0.36 
UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox...    37   0.36 
UniRef50_Q231X3 Cluster: Papain family cysteine protease contain...    37   0.36 
UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy...    37   0.36 
UniRef50_A5KBM2 Cluster: Serine-repeat antigen; n=3; Plasmodium|...    37   0.36 
UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy...    37   0.36 
UniRef50_A0D5R4 Cluster: Chromosome undetermined scaffold_39, wh...    37   0.36 
UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ...    37   0.48 
UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG...    37   0.48 
UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ...    37   0.48 
UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa...    37   0.48 
UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|...    37   0.48 
UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest...    37   0.48 
UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|...    37   0.48 
UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C...    37   0.48 
UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata...    37   0.48 
UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|...    37   0.48 
UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ...    36   0.64 
UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal...    36   0.64 
UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt...    36   0.64 
UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi...    36   0.64 
UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip...    36   0.64 
UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei...    36   0.64 
UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi...    36   0.64 
UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain...    36   0.64 
UniRef50_Q23H32 Cluster: Papain family cysteine protease contain...    36   0.64 
UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi...    36   0.64 
UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R...    36   0.64 
UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh...    36   0.64 
UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w...    36   0.64 
UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The...    36   0.64 
UniRef50_Q3A8I3 Cluster: Putative serine protease; n=1; Pelobact...    36   0.84 
UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ...    36   0.84 
UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ...    36   0.84 
UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy...    36   0.84 
UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli...    36   0.84 
UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil...    36   0.84 
UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain...    36   0.84 
UniRef50_Q235G6 Cluster: Papain family cysteine protease contain...    36   0.84 
UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ...    36   0.84 
UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabdi...    36   0.84 
UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:...    36   0.84 
UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ...    36   0.84 
UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:...    36   0.84 
UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M...    36   0.84 
UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C...    36   0.84 
UniRef50_Q8ZRX7 Cluster: Putative viral protein; n=1; Salmonella...    36   1.1  
UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S...    36   1.1  
UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz...    36   1.1  
UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain...    36   1.1  
UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D...    36   1.1  
UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ...    36   1.1  
UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,...    35   1.5  
UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ...    35   1.5  
UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ...    35   1.5  
UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280...    35   1.5  
UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir...    35   1.5  
UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop...    35   1.5  
UniRef50_Q239L8 Cluster: Papain family cysteine protease contain...    35   1.5  
UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain...    35   1.5  
UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C...    35   1.5  
UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve...    35   1.5  
UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The...    35   1.5  
UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h...    35   1.9  
UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica...    35   1.9  
UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat...    35   1.9  
UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ...    35   1.9  
UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster...    35   1.9  
UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum...    35   1.9  
UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G...    35   1.9  
UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ...    35   1.9  
UniRef50_Q4YCM9 Cluster: Cysteine protease, putative; n=5; Plasm...    35   1.9  
UniRef50_Q23H10 Cluster: Papain family cysteine protease contain...    35   1.9  
UniRef50_Q22A69 Cluster: Papain family cysteine protease contain...    35   1.9  
UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina...    35   1.9  
UniRef50_A7APS9 Cluster: Papain family cysteine protease contain...    35   1.9  
UniRef50_A5KBM1 Cluster: Serine-repeat antigen; n=1; Plasmodium ...    35   1.9  
UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who...    35   1.9  
UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen; n...    35   1.9  
UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs...    35   1.9  
UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ...    35   1.9  
UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2...    35   1.9  
UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; ...    34   2.6  
UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa...    34   2.6  
UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca...    34   2.6  
UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus...    34   2.6  
UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli...    34   2.6  
UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1...    34   2.6  
UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop...    34   2.6  
UniRef50_Q23H15 Cluster: Papain family cysteine protease contain...    34   2.6  
UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain...    34   2.6  
UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain...    34   2.6  
UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat...    34   2.6  
UniRef50_O96163 Cluster: Cysteine protease, putative; n=5; Plasm...    34   2.6  
UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li...    34   2.6  
UniRef50_A5UP12 Cluster: Adhesin-like protein; n=1; Methanobrevi...    34   2.6  
UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc...    34   2.6  
UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ...    34   3.4  
UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba...    34   3.4  
UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t...    34   3.4  
UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O...    34   3.4  
UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-...    34   3.4  
UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi...    34   3.4  
UniRef50_Q7QQ92 Cluster: GLP_243_18349_20043; n=1; Giardia lambl...    34   3.4  
UniRef50_Q26015 Cluster: Serine rich protein homologue; n=4; Pla...    34   3.4  
UniRef50_Q23H06 Cluster: Papain family cysteine protease contain...    34   3.4  
UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus...    34   3.4  
UniRef50_Q3YJ15 Cluster: Putative galactosyl transferase; n=1; H...    33   4.5  
UniRef50_Q5YER6 Cluster: Cathepsin Z; n=1; Bigelowiella natans|R...    33   4.5  
UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ...    33   4.5  
UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep: Cat...    33   4.5  
UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; ...    33   4.5  
UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae...    33   4.5  
UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3; Bilate...    33   4.5  
UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ...    33   4.5  
UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|...    33   4.5  
UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-...    33   4.5  
UniRef50_Q6A1H9 Cluster: Cathepsin X/O; n=1; Suberites domuncula...    33   4.5  
UniRef50_Q61CG7 Cluster: Putative uncharacterized protein CBG129...    33   4.5  
UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia...    33   4.5  
UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl...    33   4.5  
UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain...    33   4.5  
UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophi...    33   4.5  
UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov...    33   4.5  
UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s...    33   5.9  
UniRef50_Q89Z69 Cluster: Putative uncharacterized protein; n=1; ...    33   5.9  
UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty...    33   5.9  
UniRef50_Q7QWN9 Cluster: GLP_26_50243_51811; n=2; Giardia lambli...    33   5.9  
UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;...    33   5.9  
UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n...    33   5.9  
UniRef50_Q26153 Cluster: V-SERA 4; n=1; Plasmodium vivax|Rep: V-...    33   5.9  
UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv...    33   5.9  
UniRef50_Q1AMF3 Cluster: Cathepsin C1; n=1; Toxoplasma gondii|Re...    33   5.9  
UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w...    33   5.9  
UniRef50_Q59RI2 Cluster: Putative uncharacterized protein; n=1; ...    33   5.9  
UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis...    33   7.8  
UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina...    33   7.8  
UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179, w...    33   7.8  
UniRef50_Q1DTN0 Cluster: Predicted protein; n=1; Coccidioides im...    33   7.8  
UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D...    33   7.8  

>UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep:
           Parcxpwnx02 - Periplaneta americana (American cockroach)
          Length = 343

 Score =  161 bits (392), Expect = 1e-38
 Identities = 74/144 (51%), Positives = 90/144 (62%)
 Frame = +3

Query: 219 LPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLI 398
           L  PLSD+FI+ IN    +WKA RNF  D     +KK+MGV        LP K+ + D+ 
Sbjct: 32  LVDPLSDDFIDHINSLNTTWKAHRNFGNDIPLREIKKLMGVRRSLENFRLPEKSME-DID 90

Query: 399 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 578
             +PE FDPR++WP+CPTL E+RDQGSCGSCWAFGAVEAM+DRVC +S G  HFHFSAED
Sbjct: 91  IEIPEEFDPREQWPECPTLKEIRDQGSCGSCWAFGAVEAMSDRVCIHSKGKTHFHFSAED 150

Query: 579 XXXXXXXXXXXXXXXXXXXAWEYW 650
                              AW+YW
Sbjct: 151 LLTCCSSCGFGCNGGEPGAAWDYW 174


>UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteinase;
           n=1; Tenebrio molitor|Rep: Putative cathepsin B-like
           like proteinase - Tenebrio molitor (Yellow mealworm)
          Length = 301

 Score =  152 bits (368), Expect = 8e-36
 Identities = 70/144 (48%), Positives = 95/144 (65%), Gaps = 2/144 (1%)
 Frame = +3

Query: 225 HPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFAT-LPIKTHKIDLIA 401
           HPLSDEFIN IN KQ +WKAGRNF  +T  +H+++++GV+  +  A  LP+KTH ++L A
Sbjct: 24  HPLSDEFINEINSKQTTWKAGRNFDVNTPISHVRRLLGVLPKKANAPKLPVKTHAVNLDA 83

Query: 402 SLPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 578
            +PE+FD R+ WP+C ++  E+RDQ SCGSCWAFGAVEAM+DR+C +S+ +     SAED
Sbjct: 84  -IPESFDAREAWPECTSIIGEIRDQASCGSCWAFGAVEAMSDRICIHSDASVKVRISAED 142

Query: 579 XXXXXXXXXXXXXXXXXXXAWEYW 650
                              AW YW
Sbjct: 143 LNDCCYDCGDGCNGGWPDLAWSYW 166


>UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n=4;
           Tenebrionidae|Rep: Putative cathepsin B-like proteinase
           - Tenebrio molitor (Yellow mealworm)
          Length = 321

 Score =  136 bits (328), Expect = 6e-31
 Identities = 67/144 (46%), Positives = 97/144 (67%), Gaps = 3/144 (2%)
 Frame = +3

Query: 156 KMFISRAAYVTLVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIM 335
           K+F+S   +V LV VL+A+      LS EFI++IN  Q+SW AGRNFP +T+  +L K+ 
Sbjct: 2   KIFLS---FVVLVAVLSASLAEIDVLSSEFIDSINRIQSSWVAGRNFPENTTNEYLYKLN 58

Query: 336 GVI---EDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGA 506
           G I    D ++   P+  H  +    +PE+FD R KWP+C +LN +RDQG+CGSCWAF +
Sbjct: 59  GFIGLHPDPNYKP-PVLVHTFNA-RDVPESFDARTKWPNCDSLNRIRDQGACGSCWAFAS 116

Query: 507 VEAMTDRVCTYSNGTKHFHFSAED 578
           +E+M+DR+C +S+G+  F FS ED
Sbjct: 117 IESMSDRICIHSSGSAQFMFSPED 140


>UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome
           shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 5
           SCAF15026, whole genome shotgun sequence - Tetraodon
           nigroviridis (Green puffer)
          Length = 351

 Score =  127 bits (306), Expect = 3e-28
 Identities = 64/161 (39%), Positives = 91/161 (56%), Gaps = 2/161 (1%)
 Frame = +3

Query: 174 AAYVTLVCVLAAAKDLPH--PLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIE 347
           AA++ L    +++   PH  PLS E +N IN   ++W AG NF  +  ++++KK+ G + 
Sbjct: 4   AAFLFLAAAWSSSLARPHLKPLSSEMVNYINKLNSTWTAGHNF-HNVDYSYVKKLCGTLL 62

Query: 348 DEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 527
                 L I+ +  D+   LP+ FD R++WP+CPTL E+RDQGSCGSCWAFGA EAM+DR
Sbjct: 63  KGPKLPLMIR-YAGDI--KLPKEFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAMSDR 119

Query: 528 VCTYSNGTKHFHFSAEDXXXXXXXXXXXXXXXXXXXAWEYW 650
           VC +SN       SA+D                   AW +W
Sbjct: 120 VCIHSNAKVSVELSAQDLLTCCNSCGMGCNGGYPSSAWNFW 160


>UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep:
           Cathepsin B - Pandalus borealis (Northern red shrimp)
          Length = 328

 Score =  123 bits (296), Expect = 4e-27
 Identities = 57/130 (43%), Positives = 79/130 (60%)
 Frame = +3

Query: 189 LVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATL 368
           L+ ++AAA     PLSDEF+  +  KQ +WKAGRNF +D S   LK +  V ++     L
Sbjct: 6   LLALVAAASAELDPLSDEFLELLQSKQMTWKAGRNFAKDISKDFLKSLNCVRKNPDIPKL 65

Query: 369 PIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 548
           P+K   +     +P  FD R++WP CP ++E+RDQG+CGSCWA  A   MTDR C  + G
Sbjct: 66  PLKN--VTPTKEIPVEFDAREQWPHCPCIDEIRDQGNCGSCWAVSAASVMTDRTCIDTEG 123

Query: 549 TKHFHFSAED 578
              F FS+E+
Sbjct: 124 LVDFRFSSEN 133


>UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1)
           (Cathepsin B1) (APP secretase) (APPS) [Contains:
           Cathepsin B light chain; Cathepsin B heavy chain]; n=85;
           Eukaryota|Rep: Cathepsin B precursor (EC 3.4.22.1)
           (Cathepsin B1) (APP secretase) (APPS) [Contains:
           Cathepsin B light chain; Cathepsin B heavy chain] - Homo
           sapiens (Human)
          Length = 339

 Score =  123 bits (296), Expect = 4e-27
 Identities = 60/137 (43%), Positives = 84/137 (61%), Gaps = 4/137 (2%)
 Frame = +3

Query: 180 YVTLVC--VLAAAKDLP--HPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIE 347
           + +L C  VLA A+  P  HPLSDE +N +N +  +W+AG NF  +   ++LK++ G   
Sbjct: 5   WASLCCLLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLCGTFL 63

Query: 348 DEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 527
                  P +         LP +FD R++WP CPT+ E+RDQGSCGSCWAFGAVEA++DR
Sbjct: 64  G---GPKPPQRVMFTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDR 120

Query: 528 VCTYSNGTKHFHFSAED 578
           +C ++N       SAED
Sbjct: 121 ICIHTNAHVSVEVSAED 137


>UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=1;
           Biomphalaria glabrata|Rep: Cathepsin B preproprotein
           precursor - Biomphalaria glabrata (Bloodfluke planorb)
          Length = 333

 Score =  120 bits (289), Expect = 3e-26
 Identities = 66/161 (40%), Positives = 88/161 (54%), Gaps = 5/161 (3%)
 Frame = +3

Query: 183 VTLVCVLAAAKDLP---HPLSDEFINTINLKQNS-WKAGRNF-PRDTSFAHLKKIMGVIE 347
           V +  +LA A   P    PLSD  I  IN   N+ WKAGRNF P +   A     + + E
Sbjct: 8   VAICGLLAVALATPFHIEPLSDAEIFYINHVANTTWKAGRNFHPAEIKRARALLGVNMAE 67

Query: 348 DEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 527
           ++ +  + +K  ++     LP+NFDPR KWPDC +LNE+RDQ +CGSCWAFG+ EAMTDR
Sbjct: 68  NKAYNRIHLKYKQVQPRNDLPDNFDPRTKWPDCASLNEIRDQANCGSCWAFGSAEAMTDR 127

Query: 528 VCTYSNGTKHFHFSAEDXXXXXXXXXXXXXXXXXXXAWEYW 650
           +C    G  + H SAED                   AWE++
Sbjct: 128 ICIAGKG--NIHISAEDINDCCKSCGMGCNGGYPAAAWEWY 166


>UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin B;
           n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to cathepsin B - Strongylocentrotus purpuratus
          Length = 346

 Score =  119 bits (286), Expect = 7e-26
 Identities = 57/135 (42%), Positives = 76/135 (56%)
 Frame = +3

Query: 246 INTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENFDP 425
           +  +N  + +WKAG NF         ++++G +++ +   LP K      I  LPENFD 
Sbjct: 28  VQKVNSLKTTWKAGINF-EGWQLDDFRRMLGALKNPN-GRLP-KLENQTRIKDLPENFDA 84

Query: 426 RDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDXXXXXXXXX 605
           R+ WP+CPT+ EVRDQGSCGSCWAFGAVEA++DR+C  S G    H SAED         
Sbjct: 85  RENWPNCPTIKEVRDQGSCGSCWAFGAVEAISDRICIKSKGQTQVHISAEDLMTCCKTCG 144

Query: 606 XXXXXXXXXXAWEYW 650
                     AWEY+
Sbjct: 145 NGCNGGFPGSAWEYY 159


>UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep:
           Cathepsin B - Apriona germari
          Length = 324

 Score =  108 bits (260), Expect = 1e-22
 Identities = 50/117 (42%), Positives = 75/117 (64%), Gaps = 2/117 (1%)
 Frame = +3

Query: 234 SDEFINTINLKQNSWKAGRNFPRDT--SFAHLKKIMGVIEDEHFATLPIKTHKIDLIASL 407
           ++ FI +IN K  +W A +NF   T      L  ++G+  D +  TLP+  H  + I+ +
Sbjct: 28  TEAFIQSINEKATTWTARKNFEGRTPEQLKALADVIGINRDPN-VTLPVVFH--EAISGI 84

Query: 408 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 578
           P++FD R++WP C ++  +RD+G+CGSCWAF AVE M+DR+C  S G K F FSAE+
Sbjct: 85  PDSFDAREQWPFCESIRTIRDEGACGSCWAFAAVEVMSDRLCLASEGRKKFIFSAEE 141


>UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           B-like proteinase precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 331

 Score =  105 bits (252), Expect = 9e-22
 Identities = 56/163 (34%), Positives = 84/163 (51%), Gaps = 3/163 (1%)
 Frame = +3

Query: 171 RAAYVT--LVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVI 344
           +AA++   L+ ++ + K  P+PLS++FIN IN KQ++W AG+NF  + S   +K ++G  
Sbjct: 2   KAAFIITLLLPIVLSYKGSPNPLSNDFINYINSKQSTWVAGKNFDENLSIQEIKNLLGAK 61

Query: 345 EDEHFATLPIKTHKIDLIASLPENFDPRDKWPDC-PTLNEVRDQGSCGSCWAFGAVEAMT 521
           + +        TH  D+   +P +FD R+ W +C   ++ V DQ  CGSCWA  A  AM+
Sbjct: 62  KGK-LGVAKEFTHSEDI--QVPNSFDARENWKECSDVISTVVDQSDCGSCWAVAAASAMS 118

Query: 522 DRVCTYSNGTKHFHFSAEDXXXXXXXXXXXXXXXXXXXAWEYW 650
           DR C  S G      SAE+                   AW YW
Sbjct: 119 DRRCIASQGKLKVPVSAENLLSCCDSCGYGCEGGYPTMAWSYW 161


>UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC02853 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 181

 Score =  104 bits (249), Expect = 2e-21
 Identities = 54/109 (49%), Positives = 68/109 (62%), Gaps = 4/109 (3%)
 Frame = +3

Query: 228 PLSDEFINTINLKQN-SWKAGRNFPRDTSFAHLKKIMGVI---EDEHFATLPIKTHKIDL 395
           PLSDE I  IN + N  WKA R   R TS  H K +MGV+    D+H    PI  H  D+
Sbjct: 21  PLSDELITFINKQPNIEWKADRT-KRFTSIHHAKSMMGVLLNSVDQHKLHHPIIHHN-DI 78

Query: 396 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYS 542
              LP+ FD R  W +C ++  +RDQ SCGSCWAFGAVE+M+DR+C +S
Sbjct: 79  NIKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHS 127


>UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2;
           Arthropoda|Rep: Cathepsin B-like cysteine protease -
           Callosobruchus maculatus (Southern cowpea weevil) (Pulse
           bruchid)
          Length = 330

 Score =  103 bits (247), Expect = 4e-21
 Identities = 51/138 (36%), Positives = 74/138 (53%), Gaps = 2/138 (1%)
 Frame = +3

Query: 171 RAAYVTLVCVLAAAKDLPHP--LSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVI 344
           + A++ L  V++     P    LSDE+I  +N K   WKAGRNF RDTS  ++++++ V 
Sbjct: 2   KLAFIALAAVVSCTFAQPELDFLSDEYIEQLNSKNLPWKAGRNFERDTSLYNIQRLLSVG 61

Query: 345 EDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTD 524
                +      H+ D    LPE FD R +W  C ++ E+RDQ  CGSCWA  +   M+D
Sbjct: 62  TINPPSEFETIFHEDDG-KDLPEEFDARKQWSKCESIKEIRDQSGCGSCWAVSSASVMSD 120

Query: 525 RVCTYSNGTKHFHFSAED 578
           R+C  S+       SA D
Sbjct: 121 RICIQSDQKNQLRISAAD 138


>UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Cathepsin
           B - Fasciola gigantica (Giant liver fluke)
          Length = 339

 Score =  101 bits (242), Expect = 1e-20
 Identities = 56/143 (39%), Positives = 73/143 (51%), Gaps = 4/143 (2%)
 Frame = +3

Query: 234 SDEFINTINLKQN-SWKAGRNFPRDTSFAHLKKIMGVIED---EHFATLPIKTHKIDLIA 401
           SDE I  +N +   SWKA R+  R ++  H K  +G + +   E  A  P   H I    
Sbjct: 27  SDELIRFVNEESGASWKAARS-TRFSNVDHFKLHLGALSETPEERNALRPTIKHDISK-N 84

Query: 402 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDX 581
            LPE+FD R +WP C T++E+RDQ SCGSCWA  A  AM+DRVC +SNG      +A D 
Sbjct: 85  DLPESFDARSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADP 144

Query: 582 XXXXXXXXXXXXXXXXXXAWEYW 650
                             AW+YW
Sbjct: 145 LSCCTYCGQGCRGGYPPKAWDYW 167


>UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase
           precursor; n=28; Bilateria|Rep: Cathepsin B-like
           cysteine proteinase precursor - Schistosoma japonicum
           (Blood fluke)
          Length = 342

 Score =  101 bits (242), Expect = 1e-20
 Identities = 58/145 (40%), Positives = 74/145 (51%), Gaps = 4/145 (2%)
 Frame = +3

Query: 228 PLSDEFINTINLKQNS-WKAGRNFPRDTSFAHLKKIMGVI-EDEHFAT--LPIKTHKIDL 395
           PLSDE I+ IN   ++ WKA ++  R  S    + +MG   ED        P   H  DL
Sbjct: 29  PLSDEMISFINEHPDAGWKADKS-DRFHSLDDARILMGARKEDAEMKRNRRPTVDHH-DL 86

Query: 396 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 575
              +P  FD R KWP C +++++RDQ  CGSCWAFGAVEAMTDR+C  S G +    SA 
Sbjct: 87  NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSAL 146

Query: 576 DXXXXXXXXXXXXXXXXXXXAWEYW 650
           D                   AW+YW
Sbjct: 147 DLISCCKDCGDGCQGGFPGVAWDYW 171


>UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA;
           n=1; Tribolium castaneum|Rep: PREDICTED: similar to
           CG10992-PA - Tribolium castaneum
          Length = 325

 Score =  101 bits (241), Expect = 2e-20
 Identities = 56/159 (35%), Positives = 81/159 (50%), Gaps = 3/159 (1%)
 Frame = +3

Query: 183 VTLVCVLAA--AKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEH 356
           +T +C L    +   P+  S + I  IN +Q SWKA  N             +G+  D +
Sbjct: 4   ITFLCALTLPLSWSKPNTSSLQVIQEINSEQISWKAETNC---LDIKSRLGFLGLHPDPN 60

Query: 357 FATLPIKTHKIDLIASLPENFDPRDKWPDCP-TLNEVRDQGSCGSCWAFGAVEAMTDRVC 533
           +  +  K HKI  I S+PE+FD R+KWP+C   + ++R+QG+CGSCWAF + E MTDR+C
Sbjct: 61  YK-IQTKQHKISRIISIPESFDAREKWPECKDVIGKIRNQGNCGSCWAFASTEVMTDRLC 119

Query: 534 TYSNGTKHFHFSAEDXXXXXXXXXXXXXXXXXXXAWEYW 650
             S G   F FS E+                   AW+Y+
Sbjct: 120 ISSKGKIKFVFSPENLLTCCKDCGCGCKGGYIKNAWDYY 158


>UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6
           precursor; n=11; Bilateria|Rep: Cathepsin B-like
           cysteine proteinase 6 precursor - Caenorhabditis elegans
          Length = 379

 Score = 98.7 bits (235), Expect = 1e-19
 Identities = 50/143 (34%), Positives = 69/143 (48%), Gaps = 5/143 (3%)
 Frame = +3

Query: 237 DEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIK-----THKIDLIA 401
           D+ I+ +N  QN W A +     + +    K    +   +   L +K     +   DL  
Sbjct: 44  DDLIDYVNENQNLWTAKKQRRFSSVYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDL 103

Query: 402 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDX 581
            +PE+FD RD WP C ++  +RDQ SCGSCWAFGAVEAM+DR+C  S+G      SA+D 
Sbjct: 104 DIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDL 163

Query: 582 XXXXXXXXXXXXXXXXXXAWEYW 650
                             AW YW
Sbjct: 164 LSCCKSCGFGCNGGDPLAAWRYW 186


>UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 1 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 332

 Score = 98.3 bits (234), Expect = 1e-19
 Identities = 48/121 (39%), Positives = 68/121 (56%), Gaps = 4/121 (3%)
 Frame = +3

Query: 228 PLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTH----KIDL 395
           PLS+E IN IN    +WKAGRNF  D   +H   + G             +H    + D 
Sbjct: 26  PLSEEMINFINSINTTWKAGRNF--DEKRSHSDCVQGGDGASVLTATSTSSHFTSYEEDS 83

Query: 396 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 575
             + PE+F PR+ W  C ++  +RDQ +CGSCWAF A E+++DR+C ++NG    + SAE
Sbjct: 84  RWTCPESFTPREYWSHCSSIRVIRDQSACGSCWAFAAAESISDRICIHTNGKVQVNISAE 143

Query: 576 D 578
           D
Sbjct: 144 D 144


>UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep:
           Cathepsin b - Aedes aegypti (Yellowfever mosquito)
          Length = 332

 Score = 95.1 bits (226), Expect = 1e-18
 Identities = 43/130 (33%), Positives = 69/130 (53%), Gaps = 1/130 (0%)
 Frame = +3

Query: 192 VCVLAAAKDL-PHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATL 368
           V V+A ++ L   P +D F+  +     +W     F     F + + + G+ E +    L
Sbjct: 13  VVVIARSERLGDDPFNDGFLAQVQRHAKTWTPDATFRDGIRFENFQNMKGIFESKIGFRL 72

Query: 369 PIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 548
           P K H +     +PE FD R+KWP C +++ +++QG CG+CWA  AV  M+DR+C +S G
Sbjct: 73  PTKRHDVAYNMDIPEFFDAREKWPYCKSISTIKNQGLCGACWAVAAVSVMSDRLCIHSEG 132

Query: 549 TKHFHFSAED 578
                 +AED
Sbjct: 133 KFDVELAAED 142


>UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase
           precursor; n=29; Schistosomatidae|Rep: Cathepsin B-like
           cysteine proteinase precursor - Schistosoma mansoni
           (Blood fluke)
          Length = 340

 Score = 95.1 bits (226), Expect = 1e-18
 Identities = 54/145 (37%), Positives = 71/145 (48%), Gaps = 4/145 (2%)
 Frame = +3

Query: 228 PLSDEFINTINLKQNS-WKAGRNFPRDTSFAHLKKIMGVIEDE---HFATLPIKTHKIDL 395
           PLSD+ I+ IN   N+ W+A ++  R  S    +  MG   +E        P   H  D 
Sbjct: 28  PLSDDIISYINEHPNAGWRAEKS-NRFHSLDDARIQMGARREEPDLRRKRRPTVDHN-DW 85

Query: 396 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 575
              +P NFD R KWP C ++  +RDQ  CGSCW+FGAVEAM+DR C  S G ++   SA 
Sbjct: 86  NVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWSFGAVEAMSDRSCIQSGGKQNVELSAV 145

Query: 576 DXXXXXXXXXXXXXXXXXXXAWEYW 650
           D                   AW+YW
Sbjct: 146 DLLTCCESCGLGCEGGILGPAWDYW 170


>UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis
           sinensis|Rep: Cathepsin B5 - Clonorchis sinensis
          Length = 343

 Score = 94.7 bits (225), Expect = 2e-18
 Identities = 50/127 (39%), Positives = 64/127 (50%), Gaps = 2/127 (1%)
 Frame = +3

Query: 276 WKAGRNFPRDTSFAHLKKIMGVIED--EHFATLPIKTHKIDLIASLPENFDPRDKWPDCP 449
           W +GR  P+      L  + G   +  E  A  P   H       LP+NFD R  WP C 
Sbjct: 42  WISGR-LPKRFESGDLIHMFGAKRETREQKAQRPTLRHDGFDNMRLPKNFDARKTWPHCS 100

Query: 450 TLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDXXXXXXXXXXXXXXXXX 629
           +++E+RDQ SCGSCWAFGAVEAM+DR+C +SNG  +   SA D                 
Sbjct: 101 SISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCKDCGFGCRGGYP 160

Query: 630 XXAWEYW 650
             AW+YW
Sbjct: 161 AVAWDYW 167


>UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=1;
           Nilaparvata lugens|Rep: Cathepsin B-like protease
           precursor - Nilaparvata lugens (Brown planthopper)
          Length = 347

 Score = 93.5 bits (222), Expect = 4e-18
 Identities = 48/140 (34%), Positives = 79/140 (56%), Gaps = 6/140 (4%)
 Frame = +3

Query: 177 AYVTLVCVLAAAKDLPHPLSDEFINTINLKQNS-WKAGRNFPRDTSFAHLKKIMGVIE-D 350
           A V+ +  L   ++    +++++I+ IN    S WKAG NF  DT  ++L+ ++GV E +
Sbjct: 10  AVVSAISALPDQENTVREIANKWIDAINNNPKSTWKAGHNFHPDTPMSYLQGLLGVSELE 69

Query: 351 EHFATLP----IKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
            + A L     ++ ++ +    +P+ FD R KW  C +L E+RDQG+CGSCWA     A 
Sbjct: 70  SNLADLDKYEEMEENEENKKIKVPKYFDARKKWKKCKSLREIRDQGNCGSCWAVSVAAAF 129

Query: 519 TDRVCTYSNGTKHFHFSAED 578
            DR+C  SN   + H S+ +
Sbjct: 130 ADRLCIASNAKWNGHISSRE 149


>UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4
           precursor; n=5; Caenorhabditis|Rep: Cathepsin B-like
           cysteine proteinase 4 precursor - Caenorhabditis elegans
          Length = 335

 Score = 93.5 bits (222), Expect = 4e-18
 Identities = 54/161 (33%), Positives = 77/161 (47%), Gaps = 5/161 (3%)
 Frame = +3

Query: 180 YVTLVCVLAAAKDLPHPL----SDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIE 347
           Y+ L  ++A    L  PL     +     +N KQ+ WKA    P+D +   +KK +   E
Sbjct: 3   YLILAALVAVTAGLVIPLVPKTQEAITEYVNSKQSLWKA--EIPKDITIEQVKKRLMRTE 60

Query: 348 DEHFATLPIKTHKIDLIA-SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTD 524
                T  ++  K D+   ++P  FD R +WP+C ++N +RDQ  CGSCWAF A EA +D
Sbjct: 61  FVAPHTPDVEVVKHDINEDTIPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEAASD 120

Query: 525 RVCTYSNGTKHFHFSAEDXXXXXXXXXXXXXXXXXXXAWEY 647
           R C  SNG  +   SAED                   AW+Y
Sbjct: 121 RFCIASNGAVNTLLSAEDVLSCCSNCGYGCEGGYPINAWKY 161


>UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep:
           Cathepsin b - Aedes aegypti (Yellowfever mosquito)
          Length = 386

 Score = 90.6 bits (215), Expect = 3e-17
 Identities = 48/126 (38%), Positives = 62/126 (49%)
 Frame = +3

Query: 273 SWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPT 452
           +W+AG N P+  +       M  +E      L I     DL   LP+ FD R+KWP+CP+
Sbjct: 85  TWRAGSN-PKPPAGYRSGVNMADLERTKLP-LGIMADVEDL--DLPDTFDAREKWPECPS 140

Query: 453 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDXXXXXXXXXXXXXXXXXX 632
           L E+RDQG CGSCWA  A  AMTDR C  S G + F F + D                  
Sbjct: 141 LREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCCHSCGQGCRGGTLG 200

Query: 633 XAWEYW 650
            AW++W
Sbjct: 201 PAWQFW 206


>UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15;
           Aphidoidea|Rep: Cathepsin B-S precursor - Tuberaphis
           styraci
          Length = 349

 Score = 89.8 bits (213), Expect = 5e-17
 Identities = 52/166 (31%), Positives = 80/166 (48%), Gaps = 7/166 (4%)
 Frame = +3

Query: 174 AAYVTLVCVLAAAKDLPHP----LSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGV 341
           A +VT+VC +  +  L  P    LSDE I  IN    +WKA R FP +TS  +   ++G 
Sbjct: 2   AKFVTIVCAIFVSVYLAEPTLQFLSDERIKYINEVAKTWKAERYFPANTSEEYFIGLLGS 61

Query: 342 IEDEHFATLPIKTHKIDLIA---SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVE 512
              +++ T  ++  K D +    + P+ FD R+ W  C  +  +RDQG+CGSCW+F    
Sbjct: 62  RGYKNY-TNEVEIKKYDPLYVENNSPKQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTG 120

Query: 513 AMTDRVCTYSNGTKHFHFSAEDXXXXXXXXXXXXXXXXXXXAWEYW 650
           A  DR+C  + G  +   S E+                   AW+Y+
Sbjct: 121 AFADRLCVSTGGKFNQLLSPEELAFCCMDCGKGCGGGYPIKAWKYF 166


>UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.4;
           n=1; Caenorhabditis elegans|Rep: Putative
           uncharacterized protein W07B8.4 - Caenorhabditis elegans
          Length = 335

 Score = 88.6 bits (210), Expect = 1e-16
 Identities = 54/132 (40%), Positives = 74/132 (56%), Gaps = 1/132 (0%)
 Frame = +3

Query: 186 TLVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFAT 365
           +L+ +LAA+  +  P +  FIN IN  Q  W A       T+   +K +M V   EH A 
Sbjct: 7   SLLFILAASA-VVLPRNKLFINHINSAQKLWTAEHY----TTPFEVKNLMKV---EHVAA 58

Query: 366 LPIKTHKIDLIA-SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYS 542
              K  K+   A S+P+++D RD WP C ++N +RDQ  CGSCWA  A EA++DR C  S
Sbjct: 59  HLDKDIKLAETADSIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIAS 118

Query: 543 NGTKHFHFSAED 578
           NG  +   SAED
Sbjct: 119 NGDVNTLLSAED 130


>UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8;
           Trypanosoma|Rep: Cathepsin B-like cysteine protease -
           Trypanosoma brucei
          Length = 340

 Score = 87.8 bits (208), Expect = 2e-16
 Identities = 52/140 (37%), Positives = 79/140 (56%), Gaps = 6/140 (4%)
 Frame = +3

Query: 177 AYVTLVCVLAA--AKDLPHPLSDEFINTIN-LKQNSWKAGRN-FPRDTSFAHLKKIMGVI 344
           A   +V V AA  A+D P  LS  F++ +N L +  WKA  +   ++ +    K++ GVI
Sbjct: 13  ASTAVVAVNAALVAEDAP-VLSKAFVDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGVI 71

Query: 345 EDEHFATLPIKTH--KIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           +  + A++  K    + +  A LP +FD  + WP+CPT+ ++ DQ +CGSCWA  A  AM
Sbjct: 72  KKNNNASILPKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAM 131

Query: 519 TDRVCTYSNGTKHFHFSAED 578
           +DR CT   G +  H SA D
Sbjct: 132 SDRFCT-MGGVQDVHISAGD 150


>UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n=8;
           Strongylida|Rep: Cathepsin B-like cysteine protease 2 -
           Parelaphostrongylus tenuis
          Length = 344

 Score = 87.4 bits (207), Expect = 3e-16
 Identities = 36/82 (43%), Positives = 49/82 (59%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDXX 584
           +P++FD R +WP CP+++ +RDQ  CGSCWAFG+ EAM+DRVC  S+G K    SA+D  
Sbjct: 94  IPDSFDARVQWPHCPSISYIRDQSQCGSCWAFGSAEAMSDRVCIASHGNKTVELSADDIL 153

Query: 585 XXXXXXXXXXXXXXXXXAWEYW 650
                            AWEY+
Sbjct: 154 SCCYDCGDGCDGGYPISAWEYF 175


>UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3
           precursor; n=4; Caenorhabditis|Rep: Cathepsin B-like
           cysteine proteinase 3 precursor - Caenorhabditis elegans
          Length = 370

 Score = 87.0 bits (206), Expect = 3e-16
 Identities = 45/116 (38%), Positives = 63/116 (54%), Gaps = 5/116 (4%)
 Frame = +3

Query: 246 INTINLKQNSWKAGRNFPRDTSFAHLKKIMGV-----IEDEHFATLPIKTHKIDLIASLP 410
           ++ +N  Q SW A  N    + F    K+M V     +E +      +      +   LP
Sbjct: 36  VDHVNTVQTSWVAEHN--EISEFEMKFKVMDVKFAEPLEKDSDVASELFVRGEIVPEPLP 93

Query: 411 ENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 578
           + FD R+KWPDC T+  +R+Q +CGSCWAFGA E ++DRVC  SNGT+    S ED
Sbjct: 94  DTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVED 149


>UniRef50_Q23FP9 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 340

 Score = 85.0 bits (201), Expect = 1e-15
 Identities = 45/120 (37%), Positives = 67/120 (55%), Gaps = 4/120 (3%)
 Frame = +3

Query: 231 LSDEFINTINLKQNS-WKAGR--NFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIA 401
           +S   +  +N   NS WKA R  +F + T    L   +G +++  +  LP K    +  A
Sbjct: 27  MSPFIVFEVNSNPNSTWKAARYPHFEKMTR-EQLLGHLGSLDEPDWVKLPTKEFDPNANA 85

Query: 402 S-LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 578
             +PE FD R++WP+C ++  +RDQ +CGSCWAF A E  +DR+C  SN T     S+ED
Sbjct: 86  DPIPEFFDAREQWPNCQSIKLIRDQSTCGSCWAFAATETFSDRICIASNQTLQTSISSED 145


>UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core
           eudicotyledons|Rep: At1g02300/T6A9_10 - Arabidopsis
           thaliana (Mouse-ear cress)
          Length = 362

 Score = 84.6 bits (200), Expect = 2e-15
 Identities = 43/109 (39%), Positives = 61/109 (55%), Gaps = 4/109 (3%)
 Frame = +3

Query: 231 LSDEFINTINLKQNS-WKAGRNFP-RDTSFAHLKKIMGV--IEDEHFATLPIKTHKIDLI 398
           L +E +  +N   N+ WKA  N    + + A  K+++GV       F  +PI +H I L 
Sbjct: 46  LQNEIVKEVNENPNAGWKASFNDRFANATVAEFKRLLGVKPTPKTEFLGVPIVSHDISL- 104

Query: 399 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 545
             LP+ FD R  W  C ++  + DQG CGSCWAFGAVE+++DR C   N
Sbjct: 105 -KLPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYN 152


>UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8;
           Leishmania|Rep: Cathepsin B-like protease - Leishmania
           major
          Length = 340

 Score = 84.6 bits (200), Expect = 2e-15
 Identities = 46/122 (37%), Positives = 64/122 (52%), Gaps = 4/122 (3%)
 Frame = +3

Query: 186 TLVCVLAAAKDLPHPLSDEFINTINLK-QNSWKAGRN---FPRDTSFAHLKKIMGVIEDE 353
           T+  + A   D P  L   F+  +N K +  W A  N        S   ++K+MGV +  
Sbjct: 22  TVSGLYAKPSDFPL-LGKSFVAEVNSKAKGQWTASANNGYLVTGKSLGEVRKLMGVTDMS 80

Query: 354 HFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 533
             A  P      +L   LPE FD  + WP C T++E+RDQ +CGSCWA  AVEA++DR C
Sbjct: 81  TEAVPPRNFSVEELQQDLPEFFDAAEHWPMCLTISEIRDQSNCGSCWAIAAVEAISDRYC 140

Query: 534 TY 539
           T+
Sbjct: 141 TF 142


>UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115,
           whole genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_115,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 332

 Score = 83.4 bits (197), Expect = 4e-15
 Identities = 45/131 (34%), Positives = 69/131 (52%), Gaps = 2/131 (1%)
 Frame = +3

Query: 192 VCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPR--DTSFAHLKKIMGVIEDEHFAT 365
           +C++ +     +P    F+N+I   + +W A  N+ R  + S     K   VI D H   
Sbjct: 5   ICLIISLVSARNPFITAFVNSI---KTTWTA-TNYERWNEKSDGFYSKYFNVIVD-HSEP 59

Query: 366 LPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 545
           +  K H  + + +LP +F  ++KWP CP++  + DQG+CGSCWA  A   M+DR+C  S 
Sbjct: 60  VEYKYH--EKLENLPPSFSAQEKWPGCPSIELIPDQGNCGSCWAVSAASTMSDRLCIASG 117

Query: 546 GTKHFHFSAED 578
            T     SAED
Sbjct: 118 QTDKRQISAED 128


>UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep:
           Cathepsin B - Uronema marinum
          Length = 350

 Score = 81.8 bits (193), Expect = 1e-14
 Identities = 45/116 (38%), Positives = 65/116 (56%), Gaps = 3/116 (2%)
 Frame = +3

Query: 240 EFINTINLKQNSWKAGRNFPRD-TSFAHLKKIMGVIEDEHFATLPIKTHKIDLIA--SLP 410
           E +N  N   ++WKAG N   +  SF  ++ +MG I          +    + I   SLP
Sbjct: 29  EEVNNYNTG-STWKAGYNKRFEGMSFDQIQAMMGTIATPVHMIPDERYTPFETIQNLSLP 87

Query: 411 ENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 578
           E+FD R+ +P C +L +VRDQ +CGSCWAFG VEA++DR+C  S        S+E+
Sbjct: 88  ESFDLREAYPKCESLQQVRDQSNCGSCWAFGTVEAISDRICIASGQKDQTRISSEN 143


>UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000012227 - Anopheles gambiae
           str. PEST
          Length = 218

 Score = 81.4 bits (192), Expect = 2e-14
 Identities = 31/58 (53%), Positives = 43/58 (74%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 578
           +PE+FD R+ WP+C +L  +R+QG+CGSCWA  A   M+DRVC +SNGT +   +AED
Sbjct: 1   IPESFDARNHWPNCESLRAIRNQGTCGSCWAVAAASVMSDRVCIHSNGTINVALAAED 58


>UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep:
           Cathepsin B - Triticum aestivum (Wheat)
          Length = 353

 Score = 79.8 bits (188), Expect = 5e-14
 Identities = 42/109 (38%), Positives = 58/109 (53%), Gaps = 4/109 (3%)
 Frame = +3

Query: 231 LSDEFINTINLKQNS-WKAGRN-FPRDTSFAHLKKIMGVIEDEH--FATLPIKTHKIDLI 398
           +  + I T+N   N+ W AG N +  + +    K I+GV        A +PIK H     
Sbjct: 38  IQKDIIQTVNKHPNAGWTAGHNPYFANYTIEQFKHILGVKPTPPGLLAGVPIKIHPE--- 94

Query: 399 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 545
             LP+ FD R +W  C T+  + DQG CG+CWAF AVEA+ DR C + N
Sbjct: 95  MDLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRFCIHLN 143


>UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4;
           Ancylostomatidae|Rep: Cysteine proteinase - Ancylostoma
           ceylanicum
          Length = 348

 Score = 79.8 bits (188), Expect = 5e-14
 Identities = 43/108 (39%), Positives = 63/108 (58%), Gaps = 6/108 (5%)
 Frame = +3

Query: 243 FINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIAS------ 404
           F++ IN +Q+ ++A  + P    F    +IM    D  FA  P KT    ++A+      
Sbjct: 40  FVDYINQQQSFFRAEYS-PDAEEFVR-NRIM----DVKFAVDPEKTEPNYVLANTEMKVD 93

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 548
           +P+ FD RD+WP+C ++  +RDQ SCGSCWA  A  AM+DRVC  +NG
Sbjct: 94  IPDTFDARDRWPNCTSMKHIRDQSSCGSCWAVAAASAMSDRVCALTNG 141


>UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7;
           Rhabditida|Rep: Cysteine proteinase 3 - Necator
           americanus (Human hookworm)
          Length = 360

 Score = 79.8 bits (188), Expect = 5e-14
 Identities = 35/93 (37%), Positives = 46/93 (49%)
 Frame = +3

Query: 372 IKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT 551
           +K   +D    +P +FD RDKWP C ++  +RDQ  CGSCWA  + E M+DR+C  SNGT
Sbjct: 79  LKEEDMDFSEEIPVSFDARDKWPKCTSIGFIRDQSHCGSCWAVSSAETMSDRLCVQSNGT 138

Query: 552 KHFHFSAEDXXXXXXXXXXXXXXXXXXXAWEYW 650
                S  D                   AWEY+
Sbjct: 139 IKVLLSDTDILACCPNCGAGCGGGHTIRAWEYF 171


>UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin
           B-like cysteine proteinase 4 precursor (Cysteine
           protease-related 4); n=2; Tribolium castaneum|Rep:
           PREDICTED: similar to Cathepsin B-like cysteine
           proteinase 4 precursor (Cysteine protease-related 4) -
           Tribolium castaneum
          Length = 360

 Score = 78.6 bits (185), Expect = 1e-13
 Identities = 47/136 (34%), Positives = 65/136 (47%), Gaps = 1/136 (0%)
 Frame = +3

Query: 246 INTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENFDP 425
           IN IN +Q++W AG N P D   + L   +G+  D +F    IK  +      +PE FD 
Sbjct: 23  INQINSQQSAWTAGIN-PFDDIESRLG-FLGIHPDPNFKP-EIKEPQATQNV-IPETFDA 78

Query: 426 RDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDXXXXXXXX 602
           R+ WP+C  +   +R+QG C S WAF A E M+DR+C  +NG      S ED        
Sbjct: 79  REYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDLIDCCHYC 138

Query: 603 XXXXXXXXXXXAWEYW 650
                      AW Y+
Sbjct: 139 GNQCKGGYTYYAWNYF 154


>UniRef50_Q237A1 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 346

 Score = 78.6 bits (185), Expect = 1e-13
 Identities = 44/108 (40%), Positives = 61/108 (56%), Gaps = 3/108 (2%)
 Frame = +3

Query: 225 HPLSDEFINTINLKQNSWKAGRNFPR-DTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIA 401
           H    + I  +N   ++WKAG N    ++  A +K  MGV   +      IK   +   A
Sbjct: 34  HDKLKQIIQKVNSSNSTWKAGENTKWINSDIAGVKAHMGVKLGQESG---IKLETVSAQA 90

Query: 402 S-LPENFDPRDKWPD-CPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTY 539
           + LPE FD R +W D C +L EVRDQ +CGSCWAFGA E+++DR C +
Sbjct: 91  NGLPEEFDARVQWGDKCSSLWEVRDQSTCGSCWAFGAAESLSDRHCIH 138


>UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2;
           Ostreococcus|Rep: Cysteine proteinase - Ostreococcus
           tauri
          Length = 362

 Score = 78.2 bits (184), Expect = 2e-13
 Identities = 40/90 (44%), Positives = 50/90 (55%), Gaps = 2/90 (2%)
 Frame = +3

Query: 309 SFAHLKKI-MGVIEDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTL-NEVRDQGSC 482
           SF   K   MG +ED    T      K+     LP+ FD R+KWP C  L +E  DQG+C
Sbjct: 55  SFGRRKSARMGSLEDRLAKTWDPTKIKLHAGGRLPDTFDVREKWPKCAALVSEAVDQGAC 114

Query: 483 GSCWAFGAVEAMTDRVCTYSNGTKHFHFSA 572
           GSCWA    +AMTDR+C  +NG  + H SA
Sbjct: 115 GSCWAVAPAKAMTDRLCIATNGAVNTHVSA 144


>UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7;
           n=2; Haemonchidae|Rep: Cathepsin B-like cysteine
           protease GCP7 - Haemonchus contortus (Barber pole worm)
          Length = 348

 Score = 78.2 bits (184), Expect = 2e-13
 Identities = 30/58 (51%), Positives = 40/58 (68%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 578
           +PE+FD R+KW DCP+L  + DQ +CGSCWA  A + M+DR+C +S G K    SA D
Sbjct: 96  IPESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGRKKVLLSATD 153


>UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator
           americanus|Rep: Cysteine proteinase 4 - Necator
           americanus (Human hookworm)
          Length = 339

 Score = 76.2 bits (179), Expect = 6e-13
 Identities = 44/121 (36%), Positives = 64/121 (52%), Gaps = 3/121 (2%)
 Frame = +3

Query: 225 HPLSDE-FINTINLKQNSWKAGRNFPRDTSF--AHLKKIMGVIEDEHFATLPIKTHKIDL 395
           H LS +  ++ +N  Q+ +K   + P +  F  A +  I  + E  H    P K   I+L
Sbjct: 30  HGLSGQALVDYVNSHQSLFKTEYS-PTNEQFVKARIMDIKYMTEASH--KYPRKG--INL 84

Query: 396 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 575
              LPE FD R+KWP C ++  +RD  +CGSCWA  A   M+DR+C  +NGT     S+ 
Sbjct: 85  NVELPERFDAREKWPHCASIGLIRDHSACGSCWAVSAASVMSDRLCIQTNGTNQKILSSA 144

Query: 576 D 578
           D
Sbjct: 145 D 145


>UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 314

 Score = 74.5 bits (175), Expect = 2e-12
 Identities = 46/124 (37%), Positives = 69/124 (55%), Gaps = 2/124 (1%)
 Frame = +3

Query: 180 YVTLVCVLAAAKDLPHPLSDEFINTINL-KQNSWKAGRNFPRD-TSFAHLKKIMGVIEDE 353
           Y   VC L +  D P  L D  IN+IN  K++SW A RN   +  +F  +  +MG  +  
Sbjct: 15  YFASVC-LGSFLDKP-VLDDNLINSINNNKKSSWTAHRNKNFEGKTFGDIIGMMGTKKTA 72

Query: 354 HFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 533
             A   +  +  +L  S+P +FD R +WPDC  ++ + +Q  CGSCWAF + E ++DR+C
Sbjct: 73  --APFKLTENGEELKGSIPTSFDSRVQWPDC--IHPILNQEQCGSCWAFSSSEVLSDRLC 128

Query: 534 TYSN 545
             SN
Sbjct: 129 IASN 132


>UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 356

 Score = 73.3 bits (172), Expect = 5e-12
 Identities = 39/109 (35%), Positives = 55/109 (50%), Gaps = 1/109 (0%)
 Frame = +3

Query: 255 INLKQNSWKAGRN-FPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENFDPRD 431
           +N KQ  WKA  +        A  K I  +  ++  +    KT   +++  +P +FD R 
Sbjct: 44  VNKKQKLWKAETSRMTFQEKMARAKSIKFIKSNDEVSE---KTGNDNVLVDIPSSFDSRQ 100

Query: 432 KWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 578
           KWP C  +  VRDQ  CGS     AVE  +DR C  SNGT ++  SA+D
Sbjct: 101 KWPSCSQIGAVRDQSDCGSAAHLVAVEIASDRTCIASNGTFNWPLSAQD 149


>UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 421

 Score = 72.9 bits (171), Expect = 6e-12
 Identities = 29/60 (48%), Positives = 40/60 (66%)
 Frame = +3

Query: 399 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 578
           + +P+NFD R KWP+CP+++ V +QG CGSC+A  A    +DR C +SNGT     S ED
Sbjct: 136 SDVPKNFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKSLLSEED 195


>UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8.1;
           n=1; Caenorhabditis elegans|Rep: Putative
           uncharacterized protein W07B8.1 - Caenorhabditis elegans
          Length = 335

 Score = 72.9 bits (171), Expect = 6e-12
 Identities = 42/137 (30%), Positives = 70/137 (51%), Gaps = 1/137 (0%)
 Frame = +3

Query: 171 RAAYVTLVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIED 350
           R   + L+ VL  A  +P    D  I+ +N ++ +W AG   P  +  + LK +   + D
Sbjct: 2   RKILICLIGVLFQADGVPPSEIDRIIHYVNSQKTTWTAG--IPALSRNSMLKTL---VTD 56

Query: 351 EHFATLPIKTHKIDLIAS-LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 527
                  I+   +    S L  +FD R++WP+C ++ ++ D   C + WAF A E+M+DR
Sbjct: 57  AATIGFKIQNFGVSQANSDLSPSFDARERWPECMSIPQINDISECKTSWAFAAAESMSDR 116

Query: 528 VCTYSNGTKHFHFSAED 578
           +C  S G K+   SAE+
Sbjct: 117 LCINSGGFKNTILSAEE 133


>UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1
           precursor; n=3; Haemonchidae|Rep: Cathepsin B-like
           cysteine proteinase 1 precursor - Ostertagia ostertagi
          Length = 341

 Score = 71.3 bits (167), Expect = 2e-11
 Identities = 27/58 (46%), Positives = 39/58 (67%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 578
           +PE++DPR +W +C +L  + DQ +CGSCWA  +  AM+DR+C  S G K    SA+D
Sbjct: 91  IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQD 148


>UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep:
           Cathepsin B - Streblomastix strix
          Length = 312

 Score = 67.3 bits (157), Expect = 3e-10
 Identities = 24/51 (47%), Positives = 33/51 (64%)
 Frame = +3

Query: 396 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 548
           +A+LP+ FD R  WP+C  + ++ DQG CGSCWA  + E + DR C  S G
Sbjct: 73  VANLPDEFDSRTNWPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEG 123


>UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 311

 Score = 67.3 bits (157), Expect = 3e-10
 Identities = 37/117 (31%), Positives = 65/117 (55%), Gaps = 2/117 (1%)
 Frame = +3

Query: 231 LSDEFINTINLKQNSWKAGRNFPR--DTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIAS 404
           +S + ++ IN     W+A   +P+  + +F   K ++G        +LP +  ++ +  +
Sbjct: 25  ISRDLVDKINTLNVGWEATL-YPQFENLTFESAKSMLGSRGAWPEGSLPPEI-EVRVAEN 82

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 575
           +PENFD R +WP   +++ +R+QG CGSCWAFGA E ++DR    S    +   SA+
Sbjct: 83  IPENFDARKQWPG--SIHPIRNQGQCGSCWAFGASEVLSDRFAIASKNQIYVTLSAQ 137


>UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2
           precursor; n=8; Haemonchus contortus|Rep: Cathepsin
           B-like cysteine proteinase 2 precursor - Haemonchus
           contortus (Barber pole worm)
          Length = 342

 Score = 67.3 bits (157), Expect = 3e-10
 Identities = 32/85 (37%), Positives = 45/85 (52%)
 Frame = +3

Query: 324 KKIMGVIEDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFG 503
           +KIM +        L +K    D    +P ++DPRD W +C T   +RDQ +CGSCWA  
Sbjct: 61  QKIMSIKYKHQKLNLMVKEDP-DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVS 118

Query: 504 AVEAMTDRVCTYSNGTKHFHFSAED 578
              A++DR+C  S   K  + SA D
Sbjct: 119 TAAAISDRICIASKAEKQVNISATD 143


>UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1;
           Ostreococcus tauri|Rep: Cysteine proteinase Cathepsin F
           - Ostreococcus tauri
          Length = 498

 Score = 66.5 bits (155), Expect = 5e-10
 Identities = 29/50 (58%), Positives = 33/50 (66%), Gaps = 1/50 (2%)
 Frame = +3

Query: 402 SLPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 548
           SLP +FD RD++P C  L   VRDQG CGSCWA  A E M DR+C  S G
Sbjct: 256 SLPRHFDARDEYPKCARLIGTVRDQGKCGSCWAVAATEIMNDRLCISSGG 305


>UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella
           histriomuscorum|Rep: Cathepsin B - Oxytricha trifallax
           (Sterkiella histriomuscorum)
          Length = 294

 Score = 65.3 bits (152), Expect = 1e-09
 Identities = 34/115 (29%), Positives = 57/115 (49%)
 Frame = +3

Query: 183 VTLVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFA 362
           + ++  + A     HP+++E +  I  K + W+          F ++ K   + +   + 
Sbjct: 4   LVIIGTIVAVAVATHPINEEMVAHIKAKTSLWQPHET--TTNPFNNMTKEQLLAKCGTYI 61

Query: 363 TLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 527
               K +    I ++PENFD R +W     ++ +RDQ  CGSCWAFGA EA +DR
Sbjct: 62  VPANKEYPGSKIMTVPENFDARQQWGS--KIHAIRDQQQCGSCWAFGATEAFSDR 114


>UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep:
           Thiol protease - Trichuris suis
          Length = 348

 Score = 63.7 bits (148), Expect = 4e-09
 Identities = 27/51 (52%), Positives = 33/51 (64%)
 Frame = +3

Query: 393 LIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 545
           L  S+P +FD R  W  C +LN +RDQ  CGSCWA  A E M+DR+C  SN
Sbjct: 80  LALSIPPSFDVRSLWHVC-SLNLIRDQAKCGSCWAVSAAETMSDRICVQSN 129


>UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|Rep:
           Cysteine protease - Giardia muris
          Length = 301

 Score = 63.7 bits (148), Expect = 4e-09
 Identities = 34/83 (40%), Positives = 45/83 (54%), Gaps = 4/83 (4%)
 Frame = +3

Query: 339 VIEDEHFATLPIKTHKIDL----IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGA 506
           +I  E+  +L  +TH   L       LP+++DPR +   C  L EV DQ SCGSCWAF A
Sbjct: 51  LIPVENLRSLRTETHVSQLNLGKTKELPKDYDPRVERAHC--LPEVADQASCGSCWAFSA 108

Query: 507 VEAMTDRVCTYSNGTKHFHFSAE 575
           V    DR C Y   +K  H+S +
Sbjct: 109 VATFADRRCAYGLDSKQVHYSEQ 131


>UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG01102;
           n=1; Caenorhabditis briggsae|Rep: Putative
           uncharacterized protein CBG01102 - Caenorhabditis
           briggsae
          Length = 374

 Score = 63.3 bits (147), Expect = 5e-09
 Identities = 38/121 (31%), Positives = 62/121 (51%), Gaps = 6/121 (4%)
 Frame = +3

Query: 234 SDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPE 413
           S + IN +N +++ W AG   P+ +    LK +    E   F  L    +  ++ +  PE
Sbjct: 22  STKIINYVNSQKSLWTAGN--PKISKDYMLKTLTTDPETVGFRNLGPTFYSKNIFS--PE 77

Query: 414 N------FDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 575
           N      FD R++WP+C ++  + D   C S WAF A E+M+DR+C  S G  +   SA+
Sbjct: 78  NLDDSNFFDARERWPECSSIPIINDISDCKSSWAFSAAESMSDRLCINSGGMINTVLSAQ 137

Query: 576 D 578
           +
Sbjct: 138 E 138


>UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep:
           Cysteine proteinase - Toxoplasma gondii
          Length = 569

 Score = 62.9 bits (146), Expect = 6e-09
 Identities = 37/101 (36%), Positives = 51/101 (50%), Gaps = 9/101 (8%)
 Frame = +3

Query: 300 RDTSFAHLKKIMGVI----EDEHFAT---LPIKTHKIDLIAS-LPENFDPRDKWPDCP-T 452
           R  S    KK+MG      + E F T   +P+   + +     +P +FD R  +P C   
Sbjct: 231 RYLSLKDAKKLMGTFLVNTKVEGFPTPKGMPLPAKEFENATEPVPAHFDARTAFPACKDV 290

Query: 453 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 575
           +  VRDQG CGSCWAF + EA  DR+C  S G +    SA+
Sbjct: 291 VGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQ 331


>UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus
           contortus|Rep: Cysteine proteinase - Haemonchus
           contortus (Barber pole worm)
          Length = 350

 Score = 62.9 bits (146), Expect = 6e-09
 Identities = 23/48 (47%), Positives = 31/48 (64%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 548
           +PE+FD R  W +C ++  VRDQ  CGSCWA  A   M+DR+C  + G
Sbjct: 94  IPESFDSRIVWKNCSSITYVRDQSRCGSCWAVSAASTMSDRICVQTKG 141


>UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus
           lucimarinus CCE9901|Rep: Predicted protein -
           Ostreococcus lucimarinus CCE9901
          Length = 330

 Score = 62.1 bits (144), Expect = 1e-08
 Identities = 32/69 (46%), Positives = 39/69 (56%), Gaps = 4/69 (5%)
 Frame = +3

Query: 354 HFATLPIKTHKIDLIAS---LPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMT 521
           HF T      K++L A    LP +FD R  +P C  L   VRDQG CGSCWA  A E M 
Sbjct: 92  HFLTRLPALGKVELRAKDNRLPTSFDARVAYPKCSRLLGAVRDQGRCGSCWAVAATEVMN 151

Query: 522 DRVCTYSNG 548
           DR+C  ++G
Sbjct: 152 DRLCVATDG 160


>UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC06356 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 279

 Score = 61.7 bits (143), Expect = 1e-08
 Identities = 29/80 (36%), Positives = 44/80 (55%), Gaps = 1/80 (1%)
 Frame = +3

Query: 342 IEDEHFATLPIKTHKIDLI-ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           IE E+  T  IKT   + I   +P +FD R  W +C T+ ++ D+  C + WA   V+++
Sbjct: 6   IETENIQTKHIKTISHNSINMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSI 65

Query: 519 TDRVCTYSNGTKHFHFSAED 578
           +DR+C  SNG      SA D
Sbjct: 66  SDRICIRSNGRISVQLSARD 85


>UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep:
           Cathepsin B - Streblomastix strix
          Length = 283

 Score = 58.4 bits (135), Expect = 1e-07
 Identities = 35/101 (34%), Positives = 49/101 (48%), Gaps = 1/101 (0%)
 Frame = +3

Query: 231 LSDEFINTINLKQNSWKAGRNFPRDT-SFAHLKKIMGVIEDEHFATLPIKTHKIDLIASL 407
           L++    TIN   NS     ++P    S   L+  +G     H       ++K+      
Sbjct: 10  LAESIPETINRNPNSTWVAIDYPASVISHEKLRSKLGARFTPHRVRPYRDSNKV------ 63

Query: 408 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 530
           P+ FD R+KWPD   +  VRDQG CGSCWAF   E + DR+
Sbjct: 64  PDTFDAREKWPDA--ILPVRDQGECGSCWAFSIAETIGDRL 102


>UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_113_4299_5381 - Giardia lamblia ATCC
           50803
          Length = 360

 Score = 56.0 bits (129), Expect = 7e-07
 Identities = 27/75 (36%), Positives = 42/75 (56%)
 Frame = +3

Query: 309 SFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGS 488
           S   +K + G + D     + ++      + + PE++D RD++P C T  EV DQG+CGS
Sbjct: 109 SLDEVKAMFGPLVDTSRPAITMRRSTTPPVGA-PESYDFRDEYPHCIT--EVVDQGNCGS 165

Query: 489 CWAFGAVEAMTDRVC 533
           CWAF +V+   D  C
Sbjct: 166 CWAFSSVQTFADHRC 180


>UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3;
           Giardia intestinalis|Rep: Cathepsin B-like CP1 precursor
           - Giardia lamblia (Giardia intestinalis)
          Length = 303

 Score = 55.6 bits (128), Expect = 1e-06
 Identities = 25/56 (44%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
 Frame = +3

Query: 369 PIKTHKI-DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 533
           PI   ++ +L+  +P  FD RD++P C  +    DQGSCGSCWAF A+    DR C
Sbjct: 66  PISITEVQELVDPIPPQFDFRDEYPQC--VKPALDQGSCGSCWAFSAIGVFGDRRC 119


>UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4;
           Giardia intestinalis|Rep: Cathepsin B-like CP2 precursor
           - Giardia lamblia (Giardia intestinalis)
          Length = 300

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 25/57 (43%), Positives = 34/57 (59%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 575
           +PE+FD R+++P C  + EV DQG CGSCWAF +V    DR C      K   +S +
Sbjct: 75  VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQ 129


>UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep:
           Cysteine protease - Solanum lycopersicum (Tomato)
           (Lycopersicon esculentum)
          Length = 345

 Score = 52.0 bits (119), Expect = 1e-05
 Identities = 31/97 (31%), Positives = 52/97 (53%), Gaps = 4/97 (4%)
 Frame = +3

Query: 240 EFINTINLKQN-SWKAGRN-FPRDTSFAHLKKIMGV-IEDEHFATLPIKTHKIDLIASLP 410
           +FI ++N   N S+K G N F   TS   L K  G+ I + + +  P+ + +   I  L 
Sbjct: 68  KFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKKINDLS 127

Query: 411 ENFDPRD-KWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           +++ P +  W +   + +V+ QG CG CWAF AV ++
Sbjct: 128 DDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSL 164


>UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,
           isoform A; n=2; Endopterygota|Rep: PREDICTED: similar to
           CG3074-PA, isoform A - Tribolium castaneum
          Length = 445

 Score = 51.6 bits (118), Expect = 2e-05
 Identities = 49/184 (26%), Positives = 71/184 (38%), Gaps = 4/184 (2%)
 Frame = +3

Query: 108 IYPSIR--KKVCYNRKTKKMFISRAAYVTLVCVLAAAKDLPHPLSDEFINTINLKQNSWK 281
           +YP  +  KK C   K +KM  ++A    ++C     + L  P   E IN+ N     W 
Sbjct: 100 VYPLNKQIKKNCNVCKCEKMGQNQA---DMLC--EQHQCLIEPSITEAINS-NYANYGWS 153

Query: 282 AGR--NFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTL 455
           A     F        +K  +G ++ + F        +I    SLP  FD   KWP    +
Sbjct: 154 ASNYSKFWGHKLEEGIKLRLGTLQPQRFVMHMNPVRRIYDPNSLPREFDSEFKWPGW--M 211

Query: 456 NEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDXXXXXXXXXXXXXXXXXXX 635
           +E++DQG CGS WA       +DR    S G +    SA+                    
Sbjct: 212 SEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHLLSCDRRGQQSCNGGYLDR 271

Query: 636 AWEY 647
           AW Y
Sbjct: 272 AWSY 275


>UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2;
           cellular organisms|Rep: Cysteine proteinase, putative -
           Archaeoglobus fulgidus
          Length = 1088

 Score = 51.6 bits (118), Expect = 2e-05
 Identities = 26/60 (43%), Positives = 32/60 (53%)
 Frame = +3

Query: 396 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 575
           +ASLP  FD    W D   L+ VRDQGSCGSCWA  AV A+   +   S  +     S +
Sbjct: 591 MASLPSRFD----WRDYTGLSAVRDQGSCGSCWAHSAVAALESALIVESGASSSIDLSEQ 646


>UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;
           n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
           hypothetical protein - Strongylocentrotus purpuratus
          Length = 450

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 23/50 (46%), Positives = 28/50 (56%)
 Frame = +3

Query: 399 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 548
           A LPE FD R+ WP    ++EV DQG CGS WA       +DR+   S G
Sbjct: 195 ARLPETFDARENWPGL--IDEVIDQGKCGSSWAISTASVASDRLAIQSMG 242


>UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-LDL
           responsive gene 2, partial; n=1; Strongylocentrotus
           purpuratus|Rep: PREDICTED: similar to oxidized-LDL
           responsive gene 2, partial - Strongylocentrotus
           purpuratus
          Length = 363

 Score = 50.8 bits (116), Expect = 3e-05
 Identities = 23/59 (38%), Positives = 34/59 (57%), Gaps = 1/59 (1%)
 Frame = +3

Query: 402 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT-KHFHFSAE 575
           ++PE FD R +WP    +  V++QG+C S WA       +DR+   SNGT K+ H S +
Sbjct: 221 AIPEEFDARAQWPGL--VEGVQNQGNCASSWAMSTAATASDRLAIQSNGTFKYMHLSPQ 277


>UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon
           GZfos34G5|Rep: Cathepsin C - uncultured archaeon
           GZfos34G5
          Length = 760

 Score = 50.8 bits (116), Expect = 3e-05
 Identities = 33/100 (33%), Positives = 47/100 (47%), Gaps = 3/100 (3%)
 Frame = +3

Query: 228 PLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGV--IEDEHFATLPIKTHKIDLIA 401
           P S+E    I  K   W AG     D +F   K + G+  +      +   +   + L A
Sbjct: 244 PSSEEIQRVIEEKGAKWTAGETSVSDLTFEEKKMLCGIKSLYGLRILSTEERVRVVALDA 303

Query: 402 SLP-ENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           S+P   FD RDK      +  V++QGSCGSC AFG + A+
Sbjct: 304 SVPIGTFDWRDK-DGANWITSVKEQGSCGSCVAFGTIGAL 342


>UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted,
           possible transmembrane domain near N-terminus; n=4;
           Cryptosporidium|Rep: Cryptopain-cysteine proteinase
           secreted, possible transmembrane domain near N-terminus
           - Cryptosporidium parvum Iowa II
          Length = 401

 Score = 50.4 bits (115), Expect = 4e-05
 Identities = 31/103 (30%), Positives = 46/103 (44%), Gaps = 2/103 (1%)
 Frame = +3

Query: 243 FINTINLKQNSWKAGRNFPRDTSFAH-LKKIMGVIEDEHFATLPIKTHKIDLIASLPENF 419
           FI T N +  S+    N   D S    + +  G I+D        K+ ++    S  E  
Sbjct: 116 FIKTTNSQGFSYVLEMNEFGDLSKEEFMARFTGYIKDSKDDERVFKSSRVSASESEEEFV 175

Query: 420 DPRD-KWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 545
            P    W +   +N +R+Q +CGSCWAF AV A+    C  +N
Sbjct: 176 PPNSINWVEAGCVNPIRNQKNCGSCWAFSAVAALEGATCAQTN 218


>UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like protein
           F26E4.3; n=2; Caenorhabditis|Rep: Uncharacterized
           peptidase C1-like protein F26E4.3 - Caenorhabditis
           elegans
          Length = 491

 Score = 50.4 bits (115), Expect = 4e-05
 Identities = 22/48 (45%), Positives = 28/48 (58%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 548
           LPE+FD RDKW   P ++ V DQG CGS W+       +DR+   S G
Sbjct: 223 LPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAIISEG 268


>UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1;
           Dictyostelium discoideum AX4|Rep: Counting factor
           associated protein - Dictyostelium discoideum AX4
          Length = 531

 Score = 50.0 bits (114), Expect = 5e-05
 Identities = 32/103 (31%), Positives = 49/103 (47%)
 Frame = +3

Query: 240 EFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENF 419
           + I T N K++S+K G N   D S      ++         T     H  + + S+P   
Sbjct: 254 KIIATHNAKESSYKLGMNHYADLSNKEFNTLVKPKVARPSVTGADSVHDDESLRSIPSTV 313

Query: 420 DPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 548
           D R++  +C T   V+DQG CGSCW FG+  ++    C  +NG
Sbjct: 314 DWRNQ--NCVT--PVKDQGICGSCWTFGSTGSLEGTNCV-TNG 351


>UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 323

 Score = 49.2 bits (112), Expect = 8e-05
 Identities = 21/48 (43%), Positives = 30/48 (62%)
 Frame = +3

Query: 402 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 545
           ++P +FD R  W DC  ++ VR+Q SCGSCWA      + DR+C  S+
Sbjct: 45  TIPASFDVRTNWGDC--MSPVREQQSCGSCWAQVTSGILADRMCIESD 90


>UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1;
           Phytophthora infestans|Rep: Cathepsin-like cysteine
           protease - Phytophthora infestans (Potato late blight
           fungus)
          Length = 376

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 30/96 (31%), Positives = 49/96 (51%), Gaps = 1/96 (1%)
 Frame = +3

Query: 267 QNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTH-KIDLIASLPENFDPRDKWPD 443
           ++S+  G N   D + A  K+++     +  ++   +T  K + +  LP  +D    W +
Sbjct: 86  EHSFTLGLNDLADLADAEYKQLLSYRTRDSKSSSASETFVKPENVEDLPATWD----WRE 141

Query: 444 CPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT 551
             T+  V++QG CGSCWAF AV AM    C Y+  T
Sbjct: 142 HSTVTPVKNQGQCGSCWAFSAVAAME---CAYALST 174


>UniRef50_Q58HK5 Cluster: Hc58; n=1; Haemonchus contortus|Rep: Hc58
           - Haemonchus contortus (Barber pole worm)
          Length = 241

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 17/29 (58%), Positives = 21/29 (72%)
 Frame = +3

Query: 462 VRDQGSCGSCWAFGAVEAMTDRVCTYSNG 548
           +RDQ +CGSCWA  A E M+DR C +S G
Sbjct: 108 IRDQSNCGSCWAVSAAETMSDRACIHSKG 136


>UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus
           salmonis|Rep: Cysteine proteinase - Lepeophtheirus
           salmonis (salmon louse)
          Length = 372

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 35/98 (35%), Positives = 46/98 (46%), Gaps = 3/98 (3%)
 Frame = +3

Query: 267 QNSWKAGRN-FPRDTSFAHLKKIMGVIEDEHFATLPIKTH--KIDLIASLPENFDPRDKW 437
           + +W  G N F   T      K MG       A L  +T   K   I  LPE+ D R+K 
Sbjct: 66  KRTWDMGINEFSDLTDEEFESKYMGYSPMSSSAGLVTRTAAPKQGNIKDLPESVDWREKG 125

Query: 438 PDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT 551
                + +V++QGSCGSCW F AVE +   V   +N T
Sbjct: 126 ----VITDVKNQGSCGSCWVFSAVEQIESYVAIENNMT 159


>UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin C;
           n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to cathepsin C - Strongylocentrotus purpuratus
          Length = 482

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 34/110 (30%), Positives = 51/110 (46%), Gaps = 3/110 (2%)
 Frame = +3

Query: 225 HPLSDEFINTINLKQNSWKAGR--NFPRDTSFAHLKKIMGVIEDEHFATL-PIKTHKIDL 395
           H  +D+FI  IN  Q+SWKA     +   T     ++  G +    +  + P        
Sbjct: 186 HRRNDKFIEGINKHQDSWKATYYDRYVNLTLGDMRRRAGGKLWKRVWPDVSPTDERTKQA 245

Query: 396 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 545
            ++LPE FD RD       ++ VRDQG CGSC+AF +      R+   +N
Sbjct: 246 ASNLPEKFDWRDVG-GIDYVSPVRDQGICGSCYAFASTATQESRLRVMTN 294


>UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_217_11853_10927 - Giardia lamblia
           ATCC 50803
          Length = 308

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 24/81 (29%), Positives = 38/81 (46%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDXX 584
           +P++FD R+++P C T  EV D G C S WA+ AV+A + R C      +   +SA+   
Sbjct: 75  VPDHFDFREEYPQCIT--EVIDIGLCSSSWAYSAVDAFSHRRCLTGLDQEATRYSAQYIL 132

Query: 585 XXXXXXXXXXXXXXXXXAWEY 647
                            AW++
Sbjct: 133 SCSSTNGCFGFSTRESIAWDF 153


>UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin O
           precursor; n=2; Apocrita|Rep: PREDICTED: similar to
           Cathepsin O precursor - Apis mellifera
          Length = 374

 Score = 46.8 bits (106), Expect = 5e-04
 Identities = 24/52 (46%), Positives = 30/52 (57%)
 Frame = +3

Query: 402 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKH 557
           S+P  FD RDK    P    VR QGSCG+CWAF  +E + + +    NGT H
Sbjct: 154 SIPLRFDWRDKGVITP----VRSQGSCGACWAFSTIEVI-ESMFAIKNGTLH 200


>UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia
           intestinalis|Rep: GLP_41_8294_9919 - Giardia lamblia
           ATCC 50803
          Length = 541

 Score = 46.8 bits (106), Expect = 5e-04
 Identities = 23/50 (46%), Positives = 32/50 (64%)
 Frame = +3

Query: 402 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT 551
           +LP++FD RD       +  V DQG+CGSC+ FGAV+AM  R+   +N T
Sbjct: 240 TLPDDFDWRDV-NGVSYIPGVLDQGACGSCFTFGAVQAMNSRIMIATNRT 288


>UniRef50_P81494 Cluster: Cathepsin B; n=2; Phasianidae|Rep:
           Cathepsin B - Coturnix coturnix japonica (Japanese
           quail)
          Length = 48

 Score = 46.8 bits (106), Expect = 5e-04
 Identities = 16/25 (64%), Positives = 22/25 (88%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGS 479
           LP+ FD R +WP+CPT++E+RDQGS
Sbjct: 1   LPDTFDSRKQWPNCPTISEIRDQGS 25


>UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae
           str. PEST
          Length = 559

 Score = 46.4 bits (105), Expect = 6e-04
 Identities = 20/38 (52%), Positives = 25/38 (65%)
 Frame = +3

Query: 396 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 509
           +  LP +FD    W D   + EV++QGSCGSCWAF AV
Sbjct: 336 VGDLPRSFD----WRDHGAVTEVKNQGSCGSCWAFSAV 369


>UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia
           tropicalis|Rep: Blo t 1 allergen - Blomia tropicalis
           (Mite)
          Length = 333

 Score = 46.4 bits (105), Expect = 6e-04
 Identities = 22/40 (55%), Positives = 25/40 (62%)
 Frame = +3

Query: 387 IDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGA 506
           I+   SLP+NFD R K      L  +R QGSCGSCWAF A
Sbjct: 107 INTYGSLPQNFDWRQK----ARLTRIRQQGSCGSCWAFAA 142


>UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep:
           Cathepsin L - Stylonychia lemnae
          Length = 340

 Score = 46.0 bits (104), Expect = 8e-04
 Identities = 29/97 (29%), Positives = 46/97 (47%), Gaps = 2/97 (2%)
 Frame = +3

Query: 243 FINTINLKQN--SWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPEN 416
           FIN  N + +  S+  G N   D +    KK++G            + +    +  +PE+
Sbjct: 72  FINNHNSQNDGTSFTLGPNHLADYTHDEYKKMLGYKPRNKTGK---EVYSTPNLKDIPES 128

Query: 417 FDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 527
            D R+K      +N V+DQG CGSCWAF  + ++  R
Sbjct: 129 IDWREKG----AVNAVKDQGQCGSCWAFSTIASLESR 161


>UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35;
           Viridiplantae|Rep: Cysteine proteinase 15A precursor -
           Pisum sativum (Garden pea)
          Length = 363

 Score = 46.0 bits (104), Expect = 8e-04
 Identities = 24/53 (45%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
 Frame = +3

Query: 366 LPIKTHKIDLI--ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           LP    K  ++   +LPE+FD R+K    P    V+DQGSCGSCWAF    A+
Sbjct: 117 LPAHAQKAPILPTTNLPEDFDWREKGAVTP----VKDQGSCGSCWAFSTTGAL 165


>UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2;
           Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio
           molitor (Yellow mealworm)
          Length = 336

 Score = 45.6 bits (103), Expect = 0.001
 Identities = 26/64 (40%), Positives = 39/64 (60%), Gaps = 3/64 (4%)
 Frame = +3

Query: 348 DEHFATLPIKTHK-IDLIASL--PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           D H   +PIKT + + L AS+  P +FD    W D   ++ V++QGSCGSCWAF +  A+
Sbjct: 99  DLHKNGIPIKTREDLGLNASVRYPASFD----WRDQGMVSPVKNQGSCGSCWAFSSTGAI 154

Query: 519 TDRV 530
             ++
Sbjct: 155 ESQM 158


>UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep:
           Vivapain-4 - Plasmodium vivax
          Length = 484

 Score = 45.6 bits (103), Expect = 0.001
 Identities = 32/108 (29%), Positives = 52/108 (48%), Gaps = 8/108 (7%)
 Frame = +3

Query: 246 INTINLKQNS-WKAGRNFPRDTSFAHLKKIMGVIE---DEHFATLPIKTHKIDLIASL-P 410
           IN+ N K N  +K G N   D SF   +K M  +     +  A  P  ++  D++    P
Sbjct: 197 INSHNSKANILYKKGTNQYSDISFEEFRKTMLTLRFDLKKKLANSPYVSNYDDVLKKYKP 256

Query: 411 ENF---DPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 545
            +    + +  W +   ++E+++Q  CGSCWAFGAV A+  +     N
Sbjct: 257 ADAVVDNEKYDWREHNAVSEIKNQNLCGSCWAFGAVGAVESQYAIRKN 304


>UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcoptes
           scabiei type hominis|Rep: Sar s 1 allergen Yv9053H09 -
           Sarcoptes scabiei type hominis
          Length = 253

 Score = 45.6 bits (103), Expect = 0.001
 Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 4/62 (6%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV----EAMTDRVCTYSNGTKHFHFSA 572
           LPE FD RD       L+++R+QG CG+CWAF A+     A   R     N T+  HFS 
Sbjct: 37  LPEKFDLRD----LGYLSKIRNQGRCGACWAFAALASVESAYNRRTRIVHNRTRKHHFSE 92

Query: 573 ED 578
           ++
Sbjct: 93  QE 94


>UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like
           cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin B-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 288

 Score = 45.6 bits (103), Expect = 0.001
 Identities = 29/96 (30%), Positives = 47/96 (48%), Gaps = 2/96 (2%)
 Frame = +3

Query: 264 KQNSWKAGRNFP-RDTSFAHLKKIMGVIEDEHFATLPI-KTHKIDLIASLPENFDPRDKW 437
           K   W AG N   +  +F     I G        T+P+ +  KI++  S+P +++  +++
Sbjct: 21  KDLPWVAGENERFKGMTFKDASVISGNAHKLRPDTIPLARPPKINI--SIPMSYNFTERF 78

Query: 438 PDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 545
           P C     V DQG CGSCW+F   ++ + R C   N
Sbjct: 79  PQCDF--GVLDQGKCGSCWSFAVSKSFSHRYCRKYN 112


>UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11;
           Trichomonadidae|Rep: Cysteine protease - Tritrichomonas
           foetus (Trichomonas foetus)
          Length = 315

 Score = 45.2 bits (102), Expect = 0.001
 Identities = 23/60 (38%), Positives = 33/60 (55%)
 Frame = +3

Query: 372 IKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT 551
           +K  K+      P N D  D W +   +NE++DQ +CGSCWAF A++A  +     S GT
Sbjct: 87  MKAEKVSRGMKKP-NVDSID-WREKGVVNEIKDQAACGSCWAFSAIQA-AESAYAISTGT 143


>UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|Rep:
           Cysteine proteinase - Globodera pallida
          Length = 53

 Score = 45.2 bits (102), Expect = 0.001
 Identities = 18/36 (50%), Positives = 21/36 (58%)
 Frame = +3

Query: 471 QGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 578
           QG CG CWAF   E ++DR C  SNGT+    S  D
Sbjct: 1   QGQCGRCWAFSTAEVISDRTCIASNGTQQPIISPTD 36


>UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16;
           Bromeliaceae|Rep: Fruit bromelain precursor - Ananas
           comosus (Pineapple)
          Length = 351

 Score = 45.2 bits (102), Expect = 0.001
 Identities = 28/90 (31%), Positives = 46/90 (51%), Gaps = 2/90 (2%)
 Frame = +3

Query: 246 INTINLK-QNSWKAGRN-FPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENF 419
           I T N + +NS+  G N F   T    + +  GV    +    P+ +     I+++P++ 
Sbjct: 68  IETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVSFDDVNISAVPQSI 127

Query: 420 DPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 509
           D    W D   +NEV++Q  CGSCW+F A+
Sbjct: 128 D----WRDYGAVNEVKNQNPCGSCWSFAAI 153


>UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to
           glucocorticoid-inducible protein; n=1; Gallus
           gallus|Rep: PREDICTED: similar to
           glucocorticoid-inducible protein - Gallus gallus
          Length = 307

 Score = 44.8 bits (101), Expect = 0.002
 Identities = 19/48 (39%), Positives = 26/48 (54%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 548
           LP +FD   KWP    ++E  DQG+C   WAF      +DR+  +S G
Sbjct: 153 LPRHFDAATKWPGM--IHEPLDQGNCAGSWAFSTAAVASDRISIHSMG 198


>UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p -
           Drosophila melanogaster (Fruit fly)
          Length = 431

 Score = 44.8 bits (101), Expect = 0.002
 Identities = 20/58 (34%), Positives = 30/58 (51%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 578
           LP +F+  DKW     ++EV DQG CG+ W        +DR    S G ++   SA++
Sbjct: 187 LPSSFNALDKWSSY--ISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQN 242


>UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-like
           precursor; n=26; Euteleostomi|Rep: Tubulointerstitial
           nephritis antigen-like precursor - Homo sapiens (Human)
          Length = 467

 Score = 44.8 bits (101), Expect = 0.002
 Identities = 32/115 (27%), Positives = 53/115 (46%), Gaps = 6/115 (5%)
 Frame = +3

Query: 222 PHPLSDEFINTINLKQNSWKAGRN--FPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDL 395
           P  +  + I  IN     W+AG +  F   T    ++  +G I     ++  +  H+I  
Sbjct: 139 PCLVDPDMIKAINQGNYGWQAGNHSAFWGMTLDEGIRYRLGTIRP---SSSVMNMHEIYT 195

Query: 396 IAS----LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 548
           + +    LP  F+  +KWP+   ++E  DQG+C   WAF      +DRV  +S G
Sbjct: 196 VLNPGEVLPTAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLG 248


>UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18;
           Magnoliophyta|Rep: Thiol protease aleurain precursor -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 358

 Score = 44.8 bits (101), Expect = 0.002
 Identities = 33/95 (34%), Positives = 48/95 (50%), Gaps = 3/95 (3%)
 Frame = +3

Query: 240 EFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENF 419
           + I + N K  S+K G N   D ++   ++          ATL   +HK+   A+LPE  
Sbjct: 88  DLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNCSATLK-GSHKVTE-AALPETK 145

Query: 420 DPRDKWPDCPTLNEVRDQGSCGSCWAF---GAVEA 515
           D    W +   ++ V+DQG CGSCW F   GA+EA
Sbjct: 146 D----WREDGIVSPVKDQGGCGSCWTFSTTGALEA 176


>UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor;
           n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase
           precursor - Diabrotica virgifera virgifera (western corn
           rootworm)
          Length = 326

 Score = 44.4 bits (100), Expect = 0.002
 Identities = 20/44 (45%), Positives = 26/44 (59%)
 Frame = +3

Query: 369 PIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 500
           P   H +  +  LP  FD R+K      + EV+DQGSCGSCW+F
Sbjct: 98  PRVIHSLTPVKDLPSKFDWREKG----AVTEVKDQGSCGSCWSF 137


>UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan
           CA, family C1, cathepsin L-like cysteine peptidase -
           Trichomonas vaginalis G3
          Length = 306

 Score = 44.4 bits (100), Expect = 0.002
 Identities = 23/72 (31%), Positives = 41/72 (56%), Gaps = 2/72 (2%)
 Frame = +3

Query: 321 LKKIMGVIEDEHFATLPIKT-HKIDLIASLPENFDPRD-KWPDCPTLNEVRDQGSCGSCW 494
           L +   + E+E+ + L  K  HK   I    +N  P +  W +   +N++++QG+CGSCW
Sbjct: 54  LNRFAHLTENEYRSMLGYKYGHKSYPITKNIKNDVPTEIDWREQGIVNKIKNQGACGSCW 113

Query: 495 AFGAVEAMTDRV 530
           AF A++ +  +V
Sbjct: 114 AFSAIQVIESQV 125


>UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15;
           Magnoliophyta|Rep: Cysteine proteinase RD19a precursor -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 368

 Score = 44.4 bits (100), Expect = 0.002
 Identities = 22/53 (41%), Positives = 32/53 (60%), Gaps = 2/53 (3%)
 Frame = +3

Query: 366 LPIKTHKIDLIAS--LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           LP   +K  ++ +  LPE+FD    W D   +  V++QGSCGSCW+F A  A+
Sbjct: 120 LPKDANKAPILPTENLPEDFD----WRDHGAVTPVKNQGSCGSCWSFSATGAL 168


>UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p;
           n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
           GM06507p - Nasonia vitripennis
          Length = 483

 Score = 44.0 bits (99), Expect = 0.003
 Identities = 20/57 (35%), Positives = 29/57 (50%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 575
           LP  FD R +W +   +  V+DQG CG+ WA   V+  +DR    S G +    S +
Sbjct: 236 LPREFDSRIQWGN--DITPVQDQGWCGASWAISTVDVASDRFAIMSKGIEKVQLSGQ 290


>UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase;
           n=21; Bilateria|Rep: Cathepsin L-like cysteine
           proteinase - Globodera pallida
          Length = 379

 Score = 44.0 bits (99), Expect = 0.003
 Identities = 32/92 (34%), Positives = 47/92 (51%), Gaps = 7/92 (7%)
 Frame = +3

Query: 273 SWKAGRNFPRDTSFAHLKKIMG---VIEDEHFATLPIKTHKIDLIASLPENFDPRDK-WP 440
           +++ G N   D  F+  KK+ G   ++ D            ++ +  LPE+ D RDK W 
Sbjct: 115 TFRVGENHIADLPFSEYKKLNGYRRLLGDNLRRNASTFLAPMN-VGDLPESVDWRDKGW- 172

Query: 441 DCPTLNEVRDQGSCGSCWAF---GAVEAMTDR 527
               + EV++QG CGSCWAF   GA+EA   R
Sbjct: 173 ----VTEVKNQGMCGSCWAFSSTGALEAQHAR 200


>UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep:
           Cysteine protease - Babesia equi
          Length = 438

 Score = 44.0 bits (99), Expect = 0.003
 Identities = 27/70 (38%), Positives = 35/70 (50%), Gaps = 3/70 (4%)
 Frame = +3

Query: 309 SFAHLKKIMGVIEDEHFATLPIKTHKIDLIASL--PENFDPRD-KWPDCPTLNEVRDQGS 479
           S   LKK + V   E F T P    K+ +   L   ++ D  D  W     +  V+DQG+
Sbjct: 186 SVEELKKSLEVSASEEF-TSPEHLDKVRIAKGLGVEDSVDGEDLDWRKLNGVTPVKDQGN 244

Query: 480 CGSCWAFGAV 509
           CGSCWAF AV
Sbjct: 245 CGSCWAFAAV 254


>UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole
           genome shotgun sequence; n=7; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_22,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 350

 Score = 44.0 bits (99), Expect = 0.003
 Identities = 32/90 (35%), Positives = 42/90 (46%), Gaps = 6/90 (6%)
 Frame = +3

Query: 249 NTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFAT--LPIKTHKIDLIASLPE--- 413
           N  ++K  + K GR    +T F  L        DE FA   L +K +  DL     +   
Sbjct: 88  NLADIKARNQKLGREIFGETQFTDLT-------DEEFAATYLTLKVNPDDLEVPKAQFEN 140

Query: 414 -NFDPRDKWPDCPTLNEVRDQGSCGSCWAF 500
            N  P D W     +N+V+DQG CGSCWAF
Sbjct: 141 VNATPID-WRTRGAVNKVKDQGQCGSCWAF 169


>UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin O;
           n=1; Danio rerio|Rep: PREDICTED: similar to cathepsin O
           - Danio rerio
          Length = 327

 Score = 43.6 bits (98), Expect = 0.004
 Identities = 18/35 (51%), Positives = 21/35 (60%)
 Frame = +3

Query: 414 NFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           N  PR  W D   +  V +QGSCG CWAF  VEA+
Sbjct: 119 NNPPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAI 153


>UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo
           sapiens|Rep: Isoform 2 of Q9GZM7 - Homo sapiens (Human)
          Length = 283

 Score = 43.6 bits (98), Expect = 0.004
 Identities = 19/48 (39%), Positives = 27/48 (56%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 548
           LP  F+  +KWP+   ++E  DQG+C   WAF      +DRV  +S G
Sbjct: 69  LPTAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLG 114


>UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1;
           Bigelowiella natans|Rep: Digestive cysteine proteinase -
           Bigelowiella natans (Pedinomonas minutissima)
           (Chlorarachnion sp.(strain CCMP 621))
          Length = 360

 Score = 43.6 bits (98), Expect = 0.004
 Identities = 16/28 (57%), Positives = 19/28 (67%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           W D   L  V+DQG CGSCWAF A +A+
Sbjct: 115 WRDFNALTPVKDQGGCGSCWAFSATQAL 142


>UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase
           precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine
           proteinase precursor - Heterodera glycines (Soybean cyst
           nematode worm)
          Length = 353

 Score = 43.6 bits (98), Expect = 0.004
 Identities = 20/40 (50%), Positives = 26/40 (65%)
 Frame = +3

Query: 399 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           ++LPE  D R+K      + EV+DQG CGSCWAF A  A+
Sbjct: 133 STLPEKLDWREKG----AVTEVKDQGDCGSCWAFSATGAI 168


>UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_23,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 321

 Score = 43.6 bits (98), Expect = 0.004
 Identities = 27/104 (25%), Positives = 48/104 (46%), Gaps = 6/104 (5%)
 Frame = +3

Query: 225 HPLSDEFINTINL-KQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIA 401
           +P  +E I   ++ +QN  K      ++ S+       G + D+ F T+ +       + 
Sbjct: 49  YPTQNEQIYRFSIYQQNIMKIEDFNSQNNSYKQKINKFGDLTDQEFLTIYLNLQMPARVK 108

Query: 402 SLPENFDP-----RDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           ++ +N +P        W     +  ++DQG CGSCWAF AV A+
Sbjct: 109 NIQKNEEPFLVQEEVDWVQKGKVPAIKDQGDCGSCWAFSAVGAL 152


>UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 336

 Score = 43.2 bits (97), Expect = 0.006
 Identities = 21/41 (51%), Positives = 27/41 (65%), Gaps = 3/41 (7%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF---GAVEAM 518
           LP +FD    W D   L++V+DQG CGSCWAF   G +EA+
Sbjct: 125 LPASFD----WRDYGILSDVKDQGQCGSCWAFSTTGILEAL 161


>UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia
           theta|Rep: Cathepsin H precursor - Guillardia theta
           (Cryptomonas phi)
          Length = 353

 Score = 43.2 bits (97), Expect = 0.006
 Identities = 24/93 (25%), Positives = 45/93 (48%), Gaps = 2/93 (2%)
 Frame = +3

Query: 246 INTINLKQNS-WKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENFD 422
           +  IN +  + W+A  N   D ++   K    + E    AT+     K+  +  + + FD
Sbjct: 64  VEAINSRPGTTWRAALNQYSDLTWEEFKHAKLMAEQNCGATVTTPVEKLVKMGIVADEFD 123

Query: 423 PRDKW-PDCPTLNEVRDQGSCGSCWAFGAVEAM 518
            R++   +   ++ V++QG+CGSCW F    A+
Sbjct: 124 WRNQTCGETSCVSMVKNQGTCGSCWTFSTAAAL 156


>UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 234

 Score = 43.2 bits (97), Expect = 0.006
 Identities = 18/42 (42%), Positives = 26/42 (61%)
 Frame = +3

Query: 393 LIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           ++  +P+  D R K      +NE++DQ  CGSCWAFG+  AM
Sbjct: 14  IVGDIPDEIDYRTKG----AVNEIKDQKHCGSCWAFGSCAAM 51


>UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag-RP
           - Bombyx mori (Silk moth)
          Length = 404

 Score = 43.2 bits (97), Expect = 0.006
 Identities = 29/115 (25%), Positives = 50/115 (43%)
 Frame = +3

Query: 231 LSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLP 410
           +S++ +N +N +  +W+A   +P    F   K   G+I       L +           P
Sbjct: 131 MSEDLVNDVNQQGTTWRA-TTYPE---FNEKKLKDGLIYKLGTFPLNVTVISYSKDGQYP 186

Query: 411 ENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 575
           + FD R +W     ++ + DQ  CGS WA      + DR    S GT++   S++
Sbjct: 187 DEFDARREWYGY--ISPIADQDWCGSDWAVSIASIVGDRFSIQSFGTENVRMSSQ 239


>UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza
           sativa|Rep: Cysteine protease 1 precursor - Oryza sativa
           subsp. japonica (Rice)
          Length = 490

 Score = 43.2 bits (97), Expect = 0.006
 Identities = 20/48 (41%), Positives = 31/48 (64%)
 Frame = +3

Query: 375 KTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           + ++ D + +LP++ D RDK      +  V++QG CGSCWAF AV A+
Sbjct: 145 EAYRHDGVEALPDSVDWRDKGA---VVAPVKNQGQCGSCWAFSAVAAV 189


>UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1;
           Oryza sativa (japonica cultivar-group)|Rep: Putative
           uncharacterized protein - Oryza sativa subsp. japonica
           (Rice)
          Length = 326

 Score = 42.7 bits (96), Expect = 0.007
 Identities = 17/41 (41%), Positives = 25/41 (60%)
 Frame = +3

Query: 396 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           +A++  +  P   W +   +  V+DQG CGSCWAF  VEA+
Sbjct: 110 LAAVAGDAPPAWDWREHGAVTRVKDQGPCGSCWAFSVVEAV 150


>UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 392

 Score = 42.7 bits (96), Expect = 0.007
 Identities = 30/96 (31%), Positives = 45/96 (46%), Gaps = 5/96 (5%)
 Frame = +3

Query: 243 FINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENFD 422
           +I ++N +   +K   N   D +    K   G ++DE    +      ID   S    F+
Sbjct: 118 YIRSMNRRSLPYKLEPNHFADLTDDEFKSYKGALDDESKDVMNDHDDVIDDDRS-KRMFE 176

Query: 423 PRDK--WPDCPTLNEVRDQGSCGSCWAF---GAVEA 515
             D+  W +   +N  + QG+CGSCWAF   GAVEA
Sbjct: 177 VPDQLDWRNYGAVNPAKGQGTCGSCWAFATAGAVEA 212


>UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh
           fly) (Boettcherisca peregrina). Cathepsin L; n=2;
           Dictyostelium discoideum|Rep: Similar to Sarcophaga
           peregrina (Flesh fly) (Boettcherisca peregrina).
           Cathepsin L - Dictyostelium discoideum (Slime mold)
          Length = 265

 Score = 42.3 bits (95), Expect = 0.010
 Identities = 18/45 (40%), Positives = 31/45 (68%)
 Frame = +3

Query: 384 KIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           K ++ A++P++FD    W D   + +V++QGSC SCW+F A+ A+
Sbjct: 40  KHNVNATIPKSFD----WRDHGAVGKVKNQGSCASCWSFSALGAL 80


>UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing
           protein; n=7; Hymenostomatida|Rep: Papain family
           cysteine protease containing protein - Tetrahymena
           thermophila SB210
          Length = 387

 Score = 42.3 bits (95), Expect = 0.010
 Identities = 24/83 (28%), Positives = 40/83 (48%)
 Frame = +3

Query: 300 RDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGS 479
           R+T+  + K +      ++       + KI+ +  LP++ D    W D   +  V+DQG 
Sbjct: 99  RETTLGYSKTVKNAANKQNMFRNLKTSDKIN-VKDLPKSVD----WRDAGVVTPVKDQGH 153

Query: 480 CGSCWAFGAVEAMTDRVCTYSNG 548
           CGSCWAF A  A+ +     + G
Sbjct: 154 CGSCWAF-ATTAVIESYAAIATG 175


>UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor;
           n=176; Viridiplantae|Rep: Cysteine proteinase RD21a
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 462

 Score = 42.3 bits (95), Expect = 0.010
 Identities = 28/93 (30%), Positives = 45/93 (48%), Gaps = 1/93 (1%)
 Frame = +3

Query: 243 FINTINLKQNSWKAG-RNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENF 419
           F++  N K  S++ G   F   T+  +  K +G   ++         ++  +   LPE+ 
Sbjct: 82  FVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLRYEARVGDELPESI 141

Query: 420 DPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           D R K      + EV+DQG CGSCWAF  + A+
Sbjct: 142 DWRKKG----AVAEVKDQGGCGSCWAFSTIGAV 170


>UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core
           eudicotyledons|Rep: Chymopapain precursor - Carica
           papaya (Papaya)
          Length = 352

 Score = 42.3 bits (95), Expect = 0.010
 Identities = 34/95 (35%), Positives = 47/95 (49%), Gaps = 6/95 (6%)
 Frame = +3

Query: 243 FINTINLKQNSWKAGRNFPRDTSFAHLKK--IMGVIED----EHFATLPIKTHKIDLIAS 404
           +I+  N K NS+  G N   D S    KK  +  V ED    EHF      T+K   + +
Sbjct: 78  YIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHFDNEDF-TYKH--VTN 134

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 509
            P++ D R K    P    V++QG+CGSCWAF  +
Sbjct: 135 YPQSIDWRAKGAVTP----VKNQGACGSCWAFSTI 165


>UniRef50_P25774 Cluster: Cathepsin S precursor; n=78;
           Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens
           (Human)
          Length = 331

 Score = 42.3 bits (95), Expect = 0.010
 Identities = 23/47 (48%), Positives = 31/47 (65%)
 Frame = +3

Query: 378 THKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           T+K +    LP++ D R+K   C T  EV+ QGSCG+CWAF AV A+
Sbjct: 106 TYKSNPNRILPDSVDWREK--GCVT--EVKYQGSCGACWAFSAVGAL 148


>UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2;
           Trypanosoma cruzi|Rep: Cysteine proteinase, putative -
           Trypanosoma cruzi
          Length = 392

 Score = 41.9 bits (94), Expect = 0.013
 Identities = 19/38 (50%), Positives = 23/38 (60%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           +P+  D R+  P    L  V+DQG CGSCWA GA E M
Sbjct: 141 IPDEVDYRNSSP--AILTAVKDQGRCGSCWAHGAAEEM 176


>UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2;
           Taeniidae|Rep: Cathepsin L-like cysteine proteinase -
           Taenia solium (Pork tapeworm)
          Length = 339

 Score = 41.5 bits (93), Expect = 0.017
 Identities = 19/40 (47%), Positives = 26/40 (65%)
 Frame = +3

Query: 399 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           A LP+  D RDK      + EV++QG+CGSCWAF +  A+
Sbjct: 122 AGLPDTVDWRDK----NLVTEVKNQGNCGSCWAFSSTGAL 157


>UniRef50_Q24E33 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 328

 Score = 41.5 bits (93), Expect = 0.017
 Identities = 24/82 (29%), Positives = 43/82 (52%), Gaps = 9/82 (10%)
 Frame = +3

Query: 300 RDTSFAHLKKIMGVIEDEHFATLPI---KTHKIDLIASLPENFD-----PRD-KWPDCPT 452
           ++ +F     IM ++ DE +++L +   +   ID+  SL ++ +     P +  W     
Sbjct: 79  KNNTFKLAINIMAILTDEEYSSLYLNLDQQESIDIFDSLVDDNETVGDIPSEVNWTAQGA 138

Query: 453 LNEVRDQGSCGSCWAFGAVEAM 518
           +  V++QGSCGSCWAF    A+
Sbjct: 139 VTPVKNQGSCGSCWAFSTTGAL 160


>UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_21,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 349

 Score = 41.5 bits (93), Expect = 0.017
 Identities = 18/42 (42%), Positives = 26/42 (61%)
 Frame = +3

Query: 453 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 578
           ++EV++QGSCGSCWAF AV A+         G K+   S ++
Sbjct: 137 VSEVKNQGSCGSCWAFSAVAAL--ETALRQGGVKNVELSEQE 176


>UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precursor;
           n=3; Plasmodium (Laverania)|Rep: Trophozoite cysteine
           proteinase precursor - Plasmodium falciparum
          Length = 569

 Score = 41.5 bits (93), Expect = 0.017
 Identities = 19/45 (42%), Positives = 29/45 (64%)
 Frame = +3

Query: 375 KTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 509
           K ++ D+ + +PE  D R+K      ++E +DQG CGSCWAF +V
Sbjct: 323 KRNEKDIFSKVPEILDYREKG----IVHEPKDQGLCGSCWAFASV 363


>UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46;
           Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea
           mays (Maize)
          Length = 371

 Score = 41.5 bits (93), Expect = 0.017
 Identities = 18/38 (47%), Positives = 25/38 (65%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           LP++FD    W D   +  V++QGSCGSCW+F A  A+
Sbjct: 137 LPDDFD----WRDHGAVGPVKNQGSCGSCWSFSASGAL 170


>UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza
           sativa|Rep: Os09g0497500 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 349

 Score = 41.1 bits (92), Expect = 0.022
 Identities = 31/99 (31%), Positives = 47/99 (47%), Gaps = 6/99 (6%)
 Frame = +3

Query: 240 EFINTINLKQNSWKAGRN-FPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLP-E 413
           E + T N   N +K   N F   T+     K++G        T+P  ++      ++P E
Sbjct: 60  ELVETFNSMSNGYKLADNKFADLTNEEFRAKMLGF---RPHVTIPQISNTCSADIAMPGE 116

Query: 414 NFD---PRD-KWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           + D   P+   W     + EV++QG CGSCWAF AV A+
Sbjct: 117 SSDDILPKSVDWRKKGAVVEVKNQGDCGSCWAFSAVAAI 155


>UniRef50_Q22W19 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score = 41.1 bits (92), Expect = 0.022
 Identities = 19/45 (42%), Positives = 23/45 (51%)
 Frame = +3

Query: 375 KTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 509
           K  K  LI SL  +  P   W     +  V++QG CGSCWAF  V
Sbjct: 109 KRQKSHLIYSLKGDVAPSIDWRQKNAVTPVKNQGQCGSCWAFSTV 153


>UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-like
           cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L or H-like cysteine
           peptidase - Trichomonas vaginalis G3
          Length = 435

 Score = 41.1 bits (92), Expect = 0.022
 Identities = 21/59 (35%), Positives = 32/59 (54%), Gaps = 1/59 (1%)
 Frame = +3

Query: 378 THKIDLIASLPENFDPRDKWPDCPTLNEV-RDQGSCGSCWAFGAVEAMTDRVCTYSNGT 551
           T  ID    LPE+F     W + P +  + RDQ +CGSCWA  A  +++ ++   +N T
Sbjct: 204 TKHIDFKGDLPESFS----WRNLPNVVAMPRDQANCGSCWAQAAATSISSQISMRTNKT 258


>UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor;
           n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 355

 Score = 41.1 bits (92), Expect = 0.022
 Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 2/93 (2%)
 Frame = +3

Query: 246 INTINLKQNSWKAGRNFPRDTSFAHLK-KIMGVIEDEHFATL-PIKTHKIDLIASLPENF 419
           I+  N + NS+  G N   D +    K + +G+ + +      P    +   I  LP++ 
Sbjct: 82  IDQRNNEINSYWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDITDLPKSV 141

Query: 420 DPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           D R K    P    V+DQG CGSCWAF  V A+
Sbjct: 142 DWRKKGAVAP----VKDQGQCGSCWAFSTVAAV 170


>UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18;
           Plasmodium|Rep: Cysteine proteinase precursor -
           Plasmodium vivax (strain Salvador I)
          Length = 583

 Score = 41.1 bits (92), Expect = 0.022
 Identities = 19/40 (47%), Positives = 27/40 (67%)
 Frame = +3

Query: 390 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 509
           +L+A +PE  D R+K      ++E +DQG CGSCWAF +V
Sbjct: 334 NLLADVPEILDYREKG----IVHEPKDQGLCGSCWAFASV 369


>UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue of
           Sarcophaga 26,29kDa proteinase; n=1; Nasonia
           vitripennis|Rep: PREDICTED: similar to homologue of
           Sarcophaga 26,29kDa proteinase - Nasonia vitripennis
          Length = 553

 Score = 40.7 bits (91), Expect = 0.030
 Identities = 28/94 (29%), Positives = 42/94 (44%), Gaps = 2/94 (2%)
 Frame = +3

Query: 243 FINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFAT--LPIKTHKIDLIASLPEN 416
           FI++IN     +    N   D + A LK + G    +H     +P         A +P++
Sbjct: 278 FIHSINRANLGFTLDVNHLADRNEAELKVLRGKQYTQHGYNGGMPFPHDVEKEKADVPDS 337

Query: 417 FDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           FD    W     +  V+DQ  CGSCW+FG   A+
Sbjct: 338 FD----WRLYGAVTPVKDQSVCGSCWSFGTTGAV 367


>UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 328

 Score = 40.7 bits (91), Expect = 0.030
 Identities = 20/45 (44%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
 Frame = +3

Query: 405 LPENFDPRDKWPD-CPTLNEVRDQGSCGSCWAFGAVEAMTDRVCT 536
           +P+ FD RD + D  P +  V+DQ  CG CWAF A  A+T+   T
Sbjct: 97  IPDYFDLRDIYVDGSPVVGPVKDQEQCGCCWAF-ATTAITEAANT 140


>UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 317

 Score = 40.7 bits (91), Expect = 0.030
 Identities = 19/39 (48%), Positives = 25/39 (64%)
 Frame = +3

Query: 402 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           ++PE+ D R+K      +N VRDQ  CGSCWAF A  A+
Sbjct: 103 TVPESIDWREKG----AVNPVRDQEQCGSCWAFSAAGAL 137


>UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase
           precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin
           L-like cysteine proteinase precursor - Acanthoscelides
           obtectus (Bean weevil)
          Length = 321

 Score = 40.7 bits (91), Expect = 0.030
 Identities = 19/47 (40%), Positives = 29/47 (61%)
 Frame = +3

Query: 390 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 530
           D +  +P+  D R+K      + EV+ QG+CGSCWAF AV ++  +V
Sbjct: 105 DNVNDIPKTVDWREKG----AVTEVKKQGNCGSCWAFSAVGSIEGQV 147


>UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep:
           Cathepsin L - Kudoa thyrsites
          Length = 300

 Score = 40.7 bits (91), Expect = 0.030
 Identities = 21/66 (31%), Positives = 33/66 (50%)
 Frame = +3

Query: 321 LKKIMGVIEDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 500
           LK  + V+        P +T   D+ ++LP + D    W     +  V++QG CGSCW+F
Sbjct: 74  LKPKLPVVSTPTHGITPKETATKDIKSTLPSSVD----WKALGKVTSVKNQGHCGSCWSF 129

Query: 501 GAVEAM 518
            A  A+
Sbjct: 130 SAAGAI 135


>UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_56,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 314

 Score = 40.7 bits (91), Expect = 0.030
 Identities = 23/61 (37%), Positives = 33/61 (54%), Gaps = 2/61 (3%)
 Frame = +3

Query: 342 IEDEHFATLPI--KTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEA 515
           + +E FA L +  K   ++L A L     P     D   +  V++QG+CGSCWAF AV A
Sbjct: 83  LTNEEFAALLLTRKESPMNLDAELYVPQGPLKASADWSKITSVKNQGNCGSCWAFSAVGA 142

Query: 516 M 518
           +
Sbjct: 143 V 143


>UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1;
           Methanospirillum hungatei JF-1|Rep: Peptidase C1A,
           papain precursor - Methanospirillum hungatei (strain
           JF-1 / DSM 864)
          Length = 1096

 Score = 40.7 bits (91), Expect = 0.030
 Identities = 27/81 (33%), Positives = 39/81 (48%)
 Frame = +3

Query: 273 SWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPT 452
           SW A  N     S    + + G+  D   +T+ +    I  +  LP +FD R+   D  T
Sbjct: 278 SWTAAVNPIMLMSPEEREHLKGLRHDLKSSTI-VSGAGITPMEGLPTSFDWRNNGGDYTT 336

Query: 453 LNEVRDQGSCGSCWAFGAVEA 515
              +++QGSCGSCWAF    A
Sbjct: 337 --PIKNQGSCGSCWAFATTGA 355


>UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep:
           Cathepsin L precursor - Schistosoma mansoni (Blood
           fluke)
          Length = 319

 Score = 40.7 bits (91), Expect = 0.030
 Identities = 17/35 (48%), Positives = 25/35 (71%)
 Frame = +3

Query: 396 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 500
           + ++P+NFD R+K      + EV++QG CGSCWAF
Sbjct: 102 VNNIPKNFDWREKG----AVTEVKNQGMCGSCWAF 132


>UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l;
           n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to cathepsin l - Strongylocentrotus purpuratus
          Length = 489

 Score = 40.3 bits (90), Expect = 0.039
 Identities = 28/98 (28%), Positives = 44/98 (44%), Gaps = 1/98 (1%)
 Frame = +3

Query: 240 EFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFAT-LPIKTHKIDLIASLPEN 416
           E I++IN     +    N   D S   LK++ G +        LP     +   A +P++
Sbjct: 212 EMIHSINRANLGYVLDINHMADQSHQELKRMRGRLRQTRPNNGLPYDGSDVSDDA-VPDH 270

Query: 417 FDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 530
            D    W     ++ V+DQ  CGSCW+FG+ E +   V
Sbjct: 271 ID----WNVLGAVSPVKDQAVCGSCWSFGSAETIEGAV 304


>UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine
           protease; n=1; Strongylocentrotus purpuratus|Rep:
           PREDICTED: similar to cysteine protease -
           Strongylocentrotus purpuratus
          Length = 494

 Score = 40.3 bits (90), Expect = 0.039
 Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 1/51 (1%)
 Frame = +3

Query: 369 PIKTHKIDLIASLPENFDPRD-KWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           P+K   I   A++P+   P +  W     +  V++QG CGSCWAF A+  M
Sbjct: 223 PLKKTGIKKQAAIPQGPVPEEYDWRTHGAVTPVKNQGMCGSCWAFSAIGNM 273


>UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 280

 Score = 40.3 bits (90), Expect = 0.039
 Identities = 16/34 (47%), Positives = 24/34 (70%)
 Frame = +3

Query: 399 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 500
           +SLP+ FD    W +   + +V++QG+CGSCWAF
Sbjct: 66  SSLPQQFD----WRNLGKVTQVKNQGNCGSCWAF 95


>UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3;
           Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays
           (Maize)
          Length = 493

 Score = 40.3 bits (90), Expect = 0.039
 Identities = 15/28 (53%), Positives = 19/28 (67%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           W +   + EV+DQG CG CWAF AV A+
Sbjct: 170 WRERGAVAEVKDQGQCGGCWAFSAVAAV 197


>UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1;
           Trypanosoma cruzi|Rep: Cysteine protease, putative -
           Trypanosoma cruzi
          Length = 434

 Score = 40.3 bits (90), Expect = 0.039
 Identities = 15/24 (62%), Positives = 18/24 (75%)
 Frame = +3

Query: 447 PTLNEVRDQGSCGSCWAFGAVEAM 518
           P L  V+DQGSCGSCWA  A E++
Sbjct: 137 PVLTPVKDQGSCGSCWAHAATESV 160


>UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3;
           Bilateria|Rep: Cathepsin L-like cysteine proteinase -
           Longidorus elongatus
          Length = 358

 Score = 40.3 bits (90), Expect = 0.039
 Identities = 18/44 (40%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
 Frame = +3

Query: 393 LIASLPENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           +I  +P+N    D   W     + +V+DQGSCGSCWAF A  ++
Sbjct: 129 MIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAFSATGSL 172


>UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas
           foetus|Rep: TFCP2 protein - Tritrichomonas foetus
           (Trichomonas foetus)
          Length = 270

 Score = 40.3 bits (90), Expect = 0.039
 Identities = 17/36 (47%), Positives = 23/36 (63%)
 Frame = +3

Query: 408 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEA 515
           P +FD    W     +N +++QGSCGSCWAF A+ A
Sbjct: 51  PTSFD----WRSEGKVNPIKNQGSCGSCWAFSAIAA 82


>UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 394

 Score = 40.3 bits (90), Expect = 0.039
 Identities = 15/22 (68%), Positives = 16/22 (72%)
 Frame = +3

Query: 453 LNEVRDQGSCGSCWAFGAVEAM 518
           LN V+DQG CGSCW FGA   M
Sbjct: 196 LNPVKDQGQCGSCWTFGAAGVM 217


>UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia
           bovis|Rep: Cysteine protease 2 - Babesia bovis
          Length = 445

 Score = 40.3 bits (90), Expect = 0.039
 Identities = 17/32 (53%), Positives = 20/32 (62%)
 Frame = +3

Query: 414 NFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 509
           NF+  D W     +  V+DQG CGSCWAF AV
Sbjct: 236 NFEDID-WRRADAVTPVKDQGMCGSCWAFAAV 266


>UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F
            like protease; n=1; Nasonia vitripennis|Rep: PREDICTED:
            similar to cathepsin F like protease - Nasonia
            vitripennis
          Length = 1036

 Score = 39.9 bits (89), Expect = 0.052
 Identities = 31/101 (30%), Positives = 47/101 (46%), Gaps = 5/101 (4%)
 Frame = +3

Query: 213  KDLPHPLSDEFINTIN-LKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKI 389
            K++   +  + +N I  L++N    GR     T F  L K     +  H    P    + 
Sbjct: 748  KEMRFQIFKDNLNLIEELQRNEMGTGRYGV--TQFTDLTK--AEFKARHLGLKPTLKSEN 803

Query: 390  DL---IASLPENFDPRD-KWPDCPTLNEVRDQGSCGSCWAF 500
            D+   +A++P+   P D  W     +  V+DQGSCGSCWAF
Sbjct: 804  DIPMPMATIPDIELPSDYDWRHHNVVTPVKDQGSCGSCWAF 844


>UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia
           irregularis virus a|Rep: FirrV-1-A48 precursor -
           Feldmannia irregularis virus a
          Length = 373

 Score = 39.9 bits (89), Expect = 0.052
 Identities = 15/37 (40%), Positives = 21/37 (56%)
 Frame = +3

Query: 468 DQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 578
           DQGSC SCW+   V+ + DRV   +NG      S ++
Sbjct: 80  DQGSCASCWSISVVQMLADRVSVSTNGKIKLKLSVQE 116


>UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;
           Brugia malayi|Rep: Cahepsin L-like cysteine protease -
           Brugia malayi (Filarial nematode worm)
          Length = 371

 Score = 39.9 bits (89), Expect = 0.052
 Identities = 18/47 (38%), Positives = 27/47 (57%)
 Frame = +3

Query: 378 THKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           T ++ +   LP++ D    W     + +V+DQG CGSCW F AV A+
Sbjct: 134 TIRMKINGPLPKSID----WRTSGAVTKVKDQGYCGSCWTFSAVGAL 176


>UniRef50_O16454 Cluster: Temporarily assigned gene name protein
           196; n=4; Bilateria|Rep: Temporarily assigned gene name
           protein 196 - Caenorhabditis elegans
          Length = 477

 Score = 39.9 bits (89), Expect = 0.052
 Identities = 17/32 (53%), Positives = 24/32 (75%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 500
           LPE+FD R+K      + +V++QG+CGSCWAF
Sbjct: 264 LPESFDWREKG----AVTQVKNQGNCGSCWAF 291


>UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=17; Trichomonas vaginalis|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 318

 Score = 39.9 bits (89), Expect = 0.052
 Identities = 13/27 (48%), Positives = 18/27 (66%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAFGAVEA 515
           W +   +N ++DQ  CGSCWAF  V+A
Sbjct: 106 WRNAKIVNPIKDQAQCGSCWAFSVVQA 132


>UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia
           medanensis|Rep: Sui m 1 allergen - Suidasia medanensis
          Length = 336

 Score = 39.9 bits (89), Expect = 0.052
 Identities = 18/43 (41%), Positives = 25/43 (58%)
 Frame = +3

Query: 372 IKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 500
           ++  + D+  +LP  FD R +W        VR+QG CGSCWAF
Sbjct: 104 VQVPESDISVALPAAFDWRQQWNTA-----VRNQGQCGSCWAF 141


>UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, whole
           genome shotgun sequence; n=3; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_31,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 358

 Score = 39.9 bits (89), Expect = 0.052
 Identities = 17/48 (35%), Positives = 29/48 (60%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 548
           +PE+++ R+  P+C     +  QG+C S ++  AV A +DR+C   NG
Sbjct: 131 IPESYNFREAQPECA--QPIYFQGNCSSSYSIAAVSATSDRLCKSKNG 176


>UniRef50_Q2FLC7 Cluster: Periplasmic copper-binding precursor; n=1;
           Methanospirillum hungatei JF-1|Rep: Periplasmic
           copper-binding precursor - Methanospirillum hungatei
           (strain JF-1 / DSM 864)
          Length = 1092

 Score = 39.9 bits (89), Expect = 0.052
 Identities = 18/48 (37%), Positives = 26/48 (54%)
 Frame = +3

Query: 375 KTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           K   + ++A  P  FD RD       +  +RDQG  GSCW F AV+++
Sbjct: 77  KIRSLSILADYPSKFDLRDS----KRVPAIRDQGQSGSCWDFAAVKSL 120


>UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61;
           Leishmania|Rep: Cysteine proteinase 2 precursor -
           Leishmania pifanoi
          Length = 444

 Score = 39.9 bits (89), Expect = 0.052
 Identities = 18/38 (47%), Positives = 26/38 (68%)
 Frame = +3

Query: 396 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 509
           ++++P+  D R+K    P    V+DQG+CGSCWAF AV
Sbjct: 123 LSAVPDAVDWREKGAVTP----VKDQGACGSCWAFSAV 156


>UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean
           endopeptidase) (Cysteine proteinase)
           (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
           Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor
           (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase)
           (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
           Vignain-2] - Vigna mungo (Rice bean) (Black gram)
          Length = 362

 Score = 39.9 bits (89), Expect = 0.052
 Identities = 18/47 (38%), Positives = 27/47 (57%)
 Frame = +3

Query: 378 THKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           T   + + S+P + D R K      + +V+DQG CGSCWAF  + A+
Sbjct: 119 TFMYEKVGSVPASVDWRKKG----AVTDVKDQGQCGSCWAFSTIVAV 161


>UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera
           litura multicapsid nucleopolyhedrovirus (SpltMNPV)
          Length = 337

 Score = 39.9 bits (89), Expect = 0.052
 Identities = 17/37 (45%), Positives = 23/37 (62%)
 Frame = +3

Query: 399 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 509
           A  PE+FD    W     + +V++QG CGSCWAF A+
Sbjct: 124 ARTPESFD----WRKLNKVTKVKEQGVCGSCWAFAAI 156


>UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24;
           Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa
           (Rice)
          Length = 339

 Score = 39.5 bits (88), Expect = 0.068
 Identities = 20/41 (48%), Positives = 23/41 (56%)
 Frame = +3

Query: 396 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           I +LP   D R K    P    ++DQG CG CWAF AV AM
Sbjct: 120 IDTLPATVDWRTKGAVTP----IKDQGQCGCCWAFSAVAAM 156


>UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain;
           n=16; Chrysomelidae|Rep: Digestive cysteine protease
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 39.5 bits (88), Expect = 0.068
 Identities = 18/48 (37%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
 Frame = +3

Query: 408 PENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 545
           PE+ +  D   W +   + EV+DQ  CGSCWAF A  A+  +    +N
Sbjct: 105 PEDLEVPDSIDWTEKGAVLEVKDQNPCGSCWAFSATGALEGQNAILNN 152


>UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or
           H-like cysteine peptidase; n=1; Trichomonas vaginalis
           G3|Rep: Clan CA, family C1, cathepsin L, S or H-like
           cysteine peptidase - Trichomonas vaginalis G3
          Length = 473

 Score = 39.5 bits (88), Expect = 0.068
 Identities = 15/33 (45%), Positives = 22/33 (66%), Gaps = 1/33 (3%)
 Frame = +3

Query: 435 WPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRV 530
           W D P +  + RDQ +CGSCWAFG  E++  ++
Sbjct: 257 WRDVPNVVGKPRDQVACGSCWAFGTAESLESQL 289


>UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 291

 Score = 39.5 bits (88), Expect = 0.068
 Identities = 17/32 (53%), Positives = 21/32 (65%), Gaps = 1/32 (3%)
 Frame = +3

Query: 453 LNEVRDQGSCGSCWAFGAVEAM-TDRVCTYSN 545
           +N +RDQ  CGSCWAFG V A  ++    YSN
Sbjct: 90  VNPIRDQKQCGSCWAFGTVAACESNYALLYSN 121


>UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27;
           Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber
           officinale (Ginger)
          Length = 221

 Score = 39.5 bits (88), Expect = 0.068
 Identities = 18/38 (47%), Positives = 25/38 (65%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           LP++ D R+K    P    V++QG CGSCWAF A+ A+
Sbjct: 3   LPDSIDWREKGAVVP----VKNQGGCGSCWAFDAIAAV 36


>UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor;
           n=35; Fasciola|Rep: Cathepsin L-like proteinase
           precursor - Fasciola hepatica (Liver fluke)
          Length = 326

 Score = 39.5 bits (88), Expect = 0.068
 Identities = 14/28 (50%), Positives = 18/28 (64%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           W +   + EV+DQG+CGSCWAF     M
Sbjct: 114 WRESGYVTEVKDQGNCGSCWAFSTTGTM 141


>UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin W,
           partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
           similar to Cathepsin W, partial - Ornithorhynchus
           anatinus
          Length = 229

 Score = 39.1 bits (87), Expect = 0.090
 Identities = 18/40 (45%), Positives = 25/40 (62%), Gaps = 2/40 (5%)
 Frame = +3

Query: 396 IASLPENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAV 509
           +AS+PE    ++   W     +  V++QGSCGSCWAF AV
Sbjct: 59  MASIPEGPLRKETCDWRKRGAITSVKNQGSCGSCWAFAAV 98


>UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;
           n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
           hypothetical protein - Strongylocentrotus purpuratus
          Length = 331

 Score = 39.1 bits (87), Expect = 0.090
 Identities = 17/45 (37%), Positives = 27/45 (60%)
 Frame = +3

Query: 396 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 530
           + ++P  +D R   P  P +  V++Q SCG+CWAF  VE M  ++
Sbjct: 124 LKTMPLVYDLRSIKP--PVVTPVKNQKSCGACWAFSVVETMETQI 166


>UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba
           culbertsoni|Rep: Cysteine proteinase - Acanthamoeba
           culbertsoni
          Length = 482

 Score = 39.1 bits (87), Expect = 0.090
 Identities = 22/43 (51%), Positives = 27/43 (62%), Gaps = 3/43 (6%)
 Frame = +3

Query: 399 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF---GAVEAM 518
           AS+P N+D R K    P    V++QGSC SCWAF   GAVE +
Sbjct: 154 ASIPANWDWRTKGAVTP----VKNQGSCASCWAFVATGAVEGV 192


>UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_163_69918_68548 - Giardia lamblia
           ATCC 50803
          Length = 456

 Score = 39.1 bits (87), Expect = 0.090
 Identities = 17/44 (38%), Positives = 25/44 (56%)
 Frame = +3

Query: 378 THKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 509
           T  +  +  +P ++D R+     P    V+DQG CGSCWAFG +
Sbjct: 68  TDPLSTLPEIPTSYDLREAGLQVP----VKDQGVCGSCWAFGTM 107


>UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1;
           Brugia malayi|Rep: Cathepsin F-like cysteine proteinase
           - Brugia malayi (Filarial nematode worm)
          Length = 461

 Score = 39.1 bits (87), Expect = 0.090
 Identities = 18/35 (51%), Positives = 21/35 (60%)
 Frame = +3

Query: 396 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 500
           I +LP  FD    W     +  V+DQGSCGSCWAF
Sbjct: 245 IYNLPSKFD----WRTEGVVTPVKDQGSCGSCWAF 275


>UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 367

 Score = 39.1 bits (87), Expect = 0.090
 Identities = 17/39 (43%), Positives = 24/39 (61%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT 551
           W     ++ V++QGSCGSCWAF AV A+ + V    N +
Sbjct: 161 WRQSGAVSPVKNQGSCGSCWAFSAV-ALAESVNLLRNNS 198


>UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 513

 Score = 39.1 bits (87), Expect = 0.090
 Identities = 25/94 (26%), Positives = 40/94 (42%), Gaps = 2/94 (2%)
 Frame = +3

Query: 243 FINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEH--FATLPIKTHKIDLIASLPEN 416
           FI + N +   +    N   D + A + ++ G++ +E       P      D    LP +
Sbjct: 240 FIKSRNRQHLGYSLKPNHMADMTDAEVNRMKGLLHEEPPLIGDSPFSIPDKDRGVPLPPH 299

Query: 417 FDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
            D    W     +N V+ QG CGSC+AF    A+
Sbjct: 300 VD----WRKAGAVNSVKSQGICGSCYAFAVAGAL 329


>UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 462

 Score = 39.1 bits (87), Expect = 0.090
 Identities = 15/29 (51%), Positives = 21/29 (72%)
 Frame = +3

Query: 462 VRDQGSCGSCWAFGAVEAMTDRVCTYSNG 548
           VRDQ +CGSCWA  A EA++ ++  +S G
Sbjct: 242 VRDQANCGSCWAQSAGEAISSQISLHSKG 270


>UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis
           zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa
           zeasingle nucleocapsid nuclear polyhedrosis virus)
          Length = 367

 Score = 39.1 bits (87), Expect = 0.090
 Identities = 16/35 (45%), Positives = 22/35 (62%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 509
           LP+ +D    W D   +  ++DQG CGSCWAF A+
Sbjct: 156 LPDYYD----WRDTNKVTPIKDQGVCGSCWAFVAI 186


>UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823
           protein, partial; n=1; Ornithorhynchus anatinus|Rep:
           PREDICTED: similar to MGC81823 protein, partial -
           Ornithorhynchus anatinus
          Length = 361

 Score = 38.7 bits (86), Expect = 0.12
 Identities = 14/24 (58%), Positives = 17/24 (70%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAFGA 506
           W D   +  V+DQG CGSCWAFG+
Sbjct: 196 WRDHGYVTPVKDQGRCGSCWAFGS 219


>UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep:
           Cathepsin K - Danio rerio (Zebrafish) (Brachydanio
           rerio)
          Length = 333

 Score = 38.7 bits (86), Expect = 0.12
 Identities = 18/43 (41%), Positives = 26/43 (60%)
 Frame = +3

Query: 390 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           D +  LP++ D R        +  V++QGSCGSCWAF +V A+
Sbjct: 113 DRVGKLPKSIDYRK----LGYVTSVKNQGSCGSCWAFSSVGAL 151


>UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel
           protein - Danio rerio (Zebrafish) (Brachydanio rerio)
          Length = 328

 Score = 38.7 bits (86), Expect = 0.12
 Identities = 24/83 (28%), Positives = 44/83 (53%)
 Frame = +3

Query: 270 NSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCP 449
           +S+  G N   D +   +  + G++E++ F  +   T     + +LP+    R  W +  
Sbjct: 70  HSYTLGLNQLSDMTADEVNDMNGLLEED-FPDVNA-TFSPPSLQTLPQ----RVNWTEHG 123

Query: 450 TLNEVRDQGSCGSCWAFGAVEAM 518
            ++ V++QG CGSCWAF AV ++
Sbjct: 124 MVSPVQNQGPCGSCWAFSAVGSL 146


>UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n=1;
           Myxobolus cerebralis|Rep: Cathepsin Z-like cysteine
           proteinase - Myxobolus cerebralis
          Length = 297

 Score = 38.7 bits (86), Expect = 0.12
 Identities = 20/59 (33%), Positives = 32/59 (54%), Gaps = 3/59 (5%)
 Frame = +3

Query: 402 SLPENFDPRDKWPDCPTLNEVRDQGS---CGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 569
           ++P++FD    W +   L+ V++Q     CGSCWAF +   + DR+    N +   HFS
Sbjct: 49  NMPKSFD----WRENAYLSSVKNQHLPTYCGSCWAFASTSTIADRIYIAKNLSHFDHFS 103


>UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase" precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 315

 Score = 38.7 bits (86), Expect = 0.12
 Identities = 15/37 (40%), Positives = 21/37 (56%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 545
           W D   L  V+DQG CGSCWAF    ++  ++  + N
Sbjct: 117 WRDSAVLG-VKDQGQCGSCWAFSTTGSLEGQLAIHKN 152


>UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 395

 Score = 38.7 bits (86), Expect = 0.12
 Identities = 21/51 (41%), Positives = 26/51 (50%), Gaps = 3/51 (5%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKH---FHFSAED 578
           W D  T   VRDQG C SCW FG++ A+  R     NG       H SA++
Sbjct: 194 WSDYQT--PVRDQGECKSCWVFGSLAALESRY-LIKNGVSEKSTLHLSAQN 241


>UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathepsin
           o - Aedes aegypti (Yellowfever mosquito)
          Length = 375

 Score = 38.7 bits (86), Expect = 0.12
 Identities = 19/45 (42%), Positives = 26/45 (57%)
 Frame = +3

Query: 387 IDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMT 521
           + ++  LP+  D RDK    P    VR QGSCG+CWA   V+ +T
Sbjct: 147 LKILDYLPKVVDWRDKGVVAP----VRSQGSCGACWAISVVDTIT 187


>UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14;
           Leishmania|Rep: Cysteine proteinase 1 precursor -
           Leishmania pifanoi
          Length = 354

 Score = 38.7 bits (86), Expect = 0.12
 Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
 Frame = +3

Query: 372 IKTHKIDLIA--SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 509
           +K HK D+    S P      D W D   +  V++QG CGSCWAF A+
Sbjct: 113 LKDHKEDVHVDDSAPSGVMSVD-WRDKGAVTPVKNQGLCGSCWAFSAI 159


>UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163
           precursor; n=4; Schizophora|Rep: Putative cysteine
           proteinase CG12163 precursor - Drosophila melanogaster
           (Fruit fly)
          Length = 614

 Score = 38.7 bits (86), Expect = 0.12
 Identities = 17/32 (53%), Positives = 22/32 (68%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 500
           LP+ FD R K      + +V++QGSCGSCWAF
Sbjct: 394 LPKEFDWRQK----DAVTQVKNQGSCGSCWAF 421


>UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;
           n=17; Magnoliophyta|Rep: Thiol protease aleurain-like
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 358

 Score = 38.7 bits (86), Expect = 0.12
 Identities = 30/95 (31%), Positives = 47/95 (49%), Gaps = 3/95 (3%)
 Frame = +3

Query: 240 EFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENF 419
           + I + N K  S+K   N   D ++   ++          ATL   +HKI   A++P+  
Sbjct: 88  DLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAAQNCSATLK-GSHKITE-ATVPDTK 145

Query: 420 DPRDKWPDCPTLNEVRDQGSCGSCWAF---GAVEA 515
           D    W +   ++ V++QG CGSCW F   GA+EA
Sbjct: 146 D----WREDGIVSPVKEQGHCGSCWTFSTTGALEA 176


>UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 344

 Score = 38.3 bits (85), Expect = 0.16
 Identities = 19/55 (34%), Positives = 27/55 (49%)
 Frame = +3

Query: 411 ENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 575
           +N  P D W +   +  V+ QG CGSCW F A  A+ +      NG    +FS +
Sbjct: 134 KNAPPMD-WRNASAITPVKQQGKCGSCWTF-ASTAVLESFSFIKNGAPLTNFSEQ 186


>UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 20 SCAF14744, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 175

 Score = 38.3 bits (85), Expect = 0.16
 Identities = 18/41 (43%), Positives = 23/41 (56%)
 Frame = +3

Query: 396 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           I  LP  FD    W D   +  V++Q +CGSCWAF  V A+
Sbjct: 56  IKGLPARFD----WRDNAVVGPVQNQQACGSCWAFSVVGAV 92


>UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10;
           Liliopsida|Rep: Putative cysteine proteinase - Oryza
           sativa subsp. japonica (Rice)
          Length = 416

 Score = 38.3 bits (85), Expect = 0.16
 Identities = 30/97 (30%), Positives = 45/97 (46%), Gaps = 5/97 (5%)
 Frame = +3

Query: 243 FINTINLKQN--SWKAGRNFPRDTSFAHLK-KIMGV-IEDEHFATLPIKTHKIDLIASLP 410
           +I+  N K    S+  G N   D ++     K  GV ++   FAT    +   +L   +P
Sbjct: 55  YIHEFNQKSKGMSYVLGLNKFSDLTYEEFAAKYTGVKVDASAFATATTSSPDEELPVGVP 114

Query: 411 E-NFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
              +D    W     + +V+DQG CGSCW F AV A+
Sbjct: 115 PATWD----WRLNGAVTDVKDQGQCGSCWVFSAVGAV 147


>UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4;
           core eudicotyledons|Rep: Papain-like cysteine peptidase
           XBCP3 - Arabidopsis thaliana (Mouse-ear cress)
          Length = 437

 Score = 38.3 bits (85), Expect = 0.16
 Identities = 14/28 (50%), Positives = 18/28 (64%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           W     +  V+DQGSCG+CW+F A  AM
Sbjct: 124 WRKKGAVTNVKDQGSCGACWSFSATGAM 151


>UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza
           sativa|Rep: Os01g0347600 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 343

 Score = 38.3 bits (85), Expect = 0.16
 Identities = 14/19 (73%), Positives = 17/19 (89%)
 Frame = +3

Query: 462 VRDQGSCGSCWAFGAVEAM 518
           V+DQG+CGSCWAF AV A+
Sbjct: 140 VKDQGACGSCWAFAAVAAI 158


>UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1;
           Oryza sativa (japonica cultivar-group)|Rep: Putative
           uncharacterized protein - Oryza sativa subsp. japonica
           (Rice)
          Length = 289

 Score = 38.3 bits (85), Expect = 0.16
 Identities = 14/19 (73%), Positives = 17/19 (89%)
 Frame = +3

Query: 462 VRDQGSCGSCWAFGAVEAM 518
           V+DQG+CGSCWAF AV A+
Sbjct: 139 VKDQGACGSCWAFAAVAAI 157


>UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain;
           n=9; Cucujiformia|Rep: Digestive cysteine proteinase
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 38.3 bits (85), Expect = 0.16
 Identities = 21/69 (30%), Positives = 32/69 (46%), Gaps = 2/69 (2%)
 Frame = +3

Query: 345 EDEHFATLPIKTHKIDLIASLPENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           +DE    +  K +    +A  PE  +  D   W     + +V+ QG CGSCWAF A  A+
Sbjct: 84  KDELRRQIKTKPNVEATLAVFPEGLEVPDSIDWTQKGAVLDVKYQGGCGSCWAFSATGAL 143

Query: 519 TDRVCTYSN 545
             +    +N
Sbjct: 144 EGQNAIVNN 152


>UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing
           protein; n=5; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 437

 Score = 38.3 bits (85), Expect = 0.16
 Identities = 25/65 (38%), Positives = 33/65 (50%), Gaps = 1/65 (1%)
 Frame = +3

Query: 384 KIDLIASLPENFDPRDKWPDCPTLNEVRDQGS-CGSCWAFGAVEAMTDRVCTYSNGTKHF 560
           K DL + LP+  D R+K      + +V+ QG  CGSCWAF AV A+         G K  
Sbjct: 199 KYDL-SQLPQYVDWREKG----VVTQVKSQGKDCGSCWAFAAVAALESHY-ALKTGKKPI 252

Query: 561 HFSAE 575
            FS +
Sbjct: 253 QFSEQ 257


>UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin B-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 255

 Score = 38.3 bits (85), Expect = 0.16
 Identities = 17/63 (26%), Positives = 34/63 (53%)
 Frame = +3

Query: 357 FATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCT 536
           F    I++   D+   +P+ ++   ++P C  L  +  +  CG C+A+G ++AM+ R+C 
Sbjct: 15  FVDESIRSFPEDISIDIPDEYNFLQEYPHCD-LGPLTQE--CGCCYAYGPIKAMSHRICK 71

Query: 537 YSN 545
             N
Sbjct: 72  AKN 74


>UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_46,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 336

 Score = 38.3 bits (85), Expect = 0.16
 Identities = 19/43 (44%), Positives = 23/43 (53%)
 Frame = +3

Query: 387 IDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEA 515
           ID +    EN D      D   + +V+DQG C  CWAFGAV A
Sbjct: 130 IDELQKTQEN-DKTINSVDWRKITQVKDQGQCSGCWAFGAVGA 171


>UniRef50_P43234 Cluster: Cathepsin O precursor; n=22;
           Euteleostomi|Rep: Cathepsin O precursor - Homo sapiens
           (Human)
          Length = 321

 Score = 38.3 bits (85), Expect = 0.16
 Identities = 19/39 (48%), Positives = 23/39 (58%)
 Frame = +3

Query: 402 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           SLP  FD RDK      + +VR+Q  CG CWAF  V A+
Sbjct: 107 SLPLRFDWRDK----QVVTQVRNQQMCGGCWAFSVVGAV 141


>UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6;
           Schistosoma|Rep: Cathepsin C precursor - Schistosoma
           mansoni (Blood fluke)
          Length = 454

 Score = 38.3 bits (85), Expect = 0.16
 Identities = 32/114 (28%), Positives = 51/114 (44%), Gaps = 9/114 (7%)
 Frame = +3

Query: 231 LSDEFINTINLKQNSWKAGRNFPRDTSFA--HLKKIMGVIED--EHFATLPIKTHKIDLI 398
           ++  F+  IN  Q SW+ G  +P  + +    L+   G ++      + L  KT   +LI
Sbjct: 154 INPSFVGKINAHQKSWR-GEIYPELSKYTIDELRNRAGGVKSMVTRPSVLNRKTPSKELI 212

Query: 399 ASLPENFDPRDKWPDCPT-----LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 545
            SL  N      W   P      +  +R+QG CGSC+A  +  A+  R+   SN
Sbjct: 213 -SLTGNLPLEFDWTSPPDGSRSPVTPIRNQGICGSCYASPSAAALEARIRLVSN 265


>UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 2 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 564

 Score = 37.9 bits (84), Expect = 0.21
 Identities = 24/91 (26%), Positives = 40/91 (43%), Gaps = 2/91 (2%)
 Frame = +3

Query: 243 FINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATL--PIKTHKIDLIASLPEN 416
           FI++ N     +    N   D +   +  + G ++ +  ++   P   H+    A LP+ 
Sbjct: 291 FIDSKNRANLGYNLAVNHLADRTREEISVLRGRLQSKDGSSRAEPFPRHRFT--AKLPDQ 348

Query: 417 FDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 509
            D    W     +  V+DQ  CGSCW+FG V
Sbjct: 349 ID----WRPYGAVTPVKDQAVCGSCWSFGTV 375


>UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_542_3431_1206 - Giardia lamblia ATCC
           50803
          Length = 741

 Score = 37.9 bits (84), Expect = 0.21
 Identities = 27/71 (38%), Positives = 36/71 (50%), Gaps = 1/71 (1%)
 Frame = +3

Query: 345 EDEHFATLPIKTHKIDLI-ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMT 521
           EDE +  LP      DL  A+LP NF  R          ++ +QGSCG C+A  AVE +T
Sbjct: 40  EDE-YNELPDGPDNADLTRAALPTNFTYRGH-----RCIQIINQGSCGCCYAAAAVEMVT 93

Query: 522 DRVCTYSNGTK 554
            R C   N ++
Sbjct: 94  ARRCLQLNDSR 104


>UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine
           protease; n=11; Callosobruchus maculatus|Rep: Putative
           gut cathepsin L-like cysteine protease - Callosobruchus
           maculatus (Southern cowpea weevil) (Pulse bruchid)
          Length = 326

 Score = 37.9 bits (84), Expect = 0.21
 Identities = 17/39 (43%), Positives = 23/39 (58%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT 551
           W +   +  V+DQ +CGSCWAF AV A+  +     NGT
Sbjct: 118 WREEGAVTPVKDQANCGSCWAFSAVGAIEGQFFK-KNGT 155


>UniRef50_Q22DX2 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 358

 Score = 37.9 bits (84), Expect = 0.21
 Identities = 20/41 (48%), Positives = 25/41 (60%), Gaps = 3/41 (7%)
 Frame = +3

Query: 402 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF---GAVEA 515
           S+P ++D R   P    L  V +QG CGSCWAF   GAVE+
Sbjct: 146 SIPSSWDIRTDGPGL--LQPVENQGQCGSCWAFSTSGAVES 184


>UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1
           precursor; n=20; Psoroptidia|Rep: Major mite fecal
           allergen Der f 1 precursor - Dermatophagoides farinae
           (House-dust mite)
          Length = 321

 Score = 37.9 bits (84), Expect = 0.21
 Identities = 15/32 (46%), Positives = 17/32 (53%)
 Frame = +3

Query: 450 TLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 545
           T+  +R QG CGSCWAF  V A       Y N
Sbjct: 120 TVTPIRMQGGCGSCWAFSGVAATESAYLAYRN 151


>UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K
           precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2);
           n=2; Tribolium castaneum|Rep: PREDICTED: similar to
           Cathepsin K precursor (Cathepsin O) (Cathepsin X)
           (Cathepsin O2) - Tribolium castaneum
          Length = 332

 Score = 37.5 bits (83), Expect = 0.28
 Identities = 22/82 (26%), Positives = 36/82 (43%)
 Frame = +3

Query: 273 SWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPT 452
           +++ G N   D +   L  + G+     F   P+   +  L+ SL         W     
Sbjct: 71  TYEMGVNKFSDFTDEELSNLTGLQVPLEFEQ-PLNETEDPLLPSLGRGISASLDWRQRGG 129

Query: 453 LNEVRDQGSCGSCWAFGAVEAM 518
           +  V++QG CGSCWAF  + A+
Sbjct: 130 VTPVKNQGQCGSCWAFATIGAI 151


>UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease
           containing protein; n=2; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 332

 Score = 37.5 bits (83), Expect = 0.28
 Identities = 17/38 (44%), Positives = 24/38 (63%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           LPE+ D    W     ++ VRDQG+CGSC+AF +  A+
Sbjct: 127 LPESVD----WRKLGAVSPVRDQGNCGSCYAFASTGAL 160


>UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin
           L - Misgurnus mizolepis (Mud loach)
          Length = 337

 Score = 37.5 bits (83), Expect = 0.28
 Identities = 14/28 (50%), Positives = 17/28 (60%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           W +   +  V+DQG CGSCWAF    AM
Sbjct: 122 WREKGYVTPVKDQGECGSCWAFSTTGAM 149


>UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2;
           Roseiflexus|Rep: Peptidase C1A, papain precursor -
           Roseiflexus sp. RS-1
          Length = 1202

 Score = 37.5 bits (83), Expect = 0.28
 Identities = 17/35 (48%), Positives = 20/35 (57%), Gaps = 3/35 (8%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAF---GAVEAMTDRV 530
           W D      V+DQG CGSCWAF   G VE+   R+
Sbjct: 175 WCDQGACTPVKDQGVCGSCWAFATTGVVESALKRI 209


>UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3;
           Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber
           officinale (Ginger)
          Length = 475

 Score = 37.5 bits (83), Expect = 0.28
 Identities = 17/38 (44%), Positives = 25/38 (65%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           LP++ D R+K      +  V++QG CGSCWAF A+ A+
Sbjct: 143 LPDSIDWREKG----AVVAVKNQGRCGSCWAFAAIAAV 176


>UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC06231 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 372

 Score = 37.5 bits (83), Expect = 0.28
 Identities = 25/82 (30%), Positives = 36/82 (43%)
 Frame = +3

Query: 273 SWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPT 452
           ++K G N   D +   L+K+ G       A     T      A LP+  D    W     
Sbjct: 106 TYKMGVNNFTDKTEYELRKLRGYRSACRIAKPKGSTFISSEHAKLPDRVD----WRRNGA 161

Query: 453 LNEVRDQGSCGSCWAFGAVEAM 518
           +  V++QG CGSCWAF +  A+
Sbjct: 162 VTPVKNQGQCGSCWAFSSTGAI 183


>UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra
           erinaceieuropaei|Rep: Cysteine proteinase - Spirometra
           erinaceieuropaei (Tapeworm)
          Length = 336

 Score = 37.5 bits (83), Expect = 0.28
 Identities = 14/34 (41%), Positives = 19/34 (55%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGA 506
           L EN      W +   +  V++QG CGSCW+F A
Sbjct: 117 LKENLPDSVNWRERGAVTSVKNQGQCGSCWSFSA 150


>UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 452

 Score = 37.5 bits (83), Expect = 0.28
 Identities = 21/57 (36%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
 Frame = +3

Query: 378 THKIDLIASLPENFDPRDKWPDCPTLNEV-RDQGSCGSCWAFGAVEAMTDRVCTYSN 545
           T+   +I +LPE+F     W + P + E   DQ  CG+C+AFGA EA+  +    +N
Sbjct: 216 TYDQKVIQNLPESFS----WRNVPYVLEYPHDQAVCGTCFAFGASEAINGQFSLRAN 268


>UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101,
           whole genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_101,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 306

 Score = 37.5 bits (83), Expect = 0.28
 Identities = 25/64 (39%), Positives = 37/64 (57%)
 Frame = +3

Query: 387 IDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHF 566
           ID I +LPE+ D   K      +N V++QG+CGS W+F AV A  +    +  GT HF +
Sbjct: 105 IDSI-NLPESVDWSSK------MNPVKNQGTCGSGWSFSAVGAF-EAFFIFVKGT-HFQY 155

Query: 567 SAED 578
           S ++
Sbjct: 156 SEQN 159


>UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16;
           Plasmodium (Vinckeia)|Rep: Cysteine proteinase precursor
           - Plasmodium vinckei
          Length = 506

 Score = 37.5 bits (83), Expect = 0.28
 Identities = 26/82 (31%), Positives = 43/82 (52%), Gaps = 9/82 (10%)
 Frame = +3

Query: 291 NFPRDTSFAHLKKIMGVIED-EHFATLPIKTH--KIDLIA------SLPENFDPRDKWPD 443
           +F ++    + KK++ V  D +    +P+K H    +LI+        P++ D R K+  
Sbjct: 216 DFSKEEFDNYFKKLLSVPMDLKSKYIVPLKKHLANTNLISVDNKSKDFPDSRDYRSKFNF 275

Query: 444 CPTLNEVRDQGSCGSCWAFGAV 509
            P     +DQG+CGSCWAF A+
Sbjct: 276 LPP----KDQGNCGSCWAFAAI 293


>UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:
           Viral cathepsin - Xestia c-nigrum granulosis virus
           (XnGV) (Xestia c-nigrumgranulovirus)
          Length = 346

 Score = 37.5 bits (83), Expect = 0.28
 Identities = 17/40 (42%), Positives = 23/40 (57%)
 Frame = +3

Query: 390 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 509
           D    +P++FD    W D  ++  V+ Q  CGSCWAF AV
Sbjct: 128 DSSGKVPDSFD----WRDRNSVTSVKMQKECGSCWAFSAV 163


>UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium
           tetraurelia|Rep: Cathepsin L1 precursor - Paramecium
           tetraurelia
          Length = 314

 Score = 37.5 bits (83), Expect = 0.28
 Identities = 14/19 (73%), Positives = 17/19 (89%)
 Frame = +3

Query: 462 VRDQGSCGSCWAFGAVEAM 518
           V++QGSCGSCWAF AV A+
Sbjct: 126 VKNQGSCGSCWAFSAVGAL 144


>UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocystis
           pacifica SIR-1|Rep: Peptidase C1A, papain - Plesiocystis
           pacifica SIR-1
          Length = 650

 Score = 37.1 bits (82), Expect = 0.36
 Identities = 13/22 (59%), Positives = 17/22 (77%)
 Frame = +3

Query: 453 LNEVRDQGSCGSCWAFGAVEAM 518
           L  +R+QG+CGSCWAF AV  +
Sbjct: 176 LGAIRNQGACGSCWAFAAVSTI 197


>UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza
           sativa|Rep: Cysteine proteinase-like - Oryza sativa
           subsp. japonica (Rice)
          Length = 360

 Score = 37.1 bits (82), Expect = 0.36
 Identities = 15/27 (55%), Positives = 18/27 (66%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAFGAVEA 515
           W     + EV++Q SCGSCWAF AV A
Sbjct: 143 WRARGAVTEVKNQRSCGSCWAFAAVAA 169


>UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein A;
           n=2; Dictyostelium discoideum|Rep: Gamete and
           mating-type specific protein A - Dictyostelium
           discoideum (Slime mold)
          Length = 448

 Score = 37.1 bits (82), Expect = 0.36
 Identities = 13/22 (59%), Positives = 16/22 (72%)
 Frame = +3

Query: 462 VRDQGSCGSCWAFGAVEAMTDR 527
           +RDQG CGSCWAF +  A+  R
Sbjct: 253 IRDQGQCGSCWAFASSAALESR 274


>UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_26_47548_45815 - Giardia lamblia
           ATCC 50803
          Length = 577

 Score = 37.1 bits (82), Expect = 0.36
 Identities = 16/42 (38%), Positives = 23/42 (54%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 530
           LP+  D    W     +N  +DQ +CGSCW FGA+  +  R+
Sbjct: 344 LPQELD----WRVRGIMNMAKDQVACGSCWTFGAIGTIEGRI 381


>UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep:
           Toxopain-2 - Toxoplasma gondii
          Length = 422

 Score = 37.1 bits (82), Expect = 0.36
 Identities = 30/101 (29%), Positives = 47/101 (46%), Gaps = 4/101 (3%)
 Frame = +3

Query: 243 FINTINLKQNSWKAGRNFPRDTSFAHLK-KIMGVIEDEHFAT--LPIKTHKIDLIAS-LP 410
           +I+T N +  S+    N   D S    + K +G  +  +  +  L + T  ++++ S LP
Sbjct: 147 YIHTHNQQGYSYSLKMNHFGDLSRDEFRRKYLGFKKSRNLKSHHLGVATELLNVLPSELP 206

Query: 411 ENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 533
              D R +   C T   V+DQ  CGSCWAF    A+    C
Sbjct: 207 AGVDWRSR--GCVT--PVKDQRDCGSCWAFSTTGALEGAHC 243


>UniRef50_Q231X3 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 323

 Score = 37.1 bits (82), Expect = 0.36
 Identities = 12/28 (42%), Positives = 19/28 (67%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           W +   ++ V+ QG+CGSCWAF A  ++
Sbjct: 121 WVEAGKVSNVKSQGNCGSCWAFSATASV 148


>UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2;
           Platyhelminthes|Rep: Cathepsin L-like proteinase -
           Echinococcus multilocularis
          Length = 338

 Score = 37.1 bits (82), Expect = 0.36
 Identities = 17/38 (44%), Positives = 23/38 (60%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           +P++ D R K    P    ++DQG CGSCWAF A  A+
Sbjct: 122 VPDSIDWRKKGLVTP----IKDQGDCGSCWAFSATGAL 155


>UniRef50_A5KBM2 Cluster: Serine-repeat antigen; n=3;
           Plasmodium|Rep: Serine-repeat antigen - Plasmodium vivax
          Length = 1014

 Score = 37.1 bits (82), Expect = 0.36
 Identities = 20/56 (35%), Positives = 28/56 (50%), Gaps = 3/56 (5%)
 Frame = +3

Query: 414 NFDPRDKWPD---CPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSA 572
           N++  D+W D   C +  EV +QG+CG CW F +   +    C    G  HF  SA
Sbjct: 555 NYEYCDRWKDKTSCISNIEVEEQGNCGLCWVFASKLHLETIRC--MRGYGHFRSSA 608


>UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 493

 Score = 37.1 bits (82), Expect = 0.36
 Identities = 19/53 (35%), Positives = 27/53 (50%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFH 563
           LP  F  R+   +   + + RDQ +CGSCWAFG  E +      +   +K FH
Sbjct: 266 LPRTFSWRN---NTQVVGKPRDQVACGSCWAFGTAEVLEG---AFGIASKEFH 312


>UniRef50_A0D5R4 Cluster: Chromosome undetermined scaffold_39, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_39,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 133

 Score = 37.1 bits (82), Expect = 0.36
 Identities = 18/39 (46%), Positives = 24/39 (61%)
 Frame = +3

Query: 402 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           SLP++ D +D          V++QGSCGSCWAF A  A+
Sbjct: 92  SLPDSVDSKDGLT-------VKNQGSCGSCWAFAAAAAL 123


>UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin
           L-like protease; n=1; Nasonia vitripennis|Rep:
           PREDICTED: similar to cathepsin L-like protease -
           Nasonia vitripennis
          Length = 353

 Score = 36.7 bits (81), Expect = 0.48
 Identities = 17/37 (45%), Positives = 20/37 (54%), Gaps = 1/37 (2%)
 Frame = +3

Query: 411 ENFDPRDKWPDCPTLNEVRDQG-SCGSCWAFGAVEAM 518
           EN      W     +  VRDQG +CGSCWAF A  A+
Sbjct: 130 ENVPEHVDWRQRGAVTPVRDQGLTCGSCWAFSAAGAL 166


>UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep:
           MGC107932 protein - Xenopus tropicalis (Western clawed
           frog) (Silurana tropicalis)
          Length = 333

 Score = 36.7 bits (81), Expect = 0.48
 Identities = 19/57 (33%), Positives = 28/57 (49%), Gaps = 2/57 (3%)
 Frame = +3

Query: 369 PIKTHKIDLIA-SLPENFDPRDKWPDCPTLNEVRDQGS-CGSCWAFGAVEAMTDRVC 533
           P+K       + ++P+  D    W     +  V++QG+ CGSCWAF  V  M  R C
Sbjct: 102 PVKAESYSYTSITIPKEVD----WRKSNCVTPVKNQGTFCGSCWAFATVGVMESRYC 154


>UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease;
           n=23; Magnoliophyta|Rep: Senescence-specific cysteine
           protease - Arabidopsis thaliana (Mouse-ear cress)
          Length = 346

 Score = 36.7 bits (81), Expect = 0.48
 Identities = 18/39 (46%), Positives = 24/39 (61%)
 Frame = +3

Query: 402 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           +LP + D R K    P    +++QGSCG CWAF AV A+
Sbjct: 129 ALPVSVDWRKKGAVTP----IKNQGSCGCCWAFSAVAAI 163


>UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa
           (japonica cultivar-group)|Rep: Os09g0562700 protein -
           Oryza sativa subsp. japonica (Rice)
          Length = 235

 Score = 36.7 bits (81), Expect = 0.48
 Identities = 13/19 (68%), Positives = 15/19 (78%)
 Frame = +3

Query: 453 LNEVRDQGSCGSCWAFGAV 509
           + EV+DQG CGSCWAF  V
Sbjct: 21  VTEVKDQGRCGSCWAFSTV 39


>UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4;
           Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena
           thermophila
          Length = 320

 Score = 36.7 bits (81), Expect = 0.48
 Identities = 19/69 (27%), Positives = 33/69 (47%)
 Frame = +3

Query: 351 EHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 530
           + F TL  K +  ++  +  E  +    W     +  V++QGSCGSCWAF  + A+   +
Sbjct: 92  QQFLTLHEKVNSTEVYRAQGEATEV--DWTAKGKVTPVKNQGSCGSCWAFSTIGAVESAL 149

Query: 531 CTYSNGTKH 557
                G ++
Sbjct: 150 WIAGQGEQN 158


>UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia
           intestinalis|Rep: GLP_90_15278_13989 - Giardia lamblia
           ATCC 50803
          Length = 429

 Score = 36.7 bits (81), Expect = 0.48
 Identities = 22/49 (44%), Positives = 27/49 (55%)
 Frame = +3

Query: 369 PIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEA 515
           PIK    D   +LP++ D R+     P    VR+QG CGSCWAF  V A
Sbjct: 51  PIKVAAED---NLPQSVDLREYGLMTP----VRNQGKCGSCWAFATVAA 92


>UniRef50_P25779 Cluster: Cruzipain precursor; n=54;
           Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi
          Length = 467

 Score = 36.7 bits (81), Expect = 0.48
 Identities = 13/25 (52%), Positives = 16/25 (64%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAFGAV 509
           W     +  V+DQG CGSCWAF A+
Sbjct: 129 WRARGAVTAVKDQGQCGSCWAFSAI 153


>UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15)
           [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC
           3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain] - Sarcophaga peregrina (Flesh fly)
           (Boettcherisca peregrina)
          Length = 339

 Score = 36.7 bits (81), Expect = 0.48
 Identities = 13/28 (46%), Positives = 18/28 (64%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           W +   +  V+DQG CGSCWAF +  A+
Sbjct: 128 WREHGAVTGVKDQGHCGSCWAFSSTGAL 155


>UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42;
           Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens
           (Human)
          Length = 334

 Score = 36.7 bits (81), Expect = 0.48
 Identities = 23/72 (31%), Positives = 35/72 (48%)
 Frame = +3

Query: 303 DTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSC 482
           D +    +++MG   ++ F     K  +  L   LP++ D R K    P    V++Q  C
Sbjct: 82  DMTNEEFRQMMGCFRNQKFRKG--KVFREPLFLDLPKSVDWRKKGYVTP----VKNQKQC 135

Query: 483 GSCWAFGAVEAM 518
           GSCWAF A  A+
Sbjct: 136 GSCWAFSATGAL 147


>UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19;
           Bilateria|Rep: Cathepsin F precursor - Homo sapiens
           (Human)
          Length = 484

 Score = 36.7 bits (81), Expect = 0.48
 Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 7/60 (11%)
 Frame = +3

Query: 342 IEDEHFATLPIKT-------HKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 500
           + +E F T+ + T       +K+    S+ +   P   W     + +V+DQG CGSCWAF
Sbjct: 239 LTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAF 298


>UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 360

 Score = 36.3 bits (80), Expect = 0.64
 Identities = 17/39 (43%), Positives = 23/39 (58%)
 Frame = +3

Query: 402 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           +LP +FD RDK    P    V+ Q  CG CWAF  V+++
Sbjct: 130 NLPASFDWRDKGAITP----VKVQNGCGGCWAFSTVQSI 164


>UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis
           thaliana|Rep: F9P14.12 protein - Arabidopsis thaliana
           (Mouse-ear cress)
          Length = 343

 Score = 36.3 bits (80), Expect = 0.64
 Identities = 13/28 (46%), Positives = 17/28 (60%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           W     +  +R+QG CG CWAF AV A+
Sbjct: 133 WRTQGAVTPIRNQGKCGGCWAFSAVAAI 160


>UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcoptes
           scabiei type hominis|Rep: Cathepsin L-like protease -
           Sarcoptes scabiei type hominis
          Length = 245

 Score = 36.3 bits (80), Expect = 0.64
 Identities = 16/41 (39%), Positives = 23/41 (56%)
 Frame = +3

Query: 396 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           ++ LP+  D    W     +  ++DQ  CGSCWAF AV +M
Sbjct: 117 VSDLPDEVD----WTLKNVVAPIKDQKQCGSCWAFSAVASM 153


>UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba
           histolytica|Rep: Cysteine protease 17 - Entamoeba
           histolytica
          Length = 420

 Score = 36.3 bits (80), Expect = 0.64
 Identities = 18/48 (37%), Positives = 25/48 (52%)
 Frame = +3

Query: 384 KIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 527
           K D++  LPE  D R        L  +R+Q  CG CW+F +V A+  R
Sbjct: 160 KKDIVKELPEGIDFRK----FGKLTYIREQTGCGGCWSFASVCALESR 203


>UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 4 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 345

 Score = 36.3 bits (80), Expect = 0.64
 Identities = 13/33 (39%), Positives = 21/33 (63%)
 Frame = +3

Query: 432 KWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 530
           +W +   +  V++QG CGSCWAF +  A+  +V
Sbjct: 131 EWRENGFVTPVKNQGQCGSCWAFSSTGALEGQV 163


>UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine
           proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like
           midgut cysteine proteinase - Tenebrio molitor (Yellow
           mealworm)
          Length = 330

 Score = 36.3 bits (80), Expect = 0.64
 Identities = 15/23 (65%), Positives = 19/23 (82%), Gaps = 3/23 (13%)
 Frame = +3

Query: 453 LNEVRDQGSCGSCWAF---GAVE 512
           ++EV+DQG CGSCW+F   GAVE
Sbjct: 128 VSEVKDQGQCGSCWSFSTTGAVE 150


>UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L
           - Suberites domuncula (Sponge)
          Length = 324

 Score = 36.3 bits (80), Expect = 0.64
 Identities = 12/28 (42%), Positives = 19/28 (67%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           W     ++EV++QG CGSCW+F A  ++
Sbjct: 114 WRQKGVVSEVKNQGQCGSCWSFSATGSL 141


>UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 397

 Score = 36.3 bits (80), Expect = 0.64
 Identities = 26/98 (26%), Positives = 43/98 (43%)
 Frame = +3

Query: 258 NLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHFATLPIKTHKIDLIASLPENFDPRDKW 437
           N K NS     N  ++ S +       +  ++   T P     ++ +  +P++ D    W
Sbjct: 134 NNKNNSTNTNNNNNKNNSTSSSNSTNTINNNK---TNPNPNPPVNQLKVVPQSVD----W 186

Query: 438 PDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT 551
                ++ V+DQG CG CWAF A  A+ + V    N T
Sbjct: 187 RIQGKVSPVKDQGRCGCCWAFSAT-ALAESVNLMRNNT 223


>UniRef50_Q23H32 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 365

 Score = 36.3 bits (80), Expect = 0.64
 Identities = 18/39 (46%), Positives = 24/39 (61%)
 Frame = +3

Query: 402 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           S+PE+ D R+K      +  V+ QG CGSCWAF  V A+
Sbjct: 134 SVPESVDWREK-----LVAPVQKQGGCGSCWAFSTVIAL 167


>UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18;
           Schistosoma|Rep: Preprocathepsin cathepsin L -
           Schistosoma japonicum (Blood fluke)
          Length = 331

 Score = 36.3 bits (80), Expect = 0.64
 Identities = 14/28 (50%), Positives = 17/28 (60%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           W D   +  V+ QG CGSCWAF A  A+
Sbjct: 122 WRDHGAVTAVKHQGLCGSCWAFSATGAI 149


>UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep:
           CG4847-PD, isoform D - Drosophila melanogaster (Fruit
           fly)
          Length = 420

 Score = 36.3 bits (80), Expect = 0.64
 Identities = 19/44 (43%), Positives = 26/44 (59%), Gaps = 3/44 (6%)
 Frame = +3

Query: 405 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF---GAVEAMTDR 527
           +P+ FD    W +   +  V+ QG+CGSCWAF   GA+E  T R
Sbjct: 203 IPDAFD----WREHGGVTPVKFQGTCGSCWAFATTGAIEGHTFR 242


>UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_36,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 307

 Score = 36.3 bits (80), Expect = 0.64
 Identities = 11/22 (50%), Positives = 18/22 (81%)
 Frame = +3

Query: 453 LNEVRDQGSCGSCWAFGAVEAM 518
           +N +++QG+CGSCW F A+ A+
Sbjct: 118 MNPIKNQGNCGSCWTFSAIGAV 139


>UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184,
           whole genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_184,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 331

 Score = 36.3 bits (80), Expect = 0.64
 Identities = 19/39 (48%), Positives = 23/39 (58%)
 Frame = +3

Query: 402 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 518
           S P+  D    W D  T   V++QGSCGSCWAF A  A+
Sbjct: 117 SFPDTVD----WKDGLT---VKNQGSCGSCWAFAAAAAI 148


>UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5;
           Theileria|Rep: Cysteine proteinase precursor - Theileria
           parva
          Length = 440

 Score = 36.3 bits (80), Expect = 0.64
 Identities = 16/41 (39%), Positives = 21/41 (51%)
 Frame = +3

Query: 387 IDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 509
           +DL     EN D    W    ++  V+DQ +CG CWAF  V
Sbjct: 223 VDLAKLTGENLD----WRRSSSVTSVKDQSNCGGCWAFSTV 259


>UniRef50_Q3A8I3 Cluster: Putative serine protease; n=1; Pelobacter
           carbinolicus DSM 2380|Rep: Putative serine protease -
           Pelobacter carbinolicus (strain DSM 2380 / Gra Bd 1)
          Length = 1066

 Score = 35.9 bits (79), Expect = 0.84
 Identities = 17/39 (43%), Positives = 23/39 (58%)
 Frame = +3

Query: 399 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEA 515
           A LP +FD R+       +  VR+Q  CGSCW+FG + A
Sbjct: 22  ADLPSSFDLRNI-DGRSYIGPVRNQKKCGSCWSFGTLAA 59


>UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza
           sativa|Rep: Putative cysteine protease - Oryza sativa
           subsp. japonica (Rice)
          Length = 357

 Score = 35.9 bits (79), Expect = 0.84
 Identities = 14/19 (73%), Positives = 16/19 (84%)
 Frame = +3

Query: 462 VRDQGSCGSCWAFGAVEAM 518
           V+DQG+CGS WAF AV AM
Sbjct: 148 VKDQGACGSSWAFAAVAAM 166


>UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus
           tauri|Rep: Cysteine protease-1 - Ostreococcus tauri
          Length = 430

 Score = 35.9 bits (79), Expect = 0.84
 Identities = 15/32 (46%), Positives = 20/32 (62%), Gaps = 3/32 (9%)
 Frame = +3

Query: 435 WPDCPTLNEVRDQGSCGSCWAF---GAVEAMT 521
           W +   +   ++QG CGSCWAF   GAVE +T
Sbjct: 207 WVELGAVTPPKNQGQCGSCWAFSTTGAVEGIT 238


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 659,370,683
Number of Sequences: 1657284
Number of extensions: 13096216
Number of successful extensions: 32537
Number of sequences better than 10.0: 356
Number of HSP's better than 10.0 without gapping: 31560
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 32488
length of database: 575,637,011
effective HSP length: 98
effective length of database: 413,223,179
effective search space used: 48760335122
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)

- SilkBase 1999-2023 -