SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTX 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= heS00028
         (846 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw...   131   2e-29
UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca...   127   4e-28
UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ...   118   2e-25
UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n...   111   2e-23
UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh...   108   2e-22
UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=...   107   3e-22
UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: ...   105   2e-21
UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteina...   104   3e-21
UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|...   103   4e-21
UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ...   103   4e-21
UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Ca...   100   5e-20
UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr...   100   5e-20
UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA...    99   2e-19
UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n...    97   4e-19
UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 ...    97   4e-19
UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr...    97   5e-19
UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=...    96   9e-19
UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; ...    96   9e-19
UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps...    94   3e-18
UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gamb...    93   1e-17
UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|...    93   1e-17
UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain...    92   1e-17
UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ...    92   1e-17
UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep...    92   2e-17
UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhip...    91   3e-17
UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Ca...    90   6e-17
UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7...    90   6e-17
UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, w...    90   7e-17
UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8....    89   2e-16
UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep...    88   2e-16
UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomati...    88   3e-16
UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8...    87   4e-16
UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG011...    87   7e-16
UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; ...    86   9e-16
UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma...    86   1e-15
UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 ...    85   2e-15
UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator ame...    84   4e-15
UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8....    83   6e-15
UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma j...    83   9e-15
UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 ...    83   9e-15
UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl...    81   3e-14
UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2; Ostreococcus|...    81   3e-14
UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma j...    81   5e-14
UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2...    80   6e-14
UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2; ...    80   6e-14
UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin ...    79   1e-13
UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C...    78   3e-13
UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|R...    77   4e-13
UniRef50_Q237A1 Cluster: Papain family cysteine protease contain...    77   6e-13
UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus co...    77   7e-13
UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep...    73   7e-12
UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid...    72   2e-11
UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O...    72   2e-11
UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep...    71   4e-11
UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|R...    69   1e-10
UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve...    68   3e-10
UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus...    65   2e-09
UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus lu...    64   3e-09
UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; ...    63   7e-09
UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R...    62   1e-08
UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cel...    62   2e-08
UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi...    62   2e-08
UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi...    60   7e-08
UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-L...    59   2e-07
UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia...    59   2e-07
UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like prote...    59   2e-07
UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;...    58   2e-07
UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,...    56   8e-07
UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; ...    56   8e-07
UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|R...    56   1e-06
UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcop...    54   6e-06
UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35...    54   6e-06
UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lambl...    53   1e-05
UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - ...    52   1e-05
UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ...    52   2e-05
UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to glucocorti...    52   2e-05
UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; ...    51   3e-05
UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=...    51   3e-05
UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru...    51   4e-05
UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo sapie...    50   1e-04
UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida...    50   1e-04
UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-li...    50   1e-04
UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio...    49   1e-04
UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy...    49   1e-04
UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr...    48   2e-04
UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy...    48   2e-04
UniRef50_Q58HK5 Cluster: Hc58; n=1; Haemonchus contortus|Rep: Hc...    48   4e-04
UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr...    48   4e-04
UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh...    48   4e-04
UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ...    47   5e-04
UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin...    47   5e-04
UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try...    47   5e-04
UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia ...    47   7e-04
UniRef50_P81494 Cluster: Cathepsin B; n=2; Phasianidae|Rep: Cath...    47   7e-04
UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ...    46   0.001
UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb...    46   0.001
UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ...    46   0.002
UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re...    46   0.002
UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big...    46   0.002
UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ...    46   0.002
UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory...    46   0.002
UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ...    45   0.002
UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted...    45   0.002
UniRef50_Q23H32 Cluster: Papain family cysteine protease contain...    45   0.002
UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy...    45   0.002
UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali...    45   0.002
UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen; n...    45   0.002
UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ...    45   0.003
UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag...    45   0.003
UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ...    44   0.004
UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia...    44   0.004
UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ...    44   0.005
UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ...    44   0.005
UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain...    44   0.005
UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain...    44   0.005
UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, wh...    44   0.005
UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ...    44   0.005
UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto...    44   0.005
UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt...    44   0.006
UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re...    44   0.006
UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa...    44   0.006
UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain...    44   0.006
UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ...    43   0.008
UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1...    43   0.008
UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ...    43   0.008
UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n...    43   0.008
UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen...    43   0.008
UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ...    43   0.008
UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ...    43   0.011
UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n...    43   0.011
UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1...    43   0.011
UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n...    43   0.011
UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet...    43   0.011
UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain...    43   0.011
UniRef50_Q231X3 Cluster: Papain family cysteine protease contain...    43   0.011
UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy...    43   0.011
UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh...    43   0.011
UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh...    43   0.011
UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w...    43   0.011
UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ...    43   0.011
UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr...    43   0.011
UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;...    42   0.015
UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy...    42   0.015
UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi...    42   0.015
UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium...    42   0.015
UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n...    42   0.019
UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis...    42   0.019
UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel...    42   0.019
UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ...    42   0.026
UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs...    42   0.026
UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e...    42   0.026
UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ...    41   0.034
UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;...    41   0.034
UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;...    41   0.034
UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain...    41   0.034
UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li...    41   0.034
UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like cy...    41   0.034
UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ...    41   0.034
UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ...    41   0.034
UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The...    41   0.034
UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The...    41   0.034
UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl...    41   0.034
UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa...    41   0.045
UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ...    41   0.045
UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi...    41   0.045
UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ...    41   0.045
UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=...    41   0.045
UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R...    41   0.045
UniRef50_UPI00006CB77B Cluster: Papain family cysteine protease ...    40   0.059
UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio...    40   0.059
UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil...    40   0.059
UniRef50_Q22W19 Cluster: Papain family cysteine protease contain...    40   0.059
UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:...    40   0.059
UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu...    40   0.059
UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi...    40   0.059
UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG...    40   0.079
UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa...    40   0.079
UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae...    40   0.079
UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox...    40   0.079
UniRef50_O16454 Cluster: Temporarily assigned gene name protein ...    40   0.079
UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy...    40   0.079
UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ...    40   0.10 
UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p...    40   0.10 
UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep...    40   0.10 
UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi...    40   0.10 
UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n...    40   0.10 
UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabdi...    40   0.10 
UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H...    40   0.10 
UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cy...    40   0.10 
UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh...    40   0.10 
UniRef50_Q2FLC7 Cluster: Periplasmic copper-binding precursor; n...    40   0.10 
UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo...    40   0.10 
UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ...    39   0.14 
UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ...    39   0.14 
UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G...    39   0.14 
UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl...    39   0.14 
UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain...    39   0.14 
UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain...    39   0.14 
UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain...    39   0.14 
UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy...    39   0.14 
UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R...    39   0.14 
UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M...    39   0.14 
UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ...    39   0.18 
UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p...    39   0.18 
UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca...    39   0.18 
UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No...    39   0.18 
UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|...    39   0.18 
UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip...    39   0.18 
UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip...    39   0.18 
UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ...    39   0.18 
UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus...    39   0.18 
UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re...    39   0.18 
UniRef50_A5KBM2 Cluster: Serine-repeat antigen; n=3; Plasmodium|...    39   0.18 
UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D...    39   0.18 
UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18...    39   0.18 
UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ...    38   0.24 
UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s...    38   0.24 
UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocysti...    38   0.24 
UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa...    38   0.24 
UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n...    38   0.24 
UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ...    38   0.24 
UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy...    38   0.24 
UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ...    38   0.24 
UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia...    38   0.24 
UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j...    38   0.24 
UniRef50_Q24E33 Cluster: Papain family cysteine protease contain...    38   0.24 
UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain...    38   0.24 
UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain...    38   0.24 
UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep...    38   0.24 
UniRef50_O96163 Cluster: Cysteine protease, putative; n=5; Plasm...    38   0.24 
UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er...    38   0.24 
UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl...    38   0.24 
UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto...    38   0.24 
UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|...    38   0.24 
UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2...    38   0.24 
UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ...    38   0.32 
UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ...    38   0.32 
UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum...    38   0.32 
UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein...    38   0.32 
UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain...    38   0.32 
UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy...    38   0.32 
UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who...    38   0.32 
UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179, w...    38   0.32 
UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon G...    38   0.32 
UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ...    38   0.42 
UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat...    38   0.42 
UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R...    38   0.42 
UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv...    38   0.42 
UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph...    38   0.42 
UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa...    38   0.42 
UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir...    38   0.42 
UniRef50_Q26015 Cluster: Serine rich protein homologue; n=4; Pla...    38   0.42 
UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh...    38   0.42 
UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ...    38   0.42 
UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ...    38   0.42 
UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov...    38   0.42 
UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:...    38   0.42 
UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ...    38   0.42 
UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h...    37   0.55 
UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica...    37   0.55 
UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil...    37   0.55 
UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli...    37   0.55 
UniRef50_Q7QSU1 Cluster: GLP_127_20145_14275; n=1; Giardia lambl...    37   0.55 
UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop...    37   0.55 
UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain...    37   0.55 
UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ...    37   0.55 
UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v...    37   0.55 
UniRef50_A0D5R4 Cluster: Chromosome undetermined scaffold_39, wh...    37   0.55 
UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh...    37   0.55 
UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ...    37   0.55 
UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M...    37   0.55 
UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ...    37   0.73 
UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ...    37   0.73 
UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ...    37   0.73 
UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; ...    37   0.73 
UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3; Bilate...    37   0.73 
UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl...    37   0.73 
UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt...    37   0.73 
UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis...    37   0.73 
UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain...    37   0.73 
UniRef50_Q23H15 Cluster: Papain family cysteine protease contain...    37   0.73 
UniRef50_Q23H06 Cluster: Papain family cysteine protease contain...    37   0.73 
UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ...    37   0.73 
UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w...    37   0.73 
UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|...    37   0.73 
UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C...    37   0.73 
UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata...    37   0.73 
UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3....    37   0.73 
UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S...    36   0.97 
UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal...    36   0.97 
UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster...    36   0.97 
UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-...    36   0.97 
UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca...    36   0.97 
UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei...    36   0.97 
UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest...    36   0.97 
UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi...    36   0.97 
UniRef50_Q4YCM9 Cluster: Cysteine protease, putative; n=5; Plasm...    36   0.97 
UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n...    36   0.97 
UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi...    36   0.97 
UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat...    36   0.97 
UniRef50_A5KBM1 Cluster: Serine-repeat antigen; n=1; Plasmodium ...    36   0.97 
UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ...    36   0.97 
UniRef50_Q3A8I3 Cluster: Putative serine protease; n=1; Pelobact...    36   1.3  
UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t...    36   1.3  
UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ...    36   1.3  
UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ...    36   1.3  
UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ...    36   1.3  
UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-...    36   1.3  
UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli...    36   1.3  
UniRef50_Q23H10 Cluster: Papain family cysteine protease contain...    36   1.3  
UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ...    36   1.3  
UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir...    36   1.3  
UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophi...    36   1.3  
UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic...    36   1.3  
UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:...    36   1.3  
UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma...    36   1.3  
UniRef50_UPI0001509FF9 Cluster: Papain family cysteine protease ...    36   1.7  
UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ...    36   1.7  
UniRef50_Q8ZRX7 Cluster: Putative viral protein; n=1; Salmonella...    36   1.7  
UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz...    36   1.7  
UniRef50_Q7R0G3 Cluster: GLP_29_33036_32140; n=1; Giardia lambli...    36   1.7  
UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop...    36   1.7  
UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li...    36   1.7  
UniRef50_A5UP12 Cluster: Adhesin-like protein; n=1; Methanobrevi...    36   1.7  
UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,...    35   2.2  
UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ...    35   2.2  
UniRef50_Q8QNJ8 Cluster: EsV-1-75; n=1; Ectocarpus siliculosus v...    35   2.2  
UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280...    35   2.2  
UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O...    35   2.2  
UniRef50_Q239L8 Cluster: Papain family cysteine protease contain...    35   2.2  
UniRef50_Q235G6 Cluster: Papain family cysteine protease contain...    35   2.2  
UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C...    35   2.2  
UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve...    35   2.2  
UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w...    35   2.2  
UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc...    35   2.2  
UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi...    35   3.0  
UniRef50_Q6A1H9 Cluster: Cathepsin X/O; n=1; Suberites domuncula...    35   3.0  
UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop...    35   3.0  
UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia...    35   3.0  
UniRef50_Q26153 Cluster: V-SERA 4; n=1; Plasmodium vivax|Rep: V-...    35   3.0  
UniRef50_Q22A69 Cluster: Papain family cysteine protease contain...    35   3.0  
UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina...    35   3.0  
UniRef50_A7APS9 Cluster: Papain family cysteine protease contain...    35   3.0  
UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs...    35   3.0  
UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;...    35   3.0  
UniRef50_UPI000155D183 Cluster: PREDICTED: similar to Cathepsin ...    34   3.9  
UniRef50_A0IYD1 Cluster: Putative outer membrane adhesin like pr...    34   3.9  
UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ...    34   3.9  
UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus...    34   3.9  
UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli...    34   3.9  
UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain...    34   3.9  
UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve...    34   3.9  
UniRef50_A2DPZ6 Cluster: Clan CA, family C1, cathepsin B-like cy...    34   3.9  
UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh...    34   3.9  
UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba...    34   5.2  
UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|...    34   5.2  
UniRef50_Q01FL5 Cluster: Cysteine protease-5; n=3; Ostreococcus|...    34   5.2  
UniRef50_Q015J8 Cluster: Cathepsin; n=2; Ostreococcus|Rep: Cathe...    34   5.2  
UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ...    34   5.2  
UniRef50_Q8N0R5 Cluster: Cycle like factor BmCyc b; n=4; Obtecto...    34   5.2  
UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist...    34   5.2  
UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv...    34   5.2  
UniRef50_Q3YJ15 Cluster: Putative galactosyl transferase; n=1; H...    33   6.8  
UniRef50_Q5YER6 Cluster: Cathepsin Z; n=1; Bigelowiella natans|R...    33   6.8  
UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep: Cat...    33   6.8  
UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; ...    33   6.8  
UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; ...    33   6.8  
UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ...    33   6.8  
UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|...    33   6.8  
UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep...    33   6.8  
UniRef50_Q54MB6 Cluster: Putative uncharacterized protein; n=1; ...    33   6.8  
UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;...    33   6.8  
UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop...    33   6.8  
UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl...    33   6.8  
UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain...    33   6.8  
UniRef50_A2ERV3 Cluster: Putative uncharacterized protein; n=1; ...    33   6.8  
UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cy...    33   6.8  
UniRef50_A0EAG2 Cluster: Chromosome undetermined scaffold_86, wh...    33   6.8  
UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C...    33   6.8  
UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ...    33   9.0  
UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s...    33   9.0  
UniRef50_Q89Z69 Cluster: Putative uncharacterized protein; n=1; ...    33   9.0  
UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty...    33   9.0  
UniRef50_A3B2E1 Cluster: Putative uncharacterized protein; n=1; ...    33   9.0  
UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n...    33   9.0  
UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei...    33   9.0  
UniRef50_Q54JE9 Cluster: Putative uncharacterized protein; n=1; ...    33   9.0  
UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil...    33   9.0  
UniRef50_Q248G1 Cluster: Papain family cysteine protease contain...    33   9.0  
UniRef50_Q1AMF3 Cluster: Cathepsin C1; n=1; Toxoplasma gondii|Re...    33   9.0  
UniRef50_A7S9N1 Cluster: Predicted protein; n=1; Nematostella ve...    33   9.0  

>UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep:
           Parcxpwnx02 - Periplaneta americana (American cockroach)
          Length = 343

 Score =  131 bits (316), Expect = 2e-29
 Identities = 54/79 (68%), Positives = 64/79 (81%)
 Frame = +2

Query: 272 LPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 451
           LP K+   D+   +PE FDPR++WP+CPTL E+RDQGSCGSCWAFGAVEAM+DRVC +S 
Sbjct: 81  LPEKSME-DIDIEIPEEFDPREQWPECPTLKEIRDQGSCGSCWAFGAVEAMSDRVCIHSK 139

Query: 452 GTKHFHFSAEDLLSCCPIC 508
           G  HFHFSAEDLL+CC  C
Sbjct: 140 GKTHFHFSAEDLLTCCSSC 158



 Score =  128 bits (309), Expect = 2e-28
 Identities = 51/85 (60%), Positives = 61/85 (71%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693
           GC+GG P  AW+YW   G+VSGGSYNS QGC+PY I PCEHHV G R PC G+  TP+C 
Sbjct: 161 GCNGGEPGAAWDYWVSTGIVSGGSYNSHQGCQPYAIEPCEHHVNGTRKPC-GEGDTPRCV 219

Query: 694 KKCESGYDVNYKQDKQYGKHVYTCP 768
           K+CE GYDV Y +D+ +GK  Y  P
Sbjct: 220 KRCEEGYDVPYGKDRHFGKSAYAVP 244



 Score = 46.4 bits (105), Expect = 0.001
 Identities = 21/41 (51%), Positives = 26/41 (63%)
 Frame = +3

Query: 126 LPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGV 248
           L  PLSD+FI+ IN    +WKA RNF  D     +KK+MGV
Sbjct: 32  LVDPLSDDFIDHINSLNTTWKAHRNFGNDIPLREIKKLMGV 72


>UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1)
           (Cathepsin B1) (APP secretase) (APPS) [Contains:
           Cathepsin B light chain; Cathepsin B heavy chain]; n=85;
           Eukaryota|Rep: Cathepsin B precursor (EC 3.4.22.1)
           (Cathepsin B1) (APP secretase) (APPS) [Contains:
           Cathepsin B light chain; Cathepsin B heavy chain] - Homo
           sapiens (Human)
          Length = 339

 Score =  127 bits (306), Expect = 4e-28
 Identities = 51/83 (61%), Positives = 59/83 (71%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693
           GC+GG P  AW +W   GLVSGG Y S  GCRPY IPPCEHHV G+R PC+G+  TPKC+
Sbjct: 149 GCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCS 208

Query: 694 KKCESGYDVNYKQDKQYGKHVYT 762
           K CE GY   YKQDK YG + Y+
Sbjct: 209 KICEPGYSPTYKQDKHYGYNSYS 231



 Score =  104 bits (249), Expect = 3e-21
 Identities = 40/63 (63%), Positives = 51/63 (80%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           LP +FD R++WP CPT+ E+RDQGSCGSCWAFGAVEA++DR+C ++N       SAEDLL
Sbjct: 80  LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139

Query: 491 SCC 499
           +CC
Sbjct: 140 TCC 142



 Score = 46.0 bits (104), Expect = 0.001
 Identities = 23/57 (40%), Positives = 36/57 (63%), Gaps = 4/57 (7%)
 Frame = +3

Query: 87  YVTLVC--VLAAAKDLP--HPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMG 245
           + +L C  VLA A+  P  HPLSDE +N +N +  +W+AG NF  +   ++LK++ G
Sbjct: 5   WASLCCLLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLCG 60



 Score = 42.7 bits (96), Expect = 0.011
 Identities = 18/27 (66%), Positives = 22/27 (81%)
 Frame = +3

Query: 762 LSGDEDHIRAELFKNGPVEGAFTVYSD 842
           +S  E  I AE++KNGPVEGAF+VYSD
Sbjct: 232 VSNSEKDIMAEIYKNGPVEGAFSVYSD 258


>UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin B;
           n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to cathepsin B - Strongylocentrotus purpuratus
          Length = 346

 Score =  118 bits (284), Expect = 2e-25
 Identities = 48/76 (63%), Positives = 56/76 (73%)
 Frame = +2

Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460
           K  N   I  LPENFD R+ WP+CPT+ EVRDQGSCGSCWAFGAVEA++DR+C  S G  
Sbjct: 68  KLENQTRIKDLPENFDARENWPNCPTIKEVRDQGSCGSCWAFGAVEAISDRICIKSKGQT 127

Query: 461 HFHFSAEDLLSCCPIC 508
             H SAEDL++CC  C
Sbjct: 128 QVHISAEDLMTCCKTC 143



 Score =  114 bits (274), Expect = 3e-24
 Identities = 44/81 (54%), Positives = 57/81 (70%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693
           GC+GG P  AWEY+K  G+V+GG +NSSQGC+PY+I  C+HHV G + PC G+  TP+C 
Sbjct: 146 GCNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGPCQGEGPTPECK 205

Query: 694 KKCESGYDVNYKQDKQYGKHV 756
            KCE+ Y   Y+QDK Y   V
Sbjct: 206 HKCEASYSTPYEQDKHYALSV 226


>UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n=4;
           Tenebrionidae|Rep: Putative cathepsin B-like proteinase
           - Tenebrio molitor (Yellow mealworm)
          Length = 321

 Score =  111 bits (268), Expect = 2e-23
 Identities = 43/78 (55%), Positives = 59/78 (75%)
 Frame = +2

Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454
           P+  H F+    +PE+FD R KWP+C +LN +RDQG+CGSCWAF ++E+M+DR+C +S+G
Sbjct: 72  PVLVHTFNA-RDVPESFDARTKWPNCDSLNRIRDQGACGSCWAFASIESMSDRICIHSSG 130

Query: 455 TKHFHFSAEDLLSCCPIC 508
           +  F FS EDLLSCC  C
Sbjct: 131 SAQFMFSPEDLLSCCTSC 148



 Score = 64.5 bits (150), Expect = 3e-09
 Identities = 32/81 (39%), Positives = 44/81 (54%)
 Frame = +1

Query: 517 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCTK 696
           C GG    A +++ + G+VSGG  NS++GCRPY     + H  G         +TP CTK
Sbjct: 151 CGGGYMMSALDFYINEGIVSGGDVNSNEGCRPY---TADAHDQG---------QTPACTK 198

Query: 697 KCESGYDVNYKQDKQYGKHVY 759
            C +GY  +Y  DK YG + Y
Sbjct: 199 SCRNGYSTSYSADKHYGSNDY 219



 Score = 54.0 bits (124), Expect = 5e-06
 Identities = 30/66 (45%), Positives = 44/66 (66%)
 Frame = +3

Query: 63  KMFISRAAYVTLVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIM 242
           K+F+S   +V LV VL+A+      LS EFI++IN  Q+SW AGRNFP +T+  +L K+ 
Sbjct: 2   KIFLS---FVVLVAVLSASLAEIDVLSSEFIDSINRIQSSWVAGRNFPENTTNEYLYKLN 58

Query: 243 GVIEMN 260
           G I ++
Sbjct: 59  GFIGLH 64


>UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome
           shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 5
           SCAF15026, whole genome shotgun sequence - Tetraodon
           nigroviridis (Green puffer)
          Length = 351

 Score =  108 bits (259), Expect = 2e-22
 Identities = 43/66 (65%), Positives = 52/66 (78%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           LP+ FD R++WP+CPTL E+RDQGSCGSCWAFGA EAM+DRVC +SN       SA+DLL
Sbjct: 79  LPKEFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAMSDRVCIHSNAKVSVELSAQDLL 138

Query: 491 SCCPIC 508
           +CC  C
Sbjct: 139 TCCNSC 144



 Score =  100 bits (240), Expect = 4e-20
 Identities = 50/106 (47%), Positives = 62/106 (58%), Gaps = 22/106 (20%)
 Frame = +1

Query: 511 LGCSGGMPRLAWEYWKHFGLVSGGSYNS---------------------SQGCRPYEIPP 627
           +GC+GG P  AW +W   GLVSGG Y+S                     S GCRPY IPP
Sbjct: 146 MGCNGGYPSSAWNFWVSDGLVSGGLYDSHIGRIQVSLCVLLLAVDRDFVSPGCRPYTIPP 205

Query: 628 CEHHVPGNRMPCSGD-TKTPKCTKKCESGYDVNYKQDKQYGKHVYT 762
           CEHHV G+R  CSG+   TP+C  +CE+GY  +YKQDK +GK  Y+
Sbjct: 206 CEHHVNGSRPSCSGEGGDTPECIFRCEAGYSPSYKQDKHFGKTSYS 251



 Score = 48.4 bits (110), Expect = 2e-04
 Identities = 20/32 (62%), Positives = 25/32 (78%)
 Frame = +3

Query: 747 KTCIYLSGDEDHIRAELFKNGPVEGAFTVYSD 842
           KT   +S +ED I+ E++KNGPVEGAFTVY D
Sbjct: 247 KTSYSVSSEEDEIKQEIYKNGPVEGAFTVYED 278



 Score = 41.9 bits (94), Expect = 0.019
 Identities = 20/59 (33%), Positives = 35/59 (59%), Gaps = 2/59 (3%)
 Frame = +3

Query: 81  AAYVTLVCVLAAAKDLPH--PLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVI 251
           AA++ L    +++   PH  PLS E +N IN   ++W AG NF  +  ++++KK+ G +
Sbjct: 4   AAFLFLAAAWSSSLARPHLKPLSSEMVNYINKLNSTWTAGHNF-HNVDYSYVKKLCGTL 61


>UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=1;
           Biomphalaria glabrata|Rep: Cathepsin B preproprotein
           precursor - Biomphalaria glabrata (Bloodfluke planorb)
          Length = 333

 Score =  107 bits (257), Expect = 3e-22
 Identities = 48/101 (47%), Positives = 62/101 (61%)
 Frame = +2

Query: 206 ARHIVRAS*ENNGSYRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGS 385
           AR ++  +   N +Y   H     ++  N      LP+NFDPR KWPDC +LNE+RDQ +
Sbjct: 57  ARALLGVNMAENKAYNRIHLKYKQVQPRN-----DLPDNFDPRTKWPDCASLNEIRDQAN 111

Query: 386 CGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPIC 508
           CGSCWAFG+ EAMTDR+C    G  + H SAED+  CC  C
Sbjct: 112 CGSCWAFGSAEAMTDRICIAGKG--NIHISAEDINDCCKSC 150



 Score =  104 bits (250), Expect = 2e-21
 Identities = 40/83 (48%), Positives = 52/83 (62%)
 Frame = +1

Query: 511 LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKC 690
           +GC+GG P  AWE++   G+VSGG Y +++GC PY +P C+HH  G   PC     TPKC
Sbjct: 152 MGCNGGYPAAAWEWYVDTGVVSGGQYGTNEGCMPYSLPHCDHHTTGKYQPCPAVVPTPKC 211

Query: 691 TKKCESGYDVNYKQDKQYGKHVY 759
            KKC +GY  +Y  DK  GK  Y
Sbjct: 212 EKKCLTGYPKSYSNDKTRGKKSY 234


>UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep:
           Cathepsin B - Apriona germari
          Length = 324

 Score =  105 bits (251), Expect = 2e-21
 Identities = 44/89 (49%), Positives = 63/89 (70%)
 Frame = +2

Query: 242 GSYRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEA 421
           G  RD +  TLP+  H  + I+ +P++FD R++WP C ++  +RD+G+CGSCWAF AVE 
Sbjct: 64  GINRDPN-VTLPVVFH--EAISGIPDSFDAREQWPFCESIRTIRDEGACGSCWAFAAVEV 120

Query: 422 MTDRVCTYSNGTKHFHFSAEDLLSCCPIC 508
           M+DR+C  S G K F FSAE+++SCC  C
Sbjct: 121 MSDRLCLASEGRKKFIFSAEEVVSCCTAC 149



 Score = 53.6 bits (123), Expect = 6e-06
 Identities = 28/82 (34%), Positives = 41/82 (50%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693
           GC GG     ++YW   G+ SGG Y S  GC+PY                SG+  TP+C 
Sbjct: 152 GCRGGFLNEPYKYWVTNGIPSGGDYGSKLGCKPY------------TAAVSGE--TPQCQ 197

Query: 694 KKCESGYDVNYKQDKQYGKHVY 759
           K C SGY+ ++++D ++    Y
Sbjct: 198 KACVSGYEKSWEKDLRHATSAY 219


>UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteinase;
           n=1; Tenebrio molitor|Rep: Putative cathepsin B-like
           like proteinase - Tenebrio molitor (Yellow mealworm)
          Length = 301

 Score =  104 bits (249), Expect = 3e-21
 Identities = 45/80 (56%), Positives = 59/80 (73%), Gaps = 1/80 (1%)
 Frame = +2

Query: 272 LPIKTHNFDLIASLPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVCTYS 448
           LP+KTH  +L A +PE+FD R+ WP+C ++  E+RDQ SCGSCWAFGAVEAM+DR+C +S
Sbjct: 72  LPVKTHAVNLDA-IPESFDAREAWPECTSIIGEIRDQASCGSCWAFGAVEAMSDRICIHS 130

Query: 449 NGTKHFHFSAEDLLSCCPIC 508
           + +     SAEDL  CC  C
Sbjct: 131 DASVKVRISAEDLNDCCYDC 150



 Score =  104 bits (249), Expect = 3e-21
 Identities = 41/89 (46%), Positives = 56/89 (62%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693
           GC+GG P LAW YW   G+V+GG Y   +GC+ Y I PC+HHV GN  PC    +TP C 
Sbjct: 153 GCNGGWPDLAWSYWSSTGIVTGGLYGVDEGCKAYSIKPCDHHVDGNLGPCGDIQRTPACK 212

Query: 694 KKCESGYDVNYKQDKQYGKHVYTCPETKT 780
           K C+S  D+ YK D + G   Y+ P++++
Sbjct: 213 KSCDSTSDLEYKSDLRRGS-AYSIPKSES 240



 Score = 61.3 bits (142), Expect = 3e-08
 Identities = 24/40 (60%), Positives = 33/40 (82%)
 Frame = +3

Query: 132 HPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVI 251
           HPLSDEFIN IN KQ +WKAGRNF  +T  +H+++++GV+
Sbjct: 24  HPLSDEFINEINSKQTTWKAGRNFDVNTPISHVRRLLGVL 63


>UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis
           sinensis|Rep: Cathepsin B5 - Clonorchis sinensis
          Length = 343

 Score =  103 bits (248), Expect = 4e-21
 Identities = 48/85 (56%), Positives = 59/85 (69%), Gaps = 1/85 (1%)
 Frame = +2

Query: 257 EHFATLPIKTHN-FDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 433
           E  A  P   H+ FD +  LP+NFD R  WP C +++E+RDQ SCGSCWAFGAVEAM+DR
Sbjct: 68  EQKAQRPTLRHDGFDNMR-LPKNFDARKTWPHCSSISEIRDQSSCGSCWAFGAVEAMSDR 126

Query: 434 VCTYSNGTKHFHFSAEDLLSCCPIC 508
           +C +SNG  +   SA DLLSCC  C
Sbjct: 127 LCIHSNGAFNKSLSAVDLLSCCKDC 151



 Score = 90.6 bits (215), Expect = 4e-17
 Identities = 37/76 (48%), Positives = 49/76 (64%), Gaps = 1/76 (1%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPKC 690
           GC GG P +AW+YWK  G+V+GGS     GCR Y  P CEHHV G+  PC  +   TP+C
Sbjct: 154 GCRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEHHVQGHYPPCPRELYPTPEC 213

Query: 691 TKKCESGYDVNYKQDK 738
            ++C++  DV Y +DK
Sbjct: 214 VQQCDTP-DVGYLEDK 228


>UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6
           precursor; n=11; Bilateria|Rep: Cathepsin B-like
           cysteine proteinase 6 precursor - Caenorhabditis elegans
          Length = 379

 Score =  103 bits (248), Expect = 4e-21
 Identities = 44/76 (57%), Positives = 54/76 (71%)
 Frame = +2

Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460
           KT + DL   +PE+FD RD WP C ++  +RDQ SCGSCWAFGAVEAM+DR+C  S+G  
Sbjct: 97  KTKDLDL--DIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGEL 154

Query: 461 HFHFSAEDLLSCCPIC 508
               SA+DLLSCC  C
Sbjct: 155 QVTLSADDLLSCCKSC 170



 Score = 94.7 bits (225), Expect = 3e-18
 Identities = 41/85 (48%), Positives = 50/85 (58%), Gaps = 3/85 (3%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRM-PCSGDT-KTPK 687
           GC+GG P  AW YW   G+V+G +Y ++ GC+PY  PPCEHH       PC  D   TPK
Sbjct: 173 GCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPK 232

Query: 688 CTKKCESGY-DVNYKQDKQYGKHVY 759
           C KKC S Y D  Y +DK +G   Y
Sbjct: 233 CEKKCVSDYTDKTYSEDKFFGASAY 257


>UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep:
           Cathepsin b - Aedes aegypti (Yellowfever mosquito)
          Length = 386

 Score =  100 bits (239), Expect = 5e-20
 Identities = 40/66 (60%), Positives = 47/66 (71%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           LP+ FD R+KWP+CP+L E+RDQG CGSCWA  A  AMTDR C  S G + F F + DLL
Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184

Query: 491 SCCPIC 508
           SCC  C
Sbjct: 185 SCCHSC 190



 Score = 81.0 bits (191), Expect = 3e-14
 Identities = 40/86 (46%), Positives = 49/86 (56%), Gaps = 1/86 (1%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693
           GC GG    AW++W   GL SGG  NS QGC PY I  C   +PG       D  TPKC+
Sbjct: 193 GCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGEC--RIPGE------DEDTPKCS 244

Query: 694 KKCESGYDV-NYKQDKQYGKHVYTCP 768
            KC SGY+V +  QD+ YG+  Y+ P
Sbjct: 245 NKCRSGYNVTDVWQDRHYGRVAYSLP 270


>UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase
           precursor; n=28; Bilateria|Rep: Cathepsin B-like
           cysteine proteinase precursor - Schistosoma japonicum
           (Blood fluke)
          Length = 342

 Score =  100 bits (239), Expect = 5e-20
 Identities = 42/78 (53%), Positives = 52/78 (66%)
 Frame = +2

Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454
           P   H+ DL   +P  FD R KWP C +++++RDQ  CGSCWAFGAVEAMTDR+C  S G
Sbjct: 79  PTVDHH-DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGG 137

Query: 455 TKHFHFSAEDLLSCCPIC 508
            +    SA DL+SCC  C
Sbjct: 138 GQSAELSALDLISCCKDC 155



 Score = 99.1 bits (236), Expect = 1e-19
 Identities = 40/95 (42%), Positives = 52/95 (54%), Gaps = 1/95 (1%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPKC 690
           GC GG P +AW+YW   G+V+GGS  +  GC+PY  P CEHH  G    C     KTP+C
Sbjct: 158 GCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQC 217

Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCPETKTTSARN 795
            + C+ GY   Y+QDK YG   Y     +    R+
Sbjct: 218 KQTCQKGYKTPYEQDKHYGDESYNVQNNEKVIQRD 252


>UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA;
           n=1; Tribolium castaneum|Rep: PREDICTED: similar to
           CG10992-PA - Tribolium castaneum
          Length = 325

 Score = 98.7 bits (235), Expect = 2e-19
 Identities = 42/90 (46%), Positives = 59/90 (65%), Gaps = 1/90 (1%)
 Frame = +2

Query: 242 GSYRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCP-TLNEVRDQGSCGSCWAFGAVE 418
           G + D ++  +  K H    I S+PE+FD R+KWP+C   + ++R+QG+CGSCWAF + E
Sbjct: 54  GLHPDPNYK-IQTKQHKISRIISIPESFDAREKWPECKDVIGKIRNQGNCGSCWAFASTE 112

Query: 419 AMTDRVCTYSNGTKHFHFSAEDLLSCCPIC 508
            MTDR+C  S G   F FS E+LL+CC  C
Sbjct: 113 VMTDRLCISSKGKIKFVFSPENLLTCCKDC 142



 Score = 52.4 bits (120), Expect = 1e-05
 Identities = 19/34 (55%), Positives = 26/34 (76%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY 615
           GC GG  + AW+Y+ + G+ SGG YNSS+GC+PY
Sbjct: 145 GCKGGYIKNAWDYYINEGIASGGDYNSSEGCQPY 178


>UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n=8;
           Strongylida|Rep: Cathepsin B-like cysteine protease 2 -
           Parelaphostrongylus tenuis
          Length = 344

 Score = 97.5 bits (232), Expect = 4e-19
 Identities = 37/66 (56%), Positives = 50/66 (75%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           +P++FD R +WP CP+++ +RDQ  CGSCWAFG+ EAM+DRVC  S+G K    SA+D+L
Sbjct: 94  IPDSFDARVQWPHCPSISYIRDQSQCGSCWAFGSAEAMSDRVCIASHGNKTVELSADDIL 153

Query: 491 SCCPIC 508
           SCC  C
Sbjct: 154 SCCYDC 159



 Score = 93.9 bits (223), Expect = 5e-18
 Identities = 40/90 (44%), Positives = 51/90 (56%), Gaps = 1/90 (1%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRM-PCSGDTKTPKC 690
           GC GG P  AWEY+   G+V+GG Y +   CRPYEIPPC HH        C+    TP C
Sbjct: 162 GCDGGYPISAWEYFVETGVVTGGLYGTKDSCRPYEIPPCGHHRNETFYGNCTQIADTPDC 221

Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCPETKT 780
              C++GY ++Y  DK +GK  YT   + T
Sbjct: 222 VTTCQAGYPISYDDDKTFGKDSYTIESSVT 251


>UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3
           precursor; n=4; Caenorhabditis|Rep: Cathepsin B-like
           cysteine proteinase 3 precursor - Caenorhabditis elegans
          Length = 370

 Score = 97.5 bits (232), Expect = 4e-19
 Identities = 38/63 (60%), Positives = 48/63 (76%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           LP+ FD R+KWPDC T+  +R+Q +CGSCWAFGA E ++DRVC  SNGT+    S ED+L
Sbjct: 92  LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151

Query: 491 SCC 499
           SCC
Sbjct: 152 SCC 154



 Score = 65.3 bits (152), Expect = 2e-09
 Identities = 33/92 (35%), Positives = 42/92 (45%), Gaps = 1/92 (1%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693
           GC GG    A  +W   G V+GG Y    GC PY   PC  + P        ++ TP C 
Sbjct: 161 GCKGGYSIEALRFWASSGAVTGGDY-GGHGCMPYSFAPCTKNCP--------ESTTPSCK 211

Query: 694 KKCESGYDV-NYKQDKQYGKHVYTCPETKTTS 786
             C+S Y    YK+DK YG   Y    TK+ +
Sbjct: 212 TTCQSSYKTEEYKKDKHYGASAYKVTTTKSVT 243


>UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase
           precursor; n=29; Schistosomatidae|Rep: Cathepsin B-like
           cysteine proteinase precursor - Schistosoma mansoni
           (Blood fluke)
          Length = 340

 Score = 97.1 bits (231), Expect = 5e-19
 Identities = 41/78 (52%), Positives = 50/78 (64%)
 Frame = +2

Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454
           P   HN D    +P NFD R KWP C ++  +RDQ  CGSCW+FGAVEAM+DR C  S G
Sbjct: 78  PTVDHN-DWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWSFGAVEAMSDRSCIQSGG 136

Query: 455 TKHFHFSAEDLLSCCPIC 508
            ++   SA DLL+CC  C
Sbjct: 137 KQNVELSAVDLLTCCESC 154



 Score = 86.2 bits (204), Expect = 9e-16
 Identities = 36/84 (42%), Positives = 44/84 (52%), Gaps = 1/84 (1%)
 Frame = +1

Query: 511 LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPK 687
           LGC GG+   AW+YW   G+V+  S  +  GC PY  P CEHH  G   PC      TP+
Sbjct: 156 LGCEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPR 215

Query: 688 CTKKCESGYDVNYKQDKQYGKHVY 759
           C + C+  Y   Y QDK  GK  Y
Sbjct: 216 CKQTCQRKYKTPYTQDKHRGKSSY 239



 Score = 34.7 bits (76), Expect = 3.0
 Identities = 15/32 (46%), Positives = 20/32 (62%)
 Frame = +3

Query: 747 KTCIYLSGDEDHIRAELFKNGPVEGAFTVYSD 842
           K+   +  DE  I+ E+ K GPVE +FTVY D
Sbjct: 236 KSSYNVKNDEKAIQKEIMKYGPVEASFTVYED 267


>UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=1;
           Nilaparvata lugens|Rep: Cathepsin B-like protease
           precursor - Nilaparvata lugens (Brown planthopper)
          Length = 347

 Score = 96.3 bits (229), Expect = 9e-19
 Identities = 41/91 (45%), Positives = 52/91 (57%), Gaps = 2/91 (2%)
 Frame = +1

Query: 502 YL*LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGD--T 675
           Y   GC GG P  AW + K  GLV+GG Y+S  GC+PY I PCEHH+ G++  CS     
Sbjct: 156 YCGFGCEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPYPIAPCEHHMEGSKPNCSASPTE 215

Query: 676 KTPKCTKKCESGYDVNYKQDKQYGKHVYTCP 768
            TP C   C  G  + Y++D+Q GK  Y  P
Sbjct: 216 PTPACETTCTHGSSLAYQKDRQKGKSAYLVP 246



 Score = 83.4 bits (197), Expect = 6e-15
 Identities = 32/66 (48%), Positives = 42/66 (63%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           +P+ FD R KW  C +L E+RDQG+CGSCWA     A  DR+C  SN   + H S+ +L+
Sbjct: 92  VPKYFDARKKWKKCKSLREIRDQGNCGSCWAVSVAAAFADRLCIASNAKWNGHISSRELM 151

Query: 491 SCCPIC 508
           SCC  C
Sbjct: 152 SCCSYC 157



 Score = 38.7 bits (86), Expect = 0.18
 Identities = 19/59 (32%), Positives = 35/59 (59%), Gaps = 1/59 (1%)
 Frame = +3

Query: 84  AYVTLVCVLAAAKDLPHPLSDEFINTINLKQNS-WKAGRNFPRDTSFAHLKKIMGVIEM 257
           A V+ +  L   ++    +++++I+ IN    S WKAG NF  DT  ++L+ ++GV E+
Sbjct: 10  AVVSAISALPDQENTVREIANKWIDAINNNPKSTWKAGHNFHPDTPMSYLQGLLGVSEL 68


>UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           B-like proteinase precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 331

 Score = 96.3 bits (229), Expect = 9e-19
 Identities = 38/79 (48%), Positives = 49/79 (62%), Gaps = 1/79 (1%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSG-DTKTPKC 690
           GC GG P +AW YW   G+ +GG Y S QGC+PY + PCEHH  GN++ CS  D  TP C
Sbjct: 148 GCEGGYPTMAWSYWIDTGITTGGLYGSKQGCQPYSLQPCEHHTEGNKVQCSTLDYDTPSC 207

Query: 691 TKKCESGYDVNYKQDKQYG 747
             KC+    +NYK +  +G
Sbjct: 208 KHKCDDS-ALNYKSELTFG 225



 Score = 76.2 bits (179), Expect = 1e-12
 Identities = 35/78 (44%), Positives = 46/78 (58%), Gaps = 1/78 (1%)
 Frame = +2

Query: 284 THNFDLIASLPENFDPRDKWPDCP-TLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460
           TH+ D+   +P +FD R+ W +C   ++ V DQ  CGSCWA  A  AM+DR C  S G  
Sbjct: 72  THSEDI--QVPNSFDARENWKECSDVISTVVDQSDCGSCWAVAAASAMSDRRCIASQGKL 129

Query: 461 HFHFSAEDLLSCCPICDW 514
               SAE+LLSCC  C +
Sbjct: 130 KVPVSAENLLSCCDSCGY 147



 Score = 52.4 bits (120), Expect = 1e-05
 Identities = 23/58 (39%), Positives = 40/58 (68%), Gaps = 2/58 (3%)
 Frame = +3

Query: 78  RAAYVT--LVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMG 245
           +AA++   L+ ++ + K  P+PLS++FIN IN KQ++W AG+NF  + S   +K ++G
Sbjct: 2   KAAFIITLLLPIVLSYKGSPNPLSNDFINYINSKQSTWVAGKNFDENLSIQEIKNLLG 59


>UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Cathepsin
           B - Fasciola gigantica (Giant liver fluke)
          Length = 339

 Score = 94.3 bits (224), Expect = 3e-18
 Identities = 42/85 (49%), Positives = 52/85 (61%)
 Frame = +2

Query: 254 DEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 433
           +E  A  P   H+      LPE+FD R +WP C T++E+RDQ SCGSCWA  A  AM+DR
Sbjct: 68  EERNALRPTIKHDISK-NDLPESFDARSQWPQCWTISEIRDQASCGSCWATAAASAMSDR 126

Query: 434 VCTYSNGTKHFHFSAEDLLSCCPIC 508
           VC +SNG      +A D LSCC  C
Sbjct: 127 VCIHSNGQMRPRLAAADPLSCCTYC 151



 Score = 80.6 bits (190), Expect = 5e-14
 Identities = 36/105 (34%), Positives = 56/105 (53%), Gaps = 2/105 (1%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSGDT-KTPK 687
           GC GG P  AW+YW   G+V+GG++ +  GC+P+    C+H     +   C   T  TP 
Sbjct: 154 GCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTPP 213

Query: 688 CTKKCESGYDVNYKQDKQYGKHVYTCPETKTTSARNCSRMVPSKV 822
           C + C++GY+  Y+QDK YG   Y   E ++   +   +  P +V
Sbjct: 214 CARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEV 258


>UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000012227 - Anopheles gambiae
           str. PEST
          Length = 218

 Score = 92.7 bits (220), Expect = 1e-17
 Identities = 35/66 (53%), Positives = 48/66 (72%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           +PE+FD R+ WP+C +L  +R+QG+CGSCWA  A   M+DRVC +SNGT +   +AEDL+
Sbjct: 1   IPESFDARNHWPNCESLRAIRNQGTCGSCWAVAAASVMSDRVCIHSNGTINVALAAEDLM 60

Query: 491 SCCPIC 508
            CC  C
Sbjct: 61  GCCVDC 66



 Score = 36.7 bits (81), Expect(2) = 0.010
 Identities = 15/29 (51%), Positives = 22/29 (75%), Gaps = 1/29 (3%)
 Frame = +1

Query: 514 GCSGG-MPRLAWEYWKHFGLVSGGSYNSS 597
           GC+GG +   +++YW   GLVSGG+YNS+
Sbjct: 69  GCNGGFLDGTSFQYWVDAGLVSGGAYNST 97



 Score = 25.4 bits (53), Expect(2) = 0.010
 Identities = 9/22 (40%), Positives = 14/22 (63%)
 Frame = +1

Query: 703 ESGYDVNYKQDKQYGKHVYTCP 768
           + G D +Y +DK +GK  Y+ P
Sbjct: 98  DDGVDRHYSKDKLFGKVAYSVP 119


>UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7;
           Rhabditida|Rep: Cysteine proteinase 3 - Necator
           americanus (Human hookworm)
          Length = 360

 Score = 92.7 bits (220), Expect = 1e-17
 Identities = 36/77 (46%), Positives = 48/77 (62%)
 Frame = +2

Query: 278 IKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT 457
           +K  + D    +P +FD RDKWP C ++  +RDQ  CGSCWA  + E M+DR+C  SNGT
Sbjct: 79  LKEEDMDFSEEIPVSFDARDKWPKCTSIGFIRDQSHCGSCWAVSSAETMSDRLCVQSNGT 138

Query: 458 KHFHFSAEDLLSCCPIC 508
                S  D+L+CCP C
Sbjct: 139 IKVLLSDTDILACCPNC 155



 Score = 77.0 bits (181), Expect = 6e-13
 Identities = 35/90 (38%), Positives = 46/90 (51%), Gaps = 1/90 (1%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPKC 690
           GC GG    AWEY+K+ G+ +GG Y +   C+PY   PC+    G    C  D+  TPKC
Sbjct: 158 GCGGGHTIRAWEYFKNTGVCTGGLYGTKDSCKPYAFYPCKDESYGK---CPKDSFPTPKC 214

Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCPETKT 780
            K C+  Y   Y  DK Y    Y  P+ +T
Sbjct: 215 RKICQYKYSKKYADDKYYANSAYRIPQNET 244


>UniRef50_Q23FP9 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 340

 Score = 92.3 bits (219), Expect = 1e-17
 Identities = 37/88 (42%), Positives = 48/88 (54%), Gaps = 1/88 (1%)
 Frame = +1

Query: 502 YL*LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKT 681
           Y  +GC GG P  AW Y K  G+ +GG Y     C+PY  PPC+HHV G   PC     T
Sbjct: 153 YCGMGCKGGYPSAAWGYMKRQGVSTGGLYGDDTSCKPYIFPPCDHHVTGQYQPCGPIQPT 212

Query: 682 PKCTKKCESGYDVN-YKQDKQYGKHVYT 762
           P+C K+C S Y  N Y++D  +    Y+
Sbjct: 213 PQCVKECNSEYTQNTYEKDLHFASQTYS 240



 Score = 89.8 bits (213), Expect = 7e-17
 Identities = 39/87 (44%), Positives = 54/87 (62%), Gaps = 1/87 (1%)
 Frame = +2

Query: 242 GSYRDEHFATLPIKTHNFDLIAS-LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVE 418
           GS  +  +  LP K  + +  A  +PE FD R++WP+C ++  +RDQ +CGSCWAF A E
Sbjct: 64  GSLDEPDWVKLPTKEFDPNANADPIPEFFDAREQWPNCQSIKLIRDQSTCGSCWAFAATE 123

Query: 419 AMTDRVCTYSNGTKHFHFSAEDLLSCC 499
             +DR+C  SN T     S+EDLL CC
Sbjct: 124 TFSDRICIASNQTLQTSISSEDLLECC 150


>UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4
           precursor; n=5; Caenorhabditis|Rep: Cathepsin B-like
           cysteine proteinase 4 precursor - Caenorhabditis elegans
          Length = 335

 Score = 92.3 bits (219), Expect = 1e-17
 Identities = 36/69 (52%), Positives = 47/69 (68%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           ++P  FD R +WP+C ++N +RDQ  CGSCWAF A EA +DR C  SNG  +   SAED+
Sbjct: 80  TIPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDV 139

Query: 488 LSCCPICDW 514
           LSCC  C +
Sbjct: 140 LSCCSNCGY 148



 Score = 72.1 bits (169), Expect = 2e-11
 Identities = 35/85 (41%), Positives = 42/85 (49%), Gaps = 3/85 (3%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSGD-TKTPK 687
           GC GG P  AW+Y    G  +GGSY +  GC+PY + PC   V     P C  D   TP 
Sbjct: 149 GCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPA 208

Query: 688 CTKKC-ESGYDVNYKQDKQYGKHVY 759
           C  KC    Y+V Y  DK +G   Y
Sbjct: 209 CVNKCTNKNYNVAYTADKHFGSTAY 233


>UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep:
           Cathepsin B - Pandalus borealis (Northern red shrimp)
          Length = 328

 Score = 91.9 bits (218), Expect = 2e-17
 Identities = 37/79 (46%), Positives = 50/79 (63%)
 Frame = +2

Query: 272 LPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 451
           LP+K  N      +P  FD R++WP CP ++E+RDQG+CGSCWA  A   MTDR C  + 
Sbjct: 65  LPLK--NVTPTKEIPVEFDAREQWPHCPCIDEIRDQGNCGSCWAVSAASVMTDRTCIDTE 122

Query: 452 GTKHFHFSAEDLLSCCPIC 508
           G   F FS+E++ +CC  C
Sbjct: 123 GLVDFRFSSENVAACCTEC 141



 Score = 91.5 bits (217), Expect = 2e-17
 Identities = 36/88 (40%), Positives = 50/88 (56%)
 Frame = +1

Query: 517 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCTK 696
           C GG    A+ +W   G VSGG +NS++GC+PY +  CEHH+ G R PC GD     C++
Sbjct: 145 CYGGDEDTAFTHWVTKGFVSGGRHNSNEGCQPYSVEECEHHIEGPRPPCEGDMPELVCSE 204

Query: 697 KCESGYDVNYKQDKQYGKHVYTCPETKT 780
            C   Y   Y++D +YG   Y  P+  T
Sbjct: 205 TCHEEYGKTYEEDLEYGLEAYVLPQDVT 232



 Score = 48.4 bits (110), Expect = 2e-04
 Identities = 23/48 (47%), Positives = 31/48 (64%)
 Frame = +3

Query: 96  LVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKI 239
           L+ ++AAA     PLSDEF+  +  KQ +WKAGRNF +D S   LK +
Sbjct: 6   LLALVAAASAELDPLSDEFLELLQSKQMTWKAGRNFAKDISKDFLKSL 53


>UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 1 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 332

 Score = 91.1 bits (216), Expect = 3e-17
 Identities = 34/73 (46%), Positives = 48/73 (65%)
 Frame = +2

Query: 314 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493
           PE+F PR+ W  C ++  +RDQ +CGSCWAF A E+++DR+C ++NG    + SAEDLL+
Sbjct: 88  PESFTPREYWSHCSSIRVIRDQSACGSCWAFAAAESISDRICIHTNGKVQVNISAEDLLA 147

Query: 494 CCPICDWDAAEEC 532
           CC  C       C
Sbjct: 148 CCHTCGHGCDGRC 160



 Score = 55.2 bits (127), Expect = 2e-06
 Identities = 27/72 (37%), Positives = 38/72 (52%), Gaps = 4/72 (5%)
 Frame = +1

Query: 592 SSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCTKKCESGYDVNYKQDKQYGKHVY---- 759
           +  GC+PY +PPC   VP     C+    TPKC   C  GY+ +Y++DK + K+VY    
Sbjct: 180 TEDGCQPYSLPPC---VPN----CTHPEPTPKCQHVCRKGYEKSYEEDKHFAKNVYRLLK 232

Query: 760 TCPETKTTSARN 795
            C   KT   +N
Sbjct: 233 KCDAIKTDIYKN 244



 Score = 36.3 bits (80), Expect = 0.97
 Identities = 16/28 (57%), Positives = 18/28 (64%)
 Frame = +3

Query: 135 PLSDEFINTINLKQNSWKAGRNFPRDTS 218
           PLS+E IN IN    +WKAGRNF    S
Sbjct: 26  PLSEEMINFINSINTTWKAGRNFDEKRS 53



 Score = 34.7 bits (76), Expect = 3.0
 Identities = 13/22 (59%), Positives = 18/22 (81%)
 Frame = +3

Query: 777 DHIRAELFKNGPVEGAFTVYSD 842
           D I+ +++KNGPVE AF VY+D
Sbjct: 235 DAIKTDIYKNGPVESAFFVYAD 256


>UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep:
           Cathepsin b - Aedes aegypti (Yellowfever mosquito)
          Length = 332

 Score = 90.2 bits (214), Expect = 6e-17
 Identities = 35/79 (44%), Positives = 50/79 (63%)
 Frame = +2

Query: 272 LPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 451
           LP K H+      +PE FD R+KWP C +++ +++QG CG+CWA  AV  M+DR+C +S 
Sbjct: 72  LPTKRHDVAYNMDIPEFFDAREKWPYCKSISTIKNQGLCGACWAVAAVSVMSDRLCIHSE 131

Query: 452 GTKHFHFSAEDLLSCCPIC 508
           G      +AEDL+ CC  C
Sbjct: 132 GKFDVELAAEDLMGCCKDC 150



 Score = 83.4 bits (197), Expect = 6e-15
 Identities = 38/86 (44%), Positives = 50/86 (58%), Gaps = 1/86 (1%)
 Frame = +1

Query: 514 GCSGG-MPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKC 690
           GC+GG +   +++YW   GLVSG +YNS+ GC+PY   PC +   G    C  + KTP C
Sbjct: 153 GCNGGFLDGTSFQYWVDVGLVSGAAYNSTDGCKPYPFKPCLYPFVG----CHPE-KTPSC 207

Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCP 768
           T  C  GYD  Y++DK YG   Y  P
Sbjct: 208 THHCTEGYDGTYRRDKYYGSAAYKLP 233



 Score = 34.7 bits (76), Expect = 3.0
 Identities = 15/28 (53%), Positives = 18/28 (64%)
 Frame = +3

Query: 762 LSGDEDHIRAELFKNGPVEGAFTVYSDL 845
           L  DE  I+ E+  NGPVE  F+VY DL
Sbjct: 232 LPNDERMIQLEIMTNGPVESGFSVYQDL 259


>UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7;
           n=2; Haemonchidae|Rep: Cathepsin B-like cysteine
           protease GCP7 - Haemonchus contortus (Barber pole worm)
          Length = 348

 Score = 90.2 bits (214), Expect = 6e-17
 Identities = 39/85 (45%), Positives = 53/85 (62%)
 Frame = +2

Query: 245 SYRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
           SY  E+   +   T N D+    PE+FD R+KW DCP+L  + DQ +CGSCWA  A + M
Sbjct: 78  SYNQENVLPIANITSNDDI----PESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCM 133

Query: 425 TDRVCTYSNGTKHFHFSAEDLLSCC 499
           +DR+C +S G K    SA D+L+CC
Sbjct: 134 SDRLCIHSQGRKKVLLSATDILACC 158



 Score = 62.1 bits (144), Expect = 2e-08
 Identities = 31/91 (34%), Positives = 41/91 (45%), Gaps = 1/91 (1%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPC-SGDTKTPKC 690
           GC GG    AW++    G+V+GG+Y     C+PY  P C  H       C S    TP C
Sbjct: 165 GCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAHKGKAFNNCPSHPYATPAC 224

Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCPETKTT 783
              C+ GY   Y+ DK   +  Y  P  + T
Sbjct: 225 KPYCQYGYGKRYENDKIKARTWYWLPNDERT 255


>UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115,
           whole genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_115,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 332

 Score = 89.8 bits (213), Expect = 7e-17
 Identities = 38/87 (43%), Positives = 53/87 (60%), Gaps = 1/87 (1%)
 Frame = +2

Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454
           P++    + + +LP +F  ++KWP CP++  + DQG+CGSCWA  A   M+DR+C  S  
Sbjct: 59  PVEYKYHEKLENLPPSFSAQEKWPGCPSIELIPDQGNCGSCWAVSAASTMSDRLCIASGQ 118

Query: 455 TKHFHFSAEDLLSCCPI-CDWDAAEEC 532
           T     SAEDLLSCC I C+ D    C
Sbjct: 119 TDKRQISAEDLLSCCGINCELDGNGGC 145



 Score = 72.9 bits (171), Expect = 9e-12
 Identities = 34/81 (41%), Positives = 42/81 (51%), Gaps = 6/81 (7%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEH-HVPGNRMPCSGD-----T 675
           GC GG P  AW+Y +  G+V+GG+YN    C+PY  PPC H +  G    C  D      
Sbjct: 144 GCDGGYPYGAWKYLRVDGIVTGGTYNDFSLCKPYSFPPCSHGNDSGKYSKCENDFFMLTE 203

Query: 676 KTPKCTKKCESGYDVNYKQDK 738
            TP CTKKC   +   Y  DK
Sbjct: 204 VTPSCTKKCHPQFSRTYDVDK 224


>UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.4;
           n=1; Caenorhabditis elegans|Rep: Putative
           uncharacterized protein W07B8.4 - Caenorhabditis elegans
          Length = 335

 Score = 88.6 bits (210), Expect = 2e-16
 Identities = 34/64 (53%), Positives = 46/64 (71%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           S+P+++D RD WP C ++N +RDQ  CGSCWA  A EA++DR C  SNG  +   SAED+
Sbjct: 72  SIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLSAEDI 131

Query: 488 LSCC 499
           L+CC
Sbjct: 132 LTCC 135



 Score = 81.8 bits (193), Expect = 2e-14
 Identities = 38/86 (44%), Positives = 46/86 (53%), Gaps = 4/86 (4%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSGD-TKTPK 687
           GC GG P  AW YW   GLV+GGS+ S  GC+PY I PC   + G   P C    + TPK
Sbjct: 144 GCEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPYSIAPCGETIDGVTWPECPMKISDTPK 203

Query: 688 CTKKC--ESGYDVNYKQDKQYGKHVY 759
           C   C   + Y + Y QDK +G   Y
Sbjct: 204 CEHHCTGNNSYPIPYDQDKHFGASAY 229


>UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep:
           Cathepsin B - Uronema marinum
          Length = 350

 Score = 88.2 bits (209), Expect = 2e-16
 Identities = 36/64 (56%), Positives = 47/64 (73%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           SLPE+FD R+ +P C +L +VRDQ +CGSCWAFG VEA++DR+C  S        S+E+L
Sbjct: 85  SLPESFDLREAYPKCESLQQVRDQSNCGSCWAFGTVEAISDRICIASGQKDQTRISSENL 144

Query: 488 LSCC 499
           LSCC
Sbjct: 145 LSCC 148



 Score = 80.2 bits (189), Expect = 6e-14
 Identities = 40/97 (41%), Positives = 51/97 (52%), Gaps = 8/97 (8%)
 Frame = +1

Query: 511 LGCSGGMPRLAWEYWKHFGLVSGGSY-----NSSQGCRPYEIPPCEHHVPGNRMPCSG-- 669
           +GC+GG    AW Y+   GLVSG  Y     NS   C+PY  PPC HHV G    C+   
Sbjct: 156 MGCNGGYTAGAWNYYVKTGLVSGNLYTDDNQNSKTECQPYSFPPCSHHVQGEYQACTDLP 215

Query: 670 DTKTPKCTKKCESGYDVN-YKQDKQYGKHVYTCPETK 777
              TPKC  +C S Y  N Y+QD   G   Y+ P+++
Sbjct: 216 QFNTPKCYTECNSQYTQNSYEQDLHKGVSSYSVPKSE 252


>UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4;
           Ancylostomatidae|Rep: Cysteine proteinase - Ancylostoma
           ceylanicum
          Length = 348

 Score = 87.8 bits (208), Expect = 3e-16
 Identities = 39/88 (44%), Positives = 53/88 (60%), Gaps = 6/88 (6%)
 Frame = +2

Query: 254 DEHFATLPIKTH------NFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 415
           D  FA  P KT       N ++   +P+ FD RD+WP+C ++  +RDQ SCGSCWA  A 
Sbjct: 69  DVKFAVDPEKTEPNYVLANTEMKVDIPDTFDARDRWPNCTSMKHIRDQSSCGSCWAVAAA 128

Query: 416 EAMTDRVCTYSNGTKHFHFSAEDLLSCC 499
            AM+DRVC  +NG  +   S  ++LSCC
Sbjct: 129 SAMSDRVCALTNGRINRILSDTEVLSCC 156



 Score = 66.1 bits (154), Expect = 1e-09
 Identities = 29/84 (34%), Positives = 42/84 (50%), Gaps = 2/84 (2%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRM-PCSGDT-KTPK 687
           GC GG P  A+ Y   +GL +GG Y     C+PY   PC +H       PC  +   TP 
Sbjct: 163 GCKGGYPARAFGYAWRYGLSTGGPYGEKDACQPYAFYPCGNHAHEPYYGPCPDELWPTPT 222

Query: 688 CTKKCESGYDVNYKQDKQYGKHVY 759
           C + C+ GY + +++DK +    Y
Sbjct: 223 CRRTCQLGYPIPFEKDKIFNDQTY 246


>UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8;
           Trypanosoma|Rep: Cathepsin B-like cysteine protease -
           Trypanosoma brucei
          Length = 340

 Score = 87.4 bits (207), Expect = 4e-16
 Identities = 35/68 (51%), Positives = 45/68 (66%)
 Frame = +2

Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
           A LP +FD  + WP+CPT+ ++ DQ +CGSCWA  A  AM+DR CT   G +  H SA D
Sbjct: 92  APLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCT-MGGVQDVHISAGD 150

Query: 485 LLSCCPIC 508
           LL+CC  C
Sbjct: 151 LLACCSDC 158



 Score = 50.4 bits (115), Expect = 6e-05
 Identities = 32/82 (39%), Positives = 38/82 (46%), Gaps = 5/82 (6%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNR--MPCSG-DTKTP 684
           GC+GG P  AW Y+   GLVS   Y     C+PY  P C HH        PCS  +  TP
Sbjct: 161 GCNGGDPDRAWAYFSSTGLVS--DY-----CQPYPFPHCSHHSKSKNGYPPCSQFNFDTP 213

Query: 685 KCTKKCESGY--DVNYKQDKQY 744
           KC   C+      VNY+    Y
Sbjct: 214 KCNYTCDDPTIPVVNYRSWTSY 235


>UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG01102;
           n=1; Caenorhabditis briggsae|Rep: Putative
           uncharacterized protein CBG01102 - Caenorhabditis
           briggsae
          Length = 374

 Score = 86.6 bits (205), Expect = 7e-16
 Identities = 39/86 (45%), Positives = 50/86 (58%), Gaps = 2/86 (2%)
 Frame = +1

Query: 517 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSGDT-KTPKC 690
           C+GG    AW+YW+  GL +GGSY S  GC+PY I PC+  +     P C   T +TP C
Sbjct: 189 CAGGNVFKAWQYWQKHGLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSC 248

Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCP 768
            KKC+SGY V   +D+ YG  V   P
Sbjct: 249 EKKCKSGYPVELDKDRHYGVSVDQLP 274



 Score = 68.9 bits (161), Expect = 1e-10
 Identities = 27/59 (45%), Positives = 39/59 (66%)
 Frame = +2

Query: 323 FDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC 499
           FD R++WP+C ++  + D   C S WAF A E+M+DR+C  S G  +   SA++LLSCC
Sbjct: 85  FDARERWPECSSIPIINDISDCKSSWAFSAAESMSDRLCINSGGMINTVLSAQELLSCC 143


>UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 421

 Score = 86.2 bits (204), Expect = 9e-16
 Identities = 32/68 (47%), Positives = 46/68 (67%)
 Frame = +2

Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
           + +P+NFD R KWP+CP+++ V +QG CGSC+A  A    +DR C +SNGT     S ED
Sbjct: 136 SDVPKNFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKSLLSEED 195

Query: 485 LLSCCPIC 508
           ++ CC +C
Sbjct: 196 IIGCCSVC 203



 Score = 46.8 bits (106), Expect = 7e-04
 Identities = 31/108 (28%), Positives = 47/108 (43%), Gaps = 2/108 (1%)
 Frame = +1

Query: 517 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCTK 696
           C GG P  A  YW + GLV+GG      GCRPY        VP +        +   C K
Sbjct: 206 CYGGDPLKALTYWVNQGLVTGG----RDGCRPYSF-DLSCGVPCSPATFFEAEEKRTCMK 260

Query: 697 KCES-GYDVNYKQDKQYGKHVYTC-PETKTTSARNCSRMVPSKVLSQY 834
           +C++  Y   Y++DK +    Y+  P + T S     R+    ++  +
Sbjct: 261 RCQNIYYQQKYEEDKHFATFAYSMYPRSMTVSPDGKERVKVPTIIGHF 308


>UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8;
           Leishmania|Rep: Cathepsin B-like protease - Leishmania
           major
          Length = 340

 Score = 85.8 bits (203), Expect = 1e-15
 Identities = 37/71 (52%), Positives = 47/71 (66%)
 Frame = +2

Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475
           +L   LPE FD  + WP C T++E+RDQ +CGSCWA  AVEA++DR CT+  G      S
Sbjct: 93  ELQQDLPEFFDAAEHWPMCLTISEIRDQSNCGSCWAIAAVEAISDRYCTF-GGVPDRRMS 151

Query: 476 AEDLLSCCPIC 508
             +LLSCC IC
Sbjct: 152 TSNLLSCCFIC 162



 Score = 55.6 bits (128), Expect = 1e-06
 Identities = 30/82 (36%), Positives = 40/82 (48%), Gaps = 4/82 (4%)
 Frame = +1

Query: 511 LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT--KTP 684
           LGC GG+P +AW +W   G+       +++ C+PY   PC HH    + P    T   TP
Sbjct: 164 LGCHGGIPTVAWLWWVWVGI-------ATEDCQPYPFDPCSHHGNSEKYPPCPSTIYDTP 216

Query: 685 KCTKKCE-SGYD-VNYKQDKQY 744
           KC   CE +  D V YK    Y
Sbjct: 217 KCNTTCERNEMDLVKYKGSTSY 238


>UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2
           precursor; n=8; Haemonchus contortus|Rep: Cathepsin
           B-like cysteine proteinase 2 precursor - Haemonchus
           contortus (Barber pole worm)
          Length = 342

 Score = 85.4 bits (202), Expect = 2e-15
 Identities = 40/85 (47%), Positives = 48/85 (56%), Gaps = 3/85 (3%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRM---PCSGDTKTP 684
           GC GG P  AW+Y+ + G+VSGG Y +   CRPY I PC HH  GN      C G   TP
Sbjct: 155 GCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHH--GNDTYYGECRGTAPTP 212

Query: 685 KCTKKCESGYDVNYKQDKQYGKHVY 759
            C +KC  G    Y+ DK+YGK  Y
Sbjct: 213 PCKRKCRPGVRKMYRIDKRYGKDAY 237



 Score = 75.8 bits (178), Expect = 1e-12
 Identities = 30/67 (44%), Positives = 43/67 (64%), Gaps = 1/67 (1%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           +P ++DPRD W +C T   +RDQ +CGSCWA     A++DR+C  S   K  + SA D++
Sbjct: 87  IPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIM 145

Query: 491 SCC-PIC 508
           +CC P C
Sbjct: 146 TCCRPQC 152


>UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator
           americanus|Rep: Cysteine proteinase 4 - Necator
           americanus (Human hookworm)
          Length = 339

 Score = 84.2 bits (199), Expect = 4e-15
 Identities = 32/68 (47%), Positives = 44/68 (64%)
 Frame = +2

Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475
           +L   LPE FD R+KWP C ++  +RD  +CGSCWA  A   M+DR+C  +NGT     S
Sbjct: 83  NLNVELPERFDAREKWPHCASIGLIRDHSACGSCWAVSAASVMSDRLCIQTNGTNQKILS 142

Query: 476 AEDLLSCC 499
           + D+L+CC
Sbjct: 143 SADILACC 150



 Score = 72.9 bits (171), Expect = 9e-12
 Identities = 35/82 (42%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPC--SGDTKTPK 687
           GC GG P  A+ Y ++ G+ SGG Y     C+PY   PC+    GN  PC   G   TPK
Sbjct: 157 GCEGGYPIQAYFYLENTGVCSGGEYREKNVCKPYPFYPCD----GNYGPCPKEGAFDTPK 212

Query: 688 CTKKCESGYDVNYKQDKQYGKH 753
           C K C+  Y V Y++DK +GK+
Sbjct: 213 CRKICQFRYPVPYEEDKVFGKN 234


>UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8.1;
           n=1; Caenorhabditis elegans|Rep: Putative
           uncharacterized protein W07B8.1 - Caenorhabditis elegans
          Length = 335

 Score = 83.4 bits (197), Expect = 6e-15
 Identities = 40/92 (43%), Positives = 52/92 (56%), Gaps = 4/92 (4%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSGDTK-TPK 687
           GC GG P  AW+Y +  G+ +GGSY S  GC+PY IPPC   V     P C+  T  TP 
Sbjct: 147 GCEGGNPFKAWQYIQKHGIPTGGSYESQFGCKPYSIPPCGKTVGNVTYPACTNTTSPTPS 206

Query: 688 CTKKCES--GYDVNYKQDKQYGKHVYTCPETK 777
           C KKC S  GY ++  +D+ YG  V   P ++
Sbjct: 207 CEKKCTSRIGYPIDIDKDRHYGVSVDQLPNSQ 238



 Score = 78.2 bits (184), Expect = 2e-13
 Identities = 34/81 (41%), Positives = 51/81 (62%), Gaps = 3/81 (3%)
 Frame = +2

Query: 266 ATLPIKTHNFDLI---ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 436
           AT+  K  NF +    + L  +FD R++WP+C ++ ++ D   C + WAF A E+M+DR+
Sbjct: 58  ATIGFKIQNFGVSQANSDLSPSFDARERWPECMSIPQINDISECKTSWAFAAAESMSDRL 117

Query: 437 CTYSNGTKHFHFSAEDLLSCC 499
           C  S G K+   SAE+LLSCC
Sbjct: 118 CINSGGFKNTILSAEELLSCC 138


>UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC06356 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 279

 Score = 83.0 bits (196), Expect = 9e-15
 Identities = 32/83 (38%), Positives = 48/83 (57%), Gaps = 1/83 (1%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPKC 690
           GC  G       YW  +G+V+GGSY    GC+PY +P C +H     + C+ +T + P+C
Sbjct: 94  GCFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSYHPESRFLDCNNNTFEFPQC 153

Query: 691 TKKCESGYDVNYKQDKQYGKHVY 759
           T +C+ GY+  Y  DK YG+ +Y
Sbjct: 154 TNECQDGYNKTYDDDKFYGERIY 176



 Score = 65.3 bits (152), Expect = 2e-09
 Identities = 29/81 (35%), Positives = 46/81 (56%), Gaps = 1/81 (1%)
 Frame = +2

Query: 257 EHFATLPIKTHNFDLI-ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 433
           E+  T  IKT + + I   +P +FD R  W +C T+ ++ D+  C + WA   V++++DR
Sbjct: 9   ENIQTKHIKTISHNSINMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDR 68

Query: 434 VCTYSNGTKHFHFSAEDLLSC 496
           +C  SNG      SA D +SC
Sbjct: 69  ICIRSNGRISVQLSARDAISC 89


>UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1
           precursor; n=3; Haemonchidae|Rep: Cathepsin B-like
           cysteine proteinase 1 precursor - Ostertagia ostertagi
          Length = 341

 Score = 83.0 bits (196), Expect = 9e-15
 Identities = 31/66 (46%), Positives = 45/66 (68%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           +PE++DPR +W +C +L  + DQ +CGSCWA  +  AM+DR+C  S G K    SA+D++
Sbjct: 91  IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150

Query: 491 SCCPIC 508
           SCC  C
Sbjct: 151 SCCTWC 156



 Score = 78.6 bits (185), Expect = 2e-13
 Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 3/82 (3%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRM---PCSGDTKTP 684
           GC GG P  A+ +    G+V+GG YN+   CRPYEI PC HH  GN      C G   TP
Sbjct: 159 GCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPCGHH--GNETYYGECVGMADTP 216

Query: 685 KCTKKCESGYDVNYKQDKQYGK 750
           +C ++C  GY  +Y  D+ Y K
Sbjct: 217 RCKRRCLLGYPKSYPSDRYYKK 238


>UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core
           eudicotyledons|Rep: At1g02300/T6A9_10 - Arabidopsis
           thaliana (Mouse-ear cress)
          Length = 362

 Score = 81.4 bits (192), Expect = 3e-14
 Identities = 36/79 (45%), Positives = 48/79 (60%)
 Frame = +2

Query: 263 FATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCT 442
           F  +PI +H+  L   LP+ FD R  W  C ++  + DQG CGSCWAFGAVE+++DR C 
Sbjct: 92  FLGVPIVSHDISL--KLPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCI 149

Query: 443 YSNGTKHFHFSAEDLLSCC 499
             N   +   S  DLL+CC
Sbjct: 150 KYN--MNVSLSVNDLLACC 166



 Score = 55.6 bits (128), Expect = 1e-06
 Identities = 32/83 (38%), Positives = 43/83 (51%), Gaps = 1/83 (1%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRMPCSGDTKTPKC 690
           GC+GG P  AW Y+KH G+V       ++ C PY +   C H  PG    C     TPKC
Sbjct: 173 GCNGGYPIAAWRYFKHHGVV-------TEECDPYFDNTGCSH--PG----CEPAYPTPKC 219

Query: 691 TKKCESGYDVNYKQDKQYGKHVY 759
            +KC SG  + +++ K YG   Y
Sbjct: 220 ARKCVSGNQL-WRESKHYGVSAY 241



 Score = 37.5 bits (83), Expect = 0.42
 Identities = 16/22 (72%), Positives = 18/22 (81%)
 Frame = +3

Query: 777 DHIRAELFKNGPVEGAFTVYSD 842
           D I AE++KNGPVE AFTVY D
Sbjct: 248 DDIMAEVYKNGPVEVAFTVYED 269


>UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2;
           Ostreococcus|Rep: Cysteine proteinase - Ostreococcus
           tauri
          Length = 362

 Score = 81.0 bits (191), Expect = 3e-14
 Identities = 35/63 (55%), Positives = 43/63 (68%), Gaps = 1/63 (1%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           LP+ FD R+KWP C  L +E  DQG+CGSCWA    +AMTDR+C  +NG  + H SA  L
Sbjct: 88  LPDTFDVREKWPKCAALVSEAVDQGACGSCWAVAPAKAMTDRLCIATNGAVNTHVSAIQL 147

Query: 488 LSC 496
           LSC
Sbjct: 148 LSC 150



 Score = 43.2 bits (97), Expect = 0.008
 Identities = 18/41 (43%), Positives = 20/41 (48%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEH 636
           GC GG P  A+E     G+VSGG       C PY   PC H
Sbjct: 170 GCMGGYPTEAYETAHRVGVVSGGLNGDQDTCMPYPFAPCHH 210


>UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC02853 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 181

 Score = 80.6 bits (190), Expect = 5e-14
 Identities = 34/65 (52%), Positives = 45/65 (69%)
 Frame = +2

Query: 254 DEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 433
           D+H    PI  HN D+   LP+ FD R  W +C ++  +RDQ SCGSCWAFGAVE+M+DR
Sbjct: 64  DQHKLHHPIIHHN-DINIKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDR 122

Query: 434 VCTYS 448
           +C +S
Sbjct: 123 ICIHS 127



 Score = 37.1 bits (82), Expect = 0.55
 Identities = 21/40 (52%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
 Frame = +3

Query: 135 PLSDEFINTINLKQN-SWKAGRNFPRDTSFAHLKKIMGVI 251
           PLSDE I  IN + N  WKA R   R TS  H K +MGV+
Sbjct: 21  PLSDELITFINKQPNIEWKADRT-KRFTSIHHAKSMMGVL 59


>UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2;
           Arthropoda|Rep: Cathepsin B-like cysteine protease -
           Callosobruchus maculatus (Southern cowpea weevil) (Pulse
           bruchid)
          Length = 330

 Score = 80.2 bits (189), Expect = 6e-14
 Identities = 29/66 (43%), Positives = 39/66 (59%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           LPE FD R +W  C ++ E+RDQ  CGSCWA  +   M+DR+C  S+       SA D++
Sbjct: 81  LPEEFDARKQWSKCESIKEIRDQSGCGSCWAVSSASVMSDRICIQSDQKNQLRISAADMI 140

Query: 491 SCCPIC 508
            CC  C
Sbjct: 141 ECCESC 146



 Score = 73.7 bits (173), Expect = 5e-12
 Identities = 33/82 (40%), Positives = 42/82 (51%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693
           GC GG+P   +  WK  G VSGG YNS+ GC  Y +P C    P     C      P C 
Sbjct: 152 GCHGGIPSFTFTEWKDSGFVSGGEYNSTNGCMSYPLPRCN---PS----CKTLYDAPTCK 204

Query: 694 KKCESGYDVNYKQDKQYGKHVY 759
           K+C+ G  + Y++DK Y K  Y
Sbjct: 205 KECDKGSPLKYEEDKHYAKQAY 226



 Score = 49.6 bits (113), Expect = 1e-04
 Identities = 24/63 (38%), Positives = 38/63 (60%), Gaps = 2/63 (3%)
 Frame = +3

Query: 78  RAAYVTLVCVLAAAKDLPHP--LSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVI 251
           + A++ L  V++     P    LSDE+I  +N K   WKAGRNF RDTS  ++++++ V 
Sbjct: 2   KLAFIALAAVVSCTFAQPELDFLSDEYIEQLNSKNLPWKAGRNFERDTSLYNIQRLLSVG 61

Query: 252 EMN 260
            +N
Sbjct: 62  TIN 64


>UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 356

 Score = 80.2 bits (189), Expect = 6e-14
 Identities = 35/73 (47%), Positives = 45/73 (61%)
 Frame = +2

Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460
           KT N +++  +P +FD R KWP C  +  VRDQ  CGS     AVE  +DR C  SNGT 
Sbjct: 82  KTGNDNVLVDIPSSFDSRQKWPSCSQIGAVRDQSDCGSAAHLVAVEIASDRTCIASNGTF 141

Query: 461 HFHFSAEDLLSCC 499
           ++  SA+D LSCC
Sbjct: 142 NWPLSAQDPLSCC 154



 Score = 73.3 bits (172), Expect = 7e-12
 Identities = 35/93 (37%), Positives = 49/93 (52%), Gaps = 4/93 (4%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPG--NRMPCSGDTKTPK 687
           GC G  P+   ++W+  GL +GG+YN   GC+PY I PC+         +PC G   TP 
Sbjct: 166 GCDGSWPKDILKWWQTHGLCTGGNYNDQFGCKPYSIYPCDKKYANGTTSVPCPG-YHTPT 224

Query: 688 CTKKCESG--YDVNYKQDKQYGKHVYTCPETKT 780
           C + C S   + + YKQDK +GK  Y   +  T
Sbjct: 225 CEEHCTSNITWPIAYKQDKHFGKAHYNVGKKMT 257


>UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin
           B-like cysteine proteinase 4 precursor (Cysteine
           protease-related 4); n=2; Tribolium castaneum|Rep:
           PREDICTED: similar to Cathepsin B-like cysteine
           proteinase 4 precursor (Cysteine protease-related 4) -
           Tribolium castaneum
          Length = 360

 Score = 79.4 bits (187), Expect = 1e-13
 Identities = 31/67 (46%), Positives = 41/67 (61%), Gaps = 1/67 (1%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           +PE FD R+ WP+C  +   +R+QG C S WAF A E M+DR+C  +NG      S EDL
Sbjct: 72  IPETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDL 131

Query: 488 LSCCPIC 508
           + CC  C
Sbjct: 132 IDCCHYC 138



 Score = 58.8 bits (136), Expect = 2e-07
 Identities = 32/89 (35%), Positives = 44/89 (49%), Gaps = 1/89 (1%)
 Frame = +1

Query: 517 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCTK 696
           C GG    AW Y+   GLVSGG YN+S GC+PY        +   R+       TP C  
Sbjct: 142 CKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYS------ELNYYRI-------TPPCNT 188

Query: 697 KCESG-YDVNYKQDKQYGKHVYTCPETKT 780
            C++  Y + Y  DK +G  +Y  P+ +T
Sbjct: 189 TCQNDKYPIPYVSDKHFGDSIYYIPQNET 217


>UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep:
           Cathepsin B - Triticum aestivum (Wheat)
          Length = 353

 Score = 77.8 bits (183), Expect = 3e-13
 Identities = 36/78 (46%), Positives = 45/78 (57%)
 Frame = +2

Query: 266 ATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTY 445
           A +PIK H       LP+ FD R +W  C T+  + DQG CG+CWAF AVEA+ DR C +
Sbjct: 85  AGVPIKIHPE---MDLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRFCIH 141

Query: 446 SNGTKHFHFSAEDLLSCC 499
            N       S  DLL+CC
Sbjct: 142 LN--MSVSLSVNDLLACC 157



 Score = 45.6 bits (103), Expect = 0.002
 Identities = 32/111 (28%), Positives = 49/111 (44%), Gaps = 1/111 (0%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRMPCSGDTKTPKC 690
           GC+GG P  AW Y++  G+V       ++ C PY +   C+H  PG    C     TPKC
Sbjct: 164 GCNGGYPISAWRYFRRSGVV-------TEECDPYFDQTGCQH--PG----CEPAYPTPKC 210

Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCPETKTTSARNCSRMVPSKVLSQYIQI 843
            +KC+      +K++K +  + Y              +  P +V   Y QI
Sbjct: 211 QRKCKVENQA-WKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTYCQI 260


>UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep:
           Cathepsin B - Streblomastix strix
          Length = 312

 Score = 77.4 bits (182), Expect = 4e-13
 Identities = 30/69 (43%), Positives = 41/69 (59%)
 Frame = +2

Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
           +A+LP+ FD R  WP+C  + ++ DQG CGSCWA  + E + DR C  S G +    S +
Sbjct: 73  VANLPDEFDSRTNWPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEGKQTPELSPQ 132

Query: 482 DLLSCCPIC 508
            L SC P C
Sbjct: 133 HLTSCTPGC 141


>UniRef50_Q237A1 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 346

 Score = 77.0 bits (181), Expect = 6e-13
 Identities = 34/67 (50%), Positives = 45/67 (67%), Gaps = 1/67 (1%)
 Frame = +2

Query: 311 LPENFDPRDKWPD-CPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           LPE FD R +W D C +L EVRDQ +CGSCWAFGA E+++DR C +    +    S ++L
Sbjct: 93  LPEEFDARVQWGDKCSSLWEVRDQSTCGSCWAFGAAESLSDRHCIHLG--QDIRLSTQNL 150

Query: 488 LSCCPIC 508
           L+CC  C
Sbjct: 151 LTCCAAC 157



 Score = 72.1 bits (169), Expect = 2e-11
 Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 3/85 (3%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGN-RMPCSGDTKTPKC 690
           GC GG P  A +Y+ + GLV+G  Y ++  C+ Y   PC HHV  +   PC+G+  TP C
Sbjct: 160 GCDGGWPEAAMDYYVNTGLVTGDLYGNNSWCQAYTFAPCAHHVTSDIYPPCTGELPTPPC 219

Query: 691 TKKCESG--YDVNYKQDKQYGKHVY 759
              C+S   + + Y +D   G   Y
Sbjct: 220 INSCDSNSTHTIPYSKDIHRGSKAY 244



 Score = 36.7 bits (81), Expect = 0.73
 Identities = 15/27 (55%), Positives = 20/27 (74%)
 Frame = +3

Query: 762 LSGDEDHIRAELFKNGPVEGAFTVYSD 842
           ++ DE  I AE++KNGP+E A TVY D
Sbjct: 246 IAKDEKAIMAEIYKNGPIEVALTVYED 272


>UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus
           contortus|Rep: Cysteine proteinase - Haemonchus
           contortus (Barber pole worm)
          Length = 350

 Score = 76.6 bits (180), Expect = 7e-13
 Identities = 36/84 (42%), Positives = 44/84 (52%), Gaps = 2/84 (2%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGD--TKTPK 687
           GC GG   LAWE+ + FG+V+GG Y     CRPY   PC  H  G R  C  D    TP 
Sbjct: 163 GCEGGYDHLAWEWVQRFGVVTGGPYQQKGVCRPYAFHPCGLH-HGRRYDCPWDHSFSTPA 221

Query: 688 CTKKCESGYDVNYKQDKQYGKHVY 759
           C   C+ GY   Y++DK + K  Y
Sbjct: 222 CKPYCQFGYGKRYEKDKFFVKSTY 245



 Score = 73.7 bits (173), Expect = 5e-12
 Identities = 29/63 (46%), Positives = 38/63 (60%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           +PE+FD R  W +C ++  VRDQ  CGSCWA  A   M+DR+C  + G      S  D+L
Sbjct: 94  IPESFDSRIVWKNCSSITYVRDQSRCGSCWAVSAASTMSDRICVQTKGKLQTILSDTDIL 153

Query: 491 SCC 499
           SCC
Sbjct: 154 SCC 156



 Score = 35.1 bits (77), Expect = 2.2
 Identities = 15/32 (46%), Positives = 19/32 (59%)
 Frame = +3

Query: 747 KTCIYLSGDEDHIRAELFKNGPVEGAFTVYSD 842
           K+   L  DE  I+ E+ KNGPV+ AF  Y D
Sbjct: 242 KSTYILDNDEKVIQREMMKNGPVQAAFITYED 273


>UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep:
           Thiol protease - Trichuris suis
          Length = 348

 Score = 73.3 bits (172), Expect = 7e-12
 Identities = 33/67 (49%), Positives = 41/67 (61%)
 Frame = +2

Query: 299 LIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSA 478
           L  S+P +FD R  W  C +LN +RDQ  CGSCWA  A E M+DR+C  SN +     S 
Sbjct: 80  LALSIPPSFDVRSLWHVC-SLNLIRDQAKCGSCWAVSAAETMSDRICVQSNCSIKACISD 138

Query: 479 EDLLSCC 499
            D+LSCC
Sbjct: 139 TDILSCC 145



 Score = 63.7 bits (148), Expect = 6e-09
 Identities = 36/115 (31%), Positives = 51/115 (44%), Gaps = 11/115 (9%)
 Frame = +1

Query: 502 YL*LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYE-IPPCEHHVPGNRM-PCSGDT 675
           Y   GC+GG P  AW ++   G  +GG      GC+PY+   P   H+  N   PC  DT
Sbjct: 148 YCGYGCNGGFPIEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRHLKRNDYAPCPNDT 207

Query: 676 ---------KTPKCTKKCESGYDVNYKQDKQYGKHVYTCPETKTTSARNCSRMVP 813
                     TP+C ++C  GY  +Y  D+ YGK  Y   ++     R   +  P
Sbjct: 208 YYGECVGMADTPRCKRRCLLGYPKSYPSDRYYGKSAYIVKQSVKAIQREIMKNGP 262


>UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15;
           Aphidoidea|Rep: Cathepsin B-S precursor - Tuberaphis
           styraci
          Length = 349

 Score = 72.1 bits (169), Expect = 2e-11
 Identities = 27/65 (41%), Positives = 37/65 (56%)
 Frame = +2

Query: 314 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493
           P+ FD R+ W  C  +  +RDQG+CGSCW+F    A  DR+C  + G  +   S E+L  
Sbjct: 86  PKQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELAF 145

Query: 494 CCPIC 508
           CC  C
Sbjct: 146 CCMDC 150



 Score = 63.3 bits (147), Expect = 7e-09
 Identities = 29/89 (32%), Positives = 44/89 (49%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693
           GC GG P  AW+Y++  G+ +GG Y++ +GC PY++PPC      N        +  +C 
Sbjct: 153 GCGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYDEQGKNTCGGKPMERNHQCP 212

Query: 694 KKCESGYDVNYKQDKQYGKHVYTCPETKT 780
           K C   Y     QD+   K+ Y     +T
Sbjct: 213 KTC---YGKTTVQDRYKTKNEYVINSIET 238



 Score = 39.5 bits (88), Expect = 0.10
 Identities = 23/59 (38%), Positives = 32/59 (54%), Gaps = 4/59 (6%)
 Frame = +3

Query: 81  AAYVTLVCVLAAAKDLPHP----LSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMG 245
           A +VT+VC +  +  L  P    LSDE I  IN    +WKA R FP +TS  +   ++G
Sbjct: 2   AKFVTIVCAIFVSVYLAEPTLQFLSDERIKYINEVAKTWKAERYFPANTSEEYFIGLLG 60


>UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1;
           Ostreococcus tauri|Rep: Cysteine proteinase Cathepsin F
           - Ostreococcus tauri
          Length = 498

 Score = 71.7 bits (168), Expect = 2e-11
 Identities = 33/64 (51%), Positives = 39/64 (60%), Gaps = 1/64 (1%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
           SLP +FD RD++P C  L   VRDQG CGSCWA  A E M DR+C  S G +    S + 
Sbjct: 256 SLPRHFDARDEYPKCARLIGTVRDQGKCGSCWAVAATEIMNDRLCISSGGKEVAELSPQF 315

Query: 485 LLSC 496
            LSC
Sbjct: 316 ALSC 319


>UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep:
           Cysteine proteinase - Toxoplasma gondii
          Length = 569

 Score = 70.9 bits (166), Expect = 4e-11
 Identities = 32/78 (41%), Positives = 43/78 (55%), Gaps = 2/78 (2%)
 Frame = +2

Query: 272 LPIKTHNFDLIAS-LPENFDPRDKWPDCP-TLNEVRDQGSCGSCWAFGAVEAMTDRVCTY 445
           +P+    F+     +P +FD R  +P C   +  VRDQG CGSCWAF + EA  DR+C  
Sbjct: 260 MPLPAKEFENATEPVPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIR 319

Query: 446 SNGTKHFHFSAEDLLSCC 499
           S G +    SA+   SCC
Sbjct: 320 SQGKRLMPLSAQHTTSCC 337



 Score = 64.1 bits (149), Expect = 4e-09
 Identities = 29/70 (41%), Positives = 41/70 (58%), Gaps = 6/70 (8%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNS-SQG--CRPYEIPPCEHHVPGNRMPCSG---DT 675
           GC+GG P +AW +++  G+V+GG +++  +G  C PYE+P C HH       C       
Sbjct: 346 GCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAHHAKAPFPDCDATLVPR 405

Query: 676 KTPKCTKKCE 705
           KTPKC K CE
Sbjct: 406 KTPKCRKDCE 415


>UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|Rep:
           Cysteine protease - Giardia muris
          Length = 301

 Score = 69.3 bits (162), Expect = 1e-10
 Identities = 35/84 (41%), Positives = 47/84 (55%), Gaps = 4/84 (4%)
 Frame = +2

Query: 257 EHFATLPIKTH----NFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
           E+  +L  +TH    N      LP+++DPR +   C  L EV DQ SCGSCWAF AV   
Sbjct: 55  ENLRSLRTETHVSQLNLGKTKELPKDYDPRVERAHC--LPEVADQASCGSCWAFSAVATF 112

Query: 425 TDRVCTYSNGTKHFHFSAEDLLSC 496
            DR C Y   +K  H+S + ++SC
Sbjct: 113 ADRRCAYGLDSKQVHYSEQYVVSC 136


>UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 311

 Score = 68.1 bits (159), Expect = 3e-10
 Identities = 28/63 (44%), Positives = 41/63 (65%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           ++PENFD R +WP   +++ +R+QG CGSCWAFGA E ++DR    S    +   SA+ L
Sbjct: 82  NIPENFDARKQWPG--SIHPIRNQGQCGSCWAFGASEVLSDRFAIASKNQIYVTLSAQQL 139

Query: 488 LSC 496
           + C
Sbjct: 140 VDC 142


>UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella
           histriomuscorum|Rep: Cathepsin B - Oxytricha trifallax
           (Sterkiella histriomuscorum)
          Length = 294

 Score = 65.3 bits (152), Expect = 2e-09
 Identities = 33/65 (50%), Positives = 41/65 (63%)
 Frame = +2

Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
           I ++PENFD R +W     ++ +RDQ  CGSCWAFGA EA +DR     NG K    S E
Sbjct: 73  IMTVPENFDARQQWGS--KIHAIRDQQQCGSCWAFGATEAFSDRFAI--NG-KDVILSPE 127

Query: 482 DLLSC 496
           DL+SC
Sbjct: 128 DLVSC 132



 Score = 33.9 bits (74), Expect = 5.2
 Identities = 13/20 (65%), Positives = 18/20 (90%)
 Frame = +3

Query: 783 IRAELFKNGPVEGAFTVYSD 842
           I++E+  +GPVEGAFTVY+D
Sbjct: 203 IQSEIVSHGPVEGAFTVYTD 222


>UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus
           lucimarinus CCE9901|Rep: Predicted protein -
           Ostreococcus lucimarinus CCE9901
          Length = 330

 Score = 64.5 bits (150), Expect = 3e-09
 Identities = 30/63 (47%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           LP +FD R  +P C  L   VRDQG CGSCWA  A E M DR+C  ++G      S +  
Sbjct: 112 LPTSFDARVAYPKCSRLLGAVRDQGRCGSCWAVAATEVMNDRLCVATDGENADELSPQYA 171

Query: 488 LSC 496
           LSC
Sbjct: 172 LSC 174



 Score = 37.5 bits (83), Expect = 0.42
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 3/104 (2%)
 Frame = +1

Query: 514 GCSGG--MPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPK 687
           GC GG  +  L   + K  G+  GG  +S+  C PYE   C+H       PC     TP+
Sbjct: 180 GCDGGDVLDTLRIAFTK--GIPYGGMLDSN-ACLPYEFEACDH-------PCMVAGTTPQ 229

Query: 688 -CTKKCESGYDVNYKQDKQYGKHVYTCPETKTTSARNCSRMVPS 816
            C  KC  G  +++          YTCP+   T   +    VP+
Sbjct: 230 SCPAKCADGSALSFVHPT---SEPYTCPKGDVTHTGSGVYTVPN 270


>UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 314

 Score = 63.3 bits (147), Expect = 7e-09
 Identities = 28/68 (41%), Positives = 43/68 (63%), Gaps = 1/68 (1%)
 Frame = +2

Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG-TKHFHF 472
           +L  S+P +FD R +WPDC  ++ + +Q  CGSCWAF + E ++DR+C  SN  T     
Sbjct: 83  ELKGSIPTSFDSRVQWPDC--IHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGAL 140

Query: 473 SAEDLLSC 496
           S + L++C
Sbjct: 141 SPQTLVAC 148



 Score = 34.3 bits (75), Expect = 3.9
 Identities = 13/19 (68%), Positives = 16/19 (84%)
 Frame = +1

Query: 514 GCSGGMPRLAWEYWKHFGL 570
           GCSGG+P+LAWEY +  GL
Sbjct: 155 GCSGGIPQLAWEYMELKGL 173


>UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep:
           Cathepsin B - Streblomastix strix
          Length = 283

 Score = 62.5 bits (145), Expect = 1e-08
 Identities = 29/62 (46%), Positives = 37/62 (59%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           +P+ FD R+KWPD   +  VRDQG CGSCWAF   E + DR+     G      + EDL+
Sbjct: 63  VPDTFDAREKWPDA--ILPVRDQGECGSCWAFSIAETIGDRLGVL--GCSRGDIAPEDLV 118

Query: 491 SC 496
           SC
Sbjct: 119 SC 120


>UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2;
           cellular organisms|Rep: Cysteine proteinase, putative -
           Archaeoglobus fulgidus
          Length = 1088

 Score = 62.1 bits (144), Expect = 2e-08
 Identities = 32/77 (41%), Positives = 40/77 (51%)
 Frame = +2

Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
           +ASLP  FD    W D   L+ VRDQGSCGSCWA  AV A+   +   S  +     S +
Sbjct: 591 MASLPSRFD----WRDYTGLSAVRDQGSCGSCWAHSAVAALESALIVESGASSSIDLSEQ 646

Query: 482 DLLSCCPICDWDAAEEC 532
            LLSC   C+    + C
Sbjct: 647 HLLSCEQDCEVGIGDWC 663


>UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3;
           Giardia intestinalis|Rep: Cathepsin B-like CP1 precursor
           - Giardia lamblia (Giardia intestinalis)
          Length = 303

 Score = 62.1 bits (144), Expect = 2e-08
 Identities = 27/67 (40%), Positives = 38/67 (56%)
 Frame = +2

Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475
           +L+  +P  FD RD++P C  +    DQGSCGSCWAF A+    DR C      +   +S
Sbjct: 74  ELVDPIPPQFDFRDEYPQC--VKPALDQGSCGSCWAFSAIGVFGDRRCAMGIDKEAVSYS 131

Query: 476 AEDLLSC 496
            + L+SC
Sbjct: 132 QQHLISC 138


>UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4;
           Giardia intestinalis|Rep: Cathepsin B-like CP2 precursor
           - Giardia lamblia (Giardia intestinalis)
          Length = 300

 Score = 60.1 bits (139), Expect = 7e-08
 Identities = 27/62 (43%), Positives = 38/62 (61%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           +PE+FD R+++P C  + EV DQG CGSCWAF +V    DR C      K   +S + ++
Sbjct: 75  VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQYVV 132

Query: 491 SC 496
           SC
Sbjct: 133 SC 134


>UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-LDL
           responsive gene 2, partial; n=1; Strongylocentrotus
           purpuratus|Rep: PREDICTED: similar to oxidized-LDL
           responsive gene 2, partial - Strongylocentrotus
           purpuratus
          Length = 363

 Score = 58.8 bits (136), Expect = 2e-07
 Identities = 27/64 (42%), Positives = 38/64 (59%), Gaps = 1/64 (1%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT-KHFHFSAED 484
           ++PE FD R +WP    +  V++QG+C S WA       +DR+   SNGT K+ H S + 
Sbjct: 221 AIPEEFDARAQWPGL--VEGVQNQGNCASSWAMSTAATASDRLAIQSNGTFKYMHLSPQH 278

Query: 485 LLSC 496
           LLSC
Sbjct: 279 LLSC 282


>UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_113_4299_5381 - Giardia lamblia ATCC
           50803
          Length = 360

 Score = 58.8 bits (136), Expect = 2e-07
 Identities = 26/61 (42%), Positives = 36/61 (59%)
 Frame = +2

Query: 314 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493
           PE++D RD++P C T  EV DQG+CGSCWAF +V+   D  C          +S + +L 
Sbjct: 141 PESYDFRDEYPHCIT--EVVDQGNCGSCWAFSSVQTFADHRCRSGLDATGVSYSVQYVLD 198

Query: 494 C 496
           C
Sbjct: 199 C 199


>UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like protein
           F26E4.3; n=2; Caenorhabditis|Rep: Uncharacterized
           peptidase C1-like protein F26E4.3 - Caenorhabditis
           elegans
          Length = 491

 Score = 58.8 bits (136), Expect = 2e-07
 Identities = 27/62 (43%), Positives = 36/62 (58%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           LPE+FD RDKW   P ++ V DQG CGS W+       +DR+   S G  +   S++ LL
Sbjct: 223 LPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLL 280

Query: 491 SC 496
           SC
Sbjct: 281 SC 282


>UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;
           n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
           hypothetical protein - Strongylocentrotus purpuratus
          Length = 450

 Score = 58.4 bits (135), Expect = 2e-07
 Identities = 28/64 (43%), Positives = 35/64 (54%)
 Frame = +2

Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
           A LPE FD R+ WP    ++EV DQG CGS WA       +DR+   S G  +   S + 
Sbjct: 195 ARLPETFDARENWPGL--IDEVIDQGKCGSSWAISTASVASDRLAIQSMGEINPRLSEQH 252

Query: 485 LLSC 496
           LLSC
Sbjct: 253 LLSC 256


>UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,
           isoform A; n=2; Endopterygota|Rep: PREDICTED: similar to
           CG3074-PA, isoform A - Tribolium castaneum
          Length = 445

 Score = 56.4 bits (130), Expect = 8e-07
 Identities = 27/63 (42%), Positives = 34/63 (53%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           SLP  FD   KWP    ++E++DQG CGS WA       +DR    S G +    SA+ L
Sbjct: 196 SLPREFDSEFKWPGW--MSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHL 253

Query: 488 LSC 496
           LSC
Sbjct: 254 LSC 256


>UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 323

 Score = 56.4 bits (130), Expect = 8e-07
 Identities = 27/77 (35%), Positives = 39/77 (50%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           ++P +FD R  W DC  ++ VR+Q SCGSCWA      + DR+C  S+       S + L
Sbjct: 45  TIPASFDVRTNWGDC--MSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYL 102

Query: 488 LSCCPICDWDAAEECRD 538
           + C   C  D    C +
Sbjct: 103 MDCDGSCVSDGVSGCNN 119


>UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|Rep:
           Cysteine proteinase - Globodera pallida
          Length = 53

 Score = 56.0 bits (129), Expect = 1e-06
 Identities = 22/41 (53%), Positives = 26/41 (63%)
 Frame = +2

Query: 377 QGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC 499
           QG CG CWAF   E ++DR C  SNGT+    S  DLL+CC
Sbjct: 1   QGQCGRCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCC 41


>UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcoptes
           scabiei type hominis|Rep: Sar s 1 allergen Yv9053H09 -
           Sarcoptes scabiei type hominis
          Length = 253

 Score = 53.6 bits (123), Expect = 6e-06
 Identities = 28/68 (41%), Positives = 38/68 (55%), Gaps = 4/68 (5%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV----EAMTDRVCTYSNGTKHFHFSA 478
           LPE FD RD       L+++R+QG CG+CWAF A+     A   R     N T+  HFS 
Sbjct: 37  LPEKFDLRD----LGYLSKIRNQGRCGACWAFAALASVESAYNRRTRIVHNRTRKHHFSE 92

Query: 479 EDLLSCCP 502
           ++L+ C P
Sbjct: 93  QELVDCSP 100


>UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35;
           Viridiplantae|Rep: Cysteine proteinase 15A precursor -
           Pisum sativum (Garden pea)
          Length = 363

 Score = 53.6 bits (123), Expect = 6e-06
 Identities = 30/75 (40%), Positives = 39/75 (52%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           +LPE+FD R+K    P    V+DQGSCGSCWAF    A+      Y    K    S + L
Sbjct: 131 NLPEDFDWREKGAVTP----VKDQGSCGSCWAFSTTGALEG--AHYLATGKLVSLSEQQL 184

Query: 488 LSCCPICDWDAAEEC 532
           + C  +CD + A  C
Sbjct: 185 VDCDHVCDPEQAGSC 199


>UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_217_11853_10927 - Giardia lamblia
           ATCC 50803
          Length = 308

 Score = 52.8 bits (121), Expect = 1e-05
 Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 1/86 (1%)
 Frame = +2

Query: 242 GSYRDEHFATLPIKT-HNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVE 418
           GS R +     P++   N D +   P++FD R+++P C T  EV D G C S WA+ AV+
Sbjct: 54  GSPRTQSSIVRPVRVPENEDPV---PDHFDFREEYPQCIT--EVIDIGLCSSSWAYSAVD 108

Query: 419 AMTDRVCTYSNGTKHFHFSAEDLLSC 496
           A + R C      +   +SA+ +LSC
Sbjct: 109 AFSHRRCLTGLDQEATRYSAQYILSC 134


>UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p -
           Drosophila melanogaster (Fruit fly)
          Length = 431

 Score = 52.4 bits (120), Expect = 1e-05
 Identities = 23/62 (37%), Positives = 34/62 (54%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           LP +F+  DKW     ++EV DQG CG+ W        +DR    S G ++   SA+++L
Sbjct: 187 LPSSFNALDKWSSY--ISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNIL 244

Query: 491 SC 496
           SC
Sbjct: 245 SC 246


>UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin O
           precursor; n=2; Apocrita|Rep: PREDICTED: similar to
           Cathepsin O precursor - Apis mellifera
          Length = 374

 Score = 52.0 bits (119), Expect = 2e-05
 Identities = 32/93 (34%), Positives = 44/93 (47%)
 Frame = +2

Query: 218 VRAS*ENNGSYRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSC 397
           +R     N SY  +H     I         S+P  FD RDK    P    VR QGSCG+C
Sbjct: 128 IRGEKHMNASYHRKH----QISIDRMKRSISIPLRFDWRDKGVITP----VRSQGSCGAC 179

Query: 398 WAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           WAF  +E + + +    NGT H   S ++++ C
Sbjct: 180 WAFSTIEVI-ESMFAIKNGTLH-SLSVQEMIDC 210


>UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to
           glucocorticoid-inducible protein; n=1; Gallus
           gallus|Rep: PREDICTED: similar to
           glucocorticoid-inducible protein - Gallus gallus
          Length = 307

 Score = 51.6 bits (118), Expect = 2e-05
 Identities = 24/62 (38%), Positives = 33/62 (53%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           LP +FD   KWP    ++E  DQG+C   WAF      +DR+  +S G      S ++LL
Sbjct: 153 LPRHFDAATKWPGM--IHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPSLSPQNLL 210

Query: 491 SC 496
           SC
Sbjct: 211 SC 212


>UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p;
           n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
           GM06507p - Nasonia vitripennis
          Length = 483

 Score = 51.2 bits (117), Expect = 3e-05
 Identities = 23/62 (37%), Positives = 33/62 (53%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           LP  FD R +W +   +  V+DQG CG+ WA   V+  +DR    S G +    S + L+
Sbjct: 236 LPREFDSRIQWGN--DITPVQDQGWCGASWAISTVDVASDRFAIMSKGIEKVQLSGQHLI 293

Query: 491 SC 496
           SC
Sbjct: 294 SC 295


>UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15;
           Magnoliophyta|Rep: Cysteine proteinase RD19a precursor -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 368

 Score = 51.2 bits (117), Expect = 3e-05
 Identities = 27/75 (36%), Positives = 39/75 (52%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           +LPE+FD    W D   +  V++QGSCGSCW+F A  A+      +    K    S + L
Sbjct: 134 NLPEDFD----WRDHGAVTPVKNQGSCGSCWSFSATGALEG--ANFLATGKLVSLSEQQL 187

Query: 488 LSCCPICDWDAAEEC 532
           + C   CD + A+ C
Sbjct: 188 VDCDHECDPEEADSC 202


>UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus
           salmonis|Rep: Cysteine proteinase - Lepeophtheirus
           salmonis (salmon louse)
          Length = 372

 Score = 50.8 bits (116), Expect = 4e-05
 Identities = 26/65 (40%), Positives = 36/65 (55%)
 Frame = +2

Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
           I  LPE+ D R+K      + +V++QGSCGSCW F AVE +   V   +N T     S +
Sbjct: 112 IKDLPESVDWREKG----VITDVKNQGSCGSCWVFSAVEQIESYVAIENNMTSPPLLSTQ 167

Query: 482 DLLSC 496
            + SC
Sbjct: 168 QITSC 172


>UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo
           sapiens|Rep: Isoform 2 of Q9GZM7 - Homo sapiens (Human)
          Length = 283

 Score = 49.6 bits (113), Expect = 1e-04
 Identities = 24/62 (38%), Positives = 34/62 (54%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           LP  F+  +KWP+   ++E  DQG+C   WAF      +DRV  +S G      S ++LL
Sbjct: 69  LPTAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLL 126

Query: 491 SC 496
           SC
Sbjct: 127 SC 128


>UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11;
           Trichomonadidae|Rep: Cysteine protease - Tritrichomonas
           foetus (Trichomonas foetus)
          Length = 315

 Score = 49.6 bits (113), Expect = 1e-04
 Identities = 24/63 (38%), Positives = 36/63 (57%)
 Frame = +2

Query: 320 NFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC 499
           N D  D W +   +NE++DQ +CGSCWAF A++A  +     S GT    +S ++L+ C 
Sbjct: 100 NVDSID-WREKGVVNEIKDQAACGSCWAFSAIQA-AESAYAISTGTLE-SYSEQNLVDCV 156

Query: 500 PIC 508
             C
Sbjct: 157 QGC 159


>UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-like
           precursor; n=26; Euteleostomi|Rep: Tubulointerstitial
           nephritis antigen-like precursor - Homo sapiens (Human)
          Length = 467

 Score = 49.6 bits (113), Expect = 1e-04
 Identities = 24/62 (38%), Positives = 34/62 (54%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           LP  F+  +KWP+   ++E  DQG+C   WAF      +DRV  +S G      S ++LL
Sbjct: 203 LPTAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLL 260

Query: 491 SC 496
           SC
Sbjct: 261 SC 262


>UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2;
           Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio
           molitor (Yellow mealworm)
          Length = 336

 Score = 49.2 bits (112), Expect = 1e-04
 Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
 Frame = +2

Query: 254 DEHFATLPIKTH-NFDLIASL--PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
           D H   +PIKT  +  L AS+  P +FD    W D   ++ V++QGSCGSCWAF +  A+
Sbjct: 99  DLHKNGIPIKTREDLGLNASVRYPASFD----WRDQGMVSPVKNQGSCGSCWAFSSTGAI 154

Query: 425 TDRVCTYSNGTKHFHFSAEDLLSCCP 502
             ++   +        S + L+ C P
Sbjct: 155 ESQMKIANGAGYDSSVSEQQLVDCVP 180


>UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan
           CA, family C1, cathepsin L-like cysteine peptidase -
           Trichomonas vaginalis G3
          Length = 306

 Score = 49.2 bits (112), Expect = 1e-04
 Identities = 19/56 (33%), Positives = 34/56 (60%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPIC 508
           W +   +N++++QG+CGSCWAF A++ +  +V    N  + +  S ++LL C   C
Sbjct: 94  WREQGIVNKIKNQGACGSCWAFSAIQVIESQVA--KNQKQLYDLSEQNLLDCVTSC 147


>UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase
           precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin
           L-like cysteine proteinase precursor - Acanthoscelides
           obtectus (Bean weevil)
          Length = 321

 Score = 48.4 bits (110), Expect = 2e-04
 Identities = 27/72 (37%), Positives = 42/72 (58%)
 Frame = +2

Query: 290 NFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFH 469
           +FD +  +P+  D R+K      + EV+ QG+CGSCWAF AV ++  +V    NG+    
Sbjct: 103 SFDNVNDIPKTVDWREKG----AVTEVKKQGNCGSCWAFSAVGSIEGQV-FLKNGSLE-S 156

Query: 470 FSAEDLLSCCPI 505
            SA++L+ C  I
Sbjct: 157 LSAQNLVDCAGI 168


>UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 234

 Score = 48.4 bits (110), Expect = 2e-04
 Identities = 25/70 (35%), Positives = 38/70 (54%)
 Frame = +2

Query: 299 LIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSA 478
           ++  +P+  D R K      +NE++DQ  CGSCWAFG+  AM +      +GT  +  S 
Sbjct: 14  IVGDIPDEIDYRTKG----AVNEIKDQKHCGSCWAFGSCAAM-ESSWFLKHGTL-YSLSE 67

Query: 479 EDLLSCCPIC 508
           + L+ CC  C
Sbjct: 68  QCLVDCCHDC 77


>UniRef50_Q58HK5 Cluster: Hc58; n=1; Haemonchus contortus|Rep: Hc58
           - Haemonchus contortus (Barber pole worm)
          Length = 241

 Score = 47.6 bits (108), Expect = 4e-04
 Identities = 17/29 (58%), Positives = 21/29 (72%)
 Frame = +2

Query: 368 VRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454
           +RDQ +CGSCWA  A E M+DR C +S G
Sbjct: 108 IRDQSNCGSCWAVSAAETMSDRACIHSKG 136


>UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase
           precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine
           proteinase precursor - Heterodera glycines (Soybean cyst
           nematode worm)
          Length = 353

 Score = 47.6 bits (108), Expect = 4e-04
 Identities = 22/70 (31%), Positives = 33/70 (47%)
 Frame = +2

Query: 287 HNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHF 466
           HN   +A        +  W +   + EV+DQG CGSCWAF A  A+ +        +K  
Sbjct: 123 HNMATLAGNSSTLPEKLDWREKGAVTEVKDQGDCGSCWAFSATGAI-EGALAQKKASKII 181

Query: 467 HFSAEDLLSC 496
             S ++L+ C
Sbjct: 182 SLSEQNLVDC 191


>UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_21,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 349

 Score = 47.6 bits (108), Expect = 4e-04
 Identities = 23/58 (39%), Positives = 34/58 (58%)
 Frame = +2

Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICDWDAAEEC 532
           ++EV++QGSCGSCWAF AV A+         G K+   S ++L+ C  + D   +E C
Sbjct: 137 VSEVKNQGSCGSCWAFSAVAAL--ETALRQGGVKNVELSEQELVDCA-VKDEFESEGC 191


>UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1;
           Phytophthora infestans|Rep: Cathepsin-like cysteine
           protease - Phytophthora infestans (Potato late blight
           fungus)
          Length = 376

 Score = 47.2 bits (107), Expect = 5e-04
 Identities = 31/96 (32%), Positives = 46/96 (47%)
 Frame = +2

Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
           +  LP  +D    W +  T+  V++QG CGSCWAF AV AM    C Y+  T      +E
Sbjct: 130 VEDLPATWD----WREHSTVTPVKNQGQCGSCWAFSAVAAME---CAYALSTGTLESLSE 182

Query: 482 DLLSCCPICDWDAAEECRD*LGNIGSTSV*YQEVVT 589
             L  C +   +  + C     + G  S  Y+E++T
Sbjct: 183 QELVDCTL---NGIDTC----NHGGEMSEGYEEIIT 211


>UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia
           intestinalis|Rep: GLP_41_8294_9919 - Giardia lamblia
           ATCC 50803
          Length = 541

 Score = 47.2 bits (107), Expect = 5e-04
 Identities = 29/68 (42%), Positives = 38/68 (55%), Gaps = 5/68 (7%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN-----GTKHFHF 472
           +LP++FD RD       +  V DQG+CGSC+ FGAV+AM  R+   +N     GTK    
Sbjct: 240 TLPDDFDWRDV-NGVSYIPGVLDQGACGSCFTFGAVQAMNSRIMIATNRTDPVGTKTI-L 297

Query: 473 SAEDLLSC 496
           S E  L C
Sbjct: 298 STEHALDC 305


>UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2;
           Trypanosoma cruzi|Rep: Cysteine proteinase, putative -
           Trypanosoma cruzi
          Length = 392

 Score = 47.2 bits (107), Expect = 5e-04
 Identities = 26/64 (40%), Positives = 32/64 (50%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           +P+  D R+  P    L  V+DQG CGSCWA GA E M       + G  H   S + L 
Sbjct: 141 IPDEVDYRNSSP--AILTAVKDQGRCGSCWAHGAAEEMESHFAILT-GRLHV-LSQQQLT 196

Query: 491 SCCP 502
           SC P
Sbjct: 197 SCAP 200


>UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia
           irregularis virus a|Rep: FirrV-1-A48 precursor -
           Feldmannia irregularis virus a
          Length = 373

 Score = 46.8 bits (106), Expect = 7e-04
 Identities = 17/41 (41%), Positives = 25/41 (60%)
 Frame = +2

Query: 374 DQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           DQGSC SCW+   V+ + DRV   +NG      S ++++SC
Sbjct: 80  DQGSCASCWSISVVQMLADRVSVSTNGKIKLKLSVQEMISC 120


>UniRef50_P81494 Cluster: Cathepsin B; n=2; Phasianidae|Rep:
           Cathepsin B - Coturnix coturnix japonica (Japanese
           quail)
          Length = 48

 Score = 46.8 bits (106), Expect = 7e-04
 Identities = 16/25 (64%), Positives = 22/25 (88%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGS 385
           LP+ FD R +WP+CPT++E+RDQGS
Sbjct: 1   LPDTFDSRKQWPNCPTISEIRDQGS 25


>UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin C;
           n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to cathepsin C - Strongylocentrotus purpuratus
          Length = 482

 Score = 46.4 bits (105), Expect = 0.001
 Identities = 23/64 (35%), Positives = 35/64 (54%)
 Frame = +2

Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
           ++LPE FD RD       ++ VRDQG CGSC+AF +      R+   +N       S ++
Sbjct: 247 SNLPEKFDWRDVG-GIDYVSPVRDQGICGSCYAFASTATQESRLRVMTNNNVKVVMSPQE 305

Query: 485 LLSC 496
           ++SC
Sbjct: 306 VVSC 309


>UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae
           str. PEST
          Length = 559

 Score = 46.4 bits (105), Expect = 0.001
 Identities = 20/38 (52%), Positives = 25/38 (65%)
 Frame = +2

Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 415
           +  LP +FD    W D   + EV++QGSCGSCWAF AV
Sbjct: 336 VGDLPRSFD----WRDHGAVTEVKNQGSCGSCWAFSAV 369


>UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 336

 Score = 45.6 bits (103), Expect = 0.002
 Identities = 22/62 (35%), Positives = 31/62 (50%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           LP +FD    W D   L++V+DQG CGSCWAF     +      +    +   FS + L+
Sbjct: 125 LPASFD----WRDYGILSDVKDQGQCGSCWAFSTTGIL--EALYFMENRQKISFSEQQLV 178

Query: 491 SC 496
            C
Sbjct: 179 DC 180


>UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep:
           Cysteine protease - Solanum lycopersicum (Tomato)
           (Lycopersicon esculentum)
          Length = 345

 Score = 45.6 bits (103), Expect = 0.002
 Identities = 24/83 (28%), Positives = 42/83 (50%), Gaps = 2/83 (2%)
 Frame = +2

Query: 254 DEHFATLPIKTHNFDLIASLPENFDPRD-KWPDCPTLNEVRDQGSCGSCWAFGAVEAMTD 430
           + + +  P+ +  F  I  L +++ P +  W +   + +V+ QG CG CWAF AV ++  
Sbjct: 107 NSYLSPSPMSSTEFKKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEG 166

Query: 431 RVCTYSNGTKH-FHFSAEDLLSC 496
               Y   T +   FS ++LL C
Sbjct: 167 ---AYKIATGNLMEFSEQELLDC 186


>UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1;
           Bigelowiella natans|Rep: Digestive cysteine proteinase -
           Bigelowiella natans (Pedinomonas minutissima)
           (Chlorarachnion sp.(strain CCMP 621))
          Length = 360

 Score = 45.6 bits (103), Expect = 0.002
 Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 2/54 (3%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT--KHFHFSAEDLLSC 496
           W D   L  V+DQG CGSCWAF A +A+        N T       S E L+ C
Sbjct: 115 WRDFNALTPVKDQGGCGSCWAFSATQALESAHYIKHNDTLDSPIALSTEQLVEC 168


>UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep:
           Vivapain-4 - Plasmodium vivax
          Length = 484

 Score = 45.6 bits (103), Expect = 0.002
 Identities = 19/52 (36%), Positives = 31/52 (59%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           W +   ++E+++Q  CGSCWAFGAV A+  +     N  +H   S ++L+ C
Sbjct: 268 WREHNAVSEIKNQNLCGSCWAFGAVGAVESQYAIRKN--QHVLISEQELVDC 317


>UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza
           sativa|Rep: Cysteine protease 1 precursor - Oryza sativa
           subsp. japonica (Rice)
          Length = 490

 Score = 45.6 bits (103), Expect = 0.002
 Identities = 25/69 (36%), Positives = 37/69 (53%), Gaps = 7/69 (10%)
 Frame = +2

Query: 239 NGSYRDEHFATLPI-------KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSC 397
           NG +R  +  T P        + +  D + +LP++ D RDK      +  V++QG CGSC
Sbjct: 124 NGEFRATYLGTTPAGRGRRVGEAYRHDGVEALPDSVDWRDKGA---VVAPVKNQGQCGSC 180

Query: 398 WAFGAVEAM 424
           WAF AV A+
Sbjct: 181 WAFSAVAAV 189


>UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 280

 Score = 45.2 bits (102), Expect = 0.002
 Identities = 22/64 (34%), Positives = 37/64 (57%)
 Frame = +2

Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
           +SLP+ FD    W +   + +V++QG+CGSCWAF  +  + + +    N T    +S ++
Sbjct: 66  SSLPQQFD----WRNLGKVTQVKNQGNCGSCWAF-TITGLFESINLIRNKTVEL-YSEQE 119

Query: 485 LLSC 496
           LL C
Sbjct: 120 LLDC 123


>UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted,
           possible transmembrane domain near N-terminus; n=4;
           Cryptosporidium|Rep: Cryptopain-cysteine proteinase
           secreted, possible transmembrane domain near N-terminus
           - Cryptosporidium parvum Iowa II
          Length = 401

 Score = 45.2 bits (102), Expect = 0.002
 Identities = 19/47 (40%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
 Frame = +2

Query: 317 ENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 451
           E F P +   W +   +N +R+Q +CGSCWAF AV A+    C  +N
Sbjct: 172 EEFVPPNSINWVEAGCVNPIRNQKNCGSCWAFSAVAALEGATCAQTN 218


>UniRef50_Q23H32 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 365

 Score = 45.2 bits (102), Expect = 0.002
 Identities = 26/72 (36%), Positives = 37/72 (51%)
 Frame = +2

Query: 290 NFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFH 469
           +F L  S+PE+ D R+K      +  V+ QG CGSCWAF  V A+       +       
Sbjct: 128 SFLLSDSVPESVDWREK-----LVAPVQKQGGCGSCWAFSTVIALEGAYAKQTGNV--IK 180

Query: 470 FSAEDLLSCCPI 505
           FS ++L+ CC I
Sbjct: 181 FSEQNLIDCCRI 192


>UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like
           cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin B-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 288

 Score = 45.2 bits (102), Expect = 0.002
 Identities = 22/63 (34%), Positives = 35/63 (55%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           S+P +++  +++P C     V DQG CGSCW+F   ++ + R C   N  K   FS   L
Sbjct: 67  SIPMSYNFTERFPQCDF--GVLDQGKCGSCWSFAVSKSFSHRYCRKYN--KPVLFSQSHL 122

Query: 488 LSC 496
           ++C
Sbjct: 123 VAC 125


>UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia
           tropicalis|Rep: Blo t 1 allergen - Blomia tropicalis
           (Mite)
          Length = 333

 Score = 45.2 bits (102), Expect = 0.002
 Identities = 21/35 (60%), Positives = 23/35 (65%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGA 412
           SLP+NFD R K      L  +R QGSCGSCWAF A
Sbjct: 112 SLPQNFDWRQK----ARLTRIRQQGSCGSCWAFAA 142


>UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen;
           n=20; Amniota|Rep: Tubulointerstitial nephritis antigen
           - Homo sapiens (Human)
          Length = 476

 Score = 45.2 bits (102), Expect = 0.002
 Identities = 23/72 (31%), Positives = 32/72 (44%)
 Frame = +2

Query: 284 THNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKH 463
           T +      LPE F    KWP     +   DQ +C + WAF       DR+   S G   
Sbjct: 208 TASLPATTDLPEFFVASYKWPGWT--HGPLDQKNCAASWAFSTASVAADRIAIQSKGRYT 265

Query: 464 FHFSAEDLLSCC 499
            + S ++L+SCC
Sbjct: 266 ANLSPQNLISCC 277


>UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin O;
           n=1; Danio rerio|Rep: PREDICTED: similar to cathepsin O
           - Danio rerio
          Length = 327

 Score = 44.8 bits (101), Expect = 0.003
 Identities = 22/59 (37%), Positives = 29/59 (49%)
 Frame = +2

Query: 320 NFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           N  PR  W D   +  V +QGSCG CWAF  VEA+     +   G K    S + ++ C
Sbjct: 119 NNPPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAIES--VSAKVGEKLQQLSVQQVIDC 175


>UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag-RP
           - Bombyx mori (Silk moth)
          Length = 404

 Score = 44.8 bits (101), Expect = 0.003
 Identities = 22/61 (36%), Positives = 32/61 (52%)
 Frame = +2

Query: 314 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493
           P+ FD R +W     ++ + DQ  CGS WA      + DR    S GT++   S++ LLS
Sbjct: 186 PDEFDARREWYGY--ISPIADQDWCGSDWAVSIASIVGDRFSIQSFGTENVRMSSQTLLS 243

Query: 494 C 496
           C
Sbjct: 244 C 244


>UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 344

 Score = 44.4 bits (100), Expect = 0.004
 Identities = 21/60 (35%), Positives = 30/60 (50%)
 Frame = +2

Query: 317 ENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           +N  P D W +   +  V+ QG CGSCW F A  A+ +      NG    +FS + +L C
Sbjct: 134 KNAPPMD-WRNASAITPVKQQGKCGSCWTF-ASTAVLESFSFIKNGAPLTNFSEQQILDC 191


>UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_542_3431_1206 - Giardia lamblia ATCC
           50803
          Length = 741

 Score = 44.4 bits (100), Expect = 0.004
 Identities = 30/82 (36%), Positives = 43/82 (52%), Gaps = 1/82 (1%)
 Frame = +2

Query: 254 DEHFATLPIKTHNFDLI-ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTD 430
           ++ +  LP    N DL  A+LP NF  R          ++ +QGSCG C+A  AVE +T 
Sbjct: 40  EDEYNELPDGPDNADLTRAALPTNFTYRGH-----RCIQIINQGSCGCCYAAAAVEMVTA 94

Query: 431 RVCTYSNGTKHFHFSAEDLLSC 496
           R C   N ++    S EDL++C
Sbjct: 95  RRCLQLNDSR--LVSLEDLVTC 114


>UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine
           protease; n=11; Callosobruchus maculatus|Rep: Putative
           gut cathepsin L-like cysteine protease - Callosobruchus
           maculatus (Southern cowpea weevil) (Pulse bruchid)
          Length = 326

 Score = 44.0 bits (99), Expect = 0.005
 Identities = 25/75 (33%), Positives = 36/75 (48%)
 Frame = +2

Query: 272 LPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 451
           LP    +FD    +         W +   +  V+DQ +CGSCWAF AV A+  +     N
Sbjct: 95  LPSNAVHFDNFEDIDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAIEGQFFK-KN 153

Query: 452 GTKHFHFSAEDLLSC 496
           GT     SA++L+ C
Sbjct: 154 GTL-VSLSAQELVDC 167


>UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor;
           n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase
           precursor - Diabrotica virgifera virgifera (western corn
           rootworm)
          Length = 326

 Score = 44.0 bits (99), Expect = 0.005
 Identities = 20/44 (45%), Positives = 26/44 (59%)
 Frame = +2

Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 406
           P   H+   +  LP  FD R+K      + EV+DQGSCGSCW+F
Sbjct: 98  PRVIHSLTPVKDLPSKFDWREKG----AVTEVKDQGSCGSCWSF 137


>UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing
           protein; n=5; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 437

 Score = 44.0 bits (99), Expect = 0.005
 Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 3/80 (3%)
 Frame = +2

Query: 266 ATLPIKTHNFDL--IASLPENFDPRDKWPDCPTLNEVRDQGS-CGSCWAFGAVEAMTDRV 436
           AT    T +F    ++ LP+  D R+K      + +V+ QG  CGSCWAF AV A+    
Sbjct: 188 ATAQANTRSFRKYDLSQLPQYVDWREKG----VVTQVKSQGKDCGSCWAFAAVAALESHY 243

Query: 437 CTYSNGTKHFHFSAEDLLSC 496
                G K   FS + L+ C
Sbjct: 244 -ALKTGKKPIQFSEQQLVDC 262


>UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing
           protein; n=7; Hymenostomatida|Rep: Papain family
           cysteine protease containing protein - Tetrahymena
           thermophila SB210
          Length = 387

 Score = 44.0 bits (99), Expect = 0.005
 Identities = 25/73 (34%), Positives = 37/73 (50%)
 Frame = +2

Query: 278 IKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT 457
           +KT +   +  LP++ D    W D   +  V+DQG CGSCWAF A  A+ +     + G 
Sbjct: 122 LKTSDKINVKDLPKSVD----WRDAGVVTPVKDQGHCGSCWAF-ATTAVIESYAAIATGQ 176

Query: 458 KHFHFSAEDLLSC 496
                S + L+SC
Sbjct: 177 LK-TLSTQQLVSC 188


>UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, whole
           genome shotgun sequence; n=3; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_31,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 358

 Score = 44.0 bits (99), Expect = 0.005
 Identities = 20/62 (32%), Positives = 34/62 (54%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           +PE+++ R+  P+C     +  QG+C S ++  AV A +DR+C   NG      S +  +
Sbjct: 131 IPESYNFREAQPECA--QPIYFQGNCSSSYSIAAVSATSDRLCKSKNGEFQDQLSPQSPI 188

Query: 491 SC 496
           SC
Sbjct: 189 SC 190


>UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46;
           Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea
           mays (Maize)
          Length = 371

 Score = 44.0 bits (99), Expect = 0.005
 Identities = 25/74 (33%), Positives = 35/74 (47%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           LP++FD    W D   +  V++QGSCGSCW+F A  A+      Y    K    S +  +
Sbjct: 137 LPDDFD----WRDHGAVGPVKNQGSCGSCWSFSASGALEG--AHYLATGKLEVLSEQQFV 190

Query: 491 SCCPICDWDAAEEC 532
            C   CD    + C
Sbjct: 191 DCDHECDSSEPDSC 204


>UniRef50_P25774 Cluster: Cathepsin S precursor; n=78;
           Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens
           (Human)
          Length = 331

 Score = 44.0 bits (99), Expect = 0.005
 Identities = 26/62 (41%), Positives = 38/62 (61%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           LP++ D R+K   C T  EV+ QGSCG+CWAF AV A+  ++   +   K    SA++L+
Sbjct: 115 LPDSVDWREK--GCVT--EVKYQGSCGACWAFSAVGALEAQLKLKTG--KLVSLSAQNLV 168

Query: 491 SC 496
            C
Sbjct: 169 DC 170


>UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24;
           Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa
           (Rice)
          Length = 339

 Score = 43.6 bits (98), Expect = 0.006
 Identities = 26/65 (40%), Positives = 34/65 (52%)
 Frame = +2

Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
           I +LP   D R K    P    ++DQG CG CWAF AV AM + +   S G K    S +
Sbjct: 120 IDTLPATVDWRTKGAVTP----IKDQGQCGCCWAFSAVAAM-EGIVKLSTG-KLISLSEQ 173

Query: 482 DLLSC 496
           +L+ C
Sbjct: 174 ELVDC 178


>UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep:
           Cathepsin L - Stylonychia lemnae
          Length = 340

 Score = 43.6 bits (98), Expect = 0.006
 Identities = 18/44 (40%), Positives = 27/44 (61%)
 Frame = +2

Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 433
           +  +PE+ D R+K      +N V+DQG CGSCWAF  + ++  R
Sbjct: 122 LKDIPESIDWREKG----AVNAVKDQGQCGSCWAFSTIASLESR 161


>UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1;
           Trypanosoma cruzi|Rep: Cysteine protease, putative -
           Trypanosoma cruzi
          Length = 434

 Score = 43.6 bits (98), Expect = 0.006
 Identities = 21/48 (43%), Positives = 29/48 (60%)
 Frame = +2

Query: 353 PTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           P L  V+DQGSCGSCWA  A E++ + +   S+G K    S + + SC
Sbjct: 137 PVLTPVKDQGSCGSCWAHAATESV-ESMYAISSG-KLLTLSTQQITSC 182


>UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 394

 Score = 43.6 bits (98), Expect = 0.006
 Identities = 21/46 (45%), Positives = 26/46 (56%)
 Frame = +2

Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           LN V+DQG CGSCW FGA   M +     +NG     FS + L+ C
Sbjct: 196 LNPVKDQGQCGSCWTFGAAGVM-ESFNAITNGVLK-SFSEQQLVDC 239


>UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh
           fly) (Boettcherisca peregrina). Cathepsin L; n=2;
           Dictyostelium discoideum|Rep: Similar to Sarcophaga
           peregrina (Flesh fly) (Boettcherisca peregrina).
           Cathepsin L - Dictyostelium discoideum (Slime mold)
          Length = 265

 Score = 43.2 bits (97), Expect = 0.008
 Identities = 25/74 (33%), Positives = 41/74 (55%)
 Frame = +2

Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454
           P K HN +  A++P++FD    W D   + +V++QGSC SCW+F A+ A+      Y   
Sbjct: 38  PFK-HNVN--ATIPKSFD----WRDHGAVGKVKNQGSCASCWSFSALGALEGHY--YIKY 88

Query: 455 TKHFHFSAEDLLSC 496
            +    S ++L+ C
Sbjct: 89  GELLDLSEQNLVDC 102


>UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1;
           Dictyostelium discoideum AX4|Rep: Counting factor
           associated protein - Dictyostelium discoideum AX4
          Length = 531

 Score = 43.2 bits (97), Expect = 0.008
 Identities = 24/70 (34%), Positives = 39/70 (55%)
 Frame = +2

Query: 287 HNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHF 466
           H+ + + S+P   D R++  +C T   V+DQG CGSCW FG+  ++    C  +NG +  
Sbjct: 301 HDDESLRSIPSTVDWRNQ--NCVT--PVKDQGICGSCWTFGSTGSLEGTNCV-TNG-ELV 354

Query: 467 HFSAEDLLSC 496
             S + L+ C
Sbjct: 355 SLSEQQLVDC 364


>UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 395

 Score = 43.2 bits (97), Expect = 0.008
 Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 3/55 (5%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKH---FHFSAEDLLSC 496
           W D  T   VRDQG C SCW FG++ A+  R     NG       H SA++ ++C
Sbjct: 194 WSDYQT--PVRDQGECKSCWVFGSLAALESRY-LIKNGVSEKSTLHLSAQNAMNC 245


>UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3;
           Bilateria|Rep: Cathepsin L-like cysteine proteinase -
           Longidorus elongatus
          Length = 358

 Score = 43.2 bits (97), Expect = 0.008
 Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 2/68 (2%)
 Frame = +2

Query: 299 LIASLPENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHF 472
           +I  +P+N    D   W     + +V+DQGSCGSCWAF A  ++  +   Y    K    
Sbjct: 129 MIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAFSATGSLEGQ--HYKQTGKLVSL 186

Query: 473 SAEDLLSC 496
           S ++L+ C
Sbjct: 187 SEQNLVDC 194


>UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia
           medanensis|Rep: Sui m 1 allergen - Suidasia medanensis
          Length = 336

 Score = 43.2 bits (97), Expect = 0.008
 Identities = 23/67 (34%), Positives = 31/67 (46%)
 Frame = +2

Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475
           D+  +LP  FD R +W        VR+QG CGSCWAF     +  +     N   H   S
Sbjct: 110 DISVALPAAFDWRQQWNTA-----VRNQGQCGSCWAFATAATVEAQYAIRKN--VHVTLS 162

Query: 476 AEDLLSC 496
            + L+ C
Sbjct: 163 EQQLVDC 169


>UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor;
           n=35; Fasciola|Rep: Cathepsin L-like proteinase
           precursor - Fasciola hepatica (Liver fluke)
          Length = 326

 Score = 43.2 bits (97), Expect = 0.008
 Identities = 19/52 (36%), Positives = 26/52 (50%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           W +   + EV+DQG+CGSCWAF     M  +     N      FS + L+ C
Sbjct: 114 WRESGYVTEVKDQGNCGSCWAFSTTGTMEGQY--MKNERTSISFSEQQLVDC 163


>UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1;
           Oryza sativa (japonica cultivar-group)|Rep: Putative
           uncharacterized protein - Oryza sativa subsp. japonica
           (Rice)
          Length = 326

 Score = 42.7 bits (96), Expect = 0.011
 Identities = 17/41 (41%), Positives = 25/41 (60%)
 Frame = +2

Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
           +A++  +  P   W +   +  V+DQG CGSCWAF  VEA+
Sbjct: 110 LAAVAGDAPPAWDWREHGAVTRVKDQGPCGSCWAFSVVEAV 150


>UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1;
           Brugia malayi|Rep: Cathepsin F-like cysteine proteinase
           - Brugia malayi (Filarial nematode worm)
          Length = 461

 Score = 42.7 bits (96), Expect = 0.011
 Identities = 26/71 (36%), Positives = 38/71 (53%), Gaps = 1/71 (1%)
 Frame = +2

Query: 287 HNFDL-IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKH 463
           ++F+L I +LP  FD    W     +  V+DQGSCGSCWAF +V    + +     G K 
Sbjct: 239 NDFNLSIYNLPSKFD----WRTEGVVTPVKDQGSCGSCWAF-SVTGNIESLWAIKTG-KL 292

Query: 464 FHFSAEDLLSC 496
              S ++L+ C
Sbjct: 293 ISLSEQELIDC 303


>UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1;
           Uronema marinum|Rep: Cathepsin L-like cysteine protease
           - Uronema marinum
          Length = 333

 Score = 42.7 bits (96), Expect = 0.011
 Identities = 22/54 (40%), Positives = 32/54 (59%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP 502
           W     +  V++QG CGSCWAF AV ++ +R+   + G K   FS + L+SC P
Sbjct: 126 WVSKGAVQGVQNQGVCGSCWAFSAVCSL-ERLYKINTG-KLLSFSEQQLVSCEP 177


>UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase;
           n=21; Bilateria|Rep: Cathepsin L-like cysteine
           proteinase - Globodera pallida
          Length = 379

 Score = 42.7 bits (96), Expect = 0.011
 Identities = 24/48 (50%), Positives = 30/48 (62%), Gaps = 4/48 (8%)
 Frame = +2

Query: 302 IASLPENFDPRDK-WPDCPTLNEVRDQGSCGSCWAF---GAVEAMTDR 433
           +  LPE+ D RDK W     + EV++QG CGSCWAF   GA+EA   R
Sbjct: 158 VGDLPESVDWRDKGW-----VTEVKNQGMCGSCWAFSSTGALEAQHAR 200


>UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas
           foetus|Rep: TFCP2 protein - Tritrichomonas foetus
           (Trichomonas foetus)
          Length = 270

 Score = 42.7 bits (96), Expect = 0.011
 Identities = 22/61 (36%), Positives = 31/61 (50%)
 Frame = +2

Query: 314 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493
           P +FD    W     +N +++QGSCGSCWAF A+ A     C      +   FS + L+ 
Sbjct: 51  PTSFD----WRSEGKVNPIKNQGSCGSCWAFSAIAAQES--CHAIATGELLRFSEQSLVD 104

Query: 494 C 496
           C
Sbjct: 105 C 105


>UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 367

 Score = 42.7 bits (96), Expect = 0.011
 Identities = 20/52 (38%), Positives = 31/52 (59%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           W     ++ V++QGSCGSCWAF AV A+ + V    N +    +S ++L+ C
Sbjct: 161 WRQSGAVSPVKNQGSCGSCWAFSAV-ALAESVNLLRNNSLAL-YSEQELVDC 210


>UniRef50_Q231X3 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 323

 Score = 42.7 bits (96), Expect = 0.011
 Identities = 16/52 (30%), Positives = 26/52 (50%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           W +   ++ V+ QG+CGSCWAF A  ++   +       K    S + L+ C
Sbjct: 121 WVEAGKVSNVKSQGNCGSCWAFSATASVESALIIAGKVDKSISLSEQQLIDC 172


>UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 462

 Score = 42.7 bits (96), Expect = 0.011
 Identities = 18/43 (41%), Positives = 28/43 (65%)
 Frame = +2

Query: 368 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           VRDQ +CGSCWA  A EA++ ++  +S G  +F  S + ++ C
Sbjct: 242 VRDQANCGSCWAQSAGEAISSQISLHSKG--NFTVSIQQIMDC 282


>UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_56,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 314

 Score = 42.7 bits (96), Expect = 0.011
 Identities = 27/83 (32%), Positives = 39/83 (46%), Gaps = 2/83 (2%)
 Frame = +2

Query: 254 DEHFATLPI--KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMT 427
           +E FA L +  K    +L A L     P     D   +  V++QG+CGSCWAF AV A+ 
Sbjct: 85  NEEFAALLLTRKESPMNLDAELYVPQGPLKASADWSKITSVKNQGNCGSCWAFSAVGAVE 144

Query: 428 DRVCTYSNGTKHFHFSAEDLLSC 496
             +      +K    S + L+ C
Sbjct: 145 TLLTIKGVISKDLWLSEQQLVDC 167


>UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_23,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 321

 Score = 42.7 bits (96), Expect = 0.011
 Identities = 25/90 (27%), Positives = 41/90 (45%), Gaps = 5/90 (5%)
 Frame = +2

Query: 242 GSYRDEHFATLPIKTHNFDLIASLPENFDP-----RDKWPDCPTLNEVRDQGSCGSCWAF 406
           G   D+ F T+ +       + ++ +N +P        W     +  ++DQG CGSCWAF
Sbjct: 87  GDLTDQEFLTIYLNLQMPARVKNIQKNEEPFLVQEEVDWVQKGKVPAIKDQGDCGSCWAF 146

Query: 407 GAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
            AV A+   + T     +    S +DL+ C
Sbjct: 147 SAVGAL--EINTKIQFNEIVDLSEQDLVDC 174


>UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101,
           whole genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_101,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 306

 Score = 42.7 bits (96), Expect = 0.011
 Identities = 24/63 (38%), Positives = 37/63 (58%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           +LPE+ D   K      +N V++QG+CGS W+F AV A  +    +  GT HF +S ++L
Sbjct: 109 NLPESVDWSSK------MNPVKNQGTCGSGWSFSAVGAF-EAFFIFVKGT-HFQYSEQNL 160

Query: 488 LSC 496
           + C
Sbjct: 161 VDC 163


>UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61;
           Leishmania|Rep: Cysteine proteinase 2 precursor -
           Leishmania pifanoi
          Length = 444

 Score = 42.7 bits (96), Expect = 0.011
 Identities = 24/65 (36%), Positives = 37/65 (56%)
 Frame = +2

Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
           ++++P+  D R+K    P    V+DQG+CGSCWAF AV  +  +   Y  G +    S +
Sbjct: 123 LSAVPDAVDWREKGAVTP----VKDQGACGSCWAFSAVGNIEGQ--WYLAGHELVSLSEQ 176

Query: 482 DLLSC 496
            L+SC
Sbjct: 177 QLVSC 181


>UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163
           precursor; n=4; Schizophora|Rep: Putative cysteine
           proteinase CG12163 precursor - Drosophila melanogaster
           (Fruit fly)
          Length = 614

 Score = 42.7 bits (96), Expect = 0.011
 Identities = 23/62 (37%), Positives = 32/62 (51%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           LP+ FD R K      + +V++QGSCGSCWAF     +       +   K   FS ++LL
Sbjct: 394 LPKEFDWRQK----DAVTQVKNQGSCGSCWAFSVTGNIEGLYAVKTGELK--EFSEQELL 447

Query: 491 SC 496
            C
Sbjct: 448 DC 449


>UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;
           n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
           hypothetical protein - Strongylocentrotus purpuratus
          Length = 331

 Score = 42.3 bits (95), Expect = 0.015
 Identities = 21/65 (32%), Positives = 36/65 (55%)
 Frame = +2

Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
           + ++P  +D R   P  P +  V++Q SCG+CWAF  VE M  ++   +   +    SA+
Sbjct: 124 LKTMPLVYDLRSIKP--PVVTPVKNQKSCGACWAFSVVETMETQIALKTK--RLTQLSAQ 179

Query: 482 DLLSC 496
           +L+ C
Sbjct: 180 ELVDC 184


>UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 291

 Score = 42.3 bits (95), Expect = 0.015
 Identities = 20/51 (39%), Positives = 29/51 (56%), Gaps = 1/51 (1%)
 Frame = +2

Query: 359 LNEVRDQGSCGSCWAFGAVEAM-TDRVCTYSNGTKHFHFSAEDLLSCCPIC 508
           +N +RDQ  CGSCWAFG V A  ++    YSN  +    S ++++ C   C
Sbjct: 90  VNPIRDQKQCGSCWAFGTVAACESNYALLYSNLPQ---LSEQNIIDCATTC 137


>UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis
           zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa
           zeasingle nucleocapsid nuclear polyhedrosis virus)
          Length = 367

 Score = 42.3 bits (95), Expect = 0.015
 Identities = 22/62 (35%), Positives = 31/62 (50%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           LP+ +D    W D   +  ++DQG CGSCWAF A+  +  +     N  K    S + LL
Sbjct: 156 LPDYYD----WRDTNKVTPIKDQGVCGSCWAFVAIGNIESQYAIRHN--KLIDLSEQQLL 209

Query: 491 SC 496
            C
Sbjct: 210 DC 211


>UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium
           tetraurelia|Rep: Cathepsin L1 precursor - Paramecium
           tetraurelia
          Length = 314

 Score = 42.3 bits (95), Expect = 0.015
 Identities = 19/43 (44%), Positives = 27/43 (62%)
 Frame = +2

Query: 368 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           V++QGSCGSCWAF AV A+   + T     + +  S +DL+ C
Sbjct: 126 VKNQGSCGSCWAFSAVGAL--EINTDIELNRKYELSEQDLVDC 166


>UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2;
           Taeniidae|Rep: Cathepsin L-like cysteine proteinase -
           Taenia solium (Pork tapeworm)
          Length = 339

 Score = 41.9 bits (94), Expect = 0.019
 Identities = 23/64 (35%), Positives = 33/64 (51%)
 Frame = +2

Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
           A LP+  D RDK      + EV++QG+CGSCWAF +  A+       +   K    S + 
Sbjct: 122 AGLPDTVDWRDK----NLVTEVKNQGNCGSCWAFSSTGALEGAFAKKTG--KLISLSEQQ 175

Query: 485 LLSC 496
           L+ C
Sbjct: 176 LVDC 179


>UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia
           bovis|Rep: Cysteine protease 2 - Babesia bovis
          Length = 445

 Score = 41.9 bits (94), Expect = 0.019
 Identities = 21/59 (35%), Positives = 31/59 (52%)
 Frame = +2

Query: 320 NFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           NF+  D W     +  V+DQG CGSCWAF AV ++   +       +    S ++L+SC
Sbjct: 236 NFEDID-WRRADAVTPVKDQGMCGSCWAFAAVGSVESLLKRQKTDVR---LSEQELVSC 290


>UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16;
           Bromeliaceae|Rep: Fruit bromelain precursor - Ananas
           comosus (Pineapple)
          Length = 351

 Score = 41.9 bits (94), Expect = 0.019
 Identities = 21/65 (32%), Positives = 36/65 (55%)
 Frame = +2

Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
           I+++P++ D    W D   +NEV++Q  CGSCW+F A+ A  + +     G      S +
Sbjct: 120 ISAVPQSID----WRDYGAVNEVKNQNPCGSCWSFAAI-ATVEGIYKIKTGYL-VSLSEQ 173

Query: 482 DLLSC 496
           ++L C
Sbjct: 174 EVLDC 178


>UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain;
           n=16; Chrysomelidae|Rep: Digestive cysteine protease
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 41.5 bits (93), Expect = 0.026
 Identities = 22/63 (34%), Positives = 31/63 (49%), Gaps = 2/63 (3%)
 Frame = +2

Query: 314 PENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           PE+ +  D   W +   + EV+DQ  CGSCWAF A  A+  +    +N       S + L
Sbjct: 105 PEDLEVPDSIDWTEKGAVLEVKDQNPCGSCWAFSATGALEGQNAILNN--VKISLSEQQL 162

Query: 488 LSC 496
           L C
Sbjct: 163 LDC 165


>UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precursor;
           n=3; Plasmodium (Laverania)|Rep: Trophozoite cysteine
           proteinase precursor - Plasmodium falciparum
          Length = 569

 Score = 41.5 bits (93), Expect = 0.026
 Identities = 24/72 (33%), Positives = 38/72 (52%)
 Frame = +2

Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460
           K +  D+ + +PE  D R+K      ++E +DQG CGSCWAF +V    + V    N   
Sbjct: 323 KRNEKDIFSKVPEILDYREKG----IVHEPKDQGLCGSCWAFASV-GNIESVFAKKN-KN 376

Query: 461 HFHFSAEDLLSC 496
              FS ++++ C
Sbjct: 377 ILSFSEQEVVDC 388


>UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean
           endopeptidase) (Cysteine proteinase)
           (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
           Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor
           (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase)
           (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
           Vignain-2] - Vigna mungo (Rice bean) (Black gram)
          Length = 362

 Score = 41.5 bits (93), Expect = 0.026
 Identities = 23/71 (32%), Positives = 37/71 (52%)
 Frame = +2

Query: 284 THNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKH 463
           T  ++ + S+P + D R K      + +V+DQG CGSCWAF  + A+       +N  K 
Sbjct: 119 TFMYEKVGSVPASVDWRKKG----AVTDVKDQGQCGSCWAFSTIVAVEGINQIKTN--KL 172

Query: 464 FHFSAEDLLSC 496
              S ++L+ C
Sbjct: 173 VSLSEQELVDC 183


>UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin W,
           partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
           similar to Cathepsin W, partial - Ornithorhynchus
           anatinus
          Length = 229

 Score = 41.1 bits (92), Expect = 0.034
 Identities = 22/67 (32%), Positives = 35/67 (52%), Gaps = 2/67 (2%)
 Frame = +2

Query: 302 IASLPENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475
           +AS+PE    ++   W     +  V++QGSCGSCWAF AV    + +     G +    S
Sbjct: 59  MASIPEGPLRKETCDWRKRGAITSVKNQGSCGSCWAFAAV-GNAESMWYLRAGKRLVSLS 117

Query: 476 AEDLLSC 496
            +++L C
Sbjct: 118 VQEVLDC 124


>UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;
           Brugia malayi|Rep: Cahepsin L-like cysteine protease -
           Brugia malayi (Filarial nematode worm)
          Length = 371

 Score = 41.1 bits (92), Expect = 0.034
 Identities = 22/62 (35%), Positives = 32/62 (51%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           LP++ D    W     + +V+DQG CGSCW F AV A+  +   +    K    S ++LL
Sbjct: 143 LPKSID----WRTSGAVTKVKDQGYCGSCWTFSAVGALEGQ--HFLQTGKLVELSMQNLL 196

Query: 491 SC 496
            C
Sbjct: 197 DC 198


>UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase" precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 315

 Score = 41.1 bits (92), Expect = 0.034
 Identities = 18/52 (34%), Positives = 28/52 (53%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           W D   L  V+DQG CGSCWAF    ++  ++  + N  +    S ++L+ C
Sbjct: 117 WRDSAVLG-VKDQGQCGSCWAFSTTGSLEGQLAIHKN--QRVPLSEQELVDC 165


>UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain;
           n=9; Cucujiformia|Rep: Digestive cysteine proteinase
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 41.1 bits (92), Expect = 0.034
 Identities = 25/85 (29%), Positives = 38/85 (44%), Gaps = 2/85 (2%)
 Frame = +2

Query: 248 YRDEHFATLPIKTHNFDLIASLPENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEA 421
           ++DE    +  K +    +A  PE  +  D   W     + +V+ QG CGSCWAF A  A
Sbjct: 83  FKDELRRQIKTKPNVEATLAVFPEGLEVPDSIDWTQKGAVLDVKYQGGCGSCWAFSATGA 142

Query: 422 MTDRVCTYSNGTKHFHFSAEDLLSC 496
           +  +    +N       S + LL C
Sbjct: 143 LEGQNAIVNN--VKIPLSEQQLLDC 165


>UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-like
           cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L or H-like cysteine
           peptidase - Trichomonas vaginalis G3
          Length = 435

 Score = 41.1 bits (92), Expect = 0.034
 Identities = 22/72 (30%), Positives = 37/72 (51%), Gaps = 1/72 (1%)
 Frame = +2

Query: 284 THNFDLIASLPENFDPRDKWPDCPTLNEV-RDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460
           T + D    LPE+F     W + P +  + RDQ +CGSCWA  A  +++ ++   +N T 
Sbjct: 204 TKHIDFKGDLPESFS----WRNLPNVVAMPRDQANCGSCWAQAAATSISSQISMRTNKTT 259

Query: 461 HFHFSAEDLLSC 496
               S + ++ C
Sbjct: 260 --KVSVQQIVDC 269


>UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin B-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 255

 Score = 41.1 bits (92), Expect = 0.034
 Identities = 21/78 (26%), Positives = 42/78 (53%)
 Frame = +2

Query: 263 FATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCT 442
           F    I++   D+   +P+ ++   ++P C  L  +  +  CG C+A+G ++AM+ R+C 
Sbjct: 15  FVDESIRSFPEDISIDIPDEYNFLQEYPHCD-LGPLTQE--CGCCYAYGPIKAMSHRICK 71

Query: 443 YSNGTKHFHFSAEDLLSC 496
             N  K    SA+ +++C
Sbjct: 72  AKN--KKTFLSAQFIVAC 87


>UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep:
           Cathepsin L - Kudoa thyrsites
          Length = 300

 Score = 41.1 bits (92), Expect = 0.034
 Identities = 22/74 (29%), Positives = 37/74 (50%)
 Frame = +2

Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454
           P +T   D+ ++LP + D    W     +  V++QG CGSCW+F A  A+       +  
Sbjct: 90  PKETATKDIKSTLPSSVD----WKALGKVTSVKNQGHCGSCWSFSAAGAIESAYAIKTG- 144

Query: 455 TKHFHFSAEDLLSC 496
            +  +FS + L+ C
Sbjct: 145 -ELVNFSEQQLVDC 157


>UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor;
           n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 355

 Score = 41.1 bits (92), Expect = 0.034
 Identities = 24/65 (36%), Positives = 34/65 (52%)
 Frame = +2

Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
           I  LP++ D R K    P    V+DQG CGSCWAF  V A+ + +   + G      S +
Sbjct: 134 ITDLPKSVDWRKKGAVAP----VKDQGQCGSCWAFSTVAAV-EGINQITTGNLS-SLSEQ 187

Query: 482 DLLSC 496
           +L+ C
Sbjct: 188 ELIDC 192


>UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5;
           Theileria|Rep: Cysteine proteinase precursor - Theileria
           parva
          Length = 440

 Score = 41.1 bits (92), Expect = 0.034
 Identities = 21/67 (31%), Positives = 33/67 (49%)
 Frame = +2

Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475
           DL     EN D    W    ++  V+DQ +CG CWAF  V ++     ++ +  K +  S
Sbjct: 224 DLAKLTGENLD----WRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFD--KSYELS 277

Query: 476 AEDLLSC 496
            ++LL C
Sbjct: 278 VQELLDC 284


>UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3;
           Theileria|Rep: Cysteine proteinase precursor - Theileria
           annulata
          Length = 441

 Score = 41.1 bits (92), Expect = 0.034
 Identities = 17/53 (32%), Positives = 30/53 (56%), Gaps = 1/53 (1%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGS-CGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           W     ++ ++DQG  CGSCWAF ++ ++      Y N  K +  S ++L++C
Sbjct: 233 WARTDAVSPIKDQGDHCGSCWAFSSIASVESLYRLYKN--KSYFLSEQELVNC 283


>UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18;
           Plasmodium|Rep: Cysteine proteinase precursor -
           Plasmodium vivax (strain Salvador I)
          Length = 583

 Score = 41.1 bits (92), Expect = 0.034
 Identities = 19/40 (47%), Positives = 27/40 (67%)
 Frame = +2

Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 415
           +L+A +PE  D R+K      ++E +DQG CGSCWAF +V
Sbjct: 334 NLLADVPEILDYREKG----IVHEPKDQGLCGSCWAFASV 369


>UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza
           sativa|Rep: Os09g0497500 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 349

 Score = 40.7 bits (91), Expect = 0.045
 Identities = 24/62 (38%), Positives = 36/62 (58%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           LP++ D R K      + EV++QG CGSCWAF AV A+ + +    NG +    S ++L+
Sbjct: 122 LPKSVDWRKKG----AVVEVKNQGDCGSCWAFSAVAAI-EGINQIKNG-ELVSLSEQELV 175

Query: 491 SC 496
            C
Sbjct: 176 DC 177


>UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 328

 Score = 40.7 bits (91), Expect = 0.045
 Identities = 20/45 (44%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
 Frame = +2

Query: 311 LPENFDPRDKWPD-CPTLNEVRDQGSCGSCWAFGAVEAMTDRVCT 442
           +P+ FD RD + D  P +  V+DQ  CG CWAF A  A+T+   T
Sbjct: 97  IPDYFDLRDIYVDGSPVVGPVKDQEQCGCCWAF-ATTAITEAANT 140


>UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba
           histolytica|Rep: Cysteine protease 19 - Entamoeba
           histolytica
          Length = 324

 Score = 40.7 bits (91), Expect = 0.045
 Identities = 18/49 (36%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
 Frame = +2

Query: 359 LNEVRDQGSCGSCWAFGAVEAM-TDRVCTYSN-GTKHFHFSAEDLLSCC 499
           +  V+DQG+CGSC+AF +V  M T  + +Y +    ++  S  +++SCC
Sbjct: 112 MTPVKDQGNCGSCYAFSSVALMETAVLLSYDDLSPSNYALSTAEIVSCC 160


>UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 317

 Score = 40.7 bits (91), Expect = 0.045
 Identities = 19/39 (48%), Positives = 25/39 (64%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
           ++PE+ D R+K      +N VRDQ  CGSCWAF A  A+
Sbjct: 103 TVPESIDWREKG----AVNPVRDQEQCGSCWAFSAAGAL 137


>UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor;
           n=176; Viridiplantae|Rep: Cysteine proteinase RD21a
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 462

 Score = 40.7 bits (91), Expect = 0.045
 Identities = 19/38 (50%), Positives = 24/38 (63%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
           LPE+ D R K      + EV+DQG CGSCWAF  + A+
Sbjct: 137 LPESIDWRKKG----AVAEVKDQGGCGSCWAFSTIGAV 170


>UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep:
           Cathepsin L precursor - Schistosoma mansoni (Blood
           fluke)
          Length = 319

 Score = 40.7 bits (91), Expect = 0.045
 Identities = 17/35 (48%), Positives = 25/35 (71%)
 Frame = +2

Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 406
           + ++P+NFD R+K      + EV++QG CGSCWAF
Sbjct: 102 VNNIPKNFDWREKG----AVTEVKNQGMCGSCWAF 132


>UniRef50_UPI00006CB77B Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 382

 Score = 40.3 bits (90), Expect = 0.059
 Identities = 19/60 (31%), Positives = 34/60 (56%), Gaps = 1/60 (1%)
 Frame = +2

Query: 320 NFDPRDKWPDCPTLNEVRDQGS-CGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           +F+   K+P C  +  + +QG  C + ++  AV ++ DR+C  S G  +F  SA+  +SC
Sbjct: 128 SFNFHTKYPQC--VRPIANQGKDCSASYSIAAVSSVADRLCMASEGDFNFGLSAQPTISC 185


>UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3;
           Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays
           (Maize)
          Length = 493

 Score = 40.3 bits (90), Expect = 0.059
 Identities = 15/28 (53%), Positives = 19/28 (67%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
           W +   + EV+DQG CG CWAF AV A+
Sbjct: 170 WRERGAVAEVKDQGQCGGCWAFSAVAAV 197


>UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3;
           Theileria|Rep: Cysteine protease, putative - Theileria
           annulata
          Length = 580

 Score = 40.3 bits (90), Expect = 0.059
 Identities = 19/52 (36%), Positives = 28/52 (53%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           W +   +NEV +QGSCGSCWA  + +  +       N  K   FS++ L+ C
Sbjct: 370 WRESGFVNEVVNQGSCGSCWAIASEDIFSTFKSIKKN--KLMKFSSQQLVDC 419


>UniRef50_Q22W19 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score = 40.3 bits (90), Expect = 0.059
 Identities = 23/73 (31%), Positives = 33/73 (45%), Gaps = 1/73 (1%)
 Frame = +2

Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460
           K     LI SL  +  P   W     +  V++QG CGSCWAF  V  +      Y+  T 
Sbjct: 109 KRQKSHLIYSLKGDVAPSIDWRQKNAVTPVKNQGQCGSCWAFSTVGGLEG---AYAIATG 165

Query: 461 HF-HFSAEDLLSC 496
           +   FS + ++ C
Sbjct: 166 NLTSFSEQQIVDC 178


>UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:
           Aca s 1 allergen - Acarus siro (Dust mite)
          Length = 331

 Score = 40.3 bits (90), Expect = 0.059
 Identities = 21/63 (33%), Positives = 31/63 (49%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           +LPE FD R K      L  + +QG CG+CWAF ++  +        N   H   S ++L
Sbjct: 108 NLPETFDWRSK------LGPIENQGRCGACWAFASLATVEAAFAIKYN--THIRLSKQEL 159

Query: 488 LSC 496
           + C
Sbjct: 160 VEC 162


>UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1
           precursor; n=20; Psoroptidia|Rep: Major mite fecal
           allergen Der f 1 precursor - Dermatophagoides farinae
           (House-dust mite)
          Length = 321

 Score = 40.3 bits (90), Expect = 0.059
 Identities = 18/47 (38%), Positives = 24/47 (51%)
 Frame = +2

Query: 356 TLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           T+  +R QG CGSCWAF  V A       Y N +     S ++L+ C
Sbjct: 120 TVTPIRMQGGCGSCWAFSGVAATESAYLAYRNTS--LDLSEQELVDC 164


>UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera
           litura multicapsid nucleopolyhedrovirus (SpltMNPV)
          Length = 337

 Score = 40.3 bits (90), Expect = 0.059
 Identities = 21/64 (32%), Positives = 31/64 (48%)
 Frame = +2

Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
           A  PE+FD    W     + +V++QG CGSCWAF A+  +  +     +       S + 
Sbjct: 124 ARTPESFD----WRKLNKVTKVKEQGVCGSCWAFAAIGNIESQYAIMHDSL--IDLSEQQ 177

Query: 485 LLSC 496
           LL C
Sbjct: 178 LLDC 181


>UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep:
           MGC107932 protein - Xenopus tropicalis (Western clawed
           frog) (Silurana tropicalis)
          Length = 333

 Score = 39.9 bits (89), Expect = 0.079
 Identities = 22/76 (28%), Positives = 38/76 (50%), Gaps = 2/76 (2%)
 Frame = +2

Query: 275 PIKTHNFDLIA-SLPENFDPRDKWPDCPTLNEVRDQGS-CGSCWAFGAVEAMTDRVCTYS 448
           P+K  ++   + ++P+  D    W     +  V++QG+ CGSCWAF  V  M  R C  +
Sbjct: 102 PVKAESYSYTSITIPKEVD----WRKSNCVTPVKNQGTFCGSCWAFATVGVMESRYCIRT 157

Query: 449 NGTKHFHFSAEDLLSC 496
              +  + S + L+ C
Sbjct: 158 K--ELLNLSEQQLVDC 171


>UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa
           (japonica cultivar-group)|Rep: Os09g0562700 protein -
           Oryza sativa subsp. japonica (Rice)
          Length = 235

 Score = 39.9 bits (89), Expect = 0.079
 Identities = 19/46 (41%), Positives = 27/46 (58%)
 Frame = +2

Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           + EV+DQG CGSCWAF  V A+ + +     G K    S ++L+ C
Sbjct: 21  VTEVKDQGRCGSCWAFSTV-AVVEGIQKIKKG-KLVSLSEQELVDC 64


>UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3;
           Curculionidae|Rep: Cysteine proteinase - Hypera postica
           (alfalfa weevil)
          Length = 324

 Score = 39.9 bits (89), Expect = 0.079
 Identities = 18/44 (40%), Positives = 25/44 (56%)
 Frame = +2

Query: 368 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC 499
           V+DQG CGSCWAF ++   T+      +G K    S + L+ CC
Sbjct: 127 VKDQGDCGSCWAF-SITGSTEGAYARKSG-KLVSLSEQQLIDCC 168


>UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep:
           Toxopain-2 - Toxoplasma gondii
          Length = 422

 Score = 39.9 bits (89), Expect = 0.079
 Identities = 20/67 (29%), Positives = 29/67 (43%)
 Frame = +2

Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475
           +L+  LP        W     +  V+DQ  CGSCWAF    A+    C  +   K    S
Sbjct: 196 ELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTG--KLVSLS 253

Query: 476 AEDLLSC 496
            ++L+ C
Sbjct: 254 EQELMDC 260


>UniRef50_O16454 Cluster: Temporarily assigned gene name protein
           196; n=4; Bilateria|Rep: Temporarily assigned gene name
           protein 196 - Caenorhabditis elegans
          Length = 477

 Score = 39.9 bits (89), Expect = 0.079
 Identities = 17/32 (53%), Positives = 24/32 (75%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 406
           LPE+FD R+K      + +V++QG+CGSCWAF
Sbjct: 264 LPESFDWREKG----AVTQVKNQGNCGSCWAF 291


>UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=17; Trichomonas vaginalis|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 318

 Score = 39.9 bits (89), Expect = 0.079
 Identities = 13/27 (48%), Positives = 18/27 (66%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEA 421
           W +   +N ++DQ  CGSCWAF  V+A
Sbjct: 106 WRNAKIVNPIKDQAQCGSCWAFSVVQA 132


>UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l;
           n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to cathepsin l - Strongylocentrotus purpuratus
          Length = 489

 Score = 39.5 bits (88), Expect = 0.10
 Identities = 19/63 (30%), Positives = 32/63 (50%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           ++P++ D    W     ++ V+DQ  CGSCW+FG+ E +   V  +    K    S + L
Sbjct: 266 AVPDHID----WNVLGAVSPVKDQAVCGSCWSFGSAETIEGAV--FMQSGKRVRLSQQML 319

Query: 488 LSC 496
           + C
Sbjct: 320 MDC 322


>UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine
           protease; n=1; Strongylocentrotus purpuratus|Rep:
           PREDICTED: similar to cysteine protease -
           Strongylocentrotus purpuratus
          Length = 494

 Score = 39.5 bits (88), Expect = 0.10
 Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
 Frame = +2

Query: 275 PIKTHNFDLIASLPENFDPRD-KWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
           P+K       A++P+   P +  W     +  V++QG CGSCWAF A+  M
Sbjct: 223 PLKKTGIKKQAAIPQGPVPEEYDWRTHGAVTPVKNQGMCGSCWAFSAIGNM 273


>UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin
           L - Misgurnus mizolepis (Mud loach)
          Length = 337

 Score = 39.5 bits (88), Expect = 0.10
 Identities = 18/52 (34%), Positives = 27/52 (51%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           W +   +  V+DQG CGSCWAF    AM  ++  +    K    S ++L+ C
Sbjct: 122 WREKGYVTPVKDQGECGSCWAFSTTGAMEGQM--FRKQGKLVSLSEQNLVDC 171


>UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba
           histolytica|Rep: Cysteine protease 17 - Entamoeba
           histolytica
          Length = 420

 Score = 39.5 bits (88), Expect = 0.10
 Identities = 24/73 (32%), Positives = 34/73 (46%), Gaps = 5/73 (6%)
 Frame = +2

Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT-----K 460
           D++  LPE  D R        L  +R+Q  CG CW+F +V A+  R     N T     +
Sbjct: 162 DIVKELPEGIDFRK----FGKLTYIREQTGCGGCWSFASVCALESRYLIDYNLTVDDVGR 217

Query: 461 HFHFSAEDLLSCC 499
            +  S + LL CC
Sbjct: 218 TWALSEQQLLDCC 230


>UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n=1;
           Myxobolus cerebralis|Rep: Cathepsin Z-like cysteine
           proteinase - Myxobolus cerebralis
          Length = 297

 Score = 39.5 bits (88), Expect = 0.10
 Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 5/68 (7%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGS---CGSCWAFGAVEAMTDRVCTYSNGT--KHFHF 472
           ++P++FD    W +   L+ V++Q     CGSCWAF +   + DR+    N +   HF  
Sbjct: 49  NMPKSFD----WRENAYLSSVKNQHLPTYCGSCWAFASTSTIADRIYIAKNLSHFDHFSL 104

Query: 473 SAEDLLSC 496
           S + +++C
Sbjct: 105 SVQVVIAC 112


>UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4;
           Caenorhabditis|Rep: Cathepsin z protein 1 -
           Caenorhabditis elegans
          Length = 306

 Score = 39.5 bits (88), Expect = 0.10
 Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 7/79 (8%)
 Frame = +2

Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEV---RDQGS---CGSCWAFGAVEAMTDRV-C 439
           +T +FD    LP+ +D    W D   +N     R+Q     CGSCWAFGA  A+ DR+  
Sbjct: 56  ETEDFDS-EDLPKTWD----WRDANGINYASADRNQHIPQYCGSCWAFGATSALADRINI 110

Query: 440 TYSNGTKHFHFSAEDLLSC 496
              N     + S ++++ C
Sbjct: 111 KRKNAWPQAYLSVQEVIDC 129


>UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or
           H-like cysteine peptidase; n=1; Trichomonas vaginalis
           G3|Rep: Clan CA, family C1, cathepsin L, S or H-like
           cysteine peptidase - Trichomonas vaginalis G3
          Length = 473

 Score = 39.5 bits (88), Expect = 0.10
 Identities = 15/33 (45%), Positives = 22/33 (66%), Gaps = 1/33 (3%)
 Frame = +2

Query: 341 WPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRV 436
           W D P +  + RDQ +CGSCWAFG  E++  ++
Sbjct: 257 WRDVPNVVGKPRDQVACGSCWAFGTAESLESQL 289


>UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 452

 Score = 39.5 bits (88), Expect = 0.10
 Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
 Frame = +2

Query: 284 THNFDLIASLPENFDPRDKWPDCPTLNEV-RDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460
           T++  +I +LPE+F     W + P + E   DQ  CG+C+AFGA EA+  +    +N  +
Sbjct: 216 TYDQKVIQNLPESFS----WRNVPYVLEYPHDQAVCGTCFAFGASEAINGQFSLRAN--R 269

Query: 461 HFHFSAEDLLSC 496
               S + L+ C
Sbjct: 270 SIITSVQQLVDC 281


>UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole
           genome shotgun sequence; n=7; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_22,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 350

 Score = 39.5 bits (88), Expect = 0.10
 Identities = 23/57 (40%), Positives = 32/57 (56%), Gaps = 6/57 (10%)
 Frame = +2

Query: 254 DEHFA----TLPIKTHNFDLIASLPENFD--PRDKWPDCPTLNEVRDQGSCGSCWAF 406
           DE FA    TL +   + ++  +  EN +  P D W     +N+V+DQG CGSCWAF
Sbjct: 114 DEEFAATYLTLKVNPDDLEVPKAQFENVNATPID-WRTRGAVNKVKDQGQCGSCWAF 169


>UniRef50_Q2FLC7 Cluster: Periplasmic copper-binding precursor; n=1;
           Methanospirillum hungatei JF-1|Rep: Periplasmic
           copper-binding precursor - Methanospirillum hungatei
           (strain JF-1 / DSM 864)
          Length = 1092

 Score = 39.5 bits (88), Expect = 0.10
 Identities = 18/48 (37%), Positives = 26/48 (54%)
 Frame = +2

Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
           K  +  ++A  P  FD RD       +  +RDQG  GSCW F AV+++
Sbjct: 77  KIRSLSILADYPSKFDLRDS----KRVPAIRDQGQSGSCWDFAAVKSL 120


>UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27;
           Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber
           officinale (Ginger)
          Length = 221

 Score = 39.5 bits (88), Expect = 0.10
 Identities = 18/38 (47%), Positives = 25/38 (65%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
           LP++ D R+K    P    V++QG CGSCWAF A+ A+
Sbjct: 3   LPDSIDWREKGAVVP----VKNQGGCGSCWAFDAIAAV 36


>UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 360

 Score = 39.1 bits (87), Expect = 0.14
 Identities = 23/66 (34%), Positives = 34/66 (51%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           +LP +FD RDK    P    V+ Q  CG CWAF  V+++ + +     G K    S + +
Sbjct: 130 NLPASFDWRDKGAITP----VKVQNGCGGCWAFSTVQSI-EGLYFLKTG-KLESLSTQQV 183

Query: 488 LSCCPI 505
           + CC I
Sbjct: 184 IDCCRI 189


>UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba
           culbertsoni|Rep: Cysteine proteinase - Acanthamoeba
           culbertsoni
          Length = 482

 Score = 39.1 bits (87), Expect = 0.14
 Identities = 22/43 (51%), Positives = 27/43 (62%), Gaps = 3/43 (6%)
 Frame = +2

Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF---GAVEAM 424
           AS+P N+D R K    P    V++QGSC SCWAF   GAVE +
Sbjct: 154 ASIPANWDWRTKGAVTP----VKNQGSCASCWAFVATGAVEGV 192


>UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease
           Gip1p; n=4; Tetrahymena thermophila|Rep:
           Granule-biosynthesis induced protease Gip1p -
           Tetrahymena thermophila
          Length = 345

 Score = 39.1 bits (87), Expect = 0.14
 Identities = 19/60 (31%), Positives = 30/60 (50%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICDWDA 520
           W     LN V++QG+CGSCW F A   + +      N  +   FS + L+ C  +  +D+
Sbjct: 139 WRKRGVLNPVKNQGTCGSCWTF-ATAGILESFNQIKN-KQLLKFSEQQLVDCVSLAGYDS 196


>UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_163_69918_68548 - Giardia lamblia
           ATCC 50803
          Length = 456

 Score = 39.1 bits (87), Expect = 0.14
 Identities = 21/59 (35%), Positives = 32/59 (54%)
 Frame = +2

Query: 239 NGSYRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 415
           +G+ R  +  T P+ T     +  +P ++D R+     P    V+DQG CGSCWAFG +
Sbjct: 58  SGTCRQVYTLTDPLST-----LPEIPTSYDLREAGLQVP----VKDQGVCGSCWAFGTM 107


>UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 397

 Score = 39.1 bits (87), Expect = 0.14
 Identities = 18/46 (39%), Positives = 27/46 (58%)
 Frame = +2

Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           ++ V+DQG CG CWAF A  A+ + V    N T    +S ++L+ C
Sbjct: 192 VSPVKDQGRCGCCWAFSAT-ALAESVNLMRNNTLQ-QYSEQELVDC 235


>UniRef50_Q23FQ5 Cluster: Papain family cysteine protease containing
           protein; n=4; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score = 39.1 bits (87), Expect = 0.14
 Identities = 20/66 (30%), Positives = 29/66 (43%)
 Frame = +2

Query: 320 NFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC 499
           N+     W +   LN +++QG CGSC AFG    +      Y    +   FS + LL C 
Sbjct: 124 NYPTSVDWRNSGALNPIQNQGQCGSCAAFGTAGVLES--FYYLKSKQLLKFSEQQLLDCA 181

Query: 500 PICDWD 517
               +D
Sbjct: 182 RQAGFD 187


>UniRef50_Q22DX2 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 358

 Score = 39.1 bits (87), Expect = 0.14
 Identities = 22/63 (34%), Positives = 30/63 (47%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           S+P ++D R   P    L  V +QG CGSCWAF    A+        N T   + S + L
Sbjct: 146 SIPSSWDIRTDGPGL--LQPVENQGQCGSCWAFSTSGAVESYYSAKKNIT--LNLSKQQL 201

Query: 488 LSC 496
           + C
Sbjct: 202 VDC 204


>UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 493

 Score = 39.1 bits (87), Expect = 0.14
 Identities = 21/63 (33%), Positives = 31/63 (49%), Gaps = 1/63 (1%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFH-FSAEDL 487
           LP  F  R+   +   + + RDQ +CGSCWAFG  E +      +   +K FH  S   +
Sbjct: 266 LPRTFSWRN---NTQVVGKPRDQVACGSCWAFGTAEVLEG---AFGIASKEFHEVSTNQI 319

Query: 488 LSC 496
           + C
Sbjct: 320 MDC 322


>UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep:
           CG4847-PD, isoform D - Drosophila melanogaster (Fruit
           fly)
          Length = 420

 Score = 39.1 bits (87), Expect = 0.14
 Identities = 21/68 (30%), Positives = 35/68 (51%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           +P+ FD    W +   +  V+ QG+CGSCWAF    A+     T+       + S ++L+
Sbjct: 203 IPDAFD----WREHGGVTPVKFQGTCGSCWAFATTGAIEGH--TFRKTGSLPNLSEQNLV 256

Query: 491 SCCPICDW 514
            C P+ D+
Sbjct: 257 DCGPVEDF 264


>UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1;
           Methanospirillum hungatei JF-1|Rep: Peptidase C1A,
           papain precursor - Methanospirillum hungatei (strain
           JF-1 / DSM 864)
          Length = 1096

 Score = 39.1 bits (87), Expect = 0.14
 Identities = 18/37 (48%), Positives = 23/37 (62%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEA 421
           LP +FD R+   D  T   +++QGSCGSCWAF    A
Sbjct: 321 LPTSFDWRNNGGDYTT--PIKNQGSCGSCWAFATTGA 355


>UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F
           like protease; n=1; Nasonia vitripennis|Rep: PREDICTED:
           similar to cathepsin F like protease - Nasonia
           vitripennis
          Length = 1036

 Score = 38.7 bits (86), Expect = 0.18
 Identities = 17/36 (47%), Positives = 23/36 (63%), Gaps = 1/36 (2%)
 Frame = +2

Query: 302 IASLPENFDPRD-KWPDCPTLNEVRDQGSCGSCWAF 406
           +A++P+   P D  W     +  V+DQGSCGSCWAF
Sbjct: 809 MATIPDIELPSDYDWRHHNVVTPVKDQGSCGSCWAF 844


>UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823
           protein, partial; n=1; Ornithorhynchus anatinus|Rep:
           PREDICTED: similar to MGC81823 protein, partial -
           Ornithorhynchus anatinus
          Length = 361

 Score = 38.7 bits (86), Expect = 0.18
 Identities = 14/24 (58%), Positives = 17/24 (70%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGA 412
           W D   +  V+DQG CGSCWAFG+
Sbjct: 196 WRDHGYVTPVKDQGRCGSCWAFGS 219


>UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep:
           Cathepsin K - Danio rerio (Zebrafish) (Brachydanio
           rerio)
          Length = 333

 Score = 38.7 bits (86), Expect = 0.18
 Identities = 18/43 (41%), Positives = 26/43 (60%)
 Frame = +2

Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
           D +  LP++ D R        +  V++QGSCGSCWAF +V A+
Sbjct: 113 DRVGKLPKSIDYRK----LGYVTSVKNQGSCGSCWAFSSVGAL 151


>UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel
           protein - Danio rerio (Zebrafish) (Brachydanio rerio)
          Length = 328

 Score = 38.7 bits (86), Expect = 0.18
 Identities = 19/55 (34%), Positives = 31/55 (56%)
 Frame = +2

Query: 332 RDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           R  W +   ++ V++QG CGSCWAF AV ++  ++   +        SA++LL C
Sbjct: 116 RVNWTEHGMVSPVQNQGPCGSCWAFSAVGSLEAQMKRRTAAL--VPLSAQNLLDC 168


>UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4;
           Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena
           thermophila
          Length = 320

 Score = 38.7 bits (86), Expect = 0.18
 Identities = 23/93 (24%), Positives = 45/93 (48%), Gaps = 1/93 (1%)
 Frame = +2

Query: 257 EHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 436
           + F TL  K ++ ++  +  E  +    W     +  V++QGSCGSCWAF  + A+   +
Sbjct: 92  QQFLTLHEKVNSTEVYRAQGEATEV--DWTAKGKVTPVKNQGSCGSCWAFSTIGAVESAL 149

Query: 437 CTYSNGTKH-FHFSAEDLLSCCPICDWDAAEEC 532
                G ++  + + ++ + C     +D +E C
Sbjct: 150 WIAGQGEQNTLNLAEQEQVDCAKSPKYD-SEGC 181


>UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 2 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 564

 Score = 38.7 bits (86), Expect = 0.18
 Identities = 19/47 (40%), Positives = 23/47 (48%)
 Frame = +2

Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 415
           P   H F   A LP+  D    W     +  V+DQ  CGSCW+FG V
Sbjct: 335 PFPRHRFT--AKLPDQID----WRPYGAVTPVKDQAVCGSCWSFGTV 375


>UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 4 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 345

 Score = 38.7 bits (86), Expect = 0.18
 Identities = 16/53 (30%), Positives = 29/53 (54%)
 Frame = +2

Query: 338 KWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           +W +   +  V++QG CGSCWAF +  A+  +V  +    +    S ++L+ C
Sbjct: 131 EWRENGFVTPVKNQGQCGSCWAFSSTGALEGQV--FKRTRRLISLSEQNLMDC 181


>UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 664

 Score = 38.7 bits (86), Expect = 0.18
 Identities = 17/52 (32%), Positives = 28/52 (53%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           W     +++V++QGSCGSC+AF  V A+      Y    +    S ++L+ C
Sbjct: 476 WRTWGMVSKVKNQGSCGSCYAFSTVGALESHY--YRKNNRMLDLSEQNLVDC 525


>UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes
           vastus|Rep: Cathepsin L - Aphrocallistes vastus
          Length = 329

 Score = 38.7 bits (86), Expect = 0.18
 Identities = 17/52 (32%), Positives = 27/52 (51%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           W     +  V++QG CGSCW+F A  ++  +    S   K   FS ++L+ C
Sbjct: 121 WRSKGVVTPVKNQGQCGSCWSFSATGSLEGQYAIKSG--KLVSFSEQELVDC 170


>UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep:
           Cysteine protease - Babesia equi
          Length = 438

 Score = 38.7 bits (86), Expect = 0.18
 Identities = 18/52 (34%), Positives = 30/52 (57%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           W     +  V+DQG+CGSCWAF AV ++ + +     G +    S ++L++C
Sbjct: 230 WRKLNGVTPVKDQGNCGSCWAFAAVGSV-ESLYLIKKG-QALDLSEQELVNC 279


>UniRef50_A5KBM2 Cluster: Serine-repeat antigen; n=3;
           Plasmodium|Rep: Serine-repeat antigen - Plasmodium vivax
          Length = 1014

 Score = 38.7 bits (86), Expect = 0.18
 Identities = 21/62 (33%), Positives = 31/62 (50%), Gaps = 3/62 (4%)
 Frame = +2

Query: 320 NFDPRDKWPD---CPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           N++  D+W D   C +  EV +QG+CG CW F +   +    C    G  HF  SA  + 
Sbjct: 555 NYEYCDRWKDKTSCISNIEVEEQGNCGLCWVFASKLHLETIRC--MRGYGHFRSSALYVA 612

Query: 491 SC 496
           +C
Sbjct: 613 NC 614


>UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3;
           Dictyostelium discoideum|Rep: Cysteine proteinase 1
           precursor - Dictyostelium discoideum (Slime mold)
          Length = 343

 Score = 38.7 bits (86), Expect = 0.18
 Identities = 25/89 (28%), Positives = 42/89 (47%), Gaps = 2/89 (2%)
 Frame = +2

Query: 272 LPIKTHNFD-LIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYS 448
           LP+  +  D  I S+P  FD    W     +  V++QG CGSCW+F     +  +   + 
Sbjct: 104 LPVADYLDDEFINSIPTAFD----WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQ--HFI 157

Query: 449 NGTKHFHFSAEDLLSCCPIC-DWDAAEEC 532
           +  K    S ++L+ C   C +++  E C
Sbjct: 158 SQNKLVSLSEQNLVDCDHECMEYEGEEAC 186


>UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18;
           Magnoliophyta|Rep: Thiol protease aleurain precursor -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 358

 Score = 38.7 bits (86), Expect = 0.18
 Identities = 20/42 (47%), Positives = 26/42 (61%), Gaps = 3/42 (7%)
 Frame = +2

Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF---GAVEA 421
           A+LPE  D    W +   ++ V+DQG CGSCW F   GA+EA
Sbjct: 139 AALPETKD----WREDGIVSPVKDQGGCGSCWTFSTTGALEA 176


>UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin O
           precursor; n=1; Tribolium castaneum|Rep: PREDICTED:
           similar to Cathepsin O precursor - Tribolium castaneum
          Length = 326

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 18/64 (28%), Positives = 33/64 (51%)
 Frame = +2

Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
           A++P   D R+K      +  + +QGSCG+CWA+  +E +       +N  K    S ++
Sbjct: 119 ATVPNKVDWREK----NAVTRIYNQGSCGACWAYSVIETVESMNAIKTN--KSEELSVQE 172

Query: 485 LLSC 496
           ++ C
Sbjct: 173 IIDC 176


>UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 20 SCAF14744, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 175

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 18/41 (43%), Positives = 23/41 (56%)
 Frame = +2

Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
           I  LP  FD    W D   +  V++Q +CGSCWAF  V A+
Sbjct: 56  IKGLPARFD----WRDNAVVGPVQNQQACGSCWAFSVVGAV 92


>UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocystis
           pacifica SIR-1|Rep: Peptidase C1A, papain - Plesiocystis
           pacifica SIR-1
          Length = 650

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 18/46 (39%), Positives = 24/46 (52%)
 Frame = +2

Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           L  +R+QG+CGSCWAF AV  +       + G      S +  LSC
Sbjct: 176 LGAIRNQGACGSCWAFAAVSTIEASNAIVNGGRS--DLSEQHALSC 219


>UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza
           sativa|Rep: Cysteine proteinase-like - Oryza sativa
           subsp. japonica (Rice)
          Length = 360

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 20/52 (38%), Positives = 28/52 (53%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           W     + EV++Q SCGSCWAF AV A T+ +   + G      S + +L C
Sbjct: 143 WRARGAVTEVKNQRSCGSCWAFAAV-AATEGLVQLATGNL-VSLSEQQVLDC 192


>UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4;
           core eudicotyledons|Rep: Papain-like cysteine peptidase
           XBCP3 - Arabidopsis thaliana (Mouse-ear cress)
          Length = 437

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 14/28 (50%), Positives = 18/28 (64%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
           W     +  V+DQGSCG+CW+F A  AM
Sbjct: 124 WRKKGAVTNVKDQGSCGACWSFSATGAM 151


>UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza
           sativa|Rep: Os01g0347600 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 343

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 14/19 (73%), Positives = 17/19 (89%)
 Frame = +2

Query: 368 VRDQGSCGSCWAFGAVEAM 424
           V+DQG+CGSCWAF AV A+
Sbjct: 140 VKDQGACGSCWAFAAVAAI 158


>UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6;
           Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia
           deliciosa (Kiwi)
          Length = 509

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 18/52 (34%), Positives = 28/52 (53%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           W     +  V+DQG CGSCWAF +  A+ + +   +NG      S ++L+ C
Sbjct: 153 WRKYGIVTGVKDQGDCGSCWAFSSTGAI-EGINALANGDL-ISLSEQELVDC 202


>UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1;
           Oryza sativa (japonica cultivar-group)|Rep: Putative
           uncharacterized protein - Oryza sativa subsp. japonica
           (Rice)
          Length = 289

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 14/19 (73%), Positives = 17/19 (89%)
 Frame = +2

Query: 368 VRDQGSCGSCWAFGAVEAM 424
           V+DQG+CGSCWAF AV A+
Sbjct: 139 VKDQGACGSCWAFAAVAAI 157


>UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_567_6496_7413 - Giardia lamblia ATCC
           50803
          Length = 305

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 19/64 (29%), Positives = 29/64 (45%)
 Frame = +2

Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
           A  P+  D R   P+C    E  DQ  C  C+AF  + A++ R C      +    S + 
Sbjct: 79  AGSPDRLDYRQTHPEC--FFEPEDQKECSCCYAFATLGALSTRRCIAKLDPQAVSLSVQH 136

Query: 485 LLSC 496
           ++SC
Sbjct: 137 MVSC 140


>UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC06231 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 372

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 20/64 (31%), Positives = 31/64 (48%)
 Frame = +2

Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
           A LP+  D    W     +  V++QG CGSCWAF +  A+  +   Y    +  + S + 
Sbjct: 148 AKLPDRVD----WRRNGAVTPVKNQGQCGSCWAFSSTGAIEGQ--HYRKTNRLVNLSEQQ 201

Query: 485 LLSC 496
           L+ C
Sbjct: 202 LIDC 205


>UniRef50_Q24E33 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 328

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 18/52 (34%), Positives = 26/52 (50%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           W     +  V++QGSCGSCWAF    A+       +N  +   FS + L+ C
Sbjct: 133 WTAQGAVTPVKNQGSCGSCWAFSTTGALEGSYFLKNN--QLISFSEQQLVDC 182


>UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 330

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 15/52 (28%), Positives = 24/52 (46%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           W     ++ V+DQG CGSCWAF    ++   +       +    S + L+ C
Sbjct: 123 WVTRGKVSAVKDQGQCGSCWAFSTTGSVESALIIAGYANQTIDLSEQQLVDC 174


>UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 344

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 24/64 (37%), Positives = 32/64 (50%)
 Frame = +2

Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
           + LPE+FD RDK    P     + Q +CGSCW F A   + +       G +  HFS + 
Sbjct: 129 SDLPESFDWRDKGIITPA----KFQNTCGSCWTF-ATTGVIESQYALKYG-ELLHFSEQM 182

Query: 485 LLSC 496
           LL C
Sbjct: 183 LLDC 186


>UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathepsin
           o - Aedes aegypti (Yellowfever mosquito)
          Length = 375

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 19/39 (48%), Positives = 23/39 (58%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMT 427
           LP+  D RDK    P    VR QGSCG+CWA   V+ +T
Sbjct: 153 LPKVVDWRDKGVVAP----VRSQGSCGACWAISVVDTIT 187


>UniRef50_O96163 Cluster: Cysteine protease, putative; n=5;
           Plasmodium|Rep: Cysteine protease, putative - Plasmodium
           falciparum (isolate 3D7)
          Length = 946

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 29/87 (33%), Positives = 40/87 (45%), Gaps = 3/87 (3%)
 Frame = +2

Query: 335 DKWPD---CPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPI 505
           D+W D   C +  EV +QG+CG CW F +        C    G  HF  SA  + +C   
Sbjct: 529 DRWKDKTGCISKIEVEEQGNCGLCWIFASKLHFETIRC--MRGYGHFRSSALYVANC--- 583

Query: 506 CDWDAAEECRD*LGNIGSTSV*YQEVV 586
            D D+ E C      +GS  V + E+V
Sbjct: 584 SDRDSDEIC-----FVGSNPVEFLEIV 605


>UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra
           erinaceieuropaei|Rep: Cysteine proteinase - Spirometra
           erinaceieuropaei (Tapeworm)
          Length = 336

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 18/62 (29%), Positives = 29/62 (46%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           L EN      W +   +  V++QG CGSCW+F A  A+   +   +   +    S + L+
Sbjct: 117 LKENLPDSVNWRERGAVTSVKNQGQCGSCWSFSANGAIEGAIQIKTGALR--SLSEQQLM 174

Query: 491 SC 496
            C
Sbjct: 175 DC 176


>UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16;
           Plasmodium (Vinckeia)|Rep: Cysteine proteinase precursor
           - Plasmodium vinckei
          Length = 506

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 25/83 (30%), Positives = 42/83 (50%), Gaps = 8/83 (9%)
 Frame = +2

Query: 272 LPIKTH--NFDLIA------SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMT 427
           +P+K H  N +LI+        P++ D R K+   P     +DQG+CGSCWAF A+    
Sbjct: 242 VPLKKHLANTNLISVDNKSKDFPDSRDYRSKFNFLPP----KDQGNCGSCWAFAAI-GNF 296

Query: 428 DRVCTYSNGTKHFHFSAEDLLSC 496
           + +  ++       FS + ++ C
Sbjct: 297 EYLYVHTRHEMPISFSEQQMVDC 319


>UniRef50_P43234 Cluster: Cathepsin O precursor; n=22;
           Euteleostomi|Rep: Cathepsin O precursor - Homo sapiens
           (Human)
          Length = 321

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 19/39 (48%), Positives = 23/39 (58%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
           SLP  FD RDK      + +VR+Q  CG CWAF  V A+
Sbjct: 107 SLPLRFDWRDK----QVVTQVRNQQMCGGCWAFSVVGAV 141


>UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19;
           Bilateria|Rep: Cathepsin F precursor - Homo sapiens
           (Human)
          Length = 484

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 20/56 (35%), Positives = 28/56 (50%)
 Frame = +2

Query: 329 PRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           P   W     + +V+DQG CGSCWAF +V    +     + GT     S ++LL C
Sbjct: 273 PEWDWRSKGAVTKVKDQGMCGSCWAF-SVTGNVEGQWFLNQGTL-LSLSEQELLDC 326


>UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2;
           Entamoeba|Rep: Cysteine proteinase ACP1 precursor -
           Entamoeba histolytica
          Length = 308

 Score = 38.3 bits (85), Expect = 0.24
 Identities = 17/46 (36%), Positives = 24/46 (52%)
 Frame = +2

Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           +N  +DQG CGSCW F     +  RV    +  K + FS + L+ C
Sbjct: 103 MNPAKDQGQCGSCWTFCTTAVLEGRV--NKDLGKLYSFSEQQLVDC 146


>UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease;
           n=23; Magnoliophyta|Rep: Senescence-specific cysteine
           protease - Arabidopsis thaliana (Mouse-ear cress)
          Length = 346

 Score = 37.9 bits (84), Expect = 0.32
 Identities = 23/63 (36%), Positives = 31/63 (49%)
 Frame = +2

Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
           +LP + D R K    P    +++QGSCG CWAF AV A+     T     K    S + L
Sbjct: 129 ALPVSVDWRKKGAVTP----IKNQGSCGCCWAFSAVAAIEG--ATQIKKGKLISLSEQQL 182

Query: 488 LSC 496
           + C
Sbjct: 183 VDC 185


>UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 343

 Score = 37.9 bits (84), Expect = 0.32
 Identities = 17/48 (35%), Positives = 24/48 (50%)
 Frame = +2

Query: 368 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICD 511
           ++ QG CGSCWAF    A+   V     G +    S++ LL C  + D
Sbjct: 153 IKYQGPCGSCWAFATAAAIESAVSISGGGLQ--SLSSQQLLDCTVVSD 198


>UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium
           falciparum|Rep: Falcipain 2 - Plasmodium falciparum
          Length = 484

 Score = 37.9 bits (84), Expect = 0.32
 Identities = 20/61 (32%), Positives = 31/61 (50%), Gaps = 1/61 (1%)
 Frame = +2

Query: 317 ENFDPRD-KWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493
           ENFD     W     +  V+DQ +CGSCWAF ++ ++  +     N  K    S ++L+ 
Sbjct: 258 ENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKN--KLITLSEQELVD 315

Query: 494 C 496
           C
Sbjct: 316 C 316


>UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein A;
           n=2; Dictyostelium discoideum|Rep: Gamete and
           mating-type specific protein A - Dictyostelium
           discoideum (Slime mold)
          Length = 448

 Score = 37.9 bits (84), Expect = 0.32
 Identities = 17/45 (37%), Positives = 25/45 (55%), Gaps = 2/45 (4%)
 Frame = +2

Query: 368 VRDQGSCGSCWAFGAVEAMTDR-VCTYSNGTKH-FHFSAEDLLSC 496
           +RDQG CGSCWAF +  A+  R +  Y    K     S ++ ++C
Sbjct: 253 IRDQGQCGSCWAFASSAALESRYLIKYGTAQKSTLQLSNQNAVNC 297


>UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 356

 Score = 37.9 bits (84), Expect = 0.32
 Identities = 20/74 (27%), Positives = 35/74 (47%)
 Frame = +2

Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454
           P+K  N   +  +PE+ +    W D   ++ V+DQ +CGSCW F    A+      + + 
Sbjct: 116 PMKIQNKKNV-QVPESIN----WKDLNKVSPVKDQQNCGSCWTFSTTGAIESHYAIFED- 169

Query: 455 TKHFHFSAEDLLSC 496
            +    S + L+ C
Sbjct: 170 VEPTSLSEQQLIDC 183


>UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2;
           Platyhelminthes|Rep: Cathepsin L-like proteinase -
           Echinococcus multilocularis
          Length = 338

 Score = 37.9 bits (84), Expect = 0.32
 Identities = 21/62 (33%), Positives = 32/62 (51%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
           +P++ D R K    P    ++DQG CGSCWAF A  A+  ++   +   K    S + L+
Sbjct: 122 VPDSIDWRKKGLVTP----IKDQGDCGSCWAFSATGALEGQLKRKTG--KLISLSEQQLV 175

Query: 491 SC 496
            C
Sbjct: 176 DC 177


>UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, whole
           genome shotgun sequence; n=3; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_2,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 376

 Score = 37.9 bits (84), Expect = 0.32
 Identities = 17/46 (36%), Positives = 24/46 (52%)
 Frame = +2

Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           + EV+ QG CGSCWAF   + +  R+   +N  K    S   L+ C
Sbjct: 175 VTEVQQQGRCGSCWAFAVQDVVISRL-AIANKNKLDQLSKTHLIDC 219


>UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179,
           whole genome shotgun sequence; n=3; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_179,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 339

 Score = 37.9 bits (84), Expect = 0.32
 Identities = 20/83 (24%), Positives = 44/83 (53%)
 Frame = +2

Query: 248 YRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMT 427
           ++++    + ++      +   P  ++ ++ +P C   ++V +QG+C S ++     + +
Sbjct: 104 FKNDFTQQINVEKCKLSFMDETPVYYNFKEAYPQCN--HQVYNQGNCSSSYSIAVSSSFS 161

Query: 428 DRVCTYSNGTKHFHFSAEDLLSC 496
           DRVC   N T+    SA++LLSC
Sbjct: 162 DRVCK-QNQTQ--QLSAQNLLSC 181


>UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon
           GZfos34G5|Rep: Cathepsin C - uncultured archaeon
           GZfos34G5
          Length = 760

 Score = 37.9 bits (84), Expect = 0.32
 Identities = 21/43 (48%), Positives = 27/43 (62%), Gaps = 1/43 (2%)
 Frame = +2

Query: 299 LIASLP-ENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
           L AS+P   FD RDK      +  V++QGSCGSC AFG + A+
Sbjct: 301 LDASVPIGTFDWRDK-DGANWITSVKEQGSCGSCVAFGTIGAL 342


>UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease
           containing protein; n=2; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 332

 Score = 37.5 bits (83), Expect = 0.42
 Identities = 17/38 (44%), Positives = 24/38 (63%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
           LPE+ D    W     ++ VRDQG+CGSC+AF +  A+
Sbjct: 127 LPESVD----WRKLGAVSPVRDQGNCGSCYAFASTGAL 160


>UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate
           cathepsin L; n=4; Danio rerio|Rep: Novel protein similar
           to vertebrate cathepsin L - Danio rerio (Zebrafish)
           (Brachydanio rerio)
          Length = 334

 Score = 37.5 bits (83), Expect = 0.42
 Identities = 16/46 (34%), Positives = 26/46 (56%)
 Frame = +2

Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           + EV+DQG CGSCW+F    A+  ++  Y +  +    S + L+ C
Sbjct: 130 VTEVKDQGYCGSCWSFSTTGAIEGQM--YKHTGRLVSLSEQQLVDC 173


>UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2;
           Roseiflexus|Rep: Peptidase C1A, papain precursor -
           Roseiflexus sp. RS-1
          Length = 1202

 Score = 37.5 bits (83), Expect = 0.42
 Identities = 17/35 (48%), Positives = 20/35 (57%), Gaps = 3/35 (8%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAF---GAVEAMTDRV 436
           W D      V+DQG CGSCWAF   G VE+   R+
Sbjct: 175 WCDQGACTPVKDQGVCGSCWAFATTGVVESALKRI 209


>UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum
           aestivum|Rep: Cysteine protease - Triticum aestivum
           (Wheat)
          Length = 371

 Score = 37.5 bits (83), Expect = 0.42
 Identities = 20/61 (32%), Positives = 28/61 (45%)
 Frame = +2

Query: 314 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493
           P  FD    W +   +   + QG+CG CWAF A  A T       NG +    S ++L+ 
Sbjct: 154 PRQFD----WREHGVVTPAKQQGACGCCWAFAA--AATVESLNKINGGELVDLSVQELVD 207

Query: 494 C 496
           C
Sbjct: 208 C 208


>UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3;
           Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber
           officinale (Ginger)
          Length = 475

 Score = 37.5 bits (83), Expect = 0.42
 Identities = 17/38 (44%), Positives = 25/38 (65%)
 Frame = +2

Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
           LP++ D R+K      +  V++QG CGSCWAF A+ A+
Sbjct: 143 LPDSIDWREKG----AVVAVKNQGRCGSCWAFAAIAAV 176


>UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza
           sativa|Rep: Os01g0240900 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 166

 Score = 37.5 bits (83), Expect = 0.42
 Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
 Frame = +2

Query: 341 WPDCPTLNEVRDQGSCGSCWAF---GAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
           W D   + +V+ QG+C SCWAF   GAVE   D      N     + S + L++C
Sbjct: 104 WRDRGAVTDVKMQGTCASCWAFSTTGAVEG--DNFLASGNLRNLLNLSEQQLVNC 156


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 878,143,735
Number of Sequences: 1657284
Number of extensions: 18445955
Number of successful extensions: 54072
Number of sequences better than 10.0: 391
Number of HSP's better than 10.0 without gapping: 50872
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 53967
length of database: 575,637,011
effective HSP length: 100
effective length of database: 409,908,611
effective search space used: 74193458591
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)

- SilkBase 1999-2023 -