SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTX 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= ps4M0472.Seq
         (657 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C...   120   2e-26
UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n...   109   6e-23
UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|...   101   1e-20
UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr...   101   2e-20
UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina...   100   4e-20
UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3...    99   1e-19
UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs...    97   4e-19
UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata...    97   4e-19
UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi...    95   1e-18
UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep...    94   2e-18
UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy...    93   4e-18
UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ...    92   1e-17
UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ...    92   1e-17
UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus...    92   1e-17
UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat...    90   4e-17
UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy...    90   4e-17
UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine prot...    90   5e-17
UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M...    89   7e-17
UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j...    88   2e-16
UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida...    88   2e-16
UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin ...    87   3e-16
UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei...    87   3e-16
UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er...    87   3e-16
UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca...    86   6e-16
UniRef50_Q4TI44 Cluster: Chromosome undetermined SCAF2412, whole...    86   8e-16
UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate...    86   8e-16
UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:...    85   1e-15
UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ...    85   1e-15
UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No...    85   2e-15
UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ...    84   2e-15
UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ...    84   3e-15
UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome s...    83   4e-15
UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto...    83   4e-15
UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|...    83   6e-15
UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re...    83   8e-15
UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep...    82   1e-14
UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi...    82   1e-14
UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy...    82   1e-14
UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca...    82   1e-14
UniRef50_Q0MYX5 Cluster: Putative cysteine protease; n=1; Emilia...    82   1e-14
UniRef50_UPI0000E224BB Cluster: PREDICTED: hypothetical protein;...    81   2e-14
UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy...    81   2e-14
UniRef50_A2F4G0 Cluster: Clan CA, family C1, cathepsin L-like cy...    81   2e-14
UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]...    81   2e-14
UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica...    81   2e-14
UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n...    81   2e-14
UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e...    79   7e-14
UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18...    79   9e-14
UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1; ...    79   1e-13
UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist...    78   2e-13
UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ...    77   3e-13
UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip...    77   3e-13
UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste...    77   4e-13
UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Sl...    77   4e-13
UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat...    77   5e-13
UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy...    77   5e-13
UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li...    77   5e-13
UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory...    76   9e-13
UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:...    75   1e-12
UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae...    75   1e-12
UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=...    75   2e-12
UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ...    74   3e-12
UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc...    74   3e-12
UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n...    73   5e-12
UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1...    73   5e-12
UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain...    73   5e-12
UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio...    73   6e-12
UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ...    73   8e-12
UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-li...    72   1e-11
UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus...    71   2e-11
UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli...    71   2e-11
UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R...    71   2e-11
UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome s...    71   3e-11
UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n...    71   3e-11
UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph...    70   4e-11
UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster...    70   4e-11
UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ...    70   4e-11
UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ...    70   4e-11
UniRef50_Q9LUX8 Cluster: Cysteine protease; n=1; Pyrus pyrifolia...    70   6e-11
UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cy...    70   6e-11
UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ...    69   8e-11
UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt...    69   8e-11
UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac...    69   8e-11
UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h...    69   1e-10
UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n...    69   1e-10
UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve...    69   1e-10
UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi...    69   1e-10
UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ...    68   2e-10
UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;...    68   2e-10
UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; ...    68   2e-10
UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz...    68   2e-10
UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio...    68   2e-10
UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2...    68   2e-10
UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ...    67   3e-10
UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ...    67   4e-10
UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli...    67   4e-10
UniRef50_Q22A69 Cluster: Papain family cysteine protease contain...    66   5e-10
UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv...    66   7e-10
UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi...    66   9e-10
UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ...    66   9e-10
UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C...    66   9e-10
UniRef50_A2DKX2 Cluster: Clan CA, family C1, cathepsin L-like cy...    65   1e-09
UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;...    65   1e-09
UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ...    65   2e-09
UniRef50_P25773 Cluster: Cathepsin L; n=38; Eukaryota|Rep: Cathe...    65   2e-09
UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ...    64   2e-09
UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t...    64   3e-09
UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal...    64   3e-09
UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia ...    64   3e-09
UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP000...    64   4e-09
UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox...    64   4e-09
UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ...    63   7e-09
UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty...    63   7e-09
UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|R...    63   7e-09
UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re...    62   9e-09
UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ...    62   1e-08
UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-...    62   1e-08
UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG...    62   2e-08
UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ...    62   2e-08
UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ...    62   2e-08
UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl...    61   2e-08
UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arab...    61   3e-08
UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big...    61   3e-08
UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi...    61   3e-08
UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain...    60   3e-08
UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endop...    60   3e-08
UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D...    60   3e-08
UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz...    60   5e-08
UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;...    60   5e-08
UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy...    60   6e-08
UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel...    60   6e-08
UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ...    59   8e-08
UniRef50_Q5BTK3 Cluster: SJCHGC00358 protein; n=1; Schistosoma j...    59   8e-08
UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain...    59   8e-08
UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr...    59   8e-08
UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryz...    59   1e-07
UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyt...    59   1e-07
UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic...    59   1e-07
UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; ...    58   1e-07
UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ...    58   1e-07
UniRef50_UPI00006CBB5F Cluster: Papain family cysteine protease ...    58   2e-07
UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted...    58   2e-07
UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia...    58   2e-07
UniRef50_O16454 Cluster: Temporarily assigned gene name protein ...    58   2e-07
UniRef50_UPI0000498E2F Cluster: cysteine proteinase; n=1; Entamo...    58   2e-07
UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole...    58   2e-07
UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil...    57   3e-07
UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb...    57   4e-07
UniRef50_Q22RC9 Cluster: Papain family cysteine protease contain...    57   4e-07
UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ...    57   4e-07
UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryz...    56   6e-07
UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa...    56   7e-07
UniRef50_Q9VNK7 Cluster: CG1075-PA; n=1; Drosophila melanogaster...    56   7e-07
UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re...    56   7e-07
UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain...    56   7e-07
UniRef50_Q23H32 Cluster: Papain family cysteine protease contain...    56   7e-07
UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ...    56   7e-07
UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lambl...    56   1e-06
UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:...    56   1e-06
UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve...    56   1e-06
UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen...    56   1e-06
UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ...    55   1e-06
UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n...    55   1e-06
UniRef50_Q24E33 Cluster: Papain family cysteine protease contain...    55   1e-06
UniRef50_Q239L8 Cluster: Papain family cysteine protease contain...    55   1e-06
UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi...    55   1e-06
UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa...    55   2e-06
UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia...    55   2e-06
UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr...    55   2e-06
UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir...    55   2e-06
UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D...    55   2e-06
UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v...    54   2e-06
UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet...    54   3e-06
UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|...    54   3e-06
UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sa...    54   4e-06
UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3; Bilate...    54   4e-06
UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;...    54   4e-06
UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain...    54   4e-06
UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabdi...    54   4e-06
UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa...    53   5e-06
UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n...    53   5e-06
UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis...    53   5e-06
UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophi...    53   5e-06
UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ...    53   7e-06
UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ...    53   7e-06
UniRef50_Q54VR1 Cluster: Putative uncharacterized protein; n=1; ...    53   7e-06
UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try...    53   7e-06
UniRef50_A7APS9 Cluster: Papain family cysteine protease contain...    53   7e-06
UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w...    53   7e-06
UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia...    52   9e-06
UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov...    52   9e-06
UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R...    52   9e-06
UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl...    52   1e-05
UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa...    52   1e-05
UniRef50_Q015J8 Cluster: Cathepsin; n=2; Ostreococcus|Rep: Cathe...    52   1e-05
UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep: Cat...    52   2e-05
UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35...    52   2e-05
UniRef50_Q9BL26 Cluster: Putative uncharacterized protein; n=4; ...    51   2e-05
UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa...    51   2e-05
UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=...    51   2e-05
UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The...    51   2e-05
UniRef50_A2WPN4 Cluster: Putative uncharacterized protein; n=2; ...    51   3e-05
UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi...    51   3e-05
UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - ...    51   3e-05
UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep...    51   3e-05
UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ...    50   4e-05
UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3....    50   4e-05
UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease ...    50   5e-05
UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps...    50   5e-05
UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n...    50   5e-05
UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi...    50   5e-05
UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:...    50   5e-05
UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C...    50   6e-05
UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid...    50   6e-05
UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain...    50   6e-05
UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain...    50   6e-05
UniRef50_A0DIY3 Cluster: Chromosome undetermined scaffold_52, wh...    50   6e-05
UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl...    36   7e-05
UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhip...    49   9e-05
UniRef50_Q7JYA0 Cluster: RE20049p; n=2; Sophophora|Rep: RE20049p...    49   9e-05
UniRef50_Q4UCF5 Cluster: Cysteine proteinase, tacP, putative; n=...    49   9e-05
UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ...    49   1e-04
UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma...    49   1e-04
UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ...    48   1e-04
UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep...    48   1e-04
UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R...    48   1e-04
UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ...    48   1e-04
UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ...    48   2e-04
UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ...    48   2e-04
UniRef50_Q26988 Cluster: Cysteine proteinase 4; n=1; Tritrichomo...    48   2e-04
UniRef50_Q23H15 Cluster: Papain family cysteine protease contain...    48   2e-04
UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep...    48   2e-04
UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv...    48   3e-04
UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ...    48   3e-04
UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|...    48   3e-04
UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8....    48   3e-04
UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ...    48   3e-04
UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo...    48   3e-04
UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:...    48   3e-04
UniRef50_Q84SA7 Cluster: Thiol protease; n=1; Aster tripolium|Re...    47   3e-04
UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus...    47   3e-04
UniRef50_Q7QQ92 Cluster: GLP_243_18349_20043; n=1; Giardia lambl...    47   3e-04
UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;...    47   3e-04
UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ...    47   3e-04
UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh...    47   3e-04
UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi...    47   3e-04
UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh...    47   5e-04
UniRef50_O48605 Cluster: Putative thiol protease; n=1; Hordeum v...    47   5e-04
UniRef50_A7QDM1 Cluster: Chromosome chr10 scaffold_81, whole gen...    47   5e-04
UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain...    47   5e-04
UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C...    47   5e-04
UniRef50_Q962W3 Cluster: Cysteine protease; n=3; Giardia intesti...    46   6e-04
UniRef50_Q7R0G3 Cluster: GLP_29_33036_32140; n=1; Giardia lambli...    46   6e-04
UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; ...    46   6e-04
UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru...    46   6e-04
UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ...    46   8e-04
UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein...    46   8e-04
UniRef50_Q5CYC8 Cluster: Cathepsin like thiol protease possibly ...    46   8e-04
UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1...    46   8e-04
UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain...    46   8e-04
UniRef50_Q6A1H9 Cluster: Cathepsin X/O; n=1; Suberites domuncula...    46   0.001
UniRef50_Q237A1 Cluster: Papain family cysteine protease contain...    46   0.001
UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Ca...    46   0.001
UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,...    45   0.001
UniRef50_Q7QWN9 Cluster: GLP_26_50243_51811; n=2; Giardia lambli...    45   0.001
UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia...    45   0.001
UniRef50_Q7QRX3 Cluster: GLP_549_24108_24914; n=1; Giardia lambl...    45   0.001
UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gamb...    45   0.001
UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain...    45   0.001
UniRef50_Q23H06 Cluster: Papain family cysteine protease contain...    45   0.001
UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain...    45   0.001
UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Ca...    45   0.001
UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA...    45   0.002
UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n...    45   0.002
UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis...    45   0.002
UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil...    45   0.002
UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain...    45   0.002
UniRef50_A3FQ13 Cluster: Cathepsin like thiol protease possibly ...    45   0.002
UniRef50_P05993 Cluster: Cysteine proteinase; n=7; Eukaryota|Rep...    45   0.002
UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto...    45   0.002
UniRef50_Q23H10 Cluster: Papain family cysteine protease contain...    44   0.002
UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep...    44   0.002
UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve...    44   0.002
UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh...    44   0.002
UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ...    44   0.002
UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|...    44   0.002
UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ...    44   0.003
UniRef50_Q22W19 Cluster: Papain family cysteine protease contain...    44   0.003
UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like prote...    44   0.003
UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The...    44   0.003
UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 ...    44   0.003
UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca...    44   0.003
UniRef50_A1SVF0 Cluster: Peptidase C1A, papain; n=1; Psychromona...    44   0.004
UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; ...    44   0.004
UniRef50_A2DPZ6 Cluster: Clan CA, family C1, cathepsin B-like cy...    44   0.004
UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali...    44   0.004
UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh...    44   0.004
UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ...    43   0.006
UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ...    43   0.006
UniRef50_Q26987 Cluster: Cysteine proteinase 3; n=1; Tritrichomo...    43   0.006
UniRef50_Q84YE7 Cluster: Cysteine proteinase-like protein; n=1; ...    43   0.007
UniRef50_Q24F16 Cluster: Papain family cysteine protease contain...    43   0.007
UniRef50_O62484 Cluster: Putative uncharacterized protein; n=1; ...    43   0.007
UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 ...    43   0.007
UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re...    42   0.010
UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|...    42   0.010
UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest...    42   0.010
UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2...    42   0.010
UniRef50_Q4UC83 Cluster: Cysteine proteinase, putative; n=2; The...    42   0.010
UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain...    42   0.010
UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;...    42   0.013
UniRef50_Q5YER6 Cluster: Cathepsin Z; n=1; Bigelowiella natans|R...    42   0.013
UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep...    42   0.013
UniRef50_Q4N5Z7 Cluster: Cysteine proteinase, putative; n=2; The...    42   0.013
UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n...    42   0.013
UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma...    42   0.013
UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh...    42   0.013
UniRef50_UPI0000E46171 Cluster: PREDICTED: hypothetical protein;...    42   0.017
UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomati...    42   0.017
UniRef50_Q5CKC2 Cluster: Preprocathepsin c; n=2; Cryptosporidium...    42   0.017
UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain...    42   0.017
UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re...    42   0.017
UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh...    42   0.017
UniRef50_A0CHZ5 Cluster: Chromosome undetermined scaffold_186, w...    42   0.017
UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ...    41   0.023
UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; ...    41   0.023
UniRef50_Q4UFL9 Cluster: Cathepsin-like cysteine protease, putat...    41   0.023
UniRef50_Q231X3 Cluster: Papain family cysteine protease contain...    41   0.023
UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w...    41   0.023
UniRef50_A0BZE1 Cluster: Chromosome undetermined scaffold_139, w...    41   0.023
UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=...    41   0.023
UniRef50_Q9UBR2 Cluster: Cathepsin Z precursor; n=40; Bilateria|...    41   0.023
UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ...    41   0.030
UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=...    41   0.030
UniRef50_Q7R5X2 Cluster: GLP_81_104117_102504; n=1; Giardia lamb...    41   0.030
UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl...    41   0.030
UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil...    41   0.030
UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ...    41   0.030
UniRef50_A0DZB1 Cluster: Chromosome undetermined scaffold_7, who...    41   0.030
UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M...    41   0.030
UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O...    40   0.040
UniRef50_Q26993 Cluster: Cysteine proteinase 9; n=1; Tritrichomo...    40   0.040
UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain...    40   0.040
UniRef50_Q1AMF3 Cluster: Cathepsin C1; n=1; Toxoplasma gondii|Re...    40   0.040
UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7...    40   0.040
UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|...    40   0.040
UniRef50_A2GCC2 Cluster: Clan CA, family C1, cathepsin B-like cy...    40   0.040
UniRef50_A0DCA5 Cluster: Chromosome undetermined scaffold_45, wh...    40   0.040
UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, wh...    40   0.040
UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-li...    40   0.040
UniRef50_UPI0000E49DA9 Cluster: PREDICTED: similar to cathepsin ...    40   0.053
UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ...    40   0.053
UniRef50_Q7M1Q7 Cluster: Actinidain; n=1; Actinidia chinensis|Re...    40   0.053
UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; ...    40   0.053
UniRef50_Q4N3V5 Cluster: Cathepsin C, putative; n=1; Theileria p...    40   0.053
UniRef50_Q235G6 Cluster: Papain family cysteine protease contain...    40   0.053
UniRef50_Q0PWU8 Cluster: Cathepsin B preproprotein-like protein;...    40   0.053
UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; ...    40   0.069
UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin ...    40   0.069
UniRef50_Q01FL5 Cluster: Cysteine protease-5; n=3; Ostreococcus|...    40   0.069
UniRef50_Q9XW98 Cluster: Putative uncharacterized protein; n=1; ...    40   0.069
UniRef50_Q9NHY2 Cluster: Cysteine protease cp1; n=2; Theileria c...    40   0.069
UniRef50_Q8I8D3 Cluster: Cysteine protease 15; n=2; Entamoeba hi...    40   0.069
UniRef50_Q7Q9Y5 Cluster: ENSANGP00000012222; n=1; Anopheles gamb...    40   0.069
UniRef50_UPI00006CB653 Cluster: Papain family cysteine protease ...    39   0.092
UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G...    39   0.092
UniRef50_Q8I8D7 Cluster: Cysteine protease 11; n=4; Entamoeba hi...    39   0.092
UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw...    39   0.092
UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8....    39   0.092
UniRef50_A7AX75 Cluster: Preprocathepsin c, putative; n=1; Babes...    39   0.092
UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy...    39   0.092
UniRef50_Q9TY95 Cluster: Serine-repeat antigen protein precursor...    39   0.092
UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ...    39   0.12 
UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ...    39   0.12 
UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10...    39   0.12 
UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li...    39   0.12 
UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs...    39   0.12 
UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Sl...    38   0.16 
UniRef50_Q248G1 Cluster: Papain family cysteine protease contain...    38   0.16 
UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator ame...    38   0.16 
UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, w...    38   0.16 
UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium...    38   0.16 
UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: ...    38   0.21 
UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop...    38   0.21 
UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr...    38   0.21 
UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ...    38   0.21 
UniRef50_A4S004 Cluster: Predicted protein; n=2; Ostreococcus|Re...    38   0.28 
UniRef50_Q8I8D5 Cluster: Cysteine protease 13; n=2; Entamoeba hi...    38   0.28 
UniRef50_Q5CM16 Cluster: P3ECSL-related; n=2; Cryptosporidium|Re...    38   0.28 
UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr...    38   0.28 
UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;...    37   0.37 
UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R...    37   0.37 
UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy...    37   0.37 
UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG011...    37   0.49 
UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag...    37   0.49 
UniRef50_A0EI50 Cluster: Chromosome undetermined scaffold_98, wh...    37   0.49 
UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|R...    37   0.49 
UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,...    36   0.65 
UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep...    36   0.65 
UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin...    36   0.65 
UniRef50_Q1AMF2 Cluster: Cathepsin C2; n=1; Toxoplasma gondii|Re...    36   0.65 
UniRef50_A7SNM3 Cluster: Predicted protein; n=1; Nematostella ve...    36   0.65 
UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like cy...    36   0.86 
UniRef50_Q2H7E7 Cluster: Putative uncharacterized protein; n=1; ...    36   0.86 
UniRef50_A2XHS0 Cluster: Putative uncharacterized protein; n=2; ...    36   1.1  
UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; ...    36   1.1  
UniRef50_Q4XZE6 Cluster: Preprocathepsin c, putative; n=6; Plasm...    36   1.1  
UniRef50_O96167 Cluster: Cysteine protease, putative; n=1; Plasm...    36   1.1  
UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina...    36   1.1  
UniRef50_A7ASR7 Cluster: Cathepsin C, putative; n=1; Babesia bov...    36   1.1  
UniRef50_Q06VH9 Cluster: Putative uncharacterized protein; n=1; ...    35   1.5  
UniRef50_A5K8Y0 Cluster: Preprocathepsin c, putative; n=1; Plasm...    35   1.5  
UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy...    35   1.5  
UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba...    35   2.0  
UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O...    35   2.0  
UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain...    35   2.0  
UniRef50_A7TZ14 Cluster: Cysteine proteinase; n=1; Lepeophtheiru...    35   2.0  
UniRef50_Q8IIJ9 Cluster: Probable cathepsin C precursor; n=11; P...    35   2.0  
UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ...    34   2.6  
UniRef50_Q8I8D6 Cluster: Cysteine protease 12; n=1; Entamoeba hi...    34   2.6  
UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei...    34   2.6  
UniRef50_Q7R6M0 Cluster: GLP_170_106076_104580; n=1; Giardia lam...    34   2.6  
UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli...    34   2.6  
UniRef50_Q1AMF1 Cluster: Cathepsin C3; n=1; Toxoplasma gondii|Re...    34   2.6  
UniRef50_O96163 Cluster: Cysteine protease, putative; n=5; Plasm...    34   2.6  
UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu...    34   2.6  
UniRef50_Q8I8D2 Cluster: Cysteine protease 16; n=2; Entamoeba hi...    34   3.5  
UniRef50_Q8I8D0 Cluster: Cysteine protease 18; n=2; Entamoeba hi...    34   3.5  
UniRef50_Q7R6L4 Cluster: GLP_170_114230_115951; n=1; Giardia lam...    34   3.5  
UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh...    34   3.5  
UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ...    33   4.6  
UniRef50_UPI0000E4622C Cluster: PREDICTED: hypothetical protein;...    33   4.6  
UniRef50_A0BNM1 Cluster: Chromosome undetermined scaffold_119, w...    33   4.6  
UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum...    33   6.0  
UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop...    33   6.0  
UniRef50_O96166 Cluster: Cysteine protease, putative; n=1; Plasm...    33   6.0  
UniRef50_O96165 Cluster: Cysteine protease, putative; n=1; Plasm...    33   6.0  
UniRef50_A1SAN0 Cluster: Putative uncharacterized protein; n=1; ...    33   8.0  
UniRef50_Q8WQ50 Cluster: Zerknuellt protein; n=1; Haematopota pl...    33   8.0  
UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n...    33   8.0  

>UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15)
           [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC
           3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain] - Sarcophaga peregrina (Flesh fly)
           (Boettcherisca peregrina)
          Length = 339

 Score =  120 bits (290), Expect = 2e-26
 Identities = 52/78 (66%), Positives = 60/78 (76%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           H SFQLYS GVYNE EC   +LDHGVLVVGYGTDE G+DYW  +   G  WGE GYIKM 
Sbjct: 262 HESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMA 321

Query: 477 RNKNNRCGIASSASYXXV 424
           RN+NN+CGIA+++SY  V
Sbjct: 322 RNQNNQCGIATASSYPTV 339


>UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase;
           n=21; Bilateria|Rep: Cathepsin L-like cysteine
           proteinase - Globodera pallida
          Length = 379

 Score =  109 bits (262), Expect = 6e-23
 Identities = 50/78 (64%), Positives = 56/78 (71%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           H SFQLY+ GVY E+ECS  +LDHGVLVVGYGTD Q  DYW  +   G  WGE GYI+M 
Sbjct: 302 HRSFQLYTHGVYFEKECSPENLDHGVLVVGYGTDAQQGDYWIVKNSWGAHWGEQGYIRMA 361

Query: 477 RNKNNRCGIASSASYXXV 424
           RN+ N CGIAS ASY  V
Sbjct: 362 RNRKNNCGIASHASYPLV 379


>UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2;
           Acanthamoeba|Rep: Cysteine proteinase - Acanthamoeba
           healyi
          Length = 330

 Score =  101 bits (243), Expect = 1e-20
 Identities = 47/75 (62%), Positives = 54/75 (72%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           H SFQ YS GVY E  CSST LDHGVLVVG+G+ E G D+W  +   G  WG  GYIKM 
Sbjct: 254 HNSFQFYSGGVYYESACSSTQLDHGVLVVGWGS-ENGQDFWWVKNSWGASWGLNGYIKMS 312

Query: 477 RNKNNRCGIASSASY 433
           RN+NN CGIA++ASY
Sbjct: 313 RNQNNNCGIATAASY 327


>UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase
           precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine
           proteinase precursor - Heterodera glycines (Soybean cyst
           nematode worm)
          Length = 353

 Score =  101 bits (242), Expect = 2e-20
 Identities = 46/76 (60%), Positives = 53/76 (69%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           SFQ Y +GVY E  CS+  LDHGVL+VGYGTDE   DYW  +   GP WGE GYI++ RN
Sbjct: 278 SFQFYKTGVYYERWCSNRYLDHGVLLVGYGTDETHGDYWLVKNSWGPHWGENGYIRIARN 337

Query: 471 KNNRCGIASSASYXXV 424
           K N CGIA+ ASY  V
Sbjct: 338 KQNHCGIATMASYPVV 353


>UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteinase
           B; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like
           tick cysteine proteinase B - Haemaphysalis longicornis
           (Bush tick)
          Length = 332

 Score =  100 bits (239), Expect = 4e-20
 Identities = 46/74 (62%), Positives = 54/74 (72%)
 Frame = -1

Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466
           Q YS G+Y+E ECSS  LDHGVLVVGYGT + G DYW  +   G  WG+ GYI M RN++
Sbjct: 260 QFYSEGIYDEPECSSEQLDHGVLVVGYGTKD-GKDYWLVKNSWGTTWGDEGYIYMTRNQD 318

Query: 465 NRCGIASSASYXXV 424
           N+CGIASSASY  V
Sbjct: 319 NQCGIASSASYPLV 332


>UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3;
           Bilateria|Rep: Cathepsin L-like cysteine protease -
           Neobenedenia melleni
          Length = 335

 Score = 98.7 bits (235), Expect = 1e-19
 Identities = 44/76 (57%), Positives = 54/76 (71%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           SF LY SGVY+EE+CS T L+HGVL VGYGT  +G+DYW  +      WG  GYI M RN
Sbjct: 260 SFHLYDSGVYDEEDCSQTMLNHGVLAVGYGTTPEGLDYWKVKNSWTNTWGMEGYILMSRN 319

Query: 471 KNNRCGIASSASYXXV 424
           K+N+CG+A+ ASY  V
Sbjct: 320 KDNQCGVATVASYPIV 335


>UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precursor;
           n=3; Metazoa|Rep: Digestive cysteine proteinase 2
           precursor - Homarus americanus (American lobster)
          Length = 323

 Score = 96.7 bits (230), Expect = 4e-19
 Identities = 45/78 (57%), Positives = 53/78 (67%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           H+SFQ YSSGVY E  CS + LDH VL VGYG+ E G D+W  +      WG+ GYIKM 
Sbjct: 247 HSSFQFYSSGVYYEPSCSPSYLDHAVLAVGYGS-EGGQDFWLVKNSWATSWGDAGYIKMS 305

Query: 477 RNKNNRCGIASSASYXXV 424
           RN+NN CGIA+ ASY  V
Sbjct: 306 RNRNNNCGIATVASYPLV 323


>UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42;
           Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens
           (Human)
          Length = 334

 Score = 96.7 bits (230), Expect = 4e-19
 Identities = 43/81 (53%), Positives = 55/81 (67%), Gaps = 3/81 (3%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGY---GTDEQGVDYWXREELVGPLWGELGYI 487
           H+SFQ Y SG+Y E +CSS +LDHGVLVVGY   G +     YW  +   GP WG  GY+
Sbjct: 254 HSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYV 313

Query: 486 KMIRNKNNRCGIASSASYXXV 424
           K+ ++KNN CGIA++ASY  V
Sbjct: 314 KIAKDKNNHCGIATAASYPNV 334


>UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L
           - Suberites domuncula (Sponge)
          Length = 324

 Score = 95.5 bits (227), Expect = 1e-18
 Identities = 47/78 (60%), Positives = 52/78 (66%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           H SFQ Y +GVY E  CSS+ LDHGVLVVGYGT E G DY+  +   G  WG  GYI M 
Sbjct: 248 HRSFQFYKNGVYYEPSCSSSRLDHGVLVVGYGT-EGGQDYFIVKNSWGTRWGMDGYIMMS 306

Query: 477 RNKNNRCGIASSASYXXV 424
           RN+ N CGIAS ASY  V
Sbjct: 307 RNRRNNCGIASQASYPIV 324


>UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin
           L - Misgurnus mizolepis (Mud loach)
          Length = 337

 Score = 94.3 bits (224), Expect = 2e-18
 Identities = 43/81 (53%), Positives = 55/81 (67%), Gaps = 3/81 (3%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVD---YWXREELVGPLWGELGYI 487
           H SFQ Y SG+Y E+ECSS +LDHGVLVVGYG + + VD   YW  +      WG+ GYI
Sbjct: 257 HESFQFYQSGIYFEKECSSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSESWGDKGYI 316

Query: 486 KMIRNKNNRCGIASSASYXXV 424
            M +++ N CGIA++ASY  V
Sbjct: 317 YMAKDRKNHCGIATAASYPLV 337


>UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=17; Trichomonas vaginalis|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 318

 Score = 93.5 bits (222), Expect = 4e-18
 Identities = 42/68 (61%), Positives = 49/68 (72%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           FQLYSSG+YN + CSST LDH V +VGYGT E  VDYW      G  WGE GYI+MIRN 
Sbjct: 244 FQLYSSGIYNPKSCSSTFLDHAVGLVGYGT-ENKVDYWIVRNSWGTSWGEKGYIRMIRNN 302

Query: 468 NNRCGIAS 445
            N+CG+A+
Sbjct: 303 GNKCGVAT 310


>UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin
           L-like protease; n=1; Nasonia vitripennis|Rep:
           PREDICTED: similar to cathepsin L-like protease -
           Nasonia vitripennis
          Length = 353

 Score = 91.9 bits (218), Expect = 1e-17
 Identities = 39/79 (49%), Positives = 52/79 (65%), Gaps = 1/79 (1%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIKM 481
           H +F+ YS GVY + EC+  DLDH VL+VGYGTD +   D+W  +   G  WGE GY K+
Sbjct: 275 HDTFRFYSEGVYYQPECNEDDLDHAVLIVGYGTDNRTDQDFWLVKNSWGETWGEGGYFKV 334

Query: 480 IRNKNNRCGIASSASYXXV 424
            RN+ N CGIA++A Y  +
Sbjct: 335 ARNRRNHCGIAAAAVYPVI 353


>UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba
           culbertsoni|Rep: Cysteine proteinase - Acanthamoeba
           culbertsoni
          Length = 482

 Score = 91.9 bits (218), Expect = 1e-17
 Identities = 41/71 (57%), Positives = 48/71 (67%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           SF  YS G Y +  CSST+L+H VLVVG+GTD Q  DYW  +   G  WG+ GY+ M RN
Sbjct: 298 SFMFYSGGYYYDPTCSSTNLNHAVLVVGWGTDPQRGDYWIAKNEWGTAWGDDGYVYMARN 357

Query: 471 KNNRCGIASSA 439
           KNN CGIAS A
Sbjct: 358 KNNNCGIASLA 368


>UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes
           vastus|Rep: Cathepsin L - Aphrocallistes vastus
          Length = 329

 Score = 91.9 bits (218), Expect = 1e-17
 Identities = 44/75 (58%), Positives = 51/75 (68%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           HTSFQ+Y SG+Y    CS T LDHGVLVVGYGTD  GVDYW  +   G  WG  GY K I
Sbjct: 254 HTSFQMYHSGIYTPFLCSKTKLDHGVLVVGYGTD-NGVDYWLIKNSWGMAWGMDGYFK-I 311

Query: 477 RNKNNRCGIASSASY 433
             K+++CGI + ASY
Sbjct: 312 EMKSDKCGICTQASY 326


>UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate
           cathepsin L; n=4; Danio rerio|Rep: Novel protein similar
           to vertebrate cathepsin L - Danio rerio (Zebrafish)
           (Brachydanio rerio)
          Length = 334

 Score = 90.2 bits (214), Expect = 4e-17
 Identities = 41/76 (53%), Positives = 51/76 (67%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           SF  YSSG+Y E  C+  +L+H VLVVGYG++E G DYW  +   G  WGE GY++MIRN
Sbjct: 260 SFLFYSSGIYKESNCNPNNLNHAVLVVGYGSEE-GTDYWIIKNSWGTGWGEGGYMRMIRN 318

Query: 471 KNNRCGIASSASYXXV 424
             N CGIAS A Y  +
Sbjct: 319 GKNTCGIASYALYPII 334


>UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan
           CA, family C1, cathepsin L-like cysteine peptidase -
           Trichomonas vaginalis G3
          Length = 306

 Score = 90.2 bits (214), Expect = 4e-17
 Identities = 38/71 (53%), Positives = 49/71 (69%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           SF LY  G+Y+E +CS  DLDH V  VGYG + +  DYW      G +WGE GY++MIRN
Sbjct: 231 SFMLYKEGIYDEPKCSEEDLDHAVGCVGYGVEGEK-DYWIVRNSWGEVWGEKGYVRMIRN 289

Query: 471 KNNRCGIASSA 439
           KNN+CG+A+ A
Sbjct: 290 KNNQCGVATEA 300


>UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine
           protease; n=1; Maconellicoccus hirsutus|Rep: Putative
           cathepsin L-like cysteine protease - Maconellicoccus
           hirsutus (hibiscus mealybug)
          Length = 339

 Score = 89.8 bits (213), Expect = 5e-17
 Identities = 42/76 (55%), Positives = 52/76 (68%), Gaps = 2/76 (2%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIK 484
           H SF  Y  G+Y E +C +   +++HGVLVVGYG+ E G DYW  +   G  WGE GYI+
Sbjct: 261 HQSFHSYKGGIYFEPDCGNKKDEVNHGVLVVGYGS-ENGQDYWIVKNSYGTDWGEDGYIR 319

Query: 483 MIRNKNNRCGIASSAS 436
           M RNKNN CGIA+SAS
Sbjct: 320 MARNKNNHCGIATSAS 335


>UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15)
           (Major excreted protein) (MEP) [Contains: Cathepsin L
           heavy chain; Cathepsin L light chain]; n=19;
           Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15)
           (Major excreted protein) (MEP) [Contains: Cathepsin L
           heavy chain; Cathepsin L light chain] - Homo sapiens
           (Human)
          Length = 333

 Score = 89.4 bits (212), Expect = 7e-17
 Identities = 41/81 (50%), Positives = 51/81 (62%), Gaps = 3/81 (3%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYG---TDEQGVDYWXREELVGPLWGELGYI 487
           H SF  Y  G+Y E +CSS D+DHGVLVVGYG   T+     YW  +   G  WG  GY+
Sbjct: 253 HESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYV 312

Query: 486 KMIRNKNNRCGIASSASYXXV 424
           KM +++ N CGIAS+ASY  V
Sbjct: 313 KMAKDRRNHCGIASAASYPTV 333


>UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC06231 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 372

 Score = 88.2 bits (209), Expect = 2e-16
 Identities = 39/78 (50%), Positives = 54/78 (69%), Gaps = 2/78 (2%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           SF +Y SG+Y++ EC+S   DLDHGVL+VGYG  E G  YW  +   G  WG+ GY+K++
Sbjct: 296 SFSMYKSGIYSDPECASASEDLDHGVLLVGYGI-EDGKPYWLIKNSWGEDWGDKGYVKIL 354

Query: 477 RNKNNRCGIASSASYXXV 424
           ++  N CG+AS+ASY  V
Sbjct: 355 KDSKNMCGVASAASYPLV 372


>UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11;
           Trichomonadidae|Rep: Cysteine protease - Tritrichomonas
           foetus (Trichomonas foetus)
          Length = 315

 Score = 87.8 bits (208), Expect = 2e-16
 Identities = 41/75 (54%), Positives = 53/75 (70%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           H SFQLY SG+Y+E ECS+T L+HGV  +G+G+D     YW      G  WGE GYI++I
Sbjct: 240 HQSFQLYKSGIYDEPECSATFLNHGVGCIGFGSDND-TKYWIVPNSWGLTWGEEGYIRII 298

Query: 477 RNKNNRCGIASSASY 433
           R K+NRCGIA+SA +
Sbjct: 299 R-KDNRCGIAASACF 312


>UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin
           L-like proteinase; n=2; Strongylocentrotus
           purpuratus|Rep: PREDICTED: similar to cathepsin L-like
           proteinase - Strongylocentrotus purpuratus
          Length = 329

 Score = 87.0 bits (206), Expect = 3e-16
 Identities = 39/73 (53%), Positives = 47/73 (64%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           SFQLY SGVY++  CSST LD  +L+VGYG    G +YW      G  WG+ GYI + RN
Sbjct: 253 SFQLYVSGVYSDPNCSSTLLDLSLLLVGYGVSSVGTEYWICRNTWGEEWGDNGYINIARN 312

Query: 471 KNNRCGIASSASY 433
            NN CGIA+ A Y
Sbjct: 313 HNNMCGIATDAIY 325


>UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine
           proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like
           midgut cysteine proteinase - Tenebrio molitor (Yellow
           mealworm)
          Length = 330

 Score = 87.0 bits (206), Expect = 3e-16
 Identities = 37/71 (52%), Positives = 50/71 (70%)
 Frame = -1

Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466
           Q YS G++ ++ C+ +DL+HGVLVVGYG+D  G DYW  +   G  WGE GY + +RN  
Sbjct: 258 QFYSGGLFYDQTCNQSDLNHGVLVVGYGSDN-GQDYWILKNSWGSGWGESGYWRQVRNYG 316

Query: 465 NRCGIASSASY 433
           N CGIA++ASY
Sbjct: 317 NNCGIATAASY 327


>UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra
           erinaceieuropaei|Rep: Cysteine proteinase - Spirometra
           erinaceieuropaei (Tapeworm)
          Length = 336

 Score = 87.0 bits (206), Expect = 3e-16
 Identities = 42/75 (56%), Positives = 48/75 (64%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           F  YS GV+  + CS   +DHGVLVVGYG  E G  YW  +   G  WGE GY+KM RN+
Sbjct: 263 FMSYSHGVFVSKTCSPYAIDHGVLVVGYGA-ENGDAYWLVKNSWGSSWGEDGYLKMARNR 321

Query: 468 NNRCGIASSASYXXV 424
           NN CGIAS ASY  V
Sbjct: 322 NNMCGIASMASYPTV 336


>UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep:
           Cathepsin - Geodia cydonium (Sponge)
          Length = 322

 Score = 86.2 bits (204), Expect = 6e-16
 Identities = 41/78 (52%), Positives = 50/78 (64%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           H  FQLY  GVY+ + CS T LDHGVLVVGYG  ++  DYW  +   G  WG  G + M 
Sbjct: 243 HLGFQLYDGGVYHSDLCSQTRLDHGVLVVGYGVYKE-KDYWMVKNSWGTNWGISGDMMMS 301

Query: 477 RNKNNRCGIASSASYXXV 424
           RN++N CGIA+ ASY  V
Sbjct: 302 RNRDNNCGIATMASYPVV 319


>UniRef50_Q4TI44 Cluster: Chromosome undetermined SCAF2412, whole
           genome shotgun sequence; n=1; Tetraodon
           nigroviridis|Rep: Chromosome undetermined SCAF2412,
           whole genome shotgun sequence - Tetraodon nigroviridis
           (Green puffer)
          Length = 123

 Score = 85.8 bits (203), Expect = 8e-16
 Identities = 37/74 (50%), Positives = 48/74 (64%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           T+F LYS GVY + +C+  D++H VL+VGYG   +G  YW  +   G  WG  GYI M R
Sbjct: 47  TTFHLYSKGVYYDPDCNPEDINHAVLLVGYGVTRRGQQYWIVKNSWGTGWGTEGYILMAR 106

Query: 474 NKNNRCGIASSASY 433
           N+ N CGIA+ ASY
Sbjct: 107 NRGNLCGIANLASY 120


>UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicatein
           a3 - Lubomirskia baicalensis
          Length = 344

 Score = 85.8 bits (203), Expect = 8e-16
 Identities = 38/73 (52%), Positives = 49/73 (67%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           +F  Y SGV++   CS++ L+H +LV GYG+   G DYW  +   G  WGE GYIKM+RN
Sbjct: 270 AFMFYQSGVFDSSTCSTSKLNHAMLVTGYGSTN-GKDYWLVKNSWGTGWGESGYIKMVRN 328

Query: 471 KNNRCGIASSASY 433
           K N+CGIAS A Y
Sbjct: 329 KYNQCGIASDALY 341


>UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:
           Silicatein beta - Suberites domuncula (Sponge)
          Length = 383

 Score = 85.4 bits (202), Expect = 1e-15
 Identities = 39/74 (52%), Positives = 49/74 (66%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           TSFQ YS GV N   CSS+ L H ++V+GYG    G DYW  +   GP WG  GY K+ R
Sbjct: 308 TSFQFYSDGVLNVPYCSSSTLSHALVVIGYGK-YSGQDYWLVKNSWGPNWGVRGYGKLAR 366

Query: 474 NKNNRCGIASSASY 433
           NK N+CGIA++AS+
Sbjct: 367 NKGNKCGIATAASF 380


>UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor;
           n=35; Fasciola|Rep: Cathepsin L-like proteinase
           precursor - Fasciola hepatica (Liver fluke)
          Length = 326

 Score = 85.0 bits (201), Expect = 1e-15
 Identities = 38/77 (49%), Positives = 48/77 (62%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           + F +Y SG+Y  + CS   ++H VL VGYGT + G DYW  +   G  WGE GYI+M R
Sbjct: 247 SDFMMYRSGIYQSQTCSPLRVNHAVLAVGYGT-QGGTDYWIVKNSWGTYWGERGYIRMAR 305

Query: 474 NKNNRCGIASSASYXXV 424
           N+ N CGIAS AS   V
Sbjct: 306 NRGNMCGIASLASLPMV 322


>UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel
           protein - Danio rerio (Zebrafish) (Brachydanio rerio)
          Length = 328

 Score = 84.6 bits (200), Expect = 2e-15
 Identities = 40/76 (52%), Positives = 50/76 (65%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           SF  Y SG+YN+ +CSS  ++H VLVVGYG+ E G DYW  +   G  WGE GYI+M RN
Sbjct: 255 SFHRYRSGIYNDPKCSSALINHAVLVVGYGS-ENGQDYWLVKNSWGTAWGENGYIRMARN 313

Query: 471 KNNRCGIASSASYXXV 424
           K N CGI+S   Y  +
Sbjct: 314 K-NMCGISSFGIYPTI 328


>UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K
           precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2);
           n=2; Tribolium castaneum|Rep: PREDICTED: similar to
           Cathepsin K precursor (Cathepsin O) (Cathepsin X)
           (Cathepsin O2) - Tribolium castaneum
          Length = 332

 Score = 84.2 bits (199), Expect = 2e-15
 Identities = 38/73 (52%), Positives = 49/73 (67%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           +F  Y SGVYN   C    L+H V++VGYG  E+GVDYW  +   G  WG+ GY+KM RN
Sbjct: 259 TFHKYKSGVYNNPSCRG-GLNHAVVIVGYGR-ERGVDYWLVKNSWGAGWGQKGYVKMARN 316

Query: 471 KNNRCGIASSASY 433
           + N+CGIA+ ASY
Sbjct: 317 RRNQCGIATHASY 329


>UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor;
           n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase
           precursor - Diabrotica virgifera virgifera (western corn
           rootworm)
          Length = 326

 Score = 83.8 bits (198), Expect = 3e-15
 Identities = 40/78 (51%), Positives = 53/78 (67%), Gaps = 2/78 (2%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSS--TDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           +FQLY SG+ ++  C S    L+HGVLVVGYGT+++  DYW  +   G  WG  GYI M 
Sbjct: 250 NFQLYDSGILDDSSCYSDFNSLNHGVLVVGYGTEKEQ-DYWIVKNSWGADWGMDGYIWMS 308

Query: 477 RNKNNRCGIASSASYXXV 424
           RNKNN+CGIA+ A+Y  +
Sbjct: 309 RNKNNQCGIATDATYPTI 326


>UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome
           shotgun sequence; n=3; Tetraodontidae|Rep: Chromosome 12
           SCAF14996, whole genome shotgun sequence - Tetraodon
           nigroviridis (Green puffer)
          Length = 362

 Score = 83.4 bits (197), Expect = 4e-15
 Identities = 38/73 (52%), Positives = 47/73 (64%), Gaps = 3/73 (4%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVD---YWXREELVGPLWGELGYI 487
           H SFQ Y SG+Y E+ECSS +LDHGVLVVGYG   + VD   +W  +      WG  GYI
Sbjct: 289 HESFQFYQSGIYYEKECSSEELDHGVLVVGYGFQGEDVDGKKFWIVKNSWSENWGNKGYI 348

Query: 486 KMIRNKNNRCGIA 448
            M +++ N CGIA
Sbjct: 349 YMAKDRKNHCGIA 361


>UniRef50_P25774 Cluster: Cathepsin S precursor; n=78;
           Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens
           (Human)
          Length = 331

 Score = 83.4 bits (197), Expect = 4e-15
 Identities = 42/78 (53%), Positives = 51/78 (65%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           H SF LY SGVY E  C+  +++HGVLVVGYG D  G +YW  +   G  +GE GYI+M 
Sbjct: 256 HPSFFLYRSGVYYEPSCTQ-NVNHGVLVVGYG-DLNGKEYWLVKNSWGHNFGEEGYIRMA 313

Query: 477 RNKNNRCGIASSASYXXV 424
           RNK N CGIAS  SY  +
Sbjct: 314 RNKGNHCGIASFPSYPEI 331


>UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila
           melanogaster|Rep: LD36817p - Drosophila melanogaster
           (Fruit fly)
          Length = 352

 Score = 83.0 bits (196), Expect = 6e-15
 Identities = 36/73 (49%), Positives = 48/73 (65%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           SF+ YS G+Y +EEC+  +L+H V VVGYGT E G DYW  +      WGE G+++++RN
Sbjct: 278 SFEQYSGGIYEDEECNQGELNHSVTVVGYGT-ENGRDYWIIKNSYSQNWGEGGFMRILRN 336

Query: 471 KNNRCGIASSASY 433
               CGIAS  SY
Sbjct: 337 AGGFCGIASECSY 349


>UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Rep:
           Cathepsin R precursor - Mus musculus (Mouse)
          Length = 334

 Score = 82.6 bits (195), Expect = 8e-15
 Identities = 38/81 (46%), Positives = 49/81 (60%), Gaps = 3/81 (3%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYG---TDEQGVDYWXREELVGPLWGELGYI 487
           H SF+ Y  G+Y+E  CSS  + HGVLVVGYG    +  G  YW  +   G  WG  GY+
Sbjct: 254 HESFKNYKGGIYHEPNCSSDTVTHGVLVVGYGFKGIETDGNHYWLIKNSWGKRWGIRGYM 313

Query: 486 KMIRNKNNRCGIASSASYXXV 424
           K+ ++KNN CGIAS A Y  +
Sbjct: 314 KLAKDKNNHCGIASYAHYPTI 334


>UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep:
           Cysteine proteinase - Entamoeba histolytica
          Length = 320

 Score = 82.2 bits (194), Expect = 1e-14
 Identities = 39/73 (53%), Positives = 47/73 (64%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           SFQLY SGVY+E +C    L+H V  VGYG+ + G DY+      G  WG  GYI M RN
Sbjct: 240 SFQLYKSGVYDEPKCKKVILNHAVCAVGYGSQD-GQDYYIVRNSWGTSWGMDGYILMSRN 298

Query: 471 KNNRCGIASSASY 433
           KNN+CGIA+ A Y
Sbjct: 299 KNNQCGIANDAIY 311


>UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18;
           Schistosoma|Rep: Preprocathepsin cathepsin L -
           Schistosoma japonicum (Blood fluke)
          Length = 331

 Score = 82.2 bits (194), Expect = 1e-14
 Identities = 36/73 (49%), Positives = 48/73 (65%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           S  LY SG+Y  ++C   D++HGVL VGYG  E G DYW  +   G LWG  GY K+ RN
Sbjct: 256 SLILYKSGIYESKDCKYADINHGVLAVGYGR-ENGKDYWLIKNSWGDLWGMNGYFKLRRN 314

Query: 471 KNNRCGIASSASY 433
           K + CGI+S++S+
Sbjct: 315 KPHMCGISSNSSF 327


>UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 234

 Score = 82.2 bits (194), Expect = 1e-14
 Identities = 38/70 (54%), Positives = 46/70 (65%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           F+LYSSGV++  +C    LDH V V+GYG  E G DYW      G  WG  GYIKM RNK
Sbjct: 160 FRLYSSGVFDNPKCGKIILDHVVTVIGYGV-EDGKDYWLVRNSWGKYWGLEGYIKMSRNK 218

Query: 468 NNRCGIASSA 439
           +N+CGIA+ A
Sbjct: 219 DNQCGIATEA 228


>UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep:
           Cathepsin K - Danio rerio (Zebrafish) (Brachydanio
           rerio)
          Length = 333

 Score = 81.8 bits (193), Expect = 1e-14
 Identities = 34/74 (45%), Positives = 47/74 (63%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           ++F  Y SGVY +  C+  D++H VL VGYG   +G  YW  +   G  WG+ GY+ M R
Sbjct: 257 STFLYYKSGVYYDPNCNKEDVNHAVLAVGYGATPRGKKYWIVKNSWGEEWGKKGYVLMAR 316

Query: 474 NKNNRCGIASSASY 433
           N+NN CGIA+ AS+
Sbjct: 317 NRNNACGIANLASF 330


>UniRef50_Q0MYX5 Cluster: Putative cysteine protease; n=1; Emiliania
           huxleyi|Rep: Putative cysteine protease - Emiliania
           huxleyi
          Length = 276

 Score = 81.8 bits (193), Expect = 1e-14
 Identities = 39/75 (52%), Positives = 51/75 (68%), Gaps = 1/75 (1%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD-EQGVDYWXREELVGPLWGELGYIKMI 478
           ++FQLY SGV +   C   +LDHGVLVVGYGTD   G DYW  +   G  WGE G+++++
Sbjct: 74  SAFQLYQSGVIDSASCGK-ELDHGVLVVGYGTDTATGKDYWKIKNSWGGTWGEEGFVRVV 132

Query: 477 RNKNNRCGIASSASY 433
           + K N CGI+S ASY
Sbjct: 133 QGK-NMCGISSQASY 146


>UniRef50_UPI0000E224BB Cluster: PREDICTED: hypothetical protein;
           n=1; Pan troglodytes|Rep: PREDICTED: hypothetical
           protein - Pan troglodytes
          Length = 143

 Score = 81.4 bits (192), Expect = 2e-14
 Identities = 38/81 (46%), Positives = 46/81 (56%), Gaps = 3/81 (3%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGY---GTDEQGVDYWXREELVGPLWGELGYI 487
           H SFQ Y  G+Y E  C    LDH +LVVGY   G D     YW  +   G  WG  GYI
Sbjct: 63  HVSFQFYKKGIYFEPRCDPEGLDHAMLVVGYSYEGADSDNNKYWLVKNSWGKNWGMDGYI 122

Query: 486 KMIRNKNNRCGIASSASYXXV 424
           KM +++ N CGIA++ASY  V
Sbjct: 123 KMAKDRRNNCGIATAASYPTV 143


>UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2;
           Platyhelminthes|Rep: Cathepsin L-like proteinase -
           Echinococcus multilocularis
          Length = 338

 Score = 81.4 bits (192), Expect = 2e-14
 Identities = 36/77 (46%), Positives = 45/77 (58%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           + F LY  G+Y +  CS   LDH VLVVGY  D+    YW  +   G  WG+ GYI M R
Sbjct: 262 SGFMLYKKGIYQDNTCSQQYLDHAVLVVGYDADKTRQKYWIVKNSWGEDWGQRGYIWMAR 321

Query: 474 NKNNRCGIASSASYXXV 424
           +K N CGIA+ ASY  +
Sbjct: 322 DKGNMCGIATMASYPLI 338


>UniRef50_A2F4G0 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 203

 Score = 81.4 bits (192), Expect = 2e-14
 Identities = 36/70 (51%), Positives = 47/70 (67%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           TSFQLY SG+Y E +CS+  +D  +  VGYGT E   +YW  +   G  WGE GYI+MI+
Sbjct: 127 TSFQLYQSGIYYEPDCSTETMDLSMACVGYGT-EGTTNYWIVKNCFGDKWGEQGYIRMIK 185

Query: 474 NKNNRCGIAS 445
           +KNN C IA+
Sbjct: 186 DKNNNCAIAT 195


>UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1];
           n=11; Eutheria|Rep: Testin-2 precursor [Contains:
           Testin-1] - Mus musculus (Mouse)
          Length = 333

 Score = 81.4 bits (192), Expect = 2e-14
 Identities = 38/81 (46%), Positives = 48/81 (59%), Gaps = 3/81 (3%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGY---GTDEQGVDYWXREELVGPLWGELGYI 487
           H SFQ Y SG+Y E +C    L+H VLVVGY   G +  G  YW  +   G  WG  GYI
Sbjct: 253 HDSFQFYDSGIYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSYWLVKNSWGEEWGMKGYI 312

Query: 486 KMIRNKNNRCGIASSASYXXV 424
           K+ ++ NN CGIA+ A+Y  V
Sbjct: 313 KIAKDWNNHCGIATLATYPIV 333


>UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus
           tropicalis|Rep: LOC594890 protein - Xenopus tropicalis
           (Western clawed frog) (Silurana tropicalis)
          Length = 355

 Score = 81.0 bits (191), Expect = 2e-14
 Identities = 37/76 (48%), Positives = 50/76 (65%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           +F++Y +GVY +  CSS+  DH VLVVGYG  E GV+YW  +   G  +G+ GYIKM RN
Sbjct: 281 TFRMYKNGVYYDPNCSSSTPDHSVLVVGYGA-EDGVEYWLVKNSWGTSFGDEGYIKMARN 339

Query: 471 KNNRCGIASSASYXXV 424
            +N CGIA+   +  V
Sbjct: 340 HHNNCGIANFGCFPVV 355


>UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3;
           Bilateria|Rep: Cathepsin L-like cysteine proteinase -
           Longidorus elongatus
          Length = 358

 Score = 81.0 bits (191), Expect = 2e-14
 Identities = 38/75 (50%), Positives = 45/75 (60%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           FQ YS GVY +  CS   LDHGVL VGY + + G  Y+  +      WG+ GYI M R K
Sbjct: 282 FQFYSHGVYYDRSCSPEYLDHGVLAVGYNSTKDGKQYYIVKNSWSEDWGDDGYILMSRRK 341

Query: 468 NNRCGIASSASYXXV 424
           NN CGIA+ ASY  V
Sbjct: 342 NNNCGIATMASYPFV 356


>UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean
           endopeptidase) (Cysteine proteinase)
           (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
           Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor
           (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase)
           (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
           Vignain-2] - Vigna mungo (Rice bean) (Black gram)
          Length = 362

 Score = 79.4 bits (187), Expect = 7e-14
 Identities = 40/77 (51%), Positives = 48/77 (62%), Gaps = 3/77 (3%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           + FQ YS GV+  + C+ TDL+HGV +VGYGT   G +YW      GP WGE GYI+M R
Sbjct: 268 SDFQFYSEGVFTGD-CN-TDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQR 325

Query: 474 N---KNNRCGIASSASY 433
           N   K   CGIA  ASY
Sbjct: 326 NISKKEGLCGIAMMASY 342


>UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18;
           Magnoliophyta|Rep: Thiol protease aleurain precursor -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 358

 Score = 79.0 bits (186), Expect = 9e-14
 Identities = 41/78 (52%), Positives = 48/78 (61%), Gaps = 2/78 (2%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           SF+LY SGVY +  C ST  D++H VL VGYG  E GV YW  +   G  WG+ GY KM 
Sbjct: 282 SFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGV-EDGVPYWLIKNSWGADWGDKGYFKME 340

Query: 477 RNKNNRCGIASSASYXXV 424
             K N CGIA+ ASY  V
Sbjct: 341 MGK-NMCGIATCASYPVV 357


>UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 339

 Score = 78.6 bits (185), Expect = 1e-13
 Identities = 36/70 (51%), Positives = 46/70 (65%), Gaps = 1/70 (1%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYG-TDEQGVDYWXREELVGPLWGELGYIKMIR 475
           SF LY SGVY +  CSST L+HG+L +G+G T E G +Y+  +   G  WG  GYI + R
Sbjct: 263 SFMLYKSGVYKDPSCSSTILNHGILNIGFGVTPENGNEYYILKNSFGSKWGMKGYIYLSR 322

Query: 474 NKNNRCGIAS 445
           N NN CGI+S
Sbjct: 323 NFNNHCGISS 332


>UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3;
           Schistosoma japonicum|Rep: Clone ZZD209 mRNA sequence -
           Schistosoma japonicum (Blood fluke)
          Length = 339

 Score = 78.2 bits (184), Expect = 2e-13
 Identities = 33/70 (47%), Positives = 44/70 (62%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           F  Y SG+Y  + C+  +L+  +L+VGYG D  G+DYW  +   G  WGE GY+K+ RN 
Sbjct: 262 FLHYKSGIYQSDTCTHYNLNQSMLLVGYGYDNDGIDYWIVQNSWGKKWGESGYVKVRRNN 321

Query: 468 NNRCGIASSA 439
            N CGIAS A
Sbjct: 322 WNMCGIASLA 331


>UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l;
           n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to cathepsin l - Strongylocentrotus purpuratus
          Length = 489

 Score = 77.4 bits (182), Expect = 3e-13
 Identities = 36/75 (48%), Positives = 45/75 (60%), Gaps = 2/75 (2%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           SF  YS G Y +  C +T  DLDH VL VGYGTD  G DYW  +      WG  GY+  I
Sbjct: 410 SFSFYSYGTYYDASCGNTVDDLDHAVLAVGYGTDSSGQDYWLIKNSWSTHWGNNGYV-AI 468

Query: 477 RNKNNRCGIASSASY 433
             K+N CG+A++A+Y
Sbjct: 469 SMKDNNCGVATAATY 483


>UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 4 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 345

 Score = 77.4 bits (182), Expect = 3e-13
 Identities = 34/73 (46%), Positives = 48/73 (65%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           +F  Y +G+Y E  C    L+H VL+VGYG +E+GV YW  +   GP WGE GYIK++RN
Sbjct: 272 TFMFYKNGIYGEPNCDPRGLNHAVLLVGYG-EERGVPYWIVKNSWGPGWGEGGYIKILRN 330

Query: 471 KNNRCGIASSASY 433
           + N CG++   S+
Sbjct: 331 R-NVCGMSQDPSF 342


>UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila
           melanogaster|Rep: CG11459-PA - Drosophila melanogaster
           (Fruit fly)
          Length = 336

 Score = 77.0 bits (181), Expect = 4e-13
 Identities = 36/77 (46%), Positives = 44/77 (57%), Gaps = 2/77 (2%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIK 484
           H  F  YS GV +   C S   DL H VL+VG+GT  +  DYW  +   G  WGE GY+K
Sbjct: 257 HEEFDQYSGGVLSIPACRSKRQDLTHSVLLVGFGTHRKWGDYWIIKNSYGTDWGESGYLK 316

Query: 483 MIRNKNNRCGIASSASY 433
           + RN NN CG+AS   Y
Sbjct: 317 LARNANNMCGVASLPQY 333


>UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Slime
           mold). Cysteine proteinase 5; n=2; Dictyostelium
           discoideum|Rep: Similar to Dictyostelium discoideum
           (Slime mold). Cysteine proteinase 5 - Dictyostelium
           discoideum (Slime mold)
          Length = 345

 Score = 77.0 bits (181), Expect = 4e-13
 Identities = 37/85 (43%), Positives = 53/85 (62%), Gaps = 8/85 (9%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYG------TD--EQGVDYWXREELVGPLWGE 499
           +SFQ YSSG+Y E  C+STDL+H +L+VG+       TD  +   +YW  +   G  WGE
Sbjct: 261 SSFQFYSSGIYYEPSCNSTDLNHSILIVGFSDFSTTPTDSLKHSSNYWIVQNSFGKNWGE 320

Query: 498 LGYIKMIRNKNNRCGIASSASYXXV 424
            GYI M +++++ CGI+  ASY  V
Sbjct: 321 NGYIFMSKDRDDNCGISKMASYVIV 345


>UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes
           abbreviatus|Rep: Cathepsin L 2 - Diaprepes abbreviatus
           (Sugarcane rootstalk borer weevil)
          Length = 348

 Score = 76.6 bits (180), Expect = 5e-13
 Identities = 36/75 (48%), Positives = 50/75 (66%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           FQ Y SGVY+E +C  + L+H +L VGYG+   G ++W  +   G  WG+ GYI+M ++K
Sbjct: 276 FQFYHSGVYDEPQCGHS-LNHAMLAVGYGS-MGGKNFWLVKNSWGTGWGDQGYIRMAKDK 333

Query: 468 NNRCGIASSASYXXV 424
           NN+CGIA  ASY  V
Sbjct: 334 NNQCGIALMASYPGV 348


>UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 291

 Score = 76.6 bits (180), Expect = 5e-13
 Identities = 35/72 (48%), Positives = 47/72 (65%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           SFQLYSSG+Y++  CSS +LDH + VVGY        YW      G  WGE GY+++ ++
Sbjct: 220 SFQLYSSGIYSDPCCSSQNLDHAMNVVGYSD-----SYWIIRNSWGTSWGESGYMRLAKD 274

Query: 471 KNNRCGIASSAS 436
           KNN CG+A+ AS
Sbjct: 275 KNNMCGVATMAS 286


>UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-like
           cysteine peptidase; n=3; Trichomonas vaginalis|Rep: Clan
           CA, family C1, cathepsin L or K-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 320

 Score = 76.6 bits (180), Expect = 5e-13
 Identities = 33/69 (47%), Positives = 44/69 (63%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           SF  Y SG+Y++ +C  T LDH V +VGYG+ E G++YW      G  WGE GYI++I N
Sbjct: 245 SFMQYKSGIYDDTKCDPTQLDHYVNLVGYGS-ESGINYWIIRNSWGEAWGESGYIRIINN 303

Query: 471 KNNRCGIAS 445
             N CG+ S
Sbjct: 304 AANVCGVLS 312


>UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza
           sativa|Rep: Cysteine protease 1 precursor - Oryza sativa
           subsp. japonica (Rice)
          Length = 490

 Score = 75.8 bits (178), Expect = 9e-13
 Identities = 41/76 (53%), Positives = 47/76 (61%), Gaps = 4/76 (5%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIKMIRN 472
           FQLY SGV+    C  T+LDHGV+ VGYGTD   G  YW      GP WGE GYI+M RN
Sbjct: 299 FQLYDSGVFTGR-CG-TNLDHGVVAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERN 356

Query: 471 ---KNNRCGIASSASY 433
              +  +CGIA  ASY
Sbjct: 357 VTARTGKCGIAMMASY 372


>UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:
           Cathepsin - Petromyzon marinus (Sea lamprey)
          Length = 333

 Score = 75.4 bits (177), Expect = 1e-12
 Identities = 36/76 (47%), Positives = 49/76 (64%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           +F+ Y SG++NE  C  +  +H +LVVGYG+   G D+W  +   G  WGE GYI MIRN
Sbjct: 260 TFKHYKSGLFNEPSCDKSP-NHAMLVVGYGS-LSGNDFWIVKNSWGEDWGEKGYIYMIRN 317

Query: 471 KNNRCGIASSASYXXV 424
           K+N+CGIAS   Y  +
Sbjct: 318 KDNQCGIASIGIYPII 333


>UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3;
           Curculionidae|Rep: Cysteine proteinase - Hypera postica
           (alfalfa weevil)
          Length = 324

 Score = 75.4 bits (177), Expect = 1e-12
 Identities = 32/72 (44%), Positives = 44/72 (61%)
 Frame = -1

Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNR 460
           Y SG+Y +++CS   L+H +L VGYGT E G DYW  +   G  WGE GY ++ R K N+
Sbjct: 254 YDSGIYEDQDCSPAGLNHAILAVGYGT-ENGKDYWIIKNSWGASWGEQGYFRLARGK-NQ 311

Query: 459 CGIASSASYXXV 424
           CGI+    Y  +
Sbjct: 312 CGISEDTVYPTI 323


>UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor;
           n=176; Viridiplantae|Rep: Cysteine proteinase RD21a
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 462

 Score = 74.5 bits (175), Expect = 2e-12
 Identities = 38/76 (50%), Positives = 47/76 (61%), Gaps = 3/76 (3%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           +FQLY SG++ +  C  T LDHGV+ VGYGT E G DYW      G  WGE GY++M RN
Sbjct: 278 AFQLYDSGIF-DGSCG-TQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMARN 334

Query: 471 ---KNNRCGIASSASY 433
               + +CGIA   SY
Sbjct: 335 IASSSGKCGIAIEPSY 350


>UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh
           fly) (Boettcherisca peregrina). Cathepsin L; n=2;
           Dictyostelium discoideum|Rep: Similar to Sarcophaga
           peregrina (Flesh fly) (Boettcherisca peregrina).
           Cathepsin L - Dictyostelium discoideum (Slime mold)
          Length = 265

 Score = 73.7 bits (173), Expect = 3e-12
 Identities = 32/75 (42%), Positives = 43/75 (57%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           FQ  S G+Y  + C   +  H VL +GYGTDE GVDY+  +   G  WG  G+ K+ R  
Sbjct: 190 FQHLSGGIYYSDSCDPWNTIHAVLAIGYGTDENGVDYFLMKNSWGKSWGTNGFFKVKRGV 249

Query: 468 NNRCGIASSASYXXV 424
             +CGI ++ASY  V
Sbjct: 250 KGKCGIVTAASYPIV 264


>UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9;
           Onchocercidae|Rep: Cathepsin L-like precursor - Brugia
           pahangi (Filarial nematode worm)
          Length = 395

 Score = 73.7 bits (173), Expect = 3e-12
 Identities = 35/73 (47%), Positives = 43/73 (58%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           SF+ Y  GVY+E  C   D  H VL VGYGT     DYW  +   G  WG+ GY+ M RN
Sbjct: 323 SFRFYKDGVYSEGNCGRPD--HAVLAVGYGTHPSYGDYWIVKNSWGTDWGKDGYVYMARN 380

Query: 471 KNNRCGIASSASY 433
           + N C IAS+AS+
Sbjct: 381 RGNMCHIASAASF 393


>UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n=1;
           Rattus norvegicus|Rep: UPI0000501FDB UniRef100 entry -
           Rattus norvegicus
          Length = 338

 Score = 73.3 bits (172), Expect = 5e-12
 Identities = 33/81 (40%), Positives = 51/81 (62%), Gaps = 3/81 (3%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYG---TDEQGVDYWXREELVGPLWGELGYI 487
           H+S + Y  G+Y+E +C++  ++H VLVVGYG    +  G +YW  +   G  WG  GY+
Sbjct: 259 HSSLRFYKKGIYHEPKCNNY-VNHAVLVVGYGFEGNETDGNNYWLIQNSWGERWGLNGYM 317

Query: 486 KMIRNKNNRCGIASSASYXXV 424
           K+ +++NN CGIA+ A Y  V
Sbjct: 318 KIAKDRNNHCGIATFAQYPIV 338


>UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1;
           Dictyostelium discoideum AX4|Rep: Counting factor
           associated protein - Dictyostelium discoideum AX4
          Length = 531

 Score = 73.3 bits (172), Expect = 5e-12
 Identities = 36/74 (48%), Positives = 45/74 (60%), Gaps = 2/74 (2%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSS--TDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           F+ Y SGVYN   C +   DLDH VL +GYGT  QG DY+  +      WG  GY+ M R
Sbjct: 453 FRYYMSGVYNNPACKNGLDDLDHEVLAIGYGT-YQGQDYFLVKNSWSTNWGMDGYVYMAR 511

Query: 474 NKNNRCGIASSASY 433
           N NN CG++S A+Y
Sbjct: 512 NDNNLCGVSSQATY 525


>UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 356

 Score = 73.3 bits (172), Expect = 5e-12
 Identities = 36/74 (48%), Positives = 48/74 (64%), Gaps = 2/74 (2%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTD--LDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           F+LY SGVY+  +CSS+   ++H VL VGYG+ E GVDYW  +      WG+ GY K+ R
Sbjct: 271 FKLYKSGVYSNPDCSSSPQTVNHAVLAVGYGS-ENGVDYWYVKNSWSEFWGDEGYFKIQR 329

Query: 474 NKNNRCGIASSASY 433
              N CG+A+ ASY
Sbjct: 330 GV-NMCGVATCASY 342


>UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2;
           Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio
           molitor (Yellow mealworm)
          Length = 336

 Score = 72.9 bits (171), Expect = 6e-12
 Identities = 35/71 (49%), Positives = 40/71 (56%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           F  YS GVY    C +    H VL+VGYG +E G DYW  +   G  WG  GY K+ RN 
Sbjct: 263 FGSYSGGVYYNPTCETNKFTHAVLIVGYG-NENGQDYWLVKNSWGDGWGLDGYFKIARNA 321

Query: 468 NNRCGIASSAS 436
           NN CGIA  AS
Sbjct: 322 NNHCGIAGVAS 332


>UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1;
           Phytophthora infestans|Rep: Cathepsin-like cysteine
           protease - Phytophthora infestans (Potato late blight
           fungus)
          Length = 376

 Score = 72.5 bits (170), Expect = 8e-12
 Identities = 36/75 (48%), Positives = 46/75 (61%), Gaps = 2/75 (2%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTD--LDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           +FQLY  GVY+   C +    LDHGV   GYG  ++  DYW  +   G  WG  GYI M 
Sbjct: 278 TFQLYRHGVYSWPLCGNAPDALDHGVAAAGYGVYKKK-DYWLVKNSWGNSWGMKGYIMMS 336

Query: 477 RNKNNRCGIASSASY 433
           RNK+N+CGIA+ A+Y
Sbjct: 337 RNKDNQCGIATDATY 351


>UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-like
           protein; n=1; Maconellicoccus hirsutus|Rep: Cathepsin
           L-like cysteine proteinase-like protein -
           Maconellicoccus hirsutus (hibiscus mealybug)
          Length = 253

 Score = 72.1 bits (169), Expect = 1e-11
 Identities = 31/70 (44%), Positives = 46/70 (65%), Gaps = 2/70 (2%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           SF+ Y   +Y++ +C ++  +  + VLVVGYGTD    DYW  +  +G  WGE GY+++ 
Sbjct: 176 SFKHYKGDIYDDPQCDNSRHESSYAVLVVGYGTDNN-TDYWLIKNSLGTSWGEKGYMRLA 234

Query: 477 RNKNNRCGIA 448
           RN+NN CGIA
Sbjct: 235 RNRNNLCGIA 244


>UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella
           histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax
           (Sterkiella histriomuscorum)
          Length = 366

 Score = 70.9 bits (166), Expect = 2e-11
 Identities = 35/74 (47%), Positives = 45/74 (60%), Gaps = 2/74 (2%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSS--TDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           F+ Y SGVY  E C++   D++H VL VG+GTDE  VDYW  +   G  WG+ G+ KM R
Sbjct: 277 FRDYKSGVYAVEGCANGPNDVNHAVLAVGFGTDENKVDYWIIKNSWGAAWGDQGFFKMKR 336

Query: 474 NKNNRCGIASSASY 433
              N CGI +  SY
Sbjct: 337 GV-NMCGIQNCNSY 349


>UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_26_47548_45815 - Giardia lamblia
           ATCC 50803
          Length = 577

 Score = 70.9 bits (166), Expect = 2e-11
 Identities = 37/78 (47%), Positives = 42/78 (53%), Gaps = 2/78 (2%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEEC--SSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           S   YS GVYN+  C     DL H VL VGYGTD+   DYW       PLWG  GY   +
Sbjct: 496 SLLFYSGGVYNDPACPYKYDDLSHAVLAVGYGTDDTYGDYWIVRNSWSPLWGMDGYF-YL 554

Query: 477 RNKNNRCGIASSASYXXV 424
             K+N CGI + ASY  V
Sbjct: 555 SMKDNICGILTDASYAVV 572


>UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep:
           CG4847-PD, isoform D - Drosophila melanogaster (Fruit
           fly)
          Length = 420

 Score = 70.9 bits (166), Expect = 2e-11
 Identities = 32/72 (44%), Positives = 45/72 (62%)
 Frame = -1

Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNR 460
           Y+ G+YN++EC+  + +H +LVVGYG+ E+G DYW  +      WGE GY ++ R K N 
Sbjct: 351 YAGGIYNDDECNKGEPNHSILVVGYGS-EKGQDYWIVKNSWDDTWGEKGYFRLPRGK-NY 408

Query: 459 CGIASSASYXXV 424
           C IA   SY  V
Sbjct: 409 CFIAEECSYPVV 420


>UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 21 SCAF14577, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 478

 Score = 70.5 bits (165), Expect = 3e-11
 Identities = 36/77 (46%), Positives = 45/77 (58%), Gaps = 2/77 (2%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIK 484
           H SF  YS+GVY E  C ST  DLDH VL VGYG +  G  YW  +      WG  GYI 
Sbjct: 400 HRSFVFYSNGVYYEPACGSTVEDLDHAVLAVGYG-NLNGEPYWLIKNSWSTYWGNDGYI- 457

Query: 483 MIRNKNNRCGIASSASY 433
           ++  K+N CG+ + A+Y
Sbjct: 458 LMSMKDNNCGVTTDATY 474


>UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2;
           Taeniidae|Rep: Cathepsin L-like cysteine proteinase -
           Taenia solium (Pork tapeworm)
          Length = 339

 Score = 70.5 bits (165), Expect = 3e-11
 Identities = 32/75 (42%), Positives = 42/75 (56%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           F  Y  G+Y    CSS  L+HGVL +GYG  + G  YW  +   G  WG  GYI M ++ 
Sbjct: 266 FMFYRHGIYKSHWCSSKFLNHGVLAIGYGKQD-GKPYWLVKNSWGTRWGMKGYIMMAKDY 324

Query: 468 NNRCGIASSASYXXV 424
           +N CG+AS A +  V
Sbjct: 325 HNMCGVASLADFPYV 339


>UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3;
           Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber
           officinale (Ginger)
          Length = 475

 Score = 70.1 bits (164), Expect = 4e-11
 Identities = 39/76 (51%), Positives = 47/76 (61%), Gaps = 3/76 (3%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           +FQLY SG++    C+ T L+HGV VVGYGT E G DYW  +   G  WG  GYI M RN
Sbjct: 283 NFQLYHSGIFTGS-CN-TSLNHGVTVVGYGT-ENGNDYWIVKNSWGENWGNSGYILMERN 339

Query: 471 ---KNNRCGIASSASY 433
               + +CGIA S SY
Sbjct: 340 IAESSGKCGIAISPSY 355


>UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila
           melanogaster|Rep: CG5367-PA - Drosophila melanogaster
           (Fruit fly)
          Length = 338

 Score = 70.1 bits (164), Expect = 4e-11
 Identities = 34/76 (44%), Positives = 48/76 (63%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           +FQLYS G+Y++  CSS  ++H ++V+G+G      DYW  +   G  WGE GYI+ IR 
Sbjct: 269 TFQLYSDGIYDDPLCSSASVNHAMVVIGFGK-----DYWILKNWWGQNWGENGYIR-IRK 322

Query: 471 KNNRCGIASSASYXXV 424
             N CGIA+ A+Y  V
Sbjct: 323 GVNMCGIANYAAYAIV 338


>UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain;
           n=16; Chrysomelidae|Rep: Digestive cysteine protease
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 70.1 bits (164), Expect = 4e-11
 Identities = 35/74 (47%), Positives = 45/74 (60%), Gaps = 3/74 (4%)
 Frame = -1

Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQG---VDYWXREELVGPLWGELGYIKMIR 475
           QLY SG+ + + CS  DLDHGVLVVGYG   Q      +W  +   G +WGE GY ++ R
Sbjct: 251 QLYYSGIISGKGCSH-DLDHGVLVVGYGKASQWSGETKFWRVKNSWGKIWGENGYFRIKR 309

Query: 474 NKNNRCGIASSASY 433
           + NN CGIA   +Y
Sbjct: 310 DANNLCGIADDPTY 323


>UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor;
           n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase
           precursor - Phaedon cochleariae (Mustard beetle)
          Length = 324

 Score = 70.1 bits (164), Expect = 4e-11
 Identities = 30/69 (43%), Positives = 42/69 (60%)
 Frame = -1

Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNR 460
           Y  G++++  C   +L HGV VVGYG  E G  YW  +   G  WGE GYI++IR+ ++ 
Sbjct: 253 YGGGIFDDSSCLGDNLHHGVNVVGYGI-ENGQKYWIIKNTWGADWGESGYIRLIRDTDHS 311

Query: 459 CGIASSASY 433
           CG+   ASY
Sbjct: 312 CGVEKMASY 320


>UniRef50_Q9LUX8 Cluster: Cysteine protease; n=1; Pyrus
           pyrifolia|Rep: Cysteine protease - Pyrus pyrifolia
           (Japanese pear) (Pyrus serotina)
          Length = 147

 Score = 69.7 bits (163), Expect = 6e-11
 Identities = 39/71 (54%), Positives = 43/71 (60%), Gaps = 4/71 (5%)
 Frame = -1

Query: 633 SGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNR-- 460
           SGV+    C  TDLDHGV VVGYGTD+ G+DYW      G  WGE GYI+M RN  N   
Sbjct: 1   SGVFTGR-CG-TDLDHGVTVVGYGTDK-GLDYWIVRNSWGESWGEKGYIRMQRNLGNTAN 57

Query: 459 --CGIASSASY 433
             CGIA   SY
Sbjct: 58  GICGIAMEPSY 68


>UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 293

 Score = 69.7 bits (163), Expect = 6e-11
 Identities = 28/66 (42%), Positives = 42/66 (63%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           F+ YSS VY+  +C    + H +++ GYGTD  G DYW  +   G  WG  GYI+++RNK
Sbjct: 219 FEWYSSCVYDNPDCDPWGICHWMMICGYGTDA-GKDYWLAKNSFGSTWGMEGYIELVRNK 277

Query: 468 NNRCGI 451
           + +CG+
Sbjct: 278 DGQCGV 283


>UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin S
           preproprotein; n=2; Tribolium castaneum|Rep: PREDICTED:
           similar to cathepsin S preproprotein - Tribolium
           castaneum
          Length = 525

 Score = 69.3 bits (162), Expect = 8e-11
 Identities = 31/72 (43%), Positives = 44/72 (61%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           F+ YS GV+  + C+     H  ++VGYGT E G D+W  +   GP WG  GY+K+ RN+
Sbjct: 452 FKSYSGGVFYNKTCTRMKT-HVAVLVGYGT-ENGEDFWLVKNSYGPQWGLDGYVKIARNR 509

Query: 468 NNRCGIASSASY 433
           NN CGI +  +Y
Sbjct: 510 NNHCGITNRITY 521


>UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24;
           Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa
           (Rice)
          Length = 339

 Score = 69.3 bits (162), Expect = 8e-11
 Identities = 33/76 (43%), Positives = 43/76 (56%), Gaps = 3/76 (3%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKM--- 481
           +FQ YS GV     C  TDLDHG++ +GYG D  G  YW  +   G  WGE G+++M   
Sbjct: 263 TFQFYSGGVMTGS-CG-TDLDHGIVAIGYGKDGDGTQYWLLKNSWGTTWGENGFLRMEKD 320

Query: 480 IRNKNNRCGIASSASY 433
           I +K   CG+A   SY
Sbjct: 321 ISDKRGMCGLAMEPSY 336


>UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep:
           Actinidin Act3a - Actinidia eriantha
          Length = 380

 Score = 69.3 bits (162), Expect = 8e-11
 Identities = 34/74 (45%), Positives = 44/74 (59%), Gaps = 2/74 (2%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN- 472
           F+ Y SG++    C +T L+H V ++GYGT E G+DYW  +   G  WGE GY K+ RN 
Sbjct: 269 FRFYQSGIFTGGSCGTT-LNHAVTIIGYGT-ENGIDYWIVKNSYGTQWGESGYGKVQRNV 326

Query: 471 -KNNRCGIASSASY 433
               RCGIAS   Y
Sbjct: 327 GGEGRCGIASYPFY 340


>UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin
           heavy chain; n=3; Amniota|Rep: PREDICTED: similar to
           ferritin heavy chain - Ornithorhynchus anatinus
          Length = 338

 Score = 68.9 bits (161), Expect = 1e-10
 Identities = 29/75 (38%), Positives = 46/75 (61%), Gaps = 3/75 (4%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQ---GVDYWXREELVGPLWGELGYIKMI 478
           F  Y SG+++   C+   ++H +L VGYGT ++   G DYW  +      WGE GY++++
Sbjct: 262 FFFYHSGIFSSHSCTQK-VNHAMLAVGYGTSKEPGGGQDYWILKNSWSERWGEQGYMRLL 320

Query: 477 RNKNNRCGIASSASY 433
           +  NN CG+AS AS+
Sbjct: 321 KGANNHCGVASVASF 335


>UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4;
           core eudicotyledons|Rep: Papain-like cysteine peptidase
           XBCP3 - Arabidopsis thaliana (Mouse-ear cress)
          Length = 437

 Score = 68.9 bits (161), Expect = 1e-10
 Identities = 38/76 (50%), Positives = 48/76 (63%), Gaps = 3/76 (3%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           +FQLYSSG+++   CS T LDH VL+VGYG+ + GVDYW  +   G  WG  G++ M RN
Sbjct: 259 AFQLYSSGIFSGP-CS-TSLDHAVLIVGYGS-QNGVDYWIVKNSWGKSWGMDGFMHMQRN 315

Query: 471 KNNR---CGIASSASY 433
             N    CGI   ASY
Sbjct: 316 TENSDGVCGINMLASY 331


>UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 513

 Score = 68.9 bits (161), Expect = 1e-10
 Identities = 32/71 (45%), Positives = 45/71 (63%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           +F+ Y SG+Y + +C+   LDH  L VGYG +E+GV YW  +     +WGE GYIK I  
Sbjct: 439 TFKFYGSGIYYDTQCTHA-LDHAALAVGYG-EEKGVSYWIVKNSWSAMWGEEGYIK-IAM 495

Query: 471 KNNRCGIASSA 439
           K++ CG+A  A
Sbjct: 496 KDDNCGVAQKA 506


>UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4;
           Giardia intestinalis|Rep: Cathepsin B-like CP2 precursor
           - Giardia lamblia (Giardia intestinalis)
          Length = 300

 Score = 68.9 bits (161), Expect = 1e-10
 Identities = 34/73 (46%), Positives = 42/73 (57%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           H+ F  Y SGVY +      +  H V +VGYGTD+ GVDYW  +   GP WGE GY +MI
Sbjct: 223 HSDFMYYESGVY-QHTYGYMEGGHAVEMVGYGTDDDGVDYWIIKNSWGPDWGEDGYFRMI 281

Query: 477 RNKNNRCGIASSA 439
           R  N+ C I   A
Sbjct: 282 RGIND-CSIEEQA 293


>UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue of
           Sarcophaga 26,29kDa proteinase; n=1; Nasonia
           vitripennis|Rep: PREDICTED: similar to homologue of
           Sarcophaga 26,29kDa proteinase - Nasonia vitripennis
          Length = 553

 Score = 68.1 bits (159), Expect = 2e-10
 Identities = 34/77 (44%), Positives = 45/77 (58%), Gaps = 2/77 (2%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTD--LDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIK 484
           H +F  YS+GVY E  C +T+  LDH VL VGYGT   G  +W  +      WG  GYI 
Sbjct: 475 HKTFSFYSNGVYYEPACGNTENSLDHAVLAVGYGT-INGKGFWLIKNSWSNYWGNDGYIL 533

Query: 483 MIRNKNNRCGIASSASY 433
           M + KNN CG+ ++ +Y
Sbjct: 534 MAQ-KNNNCGVMTAPTY 549


>UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;
           n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
           CG5367-PA - Nasonia vitripennis
          Length = 362

 Score = 68.1 bits (159), Expect = 2e-10
 Identities = 32/76 (42%), Positives = 49/76 (64%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           +FQLY SG+Y++  CSS  ++H +L+VGY       +YW  +   G  WGE GY+++ + 
Sbjct: 293 TFQLYHSGIYDDPTCSSDLVNHAMLIVGYTP-----NYWILKNWWGASWGENGYMRLRKG 347

Query: 471 KNNRCGIASSASYXXV 424
           K NRCG+A+ A+Y  V
Sbjct: 348 K-NRCGVANYAAYAKV 362


>UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326;
           n=2; Danio rerio|Rep: hypothetical protein LOC550326 -
           Danio rerio
          Length = 531

 Score = 68.1 bits (159), Expect = 2e-10
 Identities = 35/77 (45%), Positives = 43/77 (55%), Gaps = 2/77 (2%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSS--TDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIK 484
           H SF  YS+GVY E EC +   DLDH VL VGYG       YW  +      WG  GYI 
Sbjct: 453 HRSFAFYSNGVYYEPECKNGINDLDHAVLAVGYGI-MNNESYWLVKNSWSSYWGNDGYI- 510

Query: 483 MIRNKNNRCGIASSASY 433
           ++  K+N CG+A+ A Y
Sbjct: 511 LMSMKDNNCGVATDAIY 527


>UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryza
           sativa (japonica cultivar-group)|Rep: Putative cysteine
           proteinase - Oryza sativa subsp. japonica (Rice)
          Length = 357

 Score = 68.1 bits (159), Expect = 2e-10
 Identities = 33/76 (43%), Positives = 44/76 (57%), Gaps = 5/76 (6%)
 Frame = -1

Query: 645 QLYSSGVYN--EEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR- 475
           Q +SSGV+   + E  +TDL+H +  VGYGTDE G  YW  +   G  WGE GY+K+ R 
Sbjct: 279 QFFSSGVFGAMQNETCTTDLNHAMTAVGYGTDEHGTKYWLMKNSWGTDWGEGGYMKIARD 338

Query: 474 --NKNNRCGIASSASY 433
             +    CG+A   SY
Sbjct: 339 VASNTGLCGLAMQPSY 354


>UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3;
           Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays
           (Maize)
          Length = 493

 Score = 67.7 bits (158), Expect = 2e-10
 Identities = 38/76 (50%), Positives = 46/76 (60%), Gaps = 3/76 (3%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           +FQLYSSG++ +  C  T LDHGV VVGYG+ E G DYW  +   G  WGE GY++M RN
Sbjct: 305 AFQLYSSGIF-DGRCG-TYLDHGVTVVGYGS-EGGKDYWIVKNSWGTQWGEAGYVRMARN 361

Query: 471 KNNR---CGIASSASY 433
              R    GIA    Y
Sbjct: 362 VRVRPPSAGIAMEPLY 377


>UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2;
           Entamoeba|Rep: Cysteine proteinase ACP1 precursor -
           Entamoeba histolytica
          Length = 308

 Score = 67.7 bits (158), Expect = 2e-10
 Identities = 30/74 (40%), Positives = 43/74 (58%), Gaps = 1/74 (1%)
 Frame = -1

Query: 651 SFQLYSSG-VYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           SFQLY  G +Y++ +C S  ++H V  VGYG++  G  YW      G  WG+ GY  + R
Sbjct: 229 SFQLYKKGTIYSDTKCRSRMMNHCVTAVGYGSNSNG-KYWIIRNSWGTSWGDAGYFLLAR 287

Query: 474 NKNNRCGIASSASY 433
           + NN CGI   ++Y
Sbjct: 288 DSNNMCGIGRDSNY 301


>UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep:
           Cysteine protease - Saprolegnia parasitica
          Length = 523

 Score = 67.3 bits (157), Expect = 3e-10
 Identities = 36/75 (48%), Positives = 46/75 (61%), Gaps = 3/75 (4%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN- 472
           FQ Y SGV+ ++ C  T LDHGVLVVGYG +E G  YW  +   G  WG+ GYIK+ R  
Sbjct: 257 FQFYKSGVF-DKSCG-TKLDHGVLVVGYG-EEGGKKYWKVKNSWGADWGDKGYIKLAREF 313

Query: 471 --KNNRCGIASSASY 433
             +  +CG+A   SY
Sbjct: 314 GPETGQCGVAMVPSY 328


>UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease;
           n=23; Magnoliophyta|Rep: Senescence-specific cysteine
           protease - Arabidopsis thaliana (Mouse-ear cress)
          Length = 346

 Score = 66.9 bits (156), Expect = 4e-10
 Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 3/78 (3%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKM---I 478
           FQ YSSGV+  E C+ T LDH V  +GYG    G  YW  +   G  WGE GY+++   +
Sbjct: 271 FQFYSSGVFTGE-CT-TYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDV 328

Query: 477 RNKNNRCGIASSASYXXV 424
           ++K   CG+A  ASY  +
Sbjct: 329 KDKQGLCGLAMKASYPTI 346


>UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_26_49243_47612 - Giardia lamblia
           ATCC 50803
          Length = 543

 Score = 66.9 bits (156), Expect = 4e-10
 Identities = 34/78 (43%), Positives = 44/78 (56%), Gaps = 2/78 (2%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSS--TDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           +F  YS GV+N+  C+S   DL H VL+VG+GTDE   DYW         WG  GY+  +
Sbjct: 466 TFSWYSGGVFNDPACASGVDDLAHAVLLVGWGTDEVAGDYWIVRNSWSNAWGIDGYM-YL 524

Query: 477 RNKNNRCGIASSASYXXV 424
             KNN CG+ + A Y  V
Sbjct: 525 SMKNNICGVLTCADYVMV 542


>UniRef50_Q22A69 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 330

 Score = 66.5 bits (155), Expect = 5e-10
 Identities = 30/73 (41%), Positives = 44/73 (60%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           + Q Y+ G+ N   C+   L+HGVL+VG G+ E G D+W  +   G  WGE GY +++R 
Sbjct: 256 NLQFYAGGISNPLICNPNGLNHGVLIVGLGS-ENGKDFWKVKNSWGASWGEKGYFRIVRG 314

Query: 471 KNNRCGIASSASY 433
           K  +CGI  + SY
Sbjct: 315 K-GKCGINRAVSY 326


>UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep:
           Dvir_CG5367 - Drosophila virilis (Fruit fly)
          Length = 298

 Score = 66.1 bits (154), Expect = 7e-10
 Identities = 30/76 (39%), Positives = 49/76 (64%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           +FQLYS G+Y++  C+ST ++H +L++G+       ++W  +   G LWGE G+++M R 
Sbjct: 229 TFQLYSEGIYDDVSCTSTSVNHAMLLIGFDK-----NFWILKNWWGELWGEAGFMRM-RK 282

Query: 471 KNNRCGIASSASYXXV 424
             N CGIA+ A+Y  V
Sbjct: 283 GINLCGIANYAAYAIV 298


>UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba
           histolytica|Rep: Cysteine protease 19 - Entamoeba
           histolytica
          Length = 324

 Score = 65.7 bits (153), Expect = 9e-10
 Identities = 29/74 (39%), Positives = 43/74 (58%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           +SF LY  G+YN+++C S      V++VGYG D+    Y+      GP WGE GY + I 
Sbjct: 244 SSFLLYHGGIYNDKKCRSDKSTIAVVIVGYGIDKNNGKYFIVRNSWGPYWGEQGYFR-IS 302

Query: 474 NKNNRCGIASSASY 433
           + NN CG+++   Y
Sbjct: 303 SDNNLCGLSNDIYY 316


>UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor;
           n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 355

 Score = 65.7 bits (153), Expect = 9e-10
 Identities = 37/75 (49%), Positives = 44/75 (58%), Gaps = 3/75 (4%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN- 472
           FQ Y  GV+N + C  TDLDHGV  VGYG+ + G DY   +   GP WGE G+I+M RN 
Sbjct: 279 FQFYKGGVFNGK-CG-TDLDHGVAAVGYGSSK-GSDYVIVKNSWGPRWGEKGFIRMKRNT 335

Query: 471 --KNNRCGIASSASY 433
                 CGI   ASY
Sbjct: 336 GKPEGLCGINKMASY 350


>UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16)
           [Contains: Cathepsin H mini chain; Cathepsin H heavy
           chain; Cathepsin H light chain]; n=37; Eukaryota|Rep:
           Cathepsin H precursor (EC 3.4.22.16) [Contains:
           Cathepsin H mini chain; Cathepsin H heavy chain;
           Cathepsin H light chain] - Homo sapiens (Human)
          Length = 335

 Score = 65.7 bits (153), Expect = 9e-10
 Identities = 31/74 (41%), Positives = 44/74 (59%), Gaps = 2/74 (2%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTD--LDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           F +Y +G+Y+   C  T   ++H VL VGYG ++ G+ YW  +   GP WG  GY  + R
Sbjct: 259 FMMYRTGIYSSTSCHKTPDKVNHAVLAVGYG-EKNGIPYWIVKNSWGPQWGMNGYFLIER 317

Query: 474 NKNNRCGIASSASY 433
            K N CG+A+ ASY
Sbjct: 318 GK-NMCGLAACASY 330


>UniRef50_A2DKX2 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 317

 Score = 65.3 bits (152), Expect = 1e-09
 Identities = 27/66 (40%), Positives = 40/66 (60%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           F+ Y  GVY  ++CS+  +DH + +VGYGT   G DYW  +   G  WG+ GY  + RN+
Sbjct: 244 FEYYYQGVYYSDDCSAWGIDHWMTIVGYGT-YNGDDYWLVKNSFGKGWGQQGYGMVARNR 302

Query: 468 NNRCGI 451
           +  CG+
Sbjct: 303 DGACGV 308


>UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;
           n=17; Magnoliophyta|Rep: Thiol protease aleurain-like
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 358

 Score = 65.3 bits (152), Expect = 1e-09
 Identities = 32/77 (41%), Positives = 44/77 (57%), Gaps = 2/77 (2%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           F+ Y  GV+    C +T  D++H VL VGYG ++  V YW  +   G  WG+ GY KM  
Sbjct: 283 FRFYKKGVFTSNTCGNTPMDVNHAVLAVGYGVEDD-VPYWLIKNSWGGEWGDNGYFKMEM 341

Query: 474 NKNNRCGIASSASYXXV 424
            K N CG+A+ +SY  V
Sbjct: 342 GK-NMCGVATCSSYPVV 357


>UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 317

 Score = 64.9 bits (151), Expect = 2e-09
 Identities = 30/73 (41%), Positives = 44/73 (60%), Gaps = 1/73 (1%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTD-LDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           +QLYS G+   + C   + ++H VL VGYG+ E G D+W  +      WGE GY++++R 
Sbjct: 244 WQLYSGGILESQSCPGGESINHAVLAVGYGS-ENGKDFWLIKNSWNTYWGEEGYLRIVRG 302

Query: 471 KNNRCGIASSASY 433
           K N+CGI   A Y
Sbjct: 303 K-NQCGINEVADY 314


>UniRef50_P25773 Cluster: Cathepsin L; n=38; Eukaryota|Rep:
           Cathepsin L - Felis silvestris catus (Cat)
          Length = 139

 Score = 64.9 bits (151), Expect = 2e-09
 Identities = 29/64 (45%), Positives = 40/64 (62%), Gaps = 3/64 (4%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGY---GTDEQGVDYWXREELVGPLWGELGYIKM 481
           +F+ Y  G+Y +  CSS D+DHGVLVVGY   GT+ +   YW  +   G  WG  GYIKM
Sbjct: 76  TFRFYKEGIYYDPSCSSEDVDHGVLVVGYGADGTETENKKYWIIKNSWGTDWGMDGYIKM 135

Query: 480 IRNK 469
            +++
Sbjct: 136 AKDR 139


>UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus
           tauri|Rep: Cysteine protease-1 - Ostreococcus tauri
          Length = 430

 Score = 64.5 bits (150), Expect = 2e-09
 Identities = 36/86 (41%), Positives = 50/86 (58%), Gaps = 13/86 (15%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDE----------QGVDYWXREELVGPLWG 502
           SFQLY  GVY+ +EC S  +DHGVLVVGYG D+          +   +W  +   G  WG
Sbjct: 341 SFQLYDGGVYDSKECGS-QVDHGVLVVGYGFDDTHHNATKHHKRHRHFWKVKNSWGGTWG 399

Query: 501 ELGYIKMIR---NKNNRCGIASSASY 433
           E G+I+M R   ++  +CGI ++ SY
Sbjct: 400 EGGFIRMARRISDETGQCGITTAPSY 425


>UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis
           thaliana|Rep: Cysteine proteinase - Arabidopsis thaliana
           (Mouse-ear cress)
          Length = 348

 Score = 64.1 bits (149), Expect = 3e-09
 Identities = 33/76 (43%), Positives = 44/76 (57%), Gaps = 3/76 (3%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           +F+ YS GV+N E C  TDL H V +VGYG  E+G  YW  +   G  WGE GY+++ R+
Sbjct: 272 AFRHYSGGVFNGE-CG-TDLHHAVTIVGYGMSEEGTKYWVVKNSWGETWGENGYMRIKRD 329

Query: 471 ---KNNRCGIASSASY 433
                  CG+A  A Y
Sbjct: 330 VDAPQGMCGLAILAFY 345


>UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis
           thaliana|Rep: F9P14.12 protein - Arabidopsis thaliana
           (Mouse-ear cress)
          Length = 343

 Score = 64.1 bits (149), Expect = 3e-09
 Identities = 37/75 (49%), Positives = 44/75 (58%), Gaps = 3/75 (4%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKM---I 478
           FQLYSSGV+    C  T+L+HGV VVGYG  E    YW  +   G  WGE GYI+M   +
Sbjct: 269 FQLYSSGVFTNY-CG-TNLNHGVTVVGYGV-EGDQKYWIVKNSWGTGWGEEGYIRMERGV 325

Query: 477 RNKNNRCGIASSASY 433
                +CGIA  ASY
Sbjct: 326 SEDTGKCGIAMMASY 340


>UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia
           theta|Rep: Cathepsin H precursor - Guillardia theta
           (Cryptomonas phi)
          Length = 353

 Score = 64.1 bits (149), Expect = 3e-09
 Identities = 33/71 (46%), Positives = 44/71 (61%), Gaps = 2/71 (2%)
 Frame = -1

Query: 639 YSSGVYNEEECSSTD--LDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466
           YSSGVY+   C  T   ++H VL VGYGT E G+ YW  +   G  WG+ GY K I+  +
Sbjct: 279 YSSGVYSSPTCVGTPDKVNHAVLAVGYGT-EGGIPYWTIKNSWGFAWGDNGYFK-IQRGS 336

Query: 465 NRCGIASSASY 433
           N+CGI+  AS+
Sbjct: 337 NKCGISVCASF 347


>UniRef50_UPI000155637A Cluster: PREDICTED: similar to
           ENSANGP00000013730, partial; n=1; Ornithorhynchus
           anatinus|Rep: PREDICTED: similar to ENSANGP00000013730,
           partial - Ornithorhynchus anatinus
          Length = 229

 Score = 63.7 bits (148), Expect = 4e-09
 Identities = 30/75 (40%), Positives = 46/75 (61%), Gaps = 2/75 (2%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           SF  Y++G+Y E +C      L+H VL+VGYG   QG  +W  +    PLWG  GY+ ++
Sbjct: 153 SFAFYANGIYYEPQCRHKLEQLNHAVLLVGYGV-LQGQAFWLLKNSWSPLWGNSGYM-LL 210

Query: 477 RNKNNRCGIASSASY 433
             K+N CG+ ++A+Y
Sbjct: 211 AMKDNDCGVTTAATY 225


>UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep:
           Toxopain-2 - Toxoplasma gondii
          Length = 422

 Score = 63.7 bits (148), Expect = 4e-09
 Identities = 34/75 (45%), Positives = 44/75 (58%), Gaps = 3/75 (4%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTD-EQGVDYWXREELVGPLWGELGYIKMIRN 472
           FQ Y  GV+ +  C  TDLDHGVL+VGYGTD E   D+W  +   G  WG  GY+ M  +
Sbjct: 347 FQFYHEGVF-DASCG-TDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMH 404

Query: 471 K--NNRCGIASSASY 433
           K    +CG+   AS+
Sbjct: 405 KGEEGQCGLLLDASF 419


>UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 360

 Score = 62.9 bits (146), Expect = 7e-09
 Identities = 33/69 (47%), Positives = 43/69 (62%), Gaps = 2/69 (2%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTD-LDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIKM 481
           TSF+ Y SGV  E E    D  DH +L+VGYG DE+  VDYW  +   G  WGE GY+++
Sbjct: 269 TSFKYYKSGVITECEDGPYDGPDHCLLLVGYGHDEELKVDYWLIKNQWGTTWGEEGYVRI 328

Query: 480 IRNKNNRCG 454
           IR+ N+  G
Sbjct: 329 IRDDNDHKG 337


>UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core
           eudicotyledons|Rep: Cysteine proteinase -
           Mesembryanthemum crystallinum (Common ice plant)
          Length = 367

 Score = 62.9 bits (146), Expect = 7e-09
 Identities = 32/74 (43%), Positives = 41/74 (55%), Gaps = 2/74 (2%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           +  Y  GV+    C  T L+HGV  VGYGT   G DYW  +   G  WGE GY++M+R  
Sbjct: 269 WMFYFQGVFTGP-CG-TKLNHGVTAVGYGTTNDGYDYWIIKNSWGETWGERGYMRMLRGV 326

Query: 468 N--NRCGIASSASY 433
           +    CGIA  AS+
Sbjct: 327 SPYGLCGIAMQASF 340


>UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|Rep:
           Cysteine protease - Giardia muris
          Length = 301

 Score = 62.9 bits (146), Expect = 7e-09
 Identities = 32/73 (43%), Positives = 40/73 (54%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           ++ F  YSSGVY        +  H V +VGYG DE G+ YW      GP WGE GY ++I
Sbjct: 224 YSDFGYYSSGVYQHVN-GMMEGGHAVEMVGYGIDESGLKYWIIRNSWGPDWGEGGYFRII 282

Query: 477 RNKNNRCGIASSA 439
           R + N CGI   A
Sbjct: 283 R-RVNECGIEEQA 294


>UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep:
           Cysteine protease - Solanum lycopersicum (Tomato)
           (Lycopersicon esculentum)
          Length = 345

 Score = 62.5 bits (145), Expect = 9e-09
 Identities = 30/77 (38%), Positives = 44/77 (57%), Gaps = 3/77 (3%)
 Frame = -1

Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK- 469
           Q Y+ G Y +  C+   ++H V  +GYGTDE+G  YW  +   G  WGE GY+K+IR+  
Sbjct: 270 QFYAGGTY-DGNCADR-INHAVTAIGYGTDEEGQKYWLLKNSWGTSWGENGYMKIIRDSG 327

Query: 468 --NNRCGIASSASYXXV 424
             +  C IA  +SY  +
Sbjct: 328 DPSGLCDIAKMSSYPNI 344


>UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1;
           Oryza sativa (japonica cultivar-group)|Rep: Putative
           uncharacterized protein - Oryza sativa subsp. japonica
           (Rice)
          Length = 326

 Score = 62.1 bits (144), Expect = 1e-08
 Identities = 33/72 (45%), Positives = 40/72 (55%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           F +Y  GV++   C  T+L+H VLVVGY   E G  YW  +   G  WGE GYI+MIRN 
Sbjct: 238 FMIYQGGVFSGP-CG-TELNHAVLVVGYDETEDGTPYWIVKNSWGAGWGESGYIRMIRNI 295

Query: 468 NNRCGIASSASY 433
               GI   A Y
Sbjct: 296 PAPEGICGIAMY 307


>UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-PA
           - Drosophila melanogaster (Fruit fly)
          Length = 549

 Score = 62.1 bits (144), Expect = 1e-08
 Identities = 33/75 (44%), Positives = 40/75 (53%), Gaps = 2/75 (2%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           +F  YS GVY E  C +    LDH VL VGYG+   G DYW  +      WG  GYI M 
Sbjct: 474 TFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGSIN-GEDYWLVKNSWSTYWGNDGYILMS 532

Query: 477 RNKNNRCGIASSASY 433
             KNN CG+ +  +Y
Sbjct: 533 AKKNN-CGVMTMPTY 546


>UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep:
           MGC107932 protein - Xenopus tropicalis (Western clawed
           frog) (Silurana tropicalis)
          Length = 333

 Score = 61.7 bits (143), Expect = 2e-08
 Identities = 33/79 (41%), Positives = 48/79 (60%), Gaps = 6/79 (7%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD------EQGVDYWXREELVGPLWGELG 493
           + FQLYS G++ E +C+ +  +H V++VGYGT+      E+  DYW  +   G  WGE G
Sbjct: 252 SDFQLYSEGIF-EGDCAESP-NHAVIIVGYGTEHANDKEEEDKDYWIIKNSWGKEWGEDG 309

Query: 492 YIKMIRNKNNRCGIASSAS 436
           Y+KM RN  N+C I   A+
Sbjct: 310 YVKMKRN-INQCSITEMAA 327


>UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep:
           Cathepsin L - Kudoa thyrsites
          Length = 300

 Score = 61.7 bits (143), Expect = 2e-08
 Identities = 29/63 (46%), Positives = 38/63 (60%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           SFQ Y  G+Y++   SS  LDH VL+VGYG  +   +YW  +   GP WGE GYI + R+
Sbjct: 239 SFQFYGGGIYSDPWASSYPLDHAVLLVGYGY-KNTENYWHVKNSWGPWWGEQGYINIKRD 297

Query: 471 KNN 463
             N
Sbjct: 298 GKN 300


>UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960
           precursor; n=2; Arabidopsis thaliana|Rep: Probable
           cysteine proteinase At3g43960 precursor - Arabidopsis
           thaliana (Mouse-ear cress)
          Length = 376

 Score = 61.7 bits (143), Expect = 2e-08
 Identities = 31/72 (43%), Positives = 40/72 (55%), Gaps = 3/72 (4%)
 Frame = -1

Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN---K 469
           Y SGVY +  CS+   DH VL+VGYGT     DYW      GP WGE GY+++ RN    
Sbjct: 274 YKSGVY-KGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEP 332

Query: 468 NNRCGIASSASY 433
             +C +A +  Y
Sbjct: 333 TGKCAVAVAPVY 344


>UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1;
           Naegleria fowleri|Rep: Cysteine proteinase homolog -
           Naegleria fowleri
          Length = 347

 Score = 61.3 bits (142), Expect = 2e-08
 Identities = 31/74 (41%), Positives = 44/74 (59%), Gaps = 4/74 (5%)
 Frame = -1

Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGV----DYWXREELVGPLWGELGYIKMI 478
           Q Y+SG+ +   C+  DLDHGVL+VGYG  +  +    +YW  +   G  WGE GY ++I
Sbjct: 271 QYYTSGISDPWFCNPQDLDHGVLIVGYGVGKSWLGSEENYWIVKNSWGSDWGEDGYFRII 330

Query: 477 RNKNNRCGIASSAS 436
           R K  +CG+ S  S
Sbjct: 331 RGK-GKCGLNSVPS 343


>UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2;
           Arabidopsis thaliana|Rep: Putative cysteine proteinase -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 365

 Score = 60.9 bits (141), Expect = 3e-08
 Identities = 32/76 (42%), Positives = 44/76 (57%), Gaps = 3/76 (3%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           SF  Y  GVY   +C  TD++H V +VGYGT   G++YW  +   G  WGE GY+++ R+
Sbjct: 289 SFGHYKGGVYAGLDCG-TDVNHAVTIVGYGT-MSGLNYWVLKNSWGESWGENGYMRIRRD 346

Query: 471 ---KNNRCGIASSASY 433
                  CGIA  A+Y
Sbjct: 347 VEWPQGMCGIAQVAAY 362


>UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1;
           Bigelowiella natans|Rep: Digestive cysteine proteinase -
           Bigelowiella natans (Pedinomonas minutissima)
           (Chlorarachnion sp.(strain CCMP 621))
          Length = 360

 Score = 60.9 bits (141), Expect = 3e-08
 Identities = 30/65 (46%), Positives = 37/65 (56%)
 Frame = -1

Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466
           Q Y  GV N   CS T L+H VL+VG+G D  G  +W  +   G  WGE GY ++IR K 
Sbjct: 288 QFYKHGVANPRFCSKTSLNHAVLLVGFGVD-GGKAFWIVKNSWGEKWGENGYFRLIRGK- 345

Query: 465 NRCGI 451
             CGI
Sbjct: 346 GACGI 350


>UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba
           histolytica|Rep: Cysteine protease 17 - Entamoeba
           histolytica
          Length = 420

 Score = 60.9 bits (141), Expect = 3e-08
 Identities = 28/73 (38%), Positives = 43/73 (58%), Gaps = 1/73 (1%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGP-LWGELGYIKMIRN 472
           F+ Y  G+Y  EEC+   L H + +VGYGT ++G  Y+      G   WGE GY+++ R 
Sbjct: 316 FKHYRGGIYYNEECTRRGLSHAMNLVGYGTTKEGQKYYIIRNSWGDWKWGEDGYMRLYRG 375

Query: 471 KNNRCGIASSASY 433
             N CG+A++A +
Sbjct: 376 -GNHCGVATNAFF 387


>UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain;
           n=9; Cucujiformia|Rep: Digestive cysteine proteinase
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 60.5 bits (140), Expect = 3e-08
 Identities = 31/74 (41%), Positives = 43/74 (58%), Gaps = 3/74 (4%)
 Frame = -1

Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQ---GVDYWXREELVGPLWGELGYIKMIR 475
           QLY  G+ +   C+  +L+HGVL VGYG ++       +W  +   G  WGE GY ++ R
Sbjct: 251 QLYFGGILDGLFCTH-NLNHGVLAVGYGEEDHLFGKKKFWKVKNSWGKDWGEQGYFRIKR 309

Query: 474 NKNNRCGIASSASY 433
           + NN CGIA  ASY
Sbjct: 310 DANNLCGIADKASY 323


>UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2;
           Endopterygota|Rep: Cathepsin L-like proteinase - Bombyx
           mori (Silk moth)
          Length = 402

 Score = 60.5 bits (140), Expect = 3e-08
 Identities = 34/73 (46%), Positives = 45/73 (61%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           +FQLYS GVY++  C S  L+H +L+VGY       DYW      G  WGE GY++ IR 
Sbjct: 334 TFQLYS-GVYDDPFCVSWHLNHAMLLVGYTQ-----DYWILLNWWGRNWGEDGYMR-IRR 386

Query: 471 KNNRCGIASSASY 433
             NRCG+A+ A+Y
Sbjct: 387 GLNRCGVANMATY 399


>UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2;
           Dictyostelium discoideum|Rep: Cysteine proteinase 2
           precursor - Dictyostelium discoideum (Slime mold)
          Length = 376

 Score = 60.5 bits (140), Expect = 3e-08
 Identities = 34/74 (45%), Positives = 47/74 (63%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           H SFQLY+SG+Y E +CS T+LDHGVLVVGYG   QG D    +E  GP+      I + 
Sbjct: 263 HNSFQLYTSGIYYEPKCSPTELDHGVLVVGYGV--QGKD----DE--GPVLNRKQTIVIH 314

Query: 477 RNKNNRCGIASSAS 436
           +N++N+   +  +S
Sbjct: 315 KNEDNKVESSDDSS 328



 Score = 41.1 bits (92), Expect = 0.023
 Identities = 17/37 (45%), Positives = 23/37 (62%)
 Frame = -1

Query: 543 DYWXREELVGPLWGELGYIKMIRNKNNRCGIASSASY 433
           +YW  +   G  WG  GYI M +++ N CGIAS +SY
Sbjct: 337 NYWIVKNSWGTSWGIKGYILMSKDRKNNCGIASVSSY 373


>UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryza
           sativa|Rep: Putative cysteine proteinase - Oryza sativa
           subsp. japonica (Rice)
          Length = 352

 Score = 60.1 bits (139), Expect = 5e-08
 Identities = 31/80 (38%), Positives = 45/80 (56%), Gaps = 5/80 (6%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGV---DYWXREELVGPLWGELGYIKMI 478
           F+ Y SGV+  + C  T LDH V VVGYG +  G     YW  +   G  WG+ GY+K+ 
Sbjct: 272 FRHYGSGVFTADSCG-TKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLE 330

Query: 477 RNKNNR--CGIASSASYXXV 424
           ++  ++  CG+A + SY  V
Sbjct: 331 KDVGSQGACGVAMAPSYPVV 350


>UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;
           Brugia malayi|Rep: Cahepsin L-like cysteine protease -
           Brugia malayi (Filarial nematode worm)
          Length = 371

 Score = 60.1 bits (139), Expect = 5e-08
 Identities = 30/82 (36%), Positives = 44/82 (53%), Gaps = 8/82 (9%)
 Frame = -1

Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDE--------QGVDYWXREELVGPLWGELGY 490
           + Y  G+++  +C+ T + H +L VGYGT+E        + VDYW  +      WG  GY
Sbjct: 286 KFYRRGIFSTSKCT-TRMGHALLAVGYGTEEVKLQNGTKKSVDYWLLKNSWSKRWGIGGY 344

Query: 489 IKMIRNKNNRCGIASSASYXXV 424
           +K+ RN+ N CGI   A Y  V
Sbjct: 345 LKLARNQENMCGIGFYACYPLV 366


>UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6;
           Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia
           deliciosa (Kiwi)
          Length = 509

 Score = 59.7 bits (138), Expect = 6e-08
 Identities = 34/77 (44%), Positives = 44/77 (57%), Gaps = 5/77 (6%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           FQLY+ G+Y + +CS    D+DH VLVVGYG  E G +YW  +   G  WG  GY  + R
Sbjct: 287 FQLYTGGIY-DGDCSDDPDDIDHAVLVVGYGA-ESGEEYWIIKNSWGTDWGMKGYAYIKR 344

Query: 474 NKNNR---CGIASSASY 433
           N +     C I + ASY
Sbjct: 345 NTSKDYGVCAINAMASY 361


>UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16;
           Bromeliaceae|Rep: Fruit bromelain precursor - Ananas
           comosus (Pineapple)
          Length = 351

 Score = 59.7 bits (138), Expect = 6e-08
 Identities = 26/71 (36%), Positives = 39/71 (54%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           +FQ Y+ GV++   C  T L+H + ++GYG D  G  YW      G  WGE GY++M R 
Sbjct: 260 NFQYYNGGVFSGP-CG-TSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARG 317

Query: 471 KNNRCGIASSA 439
            ++  G+   A
Sbjct: 318 VSSSSGVCGIA 328


>UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine
           protease; n=11; Callosobruchus maculatus|Rep: Putative
           gut cathepsin L-like cysteine protease - Callosobruchus
           maculatus (Southern cowpea weevil) (Pulse bruchid)
          Length = 326

 Score = 59.3 bits (137), Expect = 8e-08
 Identities = 32/77 (41%), Positives = 43/77 (55%), Gaps = 3/77 (3%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEE-ECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIK 484
           +    Y  G+ +E   CS+   DL+HGVLVVGYG+ E GVDYW  +   G  WGE GY +
Sbjct: 248 SQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYGS-ENGVDYWIVKNSWGADWGEKGYFR 306

Query: 483 MIRNKNNRCGIASSASY 433
            ++     CGI    +Y
Sbjct: 307 -LKKDVKACGIGYYNTY 322


>UniRef50_Q5BTK3 Cluster: SJCHGC00358 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC00358 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 78

 Score = 59.3 bits (137), Expect = 8e-08
 Identities = 29/75 (38%), Positives = 42/75 (56%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           F  Y SGV    +C   +    VL+VGYG +++   YW  +  +G  +G+ GYIK+ RN 
Sbjct: 5   FLAYESGVLIPTDCQDKEAFESVLLVGYGIEDE-TPYWLIKFSLGTEFGDQGYIKLARNH 63

Query: 468 NNRCGIASSASYXXV 424
           +N C IAS A Y  +
Sbjct: 64  SNMCHIASYAYYPVI 78


>UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 344

 Score = 59.3 bits (137), Expect = 8e-08
 Identities = 29/76 (38%), Positives = 43/76 (56%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           + Q Y  G+ + + C    ++H VL+VGYG +E G+ YW  +   G  WG  G+ K+IR 
Sbjct: 269 TLQFYEGGIVDPKNCDDK-INHAVLIVGYGVEE-GIPYWLIKNQWGAEWGIKGFFKLIRG 326

Query: 471 KNNRCGIASSASYXXV 424
           K  +CGI + AS   V
Sbjct: 327 K-KQCGIHTYASIAYV 341


>UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163
           precursor; n=4; Schizophora|Rep: Putative cysteine
           proteinase CG12163 precursor - Drosophila melanogaster
           (Fruit fly)
          Length = 614

 Score = 59.3 bits (137), Expect = 8e-08
 Identities = 31/79 (39%), Positives = 45/79 (56%), Gaps = 7/79 (8%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEE--CSSTDLDHGVLVVGYGTDE-----QGVDYWXREELVGPLWGELG 493
           + Q Y  GV +  +  CS  +LDHGVLVVGYG  +     + + YW  +   GP WGE G
Sbjct: 532 AMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQG 591

Query: 492 YIKMIRNKNNRCGIASSAS 436
           Y ++ R  +N CG++  A+
Sbjct: 592 YYRVYRG-DNTCGVSEMAT 609


>UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryza
           sativa (japonica cultivar-group)|Rep: Putative cysteine
           proteinase - Oryza sativa subsp. japonica (Rice)
          Length = 385

 Score = 58.8 bits (136), Expect = 1e-07
 Identities = 32/74 (43%), Positives = 41/74 (55%), Gaps = 4/74 (5%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLD-HGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKM--- 481
           F+ Y  GV+     S+ ++D H VLVVGYG     + YW  +   G  WGE GYI+M   
Sbjct: 290 FRSYRGGVFRGPCGSNPNVDNHVVLVVGYGVTTDNIKYWIIKNSWGKTWGEYGYIRMERD 349

Query: 480 IRNKNNRCGIASSA 439
           I NKN  CGI + A
Sbjct: 350 ILNKNGICGITTWA 363


>UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8;
           Magnoliophyta|Rep: Os12g0273800 protein - Oryza sativa
           subsp. japonica (Rice)
          Length = 504

 Score = 58.8 bits (136), Expect = 1e-07
 Identities = 27/65 (41%), Positives = 37/65 (56%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           + FQ Y  GV    EC  T LDHGV V+GYG    G  YW  +   G  WGE GY++M +
Sbjct: 263 SKFQFYGGGVM-AGECG-TSLDHGVTVIGYGAASDGTKYWLVKNSWGTTWGEAGYLRMEK 320

Query: 474 NKNNR 460
           + +++
Sbjct: 321 DIDDK 325


>UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core
           eudicotyledons|Rep: Chymopapain precursor - Carica
           papaya (Papaya)
          Length = 352

 Score = 58.8 bits (136), Expect = 1e-07
 Identities = 33/75 (44%), Positives = 43/75 (57%), Gaps = 3/75 (4%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR-- 475
           FQLY SGV+ +  C  T LDH V  VGYGT + G +Y   +   GP WGE GY+++ R  
Sbjct: 275 FQLYKSGVF-DGPCG-TKLDHAVTAVGYGTSD-GKNYIIIKNSWGPNWGEKGYMRLKRQS 331

Query: 474 -NKNNRCGIASSASY 433
            N    CG+  S+ Y
Sbjct: 332 GNSQGTCGVYKSSYY 346


>UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3;
           Eukaryota|Rep: Cathepsin-like cysteine protease -
           Phytophthora infestans (Potato late blight fungus)
          Length = 635

 Score = 58.4 bits (135), Expect = 1e-07
 Identities = 23/62 (37%), Positives = 41/62 (66%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           F  YS G++ +++ ++TD+DH + +VG+G +E GV +W      G  WGE G+++++R  
Sbjct: 230 FLKYSGGIF-DDKTNATDVDHAISIVGWG-EENGVPFWVLRNSWGSFWGESGWMRLVRGV 287

Query: 468 NN 463
           NN
Sbjct: 288 NN 289



 Score = 44.4 bits (100), Expect = 0.002
 Identities = 20/65 (30%), Positives = 35/65 (53%), Gaps = 1/65 (1%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIKMI 478
           + F+ Y+ G+Y+E       ++H + V G+G DE+   +YW      G  WGE G+ ++ 
Sbjct: 526 SKFESYTGGIYSEHVMFPL-INHEISVAGWGYDEETDTEYWIGRNSWGTYWGENGWFRIQ 584

Query: 477 RNKNN 463
            + NN
Sbjct: 585 MHHNN 589


>UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1;
           Caenorhabditis elegans|Rep: Putative uncharacterized
           protein - Caenorhabditis elegans
          Length = 299

 Score = 58.4 bits (135), Expect = 1e-07
 Identities = 31/74 (41%), Positives = 42/74 (56%), Gaps = 2/74 (2%)
 Frame = -1

Query: 651 SFQLYSSGVYN--EEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           SF  Y +G+YN  +EEC + +    + +VGYG D     YW  +   G  WGE GY+K+ 
Sbjct: 220 SFFHYKTGIYNPTKEECGNANEARSLAIVGYGKDG-AEKYWIVKGSFGTSWGEHGYMKLA 278

Query: 477 RNKNNRCGIASSAS 436
           RN  N CG+A S S
Sbjct: 279 RNV-NACGMAESIS 291


>UniRef50_UPI00006CBB5F Cluster: Papain family cysteine protease
            containing protein; n=2; Tetrahymena thermophila
            SB210|Rep: Papain family cysteine protease containing
            protein - Tetrahymena thermophila SB210
          Length = 1367

 Score = 58.0 bits (134), Expect = 2e-07
 Identities = 27/75 (36%), Positives = 41/75 (54%)
 Frame = -1

Query: 648  FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
            F+ Y+ G+ N  + S   + H + +VG+G DE+   YW     +G  WGE G+I++IR K
Sbjct: 956  FRNYTGGILNPPD-SPVQITHSLSIVGWGEDEKQTKYWIARNSLGTFWGENGFIRIIRGK 1014

Query: 468  NNRCGIASSASYXXV 424
             N   I S  SY  +
Sbjct: 1015 -NALKIESDCSYGRI 1028



 Score = 44.4 bits (100), Expect = 0.002
 Identities = 21/59 (35%), Positives = 33/59 (55%)
 Frame = -1

Query: 639  YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNN 463
            Y+ G+Y+E+       +H V VVG+G   +G +YW      G  WGE G+ K+  +K+N
Sbjct: 1294 YTGGIYSEKVKLPIP-NHYVSVVGWGQTLEGEEYWIVRNSWGTYWGEEGFFKLKMHKDN 1351


>UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted,
           possible transmembrane domain near N-terminus; n=4;
           Cryptosporidium|Rep: Cryptopain-cysteine proteinase
           secreted, possible transmembrane domain near N-terminus
           - Cryptosporidium parvum Iowa II
          Length = 401

 Score = 58.0 bits (134), Expect = 2e-07
 Identities = 32/80 (40%), Positives = 41/80 (51%), Gaps = 3/80 (3%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIKMI 478
           T FQ Y SGV+ +  C  T ++HGV++VGY  DE    +YW      G  WGE GYIK+ 
Sbjct: 320 TPFQFYKSGVF-DAPCG-TKVNHGVVLVGYDMDEDTNKEYWLVRNSWGEAWGEKGYIKLA 377

Query: 477 --RNKNNRCGIASSASYXXV 424
               K   CGI     Y  +
Sbjct: 378 LHSGKKGTCGILVEPVYPVI 397


>UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia
           circumcincta|Rep: Secreted cathepsin F - Teladorsagia
           circumcincta
          Length = 364

 Score = 58.0 bits (134), Expect = 2e-07
 Identities = 25/60 (41%), Positives = 34/60 (56%)
 Frame = -1

Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466
           Q Y  GV     C  + + HG L+VGYG  E+ + YW  +   GP WGE GY +M+R +N
Sbjct: 292 QFYKGGVSRPTTCRLSSMIHGALLVGYGV-EKNIPYWIIKNSWGPNWGEDGYYRMVRGEN 350


>UniRef50_O16454 Cluster: Temporarily assigned gene name protein
           196; n=4; Bilateria|Rep: Temporarily assigned gene name
           protein 196 - Caenorhabditis elegans
          Length = 477

 Score = 58.0 bits (134), Expect = 2e-07
 Identities = 31/74 (41%), Positives = 41/74 (55%), Gaps = 2/74 (2%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEE--CSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           + Q Y  GV +  +  C    L+HGVL+VGYG D +   YW  +   GP WGE GY K+ 
Sbjct: 401 TLQFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDGRK-PYWIVKNSWGPNWGEAGYFKLY 459

Query: 477 RNKNNRCGIASSAS 436
           R K N CG+   A+
Sbjct: 460 RGK-NVCGVQEMAT 472


>UniRef50_UPI0000498E2F Cluster: cysteine proteinase; n=1; Entamoeba
           histolytica HM-1:IMSS|Rep: cysteine proteinase -
           Entamoeba histolytica HM-1:IMSS
          Length = 317

 Score = 57.6 bits (133), Expect = 2e-07
 Identities = 28/67 (41%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
 Frame = -1

Query: 630 GVY-NEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNRCG 454
           G++ N EECS +    G+L++GYG    G+ YW  +   G  WG  GY+ + RNK N CG
Sbjct: 245 GIFENIEECSQSSPRIGLLLIGYGKTINGIPYWILKNCWGSSWGSNGYLYLKRNK-NVCG 303

Query: 453 IASSASY 433
           I S  +Y
Sbjct: 304 IYSYGTY 310


>UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole
           genome shotgun sequence; n=1; Tetraodon
           nigroviridis|Rep: Chromosome undetermined SCAF6860,
           whole genome shotgun sequence - Tetraodon nigroviridis
           (Green puffer)
          Length = 251

 Score = 57.6 bits (133), Expect = 2e-07
 Identities = 39/103 (37%), Positives = 50/103 (48%), Gaps = 28/103 (27%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPL---------- 508
           H+SF  YSSG+Y E  C+  +L H VL+VGYG+ E G DYW  +   G            
Sbjct: 148 HSSFLFYSSGIYEESNCNPNNLSHAVLLVGYGS-EGGQDYWLIKNRWGTTRQTAPAVAND 206

Query: 507 --------------WG----ELGYIKMIRNKNNRCGIASSASY 433
                         WG    E GY+++IR+  N CGIAS A Y
Sbjct: 207 HFLIKTLCLFCFFSWGSSWGEGGYMRLIRDGKNSCGIASYALY 249


>UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10;
           Liliopsida|Rep: Putative cysteine proteinase - Oryza
           sativa subsp. japonica (Rice)
          Length = 416

 Score = 57.2 bits (132), Expect = 3e-07
 Identities = 29/69 (42%), Positives = 37/69 (53%), Gaps = 3/69 (4%)
 Frame = -1

Query: 630 GVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN---KNNR 460
           GVYN   C  T ++H V  VGYG  +  ++YW      GP WGE GYI+M R+   K   
Sbjct: 332 GVYNGP-CG-TSVNHAVTTVGYGVTQDNINYWIARNSWGPRWGESGYIRMKRDIAAKEGL 389

Query: 459 CGIASSASY 433
           CGI+    Y
Sbjct: 390 CGISMYGVY 398



 Score = 50.0 bits (114), Expect = 5e-05
 Identities = 29/72 (40%), Positives = 37/72 (51%), Gaps = 5/72 (6%)
 Frame = -1

Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYG--TDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           Q Y  GV+    C +  L+HGV+VVGYG  T      YW  +   G  WGE GYI+M R+
Sbjct: 259 QHYKKGVFTGR-CKTAPLNHGVVVVGYGVNTTPDKTKYWIVKNSWGKGWGEGGYIRMKRD 317

Query: 471 ---KNNRCGIAS 445
                  CGI +
Sbjct: 318 VGTPGGLCGITT 329


>UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae
           str. PEST
          Length = 559

 Score = 56.8 bits (131), Expect = 4e-07
 Identities = 29/79 (36%), Positives = 43/79 (54%), Gaps = 7/79 (8%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEE--CSSTDLDHGVLVVGYGTDE-----QGVDYWXREELVGPLWGELG 493
           + Q Y  G+ +     C+   +DHGVL+VGYG  E     + + YW  +   GP WGE G
Sbjct: 477 AMQFYRGGISHPWHPLCNHKSIDHGVLIVGYGIKEYPMFNKTLPYWIIKNSWGPRWGEQG 536

Query: 492 YIKMIRNKNNRCGIASSAS 436
           Y ++ R  +N CG++  AS
Sbjct: 537 YYRIYRG-DNSCGVSEMAS 554


>UniRef50_Q22RC9 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 389

 Score = 56.8 bits (131), Expect = 4e-07
 Identities = 27/65 (41%), Positives = 36/65 (55%)
 Frame = -1

Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466
           Q Y  G+   + CS T L+H VL+ GYG D  GV++W  +   G  WGE GY ++ R   
Sbjct: 299 QFYKKGISAPKFCSKTTLNHAVLLTGYGID-NGVEFWNVKNSWGAKWGEQGYFRLKRGV- 356

Query: 465 NRCGI 451
             CGI
Sbjct: 357 GMCGI 361


>UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10;
           Dictyostelium discoideum|Rep: Cysteine proteinase 7
           precursor - Dictyostelium discoideum (Slime mold)
          Length = 460

 Score = 56.8 bits (131), Expect = 4e-07
 Identities = 24/31 (77%), Positives = 26/31 (83%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGT 559
           SFQLY SG+YNE  CSST LDHGVL VG+GT
Sbjct: 254 SFQLYVSGIYNEPACSSTQLDHGVLAVGFGT 284



 Score = 42.3 bits (95), Expect = 0.010
 Identities = 18/36 (50%), Positives = 22/36 (61%)
 Frame = -1

Query: 543 DYWXREELVGPLWGELGYIKMIRNKNNRCGIASSAS 436
           DYW  +   G  WG  GYI M +  NN+CGIA+ AS
Sbjct: 417 DYWIVKNSWGTSWGMDGYILMTKGNNNQCGIATMAS 452


>UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryza
           sativa (japonica cultivar-group)|Rep: Putative cysteine
           proteinase - Oryza sativa subsp. japonica (Rice)
          Length = 361

 Score = 56.4 bits (130), Expect = 6e-07
 Identities = 32/73 (43%), Positives = 41/73 (56%), Gaps = 3/73 (4%)
 Frame = -1

Query: 642 LYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN--- 472
           +   GV+ +  CSST ++H VLVVGYG      DYW  +   G  WGE GYI++ RN   
Sbjct: 274 ILKGGVF-DGYCSSTKVNHNVLVVGYGE-----DYWIIKNSWGIYWGENGYIRLKRNVPA 327

Query: 471 KNNRCGIASSASY 433
           K  +CGI   A Y
Sbjct: 328 KQGKCGITLQAWY 340


>UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza
           sativa|Rep: Os09g0381400 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 362

 Score = 56.0 bits (129), Expect = 7e-07
 Identities = 31/77 (40%), Positives = 39/77 (50%), Gaps = 3/77 (3%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD-EQGVDYWXREELVGPLWGELGYIKMI 478
           +  Q Y  GVY    C  T L H V VVGYGTD   G  YW  +   G  WGE GYI+++
Sbjct: 283 SGMQFYKGGVYTGP-CG-TRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRIL 340

Query: 477 RNKN--NRCGIASSASY 433
           R+      CG+    +Y
Sbjct: 341 RDVGGPGLCGVTLDIAY 357


>UniRef50_Q9VNK7 Cluster: CG1075-PA; n=1; Drosophila
           melanogaster|Rep: CG1075-PA - Drosophila melanogaster
           (Fruit fly)
          Length = 274

 Score = 56.0 bits (129), Expect = 7e-07
 Identities = 26/64 (40%), Positives = 33/64 (51%), Gaps = 2/64 (3%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIK 484
           H  F  Y  G+     C +T  DL H VL+VG+ T  +  DYW  +   G  WGE GY K
Sbjct: 187 HEEFDQYFGGILRTPSCRNTNYDLKHSVLLVGFETHPKWGDYWIIKNSYGTEWGESGYFK 246

Query: 483 MIRN 472
           + RN
Sbjct: 247 LARN 250


>UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep:
           Cathepsin L - Stylonychia lemnae
          Length = 340

 Score = 56.0 bits (129), Expect = 7e-07
 Identities = 30/68 (44%), Positives = 36/68 (52%), Gaps = 2/68 (2%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN- 472
           FQ Y SG+++   C  T+LDHGV  VGYG D  G  Y+         WG  GYI +I N 
Sbjct: 266 FQFYRSGIFDSSWCG-TNLDHGVAAVGYGVD-NGKQYYIVRNSWSDSWGLKGYINIIANG 323

Query: 471 -KNNRCGI 451
             N  CGI
Sbjct: 324 DGNGMCGI 331


>UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing
           protein; n=7; Hymenostomatida|Rep: Papain family
           cysteine protease containing protein - Tetrahymena
           thermophila SB210
          Length = 387

 Score = 56.0 bits (129), Expect = 7e-07
 Identities = 26/67 (38%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
 Frame = -1

Query: 654 TSFQLYSSGVYNE-EECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           ++F  Y SGV++  +   + D++H V++VGYGTDE+  DYW      G  +GE GYI++ 
Sbjct: 280 SNFHDYESGVFHGCDGADNVDINHAVVLVGYGTDEKEGDYWIVRNSWGTRFGENGYIRVK 339

Query: 477 RNKNNRC 457
           R     C
Sbjct: 340 REATPTC 346


>UniRef50_Q23H32 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 365

 Score = 56.0 bits (129), Expect = 7e-07
 Identities = 29/75 (38%), Positives = 42/75 (56%), Gaps = 6/75 (8%)
 Frame = -1

Query: 651 SFQLYSSGVYNEE--ECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           +F+ Y  G+ +E+  EC     DH + +VGYG+ E G  YW  +   G  WGE GYI+++
Sbjct: 276 NFKFYKGGIADEKLLECDPQYTDHCLGIVGYGS-ENGKQYWILKNSWGENWGEKGYIRLL 334

Query: 477 R----NKNNRCGIAS 445
           R    N    CGIA+
Sbjct: 335 RSDSSNTQGTCGIAT 349


>UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11;
           Entamoeba|Rep: Cysteine proteinase 2 precursor -
           Entamoeba histolytica
          Length = 315

 Score = 56.0 bits (129), Expect = 7e-07
 Identities = 29/74 (39%), Positives = 40/74 (54%), Gaps = 2/74 (2%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           FQLY SG Y + +C +    L+H V  VGYG  + G + W      G  WG+ GYI M+ 
Sbjct: 237 FQLYKSGAYTDTKCKNNYFALNHEVCAVGYGVVD-GKECWIVRNSWGTGWGDKGYINMV- 294

Query: 474 NKNNRCGIASSASY 433
            + N CG+A+   Y
Sbjct: 295 IEGNTCGVATDPLY 308


>UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_217_11853_10927 - Giardia lamblia
           ATCC 50803
          Length = 308

 Score = 55.6 bits (128), Expect = 1e-06
 Identities = 29/81 (35%), Positives = 44/81 (54%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           +  F  Y  G+Y+    +       V +VGYGT ++G DYW  +   GP WGE GY +++
Sbjct: 223 YEDFTYYLEGIYSYTYGNRVGF-LSVEIVGYGTSDEGQDYWIVKNYWGPGWGEDGYFRIV 281

Query: 477 RNKNNRCGIASSASYXXV*TP 415
           R + N C I +SA Y  + +P
Sbjct: 282 RGQ-NECQIENSA-YGAIISP 300


>UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:
           Aca s 1 allergen - Acarus siro (Dust mite)
          Length = 331

 Score = 55.6 bits (128), Expect = 1e-06
 Identities = 26/65 (40%), Positives = 37/65 (56%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           H  F+ Y SGV       +T+++H + +VG+G  E G+DYW      G  WGE GY K+ 
Sbjct: 255 HNGFKHYKSGVIRLTRGGTTEVNHVINIVGWGR-ENGLDYWLIRNSWGTHWGEAGYGKVE 313

Query: 477 RNKNN 463
           R+ NN
Sbjct: 314 RHHNN 318


>UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 514

 Score = 55.6 bits (128), Expect = 1e-06
 Identities = 31/68 (45%), Positives = 37/68 (54%), Gaps = 1/68 (1%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECS-STDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           S + YS G+Y++ EC   T   H VLVVGYG  E G  YW  +      WG  GYIK I 
Sbjct: 445 SLKFYSWGLYDDPECGRDTAAVHSVLVVGYGV-EDGEPYWLVKNSWSTTWGMDGYIK-IA 502

Query: 474 NKNNRCGI 451
            K N CG+
Sbjct: 503 WKRNTCGV 510


>UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia
           medanensis|Rep: Sui m 1 allergen - Suidasia medanensis
          Length = 336

 Score = 55.6 bits (128), Expect = 1e-06
 Identities = 25/61 (40%), Positives = 36/61 (59%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           F+ Y +GV      +S  ++H V +VG+GT E G DYW  +   GP WGE GY ++ R+ 
Sbjct: 264 FRFYRNGVIQNLRPNSRQINHAVTLVGWGT-EDGQDYWIVKNSWGPSWGESGYFRLGRHH 322

Query: 468 N 466
           N
Sbjct: 323 N 323


>UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 328

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 31/74 (41%), Positives = 40/74 (54%), Gaps = 2/74 (2%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEEC-SSTDLD-HGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           +F+ Y+SGV   E+C   T  + H V +VGYGT + GV YW         WG  GY+K I
Sbjct: 250 NFEWYTSGVLQSEDCYQMTPAEWHSVAIVGYGTSDDGVPYWLVRNSWNSDWGLHGYVK-I 308

Query: 477 RNKNNRCGIASSAS 436
           R   N C I S A+
Sbjct: 309 RRGVNWCLIESHAA 322


>UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1;
           Brugia malayi|Rep: Cathepsin F-like cysteine proteinase
           - Brugia malayi (Filarial nematode worm)
          Length = 461

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 26/70 (37%), Positives = 40/70 (57%), Gaps = 2/70 (2%)
 Frame = -1

Query: 639 YSSGVYN--EEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466
           Y SG+ +  +  C  + ++HGVL+ GYG  E  + YW  +   G  WGE GY +++R K 
Sbjct: 389 YKSGILHPSKSRCPPSKINHGVLITGYGI-ENNLPYWTIKNSWGEQWGENGYFQLMRGK- 446

Query: 465 NRCGIASSAS 436
           N CG++   S
Sbjct: 447 NICGVSDLVS 456


>UniRef50_Q24E33 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 328

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 29/74 (39%), Positives = 42/74 (56%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           T+F+ Y+SGV+  + C    L+HGVL  GY  D     YW  +   G  WG+ GYI +  
Sbjct: 262 TNFKFYTSGVF--DNCKKK-LNHGVLATGYTAD-----YWIIKNSWGTAWGQNGYINL-- 311

Query: 474 NKNNRCGIASSASY 433
            + N CG+ ++ASY
Sbjct: 312 KRGNTCGVCNTASY 325


>UniRef50_Q239L8 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 323

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 28/73 (38%), Positives = 43/73 (58%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           +FQ Y   ++++  C  T+LDHGVL+VGY    +   YW  +   GP WGE G+I++   
Sbjct: 256 NFQYYQKDIFSD--CG-TELDHGVLLVGYSASGK---YWKVKNSWGPNWGESGFIRLA-- 307

Query: 471 KNNRCGIASSASY 433
             N CG+ + AS+
Sbjct: 308 AGNTCGLCNMASF 320


>UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3;
           Giardia intestinalis|Rep: Cathepsin B-like CP1 precursor
           - Giardia lamblia (Giardia intestinalis)
          Length = 303

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 24/64 (37%), Positives = 33/64 (51%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           +     Y SGVY     +     H + +VGYGT + G DYW  +   GP WGE GY +++
Sbjct: 226 YADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIV 285

Query: 477 RNKN 466
           R  N
Sbjct: 286 RGVN 289


>UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza
           sativa|Rep: Cysteine proteinase-like - Oryza sativa
           subsp. japonica (Rice)
          Length = 360

 Score = 54.8 bits (126), Expect = 2e-06
 Identities = 30/75 (40%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYG-TDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           F+ Y SGVY         L+H V VVGYG   + G +YW  +   G  WGE GY+++ R 
Sbjct: 281 FRHYRSGVYAGSAACGRRLNHAVTVVGYGAAADGGGEYWLVKNQWGTWWGEGGYMRVARG 340

Query: 471 --KNNRCGIASSASY 433
                 CGIA+ A Y
Sbjct: 341 GAAGGNCGIATYAFY 355


>UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_113_4299_5381 - Giardia lamblia ATCC
           50803
          Length = 360

 Score = 54.8 bits (126), Expect = 2e-06
 Identities = 26/66 (39%), Positives = 35/66 (53%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           F  Y SGVY +         H V ++GYG  + G+DYW      GP WGE GY +++R  
Sbjct: 286 FMYYKSGVY-QHRWGLWLGGHAVEIIGYGVTDSGLDYWTVRNSWGPDWGEDGYFRIVRG- 343

Query: 468 NNRCGI 451
            + CGI
Sbjct: 344 GDECGI 349


>UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase
           precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin
           L-like cysteine proteinase precursor - Acanthoscelides
           obtectus (Bean weevil)
          Length = 321

 Score = 54.8 bits (126), Expect = 2e-06
 Identities = 25/58 (43%), Positives = 35/58 (60%)
 Frame = -1

Query: 606 SSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNRCGIASSASY 433
           S  DL+HGVL+VGYG       YW  +   G +WGE GY ++ ++  N CG+A+  SY
Sbjct: 266 SEKDLNHGVLLVGYGDG-----YWIVKNSWGRIWGEQGYFRLKKDAGNTCGVATWPSY 318


>UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5;
           Piroplasmida|Rep: Cysteine proteinase, putative -
           Theileria parva
          Length = 460

 Score = 54.8 bits (126), Expect = 2e-06
 Identities = 33/69 (47%), Positives = 43/69 (62%), Gaps = 3/69 (4%)
 Frame = -1

Query: 642 LYSSGVYNEEECSSTDLDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIKMIR-NK 469
           +Y +GVYN E C S  L+H VL+VG G DE     YW  +   GP WGE GY+++ R NK
Sbjct: 387 MYQAGVYNGE-CGSA-LNHAVLLVGEGYDEVLDKRYWVIKNSWGPDWGEDGYLRLERTNK 444

Query: 468 -NNRCGIAS 445
             ++CGI S
Sbjct: 445 GEDKCGILS 453


>UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3;
           Dictyostelium discoideum|Rep: Cysteine proteinase 1
           precursor - Dictyostelium discoideum (Slime mold)
          Length = 343

 Score = 54.8 bits (126), Expect = 2e-06
 Identities = 28/75 (37%), Positives = 41/75 (54%), Gaps = 4/75 (5%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDE----QGVDYWXREELVGPLWGELGYIKM 481
           +Q Y  GV+ +  C+   LDHG+L+VGY        + + YW  +   G  WGE GYI +
Sbjct: 267 WQFYIGGVF-DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYL 325

Query: 480 IRNKNNRCGIASSAS 436
            R KN  CG+++  S
Sbjct: 326 RRGKNT-CGVSNFVS 339


>UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 392

 Score = 54.4 bits (125), Expect = 2e-06
 Identities = 27/67 (40%), Positives = 39/67 (58%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472
           S + YS G+ +++ CS+   DH VL++GYG+D  GV YW  +      WG  G+IK+   
Sbjct: 324 SLKFYSDGIMSDKHCSNKT-DHAVLLIGYGSD-NGVPYWLIKNSWSHKWGNNGFIKI--- 378

Query: 471 KNNRCGI 451
           K   CGI
Sbjct: 379 KQGLCGI 385


>UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas
           foetus|Rep: TFCP2 protein - Tritrichomonas foetus
           (Trichomonas foetus)
          Length = 270

 Score = 54.0 bits (124), Expect = 3e-06
 Identities = 29/76 (38%), Positives = 40/76 (52%), Gaps = 1/76 (1%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDL-DHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKM 481
           H SFQLY  G+Y    C +  + +H + +VGYG  E   +YW      G  WGE GYI+ 
Sbjct: 193 HYSFQLYQGGIYWSWFCRTQYIYNHAMGIVGYGV-EGSEEYWIVRNSWGESWGEQGYIRY 251

Query: 480 IRNKNNRCGIASSASY 433
           +   +N C IA   +Y
Sbjct: 252 LLG-SNVCNIADYVTY 266


>UniRef50_P25779 Cluster: Cruzipain precursor; n=54;
           Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi
          Length = 467

 Score = 54.0 bits (124), Expect = 3e-06
 Identities = 31/80 (38%), Positives = 42/80 (52%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           +S+  Y+ GV     C S  LDHGVL+VGY  D   V YW  +      WGE GYI++ +
Sbjct: 264 SSWMTYTGGVMTS--CVSEQLDHGVLLVGY-NDSAAVPYWIIKNSWTTQWGEEGYIRIAK 320

Query: 474 NKNNRCGIASSASYXXV*TP 415
             +N+C +   AS   V  P
Sbjct: 321 G-SNQCLVKEEASSAVVGGP 339


>UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza
           sativa|Rep: OSJNBb0085F13.15 protein - Oryza sativa
           subsp. japonica (Rice)
          Length = 383

 Score = 53.6 bits (123), Expect = 4e-06
 Identities = 25/73 (34%), Positives = 43/73 (58%), Gaps = 3/73 (4%)
 Frame = -1

Query: 648 FQLY-SSGVYNEEECSSTDLDHGVLVVGYGTD--EQGVDYWXREELVGPLWGELGYIKMI 478
           FQ Y  +GVY      ST+++H + +VGYGT+  + G +YW  +   G LWG+ G++ + 
Sbjct: 302 FQNYRGNGVYKGGTGCSTNVNHALTIVGYGTNHPDTGENYWIAKNSYGNLWGDNGFVYLA 361

Query: 477 RNKNNRCGIASSA 439
           ++  +R G+   A
Sbjct: 362 KDTADRTGVCGLA 374


>UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3;
           Bilateria|Rep: Cathepsin Z1 preproprotein - Toxocara
           canis (Canine roundworm)
          Length = 307

 Score = 53.6 bits (123), Expect = 4e-06
 Identities = 23/67 (34%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD-EQGVDYWXREELVGPLWGELGYIKMIR 475
           +F++YS G+Y EE  +S ++DH + V G+G D +  V YW      G  WGE G+ +++ 
Sbjct: 225 AFEMYSGGIYTEE--TSEEIDHIIAVYGWGVDHDSSVPYWIGRNSWGTPWGESGWFRVVT 282

Query: 474 NKNNRCG 454
           ++    G
Sbjct: 283 SEYKHAG 289


>UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase" precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 315

 Score = 53.6 bits (123), Expect = 4e-06
 Identities = 32/74 (43%), Positives = 43/74 (58%), Gaps = 1/74 (1%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGV-DYWXREELVGPLWGELGYIKMIR 475
           ++QLY  G++N + C  T+L+HGVL VGY  D   V + W      G  WGE GYI++ R
Sbjct: 247 TWQLYGGGLFNNKNCR-TNLNHGVLAVGYTKDAFIVKNSW------GTSWGEQGYIRVAR 299

Query: 474 NKNNRCGIASSASY 433
            + N CGI    SY
Sbjct: 300 GE-NLCGINLMNSY 312


>UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 330

 Score = 53.6 bits (123), Expect = 4e-06
 Identities = 30/76 (39%), Positives = 45/76 (59%)
 Frame = -1

Query: 642 LYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNN 463
           LY+SG+++   C   +L+HGVL+VG+ + E     W  +   G  WGE GYI++     N
Sbjct: 259 LYNSGIFSN--CGQ-NLNHGVLLVGFNSTEGS---WLVKNSWGTSWGEQGYIRLA--DGN 310

Query: 462 RCGIASSASYXXV*TP 415
            CG+A++ASY  V  P
Sbjct: 311 TCGLANAASYPTVVPP 326


>UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4;
           Caenorhabditis|Rep: Cathepsin z protein 1 -
           Caenorhabditis elegans
          Length = 306

 Score = 53.6 bits (123), Expect = 4e-06
 Identities = 24/67 (35%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD-EQGVDYWXREELVGPLWGELGYIKMIR 475
           +F+ Y+ G+Y  +E +  D+DH + V G+G D E GV+YW      G  WGE G+ K++ 
Sbjct: 224 AFETYAGGIY--KEVTDEDIDHIISVHGWGVDHESGVEYWIGRNSWGEPWGEHGWFKIVT 281

Query: 474 NKNNRCG 454
           ++    G
Sbjct: 282 SQYKNAG 288


>UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza
           sativa|Rep: Os09g0497500 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 349

 Score = 53.2 bits (122), Expect = 5e-06
 Identities = 36/86 (41%), Positives = 44/86 (51%), Gaps = 14/86 (16%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVD----------YWXREELVGPLWGE 499
           FQLY SGVY    C++ D++HGV VVGYG  E   D          YW  +   G  WG+
Sbjct: 263 FQLYGSGVYTGP-CTA-DVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGD 320

Query: 498 LGYIKMIRN----KNNRCGIASSASY 433
            GYI M R+     +  CGIA   SY
Sbjct: 321 AGYILMQRDVAGLASGLCGIALLPSY 346


>UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n=3;
           Brugia malayi|Rep: Cathepsin L-like cysteine proteinase
           - Brugia malayi (Filarial nematode worm)
          Length = 353

 Score = 53.2 bits (122), Expect = 5e-06
 Identities = 27/69 (39%), Positives = 42/69 (60%), Gaps = 2/69 (2%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEEC--SSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           SF  Y SG+YN+ +C  ++  ++H V+ VGYG  + G++Y+  +   GP WG+ GY + I
Sbjct: 277 SFVAYRSGIYNDPKCPTNAEKVNHAVIAVGYGV-QNGMEYFIIKNSWGPTWGQKGYGR-I 334

Query: 477 RNKNNRCGI 451
           R     CGI
Sbjct: 335 RAGVFMCGI 343


>UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia
           bovis|Rep: Cysteine protease 2 - Babesia bovis
          Length = 445

 Score = 53.2 bits (122), Expect = 5e-06
 Identities = 28/68 (41%), Positives = 39/68 (57%), Gaps = 3/68 (4%)
 Frame = -1

Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIKMIRNK-- 469
           YS GV+N E CS ++L+H VL+VG G D      YW  +   G  WGE GY ++ R    
Sbjct: 373 YSGGVFNGE-CSDSELNHAVLLVGEGYDSALKKRYWLLKNSWGTSWGEDGYFRLERTNTP 431

Query: 468 NNRCGIAS 445
            ++CG+ S
Sbjct: 432 TDKCGVLS 439


>UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophila
           SB210|Rep: Cathepsin z - Tetrahymena thermophila SB210
          Length = 585

 Score = 53.2 bits (122), Expect = 5e-06
 Identities = 24/63 (38%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIKMIRN 472
           F+ Y+ G+Y E       ++H + VVG+GTD Q GV+YW      G  WGE G+ ++  +
Sbjct: 509 FEAYTGGIYKESTAFPM-INHEIAVVGWGTDPQTGVEYWIGRNSWGTYWGENGFFRIQMH 567

Query: 471 KNN 463
           K N
Sbjct: 568 KQN 570



 Score = 41.9 bits (94), Expect = 0.013
 Identities = 20/58 (34%), Positives = 30/58 (51%)
 Frame = -1

Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466
           Y+ G+YN+   S    +H + VVG+G +E    YW      G  WGE G+ + +R  N
Sbjct: 212 YTGGIYNDTS-SYPGTNHVIEVVGWG-EENNEKYWIIRNSWGSYWGEKGFYRQLRGVN 267


>UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F
            like protease; n=1; Nasonia vitripennis|Rep: PREDICTED:
            similar to cathepsin F like protease - Nasonia
            vitripennis
          Length = 1036

 Score = 52.8 bits (121), Expect = 7e-06
 Identities = 28/74 (37%), Positives = 40/74 (54%), Gaps = 7/74 (9%)
 Frame = -1

Query: 651  SFQLYSSGVYNEEE--CSSTDLDHGVLVVGYGTD-----EQGVDYWXREELVGPLWGELG 493
            + Q Y  GV +  +  CS   LDHGVL+VGYG       ++ + YW  +   GP WGE G
Sbjct: 954  AMQFYMGGVSHPFKFLCSPDSLDHGVLIVGYGVKFYPIFKKTMPYWIIKNSWGPRWGEQG 1013

Query: 492  YIKMIRNKNNRCGI 451
            Y ++ R  +  CG+
Sbjct: 1014 YYRVYRG-DGTCGV 1026


>UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin L
           preproprotein; n=1; Monodelphis domestica|Rep:
           PREDICTED: similar to cathepsin L preproprotein -
           Monodelphis domestica
          Length = 356

 Score = 52.8 bits (121), Expect = 7e-06
 Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 17/90 (18%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECS---STDLDHGVLVVGYGT------DEQGVD--------YWXREE 523
           SF+ Y  G Y E  C     ++++H +LVVGYG       +E G+         +W  + 
Sbjct: 264 SFRYYQGGPYIEPRCRLSYMSNMNHALLVVGYGPLERSKYEEFGLQAYMHKDNKFWIAKN 323

Query: 522 LVGPLWGELGYIKMIRNKNNRCGIASSASY 433
             G  WG+ GYI + +++ N+CGIAS+A+Y
Sbjct: 324 SWGEQWGDRGYIYIPKDRYNQCGIASNANY 353


>UniRef50_Q54VR1 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 291

 Score = 52.8 bits (121), Expect = 7e-06
 Identities = 22/59 (37%), Positives = 38/59 (64%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           +F+ Y+SGV+     S+ +++H + ++G+GT E GVDYW      G  +GELG+ ++ R
Sbjct: 214 AFESYTSGVFTSSVGSTGEINHEISIIGWGT-ENGVDYWIGRNSWGTYFGELGFFRIQR 271


>UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2;
           Trypanosoma cruzi|Rep: Cysteine proteinase, putative -
           Trypanosoma cruzi
          Length = 392

 Score = 52.8 bits (121), Expect = 7e-06
 Identities = 24/69 (34%), Positives = 38/69 (55%), Gaps = 2/69 (2%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSST-DLDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIKM 481
           T +  Y+ G++N  + S    ++H V +VGYG D +  +DYW       P WGE GY+++
Sbjct: 288 TYWSAYAGGIFNGCDYSKNITINHVVQLVGYGHDNKLNLDYWILRNSWSPSWGENGYMRL 347

Query: 480 IRNKNNRCG 454
           +R     CG
Sbjct: 348 LRTDKAECG 356


>UniRef50_A7APS9 Cluster: Papain family cysteine protease containing
           protein; n=1; Babesia bovis|Rep: Papain family cysteine
           protease containing protein - Babesia bovis
          Length = 435

 Score = 52.8 bits (121), Expect = 7e-06
 Identities = 26/78 (33%), Positives = 44/78 (56%), Gaps = 3/78 (3%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           +  +Q YSSG+   + C+  +++H V++ G G D+ G  +W  +   G  WGE GY+++ 
Sbjct: 359 NVDWQFYSSGIL--DSCAD-EINHAVVLAGVGQDDDG-PFWLIKNSWGTSWGEEGYVRLA 414

Query: 477 RNK---NNRCGIASSASY 433
           R     +N CG+A  A Y
Sbjct: 415 RGSSAFDNECGLAHMALY 432


>UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184,
           whole genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_184,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 331

 Score = 52.8 bits (121), Expect = 7e-06
 Identities = 32/77 (41%), Positives = 42/77 (54%), Gaps = 5/77 (6%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGV-DYWXREELVGPLWGELGYIK-- 484
           FQLYS GVY+    + T  DL+HGVL VGY  D   + + W      G  WGE GY++  
Sbjct: 258 FQLYSGGVYSRSCTAKTIDDLNHGVLAVGYAKDSYTIKNSW------GASWGEKGYMRLG 311

Query: 483 MIRNKNNRCGIASSASY 433
           ++  K  +CGI    SY
Sbjct: 312 LVAAKEGQCGIHWVPSY 328


>UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_567_6496_7413 - Giardia lamblia ATCC
           50803
          Length = 305

 Score = 52.4 bits (120), Expect = 9e-06
 Identities = 28/73 (38%), Positives = 38/73 (52%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           H  F  Y  G+Y++   +S    H VL+VGYG+     DYW      G  WGE GY +++
Sbjct: 230 HEDFLYYVGGIYHKVYGTSLG-GHAVLIVGYGSMNNH-DYWIVRNSWGSDWGENGYFRIL 287

Query: 477 RNKNNRCGIASSA 439
           R   N CGI  +A
Sbjct: 288 RG-TNECGIEKNA 299


>UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria
           dispar multicapsid nuclear polyhedrosis virus (LdMNPV)
          Length = 356

 Score = 52.4 bits (120), Expect = 9e-06
 Identities = 27/65 (41%), Positives = 36/65 (55%)
 Frame = -1

Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNR 460
           Y  GV +   C +  L+H VL+VGYG  E GV YW  +   G  WGE GY + +R   N 
Sbjct: 287 YYRGVISS--CENNGLNHAVLLVGYGV-ENGVPYWVFKNTWGDDWGENGYFR-VRQNVNA 342

Query: 459 CGIAS 445
           CG+ +
Sbjct: 343 CGMVN 347


>UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep:
           Cathepsin L precursor - Schistosoma mansoni (Blood
           fluke)
          Length = 319

 Score = 52.4 bits (120), Expect = 9e-06
 Identities = 25/58 (43%), Positives = 33/58 (56%)
 Frame = -1

Query: 609 CSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNRCGIASSAS 436
           CS   LDH VL+VGYG  E+   +W  +   G  WGE GY +M R  +  CGI + A+
Sbjct: 258 CSKYLLDHAVLLVGYGVSEKNEPFWIVKNSWGVEWGENGYFRMYRG-DGSCGINTVAT 314


>UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core
           eudicotyledons|Rep: At1g02300/T6A9_10 - Arabidopsis
           thaliana (Mouse-ear cress)
          Length = 362

 Score = 52.0 bits (119), Expect = 1e-05
 Identities = 27/70 (38%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLD-HGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKM 481
           +  F  Y SGVY  +  + T++  H V ++G+GT + G DYW         WG+ GY K 
Sbjct: 267 YEDFAHYKSGVY--KHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFK- 323

Query: 480 IRNKNNRCGI 451
           IR   N CGI
Sbjct: 324 IRRGTNECGI 333


>UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa
           (japonica cultivar-group)|Rep: Os09g0562700 protein -
           Oryza sativa subsp. japonica (Rice)
          Length = 235

 Score = 52.0 bits (119), Expect = 1e-05
 Identities = 34/85 (40%), Positives = 44/85 (51%), Gaps = 12/85 (14%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVD--------YWXREELVGPLWGEL 496
           +FQ Y  GVY +  C  T L+HGV VVGYG +E   D        YW  +   G  WG+ 
Sbjct: 150 NFQHYRKGVY-DGPCG-TRLNHGVTVVGYGQEEAAADGGAAGGDKYWIIKNSWGKNWGDQ 207

Query: 495 GYIKMIRNKNNR----CGIASSASY 433
           GYIKM ++   +    CGIA   S+
Sbjct: 208 GYIKMKKDVAGKPEGLCGIAIRPSF 232


>UniRef50_Q015J8 Cluster: Cathepsin; n=2; Ostreococcus|Rep:
           Cathepsin - Ostreococcus tauri
          Length = 556

 Score = 52.0 bits (119), Expect = 1e-05
 Identities = 22/64 (34%), Positives = 35/64 (54%), Gaps = 5/64 (7%)
 Frame = -1

Query: 645 QLYSSGVYNEEECSS-----TDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKM 481
           Q Y  GV   ++C       + ++H VLVVG+G  + G+ YW  +   GP WG+ G+ K+
Sbjct: 315 QAYDDGVIMMDDCHPLGRGISSINHAVLVVGWGVTKDGIKYWELKNSYGPKWGDQGFFKL 374

Query: 480 IRNK 469
            R +
Sbjct: 375 ERGR 378


>UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep:
           Cathepsin Z - Ostreococcus tauri
          Length = 387

 Score = 51.6 bits (118), Expect = 2e-05
 Identities = 22/58 (37%), Positives = 34/58 (58%)
 Frame = -1

Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466
           Y  G+Y  ++  S +++H V +VG+GT + G  YW      G  WGE+GY ++IR  N
Sbjct: 278 YVGGIY--KDTPSFEINHIVSIVGWGTAKDGTKYWIVRNSWGQYWGEMGYFRIIRGVN 333


>UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35;
           Viridiplantae|Rep: Cysteine proteinase 15A precursor -
           Pisum sativum (Garden pea)
          Length = 363

 Score = 51.6 bits (118), Expect = 2e-05
 Identities = 30/76 (39%), Positives = 39/76 (51%), Gaps = 6/76 (7%)
 Frame = -1

Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDE------QGVDYWXREELVGPLWGELGYIK 484
           Q Y SGV     C+ + LDHGVL+VG+G         +   YW  +   G  WGE GY K
Sbjct: 280 QTYMSGVSCPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSWGQNWGEQGYYK 339

Query: 483 MIRNKNNRCGIASSAS 436
           + R + N CG+ S  S
Sbjct: 340 ICRGR-NVCGVDSMVS 354


>UniRef50_Q9BL26 Cluster: Putative uncharacterized protein; n=4;
           Caenorhabditis elegans|Rep: Putative uncharacterized
           protein - Caenorhabditis elegans
          Length = 345

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 26/68 (38%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
 Frame = -1

Query: 639 YSSGVYNE--EECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466
           Y  G+YN   EEC+ST     +++VGYG + +   YW  +   G  WGE GY+K+ R+  
Sbjct: 226 YKIGIYNPSIEECTSTHEIRSMVIVGYGIEGEQ-KYWIVKGSFGTSWGEQGYMKLARDV- 283

Query: 465 NRCGIASS 442
           N C +A++
Sbjct: 284 NACAMATT 291


>UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1;
           Trypanosoma cruzi|Rep: Cysteine protease, putative -
           Trypanosoma cruzi
          Length = 434

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 24/75 (32%), Positives = 43/75 (57%), Gaps = 3/75 (4%)
 Frame = -1

Query: 654 TSFQLYSSGVYNE--EECSSTDLDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIK 484
           + +  Y+ GV++   ++  +  + H V +VGYGTD +   DYW      G  WGE G+I+
Sbjct: 275 SDWMFYTGGVFDGCGKDGENITISHAVQLVGYGTDNKTNQDYWVVRNSWGEGWGENGFIR 334

Query: 483 MIRNKNNRCGIASSA 439
           ++R K+N   + ++A
Sbjct: 335 LLRKKHNELCVFNNA 349


>UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=1;
           Biomphalaria glabrata|Rep: Cathepsin B preproprotein
           precursor - Biomphalaria glabrata (Bloodfluke planorb)
          Length = 333

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 27/72 (37%), Positives = 38/72 (52%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           ++ F  Y +GVY      S +  H V ++GYGT E G DYW         WG+ G+ K+ 
Sbjct: 259 YSDFLSYKTGVYRHTT-GSYEGGHAVKIIGYGT-ESGQDYWLVANSWNEDWGDKGFFKIA 316

Query: 477 RNKNNRCGIASS 442
           + K + CGI SS
Sbjct: 317 KGK-DECGIESS 327


>UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3;
           Theileria|Rep: Cysteine proteinase precursor - Theileria
           annulata
          Length = 441

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 27/68 (39%), Positives = 42/68 (61%), Gaps = 3/68 (4%)
 Frame = -1

Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTD-EQGVDYWXREELVGPLWGELGYIKMIRNK 469
           +LYS G++  + C   +L+H VL+VG G D E G+ YW  +   G  WGE G++++ R K
Sbjct: 364 KLYSGGIFTGK-CGG-ELNHAVLLVGEGVDHETGMRYWIIKNSWGEDWGENGFLRLQRTK 421

Query: 468 N--NRCGI 451
              ++CGI
Sbjct: 422 KGLDKCGI 429


>UniRef50_A2WPN4 Cluster: Putative uncharacterized protein; n=2;
           Oryza sativa (indica cultivar-group)|Rep: Putative
           uncharacterized protein - Oryza sativa subsp. indica
           (Rice)
          Length = 325

 Score = 50.8 bits (116), Expect = 3e-05
 Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 3/78 (3%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN- 472
           FQ Y  GVY +  C+   ++H V +VGY  +  G  YW  +      WGE GY+ + ++ 
Sbjct: 249 FQFYKGGVY-KGPCNPGSVNHAVTIVGYCENFGGEKYWIAKNSWSNDWGEQGYVYLAKDV 307

Query: 471 --KNNRCGIASSASYXXV 424
                 CG+A+S  Y  V
Sbjct: 308 WWPQGTCGLATSPFYPTV 325


>UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba
           histolytica|Rep: Cysteine protease 10 - Entamoeba
           histolytica
          Length = 297

 Score = 50.8 bits (116), Expect = 3e-05
 Identities = 23/50 (46%), Positives = 29/50 (58%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWG 502
           SFQ Y  G+Y+E  C    +DH V VVGYGT E+  D+W  +   G  WG
Sbjct: 250 SFQFYEGGIYDEPNCKW--VDHIVTVVGYGTTEEHQDFWVVKNSYGNEWG 297


>UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p -
           Drosophila melanogaster (Fruit fly)
          Length = 431

 Score = 50.8 bits (116), Expect = 3e-05
 Identities = 26/68 (38%), Positives = 34/68 (50%), Gaps = 2/68 (2%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLD--HGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           F  YS GVY E   +       H V +VG+G +  G  YW      G  WGE GY +++R
Sbjct: 345 FFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILR 404

Query: 474 NKNNRCGI 451
             +N CGI
Sbjct: 405 G-SNECGI 411


>UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep:
           Cysteine proteinase - Cryptobia salmositica
          Length = 443

 Score = 50.8 bits (116), Expect = 3e-05
 Identities = 25/73 (34%), Positives = 41/73 (56%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           +++Q Y+ G+ +   C    +DHGVL+VG+  D     YW  +      WGE GYI++ +
Sbjct: 257 STWQSYAGGIMSY--CPQDQIDHGVLIVGF-DDTASTPYWIIKNSWTANWGEEGYIRVAK 313

Query: 474 NKNNRCGIASSAS 436
             +N+CG+ S  S
Sbjct: 314 G-SNQCGLTSHPS 325


>UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 395

 Score = 50.4 bits (115), Expect = 4e-05
 Identities = 27/68 (39%), Positives = 41/68 (60%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           T+FQ Y+ G+Y+  E    D++H VL+VGY   ++  D W  +  +G  WGELGY + I 
Sbjct: 322 TAFQSYAGGIYDSVE-EYKDVNHIVLLVGY---DKPTDSWKIKNSLGTKWGELGYAR-IT 376

Query: 474 NKNNRCGI 451
             N++ GI
Sbjct: 377 ASNDKLGI 384


>UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC
           3.4.14.1) (Dipeptidyl-peptidase I) (DPP-I) (DPPI)
           (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase)
           [Contains: Dipeptidyl-peptidase 1 exclusion domain chain
           (Dipeptidyl- peptidase I exclusion domain chain);
           Dipeptidyl-peptidase 1 heavy chain (Dipeptidyl-peptidase
           I heavy chain); Dipeptidyl-peptidase 1 light chain
           (Dipeptidyl-peptidase I light chain)]; n=50;
           Coelomata|Rep: Dipeptidyl-peptidase 1 precursor (EC
           3.4.14.1) (Dipeptidyl-peptidase I) (DPP-I) (DPPI)
           (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase)
           [Contains: Dipeptidyl-peptidase 1 exclusion domain chain
           (Dipeptidyl- peptidase I exclusion domain chain);
           Dipeptidyl-peptidase 1 heavy chain (Dipeptidyl-peptidase
           I heavy chain); Dipeptidyl-peptidase 1 light chain
           (Dipeptidyl-peptidase I light chain)] - Homo sapiens
           (Human)
          Length = 463

 Score = 50.4 bits (115), Expect = 4e-05
 Identities = 30/79 (37%), Positives = 42/79 (53%), Gaps = 6/79 (7%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNE----EECSSTDL-DHGVLVVGYGTDE-QGVDYWXREELVGPLWGEL 496
           +  F  Y  G+Y+     +  +  +L +H VL+VGYGTD   G+DYW  +   G  WGE 
Sbjct: 377 YDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEN 436

Query: 495 GYIKMIRNKNNRCGIASSA 439
           GY + IR   + C I S A
Sbjct: 437 GYFR-IRRGTDECAIESIA 454


>UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 393

 Score = 50.0 bits (114), Expect = 5e-05
 Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 4/80 (5%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECS--STDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           S Q+Y SG+Y +  CS    D +H V++VGY ++     Y+      GP WGE G+ K+ 
Sbjct: 318 SLQMYGSGIY-DFPCSIDRNDANHAVVIVGYTSE-----YFLIRNSWGPHWGEEGHFKVR 371

Query: 477 RNKNNR--CGIASSASYXXV 424
           +  NN+  CG+ +  SY  +
Sbjct: 372 KESNNKGTCGLYNDMSYPYI 391


>UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Cathepsin
           B - Fasciola gigantica (Giant liver fluke)
          Length = 339

 Score = 50.0 bits (114), Expect = 5e-05
 Identities = 26/68 (38%), Positives = 36/68 (52%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           F +Y SG+Y+          H V ++G+G  E GV+YW         WGE GY +M+R +
Sbjct: 266 FGVYRSGIYHHVAGKFIGR-HAVRMIGWGV-ENGVNYWLMANSWNEEWGENGYFRMVRGR 323

Query: 468 NNRCGIAS 445
            N CGI S
Sbjct: 324 -NECGIES 330


>UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n=4;
           Tenebrionidae|Rep: Putative cathepsin B-like proteinase
           - Tenebrio molitor (Yellow mealworm)
          Length = 321

 Score = 50.0 bits (114), Expect = 5e-05
 Identities = 28/66 (42%), Positives = 34/66 (51%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           F  Y SGVY      S    H V +VG+G  E GV YW      G  WG+ G+ KM+R +
Sbjct: 248 FYNYVSGVYRHVSGESVGF-HVVKIVGWGV-ENGVPYWLIANSWGSSWGDHGFFKMLRGQ 305

Query: 468 NNRCGI 451
            N CGI
Sbjct: 306 -NECGI 310


>UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis
           zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa
           zeasingle nucleocapsid nuclear polyhedrosis virus)
          Length = 367

 Score = 50.0 bits (114), Expect = 5e-05
 Identities = 25/63 (39%), Positives = 37/63 (58%)
 Frame = -1

Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNR 460
           Y  G+ N+  C   DL+H VL++G+G  E  V YW  +   G  WGE G++++ RN  N 
Sbjct: 298 YRRGILNQ--CHIYDLNHAVLLIGWGI-ENNVPYWIIKNSWGEDWGENGFLRVRRNV-NA 353

Query: 459 CGI 451
           CG+
Sbjct: 354 CGL 356


>UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:
           Viral cathepsin - Xestia c-nigrum granulosis virus
           (XnGV) (Xestia c-nigrumgranulovirus)
          Length = 346

 Score = 50.0 bits (114), Expect = 5e-05
 Identities = 29/64 (45%), Positives = 38/64 (59%), Gaps = 1/64 (1%)
 Frame = -1

Query: 639 YSSGVYNEEECS-STDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNN 463
           Y SGV   + CS    L+HGVL+VGYG  E  V YW  +   G  WGE G+ ++ R+ N+
Sbjct: 273 YKSGV--AKHCSVDHGLNHGVLLVGYG-QENDVKYWTLKNSWGSDWGEQGFFRIKRDVNS 329

Query: 462 RCGI 451
            CGI
Sbjct: 330 -CGI 332


>UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep:
           Cathepsin B - Triticum aestivum (Wheat)
          Length = 353

 Score = 49.6 bits (113), Expect = 6e-05
 Identities = 26/66 (39%), Positives = 34/66 (51%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           F  Y SGVY +         H V ++G+GT + G DYW         WG+ GY K+IR +
Sbjct: 263 FAHYKSGVY-KHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGE 321

Query: 468 NNRCGI 451
            N CGI
Sbjct: 322 -NECGI 326


>UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15;
           Aphidoidea|Rep: Cathepsin B-S precursor - Tuberaphis
           styraci
          Length = 349

 Score = 49.6 bits (113), Expect = 6e-05
 Identities = 21/69 (30%), Positives = 36/69 (52%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           +  F +Y SG+Y +   +  +  H + ++G+G +E G  YW         WG+ G  K+I
Sbjct: 256 YDDFSVYKSGIYRKTPKAKYEGGHSIKIIGWG-EENGTPYWLAVNSWSKFWGDHGTFKII 314

Query: 477 RNKNNRCGI 451
           + + N CGI
Sbjct: 315 KGR-NECGI 322


>UniRef50_Q22RR8 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score = 49.6 bits (113), Expect = 6e-05
 Identities = 30/79 (37%), Positives = 42/79 (53%), Gaps = 2/79 (2%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           T++Q Y  G +N+  C   +L+HGVL+VGY +       W  +   G  WGE GYI++  
Sbjct: 260 TNWQYYEFGTFND--CFD-NLNHGVLLVGYNSK---THQWKVKNSWGTSWGEDGYIRLGA 313

Query: 474 NKN--NRCGIASSASYXXV 424
           +    N CGI   ASY  V
Sbjct: 314 STKYLNTCGICEQASYPIV 332


>UniRef50_Q22NW9 Cluster: Papain family cysteine protease containing
            protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
            family cysteine protease containing protein - Tetrahymena
            thermophila SB210
          Length = 894

 Score = 49.6 bits (113), Expect = 6e-05
 Identities = 31/72 (43%), Positives = 43/72 (59%), Gaps = 1/72 (1%)
 Frame = -1

Query: 645  QLYSSGVYNEEEC-SSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
            Q Y SG+  +  C SS +L+HGVL+VGY T+    D++  +   G  WGE GY ++   K
Sbjct: 825  QRYHSGIIGD--CGSSVNLNHGVLIVGY-TE----DFFIVKNSWGTNWGEDGYFRI--TK 875

Query: 468  NNRCGIASSASY 433
             N CGI  +ASY
Sbjct: 876  TNTCGICEAASY 887


>UniRef50_A0DIY3 Cluster: Chromosome undetermined scaffold_52, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_52,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 512

 Score = 49.6 bits (113), Expect = 6e-05
 Identities = 23/59 (38%), Positives = 33/59 (55%)
 Frame = -1

Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNN 463
           Y  G    ++ + T L+H V VVG+G  E GV+YW      G  WG++GY KM  + +N
Sbjct: 441 YEGGYIFSQKTNKTILNHYVSVVGWGV-EDGVEYWIVRNSWGSYWGDMGYAKMKMHSDN 498


>UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16;
           Plasmodium (Vinckeia)|Rep: Cysteine proteinase precursor
           - Plasmodium vinckei
          Length = 506

 Score = 36.3 bits (80), Expect(2) = 7e-05
 Identities = 17/45 (37%), Positives = 24/45 (53%), Gaps = 3/45 (6%)
 Frame = -1

Query: 558 DEQGVDYWXREELVGPLWGELGYIKMIRNK---NNRCGIASSASY 433
           D+  + YW      GP WGE GYI++ RNK   +  CG+ S   +
Sbjct: 459 DDDIIYYWIVRNSWGPNWGEGGYIRIKRNKAGDDGFCGVGSDVFF 503



 Score = 32.7 bits (71), Expect(2) = 7e-05
 Identities = 16/29 (55%), Positives = 22/29 (75%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYG 562
           F LYS GV+ + EC+  +L+H VL+VGYG
Sbjct: 401 FVLYSGGVF-DGECNP-ELNHSVLLVGYG 427


>UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 1 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 332

 Score = 49.2 bits (112), Expect = 9e-05
 Identities = 25/69 (36%), Positives = 36/69 (52%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           +  F  Y SGVY +       + H + ++G+GT E GV YW         WG+ GY K++
Sbjct: 254 YADFPSYKSGVYQQHMIKFMGV-HAIKILGWGT-EDGVPYWLVANSWNVGWGDKGYFKIL 311

Query: 477 RNKNNRCGI 451
           R K + CGI
Sbjct: 312 RGK-DECGI 319


>UniRef50_Q7JYA0 Cluster: RE20049p; n=2; Sophophora|Rep: RE20049p -
           Drosophila melanogaster (Fruit fly)
          Length = 340

 Score = 49.2 bits (112), Expect = 9e-05
 Identities = 26/64 (40%), Positives = 36/64 (56%), Gaps = 3/64 (4%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHG--VLVVGYGTD-EQGVDYWXREELVGPLWGELGYIKMI 478
           F  YSSGVY +E  + T+      ++VVGY  D +  +DYW      G  WGE GYI+++
Sbjct: 263 FMQYSSGVYVQETRALTNPKSSQFLVVVGYDHDVDSNLDYWRCLNSFGDTWGEEGYIRIV 322

Query: 477 RNKN 466
           R  N
Sbjct: 323 RRSN 326


>UniRef50_Q4UCF5 Cluster: Cysteine proteinase, tacP, putative; n=8;
           Theileria|Rep: Cysteine proteinase, tacP, putative -
           Theileria annulata
          Length = 498

 Score = 49.2 bits (112), Expect = 9e-05
 Identities = 28/74 (37%), Positives = 41/74 (55%), Gaps = 3/74 (4%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVD-YWXREELVGPLWGELGYIKM 481
           H  F  Y  G+Y +  C+  +L+H VL+VG G DE+    YW  +   G  WGE GY ++
Sbjct: 367 HREFLSYKGGLY-DGPCAK-NLNHYVLLVGEGYDEETKSRYWIIKNTFGQSWGENGYARI 424

Query: 480 IR--NKNNRCGIAS 445
           +R   K ++C I S
Sbjct: 425 VRTDEKFDKCDILS 438


>UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46;
           Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea
           mays (Maize)
          Length = 371

 Score = 48.8 bits (111), Expect = 1e-04
 Identities = 31/78 (39%), Positives = 37/78 (47%), Gaps = 8/78 (10%)
 Frame = -1

Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDE------QGVDYWXREELVGPLWGELGYIK 484
           Q Y  GV     C    LDHGVL+VGYG         +   YW  +   G  WGE GY K
Sbjct: 285 QTYIGGVSCPYICGR-HLDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENWGENGYYK 343

Query: 483 MIRNKN--NRCGIASSAS 436
           + R  N  N+CG+ S  S
Sbjct: 344 ICRGSNVRNKCGVDSMVS 361


>UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6;
           Schistosoma|Rep: Cathepsin C precursor - Schistosoma
           mansoni (Blood fluke)
          Length = 454

 Score = 48.8 bits (111), Expect = 1e-04
 Identities = 26/80 (32%), Positives = 39/80 (48%), Gaps = 9/80 (11%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLD--------HGVLVVGYGTDE-QGVDYWXREELVGPLW 505
           +  FQ Y  G+Y+     +   +        H VL+VGYG D+  G  YW  +   G  W
Sbjct: 367 YEDFQFYKEGIYHHTTVQTDHYNFNPFELTNHAVLLVGYGVDKLSGEPYWKVKNSWGVEW 426

Query: 504 GELGYIKMIRNKNNRCGIAS 445
           GE GY +++R   + CG+ S
Sbjct: 427 GEQGYFRILRG-TDECGVES 445


>UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza
           sativa|Rep: Os01g0347600 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 343

 Score = 48.4 bits (110), Expect = 1e-04
 Identities = 29/80 (36%), Positives = 40/80 (50%), Gaps = 4/80 (5%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD-EQGVDYWXREELVGPLWGELGYI---K 484
           +FQ Y SGV+    C ++  +H V +VGY  D   G  YW  +   G  WG+ GYI   K
Sbjct: 266 AFQFYKSGVF-PGPCGASS-NHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILLEK 323

Query: 483 MIRNKNNRCGIASSASYXXV 424
            +   +  CG+A S  Y  V
Sbjct: 324 DVLQPHGTCGLAVSPFYPTV 343


>UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep:
           Cathepsin B - Uronema marinum
          Length = 350

 Score = 48.4 bits (110), Expect = 1e-04
 Identities = 26/71 (36%), Positives = 36/71 (50%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           ++ F  YSSGVY     S     H + ++G+G  E G  YW         WGE G+ K++
Sbjct: 272 YSDFLTYSSGVYQNTSGSYMG-GHAIKMLGWGV-ENGTPYWLCANSWNSSWGENGFFKIL 329

Query: 477 RNKNNRCGIAS 445
           R  +N CGI S
Sbjct: 330 RG-SNECGIES 339


>UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep:
           Cathepsin B - Streblomastix strix
          Length = 283

 Score = 48.4 bits (110), Expect = 1e-04
 Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 1/76 (1%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           ++ F  Y SGVY   +    +  H VL+VG+G +++ V YW  +   G  WGE G+ K++
Sbjct: 205 YSDFMSYKSGVY-VHQAGYIEGGHAVLIVGWGVEDE-VPYWLVQNSWGTDWGENGFFKIL 262

Query: 477 RNKNN-RCGIASSASY 433
           R  ++  C    +A Y
Sbjct: 263 RGSDHCECESNVTAGY 278


>UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61;
           Leishmania|Rep: Cysteine proteinase 2 precursor -
           Leishmania pifanoi
          Length = 444

 Score = 48.4 bits (110), Expect = 1e-04
 Identities = 25/63 (39%), Positives = 34/63 (53%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           +SF  Y SGV     C    L+HGVL+VGY    + V YW  +   G  WGE GY++++ 
Sbjct: 269 SSFMSYKSGVLTA--CIGKQLNHGVLLVGYDMTGE-VPYWVIKNSWGGDWGEQGYVRVVM 325

Query: 474 NKN 466
             N
Sbjct: 326 GVN 328


>UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza
           sativa|Rep: Putative cysteine protease - Oryza sativa
           subsp. japonica (Rice)
          Length = 357

 Score = 48.0 bits (109), Expect = 2e-04
 Identities = 28/81 (34%), Positives = 40/81 (49%), Gaps = 5/81 (6%)
 Frame = -1

Query: 651 SFQLYSSGVY-NEEECSSTDLDHGVLVVGYGTD-EQGVDYWXREELVGPLWGELGYI--- 487
           +FQ Y SGV+      ++   +H V +VGY  D   G  YW  +   G  WG+ GYI   
Sbjct: 277 AFQFYGSGVFPGPRGTAAPKPNHAVTLVGYCQDGASGKKYWIAKNSWGKTWGQQGYILLE 336

Query: 486 KMIRNKNNRCGIASSASYXXV 424
           K + + +  CG+A S  Y  V
Sbjct: 337 KDVASPHGTCGLAVSPFYPTV 357


>UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1;
           Oryza sativa (japonica cultivar-group)|Rep: Putative
           uncharacterized protein - Oryza sativa subsp. japonica
           (Rice)
          Length = 289

 Score = 48.0 bits (109), Expect = 2e-04
 Identities = 28/82 (34%), Positives = 40/82 (48%), Gaps = 4/82 (4%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD-EQGVDYWXREELVGPLWGELGYI-- 487
           H ++  Y SGV+    C ++  +H V +VGY  D   G  YW  +   G  WG+ GYI  
Sbjct: 210 HATYPFYKSGVF-PGPCGASS-NHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILL 267

Query: 486 -KMIRNKNNRCGIASSASYXXV 424
            K +   +  CG+A S  Y  V
Sbjct: 268 EKDVLQPHGTCGLAVSPFYPTV 289


>UniRef50_Q26988 Cluster: Cysteine proteinase 4; n=1; Tritrichomonas
           foetus|Rep: Cysteine proteinase 4 - Tritrichomonas
           foetus (Trichomonas foetus)
          Length = 152

 Score = 48.0 bits (109), Expect = 2e-04
 Identities = 19/32 (59%), Positives = 24/32 (75%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD 556
           SF  YSSG+YN+ +CSST LDH V  +GYG +
Sbjct: 118 SFNSYSSGIYNDRQCSSTVLDHAVGCIGYGAE 149


>UniRef50_Q23H15 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 370

 Score = 48.0 bits (109), Expect = 2e-04
 Identities = 27/63 (42%), Positives = 36/63 (57%)
 Frame = -1

Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNR 460
           Y SG++N  + S   L+H VL VGY  D+QG   W  +   GP WGE GY+++    NN 
Sbjct: 305 YQSGIFNGCDQSLIILNHAVLAVGY--DKQG--NWIVKNSWGPYWGENGYMRLA--PNNT 358

Query: 459 CGI 451
           C I
Sbjct: 359 CSI 361


>UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathepsin
           o - Aedes aegypti (Yellowfever mosquito)
          Length = 375

 Score = 48.0 bits (109), Expect = 2e-04
 Identities = 30/75 (40%), Positives = 44/75 (58%), Gaps = 2/75 (2%)
 Frame = -1

Query: 651 SFQLYSSGV--YNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           S++ Y  GV  Y+ EE    DL+H V +VGY  + Q + Y+  +   GP +G+ GYIK I
Sbjct: 300 SWKYYLGGVIQYHCEEAYE-DLNHAVEIVGYNLESQ-IPYYLVKNSWGPRFGDRGYIK-I 356

Query: 477 RNKNNRCGIASSASY 433
           +   N CGIA+  S+
Sbjct: 357 QVGKNLCGIANRVSF 371


>UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum
           aestivum|Rep: Cysteine protease - Triticum aestivum
           (Wheat)
          Length = 371

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 26/71 (36%), Positives = 35/71 (49%)
 Frame = -1

Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466
           Q Y SGVY    C+ T  +H V VVGYG    G +YW  +   G  WG+ G+  + R  +
Sbjct: 297 QDYKSGVYRGP-CT-TSQNHVVTVVGYGVTGAGEEYWIAKNSWGQTWGQKGFFFVRRGAD 354

Query: 465 NRCGIASSASY 433
              G+   A Y
Sbjct: 355 GPRGLCGIAMY 365


>UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 343

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 23/65 (35%), Positives = 37/65 (56%)
 Frame = -1

Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466
           + Y SG+  + +C  T+  H ++V+GYG D     YW  +     +WGE GY+++ R+  
Sbjct: 276 RFYHSGIAEDPDCG-TEPTHALIVIGYGPD-----YWILKNTYSKVWGEKGYMRVKRDV- 328

Query: 465 NRCGI 451
           N CGI
Sbjct: 329 NWCGI 333


>UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4;
           Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena
           thermophila
          Length = 320

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 29/78 (37%), Positives = 45/78 (57%), Gaps = 1/78 (1%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGV-DYWXREELVGPLWGELGYIKMI 478
           T+FQ Y+SGV+  + C + +L+HGVL+V        + + W      GP WGE G+I++ 
Sbjct: 254 TNFQFYTSGVF--KNCKA-NLNHGVLLVANVDSSLKIKNSW------GPSWGEKGFIRLA 304

Query: 477 RNKNNRCGIASSASYXXV 424
               N CG+ ++ASY  V
Sbjct: 305 --AGNTCGVCNAASYPIV 320


>UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.4;
           n=1; Caenorhabditis elegans|Rep: Putative
           uncharacterized protein W07B8.4 - Caenorhabditis elegans
          Length = 335

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 24/73 (32%), Positives = 36/73 (49%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           +  F LY +G+Y           H V ++G+G D  G  YW        +WGE GY +++
Sbjct: 255 YEDFYLYKTGIYTHVAGGELG-GHAVKMLGWGVDN-GTPYWLAANSWNTVWGEKGYFRIL 312

Query: 477 RNKNNRCGIASSA 439
           R   + CGI S+A
Sbjct: 313 RGV-DECGIESAA 324


>UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14;
           Leishmania|Rep: Cysteine proteinase 1 precursor -
           Leishmania pifanoi
          Length = 354

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 24/63 (38%), Positives = 36/63 (57%)
 Frame = -1

Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           T++QLY  GV +   C +  L+HGVL+VG+  + +   YW  +   G  WGE GYI++  
Sbjct: 269 TTWQLYFGGVVSL--CLAWSLNHGVLIVGFNKNAKP-PYWIVKNSWGSSWGEKGYIRLAM 325

Query: 474 NKN 466
             N
Sbjct: 326 GSN 328


>UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27;
           Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber
           officinale (Ginger)
          Length = 221

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 3/75 (4%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN- 472
           FQLY +G++    C+ +  +H   V G  T E   DYW  +   G  WGE GYI++ RN 
Sbjct: 143 FQLYRNGIFTGS-CNIS-ANHYRTVGGRET-ENDKDYWTVKNSWGKNWGESGYIRVERNI 199

Query: 471 --KNNRCGIASSASY 433
              + +CGIA S SY
Sbjct: 200 AESSGKCGIAISPSY 214


>UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:
           Viral cathepsin - Cydia pomonella granulosis virus
           (CpGV) (Cydia pomonellagranulovirus)
          Length = 333

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 25/63 (39%), Positives = 39/63 (61%)
 Frame = -1

Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNR 460
           Y +G+ +  E ++  L+H VL+VGYG  +  V YW  +   G  WGE GY ++ R+KN+ 
Sbjct: 264 YKAGIADICE-NNEGLNHAVLLVGYGV-KNDVPYWILKNSWGAEWGEEGYFRVQRDKNS- 320

Query: 459 CGI 451
           CG+
Sbjct: 321 CGM 323


>UniRef50_Q84SA7 Cluster: Thiol protease; n=1; Aster tripolium|Rep:
           Thiol protease - Aster tripolium (Sea aster)
          Length = 188

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 30/76 (39%), Positives = 37/76 (48%), Gaps = 6/76 (7%)
 Frame = -1

Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVD------YWXREELVGPLWGELGYIK 484
           Q Y   V     CS   LDHGVL+VGYG+            YW  +   GP WGE GY K
Sbjct: 107 QTYIGKVSCPYVCSKKPLDHGVLLVGYGSAGYAPSRLKEKPYWIIKNSWGPDWGEDGYYK 166

Query: 483 MIRNKNNRCGIASSAS 436
            I + +N CG+ +  S
Sbjct: 167 -ICSGHNLCGMDTMVS 181


>UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella
           histriomuscorum|Rep: Cathepsin B - Oxytricha trifallax
           (Sterkiella histriomuscorum)
          Length = 294

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 27/71 (38%), Positives = 36/71 (50%), Gaps = 2/71 (2%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDL--DHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIK 484
           +T F  Y SGVY     ++TD+   H + ++GYG  E G  YW      GP WG  G+ K
Sbjct: 220 YTDFFNYQSGVYTP---TTTDVAGGHAIKILGYGV-ENGTPYWLCANSWGPAWGMSGFFK 275

Query: 483 MIRNKNNRCGI 451
           +   K   CGI
Sbjct: 276 I---KQGECGI 283


>UniRef50_Q7QQ92 Cluster: GLP_243_18349_20043; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_243_18349_20043 - Giardia lamblia
           ATCC 50803
          Length = 564

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 24/64 (37%), Positives = 30/64 (46%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           F  Y  G++N+  CS T LDH V+  GYG   QGV+ W      G  WG  G+       
Sbjct: 377 FSNYKGGIFNKP-CSKTGLDHQVMFAGYGY-YQGVEVWVMRNSWGEQWGSYGHFYTPIGN 434

Query: 468 NNRC 457
           N  C
Sbjct: 435 NVLC 438


>UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;
           Theileria|Rep: Cysteine protease, tacP, putative -
           Theileria annulata
          Length = 461

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 28/70 (40%), Positives = 42/70 (60%), Gaps = 3/70 (4%)
 Frame = -1

Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVD-YWXREELVGPLWGELGYIKMIR 475
           SF  Y SG+Y + +CS  +L+H VL+VG G D +    YW  +   G  WGE G++++ R
Sbjct: 371 SFFDYKSGIY-DGDCS-VNLNHAVLLVGEGYDPKTKKRYWIIKNSWGRDWGEDGFMRLER 428

Query: 474 NK--NNRCGI 451
               N++CGI
Sbjct: 429 TNEGNDKCGI 438


>UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 383

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 22/69 (31%), Positives = 39/69 (56%), Gaps = 3/69 (4%)
 Frame = -1

Query: 639 YSSGVYNE--EECSSTDLD-HGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469
           Y SG++N   E+C+   +  H + ++GYG + +   YW  +   G  WG  GY ++ R  
Sbjct: 310 YRSGIFNPSVEDCTEKSMGAHALTIIGYGGEGESA-YWIVKNSWGTSWGASGYFRLARGV 368

Query: 468 NNRCGIASS 442
           N+ CG+A++
Sbjct: 369 NS-CGLANT 376


>UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_21,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 349

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 27/76 (35%), Positives = 40/76 (52%), Gaps = 3/76 (3%)
 Frame = -1

Query: 651 SFQLYSSGVYNEE-ECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           +FQLY  G+Y+ + + S   L+HGV  VGY  D     Y+  +   G  WGE GYI+  R
Sbjct: 267 NFQLYKKGIYSAKCDGSKPALNHGVTNVGYAPD-----YYLIKNSWGQSWGESGYIRFAR 321

Query: 474 --NKNNRCGIASSASY 433
             +K  +CG     ++
Sbjct: 322 IADKAGQCGAQQEVNF 337


>UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera
           litura multicapsid nucleopolyhedrovirus (SpltMNPV)
          Length = 337

 Score = 47.2 bits (107), Expect = 3e-04
 Identities = 26/63 (41%), Positives = 33/63 (52%)
 Frame = -1

Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNR 460
           Y SG+     C+   L+H VL+VGYG  E    YW  +   G  WGE GY +  RN  N 
Sbjct: 268 YRSGIATV--CNDNGLNHAVLLVGYGI-ENDTPYWIFKNSWGSNWGENGYFRARRN-INA 323

Query: 459 CGI 451
           CG+
Sbjct: 324 CGM 326


>UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome
           shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 5
           SCAF15026, whole genome shotgun sequence - Tetraodon
           nigroviridis (Green puffer)
          Length = 351

 Score = 46.8 bits (106), Expect = 5e-04
 Identities = 25/71 (35%), Positives = 38/71 (53%)
 Frame = -1

Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478
           +  F LY SGVY     S+    H + ++G+G +E GV YW         WG+ G+ K++
Sbjct: 276 YEDFVLYKSGVYQHVSGSALG-GHAIKMLGWG-EENGVPYWLCANSWNTDWGDNGFFKIL 333

Query: 477 RNKNNRCGIAS 445
           R  ++ CGI S
Sbjct: 334 RGADH-CGIES 343


>UniRef50_O48605 Cluster: Putative thiol protease; n=1; Hordeum
           vulgare|Rep: Putative thiol protease - Hordeum vulgare
           (Barley)
          Length = 91

 Score = 46.8 bits (106), Expect = 5e-04
 Identities = 23/69 (33%), Positives = 35/69 (50%), Gaps = 2/69 (2%)
 Frame = -1

Query: 633 SGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN--NR 460
           SGVY +  C +   +H + +VGYGT   G  YW  +      WG+ G+I ++R+      
Sbjct: 22  SGVYIKGACKTAQ-NHAMALVGYGTKPDGTKYWIGKNSWTAKWGDKGFIYLLRDSPPLGL 80

Query: 459 CGIASSASY 433
           CG+A    Y
Sbjct: 81  CGLAKLPVY 89


>UniRef50_A7QDM1 Cluster: Chromosome chr10 scaffold_81, whole genome
           shotgun sequence; n=1; Vitis vinifera|Rep: Chromosome
           chr10 scaffold_81, whole genome shotgun sequence - Vitis
           vinifera (Grape)
          Length = 98

 Score = 46.8 bits (106), Expect = 5e-04
 Identities = 23/61 (37%), Positives = 29/61 (47%), Gaps = 3/61 (4%)
 Frame = -1

Query: 606 SSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKM---IRNKNNRCGIASSAS 436
           S  DLD+GV   GYG    G  +W  +   G  WGE GY +M   ++     CG    AS
Sbjct: 35  SGNDLDYGVTTDGYGRSADGKKHWLVKNSWGTDWGENGYTRMERGVKATTGLCGFTMQAS 94

Query: 435 Y 433
           Y
Sbjct: 95  Y 95


>UniRef50_Q23EG5 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 429

 Score = 46.8 bits (106), Expect = 5e-04
 Identities = 27/74 (36%), Positives = 39/74 (52%), Gaps = 2/74 (2%)
 Frame = -1

Query: 648 FQLYSSGVYNEEECSS--TDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475
           F+ Y  G+Y+  ECS+   +++H VL VGY    +   Y+  +   G  WG  GY   I 
Sbjct: 269 FENYEGGIYSNPECSTDPQEVNHAVLAVGYNLTGR---YYIVKNSWGKDWGMDGYF-YIE 324

Query: 474 NKNNRCGIASSASY 433
             +N CG+A  ASY
Sbjct: 325 LGSNMCGLADCASY 338


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 519,672,949
Number of Sequences: 1657284
Number of extensions: 8876468
Number of successful extensions: 22674
Number of sequences better than 10.0: 440
Number of HSP's better than 10.0 without gapping: 21812
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 22384
length of database: 575,637,011
effective HSP length: 98
effective length of database: 413,223,179
effective search space used: 49586781480
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)

- SilkBase 1999-2023 -