SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTX 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= S06A01NCLL0001_H21
         (515 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ...   268   7e-71
UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-...   252   3e-66
UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; ...   225   5e-58
UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome s...   220   1e-56
UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ...   197   1e-49
UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve...   164   9e-40
UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1...   155   7e-37
UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v...   143   2e-33
UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep...   142   3e-33
UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi...   142   4e-33
UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs...   140   2e-32
UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ...   136   2e-31
UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve...   134   1e-30
UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C...   132   3e-30
UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome s...   132   5e-30
UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata...   131   8e-30
UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n...   130   2e-29
UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca...   129   4e-29
UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip...   127   1e-28
UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j...   127   2e-28
UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er...   126   4e-28
UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine prot...   126   4e-28
UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ...   125   5e-28
UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|...   125   5e-28
UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep...   124   9e-28
UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M...   124   9e-28
UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica...   123   2e-27
UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D...    89   3e-27
UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]...   122   5e-27
UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy...   121   9e-27
UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae...   121   1e-26
UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP000...   120   2e-26
UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ...   120   2e-26
UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto...   120   3e-26
UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;...   120   3e-26
UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr...   118   8e-26
UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ...   117   2e-25
UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re...   117   2e-25
UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n...   116   2e-25
UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus...   116   3e-25
UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca...   116   4e-25
UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei...   116   4e-25
UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18...   115   6e-25
UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ...   114   1e-24
UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph...   114   1e-24
UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain...   114   1e-24
UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C...   114   1e-24
UniRef50_Q4TI44 Cluster: Chromosome undetermined SCAF2412, whole...   113   2e-24
UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip...   113   2e-24
UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli...   113   3e-24
UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy...   113   3e-24
UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ...   112   5e-24
UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida...   112   5e-24
UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate...   112   5e-24
UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No...   111   7e-24
UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi...   111   7e-24
UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy...   111   7e-24
UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli...   111   9e-24
UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n...   111   9e-24
UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain...   111   1e-23
UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=...   109   4e-23
UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:...   109   5e-23
UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3...   108   6e-23
UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc...   108   6e-23
UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy...   108   8e-23
UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio...   108   8e-23
UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat...   107   1e-22
UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ...   107   1e-22
UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Sl...   107   1e-22
UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat...   107   1e-22
UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n...   106   3e-22
UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ...   105   5e-22
UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n...   105   5e-22
UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox...   105   6e-22
UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina...   104   1e-21
UniRef50_P25773 Cluster: Cathepsin L; n=38; Eukaryota|Rep: Cathe...   104   1e-21
UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h...   103   2e-21
UniRef50_UPI0000E224BB Cluster: PREDICTED: hypothetical protein;...   103   2e-21
UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain...   103   2e-21
UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H...   102   4e-21
UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste...   102   6e-21
UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n...   102   6e-21
UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia ...   101   1e-20
UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2...   101   1e-20
UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endop...   101   1e-20
UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain...   100   2e-20
UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e...   100   2e-20
UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy...   100   2e-20
UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt...    99   3e-20
UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio...    99   3e-20
UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl...    99   3e-20
UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ...    99   3e-20
UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1; ...   100   4e-20
UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ...    99   5e-20
UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster...    99   5e-20
UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus...    99   5e-20
UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ...    99   7e-20
UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re...    98   9e-20
UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory...    98   9e-20
UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;...    98   1e-19
UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac...    98   1e-19
UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ...    98   1e-19
UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ...    97   2e-19
UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n...    97   2e-19
UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ...    96   4e-19
UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi...    96   5e-19
UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:...    96   5e-19
UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr...    96   5e-19
UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz...    95   6e-19
UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ...    95   1e-18
UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel...    95   1e-18
UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz...    94   1e-18
UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;...    94   2e-18
UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;...    94   2e-18
UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv...    94   2e-18
UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted...    93   3e-18
UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet...    93   3e-18
UniRef50_Q24E33 Cluster: Papain family cysteine protease contain...    93   3e-18
UniRef50_A2F4G0 Cluster: Clan CA, family C1, cathepsin L-like cy...    93   3e-18
UniRef50_Q22W19 Cluster: Papain family cysteine protease contain...    93   5e-18
UniRef50_A2WPN4 Cluster: Putative uncharacterized protein; n=2; ...    92   6e-18
UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ...    92   6e-18
UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy...    91   1e-17
UniRef50_Q0MYX5 Cluster: Putative cysteine protease; n=1; Emilia...    91   1e-17
UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal...    90   2e-17
UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ...    90   2e-17
UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo...    90   3e-17
UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa...    89   4e-17
UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ...    89   4e-17
UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain...    89   4e-17
UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arab...    89   6e-17
UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy...    89   6e-17
UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cy...    89   6e-17
UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li...    89   6e-17
UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin ...    88   1e-16
UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyt...    88   1e-16
UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic...    88   1e-16
UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole...    88   1e-16
UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty...    88   1e-16
UniRef50_A2DKX2 Cluster: Clan CA, family C1, cathepsin L-like cy...    88   1e-16
UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35...    88   1e-16
UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ...    87   2e-16
UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ...    87   2e-16
UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi...    87   2e-16
UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re...    87   2e-16
UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ...    87   3e-16
UniRef50_O16454 Cluster: Temporarily assigned gene name protein ...    87   3e-16
UniRef50_Q9VNK7 Cluster: CG1075-PA; n=1; Drosophila melanogaster...    86   4e-16
UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr...    86   4e-16
UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov...    86   4e-16
UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|...    86   5e-16
UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:...    86   5e-16
UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa...    85   7e-16
UniRef50_Q22RC9 Cluster: Papain family cysteine protease contain...    85   7e-16
UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-li...    85   7e-16
UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ...    85   9e-16
UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ...    85   9e-16
UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist...    85   1e-15
UniRef50_Q23H32 Cluster: Papain family cysteine protease contain...    85   1e-15
UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R...    85   1e-15
UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen...    84   2e-15
UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n...    84   2e-15
UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia...    83   3e-15
UniRef50_Q22A69 Cluster: Papain family cysteine protease contain...    83   3e-15
UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:...    83   3e-15
UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryz...    83   4e-15
UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li...    83   4e-15
UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w...    83   4e-15
UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D...    83   4e-15
UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t...    83   5e-15
UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try...    83   5e-15
UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|...    83   5e-15
UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ...    82   6e-15
UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=...    82   6e-15
UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi...    82   8e-15
UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain...    81   1e-14
UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R...    81   1e-14
UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi...    81   1e-14
UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ...    81   2e-14
UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb...    81   2e-14
UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy...    81   2e-14
UniRef50_Q248G1 Cluster: Papain family cysteine protease contain...    80   3e-14
UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ...    79   4e-14
UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big...    79   6e-14
UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa...    79   6e-14
UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep...    79   6e-14
UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n...    79   6e-14
UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali...    79   6e-14
UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG...    79   8e-14
UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh...    78   1e-13
UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|...    78   1e-13
UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ...    78   1e-13
UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ...    77   2e-13
UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain...    77   2e-13
UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma...    77   2e-13
UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ...    77   2e-13
UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ...    77   2e-13
UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ...    77   2e-13
UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain...    77   3e-13
UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain...    77   3e-13
UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil...    76   4e-13
UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi...    76   4e-13
UniRef50_Q239L8 Cluster: Papain family cysteine protease contain...    75   7e-13
UniRef50_UPI0000498E2F Cluster: cysteine proteinase; n=1; Entamo...    75   1e-12
UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis...    75   1e-12
UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia...    75   1e-12
UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3....    75   1e-12
UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:...    74   2e-12
UniRef50_Q23H10 Cluster: Papain family cysteine protease contain...    74   2e-12
UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto...    74   2e-12
UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ...    73   3e-12
UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ...    73   3e-12
UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8...    73   3e-12
UniRef50_Q23H06 Cluster: Papain family cysteine protease contain...    73   3e-12
UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cy...    73   3e-12
UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa...    73   4e-12
UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-...    73   4e-12
UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ...    73   4e-12
UniRef50_UPI00006CBB5F Cluster: Papain family cysteine protease ...    73   5e-12
UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh...    73   5e-12
UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa...    72   7e-12
UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain...    72   7e-12
UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease ...    72   9e-12
UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain...    72   9e-12
UniRef50_Q237A1 Cluster: Papain family cysteine protease contain...    71   1e-11
UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain...    71   2e-11
UniRef50_Q5CYC8 Cluster: Cathepsin like thiol protease possibly ...    71   2e-11
UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ...    71   2e-11
UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|R...    71   2e-11
UniRef50_A3FQ13 Cluster: Cathepsin like thiol protease possibly ...    71   2e-11
UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ...    70   3e-11
UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M...    70   3e-11
UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps...    70   4e-11
UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|R...    70   4e-11
UniRef50_A7APS9 Cluster: Papain family cysteine protease contain...    70   4e-11
UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - ...    69   5e-11
UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid...    69   6e-11
UniRef50_Q26988 Cluster: Cysteine proteinase 4; n=1; Tritrichomo...    69   6e-11
UniRef50_Q231X3 Cluster: Papain family cysteine protease contain...    69   6e-11
UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep...    69   6e-11
UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh...    69   6e-11
UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca...    69   6e-11
UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; ...    68   1e-10
UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|...    68   1e-10
UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain...    68   1e-10
UniRef50_A0DZB1 Cluster: Chromosome undetermined scaffold_7, who...    68   1e-10
UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain...    68   1e-10
UniRef50_Q235G6 Cluster: Papain family cysteine protease contain...    68   1e-10
UniRef50_A0DIY3 Cluster: Chromosome undetermined scaffold_52, wh...    67   2e-10
UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sa...    67   3e-10
UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|...    67   3e-10
UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum...    67   3e-10
UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru...    67   3e-10
UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ...    66   3e-10
UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G...    66   3e-10
UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C...    66   3e-10
UniRef50_Q7QRX3 Cluster: GLP_549_24108_24914; n=1; Giardia lambl...    66   4e-10
UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain...    66   4e-10
UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag...    66   4e-10
UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh...    66   4e-10
UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomati...    66   6e-10
UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7...    66   6e-10
UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=...    66   6e-10
UniRef50_A0BZE1 Cluster: Chromosome undetermined scaffold_139, w...    66   6e-10
UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 ...    66   6e-10
UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 ...    66   6e-10
UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re...    65   8e-10
UniRef50_Q5CKC2 Cluster: Preprocathepsin c; n=2; Cryptosporidium...    65   8e-10
UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy...    65   8e-10
UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p...    65   1e-09
UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;...    65   1e-09
UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1...    65   1e-09
UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ...    65   1e-09
UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma...    65   1e-09
UniRef50_UPI0000D566F0 Cluster: PREDICTED: similar to CathePsin ...    64   1e-09
UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2...    64   1e-09
UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis...    64   1e-09
UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi...    64   1e-09
UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ...    64   2e-09
UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl...    64   2e-09
UniRef50_Q9XZM9 Cluster: Cysteine proteinase CPW2; n=1; Acantham...    64   2e-09
UniRef50_Q23H15 Cluster: Papain family cysteine protease contain...    64   2e-09
UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Ca...    64   2e-09
UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep...    64   2e-09
UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh...    64   2e-09
UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ...    64   2e-09
UniRef50_Q84SA7 Cluster: Thiol protease; n=1; Aster tripolium|Re...    64   2e-09
UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lambl...    64   2e-09
UniRef50_Q1AMF3 Cluster: Cathepsin C1; n=1; Toxoplasma gondii|Re...    64   2e-09
UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8....    64   2e-09
UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w...    64   2e-09
UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr...    64   2e-09
UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ...    64   2e-09
UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ...    63   3e-09
UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ...    63   3e-09
UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium...    63   3e-09
UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep...    63   4e-09
UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n...    63   4e-09
UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain...    63   4e-09
UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus co...    62   6e-09
UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R...    62   6e-09
UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh...    62   7e-09
UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv...    62   7e-09
UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O...    62   7e-09
UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3; Bilate...    62   7e-09
UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=...    62   7e-09
UniRef50_Q8I8D3 Cluster: Cysteine protease 15; n=2; Entamoeba hi...    62   7e-09
UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir...    62   7e-09
UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n...    62   7e-09
UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, wh...    62   7e-09
UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179, w...    62   7e-09
UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C...    62   1e-08
UniRef50_Q015J8 Cluster: Cathepsin; n=2; Ostreococcus|Rep: Cathe...    62   1e-08
UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|...    62   1e-08
UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like prote...    62   1e-08
UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia...    61   1e-08
UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhip...    61   2e-08
UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil...    61   2e-08
UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabdi...    61   2e-08
UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs...    61   2e-08
UniRef50_Q9UBR2 Cluster: Cathepsin Z precursor; n=40; Bilateria|...    61   2e-08
UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin...    60   2e-08
UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir...    60   2e-08
UniRef50_UPI00006CB653 Cluster: Papain family cysteine protease ...    60   3e-08
UniRef50_Q84YE7 Cluster: Cysteine proteinase-like protein; n=1; ...    60   3e-08
UniRef50_P05993 Cluster: Cysteine proteinase; n=7; Eukaryota|Rep...    60   3e-08
UniRef50_Q9LUX8 Cluster: Cysteine protease; n=1; Pyrus pyrifolia...    60   4e-08
UniRef50_Q8I8D5 Cluster: Cysteine protease 13; n=2; Entamoeba hi...    60   4e-08
UniRef50_Q0PWU8 Cluster: Cathepsin B preproprotein-like protein;...    60   4e-08
UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ...    60   4e-08
UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh...    60   4e-08
UniRef50_Q9BL26 Cluster: Putative uncharacterized protein; n=4; ...    59   5e-08
UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve...    59   5e-08
UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophi...    59   5e-08
UniRef50_UPI0000E49DA9 Cluster: PREDICTED: similar to cathepsin ...    59   7e-08
UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ...    59   7e-08
UniRef50_A7AX75 Cluster: Preprocathepsin c, putative; n=1; Babes...    59   7e-08
UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ...    58   9e-08
UniRef50_Q962W3 Cluster: Cysteine protease; n=3; Giardia intesti...    58   9e-08
UniRef50_Q58HK5 Cluster: Hc58; n=1; Haemonchus contortus|Rep: Hc...    58   9e-08
UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr...    58   9e-08
UniRef50_Q8IIJ9 Cluster: Probable cathepsin C precursor; n=11; P...    58   9e-08
UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocysti...    58   1e-07
UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R...    58   1e-07
UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ...    58   1e-07
UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain...    58   1e-07
UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like cy...    58   1e-07
UniRef50_UPI0000498719 Cluster: cysteine protease 18-related; n=...    58   2e-07
UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ...    58   2e-07
UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus...    58   2e-07
UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw...    58   2e-07
UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|...    58   2e-07
UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|R...    58   2e-07
UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba...    57   2e-07
UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep: Cat...    57   2e-07
UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl...    57   2e-07
UniRef50_Q8I8D2 Cluster: Cysteine protease 16; n=2; Entamoeba hi...    57   2e-07
UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 ...    57   2e-07
UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; ...    57   3e-07
UniRef50_Q4UFL9 Cluster: Cathepsin-like cysteine protease, putat...    57   3e-07
UniRef50_Q24F16 Cluster: Papain family cysteine protease contain...    57   3e-07
UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu...    57   3e-07
UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep...    56   4e-07
UniRef50_Q8I8D0 Cluster: Cysteine protease 18; n=2; Entamoeba hi...    56   4e-07
UniRef50_Q6A1H9 Cluster: Cathepsin X/O; n=1; Suberites domuncula...    56   4e-07
UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w...    56   4e-07
UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The...    56   4e-07
UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; P...    56   4e-07
UniRef50_Q9GU75 Cluster: Thiolproteinase; n=2; Babesia|Rep: Thio...    56   5e-07
UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gamb...    56   5e-07
UniRef50_Q26987 Cluster: Cysteine proteinase 3; n=1; Tritrichomo...    56   5e-07
UniRef50_A0DKP6 Cluster: Chromosome undetermined scaffold_54, wh...    56   5e-07
UniRef50_A0CS14 Cluster: Chromosome undetermined scaffold_26, wh...    56   5e-07
UniRef50_Q1AMF1 Cluster: Cathepsin C3; n=1; Toxoplasma gondii|Re...    56   6e-07
UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; ...    55   8e-07
UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl...    55   8e-07
UniRef50_Q54R55 Cluster: Putative uncharacterized protein; n=1; ...    55   8e-07
UniRef50_Q4N3V5 Cluster: Cathepsin C, putative; n=1; Theileria p...    55   8e-07
UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re...    55   8e-07
UniRef50_A7SNM3 Cluster: Predicted protein; n=1; Nematostella ve...    55   8e-07
UniRef50_Q7R0G3 Cluster: GLP_29_33036_32140; n=1; Giardia lambli...    55   1e-06
UniRef50_Q26989 Cluster: Cysteine proteinase 5; n=1; Tritrichomo...    55   1e-06
UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8....    55   1e-06
UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh...    55   1e-06
UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA...    54   1e-06
UniRef50_Q26993 Cluster: Cysteine proteinase 9; n=1; Tritrichomo...    54   1e-06
UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Ca...    54   1e-06
UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl...    54   1e-06
UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep...    54   2e-06
UniRef50_Q7R5X2 Cluster: GLP_81_104117_102504; n=1; Giardia lamb...    54   2e-06
UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG011...    54   2e-06
UniRef50_Q54MB6 Cluster: Putative uncharacterized protein; n=1; ...    54   2e-06
UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina...    54   2e-06
UniRef50_O48605 Cluster: Putative thiol protease; n=1; Hordeum v...    54   3e-06
UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ...    54   3e-06
UniRef50_Q5VUI9 Cluster: Tubulointerstitial nephritis antigen; n...    54   3e-06
UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein...    53   3e-06
UniRef50_Q7QWN9 Cluster: GLP_26_50243_51811; n=2; Giardia lambli...    53   3e-06
UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Sl...    53   3e-06
UniRef50_A2DPZ6 Cluster: Clan CA, family C1, cathepsin B-like cy...    53   3e-06
UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi...    53   3e-06
UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest...    53   4e-06
UniRef50_A0BLR4 Cluster: Chromosome undetermined scaffold_115, w...    52   8e-06
UniRef50_A5KAP8 Cluster: Protease, putative; n=1; Plasmodium viv...    38   9e-06
UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,...    52   1e-05
UniRef50_Q53K53 Cluster: Cysteine protease 1, putative; n=5; Ory...    52   1e-05
UniRef50_Q7QQ92 Cluster: GLP_243_18349_20043; n=1; Giardia lambl...    52   1e-05
UniRef50_Q54VR1 Cluster: Putative uncharacterized protein; n=1; ...    52   1e-05
UniRef50_Q8I8D7 Cluster: Cysteine protease 11; n=4; Entamoeba hi...    51   1e-05
UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;...    51   1e-05
UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop...    51   1e-05
UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen; n...    51   1e-05
UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei...    51   2e-05
UniRef50_A5K8Y0 Cluster: Preprocathepsin c, putative; n=1; Plasm...    51   2e-05
UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator ame...    51   2e-05
UniRef50_Q5YER6 Cluster: Cathepsin Z; n=1; Bigelowiella natans|R...    50   2e-05
UniRef50_O96166 Cluster: Cysteine protease, putative; n=1; Plasm...    50   2e-05
UniRef50_A2GCC2 Cluster: Clan CA, family C1, cathepsin B-like cy...    50   2e-05
UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh...    50   2e-05
UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who...    50   2e-05
UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop...    50   3e-05
UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The...    50   3e-05
UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; ...    50   4e-05
UniRef50_Q6E7B6 Cluster: Cathepsin L-like cysteine proteinase; n...    50   4e-05
UniRef50_O96164 Cluster: Cysteine protease, putative; n=1; Plasm...    50   4e-05
UniRef50_A0CHZ5 Cluster: Chromosome undetermined scaffold_186, w...    50   4e-05
UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2; Ostreococcus|...    49   6e-05
UniRef50_Q8I8D6 Cluster: Cysteine protease 12; n=1; Entamoeba hi...    49   6e-05
UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia...    49   6e-05
UniRef50_Q5BTK3 Cluster: SJCHGC00358 protein; n=1; Schistosoma j...    49   6e-05
UniRef50_O96167 Cluster: Cysteine protease, putative; n=1; Plasm...    49   6e-05
UniRef50_Q61CG7 Cluster: Putative uncharacterized protein CBG129...    49   7e-05
UniRef50_Q1AMF2 Cluster: Cathepsin C2; n=1; Toxoplasma gondii|Re...    49   7e-05
UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl...    49   7e-05
UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;...    48   1e-04
UniRef50_Q9LFI9 Cluster: Putative uncharacterized protein F2K13_...    48   1e-04
UniRef50_Q7RMW5 Cluster: Papain family cysteine protease, putati...    48   1e-04
UniRef50_Q7JYA0 Cluster: RE20049p; n=2; Sophophora|Rep: RE20049p...    48   1e-04
UniRef50_Q3L7L2 Cluster: Sar s 1 allergen SMIPP-C Yv6008G08; n=2...    48   1e-04
UniRef50_A1SVF0 Cluster: Peptidase C1A, papain; n=1; Psychromona...    48   1e-04
UniRef50_A7QDM1 Cluster: Chromosome chr10 scaffold_81, whole gen...    48   1e-04
UniRef50_Q7Q9Y5 Cluster: ENSANGP00000012222; n=1; Anopheles gamb...    48   1e-04
UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; ...    48   1e-04
UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil...    48   1e-04
UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli...    48   2e-04
UniRef50_Q4UCF5 Cluster: Cysteine proteinase, tacP, putative; n=...    48   2e-04
UniRef50_Q4N5Z7 Cluster: Cysteine proteinase, putative; n=2; The...    48   2e-04
UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop...    48   2e-04
UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-32...    48   2e-04
UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryz...    47   2e-04
UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10...    47   2e-04
UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, w...    47   2e-04
UniRef50_Q9TY95 Cluster: Serine-repeat antigen protein precursor...    47   2e-04
UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin ...    47   3e-04
UniRef50_O96165 Cluster: Cysteine protease, putative; n=1; Plasm...    47   3e-04
UniRef50_O62484 Cluster: Putative uncharacterized protein; n=1; ...    47   3e-04
UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon G...    47   3e-04
UniRef50_Q3A8I3 Cluster: Putative serine protease; n=1; Pelobact...    46   4e-04
UniRef50_Q9U7F7 Cluster: Cysteine protease; n=2; Entamoeba histo...    46   4e-04
UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: ...    46   4e-04
UniRef50_Q4XZE6 Cluster: Preprocathepsin c, putative; n=6; Plasm...    46   4e-04
UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-li...    46   4e-04
UniRef50_O65214 Cluster: Cysteine protease; n=2; Volvox carteri ...    46   5e-04
UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ...    46   5e-04
UniRef50_Q5CM16 Cluster: P3ECSL-related; n=2; Cryptosporidium|Re...    46   5e-04
UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O...    46   7e-04
UniRef50_Q9XW98 Cluster: Putative uncharacterized protein; n=1; ...    46   7e-04
UniRef50_Q4UC83 Cluster: Cysteine proteinase, putative; n=2; The...    46   7e-04
UniRef50_Q01FL5 Cluster: Cysteine protease-5; n=3; Ostreococcus|...    45   9e-04
UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ...    45   9e-04
UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ...    45   9e-04
UniRef50_Q4YCM9 Cluster: Cysteine protease, putative; n=5; Plasm...    45   9e-04
UniRef50_A5KBM0 Cluster: Serine-repeat antigen (SERA), putative;...    45   9e-04
UniRef50_UPI0000E4622C Cluster: PREDICTED: hypothetical protein;...    45   0.001
UniRef50_A4S004 Cluster: Predicted protein; n=2; Ostreococcus|Re...    45   0.001
UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n...    44   0.002
UniRef50_Q7RSR1 Cluster: Papain family cysteine protease, putati...    44   0.002
UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop...    44   0.002
UniRef50_Q8QNJ8 Cluster: EsV-1-75; n=1; Ectocarpus siliculosus v...    44   0.002
UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; ...    44   0.002
UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma j...    44   0.002
UniRef50_Q9NHY1 Cluster: Cysteine protease cp2; n=1; Theileria c...    44   0.003
UniRef50_UPI00006CB77B Cluster: Papain family cysteine protease ...    43   0.005
UniRef50_Q3L7L0 Cluster: Sar s 1 allergen SMIPP-C Yv5009F04; n=3...    43   0.005
UniRef50_A0DCA5 Cluster: Chromosome undetermined scaffold_45, wh...    43   0.005
UniRef50_O96163 Cluster: Cysteine protease, putative; n=5; Plasm...    42   0.006
UniRef50_A5KBM7 Cluster: Serine-repeat antigen 4; n=1; Plasmodiu...    42   0.006
UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S...    42   0.008
UniRef50_Q8I3C0 Cluster: Papain family cysteine protease, putati...    42   0.008
UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; ...    42   0.008
UniRef50_Q26015 Cluster: Serine rich protein homologue; n=4; Pla...    42   0.008
UniRef50_A5KBM1 Cluster: Serine-repeat antigen; n=1; Plasmodium ...    42   0.008
UniRef50_Q8TQM7 Cluster: Putative uncharacterized protein; n=1; ...    42   0.008
UniRef50_UPI000155C322 Cluster: PREDICTED: similar to cathepsin ...    42   0.011
UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,...    42   0.011
UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ...    42   0.011
UniRef50_A5VDP2 Cluster: Peptidase C1A, papain; n=1; Sphingomona...    42   0.011
UniRef50_A2XHS0 Cluster: Putative uncharacterized protein; n=2; ...    42   0.011
UniRef50_A5KBM2 Cluster: Serine-repeat antigen; n=3; Plasmodium|...    42   0.011
UniRef50_UPI00006CBAC7 Cluster: Papain family cysteine protease ...    41   0.015

>UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue of
           Sarcophaga 26,29kDa proteinase; n=1; Nasonia
           vitripennis|Rep: PREDICTED: similar to homologue of
           Sarcophaga 26,29kDa proteinase - Nasonia vitripennis
          Length = 553

 Score =  268 bits (656), Expect = 7e-71
 Identities = 114/152 (75%), Positives = 132/152 (86%), Gaps = 1/152 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GGEDFR+YQWI+KHG LPTEE+YGGYLGQDGYCHI NVT I K+ G+VNV TNN +A+KL
Sbjct: 400 GGEDFRSYQWIIKHGGLPTEEEYGGYLGQDGYCHIKNVTQIAKLKGFVNVDTNNVDAMKL 459

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           ALFKHGPISVAIDA+HKTFSFYSNGVY+EP C N  + LDHAVLAVGYG +NG  +WL+K
Sbjct: 460 ALFKHGPISVAIDASHKTFSFYSNGVYYEPACGNTENSLDHAVLAVGYGTINGKGFWLIK 519

Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVL 462
           NSWSN WGNDGY+LM+ + NNCGV +APTY +
Sbjct: 520 NSWSNYWGNDGYILMAQKNNNCGVMTAPTYAI 551


>UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-PA
           - Drosophila melanogaster (Fruit fly)
          Length = 549

 Score =  252 bits (618), Expect = 3e-66
 Identities = 108/151 (71%), Positives = 128/151 (84%), Gaps = 1/151 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GGEDFR YQW+++ G +PTEE+YG YLGQDGYCH++NVT +  I G+VNVT+N+ NA KL
Sbjct: 397 GGEDFRVYQWMLQSGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNVTSNDPNAFKL 456

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           AL KHGP+SVAIDA+ KTFSFYS+GVY+EP CKN VD LDHAVLAVGYG +NG  YWLVK
Sbjct: 457 ALLKHGPLSVAIDASPKTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGSINGEDYWLVK 516

Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYV 459
           NSWS  WGNDGY+LMS ++NNCGV + PTYV
Sbjct: 517 NSWSTYWGNDGYILMSAKKNNCGVMTMPTYV 547


>UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326;
           n=2; Danio rerio|Rep: hypothetical protein LOC550326 -
           Danio rerio
          Length = 531

 Score =  225 bits (550), Expect = 5e-58
 Identities = 90/153 (58%), Positives = 124/153 (81%), Gaps = 1/153 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GGE++RA++WIMKHG + T E YG Y+G +G CH D  + + ++TG+ NVT+ +  ALK 
Sbjct: 378 GGEEWRAFEWIMKHGGISTAESYGAYMGMNGLCHYDKTSMVAQLTGYTNVTSGDILALKA 437

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           A+FK GP++V+IDAAH++F+FYSNGVY+EP+CKN +++LDHAVLAVGYG++N   YWLVK
Sbjct: 438 AIFKFGPVAVSIDAAHRSFAFYSNGVYYEPECKNGINDLDHAVLAVGYGIMNNESYWLVK 497

Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           NSWS+ WGNDGY+LMSM++NNCGV +   Y  +
Sbjct: 498 NSWSSYWGNDGYILMSMKDNNCGVATDAIYATL 530


>UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 21 SCAF14577, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 478

 Score =  220 bits (538), Expect = 1e-56
 Identities = 92/153 (60%), Positives = 123/153 (80%), Gaps = 1/153 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GGE++RAY+WIMKHG + + E YG YLG +G+CH+++     +I  + NVT+ +  ALKL
Sbjct: 325 GGEEWRAYEWIMKHGGIASAETYGPYLGMNGFCHVNSSELTAQIQSYTNVTSGDALALKL 384

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           ALFK+GP++V+IDA+H++F FYSNGVY+EP C + V++LDHAVLAVGYG LNG  YWL+K
Sbjct: 385 ALFKNGPVAVSIDASHRSFVFYSNGVYYEPACGSTVEDLDHAVLAVGYGNLNGEPYWLIK 444

Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           NSWS  WGNDGY+LMSM++NNCGV +  TYV +
Sbjct: 445 NSWSTYWGNDGYILMSMKDNNCGVTTDATYVTL 477



 Score = 46.0 bits (104), Expect = 5e-04
 Identities = 19/31 (61%), Positives = 24/31 (77%), Gaps = 1/31 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDG 99
           GGE++RAY+WIMKH G+ + E YG YLG  G
Sbjct: 271 GGEEWRAYEWIMKHGGIASAETYGPYLGMTG 301


>UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l;
           n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to cathepsin l - Strongylocentrotus purpuratus
          Length = 489

 Score =  197 bits (480), Expect = 1e-49
 Identities = 88/152 (57%), Positives = 111/152 (73%), Gaps = 2/152 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GGE++R Y+W+MK+G +P EE YG YLGQ+G CH D   A+  I  + NVT+ N+  LK 
Sbjct: 333 GGEEWRVYEWLMKNGGIPLEETYGPYLGQNGMCHYDKSKAVASIKKYYNVTSGNQKDLKK 392

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLV 363
           AL   GPI+V IDAA  +FSFYS G Y++  C N VD+LDHAVLAVGYG   +G  YWL+
Sbjct: 393 ALATKGPIAVGIDAAVPSFSFYSYGTYYDASCGNTVDDLDHAVLAVGYGTDSSGQDYWLI 452

Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQSAPTYV 459
           KNSWS  WGN+GYV +SM++NNCGV +A TYV
Sbjct: 453 KNSWSTHWGNNGYVAISMKDNNCGVATAATYV 484


>UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 513

 Score =  164 bits (399), Expect = 9e-40
 Identities = 69/144 (47%), Positives = 100/144 (69%), Gaps = 1/144 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG  +RA QWI+KHG L TEE YG YL Q+GYCH  N +   ++  ++++   N + LKL
Sbjct: 362 GGYPYRAMQWILKHGGLATEESYGRYLAQEGYCHFKNTSIGARLDKYMSIRQGNTSQLKL 421

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           A+  +GP+S+ ++   KTF FY +G+Y++ +C +    LDHA LAVGYG   G  YW+VK
Sbjct: 422 AVAFYGPVSILVNTQPKTFKFYGSGIYYDTQCTH---ALDHAALAVGYGEEKGVSYWIVK 478

Query: 367 NSWSNMWGNDGYVLMSMRENNCGV 438
           NSWS MWG +GY+ ++M+++NCGV
Sbjct: 479 NSWSAMWGEEGYIKIAMKDDNCGV 502


>UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1;
           Dictyostelium discoideum AX4|Rep: Counting factor
           associated protein - Dictyostelium discoideum AX4
          Length = 531

 Score =  155 bits (375), Expect = 7e-37
 Identities = 78/153 (50%), Positives = 102/153 (66%), Gaps = 3/153 (1%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAI-TKITGWVNVTTNNENAL 180
           GGG    A+Q++M+ G L TE +Y  YL Q+G C    VT     ITG+VNVT+ +E+AL
Sbjct: 374 GGGFASSAFQYVMEIGSLATESNYP-YLMQNGLCRDRTVTPSGVSITGYVNVTSGSESAL 432

Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360
           + A+   GP+++AIDA+   F +Y +GVY  P CKN +D+LDH VLA+GYG   G  Y+L
Sbjct: 433 QNAIATTGPVAIAIDASVDDFRYYMSGVYNNPACKNGLDDLDHEVLAIGYGTYQGQDYFL 492

Query: 361 VKNSWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456
           VKNSWS  WG DGYV M+  +NN CGV S  TY
Sbjct: 493 VKNSWSTNWGMDGYVYMARNDNNLCGVSSQATY 525


>UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 392

 Score =  143 bits (347), Expect = 2e-33
 Identities = 64/150 (42%), Positives = 94/150 (62%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG  ++A+ W+ K G+ T + YG Y GQ+G+C   N+T   +IT +  V   N  ALK A
Sbjct: 248 GGWTWKAFSWVKKFGIATTKSYGHYRGQEGFCKTSNLTVGARITSYRRVKRFNPIALKKA 307

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           L  HGP +++I+A  K+  FYS+G+  +  C NK    DHAVL +GYG  NG  YWL+KN
Sbjct: 308 LSYHGPATISINANPKSLKFYSDGIMSDKHCSNKT---DHAVLLIGYGSDNGVPYWLIKN 364

Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTYV 459
           SWS+ WGN+G++   +++  CG++  P  V
Sbjct: 365 SWSHKWGNNGFI--KIKQGLCGIEKRPFVV 392


>UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin
           L - Misgurnus mizolepis (Mud loach)
          Length = 337

 Score =  142 bits (345), Expect = 3e-33
 Identities = 73/159 (45%), Positives = 104/159 (65%), Gaps = 7/159 (4%)
 Frame = +1

Query: 10  GGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGY-CHIDNVTAITKITGWVNVTTNNENALK 183
           GG   +A+Q+I   +GL +EE Y  YLG D   CH D        TG+V++ +  E+AL 
Sbjct: 182 GGLMDQAFQYIKDNNGLDSEEAYP-YLGTDDQPCHYDPKYNAANDTGFVDIPSGKEHALM 240

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNGHK 351
            A+   GP+SVAIDA H++F FY +G+YFE +C +  +ELDH VL VGYG     ++G K
Sbjct: 241 KAVASVGPVSVAIDAGHESFQFYQSGIYFEKECSS--EELDHGVLVVGYGFEGEDVDGKK 298

Query: 352 YWLVKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465
           YW+VKNSWS  WG+ GY+ M+  R+N+CG+ +A +Y L+
Sbjct: 299 YWIVKNSWSESWGDKGYIYMAKDRKNHCGIATAASYPLV 337


>UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L
           - Suberites domuncula (Sponge)
          Length = 324

 Score =  142 bits (344), Expect = 4e-33
 Identities = 68/155 (43%), Positives = 92/155 (59%), Gaps = 1/155 (0%)
 Frame = +1

Query: 4   RGGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183
           +GG  D      I  HG+ TE  Y  Y  +DGYC  +        T + ++   +E++L 
Sbjct: 173 KGGIMDDAFRYVISNHGVDTESSYP-YTAKDGYCRFNQNNVGATETSYRDIARGSESSLT 231

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363
            A  + GPISVAIDA+H++F FY NGVY+EP C +    LDH VL VGYG   G  Y++V
Sbjct: 232 QASAQIGPISVAIDASHRSFQFYKNGVYYEPSCSS--SRLDHGVLVVGYGTEGGQDYFIV 289

Query: 364 KNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465
           KNSW   WG DGY++MS  R NNCG+ S  +Y ++
Sbjct: 290 KNSWGTRWGMDGYIMMSRNRRNNCGIASQASYPIV 324


>UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precursor;
           n=3; Metazoa|Rep: Digestive cysteine proteinase 2
           precursor - Homarus americanus (American lobster)
          Length = 323

 Score =  140 bits (339), Expect = 2e-32
 Identities = 70/154 (45%), Positives = 94/154 (61%), Gaps = 2/154 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    A+ +I   +G+ TE  Y  Y  +DG C  D+ +     +G  N+ + +E  L+ 
Sbjct: 173 GGWMNDAFDYIKANNGIDTEAAYP-YEARDGSCRFDSNSVAATCSGHTNIASGSETGLQQ 231

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           A+   GPISV IDAAH +F FYS+GVY+EP C      LDHAVLAVGYG   G  +WLVK
Sbjct: 232 AVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSY--LDHAVLAVGYGSEGGQDFWLVK 289

Query: 367 NSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465
           NSW+  WG+ GY+ MS  R NNCG+ +  +Y L+
Sbjct: 290 NSWATSWGDAGYIKMSRNRNNNCGIATVASYPLV 323


>UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11;
           Entamoeba|Rep: Cysteine proteinase 2 precursor -
           Entamoeba histolytica
          Length = 315

 Score =  136 bits (330), Expect = 2e-31
 Identities = 66/149 (44%), Positives = 87/149 (58%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG     Y +I++HG+  E DY  Y G D  C   NV +  KITG+  V  NNE  LK A
Sbjct: 163 GGLGSNVYDYIIEHGVAKESDYP-YTGSDSTCKT-NVKSFAKITGYTKVPRNNEAELKAA 220

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           L   G + V+IDA+   F  Y +G Y + KCKN    L+H V AVGYGV++G + W+V+N
Sbjct: 221 L-SQGLVDVSIDASSAKFQLYKSGAYTDTKCKNNYFALNHEVCAVGYGVVDGKECWIVRN 279

Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTY 456
           SW   WG+ GY+ M +  N CGV + P Y
Sbjct: 280 SWGTGWGDKGYINMVIEGNTCGVATDPLY 308


>UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 514

 Score =  134 bits (324), Expect = 1e-30
 Identities = 62/147 (42%), Positives = 86/147 (58%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG   +A  WI  HG+ + E YG YLGQ+G C I+ +     I  +  V   N  ALK++
Sbjct: 369 GGYYNKAMSWIYLHGIASAESYGPYLGQEGTCRIEGLRRAAAIDAFAFVPKYNNTALKIS 428

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           + + GP  V+I+    +  FYS G+Y +P+C      + H+VL VGYGV +G  YWLVKN
Sbjct: 429 VARFGPAVVSINENPLSLKFYSWGLYDDPECGRDTAAV-HSVLVVGYGVEDGEPYWLVKN 487

Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAP 450
           SWS  WG DGY+ ++ + N CGV   P
Sbjct: 488 SWSTTWGMDGYIKIAWKRNTCGVTRNP 514


>UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15)
           [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC
           3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain] - Sarcophaga peregrina (Flesh fly)
           (Boettcherisca peregrina)
          Length = 339

 Score =  132 bits (320), Expect = 3e-30
 Identities = 67/152 (44%), Positives = 93/152 (61%), Gaps = 3/152 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    A+++I  +G + TE+ Y  Y G D  CH +  T     TG+V++   +E  +K 
Sbjct: 188 GGLMDNAFRYIKDNGGIDTEKSYP-YEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKK 246

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLV 363
           A+   GP+SVAIDA+H++F  YS GVY EP+C  +   LDH VL VGYG   +G  YWLV
Sbjct: 247 AVATMGPVSVAIDASHESFQLYSEGVYNEPECDEQ--NLDHGVLVVGYGTDESGMDYWLV 304

Query: 364 KNSWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456
           KNSW   WG  GY+ M+  +NN CG+ +A +Y
Sbjct: 305 KNSWGTTWGEQGYIKMARNQNNQCGIATASSY 336


>UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome
           shotgun sequence; n=3; Tetraodontidae|Rep: Chromosome 12
           SCAF14996, whole genome shotgun sequence - Tetraodon
           nigroviridis (Green puffer)
          Length = 362

 Score =  132 bits (319), Expect = 5e-30
 Identities = 66/149 (44%), Positives = 94/149 (63%), Gaps = 6/149 (4%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG   +A+Q+I  +G L +E  Y      D  CH D        TG+V+V + +E AL  
Sbjct: 214 GGLMDQAFQYIKDNGGLDSEASYPYLATDDQPCHYDPSNNSANETGFVDVPSGSERALMK 273

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNGHKY 354
           A+   GP+SVAIDA H++F FY +G+Y+E +C +  +ELDH VL VGYG     ++G K+
Sbjct: 274 AVASVGPVSVAIDAGHESFQFYQSGIYYEKECSS--EELDHGVLVVGYGFQGEDVDGKKF 331

Query: 355 WLVKNSWSNMWGNDGYVLMSM-RENNCGV 438
           W+VKNSWS  WGN GY+ M+  R+N+CG+
Sbjct: 332 WIVKNSWSENWGNKGYIYMAKDRKNHCGI 360


>UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42;
           Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens
           (Human)
          Length = 334

 Score =  131 bits (317), Expect = 8e-30
 Identities = 69/155 (44%), Positives = 93/155 (60%), Gaps = 6/155 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG   RA+Q++ ++G L +EE Y  Y+  D  C      ++   TG+  V    E AL  
Sbjct: 180 GGFMARAFQYVKENGGLDSEESYP-YVAVDEICKYRPENSVANDTGFTVVAPGKEKALMK 238

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNGHKY 354
           A+   GPISVA+DA H +F FY +G+YFEP C +K   LDH VL VGYG      N  KY
Sbjct: 239 AVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSK--NLDHGVLVVGYGFEGANSNNSKY 296

Query: 355 WLVKNSWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456
           WLVKNSW   WG++GYV ++  +NN CG+ +A +Y
Sbjct: 297 WLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331


>UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3;
           Bilateria|Rep: Cathepsin L-like cysteine proteinase -
           Longidorus elongatus
          Length = 358

 Score =  130 bits (314), Expect = 2e-29
 Identities = 68/155 (43%), Positives = 96/155 (61%), Gaps = 3/155 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    A+Q++  + G+ TE  Y  Y G+DG C   +       TG+V++   NE  L+ 
Sbjct: 205 GGYMDGAFQYVETNKGIDTEASYP-YKGRDGRCRFKSEDVGATDTGFVDIPEGNETLLEA 263

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGY-GVLNGHKYWLV 363
           A+   GP+SVAIDAA   F FYS+GVY++  C    + LDH VLAVGY    +G +Y++V
Sbjct: 264 AIATVGPVSVAIDAASFKFQFYSHGVYYDRSC--SPEYLDHGVLAVGYNSTKDGKQYYIV 321

Query: 364 KNSWSNMWGNDGYVLMSMRE-NNCGVQSAPTYVLI 465
           KNSWS  WG+DGY+LMS R+ NNCG+ +  +Y  +
Sbjct: 322 KNSWSEDWGDDGYILMSRRKNNNCGIATMASYPFV 356


>UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep:
           Cathepsin K - Danio rerio (Zebrafish) (Brachydanio
           rerio)
          Length = 333

 Score =  129 bits (311), Expect = 4e-29
 Identities = 64/153 (41%), Positives = 91/153 (59%), Gaps = 3/153 (1%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183
           GGG    A++++  + G+ +EE Y  Y+G D  C  +         G+  +   NE AL 
Sbjct: 181 GGGYMTNAFRYVSNNQGIDSEESYP-YVGTDQQCAYNTSGVAASCRGYKEIPQGNERALT 239

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL-NGHKYWL 360
            A+   GP+SV IDA   TF +Y +GVY++P C NK ++++HAVLAVGYG    G KYW+
Sbjct: 240 AAVANVGPVSVGIDAMQSTFLYYKSGVYYDPNC-NK-EDVNHAVLAVGYGATPRGKKYWI 297

Query: 361 VKNSWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456
           VKNSW   WG  GYVLM+   NN CG+ +  ++
Sbjct: 298 VKNSWGEEWGKKGYVLMARNRNNACGIANLASF 330


>UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 2 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 564

 Score =  127 bits (307), Expect = 1e-28
 Identities = 72/154 (46%), Positives = 91/154 (59%), Gaps = 4/154 (2%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNV-TAITKITGWVNVTTNNENALKL 186
           GGEDFRAY++I  HGL ++EDYG Y+GQDG CH   V + I+ I  +VN+T  N + L  
Sbjct: 411 GGEDFRAYEYIADHGLASDEDYGAYIGQDGVCHDSKVNSTISSIKSYVNIT--NRDDLPT 468

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVL-AVGYGVLNGHKYWLV 363
           AL   GP+SV+IDAA ++FSFY       P      D LDH+VL       L G  YW V
Sbjct: 469 ALANVGPVSVSIDAALRSFSFYPTVSSMIPTAAMDTDSLDHSVLRQSATRTLQGEPYWGV 528

Query: 364 KNSWSNMWGN-DGYVLMSMRENNC-GVQSAPTYV 459
           KNSW  + G   GYVL+S +     GV +  TYV
Sbjct: 529 KNSWVYLLGEMMGYVLISPKGTTTGGVATQGTYV 562


>UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC06231 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 372

 Score =  127 bits (306), Expect = 2e-28
 Identities = 62/158 (39%), Positives = 98/158 (62%), Gaps = 6/158 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDG----YCHIDNVTAITKITGWVNVTTNNEN 174
           GG    A+Q++  + G+ +E  Y  Y+  DG     C  ++   + ++TG++N+   +E 
Sbjct: 216 GGLMDLAFQYVRDNKGIDSEISYP-YISGDGDENVRCLFNSTNIMAQVTGYINIHEGDER 274

Query: 175 ALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKY 354
           AL  A+   GP+SVAI+A   +FS Y +G+Y +P+C +  ++LDH VL VGYG+ +G  Y
Sbjct: 275 ALMNAVATIGPVSVAINAGLPSFSMYKSGIYSDPECASASEDLDHGVLLVGYGIEDGKPY 334

Query: 355 WLVKNSWSNMWGNDGYV-LMSMRENNCGVQSAPTYVLI 465
           WL+KNSW   WG+ GYV ++   +N CGV SA +Y L+
Sbjct: 335 WLIKNSWGEDWGDKGYVKILKDSKNMCGVASAASYPLV 372


>UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra
           erinaceieuropaei|Rep: Cysteine proteinase - Spirometra
           erinaceieuropaei (Tapeworm)
          Length = 336

 Score =  126 bits (303), Expect = 4e-28
 Identities = 62/150 (41%), Positives = 84/150 (56%), Gaps = 1/150 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG   +A+Q+  ++G+  E DY  Y  +DG C       +  +TG+  +   +E  L+ A
Sbjct: 187 GGLMPQAFQYAQRYGVEAEVDYR-YTERDGVCRYRQDLVVANVTGYAELPEGDEGGLQRA 245

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           +   GPISV IDAA   F  YS+GV+    C      +DH VL VGYG  NG  YWLVKN
Sbjct: 246 VATIGPISVGIDAADPGFMSYSHGVFVSKTCSPYA--IDHGVLVVGYGAENGDAYWLVKN 303

Query: 370 SWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456
           SW + WG DGY+ M+   NN CG+ S  +Y
Sbjct: 304 SWGSSWGEDGYLKMARNRNNMCGIASMASY 333


>UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine
           protease; n=1; Maconellicoccus hirsutus|Rep: Putative
           cathepsin L-like cysteine protease - Maconellicoccus
           hirsutus (hibiscus mealybug)
          Length = 339

 Score =  126 bits (303), Expect = 4e-28
 Identities = 55/153 (35%), Positives = 87/153 (56%), Gaps = 1/153 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    +Y +I K G   ++    Y   +  C       +T+++G + +    E  L  +
Sbjct: 187 GGNQHHSYFYIYKQGGVDDDVSYPYKDAEEPCAFKKENVVTRVSGEITLPDGYETNLHES 246

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           +  +GP++  IDA H++F  Y  G+YFEP C NK DE++H VL VGYG  NG  YW+VKN
Sbjct: 247 VAVYGPVAATIDATHQSFHSYKGGIYFEPDCGNKKDEVNHGVLVVGYGSENGQDYWIVKN 306

Query: 370 SWSNMWGNDGYVLMSMRENN-CGVQSAPTYVLI 465
           S+   WG DGY+ M+  +NN CG+ ++ +  ++
Sbjct: 307 SYGTDWGEDGYIRMARNKNNHCGIATSASVPML 339


>UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1;
           Phytophthora infestans|Rep: Cathepsin-like cysteine
           protease - Phytophthora infestans (Potato late blight
           fungus)
          Length = 376

 Score =  125 bits (302), Expect = 5e-28
 Identities = 66/156 (42%), Positives = 87/156 (55%), Gaps = 4/156 (2%)
 Frame = +1

Query: 10  GGEDFRAYQWIM---KHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENAL 180
           GGE    Y+ I+   K  +  EE Y       G C+  +  AI   T + NVT+ +E AL
Sbjct: 199 GGEMSEGYEEIITNHKGKIDREEVYRYTAESKGVCNAKDDKAIGHFTSYANVTSGDEAAL 258

Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360
           + A+   G  +VAIDA+  TF  Y +GVY  P C N  D LDH V A GYGV     YWL
Sbjct: 259 QAAIATKGVQAVAIDASSFTFQLYRHGVYSWPLCGNAPDALDHGVAAAGYGVYKKKDYWL 318

Query: 361 VKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465
           VKNSW N WG  GY++MS  ++N CG+ +  TY ++
Sbjct: 319 VKNSWGNSWGMKGYIMMSRNKDNQCGIATDATYPIM 354


>UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2;
           Acanthamoeba|Rep: Cysteine proteinase - Acanthamoeba
           healyi
          Length = 330

 Score =  125 bits (302), Expect = 5e-28
 Identities = 65/152 (42%), Positives = 92/152 (60%), Gaps = 2/152 (1%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183
           GG  D+ A+++I+ + G+ TE  Y         C  +       +TG+ +VT+ +ENAL 
Sbjct: 180 GGLMDY-AFEYIINNRGIDTEASYPYQTAGPLTCQYNAANKGGSLTGYTDVTSGDENALL 238

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363
            A  K  P+SVAIDA+H +F FYS GVY+E  C +   +LDH VL VG+G  NG  +W V
Sbjct: 239 NAAVKE-PVSVAIDASHNSFQFYSGGVYYESACSST--QLDHGVLVVGWGSENGQDFWWV 295

Query: 364 KNSWSNMWGNDGYVLMSMRE-NNCGVQSAPTY 456
           KNSW   WG +GY+ MS  + NNCG+ +A +Y
Sbjct: 296 KNSWGASWGLNGYIKMSRNQNNNCGIATAASY 327


>UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep:
           Cysteine proteinase - Entamoeba histolytica
          Length = 320

 Score =  124 bits (300), Expect = 9e-28
 Identities = 65/150 (43%), Positives = 88/150 (58%), Gaps = 1/150 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG     + +  ++G+  E+DY  Y   +G C  D    I K  G V V   NE AL  A
Sbjct: 166 GGSILYVFAYTKRNGVIEEKDYP-YTATNGTCQYDADKIIVKNAGQVIVEQRNEVALVEA 224

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           + + GP++VAIDA   +F  Y +GVY EPKCK  +  L+HAV AVGYG  +G  Y++V+N
Sbjct: 225 IAE-GPVAVAIDAGQASFQLYKSGVYDEPKCKKVI--LNHAVCAVGYGSQDGQDYYIVRN 281

Query: 370 SWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456
           SW   WG DGY+LMS  +NN CG+ +   Y
Sbjct: 282 SWGTSWGMDGYILMSRNKNNQCGIANDAIY 311


>UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15)
           (Major excreted protein) (MEP) [Contains: Cathepsin L
           heavy chain; Cathepsin L light chain]; n=19;
           Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15)
           (Major excreted protein) (MEP) [Contains: Cathepsin L
           heavy chain; Cathepsin L light chain] - Homo sapiens
           (Human)
          Length = 333

 Score =  124 bits (300), Expect = 9e-28
 Identities = 68/156 (43%), Positives = 95/156 (60%), Gaps = 6/156 (3%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183
           GG  D+ A+Q++  +G L +EE Y  Y   +  C  +   ++   TG+V++    E AL 
Sbjct: 180 GGLMDY-AFQYVQDNGGLDSEESYP-YEATEESCKYNPKYSVANDTGFVDIP-KQEKALM 236

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNGHK 351
            A+   GPISVAIDA H++F FY  G+YFEP C +  +++DH VL VGYG      + +K
Sbjct: 237 KAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSS--EDMDHGVLVVGYGFESTESDNNK 294

Query: 352 YWLVKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTY 456
           YWLVKNSW   WG  GYV M+  R N+CG+ SA +Y
Sbjct: 295 YWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASY 330


>UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus
           tropicalis|Rep: LOC594890 protein - Xenopus tropicalis
           (Western clawed frog) (Silurana tropicalis)
          Length = 355

 Score =  123 bits (297), Expect = 2e-27
 Identities = 59/144 (40%), Positives = 87/144 (60%), Gaps = 1/144 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    ++++I+ +G+  E +Y  Y G+DG C    V   +  T +  +   +E  LK  
Sbjct: 206 GGWVVSSFRYIIDNGIELESNYP-YQGKDGKCSYTPVKKASVCTSYRQLPYGDEATLKQV 264

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           +   GP+SVAIDA+ KTF  Y NGVY++P C +     DH+VL VGYG  +G +YWLVKN
Sbjct: 265 VGLMGPVSVAIDASRKTFRMYKNGVYYDPNCSSSTP--DHSVLVVGYGAEDGVEYWLVKN 322

Query: 370 SWSNMWGNDGYVLMSM-RENNCGV 438
           SW   +G++GY+ M+    NNCG+
Sbjct: 323 SWGTSFGDEGYIKMARNHHNNCGI 346


>UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2;
           Dictyostelium discoideum|Rep: Cysteine proteinase 2
           precursor - Dictyostelium discoideum (Slime mold)
          Length = 376

 Score = 88.6 bits (210), Expect(2) = 3e-27
 Identities = 49/111 (44%), Positives = 70/111 (63%), Gaps = 2/111 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAI-TKITGWVNVTTNNENALK 183
           GG    A+ +I+K+ G+ TE  Y  Y  + G   + N + I   I G+VN+T  +E +L+
Sbjct: 189 GGLMNNAFDYIIKNKGIDTESSYP-YTAETGSTCLFNKSDIGATIKGYVNITAGSEISLE 247

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV 336
               +HGP+SVAIDA+H +F  Y++G+Y+EPKC     ELDH VL VGYGV
Sbjct: 248 NGA-QHGPVSVAIDASHNSFQLYTSGIYYEPKC--SPTELDHGVLVVGYGV 295



 Score = 55.6 bits (128), Expect(2) = 3e-27
 Identities = 22/40 (55%), Positives = 28/40 (70%), Gaps = 1/40 (2%)
 Frame = +1

Query: 346 HKYWLVKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVL 462
           + YW+VKNSW   WG  GY+LMS  R+NNCG+ S  +Y L
Sbjct: 336 NNYWIVKNSWGTSWGIKGYILMSKDRKNNCGIASVSSYPL 375


>UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1];
           n=11; Eutheria|Rep: Testin-2 precursor [Contains:
           Testin-1] - Mus musculus (Mouse)
          Length = 333

 Score =  122 bits (294), Expect = 5e-27
 Identities = 65/158 (41%), Positives = 92/158 (58%), Gaps = 6/158 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    A+Q++  +G L TEE Y  Y+G    C      +   +  +V +    E AL  
Sbjct: 180 GGFMQNAFQYVKDNGGLATEESYP-YIGPGRKCRYHAENSAANVRDFVQIP-GREEALMK 237

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNGHKY 354
           A+ K GPISVA+DA+H +F FY +G+Y+EP+CK     L+HAVL VGYG      +G+ Y
Sbjct: 238 AVAKVGPISVAVDASHDSFQFYDSGIYYEPQCKRV--HLNHAVLVVGYGFEGEESDGNSY 295

Query: 355 WLVKNSWSNMWGNDGYVLMSMRENN-CGVQSAPTYVLI 465
           WLVKNSW   WG  GY+ ++   NN CG+ +  TY ++
Sbjct: 296 WLVKNSWGEEWGMKGYIKIAKDWNNHCGIATLATYPIV 333


>UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2;
           Platyhelminthes|Rep: Cathepsin L-like proteinase -
           Echinococcus multilocularis
          Length = 338

 Score =  121 bits (292), Expect = 9e-27
 Identities = 61/154 (39%), Positives = 89/154 (57%), Gaps = 2/154 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG+   A+++ M++G  +E DY  Y   DG C  ++   +TK++ +V V    E+ LKL+
Sbjct: 188 GGDMNDAFRYWMRNGAESESDYP-YTAMDGKCKFNSSKVVTKVSKFVKVPKKREDQLKLS 246

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLVK 366
           + + GP+SVAIDA    F  Y  G+Y +  C  +   LDHAVL VGY       KYW+VK
Sbjct: 247 VAQVGPVSVAIDATSSGFMLYKKGIYQDNTCSQQY--LDHAVLVVGYDADKTRQKYWIVK 304

Query: 367 NSWSNMWGNDGYVLMSMRENN-CGVQSAPTYVLI 465
           NSW   WG  GY+ M+  + N CG+ +  +Y LI
Sbjct: 305 NSWGEDWGQRGYIWMARDKGNMCGIATMASYPLI 338


>UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3;
           Curculionidae|Rep: Cysteine proteinase - Hypera postica
           (alfalfa weevil)
          Length = 324

 Score =  121 bits (291), Expect = 1e-26
 Identities = 57/152 (37%), Positives = 89/152 (58%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG     ++++MK GL +EE Y  Y G+DG C  +  + +TK++ + ++   +E+AL  A
Sbjct: 177 GGSLDDNFKYVMKDGLQSEESYT-YKGEDGACKYNVASVVTKVSKYTSIPAEDEDALLEA 235

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           +   GP+SV +DA++   S Y +G+Y +  C      L+HA+LAVGYG  NG  YW++KN
Sbjct: 236 VATVGPVSVGMDASY--LSSYDSGIYEDQDCSPA--GLNHAILAVGYGTENGKDYWIIKN 291

Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           SW   WG  GY  ++  +N CG+     Y  I
Sbjct: 292 SWGASWGEQGYFRLARGKNQCGISEDTVYPTI 323


>UniRef50_UPI000155637A Cluster: PREDICTED: similar to
           ENSANGP00000013730, partial; n=1; Ornithorhynchus
           anatinus|Rep: PREDICTED: similar to ENSANGP00000013730,
           partial - Ornithorhynchus anatinus
          Length = 229

 Score =  120 bits (289), Expect = 2e-26
 Identities = 43/77 (55%), Positives = 68/77 (88%)
 Frame = +1

Query: 235 KTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLMS 414
           ++F+FY+NG+Y+EP+C++K+++L+HAVL VGYGVL G  +WL+KNSWS +WGN GY+L++
Sbjct: 152 RSFAFYANGIYYEPQCRHKLEQLNHAVLLVGYGVLQGQAFWLLKNSWSPLWGNSGYMLLA 211

Query: 415 MRENNCGVQSAPTYVLI 465
           M++N+CGV +A TY ++
Sbjct: 212 MKDNDCGVTTAATYPIL 228


>UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep:
           Cysteine protease - Saprolegnia parasitica
          Length = 523

 Score =  120 bits (289), Expect = 2e-26
 Identities = 66/154 (42%), Positives = 87/154 (56%), Gaps = 5/154 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    A++W+  H GL  EEDY  Y  ++G C +     +TK+T + +V  N+E ALK 
Sbjct: 181 GGLMDNAFKWVKTHKGLCKEEDYP-YHAKEGTCALKKCKPVTKVTAFHDVPANDEQALKA 239

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           A+ K  P+SVAI+A    F FY +GV F+  C  K   LDH VL VGYG   G KYW VK
Sbjct: 240 AVAKQ-PVSVAIEADQPEFQFYKSGV-FDKSCGTK---LDHGVLVVGYGEEGGKKYWKVK 294

Query: 367 NSWSNMWGNDGYVLMSM----RENNCGVQSAPTY 456
           NSW   WG+ GY+ ++         CGV   P+Y
Sbjct: 295 NSWGADWGDKGYIKLAREFGPETGQCGVAMVPSY 328


>UniRef50_P25774 Cluster: Cathepsin S precursor; n=78;
           Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens
           (Human)
          Length = 331

 Score =  120 bits (288), Expect = 3e-26
 Identities = 59/151 (39%), Positives = 88/151 (58%), Gaps = 2/151 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    A+Q+I+ + G+ ++  Y  Y   D  C  D+       + +  +    E+ LK 
Sbjct: 182 GGFMTTAFQYIIDNKGIDSDASYP-YKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKE 240

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           A+   GP+SV +DA H +F  Y +GVY+EP C   V+   H VL VGYG LNG +YWLVK
Sbjct: 241 AVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVN---HGVLVVGYGDLNGKEYWLVK 297

Query: 367 NSWSNMWGNDGYVLMSMRE-NNCGVQSAPTY 456
           NSW + +G +GY+ M+  + N+CG+ S P+Y
Sbjct: 298 NSWGHNFGEEGYIRMARNKGNHCGIASFPSY 328


>UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;
           n=17; Magnoliophyta|Rep: Thiol protease aleurain-like
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 358

 Score =  120 bits (288), Expect = 3e-26
 Identities = 62/153 (40%), Positives = 89/153 (58%), Gaps = 1/153 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG   +A+++I  +G L TEE Y  Y G+DG C         ++   VN+T   E+ LK 
Sbjct: 207 GGLPSQAFEYIKYNGGLDTEEAYP-YTGKDGGCKFSAKNIGVQVRDSVNITLGAEDELKH 265

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           A+    P+SVA +  H+ F FY  GV+    C N   +++HAVLAVGYGV +   YWL+K
Sbjct: 266 AVGLVRPVSVAFEVVHE-FRFYKKGVFTSNTCGNTPMDVNHAVLAVGYGVEDDVPYWLIK 324

Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           NSW   WG++GY  M M +N CGV +  +Y ++
Sbjct: 325 NSWGGEWGDNGYFKMEMGKNMCGVATCSSYPVV 357


>UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase
           precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine
           proteinase precursor - Heterodera glycines (Soybean cyst
           nematode worm)
          Length = 353

 Score =  118 bits (284), Expect = 8e-26
 Identities = 61/155 (39%), Positives = 91/155 (58%), Gaps = 3/155 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    A++++   +GL TEE Y  Y    G C   N T    +  + ++   +E  LK+
Sbjct: 202 GGLMDSAFEYVRDNNGLDTEESYP-YEAVTGKCQFKNETVGGTVVSFKDLKKGDEEQLKI 260

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH-KYWLV 363
           A+   GPISVA+DA++ +F FY  GVY+E  C N+   LDH VL VGYG    H  YWLV
Sbjct: 261 AVATIGPISVALDASNLSFQFYKTGVYYERWCSNRY--LDHGVLLVGYGTDETHGDYWLV 318

Query: 364 KNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465
           KNSW   WG +GY+ ++  ++N+CG+ +  +Y ++
Sbjct: 319 KNSWGPHWGENGYIRIARNKQNHCGIATMASYPVV 353


>UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor;
           n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase
           precursor - Diabrotica virgifera virgifera (western corn
           rootworm)
          Length = 326

 Score =  117 bits (281), Expect = 2e-25
 Identities = 62/154 (40%), Positives = 86/154 (55%), Gaps = 2/154 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWI-MKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG   +A ++I    G+ +E DY  Y G D  C  D+     KI+ +  +  N+E+ LK 
Sbjct: 175 GGYMDKALEYIETAGGIMSENDYP-YEGIDDKCRFDSSKVAAKISNFTYIKKNDEDDLKN 233

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           A+   GPISVAIDA+   F  Y +G+  +  C +  + L+H VL VGYG      YW+VK
Sbjct: 234 AVIAKGPISVAIDASFN-FQLYDSGILDDSSCYSDFNSLNHGVLVVGYGTEKEQDYWIVK 292

Query: 367 NSWSNMWGNDGYVLMSMRENN-CGVQSAPTYVLI 465
           NSW   WG DGY+ MS  +NN CG+ +  TY  I
Sbjct: 293 NSWGADWGMDGYIWMSRNKNNQCGIATDATYPTI 326


>UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Rep:
           Cathepsin R precursor - Mus musculus (Mouse)
          Length = 334

 Score =  117 bits (281), Expect = 2e-25
 Identities = 61/158 (38%), Positives = 94/158 (59%), Gaps = 6/158 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG+ + A+Q+++ +G L +E  Y  Y G+DG C  +   +  +ITG+V++   +E+ L  
Sbjct: 181 GGDTYNAFQYVLHNGGLESEATYP-YEGKDGPCRYNPKNSKAEITGFVSLP-QSEDILMA 238

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNGHKY 354
           A+   GPI+  IDA+H++F  Y  G+Y EP C +  D + H VL VGYG      +G+ Y
Sbjct: 239 AVATIGPITAGIDASHESFKNYKGGIYHEPNCSS--DTVTHGVLVVGYGFKGIETDGNHY 296

Query: 355 WLVKNSWSNMWGNDGYVLMSMRENN-CGVQSAPTYVLI 465
           WL+KNSW   WG  GY+ ++  +NN CG+ S   Y  I
Sbjct: 297 WLIKNSWGKRWGIRGYMKLAKDKNNHCGIASYAHYPTI 334


>UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase;
           n=21; Bilateria|Rep: Cathepsin L-like cysteine
           proteinase - Globodera pallida
          Length = 379

 Score =  116 bits (280), Expect = 2e-25
 Identities = 64/156 (41%), Positives = 88/156 (56%), Gaps = 4/156 (2%)
 Frame = +1

Query: 10  GGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGY-CHIDNVTAITKITGWVNVTTNNENALK 183
           GG    A+Q+I   +G+  E DY  Y  + G  C           TG+ ++   +E  LK
Sbjct: 227 GGIMDNAFQYIKDNNGVDKELDYP-YKAKTGKKCLFKRNDVGATDTGFFDIAEGDEEKLK 285

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWL 360
           +A+   GP SVAIDA H++F  Y++GVYFE +C    + LDH VL VGYG       YW+
Sbjct: 286 IAVATQGPASVAIDAGHRSFQLYTHGVYFEKEC--SPENLDHGVLVVGYGTDAQQGDYWI 343

Query: 361 VKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465
           VKNSW   WG  GY+ M+  R+NNCG+ S  +Y L+
Sbjct: 344 VKNSWGAHWGEQGYIRMARNRKNNCGIASHASYPLV 379


>UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes
           vastus|Rep: Cathepsin L - Aphrocallistes vastus
          Length = 329

 Score =  116 bits (279), Expect = 3e-25
 Identities = 57/151 (37%), Positives = 88/151 (58%)
 Frame = +1

Query: 4   RGGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183
           +GG  D+ A+++   +    E DY  Y  ++G C  +    +TK + + ++ + N +ALK
Sbjct: 180 QGGLMDY-AFKYWETNLAEKESDYT-YTAKNGKCKYNAQLGVTKDSSFTDIPSENCDALK 237

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363
            A+   GPI+VA+DA+H +F  Y +G+Y    C     +LDH VL VGYG  NG  YWL+
Sbjct: 238 EAVANKGPIAVAMDASHTSFQMYHSGIYTPFLCSKT--KLDHGVLVVGYGTDNGVDYWLI 295

Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQSAPTY 456
           KNSW   WG DGY  + M+ + CG+ +  +Y
Sbjct: 296 KNSWGMAWGMDGYFKIEMKSDKCGICTQASY 326


>UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep:
           Cathepsin - Geodia cydonium (Sponge)
          Length = 322

 Score =  116 bits (278), Expect = 4e-25
 Identities = 58/154 (37%), Positives = 87/154 (56%), Gaps = 2/154 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    A+++++K+G + TE  Y  Y+ +D  CH  +    +  + +V++ + +E  L++
Sbjct: 169 GGLPDDAFKYVIKNGGIDTEASYP-YVARDEKCHYSSANIGSTCSSYVDIESKSEAQLQV 227

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           A    GPI V IDA+H  F  Y  GVY    C      LDH VL VGYGV     YW+VK
Sbjct: 228 ASATVGPIPVGIDASHLGFQLYDGGVYHSDLCSQT--RLDHGVLVVGYGVYKEKDYWMVK 285

Query: 367 NSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465
           NSW   WG  G ++MS  R+NNCG+ +  +Y ++
Sbjct: 286 NSWGTNWGISGDMMMSRNRDNNCGIATMASYPVV 319


>UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine
           proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like
           midgut cysteine proteinase - Tenebrio molitor (Yellow
           mealworm)
          Length = 330

 Score =  116 bits (278), Expect = 4e-25
 Identities = 56/150 (37%), Positives = 89/150 (59%), Gaps = 1/150 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    A+ +I  +G+ +E  Y  Y  Q  YC  D+  ++T ++G+ ++ + +EN+L  A
Sbjct: 182 GGWMDSAFSYIHDYGIMSESAYP-YEAQGDYCRFDSSQSVTTLSGYYDLPSGDENSLADA 240

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           + + GP++VAIDA  +   FYS G++++  C     +L+H VL VGYG  NG  YW++KN
Sbjct: 241 VGQAGPVAVAIDATDE-LQFYSGGLFYDQTCNQS--DLNHGVLVVGYGSDNGQDYWILKN 297

Query: 370 SWSNMWGNDGYVLMSMR-ENNCGVQSAPTY 456
           SW + WG  GY        NNCG+ +A +Y
Sbjct: 298 SWGSGWGESGYWRQVRNYGNNCGIATAASY 327


>UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18;
           Magnoliophyta|Rep: Thiol protease aleurain precursor -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 358

 Score =  115 bits (277), Expect = 6e-25
 Identities = 58/153 (37%), Positives = 89/153 (58%), Gaps = 1/153 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG   +A+++I  +G L TE+ Y  Y G+D  C         ++   VN+T   E+ LK 
Sbjct: 207 GGLPSQAFEYIKSNGGLDTEKAYP-YTGKDETCKFSAENVGVQVLNSVNITLGAEDELKH 265

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           A+    P+S+A +  H +F  Y +GVY +  C +   +++HAVLAVGYGV +G  YWL+K
Sbjct: 266 AVGLVRPVSIAFEVIH-SFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIK 324

Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           NSW   WG+ GY  M M +N CG+ +  +Y ++
Sbjct: 325 NSWGADWGDKGYFKMEMGKNMCGIATCASYPVV 357


>UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin
           L-like protease; n=1; Nasonia vitripennis|Rep:
           PREDICTED: similar to cathepsin L-like protease -
           Nasonia vitripennis
          Length = 353

 Score =  114 bits (275), Expect = 1e-24
 Identities = 63/159 (39%), Positives = 90/159 (56%), Gaps = 6/159 (3%)
 Frame = +1

Query: 7   GGGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGYCHIDNVTAITKI--TGWVNVTTNNENA 177
           GGG    ++Q+++ + GL  E +Y  Y G+   C  +      +     ++ V   +E  
Sbjct: 198 GGGSAALSFQFVVDQKGLEPEANYS-YEGRTKECPYNTSDDEDEELDASFIYVNGGDEAT 256

Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN--GHK 351
           LK+A+   GP S AID +H TF FYS GVY++P+C    D+LDHAVL VGYG  N     
Sbjct: 257 LKVAVATVGPFSAAIDGSHDTFRFYSEGVYYQPECNE--DDLDHAVLIVGYGTDNRTDQD 314

Query: 352 YWLVKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465
           +WLVKNSW   WG  GY  ++  R N+CG+ +A  Y +I
Sbjct: 315 FWLVKNSWGETWGEGGYFKVARNRRNHCGIAAAAVYPVI 353


>UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3;
           Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber
           officinale (Ginger)
          Length = 475

 Score =  114 bits (275), Expect = 1e-24
 Identities = 59/154 (38%), Positives = 85/154 (55%), Gaps = 5/154 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTA-ITKITGWVNVTTNNENALKL 186
           GG  +RA+Q+I+ +G    E++  Y G +G C+     A +  I  + NV +N+E +L+ 
Sbjct: 207 GGWPYRAFQYIINNGGVNSEEHYPYTGTNGTCNTTKENAHVVSIDSYRNVPSNDEKSLQK 266

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           A     PISV IDA+ + F  Y +G+ F   C      L+H V  VGYG  NG+ YW+VK
Sbjct: 267 AAANQ-PISVGIDASGRNFQLYHSGI-FTGSCNTS---LNHGVTVVGYGTENGNDYWIVK 321

Query: 367 NSWSNMWGNDGYVLMSMR----ENNCGVQSAPTY 456
           NSW   WGN GY+LM          CG+  +P+Y
Sbjct: 322 NSWGENWGNSGYILMERNIAESSGKCGIAISPSY 355


>UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 356

 Score =  114 bits (275), Expect = 1e-24
 Identities = 58/153 (37%), Positives = 84/153 (54%), Gaps = 1/153 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGW-VNVTTNNENALKL 186
           GG   +A+++I  +G  + E+   Y+ QD  C     T   ++ G   N+T  +E+ LK 
Sbjct: 194 GGLPSQAFEYIKYNGGISYENSYYYIAQDQECQFSPETVGARVRGGSFNITQGDEDQLKQ 253

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           A+   GP+S+A       F  Y +GVY  P C +    ++HAVLAVGYG  NG  YW VK
Sbjct: 254 AVGTVGPVSIAFQVMGD-FKLYKSGVYSNPDCSSSPQTVNHAVLAVGYGSENGVDYWYVK 312

Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           NSWS  WG++GY  +    N CGV +  +Y L+
Sbjct: 313 NSWSEFWGDEGYFKIQRGVNMCGVATCASYPLL 345


>UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16)
           [Contains: Cathepsin H mini chain; Cathepsin H heavy
           chain; Cathepsin H light chain]; n=37; Eukaryota|Rep:
           Cathepsin H precursor (EC 3.4.22.16) [Contains:
           Cathepsin H mini chain; Cathepsin H heavy chain;
           Cathepsin H light chain] - Homo sapiens (Human)
          Length = 335

 Score =  114 bits (274), Expect = 1e-24
 Identities = 56/150 (37%), Positives = 85/150 (56%), Gaps = 1/150 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG   +A+++I+ + G+  E+ Y  Y G+DGYC      AI  +    N+T  +E A+  
Sbjct: 183 GGLPSQAFEYILYNKGIMGEDTYP-YQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVE 241

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           A+  + P+S A +   + F  Y  G+Y    C    D+++HAVLAVGYG  NG  YW+VK
Sbjct: 242 AVALYNPVSFAFEVT-QDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVK 300

Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTY 456
           NSW   WG +GY L+   +N CG+ +  +Y
Sbjct: 301 NSWGPQWGMNGYFLIERGKNMCGLAACASY 330


>UniRef50_Q4TI44 Cluster: Chromosome undetermined SCAF2412, whole
           genome shotgun sequence; n=1; Tetraodon
           nigroviridis|Rep: Chromosome undetermined SCAF2412,
           whole genome shotgun sequence - Tetraodon nigroviridis
           (Green puffer)
          Length = 123

 Score =  113 bits (272), Expect = 2e-24
 Identities = 50/105 (47%), Positives = 70/105 (66%), Gaps = 2/105 (1%)
 Frame = +1

Query: 157 TTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV 336
           +  NE  L  ALFKHGP+++ IDA   TF  YS GVY++P C    ++++HAVL VGYGV
Sbjct: 21  SAGNEKLLAYALFKHGPVAIGIDATLTTFHLYSKGVYYDPDC--NPEDINHAVLLVGYGV 78

Query: 337 L-NGHKYWLVKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465
              G +YW+VKNSW   WG +GY+LM+  R N CG+ +  +Y ++
Sbjct: 79  TRRGQQYWIVKNSWGTGWGTEGYILMARNRGNLCGIANLASYPIM 123


>UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 4 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 345

 Score =  113 bits (272), Expect = 2e-24
 Identities = 60/152 (39%), Positives = 83/152 (54%), Gaps = 3/152 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKIT--GWVNVTTNNENAL 180
           GG+   A+Q++   G L TE  Y    G +  C   N     +++  G   V   NE  L
Sbjct: 193 GGQMPGAFQYVQDAGGLDTEARYPYRQGTNFQCQFSNSFEARRVSVNGHTRVPPRNERVL 252

Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360
           + A+   GPIS+AI+A+ +TF FY NG+Y EP C  +   L+HAVL VGYG   G  YW+
Sbjct: 253 QDAVANVGPISIAINASPQTFMFYKNGIYGEPNCDPR--GLNHAVLLVGYGEERGVPYWI 310

Query: 361 VKNSWSNMWGNDGYVLMSMRENNCGVQSAPTY 456
           VKNSW   WG  GY+ +    N CG+   P++
Sbjct: 311 VKNSWGPGWGEGGYIKILRNRNVCGMSQDPSF 342


>UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_26_47548_45815 - Giardia lamblia
           ATCC 50803
          Length = 577

 Score =  113 bits (271), Expect = 3e-24
 Identities = 60/158 (37%), Positives = 97/158 (61%), Gaps = 6/158 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG---LPTEEDYGGYLGQDGYCH--IDNVTAITKITGWVNVTTNNEN 174
           GG+   A +W++++    +  E +Y  YLGQ+  C   + +  +   +TG+  V   +  
Sbjct: 418 GGDTLAALKWLVENNGGRVAFESEYP-YLGQNDLCKEALFDHESFYFVTGYSAVKQYSIP 476

Query: 175 ALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH-K 351
           +LK AL + GP++V+I    ++  FYS GVY +P C  K D+L HAVLAVGYG  + +  
Sbjct: 477 SLKAAL-QDGPVAVSIGIT-ESLLFYSGGVYNDPACPYKYDDLSHAVLAVGYGTDDTYGD 534

Query: 352 YWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           YW+V+NSWS +WG DGY  +SM++N CG+ +  +Y ++
Sbjct: 535 YWIVRNSWSPLWGMDGYFYLSMKDNICGILTDASYAVV 572


>UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 234

 Score =  113 bits (271), Expect = 3e-24
 Identities = 57/146 (39%), Positives = 81/146 (55%), Gaps = 1/146 (0%)
 Frame = +1

Query: 31  YQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPI 210
           Y  I  HGL   ED   Y  +   C  D    + K+TG+ +   +NE+ LK  +  +GP 
Sbjct: 91  YVKIFMHGLFETEDNYPYQAEHHSCKFDKTRGVGKLTGY-HKCKSNEDQLKTEVAANGPY 149

Query: 211 SVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWG 390
           +V I+A  + F  YS+GV+  PKC   +  LDH V  +GYGV +G  YWLV+NSW   WG
Sbjct: 150 AVMINADSEQFRLYSSGVFDNPKCGKII--LDHVVTVIGYGVEDGKDYWLVRNSWGKYWG 207

Query: 391 NDGYVLMSM-RENNCGVQSAPTYVLI 465
            +GY+ MS  ++N CG+ +     LI
Sbjct: 208 LEGYIKMSRNKDNQCGIATEAVIPLI 233


>UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 317

 Score =  112 bits (269), Expect = 5e-24
 Identities = 59/152 (38%), Positives = 85/152 (55%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    AY +I  +GL  E  Y  Y G DGY   + + AI KI G+ ++    E ALK A
Sbjct: 170 GGWPHWAYDYIKDNGLCLESKYK-YQGYDGYYCKECIPAIKKINGYSSIN-QTEEALKEA 227

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           +   GPI+V ++A +  +  YS G+     C    + ++HAVLAVGYG  NG  +WL+KN
Sbjct: 228 VGTAGPIAVCVNA-NDDWQLYSGGILESQSCPGG-ESINHAVLAVGYGSENGKDFWLIKN 285

Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           SW+  WG +GY+ +   +N CG+     Y L+
Sbjct: 286 SWNTYWGEEGYLRIVRGKNQCGINEVADYPLL 317


>UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11;
           Trichomonadidae|Rep: Cysteine protease - Tritrichomonas
           foetus (Trichomonas foetus)
          Length = 315

 Score =  112 bits (269), Expect = 5e-24
 Identities = 54/155 (34%), Positives = 87/155 (56%), Gaps = 3/155 (1%)
 Frame = +1

Query: 7   GGGEDFRAYQWIM---KHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENA 177
           GG  D+ AY++I+   K  +  E DY  Y   DG C       +  +  ++ +  N+E  
Sbjct: 164 GGLMDY-AYKYIIDRQKGKMILESDYV-YTALDGVCKFAQFQTVGNVASFLYIAENDEED 221

Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357
           L   +  HGP++VAIDA+H++F  Y +G+Y EP+C      L+H V  +G+G  N  KYW
Sbjct: 222 LAANVETHGPVAVAIDASHQSFQLYKSGIYDEPEC--SATFLNHGVGCIGFGSDNDTKYW 279

Query: 358 LVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVL 462
           +V NSW   WG +GY+ +  ++N CG+ ++  + L
Sbjct: 280 IVPNSWGLTWGEEGYIRIIRKDNRCGIAASACFPL 314


>UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicatein
           a3 - Lubomirskia baicalensis
          Length = 344

 Score =  112 bits (269), Expect = 5e-24
 Identities = 60/156 (38%), Positives = 89/156 (57%), Gaps = 4/156 (2%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHID--NVTAITKITGWVNVTTNNENAL 180
           GG+ + A+++++ +G + TE  Y  Y G+   C  +  NV AI+  TG V + + +E  L
Sbjct: 194 GGDVYTAFKYVVDNGGIDTESSYP-YKGKKSSCQYNSKNVGAIS--TGVVKIASGSETDL 250

Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360
             A+   GPI+VA+DA+   F FY +GV+    C     +L+HA+L  GYG  NG  YWL
Sbjct: 251 LSAVASVGPIAVAVDASVNAFMFYQSGVFDSSTCSTS--KLNHAMLVTGYGSTNGKDYWL 308

Query: 361 VKNSWSNMWGNDGYVLMSMRE-NNCGVQSAPTYVLI 465
           VKNSW   WG  GY+ M   + N CG+ S   Y ++
Sbjct: 309 VKNSWGTGWGESGYIKMVRNKYNQCGIASDALYPML 344


>UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel
           protein - Danio rerio (Zebrafish) (Brachydanio rerio)
          Length = 328

 Score =  111 bits (268), Expect = 7e-24
 Identities = 57/152 (37%), Positives = 85/152 (55%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG   RA+ +++++       +  Y  ++G C           TG+  V  +NE AL+ A
Sbjct: 179 GGFLSRAFLYVIQNRGIDSSTFYPYEHKEGVCRYSVSGRAGYCTGFRIVPRHNEAALQSA 238

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           +   GP+SV I+A   +F  Y +G+Y +PKC + +  ++HAVL VGYG  NG  YWLVKN
Sbjct: 239 VANIGPVSVGINAKLLSFHRYRSGIYNDPKCSSAL--INHAVLVVGYGSENGQDYWLVKN 296

Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           SW   WG +GY+ M+  +N CG+ S   Y  I
Sbjct: 297 SWGTAWGENGYIRMARNKNMCGISSFGIYPTI 328


>UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18;
           Schistosoma|Rep: Preprocathepsin cathepsin L -
           Schistosoma japonicum (Blood fluke)
          Length = 331

 Score =  111 bits (268), Expect = 7e-24
 Identities = 55/150 (36%), Positives = 86/150 (57%), Gaps = 1/150 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    A+ ++ KH + +E DY  YLG D  CH      + K+  + ++   +E  L+ A
Sbjct: 182 GGTMDLAFNYLEKHYIESENDYK-YLGHDANCHYRKSKGVVKVKKFGDLPARDEKTLEKA 240

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           ++++GPISV I  A  +   Y +G+Y    CK    +++H VLAVGYG  NG  YWL+KN
Sbjct: 241 VYQYGPISVGI-VALDSLILYKSGIYESKDCKYA--DINHGVLAVGYGRENGKDYWLIKN 297

Query: 370 SWSNMWGNDGYV-LMSMRENNCGVQSAPTY 456
           SW ++WG +GY  L   + + CG+ S  ++
Sbjct: 298 SWGDLWGMNGYFKLRRNKPHMCGISSNSSF 327


>UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=17; Trichomonas vaginalis|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 318

 Score =  111 bits (268), Expect = 7e-24
 Identities = 61/148 (41%), Positives = 80/148 (54%), Gaps = 5/148 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKH--GL-PTEEDYGGYLGQDGYCHIDNVTAITKITGWVN-VTTNNENA 177
           GG+++ AY +++KH  GL   E DY  Y  +DG C       +T    +V   TT NE+ 
Sbjct: 164 GGDEYLAYDYVIKHQKGLWMLETDYP-YTARDGSCKFKAAKGVTLTKSYVRPTTTQNEDE 222

Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357
           LK    K G +S+AIDA+   F  YS+G+Y    C +    LDHAV  VGYG  N   YW
Sbjct: 223 LKAGCAKGGVVSIAIDASGYDFQLYSSGIYNPKSCSSTF--LDHAVGLVGYGTENKVDYW 280

Query: 358 LVKNSWSNMWGNDGYVLMSMRE-NNCGV 438
           +V+NSW   WG  GY+ M     N CGV
Sbjct: 281 IVRNSWGTSWGEKGYIRMIRNNGNKCGV 308


>UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_26_49243_47612 - Giardia lamblia
           ATCC 50803
          Length = 543

 Score =  111 bits (267), Expect = 9e-24
 Identities = 57/130 (43%), Positives = 82/130 (63%), Gaps = 2/130 (1%)
 Frame = +1

Query: 82  YLGQDGYCHIDNVTAIT-KITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSFYSN 258
           YLG +  C+    T+   +I G  +V   +  A+K AL   GP+S+A+ A  +TFS+YS 
Sbjct: 415 YLGVESLCNESIFTSDHGRIRGVAHVKEYDIGAMKYALLS-GPVSIAV-AVTETFSWYSG 472

Query: 259 GVYFEPKCKNKVDELDHAVLAVGYGVLN-GHKYWLVKNSWSNMWGNDGYVLMSMRENNCG 435
           GV+ +P C + VD+L HAVL VG+G       YW+V+NSWSN WG DGY+ +SM+ N CG
Sbjct: 473 GVFNDPACASGVDDLAHAVLLVGWGTDEVAGDYWIVRNSWSNAWGIDGYMYLSMKNNICG 532

Query: 436 VQSAPTYVLI 465
           V +   YV++
Sbjct: 533 VLTCADYVMV 542


>UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2;
           Taeniidae|Rep: Cathepsin L-like cysteine proteinase -
           Taenia solium (Pork tapeworm)
          Length = 339

 Score =  111 bits (267), Expect = 9e-24
 Identities = 58/146 (39%), Positives = 82/146 (56%), Gaps = 1/146 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    A++++ +H +  E  Y  Y   DG C  +    +  +T   ++   NE AL  A
Sbjct: 190 GGYMSYAFKYLEEHFIEPESAYP-YRATDGPCRYNESLGVGTVTDIGDIPEGNETALMEA 248

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           +   GPIS+AIDA+   F FY +G+Y    C +K   L+H VLA+GYG  +G  YWLVKN
Sbjct: 249 VATVGPISIAIDASSLGFMFYRHGIYKSHWCSSKF--LNHGVLAIGYGKQDGKPYWLVKN 306

Query: 370 SWSNMWGNDGYVLMSMRENN-CGVQS 444
           SW   WG  GY++M+   +N CGV S
Sbjct: 307 SWGTRWGMKGYIMMAKDYHNMCGVAS 332


>UniRef50_Q23EG5 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 429

 Score =  111 bits (266), Expect = 1e-23
 Identities = 57/153 (37%), Positives = 83/153 (54%), Gaps = 1/153 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG   RA+++I    G+ +  DY  Y G+DG C       + K+    N+T  +EN L  
Sbjct: 193 GGLPSRAFEYIAYAGGIESSRDYP-YKGKDGKCKFKPQKVVAKVQSSFNITFQDENELIY 251

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
            L K+GP+S+A       F  Y  G+Y  P+C     E++HAVLAVGY  L G +Y++VK
Sbjct: 252 HLAKNGPVSIAYQVTDD-FENYEGGIYSNPECSTDPQEVNHAVLAVGYN-LTG-RYYIVK 308

Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           NSW   WG DGY  + +  N CG+    +Y ++
Sbjct: 309 NSWGKDWGMDGYFYIELGSNMCGLADCASYPIL 341


>UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor;
           n=176; Viridiplantae|Rep: Cysteine proteinase RD21a
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 462

 Score =  109 bits (262), Expect = 4e-23
 Identities = 61/156 (39%), Positives = 90/156 (57%), Gaps = 6/156 (3%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYC-HIDNVTAITKITGWVNVTTNNENAL 180
           GG  D+ A+++I+K+G + T++DY  Y G DG C  I     +  I  + +V T +E +L
Sbjct: 202 GGLMDY-AFEFIIKNGGIDTDKDYP-YKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESL 259

Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360
           K A+  H PIS+AI+A  + F  Y +G+ F+  C     +LDH V+AVGYG  NG  YW+
Sbjct: 260 KKAV-AHQPISIAIEAGGRAFQLYDSGI-FDGSCGT---QLDHGVVAVGYGTENGKDYWI 314

Query: 361 VKNSWSNMWGNDGYVLMSMR----ENNCGVQSAPTY 456
           V+NSW   WG  GY+ M+         CG+   P+Y
Sbjct: 315 VRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSY 350


>UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:
           Cathepsin - Petromyzon marinus (Sea lamprey)
          Length = 333

 Score =  109 bits (261), Expect = 5e-23
 Identities = 58/155 (37%), Positives = 87/155 (56%), Gaps = 3/155 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVT-TNNENALK 183
           GG   RA Q+I+  +G+ +E  Y  Y   DG C        TK + +  V  ++NE  L+
Sbjct: 183 GGRSERALQYIIDNNGIDSELSYP-YEHADGKCRFKPANVATKCSSYQFVEPSSNEEVLR 241

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363
            A+   GPI++A++A   TF  Y +G++ EP C    +   HA+L VGYG L+G+ +W+V
Sbjct: 242 QAVASVGPIAIAMNADLDTFKHYKSGLFNEPSCDKSPN---HAMLVVGYGSLSGNDFWIV 298

Query: 364 KNSWSNMWGNDGYVLM-SMRENNCGVQSAPTYVLI 465
           KNSW   WG  GY+ M   ++N CG+ S   Y +I
Sbjct: 299 KNSWGEDWGEKGYIYMIRNKDNQCGIASIGIYPII 333


>UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3;
           Bilateria|Rep: Cathepsin L-like cysteine protease -
           Neobenedenia melleni
          Length = 335

 Score =  108 bits (260), Expect = 6e-23
 Identities = 61/155 (39%), Positives = 85/155 (54%), Gaps = 2/155 (1%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG  D      I   GL +E  Y  Y  Q   C      +   I+ + +V+  +E  LK 
Sbjct: 184 GGIMDNSFNYLIHNKGLESEASYP-YEAQKKECRYKKALSKGTISSFTDVSQFDEKDLKR 242

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL-NGHKYWLV 363
           A+   GP+S+AIDA+  +F  Y +GVY E  C   +  L+H VLAVGYG    G  YW V
Sbjct: 243 AVGLVGPVSIAIDASQFSFHLYDSGVYDEEDCSQTM--LNHGVLAVGYGTTPEGLDYWKV 300

Query: 364 KNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465
           KNSW+N WG +GY+LMS  ++N CGV +  +Y ++
Sbjct: 301 KNSWTNTWGMEGYILMSRNKDNQCGVATVASYPIV 335


>UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9;
           Onchocercidae|Rep: Cathepsin L-like precursor - Brugia
           pahangi (Filarial nematode worm)
          Length = 395

 Score =  108 bits (260), Expect = 6e-23
 Identities = 59/151 (39%), Positives = 82/151 (54%), Gaps = 2/151 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    A+Q+  ++G+  E  Y  Y+G +  C      A+    G+  +   +E ALK A
Sbjct: 248 GGYMPTAFQYASRYGIAMESRYP-YVGTEQRCRWQQSIAVVTDNGFNEIQPGDELALKHA 306

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH-KYWLVK 366
           + K GP+ V I  + ++F FY +GVY E  C       DHAVLAVGYG    +  YW+VK
Sbjct: 307 VAKRGPVVVGISGSKRSFRFYKDGVYSEGNCGRP----DHAVLAVGYGTHPSYGDYWIVK 362

Query: 367 NSWSNMWGNDGYVLMSM-RENNCGVQSAPTY 456
           NSW   WG DGYV M+  R N C + SA ++
Sbjct: 363 NSWGTDWGKDGYVYMARNRGNMCHIASAASF 393


>UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6;
           Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia
           deliciosa (Kiwi)
          Length = 509

 Score =  108 bits (259), Expect = 8e-23
 Identities = 59/156 (37%), Positives = 86/156 (55%), Gaps = 6/156 (3%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNV-TAITKITGWVNVTTNNENAL 180
           GG  D+ A++W+M +G + TE DY  Y G+DG C+     T    I G+ +V    E+AL
Sbjct: 211 GGYMDY-AFEWVMSNGGIDTETDYP-YTGEDGTCNTTKEETKAVSIDGYEDVA-EEESAL 267

Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360
             A+ K  PISV ID     F  Y+ G+Y +  C +  D++DHAVL VGYG  +G +YW+
Sbjct: 268 FCAVLKQ-PISVGIDGGAIDFQLYTGGIY-DGDCSDDPDDIDHAVLVVGYGAESGEEYWI 325

Query: 361 VKNSWSNMWGNDGYVLMSMRENN----CGVQSAPTY 456
           +KNSW   WG  GY  +    +     C + +  +Y
Sbjct: 326 IKNSWGTDWGMKGYAYIKRNTSKDYGVCAINAMASY 361


>UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2;
           Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio
           molitor (Yellow mealworm)
          Length = 336

 Score =  108 bits (259), Expect = 8e-23
 Identities = 60/152 (39%), Positives = 82/152 (53%), Gaps = 4/152 (2%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    A+ ++ ++G    E    Y   DG CH D      +++G+V ++  +EN L   
Sbjct: 187 GGWMNDAFTYVAQNGGIDSEGAYPYEMADGNCHYDPNQVAARLSGYVYLSGPDENMLADM 246

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           +   GP++VA DA    F  YS GVY+ P C+   ++  HAVL VGYG  NG  YWLVKN
Sbjct: 247 VATKGPVAVAFDA-DDPFGSYSGGVYYNPTCET--NKFTHAVLIVGYGNENGQDYWLVKN 303

Query: 370 SWSNMWGNDGYVLMSMRENN----CGVQSAPT 453
           SW + WG DGY  ++   NN     GV S PT
Sbjct: 304 SWGDGWGLDGYFKIARNANNHCGIAGVASVPT 335


>UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate
           cathepsin L; n=4; Danio rerio|Rep: Novel protein similar
           to vertebrate cathepsin L - Danio rerio (Zebrafish)
           (Brachydanio rerio)
          Length = 334

 Score =  107 bits (258), Expect = 1e-22
 Identities = 58/148 (39%), Positives = 82/148 (55%), Gaps = 2/148 (1%)
 Frame = +1

Query: 28  AYQWIMKHGLPTEEDYGGYLGQDGY-CHIDNVTAITKITGWVNVTTNNENALKLALFKHG 204
           AY +++ + L + + Y  Y   D   C  +   A+  I+ +  V   NE AL  A+   G
Sbjct: 190 AYDYVINNALESSDTYP-YTSVDTQPCFYEKNLAMAGISDYRFVPAGNEQALADAVATVG 248

Query: 205 PISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNM 384
           P+SVAIDA + +F FYS+G+Y E  C    + L+HAVL VGYG   G  YW++KNSW   
Sbjct: 249 PVSVAIDADNPSFLFYSSGIYKESNCNP--NNLNHAVLVVGYGSEEGTDYWIIKNSWGTG 306

Query: 385 WGNDGYVLMSMR-ENNCGVQSAPTYVLI 465
           WG  GY+ M    +N CG+ S   Y +I
Sbjct: 307 WGEGGYMRMIRNGKNTCGIASYALYPII 334


>UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor;
           n=35; Fasciola|Rep: Cathepsin L-like proteinase
           precursor - Fasciola hepatica (Liver fluke)
          Length = 326

 Score =  107 bits (258), Expect = 1e-22
 Identities = 56/153 (36%), Positives = 80/153 (52%), Gaps = 1/153 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    AYQ++ + GL TE  Y  Y   +G C  +    + K+TG+  V + +E  LK  
Sbjct: 174 GGLMENAYQYLKQFGLETESSYP-YTAVEGQCRYNKQLGVAKVTGYYTVHSGSEVELKNL 232

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           +    P +VA+D     F  Y +G+Y    C      ++HAVLAVGYG   G  YW+VKN
Sbjct: 233 VGARRPAAVAVDV-ESDFMMYRSGIYQSQTCSPL--RVNHAVLAVGYGTQGGTDYWIVKN 289

Query: 370 SWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465
           SW   WG  GY+ M+  R N CG+ S  +  ++
Sbjct: 290 SWGTYWGERGYIRMARNRGNMCGIASLASLPMV 322


>UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Slime
           mold). Cysteine proteinase 5; n=2; Dictyostelium
           discoideum|Rep: Similar to Dictyostelium discoideum
           (Slime mold). Cysteine proteinase 5 - Dictyostelium
           discoideum (Slime mold)
          Length = 345

 Score =  107 bits (257), Expect = 1e-22
 Identities = 54/157 (34%), Positives = 97/157 (61%), Gaps = 11/157 (7%)
 Frame = +1

Query: 28  AYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHG 204
           A+Q+I+++G + +EE Y    G+ G C  ++  ++ KIT +  V + +E++L+ A+    
Sbjct: 192 AFQYIIENGGIDSEESYKFSGGEPGKCKYNSSNSVAKITSYEKVKSGSESSLESAVSLK- 250

Query: 205 PISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGY---------GVLNGHKYW 357
           P++  IDA+  +F FYS+G+Y+EP C N  D L+H++L VG+          + +   YW
Sbjct: 251 PVAAYIDASLSSFQFYSSGIYYEPSC-NSTD-LNHSILIVGFSDFSTTPTDSLKHSSNYW 308

Query: 358 LVKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465
           +V+NS+   WG +GY+ MS  R++NCG+    +YV++
Sbjct: 309 IVQNSFGKNWGENGYIFMSKDRDDNCGISKMASYVIV 345


>UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes
           abbreviatus|Rep: Cathepsin L 2 - Diaprepes abbreviatus
           (Sugarcane rootstalk borer weevil)
          Length = 348

 Score =  107 bits (257), Expect = 1e-22
 Identities = 59/151 (39%), Positives = 86/151 (56%), Gaps = 2/151 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    A+ +I ++G + TE+ Y  Y  +DG C          ++  + V    EN L  
Sbjct: 201 GGWMHWAFGYIKENGGIDTEQSYP-YTAKDGRCAYKPGNKAATVSQVIMVP-RGENQLAA 258

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
            +   GPIS+A + +HK F FY +GVY EP+C +    L+HA+LAVGYG + G  +WLVK
Sbjct: 259 KVSSVGPISIAAEVSHK-FQFYHSGVYDEPQCGHS---LNHAMLAVGYGSMGGKNFWLVK 314

Query: 367 NSWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456
           NSW   WG+ GY+ M+  +NN CG+    +Y
Sbjct: 315 NSWGTGWGDQGYIRMAKDKNNQCGIALMASY 345


>UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4;
           core eudicotyledons|Rep: Papain-like cysteine peptidase
           XBCP3 - Arabidopsis thaliana (Mouse-ear cress)
          Length = 437

 Score =  106 bits (255), Expect = 3e-22
 Identities = 61/156 (39%), Positives = 87/156 (55%), Gaps = 6/156 (3%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTA-ITKITGWVNVTTNNENAL 180
           GG  D+ A+++++K HG+ TE+DY  Y  +DG C  D +   +  I  +  V +N+E AL
Sbjct: 183 GGLMDY-AFEFVIKNHGIDTEKDYP-YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKAL 240

Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360
             A+    P+SV I  + + F  YS+G++  P C      LDHAVL VGYG  NG  YW+
Sbjct: 241 MEAVAAQ-PVSVGICGSERAFQLYSSGIFSGP-CSTS---LDHAVLIVGYGSQNGVDYWI 295

Query: 361 VKNSWSNMWGNDGYVLMSMRENN----CGVQSAPTY 456
           VKNSW   WG DG++ M     N    CG+    +Y
Sbjct: 296 VKNSWGKSWGMDGFMHMQRNTENSDGVCGINMLASY 331


>UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K
           precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2);
           n=2; Tribolium castaneum|Rep: PREDICTED: similar to
           Cathepsin K precursor (Cathepsin O) (Cathepsin X)
           (Cathepsin O2) - Tribolium castaneum
          Length = 332

 Score =  105 bits (253), Expect = 5e-22
 Identities = 60/153 (39%), Positives = 82/153 (53%), Gaps = 3/153 (1%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNN-ENAL 180
           GGG    AY +I ++ G+    DY  YLG++G C   +      I  +  +  NN E  +
Sbjct: 181 GGGWIPTAYSYIARNKGVNYNRDYP-YLGRNGKCRYRSSKPHIAIRSYAAINNNNNEERV 239

Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360
           +  +   GP+SVAI    +TF  Y +GVY  P C+     L+HAV+ VGYG   G  YWL
Sbjct: 240 RRLVATKGPVSVAIHVDSRTFHKYKSGVYNNPSCRGG---LNHAVVIVGYGRERGVDYWL 296

Query: 361 VKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTY 456
           VKNSW   WG  GYV M+  R N CG+ +  +Y
Sbjct: 297 VKNSWGAGWGQKGYVKMARNRRNQCGIATHASY 329


>UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n=1;
           Rattus norvegicus|Rep: UPI0000501FDB UniRef100 entry -
           Rattus norvegicus
          Length = 338

 Score =  105 bits (253), Expect = 5e-22
 Identities = 57/158 (36%), Positives = 88/158 (55%), Gaps = 6/158 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG  + A+Q+++++G L +E  Y  Y G++G C   N  +  KIT        NE+ L  
Sbjct: 187 GGTTYNAFQYVLQNGGLESEATYP-YEGKEGLCRY-NPNSSAKITXICAPPQKNEDVLMD 244

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNGHKY 354
           A+    P++  I   H +  FY  G+Y EPKC N V+   HAVL VGYG      +G+ Y
Sbjct: 245 AVATK-PVAAGIHVVHSSLRFYKKGIYHEPKCNNYVN---HAVLVVGYGFEGNETDGNNY 300

Query: 355 WLVKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465
           WL++NSW   WG +GY+ ++  R N+CG+ +   Y ++
Sbjct: 301 WLIQNSWGERWGLNGYMKIAKDRNNHCGIATFAQYPIV 338


>UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep:
           Toxopain-2 - Toxoplasma gondii
          Length = 422

 Score =  105 bits (252), Expect = 6e-22
 Identities = 59/148 (39%), Positives = 81/148 (54%), Gaps = 5/148 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GGE   A+Q+++  G    ED   YL +D  C   +   + KI G+ +V   +E A+K A
Sbjct: 271 GGEMNDAFQYVLDSGGICSEDAYPYLARDEECRAQSCEKVVKILGFKDVPRRSEAAMKAA 330

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHK--YWLV 363
           L K  P+S+AI+A    F FY  GV F+  C     +LDH VL VGYG     K  +W++
Sbjct: 331 LAK-SPVSIAIEADQMPFQFYHEGV-FDASCGT---DLDHGVLLVGYGTDKESKKDFWIM 385

Query: 364 KNSWSNMWGNDGYVLMSM---RENNCGV 438
           KNSW   WG DGY+ M+M    E  CG+
Sbjct: 386 KNSWGTGWGRDGYMYMAMHKGEEGQCGL 413


>UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteinase
           B; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like
           tick cysteine proteinase B - Haemaphysalis longicornis
           (Bush tick)
          Length = 332

 Score =  104 bits (250), Expect = 1e-21
 Identities = 46/90 (51%), Positives = 64/90 (71%), Gaps = 2/90 (2%)
 Frame = +1

Query: 202 GPISVAIDAAHKTFS-FYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWS 378
           GP+SVAIDA   + S FYS G+Y EP+C +  ++LDH VL VGYG  +G  YWLVKNSW 
Sbjct: 245 GPVSVAIDAQPTSHSQFYSEGIYDEPECSS--EQLDHGVLVVGYGTKDGKDYWLVKNSWG 302

Query: 379 NMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465
             WG++GY+ M+  ++N CG+ S+ +Y L+
Sbjct: 303 TTWGDEGYIYMTRNQDNQCGIASSASYPLV 332


>UniRef50_P25773 Cluster: Cathepsin L; n=38; Eukaryota|Rep:
           Cathepsin L - Felis silvestris catus (Cat)
          Length = 139

 Score =  104 bits (249), Expect = 1e-21
 Identities = 54/140 (38%), Positives = 79/140 (56%), Gaps = 5/140 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    A+Q++  +G L +EE Y  Y  Q   C      ++  +T + ++ +  EN L +
Sbjct: 1   GGLIDDAFQYVKDNGGLDSEESYP-YHAQGDSCKYRPENSVANVTDYWDIPSK-ENELMI 58

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNGHKY 354
            L   GPIS AIDA+  TF FY  G+Y++P C +  +++DH VL VGYG         KY
Sbjct: 59  TLAAVGPISAAIDASLDTFRFYKEGIYYDPSCSS--EDVDHGVLVVGYGADGTETENKKY 116

Query: 355 WLVKNSWSNMWGNDGYVLMS 414
           W++KNSW   WG DGY+ M+
Sbjct: 117 WIIKNSWGTDWGMDGYIKMA 136


>UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin
           heavy chain; n=3; Amniota|Rep: PREDICTED: similar to
           ferritin heavy chain - Ornithorhynchus anatinus
          Length = 338

 Score =  103 bits (248), Expect = 2e-21
 Identities = 58/158 (36%), Positives = 87/158 (55%), Gaps = 6/158 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGY-CHIDNVTAITKITGWVNVTTNNENALKL 186
           GG+   A++++  +G    ED   YLG+D   C           T ++ V  +NE AL+ 
Sbjct: 186 GGQYIGAFEYVRANGGIDAEDLYPYLGRDDISCRYSLQGKAGNCTSYMVVDQDNEQALEQ 245

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN----GHKY 354
           A+   GP+SVA+DA  + F FY +G++    C  KV+   HA+LAVGYG       G  Y
Sbjct: 246 AVATVGPVSVAVDA--RPFFFYHSGIFSSHSCTQKVN---HAMLAVGYGTSKEPGGGQDY 300

Query: 355 WLVKNSWSNMWGNDGYV-LMSMRENNCGVQSAPTYVLI 465
           W++KNSWS  WG  GY+ L+    N+CGV S  ++ ++
Sbjct: 301 WILKNSWSERWGEQGYMRLLKGANNHCGVASVASFPVL 338


>UniRef50_UPI0000E224BB Cluster: PREDICTED: hypothetical protein;
           n=1; Pan troglodytes|Rep: PREDICTED: hypothetical
           protein - Pan troglodytes
          Length = 143

 Score =  103 bits (247), Expect = 2e-21
 Identities = 47/90 (52%), Positives = 61/90 (67%), Gaps = 5/90 (5%)
 Frame = +1

Query: 202 GPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNGHKYWLVKN 369
           GPISVA+ A+H +F FY  G+YFEP+C    + LDHA+L VGY       + +KYWLVKN
Sbjct: 53  GPISVAVGASHVSFQFYKKGIYFEPRC--DPEGLDHAMLVVGYSYEGADSDNNKYWLVKN 110

Query: 370 SWSNMWGNDGYVLMSM-RENNCGVQSAPTY 456
           SW   WG DGY+ M+  R NNCG+ +A +Y
Sbjct: 111 SWGKNWGMDGYIKMAKDRRNNCGIATAASY 140


>UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 344

 Score =  103 bits (247), Expect = 2e-21
 Identities = 52/146 (35%), Positives = 79/146 (54%), Gaps = 1/146 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    AYQ++ + G + T + YG Y  +   C+ D      K+  W  +   NE  ++ 
Sbjct: 195 GGLMTDAYQFLQQSGGIQTADTYGDYKNKKDICNFDKAKVKAKVVDWYQIP-ENEETIRR 253

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
            L K+GP++V I+A  +T  FY  G+  +PK  N  D+++HAVL VGYGV  G  YWL+K
Sbjct: 254 ELVKNGPVAVGINA--RTLQFYEGGIV-DPK--NCDDKINHAVLIVGYGVEEGIPYWLIK 308

Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQS 444
           N W   WG  G+  +   +  CG+ +
Sbjct: 309 NQWGAEWGIKGFFKLIRGKKQCGIHT 334


>UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or
           H-like cysteine peptidase; n=1; Trichomonas vaginalis
           G3|Rep: Clan CA, family C1, cathepsin L, S or H-like
           cysteine peptidase - Trichomonas vaginalis G3
          Length = 473

 Score =  102 bits (245), Expect = 4e-21
 Identities = 53/150 (35%), Positives = 88/150 (58%), Gaps = 4/150 (2%)
 Frame = +1

Query: 1   ARGGGEDFRAYQWIMKHG--LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNEN 174
           A GGGE   A++ ++     L  E+DY  Y+G  GYC+ +    + ++   + +  + + 
Sbjct: 315 ACGGGEAGPAFRSLINQNFKLFLEKDYP-YIGVAGYCNRNPEHPVARVVDCIAIDKSTQ- 372

Query: 175 ALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKY 354
           ALK AL+++GP S+ I+   ++ SFY+ G   +P C    D+L H VL  G+ +++G + 
Sbjct: 373 ALKEALYQYGPASIGINVI-ESMSFYTKGAVNDPTCTGAADDLVHEVLLTGWKIVDGIEC 431

Query: 355 WLVKNSWSNMWGNDGYVLMSM--RENNCGV 438
           W +KNSWS  WGN+GY+ +    +E NCGV
Sbjct: 432 WEIKNSWSTHWGNEGYIYIQAENQEYNCGV 461


>UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila
           melanogaster|Rep: CG11459-PA - Drosophila melanogaster
           (Fruit fly)
          Length = 336

 Score =  102 bits (244), Expect = 6e-21
 Identities = 53/151 (35%), Positives = 81/151 (53%), Gaps = 2/151 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    A+ +   HG+ T+E Y  Y    G C   +  +   ++G+V +   +E  L   
Sbjct: 184 GGWVSVAFNYTRDHGIATKESYP-YEPVSGECLWKSDRSAGTLSGYVTLGNYDERELAEV 242

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLVK 366
           ++  GP++V+ID  H+ F  YS GV   P C++K  +L H+VL VG+G       YW++K
Sbjct: 243 VYNIGPVAVSIDHLHEEFDQYSGGVLSIPACRSKRQDLTHSVLLVGFGTHRKWGDYWIIK 302

Query: 367 NSWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456
           NS+   WG  GY+ ++   NN CGV S P Y
Sbjct: 303 NSYGTDWGESGYLKLARNANNMCGVASLPQY 333


>UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n=3;
           Brugia malayi|Rep: Cathepsin L-like cysteine proteinase
           - Brugia malayi (Filarial nematode worm)
          Length = 353

 Score =  102 bits (244), Expect = 6e-21
 Identities = 48/145 (33%), Positives = 81/145 (55%), Gaps = 2/145 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITK--ITGWVNVTTNNENALK 183
           GG +   ++W+ +HG+ T++ Y  Y   D      N     K  +     +  +NE  LK
Sbjct: 200 GGNEPAVFRWVAEHGVKTDKSYP-YKENDSVSCPRNTPQRRKYGLADAFYLPPSNEQILK 258

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363
             L  +GP+ V++ ++ ++F  Y +G+Y +PKC    ++++HAV+AVGYGV NG +Y+++
Sbjct: 259 KILALYGPVCVSLHSSLQSFVAYRSGIYNDPKCPTNAEKVNHAVIAVGYGVQNGMEYFII 318

Query: 364 KNSWSNMWGNDGYVLMSMRENNCGV 438
           KNSW   WG  GY  +      CG+
Sbjct: 319 KNSWGPTWGQKGYGRIRAGVFMCGI 343


>UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia
           theta|Rep: Cathepsin H precursor - Guillardia theta
           (Cryptomonas phi)
          Length = 353

 Score =  101 bits (242), Expect = 1e-20
 Identities = 57/161 (35%), Positives = 86/161 (53%), Gaps = 12/161 (7%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHI-------DNV----TAITKITGWVN 153
           GG   +A+++IM +G L   E+Y  Y+  DG+C++       D V    +   K++   N
Sbjct: 189 GGLPSQAFEYIMYNGGLSKMEEYP-YVCGDGHCNVTGGPCAFDPVGKPWSVGAKVSKVAN 247

Query: 154 VTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYG 333
            T  +E ++K  +  H PISVA +        YS+GVY  P C    D+++HAVLAVGYG
Sbjct: 248 FTPGDEISMKTVVGSHNPISVAFEVV-ADLRHYSSGVYSSPTCVGTPDKVNHAVLAVGYG 306

Query: 334 VLNGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTY 456
              G  YW +KNSW   WG++GY  +    N CG+    ++
Sbjct: 307 TEGGIPYWTIKNSWGFAWGDNGYFKIQRGSNKCGISVCASF 347


>UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2;
           Entamoeba|Rep: Cysteine proteinase ACP1 precursor -
           Entamoeba histolytica
          Length = 308

 Score =  101 bits (242), Expect = 1e-20
 Identities = 52/152 (34%), Positives = 85/152 (55%), Gaps = 3/152 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    + ++I ++ GL  E DY  Y    G C    V  +  +TG   VT  +E  L+ 
Sbjct: 155 GGHPSNSLKFIQENNGLGLESDYP-YKAVAGTCK--KVKNVATVTGSRRVTDGSETGLQT 211

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNG-VYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363
            + ++GP++V +DA+  +F  Y  G +Y + KC++++  ++H V AVGYG  +  KYW++
Sbjct: 212 IIAENGPVAVGMDASRPSFQLYKKGTIYSDTKCRSRM--MNHCVTAVGYGSNSNGKYWII 269

Query: 364 KNSWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456
           +NSW   WG+ GY L++   NN CG+     Y
Sbjct: 270 RNSWGTSWGDAGYFLLARDSNNMCGIGRDSNY 301


>UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2;
           Endopterygota|Rep: Cathepsin L-like proteinase - Bombyx
           mori (Silk moth)
          Length = 402

 Score =  101 bits (241), Expect = 1e-20
 Identities = 56/150 (37%), Positives = 81/150 (54%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    A ++  + GL  E  Y  Y+G+ GYC  D+     +   W  + + +E A++ A
Sbjct: 259 GGSLRGALRYAAREGLVMESHYP-YVGKKGYCRYDSNLVRARPRRWATLPSGDEEAMEKA 317

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           L   GP++VA++AA  TF  YS GVY +P C +    L+HA+L VGY       YW++ N
Sbjct: 318 LATVGPLAVAVNAAPFTFQLYS-GVYDDPFCVSW--HLNHAMLLVGY----TQDYWILLN 370

Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTYV 459
            W   WG DGY+ +    N CGV +  TYV
Sbjct: 371 WWGRNWGEDGYMRIRRGLNRCGVANMATYV 400


>UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing
           protein; n=5; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 437

 Score =  100 bits (240), Expect = 2e-20
 Identities = 49/153 (32%), Positives = 82/153 (53%), Gaps = 1/153 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG   + ++++    G+  E DY  Y G+D  C  ++   + ++    N+T  +EN L  
Sbjct: 273 GGLPSKGFEYLAYAGGIQNEADYP-YEGEDKNCRFNSSKTVVQVQKSYNITFQDENELIY 331

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
            L  +GP+++A    +  F  Y NGV+    C    ++++HAVLAVGY +    KY++ K
Sbjct: 332 HLANYGPVTIAYQV-NSDFDNYKNGVFTSSNCSKDPEDVNHAVLAVGYNMTG--KYFIAK 388

Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           NSW N WG +GY  + +  N CG+    +Y +I
Sbjct: 389 NSWGNDWGMNGYFYIELGSNMCGLADCASYPII 421


>UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean
           endopeptidase) (Cysteine proteinase)
           (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
           Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor
           (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase)
           (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
           Vignain-2] - Vigna mungo (Rice bean) (Black gram)
          Length = 362

 Score =  100 bits (240), Expect = 2e-20
 Identities = 63/156 (40%), Positives = 83/156 (53%), Gaps = 7/156 (4%)
 Frame = +1

Query: 10  GGEDFRAYQWI-MKHGLPTEEDYGGYLGQDGYCHIDNVTAIT-KITGWVNVTTNNENALK 183
           GG    A+++I  K G+ TE +Y  Y  Q+G C    V  +   I G  NV  N+ENAL 
Sbjct: 193 GGLMESAFEFIKQKGGITTESNYP-YTAQEGTCDESKVNDLAVSIDGHENVPVNDENALL 251

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWL 360
            A+    P+SVAIDA    F FYS GV F   C     +L+H V  VGYG  ++G  YW+
Sbjct: 252 KAVANQ-PVSVAIDAGGSDFQFYSEGV-FTGDCNT---DLNHGVAIVGYGTTVDGTNYWI 306

Query: 361 VKNSWSNMWGNDGYVLM----SMRENNCGVQSAPTY 456
           V+NSW   WG  GY+ M    S +E  CG+    +Y
Sbjct: 307 VRNSWGPEWGEQGYIRMQRNISKKEGLCGIAMMASY 342


>UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan
           CA, family C1, cathepsin L-like cysteine peptidase -
           Trichomonas vaginalis G3
          Length = 306

 Score =  100 bits (239), Expect = 2e-20
 Identities = 53/132 (40%), Positives = 71/132 (53%), Gaps = 4/132 (3%)
 Frame = +1

Query: 82  YLGQDGYCHIDNVTAITKITGWVN---VTTNNENALKLALFKHGPISVAIDAAHKTFSFY 252
           Y    G C  DN  A  K  G +    V+  +E  L  A+  +GP  ++IDA+  +F  Y
Sbjct: 178 YTAVQGTCKYDNKKA--KYFGMLELAGVSRKSETELAKAVATYGPAMISIDASQHSFMLY 235

Query: 253 SNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLMSMRENN- 429
             G+Y EPKC    ++LDHAV  VGYGV     YW+V+NSW  +WG  GYV M   +NN 
Sbjct: 236 KEGIYDEPKCSE--EDLDHAVGCVGYGVEGEKDYWIVRNSWGEVWGEKGYVRMIRNKNNQ 293

Query: 430 CGVQSAPTYVLI 465
           CGV +    V +
Sbjct: 294 CGVATEAYNVFV 305


>UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24;
           Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa
           (Rice)
          Length = 339

 Score =   99 bits (238), Expect = 3e-20
 Identities = 60/154 (38%), Positives = 86/154 (55%), Gaps = 5/154 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    A+++I+K+G  T E    Y   DG C+  + +A T I G+ +V  NNE AL  A
Sbjct: 189 GGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNSAAT-IKGYEDVPANNEAALMKA 247

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL-NGHKYWLVK 366
           +    P+SVA+D    TF FYS GV     C     +LDH ++A+GYG   +G +YWL+K
Sbjct: 248 VANQ-PVSVAVDGGDMTFQFYSGGV-MTGSCGT---DLDHGIVAIGYGKDGDGTQYWLLK 302

Query: 367 NSWSNMWGNDGYVLM----SMRENNCGVQSAPTY 456
           NSW   WG +G++ M    S +   CG+   P+Y
Sbjct: 303 NSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSY 336


>UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3;
           Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays
           (Maize)
          Length = 493

 Score =   99 bits (238), Expect = 3e-20
 Identities = 58/155 (37%), Positives = 84/155 (54%), Gaps = 6/155 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHID-NVTAITKITGWVNVTTNNENALK 183
           GG    A+ +++K+G + TE DY  + G DG C +    T +  I  +  V  N E AL+
Sbjct: 229 GGLMDNAFVFMIKNGGIDTEADYP-FTGHDGTCDLKLKNTRVVSIDSFERVPINYERALQ 287

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363
            A+  H P+S +I+A+ + F  YS+G+ F+ +C      LDH V  VGYG   G  YW+V
Sbjct: 288 KAV-AHQPVSASIEASRRAFQLYSSGI-FDGRCGTY---LDHGVTVVGYGSEGGKDYWIV 342

Query: 364 KNSWSNMWGNDGYVLMS----MRENNCGVQSAPTY 456
           KNSW   WG  GYV M+    +R  + G+   P Y
Sbjct: 343 KNSWGTQWGEAGYVRMARNVRVRPPSAGIAMEPLY 377


>UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1;
           Naegleria fowleri|Rep: Cysteine proteinase homolog -
           Naegleria fowleri
          Length = 347

 Score =   99 bits (238), Expect = 3e-20
 Identities = 57/158 (36%), Positives = 89/158 (56%), Gaps = 6/158 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG  + A+Q+++K+G L TE+ Y  Y G D  C  +       I+ W +++++ EN +  
Sbjct: 196 GGLMWSAFQYVIKNGGLDTEDSYP-YEGVDDTCRFNKSNVAATISSWTSISSD-ENQMAA 253

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNG-----HK 351
            L  +GPIS+AI+A  +   +Y++G+  +P   N  D LDH VL VGYGV          
Sbjct: 254 WLAANGPISIAINA--EWLQYYTSGIS-DPWFCNPQD-LDHGVLIVGYGVGKSWLGSEEN 309

Query: 352 YWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           YW+VKNSW + WG DGY  +   +  CG+ S P+  ++
Sbjct: 310 YWIVKNSWGSDWGEDGYFRIIRGKGKCGLNSVPSSSIV 347


>UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor;
           n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase
           precursor - Phaedon cochleariae (Mustard beetle)
          Length = 324

 Score =   99 bits (238), Expect = 3e-20
 Identities = 51/154 (33%), Positives = 85/154 (55%), Gaps = 2/154 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHI-DNVTAITKITGWVNVTTNNENALKL 186
           GG     ++++  +GL ++ DY  Y G++  C   D   ++ ++TG+  VT + E +LK 
Sbjct: 176 GGFAVNGFEYVKDNGLESDADYP-YSGKEDKCKANDKSRSVVELTGYKKVTAS-ETSLKE 233

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           A+   GPIS  +    K    Y  G++ +  C    D L H V  VGYG+ NG KYW++K
Sbjct: 234 AVGTIGPISAVVFG--KPMKSYGGGIFDDSSCLG--DNLHHGVNVVGYGIENGQKYWIIK 289

Query: 367 NSWSNMWGNDGYV-LMSMRENNCGVQSAPTYVLI 465
           N+W   WG  GY+ L+   +++CGV+   +Y ++
Sbjct: 290 NTWGADWGESGYIRLIRDTDHSCGVEKMASYPIL 323


>UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 339

 Score = 99.5 bits (237), Expect = 4e-20
 Identities = 57/162 (35%), Positives = 94/162 (58%), Gaps = 10/162 (6%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKH-GLPTEEDYG--GYLGQD----GYCHIDNVTAITKITGWVNVTTNN 168
           GG    A+ +I+K  G+ +E +Y   GYL +     G C  ++  +   I+ ++ +   N
Sbjct: 181 GGLALIAFDYIIKQKGIDSEFNYPYEGYLIEPYEGRGRCRYNSFYSKASISSYIEIERFN 240

Query: 169 ENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL--N 342
           EN L  +L K  P+SV IDA+  +F  Y +GVY +P C + +  L+H +L +G+GV   N
Sbjct: 241 ENELTQSLIK-SPVSVMIDASQLSFMLYKSGVYKDPSCSSTI--LNHGILNIGFGVTPEN 297

Query: 343 GHKYWLVKNSWSNMWGNDGYVLMSMRENN-CGVQSAPTYVLI 465
           G++Y+++KNS+ + WG  GY+ +S   NN CG+ S    V+I
Sbjct: 298 GNEYYILKNSFGSKWGMKGYIYLSRNFNNHCGISSVGISVVI 339


>UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease;
           n=23; Magnoliophyta|Rep: Senescence-specific cysteine
           protease - Arabidopsis thaliana (Mouse-ear cress)
          Length = 346

 Score = 99.1 bits (236), Expect = 5e-20
 Identities = 58/144 (40%), Positives = 78/144 (54%), Gaps = 6/144 (4%)
 Frame = +1

Query: 52  GLPTEEDYGGYLGQDGYCHIDNVTA-ITKITGWVNVTTNNENALKLALFKHGPISVAIDA 228
           GL TE +Y  Y G+D  C+        T ITG+ +V  N+E AL  A+  H P+SV I+ 
Sbjct: 209 GLTTESNYP-YKGEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAV-AHQPVSVGIEG 266

Query: 229 AHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLVKNSWSNMWGNDGYV 405
               F FYS+GV F  +C      LDHAV A+GYG   NG KYW++KNSW   WG  GY+
Sbjct: 267 GGFDFQFYSSGV-FTGECTTY---LDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYM 322

Query: 406 LMSM----RENNCGVQSAPTYVLI 465
            +      ++  CG+    +Y  I
Sbjct: 323 RIQKDVKDKQGLCGLAMKASYPTI 346


>UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila
           melanogaster|Rep: CG5367-PA - Drosophila melanogaster
           (Fruit fly)
          Length = 338

 Score = 99.1 bits (236), Expect = 5e-20
 Identities = 41/138 (29%), Positives = 79/138 (57%)
 Frame = +1

Query: 52  GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAA 231
           G+  ++DY  Y+ + G C      ++  +T W  +   +E A++ A+   GP++++I+A+
Sbjct: 208 GIMRDQDYP-YVARKGKCQFVPDLSVVNVTSWAILPVRDEQAIQAAVTHIGPVAISINAS 266

Query: 232 HKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLM 411
            KTF  YS+G+Y +P C +    ++HA++ +G+    G  YW++KN W   WG +GY+ +
Sbjct: 267 PKTFQLYSDGIYDDPLCSSA--SVNHAMVVIGF----GKDYWILKNWWGQNWGENGYIRI 320

Query: 412 SMRENNCGVQSAPTYVLI 465
               N CG+ +   Y ++
Sbjct: 321 RKGVNMCGIANYAAYAIV 338


>UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella
           histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax
           (Sterkiella histriomuscorum)
          Length = 366

 Score = 99.1 bits (236), Expect = 5e-20
 Identities = 56/151 (37%), Positives = 79/151 (52%), Gaps = 2/151 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    A+++I  +G L  E  Y  Y   +G C I        I G     + NE+ LK 
Sbjct: 201 GGLPSHAFEYIKDNGGLALETTYP-YKAANGQCSIQKGQQSVGIRGGAVNISLNEDDLKQ 259

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLV 363
           A++ HGP+SVA       F  Y +GVY    C N  ++++HAVLAVG+G   N   YW++
Sbjct: 260 AIYLHGPVSVAFRVIDG-FRDYKSGVYAVEGCANGPNDVNHAVLAVGFGTDENKVDYWII 318

Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQSAPTY 456
           KNSW   WG+ G+  M    N CG+Q+  +Y
Sbjct: 319 KNSWGAAWGDQGFFKMKRGVNMCGIQNCNSY 349


>UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor;
           n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 355

 Score = 98.7 bits (235), Expect = 7e-20
 Identities = 63/157 (40%), Positives = 89/157 (56%), Gaps = 7/157 (4%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHI--DNVTAITKITGWVNVTTNNENA 177
           GG  D+ A+Q+I+  G L  E+DY  YL ++G C    ++V  +T I+G+ +V  N++ +
Sbjct: 202 GGLMDY-AFQYIISTGGLHKEDDYP-YLMEEGICQEQKEDVERVT-ISGYEDVPENDDES 258

Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357
           L  AL  H P+SVAI+A+ + F FY  GV F  KC     +LDH V AVGYG   G  Y 
Sbjct: 259 LVKAL-AHQPVSVAIEASGRDFQFYKGGV-FNGKCGT---DLDHGVAAVGYGSSKGSDYV 313

Query: 358 LVKNSWSNMWGNDGYVLMSMR----ENNCGVQSAPTY 456
           +VKNSW   WG  G++ M       E  CG+    +Y
Sbjct: 314 IVKNSWGPRWGEKGFIRMKRNTGKPEGLCGINKMASY 350


>UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep:
           Cathepsin L - Stylonychia lemnae
          Length = 340

 Score = 98.3 bits (234), Expect = 9e-20
 Identities = 56/151 (37%), Positives = 85/151 (56%), Gaps = 4/151 (2%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG+   A  +I   G + TE+DY  Y+G+D  C  +    +    G +N+       L+ 
Sbjct: 190 GGDMGLAMDYIASAGGVETEKDYP-YVGKDQTCAFEASKEVATDKGHINIVPGKFATLQA 248

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           A+ + GP+SVAI+A    F FY +G++    C      LDH V AVGYGV NG +Y++V+
Sbjct: 249 AIAE-GPVSVAIEADSLFFQFYRSGIFDSSWCGTN---LDHGVAAVGYGVDNGKQYYIVR 304

Query: 367 NSWSNMWGNDGYV-LMSMRENN--CGVQSAP 450
           NSWS+ WG  GY+ +++  + N  CG+Q  P
Sbjct: 305 NSWSDSWGLKGYINIIANGDGNGMCGIQMEP 335


>UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza
           sativa|Rep: Cysteine protease 1 precursor - Oryza sativa
           subsp. japonica (Rice)
          Length = 490

 Score = 98.3 bits (234), Expect = 9e-20
 Identities = 61/157 (38%), Positives = 86/157 (54%), Gaps = 8/157 (5%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTA-ITKITGWVNVTTNNENALK 183
           GG    A+ +I ++G L TEEDY  Y   DG C++   +  +  I G+ +V  N+E +L+
Sbjct: 222 GGIMDDAFAFIARNGGLDTEEDYP-YTAMDGKCNLAKRSRKVVSIDGFEDVPENDELSLQ 280

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV--LNGHKYW 357
            A+  H P+SVAIDA  + F  Y +GV F  +C      LDH V+AVGYG     G  YW
Sbjct: 281 KAV-AHQPVSVAIDAGGREFQLYDSGV-FTGRCGTN---LDHGVVAVGYGTDAATGAAYW 335

Query: 358 LVKNSWSNMWGNDGYVLM----SMRENNCGVQSAPTY 456
            V+NSW   WG +GY+ M    + R   CG+    +Y
Sbjct: 336 TVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASY 372


>UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;
           n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
           CG5367-PA - Nasonia vitripennis
          Length = 362

 Score = 97.9 bits (233), Expect = 1e-19
 Identities = 46/138 (33%), Positives = 73/138 (52%)
 Frame = +1

Query: 52  GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAA 231
           GL T+  Y  Y    G C      ++  +T W  +   +E AL+ A+   GPI+ +I+A 
Sbjct: 232 GLMTDATYP-YTAHQGVCKFQRKLSVVNVTSWAILPARDERALEAAVATIGPIAASINAG 290

Query: 232 HKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLM 411
            +TF  Y +G+Y +P C +  D ++HA+L VGY       YW++KN W   WG +GY+ +
Sbjct: 291 PRTFQLYHSGIYDDPTCSS--DLVNHAMLIVGY----TPNYWILKNWWGASWGENGYMRL 344

Query: 412 SMRENNCGVQSAPTYVLI 465
              +N CGV +   Y  +
Sbjct: 345 RKGKNRCGVANYAAYAKV 362


>UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep:
           Actinidin Act3a - Actinidia eriantha
          Length = 380

 Score = 97.9 bits (233), Expect = 1e-19
 Identities = 62/157 (39%), Positives = 86/157 (54%), Gaps = 6/157 (3%)
 Frame = +1

Query: 4   RGGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHID--NVTAITKITGWVNVTTNNEN 174
           +GG  D  AY++I+ +G + TEE+Y  Y+GQD  C     N   +T I  +  V  N+E 
Sbjct: 191 KGGFMD-DAYEFIINNGGINTEENYP-YIGQDDQCDEPKKNQNYVT-IDSYEQVPPNDEL 247

Query: 175 ALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKY 354
           A+K A+  + P+SVAIDA    F FY +G++    C      L+HAV  +GYG  NG  Y
Sbjct: 248 AMKRAV-AYQPVSVAIDAYCLGFRFYQSGIFTGGSCGTT---LNHAVTIIGYGTENGIDY 303

Query: 355 WLVKNSWSNMWGNDGYVLMSMR---ENNCGVQSAPTY 456
           W+VKNS+   WG  GY  +      E  CG+ S P Y
Sbjct: 304 WIVKNSYGTQWGESGYGKVQRNVGGEGRCGIASYPFY 340


>UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain;
           n=16; Chrysomelidae|Rep: Digestive cysteine protease
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 97.9 bits (233), Expect = 1e-19
 Identities = 56/157 (35%), Positives = 87/157 (55%), Gaps = 5/157 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG+   A++++  +G+ +E+ Y  Y+ +   C  D    I KI G+ NVTT+ E  L+ A
Sbjct: 177 GGDMSAAFEYVRDYGIQSEKSYP-YIRKQTECQYDASKTILKIKGYKNVTTSEEG-LRKA 234

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN---GH-KYW 357
           +   GPIS+A+++       Y +G+     C +   +LDH VL VGYG  +   G  K+W
Sbjct: 235 VGAIGPISIAMNS--DPLQLYYSGIISGKGCSH---DLDHGVLVVGYGKASQWSGETKFW 289

Query: 358 LVKNSWSNMWGNDGYVLMSMRENN-CGVQSAPTYVLI 465
            VKNSW  +WG +GY  +    NN CG+   PTY ++
Sbjct: 290 RVKNSWGKIWGENGYFRIKRDANNLCGIADDPTYPVL 326


>UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba
           culbertsoni|Rep: Cysteine proteinase - Acanthamoeba
           culbertsoni
          Length = 482

 Score = 97.1 bits (231), Expect = 2e-19
 Identities = 53/149 (35%), Positives = 79/149 (53%), Gaps = 4/149 (2%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG--LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183
           GG     Y+W++ +   L T+  Y  Y+ +   C       +  I   + V   +E+ L 
Sbjct: 222 GGNVEITYRWMISNNARLMTQASYP-YIARQSTCRYVPSQGVQGIRNIMRVRAGSESDL- 279

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWL 360
           LA     P++VAID + ++F FYS G Y++P C +    L+HAVL VG+G       YW+
Sbjct: 280 LAKAAIAPVTVAIDGSKRSFMFYSGGYYYDPTCSST--NLNHAVLVVGWGTDPQRGDYWI 337

Query: 361 VKNSWSNMWGNDGYVLMSM-RENNCGVQS 444
            KN W   WG+DGYV M+  + NNCG+ S
Sbjct: 338 AKNEWGTAWGDDGYVYMARNKNNNCGIAS 366


>UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1;
           Brugia malayi|Rep: Cathepsin F-like cysteine proteinase
           - Brugia malayi (Filarial nematode worm)
          Length = 461

 Score = 97.1 bits (231), Expect = 2e-19
 Identities = 49/152 (32%), Positives = 75/152 (49%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    A++ I + G    ED   Y  ++G CH+        I   V +   NE  +K  
Sbjct: 312 GGLPINAFREIKRMGGLEPEDQYPYEAKNGTCHLVRAQIAVSIDDAVEIP-RNETVMKAW 370

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           + + GP+SV IDA  +  S+Y +G+    K +    +++H VL  GYG+ N   YW +KN
Sbjct: 371 IAQRGPLSVGIDA--ELLSYYKSGILHPSKSRCPPSKINHGVLITGYGIENNLPYWTIKN 428

Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           SW   WG +GY  +   +N CGV    +  +I
Sbjct: 429 SWGEQWGENGYFQLMRGKNICGVSDLVSSAII 460


>UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine
           protease; n=11; Callosobruchus maculatus|Rep: Putative
           gut cathepsin L-like cysteine protease - Callosobruchus
           maculatus (Southern cowpea weevil) (Pulse bruchid)
          Length = 326

 Score = 96.3 bits (229), Expect = 4e-19
 Identities = 56/150 (37%), Positives = 78/150 (52%), Gaps = 1/150 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG   +A+ ++   G+ TEE Y  Y G+   C       +TK+  +V      E A  +A
Sbjct: 179 GGLMGQAFDFVQDEGIQTEESYP-YEGRRSSCKKSG-EYVTKVKTYVFPLDEQEMARTVA 236

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEP-KCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
               GP++VAI+A+    SFY  G+  E  +C NK ++L+H VL VGYG  NG  YW+VK
Sbjct: 237 A--KGPVAVAIEASQ--LSFYDKGIVDERCRCSNKREDLNHGVLVVGYGSENGVDYWIVK 292

Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTY 456
           NSW   WG  GY  +      CG+    TY
Sbjct: 293 NSWGADWGEKGYFRLKKDVKACGIGYYNTY 322


>UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba
           histolytica|Rep: Cysteine protease 10 - Entamoeba
           histolytica
          Length = 297

 Score = 95.9 bits (228), Expect = 5e-19
 Identities = 47/122 (38%), Positives = 70/122 (57%), Gaps = 1/122 (0%)
 Frame = +1

Query: 28  AYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGP 207
           ++ ++  HG+  E DY  Y G+   C ID    + KI  +  V    E  LK+A++ H P
Sbjct: 182 SFNYVRDHGILLERDYP-YTGKANNCSIDGKKPVIKIKDYSFVFPQTEENLKIAVY-HQP 239

Query: 208 ISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHK-YWLVKNSWSNM 384
           ++V+ID++  +F FY  G+Y EP CK     +DH V  VGYG    H+ +W+VKNS+ N 
Sbjct: 240 VAVSIDSSQLSFQFYEGGIYDEPNCK----WVDHIVTVVGYGTTEEHQDFWVVKNSYGNE 295

Query: 385 WG 390
           WG
Sbjct: 296 WG 297


>UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:
           Silicatein beta - Suberites domuncula (Sponge)
          Length = 383

 Score = 95.9 bits (228), Expect = 5e-19
 Identities = 54/154 (35%), Positives = 83/154 (53%), Gaps = 6/154 (3%)
 Frame = +1

Query: 13  GEDFRAYQWIMKH-GLPTEEDY--GG--YLGQDGYCHIDNVTAITKITGWVNVTTNNENA 177
           G+  RA  +++++ G+ T + Y  GG  Y  +   C  +         G V++ + +EN 
Sbjct: 229 GDVNRALLYVIENDGVDTWKGYPSGGDPYRSKQYSCKYERQYRGASARGIVSLASGDENT 288

Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357
           L  A+   GP+SV +DA   +F FYS+GV   P C +    L HA++ +GYG  +G  YW
Sbjct: 289 LLTAVANSGPVSVYVDATSTSFQFYSDGVLNVPYCSSST--LSHALVVIGYGKYSGQDYW 346

Query: 358 LVKNSWSNMWGNDGY-VLMSMRENNCGVQSAPTY 456
           LVKNSW   WG  GY  L   + N CG+ +A ++
Sbjct: 347 LVKNSWGPNWGVRGYGKLARNKGNKCGIATAASF 380


>UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163
           precursor; n=4; Schizophora|Rep: Putative cysteine
           proteinase CG12163 precursor - Drosophila melanogaster
           (Fruit fly)
          Length = 614

 Score = 95.9 bits (228), Expect = 5e-19
 Identities = 59/162 (36%), Positives = 79/162 (48%), Gaps = 7/162 (4%)
 Frame = +1

Query: 1   ARGGGEDFRAYQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENA 177
           A  GG    AY+ I    GL  E +Y  Y  +   CH +   +  ++ G+V++   NE A
Sbjct: 455 ACNGGLMDNAYKAIKDIGGLEYEAEYP-YKAKKNQCHFNRTLSHVQVAGFVDLPKGNETA 513

Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL---NGH 348
           ++  L  +GPIS+ I+A      FY  GV    K       LDH VL VGYGV    N H
Sbjct: 514 MQEWLLANGPISIGINA--NAMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDYPNFH 571

Query: 349 K---YWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           K   YW+VKNSW   WG  GY  +   +N CGV    T  ++
Sbjct: 572 KTLPYWIVKNSWGPRWGEQGYYRVYRGDNTCGVSEMATSAVL 613


>UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryza
           sativa (japonica cultivar-group)|Rep: Putative cysteine
           proteinase - Oryza sativa subsp. japonica (Rice)
          Length = 357

 Score = 95.5 bits (227), Expect = 6e-19
 Identities = 56/154 (36%), Positives = 80/154 (51%), Gaps = 6/154 (3%)
 Frame = +1

Query: 13  GEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           G+   A+++I  +G +  E DY       G C          I G+  V  NNE AL LA
Sbjct: 202 GDMDEAFRYITSNGGIAAESDYPYEDRALGTCRASGKPVAASIRGFQYVPPNNETALLLA 261

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLVK 366
           +  H P+SVA+D   K   F+S+GV+   + +    +L+HA+ AVGYG   +G KYWL+K
Sbjct: 262 V-AHQPVSVALDGVGKVSQFFSSGVFGAMQNETCTTDLNHAMTAVGYGTDEHGTKYWLMK 320

Query: 367 NSWSNMWGNDGY--VLMSMRENN--CGVQSAPTY 456
           NSW   WG  GY  +   +  N   CG+   P+Y
Sbjct: 321 NSWGTDWGEGGYMKIARDVASNTGLCGLAMQPSY 354


>UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh
           fly) (Boettcherisca peregrina). Cathepsin L; n=2;
           Dictyostelium discoideum|Rep: Similar to Sarcophaga
           peregrina (Flesh fly) (Boettcherisca peregrina).
           Cathepsin L - Dictyostelium discoideum (Slime mold)
          Length = 265

 Score = 94.7 bits (225), Expect = 1e-18
 Identities = 49/148 (33%), Positives = 80/148 (54%), Gaps = 2/148 (1%)
 Frame = +1

Query: 28  AYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGP 207
           A+++I+  G    E    Y G+D  C  +      K++G+V +   +E+AL  A+  +GP
Sbjct: 119 AFKYIISSGGVNLESQYPYTGKDEVCKFNQSEKEAKVSGFVMIPKFDESALMEAIALYGP 178

Query: 208 ISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLVKNSWSNM 384
           ++V ID + K F   S G+Y+   C        HAVLA+GYG   NG  Y+L+KNSW   
Sbjct: 179 VAVPIDTSTKEFQHLSGGIYYSDSCDPW--NTIHAVLAIGYGTDENGVDYFLMKNSWGKS 236

Query: 385 WGNDGYVLMSMR-ENNCGVQSAPTYVLI 465
           WG +G+  +    +  CG+ +A +Y ++
Sbjct: 237 WGTNGFFKVKRGVKGKCGIVTAASYPIV 264


>UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16;
           Bromeliaceae|Rep: Fruit bromelain precursor - Ananas
           comosus (Pineapple)
          Length = 351

 Score = 94.7 bits (225), Expect = 1e-18
 Identities = 57/155 (36%), Positives = 87/155 (56%), Gaps = 6/155 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG   +AY +I+ + G+ TEE+Y  YL   G C+ ++      ITG+  V  N+E ++  
Sbjct: 186 GGWVNKAYDFIISNNGVTTEENYP-YLAYQGTCNANSFPNSAYITGYSYVRRNDERSMMY 244

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLV 363
           A+  + PI+  IDA+ + F +Y+ GV+  P C      L+HA+  +GYG   +G KYW+V
Sbjct: 245 AV-SNQPIAALIDAS-ENFQYYNGGVFSGP-CGTS---LNHAITIIGYGQDSSGTKYWIV 298

Query: 364 KNSWSNMWGNDGYVLM----SMRENNCGVQSAPTY 456
           +NSW + WG  GYV M    S     CG+  AP +
Sbjct: 299 RNSWGSSWGEGGYVRMARGVSSSSGVCGIAMAPLF 333


>UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryza
           sativa|Rep: Putative cysteine proteinase - Oryza sativa
           subsp. japonica (Rice)
          Length = 352

 Score = 94.3 bits (224), Expect = 1e-18
 Identities = 56/163 (34%), Positives = 81/163 (49%), Gaps = 11/163 (6%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAIT----KITGWVNVTTNNENA 177
           GG    A+Q++   G  T E    Y G  G C  D  ++ +     I+G+  V  N+E +
Sbjct: 192 GGSLDNAFQYMANSGGVTTEAAYAYQGAQGACQFDASSSASGVAATISGYQRVNPNDEGS 251

Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNG 345
           L  A+    P+SVAI+ +   F  Y +GV+    C  K   LDHAV  VGYG       G
Sbjct: 252 LAAAVASQ-PVSVAIEGSGAMFRHYGSGVFTADSCGTK---LDHAVAVVGYGAEADGSGG 307

Query: 346 HKYWLVKNSWSNMWGNDGYVLMSM---RENNCGVQSAPTYVLI 465
             YW++KNSW   WG+ GY+ +      +  CGV  AP+Y ++
Sbjct: 308 GGYWIIKNSWGTTWGDGGYMKLEKDVGSQGACGVAMAPSYPVV 350


>UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;
           Brugia malayi|Rep: Cahepsin L-like cysteine protease -
           Brugia malayi (Filarial nematode worm)
          Length = 371

 Score = 93.9 bits (223), Expect = 2e-18
 Identities = 61/163 (37%), Positives = 85/163 (52%), Gaps = 11/163 (6%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    A+++++K+ G+ TE+ Y  Y G    C   N T  T       +   +E  L+ 
Sbjct: 210 GGLMMEAFEYVVKNDGIDTEKSYP-YQGYQNTCRYSNSTRGTTAYAGKLLPEGDELQLQA 268

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYG-----VLNGHK 351
           A+   GPISVA+DA  K   FY  G++   KC  +   + HA+LAVGYG     + NG K
Sbjct: 269 AIATIGPISVAVDA--KLMKFYRRGIFSTSKCTTR---MGHALLAVGYGTEEVKLQNGTK 323

Query: 352 ----YWLVKNSWSNMWGNDGYV-LMSMRENNCGVQSAPTYVLI 465
               YWL+KNSWS  WG  GY+ L   +EN CG+     Y L+
Sbjct: 324 KSVDYWLLKNSWSKRWGIGGYLKLARNQENMCGIGFYACYPLV 366


>UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase" precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 315

 Score = 93.9 bits (223), Expect = 2e-18
 Identities = 52/149 (34%), Positives = 80/149 (53%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    A+ ++ +HGL +E  Y  Y G+D  C       ++ I+G+V + T  E+AL  A
Sbjct: 175 GGLMTDAFNYVKRHGLSSESQYA-YTGRDDRCKNVENKPLSSISGYVELETT-EDALASA 232

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           +   GP+S+A+DA   T+  Y  G++    C+     L+H VLAVGY        ++VKN
Sbjct: 233 VASVGPVSIAVDA--DTWQLYGGGLFNNKNCRTN---LNHGVLAVGYT----KDAFIVKN 283

Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTY 456
           SW   WG  GY+ ++  EN CG+    +Y
Sbjct: 284 SWGTSWGEQGYIRVARGENLCGINLMNSY 312


>UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep:
           Dvir_CG5367 - Drosophila virilis (Fruit fly)
          Length = 298

 Score = 93.9 bits (223), Expect = 2e-18
 Identities = 43/138 (31%), Positives = 75/138 (54%)
 Frame = +1

Query: 52  GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAA 231
           GL    DY  Y  + G C   +  A+  +T W  +   +ENA++ A+   GP++V+I+A+
Sbjct: 168 GLMRSLDYK-YASKKGECQFVSELAVVNVTSWAILPAKDENAIQAAVAHIGPVAVSINAS 226

Query: 232 HKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLM 411
            KTF  YS G+Y +  C +    ++HA+L +G+       +W++KN W  +WG  G++ M
Sbjct: 227 PKTFQLYSEGIYDDVSCTS--TSVNHAMLLIGF----DKNFWILKNWWGELWGEAGFMRM 280

Query: 412 SMRENNCGVQSAPTYVLI 465
               N CG+ +   Y ++
Sbjct: 281 RKGINLCGIANYAAYAIV 298


>UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted,
           possible transmembrane domain near N-terminus; n=4;
           Cryptosporidium|Rep: Cryptopain-cysteine proteinase
           secreted, possible transmembrane domain near N-terminus
           - Cryptosporidium parvum Iowa II
          Length = 401

 Score = 93.5 bits (222), Expect = 3e-18
 Identities = 62/161 (38%), Positives = 85/161 (52%), Gaps = 9/161 (5%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYC---HIDNVTAITKITGWVNVTTNNENA 177
           GG    A+Q+ +K+  L T +DY  Y  ++  C     +N   I  +  +  V   N NA
Sbjct: 243 GGTMGLAFQYAIKNKYLCTNDDYP-YFAEEKTCMDSFCENYIEIP-VKAYKYVFPRNINA 300

Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV--LNGHK 351
           LK AL K+GPISVAI A    F FY +GV F+  C  KV   +H V+ VGY +      +
Sbjct: 301 LKTALAKYGPISVAIQADQTPFQFYKSGV-FDAPCGTKV---NHGVVLVGYDMDEDTNKE 356

Query: 352 YWLVKNSWSNMWGNDGYV---LMSMRENNCGVQSAPTYVLI 465
           YWLV+NSW   WG  GY+   L S ++  CG+   P Y +I
Sbjct: 357 YWLVRNSWGEAWGEKGYIKLALHSGKKGTCGILVEPVYPVI 397


>UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas
           foetus|Rep: TFCP2 protein - Tritrichomonas foetus
           (Trichomonas foetus)
          Length = 270

 Score = 93.5 bits (222), Expect = 3e-18
 Identities = 44/131 (33%), Positives = 67/131 (51%)
 Frame = +1

Query: 64  EEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTF 243
           EE+Y  Y G  G C  D  + ++ I        ++E  LK  +  +GP+S  +DA H +F
Sbjct: 138 EENYQ-YSGHKGACLYDEKSKVSNIVAVTMFPQSDEQNLKGHIAANGPVSCNVDAGHYSF 196

Query: 244 SFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLMSMRE 423
             Y  G+Y+   C+ +    +HA+  VGYGV    +YW+V+NSW   WG  GY+   +  
Sbjct: 197 QLYQGGIYWSWFCRTQYI-YNHAMGIVGYGVEGSEEYWIVRNSWGESWGEQGYIRYLLGS 255

Query: 424 NNCGVQSAPTY 456
           N C +    TY
Sbjct: 256 NVCNIADYVTY 266


>UniRef50_Q24E33 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 328

 Score = 93.1 bits (221), Expect = 3e-18
 Identities = 56/149 (37%), Positives = 81/149 (54%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG   RA++++  HG+ TEE+Y  Y  +DG C         KI  +  V   N + L  A
Sbjct: 193 GGLMPRAFRYVKAHGITTEEEYP-YTAKDGKCQTKQ--GQYKIKSFSTVPRGNCDKLAAA 249

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           + +  P+SV +DA +  F FY++GV+    CK K   L+H VLA GY       YW++KN
Sbjct: 250 IAQQ-PVSVGVDATN--FKFYTSGVF--DNCKKK---LNHGVLATGYTA----DYWIIKN 297

Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTY 456
           SW   WG +GY+ +  R N CGV +  +Y
Sbjct: 298 SWGTAWGQNGYINLK-RGNTCGVCNTASY 325


>UniRef50_A2F4G0 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 203

 Score = 93.1 bits (221), Expect = 3e-18
 Identities = 51/149 (34%), Positives = 77/149 (51%), Gaps = 5/149 (3%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKHGLPT---EEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTN-NEN 174
           GGG     Y+ IMK    T   + DY  Y  + G C  D++     I      TT  NE 
Sbjct: 48  GGGWPSGTYKSIMKQFNGTFILDSDYP-YTAKRGVCKFDSMPKAAPIMTTYGTTTKYNET 106

Query: 175 ALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKY 354
           AL LA+   G  +V++DA+  +F  Y +G+Y+EP C    + +D ++  VGYG      Y
Sbjct: 107 ALALAVSLVGVATVSVDASRTSFQLYQSGIYYEPDC--STETMDLSMACVGYGTEGTTNY 164

Query: 355 WLVKNSWSNMWGNDGYV-LMSMRENNCGV 438
           W+VKN + + WG  GY+ ++  + NNC +
Sbjct: 165 WIVKNCFGDKWGEQGYIRMIKDKNNNCAI 193


>UniRef50_Q22W19 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score = 92.7 bits (220), Expect = 5e-18
 Identities = 55/153 (35%), Positives = 81/153 (52%), Gaps = 4/153 (2%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG+   AY++++++G+ TE DY  Y G +  C  D    + K   +V VT N+ + L +A
Sbjct: 187 GGDLPPAYKYVVQNGIETEADYP-YKGVNQKCAYDASKVVFKPKSFVQVTPNSPDQLAIA 245

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           L K  P+ + I+A  K F FY++G+     C      LDH VLAVGY   +    W+VKN
Sbjct: 246 LNKE-PVPICIEADQKAFQFYTSGI-ISSGCGTN---LDHCVLAVGYDADS----WIVKN 296

Query: 370 SWSNMWGNDGYVLMSMRENN----CGVQSAPTY 456
           SW   WG +GYV ++         CG+   P Y
Sbjct: 297 SWGASWGENGYVRIARTTAKGPGVCGIYEEPVY 329


>UniRef50_A2WPN4 Cluster: Putative uncharacterized protein; n=2;
           Oryza sativa (indica cultivar-group)|Rep: Putative
           uncharacterized protein - Oryza sativa subsp. indica
           (Rice)
          Length = 325

 Score = 92.3 bits (219), Expect = 6e-18
 Identities = 55/156 (35%), Positives = 78/156 (50%), Gaps = 7/156 (4%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTA--ITKITGWVNVTTNNENALK 183
           GG    A   +   G  T E+   Y G  G C +  +       ++G+  V  N+E  L 
Sbjct: 171 GGHSDTALNLVASRGGITSEEKYPYTGVQGSCDVGKLLFDHSASVSGFAAVPPNDERQLA 230

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWL 360
           LA+ +  P++V IDA+ + F FY  GVY  P C      ++HAV  VGY     G KYW+
Sbjct: 231 LAVARQ-PVTVYIDASAQEFQFYKGGVYKGP-CNP--GSVNHAVTIVGYCENFGGEKYWI 286

Query: 361 VKNSWSNMWGNDGYVLMS----MRENNCGVQSAPTY 456
            KNSWSN WG  GYV ++      +  CG+ ++P Y
Sbjct: 287 AKNSWSNDWGEQGYVYLAKDVWWPQGTCGLATSPFY 322


>UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep:
           Cathepsin L - Kudoa thyrsites
          Length = 300

 Score = 92.3 bits (219), Expect = 6e-18
 Identities = 47/132 (35%), Positives = 74/132 (56%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    A+ +++ +G+   +DY  Y  + G C   +   + +I+ +  V  N E+ ++ +
Sbjct: 166 GGLPEIAFLYVINNGIMKLKDYP-YTAKQGTCQY-SPEDVVRISSFKCVENNEESVME-S 222

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           +  +GP S+ I+AA ++F FY  G+Y +P   +    LDHAVL VGYG  N   YW VKN
Sbjct: 223 VANNGPNSIGINAASRSFQFYGGGIYSDPWASSY--PLDHAVLLVGYGYKNTENYWHVKN 280

Query: 370 SWSNMWGNDGYV 405
           SW   WG  GY+
Sbjct: 281 SWGPWWGEQGYI 292


>UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 291

 Score = 91.5 bits (217), Expect = 1e-17
 Identities = 43/120 (35%), Positives = 69/120 (57%), Gaps = 1/120 (0%)
 Frame = +1

Query: 82  YLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNG 261
           Y G DG C  D  TA+   + +V+V + +E  L   ++++G   V +D +  +F  YS+G
Sbjct: 168 YQGVDGACKFDAKTAMPVTSNFVSVPSGSERDLANYVYQYGVAVVVLDCSRISFQLYSSG 227

Query: 262 VYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLMSMRENN-CGV 438
           +Y +P C ++   LDHA+  VGY       YW+++NSW   WG  GY+ ++  +NN CGV
Sbjct: 228 IYSDPCCSSQ--NLDHAMNVVGY----SDSYWIIRNSWGTSWGESGYMRLAKDKNNMCGV 281


>UniRef50_Q0MYX5 Cluster: Putative cysteine protease; n=1; Emiliania
           huxleyi|Rep: Putative cysteine protease - Emiliania
           huxleyi
          Length = 276

 Score = 91.1 bits (216), Expect = 1e-17
 Identities = 54/148 (36%), Positives = 75/148 (50%), Gaps = 5/148 (3%)
 Frame = +1

Query: 28  AYQWIMK-HGLPTEEDYG--GYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFK 198
           A++WI + + L TE  Y      G  G C          +T   +V + +E+AL+ A+ K
Sbjct: 4   AFEWIAEGNPLCTESTYPYTSGAGLTGTCK-KACNGEVSLTSHKDVPSGDEDALRAAVAK 62

Query: 199 HGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV--LNGHKYWLVKNS 372
             P+SVAI+A    F  Y +GV     C     ELDH VL VGYG     G  YW +KNS
Sbjct: 63  Q-PVSVAIEADKSAFQLYQSGVIDSASCGK---ELDHGVLVVGYGTDTATGKDYWKIKNS 118

Query: 373 WSNMWGNDGYVLMSMRENNCGVQSAPTY 456
           W   WG +G+V +   +N CG+ S  +Y
Sbjct: 119 WGGTWGEEGFVRVVQGKNMCGISSQASY 146


>UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis
           thaliana|Rep: F9P14.12 protein - Arabidopsis thaliana
           (Mouse-ear cress)
          Length = 343

 Score = 90.2 bits (214), Expect = 2e-17
 Identities = 60/157 (38%), Positives = 81/157 (51%), Gaps = 6/157 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHID-NVTAITKITGWVNVTTNNENALK 183
           GG    A+++I  +G L TE DY  Y G +G C  + +   +  I G+  V   NE +L+
Sbjct: 193 GGLMETAFEFIKTNGGLATETDYP-YTGIEGTCDQEKSKNKVVTIQGYQKVA-QNEASLQ 250

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363
           +A  +  P+SV IDA    F  YS+GV F   C      L+H V  VGYGV    KYW+V
Sbjct: 251 IAAAQQ-PVSVGIDAGGFIFQLYSSGV-FTNYCGTN---LNHGVTVVGYGVEGDQKYWIV 305

Query: 364 KNSWSNMWGNDGYVLM----SMRENNCGVQSAPTYVL 462
           KNSW   WG +GY+ M    S     CG+    +Y L
Sbjct: 306 KNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPL 342


>UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza
           sativa|Rep: Os01g0347600 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 343

 Score = 90.2 bits (214), Expect = 2e-17
 Identities = 56/159 (35%), Positives = 81/159 (50%), Gaps = 9/159 (5%)
 Frame = +1

Query: 7   GGGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGYCHIDNV--TAITKITGWVNVTTNNENA 177
           GGG   RA++ +  K G+  E DY  Y G  G C +D++      +I G+  V  N+E  
Sbjct: 188 GGGHTDRAFELVASKGGITAESDYR-YEGFQGKCRVDDMLFNHAARIGGYRAVPPNDERQ 246

Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGY--GVLNGHK 351
           L  A+ +  P++V IDA+   F FY +GV+  P   +     +HAV  VGY     +G K
Sbjct: 247 LATAVARQ-PVTVYIDASGPAFQFYKSGVFPGPCGASS----NHAVTLVGYCQDGASGKK 301

Query: 352 YWLVKNSWSNMWGNDGYVLMS----MRENNCGVQSAPTY 456
           YW+ KNSW   WG  GY+L+          CG+  +P Y
Sbjct: 302 YWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFY 340


>UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27;
           Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber
           officinale (Ginger)
          Length = 221

 Score = 89.8 bits (213), Expect = 3e-17
 Identities = 49/153 (32%), Positives = 75/153 (49%), Gaps = 4/153 (2%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG  +RA+Q+I+ +G    E++  Y G +G C       +  I  + NV +N+E +L+ A
Sbjct: 67  GGWPYRAFQYIINNGGINSEEHYPYTGTNGTCDTKENAHVVSIDSYRNVPSNDEKSLQKA 126

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           +  + P+SV +DAA + F  Y NG+ F   C       +H     G    N   YW VKN
Sbjct: 127 V-ANQPVSVTMDAAGRDFQLYRNGI-FTGSCNISA---NHYRTVGGRETENDKDYWTVKN 181

Query: 370 SWSNMWGNDGYVLMSMR----ENNCGVQSAPTY 456
           SW   WG  GY+ +          CG+  +P+Y
Sbjct: 182 SWGKNWGESGYIRVERNIAESSGKCGIAISPSY 214


>UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza
           sativa|Rep: Os09g0381400 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 362

 Score = 89.4 bits (212), Expect = 4e-17
 Identities = 54/131 (41%), Positives = 73/131 (55%), Gaps = 4/131 (3%)
 Frame = +1

Query: 25  RAYQWIMKHG-LPTEEDYGGYLGQDGYCH-IDNVTAITKITGWVNVTTNNENALKLALFK 198
           RAY+W++++G L TE DY  Y  + G C+   +     KITG+  V   NE AL+ A+ +
Sbjct: 214 RAYKWVVENGGLTTEADYP-YTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVAR 272

Query: 199 HGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV--LNGHKYWLVKNS 372
             P++VAI+       FY  GVY  P C  +   L HAV  VGYG    +G KYW +KNS
Sbjct: 273 Q-PVAVAIEVG-SGMQFYKGGVYTGP-CGTR---LAHAVTVVGYGTDASSGAKYWTIKNS 326

Query: 373 WSNMWGNDGYV 405
           W   WG  GY+
Sbjct: 327 WGQSWGERGYI 337


>UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1;
           Oryza sativa (japonica cultivar-group)|Rep: Putative
           uncharacterized protein - Oryza sativa subsp. japonica
           (Rice)
          Length = 326

 Score = 89.4 bits (212), Expect = 4e-17
 Identities = 54/134 (40%), Positives = 69/134 (51%), Gaps = 6/134 (4%)
 Frame = +1

Query: 73  YGGYLGQDGYCHID-NVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSF 249
           Y  Y      C  D N   I KI  +  V  N+E ALK A++  GP+SV I+A+++ F  
Sbjct: 182 YPAYEAVQEPCRFDPNKAPIVKIDSYSFVDPNDEEALKQAVYSQGPVSVLIEASYE-FMI 240

Query: 250 YSNGVYFEPKCKNKVDELDHAVLAVGYGVL-NGHKYWLVKNSWSNMWGNDGYVLMSMR-- 420
           Y  GV+  P C     EL+HAVL VGY    +G  YW+VKNSW   WG  GY+ M     
Sbjct: 241 YQGGVFSGP-CGT---ELNHAVLVVGYDETEDGTPYWIVKNSWGAGWGESGYIRMIRNIP 296

Query: 421 --ENNCGVQSAPTY 456
             E  CG+   P Y
Sbjct: 297 APEGICGIAMYPIY 310


>UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain;
           n=9; Cucujiformia|Rep: Digestive cysteine proteinase
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 89.4 bits (212), Expect = 4e-17
 Identities = 56/157 (35%), Positives = 78/157 (49%), Gaps = 5/157 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    A+ +++  G+  +  Y  Y G D  C  D    + KI G+ NV+ N+E  LK A
Sbjct: 177 GGLMSFAFDYVLDKGIEADSSYP-YKGIDTPCQYDAKKTVLKIKGYKNVS-NSEEELKKA 234

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYG----VLNGHKYW 357
           +   GP+SVAIDA       Y  G+     C +    L+H VLAVGYG    +    K+W
Sbjct: 235 VGTVGPVSVAIDA--DPIQLYFGGILDGLFCTHN---LNHGVLAVGYGEEDHLFGKKKFW 289

Query: 358 LVKNSWSNMWGNDGYVLMSMRENN-CGVQSAPTYVLI 465
            VKNSW   WG  GY  +    NN CG+    +Y ++
Sbjct: 290 KVKNSWGKDWGEQGYFRIKRDANNLCGIADKASYPIL 326


>UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2;
           Arabidopsis thaliana|Rep: Putative cysteine proteinase -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 365

 Score = 89.0 bits (211), Expect = 6e-17
 Identities = 54/154 (35%), Positives = 79/154 (51%), Gaps = 5/154 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAI-TKITGWVNVTTNNENALKL 186
           GGE   A+++I+K+G  + E    Y  +   C  +   A  T+I G+  V ++NE AL L
Sbjct: 213 GGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQIRGFQMVPSHNERAL-L 271

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
              +  P+SV IDA   +F  Y  GVY    C   V+   HAV  VGYG ++G  YW++K
Sbjct: 272 EAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVN---HAVTIVGYGTMSGLNYWVLK 328

Query: 367 NSWSNMWGNDGYVL----MSMRENNCGVQSAPTY 456
           NSW   WG +GY+     +   +  CG+     Y
Sbjct: 329 NSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAY 362


>UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 462

 Score = 89.0 bits (211), Expect = 6e-17
 Identities = 54/166 (32%), Positives = 87/166 (52%), Gaps = 8/166 (4%)
 Frame = +1

Query: 1   ARGGGEDFRAYQWI--MKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNEN 174
           A  GGE + AY  +  ++  L TEE+Y  YLG  G+C  +    I K+TG   +  ++ N
Sbjct: 296 ACAGGEGYDAYGKLAELQLNLTTEEEYP-YLGVSGHCQKNFGKTIGKVTGCYQIMRDSSN 354

Query: 175 A---LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDE---LDHAVLAVGYGV 336
               +  AL+K+GP+ + I A    F  Y+ G +   +    +D+    DH VL  G+  
Sbjct: 355 KDINVLRALYKYGPLMIYIRAGTAPFVAYTGGSFNNHEVCGGIDDHDKTDHGVLLTGWKT 414

Query: 337 LNGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI*IS 474
           ++G  ++ + NSWS  WG +G+  +S  EN+CGV   P   L+ I+
Sbjct: 415 IDGVIHYEIMNSWSTFWGEEGFAYIS-SENDCGVPVMPLLPLVEIN 459


>UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 452

 Score = 89.0 bits (211), Expect = 6e-17
 Identities = 47/146 (32%), Positives = 79/146 (54%), Gaps = 3/146 (2%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLP-TEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GGE    Y+ + +  +  T ED   YLG   YC  +    +  + G   +  ++   LK 
Sbjct: 292 GGEHDEIYRILRESKMELTLEDEYPYLGVGSYCGKNFKHTVGYVKGCYKIPEHDNEKLKS 351

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKC--KNKVDELDHAVLAVGYGVLNGHKYWL 360
           ALF+HGP++V I A    F   ++ +Y    C   +KV ++DH+VL  G+  +NG   W 
Sbjct: 352 ALFEHGPLAVGIIADQDGFGTLTDNIYDNANCYVHDKV-KIDHSVLLTGWKRINGVDAWE 410

Query: 361 VKNSWSNMWGNDGYVLMSMRENNCGV 438
           + NSWS++WG+ G+  + M +++CG+
Sbjct: 411 IMNSWSDVWGDHGFGYIVMGDHDCGI 436


>UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-like
           cysteine peptidase; n=3; Trichomonas vaginalis|Rep: Clan
           CA, family C1, cathepsin L or K-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 320

 Score = 89.0 bits (211), Expect = 6e-17
 Identities = 47/131 (35%), Positives = 71/131 (54%), Gaps = 1/131 (0%)
 Frame = +1

Query: 61  TEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAAHKT 240
           T  DY  Y+ +   C  D   ++ K TG+  V   + +AL  A+ +    S+ IDA+  +
Sbjct: 188 TAADYP-YIARASICKFDKTKSVAKTTGFERVKPGSSDALIEAV-QTSVCSLLIDASINS 245

Query: 241 FSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYV-LMSM 417
           F  Y +G+Y + KC     +LDH V  VGYG  +G  YW+++NSW   WG  GY+ +++ 
Sbjct: 246 FMQYKSGIYDDTKCDPT--QLDHYVNLVGYGSESGINYWIIRNSWGEAWGESGYIRIINN 303

Query: 418 RENNCGVQSAP 450
             N CGV S P
Sbjct: 304 AANVCGVLSHP 314


>UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin
           L-like proteinase; n=2; Strongylocentrotus
           purpuratus|Rep: PREDICTED: similar to cathepsin L-like
           proteinase - Strongylocentrotus purpuratus
          Length = 329

 Score = 88.2 bits (209), Expect = 1e-16
 Identities = 44/120 (36%), Positives = 72/120 (60%), Gaps = 2/120 (1%)
 Frame = +1

Query: 103 CHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKC 282
           C+  +  A+       +VT  NE+AL  A++   P+ VAIDA+  +F  Y +GVY +P C
Sbjct: 209 CNNASCKAVASSNVGKSVTQGNESALAEAVY-FTPVVVAIDASQPSFQLYVSGVYSDPNC 267

Query: 283 KNKVDELDHAVLAVGYGVLN-GHKYWLVKNSWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456
            + +  LD ++L VGYGV + G +YW+ +N+W   WG++GY+ ++   NN CG+ +   Y
Sbjct: 268 SSTL--LDLSLLLVGYGVSSVGTEYWICRNTWGEEWGDNGYINIARNHNNMCGIATDAIY 325


>UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8;
           Magnoliophyta|Rep: Os12g0273800 protein - Oryza sativa
           subsp. japonica (Rice)
          Length = 504

 Score = 88.2 bits (209), Expect = 1e-16
 Identities = 54/136 (39%), Positives = 72/136 (52%), Gaps = 2/136 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAIT-KITGWVNVTTNNENALKL 186
           GGE   A+Q+I+ +G  T E    Y  +DG C       +   I G+ +V  N+E +L  
Sbjct: 190 GGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKTTAAADVAASIRGYEDVPANDEPSLMK 249

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN-GHKYWLV 363
           A+    P+SVA+DA+   F FY  GV    +C      LDH V  +GYG  + G KYWLV
Sbjct: 250 AVAGQ-PVSVAVDASK--FQFYGGGV-MAGECGTS---LDHGVTVIGYGAASDGTKYWLV 302

Query: 364 KNSWSNMWGNDGYVLM 411
           KNSW   WG  GY+ M
Sbjct: 303 KNSWGTTWGEAGYLRM 318


>UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core
           eudicotyledons|Rep: Chymopapain precursor - Carica
           papaya (Papaya)
          Length = 352

 Score = 88.2 bits (209), Expect = 1e-16
 Identities = 54/154 (35%), Positives = 77/154 (50%), Gaps = 5/154 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHI-DNVTAITKITGWVNVTTNNENALKL 186
           GG    + Q++  +G+ T + Y  Y  +   C   D      KITG+  V +N E +  L
Sbjct: 199 GGYQTTSLQYVANNGVHTSKVYP-YQAKQYKCRATDKPGPKVKITGYKRVPSNCETSF-L 256

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
               + P+SV ++A  K F  Y +GV+  P C  K   LDHAV AVGYG  +G  Y ++K
Sbjct: 257 GALANQPLSVLVEAGGKPFQLYKSGVFDGP-CGTK---LDHAVTAVGYGTSDGKNYIIIK 312

Query: 367 NSWSNMWGNDGYVLMSMRENN----CGVQSAPTY 456
           NSW   WG  GY+ +  +  N    CGV  +  Y
Sbjct: 313 NSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYY 346


>UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole
           genome shotgun sequence; n=1; Tetraodon
           nigroviridis|Rep: Chromosome undetermined SCAF6860,
           whole genome shotgun sequence - Tetraodon nigroviridis
           (Green puffer)
          Length = 251

 Score = 87.8 bits (208), Expect = 1e-16
 Identities = 45/99 (45%), Positives = 59/99 (59%), Gaps = 1/99 (1%)
 Frame = +1

Query: 82  YLGQDGY-CHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSFYSN 258
           +L QD   C+ DN  A+  I  +  +   +E AL  A+   GPI+VAIDA+H +F FYS+
Sbjct: 97  FLQQDTQPCYYDNKRAVGTIRDYRFIPKGDEQALADAVATIGPITVAIDASHSSFLFYSS 156

Query: 259 GVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSW 375
           G+Y E  C    + L HAVL VGYG   G  YWL+KN W
Sbjct: 157 GIYEESNC--NPNNLSHAVLLVGYGSEGGQDYWLIKNRW 193


>UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core
           eudicotyledons|Rep: Cysteine proteinase -
           Mesembryanthemum crystallinum (Common ice plant)
          Length = 367

 Score = 87.8 bits (208), Expect = 1e-16
 Identities = 53/139 (38%), Positives = 74/139 (53%), Gaps = 5/139 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAIT-KITGWVNVTTNNENALKL 186
           GG   RA+++I + G  T E    Y  Q G C  + +   T  I G+ N+  + +  LK+
Sbjct: 190 GGTMGRAFEYIKQRGGITSEANYPYKAQAGMCKNNLIQRPTVSIDGYYNIRRSEDAVLKI 249

Query: 187 ALFKHGPISVAIDA---AHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN-GHKY 354
               H P+SVA+DA   +   + FY  GV+  P C  K   L+H V AVGYG  N G+ Y
Sbjct: 250 --LAHQPVSVAVDATTWSSLDWMFYFQGVFTGP-CGTK---LNHGVTAVGYGTTNDGYDY 303

Query: 355 WLVKNSWSNMWGNDGYVLM 411
           W++KNSW   WG  GY+ M
Sbjct: 304 WIIKNSWGETWGERGYMRM 322


>UniRef50_A2DKX2 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 317

 Score = 87.8 bits (208), Expect = 1e-16
 Identities = 55/149 (36%), Positives = 75/149 (50%), Gaps = 6/149 (4%)
 Frame = +1

Query: 10  GGEDFRAYQWIM-----KHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNEN 174
           GGE  RA  +I+     K GL  E DY       GYC  D    +TK    VN T  +E 
Sbjct: 166 GGEADRAVGYIVTDQDGKFGL--ESDYPYKSESMGYCEFDPSKGVTKALA-VNYT-RDEA 221

Query: 175 ALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKY 354
            +K+ +   GP+    D++ + F +Y  GVY+   C      +DH +  VGYG  NG  Y
Sbjct: 222 DMKVRVATTGPLICGYDSSSEDFEYYYQGVYYSDDCS--AWGIDHWMTIVGYGTYNGDDY 279

Query: 355 WLVKNSWSNMWGNDGYVLMSM-RENNCGV 438
           WLVKNS+   WG  GY +++  R+  CGV
Sbjct: 280 WLVKNSFGKGWGQQGYGMVARNRDGACGV 308


>UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35;
           Viridiplantae|Rep: Cysteine proteinase 15A precursor -
           Pisum sativum (Garden pea)
          Length = 363

 Score = 87.8 bits (208), Expect = 1e-16
 Identities = 53/153 (34%), Positives = 79/153 (51%), Gaps = 8/153 (5%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    A++++++ G +  E+DY  Y G+DG C  D    +  ++ + +V T +E+ +  
Sbjct: 205 GGLMNNAFEYLLESGGVVQEKDYA-YTGRDGSCKFDKSKVVASVSNF-SVVTLDEDQIAA 262

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-------LNG 345
            L K+GP++VAI+AA      Y +GV     C      LDH VL VG+G        L  
Sbjct: 263 NLVKNGPLAVAINAAW--MQTYMSGVSCPYVCAKS--RLDHGVLLVGFGKGAYAPIRLKE 318

Query: 346 HKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444
             YW++KNSW   WG  GY  +    N CGV S
Sbjct: 319 KPYWIIKNSWGQNWGEQGYYKICRGRNVCGVDS 351


>UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F
            like protease; n=1; Nasonia vitripennis|Rep: PREDICTED:
            similar to cathepsin F like protease - Nasonia
            vitripennis
          Length = 1036

 Score = 87.4 bits (207), Expect = 2e-16
 Identities = 56/159 (35%), Positives = 74/159 (46%), Gaps = 7/159 (4%)
 Frame = +1

Query: 10   GGEDFRAYQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
            GG    AY+ I +  GL  E DY  Y  +D  CH +       I   +N+T+N E  +  
Sbjct: 881  GGLPDTAYRAIEELGGLELESDYP-YDAEDEKCHFNKNKVKVNIVSGLNITSN-ETQMAQ 938

Query: 187  ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL------NGH 348
             L K+GP+S+ I+A      FY  GV    K     D LDH VL VGYGV          
Sbjct: 939  WLVKNGPMSIGINA--NAMQFYMGGVSHPFKFLCSPDSLDHGVLIVGYGVKFYPIFKKTM 996

Query: 349  KYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
             YW++KNSW   WG  GY  +   +  CGV    T  ++
Sbjct: 997  PYWIIKNSWGPRWGEQGYYRVYRGDGTCGVNKMVTSAVV 1035


>UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin S
           preproprotein; n=2; Tribolium castaneum|Rep: PREDICTED:
           similar to cathepsin S preproprotein - Tribolium
           castaneum
          Length = 525

 Score = 87.4 bits (207), Expect = 2e-16
 Identities = 48/144 (33%), Positives = 72/144 (50%), Gaps = 2/144 (1%)
 Frame = +1

Query: 31  YQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGP 207
           Y++I+K  G+  ++DY  Y    G C             +  +T  +E  L+  +   GP
Sbjct: 382 YKYIVKSEGINYDQDYR-YQSAPGTCRFRADKPKITFRKYAYLTAISEEDLQWIVANVGP 440

Query: 208 ISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMW 387
           ++V+ D   K F  YS GV++   C        H  + VGYG  NG  +WLVKNS+   W
Sbjct: 441 VTVSFDGRGKQFKSYSGGVFYNKTCTRMKT---HVAVLVGYGTENGEDFWLVKNSYGPQW 497

Query: 388 GNDGYVLMSM-RENNCGVQSAPTY 456
           G DGYV ++  R N+CG+ +  TY
Sbjct: 498 GLDGYVKIARNRNNHCGITNRITY 521



 Score = 40.3 bits (90), Expect = 0.025
 Identities = 26/101 (25%), Positives = 43/101 (42%), Gaps = 1/101 (0%)
 Frame = +1

Query: 31  YQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGP 207
           Y++I+  +G+  ++DY  Y    G C             +  +   +E  L+  + K GP
Sbjct: 106 YEYIINSNGINYDQDYR-YESAPGSCRFKPNKPTVTFKKYAYLAEISEEDLQWIVAKIGP 164

Query: 208 ISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGY 330
            +V+ DA       YS G+Y+   C      L H  + VGY
Sbjct: 165 ATVSFDARGSQLKSYSGGIYYNRTC---TKTLTHVAVVVGY 202


>UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis
           zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa
           zeasingle nucleocapsid nuclear polyhedrosis virus)
          Length = 367

 Score = 87.4 bits (207), Expect = 2e-16
 Identities = 47/144 (32%), Positives = 76/144 (52%), Gaps = 1/144 (0%)
 Frame = +1

Query: 10  GGEDFRAYQ-WIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    A+Q  ++  G+ TE DY  Y G +  C +DN     K+         +EN LK 
Sbjct: 220 GGLMHLAFQELLLMGGVETEADYP-YQGSEQMCTLDNRKIAVKLNSCFKYDIRDENKLKE 278

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
            ++  GP+++A+DA       Y  G+  +  C   + +L+HAVL +G+G+ N   YW++K
Sbjct: 279 LVYTTGPVAIAVDAMD--IINYRRGILNQ--CH--IYDLNHAVLLIGWGIENNVPYWIIK 332

Query: 367 NSWSNMWGNDGYVLMSMRENNCGV 438
           NSW   WG +G++ +    N CG+
Sbjct: 333 NSWGEDWGENGFLRVRRNVNACGL 356


>UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep:
           Cysteine protease - Solanum lycopersicum (Tomato)
           (Lycopersicon esculentum)
          Length = 345

 Score = 87.0 bits (206), Expect = 2e-16
 Identities = 52/134 (38%), Positives = 74/134 (55%), Gaps = 2/134 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    A+ +I+++G +  E DY  YLGQ   C     TA  +I+ +  V    E +L  
Sbjct: 195 GGFMTNAFDFIIENGGISRESDYE-YLGQQYTCRSQEKTAAVQISSY-QVVPEGETSLLQ 252

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLV 363
           A+ K  P+S+ I AA +   FY+ G Y +  C    D ++HAV A+GYG    G KYWL+
Sbjct: 253 AVTKQ-PVSIGI-AASQDLQFYAGGTY-DGNC---ADRINHAVTAIGYGTDEEGQKYWLL 306

Query: 364 KNSWSNMWGNDGYV 405
           KNSW   WG +GY+
Sbjct: 307 KNSWGTSWGENGYM 320


>UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza
           sativa|Rep: Putative cysteine protease - Oryza sativa
           subsp. japonica (Rice)
          Length = 357

 Score = 86.6 bits (205), Expect = 3e-16
 Identities = 51/159 (32%), Positives = 81/159 (50%), Gaps = 9/159 (5%)
 Frame = +1

Query: 7   GGGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGYCHIDNV--TAITKITGWVNVTTNNENA 177
           GGG    A+Q ++ K G+  E +Y  Y G  G C +D++      ++ G+  V   +E  
Sbjct: 199 GGGHTDAAFQLVVDKGGITAESEYR-YEGYKGRCRVDDMLFNHAARVGGYRAVPPADERQ 257

Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGY--GVLNGHK 351
           L  A+ +  P++  +DA+   F FY +GV+  P+      + +HAV  VGY     +G K
Sbjct: 258 LATAVARQ-PVTAYVDASGPAFQFYGSGVFPGPR-GTAAPKPNHAVTLVGYCQDGASGKK 315

Query: 352 YWLVKNSWSNMWGNDGYVLM----SMRENNCGVQSAPTY 456
           YW+ KNSW   WG  GY+L+    +     CG+  +P Y
Sbjct: 316 YWIAKNSWGKTWGQQGYILLEKDVASPHGTCGLAVSPFY 354


>UniRef50_O16454 Cluster: Temporarily assigned gene name protein
           196; n=4; Bilateria|Rep: Temporarily assigned gene name
           protein 196 - Caenorhabditis elegans
          Length = 477

 Score = 86.6 bits (205), Expect = 3e-16
 Identities = 54/152 (35%), Positives = 74/152 (48%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    AY+ I++ G    ED   Y G+   CH+        I G V +  ++E  ++  
Sbjct: 328 GGLPSNAYKEIIRMGGLEPEDAYPYDGRGETCHLVRKDIAVYINGSVELP-HDEVEMQKW 386

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           L   GPIS+ ++A   T  FY +GV    K   +   L+H VL VGYG      YW+VKN
Sbjct: 387 LVTKGPISIGLNA--NTLQFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDGRKPYWIVKN 444

Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           SW   WG  GY  +   +N CGVQ   T  L+
Sbjct: 445 SWGPNWGEAGYFKLYRGKNVCGVQEMATSALV 476


>UniRef50_Q9VNK7 Cluster: CG1075-PA; n=1; Drosophila
           melanogaster|Rep: CG1075-PA - Drosophila melanogaster
           (Fruit fly)
          Length = 274

 Score = 86.2 bits (204), Expect = 4e-16
 Identities = 41/126 (32%), Positives = 68/126 (53%), Gaps = 1/126 (0%)
 Frame = +1

Query: 28  AYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGP 207
           A+ +   +G+ ++E Y  Y  ++G C  D   +   +  +V +T+N+E  L   ++K GP
Sbjct: 120 AFNFKRDYGIASKESYP-YKPENGECRWDRRKSTGTLREYVTLTSNDERELAKVVYKIGP 178

Query: 208 ISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLVKNSWSNM 384
           + V+ID  H+ F  Y  G+   P C+N   +L H+VL VG+        YW++KNS+   
Sbjct: 179 VEVSIDHLHEEFDQYFGGILRTPSCRNTNYDLKHSVLLVGFETHPKWGDYWIIKNSYGTE 238

Query: 385 WGNDGY 402
           WG  GY
Sbjct: 239 WGESGY 244


>UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase
           precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin
           L-like cysteine proteinase precursor - Acanthoscelides
           obtectus (Bean weevil)
          Length = 321

 Score = 86.2 bits (204), Expect = 4e-16
 Identities = 48/113 (42%), Positives = 65/113 (57%), Gaps = 2/113 (1%)
 Frame = +1

Query: 133 KITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFE-PKCKNKVDELDH 309
           KITG+  V+  +E  L  A+   GPIS+A+D  H    FY  G+  +   CKN   +L+H
Sbjct: 215 KITGYQAVSKGDEVVLAQAVATIGPISIALDGNH--IMFYRRGIVSKWCGCKNSEKDLNH 272

Query: 310 AVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYV-LMSMRENNCGVQSAPTYVLI 465
            VL VGYG  +G  YW+VKNSW  +WG  GY  L     N CGV + P+Y ++
Sbjct: 273 GVLLVGYG--DG--YWIVKNSWGRIWGEQGYFRLKKDAGNTCGVATWPSYPIL 321


>UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria
           dispar multicapsid nuclear polyhedrosis virus (LdMNPV)
          Length = 356

 Score = 86.2 bits (204), Expect = 4e-16
 Identities = 56/145 (38%), Positives = 77/145 (53%), Gaps = 2/145 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTA-ITKITGWVNVTTNNENALK 183
           GG    A++ IM+ G + TE DY  ++G++  C +D     +  + G       NE  LK
Sbjct: 208 GGLLHTAFEEIMRMGGVQTELDYP-FVGRNRRCGLDRHRPYVVSLVGCYRYVMVNEEKLK 266

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363
             L   GPI +AIDAA      Y  GV     C+N  + L+HAVL VGYGV NG  YW+ 
Sbjct: 267 DLLRAVGPIPMAIDAAD--IVNYYRGVI--SSCEN--NGLNHAVLLVGYGVENGVPYWVF 320

Query: 364 KNSWSNMWGNDGYVLMSMRENNCGV 438
           KN+W + WG +GY  +    N CG+
Sbjct: 321 KNTWGDDWGENGYFRVRQNVNACGM 345


>UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila
           melanogaster|Rep: LD36817p - Drosophila melanogaster
           (Fruit fly)
          Length = 352

 Score = 85.8 bits (203), Expect = 5e-16
 Identities = 48/161 (29%), Positives = 84/161 (52%), Gaps = 8/161 (4%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTA-------ITKITGWVNVTTN 165
           GG +++  +++I  HG+     Y  Y   +  C   N TA       + KI  +  +T  
Sbjct: 197 GGFQEY-GFEYIRDHGVTLANKYP-YTQTEMQCR-QNETAGRPPRESLVKIRDYATITPG 253

Query: 166 NENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNG 345
           +E  +K  +   GP++ +++A   +F  YS G+Y + +C     EL+H+V  VGYG  NG
Sbjct: 254 DEEKMKEVIATLGPLACSMNADTISFEQYSGGIYEDEECNQ--GELNHSVTVVGYGTENG 311

Query: 346 HKYWLVKNSWSNMWGNDGYV-LMSMRENNCGVQSAPTYVLI 465
             YW++KNS+S  WG  G++ ++      CG+ S  +Y ++
Sbjct: 312 RDYWIIKNSYSQNWGEGGFMRILRNAGGFCGIASECSYPIL 352


>UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:
           Viral cathepsin - Cydia pomonella granulosis virus
           (CpGV) (Cydia pomonellagranulovirus)
          Length = 333

 Score = 85.8 bits (203), Expect = 5e-16
 Identities = 53/145 (36%), Positives = 73/145 (50%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    A + I++ G     +   Y G DG C          I+G       NEN L+  
Sbjct: 188 GGLMHWALESILQEGGVVSAENEPYYGFDGVCKKSPFEL--SISGSRRYVLQNENKLREL 245

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           L  +GPISVAID +      Y  G+     C+N  + L+HAVL VGYGV N   YW++KN
Sbjct: 246 LVVNGPISVAIDVSD--LINYKAGI--ADICENN-EGLNHAVLLVGYGVKNDVPYWILKN 300

Query: 370 SWSNMWGNDGYVLMSMRENNCGVQS 444
           SW   WG +GY  +   +N+CG+ +
Sbjct: 301 SWGAEWGEEGYFRVQRDKNSCGMMN 325


>UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza
           sativa|Rep: Os09g0497500 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 349

 Score = 85.4 bits (202), Expect = 7e-16
 Identities = 58/168 (34%), Positives = 81/168 (48%), Gaps = 18/168 (10%)
 Frame = +1

Query: 7   GGGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGYCHIDNVT-AITKITGWVNVTTNNENAL 180
           GGG    A+++++  HGL TE  Y  Y   +G C    +  +   I G+ NVT ++E  L
Sbjct: 185 GGGYMSWAFEFVVGNHGLTTEASYP-YHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDL 243

Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYG--------- 333
             A     P+SVA+D     F  Y +GVY  P C     +++H V  VGYG         
Sbjct: 244 ARAAAAQ-PVSVAVDGGSFMFQLYGSGVYTGP-C---TADVNHGVTVVGYGESEPKTDGG 298

Query: 334 --VLNGHKYWLVKNSWSNMWGNDGYVLM-----SMRENNCGVQSAPTY 456
                G KYW+VKNSW   WG+ GY+LM      +    CG+   P+Y
Sbjct: 299 GAAKGGEKYWIVKNSWGAEWGDAGYILMQRDVAGLASGLCGIALLPSY 346


>UniRef50_Q22RC9 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 389

 Score = 85.4 bits (202), Expect = 7e-16
 Identities = 44/114 (38%), Positives = 63/114 (55%)
 Frame = +1

Query: 103 CHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKC 282
           C    V    KI  W    + +E+++K  LF+ GP+SVA+DA++    FY  G+   PK 
Sbjct: 255 CRQGQVPIAAKIEDW-KALSKDEDSIKQQLFEIGPLSVALDASY--LQFYKKGIS-APKF 310

Query: 283 KNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444
            +K   L+HAVL  GYG+ NG ++W VKNSW   WG  GY  +      CG+ +
Sbjct: 311 CSKTT-LNHAVLLTGYGIDNGVEFWNVKNSWGAKWGEQGYFRLKRGVGMCGINT 363


>UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-like
           protein; n=1; Maconellicoccus hirsutus|Rep: Cathepsin
           L-like cysteine proteinase-like protein -
           Maconellicoccus hirsutus (hibiscus mealybug)
          Length = 253

 Score = 85.4 bits (202), Expect = 7e-16
 Identities = 47/146 (32%), Positives = 72/146 (49%), Gaps = 3/146 (2%)
 Frame = +1

Query: 10  GGEDFRAYQWIM-KHGLPTEEDY-GGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183
           GG     Y+++    G+  E+ Y   +   +  C  D+  +   I  +    TN E  LK
Sbjct: 100 GGNLENTYKYVNHSRGIEKEDSYRDNFRHINSRCQYDSTKSAVSIKNFSRCQTN-EAHLK 158

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363
           + +    P+SV I+   ++F  Y   +Y +P+C N   E  +AVL VGYG  N   YWL+
Sbjct: 159 MQVVGR-PVSVYINPTLESFKHYKGDIYDDPQCDNSRHESSYAVLVVGYGTDNNTDYWLI 217

Query: 364 KNSWSNMWGNDGYVLMSMRENN-CGV 438
           KNS    WG  GY+ ++   NN CG+
Sbjct: 218 KNSLGTSWGEKGYMRLARNRNNLCGI 243


>UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus
           tauri|Rep: Cysteine protease-1 - Ostreococcus tauri
          Length = 430

 Score = 85.0 bits (201), Expect = 9e-16
 Identities = 55/166 (33%), Positives = 85/166 (51%), Gaps = 16/166 (9%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTA-ITKITGWVNVTTNNENALK 183
           GG  D+ A++WI+K+G    E    Y  +   C+   +   +  I G+ +V   +E  L+
Sbjct: 265 GGLMDY-AFRWIVKNGGIDSEFQYPYSAEALACNRWKLQLHVATIDGFKDVPPGDEKELE 323

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH----- 348
            A+ +  P+S+AI+A  K+F  Y  GVY   +C ++V   DH VL VGYG  + H     
Sbjct: 324 KAVSQQ-PVSIAIEADTKSFQLYDGGVYDSKECGSQV---DHGVLVVGYGFDDTHHNATK 379

Query: 349 ------KYWLVKNSWSNMWGNDGYVLMSMR----ENNCGVQSAPTY 456
                  +W VKNSW   WG  G++ M+ R       CG+ +AP+Y
Sbjct: 380 HHKRHRHFWKVKNSWGGTWGEGGFIRMARRISDETGQCGITTAPSY 425


>UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46;
           Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea
           mays (Maize)
          Length = 371

 Score = 85.0 bits (201), Expect = 9e-16
 Identities = 55/156 (35%), Positives = 76/156 (48%), Gaps = 11/156 (7%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    A+ ++ K G L +E+DY  Y G DG C  D    +  +  + +V + +E  +  
Sbjct: 210 GGLMTTAFSYLQKAGGLESEKDYP-YTGSDGKCKFDKSKIVASVQNF-SVVSVDEAQISA 267

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-------LNG 345
            L KHGP+++ I+AA+     Y  GV     C      LDH VL VGYG        L  
Sbjct: 268 NLIKHGPLAIGINAAY--MQTYIGGVSCPYICGR---HLDHGVLLVGYGASGFAPIRLKD 322

Query: 346 HKYWLVKNSWSNMWGNDGYVLM---SMRENNCGVQS 444
             YW++KNSW   WG +GY  +   S   N CGV S
Sbjct: 323 KPYWIIKNSWGENWGENGYYKICRGSNVRNKCGVDS 358


>UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3;
           Schistosoma japonicum|Rep: Clone ZZD209 mRNA sequence -
           Schistosoma japonicum (Blood fluke)
          Length = 339

 Score = 84.6 bits (200), Expect = 1e-15
 Identities = 48/147 (32%), Positives = 74/147 (50%), Gaps = 2/147 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG  F  + ++   GL TE+ Y  + G+D  C  ++   + +  G+       E  LK A
Sbjct: 187 GGYTFTLFIYLQSFGLETEQMYP-FTGEDQDCMANSSDVVVQSIGYKFHRHGYETILKWA 245

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN-GHKYWLVK 366
           L+  GP  ++++   K F  Y +G+Y    C +    L+ ++L VGYG  N G  YW+V+
Sbjct: 246 LYNEGPYVISMNIDEK-FLHYKSGIYQSDTCTHY--NLNQSMLLVGYGYDNDGIDYWIVQ 302

Query: 367 NSWSNMWGNDGYVLMSMRE-NNCGVQS 444
           NSW   WG  GYV +     N CG+ S
Sbjct: 303 NSWGKKWGESGYVKVRRNNWNMCGIAS 329


>UniRef50_Q23H32 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 365

 Score = 84.6 bits (200), Expect = 1e-15
 Identities = 49/157 (31%), Positives = 72/157 (45%), Gaps = 7/157 (4%)
 Frame = +1

Query: 10  GGEDFRAYQWIMK--HGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183
           GG+   A   +M    G+   +DY         C  D    +    G+ N+  NNE A+K
Sbjct: 199 GGDPEPALDCVMNVLKGIMKNQDYPYQAITRKECDHDQSKNVFSPDGYENIPINNELAIK 258

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363
            A+ +  PIS  I  + + F FY  G+  E   +      DH +  VGYG  NG +YW++
Sbjct: 259 EAVSRQ-PISACISGSSQNFKFYKGGIADEKLLECDPQYTDHCLGIVGYGSENGKQYWIL 317

Query: 364 KNSWSNMWGNDGYVLM-----SMRENNCGVQSAPTYV 459
           KNSW   WG  GY+ +     S  +  CG+ + P  V
Sbjct: 318 KNSWGENWGEKGYIRLLRSDSSNTQGTCGIATEPRIV 354


>UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep:
           CG4847-PD, isoform D - Drosophila melanogaster (Fruit
           fly)
          Length = 420

 Score = 84.6 bits (200), Expect = 1e-15
 Identities = 46/154 (29%), Positives = 79/154 (51%), Gaps = 2/154 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWI--MKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183
           GG    A+ +I  ++ G+  E  Y  Y+   G C  D   +   + G+  +   +E  LK
Sbjct: 271 GGFQEAAFCFIDEVQKGVSQEGAYP-YIDNKGTCKYDGSKSGATLQGFAAIPPKDEEQLK 329

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363
             +   GP++ +++   +T   Y+ G+Y + +C NK  E +H++L VGYG   G  YW+V
Sbjct: 330 KVVATLGPVACSVNGL-ETLKNYAGGIYNDDEC-NK-GEPNHSILVVGYGSEKGQDYWIV 386

Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           KNSW + WG  GY  +   +N C +    +Y ++
Sbjct: 387 KNSWDDTWGEKGYFRLPRGKNYCFIAEECSYPVV 420


>UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia
           medanensis|Rep: Sui m 1 allergen - Suidasia medanensis
          Length = 336

 Score = 84.2 bits (199), Expect = 2e-15
 Identities = 47/155 (30%), Positives = 76/155 (49%), Gaps = 3/155 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITK--ITGWVNVTTN-NENAL 180
           GG    AY ++ + GL  E  Y  Y  +DG C    V    +  ++    +  N  +  +
Sbjct: 185 GGNPIIAYAYVQQTGLVEESAYP-YQARDGQCQSSTVNGHQRYHVSAGRELPFNATDETI 243

Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360
             +L + GP++V I A+   F FY NGV    +  ++  +++HAV  VG+G  +G  YW+
Sbjct: 244 MNSLHQIGPMAVLIFASDNEFRFYRNGVIQNLRPNSR--QINHAVTLVGWGTEDGQDYWI 301

Query: 361 VKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           VKNSW   WG  GY  +    N  G+ +   Y ++
Sbjct: 302 VKNSWGPSWGESGYFRLGRHHNLIGINNYVFYPVL 336


>UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n=1;
           Xenopus tropicalis|Rep: UPI00006A2275 UniRef100 entry -
           Xenopus tropicalis
          Length = 272

 Score = 83.8 bits (198), Expect = 2e-15
 Identities = 49/129 (37%), Positives = 72/129 (55%), Gaps = 1/129 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG+  +A++++ K+G+  E  Y  Y GQ G C       I  +    ++ + NE  L   
Sbjct: 146 GGKIEKAFKYMKKYGVMEESAYP-YTGQKGLCRKKQPGNIGVVKAIHDLPSGNETLLMNT 204

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKC-KNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           +   GP+SV+I+A+ + F  + +GVY+ P C  NKV   +HAVL VGYG  NG  YWLVK
Sbjct: 205 VGTIGPVSVSINASSEKFHQFKSGVYYNPDCLPNKV---NHAVLVVGYGKENGMDYWLVK 261

Query: 367 NSWSNMWGN 393
           N     WG+
Sbjct: 262 NR-RVAWGS 269


>UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia
           circumcincta|Rep: Secreted cathepsin F - Teladorsagia
           circumcincta
          Length = 364

 Score = 83.4 bits (197), Expect = 3e-15
 Identities = 50/152 (32%), Positives = 69/152 (45%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    AY+ I++ G    ED   Y  +   C +        I G V +  ++E  ++  
Sbjct: 217 GGFPLDAYKEIVRMGGLEPEDKYPYEAKAEQCRLVPSDIAVYINGSVELP-HDEEKMRAW 275

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           L K GPIS+ I        FY  GV     C+  +  + H  L VGYGV     YW++KN
Sbjct: 276 LVKKGPISIGITV--DDIQFYKGGVSRPTTCR--LSSMIHGALLVGYGVEKNIPYWIIKN 331

Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           SW   WG DGY  M   EN C +   PT  ++
Sbjct: 332 SWGPNWGEDGYYRMVRGENACRINRFPTSAVV 363


>UniRef50_Q22A69 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 330

 Score = 83.4 bits (197), Expect = 3e-15
 Identities = 50/157 (31%), Positives = 75/157 (47%), Gaps = 5/157 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNV-----TTNNEN 174
           GG    A+ ++    L TE  Y  Y   DG C  +    +  +  +V++       + EN
Sbjct: 178 GGLMDNAFTYLESAKLETESAYP-YTAVDGSCKYNQSLGVVGVASFVDIEQGKTVADTEN 236

Query: 175 ALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKY 354
            + +AL   GP+SVAI+A      FY+ G+     C    + L+H VL VG G  NG  +
Sbjct: 237 TMGVALDNIGPLSVAINA--NNLQFYAGGISNPLICNP--NGLNHGVLIVGLGSENGKDF 292

Query: 355 WLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           W VKNSW   WG  GY  +   +  CG+  A +Y ++
Sbjct: 293 WKVKNSWGASWGEKGYFRIVRGKGKCGINRAVSYPVL 329


>UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:
           Viral cathepsin - Xestia c-nigrum granulosis virus
           (XnGV) (Xestia c-nigrumgranulovirus)
          Length = 346

 Score = 83.4 bits (197), Expect = 3e-15
 Identities = 53/144 (36%), Positives = 73/144 (50%), Gaps = 1/144 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    A++ I++ G  + E    Y G DG C   N T   +++G       +E  L+  
Sbjct: 197 GGLMSWAFEGIIRAGGISYEAPYPYTGVDGVCK--NTTRYVQLSGCYAYDLRSEKKLRQV 254

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDE-LDHAVLAVGYGVLNGHKYWLVK 366
           L + GP+SVAID    T   Y +GV     C   VD  L+H VL VGYG  N  KYW +K
Sbjct: 255 LHEKGPVSVAIDVVDLTN--YKSGV--AKHCS--VDHGLNHGVLLVGYGQENDVKYWTLK 308

Query: 367 NSWSNMWGNDGYVLMSMRENNCGV 438
           NSW + WG  G+  +    N+CG+
Sbjct: 309 NSWGSDWGEQGFFRIKRDVNSCGI 332


>UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryza
           sativa (japonica cultivar-group)|Rep: Putative cysteine
           proteinase - Oryza sativa subsp. japonica (Rice)
          Length = 385

 Score = 83.0 bits (196), Expect = 4e-15
 Identities = 54/154 (35%), Positives = 77/154 (50%), Gaps = 11/154 (7%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEED-----YGGYLGQDGYCHID-NVTAITKITGWVNVTTNNE 171
           GG  + A+ +++K G+  +       Y  Y  Q   C  D       KI G   V + NE
Sbjct: 209 GGYPYDAFDYVIKTGISLDNRGNPPYYPPYENQKQKCRFDPRKPPFVKIDGECLVPSGNE 268

Query: 172 NALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH- 348
            ALKLA+    P+SV I  + + F  Y  GV+  P C +  +  +H VL VGYGV   + 
Sbjct: 269 TALKLAVLSQ-PVSVVITISDE-FRSYRGGVFRGP-CGSNPNVDNHVVLVVGYGVTTDNI 325

Query: 349 KYWLVKNSWSNMWGNDGYVLMS---MRENN-CGV 438
           KYW++KNSW   WG  GY+ M    + +N  CG+
Sbjct: 326 KYWIIKNSWGKTWGEYGYIRMERDILNKNGICGI 359


>UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-like
           cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L or H-like cysteine
           peptidase - Trichomonas vaginalis G3
          Length = 435

 Score = 83.0 bits (196), Expect = 4e-15
 Identities = 47/121 (38%), Positives = 68/121 (56%), Gaps = 1/121 (0%)
 Frame = +1

Query: 55  LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAAH 234
           L  E+DY  Y+G  GYC  +N +    +     V   +   LK AL+ +GP++VAI A  
Sbjct: 297 LVLEDDYP-YIGLGGYCPTNNHSMNVIVKDCWQVEPKDVEQLKRALYLYGPVAVAI-ATD 354

Query: 235 KTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL-NGHKYWLVKNSWSNMWGNDGYVLM 411
            +F+ Y     F  K    +D+L HAV   G+GV  +G KYW ++NSWS+ WG DGY L+
Sbjct: 355 SSFAKYQGPGVFPGKSAT-LDDLTHAVTLTGWGVAKDGTKYWEIQNSWSDFWGIDGYGLI 413

Query: 412 S 414
           +
Sbjct: 414 N 414


>UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184,
           whole genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_184,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 331

 Score = 83.0 bits (196), Expect = 4e-15
 Identities = 53/155 (34%), Positives = 80/155 (51%), Gaps = 3/155 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    A+ + + +G+ TEE+Y  Y G D  C          I+ +V+V   + +AL  A
Sbjct: 184 GGWMDDAFDYTVNYGVTTEEEYP-YKGVDQPCP-SGFKKKHFISSFVDVEPLSSDALHEA 241

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           + K  P++VAI A    F  YS GVY        +D+L+H VLAVGY        + +KN
Sbjct: 242 IAKT-PVAVAIKADGILFQLYSGGVYSRSCTAKTIDDLNHGVLAVGY----AKDSYTIKN 296

Query: 370 SWSNMWGNDGYV---LMSMRENNCGVQSAPTYVLI 465
           SW   WG  GY+   L++ +E  CG+   P+Y ++
Sbjct: 297 SWGASWGEKGYMRLGLVAAKEGQCGIHWVPSYPVL 331


>UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3;
           Dictyostelium discoideum|Rep: Cysteine proteinase 1
           precursor - Dictyostelium discoideum (Slime mold)
          Length = 343

 Score = 83.0 bits (196), Expect = 4e-15
 Identities = 51/152 (33%), Positives = 75/152 (49%), Gaps = 7/152 (4%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDG-YCHIDNVTAITKITGWVNVTTNNENALK 183
           GG    AY +I+K+G + TE  Y  Y  + G  C+ ++     KI+ +  +   NE  + 
Sbjct: 192 GGLQPNAYNYIIKNGGIQTESSYP-YTAETGTQCNFNSANIGAKISNFTMIP-KNETVMA 249

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN-----GH 348
             +   GP+++A DA    + FY  GV+  P   N    LDH +L VGY   N       
Sbjct: 250 GYIVSTGPLAIAADAVE--WQFYIGGVFDIPCNPNS---LDHGILIVGYSAKNTIFRKNM 304

Query: 349 KYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444
            YW+VKNSW   WG  GY+ +   +N CGV +
Sbjct: 305 PYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 336


>UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis
           thaliana|Rep: Cysteine proteinase - Arabidopsis thaliana
           (Mouse-ear cress)
          Length = 348

 Score = 82.6 bits (195), Expect = 5e-15
 Identities = 51/137 (37%), Positives = 76/137 (55%), Gaps = 5/137 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITK---ITGWVNVTTNNENA 177
           GG   +A+++I+K+ G+ TE++Y     Q        +++  +   I+G+  V  NNE A
Sbjct: 193 GGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEA 252

Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN-GHKY 354
           L  A+ +  P+SV I+     F  YS GV F  +C     +L HAV  VGYG+   G KY
Sbjct: 253 LLQAVSQQ-PVSVGIEGTGAAFRHYSGGV-FNGECGT---DLHHAVTIVGYGMSEEGTKY 307

Query: 355 WLVKNSWSNMWGNDGYV 405
           W+VKNSW   WG +GY+
Sbjct: 308 WVVKNSWGETWGENGYM 324


>UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2;
           Trypanosoma cruzi|Rep: Cysteine proteinase, putative -
           Trypanosoma cruzi
          Length = 392

 Score = 82.6 bits (195), Expect = 5e-15
 Identities = 49/142 (34%), Positives = 83/142 (58%), Gaps = 6/142 (4%)
 Frame = +1

Query: 28  AYQWIMKHGLPTE--EDYGGYLGQDGYCHID-NVTAITKITGWVNVTTNNENALKLALFK 198
           AY++  K G+ +E    Y  Y G+ G C  + +V A+ ++  +V + +N+++A+  AL K
Sbjct: 219 AYEYA-KQGITSEWVYSYTSYRGETGDCRNELDVIAVAQVQSYVKIPSNDQDAVMEALAK 277

Query: 199 HGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN--GHKYWLVKNS 372
           +GP+SV +DA +  +S Y+ G+ F     +K   ++H V  VGYG  N     YW+++NS
Sbjct: 278 NGPLSVNVDATY--WSAYAGGI-FNGCDYSKNITINHVVQLVGYGHDNKLNLDYWILRNS 334

Query: 373 WSNMWGNDGYV-LMSMRENNCG 435
           WS  WG +GY+ L+   +  CG
Sbjct: 335 WSPSWGENGYMRLLRTDKAECG 356


>UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19;
           Bilateria|Rep: Cathepsin F precursor - Homo sapiens
           (Human)
          Length = 484

 Score = 82.6 bits (195), Expect = 5e-15
 Identities = 56/155 (36%), Positives = 73/155 (47%), Gaps = 3/155 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    AY  I    GL TE+DY  Y G    C+     A   I   V ++  NE  L  
Sbjct: 335 GGLPSNAYSAIKNLGGLETEDDYS-YQGHMQSCNFSAEKAKVYINDSVELS-QNEQKLAA 392

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVY--FEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360
            L K GPISVAI+A      FY +G+     P C   +  +DHAVL VGYG  +   +W 
Sbjct: 393 WLAKRGPISVAINAFG--MQFYRHGISRPLRPLCSPWL--IDHAVLLVGYGNRSDVPFWA 448

Query: 361 VKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           +KNSW   WG  GY  +      CGV +  +  ++
Sbjct: 449 IKNSWGTDWGEKGYYYLHRGSGACGVNTMASSAVV 483


>UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 343

 Score = 82.2 bits (194), Expect = 6e-15
 Identities = 45/146 (30%), Positives = 75/146 (51%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GGGE   A ++   HG+ T  +Y  Y      C  + V  + +I+ W+   + +E A  +
Sbjct: 201 GGGEPVEALKYAQSHGITTAHNYPYYFWTTK-CR-ETVPTVARISSWMKAESEDEMAQIV 258

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           AL  +GP+ V  + A     FY +G+  +P C     E  HA++ +GYG      YW++K
Sbjct: 259 AL--NGPMIVCANFATNKNRFYHSGIAEDPDCGT---EPTHALIVIGYGP----DYWILK 309

Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQS 444
           N++S +WG  GY+ +    N CG+ +
Sbjct: 310 NTYSKVWGEKGYMRVKRDVNWCGINT 335


>UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15;
           Magnoliophyta|Rep: Cysteine proteinase RD19a precursor -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 368

 Score = 82.2 bits (194), Expect = 6e-15
 Identities = 55/165 (33%), Positives = 83/165 (50%), Gaps = 9/165 (5%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGY-CHIDNVTAITKITGWVNVTTNNENALK 183
           GG    A+++ +K G L  EEDY  Y G+DG  C +D    +  ++ + +V + +E  + 
Sbjct: 208 GGLMNSAFEYTLKTGGLMKEEDYP-YTGKDGKTCKLDKSKIVASVSNF-SVISIDEEQIA 265

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-------LN 342
             L K+GP++VAI+A +     Y  GV     C  +   L+H VL VGYG          
Sbjct: 266 ANLVKNGPLAVAINAGY--MQTYIGGVSCPYICTRR---LNHGVLLVGYGAAGYAPARFK 320

Query: 343 GHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI*IST 477
              YW++KNSW   WG +G+  +    N CGV S  + V   +ST
Sbjct: 321 EKPYWIIKNSWGETWGENGFYKICKGRNICGVDSMVSTVAATVST 365


>UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba
           histolytica|Rep: Cysteine protease 17 - Entamoeba
           histolytica
          Length = 420

 Score = 81.8 bits (193), Expect = 8e-15
 Identities = 38/101 (37%), Positives = 56/101 (55%), Gaps = 2/101 (1%)
 Frame = +1

Query: 142 GWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLA 321
           G+  V   NE AL  A+ K G + + +D   K F  Y  G+Y+  +C  +   L HA+  
Sbjct: 283 GYALVLRGNERALMSAIHKFGVLGIGLDTRSKLFKHYRGGIYYNEECTRR--GLSHAMNL 340

Query: 322 VGYGVLN-GHKYWLVKNSWSNM-WGNDGYVLMSMRENNCGV 438
           VGYG    G KY++++NSW +  WG DGY+ +    N+CGV
Sbjct: 341 VGYGTTKEGQKYYIIRNSWGDWKWGEDGYMRLYRGGNHCGV 381


>UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 367

 Score = 81.4 bits (192), Expect = 1e-14
 Identities = 55/151 (36%), Positives = 81/151 (53%), Gaps = 4/151 (2%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGW--VNVTTNNENALK 183
           GG    AY++I   G+ ++++Y  Y+GQ+  C I++ +          +   TNN N   
Sbjct: 224 GGWPSVAYRYIKDQGISSQQNYP-YIGQNRNCSINSASPPKAFYAKDPIYYYTNNGNQTN 282

Query: 184 LALF--KHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357
           L  +     PISV +DA +  +S YS GV+    C N    ++HAVL VGY   +G+  W
Sbjct: 283 LVQYAVNQAPISVLVDATN--WSSYSQGVF--NNCGNVT--INHAVLLVGYDT-SGN--W 333

Query: 358 LVKNSWSNMWGNDGYVLMSMRENNCGVQSAP 450
           LVKNSW   WG  GY+ ++   N C VQS+P
Sbjct: 334 LVKNSWGTNWGQKGYITLA-PGNTCNVQSSP 363


>UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep:
           Cathepsin L precursor - Schistosoma mansoni (Blood
           fluke)
          Length = 319

 Score = 81.4 bits (192), Expect = 1e-14
 Identities = 52/153 (33%), Positives = 75/153 (49%), Gaps = 1/153 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    AY+ I+K G    ED   Y  ++  CH+        I   VN+T  +E  L   
Sbjct: 169 GGLPSNAYESIIKMGGLMLEDNYPYDAKNEKCHLKTDGVAVYINSSVNLT-QDETELAAW 227

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHK-YWLVK 366
           L+ +  ISV ++A      FY +G+            LDHAVL VGYGV   ++ +W+VK
Sbjct: 228 LYHNSTISVGMNAL--LLQFYQHGISHPWWIFCSKYLLDHAVLLVGYGVSEKNEPFWIVK 285

Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           NSW   WG +GY  M   + +CG+ +  T  +I
Sbjct: 286 NSWGVEWGENGYFRMYRGDGSCGINTVATSAMI 318


>UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba
           histolytica|Rep: Cysteine protease 19 - Entamoeba
           histolytica
          Length = 324

 Score = 81.0 bits (191), Expect = 1e-14
 Identities = 45/152 (29%), Positives = 79/152 (51%), Gaps = 2/152 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYC-HIDNVTAITKITGWVNVTTNNENALKL 186
           GG    A ++   +G+ +E  +  Y   + +C     V  + K T   + T  ++  ++ 
Sbjct: 171 GGSIGGALKYAQDNGMQSESSFP-YKPFEQHCLQNQKVMKVKKYTH--SDTKGDDEKVRS 227

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLV 363
            +  +GP+  A+DA+  +F  Y  G+Y + KC++  D+   AV+ VGYG+  N  KY++V
Sbjct: 228 EILSYGPVGSAMDASRSSFLLYHGGIYNDKKCRS--DKSTIAVVIVGYGIDKNNGKYFIV 285

Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQSAPTYV 459
           +NSW   WG  GY  +S   N CG+ +   Y+
Sbjct: 286 RNSWGPYWGEQGYFRISSDNNLCGLSNDIYYI 317


>UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin L
           preproprotein; n=1; Monodelphis domestica|Rep:
           PREDICTED: similar to cathepsin L preproprotein -
           Monodelphis domestica
          Length = 356

 Score = 80.6 bits (190), Expect = 2e-14
 Identities = 44/128 (34%), Positives = 68/128 (53%), Gaps = 17/128 (13%)
 Frame = +1

Query: 133 KITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCK-NKVDELDH 309
           +I  +V + + +E AL  A+   GP++VAI A   +F +Y  G Y EP+C+ + +  ++H
Sbjct: 230 RIRDYVTLPSGDERALMQAVATVGPVAVAIHAP-PSFRYYQGGPYIEPRCRLSYMSNMNH 288

Query: 310 AVLAVGYGVLNGHKY---------------WLVKNSWSNMWGNDGYVLMSM-RENNCGVQ 441
           A+L VGYG L   KY               W+ KNSW   WG+ GY+ +   R N CG+ 
Sbjct: 289 ALLVVGYGPLERSKYEEFGLQAYMHKDNKFWIAKNSWGEQWGDRGYIYIPKDRYNQCGIA 348

Query: 442 SAPTYVLI 465
           S   Y ++
Sbjct: 349 SNANYPIL 356


>UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae
           str. PEST
          Length = 559

 Score = 80.6 bits (190), Expect = 2e-14
 Identities = 50/162 (30%), Positives = 78/162 (48%), Gaps = 9/162 (5%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183
           GGG    A++ I +  GL  E DY         CH +   +  ++ G V++   NE  + 
Sbjct: 402 GGGYMDDAFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAVDMP-KNETYIA 460

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVY--FEPKCKNKVDELDHAVLAVGYGVLNGHK-- 351
             L K+GPI++ ++A      FY  G+   + P C +K   +DH VL VGYG+       
Sbjct: 461 KYLIKNGPIAIGLNA--NAMQFYRGGISHPWHPLCNHK--SIDHGVLIVGYGIKEYPMFN 516

Query: 352 ----YWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
               YW++KNSW   WG  GY  +   +N+CGV    +  ++
Sbjct: 517 KTLPYWIIKNSWGPRWGEQGYYRIYRGDNSCGVSEMASSAIL 558


>UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 493

 Score = 80.6 bits (190), Expect = 2e-14
 Identities = 43/128 (33%), Positives = 66/128 (51%), Gaps = 3/128 (2%)
 Frame = +1

Query: 64  EEDYGGYLGQDG-YCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAAHKT 240
           E DY  YLG    +C  +    +  +TG   +     + ++ A++  GP+ +AI+   + 
Sbjct: 355 ESDYP-YLGASSQFCDNNKDDYLGTVTGCYKIEQRTRSVME-AIYTFGPLGIAINVI-EP 411

Query: 241 FSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLMSMR 420
              Y+NGV  +  C     +L HAVL  G+  ++G   W VKNSWS  WG DGY+ + M 
Sbjct: 412 MMLYTNGVIDDETCTGAQSDLVHAVLLTGWAEIDGKLAWEVKNSWSTYWGWDGYIYIQME 471

Query: 421 E--NNCGV 438
           +   NCGV
Sbjct: 472 DQTKNCGV 479


>UniRef50_Q248G1 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 334

 Score = 80.2 bits (189), Expect = 3e-14
 Identities = 53/146 (36%), Positives = 84/146 (57%), Gaps = 3/146 (2%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKI--TGW-VNVTTNNENAL 180
           GG    A Q+++++G+   E Y  Y+   G C  D    + K    GW VN+   +E AL
Sbjct: 191 GGWPEEALQYVIEYGIVKSEVYP-YVAVQGKCR-DIPYDVPKYYPEGWYVNLDQTSE-AL 247

Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360
           K A+ K  P+SV +DA+  T+ FY +G+ F        D+L+HA++AVGY   +G+  W+
Sbjct: 248 KAAIAK-APVSVCVDAS--TWKFYKSGI-FSGCGPTTEDDLNHAIVAVGYDA-DGN--WI 300

Query: 361 VKNSWSNMWGNDGYVLMSMRENNCGV 438
           ++NSW+  WG +GY+ ++   N CGV
Sbjct: 301 IRNSWATKWGENGYIRLA-AGNTCGV 325


>UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61;
           Leishmania|Rep: Cysteine proteinase 2 precursor -
           Leishmania pifanoi
          Length = 444

 Score = 79.4 bits (187), Expect = 4e-14
 Identities = 53/155 (34%), Positives = 83/155 (53%), Gaps = 8/155 (5%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG---LPTEEDYGGYLGQDGY---CHIDNVTAIT--KITGWVNVTTN 165
           GG   +A+ W++++    L TE+ Y  Y+  +GY   C   +   +   +I G V + ++
Sbjct: 190 GGLMLQAFDWLLQNTNGHLHTEDSYP-YVSGNGYVPECSNSSEELVVGAQIDGHVLIGSS 248

Query: 166 NENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNG 345
            E A+   L K+GPI++A+DA+  +F  Y +GV     C  K  +L+H VL VGY +   
Sbjct: 249 -EKAMAAWLAKNGPIAIALDAS--SFMSYKSGVL--TACIGK--QLNHGVLLVGYDMTGE 301

Query: 346 HKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAP 450
             YW++KNSW   WG  GYV + M  N C +   P
Sbjct: 302 VPYWVIKNSWGGDWGEQGYVRVVMGVNACLLSEYP 336


>UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1;
           Bigelowiella natans|Rep: Digestive cysteine proteinase -
           Bigelowiella natans (Pedinomonas minutissima)
           (Chlorarachnion sp.(strain CCMP 621))
          Length = 360

 Score = 79.0 bits (186), Expect = 6e-14
 Identities = 40/108 (37%), Positives = 58/108 (53%), Gaps = 2/108 (1%)
 Frame = +1

Query: 127 ITKITGWVNVTTNNENALKLALFKHGPISVAIDAAH--KTFSFYSNGVYFEPKCKNKVDE 300
           +  IT W  V ++ +        KH P+SV+IDA        FY +GV   P+  +K   
Sbjct: 248 VASITDWEQVPSDEDKIASYLALKH-PLSVSIDAGEGLSWMQFYKHGVA-NPRFCSKTS- 304

Query: 301 LDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444
           L+HAVL VG+GV  G  +W+VKNSW   WG +GY  +   +  CG+ +
Sbjct: 305 LNHAVLLVGFGVDGGKAFWIVKNSWGEKWGENGYFRLIRGKGACGINT 352


>UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa
           (japonica cultivar-group)|Rep: Os09g0562700 protein -
           Oryza sativa subsp. japonica (Rice)
          Length = 235

 Score = 79.0 bits (186), Expect = 6e-14
 Identities = 54/168 (32%), Positives = 76/168 (45%), Gaps = 16/168 (9%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTA-ITKITGWVNVTTNNENALK 183
           GG  +RA +WI  +G + T +DY         C    +      I G   V T +E +L 
Sbjct: 73  GGVSYRALEWITANGGITTRDDYPYTAAASAACDRAKLGHHAATIAGLRRVATRSEASLA 132

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYG---------V 336
            A     P++V+I+A    F  Y  GVY  P C  +   L+H V  VGYG          
Sbjct: 133 NAAAAQ-PVAVSIEAGGDNFQHYRKGVYDGP-CGTR---LNHGVTVVGYGQEEAAADGGA 187

Query: 337 LNGHKYWLVKNSWSNMWGNDGYVLM-----SMRENNCGVQSAPTYVLI 465
             G KYW++KNSW   WG+ GY+ M        E  CG+   P++ L+
Sbjct: 188 AGGDKYWIIKNSWGKNWGDQGYIKMKKDVAGKPEGLCGIAIRPSFPLM 235


>UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep:
           Cysteine proteinase - Cryptobia salmositica
          Length = 443

 Score = 79.0 bits (186), Expect = 6e-14
 Identities = 35/95 (36%), Positives = 56/95 (58%)
 Frame = +1

Query: 169 ENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH 348
           E  +   +FKHGP+S+ +DA+  T+  Y+ G+     C    D++DH VL VG+      
Sbjct: 237 EEDMAAFVFKHGPLSIGVDAS--TWQSYAGGIM--SYCPQ--DQIDHGVLIVGFDDTAST 290

Query: 349 KYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPT 453
            YW++KNSW+  WG +GY+ ++   N CG+ S P+
Sbjct: 291 PYWIIKNSWTANWGEEGYIRVAKGSNQCGLTSHPS 325


>UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n=1;
           Toxocara canis|Rep: Cathepsin L-like cysteine proteinase
           - Toxocara canis (Canine roundworm)
          Length = 360

 Score = 79.0 bits (186), Expect = 6e-14
 Identities = 50/156 (32%), Positives = 77/156 (49%), Gaps = 5/156 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG+  +A +++   GL  E DY     +   C +   T   K   +++    +E ++   
Sbjct: 209 GGDVDKALRYVYDEGLMREYDYPYVAHRQDTCQLRGETTRIKAAVFLH---QDEASIIDW 265

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPK--CKNKVDELDHAVLAVGYGVLNG--HKYW 357
           L  +GP++V I+        Y  GVY   K  C+NK+    H++  VGYG  N    KYW
Sbjct: 266 LLHYGPVNVGINVT-ADMKAYKGGVYTPDKWECENKIIGT-HSINIVGYGTWNATNQKYW 323

Query: 358 LVKNSWSNMWG-NDGYVLMSMRENNCGVQSAPTYVL 462
           +VKNSW   +G  DGYV  +   N+CG++  P  VL
Sbjct: 324 IVKNSWGQSYGIEDGYVYFARGINSCGIEDEPVGVL 359


>UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia
           tropicalis|Rep: Blo t 1 allergen - Blomia tropicalis
           (Mite)
          Length = 333

 Score = 79.0 bits (186), Expect = 6e-14
 Identities = 41/154 (26%), Positives = 78/154 (50%), Gaps = 1/154 (0%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVT-TNNENALK 183
           G G    A++++++ GL  EE+Y  Y  +  +C+ D       ++G+  +   +++  + 
Sbjct: 183 GSGYSTEAFKYMIRTGLVEEENYP-YNMRTQWCNPDVEGQRYHVSGYQQLRYQSSDEDVM 241

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363
             + +HGP+ + +  ++  F    NGV       +     DHAV+ VG+G + G  YW++
Sbjct: 242 YTIQQHGPVVIYMHGSNNYFRNLGNGVLRGVAYNDAYT--DHAVILVGWGTVQGVDYWII 299

Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           +NSW   WGN GY  +    N+ G+ +  TY  +
Sbjct: 300 RNSWGTGWGNGGYGYVERGHNSLGINNFVTYATL 333


>UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep:
           MGC107932 protein - Xenopus tropicalis (Western clawed
           frog) (Silurana tropicalis)
          Length = 333

 Score = 78.6 bits (185), Expect = 8e-14
 Identities = 47/150 (31%), Positives = 72/150 (48%), Gaps = 7/150 (4%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG   +A +++ +HG+   ++Y  Y  +   C  D+  AI        +    EN +  +
Sbjct: 180 GGFPIKALEYVAQHGVMRNKEYE-YSQKKATCEYDSDKAIHMNVSKFYILPGEEN-MATS 237

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHK------ 351
           +   GPI+V I  +   F  YS G+ FE  C    +  +HAV+ VGYG  + +       
Sbjct: 238 VAIEGPITVGIGVS-SDFQLYSEGI-FEGDC---AESPNHAVIIVGYGTEHANDKEEEDK 292

Query: 352 -YWLVKNSWSNMWGNDGYVLMSMRENNCGV 438
            YW++KNSW   WG DGYV M    N C +
Sbjct: 293 DYWIIKNSWGKEWGEDGYVKMKRNINQCSI 322


>UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_21,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 349

 Score = 78.2 bits (184), Expect = 1e-13
 Identities = 46/154 (29%), Positives = 69/154 (44%), Gaps = 3/154 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GGE +  +Q+  K+G+    +Y  Y G D  C         +  G+V+V   +  A   A
Sbjct: 193 GGEMYDGFQYASKYGIAIRSEYP-YAGVDQKCAAKQTKTRYQFAGYVDVEPLSAQAYVEA 251

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
             +H  +S+ I+A+   F  Y  G+Y   KC      L+H V  VGY       Y+L+KN
Sbjct: 252 ASEHA-LSIGINASGINFQLYKKGIY-SAKCDGSKPALNHGVTNVGYAP----DYYLIKN 305

Query: 370 SWSNMWGNDGYVLMSM---RENNCGVQSAPTYVL 462
           SW   WG  GY+  +    +   CG Q    + L
Sbjct: 306 SWGQSWGESGYIRFARIADKAGQCGAQQEVNFPL 339


>UniRef50_P25779 Cluster: Cruzipain precursor; n=54;
           Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi
          Length = 467

 Score = 78.2 bits (184), Expect = 1e-13
 Identities = 49/157 (31%), Positives = 80/157 (50%), Gaps = 5/157 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMK--HGLPTEEDYGGYLGQDGY---CHIDNVTAITKITGWVNVTTNNEN 174
           GG    A++WI++  +G    ED   Y   +G    C     T    ITG V +   +E 
Sbjct: 187 GGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELP-QDEA 245

Query: 175 ALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKY 354
            +   L  +GP++VA+DA+  ++  Y+ GV     C +  ++LDH VL VGY       Y
Sbjct: 246 QIAAWLAVNGPVAVAVDAS--SWMTYTGGVM--TSCVS--EQLDHGVLLVGYNDSAAVPY 299

Query: 355 WLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           W++KNSW+  WG +GY+ ++   N C V+   +  ++
Sbjct: 300 WIIKNSWTTQWGEEGYIRIAKGSNQCLVKEEASSAVV 336


>UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960
           precursor; n=2; Arabidopsis thaliana|Rep: Probable
           cysteine proteinase At3g43960 precursor - Arabidopsis
           thaliana (Mouse-ear cress)
          Length = 376

 Score = 78.2 bits (184), Expect = 1e-13
 Identities = 56/157 (35%), Positives = 78/157 (49%), Gaps = 8/157 (5%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQD-GYCHIDNV--TAITKITGWVNVTTNNENAL 180
           GG    A+++I ++G    ++  GY G+D   C    +  T +  I G   V  N+E +L
Sbjct: 194 GGGAVWAFEFIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSL 253

Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH-KYW 357
           K A+  + PISV I AA+   S Y +GVY +  C N     DH VL VGYG  +    YW
Sbjct: 254 KKAV-AYQPISVMISAAN--MSDYKSGVY-KGACSNLWG--DHNVLIVGYGTSSDEGDYW 307

Query: 358 LVKNSWSNMWGNDGYVLMSMR----ENNCGVQSAPTY 456
           L++NSW   WG  GY+ +          C V  AP Y
Sbjct: 308 LIRNSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVY 344


>UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 360

 Score = 77.4 bits (182), Expect = 2e-13
 Identities = 47/144 (32%), Positives = 82/144 (56%), Gaps = 4/144 (2%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG+   A++ I  +G + TE +Y  Y+ +   C  D      +I G+++V ++ ++ +K 
Sbjct: 196 GGDPEPAFRCIQNNGGIMTETEYP-YIAKQQSCKFDEDKPTFQIGGYIDVPSD-QSQVKA 253

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKN-KVDELDHAVLAVGYGVLNGHK--YW 357
           AL    P+S+ ++++  +F +Y +GV  E  C++   D  DH +L VGYG     K  YW
Sbjct: 254 ALLIQ-PLSICLNSSDTSFKYYKSGVITE--CEDGPYDGPDHCLLLVGYGHDEELKVDYW 310

Query: 358 LVKNSWSNMWGNDGYVLMSMRENN 429
           L+KN W   WG +GYV + +R++N
Sbjct: 311 LIKNQWGTTWGEEGYVRI-IRDDN 333


>UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing
           protein; n=7; Hymenostomatida|Rep: Papain family
           cysteine protease containing protein - Tetrahymena
           thermophila SB210
          Length = 387

 Score = 77.4 bits (182), Expect = 2e-13
 Identities = 44/131 (33%), Positives = 71/131 (54%), Gaps = 5/131 (3%)
 Frame = +1

Query: 28  AYQWIMKHGLPTEE--DYGGYLGQDGYCHIDNVTAITKIT--GWVNVTTNNENALKLALF 195
           AY ++   GL +E    Y  Y GQ G C  D      ++T  G++ V  N+  +L  A+ 
Sbjct: 209 AYNYVQLFGLTSEYKYSYSSYQGQTGNCTFDPTQQPIEVTIDGYLKVPENDYASLMNAVA 268

Query: 196 KHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH-KYWLVKNS 372
             GP+ +++DA++  F  Y +GV+      + VD ++HAV+ VGYG       YW+V+NS
Sbjct: 269 TQGPLVISVDASN--FHDYESGVFHGCDGADNVD-INHAVVLVGYGTDEKEGDYWIVRNS 325

Query: 373 WSNMWGNDGYV 405
           W   +G +GY+
Sbjct: 326 WGTRFGENGYI 336


>UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6;
           Schistosoma|Rep: Cathepsin C precursor - Schistosoma
           mansoni (Blood fluke)
          Length = 454

 Score = 77.4 bits (182), Expect = 2e-13
 Identities = 52/152 (34%), Positives = 71/152 (46%), Gaps = 8/152 (5%)
 Frame = +1

Query: 13  GEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLAL 192
           GEDF   Q I+   +P   +  G       C     T  + I G+   T  NE  ++L L
Sbjct: 300 GEDFGLPQKIV---IPYTGEDTGKCTVSKNCTRYYTTDYSYIGGYYGAT--NEKLMQLEL 354

Query: 193 FKHGPISVAIDAAHKTFSFYSNGVYFEPKCK------NKVDELDHAVLAVGYGV--LNGH 348
             +GP  V  +  ++ F FY  G+Y     +      N  +  +HAVL VGYGV  L+G 
Sbjct: 355 ISNGPFPVGFEV-YEDFQFYKEGIYHHTTVQTDHYNFNPFELTNHAVLLVGYGVDKLSGE 413

Query: 349 KYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444
            YW VKNSW   WG  GY  +    + CGV+S
Sbjct: 414 PYWKVKNSWGVEWGEQGYFRILRGTDECGVES 445


>UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 350

 Score = 77.0 bits (181), Expect = 2e-13
 Identities = 48/144 (33%), Positives = 79/144 (54%), Gaps = 1/144 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG      Q+    GL  + DY  Y+   G C   N      +  + +V  ++ +AL+ A
Sbjct: 209 GGFPSEGLQYASTVGL-VQSDYYPYVAVQGTCRQVNAPRYQLLDQYYSVQQSS-SALQYA 266

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKC-KNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           + +  P +V +DA+  T+ FY++GVY    C K + ++L+HAV+AVGY   + +  W+++
Sbjct: 267 ITR-APTAVGVDAS--TWQFYNSGVY--NGCGKTQRNQLNHAVIAVGY---DAYGNWIIR 318

Query: 367 NSWSNMWGNDGYVLMSMRENNCGV 438
           NSW   WG  GY+ ++ R N CGV
Sbjct: 319 NSWGTSWGQSGYITLA-RGNTCGV 341


>UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 664

 Score = 77.0 bits (181), Expect = 2e-13
 Identities = 41/119 (34%), Positives = 65/119 (54%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG     Y +I ++G   +E    Y G+ G C  ++  A ++I+ +V +  ++E  L   
Sbjct: 538 GGWMHNCYSYIQENGGINQESTYPYEGKFGQCRYNSGDAQSRISKFVMIKQHDEEDLADT 597

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           +   GP+SVA DA+ + F +YS G+Y+   C NK     HAV+ VGY   NG  YW++K
Sbjct: 598 VASVGPVSVAYDASTREFMYYSRGIYYSDNC-NKY-RTTHAVVVVGYDNENGVDYWIIK 654


>UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10;
           Dictyostelium discoideum|Rep: Cysteine proteinase 7
           precursor - Dictyostelium discoideum (Slime mold)
          Length = 460

 Score = 77.0 bits (181), Expect = 2e-13
 Identities = 44/114 (38%), Positives = 69/114 (60%), Gaps = 2/114 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGY-CHIDNVTAITKITGWVNVTTNNENALK 183
           GG    A+++I+ + G+ TE  Y  Y  +DG  C  +      +++ +VNVT+ +E+ L 
Sbjct: 178 GGLMTLAFEYIINNKGIDTESSYP-YTAEDGKKCKFNPKNVAAQLSSYVNVTSGSESDLA 236

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNG 345
            A    GP SVAIDA++++F  Y +G+Y EP C +   +LDH VLAVG+G  +G
Sbjct: 237 -AKVTQGPTSVAIDASNQSFQLYVSGIYNEPACSS--TQLDHGVLAVGFGTGSG 287



 Score = 49.6 bits (113), Expect = 4e-05
 Identities = 22/40 (55%), Positives = 26/40 (65%), Gaps = 4/40 (10%)
 Frame = +1

Query: 352 YWLVKNSWSNMWGNDGYVLMSMRENN-CGV---QSAPTYV 459
           YW+VKNSW   WG DGY+LM+   NN CG+    S PT V
Sbjct: 418 YWIVKNSWGTSWGMDGYILMTKGNNNQCGIATMASRPTAV 457


>UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 330

 Score = 76.6 bits (180), Expect = 3e-13
 Identities = 47/151 (31%), Positives = 80/151 (52%), Gaps = 1/151 (0%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAIT-KITGWVNVTTNNENALK 183
           GGG    A+++I +  L T  +Y  Y+  D  C+   +  +   ++ + +V + N   LK
Sbjct: 182 GGGWMDNAFEYIEESPLTTNSNYP-YVAVDQACNSTEIYGVLYSLSNYTDVESGNTVQLK 240

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363
             L +  P+S+A+DA++  +  Y++G++      N    L+H VL VG+    G   WLV
Sbjct: 241 QYL-QQQPLSIAVDASY--WYLYNSGIF-----SNCGQNLNHGVLLVGFNSTEGS--WLV 290

Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQSAPTY 456
           KNSW   WG  GY+ ++   N CG+ +A +Y
Sbjct: 291 KNSWGTSWGEQGYIRLA-DGNTCGLANAASY 320


>UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 394

 Score = 76.6 bits (180), Expect = 3e-13
 Identities = 47/143 (32%), Positives = 77/143 (53%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG      ++ +K G+ TE+ Y  Y    G C I N T                + LK +
Sbjct: 252 GGFQSDGVEYAIKFGIVTEDKYP-YTAVGGDCQISNPTTDGFYPKTYRKLQQTVDDLKAS 310

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           L    P++V++DA++  ++ Y +G+ F+   +   D+L+HAV+AVGY   +G+  W+++N
Sbjct: 311 L-NFSPVTVSVDASN--WNSYESGI-FDNCGETTQDQLNHAVIAVGYDT-DGN--WIIRN 363

Query: 370 SWSNMWGNDGYVLMSMRENNCGV 438
           SWS  WG DGY+ ++   N CGV
Sbjct: 364 SWSTSWGEDGYIRLA-AGNTCGV 385


>UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10;
           Liliopsida|Rep: Putative cysteine proteinase - Oryza
           sativa subsp. japonica (Rice)
          Length = 416

 Score = 76.2 bits (179), Expect = 4e-13
 Identities = 54/143 (37%), Positives = 73/143 (51%), Gaps = 9/143 (6%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEED-----YGGYLGQDGYCH-IDNVTAITKITGWVNVTTNNE 171
           GG+   A Q+I+K+G+  ++      Y GY  +   C  +     I K+   V    N E
Sbjct: 178 GGDPRAALQYIVKNGVTLDQCGKLPYYPGYEAKKLACRTVAGKPPIVKVDA-VKPVANTE 236

Query: 172 NALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL---N 342
            AL L +F+  PISV IDA+      Y  GV F  +CK     L+H V+ VGYGV    +
Sbjct: 237 AALLLKVFQQ-PISVGIDAS-ADLQHYKKGV-FTGRCKTA--PLNHGVVVVGYGVNTTPD 291

Query: 343 GHKYWLVKNSWSNMWGNDGYVLM 411
             KYW+VKNSW   WG  GY+ M
Sbjct: 292 KTKYWIVKNSWGKGWGEGGYIRM 314



 Score = 51.6 bits (118), Expect = 1e-05
 Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 5/71 (7%)
 Frame = +1

Query: 259 GVYFEPKCKNKVDELDHAVLAVGYGVLNGH-KYWLVKNSWSNMWGNDGYVLM----SMRE 423
           GVY  P C   V+   HAV  VGYGV   +  YW+ +NSW   WG  GY+ M    + +E
Sbjct: 332 GVYNGP-CGTSVN---HAVTTVGYGVTQDNINYWIARNSWGPRWGESGYIRMKRDIAAKE 387

Query: 424 NNCGVQSAPTY 456
             CG+     Y
Sbjct: 388 GLCGISMYGVY 398


>UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera
           litura multicapsid nucleopolyhedrovirus (SpltMNPV)
          Length = 337

 Score = 76.2 bits (179), Expect = 4e-13
 Identities = 49/144 (34%), Positives = 72/144 (50%), Gaps = 1/144 (0%)
 Frame = +1

Query: 10  GGEDFRAYQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG    A+Q I++  G+  E DY  Y G +  C +       +++        +E  L  
Sbjct: 190 GGLMHLAFQEIIRIGGVEHEIDYP-YQGIEYACRLAPSKLAVRLSHCYQYDLRDERKLLE 248

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
            L+K+GPI+VAID        Y +G+     C +  + L+HAVL VGYG+ N   YW+ K
Sbjct: 249 LLYKNGPIAVAIDCVD--IIDYRSGI--ATVCND--NGLNHAVLLVGYGIENDTPYWIFK 302

Query: 367 NSWSNMWGNDGYVLMSMRENNCGV 438
           NSW + WG +GY       N CG+
Sbjct: 303 NSWGSNWGENGYFRARRNINACGM 326


>UniRef50_Q239L8 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 323

 Score = 75.4 bits (177), Expect = 7e-13
 Identities = 49/143 (34%), Positives = 73/143 (51%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    A+ +I +HG+PTE  Y  Y   DG C +   +   KI+   ++   N+   K+ 
Sbjct: 188 GGLMDTAFDFISQHGIPTEAAYP-YKAVDGTCKM--TSGPYKISSHTDIQDCNDLLNKI- 243

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
             +  PI++A+DA    F +Y   ++ +  C     ELDH VL VGY      KYW VKN
Sbjct: 244 --QKQPIAIAVDA--NNFQYYQKDIFSD--CGT---ELDHGVLLVGYSASG--KYWKVKN 292

Query: 370 SWSNMWGNDGYVLMSMRENNCGV 438
           SW   WG  G++ ++   N CG+
Sbjct: 293 SWGPNWGESGFIRLA-AGNTCGL 314


>UniRef50_UPI0000498E2F Cluster: cysteine proteinase; n=1; Entamoeba
           histolytica HM-1:IMSS|Rep: cysteine proteinase -
           Entamoeba histolytica HM-1:IMSS
          Length = 317

 Score = 74.9 bits (176), Expect = 1e-12
 Identities = 43/144 (29%), Positives = 72/144 (50%), Gaps = 3/144 (2%)
 Frame = +1

Query: 37  WIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISV 216
           ++ + G   EEDY     + G C  ++     K+     ++  N++ L + + K+ PI V
Sbjct: 173 YLQRFGFMKEEDYPE-TSEKGICQYNSTRIFGKVNKRRYLSVFNDDEL-IEVIKNTPIIV 230

Query: 217 AIDAAHKTFSFYSNGVYFE--PKCKNKVDELDHAVLAVGYG-VLNGHKYWLVKNSWSNMW 387
            ID    T  +Y     FE   +C      +   +L +GYG  +NG  YW++KN W + W
Sbjct: 231 NIDMP-PTMPYYDGEGIFENIEECSQSSPRI--GLLLIGYGKTINGIPYWILKNCWGSSW 287

Query: 388 GNDGYVLMSMRENNCGVQSAPTYV 459
           G++GY+ +   +N CG+ S  TYV
Sbjct: 288 GSNGYLYLKRNKNVCGIYSYGTYV 311


>UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides
           sonorensis|Rep: Cathepsin L - Culicoides sonorensis
          Length = 331

 Score = 74.9 bits (176), Expect = 1e-12
 Identities = 47/149 (31%), Positives = 70/149 (46%), Gaps = 2/149 (1%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GG  D  A Q++  +GL  E+DY  Y G+D  CH  N          V  T  +E + K 
Sbjct: 180 GGWSDL-ALQYMRDNGLSFEKDYP-YKGKDEKCHASNENKSPVKVVNVCSTPKDEVSYKD 237

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
             +++GP+ V        F  Y  G++    C  +   ++HAV+ +GYG     KYWLV+
Sbjct: 238 HFYQYGPL-VVYYFVDNNFKQYKGGIFSSKTCNVENAGINHAVVLMGYGSEKDVKYWLVR 296

Query: 367 NSWSNMWGNDGY--VLMSMRENNCGVQSA 447
           NSW   +G  G+  +L      N G  +A
Sbjct: 297 NSWGKSFGESGHFRILRDAHMCNLGYHNA 325


>UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_567_6496_7413 - Giardia lamblia ATCC
           50803
          Length = 305

 Score = 74.5 bits (175), Expect = 1e-12
 Identities = 33/93 (35%), Positives = 51/93 (54%)
 Frame = +1

Query: 163 NNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN 342
           +N N + ++L   GP+       H+ F +Y  G+Y +    +      HAVL VGYG +N
Sbjct: 208 SNYNEIMVSLLADGPVQTGF-YVHEDFLYYVGGIYHKVYGTSLGG---HAVLIVGYGSMN 263

Query: 343 GHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQ 441
            H YW+V+NSW + WG +GY  +    N CG++
Sbjct: 264 NHDYWIVRNSWGSDWGENGYFRILRGTNECGIE 296


>UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC
           3.4.14.1) (Dipeptidyl-peptidase I) (DPP-I) (DPPI)
           (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase)
           [Contains: Dipeptidyl-peptidase 1 exclusion domain chain
           (Dipeptidyl- peptidase I exclusion domain chain);
           Dipeptidyl-peptidase 1 heavy chain (Dipeptidyl-peptidase
           I heavy chain); Dipeptidyl-peptidase 1 light chain
           (Dipeptidyl-peptidase I light chain)]; n=50;
           Coelomata|Rep: Dipeptidyl-peptidase 1 precursor (EC
           3.4.14.1) (Dipeptidyl-peptidase I) (DPP-I) (DPPI)
           (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase)
           [Contains: Dipeptidyl-peptidase 1 exclusion domain chain
           (Dipeptidyl- peptidase I exclusion domain chain);
           Dipeptidyl-peptidase 1 heavy chain (Dipeptidyl-peptidase
           I heavy chain); Dipeptidyl-peptidase 1 light chain
           (Dipeptidyl-peptidase I light chain)] - Homo sapiens
           (Human)
          Length = 463

 Score = 74.5 bits (175), Expect = 1e-12
 Identities = 37/98 (37%), Positives = 53/98 (54%), Gaps = 5/98 (5%)
 Frame = +1

Query: 166 NENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCK---NKVDELDHAVLAVGYGV 336
           NE  +KL L  HGP++VA +  +  F  Y  G+Y     +   N  +  +HAVL VGYG 
Sbjct: 356 NEALMKLELVHHGPMAVAFEV-YDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGT 414

Query: 337 --LNGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444
              +G  YW+VKNSW   WG +GY  +    + C ++S
Sbjct: 415 DSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIES 452


>UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:
           Aca s 1 allergen - Acarus siro (Dust mite)
          Length = 331

 Score = 74.1 bits (174), Expect = 2e-12
 Identities = 33/90 (36%), Positives = 48/90 (53%)
 Frame = +1

Query: 169 ENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH 348
           + ++   L  HGP++V IDA H  F  Y +GV      +    E++H +  VG+G  NG 
Sbjct: 234 DESIMTVLKTHGPVAVDIDADHNGFKHYKSGVI--RLTRGGTTEVNHVINIVGWGRENGL 291

Query: 349 KYWLVKNSWSNMWGNDGYVLMSMRENNCGV 438
            YWL++NSW   WG  GY  +    NN G+
Sbjct: 292 DYWLIRNSWGTHWGEAGYGKVERHHNNMGI 321


>UniRef50_Q23H10 Cluster: Papain family cysteine protease containing
           protein; n=14; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 336

 Score = 73.7 bits (173), Expect = 2e-12
 Identities = 47/145 (32%), Positives = 72/145 (49%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG   +   +  K G+ T + Y  Y+     C++       K   W+ +  N  N LK A
Sbjct: 196 GGWPVQCLDYASKVGITTLDKYP-YVAVQKNCNVTGTDNGFKPKSWIQIP-NTSNDLKSA 253

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           L    P+SV +DA+  T+  Y +G++    C      L+HAVLAVGY        W++KN
Sbjct: 254 L-NFSPVSVLVDAS--TWGNYYSGIF--NGCDQTHISLNHAVLAVGYDQQGN---WIIKN 305

Query: 370 SWSNMWGNDGYVLMSMRENNCGVQS 444
           SWS  WG +G++ ++   N CG+ S
Sbjct: 306 SWSTYWGENGFMRLA-PNNTCGILS 329


>UniRef50_P43234 Cluster: Cathepsin O precursor; n=22;
           Euteleostomi|Rep: Cathepsin O precursor - Homo sapiens
           (Human)
          Length = 321

 Score = 73.7 bits (173), Expect = 2e-12
 Identities = 45/156 (28%), Positives = 75/156 (48%), Gaps = 4/156 (2%)
 Frame = +1

Query: 10  GGEDFRAYQWI--MKHGLPTEEDYGGYLGQDGYCH-IDNVTAITKITGWVNVT-TNNENA 177
           GG    A  W+  M+  L  + +Y  +  Q+G CH      +   I G+     ++ E+ 
Sbjct: 172 GGSTLNALNWLNKMQVKLVKDSEYP-FKAQNGLCHYFSGSHSGFSIKGYSAYDFSDQEDE 230

Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357
           +  AL   GP+ V +DA   ++  Y  G+  +  C +   E +HAVL  G+       YW
Sbjct: 231 MAKALLTFGPLVVIVDAV--SWQDYLGGI-IQHHCSS--GEANHAVLITGFDKTGSTPYW 285

Query: 358 LVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           +V+NSW + WG DGY  + M  N CG+  + + + +
Sbjct: 286 IVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV 321


>UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin O;
           n=1; Monodelphis domestica|Rep: PREDICTED: similar to
           cathepsin O - Monodelphis domestica
          Length = 414

 Score = 73.3 bits (172), Expect = 3e-12
 Identities = 45/155 (29%), Positives = 69/155 (44%), Gaps = 3/155 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYG-GYLGQDGYCH-IDNVTAITKITGWVNVT-TNNENAL 180
           GG    A  W+ K  +   +D    +  Q G CH      A   I  + +   +  EN +
Sbjct: 265 GGSTVNALNWLNKTQVRLVKDSEYSFKAQTGLCHYFSGSHAGVSIKDYSSYDFSGKENEM 324

Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360
              L   GP++V +DA   ++  Y  G+  +  C +   E +HAVL  G+       YW+
Sbjct: 325 ANVLLAFGPLAVIVDAV--SWQDYLGGI-IQHHCSS--GEANHAVLITGFDRTGNTPYWI 379

Query: 361 VKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           V+NSW   WG DGY  + M  N CG+    + V +
Sbjct: 380 VRNSWGTSWGVDGYAFVKMGANVCGIADLVSAVFV 414


>UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 336

 Score = 73.3 bits (172), Expect = 3e-12
 Identities = 50/149 (33%), Positives = 81/149 (54%), Gaps = 2/149 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAIT-KITGWVNVTTNNENALKL 186
           GG    A +++ K G+  EE Y  YL  D  C + + T+   K+  +  +     +ALK 
Sbjct: 194 GGWPEEALKYVAKFGILKEEQYP-YLAVDSKCKVSSPTSDGFKVQSFYFID-KTADALKN 251

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKN-KVDELDHAVLAVGYGVLNGHKYWLV 363
            + +  P+SV +DA+  T+  YS+GVY    C N +   L+HAV+A+GY        W++
Sbjct: 252 TVARI-PVSVLVDAS--TWGSYSSGVY--NGCGNTQTYNLNHAVVAIGYDEQGN---WII 303

Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQSAP 450
           +NSWS  WG DG++ ++   N CG+  +P
Sbjct: 304 RNSWSTSWGMDGHMKLA-PGNTCGILLSP 331


>UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8;
           Trypanosoma|Rep: Cathepsin B-like cysteine protease -
           Trypanosoma brucei
          Length = 340

 Score = 73.3 bits (172), Expect = 3e-12
 Identities = 40/132 (30%), Positives = 58/132 (43%)
 Frame = +1

Query: 46  KHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAID 225
           K+G P    +     +  Y   D    +     W +     E+     LF  GP  VA D
Sbjct: 199 KNGYPPCSQFNFDTPKCNYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAFD 258

Query: 226 AAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYV 405
             ++ F  Y++GVY     +       HAV  VG+G  NG  YW + NSW+  WG DGY 
Sbjct: 259 V-YEDFIAYNSGVYHHVSGQYLGG---HAVRLVGWGTSNGVPYWKIANSWNTEWGMDGYF 314

Query: 406 LMSMRENNCGVQ 441
           L+    + CG++
Sbjct: 315 LIRRGSSECGIE 326


>UniRef50_Q23H06 Cluster: Papain family cysteine protease containing
           protein; n=18; Tetrahymena thermophila|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 349

 Score = 73.3 bits (172), Expect = 3e-12
 Identities = 50/140 (35%), Positives = 69/140 (49%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG   +   +  K G+  ++ Y  Y G    C +       K   WV +  NN +ALK A
Sbjct: 210 GGWPVQCIDYASKVGILNQDRYY-YFGVQMQCRVTGTNNGFKPKSWVQIP-NNSDALKTA 267

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           L    P+SVA+D  + T   Y +GV+    C + V  L+HAVL VGY        W++KN
Sbjct: 268 L-NFSPVSVAVDGTNWTD--YKSGVF--NGCDSHVS-LNHAVLVVGYDEQGN---WIIKN 318

Query: 370 SWSNMWGNDGYVLMSMRENN 429
           SWS +WG  GY  M +  NN
Sbjct: 319 SWSTLWGEGGY--MRLAPNN 336


>UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 293

 Score = 73.3 bits (172), Expect = 3e-12
 Identities = 47/148 (31%), Positives = 78/148 (52%), Gaps = 4/148 (2%)
 Frame = +1

Query: 7   GGGEDFRAYQWIM-KHGL-PTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVT-TNNENA 177
           GG  D  +Y  ++ ++G+  ++ DY  +    G C  D+  A +K   +V +T T NE  
Sbjct: 142 GGSSDGASYFVLLNQYGMWMSDSDYP-FKPYVGECKFDSSMAQSK---FVQLTYTKNETD 197

Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357
           + + +  HG ++   DA+   F +YS+ VY  P C      + H ++  GYG   G  YW
Sbjct: 198 MAVTVATHGVLACGYDASAADFEWYSSCVYDNPDCDPW--GICHWMMICGYGTDAGKDYW 255

Query: 358 LVKNSWSNMWGNDGYV-LMSMRENNCGV 438
           L KNS+ + WG +GY+ L+  ++  CGV
Sbjct: 256 LAKNSFGSTWGMEGYIELVRNKDGQCGV 283


>UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza
           sativa|Rep: Cysteine proteinase-like - Oryza sativa
           subsp. japonica (Rice)
          Length = 360

 Score = 72.9 bits (171), Expect = 4e-12
 Identities = 53/160 (33%), Positives = 72/160 (45%), Gaps = 11/160 (6%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITG-----WVNVTTNNE 171
           GG+   A ++I   G L TE  Y  Y GQ G C      A           W  +   +E
Sbjct: 201 GGDVSAALRYIAASGGLQTEAAYA-YGGQQGACRAGGFAAPNSAAAVGGARWARLY-GDE 258

Query: 172 NALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL--NG 345
            AL+ AL    P+ V ++A+   F  Y +GVY       +   L+HAV  VGYG     G
Sbjct: 259 GALQ-ALAAGQPVVVVVEASEPDFRHYRSGVYAGSAACGR--RLNHAVTVVGYGAAADGG 315

Query: 346 HKYWLVKNSWSNMWGNDGYVLMS---MRENNCGVQSAPTY 456
            +YWLVKN W   WG  GY+ ++       NCG+ +   Y
Sbjct: 316 GEYWLVKNQWGTWWGEGGYMRVARGGAAGGNCGIATYAFY 355


>UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-ear
           cress). SAG12 protein; n=2; Dictyostelium
           discoideum|Rep: Similar to Arabidopsis thaliana
           (Mouse-ear cress). SAG12 protein - Dictyostelium
           discoideum (Slime mold)
          Length = 358

 Score = 72.9 bits (171), Expect = 4e-12
 Identities = 48/158 (30%), Positives = 77/158 (48%), Gaps = 7/158 (4%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTN-NENAL 180
           GGG+ +  Y++  +  G+ T   Y  Y   DG C   N++    +  +  VT   +EN L
Sbjct: 208 GGGDPYTVYEYFSQVGGVSTNAQYP-YTATDGTCV--NMSRAVPVVSYHYVTQGGDENTL 264

Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-----LNG 345
              +   GP+S+ +DA+  T+  YS G+      KN    +DH V  VG  V      N 
Sbjct: 265 IKTIVNDGPVSICVDAS--TWQSYSGGIITTGCGKN----IDHCVQVVGLEVDKTDPSNP 318

Query: 346 HKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYV 459
            +Y++++NSW   WG DGY+ ++   + CG+    T V
Sbjct: 319 VQYYIIRNSWGTDWGIDGYIYVATGSDLCGITYESTMV 356


>UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14;
           Leishmania|Rep: Cysteine proteinase 1 precursor -
           Leishmania pifanoi
          Length = 354

 Score = 72.9 bits (171), Expect = 4e-12
 Identities = 53/154 (34%), Positives = 77/154 (50%), Gaps = 7/154 (4%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKH---GLPTEEDY----GGYLGQDGYCHIDNVTAITKITGWVNVTTNN 168
           GG   +A  WIM+     + TE  Y    GG  G    CH D      KITG++++  + 
Sbjct: 193 GGLMDQAMNWIMQSHNGSVFTEASYPYTSGG--GTRPPCH-DEGEVGAKITGFLSLPHDE 249

Query: 169 ENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH 348
           E   +  + K GP++VA+DA   T+  Y  GV     C      L+H VL VG+      
Sbjct: 250 ERIAEW-VEKRGPVAVAVDAT--TWQLYFGGVV--SLCL--AWSLNHGVLIVGFNKNAKP 302

Query: 349 KYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAP 450
            YW+VKNSW + WG  GY+ ++M  N C +++ P
Sbjct: 303 PYWIVKNSWGSSWGEKGYIRLAMGSNQCMLKNYP 336


>UniRef50_UPI00006CBB5F Cluster: Papain family cysteine protease
            containing protein; n=2; Tetrahymena thermophila
            SB210|Rep: Papain family cysteine protease containing
            protein - Tetrahymena thermophila SB210
          Length = 1367

 Score = 72.5 bits (170), Expect = 5e-12
 Identities = 33/89 (37%), Positives = 52/89 (58%), Gaps = 1/89 (1%)
 Frame = +1

Query: 178  LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYG-VLNGHKY 354
            +K  ++  GPIS  IDA     + Y+ G+Y E   K K+   +H V  VG+G  L G +Y
Sbjct: 1270 MKSEIYSRGPISCTIDATDNLENNYTGGIYSE---KVKLPIPNHYVSVVGWGQTLEGEEY 1326

Query: 355  WLVKNSWSNMWGNDGYVLMSMRENNCGVQ 441
            W+V+NSW   WG +G+  + M ++N G++
Sbjct: 1327 WIVRNSWGTYWGEEGFFKLKMHKDNLGLE 1355



 Score = 55.6 bits (128), Expect = 6e-07
 Identities = 44/169 (26%), Positives = 77/169 (45%), Gaps = 20/169 (11%)
 Frame = +1

Query: 10   GGEDFRAYQWIMKHGLPTEEDYGGYLGQD---GY----------------CHIDNVTAIT 132
            GG    AY++I+++ + T+E    Y G+D   GY                C   +   I 
Sbjct: 864  GGSPQTAYEYILRNNI-TDETCSPYTGRDFRDGYQCSSLTVCMECWPKVGCKARDDAYIY 922

Query: 133  KITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHA 312
             I  W  V    E  ++  +F HGPIS  I++  + F  Y+ G+   P   +   ++ H+
Sbjct: 923  SIESWDQV--KGEEDMQQEIFNHGPISCVINST-EDFRNYTGGILNPP---DSPVQITHS 976

Query: 313  VLAVGYGVLNGH-KYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTY 456
            +  VG+G      KYW+ +NS    WG +G++ +   +N   ++S  +Y
Sbjct: 977  LSIVGWGEDEKQTKYWIARNSLGTFWGENGFIRIIRGKNALKIESDCSY 1025


>UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_56,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 314

 Score = 72.5 bits (170), Expect = 5e-12
 Identities = 51/144 (35%), Positives = 73/144 (50%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG +    QW  K+GL T++ Y     Q+  C     T   K +G+  V  +N   +  A
Sbjct: 177 GGFENLGIQWAKKNGLTTDKQYPYDGVQNKQCKYS--TGQYKPSGYQVVAADN---MYTA 231

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           L  + PI+VA+DA   ++  Y +GV+   KC  K   L+HAVLA G+        W++KN
Sbjct: 232 L-SYQPITVAVDA--NSWQNYKSGVF--TKCTYK--SLNHAVLATGF---QEDGVWIIKN 281

Query: 370 SWSNMWGNDGYVLMSMRENNCGVQ 441
           SW   WG  GY+ +    N CGVQ
Sbjct: 282 SWGTSWGEAGYIRLPATGNPCGVQ 305


>UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1;
           Trypanosoma cruzi|Rep: Cysteine protease, putative -
           Trypanosoma cruzi
          Length = 434

 Score = 72.1 bits (169), Expect = 7e-12
 Identities = 44/148 (29%), Positives = 78/148 (52%), Gaps = 7/148 (4%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKHG---LPTEEDY-GGYLGQDGYCHID-NVTAITKITGWVNVTTNNE 171
           GGG    A+++IM  G   L  E  Y  G     G C ++ ++  +  + G+ ++  N+ 
Sbjct: 196 GGGTAQLAWEYIMNTGGITLDAEYPYVSGETSVTGRCVLNRSMPRVVNVYGYASLPHNDY 255

Query: 172 NALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN--G 345
            A+  AL + GP++V++ A+   + FY+ GV+       +   + HAV  VGYG  N   
Sbjct: 256 EAVIEALVQKGPLAVSVAASD--WMFYTGGVFDGCGKDGENITISHAVQLVGYGTDNKTN 313

Query: 346 HKYWLVKNSWSNMWGNDGYVLMSMRENN 429
             YW+V+NSW   WG +G++ +  +++N
Sbjct: 314 QDYWVVRNSWGEGWGENGFIRLLRKKHN 341


>UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 397

 Score = 72.1 bits (169), Expect = 7e-12
 Identities = 50/160 (31%), Positives = 82/160 (51%), Gaps = 7/160 (4%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGW-------VNVTTN 165
           GGG  + A  ++ + G+  E  Y  Y  Q+G C+  N T+ ++   +       ++ + N
Sbjct: 250 GGGWAYNALVYMQRKGIFLESQYP-YKAQNGVCN--NATSASRQKAFFAKDQIIIDTSVN 306

Query: 166 NENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNG 345
             N+L+ AL K  P+SV +D+ +  ++ YS+GV+    C +    +DH VL VGY    G
Sbjct: 307 ITNSLQYALSKQ-PVSVKVDSRY--WNSYSSGVF--SNCLSDGWYVDHVVLLVGY-TKEG 360

Query: 346 HKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           +  W+VKNSW   WG  GY+ ++   N C +   P    I
Sbjct: 361 N--WIVKNSWGTNWGQSGYIYLA-PGNTCNLSVTPVITSI 397


>UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 393

 Score = 71.7 bits (168), Expect = 9e-12
 Identities = 49/157 (31%), Positives = 83/157 (52%), Gaps = 5/157 (3%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHID--NVTAITKITGWVNVTTNNENALK 183
           GG   + Y +    G+  EE+Y     Q   C ++  + +   KI+ + +V +N E+ L+
Sbjct: 244 GGIPQKVYSYAAYLGITYEEEYPYIQRQRTGCGVNYNDTSKRVKISTYYDVQSNAES-LE 302

Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363
            AL K+ P++ AIDA  K+   Y +G+Y  P C    ++ +HAV+ VGY      +Y+L+
Sbjct: 303 TAL-KYAPVTAAIDA--KSLQMYGSGIYDFP-CSIDRNDANHAVVIVGYT----SEYFLI 354

Query: 364 KNSWSNMWGNDGYVLMSMRENN---CGVQSAPTYVLI 465
           +NSW   WG +G+  +    NN   CG+ +  +Y  I
Sbjct: 355 RNSWGPHWGEEGHFKVRKESNNKGTCGLYNDMSYPYI 391


>UniRef50_Q22RR8 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score = 71.7 bits (168), Expect = 9e-12
 Identities = 48/155 (30%), Positives = 79/155 (50%), Gaps = 3/155 (1%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG+   A+++I  + + TE++Y  Y G D  C          ++ +V+V + +E    +A
Sbjct: 191 GGDMDAAFKFIHDNNIATEKEYT-YRGFDQKCKGTQYPTTYGLSSFVDVQSCDE---LVA 246

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
             +  P+SVA+DA +  + +Y  G + +  C    D L+H VL VGY     H+ W VKN
Sbjct: 247 AIQQQPVSVAVDATN--WQYYEFGTFND--C---FDNLNHGVLLVGYNSKT-HQ-WKVKN 297

Query: 370 SWSNMWGNDGYVLMSMRE---NNCGVQSAPTYVLI 465
           SW   WG DGY+ +       N CG+    +Y ++
Sbjct: 298 SWGTSWGEDGYIRLGASTKYLNTCGICEQASYPIV 332


>UniRef50_Q237A1 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 346

 Score = 71.3 bits (167), Expect = 1e-11
 Identities = 37/95 (38%), Positives = 54/95 (56%), Gaps = 1/95 (1%)
 Frame = +1

Query: 166 NENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELD-HAVLAVGYGVLN 342
           +E A+   ++K+GPI VA+   ++ F  Y  GVY         DEL  HAV  VG+GV N
Sbjct: 249 DEKAIMAEIYKNGPIEVAL-TVYEDFLTYKTGVYQHVTG----DELGGHAVKMVGWGVEN 303

Query: 343 GHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSA 447
           G  YW + NSW+  WG+ G   +   +N CG++S+
Sbjct: 304 GTPYWTIVNSWNESWGDKGTFKILRGKNECGIESS 338


>UniRef50_Q22NW9 Cluster: Papain family cysteine protease containing
            protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
            family cysteine protease containing protein - Tetrahymena
            thermophila SB210
          Length = 894

 Score = 70.9 bits (166), Expect = 2e-11
 Identities = 50/151 (33%), Positives = 81/151 (53%), Gaps = 2/151 (1%)
 Frame = +1

Query: 10   GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGY-CHIDNVTAIT-KITGWVNVTTNNENALK 183
            GG    A+ +++++G+  E DY  Y G   + C  +N    + KI G+ N+   +   L+
Sbjct: 749  GGFMENAFDFVIENGILQENDYP-YEGHANFKCKKNNSNQQSYKIQGYYNINKYDCRGLQ 807

Query: 184  LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363
             A+ +  P+SVAID   K    Y +G+  +  C + V+ L+H VL VGY       +++V
Sbjct: 808  QAVAQQ-PVSVAIDG--KFLQRYHSGIIGD--CGSSVN-LNHGVLIVGYT----EDFFIV 857

Query: 364  KNSWSNMWGNDGYVLMSMRENNCGVQSAPTY 456
            KNSW   WG DGY  ++ + N CG+  A +Y
Sbjct: 858  KNSWGTNWGEDGYFRIT-KTNTCGICEAASY 887


>UniRef50_Q5CYC8 Cluster: Cathepsin like thiol protease possibly
           membrane associated; n=2; Cryptosporidium|Rep: Cathepsin
           like thiol protease possibly membrane associated -
           Cryptosporidium parvum Iowa II
          Length = 673

 Score = 70.5 bits (165), Expect = 2e-11
 Identities = 33/74 (44%), Positives = 50/74 (67%), Gaps = 1/74 (1%)
 Frame = +1

Query: 196 KHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL-NGHKYWLVKNS 372
           K G IS++I++    FS YS+G+Y  PKC     EL+HAV+ +GYG+  NG KY++++NS
Sbjct: 531 KVGSISLSINSNLPGFSSYSDGIYKAPKCTTH-SELNHAVIMIGYGINDNGDKYYVIQNS 589

Query: 373 WSNMWGNDGYVLMS 414
           W   WG  G++ +S
Sbjct: 590 WGVSWGIGGFMNVS 603


>UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain -
           Tetrahymena pyriformis
          Length = 330

 Score = 70.5 bits (165), Expect = 2e-11
 Identities = 48/143 (33%), Positives = 68/143 (47%)
 Frame = +1

Query: 28  AYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGP 207
           A  +  K G+  E  Y  Y  +DG C         K +    V   +  AL+ AL +  P
Sbjct: 196 AVAYTQKFGIVQESQYA-YTAKDGSCKTALQGTGYKPSAQFQVAATDA-ALQAAL-QVQP 252

Query: 208 ISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMW 387
           IS+ +DA+   +S YS G++    C  K    DHAVL VG   LN    W V+NSW   W
Sbjct: 253 ISICVDASK--WSSYSKGIF--SNCSAKPSAADHAVLLVG---LNADNTWKVRNSWGTSW 305

Query: 388 GNDGYVLMSMRENNCGVQSAPTY 456
           G  GY+ ++   N CG+++   Y
Sbjct: 306 GQSGYITLA-AGNTCGLENYAIY 327


>UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|Rep:
           Cysteine protease - Giardia muris
          Length = 301

 Score = 70.5 bits (165), Expect = 2e-11
 Identities = 36/86 (41%), Positives = 50/86 (58%), Gaps = 1/86 (1%)
 Frame = +1

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLV 363
           AL   GP+ VA    +  F +YS+GVY   +  N + E  HAV  VGYG+  +G KYW++
Sbjct: 210 ALVYDGPLQVAF-VVYSDFGYYSSGVY---QHVNGMMEGGHAVEMVGYGIDESGLKYWII 265

Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQ 441
           +NSW   WG  GY  +  R N CG++
Sbjct: 266 RNSWGPDWGEGGYFRIIRRVNECGIE 291


>UniRef50_A3FQ13 Cluster: Cathepsin like thiol protease possibly
           membrane associated, putative; n=1; Cryptosporidium
           parvum Iowa II|Rep: Cathepsin like thiol protease
           possibly membrane associated, putative - Cryptosporidium
           parvum Iowa II
          Length = 298

 Score = 70.5 bits (165), Expect = 2e-11
 Identities = 46/166 (27%), Positives = 82/166 (49%), Gaps = 13/166 (7%)
 Frame = +1

Query: 1   ARGGGEDFRAYQWIMKHGLPTEEDYGGYL---GQDGYC--HIDNVTAITKI----TGWVN 153
           A  GG+ F  + + +K  + T + Y       G+ G C  + +    I       TG   
Sbjct: 107 ACSGGQTFEVFNYAIKSKVCTRDSYPSTTHKTGKLGECKSNCNECVGIKNFKWSYTGSSI 166

Query: 154 VTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKN---KVDELDHAVLAV 324
           +  +  + +  A++ +GP++V++ +    F+ YS G Y  P C +      ++DHAV  +
Sbjct: 167 LYEDPWDVITDAIYNYGPVTVSVCSLMPGFNLYSGGYYEPPTCGSIWCGTRQVDHAVTLI 226

Query: 325 GYGVL-NGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYV 459
           GYGV  +G +Y+++KNSW   WGN G+  M++  + C     P +V
Sbjct: 227 GYGVSESGKRYYIMKNSWGLSWGNKGF--MNISADMCSTFFNPGWV 270


>UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 325

 Score = 70.1 bits (164), Expect = 3e-11
 Identities = 49/146 (33%), Positives = 78/146 (53%)
 Frame = +1

Query: 28  AYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGP 207
           ++++I+K+ +    DY  Y   +G C   +      I+ +V+V + +  AL  AL  H P
Sbjct: 194 SFKYIIKNKISKAADYP-YTAVEGKCKDTSSFEKYAISSYVDVPSGDCKALLTALQDH-P 251

Query: 208 ISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMW 387
           +SVAIDA  K   +Y++GVY      N  D L HAVL VGY          +KNSW   +
Sbjct: 252 VSVAIDA--KNLQYYTSGVY-----SNCSDNLTHAVLLVGYS----SSALKLKNSWGTQF 300

Query: 388 GNDGYVLMSMRENNCGVQSAPTYVLI 465
           G +GY  +++  N CGV +A ++ ++
Sbjct: 301 GENGYFRLAV-GNTCGVCNAASFPVL 325


>UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1;
           Methanospirillum hungatei JF-1|Rep: Peptidase C1A,
           papain precursor - Methanospirillum hungatei (strain
           JF-1 / DSM 864)
          Length = 1096

 Score = 70.1 bits (164), Expect = 3e-11
 Identities = 42/128 (32%), Positives = 62/128 (48%), Gaps = 11/128 (8%)
 Frame = +1

Query: 52  GLPTEEDYGGYLGQDGYCH-IDNVTAITKITG----WVNVTTNNE------NALKLALFK 198
           G  TE +Y  Y G DG C  +   T  +  T     W  V   NE      +A+K A++ 
Sbjct: 410 GTVTEANYP-YTGSDGTCKSLSGYTRYSVDTAAGETWGYVGGGNEWSIPSDDAIKTAIYL 468

Query: 199 HGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWS 378
           +GP++  +  A  TF  Y +G+       +     +HA++ VG+G LNG  YW+ KNSW 
Sbjct: 469 YGPVAAGV-YAESTFDSYRSGIL---DSTSSASYANHAIIIVGWGTLNGRTYWICKNSWG 524

Query: 379 NMWGNDGY 402
             WG  G+
Sbjct: 525 TSWGESGW 532


>UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Cathepsin
           B - Fasciola gigantica (Giant liver fluke)
          Length = 339

 Score = 69.7 bits (163), Expect = 4e-11
 Identities = 34/93 (36%), Positives = 51/93 (54%)
 Frame = +1

Query: 166 NENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNG 345
           +E+ +   + K+GP+ V   A  + F  Y +G+Y     K  +    HAV  +G+GV NG
Sbjct: 242 HESYIMQEIMKNGPVEVTF-AIFQDFGVYRSGIYHHVAGKF-IGR--HAVRMIGWGVENG 297

Query: 346 HKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444
             YWL+ NSW+  WG +GY  M    N CG++S
Sbjct: 298 VNYWLMANSWNEEWGENGYFRMVRGRNECGIES 330


>UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep:
           Cathepsin B - Streblomastix strix
          Length = 312

 Score = 69.7 bits (163), Expect = 4e-11
 Identities = 32/94 (34%), Positives = 57/94 (60%)
 Frame = +1

Query: 163 NNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN 342
           +NE  ++  ++++GP++ +  A ++  S Y +GVY         + L HA+  VG+G+L+
Sbjct: 213 SNEADIQKEIYENGPVTASF-AVYEDLSVYQSGVY--QHVTGGFEGL-HAIKVVGWGILD 268

Query: 343 GHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444
           G KYW + NSW+  WG DG +L+    + CG++S
Sbjct: 269 GVKYWTIVNSWAEDWGFDGLLLIRRGVDECGIES 302


>UniRef50_A7APS9 Cluster: Papain family cysteine protease containing
           protein; n=1; Babesia bovis|Rep: Papain family cysteine
           protease containing protein - Babesia bovis
          Length = 435

 Score = 69.7 bits (163), Expect = 4e-11
 Identities = 45/152 (29%), Positives = 75/152 (49%), Gaps = 4/152 (2%)
 Frame = +1

Query: 13  GEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLAL 192
           G  + AY++I  HG+     Y  Y  + G C ++ +           ++ N +  L   L
Sbjct: 291 GNSYFAYEYIRDHGVYRLASYP-YTAKSGPC-VEPLNEPRLTISRFGLSENPD--LPQLL 346

Query: 193 FKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNS 372
            ++GP++V + A +  + FYS+G+     C    DE++HAV+  G G  +   +WL+KNS
Sbjct: 347 KQYGPLTVYV-AVNVDWQFYSSGIL--DSC---ADEINHAVVLAGVGQDDDGPFWLIKNS 400

Query: 373 WSNMWGNDGYVLM----SMRENNCGVQSAPTY 456
           W   WG +GYV +    S  +N CG+     Y
Sbjct: 401 WGTSWGEEGYVRLARGSSAFDNECGLAHMALY 432


>UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p -
           Drosophila melanogaster (Fruit fly)
          Length = 431

 Score = 69.3 bits (162), Expect = 5e-11
 Identities = 33/94 (35%), Positives = 46/94 (48%), Gaps = 1/94 (1%)
 Frame = +1

Query: 163 NNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL- 339
           N E  +   +F  GP+   +   ++ F  YS GVY E     K     H+V  VG+G   
Sbjct: 320 NREADIMAEIFHSGPVQATM-RVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEH 378

Query: 340 NGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQ 441
           NG KYW+  NSW + WG  GY  +    N CG++
Sbjct: 379 NGEKYWIAANSWGSWWGEHGYFRILRGSNECGIE 412


>UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15;
           Aphidoidea|Rep: Cathepsin B-S precursor - Tuberaphis
           styraci
          Length = 349

 Score = 68.9 bits (161), Expect = 6e-11
 Identities = 34/98 (34%), Positives = 52/98 (53%), Gaps = 1/98 (1%)
 Frame = +1

Query: 163 NNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFE-PKCKNKVDELDHAVLAVGYGVL 339
           N+   ++  L  +GP+  + D  +  FS Y +G+Y + PK K    E  H++  +G+G  
Sbjct: 234 NSIETIEQDLMTYGPVEASFDV-YDDFSVYKSGIYRKTPKAKY---EGGHSIKIIGWGEE 289

Query: 340 NGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPT 453
           NG  YWL  NSWS  WG+ G   +    N CG++ A T
Sbjct: 290 NGTPYWLAVNSWSKFWGDHGTFKIIKGRNECGIERAVT 327


>UniRef50_Q26988 Cluster: Cysteine proteinase 4; n=1; Tritrichomonas
           foetus|Rep: Cysteine proteinase 4 - Tritrichomonas
           foetus (Trichomonas foetus)
          Length = 152

 Score = 68.9 bits (161), Expect = 6e-11
 Identities = 42/115 (36%), Positives = 62/115 (53%), Gaps = 3/115 (2%)
 Frame = +1

Query: 10  GGEDFRAYQWIMK--HGLPTEEDYGGYLGQD-GYCHIDNVTAITKITGWVNVTTNNENAL 180
           GG  F A+ +I +  +G    ED   Y G D   C  D      +ITG+++V   +E  L
Sbjct: 39  GGSPFSAFMFISRTQNGQINLEDDYPYTGTDTNDCKFDPSKGYGRITGFMSVQAQSEEDL 98

Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNG 345
              +   GPI+V IDA+  +F+ YS+G+Y + +C + V  LDHAV  +GYG   G
Sbjct: 99  FKCVASVGPIAVCIDASLASFNSYSSGIYNDRQCSSTV--LDHAVGCIGYGAEGG 151


>UniRef50_Q231X3 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 323

 Score = 68.9 bits (161), Expect = 6e-11
 Identities = 51/153 (33%), Positives = 81/153 (52%), Gaps = 2/153 (1%)
 Frame = +1

Query: 13  GEDFRAYQWIMKHGLPTEEDYGGYLGQDGY-CHIDNVTAITKIT-GWVNVTTNNENALKL 186
           G+  +A  +I ++ + TE++Y  Y  +D   C+ DN   I   T   + +   + N L  
Sbjct: 184 GQKEQALVYIKRYSITTEQNYP-YTEKDVQKCYFDNTKHIPNYTISDIKIVKASTNDLVE 242

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           AL K  P++V++DA +  + +Y  GV+ +  CK      +HAVL VG+   NG   WLVK
Sbjct: 243 AL-KIQPVAVSVDATN--WKYYKGGVFSD--CKTYYH--NHAVLLVGFQ--NGT--WLVK 291

Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           NS+   WG +GY+ +    N CGV + P   +I
Sbjct: 292 NSYGTNWGENGYIRLK-NGNTCGVANQPYQPII 323


>UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathepsin
           o - Aedes aegypti (Yellowfever mosquito)
          Length = 375

 Score = 68.9 bits (161), Expect = 6e-11
 Identities = 33/99 (33%), Positives = 60/99 (60%)
 Frame = +1

Query: 163 NNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN 342
           + E+ +   L  HGPI  A++AA  ++ +Y  GV  +  C+   ++L+HAV  VGY + +
Sbjct: 277 DREHLMLRYLATHGPIVAAVNAA--SWKYYLGGV-IQYHCEEAYEDLNHAVEIVGYNLES 333

Query: 343 GHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYV 459
              Y+LVKNSW   +G+ GY+ + + +N CG+ +  +++
Sbjct: 334 QIPYYLVKNSWGPRFGDRGYIKIQVGKNLCGIANRVSFI 372


>UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_36,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 307

 Score = 68.9 bits (161), Expect = 6e-11
 Identities = 47/145 (32%), Positives = 75/145 (51%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    A  +I + G   E DY  Y  +DG C +       +I G  N   N E+A+K  
Sbjct: 171 GGNSDLALDYIAEVGSVYERDYE-YTAKDGVCKVKQ--GKVRIAGRENYGPN-EDAIKKG 226

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           +  + P+SV++DA +  + FY+ GVY +  C++  D  +HAV+AVG+        W ++N
Sbjct: 227 IQNY-PLSVSVDATY--WKFYNQGVY-DGACRD--DFHNHAVVAVGFDYAGN---WKIRN 277

Query: 370 SWSNMWGNDGYVLMSMRENNCGVQS 444
           SW   WG  G++ +    N+C V +
Sbjct: 278 SWGEGWGEQGHIWLK-PGNSCAVMT 301


>UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1)
           (Cathepsin B1) (APP secretase) (APPS) [Contains:
           Cathepsin B light chain; Cathepsin B heavy chain]; n=85;
           Eukaryota|Rep: Cathepsin B precursor (EC 3.4.22.1)
           (Cathepsin B1) (APP secretase) (APPS) [Contains:
           Cathepsin B light chain; Cathepsin B heavy chain] - Homo
           sapiens (Human)
          Length = 339

 Score = 68.9 bits (161), Expect = 6e-11
 Identities = 32/97 (32%), Positives = 58/97 (59%), Gaps = 2/97 (2%)
 Frame = +1

Query: 160 TNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDEL--DHAVLAVGYG 333
           +N+E  +   ++K+GP+  A  + +  F  Y +GVY     ++   E+   HA+  +G+G
Sbjct: 233 SNSEKDIMAEIYKNGPVEGAF-SVYSDFLLYKSGVY-----QHVTGEMMGGHAIRILGWG 286

Query: 334 VLNGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444
           V NG  YWLV NSW+  WG++G+  +   +++CG++S
Sbjct: 287 VENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIES 323


>UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3;
           Eukaryota|Rep: Cathepsin-like cysteine protease -
           Phytophthora infestans (Potato late blight fungus)
          Length = 635

 Score = 68.1 bits (159), Expect = 1e-10
 Identities = 31/95 (32%), Positives = 53/95 (55%)
 Frame = +1

Query: 157 TTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV 336
           TT  E  +   ++  GPI+ ++ A    F  YS G++ +   K    ++DHA+  VG+G 
Sbjct: 203 TTLGEQQMMAEIYARGPIACSV-AVTDGFLKYSGGIFDD---KTNATDVDHAISIVGWGE 258

Query: 337 LNGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQ 441
            NG  +W+++NSW + WG  G++ +    NN GV+
Sbjct: 259 ENGVPFWVLRNSWGSFWGESGWMRLVRGVNNVGVE 293



 Score = 59.3 bits (137), Expect = 5e-08
 Identities = 27/88 (30%), Positives = 45/88 (51%)
 Frame = +1

Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357
           +K  ++K GPI   + A  K F  Y+ G+Y E      ++  + +V   GY      +YW
Sbjct: 508 MKAEIYKRGPIGCGVHATSK-FESYTGGIYSEHVMFPLINH-EISVAGWGYDEETDTEYW 565

Query: 358 LVKNSWSNMWGNDGYVLMSMRENNCGVQ 441
           + +NSW   WG +G+  + M  NN G++
Sbjct: 566 IGRNSWGTYWGENGWFRIQMHHNNLGIE 593


>UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4;
           Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena
           thermophila
          Length = 320

 Score = 68.1 bits (159), Expect = 1e-10
 Identities = 47/152 (30%), Positives = 77/152 (50%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG     +++I+ + +    +Y  Y  +DG C   +      I+ +  +   + N+L  A
Sbjct: 183 GGWMVEGFKYIIDNKISQTANYP-YTAKDGKCKDTSSFKKFSISKYAEIPQGDCNSLNSA 241

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           L + GPISVA+DA +  F FY++GV+     KN    L+H VL V     N      +KN
Sbjct: 242 L-EQGPISVAVDATN--FQFYTSGVF-----KNCKANLNHGVLLVA----NVDSSLKIKN 289

Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465
           SW   WG  G++ ++   N CGV +A +Y ++
Sbjct: 290 SWGPSWGEKGFIRLA-AGNTCGVCNAASYPIV 320


>UniRef50_Q24FA8 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 335

 Score = 68.1 bits (159), Expect = 1e-10
 Identities = 50/143 (34%), Positives = 78/143 (54%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189
           GG    A Q+    G+ ++ +Y  Y G  G C+I + T   +   +  +    E  L+ A
Sbjct: 195 GGVPSDAVQYAADFGVLSDNEYP-YTGIQGQCNITSKTNGFQPVQFSYLDGTAEG-LRKA 252

Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369
           L  +GP+SVA+DA++     Y++GV+    C +K   L+HAVLAVGY    G+  W++KN
Sbjct: 253 L-NYGPVSVAMDASN--MKEYTSGVF--NNCTSKQFNLNHAVLAVGYDE-EGN--WIIKN 304

Query: 370 SWSNMWGNDGYVLMSMRENNCGV 438
           S    WG +GY L++   N CG+
Sbjct: 305 SKGPNWGMEGYFLLA-PGNTCGI 326


>UniRef50_A0DZB1 Cluster: Chromosome undetermined scaffold_7, whole
           genome shotgun sequence; n=4; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_7,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 500

 Score = 68.1 bits (159), Expect = 1e-10
 Identities = 41/159 (25%), Positives = 80/159 (50%), Gaps = 14/159 (8%)
 Frame = +1

Query: 10  GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVT-------TNN 168
           GG  F   ++  +  L TE+ Y  Y G  G C   + +  +K+ G  N          +N
Sbjct: 315 GGYPFLVEKFASEQYLVTEQQYP-YKGDVGTCKKIDFSQSSKVYGAKNYKYIGGGYGLSN 373

Query: 169 ENALKLALFKHGPISVAIDAAHKTFSFYSNGVY-------FEPKCKNKVDELDHAVLAVG 327
           E  + + L+ +GP+ +  + ++  F +Y +G+Y       +  + + + +++DH+VL  G
Sbjct: 374 ERDIMMELYTNGPVIMNFEPSYD-FMYYESGIYHSVAEHDWSTQERPEWEKVDHSVLCYG 432

Query: 328 YGVLNGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444
           +G  +G K+WL++NSW + WG +G   M    +   ++S
Sbjct: 433 WGEEDGVKFWLLQNSWGSQWGENGSFRMKRGVDESAIES 471


>UniRef50_Q23FQ5 Cluster: Papain family cysteine protease containing
           protein; n=4; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score = 67.7 bits (158), Expect = 1e-10
 Identities = 47/145 (32%), Positives = 78/145 (53%), Gaps = 1/145 (0%)
 Frame = +1

Query: 31  YQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTN-NENALKLALFKHGP 207
           +++ +K+G+     Y  Y+G    C   N + ++K         N N + +K A+   GP
Sbjct: 200 FKYAIKYGIVQGSSYP-YVGYQTTCK--NTSNLSKYFPQSFKFINPNASDVKAAI-SQGP 255

Query: 208 ISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMW 387
           ISV +DA+  T+S YS G++    C + + +L+HAV+AVGY     +   +++N W   W
Sbjct: 256 ISVTVDAS--TWSSYSGGIF--NGCNSNI-QLNHAVIAVGYDTQGNY---IIRNHWGTGW 307

Query: 388 GNDGYVLMSMRENNCGVQSAPTYVL 462
           G  GY+ +S   NNCGV ++   VL
Sbjct: 308 GEKGYMRLS-ANNNCGVLTSVIQVL 331


>UniRef50_Q235G6 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 325

 Score = 67.7 bits (158), Expect = 1e-10
 Identities = 45/141 (31%), Positives = 75/141 (53%)
 Frame = +1

Query: 7   GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186
           GGG    A  ++ + GL TEE+Y  Y  ++G C +   +    I+G+  +   ++  L  
Sbjct: 180 GGGLRDIALNYVKETGLTTEEEYS-YEAKNGKCRLQGKSNPYTISGFTAIKQCSD--LVN 236

Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366
           A+ K  P++V ID+++    FY+NG++    C  K++   H VL VGY  +   + W VK
Sbjct: 237 AIQK-APVTVGIDSSN--LQFYTNGIF--SNCGTKIN---HGVLLVGYDSVK--EAWKVK 286

Query: 367 NSWSNMWGNDGYVLMSMRENN 429
           NSW   +G  GY+ +S +  N
Sbjct: 287 NSWGPKFGEGGYIYLSAKITN 307


>UniRef50_A0DIY3 Cluster: Chromosome undetermined scaffold_52, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_52,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 512

 Score = 67.3 bits (157), Expect = 2e-10
 Identities = 31/84 (36%), Positives = 46/84 (54%)
 Frame = +1

Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357
           +K+ +F  GPI   + A  +    Y  G  F  K    +  L+H V  VG+GV +G +YW
Sbjct: 418 MKIEIFNRGPIVCGVYATQE-LDDYEGGYIFSQKTNKTI--LNHYVSVVGWGVEDGVEYW 474

Query: 358 LVKNSWSNMWGNDGYVLMSMRENN 429
           +V+NSW + WG+ GY  M M  +N
Sbjct: 475 IVRNSWGSYWGDMGYAKMKMHSDN 498


>UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza
           sativa|Rep: OSJNBb0085F13.15 protein - Oryza sativa
           subsp. japonica (Rice)
          Length = 383

 Score = 66.9 bits (156), Expect = 3e-10
 Identities = 52/167 (31%), Positives = 84/167 (50%), Gaps = 12/167 (7%)
 Frame = +1

Query: 1   ARGGGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYC--HIDNVTAITKITGWVNVTTNNE 171
           ++G  +D  A+ W+ K+ G+ ++  Y  Y+G    C   +  V   T + G V +  N E
Sbjct: 224 SKGYSDD--AFLWVSKNKGIASDLIYP-YVGHKESCKKQLLGVHNAT-VRGVVTLPENRE 279

Query: 172 NALKLALFKHGPISVAIDAAHKTFSFY-SNGVYFEPK-CKNKVDELDHAVLAVGYGVLN- 342
           + +  A+ +  P++V  DA    F  Y  NGVY     C   V+   HA+  VGYG  + 
Sbjct: 280 DLIMAAVARQ-PVAVVFDAGDPLFQNYRGNGVYKGGTGCSTNVN---HALTIVGYGTNHP 335

Query: 343 --GHKYWLVKNSWSNMWGNDGYVLMSM----RENNCGVQSAPTYVLI 465
             G  YW+ KNS+ N+WG++G+V ++     R   CG+   PT+  I
Sbjct: 336 DTGENYWIAKNSYGNLWGDNGFVYLAKDTADRTGVCGLAIWPTFPTI 382


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 530,075,216
Number of Sequences: 1657284
Number of extensions: 10171785
Number of successful extensions: 28752
Number of sequences better than 10.0: 500
Number of HSP's better than 10.0 without gapping: 27642
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 28309
length of database: 575,637,011
effective HSP length: 95
effective length of database: 418,195,031
effective search space used: 31782822356
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)

- SilkBase 1999-2023 -