SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTX 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= fner12p14f
         (584 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-...   283   2e-75
UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ...   276   3e-73
UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip...   243   3e-63
UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; ...   225   5e-58
UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome s...   221   7e-57
UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ...   193   2e-48
UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve...   187   1e-46
UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve...   169   6e-41
UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP000...   165   7e-40
UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v...   162   5e-39
UniRef50_Q24E33 Cluster: Papain family cysteine protease contain...   134   1e-30
UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox...   128   7e-29
UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n...   126   3e-28
UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei...   126   4e-28
UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ...   125   7e-28
UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j...   125   9e-28
UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt...   124   2e-27
UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ...   124   2e-27
UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1...   124   2e-27
UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi...   124   2e-27
UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ...   120   2e-26
UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs...   120   3e-26
UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h...   119   4e-26
UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=...   119   6e-26
UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata...   119   6e-26
UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac...   118   8e-26
UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ...   118   8e-26
UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re...   118   1e-25
UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=...   118   1e-25
UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C...   118   1e-25
UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ...   117   2e-25
UniRef50_Q239L8 Cluster: Papain family cysteine protease contain...   117   2e-25
UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ...   116   3e-25
UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;...   116   3e-25
UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ...   116   3e-25
UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re...   116   4e-25
UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy...   116   4e-25
UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C...   116   4e-25
UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;...   116   5e-25
UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ...   115   7e-25
UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca...   115   9e-25
UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35...   114   1e-24
UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory...   114   1e-24
UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy...   114   2e-24
UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat...   114   2e-24
UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr...   114   2e-24
UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ...   114   2e-24
UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18...   114   2e-24
UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ...   113   2e-24
UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae...   113   3e-24
UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|...   113   3e-24
UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz...   112   5e-24
UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep...   112   7e-24
UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa...   112   7e-24
UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz...   112   7e-24
UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D...   112   7e-24
UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel...   112   7e-24
UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph...   111   1e-23
UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ...   110   3e-23
UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi...   110   3e-23
UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er...   110   3e-23
UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal...   109   4e-23
UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ...   109   4e-23
UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ...   109   4e-23
UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ...   108   8e-23
UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e...   108   8e-23
UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty...   107   1e-22
UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|...   107   1e-22
UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3...   107   2e-22
UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n...   107   3e-22
UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ...   107   3e-22
UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ...   107   3e-22
UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re...   106   3e-22
UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:...   106   4e-22
UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus...   106   4e-22
UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;...   105   6e-22
UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ...   105   8e-22
UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio...   105   1e-21
UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain...   105   1e-21
UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ...   105   1e-21
UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n...   104   1e-21
UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M...   104   1e-21
UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep...   103   2e-21
UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc...   103   2e-21
UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t...   103   3e-21
UniRef50_Q22A69 Cluster: Papain family cysteine protease contain...   103   4e-21
UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil...   102   5e-21
UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip...   102   5e-21
UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt...   102   7e-21
UniRef50_Q22RC9 Cluster: Papain family cysteine protease contain...   102   7e-21
UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ...   101   9e-21
UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n...   101   9e-21
UniRef50_Q22W19 Cluster: Papain family cysteine protease contain...   101   9e-21
UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic...   101   9e-21
UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ...   101   1e-20
UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ...   101   1e-20
UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|...   101   1e-20
UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat...   101   2e-20
UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa...   100   3e-20
UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr...   100   3e-20
UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ...    99   4e-20
UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted...    99   4e-20
UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain...    99   4e-20
UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|...   100   5e-20
UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb...   100   5e-20
UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain...    99   7e-20
UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|...    99   7e-20
UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca...    99   9e-20
UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p...    98   1e-19
UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w...    98   1e-19
UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa...    98   2e-19
UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n...    98   2e-19
UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n...    98   2e-19
UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy...    98   2e-19
UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus...    97   2e-19
UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio...    97   2e-19
UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain...    97   4e-19
UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate...    97   4e-19
UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ...    97   4e-19
UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto...    97   4e-19
UniRef50_O16454 Cluster: Temporarily assigned gene name protein ...    96   5e-19
UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh...    96   6e-19
UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi...    96   6e-19
UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ...    95   8e-19
UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ...    95   8e-19
UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum...    95   8e-19
UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh...    95   8e-19
UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ...    95   1e-18
UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ...    95   1e-18
UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr...    95   1e-18
UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C...    95   1e-18
UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ...    94   2e-18
UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo...    94   2e-18
UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:...    94   2e-18
UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: L...    94   3e-18
UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ...    94   3e-18
UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R...    94   3e-18
UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia ...    93   3e-18
UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh...    93   3e-18
UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R...    93   4e-18
UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No...    93   6e-18
UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl...    93   6e-18
UniRef50_Q248G1 Cluster: Papain family cysteine protease contain...    93   6e-18
UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir...    93   6e-18
UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen...    93   6e-18
UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl...    92   8e-18
UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain...    92   8e-18
UniRef50_Q235G6 Cluster: Papain family cysteine protease contain...    92   8e-18
UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG...    92   1e-17
UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli...    92   1e-17
UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh...    92   1e-17
UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p...    91   1e-17
UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain...    91   1e-17
UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy...    91   1e-17
UniRef50_Q231X3 Cluster: Papain family cysteine protease contain...    91   2e-17
UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica...    91   2e-17
UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ...    91   2e-17
UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ...    90   3e-17
UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov...    90   3e-17
UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain...    90   4e-17
UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli...    89   5e-17
UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain...    89   5e-17
UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh...    89   7e-17
UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia...    89   9e-17
UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi...    88   1e-16
UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n...    88   1e-16
UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:...    88   2e-16
UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyt...    87   2e-16
UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis...    87   2e-16
UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]...    87   2e-16
UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s...    87   3e-16
UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv...    87   3e-16
UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D...    87   3e-16
UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida...    87   4e-16
UniRef50_Q23H32 Cluster: Papain family cysteine protease contain...    87   4e-16
UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain...    85   9e-16
UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ...    85   9e-16
UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ...    85   1e-15
UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ...    85   2e-15
UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ...    85   2e-15
UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa...    84   2e-15
UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try...    84   2e-15
UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet...    84   2e-15
UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep...    83   4e-15
UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium...    83   4e-15
UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G...    83   5e-15
UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain...    83   5e-15
UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ...    83   6e-15
UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sa...    83   6e-15
UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto...    83   6e-15
UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain...    82   8e-15
UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ...    82   1e-14
UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ...    82   1e-14
UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei...    82   1e-14
UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru...    82   1e-14
UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ...    81   1e-14
UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ...    81   1e-14
UniRef50_Q23H10 Cluster: Papain family cysteine protease contain...    81   1e-14
UniRef50_A7APS9 Cluster: Papain family cysteine protease contain...    81   1e-14
UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ...    81   2e-14
UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1; ...    81   3e-14
UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain...    81   3e-14
UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The...    81   3e-14
UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ...    80   3e-14
UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cy...    80   3e-14
UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endop...    80   3e-14
UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|R...    80   3e-14
UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ...    80   4e-14
UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy...    80   4e-14
UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w...    80   4e-14
UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ...    79   6e-14
UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re...    79   6e-14
UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste...    79   8e-14
UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina...    79   8e-14
UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu...    79   8e-14
UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s...    79   1e-13
UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali...    79   1e-13
UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who...    79   1e-13
UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2...    79   1e-13
UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa...    78   1e-13
UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyosteli...    78   1e-13
UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain...    78   2e-13
UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh...    78   2e-13
UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain...    77   2e-13
UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,...    77   3e-13
UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|...    77   3e-13
UniRef50_Q3L7L2 Cluster: Sar s 1 allergen SMIPP-C Yv6008G08; n=2...    77   3e-13
UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy...    77   3e-13
UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh...    77   3e-13
UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs...    77   3e-13
UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;...    77   4e-13
UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv...    77   4e-13
UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w...    77   4e-13
UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ...    76   5e-13
UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine prot...    76   7e-13
UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re...    75   1e-12
UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi...    75   1e-12
UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ...    75   2e-12
UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain...    74   2e-12
UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:...    74   2e-12
UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1...    74   3e-12
UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H...    74   3e-12
UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li...    74   3e-12
UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh...    73   4e-12
UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl...    73   4e-12
UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-...    73   5e-12
UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-li...    73   5e-12
UniRef50_A2DKX2 Cluster: Clan CA, family C1, cathepsin L-like cy...    73   5e-12
UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n...    73   7e-12
UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis...    73   7e-12
UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arab...    71   2e-11
UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster...    71   2e-11
UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n...    71   2e-11
UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:...    71   2e-11
UniRef50_Q23H06 Cluster: Papain family cysteine protease contain...    71   2e-11
UniRef50_A0CHZ5 Cluster: Chromosome undetermined scaffold_186, w...    71   2e-11
UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryz...    71   3e-11
UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma j...    71   3e-11
UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil...    71   3e-11
UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole...    70   4e-11
UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist...    70   4e-11
UniRef50_Q23H15 Cluster: Papain family cysteine protease contain...    70   5e-11
UniRef50_Q5YER1 Cluster: Cysteine proteinase; n=1; Bigelowiella ...    69   6e-11
UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big...    69   6e-11
UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cy...    69   6e-11
UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cel...    69   8e-11
UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve...    69   1e-10
UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy...    69   1e-10
UniRef50_A0CS14 Cluster: Chromosome undetermined scaffold_26, wh...    69   1e-10
UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ...    68   1e-10
UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;...    67   2e-10
UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ...    67   2e-10
UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl...    67   2e-10
UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ...    67   3e-10
UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop...    67   3e-10
UniRef50_Q6DGW1 Cluster: 26-29kD-proteinase protein; n=23; Danio...    66   4e-10
UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Sl...    66   4e-10
UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ...    66   8e-10
UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl...    66   8e-10
UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir...    66   8e-10
UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain...    66   8e-10
UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus co...    65   1e-09
UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca...    65   1e-09
UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ...    64   2e-09
UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep...    64   2e-09
UniRef50_A0BLR4 Cluster: Chromosome undetermined scaffold_115, w...    64   2e-09
UniRef50_UPI0001509FF9 Cluster: Papain family cysteine protease ...    64   2e-09
UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; ...    64   2e-09
UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi...    64   3e-09
UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; ...    63   4e-09
UniRef50_Q650W8 Cluster: Putative cysteine proteinase; n=2; Oryz...    63   5e-09
UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n...    63   5e-09
UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2; Culicid...    62   7e-09
UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M...    62   7e-09
UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ...    62   7e-09
UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;...    62   9e-09
UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop...    62   9e-09
UniRef50_A0EI50 Cluster: Chromosome undetermined scaffold_98, wh...    62   9e-09
UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C...    62   1e-08
UniRef50_A5HC50 Cluster: Cathepsin W; n=2; Theria|Rep: Cathepsin...    62   1e-08
UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid...    62   1e-08
UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa...    61   2e-08
UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-32...    61   2e-08
UniRef50_Q9BL26 Cluster: Putative uncharacterized protein; n=4; ...    61   2e-08
UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop...    61   2e-08
UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|...    61   2e-08
UniRef50_A0DKP6 Cluster: Chromosome undetermined scaffold_54, wh...    61   2e-08
UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S...    60   3e-08
UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw...    60   3e-08
UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280...    60   4e-08
UniRef50_Q2QS15 Cluster: Papain family cysteine protease contain...    60   4e-08
UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep...    60   4e-08
UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi...    60   4e-08
UniRef50_A0EAG2 Cluster: Chromosome undetermined scaffold_86, wh...    60   4e-08
UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr...    60   4e-08
UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi...    60   4e-08
UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R...    60   5e-08
UniRef50_Q237A1 Cluster: Papain family cysteine protease contain...    60   5e-08
UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8....    60   5e-08
UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li...    60   5e-08
UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag...    60   5e-08
UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; ...    59   7e-08
UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy...    59   7e-08
UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; ...    59   9e-08
UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The...    59   9e-08
UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3....    59   9e-08
UniRef50_A2Q4E7 Cluster: Peptidase C1A, papain; n=1; Medicago tr...    58   1e-07
UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ...    58   2e-07
UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomati...    58   2e-07
UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop...    58   2e-07
UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10...    58   2e-07
UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteina...    58   2e-07
UniRef50_UPI000155D183 Cluster: PREDICTED: similar to Cathepsin ...    58   2e-07
UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Sl...    58   2e-07
UniRef50_A0BNM1 Cluster: Chromosome undetermined scaffold_119, w...    58   2e-07
UniRef50_Q0AY53 Cluster: Putative uncharacterized protein; n=1; ...    57   3e-07
UniRef50_Q3L7L0 Cluster: Sar s 1 allergen SMIPP-C Yv5009F04; n=3...    57   3e-07
UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: ...    57   4e-07
UniRef50_Q54VR1 Cluster: Putative uncharacterized protein; n=1; ...    57   4e-07
UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7...    57   4e-07
UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=...    57   4e-07
UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; P...    57   4e-07
UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh...    56   5e-07
UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O...    56   5e-07
UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=...    56   5e-07
UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R...    56   6e-07
UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|...    56   6e-07
UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; ...    56   8e-07
UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr...    56   8e-07
UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3; Bilate...    55   1e-06
UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ...    55   1e-06
UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain...    55   1e-06
UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryz...    54   2e-06
UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin...    54   2e-06
UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome s...    54   3e-06
UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ...    53   4e-06
UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma...    52   8e-06
UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhip...    52   1e-05
UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep...    52   1e-05
UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcop...    52   1e-05
UniRef50_A0DZB1 Cluster: Chromosome undetermined scaffold_7, who...    52   1e-05
UniRef50_Q015J8 Cluster: Cathepsin; n=2; Ostreococcus|Rep: Cathe...    52   1e-05
UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl...    51   2e-05
UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabdi...    51   2e-05
UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, w...    51   2e-05
UniRef50_Q8TQM7 Cluster: Putative uncharacterized protein; n=1; ...    51   2e-05
UniRef50_Q9UBR2 Cluster: Cathepsin Z precursor; n=40; Bilateria|...    51   2e-05
UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease ...    51   2e-05
UniRef50_UPI00006CB653 Cluster: Papain family cysteine protease ...    51   2e-05
UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 ...    51   2e-05
UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocysti...    50   3e-05
UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein...    50   3e-05
UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8...    50   3e-05
UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|R...    50   3e-05
UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8....    50   3e-05
UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator ame...    50   3e-05
UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ...    50   3e-05
UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;...    50   4e-05
UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus...    50   4e-05
UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n...    50   4e-05
UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to glucocorti...    50   5e-05
UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma...    50   5e-05
UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin ...    49   7e-05
UniRef50_Q6A1H9 Cluster: Cathepsin X/O; n=1; Suberites domuncula...    49   7e-05
UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil...    49   7e-05
UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 ...    49   7e-05
UniRef50_Q3A8I3 Cluster: Putative serine protease; n=1; Pelobact...    49   9e-05
UniRef50_Q53K53 Cluster: Cysteine protease 1, putative; n=5; Ory...    49   9e-05
UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophi...    49   9e-05
UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia ...    48   1e-04
UniRef50_Q945E4 Cluster: Cysteine proteinase; n=1; Vasconcellea ...    48   1e-04
UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps...    48   1e-04
UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest...    48   1e-04
UniRef50_Q54R55 Cluster: Putative uncharacterized protein; n=1; ...    48   1e-04
UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Ca...    48   1e-04
UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina...    48   1e-04
UniRef50_Q8PS79 Cluster: Putative uncharacterized protein; n=1; ...    48   1e-04
UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,...    48   2e-04
UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like prote...    48   2e-04
UniRef50_Q5CYC8 Cluster: Cathepsin like thiol protease possibly ...    47   3e-04
UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|R...    47   3e-04
UniRef50_UPI0000498E2F Cluster: cysteine proteinase; n=1; Entamo...    47   4e-04
UniRef50_Q8QNJ8 Cluster: EsV-1-75; n=1; Ectocarpus siliculosus v...    46   5e-04
UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; ...    46   5e-04
UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-L...    46   7e-04
UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n...    46   7e-04
UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon G...    46   7e-04
UniRef50_UPI00015B5D85 Cluster: PREDICTED: similar to cathepsin ...    46   9e-04
UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin ...    46   9e-04
UniRef50_UPI00006CBB5F Cluster: Papain family cysteine protease ...    46   9e-04
UniRef50_Q9XZM9 Cluster: Cysteine proteinase CPW2; n=1; Acantham...    46   9e-04
UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2...    46   9e-04
UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba...    45   0.001
UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli...    45   0.001
UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma j...    45   0.001
UniRef50_A0BZE1 Cluster: Chromosome undetermined scaffold_139, w...    45   0.001
UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; ...    45   0.002
UniRef50_Q4N5Z7 Cluster: Cysteine proteinase, putative; n=2; The...    45   0.002
UniRef50_A3FQ13 Cluster: Cathepsin like thiol protease possibly ...    45   0.002
UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy...    45   0.002
UniRef50_UPI00006CBAC7 Cluster: Papain family cysteine protease ...    44   0.002
UniRef50_Q2NG83 Cluster: Member of asn/thr-rich large protein fa...    44   0.003
UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 ...    44   0.003
UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo sapie...    44   0.004
UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gamb...    44   0.004
UniRef50_A5K8Y0 Cluster: Preprocathepsin c, putative; n=1; Plasm...    44   0.004
UniRef50_Q2FLD5 Cluster: PKD precursor; n=1; Methanospirillum hu...    44   0.004
UniRef50_Q3LFN3 Cluster: Cysteine proteinase; n=1; Dianthus cary...    43   0.005
UniRef50_Q4UCF5 Cluster: Cysteine proteinase, tacP, putative; n=...    43   0.006
UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-li...    43   0.006
UniRef50_Q7M1Q7 Cluster: Actinidain; n=1; Actinidia chinensis|Re...    42   0.011
UniRef50_Q01FL5 Cluster: Cysteine protease-5; n=3; Ostreococcus|...    42   0.011
UniRef50_A0DTZ2 Cluster: Chromosome undetermined scaffold_63, wh...    42   0.011
UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia...    42   0.014
UniRef50_Q4PGS1 Cluster: Putative uncharacterized protein; n=1; ...    42   0.014
UniRef50_Q70SU8 Cluster: Cystein proteinase inhibitor protein pr...    41   0.019
UniRef50_A4S004 Cluster: Predicted protein; n=2; Ostreococcus|Re...    41   0.019
UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep...    41   0.019
UniRef50_Q8TKH5 Cluster: Cell surface protein; n=3; Methanosarci...    41   0.019
UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA...    41   0.025
UniRef50_Q5YER6 Cluster: Cathepsin Z; n=1; Bigelowiella natans|R...    41   0.025
UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O...    40   0.033
UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; ...    40   0.033
UniRef50_Q8I0V1 Cluster: Preprocathepsin c, putative; n=1; Plasm...    40   0.033
UniRef50_Q7M4N9 Cluster: Dipeptidyl-peptidase I; n=1; Homo sapie...    40   0.033
UniRef50_Q8TMY7 Cluster: Cell surface protein; n=2; Methanosarci...    40   0.033
UniRef50_Q9SIE8 Cluster: Putative cysteine proteinase; n=1; Arab...    40   0.043
UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus lu...    40   0.043
UniRef50_Q962W3 Cluster: Cysteine protease; n=3; Giardia intesti...    40   0.043
UniRef50_Q7R5X2 Cluster: GLP_81_104117_102504; n=1; Giardia lamb...    40   0.043
UniRef50_Q26989 Cluster: Cysteine proteinase 5; n=1; Tritrichomo...    40   0.043
UniRef50_UPI0000E48EBC Cluster: PREDICTED: hypothetical protein;...    40   0.057
UniRef50_Q42312 Cluster: Cysteine protease; n=1; Arabidopsis tha...    40   0.057
UniRef50_UPI0000E46ABB Cluster: PREDICTED: similar to SCO-spondi...    39   0.076
UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - ...    39   0.076
UniRef50_A2F4G0 Cluster: Clan CA, family C1, cathepsin L-like cy...    39   0.076
UniRef50_Q8TQ91 Cluster: Putative uncharacterized protein; n=1; ...    39   0.076
UniRef50_A1SVF0 Cluster: Peptidase C1A, papain; n=1; Psychromona...    39   0.100
UniRef50_A7LFV3 Cluster: Cathepsin L protease inhibitor 2; n=1; ...    39   0.100
UniRef50_A5Z7Z2 Cluster: Putative uncharacterized protein; n=1; ...    38   0.13 
UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia...    38   0.13 
UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma j...    38   0.13 
UniRef50_Q9NHY1 Cluster: Cysteine protease cp2; n=1; Theileria c...    38   0.17 
UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep...    38   0.17 
UniRef50_Q6CPZ4 Cluster: Kluyveromyces lactis strain NRRL Y-1140...    38   0.17 
UniRef50_Q61CG7 Cluster: Putative uncharacterized protein CBG129...    38   0.23 
UniRef50_Q5Y801 Cluster: Cysteine proteinase; n=1; Petunia x hyb...    37   0.30 
UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep: Cat...    37   0.30 
UniRef50_Q9VTF1 Cluster: CG32071-PA; n=2; Drosophila melanogaste...    37   0.30 
UniRef50_Q54MB6 Cluster: Putative uncharacterized protein; n=1; ...    37   0.30 
UniRef50_Q2FLC7 Cluster: Periplasmic copper-binding precursor; n...    37   0.30 
UniRef50_UPI00006CFA59 Cluster: Papain family cysteine protease ...    37   0.40 
UniRef50_Q84YE7 Cluster: Cysteine proteinase-like protein; n=1; ...    37   0.40 
UniRef50_A5K7H0 Cluster: Putative uncharacterized protein; n=1; ...    37   0.40 
UniRef50_A0DIY3 Cluster: Chromosome undetermined scaffold_52, wh...    37   0.40 
UniRef50_A0DCA5 Cluster: Chromosome undetermined scaffold_45, wh...    37   0.40 
UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi...    37   0.40 
UniRef50_UPI0000DB7B97 Cluster: PREDICTED: hypothetical protein,...    36   0.53 
UniRef50_O65214 Cluster: Cysteine protease; n=2; Volvox carteri ...    36   0.53 
UniRef50_Q1AMF2 Cluster: Cathepsin C2; n=1; Toxoplasma gondii|Re...    36   0.53 
UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like cy...    36   0.53 
UniRef50_UPI0000ECA2BE Cluster: UPI0000ECA2BE related cluster; n...    36   0.70 
UniRef50_A6W7B6 Cluster: Methyl-accepting chemotaxis sensory tra...    36   0.70 
UniRef50_Q7JYA0 Cluster: RE20049p; n=2; Sophophora|Rep: RE20049p...    36   0.70 
UniRef50_Q4XZE6 Cluster: Preprocathepsin c, putative; n=6; Plasm...    36   0.70 
UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2; ...    36   0.70 
UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179, w...    36   0.70 
UniRef50_Q7MTY9 Cluster: Cysteine peptidase, putative; n=8; Bact...    36   0.93 
UniRef50_A7NM03 Cluster: Putative uncharacterized protein precur...    36   0.93 
UniRef50_A7DL96 Cluster: Putative uncharacterized protein precur...    36   0.93 
UniRef50_Q8I3C0 Cluster: Papain family cysteine protease, putati...    36   0.93 
UniRef50_Q55CB6 Cluster: Putative uncharacterized protein; n=1; ...    36   0.93 
UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Ca...    36   0.93 
UniRef50_Q4PH90 Cluster: Putative uncharacterized protein; n=1; ...    36   0.93 
UniRef50_P21381 Cluster: Thaumatopain; n=10; Eukaryota|Rep: Thau...    36   0.93 
UniRef50_Q6M7X4 Cluster: Conserved secreted protein; n=3; Coryne...    35   1.2  
UniRef50_Q1CXI7 Cluster: Putative uncharacterized protein; n=1; ...    35   1.2  
UniRef50_A6LML6 Cluster: Peptidase C1A, papain precursor; n=1; T...    35   1.2  
UniRef50_A1ZZE0 Cluster: Aminopeptidase C; n=1; Microscilla mari...    35   1.2  
UniRef50_Q7QWN9 Cluster: GLP_26_50243_51811; n=2; Giardia lambli...    35   1.2  

>UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-PA
           - Drosophila melanogaster (Fruit fly)
          Length = 549

 Score =  283 bits (695), Expect = 2e-75
 Identities = 128/195 (65%), Positives = 154/195 (78%), Gaps = 1/195 (0%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K KH   Y SD EHE R NIFRQ+LRYIHS NRA   +T++VNHLAD+T++EL A RG +
Sbjct: 249 KRKHGVAYHSDTEHEHRKNIFRQNLRYIHSKNRAKLTYTLAVNHLADKTEEELKARRGYK 308

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
            SG    G PFPY   + ++   ++P ++DWRL+GAVTPVKDQSVCGSCWSFGT+G +EG
Sbjct: 309 SSGIYNTGKPFPYDVPKYKD---EIPDQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHLEG 365

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYLGQ 538
           A FL NGG+LVRLSQQALIDCSW +GNNGCDGGEDFR Y+W ++  G+PTEE+YG YLGQ
Sbjct: 366 AFFLKNGGNLVRLSQQALIDCSWAYGNNGCDGGEDFRVYQWMLQSGGVPTEEEYGPYLGQ 425

Query: 539 DGYCHVDNVTAVTSI 583
           DGYCHV+NVT V  I
Sbjct: 426 DGYCHVNNVTLVAPI 440


>UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue of
           Sarcophaga 26,29kDa proteinase; n=1; Nasonia
           vitripennis|Rep: PREDICTED: similar to homologue of
           Sarcophaga 26,29kDa proteinase - Nasonia vitripennis
          Length = 553

 Score =  276 bits (676), Expect = 3e-73
 Identities = 127/196 (64%), Positives = 151/196 (77%), Gaps = 2/196 (1%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K  H + YA DLEH++R   FR +LR+IHS NRAN GFT+ VNHLADR + EL  LRG++
Sbjct: 252 KKTHNKNYAHDLEHKQRKEHFRHNLRFIHSINRANLGFTLDVNHLADRNEAELKVLRGKQ 311

Query: 182 YSGPSPHG-LPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
           Y+    +G +PFP+    VE+    +P   DWRL+GAVTPVKDQSVCGSCWSFGT GAVE
Sbjct: 312 YTQHGYNGGMPFPHD---VEKEKADVPDSFDWRLYGAVTPVKDQSVCGSCWSFGTTGAVE 368

Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLG 535
           GA F+     LVRLSQQALIDCSWGFGNNGCDGGEDFR+Y+WI +H GLPTEE+YGGYLG
Sbjct: 369 GAYFMKYK-KLVRLSQQALIDCSWGFGNNGCDGGEDFRSYQWIIKHGGLPTEEEYGGYLG 427

Query: 536 QDGYCHVDNVTAVTSI 583
           QDGYCH+ NVT +  +
Sbjct: 428 QDGYCHIKNVTQIAKL 443


>UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 2 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 564

 Score =  243 bits (594), Expect = 3e-63
 Identities = 114/194 (58%), Positives = 139/194 (71%), Gaps = 1/194 (0%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K  H+R Y  D EH++R +IFRQ+LR+I S NRAN G+ ++VNHLADRT +E++ LRGR 
Sbjct: 265 KETHKRTYELDTEHDRRRDIFRQNLRFIDSKNRANLGYNLAVNHLADRTREEISVLRGRL 324

Query: 182 YSGP-SPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
            S   S    PFP  +      + KLP + DWR +GAVTPVKDQ+VCGSCWSFGTVG +E
Sbjct: 325 QSKDGSSRAEPFPRHR-----FTAKLPDQIDWRPYGAVTPVKDQAVCGSCWSFGTVGELE 379

Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQ 538
           GA F    G LVRLS+Q L+DCSW  GNNGCDGGEDFRAYE+I  HGL ++EDYG Y+GQ
Sbjct: 380 GAYF-RKTGRLVRLSEQQLVDCSWNNGNNGCDGGEDFRAYEYIADHGLASDEDYGAYIGQ 438

Query: 539 DGYCHVDNVTAVTS 580
           DG CH   V +  S
Sbjct: 439 DGVCHDSKVNSTIS 452


>UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326;
           n=2; Danio rerio|Rep: hypothetical protein LOC550326 -
           Danio rerio
          Length = 531

 Score =  225 bits (551), Expect = 5e-58
 Identities = 103/195 (52%), Positives = 132/195 (67%), Gaps = 1/195 (0%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K K  RQY S+ EHE+R N+F  + R++HSNNRA   +++ +NH AD+T +ELA + G  
Sbjct: 233 KEKFNRQYESEKEHEERENLFLHTFRFVHSNNRAGLTYSVGINHFADKTKEELARMTGGL 292

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
                    PFP S+ R    S+  P   DWRL+GAVTPVKDQ+VCGSCWSF T G +EG
Sbjct: 293 LPKKEEKAQPFP-SEIR----SIATPNSVDWRLYGAVTPVKDQAVCGSCWSFATTGTLEG 347

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQ 538
           ALFL   G L  LSQQ L+DC+WGFGNNGCDGGE++RA+EWI +H G+ T E YG Y+G 
Sbjct: 348 ALFLKT-GQLTSLSQQMLVDCTWGFGNNGCDGGEEWRAFEWIMKHGGISTAESYGAYMGM 406

Query: 539 DGYCHVDNVTAVTSI 583
           +G CH D  + V  +
Sbjct: 407 NGLCHYDKTSMVAQL 421


>UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 21 SCAF14577, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 478

 Score =  221 bits (541), Expect = 7e-57
 Identities = 105/186 (56%), Positives = 127/186 (68%), Gaps = 1/186 (0%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K K QRQY  D EHE R   F  +LRY+HS NRA   +T+ +N L+DRT  ELA +RGR+
Sbjct: 125 KEKFQRQYEDDKEHELRQQAFIHNLRYVHSKNRAGLSYTLGLNSLSDRTMSELATMRGRK 184

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
               +  GLPFP+   +     V++P   DWRL+GAVTPVKDQ++CGSCWSF T G +EG
Sbjct: 185 QRKTTNAGLPFPFKLYQ----HVEVPESLDWRLYGAVTPVKDQAICGSCWSFATTGTIEG 240

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQ 538
           ALFL  G  L  LSQQ LIDCSWGFGNN CDGGE++RAYEWI +H G+ + E YG YLG 
Sbjct: 241 ALFLKTGS-LQVLSQQMLIDCSWGFGNNACDGGEEWRAYEWIMKHGGIASAETYGPYLGM 299

Query: 539 DGYCHV 556
            G   V
Sbjct: 300 TGSLQV 305



 Score = 97.1 bits (231), Expect = 3e-19
 Identities = 41/66 (62%), Positives = 51/66 (77%), Gaps = 1/66 (1%)
 Frame = +2

Query: 368 FLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQDG 544
           +L   G L  LSQQ LIDCSWGFGNN CDGGE++RAYEWI +H G+ + E YG YLG +G
Sbjct: 296 YLGMTGSLQVLSQQMLIDCSWGFGNNACDGGEEWRAYEWIMKHGGIASAETYGPYLGMNG 355

Query: 545 YCHVDN 562
           +CHV++
Sbjct: 356 FCHVNS 361


>UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l;
           n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to cathepsin l - Strongylocentrotus purpuratus
          Length = 489

 Score =  193 bits (471), Expect = 2e-48
 Identities = 92/169 (54%), Positives = 118/169 (69%), Gaps = 2/169 (1%)
 Frame = +2

Query: 83  IHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPP 262
           IHS NRAN G+ + +NH+AD++  EL  +RGR       +GLP  Y  S V + +V   P
Sbjct: 214 IHSINRANLGYVLDINHMADQSHQELKRMRGRLRQTRPNNGLP--YDGSDVSDDAV---P 268

Query: 263 EH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFG 439
           +H DW + GAV+PVKDQ+VCGSCWSFG+   +EGA+F+ +G   VRLSQQ L+DC+W  G
Sbjct: 269 DHIDWNVLGAVSPVKDQAVCGSCWSFGSAETIEGAVFMQSGKR-VRLSQQMLMDCTWAAG 327

Query: 440 NNGCDGGEDFRAYEWI-KRHGLPTEEDYGGYLGQDGYCHVDNVTAVTSI 583
           NNGCDGGE++R YEW+ K  G+P EE YG YLGQ+G CH D   AV SI
Sbjct: 328 NNGCDGGEEWRVYEWLMKNGGIPLEETYGPYLGQNGMCHYDKSKAVASI 376


>UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 513

 Score =  187 bits (456), Expect = 1e-46
 Identities = 88/190 (46%), Positives = 123/190 (64%), Gaps = 1/190 (0%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K  ++++Y S  EHEKR +I+R ++R+I S NR + G+++  NH+AD TD E+  ++G  
Sbjct: 214 KASYRKRYPSAHEHEKRKDIYRHNMRFIKSRNRQHLGYSLKPNHMADMTDAEVNRMKGLL 273

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
           +  P   G   P+S    ++  V LPP  DWR  GAV  VK Q +CGSC++F   GA+EG
Sbjct: 274 HEEPPLIG-DSPFSIPD-KDRGVPLPPHVDWRKAGAVNSVKSQGICGSCYAFAVAGALEG 331

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQ 538
           A F+  G  L  LS+Q ++DC+WGFGN GC GG  +RA +WI +H GL TEE YG YL Q
Sbjct: 332 AHFIKTGLKL-DLSEQQIVDCTWGFGNRGCKGGYPYRAMQWILKHGGLATEESYGRYLAQ 390

Query: 539 DGYCHVDNVT 568
           +GYCH  N +
Sbjct: 391 EGYCHFKNTS 400


>UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 514

 Score =  169 bits (410), Expect = 6e-41
 Identities = 85/192 (44%), Positives = 113/192 (58%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
           +H +QY S+ E  KR +IFR ++RYI S NR N  + ++ NH  D TD E       ++ 
Sbjct: 226 QHNKQYDSEHEVSKRKHIFRHNMRYIRSINRKNLKYKLAPNHFVDLTDGEYD-----QHK 280

Query: 188 GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGAL 367
           G S   L  PYS        V +P E DWR +GAV+PV+ Q +CGSC++   VGAVEGA 
Sbjct: 281 GDSIITLYGPYSNMSHVLQRVDVPDELDWRDYGAVSPVRGQGICGSCYALAAVGAVEGAY 340

Query: 368 FLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGY 547
           F+  G  L  LS Q +IDCSWG GN GC GG   +A  WI  HG+ + E YG YLGQ+G 
Sbjct: 341 FMKTG-KLKELSAQQVIDCSWGSGNRGCKGGYYNKAMSWIYLHGIASAESYGPYLGQEGT 399

Query: 548 CHVDNVTAVTSI 583
           C ++ +    +I
Sbjct: 400 CRIEGLRRAAAI 411


>UniRef50_UPI000155637A Cluster: PREDICTED: similar to
           ENSANGP00000013730, partial; n=1; Ornithorhynchus
           anatinus|Rep: PREDICTED: similar to ENSANGP00000013730,
           partial - Ornithorhynchus anatinus
          Length = 229

 Score =  165 bits (401), Expect = 7e-40
 Identities = 80/133 (60%), Positives = 94/133 (70%)
 Frame = +2

Query: 80  YIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLP 259
           +I S+NRANR F ++ NHL DRT  ELAALRGR  S    HG PFP+ +      +V LP
Sbjct: 1   FIDSHNRANRPFRLAPNHLTDRTPGELAALRGRLRSSRPNHGQPFPHEQLA----NVALP 56

Query: 260 PEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFG 439
              DWRL+GAVTPVKDQ+VCGSCWSF T G +EGALFL     LV LSQQ LIDCSW  G
Sbjct: 57  ESLDWRLYGAVTPVKDQAVCGSCWSFATTGTLEGALFLKVTVQLVPLSQQMLIDCSWDVG 116

Query: 440 NNGCDGGEDFRAY 478
           N GCDGG +++A+
Sbjct: 117 NFGCDGGLEWQAF 129


>UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 392

 Score =  162 bits (394), Expect = 5e-39
 Identities = 78/194 (40%), Positives = 113/194 (58%), Gaps = 7/194 (3%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR-- 181
           +H + Y  D EH +R +IFR ++RYI S NR +  + +  NH AD TDDE  + +G    
Sbjct: 94  QHDKVYEDDSEHRRRKHIFRHNVRYIRSMNRRSLPYKLEPNHFADLTDDEFKSYKGALDD 153

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
            S    +         R + +  ++P + DWR +GAV P K Q  CGSCW+F T GAVE 
Sbjct: 154 ESKDVMNDHDDVIDDDRSKRM-FEVPDQLDWRNYGAVNPAKGQGTCGSCWAFATAGAVEA 212

Query: 362 ALFLHNGGHLVRLSQQALIDCSWG-----FGNNGCDGGEDFRAYEWIKRHGLPTEEDYGG 526
           A F+   G L+ L++Q L+DC+W       GNNGC GG  ++A+ W+K+ G+ T + YG 
Sbjct: 213 AHFIQK-GELLNLAEQQLLDCTWSTPGVYHGNNGCLGGWTWKAFSWVKKFGIATTKSYGH 271

Query: 527 YLGQDGYCHVDNVT 568
           Y GQ+G+C   N+T
Sbjct: 272 YRGQEGFCKTSNLT 285


>UniRef50_Q24E33 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 328

 Score =  134 bits (324), Expect = 1e-30
 Identities = 73/183 (39%), Positives = 105/183 (57%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K++H   + +  E   R NIF Q++RYI S N  N  F +++N +A  TD+E ++L    
Sbjct: 46  KLEHNIVFQNSEEDLYRQNIFFQNVRYIQSENAKNNTFKLAINIMAILTDEEYSSLY--- 102

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
            +      +    S     E    +P E +W   GAVTPVK+Q  CGSCW+F T GA+EG
Sbjct: 103 LNLDQQESIDIFDSLVDDNETVGDIPSEVNWTAQGAVTPVKNQGSCGSCWAFSTTGALEG 162

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQD 541
           + FL N   L+  S+Q L+DCS  + N GC+GG   RA+ ++K HG+ TEE+Y  Y  +D
Sbjct: 163 SYFLKN-NQLISFSEQQLVDCSRLYLNMGCNGGLMPRAFRYVKAHGITTEEEY-PYTAKD 220

Query: 542 GYC 550
           G C
Sbjct: 221 GKC 223


>UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep:
           Toxopain-2 - Toxoplasma gondii
          Length = 422

 Score =  128 bits (310), Expect = 7e-29
 Identities = 74/195 (37%), Positives = 104/195 (53%), Gaps = 4/195 (2%)
 Frame = +2

Query: 11  HQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSG 190
           + + YA++ E ++R  IF+ +L YIH++N+    +++ +NH  D + DE      R+Y G
Sbjct: 124 YAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYSYSLKMNHFGDLSRDEFR----RKYLG 179

Query: 191 -PSPHGLPFPYSKSRVEELSV---KLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
                 L   +     E L+V   +LP   DWR  G VTPVKDQ  CGSCW+F T GA+E
Sbjct: 180 FKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALE 239

Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQ 538
           GA      G LV LS+Q L+DCS   GN  C GGE   A++++   G    ED   YL +
Sbjct: 240 GA-HCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLAR 298

Query: 539 DGYCHVDNVTAVTSI 583
           D  C   +   V  I
Sbjct: 299 DEECRAQSCEKVVKI 313


>UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4;
           core eudicotyledons|Rep: Papain-like cysteine peptidase
           XBCP3 - Arabidopsis thaliana (Mouse-ear cress)
          Length = 437

 Score =  126 bits (305), Expect = 3e-28
 Identities = 78/189 (41%), Positives = 107/189 (56%), Gaps = 3/189 (1%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDELAALR-GRR 181
           KH + Y S+ E ++R+ IF+ +  ++  +N   N  +++S+N  AD T  E  A R G  
Sbjct: 38  KHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKASRLGLS 97

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
            S PS        SK +    SVK+P   DWR  GAVT VKDQ  CG+CWSF   GA+EG
Sbjct: 98  VSAPSV----IMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEG 153

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYLGQ 538
              +   G L+ LS+Q LIDC   + N GC+GG    A+E+ IK HG+ TE+DY  Y  +
Sbjct: 154 INQIVT-GDLISLSEQELIDCDKSY-NAGCNGGLMDYAFEFVIKNHGIDTEKDY-PYQER 210

Query: 539 DGYCHVDNV 565
           DG C  D +
Sbjct: 211 DGTCKKDKL 219


>UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine
           proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like
           midgut cysteine proteinase - Tenebrio molitor (Yellow
           mealworm)
          Length = 330

 Score =  126 bits (304), Expect = 4e-28
 Identities = 76/200 (38%), Positives = 111/200 (55%), Gaps = 6/200 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELAAL 169
           K+ H++ Y+S +E  +R  IF+ ++  I  +N +  +G   ++ ++N   D + +E  A 
Sbjct: 32  KLTHKKSYSSPIEEIRRQLIFKDNVAKIAEHNAKFEKGEVTYSKAMNQFGDMSKEEFLAY 91

Query: 170 --RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
             RG+      P  L  PY  S+ + L+  +    DWR   AV+ VKDQ  CGSCWSF T
Sbjct: 92  VNRGKAQKPKHPENLRMPYVSSK-KPLAASV----DWRS-NAVSEVKDQGQCGSCWSFST 145

Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYG 523
            GAVEG L L   G L  LS+Q LIDCS  +GN GCDGG    A+ +I  +G+ +E  Y 
Sbjct: 146 TGAVEGQLALQR-GRLTSLSEQNLIDCSSSYGNAGCDGGWMDSAFSYIHDYGIMSESAY- 203

Query: 524 GYLGQDGYCHVDNVTAVTSI 583
            Y  Q  YC  D+  +VT++
Sbjct: 204 PYEAQGDYCRFDSSQSVTTL 223


>UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10;
           Dictyostelium discoideum|Rep: Cysteine proteinase 7
           precursor - Dictyostelium discoideum (Slime mold)
          Length = 460

 Score =  125 bits (302), Expect = 7e-28
 Identities = 75/184 (40%), Positives = 101/184 (54%), Gaps = 4/184 (2%)
 Frame = +2

Query: 5   VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRY 184
           + HQR Y+S+ E   R NIF+ ++ Y++  N       + +N  AD +++E  A     Y
Sbjct: 35  IAHQRHYSSE-EFNGRYNIFKANMDYVNEWNTKGSETVLGLNVFADISNEEYRAT----Y 89

Query: 185 SGPSPHGLPFPYSKSRVEELS--VKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
            G      PF  S   + E         + DWR  GAVTP+K+Q  CG CWSF T GA E
Sbjct: 90  LGT-----PFDASSLEMTESDKIFDASAQVDWRTQGAVTPIKNQGQCGGCWSFSTTGATE 144

Query: 359 GALFLHNG-GHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYL 532
           GA +L NG  +LV LS+Q LIDCS  +GNNGC+GG    A+E+ I   G+ TE  Y  Y 
Sbjct: 145 GAQYLANGKKNLVSLSEQNLIDCSGSYGNNGCEGGLMTLAFEYIINNKGIDTESSY-PYT 203

Query: 533 GQDG 544
            +DG
Sbjct: 204 AEDG 207


>UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC06231 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 372

 Score =  125 bits (301), Expect = 9e-28
 Identities = 72/186 (38%), Positives = 103/186 (55%), Gaps = 5/186 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG----FTMSVNHLADRTDDELAAL 169
           K+  +R Y + +E  KR  IF  +   +  +NRA +     + M VN+  D+T+ EL  L
Sbjct: 66  KINFKRAYGNVMEETKRFLIFGTNFIKMMEHNRAYQEGKATYKMGVNNFTDKTEYELRKL 125

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
           RG R    S   +  P   + +     KLP   DWR  GAVTPVK+Q  CGSCW+F + G
Sbjct: 126 RGYR----SACRIAKPKGSTFISSEHAKLPDRVDWRRNGAVTPVKNQGQCGSCWAFSSTG 181

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYGG 526
           A+EG  +      LV LS+Q LIDCS  +GNNGC+GG    A+++++   G+ +E  Y  
Sbjct: 182 AIEGQHY-RKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYVRDNKGIDSEISY-P 239

Query: 527 YLGQDG 544
           Y+  DG
Sbjct: 240 YISGDG 245


>UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24;
           Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa
           (Rice)
          Length = 339

 Score =  124 bits (299), Expect = 2e-27
 Identities = 74/192 (38%), Positives = 105/192 (54%), Gaps = 2/192 (1%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
           ++ R Y    E  +R  IF+ ++ +I S N  N  F +SVN  AD T+ E  A +  +  
Sbjct: 43  QYGRVYKDATEKARRFEIFKANVAFIESFNAGNHKFWLSVNQFADLTNYEFRATKTNKGF 102

Query: 188 GPSPHGLPFPYSKSRVEELSVK-LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
            PS   +P  +   R E +S+  LP   DWR  GAVTP+KDQ  CG CW+F  V A+EG 
Sbjct: 103 IPSTVRVPTTF---RYENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGI 159

Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYLGQD 541
           + L + G L+ LS+Q L+DC     + GC+GG    A+++ IK  GL TE  Y  Y   D
Sbjct: 160 VKL-STGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKY-PYTAAD 217

Query: 542 GYCHVDNVTAVT 577
           G C+  + +A T
Sbjct: 218 GKCNGGSNSAAT 229


>UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 317

 Score =  124 bits (299), Expect = 2e-27
 Identities = 74/198 (37%), Positives = 102/198 (51%), Gaps = 4/198 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELAAL 169
           KV H ++Y    E + R  +F Q+L+ I  +N R   G   F + VN  AD T +E  A+
Sbjct: 20  KVNHSKKYGHLKEEQVRFQVFSQNLQKIEQHNARYQNGEVSFYLGVNQFADMTSEEFKAM 79

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
              +        +   +    V +  + +P   DWR  GAV PV+DQ  CGSCW+F   G
Sbjct: 80  LDSQLIHKPKRDITSRF----VADPQLTVPESIDWREKGAVNPVRDQEQCGSCWAFSAAG 135

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGY 529
           A+EG  FL   G L  LS Q L+DCS  + N GC+GG    AY++IK +GL  E  Y  Y
Sbjct: 136 ALEGQRFLKE-GKLEVLSTQQLVDCSRDYKNEGCNGGWPHWAYDYIKDNGLCLESKY-KY 193

Query: 530 LGQDGYCHVDNVTAVTSI 583
            G DGY   + + A+  I
Sbjct: 194 QGYDGYYCKECIPAIKKI 211


>UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1;
           Dictyostelium discoideum AX4|Rep: Counting factor
           associated protein - Dictyostelium discoideum AX4
          Length = 531

 Score =  124 bits (299), Expect = 2e-27
 Identities = 68/190 (35%), Positives = 106/190 (55%), Gaps = 1/190 (0%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K ++ ++Y+S  EH++R   F+ + + I ++N     + + +NH AD ++ E   L   +
Sbjct: 229 KAQYNKEYSSQDEHDERFINFKAARKIIATHNAKESSYKLGMNHYADLSNKEFNTLVKPK 288

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
            + PS  G    +     +E    +P   DWR    VTPVKDQ +CGSCW+FG+ G++EG
Sbjct: 289 VARPSVTGADSVHD----DESLRSIPSTVDWRNQNCVTPVKDQGICGSCWTFGSTGSLEG 344

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHG-LPTEEDYGGYLGQ 538
              + N G LV LS+Q L+DC+   G+ GC GG    A++++   G L TE +Y  YL Q
Sbjct: 345 TNCVTN-GELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNY-PYLMQ 402

Query: 539 DGYCHVDNVT 568
           +G C    VT
Sbjct: 403 NGLCRDRTVT 412


>UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18;
           Schistosoma|Rep: Preprocathepsin cathepsin L -
           Schistosoma japonicum (Blood fluke)
          Length = 331

 Score =  124 bits (298), Expect = 2e-27
 Identities = 67/198 (33%), Positives = 106/198 (53%), Gaps = 4/198 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRAN----RGFTMSVNHLADRTDDELAAL 169
           K+K+ + Y S+ +  +R  IF + +  I  +N  +     G+TM +N   D   +E+  +
Sbjct: 31  KLKYNKTYTSNDDEMRRKMIFMRRIGKIQEHNLRHDLGLEGYTMGLNQFCDMEWEEVNRI 90

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
              +  G SP    +    + +E  +  +P   DWR  GAVT VK Q +CGSCW+F   G
Sbjct: 91  MFPKVFGNSPL---WNDDGNELELTNKPVPSTWDWRDHGAVTAVKHQGLCGSCWAFSATG 147

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGY 529
           A+EG L       LV+LS+Q L+DC + +GN+GC+GG    A+ ++++H + +E DY  Y
Sbjct: 148 AIEGQL-RRKHKKLVKLSEQQLVDCRYNYGNDGCEGGTMDLAFNYLEKHYIESENDY-KY 205

Query: 530 LGQDGYCHVDNVTAVTSI 583
           LG D  CH      V  +
Sbjct: 206 LGHDANCHYRKSKGVVKV 223


>UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine
           protease; n=11; Callosobruchus maculatus|Rep: Putative
           gut cathepsin L-like cysteine protease - Callosobruchus
           maculatus (Southern cowpea weevil) (Pulse bruchid)
          Length = 326

 Score =  120 bits (290), Expect = 2e-26
 Identities = 71/189 (37%), Positives = 105/189 (55%), Gaps = 6/189 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDE-LAA 166
           K+ H + Y S +E ++R ++F+++L  I  +N    R    F   V   AD T +E L  
Sbjct: 27  KLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEFLDL 86

Query: 167 LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
           L+ +       + + F       E++ ++     DWR  GAVTPVKDQ+ CGSCW+F  V
Sbjct: 87  LKLQGVPALPSNAVHF----DNFEDIDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAV 142

Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSW-GFGNNGCDGGEDFRAYEWIKRHGLPTEEDYG 523
           GA+EG  F  N G LV LS Q L+DC+   +GNNGC GG   +A+++++  G+ TEE Y 
Sbjct: 143 GAIEGQFFKKN-GTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDEGIQTEESY- 200

Query: 524 GYLGQDGYC 550
            Y G+   C
Sbjct: 201 PYEGRRSSC 209


>UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precursor;
           n=3; Metazoa|Rep: Digestive cysteine proteinase 2
           precursor - Homarus americanus (American lobster)
          Length = 323

 Score =  120 bits (289), Expect = 3e-26
 Identities = 76/199 (38%), Positives = 107/199 (53%), Gaps = 7/199 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHS-NNRANRG---FTMSVNHLADRTDDEL-AA 166
           K K+ RQY    E   R  IF Q+ +YI   N +   G   F +++N   D T +E  A 
Sbjct: 24  KGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEFNAV 83

Query: 167 LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
           ++G      +P  + +P  ++  +   V      DWR  GAVTPVKDQ  CGSCW+F T 
Sbjct: 84  MKGNIPRRSAPVSVFYPKKETGPQATEV------DWRTKGAVTPVKDQGQCGSCWAFSTT 137

Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYG 523
           G++EG  FL  G  L+ L++Q L+DCS  +G  GC+GG    A+++IK  +G+ TE  Y 
Sbjct: 138 GSLEGQHFLKTGS-LISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAY- 195

Query: 524 GYLGQDGYCHVD-NVTAVT 577
            Y  +DG C  D N  A T
Sbjct: 196 PYEARDGSCRFDSNSVAAT 214


>UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin
           heavy chain; n=3; Amniota|Rep: PREDICTED: similar to
           ferritin heavy chain - Ornithorhynchus anatinus
          Length = 338

 Score =  119 bits (287), Expect = 4e-26
 Identities = 70/187 (37%), Positives = 103/187 (55%), Gaps = 7/187 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDEL-AA 166
           KV H + Y+ + E   R   + +++R I  +N    +    + +++NH  D+T++EL   
Sbjct: 32  KVLHGKNYSVEAEEVFRRAAWEKNVRVIERHNEEMSQGKHSYRLAMNHFGDQTNEELHER 91

Query: 167 LRGRR--YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFG 340
           L G R    G    G      +S+    S + P E DWR  G VTPVK+Q +CGSCW+F 
Sbjct: 92  LNGFRPDLGGALRSGREQARFRSKT---SWEGPEEVDWRTKGYVTPVKNQGLCGSCWAFS 148

Query: 341 TVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
             GA+E AL     G +V LS+Q L+DCSW  GN GC GG+   A+E+++ +G    ED 
Sbjct: 149 ATGALE-ALVFKTTGKMVSLSEQNLVDCSWRQGNVGCRGGQYIGAFEYVRANGGIDAEDL 207

Query: 521 GGYLGQD 541
             YLG+D
Sbjct: 208 YPYLGRD 214


>UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor;
           n=176; Viridiplantae|Rep: Cysteine proteinase RD21a
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 462

 Score =  119 bits (286), Expect = 6e-26
 Identities = 70/186 (37%), Positives = 103/186 (55%), Gaps = 4/186 (2%)
 Frame = +2

Query: 5   VKHQRQYASD--LEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGR 178
           VKH +  + +  +E ++R  IF+ +LR++  +N  N  + + +   AD T+DE  +    
Sbjct: 55  VKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRS---- 110

Query: 179 RYSGPSPHGLPFPYSKSRVE-ELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
           +Y G          +  R E  +  +LP   DWR  GAV  VKDQ  CGSCW+F T+GAV
Sbjct: 111 KYLGAKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAV 170

Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYL 532
           EG   +   G L+ LS+Q L+DC   + N GC+GG    A+E+ IK  G+ T++DY  Y 
Sbjct: 171 EGINQIVT-GDLITLSEQELVDCDTSY-NEGCNGGLMDYAFEFIIKNGGIDTDKDY-PYK 227

Query: 533 GQDGYC 550
           G DG C
Sbjct: 228 GVDGTC 233


>UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42;
           Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens
           (Human)
          Length = 334

 Score =  119 bits (286), Expect = 6e-26
 Identities = 71/188 (37%), Positives = 104/188 (55%), Gaps = 5/188 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELAAL 169
           K  H+R Y ++ E  +R  ++ ++++ I  +N    +   GFTM++N   D T++E   +
Sbjct: 33  KATHRRLYGANEEGWRRA-VWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQM 91

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
            G   +     G  F       E L + LP   DWR  G VTPVK+Q  CGSCW+F   G
Sbjct: 92  MGCFRNQKFRKGKVFR------EPLFLDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATG 145

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGG 526
           A+EG +F    G LV LS+Q L+DCS   GN GC+GG   RA++++K + GL +EE Y  
Sbjct: 146 ALEGQMF-RKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESY-P 203

Query: 527 YLGQDGYC 550
           Y+  D  C
Sbjct: 204 YVAVDEIC 211


>UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep:
           Actinidin Act3a - Actinidia eriantha
          Length = 380

 Score =  118 bits (285), Expect = 8e-26
 Identities = 72/184 (39%), Positives = 101/184 (54%), Gaps = 2/184 (1%)
 Frame = +2

Query: 5   VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDELAALRGRR 181
           VK+ + Y S  E E R+ IF+++LR+I  +N   NR +T+ +N  AD TD+E  +     
Sbjct: 47  VKYGKSYNSLGEREMRIEIFKENLRFIDEHNADPNRSYTVGLNQFADLTDEEYRST---- 102

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
           Y G     L    S   + ++   LP   DWR  GAV  VK+Q +C SCW+F T+  VE 
Sbjct: 103 YLG-FKSSLKSKVSNRYMPQVGEVLPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVES 161

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYLGQ 538
              +   G L+ LS+Q L+DC+    N GC GG    AYE+ I   G+ TEE+Y  Y+GQ
Sbjct: 162 INQIIT-GDLISLSEQELVDCNRTPINEGCKGGFMDDAYEFIINNGGINTEENY-PYIGQ 219

Query: 539 DGYC 550
           D  C
Sbjct: 220 DDQC 223


>UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba
           culbertsoni|Rep: Cysteine proteinase - Acanthamoeba
           culbertsoni
          Length = 482

 Score =  118 bits (285), Expect = 8e-26
 Identities = 71/188 (37%), Positives = 100/188 (53%), Gaps = 7/188 (3%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
           +H R Y++D E  +R N +R+++ +I   NR N  FT+++N   D T +E A L   + S
Sbjct: 70  RHARSYSND-EFLERYNTWRENMDFIEEFNRGNHTFTVAMNEHGDLTPEEFARLYMGQVS 128

Query: 188 GPSPHGLPFPYS-KSRVEE----LSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGA 352
             S   L    + +S +E+        +P   DWR  GAVTPVK+Q  C SCW+F   GA
Sbjct: 129 PASEQELQERIAAESAMEDEHHHTRASIPANWDWRTKGAVTPVKNQGSCASCWAFVATGA 188

Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHG--LPTEEDYGG 526
           VEG   +  GG LV LS Q L+DC+ G GN GC GG     Y W+  +   L T+  Y  
Sbjct: 189 VEGVRKI-AGGSLVSLSDQMLLDCAVGTGNQGCSGGNVEITYRWMISNNARLMTQASY-P 246

Query: 527 YLGQDGYC 550
           Y+ +   C
Sbjct: 247 YIARQSTC 254


>UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep:
           Cathepsin L - Stylonychia lemnae
          Length = 340

 Score =  118 bits (284), Expect = 1e-25
 Identities = 71/184 (38%), Positives = 103/184 (55%), Gaps = 3/184 (1%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG--FTMSVNHLADRTDDELAALRGRR 181
           +  + Y S  E E RL  ++ ++ +I+++N  N G  FT+  NHLAD T DE   + G  
Sbjct: 48  RFSKAYKSKEEFEMRLQQYKSNIAFINNHNSQNDGTSFTLGPNHLADYTHDEYKKMLG-- 105

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
           Y   +  G    YS   ++++    P   DWR  GAV  VKDQ  CGSCW+F T+ ++E 
Sbjct: 106 YKPRNKTGKEV-YSTPNLKDI----PESIDWREKGAVNAVKDQGQCGSCWAFSTIASLES 160

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI-KRHGLPTEEDYGGYLGQ 538
             F+   G L  LS+Q L+DCS   GN GC+GG+   A ++I    G+ TE+DY  Y+G+
Sbjct: 161 RYFIET-GKLQSLSEQQLVDCSKN-GNEGCNGGDMGLAMDYIASAGGVETEKDY-PYVGK 217

Query: 539 DGYC 550
           D  C
Sbjct: 218 DQTC 221


>UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15;
           Magnoliophyta|Rep: Cysteine proteinase RD19a precursor -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 368

 Score =  118 bits (283), Expect = 1e-25
 Identities = 78/203 (38%), Positives = 107/203 (52%), Gaps = 9/203 (4%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K K  + YAS+ EH+ R ++F+ +LR    + + +   T  V   +D T  E    R + 
Sbjct: 55  KRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEF---RKKH 111

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
               S   LP   +K+ +      LP + DWR  GAVTPVK+Q  CGSCWSF   GA+EG
Sbjct: 112 LGVRSGFKLPKDANKAPILPTE-NLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEG 170

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFG-------NNGCDGGEDFRAYEW-IKRHGLPTEED 517
           A FL   G LV LS+Q L+DC            ++GC+GG    A+E+ +K  GL  EED
Sbjct: 171 ANFLAT-GKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEED 229

Query: 518 YGGYLGQDG-YCHVDNVTAVTSI 583
           Y  Y G+DG  C +D    V S+
Sbjct: 230 Y-PYTGKDGKTCKLDKSKIVASV 251


>UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16)
           [Contains: Cathepsin H mini chain; Cathepsin H heavy
           chain; Cathepsin H light chain]; n=37; Eukaryota|Rep:
           Cathepsin H precursor (EC 3.4.22.16) [Contains:
           Cathepsin H mini chain; Cathepsin H heavy chain;
           Cathepsin H light chain] - Homo sapiens (Human)
          Length = 335

 Score =  118 bits (283), Expect = 1e-25
 Identities = 66/183 (36%), Positives = 103/183 (56%), Gaps = 2/183 (1%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
           KH++ Y+++ E+  RL  F  + R I+++N  N  F M++N  +D +  E+      +Y 
Sbjct: 41  KHRKTYSTE-EYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIK----HKYL 95

Query: 188 GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGA-VTPVKDQSVCGSCWSFGTVGAVEGA 364
              P       +KS     +   PP  DWR  G  V+PVK+Q  CGSCW+F T GA+E A
Sbjct: 96  WSEPQNCSA--TKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESA 153

Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI-KRHGLPTEEDYGGYLGQD 541
           + +  G  ++ L++Q L+DC+  F N+GC GG   +A+E+I    G+  E+ Y  Y G+D
Sbjct: 154 IAIATG-KMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTY-PYQGKD 211

Query: 542 GYC 550
           GYC
Sbjct: 212 GYC 214


>UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor;
           n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase
           precursor - Diabrotica virgifera virgifera (western corn
           rootworm)
          Length = 326

 Score =  117 bits (282), Expect = 2e-25
 Identities = 79/199 (39%), Positives = 108/199 (54%), Gaps = 5/199 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELAAL 169
           KV++ + Y + +E +KR  IF+ SLR I ++N + + G   F + V   AD T+ E + +
Sbjct: 27  KVRNNKSYRNYIEEQKRFTIFQGSLRKIENHNDKYDHGLSTFKLGVTKFADLTEKEFSDM 86

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
            G   S  S       +S + V++L    P + DWR  GAVT VKDQ  CGSCWSF T G
Sbjct: 87  LGISRSTKSSRPRVI-HSLTPVKDL----PSKFDWREKGAVTEVKDQGSCGSCWSFSTTG 141

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYGG 526
            VEGA FL   G LV LS+Q L+DC+      GC GG   +A E+I+   G+ +E DY  
Sbjct: 142 TVEGAYFLKT-GKLVSLSEQNLVDCA-KEDCYGCSGGYMDKALEYIETAGGIMSENDY-P 198

Query: 527 YLGQDGYCHVDNVTAVTSI 583
           Y G D  C  D+      I
Sbjct: 199 YEGIDDKCRFDSSKVAAKI 217


>UniRef50_Q239L8 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 323

 Score =  117 bits (281), Expect = 2e-25
 Identities = 71/183 (38%), Positives = 99/183 (54%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K K+ ++YA       R+ IF ++L+ + SN + N G T  ++   +        L+ + 
Sbjct: 52  KTKYNKKYADPDFERYRIEIFTENLKVVESNTK-NYGITQFMDITREEFKQTYLTLKMKN 110

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
               SP         ++  +  V++    DW   GAVTPVKDQ  CGSCWSF T GAVEG
Sbjct: 111 GLKASPF--------AKFNDAGVEI----DWTTKGAVTPVKDQGQCGSCWSFSTTGAVEG 158

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQD 541
           ALFL +   L  LS+Q L+DCS   GN GC+GG    A+++I +HG+PTE  Y  Y   D
Sbjct: 159 ALFL-STKKLTSLSEQYLVDCSKD-GNEGCNGGLMDTAFDFISQHGIPTEAAY-PYKAVD 215

Query: 542 GYC 550
           G C
Sbjct: 216 GTC 218


>UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar to
           human SRY (sex determining region Y)-box 30
           (SOX30),transcript variant 1,; n=3; Mammalia|Rep: Testis
           cDNA clone: QtsA-12228, similar to human SRY (sex
           determining region Y)-box 30 (SOX30),transcript variant
           1, - Macaca fascicularis (Crab eating macaque)
           (Cynomolgus monkey)
          Length = 433

 Score =  116 bits (280), Expect = 3e-25
 Identities = 70/188 (37%), Positives = 101/188 (53%), Gaps = 5/188 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELAAL 169
           K  H+R Y +  E  +R  ++ ++++ I  +N    +   GF M++N   D T++E   +
Sbjct: 33  KATHRRLYGASEEGWRRA-VWEKNMKMIELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQV 91

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
            G   +     G  F       E L + LP   DWR  G VTPVK+Q  CGSCW+F   G
Sbjct: 92  MGCFRNQKLRKGKLFR------EPLFLDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATG 145

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGG 526
           A+EG +F    G LV LS+Q L+DCS   GN GC+GG    A+ ++K + GL +EE Y  
Sbjct: 146 ALEGQMF-RKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMNSAFRYVKENGGLDSEESY-P 203

Query: 527 YLGQDGYC 550
           Y+  DG C
Sbjct: 204 YVAMDGIC 211


>UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;
           Brugia malayi|Rep: Cahepsin L-like cysteine protease -
           Brugia malayi (Filarial nematode worm)
          Length = 371

 Score =  116 bits (280), Expect = 3e-25
 Identities = 72/196 (36%), Positives = 104/196 (53%), Gaps = 6/196 (3%)
 Frame = +2

Query: 11  HQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELAALRGR 178
           ++R    +LEH +R   + ++++ I  +N    R    + +++NHLAD   +E   L G 
Sbjct: 62  NKRDEEINLEH-RRFMTYLKNVKEIEKHNERYERNEETYELAINHLADMLPEEFRKLHGF 120

Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
           +    +       +  +   +++  LP   DWR  GAVT VKDQ  CGSCW+F  VGA+E
Sbjct: 121 QSRKITSKN---NFKNTIRMKINGPLPKSIDWRTSGAVTKVKDQGYCGSCWTFSAVGALE 177

Query: 359 GALFLHNGGHLVRLSQQALIDCSWG-FGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYL 532
           G  FL   G LV LS Q L+DCS   +GN GCDGG    A+E+ +K  G+ TE+ Y  Y 
Sbjct: 178 GQHFLQT-GKLVELSMQNLLDCSDDTYGNYGCDGGLMMEAFEYVVKNDGIDTEKSY-PYQ 235

Query: 533 GQDGYCHVDNVTAVTS 580
           G    C   N T  T+
Sbjct: 236 GYQNTCRYSNSTRGTT 251


>UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor;
           n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase
           precursor - Phaedon cochleariae (Mustard beetle)
          Length = 324

 Score =  116 bits (280), Expect = 3e-25
 Identities = 69/193 (35%), Positives = 102/193 (52%), Gaps = 6/193 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELA-A 166
           K  H R Y S  E + R NIF+ +LR I  +N         + +++N  +D TD+E    
Sbjct: 27  KKTHARTYKSLREEKLRFNIFQDTLRQIAEHNVKYENGESTYYLAINKFSDITDEEFRDM 86

Query: 167 LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGT 343
           L     S P+  GL        V +L+V   PE  DWR  G V PV++Q  CGSCW+  T
Sbjct: 87  LMKNEASRPNLEGL-------EVADLTVGAAPESIDWRSKGVVLPVRNQGECGSCWALST 139

Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYG 523
             A+E    + +G   V LS Q L+DCS  +GN+GC+GG     +E++K +GL ++ DY 
Sbjct: 140 AAAIESQSAIKSGSK-VPLSPQQLVDCSTSYGNHGCNGGFAVNGFEYVKDNGLESDADY- 197

Query: 524 GYLGQDGYCHVDN 562
            Y G++  C  ++
Sbjct: 198 PYSGKEDKCKAND 210


>UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep:
           Cysteine protease - Solanum lycopersicum (Tomato)
           (Lycopersicon esculentum)
          Length = 345

 Score =  116 bits (279), Expect = 4e-25
 Identities = 73/199 (36%), Positives = 109/199 (54%), Gaps = 7/199 (3%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRA-NRGFTMSVNHLADRTDDE-LAALRGRR 181
           +H R Y  ++E  +R  IF++++++I S N+A N  + + +N  AD T  E LA   G  
Sbjct: 45  RHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLN 104

Query: 182 YSGPSPHGLPFPYSKS---RVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVG 349
              P+ +  P P S +   ++ +LS    P + DWR  GAVT VK Q  CG CW+F  VG
Sbjct: 105 I--PNSYLSPSPMSSTEFKKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 162

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGG 526
           ++EGA  +   G+L+  S+Q L+DC+    N GC+GG    A+++ I+  G+  E DY  
Sbjct: 163 SLEGAYKIAT-GNLMEFSEQELLDCT--TNNYGCNGGFMTNAFDFIIENGGISRESDY-E 218

Query: 527 YLGQDGYCHVDNVTAVTSI 583
           YLGQ   C     TA   I
Sbjct: 219 YLGQQYTCRSQEKTAAVQI 237


>UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6;
           Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia
           deliciosa (Kiwi)
          Length = 509

 Score =  116 bits (279), Expect = 4e-25
 Identities = 68/191 (35%), Positives = 100/191 (52%), Gaps = 9/191 (4%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNR---ANRGFTMSVNHLADRTDDELAALRGR 178
           KH + Y    E EK+   FR +LRY+   N    A+ G  + +N  AD +++E   +   
Sbjct: 57  KHGKVYKHGQEVEKKFQNFRDNLRYVMEKNGERGASGGHLVGLNKFADMSNEEFREVYVS 116

Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKL-----PPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
           +   P+   +     +      +  +     P   DWR +G VT VKDQ  CGSCW+F +
Sbjct: 117 KVKKPTSKRMAIERRRQGKAAAAKAVAACDGPTSLDWRKYGIVTGVKDQGDCGSCWAFSS 176

Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDY 520
            GA+EG   L N G L+ LS+Q L+DC     N+GC+GG    A+EW+  + G+ TE DY
Sbjct: 177 TGAIEGINALAN-GDLISLSEQELVDCD--STNDGCEGGYMDYAFEWVMSNGGIDTETDY 233

Query: 521 GGYLGQDGYCH 553
             Y G+DG C+
Sbjct: 234 -PYTGEDGTCN 243


>UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15)
           [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC
           3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain] - Sarcophaga peregrina (Flesh fly)
           (Boettcherisca peregrina)
          Length = 339

 Score =  116 bits (279), Expect = 4e-25
 Identities = 69/196 (35%), Positives = 103/196 (52%), Gaps = 7/196 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNR----ANRGFTMSVNHLADRTDDELA-A 166
           K++H++ YA+++E   R+ IF ++   I  +N+        + + +N  AD    E    
Sbjct: 32  KLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADMLHHEFKET 91

Query: 167 LRGRRYS-GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
           + G  ++              + +    V +P   DWR  GAVT VKDQ  CGSCW+F +
Sbjct: 92  MNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHCGSCWAFSS 151

Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDY 520
            GA+EG  F    G LV LS+Q L+DCS  +GNNGC+GG    A+ +IK + G+ TE+ Y
Sbjct: 152 TGALEGQHF-RKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSY 210

Query: 521 GGYLGQDGYCHVDNVT 568
             Y G D  CH +  T
Sbjct: 211 -PYEGIDDSCHFNKAT 225


>UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;
           n=17; Magnoliophyta|Rep: Thiol protease aleurain-like
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 358

 Score =  116 bits (278), Expect = 5e-25
 Identities = 69/183 (37%), Positives = 103/183 (56%), Gaps = 2/183 (1%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
           ++ ++Y S  E + R ++F+++L  I S N+    + +S+N  AD T  E      +RY 
Sbjct: 65  RYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEF-----QRYK 119

Query: 188 -GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
            G + +         ++ E +V  P   DWR  G V+PVK+Q  CGSCW+F T GA+E A
Sbjct: 120 LGAAQNCSATLKGSHKITEATV--PDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAA 177

Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQD 541
            +    G  + LS+Q L+DC+  F N GC GG   +A+E+IK + GL TEE Y  Y G+D
Sbjct: 178 -YHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAY-PYTGKD 235

Query: 542 GYC 550
           G C
Sbjct: 236 GGC 238


>UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep:
           Cysteine protease - Saprolegnia parasitica
          Length = 523

 Score =  115 bits (277), Expect = 7e-25
 Identities = 70/185 (37%), Positives = 98/185 (52%), Gaps = 2/185 (1%)
 Frame = +2

Query: 35  LEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLP 211
           LE   R  +F  + + I ++N+ A+  FTM  N  +  T DE   LR      PS     
Sbjct: 42  LEWVHRFEVFILNDQRIEAHNKDASSSFTMGHNEYSHLTFDEFKKLRTGLRVSPSYIQSR 101

Query: 212 FPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHL 391
             Y+          +P E DW   G VTPVK+Q +CGSCW+F T GA+EGA F+ +   L
Sbjct: 102 AKYALMAPAVNMTDVPNEMDWVEQGGVTPVKNQGMCGSCWAFSTTGAIEGAAFV-SSKQL 160

Query: 392 VRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQDGYCHVDNVT 568
           V +S+Q L+DC    G+ GC+GG    A++W+K H GL  EEDY  Y  ++G C +    
Sbjct: 161 VSVSEQELVDCDHN-GDMGCNGGLMDNAFKWVKTHKGLCKEEDY-PYHAKEGTCALKKCK 218

Query: 569 AVTSI 583
            VT +
Sbjct: 219 PVTKV 223


>UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep:
           Cathepsin - Geodia cydonium (Sponge)
          Length = 322

 Score =  115 bits (276), Expect = 9e-25
 Identities = 68/186 (36%), Positives = 101/186 (54%), Gaps = 2/186 (1%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAA-LRGR 178
           K+K+ +QY+S  E   R  ++  +L+++   +    G+T+++N  AD    E  +   G 
Sbjct: 23  KLKYNKQYSSQEEDYLRQRVWLSNLKFVEEFDSEREGYTVAMNEFADLDPREFVSHYNGL 82

Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
           R    +  G P        E++S  LP   DWR  G VT VK+Q  CGSCW+F   G++E
Sbjct: 83  RRRPHTSSGEPCTLG----EDVSA-LPTTVDWRTKGYVTGVKNQGQCGSCWAFSATGSLE 137

Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYLG 535
           G  F +  G LV LS+Q L+DCS   GN GC+GG    A+++ IK  G+ TE  Y  Y+ 
Sbjct: 138 GQHF-NATGKLVSLSEQNLVDCSSAEGNEGCNGGLPDDAFKYVIKNGGIDTEASY-PYVA 195

Query: 536 QDGYCH 553
           +D  CH
Sbjct: 196 RDEKCH 201


>UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35;
           Viridiplantae|Rep: Cysteine proteinase 15A precursor -
           Pisum sativum (Garden pea)
          Length = 363

 Score =  114 bits (275), Expect = 1e-24
 Identities = 76/203 (37%), Positives = 106/203 (52%), Gaps = 9/203 (4%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K K  + YA+  EH+ R  +F+ +L  I +    NR  T    H   +  D  A+   R+
Sbjct: 52  KSKFSKSYATKEEHDYRFGVFKSNL--IKAKLHQNRDPT--AEHGITKFSDLTASEFRRQ 107

Query: 182 YSGPSPHGLPFPYSKSRVEEL-SVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
           + G     L  P    +   L +  LP + DWR  GAVTPVKDQ  CGSCW+F T GA+E
Sbjct: 108 FLGLKKR-LRLPAHAQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALE 166

Query: 359 GALFLHNGGHLVRLSQQALIDCSW-------GFGNNGCDGGEDFRAYEW-IKRHGLPTEE 514
           GA +L   G LV LS+Q L+DC         G  ++GC+GG    A+E+ ++  G+  E+
Sbjct: 167 GAHYLAT-GKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLESGGVVQEK 225

Query: 515 DYGGYLGQDGYCHVDNVTAVTSI 583
           DY  Y G+DG C  D    V S+
Sbjct: 226 DY-AYTGRDGSCKFDKSKVVASV 247


>UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza
           sativa|Rep: Cysteine protease 1 precursor - Oryza sativa
           subsp. japonica (Rice)
          Length = 490

 Score =  114 bits (275), Expect = 1e-24
 Identities = 71/178 (39%), Positives = 101/178 (56%), Gaps = 5/178 (2%)
 Frame = +2

Query: 38  EHEKRLNIFRQSLRYIHSNN-RANR--GFTMSVNHLADRTDDELAALRGRRYSGPSPHGL 208
           EHE+R  +F  +L+++ ++N RA+   GF + +N  AD T+ E  A     Y G +P G 
Sbjct: 84  EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRAT----YLGTTPAGR 139

Query: 209 PFPYSKSRVEELSVKLPPEHDWRLFGAVT-PVKDQSVCGSCWSFGTVGAVEGALFLHNGG 385
                ++   +    LP   DWR  GAV  PVK+Q  CGSCW+F  V AVEG   +  G 
Sbjct: 140 GRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTG- 198

Query: 386 HLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQDGYCHV 556
            LV LS+Q L++C+    N+GC+GG    A+ +I R+ GL TEEDY  Y   DG C++
Sbjct: 199 ELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDY-PYTAMDGKCNL 255


>UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2;
           Platyhelminthes|Rep: Cathepsin L-like proteinase -
           Echinococcus multilocularis
          Length = 338

 Score =  114 bits (274), Expect = 2e-24
 Identities = 72/199 (36%), Positives = 102/199 (51%), Gaps = 5/199 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIH-SNNRANRG---FTMSVNHLADRTDDELAAL 169
           KV + + YA+  E   R+ IF  +  ++   N R   G   ++ ++N  AD T +E A  
Sbjct: 34  KVANNKTYATLREEHLRMRIFINNYLFVRWHNERYYLGLETYSTALNAFADLTLEEFAEK 93

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTV 346
                  P   G+    S   VE  +  L P+  DWR  G VTP+KDQ  CGSCW+F   
Sbjct: 94  YLTLKQTPM-EGIWQDMSTQYVERPTRMLVPDSIDWRKKGLVTPIKDQGDCGSCWAFSAT 152

Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGG 526
           GA+EG L     G L+ LS+Q L+DCS   GN GC+GG+   A+ +  R+G  +E DY  
Sbjct: 153 GALEGQL-KRKTGKLISLSEQQLVDCSTYTGNEGCNGGDMNDAFRYWMRNGAESESDY-P 210

Query: 527 YLGQDGYCHVDNVTAVTSI 583
           Y   DG C  ++   VT +
Sbjct: 211 YTAMDGKCKFNSSKVVTKV 229


>UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes
           abbreviatus|Rep: Cathepsin L 2 - Diaprepes abbreviatus
           (Sugarcane rootstalk borer weevil)
          Length = 348

 Score =  114 bits (274), Expect = 2e-24
 Identities = 74/203 (36%), Positives = 106/203 (52%), Gaps = 20/203 (9%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANR----GFTMSVNHLADRTDDELAAL 169
           K++H + Y S+ E+E R ++F ++L  I+ +N+        + M++NHL D T DE   +
Sbjct: 32  KLEHGKVYESESENEYRQSVFMENLFQINEHNKLYEMGLSSYQMAMNHLGDLTKDEFMRI 91

Query: 170 ---------RGRRYSGPSPH-GLPFPYSKSRVEEL-----SVKLPPEHDWRLFGAVTPVK 304
                    +    S   P   LP          L      V LP + DWR  GAVTPVK
Sbjct: 92  YTVNMPQLPQSENLSDSEPWLDLPQDLQGFVTYALPTNLDEVDLPTDIDWRQKGAVTPVK 151

Query: 305 DQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW 484
           +Q  CGSCWSF   GA+E A +      L+ LS+Q L+DCS  +GN+GC GG    A+ +
Sbjct: 152 NQRNCGSCWSFSATGALE-AQWFKKTNKLISLSEQQLVDCSGRYGNHGCHGGWMHWAFGY 210

Query: 485 IKRH-GLPTEEDYGGYLGQDGYC 550
           IK + G+ TE+ Y  Y  +DG C
Sbjct: 211 IKENGGIDTEQSY-PYTAKDGRC 232


>UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase
           precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine
           proteinase precursor - Heterodera glycines (Soybean cyst
           nematode worm)
          Length = 353

 Score =  114 bits (274), Expect = 2e-24
 Identities = 68/182 (37%), Positives = 99/182 (54%), Gaps = 5/182 (2%)
 Frame = +2

Query: 38  EHEKRLNIFRQSLRYIHSNNRA-NRG---FTMSVNHLADRTDDELAALRGRRYSGPSPHG 205
           E  +R+N F ++ ++I ++N A  +G   F ++ NHL   T  +   +RG +        
Sbjct: 64  EKMERMNEFIKAKKFIDAHNLAFEKGEVSFKVAPNHLMHFTPAQYNRIRGLQMRSNRQR- 122

Query: 206 LPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGG 385
               ++ + +   S  LP + DWR  GAVT VKDQ  CGSCW+F   GA+EGAL      
Sbjct: 123 ----HNMATLAGNSSTLPEKLDWREKGAVTEVKDQGDCGSCWAFSATGAIEGALAQKKAS 178

Query: 386 HLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYGGYLGQDGYCHVDN 562
            ++ LS+Q L+DCS  +GN GCDGG    A+E+++  +GL TEE Y  Y    G C   N
Sbjct: 179 KIISLSEQNLVDCSSKYGNEGCDGGLMDSAFEYVRDNNGLDTEESY-PYEAVTGKCQFKN 237

Query: 563 VT 568
            T
Sbjct: 238 ET 239


>UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor;
           n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 355

 Score =  114 bits (274), Expect = 2e-24
 Identities = 70/182 (38%), Positives = 97/182 (53%), Gaps = 1/182 (0%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
           +H + Y S  E   R  +FR++L +I   N     + + +N  AD T +E    R    +
Sbjct: 57  EHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKG-RYLGLA 115

Query: 188 GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGAL 367
            P       P +  R  +++  LP   DWR  GAV PVKDQ  CGSCW+F TV AVEG  
Sbjct: 116 KPQFSRKRQPSANFRYRDIT-DLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGIN 174

Query: 368 FLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYLGQDG 544
            +   G+L  LS+Q LIDC   F N+GC+GG    A+++ I   GL  E+DY  YL ++G
Sbjct: 175 QI-TTGNLSSLSEQELIDCDTTF-NSGCNGGLMDYAFQYIISTGGLHKEDDY-PYLMEEG 231

Query: 545 YC 550
            C
Sbjct: 232 IC 233


>UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18;
           Magnoliophyta|Rep: Thiol protease aleurain precursor -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 358

 Score =  114 bits (274), Expect = 2e-24
 Identities = 68/182 (37%), Positives = 100/182 (54%), Gaps = 1/182 (0%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
           ++ ++Y +  E + R +IF+++L  I S N+    + + VN  AD T  E      R   
Sbjct: 65  RYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQ----RTKL 120

Query: 188 GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGAL 367
           G + +         +V E +  LP   DWR  G V+PVKDQ  CGSCW+F T GA+E A 
Sbjct: 121 GAAQNCSATLKGSHKVTEAA--LPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAA- 177

Query: 368 FLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQDG 544
           +    G  + LS+Q L+DC+  F N GC+GG   +A+E+IK + GL TE+ Y  Y G+D 
Sbjct: 178 YHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAY-PYTGKDE 236

Query: 545 YC 550
            C
Sbjct: 237 TC 238


>UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor;
           n=35; Fasciola|Rep: Cathepsin L-like proteinase
           precursor - Fasciola hepatica (Liver fluke)
          Length = 326

 Score =  113 bits (273), Expect = 2e-24
 Identities = 68/201 (33%), Positives = 105/201 (52%), Gaps = 7/201 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELAA- 166
           K  + ++Y +  + + R NI+ +++++I  +N R + G   +T+ +N   D T +E  A 
Sbjct: 25  KRMYNKEY-NGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 83

Query: 167 --LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFG 340
                 R S    HG+P+  +   V       P + DWR  G VT VKDQ  CGSCW+F 
Sbjct: 84  YLTEMSRASDILSHGVPYEANNRAV-------PDKIDWRESGYVTEVKDQGNCGSCWAFS 136

Query: 341 TVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
           T G +EG  ++ N    +  S+Q L+DCS  +GNNGC GG    AY+++K+ GL TE  Y
Sbjct: 137 TTGTMEGQ-YMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQFGLETESSY 195

Query: 521 GGYLGQDGYCHVDNVTAVTSI 583
             Y   +G C  +    V  +
Sbjct: 196 -PYTAVEGQCRYNKQLGVAKV 215


>UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3;
           Curculionidae|Rep: Cysteine proteinase - Hypera postica
           (alfalfa weevil)
          Length = 324

 Score =  113 bits (272), Expect = 3e-24
 Identities = 69/198 (34%), Positives = 100/198 (50%), Gaps = 4/198 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELAAL 169
           K++H + Y +  E  KR NIF  ++R I ++N    +    +   +N   D + +E   +
Sbjct: 30  KLEHGKTYLNQAEESKRFNIFTDNVRAIEAHNALYEQGKVSYKKGINKFTDMSQEEFKTM 89

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
                S   P      Y K+ VE     +P   DWR  G VT VKDQ  CGSCW+F   G
Sbjct: 90  LTLSASR-KPTLETTSYVKTGVE-----IPSSVDWRKEGRVTGVKDQGDCGSCWAFSITG 143

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGY 529
           + EGA +    G LV LS+Q LIDC     + GCDGG     ++++ + GL +EE Y  Y
Sbjct: 144 STEGA-YARKSGKLVSLSEQQLIDCCTD-TSAGCDGGSLDDNFKYVMKDGLQSEESY-TY 200

Query: 530 LGQDGYCHVDNVTAVTSI 583
            G+DG C  +  + VT +
Sbjct: 201 KGEDGACKYNVASVVTKV 218


>UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2;
           Acanthamoeba|Rep: Cysteine proteinase - Acanthamoeba
           healyi
          Length = 330

 Score =  113 bits (272), Expect = 3e-24
 Identities = 68/161 (42%), Positives = 89/161 (55%), Gaps = 7/161 (4%)
 Frame = +2

Query: 59  IFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSK---- 226
           I+R ++     +NR N+ + +++N   D T+ E   L           GL F YSK    
Sbjct: 52  IYRWNVWRDEEHNRQNKSYFLAMNQFGDLTNAEFNRLF---------KGLAFDYSKHAKI 102

Query: 227 --SRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRL 400
             +  E  +  +P E DWR  GAVT VK+Q  CGSCWSF T G+ EGA FL   G LV L
Sbjct: 103 HTAAPEAPATGIPSEFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKT-GRLVSL 161

Query: 401 SQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDY 520
           S+Q LIDCS  +GNNGC+GG    A+E+ I   G+ TE  Y
Sbjct: 162 SEQNLIDCSVSYGNNGCNGGLMDYAFEYIINNRGIDTEASY 202


>UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryza
           sativa|Rep: Putative cysteine proteinase - Oryza sativa
           subsp. japonica (Rice)
          Length = 352

 Score =  112 bits (270), Expect = 5e-24
 Identities = 67/195 (34%), Positives = 100/195 (51%), Gaps = 3/195 (1%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRA-NRGFTMSVNHLADRTDDELAALRGRRY 184
           +H R Y    E  +R  +F+ ++  I  +N A N+ + ++ N   D TD E AA+    Y
Sbjct: 48  EHGRTYKDAAEKARRFRVFKANVDLIDRSNAAGNKRYRLATNRFTDLTDAEFAAM----Y 103

Query: 185 SGPSPHGLPFPYSKS--RVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
           +G +P    +  + +  R+     + P E DWR  GAVT VK+Q  CG CW+F TV AVE
Sbjct: 104 TGYNPANTMYAAANATTRLSSEDDQQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVE 163

Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQ 538
           G +     G LV LS+Q L+DC+    N GC GG    A++++   G  T E    Y G 
Sbjct: 164 G-IHQITTGELVSLSEQQLLDCA---DNGGCTGGSLDNAFQYMANSGGVTTEAAYAYQGA 219

Query: 539 DGYCHVDNVTAVTSI 583
            G C  D  ++ + +
Sbjct: 220 QGACQFDASSSASGV 234


>UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin
           L - Misgurnus mizolepis (Mud loach)
          Length = 337

 Score =  112 bits (269), Expect = 7e-24
 Identities = 73/193 (37%), Positives = 104/193 (53%), Gaps = 7/193 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRAN----RGFTMSVNHLADRTDDELAAL 169
           K  H + Y    E  +R+ I+ ++LR I  +N  +      + + +NH  D   +E    
Sbjct: 33  KTWHGKNYHEKEEGWRRM-IWEKNLRKIQFHNLEHSMGIHTYRLGMNHFGDMNHEEF--- 88

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELS-VKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
             R+      H     +  S   E + +++P + DWR  G VTPVKDQ  CGSCW+F T 
Sbjct: 89  --RQVMNGYKHKTERKFKGSLFMEPNFLEVPSKLDWREKGYVTPVKDQGECGSCWAFSTT 146

Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYG 523
           GA+EG +F    G LV LS+Q L+DCS   GN GC+GG   +A+++IK  +GL +EE Y 
Sbjct: 147 GAMEGQMF-RKQGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNNGLDSEEAY- 204

Query: 524 GYLGQDGY-CHVD 559
            YLG D   CH D
Sbjct: 205 PYLGTDDQPCHYD 217


>UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza
           sativa|Rep: Cysteine proteinase-like - Oryza sativa
           subsp. japonica (Rice)
          Length = 360

 Score =  112 bits (269), Expect = 7e-24
 Identities = 72/195 (36%), Positives = 97/195 (49%), Gaps = 7/195 (3%)
 Frame = +2

Query: 17  RQYASDLEHEKRLNIFRQSLRYIHSNNRA--NRGFTMSVNHLADRTDDELAALR-GRRYS 187
           R YA   E  +R+ +F  +   + + NRA  +R +T+ +N  +D TDDE A    G  ++
Sbjct: 52  RAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDLTDDEFAQTHLGYSWA 111

Query: 188 GPSP---HGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
            P P   HG       +        +P   DWR  GAVT VK+Q  CGSCW+F  V A E
Sbjct: 112 PPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQRSCGSCWAFAAVAATE 171

Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLG 535
           G + L   G+LV LS+Q ++DC+   G N C GG+   A  +I    GL TE  Y  Y G
Sbjct: 172 GLVQLAT-GNLVSLSEQQVLDCTG--GANTCSGGDVSAALRYIAASGGLQTEAAY-AYGG 227

Query: 536 QDGYCHVDNVTAVTS 580
           Q G C      A  S
Sbjct: 228 QQGACRAGGFAAPNS 242


>UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryza
           sativa (japonica cultivar-group)|Rep: Putative cysteine
           proteinase - Oryza sativa subsp. japonica (Rice)
          Length = 357

 Score =  112 bits (269), Expect = 7e-24
 Identities = 66/173 (38%), Positives = 91/173 (52%), Gaps = 3/173 (1%)
 Frame = +2

Query: 11  HQRQYASDLEHEKRLNIFRQSLRYIHSNNRAN--RGFTMSVNHLADRTDDELAALRGRRY 184
           H R Y   LE  +R  +FR +  +I S N A   +   ++ N  AD T++E A   GR +
Sbjct: 56  HGRTYKDSLEKARRFEVFRTNALFIDSFNAAGGKKSPRLTTNKFADLTNEEFAEYYGRPF 115

Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
           S P   G  F Y   R  ++    P   +WR  GAVT VK+Q  C SCW+F  V AVEG 
Sbjct: 116 STPVIGGSGFMYGNVRTSDV----PANINWRDRGAVTQVKNQKDCASCWAFSAVAAVEG- 170

Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDY 520
           +      +LV LS Q L+DCS G  N+GC+ G+   A+ +I  + G+  E DY
Sbjct: 171 IHQIRSHNLVALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGGIAAESDY 223


>UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2;
           Dictyostelium discoideum|Rep: Cysteine proteinase 2
           precursor - Dictyostelium discoideum (Slime mold)
          Length = 376

 Score =  112 bits (269), Expect = 7e-24
 Identities = 69/175 (39%), Positives = 96/175 (54%), Gaps = 3/175 (1%)
 Frame = +2

Query: 5   VKHQRQYASDLEHEKRLNIFRQSLRYIHS-NNRANRGFTMSVNHLADRTDDELA-ALRGR 178
           +K  RQY+S  E   R +IF+ ++ Y+ + N++ +    + +N+ AD T++E      G 
Sbjct: 41  LKFNRQYSSS-EFSNRYSIFKSNMDYVDNWNSKGDSQTVLGLNNFADITNEEYRKTYLGT 99

Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
           R +  S +G         VE+L    P   DWR   AVTP+KDQ  CGSCWSF T G+ E
Sbjct: 100 RVNAHSYNGYD-GREVLNVEDLQTN-PKSIDWRTKNAVTPIKDQGQCGSCWSFSTTGSTE 157

Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDY 520
           GA  L     LV LS+Q L+DCS    N GCDGG    A+++ IK  G+ TE  Y
Sbjct: 158 GAHALKT-KKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYIIKNKGIDTESSY 211


>UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16;
           Bromeliaceae|Rep: Fruit bromelain precursor - Ananas
           comosus (Pineapple)
          Length = 351

 Score =  112 bits (269), Expect = 7e-24
 Identities = 66/191 (34%), Positives = 109/191 (57%), Gaps = 6/191 (3%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHS-NNRANRGFTMSVNHLADRTDDELAALRGRRY 184
           ++ R Y  D E  +R  IF+ ++++I + N+R    +T+ +N   D T  E  A    +Y
Sbjct: 43  EYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNENSYTLGINQFTDMTKSEFVA----QY 98

Query: 185 SGPSPHGLPFPYSKSRV---EELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGA 352
           +G S   LP    +  V   +++++   P+  DWR +GAV  VK+Q+ CGSCWSF  +  
Sbjct: 99  TGVS---LPLNIEREPVVSFDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIAT 155

Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGY 529
           VEG ++    G+LV LS+Q ++DC+  +   GC GG   +AY++ I  +G+ TEE+Y  Y
Sbjct: 156 VEG-IYKIKTGYLVSLSEQEVLDCAVSY---GCKGGWVNKAYDFIISNNGVTTEENY-PY 210

Query: 530 LGQDGYCHVDN 562
           L   G C+ ++
Sbjct: 211 LAYQGTCNANS 221


>UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3;
           Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber
           officinale (Ginger)
          Length = 475

 Score =  111 bits (267), Expect = 1e-23
 Identities = 74/201 (36%), Positives = 110/201 (54%), Gaps = 7/201 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRA-NRG---FTMSVNHLADRTDDELAA- 166
           +VKH+         + RL +F+++LR++  +N A +RG   + + +N  AD T++E  A 
Sbjct: 56  RVKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEEYRAR 115

Query: 167 -LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
            LR     G S  G     ++ R+ E  V LP   DWR  GAV  VK+Q  CGSCW+F  
Sbjct: 116 FLRDLSRLGRSTSGEIS--NQYRLREGDV-LPDSIDWREKGAVVAVKNQGRCGSCWAFAA 172

Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYG 523
           + AVEG   +   G L+ LS+Q L+DCS    N GC+GG  +RA+++I  +G    E++ 
Sbjct: 173 IAAVEGINQIVT-GDLISLSEQQLVDCS--TRNYGCEGGWPYRAFQYIINNGGVNSEEHY 229

Query: 524 GYLGQDGYCHVDNVTA-VTSI 583
            Y G +G C+     A V SI
Sbjct: 230 PYTGTNGTCNTTKENAHVVSI 250


>UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza
           sativa|Rep: Os01g0347600 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 343

 Score =  110 bits (264), Expect = 3e-23
 Identities = 70/188 (37%), Positives = 93/188 (49%), Gaps = 2/188 (1%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDELAALRGRRY 184
           K  + Y    E E R  IFR ++ +I     +      + +N  AD T+DE  A     Y
Sbjct: 50  KFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVAT----Y 105

Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
           +G  P   P P    R  +  +  P   DWR  GAVT VKDQ  CGSCW+F  V A+EG 
Sbjct: 106 TGAKP---PHPKEAPRPVD-PIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGL 161

Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI-KRHGLPTEEDYGGYLGQD 541
             +   G L  LS+Q L+DC     +NGC GG   RA+E +  + G+  E DY  Y G  
Sbjct: 162 TKIRT-GQLTPLSEQELVDCD--TNSNGCGGGHTDRAFELVASKGGITAESDY-RYEGFQ 217

Query: 542 GYCHVDNV 565
           G C VD++
Sbjct: 218 GKCRVDDM 225


>UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L
           - Suberites domuncula (Sponge)
          Length = 324

 Score =  110 bits (264), Expect = 3e-23
 Identities = 67/188 (35%), Positives = 103/188 (54%), Gaps = 5/188 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANR--GFTMSVNHLADRTDDELAALRG 175
           K +H ++Y  +LE  +R  I++ + ++I S+N  +   G+T+ +N   D +  E   +  
Sbjct: 27  KQEHSKEYTEELEELRRHTIWQSNKKFIDSHNSVSDKFGYTLEMNEFGDLSGVEFKQI-- 84

Query: 176 RRYSGPSPHGLPFPYSKSRVEELSVKLPPEH--DWRLFGAVTPVKDQSVCGSCWSFGTVG 349
             Y+G   + +    + +++   S  + P    DWR  G V+ VK+Q  CGSCWSF   G
Sbjct: 85  --YNG---YIMQERANDTKLFTASPYMEPAASVDWRQKGVVSEVKNQGQCGSCWSFSATG 139

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGG 526
           ++EG   L  G  LV LS+Q L+DCS  FGN+GC GG    A+ + I  HG+ TE  Y  
Sbjct: 140 SLEGQHALKMG-RLVSLSEQNLMDCSSRFGNHGCKGGIMDDAFRYVISNHGVDTESSY-P 197

Query: 527 YLGQDGYC 550
           Y  +DGYC
Sbjct: 198 YTAKDGYC 205


>UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra
           erinaceieuropaei|Rep: Cysteine proteinase - Spirometra
           erinaceieuropaei (Tapeworm)
          Length = 336

 Score =  110 bits (264), Expect = 3e-23
 Identities = 70/199 (35%), Positives = 101/199 (50%), Gaps = 5/199 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRAN----RGFTMSVNHLADRTDDELAAL 169
           K+  +++Y S  E   R   F  +L +I  +N+        + + +N  +D T  E A  
Sbjct: 36  KLAFKKEYFSSEEELHRKRAFFNNLDFIIRHNQRYYQQLESYAVRLNDFSDLTPGEFA-- 93

Query: 170 RGRRYSGPSPHGLPFPYSKSRVE-ELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
              RY       L     K  V   L   LP   +WR  GAVT VK+Q  CGSCWSF   
Sbjct: 94  --ERYLCLRGIVLTKLRRKEAVSVPLKENLPDSVNWRERGAVTSVKNQGQCGSCWSFSAN 151

Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGG 526
           GA+EGA+ +  G  L  LS+Q L+DCSW +GN GC+GG   +A+++ +R+G+  E DY  
Sbjct: 152 GAIEGAIQIKTGA-LRSLSEQQLMDCSWDYGNQGCNGGLMPQAFQYAQRYGVEAEVDY-R 209

Query: 527 YLGQDGYCHVDNVTAVTSI 583
           Y  +DG C       V ++
Sbjct: 210 YTERDGVCRYRQDLVVANV 228


>UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis
           thaliana|Rep: F9P14.12 protein - Arabidopsis thaliana
           (Mouse-ear cress)
          Length = 343

 Score =  109 bits (263), Expect = 4e-23
 Identities = 66/181 (36%), Positives = 94/181 (51%), Gaps = 1/181 (0%)
 Frame = +2

Query: 11  HQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSG 190
           H + Y    E   R  I++ +++ I   N  +  F ++ N  AD T+ E  A     + G
Sbjct: 50  HSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEFKA----HFLG 105

Query: 191 PSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALF 370
            +   L     +  V + +  +P   DWR  GAVTP+++Q  CG CW+F  V A+EG   
Sbjct: 106 LNTSSLRLHKKQRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINK 165

Query: 371 LHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQDGY 547
           +   G+LV LS+Q LIDC  G  N GC GG    A+E+IK + GL TE DY  Y G +G 
Sbjct: 166 IKT-GNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGGLATETDY-PYTGIEGT 223

Query: 548 C 550
           C
Sbjct: 224 C 224


>UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease;
           n=23; Magnoliophyta|Rep: Senescence-specific cysteine
           protease - Arabidopsis thaliana (Mouse-ear cress)
          Length = 346

 Score =  109 bits (263), Expect = 4e-23
 Identities = 72/187 (38%), Positives = 99/187 (52%), Gaps = 5/187 (2%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYI-HSNN-RANRGFTMSVNHLADRTDDELAAL-RGR 178
           KH R YA   E   R  +F+ ++  I H N+  A R F ++VN  AD T+DE  ++  G 
Sbjct: 44  KHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMYTGF 103

Query: 179 RYSGPSPHGLPFPYSKSRVEELSV-KLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
           +             S  R + +S   LP   DWR  GAVTP+K+Q  CG CW+F  V A+
Sbjct: 104 KGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAI 163

Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYGGYL 532
           EGA  +   G L+ LS+Q L+DC     + GC+GG    A+E IK   GL TE +Y  Y 
Sbjct: 164 EGATQIKK-GKLISLSEQQLVDCD--TNDFGCEGGLMDTAFEHIKATGGLTTESNY-PYK 219

Query: 533 GQDGYCH 553
           G+D  C+
Sbjct: 220 GEDATCN 226


>UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960
           precursor; n=2; Arabidopsis thaliana|Rep: Probable
           cysteine proteinase At3g43960 precursor - Arabidopsis
           thaliana (Mouse-ear cress)
          Length = 376

 Score =  109 bits (263), Expect = 4e-23
 Identities = 72/182 (39%), Positives = 97/182 (53%), Gaps = 3/182 (1%)
 Frame = +2

Query: 5   VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDEL-AALRGR 178
           V++ + Y    E E+R  IF+ +L+ I  +N   NR +   +N  +D T DE  A+  G 
Sbjct: 46  VENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGG 105

Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTP-VKDQSVCGSCWSFGTVGAV 355
           +    S   +   Y   + +E  V LP E DWR  GAV P VK Q  CGSCW+F   GAV
Sbjct: 106 KMEKKSLSDVAERY---QYKEGDV-LPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAV 161

Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLG 535
           EG   +  G  LV LS+Q LIDC  G  N GC GG    A+E+IK +G    ++  GY G
Sbjct: 162 EGINQITTG-ELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYGYTG 220

Query: 536 QD 541
           +D
Sbjct: 221 ED 222


>UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza
           sativa|Rep: Putative cysteine protease - Oryza sativa
           subsp. japonica (Rice)
          Length = 357

 Score =  108 bits (260), Expect = 8e-23
 Identities = 68/189 (35%), Positives = 95/189 (50%), Gaps = 3/189 (1%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDE-LAALRGRR 181
           K  + Y    E E R  +FR ++R+I S    A     + +N  AD T+ E +A   G +
Sbjct: 50  KFGKTYKCHGEKEHRFAVFRDNVRFIRSYRPEATYDSAVRINQFADLTNGEFVATYTGVK 109

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
              P+ H  P P    R  +  + +P   DWR  GAVT VKDQ  CGS W+F  V A+EG
Sbjct: 110 QPPPATHPHPHPEEAPRPVD-PIWMPCCIDWRFKGAVTGVKDQGACGSSWAFAAVAAMEG 168

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFG-NNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQ 538
            + +   G L  LS+Q L+DC  G G ++GC GG    A++ +   G  T E    Y G 
Sbjct: 169 LMKIRT-GQLTPLSEQELVDCVDGGGDSDGCGGGHTDAAFQLVVDKGGITAESEYRYEGY 227

Query: 539 DGYCHVDNV 565
            G C VD++
Sbjct: 228 KGRCRVDDM 236


>UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean
           endopeptidase) (Cysteine proteinase)
           (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
           Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor
           (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase)
           (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
           Vignain-2] - Vigna mungo (Rice bean) (Black gram)
          Length = 362

 Score =  108 bits (260), Expect = 8e-23
 Identities = 63/177 (35%), Positives = 93/177 (52%), Gaps = 1/177 (0%)
 Frame = +2

Query: 38  EHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDEL-AALRGRRYSGPSPHGLPF 214
           E  KR N+F+ ++ ++H+ N+ ++ + + +N  AD T+ E  +   G + +         
Sbjct: 55  EKHKRFNVFKANVMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHHKMFRGSQ 114

Query: 215 PYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLV 394
             S + + E    +P   DWR  GAVT VKDQ  CGSCW+F T+ AVEG   +     LV
Sbjct: 115 HGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKT-NKLV 173

Query: 395 RLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYCHVDNV 565
            LS+Q L+DC     N GC+GG    A+E+IK+ G  T E    Y  Q+G C    V
Sbjct: 174 SLSEQELVDCD-KEENQGCNGGLMESAFEFIKQKGGITTESNYPYTAQEGTCDESKV 229


>UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core
           eudicotyledons|Rep: Cysteine proteinase -
           Mesembryanthemum crystallinum (Common ice plant)
          Length = 367

 Score =  107 bits (258), Expect = 1e-22
 Identities = 62/173 (35%), Positives = 91/173 (52%), Gaps = 2/173 (1%)
 Frame = +2

Query: 38  EHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAAL--RGRRYSGPSPHGLP 211
           E + R ++F+++++YI+  N+ ++ + + +N   D T  E A      +   G       
Sbjct: 59  EKQNRFHVFKENVKYINEVNKMDKPYKLRLNQFGDLTPSEFARTYANSKIIEGTRNESGG 118

Query: 212 FPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHL 391
           F Y        +V++P   DWR+ GAVTPVK+Q  CG CW+F    AVEG   +   G L
Sbjct: 119 FMYE-------NVEVPRSIDWRVKGAVTPVKNQGRCGGCWAFSAAAAVEGINQI-TTGQL 170

Query: 392 VRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYC 550
           + LS+Q LIDC     N+GC GG   RA+E+IK+ G  T E    Y  Q G C
Sbjct: 171 ISLSEQQLIDCD--TQNSGCRGGTMGRAFEYIKQRGGITSEANYPYKAQAGMC 221


>UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila
           melanogaster|Rep: LD36817p - Drosophila melanogaster
           (Fruit fly)
          Length = 352

 Score =  107 bits (258), Expect = 1e-22
 Identities = 71/171 (41%), Positives = 88/171 (51%), Gaps = 7/171 (4%)
 Frame = +2

Query: 29  SDLEHEKRLNIFRQSLRYIH-SNNRANRG---FTMSVNHLADRTDDELAALRGRRYS--G 190
           SD E   R +IF   +  I  SN  A+ G   F + VN LAD T  E+A L G + S  G
Sbjct: 50  SDEERVYRESIFAAKMSLITLSNKNADNGVSGFRLGVNTLADMTRKEIATLLGSKISEFG 109

Query: 191 PSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSV-CGSCWSFGTVGAVEGAL 367
                    +  +R    S  LP   DWR  G VTP   Q V CG+CWSF T GA+EG L
Sbjct: 110 ERYTNGHINFVTAR-NPASANLPEMFDWREKGGVTPPGFQGVGCGACWSFATTGALEGHL 168

Query: 368 FLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
           F   G  L  LSQQ L+DC+  +GN GCDGG     +E+I+ HG+     Y
Sbjct: 169 FRRTGV-LASLSQQNLVDCADDYGNMGCDGGFQEYGFEYIRDHGVTLANKY 218


>UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3;
           Bilateria|Rep: Cathepsin L-like cysteine protease -
           Neobenedenia melleni
          Length = 335

 Score =  107 bits (257), Expect = 2e-22
 Identities = 63/181 (34%), Positives = 99/181 (54%), Gaps = 8/181 (4%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELAAL 169
           KVK+Q+ Y S  +   +L  + ++L  +  +N    +  + +T+++NH+AD + +E  AL
Sbjct: 31  KVKYQKDYLSSEDELNKLLTWSKNLETVRKHNELYAQGKKSYTLAMNHMADLSSEEFKAL 90

Query: 170 RGRRYSGPSPHGLPFPYS-KSRVEELSVKLPP--EHDWRLFGAVTPVKDQSVCGSCWSFG 340
               Y  P       P   K+  E   +K  P  E DW   G VT VK+Q+ CGSCW+F 
Sbjct: 91  ----YLVPKFDATKVPRKGKAAGEHRQIKNDPPSEIDWVRKGHVTAVKNQAQCGSCWAFS 146

Query: 341 TVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEED 517
           + G++EGA+     G L+  S+Q L+DCS  FGN+GC+GG    ++ + I   GL +E  
Sbjct: 147 STGSIEGAV-KRATGKLISFSEQQLVDCSTAFGNHGCNGGIMDNSFNYLIHNKGLESEAS 205

Query: 518 Y 520
           Y
Sbjct: 206 Y 206


>UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n=1;
           Xenopus tropicalis|Rep: UPI00006A2275 UniRef100 entry -
           Xenopus tropicalis
          Length = 272

 Score =  107 bits (256), Expect = 3e-22
 Identities = 69/182 (37%), Positives = 100/182 (54%), Gaps = 6/182 (3%)
 Frame = +2

Query: 23  YASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELAA-LRGRRYS 187
           Y S  E   R  I+ ++L++I  +N   + G   + + +NHL D T +E+AA + G   S
Sbjct: 1   YNSQEEERARRTIWEETLKFISVHNLEYSLGLHTYEVGMNHLGDMTGEEVAATMTGYTGS 60

Query: 188 GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQ-SVCGSCWSFGTVGAVEGA 364
           G S   +    S    E L    PP  DWR    VTPV+DQ S C SC++F  VGA+E  
Sbjct: 61  GDSLANM----SHVPKEILEALAPPSIDWRTQNCVTPVRDQGSFCRSCYAFSAVGALE-C 115

Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDG 544
            +      LV  S Q L+DCS G GN+GC+GG+  +A++++K++G+  E  Y  Y GQ G
Sbjct: 116 QWKKKTVRLVTFSPQELVDCSDGEGNHGCNGGKIEKAFKYMKKYGVMEESAY-PYTGQKG 174

Query: 545 YC 550
            C
Sbjct: 175 LC 176


>UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 383

 Score =  107 bits (256), Expect = 3e-22
 Identities = 61/173 (35%), Positives = 91/173 (52%), Gaps = 1/173 (0%)
 Frame = +2

Query: 5   VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAAL-RGRR 181
           +K  R+Y S  E E R  IF +++    +    N G  + VN   D TD+EL  + +  +
Sbjct: 87  LKFDRKYTSVEEFEYRYQIFLRNVIEFEAEEERNLGLDLDVNEFTDWTDEELQKMVQENK 146

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
           Y+    +    P  +    E  V  P   DWR  G +TP+K+Q  CGSCW+F TV +VE 
Sbjct: 147 YT---KYDFDTPKFEGSYLETGVIRPASIDWREQGKLTPIKNQGQCGSCWAFATVASVEA 203

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
              +   G LV LS+Q ++DC     NNGC GG    A +++K +GL +E++Y
Sbjct: 204 QNAIKK-GKLVSLSEQEMVDCDG--RNNGCSGGYRPYAMKFVKENGLESEKEY 253


>UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61;
           Leishmania|Rep: Cysteine proteinase 2 precursor -
           Leishmania pifanoi
          Length = 444

 Score =  107 bits (256), Expect = 3e-22
 Identities = 67/187 (35%), Positives = 97/187 (51%), Gaps = 5/187 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAA--LRG 175
           K  + R Y +  E ++RL  F ++L  +  +   N      +    D ++ E AA  L G
Sbjct: 42  KRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAARYLNG 101

Query: 176 RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
             Y   +       Y K+R +  +V  P   DWR  GAVTPVKDQ  CGSCW+F  VG +
Sbjct: 102 AAYFAAAKRHAAQHYRKARADLSAV--PDAVDWREKGAVTPVKDQGACGSCWAFSAVGNI 159

Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH---GLPTEEDYGG 526
           EG  +L  G  LV LS+Q L+ C     N+GCDGG   +A++W+ ++    L TE+ Y  
Sbjct: 160 EGQWYL-AGHELVSLSEQQLVSCD--DMNDGCDGGLMLQAFDWLLQNTNGHLHTEDSY-P 215

Query: 527 YLGQDGY 547
           Y+  +GY
Sbjct: 216 YVSGNGY 222


>UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Rep:
           Cathepsin R precursor - Mus musculus (Mouse)
          Length = 334

 Score =  106 bits (255), Expect = 3e-22
 Identities = 69/188 (36%), Positives = 100/188 (53%), Gaps = 5/188 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRAN----RGFTMSVNHLADRTDDELAAL 169
           K+K+ + Y+   E  KR+ ++ + L+ I  +NR N     GFTM +N   D+TD+E   +
Sbjct: 33  KIKYNKSYSLKEEKLKRV-VWEEKLKMIKLHNRENSLGKNGFTMKMNEFGDQTDEEFRKM 91

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
                      G     S  + E  S+ LP   DWR  G VTPV+ Q  C +CW+F   G
Sbjct: 92  MIEISVWTHREGK----SIMKREAGSI-LPKFVDWRKKGYVTPVRRQGDCDACWAFAVTG 146

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGG 526
           A+E A  +   G L  LS Q L+DCS   GNNGC GG+ + A++++  + GL +E  Y  
Sbjct: 147 AIE-AQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGLESEATY-P 204

Query: 527 YLGQDGYC 550
           Y G+DG C
Sbjct: 205 YEGKDGPC 212


>UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:
           Cathepsin - Petromyzon marinus (Sea lamprey)
          Length = 333

 Score =  106 bits (254), Expect = 4e-22
 Identities = 68/201 (33%), Positives = 100/201 (49%), Gaps = 9/201 (4%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDEL-AA 166
           K  + + Y S+ E   R ++F Q+L+ +  +N      N  F + +N  +D    E    
Sbjct: 31  KSTYGKHYGSEQEDAHRRDVFEQNLKRVLQHNLLADEGNVSFHLGINKYSDLELHEYHEK 90

Query: 167 LRGRRYS---GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSF 337
           + GR ++   G    G PFP            LP + DWRL G VTPVK+Q +CGS W+F
Sbjct: 91  VVGRFWNLRNGTRRRGAPFPLRSMD------NLPEQVDWRLKGYVTPVKEQGLCGSSWAF 144

Query: 338 GTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEE 514
              G++EG  F    G+L  LS+Q L+DC+  + NNGC+GG   RA ++ I  +G+ +E 
Sbjct: 145 SATGSLEGQHFAAT-GNLTSLSEQQLVDCTKSYYNNGCNGGRSERALQYIIDNNGIDSEL 203

Query: 515 DYGGYLGQDGYCHVDNVTAVT 577
            Y  Y   DG C        T
Sbjct: 204 SY-PYEHADGKCRFKPANVAT 223


>UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes
           vastus|Rep: Cathepsin L - Aphrocallistes vastus
          Length = 329

 Score =  106 bits (254), Expect = 4e-22
 Identities = 65/195 (33%), Positives = 96/195 (49%), Gaps = 3/195 (1%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K+K+ R Y   L+ E R  I+  ++ Y+   N     + ++ N  AD T+ E   +    
Sbjct: 34  KLKYNRSYG--LDEELRKKIWANNMLYVKEFNAEGHSYKLAANQFADLTNLEYRQI---- 87

Query: 182 YSGPSPHGLPFPYSKSRVEELSVK---LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGA 352
           Y G           + +V +  +K   LP   DWR  G VTPVK+Q  CGSCWSF   G+
Sbjct: 88  YLGYDNEARLSRKREGKVFQRKMKDEDLPTTVDWRSKGVVTPVKNQGQCGSCWSFSATGS 147

Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYL 532
           +EG  +    G LV  S+Q L+DCS   GN+GC GG    A+++ + +    E DY  Y 
Sbjct: 148 LEGQ-YAIKSGKLVSFSEQELVDCSTSLGNHGCQGGLMDYAFKYWETNLAEKESDY-TYT 205

Query: 533 GQDGYCHVDNVTAVT 577
            ++G C  +    VT
Sbjct: 206 AKNGKCKYNAQLGVT 220


>UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase" precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 315

 Score =  105 bits (253), Expect = 6e-22
 Identities = 66/192 (34%), Positives = 100/192 (52%), Gaps = 5/192 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNR----ANRGFTMSVNHLADRTDDELAAL 169
           K  H + Y + +E + R  +F+ +L+ I  +N         + ++VN  AD +  E  A+
Sbjct: 28  KATHNKSY-NVIEDKLRFAVFQDNLKKIEEHNAKYESGEETYYLAVNKFADWSSAEFQAM 86

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
             R+ +          +    V + +V+   E DWR   AV  VKDQ  CGSCW+F T G
Sbjct: 87  LARQMANKPKQS----FIAKHVADPNVQAVEEVDWR-DSAVLGVKDQGQCGSCWAFSTTG 141

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGY 529
           ++EG L +H     V LS+Q L+DC     N GC+GG    A+ ++KRHGL +E  Y  Y
Sbjct: 142 SLEGQLAIHK-NQRVPLSEQELVDCDTS-RNAGCNGGLMTDAFNYVKRHGLSSESQY-AY 198

Query: 530 LGQDGYC-HVDN 562
            G+D  C +V+N
Sbjct: 199 TGRDDRCKNVEN 210


>UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11;
           Entamoeba|Rep: Cysteine proteinase 2 precursor -
           Entamoeba histolytica
          Length = 315

 Score =  105 bits (252), Expect = 8e-22
 Identities = 62/195 (31%), Positives = 105/195 (53%), Gaps = 3/195 (1%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNH-LADRTDDELAALRGRRY 184
           K+ + + + +E  +R  IF  + +++ S N+    F +SV+   A  T++E   L   + 
Sbjct: 22  KNNKHFTA-IEKLRRRAIFNMNAKFVDSFNKIG-SFKLSVDGPFAAMTNEEYRTLLKSKR 79

Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
           +              +V+ L+++ P   DWR  G VTP++DQ+ CGSC++FG++ A+EG 
Sbjct: 80  TTEE---------NGQVKYLNIQAPESVDWRKEGKVTPIRDQAQCGSCYTFGSLAALEGR 130

Query: 365 LFLHNGG--HLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQ 538
           L +  GG  + + LS++ ++ C+   GNNGC+GG     Y++I  HG+  E DY  Y G 
Sbjct: 131 LLIEKGGDANTLDLSEEHMVQCTRDNGNNGCNGGLGSNVYDYIIEHGVAKESDY-PYTGS 189

Query: 539 DGYCHVDNVTAVTSI 583
           D  C   NV +   I
Sbjct: 190 DSTCKT-NVKSFAKI 203


>UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3;
           Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays
           (Maize)
          Length = 493

 Score =  105 bits (251), Expect = 1e-21
 Identities = 76/195 (38%), Positives = 101/195 (51%), Gaps = 9/195 (4%)
 Frame = +2

Query: 26  ASDLEHEKRLNIFRQSLRYIHSNNRAN----RGFTMSVNHLADRTDDELAA--LRGRRYS 187
           A + +  +RL +FR +LRYI ++N        GF + +   AD T +E  A  L G R  
Sbjct: 84  AGEDDDARRLEVFRDNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGR 143

Query: 188 GPSPHGLPFPYSKSRVEELS-VKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
             +  G+     + R   L+  +LP   DWR  GAV  VKDQ  CG CW+F  V AVEG 
Sbjct: 144 NGTAVGV---VGRRRYLPLAGEQLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGI 200

Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYLGQD 541
             +  G  L+ LS+Q LIDC   F + GCDGG    A+ + IK  G+ TE DY  + G D
Sbjct: 201 NKIVTGS-LISLSEQELIDCD-KFQDQGCDGGLMDNAFVFMIKNGGIDTEADY-PFTGHD 257

Query: 542 GYCHVD-NVTAVTSI 583
           G C +    T V SI
Sbjct: 258 GTCDLKLKNTRVVSI 272


>UniRef50_Q22RR8 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score =  105 bits (251), Expect = 1e-21
 Identities = 69/185 (37%), Positives = 100/185 (54%), Gaps = 4/185 (2%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
           KH   Y +  E   R  +FR +L+ I  ++  N G T  +    D T +E      +RY 
Sbjct: 49  KHSITYKTIEEKLHRFAVFRDNLKKIEGHS--NYGITKFM----DLTSEEFQ----QRYL 98

Query: 188 GPSPHGLPFPYSKSRVE--ELSVKLPPEH--DWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
               + +     KS  +  +L++KL  +   DW   GAVTPVKDQ  CGSCW+F   GA+
Sbjct: 99  RLKTNTIKRQNFKSNPKNAQLNMKLGDDIIIDWTKKGAVTPVKDQEQCGSCWAFSATGAL 158

Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLG 535
           E A F+ + G L  LS+Q L+DCS  +GN GCDGG+   A+++I  + + TE++Y  Y G
Sbjct: 159 ESATFI-STGTLPSLSEQELVDCSTSYGNEGCDGGDMDAAFKFIHDNNIATEKEY-TYRG 216

Query: 536 QDGYC 550
            D  C
Sbjct: 217 FDQKC 221


>UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain;
           n=16; Chrysomelidae|Rep: Digestive cysteine protease
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score =  105 bits (251), Expect = 1e-21
 Identities = 67/200 (33%), Positives = 106/200 (53%), Gaps = 6/200 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELA-A 166
           K  H + Y + LE + R  IF+++L  I  +N R ++G   + + V   AD T +E    
Sbjct: 27  KQTHGKTYKNLLEEKTRFGIFQRNLIKIKEHNARYDKGEETYLLGVTRFADLTHEEFKDI 86

Query: 167 LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
           L+G+  + P  +  P  +     E+L V  P   DW   GAV  VKDQ+ CGSCW+F   
Sbjct: 87  LKGQIKNKPRLNATPTVFP----EDLEV--PDSIDWTEKGAVLEVKDQNPCGSCWAFSAT 140

Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGC-DGGEDFRAYEWIKRHGLPTEEDYG 523
           GA+EG   + N    + LS+Q L+DCS  +GN  C +GG+   A+E+++ +G+ +E+ Y 
Sbjct: 141 GALEGQNAILNNVK-ISLSEQQLLDCSAAYGNGNCKEGGDMSAAFEYVRDYGIQSEKSY- 198

Query: 524 GYLGQDGYCHVDNVTAVTSI 583
            Y+ +   C  D    +  I
Sbjct: 199 PYIRKQTECQYDASKTILKI 218


>UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2;
           Taeniidae|Rep: Cathepsin L-like cysteine proteinase -
           Taenia solium (Pork tapeworm)
          Length = 339

 Score =  104 bits (250), Expect = 1e-21
 Identities = 69/201 (34%), Positives = 101/201 (50%), Gaps = 7/201 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRA-NRG---FTMSVNHLADRTDDELAAL 169
           K++H R Y S  E   R  +F ++L YI   NR  N G   ++  +N  AD    E +  
Sbjct: 39  KLQHGRVY-SGKEEAYRRGVFARNLLYIKGQNRRFNAGLESYSTGLNQFADLESSEFS-- 95

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEEL---SVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFG 340
              R+ G  P        + R+ +    +  LP   DWR    VT VK+Q  CGSCW+F 
Sbjct: 96  --ERFLGTRPESR-VAGRRGRIWKALASAAGLPDTVDWRDKNLVTEVKNQGNCGSCWAFS 152

Query: 341 TVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
           + GA+EGA F    G L+ LS+Q L+DCS   GN+GC+GG    A+++++ H +  E  Y
Sbjct: 153 STGALEGA-FAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSYAFKYLEEHFIEPESAY 211

Query: 521 GGYLGQDGYCHVDNVTAVTSI 583
             Y   DG C  +    V ++
Sbjct: 212 -PYRATDGPCRYNESLGVGTV 231


>UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15)
           (Major excreted protein) (MEP) [Contains: Cathepsin L
           heavy chain; Cathepsin L light chain]; n=19;
           Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15)
           (Major excreted protein) (MEP) [Contains: Cathepsin L
           heavy chain; Cathepsin L light chain] - Homo sapiens
           (Human)
          Length = 333

 Score =  104 bits (250), Expect = 1e-21
 Identities = 63/178 (35%), Positives = 94/178 (52%), Gaps = 5/178 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANR----GFTMSVNHLADRTDDELAAL 169
           K  H R Y  + E  +R  ++ ++++ I  +N+  R     FTM++N   D T +E   +
Sbjct: 33  KAMHNRLYGMNEEGWRRA-VWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQV 91

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
                +     G  F       E L  + P   DWR  G VTPVK+Q  CGSCW+F   G
Sbjct: 92  MNGFQNRKPRKGKVFQ------EPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATG 145

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDY 520
           A+EG +F    G L+ LS+Q L+DCS   GN GC+GG    A+++++ + GL +EE Y
Sbjct: 146 ALEGQMF-RKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESY 202


>UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep:
           Cysteine proteinase - Cryptobia salmositica
          Length = 443

 Score =  103 bits (248), Expect = 2e-21
 Identities = 65/177 (36%), Positives = 87/177 (49%), Gaps = 4/177 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K  H R YAS  E  KR  IF  +++     NR N   T   N  AD T +E        
Sbjct: 29  KAAHARNYASPDEERKRFEIFAGNMKKAAVLNRKNPMATFGPNEFADMTSEEFQTRHNAA 88

Query: 182 YSGPSPHGLPFPYSKS-RVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
               +    P   +K+   EE+   +  + DWRL GAVTPVK+Q  CGSCWSF T G +E
Sbjct: 89  RHYAAAKARPPKNTKTFTAEEIKAAVGQQIDWRLKGAVTPVKNQGACGSCWSFSTTGNIE 148

Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRH--GLPTEEDY 520
           G   +   G LV +S+Q L+ C     ++GC+GG    A+ W I  H   + TE +Y
Sbjct: 149 GQHAIAT-GQLVAVSEQELVSCD--PIDDGCNGGLMDNAFGWLISAHKGQIATEANY 202


>UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9;
           Onchocercidae|Rep: Cathepsin L-like precursor - Brugia
           pahangi (Filarial nematode worm)
          Length = 395

 Score =  103 bits (248), Expect = 2e-21
 Identities = 64/185 (34%), Positives = 95/185 (51%), Gaps = 4/185 (2%)
 Frame = +2

Query: 38  EHEKRLNIFRQS-LRYIHSNNRANRG---FTMSVNHLADRTDDELAALRGRRYSGPSPHG 205
           E+  R+ IF  + L     N +  +G   +T ++N LAD TD+E     G R    +   
Sbjct: 106 ENNFRMAIFESNELMTERINKKYEQGLVSYTTALNDLADLTDEEFMVRNGLRLPNQTDLR 165

Query: 206 LPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGG 385
                S+    + S +LP + DWR  GAVTPV++Q  CGSC++F T  A+E A      G
Sbjct: 166 GKRQTSEFYRYDKSERLPDQVDWRTKGAVTPVRNQGECGSCYAFATAAALE-AYHKQMTG 224

Query: 386 HLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYCHVDNV 565
            L+ LS Q ++DC+   GNNGC GG    A+++  R+G+  E  Y  Y+G +  C     
Sbjct: 225 RLLDLSPQNIVDCTRNLGNNGCSGGYMPTAFQYASRYGIAMESRY-PYVGTEQRCRWQQS 283

Query: 566 TAVTS 580
            AV +
Sbjct: 284 IAVVT 288


>UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis
           thaliana|Rep: Cysteine proteinase - Arabidopsis thaliana
           (Mouse-ear cress)
          Length = 348

 Score =  103 bits (247), Expect = 3e-21
 Identities = 64/181 (35%), Positives = 93/181 (51%), Gaps = 10/181 (5%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANR-GFTMSVNHLADRTDDELAALRG--- 175
           +  R Y+ + E   R NIF+++L ++ + N  N+  + + +N  +D TD+E  A      
Sbjct: 41  RFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNNKITYKVDINEFSDLTDEEFRATHTGLV 100

Query: 176 -----RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFG 340
                 R S  S      P+    V +    +    DWR  GAVTPVK Q  CG CW+F 
Sbjct: 101 VPEAITRISTLSSGKNTVPFRYGNVSDNGESM----DWRQEGAVTPVKYQGRCGGCWAFS 156

Query: 341 TVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEED 517
            V AVEG   +   G LV LS+Q L+DC   + N GC GG   +A+E+ IK  G+ TE++
Sbjct: 157 AVAAVEGITKI-TKGELVSLSEQQLLDCDRDY-NQGCRGGIMSKAFEYIIKNQGITTEDN 214

Query: 518 Y 520
           Y
Sbjct: 215 Y 215


>UniRef50_Q22A69 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 330

 Score =  103 bits (246), Expect = 4e-21
 Identities = 63/191 (32%), Positives = 94/191 (49%)
 Frame = +2

Query: 11  HQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSG 190
           + ++Y+S+  +  RL+IF+++LR I   N+ N      +   AD T +E A +    Y G
Sbjct: 37  YNKKYSSEEHYNARLSIFKENLRRIELFNK-NDEAQHGITQFADLTHEEFADM----YLG 91

Query: 191 PSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALF 370
             P  L    +K  +       P   DW   GAVTPVK+Q  CGSCW+F T G++EG   
Sbjct: 92  YKPQ-LRNSQAKVSLSSTPFTAPTAIDWTTKGAVTPVKNQGSCGSCWAFSTTGSIEGQYV 150

Query: 371 LHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYC 550
           L    +L   S+Q L+DC     + GC+GG    A+ +++   L TE  Y  Y   DG C
Sbjct: 151 LQLKQNLTSFSEQQLVDCDTK-EDQGCNGGLMDNAFTYLESAKLETESAY-PYTAVDGSC 208

Query: 551 HVDNVTAVTSI 583
             +    V  +
Sbjct: 209 KYNQSLGVVGV 219


>UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10;
           Liliopsida|Rep: Putative cysteine proteinase - Oryza
           sativa subsp. japonica (Rice)
          Length = 416

 Score =  102 bits (245), Expect = 5e-21
 Identities = 63/163 (38%), Positives = 92/163 (56%), Gaps = 6/163 (3%)
 Frame = +2

Query: 44  EKRLNIFRQSLRYIHSNNRANRG--FTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFP 217
           E R  +F+ + RYIH  N+ ++G  + + +N  +D T +E AA    +Y+G       F 
Sbjct: 43  ESRFEVFKANARYIHEFNQKSKGMSYVLGLNKFSDLTYEEFAA----KYTGVKVDASAFA 98

Query: 218 YS--KSRVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGH 388
            +   S  EEL V +PP   DWRL GAVT VKDQ  CGSCW F  VGAVEG   +   G+
Sbjct: 99  TATTSSPDEELPVGVPPATWDWRLNGAVTDVKDQGQCGSCWVFSAVGAVEGINAIMT-GN 157

Query: 389 LVRLSQQALIDCSWGFGNNGC-DGGEDFRAYEWIKRHGLPTEE 514
           L+ LS+Q ++DCS       C  GG+   A ++I ++G+  ++
Sbjct: 158 LLTLSEQQVLDCS---NTGDCLKGGDPRAALQYIVKNGVTLDQ 197


>UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 4 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 345

 Score =  102 bits (245), Expect = 5e-21
 Identities = 63/190 (33%), Positives = 99/190 (52%), Gaps = 6/190 (3%)
 Frame = +2

Query: 11  HQRQYASDLEHEKRLNIFRQSLRYIHS-NNRANRG---FTMSVNHLADRTDDELAALRGR 178
           + + Y +  E   R  +FR++  ++ + + +   G   ++++VNH AD T DE+ A    
Sbjct: 45  YNKTYGTSEETVYREQVFRRTFNFLRTVDEKFKNGTLLYSVAVNHFADMTPDEVVA---- 100

Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
            Y+G  P              L    P   +WR  G VTPVK+Q  CGSCW+F + GA+E
Sbjct: 101 NYTGYKPPSAQQLAEIPLYAPLFGDTPEFIEWRENGFVTPVKNQGQCGSCWAFSSTGALE 160

Query: 359 GALFLHNGGHLVRLSQQALIDCS-WGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYGGYL 532
           G +F      L+ LS+Q L+DC+   +GNNGC+GG+   A+++++   GL TE  Y    
Sbjct: 161 GQVFKRT-RRLISLSEQNLMDCAGQRYGNNGCNGGQMPGAFQYVQDAGGLDTEARYPYRQ 219

Query: 533 GQDGYCHVDN 562
           G +  C   N
Sbjct: 220 GTNFQCQFSN 229


>UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcoptes
           scabiei type hominis|Rep: Cathepsin L-like protease -
           Sarcoptes scabiei type hominis
          Length = 245

 Score =  102 bits (244), Expect = 7e-21
 Identities = 68/188 (36%), Positives = 94/188 (50%), Gaps = 5/188 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG----FTMSVNHLADRTDDELAAL 169
           K K+ RQ+ +  +   R  IF+++  YI  +N         + + VN   D T+ E    
Sbjct: 37  KAKYNRQFRTVYDELLRKLIFQRNYIYIRKHNEKYEAGLSTYELGVNQFTDLTNKEYNDQ 96

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
             R       H +   +     E++S  LP E DW L   V P+KDQ  CGSCW+F  V 
Sbjct: 97  MNRL---KVKHDVQSEHVFDN-EDVS-DLPDEVDWTLKNVVAPIKDQKQCGSCWAFSAVA 151

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGG 526
           ++E    L   G LV LS+Q L+DCS G GN GCDGG    A+E+ IK  G+ TE+ Y  
Sbjct: 152 SMESQNALKT-GQLVELSEQELVDCSVGEGNEGCDGGWMDSAFEFVIKADGIDTEKSY-P 209

Query: 527 YLGQDGYC 550
           Y G +  C
Sbjct: 210 YHGVNQVC 217


>UniRef50_Q22RC9 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 389

 Score =  102 bits (244), Expect = 7e-21
 Identities = 66/190 (34%), Positives = 100/190 (52%), Gaps = 9/190 (4%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFT-MSVNHLADRTDDELAALRGR 178
           K +H++ Y + LE ++R  IFRQ+L  I   N+   G     +   +D T +E  +    
Sbjct: 44  KAEHKKFY-NFLEEQRRFEIFRQNLDIISELNQVEEGTAEYGITQFSDMTTEEFKS---- 98

Query: 179 RYSGPSPHGLPFPYSKSR-VEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
           +   PS +   F  S+    +++S   P  +DWR  GAVTPVK+Q   G+CW+F T G +
Sbjct: 99  QILIPSTYARNFTGSRYHGFQKISQDAPTSYDWRDHGAVTPVKNQGTVGTCWTFSTTGNI 158

Query: 356 EGALFLHNGGHLVRLSQQALIDC------SWGFGNNGCDGGEDFRAYEW-IKRHGLPTEE 514
           EG  FL  G  LV LS++ ++DC      S G  + G  GG  + A+++ I   GLP+EE
Sbjct: 159 EGQWFL-AGNPLVSLSEEQIVDCDGSQEPSTGHADCGVFGGWPYLAFDYVINAGGLPSEE 217

Query: 515 DYGGYLGQDG 544
            Y   +G  G
Sbjct: 218 TYPYCVGNGG 227


>UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K
           precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2);
           n=2; Tribolium castaneum|Rep: PREDICTED: similar to
           Cathepsin K precursor (Cathepsin O) (Cathepsin X)
           (Cathepsin O2) - Tribolium castaneum
          Length = 332

 Score =  101 bits (243), Expect = 9e-21
 Identities = 67/189 (35%), Positives = 97/189 (51%), Gaps = 6/189 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG----FTMSVNHLADRTDDELAAL 169
           K  H R +   LE   R ++F ++L  +  +N   R     + M VN  +D TD+EL+ L
Sbjct: 31  KAMHARAFFDPLEETFRKSLFTKNLEIVEEHNERFRNGSETYEMGVNKFSDFTDEELSNL 90

Query: 170 RGRRYSGPSPHGLPFPYSKSRV-EELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
            G +   P     P   ++  +   L   +    DWR  G VTPVK+Q  CGSCW+F T+
Sbjct: 91  TGLQV--PLEFEQPLNETEDPLLPSLGRGISASLDWRQRGGVTPVKNQGQCGSCWAFATI 148

Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYG 523
           GA+E    + +    + LS+Q L+DC  G G  GC GG    AY +I R+ G+    DY 
Sbjct: 149 GAIESHYKIRH-KRAISLSEQQLVDCV-GRG-GGCGGGWIPTAYSYIARNKGVNYNRDY- 204

Query: 524 GYLGQDGYC 550
            YLG++G C
Sbjct: 205 PYLGRNGKC 213


>UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3;
           Bilateria|Rep: Cathepsin L-like cysteine proteinase -
           Longidorus elongatus
          Length = 358

 Score =  101 bits (243), Expect = 9e-21
 Identities = 63/192 (32%), Positives = 96/192 (50%), Gaps = 9/192 (4%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELAA- 166
           K+KH + Y +  E   R  +F  + + I  +N         F +S+N  AD T+ E    
Sbjct: 47  KLKHAKSYKTKDEELLRFQVFASNHKVIEQHNIEYEAGQHSFALSLNKFADMTNAEFRQR 106

Query: 167 LRGRRYSGPSPHGLPFPYSKS-RVEEL--SVKLPPEHDWRLFGAVTPVKDQSVCGSCWSF 337
           + G +           P  +   + E+  +V +P   DWR  G VT VKDQ  CGSCW+F
Sbjct: 107 MNGFKLPAKRKLAKSQPLKEDGMIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAF 166

Query: 338 GTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEE 514
              G++EG  +    G LV LS+Q L+DC     + GC+GG    A+++++ + G+ TE 
Sbjct: 167 SATGSLEGQHYKQT-GKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQYVETNKGIDTEA 225

Query: 515 DYGGYLGQDGYC 550
            Y  Y G+DG C
Sbjct: 226 SY-PYKGRDGRC 236


>UniRef50_Q22W19 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score =  101 bits (243), Expect = 9e-21
 Identities = 65/184 (35%), Positives = 96/184 (52%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
           K+  ++A +++ + R +IF Q+   +   N  N G   ++N  A  T DE   L      
Sbjct: 50  KYGFKFADEVQLQYRRSIFYQNKDLVEQLNSENNGTFHTLNAFAIYTKDEFNQLFKGYQK 109

Query: 188 GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGAL 367
               H +   YS      L   + P  DWR   AVTPVK+Q  CGSCW+F TVG +EGA 
Sbjct: 110 RQKSHLI---YS------LKGDVAPSIDWRQKNAVTPVKNQGQCGSCWAFSTVGGLEGAY 160

Query: 368 FLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGY 547
            +   G+L   S+Q ++DCS    N GC+GG+   AY+++ ++G+ TE DY  Y G +  
Sbjct: 161 AIAT-GNLTSFSEQQIVDCS--KANAGCNGGDLPPAYKYVVQNGIETEADY-PYKGVNQK 216

Query: 548 CHVD 559
           C  D
Sbjct: 217 CAYD 220


>UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core
           eudicotyledons|Rep: Chymopapain precursor - Carica
           papaya (Papaya)
          Length = 352

 Score =  101 bits (243), Expect = 9e-21
 Identities = 58/172 (33%), Positives = 88/172 (51%)
 Frame = +2

Query: 5   VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRY 184
           +KH + Y S  E   R  IFR +L YI   N+ N  + + +N  AD ++DE    +   +
Sbjct: 53  LKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKK-KYVGF 111

Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
                 GL    ++    +     P   DWR  GAVTPVK+Q  CGSCW+F T+  VEG 
Sbjct: 112 VAEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGI 171

Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
             +   G+L+ LS+Q L+DC     + GC GG    + +++  +G+ T + Y
Sbjct: 172 NKIVT-GNLLELSEQELVDCD--KHSYGCKGGYQTTSLQYVANNGVHTSKVY 220


>UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F
            like protease; n=1; Nasonia vitripennis|Rep: PREDICTED:
            similar to cathepsin F like protease - Nasonia
            vitripennis
          Length = 1036

 Score =  101 bits (242), Expect = 1e-20
 Identities = 65/185 (35%), Positives = 92/185 (49%), Gaps = 3/185 (1%)
 Frame = +2

Query: 8    KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGF-TMSVNHLADRTDDELAALR-GRR 181
            K+++ Y +  E E R  IF+ +L  I    R   G     V    D T  E  A   G +
Sbjct: 737  KYKKMYHNKEEKEMRFQIFKDNLNLIEELQRNEMGTGRYGVTQFTDLTKAEFKARHLGLK 796

Query: 182  YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
             +  S + +P P +        ++LP ++DWR    VTPVKDQ  CGSCW+F   G +EG
Sbjct: 797  PTLKSENDIPMPMATIP----DIELPSDYDWRHHNVVTPVKDQGSCGSCWAFSVTGNIEG 852

Query: 362  ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKR-HGLPTEEDYGGYLGQ 538
               + + G L+ LS+Q L+DC     ++GC+GG    AY  I+   GL  E DY  Y  +
Sbjct: 853  QYAIKH-GELLSLSEQELVDCD--KLDSGCNGGLPDTAYRAIEELGGLELESDY-PYDAE 908

Query: 539  DGYCH 553
            D  CH
Sbjct: 909  DEKCH 913


>UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 360

 Score =  101 bits (242), Expect = 1e-20
 Identities = 66/189 (34%), Positives = 97/189 (51%), Gaps = 3/189 (1%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAAL-RGR 178
           KVK+ + Y  D E + R ++F  +   I+ +N+      + VN  AD T +E  AL  G 
Sbjct: 49  KVKYAKTYKDDTEEQYRFSVFTNNYVEIYRHNKFLVFSKVGVNQFADLTHEEFKALYTGH 108

Query: 179 RYSGPSPHGLPFPYSKSRVEELSV-KLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
           ++S           +K++   L    LP   DWR  GA+TPVK Q+ CG CW+F TV ++
Sbjct: 109 KHSKDDDDD----DNKNKQPHLPTDNLPASFDWRDKGAITPVKVQNGCGGCWAFSTVQSI 164

Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYL 532
           EG  FL   G L  LS Q +IDC      +GC GG+   A+  I+ + G+ TE +Y  Y+
Sbjct: 165 EGLYFLKT-GKLESLSTQQVIDCC-RIDESGCLGGDPEPAFRCIQNNGGIMTETEY-PYI 221

Query: 533 GQDGYCHVD 559
            +   C  D
Sbjct: 222 AKQQSCKFD 230


>UniRef50_P25779 Cluster: Cruzipain precursor; n=54;
           Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi
          Length = 467

 Score =  101 bits (242), Expect = 1e-20
 Identities = 68/184 (36%), Positives = 91/184 (49%), Gaps = 3/184 (1%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K KH R Y S  E   RL++FR++L     +  AN   T  V   +D T +E    R R 
Sbjct: 42  KQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEF---RSRY 98

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
           ++G +        ++  V+   V  P   DWR  GAVT VKDQ  CGSCW+F  +G VE 
Sbjct: 99  HNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVEC 158

Query: 362 ALFLHNGGH-LVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI--KRHGLPTEEDYGGYL 532
             FL   GH L  LS+Q L+ C     ++GC GG    A+EWI  + +G    ED   Y 
Sbjct: 159 QWFL--AGHPLTNLSEQMLVSCD--KTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYA 214

Query: 533 GQDG 544
             +G
Sbjct: 215 SGEG 218


>UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate
           cathepsin L; n=4; Danio rerio|Rep: Novel protein similar
           to vertebrate cathepsin L - Danio rerio (Zebrafish)
           (Brachydanio rerio)
          Length = 334

 Score =  101 bits (241), Expect = 2e-20
 Identities = 62/179 (34%), Positives = 90/179 (50%), Gaps = 6/179 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRA-NRG---FTMSVNHLADRTDDELAAL 169
           K KH+  Y  + E   R  I+  +++ I  NN   + G   F M++N   D T  E   L
Sbjct: 30  KKKHEISYDEESEDVHRKTIWETNMQKIWKNNNDFSFGLSMFKMAMNKYGDLTSVEYKRL 89

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKL--PPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
            G +  G          + +++  L+ K       D+R  G VT VKDQ  CGSCWSF T
Sbjct: 90  LGSKIKGTGNR--KGKITSAQMLRLNAKRLGVTNIDYRAKGYVTEVKDQGYCGSCWSFST 147

Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
            GA+EG ++ H  G LV LS+Q L+DCS  +G  GC G     AY+++  + L + + Y
Sbjct: 148 TGAIEGQMYKHT-GRLVSLSEQQLVDCSRSYGTYGCSGAWMANAYDYVINNALESSDTY 205


>UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza
           sativa|Rep: Os09g0497500 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 349

 Score =  100 bits (239), Expect = 3e-20
 Identities = 64/190 (33%), Positives = 97/190 (51%), Gaps = 8/190 (4%)
 Frame = +2

Query: 5   VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRY 184
           ++H R Y    E ++R  ++R+++  + + N  + G+ ++ N  AD T++E  A    + 
Sbjct: 36  IRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFADLTNEEFRA----KM 91

Query: 185 SGPSPHGLPFPYSKSRVEELSVK-------LPPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
            G  PH      S +   ++++        LP   DWR  GAV  VK+Q  CGSCW+F  
Sbjct: 92  LGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCGSCWAFSA 151

Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDY 520
           V A+EG   + N G LV LS+Q L+DC       GC GG    A+E+ +  HGL TE  Y
Sbjct: 152 VAAIEGINQIKN-GELVSLSEQELVDCD--DEAVGCGGGYMSWAFEFVVGNHGLTTEASY 208

Query: 521 GGYLGQDGYC 550
             Y   +G C
Sbjct: 209 -PYHAANGAC 217


>UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase
           precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin
           L-like cysteine proteinase precursor - Acanthoscelides
           obtectus (Bean weevil)
          Length = 321

 Score =  100 bits (239), Expect = 3e-20
 Identities = 61/156 (39%), Positives = 84/156 (53%), Gaps = 5/156 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELAAL 169
           K++H R Y + LE ++R  IF+ +LR I  +N R + G   F M +N   D T +E    
Sbjct: 27  KIQHGRTYRTLLEEKRRFEIFKFNLRTIEEHNERYHNGEETFEMGINQFGDMTQEEFK-- 84

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
             R  +   P  +P P       +    +P   DWR  GAVT VK Q  CGSCW+F  VG
Sbjct: 85  --RMLALQKPQ-MPLPRGDEVSFDNVNDIPKTVDWREKGAVTEVKKQGNCGSCWAFSAVG 141

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSW-GFGNNGCD 454
           ++EG +FL NG  L  LS Q L+DC+   +GN GC+
Sbjct: 142 SIEGQVFLKNGS-LESLSAQNLVDCAGIEYGNFGCE 176


>UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin L
           preproprotein; n=1; Monodelphis domestica|Rep:
           PREDICTED: similar to cathepsin L preproprotein -
           Monodelphis domestica
          Length = 356

 Score =   99 bits (238), Expect = 4e-20
 Identities = 61/177 (34%), Positives = 94/177 (53%), Gaps = 4/177 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNR----ANRGFTMSVNHLADRTDDELAAL 169
           K  + + Y S+ E   R  ++ ++L+ I+ +NR      + + M +N   D TD E  + 
Sbjct: 33  KTTYGKNY-SEKEESFRRQVWEKNLKLINDHNRLFKEGKKSYFMGMNQFGDMTDKEFESR 91

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
              R +   P      Y+  R   +  +LP   DWR  G VTP+++Q  CG+CW+F T+G
Sbjct: 92  LNLRIA---PVRTRRNYTFKR--RIYYRLPKSVDWRTHGYVTPIRNQGECGACWAFSTIG 146

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
           ++EG LF    G LV LS+Q LIDCS   G   C GG    A ++I+R+G+ +E  Y
Sbjct: 147 SLEGQLF-RKTGRLVELSKQMLIDCS---GYYTCMGGSLTGALDFIRRYGVVSERCY 199


>UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted,
           possible transmembrane domain near N-terminus; n=4;
           Cryptosporidium|Rep: Cryptopain-cysteine proteinase
           secreted, possible transmembrane domain near N-terminus
           - Cryptosporidium parvum Iowa II
          Length = 401

 Score =   99 bits (238), Expect = 4e-20
 Identities = 57/186 (30%), Positives = 89/186 (47%), Gaps = 3/186 (1%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K K+ + Y+S  E  +R  I++Q++ +I + N     + + +N   D + +E  A     
Sbjct: 90  KKKYHKVYSSMEEENQRFEIYKQNMNFIKTTNSQGFSYVLEMNEFGDLSKEEFMARFTGY 149

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEH--DWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
                     F  S+    E   +  P +  +W   G V P+++Q  CGSCW+F  V A+
Sbjct: 150 IKDSKDDERVFKSSRVSASESEEEFVPPNSINWVEAGCVNPIRNQKNCGSCWAFSAVAAL 209

Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYL 532
           EGA        L  LS+Q  +DCS   GN GCDGG    A+++ IK   L T +DY  Y 
Sbjct: 210 EGATCAQTNRGLPSLSEQQFVDCSKQNGNFGCDGGTMGLAFQYAIKNKYLCTNDDY-PYF 268

Query: 533 GQDGYC 550
            ++  C
Sbjct: 269 AEEKTC 274


>UniRef50_Q23EG5 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 429

 Score =   99 bits (238), Expect = 4e-20
 Identities = 61/178 (34%), Positives = 94/178 (52%), Gaps = 7/178 (3%)
 Frame = +2

Query: 38  EHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDELAALRG-RRYSGPSPHGLP 211
           ++ +R  +F++ +  I  +N   N+ +T  ++     T++E++ L+G +  S  +     
Sbjct: 53  QNSERFQLFKKRVAKIAEHNLNPNKKYTQKISKFTFYTNEEISKLKGSQNCSATAKENTR 112

Query: 212 FPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSV----CGSCWSFGTVGAVEGALFLHN 379
                 +  +LS ++P   DWR  G V+ VKDQ      CGSCW+F   GA+E  L L  
Sbjct: 113 I----LQTYDLS-EIPDYVDWREKGIVSSVKDQDAVGDDCGSCWTFSATGAIESHLALKT 167

Query: 380 GGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYGGYLGQDGYC 550
           G     LSQQ L+DC+  F N GCDGG   RA+E+I    G+ +  DY  Y G+DG C
Sbjct: 168 GKAPFNLSQQQLVDCAGKFDNQGCDGGLPSRAFEYIAYAGGIESSRDY-PYKGKDGKC 224


>UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum
           aestivum|Rep: Thiol protease - Triticum aestivum (Wheat)
          Length = 374

 Score = 99.5 bits (237), Expect = 5e-20
 Identities = 64/188 (34%), Positives = 91/188 (48%), Gaps = 7/188 (3%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANR-GFTMSVNHLADRTDDELAALRGRRY 184
           KH + YA   E  +R +IFR+++ +I + NR  R  +T+ VN  AD T +E  A    R 
Sbjct: 56  KHGKSYAGVEEKLRRFDIFRRNVEFIEAANRDGRLSYTLGVNQFADLTHEEFLATHTSRR 115

Query: 185 SGPSPHGLPFPYSKSRVEELSVK-----LPPEHDWRLFGAVTPVKDQS-VCGSCWSFGTV 346
             PS   +    +   VE  + +     +P   +W     VTPVK+Q  VCG+CW+F  V
Sbjct: 116 VVPSEEMVITTRAGVVVEGANCQPAPNAVPRSINWVNQSKVTPVKNQGKVCGACWAFSAV 175

Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGG 526
             +E A  +   G    LS+Q LIDC     + GC  GE + AY W+ R+G         
Sbjct: 176 ATIESAYAIAKRGEPPVLSEQELIDCD--TFDRGCTSGEMYNAYFWVLRNGGIANSSTYP 233

Query: 527 YLGQDGYC 550
           Y   DG C
Sbjct: 234 YKETDGKC 241


>UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae
           str. PEST
          Length = 559

 Score = 99.5 bits (237), Expect = 5e-20
 Identities = 73/184 (39%), Positives = 86/184 (46%), Gaps = 3/184 (1%)
 Frame = +2

Query: 11  HQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAALRGRRYS 187
           H+RQYAS +EHE R NIFR +L  I   N+  RG     V   AD T  E  A  G    
Sbjct: 256 HRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERGTAKYGVTKFADMTVAEYRAHTGLVVP 315

Query: 188 GPSPHGLPFPYSKSRVEELSV-KLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
                        S  +   V  LP   DWR  GAVT VK+Q  CGSCW+F  VG VEG 
Sbjct: 316 KHDRANHVGNRVASEEDVAGVGDLPRSFDWRDHGAVTEVKNQGSCGSCWAFSAVGNVEG- 374

Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKR-HGLPTEEDYGGYLGQD 541
           L       L   S+Q LIDC     +NGC GG    A++ I++  GL  E DY       
Sbjct: 375 LHQIKTKKLESYSEQELIDCD--KVDNGCGGGYMDDAFKAIEQLGGLELENDYPYEAKAQ 432

Query: 542 GYCH 553
             CH
Sbjct: 433 KSCH 436


>UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 356

 Score = 99.1 bits (236), Expect = 7e-20
 Identities = 56/167 (33%), Positives = 90/167 (53%), Gaps = 4/167 (2%)
 Frame = +2

Query: 62  FRQSLRYIHSNNR-ANRGFTMSVNH-LADRTDDELAA--LRGRRYSGPSPHGLPFPYSKS 229
           F++S+R +  +N+  N  +T+S++   A  +D++     L  +  S  +   L  P    
Sbjct: 61  FKESVRRVREHNKKVNATYTLSIDSPFAFMSDEQFVTEYLGSQDCSATAELTLKKPMKIQ 120

Query: 230 RVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQ 409
             +  +V++P   +W+    V+PVKDQ  CGSCW+F T GA+E    +        LS+Q
Sbjct: 121 NKK--NVQVPESINWKDLNKVSPVKDQQNCGSCWTFSTTGAIESHYAIFEDVEPTSLSEQ 178

Query: 410 ALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYC 550
            LIDC+  F NNGC GG   +A+E+IK +G  + E+   Y+ QD  C
Sbjct: 179 QLIDCAGAFNNNGCSGGLPSQAFEYIKYNGGISYENSYYYIAQDQEC 225


>UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19;
           Bilateria|Rep: Cathepsin F precursor - Homo sapiens
           (Human)
          Length = 484

 Score = 99.1 bits (236), Expect = 7e-20
 Identities = 70/185 (37%), Positives = 92/185 (49%), Gaps = 2/185 (1%)
 Frame = +2

Query: 5   VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAALRGRR 181
           + + R Y S  E   RL++F  ++         +RG     V   +D T++E   +    
Sbjct: 192 ITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNT 251

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
                P G     +KS V +L+   PPE DWR  GAVT VKDQ +CGSCW+F   G VEG
Sbjct: 252 LLRKEP-GNKMKQAKS-VGDLA---PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEG 306

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKR-HGLPTEEDYGGYLGQ 538
             FL N G L+ LS+Q L+DC     +  C GG    AY  IK   GL TE+DY  Y G 
Sbjct: 307 QWFL-NQGTLLSLSEQELLDCD--KMDKACMGGLPSNAYSAIKNLGGLETEDDY-SYQGH 362

Query: 539 DGYCH 553
              C+
Sbjct: 363 MQSCN 367


>UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep:
           Cathepsin K - Danio rerio (Zebrafish) (Brachydanio
           rerio)
          Length = 333

 Score = 98.7 bits (235), Expect = 9e-20
 Identities = 63/188 (33%), Positives = 98/188 (52%), Gaps = 5/188 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANR----GFTMSVNHLADRTDDELAAL 169
           K+ H+R+Y    E   R  I+ +++ +I ++N+        + + +NH  D T +E+A  
Sbjct: 34  KITHKREYNGLNEESIRRTIWEKNMLFIEAHNKEYELGIHTYDLGMNHFGDMTLEEVA-- 91

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
              +  G        P +    ++   KLP   D+R  G VT VK+Q  CGSCW+F +VG
Sbjct: 92  --EKVMGLQMPMYRDPANTFVPDDRVGKLPKSIDYRKLGYVTSVKNQGSCGSCWAFSSVG 149

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGG 526
           A+EG L +   G LV LS Q L+DC     N+GC GG    A+ ++  + G+ +EE Y  
Sbjct: 150 ALEGQL-MKTKGQLVDLSPQNLVDCV--TENDGCGGGYMTNAFRYVSNNQGIDSEESY-P 205

Query: 527 YLGQDGYC 550
           Y+G D  C
Sbjct: 206 YVGTDQQC 213


>UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine
           protease; n=1; Strongylocentrotus purpuratus|Rep:
           PREDICTED: similar to cysteine protease -
           Strongylocentrotus purpuratus
          Length = 494

 Score = 98.3 bits (234), Expect = 1e-19
 Identities = 70/191 (36%), Positives = 98/191 (51%), Gaps = 2/191 (1%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAALRGRRY 184
           +  RQ     E+E R ++F Q++  +   N+  +G         AD T+ E   L+    
Sbjct: 165 REYRQNDGTNEYEYRYSVFVQNMLTVEMFNQFEQGTAKYGPTKFADMTEAEFRKLQ---- 220

Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
           SGP    L     K +       +P E+DWR  GAVTPVK+Q +CGSCW+F  +G +EG 
Sbjct: 221 SGP----LKKTGIKKQAAIPQGPVPEEYDWRTHGAVTPVKNQGMCGSCWAFSAIGNMEGQ 276

Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYE-WIKRHGLPTEEDYGGYLGQD 541
             +   G L+ LS+Q L+DC    G  GC+GGE   AYE  IK  G  +EE Y  Y G++
Sbjct: 277 WQIKK-GELISLSEQELVDCDKVDG--GCEGGEMSDAYEAIIKLGGAMSEEKY-PYRGEN 332

Query: 542 GYCHVDNVTAV 574
             C   N+T V
Sbjct: 333 EKCKF-NMTDV 342


>UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101,
           whole genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_101,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 306

 Score = 98.3 bits (234), Expect = 1e-19
 Identities = 58/173 (33%), Positives = 86/173 (49%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K K+Q +Y S  E E R  IF+Q+  Y    N     +T+ +N  A  TD+E   +    
Sbjct: 34  KQKYQTRYTSQFEDEYRFEIFKQNYNYYQEVNSRQSSYTLGINQFATLTDEEFEQI---- 89

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
           Y G +    P    +S ++  S+ LP   DW     + PVK+Q  CGS WSF  VGA E 
Sbjct: 90  YLGRADSS-PIEIDES-ID--SINLPESVDWS--SKMNPVKNQGTCGSGWSFSAVGAFEA 143

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
                 G H  + S+Q L+DC     ++GCDGG   +A +++ ++G   E +Y
Sbjct: 144 FFIFVKGTHF-QYSEQNLVDCD--TNSHGCDGGYPAKAIDYLNKNGAFLESEY 193


>UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza
           sativa|Rep: Os09g0381400 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 362

 Score = 97.9 bits (233), Expect = 2e-19
 Identities = 66/188 (35%), Positives = 97/188 (51%), Gaps = 7/188 (3%)
 Frame = +2

Query: 11  HQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDELAALRGRRYS 187
           H R Y S  E  +R +++R++  +I + N R +  + ++ N  AD T++E  A     Y+
Sbjct: 58  HNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYA 117

Query: 188 GPSPHGLPFPYSKSRVEELS----VKLPPEHDWRLFGAVTPVKDQ-SVCGSCWSFGTVGA 352
           G  P       + +   + S    V +P   DWR  GAV P K Q S C SCW+F T   
Sbjct: 118 GDGPVDDSVITTGAGDVDASFSYRVDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAAT 177

Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGY 529
           +E +L +   G LV LS+Q L+DC    G  GC+ G   RAY+W ++  GL TE DY  Y
Sbjct: 178 IE-SLNMIKTGKLVSLSEQQLVDCDSYDG--GCNLGSYGRAYKWVVENGGLTTEADY-PY 233

Query: 530 LGQDGYCH 553
             + G C+
Sbjct: 234 TARRGPCN 241


>UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1;
           Brugia malayi|Rep: Cathepsin F-like cysteine proteinase
           - Brugia malayi (Filarial nematode worm)
          Length = 461

 Score = 97.9 bits (233), Expect = 2e-19
 Identities = 63/195 (32%), Positives = 93/195 (47%), Gaps = 3/195 (1%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAA--LRGR 178
           K +R+Y+S  E   R  I+ Q++ +        +G  +      +D T +E     L   
Sbjct: 165 KFKREYSSIEEQLDRFRIYLQNMNFAKKLQFEEKGTAIYGATKFSDMTAEEFQKIMLPSI 224

Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
            +     +G+ F  +   +   +  LP + DWR  G VTPVKDQ  CGSCW+F   G +E
Sbjct: 225 WWDRVESNGITFNLNDFNLSIYN--LPSKFDWRTEGVVTPVKDQGSCGSCWAFSVTGNIE 282

Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQ 538
            +L+    G L+ LS+Q LIDC     + GC+GG    A+  IKR G    ED   Y  +
Sbjct: 283 -SLWAIKTGKLISLSEQELIDCD--VIDKGCNGGLPINAFREIKRMGGLEPEDQYPYEAK 339

Query: 539 DGYCHVDNVTAVTSI 583
           +G CH+       SI
Sbjct: 340 NGTCHLVRAQIAVSI 354


>UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase;
           n=21; Bilateria|Rep: Cathepsin L-like cysteine
           proteinase - Globodera pallida
          Length = 379

 Score = 97.9 bits (233), Expect = 2e-19
 Identities = 70/182 (38%), Positives = 102/182 (56%), Gaps = 9/182 (4%)
 Frame = +2

Query: 2   KVKHQRQ-YAS-DLEHEKRLNIFRQSLRYIHSNNRAN-RG---FTMSVNHLADRTDDELA 163
           K KH R+ YA  D+E+E+ L  +  + ++I  +N+A   G   F +  NH+AD    E  
Sbjct: 74  KQKHGRKAYADQDVENERMLT-YLSAKQFIDKHNQAYIEGKVTFRVGENHIADLPFSEYK 132

Query: 164 ALRG-RRYSGPSPHGLPFPYSKSRVEELSV-KLPPEHDWRLFGAVTPVKDQSVCGSCWSF 337
            L G RR  G +        + + +  ++V  LP   DWR  G VT VK+Q +CGSCW+F
Sbjct: 133 KLNGYRRLLGDNLRR----NASTFLAPMNVGDLPESVDWRDKGWVTEVKNQGMCGSCWAF 188

Query: 338 GTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEE 514
            + GA+E A      G L+ LS+Q LIDCS  +GN GC+GG    A+++IK  +G+  E 
Sbjct: 189 SSTGALE-AQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDNAFQYIKDNNGVDKEL 247

Query: 515 DY 520
           DY
Sbjct: 248 DY 249


>UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=17; Trichomonas vaginalis|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 318

 Score = 97.9 bits (233), Expect = 2e-19
 Identities = 60/183 (32%), Positives = 92/183 (50%), Gaps = 3/183 (1%)
 Frame = +2

Query: 38  EHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFP 217
           E+  RL ++  + R +  +NRAN G+ +++NHL+  T  E   L G + +          
Sbjct: 37  EYHFRLGVYNTNKRRVQEHNRANSGYQLTMNHLSCMTPSEYKVLLGHKQTKKI------- 89

Query: 218 YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVR 397
             +   +     +P   DWR    V P+KDQ+ CGSCW+F  V A E    L   G L+ 
Sbjct: 90  --EGEAKIFKGDVPDAVDWRNAKIVNPIKDQAQCGSCWAFSVVQAQESQWALKK-GQLLS 146

Query: 398 LSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH--GL-PTEEDYGGYLGQDGYCHVDNVT 568
           L++Q ++DC       GCDGG+++ AY+++ +H  GL   E DY  Y  +DG C      
Sbjct: 147 LAEQNMVDCV--DTCYGCDGGDEYLAYDYVIKHQKGLWMLETDY-PYTARDGSCKFKAAK 203

Query: 569 AVT 577
            VT
Sbjct: 204 GVT 206


>UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella
           histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax
           (Sterkiella histriomuscorum)
          Length = 366

 Score = 97.5 bits (232), Expect = 2e-19
 Identities = 57/159 (35%), Positives = 79/159 (49%), Gaps = 1/159 (0%)
 Frame = +2

Query: 83  IHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPP 262
           I  N+     +   +N  +D TD+E        Y+  +         KS     +  +P 
Sbjct: 83  IKHNSDGTNTYKKGLNAFSDMTDEEFFDY----YNIKAEQNCSATNRKS-FGNSNANIPT 137

Query: 263 EHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGN 442
           E DWR FG V+PVK+Q  CGSCW+F TVG VE    L  G     LS+Q L+DC+  + N
Sbjct: 138 EWDWRTFGVVSPVKNQGKCGSCWTFSTVGCVESHYLLKYGA-FRNLSEQQLVDCAGDYDN 196

Query: 443 NGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQDGYCHV 556
           +GC GG    A+E+IK + GL  E  Y  Y   +G C +
Sbjct: 197 HGCSGGLPSHAFEYIKDNGGLALETTY-PYKAANGQCSI 234


>UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2;
           Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio
           molitor (Yellow mealworm)
          Length = 336

 Score = 97.5 bits (232), Expect = 2e-19
 Identities = 62/193 (32%), Positives = 90/193 (46%), Gaps = 7/193 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANR----GFTMSVNHLADRTDDELAAL 169
           K  + R Y +  E   R  IF++ L     +N   R     +T+ VN   D T +E+ A 
Sbjct: 31  KTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDMTPEEMKAY 90

Query: 170 RGRRYSGPSPH--GLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
                     H  G+P    +      SV+ P   DWR  G V+PVK+Q  CGSCW+F +
Sbjct: 91  THGLIMPADLHKNGIPIKTREDLGLNASVRYPASFDWRDQGMVSPVKNQGSCGSCWAFSS 150

Query: 344 VGAVEGALFLHNG-GHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
            GA+E  + + NG G+   +S+Q L+DC       GC GG    A+ ++ ++G    E  
Sbjct: 151 TGAIESQMKIANGAGYDSSVSEQQLVDCV--PNALGCSGGWMNDAFTYVAQNGGIDSEGA 208

Query: 521 GGYLGQDGYCHVD 559
             Y   DG CH D
Sbjct: 209 YPYEMADGNCHYD 221


>UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing
           protein; n=7; Hymenostomatida|Rep: Papain family
           cysteine protease containing protein - Tetrahymena
           thermophila SB210
          Length = 387

 Score = 96.7 bits (230), Expect = 4e-19
 Identities = 63/185 (34%), Positives = 88/185 (47%), Gaps = 11/185 (5%)
 Frame = +2

Query: 38  EHEKRLNIFRQSLRYIHS-NNRANRGFTMSVNHLADRTDDELAALR---GRRYSGPSPHG 205
           E+ +R  IF Q L+ I + N+ +  G+   +N   DRT +EL        +     +   
Sbjct: 57  EYNQRKRIFEQKLKEIKAFNSNSENGYKKGINQFTDRTAEELRETTLGYSKTVKNAANKQ 116

Query: 206 LPFPYSKSRVEELSVK-LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNG 382
             F   K+  ++++VK LP   DWR  G VTPVKDQ  CGSCW+F T   +E    +   
Sbjct: 117 NMFRNLKTS-DKINVKDLPKSVDWRDAGVVTPVKDQGHCGSCWAFATTAVIESYAAIAT- 174

Query: 383 GHLVRLSQQALIDCSWGF----GNNGCDGGEDFRAYEWIKRHGLPTE--EDYGGYLGQDG 544
           G L  LS Q L+ C        G  GC+G     AY +++  GL +E    Y  Y GQ G
Sbjct: 175 GQLKTLSTQQLVSCVQNSYQCGGQGGCNGAVSELAYNYVQLFGLTSEYKYSYSSYQGQTG 234

Query: 545 YCHVD 559
            C  D
Sbjct: 235 NCTFD 239


>UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicatein
           a3 - Lubomirskia baicalensis
          Length = 344

 Score = 96.7 bits (230), Expect = 4e-19
 Identities = 68/198 (34%), Positives = 100/198 (50%), Gaps = 5/198 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYI-HSNNRANR-GFTMSVNHLADRTDDELAALRG 175
           K  HQR Y S L+  +R +I+  + +YI H N  A+  G+T+++N   D    E    R 
Sbjct: 48  KGHHQRSYESQLQEMERHSIWVANKKYIEHHNANADLFGYTLAMNGFGDLMSAEFTE-RY 106

Query: 176 RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
             +      GL    S        V      DWR  G VT V+ Q  CGS ++F   GA+
Sbjct: 107 LTHKHSQRSGLQTFESPK-----GVTYADSLDWRTRGVVTSVQSQGQCGSSYAFAAAGAL 161

Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYL 532
           EGA  L     LV LS+Q +IDCS  +GN+GC GG+ + A+++ +   G+ TE  Y  Y 
Sbjct: 162 EGATALA-ADKLVALSEQNIIDCSVPYGNHGCSGGDVYTAFKYVVDNGGIDTESSY-PYK 219

Query: 533 GQDGYCHVD--NVTAVTS 580
           G+   C  +  NV A+++
Sbjct: 220 GKKSSCQYNSKNVGAIST 237


>UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep:
           Cathepsin L - Kudoa thyrsites
          Length = 300

 Score = 96.7 bits (230), Expect = 4e-19
 Identities = 63/188 (33%), Positives = 101/188 (53%), Gaps = 5/188 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSV-NHLADRTDDELAA---L 169
           K++H   + S  E  +RL  F+++ ++IH+ N  N  +     NHL+  + +E  A   L
Sbjct: 15  KLEHNIIFDSIEEERRRLCNFKENHQFIHNFNLHNTHYHYCRHNHLSHWSHEEYMAWLTL 74

Query: 170 RGRRYSGPSP-HGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
           + +     +P HG+  P  ++  +++   LP   DW+  G VT VK+Q  CGSCWSF   
Sbjct: 75  KPKLPVVSTPTHGIT-P-KETATKDIKSTLPSSVDWKALGKVTSVKNQGHCGSCWSFSAA 132

Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGG 526
           GA+E A  +   G LV  S+Q L+DCS    N+GC+GG    A+ ++  +G+   +DY  
Sbjct: 133 GAIESAYAIKT-GELVNFSEQQLVDCS--TENHGCNGGLPEIAFLYVINNGIMKLKDY-P 188

Query: 527 YLGQDGYC 550
           Y  + G C
Sbjct: 189 YTAKQGTC 196


>UniRef50_P25774 Cluster: Cathepsin S precursor; n=78;
           Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens
           (Human)
          Length = 331

 Score = 96.7 bits (230), Expect = 4e-19
 Identities = 63/194 (32%), Positives = 97/194 (50%), Gaps = 7/194 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRAN----RGFTMSVNHLADRTDDELAAL 169
           K  + +QY    E   R  I+ ++L+++  +N  +      + + +NHL D T +E+ +L
Sbjct: 32  KKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSL 91

Query: 170 RGR-RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
               R        + +  + +R+      LP   DWR  G VT VK Q  CG+CW+F  V
Sbjct: 92  MSSLRVPSQWQRNITYKSNPNRI------LPDSVDWREKGCVTEVKYQGSCGACWAFSAV 145

Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSW-GFGNNGCDGGEDFRAYEW-IKRHGLPTEEDY 520
           GA+E  L L   G LV LS Q L+DCS   +GN GC+GG    A+++ I   G+ ++  Y
Sbjct: 146 GALEAQLKLKT-GKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASY 204

Query: 521 GGYLGQDGYCHVDN 562
             Y   D  C  D+
Sbjct: 205 -PYKAMDQKCQYDS 217


>UniRef50_O16454 Cluster: Temporarily assigned gene name protein
           196; n=4; Bilateria|Rep: Temporarily assigned gene name
           protein 196 - Caenorhabditis elegans
          Length = 477

 Score = 96.3 bits (229), Expect = 5e-19
 Identities = 64/189 (33%), Positives = 93/189 (49%), Gaps = 6/189 (3%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAALRGRRY 184
           +H+++Y +  E  KR  +F+++ + I    +  +G  +      +D T  E   +    Y
Sbjct: 180 RHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKKIM-LPY 238

Query: 185 SGPSPHGLPFPYSKSRVEELSVK-----LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
               P    +P  ++  E+  V      LP   DWR  GAVT VK+Q  CGSCW+F T G
Sbjct: 239 QWEQP---VYPMEQANFEKHDVTINEEDLPESFDWREKGAVTQVKNQGNCGSCWAFSTTG 295

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGY 529
            VEGA F+     LV LS+Q L+DC     + GC+GG    AY+ I R G    ED   Y
Sbjct: 296 NVEGAWFIAK-NKLVSLSEQELVDCD--SMDQGCNGGLPSNAYKEIIRMGGLEPEDAYPY 352

Query: 530 LGQDGYCHV 556
            G+   CH+
Sbjct: 353 DGRGETCHL 361


>UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_23,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 321

 Score = 95.9 bits (228), Expect = 6e-19
 Identities = 58/181 (32%), Positives = 87/181 (48%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
           K+ ++Y +  E   R +I++Q++  I   N  N  +   +N   D TD E   +    Y 
Sbjct: 44  KYNKRYPTQNEQIYRFSIYQQNIMKIEDFNSQNNSYKQKINKFGDLTDQEFLTI----YL 99

Query: 188 GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGAL 367
                 +P      +  E    +  E DW   G V  +KDQ  CGSCW+F  VGA+E   
Sbjct: 100 NLQ---MPARVKNIQKNEEPFLVQEEVDWVQKGKVPAIKDQGDCGSCWAFSAVGALEINT 156

Query: 368 FLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGY 547
            +     +V LS+Q L+DC+  +GN GCDGG    A ++I   G+   + Y  Y G+DG 
Sbjct: 157 KI-QFNEIVDLSEQDLVDCAGPYGNAGCDGGWMESALDYIIDSGIAETKVY-PYKGEDGI 214

Query: 548 C 550
           C
Sbjct: 215 C 215


>UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis
           zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa
           zeasingle nucleocapsid nuclear polyhedrosis virus)
          Length = 367

 Score = 95.9 bits (228), Expect = 6e-19
 Identities = 63/200 (31%), Positives = 101/200 (50%), Gaps = 15/200 (7%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRAN------------RGFTMSVNHLADRTD 151
           ++ + Y    E++ R N+F+ +L  I+S NR N                  VN  +D+T 
Sbjct: 63  QYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNKFSDKTP 122

Query: 152 DELAALRGRRYSGPSPHGLPFPYSKSRVEELS--VKLPPEHDWRLFGAVTPVKDQSVCGS 325
           DE+       +   S H   +   ++R+ + +  ++LP  +DWR    VTP+KDQ VCGS
Sbjct: 123 DEVLHSNTGFFLNLSQH---YTLCENRIVKGAPDIRLPDYYDWRDTNKVTPIKDQGVCGS 179

Query: 326 CWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAY-EWIKRHGL 502
           CW+F  +G +E    + +   L+ LS+Q L+DC     + GC+GG    A+ E +   G+
Sbjct: 180 CWAFVAIGNIESQYAIRH-NKLIDLSEQQLLDCD--EVDLGCNGGLMHLAFQELLLMGGV 236

Query: 503 PTEEDYGGYLGQDGYCHVDN 562
            TE DY  Y G +  C +DN
Sbjct: 237 ETEADY-PYQGSEQMCTLDN 255


>UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin S
           preproprotein; n=2; Tribolium castaneum|Rep: PREDICTED:
           similar to cathepsin S preproprotein - Tribolium
           castaneum
          Length = 525

 Score = 95.5 bits (227), Expect = 8e-19
 Identities = 65/188 (34%), Positives = 96/188 (51%), Gaps = 5/188 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYI-HSNNRANRG---FTMSVNHLADRTDDELAAL 169
           K K++R+Y +  E   R  IF ++ + I H N R  +G   + + +N L+D TD+E++  
Sbjct: 229 KRKYERRYPNLEEENFRRAIFEKTFQEIKHHNERYRKGLETYYLRINDLSDYTDEEMSCC 288

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
              +   PS   LP   + SR       LP   DWRL G VTPVK Q  CG+CW+F  +G
Sbjct: 289 -SEKAPKPSITILPNVSTSSRQN-----LPKMVDWRLRGVVTPVKHQGKCGTCWAFAIIG 342

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI-KRHGLPTEEDYGG 526
           A E    +H G  ++ LS+Q L+DC      + C G      Y++I K  G+  ++DY  
Sbjct: 343 ATEAQYRIHRGSFVI-LSEQQLVDCVREV--SSCRGVYLHETYKYIVKSEGINYDQDY-R 398

Query: 527 YLGQDGYC 550
           Y    G C
Sbjct: 399 YQSAPGTC 406



 Score = 74.5 bits (175), Expect = 2e-12
 Identities = 48/121 (39%), Positives = 62/121 (51%), Gaps = 1/121 (0%)
 Frame = +2

Query: 191 PSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALF 370
           P+P  + FP   +R +     LP   DWRL G VTPVK Q  CGSCW+F  +GA E A +
Sbjct: 17  PNPSIVIFPNMSARPQS---DLPDMVDWRLQGVVTPVKRQGKCGSCWAFAILGATE-AHY 72

Query: 371 LHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYLGQDGY 547
               G  V LS+Q L+DC    G   C G      YE+ I  +G+  ++DY  Y    G 
Sbjct: 73  RKQRGSFVILSEQQLVDCVREVGT--CKGVWLDEVYEYIINSNGINYDQDY-RYESAPGS 129

Query: 548 C 550
           C
Sbjct: 130 C 130


>UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 336

 Score = 95.5 bits (227), Expect = 8e-19
 Identities = 61/195 (31%), Positives = 99/195 (50%), Gaps = 8/195 (4%)
 Frame = +2

Query: 11  HQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDELAALRGRRYS 187
           ++R Y S  E + R  IF ++   I ++N      + ++ N  +D   +E A+    + S
Sbjct: 39  NKRTYFSLEEQQFRQQIFFETHERIQNHNSNPEATYKLAHNQFSDMPQEEFASRVLMKSS 98

Query: 188 GPSP-HGLPFPYSKSRVEELS---VKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
              P + +    + S  ++ +   V+LP   DWR +G ++ VKDQ  CGSCW+F T G +
Sbjct: 99  QLIPRNAVQAQNNNSTTQQHTAQDVQLPASFDWRDYGILSDVKDQGQCGSCWAFSTTGIL 158

Query: 356 EGALFLHNGGHLVRLSQQALIDC---SWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGG 526
           E   F+ N    +  S+Q L+DC   S GF + GC GG    A +++ + G+  EE Y  
Sbjct: 159 EALYFMEN-RQKISFSEQQLVDCATNSNGFNSYGCSGGWPEEALKYVAKFGILKEEQY-P 216

Query: 527 YLGQDGYCHVDNVTA 571
           YL  D  C V + T+
Sbjct: 217 YLAVDSKCKVSSPTS 231


>UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium
           falciparum|Rep: Falcipain 2 - Plasmodium falciparum
          Length = 484

 Score = 95.5 bits (227), Expect = 8e-19
 Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 10/196 (5%)
 Frame = +2

Query: 11  HQRQYASDLEHEKRLNIFRQSLRYIHS-NNRANRGFTMSVNHLADRTDDELA-ALRGRRY 184
           + +QY S  E ++R  +F Q+   ++  NN  N  +   +N  AD T  E        R 
Sbjct: 172 NNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFADLTYHEFKNKYLSLRS 231

Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPE-------HDWRLFGAVTPVKDQSVCGSCWSFGT 343
           S P  +   +   +   EE+  K   E       +DWRL   VTPVKDQ  CGSCW+F +
Sbjct: 232 SKPLKNS-KYLLDQMNYEEVIKKYRGEENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSS 290

Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYE-WIKRHGLPTEEDY 520
           +G+VE    +     L+ LS+Q L+DCS  F N GC+GG    A+E  I+  G+  + DY
Sbjct: 291 IGSVESQYAIRK-NKLITLSEQELVDCS--FKNYGCNGGLINNAFEDMIELGGICPDGDY 347

Query: 521 GGYLGQDGYCHVDNVT 568
                    C++D  T
Sbjct: 348 PYVSDAPNLCNIDRCT 363


>UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole
           genome shotgun sequence; n=7; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_22,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 350

 Score = 95.5 bits (227), Expect = 8e-19
 Identities = 63/169 (37%), Positives = 85/169 (50%), Gaps = 1/169 (0%)
 Frame = +2

Query: 17  RQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPS 196
           +QY+   E   RL ++  +L  I + N+             D TD+E AA        P 
Sbjct: 71  KQYSGS-ELLYRLQVYEANLADIKARNQKLGREIFGETQFTDLTDEEFAATYLTLKVNPD 129

Query: 197 PHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLH 376
              +P    K++ E  +V   P  DWR  GAV  VKDQ  CGSCW+F T G +EG  +  
Sbjct: 130 DLEVP----KAQFE--NVNATPI-DWRTRGAVNKVKDQGQCGSCWAFSTTGVLEG-FYKV 181

Query: 377 NGGHLVRLSQQALIDCSWGFG-NNGCDGGEDFRAYEWIKRHGLPTEEDY 520
             G L  LS+Q L+DCS     N GCDGG   RA  ++KR+GL T++ Y
Sbjct: 182 QTGELPDLSEQQLVDCSTLIDFNQGCDGGMPSRALNYVKRNGLTTQDAY 230


>UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin
           L-like protease; n=1; Nasonia vitripennis|Rep:
           PREDICTED: similar to cathepsin L-like protease -
           Nasonia vitripennis
          Length = 353

 Score = 95.1 bits (226), Expect = 1e-18
 Identities = 60/190 (31%), Positives = 96/190 (50%), Gaps = 7/190 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANR----GFTMSVNHLADRTDDELAAL 169
           K+++++ Y  D+E   R ++F ++ R I  +N+ +      + + +N   D   +E    
Sbjct: 44  KLRYKKNYNGDVEENFRRSVFHENQRKIAEHNQKHDLGLFTYKVRINQFGDMMFEEYKNY 103

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEH-DWRLFGAVTPVKDQSV-CGSCWSFGT 343
                +         P     ++  S +  PEH DWR  GAVTPV+DQ + CGSCW+F  
Sbjct: 104 M-HAANNTITQLKRIPRGDEFIKPKSAENVPEHVDWRQRGAVTPVRDQGLTCGSCWAFSA 162

Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDY 520
            GA+E A +    G L  LS Q LIDC+  +GN GC GG    ++++ + + GL  E +Y
Sbjct: 163 AGALE-AQYFKKTGVLTALSAQNLIDCTMEYGNLGCGGGSAALSFQFVVDQKGLEPEANY 221

Query: 521 GGYLGQDGYC 550
             Y G+   C
Sbjct: 222 -SYEGRTKEC 230


>UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 344

 Score = 95.1 bits (226), Expect = 1e-18
 Identities = 58/200 (29%), Positives = 94/200 (47%), Gaps = 11/200 (5%)
 Frame = +2

Query: 2   KVKHQRQYA-SDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDEL------ 160
           K  H  +Y  S +E  ++        + I  N+  +  +T+  NHL+D T +E       
Sbjct: 42  KKTHNVKYEDSSIEAYRKAIFLDNHNKIIEHNSDPSHSYTLGHNHLSDMTHEEFSLYQLN 101

Query: 161 -AALRGRRYSGPSPHGLPFPYSKSRVEE-LSVKLPPEHDWRLFGAVTPVKDQSVCGSCWS 334
            A    +   G +  G     S   V+  ++ K  P  DWR   A+TPVK Q  CGSCW+
Sbjct: 102 PARTASKSSKGGNNSGNSSGSSNPYVDPPITTKNAPPMDWRNASAITPVKQQGKCGSCWT 161

Query: 335 FGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFG--NNGCDGGEDFRAYEWIKRHGLPT 508
           F +   +E   F+ NG  L   S+Q ++DC +G G  +NGC+GG    A  +  ++G+  
Sbjct: 162 FASTAVLESFSFIKNGAPLTNFSEQQILDCVYGSGYYSNGCNGGFGSEALNYAIQNGIAP 221

Query: 509 EEDYGGYLGQDGYCHVDNVT 568
              Y  Y+G+   C  ++ +
Sbjct: 222 LSQY-PYVGKQQGCKYNSTS 240


>UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163
           precursor; n=4; Schizophora|Rep: Putative cysteine
           proteinase CG12163 precursor - Drosophila melanogaster
           (Fruit fly)
          Length = 614

 Score = 95.1 bits (226), Expect = 1e-18
 Identities = 67/188 (35%), Positives = 90/188 (47%), Gaps = 4/188 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAALRG- 175
           +V+  R+Y S  E + RL IFRQ+L+ I   N    G     +   AD T  E     G 
Sbjct: 312 QVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEYKERTGL 371

Query: 176 -RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGA 352
            +R    +  G     S + V     +LP E DWR   AVT VK+Q  CGSCW+F   G 
Sbjct: 372 WQRDEAKATGG-----SAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCGSCWAFSVTGN 426

Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKR-HGLPTEEDYGGY 529
           +EG L+    G L   S+Q L+DC     ++ C+GG    AY+ IK   GL  E +Y  Y
Sbjct: 427 IEG-LYAVKTGELKEFSEQELLDCD--TTDSACNGGLMDNAYKAIKDIGGLEYEAEY-PY 482

Query: 530 LGQDGYCH 553
             +   CH
Sbjct: 483 KAKKNQCH 490


>UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep:
           Cysteine protease - Clonorchis sinensis
          Length = 328

 Score = 94.7 bits (225), Expect = 1e-18
 Identities = 65/189 (34%), Positives = 91/189 (48%), Gaps = 3/189 (1%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAALRGR 178
           K+K+++ Y++D + E R  IF+ +L          +G     V   +D T +E      R
Sbjct: 36  KLKYKKTYSND-DDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYLR 94

Query: 179 -RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
            R+ GP     P P     ++        + DWR  GAV PV DQ  CGSCW+F  +G V
Sbjct: 95  MRFDGPIVSEDPSPEEDVTMDN------EKFDWREHGAVGPVLDQGKCGSCWAFSVIGNV 148

Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAY-EWIKRHGLPTEEDYGGYL 532
           EG  F    G L+ LS+Q L+DC       GC+GG   + Y E  K  GL    DY  Y 
Sbjct: 149 EGQWF-RKTGDLLALSEQQLVDCD--HLEKGCNGGYPPKTYGEIEKMGGLELASDY-PYT 204

Query: 533 GQDGYCHVD 559
           G DG C+++
Sbjct: 205 GVDGICYMN 213


>UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease
           containing protein; n=2; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 332

 Score = 94.3 bits (224), Expect = 2e-18
 Identities = 56/182 (30%), Positives = 92/182 (50%), Gaps = 2/182 (1%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDELAALRGR 178
           K ++  ++    E + RL +F ++ + I  +N  ++ GF   +N  +  T +E  A    
Sbjct: 43  KNRYNLEFNDIQEEQYRLFVFHENFKQIELDNMNSDNGFISGINKFSHLTKEEFKAKYLN 102

Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
           R   P+          S+ ++   KLP   DWR  GAV+PV+DQ  CGSC++F + GA+E
Sbjct: 103 RPQRPASEMKTNSILSSQ-QKTDEKLPESVDWRKLGAVSPVRDQGNCGSCYAFASTGALE 161

Query: 359 GALFLHNGGHLVRLSQQALIDCS-WGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLG 535
           G L+    G L   S Q ++DC+   F   GC GG     + ++K +G+  E  Y  Y G
Sbjct: 162 G-LYQIKTGKLEVFSPQYIVDCAKHQFSRGGCHGGYSSGVFTFVKENGMNLESRY-PYKG 219

Query: 536 QD 541
           ++
Sbjct: 220 EE 221


>UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27;
           Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber
           officinale (Ginger)
          Length = 221

 Score = 94.3 bits (224), Expect = 2e-18
 Identities = 48/110 (43%), Positives = 65/110 (59%)
 Frame = +2

Query: 254 LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWG 433
           LP   DWR  GAV PVK+Q  CGSCW+F  + AVEG   +   G L+ LS+Q L+DCS  
Sbjct: 3   LPDSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVT-GDLISLSEQQLVDCS-- 59

Query: 434 FGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYCHVDNVTAVTSI 583
             N+GC+GG  +RA+++I  +G    E++  Y G +G C       V SI
Sbjct: 60  TRNHGCEGGWPYRAFQYIINNGGINSEEHYPYTGTNGTCDTKENAHVVSI 109


>UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:
           Viral cathepsin - Xestia c-nigrum granulosis virus
           (XnGV) (Xestia c-nigrumgranulovirus)
          Length = 346

 Score = 94.3 bits (224), Expect = 2e-18
 Identities = 67/186 (36%), Positives = 90/186 (48%), Gaps = 4/186 (2%)
 Frame = +2

Query: 5   VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAA-LRGRR 181
           VK+ + Y  D E E R  IF+Q+L  I++ N         +N  AD + +EL   L G +
Sbjct: 48  VKYNKVYKDDQEKEARFEIFKQNLADINARNALEDSAMFEINSRADISSNELLQKLTGLK 107

Query: 182 YS---GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGA 352
            S   G   +    P   S   + S K+P   DWR   +VT VK Q  CGSCW+F  V  
Sbjct: 108 LSLMRGEKKNSFCTPTVISG--DSSGKVPDSFDWRDRNSVTSVKMQKECGSCWAFSAVAN 165

Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYL 532
           +E    + +   L  LS+Q L+DC     NNGC+GG    A+E I R G  + E    Y 
Sbjct: 166 IESLYHIKHNVSL-DLSEQQLVDCD--KVNNGCNGGLMSWAFEGIIRAGGISYEAPYPYT 222

Query: 533 GQDGYC 550
           G DG C
Sbjct: 223 GVDGVC 228


>UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep:
           LOC443661 protein - Xenopus laevis (African clawed frog)
          Length = 346

 Score = 93.9 bits (223), Expect = 3e-18
 Identities = 60/185 (32%), Positives = 96/185 (51%), Gaps = 5/185 (2%)
 Frame = +2

Query: 11  HQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDEL-AALRG 175
           HQ+ Y    E   R  I+ ++L++I  +N   + G   + + +NHL D T +E+ A + G
Sbjct: 58  HQKIYKDAEEERARRTIWEETLKFITVHNLEYSLGLHTYEVGMNHLGDMTGEEVEATMTG 117

Query: 176 RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
              S  S   +    ++   + L  + P   DWR  G VT V+ Q  CGSC++F  VGA+
Sbjct: 118 YTSSDDSLANM----TRVPKKLLEAQPPASIDWRTKGCVTSVRRQRKCGSCYAFSAVGAL 173

Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLG 535
           E   +    G LV  S Q L+DCS+  GN GC GG    ++ ++K+ G+  + +Y  Y G
Sbjct: 174 E-CQWKKKKGTLVTFSPQELVDCSYSEGNKGCKGGSIRSSFTYMKKSGVMEDFNY-PYTG 231

Query: 536 QDGYC 550
           ++  C
Sbjct: 232 KEEKC 236


>UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46;
           Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea
           mays (Maize)
          Length = 371

 Score = 93.9 bits (223), Expect = 3e-18
 Identities = 70/202 (34%), Positives = 95/202 (47%), Gaps = 13/202 (6%)
 Frame = +2

Query: 17  RQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSG-- 190
           + Y    EH  RL++F+ +LR    +   +      V   +D T  E      R Y G  
Sbjct: 57  KSYKDADEHAYRLSVFKDNLRRARRHQLLDPSAEHGVTKFSDLTPAEFR----RTYLGLR 112

Query: 191 PSPHGLPFPYSKSRVEELSVK---LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
            S   L     +S  E   +    LP + DWR  GAV PVK+Q  CGSCWSF   GA+EG
Sbjct: 113 KSRRALLRELGESAHEAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEG 172

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFG-------NNGCDGGEDFRAYEWI-KRHGLPTEED 517
           A +L   G L  LS+Q  +DC            ++GC+GG    A+ ++ K  GL +E+D
Sbjct: 173 AHYLAT-GKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYLQKAGGLESEKD 231

Query: 518 YGGYLGQDGYCHVDNVTAVTSI 583
           Y  Y G DG C  D    V S+
Sbjct: 232 Y-PYTGSDGKCKFDKSKIVASV 252


>UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep:
           Cathepsin L precursor - Schistosoma mansoni (Blood
           fluke)
          Length = 319

 Score = 93.9 bits (223), Expect = 3e-18
 Identities = 70/188 (37%), Positives = 99/188 (52%), Gaps = 3/188 (1%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAALR-G 175
           K+K+++QY  + E E R NIF+ ++          RG  +  V   +D T DE A     
Sbjct: 24  KLKYRKQY-HETEDEIRFNIFKSNILKAQLYQVFVRGSAIYGVTPYSDLTTDEFARTHLT 82

Query: 176 RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
             +  PS      P S  +  E++  +P   DWR  GAVT VK+Q +CGSCW+F T G V
Sbjct: 83  ASWVVPSSRSNT-PTSLGK--EVN-NIPKNFDWREKGAVTEVKNQGMCGSCWAFSTTGNV 138

Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYE-WIKRHGLPTEEDYGGYL 532
           E   F    G L+ LS+Q L+DC  G  ++GC+GG    AYE  IK  GL  E++Y  Y 
Sbjct: 139 ESQWF-RKTGKLLSLSEQQLVDCD-GL-DDGCNGGLPSNAYESIIKMGGLMLEDNY-PYD 194

Query: 533 GQDGYCHV 556
            ++  CH+
Sbjct: 195 AKNEKCHL 202


>UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia
           theta|Rep: Cathepsin H precursor - Guillardia theta
           (Cryptomonas phi)
          Length = 353

 Score = 93.5 bits (222), Expect = 3e-18
 Identities = 65/192 (33%), Positives = 103/192 (53%), Gaps = 9/192 (4%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHS-NNRANRGFTMSVNHLADRTDDEL--AALRGR 178
           KH + Y  D  + +RL  F  SL+ + + N+R    +  ++N  +D T +E   A L   
Sbjct: 39  KHSKVYEDDTTYLRRLASFCVSLKEVEAINSRPGTTWRAALNQYSDLTWEEFKHAKLMAE 98

Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRL-----FGAVTPVKDQSVCGSCWSFGT 343
           +  G +   +  P  K  + ++ + +  E DWR         V+ VK+Q  CGSCW+F T
Sbjct: 99  QNCGAT---VTTPVEK--LVKMGI-VADEFDWRNQTCGETSCVSMVKNQGTCGSCWTFST 152

Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDY 520
             A+E +L     G +V LS+Q L+DC+  F NNGC+GG   +A+E+I  + GL   E+Y
Sbjct: 153 AAALE-SLHAIKTGEMVLLSEQQLVDCAADFKNNGCNGGLPSQAFEYIMYNGGLSKMEEY 211

Query: 521 GGYLGQDGYCHV 556
             Y+  DG+C+V
Sbjct: 212 -PYVCGDGHCNV 222


>UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_36,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 307

 Score = 93.5 bits (222), Expect = 3e-18
 Identities = 62/184 (33%), Positives = 92/184 (50%), Gaps = 1/184 (0%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDELAALRGRRY 184
           K+  ++ +  E   R  IF Q++  I+ +N   N+ ++M+VN  AD TD+E  ++    Y
Sbjct: 34  KNFNKFYTSNEETYRQVIFNQNVELINKHNSNPNKSYSMAVNQFADLTDEEFQSM----Y 89

Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
            G      P       +E        + DW     + P+K+Q  CGSCW+F  +GAVEG 
Sbjct: 90  LGK-----PTYVKIDNIELSKGNTLGDADWA--SKMNPIKNQGNCGSCWTFSAIGAVEGF 142

Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDG 544
           L +  G   V LS+Q L+DC+   G  GC+GG    A ++I   G   E DY  Y  +DG
Sbjct: 143 LAIRKGFKGV-LSEQQLVDCAVDAG-EGCNGGNSDLALDYIAEVGSVYERDY-EYTAKDG 199

Query: 545 YCHV 556
            C V
Sbjct: 200 VCKV 203


>UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep:
           CG4847-PD, isoform D - Drosophila melanogaster (Fruit
           fly)
          Length = 420

 Score = 93.1 bits (221), Expect = 4e-18
 Identities = 62/155 (40%), Positives = 78/155 (50%), Gaps = 6/155 (3%)
 Frame = +2

Query: 113 FTMSVNHLADRTDDE-LAALRGRRYSGPSPHGLPFPYSKSRVEELSVK-LPPEHDWRLFG 286
           F  +VN  AD T  E L+ L G + S   P       +  ++  L  K +P   DWR  G
Sbjct: 157 FKQAVNAFADLTHSEFLSQLTGLKRS---PEAKARAAASLKLVNLPAKPIPDAFDWREHG 213

Query: 287 AVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCS--WGFGNNGCDGG 460
            VTPVK Q  CGSCW+F T GA+EG  F    G L  LS+Q L+DC     FG NGCDGG
Sbjct: 214 GVTPVKFQGTCGSCWAFATTGAIEGHTF-RKTGSLPNLSEQNLVDCGPVEDFGLNGCDGG 272

Query: 461 EDFRAYEWIK--RHGLPTEEDYGGYLGQDGYCHVD 559
               A+ +I   + G+  E  Y  Y+   G C  D
Sbjct: 273 FQEAAFCFIDEVQKGVSQEGAY-PYIDNKGTCKYD 306


>UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel
           protein - Danio rerio (Zebrafish) (Brachydanio rerio)
          Length = 328

 Score = 92.7 bits (220), Expect = 6e-18
 Identities = 62/189 (32%), Positives = 96/189 (50%), Gaps = 6/189 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRAN----RGFTMSVNHLADRTDDELAAL 169
           K +H + Y +  E   R ++++Q+L+ I  +N A       +T+ +N L+D T DE+  +
Sbjct: 31  KSQHNKTYRNTREERLRRSVWKQNLQDILLHNEAAAVGLHSYTLGLNQLSDMTADEVNDM 90

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVK-LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
            G            FP   +     S++ LP   +W   G V+PV++Q  CGSCW+F  V
Sbjct: 91  NGLLEED-------FPDVNATFSPPSLQTLPQRVNWTEHGMVSPVQNQGPCGSCWAFSAV 143

Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYG 523
           G++E A        LV LS Q L+DCS   GN GC GG   RA+ + I+  G+ +   Y 
Sbjct: 144 GSLE-AQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQNRGIDSSTFY- 201

Query: 524 GYLGQDGYC 550
            Y  ++G C
Sbjct: 202 PYEHKEGVC 210


>UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1;
           Naegleria fowleri|Rep: Cysteine proteinase homolog -
           Naegleria fowleri
          Length = 347

 Score = 92.7 bits (220), Expect = 6e-18
 Identities = 67/204 (32%), Positives = 95/204 (46%), Gaps = 13/204 (6%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
           K+ + Y ++ EH  R  IF+ ++      N   +     +   +D T +E   +   +  
Sbjct: 39  KYAKVYGTE-EHNNRYQIFKANVEKSRYYNHVGKRENFGITKFSDLTPEEFKRMFLMKTY 97

Query: 188 GPSPHG--LPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
            P      L  P      E+     P   DWR  GAVT VK+Q  CGSCW+F T G VEG
Sbjct: 98  TPEEAKKILAAPQHAVLSEKEVQTAPTSFDWRQHGAVTRVKNQGACGSCWTFSTTGNVEG 157

Query: 362 ALFLHNGGHLVRLSQQALIDCSWG---FGN-----NGCDGGEDFRAYEW-IKRHGLPTEE 514
              +   G LV LS+Q L+DC      + N     +GC+GG  + A+++ IK  GL TE+
Sbjct: 158 QWAIKK-GKLVSLSEQQLVDCDHNCVTYQNQQACDSGCNGGLMWSAFQYVIKNGGLDTED 216

Query: 515 DYGGYLGQDGYCHVD--NVTAVTS 580
            Y  Y G D  C  +  NV A  S
Sbjct: 217 SY-PYEGVDDTCRFNKSNVAATIS 239


>UniRef50_Q248G1 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 334

 Score = 92.7 bits (220), Expect = 6e-18
 Identities = 59/182 (32%), Positives = 91/182 (50%), Gaps = 4/182 (2%)
 Frame = +2

Query: 17  RQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPS 196
           R Y S+ E   R  IF ++ R + S+N  N  FT S+N  AD TD+E    + R  +   
Sbjct: 45  RVYNSEEEQFFRQLIFVENKRQVDSHNSQNPTFTQSLNQFADFTDEEF---KYRVLNTKV 101

Query: 197 PHGLPFPYSKSRVEELSVKLPPEHDWR-LFGAVTPVKDQSVCGSCWSFGTVGAVEGALFL 373
               P    +     L  ++P   DWR +   V P+K+Q  CGSCW+F   G VE    L
Sbjct: 102 SQTRPKKGRRLESRVLDQQIPESVDWRNVTNVVGPIKNQGHCGSCWTFSIAGIVESHYVL 161

Query: 374 HNGGHLVRLSQQALIDC---SWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDG 544
            +G + V  ++Q ++DC   S G+ ++GC+GG    A +++  +G+   E Y  Y+   G
Sbjct: 162 KHGSY-VSYAEQEILDCVSVSAGYQSDGCNGGWPEEALQYVIEYGIVKSEVY-PYVAVQG 219

Query: 545 YC 550
            C
Sbjct: 220 KC 221


>UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheirus
           salmonis|Rep: Putative cathepsin L - Lepeophtheirus
           salmonis (salmon louse)
          Length = 257

 Score = 92.7 bits (220), Expect = 6e-18
 Identities = 49/114 (42%), Positives = 64/114 (56%), Gaps = 1/114 (0%)
 Frame = +2

Query: 245 SVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDC 424
           S  +P   +W   GAVT VKDQ  CGSCW+F T G+VEG  F+ N   L+  S+Q L+DC
Sbjct: 35  SAPVPSYVNWTKNGAVTAVKDQKDCGSCWAFSTTGSVEGQYFIKN-KKLLSFSEQQLVDC 93

Query: 425 SWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYLGQDGYCHVDNVTAVTSI 583
           S  F N GC+GG    A+++ I   G+ TE+ Y  Y   DG C  +   A   I
Sbjct: 94  SSDFRNEGCNGGWMDNAFKYLIANKGIATEDTY-PYTATDGVCVYNKTMAAGRI 146


>UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia
           medanensis|Rep: Sui m 1 allergen - Suidasia medanensis
          Length = 336

 Score = 92.7 bits (220), Expect = 6e-18
 Identities = 65/198 (32%), Positives = 99/198 (50%), Gaps = 10/198 (5%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K  + +QY ++ E ++R  IF ++LR+I  N+    G  + VN  AD T +E +++    
Sbjct: 32  KELYGKQYTAEEEPQRRA-IFEENLRWIQENH-GKHGAGLEVNEHADLTAEEFSSM---- 85

Query: 182 YSGPSPHG-LPFPYSKSRVE----ELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
           Y+  +    L  P  K  V+    ++SV LP   DWR     T V++Q  CGSCW+F T 
Sbjct: 86  YATLNQEAFLKSPLHKEFVQVPESDISVALPAAFDWRQQWN-TAVRNQGQCGSCWAFATA 144

Query: 347 GAVEGALFLHNGGHLVRLSQQALIDC-----SWGFGNNGCDGGEDFRAYEWIKRHGLPTE 511
             VE    +    H V LS+Q L+DC        + ++GC GG    AY ++++ GL  E
Sbjct: 145 ATVEAQYAIRKNVH-VTLSEQQLVDCDHRPFQGQYEDHGCQGGNPIIAYAYVQQTGLVEE 203

Query: 512 EDYGGYLGQDGYCHVDNV 565
             Y  Y  +DG C    V
Sbjct: 204 SAY-PYQARDGQCQSSTV 220


>UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13;
           Plasmodium|Rep: Cysteine protease falcipain-3 -
           Plasmodium falciparum
          Length = 492

 Score = 92.3 bits (219), Expect = 8e-18
 Identities = 64/187 (34%), Positives = 98/187 (52%), Gaps = 16/187 (8%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDELAALRGRRY 184
           ++ ++Y +  E +KR  IF ++ R I  +N+  N  +   +N   D + +E  +    +Y
Sbjct: 177 ENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGDLSPEEFRS----KY 232

Query: 185 SGPSPHGLPF-----PYS-KSRVEELSVKLPPE--------HDWRLFGAVTPVKDQSVCG 322
                HG PF     P S ++  E++  K  P         +DWRL G VTPVKDQ++CG
Sbjct: 233 LNLKTHG-PFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVTPVKDQALCG 291

Query: 323 SCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAY-EWIKRHG 499
           SCW+F +VG+VE    +     L   S+Q L+DCS    NNGC GG    A+ + I   G
Sbjct: 292 SCWAFSSVGSVESQYAIRKKA-LFLFSEQELVDCS--VKNNGCYGGYITNAFDDMIDLGG 348

Query: 500 LPTEEDY 520
           L +++DY
Sbjct: 349 LCSQDDY 355


>UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing
           protein; n=5; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 437

 Score = 92.3 bits (219), Expect = 8e-18
 Identities = 57/184 (30%), Positives = 90/184 (48%), Gaps = 3/184 (1%)
 Frame = +2

Query: 41  HEKRLNIFRQSL-RYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFP 217
           + +R  +F+  L + I  N+  ++ ++  +N L  +TD EL   R  +    +       
Sbjct: 137 NSERFQLFKSRLAKIIEHNSNPDKKYSQIINKLTFQTDLELKKFRASQNCSATAQANTRS 196

Query: 218 YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSV-CGSCWSFGTVGAVEGALFLHNGGHLV 394
           + K    +LS +LP   DWR  G VT VK Q   CGSCW+F  V A+E    L  G   +
Sbjct: 197 FRKY---DLS-QLPQYVDWREKGVVTQVKSQGKDCGSCWAFAAVAALESHYALKTGKKPI 252

Query: 395 RLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYGGYLGQDGYCHVDNVTA 571
           + S+Q L+DC+  F   GC GG   + +E++    G+  E DY  Y G+D  C  ++   
Sbjct: 253 QFSEQQLVDCARKFDTKGCSGGLPSKGFEYLAYAGGIQNEADY-PYEGEDKNCRFNSSKT 311

Query: 572 VTSI 583
           V  +
Sbjct: 312 VVQV 315


>UniRef50_Q235G6 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 325

 Score = 92.3 bits (219), Expect = 8e-18
 Identities = 70/193 (36%), Positives = 95/193 (49%), Gaps = 8/193 (4%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEK--RLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRG 175
           K  + ++YA   + E+  R+N+F  +L +   +       TM V    D T  E A L  
Sbjct: 44  KQTYNKKYADQDDDEEVYRMNVFFDNLEFTKKDP------TMGVTKFMDLTHTEFAEL-- 95

Query: 176 RRYSGPSPHGLPFPYSKSRVEELSVKLPPEH------DWRLFGAVTPVKDQSVCGSCWSF 337
             Y  P+         ++  EE+    P +H      DW   GAVTPVK+Q  CG CWSF
Sbjct: 96  --YLNPA---------ENIDEEIDSLQPIQHNEDIVIDWVEKGAVTPVKNQGGCGGCWSF 144

Query: 338 GTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEED 517
            T G VEGA F++    L  LSQQ LIDC+    N GC GG    A  ++K  GL TEE+
Sbjct: 145 ATTGGVEGANFVYK-NVLPNLSQQQLIDCN--TQNKGCGGGLRDIALNYVKETGLTTEEE 201

Query: 518 YGGYLGQDGYCHV 556
           Y  Y  ++G C +
Sbjct: 202 Y-SYEAKNGKCRL 213


>UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep:
           MGC107932 protein - Xenopus tropicalis (Western clawed
           frog) (Silurana tropicalis)
          Length = 333

 Score = 91.9 bits (218), Expect = 1e-17
 Identities = 60/196 (30%), Positives = 99/196 (50%), Gaps = 5/196 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNR-ANRG---FTMSVNHLADRTDDELAAL 169
           K K++++Y +  +   R   +  +   +  +N+ A++G   + M++N  AD TD+E ++ 
Sbjct: 31  KSKYEKKYVTLDKELNRRKAWEATWEKVQKHNQLADQGLKSYRMAMNQFADLTDNERSS- 89

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSV-CGSCWSFGTV 346
             +    P    L  P         S+ +P E DWR    VTPVK+Q   CGSCW+F TV
Sbjct: 90  --KSCLLPREKSLN-PVKAESYSYTSITIPKEVDWRKSNCVTPVKNQGTFCGSCWAFATV 146

Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGG 526
           G +E    +     L+ LS+Q L+DC     N GC GG   +A E++ +HG+   ++Y  
Sbjct: 147 GVMESRYCIRT-KELLNLSEQQLVDCD--EINEGCCGGFPIKALEYVAQHGVMRNKEY-E 202

Query: 527 YLGQDGYCHVDNVTAV 574
           Y  +   C  D+  A+
Sbjct: 203 YSQKKATCEYDSDKAI 218


>UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_26_47548_45815 - Giardia lamblia
           ATCC 50803
          Length = 577

 Score = 91.9 bits (218), Expect = 1e-17
 Identities = 46/109 (42%), Positives = 66/109 (60%), Gaps = 10/109 (9%)
 Frame = +2

Query: 254 LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGAL----FLHNG-GH--LVRLSQQA 412
           LP E DWR+ G +   KDQ  CGSCW+FG +G +EG +     +  G  H  L   S+Q+
Sbjct: 344 LPQELDWRVRGIMNMAKDQVACGSCWTFGAIGTIEGRINKLRVVEEGLRHEPLKAYSEQS 403

Query: 413 LIDCSWGFGNNGCDGGEDFRAYEW-IKRHG--LPTEEDYGGYLGQDGYC 550
           ++DC WGFG+ GCDGG+   A +W ++ +G  +  E +Y  YLGQ+  C
Sbjct: 404 IVDCYWGFGSFGCDGGDTLAALKWLVENNGGRVAFESEY-PYLGQNDLC 451


>UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_79,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 324

 Score = 91.9 bits (218), Expect = 1e-17
 Identities = 60/187 (32%), Positives = 95/187 (50%), Gaps = 4/187 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG---FTMSVNHLADRTDDELAALR 172
           K+++ ++++S+ E   R  +F+Q+ + I ++N    G   +TM  N  AD T+ E A   
Sbjct: 40  KIQYNKKFSSEKEEMYRYLVFQQNAQLIEAHNNDKSGKYTYTMETNQFADLTEQEFA--- 96

Query: 173 GRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQ-SVCGSCWSFGTVG 349
            ++Y    P       +KS+  +  V      DW   G V P+KDQ S CGS W+F  VG
Sbjct: 97  -QKYLTFRPKST----NKSKSTDY-VPNGQARDWVEEGKVPPIKDQGSSCGSSWAFSAVG 150

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGY 529
            +E    +  G     LS+Q ++DCS  +GN GC GG     +E+++ HG+     Y  Y
Sbjct: 151 VLEINSNIEFGLETT-LSEQDMLDCSGPYGNQGCSGGWMDSGFEYVRDHGIANGSVY-PY 208

Query: 530 LGQDGYC 550
           +G D  C
Sbjct: 209 VGSDQTC 215


>UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823
           protein, partial; n=1; Ornithorhynchus anatinus|Rep:
           PREDICTED: similar to MGC81823 protein, partial -
           Ornithorhynchus anatinus
          Length = 361

 Score = 91.5 bits (217), Expect = 1e-17
 Identities = 45/90 (50%), Positives = 60/90 (66%), Gaps = 2/90 (2%)
 Frame = +2

Query: 257 PPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWG 433
           PPE  DWR  G VTPVKDQ  CGSCW+FG+ G +EG LF    G L  +S+Q L+DCS  
Sbjct: 190 PPEALDWRDHGYVTPVKDQGRCGSCWAFGSTGVLEGQLF-RRTGRLAAVSEQNLMDCSRK 248

Query: 434 FGNNGCDGGEDFRAYEWIKRH-GLPTEEDY 520
            GN GCDGG   +++ +++ + G+ +EE Y
Sbjct: 249 QGNRGCDGGLMQQSFLYVRDNGGVDSEEAY 278


>UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain;
           n=9; Cucujiformia|Rep: Digestive cysteine proteinase
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 91.5 bits (217), Expect = 1e-17
 Identities = 64/200 (32%), Positives = 94/200 (47%), Gaps = 6/200 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELA-A 166
           K  H + Y S LE   R  IF+ +LR I  +N    +    + + V   AD T DE    
Sbjct: 27  KQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAKYDKGEESYFLGVTPFADLTHDEFKDE 86

Query: 167 LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
           LR +  + P+       + +       +++P   DW   GAV  VK Q  CGSCW+F   
Sbjct: 87  LRRQIKTKPNVEATLAVFPEG------LEVPDSIDWTQKGAVLDVKYQGGCGSCWAFSAT 140

Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCD-GGEDFRAYEWIKRHGLPTEEDYG 523
           GA+EG   + N    + LS+Q L+DCS  +GN+ C+ GG    A++++   G+  +  Y 
Sbjct: 141 GALEGQNAIVNNVK-IPLSEQQLLDCSKPYGNDDCEHGGLMSFAFDYVLDKGIEADSSY- 198

Query: 524 GYLGQDGYCHVDNVTAVTSI 583
            Y G D  C  D    V  I
Sbjct: 199 PYKGIDTPCQYDAKKTVLKI 218


>UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan
           CA, family C1, cathepsin L-like cysteine peptidase -
           Trichomonas vaginalis G3
          Length = 306

 Score = 91.5 bits (217), Expect = 1e-17
 Identities = 59/181 (32%), Positives = 87/181 (48%), Gaps = 3/181 (1%)
 Frame = +2

Query: 38  EHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFP 217
           E+  RL I+  + RY+   NR N GFT+++N  A  T++E  ++ G +Y   S     +P
Sbjct: 25  EYHFRLGIWLSNKRYVQEKNRVNLGFTLALNRFAHLTENEYRSMLGYKYGHKS-----YP 79

Query: 218 YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVR 397
            +K+    +   +P E DWR  G V  +K+Q  CGSCW+F  +  +E  +   N   L  
Sbjct: 80  ITKN----IKNDVPTEIDWREQGIVNKIKNQGACGSCWAFSAIQVIESQV-AKNQKQLYD 134

Query: 398 LSQQALIDCSWGFGNNGCDGGEDFRAYEWI---KRHGLPTEEDYGGYLGQDGYCHVDNVT 568
           LS+Q L+DC       GC GG    A E++   +        DY  Y    G C  DN  
Sbjct: 135 LSEQNLLDCVTSC--FGCGGGWSPGALEYVYEKQNSKFMLTTDY-PYTAVQGTCKYDNKK 191

Query: 569 A 571
           A
Sbjct: 192 A 192


>UniRef50_Q231X3 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 323

 Score = 91.1 bits (216), Expect = 2e-17
 Identities = 61/190 (32%), Positives = 93/190 (48%), Gaps = 3/190 (1%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K KH ++Y +      RL +F ++L  + ++     G T  +    D TDDE A      
Sbjct: 43  KQKHNKRYENTDYESYRLEVFAENLEVVKNDQTGTYGITKFL----DLTDDEFAG----- 93

Query: 182 YSGPSPHGLPFPYSKSRV-EELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
               +   L   Y +  + E++ V      +W   G V+ VK Q  CGSCW+F    +VE
Sbjct: 94  ----NFLNLKAQYPEDSIAEDIEVDPKININWVEAGKVSNVKSQGNCGSCWAFSATASVE 149

Query: 359 GALFLHNG-GHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLG 535
            AL +       + LS+Q LIDCS  +GN GC  G+  +A  +IKR+ + TE++Y  Y  
Sbjct: 150 SALIIAGKVDKSISLSEQQLIDCSGDYGNYGCAAGQKEQALVYIKRYSITTEQNY-PYTE 208

Query: 536 QD-GYCHVDN 562
           +D   C+ DN
Sbjct: 209 KDVQKCYFDN 218


>UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus
           tropicalis|Rep: LOC594890 protein - Xenopus tropicalis
           (Western clawed frog) (Silurana tropicalis)
          Length = 355

 Score = 90.6 bits (215), Expect = 2e-17
 Identities = 60/185 (32%), Positives = 92/185 (49%), Gaps = 5/185 (2%)
 Frame = +2

Query: 11  HQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELAALRGR 178
           H++ Y ++ E   R  I+  +L++I  +N   + G   + + +NHL D   +E+   +  
Sbjct: 59  HKKIYKNEGEELARRLIWEDTLKFIMLHNLEYSMGLHTYEVGMNHLGDMVAEEMTDKQMN 118

Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
                  +    P       E+S   PPE  DWR    VT VKDQ  C + W+F ++GA+
Sbjct: 119 FIPQVIANITDVPV------EISKSSPPESIDWRNKNCVTSVKDQGSCIASWAFSSIGAL 172

Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLG 535
           E        G L  LS Q L+DCS  +GNNGC GG    ++ +I  +G+  E +Y  Y G
Sbjct: 173 ECQNMKRRTGKLESLSVQNLLDCSQTYGNNGCKGGWVVSSFRYIIDNGIELESNY-PYQG 231

Query: 536 QDGYC 550
           +DG C
Sbjct: 232 KDGKC 236


>UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain -
           Tetrahymena pyriformis
          Length = 330

 Score = 90.6 bits (215), Expect = 2e-17
 Identities = 60/181 (33%), Positives = 87/181 (48%), Gaps = 5/181 (2%)
 Frame = +2

Query: 23  YASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM--SVNHLADRTDDELAALRGRRYSGPS 196
           Y +  E   RL++F ++L+ I +NN AN   T    VN   D T++E AA R      P 
Sbjct: 47  YKNQGEESYRLSVFLENLKSIEANN-ANPLSTHVEEVNSFTDLTEEEFAA-RYLMKDLPQ 104

Query: 197 PHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLH 376
                 P     +E  ++  P   DW     + PVK+Q  CGSCW+F T G +EG   +H
Sbjct: 105 QMNKDLPI----LEMETLAAPQVIDWTAKNVLPPVKNQQQCGSCWAFSTAGMLEGVYNIH 160

Query: 377 NGGHL-VRLSQQALIDC--SWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGY 547
                 +  S+Q L+DC  + GFG  GC+G     A  + ++ G+  E  Y  Y  +DG 
Sbjct: 161 ESPQTPISFSEQQLVDCCGAQGFGCEGCNGAWPTDAVAYTQKFGIVQESQY-AYTAKDGS 219

Query: 548 C 550
           C
Sbjct: 220 C 220


>UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 343

 Score = 90.2 bits (214), Expect = 3e-17
 Identities = 59/180 (32%), Positives = 93/180 (51%), Gaps = 5/180 (2%)
 Frame = +2

Query: 5   VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG-FTMSVNHLADRTDDELAALRGRR 181
           VK+ R+Y ++ E  KR  IF ++L  +   N+ + G  T  +N  +D T++E      + 
Sbjct: 56  VKYLREYPNEYEIVKRFTIFSRNLDLVERYNKEDAGKVTYELNDFSDLTEEEWK----KY 111

Query: 182 YSGPSP-HGLPFPYSKSRVEELSVKLPPEHDWRLFGA---VTPVKDQSVCGSCWSFGTVG 349
              P P H       K+ +++ +  LP   DWR       VT +K Q  CGSCW+F T  
Sbjct: 112 LMTPKPDHSEKSLKPKTLIDKKN--LPNSVDWRNVNGTNHVTGIKYQGPCGSCWAFATAA 169

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGY 529
           A+E A+ + +GG L  LS Q L+DC+    ++ C GGE   A ++ + HG+ T  +Y  Y
Sbjct: 170 AIESAVSI-SGGGLQSLSSQQLLDCT--VVSDKCGGGEPVEALKYAQSHGITTAHNYPYY 226


>UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria
           dispar multicapsid nuclear polyhedrosis virus (LdMNPV)
          Length = 356

 Score = 90.2 bits (214), Expect = 3e-17
 Identities = 64/188 (34%), Positives = 98/188 (52%), Gaps = 5/188 (2%)
 Frame = +2

Query: 11  HQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG--FTMSVNHLADRTDDELAALRGRR 181
           + + Y SD E  KR +IF+ +L  I++ N  A  G   T  +N  +D +  EL A    +
Sbjct: 63  YNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKFSDLSKSELIA----K 118

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
           ++G S       + K+ +        P H DWR    VT +K+Q  CG+CW+F T+ +VE
Sbjct: 119 FTGLSIPERVSNFCKTIILNQPPDKGPLHFDWREQNKVTSIKNQGACGACWAFATLASVE 178

Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKR-HGLPTEEDYGGYLG 535
            + F      L+ LS+Q LIDC     + GC+GG    A+E I R  G+ TE DY  ++G
Sbjct: 179 -SQFAMRHNRLIDLSEQQLIDCD--SVDMGCNGGLLHTAFEEIMRMGGVQTELDY-PFVG 234

Query: 536 QDGYCHVD 559
           ++  C +D
Sbjct: 235 RNRRCGLD 242


>UniRef50_Q22LI1 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 987

 Score = 89.8 bits (213), Expect = 4e-17
 Identities = 61/199 (30%), Positives = 98/199 (49%), Gaps = 12/199 (6%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDELA------A 166
           KH + +  + + + RL+IF ++ + I  +N  ++  F + +N  A  T  E A      +
Sbjct: 37  KHNKVFDPE-QLKYRLSIFAENYKKIKEHNYNSSNTFQLGLNEYAHMTSQEFAEVFLTPS 95

Query: 167 LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
           +   +   P P   P P+  +     +V + P  DWR  GAVT VK Q  CGSCWSF   
Sbjct: 96  ISKSQQKQPKPKPQPQPHPNNSTNT-TVTITPI-DWRNKGAVTSVKRQGKCGSCWSFSAA 153

Query: 347 GAVEGALFLHNGGHLVRLSQQALIDC-----SWGFGNNGCDGGEDFRAYEWIKRHGLPTE 511
           G +E   +    G+L+ LS+Q L+DC        + +NGC+GG    A E+  ++G+   
Sbjct: 154 GLMEAFQYFKT-GNLIDLSEQQLVDCDNSSFDKSYYSNGCNGGYPQEAVEYASKYGIVPL 212

Query: 512 EDYGGYLGQDGYCHVDNVT 568
            DY  Y+ Q   C + + T
Sbjct: 213 TDY-PYVKQQQPCAIKSPT 230


>UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_26_49243_47612 - Giardia lamblia
           ATCC 50803
          Length = 543

 Score = 89.4 bits (212), Expect = 5e-17
 Identities = 48/117 (41%), Positives = 65/117 (55%), Gaps = 8/117 (6%)
 Frame = +2

Query: 227 SRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG---ALFLHNGGH--- 388
           S   +  V+ P + DWR+ G +TPVKDQ+ CGSCWSFG  G +EG   AL    G     
Sbjct: 307 SEENQKRVQFPRQLDWRVRGVITPVKDQAACGSCWSFGAAGTIEGRLNALKWKRGERDTP 366

Query: 389 LVRLSQQALIDCSWGFGNNGCDGGEDFRAY-EWIKR-HGLPTEEDYGGYLGQDGYCH 553
           L+R+S+Q++I C W   NNGC+GG  + A   +I    G    E    YLG +  C+
Sbjct: 367 LLRVSEQSIISCVWNEDNNGCNGGLTYEALTAYINEFSGRIAYEMDSPYLGVESLCN 423


>UniRef50_Q22NW9 Cluster: Papain family cysteine protease containing
            protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
            family cysteine protease containing protein - Tetrahymena
            thermophila SB210
          Length = 894

 Score = 89.4 bits (212), Expect = 5e-17
 Identities = 58/164 (35%), Positives = 91/164 (55%), Gaps = 3/164 (1%)
 Frame = +2

Query: 38   EHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPF 214
            E+  RLNIF ++L+ I ++N+ +N+ +   +N     T++E      + Y       L  
Sbjct: 617  EYMYRLNIFAKNLQNIKNHNQISNKPYIEGINQFTHLTEEEFE----QTYLT-----LQI 667

Query: 215  PYSKS-RVEE-LSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGH 388
            P SK  + +E L  ++P   DWR   AVTPVK+Q  CGS ++F T GA+EG +   +G  
Sbjct: 668  PASKQYKTQEFLGDEVPSSIDWRDLNAVTPVKNQGSCGSGYAFSTTGALEG-IHKISGKD 726

Query: 389  LVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
                S+Q +IDCS   GN+GC GG    A++++  +G+  E DY
Sbjct: 727  WKGFSEQQIIDCSRKQGNSGCHGGFMENAFDFVIENGILQENDY 770


>UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_21,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 349

 Score = 89.0 bits (211), Expect = 7e-17
 Identities = 59/187 (31%), Positives = 94/187 (50%), Gaps = 6/187 (3%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELAALRG 175
           +H ++Y +  E+  R  IF+++ +YI  +  R   G   F + +N  AD + +E  A + 
Sbjct: 46  EHGKRY-TQFENSHRFGIFKKNYQYIQEHQQRVEAGLETFELGLNDFADLSVEEFEA-KY 103

Query: 176 RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
            +Y        P   +         ++P E D R  G V+ VK+Q  CGSCW+F  V A+
Sbjct: 104 LKY-----RSTPREQTNQVYRRTGKQVPIEVDLRKDGVVSEVKNQGSCGSCWAFSAVAAL 158

Query: 356 EGALFLHNGGHLVRLSQQALIDCSW--GFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGY 529
           E AL    G   V LS+Q L+DC+    F + GCDGGE +  +++  ++G+    +Y  Y
Sbjct: 159 ETAL-RQGGVKNVELSEQELVDCAVKDEFESEGCDGGEMYDGFQYASKYGIAIRSEY-PY 216

Query: 530 LGQDGYC 550
            G D  C
Sbjct: 217 AGVDQKC 223


>UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia
           circumcincta|Rep: Secreted cathepsin F - Teladorsagia
           circumcincta
          Length = 364

 Score = 88.6 bits (210), Expect = 9e-17
 Identities = 59/180 (32%), Positives = 87/180 (48%), Gaps = 9/180 (5%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAALRGRRY 184
           +H + Y ++ E  KR  IF+++L  I S    ++G  +  +N  AD + +E         
Sbjct: 70  RHDKVYRNESEALKRFGIFKRNLEIIRSAQENDKGTAIYGINQFADLSPEEFKKTH---- 125

Query: 185 SGPSPHGLPFPYSKSRVEELSVK-------LPPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
               PH    P   +R+ +L+ +       LP   DWR  GAVT VK +  C +CW+F  
Sbjct: 126 ---LPHTWKQPDHPNRIVDLAAEGVDPKEPLPESFDWREHGAVTKVKTEGHCAACWAFSV 182

Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAY-EWIKRHGLPTEEDY 520
            G +EG  FL     LV LS Q L+DC     + GC+GG    AY E ++  GL  E+ Y
Sbjct: 183 TGNIEGQWFLAK-KKLVSLSAQQLLDCD--VVDEGCNGGFPLDAYKEIVRMGGLEPEDKY 239


>UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba
           histolytica|Rep: Cysteine protease 10 - Entamoeba
           histolytica
          Length = 297

 Score = 88.2 bits (209), Expect = 1e-16
 Identities = 58/197 (29%), Positives = 97/197 (49%), Gaps = 3/197 (1%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMS-VNHLADRTDDELAALRGR 178
           K+K+  +Y+   E  +R  IF Q+ + I   N+ N  FT++     +  T++E   L  R
Sbjct: 24  KIKYNTKYSGS-EALRRRAIFLQNSKLIQMINKQNLSFTVTNEGPFSVLTNEEYRMLHHR 82

Query: 179 RYSGPSPHGLPFPYSKSRVEELSVK-LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
                    L   +  + V+++  K +    DWR  G VTPVK+Q  C SC++FG++  +
Sbjct: 83  IDIEKEIKQLK-SHRMNLVKKMDNKEVLDSIDWRSEGKVTPVKNQRKCASCYAFGSIATI 141

Query: 356 EGALFLHNGGHLVRLSQQALIDCSWG-FGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYL 532
           E  +        + LS+Q ++DCS G + N GC  G    ++ +++ HG+  E DY  Y 
Sbjct: 142 ESLIMQETSIKEIDLSEQQIVDCSQGEYSNWGCTCGNVGNSFNYVRDHGILLERDY-PYT 200

Query: 533 GQDGYCHVDNVTAVTSI 583
           G+   C +D    V  I
Sbjct: 201 GKANNCSIDGKKPVIKI 217


>UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n=1;
           Toxocara canis|Rep: Cathepsin L-like cysteine proteinase
           - Toxocara canis (Canine roundworm)
          Length = 360

 Score = 88.2 bits (209), Expect = 1e-16
 Identities = 43/90 (47%), Positives = 54/90 (60%)
 Frame = +2

Query: 251 KLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSW 430
           ++P   DWR +  VTPVK Q  CGSCW+F TVG VE A  L   G L  LS+Q L+DC+ 
Sbjct: 144 EIPDHFDWRPYNVVTPVKSQFKCGSCWAFATVGTVESAYAL-GTGELRSLSEQQLLDCN- 201

Query: 431 GFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
              NN CDGG+  +A  ++   GL  E DY
Sbjct: 202 -LENNACDGGDVDKALRYVYDEGLMREYDY 230


>UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:
           Viral cathepsin - Cydia pomonella granulosis virus
           (CpGV) (Cydia pomonellagranulovirus)
          Length = 333

 Score = 87.8 bits (208), Expect = 2e-16
 Identities = 58/186 (31%), Positives = 87/186 (46%), Gaps = 4/186 (2%)
 Frame = +2

Query: 5   VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRY 184
           +K+ + Y SD E   +L  F+ +L+ I+  N A++     +N  +D   + L        
Sbjct: 37  IKYNKTYVSDEERAIKLENFKNNLKMINEKNMASKYAVFDINEYSDLNKNALLRRTTGFR 96

Query: 185 SGPSPHGLPFPYSKSRV----EELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGA 352
            G   +   F  ++  V    +E    LP   DWR    VTPVK+Q  CGSCW+F T+  
Sbjct: 97  LGLKKNPSAFTMTECSVVVIKDEPQALLPETLDWRDKHGVTPVKNQMECGSCWAFSTIAN 156

Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYL 532
           +E +L+       + LS+Q L++C     NNGC GG    A E I + G     +   Y 
Sbjct: 157 IE-SLYNIKYDKALNLSEQHLVNCD--NINNGCAGGLMHWALESILQEGGVVSAENEPYY 213

Query: 533 GQDGYC 550
           G DG C
Sbjct: 214 GFDGVC 219


>UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8;
           Magnoliophyta|Rep: Os12g0273800 protein - Oryza sativa
           subsp. japonica (Rice)
          Length = 504

 Score = 87.4 bits (207), Expect = 2e-16
 Identities = 59/183 (32%), Positives = 88/183 (48%), Gaps = 2/183 (1%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG-FTMSVNHLADRTDDELAALRGRRY 184
           +H R Y    E  +RL +F+ ++ +I S N   +  + + VN  AD T +E  A      
Sbjct: 50  QHGRVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSK 109

Query: 185 SGPSPHGLPFPYSKSRVEELSVK-LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
              +P+      +  + E +S   LP   DWR  GAVT +KDQ  C          A+EG
Sbjct: 110 GFSTPNNGVRVSTGFKYENVSADALPASVDWRTKGAVTRIKDQGQC----------AMEG 159

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQD 541
            + L  G  L+ LS+Q L+DC     + GC+GGE   A+++I  +G  T E    Y  +D
Sbjct: 160 FVKLSTG-KLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAED 218

Query: 542 GYC 550
           G C
Sbjct: 219 GRC 221


>UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia
           bovis|Rep: Cysteine protease 2 - Babesia bovis
          Length = 445

 Score = 87.4 bits (207), Expect = 2e-16
 Identities = 45/94 (47%), Positives = 56/94 (59%)
 Frame = +2

Query: 269 DWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNG 448
           DWR   AVTPVKDQ +CGSCW+F  VG+VE  L        VRLS+Q L+ C    GN G
Sbjct: 241 DWRRADAVTPVKDQGMCGSCWAFAAVGSVESLLKRQKTD--VRLSEQELVSCQ--LGNQG 296

Query: 449 CDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYC 550
           C+GG    A  +IK +G+   E++  YL  DG C
Sbjct: 297 CNGGYSDYALNYIKFNGIHRSEEW-PYLAADGKC 329


>UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1];
           n=11; Eutheria|Rep: Testin-2 precursor [Contains:
           Testin-1] - Mus musculus (Mouse)
          Length = 333

 Score = 87.4 bits (207), Expect = 2e-16
 Identities = 62/199 (31%), Positives = 95/199 (47%), Gaps = 9/199 (4%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELAAL 169
           + KH + Y  + E  +R  ++ ++ + I  +N         FTM++N   D T+ E   +
Sbjct: 33  RTKHGKAYNVNEERLRRA-VWEKNFKMIELHNWEYLEGKHDFTMTMNAFGDLTNTEFVKM 91

Query: 170 RG--RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
               RR      H           +   + +P   DWR+ G VTPVK+Q  C S W+F  
Sbjct: 92  MTGFRRQKIKRMHVFQ--------DHQFLYVPKYVDWRMLGYVTPVKNQGYCASSWAFSA 143

Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDY 520
            G++EG +F    G LV LS+Q L+DC      + C GG    A++++K + GL TEE Y
Sbjct: 144 TGSLEGQMF-KKTGRLVPLSEQNLLDCMGSNVTHDCSGGFMQNAFQYVKDNGGLATEESY 202

Query: 521 GGYLGQDGYC--HVDNVTA 571
             Y+G    C  H +N  A
Sbjct: 203 -PYIGPGRKCRYHAENSAA 220


>UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 21 SCAF14577, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 406

 Score = 87.0 bits (206), Expect = 3e-16
 Identities = 41/88 (46%), Positives = 56/88 (63%)
 Frame = +2

Query: 236 EELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQAL 415
           E+L  + PP  DWR  G V+PV++Q  C SCW+F ++GA+EG +     G LV LS Q L
Sbjct: 149 EKLGFETPPSVDWRKAGLVSPVQNQGFCNSCWAFSSLGALEGQM-KKRTGFLVPLSPQNL 207

Query: 416 IDCSWGFGNNGCDGGEDFRAYEWIKRHG 499
           +DCS   GN GC GG   ++Y +I R+G
Sbjct: 208 LDCSISDGNLGCRGGYISKSYSYIIRNG 235


>UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum
           aestivum|Rep: Cysteine protease - Triticum aestivum
           (Wheat)
          Length = 371

 Score = 87.0 bits (206), Expect = 3e-16
 Identities = 47/109 (43%), Positives = 60/109 (55%), Gaps = 1/109 (0%)
 Frame = +2

Query: 257 PPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGF 436
           P + DWR  G VTP K Q  CG CW+F     VE +L   NGG LV LS Q L+DCS G 
Sbjct: 154 PRQFDWREHGVVTPAKQQGACGCCWAFAAAATVE-SLNKINGGELVDLSVQELVDCSTGV 212

Query: 437 GNNGCDGGEDFRAYEWIK-RHGLPTEEDYGGYLGQDGYCHVDNVTAVTS 580
            ++ C  G    A  WIK + GL TE +Y  Y+ + G C V +   V++
Sbjct: 213 FSSPCGYGWPKSALAWIKSKGGLLTEAEY-PYMAKRGRCAVHDTARVSA 260


>UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3;
           Dictyostelium discoideum|Rep: Cysteine proteinase 1
           precursor - Dictyostelium discoideum (Slime mold)
          Length = 343

 Score = 87.0 bits (206), Expect = 3e-16
 Identities = 65/186 (34%), Positives = 89/186 (47%), Gaps = 15/186 (8%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHS------NNRANRGFTMSVNHLADRTDDELAAL 169
           K  ++Y+ + E+ +R  IF+ +L  I        N++A+  F   VN  AD + DE    
Sbjct: 35  KFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKF--GVNKFADLSSDEFKNY 91

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
                       LP   +    +E    +P   DWR  GAVTPVK+Q  CGSCWSF T G
Sbjct: 92  YLNNKEAIFTDDLPV--ADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTG 149

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSW--------GFGNNGCDGGEDFRAYEW-IKRHGL 502
            VEG  F+ +   LV LS+Q L+DC             + GC+GG    AY + IK  G+
Sbjct: 150 NVEGQHFI-SQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGI 208

Query: 503 PTEEDY 520
            TE  Y
Sbjct: 209 QTESSY 214


>UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11;
           Trichomonadidae|Rep: Cysteine protease - Tritrichomonas
           foetus (Trichomonas foetus)
          Length = 315

 Score = 86.6 bits (205), Expect = 4e-16
 Identities = 58/185 (31%), Positives = 89/185 (48%), Gaps = 3/185 (1%)
 Frame = +2

Query: 5   VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRY 184
           +++  Q+    E++ R  IF  + RY+  +N  +  FT+S+N  A  T  E   + G + 
Sbjct: 26  MRNTNQFYVGNEYQLRFGIFLSNARYVQEHNAGDSKFTVSLNKFAALTPSEYKVMLGYK- 84

Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
           +G     +     K  V+ +        DWR  G V  +KDQ+ CGSCW+F  + A E A
Sbjct: 85  TGMKAEKVSRGMKKPNVDSI--------DWREKGVVNEIKDQAACGSCWAFSAIQAAESA 136

Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI---KRHGLPTEEDYGGYLG 535
            +  + G L   S+Q L+DC  G    GC GG    AY++I   ++  +  E DY  Y  
Sbjct: 137 -YAISTGTLESYSEQNLVDCVQGC--YGCSGGLMDYAYKYIIDRQKGKMILESDY-VYTA 192

Query: 536 QDGYC 550
            DG C
Sbjct: 193 LDGVC 197


>UniRef50_Q23H32 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 365

 Score = 86.6 bits (205), Expect = 4e-16
 Identities = 61/182 (33%), Positives = 90/182 (49%), Gaps = 9/182 (4%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG----FTMSVNHLADRTDDELAAL 169
           K   +++YA D E + R  IF ++  YIH+ N+ N        + VN  AD +  E   L
Sbjct: 46  KKTFRKRYA-DSEGDYRFQIFAENYNYIHNYNQINENSQDNIQLEVNEFADLSLQEFREL 104

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEE---LSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFG 340
               Y+    H      S   + +   LS  +P   DWR    V PV+ Q  CGSCW+F 
Sbjct: 105 YFG-YNSSKKHNNQQNGSTKNLRQSFLLSDSVPESVDWRE-KLVAPVQKQGGCGSCWAFS 162

Query: 341 TVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKR--HGLPTEE 514
           TV A+EGA +    G++++ S+Q LIDC     NNGC+GG+   A + +     G+   +
Sbjct: 163 TVIALEGA-YAKQTGNVIKFSEQNLIDCC-RIENNGCNGGDPEPALDCVMNVLKGIMKNQ 220

Query: 515 DY 520
           DY
Sbjct: 221 DY 222


>UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 367

 Score = 85.4 bits (202), Expect = 9e-16
 Identities = 41/103 (39%), Positives = 61/103 (59%), Gaps = 3/103 (2%)
 Frame = +2

Query: 269 DWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSW---GFG 439
           DWR  GAV+PVK+Q  CGSCW+F  V   E    L N   L   S+Q L+DC++    + 
Sbjct: 160 DWRQSGAVSPVKNQGSCGSCWAFSAVALAESVNLLRNNS-LALYSEQELVDCTYKNPQYY 218

Query: 440 NNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYCHVDNVT 568
           N GC GG    AY +IK  G+ ++++Y  Y+GQ+  C +++ +
Sbjct: 219 NYGCQGGWPSVAYRYIKDQGISSQQNY-PYIGQNRNCSINSAS 260


>UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14;
           Leishmania|Rep: Cysteine proteinase 1 precursor -
           Leishmania pifanoi
          Length = 354

 Score = 85.4 bits (202), Expect = 9e-16
 Identities = 56/165 (33%), Positives = 81/165 (49%), Gaps = 3/165 (1%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVN-HLADRTDDELAALRGR 178
           K +H + +  D E   R N F+Q+++  +  N  N      V+   AD T  E A L   
Sbjct: 46  KKRHGKAFGGDAEEGHRFNAFKQNMQTAYFLNTQNPHAHYDVSGKFADLTPQEFAKL--- 102

Query: 179 RYSGPSPHGLPFPYSKS--RVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGA 352
            Y  P  +       K    V++ +       DWR  GAVTPVK+Q +CGSCW+F  +G 
Sbjct: 103 -YLNPDYYARHLKDHKEDVHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGN 161

Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI 487
           +EG  +  +G  LV LS+Q L+ C     + GC+GG   +A  WI
Sbjct: 162 IEGQ-WAASGHSLVSLSEQMLVSCD--NIDEGCNGGLMDQAMNWI 203


>UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus
           tauri|Rep: Cysteine protease-1 - Ostreococcus tauri
          Length = 430

 Score = 85.0 bits (201), Expect = 1e-15
 Identities = 63/175 (36%), Positives = 84/175 (48%), Gaps = 14/175 (8%)
 Frame = +2

Query: 38  EHEKRLNIFRQSLRYIHSNNR----ANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHG 205
           E+ KRL  F ++  Y+  +N           + +N LA  T +E  AL G +    S   
Sbjct: 116 EYAKRLATFAENAAYVVEHNALYAIGEVSHWVGLNSLAATTREEYRALLGYKPELRSSGD 175

Query: 206 LPF--PYSKSRVEELSVKL------PPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
                  S  +VE+           PPE  DW   GAVTP K+Q  CGSCW+F T GAVE
Sbjct: 176 AEMLEATSTDKVEQYKASWEYASVDPPEAIDWVELGAVTPPKNQGQCGSCWAFSTTGAVE 235

Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI-KRHGLPTEEDY 520
           G   +   G LV LS+Q ++ CS    N GC+GG    A+ WI K  G+ +E  Y
Sbjct: 236 GITKIRT-GRLVSLSEQEMVSCS--KQNMGCNGGLMDYAFRWIVKNGGIDSEFQY 287


>UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep:
           Cysteine proteinase - Paragonimus westermani
          Length = 272

 Score = 84.6 bits (200), Expect = 2e-15
 Identities = 49/120 (40%), Positives = 66/120 (55%), Gaps = 2/120 (1%)
 Frame = +2

Query: 230 RVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQ 406
           RV    +K  PE  DWR  GAVT V++Q  CGSCW+F T G VEG  F+   G LV LS+
Sbjct: 45  RVRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKT-GQLVSLSK 103

Query: 407 QALIDCSWGFGNNGCDGGEDFRAY-EWIKRHGLPTEEDYGGYLGQDGYCHVDNVTAVTSI 583
           Q L+DC      +GC+GG    +Y E +   GL +++DY  Y G    C ++    +  I
Sbjct: 104 QQLVDCD--RAADGCNGGWPASSYLEIMHMGGLESQDDY-PYAGVKEQCFMEKERLLAKI 160


>UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh
           fly) (Boettcherisca peregrina). Cathepsin L; n=2;
           Dictyostelium discoideum|Rep: Similar to Sarcophaga
           peregrina (Flesh fly) (Boettcherisca peregrina).
           Cathepsin L - Dictyostelium discoideum (Slime mold)
          Length = 265

 Score = 84.6 bits (200), Expect = 2e-15
 Identities = 51/146 (34%), Positives = 71/146 (48%), Gaps = 2/146 (1%)
 Frame = +2

Query: 119 MSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRV--EELSVKLPPEHDWRLFGAV 292
           M +N  +D T  E A     +   P P   P    K+      ++  +P   DWR  GAV
Sbjct: 1   MDLNEYSDLTQKEFADKFFEKLV-PEPRSGPINDIKATPFKHNVNATIPKSFDWRDHGAV 59

Query: 293 TPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFR 472
             VK+Q  C SCWSF  +GA+EG  ++   G L+ LS+Q L+DC+  FG  GC  G    
Sbjct: 60  GKVKNQGSCASCWSFSALGALEGHYYI-KYGELLDLSEQNLVDCATPFGPKGCKTGWMHD 118

Query: 473 AYEWIKRHGLPTEEDYGGYLGQDGYC 550
           A+++I   G    E    Y G+D  C
Sbjct: 119 AFKYIISSGGVNLESQYPYTGKDEVC 144


>UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa
           (japonica cultivar-group)|Rep: Os09g0562700 protein -
           Oryza sativa subsp. japonica (Rice)
          Length = 235

 Score = 84.2 bits (199), Expect = 2e-15
 Identities = 46/93 (49%), Positives = 57/93 (61%), Gaps = 1/93 (1%)
 Frame = +2

Query: 245 SVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDC 424
           S  LP +H     GAVT VKDQ  CGSCW+F TV  VEG   +   G LV LS+Q L+DC
Sbjct: 10  SCLLPVDHG----GAVTEVKDQGRCGSCWAFSTVAVVEGIQKIKK-GKLVSLSEQELVDC 64

Query: 425 SWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDY 520
                ++GCDGG  +RA EWI  + G+ T +DY
Sbjct: 65  D--TLDSGCDGGVSYRALEWITANGGITTRDDY 95


>UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2;
           Trypanosoma cruzi|Rep: Cysteine proteinase, putative -
           Trypanosoma cruzi
          Length = 392

 Score = 84.2 bits (199), Expect = 2e-15
 Identities = 66/192 (34%), Positives = 97/192 (50%), Gaps = 10/192 (5%)
 Frame = +2

Query: 38  EHEKRLNIFRQSLRYIHSNNRA-NRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPF 214
           E+ +R  +F Q+L  + ++N A N  + M +NH++D T +ELA+L G R    S H L  
Sbjct: 70  EYVRRRALFEQTLARVRTHNEAGNHLYVMGINHMSDWTPEELASLNGARPRMMS-H-LAQ 127

Query: 215 PYSKSRVEELSVKLPPEHDWRLFGA--VTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGH 388
              + R +    ++P E D+R      +T VKDQ  CGSCW+ G    +E + F    G 
Sbjct: 128 KSLQRRYQSSGGRIPDEVDYRNSSPAILTAVKDQGRCGSCWAHGAAEEME-SHFAILTGR 186

Query: 389 LVRLSQQALIDCSWG----FGNNGCDGGEDFRAYEWIKRHGLPTE--EDYGGYLGQDGYC 550
           L  LSQQ L  C+       G  GC G     AYE+ K+ G+ +E    Y  Y G+ G C
Sbjct: 187 LHVLSQQQLTSCAPNPKKCGGTGGCYGSTADLAYEYAKQ-GITSEWVYSYTSYRGETGDC 245

Query: 551 HVD-NVTAVTSI 583
             + +V AV  +
Sbjct: 246 RNELDVIAVAQV 257


>UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas
           foetus|Rep: TFCP2 protein - Tritrichomonas foetus
           (Trichomonas foetus)
          Length = 270

 Score = 84.2 bits (199), Expect = 2e-15
 Identities = 54/167 (32%), Positives = 81/167 (48%), Gaps = 4/167 (2%)
 Frame = +2

Query: 95  NRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVK-LPPEHD 271
           N    G+T+S+ H A  T  E A+L        S H        S  E +  K  P   D
Sbjct: 3   NSKGHGYTLSLYHFATYTSSEYASLLNVPSGRMSSH-------HSHHERIQYKDTPTSFD 55

Query: 272 WRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDC-SWGFGNNG 448
           WR  G V P+K+Q  CGSCW+F  + A E    +   G L+R S+Q+L+DC +  +   G
Sbjct: 56  WRSEGKVNPIKNQGSCGSCWAFSAIAAQESCHAIAT-GELLRFSEQSLVDCVTSDYSCQG 114

Query: 449 CDGGEDFRAYEWI--KRHGLPTEEDYGGYLGQDGYCHVDNVTAVTSI 583
           C GG   +A +++  +++G    E+   Y G  G C  D  + V++I
Sbjct: 115 CSGGWPDQAMKYVIEQQNGKFILEENYQYSGHKGACLYDEKSKVSNI 161


>UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep:
           Cysteine proteinase - Entamoeba histolytica
          Length = 320

 Score = 83.4 bits (197), Expect = 4e-15
 Identities = 42/107 (39%), Positives = 64/107 (59%), Gaps = 5/107 (4%)
 Frame = +2

Query: 254 LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFL-----HNGGHLVRLSQQALI 418
           +P   DWR  G +TP++D + CGSC+SFG++ A+E  L +     +N  +L  LS+Q ++
Sbjct: 97  IPTAIDWRAEGKLTPIRDHTQCGSCYSFGSLAAIESRLLIGGSQTYNADNL-DLSEQQIV 155

Query: 419 DCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYCHVD 559
           DCS    NNGC+GG     + + KR+G+  E+DY  Y   +G C  D
Sbjct: 156 DCS--NKNNGCNGGSILYVFAYTKRNGVIEEKDY-PYTATNGTCQYD 199


>UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium
           tetraurelia|Rep: Cathepsin L1 precursor - Paramecium
           tetraurelia
          Length = 314

 Score = 83.4 bits (197), Expect = 4e-15
 Identities = 59/192 (30%), Positives = 92/192 (47%), Gaps = 9/192 (4%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHS--NNRANRGFTMSVNHLADRTDDELA---- 163
           K+K+ R+Y +  +   R  +F  +L YI +   +     FT+ +N  AD +  E A    
Sbjct: 30  KMKYNRRYTNQRDEMYRYKVFTDNLNYIRAFYESPEEATFTLELNQFADMSQQEFAQTYL 89

Query: 164 ALRGRRYSGPSPHGLPFPYSKSRVE---ELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWS 334
           +L+  R +  +     F Y  + V+      VK P             VK+Q  CGSCW+
Sbjct: 90  SLKVPRTAKLNAANSNFQYKGAEVDWTDNKKVKYPA------------VKNQGSCGSCWA 137

Query: 335 FGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEE 514
           F  VGA+E    +        LS+Q L+DCS  + N+GC+GG    A+E++  +GL   +
Sbjct: 138 FSAVGALEINTDIELN-RKYELSEQDLVDCSGPYDNDGCNGGWMDSAFEYVADNGLAEAK 196

Query: 515 DYGGYLGQDGYC 550
           DY  Y  +DG C
Sbjct: 197 DY-PYTAKDGTC 207


>UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease
           Gip1p; n=4; Tetrahymena thermophila|Rep:
           Granule-biosynthesis induced protease Gip1p -
           Tetrahymena thermophila
          Length = 345

 Score = 83.0 bits (196), Expect = 5e-15
 Identities = 53/198 (26%), Positives = 95/198 (47%), Gaps = 8/198 (4%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAA----- 166
           +  ++R Y ++ E   R  +F ++L  ++ +  +++ ++  +N  +D T +E        
Sbjct: 44  RFNYKRVYLNEEEQIYRQIVFFENLASVNKHP-SHKSYSKGLNQFSDMTKEEFKQRVLNK 102

Query: 167 -LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
            +  +  S      L    + S +   +  LP   DWR  G + PVK+Q  CGSCW+F T
Sbjct: 103 KISKKASSNKGGRNLAADPAVSNLVFPTNNLPLSVDWRKRGVLNPVKNQGTCGSCWTFAT 162

Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDC--SWGFGNNGCDGGEDFRAYEWIKRHGLPTEED 517
            G +E    + N   L++ S+Q L+DC    G+ ++GCDGG       +   +G+     
Sbjct: 163 AGILESFNQIKN-KQLLKFSEQQLVDCVSLAGYDSDGCDGGFQEDGVRYAIEYGIVQSYK 221

Query: 518 YGGYLGQDGYCHVDNVTA 571
           Y  Y+G  G C V + T+
Sbjct: 222 Y-PYVGYQGRCKVTSPTS 238


>UniRef50_Q24FA8 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 335

 Score = 83.0 bits (196), Expect = 5e-15
 Identities = 56/194 (28%), Positives = 97/194 (50%), Gaps = 12/194 (6%)
 Frame = +2

Query: 23  YASDLEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDELAALRGRRYSGPSP 199
           Y+S+ E   R ++F ++ + +  +N+ +N  +++ +N  +D T   L   + R     SP
Sbjct: 43  YSSEAEKIYRQSVFLENYQSVQEHNKNSNHTYSVGINQFSDIT---LQEYQQRILMKNSP 99

Query: 200 HGLPFPYSKSRVEELS-------VKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
                  +K+R+ + S        ++    DWR  G V+PVK+Q  CG CW+F   G +E
Sbjct: 100 LN-ELAKNKNRLLQSSPIQNSNDTQIASSIDWRKKGGVSPVKNQGECGGCWTFSATGLME 158

Query: 359 GALFLHNGGHLVRL-SQQALIDC---SWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGG 526
               +HN    V L SQQ L+DC     G+ + GC+GG    A ++    G+ ++ +Y  
Sbjct: 159 SFNLIHNKPQNVSLYSQQQLLDCVTLENGYFSEGCEGGVPSDAVQYAADFGVLSDNEY-P 217

Query: 527 YLGQDGYCHVDNVT 568
           Y G  G C++ + T
Sbjct: 218 YTGIQGQCNITSKT 231


>UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin O;
           n=1; Monodelphis domestica|Rep: PREDICTED: similar to
           cathepsin O - Monodelphis domestica
          Length = 414

 Score = 82.6 bits (195), Expect = 6e-15
 Identities = 55/176 (31%), Positives = 82/176 (46%), Gaps = 6/176 (3%)
 Frame = +2

Query: 44  EKRLNIFRQSLR---YIHS-NNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLP 211
           E R   FR+SL+   Y++S ++  N      +N  +    +E   +    Y    P  LP
Sbjct: 131 ENRSTAFRESLKRHHYLNSFSSSDNTSAIYGINQFSYLFPEEFKDI----YLRSKPSVLP 186

Query: 212 FPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHL 391
                 ++    + LP   DWR    VT V++Q +CG CW+F  VG++E A +   G  L
Sbjct: 187 LYSEALKMPTTHMPLPVRFDWRDKHVVTKVRNQQMCGGCWAFSVVGSIESA-YAIKGESL 245

Query: 392 VRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH--GLPTEEDYGGYLGQDGYCH 553
             LS Q +IDCS  + N GC GG    A  W+ +    L  + +Y  +  Q G CH
Sbjct: 246 EDLSVQQVIDCS--YNNFGCSGGSTVNALNWLNKTQVRLVKDSEY-SFKAQTGLCH 298


>UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza
           sativa|Rep: OSJNBb0085F13.15 protein - Oryza sativa
           subsp. japonica (Rice)
          Length = 383

 Score = 82.6 bits (195), Expect = 6e-15
 Identities = 60/197 (30%), Positives = 88/197 (44%), Gaps = 17/197 (8%)
 Frame = +2

Query: 11  HQRQYASDLEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDE-LAALRGRRY 184
           H R YAS  E  +R  ++R ++ +I + NR  +  F +      D T +E LA   G   
Sbjct: 63  HNRSYASADEKLRRFEVYRSNMEFIEATNRNGSLTFKLGETPFTDLTHEEFLATYTGDVR 122

Query: 185 SGPSPHGLPFPYSK--------------SRVEELSVKLPPEHDWRLFGAVTPVKDQSVCG 322
             P   G+     +              +     +V +P   DWR  GAVTP K Q  C 
Sbjct: 123 LPPERRGMQDDSDEEDAVITTSAGYVAGAGAGRRTVAVPESVDWRKEGAVTPAKHQGQCA 182

Query: 323 SCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI-KRHG 499
           +CW+F  V A+E +L    GG L+ LS+Q L+DC    G   C  G    A+ W+ K  G
Sbjct: 183 ACWAFAAVAAIE-SLHKIKGGDLISLSEQELVDCD-DTGEATCSKGYSDDAFLWVSKNKG 240

Query: 500 LPTEEDYGGYLGQDGYC 550
           + ++  Y  Y+G    C
Sbjct: 241 IASDLIY-PYVGHKESC 256


>UniRef50_P43234 Cluster: Cathepsin O precursor; n=22;
           Euteleostomi|Rep: Cathepsin O precursor - Homo sapiens
           (Human)
          Length = 321

 Score = 82.6 bits (195), Expect = 6e-15
 Identities = 56/175 (32%), Positives = 83/175 (47%), Gaps = 5/175 (2%)
 Frame = +2

Query: 44  EKRLNIFRQSL---RYIHSNNRANRGFTM-SVNHLADRTDDELAALRGRRYSGPSPHGLP 211
           E+    FR+SL   RY++S   +        +N  +    +E  A+    Y    P   P
Sbjct: 38  EREAAAFRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAI----YLRSKPSKFP 93

Query: 212 FPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHL 391
              ++  +   +V LP   DWR    VT V++Q +CG CW+F  VGAVE A +   G  L
Sbjct: 94  RYSAEVHMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESA-YAIKGKPL 152

Query: 392 VRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYG-GYLGQDGYCH 553
             LS Q +IDCS  + N GC+GG    A  W+ +  +   +D    +  Q+G CH
Sbjct: 153 EDLSVQQVIDCS--YNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCH 205


>UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 344

 Score = 82.2 bits (194), Expect = 8e-15
 Identities = 41/103 (39%), Positives = 57/103 (55%), Gaps = 1/103 (0%)
 Frame = +2

Query: 254 LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWG 433
           LP   DWR  G +TP K Q+ CGSCW+F T G +E    L   G L+  S+Q L+DC   
Sbjct: 131 LPESFDWRDKGIITPAKFQNTCGSCWTFATTGVIESQYAL-KYGELLHFSEQMLLDCD-- 187

Query: 434 FGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQDGYCHVD 559
             N GC GG    AY+++++  G+ T + YG Y  +   C+ D
Sbjct: 188 NINQGCRGGLMTDAYQFLQQSGGIQTADTYGDYKNKKDICNFD 230


>UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 350

 Score = 81.8 bits (193), Expect = 1e-14
 Identities = 57/190 (30%), Positives = 89/190 (46%), Gaps = 8/190 (4%)
 Frame = +2

Query: 17  RQYASDLEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDELA--ALRGRRYS 187
           R Y S+ E   R  +F Q+ + I  +N  +N  + +  N  +D T DE A   L  +  +
Sbjct: 56  RTYLSEEERTYRQIVFLQNDQNIQKHNSDSNNTYKLQHNQFSDMTKDEFAHRVLNSQLKT 115

Query: 188 GPSPHGLPFPYSKSRVE-ELSVKLPPEHDWRLF-GAVTPVKDQSVCGSCWSFGTVGAVEG 361
             S    P    + R   + S+      DWR + G +  VK+Q  CGSCW+F T G +E 
Sbjct: 116 SASSSSQPAQTPQLRGSVDASLNASQGFDWRNYQGVLGNVKNQGQCGSCWTFATAGVLES 175

Query: 362 ALFLHNGGHLVRLSQQALIDC---SWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYL 532
              L     L+  S+Q ++DC   S+G+ ++GC+GG      ++    GL  + DY  Y+
Sbjct: 176 YYALKYQQSLI-FSEQDIVDCASRSYGYQSDGCNGGFPSEGLQYASTVGL-VQSDYYPYV 233

Query: 533 GQDGYCHVDN 562
              G C   N
Sbjct: 234 AVQGTCRQVN 243


>UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1;
           Phytophthora infestans|Rep: Cathepsin-like cysteine
           protease - Phytophthora infestans (Potato late blight
           fungus)
          Length = 376

 Score = 81.8 bits (193), Expect = 1e-14
 Identities = 67/200 (33%), Positives = 93/200 (46%), Gaps = 12/200 (6%)
 Frame = +2

Query: 11  HQRQYASDL-EHEK---RLNIFRQSLRYIHSNNRA-NRG---FTMSVNHLADRTDDELAA 166
           +++ Y +D  +H+    R   F  +L  I ++N A  RG   FT+ +N LAD  D E   
Sbjct: 47  YEKSYRNDANDHDVVQLRFRSFATNLERIQTHNEAYERGEHSFTLGLNDLADLADAEYKQ 106

Query: 167 LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
           L   R            + K    E    LP   DWR    VTPVK+Q  CGSCW+F  V
Sbjct: 107 LLSYRTRDSKSSSASETFVKPENVE---DLPATWDWREHSTVTPVKNQGQCGSCWAFSAV 163

Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCD-GGEDFRAYEWI---KRHGLPTEE 514
            A+E A  L + G L  LS+Q L+DC+   G + C+ GGE    YE I    +  +  EE
Sbjct: 164 AAMECAYAL-STGTLESLSEQELVDCTLN-GIDTCNHGGEMSEGYEEIITNHKGKIDREE 221

Query: 515 DYGGYLGQDGYCHVDNVTAV 574
            Y       G C+  +  A+
Sbjct: 222 VYRYTAESKGVCNAKDDKAI 241


>UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium
           (Vinckeia)|Rep: Berghepain-2 - Plasmodium yoelii yoelii
          Length = 472

 Score = 81.8 bits (193), Expect = 1e-14
 Identities = 55/182 (30%), Positives = 92/182 (50%), Gaps = 7/182 (3%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDE--LAALRGR- 178
           K+ ++Y+S  E ++R  IF + L+ I  +N+ N  +T  +N  +D   +E  +  L  + 
Sbjct: 162 KYNKEYSSAEEMQERFYIFSEKLKKIEKHNKENHLYTKGINAFSDMRHEEFKMKYLNNKL 221

Query: 179 --RYSGPSPHGLPFPYSKSRVEELSVKLP-PEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
              +     H +P+  + ++ +  + ++     DWR   A+  +KDQ  C SCW+F T G
Sbjct: 222 KENHQIDLRHLIPYTIAINKYKSPTDQINYTSFDWRDHNAIIDIKDQQKCASCWAFATAG 281

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYE-WIKRHGLPTEEDYGG 526
            V  A +       V LS+Q L+DC+    N GCDGG    A+E  I  +GL  E+ Y  
Sbjct: 282 VV-AAQYAIRKNQKVSLSEQQLVDCAQ--NNFGCDGGILPYAFEDLIDMNGL-CEDKYYP 337

Query: 527 YL 532
           Y+
Sbjct: 338 YV 339


>UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus
           salmonis|Rep: Cysteine proteinase - Lepeophtheirus
           salmonis (salmon louse)
          Length = 372

 Score = 81.8 bits (193), Expect = 1e-14
 Identities = 59/202 (29%), Positives = 97/202 (48%), Gaps = 12/202 (5%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDELAALRGRRY 184
           ++ + Y +      +L +F  +LR I  +N    R + M +N  +D TD+E  +    +Y
Sbjct: 33  EYSKSYHNRALRSLKLKVFVDNLREIEEHNANPKRTWDMGINEFSDLTDEEFES----KY 88

Query: 185 SGPSP--HGLPFPYSKSRVEELSVK-LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
            G SP           +  ++ ++K LP   DWR  G +T VK+Q  CGSCW F  V  +
Sbjct: 89  MGYSPMSSSAGLVTRTAAPKQGNIKDLPESVDWREKGVITDVKNQGSCGSCWVFSAVEQI 148

Query: 356 EGALFLHNG-GHLVRLSQQALIDCSWG----FGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
           E  + + N       LS Q +  CS       G+ GC G  +  AY + + +G+ TE++Y
Sbjct: 149 ESYVAIENNMTSPPLLSTQQITSCSSNPYSCGGSGGCKGAINEIAYMYTQLYGIETEKEY 208

Query: 521 ---GGYLGQDGYCHVDNVTAVT 577
               G+  + G C + N ++VT
Sbjct: 209 PYTSGFTEESGEC-LYNASSVT 229


>UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin W,
           partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
           similar to Cathepsin W, partial - Ornithorhynchus
           anatinus
          Length = 229

 Score = 81.4 bits (192), Expect = 1e-14
 Identities = 56/162 (34%), Positives = 77/162 (47%), Gaps = 4/162 (2%)
 Frame = +2

Query: 47  KRLNIFRQSLRYIHSNNRANRGFT-MSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYS 223
           +R  IF Q+L         + G     V   +D +++E  +L   R+      G+P  ++
Sbjct: 3   RRFKIFVQNLARARKLQEEDLGTAEYGVTPFSDLSEEEFLSLYAPRF------GMPSGWA 56

Query: 224 KSRVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRL 400
                     L  E  DWR  GA+T VK+Q  CGSCW+F  VG  E   +L  G  LV L
Sbjct: 57  NQMASIPEGPLRKETCDWRKRGAITSVKNQGSCGSCWAFAAVGNAESMWYLRAGKRLVSL 116

Query: 401 SQQALIDCSWGFGNNGCDGG--EDFRAYEWIKRHGLPTEEDY 520
           S Q ++DC  G   +GC GG  ED     W  R GL +E+DY
Sbjct: 117 SVQEVLDC--GRCRDGCQGGYPEDAFVTMWFNR-GLASEKDY 155


>UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin O
           precursor; n=2; Apocrita|Rep: PREDICTED: similar to
           Cathepsin O precursor - Apis mellifera
          Length = 374

 Score = 81.4 bits (192), Expect = 1e-14
 Identities = 55/204 (26%), Positives = 98/204 (48%), Gaps = 13/204 (6%)
 Frame = +2

Query: 5   VKHQRQYASD-LEHEKRLNIFRQSLRYIHSNN---RANRGFTMSVNHLADRTDDELAA-- 166
           +++ + Y ++  E+E+R   F++SL++I   N    +       +   +D +++E     
Sbjct: 62  IRYNKSYRNNPSEYEERFKRFQRSLQHIERMNGLRSSQESAYYGLTEFSDMSENEFLLHT 121

Query: 167 ------LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSC 328
                 +RG ++   S H      S  R++  S+ +P   DWR  G +TPV+ Q  CG+C
Sbjct: 122 LLPDLPIRGEKHMNASYHR-KHQISIDRMKR-SISIPLRFDWRDKGVITPVRSQGSCGAC 179

Query: 329 WSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLP- 505
           W+F T+  +E ++F    G L  LS Q +IDC+    N GC+GG+      W+    +  
Sbjct: 180 WAFSTIEVIE-SMFAIKNGTLHSLSVQEMIDCAKN-SNFGCEGGDICSLLSWLLISKVQI 237

Query: 506 TEEDYGGYLGQDGYCHVDNVTAVT 577
            +E     +G  G C +  +T  T
Sbjct: 238 LQESIYPLVGMTGTCKLGKMTDKT 261


>UniRef50_Q23H10 Cluster: Papain family cysteine protease containing
           protein; n=14; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 336

 Score = 81.4 bits (192), Expect = 1e-14
 Identities = 57/196 (29%), Positives = 95/196 (48%), Gaps = 13/196 (6%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDELAA------ 166
           ++QR Y ++ E   R  +F ++ + I  +N   N  +++ +N  +D T +E A       
Sbjct: 35  QNQRVYLNEHEKLFRQMVFFENFQKIQEHNSDPNNTYSVHLNQFSDMTKEEFAEKILMKS 94

Query: 167 -LRGRRYSGPSPHGL--PFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSF 337
            L      G S          +++++   S+ L    DWR  GAVT VK+Q  CGSCWSF
Sbjct: 95  DLVDHLMKGISQEATHNDTNNNETQLSSNSLTLADSIDWRTKGAVTSVKNQGGCGSCWSF 154

Query: 338 GTVGAVEGALFLHNGGHLVRLSQQALIDC---SWGFGNNGCDGGEDFRAYEWIKRHGLPT 508
                +E   F+ N   LV  S+Q L+DC   + G+ + GC+GG   +  ++  + G+ T
Sbjct: 155 SAAAVMESFNFIQNKA-LVDFSEQQLVDCVIPANGYNSYGCNGGWPVQCLDYASKVGITT 213

Query: 509 EEDYGGYLGQDGYCHV 556
            + Y  Y+     C+V
Sbjct: 214 LDKY-PYVAVQKNCNV 228


>UniRef50_A7APS9 Cluster: Papain family cysteine protease containing
           protein; n=1; Babesia bovis|Rep: Papain family cysteine
           protease containing protein - Babesia bovis
          Length = 435

 Score = 81.4 bits (192), Expect = 1e-14
 Identities = 61/186 (32%), Positives = 86/186 (46%), Gaps = 17/186 (9%)
 Frame = +2

Query: 44  EKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPS-PHGLPF-- 214
           E+    +R   R    N   ++ +TM +N  AD T ++  +L+G R S      G+P   
Sbjct: 139 ERFATFYRNVTRIREFNMNVHKTYTMKINQFADMTPEQFMSLQGTRASKIRVSKGIPDSQ 198

Query: 215 ---------PYSKSRVEELSVK---LPPEH--DWRLFGAVTPVKDQSVCGSCWSFGTVGA 352
                    P  KS V +   +   + PE   D R    +TPVKDQ  CGSCW+F  +G 
Sbjct: 199 VAAVGNQKGPNLKSEVRQTGNRFADISPEDFIDLRKDNYMTPVKDQGNCGSCWAFSLIGV 258

Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYL 532
            E   F H     V LS+Q L+DC      +GCD G  + AYE+I+ HG+     Y  Y 
Sbjct: 259 AE-PFFKHKRDIDVVLSEQNLVDCVKEC--HGCDYGNSYFAYEYIRDHGVYRLASY-PYT 314

Query: 533 GQDGYC 550
            + G C
Sbjct: 315 AKSGPC 320


>UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 664

 Score = 81.0 bits (191), Expect = 2e-14
 Identities = 47/123 (38%), Positives = 68/123 (55%), Gaps = 2/123 (1%)
 Frame = +2

Query: 221 SKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRL 400
           SKSR+  L    P   DWR +G V+ VK+Q  CGSC++F TVGA+E   +  N   ++ L
Sbjct: 461 SKSRL--LKWSRPISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYRKN-NRMLDL 517

Query: 401 SQQALIDC--SWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYCHVDNVTAV 574
           S+Q L+DC  S  + N GC GG     Y +I+ +G   +E    Y G+ G C  ++  A 
Sbjct: 518 SEQNLVDCTASNKYRNGGCSGGWMHNCYSYIQENGGINQESTYPYEGKFGQCRYNSGDAQ 577

Query: 575 TSI 583
           + I
Sbjct: 578 SRI 580


>UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 339

 Score = 80.6 bits (190), Expect = 3e-14
 Identities = 42/92 (45%), Positives = 61/92 (66%), Gaps = 4/92 (4%)
 Frame = +2

Query: 269 DWRLFGAVTPVKDQSVC-GSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNN 445
           DWR F AVTPVK+Q +C G+ +SF  +G +E + F+ N   L+ LS+Q +IDC+   GNN
Sbjct: 119 DWRNFDAVTPVKNQGLCSGAGYSFSAIGVIESSHFIKNK-ELITLSEQNIIDCTTDMGNN 177

Query: 446 GCDGGEDFRAYEW-IKRHGLPTEED--YGGYL 532
           GC GG    A+++ IK+ G+ +E +  Y GYL
Sbjct: 178 GCMGGLALIAFDYIIKQKGIDSEFNYPYEGYL 209


>UniRef50_Q23FQ5 Cluster: Papain family cysteine protease containing
           protein; n=4; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score = 80.6 bits (190), Expect = 3e-14
 Identities = 47/183 (25%), Positives = 90/183 (49%), Gaps = 3/183 (1%)
 Frame = +2

Query: 11  HQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG-FTMSVNHLADRTDDELAALRGRRYS 187
           ++R + ++ E   R  +F ++L+ + ++ +     +T+S+N  +D + +E       ++ 
Sbjct: 43  YRRVFLNEDEETYRQLVFFENLQKLKTHEKNTEATYTVSLNQFSDYSQEEFVQRILNKHI 102

Query: 188 GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGAL 367
             S   +      +     +V  P   DWR  GA+ P+++Q  CGSC +FGT G +E   
Sbjct: 103 SRSDADIQKEQEPNGNLRKAVNYPTSVDWRNSGALNPIQNQGQCGSCAAFGTAGVLESFY 162

Query: 368 FLHNGGHLVRLSQQALIDCS--WGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQD 541
           +L     L++ S+Q L+DC+   GF   GCDG      +++  ++G+     Y  Y+G  
Sbjct: 163 YL-KSKQLLKFSEQQLLDCARQAGFDTYGCDGAWQQEYFKYAIKYGIVQGSSY-PYVGYQ 220

Query: 542 GYC 550
             C
Sbjct: 221 TTC 223


>UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5;
           Theileria|Rep: Cysteine proteinase precursor - Theileria
           parva
          Length = 440

 Score = 80.6 bits (190), Expect = 3e-14
 Identities = 61/187 (32%), Positives = 92/187 (49%), Gaps = 17/187 (9%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAAL------ 169
           K+ R++A+  E   RL  FR +   +    + +  +   +N  +D T+ E   L      
Sbjct: 131 KYNRRHATQQERLNRLVTFRSNYLEV-KEQKGDEPYVKGINRFSDLTEREFYKLFPVMKP 189

Query: 170 RGRRYSGPS---PHGLPFPYSKSRVEELSV-------KLPPEH-DWRLFGAVTPVKDQSV 316
               YS       H     Y K+  + L+        KL  E+ DWR   +VT VKDQS 
Sbjct: 190 PKATYSNGYYLLSHMANKTYLKNLKKALNTDEDVDLAKLTGENLDWRRSSSVTSVKDQSN 249

Query: 317 CGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH 496
           CG CW+F TVG+VEG  ++ +      LS Q L+DC   F +NGC GG    AYE+++++
Sbjct: 250 CGGCWAFSTVGSVEG-YYMSHFDKSYELSVQELLDCD-SF-SNGCQGGLLESAYEYVRKY 306

Query: 497 GLPTEED 517
           GL + +D
Sbjct: 307 GLVSAKD 313


>UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1;
           Oryza sativa (japonica cultivar-group)|Rep: Putative
           uncharacterized protein - Oryza sativa subsp. japonica
           (Rice)
          Length = 326

 Score = 80.2 bits (189), Expect = 3e-14
 Identities = 47/124 (37%), Positives = 63/124 (50%), Gaps = 4/124 (3%)
 Frame = +2

Query: 50  RLNIFRQSLRYIHSNNRAN-RGFTMSVNHLADRTDDELAALRGRRYSGPSPH---GLPFP 217
           R  +F+++ RYIH  NR     + + +N  AD T +E  A    +Y+G +P    GL   
Sbjct: 49  RFEVFKKNARYIHDFNRKKGMSYKLGLNKFADLTLEEFTA----KYTGANPGPITGLKNG 104

Query: 218 YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVR 397
                +  ++   PP  DWR  GAVT VKDQ  CGSCW+F  V AVEG   +  G  L  
Sbjct: 105 TGSPPLAAVAGDAPPAWDWREHGAVTRVKDQGPCGSCWAFSVVEAVEGINEIMTGNFLTL 164

Query: 398 LSQQ 409
             QQ
Sbjct: 165 SEQQ 168


>UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 293

 Score = 80.2 bits (189), Expect = 3e-14
 Identities = 52/181 (28%), Positives = 88/181 (48%), Gaps = 3/181 (1%)
 Frame = +2

Query: 38  EHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFP 217
           E+  RL I+  ++RYI  +N+A   + +  N  A  T  E  ++  +      P  L   
Sbjct: 12  EYAFRLGIYLSNMRYIKEHNKAGSSYKLEGNRFAAFTPAEYRSMLSK------PKSLAKK 65

Query: 218 YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVR 397
           +  + ++     +P E DWR  G VTPV+ Q  CG+ W+F +    E    +++ G L  
Sbjct: 66  FESAPLKHKEGAIPAEFDWRTKGVVTPVRYQEGCGAGWAFASAALQESMWAIYDRG-LAH 124

Query: 398 LSQQALIDCSWGFGNNGCDGGEDFRA--YEWIKRHGL-PTEEDYGGYLGQDGYCHVDNVT 568
           LS Q L+DC   + ++GCDGG    A  +  + ++G+  ++ DY  +    G C  D+  
Sbjct: 125 LSVQQLLDCD--YNDDGCDGGSSDGASYFVLLNQYGMWMSDSDY-PFKPYVGECKFDSSM 181

Query: 569 A 571
           A
Sbjct: 182 A 182


>UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2;
           Endopterygota|Rep: Cathepsin L-like proteinase - Bombyx
           mori (Silk moth)
          Length = 402

 Score = 80.2 bits (189), Expect = 3e-14
 Identities = 58/196 (29%), Positives = 89/196 (45%), Gaps = 9/196 (4%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRAN----RGFTMSVNHLADRTDDE---- 157
           K  H + Y+S       L  +RQ+LR +  +NR      + +++ +NH  D    E    
Sbjct: 104 KAIHNKLYSSTHHEMAALMKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTEYFGK 163

Query: 158 -LAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWS 334
            L  ++      P+       Y  +R      K+P   DWR  G     ++Q  CG+C++
Sbjct: 164 VLKLIKAFPLFDPAEDHHKTAYRHNR----RCKVPKRIDWRDQGFKPRREEQWQCGACYA 219

Query: 335 FGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEE 514
           F    A++  L+  +G     LS Q ++DCS   GN GCDGG    A  +  R GL  E 
Sbjct: 220 FAVTHALQAQLYKRHG-EWNELSPQQIVDCSIKDGNMGCDGGSLRGALRYAAREGLVMES 278

Query: 515 DYGGYLGQDGYCHVDN 562
            Y  Y+G+ GYC  D+
Sbjct: 279 HY-PYVGKKGYCRYDS 293


>UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|Rep:
           Cathepsin W precursor - Homo sapiens (Human)
          Length = 376

 Score = 80.2 bits (189), Expect = 3e-14
 Identities = 57/177 (32%), Positives = 86/177 (48%), Gaps = 4/177 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFT-MSVNHLADRTDDELAALRG- 175
           +++  R Y S  EH  RL+IF  +L         + G     V   +D T++E   L G 
Sbjct: 46  QIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLYGY 105

Query: 176 RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWR-LFGAVTPVKDQSVCGSCWSFGTVGA 352
           RR +G    G+P    + R EE    +P   DWR + GA++P+KDQ  C  CW+    G 
Sbjct: 106 RRAAG----GVPSMGREIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCWAMAAAGN 161

Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAY-EWIKRHGLPTEEDY 520
           +E  L+  +    V +S   L+DC  G   +GC GG  + A+   +   GL +E+DY
Sbjct: 162 IE-TLWRISFWDFVDVSVHELLDC--GRCGDGCHGGFVWDAFITVLNNSGLASEKDY 215


>UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep:
           Vivapain-4 - Plasmodium vivax
          Length = 484

 Score = 79.8 bits (188), Expect = 4e-14
 Identities = 59/185 (31%), Positives = 98/185 (52%), Gaps = 9/185 (4%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDE----LAALR 172
           +H ++Y ++ E ++R   F ++L  I+S+N +AN  +    N  +D + +E    +  LR
Sbjct: 172 EHGKKYKTEEEMQQRYLAFTENLARINSHNSKANILYKKGTNQYSDISFEEFRKTMLTLR 231

Query: 173 G--RRYSGPSPHGLPFPYSKSRVEELSVKLPPE-HDWRLFGAVTPVKDQSVCGSCWSFGT 343
              ++    SP+   +     + +     +  E +DWR   AV+ +K+Q++CGSCW+FG 
Sbjct: 232 FDLKKKLANSPYVSNYDDVLKKYKPADAVVDNEKYDWREHNAVSEIKNQNLCGSCWAFGA 291

Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAY-EWIKRHGLPTEEDY 520
           VGAVE    +    H V +S+Q L+DCS    N GC GG    A+ + I    L +E DY
Sbjct: 292 VGAVESQYAIRKNQH-VLISEQELVDCS--DKNFGCFGGLASLAFDDMIDLGYLCSESDY 348

Query: 521 GGYLG 535
             Y+G
Sbjct: 349 -PYVG 352


>UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 291

 Score = 79.8 bits (188), Expect = 4e-14
 Identities = 59/181 (32%), Positives = 88/181 (48%), Gaps = 2/181 (1%)
 Frame = +2

Query: 38  EHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFP 217
           E++ R  I+  +  ++ ++N+AN  + +S+N L+  T  E  +L G +        L   
Sbjct: 12  EYKFRFGIWMANKNFVETHNKANANYKLSLNSLSHLTPTEYQSLLGTKID----KNLVSQ 67

Query: 218 YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVR 397
             K R +      P   D+R  G V P++DQ  CGSCW+FGTV A E    L    +L +
Sbjct: 68  GKKVRPQIKDS--PGILDYREMGVVNPIRDQKQCGSCWAFGTVAACESNYALLY-SNLPQ 124

Query: 398 LSQQALIDCSWGFGNNGCDGGEDFRAYEWI--KRHGLPTEEDYGGYLGQDGYCHVDNVTA 571
           LS+Q +IDC+      GC GG    A  +I  K+ G   +     Y G DG C  D  TA
Sbjct: 125 LSEQNIIDCATTC--YGCGGGIIQAAMSFIINKQGGAIMKLSDYPYQGVDGACKFDAKTA 182

Query: 572 V 574
           +
Sbjct: 183 M 183


>UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184,
           whole genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_184,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 331

 Score = 79.8 bits (188), Expect = 4e-14
 Identities = 59/190 (31%), Positives = 91/190 (47%), Gaps = 7/190 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNR----ANRGFTMSVNHLADRTDDELAAL 169
           K  H ++Y SD E   R ++F Q+L  +  +N         FT+ +N  AD T +E  A 
Sbjct: 38  KQLHGKRY-SDFEEVHRFSVFAQNLAVVMEHNSKFELGQETFTLGMNQYADLTPEEFQAS 96

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
                +          YS        +  P   DW+    +T VK+Q  CGSCW+F    
Sbjct: 97  FLTLKTKVQDRKNVKSYS-------GLSFPDTVDWK--DGLT-VKNQGSCGSCWAFAAAA 146

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCS---WGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
           A+E A F H+  + V +S+Q  +DC+    G+ + GC+GG    A+++   +G+ TEE+Y
Sbjct: 147 AIE-AGFQHHKKNKVNISEQEFVDCTTEKLGYESQGCNGGWMDDAFDYTVNYGVTTEEEY 205

Query: 521 GGYLGQDGYC 550
             Y G D  C
Sbjct: 206 -PYKGVDQPC 214


>UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin O;
           n=1; Danio rerio|Rep: PREDICTED: similar to cathepsin O
           - Danio rerio
          Length = 327

 Score = 79.4 bits (187), Expect = 6e-14
 Identities = 45/115 (39%), Positives = 62/115 (53%), Gaps = 2/115 (1%)
 Frame = +2

Query: 212 FPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHL 391
           F  SKS ++ +    PP  DWR  G V PV +Q  CG CW+F  V A+E ++    G  L
Sbjct: 107 FDQSKSEIK-VKANNPPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAIE-SVSAKVGEKL 164

Query: 392 VRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLP--TEEDYGGYLGQDGYC 550
            +LS Q +IDCS  + N GC+GG    A  W+ +  L   +E +Y  + G DG C
Sbjct: 165 QQLSVQQVIDCS--YQNQGCNGGSPVEALYWLTQSKLKLVSEAEY-PFKGADGVC 216


>UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep:
           Cysteine protease - Babesia equi
          Length = 438

 Score = 79.4 bits (187), Expect = 6e-14
 Identities = 38/83 (45%), Positives = 52/83 (62%)
 Frame = +2

Query: 269 DWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNG 448
           DWR    VTPVKDQ  CGSCW+F  VG+VE +L+L   G  + LS+Q L++C     +NG
Sbjct: 229 DWRKLNGVTPVKDQGNCGSCWAFAAVGSVE-SLYLIKKGQALDLSEQELVNCE--ENSNG 285

Query: 449 CDGGEDFRAYEWIKRHGLPTEED 517
           C+G    +A E+IK  G+   +D
Sbjct: 286 CEGDLPNKALEYIKAKGISHSKD 308


>UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila
           melanogaster|Rep: CG11459-PA - Drosophila melanogaster
           (Fruit fly)
          Length = 336

 Score = 79.0 bits (186), Expect = 8e-14
 Identities = 58/183 (31%), Positives = 94/183 (51%), Gaps = 10/183 (5%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNR----ANRGFTMSVNHLADRTDDELAAL 169
           K K+ +QY +  ++ + L  + Q +  + S+N+        F M +N  +D TD  +  L
Sbjct: 34  KAKYNKQYRNRDKYHRAL--YEQRVLAVESHNQLYLQGKVAFKMGLNKFSD-TDQRI--L 88

Query: 170 RGRRYSGPSP-----HGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSV-CGSCW 331
              R S P+P     + L    +  R ++++  +    DWR +G ++PV DQ   C SCW
Sbjct: 89  FNYRSSIPAPLETSTNALTETVNYKRYDQITEGI----DWRQYGYISPVGDQGTECLSCW 144

Query: 332 SFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTE 511
           +F T G +E A      G+LV LS + L+DC   + NNGC GG    A+ + + HG+ T+
Sbjct: 145 AFSTSGVLE-AHMAKKYGNLVPLSPKHLVDCV-PYPNNGCSGGWVSVAFNYTRDHGIATK 202

Query: 512 EDY 520
           E Y
Sbjct: 203 ESY 205


>UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteinase
           A; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like
           tick cysteine proteinase A - Haemaphysalis longicornis
           (Bush tick)
          Length = 312

 Score = 79.0 bits (186), Expect = 8e-14
 Identities = 38/88 (43%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
 Frame = +2

Query: 254 LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWG 433
           LP   DW   G+  PVK+Q  CGSCW+F T G++EG  F      +    +Q L+DCS  
Sbjct: 93  LPTTVDWAQEGSRAPVKNQGQCGSCWAFSTTGSLEGQHFRKTESRVT--GEQNLVDCSDD 150

Query: 434 FGNNGCDGGEDFRAYEWIKRH-GLPTEE 514
           FGN GC+GG     +++IK + G+ TEE
Sbjct: 151 FGNQGCNGGLMDNGFQYIKANGGIDTEE 178


>UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1
           precursor; n=20; Psoroptidia|Rep: Major mite fecal
           allergen Der f 1 precursor - Dermatophagoides farinae
           (House-dust mite)
          Length = 321

 Score = 79.0 bits (186), Expect = 8e-14
 Identities = 60/191 (31%), Positives = 91/191 (47%), Gaps = 4/191 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELA---ALR 172
           K    + YA+  E E     F +SL+Y+     AN+G   ++NHL+D + DE      + 
Sbjct: 30  KKAFNKNYATVEEEEVARKNFLESLKYVE----ANKG---AINHLSDLSLDEFKNRYLMS 82

Query: 173 GRRYSG-PSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
              +    +   L    S  R+   SV +P E D R    VTP++ Q  CGSCW+F  V 
Sbjct: 83  AEAFEQLKTQFDLNAETSACRIN--SVNVPSELDLRSLRTVTPIRMQGGCGSCWAFSGVA 140

Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGY 529
           A E A   +    L  LS+Q L+DC+     +GC G    R  E+I+++G+  E  Y  Y
Sbjct: 141 ATESAYLAYRNTSL-DLSEQELVDCA---SQHGCHGDTIPRGIEYIQQNGVVEERSY-PY 195

Query: 530 LGQDGYCHVDN 562
           + ++  C   N
Sbjct: 196 VAREQRCRRPN 206


>UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 20 SCAF14744, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 175

 Score = 78.6 bits (185), Expect = 1e-13
 Identities = 37/78 (47%), Positives = 47/78 (60%)
 Frame = +2

Query: 254 LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWG 433
           LP   DWR    V PV++Q  CGSCW+F  VGAV+ ++       LV LS Q ++DCS  
Sbjct: 59  LPARFDWRDNAVVGPVQNQQACGSCWAFSVVGAVQ-SVHAIGSSPLVELSVQQVLDCS-- 115

Query: 434 FGNNGCDGGEDFRAYEWI 487
           F NNGCDGG    A +W+
Sbjct: 116 FQNNGCDGGTPINALKWL 133


>UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia
           tropicalis|Rep: Blo t 1 allergen - Blomia tropicalis
           (Mite)
          Length = 333

 Score = 78.6 bits (185), Expect = 1e-13
 Identities = 53/189 (28%), Positives = 91/189 (48%), Gaps = 10/189 (5%)
 Frame = +2

Query: 23  YASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPH 202
           Y +  E  +R + F++ L+++  +N  + G   ++N  +D ++ E +       SG    
Sbjct: 39  YRNAEEEARREHHFKEQLKWVEEHNGID-GVEYAINEYSDMSEQEFSF----HLSGG--- 90

Query: 203 GLPFPYSKSRVEELSV-----KLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGAL 367
           GL F Y K    +  +      LP   DWR    +T ++ Q  CGSCW+F   G  E +L
Sbjct: 91  GLNFTYMKMEAAKEPLINTYGSLPQNFDWRQKARLTRIRQQGSCGSCWAFAAAGVAE-SL 149

Query: 368 FLHNGGHLVRLSQQALIDCSW-----GFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYL 532
           +       + LS+Q L+DC++      +  NGC  G    A++++ R GL  EE+Y  Y 
Sbjct: 150 YSIQKQQSIELSEQELVDCTYNRYDSSYQCNGCGSGYSTEAFKYMIRTGLVEEENY-PYN 208

Query: 533 GQDGYCHVD 559
            +  +C+ D
Sbjct: 209 MRTQWCNPD 217


>UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, whole
           genome shotgun sequence; n=3; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_2,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 376

 Score = 78.6 bits (185), Expect = 1e-13
 Identities = 41/108 (37%), Positives = 59/108 (54%)
 Frame = +2

Query: 227 SRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQ 406
           ++ E+ + K PP  DW     VT V+ Q  CGSCW+F     V   L + N   L +LS+
Sbjct: 156 TKTEKATPKNPPSLDW--LKQVTEVQQQGRCGSCWAFAVQDVVISRLAIANKNKLDQLSK 213

Query: 407 QALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYC 550
             LIDC+ G    GCDGG    A+++I ++G   E+DY  Y  ++G C
Sbjct: 214 THLIDCADG-NTEGCDGGSVSDAFDFINKYGTVYEKDYREYDQKEGQC 260


>UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2;
           Entamoeba|Rep: Cysteine proteinase ACP1 precursor -
           Entamoeba histolytica
          Length = 308

 Score = 78.6 bits (185), Expect = 1e-13
 Identities = 64/193 (33%), Positives = 93/193 (48%), Gaps = 4/193 (2%)
 Frame = +2

Query: 11  HQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALR-GRRYS 187
           H + +A+  E+  R  +F  + +++ +N  AN      +N  AD T +E      G  Y 
Sbjct: 25  HNKVFANRAEYLYRFAVFLDNKKFVEAN--ANT----ELNVFADMTHEEFIQTHLGMTYE 78

Query: 188 GPSPHGLPFPYSKSRVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
            P         + S V+  +VK  PE  DWR    + P KDQ  CGSCW+F T   +EG 
Sbjct: 79  VPE--------TTSNVKA-AVKAAPESVDWR--SIMNPAKDQGQCGSCWTFCTTAVLEGR 127

Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI-KRHGLPTEEDYGGYLGQD 541
           +   + G L   S+Q L+DC     +NGC+GG    + ++I + +GL  E DY  Y    
Sbjct: 128 V-NKDLGKLYSFSEQQLVDCD--ASDNGCEGGHPSNSLKFIQENNGLGLESDY-PYKAVA 183

Query: 542 GYC-HVDNVTAVT 577
           G C  V NV  VT
Sbjct: 184 GTCKKVKNVATVT 196


>UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1;
           Trypanosoma cruzi|Rep: Cysteine protease, putative -
           Trypanosoma cruzi
          Length = 434

 Score = 78.2 bits (184), Expect = 1e-13
 Identities = 50/168 (29%), Positives = 80/168 (47%), Gaps = 7/168 (4%)
 Frame = +2

Query: 17  RQYASDLEHEKRLNIFRQSLRYIHSNNRA-NRGFTMSVNHLADRTDDELAALRGRRYSGP 193
           ++YA   EH KR  IF+++L  + + N A  R + + +N  +D T +E  A    R + P
Sbjct: 48  KRYADPEEHRKRAAIFKENLAKVRAFNGALGRSYRLGINKFSDMTKEEFNAKFNGRVAAP 107

Query: 194 SPHGLPFPYSKSRVEELSVKLPPEHDWRLFG--AVTPVKDQSVCGSCWSFGTVGAVEGAL 367
                P    ++  +      P   +W+      +TPVKDQ  CGSCW+     +VE ++
Sbjct: 108 QSTQSP---QRAPYKRTKATFPEALNWQEAKNPVLTPVKDQGSCGSCWAHAATESVE-SM 163

Query: 368 FLHNGGHLVRLSQQALIDCSWGF----GNNGCDGGEDFRAYEWIKRHG 499
           +  + G L+ LS Q +  C        G+ GC GG    A+E+I   G
Sbjct: 164 YAISSGKLLTLSTQQITSCVNNTRKCGGSGGCGGGTAQLAWEYIMNTG 211


>UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyostelium
           discoideum|Rep: Cysteine proteinase 3 - Dictyostelium
           discoideum (Slime mold)
          Length = 151

 Score = 78.2 bits (184), Expect = 1e-13
 Identities = 52/145 (35%), Positives = 70/145 (48%), Gaps = 4/145 (2%)
 Frame = +2

Query: 38  EHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFP 217
           E   R   F++++ Y+H+ N       + +N  AD +++E        Y G   H     
Sbjct: 4   EFMPRYEEFKKNMDYVHNWNSKGSKTVLGLNQHADLSNEEYRL----NYLGTRAHIKLNG 59

Query: 218 YSKS----RVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGG 385
           Y K     R+     K P   DWR   AVTPVKDQ  CGSC    T G+VEG   +   G
Sbjct: 60  YHKRNLGLRLNRPHFKQPLNVDWREKDAVTPVKDQGQCGSC-IISTTGSVEGVTAIKT-G 117

Query: 386 HLVRLSQQALIDCSWGFGNNGCDGG 460
            LV LS+Q ++  S  FGN GC+GG
Sbjct: 118 KLVSLSEQNILRLSSSFGNEGCNGG 142


>UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 394

 Score = 77.8 bits (183), Expect = 2e-13
 Identities = 41/103 (39%), Positives = 56/103 (54%), Gaps = 3/103 (2%)
 Frame = +2

Query: 269 DWR-LFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDC--SWGFG 439
           DWR +   + PVKDQ  CGSCW+FG  G +E    + N G L   S+Q L+DC    GF 
Sbjct: 188 DWRNVKNVLNPVKDQGQCGSCWTFGAAGVMESFNAITN-GVLKSFSEQQLVDCVHQAGFS 246

Query: 440 NNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYCHVDNVT 568
           ++GC+GG      E+  + G+ TE+ Y  Y    G C + N T
Sbjct: 247 SDGCNGGFQSDGVEYAIKFGIVTEDKY-PYTAVGGDCQISNPT 288


>UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_56,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 314

 Score = 77.8 bits (183), Expect = 2e-13
 Identities = 54/169 (31%), Positives = 84/169 (49%), Gaps = 2/169 (1%)
 Frame = +2

Query: 20  QYASDLEHEKRLNIFRQSLRYIHS-NNRANRGFTMSVNHLADRTDDELAALRGRRYSGPS 196
           ++ +  E   R  +++ +++ I   N+  N           D T++E AAL   R    S
Sbjct: 41  KFYTPAERAYRFQVYQDAMKQIQILNSEENSTTVFGETQFTDLTNEEFAALLLTRKE--S 98

Query: 197 PHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLH 376
           P  L    ++  V +  +K     DW     +T VK+Q  CGSCW+F  VGAVE  L + 
Sbjct: 99  PMNLD---AELYVPQGPLKASA--DW---SKITSVKNQGNCGSCWAFSAVGAVETLLTIK 150

Query: 377 NG-GHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
                 + LS+Q L+DC  G  NNGC+GG +    +W K++GL T++ Y
Sbjct: 151 GVISKDLWLSEQQLVDCDKG-TNNGCNGGFENLGIQWAKKNGLTTDKQY 198


>UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 330

 Score = 77.4 bits (182), Expect = 2e-13
 Identities = 60/196 (30%), Positives = 88/196 (44%), Gaps = 5/196 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAA----L 169
           K +  ++YA  +    RL +F  +   + S+       T  V    D T++E AA    L
Sbjct: 44  KSRFNKRYADPITESYRLQVFASNYLRVLSDVTG----TFGVTQFFDLTEEEFAATYLTL 99

Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
           R +R    +      P  +  V           +W   G V+ VKDQ  CGSCW+F T G
Sbjct: 100 RVQRNVNATVSSPSTPKGQYDV-----------NWVTRGKVSAVKDQGQCGSCWAFSTTG 148

Query: 350 AVEGALFLHN-GGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGG 526
           +VE AL +       + LS+Q L+DCS    N GC GG    A+E+I+   L T  +Y  
Sbjct: 149 SVESALIIAGYANQTIDLSEQQLVDCS--ATNYGCGGGWMDNAFEYIEESPLTTNSNY-P 205

Query: 527 YLGQDGYCHVDNVTAV 574
           Y+  D  C+   +  V
Sbjct: 206 YVAVDQACNSTEIYGV 221


>UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,
           partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
           hypothetical protein, partial - Ornithorhynchus anatinus
          Length = 224

 Score = 77.0 bits (181), Expect = 3e-13
 Identities = 52/163 (31%), Positives = 78/163 (47%), Gaps = 4/163 (2%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFT-MSVNHLADRTDDELAALRGR 178
           ++++ + Y    EH +R  IF Q+L         ++G     V   +D ++DE  +L   
Sbjct: 51  QIRYNKSYEDQAEHARRFEIFVQNLARARKLQEEDQGTAEFGVTPFSDLSEDEFLSLYAP 110

Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
           R+  P+     +    +R+    ++     DWR  GAVTPVK+Q  CGSCW+F  VG VE
Sbjct: 111 RFRMPTS----WVNQTARIPAGPLRAET-CDWRKEGAVTPVKNQGDCGSCWAFAAVGNVE 165

Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNN-GCDGGED--FRAY 478
              +L     LV LS+Q      W   N+ G + GE   FR Y
Sbjct: 166 SMWYLRASNRLVSLSEQDGGYPQWILKNSWGPEWGEKGYFRLY 208


>UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4;
           Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena
           thermophila
          Length = 320

 Score = 77.0 bits (181), Expect = 3e-13
 Identities = 39/100 (39%), Positives = 59/100 (59%), Gaps = 4/100 (4%)
 Frame = +2

Query: 263 EHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGG--HLVRLSQQALIDC--SW 430
           E DW   G VTPVK+Q  CGSCW+F T+GAVE AL++   G  + + L++Q  +DC  S 
Sbjct: 115 EVDWTAKGKVTPVKNQGSCGSCWAFSTIGAVESALWIAGQGEQNTLNLAEQEQVDCAKSP 174

Query: 431 GFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYC 550
            + + GC+GG     +++I  + +    +Y  Y  +DG C
Sbjct: 175 KYDSEGCNGGWMVEGFKYIIDNKISQTANY-PYTAKDGKC 213


>UniRef50_Q3L7L2 Cluster: Sar s 1 allergen SMIPP-C Yv6008G08; n=2;
           Sarcoptes scabiei type hominis|Rep: Sar s 1 allergen
           SMIPP-C Yv6008G08 - Sarcoptes scabiei type hominis
          Length = 341

 Score = 77.0 bits (181), Expect = 3e-13
 Identities = 42/104 (40%), Positives = 58/104 (55%), Gaps = 3/104 (2%)
 Frame = +2

Query: 251 KLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALF--LHNGGHLVRLSQQALIDC 424
           KLP E D R    + PV++Q  C + W+FG +GAVE AL    H      +LS Q L+DC
Sbjct: 114 KLPKEFDLRKLKVIPPVRNQKRCNASWAFGPLGAVESALIHRFHLPHRHFQLSTQELVDC 173

Query: 425 SWGFGNNGCDGGEDF-RAYEWIKRHGLPTEEDYGGYLGQDGYCH 553
           +   GN GC GG D  +A+ ++   G+ TE +Y  Y  + G CH
Sbjct: 174 A---GNQGCRGGVDVTQAFSYLMEKGVVTEFEY-PYTAKKGICH 213


>UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 234

 Score = 77.0 bits (181), Expect = 3e-13
 Identities = 45/116 (38%), Positives = 61/116 (52%), Gaps = 2/116 (1%)
 Frame = +2

Query: 233 VEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQA 412
           +E +   +P E D+R  GAV  +KDQ  CGSCW+FG+  A+E + FL + G L  LS+Q 
Sbjct: 11  LETIVGDIPDEIDYRTKGAVNEIKDQKHCGSCWAFGSCAAMESSWFLKH-GTLYSLSEQC 69

Query: 413 LIDCSWGFGNNGCDGGEDFRAYEWIK--RHGLPTEEDYGGYLGQDGYCHVDNVTAV 574
           L+DC       GC G     A+E++K   HGL   ED   Y  +   C  D    V
Sbjct: 70  LVDCC--HDCLGCHGCLPSLAFEYVKIFMHGLFETEDNYPYQAEHHSCKFDKTRGV 123


>UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_26,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 358

 Score = 77.0 bits (181), Expect = 3e-13
 Identities = 53/189 (28%), Positives = 83/189 (43%), Gaps = 14/189 (7%)
 Frame = +2

Query: 5   VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR- 181
           +++ + Y +D     R+ IF ++ + I  +N     +   +N  +D+  DEL+       
Sbjct: 48  LEYGKSYDNDFTAIHRMQIFMRNKKNIEKHNHVGAKYKAKLNEFSDQDYDELSLKMFMHL 107

Query: 182 -YSGPS-PHGLPFPYSKSRVEEL-----------SVKLPPEHDWRLFGAVTPVKDQSVCG 322
            +S      G P  +SK  ++EL             +     DW     VTP + Q  CG
Sbjct: 108 DFSDDDFKFGNPHFFSKEDIKELRNHPILTQMREQARKGDSLDWTK--QVTPSRPQGTCG 165

Query: 323 SCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGL 502
           SCW+F +       L L     L +LS+  LIDC  G  N GC+GG    AY++I  +G 
Sbjct: 166 SCWAFSSSDVAISRLALKGKEDLTQLSKTHLIDCCVGDKNKGCNGGSPIGAYKFINENGA 225

Query: 503 PTEEDYGGY 529
             E +Y  Y
Sbjct: 226 LKENEYREY 234


>UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precursor;
           n=3; Plasmodium (Laverania)|Rep: Trophozoite cysteine
           proteinase precursor - Plasmodium falciparum
          Length = 569

 Score = 77.0 bits (181), Expect = 3e-13
 Identities = 53/201 (26%), Positives = 98/201 (48%), Gaps = 20/201 (9%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG--FTMSVNHLADRTDDELAALRG-- 175
           +H + Y +  E  ++  IF+ +   I ++N+ N+   +   VN  +D +++EL       
Sbjct: 231 EHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTL 290

Query: 176 --------RRYSGPSPHGLP--------FPYSKSRVEELSVKLPPEHDWRLFGAVTPVKD 307
                    +YS P  + L         +   K   +++  K+P   D+R  G V   KD
Sbjct: 291 LHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKD 350

Query: 308 QSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI 487
           Q +CGSCW+F +VG +E ++F     +++  S+Q ++DCS    N GCDGG  F ++ ++
Sbjct: 351 QGLCGSCWAFASVGNIE-SVFAKKNKNILSFSEQEVVDCS--KDNFGCDGGHPFYSFLYV 407

Query: 488 KRHGLPTEEDYGGYLGQDGYC 550
            ++ L   ++Y      D +C
Sbjct: 408 LQNELCLGDEYKYKAKDDMFC 428


>UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;
           n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
           CG5367-PA - Nasonia vitripennis
          Length = 362

 Score = 76.6 bits (180), Expect = 4e-13
 Identities = 57/200 (28%), Positives = 92/200 (46%), Gaps = 6/200 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELAAL 169
           K++H + Y   LE  +R   +  +L  I+ +N      +  + +  NH+AD +      +
Sbjct: 65  KMRHNKTYTGTLEAVRR-EAWEDNLLKIYEHNLLAAAGHHEYILRDNHIADLSTSSY--M 121

Query: 170 RGRRYSGPSPHG-LPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
           R      PS    L      + V     ++P   DWR  G VT  ++Q  CGSC+++   
Sbjct: 122 RELVKLVPSRRRRLDDDEMVAAVLHDPRRIPKSLDWREKGFVTKPENQRDCGSCYAYSIA 181

Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKR-HGLPTEEDYG 523
           G++ G +F    G +V LS+Q L+DCS   GN GC GG       +++R  GL T+  Y 
Sbjct: 182 GSIAGQIF-RQTGIVVPLSEQQLVDCSTQTGNLGCSGGSLRNTLRYLERSKGLMTDATY- 239

Query: 524 GYLGQDGYCHVDNVTAVTSI 583
            Y    G C      +V ++
Sbjct: 240 PYTAHQGVCKFQRKLSVVNV 259


>UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep:
           Dvir_CG5367 - Drosophila virilis (Fruit fly)
          Length = 298

 Score = 76.6 bits (180), Expect = 4e-13
 Identities = 54/197 (27%), Positives = 87/197 (44%), Gaps = 6/197 (3%)
 Frame = +2

Query: 11  HQRQYASDLEHEKRLNIFRQSLRYIHSNNR----ANRGFTMSVNHLADRTDDELAALRGR 178
           + R YA   +  +    + ++   ++ +N         F ++ N +AD   D    L+G 
Sbjct: 3   NNRSYARSHDEMRSYEAYEENQIIVNEHNTYYETGKSSFRLATNTMADMNTDSY--LKGY 60

Query: 179 RYSGPSPHGLPFPYSKSRV-EELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
                SP           V   L   +P   DWR  G +TP+ +Q  CGSC++F    ++
Sbjct: 61  LRLLRSPEISDSDNIADIVGSPLMNNVPESFDWRKKGFITPLYNQQSCGSCYAFSIAQSI 120

Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYGGYL 532
           EG +F    G +V LS+Q ++DCS   GN GC GG       +++   GL    DY  Y 
Sbjct: 121 EGQVFKRT-GKIVALSEQQIVDCSVSHGNQGCIGGSLRNTLRYLQATGGLMRSLDY-KYA 178

Query: 533 GQDGYCHVDNVTAVTSI 583
            + G C   +  AV ++
Sbjct: 179 SKKGECQFVSELAVVNV 195


>UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158,
           whole genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_158,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 308

 Score = 76.6 bits (180), Expect = 4e-13
 Identities = 58/182 (31%), Positives = 85/182 (46%), Gaps = 1/182 (0%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDELAALRGRRY 184
           K+Q+ Y    E   R  IF + ++   ++N    + FTM  N   D T +E  A+  RR 
Sbjct: 38  KYQKFYGPS-EKIYRAKIFEERIKLFEAHNADKTQTFTMGENQFTDLTQEEFKAIYLRRR 96

Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
           S   P  L    ++  V      L   + W     +T VKDQ  CG+ W+F  +GAVE  
Sbjct: 97  S---PQKL---VNEKYVPTNEANLTSAN-W---AGLTSVKDQGYCGAAWAFAAIGAVESV 146

Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDG 544
           L +++  +L  LS+Q LIDC     N GC+ G    +  W + +G+ T   Y  Y GQ  
Sbjct: 147 LRINSVTNL-DLSEQQLIDCD--LENQGCEDGNLNNSLNWAQNNGVTTSASY-PYTGQTD 202

Query: 545 YC 550
            C
Sbjct: 203 GC 204


>UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 280

 Score = 76.2 bits (179), Expect = 5e-13
 Identities = 43/108 (39%), Positives = 57/108 (52%), Gaps = 3/108 (2%)
 Frame = +2

Query: 254 LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRL-SQQALIDCSW 430
           LP + DWR  G VT VK+Q  CGSCW+F   G  E    + N    V L S+Q L+DCS 
Sbjct: 68  LPQQFDWRNLGKVTQVKNQGNCGSCWAFTITGLFESINLIRN--KTVELYSEQELLDCSS 125

Query: 431 G--FGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYCHVDNVT 568
              + N+GC GG    A+E+ K++G+     Y  Y G    C V+  T
Sbjct: 126 NGIYRNSGCQGGWPHLAFEYSKKNGISLSSQY-PYKGIQENCTVNQQT 172


>UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine
           protease; n=1; Maconellicoccus hirsutus|Rep: Putative
           cathepsin L-like cysteine protease - Maconellicoccus
           hirsutus (hibiscus mealybug)
          Length = 339

 Score = 75.8 bits (178), Expect = 7e-13
 Identities = 57/202 (28%), Positives = 90/202 (44%), Gaps = 8/202 (3%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRA-NRG---FTMSVNHLADRTDDELAAL 169
           K ++ ++Y +D+E   R+ IF  +   I  +N+  ++G   F   +N  +D    E    
Sbjct: 33  KTQYSKKYTTDIEDRLRMKIFIDNKYRIAQHNKLFHKGLVTFEQGINEYSDMLQSEFNEK 92

Query: 170 RGRRYSGP---SPHGLPFPYSKSRVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSF 337
            G++ S       +GLP      R   L    PP+  DWR  G V PV  Q  C S +++
Sbjct: 93  MGQKSSNQRNTEANGLP----SIRFTPLHNVNPPDSVDWRTKGLVGPVGKQVNCSSGYAW 148

Query: 338 GTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEED 517
             +GA+EG L   +      +S Q +IDCS   GN GC GG    +Y +I + G   ++ 
Sbjct: 149 SAIGALEGQL-ASDKKKFQGISVQNVIDCSESTGNKGCSGGNQHHSYFYIYKQGGVDDDV 207

Query: 518 YGGYLGQDGYCHVDNVTAVTSI 583
              Y   +  C       VT +
Sbjct: 208 SYPYKDAEEPCAFKKENVVTRV 229


>UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Rep:
           Cathepsin W - Xenopus tropicalis (Western clawed frog)
           (Silurana tropicalis)
          Length = 303

 Score = 74.9 bits (176), Expect = 1e-12
 Identities = 54/183 (29%), Positives = 84/183 (45%), Gaps = 1/183 (0%)
 Frame = +2

Query: 5   VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAALRGRR 181
           +++ R Y +  E + RL IF ++L+      R   G     V   +D TD+E +      
Sbjct: 2   LQYNRSYKTREEFKYRLRIFSENLKEASRLQREELGTAQYGVTKFSDLTDEEFSI----- 56

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
           Y  P+ + LP P    + EE+ +  P   DWR    ++  K+Q  C SCW+F  V  +E 
Sbjct: 57  YHLPT-NILPTPPILKQSEEV-LPFPTSCDWRTQNVISKAKNQRTCHSCWAFAAVANIEA 114

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQD 541
              +   G  + LS+Q +IDC+     NGC GG  + A+  + + G  T E    Y G  
Sbjct: 115 QWAIL--GQTISLSEQQVIDCN--TCRNGCSGGYAWDAFMTVLQQGGLTSEKSYPYTGHV 170

Query: 542 GYC 550
             C
Sbjct: 171 SNC 173


>UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera
           litura multicapsid nucleopolyhedrovirus (SpltMNPV)
          Length = 337

 Score = 74.9 bits (176), Expect = 1e-12
 Identities = 54/190 (28%), Positives = 90/190 (47%), Gaps = 7/190 (3%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTD----DELAALRG 175
           +H ++Y +  + +     F+++L  +++ N  +      +N  +D       +E A L  
Sbjct: 39  QHNKEYTTPDQRDAAFVNFKRNLADMNAMNNVSNQAVYGINKFSDIDKITFVNEHAGLVS 98

Query: 176 RRYSGPSPHGLPFPYSKS-RVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGA 352
              +    +  P+   +   V   S + P   DWR    VT VK+Q VCGSCW+F  +G 
Sbjct: 99  NLINSTDSNFDPYRLCEYVTVAGPSARTPESFDWRKLNKVTKVKEQGVCGSCWAFAAIGN 158

Query: 353 VEGA-LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAY-EWIKRHGLPTEEDYGG 526
           +E     +H+   L+ LS+Q L+DC     + GCDGG    A+ E I+  G+  E DY  
Sbjct: 159 IESQYAIMHDS--LIDLSEQQLLDCD--RVDQGCDGGLMHLAFQEIIRIGGVEHEIDY-P 213

Query: 527 YLGQDGYCHV 556
           Y G +  C +
Sbjct: 214 YQGIEYACRL 223


>UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1;
           Oryza sativa (japonica cultivar-group)|Rep: Putative
           uncharacterized protein - Oryza sativa subsp. japonica
           (Rice)
          Length = 289

 Score = 74.5 bits (175), Expect = 2e-12
 Identities = 54/162 (33%), Positives = 73/162 (45%), Gaps = 2/162 (1%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDELAALRGRRY 184
           K  + Y    E E R  IFR ++ +I     +      + +N  AD T+DE  A     Y
Sbjct: 49  KFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVAT----Y 104

Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
           +G  P   P P    R  +  +  P   DWR  GAVT VKDQ  CGSCW+F  V A+EG 
Sbjct: 105 TGAKP---PHPKEAPRPVD-PIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGL 160

Query: 365 LFLHNGGHLVRLSQ-QALIDCSWGFGNNGCDGGEDFRAYEWI 487
             +   G L  LS  + L++           G  D RA+E +
Sbjct: 161 TKIRT-GQLTPLSDARTLVELRNQHATGAAAGTPD-RAFELV 200


>UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 397

 Score = 74.1 bits (174), Expect = 2e-12
 Identities = 41/118 (34%), Positives = 60/118 (50%), Gaps = 5/118 (4%)
 Frame = +2

Query: 215 PYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLV 394
           P     V +L V +P   DWR+ G V+PVKDQ  CG CW+F      E    + N   L 
Sbjct: 168 PNPNPPVNQLKV-VPQSVDWRIQGKVSPVKDQGRCGCCWAFSATALAESVNLMRN-NTLQ 225

Query: 395 RLSQQALIDCS-----WGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYCH 553
           + S+Q L+DC+       + + GC GG  + A  +++R G+  E  Y  Y  Q+G C+
Sbjct: 226 QYSEQELVDCTNNQYQEDYSSLGCGGGWAYNALVYMQRKGIFLESQY-PYKAQNGVCN 282


>UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:
           Aca s 1 allergen - Acarus siro (Dust mite)
          Length = 331

 Score = 74.1 bits (174), Expect = 2e-12
 Identities = 49/171 (28%), Positives = 83/171 (48%), Gaps = 5/171 (2%)
 Frame = +2

Query: 23  YASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPH 202
           YA+  E   R   F  SL++I  N+R + G  ++VN  AD   +E   +      G +  
Sbjct: 37  YATPEEESIRRANFEASLKWIQENDRKDGGAHLAVNQFADLGANESVGVNLTARRGEA-- 94

Query: 203 GLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNG 382
              F  + +        LP   DWR    + P+++Q  CG+CW+F ++  VE A  +   
Sbjct: 95  ---FFEAVTIHVTPEGNLPETFDWR--SKLGPIENQGRCGACWAFASLATVEAAFAIKYN 149

Query: 383 GHLVRLSQQALIDCS-----WGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
            H +RLS+Q L++C+       + N+GC GG  + A ++++  G+  E  Y
Sbjct: 150 TH-IRLSKQELVECTRESDHTPYENSGCQGGYSWEALKYVQVTGVVEEAAY 199


>UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1;
           Uronema marinum|Rep: Cathepsin L-like cysteine protease
           - Uronema marinum
          Length = 333

 Score = 73.7 bits (173), Expect = 3e-12
 Identities = 56/175 (32%), Positives = 84/175 (48%), Gaps = 2/175 (1%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSV-NHLADRTDDELAALRGR 178
           K  H   Y+S  E   R  ++ ++ +++   N AN  FT+ V N  A  T++E  A   +
Sbjct: 40  KQNHNLVYSSS-EDAYRFQVYFENFQFVEEFN-ANNSFTLGVENQFAAMTNEEFKA---Q 94

Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
             S     G  +      V E +V  P    +W   GAV  V++Q VCGSCW+F  V ++
Sbjct: 95  FTSEIISEGYNYQQVDRNVYE-AVNAPSGSVNWVSKGAVQGVQNQGVCGSCWAFSAVCSL 153

Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
           E  L+  N G L+  S+Q L+ C     + GCDGG    A+ +   HGL +   Y
Sbjct: 154 E-RLYKINTGKLLSFSEQQLVSCE--PKSYGCDGGWPEAAFAYSATHGLESSASY 205


>UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or
           H-like cysteine peptidase; n=1; Trichomonas vaginalis
           G3|Rep: Clan CA, family C1, cathepsin L, S or H-like
           cysteine peptidase - Trichomonas vaginalis G3
          Length = 473

 Score = 73.7 bits (173), Expect = 3e-12
 Identities = 40/103 (38%), Positives = 56/103 (54%), Gaps = 3/103 (2%)
 Frame = +2

Query: 254 LPPEHDWR-LFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSW 430
           LP E  WR +   V   +DQ  CGSCW+FGT  ++E  L L   G    LS   ++DC+W
Sbjct: 251 LPAEFSWRDVPNVVGKPRDQVACGSCWAFGTAESLESQLALKT-GVFRELSVNQIMDCTW 309

Query: 431 GFGNNGCDGGEDFRAYEWI--KRHGLPTEEDYGGYLGQDGYCH 553
            + N+ C GGE   A+  +  +   L  E+DY  Y+G  GYC+
Sbjct: 310 DYNNSACGGGEAGPAFRSLINQNFKLFLEKDY-PYIGVAGYCN 351


>UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-like
           cysteine peptidase; n=3; Trichomonas vaginalis|Rep: Clan
           CA, family C1, cathepsin L or K-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 320

 Score = 73.7 bits (173), Expect = 3e-12
 Identities = 58/189 (30%), Positives = 85/189 (44%), Gaps = 5/189 (2%)
 Frame = +2

Query: 23  YASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPH 202
           Y  D E   R  IF  + R++   N  NR + +S+N  +  T+ E  +L G + S  +  
Sbjct: 33  YVGD-EFHFRFGIFLANKRFVQEQNSINRNYRLSLNQFSFLTNSEYKSLLGGKVSSKNND 91

Query: 203 G--LPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLH 376
              L  P SK   E          DWR  G + P+++Q  CG CW+F T+  VE A +  
Sbjct: 92  DSHLFSPQSKKSSEVT-------FDWRTKGIINPIRNQGQCGLCWAFSTICCVE-ARWAQ 143

Query: 377 NGGHLVRLSQQALIDCSWGFGNNGCDGG--EDFRAYEWIKRHG-LPTEEDYGGYLGQDGY 547
               L++LS+Q L+DC       GC GG  +D  A+      G   T  DY  Y+ +   
Sbjct: 144 AYNTLLQLSEQMLVDCV--DTCYGCMGGYADDAAAFVIENYEGKFMTAADY-PYIARASI 200

Query: 548 CHVDNVTAV 574
           C  D   +V
Sbjct: 201 CKFDKTKSV 209


>UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_46,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 336

 Score = 73.3 bits (172), Expect = 4e-12
 Identities = 50/187 (26%), Positives = 82/187 (43%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
           K+++ + Y+   E  +  N      +    N+  N+ + M +N  +D + +E + +    
Sbjct: 58  KIEYGKSYSGQQEVFRFFNFQINRNKVNKHNSDPNKTYFMKMNQFSDLSQEEFSLIYLTH 117

Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
            +            + +  + + K     DWR    +T VKDQ  C  CW+FG VGA E 
Sbjct: 118 DNAEEVMEQNLIIDELQKTQENDKTINSVDWR---KITQVKDQGQCSGCWAFGAVGAAEA 174

Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQD 541
             ++ N    V LS+Q LIDC     + GC+GG    A ++I  HGL     Y     Q 
Sbjct: 175 WFYVKN-KTTVLLSEQQLIDCD--TQSFGCNGGYQNLALKYIANHGLNDARVYPYTQKQS 231

Query: 542 GYCHVDN 562
            YC  ++
Sbjct: 232 AYCKYES 238


>UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18;
           Plasmodium|Rep: Cysteine proteinase precursor -
           Plasmodium vivax (strain Salvador I)
          Length = 583

 Score = 73.3 bits (172), Expect = 4e-12
 Identities = 51/185 (27%), Positives = 85/185 (45%), Gaps = 14/185 (7%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
           K++R Y    E  ++   F+ +   I  +N  N+ + M VN  +D +  +  +   +   
Sbjct: 243 KYKRSYKDINEQMEKYKNFKMNYLKIKKHNETNQMYKMKVNQFSDYSKKDFESYFRKLVP 302

Query: 188 GPS----PHGLPFP----------YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGS 325
            P      + +PF            + S    L   +P   D+R  G V   KDQ +CGS
Sbjct: 303 IPDHLKKKYVVPFSSMNNGKGKNVVTSSSGANLLADVPEILDYREKGIVHEPKDQGLCGS 362

Query: 326 CWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLP 505
           CW+F +VG VE      +   ++ LS+Q ++DCS    N GCDGG  F ++ +   +G+ 
Sbjct: 363 CWAFASVGNVECMYAKEHNKTILTLSEQEVVDCS--KLNFGCDGGHPFYSFIYAIENGIC 420

Query: 506 TEEDY 520
             +DY
Sbjct: 421 MGDDY 425


>UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-ear
           cress). SAG12 protein; n=2; Dictyostelium
           discoideum|Rep: Similar to Arabidopsis thaliana
           (Mouse-ear cress). SAG12 protein - Dictyostelium
           discoideum (Slime mold)
          Length = 358

 Score = 72.9 bits (171), Expect = 5e-12
 Identities = 55/195 (28%), Positives = 86/195 (44%), Gaps = 14/195 (7%)
 Frame = +2

Query: 8   KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFT-MSVNHLADRTDDELAALR-GRR 181
           KH + Y   +E E R + F+++++     N  + G      N  +D +++E +     + 
Sbjct: 50  KHSKIYKDSIEMENRFSNFKENMKKNIELNSMHAGKAKFESNGFSDLSEEEFSNFHLNKA 109

Query: 182 YSGPSPH------GLPFPYSK-----SRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSC 328
           + G   H        P P+         +E   +      DWR  G VTPVKDQ  CGSC
Sbjct: 110 FKGKPSHLRNSIKPQPTPHHSLINGYKEMENGDLNELYSIDWRKKGLVTPVKDQGQCGSC 169

Query: 329 WSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKR-HGLP 505
           + F  V  +E A ++  G   + LS+Q  +DC    G   C GG+ +  YE+  +  G+ 
Sbjct: 170 YIFSAVEQIETA-WIKAGNKPILLSEQQAVDCDPYDGQ--CGGGDPYTVYEYFSQVGGVS 226

Query: 506 TEEDYGGYLGQDGYC 550
           T   Y  Y   DG C
Sbjct: 227 TNAQY-PYTATDGTC 240


>UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-like
           protein; n=1; Maconellicoccus hirsutus|Rep: Cathepsin
           L-like cysteine proteinase-like protein -
           Maconellicoccus hirsutus (hibiscus mealybug)
          Length = 253

 Score = 72.9 bits (171), Expect = 5e-12
 Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 2/113 (1%)
 Frame = +2

Query: 251 KLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSW 430
           ++P E +W   G VTPV +Q  C   W+F   GA+E    +      V+LS+Q LI+CS 
Sbjct: 32  EIPNEINWVAKGKVTPVGNQGKCNVGWAFSVTGALESEKAIKYEAAPVKLSEQNLIECSG 91

Query: 431 GFGNNGCDGGEDFRAYEWIKR-HGLPTEEDY-GGYLGQDGYCHVDNVTAVTSI 583
           GFGN  C GG     Y+++    G+  E+ Y   +   +  C  D+  +  SI
Sbjct: 92  GFGNKRCSGGNLENTYKYVNHSRGIEKEDSYRDNFRHINSRCQYDSTKSAVSI 144


>UniRef50_A2DKX2 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 317

 Score = 72.9 bits (171), Expect = 5e-12
 Identities = 61/183 (33%), Positives = 83/183 (45%), Gaps = 3/183 (1%)
 Frame = +2

Query: 38  EHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFP 217
           E+  RL I+  + RYI   NR  R  T++ N  +  T  E  AL     S P  H  P  
Sbjct: 36  EYAFRLGIYLTTDRYIKQFNRGKRSHTLAHNKFSAYTHAEYKALLN---SKPI-H--PRN 89

Query: 218 YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVR 397
             KS++    V++P   DWR   A  PV+DQ  C S ++F +    E    ++    L  
Sbjct: 90  VQKSQITTQKVQVPDTWDWRDRVAFNPVRDQMECASGFAFASCACQEVTWNIYY-NKLYL 148

Query: 398 LSQQALIDCSWGFGNNGCDGGEDFRAYEWI--KRHG-LPTEEDYGGYLGQDGYCHVDNVT 568
           LS Q ++DC+  +   GCDGGE  RA  +I   + G    E DY       GYC  D   
Sbjct: 149 LSPQNMLDCA--YNEEGCDGGEADRAVGYIVTDQDGKFGLESDYPYKSESMGYCEFDPSK 206

Query: 569 AVT 577
            VT
Sbjct: 207 GVT 209


>UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n=1;
           Rattus norvegicus|Rep: UPI0000501FDB UniRef100 entry -
           Rattus norvegicus
          Length = 338

 Score = 72.5 bits (170), Expect = 7e-12
 Identities = 36/82 (43%), Positives = 50/82 (60%), Gaps = 1/82 (1%)
 Frame = +2

Query: 308 QSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI 487
           Q  C SCW+F  VGA+EG +F   G  L  LS Q L+DCS   GN GC GG  + A++++
Sbjct: 139 QGRCNSCWAFPVVGAIEGQMFKKTG-KLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYV 197

Query: 488 KRH-GLPTEEDYGGYLGQDGYC 550
            ++ GL +E  Y  Y G++G C
Sbjct: 198 LQNGGLESEATY-PYEGKEGLC 218


>UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides
           sonorensis|Rep: Cathepsin L - Culicoides sonorensis
          Length = 331

 Score = 72.5 bits (170), Expect = 7e-12
 Identities = 61/195 (31%), Positives = 96/195 (49%), Gaps = 8/195 (4%)
 Frame = +2

Query: 2   KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELAAL 169
           K+++ + Y    E   R  IF ++L  +  +N R   G   +   VN  +D T +E A L
Sbjct: 31  KLEYNKVYPLSTEENLRKGIFERNLADVMEHNARYLSGMETYEKGVNQFSDLTYEEFAKL 90

Query: 170 R-GRRYSGPSPHGLPFPYSKSRVEE-LSVKLPPE-HDWRLFGAVTPVKDQSVCGSCWSFG 340
             G + S           +   +E+ L  +L PE + W       PVK+Q+ CGSCW+F 
Sbjct: 91  YLGEKIS----FNELMTNADGWIEKPLRRQLAPESYAWDTKDV--PVKNQAQCGSCWAFA 144

Query: 341 TVGAVEGAL-FLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEED 517
           +V +VE      HN  +   L++Q L+DC     ++GC GG    A ++++ +GL  E+D
Sbjct: 145 SVASVEMRYKRFHNKSY--TLAEQELVDCE--TTSHGCSGGWSDLALQYMRDNGLSFEKD 200

Query: 518 YGGYLGQDGYCHVDN 562
           Y  Y G+D  CH  N
Sbjct: 201 Y-PYKGKDEKCHASN 214


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 574,653,192
Number of Sequences: 1657284
Number of extensions: 12517963
Number of successful extensions: 65452
Number of sequences better than 10.0: 500
Number of HSP's better than 10.0 without gapping: 57911
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 64504
length of database: 575,637,011
effective HSP length: 96
effective length of database: 416,537,747
effective search space used: 40820699206
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)

- SilkBase 1999-2023 -