SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTX 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= e96h0134
         (708 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C...   128   1e-28
UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n...    91   2e-17
UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ...    89   7e-17
UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep...    88   2e-16
UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j...    79   8e-14
UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n...    75   2e-12
UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M...    74   3e-12
UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h...    74   4e-12
UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae...    74   4e-12
UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ...    74   4e-12
UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei...    73   5e-12
UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;...    73   5e-12
UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e...    73   9e-12
UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n...    72   2e-11
UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio...    72   2e-11
UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ...    71   4e-11
UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain...    69   9e-11
UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto...    69   9e-11
UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs...    69   1e-10
UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ...    69   1e-10
UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca...    68   3e-10
UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt...    68   3e-10
UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat...    68   3e-10
UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ...    67   3e-10
UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ...    67   5e-10
UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata...    67   5e-10
UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ...    66   6e-10
UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome s...    66   6e-10
UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:...    66   8e-10
UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R...    66   8e-10
UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=...    66   8e-10
UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ...    66   1e-09
UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ...    66   1e-09
UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy...    66   1e-09
UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re...    65   1e-09
UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph...    65   1e-09
UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain...    65   1e-09
UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc...    65   2e-09
UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr...    64   2e-09
UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;...    64   4e-09
UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi...    63   6e-09
UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ...    63   6e-09
UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No...    63   7e-09
UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio...    63   7e-09
UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: L...    62   1e-08
UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er...    62   1e-08
UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re...    62   2e-08
UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G...    61   2e-08
UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18...    61   2e-08
UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat...    61   3e-08
UniRef50_Q23H32 Cluster: Papain family cysteine protease contain...    61   3e-08
UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ...    61   3e-08
UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus...    60   4e-08
UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox...    60   4e-08
UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ...    60   5e-08
UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|...    60   5e-08
UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p...    60   7e-08
UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ...    60   7e-08
UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine prot...    60   7e-08
UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w...    60   7e-08
UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy...    59   9e-08
UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh...    59   9e-08
UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG...    59   1e-07
UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty...    59   1e-07
UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa...    58   2e-07
UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac...    58   2e-07
UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ...    58   2e-07
UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n...    58   2e-07
UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus...    58   2e-07
UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ...    58   2e-07
UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ...    58   3e-07
UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n...    58   3e-07
UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|...    58   3e-07
UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D...    58   3e-07
UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ...    57   4e-07
UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa...    57   4e-07
UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sa...    57   4e-07
UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1...    57   4e-07
UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei...    57   5e-07
UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb...    57   5e-07
UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ...    57   5e-07
UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina...    57   5e-07
UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr...    57   5e-07
UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v...    57   5e-07
UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ...    56   6e-07
UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2; Culicid...    56   6e-07
UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ...    56   9e-07
UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ...    56   9e-07
UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p...    56   1e-06
UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome s...    56   1e-06
UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip...    56   1e-06
UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi...    56   1e-06
UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil...    55   1e-06
UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster...    55   1e-06
UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl...    55   1e-06
UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ...    55   1e-06
UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru...    55   1e-06
UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endop...    55   1e-06
UniRef50_Q5NE16 Cluster: Putative cathepsin L-like protein 3; n=...    55   1e-06
UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP000...    55   2e-06
UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica...    55   2e-06
UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:...    55   2e-06
UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted...    55   2e-06
UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]...    55   2e-06
UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|...    54   3e-06
UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel...    54   3e-06
UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain...    54   3e-06
UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain...    54   3e-06
UniRef50_O16454 Cluster: Temporarily assigned gene name protein ...    54   3e-06
UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3...    54   3e-06
UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;...    54   3e-06
UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal...    54   5e-06
UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir...    54   5e-06
UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ...    54   5e-06
UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=...    53   6e-06
UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic...    53   6e-06
UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D...    53   6e-06
UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ...    53   8e-06
UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh...    53   8e-06
UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi...    53   8e-06
UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ...    52   1e-05
UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-...    52   1e-05
UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip...    52   1e-05
UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35...    52   1e-05
UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory...    52   1e-05
UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re...    52   1e-05
UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ...    52   2e-05
UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ...    52   2e-05
UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma j...    52   2e-05
UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz...    51   2e-05
UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt...    51   2e-05
UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia...    51   2e-05
UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina...    51   2e-05
UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|...    51   2e-05
UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ...    51   2e-05
UniRef50_Q5YER1 Cluster: Cysteine proteinase; n=1; Bigelowiella ...    51   3e-05
UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca...    51   3e-05
UniRef50_Q23H10 Cluster: Papain family cysteine protease contain...    51   3e-05
UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain...    51   3e-05
UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv...    51   3e-05
UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep...    50   4e-05
UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate...    50   4e-05
UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re...    50   4e-05
UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; ...    50   6e-05
UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl...    50   6e-05
UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n...    50   7e-05
UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:...    50   7e-05
UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t...    49   1e-04
UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ...    49   1e-04
UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ...    49   1e-04
UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C...    49   1e-04
UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr...    49   1e-04
UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv...    49   1e-04
UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ...    49   1e-04
UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ...    49   1e-04
UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n...    49   1e-04
UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ...    48   2e-04
UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyt...    48   2e-04
UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ...    48   2e-04
UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryz...    48   2e-04
UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ...    48   2e-04
UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl...    48   2e-04
UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo...    48   2e-04
UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|...    48   2e-04
UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ...    48   3e-04
UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa...    48   3e-04
UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli...    48   3e-04
UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis...    48   3e-04
UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve...    48   3e-04
UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ...    47   4e-04
UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,...    47   4e-04
UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste...    47   4e-04
UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ...    46   7e-04
UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum...    46   7e-04
UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida...    46   7e-04
UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa...    46   7e-04
UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain...    46   7e-04
UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ...    46   7e-04
UniRef50_Q22A69 Cluster: Papain family cysteine protease contain...    46   7e-04
UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ...    46   7e-04
UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyosteli...    46   7e-04
UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain...    46   0.001
UniRef50_Q23H15 Cluster: Papain family cysteine protease contain...    46   0.001
UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain...    46   0.001
UniRef50_Q22RC9 Cluster: Papain family cysteine protease contain...    46   0.001
UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy...    46   0.001
UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi...    46   0.001
UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R...    46   0.001
UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain...    46   0.001
UniRef50_Q22W19 Cluster: Papain family cysteine protease contain...    46   0.001
UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ...    46   0.001
UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh...    46   0.001
UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s...    45   0.002
UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain...    45   0.002
UniRef50_Q239L8 Cluster: Papain family cysteine protease contain...    45   0.002
UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big...    45   0.002
UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain...    45   0.002
UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis...    45   0.002
UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve...    44   0.003
UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ...    44   0.003
UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s...    44   0.004
UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; ...    44   0.004
UniRef50_Q24E33 Cluster: Papain family cysteine protease contain...    44   0.004
UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali...    44   0.004
UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2...    44   0.004
UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli...    44   0.005
UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest...    44   0.005
UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy...    44   0.005
UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ...    43   0.006
UniRef50_Q2QS15 Cluster: Papain family cysteine protease contain...    43   0.006
UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy...    43   0.006
UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li...    43   0.006
UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs...    43   0.006
UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin ...    43   0.009
UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet...    43   0.009
UniRef50_Q23H06 Cluster: Papain family cysteine protease contain...    43   0.009
UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain...    43   0.009
UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The...    43   0.009
UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ...    42   0.011
UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arab...    42   0.011
UniRef50_Q5ZC39 Cluster: CRK1 protein-like; n=2; Oryza sativa (j...    42   0.015
UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-...    42   0.015
UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10...    42   0.015
UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy...    42   0.015
UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n...    42   0.020
UniRef50_A0BNM1 Cluster: Chromosome undetermined scaffold_119, w...    42   0.020
UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ...    41   0.026
UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz...    41   0.026
UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen...    41   0.026
UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The...    41   0.026
UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C...    41   0.026
UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryz...    41   0.034
UniRef50_Q248G1 Cluster: Papain family cysteine protease contain...    41   0.034
UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain...    41   0.034
UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep...    41   0.034
UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M...    41   0.034
UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:...    41   0.034
UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; P...    41   0.034
UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir...    40   0.045
UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cel...    40   0.045
UniRef50_Q9SIE8 Cluster: Putative cysteine proteinase; n=1; Arab...    40   0.060
UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|...    40   0.060
UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma j...    40   0.060
UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh...    40   0.060
UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who...    40   0.060
UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl...    40   0.060
UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov...    40   0.060
UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R...    40   0.079
UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ...    40   0.079
UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain...    40   0.079
UniRef50_Q235G6 Cluster: Papain family cysteine protease contain...    40   0.079
UniRef50_A7LFV2 Cluster: Cathepsin L protease inhibitor 1; n=1; ...    40   0.079
UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280...    39   0.10 
UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi...    39   0.10 
UniRef50_P21381 Cluster: Thaumatopain; n=10; Eukaryota|Rep: Thau...    39   0.10 
UniRef50_UPI0000D566EC Cluster: PREDICTED: similar to CG10460-PA...    39   0.14 
UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ...    39   0.14 
UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;...    39   0.14 
UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ...    39   0.14 
UniRef50_Q962W3 Cluster: Cysteine protease; n=3; Giardia intesti...    38   0.18 
UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto...    38   0.18 
UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium...    38   0.18 
UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ...    38   0.24 
UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop...    38   0.24 
UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop...    38   0.24 
UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep...    38   0.24 
UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain...    38   0.24 
UniRef50_Q5KH32 Cluster: Putative uncharacterized protein; n=2; ...    38   0.24 
UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr...    38   0.24 
UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|R...    38   0.24 
UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Sl...    38   0.32 
UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcop...    38   0.32 
UniRef50_A7APS9 Cluster: Papain family cysteine protease contain...    38   0.32 
UniRef50_A2SQ75 Cluster: Cysteine protease-like protein; n=1; Me...    38   0.32 
UniRef50_Q0AY53 Cluster: Putative uncharacterized protein; n=1; ...    37   0.42 
UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O...    37   0.42 
UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try...    37   0.42 
UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:...    37   0.42 
UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh...    37   0.42 
UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh...    37   0.56 
UniRef50_Q75ZL3 Cluster: Putative uncharacterized protein; n=1; ...    37   0.56 
UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa...    37   0.56 
UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi...    37   0.56 
UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl...    37   0.56 
UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve...    37   0.56 
UniRef50_UPI0000D566ED Cluster: PREDICTED: similar to CTLA-2-alp...    36   0.74 
UniRef50_A5HC50 Cluster: Cathepsin W; n=2; Theria|Rep: Cathepsin...    36   0.74 
UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist...    36   0.74 
UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw...    36   0.74 
UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil...    36   0.74 
UniRef50_Q231X3 Cluster: Papain family cysteine protease contain...    36   0.74 
UniRef50_A1ZBK7 Cluster: CG10460-PA; n=1; Drosophila melanogaste...    36   0.74 
UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator ame...    36   0.74 
UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr...    36   0.74 
UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ...    36   0.74 
UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi...    36   0.74 
UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ...    36   0.98 
UniRef50_Q53K53 Cluster: Cysteine protease 1, putative; n=5; Ory...    36   0.98 
UniRef50_Q7QWN9 Cluster: GLP_26_50243_51811; n=2; Giardia lambli...    36   0.98 
UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1...    36   0.98 
UniRef50_A1Z9I0 Cluster: CG6357-PA; n=3; Drosophila melanogaster...    36   0.98 
UniRef50_A0D5R4 Cluster: Chromosome undetermined scaffold_39, wh...    36   0.98 
UniRef50_P84789 Cluster: Philibertain g 1; n=5; core eudicotyled...    36   0.98 
UniRef50_Q3W780 Cluster: Peptidase S1, chymotrypsin:PDZ/DHR/GLGF...    36   1.3  
UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S...    36   1.3  
UniRef50_Q945E4 Cluster: Cysteine proteinase; n=1; Vasconcellea ...    36   1.3  
UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa...    36   1.3  
UniRef50_Q8I5D0 Cluster: Putative uncharacterized protein; n=2; ...    36   1.3  
UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli...    36   1.3  
UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop...    36   1.3  
UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca...    36   1.3  
UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ...    35   1.7  
UniRef50_UPI0000DA404B Cluster: PREDICTED: similar to cathepsin ...    35   1.7  
UniRef50_Q55FL7 Cluster: Putative uncharacterized protein; n=1; ...    35   1.7  
UniRef50_Q4YNP3 Cluster: Putative uncharacterized protein; n=1; ...    35   1.7  
UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil...    35   1.7  
UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop...    35   1.7  
UniRef50_Q2FLC7 Cluster: Periplasmic copper-binding precursor; n...    35   1.7  
UniRef50_P12400 Cluster: Protein CTLA-2-beta; n=6; Mus musculus|...    35   1.7  
UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ...    35   2.3  
UniRef50_UPI0000DA2FCA Cluster: PREDICTED: similar to alpha 3 ty...    35   2.3  
UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ...    35   2.3  
UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl...    35   2.3  
UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Sl...    35   2.3  
UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep...    35   2.3  
UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|...    35   2.3  
UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w...    35   2.3  
UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu...    35   2.3  
UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;...    34   3.0  
UniRef50_UPI0000D9BE07 Cluster: PREDICTED: hypothetical protein;...    34   3.0  
UniRef50_UPI0000D9B393 Cluster: PREDICTED: hypothetical protein;...    34   3.0  
UniRef50_UPI00001CC928 Cluster: PREDICTED: similar to CTLA-2-bet...    34   3.0  
UniRef50_Q4SUM3 Cluster: Ephrin receptor; n=4; Tetraodon nigrovi...    34   3.0  
UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=...    34   3.0  
UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid...    34   3.0  
UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R...    34   3.0  
UniRef50_A0EAG2 Cluster: Chromosome undetermined scaffold_86, wh...    34   3.0  
UniRef50_A0BV23 Cluster: Chromosome undetermined scaffold_13, wh...    34   3.0  
UniRef50_UPI00015B5D85 Cluster: PREDICTED: similar to cathepsin ...    34   3.9  
UniRef50_Q207N1 Cluster: Cathepsin S; n=2; Clupeocephala|Rep: Ca...    34   3.9  
UniRef50_A6GAX3 Cluster: Putative uncharacterized protein; n=1; ...    34   3.9  
UniRef50_A4G7B4 Cluster: Putative uncharacterized protein; n=1; ...    34   3.9  
UniRef50_Q4Y2Z9 Cluster: Putative uncharacterized protein; n=3; ...    34   3.9  
UniRef50_Q3L7L2 Cluster: Sar s 1 allergen SMIPP-C Yv6008G08; n=2...    34   3.9  
UniRef50_A0DCA5 Cluster: Chromosome undetermined scaffold_45, wh...    34   3.9  
UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh...    34   3.9  
UniRef50_Q2H7E7 Cluster: Putative uncharacterized protein; n=1; ...    34   3.9  
UniRef50_Q0TZH4 Cluster: Predicted protein; n=1; Phaeosphaeria n...    34   3.9  
UniRef50_Q8TKH5 Cluster: Cell surface protein; n=3; Methanosarci...    34   3.9  
UniRef50_UPI0000499884 Cluster: hypothetical protein 25.t00008; ...    33   5.2  
UniRef50_UPI000023E712 Cluster: hypothetical protein FG04225.1; ...    33   5.2  
UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin...    33   5.2  
UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ...    33   5.2  
UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H...    33   5.2  
UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh...    33   5.2  
UniRef50_UPI0000DB6CBD Cluster: PREDICTED: similar to rhinoceros...    33   6.9  
UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein...    33   6.9  
UniRef50_Q8IKV2 Cluster: Putative uncharacterized protein; n=1; ...    33   6.9  
UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep...    33   6.9  
UniRef50_Q7R5X2 Cluster: GLP_81_104117_102504; n=1; Giardia lamb...    33   6.9  
UniRef50_Q292E5 Cluster: GA10327-PA; n=1; Drosophila pseudoobscu...    33   6.9  
UniRef50_Q0IEH6 Cluster: Putative uncharacterized protein; n=1; ...    33   6.9  
UniRef50_A7SW33 Cluster: Predicted protein; n=3; Eumetazoa|Rep: ...    33   6.9  
UniRef50_A0DKP6 Cluster: Chromosome undetermined scaffold_54, wh...    33   6.9  
UniRef50_A3LZM2 Cluster: Predicted protein; n=1; Pichia stipitis...    33   6.9  
UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re...    33   9.1  
UniRef50_Q489L3 Cluster: Putative uncharacterized protein; n=1; ...    33   9.1  
UniRef50_Q9BL26 Cluster: Putative uncharacterized protein; n=4; ...    33   9.1  
UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy...    33   9.1  
UniRef50_Q6CS17 Cluster: Similarities with sp|Q25662 Plasmodium ...    33   9.1  
UniRef50_A4RJ84 Cluster: Putative uncharacterized protein; n=2; ...    33   9.1  

>UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15)
           [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC
           3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain] - Sarcophaga peregrina (Flesh fly)
           (Boettcherisca peregrina)
          Length = 339

 Score =  128 bits (309), Expect = 1e-28
 Identities = 57/99 (57%), Positives = 72/99 (72%)
 Frame = +2

Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
           E++H IAKHNQ +  G VSYKLG+NKY DMLHHEF +TMNG+N T    + L  +   + 
Sbjct: 54  ENRHKIAKHNQLFAQGKVSYKLGLNKYADMLHHEFKETMNGYNHTL---RQLMRERTGLV 110

Query: 437 GAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           GA +I PA+V +P+ VDWR+HGAV   KDQG CGSCW+F
Sbjct: 111 GATYIPPAHVTVPKSVDWREHGAVTGVKDQGHCGSCWAF 149



 Score = 53.6 bits (123), Expect = 5e-06
 Identities = 21/31 (67%), Positives = 26/31 (83%)
 Frame = +3

Query: 162 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIY 254
           DL+KEEW  +KLQHR NY +EVE+ FRMKI+
Sbjct: 22  DLIKEEWHTYKLQHRKNYANEVEERFRMKIF 52



 Score = 43.2 bits (97), Expect = 0.006
 Identities = 24/42 (57%), Positives = 29/42 (69%)
 Frame = +1

Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLI 690
           GALEGQHFR++G LVS L  +  +DCS    GNNG   GGL+
Sbjct: 153 GALEGQHFRKAGVLVS-LSEQNLVDCS-TKYGNNGCN-GGLM 191


>UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3;
           Bilateria|Rep: Cathepsin L-like cysteine proteinase -
           Longidorus elongatus
          Length = 358

 Score = 91.1 bits (216), Expect = 2e-17
 Identities = 46/98 (46%), Positives = 58/98 (59%)
 Frame = +2

Query: 260 HKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRG 439
           HK +I +HN +YE G  S+ L +NK+ DM + EF + MNGF   AK  K    +     G
Sbjct: 71  HK-VIEQHNIEYEAGQHSFALSLNKFADMTNAEFRQRMNGFKLPAKR-KLAKSQPLKEDG 128

Query: 440 AKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
             F  P NV +P+ VDWRK G V   KDQG CGSCW+F
Sbjct: 129 MIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAF 166



 Score = 34.7 bits (76), Expect = 2.3
 Identities = 18/37 (48%), Positives = 25/37 (67%), Gaps = 2/37 (5%)
 Frame = +1

Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDC--SGAVTGNNG 669
           G+LEGQH++Q+G LVS L  +  +DC  +G   G NG
Sbjct: 170 GSLEGQHYKQTGKLVS-LSEQNLVDCDVNGDDEGCNG 205


>UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin
           L-like protease; n=1; Nasonia vitripennis|Rep:
           PREDICTED: similar to cathepsin L-like protease -
           Nasonia vitripennis
          Length = 353

 Score = 89.4 bits (212), Expect = 7e-17
 Identities = 43/101 (42%), Positives = 65/101 (64%), Gaps = 2/101 (1%)
 Frame = +2

Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
           E++  IA+HNQK+++GL +YK+ +N++GDM+  E+   M+  N T    K +       R
Sbjct: 66  ENQRKIAEHNQKHDLGLFTYKVRINQFGDMMFEEYKNYMHAANNTITQLKRI------PR 119

Query: 437 GAKFISPANVK-LPEQVDWRKHGAVPTFKDQG-KCGSCWSF 553
           G +FI P + + +PE VDWR+ GAV   +DQG  CGSCW+F
Sbjct: 120 GDEFIKPKSAENVPEHVDWRQRGAVTPVRDQGLTCGSCWAF 160



 Score = 38.7 bits (86), Expect = 0.14
 Identities = 12/27 (44%), Positives = 22/27 (81%)
 Frame = +3

Query: 174 EEWSAFKLQHRLNYESEVEDNFRMKIY 254
           ++W+AFKL+++ NY  +VE+NFR  ++
Sbjct: 38  DDWAAFKLRYKKNYNGDVEENFRRSVF 64


>UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin
           L - Misgurnus mizolepis (Mud loach)
          Length = 337

 Score = 87.8 bits (208), Expect = 2e-16
 Identities = 40/91 (43%), Positives = 58/91 (63%)
 Frame = +2

Query: 281 HNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPA 460
           HN ++ MG+ +Y+LGMN +GDM H EF + MNG+    KH      KG     + F+ P 
Sbjct: 62  HNLEHSMGIHTYRLGMNHFGDMNHEEFRQVMNGY----KHKTERKFKG-----SLFMEPN 112

Query: 461 NVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            +++P ++DWR+ G V   KDQG+CGSCW+F
Sbjct: 113 FLEVPSKLDWREKGYVTPVKDQGECGSCWAF 143



 Score = 35.9 bits (79), Expect = 0.98
 Identities = 24/48 (50%), Positives = 29/48 (60%)
 Frame = +1

Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLIGXTAFQ 708
           GA+EGQ FR+ G LVS L  +  +DCS    GN G   GGL+   AFQ
Sbjct: 147 GAMEGQMFRKQGKLVS-LSEQNLVDCS-RPEGNEGCN-GGLM-DQAFQ 190


>UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC06231 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 372

 Score = 79.4 bits (187), Expect = 8e-14
 Identities = 36/92 (39%), Positives = 57/92 (61%)
 Frame = +2

Query: 278 KHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISP 457
           +HN+ Y+ G  +YK+G+N + D   +E  K + G+    +  K         +G+ FIS 
Sbjct: 95  EHNRAYQEGKATYKMGVNNFTDKTEYELRK-LRGYRSACRIAKP--------KGSTFISS 145

Query: 458 ANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            + KLP++VDWR++GAV   K+QG+CGSCW+F
Sbjct: 146 EHAKLPDRVDWRRNGAVTPVKNQGQCGSCWAF 177



 Score = 39.5 bits (88), Expect = 0.079
 Identities = 24/48 (50%), Positives = 33/48 (68%)
 Frame = +1

Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLIGXTAFQ 708
           GA+EGQH+R++  LV+ L  +  IDCS +  GNNG + GGL+   AFQ
Sbjct: 181 GAIEGQHYRKTNRLVN-LSEQQLIDCSKSY-GNNGCE-GGLM-DLAFQ 224


>UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase;
           n=21; Bilateria|Rep: Cathepsin L-like cysteine
           proteinase - Globodera pallida
          Length = 379

 Score = 74.5 bits (175), Expect = 2e-12
 Identities = 43/122 (35%), Positives = 62/122 (50%), Gaps = 9/122 (7%)
 Frame = +2

Query: 215 RKRGRRQFPHEDIPEH--------KHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKT 370
           +K GR+ +  +D+           K  I KHNQ Y  G V++++G N   D+   E+ K 
Sbjct: 75  QKHGRKAYADQDVENERMLTYLSAKQFIDKHNQAYIEGKVTFRVGENHIADLPFSEY-KK 133

Query: 371 MNGFNKTAKHNKNLYMKGGSVRGAKFISPANV-KLPEQVDWRKHGAVPTFKDQGKCGSCW 547
           +NG+ +    N            + F++P NV  LPE VDWR  G V   K+QG CGSCW
Sbjct: 134 LNGYRRLLGDNLRR-------NASTFLAPMNVGDLPESVDWRDKGWVTEVKNQGMCGSCW 186

Query: 548 SF 553
           +F
Sbjct: 187 AF 188



 Score = 36.3 bits (80), Expect = 0.74
 Identities = 24/48 (50%), Positives = 29/48 (60%)
 Frame = +1

Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLIGXTAFQ 708
           GALE QH RQ+G L+S L  +  IDCS    GN G   GG++   AFQ
Sbjct: 192 GALEAQHARQTGQLIS-LSEQNLIDCSKKY-GNMGCN-GGIM-DNAFQ 235


>UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15)
           (Major excreted protein) (MEP) [Contains: Cathepsin L
           heavy chain; Cathepsin L light chain]; n=19;
           Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15)
           (Major excreted protein) (MEP) [Contains: Cathepsin L
           heavy chain; Cathepsin L light chain] - Homo sapiens
           (Human)
          Length = 333

 Score = 74.1 bits (174), Expect = 3e-12
 Identities = 37/95 (38%), Positives = 49/95 (51%)
 Frame = +2

Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
           +I  HNQ+Y  G  S+ + MN +GDM   EF + MNGF                 +G  F
Sbjct: 58  MIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPR-----------KGKVF 106

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
             P   + P  VDWR+ G V   K+QG+CGSCW+F
Sbjct: 107 QEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAF 141



 Score = 39.9 bits (89), Expect = 0.060
 Identities = 25/48 (52%), Positives = 31/48 (64%)
 Frame = +1

Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLIGXTAFQ 708
           GALEGQ FR++G L+S L  +  +DCSG   GN G   GGL+   AFQ
Sbjct: 145 GALEGQMFRKTGRLIS-LSEQNLVDCSGP-QGNEGCN-GGLMDY-AFQ 188


>UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin
           heavy chain; n=3; Amniota|Rep: PREDICTED: similar to
           ferritin heavy chain - Ornithorhynchus anatinus
          Length = 338

 Score = 73.7 bits (173), Expect = 4e-12
 Identities = 36/95 (37%), Positives = 56/95 (58%)
 Frame = +2

Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
           +I +HN++   G  SY+L MN +GD  + E  + +NGF    + +    ++ G  + A+F
Sbjct: 58  VIERHNEEMSQGKHSYRLAMNHFGDQTNEELHERLNGF----RPDLGGALRSGREQ-ARF 112

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            S  + + PE+VDWR  G V   K+QG CGSCW+F
Sbjct: 113 RSKTSWEGPEEVDWRTKGYVTPVKNQGLCGSCWAF 147


>UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3;
           Curculionidae|Rep: Cysteine proteinase - Hypera postica
           (alfalfa weevil)
          Length = 324

 Score = 73.7 bits (173), Expect = 4e-12
 Identities = 43/96 (44%), Positives = 51/96 (53%), Gaps = 2/96 (2%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNL--YMKGGSVRGAK 445
           I  HN  YE G VSYK G+NK+ DM   EF KTM   + + K       Y+K G      
Sbjct: 57  IEAHNALYEQGKVSYKKGINKFTDMSQEEF-KTMLTLSASRKPTLETTSYVKTG------ 109

Query: 446 FISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
                 V++P  VDWRK G V   KDQG CGSCW+F
Sbjct: 110 ------VEIPSSVDWRKEGRVTGVKDQGDCGSCWAF 139


>UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor;
           n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase
           precursor - Diabrotica virgifera virgifera (western corn
           rootworm)
          Length = 326

 Score = 73.7 bits (173), Expect = 4e-12
 Identities = 38/94 (40%), Positives = 53/94 (56%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I  HN KY+ GL ++KLG+ K+ D+   EF   M G +++ K ++         R    +
Sbjct: 54  IENHNDKYDHGLSTFKLGVTKFADLTEKEF-SDMLGISRSTKSSRP--------RVIHSL 104

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           +P    LP + DWR+ GAV   KDQG CGSCWSF
Sbjct: 105 TPVK-DLPSKFDWREKGAVTEVKDQGSCGSCWSF 137


>UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine
           proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like
           midgut cysteine proteinase - Tenebrio molitor (Yellow
           mealworm)
          Length = 330

 Score = 73.3 bits (172), Expect = 5e-12
 Identities = 42/95 (44%), Positives = 56/95 (58%), Gaps = 1/95 (1%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN-GFNKTAKHNKNLYMKGGSVRGAKF 448
           IA+HN K+E G V+Y   MN++GDM   EF+  +N G  +  KH +NL M         +
Sbjct: 59  IAEHNAKFEKGEVTYSKAMNQFGDMSKEEFLAYVNRGKAQKPKHPENLRM--------PY 110

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           +S +   L   VDWR + AV   KDQG+CGSCWSF
Sbjct: 111 VS-SKKPLAASVDWRSN-AVSEVKDQGQCGSCWSF 143



 Score = 33.1 bits (72), Expect = 6.9
 Identities = 13/30 (43%), Positives = 20/30 (66%)
 Frame = +3

Query: 165 LVKEEWSAFKLQHRLNYESEVEDNFRMKIY 254
           L +E+WS FKL H+ +Y S +E+  R  I+
Sbjct: 23  LFQEQWSQFKLTHKKSYSSPIEEIRRQLIF 52


>UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;
           Brugia malayi|Rep: Cahepsin L-like cysteine protease -
           Brugia malayi (Filarial nematode worm)
          Length = 371

 Score = 73.3 bits (172), Expect = 5e-12
 Identities = 39/94 (41%), Positives = 53/94 (56%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I KHN++YE    +Y+L +N   DML  EF K ++GF      +KN +    ++R     
Sbjct: 85  IEKHNERYERNEETYELAINHLADMLPEEFRK-LHGFQSRKITSKNNFKN--TIR----- 136

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
              N  LP+ +DWR  GAV   KDQG CGSCW+F
Sbjct: 137 MKINGPLPKSIDWRTSGAVTKVKDQGYCGSCWTF 170



 Score = 40.3 bits (90), Expect = 0.045
 Identities = 22/43 (51%), Positives = 27/43 (62%)
 Frame = +1

Query: 562 LGALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLI 690
           +GALEGQHF Q+G LV  L  +  +DCS    GN G   GGL+
Sbjct: 173 VGALEGQHFLQTGKLVE-LSMQNLLDCSDDTYGNYGCD-GGLM 213


>UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean
           endopeptidase) (Cysteine proteinase)
           (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
           Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor
           (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase)
           (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
           Vignain-2] - Vigna mungo (Rice bean) (Black gram)
          Length = 362

 Score = 72.5 bits (170), Expect = 9e-12
 Identities = 35/80 (43%), Positives = 45/80 (56%)
 Frame = +2

Query: 314 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 493
           YKL +NK+ DM +HEF  T  G    +K N +   +G       F+      +P  VDWR
Sbjct: 80  YKLKLNKFADMTNHEFRSTYAG----SKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWR 135

Query: 494 KHGAVPTFKDQGKCGSCWSF 553
           K GAV   KDQG+CGSCW+F
Sbjct: 136 KKGAVTDVKDQGQCGSCWAF 155


>UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4;
           core eudicotyledons|Rep: Papain-like cysteine peptidase
           XBCP3 - Arabidopsis thaliana (Mouse-ear cress)
          Length = 437

 Score = 71.7 bits (168), Expect = 2e-11
 Identities = 35/81 (43%), Positives = 49/81 (60%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
           +Y L +N + D+ HHEF  +  G + +A  +  +  KG S+ G+       VK+P+ VDW
Sbjct: 73  TYSLSLNAFADLTHHEFKASRLGLSVSAP-SVIMASKGQSLGGS-------VKVPDSVDW 124

Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
           RK GAV   KDQG CG+CWSF
Sbjct: 125 RKKGAVTNVKDQGSCGACWSF 145


>UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2;
           Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio
           molitor (Yellow mealworm)
          Length = 336

 Score = 71.7 bits (168), Expect = 2e-11
 Identities = 37/93 (39%), Positives = 51/93 (54%), Gaps = 1/93 (1%)
 Frame = +2

Query: 278 KHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFIS- 454
           +HN+KY  GLVSY LG+N + DM   E     +G    A  +KN    G  ++  + +  
Sbjct: 60  EHNEKYRQGLVSYTLGVNLFTDMTPEEMKAYTHGLIMPADLHKN----GIPIKTREDLGL 115

Query: 455 PANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            A+V+ P   DWR  G V   K+QG CGSCW+F
Sbjct: 116 NASVRYPASFDWRDQGMVSPVKNQGSCGSCWAF 148


>UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 317

 Score = 70.5 bits (165), Expect = 4e-11
 Identities = 34/94 (36%), Positives = 55/94 (58%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I +HN +Y+ G VS+ LG+N++ DM   EF K M       K  +++         ++F+
Sbjct: 47  IEQHNARYQNGEVSFYLGVNQFADMTSEEF-KAMLDSQLIHKPKRDIT--------SRFV 97

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           +   + +PE +DWR+ GAV   +DQ +CGSCW+F
Sbjct: 98  ADPQLTVPESIDWREKGAVNPVRDQEQCGSCWAF 131


>UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing
           protein; n=7; Hymenostomatida|Rep: Papain family
           cysteine protease containing protein - Tetrahymena
           thermophila SB210
          Length = 387

 Score = 69.3 bits (162), Expect = 9e-11
 Identities = 36/81 (44%), Positives = 45/81 (55%), Gaps = 1/81 (1%)
 Frame = +2

Query: 314 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK-LPEQVDW 490
           YK G+N++ D    E  +T  G++KT K+  N   K    R  K     NVK LP+ VDW
Sbjct: 83  YKKGINQFTDRTAEELRETTLGYSKTVKNAAN---KQNMFRNLKTSDKINVKDLPKSVDW 139

Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
           R  G V   KDQG CGSCW+F
Sbjct: 140 RDAGVVTPVKDQGHCGSCWAF 160


>UniRef50_P25774 Cluster: Cathepsin S precursor; n=78;
           Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens
           (Human)
          Length = 331

 Score = 69.3 bits (162), Expect = 9e-11
 Identities = 35/91 (38%), Positives = 48/91 (52%)
 Frame = +2

Query: 281 HNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPA 460
           HN ++ MG+ SY LGMN  GDM   E +  M+     ++  +N+  K          S  
Sbjct: 62  HNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNITYK----------SNP 111

Query: 461 NVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           N  LP+ VDWR+ G V   K QG CG+CW+F
Sbjct: 112 NRILPDSVDWREKGCVTEVKYQGSCGACWAF 142



 Score = 32.7 bits (71), Expect = 9.1
 Identities = 22/49 (44%), Positives = 29/49 (59%)
 Frame = +1

Query: 562 LGALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLIGXTAFQ 708
           +GALE Q   ++G LVSL  ++  +DCS    GN G   GG +  TAFQ
Sbjct: 145 VGALEAQLKLKTGKLVSL-SAQNLVDCSTEKYGNKGCN-GGFM-TTAFQ 190


>UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precursor;
           n=3; Metazoa|Rep: Digestive cysteine proteinase 2
           precursor - Homarus americanus (American lobster)
          Length = 323

 Score = 68.5 bits (160), Expect = 1e-10
 Identities = 48/134 (35%), Positives = 66/134 (49%), Gaps = 12/134 (8%)
 Frame = +2

Query: 188 LQAAAPSQLRKRGR--RQFPHEDIPEHKHIIAKHNQKY--------EMGLVSYKLGMNKY 337
           L AA+PS    +G+  RQ+   +   ++ +I + NQKY        E G V++ L MNK+
Sbjct: 13  LAAASPSWEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKF 72

Query: 338 GDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPE--QVDWRKHGAVP 511
           GDM   EF   M G         N+  +   V       P     P+  +VDWR  GAV 
Sbjct: 73  GDMTLEEFNAVMKG---------NIPRRSAPV---SVFYPKKETGPQATEVDWRTKGAVT 120

Query: 512 TFKDQGKCGSCWSF 553
             KDQG+CGSCW+F
Sbjct: 121 PVKDQGQCGSCWAF 134



 Score = 32.7 bits (71), Expect = 9.1
 Identities = 14/27 (51%), Positives = 20/27 (74%)
 Frame = +1

Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCS 645
           G+LEGQHF ++G L+SL   +  +DCS
Sbjct: 138 GSLEGQHFLKTGSLISLAEQQ-LVDCS 163


>UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor;
           n=35; Fasciola|Rep: Cathepsin L-like proteinase
           precursor - Fasciola hepatica (Liver fluke)
          Length = 326

 Score = 68.5 bits (160), Expect = 1e-10
 Identities = 35/97 (36%), Positives = 53/97 (54%)
 Frame = +2

Query: 263 KHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGA 442
           KHI  +HN ++++GLV+Y LG+N++ DM   EF          AK+   +      +   
Sbjct: 49  KHI-QEHNLRHDLGLVTYTLGLNQFTDMTFEEF---------KAKYLTEMSRASDILSHG 98

Query: 443 KFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
                 N  +P+++DWR+ G V   KDQG CGSCW+F
Sbjct: 99  VPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAF 135


>UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep:
           Cathepsin K - Danio rerio (Zebrafish) (Brachydanio
           rerio)
          Length = 333

 Score = 67.7 bits (158), Expect = 3e-10
 Identities = 36/95 (37%), Positives = 53/95 (55%), Gaps = 1/95 (1%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I  HN++YE+G+ +Y LGMN +GDM   E  + + G          +Y    +     F+
Sbjct: 61  IEAHNKEYELGIHTYDLGMNHFGDMTLEEVAEKVMGLQMP------MYRDPANT----FV 110

Query: 452 SPANV-KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
               V KLP+ +D+RK G V + K+QG CGSCW+F
Sbjct: 111 PDDRVGKLPKSIDYRKLGYVTSVKNQGSCGSCWAF 145


>UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcoptes
           scabiei type hominis|Rep: Cathepsin L-like protease -
           Sarcoptes scabiei type hominis
          Length = 245

 Score = 67.7 bits (158), Expect = 3e-10
 Identities = 36/94 (38%), Positives = 54/94 (57%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I KHN+KYE GL +Y+LG+N++ D+ + E+   MN      KH+    ++   V   + +
Sbjct: 64  IRKHNEKYEAGLSTYELGVNQFTDLTNKEYNDQMNRLK--VKHD----VQSEHVFDNEDV 117

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           S     LP++VDW     V   KDQ +CGSCW+F
Sbjct: 118 S----DLPDEVDWTLKNVVAPIKDQKQCGSCWAF 147


>UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes
           abbreviatus|Rep: Cathepsin L 2 - Diaprepes abbreviatus
           (Sugarcane rootstalk borer weevil)
          Length = 348

 Score = 67.7 bits (158), Expect = 3e-10
 Identities = 39/104 (37%), Positives = 54/104 (51%), Gaps = 10/104 (9%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGG------SV 433
           I +HN+ YEMGL SY++ MN  GD+   EF++           ++NL            +
Sbjct: 59  INEHNKLYEMGLSSYQMAMNHLGDLTKDEFMRIYTVNMPQLPQSENLSDSEPWLDLPQDL 118

Query: 434 RG-AKFISPAN---VKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           +G   +  P N   V LP  +DWR+ GAV   K+Q  CGSCWSF
Sbjct: 119 QGFVTYALPTNLDEVDLPTDIDWRQKGAVTPVKNQRNCGSCWSF 162



 Score = 38.3 bits (85), Expect = 0.18
 Identities = 14/31 (45%), Positives = 22/31 (70%)
 Frame = +3

Query: 165 LVKEEWSAFKLQHRLNYESEVEDNFRMKIYL 257
           LV+E+W  FKL+H   YESE E+ +R  +++
Sbjct: 23  LVQEQWEQFKLEHGKVYESESENEYRQSVFM 53


>UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar to
           human SRY (sex determining region Y)-box 30
           (SOX30),transcript variant 1,; n=3; Mammalia|Rep: Testis
           cDNA clone: QtsA-12228, similar to human SRY (sex
           determining region Y)-box 30 (SOX30),transcript variant
           1, - Macaca fascicularis (Crab eating macaque)
           (Cynomolgus monkey)
          Length = 433

 Score = 67.3 bits (157), Expect = 3e-10
 Identities = 36/95 (37%), Positives = 50/95 (52%)
 Frame = +2

Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
           +I  HN +Y  G   + + MN +GDM + EF + M  F      N+ L       +G  F
Sbjct: 58  MIELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQVMGCFR-----NQKLR------KGKLF 106

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
             P  + LP+ VDWRK G V   K+Q +CGSCW+F
Sbjct: 107 REPLFLDLPKSVDWRKKGYVTPVKNQKQCGSCWAF 141



 Score = 35.5 bits (78), Expect = 1.3
 Identities = 20/39 (51%), Positives = 24/39 (61%)
 Frame = +1

Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPG 681
           GALEGQ FR++G LVS L  +  +DCS    GN G   G
Sbjct: 145 GALEGQMFRKTGKLVS-LSEQNLVDCSHP-QGNQGCNGG 181


>UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain;
           n=16; Chrysomelidae|Rep: Digestive cysteine protease
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 66.9 bits (156), Expect = 5e-10
 Identities = 32/94 (34%), Positives = 51/94 (54%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I +HN +Y+ G  +Y LG+ ++ D+ H EF   + G  K    NK        +     +
Sbjct: 54  IKEHNARYDKGEETYLLGVTRFADLTHEEFKDILKGQIK----NKP------RLNATPTV 103

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            P ++++P+ +DW + GAV   KDQ  CGSCW+F
Sbjct: 104 FPEDLEVPDSIDWTEKGAVLEVKDQNPCGSCWAF 137


>UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42;
           Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens
           (Human)
          Length = 334

 Score = 66.9 bits (156), Expect = 5e-10
 Identities = 36/95 (37%), Positives = 50/95 (52%)
 Frame = +2

Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
           +I  HN +Y  G   + + MN +GDM + EF + M  F       +N   + G V    F
Sbjct: 58  MIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCF-------RNQKFRKGKV----F 106

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
             P  + LP+ VDWRK G V   K+Q +CGSCW+F
Sbjct: 107 REPLFLDLPKSVDWRKKGYVTPVKNQKQCGSCWAF 141



 Score = 35.9 bits (79), Expect = 0.98
 Identities = 24/48 (50%), Positives = 29/48 (60%)
 Frame = +1

Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLIGXTAFQ 708
           GALEGQ FR++G LVS L  +  +DCS    GN G   GG +   AFQ
Sbjct: 145 GALEGQMFRKTGKLVS-LSEQNLVDCS-RPQGNQGCN-GGFMA-RAFQ 188


>UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin L
           preproprotein; n=1; Monodelphis domestica|Rep:
           PREDICTED: similar to cathepsin L preproprotein -
           Monodelphis domestica
          Length = 356

 Score = 66.5 bits (155), Expect = 6e-10
 Identities = 34/95 (35%), Positives = 52/95 (54%)
 Frame = +2

Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
           +I  HN+ ++ G  SY +GMN++GDM   EF   +N      +  +N   K    R   +
Sbjct: 58  LINDHNRLFKEGKKSYFMGMNQFGDMTDKEFESRLNLRIAPVRTRRNYTFK----RRIYY 113

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
                 +LP+ VDWR HG V   ++QG+CG+CW+F
Sbjct: 114 ------RLPKSVDWRTHGYVTPIRNQGECGACWAF 142



 Score = 36.7 bits (81), Expect = 0.56
 Identities = 20/42 (47%), Positives = 26/42 (61%)
 Frame = +1

Query: 562 LGALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGL 687
           +G+LEGQ FR++G LV L + +  IDCSG  T   G   G L
Sbjct: 145 IGSLEGQLFRKTGRLVELSK-QMLIDCSGYYTCMGGSLTGAL 185


>UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome
           shotgun sequence; n=3; Tetraodontidae|Rep: Chromosome 12
           SCAF14996, whole genome shotgun sequence - Tetraodon
           nigroviridis (Green puffer)
          Length = 362

 Score = 66.5 bits (155), Expect = 6e-10
 Identities = 37/87 (42%), Positives = 47/87 (54%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I  HN ++ MG  SY+LGMN +GDM H EF + MNG+    KH           RG+ F+
Sbjct: 58  IELHNLEHSMGQHSYRLGMNHFGDMTHEEFRQIMNGY----KHKPQ-----RKFRGSLFM 108

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGK 532
            P  ++ P  VDWR  G V   KDQ K
Sbjct: 109 EPNFLEAPRAVDWRDKGYVTPVKDQLK 135



 Score = 42.7 bits (96), Expect = 0.009
 Identities = 30/62 (48%), Positives = 35/62 (56%)
 Frame = +1

Query: 523 PREVWLMLVLSARLGALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLIGXTA 702
           P  VWL+L L    G   GQHFRQ+G LVS L  +  +DCS    GN G   GGL+   A
Sbjct: 166 PGSVWLLLGLQHHRGP-GGQHFRQTGKLVS-LSEQNLVDCS-RPEGNEGCN-GGLM-DQA 220

Query: 703 FQ 708
           FQ
Sbjct: 221 FQ 222


>UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:
           Cathepsin - Petromyzon marinus (Sea lamprey)
          Length = 333

 Score = 66.1 bits (154), Expect = 8e-10
 Identities = 39/94 (41%), Positives = 49/94 (52%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           + +HN   + G VS+ LG+NKY D+  HE+        K      NL   G   RGA F 
Sbjct: 58  VLQHNLLADEGNVSFHLGINKYSDLELHEY------HEKVVGRFWNL-RNGTRRRGAPFP 110

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
             +   LPEQVDWR  G V   K+QG CGS W+F
Sbjct: 111 LRSMDNLPEQVDWRLKGYVTPVKEQGLCGSSWAF 144


>UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep:
           CG4847-PD, isoform D - Drosophila melanogaster (Fruit
           fly)
          Length = 420

 Score = 66.1 bits (154), Expect = 8e-10
 Identities = 28/97 (28%), Positives = 50/97 (51%)
 Frame = +2

Query: 263 KHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGA 442
           K+++   N  +  G+ ++K  +N + D+ H EF+  + G  ++ +       K  +    
Sbjct: 140 KNLVEAGNAAFAQGVHTFKQAVNAFADLTHSEFLSQLTGLKRSPE------AKARAAASL 193

Query: 443 KFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           K ++     +P+  DWR+HG V   K QG CGSCW+F
Sbjct: 194 KLVNLPAKPIPDAFDWREHGGVTPVKFQGTCGSCWAF 230


>UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor;
           n=176; Viridiplantae|Rep: Cysteine proteinase RD21a
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 462

 Score = 66.1 bits (154), Expect = 8e-10
 Identities = 40/121 (33%), Positives = 64/121 (52%)
 Frame = +2

Query: 191 QAAAPSQLRKRGRRQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKT 370
           +A + + L ++ RR    E   ++   + +HN+K     +SY+LG+ ++ D+ + E+   
Sbjct: 59  KAQSQNSLVEKDRR---FEIFKDNLRFVDEHNEKN----LSYRLGLTRFADLTNDEYRSK 111

Query: 371 MNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWS 550
             G    AK  K    KG      ++ +    +LPE +DWRK GAV   KDQG CGSCW+
Sbjct: 112 YLG----AKMEK----KGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWA 163

Query: 551 F 553
           F
Sbjct: 164 F 164


>UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1;
           Phytophthora infestans|Rep: Cathepsin-like cysteine
           protease - Phytophthora infestans (Potato late blight
           fungus)
          Length = 376

 Score = 65.7 bits (153), Expect = 1e-09
 Identities = 34/95 (35%), Positives = 50/95 (52%), Gaps = 1/95 (1%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I  HN+ YE G  S+ LG+N   D+   E+ + ++   + +K          S     F+
Sbjct: 75  IQTHNEAYERGEHSFTLGLNDLADLADAEYKQLLSYRTRDSK---------SSSASETFV 125

Query: 452 SPANVK-LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            P NV+ LP   DWR+H  V   K+QG+CGSCW+F
Sbjct: 126 KPENVEDLPATWDWREHSTVTPVKNQGQCGSCWAF 160


>UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1;
           Oryza sativa (japonica cultivar-group)|Rep: Putative
           uncharacterized protein - Oryza sativa subsp. japonica
           (Rice)
          Length = 326

 Score = 65.7 bits (153), Expect = 1e-09
 Identities = 42/120 (35%), Positives = 59/120 (49%)
 Frame = +2

Query: 194 AAAPSQLRKRGRRQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTM 373
           +++P  L  +G R    E   ++   I   N+K  M   SYKLG+NK+ D+   EF    
Sbjct: 37  SSSPRDLADKGSR---FEVFKKNARYIHDFNRKKGM---SYKLGLNKFADLTLEEFTAKY 90

Query: 374 NGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            G N          +K G+  G+  ++      P   DWR+HGAV   KDQG CGSCW+F
Sbjct: 91  TGANPGPITG----LKNGT--GSPPLAAVAGDAPPAWDWREHGAVTRVKDQGPCGSCWAF 144


>UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2;
           Platyhelminthes|Rep: Cathepsin L-like proteinase -
           Echinococcus multilocularis
          Length = 338

 Score = 65.7 bits (153), Expect = 1e-09
 Identities = 31/91 (34%), Positives = 45/91 (49%)
 Frame = +2

Query: 281 HNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPA 460
           HN++Y +GL +Y   +N + D+   EF +      +T        M    V       P 
Sbjct: 64  HNERYYLGLETYSTALNAFADLTLEEFAEKYLTLKQTPMEGIWQDMSTQYVE-----RPT 118

Query: 461 NVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            + +P+ +DWRK G V   KDQG CGSCW+F
Sbjct: 119 RMLVPDSIDWRKKGLVTPIKDQGDCGSCWAF 149



 Score = 34.3 bits (75), Expect = 3.0
 Identities = 19/41 (46%), Positives = 25/41 (60%)
 Frame = +1

Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGL 687
           GALEGQ  R++G L+S L  +  +DCS   TGN G   G +
Sbjct: 153 GALEGQLKRKTGKLIS-LSEQQLVDCS-TYTGNEGCNGGDM 191


>UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep:
           Cysteine protease - Solanum lycopersicum (Tomato)
           (Lycopersicon esculentum)
          Length = 345

 Score = 65.3 bits (152), Expect = 1e-09
 Identities = 38/116 (32%), Positives = 56/116 (48%), Gaps = 5/116 (4%)
 Frame = +2

Query: 221 RGRRQFPHEDIPEHKHIIAKHNQKY-----EMGLVSYKLGMNKYGDMLHHEFVKTMNGFN 385
           R  R +  E     + +I K N K+     + G +SYKLGMN++ D+   EF+    G N
Sbjct: 45  RHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLN 104

Query: 386 KTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
               +     M   S    K    ++  +P  +DWR+ GAV   K QG+CG CW+F
Sbjct: 105 IPNSYLSPSPMS--STEFKKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAF 158


>UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3;
           Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber
           officinale (Ginger)
          Length = 475

 Score = 65.3 bits (152), Expect = 1e-09
 Identities = 31/104 (29%), Positives = 56/104 (53%), Gaps = 1/104 (0%)
 Frame = +2

Query: 245 EDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF-VKTMNGFNKTAKHNKNLYMK 421
           E   E+   + +HN   + G  +Y+LGMN++ D+ + E+  + +   ++  +        
Sbjct: 74  EVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEEYRARFLRDLSRLGRSTS----- 128

Query: 422 GGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            G +     +   +V LP+ +DWR+ GAV   K+QG+CGSCW+F
Sbjct: 129 -GEISNQYRLREGDV-LPDSIDWREKGAVVAVKNQGRCGSCWAF 170


>UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain;
           n=9; Cucujiformia|Rep: Digestive cysteine proteinase
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 65.3 bits (152), Expect = 1e-09
 Identities = 35/94 (37%), Positives = 49/94 (52%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I +HN KY+ G  SY LG+  + D+ H EF   +    KT K N         V     +
Sbjct: 54  IEEHNAKYDKGEESYFLGVTPFADLTHDEFKDELRRQIKT-KPN---------VEATLAV 103

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            P  +++P+ +DW + GAV   K QG CGSCW+F
Sbjct: 104 FPEGLEVPDSIDWTQKGAVLDVKYQGGCGSCWAF 137


>UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9;
           Onchocercidae|Rep: Cathepsin L-like precursor - Brugia
           pahangi (Filarial nematode worm)
          Length = 395

 Score = 64.9 bits (151), Expect = 2e-09
 Identities = 33/90 (36%), Positives = 50/90 (55%)
 Frame = +2

Query: 284 NQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPAN 463
           N+KYE GLVSY   +N   D+   EF+   NG     + +    ++G       +    +
Sbjct: 125 NKKYEQGLVSYTTALNDLADLTDEEFM-VRNGLRLPNQTD----LRGKRQTSEFYRYDKS 179

Query: 464 VKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            +LP+QVDWR  GAV   ++QG+CGSC++F
Sbjct: 180 ERLPDQVDWRTKGAVTPVRNQGECGSCYAF 209


>UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase
           precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin
           L-like cysteine proteinase precursor - Acanthoscelides
           obtectus (Bean weevil)
          Length = 321

 Score = 64.5 bits (150), Expect = 2e-09
 Identities = 34/95 (35%), Positives = 53/95 (55%), Gaps = 1/95 (1%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I +HN++Y  G  ++++G+N++GDM   EF + +      A     + +  G       +
Sbjct: 54  IEEHNERYHNGEETFEMGINQFGDMTQEEFKRML------ALQKPQMPLPRGDE-----V 102

Query: 452 SPANVK-LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           S  NV  +P+ VDWR+ GAV   K QG CGSCW+F
Sbjct: 103 SFDNVNDIPKTVDWREKGAVTEVKKQGNCGSCWAF 137


>UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase" precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 315

 Score = 63.7 bits (148), Expect = 4e-09
 Identities = 37/94 (39%), Positives = 51/94 (54%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I +HN KYE G  +Y L +NK+ D    EF   +    + A   K  ++       AK +
Sbjct: 54  IEEHNAKYESGEETYYLAVNKFADWSSAEFQAMLA--RQMANKPKQSFI-------AKHV 104

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           +  NV+  E+VDWR   AV   KDQG+CGSCW+F
Sbjct: 105 ADPNVQAVEEVDWRD-SAVLGVKDQGQCGSCWAF 137


>UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18;
           Schistosoma|Rep: Preprocathepsin cathepsin L -
           Schistosoma japonicum (Blood fluke)
          Length = 331

 Score = 63.3 bits (147), Expect = 6e-09
 Identities = 35/94 (37%), Positives = 50/94 (53%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I +HN ++++GL  Y +G+N++ DM   E  + M  F K    N  L+   G+      +
Sbjct: 58  IQEHNLRHDLGLEGYTMGLNQFCDMEWEEVNRIM--FPKVFG-NSPLWNDDGNE-----L 109

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
              N  +P   DWR HGAV   K QG CGSCW+F
Sbjct: 110 ELTNKPVPSTWDWRDHGAVTAVKHQGLCGSCWAF 143


>UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor;
           n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 355

 Score = 63.3 bits (147), Expect = 6e-09
 Identities = 34/81 (41%), Positives = 41/81 (50%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
           SY LG+N++ D+ H EF     G  K     K           A F       LP+ VDW
Sbjct: 91  SYWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQ-------PSANFRYRDITDLPKSVDW 143

Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
           RK GAV   KDQG+CGSCW+F
Sbjct: 144 RKKGAVAPVKDQGQCGSCWAF 164


>UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel
           protein - Danio rerio (Zebrafish) (Brachydanio rerio)
          Length = 328

 Score = 62.9 bits (146), Expect = 7e-09
 Identities = 36/94 (38%), Positives = 50/94 (53%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I  HN+   +GL SY LG+N+  DM   E V  MNG  +    + N          A F 
Sbjct: 58  ILLHNEAAAVGLHSYTLGLNQLSDMTADE-VNDMNGLLEEDFPDVN----------ATFS 106

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            P+   LP++V+W +HG V   ++QG CGSCW+F
Sbjct: 107 PPSLQTLPQRVNWTEHGMVSPVQNQGPCGSCWAF 140


>UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3;
           Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays
           (Maize)
          Length = 493

 Score = 62.9 bits (146), Expect = 7e-09
 Identities = 34/98 (34%), Positives = 52/98 (53%), Gaps = 4/98 (4%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTM----NGFNKTAKHNKNLYMKGGSVRG 439
           I  HN + + GL  ++LG+ ++ D+   E+   +     G N TA          G V  
Sbjct: 103 IDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAV---------GVVGR 153

Query: 440 AKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            +++  A  +LP+ VDWR+ GAV   KDQG+CG CW+F
Sbjct: 154 RRYLPLAGEQLPDAVDWRERGAVAEVKDQGQCGGCWAF 191


>UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep:
           LOC443661 protein - Xenopus laevis (African clawed frog)
          Length = 346

 Score = 62.1 bits (144), Expect = 1e-08
 Identities = 34/94 (36%), Positives = 49/94 (52%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I  HN +Y +GL +Y++GMN  GDM   E   TM G+  +     N+       R  K +
Sbjct: 82  ITVHNLEYSLGLHTYEVGMNHLGDMTGEEVEATMTGYTSSDDSLANM------TRVPKKL 135

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
             A  + P  +DWR  G V + + Q KCGSC++F
Sbjct: 136 LEA--QPPASIDWRTKGCVTSVRRQRKCGSCYAF 167


>UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra
           erinaceieuropaei|Rep: Cysteine proteinase - Spirometra
           erinaceieuropaei (Tapeworm)
          Length = 336

 Score = 62.1 bits (144), Expect = 1e-08
 Identities = 36/97 (37%), Positives = 50/97 (51%), Gaps = 3/97 (3%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKT---MNGFNKTAKHNKNLYMKGGSVRGA 442
           I +HNQ+Y   L SY + +N + D+   EF +    + G   T    K       SV   
Sbjct: 63  IIRHNQRYYQQLESYAVRLNDFSDLTPGEFAERYLCLRGIVLTKLRRKEAV----SV--- 115

Query: 443 KFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
               P    LP+ V+WR+ GAV + K+QG+CGSCWSF
Sbjct: 116 ----PLKENLPDSVNWRERGAVTSVKNQGQCGSCWSF 148


>UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep:
           Cathepsin L - Stylonychia lemnae
          Length = 340

 Score = 61.7 bits (143), Expect = 2e-08
 Identities = 37/95 (38%), Positives = 49/95 (51%), Gaps = 1/95 (1%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I  HN + +    S+ LG N   D  H E+ K M G+    K  K +Y            
Sbjct: 73  INNHNSQNDG--TSFTLGPNHLADYTHDEY-KKMLGYKPRNKTGKEVY------------ 117

Query: 452 SPANVK-LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           S  N+K +PE +DWR+ GAV   KDQG+CGSCW+F
Sbjct: 118 STPNLKDIPESIDWREKGAVNAVKDQGQCGSCWAF 152


>UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease
           Gip1p; n=4; Tetrahymena thermophila|Rep:
           Granule-biosynthesis induced protease Gip1p -
           Tetrahymena thermophila
          Length = 345

 Score = 61.3 bits (142), Expect = 2e-08
 Identities = 31/83 (37%), Positives = 42/83 (50%), Gaps = 2/83 (2%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTM--NGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQV 484
           SY  G+N++ DM   EF + +     +K A  NK           +  + P N  LP  V
Sbjct: 79  SYSKGLNQFSDMTKEEFKQRVLNKKISKKASSNKGGRNLAADPAVSNLVFPTN-NLPLSV 137

Query: 485 DWRKHGAVPTFKDQGKCGSCWSF 553
           DWRK G +   K+QG CGSCW+F
Sbjct: 138 DWRKRGVLNPVKNQGTCGSCWTF 160


>UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18;
           Magnoliophyta|Rep: Thiol protease aleurain precursor -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 358

 Score = 61.3 bits (142), Expect = 2e-08
 Identities = 39/99 (39%), Positives = 51/99 (51%)
 Frame = +2

Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
           E+  +I   N+K   GL SYKLG+N++ D+   EF +T  G    A  N +  +KG    
Sbjct: 85  ENLDLIRSTNKK---GL-SYKLGVNQFADLTWQEFQRTKLG----AAQNCSATLKGSH-- 134

Query: 437 GAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
                      LPE  DWR+ G V   KDQG CGSCW+F
Sbjct: 135 -----KVTEAALPETKDWREDGIVSPVKDQGGCGSCWTF 168


>UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate
           cathepsin L; n=4; Danio rerio|Rep: Novel protein similar
           to vertebrate cathepsin L - Danio rerio (Zebrafish)
           (Brachydanio rerio)
          Length = 334

 Score = 60.9 bits (141), Expect = 3e-08
 Identities = 34/95 (35%), Positives = 47/95 (49%), Gaps = 1/95 (1%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR-GAKF 448
           I K+N  +  GL  +K+ MNKYGD+   E+ + +    K   + K        +R  AK 
Sbjct: 57  IWKNNNDFSFGLSMFKMAMNKYGDLTSVEYKRLLGSKIKGTGNRKGKITSAQMLRLNAKR 116

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           +   N+      D+R  G V   KDQG CGSCWSF
Sbjct: 117 LGVTNI------DYRAKGYVTEVKDQGYCGSCWSF 145


>UniRef50_Q23H32 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 365

 Score = 60.9 bits (141), Expect = 3e-08
 Identities = 33/99 (33%), Positives = 52/99 (52%)
 Frame = +2

Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
           E+ + I  +NQ  E    + +L +N++ D+   EF +   G+N + KHN     + GS +
Sbjct: 67  ENYNYIHNYNQINENSQDNIQLEVNEFADLSLQEFRELYFGYNSSKKHNN---QQNGSTK 123

Query: 437 GAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
             +     +  +PE VDWR+    P  K QG CGSCW+F
Sbjct: 124 NLRQSFLLSDSVPESVDWREKLVAPVQK-QGGCGSCWAF 161


>UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor;
           n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase
           precursor - Phaedon cochleariae (Mustard beetle)
          Length = 324

 Score = 60.9 bits (141), Expect = 3e-08
 Identities = 33/93 (35%), Positives = 49/93 (52%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           IA+HN KYE G  +Y L +NK+ D+   EF + M   N+ ++ N         + G +  
Sbjct: 54  IAEHNVKYENGESTYYLAINKFSDITDEEF-RDMLMKNEASRPN---------LEGLEVA 103

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWS 550
                  PE +DWR  G V   ++QG+CGSCW+
Sbjct: 104 DLTVGAAPESIDWRSKGVVLPVRNQGECGSCWA 136


>UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella
           histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax
           (Sterkiella histriomuscorum)
          Length = 366

 Score = 60.5 bits (140), Expect = 4e-08
 Identities = 35/94 (37%), Positives = 46/94 (48%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I KHN     G  +YK G+N + DM   EF    + +N  A+ N        S    K  
Sbjct: 82  IIKHNSD---GTNTYKKGLNAFSDMTDEEF---FDYYNIKAEQNC-------SATNRKSF 128

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
             +N  +P + DWR  G V   K+QGKCGSCW+F
Sbjct: 129 GNSNANIPTEWDWRTFGVVSPVKNQGKCGSCWTF 162


>UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep:
           Toxopain-2 - Toxoplasma gondii
          Length = 422

 Score = 60.5 bits (140), Expect = 4e-08
 Identities = 38/113 (33%), Positives = 56/113 (49%), Gaps = 5/113 (4%)
 Frame = +2

Query: 230 RQFPHEDIPEHKHIIAKHNQKY-----EMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTA 394
           + +  E+  + ++ I K+N  Y     + G  SY L MN +GD+   EF +   GF K+ 
Sbjct: 126 KSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEFRRKYLGFKKS- 183

Query: 395 KHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
              +NL      V   + ++    +LP  VDWR  G V   KDQ  CGSCW+F
Sbjct: 184 ---RNLKSHHLGV-ATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAF 232


>UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease;
           n=23; Magnoliophyta|Rep: Senescence-specific cysteine
           protease - Arabidopsis thaliana (Mouse-ear cress)
          Length = 346

 Score = 60.1 bits (139), Expect = 5e-08
 Identities = 30/81 (37%), Positives = 41/81 (50%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
           ++KL +N++ D+ + EF     GF   +  +     K    R     S A   LP  VDW
Sbjct: 80  TFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVSSGA---LPVSVDW 136

Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
           RK GAV   K+QG CG CW+F
Sbjct: 137 RKKGAVTPIKNQGSCGCCWAF 157


>UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2;
           Acanthamoeba|Rep: Cysteine proteinase - Acanthamoeba
           healyi
          Length = 330

 Score = 60.1 bits (139), Expect = 5e-08
 Identities = 32/81 (39%), Positives = 44/81 (54%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
           SY L MN++GD+ + EF +   G           Y K   +  A   +PA   +P + DW
Sbjct: 69  SYFLAMNQFGDLTNAEFNRLFKGLAFD-------YSKHAKIHTAAPEAPAT-GIPSEFDW 120

Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
           R+ GAV   K+QG+CGSCWSF
Sbjct: 121 RQKGAVTHVKNQGQCGSCWSF 141


>UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823
           protein, partial; n=1; Ornithorhynchus anatinus|Rep:
           PREDICTED: similar to MGC81823 protein, partial -
           Ornithorhynchus anatinus
          Length = 361

 Score = 59.7 bits (138), Expect = 7e-08
 Identities = 28/66 (42%), Positives = 37/66 (56%)
 Frame = +2

Query: 356 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKC 535
           EF   MNG+ K A+  +       S   + F+ P   + PE +DWR HG V   KDQG+C
Sbjct: 157 EFAAAMNGY-KAARGVE----ASASASASAFLGPNGTEPPEALDWRDHGYVTPVKDQGRC 211

Query: 536 GSCWSF 553
           GSCW+F
Sbjct: 212 GSCWAF 217


>UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine
           protease; n=11; Callosobruchus maculatus|Rep: Putative
           gut cathepsin L-like cysteine protease - Callosobruchus
           maculatus (Southern cowpea weevil) (Pulse bruchid)
          Length = 326

 Score = 59.7 bits (138), Expect = 7e-08
 Identities = 32/94 (34%), Positives = 48/94 (51%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I +HN+KYE G  S+   + ++ DM H EF+  +      A       +   +V    F 
Sbjct: 54  IQEHNKKYERGEESFAKKVTQFADMTHEEFLDLLKLQGVPA-------LPSNAVHFDNF- 105

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
              +++  + VDWR+ GAV   KDQ  CGSCW+F
Sbjct: 106 EDIDMEEKDAVDWREEGAVTPVKDQANCGSCWAF 139



 Score = 41.9 bits (94), Expect = 0.015
 Identities = 21/44 (47%), Positives = 32/44 (72%)
 Frame = +1

Query: 562 LGALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLIG 693
           +GA+EGQ F+++G LVS L ++  +DC+    GNNG + GGL+G
Sbjct: 142 VGAIEGQFFKKNGTLVS-LSAQELVDCATEDYGNNGCK-GGLMG 183


>UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine
           protease; n=1; Maconellicoccus hirsutus|Rep: Putative
           cathepsin L-like cysteine protease - Maconellicoccus
           hirsutus (hibiscus mealybug)
          Length = 339

 Score = 59.7 bits (138), Expect = 7e-08
 Identities = 34/100 (34%), Positives = 53/100 (53%), Gaps = 2/100 (2%)
 Frame = +2

Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
           ++K+ IA+HN+ +  GLV+++ G+N+Y DML  EF + M    + + + +N    G  + 
Sbjct: 55  DNKYRIAQHNKLFHKGLVTFEQGINEYSDMLQSEFNEKM---GQKSSNQRNTEANG--LP 109

Query: 437 GAKFISPANVKLPEQVDWRKHGAVPTFKDQGKC--GSCWS 550
             +F    NV  P+ VDWR  G V     Q  C  G  WS
Sbjct: 110 SIRFTPLHNVNPPDSVDWRTKGLVGPVGKQVNCSSGYAWS 149



 Score = 37.9 bits (84), Expect = 0.24
 Identities = 14/32 (43%), Positives = 21/32 (65%)
 Frame = +3

Query: 162 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIYL 257
           +L  EEW  FK Q+   Y +++ED  RMKI++
Sbjct: 23  NLFHEEWQLFKTQYSKKYTTDIEDRLRMKIFI 54


>UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184,
           whole genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_184,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 331

 Score = 59.7 bits (138), Expect = 7e-08
 Identities = 32/95 (33%), Positives = 48/95 (50%)
 Frame = +2

Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
           ++ +HN K+E+G  ++ LGMN+Y D+   EF  +        +  KN+    G       
Sbjct: 63  VVMEHNSKFELGQETFTLGMNQYADLTPEEFQASFLTLKTKVQDRKNVKSYSG------- 115

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
                +  P+ VDW K G   T K+QG CGSCW+F
Sbjct: 116 -----LSFPDTVDW-KDGL--TVKNQGSCGSCWAF 142


>UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6;
           Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia
           deliciosa (Kiwi)
          Length = 509

 Score = 59.3 bits (137), Expect = 9e-08
 Identities = 32/99 (32%), Positives = 52/99 (52%), Gaps = 2/99 (2%)
 Frame = +2

Query: 263 KHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF--VKTMNGFNKTAKHNKNLYMKGGSVR 436
           ++++ K+ ++   G   + +G+NK+ DM + EF  V        T+K       + G   
Sbjct: 80  RYVMEKNGERGASG--GHLVGLNKFADMSNEEFREVYVSKVKKPTSKRMAIERRRQGKAA 137

Query: 437 GAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            AK ++  +   P  +DWRK+G V   KDQG CGSCW+F
Sbjct: 138 AAKAVAACDG--PTSLDWRKYGIVTGVKDQGDCGSCWAF 174


>UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_21,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 349

 Score = 59.3 bits (137), Expect = 9e-08
 Identities = 32/95 (33%), Positives = 49/95 (51%), Gaps = 1/95 (1%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKN-LYMKGGSVRGAKF 448
           I +H Q+ E GL +++LG+N + D+   EF      +  T +   N +Y + G       
Sbjct: 70  IQEHQQRVEAGLETFELGLNDFADLSVEEFEAKYLKYRSTPREQTNQVYRRTGK------ 123

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
                 ++P +VD RK G V   K+QG CGSCW+F
Sbjct: 124 ------QVPIEVDLRKDGVVSEVKNQGSCGSCWAF 152


>UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep:
           MGC107932 protein - Xenopus tropicalis (Western clawed
           frog) (Silurana tropicalis)
          Length = 333

 Score = 58.8 bits (136), Expect = 1e-07
 Identities = 33/95 (34%), Positives = 53/95 (55%), Gaps = 1/95 (1%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           + KHNQ  + GL SY++ MN++ D+  +E     +  +      K+L      V+ A+  
Sbjct: 58  VQKHNQLADQGLKSYRMAMNQFADLTDNE----RSSKSCLLPREKSL----NPVK-AESY 108

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGK-CGSCWSF 553
           S  ++ +P++VDWRK   V   K+QG  CGSCW+F
Sbjct: 109 SYTSITIPKEVDWRKSNCVTPVKNQGTFCGSCWAF 143


>UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core
           eudicotyledons|Rep: Cysteine proteinase -
           Mesembryanthemum crystallinum (Common ice plant)
          Length = 367

 Score = 58.8 bits (136), Expect = 1e-07
 Identities = 35/103 (33%), Positives = 56/103 (54%), Gaps = 4/103 (3%)
 Frame = +2

Query: 257 EHKHIIAKHNQKY--EMGLVS--YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKG 424
           +++  + K N KY  E+  +   YKL +N++GD+   EF +T    +K  +  +N    G
Sbjct: 61  QNRFHVFKENVKYINEVNKMDKPYKLRLNQFGDLTPSEFARTYAN-SKIIEGTRN--ESG 117

Query: 425 GSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           G +         NV++P  +DWR  GAV   K+QG+CG CW+F
Sbjct: 118 GFMY-------ENVEVPRSIDWRVKGAVTPVKNQGRCGGCWAF 153


>UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza
           sativa|Rep: Os09g0497500 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 349

 Score = 58.4 bits (135), Expect = 2e-07
 Identities = 33/82 (40%), Positives = 43/82 (52%), Gaps = 2/82 (2%)
 Frame = +2

Query: 314 YKLGMNKYGDMLHHEFVKTMNGFNK--TAKHNKNLYMKGGSVRGAKFISPANVKLPEQVD 487
           YKL  NK+ D+ + EF   M GF    T     N      ++ G      ++  LP+ VD
Sbjct: 72  YKLADNKFADLTNEEFRAKMLGFRPHVTIPQISNTCSADIAMPGES----SDDILPKSVD 127

Query: 488 WRKHGAVPTFKDQGKCGSCWSF 553
           WRK GAV   K+QG CGSCW+F
Sbjct: 128 WRKKGAVVEVKNQGDCGSCWAF 149


>UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep:
           Actinidin Act3a - Actinidia eriantha
          Length = 380

 Score = 58.4 bits (135), Expect = 2e-07
 Identities = 39/112 (34%), Positives = 53/112 (47%), Gaps = 2/112 (1%)
 Frame = +2

Query: 224 GRRQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHN 403
           G R+   E   E+   I +HN        SY +G+N++ D+   E+  T  GF  + K  
Sbjct: 57  GEREMRIEIFKENLRFIDEHNADPNR---SYTVGLNQFADLTDEEYRSTYLGFKSSLKSK 113

Query: 404 -KNLYM-KGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
             N YM + G V            LP+ VDWR  GAV   K+QG C SCW+F
Sbjct: 114 VSNRYMPQVGEV------------LPDYVDWRTTGAVVDVKNQGLCSSCWAF 153


>UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 280

 Score = 58.0 bits (134), Expect = 2e-07
 Identities = 33/96 (34%), Positives = 51/96 (53%), Gaps = 4/96 (4%)
 Frame = +2

Query: 278 KHNQKYEMGLVSYKLGMNKYGDMLHHEFVK-TMNG--FNKTAKHNKNLYMKGGSVRGAKF 448
           +HNQ+      SY++GMN++ D+   EF   ++N   FN  ++  +N+  +         
Sbjct: 3   QHNQEKNN---SYQIGMNQFSDLTIEEFQSISLNQQLFNSESRKLENIKNENQQADFYLQ 59

Query: 449 ISPANVK-LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           +   N   LP+Q DWR  G V   K+QG CGSCW+F
Sbjct: 60  LLKTNASSLPQQFDWRNLGKVTQVKNQGNCGSCWAF 95


>UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2;
           Taeniidae|Rep: Cathepsin L-like cysteine proteinase -
           Taenia solium (Pork tapeworm)
          Length = 339

 Score = 58.0 bits (134), Expect = 2e-07
 Identities = 32/94 (34%), Positives = 48/94 (51%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I   N+++  GL SY  G+N++ D+   EF +   G    ++      + G   R  K +
Sbjct: 65  IKGQNRRFNAGLESYSTGLNQFADLESSEFSERFLGTRPESR------VAGRRGRIWKAL 118

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           + A   LP+ VDWR    V   K+QG CGSCW+F
Sbjct: 119 ASA-AGLPDTVDWRDKNLVTEVKNQGNCGSCWAF 151


>UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes
           vastus|Rep: Cathepsin L - Aphrocallistes vastus
          Length = 329

 Score = 58.0 bits (134), Expect = 2e-07
 Identities = 30/81 (37%), Positives = 45/81 (55%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
           SYKL  N++ D+ + E+ +   G++  A+ ++    + G V   K     +  LP  VDW
Sbjct: 68  SYKLAANQFADLTNLEYRQIYLGYDNEARLSRK---REGKVFQRKM---KDEDLPTTVDW 121

Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
           R  G V   K+QG+CGSCWSF
Sbjct: 122 RSKGVVTPVKNQGQCGSCWSF 142



 Score = 33.5 bits (73), Expect = 5.2
 Identities = 20/42 (47%), Positives = 29/42 (69%)
 Frame = +1

Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLI 690
           G+LEGQ+  +SG LVS    +  +DCS ++ GN+G Q GGL+
Sbjct: 146 GSLEGQYAIKSGKLVS-FSEQELVDCSTSL-GNHGCQ-GGLM 184


>UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46;
           Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea
           mays (Maize)
          Length = 371

 Score = 58.0 bits (134), Expect = 2e-07
 Identities = 32/77 (41%), Positives = 42/77 (54%)
 Frame = +2

Query: 323 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 502
           G+ K+ D+   EF +T  G  K+ +    L   G S   A  + P +  LP+  DWR HG
Sbjct: 92  GVTKFSDLTPAEFRRTYLGLRKSRR--ALLRELGESAHEAPVL-PTD-GLPDDFDWRDHG 147

Query: 503 AVPTFKDQGKCGSCWSF 553
           AV   K+QG CGSCWSF
Sbjct: 148 AVGPVKNQGSCGSCWSF 164


>UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 336

 Score = 57.6 bits (133), Expect = 3e-07
 Identities = 33/108 (30%), Positives = 50/108 (46%)
 Frame = +2

Query: 230 RQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKN 409
           +QF  +   E    I  HN   E    +YKL  N++ DM   EF   +     +    +N
Sbjct: 49  QQFRQQIFFETHERIQNHNSNPE---ATYKLAHNQFSDMPQEEFASRVL-MKSSQLIPRN 104

Query: 410 LYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
                 +    +  +  +V+LP   DWR +G +   KDQG+CGSCW+F
Sbjct: 105 AVQAQNNNSTTQQHTAQDVQLPASFDWRDYGILSDVKDQGQCGSCWAF 152


>UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n=1;
           Xenopus tropicalis|Rep: UPI00006A2275 UniRef100 entry -
           Xenopus tropicalis
          Length = 272

 Score = 57.6 bits (133), Expect = 3e-07
 Identities = 41/131 (31%), Positives = 57/131 (43%), Gaps = 4/131 (3%)
 Frame = +2

Query: 206 SQLRKRGRRQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFN 385
           SQ  +R RR    E +      I+ HN +Y +GL +Y++GMN  GDM   E   TM G+ 
Sbjct: 3   SQEEERARRTIWEETLK----FISVHNLEYSLGLHTYEVGMNHLGDMTGEEVAATMTGYT 58

Query: 386 KTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGK-CGSCWSFQHD 562
            +     N+      +  A          P  +DWR    V   +DQG  C SC++F   
Sbjct: 59  GSGDSLANMSHVPKEILEA--------LAPPSIDWRTQNCVTPVRDQGSFCRSCYAFSAV 110

Query: 563 WEL---WKDST 586
             L   WK  T
Sbjct: 111 GALECQWKKKT 121


>UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum
           aestivum|Rep: Thiol protease - Triticum aestivum (Wheat)
          Length = 374

 Score = 57.6 bits (133), Expect = 3e-07
 Identities = 27/85 (31%), Positives = 41/85 (48%), Gaps = 1/85 (1%)
 Frame = +2

Query: 302 GLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQ 481
           G +SY LG+N++ D+ H EF+ T             +  + G V       PA   +P  
Sbjct: 88  GRLSYTLGVNQFADLTHEEFLATHTSRRVVPSEEMVITTRAGVVVEGANCQPAPNAVPRS 147

Query: 482 VDWRKHGAVPTFKDQGK-CGSCWSF 553
           ++W     V   K+QGK CG+CW+F
Sbjct: 148 INWVNQSKVTPVKNQGKVCGACWAF 172


>UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2;
           Dictyostelium discoideum|Rep: Cysteine proteinase 2
           precursor - Dictyostelium discoideum (Slime mold)
          Length = 376

 Score = 57.6 bits (133), Expect = 3e-07
 Identities = 31/78 (39%), Positives = 42/78 (53%)
 Frame = +2

Query: 320 LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 499
           LG+N + D+ + E+ KT  G    A H+ N Y  G  V   + +       P+ +DWR  
Sbjct: 79  LGLNNFADITNEEYRKTYLGTRVNA-HSYNGY-DGREVLNVEDLQTN----PKSIDWRTK 132

Query: 500 GAVPTFKDQGKCGSCWSF 553
            AV   KDQG+CGSCWSF
Sbjct: 133 NAVTPIKDQGQCGSCWSF 150


>UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 344

 Score = 57.2 bits (132), Expect = 4e-07
 Identities = 31/84 (36%), Positives = 41/84 (48%), Gaps = 3/84 (3%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEF-VKTMNGFNKTAKHNKNLYMKGGSVRGAK-FISPA-NVKLPEQ 481
           SY LG N   DM H EF +  +N     +K +K     G S   +  ++ P    K    
Sbjct: 79  SYTLGHNHLSDMTHEEFSLYQLNPARTASKSSKGGNNSGNSSGSSNPYVDPPITTKNAPP 138

Query: 482 VDWRKHGAVPTFKDQGKCGSCWSF 553
           +DWR   A+   K QGKCGSCW+F
Sbjct: 139 MDWRNASAITPVKQQGKCGSCWTF 162


>UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza
           sativa|Rep: Cysteine proteinase-like - Oryza sativa
           subsp. japonica (Rice)
          Length = 360

 Score = 57.2 bits (132), Expect = 4e-07
 Identities = 26/81 (32%), Positives = 42/81 (51%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
           +Y LG+N++ D+   EF +T  G++       + +       G    +  +  +P+ VDW
Sbjct: 85  TYTLGLNQFSDLTDDEFAQTHLGYSWAPPPPSHRHGHRAE-NGTAAAAADDTDVPDSVDW 143

Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
           R  GAV   K+Q  CGSCW+F
Sbjct: 144 RARGAVTEVKNQRSCGSCWAF 164


>UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza
           sativa|Rep: OSJNBb0085F13.15 protein - Oryza sativa
           subsp. japonica (Rice)
          Length = 383

 Score = 57.2 bits (132), Expect = 4e-07
 Identities = 32/95 (33%), Positives = 45/95 (47%), Gaps = 11/95 (11%)
 Frame = +2

Query: 302 GLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLY-----------MKGGSVRGAKF 448
           G +++KLG   + D+ H EF+ T  G  +     + +               G V GA  
Sbjct: 94  GSLTFKLGETPFTDLTHEEFLATYTGDVRLPPERRGMQDDSDEEDAVITTSAGYVAGAG- 152

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
                V +PE VDWRK GAV   K QG+C +CW+F
Sbjct: 153 AGRRTVAVPESVDWRKEGAVTPAKHQGQCAACWAF 187


>UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1;
           Dictyostelium discoideum AX4|Rep: Counting factor
           associated protein - Dictyostelium discoideum AX4
          Length = 531

 Score = 57.2 bits (132), Expect = 4e-07
 Identities = 39/99 (39%), Positives = 47/99 (47%), Gaps = 2/99 (2%)
 Frame = +2

Query: 263 KHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGA 442
           + IIA HN K      SYKLGMN Y D+ + EF   +    K A+          SV GA
Sbjct: 253 RKIIATHNAKES----SYKLGMNHYADLSNKEFNTLVKP--KVARP---------SVTGA 297

Query: 443 KFISPANV--KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
             +        +P  VDWR    V   KDQG CGSCW+F
Sbjct: 298 DSVHDDESLRSIPSTVDWRNQNCVTPVKDQGICGSCWTF 336


>UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium
           (Vinckeia)|Rep: Berghepain-2 - Plasmodium yoelii yoelii
          Length = 472

 Score = 56.8 bits (131), Expect = 5e-07
 Identities = 33/97 (34%), Positives = 47/97 (48%), Gaps = 3/97 (3%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKG---GSVRGA 442
           I KHN++  +    Y  G+N + DM H EF   M   N   K N  + ++     ++   
Sbjct: 187 IEKHNKENHL----YTKGINAFSDMRHEEF--KMKYLNNKLKENHQIDLRHLIPYTIAIN 240

Query: 443 KFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           K+ SP +       DWR H A+   KDQ KC SCW+F
Sbjct: 241 KYKSPTDQINYTSFDWRDHNAIIDIKDQQKCASCWAF 277


>UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae
           str. PEST
          Length = 559

 Score = 56.8 bits (131), Expect = 5e-07
 Identities = 32/88 (36%), Positives = 46/88 (52%)
 Frame = +2

Query: 290 KYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK 469
           K+E G   Y  G+ K+ DM   E+ +   G     KH++  ++ G  V   + ++     
Sbjct: 285 KFERGTAKY--GVTKFADMTVAEY-RAHTGL-VVPKHDRANHV-GNRVASEEDVAGVG-D 338

Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           LP   DWR HGAV   K+QG CGSCW+F
Sbjct: 339 LPRSFDWRDHGAVTEVKNQGSCGSCWAF 366


>UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep:
           Vivapain-4 - Plasmodium vivax
          Length = 484

 Score = 56.8 bits (131), Expect = 5e-07
 Identities = 36/97 (37%), Positives = 49/97 (50%), Gaps = 3/97 (3%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNG--FNKTAKHNKNLYMKGGSVRGAK 445
           I  HN K     + YK G N+Y D+   EF KTM    F+   K   + Y+        K
Sbjct: 197 INSHNSKAN---ILYKKGTNQYSDISFEEFRKTMLTLRFDLKKKLANSPYVSNYDDVLKK 253

Query: 446 FISPANVKLP-EQVDWRKHGAVPTFKDQGKCGSCWSF 553
           +  PA+  +  E+ DWR+H AV   K+Q  CGSCW+F
Sbjct: 254 Y-KPADAVVDNEKYDWREHNAVSEIKNQNLCGSCWAF 289


>UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteinase
           A; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like
           tick cysteine proteinase A - Haemaphysalis longicornis
           (Bush tick)
          Length = 312

 Score = 56.8 bits (131), Expect = 5e-07
 Identities = 40/126 (31%), Positives = 59/126 (46%), Gaps = 4/126 (3%)
 Frame = +2

Query: 188 LQAAAPSQLRKRGRRQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLG-MNKYGDMLHHEFV 364
           LQ AA S ++   RR    +   E+  ++AKHN KY  GL   ++G     GD     +V
Sbjct: 4   LQIAAQSGVQFPRRRTIEVKIFTENTLLVAKHNAKYAKGLGVLQVGPWTSLGDFAA-AWV 62

Query: 365 KTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK---LPEQVDWRKHGAVPTFKDQGKC 535
           +    ++  A   +N         G      AN+    LP  VDW + G+    K+QG+C
Sbjct: 63  RQNGQWDTAASRTRN--------SGPHLFHQANLNDSSLPTTVDWAQEGSRAPVKNQGQC 114

Query: 536 GSCWSF 553
           GSCW+F
Sbjct: 115 GSCWAF 120


>UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase
           precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine
           proteinase precursor - Heterodera glycines (Soybean cyst
           nematode worm)
          Length = 353

 Score = 56.8 bits (131), Expect = 5e-07
 Identities = 36/98 (36%), Positives = 49/98 (50%), Gaps = 1/98 (1%)
 Frame = +2

Query: 263 KHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGA 442
           K  I  HN  +E G VS+K+  N    ++H     T   +N+     + L M+    R  
Sbjct: 76  KKFIDAHNLAFEKGEVSFKVAPNH---LMHF----TPAQYNRI----RGLQMRSNRQRHN 124

Query: 443 KFISPANVK-LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
                 N   LPE++DWR+ GAV   KDQG CGSCW+F
Sbjct: 125 MATLAGNSSTLPEKLDWREKGAVTEVKDQGDCGSCWAF 162


>UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 392

 Score = 56.8 bits (131), Expect = 5e-07
 Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 7/111 (6%)
 Frame = +2

Query: 242 HEDIPEH---KHIIAKHNQKYEMGL----VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKH 400
           +ED  EH   KHI  +HN +Y   +    + YKL  N + D+   EF       +  +K 
Sbjct: 99  YEDDSEHRRRKHIF-RHNVRYIRSMNRRSLPYKLEPNHFADLTDDEFKSYKGALDDESKD 157

Query: 401 NKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
             N +     +   +  S    ++P+Q+DWR +GAV   K QG CGSCW+F
Sbjct: 158 VMNDH--DDVIDDDR--SKRMFEVPDQLDWRNYGAVNPAKGQGTCGSCWAF 204


>UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F
            like protease; n=1; Nasonia vitripennis|Rep: PREDICTED:
            similar to cathepsin F like protease - Nasonia
            vitripennis
          Length = 1036

 Score = 56.4 bits (130), Expect = 6e-07
 Identities = 30/89 (33%), Positives = 43/89 (48%)
 Frame = +2

Query: 287  QKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANV 466
            Q+ EMG   Y  G+ ++ D+   EF     G   T K   ++ M   ++         ++
Sbjct: 766  QRNEMGTGRY--GVTQFTDLTKAEFKARHLGLKPTLKSENDIPMPMATI--------PDI 815

Query: 467  KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            +LP   DWR H  V   KDQG CGSCW+F
Sbjct: 816  ELPSDYDWRHHNVVTPVKDQGSCGSCWAF 844


>UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2;
           Culicidae|Rep: Procathepsin L3, putative - Aedes aegypti
           (Yellowfever mosquito)
          Length = 313

 Score = 56.4 bits (130), Expect = 6e-07
 Identities = 27/100 (27%), Positives = 48/100 (48%), Gaps = 6/100 (6%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK------NLYMKGGSV 433
           I +HN  YE G  ++++G+N+  DM    ++K M        H K      +  ++  + 
Sbjct: 62  IEEHNANYEQGKSTFQMGVNELADMDKSSYLKKMVRMTDAIDHRKLDVDFNDEMLQATNA 121

Query: 434 RGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            G +F+      +P+ +DWR  G      +Q  CGSC++F
Sbjct: 122 FGEEFVQATQNSMPDSLDWRDKGFTTMAVNQKTCGSCYAF 161


>UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K
           precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2);
           n=2; Tribolium castaneum|Rep: PREDICTED: similar to
           Cathepsin K precursor (Cathepsin O) (Cathepsin X)
           (Cathepsin O2) - Tribolium castaneum
          Length = 332

 Score = 56.0 bits (129), Expect = 9e-07
 Identities = 26/95 (27%), Positives = 48/95 (50%)
 Frame = +2

Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
           I+ +HN+++  G  +Y++G+NK+ D    E +  + G     +  + L     +      
Sbjct: 57  IVEEHNERFRNGSETYEMGVNKFSDFTDEE-LSNLTGLQVPLEFEQPL-----NETEDPL 110

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           +      +   +DWR+ G V   K+QG+CGSCW+F
Sbjct: 111 LPSLGRGISASLDWRQRGGVTPVKNQGQCGSCWAF 145


>UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease
           containing protein; n=2; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 332

 Score = 56.0 bits (129), Expect = 9e-07
 Identities = 34/91 (37%), Positives = 48/91 (52%), Gaps = 1/91 (1%)
 Frame = +2

Query: 284 NQKYEMGLVSYKLGMNKYGDMLHHEF-VKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPA 460
           N   + G +S   G+NK+  +   EF  K +N   + A       MK  S+  ++     
Sbjct: 74  NMNSDNGFIS---GINKFSHLTKEEFKAKYLNRPQRPASE-----MKTNSILSSQ--QKT 123

Query: 461 NVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           + KLPE VDWRK GAV   +DQG CGSC++F
Sbjct: 124 DEKLPESVDWRKLGAVSPVRDQGNCGSCYAF 154


>UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine
           protease; n=1; Strongylocentrotus purpuratus|Rep:
           PREDICTED: similar to cysteine protease -
           Strongylocentrotus purpuratus
          Length = 494

 Score = 55.6 bits (128), Expect = 1e-06
 Identities = 33/88 (37%), Positives = 41/88 (46%)
 Frame = +2

Query: 290 KYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK 469
           ++E G   Y  G  K+ DM   EF K  +G  K     K   +  G V            
Sbjct: 195 QFEQGTAKY--GPTKFADMTEAEFRKLQSGPLKKTGIKKQAAIPQGPV------------ 240

Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            PE+ DWR HGAV   K+QG CGSCW+F
Sbjct: 241 -PEEYDWRTHGAVTPVKNQGMCGSCWAF 267


>UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 21 SCAF14577, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 478

 Score = 55.6 bits (128), Expect = 1e-06
 Identities = 40/118 (33%), Positives = 55/118 (46%), Gaps = 5/118 (4%)
 Frame = +2

Query: 215 RKRGRRQFPHEDIPEHKHIIAKHNQKY-----EMGLVSYKLGMNKYGDMLHHEFVKTMNG 379
           +++ +RQ+  +   E +     HN +Y       GL SY LG+N   D    E   TM G
Sbjct: 125 KEKFQRQYEDDKEHELRQQAFIHNLRYVHSKNRAGL-SYTLGLNSLSDRTMSELA-TMRG 182

Query: 380 FNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
             +    N  L           F    +V++PE +DWR +GAV   KDQ  CGSCWSF
Sbjct: 183 RKQRKTTNAGLPFP--------FKLYQHVEVPESLDWRLYGAVTPVKDQAICGSCWSF 232


>UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 2 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 564

 Score = 55.6 bits (128), Expect = 1e-06
 Identities = 26/49 (53%), Positives = 29/49 (59%), Gaps = 1/49 (2%)
 Frame = +2

Query: 410 LYMKGGSVRGAKFISPA-NVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           L  K GS R   F       KLP+Q+DWR +GAV   KDQ  CGSCWSF
Sbjct: 324 LQSKDGSSRAEPFPRHRFTAKLPDQIDWRPYGAVTPVKDQAVCGSCWSF 372



 Score = 33.1 bits (72), Expect = 6.9
 Identities = 18/40 (45%), Positives = 24/40 (60%)
 Frame = +1

Query: 562 LGALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPG 681
           +G LEG +FR++G LV  L  +  +DCS    GNNG   G
Sbjct: 375 VGELEGAYFRKTGRLVR-LSEQQLVDCSWN-NGNNGCDGG 412


>UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L
           - Suberites domuncula (Sponge)
          Length = 324

 Score = 55.6 bits (128), Expect = 1e-06
 Identities = 33/98 (33%), Positives = 47/98 (47%)
 Frame = +2

Query: 260 HKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRG 439
           +K  I  HN   +     Y L MN++GD+   EF +  NG+    + N            
Sbjct: 50  NKKFIDSHNSVSDK--FGYTLEMNEFGDLSGVEFKQIYNGYIMQERANDTKLFTA----- 102

Query: 440 AKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           + ++ PA       VDWR+ G V   K+QG+CGSCWSF
Sbjct: 103 SPYMEPA-----ASVDWRQKGVVSEVKNQGQCGSCWSF 135


>UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10;
           Liliopsida|Rep: Putative cysteine proteinase - Oryza
           sativa subsp. japonica (Rice)
          Length = 416

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 35/94 (37%), Positives = 50/94 (53%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I + NQK + G+ SY LG+NK+ D+ + EF     G     K + + +    +    + +
Sbjct: 56  IHEFNQKSK-GM-SYVLGLNKFSDLTYEEFAAKYTG----VKVDASAFATATTSSPDEEL 109

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            P  V  P   DWR +GAV   KDQG+CGSCW F
Sbjct: 110 -PVGVP-PATWDWRLNGAVTDVKDQGQCGSCWVF 141


>UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila
           melanogaster|Rep: CG5367-PA - Drosophila melanogaster
           (Fruit fly)
          Length = 338

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 32/100 (32%), Positives = 53/100 (53%), Gaps = 1/100 (1%)
 Frame = +2

Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
           E+  +I +HNQ Y+ G  S++L  N + DM    ++K   GF +  K N    ++  +  
Sbjct: 62  ENFKVIEEHNQNYKEGQTSFRLKPNIFADMSTDGYLK---GFLRLLKSN----IEDSADN 114

Query: 437 GAKFI-SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            A+ + SP    +PE +DWR  G +    +Q  CGSC++F
Sbjct: 115 MAEIVGSPLMANVPESLDWRSKGFITPPYNQLSCGSCYAF 154


>UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13;
           Plasmodium|Rep: Cysteine protease falcipain-3 -
           Plasmodium falciparum
          Length = 492

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 41/106 (38%), Positives = 49/106 (46%), Gaps = 7/106 (6%)
 Frame = +2

Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF------VKTMNGFNKTAKHNKNLYM 418
           E+   I  HN+K       YK GMNK+GD+   EF      +KT   F KT     +   
Sbjct: 197 ENYRKIELHNKKTNS---LYKRGMNKFGDLSPEEFRSKYLNLKTHGPF-KTLSPPVSYEA 252

Query: 419 KGGSVRGAKFISPANVKLPE-QVDWRKHGAVPTFKDQGKCGSCWSF 553
               V   K   PA+ KL     DWR HG V   KDQ  CGSCW+F
Sbjct: 253 NYEDV--IKKYKPADAKLDRIAYDWRLHGGVTPVKDQALCGSCWAF 296


>UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh
           fly) (Boettcherisca peregrina). Cathepsin L; n=2;
           Dictyostelium discoideum|Rep: Similar to Sarcophaga
           peregrina (Flesh fly) (Boettcherisca peregrina).
           Cathepsin L - Dictyostelium discoideum (Slime mold)
          Length = 265

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 26/78 (33%), Positives = 38/78 (48%)
 Frame = +2

Query: 320 LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 499
           + +N+Y D+   EF      F K     ++  +    ++   F    N  +P+  DWR H
Sbjct: 1   MDLNEYSDLTQKEFADKF--FEKLVPEPRSGPIN--DIKATPFKHNVNATIPKSFDWRDH 56

Query: 500 GAVPTFKDQGKCGSCWSF 553
           GAV   K+QG C SCWSF
Sbjct: 57  GAVGKVKNQGSCASCWSF 74


>UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus
           salmonis|Rep: Cysteine proteinase - Lepeophtheirus
           salmonis (salmon louse)
          Length = 372

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 27/82 (32%), Positives = 41/82 (50%), Gaps = 1/82 (1%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK-LPEQVD 487
           ++ +G+N++ D+   EF     G++  +          G V         N+K LPE VD
Sbjct: 68  TWDMGINEFSDLTDEEFESKYMGYSPMSS-------SAGLVTRTAAPKQGNIKDLPESVD 120

Query: 488 WRKHGAVPTFKDQGKCGSCWSF 553
           WR+ G +   K+QG CGSCW F
Sbjct: 121 WREKGVITDVKNQGSCGSCWVF 142


>UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2;
           Endopterygota|Rep: Cathepsin L-like proteinase - Bombyx
           mori (Silk moth)
          Length = 402

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 30/96 (31%), Positives = 50/96 (52%), Gaps = 2/96 (2%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKN--LYMKGGSVRGAK 445
           +A+HN++Y  G+ SY L +N +GDM   E+      F K  K  K   L+          
Sbjct: 131 VARHNREYLAGIQSYSLHLNHFGDMHVTEY------FGKVLKLIKAFPLFDPAEDHHKTA 184

Query: 446 FISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           +      K+P+++DWR  G  P  ++Q +CG+C++F
Sbjct: 185 YRHNRRCKVPKRIDWRDQGFKPRREEQWQCGACYAF 220


>UniRef50_Q5NE16 Cluster: Putative cathepsin L-like protein 3; n=3;
           Homo sapiens|Rep: Putative cathepsin L-like protein 3 -
           Homo sapiens (Human)
          Length = 218

 Score = 55.2 bits (127), Expect = 1e-06
 Identities = 39/120 (32%), Positives = 55/120 (45%)
 Frame = +2

Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
           +I +HNQ+Y  G  S+ + MN +G+M   EF + +NGF +  KH K          G   
Sbjct: 3   MIEQHNQEYREGKHSFTMAMNAFGEMTSEEFRQVVNGF-QNQKHRK----------GKVL 51

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSFQHDWELWKDSTSVSPATWCRFFGAK 628
             P    + + VDWR+ G V   KDQ   G   S + D    +   S+S  TW    G K
Sbjct: 52  QEPLLHDIRKSVDWREKGYVTPVKDQCNWG---SVRTDVRKTEKLVSLSVQTWWTALGFK 108


>UniRef50_UPI000155637A Cluster: PREDICTED: similar to
           ENSANGP00000013730, partial; n=1; Ornithorhynchus
           anatinus|Rep: PREDICTED: similar to ENSANGP00000013730,
           partial - Ornithorhynchus anatinus
          Length = 229

 Score = 54.8 bits (126), Expect = 2e-06
 Identities = 22/32 (68%), Positives = 24/32 (75%)
 Frame = +2

Query: 458 ANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           ANV LPE +DWR +GAV   KDQ  CGSCWSF
Sbjct: 51  ANVALPESLDWRLYGAVTPVKDQAVCGSCWSF 82


>UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus
           tropicalis|Rep: LOC594890 protein - Xenopus tropicalis
           (Western clawed frog) (Silurana tropicalis)
          Length = 355

 Score = 54.8 bits (126), Expect = 2e-06
 Identities = 33/95 (34%), Positives = 50/95 (52%), Gaps = 1/95 (1%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFV-KTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
           I  HN +Y MGL +Y++GMN  GDM+  E   K MN   +   +  ++ ++         
Sbjct: 83  IMLHNLEYSMGLHTYEVGMNHLGDMVAEEMTDKQMNFIPQVIANITDVPVE--------- 133

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           IS ++   PE +DWR    V + KDQG C + W+F
Sbjct: 134 ISKSSP--PESIDWRNKNCVTSVKDQGSCIASWAF 166


>UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:
           Silicatein beta - Suberites domuncula (Sponge)
          Length = 383

 Score = 54.8 bits (126), Expect = 2e-06
 Identities = 36/109 (33%), Positives = 53/109 (48%), Gaps = 11/109 (10%)
 Frame = +2

Query: 260 HKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK---------TMNGFNKTAKHNKNL 412
           +K  I +HNQ  +   + Y L MNK+GD+   EF++           N  +   KH  + 
Sbjct: 83  NKEYIDQHNQNAQR--LGYTLKMNKFGDLTTKEFIEGYHCVQDYQPTNASHLNKKHKTHA 140

Query: 413 YMKGGS-VRGAKFISPANV-KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           ++  G  VRG        V  +PE +DWR  G V   KDQ +CGS ++F
Sbjct: 141 FVDYGDFVRGGTGEGVRGVGNMPETMDWRTSGVVTKVKDQLRCGSSYAF 189


>UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted,
           possible transmembrane domain near N-terminus; n=4;
           Cryptosporidium|Rep: Cryptopain-cysteine proteinase
           secreted, possible transmembrane domain near N-terminus
           - Cryptosporidium parvum Iowa II
          Length = 401

 Score = 54.8 bits (126), Expect = 2e-06
 Identities = 26/81 (32%), Positives = 44/81 (54%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
           SY L MN++GD+   EF+    G+ K +K ++ ++ K   V  ++  S      P  ++W
Sbjct: 126 SYVLEMNEFGDLSKEEFMARFTGYIKDSKDDERVF-KSSRVSASE--SEEEFVPPNSINW 182

Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
            + G V   ++Q  CGSCW+F
Sbjct: 183 VEAGCVNPIRNQKNCGSCWAF 203


>UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1];
           n=11; Eutheria|Rep: Testin-2 precursor [Contains:
           Testin-1] - Mus musculus (Mouse)
          Length = 333

 Score = 54.8 bits (126), Expect = 2e-06
 Identities = 32/95 (33%), Positives = 49/95 (51%)
 Frame = +2

Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
           +I  HN +Y  G   + + MN +GD+ + EFVK M GF +  +  K +++         F
Sbjct: 58  MIELHNWEYLEGKHDFTMTMNAFGDLTNTEFVKMMTGFRR--QKIKRMHV---------F 106

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
                + +P+ VDWR  G V   K+QG C S W+F
Sbjct: 107 QDHQFLYVPKYVDWRMLGYVTPVKNQGYCASSWAF 141


>UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila
           melanogaster|Rep: LD36817p - Drosophila melanogaster
           (Fruit fly)
          Length = 352

 Score = 54.4 bits (125), Expect = 3e-06
 Identities = 32/97 (32%), Positives = 51/97 (52%), Gaps = 2/97 (2%)
 Frame = +2

Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
           +I   N+  + G+  ++LG+N   DM   E + T+ G +K ++  +      G +     
Sbjct: 67  LITLSNKNADNGVSGFRLGVNTLADMTRKE-IATLLG-SKISEFGERY--TNGHINFVTA 122

Query: 449 ISPANVKLPEQVDWRKHGAV--PTFKDQGKCGSCWSF 553
            +PA+  LPE  DWR+ G V  P F+  G CG+CWSF
Sbjct: 123 RNPASANLPEMFDWREKGGVTPPGFQGVG-CGACWSF 158


>UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16;
           Bromeliaceae|Rep: Fruit bromelain precursor - Ananas
           comosus (Pineapple)
          Length = 351

 Score = 54.4 bits (125), Expect = 3e-06
 Identities = 30/81 (37%), Positives = 41/81 (50%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
           SY LG+N++ DM   EFV    G +      +   +    V     IS     +P+ +DW
Sbjct: 78  SYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVSFDDVN----ISA----VPQSIDW 129

Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
           R +GAV   K+Q  CGSCWSF
Sbjct: 130 RDYGAVNEVKNQNPCGSCWSF 150


>UniRef50_Q24FA8 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 335

 Score = 54.0 bits (124), Expect = 3e-06
 Identities = 28/103 (27%), Positives = 55/103 (53%), Gaps = 4/103 (3%)
 Frame = +2

Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTM----NGFNKTAKHNKNLYMKG 424
           E+   + +HN+       +Y +G+N++ D+   E+ + +    +  N+ AK NKN  ++ 
Sbjct: 58  ENYQSVQEHNKNSNH---TYSVGINQFSDITLQEYQQRILMKNSPLNELAK-NKNRLLQS 113

Query: 425 GSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
             ++ +      + ++   +DWRK G V   K+QG+CG CW+F
Sbjct: 114 SPIQNSN-----DTQIASSIDWRKKGGVSPVKNQGECGGCWTF 151


>UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 344

 Score = 54.0 bits (124), Expect = 3e-06
 Identities = 36/122 (29%), Positives = 55/122 (45%), Gaps = 6/122 (4%)
 Frame = +2

Query: 206 SQLRKRGRRQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFN 385
           S+  K    +  H     +     +H  K++M   + K G  K+ DM   EF   M  F+
Sbjct: 38  SKFNKYYHNEHEHHSSFHNYKTSREHIVKHQMENPNAKFGHTKFSDMSPEEFENKMLNFD 97

Query: 386 ----KTAKHNKNLYMKGGSVRG--AKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCW 547
               K AK ++ + +K   ++G   +  +  N  LPE  DWR  G +   K Q  CGSCW
Sbjct: 98  FSLFKKAK-SQGIKLKAEPMKGYLRQGENVDNSDLPESFDWRDKGIITPAKFQNTCGSCW 156

Query: 548 SF 553
           +F
Sbjct: 157 TF 158


>UniRef50_O16454 Cluster: Temporarily assigned gene name protein
           196; n=4; Bilateria|Rep: Temporarily assigned gene name
           protein 196 - Caenorhabditis elegans
          Length = 477

 Score = 54.0 bits (124), Expect = 3e-06
 Identities = 32/94 (34%), Positives = 45/94 (47%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           + +  QK E G   Y  G  K+ DM   EF K M  +    +  + +Y    +      +
Sbjct: 204 VIRELQKNEQGTAVY--GFTKFSDMTTMEFKKIMLPY----QWEQPVYPMEQANFEKHDV 257

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           +     LPE  DWR+ GAV   K+QG CGSCW+F
Sbjct: 258 TINEEDLPESFDWREKGAVTQVKNQGNCGSCWAF 291


>UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3;
           Bilateria|Rep: Cathepsin L-like cysteine protease -
           Neobenedenia melleni
          Length = 335

 Score = 54.0 bits (124), Expect = 3e-06
 Identities = 39/141 (27%), Positives = 55/141 (39%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           + KHN+ Y  G  SY L MN   D+   EF           K +     + G   G    
Sbjct: 58  VRKHNELYAQGKKSYTLAMNHMADLSSEEF----KALYLVPKFDATKVPRKGKAAGEH-- 111

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSFQHDWELWKDSTSVSPATWCRFFGAKP 631
                  P ++DW + G V   K+Q +CGSCW+F     +     +V  AT      ++ 
Sbjct: 112 RQIKNDPPSEIDWVRKGHVTAVKNQAQCGSCWAFSSTGSI---EGAVKRATGKLISFSEQ 168

Query: 632 SSTAREQLRGTTGCNRGGSLD 694
                    G  GCN GG +D
Sbjct: 169 QLVDCSTAFGNHGCN-GGIMD 188


>UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;
           n=17; Magnoliophyta|Rep: Thiol protease aleurain-like
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 358

 Score = 54.0 bits (124), Expect = 3e-06
 Identities = 37/99 (37%), Positives = 52/99 (52%)
 Frame = +2

Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
           E+  +I   N+K   GL SYKL +N++ D+   EF +   G    A  N +  +KG    
Sbjct: 85  ENLDLIRSTNKK---GL-SYKLSLNQFADLTWQEFQRYKLG----AAQNCSATLKGSHK- 135

Query: 437 GAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
               I+ A V  P+  DWR+ G V   K+QG CGSCW+F
Sbjct: 136 ----ITEATV--PDTKDWREDGIVSPVKEQGHCGSCWTF 168


>UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis
           thaliana|Rep: F9P14.12 protein - Arabidopsis thaliana
           (Mouse-ear cress)
          Length = 343

 Score = 53.6 bits (123), Expect = 5e-06
 Identities = 30/80 (37%), Positives = 42/80 (52%)
 Frame = +2

Query: 314 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 493
           +KL  N++ DM + EF     G N ++     L+ K   V       PA   +P+ VDWR
Sbjct: 84  FKLTDNRFADMTNSEFKAHFLGLNTSSLR---LHKKQRPV-----CDPAG-NVPDAVDWR 134

Query: 494 KHGAVPTFKDQGKCGSCWSF 553
             GAV   ++QGKCG CW+F
Sbjct: 135 TQGAVTPIRNQGKCGGCWAF 154


>UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheirus
           salmonis|Rep: Putative cathepsin L - Lepeophtheirus
           salmonis (salmon louse)
          Length = 257

 Score = 53.6 bits (123), Expect = 5e-06
 Identities = 28/76 (36%), Positives = 38/76 (50%)
 Frame = +2

Query: 326 MNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGA 505
           MN+YGD+L  EF++   G  K +    N  +   S             +P  V+W K+GA
Sbjct: 1   MNQYGDLLQSEFLQGYTGLAKGSYSGDNTVILDNSA-----------PVPSYVNWTKNGA 49

Query: 506 VPTFKDQGKCGSCWSF 553
           V   KDQ  CGSCW+F
Sbjct: 50  VTAVKDQKDCGSCWAF 65


>UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960
           precursor; n=2; Arabidopsis thaliana|Rep: Probable
           cysteine proteinase At3g43960 precursor - Arabidopsis
           thaliana (Mouse-ear cress)
          Length = 376

 Score = 53.6 bits (123), Expect = 5e-06
 Identities = 32/82 (39%), Positives = 44/82 (53%), Gaps = 1/82 (1%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
           SY+ G+NK+ D+   EF  +  G     K  K    K  S    ++       LP++VDW
Sbjct: 82  SYERGLNKFSDLTADEFQASYLG----GKMEK----KSLSDVAERYQYKEGDVLPDEVDW 133

Query: 491 RKHGAV-PTFKDQGKCGSCWSF 553
           R+ GAV P  K QG+CGSCW+F
Sbjct: 134 RERGAVVPRVKRQGECGSCWAF 155


>UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15;
           Magnoliophyta|Rep: Cysteine proteinase RD19a precursor -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 368

 Score = 53.2 bits (122), Expect = 6e-06
 Identities = 30/77 (38%), Positives = 37/77 (48%)
 Frame = +2

Query: 323 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 502
           G+ ++ D+   EF K   G     K  K+          A  +   N  LPE  DWR HG
Sbjct: 95  GVTQFSDLTRSEFRKKHLGVRSGFKLPKD-------ANKAPILPTEN--LPEDFDWRDHG 145

Query: 503 AVPTFKDQGKCGSCWSF 553
           AV   K+QG CGSCWSF
Sbjct: 146 AVTPVKNQGSCGSCWSF 162


>UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core
           eudicotyledons|Rep: Chymopapain precursor - Carica
           papaya (Papaya)
          Length = 352

 Score = 53.2 bits (122), Expect = 6e-06
 Identities = 30/81 (37%), Positives = 41/81 (50%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
           SY LG+N + D+ + EF K   GF   A+    L          K ++      P+ +DW
Sbjct: 88  SYWLGLNGFADLSNDEFKKKYVGF--VAEDFTGLEHFDNEDFTYKHVT----NYPQSIDW 141

Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
           R  GAV   K+QG CGSCW+F
Sbjct: 142 RAKGAVTPVKNQGACGSCWAF 162


>UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3;
           Dictyostelium discoideum|Rep: Cysteine proteinase 1
           precursor - Dictyostelium discoideum (Slime mold)
          Length = 343

 Score = 53.2 bits (122), Expect = 6e-06
 Identities = 38/122 (31%), Positives = 59/122 (48%), Gaps = 7/122 (5%)
 Frame = +2

Query: 209 QLRKRGRRQFPHEDIPEHKHIIAKHNQKYE-MGLVSY------KLGMNKYGDMLHHEFVK 367
           + + +  +++ HE+  E   I   +  K E + L++       K G+NK+ D+   EF K
Sbjct: 31  EFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEF-K 89

Query: 368 TMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCW 547
                NK A    +L +        +FI+     +P   DWR  GAV   K+QG+CGSCW
Sbjct: 90  NYYLNNKEAIFTDDLPV--ADYLDDEFIN----SIPTAFDWRTRGAVTPVKNQGQCGSCW 143

Query: 548 SF 553
           SF
Sbjct: 144 SF 145


>UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus
           tauri|Rep: Cysteine protease-1 - Ostreococcus tauri
          Length = 430

 Score = 52.8 bits (121), Expect = 8e-06
 Identities = 32/104 (30%), Positives = 53/104 (50%), Gaps = 5/104 (4%)
 Frame = +2

Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGS-- 430
           E+   + +HN  Y +G VS+ +G+N        E+ + + G+    + + +  M   +  
Sbjct: 126 ENAAYVVEHNALYAIGEVSHWVGLNSLAATTREEY-RALLGYKPELRSSGDAEMLEATST 184

Query: 431 --VRGAKFI-SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
             V   K     A+V  PE +DW + GAV   K+QG+CGSCW+F
Sbjct: 185 DKVEQYKASWEYASVDPPEAIDWVELGAVTPPKNQGQCGSCWAF 228


>UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_23,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 321

 Score = 52.8 bits (121), Expect = 8e-06
 Identities = 32/81 (39%), Positives = 42/81 (51%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
           SYK  +NK+GD+   EF+         A+  KN+          K   P  V+  E+VDW
Sbjct: 78  SYKQKINKFGDLTDQEFLTIYLNLQMPARV-KNIQ---------KNEEPFLVQ--EEVDW 125

Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
            + G VP  KDQG CGSCW+F
Sbjct: 126 VQKGKVPAIKDQGDCGSCWAF 146


>UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis
           zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa
           zeasingle nucleocapsid nuclear polyhedrosis virus)
          Length = 367

 Score = 52.8 bits (121), Expect = 8e-06
 Identities = 28/81 (34%), Positives = 41/81 (50%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
           S + G+NK+ D    E + +  GF      +  L  +   V+GA      +++LP+  DW
Sbjct: 109 SAQFGVNKFSDKTPDEVLHSNTGFFLNLSQHYTL-CENRIVKGAP-----DIRLPDYYDW 162

Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
           R    V   KDQG CGSCW+F
Sbjct: 163 RDTNKVTPIKDQGVCGSCWAF 183


>UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin S
           preproprotein; n=2; Tribolium castaneum|Rep: PREDICTED:
           similar to cathepsin S preproprotein - Tribolium
           castaneum
          Length = 525

 Score = 52.4 bits (120), Expect = 1e-05
 Identities = 36/117 (30%), Positives = 51/117 (43%)
 Frame = +2

Query: 203 PSQLRKRGRRQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGF 382
           P+   +  RR    +   E KH    HN++Y  GL +Y L +N   D    E    M+  
Sbjct: 237 PNLEEENFRRAIFEKTFQEIKH----HNERYRKGLETYYLRINDLSDYTDEE----MSCC 288

Query: 383 NKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           ++ A       +   S    +        LP+ VDWR  G V   K QGKCG+CW+F
Sbjct: 289 SEKAPKPSITILPNVSTSSRQ-------NLPKMVDWRLRGVVTPVKHQGKCGTCWAF 338



 Score = 48.0 bits (109), Expect = 2e-04
 Identities = 18/28 (64%), Positives = 20/28 (71%)
 Frame = +2

Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           LP+ VDWR  G V   K QGKCGSCW+F
Sbjct: 35  LPDMVDWRLQGVVTPVKRQGKCGSCWAF 62


>UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-PA
           - Drosophila melanogaster (Fruit fly)
          Length = 549

 Score = 52.4 bits (120), Expect = 1e-05
 Identities = 35/109 (32%), Positives = 50/109 (45%), Gaps = 5/109 (4%)
 Frame = +2

Query: 242 HEDIP-EHKHIIAKHNQKY----EMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 406
           H D   EH+  I + N +Y        ++Y L +N   D    E +K   G+  +  +N 
Sbjct: 257 HSDTEHEHRKNIFRQNLRYIHSKNRAKLTYTLAVNHLADKTEEE-LKARRGYKSSGIYNT 315

Query: 407 NLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
               K       K+      ++P+Q DWR +GAV   KDQ  CGSCWSF
Sbjct: 316 G---KPFPYDVPKYKD----EIPDQYDWRLYGAVTPVKDQSVCGSCWSF 357


>UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 4 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 345

 Score = 52.0 bits (119), Expect = 1e-05
 Identities = 25/90 (27%), Positives = 44/90 (48%)
 Frame = +2

Query: 284 NQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPAN 463
           ++K++ G + Y + +N + DM   E V    G+   +            +      +P  
Sbjct: 73  DEKFKNGTLLYSVAVNHFADMTPDEVVANYTGYKPPSAQQ---------LAEIPLYAPLF 123

Query: 464 VKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
              PE ++WR++G V   K+QG+CGSCW+F
Sbjct: 124 GDTPEFIEWRENGFVTPVKNQGQCGSCWAF 153



 Score = 38.7 bits (86), Expect = 0.14
 Identities = 22/48 (45%), Positives = 30/48 (62%)
 Frame = +1

Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLIGXTAFQ 708
           GALEGQ F+++  L+SL   +  +DC+G   GNNG   G + G  AFQ
Sbjct: 157 GALEGQVFKRTRRLISL-SEQNLMDCAGQRYGNNGCNGGQMPG--AFQ 201


>UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35;
           Viridiplantae|Rep: Cysteine proteinase 15A precursor -
           Pisum sativum (Garden pea)
          Length = 363

 Score = 52.0 bits (119), Expect = 1e-05
 Identities = 28/77 (36%), Positives = 38/77 (49%)
 Frame = +2

Query: 323 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 502
           G+ K+ D+   EF +   G  K  +   +        + A  +   N  LPE  DWR+ G
Sbjct: 92  GITKFSDLTASEFRRQFLGLKKRLRLPAH-------AQKAPILPTTN--LPEDFDWREKG 142

Query: 503 AVPTFKDQGKCGSCWSF 553
           AV   KDQG CGSCW+F
Sbjct: 143 AVTPVKDQGSCGSCWAF 159


>UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza
           sativa|Rep: Cysteine protease 1 precursor - Oryza sativa
           subsp. japonica (Rice)
          Length = 490

 Score = 52.0 bits (119), Expect = 1e-05
 Identities = 29/81 (35%), Positives = 41/81 (50%), Gaps = 1/81 (1%)
 Frame = +2

Query: 314 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 493
           ++LGMN++ D+ + EF  T  G     +         G   G  +       LP+ VDWR
Sbjct: 112 FRLGMNRFADLTNGEFRATYLGTTPAGR---------GRRVGEAYRHDGVEALPDSVDWR 162

Query: 494 KHGAVPT-FKDQGKCGSCWSF 553
             GAV    K+QG+CGSCW+F
Sbjct: 163 DKGAVVAPVKNQGQCGSCWAF 183


>UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Rep:
           Cathepsin R precursor - Mus musculus (Mouse)
          Length = 334

 Score = 52.0 bits (119), Expect = 1e-05
 Identities = 33/99 (33%), Positives = 46/99 (46%)
 Frame = +2

Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
           E   +I  HN++  +G   + + MN++GD    EF K M   +          MK    R
Sbjct: 54  EKLKMIKLHNRENSLGKNGFTMKMNEFGDQTDEEFRKMMIEISVWTHREGKSIMK----R 109

Query: 437 GAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            A  I      LP+ VDWRK G V   + QG C +CW+F
Sbjct: 110 EAGSI------LPKFVDWRKKGYVTPVRRQGDCDACWAF 142


>UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l;
           n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to cathepsin l - Strongylocentrotus purpuratus
          Length = 489

 Score = 51.6 bits (118), Expect = 2e-05
 Identities = 29/82 (35%), Positives = 39/82 (47%)
 Frame = +2

Query: 308 VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVD 487
           + Y L +N   D  H E +K M G  +  + N  L   G  V        ++  +P+ +D
Sbjct: 222 LGYVLDINHMADQSHQE-LKRMRGRLRQTRPNNGLPYDGSDV--------SDDAVPDHID 272

Query: 488 WRKHGAVPTFKDQGKCGSCWSF 553
           W   GAV   KDQ  CGSCWSF
Sbjct: 273 WNVLGAVSPVKDQAVCGSCWSF 294


>UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 350

 Score = 51.6 bits (118), Expect = 2e-05
 Identities = 31/96 (32%), Positives = 50/96 (52%), Gaps = 2/96 (2%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFV-KTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
           I KHN        +YKL  N++ DM   EF  + +N   KT+  + +   +   +RG+  
Sbjct: 78  IQKHNSDSNN---TYKLQHNQFSDMTKDEFAHRVLNSQLKTSASSSSQPAQTPQLRGSV- 133

Query: 449 ISPANVKLPEQVDWRKH-GAVPTFKDQGKCGSCWSF 553
              A++   +  DWR + G +   K+QG+CGSCW+F
Sbjct: 134 --DASLNASQGFDWRNYQGVLGNVKNQGQCGSCWTF 167


>UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC04937 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 235

 Score = 51.6 bits (118), Expect = 2e-05
 Identities = 29/99 (29%), Positives = 49/99 (49%), Gaps = 5/99 (5%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSV---RGA 442
           I  HN  Y++ LV+Y LG+N++ D+   E + T      +   NKN  +   ++   +  
Sbjct: 90  IGLHNLHYDLNLVTYTLGINQFSDLTWIE-LSTFYLHELSVNLNKNKLLNSLNMFKLQSY 148

Query: 443 KFISP--ANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            F +   + + +P+  DWR    V   K+Q KCG  W+F
Sbjct: 149 NFTTTLLSTLNIPDNFDWRTKNVVTNVKNQEKCGCGWAF 187


>UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryza
           sativa|Rep: Putative cysteine proteinase - Oryza sativa
           subsp. japonica (Rice)
          Length = 352

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 25/80 (31%), Positives = 39/80 (48%)
 Frame = +2

Query: 314 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 493
           Y+L  N++ D+   EF     G+N        +Y    +      +S  + + P +VDWR
Sbjct: 84  YRLATNRFTDLTDAEFAAMYTGYNPA----NTMY---AAANATTRLSSEDDQQPAEVDWR 136

Query: 494 KHGAVPTFKDQGKCGSCWSF 553
           + GAV   K+Q  CG CW+F
Sbjct: 137 QQGAVTGVKNQRSCGCCWAF 156


>UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24;
           Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa
           (Rice)
          Length = 339

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 31/92 (33%), Positives = 44/92 (47%), Gaps = 3/92 (3%)
 Frame = +2

Query: 287 QKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANV 466
           + +  G   + L +N++ D+ ++EF        +  K NK       +VR        NV
Sbjct: 69  ESFNAGNHKFWLSVNQFADLTNYEF--------RATKTNKGFIPS--TVRVPTTFRYENV 118

Query: 467 K---LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
               LP  VDWR  GAV   KDQG+CG CW+F
Sbjct: 119 SIDTLPATVDWRTKGAVTPIKDQGQCGCCWAF 150


>UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia
           circumcincta|Rep: Secreted cathepsin F - Teladorsagia
           circumcincta
          Length = 364

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 30/94 (31%), Positives = 45/94 (47%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           I +  Q+ + G   Y  G+N++ D+   EF KT          + N  +       A+ +
Sbjct: 94  IIRSAQENDKGTAIY--GINQFADLSPEEFKKTHLPHTWKQPDHPNRIVD----LAAEGV 147

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            P    LPE  DWR+HGAV   K +G C +CW+F
Sbjct: 148 DPKE-PLPESFDWREHGAVTKVKTEGHCAACWAF 180


>UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteinase
           B; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like
           tick cysteine proteinase B - Haemaphysalis longicornis
           (Bush tick)
          Length = 332

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 36/104 (34%), Positives = 54/104 (51%), Gaps = 6/104 (5%)
 Frame = +2

Query: 257 EHKHIIAKHNQKY-EMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLY--MKGG 427
           E++  IA+HN KY   GLV  +           HE V  +    +  +H + L   + G 
Sbjct: 52  ENRLKIARHNAKYANNGLVQAR-----------HERVWRLVA-PRVCEHPQRLQAQLPGP 99

Query: 428 SVRGAKFISPANVK---LPEQVDWRKHGAVPTFKDQGKCGSCWS 550
              G+ +I P  ++   LP+ +DWRK GAV   K+QG+CGSCW+
Sbjct: 100 PTWGSTYIEPEGLEDEHLPKTMDWRKKGAVTPVKNQGQCGSCWA 143


>UniRef50_P25779 Cluster: Cruzipain precursor; n=54;
           Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi
          Length = 467

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 26/77 (33%), Positives = 35/77 (45%)
 Frame = +2

Query: 323 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 502
           G+  + D+   EF        ++  HN   +      R    +    V  P  VDWR  G
Sbjct: 82  GVTPFSDLTREEF--------RSRYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARG 133

Query: 503 AVPTFKDQGKCGSCWSF 553
           AV   KDQG+CGSCW+F
Sbjct: 134 AVTAVKDQGQCGSCWAF 150


>UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61;
           Leishmania|Rep: Cysteine proteinase 2 precursor -
           Leishmania pifanoi
          Length = 444

 Score = 51.2 bits (117), Expect = 2e-05
 Identities = 30/83 (36%), Positives = 42/83 (50%), Gaps = 4/83 (4%)
 Frame = +2

Query: 317 KLGMNKYGDMLHHEFV-KTMNG---FNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQV 484
           + G+ K+ D+   EF  + +NG   F    +H    Y K  +   A         +P+ V
Sbjct: 80  QFGITKFFDLSEAEFAARYLNGAAYFAAAKRHAAQHYRKARADLSA---------VPDAV 130

Query: 485 DWRKHGAVPTFKDQGKCGSCWSF 553
           DWR+ GAV   KDQG CGSCW+F
Sbjct: 131 DWREKGAVTPVKDQGACGSCWAF 153


>UniRef50_Q5YER1 Cluster: Cysteine proteinase; n=1; Bigelowiella
           natans|Rep: Cysteine proteinase - Bigelowiella natans
           (Pedinomonas minutissima) (Chlorarachnion sp.(strain
           CCMP 621))
          Length = 140

 Score = 50.8 bits (116), Expect = 3e-05
 Identities = 27/88 (30%), Positives = 44/88 (50%)
 Frame = +2

Query: 290 KYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK 469
           ++ +G  SY + +N++ D+ + EF    +G    A+             G +  +  + K
Sbjct: 61  RHNVGGYSYTVELNEFADLTNAEFRSLYHGLKPNAQ-------------GPRRTANLSTK 107

Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
             + VDW   GAV   K+QG+CGSCWSF
Sbjct: 108 SADSVDWVSKGAVTPVKNQGQCGSCWSF 135


>UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep:
           Cathepsin - Geodia cydonium (Sponge)
          Length = 322

 Score = 50.8 bits (116), Expect = 3e-05
 Identities = 30/92 (32%), Positives = 45/92 (48%)
 Frame = +2

Query: 278 KHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISP 457
           K  ++++     Y + MN++ D+   EFV   NG  +   H  +    G      + +S 
Sbjct: 48  KFVEEFDSEREGYTVAMNEFADLDPREFVSHYNGLRRRP-HTSS----GEPCTLGEDVSA 102

Query: 458 ANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
               LP  VDWR  G V   K+QG+CGSCW+F
Sbjct: 103 ----LPTTVDWRTKGYVTGVKNQGQCGSCWAF 130



 Score = 38.7 bits (86), Expect = 0.14
 Identities = 22/41 (53%), Positives = 26/41 (63%)
 Frame = +1

Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGL 687
           G+LEGQHF  +G LVS L  +  +DCS A  GN G   GGL
Sbjct: 134 GSLEGQHFNATGKLVS-LSEQNLVDCSSA-EGNEGCN-GGL 171


>UniRef50_Q23H10 Cluster: Papain family cysteine protease containing
           protein; n=14; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 336

 Score = 50.8 bits (116), Expect = 3e-05
 Identities = 26/85 (30%), Positives = 44/85 (51%), Gaps = 4/85 (4%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKH-NKNLYMKG---GSVRGAKFISPANVKLPE 478
           +Y + +N++ DM   EF + +   +    H  K +  +     +      +S  ++ L +
Sbjct: 70  TYSVHLNQFSDMTKEEFAEKILMKSDLVDHLMKGISQEATHNDTNNNETQLSSNSLTLAD 129

Query: 479 QVDWRKHGAVPTFKDQGKCGSCWSF 553
            +DWR  GAV + K+QG CGSCWSF
Sbjct: 130 SIDWRTKGAVTSVKNQGGCGSCWSF 154


>UniRef50_Q22LI1 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 987

 Score = 50.8 bits (116), Expect = 3e-05
 Identities = 27/81 (33%), Positives = 39/81 (48%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
           +++LG+N+Y  M   EF +     + +    K    K          +   V +   +DW
Sbjct: 71  TFQLGLNEYAHMTSQEFAEVFLTPSISKSQQKQPKPKPQPQPHPNNSTNTTVTITP-IDW 129

Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
           R  GAV + K QGKCGSCWSF
Sbjct: 130 RNKGAVTSVKRQGKCGSCWSF 150


>UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep:
           Dvir_CG5367 - Drosophila virilis (Fruit fly)
          Length = 298

 Score = 50.8 bits (116), Expect = 3e-05
 Identities = 32/105 (30%), Positives = 51/105 (48%), Gaps = 1/105 (0%)
 Frame = +2

Query: 242 HEDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMK 421
           +E   E++ I+ +HN  YE G  S++L  N   DM    ++K   G+ +  +  +     
Sbjct: 17  YEAYEENQIIVNEHNTYYETGKSSFRLATNTMADMNTDSYLK---GYLRLLRSPEI---- 69

Query: 422 GGSVRGAKFI-SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
             S   A  + SP    +PE  DWRK G +    +Q  CGSC++F
Sbjct: 70  SDSDNIADIVGSPLMNNVPESFDWRKKGFITPLYNQQSCGSCYAF 114


>UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep:
           Cysteine proteinase - Cryptobia salmositica
          Length = 443

 Score = 50.4 bits (115), Expect = 4e-05
 Identities = 29/79 (36%), Positives = 39/79 (49%), Gaps = 2/79 (2%)
 Frame = +2

Query: 323 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK--LPEQVDWRK 496
           G N++ DM   EF    N     A+H      K    +  K  +   +K  + +Q+DWR 
Sbjct: 69  GPNEFADMTSEEFQTRHNA----ARHYAAA--KARPPKNTKTFTAEEIKAAVGQQIDWRL 122

Query: 497 HGAVPTFKDQGKCGSCWSF 553
            GAV   K+QG CGSCWSF
Sbjct: 123 KGAVTPVKNQGACGSCWSF 141


>UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicatein
           a3 - Lubomirskia baicalensis
          Length = 344

 Score = 50.4 bits (115), Expect = 4e-05
 Identities = 37/121 (30%), Positives = 57/121 (47%)
 Frame = +2

Query: 191 QAAAPSQLRKRGRRQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKT 370
           Q +  SQL++  R    H     +K  I  HN   +  L  Y L MN +GD++  EF + 
Sbjct: 52  QRSYESQLQEMER----HSIWVANKKYIEHHNANAD--LFGYTLAMNGFGDLMSAEFTER 105

Query: 371 MNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWS 550
                 T KH++   ++        F SP  V   + +DWR  G V + + QG+CGS ++
Sbjct: 106 Y----LTHKHSQRSGLQ-------TFESPKGVTYADSLDWRTRGVVTSVQSQGQCGSSYA 154

Query: 551 F 553
           F
Sbjct: 155 F 155


>UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep:
           Cysteine protease - Babesia equi
          Length = 438

 Score = 50.4 bits (115), Expect = 4e-05
 Identities = 33/98 (33%), Positives = 45/98 (45%), Gaps = 10/98 (10%)
 Frame = +2

Query: 290 KYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGS----------VRG 439
           K + G  SY+ G+NK+ DM   EF       +   +  K+L +              VR 
Sbjct: 155 KAQTGEESYEKGINKFSDMTDEEFNLRFPALS-VEELKKSLEVSASEEFTSPEHLDKVRI 213

Query: 440 AKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           AK +   +    E +DWRK   V   KDQG CGSCW+F
Sbjct: 214 AKGLGVEDSVDGEDLDWRKLNGVTPVKDQGNCGSCWAF 251


>UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326;
           n=2; Danio rerio|Rep: hypothetical protein LOC550326 -
           Danio rerio
          Length = 531

 Score = 50.0 bits (114), Expect = 6e-05
 Identities = 33/118 (27%), Positives = 51/118 (43%), Gaps = 5/118 (4%)
 Frame = +2

Query: 215 RKRGRRQFPHEDIPEHKHIIAKHNQKY-----EMGLVSYKLGMNKYGDMLHHEFVKTMNG 379
           +++  RQ+  E   E +  +  H  ++       GL +Y +G+N + D    E  +   G
Sbjct: 233 KEKFNRQYESEKEHEERENLFLHTFRFVHSNNRAGL-TYSVGINHFADKTKEELARMTGG 291

Query: 380 FNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
                K  +        +R        ++  P  VDWR +GAV   KDQ  CGSCWSF
Sbjct: 292 L--LPKKEEKAQPFPSEIR--------SIATPNSVDWRLYGAVTPVKDQAVCGSCWSF 339


>UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1;
           Naegleria fowleri|Rep: Cysteine proteinase homolog -
           Naegleria fowleri
          Length = 347

 Score = 50.0 bits (114), Expect = 6e-05
 Identities = 28/78 (35%), Positives = 38/78 (48%), Gaps = 1/78 (1%)
 Frame = +2

Query: 323 GMNKYGDMLHHEFVKTMNGFNKTAKHNKN-LYMKGGSVRGAKFISPANVKLPEQVDWRKH 499
           G+ K+ D+   EF +       T +  K  L     +V   K +  A    P   DWR+H
Sbjct: 76  GITKFSDLTPEEFKRMFLMKTYTPEEAKKILAAPQHAVLSEKEVQTA----PTSFDWRQH 131

Query: 500 GAVPTFKDQGKCGSCWSF 553
           GAV   K+QG CGSCW+F
Sbjct: 132 GAVTRVKNQGACGSCWTF 149


>UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1;
           Brugia malayi|Rep: Cathepsin F-like cysteine proteinase
           - Brugia malayi (Filarial nematode worm)
          Length = 461

 Score = 49.6 bits (113), Expect = 7e-05
 Identities = 33/93 (35%), Positives = 39/93 (41%)
 Frame = +2

Query: 275 AKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFIS 454
           AK  Q  E G   Y  G  K+ DM   EF K M       +   N     G        +
Sbjct: 190 AKKLQFEEKGTAIY--GATKFSDMTAEEFQKIMLPSIWWDRVESN-----GITFNLNDFN 242

Query: 455 PANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            +   LP + DWR  G V   KDQG CGSCW+F
Sbjct: 243 LSIYNLPSKFDWRTEGVVTPVKDQGSCGSCWAF 275


>UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:
           Viral cathepsin - Cydia pomonella granulosis virus
           (CpGV) (Cydia pomonellagranulovirus)
          Length = 333

 Score = 49.6 bits (113), Expect = 7e-05
 Identities = 29/78 (37%), Positives = 41/78 (52%), Gaps = 2/78 (2%)
 Frame = +2

Query: 326 MNKYGDMLHHEFVKTMNGFNKTAKHNKNLY-MKGGSVRGAKFISPANVKLPEQVDWR-KH 499
           +N+Y D+  +  ++   GF    K N + + M   SV   K        LPE +DWR KH
Sbjct: 77  INEYSDLNKNALLRRTTGFRLGLKKNPSAFTMTECSVVVIK--DEPQALLPETLDWRDKH 134

Query: 500 GAVPTFKDQGKCGSCWSF 553
           G  P  K+Q +CGSCW+F
Sbjct: 135 GVTPV-KNQMECGSCWAF 151


>UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis
           thaliana|Rep: Cysteine proteinase - Arabidopsis thaliana
           (Mouse-ear cress)
          Length = 348

 Score = 49.2 bits (112), Expect = 1e-04
 Identities = 35/121 (28%), Positives = 53/121 (43%), Gaps = 6/121 (4%)
 Frame = +2

Query: 209 QLRKRGRRQFPHEDIPEHKHIIAKHN----QKYEMG-LVSYKLGMNKYGDMLHHEFVKTM 373
           Q   R  R +  E    ++  I K N    Q + M   ++YK+ +N++ D+   EF  T 
Sbjct: 37  QWMARFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNNKITYKVDINEFSDLTDEEFRATH 96

Query: 374 NGFNKTAKHNKNLYMKGGSVRGAKFISPANVK-LPEQVDWRKHGAVPTFKDQGKCGSCWS 550
            G        +   +  G  +        NV    E +DWR+ GAV   K QG+CG CW+
Sbjct: 97  TGLVVPEAITRISTLSSG--KNTVPFRYGNVSDNGESMDWRQEGAVTPVKYQGRCGGCWA 154

Query: 551 F 553
           F
Sbjct: 155 F 155


>UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba
           culbertsoni|Rep: Cysteine proteinase - Acanthamoeba
           culbertsoni
          Length = 482

 Score = 49.2 bits (112), Expect = 1e-04
 Identities = 24/89 (26%), Positives = 38/89 (42%)
 Frame = +2

Query: 287 QKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANV 466
           +++  G  ++ + MN++GD+   EF +   G    A   +                    
Sbjct: 95  EEFNRGNHTFTVAMNEHGDLTPEEFARLYMGQVSPASEQELQERIAAESAMEDEHHHTRA 154

Query: 467 KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            +P   DWR  GAV   K+QG C SCW+F
Sbjct: 155 SIPANWDWRTKGAVTPVKNQGSCASCWAF 183


>UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep:
           Cysteine proteinase - Paragonimus westermani
          Length = 272

 Score = 49.2 bits (112), Expect = 1e-04
 Identities = 19/38 (50%), Positives = 26/38 (68%), Gaps = 1/38 (2%)
 Frame = +2

Query: 443 KFISPANVKL-PEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           K + P  +K  PE++DWR  GAV   ++QG CGSCW+F
Sbjct: 44  KRVRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWAF 81


>UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep:
           Cysteine protease - Clonorchis sinensis
          Length = 328

 Score = 49.2 bits (112), Expect = 1e-04
 Identities = 18/26 (69%), Positives = 21/26 (80%)
 Frame = +2

Query: 476 EQVDWRKHGAVPTFKDQGKCGSCWSF 553
           E+ DWR+HGAV    DQGKCGSCW+F
Sbjct: 117 EKFDWREHGAVGPVLDQGKCGSCWAF 142


>UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163
           precursor; n=4; Schizophora|Rep: Putative cysteine
           proteinase CG12163 precursor - Drosophila melanogaster
           (Fruit fly)
          Length = 614

 Score = 49.2 bits (112), Expect = 1e-04
 Identities = 31/86 (36%), Positives = 45/86 (52%)
 Frame = +2

Query: 296 EMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLP 475
           EMG  S K G+ ++ DM   E+ K   G  +  +        GGS   A  +   + +LP
Sbjct: 346 EMG--SAKYGITEFADMTSSEY-KERTGLWQRDEAKAT----GGS---AAVVPAYHGELP 395

Query: 476 EQVDWRKHGAVPTFKDQGKCGSCWSF 553
           ++ DWR+  AV   K+QG CGSCW+F
Sbjct: 396 KEFDWRQKDAVTQVKNQGSCGSCWAF 421


>UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum
           aestivum|Rep: Cysteine protease - Triticum aestivum
           (Wheat)
          Length = 371

 Score = 48.8 bits (111), Expect = 1e-04
 Identities = 32/95 (33%), Positives = 46/95 (48%), Gaps = 13/95 (13%)
 Frame = +2

Query: 308 VSYKLGMNKYGDMLHHEFV-KTMNGFNKTAKHNKNLY--MKGGSVRGAKFISPA-----N 463
           + Y+LG N++ D+ + EF+ + + G    A     L   + G  V GA     A     N
Sbjct: 86  LGYELGENEFTDLTNEEFMARYVGGAYGGAGDGGGLITTLAGDVVEGAASSKNAIEEDRN 145

Query: 464 VKL-----PEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           + +     P Q DWR+HG V   K QG CG CW+F
Sbjct: 146 LTMTASDPPRQFDWREHGVVTPAKQQGACGCCWAF 180


>UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza
           sativa|Rep: Os01g0347600 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 343

 Score = 48.8 bits (111), Expect = 1e-04
 Identities = 35/104 (33%), Positives = 50/104 (48%), Gaps = 5/104 (4%)
 Frame = +2

Query: 257 EHKHIIAKHNQKYEMGL---VSYK--LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMK 421
           EH+  I + N  +  G    V+Y   +G+N++ D+ + EFV T  G      H K     
Sbjct: 62  EHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPP--HPKE---- 115

Query: 422 GGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
                  + + P  +  P  +DWR  GAV   KDQG CGSCW+F
Sbjct: 116 -----APRPVDP--IWTPCCIDWRFRGAVTGVKDQGACGSCWAF 152


>UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1;
           Oryza sativa (japonica cultivar-group)|Rep: Putative
           uncharacterized protein - Oryza sativa subsp. japonica
           (Rice)
          Length = 289

 Score = 48.8 bits (111), Expect = 1e-04
 Identities = 35/104 (33%), Positives = 50/104 (48%), Gaps = 5/104 (4%)
 Frame = +2

Query: 257 EHKHIIAKHNQKYEMGL---VSYK--LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMK 421
           EH+  I + N  +  G    V+Y   +G+N++ D+ + EFV T  G      H K     
Sbjct: 61  EHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPP--HPKE---- 114

Query: 422 GGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
                  + + P  +  P  +DWR  GAV   KDQG CGSCW+F
Sbjct: 115 -----APRPVDP--IWTPCCIDWRFRGAVTGVKDQGACGSCWAF 151


>UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n=3;
           Brugia malayi|Rep: Cathepsin L-like cysteine proteinase
           - Brugia malayi (Filarial nematode worm)
          Length = 353

 Score = 48.8 bits (111), Expect = 1e-04
 Identities = 30/95 (31%), Positives = 46/95 (48%)
 Frame = +2

Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
           +I +HNQ+Y  GL +YK+ +NK  D    E  + + G+      N   Y +G   R  + 
Sbjct: 74  MIDEHNQRYSKGLETYKVDLNKMSDWTEEE-KERLRGYYP----NLTEYAEGDLSRIIR- 127

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
                  +P+  D+RK   V    DQG+CG C+ F
Sbjct: 128 -GNITTTIPKSFDYRKKITVLPASDQGRCGVCFIF 161


>UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 360

 Score = 48.4 bits (110), Expect = 2e-04
 Identities = 28/83 (33%), Positives = 40/83 (48%)
 Frame = +2

Query: 305 LVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQV 484
           LV  K+G+N++ D+ H EF     G     KH+K+        +  +   P +  LP   
Sbjct: 83  LVFSKVGVNQFADLTHEEFKALYTGH----KHSKD--DDDDDNKNKQPHLPTD-NLPASF 135

Query: 485 DWRKHGAVPTFKDQGKCGSCWSF 553
           DWR  GA+   K Q  CG CW+F
Sbjct: 136 DWRDKGAITPVKVQNGCGGCWAF 158


>UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8;
           Magnoliophyta|Rep: Os12g0273800 protein - Oryza sativa
           subsp. japonica (Rice)
          Length = 504

 Score = 48.4 bits (110), Expect = 2e-04
 Identities = 26/74 (35%), Positives = 37/74 (50%)
 Frame = +2

Query: 314 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 493
           Y LG+N++ D+   EF  TM      +  N  + +      G K+ + +   LP  VDWR
Sbjct: 86  YWLGVNQFADLTSEEFKATMTNSKGFSTPNNGVRVS----TGFKYENVSADALPASVDWR 141

Query: 494 KHGAVPTFKDQGKC 535
             GAV   KDQG+C
Sbjct: 142 TKGAVTRIKDQGQC 155


>UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11;
           Entamoeba|Rep: Cysteine proteinase 2 precursor -
           Entamoeba histolytica
          Length = 315

 Score = 48.4 bits (110), Expect = 2e-04
 Identities = 17/31 (54%), Positives = 23/31 (74%)
 Frame = +2

Query: 461 NVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           N++ PE VDWRK G V   +DQ +CGSC++F
Sbjct: 91  NIQAPESVDWRKEGKVTPIRDQAQCGSCYTF 121


>UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryza
           sativa (japonica cultivar-group)|Rep: Putative cysteine
           proteinase - Oryza sativa subsp. japonica (Rice)
          Length = 361

 Score = 48.0 bits (109), Expect = 2e-04
 Identities = 31/79 (39%), Positives = 35/79 (44%), Gaps = 1/79 (1%)
 Frame = +2

Query: 308 VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANV-KLPEQV 484
           +SYKLG+NK+ DM   EF     G    A            V  A    P  V   P   
Sbjct: 78  MSYKLGLNKFSDMTVEEFAAKYTGVQVDAG--------AAVVTSAPDEQPVLVGDAPPVW 129

Query: 485 DWRKHGAVPTFKDQGKCGS 541
           DWR HGAV   KDQG CG+
Sbjct: 130 DWRDHGAVTPVKDQGSCGT 148


>UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep:
           Cysteine protease - Saprolegnia parasitica
          Length = 523

 Score = 48.0 bits (109), Expect = 2e-04
 Identities = 24/81 (29%), Positives = 40/81 (49%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
           S+ +G N+Y  +   EF K   G   +  +   +  +      A  ++  +V  P ++DW
Sbjct: 68  SFTMGHNEYSHLTFDEFKKLRTGLRVSPSY---IQSRAKYALMAPAVNMTDV--PNEMDW 122

Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
            + G V   K+QG CGSCW+F
Sbjct: 123 VEQGGVTPVKNQGMCGSCWAF 143


>UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18;
           Plasmodium|Rep: Cysteine proteinase precursor -
           Plasmodium vivax (strain Salvador I)
          Length = 583

 Score = 48.0 bits (109), Expect = 2e-04
 Identities = 34/103 (33%), Positives = 50/103 (48%), Gaps = 9/103 (8%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSV----RG 439
           I KHN+  +M    YK+ +N++ D    +F            H K  Y+   S     +G
Sbjct: 268 IKKHNETNQM----YKMKVNQFSDYSKKDFESYFRKLVPIPDHLKKKYVVPFSSMNNGKG 323

Query: 440 AKFI---SPANV--KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
              +   S AN+   +PE +D+R+ G V   KDQG CGSCW+F
Sbjct: 324 KNVVTSSSGANLLADVPEILDYREKGIVHEPKDQGLCGSCWAF 366


>UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27;
           Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber
           officinale (Ginger)
          Length = 221

 Score = 48.0 bits (109), Expect = 2e-04
 Identities = 17/28 (60%), Positives = 22/28 (78%)
 Frame = +2

Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           LP+ +DWR+ GAV   K+QG CGSCW+F
Sbjct: 3   LPDSIDWREKGAVVPVKNQGGCGSCWAF 30


>UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19;
           Bilateria|Rep: Cathepsin F precursor - Homo sapiens
           (Human)
          Length = 484

 Score = 48.0 bits (109), Expect = 2e-04
 Identities = 32/94 (34%), Positives = 44/94 (46%), Gaps = 1/94 (1%)
 Frame = +2

Query: 275 AKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMK-GGSVRGAKFI 451
           A+  Q  + G   Y  G+ K+ D+   EF        +T   N  L  + G  ++ AK +
Sbjct: 218 AQKIQALDRGTAQY--GVTKFSDLTEEEF--------RTIYLNTLLRKEPGNKMKQAKSV 267

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
                  P + DWR  GAV   KDQG CGSCW+F
Sbjct: 268 GDL---APPEWDWRSKGAVTKVKDQGMCGSCWAF 298


>UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin O
           precursor; n=1; Tribolium castaneum|Rep: PREDICTED:
           similar to Cathepsin O precursor - Tribolium castaneum
          Length = 326

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 28/90 (31%), Positives = 42/90 (46%)
 Frame = +2

Query: 284 NQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPAN 463
           N K   G   Y  G+ K+ D+L  EF +T    N + K + N   +    R         
Sbjct: 70  NSKKRNGSALY--GLTKFSDLLPEEFFQTYLQSNLSQKTHSNEPKRHHHKRAT------- 120

Query: 464 VKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
             +P +VDWR+  AV    +QG CG+CW++
Sbjct: 121 --VPNKVDWREKNAVTRIYNQGSCGACWAY 148


>UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza
           sativa|Rep: Os09g0381400 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 362

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 2/86 (2%)
 Frame = +2

Query: 302 GLVSYKLGMNKYGDMLHHEFVKTMNGFNK-TAKHNKNLYMKGGSVRGAKFISPANVKLPE 478
           G ++Y+L  N++ D+   EF+ T  G+       + ++   G     A F     V +P 
Sbjct: 89  GDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITTGAGDVDASF--SYRVDVPA 146

Query: 479 QVDWRKHGAVPTFKDQ-GKCGSCWSF 553
            VDWR  GAV   K Q   C SCW+F
Sbjct: 147 SVDWRAQGAVVPPKSQTSTCSSCWAF 172


>UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_26_49243_47612 - Giardia lamblia
           ATCC 50803
          Length = 543

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 17/30 (56%), Positives = 20/30 (66%)
 Frame = +2

Query: 464 VKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           V+ P Q+DWR  G +   KDQ  CGSCWSF
Sbjct: 314 VQFPRQLDWRVRGVITPVKDQAACGSCWSF 343


>UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides
           sonorensis|Rep: Cathepsin L - Culicoides sonorensis
          Length = 331

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 1/95 (1%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
           + +HN +Y  G+ +Y+ G+N++ D+ + EF K   G  +    N+ +    G +      
Sbjct: 58  VMEHNARYLSGMETYEKGVNQFSDLTYEEFAKLYLG--EKISFNELMTNADGWIE----- 110

Query: 452 SPANVKL-PEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            P   +L PE   W     VP  K+Q +CGSCW+F
Sbjct: 111 KPLRRQLAPESYAWDTKD-VPV-KNQAQCGSCWAF 143


>UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 513

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 37/116 (31%), Positives = 52/116 (44%), Gaps = 7/116 (6%)
 Frame = +2

Query: 227 RRQFPHEDIPEHKHIIAKHNQKYEMGL----VSYKLGMNKYGDMLHHEFVKTMNGFNKTA 394
           R+++P     E +  I +HN ++        + Y L  N   DM   E V  M G     
Sbjct: 218 RKRYPSAHEHEKRKDIYRHNMRFIKSRNRQHLGYSLKPNHMADMTDAE-VNRMKGL---- 272

Query: 395 KHNKNLYMKGGSVRGAKFISP---ANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
                L+ +   +  + F  P     V LP  VDWRK GAV + K QG CGSC++F
Sbjct: 273 -----LHEEPPLIGDSPFSIPDKDRGVPLPPHVDWRKAGAVNSVKSQGICGSCYAF 323


>UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue of
           Sarcophaga 26,29kDa proteinase; n=1; Nasonia
           vitripennis|Rep: PREDICTED: similar to homologue of
           Sarcophaga 26,29kDa proteinase - Nasonia vitripennis
          Length = 553

 Score = 47.2 bits (107), Expect = 4e-04
 Identities = 31/119 (26%), Positives = 53/119 (44%), Gaps = 4/119 (3%)
 Frame = +2

Query: 209 QLRKRGRRQFPHEDIPEHKHIIAKHNQKYEMGL----VSYKLGMNKYGDMLHHEFVKTMN 376
           + +K   + + H+   + +    +HN ++   +    + + L +N   D    E +K + 
Sbjct: 250 RFKKTHNKNYAHDLEHKQRKEHFRHNLRFIHSINRANLGFTLDVNHLADRNEAE-LKVLR 308

Query: 377 GFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           G   T +H  N     G +     +      +P+  DWR +GAV   KDQ  CGSCWSF
Sbjct: 309 GKQYT-QHGYN-----GGMPFPHDVEKEKADVPDSFDWRLYGAVTPVKDQSVCGSCWSF 361


>UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,
           partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
           hypothetical protein, partial - Ornithorhynchus anatinus
          Length = 224

 Score = 47.2 bits (107), Expect = 4e-04
 Identities = 19/33 (57%), Positives = 21/33 (63%)
 Frame = +2

Query: 455 PANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           PA     E  DWRK GAV   K+QG CGSCW+F
Sbjct: 126 PAGPLRAETCDWRKEGAVTPVKNQGDCGSCWAF 158


>UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila
           melanogaster|Rep: CG11459-PA - Drosophila melanogaster
           (Fruit fly)
          Length = 336

 Score = 47.2 bits (107), Expect = 4e-04
 Identities = 30/113 (26%), Positives = 56/113 (49%), Gaps = 2/113 (1%)
 Frame = +2

Query: 221 RGRRQFPHEDIPEHKHI-IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAK 397
           R R ++ H  + E + + +  HNQ Y  G V++K+G+NK+ D          +      +
Sbjct: 42  RNRDKY-HRALYEQRVLAVESHNQLYLQGKVAFKMGLNKFSDTDQRILFNYRSSIPAPLE 100

Query: 398 HNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQG-KCGSCWSF 553
            + N   +  +V   ++      ++ E +DWR++G +    DQG +C SCW+F
Sbjct: 101 TSTNALTE--TVNYKRYD-----QITEGIDWRQYGYISPVGDQGTECLSCWAF 146


>UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin W,
           partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
           similar to Cathepsin W, partial - Ornithorhynchus
           anatinus
          Length = 229

 Score = 46.4 bits (105), Expect = 7e-04
 Identities = 16/26 (61%), Positives = 20/26 (76%)
 Frame = +2

Query: 476 EQVDWRKHGAVPTFKDQGKCGSCWSF 553
           E  DWRK GA+ + K+QG CGSCW+F
Sbjct: 70  ETCDWRKRGAITSVKNQGSCGSCWAF 95


>UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium
           falciparum|Rep: Falcipain 2 - Plasmodium falciparum
          Length = 484

 Score = 46.4 bits (105), Expect = 7e-04
 Identities = 29/101 (28%), Positives = 42/101 (41%), Gaps = 2/101 (1%)
 Frame = +2

Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGF--NKTAKHNKNLYMKGGS 430
           ++ H +  HN         YK  +N++ D+ +HEF         +K  K++K L  +   
Sbjct: 191 QNAHKVNMHNNNKNS---LYKKELNRFADLTYHEFKNKYLSLRSSKPLKNSKYLLDQMNY 247

Query: 431 VRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
               K             DWR H  V   KDQ  CGSCW+F
Sbjct: 248 EEVIKKYRGEENFDHAAYDWRLHSGVTPVKDQKNCGSCWAF 288


>UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11;
           Trichomonadidae|Rep: Cysteine protease - Tritrichomonas
           foetus (Trichomonas foetus)
          Length = 315

 Score = 46.4 bits (105), Expect = 7e-04
 Identities = 30/93 (32%), Positives = 45/93 (48%)
 Frame = +2

Query: 275 AKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFIS 454
           A++ Q++  G   + + +NK+  +   E+ K M G+    K  K         RG K   
Sbjct: 49  ARYVQEHNAGDSKFTVSLNKFAALTPSEY-KVMLGYKTGMKAEK-------VSRGMK--- 97

Query: 455 PANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
             NV   + +DWR+ G V   KDQ  CGSCW+F
Sbjct: 98  KPNV---DSIDWREKGVVNEIKDQAACGSCWAF 127


>UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1;
           Trypanosoma cruzi|Rep: Cysteine protease, putative -
           Trypanosoma cruzi
          Length = 434

 Score = 46.4 bits (105), Expect = 7e-04
 Identities = 27/82 (32%), Positives = 39/82 (47%), Gaps = 2/82 (2%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
           SY+LG+NK+ DM   EF    NG  + A        +    +  K         PE ++W
Sbjct: 80  SYRLGINKFSDMTKEEFNAKFNG--RVAAPQSTQSPQRAPYKRTK------ATFPEALNW 131

Query: 491 R--KHGAVPTFKDQGKCGSCWS 550
           +  K+  +   KDQG CGSCW+
Sbjct: 132 QEAKNPVLTPVKDQGSCGSCWA 153


>UniRef50_Q23FQ5 Cluster: Papain family cysteine protease containing
           protein; n=4; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score = 46.4 bits (105), Expect = 7e-04
 Identities = 36/118 (30%), Positives = 56/118 (47%), Gaps = 5/118 (4%)
 Frame = +2

Query: 215 RKRGRRQFPHEDIPEHKHIIAKHN-QK---YEMGL-VSYKLGMNKYGDMLHHEFVKTMNG 379
           R   RR F +ED   ++ ++   N QK   +E     +Y + +N++ D    EFV+ +  
Sbjct: 40  RSSYRRVFLNEDEETYRQLVFFENLQKLKTHEKNTEATYTVSLNQFSDYSQEEFVQRI-- 97

Query: 380 FNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            NK    +     K     G   +  A V  P  VDWR  GA+   ++QG+CGSC +F
Sbjct: 98  LNKHISRSDADIQKEQEPNGN--LRKA-VNYPTSVDWRNSGALNPIQNQGQCGSCAAF 152


>UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 383

 Score = 46.4 bits (105), Expect = 7e-04
 Identities = 26/78 (33%), Positives = 40/78 (51%)
 Frame = +2

Query: 320 LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 499
           L +N++ D    E  K +   NK  K++ +     GS      I PA++      DWR+ 
Sbjct: 125 LDVNEFTDWTDEELQKMVQE-NKYTKYDFDTPKFEGSYLETGVIRPASI------DWREQ 177

Query: 500 GAVPTFKDQGKCGSCWSF 553
           G +   K+QG+CGSCW+F
Sbjct: 178 GKLTPIKNQGQCGSCWAF 195


>UniRef50_Q22A69 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 330

 Score = 46.4 bits (105), Expect = 7e-04
 Identities = 23/77 (29%), Positives = 37/77 (48%)
 Frame = +2

Query: 323 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 502
           G+ ++ D+ H EF     G+    ++++       S+    F +P        +DW   G
Sbjct: 73  GITQFADLTHEEFADMYLGYKPQLRNSQAKV----SLSSTPFTAPT------AIDWTTKG 122

Query: 503 AVPTFKDQGKCGSCWSF 553
           AV   K+QG CGSCW+F
Sbjct: 123 AVTPVKNQGSCGSCWAF 139


>UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10;
           Dictyostelium discoideum|Rep: Cysteine proteinase 7
           precursor - Dictyostelium discoideum (Slime mold)
          Length = 460

 Score = 46.4 bits (105), Expect = 7e-04
 Identities = 17/25 (68%), Positives = 19/25 (76%)
 Frame = +2

Query: 479 QVDWRKHGAVPTFKDQGKCGSCWSF 553
           QVDWR  GAV   K+QG+CG CWSF
Sbjct: 113 QVDWRTQGAVTPIKNQGQCGGCWSF 137


>UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyostelium
           discoideum|Rep: Cysteine proteinase 3 - Dictyostelium
           discoideum (Slime mold)
          Length = 151

 Score = 46.4 bits (105), Expect = 7e-04
 Identities = 28/75 (37%), Positives = 39/75 (52%)
 Frame = +2

Query: 320 LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 499
           LG+N++ D+ + E+   +N     A    N Y K     G +   P + K P  VDWR+ 
Sbjct: 31  LGLNQHADLSNEEY--RLNYLGTRAHIKLNGYHKRNL--GLRLNRP-HFKQPLNVDWREK 85

Query: 500 GAVPTFKDQGKCGSC 544
            AV   KDQG+CGSC
Sbjct: 86  DAVTPVKDQGQCGSC 100


>UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 397

 Score = 46.0 bits (104), Expect = 0.001
 Identities = 16/28 (57%), Positives = 20/28 (71%)
 Frame = +2

Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           +P+ VDWR  G V   KDQG+CG CW+F
Sbjct: 180 VPQSVDWRIQGKVSPVKDQGRCGCCWAF 207


>UniRef50_Q23H15 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 370

 Score = 46.0 bits (104), Expect = 0.001
 Identities = 17/28 (60%), Positives = 20/28 (71%)
 Frame = +2

Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           L   +DWR  GAV + K+QG CGSCWSF
Sbjct: 162 LAASIDWRTKGAVTSVKNQGNCGSCWSF 189


>UniRef50_Q22RR8 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score = 46.0 bits (104), Expect = 0.001
 Identities = 34/117 (29%), Positives = 54/117 (46%), Gaps = 2/117 (1%)
 Frame = +2

Query: 260 HKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRG 439
           H+  + + N K   G  +Y  G+ K+ D+   EF +            +N      + + 
Sbjct: 62  HRFAVFRDNLKKIEGHSNY--GITKFMDLTSEEFQQRYLRLKTNTIKRQNFK---SNPKN 116

Query: 440 AKFISPANVKLPEQV--DWRKHGAVPTFKDQGKCGSCWSFQHDWELWKDSTSVSPAT 604
           A+     N+KL + +  DW K GAV   KDQ +CGSCW+F     L + +T +S  T
Sbjct: 117 AQL----NMKLGDDIIIDWTKKGAVTPVKDQEQCGSCWAFSATGAL-ESATFISTGT 168


>UniRef50_Q22RC9 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 389

 Score = 46.0 bits (104), Expect = 0.001
 Identities = 34/95 (35%), Positives = 44/95 (46%)
 Frame = +2

Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
           II++ NQ  E G   Y  G+ ++ DM   EF K+      T   N      G    G + 
Sbjct: 69  IISELNQ-VEEGTAEY--GITQFSDMTTEEF-KSQILIPSTYARN----FTGSRYHGFQK 120

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           IS      P   DWR HGAV   K+QG  G+CW+F
Sbjct: 121 ISQ---DAPTSYDWRDHGAVTPVKNQGTVGTCWTF 152


>UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan
           CA, family C1, cathepsin L-like cysteine peptidase -
           Trichomonas vaginalis G3
          Length = 306

 Score = 46.0 bits (104), Expect = 0.001
 Identities = 17/33 (51%), Positives = 24/33 (72%), Gaps = 2/33 (6%)
 Frame = +2

Query: 461 NVK--LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           N+K  +P ++DWR+ G V   K+QG CGSCW+F
Sbjct: 83  NIKNDVPTEIDWREQGIVNKIKNQGACGSCWAF 115


>UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera
           litura multicapsid nucleopolyhedrovirus (SpltMNPV)
          Length = 337

 Score = 46.0 bits (104), Expect = 0.001
 Identities = 23/77 (29%), Positives = 36/77 (46%)
 Frame = +2

Query: 323 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 502
           G+NK+ D+    FV    G      ++ +       +     ++  + + PE  DWRK  
Sbjct: 77  GINKFSDIDKITFVNEHAGLVSNLINSTDSNFDPYRLCEYVTVAGPSARTPESFDWRKLN 136

Query: 503 AVPTFKDQGKCGSCWSF 553
            V   K+QG CGSCW+F
Sbjct: 137 KVTKVKEQGVCGSCWAF 153


>UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep:
           Cathepsin L precursor - Schistosoma mansoni (Blood
           fluke)
          Length = 319

 Score = 46.0 bits (104), Expect = 0.001
 Identities = 16/28 (57%), Positives = 21/28 (75%)
 Frame = +2

Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           +P+  DWR+ GAV   K+QG CGSCW+F
Sbjct: 105 IPKNFDWREKGAVTEVKNQGMCGSCWAF 132


>UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 356

 Score = 45.6 bits (103), Expect = 0.001
 Identities = 36/140 (25%), Positives = 62/140 (44%), Gaps = 2/140 (1%)
 Frame = +2

Query: 272 IAKHNQKYEMGLVSYKLGMNK-YGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
           + +HN+K      +Y L ++  +  M   +FV    G ++       L +K    +  K 
Sbjct: 68  VREHNKKVN---ATYTLSIDSPFAFMSDEQFVTEYLG-SQDCSATAELTLK----KPMKI 119

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSFQHDWELWKDSTSVSPATWCRFFGAK 628
            +  NV++PE ++W+    V   KDQ  CGSCW+F         +T    + +  F   +
Sbjct: 120 QNKKNVQVPESINWKDLNKVSPVKDQQNCGSCWTF--------STTGAIESHYAIFEDVE 171

Query: 629 PSSTAREQLRGTTGC-NRGG 685
           P+S + +QL    G  N  G
Sbjct: 172 PTSLSEQQLIDCAGAFNNNG 191


>UniRef50_Q22W19 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score = 45.6 bits (103), Expect = 0.001
 Identities = 23/63 (36%), Positives = 31/63 (49%), Gaps = 1/63 (1%)
 Frame = +2

Query: 368 TMNGFNKTAKHNKNLYMKGGSVRG-AKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSC 544
           T+N F    K   N   KG   R  +  I      +   +DWR+  AV   K+QG+CGSC
Sbjct: 88  TLNAFAIYTKDEFNQLFKGYQKRQKSHLIYSLKGDVAPSIDWRQKNAVTPVKNQGQCGSC 147

Query: 545 WSF 553
           W+F
Sbjct: 148 WAF 150


>UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep:
           Cathepsin L - Kudoa thyrsites
          Length = 300

 Score = 45.6 bits (103), Expect = 0.001
 Identities = 17/28 (60%), Positives = 20/28 (71%)
 Frame = +2

Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           LP  VDW+  G V + K+QG CGSCWSF
Sbjct: 102 LPSSVDWKALGKVTSVKNQGHCGSCWSF 129


>UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole
           genome shotgun sequence; n=7; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_22,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 350

 Score = 45.6 bits (103), Expect = 0.001
 Identities = 16/24 (66%), Positives = 19/24 (79%)
 Frame = +2

Query: 482 VDWRKHGAVPTFKDQGKCGSCWSF 553
           +DWR  GAV   KDQG+CGSCW+F
Sbjct: 146 IDWRTRGAVNKVKDQGQCGSCWAF 169


>UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 20 SCAF14744, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 175

 Score = 45.2 bits (102), Expect = 0.002
 Identities = 25/81 (30%), Positives = 38/81 (46%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
           S K G+N++ D+   EF              K+LY++  + R   F       LP + DW
Sbjct: 20  SAKYGINQFSDLSEREF--------------KDLYLRASADRAPVFTGQKIKGLPARFDW 65

Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
           R +  V   ++Q  CGSCW+F
Sbjct: 66  RDNAVVGPVQNQQACGSCWAF 86


>UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing
           protein; n=5; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 437

 Score = 45.2 bits (102), Expect = 0.002
 Identities = 18/30 (60%), Positives = 22/30 (73%), Gaps = 1/30 (3%)
 Frame = +2

Query: 467 KLPEQVDWRKHGAVPTFKDQGK-CGSCWSF 553
           +LP+ VDWR+ G V   K QGK CGSCW+F
Sbjct: 204 QLPQYVDWREKGVVTQVKSQGKDCGSCWAF 233


>UniRef50_Q239L8 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 323

 Score = 45.2 bits (102), Expect = 0.002
 Identities = 16/25 (64%), Positives = 19/25 (76%)
 Frame = +2

Query: 479 QVDWRKHGAVPTFKDQGKCGSCWSF 553
           ++DW   GAV   KDQG+CGSCWSF
Sbjct: 126 EIDWTTKGAVTPVKDQGQCGSCWSF 150


>UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1;
           Bigelowiella natans|Rep: Digestive cysteine proteinase -
           Bigelowiella natans (Pedinomonas minutissima)
           (Chlorarachnion sp.(strain CCMP 621))
          Length = 360

 Score = 44.8 bits (101), Expect = 0.002
 Identities = 32/100 (32%), Positives = 45/100 (45%), Gaps = 1/100 (1%)
 Frame = +2

Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
           E++ II   N+  E+G   Y  G  ++ DM   +F   M  F       +N         
Sbjct: 50  ENERIIQGLNEN-ELGSAVY--GHTRFSDMSPEQFRAMMTPFKYHTDEAEN--------- 97

Query: 437 GAKFISPAN-VKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
            A +    N VK+ +  DWR   A+   KDQG CGSCW+F
Sbjct: 98  -AAYDQNKNAVKVTDSFDWRDFNALTPVKDQGGCGSCWAF 136


>UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 367

 Score = 44.8 bits (101), Expect = 0.002
 Identities = 15/26 (57%), Positives = 20/26 (76%)
 Frame = +2

Query: 476 EQVDWRKHGAVPTFKDQGKCGSCWSF 553
           + +DWR+ GAV   K+QG CGSCW+F
Sbjct: 157 QSIDWRQSGAVSPVKNQGSCGSCWAF 182


>UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia
           bovis|Rep: Cysteine protease 2 - Babesia bovis
          Length = 445

 Score = 44.8 bits (101), Expect = 0.002
 Identities = 16/26 (61%), Positives = 19/26 (73%)
 Frame = +2

Query: 476 EQVDWRKHGAVPTFKDQGKCGSCWSF 553
           E +DWR+  AV   KDQG CGSCW+F
Sbjct: 238 EDIDWRRADAVTPVKDQGMCGSCWAF 263


>UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 514

 Score = 44.4 bits (100), Expect = 0.003
 Identities = 31/108 (28%), Positives = 54/108 (50%), Gaps = 1/108 (0%)
 Frame = +2

Query: 230 RQFPHE-DIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 406
           +Q+  E ++ + KHI  +HN +Y   +    L   KY    +H FV   +G     K + 
Sbjct: 229 KQYDSEHEVSKRKHIF-RHNMRYIRSINRKNL---KYKLAPNH-FVDLTDGEYDQHKGDS 283

Query: 407 NLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWS 550
            + + G     +  +    V +P+++DWR +GAV   + QG CGSC++
Sbjct: 284 IITLYGPYSNMSHVLQ--RVDVPDELDWRDYGAVSPVRGQGICGSCYA 329


>UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14;
           Leishmania|Rep: Cysteine proteinase 1 precursor -
           Leishmania pifanoi
          Length = 354

 Score = 44.4 bits (100), Expect = 0.003
 Identities = 27/74 (36%), Positives = 39/74 (52%)
 Frame = +2

Query: 332 KYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVP 511
           K+ D+   EF K     +  A+H K+ + +   V  +   +P+ V     VDWR  GAV 
Sbjct: 90  KFADLTPQEFAKLYLNPDYYARHLKD-HKEDVHVDDS---APSGVM---SVDWRDKGAVT 142

Query: 512 TFKDQGKCGSCWSF 553
             K+QG CGSCW+F
Sbjct: 143 PVKNQGLCGSCWAF 156


>UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 21 SCAF14577, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 406

 Score = 44.0 bits (99), Expect = 0.004
 Identities = 27/103 (26%), Positives = 47/103 (45%), Gaps = 8/103 (7%)
 Frame = +2

Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAK-------HNKNLYMKGG 427
           ++A+HN +   G  S+ L +N   D++    +   +  ++  +          NL ++  
Sbjct: 80  LVARHNLEASAGKHSFTLELNHLADLVRRVLLLQPSLASERVRLTAEEINEMNNLKVEER 139

Query: 428 S-VRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           + VR          + P  VDWRK G V   ++QG C SCW+F
Sbjct: 140 APVRNGTSEEKLGFETPPSVDWRKAGLVSPVQNQGFCNSCWAF 182


>UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2;
           Oryza sativa|Rep: Putative uncharacterized protein -
           Oryza sativa subsp. indica (Rice)
          Length = 149

 Score = 44.0 bits (99), Expect = 0.004
 Identities = 16/28 (57%), Positives = 20/28 (71%)
 Frame = +2

Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           +P+ +DWRK GAV   K Q  CGSCW+F
Sbjct: 17  MPKSIDWRKKGAVVEVKYQEDCGSCWAF 44


>UniRef50_Q24E33 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 328

 Score = 44.0 bits (99), Expect = 0.004
 Identities = 15/28 (53%), Positives = 20/28 (71%)
 Frame = +2

Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           +P +V+W   GAV   K+QG CGSCW+F
Sbjct: 127 IPSEVNWTAQGAVTPVKNQGSCGSCWAF 154


>UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia
           tropicalis|Rep: Blo t 1 allergen - Blomia tropicalis
           (Mite)
          Length = 333

 Score = 44.0 bits (99), Expect = 0.004
 Identities = 24/84 (28%), Positives = 39/84 (46%)
 Frame = +2

Query: 302 GLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQ 481
           G+   +  +N+Y DM   EF      F+ +       YMK  + +     +  +  LP+ 
Sbjct: 64  GIDGVEYAINEYSDMSEQEF-----SFHLSGGGLNFTYMKMEAAKEPLINTYGS--LPQN 116

Query: 482 VDWRKHGAVPTFKDQGKCGSCWSF 553
            DWR+   +   + QG CGSCW+F
Sbjct: 117 FDWRQKARLTRIRQQGSCGSCWAF 140


>UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2;
           Entamoeba|Rep: Cysteine proteinase ACP1 precursor -
           Entamoeba histolytica
          Length = 308

 Score = 44.0 bits (99), Expect = 0.004
 Identities = 27/76 (35%), Positives = 38/76 (50%)
 Frame = +2

Query: 326 MNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGA 505
           +N + DM H EF++T  G         +      +V+ A  +  A    PE VDWR    
Sbjct: 57  LNVFADMTHEEFIQTHLGMTYEVPETTS------NVKAA--VKAA----PESVDWR--SI 102

Query: 506 VPTFKDQGKCGSCWSF 553
           +   KDQG+CGSCW+F
Sbjct: 103 MNPAKDQGQCGSCWTF 118


>UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_26_47548_45815 - Giardia lamblia
           ATCC 50803
          Length = 577

 Score = 43.6 bits (98), Expect = 0.005
 Identities = 16/31 (51%), Positives = 21/31 (67%)
 Frame = +2

Query: 461 NVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           N  LP+++DWR  G +   KDQ  CGSCW+F
Sbjct: 341 NEDLPQELDWRVRGIMNMAKDQVACGSCWTF 371


>UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia
           intestinalis|Rep: GLP_90_15278_13989 - Giardia lamblia
           ATCC 50803
          Length = 429

 Score = 43.6 bits (98), Expect = 0.005
 Identities = 16/32 (50%), Positives = 23/32 (71%)
 Frame = +2

Query: 458 ANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           A   LP+ VD R++G +   ++QGKCGSCW+F
Sbjct: 56  AEDNLPQSVDLREYGLMTPVRNQGKCGSCWAF 87


>UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 291

 Score = 43.6 bits (98), Expect = 0.005
 Identities = 29/98 (29%), Positives = 46/98 (46%)
 Frame = +2

Query: 260 HKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRG 439
           +K+ +  HN+       +YKL +N    +   E+   +       K +KNL  +G  VR 
Sbjct: 23  NKNFVETHNKAN----ANYKLSLNSLSHLTPTEYQSLLG-----TKIDKNLVSQGKKVR- 72

Query: 440 AKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
                P     P  +D+R+ G V   +DQ +CGSCW+F
Sbjct: 73  -----PQIKDSPGILDYREMGVVNPIRDQKQCGSCWAF 105


>UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza
           sativa|Rep: Putative cysteine protease - Oryza sativa
           subsp. japonica (Rice)
          Length = 357

 Score = 43.2 bits (97), Expect = 0.006
 Identities = 24/76 (31%), Positives = 37/76 (48%)
 Frame = +2

Query: 326 MNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGA 505
           +N++ D+ + EFV T  G  +        +         + + P  + +P  +DWR  GA
Sbjct: 90  INQFADLTNGEFVATYTGVKQPPPAT---HPHPHPEEAPRPVDP--IWMPCCIDWRFKGA 144

Query: 506 VPTFKDQGKCGSCWSF 553
           V   KDQG CGS W+F
Sbjct: 145 VTGVKDQGACGSSWAF 160


>UniRef50_Q2QS15 Cluster: Papain family cysteine protease containing
           protein; n=1; Oryza sativa (japonica
           cultivar-group)|Rep: Papain family cysteine protease
           containing protein - Oryza sativa subsp. japonica (Rice)
          Length = 351

 Score = 43.2 bits (97), Expect = 0.006
 Identities = 17/28 (60%), Positives = 19/28 (67%)
 Frame = +2

Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           LP+ VDWRK GAV   K    CGSCW+F
Sbjct: 145 LPKSVDWRKKGAVVEVKYHEDCGSCWAF 172


>UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 234

 Score = 43.2 bits (97), Expect = 0.006
 Identities = 15/28 (53%), Positives = 21/28 (75%)
 Frame = +2

Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           +P+++D+R  GAV   KDQ  CGSCW+F
Sbjct: 18  IPDEIDYRTKGAVNEIKDQKHCGSCWAF 45


>UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-like
           cysteine peptidase; n=3; Trichomonas vaginalis|Rep: Clan
           CA, family C1, cathepsin L or K-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 320

 Score = 43.2 bits (97), Expect = 0.006
 Identities = 23/82 (28%), Positives = 45/82 (54%), Gaps = 1/82 (1%)
 Frame = +2

Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQV-D 487
           +Y+L +N++  + + E+ K++ G   ++K+N + ++           SP + K  E   D
Sbjct: 61  NYRLSLNQFSFLTNSEY-KSLLGGKVSSKNNDDSHL----------FSPQSKKSSEVTFD 109

Query: 488 WRKHGAVPTFKDQGKCGSCWSF 553
           WR  G +   ++QG+CG CW+F
Sbjct: 110 WRTKGIINPIRNQGQCGLCWAF 131


>UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precursor;
           n=3; Plasmodium (Laverania)|Rep: Trophozoite cysteine
           proteinase precursor - Plasmodium falciparum
          Length = 569

 Score = 43.2 bits (97), Expect = 0.006
 Identities = 17/29 (58%), Positives = 22/29 (75%)
 Frame = +2

Query: 467 KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           K+PE +D+R+ G V   KDQG CGSCW+F
Sbjct: 332 KVPEILDYREKGIVHEPKDQGLCGSCWAF 360


>UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin
           L-like proteinase; n=2; Strongylocentrotus
           purpuratus|Rep: PREDICTED: similar to cathepsin L-like
           proteinase - Strongylocentrotus purpuratus
          Length = 329

 Score = 42.7 bits (96), Expect = 0.009
 Identities = 27/95 (28%), Positives = 48/95 (50%)
 Frame = +2

Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
           ++  ++ ++N+ Y+ G  S+K+ MN++ D    +  K  N F+  A    NL +     R
Sbjct: 54  KNNRLVDENNRAYDEGRRSFKMAMNEFADQ---DMSKVRNKFDVQA----NL-LNAERKR 105

Query: 437 GAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGS 541
            +   S ++  LP   DWRK G V   ++QG+  S
Sbjct: 106 KSSGTSSSSSTLPSSWDWRKEGKVNPVRNQGQMNS 140


>UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas
           foetus|Rep: TFCP2 protein - Tritrichomonas foetus
           (Trichomonas foetus)
          Length = 270

 Score = 42.7 bits (96), Expect = 0.009
 Identities = 15/27 (55%), Positives = 17/27 (62%)
 Frame = +2

Query: 473 PEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           P   DWR  G V   K+QG CGSCW+F
Sbjct: 51  PTSFDWRSEGKVNPIKNQGSCGSCWAF 77


>UniRef50_Q23H06 Cluster: Papain family cysteine protease containing
           protein; n=18; Tetrahymena thermophila|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 349

 Score = 42.7 bits (96), Expect = 0.009
 Identities = 15/35 (42%), Positives = 21/35 (60%)
 Frame = +2

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           ++  N  +   +DWR  GAV   K QG CG+CW+F
Sbjct: 134 LNSKNFTIATSIDWRSRGAVTQVKWQGNCGACWAF 168


>UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 330

 Score = 42.7 bits (96), Expect = 0.009
 Identities = 17/34 (50%), Positives = 21/34 (61%)
 Frame = +2

Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           SP+  K    V+W   G V   KDQG+CGSCW+F
Sbjct: 111 SPSTPKGQYDVNWVTRGKVSAVKDQGQCGSCWAF 144


>UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5;
           Theileria|Rep: Cysteine proteinase precursor - Theileria
           parva
          Length = 440

 Score = 42.7 bits (96), Expect = 0.009
 Identities = 27/101 (26%), Positives = 45/101 (44%), Gaps = 13/101 (12%)
 Frame = +2

Query: 290 KYEMGLVSYKLGMNKYGDMLHHEFVKTM-----------NGFNKTAKHNKNLYMKG--GS 430
           K + G   Y  G+N++ D+   EF K             NG+   +      Y+K    +
Sbjct: 157 KEQKGDEPYVKGINRFSDLTEREFYKLFPVMKPPKATYSNGYYLLSHMANKTYLKNLKKA 216

Query: 431 VRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           +   + +  A +   E +DWR+  +V + KDQ  CG CW+F
Sbjct: 217 LNTDEDVDLAKLT-GENLDWRRSSSVTSVKDQSNCGGCWAF 256


>UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin O;
           n=1; Danio rerio|Rep: PREDICTED: similar to cathepsin O
           - Danio rerio
          Length = 327

 Score = 42.3 bits (95), Expect = 0.011
 Identities = 21/58 (36%), Positives = 29/58 (50%), Gaps = 5/58 (8%)
 Frame = +2

Query: 395 KHNKNLYMKGGSVRGAKFI-SPANVKL----PEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           K  K  Y+   +    KF  S + +K+    P + DWR HG V    +QG CG CW+F
Sbjct: 90  KQFKEQYLTARAEAAPKFDQSKSEIKVKANNPPRFDWRDHGVVGPVHNQGSCGGCWAF 147


>UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2;
           Arabidopsis thaliana|Rep: Putative cysteine proteinase -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 365

 Score = 42.3 bits (95), Expect = 0.011
 Identities = 33/107 (30%), Positives = 46/107 (42%), Gaps = 5/107 (4%)
 Frame = +2

Query: 230 RQFPHEDIPEHKHIIAKHNQKY-----EMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTA 394
           R +  E   E +  + K N K+      MG  SY LG+N++ D    EF+ T  G     
Sbjct: 47  RVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSYTLGVNEFTDWKTEEFLATHTGLRVNV 106

Query: 395 KHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKC 535
                L+ K    R    +S  +++  E  DWR  GAV   K QG C
Sbjct: 107 TSLSELFNKTKPSRNWN-MSDIDME-DESKDWRDEGAVTPVKYQGAC 151


>UniRef50_Q5ZC39 Cluster: CRK1 protein-like; n=2; Oryza sativa
           (japonica cultivar-group)|Rep: CRK1 protein-like - Oryza
           sativa subsp. japonica (Rice)
          Length = 374

 Score = 41.9 bits (94), Expect = 0.015
 Identities = 24/65 (36%), Positives = 31/65 (47%)
 Frame = +3

Query: 489 GGSTAPSRHSRTKGSVAHAGPFSTTGSFGRTALPSVRLPGVASSEQNLHRLLGSSYGEQR 668
           GG+ APS  S + G    A P S+ G+   TA P     G A  E+ L   +G   GE+R
Sbjct: 287 GGTAAPSSSSSSAGQSRSAVPSSSAGAAPATAGPMPASAGAAKRERGLEPTMGEREGERR 346

Query: 669 AATGG 683
            A  G
Sbjct: 347 GAGDG 351


>UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-ear
           cress). SAG12 protein; n=2; Dictyostelium
           discoideum|Rep: Similar to Arabidopsis thaliana
           (Mouse-ear cress). SAG12 protein - Dictyostelium
           discoideum (Slime mold)
          Length = 358

 Score = 41.9 bits (94), Expect = 0.015
 Identities = 17/41 (41%), Positives = 24/41 (58%)
 Frame = +2

Query: 431 VRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           + G K +   ++     +DWRK G V   KDQG+CGSC+ F
Sbjct: 132 INGYKEMENGDLNELYSIDWRKKGLVTPVKDQGQCGSCYIF 172


>UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10;
           Eukaryota|Rep: Extracellular cysteine protease 8 -
           Tritrichomonas foetus (Trichomonas foetus)
          Length = 315

 Score = 41.9 bits (94), Expect = 0.015
 Identities = 27/78 (34%), Positives = 38/78 (48%)
 Frame = +2

Query: 314 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 493
           +  G+NK+  M   E+ K + GF       K       +V+  K    A+V   E +DWR
Sbjct: 62  FTTGLNKFAAMTPSEY-KALLGFRMDLAQRK-------AVKSTK---KASV---ESLDWR 107

Query: 494 KHGAVPTFKDQGKCGSCW 547
           + G V   KDQ +CGSCW
Sbjct: 108 EKGVVNPIKDQAQCGSCW 125


>UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=17; Trichomonas vaginalis|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 318

 Score = 41.9 bits (94), Expect = 0.015
 Identities = 15/28 (53%), Positives = 19/28 (67%)
 Frame = +2

Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           +P+ VDWR    V   KDQ +CGSCW+F
Sbjct: 100 VPDAVDWRNAKIVNPIKDQAQCGSCWAF 127


>UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n=1;
           Toxocara canis|Rep: Cathepsin L-like cysteine proteinase
           - Toxocara canis (Canine roundworm)
          Length = 360

 Score = 41.5 bits (93), Expect = 0.020
 Identities = 14/29 (48%), Positives = 19/29 (65%)
 Frame = +2

Query: 467 KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           ++P+  DWR +  V   K Q KCGSCW+F
Sbjct: 144 EIPDHFDWRPYNVVTPVKSQFKCGSCWAF 172


>UniRef50_A0BNM1 Cluster: Chromosome undetermined scaffold_119,
           whole genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_119,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 341

 Score = 41.5 bits (93), Expect = 0.020
 Identities = 25/99 (25%), Positives = 54/99 (54%), Gaps = 1/99 (1%)
 Frame = +2

Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKT-MNGFNKTAKHNKNLYMKGGSV 433
           ++K +I +HN++ E    ++ +G N++  + + EFV   +N  +   ++ ++  ++  + 
Sbjct: 54  QNKQMIEEHNKRSEF---TFLMGENQFMAITNEEFVSLYLNPISPEKQNEQDQIIRKTNP 110

Query: 434 RGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWS 550
           +  + I   N+K  + VDWR +  V   K+ G CGS W+
Sbjct: 111 KSPEPIREYNLK--DDVDWRGYAPV---KNSGNCGSSWA 144


>UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin O
           precursor; n=2; Apocrita|Rep: PREDICTED: similar to
           Cathepsin O precursor - Apis mellifera
          Length = 374

 Score = 41.1 bits (92), Expect = 0.026
 Identities = 12/31 (38%), Positives = 20/31 (64%)
 Frame = +2

Query: 461 NVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           ++ +P + DWR  G +   + QG CG+CW+F
Sbjct: 152 SISIPLRFDWRDKGVITPVRSQGSCGACWAF 182


>UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryza
           sativa (japonica cultivar-group)|Rep: Putative cysteine
           proteinase - Oryza sativa subsp. japonica (Rice)
          Length = 357

 Score = 41.1 bits (92), Expect = 0.026
 Identities = 26/84 (30%), Positives = 39/84 (46%)
 Frame = +2

Query: 302 GLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQ 481
           G  S +L  NK+ D+ + EF +       T        + GGS  G  + +     +P  
Sbjct: 88  GKKSPRLTTNKFADLTNEEFAEYYGRPFSTP-------VIGGS--GFMYGNVRTSDVPAN 138

Query: 482 VDWRKHGAVPTFKDQGKCGSCWSF 553
           ++WR  GAV   K+Q  C SCW+F
Sbjct: 139 INWRDRGAVTQVKNQKDCASCWAF 162


>UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia
           medanensis|Rep: Sui m 1 allergen - Suidasia medanensis
          Length = 336

 Score = 41.1 bits (92), Expect = 0.026
 Identities = 29/117 (24%), Positives = 54/117 (46%), Gaps = 2/117 (1%)
 Frame = +2

Query: 209 QLRKRGRRQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLGM--NKYGDMLHHEFVKTMNGF 382
           Q ++   +Q+  E+ P+ + I  ++ +  +     +  G+  N++ D+   EF       
Sbjct: 30  QFKELYGKQYTAEEEPQRRAIFEENLRWIQENHGKHGAGLEVNEHADLTAEEFSSMYATL 89

Query: 383 NKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           N+ A     L+ +   V      S  +V LP   DWR+       ++QG+CGSCW+F
Sbjct: 90  NQEAFLKSPLHKEFVQVPE----SDISVALPAAFDWRQQWNTAV-RNQGQCGSCWAF 141


>UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3;
           Theileria|Rep: Cysteine proteinase precursor - Theileria
           annulata
          Length = 441

 Score = 41.1 bits (92), Expect = 0.026
 Identities = 29/96 (30%), Positives = 44/96 (45%), Gaps = 16/96 (16%)
 Frame = +2

Query: 314 YKLGMNKYGDMLHHEFV---------KTMNGFNKTAK-----HNKNLYM-KGGSVRGAKF 448
           Y L +NK+ D+   EF          KT    +K  +     H   +Y+ K    +G + 
Sbjct: 160 YSLDLNKFSDLSDEEFKALYPVITPPKTYTSLSKHLEFKKMSHKNPIYISKLKKAKGIEE 219

Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQG-KCGSCWSF 553
           I   ++   E ++W +  AV   KDQG  CGSCW+F
Sbjct: 220 IKDLSLITGENLNWARTDAVSPIKDQGDHCGSCWAF 255


>UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16)
           [Contains: Cathepsin H mini chain; Cathepsin H heavy
           chain; Cathepsin H light chain]; n=37; Eukaryota|Rep:
           Cathepsin H precursor (EC 3.4.22.16) [Contains:
           Cathepsin H mini chain; Cathepsin H heavy chain;
           Cathepsin H light chain] - Homo sapiens (Human)
          Length = 335

 Score = 41.1 bits (92), Expect = 0.026
 Identities = 17/28 (60%), Positives = 19/28 (67%), Gaps = 1/28 (3%)
 Frame = +2

Query: 473 PEQVDWRKHGA-VPTFKDQGKCGSCWSF 553
           P  VDWRK G  V   K+QG CGSCW+F
Sbjct: 117 PPSVDWRKKGNFVSPVKNQGACGSCWTF 144


>UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryza
           sativa (japonica cultivar-group)|Rep: Putative cysteine
           proteinase - Oryza sativa subsp. japonica (Rice)
          Length = 385

 Score = 40.7 bits (91), Expect = 0.034
 Identities = 29/92 (31%), Positives = 44/92 (47%), Gaps = 10/92 (10%)
 Frame = +2

Query: 308 VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVD 487
           ++Y+LG+N++ DM   EF     G  +T     +L  + G+V   K   PA   +P   +
Sbjct: 87  MTYRLGLNQFSDMTFEEFAGKFTG-GRTGSIAGDL--RDGAVTYCK--PPAVGYVPPSWN 141

Query: 488 WRKHGAVPTFKDQGKC----------GSCWSF 553
           W K+G V   K+Q  C          GSCW+F
Sbjct: 142 WTKYGVVTPVKNQLTCVNTIKMSMYEGSCWAF 173


>UniRef50_Q248G1 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 334

 Score = 40.7 bits (91), Expect = 0.034
 Identities = 16/30 (53%), Positives = 20/30 (66%), Gaps = 1/30 (3%)
 Frame = +2

Query: 467 KLPEQVDWRK-HGAVPTFKDQGKCGSCWSF 553
           ++PE VDWR     V   K+QG CGSCW+F
Sbjct: 120 QIPESVDWRNVTNVVGPIKNQGHCGSCWTF 149


>UniRef50_Q22NW9 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 894

 Score = 40.7 bits (91), Expect = 0.034
 Identities = 23/72 (31%), Positives = 36/72 (50%)
 Frame = +2

Query: 467 KLPEQVDWRKHGAVPTFKDQGKCGSCWSFQHDWELWKDSTSVSPATWCRFFGAKPSSTAR 646
           ++P  +DWR   AV   K+QG CGS ++F     L +    +S   W  F   +    +R
Sbjct: 682 EVPSSIDWRDLNAVTPVKNQGSCGSGYAFSTTGAL-EGIHKISGKDWKGFSEQQIIDCSR 740

Query: 647 EQLRGTTGCNRG 682
           +Q  G +GC+ G
Sbjct: 741 KQ--GNSGCHGG 750


>UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathepsin
           o - Aedes aegypti (Yellowfever mosquito)
          Length = 375

 Score = 40.7 bits (91), Expect = 0.034
 Identities = 14/27 (51%), Positives = 18/27 (66%)
 Frame = +2

Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWS 550
           LP+ VDWR  G V   + QG CG+CW+
Sbjct: 153 LPKVVDWRDKGVVAPVRSQGSCGACWA 179


>UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1;
           Methanospirillum hungatei JF-1|Rep: Peptidase C1A,
           papain precursor - Methanospirillum hungatei (strain
           JF-1 / DSM 864)
          Length = 1096

 Score = 40.7 bits (91), Expect = 0.034
 Identities = 23/65 (35%), Positives = 34/65 (52%), Gaps = 4/65 (6%)
 Frame = +2

Query: 416 MKGGSVRGAKFISPANVKLPEQVDWRKHGAVPT--FKDQGKCGSCWSFQHD--WELWKDS 583
           +K  ++     I+P    LP   DWR +G   T   K+QG CGSCW+F     +E +K+ 
Sbjct: 304 LKSSTIVSGAGITPME-GLPTSFDWRNNGGDYTTPIKNQGSCGSCWAFATTGAFESYKEI 362

Query: 584 TSVSP 598
            S +P
Sbjct: 363 KSGNP 367


>UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:
           Viral cathepsin - Xestia c-nigrum granulosis virus
           (XnGV) (Xestia c-nigrumgranulovirus)
          Length = 346

 Score = 40.7 bits (91), Expect = 0.034
 Identities = 14/29 (48%), Positives = 20/29 (68%)
 Frame = +2

Query: 467 KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           K+P+  DWR   +V + K Q +CGSCW+F
Sbjct: 132 KVPDSFDWRDRNSVTSVKMQKECGSCWAF 160


>UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4;
           Paramecium tetraurelia|Rep: Putative cathepsin L2
           precursor - Paramecium tetraurelia
          Length = 294

 Score = 40.7 bits (91), Expect = 0.034
 Identities = 30/107 (28%), Positives = 53/107 (49%), Gaps = 2/107 (1%)
 Frame = +2

Query: 260 HKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRG 439
           +K +I +HNQ+ +   V+Y++G N++  + H EFV          K + ++ + G S   
Sbjct: 41  NKRMIEEHNQRED---VTYQMGENQFMTLSHEEFVDLY-----LQKSDSSVNIMGAS--- 89

Query: 440 AKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF--QHDWELW 574
              +    ++    VDWR +    T K+QG+C S W+F   +  E W
Sbjct: 90  ---LPEVQLEGLGAVDWRNY---TTVKEQGQCASGWAFSVSNSLEAW 130


>UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5;
           Piroplasmida|Rep: Cysteine proteinase, putative -
           Theileria parva
          Length = 460

 Score = 40.3 bits (90), Expect = 0.045
 Identities = 32/105 (30%), Positives = 44/105 (41%), Gaps = 17/105 (16%)
 Frame = +2

Query: 290 KYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFN-----KTAKHNKNL---------YMKG- 424
           K   G  +Y   +N + DM   EF K            T  H   L         Y+K  
Sbjct: 174 KIHQGHETYSREINSFADMTEEEFNKLFPPIKVPESKSTTSHVDRLMARMVSDETYLKNL 233

Query: 425 -GSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQG-KCGSCWSF 553
             ++   K + P N+   E +DWRK   V   K+QG +CGSCW+F
Sbjct: 234 KKALNTDKDVDPKNIT-GEGLDWRKADGVSKIKNQGLECGSCWAF 277


>UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2;
           cellular organisms|Rep: Cysteine proteinase, putative -
           Archaeoglobus fulgidus
          Length = 1088

 Score = 40.3 bits (90), Expect = 0.045
 Identities = 13/27 (48%), Positives = 18/27 (66%)
 Frame = +2

Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWS 550
           LP + DWR +  +   +DQG CGSCW+
Sbjct: 594 LPSRFDWRDYTGLSAVRDQGSCGSCWA 620


>UniRef50_Q9SIE8 Cluster: Putative cysteine proteinase; n=1;
           Arabidopsis thaliana|Rep: Putative cysteine proteinase -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 105

 Score = 39.9 bits (89), Expect = 0.060
 Identities = 25/72 (34%), Positives = 35/72 (48%)
 Frame = +2

Query: 314 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 493
           YKL +NK+ ++   EFV     F+ +  H K L  K        F      + P+ +DWR
Sbjct: 35  YKLKLNKFANLTDVEFVNAHTCFDMS-DHKKILDSK-------PFFYENMTQAPDSLDWR 86

Query: 494 KHGAVPTFKDQG 529
           + GAV   KDQG
Sbjct: 87  EKGAVTNVKDQG 98


>UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4;
           Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena
           thermophila
          Length = 320

 Score = 39.9 bits (89), Expect = 0.060
 Identities = 14/25 (56%), Positives = 17/25 (68%)
 Frame = +2

Query: 479 QVDWRKHGAVPTFKDQGKCGSCWSF 553
           +VDW   G V   K+QG CGSCW+F
Sbjct: 115 EVDWTAKGKVTPVKNQGSCGSCWAF 139


>UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC02853 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 181

 Score = 39.9 bits (89), Expect = 0.060
 Identities = 15/35 (42%), Positives = 22/35 (62%), Gaps = 4/35 (11%)
 Frame = +2

Query: 461 NVKLPEQVD----WRKHGAVPTFKDQGKCGSCWSF 553
           N+KLP+  D    W+   ++ T +DQ  CGSCW+F
Sbjct: 79  NIKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAF 113


>UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_79,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 324

 Score = 39.9 bits (89), Expect = 0.060
 Identities = 29/102 (28%), Positives = 42/102 (41%), Gaps = 3/102 (2%)
 Frame = +2

Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFN--KTAKHNKNLYMKGGS 430
           ++  +I  HN   + G  +Y +  N++ D+   EF +    F    T K     Y+  G 
Sbjct: 62  QNAQLIEAHNND-KSGKYTYTMETNQFADLTEQEFAQKYLTFRPKSTNKSKSTDYVPNGQ 120

Query: 431 VRGAKFISPANVKLPEQVDWRKHGAVPTFKDQG-KCGSCWSF 553
            R                DW + G VP  KDQG  CGS W+F
Sbjct: 121 AR----------------DWVEEGKVPPIKDQGSSCGSSWAF 146


>UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, whole
           genome shotgun sequence; n=3; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_2,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 376

 Score = 39.9 bits (89), Expect = 0.060
 Identities = 23/74 (31%), Positives = 36/74 (48%)
 Frame = +2

Query: 332 KYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVP 511
           K   + H+ F++     +KT K  K +  K  +    +  +P N   P  +DW K   V 
Sbjct: 126 KSDSLSHNSFLQA----DKTVKVVKKVVKKASATTKTEKATPKN---PPSLDWLKQ--VT 176

Query: 512 TFKDQGKCGSCWSF 553
             + QG+CGSCW+F
Sbjct: 177 EVQQQGRCGSCWAF 190


>UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16;
           Plasmodium (Vinckeia)|Rep: Cysteine proteinase precursor
           - Plasmodium vinckei
          Length = 506

 Score = 39.9 bits (89), Expect = 0.060
 Identities = 29/97 (29%), Positives = 41/97 (42%), Gaps = 5/97 (5%)
 Frame = +2

Query: 278 KHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKG--GSVRGAKFI 451
           KHN+      ++Y   +N+Y D    EF              K+ Y+      +     I
Sbjct: 195 KHNEMVGKNGLTYVQKVNQYSDFSKEEFDNYFKKLLSVPMDLKSKYIVPLKKHLANTNLI 254

Query: 452 SPANVK--LPEQVDWR-KHGAVPTFKDQGKCGSCWSF 553
           S  N     P+  D+R K   +P  KDQG CGSCW+F
Sbjct: 255 SVDNKSKDFPDSRDYRSKFNFLPP-KDQGNCGSCWAF 290


>UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria
           dispar multicapsid nuclear polyhedrosis virus (LdMNPV)
          Length = 356

 Score = 39.9 bits (89), Expect = 0.060
 Identities = 14/29 (48%), Positives = 19/29 (65%)
 Frame = +2

Query: 467 KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           K P   DWR+   V + K+QG CG+CW+F
Sbjct: 143 KGPLHFDWREQNKVTSIKNQGACGACWAF 171


>UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2;
           Roseiflexus|Rep: Peptidase C1A, papain precursor -
           Roseiflexus sp. RS-1
          Length = 1202

 Score = 39.5 bits (88), Expect = 0.079
 Identities = 15/28 (53%), Positives = 17/28 (60%)
 Frame = +2

Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
           LP   +W   GA    KDQG CGSCW+F
Sbjct: 169 LPAAFNWCDQGACTPVKDQGVCGSCWAF 196


>UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 343

 Score = 39.5 bits (88), Expect = 0.079
 Identities = 31/98 (31%), Positives = 48/98 (48%), Gaps = 3/98 (3%)
 Frame = +2

Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
           ++ ++N K + G V+Y+L  N + D+   E+ K +        H++       S++    
Sbjct: 81  LVERYN-KEDAGKVTYEL--NDFSDLTEEEWKKYL--MTPKPDHSEK------SLKPKTL 129

Query: 449 ISPANVKLPEQVDWRK-HGA--VPTFKDQGKCGSCWSF 553
           I   N  LP  VDWR  +G   V   K QG CGSCW+F
Sbjct: 130 IDKKN--LPNSVDWRNVNGTNHVTGIKYQGPCGSCWAF 165


>UniRef50_Q23EG5 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 429

 Score = 39.5 bits (88), Expect = 0.079
 Identities = 16/33 (48%), Positives = 22/33 (66%), Gaps = 4/33 (12%)
 Frame = +2

Query: 467 KLPEQVDWRKHGAVPTFKDQ----GKCGSCWSF 553
           ++P+ VDWR+ G V + KDQ      CGSCW+F
Sbjct: 121 EIPDYVDWREKGIVSSVKDQDAVGDDCGSCWTF 153


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 707,051,345
Number of Sequences: 1657284
Number of extensions: 15102863
Number of successful extensions: 55030
Number of sequences better than 10.0: 371
Number of HSP's better than 10.0 without gapping: 51322
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 54798
length of database: 575,637,011
effective HSP length: 98
effective length of database: 413,223,179
effective search space used: 56611575523
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)

- SilkBase 1999-2023 -