SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTX 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= tesV0482.Seq
         (797 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C...   112   8e-24
UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ...    86   1e-15
UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n...    84   4e-15
UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j...    81   3e-14
UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M...    81   4e-14
UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep...    80   6e-14
UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;...    80   6e-14
UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p...    79   1e-13
UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi...    79   1e-13
UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs...    79   1e-13
UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n...    79   2e-13
UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca...    78   2e-13
UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ...    77   5e-13
UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip...    77   5e-13
UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata...    77   5e-13
UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|...    76   9e-13
UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ...    76   9e-13
UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae...    76   1e-12
UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy...    76   1e-12
UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D...    76   1e-12
UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei...    75   3e-12
UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ...    74   5e-12
UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re...    74   5e-12
UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=...    74   5e-12
UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ...    73   6e-12
UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R...    73   6e-12
UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35...    73   6e-12
UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e...    73   6e-12
UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n...    73   8e-12
UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ...    73   8e-12
UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C...    73   8e-12
UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D...    73   8e-12
UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus...    72   2e-11
UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr...    71   3e-11
UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ...    71   3e-11
UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat...    71   3e-11
UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox...    71   3e-11
UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p...    71   4e-11
UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R...    71   4e-11
UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel...    71   4e-11
UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio...    70   6e-11
UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl...    70   6e-11
UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ...    70   6e-11
UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa...    70   8e-11
UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip...    70   8e-11
UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; ...    69   1e-10
UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal...    69   1e-10
UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt...    69   1e-10
UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ...    69   1e-10
UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain...    69   1e-10
UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat...    69   1e-10
UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ...    69   1e-10
UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|...    69   1e-10
UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy...    69   2e-10
UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb...    69   2e-10
UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h...    68   2e-10
UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s...    68   2e-10
UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sa...    68   2e-10
UniRef50_Q23H15 Cluster: Papain family cysteine protease contain...    68   2e-10
UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:...    68   3e-10
UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph...    68   3e-10
UniRef50_Q22W19 Cluster: Papain family cysteine protease contain...    68   3e-10
UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh...    68   3e-10
UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ...    68   3e-10
UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ...    67   4e-10
UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil...    67   4e-10
UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=...    67   4e-10
UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ...    67   6e-10
UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ...    67   6e-10
UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n...    67   6e-10
UniRef50_Q239L8 Cluster: Papain family cysteine protease contain...    67   6e-10
UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ...    67   6e-10
UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic...    67   6e-10
UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ...    66   7e-10
UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz...    66   7e-10
UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep...    66   7e-10
UniRef50_Q23H10 Cluster: Papain family cysteine protease contain...    66   7e-10
UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto...    66   7e-10
UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome s...    66   1e-09
UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr...    66   1e-09
UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v...    66   1e-09
UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca...    66   1e-09
UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa...    66   1e-09
UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty...    66   1e-09
UniRef50_Q24E33 Cluster: Papain family cysteine protease contain...    66   1e-09
UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ...    66   1e-09
UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ...    65   2e-09
UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1...    65   2e-09
UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi...    65   2e-09
UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir...    65   2e-09
UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain...    65   2e-09
UniRef50_O16454 Cluster: Temporarily assigned gene name protein ...    64   3e-09
UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ...    64   4e-09
UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-...    64   4e-09
UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ...    64   4e-09
UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er...    64   4e-09
UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve...    64   4e-09
UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo...    64   4e-09
UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida...    64   5e-09
UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re...    64   5e-09
UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|...    64   5e-09
UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ...    63   7e-09
UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum...    63   7e-09
UniRef50_Q22RC9 Cluster: Papain family cysteine protease contain...    63   7e-09
UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]...    63   7e-09
UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc...    63   7e-09
UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ...    63   9e-09
UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ...    63   9e-09
UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy...    63   9e-09
UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t...    62   1e-08
UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei...    62   1e-08
UniRef50_Q23H06 Cluster: Papain family cysteine protease contain...    62   1e-08
UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina...    62   1e-08
UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy...    62   1e-08
UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ...    62   2e-08
UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP000...    62   2e-08
UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ...    62   2e-08
UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ...    62   2e-08
UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa...    62   2e-08
UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac...    62   2e-08
UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve...    62   2e-08
UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ...    62   2e-08
UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ...    61   3e-08
UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ...    61   3e-08
UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain...    61   3e-08
UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ...    61   3e-08
UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No...    61   4e-08
UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re...    61   4e-08
UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; ...    61   4e-08
UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|...    61   4e-08
UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n...    61   4e-08
UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ...    61   4e-08
UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi...    61   4e-08
UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt...    60   5e-08
UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv...    60   5e-08
UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi...    60   5e-08
UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re...    60   5e-08
UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ...    60   6e-08
UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl...    60   6e-08
UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia...    60   6e-08
UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain...    60   6e-08
UniRef50_Q22A69 Cluster: Papain family cysteine protease contain...    60   6e-08
UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov...    60   6e-08
UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ...    60   8e-08
UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG...    60   8e-08
UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ...    60   8e-08
UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet...    60   8e-08
UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3...    60   8e-08
UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory...    60   8e-08
UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ...    60   8e-08
UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ...    59   1e-07
UniRef50_Q235G6 Cluster: Papain family cysteine protease contain...    59   1e-07
UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18...    59   1e-07
UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ...    59   1e-07
UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R...    59   1e-07
UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv...    59   1e-07
UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n...    59   1e-07
UniRef50_Q2QS15 Cluster: Papain family cysteine protease contain...    58   2e-07
UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ...    58   2e-07
UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus...    58   2e-07
UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy...    58   2e-07
UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;...    58   3e-07
UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ...    58   3e-07
UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz...    58   3e-07
UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa...    58   3e-07
UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain...    58   3e-07
UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain...    58   3e-07
UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;...    58   3e-07
UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain...    58   3e-07
UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,...    57   4e-07
UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ...    57   4e-07
UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280...    57   4e-07
UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G...    57   4e-07
UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:...    57   4e-07
UniRef50_Q23H32 Cluster: Papain family cysteine protease contain...    57   4e-07
UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate...    57   4e-07
UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine prot...    57   4e-07
UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ...    57   4e-07
UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;...    56   8e-07
UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr...    56   8e-07
UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica...    56   1e-06
UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-...    56   1e-06
UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: L...    56   1e-06
UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina...    56   1e-06
UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster...    55   2e-06
UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1; ...    55   2e-06
UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy...    55   2e-06
UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C...    55   2e-06
UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2...    55   2e-06
UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh...    54   3e-06
UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ...    54   4e-06
UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain...    54   4e-06
UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyosteli...    54   4e-06
UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole...    54   5e-06
UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis...    54   5e-06
UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh...    54   5e-06
UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big...    53   7e-06
UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ...    53   7e-06
UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali...    53   7e-06
UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome s...    53   1e-05
UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa...    53   1e-05
UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio...    53   1e-05
UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain...    53   1e-05
UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain...    53   1e-05
UniRef50_UPI0001509FF9 Cluster: Papain family cysteine protease ...    52   2e-05
UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain...    52   2e-05
UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs...    52   2e-05
UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1...    52   2e-05
UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa...    52   2e-05
UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ...    51   3e-05
UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ...    51   3e-05
UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi...    51   3e-05
UniRef50_A7APS9 Cluster: Papain family cysteine protease contain...    51   3e-05
UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma j...    51   4e-05
UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil...    51   4e-05
UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen...    51   4e-05
UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The...    51   4e-05
UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia ...    50   5e-05
UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ...    50   5e-05
UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh...    50   5e-05
UniRef50_Q9BL26 Cluster: Putative uncharacterized protein; n=4; ...    50   7e-05
UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir...    50   9e-05
UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li...    50   9e-05
UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ...    49   1e-04
UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain...    49   1e-04
UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl...    49   1e-04
UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:...    49   1e-04
UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n...    49   2e-04
UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Sl...    49   2e-04
UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Sl...    49   2e-04
UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted...    49   2e-04
UniRef50_Q248G1 Cluster: Papain family cysteine protease contain...    49   2e-04
UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru...    49   2e-04
UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cel...    49   2e-04
UniRef50_UPI0000D566ED Cluster: PREDICTED: similar to CTLA-2-alp...    48   2e-04
UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ...    48   2e-04
UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste...    48   2e-04
UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop...    48   2e-04
UniRef50_A1ZBK7 Cluster: CG10460-PA; n=1; Drosophila melanogaste...    48   3e-04
UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n...    48   4e-04
UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain...    48   4e-04
UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|...    47   6e-04
UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli...    47   6e-04
UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w...    47   6e-04
UniRef50_Q5YER1 Cluster: Cysteine proteinase; n=1; Bigelowiella ...    46   8e-04
UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2; Culicid...    46   8e-04
UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh...    46   8e-04
UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:...    46   8e-04
UniRef50_UPI0000D566EC Cluster: PREDICTED: similar to CG10460-PA...    46   0.001
UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s...    46   0.001
UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyt...    46   0.001
UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop...    46   0.001
UniRef50_Q231X3 Cluster: Papain family cysteine protease contain...    46   0.001
UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O...    46   0.001
UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh...    46   0.001
UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re...    45   0.002
UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu...    45   0.002
UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil...    45   0.003
UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep...    45   0.003
UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The...    45   0.003
UniRef50_Q2XWW8 Cluster: Cysteine protease Mir1; n=1; Zea diplop...    44   0.003
UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest...    44   0.003
UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;...    44   0.003
UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try...    44   0.003
UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop...    44   0.003
UniRef50_A7LFV2 Cluster: Cathepsin L protease inhibitor 1; n=1; ...    44   0.003
UniRef50_A0BNM1 Cluster: Chromosome undetermined scaffold_119, w...    44   0.003
UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ...    44   0.004
UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-li...    44   0.004
UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain...    44   0.006
UniRef50_Q5NE16 Cluster: Putative cathepsin L-like protein 3; n=...    44   0.006
UniRef50_UPI00015B5D85 Cluster: PREDICTED: similar to cathepsin ...    43   0.008
UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|...    43   0.008
UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain...    43   0.008
UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:...    43   0.010
UniRef50_A0DKP6 Cluster: Chromosome undetermined scaffold_54, wh...    43   0.010
UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto...    43   0.010
UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ...    42   0.014
UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;...    42   0.014
UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryz...    42   0.014
UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi...    42   0.014
UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; ...    42   0.018
UniRef50_Q237A1 Cluster: Papain family cysteine protease contain...    42   0.018
UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10...    42   0.018
UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endop...    42   0.018
UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; P...    42   0.018
UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocysti...    42   0.024
UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop...    42   0.024
UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl...    42   0.024
UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ...    41   0.031
UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein...    41   0.031
UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis...    41   0.031
UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcop...    41   0.031
UniRef50_A0DCA5 Cluster: Chromosome undetermined scaffold_45, wh...    41   0.031
UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who...    41   0.031
UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M...    41   0.031
UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium...    41   0.031
UniRef50_A2Q4E7 Cluster: Peptidase C1A, papain; n=1; Medicago tr...    41   0.041
UniRef50_Q54MB6 Cluster: Putative uncharacterized protein; n=1; ...    41   0.041
UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain...    41   0.041
UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w...    41   0.041
UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ...    40   0.055
UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ...    40   0.055
UniRef50_Q292E5 Cluster: GA10327-PA; n=1; Drosophila pseudoobscu...    40   0.055
UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cy...    40   0.055
UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C...    40   0.072
UniRef50_UPI00001CC928 Cluster: PREDICTED: similar to CTLA-2-bet...    40   0.096
UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryz...    40   0.096
UniRef50_A1Z9I0 Cluster: CG6357-PA; n=3; Drosophila melanogaster...    40   0.096
UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr...    40   0.096
UniRef50_P12400 Cluster: Protein CTLA-2-beta; n=6; Mus musculus|...    39   0.13 
UniRef50_UPI0000DA404B Cluster: PREDICTED: similar to cathepsin ...    39   0.17 
UniRef50_Q962W3 Cluster: Cysteine protease; n=3; Giardia intesti...    39   0.17 
UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli...    39   0.17 
UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep...    39   0.17 
UniRef50_A0EAG2 Cluster: Chromosome undetermined scaffold_86, wh...    39   0.17 
UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w...    39   0.17 
UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ...    38   0.22 
UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve...    38   0.22 
UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy...    38   0.22 
UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh...    38   0.22 
UniRef50_Q9GU75 Cluster: Thiolproteinase; n=2; Babesia|Rep: Thio...    38   0.29 
UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy...    38   0.29 
UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ...    37   0.51 
UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H...    37   0.51 
UniRef50_Q8TKH5 Cluster: Cell surface protein; n=3; Methanosarci...    37   0.51 
UniRef50_UPI0000ECBFDF Cluster: UPI0000ECBFDF related cluster; n...    37   0.67 
UniRef50_Q4S572 Cluster: Tyrosine-protein kinase receptor; n=2; ...    37   0.67 
UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh...    37   0.67 
UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia...    37   0.67 
UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li...    37   0.67 
UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh...    37   0.67 
UniRef50_Q8TQ91 Cluster: Putative uncharacterized protein; n=1; ...    37   0.67 
UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr...    37   0.67 
UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|R...    37   0.67 
UniRef50_A6EGZ3 Cluster: Aminopeptidase C; n=1; Pedobacter sp. B...    36   0.89 
UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl...    36   0.89 
UniRef50_Q01FL5 Cluster: Cysteine protease-5; n=3; Ostreococcus|...    36   0.89 
UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O...    36   0.89 
UniRef50_A7LFV3 Cluster: Cathepsin L protease inhibitor 2; n=1; ...    36   0.89 
UniRef50_A3FQ13 Cluster: Cathepsin like thiol protease possibly ...    36   0.89 
UniRef50_Q2NG83 Cluster: Member of asn/thr-rich large protein fa...    36   0.89 
UniRef50_Q0AY53 Cluster: Putative uncharacterized protein; n=1; ...    36   1.2  
UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=...    36   1.2  
UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S...    36   1.6  
UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl...    36   1.6  
UniRef50_Q4N5Z7 Cluster: Cysteine proteinase, putative; n=2; The...    36   1.6  
UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus co...    36   1.6  
UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ...    36   1.6  
UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3....    36   1.6  
UniRef50_Q7X6B4 Cluster: OSJNBa0079F16.1 protein; n=41; Euphyllo...    35   2.1  
UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi...    35   2.1  
UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus...    35   2.1  
UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2...    35   2.1  
UniRef50_Q55FL7 Cluster: Putative uncharacterized protein; n=1; ...    35   2.1  
UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma...    35   2.1  
UniRef50_UPI0000DA2FCA Cluster: PREDICTED: similar to alpha 3 ty...    35   2.7  
UniRef50_Q945E4 Cluster: Cysteine proteinase; n=1; Vasconcellea ...    35   2.7  
UniRef50_Q650W8 Cluster: Putative cysteine proteinase; n=2; Oryz...    35   2.7  
UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n...    35   2.7  
UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist...    35   2.7  
UniRef50_Q7RPJ9 Cluster: Mature parasite-infected erythrocyte su...    35   2.7  
UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|...    35   2.7  
UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy...    35   2.7  
UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n...    35   2.7  
UniRef50_P84789 Cluster: Philibertain g 1; n=5; core eudicotyled...    35   2.7  
UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ...    34   3.6  
UniRef50_Q207N1 Cluster: Cathepsin S; n=2; Clupeocephala|Rep: Ca...    34   3.6  
UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba...    34   3.6  
UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep...    34   3.6  
UniRef50_Q7QWN9 Cluster: GLP_26_50243_51811; n=2; Giardia lambli...    34   3.6  
UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain...    34   3.6  
UniRef50_A5K8Y0 Cluster: Preprocathepsin c, putative; n=1; Plasm...    34   3.6  
UniRef50_A0BV23 Cluster: Chromosome undetermined scaffold_13, wh...    34   3.6  
UniRef50_A6H8W3 Cluster: GPR124 protein; n=4; Euteleostomi|Rep: ...    34   3.6  
UniRef50_A4YDW2 Cluster: Major facilitator superfamily MFS_1 pre...    34   3.6  
UniRef50_P21381 Cluster: Thaumatopain; n=10; Eukaryota|Rep: Thau...    34   3.6  
UniRef50_Q96PE1 Cluster: Probable G-protein coupled receptor 124...    34   3.6  
UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca...    34   3.6  
UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;...    34   4.8  
UniRef50_UPI00006CCC39 Cluster: hypothetical protein TTHERM_0033...    34   4.8  
UniRef50_Q4AI35 Cluster: Cysteine peptidase, putative precursor;...    34   4.8  
UniRef50_A1ZZ62 Cluster: Aminopeptidase C; n=1; Microscilla mari...    34   4.8  
UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid...    34   4.8  
UniRef50_Q4Y2Z9 Cluster: Putative uncharacterized protein; n=3; ...    34   4.8  
UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|R...    34   4.8  
UniRef50_Q2H7E7 Cluster: Putative uncharacterized protein; n=1; ...    34   4.8  
UniRef50_A3LQQ7 Cluster: Putative uncharacterized protein ALS4; ...    34   4.8  
UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon G...    34   4.8  
UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi...    34   4.8  
UniRef50_Q70SU8 Cluster: Cystein proteinase inhibitor protein pr...    33   6.3  
UniRef50_Q9SIE8 Cluster: Putative cysteine proteinase; n=1; Arab...    33   6.3  
UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arab...    33   6.3  
UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps...    33   6.3  
UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw...    33   6.3  
UniRef50_Q4YWX6 Cluster: Putative uncharacterized protein; n=1; ...    33   6.3  
UniRef50_A2F4T7 Cluster: Clan CA, family C1, cathepsin L-like cy...    33   6.3  
UniRef50_A4RJ84 Cluster: Putative uncharacterized protein; n=2; ...    33   6.3  
UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma...    33   6.3  
UniRef50_Q7MTY9 Cluster: Cysteine peptidase, putative; n=8; Bact...    33   8.3  
UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma j...    33   8.3  
UniRef50_Q4Q6W9 Cluster: Putative uncharacterized protein; n=3; ...    33   8.3  
UniRef50_Q22ST4 Cluster: Von Willebrand factor type A domain con...    33   8.3  
UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|...    33   8.3  
UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ...    33   8.3  

>UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15)
           [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC
           3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain] - Sarcophaga peregrina (Flesh fly)
           (Boettcherisca peregrina)
          Length = 339

 Score =  112 bits (270), Expect = 8e-24
 Identities = 70/182 (38%), Positives = 91/182 (50%), Gaps = 6/182 (3%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
           E+ FRMKI+ E++H IAKHNQ +  G VSYKLG+NKY DMLHHEF +TMNG+N T    +
Sbjct: 44  EERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADMLHHEFKETMNGYNHTL---R 100

Query: 436 NLYMKGGSVRGAKFISPANVKLPE----RWTGGSTAPSPTSRTKG--SVAHAGLQHDWSF 597
            L  +   + GA +I PA+V +P+    R  G  T            + +  G      F
Sbjct: 101 QLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHF 160

Query: 598 GKDSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTEQTYLTRGV 777
            K    VS                         G     F+YIKDNGGIDTE++Y   G+
Sbjct: 161 RKAGVLVS-----LSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGI 215

Query: 778 DD 783
           DD
Sbjct: 216 DD 217



 Score = 90.2 bits (214), Expect = 5e-17
 Identities = 37/54 (68%), Positives = 45/54 (83%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           +VDWR+HGAVT +KDQG CGSCW+ +     EGQHFR++G LVSLSEQNL+DCS
Sbjct: 125 SVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCS 178



 Score = 40.3 bits (90), Expect = 0.055
 Identities = 24/49 (48%), Positives = 27/49 (55%), Gaps = 3/49 (6%)
 Frame = +2

Query: 578 FSTTGALGRTALPSVRLPGVALGAKPHRLLGA---YGNNGCNGGLMDNA 715
           FS+TGAL        R  GV +      L+     YGNNGCNGGLMDNA
Sbjct: 149 FSSTGALEGQHF---RKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNA 194



 Score = 38.3 bits (85), Expect = 0.22
 Identities = 14/23 (60%), Positives = 18/23 (78%)
 Frame = +2

Query: 191 DLVKEEWSAFKLQHRLNYESEAK 259
           DL+KEEW  +KLQHR NY +E +
Sbjct: 22  DLIKEEWHTYKLQHRKNYANEVE 44


>UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor;
           n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase
           precursor - Diabrotica virgifera virgifera (western corn
           rootworm)
          Length = 326

 Score = 85.8 bits (203), Expect = 1e-15
 Identities = 42/105 (40%), Positives = 57/105 (54%)
 Frame = +3

Query: 459 RPRG*VHIAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVS 638
           RPR    +   ++     DWR+ GAVT++KDQG CGSCWS +     EG +F ++G LVS
Sbjct: 97  RPRVIHSLTPVKDLPSKFDWREKGAVTEVKDQGSCGSCWSFSTTGTVEGAYFLKTGKLVS 156

Query: 639 LSEQNLIDCSEHXXXXXXXXXXXXXLQVHQGQRGDRHRADLPYEG 773
           LSEQNL+DC++              L+  +   G     D PYEG
Sbjct: 157 LSEQNLVDCAKEDCYGCSGGYMDKALEYIETAGGIMSENDYPYEG 201



 Score = 42.3 bits (95), Expect = 0.014
 Identities = 20/60 (33%), Positives = 33/60 (55%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
           E+  R  I+      I  HN KY+ GL ++KLG+ K+ D+   EF   M G +++ K ++
Sbjct: 39  EEQKRFTIFQGSLRKIENHNDKYDHGLSTFKLGVTKFADLTEKEF-SDMLGISRSTKSSR 97


>UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3;
           Bilateria|Rep: Cathepsin L-like cysteine proteinase -
           Longidorus elongatus
          Length = 358

 Score = 83.8 bits (198), Expect = 4e-15
 Identities = 40/90 (44%), Positives = 52/90 (57%), Gaps = 1/90 (1%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH-XXX 683
           +VDWRK G VT +KDQG CGSCW+ +     EGQH++Q+G LVSLSEQNL+DC  +    
Sbjct: 142 SVDWRKEGYVTKVKDQGSCGSCWAFSATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGDDE 201

Query: 684 XXXXXXXXXXLQVHQGQRGDRHRADLPYEG 773
                      Q  +  +G    A  PY+G
Sbjct: 202 GCNGGYMDGAFQYVETNKGIDTEASYPYKG 231



 Score = 59.3 bits (137), Expect = 1e-07
 Identities = 45/180 (25%), Positives = 73/180 (40%), Gaps = 1/180 (0%)
 Frame = +1

Query: 244 RKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTA 423
           + + E+  R +++A +  +I +HN +YE G  S+ L +NK+ DM + EF + MNGF   A
Sbjct: 55  KTKDEELLRFQVFASNHKVIEQHNIEYEAGQHSFALSLNKFADMTNAEFRQRMNGFKLPA 114

Query: 424 KHNKNLYMKGGSVRGAKFISPANVKLPERWTGGSTAPSPTSRTKGSVAHA-GLQHDWSFG 600
           K  K    +     G  F  P NV +P+             + +GS           S  
Sbjct: 115 K-RKLAKSQPLKEDGMIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAFSATGSLE 173

Query: 601 KDSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTEQTYLTRGVD 780
                 +                   ++    G     F+Y++ N GIDTE +Y  +G D
Sbjct: 174 GQHYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQYVETNKGIDTEASYPYKGRD 233


>UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC06231 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 372

 Score = 81.0 bits (191), Expect = 3e-14
 Identities = 35/72 (48%), Positives = 52/72 (72%), Gaps = 1/72 (1%)
 Frame = +3

Query: 459 RPRG*VHIAGQR-EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLV 635
           +P+G   I+ +  +    VDWR++GAVT +K+QG+CGSCW+ +     EGQH+R++  LV
Sbjct: 136 KPKGSTFISSEHAKLPDRVDWRRNGAVTPVKNQGQCGSCWAFSSTGAIEGQHYRKTNRLV 195

Query: 636 SLSEQNLIDCSE 671
           +LSEQ LIDCS+
Sbjct: 196 NLSEQQLIDCSK 207



 Score = 45.2 bits (102), Expect = 0.002
 Identities = 41/175 (23%), Positives = 73/175 (41%), Gaps = 6/175 (3%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
           E+  R  I+  +   + +HN+ Y+ G  +YK+G+N + D   +E ++ + G+    +  K
Sbjct: 78  EETKRFLIFGTNFIKMMEHNRAYQEGKATYKMGVNNFTDKTEYE-LRKLRGYRSACRIAK 136

Query: 436 NLYMKGGSVRGAKFISPANVKLPER--W-TGGSTAPSPTSRTKGS---VAHAGLQHDWSF 597
                    +G+ FIS  + KLP+R  W   G+  P       GS    +  G      +
Sbjct: 137 --------PKGSTFISSEHAKLPDRVDWRRNGAVTPVKNQGQCGSCWAFSSTGAIEGQHY 188

Query: 598 GKDSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTEQTY 762
            K +  V+                       + G     F+Y++DN GID+E +Y
Sbjct: 189 RKTNRLVN-----LSEQQLIDCSKSYGNNGCEGGLMDLAFQYVRDNKGIDSEISY 238


>UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15)
           (Major excreted protein) (MEP) [Contains: Cathepsin L
           heavy chain; Cathepsin L light chain]; n=19;
           Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15)
           (Major excreted protein) (MEP) [Contains: Cathepsin L
           heavy chain; Cathepsin L light chain] - Homo sapiens
           (Human)
          Length = 333

 Score = 80.6 bits (190), Expect = 4e-14
 Identities = 34/58 (58%), Positives = 45/58 (77%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           EA  +VDWR+ G VT +K+QG+CGSCW+ +     EGQ FR++G L+SLSEQNL+DCS
Sbjct: 113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCS 170



 Score = 53.6 bits (123), Expect = 5e-06
 Identities = 43/182 (23%), Positives = 74/182 (40%), Gaps = 3/182 (1%)
 Frame = +1

Query: 226 AAPSQLRKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN 405
           A  ++L    E+ +R  ++ ++  +I  HNQ+Y  G  S+ + MN +GDM   EF + MN
Sbjct: 34  AMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMN 93

Query: 406 GFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPERWTGGSTAPSPTSRTKGS---VAHAG 576
           GF         ++ +        + +P +V   E+   G   P       GS    +  G
Sbjct: 94  GFQNRKPRKGKVFQE-----PLFYEAPRSVDWREK---GYVTPVKNQGQCGSCWAFSATG 145

Query: 577 LQHDWSFGKDSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTEQ 756
                 F K    +S                    +    G     F+Y++DNGG+D+E+
Sbjct: 146 ALEGQMFRKTGRLIS-----LSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEE 200

Query: 757 TY 762
           +Y
Sbjct: 201 SY 202


>UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin
           L - Misgurnus mizolepis (Mud loach)
          Length = 337

 Score = 80.2 bits (189), Expect = 6e-14
 Identities = 34/58 (58%), Positives = 42/58 (72%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           E    +DWR+ G VT +KDQG+CGSCW+ +     EGQ FR+ G LVSLSEQNL+DCS
Sbjct: 115 EVPSKLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDCS 172



 Score = 72.1 bits (169), Expect = 1e-11
 Identities = 54/181 (29%), Positives = 77/181 (42%), Gaps = 4/181 (2%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
           E+ +R  I+ ++   I  HN ++ MG+ +Y+LGMN +GDM H EF + MNG+    KH  
Sbjct: 44  EEGWRRMIWEKNLRKIQFHNLEHSMGIHTYRLGMNHFGDMNHEEFRQVMNGY----KHKT 99

Query: 436 NLYMKGGSVRGAKFIS-PANVKLPERWTGGSTAPSPTSRTKGS---VAHAGLQHDWSFGK 603
               KG       F+  P+ +   E+   G   P       GS    +  G      F K
Sbjct: 100 ERKFKGSLFMEPNFLEVPSKLDWREK---GYVTPVKDQGECGSCWAFSTTGAMEGQMFRK 156

Query: 604 DSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTEQTYLTRGVDD 783
               VS                    +    G   Q F+YIKDN G+D+E+ Y   G DD
Sbjct: 157 QGKLVS-----LSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNNGLDSEEAYPYLGTDD 211

Query: 784 Q 786
           Q
Sbjct: 212 Q 212


>UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;
           Brugia malayi|Rep: Cahepsin L-like cysteine protease -
           Brugia malayi (Filarial nematode worm)
          Length = 371

 Score = 80.2 bits (189), Expect = 6e-14
 Identities = 33/55 (60%), Positives = 42/55 (76%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           ++DWR  GAVT +KDQG CGSCW+ + +   EGQHF Q+G LV LS QNL+DCS+
Sbjct: 146 SIDWRTSGAVTKVKDQGYCGSCWTFSAVGALEGQHFLQTGKLVELSMQNLLDCSD 200



 Score = 40.3 bits (90), Expect = 0.055
 Identities = 21/57 (36%), Positives = 31/57 (54%)
 Frame = +1

Query: 268 RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKN 438
           R   Y ++   I KHN++YE    +Y+L +N   DML  EF K ++GF      +KN
Sbjct: 74  RFMTYLKNVKEIEKHNERYERNEETYELAINHLADMLPEEFRK-LHGFQSRKITSKN 129


>UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823
           protein, partial; n=1; Ornithorhynchus anatinus|Rep:
           PREDICTED: similar to MGC81823 protein, partial -
           Ornithorhynchus anatinus
          Length = 361

 Score = 79.4 bits (187), Expect = 1e-13
 Identities = 33/58 (56%), Positives = 43/58 (74%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           E   A+DWR HG VT +KDQG+CGSCW+     + EGQ FR++G L ++SEQNL+DCS
Sbjct: 189 EPPEALDWRDHGYVTPVKDQGRCGSCWAFGSTGVLEGQLFRRTGRLAAVSEQNLMDCS 246


>UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L
           - Suberites domuncula (Sponge)
          Length = 324

 Score = 79.4 bits (187), Expect = 1e-13
 Identities = 34/58 (58%), Positives = 44/58 (75%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           E A +VDWR+ G V+++K+QG+CGSCWS +     EGQH  + G LVSLSEQNL+DCS
Sbjct: 107 EPAASVDWRQKGVVSEVKNQGQCGSCWSFSATGSLEGQHALKMGRLVSLSEQNLMDCS 164


>UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precursor;
           n=3; Metazoa|Rep: Digestive cysteine proteinase 2
           precursor - Homarus americanus (American lobster)
          Length = 323

 Score = 79.0 bits (186), Expect = 1e-13
 Identities = 33/56 (58%), Positives = 42/56 (75%)
 Frame = +3

Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           A  VDWR  GAVT +KDQG+CGSCW+ +     EGQHF ++G L+SL+EQ L+DCS
Sbjct: 108 ATEVDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCS 163



 Score = 51.6 bits (118), Expect = 2e-05
 Identities = 52/179 (29%), Positives = 73/179 (40%), Gaps = 10/179 (5%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
           ED++R  I+ +++  I + N+KYE G V++ L MNK+GDM   EF               
Sbjct: 36  EDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEF--------------- 80

Query: 436 NLYMKGGSVRGAKFISPANVKLPERWTGGSTAPSPTSRTKGSVAHAGLQHD----WSFGK 603
           N  MKG   R +   +P +V  P++ T G  A     RTKG+V     Q      W+F  
Sbjct: 81  NAVMKGNIPRRS---APVSVFYPKKET-GPQATEVDWRTKGAVTPVKDQGQCGSCWAFST 136

Query: 604 DST------SVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTEQTY 762
             +        + +                  Q    G     F YIK N GIDTE  Y
Sbjct: 137 TGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAY 195


>UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase;
           n=21; Bilateria|Rep: Cathepsin L-like cysteine
           proteinase - Globodera pallida
          Length = 379

 Score = 78.6 bits (185), Expect = 2e-13
 Identities = 33/55 (60%), Positives = 42/55 (76%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           +VDWR  G VT++K+QG CGSCW+ +     E QH RQ+G L+SLSEQNLIDCS+
Sbjct: 164 SVDWRDKGWVTEVKNQGMCGSCWAFSSTGALEAQHARQTGQLISLSEQNLIDCSK 218



 Score = 35.9 bits (79), Expect = 1.2
 Identities = 21/49 (42%), Positives = 25/49 (51%), Gaps = 3/49 (6%)
 Frame = +2

Query: 578 FSTTGALGRTALPSVRLPGVALGAKPHRLLGA---YGNNGCNGGLMDNA 715
           FS+TGAL        R  G  +      L+     YGN GCNGG+MDNA
Sbjct: 188 FSSTGAL---EAQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDNA 233


>UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep:
           Cathepsin - Geodia cydonium (Sponge)
          Length = 322

 Score = 78.2 bits (184), Expect = 2e-13
 Identities = 33/53 (62%), Positives = 40/53 (75%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           VDWR  G VT +K+QG+CGSCW+ +     EGQHF  +G LVSLSEQNL+DCS
Sbjct: 107 VDWRTKGYVTGVKNQGQCGSCWAFSATGSLEGQHFNATGKLVSLSEQNLVDCS 159


>UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar to
           human SRY (sex determining region Y)-box 30
           (SOX30),transcript variant 1,; n=3; Mammalia|Rep: Testis
           cDNA clone: QtsA-12228, similar to human SRY (sex
           determining region Y)-box 30 (SOX30),transcript variant
           1, - Macaca fascicularis (Crab eating macaque)
           (Cynomolgus monkey)
          Length = 433

 Score = 77.0 bits (181), Expect = 5e-13
 Identities = 33/54 (61%), Positives = 42/54 (77%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           +VDWRK G VT +K+Q +CGSCW+ +     EGQ FR++G LVSLSEQNL+DCS
Sbjct: 117 SVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS 170



 Score = 49.6 bits (113), Expect = 9e-05
 Identities = 46/183 (25%), Positives = 74/183 (40%), Gaps = 4/183 (2%)
 Frame = +1

Query: 226 AAPSQLRKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN 405
           A   +L    E+ +R  ++ ++  +I  HN +Y  G   + + MN +GDM + EF + M 
Sbjct: 34  ATHRRLYGASEEGWRRAVWEKNMKMIELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQVMG 93

Query: 406 GFNKTAKHNKNLYMKGGSVRGAKFIS-PANVKLPERWTGGSTAPSPTSRTKGS---VAHA 573
            F      N+ L  KG   R   F+  P +V   ++   G   P    +  GS    +  
Sbjct: 94  CF-----RNQKL-RKGKLFREPLFLDLPKSVDWRKK---GYVTPVKNQKQCGSCWAFSAT 144

Query: 574 GLQHDWSFGKDSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTE 753
           G      F K    VS                    Q    G     F+Y+K+NGG+D+E
Sbjct: 145 GALEGQMFRKTGKLVS-----LSEQNLVDCSHPQGNQGCNGGFMNSAFRYVKENGGLDSE 199

Query: 754 QTY 762
           ++Y
Sbjct: 200 ESY 202


>UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 2 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 564

 Score = 77.0 bits (181), Expect = 5e-13
 Identities = 43/107 (40%), Positives = 58/107 (54%), Gaps = 1/107 (0%)
 Frame = +3

Query: 351 GHEQVRRHAPPRVREDYERLQQNCQTQQESVH-EGWERPRG*VHIAGQREAAGAVDWRKH 527
           G+     H   R RE+   L+   Q++  S   E + R R    +  Q      +DWR +
Sbjct: 301 GYNLAVNHLADRTREEISVLRGRLQSKDGSSRAEPFPRHRFTAKLPDQ------IDWRPY 354

Query: 528 GAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           GAVT +KDQ  CGSCWS   +   EG +FR++G LV LSEQ L+DCS
Sbjct: 355 GAVTPVKDQAVCGSCWSFGTVGELEGAYFRKTGRLVRLSEQQLVDCS 401


>UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42;
           Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens
           (Human)
          Length = 334

 Score = 77.0 bits (181), Expect = 5e-13
 Identities = 33/54 (61%), Positives = 42/54 (77%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           +VDWRK G VT +K+Q +CGSCW+ +     EGQ FR++G LVSLSEQNL+DCS
Sbjct: 117 SVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS 170



 Score = 50.8 bits (116), Expect = 4e-05
 Identities = 47/190 (24%), Positives = 79/190 (41%), Gaps = 4/190 (2%)
 Frame = +1

Query: 226 AAPSQLRKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN 405
           A   +L    E+ +R  ++ ++  +I  HN +Y  G   + + MN +GDM + EF + M 
Sbjct: 34  ATHRRLYGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMG 93

Query: 406 GFNKTAKHNKNLYMKGGSVRGAKFIS-PANVKLPERWTGGSTAPSPTSRTKGS---VAHA 573
            F    ++ K  + KG   R   F+  P +V   ++   G   P    +  GS    +  
Sbjct: 94  CF----RNQK--FRKGKVFREPLFLDLPKSVDWRKK---GYVTPVKNQKQCGSCWAFSAT 144

Query: 574 GLQHDWSFGKDSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTE 753
           G      F K    VS                    Q    G   + F+Y+K+NGG+D+E
Sbjct: 145 GALEGQMFRKTGKLVS-----LSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSE 199

Query: 754 QTYLTRGVDD 783
           ++Y    VD+
Sbjct: 200 ESYPYVAVDE 209


>UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2;
           Acanthamoeba|Rep: Cysteine proteinase - Acanthamoeba
           healyi
          Length = 330

 Score = 76.2 bits (179), Expect = 9e-13
 Identities = 33/52 (63%), Positives = 41/52 (78%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           DWR+ GAVT +K+QG+CGSCWS +     EG +F ++G LVSLSEQNLIDCS
Sbjct: 119 DWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLVSLSEQNLIDCS 170



 Score = 35.1 bits (77), Expect = 2.1
 Identities = 25/50 (50%), Positives = 31/50 (62%), Gaps = 4/50 (8%)
 Frame = +2

Query: 578 FSTTGAL-GRTALPSVRLPGVALGAKPHRLLG---AYGNNGCNGGLMDNA 715
           FSTTG+  G   L + RL  V+L  +   L+    +YGNNGCNGGLMD A
Sbjct: 141 FSTTGSTEGANFLKTGRL--VSLSEQ--NLIDCSVSYGNNGCNGGLMDYA 186


>UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine
           protease; n=11; Callosobruchus maculatus|Rep: Putative
           gut cathepsin L-like cysteine protease - Callosobruchus
           maculatus (Southern cowpea weevil) (Pulse bruchid)
          Length = 326

 Score = 76.2 bits (179), Expect = 9e-13
 Identities = 32/58 (55%), Positives = 42/58 (72%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           E   AVDWR+ GAVT +KDQ  CGSCW+ + +   EGQ F+++G LVSLS Q L+DC+
Sbjct: 111 EEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCA 168



 Score = 36.7 bits (81), Expect = 0.67
 Identities = 15/46 (32%), Positives = 27/46 (58%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFV 393
           E+  R  ++ ++   I +HN+KYE G  S+   + ++ DM H EF+
Sbjct: 39  EEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEFL 84


>UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3;
           Curculionidae|Rep: Cysteine proteinase - Hypera postica
           (alfalfa weevil)
          Length = 324

 Score = 75.8 bits (178), Expect = 1e-12
 Identities = 34/57 (59%), Positives = 40/57 (70%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           E   +VDWRK G VT +KDQG CGSCW+ +     EG + R+SG LVSLSEQ LIDC
Sbjct: 111 EIPSSVDWRKEGRVTGVKDQGDCGSCWAFSITGSTEGAYARKSGKLVSLSEQQLIDC 167



 Score = 47.6 bits (108), Expect = 4e-04
 Identities = 23/51 (45%), Positives = 31/51 (60%)
 Frame = +1

Query: 250 RGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTM 402
           + E++ R  I+ ++   I  HN  YE G VSYK G+NK+ DM   EF KTM
Sbjct: 40  QAEESKRFNIFTDNVRAIEAHNALYEQGKVSYKKGINKFTDMSQEEF-KTM 89


>UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2;
           Platyhelminthes|Rep: Cathepsin L-like proteinase -
           Echinococcus multilocularis
          Length = 338

 Score = 75.8 bits (178), Expect = 1e-12
 Identities = 36/87 (41%), Positives = 48/87 (55%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEHXXXX 686
           ++DWRK G VT IKDQG CGSCW+ +     EGQ  R++G L+SLSEQ L+DCS +    
Sbjct: 125 SIDWRKKGLVTPIKDQGDCGSCWAFSATGALEGQLKRKTGKLISLSEQQLVDCSTYTGNE 184

Query: 687 XXXXXXXXXLQVHQGQRGDRHRADLPY 767
                       +  + G    +D PY
Sbjct: 185 GCNGGDMNDAFRYWMRNGAESESDYPY 211



 Score = 39.1 bits (87), Expect = 0.13
 Identities = 14/47 (29%), Positives = 28/47 (59%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK 396
           E++ RM+I+  +   +  HN++Y +GL +Y   +N + D+   EF +
Sbjct: 46  EEHLRMRIFINNYLFVRWHNERYYLGLETYSTALNAFADLTLEEFAE 92


>UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3;
           Dictyostelium discoideum|Rep: Cysteine proteinase 1
           precursor - Dictyostelium discoideum (Slime mold)
          Length = 343

 Score = 75.8 bits (178), Expect = 1e-12
 Identities = 33/53 (62%), Positives = 38/53 (71%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           A DWR  GAVT +K+QG+CGSCWS +     EGQHF     LVSLSEQNL+DC
Sbjct: 121 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDC 173


>UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine
           proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like
           midgut cysteine proteinase - Tenebrio molitor (Yellow
           mealworm)
          Length = 330

 Score = 74.5 bits (175), Expect = 3e-12
 Identities = 34/64 (53%), Positives = 46/64 (71%)
 Frame = +3

Query: 477 HIAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNL 656
           +++ ++  A +VDWR + AV+++KDQG+CGSCWS +     EGQ   Q G L SLSEQNL
Sbjct: 110 YVSSKKPLAASVDWRSN-AVSEVKDQGQCGSCWSFSTTGAVEGQLALQRGRLTSLSEQNL 168

Query: 657 IDCS 668
           IDCS
Sbjct: 169 IDCS 172



 Score = 49.2 bits (112), Expect = 1e-04
 Identities = 26/65 (40%), Positives = 39/65 (60%), Gaps = 1/65 (1%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN-GFNKTAKHN 432
           E+  R  I+ ++   IA+HN K+E G V+Y   MN++GDM   EF+  +N G  +  KH 
Sbjct: 44  EEIRRQLIFKDNVAKIAEHNAKFEKGEVTYSKAMNQFGDMSKEEFLAYVNRGKAQKPKHP 103

Query: 433 KNLYM 447
           +NL M
Sbjct: 104 ENLRM 108



 Score = 33.9 bits (74), Expect = 4.8
 Identities = 21/47 (44%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
 Frame = +2

Query: 578 FSTTGAL-GRTALPSVRLPGVALGAKPHRLLGAYGNNGCNGGLMDNA 715
           FSTTGA+ G+ AL   RL  ++          +YGN GC+GG MD+A
Sbjct: 143 FSTTGAVEGQLALQRGRLTSLS-EQNLIDCSSSYGNAGCDGGWMDSA 188


>UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin L
           preproprotein; n=1; Monodelphis domestica|Rep:
           PREDICTED: similar to cathepsin L preproprotein -
           Monodelphis domestica
          Length = 356

 Score = 73.7 bits (173), Expect = 5e-12
 Identities = 31/54 (57%), Positives = 42/54 (77%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           +VDWR HG VT I++QG+CG+CW+ + +   EGQ FR++G LV LS+Q LIDCS
Sbjct: 118 SVDWRTHGYVTPIRNQGECGACWAFSTIGSLEGQLFRKTGRLVELSKQMLIDCS 171



 Score = 45.6 bits (103), Expect = 0.001
 Identities = 18/50 (36%), Positives = 33/50 (66%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN 405
           E++FR +++ ++  +I  HN+ ++ G  SY +GMN++GDM   EF   +N
Sbjct: 44  EESFRRQVWEKNLKLINDHNRLFKEGKKSYFMGMNQFGDMTDKEFESRLN 93


>UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep:
           Cathepsin L - Stylonychia lemnae
          Length = 340

 Score = 73.7 bits (173), Expect = 5e-12
 Identities = 33/89 (37%), Positives = 49/89 (55%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEHXXXX 686
           ++DWR+ GAV  +KDQG+CGSCW+ + +   E ++F ++G L SLSEQ L+DCS++    
Sbjct: 128 SIDWREKGAVNAVKDQGQCGSCWAFSTIASLESRYFIETGKLQSLSEQQLVDCSKNGNEG 187

Query: 687 XXXXXXXXXLQVHQGQRGDRHRADLPYEG 773
                    +       G     D PY G
Sbjct: 188 CNGGDMGLAMDYIASAGGVETEKDYPYVG 216


>UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15;
           Magnoliophyta|Rep: Cysteine proteinase RD19a precursor -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 368

 Score = 73.7 bits (173), Expect = 5e-12
 Identities = 31/51 (60%), Positives = 37/51 (72%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           DWR HGAVT +K+QG CGSCWS +     EG +F  +G LVSLSEQ L+DC
Sbjct: 140 DWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDC 190


>UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin
           L-like protease; n=1; Nasonia vitripennis|Rep:
           PREDICTED: similar to cathepsin L-like protease -
           Nasonia vitripennis
          Length = 353

 Score = 73.3 bits (172), Expect = 6e-12
 Identities = 38/96 (39%), Positives = 53/96 (55%), Gaps = 2/96 (2%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQG-KCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS-EHXXX 683
           VDWR+ GAVT ++DQG  CGSCW+ +     E Q+F+++G L +LS QNLIDC+ E+   
Sbjct: 136 VDWRQRGAVTPVRDQGLTCGSCWAFSAAGALEAQYFKKTGVLTALSAQNLIDCTMEYGNL 195

Query: 684 XXXXXXXXXXLQVHQGQRGDRHRADLPYEGS*RPIP 791
                      Q    Q+G    A+  YEG  +  P
Sbjct: 196 GCGGGSAALSFQFVVDQKGLEPEANYSYEGRTKECP 231



 Score = 63.7 bits (148), Expect = 5e-09
 Identities = 31/85 (36%), Positives = 53/85 (62%), Gaps = 1/85 (1%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
           E+NFR  ++ E++  IA+HNQK+++GL +YK+ +N++GDM+  E+   M+  N T    K
Sbjct: 56  EENFRRSVFHENQRKIAEHNQKHDLGLFTYKVRINQFGDMMFEEYKNYMHAANNTITQLK 115

Query: 436 NLYMKGGSVRGAKFISPANVK-LPE 507
            +       RG +FI P + + +PE
Sbjct: 116 RI------PRGDEFIKPKSAENVPE 134


>UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep:
           CG4847-PD, isoform D - Drosophila melanogaster (Fruit
           fly)
          Length = 420

 Score = 73.3 bits (172), Expect = 6e-12
 Identities = 31/53 (58%), Positives = 38/53 (71%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           A DWR+HG VT +K QG CGSCW+ A     EG  FR++G L +LSEQNL+DC
Sbjct: 206 AFDWREHGGVTPVKFQGTCGSCWAFATTGAIEGHTFRKTGSLPNLSEQNLVDC 258


>UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35;
           Viridiplantae|Rep: Cysteine proteinase 15A precursor -
           Pisum sativum (Garden pea)
          Length = 363

 Score = 73.3 bits (172), Expect = 6e-12
 Identities = 30/51 (58%), Positives = 37/51 (72%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           DWR+ GAVT +KDQG CGSCW+ +     EG H+  +G LVSLSEQ L+DC
Sbjct: 137 DWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDC 187


>UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean
           endopeptidase) (Cysteine proteinase)
           (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
           Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor
           (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase)
           (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
           Vignain-2] - Vigna mungo (Rice bean) (Black gram)
          Length = 362

 Score = 73.3 bits (172), Expect = 6e-12
 Identities = 31/55 (56%), Positives = 42/55 (76%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           +VDWRK GAVTD+KDQG+CGSCW+ + +   EG +  ++  LVSLSEQ L+DC +
Sbjct: 131 SVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDK 185


>UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4;
           core eudicotyledons|Rep: Papain-like cysteine peptidase
           XBCP3 - Arabidopsis thaliana (Mouse-ear cress)
          Length = 437

 Score = 72.9 bits (171), Expect = 8e-12
 Identities = 32/64 (50%), Positives = 43/64 (67%)
 Frame = +3

Query: 480 IAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLI 659
           + G  +   +VDWRK GAVT++KDQG CG+CWS +     EG +   +G L+SLSEQ LI
Sbjct: 112 LGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELI 171

Query: 660 DCSE 671
           DC +
Sbjct: 172 DCDK 175


>UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep:
           Cysteine proteinase - Paragonimus westermani
          Length = 272

 Score = 72.9 bits (171), Expect = 8e-12
 Identities = 39/101 (38%), Positives = 53/101 (52%), Gaps = 1/101 (0%)
 Frame = +3

Query: 474 VHIAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQN 653
           V   G + A   +DWR  GAVT +++QG CGSCW+ +     EGQ F ++G LVSLS+Q 
Sbjct: 46  VRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLVSLSKQQ 105

Query: 654 LIDCSEHXXXXXXXXXXXXXLQV-HQGQRGDRHRADLPYEG 773
           L+DC                L++ H G  G   + D PY G
Sbjct: 106 LVDCDRAADGCNGGWPASSYLEIMHMG--GLESQDDYPYAG 144


>UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep:
           Cysteine protease - Clonorchis sinensis
          Length = 328

 Score = 72.9 bits (171), Expect = 8e-12
 Identities = 30/51 (58%), Positives = 40/51 (78%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           DWR+HGAV  + DQGKCGSCW+ + +   EGQ FR++G L++LSEQ L+DC
Sbjct: 120 DWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKTGDLLALSEQQLVDC 170


>UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2;
           Dictyostelium discoideum|Rep: Cysteine proteinase 2
           precursor - Dictyostelium discoideum (Slime mold)
          Length = 376

 Score = 72.9 bits (171), Expect = 8e-12
 Identities = 32/54 (59%), Positives = 39/54 (72%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           ++DWR   AVT IKDQG+CGSCWS +     EG H  ++  LVSLSEQNL+DCS
Sbjct: 126 SIDWRTKNAVTPIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCS 179


>UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes
           vastus|Rep: Cathepsin L - Aphrocallistes vastus
          Length = 329

 Score = 71.7 bits (168), Expect = 2e-11
 Identities = 31/53 (58%), Positives = 38/53 (71%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           VDWR  G VT +K+QG+CGSCWS +     EGQ+  +SG LVS SEQ L+DCS
Sbjct: 119 VDWRSKGVVTPVKNQGQCGSCWSFSATGSLEGQYAIKSGKLVSFSEQELVDCS 171


>UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase
           precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin
           L-like cysteine proteinase precursor - Acanthoscelides
           obtectus (Bean weevil)
          Length = 321

 Score = 71.3 bits (167), Expect = 3e-11
 Identities = 30/53 (56%), Positives = 40/53 (75%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           VDWR+ GAVT++K QG CGSCW+ + +   EGQ F ++G L SLS QNL+DC+
Sbjct: 114 VDWREKGAVTEVKKQGNCGSCWAFSAVGSIEGQVFLKNGSLESLSAQNLVDCA 166



 Score = 40.7 bits (91), Expect = 0.041
 Identities = 15/49 (30%), Positives = 31/49 (63%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTM 402
           E+  R +I+  +   I +HN++Y  G  ++++G+N++GDM   EF + +
Sbjct: 39  EEKRRFEIFKFNLRTIEEHNERYHNGEETFEMGINQFGDMTQEEFKRML 87


>UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10;
           Dictyostelium discoideum|Rep: Cysteine proteinase 7
           precursor - Dictyostelium discoideum (Slime mold)
          Length = 460

 Score = 71.3 bits (167), Expect = 3e-11
 Identities = 34/60 (56%), Positives = 41/60 (68%), Gaps = 2/60 (3%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSG--YLVSLSEQNLIDCS 668
           +A+  VDWR  GAVT IK+QG+CG CWS +     EG  +  +G   LVSLSEQNLIDCS
Sbjct: 109 DASAQVDWRTQGAVTPIKNQGQCGGCWSFSTTGATEGAQYLANGKKNLVSLSEQNLIDCS 168


>UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate
           cathepsin L; n=4; Danio rerio|Rep: Novel protein similar
           to vertebrate cathepsin L - Danio rerio (Zebrafish)
           (Brachydanio rerio)
          Length = 334

 Score = 70.9 bits (166), Expect = 3e-11
 Identities = 30/53 (56%), Positives = 39/53 (73%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           +D+R  G VT++KDQG CGSCWS +     EGQ ++ +G LVSLSEQ L+DCS
Sbjct: 122 IDYRAKGYVTEVKDQGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQLVDCS 174



 Score = 39.9 bits (89), Expect = 0.072
 Identities = 23/84 (27%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
 Frame = +1

Query: 247 KRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAK 426
           +  ED  R  I+  +   I K+N  +  GL  +K+ MNKYGD+   E+ + +    K   
Sbjct: 39  EESEDVHRKTIWETNMQKIWKNNNDFSFGLSMFKMAMNKYGDLTSVEYKRLLGSKIKGTG 98

Query: 427 HNKNLYMKGGSVR-GAKFISPANV 495
           + K        +R  AK +   N+
Sbjct: 99  NRKGKITSAQMLRLNAKRLGVTNI 122


>UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep:
           Toxopain-2 - Toxoplasma gondii
          Length = 422

 Score = 70.9 bits (166), Expect = 3e-11
 Identities = 31/58 (53%), Positives = 37/58 (63%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           E    VDWR  G VT +KDQ  CGSCW+ +     EG H  ++G LVSLSEQ L+DCS
Sbjct: 204 ELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCS 261


>UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine
           protease; n=1; Strongylocentrotus purpuratus|Rep:
           PREDICTED: similar to cysteine protease -
           Strongylocentrotus purpuratus
          Length = 494

 Score = 70.5 bits (165), Expect = 4e-11
 Identities = 29/53 (54%), Positives = 38/53 (71%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           DWR HGAVT +K+QG CGSCW+ + +   EGQ   + G L+SLSEQ L+DC +
Sbjct: 245 DWRTHGAVTPVKNQGMCGSCWAFSAIGNMEGQWQIKKGELISLSEQELVDCDK 297


>UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep:
           Cathepsin L precursor - Schistosoma mansoni (Blood
           fluke)
          Length = 319

 Score = 70.5 bits (165), Expect = 4e-11
 Identities = 29/51 (56%), Positives = 39/51 (76%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           DWR+ GAVT++K+QG CGSCW+ +     E Q FR++G L+SLSEQ L+DC
Sbjct: 110 DWREKGAVTEVKNQGMCGSCWAFSTTGNVESQWFRKTGKLLSLSEQQLVDC 160


>UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16;
           Bromeliaceae|Rep: Fruit bromelain precursor - Ananas
           comosus (Pineapple)
          Length = 351

 Score = 70.5 bits (165), Expect = 4e-11
 Identities = 28/54 (51%), Positives = 41/54 (75%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           ++DWR +GAV ++K+Q  CGSCWS A +   EG +  ++GYLVSLSEQ ++DC+
Sbjct: 126 SIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCA 179


>UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3;
           Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays
           (Maize)
          Length = 493

 Score = 70.1 bits (164), Expect = 6e-11
 Identities = 31/64 (48%), Positives = 45/64 (70%)
 Frame = +3

Query: 480 IAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLI 659
           +AG+ +   AVDWR+ GAV ++KDQG+CG CW+ + +   EG +   +G L+SLSEQ LI
Sbjct: 159 LAGE-QLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVTGSLISLSEQELI 217

Query: 660 DCSE 671
           DC +
Sbjct: 218 DCDK 221


>UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1;
           Naegleria fowleri|Rep: Cysteine proteinase homolog -
           Naegleria fowleri
          Length = 347

 Score = 70.1 bits (164), Expect = 6e-11
 Identities = 31/59 (52%), Positives = 40/59 (67%)
 Frame = +3

Query: 498 AAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
           A  + DWR+HGAVT +K+QG CGSCW+ +     EGQ   + G LVSLSEQ L+DC  +
Sbjct: 122 APTSFDWRQHGAVTRVKNQGACGSCWTFSTTGNVEGQWAIKKGKLVSLSEQQLVDCDHN 180



 Score = 33.9 bits (74), Expect = 4.8
 Identities = 13/23 (56%), Positives = 17/23 (73%)
 Frame = +1

Query: 715 FKYIKDNGGIDTEQTYLTRGVDD 783
           F+Y+  NGG+DTE +Y   GVDD
Sbjct: 203 FQYVIKNGGLDTEDSYPYEGVDD 225


>UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor;
           n=35; Fasciola|Rep: Cathepsin L-like proteinase
           precursor - Fasciola hepatica (Liver fluke)
          Length = 326

 Score = 70.1 bits (164), Expect = 6e-11
 Identities = 28/62 (45%), Positives = 38/62 (61%)
 Frame = +3

Query: 483 AGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLID 662
           A  R     +DWR+ G VT++KDQG CGSCW+ +     EGQ+ +     +S SEQ L+D
Sbjct: 103 ANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVD 162

Query: 663 CS 668
           CS
Sbjct: 163 CS 164



 Score = 43.2 bits (97), Expect = 0.008
 Identities = 17/45 (37%), Positives = 30/45 (66%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF 390
           +D  R  I+ ++   I +HN ++++GLV+Y LG+N++ DM   EF
Sbjct: 36  DDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEF 80



 Score = 34.7 bits (76), Expect = 2.7
 Identities = 19/49 (38%), Positives = 29/49 (59%), Gaps = 3/49 (6%)
 Frame = +2

Query: 578 FSTTGALGRTALPSVRLPGVALGAKPHRLL---GAYGNNGCNGGLMDNA 715
           FSTTG +    + + R    ++     +L+   G +GNNGC+GGLM+NA
Sbjct: 135 FSTTGTMEGQYMKNER---TSISFSEQQLVDCSGPWGNNGCSGGLMENA 180


>UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza
           sativa|Rep: Os09g0497500 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 349

 Score = 69.7 bits (163), Expect = 8e-11
 Identities = 29/55 (52%), Positives = 41/55 (74%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           +VDWRK GAV ++K+QG CGSCW+ + +   EG +  ++G LVSLSEQ L+DC +
Sbjct: 125 SVDWRKKGAVVEVKNQGDCGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDD 179


>UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1;
           Rhipicephalus appendiculatus|Rep: Midgut cysteine
           proteinase 4 - Rhipicephalus appendiculatus (Brown ear
           tick)
          Length = 345

 Score = 69.7 bits (163), Expect = 8e-11
 Identities = 27/53 (50%), Positives = 42/53 (79%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           ++WR++G VT +K+QG+CGSCW+ +     EGQ F+++  L+SLSEQNL+DC+
Sbjct: 130 IEWRENGFVTPVKNQGQCGSCWAFSSTGALEGQVFKRTRRLISLSEQNLMDCA 182


>UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326;
           n=2; Danio rerio|Rep: hypothetical protein LOC550326 -
           Danio rerio
          Length = 531

 Score = 69.3 bits (162), Expect = 1e-10
 Identities = 30/54 (55%), Positives = 38/54 (70%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           +VDWR +GAVT +KDQ  CGSCWS A     EG  F ++G L SLS+Q L+DC+
Sbjct: 315 SVDWRLYGAVTPVKDQAVCGSCWSFATTGTLEGALFLKTGQLTSLSQQMLVDCT 368


>UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis
           thaliana|Rep: F9P14.12 protein - Arabidopsis thaliana
           (Mouse-ear cress)
          Length = 343

 Score = 69.3 bits (162), Expect = 1e-10
 Identities = 31/53 (58%), Positives = 39/53 (73%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           AVDWR  GAVT I++QGKCG CW+ + +   EG +  ++G LVSLSEQ LIDC
Sbjct: 130 AVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDC 182


>UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24;
           Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa
           (Rice)
          Length = 339

 Score = 69.3 bits (162), Expect = 1e-10
 Identities = 30/55 (54%), Positives = 37/55 (67%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
           VDWR  GAVT IKDQG+CG CW+ + +   EG     +G L+SLSEQ L+DC  H
Sbjct: 127 VDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVH 181


>UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh
           fly) (Boettcherisca peregrina). Cathepsin L; n=2;
           Dictyostelium discoideum|Rep: Similar to Sarcophaga
           peregrina (Flesh fly) (Boettcherisca peregrina).
           Cathepsin L - Dictyostelium discoideum (Slime mold)
          Length = 265

 Score = 69.3 bits (162), Expect = 1e-10
 Identities = 28/52 (53%), Positives = 37/52 (71%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           DWR HGAV  +K+QG C SCWS + L   EG ++ + G L+ LSEQNL+DC+
Sbjct: 52  DWRDHGAVGKVKNQGSCASCWSFSALGALEGHYYIKYGELLDLSEQNLVDCA 103


>UniRef50_Q22LI1 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 987

 Score = 68.9 bits (161), Expect = 1e-10
 Identities = 28/52 (53%), Positives = 36/52 (69%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +DWR  GAVT +K QGKCGSCWS +   L E   + ++G L+ LSEQ L+DC
Sbjct: 127 IDWRNKGAVTSVKRQGKCGSCWSFSAAGLMEAFQYFKTGNLIDLSEQQLVDC 178


>UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes
           abbreviatus|Rep: Cathepsin L 2 - Diaprepes abbreviatus
           (Sugarcane rootstalk borer weevil)
          Length = 348

 Score = 68.9 bits (161), Expect = 1e-10
 Identities = 28/53 (52%), Positives = 38/53 (71%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           +DWR+ GAVT +K+Q  CGSCWS +     E Q F+++  L+SLSEQ L+DCS
Sbjct: 139 IDWRQKGAVTPVKNQRNCGSCWSFSATGALEAQWFKKTNKLISLSEQQLVDCS 191



 Score = 55.6 bits (128), Expect = 1e-06
 Identities = 51/182 (28%), Positives = 73/182 (40%), Gaps = 13/182 (7%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
           E+ +R  ++ E+   I +HN+ YEMGL SY++ MN  GD+   EF++           ++
Sbjct: 44  ENEYRQSVFMENLFQINEHNKLYEMGLSSYQMAMNHLGDLTKDEFMRIYTVNMPQLPQSE 103

Query: 436 NLYMKGGSVRGAKFISP-ANVKLPERWTGGSTAPSPTSRTKGSVAHAGLQHD----WSFG 600
           NL      +   + +       LP              R KG+V     Q +    WSF 
Sbjct: 104 NLSDSEPWLDLPQDLQGFVTYALPTNLDEVDLPTDIDWRQKGAVTPVKNQRNCGSCWSF- 162

Query: 601 KDSTSVSPATWCXXXXXXXXXXXXXXEQRLQR----GAHG----QRFKYIKDNGGIDTEQ 756
             +T    A W                    R    G HG      F YIK+NGGIDTEQ
Sbjct: 163 -SATGALEAQWFKKTNKLISLSEQQLVDCSGRYGNHGCHGGWMHWAFGYIKENGGIDTEQ 221

Query: 757 TY 762
           +Y
Sbjct: 222 SY 223


>UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61;
           Leishmania|Rep: Cysteine proteinase 2 precursor -
           Leishmania pifanoi
          Length = 444

 Score = 68.9 bits (161), Expect = 1e-10
 Identities = 30/55 (54%), Positives = 38/55 (69%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           AVDWR+ GAVT +KDQG CGSCW+ + +   EGQ +     LVSLSEQ L+ C +
Sbjct: 129 AVDWREKGAVTPVKDQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDD 183


>UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19;
           Bilateria|Rep: Cathepsin F precursor - Homo sapiens
           (Human)
          Length = 484

 Score = 68.9 bits (161), Expect = 1e-10
 Identities = 30/53 (56%), Positives = 36/53 (67%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           DWR  GAVT +KDQG CGSCW+ +     EGQ F   G L+SLSEQ L+DC +
Sbjct: 276 DWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK 328


>UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6;
           Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia
           deliciosa (Kiwi)
          Length = 509

 Score = 68.5 bits (160), Expect = 2e-10
 Identities = 41/118 (34%), Positives = 58/118 (49%), Gaps = 4/118 (3%)
 Frame = +3

Query: 324 RNGPRFLQAGHEQVRRHAPPRVREDYERLQQNCQTQQESVHEGWERPRG*VHIAGQREAA 503
           +NG R    GH            E++  +  +   +  S     ER R     A +  AA
Sbjct: 85  KNGERGASGGHLVGLNKFADMSNEEFREVYVSKVKKPTSKRMAIERRRQGKAAAAKAVAA 144

Query: 504 ----GAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
                ++DWRK+G VT +KDQG CGSCW+ +     EG +   +G L+SLSEQ L+DC
Sbjct: 145 CDGPTSLDWRKYGIVTGVKDQGDCGSCWAFSSTGAIEGINALANGDLISLSEQELVDC 202


>UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae
           str. PEST
          Length = 559

 Score = 68.5 bits (160), Expect = 2e-10
 Identities = 30/64 (46%), Positives = 42/64 (65%)
 Frame = +3

Query: 480 IAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLI 659
           +AG  +   + DWR HGAVT++K+QG CGSCW+ + +   EG H  ++  L S SEQ LI
Sbjct: 333 VAGVGDLPRSFDWRDHGAVTEVKNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELI 392

Query: 660 DCSE 671
           DC +
Sbjct: 393 DCDK 396


>UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin
           heavy chain; n=3; Amniota|Rep: PREDICTED: similar to
           ferritin heavy chain - Ornithorhynchus anatinus
          Length = 338

 Score = 68.1 bits (159), Expect = 2e-10
 Identities = 30/58 (51%), Positives = 38/58 (65%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           E    VDWR  G VT +K+QG CGSCW+ +     E   F+ +G +VSLSEQNL+DCS
Sbjct: 119 EGPEEVDWRTKGYVTPVKNQGLCGSCWAFSATGALEALVFKTTGKMVSLSEQNLVDCS 176



 Score = 51.6 bits (118), Expect = 2e-05
 Identities = 47/183 (25%), Positives = 71/183 (38%), Gaps = 7/183 (3%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
           E+ FR   + ++  +I +HN++   G  SY+L MN +GD  + E  + +NGF        
Sbjct: 44  EEVFRRAAWEKNVRVIERHNEEMSQGKHSYRLAMNHFGDQTNEELHERLNGFRPDL---- 99

Query: 436 NLYMKGGSVRGAKFISPANVKLPERWTGGSTAPSPTSRTKGSVAHAGLQHD-WSF---GK 603
                GG++R  +    A  +    W G       T      V + GL    W+F   G 
Sbjct: 100 -----GGALRSGR--EQARFRSKTSWEGPEEVDWRTKGYVTPVKNQGLCGSCWAFSATGA 152

Query: 604 DSTSVSPATWCXXXXXXXXXXXXXXEQ---RLQRGAHGQRFKYIKDNGGIDTEQTYLTRG 774
               V   T                 Q     + G +   F+Y++ NGGID E  Y   G
Sbjct: 153 LEALVFKTTGKMVSLSEQNLVDCSWRQGNVGCRGGQYIGAFEYVRANGGIDAEDLYPYLG 212

Query: 775 VDD 783
            DD
Sbjct: 213 RDD 215


>UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 21 SCAF14577, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 406

 Score = 68.1 bits (159), Expect = 2e-10
 Identities = 29/58 (50%), Positives = 41/58 (70%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           E   +VDWRK G V+ +++QG C SCW+ + L   EGQ  +++G+LV LS QNL+DCS
Sbjct: 154 ETPPSVDWRKAGLVSPVQNQGFCNSCWAFSSLGALEGQMKKRTGFLVPLSPQNLLDCS 211


>UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza
           sativa|Rep: OSJNBb0085F13.15 protein - Oryza sativa
           subsp. japonica (Rice)
          Length = 383

 Score = 68.1 bits (159), Expect = 2e-10
 Identities = 32/65 (49%), Positives = 42/65 (64%), Gaps = 2/65 (3%)
 Frame = +3

Query: 483 AGQREAA--GAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNL 656
           AG+R  A   +VDWRK GAVT  K QG+C +CW+ A +   E  H  + G L+SLSEQ L
Sbjct: 153 AGRRTVAVPESVDWRKEGAVTPAKHQGQCAACWAFAAVAAIESLHKIKGGDLISLSEQEL 212

Query: 657 IDCSE 671
           +DC +
Sbjct: 213 VDCDD 217


>UniRef50_Q23H15 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 370

 Score = 68.1 bits (159), Expect = 2e-10
 Identities = 29/55 (52%), Positives = 37/55 (67%)
 Frame = +3

Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           A ++DWR  GAVT +K+QG CGSCWS +   L E  +F Q+  LV  SEQ L+DC
Sbjct: 163 AASIDWRTKGAVTSVKNQGNCGSCWSFSAAGLMESFNFIQNKALVDFSEQQLLDC 217


>UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:
           Cathepsin - Petromyzon marinus (Sea lamprey)
          Length = 333

 Score = 67.7 bits (158), Expect = 3e-10
 Identities = 29/54 (53%), Positives = 37/54 (68%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           VDWR  G VT +K+QG CGS W+ +     EGQHF  +G L SLSEQ L+DC++
Sbjct: 121 VDWRLKGYVTPVKEQGLCGSSWAFSATGSLEGQHFAATGNLTSLSEQQLVDCTK 174



 Score = 38.3 bits (85), Expect = 0.22
 Identities = 17/51 (33%), Positives = 30/51 (58%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNG 408
           ED  R  ++ ++   + +HN   + G VS+ LG+NKY D+  HE+ + + G
Sbjct: 43  EDAHRRDVFEQNLKRVLQHNLLADEGNVSFHLGINKYSDLELHEYHEKVVG 93


>UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3;
           Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber
           officinale (Ginger)
          Length = 475

 Score = 67.7 bits (158), Expect = 3e-10
 Identities = 28/54 (51%), Positives = 40/54 (74%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           ++DWR+ GAV  +K+QG+CGSCW+ A +   EG +   +G L+SLSEQ L+DCS
Sbjct: 146 SIDWREKGAVVAVKNQGRCGSCWAFAAIAAVEGINQIVTGDLISLSEQQLVDCS 199



 Score = 38.7 bits (86), Expect = 0.17
 Identities = 12/43 (27%), Positives = 29/43 (67%)
 Frame = +1

Query: 262 NFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF 390
           ++R++++ E+   + +HN   + G  +Y+LGMN++ D+ + E+
Sbjct: 70  DYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEEY 112


>UniRef50_Q22W19 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score = 67.7 bits (158), Expect = 3e-10
 Identities = 34/93 (36%), Positives = 50/93 (53%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
           + A ++DWR+  AVT +K+QG+CGSCW+ + +   EG +   +G L S SEQ ++DCS+ 
Sbjct: 122 DVAPSIDWRQKNAVTPVKNQGQCGSCWAFSTVGGLEGAYAIATGNLTSFSEQQIVDCSKA 181

Query: 675 XXXXXXXXXXXXXLQVHQGQRGDRHRADLPYEG 773
                          V   Q G    AD PY+G
Sbjct: 182 NAGCNGGDLPPAYKYV--VQNGIETEADYPYKG 212


>UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole
           genome shotgun sequence; n=7; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_22,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 350

 Score = 67.7 bits (158), Expect = 3e-10
 Identities = 29/56 (51%), Positives = 38/56 (67%)
 Frame = +3

Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           A  +DWR  GAV  +KDQG+CGSCW+ +   + EG +  Q+G L  LSEQ L+DCS
Sbjct: 143 ATPIDWRTRGAVNKVKDQGQCGSCWAFSTTGVLEGFYKVQTGELPDLSEQQLVDCS 198


>UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor;
           n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 355

 Score = 67.7 bits (158), Expect = 3e-10
 Identities = 30/53 (56%), Positives = 38/53 (71%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +VDWRK GAV  +KDQG+CGSCW+ + +   EG +   +G L SLSEQ LIDC
Sbjct: 140 SVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDC 192


>UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease;
           n=23; Magnoliophyta|Rep: Senescence-specific cysteine
           protease - Arabidopsis thaliana (Mouse-ear cress)
          Length = 346

 Score = 67.3 bits (157), Expect = 4e-10
 Identities = 29/53 (54%), Positives = 37/53 (69%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +VDWRK GAVT IK+QG CG CW+ + +   EG    + G L+SLSEQ L+DC
Sbjct: 133 SVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDC 185


>UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10;
           Liliopsida|Rep: Putative cysteine proteinase - Oryza
           sativa subsp. japonica (Rice)
          Length = 416

 Score = 67.3 bits (157), Expect = 4e-10
 Identities = 28/52 (53%), Positives = 39/52 (75%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           DWR +GAVTD+KDQG+CGSCW  + +   EG +   +G L++LSEQ ++DCS
Sbjct: 119 DWRLNGAVTDVKDQGQCGSCWVFSAVGAVEGINAIMTGNLLTLSEQQVLDCS 170


>UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor;
           n=176; Viridiplantae|Rep: Cysteine proteinase RD21a
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 462

 Score = 67.3 bits (157), Expect = 4e-10
 Identities = 28/57 (49%), Positives = 40/57 (70%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           E   ++DWRK GAV ++KDQG CGSCW+ + +   EG +   +G L++LSEQ L+DC
Sbjct: 136 ELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDC 192


>UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F
           like protease; n=1; Nasonia vitripennis|Rep: PREDICTED:
           similar to cathepsin F like protease - Nasonia
           vitripennis
          Length = 1036

 Score = 66.9 bits (156), Expect = 6e-10
 Identities = 28/53 (52%), Positives = 36/53 (67%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           DWR H  VT +KDQG CGSCW+ +     EGQ+  + G L+SLSEQ L+DC +
Sbjct: 822 DWRHHNVVTPVKDQGSCGSCWAFSVTGNIEGQYAIKHGELLSLSEQELVDCDK 874


>UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin S
           preproprotein; n=2; Tribolium castaneum|Rep: PREDICTED:
           similar to cathepsin S preproprotein - Tribolium
           castaneum
          Length = 525

 Score = 66.9 bits (156), Expect = 6e-10
 Identities = 30/59 (50%), Positives = 36/59 (61%)
 Frame = +3

Query: 489 QREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           Q +    VDWR  G VT +K QGKCGSCW+ A L   E  + +Q G  V LSEQ L+DC
Sbjct: 32  QSDLPDMVDWRLQGVVTPVKRQGKCGSCWAFAILGATEAHYRKQRGSFVILSEQQLVDC 90



 Score = 62.5 bits (145), Expect = 1e-08
 Identities = 27/52 (51%), Positives = 33/52 (63%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           VDWR  G VT +K QGKCG+CW+ A +   E Q+    G  V LSEQ L+DC
Sbjct: 315 VDWRLRGVVTPVKHQGKCGTCWAFAIIGATEAQYRIHRGSFVILSEQQLVDC 366



 Score = 34.7 bits (76), Expect = 2.7
 Identities = 16/44 (36%), Positives = 23/44 (52%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 387
           E+NFR  I+ +    I  HN++Y  GL +Y L +N   D    E
Sbjct: 241 EENFRRAIFEKTFQEIKHHNERYRKGLETYYLRINDLSDYTDEE 284


>UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2;
           Taeniidae|Rep: Cathepsin L-like cysteine proteinase -
           Taenia solium (Pork tapeworm)
          Length = 339

 Score = 66.9 bits (156), Expect = 6e-10
 Identities = 27/53 (50%), Positives = 37/53 (69%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           VDWR    VT++K+QG CGSCW+ +     EG   +++G L+SLSEQ L+DCS
Sbjct: 128 VDWRDKNLVTEVKNQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCS 180


>UniRef50_Q239L8 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 323

 Score = 66.9 bits (156), Expect = 6e-10
 Identities = 30/59 (50%), Positives = 38/59 (64%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           +A   +DW   GAVT +KDQG+CGSCWS +     EG  F  +  L SLSEQ L+DCS+
Sbjct: 122 DAGVEIDWTTKGAVTPVKDQGQCGSCWSFSTTGAVEGALFLSTKKLTSLSEQYLVDCSK 180


>UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 383

 Score = 66.9 bits (156), Expect = 6e-10
 Identities = 27/53 (50%), Positives = 39/53 (73%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           ++DWR+ G +T IK+QG+CGSCW+ A +   E Q+  + G LVSLSEQ ++DC
Sbjct: 171 SIDWREQGKLTPIKNQGQCGSCWAFATVASVEAQNAIKKGKLVSLSEQEMVDC 223


>UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core
           eudicotyledons|Rep: Chymopapain precursor - Carica
           papaya (Papaya)
          Length = 352

 Score = 66.9 bits (156), Expect = 6e-10
 Identities = 27/56 (48%), Positives = 39/56 (69%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
           ++DWR  GAVT +K+QG CGSCW+ + +   EG +   +G L+ LSEQ L+DC +H
Sbjct: 138 SIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH 193


>UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 336

 Score = 66.5 bits (155), Expect = 7e-10
 Identities = 25/64 (39%), Positives = 41/64 (64%)
 Frame = +3

Query: 477 HIAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNL 656
           H A   +   + DWR +G ++D+KDQG+CGSCW+ +   + E  +F ++   +S SEQ L
Sbjct: 118 HTAQDVQLPASFDWRDYGILSDVKDQGQCGSCWAFSTTGILEALYFMENRQKISFSEQQL 177

Query: 657 IDCS 668
           +DC+
Sbjct: 178 VDCA 181


>UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryza
           sativa|Rep: Putative cysteine proteinase - Oryza sativa
           subsp. japonica (Rice)
          Length = 352

 Score = 66.5 bits (155), Expect = 7e-10
 Identities = 28/55 (50%), Positives = 39/55 (70%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
           VDWR+ GAVT +K+Q  CG CW+ + +   EG H   +G LVSLSEQ L+DC+++
Sbjct: 133 VDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCADN 187


>UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep:
           Cysteine proteinase - Cryptobia salmositica
          Length = 443

 Score = 66.5 bits (155), Expect = 7e-10
 Identities = 28/52 (53%), Positives = 36/52 (69%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +DWR  GAVT +K+QG CGSCWS +     EGQH   +G LV++SEQ L+ C
Sbjct: 118 IDWRLKGAVTPVKNQGACGSCWSFSTTGNIEGQHAIATGQLVAVSEQELVSC 169


>UniRef50_Q23H10 Cluster: Papain family cysteine protease containing
           protein; n=14; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 336

 Score = 66.5 bits (155), Expect = 7e-10
 Identities = 28/55 (50%), Positives = 37/55 (67%)
 Frame = +3

Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           A ++DWR  GAVT +K+QG CGSCWS +   + E  +F Q+  LV  SEQ L+DC
Sbjct: 128 ADSIDWRTKGAVTSVKNQGGCGSCWSFSAAAVMESFNFIQNKALVDFSEQQLVDC 182


>UniRef50_P25774 Cluster: Cathepsin S precursor; n=78;
           Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens
           (Human)
          Length = 331

 Score = 66.5 bits (155), Expect = 7e-10
 Identities = 28/54 (51%), Positives = 39/54 (72%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           +VDWR+ G VT++K QG CG+CW+ + +   E Q   ++G LVSLS QNL+DCS
Sbjct: 118 SVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCS 171



 Score = 48.0 bits (109), Expect = 3e-04
 Identities = 48/192 (25%), Positives = 81/192 (42%), Gaps = 7/192 (3%)
 Frame = +1

Query: 238 QLRKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNK 417
           Q +++ E+  R  I+ ++   +  HN ++ MG+ SY LGMN  GDM   E +  M+    
Sbjct: 38  QYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRV 97

Query: 418 TAKHNKNLYMKGGSVRGAKFISPANVKLPERWTGGSTAPSPTSRTKGSVAHAGLQHDWSF 597
            ++  +N+  K    R    I P +V   E+  G  T      + +GS         W+F
Sbjct: 98  PSQWQRNITYKSNPNR----ILPDSVDWREK--GCVT----EVKYQGSCGAC-----WAF 142

Query: 598 ---GKDSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHG----QRFKYIKDNGGIDTEQ 756
              G     +   T                E+   +G +G      F+YI DN GID++ 
Sbjct: 143 SAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDA 202

Query: 757 TYLTRGVDDQFQ 792
           +Y  + +D + Q
Sbjct: 203 SYPYKAMDQKCQ 214


>UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 21 SCAF14577, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 478

 Score = 66.1 bits (154), Expect = 1e-09
 Identities = 31/58 (53%), Positives = 38/58 (65%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           E   ++DWR +GAVT +KDQ  CGSCWS A     EG  F ++G L  LS+Q LIDCS
Sbjct: 204 EVPESLDWRLYGAVTPVKDQAICGSCWSFATTGTIEGALFLKTGSLQVLSQQMLIDCS 261


>UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase
           precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine
           proteinase precursor - Heterodera glycines (Soybean cyst
           nematode worm)
          Length = 353

 Score = 66.1 bits (154), Expect = 1e-09
 Identities = 28/54 (51%), Positives = 40/54 (74%), Gaps = 1/54 (1%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQ-HFRQSGYLVSLSEQNLIDCS 668
           +DWR+ GAVT++KDQG CGSCW+ +     EG    +++  ++SLSEQNL+DCS
Sbjct: 139 LDWREKGAVTEVKDQGDCGSCWAFSATGAIEGALAQKKASKIISLSEQNLVDCS 192


>UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 392

 Score = 66.1 bits (154), Expect = 1e-09
 Identities = 28/58 (48%), Positives = 36/58 (62%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           E    +DWR +GAV   K QG CGSCW+ A     E  HF Q G L++L+EQ L+DC+
Sbjct: 176 EVPDQLDWRNYGAVNPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCT 233


>UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep:
           Cathepsin K - Danio rerio (Zebrafish) (Brachydanio
           rerio)
          Length = 333

 Score = 65.7 bits (153), Expect = 1e-09
 Identities = 27/53 (50%), Positives = 37/53 (69%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           ++D+RK G VT +K+QG CGSCW+ + +   EGQ  +  G LV LS QNL+DC
Sbjct: 121 SIDYRKLGYVTSVKNQGSCGSCWAFSSVGALEGQLMKTKGQLVDLSPQNLVDC 173



 Score = 45.6 bits (103), Expect = 0.001
 Identities = 19/51 (37%), Positives = 32/51 (62%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNG 408
           E++ R  I+ ++   I  HN++YE+G+ +Y LGMN +GDM   E  + + G
Sbjct: 46  EESIRRTIWEKNMLFIEAHNKEYELGIHTYDLGMNHFGDMTLEEVAEKVMG 96


>UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza
           sativa|Rep: Cysteine proteinase-like - Oryza sativa
           subsp. japonica (Rice)
          Length = 360

 Score = 65.7 bits (153), Expect = 1e-09
 Identities = 29/62 (46%), Positives = 40/62 (64%)
 Frame = +3

Query: 483 AGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLID 662
           A   +   +VDWR  GAVT++K+Q  CGSCW+ A +   EG     +G LVSLSEQ ++D
Sbjct: 132 ADDTDVPDSVDWRARGAVTEVKNQRSCGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLD 191

Query: 663 CS 668
           C+
Sbjct: 192 CT 193


>UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core
           eudicotyledons|Rep: Cysteine proteinase -
           Mesembryanthemum crystallinum (Common ice plant)
          Length = 367

 Score = 65.7 bits (153), Expect = 1e-09
 Identities = 28/57 (49%), Positives = 38/57 (66%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           E   ++DWR  GAVT +K+QG+CG CW+ +     EG +   +G L+SLSEQ LIDC
Sbjct: 125 EVPRSIDWRVKGAVTPVKNQGRCGGCWAFSAAAAVEGINQITTGQLISLSEQQLIDC 181


>UniRef50_Q24E33 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 328

 Score = 65.7 bits (153), Expect = 1e-09
 Identities = 26/53 (49%), Positives = 36/53 (67%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           V+W   GAVT +K+QG CGSCW+ +     EG +F ++  L+S SEQ L+DCS
Sbjct: 131 VNWTAQGAVTPVKNQGSCGSCWAFSTTGALEGSYFLKNNQLISFSEQQLVDCS 183


>UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46;
           Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea
           mays (Maize)
          Length = 371

 Score = 65.7 bits (153), Expect = 1e-09
 Identities = 27/51 (52%), Positives = 33/51 (64%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           DWR HGAV  +K+QG CGSCWS +     EG H+  +G L  LSEQ  +DC
Sbjct: 142 DWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDC 192


>UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza
           sativa|Rep: Os01g0347600 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 343

 Score = 65.3 bits (152), Expect = 2e-09
 Identities = 28/52 (53%), Positives = 35/52 (67%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +DWR  GAVT +KDQG CGSCW+ A +   EG    ++G L  LSEQ L+DC
Sbjct: 129 IDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDC 180


>UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1;
           Dictyostelium discoideum AX4|Rep: Counting factor
           associated protein - Dictyostelium discoideum AX4
          Length = 531

 Score = 65.3 bits (152), Expect = 2e-09
 Identities = 29/59 (49%), Positives = 35/59 (59%)
 Frame = +3

Query: 492 REAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           R     VDWR    VT +KDQG CGSCW+       EG +   +G LVSLSEQ L+DC+
Sbjct: 307 RSIPSTVDWRNQNCVTPVKDQGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCA 365


>UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18;
           Schistosoma|Rep: Preprocathepsin cathepsin L -
           Schistosoma japonicum (Blood fluke)
          Length = 331

 Score = 65.3 bits (152), Expect = 2e-09
 Identities = 29/51 (56%), Positives = 34/51 (66%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           DWR HGAVT +K QG CGSCW+ +     EGQ  R+   LV LSEQ L+DC
Sbjct: 121 DWRDHGAVTAVKHQGLCGSCWAFSATGAIEGQLRRKHKKLVKLSEQQLVDC 171



 Score = 33.1 bits (72), Expect = 8.3
 Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 1/50 (2%)
 Frame = +1

Query: 256 EDNFRMK-IYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTM 402
           +D  R K I+      I +HN ++++GL  Y +G+N++ DM   E  + M
Sbjct: 42  DDEMRRKMIFMRRIGKIQEHNLRHDLGLEGYTMGLNQFCDMEWEEVNRIM 91


>UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheirus
           salmonis|Rep: Putative cathepsin L - Lepeophtheirus
           salmonis (salmon louse)
          Length = 257

 Score = 65.3 bits (152), Expect = 2e-09
 Identities = 28/53 (52%), Positives = 38/53 (71%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           V+W K+GAVT +KDQ  CGSCW+ +     EGQ+F ++  L+S SEQ L+DCS
Sbjct: 42  VNWTKNGAVTAVKDQKDCGSCWAFSTTGSVEGQYFIKNKKLLSFSEQQLVDCS 94


>UniRef50_Q22RR8 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score = 64.9 bits (151), Expect = 2e-09
 Identities = 28/53 (52%), Positives = 35/53 (66%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           +DW K GAVT +KDQ +CGSCW+ +     E   F  +G L SLSEQ L+DCS
Sbjct: 129 IDWTKKGAVTPVKDQEQCGSCWAFSATGALESATFISTGTLPSLSEQELVDCS 181


>UniRef50_O16454 Cluster: Temporarily assigned gene name protein
           196; n=4; Bilateria|Rep: Temporarily assigned gene name
           protein 196 - Caenorhabditis elegans
          Length = 477

 Score = 64.5 bits (150), Expect = 3e-09
 Identities = 28/51 (54%), Positives = 34/51 (66%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           DWR+ GAVT +K+QG CGSCW+ +     EG  F     LVSLSEQ L+DC
Sbjct: 269 DWREKGAVTQVKNQGNCGSCWAFSTTGNVEGAWFIAKNKLVSLSEQELVDC 319


>UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue of
           Sarcophaga 26,29kDa proteinase; n=1; Nasonia
           vitripennis|Rep: PREDICTED: similar to homologue of
           Sarcophaga 26,29kDa proteinase - Nasonia vitripennis
          Length = 553

 Score = 64.1 bits (149), Expect = 4e-09
 Identities = 29/52 (55%), Positives = 34/52 (65%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           DWR +GAVT +KDQ  CGSCWS       EG +F +   LV LS+Q LIDCS
Sbjct: 339 DWRLYGAVTPVKDQSVCGSCWSFGTTGAVEGAYFMKYKKLVRLSQQALIDCS 390


>UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-PA
           - Drosophila melanogaster (Fruit fly)
          Length = 549

 Score = 64.1 bits (149), Expect = 4e-09
 Identities = 30/53 (56%), Positives = 35/53 (66%), Gaps = 1/53 (1%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHF-RQSGYLVSLSEQNLIDCS 668
           DWR +GAVT +KDQ  CGSCWS   +   EG  F +  G LV LS+Q LIDCS
Sbjct: 335 DWRLYGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDCS 387


>UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 317

 Score = 64.1 bits (149), Expect = 4e-09
 Identities = 26/54 (48%), Positives = 36/54 (66%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           ++DWR+ GAV  ++DQ +CGSCW+ +     EGQ F + G L  LS Q L+DCS
Sbjct: 107 SIDWREKGAVNPVRDQEQCGSCWAFSAAGALEGQRFLKEGKLEVLSTQQLVDCS 160



 Score = 42.7 bits (96), Expect = 0.010
 Identities = 16/45 (35%), Positives = 30/45 (66%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF 390
           E+  R ++++++   I +HN +Y+ G VS+ LG+N++ DM   EF
Sbjct: 32  EEQVRFQVFSQNLQKIEQHNARYQNGEVSFYLGVNQFADMTSEEF 76


>UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra
           erinaceieuropaei|Rep: Cysteine proteinase - Spirometra
           erinaceieuropaei (Tapeworm)
          Length = 336

 Score = 64.1 bits (149), Expect = 4e-09
 Identities = 29/54 (53%), Positives = 39/54 (72%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           +V+WR+ GAVT +K+QG+CGSCWS +     EG    ++G L SLSEQ L+DCS
Sbjct: 124 SVNWRERGAVTSVKNQGQCGSCWSFSANGAIEGAIQIKTGALRSLSEQQLMDCS 177



 Score = 33.1 bits (72), Expect = 8.3
 Identities = 14/47 (29%), Positives = 25/47 (53%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK 396
           E+  R + +  +   I +HNQ+Y   L SY + +N + D+   EF +
Sbjct: 48  EELHRKRAFFNNLDFIIRHNQRYYQQLESYAVRLNDFSDLTPGEFAE 94


>UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 513

 Score = 64.1 bits (149), Expect = 4e-09
 Identities = 27/53 (50%), Positives = 36/53 (67%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           VDWRK GAV  +K QG CGSC++ A     EG HF ++G  + LSEQ ++DC+
Sbjct: 300 VDWRKAGAVNSVKSQGICGSCYAFAVAGALEGAHFIKTGLKLDLSEQQIVDCT 352


>UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27;
           Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber
           officinale (Ginger)
          Length = 221

 Score = 64.1 bits (149), Expect = 4e-09
 Identities = 27/54 (50%), Positives = 38/54 (70%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           ++DWR+ GAV  +K+QG CGSCW+   +   EG +   +G L+SLSEQ L+DCS
Sbjct: 6   SIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCS 59


>UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11;
           Trichomonadidae|Rep: Cysteine protease - Tritrichomonas
           foetus (Trichomonas foetus)
          Length = 315

 Score = 63.7 bits (148), Expect = 5e-09
 Identities = 25/53 (47%), Positives = 36/53 (67%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           ++DWR+ G V +IKDQ  CGSCW+ + ++  E  +   +G L S SEQNL+DC
Sbjct: 103 SIDWREKGVVNEIKDQAACGSCWAFSAIQAAESAYAISTGTLESYSEQNLVDC 155


>UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep:
           Cysteine protease - Babesia equi
          Length = 438

 Score = 63.7 bits (148), Expect = 5e-09
 Identities = 31/89 (34%), Positives = 44/89 (49%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEHXXXXX 689
           +DWRK   VT +KDQG CGSCW+ A +   E  +  + G  + LSEQ L++C E+     
Sbjct: 228 LDWRKLNGVTPVKDQGNCGSCWAFAAVGSVESLYLIKKGQALDLSEQELVNCEENSNGCE 287

Query: 690 XXXXXXXXLQVHQGQRGDRHRADLPYEGS 776
                     +    +G  H  DLPY  +
Sbjct: 288 GDLPNKALEYIK--AKGISHSKDLPYHAA 314


>UniRef50_P25779 Cluster: Cruzipain precursor; n=54;
           Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi
          Length = 467

 Score = 63.7 bits (148), Expect = 5e-09
 Identities = 29/58 (50%), Positives = 37/58 (63%)
 Frame = +3

Query: 498 AAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           A  AVDWR  GAVT +KDQG+CGSCW+ + +   E Q F     L +LSEQ L+ C +
Sbjct: 123 APAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDK 180


>UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1;
           Phytophthora infestans|Rep: Cathepsin-like cysteine
           protease - Phytophthora infestans (Potato late blight
           fungus)
          Length = 376

 Score = 63.3 bits (147), Expect = 7e-09
 Identities = 25/52 (48%), Positives = 36/52 (69%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           DWR+H  VT +K+QG+CGSCW+ + +   E  +   +G L SLSEQ L+DC+
Sbjct: 138 DWREHSTVTPVKNQGQCGSCWAFSAVAAMECAYALSTGTLESLSEQELVDCT 189



 Score = 35.5 bits (78), Expect = 1.6
 Identities = 23/83 (27%), Positives = 38/83 (45%), Gaps = 1/83 (1%)
 Frame = +1

Query: 268 RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYM 447
           R + +A +   I  HN+ YE G  S+ LG+N   D+   E+ + ++   + +K       
Sbjct: 64  RFRSFATNLERIQTHNEAYERGEHSFTLGLNDLADLADAEYKQLLSYRTRDSK------- 116

Query: 448 KGGSVRGAKFISPANVK-LPERW 513
              S     F+ P NV+ LP  W
Sbjct: 117 --SSSASETFVKPENVEDLPATW 137


>UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium
           falciparum|Rep: Falcipain 2 - Plasmodium falciparum
          Length = 484

 Score = 63.3 bits (147), Expect = 7e-09
 Identities = 26/54 (48%), Positives = 35/54 (64%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           A DWR H  VT +KDQ  CGSCW+ + +   E Q+  +   L++LSEQ L+DCS
Sbjct: 264 AYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCS 317


>UniRef50_Q22RC9 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 389

 Score = 63.3 bits (147), Expect = 7e-09
 Identities = 27/58 (46%), Positives = 38/58 (65%)
 Frame = +3

Query: 492 REAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           ++A  + DWR HGAVT +K+QG  G+CW+ +     EGQ F     LVSLSE+ ++DC
Sbjct: 123 QDAPTSYDWRDHGAVTPVKNQGTVGTCWTFSTTGNIEGQWFLAGNPLVSLSEEQIVDC 180


>UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1];
           n=11; Eutheria|Rep: Testin-2 precursor [Contains:
           Testin-1] - Mus musculus (Mouse)
          Length = 333

 Score = 63.3 bits (147), Expect = 7e-09
 Identities = 28/52 (53%), Positives = 36/52 (69%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           VDWR  G VT +K+QG C S W+ +     EGQ F+++G LV LSEQNL+DC
Sbjct: 118 VDWRMLGYVTPVKNQGYCASSWAFSATGSLEGQMFKKTGRLVPLSEQNLLDC 169



 Score = 42.3 bits (95), Expect = 0.014
 Identities = 18/54 (33%), Positives = 31/54 (57%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNK 417
           E+  R  ++ ++  +I  HN +Y  G   + + MN +GD+ + EFVK M GF +
Sbjct: 44  EERLRRAVWEKNFKMIELHNWEYLEGKHDFTMTMNAFGDLTNTEFVKMMTGFRR 97


>UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9;
           Onchocercidae|Rep: Cathepsin L-like precursor - Brugia
           pahangi (Filarial nematode worm)
          Length = 395

 Score = 63.3 bits (147), Expect = 7e-09
 Identities = 25/55 (45%), Positives = 38/55 (69%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
           VDWR  GAVT +++QG+CGSC++ A     E  H + +G L+ LS QN++DC+ +
Sbjct: 186 VDWRTKGAVTPVRNQGECGSCYAFATAAALEAYHKQMTGRLLDLSPQNIVDCTRN 240



 Score = 44.8 bits (101), Expect = 0.003
 Identities = 19/46 (41%), Positives = 29/46 (63%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFV 393
           E+NFRM I+  ++ +  + N+KYE GLVSY   +N   D+   EF+
Sbjct: 106 ENNFRMAIFESNELMTERINKKYEQGLVSYTTALNDLADLTDEEFM 151


>UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l;
           n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to cathepsin l - Strongylocentrotus purpuratus
          Length = 489

 Score = 62.9 bits (146), Expect = 9e-09
 Identities = 28/53 (52%), Positives = 34/53 (64%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           +DW   GAV+ +KDQ  CGSCWS    E  EG  F QSG  V LS+Q L+DC+
Sbjct: 271 IDWNVLGAVSPVKDQAVCGSCWSFGSAETIEGAVFMQSGKRVRLSQQMLMDCT 323


>UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain;
           n=16; Chrysomelidae|Rep: Digestive cysteine protease
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 62.9 bits (146), Expect = 9e-09
 Identities = 26/58 (44%), Positives = 37/58 (63%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           E   ++DW + GAV ++KDQ  CGSCW+ +     EGQ+   +   +SLSEQ L+DCS
Sbjct: 109 EVPDSIDWTEKGAVLEVKDQNPCGSCWAFSATGALEGQNAILNNVKISLSEQQLLDCS 166



 Score = 39.1 bits (87), Expect = 0.13
 Identities = 16/51 (31%), Positives = 28/51 (54%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNG 408
           E+  R  I+  +   I +HN +Y+ G  +Y LG+ ++ D+ H EF   + G
Sbjct: 39  EEKTRFGIFQRNLIKIKEHNARYDKGEETYLLGVTRFADLTHEEFKDILKG 89


>UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan
           CA, family C1, cathepsin L-like cysteine peptidase -
           Trichomonas vaginalis G3
          Length = 306

 Score = 62.9 bits (146), Expect = 9e-09
 Identities = 25/52 (48%), Positives = 35/52 (67%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +DWR+ G V  IK+QG CGSCW+ + +++ E Q  +    L  LSEQNL+DC
Sbjct: 92  IDWREQGIVNKIKNQGACGSCWAFSAIQVIESQVAKNQKQLYDLSEQNLLDC 143


>UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis
           thaliana|Rep: Cysteine proteinase - Arabidopsis thaliana
           (Mouse-ear cress)
          Length = 348

 Score = 62.5 bits (145), Expect = 1e-08
 Identities = 27/53 (50%), Positives = 36/53 (67%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           ++DWR+ GAVT +K QG+CG CW+ + +   EG      G LVSLSEQ L+DC
Sbjct: 131 SMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDC 183


>UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium
           (Vinckeia)|Rep: Berghepain-2 - Plasmodium yoelii yoelii
          Length = 472

 Score = 62.5 bits (145), Expect = 1e-08
 Identities = 26/54 (48%), Positives = 35/54 (64%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
           DWR H A+ DIKDQ KC SCW+ A   +   Q+  +    VSLSEQ L+DC+++
Sbjct: 255 DWRDHNAIIDIKDQQKCASCWAFATAGVVAAQYAIRKNQKVSLSEQQLVDCAQN 308


>UniRef50_Q23H06 Cluster: Papain family cysteine protease containing
           protein; n=18; Tetrahymena thermophila|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 349

 Score = 62.5 bits (145), Expect = 1e-08
 Identities = 26/55 (47%), Positives = 36/55 (65%)
 Frame = +3

Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           A ++DWR  GAVT +K QG CG+CW+ +   + E  +F Q+  LV  SEQ L+DC
Sbjct: 142 ATSIDWRSRGAVTQVKWQGNCGACWAFSATGVMESFNFIQNKALVEFSEQQLLDC 196


>UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteinase
           A; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like
           tick cysteine proteinase A - Haemaphysalis longicornis
           (Bush tick)
          Length = 312

 Score = 62.5 bits (145), Expect = 1e-08
 Identities = 26/54 (48%), Positives = 38/54 (70%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           VDW + G+   +K+QG+CGSCW+ +     EGQHFR++   V+  EQNL+DCS+
Sbjct: 97  VDWAQEGSRAPVKNQGQCGSCWAFSTTGSLEGQHFRKTESRVT-GEQNLVDCSD 149



 Score = 42.3 bits (95), Expect = 0.014
 Identities = 48/186 (25%), Positives = 73/186 (39%), Gaps = 5/186 (2%)
 Frame = +1

Query: 217 LQAAAPSQLRKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLG-MNKYGDMLHHEFV 393
           LQ AA S ++        +KI+ E+  ++AKHN KY  GL   ++G     GD     +V
Sbjct: 4   LQIAAQSGVQFPRRRTIEVKIFTENTLLVAKHNAKYAKGLGVLQVGPWTSLGDFA-AAWV 62

Query: 394 KTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPERWT-GGSTAPSPTSRTKGS--- 561
           +    ++  A   +N    G  +     ++ +++     W   GS AP       GS   
Sbjct: 63  RQNGQWDTAASRTRN---SGPHLFHQANLNDSSLPTTVDWAQEGSRAPVKNQGQCGSCWA 119

Query: 562 VAHAGLQHDWSFGKDSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGG 741
            +  G      F K  + V+                    Q    G     F+YIK NGG
Sbjct: 120 FSTTGSLEGQHFRKTESRVT------GEQNLVDCSDDFGNQGCNGGLMDNGFQYIKANGG 173

Query: 742 IDTEQT 759
           IDTE+T
Sbjct: 174 IDTEET 179


>UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=17; Trichomonas vaginalis|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 318

 Score = 62.5 bits (145), Expect = 1e-08
 Identities = 26/53 (49%), Positives = 36/53 (67%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           AVDWR    V  IKDQ +CGSCW+ + ++  E Q   + G L+SL+EQN++DC
Sbjct: 103 AVDWRNAKIVNPIKDQAQCGSCWAFSVVQAQESQWALKKGQLLSLAEQNMVDC 155


>UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14;
           Leishmania|Rep: Cysteine proteinase 1 precursor -
           Leishmania pifanoi
          Length = 354

 Score = 62.1 bits (144), Expect = 2e-08
 Identities = 28/53 (52%), Positives = 35/53 (66%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +VDWR  GAVT +K+QG CGSCW+ + +   EGQ       LVSLSEQ L+ C
Sbjct: 132 SVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQMLVSC 184


>UniRef50_UPI000155637A Cluster: PREDICTED: similar to
           ENSANGP00000013730, partial; n=1; Ornithorhynchus
           anatinus|Rep: PREDICTED: similar to ENSANGP00000013730,
           partial - Ornithorhynchus anatinus
          Length = 229

 Score = 61.7 bits (143), Expect = 2e-08
 Identities = 30/55 (54%), Positives = 37/55 (67%), Gaps = 1/55 (1%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHF-RQSGYLVSLSEQNLIDCS 668
           ++DWR +GAVT +KDQ  CGSCWS A     EG  F + +  LV LS+Q LIDCS
Sbjct: 58  SLDWRLYGAVTPVKDQAVCGSCWSFATTGTLEGALFLKVTVQLVPLSQQMLIDCS 112


>UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K
           precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2);
           n=2; Tribolium castaneum|Rep: PREDICTED: similar to
           Cathepsin K precursor (Cathepsin O) (Cathepsin X)
           (Cathepsin O2) - Tribolium castaneum
          Length = 332

 Score = 61.7 bits (143), Expect = 2e-08
 Identities = 25/58 (43%), Positives = 38/58 (65%)
 Frame = +3

Query: 492 REAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           R  + ++DWR+ G VT +K+QG+CGSCW+ A +   E  +  +    +SLSEQ L+DC
Sbjct: 116 RGISASLDWRQRGGVTPVKNQGQCGSCWAFATIGAIESHYKIRHKRAISLSEQQLVDC 173



 Score = 40.7 bits (91), Expect = 0.041
 Identities = 13/44 (29%), Positives = 28/44 (63%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 387
           E+ FR  ++ ++  I+ +HN+++  G  +Y++G+NK+ D    E
Sbjct: 43  EETFRKSLFTKNLEIVEEHNERFRNGSETYEMGVNKFSDFTDEE 86


>UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza
           sativa|Rep: Putative cysteine protease - Oryza sativa
           subsp. japonica (Rice)
          Length = 357

 Score = 61.7 bits (143), Expect = 2e-08
 Identities = 27/52 (51%), Positives = 34/52 (65%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +DWR  GAVT +KDQG CGS W+ A +   EG    ++G L  LSEQ L+DC
Sbjct: 137 IDWRFKGAVTGVKDQGACGSSWAFAAVAAMEGLMKIRTGQLTPLSEQELVDC 188


>UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa
           (japonica cultivar-group)|Rep: Os09g0562700 protein -
           Oryza sativa subsp. japonica (Rice)
          Length = 235

 Score = 61.7 bits (143), Expect = 2e-08
 Identities = 26/46 (56%), Positives = 35/46 (76%)
 Frame = +3

Query: 528 GAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           GAVT++KDQG+CGSCW+ + + + EG    + G LVSLSEQ L+DC
Sbjct: 19  GAVTEVKDQGRCGSCWAFSTVAVVEGIQKIKKGKLVSLSEQELVDC 64


>UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep:
           Actinidin Act3a - Actinidia eriantha
          Length = 380

 Score = 61.7 bits (143), Expect = 2e-08
 Identities = 27/53 (50%), Positives = 36/53 (67%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           VDWR  GAV D+K+QG C SCW+ A +   E  +   +G L+SLSEQ L+DC+
Sbjct: 130 VDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIITGDLISLSEQELVDCN 182



 Score = 37.9 bits (84), Expect = 0.29
 Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 1/66 (1%)
 Frame = +1

Query: 253 GEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHN 432
           GE   R++I+ E+   I +HN        SY +G+N++ D+   E+  T  GF  + K  
Sbjct: 57  GEREMRIEIFKENLRFIDEHNADPNR---SYTVGLNQFADLTDEEYRSTYLGFKSSLKSK 113

Query: 433 -KNLYM 447
             N YM
Sbjct: 114 VSNRYM 119


>UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 514

 Score = 61.7 bits (143), Expect = 2e-08
 Identities = 26/64 (40%), Positives = 41/64 (64%)
 Frame = +3

Query: 477 HIAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNL 656
           H+  + +    +DWR +GAV+ ++ QG CGSC++ A +   EG +F ++G L  LS Q +
Sbjct: 296 HVLQRVDVPDELDWRDYGAVSPVRGQGICGSCYALAAVGAVEGAYFMKTGKLKELSAQQV 355

Query: 657 IDCS 668
           IDCS
Sbjct: 356 IDCS 359


>UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep:
           Cathepsin L - Kudoa thyrsites
          Length = 300

 Score = 61.7 bits (143), Expect = 2e-08
 Identities = 26/54 (48%), Positives = 36/54 (66%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           +VDW+  G VT +K+QG CGSCWS +     E  +  ++G LV+ SEQ L+DCS
Sbjct: 105 SVDWKALGKVTSVKNQGHCGSCWSFSAAGAIESAYAIKTGELVNFSEQQLVDCS 158


>UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep:
           Cysteine protease - Saprolegnia parasitica
          Length = 523

 Score = 61.3 bits (142), Expect = 3e-08
 Identities = 26/55 (47%), Positives = 35/55 (63%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
           +DW + G VT +K+QG CGSCW+ +     EG  F  S  LVS+SEQ L+DC  +
Sbjct: 120 MDWVEQGGVTPVKNQGMCGSCWAFSTTGAIEGAAFVSSKQLVSVSEQELVDCDHN 174


>UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1;
           Oryza sativa (japonica cultivar-group)|Rep: Putative
           uncharacterized protein - Oryza sativa subsp. japonica
           (Rice)
          Length = 326

 Score = 61.3 bits (142), Expect = 3e-08
 Identities = 27/52 (51%), Positives = 37/52 (71%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQ 650
           +A  A DWR+HGAVT +KDQG CGSCW+ + +E  EG +   +G  ++LSEQ
Sbjct: 116 DAPPAWDWREHGAVTRVKDQGPCGSCWAFSVVEAVEGINEIMTGNFLTLSEQ 167



 Score = 37.1 bits (82), Expect = 0.51
 Identities = 21/63 (33%), Positives = 31/63 (49%)
 Frame = +1

Query: 226 AAPSQLRKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN 405
           AA S  R   +   R +++ ++   I   N+K  M   SYKLG+NK+ D+   EF     
Sbjct: 35  AASSSPRDLADKGSRFEVFKKNARYIHDFNRKKGM---SYKLGLNKFADLTLEEFTAKYT 91

Query: 406 GFN 414
           G N
Sbjct: 92  GAN 94


>UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain;
           n=9; Cucujiformia|Rep: Digestive cysteine proteinase
           intestain - Leptinotarsa decemlineata (Colorado potato
           beetle)
          Length = 326

 Score = 61.3 bits (142), Expect = 3e-08
 Identities = 26/59 (44%), Positives = 37/59 (62%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           E   ++DW + GAV D+K QG CGSCW+ +     EGQ+   +   + LSEQ L+DCS+
Sbjct: 109 EVPDSIDWTQKGAVLDVKYQGGCGSCWAFSATGALEGQNAIVNNVKIPLSEQQLLDCSK 167



 Score = 39.1 bits (87), Expect = 0.13
 Identities = 17/45 (37%), Positives = 25/45 (55%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF 390
           E+  R  I+  +   I +HN KY+ G  SY LG+  + D+ H EF
Sbjct: 39  EERTRFGIFQSNLRKIEEHNAKYDKGEESYFLGVTPFADLTHDEF 83


>UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 664

 Score = 61.3 bits (142), Expect = 3e-08
 Identities = 22/54 (40%), Positives = 39/54 (72%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           ++DWR  G V+ +K+QG CGSC++ + +   E  ++R++  ++ LSEQNL+DC+
Sbjct: 473 SIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYRKNNRMLDLSEQNLVDCT 526


>UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel
           protein - Danio rerio (Zebrafish) (Brachydanio rerio)
          Length = 328

 Score = 60.9 bits (141), Expect = 4e-08
 Identities = 25/53 (47%), Positives = 37/53 (69%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           V+W +HG V+ +++QG CGSCW+ + +   E Q  R++  LV LS QNL+DCS
Sbjct: 117 VNWTEHGMVSPVQNQGPCGSCWAFSAVGSLEAQMKRRTAALVPLSAQNLLDCS 169



 Score = 38.3 bits (85), Expect = 0.22
 Identities = 20/55 (36%), Positives = 29/55 (52%)
 Frame = +1

Query: 244 RKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNG 408
           R   E+  R  ++ ++   I  HN+   +GL SY LG+N+  DM   E V  MNG
Sbjct: 39  RNTREERLRRSVWKQNLQDILLHNEAAAVGLHSYTLGLNQLSDMTADE-VNDMNG 92


>UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep:
           Cysteine protease - Solanum lycopersicum (Tomato)
           (Lycopersicon esculentum)
          Length = 345

 Score = 60.9 bits (141), Expect = 4e-08
 Identities = 24/53 (45%), Positives = 36/53 (67%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           +DWR+ GAVT +K QG+CG CW+ + +   EG +   +G L+  SEQ L+DC+
Sbjct: 135 LDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 187



 Score = 35.5 bits (78), Expect = 1.6
 Identities = 19/53 (35%), Positives = 27/53 (50%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFN 414
           E   R  I+ E+   I   N+    G +SYKLGMN++ D+   EF+    G N
Sbjct: 55  EKGERFMIFKENMKFIESVNKA---GNLSYKLGMNEFADITSQEFLAKFTGLN 104


>UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2;
           Oryza sativa|Rep: Putative uncharacterized protein -
           Oryza sativa subsp. indica (Rice)
          Length = 149

 Score = 60.9 bits (141), Expect = 4e-08
 Identities = 26/55 (47%), Positives = 38/55 (69%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           ++DWRK GAV ++K Q  CGSCW+ + +   EG    ++G LVSLS+Q L+DC +
Sbjct: 20  SIDWRKKGAVVEVKYQEDCGSCWAFSAVAAIEG--INKNGELVSLSKQELVDCDD 72


>UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila
           melanogaster|Rep: LD36817p - Drosophila melanogaster
           (Fruit fly)
          Length = 352

 Score = 60.9 bits (141), Expect = 4e-08
 Identities = 28/54 (51%), Positives = 36/54 (66%), Gaps = 1/54 (1%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGK-CGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           DWR+ G VT    QG  CG+CWS A     EG  FR++G L SLS+QNL+DC++
Sbjct: 135 DWREKGGVTPPGFQGVGCGACWSFATTGALEGHLFRRTGVLASLSQQNLVDCAD 188


>UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1;
           Brugia malayi|Rep: Cathepsin F-like cysteine proteinase
           - Brugia malayi (Filarial nematode worm)
          Length = 461

 Score = 60.9 bits (141), Expect = 4e-08
 Identities = 27/51 (52%), Positives = 33/51 (64%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           DWR  G VT +KDQG CGSCW+ +     E     ++G L+SLSEQ LIDC
Sbjct: 253 DWRTEGVVTPVKDQGSCGSCWAFSVTGNIESLWAIKTGKLISLSEQELIDC 303


>UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960
           precursor; n=2; Arabidopsis thaliana|Rep: Probable
           cysteine proteinase At3g43960 precursor - Arabidopsis
           thaliana (Mouse-ear cress)
          Length = 376

 Score = 60.9 bits (141), Expect = 4e-08
 Identities = 30/53 (56%), Positives = 36/53 (67%), Gaps = 1/53 (1%)
 Frame = +3

Query: 510 VDWRKHGAVTD-IKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           VDWR+ GAV   +K QG+CGSCW+ A     EG +   +G LVSLSEQ LIDC
Sbjct: 131 VDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDC 183


>UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis
           zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa
           zeasingle nucleocapsid nuclear polyhedrosis virus)
          Length = 367

 Score = 60.9 bits (141), Expect = 4e-08
 Identities = 34/88 (38%), Positives = 41/88 (46%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEHXXXXXX 692
           DWR    VT IKDQG CGSCW+   +   E Q+  +   L+ LSEQ L+DC E       
Sbjct: 161 DWRDTNKVTPIKDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCDE-VDLGCN 219

Query: 693 XXXXXXXLQVHQGQRGDRHRADLPYEGS 776
                   Q      G    AD PY+GS
Sbjct: 220 GGLMHLAFQELLLMGGVETEADYPYQGS 247


>UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcoptes
           scabiei type hominis|Rep: Cathepsin L-like protease -
           Sarcoptes scabiei type hominis
          Length = 245

 Score = 60.5 bits (140), Expect = 5e-08
 Identities = 26/53 (49%), Positives = 34/53 (64%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           VDW     V  IKDQ +CGSCW+ + +   E Q+  ++G LV LSEQ L+DCS
Sbjct: 124 VDWTLKNVVAPIKDQKQCGSCWAFSAVASMESQNALKTGQLVELSEQELVDCS 176



 Score = 46.0 bits (104), Expect = 0.001
 Identities = 21/56 (37%), Positives = 34/56 (60%)
 Frame = +1

Query: 238 QLRKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN 405
           Q R   ++  R  I+  +   I KHN+KYE GL +Y+LG+N++ D+ + E+   MN
Sbjct: 43  QFRTVYDELLRKLIFQRNYIYIRKHNEKYEAGLSTYELGVNQFTDLTNKEYNDQMN 98


>UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep:
           Dvir_CG5367 - Drosophila virilis (Fruit fly)
          Length = 298

 Score = 60.5 bits (140), Expect = 5e-08
 Identities = 24/52 (46%), Positives = 38/52 (73%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           DWRK G +T + +Q  CGSC++ +  +  EGQ F+++G +V+LSEQ ++DCS
Sbjct: 92  DWRKKGFITPLYNQQSCGSCYAFSIAQSIEGQVFKRTGKIVALSEQQIVDCS 143


>UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera
           litura multicapsid nucleopolyhedrovirus (SpltMNPV)
          Length = 337

 Score = 60.5 bits (140), Expect = 5e-08
 Identities = 31/87 (35%), Positives = 42/87 (48%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEHXXXXXX 692
           DWRK   VT +K+QG CGSCW+ A +   E Q+      L+ LSEQ L+DC         
Sbjct: 131 DWRKLNKVTKVKEQGVCGSCWAFAAIGNIESQYAIMHDSLIDLSEQQLLDCDRVDQGCDG 190

Query: 693 XXXXXXXLQVHQGQRGDRHRADLPYEG 773
                   ++ +   G  H  D PY+G
Sbjct: 191 GLMHLAFQEIIR-IGGVEHEIDYPYQG 216


>UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Rep:
           Cathepsin R precursor - Mus musculus (Mouse)
          Length = 334

 Score = 60.5 bits (140), Expect = 5e-08
 Identities = 36/95 (37%), Positives = 45/95 (47%), Gaps = 3/95 (3%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE---HXX 680
           VDWRK G VT ++ QG C +CW+ A     E Q   Q+G L  LS QNL+DCS+   +  
Sbjct: 119 VDWRKKGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLSVQNLVDCSKPQGNNG 178

Query: 681 XXXXXXXXXXXLQVHQGQRGDRHRADLPYEGS*RP 785
                        +H G  G    A  PYEG   P
Sbjct: 179 CLGGDTYNAFQYVLHNG--GLESEATYPYEGKDGP 211



 Score = 34.7 bits (76), Expect = 2.7
 Identities = 42/179 (23%), Positives = 68/179 (37%), Gaps = 4/179 (2%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNK-TAKHN 432
           E+  +  ++ E   +I  HN++  +G   + + MN++GD    EF K M   +  T +  
Sbjct: 44  EEKLKRVVWEEKLKMIKLHNRENSLGKNGFTMKMNEFGDQTDEEFRKMMIEISVWTHREG 103

Query: 433 KNLYMKGGSVRGAKFIS--PANVKLPERWTGGSTAPSPTSRTKGSVAHAGLQHDWSFGK- 603
           K++  +       KF+         P R  G   A    + T G++     Q  W  GK 
Sbjct: 104 KSIMKREAGSILPKFVDWRKKGYVTPVRRQGDCDACWAFAVT-GAIE---AQAIWQTGKL 159

Query: 604 DSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTEQTYLTRGVD 780
              SV     C                    G     F+Y+  NGG+++E TY   G D
Sbjct: 160 TPLSVQNLVDCSKPQGNNGCLG---------GDTYNAFQYVLHNGGLESEATYPYEGKD 209


>UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin W,
           partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
           similar to Cathepsin W, partial - Ornithorhynchus
           anatinus
          Length = 229

 Score = 60.1 bits (139), Expect = 6e-08
 Identities = 26/52 (50%), Positives = 36/52 (69%), Gaps = 1/52 (1%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSG-YLVSLSEQNLIDC 665
           DWRK GA+T +K+QG CGSCW+ A +   E   + ++G  LVSLS Q ++DC
Sbjct: 73  DWRKRGAITSVKNQGSCGSCWAFAAVGNAESMWYLRAGKRLVSLSVQEVLDC 124


>UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13;
           Plasmodium|Rep: Cysteine protease falcipain-3 -
           Plasmodium falciparum
          Length = 492

 Score = 60.1 bits (139), Expect = 6e-08
 Identities = 26/54 (48%), Positives = 33/54 (61%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           A DWR HG VT +KDQ  CGSCW+ + +   E Q+  +   L   SEQ L+DCS
Sbjct: 272 AYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCS 325


>UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia
           circumcincta|Rep: Secreted cathepsin F - Teladorsagia
           circumcincta
          Length = 364

 Score = 60.1 bits (139), Expect = 6e-08
 Identities = 26/51 (50%), Positives = 33/51 (64%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           DWR+HGAVT +K +G C +CW+ +     EGQ F     LVSLS Q L+DC
Sbjct: 158 DWREHGAVTKVKTEGHCAACWAFSVTGNIEGQWFLAKKKLVSLSAQQLLDC 208


>UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 367

 Score = 60.1 bits (139), Expect = 6e-08
 Identities = 23/54 (42%), Positives = 37/54 (68%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           ++DWR+ GAV+ +K+QG CGSCW+ + + L E  +  ++  L   SEQ L+DC+
Sbjct: 158 SIDWRQSGAVSPVKNQGSCGSCWAFSAVALAESVNLLRNNSLALYSEQELVDCT 211


>UniRef50_Q22A69 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 330

 Score = 60.1 bits (139), Expect = 6e-08
 Identities = 28/57 (49%), Positives = 35/57 (61%), Gaps = 1/57 (1%)
 Frame = +3

Query: 498 AAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQ-SGYLVSLSEQNLIDC 665
           A  A+DW   GAVT +K+QG CGSCW+ +     EGQ+  Q    L S SEQ L+DC
Sbjct: 112 APTAIDWTTKGAVTPVKNQGSCGSCWAFSTTGSIEGQYVLQLKQNLTSFSEQQLVDC 168


>UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30;
           Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria
           dispar multicapsid nuclear polyhedrosis virus (LdMNPV)
          Length = 356

 Score = 60.1 bits (139), Expect = 6e-08
 Identities = 26/51 (50%), Positives = 32/51 (62%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           DWR+   VT IK+QG CG+CW+ A L   E Q   +   L+ LSEQ LIDC
Sbjct: 149 DWREQNKVTSIKNQGACGACWAFATLASVESQFAMRHNRLIDLSEQQLIDC 199


>UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease
           containing protein; n=2; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 332

 Score = 59.7 bits (138), Expect = 8e-08
 Identities = 25/56 (44%), Positives = 38/56 (67%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
           +VDWRK GAV+ ++DQG CGSC++ A     EG +  ++G L   S Q ++DC++H
Sbjct: 130 SVDWRKLGAVSPVRDQGNCGSCYAFASTGALEGLYQIKTGKLEVFSPQYIVDCAKH 185


>UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep:
           MGC107932 protein - Xenopus tropicalis (Western clawed
           frog) (Silurana tropicalis)
          Length = 333

 Score = 59.7 bits (138), Expect = 8e-08
 Identities = 26/55 (47%), Positives = 38/55 (69%), Gaps = 1/55 (1%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGK-CGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           VDWRK   VT +K+QG  CGSCW+ A + + E ++  ++  L++LSEQ L+DC E
Sbjct: 119 VDWRKSNCVTPVKNQGTFCGSCWAFATVGVMESRYCIRTKELLNLSEQQLVDCDE 173



 Score = 33.9 bits (74), Expect = 4.8
 Identities = 12/29 (41%), Positives = 20/29 (68%)
 Frame = +1

Query: 301 IAKHNQKYEMGLVSYKLGMNKYGDMLHHE 387
           + KHNQ  + GL SY++ MN++ D+  +E
Sbjct: 58  VQKHNQLADQGLKSYRMAMNQFADLTDNE 86


>UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus
           tauri|Rep: Cysteine protease-1 - Ostreococcus tauri
          Length = 430

 Score = 59.7 bits (138), Expect = 8e-08
 Identities = 27/55 (49%), Positives = 38/55 (69%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           A+DW + GAVT  K+QG+CGSCW+ +     EG    ++G LVSLSEQ ++ CS+
Sbjct: 204 AIDWVELGAVTPPKNQGQCGSCWAFSTTGAVEGITKIRTGRLVSLSEQEMVSCSK 258


>UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas
           foetus|Rep: TFCP2 protein - Tritrichomonas foetus
           (Trichomonas foetus)
          Length = 270

 Score = 59.7 bits (138), Expect = 8e-08
 Identities = 24/58 (41%), Positives = 35/58 (60%)
 Frame = +3

Query: 492 REAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           ++   + DWR  G V  IK+QG CGSCW+ + +   E  H   +G L+  SEQ+L+DC
Sbjct: 48  KDTPTSFDWRSEGKVNPIKNQGSCGSCWAFSAIAAQESCHAIATGELLRFSEQSLVDC 105


>UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3;
           Bilateria|Rep: Cathepsin L-like cysteine protease -
           Neobenedenia melleni
          Length = 335

 Score = 59.7 bits (138), Expect = 8e-08
 Identities = 25/53 (47%), Positives = 35/53 (66%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           +DW + G VT +K+Q +CGSCW+ +     EG   R +G L+S SEQ L+DCS
Sbjct: 122 IDWVRKGHVTAVKNQAQCGSCWAFSSTGSIEGAVKRATGKLISFSEQQLVDCS 174



 Score = 33.5 bits (73), Expect = 6.3
 Identities = 11/15 (73%), Positives = 15/15 (100%)
 Frame = +2

Query: 671 AYGNNGCNGGLMDNA 715
           A+GN+GCNGG+MDN+
Sbjct: 176 AFGNHGCNGGIMDNS 190


>UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza
           sativa|Rep: Cysteine protease 1 precursor - Oryza sativa
           subsp. japonica (Rice)
          Length = 490

 Score = 59.7 bits (138), Expect = 8e-08
 Identities = 27/57 (47%), Positives = 40/57 (70%), Gaps = 1/57 (1%)
 Frame = +3

Query: 507 AVDWRKHGAVT-DIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
           +VDWR  GAV   +K+QG+CGSCW+ + +   EG +   +G LVSLSEQ L++C+ +
Sbjct: 158 SVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARN 214


>UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor;
           n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase
           precursor - Phaedon cochleariae (Mustard beetle)
          Length = 324

 Score = 59.7 bits (138), Expect = 8e-08
 Identities = 31/92 (33%), Positives = 42/92 (45%)
 Frame = +3

Query: 498 AAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEHX 677
           A  ++DWR  G V  +++QG+CGSCW+ +     E Q   +SG  V LS Q L+DCS   
Sbjct: 110 APESIDWRSKGVVLPVRNQGECGSCWALSTAAAIESQSAIKSGSKVPLSPQQLVDCSTSY 169

Query: 678 XXXXXXXXXXXXLQVHQGQRGDRHRADLPYEG 773
                          +    G    AD PY G
Sbjct: 170 GNHGCNGGFAVNGFEYVKDNGLESDADYPYSG 201



 Score = 40.7 bits (91), Expect = 0.041
 Identities = 18/45 (40%), Positives = 26/45 (57%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF 390
           E+  R  I+ +    IA+HN KYE G  +Y L +NK+ D+   EF
Sbjct: 39  EEKLRFNIFQDTLRQIAEHNVKYENGESTYYLAINKFSDITDEEF 83


>UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep:
           Vivapain-4 - Plasmodium vivax
          Length = 484

 Score = 59.3 bits (137), Expect = 1e-07
 Identities = 24/53 (45%), Positives = 35/53 (66%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           DWR+H AV++IK+Q  CGSCW+   +   E Q+  +    V +SEQ L+DCS+
Sbjct: 267 DWREHNAVSEIKNQNLCGSCWAFGAVGAVESQYAIRKNQHVLISEQELVDCSD 319


>UniRef50_Q235G6 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 325

 Score = 59.3 bits (137), Expect = 1e-07
 Identities = 26/53 (49%), Positives = 34/53 (64%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           +DW + GAVT +K+QG CG CWS A     EG +F     L +LS+Q LIDC+
Sbjct: 121 IDWVEKGAVTPVKNQGGCGGCWSFATTGGVEGANFVYKNVLPNLSQQQLIDCN 173


>UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18;
           Magnoliophyta|Rep: Thiol protease aleurain precursor -
           Arabidopsis thaliana (Mouse-ear cress)
          Length = 358

 Score = 59.3 bits (137), Expect = 1e-07
 Identities = 24/52 (46%), Positives = 34/52 (65%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           DWR+ G V+ +KDQG CGSCW+ +     E  + +  G  +SLSEQ L+DC+
Sbjct: 146 DWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCA 197



 Score = 44.0 bits (99), Expect = 0.004
 Identities = 52/179 (29%), Positives = 76/179 (42%), Gaps = 3/179 (1%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGF--NKTAKH 429
           E   R  I+ E+  +I   N+K   GL SYKLG+N++ D+   EF +T  G   N +A  
Sbjct: 75  EMKLRFSIFKENLDLIRSTNKK---GL-SYKLGVNQFADLTWQEFQRTKLGAAQNCSATL 130

Query: 430 NKNLYMKGGSVRGAKFISPANVKLPERWTGGSTAPSPTSRTKGSVAHAGLQHDWSFGKD- 606
             +  +   ++   K      +  P +  GG  +   T  T G++  A  Q   +FGK  
Sbjct: 131 KGSHKVTEAALPETKDWREDGIVSPVKDQGGCGS-CWTFSTTGALEAAYHQ---AFGKGI 186

Query: 607 STSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTEQTYLTRGVDD 783
           S S      C                    G   Q F+YIK NGG+DTE+ Y   G D+
Sbjct: 187 SLSEQQLVDCAGAFNNYGCNG---------GLPSQAFEYIKSNGGLDTEKAYPYTGKDE 236


>UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 360

 Score = 58.8 bits (136), Expect = 1e-07
 Identities = 24/51 (47%), Positives = 34/51 (66%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           DWR  GA+T +K Q  CG CW+ + ++  EG +F ++G L SLS Q +IDC
Sbjct: 136 DWRDKGAITPVKVQNGCGGCWAFSTVQSIEGLYFLKTGKLESLSTQQVIDC 186


>UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2;
           Roseiflexus|Rep: Peptidase C1A, papain precursor -
           Roseiflexus sp. RS-1
          Length = 1202

 Score = 58.8 bits (136), Expect = 1e-07
 Identities = 37/108 (34%), Positives = 46/108 (42%), Gaps = 3/108 (2%)
 Frame = +3

Query: 474 VHIAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQN 653
           V +  Q     A +W   GA T +KDQG CGSCW+ A   + E    R  G    LSEQ 
Sbjct: 161 VVMGAQEGLPAAFNWCDQGACTPVKDQGVCGSCWAFATTGVVESALKRIDGVERDLSEQY 220

Query: 654 LIDCSEH---XXXXXXXXXXXXXLQVHQGQRGDRHRADLPYEGS*RPI 788
           LI    H                L  HQ + G  + +DLPY G   P+
Sbjct: 221 LISAGTHGTCNGGGPAYDLFIGDLPAHQTEAGAVYESDLPYLGQDVPL 268


>UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum
           aestivum|Rep: Cysteine protease - Triticum aestivum
           (Wheat)
          Length = 371

 Score = 58.8 bits (136), Expect = 1e-07
 Identities = 26/52 (50%), Positives = 30/52 (57%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           DWR+HG VT  K QG CG CW+ A     E  +    G LV LS Q L+DCS
Sbjct: 158 DWREHGVVTPAKQQGACGCCWAFAAAATVESLNKINGGELVDLSVQELVDCS 209


>UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n=1;
           Toxocara canis|Rep: Cathepsin L-like cysteine proteinase
           - Toxocara canis (Canine roundworm)
          Length = 360

 Score = 58.8 bits (136), Expect = 1e-07
 Identities = 27/63 (42%), Positives = 37/63 (58%)
 Frame = +3

Query: 480 IAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLI 659
           +A + E     DWR +  VT +K Q KCGSCW+ A +   E  +   +G L SLSEQ L+
Sbjct: 139 LARREEIPDHFDWRPYNVVTPVKSQFKCGSCWAFATVGTVESAYALGTGELRSLSEQQLL 198

Query: 660 DCS 668
           DC+
Sbjct: 199 DCN 201


>UniRef50_Q2QS15 Cluster: Papain family cysteine protease containing
           protein; n=1; Oryza sativa (japonica
           cultivar-group)|Rep: Papain family cysteine protease
           containing protein - Oryza sativa subsp. japonica (Rice)
          Length = 351

 Score = 58.4 bits (135), Expect = 2e-07
 Identities = 26/55 (47%), Positives = 36/55 (65%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           +VDWRK GAV ++K    CGSCW+ + +   EG    ++G LVSL EQ L+DC +
Sbjct: 148 SVDWRKKGAVVEVKYHEDCGSCWAFSAVAAIEG--INKNGELVSLLEQELVDCDD 200


>UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba
           culbertsoni|Rep: Cysteine proteinase - Acanthamoeba
           culbertsoni
          Length = 482

 Score = 58.4 bits (135), Expect = 2e-07
 Identities = 26/52 (50%), Positives = 32/52 (61%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           DWR  GAVT +K+QG C SCW+       EG      G LVSLS+Q L+DC+
Sbjct: 161 DWRTKGAVTPVKNQGSCASCWAFVATGAVEGVRKIAGGSLVSLSDQMLLDCA 212


>UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella
           histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax
           (Sterkiella histriomuscorum)
          Length = 366

 Score = 58.4 bits (135), Expect = 2e-07
 Identities = 23/52 (44%), Positives = 34/52 (65%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           DWR  G V+ +K+QGKCGSCW+ + +   E  +  + G   +LSEQ L+DC+
Sbjct: 140 DWRTFGVVSPVKNQGKCGSCWTFSTVGCVESHYLLKYGAFRNLSEQQLVDCA 191


>UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 234

 Score = 58.4 bits (135), Expect = 2e-07
 Identities = 26/52 (50%), Positives = 32/52 (61%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +D+R  GAV +IKDQ  CGSCW+       E   F + G L SLSEQ L+DC
Sbjct: 22  IDYRTKGAVNEIKDQKHCGSCWAFGSCAAMESSWFLKHGTLYSLSEQCLVDC 73


>UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;
           n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
           CG5367-PA - Nasonia vitripennis
          Length = 362

 Score = 58.0 bits (134), Expect = 3e-07
 Identities = 27/59 (45%), Positives = 38/59 (64%)
 Frame = +3

Query: 492 REAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           R    ++DWR+ G VT  ++Q  CGSC++ +      GQ FRQ+G +V LSEQ L+DCS
Sbjct: 149 RRIPKSLDWREKGFVTKPENQRDCGSCYAYSIAGSIAGQIFRQTGIVVPLSEQQLVDCS 207


>UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin O
           precursor; n=2; Apocrita|Rep: PREDICTED: similar to
           Cathepsin O precursor - Apis mellifera
          Length = 374

 Score = 58.0 bits (134), Expect = 3e-07
 Identities = 22/54 (40%), Positives = 36/54 (66%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
           DWR  G +T ++ QG CG+CW+ + +E+ E     ++G L SLS Q +IDC+++
Sbjct: 160 DWRDKGVITPVRSQGSCGACWAFSTIEVIESMFAIKNGTLHSLSVQEMIDCAKN 213


>UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryza
           sativa (japonica cultivar-group)|Rep: Putative cysteine
           proteinase - Oryza sativa subsp. japonica (Rice)
          Length = 357

 Score = 58.0 bits (134), Expect = 3e-07
 Identities = 25/53 (47%), Positives = 35/53 (66%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           ++WR  GAVT +K+Q  C SCW+ + +   EG H  +S  LV+LS Q L+DCS
Sbjct: 139 INWRDRGAVTQVKNQKDCASCWAFSAVAAVEGIHQIRSHNLVALSTQQLLDCS 191


>UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza
           sativa|Rep: Os01g0240900 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 166

 Score = 58.0 bits (134), Expect = 3e-07
 Identities = 28/57 (49%), Positives = 36/57 (63%), Gaps = 3/57 (5%)
 Frame = +3

Query: 504 GAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSG---YLVSLSEQNLIDC 665
           GA  WR  GAVTD+K QG C SCW+ +     EG +F  SG    L++LSEQ L++C
Sbjct: 100 GASIWRDRGAVTDVKMQGTCASCWAFSTTGAVEGDNFLASGNLRNLLNLSEQQLVNC 156


>UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 397

 Score = 58.0 bits (134), Expect = 3e-07
 Identities = 23/56 (41%), Positives = 35/56 (62%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
           +VDWR  G V+ +KDQG+CG CW+ +   L E  +  ++  L   SEQ L+DC+ +
Sbjct: 183 SVDWRIQGKVSPVKDQGRCGCCWAFSATALAESVNLMRNNTLQQYSEQELVDCTNN 238


>UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing
           protein; n=5; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 437

 Score = 58.0 bits (134), Expect = 3e-07
 Identities = 32/91 (35%), Positives = 46/91 (50%), Gaps = 3/91 (3%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGK-CGSCWSSARLELWEGQHFRQSGYL-VSLSEQNLIDCS-EHXX 680
           VDWR+ G VT +K QGK CGSCW+ A +   E  +  ++G   +  SEQ L+DC+ +   
Sbjct: 209 VDWREKGVVTQVKSQGKDCGSCWAFAAVAALESHYALKTGKKPIQFSEQQLVDCARKFDT 268

Query: 681 XXXXXXXXXXXLQVHQGQRGDRHRADLPYEG 773
                       +      G ++ AD PYEG
Sbjct: 269 KGCSGGLPSKGFEYLAYAGGIQNEADYPYEG 299


>UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;
           n=17; Magnoliophyta|Rep: Thiol protease aleurain-like
           precursor - Arabidopsis thaliana (Mouse-ear cress)
          Length = 358

 Score = 58.0 bits (134), Expect = 3e-07
 Identities = 23/52 (44%), Positives = 34/52 (65%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           DWR+ G V+ +K+QG CGSCW+ +     E  + +  G  +SLSEQ L+DC+
Sbjct: 146 DWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCA 197



 Score = 36.7 bits (81), Expect = 0.67
 Identities = 49/178 (27%), Positives = 71/178 (39%), Gaps = 3/178 (1%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGF--NKTAKH 429
           E   R  ++ E+  +I   N+K   GL SYKL +N++ D+   EF +   G   N +A  
Sbjct: 75  EMKLRFSVFKENLDLIRSTNKK---GL-SYKLSLNQFADLTWQEFQRYKLGAAQNCSATL 130

Query: 430 NKNLYMKGGSVRGAKFISPANVKLPERWTGGSTAPSPTSRTKGSVAHAGLQHDWSFGKD- 606
             +  +   +V   K      +  P +   G      T  T G++  A  Q   +FGK  
Sbjct: 131 KGSHKITEATVPDTKDWREDGIVSPVK-EQGHCGSCWTFSTTGALEAAYHQ---AFGKGI 186

Query: 607 STSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTEQTYLTRGVD 780
           S S      C                    G   Q F+YIK NGG+DTE+ Y   G D
Sbjct: 187 SLSEQQLVDCAGTFNNFGCHG---------GLPSQAFEYIKYNGGLDTEEAYPYTGKD 235


>UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing
           protein; n=7; Hymenostomatida|Rep: Papain family
           cysteine protease containing protein - Tetrahymena
           thermophila SB210
          Length = 387

 Score = 57.6 bits (133), Expect = 3e-07
 Identities = 25/56 (44%), Positives = 34/56 (60%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
           +VDWR  G VT +KDQG CGSCW+ A   + E      +G L +LS Q L+ C ++
Sbjct: 136 SVDWRDAGVVTPVKDQGHCGSCWAFATTAVIESYAAIATGQLKTLSTQQLVSCVQN 191



 Score = 34.7 bits (76), Expect = 2.7
 Identities = 28/85 (32%), Positives = 39/85 (45%), Gaps = 1/85 (1%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
           E N R +I+ +    I   N   E G   YK G+N++ D    E  +T  G++KT K+  
Sbjct: 57  EYNQRKRIFEQKLKEIKAFNSNSENG---YKKGINQFTDRTAEELRETTLGYSKTVKNAA 113

Query: 436 NLYMKGGSVRGAKFISPANVK-LPE 507
           N   K    R  K     NVK LP+
Sbjct: 114 N---KQNMFRNLKTSDKINVKDLPK 135


>UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,
           partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
           hypothetical protein, partial - Ornithorhynchus anatinus
          Length = 224

 Score = 57.2 bits (132), Expect = 4e-07
 Identities = 28/52 (53%), Positives = 34/52 (65%), Gaps = 1/52 (1%)
 Frame = +3

Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQ-HFRQSGYLVSLSEQN 653
           A   DWRK GAVT +K+QG CGSCW+ A +   E   + R S  LVSLSEQ+
Sbjct: 132 AETCDWRKEGAVTPVKNQGDCGSCWAFAAVGNVESMWYLRASNRLVSLSEQD 183


>UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 344

 Score = 57.2 bits (132), Expect = 4e-07
 Identities = 25/56 (44%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
 Frame = +3

Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGY-LVSLSEQNLIDC 665
           A  +DWR   A+T +K QGKCGSCW+ A   + E   F ++G  L + SEQ ++DC
Sbjct: 136 APPMDWRNASAITPVKQQGKCGSCWTFASTAVLESFSFIKNGAPLTNFSEQQILDC 191


>UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein
           OJ1280_A04.4; n=1; Oryza sativa (japonica
           cultivar-group)|Rep: Putative uncharacterized protein
           OJ1280_A04.4 - Oryza sativa subsp. japonica (Rice)
          Length = 340

 Score = 57.2 bits (132), Expect = 4e-07
 Identities = 33/95 (34%), Positives = 48/95 (50%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEHXXXX 686
           ++D RK GAV ++K Q  CGSCW+ + +   EG    ++G LVSLSEQ L+DC +     
Sbjct: 133 SIDRRKKGAVVEVKYQEDCGSCWAFSAVAAIEG--INKNGELVSLSEQELVDCDDEAVGC 190

Query: 687 XXXXXXXXXLQVHQGQRGDRHRADLPYEGS*RPIP 791
                       H+ +R     A+ P  G  R +P
Sbjct: 191 GGGHHGGELAVPHRERRVPGGEAE-PERGQHRGLP 224


>UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease
           Gip1p; n=4; Tetrahymena thermophila|Rep:
           Granule-biosynthesis induced protease Gip1p -
           Tetrahymena thermophila
          Length = 345

 Score = 57.2 bits (132), Expect = 4e-07
 Identities = 23/53 (43%), Positives = 34/53 (64%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +VDWRK G +  +K+QG CGSCW+ A   + E  +  ++  L+  SEQ L+DC
Sbjct: 136 SVDWRKRGVLNPVKNQGTCGSCWTFATAGILESFNQIKNKQLLKFSEQQLVDC 188


>UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:
           Silicatein beta - Suberites domuncula (Sponge)
          Length = 383

 Score = 57.2 bits (132), Expect = 4e-07
 Identities = 25/53 (47%), Positives = 36/53 (67%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           +DWR  G VT +KDQ +CGS ++ + +   EG +    G LV+LSEQN++DCS
Sbjct: 166 MDWRTSGVVTKVKDQLRCGSSYAFSAMASLEGINALSYGSLVTLSEQNIVDCS 218


>UniRef50_Q23H32 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 365

 Score = 57.2 bits (132), Expect = 4e-07
 Identities = 24/53 (45%), Positives = 36/53 (67%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +VDWR+   V  ++ QG CGSCW+ + +   EG + +Q+G ++  SEQNLIDC
Sbjct: 138 SVDWREK-LVAPVQKQGGCGSCWAFSTVIALEGAYAKQTGNVIKFSEQNLIDC 189



 Score = 41.1 bits (92), Expect = 0.031
 Identities = 19/59 (32%), Positives = 35/59 (59%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHN 432
           E ++R +I+AE+ + I  +NQ  E    + +L +N++ D+   EF +   G+N + KHN
Sbjct: 57  EGDYRFQIFAENYNYIHNYNQINENSQDNIQLEVNEFADLSLQEFRELYFGYNSSKKHN 115


>UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicatein
           a3 - Lubomirskia baicalensis
          Length = 344

 Score = 57.2 bits (132), Expect = 4e-07
 Identities = 26/56 (46%), Positives = 36/56 (64%)
 Frame = +3

Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           A ++DWR  G VT ++ QG+CGS ++ A     EG     +  LV+LSEQN+IDCS
Sbjct: 129 ADSLDWRTRGVVTSVQSQGQCGSSYAFAAAGALEGATALAADKLVALSEQNIIDCS 184



 Score = 35.9 bits (79), Expect = 1.2
 Identities = 45/176 (25%), Positives = 70/176 (39%), Gaps = 7/176 (3%)
 Frame = +1

Query: 268 RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYM 447
           R  I+  +K  I  HN   +  L  Y L MN +GD++  EF +       T KH++   +
Sbjct: 64  RHSIWVANKKYIEHHNANAD--LFGYTLAMNGFGDLMSAEFTERY----LTHKHSQRSGL 117

Query: 448 KGGSVRGAKFISPANVKLPERWTGGSTAPSPTSRTKGSVAHAGLQHDWSFGKDSTSVSPA 627
           +  +    K ++ A+  L  R  G  T+     +   S A A           + ++  A
Sbjct: 118 Q--TFESPKGVTYAD-SLDWRTRGVVTSVQSQGQCGSSYAFAA----------AGALEGA 164

Query: 628 TWCXXXXXXXXXXXXXXEQRLQRGAHG-------QRFKYIKDNGGIDTEQTYLTRG 774
           T                +  +  G HG         FKY+ DNGGIDTE +Y  +G
Sbjct: 165 TALAADKLVALSEQNIIDCSVPYGNHGCSGGDVYTAFKYVVDNGGIDTESSYPYKG 220


>UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine
           protease; n=1; Maconellicoccus hirsutus|Rep: Putative
           cathepsin L-like cysteine protease - Maconellicoccus
           hirsutus (hibiscus mealybug)
          Length = 339

 Score = 57.2 bits (132), Expect = 4e-07
 Identities = 39/115 (33%), Positives = 60/115 (52%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
           ED  RMKI+ ++K+ IA+HN+ +  GLV+++ G+N+Y DML  EF + M    + + + +
Sbjct: 45  EDRLRMKIFIDNKYRIAQHNKLFHKGLVTFEQGINEYSDMLQSEFNEKM---GQKSSNQR 101

Query: 436 NLYMKGGSVRGAKFISPANVKLPERWTGGSTAPSPTSRTKGSVAHAGLQHDWSFG 600
           N    G  +   +F    NV  P+         S   RTKG V   G Q + S G
Sbjct: 102 NTEANG--LPSIRFTPLHNVNPPD---------SVDWRTKGLVGPVGKQVNCSSG 145



 Score = 39.5 bits (88), Expect = 0.096
 Identities = 20/55 (36%), Positives = 28/55 (50%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           +VDWR  G V  +  Q  C S ++ + +   EGQ          +S QN+IDCSE
Sbjct: 124 SVDWRTKGLVGPVGKQVNCSSGYAWSAIGALEGQLASDKKKFQGISVQNVIDCSE 178


>UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11;
           Entamoeba|Rep: Cysteine proteinase 2 precursor -
           Entamoeba histolytica
          Length = 315

 Score = 57.2 bits (132), Expect = 4e-07
 Identities = 30/97 (30%), Positives = 48/97 (49%), Gaps = 3/97 (3%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSG---YLVSLSEQNLIDC 665
           +A  +VDWRK G VT I+DQ +CGSC++   L   EG+   + G     + LSE++++ C
Sbjct: 93  QAPESVDWRKEGKVTPIRDQAQCGSCYTFGSLAALEGRLLIEKGGDANTLDLSEEHMVQC 152

Query: 666 SEHXXXXXXXXXXXXXLQVHQGQRGDRHRADLPYEGS 776
           +               +  +  + G    +D PY GS
Sbjct: 153 TRDNGNNGCNGGLGSNVYDYIIEHGVAKESDYPYTGS 189


>UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;
           n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
           L-like proteinase" precursor - Diabrotica virgifera
           virgifera (western corn rootworm)
          Length = 315

 Score = 56.4 bits (130), Expect = 8e-07
 Identities = 29/64 (45%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
 Frame = +3

Query: 477 HIAGQR-EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQN 653
           H+A    +A   VDWR   AV  +KDQG+CGSCW+ +     EGQ        V LSEQ 
Sbjct: 103 HVADPNVQAVEEVDWRD-SAVLGVKDQGQCGSCWAFSTTGSLEGQLAIHKNQRVPLSEQE 161

Query: 654 LIDC 665
           L+DC
Sbjct: 162 LVDC 165



 Score = 39.1 bits (87), Expect = 0.13
 Identities = 17/45 (37%), Positives = 25/45 (55%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF 390
           ED  R  ++ ++   I +HN KYE G  +Y L +NK+ D    EF
Sbjct: 39  EDKLRFAVFQDNLKKIEEHNAKYESGEETYYLAVNKFADWSSAEF 83


>UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163
           precursor; n=4; Schizophora|Rep: Putative cysteine
           proteinase CG12163 precursor - Drosophila melanogaster
           (Fruit fly)
          Length = 614

 Score = 56.4 bits (130), Expect = 8e-07
 Identities = 24/51 (47%), Positives = 33/51 (64%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           DWR+  AVT +K+QG CGSCW+ +     EG +  ++G L   SEQ L+DC
Sbjct: 399 DWRQKDAVTQVKNQGSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDC 449


>UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus
           tropicalis|Rep: LOC594890 protein - Xenopus tropicalis
           (Western clawed frog) (Silurana tropicalis)
          Length = 355

 Score = 56.0 bits (129), Expect = 1e-06
 Identities = 25/56 (44%), Positives = 37/56 (66%), Gaps = 1/56 (1%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHF-RQSGYLVSLSEQNLIDCSE 671
           ++DWR    VT +KDQG C + W+ + +   E Q+  R++G L SLS QNL+DCS+
Sbjct: 142 SIDWRNKNCVTSVKDQGSCIASWAFSSIGALECQNMKRRTGKLESLSVQNLLDCSQ 197



 Score = 43.2 bits (97), Expect = 0.008
 Identities = 22/55 (40%), Positives = 31/55 (56%), Gaps = 1/55 (1%)
 Frame = +1

Query: 244 RKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFV-KTMN 405
           +  GE+  R  I+ +    I  HN +Y MGL +Y++GMN  GDM+  E   K MN
Sbjct: 64  KNEGEELARRLIWEDTLKFIMLHNLEYSMGLHTYEVGMNHLGDMVAEEMTDKQMN 118


>UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-ear
           cress). SAG12 protein; n=2; Dictyostelium
           discoideum|Rep: Similar to Arabidopsis thaliana
           (Mouse-ear cress). SAG12 protein - Dictyostelium
           discoideum (Slime mold)
          Length = 358

 Score = 56.0 bits (129), Expect = 1e-06
 Identities = 23/56 (41%), Positives = 34/56 (60%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
           ++DWRK G VT +KDQG+CGSC+  + +E  E    +     + LSEQ  +DC  +
Sbjct: 148 SIDWRKKGLVTPVKDQGQCGSCYIFSAVEQIETAWIKAGNKPILLSEQQAVDCDPY 203


>UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep:
           LOC443661 protein - Xenopus laevis (African clawed frog)
          Length = 346

 Score = 55.6 bits (128), Expect = 1e-06
 Identities = 23/60 (38%), Positives = 37/60 (61%)
 Frame = +3

Query: 489 QREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           + +   ++DWR  G VT ++ Q KCGSC++ + +   E Q  ++ G LV+ S Q L+DCS
Sbjct: 137 EAQPPASIDWRTKGCVTSVRRQRKCGSCYAFSAVGALECQWKKKKGTLVTFSPQELVDCS 196



 Score = 46.0 bits (104), Expect = 0.001
 Identities = 21/55 (38%), Positives = 30/55 (54%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKT 420
           E+  R  I+ E    I  HN +Y +GL +Y++GMN  GDM   E   TM G+  +
Sbjct: 67  EERARRTIWEETLKFITVHNLEYSLGLHTYEVGMNHLGDMTGEEVEATMTGYTSS 121


>UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteinase
           B; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like
           tick cysteine proteinase B - Haemaphysalis longicornis
           (Bush tick)
          Length = 332

 Score = 55.6 bits (128), Expect = 1e-06
 Identities = 32/80 (40%), Positives = 42/80 (52%)
 Frame = +3

Query: 342 LQAGHEQVRRHAPPRVREDYERLQQNCQTQQESVHEGWERPRG*VHIAGQREAAGAVDWR 521
           +QA HE+V R   PRV E  +RLQ        +    +  P G             +DWR
Sbjct: 70  VQARHERVWRLVAPRVCEHPQRLQAQLPGPP-TWGSTYIEPEG----LEDEHLPKTMDWR 124

Query: 522 KHGAVTDIKDQGKCGSCWSS 581
           K GAVT +K+QG+CGSCW+S
Sbjct: 125 KKGAVTPVKNQGQCGSCWAS 144


>UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila
           melanogaster|Rep: CG5367-PA - Drosophila melanogaster
           (Fruit fly)
          Length = 338

 Score = 54.8 bits (126), Expect = 2e-06
 Identities = 29/88 (32%), Positives = 44/88 (50%), Gaps = 1/88 (1%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS-EHXXX 683
           ++DWR  G +T   +Q  CGSC++ +  E   GQ F+++G ++SLS+Q ++DCS  H   
Sbjct: 130 SLDWRSKGFITPPYNQLSCGSCYAFSIAESIMGQVFKRTGKILSLSKQQIVDCSVSHGNQ 189

Query: 684 XXXXXXXXXXLQVHQGQRGDRHRADLPY 767
                     L   Q   G     D PY
Sbjct: 190 GCVGGSLRNTLSYLQSTGGIMRDQDYPY 217



 Score = 34.7 bits (76), Expect = 2.7
 Identities = 18/53 (33%), Positives = 29/53 (54%)
 Frame = +1

Query: 274 KIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHN 432
           K + E+  +I +HNQ Y+ G  S++L  N + DM    ++K   GF +  K N
Sbjct: 58  KAFEENFKVIEEHNQNYKEGQTSFRLKPNIFADMSTDGYLK---GFLRLLKSN 107


>UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 339

 Score = 54.8 bits (126), Expect = 2e-06
 Identities = 24/55 (43%), Positives = 39/55 (70%), Gaps = 1/55 (1%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKC-GSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           ++DWR   AVT +K+QG C G+ +S + + + E  HF ++  L++LSEQN+IDC+
Sbjct: 117 SIDWRNFDAVTPVKNQGLCSGAGYSFSAIGVIESSHFIKNKELITLSEQNIIDCT 171


>UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like
           cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
           Clan CA, family C1, cathepsin L-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 291

 Score = 54.8 bits (126), Expect = 2e-06
 Identities = 23/59 (38%), Positives = 36/59 (61%)
 Frame = +3

Query: 492 REAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           +++ G +D+R+ G V  I+DQ +CGSCW+   +   E  +      L  LSEQN+IDC+
Sbjct: 76  KDSPGILDYREMGVVNPIRDQKQCGSCWAFGTVAACESNYALLYSNLPQLSEQNIIDCA 134


>UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16)
           [Contains: Cathepsin H mini chain; Cathepsin H heavy
           chain; Cathepsin H light chain]; n=37; Eukaryota|Rep:
           Cathepsin H precursor (EC 3.4.22.16) [Contains:
           Cathepsin H mini chain; Cathepsin H heavy chain;
           Cathepsin H light chain] - Homo sapiens (Human)
          Length = 335

 Score = 54.8 bits (126), Expect = 2e-06
 Identities = 24/56 (42%), Positives = 37/56 (66%), Gaps = 1/56 (1%)
 Frame = +3

Query: 507 AVDWRKHGA-VTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           +VDWRK G  V+ +K+QG CGSCW+ +     E      +G ++SL+EQ L+DC++
Sbjct: 119 SVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQ 174


>UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2;
           Entamoeba|Rep: Cysteine proteinase ACP1 precursor -
           Entamoeba histolytica
          Length = 308

 Score = 54.8 bits (126), Expect = 2e-06
 Identities = 26/61 (42%), Positives = 35/61 (57%)
 Frame = +3

Query: 483 AGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLID 662
           A  + A  +VDWR    +   KDQG+CGSCW+     + EG+  +  G L S SEQ L+D
Sbjct: 88  AAVKAAPESVDWRS--IMNPAKDQGQCGSCWTFCTTAVLEGRVNKDLGKLYSFSEQQLVD 145

Query: 663 C 665
           C
Sbjct: 146 C 146


>UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_23,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 321

 Score = 54.4 bits (125), Expect = 3e-06
 Identities = 25/53 (47%), Positives = 33/53 (62%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           VDW + G V  IKDQG CGSCW+ + +   E     Q   +V LSEQ+L+DC+
Sbjct: 123 VDWVQKGKVPAIKDQGDCGSCWAFSAVGALEINTKIQFNEIVDLSEQDLVDCA 175


>UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 280

 Score = 54.0 bits (124), Expect = 4e-06
 Identities = 23/54 (42%), Positives = 33/54 (61%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
           DWR  G VT +K+QG CGSCW+     L+E  +  ++  +   SEQ L+DCS +
Sbjct: 73  DWRNLGKVTQVKNQGNCGSCWAFTITGLFESINLIRNKTVELYSEQELLDCSSN 126


>UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 394

 Score = 54.0 bits (124), Expect = 4e-06
 Identities = 24/56 (42%), Positives = 33/56 (58%), Gaps = 1/56 (1%)
 Frame = +3

Query: 501 AGAVDWRK-HGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           A +VDWR     +  +KDQG+CGSCW+     + E  +   +G L S SEQ L+DC
Sbjct: 184 AASVDWRNVKNVLNPVKDQGQCGSCWTFGAAGVMESFNAITNGVLKSFSEQQLVDC 239


>UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyostelium
           discoideum|Rep: Cysteine proteinase 3 - Dictyostelium
           discoideum (Slime mold)
          Length = 151

 Score = 54.0 bits (124), Expect = 4e-06
 Identities = 28/53 (52%), Positives = 36/53 (67%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           VDWR+  AVT +KDQG+CGSC  S    + EG    ++G LVSLSEQN++  S
Sbjct: 80  VDWREKDAVTPVKDQGQCGSCIISTTGSV-EGVTAIKTGKLVSLSEQNILRLS 131



 Score = 34.7 bits (76), Expect = 2.7
 Identities = 19/45 (42%), Positives = 29/45 (64%), Gaps = 1/45 (2%)
 Frame = +2

Query: 575 VFSTTGAL-GRTALPSVRLPGVALGAKPHRLLGAYGNNGCNGGLM 706
           + STTG++ G TA+ + +L  ++      RL  ++GN GCNGGLM
Sbjct: 101 IISTTGSVEGVTAIKTGKLVSLS-EQNILRLSSSFGNEGCNGGLM 144


>UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole
           genome shotgun sequence; n=1; Tetraodon
           nigroviridis|Rep: Chromosome undetermined SCAF6860,
           whole genome shotgun sequence - Tetraodon nigroviridis
           (Green puffer)
          Length = 251

 Score = 53.6 bits (123), Expect = 5e-06
 Identities = 22/39 (56%), Positives = 30/39 (76%)
 Frame = +3

Query: 555 GKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           G CGSCW+ +     EGQ ++++G LVSLSEQNL+DCS+
Sbjct: 1   GYCGSCWAFSTTGAIEGQIYKKTGQLVSLSEQNLVDCSK 39


>UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia
           bovis|Rep: Cysteine protease 2 - Babesia bovis
          Length = 445

 Score = 53.6 bits (123), Expect = 5e-06
 Identities = 26/52 (50%), Positives = 32/52 (61%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +DWR+  AVT +KDQG CGSCW+ A +   E    RQ    V LSEQ L+ C
Sbjct: 240 IDWRRADAVTPVKDQGMCGSCWAFAAVGSVESLLKRQKTD-VRLSEQELVSC 290


>UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_21,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 349

 Score = 53.6 bits (123), Expect = 5e-06
 Identities = 26/54 (48%), Positives = 35/54 (64%), Gaps = 1/54 (1%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYL-VSLSEQNLIDCS 668
           VD RK G V+++K+QG CGSCW+ + +   E    RQ G   V LSEQ L+DC+
Sbjct: 129 VDLRKDGVVSEVKNQGSCGSCWAFSAVAALE-TALRQGGVKNVELSEQELVDCA 181



 Score = 39.9 bits (89), Expect = 0.072
 Identities = 19/68 (27%), Positives = 36/68 (52%), Gaps = 1/68 (1%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
           E++ R  I+ ++   I +H Q+ E GL +++LG+N + D+   EF      +  T +   
Sbjct: 55  ENSHRFGIFKKNYQYIQEHQQRVEAGLETFELGLNDFADLSVEEFEAKYLKYRSTPREQT 114

Query: 436 N-LYMKGG 456
           N +Y + G
Sbjct: 115 NQVYRRTG 122


>UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1;
           Bigelowiella natans|Rep: Digestive cysteine proteinase -
           Bigelowiella natans (Pedinomonas minutissima)
           (Chlorarachnion sp.(strain CCMP 621))
          Length = 360

 Score = 53.2 bits (122), Expect = 7e-06
 Identities = 22/58 (37%), Positives = 36/58 (62%), Gaps = 4/58 (6%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHF-RQSGYL---VSLSEQNLIDCSEH 674
           DWR   A+T +KDQG CGSCW+ +  +  E  H+ + +  L   ++LS + L++C +H
Sbjct: 114 DWRDFNALTPVKDQGGCGSCWAFSATQALESAHYIKHNDTLDSPIALSTEQLVECDQH 171


>UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1;
           Oryza sativa (japonica cultivar-group)|Rep: Putative
           uncharacterized protein - Oryza sativa subsp. japonica
           (Rice)
          Length = 289

 Score = 53.2 bits (122), Expect = 7e-06
 Identities = 23/46 (50%), Positives = 30/46 (65%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSE 647
           +DWR  GAVT +KDQG CGSCW+ A +   EG    ++G L  LS+
Sbjct: 128 IDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSD 173


>UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia
           tropicalis|Rep: Blo t 1 allergen - Blomia tropicalis
           (Mite)
          Length = 333

 Score = 53.2 bits (122), Expect = 7e-06
 Identities = 22/52 (42%), Positives = 31/52 (59%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           DWR+   +T I+ QG CGSCW+ A   + E  +  Q    + LSEQ L+DC+
Sbjct: 118 DWRQKARLTRIRQQGSCGSCWAFAAAGVAESLYSIQKQQSIELSEQELVDCT 169


>UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome
           shotgun sequence; n=3; Tetraodontidae|Rep: Chromosome 12
           SCAF14996, whole genome shotgun sequence - Tetraodon
           nigroviridis (Green puffer)
          Length = 362

 Score = 52.8 bits (121), Expect = 1e-05
 Identities = 22/52 (42%), Positives = 34/52 (65%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGF 411
           E+ +R  ++ ++   I  HN ++ MG  SY+LGMN +GDM H EF + MNG+
Sbjct: 43  EEGWRRMVWEKNLKKIELHNLEHSMGQHSYRLGMNHFGDMTHEEFRQIMNGY 94



 Score = 45.2 bits (102), Expect = 0.002
 Identities = 19/22 (86%), Positives = 21/22 (95%)
 Frame = +3

Query: 603 GQHFRQSGYLVSLSEQNLIDCS 668
           GQHFRQ+G LVSLSEQNL+DCS
Sbjct: 183 GQHFRQTGKLVSLSEQNLVDCS 204



 Score = 37.9 bits (84), Expect = 0.29
 Identities = 16/30 (53%), Positives = 20/30 (66%)
 Frame = +1

Query: 697 GAHGQRFKYIKDNGGIDTEQTYLTRGVDDQ 786
           G   Q F+YIKDNGG+D+E +Y     DDQ
Sbjct: 215 GLMDQAFQYIKDNGGLDSEASYPYLATDDQ 244


>UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza
           sativa|Rep: Os09g0381400 protein - Oryza sativa subsp.
           japonica (Rice)
          Length = 362

 Score = 52.8 bits (121), Expect = 1e-05
 Identities = 25/61 (40%), Positives = 33/61 (54%), Gaps = 1/61 (1%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQ-GKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           +   +VDWR  GAV   K Q   C SCW+       E  +  ++G LVSLSEQ L+DC  
Sbjct: 143 DVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDS 202

Query: 672 H 674
           +
Sbjct: 203 Y 203


>UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2;
           Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio
           molitor (Yellow mealworm)
          Length = 336

 Score = 52.8 bits (121), Expect = 1e-05
 Identities = 24/53 (45%), Positives = 32/53 (60%), Gaps = 2/53 (3%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQH--FRQSGYLVSLSEQNLIDC 665
           DWR  G V+ +K+QG CGSCW+ +     E Q      +GY  S+SEQ L+DC
Sbjct: 126 DWRDQGMVSPVKNQGSCGSCWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDC 178



 Score = 48.4 bits (110), Expect = 2e-04
 Identities = 23/61 (37%), Positives = 33/61 (54%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
           E+ FR +I+ +      +HN+KY  GLVSY LG+N + DM   E     +G    A  +K
Sbjct: 43  EETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDMTPEEMKAYTHGLIMPADLHK 102

Query: 436 N 438
           N
Sbjct: 103 N 103


>UniRef50_Q23FQ5 Cluster: Papain family cysteine protease containing
           protein; n=4; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 332

 Score = 52.8 bits (121), Expect = 1e-05
 Identities = 22/54 (40%), Positives = 35/54 (64%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           +VDWR  GA+  I++QG+CGSC +     + E  ++ +S  L+  SEQ L+DC+
Sbjct: 128 SVDWRNSGALNPIQNQGQCGSCAAFGTAGVLESFYYLKSKQLLKFSEQQLLDCA 181


>UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 344

 Score = 52.8 bits (121), Expect = 1e-05
 Identities = 23/51 (45%), Positives = 30/51 (58%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           DWR  G +T  K Q  CGSCW+ A   + E Q+  + G L+  SEQ L+DC
Sbjct: 136 DWRDKGIITPAKFQNTCGSCWTFATTGVIESQYALKYGELLHFSEQMLLDC 186


>UniRef50_UPI0001509FF9 Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 348

 Score = 52.0 bits (119), Expect = 2e-05
 Identities = 22/59 (37%), Positives = 35/59 (59%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           E   ++DW    AV+++K QG C S W+ A +   E   F ++G +  +SEQNL+DC +
Sbjct: 139 EPVNSIDWISKNAVSNVKTQGMCQSSWAFAAVAGVESALFLKNGKIPDVSEQNLLDCDQ 197


>UniRef50_Q24FA8 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 335

 Score = 52.0 bits (119), Expect = 2e-05
 Identities = 24/59 (40%), Positives = 36/59 (61%), Gaps = 2/59 (3%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQH-FRQSGYLVSL-SEQNLIDC 665
           + A ++DWRK G V+ +K+QG+CG CW+ +   L E  +        VSL S+Q L+DC
Sbjct: 123 QIASSIDWRKKGGVSPVKNQGECGGCWTFSATGLMESFNLIHNKPQNVSLYSQQQLLDC 181


>UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precursor;
           n=3; Plasmodium (Laverania)|Rep: Trophozoite cysteine
           proteinase precursor - Plasmodium falciparum
          Length = 569

 Score = 52.0 bits (119), Expect = 2e-05
 Identities = 22/54 (40%), Positives = 36/54 (66%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           +D+R+ G V + KDQG CGSCW+ A +   E    +++  ++S SEQ ++DCS+
Sbjct: 337 LDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSK 390


>UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1;
           Uronema marinum|Rep: Cathepsin L-like cysteine protease
           - Uronema marinum
          Length = 333

 Score = 51.6 bits (118), Expect = 2e-05
 Identities = 22/55 (40%), Positives = 35/55 (63%)
 Frame = +3

Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +G+V+W   GAV  +++QG CGSCW+ + +   E  +   +G L+S SEQ L+ C
Sbjct: 121 SGSVNWVSKGAVQGVQNQGVCGSCWAFSAVCSLERLYKINTGKLLSFSEQQLVSC 175


>UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1;
           Trypanosoma cruzi|Rep: Cysteine protease, putative -
           Trypanosoma cruzi
          Length = 434

 Score = 51.6 bits (118), Expect = 2e-05
 Identities = 23/55 (41%), Positives = 34/55 (61%), Gaps = 2/55 (3%)
 Frame = +3

Query: 507 AVDWR--KHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           A++W+  K+  +T +KDQG CGSCW+ A  E  E  +   SG L++LS Q +  C
Sbjct: 128 ALNWQEAKNPVLTPVKDQGSCGSCWAHAATESVESMYAISSGKLLTLSTQQITSC 182


>UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin O;
           n=1; Danio rerio|Rep: PREDICTED: similar to cathepsin O
           - Danio rerio
          Length = 327

 Score = 51.2 bits (117), Expect = 3e-05
 Identities = 22/52 (42%), Positives = 29/52 (55%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           DWR HG V  + +QG CG CW+ + +E  E    +    L  LS Q +IDCS
Sbjct: 125 DWRDHGVVGPVHNQGSCGGCWAFSIVEAIESVSAKVGEKLQQLSVQQVIDCS 176


>UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin O
           precursor; n=1; Tribolium castaneum|Rep: PREDICTED:
           similar to Cathepsin O precursor - Tribolium castaneum
          Length = 326

 Score = 51.2 bits (117), Expect = 3e-05
 Identities = 22/53 (41%), Positives = 33/53 (62%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           VDWR+  AVT I +QG CG+CW+ + +E  E  +  ++     LS Q +IDC+
Sbjct: 125 VDWREKNAVTRIYNQGSCGACWAYSVIETVESMNAIKTNKSEELSVQEIIDCA 177


>UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba
           histolytica|Rep: Cysteine protease 10 - Entamoeba
           histolytica
          Length = 297

 Score = 51.2 bits (117), Expect = 3e-05
 Identities = 21/61 (34%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
 Frame = +3

Query: 492 REAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYL-VSLSEQNLIDCS 668
           +E   ++DWR  G VT +K+Q KC SC++   +   E    +++    + LSEQ ++DCS
Sbjct: 106 KEVLDSIDWRSEGKVTPVKNQRKCASCYAFGSIATIESLIMQETSIKEIDLSEQQIVDCS 165

Query: 669 E 671
           +
Sbjct: 166 Q 166


>UniRef50_A7APS9 Cluster: Papain family cysteine protease containing
           protein; n=1; Babesia bovis|Rep: Papain family cysteine
           protease containing protein - Babesia bovis
          Length = 435

 Score = 51.2 bits (117), Expect = 3e-05
 Identities = 23/52 (44%), Positives = 32/52 (61%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +D RK   +T +KDQG CGSCW+ + + + E     +    V LSEQNL+DC
Sbjct: 230 IDLRKDNYMTPVKDQGNCGSCWAFSLIGVAEPFFKHKRDIDVVLSEQNLVDC 281


>UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC04937 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 235

 Score = 50.8 bits (116), Expect = 4e-05
 Identities = 24/53 (45%), Positives = 32/53 (60%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           DWR    VT++K+Q KCG  W+ A +   EGQ    S  L SLS Q L+DC++
Sbjct: 165 DWRTKNVVTNVKNQEKCGCGWAFASVGALEGQMKLHSIPLQSLSTQQLVDCTQ 217



 Score = 34.7 bits (76), Expect = 2.7
 Identities = 14/40 (35%), Positives = 25/40 (62%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 375
           E+ +R  I+  +   I  HN  Y++ LV+Y LG+N++ D+
Sbjct: 75  EEIYRRHIWNMYVSRIGLHNLHYDLNLVTYTLGINQFSDL 114


>UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3;
           Theileria|Rep: Cysteine protease, putative - Theileria
           annulata
          Length = 580

 Score = 50.8 bits (116), Expect = 4e-05
 Identities = 20/52 (38%), Positives = 31/52 (59%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           VDWR+ G V ++ +QG CGSCW+ A  +++      +   L+  S Q L+DC
Sbjct: 368 VDWRESGFVNEVVNQGSCGSCWAIASEDIFSTFKSIKKNKLMKFSSQQLVDC 419


>UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia
           medanensis|Rep: Sui m 1 allergen - Suidasia medanensis
          Length = 336

 Score = 50.8 bits (116), Expect = 4e-05
 Identities = 23/53 (43%), Positives = 33/53 (62%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           A DWR+    T +++QG+CGSCW+ A     E Q+  +    V+LSEQ L+DC
Sbjct: 118 AFDWRQQWN-TAVRNQGQCGSCWAFATAATVEAQYAIRKNVHVTLSEQQLVDC 169


>UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5;
           Theileria|Rep: Cysteine proteinase precursor - Theileria
           parva
          Length = 440

 Score = 50.8 bits (116), Expect = 4e-05
 Identities = 20/52 (38%), Positives = 29/52 (55%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +DWR+  +VT +KDQ  CG CW+ + +   EG +         LS Q L+DC
Sbjct: 233 LDWRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFDKSYELSVQELLDC 284


>UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia
           theta|Rep: Cathepsin H precursor - Guillardia theta
           (Cryptomonas phi)
          Length = 353

 Score = 50.4 bits (115), Expect = 5e-05
 Identities = 24/61 (39%), Positives = 34/61 (55%), Gaps = 5/61 (8%)
 Frame = +3

Query: 501 AGAVDWRKH-----GAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           A   DWR         V+ +K+QG CGSCW+ +     E  H  ++G +V LSEQ L+DC
Sbjct: 119 ADEFDWRNQTCGETSCVSMVKNQGTCGSCWTFSTAAALESLHAIKTGEMVLLSEQQLVDC 178

Query: 666 S 668
           +
Sbjct: 179 A 179


>UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1;
           Caenorhabditis elegans|Rep: Putative uncharacterized
           protein - Caenorhabditis elegans
          Length = 299

 Score = 50.4 bits (115), Expect = 5e-05
 Identities = 22/54 (40%), Positives = 35/54 (64%), Gaps = 1/54 (1%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFR-QSGYLVSLSEQNLIDCS 668
           +DWR+ G V  +KDQGKC + ++ A +   E  + +  +G L+S SEQ +IDC+
Sbjct: 84  LDWREKGIVGPVKDQGKCNASYAFAAIAAIESMYAKANNGKLLSFSEQQIIDCA 137


>UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_46,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 336

 Score = 50.4 bits (115), Expect = 5e-05
 Identities = 23/53 (43%), Positives = 32/53 (60%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +VDWRK   +T +KDQG+C  CW+   +   E   + ++   V LSEQ LIDC
Sbjct: 145 SVDWRK---ITQVKDQGQCSGCWAFGAVGAAEAWFYVKNKTTVLLSEQQLIDC 194


>UniRef50_Q9BL26 Cluster: Putative uncharacterized protein; n=4;
           Caenorhabditis elegans|Rep: Putative uncharacterized
           protein - Caenorhabditis elegans
          Length = 345

 Score = 50.0 bits (114), Expect = 7e-05
 Identities = 28/74 (37%), Positives = 41/74 (55%), Gaps = 1/74 (1%)
 Frame = +3

Query: 453 WERPRG*VHIAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFR-QSGY 629
           WE P   +H+   R     +DWR+ G V  +KDQGKC +  + A     E  + +  +G 
Sbjct: 72  WETP---IHM--DRTTEEFLDWREKGIVGPVKDQGKCNASHAFAITSSIESMYAKATNGT 126

Query: 630 LVSLSEQNLIDCSE 671
           L+S SEQ LIDC++
Sbjct: 127 LLSFSEQQLIDCND 140


>UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5;
           Piroplasmida|Rep: Cysteine proteinase, putative -
           Theileria parva
          Length = 460

 Score = 49.6 bits (113), Expect = 9e-05
 Identities = 23/53 (43%), Positives = 32/53 (60%), Gaps = 1/53 (1%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQG-KCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +DWRK   V+ IK+QG +CGSCW+ A +   E  +       + LSEQ L+DC
Sbjct: 253 LDWRKADGVSKIKNQGLECGSCWAFASVSSVESLYKIYRNVTLDLSEQELVDC 305


>UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-like
           cysteine peptidase; n=3; Trichomonas vaginalis|Rep: Clan
           CA, family C1, cathepsin L or K-like cysteine peptidase
           - Trichomonas vaginalis G3
          Length = 320

 Score = 49.6 bits (113), Expect = 9e-05
 Identities = 20/51 (39%), Positives = 31/51 (60%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           DWR  G +  I++QG+CG CW+ + +   E +  +    L+ LSEQ L+DC
Sbjct: 109 DWRTKGIINPIRNQGQCGLCWAFSTICCVEARWAQAYNTLLQLSEQMLVDC 159


>UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease
           containing protein; n=1; Tetrahymena thermophila
           SB210|Rep: Papain family cysteine protease containing
           protein - Tetrahymena thermophila SB210
          Length = 350

 Score = 49.2 bits (112), Expect = 1e-04
 Identities = 24/88 (27%), Positives = 45/88 (51%), Gaps = 1/88 (1%)
 Frame = +3

Query: 408 LQQNCQTQQESVHEGWERPRG*VHIAGQREAAGAVDWRKH-GAVTDIKDQGKCGSCWSSA 584
           L    +T   S  +  + P+    +     A+   DWR + G + ++K+QG+CGSCW+ A
Sbjct: 109 LNSQLKTSASSSSQPAQTPQLRGSVDASLNASQGFDWRNYQGVLGNVKNQGQCGSCWTFA 168

Query: 585 RLELWEGQHFRQSGYLVSLSEQNLIDCS 668
              + E  +  +    +  SEQ+++DC+
Sbjct: 169 TAGVLESYYALKYQQSLIFSEQDIVDCA 196


>UniRef50_Q22NW9 Cluster: Papain family cysteine protease containing
           protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 894

 Score = 49.2 bits (112), Expect = 1e-04
 Identities = 29/93 (31%), Positives = 38/93 (40%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
           E   ++DWR   AVT +K+QG CGS ++ +     EG H          SEQ +IDCS  
Sbjct: 682 EVPSSIDWRDLNAVTPVKNQGSCGSGYAFSTTGALEGIHKISGKDWKGFSEQQIIDCSRK 741

Query: 675 XXXXXXXXXXXXXLQVHQGQRGDRHRADLPYEG 773
                              + G     D PYEG
Sbjct: 742 QGNSGCHGGFMENAFDFVIENGILQENDYPYEG 774


>UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18;
           Plasmodium|Rep: Cysteine proteinase precursor -
           Plasmodium vivax (strain Salvador I)
          Length = 583

 Score = 49.2 bits (112), Expect = 1e-04
 Identities = 22/55 (40%), Positives = 38/55 (69%), Gaps = 1/55 (1%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQ-SGYLVSLSEQNLIDCSE 671
           +D+R+ G V + KDQG CGSCW+ A +   E  + ++ +  +++LSEQ ++DCS+
Sbjct: 343 LDYREKGIVHEPKDQGLCGSCWAFASVGNVECMYAKEHNKTILTLSEQEVVDCSK 397


>UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:
           Viral cathepsin - Xestia c-nigrum granulosis virus
           (XnGV) (Xestia c-nigrumgranulovirus)
          Length = 346

 Score = 49.2 bits (112), Expect = 1e-04
 Identities = 20/53 (37%), Positives = 31/53 (58%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           DWR   +VT +K Q +CGSCW+ + +   E  +  +    + LSEQ L+DC +
Sbjct: 138 DWRDRNSVTSVKMQKECGSCWAFSAVANIESLYHIKHNVSLDLSEQQLVDCDK 190


>UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n=1;
           Rattus norvegicus|Rep: UPI0000501FDB UniRef100 entry -
           Rattus norvegicus
          Length = 338

 Score = 48.8 bits (111), Expect = 2e-04
 Identities = 20/40 (50%), Positives = 28/40 (70%)
 Frame = +3

Query: 552 QGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           QG+C SCW+   +   EGQ F+++G L  LS QNL+DCS+
Sbjct: 139 QGRCNSCWAFPVVGAIEGQMFKKTGKLTPLSVQNLVDCSK 178


>UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Slime
           mold). Cysteine proteinase 5; n=2; Dictyostelium
           discoideum|Rep: Similar to Dictyostelium discoideum
           (Slime mold). Cysteine proteinase 5 - Dictyostelium
           discoideum (Slime mold)
          Length = 345

 Score = 48.8 bits (111), Expect = 2e-04
 Identities = 27/59 (45%), Positives = 32/59 (54%), Gaps = 3/59 (5%)
 Frame = +3

Query: 501 AGAVDWRKHGAVTDIKDQ-GKCGSCWSSARLELWEGQHF--RQSGYLVSLSEQNLIDCS 668
           +  +DWRK GAV  +K Q G CGS W    +   E  HF        +SLS QNLIDCS
Sbjct: 121 SSGIDWRKKGAVPSVKSQIGGCGS-WPITAVGATESAHFLANPKDPFISLSMQNLIDCS 178


>UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Slime
           mold). Gamete and mating- type specific protein A; n=2;
           Dictyostelium discoideum|Rep: Similar to Dictyostelium
           discoideum (Slime mold). Gamete and mating- type
           specific protein A - Dictyostelium discoideum (Slime
           mold)
          Length = 415

 Score = 48.8 bits (111), Expect = 2e-04
 Identities = 23/59 (38%), Positives = 33/59 (55%), Gaps = 3/59 (5%)
 Frame = +3

Query: 498 AAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYL---VSLSEQNLIDC 665
           + G VDW+  G VT IK+QG+CG C+S A     E  +  ++      + LSEQN + C
Sbjct: 209 STGDVDWKSLGFVTSIKNQGQCGGCYSFATCAALESAYLIKNNLPNTDIDLSEQNFVSC 267


>UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted,
           possible transmembrane domain near N-terminus; n=4;
           Cryptosporidium|Rep: Cryptopain-cysteine proteinase
           secreted, possible transmembrane domain near N-terminus
           - Cryptosporidium parvum Iowa II
          Length = 401

 Score = 48.8 bits (111), Expect = 2e-04
 Identities = 22/56 (39%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGY-LVSLSEQNLIDCSE 671
           +++W + G V  I++Q  CGSCW+ + +   EG    Q+   L SLSEQ  +DCS+
Sbjct: 179 SINWVEAGCVNPIRNQKNCGSCWAFSAVAALEGATCAQTNRGLPSLSEQQFVDCSK 234



 Score = 38.3 bits (85), Expect = 0.22
 Identities = 24/87 (27%), Positives = 44/87 (50%), Gaps = 3/87 (3%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
           E+N R +IY ++ + I   N +   G  SY L MN++GD+   EF+    G+ K +K ++
Sbjct: 102 EENQRFEIYKQNMNFIKTTNSQ---GF-SYVLEMNEFGDLSKEEFMARFTGYIKDSKDDE 157

Query: 436 NLYMK---GGSVRGAKFISPANVKLPE 507
            ++       S    +F+ P ++   E
Sbjct: 158 RVFKSSRVSASESEEEFVPPNSINWVE 184


>UniRef50_Q248G1 Cluster: Papain family cysteine protease containing
           protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 334

 Score = 48.8 bits (111), Expect = 2e-04
 Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 1/54 (1%)
 Frame = +3

Query: 507 AVDWRK-HGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +VDWR     V  IK+QG CGSCW+ +   + E  +  + G  VS +EQ ++DC
Sbjct: 124 SVDWRNVTNVVGPIKNQGHCGSCWTFSIAGIVESHYVLKHGSYVSYAEQEILDC 177


>UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus
           salmonis|Rep: Cysteine proteinase - Lepeophtheirus
           salmonis (salmon louse)
          Length = 372

 Score = 48.8 bits (111), Expect = 2e-04
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
 Frame = +3

Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVS--LSEQNLIDCSEH 674
           +VDWR+ G +TD+K+QG CGSCW  + +E  E     ++       LS Q +  CS +
Sbjct: 118 SVDWREKGVITDVKNQGSCGSCWVFSAVEQIESYVAIENNMTSPPLLSTQQITSCSSN 175


>UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2;
           cellular organisms|Rep: Cysteine proteinase, putative -
           Archaeoglobus fulgidus
          Length = 1088

 Score = 48.8 bits (111), Expect = 2e-04
 Identities = 20/55 (36%), Positives = 33/55 (60%), Gaps = 2/55 (3%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSG--YLVSLSEQNLIDCSE 671
           DWR +  ++ ++DQG CGSCW+ + +   E     +SG    + LSEQ+L+ C +
Sbjct: 599 DWRDYTGLSAVRDQGSCGSCWAHSAVAALESALIVESGASSSIDLSEQHLLSCEQ 653


>UniRef50_UPI0000D566ED Cluster: PREDICTED: similar to CTLA-2-alpha
           protein precursor; n=1; Tribolium castaneum|Rep:
           PREDICTED: similar to CTLA-2-alpha protein precursor -
           Tribolium castaneum
          Length = 101

 Score = 48.4 bits (110), Expect = 2e-04
 Identities = 18/44 (40%), Positives = 32/44 (72%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 387
           E+NFR +++A++   I +HN+KYE G V+Y +G+N++ D+   E
Sbjct: 45  EENFRKQLFAKNLEKIEEHNKKYEQGQVTYTMGVNQFSDLTPEE 88


>UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis|Rep: Putative uncharacterized protein -
           Caenorhabditis elegans
          Length = 343

 Score = 48.4 bits (110), Expect = 2e-04
 Identities = 27/63 (42%), Positives = 34/63 (53%), Gaps = 3/63 (4%)
 Frame = +3

Query: 489 QREAAGAVDWRK-HGA--VTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLI 659
           ++    +VDWR  +G   VT IK QG CGSCW+ A     E       G L SLS Q L+
Sbjct: 132 KKNLPNSVDWRNVNGTNHVTGIKYQGPCGSCWAFATAAAIESAVSISGGGLQSLSSQQLL 191

Query: 660 DCS 668
           DC+
Sbjct: 192 DCT 194


>UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila
           melanogaster|Rep: CG11459-PA - Drosophila melanogaster
           (Fruit fly)
          Length = 336

 Score = 48.4 bits (110), Expect = 2e-04
 Identities = 20/53 (37%), Positives = 35/53 (66%), Gaps = 1/53 (1%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQG-KCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +DWR++G ++ + DQG +C SCW+ +   + E    ++ G LV LS ++L+DC
Sbjct: 122 IDWRQYGYISPVGDQGTECLSCWAFSTSGVLEAHMAKKYGNLVPLSPKHLVDC 174



 Score = 38.7 bits (86), Expect = 0.17
 Identities = 14/41 (34%), Positives = 23/41 (56%)
 Frame = +1

Query: 250 RGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGD 372
           R  D +   +Y +    +  HNQ Y  G V++K+G+NK+ D
Sbjct: 42  RNRDKYHRALYEQRVLAVESHNQLYLQGKVAFKMGLNKFSD 82


>UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcoptes
           scabiei type hominis|Rep: Sar s 1 allergen Yv4003H01 -
           Sarcoptes scabiei type hominis
          Length = 330

 Score = 48.4 bits (110), Expect = 2e-04
 Identities = 25/56 (44%), Positives = 35/56 (62%), Gaps = 3/56 (5%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHF--RQ-SGYLVSLSEQNLIDCS 668
           +D RK G VT +KDQ KCG+CW+ + +   E  +   RQ S +   LSEQ L+DC+
Sbjct: 117 IDLRKCGFVTPVKDQKKCGACWAFSTVCTTESLYLSSRQVSPWKFGLSEQELVDCA 172


>UniRef50_A1ZBK7 Cluster: CG10460-PA; n=1; Drosophila
           melanogaster|Rep: CG10460-PA - Drosophila melanogaster
           (Fruit fly)
          Length = 79

 Score = 48.0 bits (109), Expect = 3e-04
 Identities = 21/47 (44%), Positives = 31/47 (65%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK 396
           ED  R +IYAE K  I +HN+K+E G V++K+G+N   D+   EF +
Sbjct: 24  EDLMRRRIYAESKARIEEHNRKFEKGEVTWKMGINHLADLTPEEFAQ 70


>UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n=1;
           Xenopus tropicalis|Rep: UPI00006A2275 UniRef100 entry -
           Xenopus tropicalis
          Length = 272

 Score = 47.6 bits (108), Expect = 4e-04
 Identities = 22/59 (37%), Positives = 36/59 (61%), Gaps = 1/59 (1%)
 Frame = +3

Query: 498 AAGAVDWRKHGAVTDIKDQGK-CGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
           A  ++DWR    VT ++DQG  C SC++ + +   E Q  +++  LV+ S Q L+DCS+
Sbjct: 79  APPSIDWRTQNCVTPVRDQGSFCRSCYAFSAVGALECQWKKKTVRLVTFSPQELVDCSD 137



 Score = 47.2 bits (107), Expect = 5e-04
 Identities = 22/62 (35%), Positives = 33/62 (53%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
           E+  R  I+ E    I+ HN +Y +GL +Y++GMN  GDM   E   TM G+  +     
Sbjct: 6   EERARRTIWEETLKFISVHNLEYSLGLHTYEVGMNHLGDMTGEEVAATMTGYTGSGDSLA 65

Query: 436 NL 441
           N+
Sbjct: 66  NM 67


>UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing
           protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
           family cysteine protease containing protein -
           Tetrahymena thermophila SB210
          Length = 330

 Score = 47.6 bits (108), Expect = 4e-04
 Identities = 24/56 (42%), Positives = 33/56 (58%), Gaps = 3/56 (5%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGY---LVSLSEQNLIDCS 668
           V+W   G V+ +KDQG+CGSCW+ +     E      +GY    + LSEQ L+DCS
Sbjct: 121 VNWVTRGKVSAVKDQGQCGSCWAFSTTGSVESA-LIIAGYANQTIDLSEQQLVDCS 175


>UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4;
           Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena
           thermophila
          Length = 320

 Score = 46.8 bits (106), Expect = 6e-04
 Identities = 23/62 (37%), Positives = 35/62 (56%), Gaps = 5/62 (8%)
 Frame = +3

Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARL-----ELWEGQHFRQSGYLVSLSEQNLIDC 665
           A  VDW   G VT +K+QG CGSCW+ + +      LW      Q+   ++L+EQ  +DC
Sbjct: 113 ATEVDWTAKGKVTPVKNQGSCGSCWAFSTIGAVESALWIAGQGEQN--TLNLAEQEQVDC 170

Query: 666 SE 671
           ++
Sbjct: 171 AK 172


>UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lamblia
           ATCC 50803|Rep: GLP_26_49243_47612 - Giardia lamblia
           ATCC 50803
          Length = 543

 Score = 46.8 bits (106), Expect = 6e-04
 Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 7/59 (11%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWS-------SARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +DWR  G +T +KDQ  CGSCWS         RL   + +   +   L+ +SEQ++I C
Sbjct: 320 LDWRVRGVITPVKDQAACGSCWSFGAAGTIEGRLNALKWKRGERDTPLLRVSEQSIISC 378


>UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184,
           whole genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_184,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 331

 Score = 46.8 bits (106), Expect = 6e-04
 Identities = 19/67 (28%), Positives = 36/67 (53%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
           E+  R  ++A++  ++ +HN K+E+G  ++ LGMN+Y D+   EF  +        +  K
Sbjct: 49  EEVHRFSVFAQNLAVVMEHNSKFELGQETFTLGMNQYADLTPEEFQASFLTLKTKVQDRK 108

Query: 436 NLYMKGG 456
           N+    G
Sbjct: 109 NVKSYSG 115



 Score = 41.9 bits (94), Expect = 0.018
 Identities = 21/53 (39%), Positives = 28/53 (52%)
 Frame = +3

Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           VDW K G    +K+QG CGSCW+ A     E          V++SEQ  +DC+
Sbjct: 122 VDW-KDGLT--VKNQGSCGSCWAFAAAAAIEAGFQHHKKNKVNISEQEFVDCT 171


>UniRef50_Q5YER1 Cluster: Cysteine proteinase; n=1; Bigelowiella
           natans|Rep: Cysteine proteinase - Bigelowiella natans
           (Pedinomonas minutissima) (Chlorarachnion sp.(strain
           CCMP 621))
          Length = 140

 Score = 46.4 bits (105), Expect = 8e-04
 Identities = 17/28 (60%), Positives = 23/28 (82%)
 Frame = +3

Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWS 578
           ++A +VDW   GAVT +K+QG+CGSCWS
Sbjct: 107 KSADSVDWVSKGAVTPVKNQGQCGSCWS 134


>UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2;
           Culicidae|Rep: Procathepsin L3, putative - Aedes aegypti
           (Yellowfever mosquito)
          Length = 313

 Score = 46.4 bits (105), Expect = 8e-04
 Identities = 28/99 (28%), Positives = 42/99 (42%), Gaps = 1/99 (1%)
 Frame = +3

Query: 483 AGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLID 662
           A Q     ++DWR  G  T   +Q  CGSC++ +      GQ  R+ G +  +S Q ++D
Sbjct: 129 ATQNSMPDSLDWRDKGFTTMAVNQKTCGSCYAFSIGHALNGQIMRRIGRVEYVSTQQMVD 188

Query: 663 CS-EHXXXXXXXXXXXXXLQVHQGQRGDRHRADLPYEGS 776
           CS                +Q  Q  +G    +D PY  S
Sbjct: 189 CSTSAGNKGCAGGSLRFTMQYLQNSQGIMRSSDYPYTSS 227



 Score = 36.7 bits (81), Expect = 0.67
 Identities = 25/115 (21%), Positives = 49/115 (42%), Gaps = 9/115 (7%)
 Frame = +1

Query: 268 RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK---- 435
           R + + ++   I +HN  YE G  ++++G+N+  DM    ++K M        H K    
Sbjct: 51  RKRAFKKNMQEIEEHNANYEQGKSTFQMGVNELADMDKSSYLKKMVRMTDAIDHRKLDVD 110

Query: 436 --NLYMKGGSVRGAKFISPANVKLPER--WTG-GSTAPSPTSRTKGSVAHAGLQH 585
             +  ++  +  G +F+      +P+   W   G T  +   +T GS     + H
Sbjct: 111 FNDEMLQATNAFGEEFVQATQNSMPDSLDWRDKGFTTMAVNQKTCGSCYAFSIGH 165


>UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_36,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 307

 Score = 46.4 bits (105), Expect = 8e-04
 Identities = 22/55 (40%), Positives = 31/55 (56%)
 Frame = +3

Query: 504 GAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           G  DW     +  IK+QG CGSCW+ + +   EG    + G+   LSEQ L+DC+
Sbjct: 110 GDADWASK--MNPIKNQGNCGSCWTFSAIGAVEGFLAIRKGFKGVLSEQQLVDCA 162


>UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:
           Viral cathepsin - Cydia pomonella granulosis virus
           (CpGV) (Cydia pomonellagranulovirus)
          Length = 333

 Score = 46.4 bits (105), Expect = 8e-04
 Identities = 22/53 (41%), Positives = 36/53 (67%), Gaps = 1/53 (1%)
 Frame = +3

Query: 510 VDWR-KHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
           +DWR KHG VT +K+Q +CGSCW+ + +   E  +  +    ++LSEQ+L++C
Sbjct: 128 LDWRDKHG-VTPVKNQMECGSCWAFSTIANIESLYNIKYDKALNLSEQHLVNC 179


>UniRef50_UPI0000D566EC Cluster: PREDICTED: similar to CG10460-PA;
           n=1; Tribolium castaneum|Rep: PREDICTED: similar to
           CG10460-PA - Tribolium castaneum
          Length = 80

 Score = 46.0 bits (104), Expect = 0.001
 Identities = 16/44 (36%), Positives = 30/44 (68%)
 Frame = +1

Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 387
           E+++R  ++  +  ++  HN+KYE GLV+YK+G+N++ D    E
Sbjct: 30  EESYRKSLFVANLQMVESHNEKYEDGLVNYKMGINQFADYSKEE 73


>UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 20 SCAF14744, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 175

 Score = 46.0 bits (104), Expect = 0.001
 Identities = 20/52 (38%), Positives = 30/52 (57%)
 Frame = +3

Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
           DWR +  V  +++Q  CGSCW+ + +   +  H   S  LV LS Q ++DCS
Sbjct: 64  DWRDNAVVGPVQNQQACGSCWAFSVVGAVQSVHAIGSSPLVELSVQQVLDCS 115


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 776,152,799
Number of Sequences: 1657284
Number of extensions: 16417678
Number of successful extensions: 56151
Number of sequences better than 10.0: 405
Number of HSP's better than 10.0 without gapping: 52437
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 55977
length of database: 575,637,011
effective HSP length: 99
effective length of database: 411,565,895
effective search space used: 68319938570
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)

- SilkBase 1999-2023 -