BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= pg--0742.Seq (603 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP000... 104 2e-21 UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ... 103 2e-21 UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-... 103 2e-21 UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip... 94 3e-18 UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome s... 90 3e-17 UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ... 82 9e-15 UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; ... 80 5e-14 UniRef50_Q6DGW1 Cluster: 26-29kD-proteinase protein; n=23; Danio... 69 6e-11 UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt... 61 2e-08 UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve... 60 3e-08 UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j... 60 4e-08 UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1... 57 3e-07 UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox... 56 5e-07 UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep... 56 5e-07 UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n... 56 6e-07 UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ... 56 6e-07 UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re... 55 1e-06 UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel... 55 1e-06 UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil... 55 1e-06 UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain... 55 1e-06 UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa... 54 2e-06 UniRef50_Q5YER1 Cluster: Cysteine proteinase; n=1; Bigelowiella ... 54 2e-06 UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|... 54 2e-06 UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy... 54 2e-06 UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try... 54 3e-06 UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc... 53 6e-06 UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata... 53 6e-06 UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz... 52 1e-05 UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ... 51 2e-05 UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ... 51 2e-05 UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v... 51 2e-05 UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve... 51 2e-05 UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ... 51 2e-05 UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;... 51 2e-05 UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ... 51 2e-05 UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3... 51 2e-05 UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz... 50 3e-05 UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ... 50 4e-05 UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ... 50 4e-05 UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli... 50 4e-05 UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio... 50 4e-05 UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy... 50 4e-05 UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ... 50 4e-05 UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG... 50 6e-05 UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac... 50 6e-05 UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ... 50 6e-05 UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D... 50 6e-05 UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=... 49 7e-05 UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ... 49 1e-04 UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ... 49 1e-04 UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs... 49 1e-04 UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h... 48 1e-04 UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ... 48 2e-04 UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ... 48 2e-04 UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ... 48 2e-04 UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ... 48 2e-04 UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|... 48 2e-04 UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi... 48 2e-04 UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus... 48 2e-04 UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R... 48 2e-04 UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ... 48 2e-04 UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic... 48 2e-04 UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35... 48 2e-04 UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p... 47 3e-04 UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ... 47 3e-04 UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip... 47 3e-04 UniRef50_Q22RC9 Cluster: Papain family cysteine protease contain... 47 3e-04 UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=... 47 3e-04 UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|... 47 4e-04 UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi... 47 4e-04 UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy... 47 4e-04 UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph... 46 5e-04 UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl... 46 5e-04 UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet... 46 5e-04 UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep... 46 7e-04 UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D... 46 7e-04 UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ... 46 7e-04 UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|... 46 7e-04 UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ... 46 0.001 UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ... 46 0.001 UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal... 46 0.001 UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb... 46 0.001 UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ... 46 0.001 UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat... 46 0.001 UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C... 46 0.001 UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ... 45 0.001 UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ... 45 0.001 UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:... 45 0.001 UniRef50_Q24E33 Cluster: Papain family cysteine protease contain... 45 0.001 UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ... 45 0.001 UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n... 45 0.002 UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re... 45 0.002 UniRef50_Q248G1 Cluster: Papain family cysteine protease contain... 45 0.002 UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory... 45 0.002 UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e... 45 0.002 UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryz... 44 0.002 UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei... 44 0.002 UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil... 44 0.002 UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M... 44 0.002 UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ... 44 0.003 UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n... 44 0.003 UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10... 44 0.003 UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ... 44 0.003 UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto... 44 0.003 UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p... 44 0.004 UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum... 44 0.004 UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n... 44 0.004 UniRef50_Q22A69 Cluster: Papain family cysteine protease contain... 44 0.004 UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18... 44 0.004 UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arab... 43 0.005 UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big... 43 0.005 UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n... 43 0.005 UniRef50_Q239L8 Cluster: Papain family cysteine protease contain... 43 0.005 UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ... 43 0.006 UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty... 43 0.006 UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida... 43 0.006 UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 43 0.006 UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ... 42 0.008 UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa... 42 0.008 UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr... 42 0.008 UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy... 42 0.008 UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;... 42 0.008 UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli... 42 0.011 UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ... 42 0.011 UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ... 42 0.015 UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain... 42 0.015 UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus... 42 0.015 UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr... 42 0.015 UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re... 42 0.015 UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R... 42 0.015 UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl... 41 0.020 UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain... 41 0.020 UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ... 41 0.020 UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ... 41 0.020 UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ... 41 0.020 UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|R... 41 0.020 UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:... 41 0.020 UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s... 41 0.026 UniRef50_Q0AY53 Cluster: Putative uncharacterized protein; n=1; ... 41 0.026 UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R... 41 0.026 UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyt... 41 0.026 UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca... 41 0.026 UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl... 41 0.026 UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;... 41 0.026 UniRef50_Q22W19 Cluster: Papain family cysteine protease contain... 41 0.026 UniRef50_O16454 Cluster: Temporarily assigned gene name protein ... 41 0.026 UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ... 41 0.026 UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C... 41 0.026 UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa... 40 0.034 UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ... 40 0.034 UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt... 40 0.034 UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa... 40 0.034 UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain... 40 0.034 UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li... 40 0.034 UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cy... 40 0.034 UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain... 40 0.045 UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis... 40 0.045 UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy... 40 0.045 UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae... 40 0.060 UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n... 40 0.060 UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain... 40 0.060 UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain... 40 0.060 UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina... 40 0.060 UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ... 40 0.060 UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi... 40 0.060 UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ... 39 0.079 UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio... 39 0.079 UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy... 39 0.079 UniRef50_A5HC50 Cluster: Cathepsin W; n=2; Theria|Rep: Cathepsin... 39 0.079 UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G... 39 0.079 UniRef50_Q23H10 Cluster: Papain family cysteine protease contain... 39 0.079 UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh... 39 0.079 UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr... 39 0.079 UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo... 39 0.079 UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,... 39 0.10 UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ... 39 0.10 UniRef50_Q23H15 Cluster: Papain family cysteine protease contain... 39 0.10 UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca... 38 0.14 UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat... 38 0.14 UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No... 38 0.14 UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er... 38 0.14 UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ... 38 0.18 UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ... 38 0.18 UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|... 38 0.18 UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ... 38 0.18 UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep... 38 0.18 UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain... 38 0.18 UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain... 38 0.18 UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re... 38 0.18 UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir... 38 0.18 UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M... 38 0.18 UniRef50_Q2FLD5 Cluster: PKD precursor; n=1; Methanospirillum hu... 38 0.18 UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: L... 38 0.24 UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv... 37 0.32 UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ... 37 0.32 UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C... 37 0.32 UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen... 37 0.32 UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh... 37 0.32 UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei... 37 0.42 UniRef50_Q235G6 Cluster: Papain family cysteine protease contain... 37 0.42 UniRef50_Q8TMY7 Cluster: Cell surface protein; n=2; Methanosarci... 37 0.42 UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sa... 36 0.56 UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest... 36 0.56 UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop... 36 0.56 UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ... 36 0.74 UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv... 36 0.74 UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ... 36 0.74 UniRef50_A0B934 Cluster: GHMP kinase; n=1; Methanosaeta thermoph... 36 0.74 UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The... 36 0.74 UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ... 36 0.97 UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t... 36 0.97 UniRef50_Q42312 Cluster: Cysteine protease; n=1; Arabidopsis tha... 36 0.97 UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi... 36 0.97 UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh... 36 0.97 UniRef50_Q8TQM7 Cluster: Putative uncharacterized protein; n=1; ... 36 0.97 UniRef50_Q8PS79 Cluster: Putative uncharacterized protein; n=1; ... 36 0.97 UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:... 36 0.97 UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s... 35 1.3 UniRef50_Q1CXI7 Cluster: Putative uncharacterized protein; n=1; ... 35 1.3 UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ... 35 1.3 UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina... 35 1.3 UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ... 35 1.7 UniRef50_Q53K53 Cluster: Cysteine protease 1, putative; n=5; Ory... 35 1.7 UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi... 35 1.7 UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop... 35 1.7 UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain... 35 1.7 UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain... 35 1.7 UniRef50_A7LFV3 Cluster: Cathepsin L protease inhibitor 2; n=1; ... 35 1.7 UniRef50_A0C1I6 Cluster: Chromosome undetermined scaffold_142, w... 35 1.7 UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov... 35 1.7 UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi... 35 1.7 UniRef50_Q70SU8 Cluster: Cystein proteinase inhibitor protein pr... 34 2.3 UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; ... 34 2.3 UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-... 34 2.3 UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:... 34 2.3 UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop... 34 2.3 UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia... 34 2.3 UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain... 34 2.3 UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H... 34 2.3 UniRef50_A2DKX2 Cluster: Clan CA, family C1, cathepsin L-like cy... 34 2.3 UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]... 34 2.3 UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu... 34 2.3 UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyosteli... 34 2.3 UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto... 34 2.3 UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome s... 34 3.0 UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S... 34 3.0 UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain... 34 3.0 UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi... 34 3.0 UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryz... 33 3.9 UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa... 33 3.9 UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster... 33 3.9 UniRef50_Q23H32 Cluster: Papain family cysteine protease contain... 33 3.9 UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w... 33 3.9 UniRef50_Q8TGH8 Cluster: Bck1-like MAP kinase kinase kinase; n=1... 33 3.9 UniRef50_O95905 Cluster: SGT1 protein; n=27; Euteleostomi|Rep: S... 33 3.9 UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2... 33 3.9 UniRef50_UPI00015B5D85 Cluster: PREDICTED: similar to cathepsin ... 33 5.2 UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ... 33 5.2 UniRef50_Q2RPV6 Cluster: Putative uncharacterized protein; n=1; ... 33 5.2 UniRef50_Q022Z7 Cluster: Putative uncharacterized protein; n=1; ... 33 5.2 UniRef50_A4CGF7 Cluster: Chitinase; n=1; Robiginitalea biformata... 33 5.2 UniRef50_Q5Z4X7 Cluster: Putative uncharacterized protein B1061F... 33 5.2 UniRef50_Q5JJI1 Cluster: Putative uncharacterized protein B1793G... 33 5.2 UniRef50_Q2QS15 Cluster: Papain family cysteine protease contain... 33 5.2 UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n... 33 5.2 UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma j... 33 5.2 UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain... 33 5.2 UniRef50_A0C797 Cluster: Chromosome undetermined scaffold_154, w... 33 5.2 UniRef50_Q8TQ91 Cluster: Putative uncharacterized protein; n=1; ... 33 5.2 UniRef50_Q2NG83 Cluster: Member of asn/thr-rich large protein fa... 33 5.2 UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3.... 33 5.2 UniRef50_UPI00006CA492 Cluster: hypothetical protein TTHERM_0049... 33 6.9 UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re... 33 6.9 UniRef50_A6LML6 Cluster: Peptidase C1A, papain precursor; n=1; T... 33 6.9 UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ... 33 6.9 UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;... 33 6.9 UniRef50_Q8TKH5 Cluster: Cell surface protein; n=3; Methanosarci... 33 6.9 UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O... 32 9.1 UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ... 32 9.1 UniRef50_Q61CG7 Cluster: Putative uncharacterized protein CBG129... 32 9.1 UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted... 32 9.1 UniRef50_Q23H06 Cluster: Papain family cysteine protease contain... 32 9.1 UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali... 32 9.1 UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh... 32 9.1 >UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP00000013730, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to ENSANGP00000013730, partial - Ornithorhynchus anatinus Length = 229 Score = 104 bits (249), Expect = 2e-21 Identities = 51/84 (60%), Positives = 57/84 (67%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHD 432 HNRANR F ++ NHL DRT ELAALRGR S HG PFP+ + +V LP D Sbjct: 5 HNRANRPFRLAPNHLTDRTPGELAALRGRLRSSRPNHGQPFPHE----QLANVALPESLD 60 Query: 433 WRLFGAVTPVKDQLVFGSCWSFGT 504 WRL+GAVTPVKDQ V GSCWSF T Sbjct: 61 WRLYGAVTPVKDQAVCGSCWSFAT 84 Score = 38.7 bits (86), Expect = 0.10 Identities = 19/30 (63%), Positives = 20/30 (66%) Frame = +2 Query: 512 VEGALFLHKGGHLXWLSQQALIDCSWGFGN 601 +EGALFL L LSQQ LIDCSW GN Sbjct: 88 LEGALFLKVTVQLVPLSQQMLIDCSWDVGN 117 >UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue of Sarcophaga 26,29kDa proteinase; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to homologue of Sarcophaga 26,29kDa proteinase - Nasonia vitripennis Length = 553 Score = 103 bits (248), Expect = 2e-21 Identities = 53/102 (51%), Positives = 69/102 (67%), Gaps = 2/102 (1%) Frame = +1 Query: 205 REEAEHLQAVAQ-IHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHG-LPFP 378 ++ EH + + IH+ NRAN GFT+ VNHLADR + EL LRG++Y+ +G +PFP Sbjct: 266 KQRKEHFRHNLRFIHSI-NRANLGFTLDVNHLADRNEAELKVLRGKQYTQHGYNGGMPFP 324 Query: 379 YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 + VE+ +P DWRL+GAVTPVKDQ V GSCWSFGT Sbjct: 325 HD---VEKEKADVPDSFDWRLYGAVTPVKDQSVCGSCWSFGT 363 Score = 92.3 bits (219), Expect = 8e-18 Identities = 42/78 (53%), Positives = 51/78 (65%) Frame = +2 Query: 23 DVFKVDSNMQCTGFPGPGSRHFATFNPMKEFVRPVHDAHVHDEFERFKVKLQKQYASDLE 202 +VF+V+ N C FPGPG TFNPMKEF+ H AHV F+RFK K YA DLE Sbjct: 206 EVFQVEQNASCVSFPGPGEHRIYTFNPMKEFIHN-HQAHVDMAFDRFKKTHNKNYAHDLE 264 Query: 203 HEKRLNIFRQSLRYIHSI 256 H++R FR +LR+IHSI Sbjct: 265 HKQRKEHFRHNLRFIHSI 282 Score = 43.6 bits (98), Expect = 0.004 Identities = 22/30 (73%), Positives = 23/30 (76%) Frame = +2 Query: 512 VEGALFLHKGGHLXWLSQQALIDCSWGFGN 601 VEGA F+ K L LSQQALIDCSWGFGN Sbjct: 367 VEGAYFM-KYKKLVRLSQQALIDCSWGFGN 395 >UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-PA - Drosophila melanogaster (Fruit fly) Length = 549 Score = 103 bits (248), Expect = 2e-21 Identities = 47/82 (57%), Positives = 58/82 (70%) Frame = +2 Query: 8 DEIDPDVFKVDSNMQCTGFPGPGSRHFATFNPMKEFVRPVHDAHVHDEFERFKVKLQKQY 187 D+I +VF++D ++QC GFPGPG+ H+ATFNPM+EF+ D HV F FK K Y Sbjct: 198 DDIPNEVFEIDDSLQCVGFPGPGTGHYATFNPMQEFISGT-DEHVDKAFHHFKRKHGVAY 256 Query: 188 ASDLEHEKRLNIFRQSLRYIHS 253 SD EHE R NIFRQ+LRYIHS Sbjct: 257 HSDTEHEHRKNIFRQNLRYIHS 278 Score = 102 bits (244), Expect = 7e-21 Identities = 49/93 (52%), Positives = 66/93 (70%) Frame = +1 Query: 226 QAVAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEEL 405 Q + IH+ NRA +T++VNHLAD+T++EL A RG + SG G PFPY + ++ Sbjct: 271 QNLRYIHS-KNRAKLTYTLAVNHLADKTEEELKARRGYKSSGIYNTGKPFPYDVPKYKD- 328 Query: 406 SVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 ++P ++DWRL+GAVTPVKDQ V GSCWSFGT Sbjct: 329 --EIPDQYDWRLYGAVTPVKDQSVCGSCWSFGT 359 Score = 49.6 bits (113), Expect = 6e-05 Identities = 21/30 (70%), Positives = 24/30 (80%) Frame = +2 Query: 512 VEGALFLHKGGHLXWLSQQALIDCSWGFGN 601 +EGA FL GG+L LSQQALIDCSW +GN Sbjct: 363 LEGAFFLKNGGNLVRLSQQALIDCSWAYGN 392 >UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 2 - Rhipicephalus appendiculatus (Brown ear tick) Length = 564 Score = 93.9 bits (223), Expect = 3e-18 Identities = 47/84 (55%), Positives = 57/84 (67%), Gaps = 1/84 (1%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDELAALRGRRYSGP-SPHGLPFPYSKSRVEELSVKLPPEHD 432 NRAN G+ ++VNHLADRT +E++ LRGR S S PFP + + KLP + D Sbjct: 296 NRANLGYNLAVNHLADRTREEISVLRGRLQSKDGSSRAEPFPRHR-----FTAKLPDQID 350 Query: 433 WRLFGAVTPVKDQLVFGSCWSFGT 504 WR +GAVTPVKDQ V GSCWSFGT Sbjct: 351 WRPYGAVTPVKDQAVCGSCWSFGT 374 Score = 84.6 bits (200), Expect = 2e-15 Identities = 67/209 (32%), Positives = 94/209 (44%), Gaps = 11/209 (5%) Frame = +2 Query: 8 DEIDPDVFKVDS--NMQCTGFPGPGSRHFATFNPMKEFVRPVHDAHVHDEFERFKVKLQK 181 D + P VF V + N C FPGPG+ A NPM EF+ HD H FE FK ++ Sbjct: 212 DPVPPSVFDVTTLFNGTCRSFPGPGAERLALHNPMAEFLGN-HDGHTKHSFEDFKETHKR 270 Query: 182 QYASDLEHEKRLNIFRQSLRYIHS-----IIERTAVSP-CP*TILPIALTTSSLPSEGGG 343 Y D EH++R +IFRQ+LR+I S + AV+ T I++ L S+ G Sbjct: 271 TYELDTEHDRRRDIFRQNLRFIDSKNRANLGYNLAVNHLADRTREEISVLRGRLQSKDGS 330 Query: 344 TRG---PALTVSRSRTANLEWRS*A*SCLRNTTGDCSERSLPLKISWCXXXXXXXXXXXV 514 +R P + ++WR ++++ C + Sbjct: 331 SRAEPFPRHRFTAKLPDQIDWRP------YGAVTPVKDQAV------CGSCWSFGTVGEL 378 Query: 515 EGALFLHKGGHLXWLSQQALIDCSWGFGN 601 EGA F K G L LS+Q L+DCSW GN Sbjct: 379 EGAYF-RKTGRLVRLSEQQLVDCSWNNGN 406 >UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 21 SCAF14577, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 478 Score = 90.2 bits (214), Expect = 3e-17 Identities = 48/105 (45%), Positives = 64/105 (60%), Gaps = 6/105 (5%) Frame = +1 Query: 208 EEAEH-LQAVAQIHTFH-----NRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGL 369 ++ EH L+ A IH NRA +T+ +N L+DRT ELA +RGR+ + GL Sbjct: 134 DDKEHELRQQAFIHNLRYVHSKNRAGLSYTLGLNSLSDRTMSELATMRGRKQRKTTNAGL 193 Query: 370 PFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 PFP+ + V++P DWRL+GAVTPVKDQ + GSCWSF T Sbjct: 194 PFPFKLYQ----HVEVPESLDWRLYGAVTPVKDQAICGSCWSFAT 234 Score = 81.4 bits (192), Expect = 1e-14 Identities = 35/80 (43%), Positives = 43/80 (53%) Frame = +2 Query: 14 IDPDVFKVDSNMQCTGFPGPGSRHFATFNPMKEFVRPVHDAHVHDEFERFKVKLQKQYAS 193 +DP +F + M C GFPGPG H NPMK+ + H F FK K Q+QY Sbjct: 75 VDPKIFTLPEGMTCEGFPGPGVEHHMLANPMKDLIHTSASGHSQRVFGHFKEKFQRQYED 134 Query: 194 DLEHEKRLNIFRQSLRYIHS 253 D EHE R F +LRY+HS Sbjct: 135 DKEHELRQQAFIHNLRYVHS 154 Score = 47.6 bits (108), Expect = 2e-04 Identities = 23/30 (76%), Positives = 24/30 (80%) Frame = +2 Query: 512 VEGALFLHKGGHLXWLSQQALIDCSWGFGN 601 +EGALFL K G L LSQQ LIDCSWGFGN Sbjct: 238 IEGALFL-KTGSLQVLSQQMLIDCSWGFGN 266 Score = 38.7 bits (86), Expect = 0.10 Identities = 17/25 (68%), Positives = 18/25 (72%) Frame = +2 Query: 527 FLHKGGHLXWLSQQALIDCSWGFGN 601 +L G L LSQQ LIDCSWGFGN Sbjct: 296 YLGMTGSLQVLSQQMLIDCSWGFGN 320 >UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin l - Strongylocentrotus purpuratus Length = 489 Score = 82.2 bits (194), Expect = 9e-15 Identities = 43/89 (48%), Positives = 58/89 (65%), Gaps = 1/89 (1%) Frame = +1 Query: 241 IHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLP 420 IH+ NRAN G+ + +NH+AD++ EL +RGR +GLP Y S V + +V Sbjct: 214 IHSI-NRANLGYVLDINHMADQSHQELKRMRGRLRQTRPNNGLP--YDGSDVSDDAV--- 267 Query: 421 PEH-DWRLFGAVTPVKDQLVFGSCWSFGT 504 P+H DW + GAV+PVKDQ V GSCWSFG+ Sbjct: 268 PDHIDWNVLGAVSPVKDQAVCGSCWSFGS 296 Score = 35.9 bits (79), Expect = 0.74 Identities = 15/30 (50%), Positives = 21/30 (70%) Frame = +2 Query: 512 VEGALFLHKGGHLXWLSQQALIDCSWGFGN 601 +EGA+F+ G + LSQQ L+DC+W GN Sbjct: 300 IEGAVFMQSGKRVR-LSQQMLMDCTWAAGN 328 >UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; n=2; Danio rerio|Rep: hypothetical protein LOC550326 - Danio rerio Length = 531 Score = 79.8 bits (188), Expect = 5e-14 Identities = 43/93 (46%), Positives = 55/93 (59%), Gaps = 5/93 (5%) Frame = +1 Query: 241 IHTF-----HNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEEL 405 +HTF +NRA +++ +NH AD+T +ELA + G PFP S+ R Sbjct: 254 LHTFRFVHSNNRAGLTYSVGINHFADKTKEELARMTGGLLPKKEEKAQPFP-SEIR---- 308 Query: 406 SVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 S+ P DWRL+GAVTPVKDQ V GSCWSF T Sbjct: 309 SIATPNSVDWRLYGAVTPVKDQAVCGSCWSFAT 341 Score = 69.3 bits (162), Expect = 6e-11 Identities = 30/79 (37%), Positives = 43/79 (54%) Frame = +2 Query: 17 DPDVFKVDSNMQCTGFPGPGSRHFATFNPMKEFVRPVHDAHVHDEFERFKVKLQKQYASD 196 +PDVF + C FP P H NP +++V +H H F FK K +QY S+ Sbjct: 184 EPDVFTPPAGFTCEEFPDPPEEHQILANPFQDYVNTHPVSHAHRMFGPFKEKFNRQYESE 243 Query: 197 LEHEKRLNIFRQSLRYIHS 253 EHE+R N+F + R++HS Sbjct: 244 KEHEERENLFLHTFRFVHS 262 Score = 45.6 bits (103), Expect = 0.001 Identities = 21/30 (70%), Positives = 24/30 (80%) Frame = +2 Query: 512 VEGALFLHKGGHLXWLSQQALIDCSWGFGN 601 +EGALFL K G L LSQQ L+DC+WGFGN Sbjct: 345 LEGALFL-KTGQLTSLSQQMLVDCTWGFGN 373 >UniRef50_Q6DGW1 Cluster: 26-29kD-proteinase protein; n=23; Danio rerio|Rep: 26-29kD-proteinase protein - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 327 Score = 69.3 bits (162), Expect = 6e-11 Identities = 30/79 (37%), Positives = 43/79 (54%) Frame = +2 Query: 17 DPDVFKVDSNMQCTGFPGPGSRHFATFNPMKEFVRPVHDAHVHDEFERFKVKLQKQYASD 196 +PDVF + C FP P H NP +++V +H H F FK K +QY S+ Sbjct: 210 EPDVFTPPAGFTCEEFPDPPEEHQILANPFQDYVNTHPVSHAHRMFGPFKEKFNRQYESE 269 Query: 197 LEHEKRLNIFRQSLRYIHS 253 EHE+R N+F + R++HS Sbjct: 270 KEHEERENLFLHTFRFVHS 288 >UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa (Rice) Length = 339 Score = 60.9 bits (141), Expect = 2e-08 Identities = 38/90 (42%), Positives = 49/90 (54%), Gaps = 1/90 (1%) Frame = +1 Query: 232 VAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSV 411 VA I +F N N F +SVN AD T+ E A + + PS +P + R E +S+ Sbjct: 65 VAFIESF-NAGNHKFWLSVNQFADLTNYEFRATKTNKGFIPSTVRVPTTF---RYENVSI 120 Query: 412 K-LPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 LP DWR GAVTP+KDQ G CW+F Sbjct: 121 DTLPATVDWRTKGAVTPIKDQGQCGCCWAF 150 >UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 513 Score = 60.5 bits (140), Expect = 3e-08 Identities = 32/81 (39%), Positives = 47/81 (58%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDW 435 NR + G+++ NH+AD TD E+ ++G + P G P+S ++ V LPP DW Sbjct: 245 NRQHLGYSLKPNHMADMTDAEVNRMKGLLHEEPPLIG-DSPFSIPD-KDRGVPLPPHVDW 302 Query: 436 RLFGAVTPVKDQLVFGSCWSF 498 R GAV VK Q + GSC++F Sbjct: 303 RKAGAVNSVKSQGICGSCYAF 323 Score = 49.2 bits (112), Expect = 7e-05 Identities = 28/77 (36%), Positives = 43/77 (55%), Gaps = 1/77 (1%) Frame = +2 Query: 26 VFKVDSNMQCTGFPGPGS-RHFATFNPMKEFVRPVHDAHVHDEFERFKVKLQKQYASDLE 202 VF++ ++++C F + NPM EF+ H A H F FK +K+Y S E Sbjct: 169 VFEIPTDIKCFEFSHEKNVGAVGEINPMFEFMP--HTAVQHHLFNAFKASYRKRYPSAHE 226 Query: 203 HEKRLNIFRQSLRYIHS 253 HEKR +I+R ++R+I S Sbjct: 227 HEKRKDIYRHNMRFIKS 243 Score = 37.1 bits (82), Expect = 0.32 Identities = 16/30 (53%), Positives = 22/30 (73%) Frame = +2 Query: 512 VEGALFLHKGGHLXWLSQQALIDCSWGFGN 601 +EGA F+ G L LS+Q ++DC+WGFGN Sbjct: 329 LEGAHFIKTGLKLD-LSEQQIVDCTWGFGN 357 >UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06231 protein - Schistosoma japonicum (Blood fluke) Length = 372 Score = 60.1 bits (139), Expect = 4e-08 Identities = 36/88 (40%), Positives = 47/88 (53%), Gaps = 4/88 (4%) Frame = +1 Query: 253 HNRANRG----FTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLP 420 HNRA + + M VN+ D+T+ EL LRG R S + P + + KLP Sbjct: 96 HNRAYQEGKATYKMGVNNFTDKTEYELRKLRGYR----SACRIAKPKGSTFISSEHAKLP 151 Query: 421 PEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 DWR GAVTPVK+Q GSCW+F + Sbjct: 152 DRVDWRRNGAVTPVKNQGQCGSCWAFSS 179 >UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1; Dictyostelium discoideum AX4|Rep: Counting factor associated protein - Dictyostelium discoideum AX4 Length = 531 Score = 57.2 bits (132), Expect = 3e-07 Identities = 31/95 (32%), Positives = 47/95 (49%) Frame = +1 Query: 220 HLQAVAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVE 399 + +A +I HN + + +NH AD ++ E L + + PS G + + Sbjct: 248 NFKAARKIIATHNAKESSYKLGMNHYADLSNKEFNTLVKPKVARPSVTGADSVHD----D 303 Query: 400 ELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 E +P DWR VTPVKDQ + GSCW+FG+ Sbjct: 304 ESLRSIPSTVDWRNQNCVTPVKDQGICGSCWTFGS 338 >UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Toxopain-2 - Toxoplasma gondii Length = 422 Score = 56.4 bits (130), Expect = 5e-07 Identities = 36/92 (39%), Positives = 47/92 (51%), Gaps = 4/92 (4%) Frame = +1 Query: 241 IHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSG-PSPHGLPFPYSKSRVEELSV-- 411 IHT HN+ +++ +NH D + DE R+Y G L + E L+V Sbjct: 148 IHT-HNQQGYSYSLKMNHFGDLSRDEFR----RKYLGFKKSRNLKSHHLGVATELLNVLP 202 Query: 412 -KLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 +LP DWR G VTPVKDQ GSCW+F T Sbjct: 203 SELPAGVDWRSRGCVTPVKDQRDCGSCWAFST 234 Score = 39.1 bits (87), Expect = 0.079 Identities = 16/41 (39%), Positives = 25/41 (60%) Frame = +2 Query: 131 DAHVHDEFERFKVKLQKQYASDLEHEKRLNIFRQSLRYIHS 253 +AH D F F+ K YA++ E ++R IF+ +L YIH+ Sbjct: 110 EAHFQDAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHT 150 >UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep: Cysteine proteinase - Cryptobia salmositica Length = 443 Score = 56.4 bits (130), Expect = 5e-07 Identities = 32/84 (38%), Positives = 40/84 (47%), Gaps = 1/84 (1%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKS-RVEELSVKLPPEHD 432 NR N T N AD T +E + P +K+ EE+ + + D Sbjct: 60 NRKNPMATFGPNEFADMTSEEFQTRHNAARHYAAAKARPPKNTKTFTAEEIKAAVGQQID 119 Query: 433 WRLFGAVTPVKDQLVFGSCWSFGT 504 WRL GAVTPVK+Q GSCWSF T Sbjct: 120 WRLKGAVTPVKNQGACGSCWSFST 143 >UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4; core eudicotyledons|Rep: Papain-like cysteine peptidase XBCP3 - Arabidopsis thaliana (Mouse-ear cress) Length = 437 Score = 56.0 bits (129), Expect = 6e-07 Identities = 43/106 (40%), Positives = 53/106 (50%), Gaps = 6/106 (5%) Frame = +1 Query: 199 GAREEAEH-LQAVAQIHTF---HNR-ANRGFTMSVNHLADRTDDELAALR-GRRYSGPSP 360 G+ EE + +Q H F HN N +++S+N AD T E A R G S PS Sbjct: 44 GSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKASRLGLSVSAPSV 103 Query: 361 HGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 SK + SVK+P DWR GAVT VKDQ G+CWSF Sbjct: 104 ----IMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSF 145 >UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 326 Score = 56.0 bits (129), Expect = 6e-07 Identities = 38/114 (33%), Positives = 56/114 (49%), Gaps = 3/114 (2%) Frame = +1 Query: 166 SQTPEAVRERPGAREEAEHLQAVAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRY 345 S +P + ++ G+R E A IH F+ + + + +N AD T +E A +Y Sbjct: 37 SSSPRDLADK-GSRFEVFKKNA-RYIHDFNRKKGMSYKLGLNKFADLTLEEFTA----KY 90 Query: 346 SGPSPH---GLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 +G +P GL + ++ PP DWR GAVT VKDQ GSCW+F Sbjct: 91 TGANPGPITGLKNGTGSPPLAAVAGDAPPAWDWREHGAVTRVKDQGPCGSCWAF 144 >UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep: Cathepsin L - Stylonychia lemnae Length = 340 Score = 55.2 bits (127), Expect = 1e-06 Identities = 36/86 (41%), Positives = 44/86 (51%), Gaps = 2/86 (2%) Frame = +1 Query: 253 HNRANRG--FTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPE 426 HN N G FT+ NHLAD T DE + G Y + G YS ++++ P Sbjct: 76 HNSQNDGTSFTLGPNHLADYTHDEYKKMLG--YKPRNKTGKEV-YSTPNLKDI----PES 128 Query: 427 HDWRLFGAVTPVKDQLVFGSCWSFGT 504 DWR GAV VKDQ GSCW+F T Sbjct: 129 IDWREKGAVNAVKDQGQCGSCWAFST 154 >UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromeliaceae|Rep: Fruit bromelain precursor - Ananas comosus (Pineapple) Length = 351 Score = 55.2 bits (127), Expect = 1e-06 Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 4/93 (4%) Frame = +1 Query: 232 VAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRV---EE 402 V I TF++R +T+ +N D T E A +Y+G S LP + V ++ Sbjct: 65 VKHIETFNSRNENSYTLGINQFTDMTKSEFVA----QYTGVS---LPLNIEREPVVSFDD 117 Query: 403 LSVKLPPEH-DWRLFGAVTPVKDQLVFGSCWSF 498 +++ P+ DWR +GAV VK+Q GSCWSF Sbjct: 118 VNISAVPQSIDWRDYGAVNEVKNQNPCGSCWSF 150 >UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Liliopsida|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 416 Score = 54.8 bits (126), Expect = 1e-06 Identities = 36/90 (40%), Positives = 48/90 (53%), Gaps = 4/90 (4%) Frame = +1 Query: 241 IHTFHNRAN-RGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYS--KSRVEELSV 411 IH F+ ++ + + +N +D T +E AA +Y+G F + S EEL V Sbjct: 56 IHEFNQKSKGMSYVLGLNKFSDLTYEEFAA----KYTGVKVDASAFATATTSSPDEELPV 111 Query: 412 KLPPEH-DWRLFGAVTPVKDQLVFGSCWSF 498 +PP DWRL GAVT VKDQ GSCW F Sbjct: 112 GVPPATWDWRLNGAVTDVKDQGQCGSCWVF 141 >UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing protein; n=7; Hymenostomatida|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 387 Score = 54.8 bits (126), Expect = 1e-06 Identities = 33/97 (34%), Positives = 49/97 (50%), Gaps = 4/97 (4%) Frame = +1 Query: 226 QAVAQIHTFHNRANRGFTMSVNHLADRTDDELAALR---GRRYSGPSPHGLPFPYSKSRV 396 Q + +I F++ + G+ +N DRT +EL + + F K+ Sbjct: 67 QKLKEIKAFNSNSENGYKKGINQFTDRTAEELRETTLGYSKTVKNAANKQNMFRNLKTS- 125 Query: 397 EELSVK-LPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 ++++VK LP DWR G VTPVKDQ GSCW+F T Sbjct: 126 DKINVKDLPKSVDWRDAGVVTPVKDQGHCGSCWAFAT 162 >UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sativa|Rep: Cysteine proteinase-like - Oryza sativa subsp. japonica (Rice) Length = 360 Score = 54.4 bits (125), Expect = 2e-06 Identities = 33/87 (37%), Positives = 44/87 (50%), Gaps = 6/87 (6%) Frame = +1 Query: 256 NRA--NRGFTMSVNHLADRTDDELAALR-GRRYSGPSP---HGLPFPYSKSRVEELSVKL 417 NRA +R +T+ +N +D TDDE A G ++ P P HG + + Sbjct: 78 NRAGGDRTYTLGLNQFSDLTDDEFAQTHLGYSWAPPPPSHRHGHRAENGTAAAAADDTDV 137 Query: 418 PPEHDWRLFGAVTPVKDQLVFGSCWSF 498 P DWR GAVT VK+Q GSCW+F Sbjct: 138 PDSVDWRARGAVTEVKNQRSCGSCWAF 164 >UniRef50_Q5YER1 Cluster: Cysteine proteinase; n=1; Bigelowiella natans|Rep: Cysteine proteinase - Bigelowiella natans (Pedinomonas minutissima) (Chlorarachnion sp.(strain CCMP 621)) Length = 140 Score = 54.4 bits (125), Expect = 2e-06 Identities = 34/86 (39%), Positives = 41/86 (47%) Frame = +1 Query: 247 TFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPE 426 T HN +T+ +N AD T+ E +L Y G P+ R LS K Sbjct: 60 TRHNVGGYSYTVELNEFADLTNAEFRSL----YHGLKPNA----QGPRRTANLSTKSADS 111 Query: 427 HDWRLFGAVTPVKDQLVFGSCWSFGT 504 DW GAVTPVK+Q GSCWSF T Sbjct: 112 VDWVSKGAVTPVKNQGQCGSCWSFST 137 >UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|Rep: Cysteine proteinase - Acanthamoeba healyi Length = 330 Score = 54.4 bits (125), Expect = 2e-06 Identities = 35/90 (38%), Positives = 45/90 (50%), Gaps = 6/90 (6%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSK------SRVEELSVK 414 HNR N+ + +++N D T+ E L GL F YSK + E + Sbjct: 63 HNRQNKSYFLAMNQFGDLTNAEFNRLF---------KGLAFDYSKHAKIHTAAPEAPATG 113 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 +P E DWR GAVT VK+Q GSCWSF T Sbjct: 114 IPSEFDWRQKGAVTHVKNQGQCGSCWSFST 143 Score = 33.1 bits (72), Expect = 5.2 Identities = 18/29 (62%), Positives = 20/29 (68%) Frame = +2 Query: 515 EGALFLHKGGHLXWLSQQALIDCSWGFGN 601 EGA FL K G L LS+Q LIDCS +GN Sbjct: 148 EGANFL-KTGRLVSLSEQNLIDCSVSYGN 175 >UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 306 Score = 54.4 bits (125), Expect = 2e-06 Identities = 29/81 (35%), Positives = 45/81 (55%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDW 435 NR N GFT+++N A T++E ++ G +Y S +P +K+ + +P E DW Sbjct: 44 NRVNLGFTLALNRFAHLTENEYRSMLGYKYGHKS-----YPITKN----IKNDVPTEIDW 94 Query: 436 RLFGAVTPVKDQLVFGSCWSF 498 R G V +K+Q GSCW+F Sbjct: 95 REQGIVNKIKNQGACGSCWAF 115 >UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Trypanosoma cruzi|Rep: Cysteine proteinase, putative - Trypanosoma cruzi Length = 392 Score = 53.6 bits (123), Expect = 3e-06 Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 2/105 (1%) Frame = +1 Query: 193 RPGAREEAEHLQAVAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLP 372 R R A Q +A++ T + N + M +NH++D T +ELA+L G R S H L Sbjct: 69 REYVRRRALFEQTLARVRTHNEAGNHLYVMGINHMSDWTPEELASLNGARPRMMS-H-LA 126 Query: 373 FPYSKSRVEELSVKLPPEHDWRLF--GAVTPVKDQLVFGSCWSFG 501 + R + ++P E D+R +T VKDQ GSCW+ G Sbjct: 127 QKSLQRRYQSSGGRIPDEVDYRNSSPAILTAVKDQGRCGSCWAHG 171 >UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchocercidae|Rep: Cathepsin L-like precursor - Brugia pahangi (Filarial nematode worm) Length = 395 Score = 52.8 bits (121), Expect = 6e-06 Identities = 29/77 (37%), Positives = 42/77 (54%) Frame = +1 Query: 274 FTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAV 453 +T ++N LAD TD+E G R + S+ + S +LP + DWR GAV Sbjct: 135 YTTALNDLADLTDEEFMVRNGLRLPNQTDLRGKRQTSEFYRYDKSERLPDQVDWRTKGAV 194 Query: 454 TPVKDQLVFGSCWSFGT 504 TPV++Q GSC++F T Sbjct: 195 TPVRNQGECGSCYAFAT 211 >UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens (Human) Length = 334 Score = 52.8 bits (121), Expect = 6e-06 Identities = 30/82 (36%), Positives = 42/82 (51%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHD 432 +++ GFTM++N D T++E + G + G F E L + LP D Sbjct: 66 YSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRNQKFRKGKVFR------EPLFLDLPKSVD 119 Query: 433 WRLFGAVTPVKDQLVFGSCWSF 498 WR G VTPVK+Q GSCW+F Sbjct: 120 WRKKGYVTPVKNQKQCGSCWAF 141 >UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryza sativa|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 352 Score = 51.6 bits (118), Expect = 1e-05 Identities = 29/82 (35%), Positives = 43/82 (52%), Gaps = 2/82 (2%) Frame = +1 Query: 265 NRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKS--RVEELSVKLPPEHDWR 438 N+ + ++ N D TD E AA+ Y+G +P + + + R+ + P E DWR Sbjct: 81 NKRYRLATNRFTDLTDAEFAAM----YTGYNPANTMYAAANATTRLSSEDDQQPAEVDWR 136 Query: 439 LFGAVTPVKDQLVFGSCWSFGT 504 GAVT VK+Q G CW+F T Sbjct: 137 QQGAVTGVKNQRSCGCCWAFST 158 >UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; Phytophthora infestans|Rep: Cathepsin-like cysteine protease - Phytophthora infestans (Potato late blight fungus) Length = 376 Score = 51.2 bits (117), Expect = 2e-05 Identities = 32/98 (32%), Positives = 42/98 (42%) Frame = +1 Query: 205 REEAEHLQAVAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYS 384 R A +L+ + + + R FT+ +N LAD D E L R + Sbjct: 66 RSFATNLERIQTHNEAYERGEHSFTLGLNDLADLADAEYKQLLSYRTRDSKSSSASETFV 125 Query: 385 KSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 K E LP DWR VTPVK+Q GSCW+F Sbjct: 126 KPENVE---DLPATWDWREHSTVTPVKNQGQCGSCWAF 160 >UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba culbertsoni|Rep: Cysteine proteinase - Acanthamoeba culbertsoni Length = 482 Score = 51.2 bits (117), Expect = 2e-05 Identities = 32/86 (37%), Positives = 44/86 (51%), Gaps = 5/86 (5%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYS-KSRVEE----LSVKLP 420 NR N FT+++N D T +E A L + S S L + +S +E+ +P Sbjct: 98 NRGNHTFTVAMNEHGDLTPEEFARLYMGQVSPASEQELQERIAAESAMEDEHHHTRASIP 157 Query: 421 PEHDWRLFGAVTPVKDQLVFGSCWSF 498 DWR GAVTPVK+Q SCW+F Sbjct: 158 ANWDWRTKGAVTPVKNQGSCASCWAF 183 >UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 392 Score = 51.2 bits (117), Expect = 2e-05 Identities = 29/85 (34%), Positives = 42/85 (49%), Gaps = 2/85 (2%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDELAALRGRR--YSGPSPHGLPFPYSKSRVEELSVKLPPEH 429 NR + + + NH AD TDDE + +G S + R + + ++P + Sbjct: 123 NRRSLPYKLEPNHFADLTDDEFKSYKGALDDESKDVMNDHDDVIDDDRSKRM-FEVPDQL 181 Query: 430 DWRLFGAVTPVKDQLVFGSCWSFGT 504 DWR +GAV P K Q GSCW+F T Sbjct: 182 DWRNYGAVNPAKGQGTCGSCWAFAT 206 Score = 49.6 bits (113), Expect = 6e-05 Identities = 24/60 (40%), Positives = 35/60 (58%), Gaps = 1/60 (1%) Frame = +2 Query: 92 TFNPMKEFVRPVHDAH-VHDEFERFKVKLQKQYASDLEHEKRLNIFRQSLRYIHSIIERT 268 + NPM EF H V D+F+ F+ + K Y D EH +R +IFR ++RYI S+ R+ Sbjct: 67 SINPMAEFTSLGHSRDLVDDDFDEFRQQHDKVYEDDSEHRRRKHIFRHNVRYIRSMNRRS 126 >UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 514 Score = 51.2 bits (117), Expect = 2e-05 Identities = 30/80 (37%), Positives = 42/80 (52%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDW 435 NR N + ++ NH D TD E ++ G S L PYS V +P E DW Sbjct: 255 NRKNLKYKLAPNHFVDLTDGEYD-----QHKGDSIITLYGPYSNMSHVLQRVDVPDELDW 309 Query: 436 RLFGAVTPVKDQLVFGSCWS 495 R +GAV+PV+ Q + GSC++ Sbjct: 310 RDYGAVSPVRGQGICGSCYA 329 Score = 48.8 bits (111), Expect = 1e-04 Identities = 28/81 (34%), Positives = 44/81 (54%), Gaps = 1/81 (1%) Frame = +2 Query: 17 DPDVFKVDSNMQCTGFPGPGSRHFATFNPMKEFVRPVH-DAHVHDEFERFKVKLQKQYAS 193 D D F++ +C RHF + NPM+EF+ D + + +++ + KQY S Sbjct: 175 DLDRFELPKGSECYNLSHSFDRHFVS-NPMQEFMSYGKVDFAIERMYRKYQGQHNKQYDS 233 Query: 194 DLEHEKRLNIFRQSLRYIHSI 256 + E KR +IFR ++RYI SI Sbjct: 234 EHEVSKRKHIFRHNMRYIRSI 254 Score = 38.3 bits (85), Expect = 0.14 Identities = 19/30 (63%), Positives = 21/30 (70%) Frame = +2 Query: 512 VEGALFLHKGGHLXWLSQQALIDCSWGFGN 601 VEGA F+ K G L LS Q +IDCSWG GN Sbjct: 336 VEGAYFM-KTGKLKELSAQQVIDCSWGSGN 364 >UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: Cysteine protease - Saprolegnia parasitica Length = 523 Score = 50.8 bits (116), Expect = 2e-05 Identities = 32/85 (37%), Positives = 40/85 (47%), Gaps = 1/85 (1%) Frame = +1 Query: 253 HNR-ANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEH 429 HN+ A+ FTM N + T DE LR PS Y+ +P E Sbjct: 61 HNKDASSSFTMGHNEYSHLTFDEFKKLRTGLRVSPSYIQSRAKYALMAPAVNMTDVPNEM 120 Query: 430 DWRLFGAVTPVKDQLVFGSCWSFGT 504 DW G VTPVK+Q + GSCW+F T Sbjct: 121 DWVEQGGVTPVKNQGMCGSCWAFST 145 >UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2; Brugia malayi|Rep: Cahepsin L-like cysteine protease - Brugia malayi (Filarial nematode worm) Length = 371 Score = 50.8 bits (116), Expect = 2e-05 Identities = 31/96 (32%), Positives = 48/96 (50%), Gaps = 3/96 (3%) Frame = +1 Query: 220 HLQAVAQIHTFHNRANRG---FTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKS 390 +L+ V +I + R R + +++NHLAD +E L G + + + + Sbjct: 78 YLKNVKEIEKHNERYERNEETYELAINHLADMLPEEFRKLHGFQSRKITSKN---NFKNT 134 Query: 391 RVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 +++ LP DWR GAVT VKDQ GSCW+F Sbjct: 135 IRMKINGPLPKSIDWRTSGAVTKVKDQGYCGSCWTF 170 >UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 317 Score = 50.8 bits (116), Expect = 2e-05 Identities = 28/95 (29%), Positives = 45/95 (47%) Frame = +1 Query: 214 AEHLQAVAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSR 393 +++LQ + Q + + F + VN AD T +E A+ + + + Sbjct: 41 SQNLQKIEQHNARYQNGEVSFYLGVNQFADMTSEEFKAMLDSQLIHKPKRDITSRF---- 96 Query: 394 VEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 V + + +P DWR GAV PV+DQ GSCW+F Sbjct: 97 VADPQLTVPESIDWREKGAVNPVRDQEQCGSCWAF 131 >UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3; Bilateria|Rep: Cathepsin L-like cysteine protease - Neobenedenia melleni Length = 335 Score = 50.8 bits (116), Expect = 2e-05 Identities = 31/100 (31%), Positives = 52/100 (52%), Gaps = 3/100 (3%) Frame = +1 Query: 214 AEHLQAVAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYS-KS 390 +++L+ V + + + + + +T+++NH+AD + +E AL Y P P K+ Sbjct: 52 SKNLETVRKHNELYAQGKKSYTLAMNHMADLSSEEFKAL----YLVPKFDATKVPRKGKA 107 Query: 391 RVEELSVKLPP--EHDWRLFGAVTPVKDQLVFGSCWSFGT 504 E +K P E DW G VT VK+Q GSCW+F + Sbjct: 108 AGEHRQIKNDPPSEIDWVRKGHVTAVKNQAQCGSCWAFSS 147 >UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 357 Score = 50.4 bits (115), Expect = 3e-05 Identities = 28/73 (38%), Positives = 38/73 (52%) Frame = +1 Query: 280 MSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTP 459 ++ N AD T++E A GR +S P G F Y R ++ P +WR GAVT Sbjct: 94 LTTNKFADLTNEEFAEYYGRPFSTPVIGGSGFMYGNVRTSDV----PANINWRDRGAVTQ 149 Query: 460 VKDQLVFGSCWSF 498 VK+Q SCW+F Sbjct: 150 VKNQKDCASCWAF 162 >UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2); n=2; Tribolium castaneum|Rep: PREDICTED: similar to Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2) - Tribolium castaneum Length = 332 Score = 50.0 bits (114), Expect = 4e-05 Identities = 31/97 (31%), Positives = 48/97 (49%), Gaps = 1/97 (1%) Frame = +1 Query: 217 EHLQAVAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRV 396 ++L+ V + + + + M VN +D TD+EL+ L G + P P ++ + Sbjct: 53 KNLEIVEEHNERFRNGSETYEMGVNKFSDFTDEELSNLTGLQV--PLEFEQPLNETEDPL 110 Query: 397 -EELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 L + DWR G VTPVK+Q GSCW+F T Sbjct: 111 LPSLGRGISASLDWRQRGGVTPVKNQGQCGSCWAFAT 147 >UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar to human SRY (sex determining region Y)-box 30 (SOX30),transcript variant 1,; n=3; Mammalia|Rep: Testis cDNA clone: QtsA-12228, similar to human SRY (sex determining region Y)-box 30 (SOX30),transcript variant 1, - Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) Length = 433 Score = 50.0 bits (114), Expect = 4e-05 Identities = 29/82 (35%), Positives = 41/82 (50%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHD 432 +++ GF M++N D T++E + G + G F E L + LP D Sbjct: 66 YSQGKHGFAMAMNAFGDMTNEEFRQVMGCFRNQKLRKGKLFR------EPLFLDLPKSVD 119 Query: 433 WRLFGAVTPVKDQLVFGSCWSF 498 WR G VTPVK+Q GSCW+F Sbjct: 120 WRKKGYVTPVKNQKQCGSCWAF 141 >UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lamblia ATCC 50803|Rep: GLP_26_49243_47612 - Giardia lamblia ATCC 50803 Length = 543 Score = 50.0 bits (114), Expect = 4e-05 Identities = 20/38 (52%), Positives = 25/38 (65%) Frame = +1 Query: 388 SRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFG 501 S + V+ P + DWR+ G +TPVKDQ GSCWSFG Sbjct: 307 SEENQKRVQFPRQLDWRVRGVITPVKDQAACGSCWSFG 344 >UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio molitor (Yellow mealworm) Length = 336 Score = 50.0 bits (114), Expect = 4e-05 Identities = 37/115 (32%), Positives = 51/115 (44%), Gaps = 9/115 (7%) Frame = +1 Query: 187 RERPGAREEAEHLQAVAQ-IHTF--HNRANR----GFTMSVNHLADRTDDELAALRGRRY 345 R A+EE Q + + TF HN R +T+ VN D T +E+ A Sbjct: 36 RSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDMTPEEMKAYTHGLI 95 Query: 346 SGPSPH--GLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 H G+P + SV+ P DWR G V+PVK+Q GSCW+F + Sbjct: 96 MPADLHKNGIPIKTREDLGLNASVRYPASFDWRDQGMVSPVKNQGSCGSCWAFSS 150 >UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=17; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 318 Score = 50.0 bits (114), Expect = 4e-05 Identities = 27/82 (32%), Positives = 39/82 (47%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHD 432 HNRAN G+ +++NHL+ T E L G + + + + +P D Sbjct: 55 HNRANSGYQLTMNHLSCMTPSEYKVLLGHKQTKKI---------EGEAKIFKGDVPDAVD 105 Query: 433 WRLFGAVTPVKDQLVFGSCWSF 498 WR V P+KDQ GSCW+F Sbjct: 106 WRNAKIVNPIKDQAQCGSCWAF 127 >UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; Leishmania|Rep: Cysteine proteinase 2 precursor - Leishmania pifanoi Length = 444 Score = 50.0 bits (114), Expect = 4e-05 Identities = 31/84 (36%), Positives = 39/84 (46%), Gaps = 2/84 (2%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDELAA--LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPE 426 H N + D ++ E AA L G Y + Y K+R + +V P Sbjct: 72 HQARNPHAQFGITKFFDLSEAEFAARYLNGAAYFAAAKRHAAQHYRKARADLSAV--PDA 129 Query: 427 HDWRLFGAVTPVKDQLVFGSCWSF 498 DWR GAVTPVKDQ GSCW+F Sbjct: 130 VDWREKGAVTPVKDQGACGSCWAF 153 >UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MGC107932 protein - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 333 Score = 49.6 bits (113), Expect = 6e-05 Identities = 29/80 (36%), Positives = 41/80 (51%), Gaps = 1/80 (1%) Frame = +1 Query: 268 RGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFG 447 + + M++N AD TD+E ++ + P L P S+ +P E DWR Sbjct: 70 KSYRMAMNQFADLTDNERSS---KSCLLPREKSLN-PVKAESYSYTSITIPKEVDWRKSN 125 Query: 448 AVTPVKDQLVF-GSCWSFGT 504 VTPVK+Q F GSCW+F T Sbjct: 126 CVTPVKNQGTFCGSCWAFAT 145 >UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Actinidin Act3a - Actinidia eriantha Length = 380 Score = 49.6 bits (113), Expect = 6e-05 Identities = 31/85 (36%), Positives = 42/85 (49%), Gaps = 1/85 (1%) Frame = +1 Query: 253 HNR-ANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEH 429 HN NR +T+ +N AD TD+E + Y G L S + ++ LP Sbjct: 76 HNADPNRSYTVGLNQFADLTDEEYRST----YLG-FKSSLKSKVSNRYMPQVGEVLPDYV 130 Query: 430 DWRLFGAVTPVKDQLVFGSCWSFGT 504 DWR GAV VK+Q + SCW+F T Sbjct: 131 DWRTTGAVVDVKNQGLCSSCWAFAT 155 >UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 383 Score = 49.6 bits (113), Expect = 6e-05 Identities = 28/81 (34%), Positives = 40/81 (49%), Gaps = 1/81 (1%) Frame = +1 Query: 265 NRGFTMSVNHLADRTDDELAAL-RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRL 441 N G + VN D TD+EL + + +Y+ + P + E V P DWR Sbjct: 120 NLGLDLDVNEFTDWTDEELQKMVQENKYT---KYDFDTPKFEGSYLETGVIRPASIDWRE 176 Query: 442 FGAVTPVKDQLVFGSCWSFGT 504 G +TP+K+Q GSCW+F T Sbjct: 177 QGKLTPIKNQGQCGSCWAFAT 197 >UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; Dictyostelium discoideum|Rep: Cysteine proteinase 2 precursor - Dictyostelium discoideum (Slime mold) Length = 376 Score = 49.6 bits (113), Expect = 6e-05 Identities = 30/89 (33%), Positives = 45/89 (50%), Gaps = 1/89 (1%) Frame = +1 Query: 241 IHTFHNRANRGFTMSVNHLADRTDDELA-ALRGRRYSGPSPHGLPFPYSKSRVEELSVKL 417 + ++++ + + +N+ AD T++E G R + S +G VE+L Sbjct: 66 VDNWNSKGDSQTVLGLNNFADITNEEYRKTYLGTRVNAHSYNGYD-GREVLNVEDLQTN- 123 Query: 418 PPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 P DWR AVTP+KDQ GSCWSF T Sbjct: 124 PKSIDWRTKNAVTPIKDQGQCGSCWSFST 152 >UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=176; Viridiplantae|Rep: Cysteine proteinase RD21a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 462 Score = 49.2 bits (112), Expect = 7e-05 Identities = 30/85 (35%), Positives = 40/85 (47%), Gaps = 1/85 (1%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVE-ELSVKLPPEH 429 HN N + + + AD T+DE + +Y G + R E + +LP Sbjct: 86 HNEKNLSYRLGLTRFADLTNDEYRS----KYLGAKMEKKGERRTSLRYEARVGDELPESI 141 Query: 430 DWRLFGAVTPVKDQLVFGSCWSFGT 504 DWR GAV VKDQ GSCW+F T Sbjct: 142 DWRKKGAVAEVKDQGGCGSCWAFST 166 >UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; n=23; Magnoliophyta|Rep: Senescence-specific cysteine protease - Arabidopsis thaliana (Mouse-ear cress) Length = 346 Score = 48.8 bits (111), Expect = 1e-04 Identities = 30/81 (37%), Positives = 41/81 (50%), Gaps = 2/81 (2%) Frame = +1 Query: 262 ANRGFTMSVNHLADRTDDELAAL-RGRRYSGPSPHGLPFPYSKSRVEELSV-KLPPEHDW 435 A R F ++VN AD T+DE ++ G + S R + +S LP DW Sbjct: 77 AGRTFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDW 136 Query: 436 RLFGAVTPVKDQLVFGSCWSF 498 R GAVTP+K+Q G CW+F Sbjct: 137 RKKGAVTPIKNQGSCGCCWAF 157 >UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza sativa|Rep: Putative cysteine protease - Oryza sativa subsp. japonica (Rice) Length = 357 Score = 48.8 bits (111), Expect = 1e-04 Identities = 30/87 (34%), Positives = 42/87 (48%), Gaps = 1/87 (1%) Frame = +1 Query: 241 IHTFHNRANRGFTMSVNHLADRTDDE-LAALRGRRYSGPSPHGLPFPYSKSRVEELSVKL 417 I ++ A + +N AD T+ E +A G + P+ H P P R + + + Sbjct: 75 IRSYRPEATYDSAVRINQFADLTNGEFVATYTGVKQPPPATHPHPHPEEAPRPVD-PIWM 133 Query: 418 PPEHDWRLFGAVTPVKDQLVFGSCWSF 498 P DWR GAVT VKDQ GS W+F Sbjct: 134 PCCIDWRFKGAVTGVKDQGACGSSWAF 160 >UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precursor; n=3; Metazoa|Rep: Digestive cysteine proteinase 2 precursor - Homarus americanus (American lobster) Length = 323 Score = 48.8 bits (111), Expect = 1e-04 Identities = 31/92 (33%), Positives = 46/92 (50%), Gaps = 4/92 (4%) Frame = +1 Query: 241 IHTFHNRANRG---FTMSVNHLADRTDDEL-AALRGRRYSGPSPHGLPFPYSKSRVEELS 408 I F+ + G F +++N D T +E A ++G +P + +P ++ + Sbjct: 51 IEEFNKKYENGEVTFNLAMNKFGDMTLEEFNAVMKGNIPRRSAPVSVFYPKKETGPQATE 110 Query: 409 VKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 V DWR GAVTPVKDQ GSCW+F T Sbjct: 111 V------DWRTKGAVTPVKDQGQCGSCWAFST 136 >UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin heavy chain; n=3; Amniota|Rep: PREDICTED: similar to ferritin heavy chain - Ornithorhynchus anatinus Length = 338 Score = 48.4 bits (110), Expect = 1e-04 Identities = 30/84 (35%), Positives = 45/84 (53%), Gaps = 3/84 (3%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDEL-AALRGRR--YSGPSPHGLPFPYSKSRVEELSVKLPPE 426 ++ + +++NH D+T++EL L G R G G +S+ S + P E Sbjct: 67 SQGKHSYRLAMNHFGDQTNEELHERLNGFRPDLGGALRSGREQARFRSKT---SWEGPEE 123 Query: 427 HDWRLFGAVTPVKDQLVFGSCWSF 498 DWR G VTPVK+Q + GSCW+F Sbjct: 124 VDWRTKGYVTPVKNQGLCGSCWAF 147 >UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin S preproprotein; n=2; Tribolium castaneum|Rep: PREDICTED: similar to cathepsin S preproprotein - Tribolium castaneum Length = 525 Score = 48.0 bits (109), Expect = 2e-04 Identities = 29/75 (38%), Positives = 40/75 (53%) Frame = +1 Query: 274 FTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAV 453 + + +N L+D TD+E++ + PS LP + SR LP DWRL G V Sbjct: 270 YYLRINDLSDYTDEEMSCC-SEKAPKPSITILPNVSTSSRQN-----LPKMVDWRLRGVV 323 Query: 454 TPVKDQLVFGSCWSF 498 TPVK Q G+CW+F Sbjct: 324 TPVKHQGKCGTCWAF 338 Score = 44.0 bits (99), Expect = 0.003 Identities = 23/49 (46%), Positives = 28/49 (57%) Frame = +1 Query: 352 PSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 P+P + FP +R + LP DWRL G VTPVK Q GSCW+F Sbjct: 17 PNPSIVIFPNMSARPQS---DLPDMVDWRLQGVVTPVKRQGKCGSCWAF 62 >UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativa|Rep: Os01g0347600 protein - Oryza sativa subsp. japonica (Rice) Length = 343 Score = 48.0 bits (109), Expect = 2e-04 Identities = 30/73 (41%), Positives = 37/73 (50%) Frame = +1 Query: 280 MSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTP 459 + +N AD T+DE A Y+G P P P R + + P DWR GAVT Sbjct: 88 VGINQFADLTNDEFVAT----YTGAKP---PHPKEAPRPVD-PIWTPCCIDWRFRGAVTG 139 Query: 460 VKDQLVFGSCWSF 498 VKDQ GSCW+F Sbjct: 140 VKDQGACGSCWAF 152 >UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 289 Score = 48.0 bits (109), Expect = 2e-04 Identities = 30/73 (41%), Positives = 37/73 (50%) Frame = +1 Query: 280 MSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTP 459 + +N AD T+DE A Y+G P P P R + + P DWR GAVT Sbjct: 87 VGINQFADLTNDEFVAT----YTGAKP---PHPKEAPRPVD-PIWTPCCIDWRFRGAVTG 138 Query: 460 VKDQLVFGSCWSF 498 VKDQ GSCW+F Sbjct: 139 VKDQGACGSCWAF 151 >UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 326 Score = 48.0 bits (109), Expect = 2e-04 Identities = 31/77 (40%), Positives = 40/77 (51%) Frame = +1 Query: 274 FTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAV 453 F + V AD T+ E + + G S S +S + V++L P + DWR GAV Sbjct: 68 FKLGVTKFADLTEKEFSDMLGISRSTKSSRPRVI-HSLTPVKDL----PSKFDWREKGAV 122 Query: 454 TPVKDQLVFGSCWSFGT 504 T VKDQ GSCWSF T Sbjct: 123 TEVKDQGSCGSCWSFST 139 Score = 35.5 bits (78), Expect = 0.97 Identities = 17/43 (39%), Positives = 26/43 (60%) Frame = +2 Query: 125 VHDAHVHDEFERFKVKLQKQYASDLEHEKRLNIFRQSLRYIHS 253 VH +E+ +FKV+ K Y + +E +KR IF+ SLR I + Sbjct: 14 VHALSDKEEWVQFKVRNNKSYRNYIEEQKRFTIFQGSLRKIEN 56 >UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi Length = 467 Score = 48.0 bits (109), Expect = 2e-04 Identities = 35/102 (34%), Positives = 46/102 (45%), Gaps = 3/102 (2%) Frame = +1 Query: 202 AREEAEHLQAVAQ---IHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLP 372 A EEA L + + H AN T V +D T +E R R ++G + Sbjct: 52 AAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEF---RSRYHNGAAHFAAA 108 Query: 373 FPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 ++ V+ V P DWR GAVT VKDQ GSCW+F Sbjct: 109 QERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAF 150 >UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa zeasingle nucleocapsid nuclear polyhedrosis virus) Length = 367 Score = 48.0 bits (109), Expect = 2e-04 Identities = 26/73 (35%), Positives = 41/73 (56%), Gaps = 2/73 (2%) Frame = +1 Query: 286 VNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELS--VKLPPEHDWRLFGAVTP 459 VN +D+T DE+ + S H + ++R+ + + ++LP +DWR VTP Sbjct: 114 VNKFSDKTPDEVLHSNTGFFLNLSQH---YTLCENRIVKGAPDIRLPDYYDWRDTNKVTP 170 Query: 460 VKDQLVFGSCWSF 498 +KDQ V GSCW+F Sbjct: 171 IKDQGVCGSCWAF 183 >UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax (Sterkiella histriomuscorum) Length = 366 Score = 47.6 bits (108), Expect = 2e-04 Identities = 30/102 (29%), Positives = 45/102 (44%) Frame = +1 Query: 199 GAREEAEHLQAVAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFP 378 G +A + QI ++ + +N +D TD+E Y+ + Sbjct: 68 GIDRKATFANKLQQIIKHNSDGTNTYKKGLNAFSDMTDEEFFDY----YNIKAEQNCSAT 123 Query: 379 YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 KS + +P E DWR FG V+PVK+Q GSCW+F T Sbjct: 124 NRKS-FGNSNANIPTEWDWRTFGVVSPVKNQGKCGSCWTFST 164 >UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep: CG4847-PD, isoform D - Drosophila melanogaster (Fruit fly) Length = 420 Score = 47.6 bits (108), Expect = 2e-04 Identities = 31/79 (39%), Positives = 39/79 (49%), Gaps = 2/79 (2%) Frame = +1 Query: 274 FTMSVNHLADRTDDE-LAALRGRRYSGPSPHGLPFPYSKSRVEELSVK-LPPEHDWRLFG 447 F +VN AD T E L+ L G + S P + ++ L K +P DWR G Sbjct: 157 FKQAVNAFADLTHSEFLSQLTGLKRS---PEAKARAAASLKLVNLPAKPIPDAFDWREHG 213 Query: 448 AVTPVKDQLVFGSCWSFGT 504 VTPVK Q GSCW+F T Sbjct: 214 GVTPVKFQGTCGSCWAFAT 232 >UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1 precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 355 Score = 47.6 bits (108), Expect = 2e-04 Identities = 29/83 (34%), Positives = 39/83 (46%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDW 435 N + + +N AD T +E R + P P + R +++ LP DW Sbjct: 86 NNEINSYWLGLNEFADLTHEEFKG-RYLGLAKPQFSRKRQPSANFRYRDIT-DLPKSVDW 143 Query: 436 RLFGAVTPVKDQLVFGSCWSFGT 504 R GAV PVKDQ GSCW+F T Sbjct: 144 RKKGAVAPVKDQGQCGSCWAFST 166 >UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudicotyledons|Rep: Chymopapain precursor - Carica papaya (Papaya) Length = 352 Score = 47.6 bits (108), Expect = 2e-04 Identities = 27/83 (32%), Positives = 40/83 (48%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDW 435 N+ N + + +N AD ++DE + + GL ++ + P DW Sbjct: 83 NKKNNSYWLGLNGFADLSNDEFKK-KYVGFVAEDFTGLEHFDNEDFTYKHVTNYPQSIDW 141 Query: 436 RLFGAVTPVKDQLVFGSCWSFGT 504 R GAVTPVK+Q GSCW+F T Sbjct: 142 RAKGAVTPVKNQGACGSCWAFST 164 >UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35; Viridiplantae|Rep: Cysteine proteinase 15A precursor - Pisum sativum (Garden pea) Length = 363 Score = 47.6 bits (108), Expect = 2e-04 Identities = 20/33 (60%), Positives = 23/33 (69%) Frame = +1 Query: 406 SVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 + LP + DWR GAVTPVKDQ GSCW+F T Sbjct: 129 TTNLPEDFDWREKGAVTPVKDQGSCGSCWAFST 161 >UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine protease; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cysteine protease - Strongylocentrotus purpuratus Length = 494 Score = 47.2 bits (107), Expect = 3e-04 Identities = 18/28 (64%), Positives = 23/28 (82%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 +P E+DWR GAVTPVK+Q + GSCW+F Sbjct: 240 VPEEYDWRTHGAVTPVKNQGMCGSCWAF 267 >UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 47.2 bits (107), Expect = 3e-04 Identities = 28/89 (31%), Positives = 43/89 (48%) Frame = +1 Query: 238 QIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKL 417 QI + ++ GF +N + T +E A R P+ S+ ++ KL Sbjct: 69 QIELDNMNSDNGFISGINKFSHLTKEEFKAKYLNRPQRPASEMKTNSILSSQ-QKTDEKL 127 Query: 418 PPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 P DWR GAV+PV+DQ GSC++F + Sbjct: 128 PESVDWRKLGAVSPVRDQGNCGSCYAFAS 156 >UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 4 - Rhipicephalus appendiculatus (Brown ear tick) Length = 345 Score = 47.2 bits (107), Expect = 3e-04 Identities = 28/77 (36%), Positives = 38/77 (49%) Frame = +1 Query: 274 FTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAV 453 ++++VNH AD T DE+ A Y+G P L P +WR G V Sbjct: 83 YSVAVNHFADMTPDEVVA----NYTGYKPPSAQQLAEIPLYAPLFGDTPEFIEWRENGFV 138 Query: 454 TPVKDQLVFGSCWSFGT 504 TPVK+Q GSCW+F + Sbjct: 139 TPVKNQGQCGSCWAFSS 155 >UniRef50_Q22RC9 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 389 Score = 47.2 bits (107), Expect = 3e-04 Identities = 22/52 (42%), Positives = 31/52 (59%), Gaps = 1/52 (1%) Frame = +1 Query: 352 PSPHGLPFPYSKSR-VEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 PS + F S+ +++S P +DWR GAVTPVK+Q G+CW+F T Sbjct: 103 PSTYARNFTGSRYHGFQKISQDAPTSYDWRDHGAVTPVKNQGTVGTCWTFST 154 >UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15; Magnoliophyta|Rep: Cysteine proteinase RD19a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 368 Score = 47.2 bits (107), Expect = 3e-04 Identities = 30/82 (36%), Positives = 39/82 (47%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHD 432 H + + T V +D T E R + S LP +K+ + LP + D Sbjct: 85 HQKLDPSATHGVTQFSDLTRSEF---RKKHLGVRSGFKLPKDANKAPILPTE-NLPEDFD 140 Query: 433 WRLFGAVTPVKDQLVFGSCWSF 498 WR GAVTPVK+Q GSCWSF Sbjct: 141 WRDHGAVTPVKNQGSCGSCWSF 162 Score = 34.7 bits (76), Expect = 1.7 Identities = 15/32 (46%), Positives = 21/32 (65%) Frame = +2 Query: 146 DEFERFKVKLQKQYASDLEHEKRLNIFRQSLR 241 D F FK K K YAS+ EH+ R ++F+ +LR Sbjct: 49 DHFSLFKRKFGKVYASNEEHDYRFSVFKANLR 80 >UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|Rep: LD36817p - Drosophila melanogaster (Fruit fly) Length = 352 Score = 46.8 bits (106), Expect = 4e-04 Identities = 33/81 (40%), Positives = 39/81 (48%), Gaps = 3/81 (3%) Frame = +1 Query: 271 GFTMSVNHLADRTDDELAALRGRRYS--GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLF 444 GF + VN LAD T E+A L G + S G + +R S LP DWR Sbjct: 81 GFRLGVNTLADMTRKEIATLLGSKISEFGERYTNGHINFVTAR-NPASANLPEMFDWREK 139 Query: 445 GAVTPVKDQLV-FGSCWSFGT 504 G VTP Q V G+CWSF T Sbjct: 140 GGVTPPGFQGVGCGACWSFAT 160 >UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schistosoma|Rep: Preprocathepsin cathepsin L - Schistosoma japonicum (Blood fluke) Length = 331 Score = 46.8 bits (106), Expect = 4e-04 Identities = 27/82 (32%), Positives = 41/82 (50%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHD 432 H+ G+TM +N D +E+ + + G SP + + +E + +P D Sbjct: 65 HDLGLEGYTMGLNQFCDMEWEEVNRIMFPKVFGNSPL---WNDDGNELELTNKPVPSTWD 121 Query: 433 WRLFGAVTPVKDQLVFGSCWSF 498 WR GAVT VK Q + GSCW+F Sbjct: 122 WRDHGAVTAVKHQGLCGSCWAF 143 >UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 291 Score = 46.8 bits (106), Expect = 4e-04 Identities = 29/84 (34%), Positives = 41/84 (48%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHD 432 HN+AN + +S+N L+ T E +L G + L K R + P D Sbjct: 30 HNKANANYKLSLNSLSHLTPTEYQSLLGTKID----KNLVSQGKKVRPQIKDS--PGILD 83 Query: 433 WRLFGAVTPVKDQLVFGSCWSFGT 504 +R G V P++DQ GSCW+FGT Sbjct: 84 YREMGVVNPIRDQKQCGSCWAFGT 107 >UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber officinale (Ginger) Length = 475 Score = 46.4 bits (105), Expect = 5e-04 Identities = 34/96 (35%), Positives = 49/96 (51%), Gaps = 2/96 (2%) Frame = +1 Query: 217 EHLQAVAQIHTFHNRANRGFTMSVNHLADRTDDELAA--LRGRRYSGPSPHGLPFPYSKS 390 E+L+ V + + +R + + +N AD T++E A LR G S G ++ Sbjct: 78 ENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEEYRARFLRDLSRLGRSTSGEIS--NQY 135 Query: 391 RVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 R+ E V LP DWR GAV VK+Q GSCW+F Sbjct: 136 RLREGDV-LPDSIDWREKGAVVAVKNQGRCGSCWAF 170 >UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Plasmodium|Rep: Cysteine protease falcipain-3 - Plasmodium falciparum Length = 492 Score = 46.4 bits (105), Expect = 5e-04 Identities = 17/26 (65%), Positives = 21/26 (80%) Frame = +1 Query: 427 HDWRLFGAVTPVKDQLVFGSCWSFGT 504 +DWRL G VTPVKDQ + GSCW+F + Sbjct: 273 YDWRLHGGVTPVKDQALCGSCWAFSS 298 >UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foetus|Rep: TFCP2 protein - Tritrichomonas foetus (Trichomonas foetus) Length = 270 Score = 46.4 bits (105), Expect = 5e-04 Identities = 29/82 (35%), Positives = 37/82 (45%), Gaps = 1/82 (1%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVK-LPPEHD 432 N G+T+S+ H A T E A+L S H S E + K P D Sbjct: 3 NSKGHGYTLSLYHFATYTSSEYASLLNVPSGRMSSH-------HSHHERIQYKDTPTSFD 55 Query: 433 WRLFGAVTPVKDQLVFGSCWSF 498 WR G V P+K+Q GSCW+F Sbjct: 56 WRSEGKVNPIKNQGSCGSCWAF 77 >UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin L - Misgurnus mizolepis (Mud loach) Length = 337 Score = 46.0 bits (104), Expect = 7e-04 Identities = 29/91 (31%), Positives = 42/91 (46%), Gaps = 2/91 (2%) Frame = +1 Query: 238 QIHTF-HNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELS-V 411 Q H H+ + + +NH D +E R+ H + S E + + Sbjct: 60 QFHNLEHSMGIHTYRLGMNHFGDMNHEEF-----RQVMNGYKHKTERKFKGSLFMEPNFL 114 Query: 412 KLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 ++P + DWR G VTPVKDQ GSCW+F T Sbjct: 115 EVPSKLDWREKGYVTPVKDQGECGSCWAFST 145 >UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; Dictyostelium discoideum|Rep: Cysteine proteinase 1 precursor - Dictyostelium discoideum (Slime mold) Length = 343 Score = 46.0 bits (104), Expect = 7e-04 Identities = 20/36 (55%), Positives = 23/36 (63%) Frame = +1 Query: 397 EELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 +E +P DWR GAVTPVK+Q GSCWSF T Sbjct: 112 DEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFST 147 >UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase precursor - Phaedon cochleariae (Mustard beetle) Length = 324 Score = 46.0 bits (104), Expect = 7e-04 Identities = 30/96 (31%), Positives = 47/96 (48%), Gaps = 2/96 (2%) Frame = +1 Query: 223 LQAVAQIHTFHNRANRGFTMSVNHLADRTDDELA-ALRGRRYSGPSPHGLPFPYSKSRVE 399 L+ +A+ + + + +++N +D TD+E L S P+ GL V Sbjct: 51 LRQIAEHNVKYENGESTYYLAINKFSDITDEEFRDMLMKNEASRPNLEGL-------EVA 103 Query: 400 ELSVKLPPEH-DWRLFGAVTPVKDQLVFGSCWSFGT 504 +L+V PE DWR G V PV++Q GSCW+ T Sbjct: 104 DLTVGAAPESIDWRSKGVVLPVRNQGECGSCWALST 139 >UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|Rep: Cathepsin F precursor - Homo sapiens (Human) Length = 484 Score = 46.0 bits (104), Expect = 7e-04 Identities = 19/27 (70%), Positives = 21/27 (77%) Frame = +1 Query: 418 PPEHDWRLFGAVTPVKDQLVFGSCWSF 498 PPE DWR GAVT VKDQ + GSCW+F Sbjct: 272 PPEWDWRSKGAVTKVKDQGMCGSCWAF 298 >UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin L preproprotein; n=1; Monodelphis domestica|Rep: PREDICTED: similar to cathepsin L preproprotein - Monodelphis domestica Length = 356 Score = 45.6 bits (103), Expect = 0.001 Identities = 26/96 (27%), Positives = 44/96 (45%) Frame = +1 Query: 217 EHLQAVAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRV 396 ++L+ + + + + M +N D TD E + R + P Y+ R Sbjct: 54 KNLKLINDHNRLFKEGKKSYFMGMNQFGDMTDKEFESRLNLRIA---PVRTRRNYTFKR- 109 Query: 397 EELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 + +LP DWR G VTP+++Q G+CW+F T Sbjct: 110 -RIYYRLPKSVDWRTHGYVTPIRNQGECGACWAFST 144 >UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 360 Score = 45.6 bits (103), Expect = 0.001 Identities = 31/86 (36%), Positives = 41/86 (47%), Gaps = 2/86 (2%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDELAAL-RGRRYSGPSPHGLPFPYSKSRVEELSV-KLPPE 426 HN+ + VN AD T +E AL G ++S +K++ L LP Sbjct: 79 HNKFLVFSKVGVNQFADLTHEEFKALYTGHKHSKDDDDD----DNKNKQPHLPTDNLPAS 134 Query: 427 HDWRLFGAVTPVKDQLVFGSCWSFGT 504 DWR GA+TPVK Q G CW+F T Sbjct: 135 FDWRDKGAITPVKVQNGCGGCWAFST 160 >UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thaliana|Rep: F9P14.12 protein - Arabidopsis thaliana (Mouse-ear cress) Length = 343 Score = 45.6 bits (103), Expect = 0.001 Identities = 27/91 (29%), Positives = 44/91 (48%) Frame = +1 Query: 226 QAVAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEEL 405 Q+ Q+ + N + F ++ N AD T+ E A + G + L + V + Sbjct: 68 QSNVQLIDYINSLHLPFKLTDNRFADMTNSEFKA----HFLGLNTSSLRLHKKQRPVCDP 123 Query: 406 SVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 + +P DWR GAVTP+++Q G CW+F Sbjct: 124 AGNVPDAVDWRTQGAVTPIRNQGKCGGCWAF 154 >UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae str. PEST Length = 559 Score = 45.6 bits (103), Expect = 0.001 Identities = 20/42 (47%), Positives = 28/42 (66%) Frame = +2 Query: 131 DAHVHDEFERFKVKLQKQYASDLEHEKRLNIFRQSLRYIHSI 256 DAHV F++F+ ++QYAS +EHE R NIFR +L I + Sbjct: 242 DAHVRRMFDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQL 283 Score = 39.1 bits (87), Expect = 0.079 Identities = 17/28 (60%), Positives = 19/28 (67%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 LP DWR GAVT VK+Q GSCW+F Sbjct: 339 LPRSFDWRDHGAVTEVKNQGSCGSCWAF 366 >UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine protease; n=11; Callosobruchus maculatus|Rep: Putative gut cathepsin L-like cysteine protease - Callosobruchus maculatus (Southern cowpea weevil) (Pulse bruchid) Length = 326 Score = 45.6 bits (103), Expect = 0.001 Identities = 28/83 (33%), Positives = 39/83 (46%), Gaps = 1/83 (1%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDE-LAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEH 429 + R F V AD T +E L L+ + + + F E++ ++ Sbjct: 61 YERGEESFAKKVTQFADMTHEEFLDLLKLQGVPALPSNAVHF----DNFEDIDMEEKDAV 116 Query: 430 DWRLFGAVTPVKDQLVFGSCWSF 498 DWR GAVTPVKDQ GSCW+F Sbjct: 117 DWREEGAVTPVKDQANCGSCWAF 139 >UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviatus|Rep: Cathepsin L 2 - Diaprepes abbreviatus (Sugarcane rootstalk borer weevil) Length = 348 Score = 45.6 bits (103), Expect = 0.001 Identities = 20/30 (66%), Positives = 22/30 (73%) Frame = +1 Query: 409 VKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 V LP + DWR GAVTPVK+Q GSCWSF Sbjct: 133 VDLPTDIDWRQKGAVTPVKNQRNCGSCWSF 162 >UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [Contains: Cathepsin H mini chain; Cathepsin H heavy chain; Cathepsin H light chain]; n=37; Eukaryota|Rep: Cathepsin H precursor (EC 3.4.22.16) [Contains: Cathepsin H mini chain; Cathepsin H heavy chain; Cathepsin H light chain] - Homo sapiens (Human) Length = 335 Score = 45.6 bits (103), Expect = 0.001 Identities = 29/85 (34%), Positives = 40/85 (47%), Gaps = 1/85 (1%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHD 432 HN N F M++N +D + E+ +Y P +KS + PP D Sbjct: 68 HNNGNHTFKMALNQFSDMSFAEIK----HKYLWSEPQNCSA--TKSNYLRGTGPYPPSVD 121 Query: 433 WRLFGA-VTPVKDQLVFGSCWSFGT 504 WR G V+PVK+Q GSCW+F T Sbjct: 122 WRKKGNFVSPVKNQGACGSCWTFST 146 >UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin F like protease - Nasonia vitripennis Length = 1036 Score = 45.2 bits (102), Expect = 0.001 Identities = 26/72 (36%), Positives = 36/72 (50%), Gaps = 1/72 (1%) Frame = +1 Query: 286 VNHLADRTDDELAALR-GRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPV 462 V D T E A G + + S + +P P + ++LP ++DWR VTPV Sbjct: 777 VTQFTDLTKAEFKARHLGLKPTLKSENDIPMPMATIP----DIELPSDYDWRHHNVVTPV 832 Query: 463 KDQLVFGSCWSF 498 KDQ GSCW+F Sbjct: 833 KDQGSCGSCWAF 844 >UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 344 Score = 45.2 bits (102), Expect = 0.001 Identities = 30/93 (32%), Positives = 43/93 (46%), Gaps = 9/93 (9%) Frame = +1 Query: 253 HNR-ANRGFTMSVNHLADRTDDEL-------AALRGRRYSGPSPHGLPFPYSKSRVEE-L 405 HN + +T+ NHL+D T +E A + G + G S V+ + Sbjct: 72 HNSDPSHSYTLGHNHLSDMTHEEFSLYQLNPARTASKSSKGGNNSGNSSGSSNPYVDPPI 131 Query: 406 SVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 + K P DWR A+TPVK Q GSCW+F + Sbjct: 132 TTKNAPPMDWRNASAITPVKQQGKCGSCWTFAS 164 >UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep: Cathepsin - Petromyzon marinus (Sea lamprey) Length = 333 Score = 45.2 bits (102), Expect = 0.001 Identities = 32/98 (32%), Positives = 47/98 (47%), Gaps = 4/98 (4%) Frame = +1 Query: 217 EHLQAVAQIHTFHNRANRGFTMSVNHLADRTDDEL-AALRGRRYS---GPSPHGLPFPYS 384 ++L+ V Q + + N F + +N +D E + GR ++ G G PFP Sbjct: 53 QNLKRVLQHNLLADEGNVSFHLGINKYSDLELHEYHEKVVGRFWNLRNGTRRRGAPFPLR 112 Query: 385 KSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 LP + DWRL G VTPVK+Q + GS W+F Sbjct: 113 SMD------NLPEQVDWRLKGYVTPVKEQGLCGSSWAF 144 >UniRef50_Q24E33 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 328 Score = 45.2 bits (102), Expect = 0.001 Identities = 28/83 (33%), Positives = 41/83 (49%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDW 435 N N F +++N +A TD+E ++L + + S E +P E +W Sbjct: 77 NAKNNTFKLAINIMAILTDEEYSSLY---LNLDQQESIDIFDSLVDDNETVGDIPSEVNW 133 Query: 436 RLFGAVTPVKDQLVFGSCWSFGT 504 GAVTPVK+Q GSCW+F T Sbjct: 134 TAQGAVTPVKNQGSCGSCWAFST 156 >UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; n=35; Fasciola|Rep: Cathepsin L-like proteinase precursor - Fasciola hepatica (Liver fluke) Length = 326 Score = 45.2 bits (102), Expect = 0.001 Identities = 29/80 (36%), Positives = 38/80 (47%), Gaps = 3/80 (3%) Frame = +1 Query: 274 FTMSVNHLADRTDDELAA---LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLF 444 +T+ +N D T +E A R S HG+P+ + V P + DWR Sbjct: 65 YTLGLNQFTDMTFEEFKAKYLTEMSRASDILSHGVPYEANNRAV-------PDKIDWRES 117 Query: 445 GAVTPVKDQLVFGSCWSFGT 504 G VT VKDQ GSCW+F T Sbjct: 118 GYVTEVKDQGNCGSCWAFST 137 >UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n=1; Xenopus tropicalis|Rep: UPI00006A2275 UniRef100 entry - Xenopus tropicalis Length = 272 Score = 44.8 bits (101), Expect = 0.002 Identities = 30/77 (38%), Positives = 40/77 (51%), Gaps = 2/77 (2%) Frame = +1 Query: 274 FTMSVNHLADRTDDELAA-LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGA 450 + + +NHL D T +E+AA + G SG S + S E L PP DWR Sbjct: 35 YEVGMNHLGDMTGEEVAATMTGYTGSGDSLANM----SHVPKEILEALAPPSIDWRTQNC 90 Query: 451 VTPVKDQLVF-GSCWSF 498 VTPV+DQ F SC++F Sbjct: 91 VTPVRDQGSFCRSCYAF 107 >UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep: Cysteine protease - Solanum lycopersicum (Tomato) (Lycopersicon esculentum) Length = 345 Score = 44.8 bits (101), Expect = 0.002 Identities = 31/91 (34%), Positives = 44/91 (48%), Gaps = 5/91 (5%) Frame = +1 Query: 241 IHTFHNRANRGFTMSVNHLADRTDDE-LAALRGRRYSGPSPHGLPFPYSKS---RVEELS 408 I + + N + + +N AD T E LA G P+ + P P S + ++ +LS Sbjct: 70 IESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNI--PNSYLSPSPMSSTEFKKINDLS 127 Query: 409 VKLPPEH-DWRLFGAVTPVKDQLVFGSCWSF 498 P + DWR GAVT VK Q G CW+F Sbjct: 128 DDYMPSNLDWRESGAVTQVKHQGRCGCCWAF 158 >UniRef50_Q248G1 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 334 Score = 44.8 bits (101), Expect = 0.002 Identities = 28/83 (33%), Positives = 39/83 (46%), Gaps = 1/83 (1%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHD 432 HN N FT S+N AD TD+E + R + P + L ++P D Sbjct: 70 HNSQNPTFTQSLNQFADFTDEE---FKYRVLNTKVSQTRPKKGRRLESRVLDQQIPESVD 126 Query: 433 WR-LFGAVTPVKDQLVFGSCWSF 498 WR + V P+K+Q GSCW+F Sbjct: 127 WRNVTNVVGPIKNQGHCGSCWTF 149 >UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza sativa|Rep: Cysteine protease 1 precursor - Oryza sativa subsp. japonica (Rice) Length = 490 Score = 44.8 bits (101), Expect = 0.002 Identities = 29/77 (37%), Positives = 38/77 (49%), Gaps = 1/77 (1%) Frame = +1 Query: 271 GFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGA 450 GF + +N AD T+ E A Y G +P G ++ + LP DWR GA Sbjct: 111 GFRLGMNRFADLTNGEFRAT----YLGTTPAGRGRRVGEAYRHDGVEALPDSVDWRDKGA 166 Query: 451 VT-PVKDQLVFGSCWSF 498 V PVK+Q GSCW+F Sbjct: 167 VVAPVKNQGQCGSCWAF 183 >UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2] - Vigna mungo (Rice bean) (Black gram) Length = 362 Score = 44.8 bits (101), Expect = 0.002 Identities = 29/92 (31%), Positives = 44/92 (47%), Gaps = 1/92 (1%) Frame = +1 Query: 232 VAQIHTFHNRANRGFTMSVNHLADRTDDEL-AALRGRRYSGPSPHGLPFPYSKSRVEELS 408 V +H N+ ++ + + +N AD T+ E + G + + S + + E Sbjct: 67 VMHVHNT-NKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKV 125 Query: 409 VKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 +P DWR GAVT VKDQ GSCW+F T Sbjct: 126 GSVPASVDWRKKGAVTDVKDQGQCGSCWAFST 157 >UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 361 Score = 44.4 bits (100), Expect = 0.002 Identities = 30/85 (35%), Positives = 42/85 (49%), Gaps = 2/85 (2%) Frame = +1 Query: 241 IHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEE--LSVK 414 IH F+ + + + +N +D T +E AA +Y+G + + E+ L Sbjct: 69 IHEFNKKEGMSYKLGLNKFSDMTVEEFAA----KYTGVQVDAGAAVVTSAPDEQPVLVGD 124 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSC 489 PP DWR GAVTPVKDQ GSC Sbjct: 125 APPVWDWRDHGAVTPVKDQ---GSC 146 >UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like midgut cysteine proteinase - Tenebrio molitor (Yellow mealworm) Length = 330 Score = 44.4 bits (100), Expect = 0.002 Identities = 29/98 (29%), Positives = 48/98 (48%), Gaps = 2/98 (2%) Frame = +1 Query: 217 EHLQAVAQIHTFHNRANRGFTMSVNHLADRTDDELAAL--RGRRYSGPSPHGLPFPYSKS 390 +++ +A+ + + ++ ++N D + +E A RG+ P L PY S Sbjct: 54 DNVAKIAEHNAKFEKGEVTYSKAMNQFGDMSKEEFLAYVNRGKAQKPKHPENLRMPYVSS 113 Query: 391 RVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 + + L+ + DWR AV+ VKDQ GSCWSF T Sbjct: 114 K-KPLAASV----DWRS-NAVSEVKDQGQCGSCWSFST 145 >UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theileria|Rep: Cysteine protease, putative - Theileria parva Length = 612 Score = 44.4 bits (100), Expect = 0.002 Identities = 29/87 (33%), Positives = 41/87 (47%), Gaps = 2/87 (2%) Frame = +1 Query: 241 IHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLP 420 I T ++ N+ FTM D +D+EL P+ + YS++ E S K Sbjct: 211 IETHNSNHNKIFTMGYTSSTDSSDEELGRAVSSISYKPTQDEI---YSRASEEMSSSKKY 267 Query: 421 PE--HDWRLFGAVTPVKDQLVFGSCWS 495 P DWR G + PV+DQ GSCW+ Sbjct: 268 PGVIFDWREKGVILPVQDQKECGSCWA 294 >UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=19; Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Homo sapiens (Human) Length = 333 Score = 44.4 bits (100), Expect = 0.002 Identities = 27/82 (32%), Positives = 36/82 (43%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHD 432 + FTM++N D T +E + + G F E L + P D Sbjct: 66 YREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQ------EPLFYEAPRSVD 119 Query: 433 WRLFGAVTPVKDQLVFGSCWSF 498 WR G VTPVK+Q GSCW+F Sbjct: 120 WREKGYVTPVKNQGQCGSCWAF 141 >UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin L-like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin L-like protease - Nasonia vitripennis Length = 353 Score = 44.0 bits (99), Expect = 0.003 Identities = 22/43 (51%), Positives = 27/43 (62%), Gaps = 2/43 (4%) Frame = +1 Query: 376 PYSKSRVEELSVKLPPEH-DWRLFGAVTPVKDQ-LVFGSCWSF 498 P ++ S + PEH DWR GAVTPV+DQ L GSCW+F Sbjct: 118 PRGDEFIKPKSAENVPEHVDWRQRGAVTPVRDQGLTCGSCWAF 160 >UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1; Brugia malayi|Rep: Cathepsin F-like cysteine proteinase - Brugia malayi (Filarial nematode worm) Length = 461 Score = 44.0 bits (99), Expect = 0.003 Identities = 18/28 (64%), Positives = 20/28 (71%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 LP + DWR G VTPVKDQ GSCW+F Sbjct: 248 LPSKFDWRTEGVVTPVKDQGSCGSCWAF 275 >UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10; Eukaryota|Rep: Extracellular cysteine protease 8 - Tritrichomonas foetus (Trichomonas foetus) Length = 315 Score = 44.0 bits (99), Expect = 0.003 Identities = 32/88 (36%), Positives = 38/88 (43%) Frame = +1 Query: 229 AVAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELS 408 A A++ HN A FT +N A T E AL G R + K+ VE L Sbjct: 47 ANARLVKEHNAAKGKFTTGLNKFAAMTPSEYKALLGFRMDLAQRKAVKST-KKASVESL- 104 Query: 409 VKLPPEHDWRLFGAVTPVKDQLVFGSCW 492 DWR G V P+KDQ GSCW Sbjct: 105 -------DWREKGVVNPIKDQAQCGSCW 125 >UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; Entamoeba|Rep: Cysteine proteinase 2 precursor - Entamoeba histolytica Length = 315 Score = 44.0 bits (99), Expect = 0.003 Identities = 17/38 (44%), Positives = 27/38 (71%) Frame = +1 Query: 391 RVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 +V+ L+++ P DWR G VTP++DQ GSC++FG+ Sbjct: 86 QVKYLNIQAPESVDWRKEGKVTPIRDQAQCGSCYTFGS 123 >UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens (Human) Length = 331 Score = 44.0 bits (99), Expect = 0.003 Identities = 25/83 (30%), Positives = 39/83 (46%), Gaps = 1/83 (1%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDELAALRGR-RYSGPSPHGLPFPYSKSRVEELSVKLPPEH 429 H+ + + +NHL D T +E+ +L R + + + +R+ LP Sbjct: 66 HSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNITYKSNPNRI------LPDSV 119 Query: 430 DWRLFGAVTPVKDQLVFGSCWSF 498 DWR G VT VK Q G+CW+F Sbjct: 120 DWREKGCVTEVKYQGSCGACWAF 142 >UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 protein, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to MGC81823 protein, partial - Ornithorhynchus anatinus Length = 361 Score = 43.6 bits (98), Expect = 0.004 Identities = 20/30 (66%), Positives = 22/30 (73%), Gaps = 1/30 (3%) Frame = +1 Query: 418 PPEH-DWRLFGAVTPVKDQLVFGSCWSFGT 504 PPE DWR G VTPVKDQ GSCW+FG+ Sbjct: 190 PPEALDWRDHGYVTPVKDQGRCGSCWAFGS 219 >UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum|Rep: Falcipain 2 - Plasmodium falciparum Length = 484 Score = 43.6 bits (98), Expect = 0.004 Identities = 32/102 (31%), Positives = 45/102 (44%), Gaps = 8/102 (7%) Frame = +1 Query: 223 LQAVAQIHTFHNRANRGFTMSVNHLADRTDDELA-ALRGRRYSGPSPHGLPFPYSKSRVE 399 LQ +++ +N N + +N AD T E R S P + + + E Sbjct: 190 LQNAHKVNMHNNNKNSLYKKELNRFADLTYHEFKNKYLSLRSSKPLKNS-KYLLDQMNYE 248 Query: 400 ELSVKLPPE-------HDWRLFGAVTPVKDQLVFGSCWSFGT 504 E+ K E +DWRL VTPVKDQ GSCW+F + Sbjct: 249 EVIKKYRGEENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSS 290 >UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n=1; Toxocara canis|Rep: Cathepsin L-like cysteine proteinase - Toxocara canis (Canine roundworm) Length = 360 Score = 43.6 bits (98), Expect = 0.004 Identities = 16/31 (51%), Positives = 20/31 (64%) Frame = +1 Query: 412 KLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 ++P DWR + VTPVK Q GSCW+F T Sbjct: 144 EIPDHFDWRPYNVVTPVKSQFKCGSCWAFAT 174 >UniRef50_Q22A69 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 330 Score = 43.6 bits (98), Expect = 0.004 Identities = 27/73 (36%), Positives = 34/73 (46%) Frame = +1 Query: 286 VNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVK 465 + AD T +E A + Y G P L +K + P DW GAVTPVK Sbjct: 74 ITQFADLTHEEFADM----YLGYKPQ-LRNSQAKVSLSSTPFTAPTAIDWTTKGAVTPVK 128 Query: 466 DQLVFGSCWSFGT 504 +Q GSCW+F T Sbjct: 129 NQGSCGSCWAFST 141 >UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18; Magnoliophyta|Rep: Thiol protease aleurain precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 43.6 bits (98), Expect = 0.004 Identities = 29/83 (34%), Positives = 38/83 (45%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDW 435 N+ + + VN AD T E R G + + +V E + LP DW Sbjct: 94 NKKGLSYKLGVNQFADLTWQEFQ----RTKLGAAQNCSATLKGSHKVTEAA--LPETKDW 147 Query: 436 RLFGAVTPVKDQLVFGSCWSFGT 504 R G V+PVKDQ GSCW+F T Sbjct: 148 REDGIVSPVKDQGGCGSCWTFST 170 >UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arabidopsis thaliana|Rep: Putative cysteine proteinase - Arabidopsis thaliana (Mouse-ear cress) Length = 365 Score = 43.2 bits (97), Expect = 0.005 Identities = 34/91 (37%), Positives = 46/91 (50%), Gaps = 3/91 (3%) Frame = +1 Query: 241 IHTFHNRANRGFTMSVNHLAD-RTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELS-VK 414 I F+N N+ +T+ VN D +T++ LA G R + S L SR +S + Sbjct: 69 IENFNNMGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWNMSDID 128 Query: 415 LPPEH-DWRLFGAVTPVKDQLVFGSCWSFGT 504 + E DWR GAVTPVK Q G+C F T Sbjct: 129 MEDESKDWRDEGAVTPVKYQ---GACPEFPT 156 >UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Bigelowiella natans|Rep: Digestive cysteine proteinase - Bigelowiella natans (Pedinomonas minutissima) (Chlorarachnion sp.(strain CCMP 621)) Length = 360 Score = 43.2 bits (97), Expect = 0.005 Identities = 18/31 (58%), Positives = 22/31 (70%) Frame = +1 Query: 406 SVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 +VK+ DWR F A+TPVKDQ GSCW+F Sbjct: 106 AVKVTDSFDWRDFNALTPVKDQGGCGSCWAF 136 >UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Longidorus elongatus Length = 358 Score = 43.2 bits (97), Expect = 0.005 Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 4/99 (4%) Frame = +1 Query: 214 AEHLQAVAQIHTFHNRANRGFTMSVNHLADRTDDELAA-LRGRRYSGPSPHGLPFPYSKS 390 A + + + Q + + F +S+N AD T+ E + G + P + Sbjct: 68 ASNHKVIEQHNIEYEAGQHSFALSLNKFADMTNAEFRQRMNGFKLPAKRKLAKSQPLKED 127 Query: 391 -RVEEL--SVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 + E+ +V +P DWR G VT VKDQ GSCW+F Sbjct: 128 GMIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAF 166 >UniRef50_Q239L8 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 43.2 bits (97), Expect = 0.005 Identities = 19/27 (70%), Positives = 19/27 (70%) Frame = +1 Query: 424 EHDWRLFGAVTPVKDQLVFGSCWSFGT 504 E DW GAVTPVKDQ GSCWSF T Sbjct: 126 EIDWTTKGAVTPVKDQGQCGSCWSFST 152 >UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin O precursor; n=2; Apocrita|Rep: PREDICTED: similar to Cathepsin O precursor - Apis mellifera Length = 374 Score = 42.7 bits (96), Expect = 0.006 Identities = 29/100 (29%), Positives = 46/100 (46%), Gaps = 4/100 (4%) Frame = +1 Query: 217 EHLQAVAQIHTFHNRANRGFT----MSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYS 384 +H++ + + + A G T MS N T +RG ++ S H S Sbjct: 87 QHIERMNGLRSSQESAYYGLTEFSDMSENEFLLHTLLPDLPIRGEKHMNASYHR-KHQIS 145 Query: 385 KSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 R++ S+ +P DWR G +TPV+ Q G+CW+F T Sbjct: 146 IDRMKR-SISIPLRFDWRDKGVITPVRSQGSCGACWAFST 184 >UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicotyledons|Rep: Cysteine proteinase - Mesembryanthemum crystallinum (Common ice plant) Length = 367 Score = 42.7 bits (96), Expect = 0.006 Identities = 26/83 (31%), Positives = 39/83 (46%), Gaps = 2/83 (2%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDELAAL--RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEH 429 N+ ++ + + +N D T E A + G F Y +V++P Sbjct: 78 NKMDKPYKLRLNQFGDLTPSEFARTYANSKIIEGTRNESGGFMYE-------NVEVPRSI 130 Query: 430 DWRLFGAVTPVKDQLVFGSCWSF 498 DWR+ GAVTPVK+Q G CW+F Sbjct: 131 DWRVKGAVTPVKNQGRCGGCWAF 153 >UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadidae|Rep: Cysteine protease - Tritrichomonas foetus (Trichomonas foetus) Length = 315 Score = 42.7 bits (96), Expect = 0.006 Identities = 26/82 (31%), Positives = 37/82 (45%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHD 432 HN + FT+S+N A T E + G + +G + K V+ + D Sbjct: 55 HNAGDSKFTVSLNKFAALTPSEYKVMLGYK-TGMKAEKVSRGMKKPNVDSI--------D 105 Query: 433 WRLFGAVTPVKDQLVFGSCWSF 498 WR G V +KDQ GSCW+F Sbjct: 106 WREKGVVNEIKDQAACGSCWAF 127 >UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus salmonis|Rep: Cysteine proteinase - Lepeophtheirus salmonis (salmon louse) Length = 372 Score = 42.7 bits (96), Expect = 0.006 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 4/86 (4%) Frame = +1 Query: 253 HN-RANRGFTMSVNHLADRTDDELAALRGRRYSGPSP--HGLPFPYSKSRVEELSVK-LP 420 HN R + M +N +D TD+E + +Y G SP + ++ ++K LP Sbjct: 61 HNANPKRTWDMGINEFSDLTDEEFES----KYMGYSPMSSSAGLVTRTAAPKQGNIKDLP 116 Query: 421 PEHDWRLFGAVTPVKDQLVFGSCWSF 498 DWR G +T VK+Q GSCW F Sbjct: 117 ESVDWREKGVITDVKNQGSCGSCWVF 142 >UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 42.3 bits (95), Expect = 0.008 Identities = 17/32 (53%), Positives = 22/32 (68%) Frame = +1 Query: 409 VKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 V+LP DWR +G ++ VKDQ GSCW+F T Sbjct: 123 VQLPASFDWRDYGILSDVKDQGQCGSCWAFST 154 >UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa|Rep: Os09g0497500 protein - Oryza sativa subsp. japonica (Rice) Length = 349 Score = 42.3 bits (95), Expect = 0.008 Identities = 31/96 (32%), Positives = 48/96 (50%), Gaps = 7/96 (7%) Frame = +1 Query: 232 VAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSV 411 V + TF++ +N G+ ++ N AD T++E A + G PH S + ++++ Sbjct: 59 VELVETFNSMSN-GYKLADNKFADLTNEEFRA----KMLGFRPHVTIPQISNTCSADIAM 113 Query: 412 K-------LPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 LP DWR GAV VK+Q GSCW+F Sbjct: 114 PGESSDDILPKSVDWRKKGAVVEVKNQGDCGSCWAF 149 >UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin L-like cysteine proteinase precursor - Acanthoscelides obtectus (Bean weevil) Length = 321 Score = 42.3 bits (95), Expect = 0.008 Identities = 27/93 (29%), Positives = 41/93 (44%) Frame = +1 Query: 220 HLQAVAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVE 399 +L+ + + + ++ F M +N D T +E R + P +P P Sbjct: 50 NLRTIEEHNERYHNGEETFEMGINQFGDMTQEEFK----RMLALQKPQ-MPLPRGDEVSF 104 Query: 400 ELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 + +P DWR GAVT VK Q GSCW+F Sbjct: 105 DNVNDIPKTVDWREKGAVTEVKKQGNCGSCWAF 137 >UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platyhelminthes|Rep: Cathepsin L-like proteinase - Echinococcus multilocularis Length = 338 Score = 42.3 bits (95), Expect = 0.008 Identities = 28/76 (36%), Positives = 38/76 (50%), Gaps = 1/76 (1%) Frame = +1 Query: 274 FTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEH-DWRLFGA 450 ++ ++N AD T +E A P G+ S VE + L P+ DWR G Sbjct: 75 YSTALNAFADLTLEEFAEKYLTLKQTPM-EGIWQDMSTQYVERPTRMLVPDSIDWRKKGL 133 Query: 451 VTPVKDQLVFGSCWSF 498 VTP+KDQ GSCW+F Sbjct: 134 VTPIKDQGDCGSCWAF 149 >UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor; n=17; Magnoliophyta|Rep: Thiol protease aleurain-like precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 42.3 bits (95), Expect = 0.008 Identities = 28/84 (33%), Positives = 41/84 (48%), Gaps = 1/84 (1%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDELAALRGRRYS-GPSPHGLPFPYSKSRVEELSVKLPPEHD 432 N+ + +S+N AD T E +RY G + + ++ E +V P D Sbjct: 94 NKKGLSYKLSLNQFADLTWQEF-----QRYKLGAAQNCSATLKGSHKITEATV--PDTKD 146 Query: 433 WRLFGAVTPVKDQLVFGSCWSFGT 504 WR G V+PVK+Q GSCW+F T Sbjct: 147 WREDGIVSPVKEQGHCGSCWTFST 170 >UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lamblia ATCC 50803|Rep: GLP_26_47548_45815 - Giardia lamblia ATCC 50803 Length = 577 Score = 41.9 bits (94), Expect = 0.011 Identities = 16/29 (55%), Positives = 20/29 (68%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSFG 501 LP E DWR+ G + KDQ+ GSCW+FG Sbjct: 344 LPQELDWRVRGIMNMAKDQVACGSCWTFG 372 >UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea mays (Maize) Length = 371 Score = 41.9 bits (94), Expect = 0.011 Identities = 18/28 (64%), Positives = 20/28 (71%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 LP + DWR GAV PVK+Q GSCWSF Sbjct: 137 LPDDFDWRDHGAVGPVKNQGSCGSCWSF 164 >UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: Cysteine proteinase - Paragonimus westermani Length = 272 Score = 41.5 bits (93), Expect = 0.015 Identities = 20/39 (51%), Positives = 24/39 (61%), Gaps = 1/39 (2%) Frame = +1 Query: 391 RVEELSVKLPPEH-DWRLFGAVTPVKDQLVFGSCWSFGT 504 RV +K PE DWR GAVT V++Q GSCW+F T Sbjct: 45 RVRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWAFST 83 >UniRef50_Q22LI1 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 987 Score = 41.5 bits (93), Expect = 0.015 Identities = 30/89 (33%), Positives = 41/89 (46%), Gaps = 7/89 (7%) Frame = +1 Query: 253 HN-RANRGFTMSVNHLADRTDDELA------ALRGRRYSGPSPHGLPFPYSKSRVEELSV 411 HN ++ F + +N A T E A ++ + P P P P+ + +V Sbjct: 64 HNYNSSNTFQLGLNEYAHMTSQEFAEVFLTPSISKSQQKQPKPKPQPQPHPNNSTNT-TV 122 Query: 412 KLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 + P DWR GAVT VK Q GSCWSF Sbjct: 123 TITPI-DWRNKGAVTSVKRQGKCGSCWSF 150 >UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus|Rep: Cathepsin L - Aphrocallistes vastus Length = 329 Score = 41.5 bits (93), Expect = 0.015 Identities = 28/84 (33%), Positives = 38/84 (45%), Gaps = 3/84 (3%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVK---LPPE 426 N + ++ N AD T+ E + Y G + +V + +K LP Sbjct: 63 NAEGHSYKLAANQFADLTNLEYRQI----YLGYDNEARLSRKREGKVFQRKMKDEDLPTT 118 Query: 427 HDWRLFGAVTPVKDQLVFGSCWSF 498 DWR G VTPVK+Q GSCWSF Sbjct: 119 VDWRSKGVVTPVKNQGQCGSCWSF 142 >UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine proteinase precursor - Heterodera glycines (Soybean cyst nematode worm) Length = 353 Score = 41.5 bits (93), Expect = 0.015 Identities = 26/75 (34%), Positives = 37/75 (49%) Frame = +1 Query: 274 FTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAV 453 F ++ NHL T + +RG + ++ + + S LP + DWR GAV Sbjct: 93 FKVAPNHLMHFTPAQYNRIRGLQMRSNRQR-----HNMATLAGNSSTLPEKLDWREKGAV 147 Query: 454 TPVKDQLVFGSCWSF 498 T VKDQ GSCW+F Sbjct: 148 TEVKDQGDCGSCWAF 162 >UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Rep: Cathepsin R precursor - Mus musculus (Mouse) Length = 334 Score = 41.5 bits (93), Expect = 0.015 Identities = 31/86 (36%), Positives = 40/86 (46%), Gaps = 4/86 (4%) Frame = +1 Query: 253 HNRAN----RGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLP 420 HNR N GFTM +N D+TD+E + G S + E S+ LP Sbjct: 62 HNRENSLGKNGFTMKMNEFGDQTDEEFRKMMIEISVWTHREGK----SIMKREAGSI-LP 116 Query: 421 PEHDWRLFGAVTPVKDQLVFGSCWSF 498 DWR G VTPV+ Q +CW+F Sbjct: 117 KFVDWRKKGYVTPVRRQGDCDACWAF 142 >UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep: Cathepsin L precursor - Schistosoma mansoni (Blood fluke) Length = 319 Score = 41.5 bits (93), Expect = 0.015 Identities = 17/30 (56%), Positives = 21/30 (70%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 +P DWR GAVT VK+Q + GSCW+F T Sbjct: 105 IPKNFDWREKGAVTEVKNQGMCGSCWAFST 134 >UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegleria fowleri|Rep: Cysteine proteinase homolog - Naegleria fowleri Length = 347 Score = 41.1 bits (92), Expect = 0.020 Identities = 17/29 (58%), Positives = 19/29 (65%) Frame = +1 Query: 418 PPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 P DWR GAVT VK+Q GSCW+F T Sbjct: 123 PTSFDWRQHGAVTRVKNQGACGSCWTFST 151 >UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 356 Score = 41.1 bits (92), Expect = 0.020 Identities = 27/102 (26%), Positives = 51/102 (50%), Gaps = 4/102 (3%) Frame = +1 Query: 211 EAEHL-QAVAQIHTFHNRANRGFTMSVNH-LADRTDDELAA--LRGRRYSGPSPHGLPFP 378 E +H ++V ++ + + N +T+S++ A +D++ L + S + L P Sbjct: 57 EFQHFKESVRRVREHNKKVNATYTLSIDSPFAFMSDEQFVTEYLGSQDCSATAELTLKKP 116 Query: 379 YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 + +V++P +W+ V+PVKDQ GSCW+F T Sbjct: 117 MKIQNKK--NVQVPESINWKDLNKVSPVKDQQNCGSCWTFST 156 >UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; n=16; Chrysomelidae|Rep: Digestive cysteine protease intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 41.1 bits (92), Expect = 0.020 Identities = 27/83 (32%), Positives = 40/83 (48%), Gaps = 1/83 (1%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDELA-ALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEH 429 +++ + + V AD T +E L+G+ + P + P + E+L V P Sbjct: 61 YDKGEETYLLGVTRFADLTHEEFKDILKGQIKNKPRLNATPTVFP----EDLEV--PDSI 114 Query: 430 DWRLFGAVTPVKDQLVFGSCWSF 498 DW GAV VKDQ GSCW+F Sbjct: 115 DWTEKGAVLEVKDQNPCGSCWAF 137 >UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; Dictyostelium discoideum|Rep: Cysteine proteinase 7 precursor - Dictyostelium discoideum (Slime mold) Length = 460 Score = 41.1 bits (92), Expect = 0.020 Identities = 21/49 (42%), Positives = 25/49 (51%), Gaps = 2/49 (4%) Frame = +1 Query: 364 GLPFPYSKSRVEELS--VKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 G PF S + E + DWR GAVTP+K+Q G CWSF T Sbjct: 91 GTPFDASSLEMTESDKIFDASAQVDWRTQGAVTPIKNQGQCGGCWSFST 139 >UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 precursor; n=2; Arabidopsis thaliana|Rep: Probable cysteine proteinase At3g43960 precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 376 Score = 41.1 bits (92), Expect = 0.020 Identities = 32/89 (35%), Positives = 44/89 (49%), Gaps = 2/89 (2%) Frame = +1 Query: 238 QIHTFHNRANRGFTMSVNHLADRTDDEL-AALRGRRYSGPSPHGLPFPYSKSRVEELSVK 414 +I ++ NR + +N +D T DE A+ G + S + Y + +E V Sbjct: 71 RIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKMEKKSLSDVAERY---QYKEGDV- 126 Query: 415 LPPEHDWRLFGAVTP-VKDQLVFGSCWSF 498 LP E DWR GAV P VK Q GSCW+F Sbjct: 127 LPDEVDWRERGAVVPRVKRQGECGSCWAF 155 >UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|Rep: Cathepsin W precursor - Homo sapiens (Human) Length = 376 Score = 41.1 bits (92), Expect = 0.020 Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 2/72 (2%) Frame = +1 Query: 286 VNHLADRTDDELAALRG-RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWR-LFGAVTP 459 V +D T++E L G RR +G G+P + R EE +P DWR + GA++P Sbjct: 88 VTPFSDLTEEEFGQLYGYRRAAG----GVPSMGREIRSEEPEESVPFSCDWRKVAGAISP 143 Query: 460 VKDQLVFGSCWS 495 +KDQ CW+ Sbjct: 144 IKDQKNCNCCWA 155 >UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep: Viral cathepsin - Cydia pomonella granulosis virus (CpGV) (Cydia pomonellagranulovirus) Length = 333 Score = 41.1 bits (92), Expect = 0.020 Identities = 18/36 (50%), Positives = 22/36 (61%) Frame = +1 Query: 397 EELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 +E LP DWR VTPVK+Q+ GSCW+F T Sbjct: 118 DEPQALLPETLDWRDKHGVTPVKNQMECGSCWAFST 153 >UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 21 SCAF14577, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 406 Score = 40.7 bits (91), Expect = 0.026 Identities = 16/36 (44%), Positives = 23/36 (63%) Frame = +1 Query: 397 EELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 E+L + PP DWR G V+PV++Q SCW+F + Sbjct: 149 EKLGFETPPSVDWRKAGLVSPVQNQGFCNSCWAFSS 184 >UniRef50_Q0AY53 Cluster: Putative uncharacterized protein; n=1; Syntrophomonas wolfei subsp. wolfei str. Goettingen|Rep: Putative uncharacterized protein - Syntrophomonas wolfei subsp. wolfei (strain Goettingen) Length = 475 Score = 40.7 bits (91), Expect = 0.026 Identities = 23/63 (36%), Positives = 33/63 (52%), Gaps = 3/63 (4%) Frame = +1 Query: 328 LRGRRYSGPSPHGLPFPYSKSRV---EELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 L G+ G PH L + K + E ++L +D R G +TPVKDQ G+CW+F Sbjct: 37 LAGQYQPGFIPHPLNLSHLKGQKIFSETKLLRLSSSYDLRKEGRLTPVKDQGPAGTCWAF 96 Query: 499 GTW 507 T+ Sbjct: 97 ATY 99 >UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; Roseiflexus|Rep: Peptidase C1A, papain precursor - Roseiflexus sp. RS-1 Length = 1202 Score = 40.7 bits (91), Expect = 0.026 Identities = 18/30 (60%), Positives = 20/30 (66%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 LP +W GA TPVKDQ V GSCW+F T Sbjct: 169 LPAAFNWCDQGACTPVKDQGVCGSCWAFAT 198 >UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyta|Rep: Os12g0273800 protein - Oryza sativa subsp. japonica (Rice) Length = 504 Score = 40.7 bits (91), Expect = 0.026 Identities = 30/95 (31%), Positives = 43/95 (45%), Gaps = 5/95 (5%) Frame = +1 Query: 202 AREEAEHLQA----VAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGL 369 A E+A L+ VA I +F+ + + VN AD T +E A +P+ Sbjct: 58 AAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNG 117 Query: 370 PFPYSKSRVEELSVK-LPPEHDWRLFGAVTPVKDQ 471 + + E +S LP DWR GAVT +KDQ Sbjct: 118 VRVSTGFKYENVSADALPASVDWRTKGAVTRIKDQ 152 >UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Cathepsin - Geodia cydonium (Sponge) Length = 322 Score = 40.7 bits (91), Expect = 0.026 Identities = 28/77 (36%), Positives = 38/77 (49%), Gaps = 1/77 (1%) Frame = +1 Query: 271 GFTMSVNHLADRTDDELAA-LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFG 447 G+T+++N AD E + G R + G P E++S LP DWR G Sbjct: 59 GYTVAMNEFADLDPREFVSHYNGLRRRPHTSSGEPCTLG----EDVSA-LPTTVDWRTKG 113 Query: 448 AVTPVKDQLVFGSCWSF 498 VT VK+Q GSCW+F Sbjct: 114 YVTGVKNQGQCGSCWAF 130 >UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lamblia ATCC 50803|Rep: GLP_163_69918_68548 - Giardia lamblia ATCC 50803 Length = 456 Score = 40.7 bits (91), Expect = 0.026 Identities = 17/31 (54%), Positives = 21/31 (67%) Frame = +1 Query: 412 KLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 ++P +D R G PVKDQ V GSCW+FGT Sbjct: 76 EIPTSYDLREAGLQVPVKDQGVCGSCWAFGT 106 >UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase" precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 315 Score = 40.7 bits (91), Expect = 0.026 Identities = 26/96 (27%), Positives = 45/96 (46%) Frame = +1 Query: 217 EHLQAVAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRV 396 ++L+ + + + + + ++VN AD + E A+ R+ + + V Sbjct: 49 DNLKKIEEHNAKYESGEETYYLAVNKFADWSSAEFQAMLARQMANKPKQS----FIAKHV 104 Query: 397 EELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 + +V+ E DWR AV VKDQ GSCW+F T Sbjct: 105 ADPNVQAVEEVDWR-DSAVLGVKDQGQCGSCWAFST 139 >UniRef50_Q22W19 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 40.7 bits (91), Expect = 0.026 Identities = 18/37 (48%), Positives = 22/37 (59%) Frame = +1 Query: 394 VEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 + L + P DWR AVTPVK+Q GSCW+F T Sbjct: 116 IYSLKGDVAPSIDWRQKNAVTPVKNQGQCGSCWAFST 152 >UniRef50_O16454 Cluster: Temporarily assigned gene name protein 196; n=4; Bilateria|Rep: Temporarily assigned gene name protein 196 - Caenorhabditis elegans Length = 477 Score = 40.7 bits (91), Expect = 0.026 Identities = 18/30 (60%), Positives = 20/30 (66%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 LP DWR GAVT VK+Q GSCW+F T Sbjct: 264 LPESFDWREKGAVTQVKNQGNCGSCWAFST 293 >UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: Cathepsin L - Kudoa thyrsites Length = 300 Score = 40.7 bits (91), Expect = 0.026 Identities = 29/90 (32%), Positives = 44/90 (48%), Gaps = 4/90 (4%) Frame = +1 Query: 241 IHTFHNRANRGFTMSVNHLADRTDDELAA---LRGRRYSGPSP-HGLPFPYSKSRVEELS 408 IH F+ NHL+ + +E A L+ + +P HG+ P ++ +++ Sbjct: 42 IHNFNLHNTHYHYCRHNHLSHWSHEEYMAWLTLKPKLPVVSTPTHGIT-P-KETATKDIK 99 Query: 409 VKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 LP DW+ G VT VK+Q GSCWSF Sbjct: 100 STLPSSVDWKALGKVTSVKNQGHCGSCWSF 129 >UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina) Length = 339 Score = 40.7 bits (91), Expect = 0.026 Identities = 18/32 (56%), Positives = 21/32 (65%) Frame = +1 Query: 409 VKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 V +P DWR GAVT VKDQ GSCW+F + Sbjct: 120 VTVPKSVDWREHGAVTGVKDQGHCGSCWAFSS 151 >UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa|Rep: Os09g0381400 protein - Oryza sativa subsp. japonica (Rice) Length = 362 Score = 40.3 bits (90), Expect = 0.034 Identities = 34/115 (29%), Positives = 48/115 (41%), Gaps = 9/115 (7%) Frame = +1 Query: 187 RERPGAREEAEHLQAVAQIHTFHNRAN-RG---FTMSVNHLADRTDDELAALRGRRYSGP 354 R P A E + + F + N RG + ++ N AD T++E A Y+G Sbjct: 60 RSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGD 119 Query: 355 SPHGLPFPYSKSRVEELS----VKLPPEHDWRLFGAVTPVKDQL-VFGSCWSFGT 504 P + + + S V +P DWR GAV P K Q SCW+F T Sbjct: 120 GPVDDSVITTGAGDVDASFSYRVDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVT 174 >UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus tauri|Rep: Cysteine protease-1 - Ostreococcus tauri Length = 430 Score = 40.3 bits (90), Expect = 0.034 Identities = 20/40 (50%), Positives = 23/40 (57%) Frame = +1 Query: 385 KSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 K+ E SV P DW GAVTP K+Q GSCW+F T Sbjct: 191 KASWEYASVDPPEAIDWVELGAVTPPKNQGQCGSCWAFST 230 >UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcoptes scabiei type hominis|Rep: Cathepsin L-like protease - Sarcoptes scabiei type hominis Length = 245 Score = 40.3 bits (90), Expect = 0.034 Identities = 16/28 (57%), Positives = 18/28 (64%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 LP E DW L V P+KDQ GSCW+F Sbjct: 120 LPDEVDWTLKNVVAPIKDQKQCGSCWAF 147 >UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypanosoma cruzi|Rep: Cysteine protease, putative - Trypanosoma cruzi Length = 434 Score = 40.3 bits (90), Expect = 0.034 Identities = 23/90 (25%), Positives = 40/90 (44%), Gaps = 2/90 (2%) Frame = +1 Query: 232 VAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSV 411 +A++ F+ R + + +N +D T +E A R + P P ++ + Sbjct: 67 LAKVRAFNGALGRSYRLGINKFSDMTKEEFNAKFNGRVAAPQSTQSP---QRAPYKRTKA 123 Query: 412 KLPPEHDWRLFG--AVTPVKDQLVFGSCWS 495 P +W+ +TPVKDQ GSCW+ Sbjct: 124 TFPEALNWQEAKNPVLTPVKDQGSCGSCWA 153 >UniRef50_Q23FQ5 Cluster: Papain family cysteine protease containing protein; n=4; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 40.3 bits (90), Expect = 0.034 Identities = 22/89 (24%), Positives = 40/89 (44%) Frame = +1 Query: 238 QIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKL 417 ++ T +T+S+N +D + +E ++ S + + +V Sbjct: 66 KLKTHEKNTEATYTVSLNQFSDYSQEEFVQRILNKHISRSDADIQKEQEPNGNLRKAVNY 125 Query: 418 PPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 P DWR GA+ P+++Q GSC +FGT Sbjct: 126 PTSVDWRNSGALNPIQNQGQCGSCAAFGT 154 >UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-like cysteine peptidase; n=3; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L or K-like cysteine peptidase - Trichomonas vaginalis G3 Length = 320 Score = 40.3 bits (90), Expect = 0.034 Identities = 26/85 (30%), Positives = 39/85 (45%), Gaps = 2/85 (2%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHG--LPFPYSKSRVEELSVKLPPEH 429 N NR + +S+N + T+ E +L G + S + L P SK E Sbjct: 56 NSINRNYRLSLNQFSFLTNSEYKSLLGGKVSSKNNDDSHLFSPQSKKSSEVT-------F 108 Query: 430 DWRLFGAVTPVKDQLVFGSCWSFGT 504 DWR G + P+++Q G CW+F T Sbjct: 109 DWRTKGIINPIRNQGQCGLCWAFST 133 >UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 293 Score = 40.3 bits (90), Expect = 0.034 Identities = 23/84 (27%), Positives = 38/84 (45%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHD 432 HN+A + + N A T E ++ + P L + + ++ +P E D Sbjct: 30 HNKAGSSYKLEGNRFAAFTPAEYRSMLSK------PKSLAKKFESAPLKHKEGAIPAEFD 83 Query: 433 WRLFGAVTPVKDQLVFGSCWSFGT 504 WR G VTPV+ Q G+ W+F + Sbjct: 84 WRTKGVVTPVRYQEGCGAGWAFAS 107 >UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 397 Score = 39.9 bits (89), Expect = 0.045 Identities = 19/41 (46%), Positives = 24/41 (58%) Frame = +1 Query: 376 PYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 P V +L V +P DWR+ G V+PVKDQ G CW+F Sbjct: 168 PNPNPPVNQLKV-VPQSVDWRIQGKVSPVKDQGRCGCCWAF 207 >UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis|Rep: Cysteine protease 2 - Babesia bovis Length = 445 Score = 39.9 bits (89), Expect = 0.045 Identities = 16/23 (69%), Positives = 18/23 (78%) Frame = +1 Query: 430 DWRLFGAVTPVKDQLVFGSCWSF 498 DWR AVTPVKDQ + GSCW+F Sbjct: 241 DWRRADAVTPVKDQGMCGSCWAF 263 >UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 234 Score = 39.9 bits (89), Expect = 0.045 Identities = 17/37 (45%), Positives = 24/37 (64%) Frame = +1 Query: 394 VEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 +E + +P E D+R GAV +KDQ GSCW+FG+ Sbjct: 11 LETIVGDIPDEIDYRTKGAVNEIKDQKHCGSCWAFGS 47 >UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae|Rep: Cysteine proteinase - Hypera postica (alfalfa weevil) Length = 324 Score = 39.5 bits (88), Expect = 0.060 Identities = 26/94 (27%), Positives = 42/94 (44%) Frame = +1 Query: 217 EHLQAVAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRV 396 ++++A+ + + + + +N D + +E + S P Y K+ V Sbjct: 52 DNVRAIEAHNALYEQGKVSYKKGINKFTDMSQEEFKTMLTLSASR-KPTLETTSYVKTGV 110 Query: 397 EELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 E +P DWR G VT VKDQ GSCW+F Sbjct: 111 E-----IPSSVDWRKEGRVTGVKDQGDCGSCWAF 139 >UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n=21; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Globodera pallida Length = 379 Score = 39.5 bits (88), Expect = 0.060 Identities = 28/79 (35%), Positives = 40/79 (50%), Gaps = 2/79 (2%) Frame = +1 Query: 274 FTMSVNHLADRTDDELAALRG-RRYSGPSPHGLPFPYSKSRVEELSV-KLPPEHDWRLFG 447 F + NH+AD E L G RR G + + + + ++V LP DWR G Sbjct: 116 FRVGENHIADLPFSEYKKLNGYRRLLGDNLRR----NASTFLAPMNVGDLPESVDWRDKG 171 Query: 448 AVTPVKDQLVFGSCWSFGT 504 VT VK+Q + GSCW+F + Sbjct: 172 WVTEVKNQGMCGSCWAFSS 190 >UniRef50_Q22RR8 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 39.5 bits (88), Expect = 0.060 Identities = 20/41 (48%), Positives = 26/41 (63%), Gaps = 2/41 (4%) Frame = +1 Query: 382 SKSRVEELSVKLPPEH--DWRLFGAVTPVKDQLVFGSCWSF 498 S + +L++KL + DW GAVTPVKDQ GSCW+F Sbjct: 112 SNPKNAQLNMKLGDDIIIDWTKKGAVTPVKDQEQCGSCWAF 152 >UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 344 Score = 39.5 bits (88), Expect = 0.060 Identities = 16/30 (53%), Positives = 18/30 (60%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 LP DWR G +TP K Q GSCW+F T Sbjct: 131 LPESFDWRDKGIITPAKFQNTCGSCWTFAT 160 >UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteinase B; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like tick cysteine proteinase B - Haemaphysalis longicornis (Bush tick) Length = 332 Score = 39.5 bits (88), Expect = 0.060 Identities = 17/27 (62%), Positives = 19/27 (70%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWS 495 LP DWR GAVTPVK+Q GSCW+ Sbjct: 117 LPKTMDWRKKGAVTPVKNQGQCGSCWA 143 >UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; Leishmania|Rep: Cysteine proteinase 1 precursor - Leishmania pifanoi Length = 354 Score = 39.5 bits (88), Expect = 0.060 Identities = 16/23 (69%), Positives = 19/23 (82%) Frame = +1 Query: 430 DWRLFGAVTPVKDQLVFGSCWSF 498 DWR GAVTPVK+Q + GSCW+F Sbjct: 134 DWRDKGAVTPVKNQGLCGSCWAF 156 >UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera litura multicapsid nucleopolyhedrovirus (SpltMNPV) Length = 337 Score = 39.5 bits (88), Expect = 0.060 Identities = 16/31 (51%), Positives = 19/31 (61%) Frame = +1 Query: 406 SVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 S + P DWR VT VK+Q V GSCW+F Sbjct: 123 SARTPESFDWRKLNKVTKVKEQGVCGSCWAF 153 >UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin O; n=1; Danio rerio|Rep: PREDICTED: similar to cathepsin O - Danio rerio Length = 327 Score = 39.1 bits (87), Expect = 0.079 Identities = 18/42 (42%), Positives = 23/42 (54%) Frame = +1 Query: 373 FPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 F SKS ++ + PP DWR G V PV +Q G CW+F Sbjct: 107 FDQSKSEIK-VKANNPPRFDWRDHGVVGPVHNQGSCGGCWAF 147 >UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays (Maize) Length = 493 Score = 39.1 bits (87), Expect = 0.079 Identities = 29/79 (36%), Positives = 38/79 (48%), Gaps = 3/79 (3%) Frame = +1 Query: 271 GFTMSVNHLADRTDDELAA--LRGRRYSGPSPHGLPFPYSKSRVEELS-VKLPPEHDWRL 441 GF + + AD T +E A L G R + G+ + R L+ +LP DWR Sbjct: 116 GFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGV---VGRRRYLPLAGEQLPDAVDWRE 172 Query: 442 FGAVTPVKDQLVFGSCWSF 498 GAV VKDQ G CW+F Sbjct: 173 RGAVAEVKDQGQCGGCWAF 191 >UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia deliciosa (Kiwi) Length = 509 Score = 39.1 bits (87), Expect = 0.079 Identities = 16/29 (55%), Positives = 19/29 (65%) Frame = +1 Query: 418 PPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 P DWR +G VT VKDQ GSCW+F + Sbjct: 148 PTSLDWRKYGIVTGVKDQGDCGSCWAFSS 176 >UniRef50_A5HC50 Cluster: Cathepsin W; n=2; Theria|Rep: Cathepsin W - Oryctolagus cuniculus (Rabbit) Length = 242 Score = 39.1 bits (87), Expect = 0.079 Identities = 22/71 (30%), Positives = 34/71 (47%), Gaps = 1/71 (1%) Frame = +1 Query: 286 VNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWR-LFGAVTPV 462 V +D T++E L G + + G+P + EE LPP DWR G ++P+ Sbjct: 29 VTRFSDLTEEEFGQLYGHQRAAG---GVPSVGREVGSEERGTPLPPTCDWRKAAGVISPI 85 Query: 463 KDQLVFGSCWS 495 +DQ CW+ Sbjct: 86 RDQRDCQCCWA 96 >UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease Gip1p; n=4; Tetrahymena thermophila|Rep: Granule-biosynthesis induced protease Gip1p - Tetrahymena thermophila Length = 345 Score = 39.1 bits (87), Expect = 0.079 Identities = 16/30 (53%), Positives = 19/30 (63%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 LP DWR G + PVK+Q GSCW+F T Sbjct: 133 LPLSVDWRKRGVLNPVKNQGTCGSCWTFAT 162 >UniRef50_Q23H10 Cluster: Papain family cysteine protease containing protein; n=14; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 39.1 bits (87), Expect = 0.079 Identities = 18/39 (46%), Positives = 25/39 (64%) Frame = +1 Query: 382 SKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 +++++ S+ L DWR GAVT VK+Q GSCWSF Sbjct: 116 NETQLSSNSLTLADSIDWRTKGAVTSVKNQGGCGSCWSF 154 >UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole genome shotgun sequence; n=7; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_22, whole genome shotgun sequence - Paramecium tetraurelia Length = 350 Score = 39.1 bits (87), Expect = 0.079 Identities = 28/68 (41%), Positives = 34/68 (50%) Frame = +1 Query: 301 DRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVF 480 D TD+E AA P +P K++ E +V P DWR GAV VKDQ Sbjct: 111 DLTDEEFAATYLTLKVNPDDLEVP----KAQFE--NVNATPI-DWRTRGAVNKVKDQGQC 163 Query: 481 GSCWSFGT 504 GSCW+F T Sbjct: 164 GSCWAFST 171 >UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 precursor; n=4; Schizophora|Rep: Putative cysteine proteinase CG12163 precursor - Drosophila melanogaster (Fruit fly) Length = 614 Score = 39.1 bits (87), Expect = 0.079 Identities = 17/29 (58%), Positives = 20/29 (68%) Frame = +1 Query: 412 KLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 +LP E DWR AVT VK+Q GSCW+F Sbjct: 393 ELPKEFDWRQKDAVTQVKNQGSCGSCWAF 421 >UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber officinale (Ginger) Length = 221 Score = 39.1 bits (87), Expect = 0.079 Identities = 17/28 (60%), Positives = 19/28 (67%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 LP DWR GAV PVK+Q GSCW+F Sbjct: 3 LPDSIDWREKGAVVPVKNQGGCGSCWAF 30 >UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: hypothetical protein, partial - Ornithorhynchus anatinus Length = 224 Score = 38.7 bits (86), Expect = 0.10 Identities = 16/23 (69%), Positives = 18/23 (78%) Frame = +1 Query: 430 DWRLFGAVTPVKDQLVFGSCWSF 498 DWR GAVTPVK+Q GSCW+F Sbjct: 136 DWRKEGAVTPVKNQGDCGSCWAF 158 Score = 32.7 bits (71), Expect = 6.9 Identities = 12/31 (38%), Positives = 19/31 (61%) Frame = +2 Query: 146 DEFERFKVKLQKQYASDLEHEKRLNIFRQSL 238 D+F+ F+++ K Y EH +R IF Q+L Sbjct: 45 DKFKEFQIRYNKSYEDQAEHARRFEIFVQNL 75 >UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 280 Score = 38.7 bits (86), Expect = 0.10 Identities = 16/28 (57%), Positives = 19/28 (67%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 LP + DWR G VT VK+Q GSCW+F Sbjct: 68 LPQQFDWRNLGKVTQVKNQGNCGSCWAF 95 >UniRef50_Q23H15 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 370 Score = 38.7 bits (86), Expect = 0.10 Identities = 20/38 (52%), Positives = 22/38 (57%) Frame = +1 Query: 385 KSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 +S E S L DWR GAVT VK+Q GSCWSF Sbjct: 152 RSLTEFKSPTLAASIDWRTKGAVTSVKNQGNCGSCWSF 189 >UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Cathepsin K - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 333 Score = 38.3 bits (85), Expect = 0.14 Identities = 24/77 (31%), Positives = 37/77 (48%) Frame = +1 Query: 274 FTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAV 453 + + +NH D T +E+A + G P + ++ KLP D+R G V Sbjct: 75 YDLGMNHFGDMTLEEVA----EKVMGLQMPMYRDPANTFVPDDRVGKLPKSIDYRKLGYV 130 Query: 454 TPVKDQLVFGSCWSFGT 504 T VK+Q GSCW+F + Sbjct: 131 TSVKNQGSCGSCWAFSS 147 >UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cathepsin L; n=4; Danio rerio|Rep: Novel protein similar to vertebrate cathepsin L - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 334 Score = 38.3 bits (85), Expect = 0.14 Identities = 27/79 (34%), Positives = 36/79 (45%), Gaps = 2/79 (2%) Frame = +1 Query: 274 FTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKL--PPEHDWRLFG 447 F M++N D T E L G + G + +++ L+ K D+R G Sbjct: 71 FKMAMNKYGDLTSVEYKRLLGSKIKGTGNR--KGKITSAQMLRLNAKRLGVTNIDYRAKG 128 Query: 448 AVTPVKDQLVFGSCWSFGT 504 VT VKDQ GSCWSF T Sbjct: 129 YVTEVKDQGYCGSCWSFST 147 >UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel protein - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 328 Score = 38.3 bits (85), Expect = 0.14 Identities = 24/76 (31%), Positives = 38/76 (50%), Gaps = 1/76 (1%) Frame = +1 Query: 274 FTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVK-LPPEHDWRLFGA 450 +T+ +N L+D T DE+ + G FP + S++ LP +W G Sbjct: 72 YTLGLNQLSDMTADEVNDMNGLLEED-------FPDVNATFSPPSLQTLPQRVNWTEHGM 124 Query: 451 VTPVKDQLVFGSCWSF 498 V+PV++Q GSCW+F Sbjct: 125 VSPVQNQGPCGSCWAF 140 >UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra erinaceieuropaei|Rep: Cysteine proteinase - Spirometra erinaceieuropaei (Tapeworm) Length = 336 Score = 38.3 bits (85), Expect = 0.14 Identities = 18/32 (56%), Positives = 20/32 (62%) Frame = +1 Query: 403 LSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 L LP +WR GAVT VK+Q GSCWSF Sbjct: 117 LKENLPDSVNWRERGAVTSVKNQGQCGSCWSF 148 Score = 35.5 bits (78), Expect = 0.97 Identities = 16/30 (53%), Positives = 22/30 (73%) Frame = +2 Query: 512 VEGALFLHKGGHLXWLSQQALIDCSWGFGN 601 +EGA+ + K G L LS+Q L+DCSW +GN Sbjct: 154 IEGAIQI-KTGALRSLSEQQLMDCSWDYGN 182 >UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin O; n=1; Monodelphis domestica|Rep: PREDICTED: similar to cathepsin O - Monodelphis domestica Length = 414 Score = 37.9 bits (84), Expect = 0.18 Identities = 31/113 (27%), Positives = 47/113 (41%), Gaps = 2/113 (1%) Frame = +1 Query: 166 SQTPEAVRERPGAREEA--EHLQAVAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGR 339 SQTP ER R A E L+ +++F + N +N + +E + Sbjct: 123 SQTPP---ERSENRSTAFRESLKRHHYLNSFSSSDNTSAIYGINQFSYLFPEEFKDI--- 176 Query: 340 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 Y P LP ++ + LP DWR VT V++Q + G CW+F Sbjct: 177 -YLRSKPSVLPLYSEALKMPTTHMPLPVRFDWRDKHVVTKVRNQQMCGGCWAF 228 >UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 350 Score = 37.9 bits (84), Expect = 0.18 Identities = 28/98 (28%), Positives = 43/98 (43%), Gaps = 4/98 (4%) Frame = +1 Query: 223 LQAVAQIHTFHNRANRGFTMSVNHLADRTDDELA--ALRGRRYSGPSPHGLPFPYSKSRV 396 LQ I ++ +N + + N +D T DE A L + + S P + R Sbjct: 72 LQNDQNIQKHNSDSNNTYKLQHNQFSDMTKDEFAHRVLNSQLKTSASSSSQPAQTPQLRG 131 Query: 397 E-ELSVKLPPEHDWRLF-GAVTPVKDQLVFGSCWSFGT 504 + S+ DWR + G + VK+Q GSCW+F T Sbjct: 132 SVDASLNASQGFDWRNYQGVLGNVKNQGQCGSCWTFAT 169 >UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena thermophila Length = 320 Score = 37.9 bits (84), Expect = 0.18 Identities = 16/27 (59%), Positives = 18/27 (66%) Frame = +1 Query: 424 EHDWRLFGAVTPVKDQLVFGSCWSFGT 504 E DW G VTPVK+Q GSCW+F T Sbjct: 115 EVDWTAKGKVTPVKNQGSCGSCWAFST 141 >UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina). Cathepsin L; n=2; Dictyostelium discoideum|Rep: Similar to Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina). Cathepsin L - Dictyostelium discoideum (Slime mold) Length = 265 Score = 37.9 bits (84), Expect = 0.18 Identities = 25/75 (33%), Positives = 33/75 (44%), Gaps = 2/75 (2%) Frame = +1 Query: 280 MSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRV--EELSVKLPPEHDWRLFGAV 453 M +N +D T E A + P P P K+ ++ +P DWR GAV Sbjct: 1 MDLNEYSDLTQKEFADKFFEKLV-PEPRSGPINDIKATPFKHNVNATIPKSFDWRDHGAV 59 Query: 454 TPVKDQLVFGSCWSF 498 VK+Q SCWSF Sbjct: 60 GKVKNQGSCASCWSF 74 >UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep: Cysteine proteinase - Entamoeba histolytica Length = 320 Score = 37.9 bits (84), Expect = 0.18 Identities = 14/30 (46%), Positives = 20/30 (66%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 +P DWR G +TP++D GSC+SFG+ Sbjct: 97 IPTAIDWRAEGKLTPIRDHTQCGSCYSFGS 126 >UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 367 Score = 37.9 bits (84), Expect = 0.18 Identities = 15/23 (65%), Positives = 18/23 (78%) Frame = +1 Query: 430 DWRLFGAVTPVKDQLVFGSCWSF 498 DWR GAV+PVK+Q GSCW+F Sbjct: 160 DWRQSGAVSPVKNQGSCGSCWAF 182 >UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing protein; n=5; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 437 Score = 37.9 bits (84), Expect = 0.18 Identities = 28/90 (31%), Positives = 44/90 (48%), Gaps = 1/90 (1%) Frame = +1 Query: 232 VAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSV 411 +A+I ++ ++ ++ +N L +TD EL R + + + K +LS Sbjct: 148 LAKIIEHNSNPDKKYSQIINKLTFQTDLELKKFRASQNCSATAQANTRSFRKY---DLS- 203 Query: 412 KLPPEHDWRLFGAVTPVKDQ-LVFGSCWSF 498 +LP DWR G VT VK Q GSCW+F Sbjct: 204 QLPQYVDWREKGVVTQVKSQGKDCGSCWAF 233 >UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep: Cysteine protease - Babesia equi Length = 438 Score = 37.9 bits (84), Expect = 0.18 Identities = 15/23 (65%), Positives = 16/23 (69%) Frame = +1 Query: 430 DWRLFGAVTPVKDQLVFGSCWSF 498 DWR VTPVKDQ GSCW+F Sbjct: 229 DWRKLNGVTPVKDQGNCGSCWAF 251 >UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheirus salmonis|Rep: Putative cathepsin L - Lepeophtheirus salmonis (salmon louse) Length = 257 Score = 37.9 bits (84), Expect = 0.18 Identities = 17/33 (51%), Positives = 20/33 (60%) Frame = +1 Query: 406 SVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 S +P +W GAVT VKDQ GSCW+F T Sbjct: 35 SAPVPSYVNWTKNGAVTAVKDQKDCGSCWAFST 67 >UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; Methanospirillum hungatei JF-1|Rep: Peptidase C1A, papain precursor - Methanospirillum hungatei (strain JF-1 / DSM 864) Length = 1096 Score = 37.9 bits (84), Expect = 0.18 Identities = 34/109 (31%), Positives = 52/109 (47%), Gaps = 4/109 (3%) Frame = +1 Query: 190 ERPGAREEAEHL-QA-VAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPH 363 E P + EE + QA V I+ + N +T +VN + + +E L+G R+ S Sbjct: 249 EDPLSEEERYNAAQAEVDDINAYVKEHNLSWTAAVNPIMLMSPEEREHLKGLRHDLKSST 308 Query: 364 GLPFPYSKSRVEELSVKLPPEHDWRLFGA--VTPVKDQLVFGSCWSFGT 504 + S + + + LP DWR G TP+K+Q GSCW+F T Sbjct: 309 IV----SGAGITPME-GLPTSFDWRNNGGDYTTPIKNQGSCGSCWAFAT 352 >UniRef50_Q2FLD5 Cluster: PKD precursor; n=1; Methanospirillum hungatei JF-1|Rep: PKD precursor - Methanospirillum hungatei (strain JF-1 / DSM 864) Length = 1236 Score = 37.9 bits (84), Expect = 0.18 Identities = 13/29 (44%), Positives = 21/29 (72%) Frame = +1 Query: 418 PPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 P +D R F +TP+K+Q +G+CW+FG+ Sbjct: 91 PSSYDLRTFDKLTPIKNQNPWGTCWAFGS 119 >UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: LOC443661 protein - Xenopus laevis (African clawed frog) Length = 346 Score = 37.5 bits (83), Expect = 0.24 Identities = 24/76 (31%), Positives = 38/76 (50%), Gaps = 1/76 (1%) Frame = +1 Query: 274 FTMSVNHLADRTDDEL-AALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGA 450 + + +NHL D T +E+ A + G S S + ++ + L + P DWR G Sbjct: 96 YEVGMNHLGDMTGEEVEATMTGYTSSDDSLANM----TRVPKKLLEAQPPASIDWRTKGC 151 Query: 451 VTPVKDQLVFGSCWSF 498 VT V+ Q GSC++F Sbjct: 152 VTSVRRQRKCGSCYAF 167 >UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestivum|Rep: Cysteine protease - Triticum aestivum (Wheat) Length = 371 Score = 37.1 bits (82), Expect = 0.32 Identities = 14/27 (51%), Positives = 16/27 (59%) Frame = +1 Query: 418 PPEHDWRLFGAVTPVKDQLVFGSCWSF 498 P + DWR G VTP K Q G CW+F Sbjct: 154 PRQFDWREHGVVTPAKQQGACGCCWAF 180 >UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: Vivapain-4 - Plasmodium vivax Length = 484 Score = 37.1 bits (82), Expect = 0.32 Identities = 13/26 (50%), Positives = 20/26 (76%) Frame = +1 Query: 424 EHDWRLFGAVTPVKDQLVFGSCWSFG 501 ++DWR AV+ +K+Q + GSCW+FG Sbjct: 265 KYDWREHNAVSEIKNQNLCGSCWAFG 290 >UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: Cysteine protease - Clonorchis sinensis Length = 328 Score = 37.1 bits (82), Expect = 0.32 Identities = 25/72 (34%), Positives = 32/72 (44%), Gaps = 1/72 (1%) Frame = +1 Query: 286 VNHLADRTDDELAALRGR-RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPV 462 V +D T +E R R+ GP P P ++ + DWR GAV PV Sbjct: 77 VTQFSDLTSEEFKTRYLRMRFDGPIVSEDPSPEEDVTMDN------EKFDWREHGAVGPV 130 Query: 463 KDQLVFGSCWSF 498 DQ GSCW+F Sbjct: 131 LDQGKCGSCWAF 142 >UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanensis|Rep: Sui m 1 allergen - Suidasia medanensis Length = 336 Score = 37.1 bits (82), Expect = 0.32 Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 5/88 (5%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHG-LPFPYSKSRVE----ELSVKLP 420 N G + VN AD T +E +++ Y+ + L P K V+ ++SV LP Sbjct: 61 NHGKHGAGLEVNEHADLTAEEFSSM----YATLNQEAFLKSPLHKEFVQVPESDISVALP 116 Query: 421 PEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 DWR T V++Q GSCW+F T Sbjct: 117 AAFDWRQQWN-TAVRNQGQCGSCWAFAT 143 >UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_36, whole genome shotgun sequence - Paramecium tetraurelia Length = 307 Score = 37.1 bits (82), Expect = 0.32 Identities = 26/91 (28%), Positives = 42/91 (46%) Frame = +1 Query: 226 QAVAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEEL 405 Q V I+ ++ N+ ++M+VN AD TD+E ++ Y G P +E Sbjct: 54 QNVELINKHNSNPNKSYSMAVNQFADLTDEEFQSM----YLGK-----PTYVKIDNIELS 104 Query: 406 SVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 + DW + P+K+Q GSCW+F Sbjct: 105 KGNTLGDADWA--SKMNPIKNQGNCGSCWTF 133 >UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckeia)|Rep: Berghepain-2 - Plasmodium yoelii yoelii Length = 472 Score = 36.7 bits (81), Expect = 0.42 Identities = 22/90 (24%), Positives = 42/90 (46%), Gaps = 6/90 (6%) Frame = +1 Query: 253 HNRANRGFTMSVNHLADRTDDE--LAALRGR---RYSGPSPHGLPFPYSKSRVEELSVKL 417 HN+ N +T +N +D +E + L + + H +P+ + ++ + + ++ Sbjct: 190 HNKENHLYTKGINAFSDMRHEEFKMKYLNNKLKENHQIDLRHLIPYTIAINKYKSPTDQI 249 Query: 418 P-PEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 DWR A+ +KDQ SCW+F T Sbjct: 250 NYTSFDWRDHNAIIDIKDQQKCASCWAFAT 279 >UniRef50_Q235G6 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 325 Score = 36.7 bits (81), Expect = 0.42 Identities = 16/25 (64%), Positives = 17/25 (68%) Frame = +1 Query: 430 DWRLFGAVTPVKDQLVFGSCWSFGT 504 DW GAVTPVK+Q G CWSF T Sbjct: 122 DWVEKGAVTPVKNQGGCGGCWSFAT 146 >UniRef50_Q8TMY7 Cluster: Cell surface protein; n=2; Methanosarcina|Rep: Cell surface protein - Methanosarcina acetivorans Length = 1515 Score = 36.7 bits (81), Expect = 0.42 Identities = 17/36 (47%), Positives = 21/36 (58%) Frame = +1 Query: 400 ELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGTW 507 E S P +D R G V+PVKDQ GSCW+ G + Sbjct: 124 EDSGSFEPFYDLRELGKVSPVKDQKDSGSCWAHGAY 159 >UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sativa|Rep: OSJNBb0085F13.15 protein - Oryza sativa subsp. japonica (Rice) Length = 383 Score = 36.3 bits (80), Expect = 0.56 Identities = 15/31 (48%), Positives = 19/31 (61%) Frame = +1 Query: 406 SVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 +V +P DWR GAVTP K Q +CW+F Sbjct: 157 TVAVPESVDWRKEGAVTPAKHQGQCAACWAF 187 >UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intestinalis|Rep: GLP_90_15278_13989 - Giardia lamblia ATCC 50803 Length = 429 Score = 36.3 bits (80), Expect = 0.56 Identities = 15/30 (50%), Positives = 20/30 (66%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 LP D R +G +TPV++Q GSCW+F T Sbjct: 60 LPQSVDLREYGLMTPVRNQGKCGSCWAFAT 89 >UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcoptes scabiei type hominis|Rep: Sar s 1 allergen Yv4003H01 - Sarcoptes scabiei type hominis Length = 330 Score = 36.3 bits (80), Expect = 0.56 Identities = 16/27 (59%), Positives = 18/27 (66%) Frame = +1 Query: 424 EHDWRLFGAVTPVKDQLVFGSCWSFGT 504 E D R G VTPVKDQ G+CW+F T Sbjct: 116 EIDLRKCGFVTPVKDQKKCGACWAFST 142 >UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin C; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin C - Strongylocentrotus purpuratus Length = 482 Score = 35.9 bits (79), Expect = 0.74 Identities = 16/41 (39%), Positives = 26/41 (63%), Gaps = 3/41 (7%) Frame = +1 Query: 391 RVEELSVKLPPEHDWRLFGA---VTPVKDQLVFGSCWSFGT 504 R ++ + LP + DWR G V+PV+DQ + GSC++F + Sbjct: 241 RTKQAASNLPEKFDWRDVGGIDYVSPVRDQGICGSCYAFAS 281 >UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dvir_CG5367 - Drosophila virilis (Fruit fly) Length = 298 Score = 35.9 bits (79), Expect = 0.74 Identities = 28/99 (28%), Positives = 41/99 (41%), Gaps = 2/99 (2%) Frame = +1 Query: 208 EEAEHLQAVAQIH-TFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYS 384 E E Q + H T++ F ++ N +AD D L+G SP Sbjct: 18 EAYEENQIIVNEHNTYYETGKSSFRLATNTMADMNTDSY--LKGYLRLLRSPEISDSDNI 75 Query: 385 KSRV-EELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 V L +P DWR G +TP+ +Q GSC++F Sbjct: 76 ADIVGSPLMNNVPESFDWRKKGFITPLYNQQSCGSCYAF 114 >UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain - Tetrahymena pyriformis Length = 330 Score = 35.9 bits (79), Expect = 0.74 Identities = 24/73 (32%), Positives = 32/73 (43%) Frame = +1 Query: 286 VNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVK 465 VN D T++E AA R P P +E ++ P DW + PVK Sbjct: 82 VNSFTDLTEEEFAA-RYLMKDLPQQMNKDLPI----LEMETLAAPQVIDWTAKNVLPPVK 136 Query: 466 DQLVFGSCWSFGT 504 +Q GSCW+F T Sbjct: 137 NQQQCGSCWAFST 149 >UniRef50_A0B934 Cluster: GHMP kinase; n=1; Methanosaeta thermophila PT|Rep: GHMP kinase - Methanosaeta thermophila (strain DSM 6194 / PT) (Methanothrixthermophila (strain DSM 6194 / PT)) Length = 305 Score = 35.9 bits (79), Expect = 0.74 Identities = 25/88 (28%), Positives = 41/88 (46%) Frame = -1 Query: 339 PPSEGSELVVSAIGKMVHGHGETAVRSIMECMYLSDCLKMFSLFSCSRSLAYCFWSLTLK 160 P S G L ++I K V+ GE AVR I+ + + +++ F+C LA + ++ Sbjct: 197 PISTGDVLRDASIMKRVNAAGERAVREILRRPTMGEFMRLSKRFTCETELASSWAMDAIE 256 Query: 159 RSNSSWTWASCTGRTNSFIGLKVAKCRE 76 SS AS +S + +CRE Sbjct: 257 AVESSGGMASMIMLGDSVFAVGGEECRE 284 >UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; Theileria|Rep: Cysteine proteinase precursor - Theileria parva Length = 440 Score = 35.9 bits (79), Expect = 0.74 Identities = 17/32 (53%), Positives = 20/32 (62%), Gaps = 1/32 (3%) Frame = +1 Query: 412 KLPPEH-DWRLFGAVTPVKDQLVFGSCWSFGT 504 KL E+ DWR +VT VKDQ G CW+F T Sbjct: 227 KLTGENLDWRRSSSVTSVKDQSNCGGCWAFST 258 >UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin W, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to Cathepsin W, partial - Ornithorhynchus anatinus Length = 229 Score = 35.5 bits (78), Expect = 0.97 Identities = 14/23 (60%), Positives = 17/23 (73%) Frame = +1 Query: 430 DWRLFGAVTPVKDQLVFGSCWSF 498 DWR GA+T VK+Q GSCW+F Sbjct: 73 DWRKRGAITSVKNQGSCGSCWAF 95 >UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis thaliana|Rep: Cysteine proteinase - Arabidopsis thaliana (Mouse-ear cress) Length = 348 Score = 35.5 bits (78), Expect = 0.97 Identities = 15/23 (65%), Positives = 16/23 (69%) Frame = +1 Query: 430 DWRLFGAVTPVKDQLVFGSCWSF 498 DWR GAVTPVK Q G CW+F Sbjct: 133 DWRQEGAVTPVKYQGRCGGCWAF 155 >UniRef50_Q42312 Cluster: Cysteine protease; n=1; Arabidopsis thaliana|Rep: Cysteine protease - Arabidopsis thaliana (Mouse-ear cress) Length = 105 Score = 35.5 bits (78), Expect = 0.97 Identities = 19/43 (44%), Positives = 25/43 (58%) Frame = +2 Query: 125 VHDAHVHDEFERFKVKLQKQYASDLEHEKRLNIFRQSLRYIHS 253 V DA FE + VK K Y S E E+RL IF +LR+I++ Sbjct: 40 VFDAEASLIFESWMVKHGKVYGSVAEKERRLTIFEDNLRFINN 82 >UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba histolytica|Rep: Cysteine protease 17 - Entamoeba histolytica Length = 420 Score = 35.5 bits (78), Expect = 0.97 Identities = 15/56 (26%), Positives = 31/56 (55%) Frame = +1 Query: 337 RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 R+ P + + + ++ +++ +LP D+R FG +T +++Q G CWSF + Sbjct: 141 RKVHVPKKYRIGRKWQFNKKKDIVKELPEGIDFRKFGKLTYIREQTGCGGCWSFAS 196 >UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_23, whole genome shotgun sequence - Paramecium tetraurelia Length = 321 Score = 35.5 bits (78), Expect = 0.97 Identities = 26/91 (28%), Positives = 36/91 (39%) Frame = +1 Query: 226 QAVAQIHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEEL 405 Q + +I F N N + +N D TD E + Y +P + E Sbjct: 64 QNIMKIEDF-NSQNNSYKQKINKFGDLTDQEFLTI----YLNLQ---MPARVKNIQKNEE 115 Query: 406 SVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 + E DW G V +KDQ GSCW+F Sbjct: 116 PFLVQEEVDWVQKGKVPAIKDQGDCGSCWAF 146 >UniRef50_Q8TQM7 Cluster: Putative uncharacterized protein; n=1; Methanosarcina acetivorans|Rep: Putative uncharacterized protein - Methanosarcina acetivorans Length = 584 Score = 35.5 bits (78), Expect = 0.97 Identities = 23/71 (32%), Positives = 32/71 (45%), Gaps = 2/71 (2%) Frame = +1 Query: 298 ADRTDDELAALRGRRYSG--PSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQ 471 AD+ + A R G P+P L S++ L + P +D R VT VK Q Sbjct: 77 ADKVYTQSLASSNRHKKGFVPAPVDLS---DLSKISTLEISAPAYYDLRTLNRVTSVKYQ 133 Query: 472 LVFGSCWSFGT 504 G+CW+F T Sbjct: 134 GESGACWTFAT 144 >UniRef50_Q8PS79 Cluster: Putative uncharacterized protein; n=1; Methanosarcina mazei|Rep: Putative uncharacterized protein - Methanosarcina mazei (Methanosarcina frisia) Length = 626 Score = 35.5 bits (78), Expect = 0.97 Identities = 17/51 (33%), Positives = 27/51 (52%), Gaps = 4/51 (7%) Frame = +1 Query: 367 LPFPYSKSRVEELSV----KLPPEHDWRLFGAVTPVKDQLVFGSCWSFGTW 507 +P P S + ++SV P +D R +T VKDQ G+CW+F ++ Sbjct: 44 VPSPIELSYISDISVPKAASAPAYYDLRALNRLTSVKDQGTAGTCWAFASY 94 >UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep: Viral cathepsin - Xestia c-nigrum granulosis virus (XnGV) (Xestia c-nigrumgranulovirus) Length = 346 Score = 35.5 bits (78), Expect = 0.97 Identities = 16/31 (51%), Positives = 19/31 (61%) Frame = +1 Query: 406 SVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 S K+P DWR +VT VK Q GSCW+F Sbjct: 130 SGKVPDSFDWRDRNSVTSVKMQKECGSCWAF 160 >UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 20 SCAF14744, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 175 Score = 35.1 bits (77), Expect = 1.3 Identities = 14/28 (50%), Positives = 17/28 (60%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 LP DWR V PV++Q GSCW+F Sbjct: 59 LPARFDWRDNAVVGPVQNQQACGSCWAF 86 >UniRef50_Q1CXI7 Cluster: Putative uncharacterized protein; n=1; Myxococcus xanthus DK 1622|Rep: Putative uncharacterized protein - Myxococcus xanthus (strain DK 1622) Length = 294 Score = 35.1 bits (77), Expect = 1.3 Identities = 21/58 (36%), Positives = 28/58 (48%) Frame = +1 Query: 271 GFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLF 444 G S+ H D T D L A +GPSP LP P RV +++P ++ RLF Sbjct: 26 GLAYSLRH--DGTLDSLMAASEGVITGPSPDWLPLPEGGLRVNSALLEMPEDYGPRLF 81 >UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 664 Score = 35.1 bits (77), Expect = 1.3 Identities = 19/41 (46%), Positives = 25/41 (60%) Frame = +1 Query: 382 SKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 SKSR+ L P DWR +G V+ VK+Q GSC++F T Sbjct: 461 SKSRL--LKWSRPISIDWRTWGMVSKVKNQGSCGSCYAFST 499 >UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteinase A; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like tick cysteine proteinase A - Haemaphysalis longicornis (Bush tick) Length = 312 Score = 35.1 bits (77), Expect = 1.3 Identities = 15/30 (50%), Positives = 18/30 (60%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 LP DW G+ PVK+Q GSCW+F T Sbjct: 93 LPTTVDWAQEGSRAPVKNQGQCGSCWAFST 122 >UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin B; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin B - Strongylocentrotus purpuratus Length = 346 Score = 34.7 bits (76), Expect = 1.7 Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 1/50 (2%) Frame = +1 Query: 355 SPHG-LPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFG 501 +P+G LP +++R+++L +W + V+DQ GSCW+FG Sbjct: 61 NPNGRLPKLENQTRIKDLPENFDARENWPNCPTIKEVRDQGSCGSCWAFG 110 >UniRef50_Q53K53 Cluster: Cysteine protease 1, putative; n=5; Oryza sativa|Rep: Cysteine protease 1, putative - Oryza sativa subsp. japonica (Rice) Length = 472 Score = 34.7 bits (76), Expect = 1.7 Identities = 30/103 (29%), Positives = 40/103 (38%), Gaps = 8/103 (7%) Frame = +1 Query: 187 RERPGAREEAEHLQAVAQIHTFHNRAN-RG---FTMSVNHLADRTDDELAALRGRRYSGP 354 R P A E + + F + N RG + ++ N AD T++E A Y G Sbjct: 60 RSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYIGD 119 Query: 355 SP-HGLPFPYSKSRVE---ELSVKLPPEHDWRLFGAVTPVKDQ 471 P F V+ V +P DWR GAV P K Q Sbjct: 120 GPVDDFVFTTGAGDVDASFSYRVDVPASVDWRAQGAVVPPKSQ 162 >UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba histolytica|Rep: Cysteine protease 10 - Entamoeba histolytica Length = 297 Score = 34.7 bits (76), Expect = 1.7 Identities = 14/25 (56%), Positives = 18/25 (72%) Frame = +1 Query: 430 DWRLFGAVTPVKDQLVFGSCWSFGT 504 DWR G VTPVK+Q SC++FG+ Sbjct: 113 DWRSEGKVTPVKNQRKCASCYAFGS 137 >UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcoptes scabiei type hominis|Rep: Sar s 1 allergen Yv5032C08 - Sarcoptes scabiei type hominis Length = 340 Score = 34.7 bits (76), Expect = 1.7 Identities = 27/79 (34%), Positives = 39/79 (49%), Gaps = 1/79 (1%) Frame = +1 Query: 265 NRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELS-VKLPPEHDWRL 441 N+ +S+N AD T +E +A + P L Y ++ VKL E D R Sbjct: 66 NKHRGVSINAHADLTVNEFSAKYLSK--APKTEDLLDEYKLFSCDKFEGVKLG-ELDLRK 122 Query: 442 FGAVTPVKDQLVFGSCWSF 498 G VT +++QL GSCW+F Sbjct: 123 EGRVTKIREQLACGSCWAF 141 >UniRef50_Q23EG5 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 429 Score = 34.7 bits (76), Expect = 1.7 Identities = 27/94 (28%), Positives = 46/94 (48%), Gaps = 5/94 (5%) Frame = +1 Query: 232 VAQIHTFHNRANRGFTMSVNHLADRTDDELAALRG-RRYSGPSPHGLPFPYSKSRVEELS 408 VA+I + N+ +T ++ T++E++ L+G + S + + +LS Sbjct: 65 VAKIAEHNLNPNKKYTQKISKFTFYTNEEISKLKGSQNCSATAKENTRI----LQTYDLS 120 Query: 409 VKLPPEHDWRLFGAVTPVKDQLVF----GSCWSF 498 ++P DWR G V+ VKDQ GSCW+F Sbjct: 121 -EIPDYVDWREKGIVSSVKDQDAVGDDCGSCWTF 153 >UniRef50_Q22NW9 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 894 Score = 34.7 bits (76), Expect = 1.7 Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 2/48 (4%) Frame = +1 Query: 367 LPFPYSKS-RVEE-LSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 L P SK + +E L ++P DWR AVTPVK+Q GS ++F T Sbjct: 665 LQIPASKQYKTQEFLGDEVPSSIDWRDLNAVTPVKNQGSCGSGYAFST 712 >UniRef50_A7LFV3 Cluster: Cathepsin L protease inhibitor 2; n=1; Diaprepes abbreviatus|Rep: Cathepsin L protease inhibitor 2 - Diaprepes abbreviatus (Sugarcane rootstalk borer weevil) Length = 91 Score = 34.7 bits (76), Expect = 1.7 Identities = 16/40 (40%), Positives = 24/40 (60%) Frame = +2 Query: 146 DEFERFKVKLQKQYASDLEHEKRLNIFRQSLRYIHSIIER 265 +E+E+FK + Y S E KR NIF+Q+L+ I E+ Sbjct: 15 EEWEKFKTGFNRNYDSSDEEAKRFNIFQQNLQSIREHNEK 54 >UniRef50_A0C1I6 Cluster: Chromosome undetermined scaffold_142, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_142, whole genome shotgun sequence - Paramecium tetraurelia Length = 338 Score = 34.7 bits (76), Expect = 1.7 Identities = 15/45 (33%), Positives = 26/45 (57%) Frame = +2 Query: 104 MKEFVRPVHDAHVHDEFERFKVKLQKQYASDLEHEKRLNIFRQSL 238 M++FV P H +HD+F KLQK + + ++ + + +I Q L Sbjct: 1 MEDFVSPTHSIQMHDKFVTRLEKLQKLFDNSVQRKDQQHIILQQL 45 >UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria dispar multicapsid nuclear polyhedrosis virus (LdMNPV) Length = 356 Score = 34.7 bits (76), Expect = 1.7 Identities = 23/77 (29%), Positives = 36/77 (46%), Gaps = 1/77 (1%) Frame = +1 Query: 277 TMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEH-DWRLFGAV 453 T +N +D + EL A +++G S + K+ + P H DWR V Sbjct: 101 TYKINKFSDLSKSELIA----KFTGLSIPERVSNFCKTIILNQPPDKGPLHFDWREQNKV 156 Query: 454 TPVKDQLVFGSCWSFGT 504 T +K+Q G+CW+F T Sbjct: 157 TSIKNQGACGACWAFAT 173 Score = 32.3 bits (70), Expect = 9.1 Identities = 15/36 (41%), Positives = 20/36 (55%) Frame = +2 Query: 146 DEFERFKVKLQKQYASDLEHEKRLNIFRQSLRYIHS 253 D FE F K Y SD E KR +IF+ +L I++ Sbjct: 54 DYFESFVENYNKNYTSDWEKNKRYSIFKDNLHEINA 89 >UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Giardia intestinalis|Rep: Cathepsin B-like CP1 precursor - Giardia lamblia (Giardia intestinalis) Length = 303 Score = 34.7 bits (76), Expect = 1.7 Identities = 23/66 (34%), Positives = 33/66 (50%), Gaps = 2/66 (3%) Frame = +1 Query: 307 TDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWR--LFGAVTPVKDQLVF 480 T+DE ++ R + G P S + V+EL +PP+ D+R V P DQ Sbjct: 43 TEDEFRSMLIRPDRLRARSGSLPPISITEVQELVDPIPPQFDFRDEYPQCVKPALDQGSC 102 Query: 481 GSCWSF 498 GSCW+F Sbjct: 103 GSCWAF 108 >UniRef50_Q70SU8 Cluster: Cystein proteinase inhibitor protein precursor; n=4; Salmonidae|Rep: Cystein proteinase inhibitor protein precursor - Salmo salar (Atlantic salmon) Length = 342 Score = 34.3 bits (75), Expect = 2.3 Identities = 16/32 (50%), Positives = 20/32 (62%) Frame = +2 Query: 131 DAHVHDEFERFKVKLQKQYASDLEHEKRLNIF 226 +A VH EFE +KVK K Y S +E KR I+ Sbjct: 267 EAEVHKEFETWKVKYGKTYPSTVEEAKRKEIW 298 >UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; Oryza sativa|Rep: Putative uncharacterized protein - Oryza sativa subsp. indica (Rice) Length = 149 Score = 34.3 bits (75), Expect = 2.3 Identities = 15/28 (53%), Positives = 17/28 (60%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 +P DWR GAV VK Q GSCW+F Sbjct: 17 MPKSIDWRKKGAVVEVKYQEDCGSCWAF 44 >UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-ear cress). SAG12 protein; n=2; Dictyostelium discoideum|Rep: Similar to Arabidopsis thaliana (Mouse-ear cress). SAG12 protein - Dictyostelium discoideum (Slime mold) Length = 358 Score = 34.3 bits (75), Expect = 2.3 Identities = 15/23 (65%), Positives = 16/23 (69%) Frame = +1 Query: 430 DWRLFGAVTPVKDQLVFGSCWSF 498 DWR G VTPVKDQ GSC+ F Sbjct: 150 DWRKKGLVTPVKDQGQCGSCYIF 172 >UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep: Silicatein beta - Suberites domuncula (Sponge) Length = 383 Score = 34.3 bits (75), Expect = 2.3 Identities = 15/28 (53%), Positives = 18/28 (64%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 +P DWR G VT VKDQL GS ++F Sbjct: 162 MPETMDWRTSGVVTKVKDQLRCGSSYAF 189 >UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcoptes scabiei type hominis|Rep: Sar s 1 allergen Yv6030H07 - Sarcoptes scabiei type hominis Length = 322 Score = 34.3 bits (75), Expect = 2.3 Identities = 17/42 (40%), Positives = 23/42 (54%) Frame = +1 Query: 379 YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 Y V E S+ PE D R +TP+++Q GSCW+F T Sbjct: 95 YGFCNVTETSIF--PEIDLRKDNVLTPIREQGACGSCWAFST 134 >UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia circumcincta|Rep: Secreted cathepsin F - Teladorsagia circumcincta Length = 364 Score = 34.3 bits (75), Expect = 2.3 Identities = 19/54 (35%), Positives = 27/54 (50%), Gaps = 7/54 (12%) Frame = +1 Query: 358 PHGLPFPYSKSRVEELSVK-------LPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 PH P +R+ +L+ + LP DWR GAVT VK + +CW+F Sbjct: 127 PHTWKQPDHPNRIVDLAAEGVDPKEPLPESFDWREHGAVTKVKTEGHCAACWAF 180 >UniRef50_Q24FA8 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 335 Score = 34.3 bits (75), Expect = 2.3 Identities = 13/23 (56%), Positives = 16/23 (69%) Frame = +1 Query: 430 DWRLFGAVTPVKDQLVFGSCWSF 498 DWR G V+PVK+Q G CW+F Sbjct: 129 DWRKKGGVSPVKNQGECGGCWTF 151 >UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L, S or H-like cysteine peptidase - Trichomonas vaginalis G3 Length = 473 Score = 34.3 bits (75), Expect = 2.3 Identities = 15/31 (48%), Positives = 19/31 (61%), Gaps = 1/31 (3%) Frame = +1 Query: 415 LPPEHDWR-LFGAVTPVKDQLVFGSCWSFGT 504 LP E WR + V +DQ+ GSCW+FGT Sbjct: 251 LPAEFSWRDVPNVVGKPRDQVACGSCWAFGT 281 >UniRef50_A2DKX2 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 317 Score = 34.3 bits (75), Expect = 2.3 Identities = 27/83 (32%), Positives = 39/83 (46%) Frame = +1 Query: 256 NRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDW 435 NR R T++ N + T E AL S P H P KS++ V++P DW Sbjct: 55 NRGKRSHTLAHNKFSAYTHAEYKALLN---SKPI-H--PRNVQKSQITTQKVQVPDTWDW 108 Query: 436 RLFGAVTPVKDQLVFGSCWSFGT 504 R A PV+DQ+ S ++F + Sbjct: 109 RDRVAFNPVRDQMECASGFAFAS 131 >UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]; n=11; Eutheria|Rep: Testin-2 precursor [Contains: Testin-1] - Mus musculus (Mouse) Length = 333 Score = 34.3 bits (75), Expect = 2.3 Identities = 14/28 (50%), Positives = 18/28 (64%) Frame = +1 Query: 415 LPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 +P DWR+ G VTPVK+Q S W+F Sbjct: 114 VPKYVDWRMLGYVTPVKNQGYCASSWAF 141 >UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precursor; n=20; Psoroptidia|Rep: Major mite fecal allergen Der f 1 precursor - Dermatophagoides farinae (House-dust mite) Length = 321 Score = 34.3 bits (75), Expect = 2.3 Identities = 27/83 (32%), Positives = 40/83 (48%), Gaps = 4/83 (4%) Frame = +1 Query: 262 ANRGFTMSVNHLADRTDDELA---ALRGRRYSG-PSPHGLPFPYSKSRVEELSVKLPPEH 429 AN+G ++NHL+D + DE + + + L S R+ SV +P E Sbjct: 59 ANKG---AINHLSDLSLDEFKNRYLMSAEAFEQLKTQFDLNAETSACRIN--SVNVPSEL 113 Query: 430 DWRLFGAVTPVKDQLVFGSCWSF 498 D R VTP++ Q GSCW+F Sbjct: 114 DLRSLRTVTPIRMQGGCGSCWAF 136 >UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyostelium discoideum|Rep: Cysteine proteinase 3 - Dictyostelium discoideum (Slime mold) Length = 151 Score = 34.3 bits (75), Expect = 2.3 Identities = 27/87 (31%), Positives = 40/87 (45%), Gaps = 4/87 (4%) Frame = +1 Query: 241 IHTFHNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKS----RVEELS 408 +H ++++ ++ + +N AD +++E Y G H Y K R+ Sbjct: 19 VHNWNSKGSKT-VLGLNQHADLSNEEYRL----NYLGTRAHIKLNGYHKRNLGLRLNRPH 73 Query: 409 VKLPPEHDWRLFGAVTPVKDQLVFGSC 489 K P DWR AVTPVKDQ GSC Sbjct: 74 FKQPLNVDWREKDAVTPVKDQGQCGSC 100 >UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleostomi|Rep: Cathepsin O precursor - Homo sapiens (Human) Length = 321 Score = 34.3 bits (75), Expect = 2.3 Identities = 14/31 (45%), Positives = 19/31 (61%) Frame = +1 Query: 406 SVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 +V LP DWR VT V++Q + G CW+F Sbjct: 105 NVSLPLRFDWRDKQVVTQVRNQQMCGGCWAF 135 >UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome shotgun sequence; n=3; Tetraodontidae|Rep: Chromosome 12 SCAF14996, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 362 Score = 33.9 bits (74), Expect = 3.0 Identities = 24/80 (30%), Positives = 35/80 (43%), Gaps = 1/80 (1%) Frame = +1 Query: 238 QIHTF-HNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVK 414 ++H H+ + + +NH D T +E + P F S +E ++ Sbjct: 59 ELHNLEHSMGQHSYRLGMNHFGDMTHEEFRQIMNGYKHKPQRK---FRGSLF-MEPNFLE 114 Query: 415 LPPEHDWRLFGAVTPVKDQL 474 P DWR G VTPVKDQL Sbjct: 115 APRAVDWRDKGYVTPVKDQL 134 >UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; Syntrophobacter fumaroxidans MPOB|Rep: Peptidase C1A, papain precursor - Syntrophobacter fumaroxidans (strain DSM 10017 / MPOB) Length = 497 Score = 33.9 bits (74), Expect = 3.0 Identities = 19/47 (40%), Positives = 26/47 (55%) Frame = +1 Query: 367 LPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGTW 507 LP S++ E +V P +D R VT V+DQ GSCW+F T+ Sbjct: 85 LPDADSRAAAVE-AVTYPATYDLRTQHRVTSVRDQGDCGSCWAFATY 130 >UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain; n=9; Cucujiformia|Rep: Digestive cysteine proteinase intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 33.9 bits (74), Expect = 3.0 Identities = 24/94 (25%), Positives = 43/94 (45%), Gaps = 1/94 (1%) Frame = +1 Query: 220 HLQAVAQIHTFHNRANRGFTMSVNHLADRTDDELA-ALRGRRYSGPSPHGLPFPYSKSRV 396 +L+ + + + +++ + + V AD T DE LR + + P+ + + Sbjct: 50 NLRKIEEHNAKYDKGEESYFLGVTPFADLTHDEFKDELRRQIKTKPNVEATLAVFPEG-- 107 Query: 397 EELSVKLPPEHDWRLFGAVTPVKDQLVFGSCWSF 498 +++P DW GAV VK Q GSCW+F Sbjct: 108 ----LEVPDSIDWTQKGAVLDVKYQGGCGSCWAF 137 >UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L - Suberites domuncula (Sponge) Length = 324 Score = 33.9 bits (74), Expect = 3.0 Identities = 14/23 (60%), Positives = 16/23 (69%) Frame = +1 Query: 430 DWRLFGAVTPVKDQLVFGSCWSF 498 DWR G V+ VK+Q GSCWSF Sbjct: 113 DWRQKGVVSEVKNQGQCGSCWSF 135 >UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 385 Score = 33.5 bits (73), Expect = 3.9 Identities = 24/97 (24%), Positives = 42/97 (43%), Gaps = 11/97 (11%) Frame = +1 Query: 241 IHTFHNRANRGFTMSVNHLADRTDDELAA-LRGRRYSGPSPHGLPFPYSKSRVEELSVKL 417 ++ F+ + + + +N +D T +E A G R + + + + + Sbjct: 78 VNEFNKKEGMTYRLGLNQFSDMTFEEFAGKFTGGRTGSIAGDLRDGAVTYCKPPAVGY-V 136 Query: 418 PPEHDWRLFGAVTPVKDQLVF----------GSCWSF 498 PP +W +G VTPVK+QL GSCW+F Sbjct: 137 PPSWNWTKYGVVTPVKNQLTCVNTIKMSMYEGSCWAF 173 >UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa (japonica cultivar-group)|Rep: Os09g0562700 protein - Oryza sativa subsp. japonica (Rice) Length = 235 Score = 33.5 bits (73), Expect = 3.9 Identities = 18/33 (54%), Positives = 20/33 (60%) Frame = +1 Query: 406 SVKLPPEHDWRLFGAVTPVKDQLVFGSCWSFGT 504 S LP +H GAVT VKDQ GSCW+F T Sbjct: 10 SCLLPVDHG----GAVTEVKDQGRCGSCWAFST 38 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 589,725,011 Number of Sequences: 1657284 Number of extensions: 11785611 Number of successful extensions: 49837 Number of sequences better than 10.0: 284 Number of HSP's better than 10.0 without gapping: 46550 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 49741 length of database: 575,637,011 effective HSP length: 97 effective length of database: 414,880,463 effective search space used: 42732687689 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -