BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= heS00028 (846 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw... 131 2e-29 UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca... 127 4e-28 UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ... 118 2e-25 UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n... 111 2e-23 UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh... 108 2e-22 UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=... 107 3e-22 UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: ... 105 2e-21 UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteina... 104 3e-21 UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|... 103 4e-21 UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ... 103 4e-21 UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Ca... 100 5e-20 UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr... 100 5e-20 UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA... 99 2e-19 UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n... 97 4e-19 UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 ... 97 4e-19 UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr... 97 5e-19 UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=... 96 9e-19 UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; ... 96 9e-19 UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps... 94 3e-18 UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gamb... 93 1e-17 UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|... 93 1e-17 UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain... 92 1e-17 UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ... 92 1e-17 UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep... 92 2e-17 UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhip... 91 3e-17 UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Ca... 90 6e-17 UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7... 90 6e-17 UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, w... 90 7e-17 UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.... 89 2e-16 UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep... 88 2e-16 UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomati... 88 3e-16 UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8... 87 4e-16 UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG011... 87 7e-16 UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; ... 86 9e-16 UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma... 86 1e-15 UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 ... 85 2e-15 UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator ame... 84 4e-15 UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8.... 83 6e-15 UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma j... 83 9e-15 UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 ... 83 9e-15 UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl... 81 3e-14 UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2; Ostreococcus|... 81 3e-14 UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma j... 81 5e-14 UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2... 80 6e-14 UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2; ... 80 6e-14 UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin ... 79 1e-13 UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C... 78 3e-13 UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 77 4e-13 UniRef50_Q237A1 Cluster: Papain family cysteine protease contain... 77 6e-13 UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus co... 77 7e-13 UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep... 73 7e-12 UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid... 72 2e-11 UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O... 72 2e-11 UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep... 71 4e-11 UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|R... 69 1e-10 UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve... 68 3e-10 UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus... 65 2e-09 UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus lu... 64 3e-09 UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; ... 63 7e-09 UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 62 1e-08 UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cel... 62 2e-08 UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi... 62 2e-08 UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi... 60 7e-08 UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-L... 59 2e-07 UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia... 59 2e-07 UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like prote... 59 2e-07 UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;... 58 2e-07 UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,... 56 8e-07 UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; ... 56 8e-07 UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|R... 56 1e-06 UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcop... 54 6e-06 UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35... 54 6e-06 UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lambl... 53 1e-05 UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - ... 52 1e-05 UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ... 52 2e-05 UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to glucocorti... 52 2e-05 UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; ... 51 3e-05 UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=... 51 3e-05 UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 51 4e-05 UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo sapie... 50 1e-04 UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida... 50 1e-04 UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-li... 50 1e-04 UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio... 49 1e-04 UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy... 49 1e-04 UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr... 48 2e-04 UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy... 48 2e-04 UniRef50_Q58HK5 Cluster: Hc58; n=1; Haemonchus contortus|Rep: Hc... 48 4e-04 UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr... 48 4e-04 UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh... 48 4e-04 UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ... 47 5e-04 UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin... 47 5e-04 UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try... 47 5e-04 UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia ... 47 7e-04 UniRef50_P81494 Cluster: Cathepsin B; n=2; Phasianidae|Rep: Cath... 47 7e-04 UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ... 46 0.001 UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb... 46 0.001 UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ... 46 0.002 UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re... 46 0.002 UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big... 46 0.002 UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ... 46 0.002 UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory... 46 0.002 UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ... 45 0.002 UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted... 45 0.002 UniRef50_Q23H32 Cluster: Papain family cysteine protease contain... 45 0.002 UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy... 45 0.002 UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali... 45 0.002 UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen; n... 45 0.002 UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ... 45 0.003 UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag... 45 0.003 UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ... 44 0.004 UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia... 44 0.004 UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ... 44 0.005 UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ... 44 0.005 UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain... 44 0.005 UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain... 44 0.005 UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, wh... 44 0.005 UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ... 44 0.005 UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto... 44 0.005 UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt... 44 0.006 UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re... 44 0.006 UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa... 44 0.006 UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain... 44 0.006 UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ... 43 0.008 UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1... 43 0.008 UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ... 43 0.008 UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n... 43 0.008 UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen... 43 0.008 UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ... 43 0.008 UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ... 43 0.011 UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n... 43 0.011 UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1... 43 0.011 UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n... 43 0.011 UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet... 43 0.011 UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain... 43 0.011 UniRef50_Q231X3 Cluster: Papain family cysteine protease contain... 43 0.011 UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy... 43 0.011 UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh... 43 0.011 UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh... 43 0.011 UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w... 43 0.011 UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ... 43 0.011 UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr... 43 0.011 UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;... 42 0.015 UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy... 42 0.015 UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi... 42 0.015 UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium... 42 0.015 UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n... 42 0.019 UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis... 42 0.019 UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel... 42 0.019 UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ... 42 0.026 UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs... 42 0.026 UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e... 42 0.026 UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ... 41 0.034 UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;... 41 0.034 UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;... 41 0.034 UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain... 41 0.034 UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li... 41 0.034 UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like cy... 41 0.034 UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ... 41 0.034 UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ... 41 0.034 UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The... 41 0.034 UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The... 41 0.034 UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl... 41 0.034 UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa... 41 0.045 UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ... 41 0.045 UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi... 41 0.045 UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ... 41 0.045 UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=... 41 0.045 UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R... 41 0.045 UniRef50_UPI00006CB77B Cluster: Papain family cysteine protease ... 40 0.059 UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio... 40 0.059 UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil... 40 0.059 UniRef50_Q22W19 Cluster: Papain family cysteine protease contain... 40 0.059 UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:... 40 0.059 UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu... 40 0.059 UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi... 40 0.059 UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG... 40 0.079 UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa... 40 0.079 UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae... 40 0.079 UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox... 40 0.079 UniRef50_O16454 Cluster: Temporarily assigned gene name protein ... 40 0.079 UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy... 40 0.079 UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ... 40 0.10 UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p... 40 0.10 UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep... 40 0.10 UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi... 40 0.10 UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n... 40 0.10 UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabdi... 40 0.10 UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H... 40 0.10 UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cy... 40 0.10 UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh... 40 0.10 UniRef50_Q2FLC7 Cluster: Periplasmic copper-binding precursor; n... 40 0.10 UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo... 40 0.10 UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ... 39 0.14 UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ... 39 0.14 UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G... 39 0.14 UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl... 39 0.14 UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain... 39 0.14 UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain... 39 0.14 UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain... 39 0.14 UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy... 39 0.14 UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R... 39 0.14 UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M... 39 0.14 UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ... 39 0.18 UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p... 39 0.18 UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca... 39 0.18 UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No... 39 0.18 UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|... 39 0.18 UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip... 39 0.18 UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip... 39 0.18 UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ... 39 0.18 UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus... 39 0.18 UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re... 39 0.18 UniRef50_A5KBM2 Cluster: Serine-repeat antigen; n=3; Plasmodium|... 39 0.18 UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D... 39 0.18 UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18... 39 0.18 UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ... 38 0.24 UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s... 38 0.24 UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocysti... 38 0.24 UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa... 38 0.24 UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n... 38 0.24 UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ... 38 0.24 UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy... 38 0.24 UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ... 38 0.24 UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia... 38 0.24 UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j... 38 0.24 UniRef50_Q24E33 Cluster: Papain family cysteine protease contain... 38 0.24 UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain... 38 0.24 UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain... 38 0.24 UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep... 38 0.24 UniRef50_O96163 Cluster: Cysteine protease, putative; n=5; Plasm... 38 0.24 UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er... 38 0.24 UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl... 38 0.24 UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto... 38 0.24 UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|... 38 0.24 UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2... 38 0.24 UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ... 38 0.32 UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ... 38 0.32 UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum... 38 0.32 UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein... 38 0.32 UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain... 38 0.32 UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy... 38 0.32 UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who... 38 0.32 UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179, w... 38 0.32 UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon G... 38 0.32 UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ... 38 0.42 UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat... 38 0.42 UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R... 38 0.42 UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv... 38 0.42 UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph... 38 0.42 UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa... 38 0.42 UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir... 38 0.42 UniRef50_Q26015 Cluster: Serine rich protein homologue; n=4; Pla... 38 0.42 UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh... 38 0.42 UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ... 38 0.42 UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ... 38 0.42 UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov... 38 0.42 UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:... 38 0.42 UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ... 38 0.42 UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h... 37 0.55 UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica... 37 0.55 UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil... 37 0.55 UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli... 37 0.55 UniRef50_Q7QSU1 Cluster: GLP_127_20145_14275; n=1; Giardia lambl... 37 0.55 UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop... 37 0.55 UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain... 37 0.55 UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ... 37 0.55 UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v... 37 0.55 UniRef50_A0D5R4 Cluster: Chromosome undetermined scaffold_39, wh... 37 0.55 UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh... 37 0.55 UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ... 37 0.55 UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M... 37 0.55 UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ... 37 0.73 UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ... 37 0.73 UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ... 37 0.73 UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; ... 37 0.73 UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3; Bilate... 37 0.73 UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl... 37 0.73 UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt... 37 0.73 UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis... 37 0.73 UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain... 37 0.73 UniRef50_Q23H15 Cluster: Papain family cysteine protease contain... 37 0.73 UniRef50_Q23H06 Cluster: Papain family cysteine protease contain... 37 0.73 UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ... 37 0.73 UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w... 37 0.73 UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|... 37 0.73 UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C... 37 0.73 UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata... 37 0.73 UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3.... 37 0.73 UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S... 36 0.97 UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal... 36 0.97 UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster... 36 0.97 UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-... 36 0.97 UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca... 36 0.97 UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei... 36 0.97 UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest... 36 0.97 UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi... 36 0.97 UniRef50_Q4YCM9 Cluster: Cysteine protease, putative; n=5; Plasm... 36 0.97 UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n... 36 0.97 UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi... 36 0.97 UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat... 36 0.97 UniRef50_A5KBM1 Cluster: Serine-repeat antigen; n=1; Plasmodium ... 36 0.97 UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ... 36 0.97 UniRef50_Q3A8I3 Cluster: Putative serine protease; n=1; Pelobact... 36 1.3 UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t... 36 1.3 UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ... 36 1.3 UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ... 36 1.3 UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ... 36 1.3 UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-... 36 1.3 UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli... 36 1.3 UniRef50_Q23H10 Cluster: Papain family cysteine protease contain... 36 1.3 UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ... 36 1.3 UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir... 36 1.3 UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophi... 36 1.3 UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic... 36 1.3 UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:... 36 1.3 UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma... 36 1.3 UniRef50_UPI0001509FF9 Cluster: Papain family cysteine protease ... 36 1.7 UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ... 36 1.7 UniRef50_Q8ZRX7 Cluster: Putative viral protein; n=1; Salmonella... 36 1.7 UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz... 36 1.7 UniRef50_Q7R0G3 Cluster: GLP_29_33036_32140; n=1; Giardia lambli... 36 1.7 UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop... 36 1.7 UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li... 36 1.7 UniRef50_A5UP12 Cluster: Adhesin-like protein; n=1; Methanobrevi... 36 1.7 UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,... 35 2.2 UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ... 35 2.2 UniRef50_Q8QNJ8 Cluster: EsV-1-75; n=1; Ectocarpus siliculosus v... 35 2.2 UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280... 35 2.2 UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O... 35 2.2 UniRef50_Q239L8 Cluster: Papain family cysteine protease contain... 35 2.2 UniRef50_Q235G6 Cluster: Papain family cysteine protease contain... 35 2.2 UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C... 35 2.2 UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve... 35 2.2 UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w... 35 2.2 UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc... 35 2.2 UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi... 35 3.0 UniRef50_Q6A1H9 Cluster: Cathepsin X/O; n=1; Suberites domuncula... 35 3.0 UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop... 35 3.0 UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia... 35 3.0 UniRef50_Q26153 Cluster: V-SERA 4; n=1; Plasmodium vivax|Rep: V-... 35 3.0 UniRef50_Q22A69 Cluster: Papain family cysteine protease contain... 35 3.0 UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina... 35 3.0 UniRef50_A7APS9 Cluster: Papain family cysteine protease contain... 35 3.0 UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs... 35 3.0 UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;... 35 3.0 UniRef50_UPI000155D183 Cluster: PREDICTED: similar to Cathepsin ... 34 3.9 UniRef50_A0IYD1 Cluster: Putative outer membrane adhesin like pr... 34 3.9 UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ... 34 3.9 UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus... 34 3.9 UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli... 34 3.9 UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain... 34 3.9 UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve... 34 3.9 UniRef50_A2DPZ6 Cluster: Clan CA, family C1, cathepsin B-like cy... 34 3.9 UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh... 34 3.9 UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba... 34 5.2 UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|... 34 5.2 UniRef50_Q01FL5 Cluster: Cysteine protease-5; n=3; Ostreococcus|... 34 5.2 UniRef50_Q015J8 Cluster: Cathepsin; n=2; Ostreococcus|Rep: Cathe... 34 5.2 UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ... 34 5.2 UniRef50_Q8N0R5 Cluster: Cycle like factor BmCyc b; n=4; Obtecto... 34 5.2 UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist... 34 5.2 UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv... 34 5.2 UniRef50_Q3YJ15 Cluster: Putative galactosyl transferase; n=1; H... 33 6.8 UniRef50_Q5YER6 Cluster: Cathepsin Z; n=1; Bigelowiella natans|R... 33 6.8 UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep: Cat... 33 6.8 UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; ... 33 6.8 UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; ... 33 6.8 UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ... 33 6.8 UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|... 33 6.8 UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep... 33 6.8 UniRef50_Q54MB6 Cluster: Putative uncharacterized protein; n=1; ... 33 6.8 UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;... 33 6.8 UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop... 33 6.8 UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl... 33 6.8 UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain... 33 6.8 UniRef50_A2ERV3 Cluster: Putative uncharacterized protein; n=1; ... 33 6.8 UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cy... 33 6.8 UniRef50_A0EAG2 Cluster: Chromosome undetermined scaffold_86, wh... 33 6.8 UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C... 33 6.8 UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ... 33 9.0 UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s... 33 9.0 UniRef50_Q89Z69 Cluster: Putative uncharacterized protein; n=1; ... 33 9.0 UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty... 33 9.0 UniRef50_A3B2E1 Cluster: Putative uncharacterized protein; n=1; ... 33 9.0 UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n... 33 9.0 UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei... 33 9.0 UniRef50_Q54JE9 Cluster: Putative uncharacterized protein; n=1; ... 33 9.0 UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil... 33 9.0 UniRef50_Q248G1 Cluster: Papain family cysteine protease contain... 33 9.0 UniRef50_Q1AMF3 Cluster: Cathepsin C1; n=1; Toxoplasma gondii|Re... 33 9.0 UniRef50_A7S9N1 Cluster: Predicted protein; n=1; Nematostella ve... 33 9.0 >UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpwnx02 - Periplaneta americana (American cockroach) Length = 343 Score = 131 bits (316), Expect = 2e-29 Identities = 54/79 (68%), Positives = 64/79 (81%) Frame = +2 Query: 272 LPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 451 LP K+ D+ +PE FDPR++WP+CPTL E+RDQGSCGSCWAFGAVEAM+DRVC +S Sbjct: 81 LPEKSME-DIDIEIPEEFDPREQWPECPTLKEIRDQGSCGSCWAFGAVEAMSDRVCIHSK 139 Query: 452 GTKHFHFSAEDLLSCCPIC 508 G HFHFSAEDLL+CC C Sbjct: 140 GKTHFHFSAEDLLTCCSSC 158 Score = 128 bits (309), Expect = 2e-28 Identities = 51/85 (60%), Positives = 61/85 (71%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693 GC+GG P AW+YW G+VSGGSYNS QGC+PY I PCEHHV G R PC G+ TP+C Sbjct: 161 GCNGGEPGAAWDYWVSTGIVSGGSYNSHQGCQPYAIEPCEHHVNGTRKPC-GEGDTPRCV 219 Query: 694 KKCESGYDVNYKQDKQYGKHVYTCP 768 K+CE GYDV Y +D+ +GK Y P Sbjct: 220 KRCEEGYDVPYGKDRHFGKSAYAVP 244 Score = 46.4 bits (105), Expect = 0.001 Identities = 21/41 (51%), Positives = 26/41 (63%) Frame = +3 Query: 126 LPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGV 248 L PLSD+FI+ IN +WKA RNF D +KK+MGV Sbjct: 32 LVDPLSDDFIDHINSLNTTWKAHRNFGNDIPLREIKKLMGV 72 >UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Cathepsin B1) (APP secretase) (APPS) [Contains: Cathepsin B light chain; Cathepsin B heavy chain]; n=85; Eukaryota|Rep: Cathepsin B precursor (EC 3.4.22.1) (Cathepsin B1) (APP secretase) (APPS) [Contains: Cathepsin B light chain; Cathepsin B heavy chain] - Homo sapiens (Human) Length = 339 Score = 127 bits (306), Expect = 4e-28 Identities = 51/83 (61%), Positives = 59/83 (71%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693 GC+GG P AW +W GLVSGG Y S GCRPY IPPCEHHV G+R PC+G+ TPKC+ Sbjct: 149 GCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCS 208 Query: 694 KKCESGYDVNYKQDKQYGKHVYT 762 K CE GY YKQDK YG + Y+ Sbjct: 209 KICEPGYSPTYKQDKHYGYNSYS 231 Score = 104 bits (249), Expect = 3e-21 Identities = 40/63 (63%), Positives = 51/63 (80%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 LP +FD R++WP CPT+ E+RDQGSCGSCWAFGAVEA++DR+C ++N SAEDLL Sbjct: 80 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139 Query: 491 SCC 499 +CC Sbjct: 140 TCC 142 Score = 46.0 bits (104), Expect = 0.001 Identities = 23/57 (40%), Positives = 36/57 (63%), Gaps = 4/57 (7%) Frame = +3 Query: 87 YVTLVC--VLAAAKDLP--HPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMG 245 + +L C VLA A+ P HPLSDE +N +N + +W+AG NF + ++LK++ G Sbjct: 5 WASLCCLLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLCG 60 Score = 42.7 bits (96), Expect = 0.011 Identities = 18/27 (66%), Positives = 22/27 (81%) Frame = +3 Query: 762 LSGDEDHIRAELFKNGPVEGAFTVYSD 842 +S E I AE++KNGPVEGAF+VYSD Sbjct: 232 VSNSEKDIMAEIYKNGPVEGAFSVYSD 258 >UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin B; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin B - Strongylocentrotus purpuratus Length = 346 Score = 118 bits (284), Expect = 2e-25 Identities = 48/76 (63%), Positives = 56/76 (73%) Frame = +2 Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460 K N I LPENFD R+ WP+CPT+ EVRDQGSCGSCWAFGAVEA++DR+C S G Sbjct: 68 KLENQTRIKDLPENFDARENWPNCPTIKEVRDQGSCGSCWAFGAVEAISDRICIKSKGQT 127 Query: 461 HFHFSAEDLLSCCPIC 508 H SAEDL++CC C Sbjct: 128 QVHISAEDLMTCCKTC 143 Score = 114 bits (274), Expect = 3e-24 Identities = 44/81 (54%), Positives = 57/81 (70%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693 GC+GG P AWEY+K G+V+GG +NSSQGC+PY+I C+HHV G + PC G+ TP+C Sbjct: 146 GCNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGPCQGEGPTPECK 205 Query: 694 KKCESGYDVNYKQDKQYGKHV 756 KCE+ Y Y+QDK Y V Sbjct: 206 HKCEASYSTPYEQDKHYALSV 226 >UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n=4; Tenebrionidae|Rep: Putative cathepsin B-like proteinase - Tenebrio molitor (Yellow mealworm) Length = 321 Score = 111 bits (268), Expect = 2e-23 Identities = 43/78 (55%), Positives = 59/78 (75%) Frame = +2 Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454 P+ H F+ +PE+FD R KWP+C +LN +RDQG+CGSCWAF ++E+M+DR+C +S+G Sbjct: 72 PVLVHTFNA-RDVPESFDARTKWPNCDSLNRIRDQGACGSCWAFASIESMSDRICIHSSG 130 Query: 455 TKHFHFSAEDLLSCCPIC 508 + F FS EDLLSCC C Sbjct: 131 SAQFMFSPEDLLSCCTSC 148 Score = 64.5 bits (150), Expect = 3e-09 Identities = 32/81 (39%), Positives = 44/81 (54%) Frame = +1 Query: 517 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCTK 696 C GG A +++ + G+VSGG NS++GCRPY + H G +TP CTK Sbjct: 151 CGGGYMMSALDFYINEGIVSGGDVNSNEGCRPY---TADAHDQG---------QTPACTK 198 Query: 697 KCESGYDVNYKQDKQYGKHVY 759 C +GY +Y DK YG + Y Sbjct: 199 SCRNGYSTSYSADKHYGSNDY 219 Score = 54.0 bits (124), Expect = 5e-06 Identities = 30/66 (45%), Positives = 44/66 (66%) Frame = +3 Query: 63 KMFISRAAYVTLVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIM 242 K+F+S +V LV VL+A+ LS EFI++IN Q+SW AGRNFP +T+ +L K+ Sbjct: 2 KIFLS---FVVLVAVLSASLAEIDVLSSEFIDSINRIQSSWVAGRNFPENTTNEYLYKLN 58 Query: 243 GVIEMN 260 G I ++ Sbjct: 59 GFIGLH 64 >UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 5 SCAF15026, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 351 Score = 108 bits (259), Expect = 2e-22 Identities = 43/66 (65%), Positives = 52/66 (78%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 LP+ FD R++WP+CPTL E+RDQGSCGSCWAFGA EAM+DRVC +SN SA+DLL Sbjct: 79 LPKEFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAMSDRVCIHSNAKVSVELSAQDLL 138 Query: 491 SCCPIC 508 +CC C Sbjct: 139 TCCNSC 144 Score = 100 bits (240), Expect = 4e-20 Identities = 50/106 (47%), Positives = 62/106 (58%), Gaps = 22/106 (20%) Frame = +1 Query: 511 LGCSGGMPRLAWEYWKHFGLVSGGSYNS---------------------SQGCRPYEIPP 627 +GC+GG P AW +W GLVSGG Y+S S GCRPY IPP Sbjct: 146 MGCNGGYPSSAWNFWVSDGLVSGGLYDSHIGRIQVSLCVLLLAVDRDFVSPGCRPYTIPP 205 Query: 628 CEHHVPGNRMPCSGD-TKTPKCTKKCESGYDVNYKQDKQYGKHVYT 762 CEHHV G+R CSG+ TP+C +CE+GY +YKQDK +GK Y+ Sbjct: 206 CEHHVNGSRPSCSGEGGDTPECIFRCEAGYSPSYKQDKHFGKTSYS 251 Score = 48.4 bits (110), Expect = 2e-04 Identities = 20/32 (62%), Positives = 25/32 (78%) Frame = +3 Query: 747 KTCIYLSGDEDHIRAELFKNGPVEGAFTVYSD 842 KT +S +ED I+ E++KNGPVEGAFTVY D Sbjct: 247 KTSYSVSSEEDEIKQEIYKNGPVEGAFTVYED 278 Score = 41.9 bits (94), Expect = 0.019 Identities = 20/59 (33%), Positives = 35/59 (59%), Gaps = 2/59 (3%) Frame = +3 Query: 81 AAYVTLVCVLAAAKDLPH--PLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVI 251 AA++ L +++ PH PLS E +N IN ++W AG NF + ++++KK+ G + Sbjct: 4 AAFLFLAAAWSSSLARPHLKPLSSEMVNYINKLNSTWTAGHNF-HNVDYSYVKKLCGTL 61 >UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=1; Biomphalaria glabrata|Rep: Cathepsin B preproprotein precursor - Biomphalaria glabrata (Bloodfluke planorb) Length = 333 Score = 107 bits (257), Expect = 3e-22 Identities = 48/101 (47%), Positives = 62/101 (61%) Frame = +2 Query: 206 ARHIVRAS*ENNGSYRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGS 385 AR ++ + N +Y H ++ N LP+NFDPR KWPDC +LNE+RDQ + Sbjct: 57 ARALLGVNMAENKAYNRIHLKYKQVQPRN-----DLPDNFDPRTKWPDCASLNEIRDQAN 111 Query: 386 CGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPIC 508 CGSCWAFG+ EAMTDR+C G + H SAED+ CC C Sbjct: 112 CGSCWAFGSAEAMTDRICIAGKG--NIHISAEDINDCCKSC 150 Score = 104 bits (250), Expect = 2e-21 Identities = 40/83 (48%), Positives = 52/83 (62%) Frame = +1 Query: 511 LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKC 690 +GC+GG P AWE++ G+VSGG Y +++GC PY +P C+HH G PC TPKC Sbjct: 152 MGCNGGYPAAAWEWYVDTGVVSGGQYGTNEGCMPYSLPHCDHHTTGKYQPCPAVVPTPKC 211 Query: 691 TKKCESGYDVNYKQDKQYGKHVY 759 KKC +GY +Y DK GK Y Sbjct: 212 EKKCLTGYPKSYSNDKTRGKKSY 234 >UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: Cathepsin B - Apriona germari Length = 324 Score = 105 bits (251), Expect = 2e-21 Identities = 44/89 (49%), Positives = 63/89 (70%) Frame = +2 Query: 242 GSYRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEA 421 G RD + TLP+ H + I+ +P++FD R++WP C ++ +RD+G+CGSCWAF AVE Sbjct: 64 GINRDPN-VTLPVVFH--EAISGIPDSFDAREQWPFCESIRTIRDEGACGSCWAFAAVEV 120 Query: 422 MTDRVCTYSNGTKHFHFSAEDLLSCCPIC 508 M+DR+C S G K F FSAE+++SCC C Sbjct: 121 MSDRLCLASEGRKKFIFSAEEVVSCCTAC 149 Score = 53.6 bits (123), Expect = 6e-06 Identities = 28/82 (34%), Positives = 41/82 (50%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693 GC GG ++YW G+ SGG Y S GC+PY SG+ TP+C Sbjct: 152 GCRGGFLNEPYKYWVTNGIPSGGDYGSKLGCKPY------------TAAVSGE--TPQCQ 197 Query: 694 KKCESGYDVNYKQDKQYGKHVY 759 K C SGY+ ++++D ++ Y Sbjct: 198 KACVSGYEKSWEKDLRHATSAY 219 >UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteinase; n=1; Tenebrio molitor|Rep: Putative cathepsin B-like like proteinase - Tenebrio molitor (Yellow mealworm) Length = 301 Score = 104 bits (249), Expect = 3e-21 Identities = 45/80 (56%), Positives = 59/80 (73%), Gaps = 1/80 (1%) Frame = +2 Query: 272 LPIKTHNFDLIASLPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVCTYS 448 LP+KTH +L A +PE+FD R+ WP+C ++ E+RDQ SCGSCWAFGAVEAM+DR+C +S Sbjct: 72 LPVKTHAVNLDA-IPESFDAREAWPECTSIIGEIRDQASCGSCWAFGAVEAMSDRICIHS 130 Query: 449 NGTKHFHFSAEDLLSCCPIC 508 + + SAEDL CC C Sbjct: 131 DASVKVRISAEDLNDCCYDC 150 Score = 104 bits (249), Expect = 3e-21 Identities = 41/89 (46%), Positives = 56/89 (62%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693 GC+GG P LAW YW G+V+GG Y +GC+ Y I PC+HHV GN PC +TP C Sbjct: 153 GCNGGWPDLAWSYWSSTGIVTGGLYGVDEGCKAYSIKPCDHHVDGNLGPCGDIQRTPACK 212 Query: 694 KKCESGYDVNYKQDKQYGKHVYTCPETKT 780 K C+S D+ YK D + G Y+ P++++ Sbjct: 213 KSCDSTSDLEYKSDLRRGS-AYSIPKSES 240 Score = 61.3 bits (142), Expect = 3e-08 Identities = 24/40 (60%), Positives = 33/40 (82%) Frame = +3 Query: 132 HPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVI 251 HPLSDEFIN IN KQ +WKAGRNF +T +H+++++GV+ Sbjct: 24 HPLSDEFINEINSKQTTWKAGRNFDVNTPISHVRRLLGVL 63 >UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|Rep: Cathepsin B5 - Clonorchis sinensis Length = 343 Score = 103 bits (248), Expect = 4e-21 Identities = 48/85 (56%), Positives = 59/85 (69%), Gaps = 1/85 (1%) Frame = +2 Query: 257 EHFATLPIKTHN-FDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 433 E A P H+ FD + LP+NFD R WP C +++E+RDQ SCGSCWAFGAVEAM+DR Sbjct: 68 EQKAQRPTLRHDGFDNMR-LPKNFDARKTWPHCSSISEIRDQSSCGSCWAFGAVEAMSDR 126 Query: 434 VCTYSNGTKHFHFSAEDLLSCCPIC 508 +C +SNG + SA DLLSCC C Sbjct: 127 LCIHSNGAFNKSLSAVDLLSCCKDC 151 Score = 90.6 bits (215), Expect = 4e-17 Identities = 37/76 (48%), Positives = 49/76 (64%), Gaps = 1/76 (1%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPKC 690 GC GG P +AW+YWK G+V+GGS GCR Y P CEHHV G+ PC + TP+C Sbjct: 154 GCRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEHHVQGHYPPCPRELYPTPEC 213 Query: 691 TKKCESGYDVNYKQDK 738 ++C++ DV Y +DK Sbjct: 214 VQQCDTP-DVGYLEDK 228 >UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 precursor; n=11; Bilateria|Rep: Cathepsin B-like cysteine proteinase 6 precursor - Caenorhabditis elegans Length = 379 Score = 103 bits (248), Expect = 4e-21 Identities = 44/76 (57%), Positives = 54/76 (71%) Frame = +2 Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460 KT + DL +PE+FD RD WP C ++ +RDQ SCGSCWAFGAVEAM+DR+C S+G Sbjct: 97 KTKDLDL--DIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGEL 154 Query: 461 HFHFSAEDLLSCCPIC 508 SA+DLLSCC C Sbjct: 155 QVTLSADDLLSCCKSC 170 Score = 94.7 bits (225), Expect = 3e-18 Identities = 41/85 (48%), Positives = 50/85 (58%), Gaps = 3/85 (3%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRM-PCSGDT-KTPK 687 GC+GG P AW YW G+V+G +Y ++ GC+PY PPCEHH PC D TPK Sbjct: 173 GCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPK 232 Query: 688 CTKKCESGY-DVNYKQDKQYGKHVY 759 C KKC S Y D Y +DK +G Y Sbjct: 233 CEKKCVSDYTDKTYSEDKFFGASAY 257 >UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Cathepsin b - Aedes aegypti (Yellowfever mosquito) Length = 386 Score = 100 bits (239), Expect = 5e-20 Identities = 40/66 (60%), Positives = 47/66 (71%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 LP+ FD R+KWP+CP+L E+RDQG CGSCWA A AMTDR C S G + F F + DLL Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184 Query: 491 SCCPIC 508 SCC C Sbjct: 185 SCCHSC 190 Score = 81.0 bits (191), Expect = 3e-14 Identities = 40/86 (46%), Positives = 49/86 (56%), Gaps = 1/86 (1%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693 GC GG AW++W GL SGG NS QGC PY I C +PG D TPKC+ Sbjct: 193 GCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGEC--RIPGE------DEDTPKCS 244 Query: 694 KKCESGYDV-NYKQDKQYGKHVYTCP 768 KC SGY+V + QD+ YG+ Y+ P Sbjct: 245 NKCRSGYNVTDVWQDRHYGRVAYSLP 270 >UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase precursor; n=28; Bilateria|Rep: Cathepsin B-like cysteine proteinase precursor - Schistosoma japonicum (Blood fluke) Length = 342 Score = 100 bits (239), Expect = 5e-20 Identities = 42/78 (53%), Positives = 52/78 (66%) Frame = +2 Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454 P H+ DL +P FD R KWP C +++++RDQ CGSCWAFGAVEAMTDR+C S G Sbjct: 79 PTVDHH-DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGG 137 Query: 455 TKHFHFSAEDLLSCCPIC 508 + SA DL+SCC C Sbjct: 138 GQSAELSALDLISCCKDC 155 Score = 99.1 bits (236), Expect = 1e-19 Identities = 40/95 (42%), Positives = 52/95 (54%), Gaps = 1/95 (1%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPKC 690 GC GG P +AW+YW G+V+GGS + GC+PY P CEHH G C KTP+C Sbjct: 158 GCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQC 217 Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCPETKTTSARN 795 + C+ GY Y+QDK YG Y + R+ Sbjct: 218 KQTCQKGYKTPYEQDKHYGDESYNVQNNEKVIQRD 252 >UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA; n=1; Tribolium castaneum|Rep: PREDICTED: similar to CG10992-PA - Tribolium castaneum Length = 325 Score = 98.7 bits (235), Expect = 2e-19 Identities = 42/90 (46%), Positives = 59/90 (65%), Gaps = 1/90 (1%) Frame = +2 Query: 242 GSYRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCP-TLNEVRDQGSCGSCWAFGAVE 418 G + D ++ + K H I S+PE+FD R+KWP+C + ++R+QG+CGSCWAF + E Sbjct: 54 GLHPDPNYK-IQTKQHKISRIISIPESFDAREKWPECKDVIGKIRNQGNCGSCWAFASTE 112 Query: 419 AMTDRVCTYSNGTKHFHFSAEDLLSCCPIC 508 MTDR+C S G F FS E+LL+CC C Sbjct: 113 VMTDRLCISSKGKIKFVFSPENLLTCCKDC 142 Score = 52.4 bits (120), Expect = 1e-05 Identities = 19/34 (55%), Positives = 26/34 (76%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY 615 GC GG + AW+Y+ + G+ SGG YNSS+GC+PY Sbjct: 145 GCKGGYIKNAWDYYINEGIASGGDYNSSEGCQPY 178 >UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n=8; Strongylida|Rep: Cathepsin B-like cysteine protease 2 - Parelaphostrongylus tenuis Length = 344 Score = 97.5 bits (232), Expect = 4e-19 Identities = 37/66 (56%), Positives = 50/66 (75%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 +P++FD R +WP CP+++ +RDQ CGSCWAFG+ EAM+DRVC S+G K SA+D+L Sbjct: 94 IPDSFDARVQWPHCPSISYIRDQSQCGSCWAFGSAEAMSDRVCIASHGNKTVELSADDIL 153 Query: 491 SCCPIC 508 SCC C Sbjct: 154 SCCYDC 159 Score = 93.9 bits (223), Expect = 5e-18 Identities = 40/90 (44%), Positives = 51/90 (56%), Gaps = 1/90 (1%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRM-PCSGDTKTPKC 690 GC GG P AWEY+ G+V+GG Y + CRPYEIPPC HH C+ TP C Sbjct: 162 GCDGGYPISAWEYFVETGVVTGGLYGTKDSCRPYEIPPCGHHRNETFYGNCTQIADTPDC 221 Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCPETKT 780 C++GY ++Y DK +GK YT + T Sbjct: 222 VTTCQAGYPISYDDDKTFGKDSYTIESSVT 251 >UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 precursor; n=4; Caenorhabditis|Rep: Cathepsin B-like cysteine proteinase 3 precursor - Caenorhabditis elegans Length = 370 Score = 97.5 bits (232), Expect = 4e-19 Identities = 38/63 (60%), Positives = 48/63 (76%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 LP+ FD R+KWPDC T+ +R+Q +CGSCWAFGA E ++DRVC SNGT+ S ED+L Sbjct: 92 LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151 Query: 491 SCC 499 SCC Sbjct: 152 SCC 154 Score = 65.3 bits (152), Expect = 2e-09 Identities = 33/92 (35%), Positives = 42/92 (45%), Gaps = 1/92 (1%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693 GC GG A +W G V+GG Y GC PY PC + P ++ TP C Sbjct: 161 GCKGGYSIEALRFWASSGAVTGGDY-GGHGCMPYSFAPCTKNCP--------ESTTPSCK 211 Query: 694 KKCESGYDV-NYKQDKQYGKHVYTCPETKTTS 786 C+S Y YK+DK YG Y TK+ + Sbjct: 212 TTCQSSYKTEEYKKDKHYGASAYKVTTTKSVT 243 >UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase precursor; n=29; Schistosomatidae|Rep: Cathepsin B-like cysteine proteinase precursor - Schistosoma mansoni (Blood fluke) Length = 340 Score = 97.1 bits (231), Expect = 5e-19 Identities = 41/78 (52%), Positives = 50/78 (64%) Frame = +2 Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454 P HN D +P NFD R KWP C ++ +RDQ CGSCW+FGAVEAM+DR C S G Sbjct: 78 PTVDHN-DWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWSFGAVEAMSDRSCIQSGG 136 Query: 455 TKHFHFSAEDLLSCCPIC 508 ++ SA DLL+CC C Sbjct: 137 KQNVELSAVDLLTCCESC 154 Score = 86.2 bits (204), Expect = 9e-16 Identities = 36/84 (42%), Positives = 44/84 (52%), Gaps = 1/84 (1%) Frame = +1 Query: 511 LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPK 687 LGC GG+ AW+YW G+V+ S + GC PY P CEHH G PC TP+ Sbjct: 156 LGCEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPR 215 Query: 688 CTKKCESGYDVNYKQDKQYGKHVY 759 C + C+ Y Y QDK GK Y Sbjct: 216 CKQTCQRKYKTPYTQDKHRGKSSY 239 Score = 34.7 bits (76), Expect = 3.0 Identities = 15/32 (46%), Positives = 20/32 (62%) Frame = +3 Query: 747 KTCIYLSGDEDHIRAELFKNGPVEGAFTVYSD 842 K+ + DE I+ E+ K GPVE +FTVY D Sbjct: 236 KSSYNVKNDEKAIQKEIMKYGPVEASFTVYED 267 >UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=1; Nilaparvata lugens|Rep: Cathepsin B-like protease precursor - Nilaparvata lugens (Brown planthopper) Length = 347 Score = 96.3 bits (229), Expect = 9e-19 Identities = 41/91 (45%), Positives = 52/91 (57%), Gaps = 2/91 (2%) Frame = +1 Query: 502 YL*LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGD--T 675 Y GC GG P AW + K GLV+GG Y+S GC+PY I PCEHH+ G++ CS Sbjct: 156 YCGFGCEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPYPIAPCEHHMEGSKPNCSASPTE 215 Query: 676 KTPKCTKKCESGYDVNYKQDKQYGKHVYTCP 768 TP C C G + Y++D+Q GK Y P Sbjct: 216 PTPACETTCTHGSSLAYQKDRQKGKSAYLVP 246 Score = 83.4 bits (197), Expect = 6e-15 Identities = 32/66 (48%), Positives = 42/66 (63%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 +P+ FD R KW C +L E+RDQG+CGSCWA A DR+C SN + H S+ +L+ Sbjct: 92 VPKYFDARKKWKKCKSLREIRDQGNCGSCWAVSVAAAFADRLCIASNAKWNGHISSRELM 151 Query: 491 SCCPIC 508 SCC C Sbjct: 152 SCCSYC 157 Score = 38.7 bits (86), Expect = 0.18 Identities = 19/59 (32%), Positives = 35/59 (59%), Gaps = 1/59 (1%) Frame = +3 Query: 84 AYVTLVCVLAAAKDLPHPLSDEFINTINLKQNS-WKAGRNFPRDTSFAHLKKIMGVIEM 257 A V+ + L ++ +++++I+ IN S WKAG NF DT ++L+ ++GV E+ Sbjct: 10 AVVSAISALPDQENTVREIANKWIDAINNNPKSTWKAGHNFHPDTPMSYLQGLLGVSEL 68 >UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin B-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 331 Score = 96.3 bits (229), Expect = 9e-19 Identities = 38/79 (48%), Positives = 49/79 (62%), Gaps = 1/79 (1%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSG-DTKTPKC 690 GC GG P +AW YW G+ +GG Y S QGC+PY + PCEHH GN++ CS D TP C Sbjct: 148 GCEGGYPTMAWSYWIDTGITTGGLYGSKQGCQPYSLQPCEHHTEGNKVQCSTLDYDTPSC 207 Query: 691 TKKCESGYDVNYKQDKQYG 747 KC+ +NYK + +G Sbjct: 208 KHKCDDS-ALNYKSELTFG 225 Score = 76.2 bits (179), Expect = 1e-12 Identities = 35/78 (44%), Positives = 46/78 (58%), Gaps = 1/78 (1%) Frame = +2 Query: 284 THNFDLIASLPENFDPRDKWPDCP-TLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460 TH+ D+ +P +FD R+ W +C ++ V DQ CGSCWA A AM+DR C S G Sbjct: 72 THSEDI--QVPNSFDARENWKECSDVISTVVDQSDCGSCWAVAAASAMSDRRCIASQGKL 129 Query: 461 HFHFSAEDLLSCCPICDW 514 SAE+LLSCC C + Sbjct: 130 KVPVSAENLLSCCDSCGY 147 Score = 52.4 bits (120), Expect = 1e-05 Identities = 23/58 (39%), Positives = 40/58 (68%), Gaps = 2/58 (3%) Frame = +3 Query: 78 RAAYVT--LVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMG 245 +AA++ L+ ++ + K P+PLS++FIN IN KQ++W AG+NF + S +K ++G Sbjct: 2 KAAFIITLLLPIVLSYKGSPNPLSNDFINYINSKQSTWVAGKNFDENLSIQEIKNLLG 59 >UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Cathepsin B - Fasciola gigantica (Giant liver fluke) Length = 339 Score = 94.3 bits (224), Expect = 3e-18 Identities = 42/85 (49%), Positives = 52/85 (61%) Frame = +2 Query: 254 DEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 433 +E A P H+ LPE+FD R +WP C T++E+RDQ SCGSCWA A AM+DR Sbjct: 68 EERNALRPTIKHDISK-NDLPESFDARSQWPQCWTISEIRDQASCGSCWATAAASAMSDR 126 Query: 434 VCTYSNGTKHFHFSAEDLLSCCPIC 508 VC +SNG +A D LSCC C Sbjct: 127 VCIHSNGQMRPRLAAADPLSCCTYC 151 Score = 80.6 bits (190), Expect = 5e-14 Identities = 36/105 (34%), Positives = 56/105 (53%), Gaps = 2/105 (1%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSGDT-KTPK 687 GC GG P AW+YW G+V+GG++ + GC+P+ C+H + C T TP Sbjct: 154 GCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTPP 213 Query: 688 CTKKCESGYDVNYKQDKQYGKHVYTCPETKTTSARNCSRMVPSKV 822 C + C++GY+ Y+QDK YG Y E ++ + + P +V Sbjct: 214 CARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEV 258 >UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000012227 - Anopheles gambiae str. PEST Length = 218 Score = 92.7 bits (220), Expect = 1e-17 Identities = 35/66 (53%), Positives = 48/66 (72%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 +PE+FD R+ WP+C +L +R+QG+CGSCWA A M+DRVC +SNGT + +AEDL+ Sbjct: 1 IPESFDARNHWPNCESLRAIRNQGTCGSCWAVAAASVMSDRVCIHSNGTINVALAAEDLM 60 Query: 491 SCCPIC 508 CC C Sbjct: 61 GCCVDC 66 Score = 36.7 bits (81), Expect(2) = 0.010 Identities = 15/29 (51%), Positives = 22/29 (75%), Gaps = 1/29 (3%) Frame = +1 Query: 514 GCSGG-MPRLAWEYWKHFGLVSGGSYNSS 597 GC+GG + +++YW GLVSGG+YNS+ Sbjct: 69 GCNGGFLDGTSFQYWVDAGLVSGGAYNST 97 Score = 25.4 bits (53), Expect(2) = 0.010 Identities = 9/22 (40%), Positives = 14/22 (63%) Frame = +1 Query: 703 ESGYDVNYKQDKQYGKHVYTCP 768 + G D +Y +DK +GK Y+ P Sbjct: 98 DDGVDRHYSKDKLFGKVAYSVP 119 >UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|Rep: Cysteine proteinase 3 - Necator americanus (Human hookworm) Length = 360 Score = 92.7 bits (220), Expect = 1e-17 Identities = 36/77 (46%), Positives = 48/77 (62%) Frame = +2 Query: 278 IKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT 457 +K + D +P +FD RDKWP C ++ +RDQ CGSCWA + E M+DR+C SNGT Sbjct: 79 LKEEDMDFSEEIPVSFDARDKWPKCTSIGFIRDQSHCGSCWAVSSAETMSDRLCVQSNGT 138 Query: 458 KHFHFSAEDLLSCCPIC 508 S D+L+CCP C Sbjct: 139 IKVLLSDTDILACCPNC 155 Score = 77.0 bits (181), Expect = 6e-13 Identities = 35/90 (38%), Positives = 46/90 (51%), Gaps = 1/90 (1%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPKC 690 GC GG AWEY+K+ G+ +GG Y + C+PY PC+ G C D+ TPKC Sbjct: 158 GCGGGHTIRAWEYFKNTGVCTGGLYGTKDSCKPYAFYPCKDESYGK---CPKDSFPTPKC 214 Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCPETKT 780 K C+ Y Y DK Y Y P+ +T Sbjct: 215 RKICQYKYSKKYADDKYYANSAYRIPQNET 244 >UniRef50_Q23FP9 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 340 Score = 92.3 bits (219), Expect = 1e-17 Identities = 37/88 (42%), Positives = 48/88 (54%), Gaps = 1/88 (1%) Frame = +1 Query: 502 YL*LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKT 681 Y +GC GG P AW Y K G+ +GG Y C+PY PPC+HHV G PC T Sbjct: 153 YCGMGCKGGYPSAAWGYMKRQGVSTGGLYGDDTSCKPYIFPPCDHHVTGQYQPCGPIQPT 212 Query: 682 PKCTKKCESGYDVN-YKQDKQYGKHVYT 762 P+C K+C S Y N Y++D + Y+ Sbjct: 213 PQCVKECNSEYTQNTYEKDLHFASQTYS 240 Score = 89.8 bits (213), Expect = 7e-17 Identities = 39/87 (44%), Positives = 54/87 (62%), Gaps = 1/87 (1%) Frame = +2 Query: 242 GSYRDEHFATLPIKTHNFDLIAS-LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVE 418 GS + + LP K + + A +PE FD R++WP+C ++ +RDQ +CGSCWAF A E Sbjct: 64 GSLDEPDWVKLPTKEFDPNANADPIPEFFDAREQWPNCQSIKLIRDQSTCGSCWAFAATE 123 Query: 419 AMTDRVCTYSNGTKHFHFSAEDLLSCC 499 +DR+C SN T S+EDLL CC Sbjct: 124 TFSDRICIASNQTLQTSISSEDLLECC 150 >UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 precursor; n=5; Caenorhabditis|Rep: Cathepsin B-like cysteine proteinase 4 precursor - Caenorhabditis elegans Length = 335 Score = 92.3 bits (219), Expect = 1e-17 Identities = 36/69 (52%), Positives = 47/69 (68%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 ++P FD R +WP+C ++N +RDQ CGSCWAF A EA +DR C SNG + SAED+ Sbjct: 80 TIPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDV 139 Query: 488 LSCCPICDW 514 LSCC C + Sbjct: 140 LSCCSNCGY 148 Score = 72.1 bits (169), Expect = 2e-11 Identities = 35/85 (41%), Positives = 42/85 (49%), Gaps = 3/85 (3%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSGD-TKTPK 687 GC GG P AW+Y G +GGSY + GC+PY + PC V P C D TP Sbjct: 149 GCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPA 208 Query: 688 CTKKC-ESGYDVNYKQDKQYGKHVY 759 C KC Y+V Y DK +G Y Sbjct: 209 CVNKCTNKNYNVAYTADKHFGSTAY 233 >UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep: Cathepsin B - Pandalus borealis (Northern red shrimp) Length = 328 Score = 91.9 bits (218), Expect = 2e-17 Identities = 37/79 (46%), Positives = 50/79 (63%) Frame = +2 Query: 272 LPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 451 LP+K N +P FD R++WP CP ++E+RDQG+CGSCWA A MTDR C + Sbjct: 65 LPLK--NVTPTKEIPVEFDAREQWPHCPCIDEIRDQGNCGSCWAVSAASVMTDRTCIDTE 122 Query: 452 GTKHFHFSAEDLLSCCPIC 508 G F FS+E++ +CC C Sbjct: 123 GLVDFRFSSENVAACCTEC 141 Score = 91.5 bits (217), Expect = 2e-17 Identities = 36/88 (40%), Positives = 50/88 (56%) Frame = +1 Query: 517 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCTK 696 C GG A+ +W G VSGG +NS++GC+PY + CEHH+ G R PC GD C++ Sbjct: 145 CYGGDEDTAFTHWVTKGFVSGGRHNSNEGCQPYSVEECEHHIEGPRPPCEGDMPELVCSE 204 Query: 697 KCESGYDVNYKQDKQYGKHVYTCPETKT 780 C Y Y++D +YG Y P+ T Sbjct: 205 TCHEEYGKTYEEDLEYGLEAYVLPQDVT 232 Score = 48.4 bits (110), Expect = 2e-04 Identities = 23/48 (47%), Positives = 31/48 (64%) Frame = +3 Query: 96 LVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKI 239 L+ ++AAA PLSDEF+ + KQ +WKAGRNF +D S LK + Sbjct: 6 LLALVAAASAELDPLSDEFLELLQSKQMTWKAGRNFAKDISKDFLKSL 53 >UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 1 - Rhipicephalus appendiculatus (Brown ear tick) Length = 332 Score = 91.1 bits (216), Expect = 3e-17 Identities = 34/73 (46%), Positives = 48/73 (65%) Frame = +2 Query: 314 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493 PE+F PR+ W C ++ +RDQ +CGSCWAF A E+++DR+C ++NG + SAEDLL+ Sbjct: 88 PESFTPREYWSHCSSIRVIRDQSACGSCWAFAAAESISDRICIHTNGKVQVNISAEDLLA 147 Query: 494 CCPICDWDAAEEC 532 CC C C Sbjct: 148 CCHTCGHGCDGRC 160 Score = 55.2 bits (127), Expect = 2e-06 Identities = 27/72 (37%), Positives = 38/72 (52%), Gaps = 4/72 (5%) Frame = +1 Query: 592 SSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCTKKCESGYDVNYKQDKQYGKHVY---- 759 + GC+PY +PPC VP C+ TPKC C GY+ +Y++DK + K+VY Sbjct: 180 TEDGCQPYSLPPC---VPN----CTHPEPTPKCQHVCRKGYEKSYEEDKHFAKNVYRLLK 232 Query: 760 TCPETKTTSARN 795 C KT +N Sbjct: 233 KCDAIKTDIYKN 244 Score = 36.3 bits (80), Expect = 0.97 Identities = 16/28 (57%), Positives = 18/28 (64%) Frame = +3 Query: 135 PLSDEFINTINLKQNSWKAGRNFPRDTS 218 PLS+E IN IN +WKAGRNF S Sbjct: 26 PLSEEMINFINSINTTWKAGRNFDEKRS 53 Score = 34.7 bits (76), Expect = 3.0 Identities = 13/22 (59%), Positives = 18/22 (81%) Frame = +3 Query: 777 DHIRAELFKNGPVEGAFTVYSD 842 D I+ +++KNGPVE AF VY+D Sbjct: 235 DAIKTDIYKNGPVESAFFVYAD 256 >UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Cathepsin b - Aedes aegypti (Yellowfever mosquito) Length = 332 Score = 90.2 bits (214), Expect = 6e-17 Identities = 35/79 (44%), Positives = 50/79 (63%) Frame = +2 Query: 272 LPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 451 LP K H+ +PE FD R+KWP C +++ +++QG CG+CWA AV M+DR+C +S Sbjct: 72 LPTKRHDVAYNMDIPEFFDAREKWPYCKSISTIKNQGLCGACWAVAAVSVMSDRLCIHSE 131 Query: 452 GTKHFHFSAEDLLSCCPIC 508 G +AEDL+ CC C Sbjct: 132 GKFDVELAAEDLMGCCKDC 150 Score = 83.4 bits (197), Expect = 6e-15 Identities = 38/86 (44%), Positives = 50/86 (58%), Gaps = 1/86 (1%) Frame = +1 Query: 514 GCSGG-MPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKC 690 GC+GG + +++YW GLVSG +YNS+ GC+PY PC + G C + KTP C Sbjct: 153 GCNGGFLDGTSFQYWVDVGLVSGAAYNSTDGCKPYPFKPCLYPFVG----CHPE-KTPSC 207 Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCP 768 T C GYD Y++DK YG Y P Sbjct: 208 THHCTEGYDGTYRRDKYYGSAAYKLP 233 Score = 34.7 bits (76), Expect = 3.0 Identities = 15/28 (53%), Positives = 18/28 (64%) Frame = +3 Query: 762 LSGDEDHIRAELFKNGPVEGAFTVYSDL 845 L DE I+ E+ NGPVE F+VY DL Sbjct: 232 LPNDERMIQLEIMTNGPVESGFSVYQDL 259 >UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7; n=2; Haemonchidae|Rep: Cathepsin B-like cysteine protease GCP7 - Haemonchus contortus (Barber pole worm) Length = 348 Score = 90.2 bits (214), Expect = 6e-17 Identities = 39/85 (45%), Positives = 53/85 (62%) Frame = +2 Query: 245 SYRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424 SY E+ + T N D+ PE+FD R+KW DCP+L + DQ +CGSCWA A + M Sbjct: 78 SYNQENVLPIANITSNDDI----PESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCM 133 Query: 425 TDRVCTYSNGTKHFHFSAEDLLSCC 499 +DR+C +S G K SA D+L+CC Sbjct: 134 SDRLCIHSQGRKKVLLSATDILACC 158 Score = 62.1 bits (144), Expect = 2e-08 Identities = 31/91 (34%), Positives = 41/91 (45%), Gaps = 1/91 (1%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPC-SGDTKTPKC 690 GC GG AW++ G+V+GG+Y C+PY P C H C S TP C Sbjct: 165 GCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAHKGKAFNNCPSHPYATPAC 224 Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCPETKTT 783 C+ GY Y+ DK + Y P + T Sbjct: 225 KPYCQYGYGKRYENDKIKARTWYWLPNDERT 255 >UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_115, whole genome shotgun sequence - Paramecium tetraurelia Length = 332 Score = 89.8 bits (213), Expect = 7e-17 Identities = 38/87 (43%), Positives = 53/87 (60%), Gaps = 1/87 (1%) Frame = +2 Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454 P++ + + +LP +F ++KWP CP++ + DQG+CGSCWA A M+DR+C S Sbjct: 59 PVEYKYHEKLENLPPSFSAQEKWPGCPSIELIPDQGNCGSCWAVSAASTMSDRLCIASGQ 118 Query: 455 TKHFHFSAEDLLSCCPI-CDWDAAEEC 532 T SAEDLLSCC I C+ D C Sbjct: 119 TDKRQISAEDLLSCCGINCELDGNGGC 145 Score = 72.9 bits (171), Expect = 9e-12 Identities = 34/81 (41%), Positives = 42/81 (51%), Gaps = 6/81 (7%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEH-HVPGNRMPCSGD-----T 675 GC GG P AW+Y + G+V+GG+YN C+PY PPC H + G C D Sbjct: 144 GCDGGYPYGAWKYLRVDGIVTGGTYNDFSLCKPYSFPPCSHGNDSGKYSKCENDFFMLTE 203 Query: 676 KTPKCTKKCESGYDVNYKQDK 738 TP CTKKC + Y DK Sbjct: 204 VTPSCTKKCHPQFSRTYDVDK 224 >UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.4; n=1; Caenorhabditis elegans|Rep: Putative uncharacterized protein W07B8.4 - Caenorhabditis elegans Length = 335 Score = 88.6 bits (210), Expect = 2e-16 Identities = 34/64 (53%), Positives = 46/64 (71%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 S+P+++D RD WP C ++N +RDQ CGSCWA A EA++DR C SNG + SAED+ Sbjct: 72 SIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLSAEDI 131 Query: 488 LSCC 499 L+CC Sbjct: 132 LTCC 135 Score = 81.8 bits (193), Expect = 2e-14 Identities = 38/86 (44%), Positives = 46/86 (53%), Gaps = 4/86 (4%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSGD-TKTPK 687 GC GG P AW YW GLV+GGS+ S GC+PY I PC + G P C + TPK Sbjct: 144 GCEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPYSIAPCGETIDGVTWPECPMKISDTPK 203 Query: 688 CTKKC--ESGYDVNYKQDKQYGKHVY 759 C C + Y + Y QDK +G Y Sbjct: 204 CEHHCTGNNSYPIPYDQDKHFGASAY 229 >UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep: Cathepsin B - Uronema marinum Length = 350 Score = 88.2 bits (209), Expect = 2e-16 Identities = 36/64 (56%), Positives = 47/64 (73%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 SLPE+FD R+ +P C +L +VRDQ +CGSCWAFG VEA++DR+C S S+E+L Sbjct: 85 SLPESFDLREAYPKCESLQQVRDQSNCGSCWAFGTVEAISDRICIASGQKDQTRISSENL 144 Query: 488 LSCC 499 LSCC Sbjct: 145 LSCC 148 Score = 80.2 bits (189), Expect = 6e-14 Identities = 40/97 (41%), Positives = 51/97 (52%), Gaps = 8/97 (8%) Frame = +1 Query: 511 LGCSGGMPRLAWEYWKHFGLVSGGSY-----NSSQGCRPYEIPPCEHHVPGNRMPCSG-- 669 +GC+GG AW Y+ GLVSG Y NS C+PY PPC HHV G C+ Sbjct: 156 MGCNGGYTAGAWNYYVKTGLVSGNLYTDDNQNSKTECQPYSFPPCSHHVQGEYQACTDLP 215 Query: 670 DTKTPKCTKKCESGYDVN-YKQDKQYGKHVYTCPETK 777 TPKC +C S Y N Y+QD G Y+ P+++ Sbjct: 216 QFNTPKCYTECNSQYTQNSYEQDLHKGVSSYSVPKSE 252 >UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomatidae|Rep: Cysteine proteinase - Ancylostoma ceylanicum Length = 348 Score = 87.8 bits (208), Expect = 3e-16 Identities = 39/88 (44%), Positives = 53/88 (60%), Gaps = 6/88 (6%) Frame = +2 Query: 254 DEHFATLPIKTH------NFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 415 D FA P KT N ++ +P+ FD RD+WP+C ++ +RDQ SCGSCWA A Sbjct: 69 DVKFAVDPEKTEPNYVLANTEMKVDIPDTFDARDRWPNCTSMKHIRDQSSCGSCWAVAAA 128 Query: 416 EAMTDRVCTYSNGTKHFHFSAEDLLSCC 499 AM+DRVC +NG + S ++LSCC Sbjct: 129 SAMSDRVCALTNGRINRILSDTEVLSCC 156 Score = 66.1 bits (154), Expect = 1e-09 Identities = 29/84 (34%), Positives = 42/84 (50%), Gaps = 2/84 (2%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRM-PCSGDT-KTPK 687 GC GG P A+ Y +GL +GG Y C+PY PC +H PC + TP Sbjct: 163 GCKGGYPARAFGYAWRYGLSTGGPYGEKDACQPYAFYPCGNHAHEPYYGPCPDELWPTPT 222 Query: 688 CTKKCESGYDVNYKQDKQYGKHVY 759 C + C+ GY + +++DK + Y Sbjct: 223 CRRTCQLGYPIPFEKDKIFNDQTY 246 >UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8; Trypanosoma|Rep: Cathepsin B-like cysteine protease - Trypanosoma brucei Length = 340 Score = 87.4 bits (207), Expect = 4e-16 Identities = 35/68 (51%), Positives = 45/68 (66%) Frame = +2 Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484 A LP +FD + WP+CPT+ ++ DQ +CGSCWA A AM+DR CT G + H SA D Sbjct: 92 APLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCT-MGGVQDVHISAGD 150 Query: 485 LLSCCPIC 508 LL+CC C Sbjct: 151 LLACCSDC 158 Score = 50.4 bits (115), Expect = 6e-05 Identities = 32/82 (39%), Positives = 38/82 (46%), Gaps = 5/82 (6%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNR--MPCSG-DTKTP 684 GC+GG P AW Y+ GLVS Y C+PY P C HH PCS + TP Sbjct: 161 GCNGGDPDRAWAYFSSTGLVS--DY-----CQPYPFPHCSHHSKSKNGYPPCSQFNFDTP 213 Query: 685 KCTKKCESGY--DVNYKQDKQY 744 KC C+ VNY+ Y Sbjct: 214 KCNYTCDDPTIPVVNYRSWTSY 235 >UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG01102; n=1; Caenorhabditis briggsae|Rep: Putative uncharacterized protein CBG01102 - Caenorhabditis briggsae Length = 374 Score = 86.6 bits (205), Expect = 7e-16 Identities = 39/86 (45%), Positives = 50/86 (58%), Gaps = 2/86 (2%) Frame = +1 Query: 517 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSGDT-KTPKC 690 C+GG AW+YW+ GL +GGSY S GC+PY I PC+ + P C T +TP C Sbjct: 189 CAGGNVFKAWQYWQKHGLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSC 248 Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCP 768 KKC+SGY V +D+ YG V P Sbjct: 249 EKKCKSGYPVELDKDRHYGVSVDQLP 274 Score = 68.9 bits (161), Expect = 1e-10 Identities = 27/59 (45%), Positives = 39/59 (66%) Frame = +2 Query: 323 FDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC 499 FD R++WP+C ++ + D C S WAF A E+M+DR+C S G + SA++LLSCC Sbjct: 85 FDARERWPECSSIPIINDISDCKSSWAFSAAESMSDRLCINSGGMINTVLSAQELLSCC 143 >UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 421 Score = 86.2 bits (204), Expect = 9e-16 Identities = 32/68 (47%), Positives = 46/68 (67%) Frame = +2 Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484 + +P+NFD R KWP+CP+++ V +QG CGSC+A A +DR C +SNGT S ED Sbjct: 136 SDVPKNFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKSLLSEED 195 Query: 485 LLSCCPIC 508 ++ CC +C Sbjct: 196 IIGCCSVC 203 Score = 46.8 bits (106), Expect = 7e-04 Identities = 31/108 (28%), Positives = 47/108 (43%), Gaps = 2/108 (1%) Frame = +1 Query: 517 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCTK 696 C GG P A YW + GLV+GG GCRPY VP + + C K Sbjct: 206 CYGGDPLKALTYWVNQGLVTGG----RDGCRPYSF-DLSCGVPCSPATFFEAEEKRTCMK 260 Query: 697 KCES-GYDVNYKQDKQYGKHVYTC-PETKTTSARNCSRMVPSKVLSQY 834 +C++ Y Y++DK + Y+ P + T S R+ ++ + Sbjct: 261 RCQNIYYQQKYEEDKHFATFAYSMYPRSMTVSPDGKERVKVPTIIGHF 308 >UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishmania|Rep: Cathepsin B-like protease - Leishmania major Length = 340 Score = 85.8 bits (203), Expect = 1e-15 Identities = 37/71 (52%), Positives = 47/71 (66%) Frame = +2 Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475 +L LPE FD + WP C T++E+RDQ +CGSCWA AVEA++DR CT+ G S Sbjct: 93 ELQQDLPEFFDAAEHWPMCLTISEIRDQSNCGSCWAIAAVEAISDRYCTF-GGVPDRRMS 151 Query: 476 AEDLLSCCPIC 508 +LLSCC IC Sbjct: 152 TSNLLSCCFIC 162 Score = 55.6 bits (128), Expect = 1e-06 Identities = 30/82 (36%), Positives = 40/82 (48%), Gaps = 4/82 (4%) Frame = +1 Query: 511 LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT--KTP 684 LGC GG+P +AW +W G+ +++ C+PY PC HH + P T TP Sbjct: 164 LGCHGGIPTVAWLWWVWVGI-------ATEDCQPYPFDPCSHHGNSEKYPPCPSTIYDTP 216 Query: 685 KCTKKCE-SGYD-VNYKQDKQY 744 KC CE + D V YK Y Sbjct: 217 KCNTTCERNEMDLVKYKGSTSY 238 >UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 precursor; n=8; Haemonchus contortus|Rep: Cathepsin B-like cysteine proteinase 2 precursor - Haemonchus contortus (Barber pole worm) Length = 342 Score = 85.4 bits (202), Expect = 2e-15 Identities = 40/85 (47%), Positives = 48/85 (56%), Gaps = 3/85 (3%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRM---PCSGDTKTP 684 GC GG P AW+Y+ + G+VSGG Y + CRPY I PC HH GN C G TP Sbjct: 155 GCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHH--GNDTYYGECRGTAPTP 212 Query: 685 KCTKKCESGYDVNYKQDKQYGKHVY 759 C +KC G Y+ DK+YGK Y Sbjct: 213 PCKRKCRPGVRKMYRIDKRYGKDAY 237 Score = 75.8 bits (178), Expect = 1e-12 Identities = 30/67 (44%), Positives = 43/67 (64%), Gaps = 1/67 (1%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 +P ++DPRD W +C T +RDQ +CGSCWA A++DR+C S K + SA D++ Sbjct: 87 IPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIM 145 Query: 491 SCC-PIC 508 +CC P C Sbjct: 146 TCCRPQC 152 >UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator americanus|Rep: Cysteine proteinase 4 - Necator americanus (Human hookworm) Length = 339 Score = 84.2 bits (199), Expect = 4e-15 Identities = 32/68 (47%), Positives = 44/68 (64%) Frame = +2 Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475 +L LPE FD R+KWP C ++ +RD +CGSCWA A M+DR+C +NGT S Sbjct: 83 NLNVELPERFDAREKWPHCASIGLIRDHSACGSCWAVSAASVMSDRLCIQTNGTNQKILS 142 Query: 476 AEDLLSCC 499 + D+L+CC Sbjct: 143 SADILACC 150 Score = 72.9 bits (171), Expect = 9e-12 Identities = 35/82 (42%), Positives = 46/82 (56%), Gaps = 2/82 (2%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPC--SGDTKTPK 687 GC GG P A+ Y ++ G+ SGG Y C+PY PC+ GN PC G TPK Sbjct: 157 GCEGGYPIQAYFYLENTGVCSGGEYREKNVCKPYPFYPCD----GNYGPCPKEGAFDTPK 212 Query: 688 CTKKCESGYDVNYKQDKQYGKH 753 C K C+ Y V Y++DK +GK+ Sbjct: 213 CRKICQFRYPVPYEEDKVFGKN 234 >UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8.1; n=1; Caenorhabditis elegans|Rep: Putative uncharacterized protein W07B8.1 - Caenorhabditis elegans Length = 335 Score = 83.4 bits (197), Expect = 6e-15 Identities = 40/92 (43%), Positives = 52/92 (56%), Gaps = 4/92 (4%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSGDTK-TPK 687 GC GG P AW+Y + G+ +GGSY S GC+PY IPPC V P C+ T TP Sbjct: 147 GCEGGNPFKAWQYIQKHGIPTGGSYESQFGCKPYSIPPCGKTVGNVTYPACTNTTSPTPS 206 Query: 688 CTKKCES--GYDVNYKQDKQYGKHVYTCPETK 777 C KKC S GY ++ +D+ YG V P ++ Sbjct: 207 CEKKCTSRIGYPIDIDKDRHYGVSVDQLPNSQ 238 Score = 78.2 bits (184), Expect = 2e-13 Identities = 34/81 (41%), Positives = 51/81 (62%), Gaps = 3/81 (3%) Frame = +2 Query: 266 ATLPIKTHNFDLI---ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 436 AT+ K NF + + L +FD R++WP+C ++ ++ D C + WAF A E+M+DR+ Sbjct: 58 ATIGFKIQNFGVSQANSDLSPSFDARERWPECMSIPQINDISECKTSWAFAAAESMSDRL 117 Query: 437 CTYSNGTKHFHFSAEDLLSCC 499 C S G K+ SAE+LLSCC Sbjct: 118 CINSGGFKNTILSAEELLSCC 138 >UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06356 protein - Schistosoma japonicum (Blood fluke) Length = 279 Score = 83.0 bits (196), Expect = 9e-15 Identities = 32/83 (38%), Positives = 48/83 (57%), Gaps = 1/83 (1%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPKC 690 GC G YW +G+V+GGSY GC+PY +P C +H + C+ +T + P+C Sbjct: 94 GCFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSYHPESRFLDCNNNTFEFPQC 153 Query: 691 TKKCESGYDVNYKQDKQYGKHVY 759 T +C+ GY+ Y DK YG+ +Y Sbjct: 154 TNECQDGYNKTYDDDKFYGERIY 176 Score = 65.3 bits (152), Expect = 2e-09 Identities = 29/81 (35%), Positives = 46/81 (56%), Gaps = 1/81 (1%) Frame = +2 Query: 257 EHFATLPIKTHNFDLI-ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 433 E+ T IKT + + I +P +FD R W +C T+ ++ D+ C + WA V++++DR Sbjct: 9 ENIQTKHIKTISHNSINMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDR 68 Query: 434 VCTYSNGTKHFHFSAEDLLSC 496 +C SNG SA D +SC Sbjct: 69 ICIRSNGRISVQLSARDAISC 89 >UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 precursor; n=3; Haemonchidae|Rep: Cathepsin B-like cysteine proteinase 1 precursor - Ostertagia ostertagi Length = 341 Score = 83.0 bits (196), Expect = 9e-15 Identities = 31/66 (46%), Positives = 45/66 (68%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 +PE++DPR +W +C +L + DQ +CGSCWA + AM+DR+C S G K SA+D++ Sbjct: 91 IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150 Query: 491 SCCPIC 508 SCC C Sbjct: 151 SCCTWC 156 Score = 78.6 bits (185), Expect = 2e-13 Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 3/82 (3%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRM---PCSGDTKTP 684 GC GG P A+ + G+V+GG YN+ CRPYEI PC HH GN C G TP Sbjct: 159 GCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPCGHH--GNETYYGECVGMADTP 216 Query: 685 KCTKKCESGYDVNYKQDKQYGK 750 +C ++C GY +Y D+ Y K Sbjct: 217 RCKRRCLLGYPKSYPSDRYYKK 238 >UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyledons|Rep: At1g02300/T6A9_10 - Arabidopsis thaliana (Mouse-ear cress) Length = 362 Score = 81.4 bits (192), Expect = 3e-14 Identities = 36/79 (45%), Positives = 48/79 (60%) Frame = +2 Query: 263 FATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCT 442 F +PI +H+ L LP+ FD R W C ++ + DQG CGSCWAFGAVE+++DR C Sbjct: 92 FLGVPIVSHDISL--KLPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCI 149 Query: 443 YSNGTKHFHFSAEDLLSCC 499 N + S DLL+CC Sbjct: 150 KYN--MNVSLSVNDLLACC 166 Score = 55.6 bits (128), Expect = 1e-06 Identities = 32/83 (38%), Positives = 43/83 (51%), Gaps = 1/83 (1%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRMPCSGDTKTPKC 690 GC+GG P AW Y+KH G+V ++ C PY + C H PG C TPKC Sbjct: 173 GCNGGYPIAAWRYFKHHGVV-------TEECDPYFDNTGCSH--PG----CEPAYPTPKC 219 Query: 691 TKKCESGYDVNYKQDKQYGKHVY 759 +KC SG + +++ K YG Y Sbjct: 220 ARKCVSGNQL-WRESKHYGVSAY 241 Score = 37.5 bits (83), Expect = 0.42 Identities = 16/22 (72%), Positives = 18/22 (81%) Frame = +3 Query: 777 DHIRAELFKNGPVEGAFTVYSD 842 D I AE++KNGPVE AFTVY D Sbjct: 248 DDIMAEVYKNGPVEVAFTVYED 269 >UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2; Ostreococcus|Rep: Cysteine proteinase - Ostreococcus tauri Length = 362 Score = 81.0 bits (191), Expect = 3e-14 Identities = 35/63 (55%), Positives = 43/63 (68%), Gaps = 1/63 (1%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 LP+ FD R+KWP C L +E DQG+CGSCWA +AMTDR+C +NG + H SA L Sbjct: 88 LPDTFDVREKWPKCAALVSEAVDQGACGSCWAVAPAKAMTDRLCIATNGAVNTHVSAIQL 147 Query: 488 LSC 496 LSC Sbjct: 148 LSC 150 Score = 43.2 bits (97), Expect = 0.008 Identities = 18/41 (43%), Positives = 20/41 (48%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEH 636 GC GG P A+E G+VSGG C PY PC H Sbjct: 170 GCMGGYPTEAYETAHRVGVVSGGLNGDQDTCMPYPFAPCHH 210 >UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma japonicum|Rep: SJCHGC02853 protein - Schistosoma japonicum (Blood fluke) Length = 181 Score = 80.6 bits (190), Expect = 5e-14 Identities = 34/65 (52%), Positives = 45/65 (69%) Frame = +2 Query: 254 DEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 433 D+H PI HN D+ LP+ FD R W +C ++ +RDQ SCGSCWAFGAVE+M+DR Sbjct: 64 DQHKLHHPIIHHN-DINIKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDR 122 Query: 434 VCTYS 448 +C +S Sbjct: 123 ICIHS 127 Score = 37.1 bits (82), Expect = 0.55 Identities = 21/40 (52%), Positives = 24/40 (60%), Gaps = 1/40 (2%) Frame = +3 Query: 135 PLSDEFINTINLKQN-SWKAGRNFPRDTSFAHLKKIMGVI 251 PLSDE I IN + N WKA R R TS H K +MGV+ Sbjct: 21 PLSDELITFINKQPNIEWKADRT-KRFTSIHHAKSMMGVL 59 >UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2; Arthropoda|Rep: Cathepsin B-like cysteine protease - Callosobruchus maculatus (Southern cowpea weevil) (Pulse bruchid) Length = 330 Score = 80.2 bits (189), Expect = 6e-14 Identities = 29/66 (43%), Positives = 39/66 (59%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 LPE FD R +W C ++ E+RDQ CGSCWA + M+DR+C S+ SA D++ Sbjct: 81 LPEEFDARKQWSKCESIKEIRDQSGCGSCWAVSSASVMSDRICIQSDQKNQLRISAADMI 140 Query: 491 SCCPIC 508 CC C Sbjct: 141 ECCESC 146 Score = 73.7 bits (173), Expect = 5e-12 Identities = 33/82 (40%), Positives = 42/82 (51%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693 GC GG+P + WK G VSGG YNS+ GC Y +P C P C P C Sbjct: 152 GCHGGIPSFTFTEWKDSGFVSGGEYNSTNGCMSYPLPRCN---PS----CKTLYDAPTCK 204 Query: 694 KKCESGYDVNYKQDKQYGKHVY 759 K+C+ G + Y++DK Y K Y Sbjct: 205 KECDKGSPLKYEEDKHYAKQAY 226 Score = 49.6 bits (113), Expect = 1e-04 Identities = 24/63 (38%), Positives = 38/63 (60%), Gaps = 2/63 (3%) Frame = +3 Query: 78 RAAYVTLVCVLAAAKDLPHP--LSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVI 251 + A++ L V++ P LSDE+I +N K WKAGRNF RDTS ++++++ V Sbjct: 2 KLAFIALAAVVSCTFAQPELDFLSDEYIEQLNSKNLPWKAGRNFERDTSLYNIQRLLSVG 61 Query: 252 EMN 260 +N Sbjct: 62 TIN 64 >UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 356 Score = 80.2 bits (189), Expect = 6e-14 Identities = 35/73 (47%), Positives = 45/73 (61%) Frame = +2 Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460 KT N +++ +P +FD R KWP C + VRDQ CGS AVE +DR C SNGT Sbjct: 82 KTGNDNVLVDIPSSFDSRQKWPSCSQIGAVRDQSDCGSAAHLVAVEIASDRTCIASNGTF 141 Query: 461 HFHFSAEDLLSCC 499 ++ SA+D LSCC Sbjct: 142 NWPLSAQDPLSCC 154 Score = 73.3 bits (172), Expect = 7e-12 Identities = 35/93 (37%), Positives = 49/93 (52%), Gaps = 4/93 (4%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPG--NRMPCSGDTKTPK 687 GC G P+ ++W+ GL +GG+YN GC+PY I PC+ +PC G TP Sbjct: 166 GCDGSWPKDILKWWQTHGLCTGGNYNDQFGCKPYSIYPCDKKYANGTTSVPCPG-YHTPT 224 Query: 688 CTKKCESG--YDVNYKQDKQYGKHVYTCPETKT 780 C + C S + + YKQDK +GK Y + T Sbjct: 225 CEEHCTSNITWPIAYKQDKHFGKAHYNVGKKMT 257 >UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin B-like cysteine proteinase 4 precursor (Cysteine protease-related 4); n=2; Tribolium castaneum|Rep: PREDICTED: similar to Cathepsin B-like cysteine proteinase 4 precursor (Cysteine protease-related 4) - Tribolium castaneum Length = 360 Score = 79.4 bits (187), Expect = 1e-13 Identities = 31/67 (46%), Positives = 41/67 (61%), Gaps = 1/67 (1%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 +PE FD R+ WP+C + +R+QG C S WAF A E M+DR+C +NG S EDL Sbjct: 72 IPETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDL 131 Query: 488 LSCCPIC 508 + CC C Sbjct: 132 IDCCHYC 138 Score = 58.8 bits (136), Expect = 2e-07 Identities = 32/89 (35%), Positives = 44/89 (49%), Gaps = 1/89 (1%) Frame = +1 Query: 517 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCTK 696 C GG AW Y+ GLVSGG YN+S GC+PY + R+ TP C Sbjct: 142 CKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYS------ELNYYRI-------TPPCNT 188 Query: 697 KCESG-YDVNYKQDKQYGKHVYTCPETKT 780 C++ Y + Y DK +G +Y P+ +T Sbjct: 189 TCQNDKYPIPYVSDKHFGDSIYYIPQNET 217 >UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: Cathepsin B - Triticum aestivum (Wheat) Length = 353 Score = 77.8 bits (183), Expect = 3e-13 Identities = 36/78 (46%), Positives = 45/78 (57%) Frame = +2 Query: 266 ATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTY 445 A +PIK H LP+ FD R +W C T+ + DQG CG+CWAF AVEA+ DR C + Sbjct: 85 AGVPIKIHPE---MDLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRFCIH 141 Query: 446 SNGTKHFHFSAEDLLSCC 499 N S DLL+CC Sbjct: 142 LN--MSVSLSVNDLLACC 157 Score = 45.6 bits (103), Expect = 0.002 Identities = 32/111 (28%), Positives = 49/111 (44%), Gaps = 1/111 (0%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRMPCSGDTKTPKC 690 GC+GG P AW Y++ G+V ++ C PY + C+H PG C TPKC Sbjct: 164 GCNGGYPISAWRYFRRSGVV-------TEECDPYFDQTGCQH--PG----CEPAYPTPKC 210 Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCPETKTTSARNCSRMVPSKVLSQYIQI 843 +KC+ +K++K + + Y + P +V Y QI Sbjct: 211 QRKCKVENQA-WKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTYCQI 260 >UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep: Cathepsin B - Streblomastix strix Length = 312 Score = 77.4 bits (182), Expect = 4e-13 Identities = 30/69 (43%), Positives = 41/69 (59%) Frame = +2 Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481 +A+LP+ FD R WP+C + ++ DQG CGSCWA + E + DR C S G + S + Sbjct: 73 VANLPDEFDSRTNWPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEGKQTPELSPQ 132 Query: 482 DLLSCCPIC 508 L SC P C Sbjct: 133 HLTSCTPGC 141 >UniRef50_Q237A1 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 346 Score = 77.0 bits (181), Expect = 6e-13 Identities = 34/67 (50%), Positives = 45/67 (67%), Gaps = 1/67 (1%) Frame = +2 Query: 311 LPENFDPRDKWPD-CPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 LPE FD R +W D C +L EVRDQ +CGSCWAFGA E+++DR C + + S ++L Sbjct: 93 LPEEFDARVQWGDKCSSLWEVRDQSTCGSCWAFGAAESLSDRHCIHLG--QDIRLSTQNL 150 Query: 488 LSCCPIC 508 L+CC C Sbjct: 151 LTCCAAC 157 Score = 72.1 bits (169), Expect = 2e-11 Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 3/85 (3%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGN-RMPCSGDTKTPKC 690 GC GG P A +Y+ + GLV+G Y ++ C+ Y PC HHV + PC+G+ TP C Sbjct: 160 GCDGGWPEAAMDYYVNTGLVTGDLYGNNSWCQAYTFAPCAHHVTSDIYPPCTGELPTPPC 219 Query: 691 TKKCESG--YDVNYKQDKQYGKHVY 759 C+S + + Y +D G Y Sbjct: 220 INSCDSNSTHTIPYSKDIHRGSKAY 244 Score = 36.7 bits (81), Expect = 0.73 Identities = 15/27 (55%), Positives = 20/27 (74%) Frame = +3 Query: 762 LSGDEDHIRAELFKNGPVEGAFTVYSD 842 ++ DE I AE++KNGP+E A TVY D Sbjct: 246 IAKDEKAIMAEIYKNGPIEVALTVYED 272 >UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus contortus|Rep: Cysteine proteinase - Haemonchus contortus (Barber pole worm) Length = 350 Score = 76.6 bits (180), Expect = 7e-13 Identities = 36/84 (42%), Positives = 44/84 (52%), Gaps = 2/84 (2%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGD--TKTPK 687 GC GG LAWE+ + FG+V+GG Y CRPY PC H G R C D TP Sbjct: 163 GCEGGYDHLAWEWVQRFGVVTGGPYQQKGVCRPYAFHPCGLH-HGRRYDCPWDHSFSTPA 221 Query: 688 CTKKCESGYDVNYKQDKQYGKHVY 759 C C+ GY Y++DK + K Y Sbjct: 222 CKPYCQFGYGKRYEKDKFFVKSTY 245 Score = 73.7 bits (173), Expect = 5e-12 Identities = 29/63 (46%), Positives = 38/63 (60%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 +PE+FD R W +C ++ VRDQ CGSCWA A M+DR+C + G S D+L Sbjct: 94 IPESFDSRIVWKNCSSITYVRDQSRCGSCWAVSAASTMSDRICVQTKGKLQTILSDTDIL 153 Query: 491 SCC 499 SCC Sbjct: 154 SCC 156 Score = 35.1 bits (77), Expect = 2.2 Identities = 15/32 (46%), Positives = 19/32 (59%) Frame = +3 Query: 747 KTCIYLSGDEDHIRAELFKNGPVEGAFTVYSD 842 K+ L DE I+ E+ KNGPV+ AF Y D Sbjct: 242 KSTYILDNDEKVIQREMMKNGPVQAAFITYED 273 >UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep: Thiol protease - Trichuris suis Length = 348 Score = 73.3 bits (172), Expect = 7e-12 Identities = 33/67 (49%), Positives = 41/67 (61%) Frame = +2 Query: 299 LIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSA 478 L S+P +FD R W C +LN +RDQ CGSCWA A E M+DR+C SN + S Sbjct: 80 LALSIPPSFDVRSLWHVC-SLNLIRDQAKCGSCWAVSAAETMSDRICVQSNCSIKACISD 138 Query: 479 EDLLSCC 499 D+LSCC Sbjct: 139 TDILSCC 145 Score = 63.7 bits (148), Expect = 6e-09 Identities = 36/115 (31%), Positives = 51/115 (44%), Gaps = 11/115 (9%) Frame = +1 Query: 502 YL*LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYE-IPPCEHHVPGNRM-PCSGDT 675 Y GC+GG P AW ++ G +GG GC+PY+ P H+ N PC DT Sbjct: 148 YCGYGCNGGFPIEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRHLKRNDYAPCPNDT 207 Query: 676 ---------KTPKCTKKCESGYDVNYKQDKQYGKHVYTCPETKTTSARNCSRMVP 813 TP+C ++C GY +Y D+ YGK Y ++ R + P Sbjct: 208 YYGECVGMADTPRCKRRCLLGYPKSYPSDRYYGKSAYIVKQSVKAIQREIMKNGP 262 >UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoidea|Rep: Cathepsin B-S precursor - Tuberaphis styraci Length = 349 Score = 72.1 bits (169), Expect = 2e-11 Identities = 27/65 (41%), Positives = 37/65 (56%) Frame = +2 Query: 314 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493 P+ FD R+ W C + +RDQG+CGSCW+F A DR+C + G + S E+L Sbjct: 86 PKQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELAF 145 Query: 494 CCPIC 508 CC C Sbjct: 146 CCMDC 150 Score = 63.3 bits (147), Expect = 7e-09 Identities = 29/89 (32%), Positives = 44/89 (49%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693 GC GG P AW+Y++ G+ +GG Y++ +GC PY++PPC N + +C Sbjct: 153 GCGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYDEQGKNTCGGKPMERNHQCP 212 Query: 694 KKCESGYDVNYKQDKQYGKHVYTCPETKT 780 K C Y QD+ K+ Y +T Sbjct: 213 KTC---YGKTTVQDRYKTKNEYVINSIET 238 Score = 39.5 bits (88), Expect = 0.10 Identities = 23/59 (38%), Positives = 32/59 (54%), Gaps = 4/59 (6%) Frame = +3 Query: 81 AAYVTLVCVLAAAKDLPHP----LSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMG 245 A +VT+VC + + L P LSDE I IN +WKA R FP +TS + ++G Sbjct: 2 AKFVTIVCAIFVSVYLAEPTLQFLSDERIKYINEVAKTWKAERYFPANTSEEYFIGLLG 60 >UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; Ostreococcus tauri|Rep: Cysteine proteinase Cathepsin F - Ostreococcus tauri Length = 498 Score = 71.7 bits (168), Expect = 2e-11 Identities = 33/64 (51%), Positives = 39/64 (60%), Gaps = 1/64 (1%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484 SLP +FD RD++P C L VRDQG CGSCWA A E M DR+C S G + S + Sbjct: 256 SLPRHFDARDEYPKCARLIGTVRDQGKCGSCWAVAATEIMNDRLCISSGGKEVAELSPQF 315 Query: 485 LLSC 496 LSC Sbjct: 316 ALSC 319 >UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep: Cysteine proteinase - Toxoplasma gondii Length = 569 Score = 70.9 bits (166), Expect = 4e-11 Identities = 32/78 (41%), Positives = 43/78 (55%), Gaps = 2/78 (2%) Frame = +2 Query: 272 LPIKTHNFDLIAS-LPENFDPRDKWPDCP-TLNEVRDQGSCGSCWAFGAVEAMTDRVCTY 445 +P+ F+ +P +FD R +P C + VRDQG CGSCWAF + EA DR+C Sbjct: 260 MPLPAKEFENATEPVPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIR 319 Query: 446 SNGTKHFHFSAEDLLSCC 499 S G + SA+ SCC Sbjct: 320 SQGKRLMPLSAQHTTSCC 337 Score = 64.1 bits (149), Expect = 4e-09 Identities = 29/70 (41%), Positives = 41/70 (58%), Gaps = 6/70 (8%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNS-SQG--CRPYEIPPCEHHVPGNRMPCSG---DT 675 GC+GG P +AW +++ G+V+GG +++ +G C PYE+P C HH C Sbjct: 346 GCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAHHAKAPFPDCDATLVPR 405 Query: 676 KTPKCTKKCE 705 KTPKC K CE Sbjct: 406 KTPKCRKDCE 415 >UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|Rep: Cysteine protease - Giardia muris Length = 301 Score = 69.3 bits (162), Expect = 1e-10 Identities = 35/84 (41%), Positives = 47/84 (55%), Gaps = 4/84 (4%) Frame = +2 Query: 257 EHFATLPIKTH----NFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424 E+ +L +TH N LP+++DPR + C L EV DQ SCGSCWAF AV Sbjct: 55 ENLRSLRTETHVSQLNLGKTKELPKDYDPRVERAHC--LPEVADQASCGSCWAFSAVATF 112 Query: 425 TDRVCTYSNGTKHFHFSAEDLLSC 496 DR C Y +K H+S + ++SC Sbjct: 113 ADRRCAYGLDSKQVHYSEQYVVSC 136 >UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 311 Score = 68.1 bits (159), Expect = 3e-10 Identities = 28/63 (44%), Positives = 41/63 (65%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 ++PENFD R +WP +++ +R+QG CGSCWAFGA E ++DR S + SA+ L Sbjct: 82 NIPENFDARKQWPG--SIHPIRNQGQCGSCWAFGASEVLSDRFAIASKNQIYVTLSAQQL 139 Query: 488 LSC 496 + C Sbjct: 140 VDC 142 >UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomuscorum|Rep: Cathepsin B - Oxytricha trifallax (Sterkiella histriomuscorum) Length = 294 Score = 65.3 bits (152), Expect = 2e-09 Identities = 33/65 (50%), Positives = 41/65 (63%) Frame = +2 Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481 I ++PENFD R +W ++ +RDQ CGSCWAFGA EA +DR NG K S E Sbjct: 73 IMTVPENFDARQQWGS--KIHAIRDQQQCGSCWAFGATEAFSDRFAI--NG-KDVILSPE 127 Query: 482 DLLSC 496 DL+SC Sbjct: 128 DLVSC 132 Score = 33.9 bits (74), Expect = 5.2 Identities = 13/20 (65%), Positives = 18/20 (90%) Frame = +3 Query: 783 IRAELFKNGPVEGAFTVYSD 842 I++E+ +GPVEGAFTVY+D Sbjct: 203 IQSEIVSHGPVEGAFTVYTD 222 >UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus lucimarinus CCE9901|Rep: Predicted protein - Ostreococcus lucimarinus CCE9901 Length = 330 Score = 64.5 bits (150), Expect = 3e-09 Identities = 30/63 (47%), Positives = 36/63 (57%), Gaps = 1/63 (1%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 LP +FD R +P C L VRDQG CGSCWA A E M DR+C ++G S + Sbjct: 112 LPTSFDARVAYPKCSRLLGAVRDQGRCGSCWAVAATEVMNDRLCVATDGENADELSPQYA 171 Query: 488 LSC 496 LSC Sbjct: 172 LSC 174 Score = 37.5 bits (83), Expect = 0.42 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 3/104 (2%) Frame = +1 Query: 514 GCSGG--MPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPK 687 GC GG + L + K G+ GG +S+ C PYE C+H PC TP+ Sbjct: 180 GCDGGDVLDTLRIAFTK--GIPYGGMLDSN-ACLPYEFEACDH-------PCMVAGTTPQ 229 Query: 688 -CTKKCESGYDVNYKQDKQYGKHVYTCPETKTTSARNCSRMVPS 816 C KC G +++ YTCP+ T + VP+ Sbjct: 230 SCPAKCADGSALSFVHPT---SEPYTCPKGDVTHTGSGVYTVPN 270 >UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 314 Score = 63.3 bits (147), Expect = 7e-09 Identities = 28/68 (41%), Positives = 43/68 (63%), Gaps = 1/68 (1%) Frame = +2 Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG-TKHFHF 472 +L S+P +FD R +WPDC ++ + +Q CGSCWAF + E ++DR+C SN T Sbjct: 83 ELKGSIPTSFDSRVQWPDC--IHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGAL 140 Query: 473 SAEDLLSC 496 S + L++C Sbjct: 141 SPQTLVAC 148 Score = 34.3 bits (75), Expect = 3.9 Identities = 13/19 (68%), Positives = 16/19 (84%) Frame = +1 Query: 514 GCSGGMPRLAWEYWKHFGL 570 GCSGG+P+LAWEY + GL Sbjct: 155 GCSGGIPQLAWEYMELKGL 173 >UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep: Cathepsin B - Streblomastix strix Length = 283 Score = 62.5 bits (145), Expect = 1e-08 Identities = 29/62 (46%), Positives = 37/62 (59%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 +P+ FD R+KWPD + VRDQG CGSCWAF E + DR+ G + EDL+ Sbjct: 63 VPDTFDAREKWPDA--ILPVRDQGECGSCWAFSIAETIGDRLGVL--GCSRGDIAPEDLV 118 Query: 491 SC 496 SC Sbjct: 119 SC 120 >UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cellular organisms|Rep: Cysteine proteinase, putative - Archaeoglobus fulgidus Length = 1088 Score = 62.1 bits (144), Expect = 2e-08 Identities = 32/77 (41%), Positives = 40/77 (51%) Frame = +2 Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481 +ASLP FD W D L+ VRDQGSCGSCWA AV A+ + S + S + Sbjct: 591 MASLPSRFD----WRDYTGLSAVRDQGSCGSCWAHSAVAALESALIVESGASSSIDLSEQ 646 Query: 482 DLLSCCPICDWDAAEEC 532 LLSC C+ + C Sbjct: 647 HLLSCEQDCEVGIGDWC 663 >UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Giardia intestinalis|Rep: Cathepsin B-like CP1 precursor - Giardia lamblia (Giardia intestinalis) Length = 303 Score = 62.1 bits (144), Expect = 2e-08 Identities = 27/67 (40%), Positives = 38/67 (56%) Frame = +2 Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475 +L+ +P FD RD++P C + DQGSCGSCWAF A+ DR C + +S Sbjct: 74 ELVDPIPPQFDFRDEYPQC--VKPALDQGSCGSCWAFSAIGVFGDRRCAMGIDKEAVSYS 131 Query: 476 AEDLLSC 496 + L+SC Sbjct: 132 QQHLISC 138 >UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Giardia intestinalis|Rep: Cathepsin B-like CP2 precursor - Giardia lamblia (Giardia intestinalis) Length = 300 Score = 60.1 bits (139), Expect = 7e-08 Identities = 27/62 (43%), Positives = 38/62 (61%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 +PE+FD R+++P C + EV DQG CGSCWAF +V DR C K +S + ++ Sbjct: 75 VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQYVV 132 Query: 491 SC 496 SC Sbjct: 133 SC 134 >UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-LDL responsive gene 2, partial; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to oxidized-LDL responsive gene 2, partial - Strongylocentrotus purpuratus Length = 363 Score = 58.8 bits (136), Expect = 2e-07 Identities = 27/64 (42%), Positives = 38/64 (59%), Gaps = 1/64 (1%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT-KHFHFSAED 484 ++PE FD R +WP + V++QG+C S WA +DR+ SNGT K+ H S + Sbjct: 221 AIPEEFDARAQWPGL--VEGVQNQGNCASSWAMSTAATASDRLAIQSNGTFKYMHLSPQH 278 Query: 485 LLSC 496 LLSC Sbjct: 279 LLSC 282 >UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia ATCC 50803|Rep: GLP_113_4299_5381 - Giardia lamblia ATCC 50803 Length = 360 Score = 58.8 bits (136), Expect = 2e-07 Identities = 26/61 (42%), Positives = 36/61 (59%) Frame = +2 Query: 314 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493 PE++D RD++P C T EV DQG+CGSCWAF +V+ D C +S + +L Sbjct: 141 PESYDFRDEYPHCIT--EVVDQGNCGSCWAFSSVQTFADHRCRSGLDATGVSYSVQYVLD 198 Query: 494 C 496 C Sbjct: 199 C 199 >UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like protein F26E4.3; n=2; Caenorhabditis|Rep: Uncharacterized peptidase C1-like protein F26E4.3 - Caenorhabditis elegans Length = 491 Score = 58.8 bits (136), Expect = 2e-07 Identities = 27/62 (43%), Positives = 36/62 (58%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 LPE+FD RDKW P ++ V DQG CGS W+ +DR+ S G + S++ LL Sbjct: 223 LPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLL 280 Query: 491 SC 496 SC Sbjct: 281 SC 282 >UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: hypothetical protein - Strongylocentrotus purpuratus Length = 450 Score = 58.4 bits (135), Expect = 2e-07 Identities = 28/64 (43%), Positives = 35/64 (54%) Frame = +2 Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484 A LPE FD R+ WP ++EV DQG CGS WA +DR+ S G + S + Sbjct: 195 ARLPETFDARENWPGL--IDEVIDQGKCGSSWAISTASVASDRLAIQSMGEINPRLSEQH 252 Query: 485 LLSC 496 LLSC Sbjct: 253 LLSC 256 >UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA, isoform A; n=2; Endopterygota|Rep: PREDICTED: similar to CG3074-PA, isoform A - Tribolium castaneum Length = 445 Score = 56.4 bits (130), Expect = 8e-07 Identities = 27/63 (42%), Positives = 34/63 (53%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 SLP FD KWP ++E++DQG CGS WA +DR S G + SA+ L Sbjct: 196 SLPREFDSEFKWPGW--MSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHL 253 Query: 488 LSC 496 LSC Sbjct: 254 LSC 256 >UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 323 Score = 56.4 bits (130), Expect = 8e-07 Identities = 27/77 (35%), Positives = 39/77 (50%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 ++P +FD R W DC ++ VR+Q SCGSCWA + DR+C S+ S + L Sbjct: 45 TIPASFDVRTNWGDC--MSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYL 102 Query: 488 LSCCPICDWDAAEECRD 538 + C C D C + Sbjct: 103 MDCDGSCVSDGVSGCNN 119 >UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|Rep: Cysteine proteinase - Globodera pallida Length = 53 Score = 56.0 bits (129), Expect = 1e-06 Identities = 22/41 (53%), Positives = 26/41 (63%) Frame = +2 Query: 377 QGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC 499 QG CG CWAF E ++DR C SNGT+ S DLL+CC Sbjct: 1 QGQCGRCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCC 41 >UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcoptes scabiei type hominis|Rep: Sar s 1 allergen Yv9053H09 - Sarcoptes scabiei type hominis Length = 253 Score = 53.6 bits (123), Expect = 6e-06 Identities = 28/68 (41%), Positives = 38/68 (55%), Gaps = 4/68 (5%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV----EAMTDRVCTYSNGTKHFHFSA 478 LPE FD RD L+++R+QG CG+CWAF A+ A R N T+ HFS Sbjct: 37 LPEKFDLRD----LGYLSKIRNQGRCGACWAFAALASVESAYNRRTRIVHNRTRKHHFSE 92 Query: 479 EDLLSCCP 502 ++L+ C P Sbjct: 93 QELVDCSP 100 >UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35; Viridiplantae|Rep: Cysteine proteinase 15A precursor - Pisum sativum (Garden pea) Length = 363 Score = 53.6 bits (123), Expect = 6e-06 Identities = 30/75 (40%), Positives = 39/75 (52%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 +LPE+FD R+K P V+DQGSCGSCWAF A+ Y K S + L Sbjct: 131 NLPEDFDWREKGAVTP----VKDQGSCGSCWAFSTTGALEG--AHYLATGKLVSLSEQQL 184 Query: 488 LSCCPICDWDAAEEC 532 + C +CD + A C Sbjct: 185 VDCDHVCDPEQAGSC 199 >UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lamblia ATCC 50803|Rep: GLP_217_11853_10927 - Giardia lamblia ATCC 50803 Length = 308 Score = 52.8 bits (121), Expect = 1e-05 Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 1/86 (1%) Frame = +2 Query: 242 GSYRDEHFATLPIKT-HNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVE 418 GS R + P++ N D + P++FD R+++P C T EV D G C S WA+ AV+ Sbjct: 54 GSPRTQSSIVRPVRVPENEDPV---PDHFDFREEYPQCIT--EVIDIGLCSSSWAYSAVD 108 Query: 419 AMTDRVCTYSNGTKHFHFSAEDLLSC 496 A + R C + +SA+ +LSC Sbjct: 109 AFSHRRCLTGLDQEATRYSAQYILSC 134 >UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - Drosophila melanogaster (Fruit fly) Length = 431 Score = 52.4 bits (120), Expect = 1e-05 Identities = 23/62 (37%), Positives = 34/62 (54%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 LP +F+ DKW ++EV DQG CG+ W +DR S G ++ SA+++L Sbjct: 187 LPSSFNALDKWSSY--ISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNIL 244 Query: 491 SC 496 SC Sbjct: 245 SC 246 >UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin O precursor; n=2; Apocrita|Rep: PREDICTED: similar to Cathepsin O precursor - Apis mellifera Length = 374 Score = 52.0 bits (119), Expect = 2e-05 Identities = 32/93 (34%), Positives = 44/93 (47%) Frame = +2 Query: 218 VRAS*ENNGSYRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSC 397 +R N SY +H I S+P FD RDK P VR QGSCG+C Sbjct: 128 IRGEKHMNASYHRKH----QISIDRMKRSISIPLRFDWRDKGVITP----VRSQGSCGAC 179 Query: 398 WAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 WAF +E + + + NGT H S ++++ C Sbjct: 180 WAFSTIEVI-ESMFAIKNGTLH-SLSVQEMIDC 210 >UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to glucocorticoid-inducible protein; n=1; Gallus gallus|Rep: PREDICTED: similar to glucocorticoid-inducible protein - Gallus gallus Length = 307 Score = 51.6 bits (118), Expect = 2e-05 Identities = 24/62 (38%), Positives = 33/62 (53%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 LP +FD KWP ++E DQG+C WAF +DR+ +S G S ++LL Sbjct: 153 LPRHFDAATKWPGM--IHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPSLSPQNLL 210 Query: 491 SC 496 SC Sbjct: 211 SC 212 >UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to GM06507p - Nasonia vitripennis Length = 483 Score = 51.2 bits (117), Expect = 3e-05 Identities = 23/62 (37%), Positives = 33/62 (53%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 LP FD R +W + + V+DQG CG+ WA V+ +DR S G + S + L+ Sbjct: 236 LPREFDSRIQWGN--DITPVQDQGWCGASWAISTVDVASDRFAIMSKGIEKVQLSGQHLI 293 Query: 491 SC 496 SC Sbjct: 294 SC 295 >UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15; Magnoliophyta|Rep: Cysteine proteinase RD19a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 368 Score = 51.2 bits (117), Expect = 3e-05 Identities = 27/75 (36%), Positives = 39/75 (52%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 +LPE+FD W D + V++QGSCGSCW+F A A+ + K S + L Sbjct: 134 NLPEDFD----WRDHGAVTPVKNQGSCGSCWSFSATGALEG--ANFLATGKLVSLSEQQL 187 Query: 488 LSCCPICDWDAAEEC 532 + C CD + A+ C Sbjct: 188 VDCDHECDPEEADSC 202 >UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus salmonis|Rep: Cysteine proteinase - Lepeophtheirus salmonis (salmon louse) Length = 372 Score = 50.8 bits (116), Expect = 4e-05 Identities = 26/65 (40%), Positives = 36/65 (55%) Frame = +2 Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481 I LPE+ D R+K + +V++QGSCGSCW F AVE + V +N T S + Sbjct: 112 IKDLPESVDWREKG----VITDVKNQGSCGSCWVFSAVEQIESYVAIENNMTSPPLLSTQ 167 Query: 482 DLLSC 496 + SC Sbjct: 168 QITSC 172 >UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo sapiens|Rep: Isoform 2 of Q9GZM7 - Homo sapiens (Human) Length = 283 Score = 49.6 bits (113), Expect = 1e-04 Identities = 24/62 (38%), Positives = 34/62 (54%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 LP F+ +KWP+ ++E DQG+C WAF +DRV +S G S ++LL Sbjct: 69 LPTAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLL 126 Query: 491 SC 496 SC Sbjct: 127 SC 128 >UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadidae|Rep: Cysteine protease - Tritrichomonas foetus (Trichomonas foetus) Length = 315 Score = 49.6 bits (113), Expect = 1e-04 Identities = 24/63 (38%), Positives = 36/63 (57%) Frame = +2 Query: 320 NFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC 499 N D D W + +NE++DQ +CGSCWAF A++A + S GT +S ++L+ C Sbjct: 100 NVDSID-WREKGVVNEIKDQAACGSCWAFSAIQA-AESAYAISTGTLE-SYSEQNLVDCV 156 Query: 500 PIC 508 C Sbjct: 157 QGC 159 >UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-like precursor; n=26; Euteleostomi|Rep: Tubulointerstitial nephritis antigen-like precursor - Homo sapiens (Human) Length = 467 Score = 49.6 bits (113), Expect = 1e-04 Identities = 24/62 (38%), Positives = 34/62 (54%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 LP F+ +KWP+ ++E DQG+C WAF +DRV +S G S ++LL Sbjct: 203 LPTAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLL 260 Query: 491 SC 496 SC Sbjct: 261 SC 262 >UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio molitor (Yellow mealworm) Length = 336 Score = 49.2 bits (112), Expect = 1e-04 Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%) Frame = +2 Query: 254 DEHFATLPIKTH-NFDLIASL--PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424 D H +PIKT + L AS+ P +FD W D ++ V++QGSCGSCWAF + A+ Sbjct: 99 DLHKNGIPIKTREDLGLNASVRYPASFD----WRDQGMVSPVKNQGSCGSCWAFSSTGAI 154 Query: 425 TDRVCTYSNGTKHFHFSAEDLLSCCP 502 ++ + S + L+ C P Sbjct: 155 ESQMKIANGAGYDSSVSEQQLVDCVP 180 >UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 306 Score = 49.2 bits (112), Expect = 1e-04 Identities = 19/56 (33%), Positives = 34/56 (60%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPIC 508 W + +N++++QG+CGSCWAF A++ + +V N + + S ++LL C C Sbjct: 94 WREQGIVNKIKNQGACGSCWAFSAIQVIESQVA--KNQKQLYDLSEQNLLDCVTSC 147 >UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin L-like cysteine proteinase precursor - Acanthoscelides obtectus (Bean weevil) Length = 321 Score = 48.4 bits (110), Expect = 2e-04 Identities = 27/72 (37%), Positives = 42/72 (58%) Frame = +2 Query: 290 NFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFH 469 +FD + +P+ D R+K + EV+ QG+CGSCWAF AV ++ +V NG+ Sbjct: 103 SFDNVNDIPKTVDWREKG----AVTEVKKQGNCGSCWAFSAVGSIEGQV-FLKNGSLE-S 156 Query: 470 FSAEDLLSCCPI 505 SA++L+ C I Sbjct: 157 LSAQNLVDCAGI 168 >UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 234 Score = 48.4 bits (110), Expect = 2e-04 Identities = 25/70 (35%), Positives = 38/70 (54%) Frame = +2 Query: 299 LIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSA 478 ++ +P+ D R K +NE++DQ CGSCWAFG+ AM + +GT + S Sbjct: 14 IVGDIPDEIDYRTKG----AVNEIKDQKHCGSCWAFGSCAAM-ESSWFLKHGTL-YSLSE 67 Query: 479 EDLLSCCPIC 508 + L+ CC C Sbjct: 68 QCLVDCCHDC 77 >UniRef50_Q58HK5 Cluster: Hc58; n=1; Haemonchus contortus|Rep: Hc58 - Haemonchus contortus (Barber pole worm) Length = 241 Score = 47.6 bits (108), Expect = 4e-04 Identities = 17/29 (58%), Positives = 21/29 (72%) Frame = +2 Query: 368 VRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454 +RDQ +CGSCWA A E M+DR C +S G Sbjct: 108 IRDQSNCGSCWAVSAAETMSDRACIHSKG 136 >UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine proteinase precursor - Heterodera glycines (Soybean cyst nematode worm) Length = 353 Score = 47.6 bits (108), Expect = 4e-04 Identities = 22/70 (31%), Positives = 33/70 (47%) Frame = +2 Query: 287 HNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHF 466 HN +A + W + + EV+DQG CGSCWAF A A+ + +K Sbjct: 123 HNMATLAGNSSTLPEKLDWREKGAVTEVKDQGDCGSCWAFSATGAI-EGALAQKKASKII 181 Query: 467 HFSAEDLLSC 496 S ++L+ C Sbjct: 182 SLSEQNLVDC 191 >UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_21, whole genome shotgun sequence - Paramecium tetraurelia Length = 349 Score = 47.6 bits (108), Expect = 4e-04 Identities = 23/58 (39%), Positives = 34/58 (58%) Frame = +2 Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICDWDAAEEC 532 ++EV++QGSCGSCWAF AV A+ G K+ S ++L+ C + D +E C Sbjct: 137 VSEVKNQGSCGSCWAFSAVAAL--ETALRQGGVKNVELSEQELVDCA-VKDEFESEGC 191 >UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; Phytophthora infestans|Rep: Cathepsin-like cysteine protease - Phytophthora infestans (Potato late blight fungus) Length = 376 Score = 47.2 bits (107), Expect = 5e-04 Identities = 31/96 (32%), Positives = 46/96 (47%) Frame = +2 Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481 + LP +D W + T+ V++QG CGSCWAF AV AM C Y+ T +E Sbjct: 130 VEDLPATWD----WREHSTVTPVKNQGQCGSCWAFSAVAAME---CAYALSTGTLESLSE 182 Query: 482 DLLSCCPICDWDAAEECRD*LGNIGSTSV*YQEVVT 589 L C + + + C + G S Y+E++T Sbjct: 183 QELVDCTL---NGIDTC----NHGGEMSEGYEEIIT 211 >UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestinalis|Rep: GLP_41_8294_9919 - Giardia lamblia ATCC 50803 Length = 541 Score = 47.2 bits (107), Expect = 5e-04 Identities = 29/68 (42%), Positives = 38/68 (55%), Gaps = 5/68 (7%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN-----GTKHFHF 472 +LP++FD RD + V DQG+CGSC+ FGAV+AM R+ +N GTK Sbjct: 240 TLPDDFDWRDV-NGVSYIPGVLDQGACGSCFTFGAVQAMNSRIMIATNRTDPVGTKTI-L 297 Query: 473 SAEDLLSC 496 S E L C Sbjct: 298 STEHALDC 305 >UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Trypanosoma cruzi|Rep: Cysteine proteinase, putative - Trypanosoma cruzi Length = 392 Score = 47.2 bits (107), Expect = 5e-04 Identities = 26/64 (40%), Positives = 32/64 (50%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 +P+ D R+ P L V+DQG CGSCWA GA E M + G H S + L Sbjct: 141 IPDEVDYRNSSP--AILTAVKDQGRCGSCWAHGAAEEMESHFAILT-GRLHV-LSQQQLT 196 Query: 491 SCCP 502 SC P Sbjct: 197 SCAP 200 >UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia irregularis virus a|Rep: FirrV-1-A48 precursor - Feldmannia irregularis virus a Length = 373 Score = 46.8 bits (106), Expect = 7e-04 Identities = 17/41 (41%), Positives = 25/41 (60%) Frame = +2 Query: 374 DQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 DQGSC SCW+ V+ + DRV +NG S ++++SC Sbjct: 80 DQGSCASCWSISVVQMLADRVSVSTNGKIKLKLSVQEMISC 120 >UniRef50_P81494 Cluster: Cathepsin B; n=2; Phasianidae|Rep: Cathepsin B - Coturnix coturnix japonica (Japanese quail) Length = 48 Score = 46.8 bits (106), Expect = 7e-04 Identities = 16/25 (64%), Positives = 22/25 (88%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGS 385 LP+ FD R +WP+CPT++E+RDQGS Sbjct: 1 LPDTFDSRKQWPNCPTISEIRDQGS 25 >UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin C; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin C - Strongylocentrotus purpuratus Length = 482 Score = 46.4 bits (105), Expect = 0.001 Identities = 23/64 (35%), Positives = 35/64 (54%) Frame = +2 Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484 ++LPE FD RD ++ VRDQG CGSC+AF + R+ +N S ++ Sbjct: 247 SNLPEKFDWRDVG-GIDYVSPVRDQGICGSCYAFASTATQESRLRVMTNNNVKVVMSPQE 305 Query: 485 LLSC 496 ++SC Sbjct: 306 VVSC 309 >UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae str. PEST Length = 559 Score = 46.4 bits (105), Expect = 0.001 Identities = 20/38 (52%), Positives = 25/38 (65%) Frame = +2 Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 415 + LP +FD W D + EV++QGSCGSCWAF AV Sbjct: 336 VGDLPRSFD----WRDHGAVTEVKNQGSCGSCWAFSAV 369 >UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 45.6 bits (103), Expect = 0.002 Identities = 22/62 (35%), Positives = 31/62 (50%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 LP +FD W D L++V+DQG CGSCWAF + + + FS + L+ Sbjct: 125 LPASFD----WRDYGILSDVKDQGQCGSCWAFSTTGIL--EALYFMENRQKISFSEQQLV 178 Query: 491 SC 496 C Sbjct: 179 DC 180 >UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep: Cysteine protease - Solanum lycopersicum (Tomato) (Lycopersicon esculentum) Length = 345 Score = 45.6 bits (103), Expect = 0.002 Identities = 24/83 (28%), Positives = 42/83 (50%), Gaps = 2/83 (2%) Frame = +2 Query: 254 DEHFATLPIKTHNFDLIASLPENFDPRD-KWPDCPTLNEVRDQGSCGSCWAFGAVEAMTD 430 + + + P+ + F I L +++ P + W + + +V+ QG CG CWAF AV ++ Sbjct: 107 NSYLSPSPMSSTEFKKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEG 166 Query: 431 RVCTYSNGTKH-FHFSAEDLLSC 496 Y T + FS ++LL C Sbjct: 167 ---AYKIATGNLMEFSEQELLDC 186 >UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Bigelowiella natans|Rep: Digestive cysteine proteinase - Bigelowiella natans (Pedinomonas minutissima) (Chlorarachnion sp.(strain CCMP 621)) Length = 360 Score = 45.6 bits (103), Expect = 0.002 Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 2/54 (3%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT--KHFHFSAEDLLSC 496 W D L V+DQG CGSCWAF A +A+ N T S E L+ C Sbjct: 115 WRDFNALTPVKDQGGCGSCWAFSATQALESAHYIKHNDTLDSPIALSTEQLVEC 168 >UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: Vivapain-4 - Plasmodium vivax Length = 484 Score = 45.6 bits (103), Expect = 0.002 Identities = 19/52 (36%), Positives = 31/52 (59%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 W + ++E+++Q CGSCWAFGAV A+ + N +H S ++L+ C Sbjct: 268 WREHNAVSEIKNQNLCGSCWAFGAVGAVESQYAIRKN--QHVLISEQELVDC 317 >UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza sativa|Rep: Cysteine protease 1 precursor - Oryza sativa subsp. japonica (Rice) Length = 490 Score = 45.6 bits (103), Expect = 0.002 Identities = 25/69 (36%), Positives = 37/69 (53%), Gaps = 7/69 (10%) Frame = +2 Query: 239 NGSYRDEHFATLPI-------KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSC 397 NG +R + T P + + D + +LP++ D RDK + V++QG CGSC Sbjct: 124 NGEFRATYLGTTPAGRGRRVGEAYRHDGVEALPDSVDWRDKGA---VVAPVKNQGQCGSC 180 Query: 398 WAFGAVEAM 424 WAF AV A+ Sbjct: 181 WAFSAVAAV 189 >UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 280 Score = 45.2 bits (102), Expect = 0.002 Identities = 22/64 (34%), Positives = 37/64 (57%) Frame = +2 Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484 +SLP+ FD W + + +V++QG+CGSCWAF + + + + N T +S ++ Sbjct: 66 SSLPQQFD----WRNLGKVTQVKNQGNCGSCWAF-TITGLFESINLIRNKTVEL-YSEQE 119 Query: 485 LLSC 496 LL C Sbjct: 120 LLDC 123 >UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus; n=4; Cryptosporidium|Rep: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus - Cryptosporidium parvum Iowa II Length = 401 Score = 45.2 bits (102), Expect = 0.002 Identities = 19/47 (40%), Positives = 27/47 (57%), Gaps = 2/47 (4%) Frame = +2 Query: 317 ENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 451 E F P + W + +N +R+Q +CGSCWAF AV A+ C +N Sbjct: 172 EEFVPPNSINWVEAGCVNPIRNQKNCGSCWAFSAVAALEGATCAQTN 218 >UniRef50_Q23H32 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 365 Score = 45.2 bits (102), Expect = 0.002 Identities = 26/72 (36%), Positives = 37/72 (51%) Frame = +2 Query: 290 NFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFH 469 +F L S+PE+ D R+K + V+ QG CGSCWAF V A+ + Sbjct: 128 SFLLSDSVPESVDWREK-----LVAPVQKQGGCGSCWAFSTVIALEGAYAKQTGNV--IK 180 Query: 470 FSAEDLLSCCPI 505 FS ++L+ CC I Sbjct: 181 FSEQNLIDCCRI 192 >UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin B-like cysteine peptidase - Trichomonas vaginalis G3 Length = 288 Score = 45.2 bits (102), Expect = 0.002 Identities = 22/63 (34%), Positives = 35/63 (55%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 S+P +++ +++P C V DQG CGSCW+F ++ + R C N K FS L Sbjct: 67 SIPMSYNFTERFPQCDF--GVLDQGKCGSCWSFAVSKSFSHRYCRKYN--KPVLFSQSHL 122 Query: 488 LSC 496 ++C Sbjct: 123 VAC 125 >UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicalis|Rep: Blo t 1 allergen - Blomia tropicalis (Mite) Length = 333 Score = 45.2 bits (102), Expect = 0.002 Identities = 21/35 (60%), Positives = 23/35 (65%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGA 412 SLP+NFD R K L +R QGSCGSCWAF A Sbjct: 112 SLPQNFDWRQK----ARLTRIRQQGSCGSCWAFAA 142 >UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen; n=20; Amniota|Rep: Tubulointerstitial nephritis antigen - Homo sapiens (Human) Length = 476 Score = 45.2 bits (102), Expect = 0.002 Identities = 23/72 (31%), Positives = 32/72 (44%) Frame = +2 Query: 284 THNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKH 463 T + LPE F KWP + DQ +C + WAF DR+ S G Sbjct: 208 TASLPATTDLPEFFVASYKWPGWT--HGPLDQKNCAASWAFSTASVAADRIAIQSKGRYT 265 Query: 464 FHFSAEDLLSCC 499 + S ++L+SCC Sbjct: 266 ANLSPQNLISCC 277 >UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin O; n=1; Danio rerio|Rep: PREDICTED: similar to cathepsin O - Danio rerio Length = 327 Score = 44.8 bits (101), Expect = 0.003 Identities = 22/59 (37%), Positives = 29/59 (49%) Frame = +2 Query: 320 NFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 N PR W D + V +QGSCG CWAF VEA+ + G K S + ++ C Sbjct: 119 NNPPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAIES--VSAKVGEKLQQLSVQQVIDC 175 >UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag-RP - Bombyx mori (Silk moth) Length = 404 Score = 44.8 bits (101), Expect = 0.003 Identities = 22/61 (36%), Positives = 32/61 (52%) Frame = +2 Query: 314 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493 P+ FD R +W ++ + DQ CGS WA + DR S GT++ S++ LLS Sbjct: 186 PDEFDARREWYGY--ISPIADQDWCGSDWAVSIASIVGDRFSIQSFGTENVRMSSQTLLS 243 Query: 494 C 496 C Sbjct: 244 C 244 >UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 344 Score = 44.4 bits (100), Expect = 0.004 Identities = 21/60 (35%), Positives = 30/60 (50%) Frame = +2 Query: 317 ENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 +N P D W + + V+ QG CGSCW F A A+ + NG +FS + +L C Sbjct: 134 KNAPPMD-WRNASAITPVKQQGKCGSCWTF-ASTAVLESFSFIKNGAPLTNFSEQQILDC 191 >UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia ATCC 50803|Rep: GLP_542_3431_1206 - Giardia lamblia ATCC 50803 Length = 741 Score = 44.4 bits (100), Expect = 0.004 Identities = 30/82 (36%), Positives = 43/82 (52%), Gaps = 1/82 (1%) Frame = +2 Query: 254 DEHFATLPIKTHNFDLI-ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTD 430 ++ + LP N DL A+LP NF R ++ +QGSCG C+A AVE +T Sbjct: 40 EDEYNELPDGPDNADLTRAALPTNFTYRGH-----RCIQIINQGSCGCCYAAAAVEMVTA 94 Query: 431 RVCTYSNGTKHFHFSAEDLLSC 496 R C N ++ S EDL++C Sbjct: 95 RRCLQLNDSR--LVSLEDLVTC 114 >UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine protease; n=11; Callosobruchus maculatus|Rep: Putative gut cathepsin L-like cysteine protease - Callosobruchus maculatus (Southern cowpea weevil) (Pulse bruchid) Length = 326 Score = 44.0 bits (99), Expect = 0.005 Identities = 25/75 (33%), Positives = 36/75 (48%) Frame = +2 Query: 272 LPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 451 LP +FD + W + + V+DQ +CGSCWAF AV A+ + N Sbjct: 95 LPSNAVHFDNFEDIDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAIEGQFFK-KN 153 Query: 452 GTKHFHFSAEDLLSC 496 GT SA++L+ C Sbjct: 154 GTL-VSLSAQELVDC 167 >UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 326 Score = 44.0 bits (99), Expect = 0.005 Identities = 20/44 (45%), Positives = 26/44 (59%) Frame = +2 Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 406 P H+ + LP FD R+K + EV+DQGSCGSCW+F Sbjct: 98 PRVIHSLTPVKDLPSKFDWREKG----AVTEVKDQGSCGSCWSF 137 >UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing protein; n=5; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 437 Score = 44.0 bits (99), Expect = 0.005 Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 3/80 (3%) Frame = +2 Query: 266 ATLPIKTHNFDL--IASLPENFDPRDKWPDCPTLNEVRDQGS-CGSCWAFGAVEAMTDRV 436 AT T +F ++ LP+ D R+K + +V+ QG CGSCWAF AV A+ Sbjct: 188 ATAQANTRSFRKYDLSQLPQYVDWREKG----VVTQVKSQGKDCGSCWAFAAVAALESHY 243 Query: 437 CTYSNGTKHFHFSAEDLLSC 496 G K FS + L+ C Sbjct: 244 -ALKTGKKPIQFSEQQLVDC 262 >UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing protein; n=7; Hymenostomatida|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 387 Score = 44.0 bits (99), Expect = 0.005 Identities = 25/73 (34%), Positives = 37/73 (50%) Frame = +2 Query: 278 IKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT 457 +KT + + LP++ D W D + V+DQG CGSCWAF A A+ + + G Sbjct: 122 LKTSDKINVKDLPKSVD----WRDAGVVTPVKDQGHCGSCWAF-ATTAVIESYAAIATGQ 176 Query: 458 KHFHFSAEDLLSC 496 S + L+SC Sbjct: 177 LK-TLSTQQLVSC 188 >UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, whole genome shotgun sequence; n=3; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_31, whole genome shotgun sequence - Paramecium tetraurelia Length = 358 Score = 44.0 bits (99), Expect = 0.005 Identities = 20/62 (32%), Positives = 34/62 (54%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 +PE+++ R+ P+C + QG+C S ++ AV A +DR+C NG S + + Sbjct: 131 IPESYNFREAQPECA--QPIYFQGNCSSSYSIAAVSATSDRLCKSKNGEFQDQLSPQSPI 188 Query: 491 SC 496 SC Sbjct: 189 SC 190 >UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea mays (Maize) Length = 371 Score = 44.0 bits (99), Expect = 0.005 Identities = 25/74 (33%), Positives = 35/74 (47%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 LP++FD W D + V++QGSCGSCW+F A A+ Y K S + + Sbjct: 137 LPDDFD----WRDHGAVGPVKNQGSCGSCWSFSASGALEG--AHYLATGKLEVLSEQQFV 190 Query: 491 SCCPICDWDAAEEC 532 C CD + C Sbjct: 191 DCDHECDSSEPDSC 204 >UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens (Human) Length = 331 Score = 44.0 bits (99), Expect = 0.005 Identities = 26/62 (41%), Positives = 38/62 (61%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 LP++ D R+K C T EV+ QGSCG+CWAF AV A+ ++ + K SA++L+ Sbjct: 115 LPDSVDWREK--GCVT--EVKYQGSCGACWAFSAVGALEAQLKLKTG--KLVSLSAQNLV 168 Query: 491 SC 496 C Sbjct: 169 DC 170 >UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa (Rice) Length = 339 Score = 43.6 bits (98), Expect = 0.006 Identities = 26/65 (40%), Positives = 34/65 (52%) Frame = +2 Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481 I +LP D R K P ++DQG CG CWAF AV AM + + S G K S + Sbjct: 120 IDTLPATVDWRTKGAVTP----IKDQGQCGCCWAFSAVAAM-EGIVKLSTG-KLISLSEQ 173 Query: 482 DLLSC 496 +L+ C Sbjct: 174 ELVDC 178 >UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep: Cathepsin L - Stylonychia lemnae Length = 340 Score = 43.6 bits (98), Expect = 0.006 Identities = 18/44 (40%), Positives = 27/44 (61%) Frame = +2 Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 433 + +PE+ D R+K +N V+DQG CGSCWAF + ++ R Sbjct: 122 LKDIPESIDWREKG----AVNAVKDQGQCGSCWAFSTIASLESR 161 >UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypanosoma cruzi|Rep: Cysteine protease, putative - Trypanosoma cruzi Length = 434 Score = 43.6 bits (98), Expect = 0.006 Identities = 21/48 (43%), Positives = 29/48 (60%) Frame = +2 Query: 353 PTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 P L V+DQGSCGSCWA A E++ + + S+G K S + + SC Sbjct: 137 PVLTPVKDQGSCGSCWAHAATESV-ESMYAISSG-KLLTLSTQQITSC 182 >UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 394 Score = 43.6 bits (98), Expect = 0.006 Identities = 21/46 (45%), Positives = 26/46 (56%) Frame = +2 Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 LN V+DQG CGSCW FGA M + +NG FS + L+ C Sbjct: 196 LNPVKDQGQCGSCWTFGAAGVM-ESFNAITNGVLK-SFSEQQLVDC 239 >UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina). Cathepsin L; n=2; Dictyostelium discoideum|Rep: Similar to Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina). Cathepsin L - Dictyostelium discoideum (Slime mold) Length = 265 Score = 43.2 bits (97), Expect = 0.008 Identities = 25/74 (33%), Positives = 41/74 (55%) Frame = +2 Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454 P K HN + A++P++FD W D + +V++QGSC SCW+F A+ A+ Y Sbjct: 38 PFK-HNVN--ATIPKSFD----WRDHGAVGKVKNQGSCASCWSFSALGALEGHY--YIKY 88 Query: 455 TKHFHFSAEDLLSC 496 + S ++L+ C Sbjct: 89 GELLDLSEQNLVDC 102 >UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1; Dictyostelium discoideum AX4|Rep: Counting factor associated protein - Dictyostelium discoideum AX4 Length = 531 Score = 43.2 bits (97), Expect = 0.008 Identities = 24/70 (34%), Positives = 39/70 (55%) Frame = +2 Query: 287 HNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHF 466 H+ + + S+P D R++ +C T V+DQG CGSCW FG+ ++ C +NG + Sbjct: 301 HDDESLRSIPSTVDWRNQ--NCVT--PVKDQGICGSCWTFGSTGSLEGTNCV-TNG-ELV 354 Query: 467 HFSAEDLLSC 496 S + L+ C Sbjct: 355 SLSEQQLVDC 364 >UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 395 Score = 43.2 bits (97), Expect = 0.008 Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 3/55 (5%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKH---FHFSAEDLLSC 496 W D T VRDQG C SCW FG++ A+ R NG H SA++ ++C Sbjct: 194 WSDYQT--PVRDQGECKSCWVFGSLAALESRY-LIKNGVSEKSTLHLSAQNAMNC 245 >UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Longidorus elongatus Length = 358 Score = 43.2 bits (97), Expect = 0.008 Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 2/68 (2%) Frame = +2 Query: 299 LIASLPENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHF 472 +I +P+N D W + +V+DQGSCGSCWAF A ++ + Y K Sbjct: 129 MIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAFSATGSLEGQ--HYKQTGKLVSL 186 Query: 473 SAEDLLSC 496 S ++L+ C Sbjct: 187 SEQNLVDC 194 >UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanensis|Rep: Sui m 1 allergen - Suidasia medanensis Length = 336 Score = 43.2 bits (97), Expect = 0.008 Identities = 23/67 (34%), Positives = 31/67 (46%) Frame = +2 Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475 D+ +LP FD R +W VR+QG CGSCWAF + + N H S Sbjct: 110 DISVALPAAFDWRQQWNTA-----VRNQGQCGSCWAFATAATVEAQYAIRKN--VHVTLS 162 Query: 476 AEDLLSC 496 + L+ C Sbjct: 163 EQQLVDC 169 >UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; n=35; Fasciola|Rep: Cathepsin L-like proteinase precursor - Fasciola hepatica (Liver fluke) Length = 326 Score = 43.2 bits (97), Expect = 0.008 Identities = 19/52 (36%), Positives = 26/52 (50%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 W + + EV+DQG+CGSCWAF M + N FS + L+ C Sbjct: 114 WRESGYVTEVKDQGNCGSCWAFSTTGTMEGQY--MKNERTSISFSEQQLVDC 163 >UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 326 Score = 42.7 bits (96), Expect = 0.011 Identities = 17/41 (41%), Positives = 25/41 (60%) Frame = +2 Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424 +A++ + P W + + V+DQG CGSCWAF VEA+ Sbjct: 110 LAAVAGDAPPAWDWREHGAVTRVKDQGPCGSCWAFSVVEAV 150 >UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1; Brugia malayi|Rep: Cathepsin F-like cysteine proteinase - Brugia malayi (Filarial nematode worm) Length = 461 Score = 42.7 bits (96), Expect = 0.011 Identities = 26/71 (36%), Positives = 38/71 (53%), Gaps = 1/71 (1%) Frame = +2 Query: 287 HNFDL-IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKH 463 ++F+L I +LP FD W + V+DQGSCGSCWAF +V + + G K Sbjct: 239 NDFNLSIYNLPSKFD----WRTEGVVTPVKDQGSCGSCWAF-SVTGNIESLWAIKTG-KL 292 Query: 464 FHFSAEDLLSC 496 S ++L+ C Sbjct: 293 ISLSEQELIDC 303 >UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1; Uronema marinum|Rep: Cathepsin L-like cysteine protease - Uronema marinum Length = 333 Score = 42.7 bits (96), Expect = 0.011 Identities = 22/54 (40%), Positives = 32/54 (59%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP 502 W + V++QG CGSCWAF AV ++ +R+ + G K FS + L+SC P Sbjct: 126 WVSKGAVQGVQNQGVCGSCWAFSAVCSL-ERLYKINTG-KLLSFSEQQLVSCEP 177 >UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n=21; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Globodera pallida Length = 379 Score = 42.7 bits (96), Expect = 0.011 Identities = 24/48 (50%), Positives = 30/48 (62%), Gaps = 4/48 (8%) Frame = +2 Query: 302 IASLPENFDPRDK-WPDCPTLNEVRDQGSCGSCWAF---GAVEAMTDR 433 + LPE+ D RDK W + EV++QG CGSCWAF GA+EA R Sbjct: 158 VGDLPESVDWRDKGW-----VTEVKNQGMCGSCWAFSSTGALEAQHAR 200 >UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foetus|Rep: TFCP2 protein - Tritrichomonas foetus (Trichomonas foetus) Length = 270 Score = 42.7 bits (96), Expect = 0.011 Identities = 22/61 (36%), Positives = 31/61 (50%) Frame = +2 Query: 314 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493 P +FD W +N +++QGSCGSCWAF A+ A C + FS + L+ Sbjct: 51 PTSFD----WRSEGKVNPIKNQGSCGSCWAFSAIAAQES--CHAIATGELLRFSEQSLVD 104 Query: 494 C 496 C Sbjct: 105 C 105 >UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 367 Score = 42.7 bits (96), Expect = 0.011 Identities = 20/52 (38%), Positives = 31/52 (59%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 W ++ V++QGSCGSCWAF AV A+ + V N + +S ++L+ C Sbjct: 161 WRQSGAVSPVKNQGSCGSCWAFSAV-ALAESVNLLRNNSLAL-YSEQELVDC 210 >UniRef50_Q231X3 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 42.7 bits (96), Expect = 0.011 Identities = 16/52 (30%), Positives = 26/52 (50%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 W + ++ V+ QG+CGSCWAF A ++ + K S + L+ C Sbjct: 121 WVEAGKVSNVKSQGNCGSCWAFSATASVESALIIAGKVDKSISLSEQQLIDC 172 >UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 462 Score = 42.7 bits (96), Expect = 0.011 Identities = 18/43 (41%), Positives = 28/43 (65%) Frame = +2 Query: 368 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 VRDQ +CGSCWA A EA++ ++ +S G +F S + ++ C Sbjct: 242 VRDQANCGSCWAQSAGEAISSQISLHSKG--NFTVSIQQIMDC 282 >UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_56, whole genome shotgun sequence - Paramecium tetraurelia Length = 314 Score = 42.7 bits (96), Expect = 0.011 Identities = 27/83 (32%), Positives = 39/83 (46%), Gaps = 2/83 (2%) Frame = +2 Query: 254 DEHFATLPI--KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMT 427 +E FA L + K +L A L P D + V++QG+CGSCWAF AV A+ Sbjct: 85 NEEFAALLLTRKESPMNLDAELYVPQGPLKASADWSKITSVKNQGNCGSCWAFSAVGAVE 144 Query: 428 DRVCTYSNGTKHFHFSAEDLLSC 496 + +K S + L+ C Sbjct: 145 TLLTIKGVISKDLWLSEQQLVDC 167 >UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_23, whole genome shotgun sequence - Paramecium tetraurelia Length = 321 Score = 42.7 bits (96), Expect = 0.011 Identities = 25/90 (27%), Positives = 41/90 (45%), Gaps = 5/90 (5%) Frame = +2 Query: 242 GSYRDEHFATLPIKTHNFDLIASLPENFDP-----RDKWPDCPTLNEVRDQGSCGSCWAF 406 G D+ F T+ + + ++ +N +P W + ++DQG CGSCWAF Sbjct: 87 GDLTDQEFLTIYLNLQMPARVKNIQKNEEPFLVQEEVDWVQKGKVPAIKDQGDCGSCWAF 146 Query: 407 GAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 AV A+ + T + S +DL+ C Sbjct: 147 SAVGAL--EINTKIQFNEIVDLSEQDLVDC 174 >UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_101, whole genome shotgun sequence - Paramecium tetraurelia Length = 306 Score = 42.7 bits (96), Expect = 0.011 Identities = 24/63 (38%), Positives = 37/63 (58%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 +LPE+ D K +N V++QG+CGS W+F AV A + + GT HF +S ++L Sbjct: 109 NLPESVDWSSK------MNPVKNQGTCGSGWSFSAVGAF-EAFFIFVKGT-HFQYSEQNL 160 Query: 488 LSC 496 + C Sbjct: 161 VDC 163 >UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; Leishmania|Rep: Cysteine proteinase 2 precursor - Leishmania pifanoi Length = 444 Score = 42.7 bits (96), Expect = 0.011 Identities = 24/65 (36%), Positives = 37/65 (56%) Frame = +2 Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481 ++++P+ D R+K P V+DQG+CGSCWAF AV + + Y G + S + Sbjct: 123 LSAVPDAVDWREKGAVTP----VKDQGACGSCWAFSAVGNIEGQ--WYLAGHELVSLSEQ 176 Query: 482 DLLSC 496 L+SC Sbjct: 177 QLVSC 181 >UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 precursor; n=4; Schizophora|Rep: Putative cysteine proteinase CG12163 precursor - Drosophila melanogaster (Fruit fly) Length = 614 Score = 42.7 bits (96), Expect = 0.011 Identities = 23/62 (37%), Positives = 32/62 (51%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 LP+ FD R K + +V++QGSCGSCWAF + + K FS ++LL Sbjct: 394 LPKEFDWRQK----DAVTQVKNQGSCGSCWAFSVTGNIEGLYAVKTGELK--EFSEQELL 447 Query: 491 SC 496 C Sbjct: 448 DC 449 >UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: hypothetical protein - Strongylocentrotus purpuratus Length = 331 Score = 42.3 bits (95), Expect = 0.015 Identities = 21/65 (32%), Positives = 36/65 (55%) Frame = +2 Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481 + ++P +D R P P + V++Q SCG+CWAF VE M ++ + + SA+ Sbjct: 124 LKTMPLVYDLRSIKP--PVVTPVKNQKSCGACWAFSVVETMETQIALKTK--RLTQLSAQ 179 Query: 482 DLLSC 496 +L+ C Sbjct: 180 ELVDC 184 >UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 291 Score = 42.3 bits (95), Expect = 0.015 Identities = 20/51 (39%), Positives = 29/51 (56%), Gaps = 1/51 (1%) Frame = +2 Query: 359 LNEVRDQGSCGSCWAFGAVEAM-TDRVCTYSNGTKHFHFSAEDLLSCCPIC 508 +N +RDQ CGSCWAFG V A ++ YSN + S ++++ C C Sbjct: 90 VNPIRDQKQCGSCWAFGTVAACESNYALLYSNLPQ---LSEQNIIDCATTC 137 >UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa zeasingle nucleocapsid nuclear polyhedrosis virus) Length = 367 Score = 42.3 bits (95), Expect = 0.015 Identities = 22/62 (35%), Positives = 31/62 (50%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 LP+ +D W D + ++DQG CGSCWAF A+ + + N K S + LL Sbjct: 156 LPDYYD----WRDTNKVTPIKDQGVCGSCWAFVAIGNIESQYAIRHN--KLIDLSEQQLL 209 Query: 491 SC 496 C Sbjct: 210 DC 211 >UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium tetraurelia|Rep: Cathepsin L1 precursor - Paramecium tetraurelia Length = 314 Score = 42.3 bits (95), Expect = 0.015 Identities = 19/43 (44%), Positives = 27/43 (62%) Frame = +2 Query: 368 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 V++QGSCGSCWAF AV A+ + T + + S +DL+ C Sbjct: 126 VKNQGSCGSCWAFSAVGAL--EINTDIELNRKYELSEQDLVDC 166 >UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2; Taeniidae|Rep: Cathepsin L-like cysteine proteinase - Taenia solium (Pork tapeworm) Length = 339 Score = 41.9 bits (94), Expect = 0.019 Identities = 23/64 (35%), Positives = 33/64 (51%) Frame = +2 Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484 A LP+ D RDK + EV++QG+CGSCWAF + A+ + K S + Sbjct: 122 AGLPDTVDWRDK----NLVTEVKNQGNCGSCWAFSSTGALEGAFAKKTG--KLISLSEQQ 175 Query: 485 LLSC 496 L+ C Sbjct: 176 LVDC 179 >UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis|Rep: Cysteine protease 2 - Babesia bovis Length = 445 Score = 41.9 bits (94), Expect = 0.019 Identities = 21/59 (35%), Positives = 31/59 (52%) Frame = +2 Query: 320 NFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 NF+ D W + V+DQG CGSCWAF AV ++ + + S ++L+SC Sbjct: 236 NFEDID-WRRADAVTPVKDQGMCGSCWAFAAVGSVESLLKRQKTDVR---LSEQELVSC 290 >UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromeliaceae|Rep: Fruit bromelain precursor - Ananas comosus (Pineapple) Length = 351 Score = 41.9 bits (94), Expect = 0.019 Identities = 21/65 (32%), Positives = 36/65 (55%) Frame = +2 Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481 I+++P++ D W D +NEV++Q CGSCW+F A+ A + + G S + Sbjct: 120 ISAVPQSID----WRDYGAVNEVKNQNPCGSCWSFAAI-ATVEGIYKIKTGYL-VSLSEQ 173 Query: 482 DLLSC 496 ++L C Sbjct: 174 EVLDC 178 >UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; n=16; Chrysomelidae|Rep: Digestive cysteine protease intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 41.5 bits (93), Expect = 0.026 Identities = 22/63 (34%), Positives = 31/63 (49%), Gaps = 2/63 (3%) Frame = +2 Query: 314 PENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 PE+ + D W + + EV+DQ CGSCWAF A A+ + +N S + L Sbjct: 105 PEDLEVPDSIDWTEKGAVLEVKDQNPCGSCWAFSATGALEGQNAILNN--VKISLSEQQL 162 Query: 488 LSC 496 L C Sbjct: 163 LDC 165 >UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precursor; n=3; Plasmodium (Laverania)|Rep: Trophozoite cysteine proteinase precursor - Plasmodium falciparum Length = 569 Score = 41.5 bits (93), Expect = 0.026 Identities = 24/72 (33%), Positives = 38/72 (52%) Frame = +2 Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460 K + D+ + +PE D R+K ++E +DQG CGSCWAF +V + V N Sbjct: 323 KRNEKDIFSKVPEILDYREKG----IVHEPKDQGLCGSCWAFASV-GNIESVFAKKN-KN 376 Query: 461 HFHFSAEDLLSC 496 FS ++++ C Sbjct: 377 ILSFSEQEVVDC 388 >UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2] - Vigna mungo (Rice bean) (Black gram) Length = 362 Score = 41.5 bits (93), Expect = 0.026 Identities = 23/71 (32%), Positives = 37/71 (52%) Frame = +2 Query: 284 THNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKH 463 T ++ + S+P + D R K + +V+DQG CGSCWAF + A+ +N K Sbjct: 119 TFMYEKVGSVPASVDWRKKG----AVTDVKDQGQCGSCWAFSTIVAVEGINQIKTN--KL 172 Query: 464 FHFSAEDLLSC 496 S ++L+ C Sbjct: 173 VSLSEQELVDC 183 >UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin W, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to Cathepsin W, partial - Ornithorhynchus anatinus Length = 229 Score = 41.1 bits (92), Expect = 0.034 Identities = 22/67 (32%), Positives = 35/67 (52%), Gaps = 2/67 (2%) Frame = +2 Query: 302 IASLPENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475 +AS+PE ++ W + V++QGSCGSCWAF AV + + G + S Sbjct: 59 MASIPEGPLRKETCDWRKRGAITSVKNQGSCGSCWAFAAV-GNAESMWYLRAGKRLVSLS 117 Query: 476 AEDLLSC 496 +++L C Sbjct: 118 VQEVLDC 124 >UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2; Brugia malayi|Rep: Cahepsin L-like cysteine protease - Brugia malayi (Filarial nematode worm) Length = 371 Score = 41.1 bits (92), Expect = 0.034 Identities = 22/62 (35%), Positives = 32/62 (51%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 LP++ D W + +V+DQG CGSCW F AV A+ + + K S ++LL Sbjct: 143 LPKSID----WRTSGAVTKVKDQGYCGSCWTFSAVGALEGQ--HFLQTGKLVELSMQNLL 196 Query: 491 SC 496 C Sbjct: 197 DC 198 >UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase" precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 315 Score = 41.1 bits (92), Expect = 0.034 Identities = 18/52 (34%), Positives = 28/52 (53%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 W D L V+DQG CGSCWAF ++ ++ + N + S ++L+ C Sbjct: 117 WRDSAVLG-VKDQGQCGSCWAFSTTGSLEGQLAIHKN--QRVPLSEQELVDC 165 >UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain; n=9; Cucujiformia|Rep: Digestive cysteine proteinase intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 41.1 bits (92), Expect = 0.034 Identities = 25/85 (29%), Positives = 38/85 (44%), Gaps = 2/85 (2%) Frame = +2 Query: 248 YRDEHFATLPIKTHNFDLIASLPENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEA 421 ++DE + K + +A PE + D W + +V+ QG CGSCWAF A A Sbjct: 83 FKDELRRQIKTKPNVEATLAVFPEGLEVPDSIDWTQKGAVLDVKYQGGCGSCWAFSATGA 142 Query: 422 MTDRVCTYSNGTKHFHFSAEDLLSC 496 + + +N S + LL C Sbjct: 143 LEGQNAIVNN--VKIPLSEQQLLDC 165 >UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L or H-like cysteine peptidase - Trichomonas vaginalis G3 Length = 435 Score = 41.1 bits (92), Expect = 0.034 Identities = 22/72 (30%), Positives = 37/72 (51%), Gaps = 1/72 (1%) Frame = +2 Query: 284 THNFDLIASLPENFDPRDKWPDCPTLNEV-RDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460 T + D LPE+F W + P + + RDQ +CGSCWA A +++ ++ +N T Sbjct: 204 TKHIDFKGDLPESFS----WRNLPNVVAMPRDQANCGSCWAQAAATSISSQISMRTNKTT 259 Query: 461 HFHFSAEDLLSC 496 S + ++ C Sbjct: 260 --KVSVQQIVDC 269 >UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin B-like cysteine peptidase - Trichomonas vaginalis G3 Length = 255 Score = 41.1 bits (92), Expect = 0.034 Identities = 21/78 (26%), Positives = 42/78 (53%) Frame = +2 Query: 263 FATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCT 442 F I++ D+ +P+ ++ ++P C L + + CG C+A+G ++AM+ R+C Sbjct: 15 FVDESIRSFPEDISIDIPDEYNFLQEYPHCD-LGPLTQE--CGCCYAYGPIKAMSHRICK 71 Query: 443 YSNGTKHFHFSAEDLLSC 496 N K SA+ +++C Sbjct: 72 AKN--KKTFLSAQFIVAC 87 >UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: Cathepsin L - Kudoa thyrsites Length = 300 Score = 41.1 bits (92), Expect = 0.034 Identities = 22/74 (29%), Positives = 37/74 (50%) Frame = +2 Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454 P +T D+ ++LP + D W + V++QG CGSCW+F A A+ + Sbjct: 90 PKETATKDIKSTLPSSVD----WKALGKVTSVKNQGHCGSCWSFSAAGAIESAYAIKTG- 144 Query: 455 TKHFHFSAEDLLSC 496 + +FS + L+ C Sbjct: 145 -ELVNFSEQQLVDC 157 >UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1 precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 355 Score = 41.1 bits (92), Expect = 0.034 Identities = 24/65 (36%), Positives = 34/65 (52%) Frame = +2 Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481 I LP++ D R K P V+DQG CGSCWAF V A+ + + + G S + Sbjct: 134 ITDLPKSVDWRKKGAVAP----VKDQGQCGSCWAFSTVAAV-EGINQITTGNLS-SLSEQ 187 Query: 482 DLLSC 496 +L+ C Sbjct: 188 ELIDC 192 >UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; Theileria|Rep: Cysteine proteinase precursor - Theileria parva Length = 440 Score = 41.1 bits (92), Expect = 0.034 Identities = 21/67 (31%), Positives = 33/67 (49%) Frame = +2 Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475 DL EN D W ++ V+DQ +CG CWAF V ++ ++ + K + S Sbjct: 224 DLAKLTGENLD----WRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFD--KSYELS 277 Query: 476 AEDLLSC 496 ++LL C Sbjct: 278 VQELLDC 284 >UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; Theileria|Rep: Cysteine proteinase precursor - Theileria annulata Length = 441 Score = 41.1 bits (92), Expect = 0.034 Identities = 17/53 (32%), Positives = 30/53 (56%), Gaps = 1/53 (1%) Frame = +2 Query: 341 WPDCPTLNEVRDQGS-CGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 W ++ ++DQG CGSCWAF ++ ++ Y N K + S ++L++C Sbjct: 233 WARTDAVSPIKDQGDHCGSCWAFSSIASVESLYRLYKN--KSYFLSEQELVNC 283 >UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Plasmodium|Rep: Cysteine proteinase precursor - Plasmodium vivax (strain Salvador I) Length = 583 Score = 41.1 bits (92), Expect = 0.034 Identities = 19/40 (47%), Positives = 27/40 (67%) Frame = +2 Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 415 +L+A +PE D R+K ++E +DQG CGSCWAF +V Sbjct: 334 NLLADVPEILDYREKG----IVHEPKDQGLCGSCWAFASV 369 >UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa|Rep: Os09g0497500 protein - Oryza sativa subsp. japonica (Rice) Length = 349 Score = 40.7 bits (91), Expect = 0.045 Identities = 24/62 (38%), Positives = 36/62 (58%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 LP++ D R K + EV++QG CGSCWAF AV A+ + + NG + S ++L+ Sbjct: 122 LPKSVDWRKKG----AVVEVKNQGDCGSCWAFSAVAAI-EGINQIKNG-ELVSLSEQELV 175 Query: 491 SC 496 C Sbjct: 176 DC 177 >UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 328 Score = 40.7 bits (91), Expect = 0.045 Identities = 20/45 (44%), Positives = 27/45 (60%), Gaps = 1/45 (2%) Frame = +2 Query: 311 LPENFDPRDKWPD-CPTLNEVRDQGSCGSCWAFGAVEAMTDRVCT 442 +P+ FD RD + D P + V+DQ CG CWAF A A+T+ T Sbjct: 97 IPDYFDLRDIYVDGSPVVGPVKDQEQCGCCWAF-ATTAITEAANT 140 >UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba histolytica|Rep: Cysteine protease 19 - Entamoeba histolytica Length = 324 Score = 40.7 bits (91), Expect = 0.045 Identities = 18/49 (36%), Positives = 31/49 (63%), Gaps = 2/49 (4%) Frame = +2 Query: 359 LNEVRDQGSCGSCWAFGAVEAM-TDRVCTYSN-GTKHFHFSAEDLLSCC 499 + V+DQG+CGSC+AF +V M T + +Y + ++ S +++SCC Sbjct: 112 MTPVKDQGNCGSCYAFSSVALMETAVLLSYDDLSPSNYALSTAEIVSCC 160 >UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 317 Score = 40.7 bits (91), Expect = 0.045 Identities = 19/39 (48%), Positives = 25/39 (64%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424 ++PE+ D R+K +N VRDQ CGSCWAF A A+ Sbjct: 103 TVPESIDWREKG----AVNPVRDQEQCGSCWAFSAAGAL 137 >UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=176; Viridiplantae|Rep: Cysteine proteinase RD21a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 462 Score = 40.7 bits (91), Expect = 0.045 Identities = 19/38 (50%), Positives = 24/38 (63%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424 LPE+ D R K + EV+DQG CGSCWAF + A+ Sbjct: 137 LPESIDWRKKG----AVAEVKDQGGCGSCWAFSTIGAV 170 >UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep: Cathepsin L precursor - Schistosoma mansoni (Blood fluke) Length = 319 Score = 40.7 bits (91), Expect = 0.045 Identities = 17/35 (48%), Positives = 25/35 (71%) Frame = +2 Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 406 + ++P+NFD R+K + EV++QG CGSCWAF Sbjct: 102 VNNIPKNFDWREKG----AVTEVKNQGMCGSCWAF 132 >UniRef50_UPI00006CB77B Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 382 Score = 40.3 bits (90), Expect = 0.059 Identities = 19/60 (31%), Positives = 34/60 (56%), Gaps = 1/60 (1%) Frame = +2 Query: 320 NFDPRDKWPDCPTLNEVRDQGS-CGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 +F+ K+P C + + +QG C + ++ AV ++ DR+C S G +F SA+ +SC Sbjct: 128 SFNFHTKYPQC--VRPIANQGKDCSASYSIAAVSSVADRLCMASEGDFNFGLSAQPTISC 185 >UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays (Maize) Length = 493 Score = 40.3 bits (90), Expect = 0.059 Identities = 15/28 (53%), Positives = 19/28 (67%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 424 W + + EV+DQG CG CWAF AV A+ Sbjct: 170 WRERGAVAEVKDQGQCGGCWAFSAVAAV 197 >UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theileria|Rep: Cysteine protease, putative - Theileria annulata Length = 580 Score = 40.3 bits (90), Expect = 0.059 Identities = 19/52 (36%), Positives = 28/52 (53%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 W + +NEV +QGSCGSCWA + + + N K FS++ L+ C Sbjct: 370 WRESGFVNEVVNQGSCGSCWAIASEDIFSTFKSIKKN--KLMKFSSQQLVDC 419 >UniRef50_Q22W19 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 40.3 bits (90), Expect = 0.059 Identities = 23/73 (31%), Positives = 33/73 (45%), Gaps = 1/73 (1%) Frame = +2 Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460 K LI SL + P W + V++QG CGSCWAF V + Y+ T Sbjct: 109 KRQKSHLIYSLKGDVAPSIDWRQKNAVTPVKNQGQCGSCWAFSTVGGLEG---AYAIATG 165 Query: 461 HF-HFSAEDLLSC 496 + FS + ++ C Sbjct: 166 NLTSFSEQQIVDC 178 >UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep: Aca s 1 allergen - Acarus siro (Dust mite) Length = 331 Score = 40.3 bits (90), Expect = 0.059 Identities = 21/63 (33%), Positives = 31/63 (49%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 +LPE FD R K L + +QG CG+CWAF ++ + N H S ++L Sbjct: 108 NLPETFDWRSK------LGPIENQGRCGACWAFASLATVEAAFAIKYN--THIRLSKQEL 159 Query: 488 LSC 496 + C Sbjct: 160 VEC 162 >UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precursor; n=20; Psoroptidia|Rep: Major mite fecal allergen Der f 1 precursor - Dermatophagoides farinae (House-dust mite) Length = 321 Score = 40.3 bits (90), Expect = 0.059 Identities = 18/47 (38%), Positives = 24/47 (51%) Frame = +2 Query: 356 TLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 T+ +R QG CGSCWAF V A Y N + S ++L+ C Sbjct: 120 TVTPIRMQGGCGSCWAFSGVAATESAYLAYRNTS--LDLSEQELVDC 164 >UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera litura multicapsid nucleopolyhedrovirus (SpltMNPV) Length = 337 Score = 40.3 bits (90), Expect = 0.059 Identities = 21/64 (32%), Positives = 31/64 (48%) Frame = +2 Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484 A PE+FD W + +V++QG CGSCWAF A+ + + + S + Sbjct: 124 ARTPESFD----WRKLNKVTKVKEQGVCGSCWAFAAIGNIESQYAIMHDSL--IDLSEQQ 177 Query: 485 LLSC 496 LL C Sbjct: 178 LLDC 181 >UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MGC107932 protein - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 333 Score = 39.9 bits (89), Expect = 0.079 Identities = 22/76 (28%), Positives = 38/76 (50%), Gaps = 2/76 (2%) Frame = +2 Query: 275 PIKTHNFDLIA-SLPENFDPRDKWPDCPTLNEVRDQGS-CGSCWAFGAVEAMTDRVCTYS 448 P+K ++ + ++P+ D W + V++QG+ CGSCWAF V M R C + Sbjct: 102 PVKAESYSYTSITIPKEVD----WRKSNCVTPVKNQGTFCGSCWAFATVGVMESRYCIRT 157 Query: 449 NGTKHFHFSAEDLLSC 496 + + S + L+ C Sbjct: 158 K--ELLNLSEQQLVDC 171 >UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa (japonica cultivar-group)|Rep: Os09g0562700 protein - Oryza sativa subsp. japonica (Rice) Length = 235 Score = 39.9 bits (89), Expect = 0.079 Identities = 19/46 (41%), Positives = 27/46 (58%) Frame = +2 Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 + EV+DQG CGSCWAF V A+ + + G K S ++L+ C Sbjct: 21 VTEVKDQGRCGSCWAFSTV-AVVEGIQKIKKG-KLVSLSEQELVDC 64 >UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae|Rep: Cysteine proteinase - Hypera postica (alfalfa weevil) Length = 324 Score = 39.9 bits (89), Expect = 0.079 Identities = 18/44 (40%), Positives = 25/44 (56%) Frame = +2 Query: 368 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC 499 V+DQG CGSCWAF ++ T+ +G K S + L+ CC Sbjct: 127 VKDQGDCGSCWAF-SITGSTEGAYARKSG-KLVSLSEQQLIDCC 168 >UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Toxopain-2 - Toxoplasma gondii Length = 422 Score = 39.9 bits (89), Expect = 0.079 Identities = 20/67 (29%), Positives = 29/67 (43%) Frame = +2 Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475 +L+ LP W + V+DQ CGSCWAF A+ C + K S Sbjct: 196 ELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTG--KLVSLS 253 Query: 476 AEDLLSC 496 ++L+ C Sbjct: 254 EQELMDC 260 >UniRef50_O16454 Cluster: Temporarily assigned gene name protein 196; n=4; Bilateria|Rep: Temporarily assigned gene name protein 196 - Caenorhabditis elegans Length = 477 Score = 39.9 bits (89), Expect = 0.079 Identities = 17/32 (53%), Positives = 24/32 (75%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 406 LPE+FD R+K + +V++QG+CGSCWAF Sbjct: 264 LPESFDWREKG----AVTQVKNQGNCGSCWAF 291 >UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=17; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 318 Score = 39.9 bits (89), Expect = 0.079 Identities = 13/27 (48%), Positives = 18/27 (66%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEA 421 W + +N ++DQ CGSCWAF V+A Sbjct: 106 WRNAKIVNPIKDQAQCGSCWAFSVVQA 132 >UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin l - Strongylocentrotus purpuratus Length = 489 Score = 39.5 bits (88), Expect = 0.10 Identities = 19/63 (30%), Positives = 32/63 (50%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 ++P++ D W ++ V+DQ CGSCW+FG+ E + V + K S + L Sbjct: 266 AVPDHID----WNVLGAVSPVKDQAVCGSCWSFGSAETIEGAV--FMQSGKRVRLSQQML 319 Query: 488 LSC 496 + C Sbjct: 320 MDC 322 >UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine protease; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cysteine protease - Strongylocentrotus purpuratus Length = 494 Score = 39.5 bits (88), Expect = 0.10 Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 1/51 (1%) Frame = +2 Query: 275 PIKTHNFDLIASLPENFDPRD-KWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424 P+K A++P+ P + W + V++QG CGSCWAF A+ M Sbjct: 223 PLKKTGIKKQAAIPQGPVPEEYDWRTHGAVTPVKNQGMCGSCWAFSAIGNM 273 >UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin L - Misgurnus mizolepis (Mud loach) Length = 337 Score = 39.5 bits (88), Expect = 0.10 Identities = 18/52 (34%), Positives = 27/52 (51%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 W + + V+DQG CGSCWAF AM ++ + K S ++L+ C Sbjct: 122 WREKGYVTPVKDQGECGSCWAFSTTGAMEGQM--FRKQGKLVSLSEQNLVDC 171 >UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba histolytica|Rep: Cysteine protease 17 - Entamoeba histolytica Length = 420 Score = 39.5 bits (88), Expect = 0.10 Identities = 24/73 (32%), Positives = 34/73 (46%), Gaps = 5/73 (6%) Frame = +2 Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT-----K 460 D++ LPE D R L +R+Q CG CW+F +V A+ R N T + Sbjct: 162 DIVKELPEGIDFRK----FGKLTYIREQTGCGGCWSFASVCALESRYLIDYNLTVDDVGR 217 Query: 461 HFHFSAEDLLSCC 499 + S + LL CC Sbjct: 218 TWALSEQQLLDCC 230 >UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n=1; Myxobolus cerebralis|Rep: Cathepsin Z-like cysteine proteinase - Myxobolus cerebralis Length = 297 Score = 39.5 bits (88), Expect = 0.10 Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 5/68 (7%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGS---CGSCWAFGAVEAMTDRVCTYSNGT--KHFHF 472 ++P++FD W + L+ V++Q CGSCWAF + + DR+ N + HF Sbjct: 49 NMPKSFD----WRENAYLSSVKNQHLPTYCGSCWAFASTSTIADRIYIAKNLSHFDHFSL 104 Query: 473 SAEDLLSC 496 S + +++C Sbjct: 105 SVQVVIAC 112 >UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabditis|Rep: Cathepsin z protein 1 - Caenorhabditis elegans Length = 306 Score = 39.5 bits (88), Expect = 0.10 Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 7/79 (8%) Frame = +2 Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEV---RDQGS---CGSCWAFGAVEAMTDRV-C 439 +T +FD LP+ +D W D +N R+Q CGSCWAFGA A+ DR+ Sbjct: 56 ETEDFDS-EDLPKTWD----WRDANGINYASADRNQHIPQYCGSCWAFGATSALADRINI 110 Query: 440 TYSNGTKHFHFSAEDLLSC 496 N + S ++++ C Sbjct: 111 KRKNAWPQAYLSVQEVIDC 129 >UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L, S or H-like cysteine peptidase - Trichomonas vaginalis G3 Length = 473 Score = 39.5 bits (88), Expect = 0.10 Identities = 15/33 (45%), Positives = 22/33 (66%), Gaps = 1/33 (3%) Frame = +2 Query: 341 WPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRV 436 W D P + + RDQ +CGSCWAFG E++ ++ Sbjct: 257 WRDVPNVVGKPRDQVACGSCWAFGTAESLESQL 289 >UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 452 Score = 39.5 bits (88), Expect = 0.10 Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 1/72 (1%) Frame = +2 Query: 284 THNFDLIASLPENFDPRDKWPDCPTLNEV-RDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460 T++ +I +LPE+F W + P + E DQ CG+C+AFGA EA+ + +N + Sbjct: 216 TYDQKVIQNLPESFS----WRNVPYVLEYPHDQAVCGTCFAFGASEAINGQFSLRAN--R 269 Query: 461 HFHFSAEDLLSC 496 S + L+ C Sbjct: 270 SIITSVQQLVDC 281 >UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole genome shotgun sequence; n=7; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_22, whole genome shotgun sequence - Paramecium tetraurelia Length = 350 Score = 39.5 bits (88), Expect = 0.10 Identities = 23/57 (40%), Positives = 32/57 (56%), Gaps = 6/57 (10%) Frame = +2 Query: 254 DEHFA----TLPIKTHNFDLIASLPENFD--PRDKWPDCPTLNEVRDQGSCGSCWAF 406 DE FA TL + + ++ + EN + P D W +N+V+DQG CGSCWAF Sbjct: 114 DEEFAATYLTLKVNPDDLEVPKAQFENVNATPID-WRTRGAVNKVKDQGQCGSCWAF 169 >UniRef50_Q2FLC7 Cluster: Periplasmic copper-binding precursor; n=1; Methanospirillum hungatei JF-1|Rep: Periplasmic copper-binding precursor - Methanospirillum hungatei (strain JF-1 / DSM 864) Length = 1092 Score = 39.5 bits (88), Expect = 0.10 Identities = 18/48 (37%), Positives = 26/48 (54%) Frame = +2 Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424 K + ++A P FD RD + +RDQG GSCW F AV+++ Sbjct: 77 KIRSLSILADYPSKFDLRDS----KRVPAIRDQGQSGSCWDFAAVKSL 120 >UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber officinale (Ginger) Length = 221 Score = 39.5 bits (88), Expect = 0.10 Identities = 18/38 (47%), Positives = 25/38 (65%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424 LP++ D R+K P V++QG CGSCWAF A+ A+ Sbjct: 3 LPDSIDWREKGAVVP----VKNQGGCGSCWAFDAIAAV 36 >UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 360 Score = 39.1 bits (87), Expect = 0.14 Identities = 23/66 (34%), Positives = 34/66 (51%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 +LP +FD RDK P V+ Q CG CWAF V+++ + + G K S + + Sbjct: 130 NLPASFDWRDKGAITP----VKVQNGCGGCWAFSTVQSI-EGLYFLKTG-KLESLSTQQV 183 Query: 488 LSCCPI 505 + CC I Sbjct: 184 IDCCRI 189 >UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba culbertsoni|Rep: Cysteine proteinase - Acanthamoeba culbertsoni Length = 482 Score = 39.1 bits (87), Expect = 0.14 Identities = 22/43 (51%), Positives = 27/43 (62%), Gaps = 3/43 (6%) Frame = +2 Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF---GAVEAM 424 AS+P N+D R K P V++QGSC SCWAF GAVE + Sbjct: 154 ASIPANWDWRTKGAVTP----VKNQGSCASCWAFVATGAVEGV 192 >UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease Gip1p; n=4; Tetrahymena thermophila|Rep: Granule-biosynthesis induced protease Gip1p - Tetrahymena thermophila Length = 345 Score = 39.1 bits (87), Expect = 0.14 Identities = 19/60 (31%), Positives = 30/60 (50%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICDWDA 520 W LN V++QG+CGSCW F A + + N + FS + L+ C + +D+ Sbjct: 139 WRKRGVLNPVKNQGTCGSCWTF-ATAGILESFNQIKN-KQLLKFSEQQLVDCVSLAGYDS 196 >UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lamblia ATCC 50803|Rep: GLP_163_69918_68548 - Giardia lamblia ATCC 50803 Length = 456 Score = 39.1 bits (87), Expect = 0.14 Identities = 21/59 (35%), Positives = 32/59 (54%) Frame = +2 Query: 239 NGSYRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 415 +G+ R + T P+ T + +P ++D R+ P V+DQG CGSCWAFG + Sbjct: 58 SGTCRQVYTLTDPLST-----LPEIPTSYDLREAGLQVP----VKDQGVCGSCWAFGTM 107 >UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 397 Score = 39.1 bits (87), Expect = 0.14 Identities = 18/46 (39%), Positives = 27/46 (58%) Frame = +2 Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 ++ V+DQG CG CWAF A A+ + V N T +S ++L+ C Sbjct: 192 VSPVKDQGRCGCCWAFSAT-ALAESVNLMRNNTLQ-QYSEQELVDC 235 >UniRef50_Q23FQ5 Cluster: Papain family cysteine protease containing protein; n=4; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 39.1 bits (87), Expect = 0.14 Identities = 20/66 (30%), Positives = 29/66 (43%) Frame = +2 Query: 320 NFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC 499 N+ W + LN +++QG CGSC AFG + Y + FS + LL C Sbjct: 124 NYPTSVDWRNSGALNPIQNQGQCGSCAAFGTAGVLES--FYYLKSKQLLKFSEQQLLDCA 181 Query: 500 PICDWD 517 +D Sbjct: 182 RQAGFD 187 >UniRef50_Q22DX2 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 358 Score = 39.1 bits (87), Expect = 0.14 Identities = 22/63 (34%), Positives = 30/63 (47%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 S+P ++D R P L V +QG CGSCWAF A+ N T + S + L Sbjct: 146 SIPSSWDIRTDGPGL--LQPVENQGQCGSCWAFSTSGAVESYYSAKKNIT--LNLSKQQL 201 Query: 488 LSC 496 + C Sbjct: 202 VDC 204 >UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 493 Score = 39.1 bits (87), Expect = 0.14 Identities = 21/63 (33%), Positives = 31/63 (49%), Gaps = 1/63 (1%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFH-FSAEDL 487 LP F R+ + + + RDQ +CGSCWAFG E + + +K FH S + Sbjct: 266 LPRTFSWRN---NTQVVGKPRDQVACGSCWAFGTAEVLEG---AFGIASKEFHEVSTNQI 319 Query: 488 LSC 496 + C Sbjct: 320 MDC 322 >UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep: CG4847-PD, isoform D - Drosophila melanogaster (Fruit fly) Length = 420 Score = 39.1 bits (87), Expect = 0.14 Identities = 21/68 (30%), Positives = 35/68 (51%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 +P+ FD W + + V+ QG+CGSCWAF A+ T+ + S ++L+ Sbjct: 203 IPDAFD----WREHGGVTPVKFQGTCGSCWAFATTGAIEGH--TFRKTGSLPNLSEQNLV 256 Query: 491 SCCPICDW 514 C P+ D+ Sbjct: 257 DCGPVEDF 264 >UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; Methanospirillum hungatei JF-1|Rep: Peptidase C1A, papain precursor - Methanospirillum hungatei (strain JF-1 / DSM 864) Length = 1096 Score = 39.1 bits (87), Expect = 0.14 Identities = 18/37 (48%), Positives = 23/37 (62%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEA 421 LP +FD R+ D T +++QGSCGSCWAF A Sbjct: 321 LPTSFDWRNNGGDYTT--PIKNQGSCGSCWAFATTGA 355 >UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin F like protease - Nasonia vitripennis Length = 1036 Score = 38.7 bits (86), Expect = 0.18 Identities = 17/36 (47%), Positives = 23/36 (63%), Gaps = 1/36 (2%) Frame = +2 Query: 302 IASLPENFDPRD-KWPDCPTLNEVRDQGSCGSCWAF 406 +A++P+ P D W + V+DQGSCGSCWAF Sbjct: 809 MATIPDIELPSDYDWRHHNVVTPVKDQGSCGSCWAF 844 >UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 protein, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to MGC81823 protein, partial - Ornithorhynchus anatinus Length = 361 Score = 38.7 bits (86), Expect = 0.18 Identities = 14/24 (58%), Positives = 17/24 (70%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGA 412 W D + V+DQG CGSCWAFG+ Sbjct: 196 WRDHGYVTPVKDQGRCGSCWAFGS 219 >UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Cathepsin K - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 333 Score = 38.7 bits (86), Expect = 0.18 Identities = 18/43 (41%), Positives = 26/43 (60%) Frame = +2 Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424 D + LP++ D R + V++QGSCGSCWAF +V A+ Sbjct: 113 DRVGKLPKSIDYRK----LGYVTSVKNQGSCGSCWAFSSVGAL 151 >UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel protein - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 328 Score = 38.7 bits (86), Expect = 0.18 Identities = 19/55 (34%), Positives = 31/55 (56%) Frame = +2 Query: 332 RDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 R W + ++ V++QG CGSCWAF AV ++ ++ + SA++LL C Sbjct: 116 RVNWTEHGMVSPVQNQGPCGSCWAFSAVGSLEAQMKRRTAAL--VPLSAQNLLDC 168 >UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena thermophila Length = 320 Score = 38.7 bits (86), Expect = 0.18 Identities = 23/93 (24%), Positives = 45/93 (48%), Gaps = 1/93 (1%) Frame = +2 Query: 257 EHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 436 + F TL K ++ ++ + E + W + V++QGSCGSCWAF + A+ + Sbjct: 92 QQFLTLHEKVNSTEVYRAQGEATEV--DWTAKGKVTPVKNQGSCGSCWAFSTIGAVESAL 149 Query: 437 CTYSNGTKH-FHFSAEDLLSCCPICDWDAAEEC 532 G ++ + + ++ + C +D +E C Sbjct: 150 WIAGQGEQNTLNLAEQEQVDCAKSPKYD-SEGC 181 >UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 2 - Rhipicephalus appendiculatus (Brown ear tick) Length = 564 Score = 38.7 bits (86), Expect = 0.18 Identities = 19/47 (40%), Positives = 23/47 (48%) Frame = +2 Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 415 P H F A LP+ D W + V+DQ CGSCW+FG V Sbjct: 335 PFPRHRFT--AKLPDQID----WRPYGAVTPVKDQAVCGSCWSFGTV 375 >UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 4 - Rhipicephalus appendiculatus (Brown ear tick) Length = 345 Score = 38.7 bits (86), Expect = 0.18 Identities = 16/53 (30%), Positives = 29/53 (54%) Frame = +2 Query: 338 KWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 +W + + V++QG CGSCWAF + A+ +V + + S ++L+ C Sbjct: 131 EWRENGFVTPVKNQGQCGSCWAFSSTGALEGQV--FKRTRRLISLSEQNLMDC 181 >UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 664 Score = 38.7 bits (86), Expect = 0.18 Identities = 17/52 (32%), Positives = 28/52 (53%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 W +++V++QGSCGSC+AF V A+ Y + S ++L+ C Sbjct: 476 WRTWGMVSKVKNQGSCGSCYAFSTVGALESHY--YRKNNRMLDLSEQNLVDC 525 >UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus|Rep: Cathepsin L - Aphrocallistes vastus Length = 329 Score = 38.7 bits (86), Expect = 0.18 Identities = 17/52 (32%), Positives = 27/52 (51%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 W + V++QG CGSCW+F A ++ + S K FS ++L+ C Sbjct: 121 WRSKGVVTPVKNQGQCGSCWSFSATGSLEGQYAIKSG--KLVSFSEQELVDC 170 >UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep: Cysteine protease - Babesia equi Length = 438 Score = 38.7 bits (86), Expect = 0.18 Identities = 18/52 (34%), Positives = 30/52 (57%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 W + V+DQG+CGSCWAF AV ++ + + G + S ++L++C Sbjct: 230 WRKLNGVTPVKDQGNCGSCWAFAAVGSV-ESLYLIKKG-QALDLSEQELVNC 279 >UniRef50_A5KBM2 Cluster: Serine-repeat antigen; n=3; Plasmodium|Rep: Serine-repeat antigen - Plasmodium vivax Length = 1014 Score = 38.7 bits (86), Expect = 0.18 Identities = 21/62 (33%), Positives = 31/62 (50%), Gaps = 3/62 (4%) Frame = +2 Query: 320 NFDPRDKWPD---CPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 N++ D+W D C + EV +QG+CG CW F + + C G HF SA + Sbjct: 555 NYEYCDRWKDKTSCISNIEVEEQGNCGLCWVFASKLHLETIRC--MRGYGHFRSSALYVA 612 Query: 491 SC 496 +C Sbjct: 613 NC 614 >UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; Dictyostelium discoideum|Rep: Cysteine proteinase 1 precursor - Dictyostelium discoideum (Slime mold) Length = 343 Score = 38.7 bits (86), Expect = 0.18 Identities = 25/89 (28%), Positives = 42/89 (47%), Gaps = 2/89 (2%) Frame = +2 Query: 272 LPIKTHNFD-LIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYS 448 LP+ + D I S+P FD W + V++QG CGSCW+F + + + Sbjct: 104 LPVADYLDDEFINSIPTAFD----WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQ--HFI 157 Query: 449 NGTKHFHFSAEDLLSCCPIC-DWDAAEEC 532 + K S ++L+ C C +++ E C Sbjct: 158 SQNKLVSLSEQNLVDCDHECMEYEGEEAC 186 >UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18; Magnoliophyta|Rep: Thiol protease aleurain precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 38.7 bits (86), Expect = 0.18 Identities = 20/42 (47%), Positives = 26/42 (61%), Gaps = 3/42 (7%) Frame = +2 Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF---GAVEA 421 A+LPE D W + ++ V+DQG CGSCW F GA+EA Sbjct: 139 AALPETKD----WREDGIVSPVKDQGGCGSCWTFSTTGALEA 176 >UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin O precursor; n=1; Tribolium castaneum|Rep: PREDICTED: similar to Cathepsin O precursor - Tribolium castaneum Length = 326 Score = 38.3 bits (85), Expect = 0.24 Identities = 18/64 (28%), Positives = 33/64 (51%) Frame = +2 Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484 A++P D R+K + + +QGSCG+CWA+ +E + +N K S ++ Sbjct: 119 ATVPNKVDWREK----NAVTRIYNQGSCGACWAYSVIETVESMNAIKTN--KSEELSVQE 172 Query: 485 LLSC 496 ++ C Sbjct: 173 IIDC 176 >UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 20 SCAF14744, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 175 Score = 38.3 bits (85), Expect = 0.24 Identities = 18/41 (43%), Positives = 23/41 (56%) Frame = +2 Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424 I LP FD W D + V++Q +CGSCWAF V A+ Sbjct: 56 IKGLPARFD----WRDNAVVGPVQNQQACGSCWAFSVVGAV 92 >UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocystis pacifica SIR-1|Rep: Peptidase C1A, papain - Plesiocystis pacifica SIR-1 Length = 650 Score = 38.3 bits (85), Expect = 0.24 Identities = 18/46 (39%), Positives = 24/46 (52%) Frame = +2 Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 L +R+QG+CGSCWAF AV + + G S + LSC Sbjct: 176 LGAIRNQGACGSCWAFAAVSTIEASNAIVNGGRS--DLSEQHALSC 219 >UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sativa|Rep: Cysteine proteinase-like - Oryza sativa subsp. japonica (Rice) Length = 360 Score = 38.3 bits (85), Expect = 0.24 Identities = 20/52 (38%), Positives = 28/52 (53%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 W + EV++Q SCGSCWAF AV A T+ + + G S + +L C Sbjct: 143 WRARGAVTEVKNQRSCGSCWAFAAV-AATEGLVQLATGNL-VSLSEQQVLDC 192 >UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4; core eudicotyledons|Rep: Papain-like cysteine peptidase XBCP3 - Arabidopsis thaliana (Mouse-ear cress) Length = 437 Score = 38.3 bits (85), Expect = 0.24 Identities = 14/28 (50%), Positives = 18/28 (64%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 424 W + V+DQGSCG+CW+F A AM Sbjct: 124 WRKKGAVTNVKDQGSCGACWSFSATGAM 151 >UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativa|Rep: Os01g0347600 protein - Oryza sativa subsp. japonica (Rice) Length = 343 Score = 38.3 bits (85), Expect = 0.24 Identities = 14/19 (73%), Positives = 17/19 (89%) Frame = +2 Query: 368 VRDQGSCGSCWAFGAVEAM 424 V+DQG+CGSCWAF AV A+ Sbjct: 140 VKDQGACGSCWAFAAVAAI 158 >UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia deliciosa (Kiwi) Length = 509 Score = 38.3 bits (85), Expect = 0.24 Identities = 18/52 (34%), Positives = 28/52 (53%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 W + V+DQG CGSCWAF + A+ + + +NG S ++L+ C Sbjct: 153 WRKYGIVTGVKDQGDCGSCWAFSSTGAI-EGINALANGDL-ISLSEQELVDC 202 >UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 289 Score = 38.3 bits (85), Expect = 0.24 Identities = 14/19 (73%), Positives = 17/19 (89%) Frame = +2 Query: 368 VRDQGSCGSCWAFGAVEAM 424 V+DQG+CGSCWAF AV A+ Sbjct: 139 VKDQGACGSCWAFAAVAAI 157 >UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia ATCC 50803|Rep: GLP_567_6496_7413 - Giardia lamblia ATCC 50803 Length = 305 Score = 38.3 bits (85), Expect = 0.24 Identities = 19/64 (29%), Positives = 29/64 (45%) Frame = +2 Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484 A P+ D R P+C E DQ C C+AF + A++ R C + S + Sbjct: 79 AGSPDRLDYRQTHPEC--FFEPEDQKECSCCYAFATLGALSTRRCIAKLDPQAVSLSVQH 136 Query: 485 LLSC 496 ++SC Sbjct: 137 MVSC 140 >UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06231 protein - Schistosoma japonicum (Blood fluke) Length = 372 Score = 38.3 bits (85), Expect = 0.24 Identities = 20/64 (31%), Positives = 31/64 (48%) Frame = +2 Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484 A LP+ D W + V++QG CGSCWAF + A+ + Y + + S + Sbjct: 148 AKLPDRVD----WRRNGAVTPVKNQGQCGSCWAFSSTGAIEGQ--HYRKTNRLVNLSEQQ 201 Query: 485 LLSC 496 L+ C Sbjct: 202 LIDC 205 >UniRef50_Q24E33 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 328 Score = 38.3 bits (85), Expect = 0.24 Identities = 18/52 (34%), Positives = 26/52 (50%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 W + V++QGSCGSCWAF A+ +N + FS + L+ C Sbjct: 133 WTAQGAVTPVKNQGSCGSCWAFSTTGALEGSYFLKNN--QLISFSEQQLVDC 182 >UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 330 Score = 38.3 bits (85), Expect = 0.24 Identities = 15/52 (28%), Positives = 24/52 (46%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 W ++ V+DQG CGSCWAF ++ + + S + L+ C Sbjct: 123 WVTRGKVSAVKDQGQCGSCWAFSTTGSVESALIIAGYANQTIDLSEQQLVDC 174 >UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 344 Score = 38.3 bits (85), Expect = 0.24 Identities = 24/64 (37%), Positives = 32/64 (50%) Frame = +2 Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484 + LPE+FD RDK P + Q +CGSCW F A + + G + HFS + Sbjct: 129 SDLPESFDWRDKGIITPA----KFQNTCGSCWTF-ATTGVIESQYALKYG-ELLHFSEQM 182 Query: 485 LLSC 496 LL C Sbjct: 183 LLDC 186 >UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathepsin o - Aedes aegypti (Yellowfever mosquito) Length = 375 Score = 38.3 bits (85), Expect = 0.24 Identities = 19/39 (48%), Positives = 23/39 (58%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMT 427 LP+ D RDK P VR QGSCG+CWA V+ +T Sbjct: 153 LPKVVDWRDKGVVAP----VRSQGSCGACWAISVVDTIT 187 >UniRef50_O96163 Cluster: Cysteine protease, putative; n=5; Plasmodium|Rep: Cysteine protease, putative - Plasmodium falciparum (isolate 3D7) Length = 946 Score = 38.3 bits (85), Expect = 0.24 Identities = 29/87 (33%), Positives = 40/87 (45%), Gaps = 3/87 (3%) Frame = +2 Query: 335 DKWPD---CPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPI 505 D+W D C + EV +QG+CG CW F + C G HF SA + +C Sbjct: 529 DRWKDKTGCISKIEVEEQGNCGLCWIFASKLHFETIRC--MRGYGHFRSSALYVANC--- 583 Query: 506 CDWDAAEECRD*LGNIGSTSV*YQEVV 586 D D+ E C +GS V + E+V Sbjct: 584 SDRDSDEIC-----FVGSNPVEFLEIV 605 >UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra erinaceieuropaei|Rep: Cysteine proteinase - Spirometra erinaceieuropaei (Tapeworm) Length = 336 Score = 38.3 bits (85), Expect = 0.24 Identities = 18/62 (29%), Positives = 29/62 (46%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 L EN W + + V++QG CGSCW+F A A+ + + + S + L+ Sbjct: 117 LKENLPDSVNWRERGAVTSVKNQGQCGSCWSFSANGAIEGAIQIKTGALR--SLSEQQLM 174 Query: 491 SC 496 C Sbjct: 175 DC 176 >UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Plasmodium (Vinckeia)|Rep: Cysteine proteinase precursor - Plasmodium vinckei Length = 506 Score = 38.3 bits (85), Expect = 0.24 Identities = 25/83 (30%), Positives = 42/83 (50%), Gaps = 8/83 (9%) Frame = +2 Query: 272 LPIKTH--NFDLIA------SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMT 427 +P+K H N +LI+ P++ D R K+ P +DQG+CGSCWAF A+ Sbjct: 242 VPLKKHLANTNLISVDNKSKDFPDSRDYRSKFNFLPP----KDQGNCGSCWAFAAI-GNF 296 Query: 428 DRVCTYSNGTKHFHFSAEDLLSC 496 + + ++ FS + ++ C Sbjct: 297 EYLYVHTRHEMPISFSEQQMVDC 319 >UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleostomi|Rep: Cathepsin O precursor - Homo sapiens (Human) Length = 321 Score = 38.3 bits (85), Expect = 0.24 Identities = 19/39 (48%), Positives = 23/39 (58%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424 SLP FD RDK + +VR+Q CG CWAF V A+ Sbjct: 107 SLPLRFDWRDK----QVVTQVRNQQMCGGCWAFSVVGAV 141 >UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|Rep: Cathepsin F precursor - Homo sapiens (Human) Length = 484 Score = 38.3 bits (85), Expect = 0.24 Identities = 20/56 (35%), Positives = 28/56 (50%) Frame = +2 Query: 329 PRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 P W + +V+DQG CGSCWAF +V + + GT S ++LL C Sbjct: 273 PEWDWRSKGAVTKVKDQGMCGSCWAF-SVTGNVEGQWFLNQGTL-LSLSEQELLDC 326 >UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2; Entamoeba|Rep: Cysteine proteinase ACP1 precursor - Entamoeba histolytica Length = 308 Score = 38.3 bits (85), Expect = 0.24 Identities = 17/46 (36%), Positives = 24/46 (52%) Frame = +2 Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 +N +DQG CGSCW F + RV + K + FS + L+ C Sbjct: 103 MNPAKDQGQCGSCWTFCTTAVLEGRV--NKDLGKLYSFSEQQLVDC 146 >UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; n=23; Magnoliophyta|Rep: Senescence-specific cysteine protease - Arabidopsis thaliana (Mouse-ear cress) Length = 346 Score = 37.9 bits (84), Expect = 0.32 Identities = 23/63 (36%), Positives = 31/63 (49%) Frame = +2 Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487 +LP + D R K P +++QGSCG CWAF AV A+ T K S + L Sbjct: 129 ALPVSVDWRKKGAVTP----IKNQGSCGCCWAFSAVAAIEG--ATQIKKGKLISLSEQQL 182 Query: 488 LSC 496 + C Sbjct: 183 VDC 185 >UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 343 Score = 37.9 bits (84), Expect = 0.32 Identities = 17/48 (35%), Positives = 24/48 (50%) Frame = +2 Query: 368 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICD 511 ++ QG CGSCWAF A+ V G + S++ LL C + D Sbjct: 153 IKYQGPCGSCWAFATAAAIESAVSISGGGLQ--SLSSQQLLDCTVVSD 198 >UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum|Rep: Falcipain 2 - Plasmodium falciparum Length = 484 Score = 37.9 bits (84), Expect = 0.32 Identities = 20/61 (32%), Positives = 31/61 (50%), Gaps = 1/61 (1%) Frame = +2 Query: 317 ENFDPRD-KWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493 ENFD W + V+DQ +CGSCWAF ++ ++ + N K S ++L+ Sbjct: 258 ENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKN--KLITLSEQELVD 315 Query: 494 C 496 C Sbjct: 316 C 316 >UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein A; n=2; Dictyostelium discoideum|Rep: Gamete and mating-type specific protein A - Dictyostelium discoideum (Slime mold) Length = 448 Score = 37.9 bits (84), Expect = 0.32 Identities = 17/45 (37%), Positives = 25/45 (55%), Gaps = 2/45 (4%) Frame = +2 Query: 368 VRDQGSCGSCWAFGAVEAMTDR-VCTYSNGTKH-FHFSAEDLLSC 496 +RDQG CGSCWAF + A+ R + Y K S ++ ++C Sbjct: 253 IRDQGQCGSCWAFASSAALESRYLIKYGTAQKSTLQLSNQNAVNC 297 >UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 356 Score = 37.9 bits (84), Expect = 0.32 Identities = 20/74 (27%), Positives = 35/74 (47%) Frame = +2 Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454 P+K N + +PE+ + W D ++ V+DQ +CGSCW F A+ + + Sbjct: 116 PMKIQNKKNV-QVPESIN----WKDLNKVSPVKDQQNCGSCWTFSTTGAIESHYAIFED- 169 Query: 455 TKHFHFSAEDLLSC 496 + S + L+ C Sbjct: 170 VEPTSLSEQQLIDC 183 >UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platyhelminthes|Rep: Cathepsin L-like proteinase - Echinococcus multilocularis Length = 338 Score = 37.9 bits (84), Expect = 0.32 Identities = 21/62 (33%), Positives = 32/62 (51%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490 +P++ D R K P ++DQG CGSCWAF A A+ ++ + K S + L+ Sbjct: 122 VPDSIDWRKKGLVTP----IKDQGDCGSCWAFSATGALEGQLKRKTG--KLISLSEQQLV 175 Query: 491 SC 496 C Sbjct: 176 DC 177 >UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, whole genome shotgun sequence; n=3; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_2, whole genome shotgun sequence - Paramecium tetraurelia Length = 376 Score = 37.9 bits (84), Expect = 0.32 Identities = 17/46 (36%), Positives = 24/46 (52%) Frame = +2 Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 + EV+ QG CGSCWAF + + R+ +N K S L+ C Sbjct: 175 VTEVQQQGRCGSCWAFAVQDVVISRL-AIANKNKLDQLSKTHLIDC 219 >UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179, whole genome shotgun sequence; n=3; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_179, whole genome shotgun sequence - Paramecium tetraurelia Length = 339 Score = 37.9 bits (84), Expect = 0.32 Identities = 20/83 (24%), Positives = 44/83 (53%) Frame = +2 Query: 248 YRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMT 427 ++++ + ++ + P ++ ++ +P C ++V +QG+C S ++ + + Sbjct: 104 FKNDFTQQINVEKCKLSFMDETPVYYNFKEAYPQCN--HQVYNQGNCSSSYSIAVSSSFS 161 Query: 428 DRVCTYSNGTKHFHFSAEDLLSC 496 DRVC N T+ SA++LLSC Sbjct: 162 DRVCK-QNQTQ--QLSAQNLLSC 181 >UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon GZfos34G5|Rep: Cathepsin C - uncultured archaeon GZfos34G5 Length = 760 Score = 37.9 bits (84), Expect = 0.32 Identities = 21/43 (48%), Positives = 27/43 (62%), Gaps = 1/43 (2%) Frame = +2 Query: 299 LIASLP-ENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424 L AS+P FD RDK + V++QGSCGSC AFG + A+ Sbjct: 301 LDASVPIGTFDWRDK-DGANWITSVKEQGSCGSCVAFGTIGAL 342 >UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 37.5 bits (83), Expect = 0.42 Identities = 17/38 (44%), Positives = 24/38 (63%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424 LPE+ D W ++ VRDQG+CGSC+AF + A+ Sbjct: 127 LPESVD----WRKLGAVSPVRDQGNCGSCYAFASTGAL 160 >UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cathepsin L; n=4; Danio rerio|Rep: Novel protein similar to vertebrate cathepsin L - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 334 Score = 37.5 bits (83), Expect = 0.42 Identities = 16/46 (34%), Positives = 26/46 (56%) Frame = +2 Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 + EV+DQG CGSCW+F A+ ++ Y + + S + L+ C Sbjct: 130 VTEVKDQGYCGSCWSFSTTGAIEGQM--YKHTGRLVSLSEQQLVDC 173 >UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; Roseiflexus|Rep: Peptidase C1A, papain precursor - Roseiflexus sp. RS-1 Length = 1202 Score = 37.5 bits (83), Expect = 0.42 Identities = 17/35 (48%), Positives = 20/35 (57%), Gaps = 3/35 (8%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAF---GAVEAMTDRV 436 W D V+DQG CGSCWAF G VE+ R+ Sbjct: 175 WCDQGACTPVKDQGVCGSCWAFATTGVVESALKRI 209 >UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestivum|Rep: Cysteine protease - Triticum aestivum (Wheat) Length = 371 Score = 37.5 bits (83), Expect = 0.42 Identities = 20/61 (32%), Positives = 28/61 (45%) Frame = +2 Query: 314 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493 P FD W + + + QG+CG CWAF A A T NG + S ++L+ Sbjct: 154 PRQFD----WREHGVVTPAKQQGACGCCWAFAA--AATVESLNKINGGELVDLSVQELVD 207 Query: 494 C 496 C Sbjct: 208 C 208 >UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber officinale (Ginger) Length = 475 Score = 37.5 bits (83), Expect = 0.42 Identities = 17/38 (44%), Positives = 25/38 (65%) Frame = +2 Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424 LP++ D R+K + V++QG CGSCWAF A+ A+ Sbjct: 143 LPDSIDWREKG----AVVAVKNQGRCGSCWAFAAIAAV 176 >UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa|Rep: Os01g0240900 protein - Oryza sativa subsp. japonica (Rice) Length = 166 Score = 37.5 bits (83), Expect = 0.42 Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 3/55 (5%) Frame = +2 Query: 341 WPDCPTLNEVRDQGSCGSCWAF---GAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496 W D + +V+ QG+C SCWAF GAVE D N + S + L++C Sbjct: 104 WRDRGAVTDVKMQGTCASCWAFSTTGAVEG--DNFLASGNLRNLLNLSEQQLVNC 156 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 878,143,735 Number of Sequences: 1657284 Number of extensions: 18445955 Number of successful extensions: 54072 Number of sequences better than 10.0: 391 Number of HSP's better than 10.0 without gapping: 50872 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 53967 length of database: 575,637,011 effective HSP length: 100 effective length of database: 409,908,611 effective search space used: 74193458591 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -