BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= fbS20091 (714 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw... 136 7e-31 UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ... 120 5e-26 UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca... 119 6e-26 UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|... 108 1e-22 UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; ... 107 2e-22 UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=... 107 3e-22 UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteina... 107 3e-22 UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ... 103 6e-21 UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain... 102 1e-20 UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=... 101 2e-20 UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh... 98 2e-19 UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n... 96 9e-19 UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr... 95 2e-18 UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr... 93 6e-18 UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n... 92 1e-17 UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep... 91 2e-17 UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 ... 86 7e-16 UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma j... 85 2e-15 UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.... 85 2e-15 UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8.... 85 2e-15 UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA... 83 5e-15 UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: ... 83 5e-15 UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 ... 83 7e-15 UniRef50_Q237A1 Cluster: Papain family cysteine protease contain... 82 1e-14 UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|... 82 2e-14 UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Ca... 81 2e-14 UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ... 81 2e-14 UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, w... 81 3e-14 UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep... 81 4e-14 UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Ca... 79 1e-13 UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps... 77 3e-13 UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma... 77 3e-13 UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2... 77 4e-13 UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl... 77 6e-13 UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid... 76 8e-13 UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomati... 75 2e-12 UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 ... 75 2e-12 UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8... 74 3e-12 UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2; ... 74 3e-12 UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gamb... 73 5e-12 UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus co... 73 7e-12 UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator ame... 73 9e-12 UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep... 72 2e-11 UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhip... 71 2e-11 UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin ... 71 3e-11 UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7... 69 1e-10 UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C... 69 2e-10 UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2; Ostreococcus|... 68 3e-10 UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep... 66 6e-10 UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG011... 63 1e-09 UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma j... 65 2e-09 UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 64 2e-09 UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O... 64 4e-09 UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; ... 62 1e-08 UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve... 62 2e-08 UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus lu... 60 7e-08 UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; ... 59 9e-08 UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus... 58 2e-07 UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 57 4e-07 UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia... 56 9e-07 UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi... 56 9e-07 UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi... 54 3e-06 UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|R... 54 3e-06 UniRef50_Q7Q9Y5 Cluster: ENSANGP00000012222; n=1; Anopheles gamb... 53 6e-06 UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cel... 50 6e-05 UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; ... 50 8e-05 UniRef50_Q0PWU8 Cluster: Cathepsin B preproprotein-like protein;... 49 1e-04 UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;... 49 1e-04 UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like prote... 48 2e-04 UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted... 48 3e-04 UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, wh... 48 3e-04 UniRef50_P81494 Cluster: Cathepsin B; n=2; Phasianidae|Rep: Cath... 47 5e-04 UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb... 46 7e-04 UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali... 46 7e-04 UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35... 46 0.001 UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio... 46 0.001 UniRef50_UPI0000E4622C Cluster: PREDICTED: hypothetical protein;... 45 0.002 UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lambl... 45 0.002 UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,... 45 0.002 UniRef50_UPI00006CB77B Cluster: Papain family cysteine protease ... 44 0.003 UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ... 44 0.003 UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin... 44 0.003 UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ... 44 0.003 UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida... 44 0.003 UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=... 44 0.003 UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re... 44 0.004 UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 44 0.004 UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy... 44 0.004 UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-li... 44 0.004 UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ... 44 0.005 UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re... 44 0.005 UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big... 44 0.005 UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re... 44 0.005 UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try... 44 0.005 UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr... 44 0.005 UniRef50_Q5VUI9 Cluster: Tubulointerstitial nephritis antigen; n... 44 0.005 UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen; n... 44 0.005 UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ... 43 0.007 UniRef50_Q58HK5 Cluster: Hc58; n=1; Haemonchus contortus|Rep: Hc... 43 0.007 UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain... 43 0.007 UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy... 43 0.007 UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory... 43 0.007 UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ... 43 0.009 UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n... 43 0.009 UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy... 43 0.009 UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ... 42 0.011 UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ... 42 0.011 UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto... 42 0.011 UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ... 42 0.015 UniRef50_A2GCC2 Cluster: Clan CA, family C1, cathepsin B-like cy... 42 0.015 UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag... 42 0.015 UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to glucocorti... 42 0.020 UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n... 42 0.020 UniRef50_Q24E33 Cluster: Papain family cysteine protease contain... 42 0.020 UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs... 42 0.020 UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ... 42 0.020 UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel... 42 0.020 UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ... 41 0.026 UniRef50_Q5CKC2 Cluster: Preprocathepsin c; n=2; Cryptosporidium... 41 0.026 UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1... 41 0.026 UniRef50_Q22W19 Cluster: Papain family cysteine protease contain... 41 0.026 UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh... 41 0.026 UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=... 41 0.026 UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl... 41 0.026 UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo sapie... 41 0.035 UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ... 41 0.035 UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr... 41 0.035 UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcop... 41 0.035 UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ... 41 0.035 UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh... 41 0.035 UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh... 41 0.035 UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon G... 41 0.035 UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ... 41 0.035 UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R... 41 0.035 UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ... 40 0.046 UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p... 40 0.046 UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ... 40 0.046 UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio... 40 0.046 UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa... 40 0.046 UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n... 40 0.046 UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet... 40 0.046 UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain... 40 0.046 UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis... 40 0.046 UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3.... 40 0.046 UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;... 40 0.061 UniRef50_O16454 Cluster: Temporarily assigned gene name protein ... 40 0.061 UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy... 40 0.061 UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen... 40 0.061 UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh... 40 0.061 UniRef50_Q2FLC7 Cluster: Periplasmic copper-binding precursor; n... 40 0.061 UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ... 40 0.061 UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e... 40 0.061 UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi... 40 0.061 UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ... 40 0.061 UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; ... 40 0.080 UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-L... 40 0.080 UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt... 40 0.080 UniRef50_Q7QRX3 Cluster: GLP_549_24108_24914; n=1; Giardia lambl... 40 0.080 UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain... 40 0.080 UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ... 40 0.080 UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H... 40 0.080 UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo... 40 0.080 UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18... 40 0.080 UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ... 39 0.11 UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;... 39 0.11 UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ... 39 0.11 UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl... 39 0.11 UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ... 39 0.11 UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n... 39 0.11 UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ... 39 0.11 UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy... 39 0.11 UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M... 39 0.11 UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi... 39 0.11 UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p... 39 0.14 UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca... 39 0.14 UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa... 39 0.14 UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain... 39 0.14 UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep... 39 0.14 UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li... 39 0.14 UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ... 39 0.14 UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr... 39 0.14 UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto... 39 0.14 UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma... 39 0.14 UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s... 38 0.19 UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n... 38 0.19 UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ... 38 0.19 UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ... 38 0.19 UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - ... 38 0.19 UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1... 38 0.19 UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh... 38 0.19 UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The... 38 0.19 UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C... 38 0.19 UniRef50_UPI00006CBAC7 Cluster: Papain family cysteine protease ... 38 0.25 UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG... 38 0.25 UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R... 38 0.25 UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip... 38 0.25 UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ... 38 0.25 UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain... 38 0.25 UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl... 38 0.25 UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ... 38 0.32 UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No... 38 0.32 UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep... 38 0.32 UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph... 38 0.32 UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er... 38 0.32 UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:... 38 0.32 UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata... 38 0.32 UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium... 38 0.32 UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ... 37 0.43 UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ... 37 0.43 UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocysti... 37 0.43 UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa... 37 0.43 UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil... 37 0.43 UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein... 37 0.43 UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli... 37 0.43 UniRef50_Q231X3 Cluster: Papain family cysteine protease contain... 37 0.43 UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-32... 37 0.43 UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy... 37 0.43 UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|R... 37 0.43 UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v... 37 0.43 UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cy... 37 0.43 UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like cy... 37 0.43 UniRef50_A0D5R4 Cluster: Chromosome undetermined scaffold_39, wh... 37 0.43 UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179, w... 37 0.43 UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ... 37 0.57 UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ... 37 0.57 UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa... 37 0.57 UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei... 37 0.57 UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest... 37 0.57 UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia... 37 0.57 UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox... 37 0.57 UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j... 37 0.57 UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R... 37 0.57 UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|... 37 0.57 UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|... 37 0.57 UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ... 36 0.75 UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal... 36 0.75 UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|... 36 0.75 UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt... 36 0.75 UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi... 36 0.75 UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip... 36 0.75 UniRef50_Q7R0G3 Cluster: GLP_29_33036_32140; n=1; Giardia lambli... 36 0.75 UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi... 36 0.75 UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain... 36 0.75 UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain... 36 0.75 UniRef50_Q23H32 Cluster: Papain family cysteine protease contain... 36 0.75 UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi... 36 0.75 UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh... 36 0.75 UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w... 36 0.75 UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ... 36 0.99 UniRef50_Q3A8I3 Cluster: Putative serine protease; n=1; Pelobact... 36 0.99 UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ... 36 0.99 UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ... 36 0.99 UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli... 36 0.99 UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;... 36 0.99 UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil... 36 0.99 UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ... 36 0.99 UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabdi... 36 0.99 UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:... 36 0.99 UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve... 36 0.99 UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy... 36 0.99 UniRef50_Q6ZRQ1 Cluster: NOL1/NOP2/Sun domain family member 4; n... 36 0.99 UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ... 36 0.99 UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:... 36 0.99 UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M... 36 0.99 UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S... 36 1.3 UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz... 36 1.3 UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia ... 36 1.3 UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy... 36 1.3 UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ... 36 1.3 UniRef50_Q1AMF3 Cluster: Cathepsin C1; n=1; Toxoplasma gondii|Re... 36 1.3 UniRef50_A5KBM2 Cluster: Serine-repeat antigen; n=3; Plasmodium|... 36 1.3 UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic... 36 1.3 UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D... 36 1.3 UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ... 36 1.3 UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;... 36 1.3 UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2... 36 1.3 UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,... 35 1.7 UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ... 35 1.7 UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ... 35 1.7 UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia ... 35 1.7 UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280... 35 1.7 UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop... 35 1.7 UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain... 35 1.7 UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain... 35 1.7 UniRef50_Q239L8 Cluster: Papain family cysteine protease contain... 35 1.7 UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C... 35 1.7 UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy... 35 1.7 UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu... 35 1.7 UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h... 35 2.3 UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica... 35 2.3 UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat... 35 2.3 UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster... 35 2.3 UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G... 35 2.3 UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus... 35 2.3 UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ... 35 2.3 UniRef50_Q23H10 Cluster: Papain family cysteine protease contain... 35 2.3 UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain... 35 2.3 UniRef50_Q22A69 Cluster: Papain family cysteine protease contain... 35 2.3 UniRef50_A7APS9 Cluster: Papain family cysteine protease contain... 35 2.3 UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who... 35 2.3 UniRef50_Q0W7Z1 Cluster: Chemotaxis MCP methylation-inhibitor; n... 35 2.3 UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs... 35 2.3 UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ... 34 3.0 UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa... 34 3.0 UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca... 34 3.0 UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi... 34 3.0 UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n... 34 3.0 UniRef50_Q4YCM9 Cluster: Cysteine protease, putative; n=5; Plasm... 34 3.0 UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop... 34 3.0 UniRef50_Q23H15 Cluster: Papain family cysteine protease contain... 34 3.0 UniRef50_Q235G6 Cluster: Papain family cysteine protease contain... 34 3.0 UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat... 34 3.0 UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li... 34 3.0 UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh... 34 3.0 UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc... 34 3.0 UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba... 34 4.0 UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t... 34 4.0 UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; ... 34 4.0 UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O... 34 4.0 UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-... 34 4.0 UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum... 34 4.0 UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli... 34 4.0 UniRef50_Q23H06 Cluster: Papain family cysteine protease contain... 34 4.0 UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve... 34 4.0 UniRef50_A2ERV3 Cluster: Putative uncharacterized protein; n=1; ... 34 4.0 UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ... 34 4.0 UniRef50_Q5FQP2 Cluster: Putative uncharacterized protein; n=1; ... 33 5.3 UniRef50_Q5YER6 Cluster: Cathepsin Z; n=1; Bigelowiella natans|R... 33 5.3 UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ... 33 5.3 UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep: Cat... 33 5.3 UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; ... 33 5.3 UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3; Bilate... 33 5.3 UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ... 33 5.3 UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|... 33 5.3 UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-... 33 5.3 UniRef50_Q6A1H9 Cluster: Cathepsin X/O; n=1; Suberites domuncula... 33 5.3 UniRef50_Q61CG7 Cluster: Putative uncharacterized protein CBG129... 33 5.3 UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;... 33 5.3 UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia... 33 5.3 UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl... 33 5.3 UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain... 33 5.3 UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain... 33 5.3 UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain... 33 5.3 UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain... 33 5.3 UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus... 33 5.3 UniRef50_O96163 Cluster: Cysteine protease, putative; n=5; Plasm... 33 5.3 UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina... 33 5.3 UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w... 33 5.3 UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s... 33 7.0 UniRef50_Q8D2E4 Cluster: YbgF protein; n=1; Wigglesworthia gloss... 33 7.0 UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty... 33 7.0 UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae... 33 7.0 UniRef50_Q7QWN9 Cluster: GLP_26_50243_51811; n=2; Giardia lambli... 33 7.0 UniRef50_Q7QQ92 Cluster: GLP_243_18349_20043; n=1; Giardia lambl... 33 7.0 UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n... 33 7.0 UniRef50_Q26015 Cluster: Serine rich protein homologue; n=4; Pla... 33 7.0 UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv... 33 7.0 UniRef50_A5KBM1 Cluster: Serine-repeat antigen; n=1; Plasmodium ... 33 7.0 UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w... 33 7.0 UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The... 33 7.0 UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov... 33 7.0 UniRef50_Q9QME4 Cluster: Gag polyprotein; n=78; root|Rep: Gag po... 33 9.2 UniRef50_Q2SMU4 Cluster: Putative uncharacterized protein; n=1; ... 33 9.2 UniRef50_Q0FGC4 Cluster: Putative uncharacterized protein; n=1; ... 33 9.2 UniRef50_A7PJI4 Cluster: Chromosome chr12 scaffold_18, whole gen... 33 9.2 UniRef50_Q4N7X2 Cluster: Putative uncharacterized protein; n=1; ... 33 9.2 UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir... 33 9.2 UniRef50_Q3SDA0 Cluster: Mini antigen; n=1; Paramecium tetraurel... 33 9.2 UniRef50_Q248G1 Cluster: Papain family cysteine protease contain... 33 9.2 UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina... 33 9.2 UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophi... 33 9.2 UniRef50_A2DCQ3 Cluster: Beige/BEACH domain containing protein; ... 33 9.2 UniRef50_A0BZE1 Cluster: Chromosome undetermined scaffold_139, w... 33 9.2 UniRef50_Q1DTN0 Cluster: Predicted protein; n=1; Coccidioides im... 33 9.2 UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D... 33 9.2 >UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpwnx02 - Periplaneta americana (American cockroach) Length = 343 Score = 136 bits (328), Expect = 7e-31 Identities = 55/85 (64%), Positives = 63/85 (74%) Frame = +3 Query: 255 YSNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEI 434 +S G HFHFSAEDLL+CC CG GC+GG P AW+YW G+VSGGSYNS QGC+PY I Sbjct: 137 HSKGKTHFHFSAEDLLTCCSSCGFGCNGGEPGAAWDYWVSTGIVSGGSYNSHQGCQPYAI 196 Query: 435 PPCEHHVPGNRMPCSGDTKTPKCTK 509 PCEHHV G R PC G+ TP+C K Sbjct: 197 EPCEHHVNGTRKPC-GEGDTPRCVK 220 Score = 111 bits (267), Expect = 2e-23 Identities = 49/84 (58%), Positives = 61/84 (72%) Frame = +1 Query: 1 AGRNFPRDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNE 180 A RNF D +KK+MGV + LP K+ + D+ +PE FDPR++WP+CPTL E Sbjct: 53 AHRNFGNDIPLREIKKLMGVRRSLENFRLPEKSME-DIDIEIPEEFDPREQWPECPTLKE 111 Query: 181 VRDQGSCGSCWAFGAVEAMTDRVC 252 +RDQGSCGSCWAFGAVEAM+DRVC Sbjct: 112 IRDQGSCGSCWAFGAVEAMSDRVC 135 Score = 80.2 bits (189), Expect = 5e-14 Identities = 34/69 (49%), Positives = 42/69 (60%) Frame = +2 Query: 506 KKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYK 685 K+CE GYDV Y +D+ +GK Y V G I+ EL NGP E A TVY D L Y++GVY+ Sbjct: 220 KRCEEGYDVPYGKDRHFGKSAYAVPGSVKAIQKELLLNGPAEAALTVYDDFLHYRTGVYQ 279 Query: 686 HTQGDVSAG 712 H G G Sbjct: 280 HVSGGALGG 288 >UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin B; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin B - Strongylocentrotus purpuratus Length = 346 Score = 120 bits (288), Expect = 5e-26 Identities = 46/82 (56%), Positives = 59/82 (71%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIP 437 S G H SAEDL++CC CG GC+GG P AWEY+K G+V+GG +NSSQGC+PY+I Sbjct: 123 SKGQTQVHISAEDLMTCCKTCGNGCNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPYQIK 182 Query: 438 PCEHHVPGNRMPCSGDTKTPKC 503 C+HHV G + PC G+ TP+C Sbjct: 183 SCDHHVNGTKGPCQGEGPTPEC 204 Score = 95.1 bits (226), Expect = 2e-18 Identities = 44/84 (52%), Positives = 57/84 (67%) Frame = +1 Query: 1 AGRNFPRDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNE 180 AG NF ++++G +K+ + LP K I LPENFD R+ WP+CPT+ E Sbjct: 40 AGINF-EGWQLDDFRRMLGALKNPN-GRLP-KLENQTRIKDLPENFDARENWPNCPTIKE 96 Query: 181 VRDQGSCGSCWAFGAVEAMTDRVC 252 VRDQGSCGSCWAFGAVEA++DR+C Sbjct: 97 VRDQGSCGSCWAFGAVEAISDRIC 120 Score = 61.7 bits (143), Expect = 2e-08 Identities = 26/56 (46%), Positives = 35/56 (62%) Frame = +2 Query: 509 KCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSG 676 KCE+ Y Y+QDK Y V ++S + + + E+ NGPVE FTVY D +YKSG Sbjct: 207 KCEASYSTPYEQDKHYALSVNSISNNPEATQTEIMTNGPVEADFTVYEDFPTYKSG 262 >UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Cathepsin B1) (APP secretase) (APPS) [Contains: Cathepsin B light chain; Cathepsin B heavy chain]; n=85; Eukaryota|Rep: Cathepsin B precursor (EC 3.4.22.1) (Cathepsin B1) (APP secretase) (APPS) [Contains: Cathepsin B light chain; Cathepsin B heavy chain] - Homo sapiens (Human) Length = 339 Score = 119 bits (287), Expect = 6e-26 Identities = 50/86 (58%), Positives = 60/86 (69%), Gaps = 1/86 (1%) Frame = +3 Query: 255 YSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYE 431 ++N SAEDLL+CC +CG GC+GG P AW +W GLVSGG Y S GCRPY Sbjct: 124 HTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYS 183 Query: 432 IPPCEHHVPGNRMPCSGDTKTPKCTK 509 IPPCEHHV G+R PC+G+ TPKC+K Sbjct: 184 IPPCEHHVNGSRPPCTGEGDTPKCSK 209 Score = 97.1 bits (231), Expect = 4e-19 Identities = 43/72 (59%), Positives = 51/72 (70%) Frame = +2 Query: 497 KMHKKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSG 676 K K CE GY YKQDK YG + Y+VS E I AE++KNGPVEGAF+VYSD L YKSG Sbjct: 206 KCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSG 265 Query: 677 VYKHTQGDVSAG 712 VY+H G++ G Sbjct: 266 VYQHVTGEMMGG 277 Score = 86.2 bits (204), Expect = 7e-16 Identities = 39/84 (46%), Positives = 53/84 (63%) Frame = +1 Query: 1 AGRNFPRDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNE 180 AG NF + ++LK++ G P + LP +FD R++WP CPT+ E Sbjct: 43 AGHNF-YNVDMSYLKRLCGTFLG---GPKPPQRVMFTEDLKLPASFDAREQWPQCPTIKE 98 Query: 181 VRDQGSCGSCWAFGAVEAMTDRVC 252 +RDQGSCGSCWAFGAVEA++DR+C Sbjct: 99 IRDQGSCGSCWAFGAVEAISDRIC 122 >UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|Rep: Cathepsin B5 - Clonorchis sinensis Length = 343 Score = 108 bits (260), Expect = 1e-22 Identities = 50/109 (45%), Positives = 65/109 (59%), Gaps = 4/109 (3%) Frame = +3 Query: 255 YSNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEI 434 +SNG + SA DLLSCC CG GC GG P +AW+YWK G+V+GGS GCR Y Sbjct: 130 HSNGAFNKSLSAVDLLSCCKDCGFGCRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPF 189 Query: 435 PPCEHHVPGNRMPCSGDT-KTPKCTKNANL-DTTLITNKT--NNTENMY 569 P CEHHV G+ PC + TP+C + + D + +KT N + N+Y Sbjct: 190 PKCEHHVQGHYPPCPRELYPTPECVQQCDTPDVGYLEDKTRANMSYNIY 238 Score = 79.8 bits (188), Expect = 6e-14 Identities = 30/43 (69%), Positives = 37/43 (86%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 LP+NFD R WP C +++E+RDQ SCGSCWAFGAVEAM+DR+C Sbjct: 86 LPKNFDARKTWPHCSSISEIRDQSSCGSCWAFGAVEAMSDRLC 128 Score = 54.8 bits (126), Expect = 2e-06 Identities = 27/69 (39%), Positives = 36/69 (52%) Frame = +2 Query: 506 KKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYK 685 ++C++ DV Y +DK Y + E I E+ GPVE FT+Y D L Y SGVY Sbjct: 215 QQCDTP-DVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYF 273 Query: 686 HTQGDVSAG 712 H G +G Sbjct: 274 HALGAPMSG 282 >UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin B-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 331 Score = 107 bits (258), Expect = 2e-22 Identities = 44/83 (53%), Positives = 52/83 (62%), Gaps = 1/83 (1%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIP 437 S G SAE+LLSCC CG GC GG P +AW YW G+ +GG Y S QGC+PY + Sbjct: 125 SQGKLKVPVSAENLLSCCDSCGYGCEGGYPTMAWSYWIDTGITTGGLYGSKQGCQPYSLQ 184 Query: 438 PCEHHVPGNRMPCSG-DTKTPKC 503 PCEHH GN++ CS D TP C Sbjct: 185 PCEHHTEGNKVQCSTLDYDTPSC 207 Score = 66.1 bits (154), Expect = 8e-10 Identities = 32/85 (37%), Positives = 46/85 (54%), Gaps = 1/85 (1%) Frame = +1 Query: 1 AGRNFPRDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCP-TLN 177 AG+NF + S +K ++G K + TH D+ +P +FD R+ W +C ++ Sbjct: 41 AGKNFDENLSIQEIKNLLGAKKGK-LGVAKEFTHSEDI--QVPNSFDARENWKECSDVIS 97 Query: 178 EVRDQGSCGSCWAFGAVEAMTDRVC 252 V DQ CGSCWA A AM+DR C Sbjct: 98 TVVDQSDCGSCWAVAAASAMSDRRC 122 Score = 58.4 bits (135), Expect = 2e-07 Identities = 28/68 (41%), Positives = 39/68 (57%) Frame = +2 Query: 509 KCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKH 688 KC+ +NYK + +G +I+ E+ NGPVE AF VYSD ++YKSGVY+H Sbjct: 210 KCDDSA-LNYKSELTFGSGSVRNFYSVANIQKEILTNGPVEAAFDVYSDFVNYKSGVYQH 268 Query: 689 TQGDVSAG 712 G+ G Sbjct: 269 VAGEYLGG 276 >UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=1; Biomphalaria glabrata|Rep: Cathepsin B preproprotein precursor - Biomphalaria glabrata (Bloodfluke planorb) Length = 333 Score = 107 bits (257), Expect = 3e-22 Identities = 40/82 (48%), Positives = 52/82 (63%) Frame = +3 Query: 264 GTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPC 443 G + H SAED+ CC CG+GC+GG P AWE++ G+VSGG Y +++GC PY +P C Sbjct: 132 GKGNIHISAEDINDCCKSCGMGCNGGYPAAAWEWYVDTGVVSGGQYGTNEGCMPYSLPHC 191 Query: 444 EHHVPGNRMPCSGDTKTPKCTK 509 +HH G PC TPKC K Sbjct: 192 DHHTTGKYQPCPAVVPTPKCEK 213 Score = 96.7 bits (230), Expect = 5e-19 Identities = 41/85 (48%), Positives = 58/85 (68%), Gaps = 1/85 (1%) Frame = +1 Query: 1 AGRNF-PRDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLN 177 AGRNF P + A + + +++ + + +K ++ LP+NFDPR KWPDC +LN Sbjct: 45 AGRNFHPAEIKRARALLGVNMAENKAYNRIHLKYKQVQPRNDLPDNFDPRTKWPDCASLN 104 Query: 178 EVRDQGSCGSCWAFGAVEAMTDRVC 252 E+RDQ +CGSCWAFG+ EAMTDR+C Sbjct: 105 EIRDQANCGSCWAFGSAEAMTDRIC 129 Score = 77.0 bits (181), Expect = 4e-13 Identities = 38/72 (52%), Positives = 43/72 (59%) Frame = +2 Query: 497 KMHKKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSG 676 K KKC +GY +Y DK GK Y V G + I EL NGPV AF VYSD LSYK+G Sbjct: 210 KCEKKCLTGYPKSYSNDKTRGKKSYGVRGVQS-IMQELVDNGPVTAAFDVYSDFLSYKTG 268 Query: 677 VYKHTQGDVSAG 712 VY+HT G G Sbjct: 269 VYRHTTGSYEGG 280 >UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteinase; n=1; Tenebrio molitor|Rep: Putative cathepsin B-like like proteinase - Tenebrio molitor (Yellow mealworm) Length = 301 Score = 107 bits (257), Expect = 3e-22 Identities = 46/86 (53%), Positives = 67/86 (77%), Gaps = 2/86 (2%) Frame = +1 Query: 1 AGRNFPRDTSFAHLKKIMGVI-KDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTL- 174 AGRNF +T +H+++++GV+ K + LP+KTH ++L A +PE+FD R+ WP+C ++ Sbjct: 43 AGRNFDVNTPISHVRRLLGVLPKKANAPKLPVKTHAVNLDA-IPESFDAREAWPECTSII 101 Query: 175 NEVRDQGSCGSCWAFGAVEAMTDRVC 252 E+RDQ SCGSCWAFGAVEAM+DR+C Sbjct: 102 GEIRDQASCGSCWAFGAVEAMSDRIC 127 Score = 105 bits (253), Expect = 8e-22 Identities = 43/93 (46%), Positives = 56/93 (60%) Frame = +3 Query: 255 YSNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEI 434 +S+ + SAEDL CC CG GC+GG P LAW YW G+V+GG Y +GC+ Y I Sbjct: 129 HSDASVKVRISAEDLNDCCYDCGDGCNGGWPDLAWSYWSSTGIVTGGLYGVDEGCKAYSI 188 Query: 435 PPCEHHVPGNRMPCSGDTKTPKCTKNANLDTTL 533 PC+HHV GN PC +TP C K+ + + L Sbjct: 189 KPCDHHVDGNLGPCGDIQRTPACKKSCDSTSDL 221 Score = 54.0 bits (124), Expect = 3e-06 Identities = 24/56 (42%), Positives = 34/56 (60%) Frame = +2 Query: 506 KKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKS 673 K C+S D+ YK D + G Y++ E I+ E+ NGPVE + VYSD L+YK+ Sbjct: 213 KSCDSTSDLEYKSDLRRGS-AYSIPKSESQIQTEIMTNGPVEADYDVYSDFLTYKA 267 >UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 precursor; n=11; Bilateria|Rep: Cathepsin B-like cysteine proteinase 6 precursor - Caenorhabditis elegans Length = 379 Score = 103 bits (246), Expect = 6e-21 Identities = 45/93 (48%), Positives = 54/93 (58%), Gaps = 2/93 (2%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIP 437 S+G SA+DLLSCC CG GC+GG P AW YW G+V+G +Y ++ GC+PY P Sbjct: 150 SHGELQVTLSADDLLSCCKSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFP 209 Query: 438 PCEHHVPGNRM-PCSGDT-KTPKCTKNANLDTT 530 PCEHH PC D TPKC K D T Sbjct: 210 PCEHHSKKTHFDPCPHDLYPTPKCEKKCVSDYT 242 Score = 82.2 bits (194), Expect = 1e-14 Identities = 33/53 (62%), Positives = 41/53 (77%) Frame = +1 Query: 94 KTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 KT +DL +PE+FD RD WP C ++ +RDQ SCGSCWAFGAVEAM+DR+C Sbjct: 97 KTKDLDL--DIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRIC 147 Score = 69.7 bits (163), Expect = 7e-11 Identities = 34/73 (46%), Positives = 42/73 (57%), Gaps = 1/73 (1%) Frame = +2 Query: 497 KMHKKCESGY-DVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKS 673 K KKC S Y D Y +DK +G Y V D + I+ EL +GP+E AF VY D L+Y Sbjct: 232 KCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDG 291 Query: 674 GVYKHTQGDVSAG 712 GVY HT G + G Sbjct: 292 GVYVHTGGKLGGG 304 >UniRef50_Q23FP9 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 340 Score = 102 bits (244), Expect = 1e-20 Identities = 44/106 (41%), Positives = 55/106 (51%), Gaps = 1/106 (0%) Frame = +3 Query: 216 FRCRRSYDRQSMYYSNGTKHFHFSAEDLLSCCP-ICGLGCSGGMPRLAWEYWKHFGLVSG 392 F ++ + SN T S+EDLL CC CG+GC GG P AW Y K G+ +G Sbjct: 119 FAATETFSDRICIASNQTLQTSISSEDLLECCADYCGMGCKGGYPSAAWGYMKRQGVSTG 178 Query: 393 GSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCTKNANLDTT 530 G Y C+PY PPC+HHV G PC TP+C K N + T Sbjct: 179 GLYGDDTSCKPYIFPPCDHHVTGQYQPCGPIQPTPQCVKECNSEYT 224 Score = 70.5 bits (165), Expect = 4e-11 Identities = 28/68 (41%), Positives = 43/68 (63%), Gaps = 1/68 (1%) Frame = +1 Query: 52 MGVIKDEHFATLPIKTHKIDLIAS-LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 228 +G + + + LP K + A +PE FD R++WP+C ++ +RDQ +CGSCWAF A Sbjct: 63 LGSLDEPDWVKLPTKEFDPNANADPIPEFFDAREQWPNCQSIKLIRDQSTCGSCWAFAAT 122 Query: 229 EAMTDRVC 252 E +DR+C Sbjct: 123 ETFSDRIC 130 Score = 52.8 bits (121), Expect = 8e-06 Identities = 23/60 (38%), Positives = 37/60 (61%), Gaps = 1/60 (1%) Frame = +2 Query: 506 KKCESGYDVN-YKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVY 682 K+C S Y N Y++D + Y++ + I+ E+ +GPV+ +F V +D L+YKSGVY Sbjct: 217 KECNSEYTQNTYEKDLHFASQTYSIKQNVQAIQREIMAHGPVQASFKVAADFLTYKSGVY 276 >UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=1; Nilaparvata lugens|Rep: Cathepsin B-like protease precursor - Nilaparvata lugens (Brown planthopper) Length = 347 Score = 101 bits (242), Expect = 2e-20 Identities = 45/110 (40%), Positives = 58/110 (52%), Gaps = 2/110 (1%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIP 437 SN + H S+ +L+SCC CG GC GG P AW + K GLV+GG Y+S GC+PY I Sbjct: 137 SNAKWNGHISSRELMSCCSYCGFGCEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPYPIA 196 Query: 438 PCEHHVPGNRMPCSGD--TKTPKCTKNANLDTTLITNKTNNTENMYILCP 581 PCEHH+ G++ CS TP C ++L K L P Sbjct: 197 PCEHHMEGSKPNCSASPTEPTPACETTCTHGSSLAYQKDRQKGKSAYLVP 246 Score = 72.1 bits (169), Expect = 1e-11 Identities = 34/89 (38%), Positives = 53/89 (59%), Gaps = 5/89 (5%) Frame = +1 Query: 1 AGRNFPRDTSFAHLKKIMGVIK-DEHFATLP----IKTHKIDLIASLPENFDPRDKWPDC 165 AG NF DT ++L+ ++GV + + + A L ++ ++ + +P+ FD R KW C Sbjct: 46 AGHNFHPDTPMSYLQGLLGVSELESNLADLDKYEEMEENEENKKIKVPKYFDARKKWKKC 105 Query: 166 PTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 +L E+RDQG+CGSCWA A DR+C Sbjct: 106 KSLREIRDQGNCGSCWAVSVAAAFADRLC 134 Score = 64.1 bits (149), Expect = 3e-09 Identities = 28/58 (48%), Positives = 35/58 (60%) Frame = +2 Query: 512 CESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYK 685 C G + Y++D+Q GK Y V E + E+FKNGP+ AF VY D YKSGVYK Sbjct: 224 CTHGSSLAYQKDRQKGKSAYLVPVGEKQTQLEIFKNGPIVAAFKVYEDFFMYKSGVYK 281 >UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 5 SCAF15026, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 351 Score = 97.9 bits (233), Expect = 2e-19 Identities = 40/68 (58%), Positives = 51/68 (75%) Frame = +2 Query: 509 KCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKH 688 +CE+GY +YKQDK +GK Y+VS +ED I+ E++KNGPVEGAFTVY D + YKSGVY+H Sbjct: 230 RCEAGYSPSYKQDKHFGKTSYSVSSEEDEIKQEIYKNGPVEGAFTVYEDFVLYKSGVYQH 289 Query: 689 TQGDVSAG 712 G G Sbjct: 290 VSGSALGG 297 Score = 96.7 bits (230), Expect = 5e-19 Identities = 49/105 (46%), Positives = 59/105 (56%), Gaps = 22/105 (20%) Frame = +3 Query: 255 YSNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNS--------- 407 +SN SA+DLL+CC CG+GC+GG P AW +W GLVSGG Y+S Sbjct: 123 HSNAKVSVELSAQDLLTCCNSCGMGCNGGYPSSAWNFWVSDGLVSGGLYDSHIGRIQVSL 182 Query: 408 ------------SQGCRPYEIPPCEHHVPGNRMPCSGD-TKTPKC 503 S GCRPY IPPCEHHV G+R CSG+ TP+C Sbjct: 183 CVLLLAVDRDFVSPGCRPYTIPPCEHHVNGSRPSCSGEGGDTPEC 227 Score = 92.7 bits (220), Expect = 8e-18 Identities = 43/84 (51%), Positives = 59/84 (70%) Frame = +1 Query: 1 AGRNFPRDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNE 180 AG NF + ++++KK+ G + L I+ + D+ LP+ FD R++WP+CPTL E Sbjct: 42 AGHNF-HNVDYSYVKKLCGTLLKGPKLPLMIR-YAGDI--KLPKEFDSREQWPNCPTLKE 97 Query: 181 VRDQGSCGSCWAFGAVEAMTDRVC 252 +RDQGSCGSCWAFGA EAM+DRVC Sbjct: 98 IRDQGSCGSCWAFGASEAMSDRVC 121 >UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n=4; Tenebrionidae|Rep: Putative cathepsin B-like proteinase - Tenebrio molitor (Yellow mealworm) Length = 321 Score = 95.9 bits (228), Expect = 9e-19 Identities = 44/100 (44%), Positives = 65/100 (65%), Gaps = 3/100 (3%) Frame = +1 Query: 1 AGRNFPRDTSFAHLKKIMGVI---KDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPT 171 AGRNFP +T+ +L K+ G I D ++ P+ H + +PE+FD R KWP+C + Sbjct: 41 AGRNFPENTTNEYLYKLNGFIGLHPDPNYKP-PVLVHTFNA-RDVPESFDARTKWPNCDS 98 Query: 172 LNEVRDQGSCGSCWAFGAVEAMTDRVCTILTELNIFIFLP 291 LN +RDQG+CGSCWAF ++E+M+DR+C + F+F P Sbjct: 99 LNRIRDQGACGSCWAFASIESMSDRICIHSSGSAQFMFSP 138 Score = 68.5 bits (160), Expect = 2e-10 Identities = 30/58 (51%), Positives = 40/58 (68%) Frame = +3 Query: 255 YSNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY 428 +S+G+ F FS EDLLSCC CG C GG A +++ + G+VSGG NS++GCRPY Sbjct: 127 HSSGSAQFMFSPEDLLSCCTSCG-DCGGGYMMSALDFYINEGIVSGGDVNSNEGCRPY 183 Score = 66.9 bits (156), Expect = 5e-10 Identities = 28/65 (43%), Positives = 38/65 (58%) Frame = +2 Query: 506 KKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYK 685 K C +GY +Y DK YG + Y VS D I+ E+ NGP+ F V+ D +Y SGVY+ Sbjct: 198 KSCRNGYSTSYSADKHYGSNDYVVSSVIDQIQYEVMTNGPIIVNFEVFQDFYNYVSGVYR 257 Query: 686 HTQGD 700 H G+ Sbjct: 258 HVSGE 262 >UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase precursor; n=28; Bilateria|Rep: Cathepsin B-like cysteine proteinase precursor - Schistosoma japonicum (Blood fluke) Length = 342 Score = 95.1 bits (226), Expect = 2e-18 Identities = 40/83 (48%), Positives = 49/83 (59%), Gaps = 1/83 (1%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIP 437 S G + SA DL+SCC CG GC GG P +AW+YW G+V+GGS + GC+PY P Sbjct: 135 SGGGQSAELSALDLISCCKDCGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFP 194 Query: 438 PCEHHVPGNRMPCSGDT-KTPKC 503 CEHH G C KTP+C Sbjct: 195 KCEHHTKGKYPACGTKIYKTPQC 217 Score = 80.6 bits (190), Expect = 4e-14 Identities = 30/48 (62%), Positives = 37/48 (77%) Frame = +1 Query: 109 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 DL +P FD R KWP C +++++RDQ CGSCWAFGAVEAMTDR+C Sbjct: 85 DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRIC 132 Score = 76.6 bits (180), Expect = 6e-13 Identities = 31/67 (46%), Positives = 41/67 (61%) Frame = +2 Query: 512 CESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHT 691 C+ GY Y+QDK YG Y V +E I+ ++ GPVE AF VY D L+YKSG+Y+H Sbjct: 221 CQKGYKTPYEQDKHYGDESYNVQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHV 280 Query: 692 QGDVSAG 712 G + G Sbjct: 281 TGSIVGG 287 >UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase precursor; n=29; Schistosomatidae|Rep: Cathepsin B-like cysteine proteinase precursor - Schistosoma mansoni (Blood fluke) Length = 340 Score = 93.1 bits (221), Expect = 6e-18 Identities = 39/88 (44%), Positives = 49/88 (55%), Gaps = 1/88 (1%) Frame = +3 Query: 243 QSMYYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCR 422 +S S G ++ SA DLL+CC CGLGC GG+ AW+YW G+V+ S + GC Sbjct: 129 RSCIQSGGKQNVELSAVDLLTCCESCGLGCEGGILGPAWDYWVKEGIVTASSKENHTGCE 188 Query: 423 PYEIPPCEHHVPGNRMPCSGDT-KTPKC 503 PY P CEHH G PC TP+C Sbjct: 189 PYPFPKCEHHTKGKYPPCGSKIYNTPRC 216 Score = 78.2 bits (184), Expect = 2e-13 Identities = 34/67 (50%), Positives = 41/67 (61%) Frame = +2 Query: 512 CESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHT 691 C+ Y Y QDK GK Y V DE I+ E+ K GPVE +FTVY D L+YKSG+YKH Sbjct: 220 CQRKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHI 279 Query: 692 QGDVSAG 712 G+ G Sbjct: 280 TGEALGG 286 Score = 73.3 bits (172), Expect = 5e-12 Identities = 28/48 (58%), Positives = 34/48 (70%) Frame = +1 Query: 109 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 D +P NFD R KWP C ++ +RDQ CGSCW+FGAVEAM+DR C Sbjct: 84 DWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWSFGAVEAMSDRSC 131 >UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n=8; Strongylida|Rep: Cathepsin B-like cysteine protease 2 - Parelaphostrongylus tenuis Length = 344 Score = 91.9 bits (218), Expect = 1e-17 Identities = 37/65 (56%), Positives = 44/65 (67%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIP 437 S+G K SA+D+LSCC CG GC GG P AWEY+ G+V+GG Y + CRPYEIP Sbjct: 139 SHGNKTVELSADDILSCCYDCGDGCDGGYPISAWEYFVETGVVTGGLYGTKDSCRPYEIP 198 Query: 438 PCEHH 452 PC HH Sbjct: 199 PCGHH 203 Score = 74.5 bits (175), Expect = 2e-12 Identities = 26/43 (60%), Positives = 36/43 (83%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 +P++FD R +WP CP+++ +RDQ CGSCWAFG+ EAM+DRVC Sbjct: 94 IPDSFDARVQWPHCPSISYIRDQSQCGSCWAFGSAEAMSDRVC 136 Score = 67.3 bits (157), Expect = 4e-10 Identities = 27/67 (40%), Positives = 36/67 (53%) Frame = +2 Query: 512 CESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHT 691 C++GY ++Y DK +GK YT+ I+ E+ GPV AF VY D Y G+YKH Sbjct: 225 CQAGYPISYDDDKTFGKDSYTIESSVTAIQKEIMTYGPVTAAFIVYEDFFHYHRGIYKHV 284 Query: 692 QGDVSAG 712 G G Sbjct: 285 SGGEEGG 291 >UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep: Cathepsin B - Pandalus borealis (Northern red shrimp) Length = 328 Score = 91.5 bits (217), Expect = 2e-17 Identities = 36/87 (41%), Positives = 52/87 (59%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIP 437 + G F FS+E++ +CC CG C GG A+ +W G VSGG +NS++GC+PY + Sbjct: 121 TEGLVDFRFSSENVAACCTECGNACYGGDEDTAFTHWVTKGFVSGGRHNSNEGCQPYSVE 180 Query: 438 PCEHHVPGNRMPCSGDTKTPKCTKNAN 518 CEHH+ G R PC GD C++ + Sbjct: 181 ECEHHIEGPRPPCEGDMPELVCSETCH 207 Score = 88.6 bits (210), Expect = 1e-16 Identities = 39/84 (46%), Positives = 51/84 (60%) Frame = +1 Query: 1 AGRNFPRDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNE 180 AGRNF +D S LK + V K+ LP+K + +P FD R++WP CP ++E Sbjct: 37 AGRNFAKDISKDFLKSLNCVRKNPDIPKLPLKN--VTPTKEIPVEFDAREQWPHCPCIDE 94 Query: 181 VRDQGSCGSCWAFGAVEAMTDRVC 252 +RDQG+CGSCWA A MTDR C Sbjct: 95 IRDQGNCGSCWAVSAASVMTDRTC 118 Score = 68.1 bits (159), Expect = 2e-10 Identities = 29/62 (46%), Positives = 36/62 (58%) Frame = +2 Query: 512 CESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHT 691 C Y Y++D +YG Y + D I+ E+ NGPV AF VY D LSYKSGVY+H Sbjct: 206 CHEEYGKTYEEDLEYGLEAYVLPQDVTQIQEEIMTNGPVTAAFAVYDDFLSYKSGVYQHE 265 Query: 692 QG 697 G Sbjct: 266 TG 267 >UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 precursor; n=3; Haemonchidae|Rep: Cathepsin B-like cysteine proteinase 1 precursor - Ostertagia ostertagi Length = 341 Score = 86.2 bits (204), Expect = 7e-16 Identities = 41/91 (45%), Positives = 51/91 (56%), Gaps = 3/91 (3%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIP 437 S G K SA+D++SCC CG GC GG P A+ + G+V+GG YN+ CRPYEI Sbjct: 136 SKGAKQVLISAQDVVSCCTWCGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIH 195 Query: 438 PCEHHVPGNRM---PCSGDTKTPKCTKNANL 521 PC HH GN C G TP+C + L Sbjct: 196 PCGHH--GNETYYGECVGMADTPRCKRRCLL 224 Score = 61.7 bits (143), Expect = 2e-08 Identities = 21/43 (48%), Positives = 32/43 (74%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 +PE++DPR +W +C +L + DQ +CGSCWA + AM+DR+C Sbjct: 91 IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRIC 133 Score = 59.3 bits (137), Expect = 9e-08 Identities = 25/64 (39%), Positives = 36/64 (56%) Frame = +2 Query: 506 KKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYK 685 ++C GY +Y D+ Y K Y + I+ ++ KNGPV +TVY D Y+SG+YK Sbjct: 220 RRCLLGYPKSYPSDRYY-KKAYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYK 278 Query: 686 HTQG 697 H G Sbjct: 279 HKAG 282 >UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma japonicum|Rep: SJCHGC02853 protein - Schistosoma japonicum (Blood fluke) Length = 181 Score = 85.0 bits (201), Expect = 2e-15 Identities = 40/81 (49%), Positives = 52/81 (64%), Gaps = 3/81 (3%) Frame = +1 Query: 19 RDTSFAHLKKIMGVIK---DEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRD 189 R TS H K +MGV+ D+H PI H D+ LP+ FD R W +C ++ +RD Sbjct: 45 RFTSIHHAKSMMGVLLNSVDQHKLHHPIIHHN-DINIKLPKYFDSRKYWKNCSSIRTIRD 103 Query: 190 QGSCGSCWAFGAVEAMTDRVC 252 Q SCGSCWAFGAVE+M+DR+C Sbjct: 104 QSSCGSCWAFGAVESMSDRIC 124 >UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.4; n=1; Caenorhabditis elegans|Rep: Putative uncharacterized protein W07B8.4 - Caenorhabditis elegans Length = 335 Score = 85.0 bits (201), Expect = 2e-15 Identities = 42/87 (48%), Positives = 50/87 (57%), Gaps = 5/87 (5%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCP---ICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY 428 SNG + SAED+L+CC CG GC GG P AW YW GLV+GGS+ S GC+PY Sbjct: 118 SNGDVNTLLSAEDILTCCTGKFNCGDGCEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPY 177 Query: 429 EIPPCEHHVPGNRMP-CSGD-TKTPKC 503 I PC + G P C + TPKC Sbjct: 178 SIAPCGETIDGVTWPECPMKISDTPKC 204 Score = 72.1 bits (169), Expect = 1e-11 Identities = 31/70 (44%), Positives = 45/70 (64%), Gaps = 1/70 (1%) Frame = +1 Query: 46 KIMGVIKDEHFATLPIKTHKIDLIA-SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFG 222 ++ ++K EH A K K+ A S+P+++D RD WP C ++N +RDQ CGSCWA Sbjct: 46 EVKNLMKVEHVAAHLDKDIKLAETADSIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVA 105 Query: 223 AVEAMTDRVC 252 A EA++DR C Sbjct: 106 AAEAISDRTC 115 Score = 58.8 bits (136), Expect = 1e-07 Identities = 25/70 (35%), Positives = 34/70 (48%) Frame = +2 Query: 503 HKKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVY 682 H + Y + Y QDK +G Y + I+ E+ +GPVE F VY D YK+G+Y Sbjct: 207 HCTGNNSYPIPYDQDKHFGASAYAIGRSAKQIQTEILAHGPVEVGFIVYEDFYLYKTGIY 266 Query: 683 KHTQGDVSAG 712 H G G Sbjct: 267 THVAGGELGG 276 >UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8.1; n=1; Caenorhabditis elegans|Rep: Putative uncharacterized protein W07B8.1 - Caenorhabditis elegans Length = 335 Score = 84.6 bits (200), Expect = 2e-15 Identities = 43/89 (48%), Positives = 51/89 (57%), Gaps = 5/89 (5%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCP---ICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY 428 S G K+ SAE+LLSCC CG GC GG P AW+Y + G+ +GGSY S GC+PY Sbjct: 121 SGGFKNTILSAEELLSCCTGMFSCGEGCEGGNPFKAWQYIQKHGIPTGGSYESQFGCKPY 180 Query: 429 EIPPCEHHVPGNRMP-CSGDTK-TPKCTK 509 IPPC V P C+ T TP C K Sbjct: 181 SIPPCGKTVGNVTYPACTNTTSPTPSCEK 209 Score = 57.2 bits (132), Expect = 4e-07 Identities = 24/67 (35%), Positives = 39/67 (58%), Gaps = 2/67 (2%) Frame = +2 Query: 506 KKCES--GYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGV 679 KKC S GY ++ +D+ YG V + + I++++ NGP++ F VY D L Y +G+ Sbjct: 209 KKCTSRIGYPIDIDKDRHYGVSVDQLPNSQIEIQSDVMLNGPIQATFEVYDDFLQYTTGI 268 Query: 680 YKHTQGD 700 Y H G+ Sbjct: 269 YVHLTGN 275 Score = 54.4 bits (125), Expect = 3e-06 Identities = 18/45 (40%), Positives = 31/45 (68%) Frame = +1 Query: 118 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 + L +FD R++WP+C ++ ++ D C + WAF A E+M+DR+C Sbjct: 74 SDLSPSFDARERWPECMSIPQINDISECKTSWAFAAAESMSDRLC 118 >UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA; n=1; Tribolium castaneum|Rep: PREDICTED: similar to CG10992-PA - Tribolium castaneum Length = 325 Score = 83.4 bits (197), Expect = 5e-15 Identities = 36/81 (44%), Positives = 53/81 (65%), Gaps = 1/81 (1%) Frame = +1 Query: 52 MGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCP-TLNEVRDQGSCGSCWAFGAV 228 +G+ D ++ + K HKI I S+PE+FD R+KWP+C + ++R+QG+CGSCWAF + Sbjct: 53 LGLHPDPNYK-IQTKQHKISRIISIPESFDAREKWPECKDVIGKIRNQGNCGSCWAFAST 111 Query: 229 EAMTDRVCTILTELNIFIFLP 291 E MTDR+C F+F P Sbjct: 112 EVMTDRLCISSKGKIKFVFSP 132 Score = 76.6 bits (180), Expect = 6e-13 Identities = 31/57 (54%), Positives = 40/57 (70%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY 428 S G F FS E+LL+CC CG GC GG + AW+Y+ + G+ SGG YNSS+GC+PY Sbjct: 122 SKGKIKFVFSPENLLTCCKDCGCGCKGGYIKNAWDYYINEGIASGGDYNSSEGCQPY 178 Score = 37.9 bits (84), Expect = 0.25 Identities = 16/43 (37%), Positives = 24/43 (55%) Frame = +2 Query: 569 YTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQG 697 YT+ + I+ E+ NGPV + V+ D +KSGVY + G Sbjct: 195 YTLETNVAQIQMEILTNGPVMAYYNVFEDFACHKSGVYYYKSG 237 >UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: Cathepsin B - Apriona germari Length = 324 Score = 83.4 bits (197), Expect = 5e-15 Identities = 35/82 (42%), Positives = 55/82 (67%) Frame = +1 Query: 40 LKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 219 L ++G+ +D + TLP+ H + I+ +P++FD R++WP C ++ +RD+G+CGSCWAF Sbjct: 59 LADVIGINRDPN-VTLPVVFH--EAISGIPDSFDAREQWPFCESIRTIRDEGACGSCWAF 115 Query: 220 GAVEAMTDRVCTILTELNIFIF 285 AVE M+DR+C FIF Sbjct: 116 AAVEVMSDRLCLASEGRKKFIF 137 Score = 70.5 bits (165), Expect = 4e-11 Identities = 29/57 (50%), Positives = 36/57 (63%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY 428 S G K F FSAE+++SCC CG GC GG ++YW G+ SGG Y S GC+PY Sbjct: 129 SEGRKKFIFSAEEVVSCCTACGGGCRGGFLNEPYKYWVTNGIPSGGDYGSKLGCKPY 185 Score = 62.5 bits (145), Expect = 1e-08 Identities = 27/75 (36%), Positives = 41/75 (54%) Frame = +2 Query: 488 EDSKMHKKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSY 667 E + K C SGY+ ++++D ++ Y V+G I+ E+ NGPV VY D SY Sbjct: 192 ETPQCQKACVSGYEKSWEKDLRHATSAYQVNGGVLQIQREILDNGPVTAYMEVYEDFYSY 251 Query: 668 KSGVYKHTQGDVSAG 712 +G+Y+HT G G Sbjct: 252 GTGIYQHTSGSFVGG 266 >UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 precursor; n=8; Haemonchus contortus|Rep: Cathepsin B-like cysteine proteinase 2 precursor - Haemonchus contortus (Barber pole worm) Length = 342 Score = 83.0 bits (196), Expect = 7e-15 Identities = 40/88 (45%), Positives = 50/88 (56%), Gaps = 4/88 (4%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCC-PICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEI 434 S K + SA D+++CC P CG GC GG P AW+Y+ + G+VSGG Y + CRPY I Sbjct: 131 SKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPI 190 Query: 435 PPCEHHVPGNRM---PCSGDTKTPKCTK 509 PC HH GN C G TP C + Sbjct: 191 HPCGHH--GNDTYYGECRGTAPTPPCKR 216 Score = 72.5 bits (170), Expect = 9e-12 Identities = 31/66 (46%), Positives = 41/66 (62%) Frame = +2 Query: 506 KKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYK 685 +KC G Y+ DK+YGK Y V I++E+ KNGPV +F VY D YKSG+YK Sbjct: 216 RKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPVVASFAVYEDFRHYKSGIYK 275 Query: 686 HTQGDV 703 HT G++ Sbjct: 276 HTAGEL 281 Score = 60.1 bits (139), Expect = 5e-08 Identities = 27/70 (38%), Positives = 39/70 (55%) Frame = +1 Query: 43 KKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFG 222 +KIM + L +K D +P ++DPRD W +C T +RDQ +CGSCWA Sbjct: 61 QKIMSIKYKHQKLNLMVKEDP-DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVS 118 Query: 223 AVEAMTDRVC 252 A++DR+C Sbjct: 119 TAAAISDRIC 128 >UniRef50_Q237A1 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 346 Score = 82.2 bits (194), Expect = 1e-14 Identities = 33/83 (39%), Positives = 50/83 (60%), Gaps = 1/83 (1%) Frame = +3 Query: 285 SAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGN 464 S ++LL+CC CG GC GG P A +Y+ + GLV+G Y ++ C+ Y PC HHV + Sbjct: 146 STQNLLTCCAACGDGCDGGWPEAAMDYYVNTGLVTGDLYGNNSWCQAYTFAPCAHHVTSD 205 Query: 465 -RMPCSGDTKTPKCTKNANLDTT 530 PC+G+ TP C + + ++T Sbjct: 206 IYPPCTGELPTPPCINSCDSNST 228 Score = 69.3 bits (162), Expect = 9e-11 Identities = 30/65 (46%), Positives = 41/65 (63%) Frame = +2 Query: 518 SGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQG 697 S + + Y +D G Y ++ DE I AE++KNGP+E A TVY D L+YK+GVY+H G Sbjct: 227 STHTIPYSKDIHRGSKAYGIAKDEKAIMAEIYKNGPIEVALTVYEDFLTYKTGVYQHVTG 286 Query: 698 DVSAG 712 D G Sbjct: 287 DELGG 291 Score = 66.1 bits (154), Expect = 8e-10 Identities = 28/44 (63%), Positives = 34/44 (77%), Gaps = 1/44 (2%) Frame = +1 Query: 124 LPENFDPRDKWPD-CPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 LPE FD R +W D C +L EVRDQ +CGSCWAFGA E+++DR C Sbjct: 93 LPEEFDARVQWGDKCSSLWEVRDQSTCGSCWAFGAAESLSDRHC 136 >UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|Rep: Cysteine proteinase 3 - Necator americanus (Human hookworm) Length = 360 Score = 81.8 bits (193), Expect = 2e-14 Identities = 38/85 (44%), Positives = 48/85 (56%), Gaps = 1/85 (1%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIP 437 SNGT S D+L+CCP CG GC GG AWEY+K+ G+ +GG Y + C+PY Sbjct: 135 SNGTIKVLLSDTDILACCPNCGAGCGGGHTIRAWEYFKNTGVCTGGLYGTKDSCKPYAFY 194 Query: 438 PCEHHVPGNRMPCSGDT-KTPKCTK 509 PC+ G C D+ TPKC K Sbjct: 195 PCKDESYGK---CPKDSFPTPKCRK 216 Score = 69.3 bits (162), Expect = 9e-11 Identities = 25/54 (46%), Positives = 35/54 (64%) Frame = +1 Query: 91 IKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 +K +D +P +FD RDKWP C ++ +RDQ CGSCWA + E M+DR+C Sbjct: 79 LKEEDMDFSEEIPVSFDARDKWPKCTSIGFIRDQSHCGSCWAVSSAETMSDRLC 132 Score = 54.8 bits (126), Expect = 2e-06 Identities = 24/67 (35%), Positives = 34/67 (50%) Frame = +2 Query: 497 KMHKKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSG 676 K K C+ Y Y DK Y Y + +E I+ E+ +NGPV +F +Y D Y+ G Sbjct: 213 KCRKICQYKYSKKYADDKYYANSAYRIPQNETWIKLEIMRNGPVTASFRIYPDFGFYEKG 272 Query: 677 VYKHTQG 697 VY + G Sbjct: 273 VYVTSGG 279 >UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Cathepsin b - Aedes aegypti (Yellowfever mosquito) Length = 386 Score = 81.4 bits (192), Expect = 2e-14 Identities = 40/83 (48%), Positives = 46/83 (55%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIP 437 S G + F F + DLLSCC CG GC GG AW++W GL SGG NS QGC PY I Sbjct: 170 SKGKEQFIFGSLDLLSCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIG 229 Query: 438 PCEHHVPGNRMPCSGDTKTPKCT 506 C +PG D TPKC+ Sbjct: 230 EC--RIPGE------DEDTPKCS 244 Score = 79.0 bits (186), Expect = 1e-13 Identities = 32/54 (59%), Positives = 37/54 (68%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTILTELNIFIF 285 LP+ FD R+KWP+CP+L E+RDQG CGSCWA A AMTDR C FIF Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIF 178 Score = 77.0 bits (181), Expect = 4e-13 Identities = 37/77 (48%), Positives = 50/77 (64%), Gaps = 2/77 (2%) Frame = +2 Query: 488 EDS-KMHKKCESGYDV-NYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLL 661 ED+ K KC SGY+V + QD+ YG+ Y++ DE I E+F NGPV+ AF Y DL Sbjct: 238 EDTPKCSNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLH 297 Query: 662 SYKSGVYKHTQGDVSAG 712 +YKSG+Y+H G +S G Sbjct: 298 AYKSGIYRHVWGPLSGG 314 >UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 precursor; n=5; Caenorhabditis|Rep: Cathepsin B-like cysteine proteinase 4 precursor - Caenorhabditis elegans Length = 335 Score = 81.4 bits (192), Expect = 2e-14 Identities = 39/84 (46%), Positives = 46/84 (54%), Gaps = 2/84 (2%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIP 437 SNG + SAED+LSCC CG GC GG P AW+Y G +GGSY + GC+PY + Sbjct: 126 SNGAVNTLLSAEDVLSCCSNCGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLA 185 Query: 438 PCEHHVPGNRMP-CSGD-TKTPKC 503 PC V P C D TP C Sbjct: 186 PCGETVGNVTWPSCPDDGYDTPAC 209 Score = 70.1 bits (164), Expect = 5e-11 Identities = 33/82 (40%), Positives = 50/82 (60%), Gaps = 3/82 (3%) Frame = +1 Query: 16 PRDTSFAHLKKIMGVIKDEHFA--TLPIKTHKIDLIA-SLPENFDPRDKWPDCPTLNEVR 186 P+D + +KK + ++ E A T ++ K D+ ++P FD R +WP+C ++N +R Sbjct: 44 PKDITIEQVKKRL--MRTEFVAPHTPDVEVVKHDINEDTIPATFDARTQWPNCMSINNIR 101 Query: 187 DQGSCGSCWAFGAVEAMTDRVC 252 DQ CGSCWAF A EA +DR C Sbjct: 102 DQSDCGSCWAFAAAEAASDRFC 123 Score = 68.1 bits (159), Expect = 2e-10 Identities = 32/69 (46%), Positives = 39/69 (56%), Gaps = 1/69 (1%) Frame = +2 Query: 509 KCES-GYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYK 685 KC + Y+V Y DK +G Y V I+AE+ +GPVE AFTVY D YK+GVY Sbjct: 212 KCTNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYV 271 Query: 686 HTQGDVSAG 712 HT G G Sbjct: 272 HTTGQELGG 280 >UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_115, whole genome shotgun sequence - Paramecium tetraurelia Length = 332 Score = 81.0 bits (191), Expect = 3e-14 Identities = 49/119 (41%), Positives = 59/119 (49%), Gaps = 13/119 (10%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPI-CGL----GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCR 422 S T SAEDLLSCC I C L GC GG P AW+Y + G+V+GG+YN C+ Sbjct: 116 SGQTDKRQISAEDLLSCCGINCELDGNGGCDGGYPYGAWKYLRVDGIVTGGTYNDFSLCK 175 Query: 423 PYEIPPCEH-HVPGNRMPCSGD-----TKTPKCTKNAN--LDTTLITNKTNNTENMYIL 575 PY PPC H + G C D TP CTK + T +K + EN Y L Sbjct: 176 PYSFPPCSHGNDSGKYSKCENDFFMLTEVTPSCTKKCHPQFSRTYDVDKIRSRENPYKL 234 Score = 67.3 bits (157), Expect = 4e-10 Identities = 29/69 (42%), Positives = 42/69 (60%) Frame = +1 Query: 46 KIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGA 225 K VI D H + K H + + +LP +F ++KWP CP++ + DQG+CGSCWA A Sbjct: 48 KYFNVIVD-HSEPVEYKYH--EKLENLPPSFSAQEKWPGCPSIELIPDQGNCGSCWAVSA 104 Query: 226 VEAMTDRVC 252 M+DR+C Sbjct: 105 ASTMSDRLC 113 Score = 60.5 bits (140), Expect = 4e-08 Identities = 27/65 (41%), Positives = 41/65 (63%), Gaps = 1/65 (1%) Frame = +2 Query: 506 KKCESGYDVNYKQDK-QYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVY 682 KKC + Y DK + ++ Y + D++ I+ E++ NGPV+ FTV+ D L+YKSGVY Sbjct: 210 KKCHPQFSRTYDVDKIRSRENPYKLIKDQEQIKNEIYLNGPVQAVFTVFDDFLNYKSGVY 269 Query: 683 KHTQG 697 + T G Sbjct: 270 QQTTG 274 >UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep: Cathepsin B - Uronema marinum Length = 350 Score = 80.6 bits (190), Expect = 4e-14 Identities = 41/92 (44%), Positives = 48/92 (52%), Gaps = 10/92 (10%) Frame = +3 Query: 285 SAEDLLSCCP---ICGLGCSGGMPRLAWEYWKHFGLVSGGSY-----NSSQGCRPYEIPP 440 S+E+LLSCC CG+GC+GG AW Y+ GLVSG Y NS C+PY PP Sbjct: 140 SSENLLSCCRGTFACGMGCNGGYTAGAWNYYVKTGLVSGNLYTDDNQNSKTECQPYSFPP 199 Query: 441 CEHHVPGNRMPCSG--DTKTPKCTKNANLDTT 530 C HHV G C+ TPKC N T Sbjct: 200 CSHHVQGEYQACTDLPQFNTPKCYTECNSQYT 231 Score = 73.7 bits (173), Expect = 4e-12 Identities = 28/44 (63%), Positives = 37/44 (84%) Frame = +1 Query: 121 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 SLPE+FD R+ +P C +L +VRDQ +CGSCWAFG VEA++DR+C Sbjct: 85 SLPESFDLREAYPKCESLQQVRDQSNCGSCWAFGTVEAISDRIC 128 Score = 64.1 bits (149), Expect = 3e-09 Identities = 30/73 (41%), Positives = 43/73 (58%), Gaps = 1/73 (1%) Frame = +2 Query: 497 KMHKKCESGYDVN-YKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKS 673 K + +C S Y N Y+QD G Y+V E+ I+AE+++ G +F VYSD L+Y S Sbjct: 221 KCYTECNSQYTQNSYEQDLHKGVSSYSVPKSEEQIKAEIYQYGSTTASFNVYSDFLTYSS 280 Query: 674 GVYKHTQGDVSAG 712 GVY++T G G Sbjct: 281 GVYQNTSGSYMGG 293 >UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Cathepsin b - Aedes aegypti (Yellowfever mosquito) Length = 332 Score = 79.0 bits (186), Expect = 1e-13 Identities = 37/85 (43%), Positives = 50/85 (58%), Gaps = 1/85 (1%) Frame = +3 Query: 255 YSNGTKHFHFSAEDLLSCCPICGLGCSGG-MPRLAWEYWKHFGLVSGGSYNSSQGCRPYE 431 +S G +AEDL+ CC CG GC+GG + +++YW GLVSG +YNS+ GC+PY Sbjct: 129 HSEGKFDVELAAEDLMGCCKDCGNGCNGGFLDGTSFQYWVDVGLVSGAAYNSTDGCKPYP 188 Query: 432 IPPCEHHVPGNRMPCSGDTKTPKCT 506 PC + G C + KTP CT Sbjct: 189 FKPCLYPFVG----CHPE-KTPSCT 208 Score = 74.5 bits (175), Expect = 2e-12 Identities = 28/74 (37%), Positives = 46/74 (62%) Frame = +1 Query: 31 FAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSC 210 F + + + G+ + + LP K H + +PE FD R+KWP C +++ +++QG CG+C Sbjct: 54 FENFQNMKGIFESKIGFRLPTKRHDVAYNMDIPEFFDAREKWPYCKSISTIKNQGLCGAC 113 Query: 211 WAFGAVEAMTDRVC 252 WA AV M+DR+C Sbjct: 114 WAVAAVSVMSDRLC 127 Score = 74.1 bits (174), Expect = 3e-12 Identities = 31/62 (50%), Positives = 39/62 (62%) Frame = +2 Query: 512 CESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHT 691 C GYD Y++DK YG Y + DE I+ E+ NGPVE F+VY DL YK+GVY+H Sbjct: 211 CTEGYDGTYRRDKYYGSAAYKLPNDERMIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHV 270 Query: 692 QG 697 G Sbjct: 271 VG 272 >UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Cathepsin B - Fasciola gigantica (Giant liver fluke) Length = 339 Score = 77.4 bits (182), Expect = 3e-13 Identities = 29/65 (44%), Positives = 40/65 (61%) Frame = +3 Query: 255 YSNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEI 434 +SNG +A D LSCC CG GC GG P AW+YW G+V+GG++ + GC+P+ Sbjct: 130 HSNGQMRPRLAAADPLSCCTYCGQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMF 189 Query: 435 PPCEH 449 C+H Sbjct: 190 TKCDH 194 Score = 76.2 bits (179), Expect = 8e-13 Identities = 37/81 (45%), Positives = 48/81 (59%), Gaps = 3/81 (3%) Frame = +1 Query: 19 RDTSFAHLKKIMGVIKD---EHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRD 189 R ++ H K +G + + E A P H I LPE+FD R +WP C T++E+RD Sbjct: 49 RFSNVDHFKLHLGALSETPEERNALRPTIKHDISK-NDLPESFDARSQWPQCWTISEIRD 107 Query: 190 QGSCGSCWAFGAVEAMTDRVC 252 Q SCGSCWA A AM+DRVC Sbjct: 108 QASCGSCWATAAASAMSDRVC 128 Score = 69.7 bits (163), Expect = 7e-11 Identities = 28/64 (43%), Positives = 39/64 (60%) Frame = +2 Query: 506 KKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYK 685 + C++GY+ Y+QDK YG Y V E +I E+ KNGPVE F ++ D Y+SG+Y Sbjct: 216 RACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFGVYRSGIYH 275 Query: 686 HTQG 697 H G Sbjct: 276 HVAG 279 >UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishmania|Rep: Cathepsin B-like protease - Leishmania major Length = 340 Score = 77.4 bits (182), Expect = 3e-13 Identities = 35/76 (46%), Positives = 46/76 (60%) Frame = +1 Query: 28 SFAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGS 207 S ++K+MGV A P +L LPE FD + WP C T++E+RDQ +CGS Sbjct: 66 SLGEVRKLMGVTDMSTEAVPPRNFSVEELQQDLPEFFDAAEHWPMCLTISEIRDQSNCGS 125 Query: 208 CWAFGAVEAMTDRVCT 255 CWA AVEA++DR CT Sbjct: 126 CWAIAAVEAISDRYCT 141 Score = 70.5 bits (165), Expect = 4e-11 Identities = 33/82 (40%), Positives = 43/82 (52%), Gaps = 2/82 (2%) Frame = +3 Query: 264 GTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPC 443 G S +LLSCC ICGLGC GG+P +AW +W G+ +++ C+PY PC Sbjct: 144 GVPDRRMSTSNLLSCCFICGLGCHGGIPTVAWLWWVWVGI-------ATEDCQPYPFDPC 196 Query: 444 EHHVPGNRMPCSGDT--KTPKC 503 HH + P T TPKC Sbjct: 197 SHHGNSEKYPPCPSTIYDTPKC 218 Score = 55.2 bits (127), Expect = 2e-06 Identities = 29/69 (42%), Positives = 39/69 (56%), Gaps = 1/69 (1%) Frame = +2 Query: 509 KCESGYDVNYKQDKQY-GKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYK 685 KC + + N +Y G Y+V G+++ + EL NGP+E VYSD + YKSGVYK Sbjct: 217 KCNTTCERNEMDLVKYKGSTSYSVKGEKE-LMIELMTNGPLELTMQVYSDFVGYKSGVYK 275 Query: 686 HTQGDVSAG 712 H GD G Sbjct: 276 HVLGDFLGG 284 >UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2; Arthropoda|Rep: Cathepsin B-like cysteine protease - Callosobruchus maculatus (Southern cowpea weevil) (Pulse bruchid) Length = 330 Score = 77.0 bits (181), Expect = 4e-13 Identities = 34/84 (40%), Positives = 49/84 (58%) Frame = +1 Query: 1 AGRNFPRDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNE 180 AGRNF RDTS ++++++ V + H+ D LPE FD R +W C ++ E Sbjct: 41 AGRNFERDTSLYNIQRLLSVGTINPPSEFETIFHEDDG-KDLPEEFDARKQWSKCESIKE 99 Query: 181 VRDQGSCGSCWAFGAVEAMTDRVC 252 +RDQ CGSCWA + M+DR+C Sbjct: 100 IRDQSGCGSCWAVSSASVMSDRIC 123 Score = 65.7 bits (153), Expect = 1e-09 Identities = 29/61 (47%), Positives = 40/61 (65%), Gaps = 1/61 (1%) Frame = +2 Query: 506 KKCESGYDVNYKQDKQYGKHVYTV-SGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVY 682 K+C+ G + Y++DK Y K Y + S E I+ E+ KNGPV +FTVY+D + Y SGVY Sbjct: 205 KECDKGSPLKYEEDKHYAKQAYRIMSKVERQIQLEIIKNGPVVASFTVYADFIHYLSGVY 264 Query: 683 K 685 K Sbjct: 265 K 265 Score = 64.5 bits (150), Expect = 2e-09 Identities = 32/91 (35%), Positives = 42/91 (46%), Gaps = 3/91 (3%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGL---GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY 428 S+ SA D++ CC C GC GG+P + WK G VSGG YNS+ GC Y Sbjct: 126 SDQKNQLRISAADMIECCESCTFSVDGCHGGIPSFTFTEWKDSGFVSGGEYNSTNGCMSY 185 Query: 429 EIPPCEHHVPGNRMPCSGDTKTPKCTKNANL 521 +P C P + T +C K + L Sbjct: 186 PLPRCN---PSCKTLYDAPTCKKECDKGSPL 213 >UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyledons|Rep: At1g02300/T6A9_10 - Arabidopsis thaliana (Mouse-ear cress) Length = 362 Score = 76.6 bits (180), Expect = 6e-13 Identities = 34/79 (43%), Positives = 48/79 (60%), Gaps = 2/79 (2%) Frame = +1 Query: 22 DTSFAHLKKIMGV--IKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQG 195 + + A K+++GV F +PI +H I L LP+ FD R W C ++ + DQG Sbjct: 72 NATVAEFKRLLGVKPTPKTEFLGVPIVSHDISL--KLPKEFDARTAWSQCTSIGRILDQG 129 Query: 196 SCGSCWAFGAVEAMTDRVC 252 CGSCWAFGAVE+++DR C Sbjct: 130 HCGSCWAFGAVESLSDRFC 148 Score = 73.7 bits (173), Expect = 4e-12 Identities = 35/67 (52%), Positives = 42/67 (62%) Frame = +2 Query: 497 KMHKKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSG 676 K +KC SG + +++ K YG Y V D I AE++KNGPVE AFTVY D YKSG Sbjct: 218 KCARKCVSGNQL-WRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSG 276 Query: 677 VYKHTQG 697 VYKH G Sbjct: 277 VYKHITG 283 Score = 56.0 bits (129), Expect = 9e-07 Identities = 32/77 (41%), Positives = 41/77 (53%), Gaps = 2/77 (2%) Frame = +3 Query: 285 SAEDLLSCCP-ICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVP 458 S DLL+CC +CG GC+GG P AW Y+KH G+V ++ C PY + C H P Sbjct: 158 SVNDLLACCGFLCGQGCNGGYPIAAWRYFKHHGVV-------TEECDPYFDNTGCSH--P 208 Query: 459 GNRMPCSGDTKTPKCTK 509 G C TPKC + Sbjct: 209 G----CEPAYPTPKCAR 221 >UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoidea|Rep: Cathepsin B-S precursor - Tuberaphis styraci Length = 349 Score = 76.2 bits (179), Expect = 8e-13 Identities = 34/97 (35%), Positives = 51/97 (52%) Frame = +3 Query: 285 SAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGN 464 S E+L CC CG GC GG P AW+Y++ G+ +GG Y++ +GC PY++PPC N Sbjct: 139 SPEELAFCCMDCGKGCGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYDEQGKN 198 Query: 465 RMPCSGDTKTPKCTKNANLDTTLITNKTNNTENMYIL 575 + +C K TT+ T+N Y++ Sbjct: 199 TCGGKPMERNHQCPKTCYGKTTV--QDRYKTKNEYVI 233 Score = 63.7 bits (148), Expect = 4e-09 Identities = 30/87 (34%), Positives = 48/87 (55%), Gaps = 3/87 (3%) Frame = +1 Query: 1 AGRNFPRDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIA---SLPENFDPRDKWPDCPT 171 A R FP +TS + ++G +++ T ++ K D + + P+ FD R+ W C Sbjct: 42 AERYFPANTSEEYFIGLLGSRGYKNY-TNEVEIKKYDPLYVENNSPKQFDSRENWKSCKQ 100 Query: 172 LNEVRDQGSCGSCWAFGAVEAMTDRVC 252 + +RDQG+CGSCW+F A DR+C Sbjct: 101 IGHIRDQGNCGSCWSFSTTGAFADRLC 127 Score = 44.4 bits (100), Expect = 0.003 Identities = 23/63 (36%), Positives = 34/63 (53%) Frame = +2 Query: 503 HKKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVY 682 H+ ++ Y QD+ K+ Y ++ E I +L GPVE +F VY D YKSG+Y Sbjct: 209 HQCPKTCYGKTTVQDRYKTKNEYVINSIET-IEQDLMTYGPVEASFDVYDDFSVYKSGIY 267 Query: 683 KHT 691 + T Sbjct: 268 RKT 270 >UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomatidae|Rep: Cysteine proteinase - Ancylostoma ceylanicum Length = 348 Score = 74.9 bits (176), Expect = 2e-12 Identities = 33/72 (45%), Positives = 45/72 (62%), Gaps = 6/72 (8%) Frame = +1 Query: 61 IKDEHFATLPIKTHKIDLIAS------LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFG 222 I D FA P KT ++A+ +P+ FD RD+WP+C ++ +RDQ SCGSCWA Sbjct: 67 IMDVKFAVDPEKTEPNYVLANTEMKVDIPDTFDARDRWPNCTSMKHIRDQSSCGSCWAVA 126 Query: 223 AVEAMTDRVCTI 258 A AM+DRVC + Sbjct: 127 AASAMSDRVCAL 138 Score = 66.9 bits (156), Expect = 5e-10 Identities = 39/116 (33%), Positives = 53/116 (45%), Gaps = 5/116 (4%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCC-PICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEI 434 +NG + S ++LSCC CG GC GG P A+ Y +GL +GG Y C+PY Sbjct: 139 TNGRINRILSDTEVLSCCFGSCGFGCKGGYPARAFGYAWRYGLSTGGPYGEKDACQPYAF 198 Query: 435 PPCEHHVPGNRM-PCSGDT-KTPKCTKNANLDTTLITNKTN--NTENMYILCPETK 590 PC +H PC + TP C + L + K N + YI ET+ Sbjct: 199 YPCGNHAHEPYYGPCPDELWPTPTCRRTCQLGYPIPFEKDKIFNDQTYYIFGNETE 254 Score = 60.5 bits (140), Expect = 4e-08 Identities = 24/67 (35%), Positives = 39/67 (58%) Frame = +2 Query: 506 KKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYK 685 + C+ GY + +++DK + Y + G+E I+ E+ GPV + VY D YK GVY Sbjct: 225 RTCQLGYPIPFEKDKIFNDQTYYIFGNETEIKYEIMTRGPVVATYKVYRDFDYYKKGVYI 284 Query: 686 HTQGDVS 706 H +G+V+ Sbjct: 285 HREGEVT 291 >UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 precursor; n=4; Caenorhabditis|Rep: Cathepsin B-like cysteine proteinase 3 precursor - Caenorhabditis elegans Length = 370 Score = 74.9 bits (176), Expect = 2e-12 Identities = 27/43 (62%), Positives = 35/43 (81%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 LP+ FD R+KWPDC T+ +R+Q +CGSCWAFGA E ++DRVC Sbjct: 92 LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVC 134 Score = 68.9 bits (161), Expect = 1e-10 Identities = 36/91 (39%), Positives = 44/91 (48%), Gaps = 6/91 (6%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCC-PICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEI 434 SNGT+ S ED+LSCC CG GC GG A +W G V+GG Y GC PY Sbjct: 137 SNGTQQPVISVEDILSCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDY-GGHGCMPYSF 195 Query: 435 PPCEHHVPGNRMP-----CSGDTKTPKCTKN 512 PC + P + P C KT + K+ Sbjct: 196 APCTKNCPESTTPSCKTTCQSSYKTEEYKKD 226 Score = 61.3 bits (142), Expect = 2e-08 Identities = 29/70 (41%), Positives = 40/70 (57%), Gaps = 3/70 (4%) Frame = +2 Query: 512 CESGYDVN-YKQDKQYGKHVYTVSGDED--HIRAELFKNGPVEGAFTVYSDLLSYKSGVY 682 C+S Y YK+DK YG Y V+ + I+ E++ GPVE ++ VY D YKSGVY Sbjct: 214 CQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGVY 273 Query: 683 KHTQGDVSAG 712 +T G + G Sbjct: 274 HYTSGKLVGG 283 >UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8; Trypanosoma|Rep: Cathepsin B-like cysteine protease - Trypanosoma brucei Length = 340 Score = 74.1 bits (174), Expect = 3e-12 Identities = 32/74 (43%), Positives = 48/74 (64%), Gaps = 2/74 (2%) Frame = +1 Query: 43 KKIMGVIKDEHFATLPIKTH--KIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWA 216 K++ GVIK + A++ K + + A LP +FD + WP+CPT+ ++ DQ +CGSCWA Sbjct: 65 KRLNGVIKKNNNASILPKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWA 124 Query: 217 FGAVEAMTDRVCTI 258 A AM+DR CT+ Sbjct: 125 VAAASAMSDRFCTM 138 Score = 72.1 bits (169), Expect = 1e-11 Identities = 40/96 (41%), Positives = 48/96 (50%), Gaps = 3/96 (3%) Frame = +3 Query: 264 GTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPC 443 G + H SA DLL+CC CG GC+GG P AW Y+ GLVS Y C+PY P C Sbjct: 140 GVQDVHISAGDLLACCSDCGDGCNGGDPDRAWAYFSSTGLVS--DY-----CQPYPFPHC 192 Query: 444 EHHVPGNR--MPCSG-DTKTPKCTKNANLDTTLITN 542 HH PCS + TPKC + T + N Sbjct: 193 SHHSKSKNGYPPCSQFNFDTPKCNYTCDDPTIPVVN 228 Score = 52.4 bits (120), Expect = 1e-05 Identities = 23/48 (47%), Positives = 30/48 (62%) Frame = +2 Query: 569 YTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGDVSAG 712 Y + G++D++R ELF GP E AF VY D ++Y SGVY H G G Sbjct: 235 YALQGEDDYMR-ELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGG 281 >UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 356 Score = 74.1 bits (174), Expect = 3e-12 Identities = 39/99 (39%), Positives = 54/99 (54%), Gaps = 8/99 (8%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCC----PICG--LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGC 419 SNGT ++ SA+D LSCC ICG GC G P+ ++W+ GL +GG+YN GC Sbjct: 137 SNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYNDQFGC 196 Query: 420 RPYEIPPCEHHVPG--NRMPCSGDTKTPKCTKNANLDTT 530 +PY I PC+ +PC G TP C ++ + T Sbjct: 197 KPYSIYPCDKKYANGTTSVPCPG-YHTPTCEEHCTSNIT 234 Score = 63.7 bits (148), Expect = 4e-09 Identities = 27/63 (42%), Positives = 36/63 (57%) Frame = +2 Query: 524 YDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGDV 703 + + YKQDK +GK Y V I+ E+ NGPV +F +Y D YK+G+Y HT GD Sbjct: 235 WPIAYKQDKHFGKAHYNVGKKMTDIQIEIMTNGPVIASFIIYDDFWDYKTGIYVHTAGDQ 294 Query: 704 SAG 712 G Sbjct: 295 EGG 297 Score = 56.0 bits (129), Expect = 9e-07 Identities = 23/53 (43%), Positives = 30/53 (56%) Frame = +1 Query: 94 KTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 KT +++ +P +FD R KWP C + VRDQ CGS AVE +DR C Sbjct: 82 KTGNDNVLVDIPSSFDSRQKWPSCSQIGAVRDQSDCGSAAHLVAVEIASDRTC 134 >UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000012227 - Anopheles gambiae str. PEST Length = 218 Score = 73.3 bits (172), Expect = 5e-12 Identities = 32/66 (48%), Positives = 42/66 (63%) Frame = +2 Query: 503 HKKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVY 682 + + G D +Y +DK +GK Y+V DE IR E+ NGPVE F VY D+L YKSGVY Sbjct: 94 YNSTDDGVDRHYSKDKLFGKVAYSVPRDERAIRYEIMTNGPVEAGFDVYEDVLLYKSGVY 153 Query: 683 KHTQGD 700 +H G+ Sbjct: 154 RHVYGE 159 Score = 68.1 bits (159), Expect = 2e-10 Identities = 24/43 (55%), Positives = 33/43 (76%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 +PE+FD R+ WP+C +L +R+QG+CGSCWA A M+DRVC Sbjct: 1 IPESFDARNHWPNCESLRAIRNQGTCGSCWAVAAASVMSDRVC 43 Score = 62.9 bits (146), Expect = 8e-09 Identities = 27/53 (50%), Positives = 38/53 (71%), Gaps = 1/53 (1%) Frame = +3 Query: 255 YSNGTKHFHFSAEDLLSCCPICGLGCSGG-MPRLAWEYWKHFGLVSGGSYNSS 410 +SNGT + +AEDL+ CC CG GC+GG + +++YW GLVSGG+YNS+ Sbjct: 45 HSNGTINVALAAEDLMGCCVDCGNGCNGGFLDGTSFQYWVDAGLVSGGAYNST 97 >UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus contortus|Rep: Cysteine proteinase - Haemonchus contortus (Barber pole worm) Length = 350 Score = 72.9 bits (171), Expect = 7e-12 Identities = 36/76 (47%), Positives = 42/76 (55%), Gaps = 3/76 (3%) Frame = +3 Query: 285 SAEDLLSCCP-ICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPG 461 S D+LSCC +CG GC GG LAWE+ + FG+V+GG Y CRPY PC H G Sbjct: 148 SDTDILSCCGRMCGDGCEGGYDHLAWEWVQRFGVVTGGPYQQKGVCRPYAFHPCGLH-HG 206 Query: 462 NRMPCSGD--TKTPKC 503 R C D TP C Sbjct: 207 RRYDCPWDHSFSTPAC 222 Score = 66.5 bits (155), Expect = 6e-10 Identities = 27/62 (43%), Positives = 37/62 (59%) Frame = +2 Query: 512 CESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHT 691 C+ GY Y++DK + K Y + DE I+ E+ KNGPV+ AF Y D YK G+Y H Sbjct: 226 CQFGYGKRYEKDKFFVKSTYILDNDEKVIQREMMKNGPVQAAFITYEDFSPYKGGIYVHV 285 Query: 692 QG 697 +G Sbjct: 286 KG 287 Score = 61.7 bits (143), Expect = 2e-08 Identities = 28/67 (41%), Positives = 39/67 (58%), Gaps = 8/67 (11%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC--------TILTELNIF 279 +PE+FD R W +C ++ VRDQ CGSCWA A M+DR+C TIL++ +I Sbjct: 94 IPESFDSRIVWKNCSSITYVRDQSRCGSCWAVSAASTMSDRICVQTKGKLQTILSDTDIL 153 Query: 280 IFLPRIC 300 R+C Sbjct: 154 SCCGRMC 160 >UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator americanus|Rep: Cysteine proteinase 4 - Necator americanus (Human hookworm) Length = 339 Score = 72.5 bits (170), Expect = 9e-12 Identities = 37/87 (42%), Positives = 47/87 (54%), Gaps = 3/87 (3%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCC-PICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEI 434 +NGT S+ D+L+CC CG GC GG P A+ Y ++ G+ SGG Y C+PY Sbjct: 133 TNGTNQKILSSADILACCGEDCGSGCEGGYPIQAYFYLENTGVCSGGEYREKNVCKPYPF 192 Query: 435 PPCEHHVPGNRMPC--SGDTKTPKCTK 509 PC+ GN PC G TPKC K Sbjct: 193 YPCD----GNYGPCPKEGAFDTPKCRK 215 Score = 67.7 bits (158), Expect = 3e-10 Identities = 25/49 (51%), Positives = 33/49 (67%) Frame = +1 Query: 106 IDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 I+L LPE FD R+KWP C ++ +RD +CGSCWA A M+DR+C Sbjct: 82 INLNVELPERFDAREKWPHCASIGLIRDHSACGSCWAVSAASVMSDRLC 130 Score = 63.3 bits (147), Expect = 6e-09 Identities = 30/68 (44%), Positives = 41/68 (60%), Gaps = 1/68 (1%) Frame = +2 Query: 497 KMHKKCESGYDVNYKQDKQYGKHVYTVSGD-EDHIRAELFKNGPVEGAFTVYSDLLSYKS 673 K K C+ Y V Y++DK +GK+ + + D E IR E+F NGPV F V+ D + YK Sbjct: 212 KCRKICQFRYPVPYEEDKVFGKNSHILLQDNEARIRQEIFINGPVGANFYVFEDFIHYKE 271 Query: 674 GVYKHTQG 697 G+YK T G Sbjct: 272 GIYKQTYG 279 >UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep: Thiol protease - Trichuris suis Length = 348 Score = 71.7 bits (168), Expect = 2e-11 Identities = 30/66 (45%), Positives = 40/66 (60%) Frame = +2 Query: 506 KKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYK 685 ++C GY +Y D+ YGK Y V I+ E+ KNGPV +F VY D YKSG+YK Sbjct: 223 RRCLLGYPKSYPSDRYYGKSAYIVKQSVKAIQREIMKNGPVVASFAVYEDFRHYKSGIYK 282 Query: 686 HTQGDV 703 HT G++ Sbjct: 283 HTAGEL 288 Score = 60.1 bits (139), Expect = 5e-08 Identities = 25/47 (53%), Positives = 31/47 (65%) Frame = +1 Query: 112 LIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 L S+P +FD R W C +LN +RDQ CGSCWA A E M+DR+C Sbjct: 80 LALSIPPSFDVRSLWHVC-SLNLIRDQAKCGSCWAVSAAETMSDRIC 125 Score = 60.1 bits (139), Expect = 5e-08 Identities = 31/81 (38%), Positives = 42/81 (51%), Gaps = 3/81 (3%) Frame = +3 Query: 285 SAEDLLSCCPI-CGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYE-IPPCEHHVP 458 S D+LSCC + CG GC+GG P AW ++ G +GG GC+PY+ P H+ Sbjct: 137 SDTDILSCCGLYCGYGCNGGFPIEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRHLK 196 Query: 459 GN-RMPCSGDTKTPKCTKNAN 518 N PC DT +C A+ Sbjct: 197 RNDYAPCPNDTYYGECVGMAD 217 >UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 1 - Rhipicephalus appendiculatus (Brown ear tick) Length = 332 Score = 71.3 bits (167), Expect = 2e-11 Identities = 30/63 (47%), Positives = 43/63 (68%) Frame = +2 Query: 497 KMHKKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSG 676 K C GY+ +Y++DK + K+VY + D I+ +++KNGPVE AF VY+D SYKSG Sbjct: 204 KCQHVCRKGYEKSYEEDKHFAKNVYRLLKKCDAIKTDIYKNGPVESAFFVYADFPSYKSG 263 Query: 677 VYK 685 VY+ Sbjct: 264 VYQ 266 Score = 65.3 bits (152), Expect = 1e-09 Identities = 22/42 (52%), Positives = 32/42 (76%) Frame = +1 Query: 127 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 PE+F PR+ W C ++ +RDQ +CGSCWAF A E+++DR+C Sbjct: 88 PESFTPREYWSHCSSIRVIRDQSACGSCWAFAAAESISDRIC 129 Score = 54.8 bits (126), Expect = 2e-06 Identities = 33/96 (34%), Positives = 44/96 (45%) Frame = +3 Query: 216 FRCRRSYDRQSMYYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGG 395 F S + ++NG + SAEDLL+CC CG GC G + + LV Sbjct: 118 FAAAESISDRICIHTNGKVQVNISAEDLLACCHTCGHGCDGRCHCSSVAILQGRRLVP-E 176 Query: 396 SYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKC 503 + GC+PY +PPC VP C+ TPKC Sbjct: 177 PVRTEDGCQPYSLPPC---VPN----CTHPEPTPKC 205 >UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin B-like cysteine proteinase 4 precursor (Cysteine protease-related 4); n=2; Tribolium castaneum|Rep: PREDICTED: similar to Cathepsin B-like cysteine proteinase 4 precursor (Cysteine protease-related 4) - Tribolium castaneum Length = 360 Score = 70.9 bits (166), Expect = 3e-11 Identities = 33/79 (41%), Positives = 40/79 (50%), Gaps = 7/79 (8%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYE-- 431 +NG S EDL+ CC CG C GG AW Y+ GLVSGG YN+S GC+PY Sbjct: 118 TNGKVKIQLSPEDLIDCCHYCGNQCKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYSEL 177 Query: 432 -----IPPCEHHVPGNRMP 473 PPC ++ P Sbjct: 178 NYYRITPPCNTTCQNDKYP 196 Score = 59.3 bits (137), Expect = 9e-08 Identities = 22/44 (50%), Positives = 30/44 (68%), Gaps = 1/44 (2%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVC 252 +PE FD R+ WP+C + +R+QG C S WAF A E M+DR+C Sbjct: 72 IPETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLC 115 Score = 49.6 bits (113), Expect = 8e-05 Identities = 23/59 (38%), Positives = 32/59 (54%), Gaps = 1/59 (1%) Frame = +2 Query: 524 YDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNG-PVEGAFTVYSDLLSYKSGVYKHTQG 697 Y + Y DK +G +Y + +E I+ E+ G PV AF VY D Y+ GVY +T G Sbjct: 195 YPIPYVSDKHFGDSIYYIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYRDGVYIYTSG 253 >UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7; n=2; Haemonchidae|Rep: Cathepsin B-like cysteine protease GCP7 - Haemonchus contortus (Barber pole worm) Length = 348 Score = 68.9 bits (161), Expect = 1e-10 Identities = 24/43 (55%), Positives = 33/43 (76%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 +PE+FD R+KW DCP+L + DQ +CGSCWA A + M+DR+C Sbjct: 96 IPESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCMSDRLC 138 Score = 66.1 bits (154), Expect = 8e-10 Identities = 33/85 (38%), Positives = 42/85 (49%), Gaps = 2/85 (2%) Frame = +3 Query: 255 YSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYE 431 +S G K SA D+L+CC CG GC GG AW++ G+V+GG+Y C+PY Sbjct: 140 HSQGRKKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYV 199 Query: 432 IPPCEHHVPGNRMPC-SGDTKTPKC 503 P C H C S TP C Sbjct: 200 FPQCGAHKGKAFNNCPSHPYATPAC 224 Score = 58.0 bits (134), Expect = 2e-07 Identities = 25/67 (37%), Positives = 35/67 (52%) Frame = +2 Query: 512 CESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHT 691 C+ GY Y+ DK + Y + DE I+ E+ + GPV F +Y D Y+ GVY HT Sbjct: 228 CQYGYGKRYENDKIKARTWYWLPNDERTIQLEIMQKGPVHATFNIYEDFEHYEGGVYIHT 287 Query: 692 QGDVSAG 712 G + G Sbjct: 288 AGAMEGG 294 >UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: Cathepsin B - Triticum aestivum (Wheat) Length = 353 Score = 68.5 bits (160), Expect = 2e-10 Identities = 36/87 (41%), Positives = 47/87 (54%), Gaps = 3/87 (3%) Frame = +1 Query: 1 AGRN-FPRDTSFAHLKKIMGVIKDEH--FATLPIKTHKIDLIASLPENFDPRDKWPDCPT 171 AG N + + + K I+GV A +PIK H LP+ FD R +W C T Sbjct: 56 AGHNPYFANYTIEQFKHILGVKPTPPGLLAGVPIKIHPE---MDLPKEFDARTQWSSCST 112 Query: 172 LNEVRDQGSCGSCWAFGAVEAMTDRVC 252 + + DQG CG+CWAF AVEA+ DR C Sbjct: 113 IGNILDQGHCGACWAFAAVEALQDRFC 139 Score = 58.4 bits (135), Expect = 2e-07 Identities = 31/74 (41%), Positives = 41/74 (55%), Gaps = 2/74 (2%) Frame = +2 Query: 497 KMHKKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYS--DLLSYK 670 K +KC+ +K++K + + Y V + I AE++KNGPVE AFT D YK Sbjct: 209 KCQRKCKVENQA-WKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTYCQILDFAHYK 267 Query: 671 SGVYKHTQGDVSAG 712 SGVYKH G V G Sbjct: 268 SGVYKHITGGVMGG 281 Score = 53.2 bits (122), Expect = 6e-06 Identities = 32/97 (32%), Positives = 47/97 (48%), Gaps = 2/97 (2%) Frame = +3 Query: 285 SAEDLLSCCP-ICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVP 458 S DLL+CC +CG GC+GG P AW Y++ G+V ++ C PY + C+H P Sbjct: 149 SVNDLLACCGFLCGSGCNGGYPISAWRYFRRSGVV-------TEECDPYFDQTGCQH--P 199 Query: 459 GNRMPCSGDTKTPKCTKNANLDTTLITNKTNNTENMY 569 G C TPKC + ++ + + N Y Sbjct: 200 G----CEPAYPTPKCQRKCKVENQAWKENKHFSVNAY 232 >UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2; Ostreococcus|Rep: Cysteine proteinase - Ostreococcus tauri Length = 362 Score = 67.7 bits (158), Expect = 3e-10 Identities = 34/77 (44%), Positives = 43/77 (55%), Gaps = 2/77 (2%) Frame = +1 Query: 28 SFAHLKKI-MGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTL-NEVRDQGSC 201 SF K MG ++D T K+ LP+ FD R+KWP C L +E DQG+C Sbjct: 55 SFGRRKSARMGSLEDRLAKTWDPTKIKLHAGGRLPDTFDVREKWPKCAALVSEAVDQGAC 114 Query: 202 GSCWAFGAVEAMTDRVC 252 GSCWA +AMTDR+C Sbjct: 115 GSCWAVAPAKAMTDRLC 131 Score = 45.6 bits (103), Expect = 0.001 Identities = 23/64 (35%), Positives = 26/64 (40%) Frame = +3 Query: 327 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 506 GC GG P A+E G+VSGG C PY PC H N T T Sbjct: 170 GCMGGYPTEAYETAHRVGVVSGGLNGDQDTCMPYPFAPCHHPCEPNHNAVCPRTCQRSAT 229 Query: 507 KNAN 518 + AN Sbjct: 230 QTAN 233 >UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep: Cysteine proteinase - Toxoplasma gondii Length = 569 Score = 66.5 bits (155), Expect = 6e-10 Identities = 35/108 (32%), Positives = 54/108 (50%), Gaps = 9/108 (8%) Frame = +3 Query: 216 FRCRRSYDRQSMYYSNGTKHFHFSAEDLLSCCPI---CGLGCSGGMPRLAWEYWKHFGLV 386 F +++ + S G + SA+ SCC GC+GG P +AW +++ G+V Sbjct: 306 FASTEAFNDRLCIRSQGKRLMPLSAQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVV 365 Query: 387 SGGSYNS-SQG--CRPYEIPPCEHHVPGNRMPCSG---DTKTPKCTKN 512 +GG +++ +G C PYE+P C HH C KTPKC K+ Sbjct: 366 TGGDFDALGKGTTCWPYEVPFCAHHAKAPFPDCDATLVPRKTPKCRKD 413 Score = 59.3 bits (137), Expect = 9e-08 Identities = 34/87 (39%), Positives = 45/87 (51%), Gaps = 9/87 (10%) Frame = +1 Query: 19 RDTSFAHLKKIMGVI----KDEHFAT---LPIKTHKIDLIAS-LPENFDPRDKWPDCP-T 171 R S KK+MG K E F T +P+ + + +P +FD R +P C Sbjct: 231 RYLSLKDAKKLMGTFLVNTKVEGFPTPKGMPLPAKEFENATEPVPAHFDARTAFPACKDV 290 Query: 172 LNEVRDQGSCGSCWAFGAVEAMTDRVC 252 + VRDQG CGSCWAF + EA DR+C Sbjct: 291 VGHVRDQGDCGSCWAFASTEAFNDRLC 317 Score = 57.2 bits (132), Expect = 4e-07 Identities = 30/71 (42%), Positives = 39/71 (54%), Gaps = 4/71 (5%) Frame = +2 Query: 497 KMHKKCES-GYDVN---YKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLS 664 K K CE Y N + QD Y++ +D ++ ++ +GPV GAF VY D LS Sbjct: 409 KCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDD-VKRDMMTHGPVSGAFMVYEDFLS 467 Query: 665 YKSGVYKHTQG 697 YKSGVYKH G Sbjct: 468 YKSGVYKHVSG 478 >UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG01102; n=1; Caenorhabditis briggsae|Rep: Putative uncharacterized protein CBG01102 - Caenorhabditis briggsae Length = 374 Score = 63.3 bits (147), Expect(2) = 1e-09 Identities = 28/62 (45%), Positives = 36/62 (58%), Gaps = 2/62 (3%) Frame = +3 Query: 330 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSGDT-KTPKC 503 C+GG AW+YW+ GL +GGSY S GC+PY I PC+ + P C T +TP C Sbjct: 189 CAGGNVFKAWQYWQKHGLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSC 248 Query: 504 TK 509 K Sbjct: 249 EK 250 Score = 59.3 bits (137), Expect = 9e-08 Identities = 24/65 (36%), Positives = 37/65 (56%) Frame = +2 Query: 506 KKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYK 685 KKC+SGY V +D+ YG V + + I++++ NGP+ VY D L Y +G+Y Sbjct: 250 KKCKSGYPVELDKDRHYGVSVDQLPNRQIEIQSDVMLNGPISATMEVYDDFLQYTTGIYV 309 Query: 686 HTQGD 700 H G+ Sbjct: 310 HLTGN 314 Score = 52.4 bits (120), Expect = 1e-05 Identities = 18/39 (46%), Positives = 27/39 (69%) Frame = +1 Query: 136 FDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 FD R++WP+C ++ + D C S WAF A E+M+DR+C Sbjct: 85 FDARERWPECSSIPIINDISDCKSSWAFSAAESMSDRLC 123 Score = 22.2 bits (45), Expect(2) = 1e-09 Identities = 13/29 (44%), Positives = 16/29 (55%), Gaps = 3/29 (10%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCP---ICGLGCS 335 S G + SA++LLSCC CG G S Sbjct: 126 SGGMINTVLSAQELLSCCTGVFSCGEGDS 154 >UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06356 protein - Schistosoma japonicum (Blood fluke) Length = 279 Score = 64.9 bits (151), Expect = 2e-09 Identities = 31/84 (36%), Positives = 43/84 (51%), Gaps = 1/84 (1%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIP 437 SNG SA D +SC GC G YW +G+V+GGSY GC+PY +P Sbjct: 73 SNGRISVQLSARDAISCG--FSPGCFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLP 130 Query: 438 PCEHHVPGNRMPCSGDT-KTPKCT 506 C +H + C+ +T + P+CT Sbjct: 131 KCSYHPESRFLDCNNNTFEFPQCT 154 Score = 64.1 bits (149), Expect = 3e-09 Identities = 27/61 (44%), Positives = 39/61 (63%) Frame = +2 Query: 509 KCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKH 688 +C+ GY+ Y DK YG+ +Y V G ++ I+ E+ NGPV + +V +D L YKSGVY Sbjct: 156 ECQDGYNKTYDDDKFYGERIYNVYGTQEDIQKEILMNGPVIASISVNTDFLVYKSGVYLP 215 Query: 689 T 691 T Sbjct: 216 T 216 Score = 51.6 bits (118), Expect = 2e-05 Identities = 22/65 (33%), Positives = 38/65 (58%), Gaps = 1/65 (1%) Frame = +1 Query: 61 IKDEHFATLPIKTHKIDLI-ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 I+ E+ T IKT + I +P +FD R W +C T+ ++ D+ C + WA V+++ Sbjct: 6 IETENIQTKHIKTISHNSINMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSI 65 Query: 238 TDRVC 252 +DR+C Sbjct: 66 SDRIC 70 >UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep: Cathepsin B - Streblomastix strix Length = 312 Score = 64.5 bits (150), Expect = 2e-09 Identities = 22/46 (47%), Positives = 31/46 (67%) Frame = +1 Query: 115 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 +A+LP+ FD R WP+C + ++ DQG CGSCWA + E + DR C Sbjct: 73 VANLPDEFDSRTNWPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFC 118 Score = 51.6 bits (118), Expect = 2e-05 Identities = 21/43 (48%), Positives = 30/43 (69%) Frame = +2 Query: 569 YTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQG 697 Y+V +E I+ E+++NGPV +F VY DL Y+SGVY+H G Sbjct: 209 YSVRSNEADIQKEIYENGPVTASFAVYEDLSVYQSGVYQHVTG 251 >UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; Ostreococcus tauri|Rep: Cysteine proteinase Cathepsin F - Ostreococcus tauri Length = 498 Score = 63.7 bits (148), Expect = 4e-09 Identities = 27/45 (60%), Positives = 31/45 (68%), Gaps = 1/45 (2%) Frame = +1 Query: 121 SLPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVC 252 SLP +FD RD++P C L VRDQG CGSCWA A E M DR+C Sbjct: 256 SLPRHFDARDEYPKCARLIGTVRDQGKCGSCWAVAATEIMNDRLC 300 >UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 421 Score = 62.1 bits (144), Expect = 1e-08 Identities = 22/45 (48%), Positives = 32/45 (71%) Frame = +1 Query: 118 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 + +P+NFD R KWP+CP+++ V +QG CGSC+A A +DR C Sbjct: 136 SDVPKNFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRAC 180 Score = 57.6 bits (133), Expect = 3e-07 Identities = 28/58 (48%), Positives = 34/58 (58%) Frame = +3 Query: 255 YSNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY 428 +SNGT S ED++ CC +CG C GG P A YW + GLV+GG GCRPY Sbjct: 182 HSNGTFKSLLSEEDIIGCCSVCG-NCYGGDPLKALTYWVNQGLVTGG----RDGCRPY 234 >UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 311 Score = 61.7 bits (143), Expect = 2e-08 Identities = 26/62 (41%), Positives = 43/62 (69%) Frame = +1 Query: 103 KIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTILTELNIFI 282 ++ + ++PENFD R +WP +++ +R+QG CGSCWAFGA E ++DR I ++ I++ Sbjct: 76 EVRVAENIPENFDARKQWPG--SIHPIRNQGQCGSCWAFGASEVLSDRF-AIASKNQIYV 132 Query: 283 FL 288 L Sbjct: 133 TL 134 Score = 44.0 bits (99), Expect = 0.004 Identities = 16/39 (41%), Positives = 24/39 (61%) Frame = +2 Query: 596 IRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGDVSAG 712 I+ ++ NGPVE FT++ D +Y+SG+Y H G G Sbjct: 218 IQTDIMNNGPVEADFTIFQDFYAYRSGIYVHATGKQLGG 256 >UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus lucimarinus CCE9901|Rep: Predicted protein - Ostreococcus lucimarinus CCE9901 Length = 330 Score = 59.7 bits (138), Expect = 7e-08 Identities = 31/64 (48%), Positives = 36/64 (56%), Gaps = 4/64 (6%) Frame = +1 Query: 73 HFATLPIKTHKIDLIAS---LPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMT 240 HF T K++L A LP +FD R +P C L VRDQG CGSCWA A E M Sbjct: 92 HFLTRLPALGKVELRAKDNRLPTSFDARVAYPKCSRLLGAVRDQGRCGSCWAVAATEVMN 151 Query: 241 DRVC 252 DR+C Sbjct: 152 DRLC 155 >UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 314 Score = 59.3 bits (137), Expect = 9e-08 Identities = 30/82 (36%), Positives = 47/82 (57%) Frame = +1 Query: 7 RNFPRDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVR 186 +NF T F + +MG K A + + +L S+P +FD R +WPDC ++ + Sbjct: 52 KNFEGKT-FGDIIGMMGTKKTA--APFKLTENGEELKGSIPTSFDSRVQWPDC--IHPIL 106 Query: 187 DQGSCGSCWAFGAVEAMTDRVC 252 +Q CGSCWAF + E ++DR+C Sbjct: 107 NQEQCGSCWAFSSSEVLSDRLC 128 Score = 39.9 bits (89), Expect = 0.061 Identities = 20/49 (40%), Positives = 28/49 (57%) Frame = +3 Query: 237 DRQSMYYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGL 383 DR + +N T S + L++C GCSGG+P+LAWEY + GL Sbjct: 125 DRLCIASNNKTNPGALSPQTLVACDVYGNDGCSGGIPQLAWEYMELKGL 173 Score = 36.3 bits (80), Expect = 0.75 Identities = 16/39 (41%), Positives = 20/39 (51%) Frame = +2 Query: 596 IRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGDVSAG 712 I+ + GP+ G VY D +SY SGVY T G G Sbjct: 220 IQENILAYGPIVGTMEVYEDFMSYSSGVYVMTPGSSLLG 258 >UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomuscorum|Rep: Cathepsin B - Oxytricha trifallax (Sterkiella histriomuscorum) Length = 294 Score = 58.0 bits (134), Expect = 2e-07 Identities = 24/44 (54%), Positives = 31/44 (70%) Frame = +1 Query: 115 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 246 I ++PENFD R +W ++ +RDQ CGSCWAFGA EA +DR Sbjct: 73 IMTVPENFDARQQWGS--KIHAIRDQQQCGSCWAFGATEAFSDR 114 Score = 52.4 bits (120), Expect = 1e-05 Identities = 22/39 (56%), Positives = 30/39 (76%) Frame = +2 Query: 596 IRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGDVSAG 712 I++E+ +GPVEGAFTVY+D +Y+SGVY T DV+ G Sbjct: 203 IQSEIVSHGPVEGAFTVYTDFFNYQSGVYTPTTTDVAGG 241 Score = 33.9 bits (74), Expect = 4.0 Identities = 24/75 (32%), Positives = 31/75 (41%), Gaps = 2/75 (2%) Frame = +3 Query: 261 NGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGG--SYNSSQGCRPYEI 434 NG K S EDL+SC GC+GG +AWEY G + Y++ G P Sbjct: 118 NG-KDVILSPEDLVSC-DTNDYGCNGGYMDVAWEYLADHGAATDSCFPYSAGSGFAPACS 175 Query: 435 PPCEHHVPGNRMPCS 479 C R C+ Sbjct: 176 DKCADGSAMQRFKCA 190 >UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep: Cathepsin B - Streblomastix strix Length = 283 Score = 57.2 bits (132), Expect = 4e-07 Identities = 23/42 (54%), Positives = 29/42 (69%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 249 +P+ FD R+KWPD + VRDQG CGSCWAF E + DR+ Sbjct: 63 VPDTFDAREKWPDA--ILPVRDQGECGSCWAFSIAETIGDRL 102 Score = 52.0 bits (119), Expect = 1e-05 Identities = 22/43 (51%), Positives = 28/43 (65%) Frame = +2 Query: 584 DEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGDVSAG 712 D D I+ E+++ GPV F VYSD +SYKSGVY H G + G Sbjct: 184 DADDIQGEIYEYGPVSMGFIVYSDFMSYKSGVYVHQAGYIEGG 226 >UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia ATCC 50803|Rep: GLP_113_4299_5381 - Giardia lamblia ATCC 50803 Length = 360 Score = 56.0 bits (129), Expect = 9e-07 Identities = 27/75 (36%), Positives = 42/75 (56%) Frame = +1 Query: 28 SFAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGS 207 S +K + G + D + ++ + + PE++D RD++P C T EV DQG+CGS Sbjct: 109 SLDEVKAMFGPLVDTSRPAITMRRSTTPPVGA-PESYDFRDEYPHCIT--EVVDQGNCGS 165 Query: 208 CWAFGAVEAMTDRVC 252 CWAF +V+ D C Sbjct: 166 CWAFSSVQTFADHRC 180 Score = 35.9 bits (79), Expect = 0.99 Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 1/47 (2%) Frame = +2 Query: 560 KHVYTVSGDEDHIRAE-LFKNGPVEGAFTVYSDLLSYKSGVYKHTQG 697 ++V SG + + L +GPV F V D + YKSGVY+H G Sbjct: 253 ENVVATSGSKSGSAIDVLLAHGPVVATFNVAQDFMYYKSGVYQHRWG 299 >UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Giardia intestinalis|Rep: Cathepsin B-like CP1 precursor - Giardia lamblia (Giardia intestinalis) Length = 303 Score = 56.0 bits (129), Expect = 9e-07 Identities = 25/58 (43%), Positives = 35/58 (60%), Gaps = 1/58 (1%) Frame = +1 Query: 88 PIKTHKI-DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTI 258 PI ++ +L+ +P FD RD++P C + DQGSCGSCWAF A+ DR C + Sbjct: 66 PISITEVQELVDPIPPQFDFRDEYPQC--VKPALDQGSCGSCWAFSAIGVFGDRRCAM 121 Score = 45.2 bits (102), Expect = 0.002 Identities = 24/67 (35%), Positives = 34/67 (50%) Frame = +2 Query: 512 CESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHT 691 C+ G + + YG+ VS I L GP++ VY+DL Y+SGVYKHT Sbjct: 185 CDDGSPIQLYKAHGYGQ----VSKSVPAIMGMLVAGGPLQTMIVVYADLSYYESGVYKHT 240 Query: 692 QGDVSAG 712 G ++ G Sbjct: 241 YGTINLG 247 >UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Giardia intestinalis|Rep: Cathepsin B-like CP2 precursor - Giardia lamblia (Giardia intestinalis) Length = 300 Score = 54.4 bits (125), Expect = 3e-06 Identities = 23/43 (53%), Positives = 30/43 (69%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 +PE+FD R+++P C + EV DQG CGSCWAF +V DR C Sbjct: 75 VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRC 115 Score = 41.1 bits (92), Expect = 0.026 Identities = 17/35 (48%), Positives = 25/35 (71%) Frame = +2 Query: 608 LFKNGPVEGAFTVYSDLLSYKSGVYKHTQGDVSAG 712 L +GP++ AF V+SD + Y+SGVY+HT G + G Sbjct: 210 LSTSGPLQVAFLVHSDFMYYESGVYQHTYGYMEGG 244 >UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|Rep: Cysteine protease - Giardia muris Length = 301 Score = 54.0 bits (124), Expect = 3e-06 Identities = 30/69 (43%), Positives = 38/69 (55%), Gaps = 4/69 (5%) Frame = +1 Query: 58 VIKDEHFATLPIKTHKIDL----IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGA 225 +I E+ +L +TH L LP+++DPR + C L EV DQ SCGSCWAF A Sbjct: 51 LIPVENLRSLRTETHVSQLNLGKTKELPKDYDPRVERAHC--LPEVADQASCGSCWAFSA 108 Query: 226 VEAMTDRVC 252 V DR C Sbjct: 109 VATFADRRC 117 Score = 44.8 bits (101), Expect = 0.002 Identities = 21/50 (42%), Positives = 27/50 (54%) Frame = +2 Query: 563 HVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGDVSAG 712 HV D D + L +GP++ AF VYSD Y SGVY+H G + G Sbjct: 196 HVINYGMDLDRMMEALVYDGPLQVAFVVYSDFGYYSSGVYQHVNGMMEGG 245 >UniRef50_Q7Q9Y5 Cluster: ENSANGP00000012222; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000012222 - Anopheles gambiae str. PEST Length = 101 Score = 53.2 bits (122), Expect = 6e-06 Identities = 21/39 (53%), Positives = 28/39 (71%) Frame = +2 Query: 581 GDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQG 697 GDE+ I E+F GP + FT+Y+D + YKSGVY+HT G Sbjct: 21 GDEERIMYEVFNFGPAQATFTMYTDFVQYKSGVYRHTFG 59 >UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cellular organisms|Rep: Cysteine proteinase, putative - Archaeoglobus fulgidus Length = 1088 Score = 50.0 bits (114), Expect = 6e-05 Identities = 24/41 (58%), Positives = 27/41 (65%) Frame = +1 Query: 115 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 +ASLP FD W D L+ VRDQGSCGSCWA AV A+ Sbjct: 591 MASLPSRFD----WRDYTGLSAVRDQGSCGSCWAHSAVAAL 627 >UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 323 Score = 49.6 bits (113), Expect = 8e-05 Identities = 24/56 (42%), Positives = 35/56 (62%) Frame = +1 Query: 121 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTILTELNIFIFL 288 ++P +FD R W DC ++ VR+Q SCGSCWA + DR+C I ++ NI + L Sbjct: 45 TIPASFDVRTNWGDC--MSPVREQQSCGSCWAQVTSGILADRMC-IESDKNIKMLL 97 >UniRef50_Q0PWU8 Cluster: Cathepsin B preproprotein-like protein; n=1; Diaphorina citri|Rep: Cathepsin B preproprotein-like protein - Diaphorina citri (Asian citrus psyllid) Length = 125 Score = 49.2 bits (112), Expect = 1e-04 Identities = 22/59 (37%), Positives = 36/59 (61%) Frame = +2 Query: 524 YDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGD 700 Y+ Y+ D + GK + V + +++++GP+ F+VY+D L YKSGVY+H GD Sbjct: 7 YESTYRFDLKKGKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD 63 >UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: hypothetical protein - Strongylocentrotus purpuratus Length = 450 Score = 48.8 bits (111), Expect = 1e-04 Identities = 21/44 (47%), Positives = 26/44 (59%) Frame = +1 Query: 118 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 249 A LPE FD R+ WP ++EV DQG CGS WA +DR+ Sbjct: 195 ARLPETFDARENWPGL--IDEVIDQGKCGSSWAISTASVASDRL 236 Score = 41.1 bits (92), Expect = 0.026 Identities = 16/47 (34%), Positives = 28/47 (59%) Frame = +2 Query: 569 YTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGDVSA 709 Y ++ E I E+++NGPV+ F V +D Y GVY++ + + +A Sbjct: 329 YRIAAREVDIMTEIYQNGPVQATFNVKNDFFVYNRGVYRNVKQEFTA 375 >UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like protein F26E4.3; n=2; Caenorhabditis|Rep: Uncharacterized peptidase C1-like protein F26E4.3 - Caenorhabditis elegans Length = 491 Score = 48.4 bits (110), Expect = 2e-04 Identities = 21/45 (46%), Positives = 27/45 (60%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTI 258 LPE+FD RDKW P ++ V DQG CGS W+ +DR+ I Sbjct: 223 LPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAII 265 Score = 46.8 bits (106), Expect = 5e-04 Identities = 19/41 (46%), Positives = 25/41 (60%) Frame = +2 Query: 569 YTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHT 691 Y VS E+ I+ EL NGPV+ F V+ D Y GVY+H+ Sbjct: 357 YKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHS 397 >UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus; n=4; Cryptosporidium|Rep: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus - Cryptosporidium parvum Iowa II Length = 401 Score = 47.6 bits (108), Expect = 3e-04 Identities = 23/72 (31%), Positives = 34/72 (47%), Gaps = 1/72 (1%) Frame = +1 Query: 40 LKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRD-KWPDCPTLNEVRDQGSCGSCWA 216 + + G IKD K+ ++ S E P W + +N +R+Q +CGSCWA Sbjct: 143 MARFTGYIKDSKDDERVFKSSRVSASESEEEFVPPNSINWVEAGCVNPIRNQKNCGSCWA 202 Query: 217 FGAVEAMTDRVC 252 F AV A+ C Sbjct: 203 FSAVAALEGATC 214 >UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, whole genome shotgun sequence; n=3; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_31, whole genome shotgun sequence - Paramecium tetraurelia Length = 358 Score = 47.6 bits (108), Expect = 3e-04 Identities = 20/57 (35%), Positives = 31/57 (54%) Frame = +2 Query: 527 DVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQG 697 D + ++Y H Y V E++I+ E+ NGP+ V+ D L YK GVY+ +G Sbjct: 233 DALFSNCEKYKIHDYCVVSGEENIKREILNNGPIVAVIQVFKDFLVYKGGVYEVVEG 289 Score = 36.3 bits (80), Expect = 0.75 Identities = 15/43 (34%), Positives = 27/43 (62%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 +PE+++ R+ P+C + QG+C S ++ AV A +DR+C Sbjct: 131 IPESYNFREAQPECA--QPIYFQGNCSSSYSIAAVSATSDRLC 171 >UniRef50_P81494 Cluster: Cathepsin B; n=2; Phasianidae|Rep: Cathepsin B - Coturnix coturnix japonica (Japanese quail) Length = 48 Score = 46.8 bits (106), Expect = 5e-04 Identities = 16/25 (64%), Positives = 22/25 (88%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGS 198 LP+ FD R +WP+CPT++E+RDQGS Sbjct: 1 LPDTFDSRKQWPNCPTISEIRDQGS 25 Score = 32.7 bits (71), Expect = 9.2 Identities = 14/25 (56%), Positives = 17/25 (68%), Gaps = 1/25 (4%) Frame = +3 Query: 264 GTKHFHFSAEDLLSCCPI-CGLGCS 335 G+ SAEDLLSCC CG+GC+ Sbjct: 24 GSVSVEVSAEDLLSCCGFECGMGCN 48 >UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae str. PEST Length = 559 Score = 46.4 bits (105), Expect = 7e-04 Identities = 20/38 (52%), Positives = 25/38 (65%) Frame = +1 Query: 115 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 228 + LP +FD W D + EV++QGSCGSCWAF AV Sbjct: 336 VGDLPRSFD----WRDHGAVTEVKNQGSCGSCWAFSAV 369 >UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicalis|Rep: Blo t 1 allergen - Blomia tropicalis (Mite) Length = 333 Score = 46.4 bits (105), Expect = 7e-04 Identities = 22/40 (55%), Positives = 25/40 (62%) Frame = +1 Query: 106 IDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGA 225 I+ SLP+NFD R K L +R QGSCGSCWAF A Sbjct: 107 INTYGSLPQNFDWRQK----ARLTRIRQQGSCGSCWAFAA 142 >UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35; Viridiplantae|Rep: Cysteine proteinase 15A precursor - Pisum sativum (Garden pea) Length = 363 Score = 46.0 bits (104), Expect = 0.001 Identities = 24/53 (45%), Positives = 31/53 (58%), Gaps = 2/53 (3%) Frame = +1 Query: 85 LPIKTHKIDLI--ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 LP K ++ +LPE+FD R+K P V+DQGSCGSCWAF A+ Sbjct: 117 LPAHAQKAPILPTTNLPEDFDWREKGAVTP----VKDQGSCGSCWAFSTTGAL 165 >UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio molitor (Yellow mealworm) Length = 336 Score = 45.6 bits (103), Expect = 0.001 Identities = 26/64 (40%), Positives = 39/64 (60%), Gaps = 3/64 (4%) Frame = +1 Query: 67 DEHFATLPIKTHK-IDLIASL--PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 D H +PIKT + + L AS+ P +FD W D ++ V++QGSCGSCWAF + A+ Sbjct: 99 DLHKNGIPIKTREDLGLNASVRYPASFD----WRDQGMVSPVKNQGSCGSCWAFSSTGAI 154 Query: 238 TDRV 249 ++ Sbjct: 155 ESQM 158 >UniRef50_UPI0000E4622C Cluster: PREDICTED: hypothetical protein; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: hypothetical protein - Strongylocentrotus purpuratus Length = 145 Score = 45.2 bits (102), Expect = 0.002 Identities = 18/34 (52%), Positives = 22/34 (64%) Frame = +2 Query: 587 EDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKH 688 E I+AE+F NGPV+ F V SD Y GVY+H Sbjct: 4 EQQIQAEIFTNGPVQAVFNVKSDFFMYNGGVYRH 37 >UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lamblia ATCC 50803|Rep: GLP_217_11853_10927 - Giardia lamblia ATCC 50803 Length = 308 Score = 45.2 bits (102), Expect = 0.002 Identities = 20/43 (46%), Positives = 29/43 (67%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 +P++FD R+++P C T EV D G C S WA+ AV+A + R C Sbjct: 75 VPDHFDFREEYPQCIT--EVIDIGLCSSSWAYSAVDAFSHRRC 115 Score = 34.3 bits (75), Expect = 3.0 Identities = 12/37 (32%), Positives = 21/37 (56%) Frame = +2 Query: 590 DHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGD 700 + ++ + GP++ FTVY D Y G+Y +T G+ Sbjct: 204 ERLKRAVALRGPMQAMFTVYEDFTYYLEGIYSYTYGN 240 >UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA, isoform A; n=2; Endopterygota|Rep: PREDICTED: similar to CG3074-PA, isoform A - Tribolium castaneum Length = 445 Score = 44.8 bits (101), Expect = 0.002 Identities = 23/69 (33%), Positives = 34/69 (49%) Frame = +1 Query: 40 LKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 219 +K +G ++ + F +I SLP FD KWP ++E++DQG CGS WA Sbjct: 169 IKLRLGTLQPQRFVMHMNPVRRIYDPNSLPREFDSEFKWPGW--MSEIQDQGWCGSSWAI 226 Query: 220 GAVEAMTDR 246 +DR Sbjct: 227 TTAAVASDR 235 Score = 40.7 bits (91), Expect = 0.035 Identities = 15/37 (40%), Positives = 23/37 (62%) Frame = +2 Query: 581 GDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHT 691 G+E I E+ +GPV+ VY D +YK G+Y+H+ Sbjct: 330 GNETDIMYEILHSGPVQATMKVYHDFFTYKRGIYRHS 366 Score = 34.7 bits (76), Expect = 2.3 Identities = 28/89 (31%), Positives = 35/89 (39%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIP 437 S G + SA+ LLSC C+GG AW Y + GLV + C PY Sbjct: 240 SKGREKVTLSAQHLLSCDRRGQQSCNGGYLDRAWSYIRKIGLV-------DEQCFPYSAT 292 Query: 438 PCEHHVPGNRMPCSGDTKTPKCTKNANLD 524 R+P GD T C N+D Sbjct: 293 N-----EKCRIPRRGDLVTANCQLPTNVD 316 >UniRef50_UPI00006CB77B Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 382 Score = 44.4 bits (100), Expect = 0.003 Identities = 19/39 (48%), Positives = 25/39 (64%) Frame = +2 Query: 569 YTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYK 685 Y VS ++ I+ E+ NGPV V+SD L YKSGVY+ Sbjct: 241 YCVSAGQESIKREIMLNGPVVSLMNVFSDFLVYKSGVYR 279 >UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; Phytophthora infestans|Rep: Cathepsin-like cysteine protease - Phytophthora infestans (Potato late blight fungus) Length = 376 Score = 44.4 bits (100), Expect = 0.003 Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 1/79 (1%) Frame = +1 Query: 4 GRNFPRDTSFAHLKKIMGV-IKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNE 180 G N D + A K+++ +D ++ K + + LP +D W + T+ Sbjct: 92 GLNDLADLADAEYKQLLSYRTRDSKSSSASETFVKPENVEDLPATWD----WREHSTVTP 147 Query: 181 VRDQGSCGSCWAFGAVEAM 237 V++QG CGSCWAF AV AM Sbjct: 148 VKNQGQCGSCWAFSAVAAM 166 >UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestinalis|Rep: GLP_41_8294_9919 - Giardia lamblia ATCC 50803 Length = 541 Score = 44.4 bits (100), Expect = 0.003 Identities = 21/43 (48%), Positives = 29/43 (67%) Frame = +1 Query: 121 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 249 +LP++FD RD + V DQG+CGSC+ FGAV+AM R+ Sbjct: 240 TLPDDFDWRDV-NGVSYIPGVLDQGACGSCFTFGAVQAMNSRI 281 >UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 326 Score = 44.4 bits (100), Expect = 0.003 Identities = 20/44 (45%), Positives = 26/44 (59%) Frame = +1 Query: 88 PIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 219 P H + + LP FD R+K + EV+DQGSCGSCW+F Sbjct: 98 PRVIHSLTPVKDLPSKFDWREKG----AVTEVKDQGSCGSCWSF 137 >UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadidae|Rep: Cysteine protease - Tritrichomonas foetus (Trichomonas foetus) Length = 315 Score = 44.4 bits (100), Expect = 0.003 Identities = 20/48 (41%), Positives = 29/48 (60%) Frame = +1 Query: 91 IKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEA 234 +K K+ P N D D W + +NE++DQ +CGSCWAF A++A Sbjct: 87 MKAEKVSRGMKKP-NVDSID-WREKGVVNEIKDQAACGSCWAFSAIQA 132 >UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15; Magnoliophyta|Rep: Cysteine proteinase RD19a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 368 Score = 44.4 bits (100), Expect = 0.003 Identities = 22/53 (41%), Positives = 32/53 (60%), Gaps = 2/53 (3%) Frame = +1 Query: 85 LPIKTHKIDLIAS--LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 LP +K ++ + LPE+FD W D + V++QGSCGSCW+F A A+ Sbjct: 120 LPKDANKAPILPTENLPEDFD----WRDHGAVTPVKNQGSCGSCWSFSATGAL 168 >UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep: Cysteine protease - Babesia equi Length = 438 Score = 44.0 bits (99), Expect = 0.004 Identities = 27/70 (38%), Positives = 35/70 (50%), Gaps = 3/70 (4%) Frame = +1 Query: 28 SFAHLKKIMGVIKDEHFATLPIKTHKIDLIASL--PENFDPRD-KWPDCPTLNEVRDQGS 198 S LKK + V E F T P K+ + L ++ D D W + V+DQG+ Sbjct: 186 SVEELKKSLEVSASEEF-TSPEHLDKVRIAKGLGVEDSVDGEDLDWRKLNGVTPVKDQGN 244 Query: 199 CGSCWAFGAV 228 CGSCWAF AV Sbjct: 245 CGSCWAFAAV 254 >UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus salmonis|Rep: Cysteine proteinase - Lepeophtheirus salmonis (salmon louse) Length = 372 Score = 44.0 bits (99), Expect = 0.004 Identities = 21/45 (46%), Positives = 28/45 (62%) Frame = +1 Query: 115 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 249 I LPE+ D R+K + +V++QGSCGSCW F AVE + V Sbjct: 112 IKDLPESVDWREKG----VITDVKNQGSCGSCWVFSAVEQIESYV 152 >UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin B-like cysteine peptidase - Trichomonas vaginalis G3 Length = 288 Score = 44.0 bits (99), Expect = 0.004 Identities = 24/65 (36%), Positives = 34/65 (52%) Frame = +2 Query: 506 KKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYK 685 KKC + + Q +Y S +E I + GPV + VYSDL+ YKSG+Y Sbjct: 168 KKCTNESETYEAQFTEYWSVARYASIEEMQIG--IMTEGPVTTSLKVYSDLMYYKSGIYT 225 Query: 686 HTQGD 700 HT+G+ Sbjct: 226 HTKGE 230 Score = 42.3 bits (95), Expect = 0.011 Identities = 20/58 (34%), Positives = 36/58 (62%), Gaps = 1/58 (1%) Frame = +1 Query: 82 TLPI-KTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 T+P+ + KI++ S+P +++ +++P C V DQG CGSCW+F ++ + R C Sbjct: 55 TIPLARPPKINI--SIPMSYNFTERFPQCDF--GVLDQGKCGSCWSFAVSKSFSHRYC 108 >UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-like precursor; n=26; Euteleostomi|Rep: Tubulointerstitial nephritis antigen-like precursor - Homo sapiens (Human) Length = 467 Score = 44.0 bits (99), Expect = 0.004 Identities = 17/42 (40%), Positives = 24/42 (57%) Frame = +2 Query: 566 VYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHT 691 VY + ++ I EL +NGPV+ V+ D YK G+Y HT Sbjct: 343 VYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHT 384 Score = 40.7 bits (91), Expect = 0.035 Identities = 17/42 (40%), Positives = 24/42 (57%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 249 LP F+ +KWP+ ++E DQG+C WAF +DRV Sbjct: 203 LPTAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRV 242 >UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin O; n=1; Danio rerio|Rep: PREDICTED: similar to cathepsin O - Danio rerio Length = 327 Score = 43.6 bits (98), Expect = 0.005 Identities = 18/35 (51%), Positives = 21/35 (60%) Frame = +1 Query: 133 NFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 N PR W D + V +QGSCG CWAF VEA+ Sbjct: 119 NNPPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAI 153 >UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep: Cysteine protease - Solanum lycopersicum (Tomato) (Lycopersicon esculentum) Length = 345 Score = 43.6 bits (98), Expect = 0.005 Identities = 22/73 (30%), Positives = 39/73 (53%), Gaps = 2/73 (2%) Frame = +1 Query: 25 TSFAHLKKIMGV-IKDEHFATLPIKTHKIDLIASLPENFDPRD-KWPDCPTLNEVRDQGS 198 TS L K G+ I + + + P+ + + I L +++ P + W + + +V+ QG Sbjct: 92 TSQEFLAKFTGLNIPNSYLSPSPMSSTEFKKINDLSDDYMPSNLDWRESGAVTQVKHQGR 151 Query: 199 CGSCWAFGAVEAM 237 CG CWAF AV ++ Sbjct: 152 CGCCWAFSAVGSL 164 >UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Bigelowiella natans|Rep: Digestive cysteine proteinase - Bigelowiella natans (Pedinomonas minutissima) (Chlorarachnion sp.(strain CCMP 621)) Length = 360 Score = 43.6 bits (98), Expect = 0.005 Identities = 16/28 (57%), Positives = 19/28 (67%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 W D L V+DQG CGSCWAF A +A+ Sbjct: 115 WRDFNALTPVKDQGGCGSCWAFSATQAL 142 >UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep: Cathepsin L - Stylonychia lemnae Length = 340 Score = 43.6 bits (98), Expect = 0.005 Identities = 18/44 (40%), Positives = 27/44 (61%) Frame = +1 Query: 115 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 246 + +PE+ D R+K +N V+DQG CGSCWAF + ++ R Sbjct: 122 LKDIPESIDWREKG----AVNAVKDQGQCGSCWAFSTIASLESR 161 >UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Trypanosoma cruzi|Rep: Cysteine proteinase, putative - Trypanosoma cruzi Length = 392 Score = 43.6 bits (98), Expect = 0.005 Identities = 20/51 (39%), Positives = 27/51 (52%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTILTELNI 276 +P+ D R+ P L V+DQG CGSCWA GA E M + L++ Sbjct: 141 IPDEVDYRNSSP--AILTAVKDQGRCGSCWAHGAAEEMESHFAILTGRLHV 189 >UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine proteinase precursor - Heterodera glycines (Soybean cyst nematode worm) Length = 353 Score = 43.6 bits (98), Expect = 0.005 Identities = 20/40 (50%), Positives = 26/40 (65%) Frame = +1 Query: 118 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 ++LPE D R+K + EV+DQG CGSCWAF A A+ Sbjct: 133 STLPEKLDWREKG----AVTEVKDQGDCGSCWAFSATGAI 168 >UniRef50_Q5VUI9 Cluster: Tubulointerstitial nephritis antigen; n=3; Homo sapiens|Rep: Tubulointerstitial nephritis antigen - Homo sapiens (Human) Length = 155 Score = 43.6 bits (98), Expect = 0.005 Identities = 17/40 (42%), Positives = 24/40 (60%) Frame = +2 Query: 569 YTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKH 688 Y VS +E I E+ +NGPV+ V D YK+G+Y+H Sbjct: 34 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRH 73 >UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen; n=20; Amniota|Rep: Tubulointerstitial nephritis antigen - Homo sapiens (Human) Length = 476 Score = 43.6 bits (98), Expect = 0.005 Identities = 17/40 (42%), Positives = 24/40 (60%) Frame = +2 Query: 569 YTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKH 688 Y VS +E I E+ +NGPV+ V D YK+G+Y+H Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRH 394 Score = 35.1 bits (77), Expect = 1.7 Identities = 18/48 (37%), Positives = 24/48 (50%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSY 401 S G + S ++L+SCC GC+ G AW Y + GLVS Y Sbjct: 260 SKGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWWYLRKRGLVSHACY 307 >UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 43.2 bits (97), Expect = 0.007 Identities = 21/41 (51%), Positives = 27/41 (65%), Gaps = 3/41 (7%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF---GAVEAM 237 LP +FD W D L++V+DQG CGSCWAF G +EA+ Sbjct: 125 LPASFD----WRDYGILSDVKDQGQCGSCWAFSTTGILEAL 161 Score = 33.1 bits (72), Expect = 7.0 Identities = 16/57 (28%), Positives = 26/57 (45%), Gaps = 4/57 (7%) Frame = +3 Query: 243 QSMYYSNGTKHFHFSAEDLLSCCP----ICGLGCSGGMPRLAWEYWKHFGLVSGGSY 401 +++Y+ + FS + L+ C GCSGG P A +Y FG++ Y Sbjct: 159 EALYFMENRQKISFSEQQLVDCATNSNGFNSYGCSGGWPEEALKYVAKFGILKEEQY 215 >UniRef50_Q58HK5 Cluster: Hc58; n=1; Haemonchus contortus|Rep: Hc58 - Haemonchus contortus (Barber pole worm) Length = 241 Score = 43.2 bits (97), Expect = 0.007 Identities = 15/24 (62%), Positives = 18/24 (75%) Frame = +1 Query: 181 VRDQGSCGSCWAFGAVEAMTDRVC 252 +RDQ +CGSCWA A E M+DR C Sbjct: 108 IRDQSNCGSCWAVSAAETMSDRAC 131 Score = 34.3 bits (75), Expect = 3.0 Identities = 19/57 (33%), Positives = 27/57 (47%), Gaps = 2/57 (3%) Frame = +3 Query: 237 DRQSMYYSNGTKHFHFSAEDLLSCC--PICGLGCSGGMPRLAWEYWKHFGLVSGGSY 401 DR ++ S D+LSCC C +G GG+ AW Y +G+ +GG Y Sbjct: 5 DRACIHSKGKAFKARLSDTDILSCCGKDPCQIG-EGGISARAWLYAMQYGVCTGGYY 60 >UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing protein; n=7; Hymenostomatida|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 387 Score = 43.2 bits (97), Expect = 0.007 Identities = 25/82 (30%), Positives = 40/82 (48%) Frame = +1 Query: 19 RDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGS 198 R+T+ + K + ++ + KI+ + LP++ D W D + V+DQG Sbjct: 99 RETTLGYSKTVKNAANKQNMFRNLKTSDKIN-VKDLPKSVD----WRDAGVVTPVKDQGH 153 Query: 199 CGSCWAFGAVEAMTDRVCTILT 264 CGSCWAF A A+ + I T Sbjct: 154 CGSCWAF-ATTAVIESYAAIAT 174 >UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 234 Score = 43.2 bits (97), Expect = 0.007 Identities = 18/42 (42%), Positives = 26/42 (61%) Frame = +1 Query: 112 LIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 ++ +P+ D R K +NE++DQ CGSCWAFG+ AM Sbjct: 14 IVGDIPDEIDYRTKG----AVNEIKDQKHCGSCWAFGSCAAM 51 Score = 35.9 bits (79), Expect = 0.99 Identities = 19/44 (43%), Positives = 26/44 (59%) Frame = +3 Query: 246 SMYYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHF 377 S + +GT + S + L+ CC C LGC G +P LA+EY K F Sbjct: 54 SWFLKHGTL-YSLSEQCLVDCCHDC-LGCHGCLPSLAFEYVKIF 95 >UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza sativa|Rep: Cysteine protease 1 precursor - Oryza sativa subsp. japonica (Rice) Length = 490 Score = 43.2 bits (97), Expect = 0.007 Identities = 20/48 (41%), Positives = 31/48 (64%) Frame = +1 Query: 94 KTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 + ++ D + +LP++ D RDK + V++QG CGSCWAF AV A+ Sbjct: 145 EAYRHDGVEALPDSVDWRDKGA---VVAPVKNQGQCGSCWAFSAVAAV 189 >UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 326 Score = 42.7 bits (96), Expect = 0.009 Identities = 17/41 (41%), Positives = 25/41 (60%) Frame = +1 Query: 115 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 +A++ + P W + + V+DQG CGSCWAF VEA+ Sbjct: 110 LAAVAGDAPPAWDWREHGAVTRVKDQGPCGSCWAFSVVEAV 150 >UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n=21; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Globodera pallida Length = 379 Score = 42.7 bits (96), Expect = 0.009 Identities = 24/48 (50%), Positives = 30/48 (62%), Gaps = 4/48 (8%) Frame = +1 Query: 115 IASLPENFDPRDK-WPDCPTLNEVRDQGSCGSCWAF---GAVEAMTDR 246 + LPE+ D RDK W + EV++QG CGSCWAF GA+EA R Sbjct: 158 VGDLPESVDWRDKGW-----VTEVKNQGMCGSCWAFSSTGALEAQHAR 200 >UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 306 Score = 42.7 bits (96), Expect = 0.009 Identities = 22/72 (30%), Positives = 41/72 (56%), Gaps = 2/72 (2%) Frame = +1 Query: 40 LKKIMGVIKDEHFATLPIKT-HKIDLIASLPENFDPRD-KWPDCPTLNEVRDQGSCGSCW 213 L + + ++E+ + L K HK I +N P + W + +N++++QG+CGSCW Sbjct: 54 LNRFAHLTENEYRSMLGYKYGHKSYPITKNIKNDVPTEIDWREQGIVNKIKNQGACGSCW 113 Query: 214 AFGAVEAMTDRV 249 AF A++ + +V Sbjct: 114 AFSAIQVIESQV 125 >UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin O precursor; n=2; Apocrita|Rep: PREDICTED: similar to Cathepsin O precursor - Apis mellifera Length = 374 Score = 42.3 bits (95), Expect = 0.011 Identities = 20/39 (51%), Positives = 24/39 (61%) Frame = +1 Query: 121 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 S+P FD RDK P VR QGSCG+CWAF +E + Sbjct: 154 SIPLRFDWRDKGVITP----VRSQGSCGACWAFSTIEVI 188 >UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina). Cathepsin L; n=2; Dictyostelium discoideum|Rep: Similar to Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina). Cathepsin L - Dictyostelium discoideum (Slime mold) Length = 265 Score = 42.3 bits (95), Expect = 0.011 Identities = 18/45 (40%), Positives = 31/45 (68%) Frame = +1 Query: 103 KIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 K ++ A++P++FD W D + +V++QGSC SCW+F A+ A+ Sbjct: 40 KHNVNATIPKSFD----WRDHGAVGKVKNQGSCASCWSFSALGAL 80 >UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens (Human) Length = 331 Score = 42.3 bits (95), Expect = 0.011 Identities = 23/47 (48%), Positives = 31/47 (65%) Frame = +1 Query: 97 THKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 T+K + LP++ D R+K C T EV+ QGSCG+CWAF AV A+ Sbjct: 106 TYKSNPNRILPDSVDWREK--GCVT--EVKYQGSCGACWAFSAVGAL 148 >UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 328 Score = 41.9 bits (94), Expect = 0.015 Identities = 20/49 (40%), Positives = 30/49 (61%), Gaps = 1/49 (2%) Frame = +1 Query: 124 LPENFDPRDKWPD-CPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTILTE 267 +P+ FD RD + D P + V+DQ CG CWAF A A+T+ T+ ++ Sbjct: 97 IPDYFDLRDIYVDGSPVVGPVKDQEQCGCCWAF-ATTAITEAANTLYSK 144 >UniRef50_A2GCC2 Cluster: Clan CA, family C1, cathepsin B-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin B-like cysteine peptidase - Trichomonas vaginalis G3 Length = 135 Score = 41.9 bits (94), Expect = 0.015 Identities = 21/49 (42%), Positives = 27/49 (55%) Frame = +2 Query: 539 KQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYK 685 K Q+ H + ED I+ E+ +NGPV F V DL YKSGVY+ Sbjct: 25 KYKTQHNSHKFFYG--EDEIKNEILQNGPVTAVFDVRPDLAYYKSGVYQ 71 >UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag-RP - Bombyx mori (Silk moth) Length = 404 Score = 41.9 bits (94), Expect = 0.015 Identities = 18/45 (40%), Positives = 29/45 (64%), Gaps = 1/45 (2%) Frame = +2 Query: 569 YTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQ-GD 700 +++S +ED I ++ +GP G TVY D Y+ G+Y+HT+ GD Sbjct: 299 FSISKEED-IMYDIMTSGPALGIMTVYQDFFHYREGIYRHTRHGD 342 Score = 35.1 bits (77), Expect = 1.7 Identities = 25/65 (38%), Positives = 36/65 (55%) Frame = +3 Query: 237 DRQSMYYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQG 416 DR S+ S GT++ S++ LLSC GC+GG +A+++ K GLV S+ Sbjct: 222 DRFSIQ-SFGTENVRMSSQTLLSCHLKGQRGCNGGNLDIAFDFVKTHGLV-------SEQ 273 Query: 417 CRPYE 431 C PYE Sbjct: 274 CFPYE 278 >UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to glucocorticoid-inducible protein; n=1; Gallus gallus|Rep: PREDICTED: similar to glucocorticoid-inducible protein - Gallus gallus Length = 307 Score = 41.5 bits (93), Expect = 0.020 Identities = 17/42 (40%), Positives = 23/42 (54%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 249 LP +FD KWP ++E DQG+C WAF +DR+ Sbjct: 153 LPRHFDAATKWPGM--IHEPLDQGNCAGSWAFSTAAVASDRI 192 Score = 35.1 bits (77), Expect = 1.7 Identities = 30/95 (31%), Positives = 41/95 (43%), Gaps = 1/95 (1%) Frame = +3 Query: 237 DRQSMYYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYN-SSQ 413 DR S++ S G S ++LLSC GCSGG AW Y + G+V+ Y +SQ Sbjct: 190 DRISIH-SMGHMTPSLSPQNLLSCDTRNQRGCSGGRLDGAWWYLRRRGVVTDECYPFTSQ 248 Query: 414 GCRPYEIPPCEHHVPGNRMPCSGDTKTPKCTKNAN 518 +P P H R + P +AN Sbjct: 249 DSQPAAQPCMMHSRSTGRGKRQATARCPNPQTHAN 283 >UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2; Taeniidae|Rep: Cathepsin L-like cysteine proteinase - Taenia solium (Pork tapeworm) Length = 339 Score = 41.5 bits (93), Expect = 0.020 Identities = 19/40 (47%), Positives = 26/40 (65%) Frame = +1 Query: 118 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 A LP+ D RDK + EV++QG+CGSCWAF + A+ Sbjct: 122 AGLPDTVDWRDK----NLVTEVKNQGNCGSCWAFSSTGAL 157 >UniRef50_Q24E33 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 328 Score = 41.5 bits (93), Expect = 0.020 Identities = 24/82 (29%), Positives = 43/82 (52%), Gaps = 9/82 (10%) Frame = +1 Query: 19 RDTSFAHLKKIMGVIKDEHFATLPI---KTHKIDLIASLPENFD-----PRD-KWPDCPT 171 ++ +F IM ++ DE +++L + + ID+ SL ++ + P + W Sbjct: 79 KNNTFKLAINIMAILTDEEYSSLYLNLDQQESIDIFDSLVDDNETVGDIPSEVNWTAQGA 138 Query: 172 LNEVRDQGSCGSCWAFGAVEAM 237 + V++QGSCGSCWAF A+ Sbjct: 139 VTPVKNQGSCGSCWAFSTTGAL 160 >UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precursor; n=3; Plasmodium (Laverania)|Rep: Trophozoite cysteine proteinase precursor - Plasmodium falciparum Length = 569 Score = 41.5 bits (93), Expect = 0.020 Identities = 19/45 (42%), Positives = 29/45 (64%) Frame = +1 Query: 94 KTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 228 K ++ D+ + +PE D R+K ++E +DQG CGSCWAF +V Sbjct: 323 KRNEKDIFSKVPEILDYREKG----IVHEPKDQGLCGSCWAFASV 363 >UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea mays (Maize) Length = 371 Score = 41.5 bits (93), Expect = 0.020 Identities = 18/38 (47%), Positives = 25/38 (65%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 LP++FD W D + V++QGSCGSCW+F A A+ Sbjct: 137 LPDDFD----WRDHGAVGPVKNQGSCGSCWSFSASGAL 170 >UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromeliaceae|Rep: Fruit bromelain precursor - Ananas comosus (Pineapple) Length = 351 Score = 41.5 bits (93), Expect = 0.020 Identities = 16/38 (42%), Positives = 26/38 (68%) Frame = +1 Query: 115 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 228 I+++P++ D W D +NEV++Q CGSCW+F A+ Sbjct: 120 ISAVPQSID----WRDYGAVNEVKNQNPCGSCWSFAAI 153 >UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin C; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin C - Strongylocentrotus purpuratus Length = 482 Score = 41.1 bits (92), Expect = 0.026 Identities = 21/57 (36%), Positives = 33/57 (57%) Frame = +1 Query: 118 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTILTELNIFIFL 288 ++LPE FD RD ++ VRDQG CGSC+AF + R+ ++T N+ + + Sbjct: 247 SNLPEKFDWRDVG-GIDYVSPVRDQGICGSCYAFASTATQESRL-RVMTNNNVKVVM 301 Score = 41.1 bits (92), Expect = 0.026 Identities = 16/35 (45%), Positives = 24/35 (68%) Frame = +2 Query: 584 DEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKH 688 +ED +R EL ++GP+ +F VY D L Y+ G+Y H Sbjct: 373 NEDLMRLELLRSGPLAISFEVYDDFLFYRGGIYHH 407 >UniRef50_Q5CKC2 Cluster: Preprocathepsin c; n=2; Cryptosporidium|Rep: Preprocathepsin c - Cryptosporidium hominis Length = 635 Score = 41.1 bits (92), Expect = 0.026 Identities = 17/39 (43%), Positives = 25/39 (64%) Frame = +2 Query: 584 DEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGD 700 DED ++ E+FKNGP+ A + + LL Y++GVY D Sbjct: 477 DEDRMKEEIFKNGPIAVAMHIDTSLLVYENGVYDSIPND 515 >UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1; Dictyostelium discoideum AX4|Rep: Counting factor associated protein - Dictyostelium discoideum AX4 Length = 531 Score = 41.1 bits (92), Expect = 0.026 Identities = 21/57 (36%), Positives = 31/57 (54%) Frame = +1 Query: 100 HKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTILTEL 270 H + + S+P D R++ +C T V+DQG CGSCW FG+ ++ C EL Sbjct: 301 HDDESLRSIPSTVDWRNQ--NCVT--PVKDQGICGSCWTFGSTGSLEGTNCVTNGEL 353 >UniRef50_Q22W19 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 41.1 bits (92), Expect = 0.026 Identities = 19/45 (42%), Positives = 23/45 (51%) Frame = +1 Query: 94 KTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 228 K K LI SL + P W + V++QG CGSCWAF V Sbjct: 109 KRQKSHLIYSLKGDVAPSIDWRQKNAVTPVKNQGQCGSCWAFSTV 153 >UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_23, whole genome shotgun sequence - Paramecium tetraurelia Length = 321 Score = 41.1 bits (92), Expect = 0.026 Identities = 20/66 (30%), Positives = 33/66 (50%), Gaps = 5/66 (7%) Frame = +1 Query: 55 GVIKDEHFATLPIKTHKIDLIASLPENFDP-----RDKWPDCPTLNEVRDQGSCGSCWAF 219 G + D+ F T+ + + ++ +N +P W + ++DQG CGSCWAF Sbjct: 87 GDLTDQEFLTIYLNLQMPARVKNIQKNEEPFLVQEEVDWVQKGKVPAIKDQGDCGSCWAF 146 Query: 220 GAVEAM 237 AV A+ Sbjct: 147 SAVGAL 152 >UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=176; Viridiplantae|Rep: Cysteine proteinase RD21a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 462 Score = 41.1 bits (92), Expect = 0.026 Identities = 25/53 (47%), Positives = 30/53 (56%), Gaps = 3/53 (5%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF---GAVEAMTDRVCTILTELN 273 LPE+ D R K + EV+DQG CGSCWAF GAVE + V L L+ Sbjct: 137 LPESIDWRKKG----AVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLS 185 >UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Plasmodium|Rep: Cysteine proteinase precursor - Plasmodium vivax (strain Salvador I) Length = 583 Score = 41.1 bits (92), Expect = 0.026 Identities = 19/40 (47%), Positives = 27/40 (67%) Frame = +1 Query: 109 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 228 +L+A +PE D R+K ++E +DQG CGSCWAF +V Sbjct: 334 NLLADVPEILDYREKG----IVHEPKDQGLCGSCWAFASV 369 >UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo sapiens|Rep: Isoform 2 of Q9GZM7 - Homo sapiens (Human) Length = 283 Score = 40.7 bits (91), Expect = 0.035 Identities = 17/42 (40%), Positives = 24/42 (57%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 249 LP F+ +KWP+ ++E DQG+C WAF +DRV Sbjct: 69 LPTAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRV 108 >UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 317 Score = 40.7 bits (91), Expect = 0.035 Identities = 19/39 (48%), Positives = 25/39 (64%) Frame = +1 Query: 121 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 ++PE+ D R+K +N VRDQ CGSCWAF A A+ Sbjct: 103 TVPESIDWREKG----AVNPVRDQEQCGSCWAFSAAGAL 137 >UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin L-like cysteine proteinase precursor - Acanthoscelides obtectus (Bean weevil) Length = 321 Score = 40.7 bits (91), Expect = 0.035 Identities = 19/47 (40%), Positives = 29/47 (61%) Frame = +1 Query: 109 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 249 D + +P+ D R+K + EV+ QG+CGSCWAF AV ++ +V Sbjct: 105 DNVNDIPKTVDWREKG----AVTEVKKQGNCGSCWAFSAVGSIEGQV 147 >UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcoptes scabiei type hominis|Rep: Sar s 1 allergen Yv9053H09 - Sarcoptes scabiei type hominis Length = 253 Score = 40.7 bits (91), Expect = 0.035 Identities = 18/38 (47%), Positives = 26/38 (68%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 LPE FD RD L+++R+QG CG+CWAF A+ ++ Sbjct: 37 LPEKFDLRD----LGYLSKIRNQGRCGACWAFAALASV 70 Score = 38.7 bits (86), Expect = 0.14 Identities = 24/86 (27%), Positives = 38/86 (44%), Gaps = 1/86 (1%) Frame = +3 Query: 147 RQMA*LSNVE*SQRSRVLWQLLGFRCRRS-YDRQSMYYSNGTKHFHFSAEDLLSCCPICG 323 R + LS + R W S Y+R++ N T+ HFS ++L+ C P Sbjct: 44 RDLGYLSKIRNQGRCGACWAFAALASVESAYNRRTRIVHNRTRKHHFSEQELVDCSPNTE 103 Query: 324 LGCSGGMPRLAWEYWKHFGLVSGGSY 401 GCSG + +Y + G+V +Y Sbjct: 104 -GCSGNIISNGLKYVQLRGVVKSANY 128 >UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: Cathepsin L - Kudoa thyrsites Length = 300 Score = 40.7 bits (91), Expect = 0.035 Identities = 21/66 (31%), Positives = 33/66 (50%) Frame = +1 Query: 40 LKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 219 LK + V+ P +T D+ ++LP + D W + V++QG CGSCW+F Sbjct: 74 LKPKLPVVSTPTHGITPKETATKDIKSTLPSSVD----WKALGKVTSVKNQGHCGSCWSF 129 Query: 220 GAVEAM 237 A A+ Sbjct: 130 SAAGAI 135 >UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_56, whole genome shotgun sequence - Paramecium tetraurelia Length = 314 Score = 40.7 bits (91), Expect = 0.035 Identities = 23/61 (37%), Positives = 33/61 (54%), Gaps = 2/61 (3%) Frame = +1 Query: 61 IKDEHFATLPI--KTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEA 234 + +E FA L + K ++L A L P D + V++QG+CGSCWAF AV A Sbjct: 83 LTNEEFAALLLTRKESPMNLDAELYVPQGPLKASADWSKITSVKNQGNCGSCWAFSAVGA 142 Query: 235 M 237 + Sbjct: 143 V 143 >UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_21, whole genome shotgun sequence - Paramecium tetraurelia Length = 349 Score = 40.7 bits (91), Expect = 0.035 Identities = 15/22 (68%), Positives = 20/22 (90%) Frame = +1 Query: 172 LNEVRDQGSCGSCWAFGAVEAM 237 ++EV++QGSCGSCWAF AV A+ Sbjct: 137 VSEVKNQGSCGSCWAFSAVAAL 158 >UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon GZfos34G5|Rep: Cathepsin C - uncultured archaeon GZfos34G5 Length = 760 Score = 40.7 bits (91), Expect = 0.035 Identities = 27/82 (32%), Positives = 40/82 (48%), Gaps = 3/82 (3%) Frame = +1 Query: 1 AGRNFPRDTSFAHLKKIMGV--IKDEHFATLPIKTHKIDLIASLP-ENFDPRDKWPDCPT 171 AG D +F K + G+ + + + + L AS+P FD RDK Sbjct: 262 AGETSVSDLTFEEKKMLCGIKSLYGLRILSTEERVRVVALDASVPIGTFDWRDK-DGANW 320 Query: 172 LNEVRDQGSCGSCWAFGAVEAM 237 + V++QGSCGSC AFG + A+ Sbjct: 321 ITSVKEQGSCGSCVAFGTIGAL 342 >UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1 precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 355 Score = 40.7 bits (91), Expect = 0.035 Identities = 20/41 (48%), Positives = 24/41 (58%) Frame = +1 Query: 115 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 I LP++ D R K P V+DQG CGSCWAF V A+ Sbjct: 134 ITDLPKSVDWRKKGAVAP----VKDQGQCGSCWAFSTVAAV 170 >UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep: Cathepsin L precursor - Schistosoma mansoni (Blood fluke) Length = 319 Score = 40.7 bits (91), Expect = 0.035 Identities = 17/35 (48%), Positives = 25/35 (71%) Frame = +1 Query: 115 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 219 + ++P+NFD R+K + EV++QG CGSCWAF Sbjct: 102 VNNIPKNFDWREKG----AVTEVKNQGMCGSCWAF 132 >UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin F like protease - Nasonia vitripennis Length = 1036 Score = 40.3 bits (90), Expect = 0.046 Identities = 25/69 (36%), Positives = 33/69 (47%), Gaps = 4/69 (5%) Frame = +1 Query: 25 TSFAHLKKIMGVIKDEHFATLPIKTHKIDL---IASLPENFDPRD-KWPDCPTLNEVRDQ 192 T F L K K H P + D+ +A++P+ P D W + V+DQ Sbjct: 778 TQFTDLTK--AEFKARHLGLKPTLKSENDIPMPMATIPDIELPSDYDWRHHNVVTPVKDQ 835 Query: 193 GSCGSCWAF 219 GSCGSCWAF Sbjct: 836 GSCGSCWAF 844 >UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine protease; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cysteine protease - Strongylocentrotus purpuratus Length = 494 Score = 40.3 bits (90), Expect = 0.046 Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 1/51 (1%) Frame = +1 Query: 88 PIKTHKIDLIASLPENFDPRD-KWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 P+K I A++P+ P + W + V++QG CGSCWAF A+ M Sbjct: 223 PLKKTGIKKQAAIPQGPVPEEYDWRTHGAVTPVKNQGMCGSCWAFSAIGNM 273 >UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 280 Score = 40.3 bits (90), Expect = 0.046 Identities = 16/34 (47%), Positives = 24/34 (70%) Frame = +1 Query: 118 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 219 +SLP+ FD W + + +V++QG+CGSCWAF Sbjct: 66 SSLPQQFD----WRNLGKVTQVKNQGNCGSCWAF 95 >UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays (Maize) Length = 493 Score = 40.3 bits (90), Expect = 0.046 Identities = 15/28 (53%), Positives = 19/28 (67%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 W + + EV+DQG CG CWAF AV A+ Sbjct: 170 WRERGAVAEVKDQGQCGGCWAFSAVAAV 197 >UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypanosoma cruzi|Rep: Cysteine protease, putative - Trypanosoma cruzi Length = 434 Score = 40.3 bits (90), Expect = 0.046 Identities = 15/24 (62%), Positives = 18/24 (75%) Frame = +1 Query: 166 PTLNEVRDQGSCGSCWAFGAVEAM 237 P L V+DQGSCGSCWA A E++ Sbjct: 137 PVLTPVKDQGSCGSCWAHAATESV 160 >UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Longidorus elongatus Length = 358 Score = 40.3 bits (90), Expect = 0.046 Identities = 18/44 (40%), Positives = 26/44 (59%), Gaps = 2/44 (4%) Frame = +1 Query: 112 LIASLPENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 +I +P+N D W + +V+DQGSCGSCWAF A ++ Sbjct: 129 MIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAFSATGSL 172 >UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foetus|Rep: TFCP2 protein - Tritrichomonas foetus (Trichomonas foetus) Length = 270 Score = 40.3 bits (90), Expect = 0.046 Identities = 17/36 (47%), Positives = 23/36 (63%) Frame = +1 Query: 127 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEA 234 P +FD W +N +++QGSCGSCWAF A+ A Sbjct: 51 PTSFD----WRSEGKVNPIKNQGSCGSCWAFSAIAA 82 >UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 394 Score = 40.3 bits (90), Expect = 0.046 Identities = 15/22 (68%), Positives = 16/22 (72%) Frame = +1 Query: 172 LNEVRDQGSCGSCWAFGAVEAM 237 LN V+DQG CGSCW FGA M Sbjct: 196 LNPVKDQGQCGSCWTFGAAGVM 217 >UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis|Rep: Cysteine protease 2 - Babesia bovis Length = 445 Score = 40.3 bits (90), Expect = 0.046 Identities = 17/32 (53%), Positives = 20/32 (62%) Frame = +1 Query: 133 NFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 228 NF+ D W + V+DQG CGSCWAF AV Sbjct: 236 NFEDID-WRRADAVTPVKDQGMCGSCWAFAAV 266 >UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3.4.14.1) (Dipeptidyl-peptidase I) (DPP-I) (DPPI) (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase) [Contains: Dipeptidyl-peptidase 1 exclusion domain chain (Dipeptidyl- peptidase I exclusion domain chain); Dipeptidyl-peptidase 1 heavy chain (Dipeptidyl-peptidase I heavy chain); Dipeptidyl-peptidase 1 light chain (Dipeptidyl-peptidase I light chain)]; n=50; Coelomata|Rep: Dipeptidyl-peptidase 1 precursor (EC 3.4.14.1) (Dipeptidyl-peptidase I) (DPP-I) (DPPI) (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase) [Contains: Dipeptidyl-peptidase 1 exclusion domain chain (Dipeptidyl- peptidase I exclusion domain chain); Dipeptidyl-peptidase 1 heavy chain (Dipeptidyl-peptidase I heavy chain); Dipeptidyl-peptidase 1 light chain (Dipeptidyl-peptidase I light chain)] - Homo sapiens (Human) Length = 463 Score = 40.3 bits (90), Expect = 0.046 Identities = 17/36 (47%), Positives = 23/36 (63%) Frame = +2 Query: 584 DEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHT 691 +E ++ EL +GP+ AF VY D L YK G+Y HT Sbjct: 356 NEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHT 391 >UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2; Brugia malayi|Rep: Cahepsin L-like cysteine protease - Brugia malayi (Filarial nematode worm) Length = 371 Score = 39.9 bits (89), Expect = 0.061 Identities = 18/47 (38%), Positives = 27/47 (57%) Frame = +1 Query: 97 THKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 T ++ + LP++ D W + +V+DQG CGSCW F AV A+ Sbjct: 134 TIRMKINGPLPKSID----WRTSGAVTKVKDQGYCGSCWTFSAVGAL 176 >UniRef50_O16454 Cluster: Temporarily assigned gene name protein 196; n=4; Bilateria|Rep: Temporarily assigned gene name protein 196 - Caenorhabditis elegans Length = 477 Score = 39.9 bits (89), Expect = 0.061 Identities = 17/32 (53%), Positives = 24/32 (75%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 219 LPE+FD R+K + +V++QG+CGSCWAF Sbjct: 264 LPESFDWREKG----AVTQVKNQGNCGSCWAF 291 >UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=17; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 318 Score = 39.9 bits (89), Expect = 0.061 Identities = 13/27 (48%), Positives = 18/27 (66%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAFGAVEA 234 W + +N ++DQ CGSCWAF V+A Sbjct: 106 WRNAKIVNPIKDQAQCGSCWAFSVVQA 132 >UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanensis|Rep: Sui m 1 allergen - Suidasia medanensis Length = 336 Score = 39.9 bits (89), Expect = 0.061 Identities = 18/43 (41%), Positives = 25/43 (58%) Frame = +1 Query: 91 IKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 219 ++ + D+ +LP FD R +W VR+QG CGSCWAF Sbjct: 104 VQVPESDISVALPAAFDWRQQWNTA-----VRNQGQCGSCWAF 141 >UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole genome shotgun sequence; n=7; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_22, whole genome shotgun sequence - Paramecium tetraurelia Length = 350 Score = 39.9 bits (89), Expect = 0.061 Identities = 24/59 (40%), Positives = 31/59 (52%), Gaps = 6/59 (10%) Frame = +1 Query: 61 IKDEHFAT--LPIKTHKIDLIASLPE----NFDPRDKWPDCPTLNEVRDQGSCGSCWAF 219 + DE FA L +K + DL + N P D W +N+V+DQG CGSCWAF Sbjct: 112 LTDEEFAATYLTLKVNPDDLEVPKAQFENVNATPID-WRTRGAVNKVKDQGQCGSCWAF 169 >UniRef50_Q2FLC7 Cluster: Periplasmic copper-binding precursor; n=1; Methanospirillum hungatei JF-1|Rep: Periplasmic copper-binding precursor - Methanospirillum hungatei (strain JF-1 / DSM 864) Length = 1092 Score = 39.9 bits (89), Expect = 0.061 Identities = 18/48 (37%), Positives = 26/48 (54%) Frame = +1 Query: 94 KTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 K + ++A P FD RD + +RDQG GSCW F AV+++ Sbjct: 77 KIRSLSILADYPSKFDLRDS----KRVPAIRDQGQSGSCWDFAAVKSL 120 >UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; Leishmania|Rep: Cysteine proteinase 2 precursor - Leishmania pifanoi Length = 444 Score = 39.9 bits (89), Expect = 0.061 Identities = 18/38 (47%), Positives = 26/38 (68%) Frame = +1 Query: 115 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 228 ++++P+ D R+K P V+DQG+CGSCWAF AV Sbjct: 123 LSAVPDAVDWREKGAVTP----VKDQGACGSCWAFSAV 156 >UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2] - Vigna mungo (Rice bean) (Black gram) Length = 362 Score = 39.9 bits (89), Expect = 0.061 Identities = 18/47 (38%), Positives = 27/47 (57%) Frame = +1 Query: 97 THKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 T + + S+P + D R K + +V+DQG CGSCWAF + A+ Sbjct: 119 TFMYEKVGSVPASVDWRKKG----AVTDVKDQGQCGSCWAFSTIVAV 161 >UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera litura multicapsid nucleopolyhedrovirus (SpltMNPV) Length = 337 Score = 39.9 bits (89), Expect = 0.061 Identities = 17/37 (45%), Positives = 23/37 (62%) Frame = +1 Query: 118 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 228 A PE+FD W + +V++QG CGSCWAF A+ Sbjct: 124 ARTPESFD----WRKLNKVTKVKEQGVCGSCWAFAAI 156 >UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; n=35; Fasciola|Rep: Cathepsin L-like proteinase precursor - Fasciola hepatica (Liver fluke) Length = 326 Score = 39.9 bits (89), Expect = 0.061 Identities = 20/54 (37%), Positives = 27/54 (50%), Gaps = 1/54 (1%) Frame = +3 Query: 243 QSMYYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPRLAWEYWKHFGLVSGGSY 401 + Y N FS + L+ C P GCSGG+ A++Y K FGL + SY Sbjct: 142 EGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQFGLETESSY 195 Score = 39.5 bits (88), Expect = 0.080 Identities = 14/28 (50%), Positives = 18/28 (64%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 W + + EV+DQG+CGSCWAF M Sbjct: 114 WRESGYVTEVKDQGNCGSCWAFSTTGTM 141 >UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to GM06507p - Nasonia vitripennis Length = 483 Score = 39.5 bits (88), Expect = 0.080 Identities = 17/41 (41%), Positives = 24/41 (58%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 246 LP FD R +W + + V+DQG CG+ WA V+ +DR Sbjct: 236 LPREFDSRIQWGN--DITPVQDQGWCGASWAISTVDVASDR 274 Score = 37.5 bits (83), Expect = 0.32 Identities = 14/38 (36%), Positives = 23/38 (60%) Frame = +2 Query: 581 GDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQ 694 G+E I E+ +GPV+ V+ D Y+SG+Y H++ Sbjct: 371 GNETDIMQEILTSGPVQATMRVHRDFFHYESGIYVHSR 408 Score = 32.7 bits (71), Expect = 9.2 Identities = 16/48 (33%), Positives = 22/48 (45%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSY 401 S G + S + L+SC GC GG AW + + FG+V Y Sbjct: 279 SKGIEKVQLSGQHLISCNNRGQRGCKGGYLDRAWLFMRKFGVVDEDCY 326 >UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-LDL responsive gene 2, partial; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to oxidized-LDL responsive gene 2, partial - Strongylocentrotus purpuratus Length = 363 Score = 39.5 bits (88), Expect = 0.080 Identities = 16/43 (37%), Positives = 25/43 (58%) Frame = +1 Query: 121 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 249 ++PE FD R +WP + V++QG+C S WA +DR+ Sbjct: 221 AIPEEFDARAQWPGL--VEGVQNQGNCASSWAMSTAATASDRL 261 Score = 38.7 bits (86), Expect = 0.14 Identities = 21/49 (42%), Positives = 27/49 (55%), Gaps = 1/49 (2%) Frame = +3 Query: 258 SNGT-KHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSY 401 SNGT K+ H S + LLSC GC+GG AW Y + G+V+ Y Sbjct: 265 SNGTFKYMHLSPQHLLSCNVKRQQGCAGGHLDRAWWYMRKRGIVTEDCY 313 >UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa (Rice) Length = 339 Score = 39.5 bits (88), Expect = 0.080 Identities = 20/41 (48%), Positives = 23/41 (56%) Frame = +1 Query: 115 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 I +LP D R K P ++DQG CG CWAF AV AM Sbjct: 120 IDTLPATVDWRTKGAVTP----IKDQGQCGCCWAFSAVAAM 156 >UniRef50_Q7QRX3 Cluster: GLP_549_24108_24914; n=1; Giardia lamblia ATCC 50803|Rep: GLP_549_24108_24914 - Giardia lamblia ATCC 50803 Length = 268 Score = 39.5 bits (88), Expect = 0.080 Identities = 22/71 (30%), Positives = 30/71 (42%), Gaps = 1/71 (1%) Frame = +2 Query: 488 EDSKMHKKCESGYDVNYKQDKQYGKHVYTVSGDEDH-IRAELFKNGPVEGAFTVYSDLLS 664 +D+ C GY + K + Y + H I+ L GPV F +Y D L Sbjct: 154 DDTSCPLACSDGYALRKTSIKAF----YNIGHRNPHRIKEALVTEGPVATEFALYEDFLY 209 Query: 665 YKSGVYKHTQG 697 Y SG+Y H G Sbjct: 210 YGSGIYHHVAG 220 >UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain; n=9; Cucujiformia|Rep: Digestive cysteine proteinase intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 39.5 bits (88), Expect = 0.080 Identities = 21/60 (35%), Positives = 29/60 (48%), Gaps = 2/60 (3%) Frame = +1 Query: 64 KDEHFATLPIKTHKIDLIASLPENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 KDE + K + +A PE + D W + +V+ QG CGSCWAF A A+ Sbjct: 84 KDELRRQIKTKPNVEATLAVFPEGLEVPDSIDWTQKGAVLDVKYQGGCGSCWAFSATGAL 143 >UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: Vivapain-4 - Plasmodium vivax Length = 484 Score = 39.5 bits (88), Expect = 0.080 Identities = 14/28 (50%), Positives = 21/28 (75%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 W + ++E+++Q CGSCWAFGAV A+ Sbjct: 268 WREHNAVSEIKNQNLCGSCWAFGAVGAV 295 >UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L, S or H-like cysteine peptidase - Trichomonas vaginalis G3 Length = 473 Score = 39.5 bits (88), Expect = 0.080 Identities = 15/33 (45%), Positives = 22/33 (66%), Gaps = 1/33 (3%) Frame = +1 Query: 154 WPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRV 249 W D P + + RDQ +CGSCWAFG E++ ++ Sbjct: 257 WRDVPNVVGKPRDQVACGSCWAFGTAESLESQL 289 >UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber officinale (Ginger) Length = 221 Score = 39.5 bits (88), Expect = 0.080 Identities = 18/38 (47%), Positives = 25/38 (65%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 LP++ D R+K P V++QG CGSCWAF A+ A+ Sbjct: 3 LPDSIDWREKGAVVP----VKNQGGCGSCWAFDAIAAV 36 >UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18; Magnoliophyta|Rep: Thiol protease aleurain precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 39.5 bits (88), Expect = 0.080 Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 3/49 (6%) Frame = +1 Query: 97 THKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF---GAVEA 234 +HK+ A+LPE D W + ++ V+DQG CGSCW F GA+EA Sbjct: 133 SHKVTE-AALPETKD----WREDGIVSPVKDQGGCGSCWTFSTTGALEA 176 >UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin W, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to Cathepsin W, partial - Ornithorhynchus anatinus Length = 229 Score = 39.1 bits (87), Expect = 0.11 Identities = 18/40 (45%), Positives = 25/40 (62%), Gaps = 2/40 (5%) Frame = +1 Query: 115 IASLPENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAV 228 +AS+PE ++ W + V++QGSCGSCWAF AV Sbjct: 59 MASIPEGPLRKETCDWRKRGAITSVKNQGSCGSCWAFAAV 98 >UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: hypothetical protein - Strongylocentrotus purpuratus Length = 331 Score = 39.1 bits (87), Expect = 0.11 Identities = 17/45 (37%), Positives = 27/45 (60%) Frame = +1 Query: 115 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 249 + ++P +D R P P + V++Q SCG+CWAF VE M ++ Sbjct: 124 LKTMPLVYDLRSIKP--PVVTPVKNQKSCGACWAFSVVETMETQI 166 >UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba culbertsoni|Rep: Cysteine proteinase - Acanthamoeba culbertsoni Length = 482 Score = 39.1 bits (87), Expect = 0.11 Identities = 22/43 (51%), Positives = 27/43 (62%), Gaps = 3/43 (6%) Frame = +1 Query: 118 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF---GAVEAM 237 AS+P N+D R K P V++QGSC SCWAF GAVE + Sbjct: 154 ASIPANWDWRTKGAVTP----VKNQGSCASCWAFVATGAVEGV 192 >UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lamblia ATCC 50803|Rep: GLP_163_69918_68548 - Giardia lamblia ATCC 50803 Length = 456 Score = 39.1 bits (87), Expect = 0.11 Identities = 17/44 (38%), Positives = 25/44 (56%) Frame = +1 Query: 97 THKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 228 T + + +P ++D R+ P V+DQG CGSCWAFG + Sbjct: 68 TDPLSTLPEIPTSYDLREAGLQVP----VKDQGVCGSCWAFGTM 107 >UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine protease; n=11; Callosobruchus maculatus|Rep: Putative gut cathepsin L-like cysteine protease - Callosobruchus maculatus (Southern cowpea weevil) (Pulse bruchid) Length = 326 Score = 39.1 bits (87), Expect = 0.11 Identities = 21/73 (28%), Positives = 34/73 (46%), Gaps = 2/73 (2%) Frame = +1 Query: 25 TSFAHL--KKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGS 198 T FA + ++ + ++K + LP D + W + + V+DQ + Sbjct: 73 TQFADMTHEEFLDLLKLQGVPALPSNAVHFDNFEDIDMEEKDAVDWREEGAVTPVKDQAN 132 Query: 199 CGSCWAFGAVEAM 237 CGSCWAF AV A+ Sbjct: 133 CGSCWAFSAVGAI 145 >UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1; Brugia malayi|Rep: Cathepsin F-like cysteine proteinase - Brugia malayi (Filarial nematode worm) Length = 461 Score = 39.1 bits (87), Expect = 0.11 Identities = 18/35 (51%), Positives = 21/35 (60%) Frame = +1 Query: 115 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 219 I +LP FD W + V+DQGSCGSCWAF Sbjct: 245 IYNLPSKFD----WRTEGVVTPVKDQGSCGSCWAF 275 >UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; n=16; Chrysomelidae|Rep: Digestive cysteine protease intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 39.1 bits (87), Expect = 0.11 Identities = 17/39 (43%), Positives = 23/39 (58%), Gaps = 2/39 (5%) Frame = +1 Query: 127 PENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 PE+ + D W + + EV+DQ CGSCWAF A A+ Sbjct: 105 PEDLEVPDSIDWTEKGAVLEVKDQNPCGSCWAFSATGAL 143 >UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 291 Score = 39.1 bits (87), Expect = 0.11 Identities = 15/33 (45%), Positives = 19/33 (57%) Frame = +1 Query: 172 LNEVRDQGSCGSCWAFGAVEAMTDRVCTILTEL 270 +N +RDQ CGSCWAFG V A + + L Sbjct: 90 VNPIRDQKQCGSCWAFGTVAACESNYALLYSNL 122 >UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; Methanospirillum hungatei JF-1|Rep: Peptidase C1A, papain precursor - Methanospirillum hungatei (strain JF-1 / DSM 864) Length = 1096 Score = 39.1 bits (87), Expect = 0.11 Identities = 18/37 (48%), Positives = 23/37 (62%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEA 234 LP +FD R+ D T +++QGSCGSCWAF A Sbjct: 321 LPTSFDWRNNGGDYTT--PIKNQGSCGSCWAFATTGA 355 >UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa zeasingle nucleocapsid nuclear polyhedrosis virus) Length = 367 Score = 39.1 bits (87), Expect = 0.11 Identities = 16/35 (45%), Positives = 22/35 (62%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 228 LP+ +D W D + ++DQG CGSCWAF A+ Sbjct: 156 LPDYYD----WRDTNKVTPIKDQGVCGSCWAFVAI 186 >UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 protein, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to MGC81823 protein, partial - Ornithorhynchus anatinus Length = 361 Score = 38.7 bits (86), Expect = 0.14 Identities = 14/24 (58%), Positives = 17/24 (70%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAFGA 225 W D + V+DQG CGSCWAFG+ Sbjct: 196 WRDHGYVTPVKDQGRCGSCWAFGS 219 >UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Cathepsin K - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 333 Score = 38.7 bits (86), Expect = 0.14 Identities = 18/43 (41%), Positives = 26/43 (60%) Frame = +1 Query: 109 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 D + LP++ D R + V++QGSCGSCWAF +V A+ Sbjct: 113 DRVGKLPKSIDYRK----LGYVTSVKNQGSCGSCWAFSSVGAL 151 >UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa|Rep: Os09g0497500 protein - Oryza sativa subsp. japonica (Rice) Length = 349 Score = 38.7 bits (86), Expect = 0.14 Identities = 19/38 (50%), Positives = 25/38 (65%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 LP++ D R K + EV++QG CGSCWAF AV A+ Sbjct: 122 LPKSVDWRKKG----AVVEVKNQGDCGSCWAFSAVAAI 155 >UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 367 Score = 38.7 bits (86), Expect = 0.14 Identities = 14/25 (56%), Positives = 18/25 (72%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAFGAV 228 W ++ V++QGSCGSCWAF AV Sbjct: 161 WRQSGAVSPVKNQGSCGSCWAFSAV 185 >UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathepsin o - Aedes aegypti (Yellowfever mosquito) Length = 375 Score = 38.7 bits (86), Expect = 0.14 Identities = 19/45 (42%), Positives = 26/45 (57%) Frame = +1 Query: 106 IDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMT 240 + ++ LP+ D RDK P VR QGSCG+CWA V+ +T Sbjct: 147 LKILDYLPKVVDWRDKGVVAP----VRSQGSCGACWAISVVDTIT 187 >UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L or H-like cysteine peptidase - Trichomonas vaginalis G3 Length = 435 Score = 38.7 bits (86), Expect = 0.14 Identities = 19/52 (36%), Positives = 29/52 (55%), Gaps = 1/52 (1%) Frame = +1 Query: 97 THKIDLIASLPENFDPRDKWPDCPTLNEV-RDQGSCGSCWAFGAVEAMTDRV 249 T ID LPE+F W + P + + RDQ +CGSCWA A +++ ++ Sbjct: 204 TKHIDFKGDLPESFS----WRNLPNVVAMPRDQANCGSCWAQAAATSISSQI 251 >UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; Leishmania|Rep: Cysteine proteinase 1 precursor - Leishmania pifanoi Length = 354 Score = 38.7 bits (86), Expect = 0.14 Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 2/48 (4%) Frame = +1 Query: 91 IKTHKIDLIA--SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 228 +K HK D+ S P D W D + V++QG CGSCWAF A+ Sbjct: 113 LKDHKEDVHVDDSAPSGVMSVD-WRDKGAVTPVKNQGLCGSCWAFSAI 159 >UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 precursor; n=4; Schizophora|Rep: Putative cysteine proteinase CG12163 precursor - Drosophila melanogaster (Fruit fly) Length = 614 Score = 38.7 bits (86), Expect = 0.14 Identities = 17/32 (53%), Positives = 22/32 (68%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 219 LP+ FD R K + +V++QGSCGSCWAF Sbjct: 394 LPKEFDWRQK----DAVTQVKNQGSCGSCWAF 421 >UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleostomi|Rep: Cathepsin O precursor - Homo sapiens (Human) Length = 321 Score = 38.7 bits (86), Expect = 0.14 Identities = 22/58 (37%), Positives = 27/58 (46%) Frame = +1 Query: 64 KDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 K F + H SLP FD RDK + +VR+Q CG CWAF V A+ Sbjct: 88 KPSKFPRYSAEVHMSIPNVSLPLRFDWRDK----QVVTQVRNQQMCGGCWAFSVVGAV 141 >UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma|Rep: Cathepsin C precursor - Schistosoma mansoni (Blood fluke) Length = 454 Score = 38.7 bits (86), Expect = 0.14 Identities = 20/52 (38%), Positives = 25/52 (48%) Frame = +2 Query: 536 YKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHT 691 Y D Y Y + +E ++ EL NGP F VY D YK G+Y HT Sbjct: 331 YTTDYSYIGGYYGAT-NEKLMQLELISNGPFPVGFEVYEDFQFYKEGIYHHT 381 >UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 20 SCAF14744, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 175 Score = 38.3 bits (85), Expect = 0.19 Identities = 18/41 (43%), Positives = 23/41 (56%) Frame = +1 Query: 115 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 I LP FD W D + V++Q +CGSCWAF V A+ Sbjct: 56 IKGLPARFD----WRDNAVVGPVQNQQACGSCWAFSVVGAV 92 >UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4; core eudicotyledons|Rep: Papain-like cysteine peptidase XBCP3 - Arabidopsis thaliana (Mouse-ear cress) Length = 437 Score = 38.3 bits (85), Expect = 0.19 Identities = 14/28 (50%), Positives = 18/28 (64%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 W + V+DQGSCG+CW+F A AM Sbjct: 124 WRKKGAVTNVKDQGSCGACWSFSATGAM 151 >UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativa|Rep: Os01g0347600 protein - Oryza sativa subsp. japonica (Rice) Length = 343 Score = 38.3 bits (85), Expect = 0.19 Identities = 14/19 (73%), Positives = 17/19 (89%) Frame = +1 Query: 181 VRDQGSCGSCWAFGAVEAM 237 V+DQG+CGSCWAF AV A+ Sbjct: 140 VKDQGACGSCWAFAAVAAI 158 >UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 289 Score = 38.3 bits (85), Expect = 0.19 Identities = 14/19 (73%), Positives = 17/19 (89%) Frame = +1 Query: 181 VRDQGSCGSCWAFGAVEAM 237 V+DQG+CGSCWAF AV A+ Sbjct: 139 VKDQGACGSCWAFAAVAAI 157 >UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - Drosophila melanogaster (Fruit fly) Length = 431 Score = 38.3 bits (85), Expect = 0.19 Identities = 21/61 (34%), Positives = 32/61 (52%), Gaps = 1/61 (1%) Frame = +2 Query: 530 VNYKQDKQYGKH-VYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGDVS 706 VN +D Y Y+++ + D I AE+F +GPV+ V D +Y GVY+ T + Sbjct: 303 VNVDRDSLYTVGPAYSLNREAD-IMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRK 361 Query: 707 A 709 A Sbjct: 362 A 362 Score = 37.5 bits (83), Expect = 0.32 Identities = 16/41 (39%), Positives = 22/41 (53%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 246 LP +F+ DKW ++EV DQG CG+ W +DR Sbjct: 187 LPSSFNALDKWSSY--ISEVPDQGWCGASWVLSTTSVASDR 225 >UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1; Uronema marinum|Rep: Cathepsin L-like cysteine protease - Uronema marinum Length = 333 Score = 38.3 bits (85), Expect = 0.19 Identities = 21/53 (39%), Positives = 26/53 (49%) Frame = +3 Query: 243 QSMYYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSY 401 + +Y N K FS + L+SC P GC GG P A+ Y GL S SY Sbjct: 154 ERLYKINTGKLLSFSEQQLVSCEPK-SYGCDGGWPEAAFAYSATHGLESSASY 205 Score = 34.3 bits (75), Expect = 3.0 Identities = 13/25 (52%), Positives = 16/25 (64%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAFGAV 228 W + V++QG CGSCWAF AV Sbjct: 126 WVSKGAVQGVQNQGVCGSCWAFSAV 150 >UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_46, whole genome shotgun sequence - Paramecium tetraurelia Length = 336 Score = 38.3 bits (85), Expect = 0.19 Identities = 19/43 (44%), Positives = 23/43 (53%) Frame = +1 Query: 106 IDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEA 234 ID + EN D D + +V+DQG C CWAFGAV A Sbjct: 130 IDELQKTQEN-DKTINSVDWRKITQVKDQGQCSGCWAFGAVGA 171 >UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; Theileria|Rep: Cysteine proteinase precursor - Theileria parva Length = 440 Score = 38.3 bits (85), Expect = 0.19 Identities = 18/49 (36%), Positives = 27/49 (55%) Frame = +3 Query: 243 QSMYYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVS 389 + Y S+ K + S ++LL C GC GG+ A+EY + +GLVS Sbjct: 263 EGYYMSHFDKSYELSVQELLDCDSFSN-GCQGGLLESAYEYVRKYGLVS 310 Score = 36.3 bits (80), Expect = 0.75 Identities = 16/41 (39%), Positives = 21/41 (51%) Frame = +1 Query: 106 IDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 228 +DL EN D W ++ V+DQ +CG CWAF V Sbjct: 223 VDLAKLTGENLD----WRRSSSVTSVKDQSNCGGCWAFSTV 259 >UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina) Length = 339 Score = 38.3 bits (85), Expect = 0.19 Identities = 18/43 (41%), Positives = 24/43 (55%), Gaps = 3/43 (6%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAF---GAVEAMTDRVCTILTELN 273 W + + V+DQG CGSCWAF GA+E R +L L+ Sbjct: 128 WREHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLS 170 >UniRef50_UPI00006CBAC7 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 497 Score = 37.9 bits (84), Expect = 0.25 Identities = 19/45 (42%), Positives = 26/45 (57%) Frame = +2 Query: 548 KQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVY 682 +QYGK G+E + E+ KNGP+ F +D + YKSGVY Sbjct: 367 QQYGK------GNEREMMLEIMKNGPIVANFKTSADFVYYKSGVY 405 >UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MGC107932 protein - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 333 Score = 37.9 bits (84), Expect = 0.25 Identities = 21/63 (33%), Positives = 30/63 (47%), Gaps = 2/63 (3%) Frame = +1 Query: 88 PIKTHKIDLIA-SLPENFDPRDKWPDCPTLNEVRDQGS-CGSCWAFGAVEAMTDRVCTIL 261 P+K + ++P+ D W + V++QG+ CGSCWAF V M R C Sbjct: 102 PVKAESYSYTSITIPKEVD----WRKSNCVTPVKNQGTFCGSCWAFATVGVMESRYCIRT 157 Query: 262 TEL 270 EL Sbjct: 158 KEL 160 >UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; Roseiflexus|Rep: Peptidase C1A, papain precursor - Roseiflexus sp. RS-1 Length = 1202 Score = 37.9 bits (84), Expect = 0.25 Identities = 18/43 (41%), Positives = 24/43 (55%), Gaps = 3/43 (6%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAF---GAVEAMTDRVCTILTELN 273 W D V+DQG CGSCWAF G VE+ R+ + +L+ Sbjct: 175 WCDQGACTPVKDQGVCGSCWAFATTGVVESALKRIDGVERDLS 217 >UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 2 - Rhipicephalus appendiculatus (Brown ear tick) Length = 564 Score = 37.9 bits (84), Expect = 0.25 Identities = 20/55 (36%), Positives = 25/55 (45%) Frame = +1 Query: 64 KDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 228 KD P H+ A LP+ D W + V+DQ CGSCW+FG V Sbjct: 327 KDGSSRAEPFPRHRFT--AKLPDQID----WRPYGAVTPVKDQAVCGSCWSFGTV 375 >UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 395 Score = 37.9 bits (84), Expect = 0.25 Identities = 16/31 (51%), Positives = 19/31 (61%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 246 W D T VRDQG C SCW FG++ A+ R Sbjct: 194 WSDYQT--PVRDQGECKSCWVFGSLAALESR 222 >UniRef50_Q22DX2 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 358 Score = 37.9 bits (84), Expect = 0.25 Identities = 20/41 (48%), Positives = 25/41 (60%), Gaps = 3/41 (7%) Frame = +1 Query: 121 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF---GAVEA 234 S+P ++D R P L V +QG CGSCWAF GAVE+ Sbjct: 146 SIPSSWDIRTDGPGL--LQPVENQGQCGSCWAFSTSGAVES 184 >UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Plasmodium (Vinckeia)|Rep: Cysteine proteinase precursor - Plasmodium vinckei Length = 506 Score = 37.9 bits (84), Expect = 0.25 Identities = 26/82 (31%), Positives = 43/82 (52%), Gaps = 9/82 (10%) Frame = +1 Query: 10 NFPRDTSFAHLKKIMGVIKD-EHFATLPIKTH--KIDLIA------SLPENFDPRDKWPD 162 +F ++ + KK++ V D + +P+K H +LI+ P++ D R K+ Sbjct: 216 DFSKEEFDNYFKKLLSVPMDLKSKYIVPLKKHLANTNLISVDNKSKDFPDSRDYRSKFNF 275 Query: 163 CPTLNEVRDQGSCGSCWAFGAV 228 P +DQG+CGSCWAF A+ Sbjct: 276 LPP----KDQGNCGSCWAFAAI 293 >UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 37.5 bits (83), Expect = 0.32 Identities = 17/38 (44%), Positives = 24/38 (63%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 LPE+ D W ++ VRDQG+CGSC+AF + A+ Sbjct: 127 LPESVD----WRKLGAVSPVRDQGNCGSCYAFASTGAL 160 >UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel protein - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 328 Score = 37.5 bits (83), Expect = 0.32 Identities = 14/31 (45%), Positives = 21/31 (67%) Frame = +1 Query: 145 RDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 R W + ++ V++QG CGSCWAF AV ++ Sbjct: 116 RVNWTEHGMVSPVQNQGPCGSCWAFSAVGSL 146 >UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin L - Misgurnus mizolepis (Mud loach) Length = 337 Score = 37.5 bits (83), Expect = 0.32 Identities = 14/28 (50%), Positives = 17/28 (60%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 W + + V+DQG CGSCWAF AM Sbjct: 122 WREKGYVTPVKDQGECGSCWAFSTTGAM 149 >UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber officinale (Ginger) Length = 475 Score = 37.5 bits (83), Expect = 0.32 Identities = 17/38 (44%), Positives = 25/38 (65%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 LP++ D R+K + V++QG CGSCWAF A+ A+ Sbjct: 143 LPDSIDWREKG----AVVAVKNQGRCGSCWAFAAIAAV 176 >UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra erinaceieuropaei|Rep: Cysteine proteinase - Spirometra erinaceieuropaei (Tapeworm) Length = 336 Score = 37.5 bits (83), Expect = 0.32 Identities = 14/34 (41%), Positives = 19/34 (55%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGA 225 L EN W + + V++QG CGSCW+F A Sbjct: 117 LKENLPDSVNWRERGAVTSVKNQGQCGSCWSFSA 150 >UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep: Viral cathepsin - Xestia c-nigrum granulosis virus (XnGV) (Xestia c-nigrumgranulovirus) Length = 346 Score = 37.5 bits (83), Expect = 0.32 Identities = 17/40 (42%), Positives = 23/40 (57%) Frame = +1 Query: 109 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 228 D +P++FD W D ++ V+ Q CGSCWAF AV Sbjct: 128 DSSGKVPDSFD----WRDRNSVTSVKMQKECGSCWAFSAV 163 >UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens (Human) Length = 334 Score = 37.5 bits (83), Expect = 0.32 Identities = 23/72 (31%), Positives = 36/72 (50%) Frame = +1 Query: 22 DTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSC 201 D + +++MG +++ F K + L LP++ D R K P V++Q C Sbjct: 82 DMTNEEFRQMMGCFRNQKFRKG--KVFREPLFLDLPKSVDWRKKGYVTP----VKNQKQC 135 Query: 202 GSCWAFGAVEAM 237 GSCWAF A A+ Sbjct: 136 GSCWAFSATGAL 147 >UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium tetraurelia|Rep: Cathepsin L1 precursor - Paramecium tetraurelia Length = 314 Score = 37.5 bits (83), Expect = 0.32 Identities = 14/19 (73%), Positives = 17/19 (89%) Frame = +1 Query: 181 VRDQGSCGSCWAFGAVEAM 237 V++QGSCGSCWAF AV A+ Sbjct: 126 VKNQGSCGSCWAFSAVGAL 144 >UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin l - Strongylocentrotus purpuratus Length = 489 Score = 37.1 bits (82), Expect = 0.43 Identities = 24/81 (29%), Positives = 38/81 (46%), Gaps = 1/81 (1%) Frame = +1 Query: 10 NFPRDTSFAHLKKIMGVIKDEHFAT-LPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVR 186 N D S LK++ G ++ LP + A +P++ D W ++ V+ Sbjct: 229 NHMADQSHQELKRMRGRLRQTRPNNGLPYDGSDVSDDA-VPDHID----WNVLGAVSPVK 283 Query: 187 DQGSCGSCWAFGAVEAMTDRV 249 DQ CGSCW+FG+ E + V Sbjct: 284 DQAVCGSCWSFGSAETIEGAV 304 >UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2); n=2; Tribolium castaneum|Rep: PREDICTED: similar to Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2) - Tribolium castaneum Length = 332 Score = 37.1 bits (82), Expect = 0.43 Identities = 16/50 (32%), Positives = 24/50 (48%) Frame = +1 Query: 88 PIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 P+ + L+ SL W + V++QG CGSCWAF + A+ Sbjct: 102 PLNETEDPLLPSLGRGISASLDWRQRGGVTPVKNQGQCGSCWAFATIGAI 151 >UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocystis pacifica SIR-1|Rep: Peptidase C1A, papain - Plesiocystis pacifica SIR-1 Length = 650 Score = 37.1 bits (82), Expect = 0.43 Identities = 13/22 (59%), Positives = 17/22 (77%) Frame = +1 Query: 172 LNEVRDQGSCGSCWAFGAVEAM 237 L +R+QG+CGSCWAF AV + Sbjct: 176 LGAIRNQGACGSCWAFAAVSTI 197 >UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sativa|Rep: Cysteine proteinase-like - Oryza sativa subsp. japonica (Rice) Length = 360 Score = 37.1 bits (82), Expect = 0.43 Identities = 15/27 (55%), Positives = 18/27 (66%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAFGAVEA 234 W + EV++Q SCGSCWAF AV A Sbjct: 143 WRARGAVTEVKNQRSCGSCWAFAAVAA 169 >UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Liliopsida|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 416 Score = 37.1 bits (82), Expect = 0.43 Identities = 13/22 (59%), Positives = 17/22 (77%) Frame = +1 Query: 172 LNEVRDQGSCGSCWAFGAVEAM 237 + +V+DQG CGSCW F AV A+ Sbjct: 126 VTDVKDQGQCGSCWVFSAVGAV 147 >UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein A; n=2; Dictyostelium discoideum|Rep: Gamete and mating-type specific protein A - Dictyostelium discoideum (Slime mold) Length = 448 Score = 37.1 bits (82), Expect = 0.43 Identities = 13/22 (59%), Positives = 16/22 (72%) Frame = +1 Query: 181 VRDQGSCGSCWAFGAVEAMTDR 246 +RDQG CGSCWAF + A+ R Sbjct: 253 IRDQGQCGSCWAFASSAALESR 274 >UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lamblia ATCC 50803|Rep: GLP_26_47548_45815 - Giardia lamblia ATCC 50803 Length = 577 Score = 37.1 bits (82), Expect = 0.43 Identities = 16/42 (38%), Positives = 23/42 (54%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 249 LP+ D W +N +DQ +CGSCW FGA+ + R+ Sbjct: 344 LPQELD----WRVRGIMNMAKDQVACGSCWTFGAIGTIEGRI 381 >UniRef50_Q231X3 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 37.1 bits (82), Expect = 0.43 Identities = 12/28 (42%), Positives = 19/28 (67%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 W + ++ V+ QG+CGSCWAF A ++ Sbjct: 121 WVEAGKVSNVKSQGNCGSCWAFSATASV 148 >UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-329; n=2; Caenorhabditis|Rep: Putative uncharacterized protein tag-329 - Caenorhabditis elegans Length = 374 Score = 37.1 bits (82), Expect = 0.43 Identities = 17/44 (38%), Positives = 22/44 (50%) Frame = +3 Query: 270 KHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSY 401 K + S +++ C P G GC+GG P EY K GL G Y Sbjct: 188 KAMNLSEQEVCDCAPKHGPGCNGGDPVDGLEYIKEMGLTGGKEY 231 >UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platyhelminthes|Rep: Cathepsin L-like proteinase - Echinococcus multilocularis Length = 338 Score = 37.1 bits (82), Expect = 0.43 Identities = 17/38 (44%), Positives = 23/38 (60%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 +P++ D R K P ++DQG CGSCWAF A A+ Sbjct: 122 VPDSIDWRKKGLVTP----IKDQGDCGSCWAFSATGAL 155 >UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|Rep: Cysteine proteinase - Globodera pallida Length = 53 Score = 37.1 bits (82), Expect = 0.43 Identities = 12/21 (57%), Positives = 14/21 (66%) Frame = +1 Query: 190 QGSCGSCWAFGAVEAMTDRVC 252 QG CG CWAF E ++DR C Sbjct: 1 QGQCGRCWAFSTAEVISDRTC 21 Score = 35.5 bits (78), Expect = 1.3 Identities = 16/29 (55%), Positives = 20/29 (68%), Gaps = 1/29 (3%) Frame = +3 Query: 258 SNGTKHFHFSAEDLLSCCPI-CGLGCSGG 341 SNGT+ S DLL+CC + CG GC+GG Sbjct: 24 SNGTQQPIISPTDLLTCCGMSCGEGCNGG 52 >UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 392 Score = 37.1 bits (82), Expect = 0.43 Identities = 16/30 (53%), Positives = 20/30 (66%), Gaps = 3/30 (10%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAF---GAVEA 234 W + +N + QG+CGSCWAF GAVEA Sbjct: 183 WRNYGAVNPAKGQGTCGSCWAFATAGAVEA 212 >UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 452 Score = 37.1 bits (82), Expect = 0.43 Identities = 23/59 (38%), Positives = 32/59 (54%), Gaps = 1/59 (1%) Frame = +1 Query: 64 KDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEV-RDQGSCGSCWAFGAVEAM 237 K + T P K+ I +LPE+F W + P + E DQ CG+C+AFGA EA+ Sbjct: 207 KGSNAETCPTYDQKV--IQNLPESFS----WRNVPYVLEYPHDQAVCGTCFAFGASEAI 259 >UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin B-like cysteine peptidase - Trichomonas vaginalis G3 Length = 255 Score = 37.1 bits (82), Expect = 0.43 Identities = 16/59 (27%), Positives = 33/59 (55%) Frame = +1 Query: 76 FATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 F I++ D+ +P+ ++ ++P C L + + CG C+A+G ++AM+ R+C Sbjct: 15 FVDESIRSFPEDISIDIPDEYNFLQEYPHCD-LGPLTQE--CGCCYAYGPIKAMSHRIC 70 >UniRef50_A0D5R4 Cluster: Chromosome undetermined scaffold_39, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_39, whole genome shotgun sequence - Paramecium tetraurelia Length = 133 Score = 37.1 bits (82), Expect = 0.43 Identities = 18/39 (46%), Positives = 24/39 (61%) Frame = +1 Query: 121 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 SLP++ D +D V++QGSCGSCWAF A A+ Sbjct: 92 SLPDSVDSKDGLT-------VKNQGSCGSCWAFAAAAAL 123 >UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179, whole genome shotgun sequence; n=3; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_179, whole genome shotgun sequence - Paramecium tetraurelia Length = 339 Score = 37.1 bits (82), Expect = 0.43 Identities = 15/50 (30%), Positives = 26/50 (52%) Frame = +2 Query: 548 KQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQG 697 ++Y Y ++D I+ ++ GPV VY D L Y+ G+Y+ +G Sbjct: 231 QRYKAESYCQLQNKDDIKRDILNKGPVVAIIPVYKDFLIYRDGIYQVLEG 280 Score = 34.3 bits (75), Expect = 3.0 Identities = 14/65 (21%), Positives = 35/65 (53%) Frame = +1 Query: 58 VIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 + K++ + ++ K+ + P ++ ++ +P C ++V +QG+C S ++ + Sbjct: 103 LFKNDFTQQINVEKCKLSFMDETPVYYNFKEAYPQCN--HQVYNQGNCSSSYSIAVSSSF 160 Query: 238 TDRVC 252 +DRVC Sbjct: 161 SDRVC 165 >UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue of Sarcophaga 26,29kDa proteinase; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to homologue of Sarcophaga 26,29kDa proteinase - Nasonia vitripennis Length = 553 Score = 36.7 bits (81), Expect = 0.57 Identities = 16/40 (40%), Positives = 23/40 (57%) Frame = +1 Query: 118 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 A +P++FD W + V+DQ CGSCW+FG A+ Sbjct: 332 ADVPDSFD----WRLYGAVTPVKDQSVCGSCWSFGTTGAV 367 >UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; n=23; Magnoliophyta|Rep: Senescence-specific cysteine protease - Arabidopsis thaliana (Mouse-ear cress) Length = 346 Score = 36.7 bits (81), Expect = 0.57 Identities = 18/39 (46%), Positives = 24/39 (61%) Frame = +1 Query: 121 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 +LP + D R K P +++QGSCG CWAF AV A+ Sbjct: 129 ALPVSVDWRKKGAVTP----IKNQGSCGCCWAFSAVAAI 163 >UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa (japonica cultivar-group)|Rep: Os09g0562700 protein - Oryza sativa subsp. japonica (Rice) Length = 235 Score = 36.7 bits (81), Expect = 0.57 Identities = 13/19 (68%), Positives = 15/19 (78%) Frame = +1 Query: 172 LNEVRDQGSCGSCWAFGAV 228 + EV+DQG CGSCWAF V Sbjct: 21 VTEVKDQGRCGSCWAFSTV 39 >UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like midgut cysteine proteinase - Tenebrio molitor (Yellow mealworm) Length = 330 Score = 36.7 bits (81), Expect = 0.57 Identities = 20/41 (48%), Positives = 27/41 (65%), Gaps = 6/41 (14%) Frame = +1 Query: 172 LNEVRDQGSCGSCWAF---GAVE---AMTDRVCTILTELNI 276 ++EV+DQG CGSCW+F GAVE A+ T L+E N+ Sbjct: 128 VSEVKDQGQCGSCWSFSTTGAVEGQLALQRGRLTSLSEQNL 168 >UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intestinalis|Rep: GLP_90_15278_13989 - Giardia lamblia ATCC 50803 Length = 429 Score = 36.7 bits (81), Expect = 0.57 Identities = 22/49 (44%), Positives = 27/49 (55%) Frame = +1 Query: 88 PIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEA 234 PIK D +LP++ D R+ P VR+QG CGSCWAF V A Sbjct: 51 PIKVAAED---NLPQSVDLREYGLMTP----VRNQGKCGSCWAFATVAA 92 >UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia ATCC 50803|Rep: GLP_542_3431_1206 - Giardia lamblia ATCC 50803 Length = 741 Score = 36.7 bits (81), Expect = 0.57 Identities = 24/68 (35%), Positives = 34/68 (50%), Gaps = 1/68 (1%) Frame = +1 Query: 67 DEHFATLPIKTHKIDLI-ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTD 243 ++ + LP DL A+LP NF R ++ +QGSCG C+A AVE +T Sbjct: 40 EDEYNELPDGPDNADLTRAALPTNFTYRGH-----RCIQIINQGSCGCCYAAAAVEMVTA 94 Query: 244 RVCTILTE 267 R C L + Sbjct: 95 RRCLQLND 102 >UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Toxopain-2 - Toxoplasma gondii Length = 422 Score = 36.7 bits (81), Expect = 0.57 Identities = 16/48 (33%), Positives = 21/48 (43%) Frame = +1 Query: 109 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVC 252 +L+ LP W + V+DQ CGSCWAF A+ C Sbjct: 196 ELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHC 243 >UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06231 protein - Schistosoma japonicum (Blood fluke) Length = 372 Score = 36.7 bits (81), Expect = 0.57 Identities = 29/91 (31%), Positives = 38/91 (41%), Gaps = 3/91 (3%) Frame = +1 Query: 10 NFPRDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRD 189 NF T + L+K+ G A T A LP+ D W + V++ Sbjct: 113 NFTDKTEY-ELRKLRGYRSACRIAKPKGSTFISSEHAKLPDRVD----WRRNGAVTPVKN 167 Query: 190 QGSCGSCWAF---GAVEAMTDRVCTILTELN 273 QG CGSCWAF GA+E R L L+ Sbjct: 168 QGQCGSCWAFSSTGAIEGQHYRKTNRLVNLS 198 >UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep: CG4847-PD, isoform D - Drosophila melanogaster (Fruit fly) Length = 420 Score = 36.7 bits (81), Expect = 0.57 Identities = 21/53 (39%), Positives = 29/53 (54%), Gaps = 3/53 (5%) Frame = +1 Query: 124 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF---GAVEAMTDRVCTILTELN 273 +P+ FD W + + V+ QG+CGSCWAF GA+E T R L L+ Sbjct: 203 IPDAFD----WREHGGVTPVKFQGTCGSCWAFATTGAIEGHTFRKTGSLPNLS 251 >UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi Length = 467 Score = 36.7 bits (81), Expect = 0.57 Identities = 13/25 (52%), Positives = 16/25 (64%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAFGAV 228 W + V+DQG CGSCWAF A+ Sbjct: 129 WRARGAVTAVKDQGQCGSCWAFSAI 153 >UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|Rep: Cathepsin F precursor - Homo sapiens (Human) Length = 484 Score = 36.7 bits (81), Expect = 0.57 Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 7/60 (11%) Frame = +1 Query: 61 IKDEHFATLPIKT-------HKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 219 + +E F T+ + T +K+ S+ + P W + +V+DQG CGSCWAF Sbjct: 239 LTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAF 298 >UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 360 Score = 36.3 bits (80), Expect = 0.75 Identities = 17/39 (43%), Positives = 23/39 (58%) Frame = +1 Query: 121 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 +LP +FD RDK P V+ Q CG CWAF V+++ Sbjct: 130 NLPASFDWRDKGAITP----VKVQNGCGGCWAFSTVQSI 164 >UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thaliana|Rep: F9P14.12 protein - Arabidopsis thaliana (Mouse-ear cress) Length = 343 Score = 36.3 bits (80), Expect = 0.75 Identities = 13/28 (46%), Positives = 17/28 (60%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 W + +R+QG CG CWAF AV A+ Sbjct: 133 WRTQGAVTPIRNQGKCGGCWAFSAVAAI 160 >UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena thermophila Length = 320 Score = 36.3 bits (80), Expect = 0.75 Identities = 18/56 (32%), Positives = 29/56 (51%) Frame = +1 Query: 70 EHFATLPIKTHKIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 + F TL K + ++ + E + W + V++QGSCGSCWAF + A+ Sbjct: 92 QQFLTLHEKVNSTEVYRAQGEATEV--DWTAKGKVTPVKNQGSCGSCWAFSTIGAV 145 >UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcoptes scabiei type hominis|Rep: Cathepsin L-like protease - Sarcoptes scabiei type hominis Length = 245 Score = 36.3 bits (80), Expect = 0.75 Identities = 16/41 (39%), Positives = 23/41 (56%) Frame = +1 Query: 115 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 ++ LP+ D W + ++DQ CGSCWAF AV +M Sbjct: 117 VSDLPDEVD----WTLKNVVAPIKDQKQCGSCWAFSAVASM 153 >UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba histolytica|Rep: Cysteine protease 17 - Entamoeba histolytica Length = 420 Score = 36.3 bits (80), Expect = 0.75 Identities = 18/48 (37%), Positives = 25/48 (52%) Frame = +1 Query: 103 KIDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 246 K D++ LPE D R L +R+Q CG CW+F +V A+ R Sbjct: 160 KKDIVKELPEGIDFRK----FGKLTYIREQTGCGGCWSFASVCALESR 203 >UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 4 - Rhipicephalus appendiculatus (Brown ear tick) Length = 345 Score = 36.3 bits (80), Expect = 0.75 Identities = 13/33 (39%), Positives = 21/33 (63%) Frame = +1 Query: 151 KWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 249 +W + + V++QG CGSCWAF + A+ +V Sbjct: 131 EWRENGFVTPVKNQGQCGSCWAFSSTGALEGQV 163 >UniRef50_Q7R0G3 Cluster: GLP_29_33036_32140; n=1; Giardia lamblia ATCC 50803|Rep: GLP_29_33036_32140 - Giardia lamblia ATCC 50803 Length = 298 Score = 36.3 bits (80), Expect = 0.75 Identities = 17/46 (36%), Positives = 23/46 (50%) Frame = +2 Query: 563 HVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGD 700 H+Y G+ I L + GP+ VY DLL+Y G+Y T D Sbjct: 190 HIY--GGNATRIAELLMQKGPLYAELFVYKDLLTYHGGIYNRTSTD 233 >UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L - Suberites domuncula (Sponge) Length = 324 Score = 36.3 bits (80), Expect = 0.75 Identities = 12/28 (42%), Positives = 19/28 (67%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 W ++EV++QG CGSCW+F A ++ Sbjct: 114 WRQKGVVSEVKNQGQCGSCWSFSATGSL 141 >UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 356 Score = 36.3 bits (80), Expect = 0.75 Identities = 17/36 (47%), Positives = 21/36 (58%) Frame = +2 Query: 581 GDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKH 688 GDED ++ + GPV AF V D YKSGVY + Sbjct: 246 GDEDQLKQAVGTVGPVSIAFQVMGDFKLYKSGVYSN 281 Score = 35.5 bits (78), Expect = 1.3 Identities = 14/30 (46%), Positives = 20/30 (66%), Gaps = 3/30 (10%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAF---GAVEA 234 W D ++ V+DQ +CGSCW F GA+E+ Sbjct: 133 WKDLNKVSPVKDQQNCGSCWTFSTTGAIES 162 >UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing protein; n=5; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 437 Score = 36.3 bits (80), Expect = 0.75 Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 1/46 (2%) Frame = +1 Query: 103 KIDLIASLPENFDPRDKWPDCPTLNEVRDQG-SCGSCWAFGAVEAM 237 K DL + LP+ D W + + +V+ QG CGSCWAF AV A+ Sbjct: 199 KYDL-SQLPQYVD----WREKGVVTQVKSQGKDCGSCWAFAAVAAL 239 >UniRef50_Q23H32 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 365 Score = 36.3 bits (80), Expect = 0.75 Identities = 18/39 (46%), Positives = 24/39 (61%) Frame = +1 Query: 121 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 S+PE+ D R+K + V+ QG CGSCWAF V A+ Sbjct: 134 SVPESVDWREK-----LVAPVQKQGGCGSCWAFSTVIAL 167 >UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schistosoma|Rep: Preprocathepsin cathepsin L - Schistosoma japonicum (Blood fluke) Length = 331 Score = 36.3 bits (80), Expect = 0.75 Identities = 14/28 (50%), Positives = 17/28 (60%) Frame = +1 Query: 154 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 W D + V+ QG CGSCWAF A A+ Sbjct: 122 WRDHGAVTAVKHQGLCGSCWAFSATGAI 149 >UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_36, whole genome shotgun sequence - Paramecium tetraurelia Length = 307 Score = 36.3 bits (80), Expect = 0.75 Identities = 11/22 (50%), Positives = 18/22 (81%) Frame = +1 Query: 172 LNEVRDQGSCGSCWAFGAVEAM 237 +N +++QG+CGSCW F A+ A+ Sbjct: 118 MNPIKNQGNCGSCWTFSAIGAV 139 >UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_184, whole genome shotgun sequence - Paramecium tetraurelia Length = 331 Score = 36.3 bits (80), Expect = 0.75 Identities = 19/39 (48%), Positives = 23/39 (58%) Frame = +1 Query: 121 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 S P+ D W D T V++QGSCGSCWAF A A+ Sbjct: 117 SFPDTVD----WKDGLT---VKNQGSCGSCWAFAAAAAI 148 >UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 344 Score = 35.9 bits (79), Expect = 0.99 Identities = 13/36 (36%), Positives = 19/36 (52%) Frame = +1 Query: 130 ENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 237 +N P D W + + V+ QG CGSCW F + + Sbjct: 134 KNAPPMD-WRNASAITPVKQQGKCGSCWTFASTAVL 168 >UniRef50_Q3A8I3 Cluster: Putative serine protease; n=1; Pelobacter carbinolicus DSM 2380|Rep: Putative serine protease - Pelobacter carbinolicus (strain DSM 2380 / Gra Bd 1) Length = 1066 Score = 35.9 bits (79), Expect = 0.99 Identities = 17/39 (43%), Positives = 23/39 (58%) Frame = +1 Query: 118 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEA 234 A LP +FD R+ + VR+Q CGSCW+FG + A Sbjct: 22 ADLPSSFDLRNI-DGRSYIGPVRNQKKCGSCWSFGTLAA 59 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 763,628,465 Number of Sequences: 1657284 Number of extensions: 16434455 Number of successful extensions: 50579 Number of sequences better than 10.0: 375 Number of HSP's better than 10.0 without gapping: 47609 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 50473 length of database: 575,637,011 effective HSP length: 98 effective length of database: 413,223,179 effective search space used: 57438021881 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -