BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= I10A02NGRL0001_F06 (589 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw... 271 7e-72 UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ... 244 1e-63 UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteina... 242 5e-63 UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca... 231 9e-60 UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh... 212 5e-54 UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=... 208 7e-53 UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr... 202 6e-51 UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=... 193 3e-48 UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep... 191 1e-47 UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ... 189 4e-47 UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; ... 187 1e-46 UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|... 187 1e-46 UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain... 184 2e-45 UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr... 180 2e-44 UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n... 177 1e-43 UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n... 172 4e-42 UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps... 172 6e-42 UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Ca... 169 4e-41 UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: ... 167 2e-40 UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Ca... 165 7e-40 UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep... 163 2e-39 UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.... 161 1e-38 UniRef50_Q237A1 Cluster: Papain family cysteine protease contain... 157 2e-37 UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 ... 157 2e-37 UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 ... 155 5e-37 UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ... 153 3e-36 UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator ame... 153 4e-36 UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|... 153 4e-36 UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 ... 151 9e-36 UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2... 151 2e-35 UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA... 146 3e-34 UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma... 144 1e-33 UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid... 144 2e-33 UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8.... 143 2e-33 UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, w... 143 3e-33 UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomati... 142 4e-33 UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2; ... 138 9e-32 UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7... 136 5e-31 UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8... 134 1e-30 UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin ... 134 2e-30 UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus co... 133 3e-30 UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl... 133 3e-30 UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhip... 132 8e-30 UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep... 129 5e-29 UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep... 127 2e-28 UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gamb... 127 2e-28 UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C... 120 3e-26 UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma j... 120 3e-26 UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; ... 118 1e-25 UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2; Ostreococcus|... 105 1e-21 UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 101 1e-20 UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; ... 96 6e-19 UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O... 92 1e-17 UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve... 92 1e-17 UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma j... 86 5e-16 UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus lu... 85 2e-15 UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus... 84 3e-15 UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG011... 79 6e-14 UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|R... 78 2e-13 UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 75 9e-13 UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,... 73 7e-12 UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi... 72 9e-12 UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like prote... 71 2e-11 UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;... 71 2e-11 UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi... 70 4e-11 UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-L... 69 6e-11 UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia... 68 2e-10 UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|R... 68 2e-10 UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to glucocorti... 67 2e-10 UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcop... 67 2e-10 UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy... 67 3e-10 UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; ... 66 6e-10 UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen; n... 65 1e-09 UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n... 64 2e-09 UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;... 63 4e-09 UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 63 5e-09 UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ... 62 7e-09 UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - ... 62 7e-09 UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ... 62 7e-09 UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida... 62 7e-09 UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-li... 62 7e-09 UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ... 62 7e-09 UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain... 62 9e-09 UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re... 62 1e-08 UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35... 62 1e-08 UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ... 61 2e-08 UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio... 61 2e-08 UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1... 61 2e-08 UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain... 61 2e-08 UniRef50_Q24E33 Cluster: Papain family cysteine protease contain... 61 2e-08 UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr... 61 2e-08 UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=... 61 2e-08 UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo sapie... 60 3e-08 UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen... 60 3e-08 UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cel... 60 3e-08 UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag... 60 4e-08 UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R... 60 4e-08 UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium... 60 4e-08 UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ... 60 5e-08 UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy... 60 5e-08 UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ... 60 5e-08 UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali... 60 5e-08 UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The... 60 5e-08 UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa... 59 7e-08 UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh... 59 7e-08 UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ... 59 9e-08 UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e... 59 9e-08 UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18... 59 9e-08 UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain... 58 1e-07 UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain... 58 1e-07 UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-32... 58 1e-07 UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1... 58 2e-07 UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; ... 58 2e-07 UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy... 58 2e-07 UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n... 58 2e-07 UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The... 58 2e-07 UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG... 57 3e-07 UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted... 57 3e-07 UniRef50_Q235G6 Cluster: Papain family cysteine protease contain... 57 3e-07 UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh... 57 3e-07 UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lambl... 57 4e-07 UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain... 57 4e-07 UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus... 57 4e-07 UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ... 57 4e-07 UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er... 57 4e-07 UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl... 57 4e-07 UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto... 57 4e-07 UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ... 56 5e-07 UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try... 56 5e-07 UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain... 56 5e-07 UniRef50_O16454 Cluster: Temporarily assigned gene name protein ... 56 5e-07 UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy... 56 5e-07 UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ... 56 5e-07 UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ... 56 6e-07 UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin... 56 6e-07 UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=... 56 6e-07 UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ... 56 8e-07 UniRef50_Q231X3 Cluster: Papain family cysteine protease contain... 56 8e-07 UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis... 56 8e-07 UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ... 56 8e-07 UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy... 56 8e-07 UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh... 56 8e-07 UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;... 56 8e-07 UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;... 55 1e-06 UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big... 55 1e-06 UniRef50_Q23H32 Cluster: Papain family cysteine protease contain... 55 1e-06 UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w... 55 1e-06 UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi... 55 1e-06 UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel... 55 1e-06 UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep... 55 1e-06 UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia ... 55 1e-06 UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j... 55 1e-06 UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ... 55 1e-06 UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ... 54 2e-06 UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus... 54 2e-06 UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ... 54 2e-06 UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma... 54 2e-06 UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re... 54 3e-06 UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop... 54 3e-06 UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet... 54 3e-06 UniRef50_Q23H10 Cluster: Papain family cysteine protease contain... 54 3e-06 UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy... 54 3e-06 UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh... 54 3e-06 UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M... 54 3e-06 UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl... 54 3e-06 UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata... 54 3e-06 UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ... 53 4e-06 UniRef50_Q22W19 Cluster: Papain family cysteine protease contain... 53 4e-06 UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain... 53 4e-06 UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:... 53 4e-06 UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ... 53 6e-06 UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n... 53 6e-06 UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ... 53 6e-06 UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir... 53 6e-06 UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ... 52 8e-06 UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; ... 52 8e-06 UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n... 52 8e-06 UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip... 52 8e-06 UniRef50_Q54VR1 Cluster: Putative uncharacterized protein; n=1; ... 52 8e-06 UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M... 52 8e-06 UniRef50_P81494 Cluster: Cathepsin B; n=2; Phasianidae|Rep: Cath... 52 8e-06 UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ... 52 1e-05 UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ... 52 1e-05 UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca... 52 1e-05 UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip... 52 1e-05 UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb... 52 1e-05 UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n... 52 1e-05 UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re... 52 1e-05 UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs... 52 1e-05 UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr... 52 1e-05 UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ... 52 1e-05 UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ... 52 1e-05 UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ... 52 1e-05 UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop... 52 1e-05 UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain... 52 1e-05 UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh... 52 1e-05 UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov... 52 1e-05 UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ... 51 2e-05 UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio... 51 2e-05 UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum... 51 2e-05 UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep... 51 2e-05 UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh... 51 2e-05 UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi... 51 2e-05 UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3.... 51 2e-05 UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t... 51 2e-05 UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-... 51 2e-05 UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh... 51 2e-05 UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ... 51 2e-05 UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C... 51 2e-05 UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia ... 50 3e-05 UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein... 50 3e-05 UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei... 50 3e-05 UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia... 50 3e-05 UniRef50_Q23H06 Cluster: Papain family cysteine protease contain... 50 3e-05 UniRef50_Q239L8 Cluster: Papain family cysteine protease contain... 50 3e-05 UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who... 50 3e-05 UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt... 50 4e-05 UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia... 50 4e-05 UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabdi... 50 4e-05 UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ... 50 4e-05 UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo... 50 4e-05 UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C... 50 4e-05 UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph... 50 5e-05 UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt... 50 5e-05 UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ... 50 5e-05 UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain... 50 5e-05 UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;... 50 5e-05 UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain... 50 5e-05 UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic... 50 5e-05 UniRef50_UPI00006CB77B Cluster: Papain family cysteine protease ... 49 7e-05 UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae... 49 7e-05 UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain... 49 7e-05 UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cy... 49 7e-05 UniRef50_A2DPZ6 Cluster: Clan CA, family C1, cathepsin B-like cy... 49 7e-05 UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, wh... 49 7e-05 UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h... 49 9e-05 UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n... 49 9e-05 UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No... 49 9e-05 UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi... 49 9e-05 UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n... 49 9e-05 UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;... 49 9e-05 UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis... 49 9e-05 UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain... 49 9e-05 UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ... 49 9e-05 UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophi... 49 9e-05 UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu... 49 9e-05 UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:... 49 9e-05 UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p... 48 1e-04 UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca... 48 1e-04 UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ... 48 1e-04 UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G... 48 1e-04 UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil... 48 1e-04 UniRef50_Q22A69 Cluster: Papain family cysteine protease contain... 48 1e-04 UniRef50_A0DIY3 Cluster: Chromosome undetermined scaffold_52, wh... 48 1e-04 UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ... 48 2e-04 UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat... 48 2e-04 UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist... 48 2e-04 UniRef50_Q248G1 Cluster: Papain family cysteine protease contain... 48 2e-04 UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179, w... 48 2e-04 UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D... 48 2e-04 UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|... 48 2e-04 UniRef50_Q58HK5 Cluster: Hc58; n=1; Haemonchus contortus|Rep: Hc... 48 2e-04 UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep... 48 2e-04 UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3... 48 2e-04 UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon G... 48 2e-04 UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ... 48 2e-04 UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ... 47 3e-04 UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep: Cat... 47 3e-04 UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl... 47 3e-04 UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ... 47 3e-04 UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr... 47 3e-04 UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh... 47 3e-04 UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto... 47 3e-04 UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ... 47 4e-04 UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa... 47 4e-04 UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; ... 47 4e-04 UniRef50_Q5CM16 Cluster: P3ECSL-related; n=2; Cryptosporidium|Re... 47 4e-04 UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma j... 47 4e-04 UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop... 47 4e-04 UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ... 47 4e-04 UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2... 47 4e-04 UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole... 46 5e-04 UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|... 46 5e-04 UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi... 46 5e-04 UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat... 46 5e-04 UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve... 46 5e-04 UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like cy... 46 5e-04 UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ... 46 7e-04 UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ... 46 7e-04 UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3; Bilate... 46 7e-04 UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia... 46 7e-04 UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain... 46 7e-04 UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir... 46 7e-04 UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs... 46 7e-04 UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D... 46 7e-04 UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: L... 46 9e-04 UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain... 46 9e-04 UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina... 46 9e-04 UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory... 46 9e-04 UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|... 46 9e-04 UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal... 45 0.001 UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox... 45 0.001 UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc... 45 0.001 UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; P... 45 0.001 UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster... 45 0.002 UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep... 45 0.002 UniRef50_Q3L7L2 Cluster: Sar s 1 allergen SMIPP-C Yv6008G08; n=2... 45 0.002 UniRef50_Q23H15 Cluster: Papain family cysteine protease contain... 45 0.002 UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi... 45 0.002 UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|... 45 0.002 UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p... 44 0.002 UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ... 44 0.002 UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s... 44 0.002 UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty... 44 0.002 UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa... 44 0.002 UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste... 44 0.002 UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R... 44 0.002 UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re... 44 0.003 UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ... 44 0.003 UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ... 44 0.003 UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ... 44 0.003 UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl... 44 0.003 UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain... 44 0.003 UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C... 44 0.003 UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:... 44 0.003 UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ... 44 0.003 UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba... 44 0.003 UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli... 44 0.003 UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi... 44 0.003 UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w... 44 0.003 UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re... 44 0.003 UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy... 43 0.005 UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|... 43 0.005 UniRef50_UPI00006CBB5F Cluster: Papain family cysteine protease ... 43 0.006 UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei... 43 0.006 UniRef50_Q6A1H9 Cluster: Cathepsin X/O; n=1; Suberites domuncula... 43 0.006 UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy... 43 0.006 UniRef50_UPI000155D183 Cluster: PREDICTED: similar to Cathepsin ... 42 0.008 UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ... 42 0.008 UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ... 42 0.008 UniRef50_Q8I1Y2 Cluster: Protease, putative; n=1; Plasmodium fal... 42 0.008 UniRef50_A5KAP8 Cluster: Protease, putative; n=1; Plasmodium viv... 42 0.008 UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li... 42 0.008 UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w... 42 0.008 UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli... 42 0.011 UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve... 42 0.011 UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica... 42 0.014 UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv... 42 0.014 UniRef50_A7APS9 Cluster: Papain family cysteine protease contain... 42 0.014 UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ... 42 0.014 UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; ... 41 0.019 UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n... 41 0.019 UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:... 41 0.019 UniRef50_Q8I0V1 Cluster: Preprocathepsin c, putative; n=1; Plasm... 41 0.019 UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H... 41 0.019 UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa... 41 0.025 UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil... 41 0.025 UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ... 41 0.025 UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy... 41 0.025 UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li... 41 0.025 UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease ... 40 0.033 UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ... 40 0.033 UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O... 40 0.033 UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ... 40 0.033 UniRef50_Q5CKC2 Cluster: Preprocathepsin c; n=2; Cryptosporidium... 40 0.033 UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop... 40 0.033 UniRef50_A0CS14 Cluster: Chromosome undetermined scaffold_26, wh... 40 0.033 UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s... 40 0.043 UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome s... 40 0.043 UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocysti... 40 0.043 UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv... 40 0.043 UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-... 40 0.043 UniRef50_Q54R55 Cluster: Putative uncharacterized protein; n=1; ... 40 0.043 UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli... 40 0.057 UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Sl... 40 0.057 UniRef50_Q5CYC8 Cluster: Cathepsin like thiol protease possibly ... 40 0.057 UniRef50_A5K8Y0 Cluster: Preprocathepsin c, putative; n=1; Plasm... 40 0.057 UniRef50_Q8IIJ9 Cluster: Probable cathepsin C precursor; n=11; P... 40 0.057 UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S... 39 0.075 UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz... 39 0.075 UniRef50_Q7RQM7 Cluster: Dipeptidyl-peptidase i; n=6; Plasmodium... 39 0.075 UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl... 39 0.075 UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest... 39 0.075 UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n... 39 0.075 UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280... 39 0.100 UniRef50_Q01FL5 Cluster: Cysteine protease-5; n=3; Ostreococcus|... 39 0.100 UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac... 39 0.100 UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ... 39 0.100 UniRef50_Q8I8D3 Cluster: Cysteine protease 15; n=2; Entamoeba hi... 39 0.100 UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain... 39 0.100 UniRef50_A7AX75 Cluster: Preprocathepsin c, putative; n=1; Babes... 39 0.100 UniRef50_Q2FLC7 Cluster: Periplasmic copper-binding precursor; n... 39 0.100 UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP000... 38 0.13 UniRef50_Q8QNJ8 Cluster: EsV-1-75; n=1; Ectocarpus siliculosus v... 38 0.13 UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v... 38 0.13 UniRef50_UPI00006CB653 Cluster: Papain family cysteine protease ... 38 0.17 UniRef50_Q015J8 Cluster: Cathepsin; n=2; Ostreococcus|Rep: Cathe... 38 0.17 UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ... 38 0.17 UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate... 38 0.17 UniRef50_Q9UBR2 Cluster: Cathepsin Z precursor; n=40; Bilateria|... 38 0.17 UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa... 38 0.23 UniRef50_Q5YER6 Cluster: Cathepsin Z; n=1; Bigelowiella natans|R... 37 0.30 UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endop... 37 0.30 UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|... 37 0.40 UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2; Culicid... 37 0.40 UniRef50_Q3A8I3 Cluster: Putative serine protease; n=1; Pelobact... 36 0.53 UniRef50_A4S004 Cluster: Predicted protein; n=2; Ostreococcus|Re... 36 0.53 UniRef50_Q22RC9 Cluster: Papain family cysteine protease contain... 36 0.53 UniRef50_A5KBM2 Cluster: Serine-repeat antigen; n=3; Plasmodium|... 36 0.53 UniRef50_A0DCA5 Cluster: Chromosome undetermined scaffold_45, wh... 36 0.53 UniRef50_A5UP12 Cluster: Adhesin-like protein; n=1; Methanobrevi... 36 0.53 UniRef50_A0EAG2 Cluster: Chromosome undetermined scaffold_86, wh... 36 0.70 UniRef50_A0D5R4 Cluster: Chromosome undetermined scaffold_39, wh... 36 0.70 UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,... 36 0.93 UniRef50_UPI00006CBAC7 Cluster: Papain family cysteine protease ... 36 0.93 UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R... 36 0.93 UniRef50_Q7R0G3 Cluster: GLP_29_33036_32140; n=1; Giardia lambli... 36 0.93 UniRef50_Q4N3V5 Cluster: Cathepsin C, putative; n=1; Theileria p... 36 0.93 UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina... 36 0.93 UniRef50_A0BZE1 Cluster: Chromosome undetermined scaffold_139, w... 36 0.93 UniRef50_UPI0001509FF9 Cluster: Papain family cysteine protease ... 35 1.2 UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ... 35 1.2 UniRef50_Q9NHY1 Cluster: Cysteine protease cp2; n=1; Theileria c... 35 1.2 UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil... 35 1.2 UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ... 35 1.2 UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz... 35 1.6 UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi... 35 1.6 UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1; ... 35 1.6 UniRef50_Q4YCM9 Cluster: Cysteine protease, putative; n=5; Plasm... 35 1.6 UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;... 34 2.1 UniRef50_Q02CM0 Cluster: 4Fe-4S ferredoxin, iron-sulfur binding ... 34 2.1 UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cy... 34 2.1 UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; ... 34 2.8 UniRef50_A5HC50 Cluster: Cathepsin W; n=2; Theria|Rep: Cathepsin... 34 2.8 UniRef50_A2F4G0 Cluster: Clan CA, family C1, cathepsin L-like cy... 34 2.8 UniRef50_Q9NHY2 Cluster: Cysteine protease cp1; n=2; Theileria c... 33 3.7 UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n... 33 3.7 UniRef50_Q0GBZ7 Cluster: Membrane-associated protein 29; n=4; Sc... 33 3.7 UniRef50_Q7M4N9 Cluster: Dipeptidyl-peptidase I; n=1; Homo sapie... 33 3.7 UniRef50_UPI0000DB78AE Cluster: PREDICTED: similar to C25E10.7; ... 33 4.9 UniRef50_O96163 Cluster: Cysteine protease, putative; n=5; Plasm... 33 4.9 UniRef50_A7ASR7 Cluster: Cathepsin C, putative; n=1; Babesia bov... 33 4.9 UniRef50_A5KBM1 Cluster: Serine-repeat antigen; n=1; Plasmodium ... 33 4.9 UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ... 33 6.5 UniRef50_Q0E4Y7 Cluster: 50 kDa Cathepsin B; n=2; Ascovirus|Rep:... 33 6.5 UniRef50_A7M7G2 Cluster: ParC; n=1; Serratia entomophila|Rep: Pa... 33 6.5 UniRef50_A3J1C4 Cluster: B-glycosyltransferase-related protein, ... 33 6.5 UniRef50_Q0E4N0 Cluster: Os02g0109400 protein; n=3; Oryza sativa... 33 6.5 UniRef50_Q54MB6 Cluster: Putative uncharacterized protein; n=1; ... 33 6.5 UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10... 33 6.5 UniRef50_A5KBM0 Cluster: Serine-repeat antigen (SERA), putative;... 33 6.5 UniRef50_A0DZB1 Cluster: Chromosome undetermined scaffold_7, who... 33 6.5 UniRef50_Q2H7E7 Cluster: Putative uncharacterized protein; n=1; ... 33 6.5 UniRef50_UPI0000F2D780 Cluster: PREDICTED: similar to WW domain ... 32 8.6 UniRef50_A1SVF0 Cluster: Peptidase C1A, papain; n=1; Psychromona... 32 8.6 UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sa... 32 8.6 UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryz... 32 8.6 UniRef50_Q2QS15 Cluster: Papain family cysteine protease contain... 32 8.6 UniRef50_Q8I8D7 Cluster: Cysteine protease 11; n=4; Entamoeba hi... 32 8.6 UniRef50_Q7QWN9 Cluster: GLP_26_50243_51811; n=2; Giardia lambli... 32 8.6 UniRef50_Q26153 Cluster: V-SERA 4; n=1; Plasmodium vivax|Rep: V-... 32 8.6 UniRef50_Q26015 Cluster: Serine rich protein homologue; n=4; Pla... 32 8.6 UniRef50_Q1AMF3 Cluster: Cathepsin C1; n=1; Toxoplasma gondii|Re... 32 8.6 UniRef50_Q1AMF2 Cluster: Cathepsin C2; n=1; Toxoplasma gondii|Re... 32 8.6 UniRef50_A0DKP6 Cluster: Chromosome undetermined scaffold_54, wh... 32 8.6 UniRef50_Q1DTN0 Cluster: Predicted protein; n=1; Coccidioides im... 32 8.6 UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]... 32 8.6 >UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpwnx02 - Periplaneta americana (American cockroach) Length = 343 Score = 271 bits (665), Expect = 7e-72 Identities = 114/191 (59%), Positives = 139/191 (72%) Frame = +2 Query: 14 TWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPT 193 TWKA RNF P IK LMG + +LP+ + + ++ +PE FDPR++WPECPT Sbjct: 50 TWKAHRNFGNDIPLREIKKLMGVRRSLENFRLPEKSME-DIDIEIPEEFDPREQWPECPT 108 Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPT 373 L EIRDQGSCGSCWAFGAVEAM+DRVCI+S HFHFSAEDL++CC CG GCNGG P Sbjct: 109 LKEIRDQGSCGSCWAFGAVEAMSDRVCIHSKGKTHFHFSAEDLLTCCSSCGFGCNGGEPG 168 Query: 374 LAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCESSYN 553 AW+YW G+VSGG+YNS QGC+PY I PCEHHV G R PC G+ TP+C K CE Y+ Sbjct: 169 AAWDYWVSTGIVSGGSYNSHQGCQPYAIEPCEHHVNGTRKPC-GEGDTPRCVKRCEEGYD 227 Query: 554 VPFKKEQRYGK 586 VP+ K++ +GK Sbjct: 228 VPYGKDRHFGK 238 >UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin B; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin B - Strongylocentrotus purpuratus Length = 346 Score = 244 bits (597), Expect = 1e-63 Identities = 102/191 (53%), Positives = 137/191 (71%) Frame = +2 Query: 8 QNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPEC 187 + TWKAG NF + ++GALK+ N +LPK+ + I +LPENFD R+ WP C Sbjct: 35 KTTWKAGINFEGWQ-LDDFRRMLGALKNPNG-RLPKLENQTR-IKDLPENFDARENWPNC 91 Query: 188 PTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGM 367 PT+ E+RDQGSCGSCWAFGAVEA++DR+CI S H SAEDL++CC CG GCNGG Sbjct: 92 PTIKEVRDQGSCGSCWAFGAVEAISDRICIKSKGQTQVHISAEDLMTCCKTCGNGCNGGF 151 Query: 368 PTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCESS 547 P AWEY+K G+V+GG +NSSQGC+PY+I C+HHV G + PC G+ TP+C+ CE+S Sbjct: 152 PGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGPCQGEGPTPECKHKCEAS 211 Query: 548 YNVPFKKEQRY 580 Y+ P+++++ Y Sbjct: 212 YSTPYEQDKHY 222 >UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteinase; n=1; Tenebrio molitor|Rep: Putative cathepsin B-like like proteinase - Tenebrio molitor (Yellow mealworm) Length = 301 Score = 242 bits (592), Expect = 5e-63 Identities = 106/195 (54%), Positives = 135/195 (69%), Gaps = 2/195 (1%) Frame = +2 Query: 5 KQNTWKAGRNFPTHTPFAHIKILMGAL-KDDNILKLPKVTHDAELIANLPENFDPRDKWP 181 KQ TWKAGRNF +TP +H++ L+G L K N KLP TH L A +PE+FD R+ WP Sbjct: 37 KQTTWKAGRNFDVNTPISHVRRLLGVLPKKANAPKLPVKTHAVNLDA-IPESFDAREAWP 95 Query: 182 ECPTL-NEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCN 358 EC ++ EIRDQ SCGSCWAFGAVEAM+DR+CI+S+A+ SAEDL CC CG GCN Sbjct: 96 ECTSIIGEIRDQASCGSCWAFGAVEAMSDRICIHSDASVKVRISAEDLNDCCYDCGDGCN 155 Query: 359 GGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNC 538 GG P LAW YW G+V+GG Y +GC+ Y I PC+HHV GN PC +TP C+K+C Sbjct: 156 GGWPDLAWSYWSSTGIVTGGLYGVDEGCKAYSIKPCDHHVDGNLGPCGDIQRTPACKKSC 215 Query: 539 ESSYNVPFKKEQRYG 583 +S+ ++ +K + R G Sbjct: 216 DSTSDLEYKSDLRRG 230 >UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Cathepsin B1) (APP secretase) (APPS) [Contains: Cathepsin B light chain; Cathepsin B heavy chain]; n=85; Eukaryota|Rep: Cathepsin B precursor (EC 3.4.22.1) (Cathepsin B1) (APP secretase) (APPS) [Contains: Cathepsin B light chain; Cathepsin B heavy chain] - Homo sapiens (Human) Length = 339 Score = 231 bits (565), Expect = 9e-60 Identities = 103/195 (52%), Positives = 132/195 (67%), Gaps = 1/195 (0%) Frame = +2 Query: 2 KKQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWP 181 K+ TW+AG NF + +++K L G K P+ E + LP +FD R++WP Sbjct: 36 KRNTTWQAGHNF-YNVDMSYLKRLCGTFLGGP--KPPQRVMFTEDL-KLPASFDAREQWP 91 Query: 182 ECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCN 358 +CPT+ EIRDQGSCGSCWAFGAVEA++DR+CI++NA SAEDL++CC +CG GCN Sbjct: 92 QCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCN 151 Query: 359 GGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNC 538 GG P AW +W GLVSGG Y S GCRPY IPPCEHHV G+R PC G+ TPKC K C Sbjct: 152 GGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKIC 211 Query: 539 ESSYNVPFKKEQRYG 583 E Y+ +K+++ YG Sbjct: 212 EPGYSPTYKQDKHYG 226 >UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 5 SCAF15026, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 351 Score = 212 bits (518), Expect = 5e-54 Identities = 106/217 (48%), Positives = 136/217 (62%), Gaps = 22/217 (10%) Frame = +2 Query: 2 KKQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWP 181 K +TW AG NF + ++++K L G L KLP + A I LP+ FD R++WP Sbjct: 35 KLNSTWTAGHNFH-NVDYSYVKKLCGTLLKGP--KLPLMIRYAGDI-KLPKEFDSREQWP 90 Query: 182 ECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNG 361 CPTL EIRDQGSCGSCWAFGA EAM+DRVCI+SNA SA+DL++CC CG+GCNG Sbjct: 91 NCPTLKEIRDQGSCGSCWAFGASEAMSDRVCIHSNAKVSVELSAQDLLTCCNSCGMGCNG 150 Query: 362 GMPTLAWEYWKHVGLVSGGNYNS---------------------SQGCRPYEIPPCEHHV 478 G P+ AW +W GLVSGG Y+S S GCRPY IPPCEHHV Sbjct: 151 GYPSSAWNFWVSDGLVSGGLYDSHIGRIQVSLCVLLLAVDRDFVSPGCRPYTIPPCEHHV 210 Query: 479 PGNRMPCNGD-TKTPKCQKNCESSYNVPFKKEQRYGK 586 G+R C+G+ TP+C CE+ Y+ +K+++ +GK Sbjct: 211 NGSRPSCSGEGGDTPECIFRCEAGYSPSYKQDKHFGK 247 >UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=1; Biomphalaria glabrata|Rep: Cathepsin B preproprotein precursor - Biomphalaria glabrata (Bloodfluke planorb) Length = 333 Score = 208 bits (508), Expect = 7e-53 Identities = 88/192 (45%), Positives = 118/192 (61%), Gaps = 1/192 (0%) Frame = +2 Query: 14 TWKAGRNF-PTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECP 190 TWKAGRNF P A + + ++ ++ + +LP+NFDPR KWP+C Sbjct: 42 TWKAGRNFHPAEIKRARALLGVNMAENKAYNRIHLKYKQVQPRNDLPDNFDPRTKWPDCA 101 Query: 191 TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMP 370 +LNEIRDQ +CGSCWAFG+ EAMTDR+CI + H SAED+ CC CG+GCNGG P Sbjct: 102 SLNEIRDQANCGSCWAFGSAEAMTDRICIAGKG--NIHISAEDINDCCKSCGMGCNGGYP 159 Query: 371 TLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCESSY 550 AWE++ G+VSGG Y +++GC PY +P C+HH G PC TPKC+K C + Y Sbjct: 160 AAAWEWYVDTGVVSGGQYGTNEGCMPYSLPHCDHHTTGKYQPCPAVVPTPKCEKKCLTGY 219 Query: 551 NVPFKKEQRYGK 586 + ++ GK Sbjct: 220 PKSYSNDKTRGK 231 >UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase precursor; n=28; Bilateria|Rep: Cathepsin B-like cysteine proteinase precursor - Schistosoma japonicum (Blood fluke) Length = 342 Score = 202 bits (492), Expect = 6e-51 Identities = 89/193 (46%), Positives = 120/193 (62%), Gaps = 4/193 (2%) Frame = +2 Query: 17 WKAGRNFPTHTPFAHIKILMGALKDDNILKL---PKVTHDAELIANLPENFDPRDKWPEC 187 WKA ++ H+ +ILMGA K+D +K P V H +L +P FD R KWP C Sbjct: 46 WKADKSDRFHS-LDDARILMGARKEDAEMKRNRRPTVDHH-DLNVEIPSQFDSRKKWPHC 103 Query: 188 PTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGM 367 ++++IRDQ CGSCWAFGAVEAMTDR+CI S + SA DL+SCC CG GC GG Sbjct: 104 KSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCKDCGDGCQGGF 163 Query: 368 PTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDT-KTPKCQKNCES 544 P +AW+YW G+V+GG+ + GC+PY P CEHH G C KTP+C++ C+ Sbjct: 164 PGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQK 223 Query: 545 SYNVPFKKEQRYG 583 Y P+++++ YG Sbjct: 224 GYKTPYEQDKHYG 236 >UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=1; Nilaparvata lugens|Rep: Cathepsin B-like protease precursor - Nilaparvata lugens (Brown planthopper) Length = 347 Score = 193 bits (470), Expect = 3e-48 Identities = 86/200 (43%), Positives = 123/200 (61%), Gaps = 7/200 (3%) Frame = +2 Query: 8 QNTWKAGRNFPTHTPFAHIKILMGALK-DDNILKLPKVTHDAELIAN----LPENFDPRD 172 ++TWKAG NF TP ++++ L+G + + N+ L K E N +P+ FD R Sbjct: 41 KSTWKAGHNFHPDTPMSYLQGLLGVSELESNLADLDKYEEMEENEENKKIKVPKYFDARK 100 Query: 173 KWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLG 352 KW +C +L EIRDQG+CGSCWA A DR+CI SNA + H S+ +L+SCC CG G Sbjct: 101 KWKKCKSLREIRDQGNCGSCWAVSVAAAFADRLCIASNAKWNGHISSRELMSCCSYCGFG 160 Query: 353 CNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGD--TKTPKC 526 C GG P AW + K GLV+GG+Y+S GC+PY I PCEHH+ G++ C+ TP C Sbjct: 161 CEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPYPIAPCEHHMEGSKPNCSASPTEPTPAC 220 Query: 527 QKNCESSYNVPFKKEQRYGK 586 + C ++ ++K+++ GK Sbjct: 221 ETTCTHGSSLAYQKDRQKGK 240 >UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep: Cathepsin B - Pandalus borealis (Northern red shrimp) Length = 328 Score = 191 bits (465), Expect = 1e-47 Identities = 83/193 (43%), Positives = 111/193 (57%) Frame = +2 Query: 5 KQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPE 184 KQ TWKAGRNF +K L K+ +I KLP + +P FD R++WP Sbjct: 31 KQMTWKAGRNFAKDISKDFLKSLNCVRKNPDIPKLP--LKNVTPTKEIPVEFDAREQWPH 88 Query: 185 CPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGG 364 CP ++EIRDQG+CGSCWA A MTDR CI + F FS+E++ +CC CG C GG Sbjct: 89 CPCIDEIRDQGNCGSCWAVSAASVMTDRTCIDTEGLVDFRFSSENVAACCTECGNACYGG 148 Query: 365 MPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCES 544 A+ +W G VSGG +NS++GC+PY + CEHH+ G R PC GD C + C Sbjct: 149 DEDTAFTHWVTKGFVSGGRHNSNEGCQPYSVEECEHHIEGPRPPCEGDMPELVCSETCHE 208 Query: 545 SYNVPFKKEQRYG 583 Y ++++ YG Sbjct: 209 EYGKTYEEDLEYG 221 >UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 precursor; n=11; Bilateria|Rep: Cathepsin B-like cysteine proteinase 6 precursor - Caenorhabditis elegans Length = 379 Score = 189 bits (461), Expect = 4e-47 Identities = 79/152 (51%), Positives = 99/152 (65%), Gaps = 2/152 (1%) Frame = +2 Query: 131 ELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFS 310 +L ++PE+FD RD WP+C ++ IRDQ SCGSCWAFGAVEAM+DR+CI S+ S Sbjct: 100 DLDLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLS 159 Query: 311 AEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNR 490 A+DL+SCC CG GCNGG P AW YW G+V+G NY ++ GC+PY PPCEHH Sbjct: 160 ADDLLSCCKSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTH 219 Query: 491 M-PCNGDT-KTPKCQKNCESSYNVPFKKEQRY 580 PC D TPKC+K C S Y E ++ Sbjct: 220 FDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKF 251 >UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin B-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 331 Score = 187 bits (456), Expect = 1e-46 Identities = 85/195 (43%), Positives = 115/195 (58%), Gaps = 2/195 (1%) Frame = +2 Query: 5 KQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPE 184 KQ+TW AG+NF + IK L+GA K + + TH ++ +P +FD R+ W E Sbjct: 35 KQSTWVAGKNFDENLSIQEIKNLLGA-KKGKLGVAKEFTHSEDI--QVPNSFDARENWKE 91 Query: 185 CP-TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNG 361 C ++ + DQ CGSCWA A AM+DR CI S SAE+L+SCC CG GC G Sbjct: 92 CSDVISTVVDQSDCGSCWAVAAASAMSDRRCIASQGKLKVPVSAENLLSCCDSCGYGCEG 151 Query: 362 GMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNG-DTKTPKCQKNC 538 G PT+AW YW G+ +GG Y S QGC+PY + PCEHH GN++ C+ D TP C+ C Sbjct: 152 GYPTMAWSYWIDTGITTGGLYGSKQGCQPYSLQPCEHHTEGNKVQCSTLDYDTPSCKHKC 211 Query: 539 ESSYNVPFKKEQRYG 583 + S + +K E +G Sbjct: 212 DDS-ALNYKSELTFG 225 >UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|Rep: Cathepsin B5 - Clonorchis sinensis Length = 343 Score = 187 bits (456), Expect = 1e-46 Identities = 84/179 (46%), Positives = 110/179 (61%), Gaps = 3/179 (1%) Frame = +2 Query: 17 WKAGRNFPTHTPFAHIKILMGALKDDNILKL--PKVTHDAELIANLPENFDPRDKWPECP 190 W +GR P + + GA ++ K P + HD LP+NFD R WP C Sbjct: 42 WISGR-LPKRFESGDLIHMFGAKRETREQKAQRPTLRHDGFDNMRLPKNFDARKTWPHCS 100 Query: 191 TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMP 370 +++EIRDQ SCGSCWAFGAVEAM+DR+CI+SN + SA DL+SCC CG GC GG P Sbjct: 101 SISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCKDCGFGCRGGYP 160 Query: 371 TLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDT-KTPKCQKNCES 544 +AW+YWK G+V+GG+ GCR Y P CEHHV G+ PC + TP+C + C++ Sbjct: 161 AVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQCDT 219 >UniRef50_Q23FP9 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 340 Score = 184 bits (447), Expect = 2e-45 Identities = 82/184 (44%), Positives = 109/184 (59%), Gaps = 4/184 (2%) Frame = +2 Query: 11 NTWKAGRNFPTHTPFAHIKIL--MGALKDDNILKLPKVTHDAELIAN-LPENFDPRDKWP 181 +TWKA R +P ++L +G+L + + +KLP D A+ +PE FD R++WP Sbjct: 41 STWKAAR-YPHFEKMTREQLLGHLGSLDEPDWVKLPTKEFDPNANADPIPEFFDAREQWP 99 Query: 182 ECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP-ICGLGCN 358 C ++ IRDQ +CGSCWAF A E +DR+CI SN T S+EDL+ CC CG+GC Sbjct: 100 NCQSIKLIRDQSTCGSCWAFAATETFSDRICIASNQTLQTSISSEDLLECCADYCGMGCK 159 Query: 359 GGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNC 538 GG P+ AW Y K G+ +GG Y C+PY PPC+HHV G PC TP+C K C Sbjct: 160 GGYPSAAWGYMKRQGVSTGGLYGDDTSCKPYIFPPCDHHVTGQYQPCGPIQPTPQCVKEC 219 Query: 539 ESSY 550 S Y Sbjct: 220 NSEY 223 >UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase precursor; n=29; Schistosomatidae|Rep: Cathepsin B-like cysteine proteinase precursor - Schistosoma mansoni (Blood fluke) Length = 340 Score = 180 bits (439), Expect = 2e-44 Identities = 80/194 (41%), Positives = 114/194 (58%), Gaps = 4/194 (2%) Frame = +2 Query: 17 WKAGRNFPTHTPFAHIKILMGALKDDNILKL---PKVTHDAELIANLPENFDPRDKWPEC 187 W+A ++ H+ +I MGA +++ L+ P V H+ + +P NFD R KWP C Sbjct: 45 WRAEKSNRFHS-LDDARIQMGARREEPDLRRKRRPTVDHN-DWNVEIPSNFDSRKKWPGC 102 Query: 188 PTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGM 367 ++ IRDQ CGSCW+FGAVEAM+DR CI S ++ SA DL++CC CGLGC GG+ Sbjct: 103 KSIATIRDQSRCGSCWSFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCESCGLGCEGGI 162 Query: 368 PTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDT-KTPKCQKNCES 544 AW+YW G+V+ + + GC PY P CEHH G PC TP+C++ C+ Sbjct: 163 LGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQR 222 Query: 545 SYNVPFKKEQRYGK 586 Y P+ +++ GK Sbjct: 223 KYKTPYTQDKHRGK 236 >UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n=8; Strongylida|Rep: Cathepsin B-like cysteine protease 2 - Parelaphostrongylus tenuis Length = 344 Score = 177 bits (432), Expect = 1e-43 Identities = 75/162 (46%), Positives = 101/162 (62%), Gaps = 1/162 (0%) Frame = +2 Query: 104 KLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYS 283 K P+V E +P++FD R +WP CP+++ IRDQ CGSCWAFG+ EAM+DRVCI S Sbjct: 80 KKPRVDEIGEEGFKIPDSFDARVQWPHCPSISYIRDQSQCGSCWAFGSAEAMSDRVCIAS 139 Query: 284 NATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPP 463 + K SA+D++SCC CG GC+GG P AWEY+ G+V+GG Y + CRPYEIPP Sbjct: 140 HGNKTVELSADDILSCCYDCGDGCDGGYPISAWEYFVETGVVTGGLYGTKDSCRPYEIPP 199 Query: 464 CEHHVPGNRM-PCNGDTKTPKCQKNCESSYNVPFKKEQRYGK 586 C HH C TP C C++ Y + + ++ +GK Sbjct: 200 CGHHRNETFYGNCTQIADTPDCVTTCQAGYPISYDDDKTFGK 241 >UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n=4; Tenebrionidae|Rep: Putative cathepsin B-like proteinase - Tenebrio molitor (Yellow mealworm) Length = 321 Score = 172 bits (419), Expect = 4e-42 Identities = 83/196 (42%), Positives = 120/196 (61%), Gaps = 2/196 (1%) Frame = +2 Query: 8 QNTWKAGRNFPTHTPFAHIKILMG--ALKDDNILKLPKVTHDAELIANLPENFDPRDKWP 181 Q++W AGRNFP +T ++ L G L D K P + H ++PE+FD R KWP Sbjct: 36 QSSWVAGRNFPENTTNEYLYKLNGFIGLHPDPNYKPPVLVHTFNA-RDVPESFDARTKWP 94 Query: 182 ECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNG 361 C +LN IRDQG+CGSCWAF ++E+M+DR+CI+S+ + F FS EDL+SCC CG C G Sbjct: 95 NCDSLNRIRDQGACGSCWAFASIESMSDRICIHSSGSAQFMFSPEDLLSCCTSCG-DCGG 153 Query: 362 GMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCE 541 G A +++ + G+VSGG+ NS++GCRPY + H G +TP C K+C Sbjct: 154 GYMMSALDFYINEGIVSGGDVNSNEGCRPY---TADAHDQG---------QTPACTKSCR 201 Query: 542 SSYNVPFKKEQRYGKH 589 + Y+ + ++ YG + Sbjct: 202 NGYSTSYSADKHYGSN 217 >UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Cathepsin B - Fasciola gigantica (Giant liver fluke) Length = 339 Score = 172 bits (418), Expect = 6e-42 Identities = 81/196 (41%), Positives = 116/196 (59%), Gaps = 6/196 (3%) Frame = +2 Query: 14 TWKAGRNFPTHTPFAHIKILMGALKDD----NILKLPKVTHDAELIANLPENFDPRDKWP 181 +WKA R+ + H K+ +GAL + N L+ P + HD +LPE+FD R +WP Sbjct: 41 SWKAARS-TRFSNVDHFKLHLGALSETPEERNALR-PTIKHDISK-NDLPESFDARSQWP 97 Query: 182 ECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNG 361 +C T++EIRDQ SCGSCWA A AM+DRVCI+SN +A D +SCC CG GC G Sbjct: 98 QCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTYCGQGCRG 157 Query: 362 GMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP-CNGDT-KTPKCQKN 535 G P AW+YW G+V+GG + + GC+P+ C+H + C T TP C + Sbjct: 158 GYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTPPCARA 217 Query: 536 CESSYNVPFKKEQRYG 583 C++ YN +++++ YG Sbjct: 218 CQTGYNKTYEQDKFYG 233 >UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Cathepsin b - Aedes aegypti (Yellowfever mosquito) Length = 386 Score = 169 bits (411), Expect = 4e-41 Identities = 89/193 (46%), Positives = 109/193 (56%), Gaps = 2/193 (1%) Frame = +2 Query: 14 TWKAGRNFPTHTPFAHIKILMGALKDDNILKLPK-VTHDAELIANLPENFDPRDKWPECP 190 TW+AG N P + M L+ KLP + D E + +LP+ FD R+KWPECP Sbjct: 85 TWRAGSN-PKPPAGYRSGVNMADLERT---KLPLGIMADVEDL-DLPDTFDAREKWPECP 139 Query: 191 TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMP 370 +L EIRDQG CGSCWA A AMTDR C+ S + F F + DL+SCC CG GC GG Sbjct: 140 SLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCCHSCGQGCRGGTL 199 Query: 371 TLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCESSY 550 AW++W GL SGG NS QGC PY I C +PG D TPKC C S Y Sbjct: 200 GPAWQFWVEKGLSSGGPLNSRQGCHPYPIGEC--RIPGE------DEDTPKCSNKCRSGY 251 Query: 551 NV-PFKKEQRYGK 586 NV +++ YG+ Sbjct: 252 NVTDVWQDRHYGR 264 >UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: Cathepsin B - Apriona germari Length = 324 Score = 167 bits (405), Expect = 2e-40 Identities = 85/196 (43%), Positives = 118/196 (60%), Gaps = 3/196 (1%) Frame = +2 Query: 2 KKQNTWKAGRNFPTHTPFAHIKIL---MGALKDDNILKLPKVTHDAELIANLPENFDPRD 172 +K TW A +NF TP +K L +G +D N+ LP V H+A I+ +P++FD R+ Sbjct: 37 EKATTWTARKNFEGRTP-EQLKALADVIGINRDPNVT-LPVVFHEA--ISGIPDSFDARE 92 Query: 173 KWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLG 352 +WP C ++ IRD+G+CGSCWAF AVE M+DR+C+ S K F FSAE++VSCC CG G Sbjct: 93 QWPFCESIRTIRDEGACGSCWAFAAVEVMSDRLCLASEGRKKFIFSAEEVVSCCTACGGG 152 Query: 353 CNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQK 532 C GG ++YW G+ SGG+Y S GC+PY V G +TP+CQK Sbjct: 153 CRGGFLNEPYKYWVTNGIPSGGDYGSKLGCKPYTAA-----VSG---------ETPQCQK 198 Query: 533 NCESSYNVPFKKEQRY 580 C S Y ++K+ R+ Sbjct: 199 ACVSGYEKSWEKDLRH 214 >UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Cathepsin b - Aedes aegypti (Yellowfever mosquito) Length = 332 Score = 165 bits (401), Expect = 7e-40 Identities = 74/191 (38%), Positives = 109/191 (57%), Gaps = 1/191 (0%) Frame = +2 Query: 14 TWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPT 193 TW F F + + + G + +LP HD ++PE FD R+KWP C + Sbjct: 41 TWTPDATFRDGIRFENFQNMKGIFESKIGFRLPTKRHDVAYNMDIPEFFDAREKWPYCKS 100 Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGG-MP 370 ++ I++QG CG+CWA AV M+DR+CI+S +AEDL+ CC CG GCNGG + Sbjct: 101 ISTIKNQGLCGACWAVAAVSVMSDRLCIHSEGKFDVELAAEDLMGCCKDCGNGCNGGFLD 160 Query: 371 TLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCESSY 550 +++YW VGLVSG YNS+ GC+PY PC + G C+ + KTP C +C Y Sbjct: 161 GTSFQYWVDVGLVSGAAYNSTDGCKPYPFKPCLYPFVG----CHPE-KTPSCTHHCTEGY 215 Query: 551 NVPFKKEQRYG 583 + +++++ YG Sbjct: 216 DGTYRRDKYYG 226 >UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep: Cathepsin B - Uronema marinum Length = 350 Score = 163 bits (397), Expect = 2e-39 Identities = 84/194 (43%), Positives = 108/194 (55%), Gaps = 14/194 (7%) Frame = +2 Query: 11 NTWKAGRNFPTH-TPFAHIKILMGALKDDNILKLPKVTHDA-ELIANL--PENFDPRDKW 178 +TWKAG N F I+ +MG + + +P + E I NL PE+FD R+ + Sbjct: 38 STWKAGYNKRFEGMSFDQIQAMMGTIATP-VHMIPDERYTPFETIQNLSLPESFDLREAY 96 Query: 179 PECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP---ICGL 349 P+C +L ++RDQ +CGSCWAFG VEA++DR+CI S S+E+L+SCC CG+ Sbjct: 97 PKCESLQQVRDQSNCGSCWAFGTVEAISDRICIASGQKDQTRISSENLLSCCRGTFACGM 156 Query: 350 GCNGGMPTLAWEYWKHVGLVSGG-----NYNSSQGCRPYEIPPCEHHVPGNRMPCNG--D 508 GCNGG AW Y+ GLVSG N NS C+PY PPC HHV G C Sbjct: 157 GCNGGYTAGAWNYYVKTGLVSGNLYTDDNQNSKTECQPYSFPPCSHHVQGEYQACTDLPQ 216 Query: 509 TKTPKCQKNCESSY 550 TPKC C S Y Sbjct: 217 FNTPKCYTECNSQY 230 >UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.4; n=1; Caenorhabditis elegans|Rep: Putative uncharacterized protein W07B8.4 - Caenorhabditis elegans Length = 335 Score = 161 bits (390), Expect = 1e-38 Identities = 71/159 (44%), Positives = 100/159 (62%), Gaps = 7/159 (4%) Frame = +2 Query: 128 AELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHF 307 AE ++P+++D RD WP+C ++N IRDQ CGSCWA A EA++DR CI SN + Sbjct: 67 AETADSIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLL 126 Query: 308 SAEDLVSCCP---ICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHV 478 SAED+++CC CG GC GG P AW YW GLV+GG++ S GC+PY I PC + Sbjct: 127 SAEDILTCCTGKFNCGDGCEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPYSIAPCGETI 186 Query: 479 PGNRMP-CNGD-TKTPKCQKNC--ESSYNVPFKKEQRYG 583 G P C + TPKC+ +C +SY +P+ +++ +G Sbjct: 187 DGVTWPECPMKISDTPKCEHHCTGNNSYPIPYDQDKHFG 225 >UniRef50_Q237A1 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 346 Score = 157 bits (381), Expect = 2e-37 Identities = 78/197 (39%), Positives = 117/197 (59%), Gaps = 6/197 (3%) Frame = +2 Query: 11 NTWKAGRNFP-THTPFAHIKILMGA-LKDDNILKLPKVTHDAELIANLPENFDPRDKWPE 184 +TWKAG N ++ A +K MG L ++ +KL V+ A LPE FD R +W + Sbjct: 49 STWKAGENTKWINSDIAGVKAHMGVKLGQESGIKLETVSAQAN---GLPEEFDARVQWGD 105 Query: 185 -CPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNG 361 C +L E+RDQ +CGSCWAFGA E+++DR CI+ + S ++L++CC CG GC+G Sbjct: 106 KCSSLWEVRDQSTCGSCWAFGAAESLSDRHCIHLG--QDIRLSTQNLLTCCAACGDGCDG 163 Query: 362 GMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGN-RMPCNGDTKTPKCQKNC 538 G P A +Y+ + GLV+G Y ++ C+ Y PC HHV + PC G+ TP C +C Sbjct: 164 GWPEAAMDYYVNTGLVTGDLYGNNSWCQAYTFAPCAHHVTSDIYPPCTGELPTPPCINSC 223 Query: 539 E--SSYNVPFKKEQRYG 583 + S++ +P+ K+ G Sbjct: 224 DSNSTHTIPYSKDIHRG 240 >UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 precursor; n=8; Haemonchus contortus|Rep: Cathepsin B-like cysteine proteinase 2 precursor - Haemonchus contortus (Barber pole worm) Length = 342 Score = 157 bits (380), Expect = 2e-37 Identities = 77/184 (41%), Positives = 104/184 (56%), Gaps = 4/184 (2%) Frame = +2 Query: 47 TPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCG 226 TP KI+ K + + K D E+ ++P ++DPRD W C T IRDQ +CG Sbjct: 56 TPDFEQKIMSIKYKHQKLNLMVKEDPDPEV--DIPPSYDPRDVWKNCTTFY-IRDQANCG 112 Query: 227 SCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWEYWKHVG 403 SCWA A++DR+CI S A K + SA D+++CC P CG GC GG P AW+Y+ + G Sbjct: 113 SCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDG 172 Query: 404 LVSGGNYNSSQGCRPYEIPPCEHHVPGNRM---PCNGDTKTPKCQKNCESSYNVPFKKEQ 574 +VSGG Y + CRPY I PC HH GN C G TP C++ C ++ ++ Sbjct: 173 VVSGGEYLTKDVCRPYPIHPCGHH--GNDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDK 230 Query: 575 RYGK 586 RYGK Sbjct: 231 RYGK 234 >UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 precursor; n=3; Haemonchidae|Rep: Cathepsin B-like cysteine proteinase 1 precursor - Ostertagia ostertagi Length = 341 Score = 155 bits (377), Expect = 5e-37 Identities = 78/189 (41%), Positives = 106/189 (56%), Gaps = 7/189 (3%) Frame = +2 Query: 41 THTPFAHIKILMGALKDDNILKLP-KVTHDAELIAN---LPENFDPRDKWPECPTLNEIR 208 T TP + K + LK + +P + D EL N +PE++DPR +W C +L I Sbjct: 52 TATPVPYFKQRLMDLKYIDQNNIPDEEVEDEELEENNDDIPESYDPRIQWANCSSLFHIP 111 Query: 209 DQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEY 388 DQ +CGSCWA + AM+DR+CI S K SA+D+VSCC CG GC GG P A+ + Sbjct: 112 DQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVVSCCTWCGDGCEGGWPISAFRF 171 Query: 389 WKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRM---PCNGDTKTPKCQKNCESSYNVP 559 G+V+GG+YN+ CRPYEI PC HH GN C G TP+C++ C Y Sbjct: 172 HADEGVVTGGDYNTKGSCRPYEIHPCGHH--GNETYYGECVGMADTPRCKRRCLLGYPKS 229 Query: 560 FKKEQRYGK 586 + ++ Y K Sbjct: 230 YPSDRYYKK 238 >UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 precursor; n=5; Caenorhabditis|Rep: Cathepsin B-like cysteine proteinase 4 precursor - Caenorhabditis elegans Length = 335 Score = 153 bits (371), Expect = 3e-36 Identities = 77/198 (38%), Positives = 102/198 (51%), Gaps = 5/198 (2%) Frame = +2 Query: 5 KQNTWKAGRNFPTHTPFAHIK--ILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKW 178 KQ+ WKA P +K ++ + + V HD +P FD R +W Sbjct: 35 KQSLWKA--EIPKDITIEQVKKRLMRTEFVAPHTPDVEVVKHDINE-DTIPATFDARTQW 91 Query: 179 PECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCN 358 P C ++N IRDQ CGSCWAF A EA +DR CI SN + SAED++SCC CG GC Sbjct: 92 PNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSNCGYGCE 151 Query: 359 GGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP-CNGD-TKTPKCQK 532 GG P AW+Y G +GG+Y + GC+PY + PC V P C D TP C Sbjct: 152 GGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVN 211 Query: 533 NC-ESSYNVPFKKEQRYG 583 C +YNV + ++ +G Sbjct: 212 KCTNKNYNVAYTADKHFG 229 >UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator americanus|Rep: Cysteine proteinase 4 - Necator americanus (Human hookworm) Length = 339 Score = 153 bits (370), Expect = 4e-36 Identities = 73/187 (39%), Positives = 104/187 (55%), Gaps = 3/187 (1%) Frame = +2 Query: 38 PTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQG 217 PT+ F +I+ + K P+ L LPE FD R+KWP C ++ IRD Sbjct: 54 PTNEQFVKARIMDIKYMTEASHKYPR--KGINLNVELPERFDAREKWPHCASIGLIRDHS 111 Query: 218 SCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWEYWK 394 +CGSCWA A M+DR+CI +N T S+ D+++CC CG GC GG P A+ Y + Sbjct: 112 ACGSCWAVSAASVMSDRLCIQTNGTNQKILSSADILACCGEDCGSGCEGGYPIQAYFYLE 171 Query: 395 HVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPC--NGDTKTPKCQKNCESSYNVPFKK 568 + G+ SGG Y C+PY PC+ GN PC G TPKC+K C+ Y VP+++ Sbjct: 172 NTGVCSGGEYREKNVCKPYPFYPCD----GNYGPCPKEGAFDTPKCRKICQFRYPVPYEE 227 Query: 569 EQRYGKH 589 ++ +GK+ Sbjct: 228 DKVFGKN 234 >UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|Rep: Cysteine proteinase 3 - Necator americanus (Human hookworm) Length = 360 Score = 153 bits (370), Expect = 4e-36 Identities = 64/153 (41%), Positives = 91/153 (59%), Gaps = 1/153 (0%) Frame = +2 Query: 125 DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH 304 D + +P +FD RDKWP+C ++ IRDQ CGSCWA + E M+DR+C+ SN T Sbjct: 83 DMDFSEEIPVSFDARDKWPKCTSIGFIRDQSHCGSCWAVSSAETMSDRLCVQSNGTIKVL 142 Query: 305 FSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPG 484 S D+++CCP CG GC GG AWEY+K+ G+ +GG Y + C+PY PC+ G Sbjct: 143 LSDTDILACCPNCGAGCGGGHTIRAWEYFKNTGVCTGGLYGTKDSCKPYAFYPCKDESYG 202 Query: 485 NRMPCNGDT-KTPKCQKNCESSYNVPFKKEQRY 580 C D+ TPKC+K C+ Y+ + ++ Y Sbjct: 203 K---CPKDSFPTPKCRKICQYKYSKKYADDKYY 232 >UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 precursor; n=4; Caenorhabditis|Rep: Cathepsin B-like cysteine proteinase 3 precursor - Caenorhabditis elegans Length = 370 Score = 151 bits (367), Expect = 9e-36 Identities = 68/148 (45%), Positives = 90/148 (60%), Gaps = 2/148 (1%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 LP+ FD R+KWP+C T+ IR+Q +CGSCWAFGA E ++DRVCI SN T+ S ED++ Sbjct: 92 LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151 Query: 326 SCC-PICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCN 502 SCC CG GC GG A +W G V+GG+Y GC PY PC + P Sbjct: 152 SCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDY-GGHGCMPYSFAPCTKNCP------- 203 Query: 503 GDTKTPKCQKNCESSYNV-PFKKEQRYG 583 ++ TP C+ C+SSY +KK++ YG Sbjct: 204 -ESTTPSCKTTCQSSYKTEEYKKDKHYG 230 >UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2; Arthropoda|Rep: Cathepsin B-like cysteine protease - Callosobruchus maculatus (Southern cowpea weevil) (Pulse bruchid) Length = 330 Score = 151 bits (365), Expect = 2e-35 Identities = 71/197 (36%), Positives = 104/197 (52%), Gaps = 3/197 (1%) Frame = +2 Query: 5 KQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPE 184 K WKAGRNF T +I+ L+ + + + H+ + +LPE FD R +W + Sbjct: 35 KNLPWKAGRNFERDTSLYNIQRLLSVGTINPPSEFETIFHEDDG-KDLPEEFDARKQWSK 93 Query: 185 CPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGL---GC 355 C ++ EIRDQ CGSCWA + M+DR+CI S+ SA D++ CC C GC Sbjct: 94 CESIKEIRDQSGCGSCWAVSSASVMSDRICIQSDQKNQLRISAADMIECCESCTFSVDGC 153 Query: 356 NGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKN 535 +GG+P+ + WK G VSGG YNS+ GC Y +P C P C P C+K Sbjct: 154 HGGIPSFTFTEWKDSGFVSGGEYNSTNGCMSYPLPRCN---PS----CKTLYDAPTCKKE 206 Query: 536 CESSYNVPFKKEQRYGK 586 C+ + +++++ Y K Sbjct: 207 CDKGSPLKYEEDKHYAK 223 >UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA; n=1; Tribolium castaneum|Rep: PREDICTED: similar to CG10992-PA - Tribolium castaneum Length = 325 Score = 147 bits (355), Expect = 3e-34 Identities = 70/152 (46%), Positives = 94/152 (61%), Gaps = 3/152 (1%) Frame = +2 Query: 5 KQNTWKAGRNFPTHTPFAHIKILMG--ALKDDNILKLPKVTHDAELIANLPENFDPRDKW 178 +Q +WKA N IK +G L D K+ H I ++PE+FD R+KW Sbjct: 33 EQISWKAETNC------LDIKSRLGFLGLHPDPNYKIQTKQHKISRIISIPESFDAREKW 86 Query: 179 PECP-TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355 PEC + +IR+QG+CGSCWAF + E MTDR+CI S F FS E+L++CC CG GC Sbjct: 87 PECKDVIGKIRNQGNCGSCWAFASTEVMTDRLCISSKGKIKFVFSPENLLTCCKDCGCGC 146 Query: 356 NGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPY 451 GG AW+Y+ + G+ SGG+YNSS+GC+PY Sbjct: 147 KGGYIKNAWDYYINEGIASGGDYNSSEGCQPY 178 >UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishmania|Rep: Cathepsin B-like protease - Leishmania major Length = 340 Score = 144 bits (350), Expect = 1e-33 Identities = 73/185 (39%), Positives = 97/185 (52%), Gaps = 5/185 (2%) Frame = +2 Query: 2 KKQNTWKAGRN---FPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRD 172 K + W A N T ++ LMG P+ EL +LPE FD + Sbjct: 47 KAKGQWTASANNGYLVTGKSLGEVRKLMGVTDMSTEAVPPRNFSVEELQQDLPEFFDAAE 106 Query: 173 KWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLG 352 WP C T++EIRDQ +CGSCWA AVEA++DR C + S +L+SCC ICGLG Sbjct: 107 HWPMCLTISEIRDQSNCGSCWAIAAVEAISDRYCTFGGVPDR-RMSTSNLLSCCFICGLG 165 Query: 353 CNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDT--KTPKC 526 C+GG+PT+AW +W VG+ +++ C+PY PC HH + P T TPKC Sbjct: 166 CHGGIPTVAWLWWVWVGI-------ATEDCQPYPFDPCSHHGNSEKYPPCPSTIYDTPKC 218 Query: 527 QKNCE 541 CE Sbjct: 219 NTTCE 223 >UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoidea|Rep: Cathepsin B-S precursor - Tuberaphis styraci Length = 349 Score = 144 bits (348), Expect = 2e-33 Identities = 67/193 (34%), Positives = 98/193 (50%), Gaps = 4/193 (2%) Frame = +2 Query: 14 TWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIA--NLPENFDPRDKWPEC 187 TWKA R FP +T + L+G+ N ++ L N P+ FD R+ W C Sbjct: 39 TWKAERYFPANTSEEYFIGLLGSRGYKNYTNEVEIKKYDPLYVENNSPKQFDSRENWKSC 98 Query: 188 PTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGM 367 + IRDQG+CGSCW+F A DR+C+ + + S E+L CC CG GC GG Sbjct: 99 KQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELAFCCMDCGKGCGGGY 158 Query: 368 PTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCESS 547 P AW+Y++ G+ +GG+Y++ +GC PY++PPC N + +C K C Sbjct: 159 PIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYDEQGKNTCGGKPMERNHQCPKTCYGK 218 Query: 548 YNVP--FKKEQRY 580 V +K + Y Sbjct: 219 TTVQDRYKTKNEY 231 >UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8.1; n=1; Caenorhabditis elegans|Rep: Putative uncharacterized protein W07B8.1 - Caenorhabditis elegans Length = 335 Score = 143 bits (347), Expect = 2e-33 Identities = 67/155 (43%), Positives = 92/155 (59%), Gaps = 7/155 (4%) Frame = +2 Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319 ++L +FD R++WPEC ++ +I D C + WAF A E+M+DR+CI S K+ SAE+ Sbjct: 74 SDLSPSFDARERWPECMSIPQINDISECKTSWAFAAAESMSDRLCINSGGFKNTILSAEE 133 Query: 320 LVSCCP---ICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNR 490 L+SCC CG GC GG P AW+Y + G+ +GG+Y S GC+PY IPPC V Sbjct: 134 LLSCCTGMFSCGEGCEGGNPFKAWQYIQKHGIPTGGSYESQFGCKPYSIPPCGKTVGNVT 193 Query: 491 MP-CNGDTK-TPKCQKNCES--SYNVPFKKEQRYG 583 P C T TP C+K C S Y + K++ YG Sbjct: 194 YPACTNTTSPTPSCEKKCTSRIGYPIDIDKDRHYG 228 >UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_115, whole genome shotgun sequence - Paramecium tetraurelia Length = 332 Score = 143 bits (346), Expect = 3e-33 Identities = 68/152 (44%), Positives = 87/152 (57%), Gaps = 11/152 (7%) Frame = +2 Query: 131 ELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFS 310 E + NLP +F ++KWP CP++ I DQG+CGSCWA A M+DR+CI S T S Sbjct: 66 EKLENLPPSFSAQEKWPGCPSIELIPDQGNCGSCWAVSAASTMSDRLCIASGQTDKRQIS 125 Query: 311 AEDLVSCCPI-CGL----GCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEH- 472 AEDL+SCC I C L GC+GG P AW+Y + G+V+GG YN C+PY PPC H Sbjct: 126 AEDLLSCCGINCELDGNGGCDGGYPYGAWKYLRVDGIVTGGTYNDFSLCKPYSFPPCSHG 185 Query: 473 HVPGNRMPCNGD-----TKTPKCQKNCESSYN 553 + G C D TP C K C ++ Sbjct: 186 NDSGKYSKCENDFFMLTEVTPSCTKKCHPQFS 217 >UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomatidae|Rep: Cysteine proteinase - Ancylostoma ceylanicum Length = 348 Score = 142 bits (345), Expect = 4e-33 Identities = 63/156 (40%), Positives = 89/156 (57%), Gaps = 3/156 (1%) Frame = +2 Query: 116 VTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATK 295 V + E+ ++P+ FD RD+WP C ++ IRDQ SCGSCWA A AM+DRVC +N Sbjct: 84 VLANTEMKVDIPDTFDARDRWPNCTSMKHIRDQSSCGSCWAVAAASAMSDRVCALTNGRI 143 Query: 296 HFHFSAEDLVSCC-PICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEH 472 + S +++SCC CG GC GG P A+ Y GL +GG Y C+PY PC + Sbjct: 144 NRILSDTEVLSCCFGSCGFGCKGGYPARAFGYAWRYGLSTGGPYGEKDACQPYAFYPCGN 203 Query: 473 HVPGNRM-PCNGDT-KTPKCQKNCESSYNVPFKKEQ 574 H PC + TP C++ C+ Y +PF+K++ Sbjct: 204 HAHEPYYGPCPDELWPTPTCRRTCQLGYPIPFEKDK 239 >UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 356 Score = 138 bits (334), Expect = 9e-32 Identities = 77/206 (37%), Positives = 111/206 (53%), Gaps = 11/206 (5%) Frame = +2 Query: 2 KKQNTWKAGRNFPT-HTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKW 178 KKQ WKA + T A K + +D + + K +D L+ ++P +FD R KW Sbjct: 46 KKQKLWKAETSRMTFQEKMARAKSIKFIKSNDEVSE--KTGNDNVLV-DIPSSFDSRQKW 102 Query: 179 PECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC----PIC- 343 P C + +RDQ CGS AVE +DR CI SN T ++ SA+D +SCC IC Sbjct: 103 PSCSQIGAVRDQSDCGSAAHLVAVEIASDRTCIASNGTFNWPLSAQDPLSCCVGLMSICG 162 Query: 344 -GLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPG--NRMPCNGDTK 514 G GC+G P ++W+ GL +GGNYN GC+PY I PC+ +PC G Sbjct: 163 DGWGCDGSWPKDILKWWQTHGLCTGGNYNDQFGCKPYSIYPCDKKYANGTTSVPCPG-YH 221 Query: 515 TPKCQKNCESSYNVP--FKKEQRYGK 586 TP C+++C S+ P +K+++ +GK Sbjct: 222 TPTCEEHCTSNITWPIAYKQDKHFGK 247 >UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7; n=2; Haemonchidae|Rep: Cathepsin B-like cysteine protease GCP7 - Haemonchus contortus (Barber pole worm) Length = 348 Score = 136 bits (328), Expect = 5e-31 Identities = 60/163 (36%), Positives = 92/163 (56%), Gaps = 2/163 (1%) Frame = +2 Query: 92 DNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRV 271 +N+L + +T + ++ PE+FD R+KW +CP+L I DQ +CGSCWA A + M+DR+ Sbjct: 82 ENVLPIANITSNDDI----PESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCMSDRL 137 Query: 272 CIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRP 448 CI+S K SA D+++CC CG GC+GG AW++ G+V+GG Y C+P Sbjct: 138 CIHSQGRKKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGNCKP 197 Query: 449 YEIPPCEHHVPGNRMPC-NGDTKTPKCQKNCESSYNVPFKKEQ 574 Y P C H C + TP C+ C+ Y ++ ++ Sbjct: 198 YVFPQCGAHKGKAFNNCPSHPYATPACKPYCQYGYGKRYENDK 240 >UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8; Trypanosoma|Rep: Cathepsin B-like cysteine protease - Trypanosoma brucei Length = 340 Score = 134 bits (324), Expect = 1e-30 Identities = 71/164 (43%), Positives = 89/164 (54%), Gaps = 5/164 (3%) Frame = +2 Query: 65 KILMGALKDDNILK-LPKVTH-DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWA 238 K L G +K +N LPK + E A LP +FD + WP CPT+ +I DQ +CGSCWA Sbjct: 65 KRLNGVIKKNNNASILPKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWA 124 Query: 239 FGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGG 418 A AM+DR C + H SA DL++CC CG GCNGG P AW Y+ GLVS Sbjct: 125 VAAASAMSDRFCT-MGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAWAYFSSTGLVS-- 181 Query: 419 NYNSSQGCRPYEIPPCEHHVPGNR--MPCNG-DTKTPKCQKNCE 541 +Y C+PY P C HH PC+ + TPKC C+ Sbjct: 182 DY-----CQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCNYTCD 220 >UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin B-like cysteine proteinase 4 precursor (Cysteine protease-related 4); n=2; Tribolium castaneum|Rep: PREDICTED: similar to Cathepsin B-like cysteine proteinase 4 precursor (Cysteine protease-related 4) - Tribolium castaneum Length = 360 Score = 134 bits (323), Expect = 2e-30 Identities = 76/201 (37%), Positives = 102/201 (50%), Gaps = 8/201 (3%) Frame = +2 Query: 5 KQNTWKAGRNFPTHTPFAHIKILMGAL---KDDNI---LKLPKVTHDAELIANLPENFDP 166 +Q+ W AG N PF I+ +G L D N +K P+ T + +PE FD Sbjct: 29 QQSAWTAGIN-----PFDDIESRLGFLGIHPDPNFKPEIKEPQATQNV-----IPETFDA 78 Query: 167 RDKWPECPTL-NEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPIC 343 R+ WPEC + IR+QG C S WAF A E M+DR+CI +N S EDL+ CC C Sbjct: 79 REYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDLIDCCHYC 138 Query: 344 GLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPK 523 G C GG AW Y+ GLVSGG+YN+S GC+PY N TP Sbjct: 139 GNQCKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYS-------------ELNYYRITPP 185 Query: 524 CQKNCES-SYNVPFKKEQRYG 583 C C++ Y +P+ ++ +G Sbjct: 186 CNTTCQNDKYPIPYVSDKHFG 206 >UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus contortus|Rep: Cysteine proteinase - Haemonchus contortus (Barber pole worm) Length = 350 Score = 133 bits (322), Expect = 3e-30 Identities = 66/169 (39%), Positives = 89/169 (52%), Gaps = 5/169 (2%) Frame = +2 Query: 95 NILKLPKVTHDAELIAN--LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDR 268 N KL KV E N +PE+FD R W C ++ +RDQ CGSCWA A M+DR Sbjct: 75 NARKLYKVKKAEEQTTNEDIPESFDSRIVWKNCSSITYVRDQSRCGSCWAVSAASTMSDR 134 Query: 269 VCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCR 445 +C+ + S D++SCC +CG GC GG LAWE+ + G+V+GG Y CR Sbjct: 135 ICVQTKGKLQTILSDTDILSCCGRMCGDGCEGGYDHLAWEWVQRFGVVTGGPYQQKGVCR 194 Query: 446 PYEIPPCEHHVPGNRMPCNGD--TKTPKCQKNCESSYNVPFKKEQRYGK 586 PY PC H G R C D TP C+ C+ Y ++K++ + K Sbjct: 195 PYAFHPCGLH-HGRRYDCPWDHSFSTPACKPYCQFGYGKRYEKDKFFVK 242 >UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyledons|Rep: At1g02300/T6A9_10 - Arabidopsis thaliana (Mouse-ear cress) Length = 362 Score = 133 bits (321), Expect = 3e-30 Identities = 78/194 (40%), Positives = 103/194 (53%), Gaps = 5/194 (2%) Frame = +2 Query: 17 WKAGRNFP-THTPFAHIKILMGA--LKDDNILKLPKVTHDAELIANLPENFDPRDKWPEC 187 WKA N + A K L+G L +P V+HD L LP+ FD R W +C Sbjct: 62 WKASFNDRFANATVAEFKRLLGVKPTPKTEFLGVPIVSHDISL--KLPKEFDARTAWSQC 119 Query: 188 PTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP-ICGLGCNGG 364 ++ I DQG CGSCWAFGAVE+++DR CI N + S DL++CC +CG GCNGG Sbjct: 120 TSIGRILDQGHCGSCWAFGAVESLSDRFCIKYN--MNVSLSVNDLLACCGFLCGQGCNGG 177 Query: 365 MPTLAWEYWKHVGLVSGGNYNSSQGCRPY-EIPPCEHHVPGNRMPCNGDTKTPKCQKNCE 541 P AW Y+KH G+V ++ C PY + C H PG C TPKC + C Sbjct: 178 YPIAAWRYFKHHGVV-------TEECDPYFDNTGCSH--PG----CEPAYPTPKCARKCV 224 Query: 542 SSYNVPFKKEQRYG 583 S N +++ + YG Sbjct: 225 SG-NQLWRESKHYG 237 >UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 1 - Rhipicephalus appendiculatus (Brown ear tick) Length = 332 Score = 132 bits (318), Expect = 8e-30 Identities = 68/196 (34%), Positives = 101/196 (51%), Gaps = 4/196 (2%) Frame = +2 Query: 14 TWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTH----DAELIANLPENFDPRDKWP 181 TWKAGRNF +H + G + +H + + PE+F PR+ W Sbjct: 41 TWKAGRNFDEKR--SHSDCVQGGDGASVLTATSTSSHFTSYEEDSRWTCPESFTPREYWS 98 Query: 182 ECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNG 361 C ++ IRDQ +CGSCWAF A E+++DR+CI++N + SAEDL++CC CG GC+G Sbjct: 99 HCSSIRVIRDQSACGSCWAFAAAESISDRICIHTNGKVQVNISAEDLLACCHTCGHGCDG 158 Query: 362 GMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCE 541 + + LV + GC+PY +PPC VP C TPKCQ C Sbjct: 159 RCHCSSVAILQGRRLVP-EPVRTEDGCQPYSLPPC---VPN----CTHPEPTPKCQHVCR 210 Query: 542 SSYNVPFKKEQRYGKH 589 Y +++++ + K+ Sbjct: 211 KGYEKSYEEDKHFAKN 226 >UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep: Thiol protease - Trichuris suis Length = 348 Score = 129 bits (311), Expect = 5e-29 Identities = 65/166 (39%), Positives = 88/166 (53%), Gaps = 12/166 (7%) Frame = +2 Query: 125 DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH 304 D L ++P +FD R W C +LN IRDQ CGSCWA A E M+DR+C+ SN + Sbjct: 77 DRSLALSIPPSFDVRSLWHVC-SLNLIRDQAKCGSCWAVSAAETMSDRICVQSNCSIKAC 135 Query: 305 FSAEDLVSCCPI-CGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYE-IPPCEHHV 478 S D++SCC + CG GCNGG P AW ++ G +GG GC+PY+ P H+ Sbjct: 136 ISDTDILSCCGLYCGYGCNGGFPIEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRHL 195 Query: 479 PGN-RMPCNGDT---------KTPKCQKNCESSYNVPFKKEQRYGK 586 N PC DT TP+C++ C Y + ++ YGK Sbjct: 196 KRNDYAPCPNDTYYGECVGMADTPRCKRRCLLGYPKSYPSDRYYGK 241 >UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep: Cysteine proteinase - Toxoplasma gondii Length = 569 Score = 127 bits (307), Expect = 2e-28 Identities = 59/142 (41%), Positives = 83/142 (58%), Gaps = 10/142 (7%) Frame = +2 Query: 146 LPENFDPRDKWPECP-TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322 +P +FD R +P C + +RDQG CGSCWAF + EA DR+CI S + SA+ Sbjct: 274 VPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHT 333 Query: 323 VSCC---PICGLGCNGGMPTLAWEYWKHVGLVSGGNYNS-SQG--CRPYEIPPCEHHVPG 484 SCC GCNGG P +AW +++ G+V+GG++++ +G C PYE+P C HH Sbjct: 334 TSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAHHAKA 393 Query: 485 NRMPCNG---DTKTPKCQKNCE 541 C+ KTPKC+K+CE Sbjct: 394 PFPDCDATLVPRKTPKCRKDCE 415 >UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000012227 - Anopheles gambiae str. PEST Length = 218 Score = 127 bits (306), Expect = 2e-28 Identities = 53/97 (54%), Positives = 69/97 (71%), Gaps = 1/97 (1%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 +PE+FD R+ WP C +L IR+QG+CGSCWA A M+DRVCI+SN T + +AEDL+ Sbjct: 1 IPESFDARNHWPNCESLRAIRNQGTCGSCWAVAAASVMSDRVCIHSNGTINVALAAEDLM 60 Query: 326 SCCPICGLGCNGG-MPTLAWEYWKHVGLVSGGNYNSS 433 CC CG GCNGG + +++YW GLVSGG YNS+ Sbjct: 61 GCCVDCGNGCNGGFLDGTSFQYWVDAGLVSGGAYNST 97 >UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: Cathepsin B - Triticum aestivum (Wheat) Length = 353 Score = 120 bits (288), Expect = 3e-26 Identities = 61/148 (41%), Positives = 85/148 (57%), Gaps = 2/148 (1%) Frame = +2 Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322 +LP+ FD R +W C T+ I DQG CG+CWAF AVEA+ DR CI+ N + S DL Sbjct: 96 DLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRFCIHLNMS--VSLSVNDL 153 Query: 323 VSCCP-ICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPY-EIPPCEHHVPGNRMP 496 ++CC +CG GCNGG P AW Y++ G+V ++ C PY + C+H PG Sbjct: 154 LACCGFLCGSGCNGGYPISAWRYFRRSGVV-------TEECDPYFDQTGCQH--PG---- 200 Query: 497 CNGDTKTPKCQKNCESSYNVPFKKEQRY 580 C TPKCQ+ C+ N +K+ + + Sbjct: 201 CEPAYPTPKCQRKCKVE-NQAWKENKHF 227 >UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06356 protein - Schistosoma japonicum (Blood fluke) Length = 279 Score = 120 bits (288), Expect = 3e-26 Identities = 52/158 (32%), Positives = 84/158 (53%), Gaps = 1/158 (0%) Frame = +2 Query: 116 VTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATK 295 ++H++ + +P +FD R W C T+ +I D+ C + WA V++++DR+CI SN Sbjct: 19 ISHNS-INMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRI 77 Query: 296 HFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHH 475 SA D +SC GC G YW G+V+GG+Y GC+PY +P C +H Sbjct: 78 SVQLSARDAISCG--FSPGCFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSYH 135 Query: 476 VPGNRMPCNGDT-KTPKCQKNCESSYNVPFKKEQRYGK 586 + CN +T + P+C C+ YN + ++ YG+ Sbjct: 136 PESRFLDCNNNTFEFPQCTNECQDGYNKTYDDDKFYGE 173 >UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 421 Score = 118 bits (283), Expect = 1e-25 Identities = 55/137 (40%), Positives = 76/137 (55%) Frame = +2 Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319 +++P+NFD R KWP CP+++ + +QG CGSC+A A +DR CI+SN T S ED Sbjct: 136 SDVPKNFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKSLLSEED 195 Query: 320 LVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPC 499 ++ CC +CG C GG P A YW + GLV+GG GCRPY VP + Sbjct: 196 IIGCCSVCG-NCYGGDPLKALTYWVNQGLVTGGR----DGCRPYSF-DLSCGVPCSPATF 249 Query: 500 NGDTKTPKCQKNCESSY 550 + C K C++ Y Sbjct: 250 FEAEEKRTCMKRCQNIY 266 >UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2; Ostreococcus|Rep: Cysteine proteinase - Ostreococcus tauri Length = 362 Score = 105 bits (251), Expect = 1e-21 Identities = 60/148 (40%), Positives = 75/148 (50%), Gaps = 14/148 (9%) Frame = +2 Query: 146 LPENFDPRDKWPECPTL-NEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322 LP+ FD R+KWP+C L +E DQG+CGSCWA +AMTDR+CI +N + H SA L Sbjct: 88 LPDTFDVREKWPKCAALVSEAVDQGACGSCWAVAPAKAMTDRLCIATNGAVNTHVSAIQL 147 Query: 323 VSCCP-----------ICG--LGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPP 463 +SC + G GC GG PT A+E VG+VSGG C PY P Sbjct: 148 LSCNSHSNSAYTYDENLAGGSGGCMGGYPTEAYETAHRVGVVSGGLNGDQDTCMPYPFAP 207 Query: 464 CEHHVPGNRMPCNGDTKTPKCQKNCESS 547 C H PC C + C+ S Sbjct: 208 CHH-------PCE-PNHNAVCPRTCQRS 227 >UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep: Cathepsin B - Streblomastix strix Length = 312 Score = 101 bits (242), Expect = 1e-20 Identities = 52/132 (39%), Positives = 71/132 (53%) Frame = +2 Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316 +ANLP+ FD R WP C + +I DQG CGSCWA + E + DR CI S + S + Sbjct: 73 VANLPDEFDSRTNWPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEGKQTPELSPQ 132 Query: 317 DLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP 496 L SC P C GCNGG + A+ + + G++ + C PY++ C+H PG Sbjct: 133 HLTSCTPGCS-GCNGGWMSTAFGFMQSNGIL-------GEDCIPYQMGKCKH--PG---- 178 Query: 497 CNGDTKTPKCQK 532 C+ TPKC K Sbjct: 179 CS-TWPTPKCNK 189 >UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 314 Score = 95.9 bits (228), Expect = 6e-19 Identities = 56/137 (40%), Positives = 79/137 (57%), Gaps = 3/137 (2%) Frame = +2 Query: 5 KQNTWKAGRN--FPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKW 178 K+++W A RN F T F I +MG K KL + + EL ++P +FD R +W Sbjct: 42 KKSSWTAHRNKNFEGKT-FGDIIGMMGTKKTAAPFKLTE--NGEELKGSIPTSFDSRVQW 98 Query: 179 PECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYS-NATKHFHFSAEDLVSCCPICGLGC 355 P+C ++ I +Q CGSCWAF + E ++DR+CI S N T S + LV+C GC Sbjct: 99 PDC--IHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGALSPQTLVACDVYGNDGC 156 Query: 356 NGGMPTLAWEYWKHVGL 406 +GG+P LAWEY + GL Sbjct: 157 SGGIPQLAWEYMELKGL 173 >UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; Ostreococcus tauri|Rep: Cysteine proteinase Cathepsin F - Ostreococcus tauri Length = 498 Score = 91.9 bits (218), Expect = 1e-17 Identities = 60/154 (38%), Positives = 80/154 (51%), Gaps = 6/154 (3%) Frame = +2 Query: 41 THTPFAHIKILMGALK-DDNILKLPKVTHDAELIANLPENFDPRDKWPECPTL-NEIRDQ 214 T +P+A GA D + L +V DA L +LP +FD RD++P+C L +RDQ Sbjct: 222 TLSPYASSDETHGAHPFDRKAVGLGRVKWDA-LKHSLPRHFDARDEYPKCARLIGTVRDQ 280 Query: 215 GSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGG--MPTLAWEY 388 G CGSCWA A E M DR+CI S + S + +SC G GC GG + TL Sbjct: 281 GKCGSCWAVAATEIMNDRLCISSGGKEVAELSPQFALSCYN-SGAGCEGGDVVDTLTLAL 339 Query: 389 WKHVGLVSGGNYNSSQGCRPYEIPPCEH--HVPG 484 K G+ GG + C PY+ PC+H +PG Sbjct: 340 AK--GVPHGGMLDKG-ACLPYQFEPCDHPCMIPG 370 >UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 311 Score = 91.9 bits (218), Expect = 1e-17 Identities = 51/147 (34%), Positives = 72/147 (48%) Frame = +2 Query: 125 DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH 304 + + N+PENFD R +WP +++ IR+QG CGSCWAFGA E ++DR I S + Sbjct: 76 EVRVAENIPENFDARKQWPG--SIHPIRNQGQCGSCWAFGASEVLSDRFAIASKNQIYVT 133 Query: 305 FSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPG 484 SA+ LV C + GC+GG P AW Y GL++ Y PY + Sbjct: 134 LSAQQLVD-CDLDNSGCSGGWPINAWNYMVKTGLLTEQCYG------PYYAKQYTCRLTA 186 Query: 485 NRMPCNGDTKTPKCQKNCESSYNVPFK 565 N C + +S+Y +P K Sbjct: 187 NTTDCPWQPGVKARFYHAKSAYKLPAK 213 >UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma japonicum|Rep: SJCHGC02853 protein - Schistosoma japonicum (Blood fluke) Length = 181 Score = 86.2 bits (204), Expect = 5e-16 Identities = 43/92 (46%), Positives = 56/92 (60%), Gaps = 3/92 (3%) Frame = +2 Query: 17 WKAGRNFPTHTPFAHIKILMGALK---DDNILKLPKVTHDAELIANLPENFDPRDKWPEC 187 WKA R T H K +MG L D + L P + H+ ++ LP+ FD R W C Sbjct: 38 WKADRT-KRFTSIHHAKSMMGVLLNSVDQHKLHHPIIHHN-DINIKLPKYFDSRKYWKNC 95 Query: 188 PTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYS 283 ++ IRDQ SCGSCWAFGAVE+M+DR+CI+S Sbjct: 96 SSIRTIRDQSSCGSCWAFGAVESMSDRICIHS 127 >UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus lucimarinus CCE9901|Rep: Predicted protein - Ostreococcus lucimarinus CCE9901 Length = 330 Score = 84.6 bits (200), Expect = 2e-15 Identities = 52/143 (36%), Positives = 71/143 (49%), Gaps = 4/143 (2%) Frame = +2 Query: 146 LPENFDPRDKWPECPTL-NEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322 LP +FD R +P+C L +RDQG CGSCWA A E M DR+C+ ++ S + Sbjct: 112 LPTSFDARVAYPKCSRLLGAVRDQGRCGSCWAVAATEVMNDRLCVATDGENADELSPQYA 171 Query: 323 VSCCPICGLGCNGG--MPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP 496 +SC G GC+GG + TL + K G+ GG +S+ C PYE C+H P Sbjct: 172 LSCFD-SGSGCDGGDVLDTLRIAFTK--GIPYGGMLDSN-ACLPYEFEACDH-------P 220 Query: 497 CNGDTKTPK-CQKNCESSYNVPF 562 C TP+ C C + F Sbjct: 221 CMVAGTTPQSCPAKCADGSALSF 243 >UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomuscorum|Rep: Cathepsin B - Oxytricha trifallax (Sterkiella histriomuscorum) Length = 294 Score = 83.8 bits (198), Expect = 3e-15 Identities = 50/123 (40%), Positives = 59/123 (47%), Gaps = 2/123 (1%) Frame = +2 Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316 I +PENFD R +W ++ IRDQ CGSCWAFGA EA +DR I K S E Sbjct: 73 IMTVPENFDARQQWGS--KIHAIRDQQQCGSCWAFGATEAFSDRFAING---KDVILSPE 127 Query: 317 DLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGG--NYNSSQGCRPYEIPPCEHHVPGNR 490 DLVS C GCNGG +AWEY G + Y++ G P C R Sbjct: 128 DLVS-CDTNDYGCNGGYMDVAWEYLADHGAATDSCFPYSAGSGFAPACSDKCADGSAMQR 186 Query: 491 MPC 499 C Sbjct: 187 FKC 189 >UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG01102; n=1; Caenorhabditis briggsae|Rep: Putative uncharacterized protein CBG01102 - Caenorhabditis briggsae Length = 374 Score = 79.4 bits (187), Expect = 6e-14 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 2/79 (2%) Frame = +2 Query: 353 CNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP-C-NGDTKTPKC 526 C GG AW+YW+ GL +GG+Y S GC+PY I PC+ + P C N +TP C Sbjct: 189 CAGGNVFKAWQYWQKHGLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSC 248 Query: 527 QKNCESSYNVPFKKEQRYG 583 +K C+S Y V K++ YG Sbjct: 249 EKKCKSGYPVELDKDRHYG 267 Score = 68.9 bits (161), Expect = 8e-11 Identities = 28/59 (47%), Positives = 39/59 (66%) Frame = +2 Query: 158 FDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC 334 FD R++WPEC ++ I D C S WAF A E+M+DR+CI S + SA++L+SCC Sbjct: 85 FDARERWPECSSIPIINDISDCKSSWAFSAAESMSDRLCINSGGMINTVLSAQELLSCC 143 >UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|Rep: Cysteine protease - Giardia muris Length = 301 Score = 77.8 bits (183), Expect = 2e-13 Identities = 42/109 (38%), Positives = 59/109 (54%), Gaps = 4/109 (3%) Frame = +2 Query: 92 DNILKLPKVTHDAEL----IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAM 259 +N+ L TH ++L LP+++DPR + C L E+ DQ SCGSCWAF AV Sbjct: 55 ENLRSLRTETHVSQLNLGKTKELPKDYDPRVERAHC--LPEVADQASCGSCWAFSAVATF 112 Query: 260 TDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGL 406 DR C Y +K H+S + +VSC G CNGG + W++ G+ Sbjct: 113 ADRRCAYGLDSKQVHYSEQYVVSCDFGDG-ACNGGWLSNVWKFLTKTGV 160 >UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep: Cathepsin B - Streblomastix strix Length = 283 Score = 75.4 bits (177), Expect = 9e-13 Identities = 36/89 (40%), Positives = 52/89 (58%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 +P+ FD R+KWP+ + +RDQG CGSCWAF E + DR+ + + EDLV Sbjct: 63 VPDTFDAREKWPDA--ILPVRDQGECGSCWAFSIAETIGDRLGVL--GCSRGDIAPEDLV 118 Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGLVS 412 S C I GC+GG +AW++ + GL + Sbjct: 119 S-CDIFDDGCDGGFIDMAWDWCQENGLTT 146 >UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA, isoform A; n=2; Endopterygota|Rep: PREDICTED: similar to CG3074-PA, isoform A - Tribolium castaneum Length = 445 Score = 72.5 bits (170), Expect = 7e-12 Identities = 44/126 (34%), Positives = 59/126 (46%), Gaps = 1/126 (0%) Frame = +2 Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322 +LP FD KWP ++EI+DQG CGS WA +DR I S + SA+ L Sbjct: 196 SLPREFDSEFKWPGW--MSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHL 253 Query: 323 VSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGN-RMPC 499 +SC CNGG AW Y + +GLV + S IP V N ++P Sbjct: 254 LSCDRRGQQSCNGGYLDRAWSYIRKIGLVDEQCFPYSATNEKCRIPRRGDLVTANCQLPT 313 Query: 500 NGDTKT 517 N D ++ Sbjct: 314 NVDRRS 319 >UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Giardia intestinalis|Rep: Cathepsin B-like CP1 precursor - Giardia lamblia (Giardia intestinalis) Length = 303 Score = 72.1 bits (169), Expect = 9e-12 Identities = 35/96 (36%), Positives = 50/96 (52%) Frame = +2 Query: 116 VTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATK 295 +T EL+ +P FD RD++P+C + DQGSCGSCWAF A+ DR C + Sbjct: 69 ITEVQELVDPIPPQFDFRDEYPQC--VKPALDQGSCGSCWAFSAIGVFGDRRCAMGIDKE 126 Query: 296 HFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVG 403 +S + L+S C + GC+GG W + G Sbjct: 127 AVSYSQQHLIS-CSLENFGCDGGDFQPTWSFLTFTG 161 >UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like protein F26E4.3; n=2; Caenorhabditis|Rep: Uncharacterized peptidase C1-like protein F26E4.3 - Caenorhabditis elegans Length = 491 Score = 71.3 bits (167), Expect = 2e-11 Identities = 35/93 (37%), Positives = 49/93 (52%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 LPE+FD RDKW P ++ + DQG CGS W+ +DR+ I S + S++ L+ Sbjct: 223 LPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLL 280 Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424 SC GC GG AW Y + +G+V Y Sbjct: 281 SCNQHRQKGCEGGYLDRAWWYIRKLGVVGDHCY 313 >UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: hypothetical protein - Strongylocentrotus purpuratus Length = 450 Score = 70.9 bits (166), Expect = 2e-11 Identities = 37/100 (37%), Positives = 49/100 (49%) Frame = +2 Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319 A LPE FD R+ WP ++E+ DQG CGS WA +DR+ I S + S + Sbjct: 195 ARLPETFDARENWPGL--IDEVIDQGKCGSSWAISTASVASDRLAIQSMGEINPRLSEQH 252 Query: 320 LVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQG 439 L+SC GC+GG AW + + G VS Y G Sbjct: 253 LLSCNIRGQRGCSGGYLDRAWYHLRRAGAVSRACYPYHSG 292 >UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Giardia intestinalis|Rep: Cathepsin B-like CP2 precursor - Giardia lamblia (Giardia intestinalis) Length = 300 Score = 70.1 bits (164), Expect = 4e-11 Identities = 33/87 (37%), Positives = 49/87 (56%) Frame = +2 Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322 ++PE+FD R+++P C + E+ DQG CGSCWAF +V DR C+ K +S + + Sbjct: 74 DVPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQYV 131 Query: 323 VSCCPICGLGCNGGMPTLAWEYWKHVG 403 VS C + CNGG W++ G Sbjct: 132 VS-CDHGDMACNGGWLPNVWKFLTKTG 157 >UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-LDL responsive gene 2, partial; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to oxidized-LDL responsive gene 2, partial - Strongylocentrotus purpuratus Length = 363 Score = 69.3 bits (162), Expect = 6e-11 Identities = 38/110 (34%), Positives = 58/110 (52%), Gaps = 1/110 (0%) Frame = +2 Query: 98 ILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCI 277 +L + ++ +D A +PE FD R +WP + +++QG+C S WA +DR+ I Sbjct: 207 VLTMHQIQNDMPPEA-IPEEFDARAQWPGL--VEGVQNQGNCASSWAMSTAATASDRLAI 263 Query: 278 YSNAT-KHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424 SN T K+ H S + L+SC GC GG AW Y + G+V+ Y Sbjct: 264 QSNGTFKYMHLSPQHLLSCNVKRQQGCAGGHLDRAWWYMRKRGIVTEDCY 313 >UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia ATCC 50803|Rep: GLP_113_4299_5381 - Giardia lamblia ATCC 50803 Length = 360 Score = 67.7 bits (158), Expect = 2e-10 Identities = 33/87 (37%), Positives = 48/87 (55%) Frame = +2 Query: 149 PENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVS 328 PE++D RD++P C T E+ DQG+CGSCWAF +V+ D C +S + ++ Sbjct: 141 PESYDFRDEYPHCIT--EVVDQGNCGSCWAFSSVQTFADHRCRSGLDATGVSYSVQYVLD 198 Query: 329 CCPICGLGCNGGMPTLAWEYWKHVGLV 409 C GCNGG P A+ + + G V Sbjct: 199 -CDRKDHGCNGGEPVNAFNFLHNTGTV 224 >UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|Rep: Cysteine proteinase - Globodera pallida Length = 53 Score = 67.7 bits (158), Expect = 2e-10 Identities = 28/52 (53%), Positives = 34/52 (65%), Gaps = 1/52 (1%) Frame = +2 Query: 212 QGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPI-CGLGCNGG 364 QG CG CWAF E ++DR CI SN T+ S DL++CC + CG GCNGG Sbjct: 1 QGQCGRCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCCGMSCGEGCNGG 52 >UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to glucocorticoid-inducible protein; n=1; Gallus gallus|Rep: PREDICTED: similar to glucocorticoid-inducible protein - Gallus gallus Length = 307 Score = 67.3 bits (157), Expect = 2e-10 Identities = 42/136 (30%), Positives = 60/136 (44%), Gaps = 1/136 (0%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 LP +FD KWP ++E DQG+C WAF +DR+ I+S S ++L+ Sbjct: 153 LPRHFDAATKWPGM--IHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPSLSPQNLL 210 Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYN-SSQGCRPYEIPPCEHHVPGNRMPCN 502 SC GC+GG AW Y + G+V+ Y +SQ +P P H R Sbjct: 211 SCDTRNQRGCSGGRLDGAWWYLRRRGVVTDECYPFTSQDSQPAAQPCMMHSRSTGRGKRQ 270 Query: 503 GDTKTPKCQKNCESSY 550 + P Q + Y Sbjct: 271 ATARCPNPQTHANDIY 286 >UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcoptes scabiei type hominis|Rep: Sar s 1 allergen Yv9053H09 - Sarcoptes scabiei type hominis Length = 253 Score = 67.3 bits (157), Expect = 2e-10 Identities = 39/98 (39%), Positives = 54/98 (55%), Gaps = 4/98 (4%) Frame = +2 Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAV----EAMTDRVCIYSNATKHFHFS 310 +LPE FD RD L++IR+QG CG+CWAF A+ A R I N T+ HFS Sbjct: 36 DLPEKFDLRD----LGYLSKIRNQGRCGACWAFAALASVESAYNRRTRIVHNRTRKHHFS 91 Query: 311 AEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424 ++LV C P GC+G + + +Y + G+V NY Sbjct: 92 EQELVDCSPNTE-GCSGNIISNGLKYVQLRGVVKSANY 128 >UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 234 Score = 66.9 bits (156), Expect = 3e-10 Identities = 35/87 (40%), Positives = 50/87 (57%) Frame = +2 Query: 134 LIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSA 313 ++ ++P+ D R K +NEI+DQ CGSCWAFG+ AM + + S Sbjct: 14 IVGDIPDEIDYRTKG----AVNEIKDQKHCGSCWAFGSCAAMESSWFLKHGTL--YSLSE 67 Query: 314 EDLVSCCPICGLGCNGGMPTLAWEYWK 394 + LV CC C LGC+G +P+LA+EY K Sbjct: 68 QCLVDCCHDC-LGCHGCLPSLAFEYVK 93 >UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to GM06507p - Nasonia vitripennis Length = 483 Score = 66.1 bits (154), Expect = 6e-10 Identities = 36/115 (31%), Positives = 53/115 (46%), Gaps = 1/115 (0%) Frame = +2 Query: 83 LKDDNILKLPKVTHDAELIAN-LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAM 259 L +I ++PK + N LP FD R +W + ++DQG CG+ WA V+ Sbjct: 214 LHSTDIFQIPKQNKQQWINPNDLPREFDSRIQWGN--DITPVQDQGWCGASWAISTVDVA 271 Query: 260 TDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424 +DR I S + S + L+SC GC GG AW + + G+V Y Sbjct: 272 SDRFAIMSKGIEKVQLSGQHLISCNNRGQRGCKGGYLDRAWLFMRKFGVVDEDCY 326 >UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen; n=20; Amniota|Rep: Tubulointerstitial nephritis antigen - Homo sapiens (Human) Length = 476 Score = 64.9 bits (151), Expect = 1e-09 Identities = 37/109 (33%), Positives = 51/109 (46%) Frame = +2 Query: 98 ILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCI 277 +L + ++T +LPE F KWP T + DQ +C + WAF DR+ I Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPGW-THGPL-DQKNCAASWAFSTASVAADRIAI 258 Query: 278 YSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424 S + S ++L+SCC GCN G AW Y + GLVS Y Sbjct: 259 QSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWWYLRKRGLVSHACY 307 >UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n=21; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Globodera pallida Length = 379 Score = 64.1 bits (149), Expect = 2e-09 Identities = 46/134 (34%), Positives = 67/134 (50%), Gaps = 7/134 (5%) Frame = +2 Query: 14 TWKAGRNFPTHTPFAHIKILMG--ALKDDNILKLPKVTHDAELIANLPENFDPRDK-WPE 184 T++ G N PF+ K L G L DN+ + + +LPE+ D RDK W Sbjct: 115 TFRVGENHIADLPFSEYKKLNGYRRLLGDNLRRNASTFLAPMNVGDLPESVDWRDKGW-- 172 Query: 185 CPTLNEIRDQGSCGSCWAF---GAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LG 352 + E+++QG CGSCWAF GA+EA R + S ++L+ C G +G Sbjct: 173 ---VTEVKNQGMCGSCWAFSSTGALEAQHAR-----QTGQLISLSEQNLIDCSKKYGNMG 224 Query: 353 CNGGMPTLAWEYWK 394 CNGG+ A++Y K Sbjct: 225 CNGGIMDNAFQYIK 238 >UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase" precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 315 Score = 63.3 bits (147), Expect = 4e-09 Identities = 31/83 (37%), Positives = 44/83 (53%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355 W + L ++DQG CGSCWAF ++ ++ I+ N + S ++LV C GC Sbjct: 117 WRDSAVLG-VKDQGQCGSCWAFSTTGSLEGQLAIHKN--QRVPLSEQELVDCDTSRNAGC 173 Query: 356 NGGMPTLAWEYWKHVGLVSGGNY 424 NGG+ T A+ Y K GL S Y Sbjct: 174 NGGLMTDAFNYVKRHGLSSESQY 196 >UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus salmonis|Rep: Cysteine proteinase - Lepeophtheirus salmonis (salmon louse) Length = 372 Score = 62.9 bits (146), Expect = 5e-09 Identities = 36/106 (33%), Positives = 54/106 (50%), Gaps = 5/106 (4%) Frame = +2 Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316 I +LPE+ D W E + ++++QGSCGSCW F AVE + V I +N T S + Sbjct: 112 IKDLPESVD----WREKGVITDVKNQGSCGSCWVFSAVEQIESYVAIENNMTSPPLLSTQ 167 Query: 317 DLVSCCP---ICG--LGCNGGMPTLAWEYWKHVGLVSGGNYNSSQG 439 + SC CG GC G + +A+ Y + G+ + Y + G Sbjct: 168 QITSCSSNPYSCGGSGGCKGAINEIAYMYTQLYGIETEKEYPYTSG 213 >UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin C; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin C - Strongylocentrotus purpuratus Length = 482 Score = 62.5 bits (145), Expect = 7e-09 Identities = 43/131 (32%), Positives = 60/131 (45%), Gaps = 8/131 (6%) Frame = +2 Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319 +NLPE FD RD ++ +RDQG CGSC+AF + R+ + +N S ++ Sbjct: 247 SNLPEKFDWRDVGG-IDYVSPVRDQGICGSCYAFASTATQESRLRVMTNNNVKVVMSPQE 305 Query: 320 LVSCCPICGLGCNGGMPTL-AWEYWKHVGLVSGGNYNSSQGCRPYEIPPCE-------HH 475 +VSC GC GG P L A +Y + GLV Y + P C H+ Sbjct: 306 VVSCSEY-AQGCEGGFPYLIAGKYGQDFGLVDETCYPYRERDAPCRQVSCRRFRTSEYHY 364 Query: 476 VPGNRMPCNGD 508 + G CN D Sbjct: 365 IGGFYGACNED 375 >UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - Drosophila melanogaster (Fruit fly) Length = 431 Score = 62.5 bits (145), Expect = 7e-09 Identities = 32/97 (32%), Positives = 47/97 (48%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 LP +F+ DKW ++E+ DQG CG+ W +DR I S ++ SA++++ Sbjct: 187 LPSSFNALDKWSSY--ISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNIL 244 Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQ 436 SC GC GG AW Y G+V Y +Q Sbjct: 245 SCTR-RQQGCEGGHLDAAWRYLHKKGVVDENCYPYTQ 280 >UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 326 Score = 62.5 bits (145), Expect = 7e-09 Identities = 35/104 (33%), Positives = 48/104 (46%) Frame = +2 Query: 110 PKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNA 289 P+V H + +LP FD W E + E++DQGSCGSCW+F T + Sbjct: 98 PRVIHSLTPVKDLPSKFD----WREKGAVTEVKDQGSCGSCWSFSTTG--TVEGAYFLKT 151 Query: 290 TKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGN 421 K S ++LV C GC+GG A EY + G + N Sbjct: 152 GKLVSLSEQNLVDCAKEDCYGCSGGYMDKALEYIETAGGIMSEN 195 >UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadidae|Rep: Cysteine protease - Tritrichomonas foetus (Trichomonas foetus) Length = 315 Score = 62.5 bits (145), Expect = 7e-09 Identities = 31/78 (39%), Positives = 45/78 (57%) Frame = +2 Query: 155 NFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC 334 N D D W E +NEI+DQ +CGSCWAF A++A I + + +S ++LV C Sbjct: 100 NVDSID-WREKGVVNEIKDQAACGSCWAFSAIQAAESAYAISTGTLE--SYSEQNLVDCV 156 Query: 335 PICGLGCNGGMPTLAWEY 388 C GC+GG+ A++Y Sbjct: 157 QGC-YGCSGGLMDYAYKY 173 >UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-like precursor; n=26; Euteleostomi|Rep: Tubulointerstitial nephritis antigen-like precursor - Homo sapiens (Human) Length = 467 Score = 62.5 bits (145), Expect = 7e-09 Identities = 39/112 (34%), Positives = 52/112 (46%), Gaps = 2/112 (1%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 LP F+ +KWP ++E DQG+C WAF +DRV I+S S ++L+ Sbjct: 203 LPTAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLL 260 Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPY--EIPPCEHH 475 SC GC GG AW + + G+VS Y S R PPC H Sbjct: 261 SCDTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMH 312 >UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; n=35; Fasciola|Rep: Cathepsin L-like proteinase precursor - Fasciola hepatica (Liver fluke) Length = 326 Score = 62.5 bits (145), Expect = 7e-09 Identities = 31/84 (36%), Positives = 43/84 (51%), Gaps = 1/84 (1%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLG 352 W E + E++DQG+CGSCWAF M + N FS + LV C P G Sbjct: 114 WRESGYVTEVKDQGNCGSCWAFSTTGTMEGQ--YMKNERTSISFSEQQLVDCSGPWGNNG 171 Query: 353 CNGGMPTLAWEYWKHVGLVSGGNY 424 C+GG+ A++Y K GL + +Y Sbjct: 172 CSGGLMENAYQYLKQFGLETESSY 195 >UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 367 Score = 62.1 bits (144), Expect = 9e-09 Identities = 32/87 (36%), Positives = 49/87 (56%), Gaps = 4/87 (4%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC----PIC 343 W + ++ +++QGSCGSCWAF AV A+ + V + N + +S ++LV C Sbjct: 161 WRQSGAVSPVKNQGSCGSCWAFSAV-ALAESVNLLRNNSLAL-YSEQELVDCTYKNPQYY 218 Query: 344 GLGCNGGMPTLAWEYWKHVGLVSGGNY 424 GC GG P++A+ Y K G+ S NY Sbjct: 219 NYGCQGGWPSVAYRYIKDQGISSQQNY 245 >UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep: Cathepsin L - Stylonychia lemnae Length = 340 Score = 61.7 bits (143), Expect = 1e-08 Identities = 34/100 (34%), Positives = 48/100 (48%) Frame = +2 Query: 104 KLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYS 283 K K + + ++PE+ D W E +N ++DQG CGSCWAF + ++ R I + Sbjct: 111 KTGKEVYSTPNLKDIPESID----WREKGAVNAVKDQGQCGSCWAFSTIASLESRYFIET 166 Query: 284 NATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVG 403 K S + LV C GCNGG LA +Y G Sbjct: 167 G--KLQSLSEQQLVDCSKNGNEGCNGGDMGLAMDYIASAG 204 >UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35; Viridiplantae|Rep: Cysteine proteinase 15A precursor - Pisum sativum (Garden pea) Length = 363 Score = 61.7 bits (143), Expect = 1e-08 Identities = 40/106 (37%), Positives = 52/106 (49%), Gaps = 10/106 (9%) Frame = +2 Query: 101 LKLPKVTHDAELI--ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVC 274 L+LP A ++ NLPE+FD R+K P ++DQGSCGSCWAF A+ Sbjct: 115 LRLPAHAQKAPILPTTNLPEDFDWREKGAVTP----VKDQGSCGSCWAFSTTGALEG--A 168 Query: 275 IYSNATKHFHFSAEDLVSCCPI--------CGLGCNGGMPTLAWEY 388 Y K S + LV C + C GCNGG+ A+EY Sbjct: 169 HYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEY 214 >UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 317 Score = 61.3 bits (142), Expect = 2e-08 Identities = 44/134 (32%), Positives = 60/134 (44%), Gaps = 5/134 (3%) Frame = +2 Query: 80 ALKDDNILKLPKVTHDAELIAN----LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGA 247 A+ D ++ PK + +A+ +PE+ D W E +N +RDQ CGSCWAF A Sbjct: 78 AMLDSQLIHKPKRDITSRFVADPQLTVPESID----WREKGAVNPVRDQEQCGSCWAFSA 133 Query: 248 VEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424 A+ + + K S + LV C GCNGG P A++Y K GL Y Sbjct: 134 AGALEGQ--RFLKEGKLEVLSTQQLVDCSRDYKNEGCNGGWPHWAYDYIKDNGLCLESKY 191 Query: 425 NSSQGCRPYEIPPC 466 QG Y C Sbjct: 192 -KYQGYDGYYCKEC 204 >UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio molitor (Yellow mealworm) Length = 336 Score = 61.3 bits (142), Expect = 2e-08 Identities = 43/129 (33%), Positives = 63/129 (48%), Gaps = 2/129 (1%) Frame = +2 Query: 113 KVTHDAELIANL--PENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSN 286 K D L A++ P +FD RD+ P +++QGSCGSCWAF + A+ ++ I + Sbjct: 108 KTREDLGLNASVRYPASFDWRDQGMVSP----VKNQGSCGSCWAFSSTGAIESQMKIANG 163 Query: 287 ATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPC 466 A S + LV C P LGC+GG A+ Y G + S+G PYE+ Sbjct: 164 AGYDSSVSEQQLVDCVP-NALGCSGGWMNDAFTYVAQNGGI------DSEGAYPYEMADG 216 Query: 467 EHHVPGNRM 493 H N++ Sbjct: 217 NCHYDPNQV 225 >UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1; Dictyostelium discoideum AX4|Rep: Counting factor associated protein - Dictyostelium discoideum AX4 Length = 531 Score = 61.3 bits (142), Expect = 2e-08 Identities = 34/103 (33%), Positives = 54/103 (52%), Gaps = 2/103 (1%) Frame = +2 Query: 122 HDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHF 301 HD E + ++P D R++ C T ++DQG CGSCW FG+ ++ C+ + + Sbjct: 301 HDDESLRSIPSTVDWRNQ--NCVT--PVKDQGICGSCWTFGSTGSLEGTNCVTNG--ELV 354 Query: 302 HFSAEDLVSCCPICG-LGCNGGMPTLAWEYWKHVG-LVSGGNY 424 S + LV C + G GC GG + A++Y +G L + NY Sbjct: 355 SLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNY 397 >UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing protein; n=7; Hymenostomatida|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 387 Score = 61.3 bits (142), Expect = 2e-08 Identities = 42/134 (31%), Positives = 61/134 (45%), Gaps = 5/134 (3%) Frame = +2 Query: 47 TPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCG 226 T + K + A N+ + K T D + +LP++ D W + + ++DQG CG Sbjct: 101 TTLGYSKTVKNAANKQNMFRNLK-TSDKINVKDLPKSVD----WRDAGVVTPVKDQGHCG 155 Query: 227 SCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP---ICG--LGCNGGMPTLAWEYW 391 SCWAF + I + K S + LVSC CG GCNG + LA+ Y Sbjct: 156 SCWAFATTAVIESYAAIATGQLK--TLSTQQLVSCVQNSYQCGGQGGCNGAVSELAYNYV 213 Query: 392 KHVGLVSGGNYNSS 433 + GL S Y+ S Sbjct: 214 QLFGLTSEYKYSYS 227 >UniRef50_Q24E33 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 328 Score = 60.9 bits (141), Expect = 2e-08 Identities = 38/142 (26%), Positives = 65/142 (45%), Gaps = 2/142 (1%) Frame = +2 Query: 5 KQNTWKAGRNFPTH-TPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWP 181 K NT+K N T + + + + ++I + D E + ++P + W Sbjct: 79 KNNTFKLAINIMAILTDEEYSSLYLNLDQQESIDIFDSLVDDNETVGDIPSEVN----WT 134 Query: 182 ECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPI-CGLGCN 358 + +++QGSCGSCWAF A+ + +N + FS + LV C + +GCN Sbjct: 135 AQGAVTPVKNQGSCGSCWAFSTTGALEGSYFLKNN--QLISFSEQQLVDCSRLYLNMGCN 192 Query: 359 GGMPTLAWEYWKHVGLVSGGNY 424 GG+ A+ Y K G+ + Y Sbjct: 193 GGLMPRAFRYVKAHGITTEEEY 214 >UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine proteinase precursor - Heterodera glycines (Soybean cyst nematode worm) Length = 353 Score = 60.9 bits (141), Expect = 2e-08 Identities = 33/84 (39%), Positives = 45/84 (53%), Gaps = 1/84 (1%) Frame = +2 Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319 + LPE D W E + E++DQG CGSCWAF A A+ + A+K S ++ Sbjct: 133 STLPEKLD----WREKGAVTEVKDQGDCGSCWAFSATGAI-EGALAQKKASKIISLSEQN 187 Query: 320 LVSCCPICG-LGCNGGMPTLAWEY 388 LV C G GC+GG+ A+EY Sbjct: 188 LVDCSSKYGNEGCDGGLMDSAFEY 211 >UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15; Magnoliophyta|Rep: Cysteine proteinase RD19a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 368 Score = 60.9 bits (141), Expect = 2e-08 Identities = 43/118 (36%), Positives = 59/118 (50%), Gaps = 11/118 (9%) Frame = +2 Query: 104 KLPKVTHDAELIA--NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCI 277 KLPK + A ++ NLPE+FD RD P +++QGSCGSCW+F A A+ + Sbjct: 119 KLPKDANKAPILPTENLPEDFDWRDHGAVTP----VKNQGSCGSCWSFSATGALEGANFL 174 Query: 278 YSNATKHFHFSAEDLVSC--------CPICGLGCNGGMPTLAWEY-WKHVGLVSGGNY 424 + K S + LV C C GCNGG+ A+EY K GL+ +Y Sbjct: 175 ATG--KLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDY 230 >UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo sapiens|Rep: Isoform 2 of Q9GZM7 - Homo sapiens (Human) Length = 283 Score = 60.5 bits (140), Expect = 3e-08 Identities = 32/98 (32%), Positives = 47/98 (47%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 LP F+ +KWP ++E DQG+C WAF +DRV I+S S ++L+ Sbjct: 69 LPTAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLL 126 Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQG 439 SC GC GG AW + + G + G+ +G Sbjct: 127 SCDTHQQQGCRGGRLDGAWWFLRRRGYAATGDVGREEG 164 >UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanensis|Rep: Sui m 1 allergen - Suidasia medanensis Length = 336 Score = 60.5 bits (140), Expect = 3e-08 Identities = 45/142 (31%), Positives = 65/142 (45%), Gaps = 11/142 (7%) Frame = +2 Query: 125 DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH 304 ++++ LP FD R +W +R+QG CGSCWAF + + I N H Sbjct: 108 ESDISVALPAAFDWRQQWNTA-----VRNQGQCGSCWAFATAATVEAQYAIRKNV--HVT 160 Query: 305 FSAEDLVSC--CPICGL----GCNGGMPTLAWEYWKHVGLV--SGGNYNSSQG-CRPYEI 457 S + LV C P G GC GG P +A+ Y + GLV S Y + G C+ + Sbjct: 161 LSEQQLVDCDHRPFQGQYEDHGCQGGNPIIAYAYVQQTGLVEESAYPYQARDGQCQSSTV 220 Query: 458 PPCE-HHV-PGNRMPCNGDTKT 517 + +HV G +P N +T Sbjct: 221 NGHQRYHVSAGRELPFNATDET 242 >UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cellular organisms|Rep: Cysteine proteinase, putative - Archaeoglobus fulgidus Length = 1088 Score = 60.5 bits (140), Expect = 3e-08 Identities = 29/72 (40%), Positives = 41/72 (56%) Frame = +2 Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316 +A+LP FD W + L+ +RDQGSCGSCWA AV A+ + + S A+ S + Sbjct: 591 MASLPSRFD----WRDYTGLSAVRDQGSCGSCWAHSAVAALESALIVESGASSSIDLSEQ 646 Query: 317 DLVSCCPICGLG 352 L+SC C +G Sbjct: 647 HLLSCEQDCEVG 658 >UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag-RP - Bombyx mori (Silk moth) Length = 404 Score = 60.1 bits (139), Expect = 4e-08 Identities = 44/129 (34%), Positives = 62/129 (48%), Gaps = 5/129 (3%) Frame = +2 Query: 83 LKDDNILKLPKVTHDAELIA-----NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGA 247 LKD I KL + +I+ P+ FD R +W ++ I DQ CGS WA Sbjct: 159 LKDGLIYKLGTFPLNVTVISYSKDGQYPDEFDARREW--YGYISPIADQDWCGSDWAVSI 216 Query: 248 VEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYN 427 + DR I S T++ S++ L+SC GCNGG +A+++ K GLV Sbjct: 217 ASIVGDRFSIQSFGTENVRMSSQTLLSCHLKGQRGCNGGNLDIAFDFVKTHGLV------ 270 Query: 428 SSQGCRPYE 454 S+ C PYE Sbjct: 271 -SEQCFPYE 278 >UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep: Cathepsin L precursor - Schistosoma mansoni (Blood fluke) Length = 319 Score = 60.1 bits (139), Expect = 4e-08 Identities = 35/97 (36%), Positives = 51/97 (52%), Gaps = 1/97 (1%) Frame = +2 Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316 + N+P+NFD W E + E+++QG CGSCWAF + + + K S + Sbjct: 102 VNNIPKNFD----WREKGAVTEVKNQGMCGSCWAFSTTGNVESQ--WFRKTGKLLSLSEQ 155 Query: 317 DLVSCCPICGLGCNGGMPTLAWE-YWKHVGLVSGGNY 424 LV C + GCNGG+P+ A+E K GL+ NY Sbjct: 156 QLVDCDGLDD-GCNGGLPSNAYESIIKMGGLMLEDNY 191 >UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium tetraurelia|Rep: Cathepsin L1 precursor - Paramecium tetraurelia Length = 314 Score = 60.1 bits (139), Expect = 4e-08 Identities = 32/75 (42%), Positives = 41/75 (54%), Gaps = 1/75 (1%) Frame = +2 Query: 203 IRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLA 379 +++QGSCGSCWAF AV A+ I N + + S +DLV C P GCNGG A Sbjct: 126 VKNQGSCGSCWAFSAVGALEINTDIELN--RKYELSEQDLVDCSGPYDNDGCNGGWMDSA 183 Query: 380 WEYWKHVGLVSGGNY 424 +EY GL +Y Sbjct: 184 FEYVADNGLAEAKDY 198 >UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: Vivapain-4 - Plasmodium vivax Length = 484 Score = 59.7 bits (138), Expect = 5e-08 Identities = 35/100 (35%), Positives = 53/100 (53%), Gaps = 2/100 (2%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355 W E ++EI++Q CGSCWAFGAV A+ + I N +H S ++LV C GC Sbjct: 268 WREHNAVSEIKNQNLCGSCWAFGAVGAVESQYAIRKN--QHVLISEQELVDCSD-KNFGC 324 Query: 356 NGGMPTLAWEYWKHVGLVSGGNYNSSQGCRP--YEIPPCE 469 GG+ +LA++ +G + + G +P EI C+ Sbjct: 325 FGGLASLAFDDMIDLGYLCSESDYPYVGFKPRKCEIKKCK 364 >UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 306 Score = 59.7 bits (138), Expect = 5e-08 Identities = 28/82 (34%), Positives = 44/82 (53%) Frame = +2 Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322 N+ + W E +N+I++QG+CGSCWAF A++ + +V N + + S ++L Sbjct: 83 NIKNDVPTEIDWREQGIVNKIKNQGACGSCWAFSAIQVIESQVA--KNQKQLYDLSEQNL 140 Query: 323 VSCCPICGLGCNGGMPTLAWEY 388 + C C GC GG A EY Sbjct: 141 LDCVTSC-FGCGGGWSPGALEY 161 >UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: Cathepsin L - Kudoa thyrsites Length = 300 Score = 59.7 bits (138), Expect = 5e-08 Identities = 36/116 (31%), Positives = 58/116 (50%), Gaps = 2/116 (1%) Frame = +2 Query: 110 PKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNA 289 PK T ++ + LP + D W + +++QG CGSCW+F A A+ I + Sbjct: 90 PKETATKDIKSTLPSSVD----WKALGKVTSVKNQGHCGSCWSFSAAGAIESAYAIKTG- 144 Query: 290 TKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGN--YNSSQGCRPY 451 + +FS + LV C GCNGG+P +A+ Y + G++ + Y + QG Y Sbjct: 145 -ELVNFSEQQLVDCSTE-NHGCNGGLPEIAFLYVINNGIMKLKDYPYTAKQGTCQY 198 >UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicalis|Rep: Blo t 1 allergen - Blomia tropicalis (Mite) Length = 333 Score = 59.7 bits (138), Expect = 5e-08 Identities = 43/124 (34%), Positives = 55/124 (44%), Gaps = 10/124 (8%) Frame = +2 Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322 +LP+NFD W + L IR QGSCGSCWAF A I + S ++L Sbjct: 112 SLPQNFD----WRQKARLTRIRQQGSCGSCWAFAAAGVAESLYSIQKQ--QSIELSEQEL 165 Query: 323 VSC-------CPICGLGCNGGMPTLAWEYWKHVGLVSGGNY---NSSQGCRPYEIPPCEH 472 V C C GC G T A++Y GLV NY +Q C P ++ + Sbjct: 166 VDCTYNRYDSSYQCN-GCGSGYSTEAFKYMIRTGLVEEENYPYNMRTQWCNP-DVEGQRY 223 Query: 473 HVPG 484 HV G Sbjct: 224 HVSG 227 >UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; Theileria|Rep: Cysteine proteinase precursor - Theileria parva Length = 440 Score = 59.7 bits (138), Expect = 5e-08 Identities = 35/122 (28%), Positives = 54/122 (44%) Frame = +2 Query: 95 NILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVC 274 N+ K D +L EN D W ++ ++DQ +CG CWAF V ++ Sbjct: 212 NLKKALNTDEDVDLAKLTGENLD----WRRSSSVTSVKDQSNCGGCWAFSTVGSVEG--Y 265 Query: 275 IYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYE 454 S+ K + S ++L+ C GC GG+ A+EY + GLVS + R Sbjct: 266 YMSHFDKSYELSVQELLDCDSFSN-GCQGGLLESAYEYVRKYGLVSAKDLPFVDKARRCS 324 Query: 455 IP 460 +P Sbjct: 325 VP 326 >UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypanosoma cruzi|Rep: Cysteine protease, putative - Trypanosoma cruzi Length = 434 Score = 59.3 bits (137), Expect = 7e-08 Identities = 34/83 (40%), Positives = 41/83 (49%), Gaps = 7/83 (8%) Frame = +2 Query: 176 WPEC--PTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC---CPI 340 W E P L ++DQGSCGSCWA A E++ I S K S + + SC Sbjct: 131 WQEAKNPVLTPVKDQGSCGSCWAHAATESVESMYAISSG--KLLTLSTQQITSCVNNTRK 188 Query: 341 CG--LGCNGGMPTLAWEYWKHVG 403 CG GC GG LAWEY + G Sbjct: 189 CGGSGGCGGGTAQLAWEYIMNTG 211 >UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_56, whole genome shotgun sequence - Paramecium tetraurelia Length = 314 Score = 59.3 bits (137), Expect = 7e-08 Identities = 28/77 (36%), Positives = 40/77 (51%) Frame = +2 Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPT 373 + +++QG+CGSCWAF AV A+ + I +K S + LV C GCNGG Sbjct: 122 ITSVKNQGNCGSCWAFSAVGAVETLLTIKGVISKDLWLSEQQLVDCDKGTNNGCNGGFEN 181 Query: 374 LAWEYWKHVGLVSGGNY 424 L ++ K GL + Y Sbjct: 182 LGIQWAKKNGLTTDKQY 198 >UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 280 Score = 58.8 bits (136), Expect = 9e-08 Identities = 31/98 (31%), Positives = 51/98 (52%), Gaps = 3/98 (3%) Frame = +2 Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319 ++LP+ FD W + ++++QG+CGSCWAF + + + + + N T +S ++ Sbjct: 66 SSLPQQFD----WRNLGKVTQVKNQGNCGSCWAF-TITGLFESINLIRNKTVEL-YSEQE 119 Query: 320 LVSCCP---ICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424 L+ C GC GG P LA+EY K G+ Y Sbjct: 120 LLDCSSNGIYRNSGCQGGWPHLAFEYSKKNGISLSSQY 157 >UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2] - Vigna mungo (Rice bean) (Black gram) Length = 362 Score = 58.8 bits (136), Expect = 9e-08 Identities = 33/99 (33%), Positives = 52/99 (52%), Gaps = 1/99 (1%) Frame = +2 Query: 131 ELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFS 310 E + ++P + D W + + +++DQG CGSCWAF + A+ I +N K S Sbjct: 123 EKVGSVPASVD----WRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTN--KLVSLS 176 Query: 311 AEDLVSCCPICGLGCNGGMPTLAWEYWKHV-GLVSGGNY 424 ++LV C GCNGG+ A+E+ K G+ + NY Sbjct: 177 EQELVDCDKEENQGCNGGLMESAFEFIKQKGGITTESNY 215 >UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18; Magnoliophyta|Rep: Thiol protease aleurain precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 58.8 bits (136), Expect = 9e-08 Identities = 33/89 (37%), Positives = 43/89 (48%), Gaps = 1/89 (1%) Frame = +2 Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319 A LPE D W E ++ ++DQG CGSCW F A+ + K S + Sbjct: 139 AALPETKD----WREDGIVSPVKDQGGCGSCWTFSTTGAL--EAAYHQAFGKGISLSEQQ 192 Query: 320 LVSCC-PICGLGCNGGMPTLAWEYWKHVG 403 LV C GCNGG+P+ A+EY K G Sbjct: 193 LVDCAGAFNNYGCNGGLPSQAFEYIKSNG 221 >UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing protein; n=5; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 437 Score = 58.4 bits (135), Expect = 1e-07 Identities = 31/91 (34%), Positives = 46/91 (50%), Gaps = 2/91 (2%) Frame = +2 Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGS-CGSCWAFGAVEAMTDRVCIYSNATKHFHFSA 313 ++ LP+ D W E + +++ QG CGSCWAF AV A+ + K FS Sbjct: 202 LSQLPQYVD----WREKGVVTQVKSQGKDCGSCWAFAAVAALESHYAL-KTGKKPIQFSE 256 Query: 314 EDLVSCC-PICGLGCNGGMPTLAWEYWKHVG 403 + LV C GC+GG+P+ +EY + G Sbjct: 257 QQLVDCARKFDTKGCSGGLPSKGFEYLAYAG 287 >UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 394 Score = 58.4 bits (135), Expect = 1e-07 Identities = 32/80 (40%), Positives = 39/80 (48%), Gaps = 3/80 (3%) Frame = +2 Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGL---GCNGG 364 LN ++DQG CGSCW FGA M I + K FS + LV C G GCNGG Sbjct: 196 LNPVKDQGQCGSCWTFGAAGVMESFNAITNGVLK--SFSEQQLVDCVHQAGFSSDGCNGG 253 Query: 365 MPTLAWEYWKHVGLVSGGNY 424 + EY G+V+ Y Sbjct: 254 FQSDGVEYAIKFGIVTEDKY 273 >UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-329; n=2; Caenorhabditis|Rep: Putative uncharacterized protein tag-329 - Caenorhabditis elegans Length = 374 Score = 58.4 bits (135), Expect = 1e-07 Identities = 32/94 (34%), Positives = 44/94 (46%), Gaps = 1/94 (1%) Frame = +2 Query: 146 LPENFDPRDKWPECP-TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322 LP+ FD R+K + I+ Q SC CW F A + ++ K + S +++ Sbjct: 140 LPKTFDLRNKKVGGHYIIGPIKTQDSCACCWGFAATAVAEAALTVHLK--KAMNLSEQEV 197 Query: 323 VSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424 C P G GCNGG P EY K +GL G Y Sbjct: 198 CDCAPKHGPGCNGGDPVDGLEYIKEMGLTGGKEY 231 >UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1; Uronema marinum|Rep: Cathepsin L-like cysteine protease - Uronema marinum Length = 333 Score = 58.0 bits (134), Expect = 2e-07 Identities = 33/83 (39%), Positives = 43/83 (51%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355 W + +++QG CGSCWAF AV ++ I N K FS + LVSC P GC Sbjct: 126 WVSKGAVQGVQNQGVCGSCWAFSAVCSLERLYKI--NTGKLLSFSEQQLVSCEP-KSYGC 182 Query: 356 NGGMPTLAWEYWKHVGLVSGGNY 424 +GG P A+ Y GL S +Y Sbjct: 183 DGGWPEAAFAYSATHGLESSASY 205 >UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 323 Score = 58.0 bits (134), Expect = 2e-07 Identities = 34/107 (31%), Positives = 52/107 (48%), Gaps = 8/107 (7%) Frame = +2 Query: 116 VTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATK 295 +++ + +P +FD R W +C ++ +R+Q SCGSCWA + DR+CI S+ Sbjct: 36 ISYSQNELDTIPASFDVRTNWGDC--MSPVREQQSCGSCWAQVTSGILADRMCIESDKNI 93 Query: 296 HFHFSAEDLVSC---CPI-----CGLGCNGGMPTLAWEYWKHVGLVS 412 S + L+ C C C GC GG LA + G+VS Sbjct: 94 KMLLSPQYLMDCDGSCVSDGVSGCNNGCKGGFVGLALTRLINEGIVS 140 >UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platyhelminthes|Rep: Cathepsin L-like proteinase - Echinococcus multilocularis Length = 338 Score = 58.0 bits (134), Expect = 2e-07 Identities = 31/84 (36%), Positives = 40/84 (47%), Gaps = 1/84 (1%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LG 352 W + + I+DQG CGSCWAF A A+ + + K S + LV C G G Sbjct: 128 WRKKGLVTPIKDQGDCGSCWAFSATGALEGQ--LKRKTGKLISLSEQQLVDCSTYTGNEG 185 Query: 353 CNGGMPTLAWEYWKHVGLVSGGNY 424 CNGG A+ YW G S +Y Sbjct: 186 CNGGDMNDAFRYWMRNGAESESDY 209 >UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1; Brugia malayi|Rep: Cathepsin F-like cysteine proteinase - Brugia malayi (Filarial nematode worm) Length = 461 Score = 57.6 bits (133), Expect = 2e-07 Identities = 32/89 (35%), Positives = 44/89 (49%) Frame = +2 Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316 I NLP FD W + ++DQGSCGSCWAF + I + K S + Sbjct: 245 IYNLPSKFD----WRTEGVVTPVKDQGSCGSCWAFSVTGNIESLWAIKTG--KLISLSEQ 298 Query: 317 DLVSCCPICGLGCNGGMPTLAWEYWKHVG 403 +L+ C + GCNGG+P A+ K +G Sbjct: 299 ELID-CDVIDKGCNGGLPINAFREIKRMG 326 >UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; Theileria|Rep: Cysteine proteinase precursor - Theileria annulata Length = 441 Score = 57.6 bits (133), Expect = 2e-07 Identities = 27/72 (37%), Positives = 41/72 (56%), Gaps = 1/72 (1%) Frame = +2 Query: 176 WPECPTLNEIRDQGS-CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLG 352 W ++ I+DQG CGSCWAF ++ ++ +Y N K + S ++LV+ C +G Sbjct: 233 WARTDAVSPIKDQGDHCGSCWAFSSIASVESLYRLYKN--KSYFLSEQELVN-CDKSSMG 289 Query: 353 CNGGMPTLAWEY 388 C GG+P A EY Sbjct: 290 CAGGLPITALEY 301 >UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MGC107932 protein - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 333 Score = 57.2 bits (132), Expect = 3e-07 Identities = 32/88 (36%), Positives = 43/88 (48%), Gaps = 1/88 (1%) Frame = +2 Query: 176 WPECPTLNEIRDQGS-CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLG 352 W + + +++QG+ CGSCWAF V M R CI + + S + LV C I G Sbjct: 121 WRKSNCVTPVKNQGTFCGSCWAFATVGVMESRYCI--RTKELLNLSEQQLVDCDEI-NEG 177 Query: 353 CNGGMPTLAWEYWKHVGLVSGGNYNSSQ 436 C GG P A EY G++ Y SQ Sbjct: 178 CCGGFPIKALEYVAQHGVMRNKEYEYSQ 205 >UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus; n=4; Cryptosporidium|Rep: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus - Cryptosporidium parvum Iowa II Length = 401 Score = 57.2 bits (132), Expect = 3e-07 Identities = 32/82 (39%), Positives = 42/82 (51%), Gaps = 3/82 (3%) Frame = +2 Query: 152 ENFDPRDK--WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 E F P + W E +N IR+Q +CGSCWAF AV A+ C +N S + V Sbjct: 172 EEFVPPNSINWVEAGCVNPIRNQKNCGSCWAFSAVAALEGATCAQTNRGLP-SLSEQQFV 230 Query: 326 SCCPICG-LGCNGGMPTLAWEY 388 C G GC+GG LA++Y Sbjct: 231 DCSKQNGNFGCDGGTMGLAFQY 252 >UniRef50_Q235G6 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 325 Score = 57.2 bits (132), Expect = 3e-07 Identities = 33/129 (25%), Positives = 56/129 (43%) Frame = +2 Query: 41 THTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGS 220 THT FA + + D+ I L + H+ +++ + W E + +++QG Sbjct: 88 THTEFAELYLNPAENIDEEIDSLQPIQHNEDIVID----------WVEKGAVTPVKNQGG 137 Query: 221 CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHV 400 CG CW+F + +Y N + S + L+ C GC GG+ +A Y K Sbjct: 138 CGGCWSFATTGGVEGANFVYKNVLP--NLSQQQLID-CNTQNKGCGGGLRDIALNYVKET 194 Query: 401 GLVSGGNYN 427 GL + Y+ Sbjct: 195 GLTTEEEYS 203 >UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_36, whole genome shotgun sequence - Paramecium tetraurelia Length = 307 Score = 57.2 bits (132), Expect = 3e-07 Identities = 30/77 (38%), Positives = 40/77 (51%) Frame = +2 Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPT 373 +N I++QG+CGSCW F A+ A+ + I S + LV C G GCNGG Sbjct: 118 MNPIKNQGNCGSCWTFSAIGAVEGFLAIRKGFKG--VLSEQQLVDCAVDAGEGCNGGNSD 175 Query: 374 LAWEYWKHVGLVSGGNY 424 LA +Y VG V +Y Sbjct: 176 LALDYIAEVGSVYERDY 192 >UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lamblia ATCC 50803|Rep: GLP_217_11853_10927 - Giardia lamblia ATCC 50803 Length = 308 Score = 56.8 bits (131), Expect = 4e-07 Identities = 27/87 (31%), Positives = 49/87 (56%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 +P++FD R+++P+C T E+ D G C S WA+ AV+A + R C+ + +SA+ ++ Sbjct: 75 VPDHFDFREEYPQCIT--EVIDIGLCSSSWAYSAVDAFSHRRCLTGLDQEATRYSAQYIL 132 Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGL 406 SC G ++AW++ G+ Sbjct: 133 SCSSTNGCFGFSTRESIAWDFIATTGI 159 >UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 330 Score = 56.8 bits (131), Expect = 4e-07 Identities = 31/97 (31%), Positives = 43/97 (44%), Gaps = 3/97 (3%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355 W ++ ++DQG CGSCWAF ++ + I A + S + LV C GC Sbjct: 123 WVTRGKVSAVKDQGQCGSCWAFSTTGSVESALIIAGYANQTIDLSEQQLVD-CSATNYGC 181 Query: 356 NGGMPTLAWEYWKHVGLVSGGNY---NSSQGCRPYEI 457 GG A+EY + L + NY Q C EI Sbjct: 182 GGGWMDNAFEYIEESPLTTNSNYPYVAVDQACNSTEI 218 >UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus|Rep: Cathepsin L - Aphrocallistes vastus Length = 329 Score = 56.8 bits (131), Expect = 4e-07 Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 1/85 (1%) Frame = +2 Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322 +LP D R K P +++QG CGSCW+F A ++ + I S K FS ++L Sbjct: 114 DLPTTVDWRSKGVVTP----VKNQGQCGSCWSFSATGSLEGQYAIKSG--KLVSFSEQEL 167 Query: 323 VSCCPICG-LGCNGGMPTLAWEYWK 394 V C G GC GG+ A++YW+ Sbjct: 168 VDCSTSLGNHGCQGGLMDYAFKYWE 192 >UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain - Tetrahymena pyriformis Length = 330 Score = 56.8 bits (131), Expect = 4e-07 Identities = 29/86 (33%), Positives = 38/86 (44%), Gaps = 3/86 (3%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGL-- 349 W L +++Q CGSCWAF + I+ + FS + LV CC G Sbjct: 126 WTAKNVLPPVKNQQQCGSCWAFSTAGMLEGVYNIHESPQTPISFSEQQLVDCCGAQGFGC 185 Query: 350 -GCNGGMPTLAWEYWKHVGLVSGGNY 424 GCNG PT A Y + G+V Y Sbjct: 186 EGCNGAWPTDAVAYTQKFGIVQESQY 211 >UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra erinaceieuropaei|Rep: Cysteine proteinase - Spirometra erinaceieuropaei (Tapeworm) Length = 336 Score = 56.8 bits (131), Expect = 4e-07 Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 1/111 (0%) Frame = +2 Query: 107 LPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSN 286 L K+ + L EN W E + +++QG CGSCW+F A A+ + I + Sbjct: 104 LTKLRRKEAVSVPLKENLPDSVNWRERGAVTSVKNQGQCGSCWSFSANGAIEGAIQIKTG 163 Query: 287 ATKHFHFSAEDLVSCCPICG-LGCNGGMPTLAWEYWKHVGLVSGGNYNSSQ 436 A + S + L+ C G GCNGG+ A++Y + G+ + +Y ++ Sbjct: 164 ALR--SLSEQQLMDCSWDYGNQGCNGGLMPQAFQYAQRYGVEAEVDYRYTE 212 >UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Plasmodium|Rep: Cysteine proteinase precursor - Plasmodium vivax (strain Salvador I) Length = 583 Score = 56.8 bits (131), Expect = 4e-07 Identities = 34/101 (33%), Positives = 54/101 (53%), Gaps = 2/101 (1%) Frame = +2 Query: 128 AELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKH--F 301 A L+A++PE D R+K ++E +DQG CGSCWAF +V + C+Y+ Sbjct: 333 ANLLADVPEILDYREKG----IVHEPKDQGLCGSCWAFASVGNVE---CMYAKEHNKTIL 385 Query: 302 HFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424 S +++V C + GC+GG P ++ Y G+ G +Y Sbjct: 386 TLSEQEVVDCSKL-NFGCDGGHPFYSFIYAIENGICMGDDY 425 >UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens (Human) Length = 331 Score = 56.8 bits (131), Expect = 4e-07 Identities = 34/83 (40%), Positives = 49/83 (59%), Gaps = 2/83 (2%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 LP++ D R+K C T E++ QGSCG+CWAF AV A+ ++ + + K SA++LV Sbjct: 115 LPDSVDWREKG--CVT--EVKYQGSCGACWAFSAVGALEAQLKLKTG--KLVSLSAQNLV 168 Query: 326 SCC--PICGLGCNGGMPTLAWEY 388 C GCNGG T A++Y Sbjct: 169 DCSTEKYGNKGCNGGFMTTAFQY 191 >UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin O precursor; n=2; Apocrita|Rep: PREDICTED: similar to Cathepsin O precursor - Apis mellifera Length = 374 Score = 56.4 bits (130), Expect = 5e-07 Identities = 28/74 (37%), Positives = 38/74 (51%) Frame = +2 Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322 ++P FD RDK P +R QGSCG+CWAF +E + I N T H S +++ Sbjct: 154 SIPLRFDWRDKGVITP----VRSQGSCGACWAFSTIEVIESMFAI-KNGTLH-SLSVQEM 207 Query: 323 VSCCPICGLGCNGG 364 + C GC GG Sbjct: 208 IDCAKNSNFGCEGG 221 >UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Trypanosoma cruzi|Rep: Cysteine proteinase, putative - Trypanosoma cruzi Length = 392 Score = 56.4 bits (130), Expect = 5e-07 Identities = 39/100 (39%), Positives = 48/100 (48%), Gaps = 6/100 (6%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH-FSAEDL 322 +P+ D R+ P L ++DQG CGSCWA GA E M I T H S + L Sbjct: 141 IPDEVDYRNSSPAI--LTAVKDQGRCGSCWAHGAAEEMESHFAI---LTGRLHVLSQQQL 195 Query: 323 VSCCP---ICG--LGCNGGMPTLAWEYWKHVGLVSGGNYN 427 SC P CG GC G LA+EY K G+ S Y+ Sbjct: 196 TSCAPNPKKCGGTGGCYGSTADLAYEYAKQ-GITSEWVYS 234 >UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 356 Score = 56.4 bits (130), Expect = 5e-07 Identities = 26/83 (31%), Positives = 44/83 (53%), Gaps = 1/83 (1%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLG 352 W + ++ ++DQ +CGSCW F A+ I+ + + S + L+ C G Sbjct: 133 WKDLNKVSPVKDQQNCGSCWTFSTTGAIESHYAIFED-VEPTSLSEQQLIDCAGAFNNNG 191 Query: 353 CNGGMPTLAWEYWKHVGLVSGGN 421 C+GG+P+ A+EY K+ G +S N Sbjct: 192 CSGGLPSQAFEYIKYNGGISYEN 214 >UniRef50_O16454 Cluster: Temporarily assigned gene name protein 196; n=4; Bilateria|Rep: Temporarily assigned gene name protein 196 - Caenorhabditis elegans Length = 477 Score = 56.4 bits (130), Expect = 5e-07 Identities = 30/81 (37%), Positives = 46/81 (56%) Frame = +2 Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322 +LPE+FD W E + ++++QG+CGSCWAF + I N K S ++L Sbjct: 263 DLPESFD----WREKGAVTQVKNQGNCGSCWAFSTTGNVEGAWFIAKN--KLVSLSEQEL 316 Query: 323 VSCCPICGLGCNGGMPTLAWE 385 V C + GCNGG+P+ A++ Sbjct: 317 VDCDSM-DQGCNGGLPSNAYK 336 >UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 291 Score = 56.4 bits (130), Expect = 5e-07 Identities = 40/130 (30%), Positives = 58/130 (44%), Gaps = 1/130 (0%) Frame = +2 Query: 2 KKQNTWKAGRNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWP 181 K +K N +H + L+G D N++ K I + P D R Sbjct: 32 KANANYKLSLNSLSHLTPTEYQSLLGTKIDKNLVSQGKKVRPQ--IKDSPGILDYR---- 85 Query: 182 ECPTLNEIRDQGSCGSCWAFGAVEAM-TDRVCIYSNATKHFHFSAEDLVSCCPICGLGCN 358 E +N IRDQ CGSCWAFG V A ++ +YSN + S ++++ C C GC Sbjct: 86 EMGVVNPIRDQKQCGSCWAFGTVAACESNYALLYSNLPQ---LSEQNIIDCATTC-YGCG 141 Query: 359 GGMPTLAWEY 388 GG+ A + Sbjct: 142 GGIIQAAMSF 151 >UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase precursor - Phaedon cochleariae (Mustard beetle) Length = 324 Score = 56.4 bits (130), Expect = 5e-07 Identities = 35/93 (37%), Positives = 44/93 (47%), Gaps = 1/93 (1%) Frame = +2 Query: 149 PENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVS 328 PE+ D R K P +R+QG CGSCWA A+ + I S + S + LV Sbjct: 111 PESIDWRSKGVVLP----VRNQGECGSCWALSTAAAIESQSAIKSGS--KVPLSPQQLVD 164 Query: 329 CCPICG-LGCNGGMPTLAWEYWKHVGLVSGGNY 424 C G GCNGG +EY K GL S +Y Sbjct: 165 CSTSYGNHGCNGGFAVNGFEYVKDNGLESDADY 197 >UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin F like protease - Nasonia vitripennis Length = 1036 Score = 56.0 bits (129), Expect = 6e-07 Identities = 36/112 (32%), Positives = 54/112 (48%), Gaps = 1/112 (0%) Frame = +2 Query: 71 LMGALKDDNILKLPKVT-HDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGA 247 L LK +N + +P T D EL P ++D W + ++DQGSCGSCWAF Sbjct: 795 LKPTLKSENDIPMPMATIPDIEL----PSDYD----WRHHNVVTPVKDQGSCGSCWAFSV 846 Query: 248 VEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVG 403 + + I + S ++LV C + GCNGG+P A+ + +G Sbjct: 847 TGNIEGQYAIKHG--ELLSLSEQELVDCDKL-DSGCNGGLPDTAYRAIEELG 895 >UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestinalis|Rep: GLP_41_8294_9919 - Giardia lamblia ATCC 50803 Length = 541 Score = 56.0 bits (129), Expect = 6e-07 Identities = 33/97 (34%), Positives = 51/97 (52%), Gaps = 4/97 (4%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH----FSA 313 LP++FD RD + + DQG+CGSC+ FGAV+AM R+ I +N T S Sbjct: 241 LPDDFDWRDV-NGVSYIPGVLDQGACGSCFTFGAVQAMNSRIMIATNRTDPVGTKTILST 299 Query: 314 EDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424 E + C + GC+GG P + + G+++ +Y Sbjct: 300 EHALD-CNVYSQGCDGGFPEHVLRFAETNGIMTEDDY 335 >UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=176; Viridiplantae|Rep: Cysteine proteinase RD21a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 462 Score = 56.0 bits (129), Expect = 6e-07 Identities = 32/105 (30%), Positives = 52/105 (49%) Frame = +2 Query: 74 MGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVE 253 +GA + + + ++A + LPE+ D W + + E++DQG CGSCWAF + Sbjct: 113 LGAKMEKKGERRTSLRYEARVGDELPESID----WRKKGAVAEVKDQGGCGSCWAFSTIG 168 Query: 254 AMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEY 388 A+ I + S ++LV C GCNGG+ A+E+ Sbjct: 169 AVEGINQIVTGDL--ITLSEQELVDCDTSYNEGCNGGLMDYAFEF 211 >UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 55.6 bits (128), Expect = 8e-07 Identities = 34/105 (32%), Positives = 51/105 (48%), Gaps = 4/105 (3%) Frame = +2 Query: 122 HDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHF 301 H A+ + LP +FD W + L++++DQG CGSCWAF + + + + N K Sbjct: 118 HTAQDV-QLPASFD----WRDYGILSDVKDQGQCGSCWAF-STTGILEALYFMENRQK-I 170 Query: 302 HFSAEDLVSCCP----ICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424 FS + LV C GC+GG P A +Y G++ Y Sbjct: 171 SFSEQQLVDCATNSNGFNSYGCSGGWPEEALKYVAKFGILKEEQY 215 >UniRef50_Q231X3 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 55.6 bits (128), Expect = 8e-07 Identities = 26/84 (30%), Positives = 38/84 (45%), Gaps = 1/84 (1%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LG 352 W E ++ ++ QG+CGSCWAF A ++ + I K S + L+ C G G Sbjct: 121 WVEAGKVSNVKSQGNCGSCWAFSATASVESALIIAGKVDKSISLSEQQLIDCSGDYGNYG 180 Query: 353 CNGGMPTLAWEYWKHVGLVSGGNY 424 C G A Y K + + NY Sbjct: 181 CAAGQKEQALVYIKRYSITTEQNY 204 >UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis|Rep: Cysteine protease 2 - Babesia bovis Length = 445 Score = 55.6 bits (128), Expect = 8e-07 Identities = 30/84 (35%), Positives = 41/84 (48%) Frame = +2 Query: 155 NFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC 334 NF+ D W + ++DQG CGSCWAF AV ++ + S ++LVS C Sbjct: 236 NFEDID-WRRADAVTPVKDQGMCGSCWAFAAVGSVES---LLKRQKTDVRLSEQELVS-C 290 Query: 335 PICGLGCNGGMPTLAWEYWKHVGL 406 + GCNGG A Y K G+ Sbjct: 291 QLGNQGCNGGYSDYALNYIKFNGI 314 >UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; n=16; Chrysomelidae|Rep: Digestive cysteine protease intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 55.6 bits (128), Expect = 8e-07 Identities = 41/130 (31%), Positives = 60/130 (46%), Gaps = 2/130 (1%) Frame = +2 Query: 41 THTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGS 220 TH F I L G +K+ L +L +P++ D W E + E++DQ Sbjct: 79 THEEFKDI--LKGQIKNKPRLNATPTVFPEDL--EVPDSID----WTEKGAVLEVKDQNP 130 Query: 221 CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLG-C-NGGMPTLAWEYWK 394 CGSCWAF A A+ + I +N S + L+ C G G C GG + A+EY + Sbjct: 131 CGSCWAFSATGALEGQNAILNNV--KISLSEQQLLDCSAAYGNGNCKEGGDMSAAFEYVR 188 Query: 395 HVGLVSGGNY 424 G+ S +Y Sbjct: 189 DYGIQSEKSY 198 >UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin B-like cysteine peptidase - Trichomonas vaginalis G3 Length = 288 Score = 55.6 bits (128), Expect = 8e-07 Identities = 38/131 (29%), Positives = 61/131 (46%), Gaps = 2/131 (1%) Frame = +2 Query: 2 KKQNTWKAGRN--FPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDK 175 +K W AG N F T F ++ G +P + ++ ++P +++ ++ Sbjct: 20 EKDLPWVAGENERFKGMT-FKDASVISGNAHKLRPDTIP-LARPPKINISIPMSYNFTER 77 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355 +P+C + DQG CGSCW+F ++ + R C N K FS LV+ C GC Sbjct: 78 FPQCDF--GVLDQGKCGSCWSFAVSKSFSHRYCRKYN--KPVLFSQSHLVA-CDRRNSGC 132 Query: 356 NGGMPTLAWEY 388 GG+ AW Y Sbjct: 133 GGGIEVNAWRY 143 >UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole genome shotgun sequence; n=7; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_22, whole genome shotgun sequence - Paramecium tetraurelia Length = 350 Score = 55.6 bits (128), Expect = 8e-07 Identities = 31/92 (33%), Positives = 43/92 (46%), Gaps = 2/92 (2%) Frame = +2 Query: 155 NFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC 334 N P D W +N+++DQG CGSCWAF + + + S + LV C Sbjct: 142 NATPID-WRTRGAVNKVKDQGQCGSCWAFSTTGVLEGFYKVQTGELP--DLSEQQLVDCS 198 Query: 335 PICGL--GCNGGMPTLAWEYWKHVGLVSGGNY 424 + GC+GGMP+ A Y K GL + Y Sbjct: 199 TLIDFNQGCDGGMPSRALNYVKRNGLTTQDAY 230 >UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor; n=17; Magnoliophyta|Rep: Thiol protease aleurain-like precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 55.6 bits (128), Expect = 8e-07 Identities = 28/82 (34%), Positives = 42/82 (51%), Gaps = 1/82 (1%) Frame = +2 Query: 161 DPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-P 337 D +D W E ++ +++QG CGSCW F A+ + K S + LV C Sbjct: 143 DTKD-WREDGIVSPVKEQGHCGSCWTFSTTGAL--EAAYHQAFGKGISLSEQQLVDCAGT 199 Query: 338 ICGLGCNGGMPTLAWEYWKHVG 403 GC+GG+P+ A+EY K+ G Sbjct: 200 FNNFGCHGGLPSQAFEYIKYNG 221 >UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: hypothetical protein - Strongylocentrotus purpuratus Length = 331 Score = 55.2 bits (127), Expect = 1e-06 Identities = 33/99 (33%), Positives = 48/99 (48%), Gaps = 3/99 (3%) Frame = +2 Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316 + +P +D R P P + +++Q SCG+CWAF VE M ++ + + SA+ Sbjct: 124 LKTMPLVYDLRSIKP--PVVTPVKNQKSCGACWAFSVVETMETQIAL--KTKRLTQLSAQ 179 Query: 317 DLVSCCPICG-LGCNGGMP--TLAWEYWKHVGLVSGGNY 424 +LV C G GC GG+P TL W LV Y Sbjct: 180 ELVDCGTAAGDGGCRGGIPCKTLDWLNRTKTSLVPESTY 218 >UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Bigelowiella natans|Rep: Digestive cysteine proteinase - Bigelowiella natans (Pedinomonas minutissima) (Chlorarachnion sp.(strain CCMP 621)) Length = 360 Score = 55.2 bits (127), Expect = 1e-06 Identities = 33/86 (38%), Positives = 41/86 (47%), Gaps = 3/86 (3%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNAT--KHFHFSAEDLVSCCPICGL 349 W + L ++DQG CGSCWAF A +A+ I N T S E LV C Sbjct: 115 WRDFNALTPVKDQGGCGSCWAFSATQALESAHYIKHNDTLDSPIALSTEQLVE-CDQHDY 173 Query: 350 GCNGGMPTLAWEYWKHV-GLVSGGNY 424 C GG P A +Y K GLV+ +Y Sbjct: 174 ACYGGFPRDAMKYIKESGGLVAEADY 199 >UniRef50_Q23H32 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 365 Score = 55.2 bits (127), Expect = 1e-06 Identities = 34/97 (35%), Positives = 52/97 (53%), Gaps = 3/97 (3%) Frame = +2 Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKH-FHFSAED 319 ++PE+ D R+K + ++ QG CGSCWAF V A+ Y+ T + FS ++ Sbjct: 134 SVPESVDWREK-----LVAPVQKQGGCGSCWAFSTVIALEG---AYAKQTGNVIKFSEQN 185 Query: 320 LVSCCPICGLGCNGGMPTLAWEYWKHV--GLVSGGNY 424 L+ CC I GCNGG P A + +V G++ +Y Sbjct: 186 LIDCCRIENNGCNGGDPEPALDCVMNVLKGIMKNQDY 222 >UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_101, whole genome shotgun sequence - Paramecium tetraurelia Length = 306 Score = 55.2 bits (127), Expect = 1e-06 Identities = 37/104 (35%), Positives = 52/104 (50%), Gaps = 3/104 (2%) Frame = +2 Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322 NLPE+ D K +N +++QG+CGS W+F AV A + I+ T HF +S ++L Sbjct: 109 NLPESVDWSSK------MNPVKNQGTCGSGWSFSAVGAF-EAFFIFVKGT-HFQYSEQNL 160 Query: 323 VSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY---NSSQGCR 445 V C GC+GG P A +Y G Y S + CR Sbjct: 161 VD-CDTNSHGCDGGYPAKAIDYLNKNGAFLESEYPYVASKEKCR 203 >UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa zeasingle nucleocapsid nuclear polyhedrosis virus) Length = 367 Score = 55.2 bits (127), Expect = 1e-06 Identities = 30/80 (37%), Positives = 44/80 (55%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 LP+ +D W + + I+DQG CGSCWAF A+ + + I N K S + L+ Sbjct: 156 LPDYYD----WRDTNKVTPIKDQGVCGSCWAFVAIGNIESQYAIRHN--KLIDLSEQQLL 209 Query: 326 SCCPICGLGCNGGMPTLAWE 385 C + LGCNGG+ LA++ Sbjct: 210 DCDEV-DLGCNGGLMHLAFQ 228 >UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromeliaceae|Rep: Fruit bromelain precursor - Ananas comosus (Pineapple) Length = 351 Score = 55.2 bits (127), Expect = 1e-06 Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 2/143 (1%) Frame = +2 Query: 2 KKQNTWKAGRN-FPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKW 178 + +N++ G N F T + G NI + P V+ D I+ +P++ D W Sbjct: 74 RNENSYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVSFDDVNISAVPQSID----W 129 Query: 179 PECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCN 358 + +NE+++Q CGSCW+F A+ + IY T + +E V C + GC Sbjct: 130 RDYGAVNEVKNQNPCGSCWSFAAIATVEG---IYKIKTGYLVSLSEQEVLDCAV-SYGCK 185 Query: 359 GGMPTLAWEY-WKHVGLVSGGNY 424 GG A+++ + G+ + NY Sbjct: 186 GGWVNKAYDFIISNNGVTTEENY 208 >UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin L - Misgurnus mizolepis (Mud loach) Length = 337 Score = 54.8 bits (126), Expect = 1e-06 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 1/74 (1%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLG 352 W E + ++DQG CGSCWAF AM + ++ K S ++LV C P G Sbjct: 122 WREKGYVTPVKDQGECGSCWAFSTTGAMEGQ--MFRKQGKLVSLSEQNLVDCSRPEGNEG 179 Query: 353 CNGGMPTLAWEYWK 394 CNGG+ A++Y K Sbjct: 180 CNGGLMDQAFQYIK 193 >UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia irregularis virus a|Rep: FirrV-1-A48 precursor - Feldmannia irregularis virus a Length = 373 Score = 54.8 bits (126), Expect = 1e-06 Identities = 27/78 (34%), Positives = 42/78 (53%), Gaps = 2/78 (2%) Frame = +2 Query: 209 DQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP-ICGLGCN-GGMPTLAW 382 DQGSC SCW+ V+ + DRV + +N S ++++SC GL C+ GG+P A+ Sbjct: 80 DQGSCASCWSISVVQMLADRVSVSTNGKIKLKLSVQEMISCWDGHDGLACSKGGVPEKAY 139 Query: 383 EYWKHVGLVSGGNYNSSQ 436 +Y G+ +Y Q Sbjct: 140 QYIIENGIGLAEDYPYEQ 157 >UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06231 protein - Schistosoma japonicum (Blood fluke) Length = 372 Score = 54.8 bits (126), Expect = 1e-06 Identities = 38/127 (29%), Positives = 58/127 (45%), Gaps = 2/127 (1%) Frame = +2 Query: 14 TWKAG-RNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECP 190 T+K G NF T + ++ L G I K T + A LP+ D W Sbjct: 106 TYKMGVNNFTDKTEY-ELRKLRGYRSACRIAKPKGSTFISSEHAKLPDRVD----WRRNG 160 Query: 191 TLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LGCNGGM 367 + +++QG CGSCWAF + A+ + Y + + S + L+ C G GC GG+ Sbjct: 161 AVTPVKNQGQCGSCWAFSSTGAIEGQ--HYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGL 218 Query: 368 PTLAWEY 388 LA++Y Sbjct: 219 MDLAFQY 225 >UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1 precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 355 Score = 54.8 bits (126), Expect = 1e-06 Identities = 30/84 (35%), Positives = 43/84 (51%) Frame = +2 Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316 I +LP++ D R K P ++DQG CGSCWAF V A+ I + S + Sbjct: 134 ITDLPKSVDWRKKGAVAP----VKDQGQCGSCWAFSTVAAVEGINQITTGNLS--SLSEQ 187 Query: 317 DLVSCCPICGLGCNGGMPTLAWEY 388 +L+ C GCNGG+ A++Y Sbjct: 188 ELIDCDTTFNSGCNGGLMDYAFQY 211 >UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: Cysteine protease - Saprolegnia parasitica Length = 523 Score = 54.4 bits (125), Expect = 2e-06 Identities = 35/113 (30%), Positives = 54/113 (47%), Gaps = 9/113 (7%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355 W E + +++QG CGSCWAF A+ + S + S ++LV C +GC Sbjct: 122 WVEQGGVTPVKNQGMCGSCWAFSTTGAIEGAAFVSSK--QLVSVSEQELVDCDHNGDMGC 179 Query: 356 NGGMPTLAWEYWK-HVGLVSGGN--YNSSQG------CRPYEIPPCEHHVPGN 487 NGG+ A+++ K H GL + Y++ +G C+P H VP N Sbjct: 180 NGGLMDNAFKWVKTHKGLCKEEDYPYHAKEGTCALKKCKPVTKVTAFHDVPAN 232 >UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax (Sterkiella histriomuscorum) Length = 366 Score = 54.0 bits (124), Expect = 2e-06 Identities = 29/89 (32%), Positives = 45/89 (50%), Gaps = 1/89 (1%) Frame = +2 Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319 AN+P +D W ++ +++QG CGSCW F V + + A + + S + Sbjct: 133 ANIPTEWD----WRTFGVVSPVKNQGKCGSCWTFSTVGCVESHYLLKYGAFR--NLSEQQ 186 Query: 320 LVSCC-PICGLGCNGGMPTLAWEYWKHVG 403 LV C GC+GG+P+ A+EY K G Sbjct: 187 LVDCAGDYDNHGCSGGLPSHAFEYIKDNG 215 >UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea mays (Maize) Length = 371 Score = 54.0 bits (124), Expect = 2e-06 Identities = 41/135 (30%), Positives = 61/135 (45%), Gaps = 12/135 (8%) Frame = +2 Query: 35 FPTHTPFAHIKILMGALKDDNIL--KLPKVTHDAELIAN--LPENFDPRDKWPECPTLNE 202 F TP + +G K L +L + H+A ++ LP++FD W + + Sbjct: 96 FSDLTPAEFRRTYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDFD----WRDHGAVGP 151 Query: 203 IRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC---C-----PICGLGCN 358 +++QGSCGSCW+F A A+ Y K S + V C C C GCN Sbjct: 152 VKNQGSCGSCWSFSASGALEG--AHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCN 209 Query: 359 GGMPTLAWEYWKHVG 403 GG+ T A+ Y + G Sbjct: 210 GGLMTTAFSYLQKAG 224 >UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma|Rep: Cathepsin C precursor - Schistosoma mansoni (Blood fluke) Length = 454 Score = 54.0 bits (124), Expect = 2e-06 Identities = 36/97 (37%), Positives = 49/97 (50%), Gaps = 6/97 (6%) Frame = +2 Query: 134 LIANLPENFDPRDKWPECPT-----LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKH 298 L NLP FD W P + IR+QG CGSC+A + A+ R+ + SN ++ Sbjct: 214 LTGNLPLEFD----WTSPPDGSRSPVTPIRNQGICGSCYASPSAAALEARIRLVSNFSEQ 269 Query: 299 FHFSAEDLVSCCPICGLGCNGGMPTL-AWEYWKHVGL 406 S + +V C P GCNGG P L A +Y + GL Sbjct: 270 PILSPQTVVDCSPY-SEGCNGGFPFLIAGKYGEDFGL 305 >UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep: Cysteine protease - Solanum lycopersicum (Tomato) (Lycopersicon esculentum) Length = 345 Score = 53.6 bits (123), Expect = 3e-06 Identities = 30/95 (31%), Positives = 51/95 (53%), Gaps = 2/95 (2%) Frame = +2 Query: 110 PKVTHDAELIANLPENFDPRD-KWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSN 286 P + + + I +L +++ P + W E + +++ QG CG CWAF AV ++ Y Sbjct: 114 PMSSTEFKKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEG---AYKI 170 Query: 287 ATKH-FHFSAEDLVSCCPICGLGCNGGMPTLAWEY 388 AT + FS ++L+ C GCNGG T A+++ Sbjct: 171 ATGNLMEFSEQELLD-CTTNNYGCNGGFMTNAFDF 204 >UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcoptes scabiei type hominis|Rep: Sar s 1 allergen Yv6030H07 - Sarcoptes scabiei type hominis Length = 322 Score = 53.6 bits (123), Expect = 3e-06 Identities = 30/89 (33%), Positives = 43/89 (48%), Gaps = 4/89 (4%) Frame = +2 Query: 194 LNEIRDQGSCGSCWAFGAV-EAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMP 370 L IR+QG+CGSCWAF + A ++ + + S + LV C GC+G P Sbjct: 117 LTPIREQGACGSCWAFSTICTAESNYLTTRQAPLNKWTLSEQQLVDCA--SPKGCDGEKP 174 Query: 371 TLAWEYWKHVGLVSGGNY---NSSQGCRP 448 T ++Y G+ +G Y Q CRP Sbjct: 175 TTGFKYLLEKGVTTGDRYPYVGKVQPCRP 203 >UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foetus|Rep: TFCP2 protein - Tritrichomonas foetus (Trichomonas foetus) Length = 270 Score = 53.6 bits (123), Expect = 3e-06 Identities = 32/91 (35%), Positives = 43/91 (47%), Gaps = 2/91 (2%) Frame = +2 Query: 122 HDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHF 301 H+ + P +FD W +N I++QGSCGSCWAF A+ A C + Sbjct: 42 HERIQYKDTPTSFD----WRSEGKVNPIKNQGSCGSCWAFSAIAAQES--CHAIATGELL 95 Query: 302 HFSAEDLVSC--CPICGLGCNGGMPTLAWEY 388 FS + LV C GC+GG P A +Y Sbjct: 96 RFSEQSLVDCVTSDYSCQGCSGGWPDQAMKY 126 >UniRef50_Q23H10 Cluster: Papain family cysteine protease containing protein; n=14; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 53.6 bits (123), Expect = 3e-06 Identities = 45/140 (32%), Positives = 63/140 (45%), Gaps = 12/140 (8%) Frame = +2 Query: 41 THTPFAHIKILMGALKDDNILK--LPKVTH------DAELIANLPENFDPRDKWPECPTL 196 T FA KILM + D+++K + TH + +L +N D D W + Sbjct: 82 TKEEFAE-KILMKSDLVDHLMKGISQEATHNDTNNNETQLSSNSLTLADSID-WRTKGAV 139 Query: 197 NEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC-CPICG---LGCNGG 364 +++QG CGSCW+F A M I + A FS + LV C P G GCNGG Sbjct: 140 TSVKNQGGCGSCWSFSAAAVMESFNFIQNKAL--VDFSEQQLVDCVIPANGYNSYGCNGG 197 Query: 365 MPTLAWEYWKHVGLVSGGNY 424 P +Y VG+ + Y Sbjct: 198 WPVQCLDYASKVGITTLDKY 217 >UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=17; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 318 Score = 53.6 bits (123), Expect = 3e-06 Identities = 24/71 (33%), Positives = 37/71 (52%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355 W +N I+DQ CGSCWAF V+A + + + + +++V C C GC Sbjct: 106 WRNAKIVNPIKDQAQCGSCWAFSVVQAQESQWALKKG--QLLSLAEQNMVDCVDTC-YGC 162 Query: 356 NGGMPTLAWEY 388 +GG LA++Y Sbjct: 163 DGGDEYLAYDY 173 >UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_23, whole genome shotgun sequence - Paramecium tetraurelia Length = 321 Score = 53.6 bits (123), Expect = 3e-06 Identities = 36/110 (32%), Positives = 49/110 (44%), Gaps = 6/110 (5%) Frame = +2 Query: 77 GALKDDNILKLPKVTHDAELIANLPENFDP-----RDKWPECPTLNEIRDQGSCGSCWAF 241 G L D L + + N+ +N +P W + + I+DQG CGSCWAF Sbjct: 87 GDLTDQEFLTIYLNLQMPARVKNIQKNEEPFLVQEEVDWVQKGKVPAIKDQGDCGSCWAF 146 Query: 242 GAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWEY 388 AV A+ I N + S +DLV C P GC+GG A +Y Sbjct: 147 SAVGALEINTKIQFN--EIVDLSEQDLVDCAGPYGNAGCDGGWMESALDY 194 >UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; Methanospirillum hungatei JF-1|Rep: Peptidase C1A, papain precursor - Methanospirillum hungatei (strain JF-1 / DSM 864) Length = 1096 Score = 53.6 bits (123), Expect = 3e-06 Identities = 44/132 (33%), Positives = 64/132 (48%), Gaps = 1/132 (0%) Frame = +2 Query: 59 HIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWA 238 H+K L LK I+ +T + LP +FD R+ + T I++QGSCGSCWA Sbjct: 296 HLKGLRHDLKSSTIVSGAGITP----MEGLPTSFDWRNNGGDYTT--PIKNQGSCGSCWA 349 Query: 239 FGAVEAMTDRVCIYS-NATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSG 415 F A I S N + ++ + LV+C GCNGG+ T A Y+ + +SG Sbjct: 350 FATTGAFESYKEIKSGNPGMNPDYAEQYLVNCAG-DQRGCNGGLFT-AMAYFVNKAGLSG 407 Query: 416 GNYNSSQGCRPY 451 G ++ PY Sbjct: 408 GVGTVTEANYPY 419 >UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Plasmodium (Vinckeia)|Rep: Cysteine proteinase precursor - Plasmodium vinckei Length = 506 Score = 53.6 bits (123), Expect = 3e-06 Identities = 37/120 (30%), Positives = 58/120 (48%), Gaps = 6/120 (5%) Frame = +2 Query: 83 LKDDNILKLPKVTHDAELIA------NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFG 244 LK I+ L K + LI+ + P++ D R K+ P +DQG+CGSCWAF Sbjct: 236 LKSKYIVPLKKHLANTNLISVDNKSKDFPDSRDYRSKFNFLPP----KDQGNCGSCWAFA 291 Query: 245 AVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424 A+ + + +++ FS + +V C GC+GG P A+ Y + G+ G Y Sbjct: 292 AI-GNFEYLYVHTRHEMPISFSEQQMVDCSTE-NYGCDGGNPFYAFLYMINNGVCLGDEY 349 >UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens (Human) Length = 334 Score = 53.6 bits (123), Expect = 3e-06 Identities = 37/112 (33%), Positives = 55/112 (49%), Gaps = 1/112 (0%) Frame = +2 Query: 71 LMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAV 250 +MG ++ K KV + L +LP++ D R K P +++Q CGSCWAF A Sbjct: 91 MMGCFRNQKFRK-GKVFREP-LFLDLPKSVDWRKKGYVTP----VKNQKQCGSCWAFSAT 144 Query: 251 EAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWEYWKHVG 403 A+ + ++ K S ++LV C P GCNGG A++Y K G Sbjct: 145 GALEGQ--MFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENG 194 >UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 360 Score = 53.2 bits (122), Expect = 4e-06 Identities = 34/98 (34%), Positives = 46/98 (46%) Frame = +2 Query: 89 DDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDR 268 DDN K P + D NLP +FD RDK P ++ Q CG CWAF V+++ Sbjct: 117 DDNKNKQPHLPTD-----NLPASFDWRDKGAITP----VKVQNGCGGCWAFSTVQSIEG- 166 Query: 269 VCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAW 382 + K S + ++ CC I GC GG P A+ Sbjct: 167 -LYFLKTGKLESLSTQQVIDCCRIDESGCLGGDPEPAF 203 >UniRef50_Q22W19 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 53.2 bits (122), Expect = 4e-06 Identities = 29/99 (29%), Positives = 47/99 (47%) Frame = +2 Query: 128 AELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHF 307 + LI +L + P W + + +++QG CGSCWAF V + Y+ AT + Sbjct: 113 SHLIYSLKGDVAPSIDWRQKNAVTPVKNQGQCGSCWAFSTVGGLEG---AYAIATGNLTS 169 Query: 308 SAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424 +E + C GCNGG A++Y G+ + +Y Sbjct: 170 FSEQQIVDCSKANAGCNGGDLPPAYKYVVQNGIETEADY 208 >UniRef50_Q22DX2 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 358 Score = 53.2 bits (122), Expect = 4e-06 Identities = 32/94 (34%), Positives = 46/94 (48%) Frame = +2 Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322 ++P ++D R P L + +QG CGSCWAF A+ N T + S + L Sbjct: 146 SIPSSWDIRTDGPGL--LQPVENQGQCGSCWAFSTSGAVESYYSAKKNIT--LNLSKQQL 201 Query: 323 VSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424 V C G GC+GG A++Y + VG+V Y Sbjct: 202 VDCVYDHG-GCDGGWFNDAFKYIQSVGIVLNATY 234 >UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep: Aca s 1 allergen - Acarus siro (Dust mite) Length = 331 Score = 53.2 bits (122), Expect = 4e-06 Identities = 35/100 (35%), Positives = 45/100 (45%), Gaps = 6/100 (6%) Frame = +2 Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322 NLPE FD R K L I +QG CG+CWAF ++ + I N H S ++L Sbjct: 108 NLPETFDWRSK------LGPIENQGRCGACWAFASLATVEAAFAIKYNT--HIRLSKQEL 159 Query: 323 VSC------CPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424 V C P GC GG A +Y + G+V Y Sbjct: 160 VECTRESDHTPYENSGCQGGYSWEALKYVQVTGVVEEAAY 199 >UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 350 Score = 52.8 bits (121), Expect = 6e-06 Identities = 48/165 (29%), Positives = 68/165 (41%), Gaps = 15/165 (9%) Frame = +2 Query: 11 NTWKAGRN-FP--THTPFAHIKILMGALK-----DDNILKLPKVTHDAELIANLPENFDP 166 NT+K N F T FAH ++L LK + P++ + N + FD Sbjct: 87 NTYKLQHNQFSDMTKDEFAH-RVLNSQLKTSASSSSQPAQTPQLRGSVDASLNASQGFDW 145 Query: 167 RDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP--- 337 R+ L +++QG CGSCW F + + + FS +D+V C Sbjct: 146 RNYQG---VLGNVKNQGQCGSCWTFATAGVLESYYAL--KYQQSLIFSEQDIVDCASRSY 200 Query: 338 -ICGLGCNGGMPTLAWEYWKHVGLVSGGNYN--SSQG-CRPYEIP 460 GCNGG P+ +Y VGLV Y + QG CR P Sbjct: 201 GYQSDGCNGGFPSEGLQYASTVGLVQSDYYPYVAVQGTCRQVNAP 245 >UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2; Taeniidae|Rep: Cathepsin L-like cysteine proteinase - Taenia solium (Pork tapeworm) Length = 339 Score = 52.8 bits (121), Expect = 6e-06 Identities = 31/84 (36%), Positives = 43/84 (51%), Gaps = 1/84 (1%) Frame = +2 Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319 A LP+ D RDK + E+++QG+CGSCWAF + A+ K S + Sbjct: 122 AGLPDTVDWRDK----NLVTEVKNQGNCGSCWAFSSTGALEG--AFAKKTGKLISLSEQQ 175 Query: 320 LVSCCPICGL-GCNGGMPTLAWEY 388 LV C G GCNGG + A++Y Sbjct: 176 LVDCSLKNGNDGCNGGYMSYAFKY 199 >UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 395 Score = 52.8 bits (121), Expect = 6e-06 Identities = 24/76 (31%), Positives = 41/76 (53%), Gaps = 2/76 (2%) Frame = +2 Query: 203 IRDQGSCGSCWAFGAVEAMTDRVCIYSNATKH--FHFSAEDLVSCCPICGLGCNGGMPTL 376 +RDQG C SCW FG++ A+ R I + ++ H SA++ ++C GC G P Sbjct: 201 VRDQGECKSCWVFGSLAALESRYLIKNGVSEKSTLHLSAQNAMNCIT---SGCESGWPAN 257 Query: 377 AWEYWKHVGLVSGGNY 424 ++Y++ G+ +Y Sbjct: 258 VFDYFESSGIAFEKDY 273 >UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Piroplasmida|Rep: Cysteine proteinase, putative - Theileria parva Length = 460 Score = 52.8 bits (121), Expect = 6e-06 Identities = 36/120 (30%), Positives = 59/120 (49%), Gaps = 3/120 (2%) Frame = +2 Query: 56 AHIKILMGAL-KDDNILKLPKVTHDAELIANLPENFDPRD-KWPECPTLNEIRDQG-SCG 226 +H+ LM + D+ LK K + + + P+N W + +++I++QG CG Sbjct: 214 SHVDRLMARMVSDETYLKNLKKALNTDKDVD-PKNITGEGLDWRKADGVSKIKNQGLECG 272 Query: 227 SCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGL 406 SCWAF +V ++ IY N T S ++LV C GC GG A +Y ++ G+ Sbjct: 273 SCWAFASVSSVESLYKIYRNVT--LDLSEQELVD-CETSSKGCEGGFGDTALKYIQNKGV 329 >UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; n=23; Magnoliophyta|Rep: Senescence-specific cysteine protease - Arabidopsis thaliana (Mouse-ear cress) Length = 346 Score = 52.4 bits (120), Expect = 8e-06 Identities = 31/84 (36%), Positives = 41/84 (48%), Gaps = 1/84 (1%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355 W + + I++QGSCG CWAF AV A+ I K S + LV C GC Sbjct: 136 WRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKG--KLISLSEQQLVD-CDTNDFGC 192 Query: 356 NGGMPTLAWEYWKHV-GLVSGGNY 424 GG+ A+E+ K GL + NY Sbjct: 193 EGGLMDTAFEHIKATGGLTTESNY 216 >UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; Eukaryota|Rep: Cathepsin-like cysteine protease - Phytophthora infestans (Potato late blight fungus) Length = 635 Score = 52.4 bits (120), Expect = 8e-06 Identities = 34/97 (35%), Positives = 52/97 (53%), Gaps = 4/97 (4%) Frame = +2 Query: 122 HDAELIANLPENFDPRD-KWPECPTLNEIRDQGS-CGSCWAFGAVEAMTDRVCIYSNAT- 292 H+ + +LP+++D RD T ++ + CGSCWA G A++DR+ I NA+ Sbjct: 354 HETMDVTDLPKSWDWRDVNGKNYVTWDKNQHIPKYCGSCWAQGTTSALSDRISILRNASW 413 Query: 293 KHFHFSAEDLVSCCPICGLGCNGGMPTLAWEY-WKHV 400 S + L++C G CNGG P L +EY +HV Sbjct: 414 PEIALSPQVLINC--HAGGTCNGGNPGLVYEYAHRHV 448 Score = 41.1 bits (92), Expect = 0.019 Identities = 33/111 (29%), Positives = 49/111 (44%), Gaps = 12/111 (10%) Frame = +2 Query: 122 HDAELIANLPENFDPRDKWPECPTLNEIRDQGS---CGSCWAFGAVEAMTDRVCIYSNAT 292 HD ++ LP+NFD R+ ++ R+Q CGSCW+F A A+ DR+ I+ Sbjct: 48 HDYIDVSKLPKNFDWRNV-NGTRYVSISRNQHIPHYCGSCWSFAATSALADRILIFKERN 106 Query: 293 KHFHFSAE---------DLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGG 418 S E ++ C GC+GG A+ Y K G+ G Sbjct: 107 PGNKPSVEVHRGVVLSPQVILNCDKKDNGCHGGDQLEAYRYIKEHGVPEEG 157 >UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4; core eudicotyledons|Rep: Papain-like cysteine peptidase XBCP3 - Arabidopsis thaliana (Mouse-ear cress) Length = 437 Score = 52.4 bits (120), Expect = 8e-06 Identities = 24/71 (33%), Positives = 37/71 (52%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355 W + + ++DQGSCG+CW+F A AM I + S ++L+ C GC Sbjct: 124 WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDL--ISLSEQELIDCDKSYNAGC 181 Query: 356 NGGMPTLAWEY 388 NGG+ A+E+ Sbjct: 182 NGGLMDYAFEF 192 >UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 2 - Rhipicephalus appendiculatus (Brown ear tick) Length = 564 Score = 52.4 bits (120), Expect = 8e-06 Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 3/129 (2%) Frame = +2 Query: 62 IKILMGAL--KDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCW 235 I +L G L KD + P H A LP+ D W + ++DQ CGSCW Sbjct: 317 ISVLRGRLQSKDGSSRAEPFPRH--RFTAKLPDQID----WRPYGAVTPVKDQAVCGSCW 370 Query: 236 AFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LGCNGGMPTLAWEYWKHVGLVS 412 +FG V + + + S + LV C G GC+GG A+EY GL S Sbjct: 371 SFGTVGELEG--AYFRKTGRLVRLSEQQLVDCSWNNGNNGCDGGEDFRAYEYIADHGLAS 428 Query: 413 GGNYNSSQG 439 +Y + G Sbjct: 429 DEDYGAYIG 437 >UniRef50_Q54VR1 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 291 Score = 52.4 bits (120), Expect = 8e-06 Identities = 35/117 (29%), Positives = 48/117 (41%) Frame = +2 Query: 221 CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHV 400 CGSCWA G A+ DR+ I T A ++ C C+GG PT A+ Y Sbjct: 76 CGSCWAHGTTSALGDRIKIGRKGTFPEVVLAPQVLLNCAGPDNTCDGGDPTEAYAYMAAK 135 Query: 401 GLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTPKCQKNCESSYNVPFKKE 571 G+ + + C PYE E + G CN D P + +Y F +E Sbjct: 136 GI-------TDETCAPYEAIDNECNAEGICKNCNFDLSNPTADCFAQPTYTTYFVEE 185 >UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=19; Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Homo sapiens (Human) Length = 333 Score = 52.4 bits (120), Expect = 8e-06 Identities = 25/72 (34%), Positives = 39/72 (54%), Gaps = 1/72 (1%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLG 352 W E + +++QG CGSCWAF A A+ + ++ + S ++LV C P G Sbjct: 120 WREKGYVTPVKNQGQCGSCWAFSATGALEGQ--MFRKTGRLISLSEQNLVDCSGPQGNEG 177 Query: 353 CNGGMPTLAWEY 388 CNGG+ A++Y Sbjct: 178 CNGGLMDYAFQY 189 >UniRef50_P81494 Cluster: Cathepsin B; n=2; Phasianidae|Rep: Cathepsin B - Coturnix coturnix japonica (Japanese quail) Length = 48 Score = 52.4 bits (120), Expect = 8e-06 Identities = 32/72 (44%), Positives = 39/72 (54%), Gaps = 1/72 (1%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 LP+ FD R +WP CPT++EIRDQGS +VE SAEDL+ Sbjct: 1 LPDTFDSRKQWPNCPTISEIRDQGSV-------SVEV-----------------SAEDLL 36 Query: 326 SCCPI-CGLGCN 358 SCC CG+GCN Sbjct: 37 SCCGFECGMGCN 48 >UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin W, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to Cathepsin W, partial - Ornithorhynchus anatinus Length = 229 Score = 52.0 bits (119), Expect = 1e-05 Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 3/110 (2%) Frame = +2 Query: 128 AELIANLPENFDPRDK--WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHF 301 A +A++PE ++ W + + +++QGSCGSCWAF AV Y A K Sbjct: 56 ANQMASIPEGPLRKETCDWRKRGAITSVKNQGSCGSCWAFAAVG--NAESMWYLRAGKRL 113 Query: 302 HFSAEDLVSCCPICGLGCNGGMPTLAW-EYWKHVGLVSGGNYNSSQGCRP 448 + V C C GC GG P A+ W + GL S +Y RP Sbjct: 114 VSLSVQEVLDCGRCRDGCQGGYPEDAFVTMWFNRGLASEKDYPYKVRARP 163 >UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar to human SRY (sex determining region Y)-box 30 (SOX30),transcript variant 1,; n=3; Mammalia|Rep: Testis cDNA clone: QtsA-12228, similar to human SRY (sex determining region Y)-box 30 (SOX30),transcript variant 1, - Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) Length = 433 Score = 52.0 bits (119), Expect = 1e-05 Identities = 36/112 (32%), Positives = 55/112 (49%), Gaps = 1/112 (0%) Frame = +2 Query: 71 LMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAV 250 +MG ++ + K K+ + L +LP++ D R K P +++Q CGSCWAF A Sbjct: 91 VMGCFRNQKLRK-GKLFREP-LFLDLPKSVDWRKKGYVTP----VKNQKQCGSCWAFSAT 144 Query: 251 EAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMPTLAWEYWKHVG 403 A+ + ++ K S ++LV C P GCNGG A+ Y K G Sbjct: 145 GALEGQ--MFRKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMNSAFRYVKENG 194 >UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Cathepsin - Geodia cydonium (Sponge) Length = 322 Score = 52.0 bits (119), Expect = 1e-05 Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 3/101 (2%) Frame = +2 Query: 131 ELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNAT-KHFHF 307 E ++ LP D R K + +++QG CGSCWAF A ++ + + NAT K Sbjct: 98 EDVSALPTTVDWRTKG----YVTGVKNQGQCGSCWAFSATGSLEGQ---HFNATGKLVSL 150 Query: 308 SAEDLVSCCPICG-LGCNGGMPTLAWEY-WKHVGLVSGGNY 424 S ++LV C G GCNGG+P A++Y K+ G+ + +Y Sbjct: 151 SEQNLVDCSSAEGNEGCNGGLPDDAFKYVIKNGGIDTEASY 191 >UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 4 - Rhipicephalus appendiculatus (Brown ear tick) Length = 345 Score = 52.0 bits (119), Expect = 1e-05 Identities = 39/148 (26%), Positives = 66/148 (44%), Gaps = 3/148 (2%) Frame = +2 Query: 86 KDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTD 265 K + +L ++ A L + PE + W E + +++QG CGSCWAF + A+ Sbjct: 106 KPPSAQQLAEIPLYAPLFGDTPEFIE----WRENGFVTPVKNQGQCGSCWAFSSTGALEG 161 Query: 266 RVCIYSNATKHFHFSAEDLVSCC--PICGLGCNGGMPTLAWEYWKHV-GLVSGGNYNSSQ 436 +V + + S ++L+ C GCNGG A++Y + GL + Y Q Sbjct: 162 QV--FKRTRRLISLSEQNLMDCAGQRYGNNGCNGGQMPGAFQYVQDAGGLDTEARYPYRQ 219 Query: 437 GCRPYEIPPCEHHVPGNRMPCNGDTKTP 520 G ++ + R+ NG T+ P Sbjct: 220 GTN-FQC-QFSNSFEARRVSVNGHTRVP 245 >UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae str. PEST Length = 559 Score = 52.0 bits (119), Expect = 1e-05 Identities = 30/94 (31%), Positives = 50/94 (53%), Gaps = 1/94 (1%) Frame = +2 Query: 125 DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH 304 D + +LP +FD W + + E+++QGSCGSCWAF AV + ++ TK Sbjct: 332 DVAGVGDLPRSFD----WRDHGAVTEVKNQGSCGSCWAFSAVGNVEG---LHQIKTKKLE 384 Query: 305 -FSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVG 403 +S ++L+ C + GC GG A++ + +G Sbjct: 385 SYSEQELIDCDKVDN-GCGGGYMDDAFKAIEQLG 417 >UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Longidorus elongatus Length = 358 Score = 52.0 bits (119), Expect = 1e-05 Identities = 31/89 (34%), Positives = 47/89 (52%), Gaps = 4/89 (4%) Frame = +2 Query: 134 LIANLPENFDPRDK--WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHF 307 +I +P+N D W + + +++DQGSCGSCWAF A ++ + Y K Sbjct: 129 MIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAFSATGSLEGQ--HYKQTGKLVSL 186 Query: 308 SAEDLVSCCPICG--LGCNGGMPTLAWEY 388 S ++LV C + G GCNGG A++Y Sbjct: 187 SEQNLVD-CDVNGDDEGCNGGYMDGAFQY 214 >UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep: Cysteine protease - Babesia equi Length = 438 Score = 52.0 bits (119), Expect = 1e-05 Identities = 27/77 (35%), Positives = 40/77 (51%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355 W + + ++DQG+CGSCWAF AV ++ I + S ++LV+C GC Sbjct: 230 WRKLNGVTPVKDQGNCGSCWAFAAVGSVESLYLIKKG--QALDLSEQELVNCEENSN-GC 286 Query: 356 NGGMPTLAWEYWKHVGL 406 G +P A EY K G+ Sbjct: 287 EGDLPNKALEYIKAKGI 303 >UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precursor; n=3; Plasmodium (Laverania)|Rep: Trophozoite cysteine proteinase precursor - Plasmodium falciparum Length = 569 Score = 52.0 bits (119), Expect = 1e-05 Identities = 41/134 (30%), Positives = 64/134 (47%), Gaps = 5/134 (3%) Frame = +2 Query: 38 PTHTPFAHIKILMGALKDDNILKLPKVTH----DAELIANLPENFDPRDKWPECPTLNEI 205 P H + K LKD NIL T+ + ++ + +PE D R+K ++E Sbjct: 294 PNHMIEKYSKPFENHLKD-NILISEFYTNGKRNEKDIFSKVPEILDYREKG----IVHEP 348 Query: 206 RDQGSCGSCWAFGAVEAMTDRVCIYSNATKH-FHFSAEDLVSCCPICGLGCNGGMPTLAW 382 +DQG CGSCWAF +V + +++ K+ FS +++V C GC+GG P ++ Sbjct: 349 KDQGLCGSCWAFASVGNIES---VFAKKNKNILSFSEQEVVDCSK-DNFGCDGGHPFYSF 404 Query: 383 EYWKHVGLVSGGNY 424 Y L G Y Sbjct: 405 LYVLQNELCLGDEY 418 >UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 precursor; n=4; Schizophora|Rep: Putative cysteine proteinase CG12163 precursor - Drosophila melanogaster (Fruit fly) Length = 614 Score = 52.0 bits (119), Expect = 1e-05 Identities = 27/86 (31%), Positives = 44/86 (51%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 LP+ FD W + + ++++QGSCGSCWAF + + + K FS ++L+ Sbjct: 394 LPKEFD----WRQKDAVTQVKNQGSCGSCWAFSVTGNIEGLYAVKTGELK--EFSEQELL 447 Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVG 403 C CNGG+ A++ K +G Sbjct: 448 D-CDTTDSACNGGLMDNAYKAIKDIG 472 >UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin L-like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin L-like protease - Nasonia vitripennis Length = 353 Score = 51.6 bits (118), Expect = 1e-05 Identities = 33/98 (33%), Positives = 50/98 (51%), Gaps = 3/98 (3%) Frame = +2 Query: 143 NLPENFDPRDKWPECPTLNEIRDQG-SCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319 N+PE+ D W + + +RDQG +CGSCWAF A A+ + + SA++ Sbjct: 131 NVPEHVD----WRQRGAVTPVRDQGLTCGSCWAFSAAGALEAQ--YFKKTGVLTALSAQN 184 Query: 320 LVSCCPICG-LGCNGGMPTLAWEY-WKHVGLVSGGNYN 427 L+ C G LGC GG L++++ GL NY+ Sbjct: 185 LIDCTMEYGNLGCGGGSAALSFQFVVDQKGLEPEANYS 222 >UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 344 Score = 51.6 bits (118), Expect = 1e-05 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 6/104 (5%) Frame = +2 Query: 152 ENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC 331 +N P D W + ++ QG CGSCW F A A+ + N +FS + ++ C Sbjct: 134 KNAPPMD-WRNASAITPVKQQGKCGSCWTF-ASTAVLESFSFIKNGAPLTNFSEQQILDC 191 Query: 332 CPICGL---GCNGGMPTLAWEYWKHVGLVSGGNY---NSSQGCR 445 G GCNGG + A Y G+ Y QGC+ Sbjct: 192 VYGSGYYSNGCNGGFGSEALNYAIQNGIAPLSQYPYVGKQQGCK 235 >UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; Phytophthora infestans|Rep: Cathepsin-like cysteine protease - Phytophthora infestans (Potato late blight fungus) Length = 376 Score = 51.6 bits (118), Expect = 1e-05 Identities = 29/79 (36%), Positives = 42/79 (53%), Gaps = 1/79 (1%) Frame = +2 Query: 131 ELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFS 310 E + +LP +D W E T+ +++QG CGSCWAF AV AM C Y+ +T Sbjct: 128 ENVEDLPATWD----WREHSTVTPVKNQGQCGSCWAFSAVAAME---CAYALSTGTLESL 180 Query: 311 AEDLVSCCPICGLG-CNGG 364 +E + C + G+ CN G Sbjct: 181 SEQELVDCTLNGIDTCNHG 199 >UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcoptes scabiei type hominis|Rep: Sar s 1 allergen Yv5032C08 - Sarcoptes scabiei type hominis Length = 340 Score = 51.6 bits (118), Expect = 1e-05 Identities = 30/80 (37%), Positives = 44/80 (55%), Gaps = 3/80 (3%) Frame = +2 Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATK---HFHFSAEDLVSCCPICGLGCNGG 364 + +IR+Q +CGSCWAF +V A + + + SN T+ + S + LV C GCNG Sbjct: 126 VTKIREQLACGSCWAF-SVTANVESLLLGSNCTRWSTNDWLSPQQLVDCA--SDHGCNGE 182 Query: 365 MPTLAWEYWKHVGLVSGGNY 424 + EY +H G+V G Y Sbjct: 183 KTSTGLEYVQHKGIVKEGVY 202 >UniRef50_Q23EG5 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 429 Score = 51.6 bits (118), Expect = 1e-05 Identities = 26/81 (32%), Positives = 42/81 (51%), Gaps = 5/81 (6%) Frame = +2 Query: 176 WPECPTLNEIRDQGS----CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PI 340 W E ++ ++DQ + CGSCW F A A+ + + + F+ S + LV C Sbjct: 128 WREKGIVSSVKDQDAVGDDCGSCWTFSATGAIESHLALKTGKAP-FNLSQQQLVDCAGKF 186 Query: 341 CGLGCNGGMPTLAWEYWKHVG 403 GC+GG+P+ A+EY + G Sbjct: 187 DNQGCDGGLPSRAFEYIAYAG 207 >UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_79, whole genome shotgun sequence - Paramecium tetraurelia Length = 324 Score = 51.6 bits (118), Expect = 1e-05 Identities = 42/119 (35%), Positives = 57/119 (47%), Gaps = 7/119 (5%) Frame = +2 Query: 110 PKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGS-CGSCWAFGAVEAMTDRVCIYSN 286 PK T+ ++ +P N RD W E + I+DQGS CGS WAF AV + I SN Sbjct: 104 PKSTNKSKSTDYVP-NGQARD-WVEEGKVPPIKDQGSSCGSSWAFSAVGVLE----INSN 157 Query: 287 ATKHFH--FSAEDLVSCC-PICGLGCNGGMPTLAWEYWKHVGLVSGGNY---NSSQGCR 445 S +D++ C P GC+GG +EY + G+ +G Y S Q CR Sbjct: 158 IEFGLETTLSEQDMLDCSGPYGNQGCSGGWMDSGFEYVRDHGIANGSVYPYVGSDQTCR 216 >UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria dispar multicapsid nuclear polyhedrosis virus (LdMNPV) Length = 356 Score = 51.6 bits (118), Expect = 1e-05 Identities = 27/91 (29%), Positives = 47/91 (51%), Gaps = 1/91 (1%) Frame = +2 Query: 134 LIANLPENFDPRD-KWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFS 310 +I N P + P W E + I++QG+CG+CWAF + ++ + + N + S Sbjct: 135 IILNQPPDKGPLHFDWREQNKVTSIKNQGACGACWAFATLASVESQFAMRHN--RLIDLS 192 Query: 311 AEDLVSCCPICGLGCNGGMPTLAWEYWKHVG 403 + L+ C + +GCNGG+ A+E +G Sbjct: 193 EQQLIDCDSV-DMGCNGGLLHTAFEEIMRMG 222 >UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin O; n=1; Danio rerio|Rep: PREDICTED: similar to cathepsin O - Danio rerio Length = 327 Score = 51.2 bits (117), Expect = 2e-05 Identities = 30/92 (32%), Positives = 40/92 (43%), Gaps = 2/92 (2%) Frame = +2 Query: 155 NFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC 334 N PR W + + + +QGSCG CWAF VEA+ + + + + V C Sbjct: 119 NNPPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAIES---VSAKVGEKLQQLSVQQVIDC 175 Query: 335 PICGLGCNGGMP--TLAWEYWKHVGLVSGGNY 424 GCNGG P L W + LVS Y Sbjct: 176 SYQNQGCNGGSPVEALYWLTQSKLKLVSEAEY 207 >UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays (Maize) Length = 493 Score = 51.2 bits (117), Expect = 2e-05 Identities = 27/84 (32%), Positives = 44/84 (52%), Gaps = 1/84 (1%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355 W E + E++DQG CG CWAF AV A+ I + + S ++L+ C GC Sbjct: 170 WRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVTGSL--ISLSEQELIDCDKFQDQGC 227 Query: 356 NGGMPTLAWEYW-KHVGLVSGGNY 424 +GG+ A+ + K+ G+ + +Y Sbjct: 228 DGGLMDNAFVFMIKNGGIDTEADY 251 >UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum|Rep: Falcipain 2 - Plasmodium falciparum Length = 484 Score = 51.2 bits (117), Expect = 2e-05 Identities = 31/93 (33%), Positives = 47/93 (50%), Gaps = 2/93 (2%) Frame = +2 Query: 152 ENFDPRD-KWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVS 328 ENFD W + ++DQ +CGSCWAF ++ ++ + I N K S ++LV Sbjct: 258 ENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKN--KLITLSEQELVD 315 Query: 329 CCPICGLGCNGGMPTLAWEYWKHV-GLVSGGNY 424 C GCNGG+ A+E + G+ G+Y Sbjct: 316 -CSFKNYGCNGGLINNAFEDMIELGGICPDGDY 347 >UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathepsin o - Aedes aegypti (Yellowfever mosquito) Length = 375 Score = 51.2 bits (117), Expect = 2e-05 Identities = 29/94 (30%), Positives = 43/94 (45%) Frame = +2 Query: 83 LKDDNILKLPKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMT 262 +KDD I K D +++ LP+ D RDK P +R QGSCG+CWA V+ +T Sbjct: 134 MKDDIIFSRAK--RDLKILDYLPKVVDWRDKGVVAP----VRSQGSCGACWAISVVDTIT 187 Query: 263 DRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGG 364 + + +++C GC GG Sbjct: 188 S-ISAIKRQQNFSELCLDQVINCAGNGNFGCEGG 220 >UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_21, whole genome shotgun sequence - Paramecium tetraurelia Length = 349 Score = 51.2 bits (117), Expect = 2e-05 Identities = 26/80 (32%), Positives = 40/80 (50%), Gaps = 3/80 (3%) Frame = +2 Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC---PICGLGCNGG 364 ++E+++QGSCGSCWAF AV A+ + K+ S ++LV C GC+GG Sbjct: 137 VSEVKNQGSCGSCWAFSAVAAL--ETALRQGGVKNVELSEQELVDCAVKDEFESEGCDGG 194 Query: 365 MPTLAWEYWKHVGLVSGGNY 424 ++Y G+ Y Sbjct: 195 EMYDGFQYASKYGIAIRSEY 214 >UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera litura multicapsid nucleopolyhedrovirus (SpltMNPV) Length = 337 Score = 51.2 bits (117), Expect = 2e-05 Identities = 27/88 (30%), Positives = 47/88 (53%) Frame = +2 Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319 A PE+FD W + + ++++QG CGSCWAF A+ + + I ++ S + Sbjct: 124 ARTPESFD----WRKLNKVTKVKEQGVCGSCWAFAAIGNIESQYAIMHDSL--IDLSEQQ 177 Query: 320 LVSCCPICGLGCNGGMPTLAWEYWKHVG 403 L+ C + GC+GG+ LA++ +G Sbjct: 178 LLDCDRV-DQGCDGGLMHLAFQEIIRIG 204 >UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3.4.14.1) (Dipeptidyl-peptidase I) (DPP-I) (DPPI) (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase) [Contains: Dipeptidyl-peptidase 1 exclusion domain chain (Dipeptidyl- peptidase I exclusion domain chain); Dipeptidyl-peptidase 1 heavy chain (Dipeptidyl-peptidase I heavy chain); Dipeptidyl-peptidase 1 light chain (Dipeptidyl-peptidase I light chain)]; n=50; Coelomata|Rep: Dipeptidyl-peptidase 1 precursor (EC 3.4.14.1) (Dipeptidyl-peptidase I) (DPP-I) (DPPI) (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase) [Contains: Dipeptidyl-peptidase 1 exclusion domain chain (Dipeptidyl- peptidase I exclusion domain chain); Dipeptidyl-peptidase 1 heavy chain (Dipeptidyl-peptidase I heavy chain); Dipeptidyl-peptidase 1 light chain (Dipeptidyl-peptidase I light chain)] - Homo sapiens (Human) Length = 463 Score = 51.2 bits (117), Expect = 2e-05 Identities = 32/101 (31%), Positives = 55/101 (54%), Gaps = 1/101 (0%) Frame = +2 Query: 110 PKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNA 289 P + I +LP ++D R+ ++ +R+Q SCGSC++F ++ + R+ I +N Sbjct: 219 PLTAEIQQKILHLPTSWDWRNVHG-INFVSPVRNQASCGSCYSFASMGMLEARIRILTNN 277 Query: 290 TKHFHFSAEDLVSCCPICGLGCNGGMPTL-AWEYWKHVGLV 409 ++ S +++VSC GC GG P L A +Y + GLV Sbjct: 278 SQTPILSPQEVVSCSQY-AQGCEGGFPYLIAGKYAQDFGLV 317 >UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis thaliana|Rep: Cysteine proteinase - Arabidopsis thaliana (Mouse-ear cress) Length = 348 Score = 50.8 bits (116), Expect = 2e-05 Identities = 33/104 (31%), Positives = 50/104 (48%), Gaps = 4/104 (3%) Frame = +2 Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322 N+ +N + D W + + ++ QG CG CWAF AV A+ I + S + L Sbjct: 124 NVSDNGESMD-WRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKG--ELVSLSEQQL 180 Query: 323 VSCCPICGLGCNGGMPTLAWEY-WKHVGLVSGGNY---NSSQGC 442 + C GC GG+ + A+EY K+ G+ + NY S Q C Sbjct: 181 LDCDRDYNQGCRGGIMSKAFEYIIKNQGITTEDNYPYQESQQTC 224 >UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-ear cress). SAG12 protein; n=2; Dictyostelium discoideum|Rep: Similar to Arabidopsis thaliana (Mouse-ear cress). SAG12 protein - Dictyostelium discoideum (Slime mold) Length = 358 Score = 50.8 bits (116), Expect = 2e-05 Identities = 28/79 (35%), Positives = 37/79 (46%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355 W + + ++DQG CGSC+ F AVE + K S + V C P G C Sbjct: 151 WRKKGLVTPVKDQGQCGSCYIFSAVEQI--ETAWIKAGNKPILLSEQQAVDCDPYDG-QC 207 Query: 356 NGGMPTLAWEYWKHVGLVS 412 GG P +EY+ VG VS Sbjct: 208 GGGDPYTVYEYFSQVGGVS 226 >UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_46, whole genome shotgun sequence - Paramecium tetraurelia Length = 336 Score = 50.8 bits (116), Expect = 2e-05 Identities = 28/81 (34%), Positives = 39/81 (48%) Frame = +2 Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPT 373 + +++DQG C CWAFGAV A + + T S + L+ C GCNGG Sbjct: 151 ITQVKDQGQCSGCWAFGAVGAAEAWFYVKNKTT--VLLSEQQLID-CDTQSFGCNGGYQN 207 Query: 374 LAWEYWKHVGLVSGGNYNSSQ 436 LA +Y + GL Y +Q Sbjct: 208 LALKYIANHGLNDARVYPYTQ 228 >UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; Entamoeba|Rep: Cysteine proteinase 2 precursor - Entamoeba histolytica Length = 315 Score = 50.8 bits (116), Expect = 2e-05 Identities = 30/94 (31%), Positives = 46/94 (48%), Gaps = 2/94 (2%) Frame = +2 Query: 149 PENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKH-FHFSAEDLV 325 PE+ D W + + IRDQ CGSC+ FG++ A+ R+ I + S E +V Sbjct: 95 PESVD----WRKEGKVTPIRDQAQCGSCYTFGSLAALEGRLLIEKGGDANTLDLSEEHMV 150 Query: 326 SCCPICG-LGCNGGMPTLAWEYWKHVGLVSGGNY 424 C G GCNGG+ + ++Y G+ +Y Sbjct: 151 QCTRDNGNNGCNGGLGSNVYDYIIEHGVAKESDY 184 >UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina) Length = 339 Score = 50.8 bits (116), Expect = 2e-05 Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 1/77 (1%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LG 352 W E + ++DQG CGSCWAF + A+ + + A S ++LV C G G Sbjct: 128 WREHGAVTGVKDQGHCGSCWAFSSTGALEGQ--HFRKAGVLVSLSEQNLVDCSTKYGNNG 185 Query: 353 CNGGMPTLAWEYWKHVG 403 CNGG+ A+ Y K G Sbjct: 186 CNGGLMDNAFRYIKDNG 202 >UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia theta|Rep: Cathepsin H precursor - Guillardia theta (Cryptomonas phi) Length = 353 Score = 50.4 bits (115), Expect = 3e-05 Identities = 29/89 (32%), Positives = 47/89 (52%), Gaps = 2/89 (2%) Frame = +2 Query: 152 ENFDPRDKW-PECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVS 328 + FD R++ E ++ +++QG+CGSCW F A+ I + + S + LV Sbjct: 120 DEFDWRNQTCGETSCVSMVKNQGTCGSCWTFSTAAALESLHAIKTG--EMVLLSEQQLVD 177 Query: 329 C-CPICGLGCNGGMPTLAWEYWKHVGLVS 412 C GCNGG+P+ A+EY + G +S Sbjct: 178 CAADFKNNGCNGGLPSQAFEYIMYNGGLS 206 >UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein A; n=2; Dictyostelium discoideum|Rep: Gamete and mating-type specific protein A - Dictyostelium discoideum (Slime mold) Length = 448 Score = 50.4 bits (115), Expect = 3e-05 Identities = 28/70 (40%), Positives = 37/70 (52%), Gaps = 2/70 (2%) Frame = +2 Query: 203 IRDQGSCGSCWAFGAVEAMTDRVCI-YSNATKH-FHFSAEDLVSCCPICGLGCNGGMPTL 376 IRDQG CGSCWAF + A+ R I Y A K S ++ V+C GCNGG Sbjct: 253 IRDQGQCGSCWAFASSAALESRYLIKYGTAQKSTLQLSNQNAVNC---IASGCNGGWSGN 309 Query: 377 AWEYWKHVGL 406 + ++K G+ Sbjct: 310 YFNFFKTPGI 319 >UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like midgut cysteine proteinase - Tenebrio molitor (Yellow mealworm) Length = 330 Score = 50.4 bits (115), Expect = 3e-05 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 1/78 (1%) Frame = +2 Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG-LGCNGGMP 370 ++E++DQG CGSCW+F A+ ++ + + S ++L+ C G GC+GG Sbjct: 128 VSEVKDQGQCGSCWSFSTTGAVEGQLALQRG--RLTSLSEQNLIDCSSSYGNAGCDGGWM 185 Query: 371 TLAWEYWKHVGLVSGGNY 424 A+ Y G++S Y Sbjct: 186 DSAFSYIHDYGIMSESAY 203 >UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia ATCC 50803|Rep: GLP_542_3431_1206 - Giardia lamblia ATCC 50803 Length = 741 Score = 50.4 bits (115), Expect = 3e-05 Identities = 43/133 (32%), Positives = 62/133 (46%), Gaps = 6/133 (4%) Frame = +2 Query: 89 DDNILKLPKVTHDAELI-ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTD 265 +D +LP +A+L A LP NF R C +I +QGSCG C+A AVE +T Sbjct: 40 EDEYNELPDGPDNADLTRAALPTNFTYRGH--RCI---QIINQGSCGCCYAAAAVEMVTA 94 Query: 266 RVCIYSNATKHFHFSAEDLVSC-----CPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNS 430 R C+ N ++ S EDLV+C I GC GG + ++ + G+V + Sbjct: 95 RRCLQLNDSR--LVSLEDLVTCDHTKYLNIQNNGCRGGNSLASLKFGETTGMVYDTCEDY 152 Query: 431 SQGCRPYEIPPCE 469 PY C+ Sbjct: 153 WNRTYPYPTETCK 165 >UniRef50_Q23H06 Cluster: Papain family cysteine protease containing protein; n=18; Tetrahymena thermophila|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 349 Score = 50.4 bits (115), Expect = 3e-05 Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 4/87 (4%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC-CPICGL- 349 W + +++ QG+CG+CWAF A M I + A FS + L+ C P G Sbjct: 147 WRSRGAVTQVKWQGNCGACWAFSATGVMESFNFIQNKAL--VEFSEQQLLDCVIPANGYP 204 Query: 350 --GCNGGMPTLAWEYWKHVGLVSGGNY 424 GC+GG P +Y VG+++ Y Sbjct: 205 SSGCHGGWPVQCIDYASKVGILNQDRY 231 >UniRef50_Q239L8 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 50.4 bits (115), Expect = 3e-05 Identities = 24/83 (28%), Positives = 39/83 (46%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355 W + ++DQG CGSCW+F A+ ++ + K S + LV C GC Sbjct: 129 WTTKGAVTPVKDQGQCGSCWSFSTTGAVEG--ALFLSTKKLTSLSEQYLVDCSKDGNEGC 186 Query: 356 NGGMPTLAWEYWKHVGLVSGGNY 424 NGG+ A+++ G+ + Y Sbjct: 187 NGGLMDTAFDFISQHGIPTEAAY 209 >UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, whole genome shotgun sequence; n=3; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_2, whole genome shotgun sequence - Paramecium tetraurelia Length = 376 Score = 50.4 bits (115), Expect = 3e-05 Identities = 25/77 (32%), Positives = 39/77 (50%) Frame = +2 Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPT 373 + E++ QG CGSCWAF + + R+ I +N K S L+ C GC+GG + Sbjct: 175 VTEVQQQGRCGSCWAFAVQDVVISRLAI-ANKNKLDQLSKTHLIDCADGNTEGCDGGSVS 233 Query: 374 LAWEYWKHVGLVSGGNY 424 A+++ G V +Y Sbjct: 234 DAFDFINKYGTVYEKDY 250 >UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcoptes scabiei type hominis|Rep: Cathepsin L-like protease - Sarcoptes scabiei type hominis Length = 245 Score = 50.0 bits (114), Expect = 4e-05 Identities = 37/115 (32%), Positives = 58/115 (50%), Gaps = 5/115 (4%) Frame = +2 Query: 125 DAELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH 304 D E +++LP+ D W + I+DQ CGSCWAF AV +M + + + + Sbjct: 113 DNEDVSDLPDEVD----WTLKNVVAPIKDQKQCGSCWAFSAVASMESQNALKTG--QLVE 166 Query: 305 FSAEDLVSCCPICG-LGCNGGMPTLAWEY-WKHVGLVSGGNY---NSSQGCRPYE 454 S ++LV C G GC+GG A+E+ K G+ + +Y +Q CR Y+ Sbjct: 167 LSEQELVDCSVGEGNEGCDGGWMDSAFEFVIKADGIDTEKSYPYHGVNQVCRSYQ 221 >UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia ATCC 50803|Rep: GLP_567_6496_7413 - Giardia lamblia ATCC 50803 Length = 305 Score = 50.0 bits (114), Expect = 4e-05 Identities = 35/117 (29%), Positives = 49/117 (41%), Gaps = 3/117 (2%) Frame = +2 Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319 A P+ D R PEC E DQ C C+AF + A++ R CI + S + Sbjct: 79 AGSPDRLDYRQTHPEC--FFEPEDQKECSCCYAFATLGALSTRRCIAKLDPQAVSLSVQH 136 Query: 320 LVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGG--NYNSSQGCRPYEIP-PCEHHVP 481 +VS C GC GG +W + + G V Y S + + E P C+ P Sbjct: 137 MVS-CDSGEAGCQGGEFESSWAFLETEGAVKSDCLPYTSGETGKSGECPTTCQDGTP 192 >UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabditis|Rep: Cathepsin z protein 1 - Caenorhabditis elegans Length = 306 Score = 50.0 bits (114), Expect = 4e-05 Identities = 29/81 (35%), Positives = 41/81 (50%), Gaps = 4/81 (4%) Frame = +2 Query: 221 CGSCWAFGAVEAMTDRVCI-YSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKH 397 CGSCWAFGA A+ DR+ I NA + S ++++ C G GG P ++Y Sbjct: 92 CGSCWAFGATSALADRINIKRKNAWPQAYLSVQEVIDCSG-AGTCVMGGEPGGVYKYAHE 150 Query: 398 VGL--VSGGNYNSSQG-CRPY 451 G+ + NY + G C PY Sbjct: 151 HGIPHETCNNYQARDGKCDPY 171 >UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; Dictyostelium discoideum|Rep: Cysteine proteinase 7 precursor - Dictyostelium discoideum (Slime mold) Length = 460 Score = 50.0 bits (114), Expect = 4e-05 Identities = 26/73 (35%), Positives = 38/73 (52%), Gaps = 2/73 (2%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHF-HFSAEDLVSCCPICG-L 349 W + I++QG CG CW+F A T+ +N K+ S ++L+ C G Sbjct: 116 WRTQGAVTPIKNQGQCGGCWSFSTTGA-TEGAQYLANGKKNLVSLSEQNLIDCSGSYGNN 174 Query: 350 GCNGGMPTLAWEY 388 GC GG+ TLA+EY Sbjct: 175 GCEGGLMTLAFEY 187 >UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber officinale (Ginger) Length = 221 Score = 50.0 bits (114), Expect = 4e-05 Identities = 32/97 (32%), Positives = 48/97 (49%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 LP++ D R+K P +++QG CGSCWAF A+ A+ I + S + LV Sbjct: 3 LPDSIDWREKGAVVP----VKNQGGCGSCWAFDAIAAVEGINQIVTGDL--ISLSEQQLV 56 Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQ 436 C GC GG P A++Y +++ G NS + Sbjct: 57 D-CSTRNHGCEGGWPYRAFQY-----IINNGGINSEE 87 >UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [Contains: Cathepsin H mini chain; Cathepsin H heavy chain; Cathepsin H light chain]; n=37; Eukaryota|Rep: Cathepsin H precursor (EC 3.4.22.16) [Contains: Cathepsin H mini chain; Cathepsin H heavy chain; Cathepsin H light chain] - Homo sapiens (Human) Length = 335 Score = 50.0 bits (114), Expect = 4e-05 Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 1/66 (1%) Frame = +2 Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGCNGGMP 370 ++ +++QG+CGSCW F A+ + I + K + + LV C GC GG+P Sbjct: 129 VSPVKNQGACGSCWTFSTTGALESAIAIATG--KMLSLAEQQLVDCAQDFNNHGCQGGLP 186 Query: 371 TLAWEY 388 + A+EY Sbjct: 187 SQAFEY 192 >UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber officinale (Ginger) Length = 475 Score = 49.6 bits (113), Expect = 5e-05 Identities = 31/97 (31%), Positives = 47/97 (48%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 LP++ D W E + +++QG CGSCWAF A+ A+ I + S + LV Sbjct: 143 LPDSID----WREKGAVVAVKNQGRCGSCWAFAAIAAVEGINQIVTGDL--ISLSEQQLV 196 Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQ 436 C GC GG P A++Y +++ G NS + Sbjct: 197 D-CSTRNYGCEGGWPYRAFQY-----IINNGGVNSEE 227 >UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa (Rice) Length = 339 Score = 49.6 bits (113), Expect = 5e-05 Identities = 37/99 (37%), Positives = 49/99 (49%), Gaps = 3/99 (3%) Frame = +2 Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316 I LP D R K P I+DQG CG CWAF AV AM V + + K S + Sbjct: 120 IDTLPATVDWRTKGAVTP----IKDQGQCGCCWAFSAVAAMEGIVKL--STGKLISLSEQ 173 Query: 317 DLVSCCPICG--LGCNGGMPTLAWEY-WKHVGLVSGGNY 424 +LV C + G GC GG+ A+++ K+ GL + Y Sbjct: 174 ELVD-CDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKY 211 >UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine protease; n=11; Callosobruchus maculatus|Rep: Putative gut cathepsin L-like cysteine protease - Callosobruchus maculatus (Southern cowpea weevil) (Pulse bruchid) Length = 326 Score = 49.6 bits (113), Expect = 5e-05 Identities = 28/85 (32%), Positives = 44/85 (51%), Gaps = 2/85 (2%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC--PICGL 349 W E + ++DQ +CGSCWAF AV A+ + N T SA++LV C Sbjct: 118 WREEGAVTPVKDQANCGSCWAFSAVGAIEGQF-FKKNGTL-VSLSAQELVDCATEDYGNN 175 Query: 350 GCNGGMPTLAWEYWKHVGLVSGGNY 424 GC GG+ A+++ + G+ + +Y Sbjct: 176 GCKGGLMGQAFDFVQDEGIQTEESY 200 >UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain; n=9; Cucujiformia|Rep: Digestive cysteine proteinase intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 49.6 bits (113), Expect = 5e-05 Identities = 35/130 (26%), Positives = 60/130 (46%), Gaps = 4/130 (3%) Frame = +2 Query: 47 TPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDK--WPECPTLNEIRDQGS 220 TPFA + KD+ ++ + +A PE + D W + + +++ QG Sbjct: 73 TPFADLT--HDEFKDELRRQIKTKPNVEATLAVFPEGLEVPDSIDWTQKGAVLDVKYQGG 130 Query: 221 CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC-PICGLGC-NGGMPTLAWEYWK 394 CGSCWAF A A+ + I +N S + L+ C P C +GG+ + A++Y Sbjct: 131 CGSCWAFSATGALEGQNAIVNNV--KIPLSEQQLLDCSKPYGNDDCEHGGLMSFAFDYVL 188 Query: 395 HVGLVSGGNY 424 G+ + +Y Sbjct: 189 DKGIEADSSY 198 >UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2; Theileria|Rep: Cysteine protease, tacP, putative - Theileria annulata Length = 461 Score = 49.6 bits (113), Expect = 5e-05 Identities = 29/87 (33%), Positives = 44/87 (50%), Gaps = 1/87 (1%) Frame = +2 Query: 149 PENFDPRDKWPECPTLNEIRDQG-SCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 PE+ D W + +++DQG C SCWAF +V A+ + + S + L+ Sbjct: 237 PEDLD----WRRPDVVTKVKDQGLDCSSCWAFASVAAVESIFQLLQDV--DLDLSEQHLI 290 Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGL 406 +C C GC+GG LA +Y K+ GL Sbjct: 291 NCETRCS-GCSGGYADLALDYVKNKGL 316 >UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 344 Score = 49.6 bits (113), Expect = 5e-05 Identities = 27/88 (30%), Positives = 43/88 (48%) Frame = +2 Query: 140 ANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAED 319 ++LPE+FD RDK P + Q +CGSCW F + + + HF +E Sbjct: 129 SDLPESFDWRDKGIITPA----KFQNTCGSCWTFATTGVIESQYALKYGELLHF---SEQ 181 Query: 320 LVSCCPICGLGCNGGMPTLAWEYWKHVG 403 ++ C GC GG+ T A+++ + G Sbjct: 182 MLLDCDNINQGCRGGLMTDAYQFLQQSG 209 >UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudicotyledons|Rep: Chymopapain precursor - Carica papaya (Papaya) Length = 352 Score = 49.6 bits (113), Expect = 5e-05 Identities = 37/138 (26%), Positives = 62/138 (44%) Frame = +2 Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316 + N P++ D R K P +++QG+CGSCWAF + + I + S + Sbjct: 132 VTNYPQSIDWRAKGAVTP----VKNQGACGSCWAFSTIATVEGINKIVTG--NLLELSEQ 185 Query: 317 DLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP 496 +LV C GC GG T + +Y + G+ + Y + Y+ + PG ++ Sbjct: 186 ELVD-CDKHSYGCKGGYQTTSLQYVANNGVHTSKVY--PYQAKQYKCRATDK--PGPKVK 240 Query: 497 CNGDTKTPKCQKNCESSY 550 G + P NCE+S+ Sbjct: 241 ITGYKRVP---SNCETSF 255 >UniRef50_UPI00006CB77B Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 382 Score = 49.2 bits (112), Expect = 7e-05 Identities = 32/112 (28%), Positives = 54/112 (48%), Gaps = 6/112 (5%) Frame = +2 Query: 155 NFDPRDKWPECPTLNEIRDQGS-CGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSC 331 +F+ K+P+C + I +QG C + ++ AV ++ DR+C+ S +F SA+ +SC Sbjct: 128 SFNFHTKYPQC--VRPIANQGKDCSASYSIAAVSSVADRLCMASEGDFNFGLSAQPTISC 185 Query: 332 CPICGLGCNGGMPTLAWEYWKHVGLVSG-----GNYNSSQGCRPYEIPPCEH 472 C GG + ++ K G V +S++GC I CEH Sbjct: 186 YENQSYKCEGGYVSKTFQKGKTTGFVKEECLPYHGTDSNEGCS--LIDKCEH 235 >UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae|Rep: Cysteine proteinase - Hypera postica (alfalfa weevil) Length = 324 Score = 49.2 bits (112), Expect = 7e-05 Identities = 26/83 (31%), Positives = 40/83 (48%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355 W + + ++DQG CGSCWAF ++ T+ + K S + L+ CC GC Sbjct: 118 WRKEGRVTGVKDQGDCGSCWAF-SITGSTEGAYARKSG-KLVSLSEQQLIDCCTDTSAGC 175 Query: 356 NGGMPTLAWEYWKHVGLVSGGNY 424 +GG ++Y GL S +Y Sbjct: 176 DGGSLDDNFKYVMKDGLQSEESY 198 >UniRef50_Q23FQ5 Cluster: Papain family cysteine protease containing protein; n=4; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 49.2 bits (112), Expect = 7e-05 Identities = 28/93 (30%), Positives = 41/93 (44%), Gaps = 3/93 (3%) Frame = +2 Query: 155 NFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCC 334 N+ W LN I++QG CGSC AFG + Y + + FS + L+ C Sbjct: 124 NYPTSVDWRNSGALNPIQNQGQCGSCAAFGTAGVLES--FYYLKSKQLLKFSEQQLLDCA 181 Query: 335 PICGL---GCNGGMPTLAWEYWKHVGLVSGGNY 424 G GC+G ++Y G+V G +Y Sbjct: 182 RQAGFDTYGCDGAWQQEYFKYAIKYGIVQGSSY 214 >UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 452 Score = 49.2 bits (112), Expect = 7e-05 Identities = 30/84 (35%), Positives = 44/84 (52%), Gaps = 2/84 (2%) Frame = +2 Query: 119 THDAELIANLPENFDPRDKWPECPTLNEI-RDQGSCGSCWAFGAVEAMTDRVCIYSNATK 295 T+D ++I NLPE+F W P + E DQ CG+C+AFGA EA+ + + +N + Sbjct: 216 TYDQKVIQNLPESFS----WRNVPYVLEYPHDQAVCGTCFAFGASEAINGQFSLRAN--R 269 Query: 296 HFHFSAEDLVSCC-PICGLGCNGG 364 S + LV C C+GG Sbjct: 270 SIITSVQQLVDCTWGTINYACDGG 293 >UniRef50_A2DPZ6 Cluster: Clan CA, family C1, cathepsin B-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin B-like cysteine peptidase - Trichomonas vaginalis G3 Length = 253 Score = 49.2 bits (112), Expect = 7e-05 Identities = 33/119 (27%), Positives = 56/119 (47%) Frame = +2 Query: 128 AELIANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHF 307 + + LPE ++ +++PEC I+ CG C+ + A++++ R C + F Sbjct: 22 SNISVELPEYYNFLEEYPECDFGPLIQH---CGCCYVYSALKSLAHRYC--RALRRRIQF 76 Query: 308 SAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPG 484 SA+ ++S C + LGCNGG + Y + G V +G R Y C+ V G Sbjct: 77 SAQYIIS-CDLFNLGCNGGNEKAVFYYLEQHG-VPELECQPWRGIRGYNQEVCKKCVNG 133 >UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, whole genome shotgun sequence; n=3; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_31, whole genome shotgun sequence - Paramecium tetraurelia Length = 358 Score = 49.2 bits (112), Expect = 7e-05 Identities = 30/89 (33%), Positives = 42/89 (47%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 +PE+++ R+ PEC I QG+C S ++ AV A +DR+C N S + + Sbjct: 131 IPESYNFREAQPECA--QPIYFQGNCSSSYSIAAVSATSDRLCKSKNGEFQDQLSPQSPI 188 Query: 326 SCCPICGLGCNGGMPTLAWEYWKHVGLVS 412 S C C GG T E K G VS Sbjct: 189 S-CDDKNYKCGGGSVTRVLEVGKKQGFVS 216 >UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin heavy chain; n=3; Amniota|Rep: PREDICTED: similar to ferritin heavy chain - Ornithorhynchus anatinus Length = 338 Score = 48.8 bits (111), Expect = 9e-05 Identities = 30/81 (37%), Positives = 40/81 (49%), Gaps = 1/81 (1%) Frame = +2 Query: 149 PENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVS 328 PE D R K P +++QG CGSCWAF A A+ ++ K S ++LV Sbjct: 121 PEEVDWRTKGYVTP----VKNQGLCGSCWAFSATGAL--EALVFKTTGKMVSLSEQNLVD 174 Query: 329 CCPICG-LGCNGGMPTLAWEY 388 C G +GC GG A+EY Sbjct: 175 CSWRQGNVGCRGGQYIGAFEY 195 >UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n=1; Xenopus tropicalis|Rep: UPI00006A2275 UniRef100 entry - Xenopus tropicalis Length = 272 Score = 48.8 bits (111), Expect = 9e-05 Identities = 33/90 (36%), Positives = 44/90 (48%), Gaps = 3/90 (3%) Frame = +2 Query: 164 PRDKWPECPTLNEIRDQGS-CGSCWAFGAVEAMTDRVCIYSNATKHF-HFSAEDLVSCCP 337 P W + +RDQGS C SC+AF AV A+ C + T FS ++LV C Sbjct: 81 PSIDWRTQNCVTPVRDQGSFCRSCYAFSAVGALE---CQWKKKTVRLVTFSPQELVDCSD 137 Query: 338 ICGL-GCNGGMPTLAWEYWKHVGLVSGGNY 424 G GCNGG A++Y K G++ Y Sbjct: 138 GEGNHGCNGGKIEKAFKYMKKYGVMEESAY 167 >UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel protein - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 328 Score = 48.8 bits (111), Expect = 9e-05 Identities = 37/122 (30%), Positives = 57/122 (46%), Gaps = 3/122 (2%) Frame = +2 Query: 167 RDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICG 346 R W E ++ +++QG CGSCWAF AV ++ ++ + A SA++L+ C G Sbjct: 116 RVNWTEHGMVSPVQNQGPCGSCWAFSAVGSLEAQMKRRTAAL--VPLSAQNLLDCSVSLG 173 Query: 347 -LGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPP--CEHHVPGNRMPCNGDTKT 517 GC GG + A+ Y ++ +SS PYE C + V G C G Sbjct: 174 NRGCKGGFLSRAFLY-----VIQNRGIDSST-FYPYEHKEGVCRYSVSGRAGYCTGFRIV 227 Query: 518 PK 523 P+ Sbjct: 228 PR 229 >UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba histolytica|Rep: Cysteine protease 19 - Entamoeba histolytica Length = 324 Score = 48.8 bits (111), Expect = 9e-05 Identities = 27/81 (33%), Positives = 45/81 (55%), Gaps = 4/81 (4%) Frame = +2 Query: 194 LNEIRDQGSCGSCWAFGAVEAMTDRVCI-YSN-ATKHFHFSAEDLVSCC--PICGLGCNG 361 + ++DQG+CGSC+AF +V M V + Y + + ++ S ++VSCC P GC G Sbjct: 112 MTPVKDQGNCGSCYAFSSVALMETAVLLSYDDLSPSNYALSTAEIVSCCYDPSECRGCEG 171 Query: 362 GMPTLAWEYWKHVGLVSGGNY 424 G A +Y + G+ S ++ Sbjct: 172 GSIGGALKYAQDNGMQSESSF 192 >UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n=1; Myxobolus cerebralis|Rep: Cathepsin Z-like cysteine proteinase - Myxobolus cerebralis Length = 297 Score = 48.8 bits (111), Expect = 9e-05 Identities = 29/86 (33%), Positives = 44/86 (51%), Gaps = 4/86 (4%) Frame = +2 Query: 143 NLPENFDPRDKWPECPTLNEIRDQGS---CGSCWAFGAVEAMTDRVCIYSNATKHFHFS- 310 N+P++FD W E L+ +++Q CGSCWAF + + DR+ I N + HFS Sbjct: 49 NMPKSFD----WRENAYLSSVKNQHLPTYCGSCWAFASTSTIADRIYIAKNLSHFDHFSL 104 Query: 311 AEDLVSCCPICGLGCNGGMPTLAWEY 388 + +V C G GG + +EY Sbjct: 105 SVQVVIACAQSGDCKLGGFASGVYEY 130 >UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2; Brugia malayi|Rep: Cahepsin L-like cysteine protease - Brugia malayi (Filarial nematode worm) Length = 371 Score = 48.8 bits (111), Expect = 9e-05 Identities = 27/83 (32%), Positives = 43/83 (51%), Gaps = 2/83 (2%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 LP++ D W + +++DQG CGSCW F AV A+ + + + K S ++L+ Sbjct: 143 LPKSID----WRTSGAVTKVKDQGYCGSCWTFSAVGALEGQHFLQTG--KLVELSMQNLL 196 Query: 326 SCC--PICGLGCNGGMPTLAWEY 388 C GC+GG+ A+EY Sbjct: 197 DCSDDTYGNYGCDGGLMMEAFEY 219 >UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis|Rep: Cathepsin L - Culicoides sonorensis Length = 331 Score = 48.8 bits (111), Expect = 9e-05 Identities = 24/68 (35%), Positives = 38/68 (55%) Frame = +2 Query: 203 IRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAW 382 +++Q CGSCWAF +V ++ R + N K + + ++LV C GC+GG LA Sbjct: 131 VKNQAQCGSCWAFASVASVEMRYKRFHN--KSYTLAEQELVD-CETTSHGCSGGWSDLAL 187 Query: 383 EYWKHVGL 406 +Y + GL Sbjct: 188 QYMRDNGL 195 >UniRef50_Q24FA8 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 335 Score = 48.8 bits (111), Expect = 9e-05 Identities = 24/87 (27%), Positives = 41/87 (47%), Gaps = 4/87 (4%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPI----C 343 W + ++ +++QG CG CW F A M I++ +S + L+ C + Sbjct: 130 WRKKGGVSPVKNQGECGGCWTFSATGLMESFNLIHNKPQNVSLYSQQQLLDCVTLENGYF 189 Query: 344 GLGCNGGMPTLAWEYWKHVGLVSGGNY 424 GC GG+P+ A +Y G++S Y Sbjct: 190 SEGCEGGVPSDAVQYAADFGVLSDNEY 216 >UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 383 Score = 48.8 bits (111), Expect = 9e-05 Identities = 29/83 (34%), Positives = 40/83 (48%) Frame = +2 Query: 176 WPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGC 355 W E L I++QG CGSCWAF V ++ + I K S +++V C GC Sbjct: 174 WREQGKLTPIKNQGQCGSCWAFATVASVEAQNAIKKG--KLVSLSEQEMVD-CDGRNNGC 230 Query: 356 NGGMPTLAWEYWKHVGLVSGGNY 424 +GG A ++ K GL S Y Sbjct: 231 SGGYRPYAMKFVKENGLESEKEY 253 >UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophila SB210|Rep: Cathepsin z - Tetrahymena thermophila SB210 Length = 585 Score = 48.8 bits (111), Expect = 9e-05 Identities = 40/138 (28%), Positives = 62/138 (44%), Gaps = 5/138 (3%) Frame = +2 Query: 8 QNTWKAGRNFPTHTP-FAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWPE 184 +NT K HT F H + + K+ L + H+ A+LP N+D R+ Sbjct: 290 RNTTKVTEVSNNHTNNFRHTTCIRESNKNSTQLITGPLPHEYINAASLPANWDWRNI-NG 348 Query: 185 CPTLNEIRDQGS---CGSCWAFGAVEAMTDRVCIYSNAT-KHFHFSAEDLVSCCPICGLG 352 L+ R+Q CGSCWA G ++ DR+ I N T S + +++C G Sbjct: 349 VNYLSFTRNQHIPQYCGSCWAHGTTSSLADRINIARNRTWPDIALSVQVVLNC--QAGGS 406 Query: 353 CNGGMPTLAWEYWKHVGL 406 CNGG P +++ G+ Sbjct: 407 CNGGQPMGVYQFANKQGI 424 Score = 40.7 bits (91), Expect = 0.025 Identities = 41/163 (25%), Positives = 63/163 (38%), Gaps = 5/163 (3%) Frame = +2 Query: 110 PKVTHDAELIANLPENFDPRDKWPECPTLNEIRDQGS---CGSCWAFGAVEAMTDRVCIY 280 P V +AE + LP NF ++ L +R+Q CGSCWA A + DR+ I Sbjct: 31 PYVISNAEFNSVLPSNFTWQNV-NGTDYLTLVRNQHIPQYCGSCWAQAASSTLADRIKIA 89 Query: 281 SNATKHFHFSAEDLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIP 460 A A ++ C GC+GG A+++ K + + + C PY+ Sbjct: 90 RKAQWPDVVIAPQVLVSCDEYSNGCHGGNSGTAFQWIKEHNI-------TDETCSPYQA- 141 Query: 461 PCEHHVPGNRMPCNGDTKTPKC--QKNCESSYNVPFKKEQRYG 583 + N + C+ C K C + N YG Sbjct: 142 ----YGHDNGLGCSAQIMCKNCMPNKGCWAQENAKVYTVAEYG 180 >UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precursor; n=20; Psoroptidia|Rep: Major mite fecal allergen Der f 1 precursor - Dermatophagoides farinae (House-dust mite) Length = 321 Score = 48.8 bits (111), Expect = 9e-05 Identities = 32/94 (34%), Positives = 41/94 (43%) Frame = +2 Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322 N+P D R T+ IR QG CGSCWAF V A Y N + S ++L Sbjct: 108 NVPSELDLRS----LRTVTPIRMQGGCGSCWAFSGVAATESAYLAYRNTS--LDLSEQEL 161 Query: 323 VSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424 V C GC+G EY + G+V +Y Sbjct: 162 VDCA--SQHGCHGDTIPRGIEYIQQNGVVEERSY 193 >UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep: Viral cathepsin - Xestia c-nigrum granulosis virus (XnGV) (Xestia c-nigrumgranulovirus) Length = 346 Score = 48.8 bits (111), Expect = 9e-05 Identities = 28/80 (35%), Positives = 43/80 (53%) Frame = +2 Query: 146 LPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 325 +P++FD RD+ ++ ++ Q CGSCWAF AV + I N + S + LV Sbjct: 133 VPDSFDWRDR----NSVTSVKMQKECGSCWAFSAVANIESLYHIKHNVS--LDLSEQQLV 186 Query: 326 SCCPICGLGCNGGMPTLAWE 385 C + GCNGG+ + A+E Sbjct: 187 DCDKV-NNGCNGGLMSWAFE 205 >UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine protease; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cysteine protease - Strongylocentrotus purpuratus Length = 494 Score = 48.4 bits (110), Expect = 1e-04 Identities = 39/128 (30%), Positives = 56/128 (43%), Gaps = 1/128 (0%) Frame = +2 Query: 5 KQNTWKAG-RNFPTHTPFAHIKILMGALKDDNILKLPKVTHDAELIANLPENFDPRDKWP 181 +Q T K G F T K+ G LK I K + +PE +D W Sbjct: 197 EQGTAKYGPTKFADMTEAEFRKLQSGPLKKTGIKKQAAIPQGP-----VPEEYD----WR 247 Query: 182 ECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGLGCNG 361 + +++QG CGSCWAF A+ M + I + S ++LV C + G GC G Sbjct: 248 THGAVTPVKNQGMCGSCWAFSAIGNMEGQWQIKKG--ELISLSEQELVDCDKVDG-GCEG 304 Query: 362 GMPTLAWE 385 G + A+E Sbjct: 305 GEMSDAYE 312 >UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Cathepsin K - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 333 Score = 48.4 bits (110), Expect = 1e-04 Identities = 37/131 (28%), Positives = 58/131 (44%), Gaps = 2/131 (1%) Frame = +2 Query: 137 IANLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAE 316 + LP++ D R + + +++QGSCGSCWAF +V A+ + + + S + Sbjct: 115 VGKLPKSIDYR----KLGYVTSVKNQGSCGSCWAFSSVGALEGQ--LMKTKGQLVDLSPQ 168 Query: 317 DLVSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPY--EIPPCEHHVPGNR 490 +LV C GC GG T A+ Y VS S+ PY C ++ G Sbjct: 169 NLVDCVTE-NDGCGGGYMTNAFRY------VSNNQGIDSEESYPYVGTDQQCAYNTSGVA 221 Query: 491 MPCNGDTKTPK 523 C G + P+ Sbjct: 222 ASCRGYKEIPQ 232 >UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 343 Score = 48.4 bits (110), Expect = 1e-04 Identities = 30/94 (31%), Positives = 44/94 (46%) Frame = +2 Query: 143 NLPENFDPRDKWPECPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDL 322 NLP + D R+ + I+ QG CGSCWAF A+ V I + S++ L Sbjct: 134 NLPNSVDWRNV-NGTNHVTGIKYQGPCGSCWAFATAAAIESAVSISGGGLQ--SLSSQQL 190 Query: 323 VSCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNY 424 + C + C GG P A +Y + G+ + NY Sbjct: 191 LDCTVVSD-KCGGGEPVEALKYAQSHGITTAHNY 223 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 603,931,243 Number of Sequences: 1657284 Number of extensions: 12407851 Number of successful extensions: 34860 Number of sequences better than 10.0: 468 Number of HSP's better than 10.0 without gapping: 33339 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 34539 length of database: 575,637,011 effective HSP length: 97 effective length of database: 414,880,463 effective search space used: 40658285374 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -