BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= MFBP04_F_K21 (862 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw... 150 4e-35 UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ... 134 4e-30 UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca... 131 2e-29 UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|... 128 2e-28 UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ... 128 2e-28 UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n... 124 2e-27 UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr... 122 1e-26 UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteina... 121 3e-26 UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=... 118 2e-25 UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh... 116 8e-25 UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ... 115 1e-24 UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=... 115 2e-24 UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: ... 113 4e-24 UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr... 112 1e-23 UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.... 111 2e-23 UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA... 111 2e-23 UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; ... 110 4e-23 UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps... 110 5e-23 UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n... 108 2e-22 UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Ca... 107 4e-22 UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain... 107 5e-22 UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|... 105 1e-21 UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Ca... 104 3e-21 UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid... 103 4e-21 UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin ... 102 1e-20 UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 ... 102 1e-20 UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8.... 101 2e-20 UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 ... 101 2e-20 UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 ... 101 3e-20 UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus co... 100 4e-20 UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep... 99 7e-20 UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gamb... 99 7e-20 UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep... 99 2e-19 UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, w... 98 2e-19 UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2... 97 7e-19 UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7... 96 1e-18 UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomati... 92 1e-17 UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma... 92 2e-17 UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep... 91 2e-17 UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator ame... 91 3e-17 UniRef50_Q237A1 Cluster: Papain family cysteine protease contain... 90 6e-17 UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep... 89 1e-16 UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhip... 89 2e-16 UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8... 88 3e-16 UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2; ... 86 9e-16 UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl... 86 1e-15 UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG011... 55 2e-15 UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; ... 82 2e-14 UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C... 81 3e-14 UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma j... 73 7e-12 UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2; Ostreococcus|... 70 9e-11 UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve... 68 3e-10 UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|R... 67 5e-10 UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; ... 66 1e-09 UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus lu... 65 2e-09 UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus... 65 2e-09 UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O... 65 2e-09 UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 60 9e-08 UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|R... 60 9e-08 UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-L... 59 1e-07 UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ... 58 2e-07 UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 58 3e-07 UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1... 58 4e-07 UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; ... 57 7e-07 UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy... 57 7e-07 UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain... 56 1e-06 UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi... 56 2e-06 UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia ... 55 3e-06 UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma j... 55 3e-06 UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen; n... 54 4e-06 UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;... 54 5e-06 UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium... 54 5e-06 UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ... 54 6e-06 UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,... 53 8e-06 UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin... 53 8e-06 UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcop... 53 8e-06 UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The... 53 8e-06 UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to glucocorti... 53 1e-05 UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ... 53 1e-05 UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae... 53 1e-05 UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like prote... 53 1e-05 UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;... 52 1e-05 UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida... 52 1e-05 UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain... 52 1e-05 UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ... 52 2e-05 UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG... 52 2e-05 UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 52 2e-05 UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy... 52 2e-05 UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh... 51 3e-05 UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa... 51 4e-05 UniRef50_Q24E33 Cluster: Papain family cysteine protease contain... 51 4e-05 UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain... 51 4e-05 UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The... 51 4e-05 UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ... 50 6e-05 UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - ... 50 6e-05 UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ... 50 6e-05 UniRef50_Q23H32 Cluster: Papain family cysteine protease contain... 50 6e-05 UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus... 50 6e-05 UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ... 50 6e-05 UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag... 50 6e-05 UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cel... 50 6e-05 UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-32... 50 8e-05 UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ... 50 1e-04 UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei... 50 1e-04 UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try... 50 1e-04 UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain... 50 1e-04 UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy... 50 1e-04 UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh... 50 1e-04 UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3.... 50 1e-04 UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-... 49 1e-04 UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi... 49 1e-04 UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio... 49 2e-04 UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1... 49 2e-04 UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy... 49 2e-04 UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-li... 49 2e-04 UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ... 48 2e-04 UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis... 48 2e-04 UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr... 48 2e-04 UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen... 48 2e-04 UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia... 48 3e-04 UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n... 48 3e-04 UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain... 48 3e-04 UniRef50_Q248G1 Cluster: Papain family cysteine protease contain... 48 3e-04 UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ... 48 3e-04 UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ... 48 3e-04 UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma... 48 3e-04 UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;... 48 4e-04 UniRef50_Q58HK5 Cluster: Hc58; n=1; Haemonchus contortus|Rep: Hc... 48 4e-04 UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh... 48 4e-04 UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh... 48 4e-04 UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18... 48 4e-04 UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep... 47 5e-04 UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ... 47 5e-04 UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop... 47 5e-04 UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi... 47 5e-04 UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big... 47 7e-04 UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi... 47 7e-04 UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted... 47 7e-04 UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain... 47 7e-04 UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain... 47 7e-04 UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ... 47 7e-04 UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n... 46 0.001 UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n... 46 0.001 UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ... 46 0.001 UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet... 46 0.001 UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re... 46 0.001 UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy... 46 0.001 UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh... 46 0.001 UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w... 46 0.001 UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl... 46 0.001 UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo sapie... 46 0.001 UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re... 46 0.001 UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca... 46 0.001 UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j... 46 0.001 UniRef50_Q231X3 Cluster: Papain family cysteine protease contain... 46 0.001 UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ... 46 0.001 UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e... 46 0.001 UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ... 46 0.002 UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ... 46 0.002 UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ... 46 0.002 UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli... 46 0.002 UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabdi... 46 0.002 UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis... 46 0.002 UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh... 46 0.002 UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; ... 45 0.002 UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl... 45 0.002 UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ... 45 0.002 UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;... 45 0.002 UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ... 45 0.003 UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa... 45 0.003 UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt... 45 0.003 UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain... 45 0.003 UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M... 45 0.003 UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C... 45 0.003 UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; ... 44 0.004 UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein... 44 0.004 UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt... 44 0.004 UniRef50_Q239L8 Cluster: Papain family cysteine protease contain... 44 0.004 UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain... 44 0.004 UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh... 44 0.004 UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ... 44 0.005 UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ... 44 0.005 UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio... 44 0.005 UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus... 44 0.005 UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;... 44 0.005 UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;... 44 0.005 UniRef50_Q23H10 Cluster: Papain family cysteine protease contain... 44 0.005 UniRef50_Q23H06 Cluster: Papain family cysteine protease contain... 44 0.005 UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy... 44 0.005 UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ... 44 0.005 UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who... 44 0.005 UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ... 44 0.005 UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal... 44 0.007 UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip... 44 0.007 UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ... 44 0.007 UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=... 44 0.007 UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|... 44 0.007 UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p... 43 0.009 UniRef50_UPI00006CB77B Cluster: Papain family cysteine protease ... 43 0.009 UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat... 43 0.009 UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t... 43 0.009 UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ... 43 0.009 UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist... 43 0.009 UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain... 43 0.009 UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu... 43 0.009 UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35... 43 0.009 UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata... 43 0.009 UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h... 43 0.011 UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole... 43 0.011 UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s... 43 0.011 UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph... 43 0.011 UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa... 43 0.011 UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n... 43 0.011 UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er... 43 0.011 UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy... 43 0.011 UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ... 42 0.015 UniRef50_Q8I0V1 Cluster: Preprocathepsin c, putative; n=1; Plasm... 42 0.015 UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n... 42 0.015 UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain... 42 0.015 UniRef50_Q22W19 Cluster: Papain family cysteine protease contain... 42 0.015 UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179, w... 42 0.015 UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto... 42 0.015 UniRef50_UPI000155D183 Cluster: PREDICTED: similar to Cathepsin ... 42 0.020 UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ... 42 0.020 UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia ... 42 0.020 UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy... 42 0.020 UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia... 42 0.020 UniRef50_Q235G6 Cluster: Papain family cysteine protease contain... 42 0.020 UniRef50_A7APS9 Cluster: Papain family cysteine protease contain... 42 0.020 UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs... 42 0.020 UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C... 42 0.020 UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum... 42 0.026 UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3... 42 0.026 UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh... 42 0.026 UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=... 42 0.026 UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo... 42 0.026 UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G... 41 0.035 UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep... 41 0.035 UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like cy... 41 0.035 UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov... 41 0.035 UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|... 41 0.035 UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2... 41 0.035 UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No... 41 0.046 UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ... 41 0.046 UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ... 41 0.046 UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste... 41 0.046 UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl... 41 0.046 UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip... 41 0.046 UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n... 41 0.046 UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ... 41 0.046 UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain... 41 0.046 UniRef50_Q22A69 Cluster: Papain family cysteine protease contain... 41 0.046 UniRef50_A7AX75 Cluster: Preprocathepsin c, putative; n=1; Babes... 41 0.046 UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi... 41 0.046 UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc... 41 0.046 UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ... 40 0.061 UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ... 40 0.061 UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ... 40 0.061 UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re... 40 0.061 UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty... 40 0.061 UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep: Cat... 40 0.061 UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3; Bilate... 40 0.061 UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb... 40 0.061 UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox... 40 0.061 UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; ... 40 0.061 UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C... 40 0.061 UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:... 40 0.061 UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy... 40 0.061 UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D... 40 0.061 UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ... 40 0.061 UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr... 40 0.061 UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p... 40 0.081 UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: L... 40 0.081 UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ... 40 0.081 UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi... 40 0.081 UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr... 40 0.081 UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain... 40 0.081 UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ... 40 0.081 UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir... 40 0.081 UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi... 40 0.081 UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ... 40 0.11 UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|... 40 0.11 UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop... 40 0.11 UniRef50_O16454 Cluster: Temporarily assigned gene name protein ... 40 0.11 UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophi... 40 0.11 UniRef50_A2DPZ6 Cluster: Clan CA, family C1, cathepsin B-like cy... 40 0.11 UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, wh... 40 0.11 UniRef50_A0CS14 Cluster: Chromosome undetermined scaffold_26, wh... 40 0.11 UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R... 40 0.11 UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; P... 40 0.11 UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca... 39 0.14 UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocysti... 39 0.14 UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S... 39 0.14 UniRef50_Q015J8 Cluster: Cathepsin; n=2; Ostreococcus|Rep: Cathe... 39 0.14 UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-... 39 0.14 UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep... 39 0.14 UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R... 39 0.14 UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w... 39 0.14 UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ... 39 0.19 UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lambl... 39 0.19 UniRef50_Q54VR1 Cluster: Putative uncharacterized protein; n=1; ... 39 0.19 UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n... 39 0.19 UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain... 39 0.19 UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir... 39 0.19 UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic... 39 0.19 UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica... 38 0.25 UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ... 38 0.25 UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ... 38 0.25 UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|... 38 0.25 UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep... 38 0.25 UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat... 38 0.25 UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina... 38 0.25 UniRef50_A5K8Y0 Cluster: Preprocathepsin c, putative; n=1; Plasm... 38 0.25 UniRef50_A0DIY3 Cluster: Chromosome undetermined scaffold_52, wh... 38 0.25 UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs... 38 0.25 UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ... 38 0.25 UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli... 38 0.33 UniRef50_Q5CKC2 Cluster: Preprocathepsin c; n=2; Cryptosporidium... 38 0.33 UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory... 38 0.33 UniRef50_UPI00006CBB5F Cluster: Papain family cysteine protease ... 38 0.43 UniRef50_Q8QNJ8 Cluster: EsV-1-75; n=1; Ectocarpus siliculosus v... 38 0.43 UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa... 38 0.43 UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz... 38 0.43 UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli... 38 0.43 UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve... 38 0.43 UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H... 38 0.43 UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li... 38 0.43 UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M... 38 0.43 UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ... 37 0.57 UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n... 37 0.57 UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ... 37 0.57 UniRef50_Q8I1Y2 Cluster: Protease, putative; n=1; Plasmodium fal... 37 0.57 UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n... 37 0.57 UniRef50_Q5CM16 Cluster: P3ECSL-related; n=2; Cryptosporidium|Re... 37 0.57 UniRef50_Q23H15 Cluster: Papain family cysteine protease contain... 37 0.57 UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v... 37 0.57 UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li... 37 0.57 UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w... 37 0.57 UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ... 37 0.57 UniRef50_Q9UBR2 Cluster: Cathepsin Z precursor; n=40; Bilateria|... 37 0.57 UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil... 37 0.75 UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ... 37 0.75 UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ... 37 0.75 UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv... 36 1.00 UniRef50_Q7RQM7 Cluster: Dipeptidyl-peptidase i; n=6; Plasmodium... 36 1.00 UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia... 36 1.00 UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi... 36 1.00 UniRef50_A0D5R4 Cluster: Chromosome undetermined scaffold_39, wh... 36 1.00 UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ... 36 1.3 UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease ... 36 1.3 UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s... 36 1.3 UniRef50_Q8I8D3 Cluster: Cysteine protease 15; n=2; Entamoeba hi... 36 1.3 UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl... 36 1.3 UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain... 36 1.3 UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve... 36 1.3 UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,... 36 1.7 UniRef50_Q8ZRX7 Cluster: Putative viral protein; n=1; Salmonella... 36 1.7 UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ... 36 1.7 UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|... 36 1.7 UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma j... 36 1.7 UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali... 36 1.7 UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D... 36 1.7 UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re... 36 1.7 UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel... 36 1.7 UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ... 35 2.3 UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest... 35 2.3 UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia... 35 2.3 UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:... 35 2.3 UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ... 35 3.0 UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R... 35 3.0 UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac... 35 3.0 UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei... 35 3.0 UniRef50_Q7R0G3 Cluster: GLP_29_33036_32140; n=1; Giardia lambli... 35 3.0 UniRef50_Q6A1H9 Cluster: Cathepsin X/O; n=1; Suberites domuncula... 35 3.0 UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate... 35 3.0 UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ... 35 3.0 UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba... 34 4.0 UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280... 34 4.0 UniRef50_Q5YER6 Cluster: Cathepsin Z; n=1; Bigelowiella natans|R... 34 4.0 UniRef50_A5HC50 Cluster: Cathepsin W; n=2; Theria|Rep: Cathepsin... 34 4.0 UniRef50_Q7R6L4 Cluster: GLP_170_114230_115951; n=1; Giardia lam... 34 4.0 UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil... 34 4.0 UniRef50_Q4N3V5 Cluster: Cathepsin C, putative; n=1; Theileria p... 34 4.0 UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop... 34 4.0 UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl... 34 4.0 UniRef50_A5KAP8 Cluster: Protease, putative; n=1; Plasmodium viv... 34 4.0 UniRef50_A0BZE1 Cluster: Chromosome undetermined scaffold_139, w... 34 4.0 UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto... 34 4.0 UniRef50_Q5CYC8 Cluster: Cathepsin like thiol protease possibly ... 34 5.3 UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2; Culicid... 34 5.3 UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon G... 34 5.3 UniRef50_A5UP12 Cluster: Adhesin-like protein; n=1; Methanobrevi... 34 5.3 UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:... 34 5.3 UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ... 33 7.0 UniRef50_UPI00006CB653 Cluster: Papain family cysteine protease ... 33 7.0 UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re... 33 7.0 UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster... 33 7.0 UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop... 33 7.0 UniRef50_A5KBM2 Cluster: Serine-repeat antigen; n=3; Plasmodium|... 33 7.0 UniRef50_A0EAG2 Cluster: Chromosome undetermined scaffold_86, wh... 33 7.0 UniRef50_Q4T8I7 Cluster: Chromosome undetermined SCAF7784, whole... 33 9.3 UniRef50_Q8XSB3 Cluster: Hypothetical signal peptide protein; n=... 33 9.3 UniRef50_Q9NHY1 Cluster: Cysteine protease cp2; n=1; Theileria c... 33 9.3 UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi... 33 9.3 UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Sl... 33 9.3 UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil... 33 9.3 UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain... 33 9.3 UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cy... 33 9.3 >UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpwnx02 - Periplaneta americana (American cockroach) Length = 343 Score = 150 bits (364), Expect = 4e-35 Identities = 60/87 (68%), Positives = 69/87 (79%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQGSCGSCWAFGAVEAM+DRVC +S G HFHFSAEDLL+CC CG GC+GG P W Sbjct: 112 IRDQGSCGSCWAFGAVEAMSDRVCIHSKGKTHFHFSAEDLLTCCSSCGFGCNGGEPGAAW 171 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIPP 764 +YW G+VSGGSY+S GC+PY I P Sbjct: 172 DYWVSTGIVSGGSYNSHQGCQPYAIEP 198 Score = 78.6 bits (185), Expect = 2e-13 Identities = 37/79 (46%), Positives = 49/79 (62%) Frame = +1 Query: 259 LPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLI 438 L PLSD+FI+ IN +WKA RNF D +KK+MGV + LP K+ + D+ Sbjct: 32 LVDPLSDDFIDHINSLNTTWKAHRNFGNDIPLREIKKLMGVRRSLENFRLPEKSME-DID 90 Query: 439 ASLPENFDPRDXWPDCPTL 495 +PE FDPR+ WP+CPTL Sbjct: 91 IEIPEEFDPREQWPECPTL 109 >UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin B; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin B - Strongylocentrotus purpuratus Length = 346 Score = 134 bits (323), Expect = 4e-30 Identities = 53/85 (62%), Positives = 66/85 (77%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 VRDQGSCGSCWAFGAVEA++DR+C S G H SAEDL++CC CG GC+GG P W Sbjct: 97 VRDQGSCGSCWAFGAVEAISDRICIKSKGQTQVHISAEDLMTCCKTCGNGCNGGFPGSAW 156 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEI 758 EY+K G+V+GG ++SS GC+PY+I Sbjct: 157 EYYKDTGIVTGGQWNSSQGCQPYQI 181 Score = 51.6 bits (118), Expect = 2e-05 Identities = 25/70 (35%), Positives = 39/70 (55%) Frame = +1 Query: 286 INTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDP 465 + +N + +WKAG NF ++++G +K+ + LP K I LPENFD Sbjct: 28 VQKVNSLKTTWKAGINF-EGWQLDDFRRMLGALKNPN-GRLP-KLENQTRIKDLPENFDA 84 Query: 466 RDXWPDCPTL 495 R+ WP+CPT+ Sbjct: 85 RENWPNCPTI 94 >UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Cathepsin B1) (APP secretase) (APPS) [Contains: Cathepsin B light chain; Cathepsin B heavy chain]; n=85; Eukaryota|Rep: Cathepsin B precursor (EC 3.4.22.1) (Cathepsin B1) (APP secretase) (APPS) [Contains: Cathepsin B light chain; Cathepsin B heavy chain] - Homo sapiens (Human) Length = 339 Score = 131 bits (317), Expect = 2e-29 Identities = 59/106 (55%), Positives = 69/106 (65%), Gaps = 2/106 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLT 680 +RDQGSCGSCWAFGAVEA++DR+C ++N SAEDLL+CC +CG GC+GG P Sbjct: 99 IRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEA 158 Query: 681 WEYWKXFGLVSGGSYHSSXGCRPYEIPPX*TS-RTRAPGCPXXGDT 815 W +W GLVSGG Y S GCRPY IPP P C GDT Sbjct: 159 WNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDT 204 Score = 62.1 bits (144), Expect = 2e-08 Identities = 34/96 (35%), Positives = 51/96 (53%), Gaps = 4/96 (4%) Frame = +1 Query: 220 YVTLVC--VLAAAKDLP--HPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIK 387 + +L C VLA A+ P HPLSDE +N +N + +W+AG NF + ++LK++ G Sbjct: 5 WASLCCLLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLCGTFL 63 Query: 388 DEHFATLPIKTHKIDLIASLPENFDPRDXWPDCPTL 495 P + LP +FD R+ WP CPT+ Sbjct: 64 G---GPKPPQRVMFTEDLKLPASFDAREQWPQCPTI 96 >UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|Rep: Cathepsin B5 - Clonorchis sinensis Length = 343 Score = 128 bits (309), Expect = 2e-28 Identities = 52/86 (60%), Positives = 61/86 (70%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQ SCGSCWAFGAVEAM+DR+C +SNG + SA DLLSCC CG GC GG P + W Sbjct: 105 IRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCKDCGFGCRGGYPAVAW 164 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIP 761 +YWK G+V+GGS GCR Y P Sbjct: 165 DYWKTHGIVTGGSKEDPSGCRSYPFP 190 >UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 precursor; n=11; Bilateria|Rep: Cathepsin B-like cysteine proteinase 6 precursor - Caenorhabditis elegans Length = 379 Score = 128 bits (308), Expect = 2e-28 Identities = 53/101 (52%), Positives = 67/101 (66%), Gaps = 2/101 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQ SCGSCWAFGAVEAM+DR+C S+G SA+DLLSCC CG GC+GG P W Sbjct: 124 IRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCKSCGFGCNGGDPLAAW 183 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIPP--X*TSRTRAPGCP 800 YW G+V+G +Y ++ GC+PY PP + +T CP Sbjct: 184 RYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCP 224 Score = 37.5 bits (83), Expect = 0.43 Identities = 21/78 (26%), Positives = 35/78 (44%), Gaps = 5/78 (6%) Frame = +1 Query: 277 DEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIKDEHFATLPIK-----THKIDLIA 441 D+ I+ +N QN W A + + + K + + L +K + DL Sbjct: 44 DDLIDYVNENQNLWTAKKQRRFSSVYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDL 103 Query: 442 SLPENFDPRDXWPDCPTL 495 +PE+FD RD WP C ++ Sbjct: 104 DIPESFDSRDNWPKCDSI 121 >UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n=8; Strongylida|Rep: Cathepsin B-like cysteine protease 2 - Parelaphostrongylus tenuis Length = 344 Score = 124 bits (300), Expect = 2e-27 Identities = 51/87 (58%), Positives = 61/87 (70%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQ CGSCWAFG+ EAM+DRVC S+G K SA+D+LSCC CG GC GG P W Sbjct: 113 IRDQSQCGSCWAFGSAEAMSDRVCIASHGNKTVELSADDILSCCYDCGDGCDGGYPISAW 172 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIPP 764 EY+ G+V+GG Y + CRPYEIPP Sbjct: 173 EYFVETGVVTGGLYGTKDSCRPYEIPP 199 >UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase precursor; n=28; Bilateria|Rep: Cathepsin B-like cysteine proteinase precursor - Schistosoma japonicum (Blood fluke) Length = 342 Score = 122 bits (294), Expect = 1e-26 Identities = 49/86 (56%), Positives = 59/86 (68%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQ CGSCWAFGAVEAMTDR+C S G + SA DL+SCC CG GC GG P + W Sbjct: 109 IRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCKDCGDGCQGGFPGVAW 168 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIP 761 +YW G+V+GGS + GC+PY P Sbjct: 169 DYWVKRGIVTGGSKENHTGCQPYPFP 194 Score = 38.7 bits (86), Expect = 0.19 Identities = 27/80 (33%), Positives = 39/80 (48%), Gaps = 4/80 (5%) Frame = +1 Query: 268 PLSDEFINTINLKQNS-WKAGRNFPRDTSFAHLKKIMGVIKDEHFATL---PIKTHKIDL 435 PLSDE I+ IN ++ WKA ++ R S + +MG K++ P H DL Sbjct: 29 PLSDEMISFINEHPDAGWKADKS-DRFHSLDDARILMGARKEDAEMKRNRRPTVDHH-DL 86 Query: 436 IASLPENFDPRDXWPDCPTL 495 +P FD R WP C ++ Sbjct: 87 NVEIPSQFDSRKKWPHCKSI 106 >UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteinase; n=1; Tenebrio molitor|Rep: Putative cathepsin B-like like proteinase - Tenebrio molitor (Yellow mealworm) Length = 301 Score = 121 bits (291), Expect = 3e-26 Identities = 49/87 (56%), Positives = 59/87 (67%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQ SCGSCWAFGAVEAM+DR+C +S+ + SAEDL CC CG GC+GG P L W Sbjct: 104 IRDQASCGSCWAFGAVEAMSDRICIHSDASVKVRISAEDLNDCCYDCGDGCNGGWPDLAW 163 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIPP 764 YW G+V+GG Y GC+ Y I P Sbjct: 164 SYWSSTGIVTGGLYGVDEGCKAYSIKP 190 Score = 93.9 bits (223), Expect = 5e-18 Identities = 40/78 (51%), Positives = 59/78 (75%), Gaps = 1/78 (1%) Frame = +1 Query: 265 HPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVI-KDEHFATLPIKTHKIDLIA 441 HPLSDEFIN IN KQ +WKAGRNF +T +H+++++GV+ K + LP+KTH ++L A Sbjct: 24 HPLSDEFINEINSKQTTWKAGRNFDVNTPISHVRRLLGVLPKKANAPKLPVKTHAVNLDA 83 Query: 442 SLPENFDPRDXWPDCPTL 495 +PE+FD R+ WP+C ++ Sbjct: 84 -IPESFDAREAWPECTSI 100 >UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=1; Biomphalaria glabrata|Rep: Cathepsin B preproprotein precursor - Biomphalaria glabrata (Bloodfluke planorb) Length = 333 Score = 118 bits (284), Expect = 2e-25 Identities = 46/86 (53%), Positives = 60/86 (69%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQ +CGSCWAFG+ EAMTDR+C G + H SAED+ CC CG+GC+GG P W Sbjct: 106 IRDQANCGSCWAFGSAEAMTDRICIAGKG--NIHISAEDINDCCKSCGMGCNGGYPAAAW 163 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIP 761 E++ G+VSGG Y ++ GC PY +P Sbjct: 164 EWYVDTGVVSGGQYGTNEGCMPYSLP 189 Score = 56.0 bits (129), Expect = 1e-06 Identities = 35/96 (36%), Positives = 51/96 (53%), Gaps = 5/96 (5%) Frame = +1 Query: 223 VTLVCVLAAAKDLP---HPLSDEFINTINLKQNS-WKAGRNF-PRDTSFAHLKKIMGVIK 387 V + +LA A P PLSD I IN N+ WKAGRNF P + A + + + Sbjct: 8 VAICGLLAVALATPFHIEPLSDAEIFYINHVANTTWKAGRNFHPAEIKRARALLGVNMAE 67 Query: 388 DEHFATLPIKTHKIDLIASLPENFDPRDXWPDCPTL 495 ++ + + +K ++ LP+NFDPR WPDC +L Sbjct: 68 NKAYNRIHLKYKQVQPRNDLPDNFDPRTKWPDCASL 103 >UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 5 SCAF15026, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 351 Score = 116 bits (279), Expect = 8e-25 Identities = 48/79 (60%), Positives = 56/79 (70%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQGSCGSCWAFGA EAM+DRVC +SN SA+DLL+CC CG+GC+GG P W Sbjct: 98 IRDQGSCGSCWAFGASEAMSDRVCIHSNAKVSVELSAQDLLTCCNSCGMGCNGGYPSSAW 157 Query: 684 EYWKXFGLVSGGSYHSSXG 740 +W GLVSGG Y S G Sbjct: 158 NFWVSDGLVSGGLYDSHIG 176 Score = 62.1 bits (144), Expect = 2e-08 Identities = 34/96 (35%), Positives = 55/96 (57%), Gaps = 2/96 (2%) Frame = +1 Query: 214 AAYVTLVCVLAAAKDLPH--PLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIK 387 AA++ L +++ PH PLS E +N IN ++W AG NF + ++++KK+ G + Sbjct: 4 AAFLFLAAAWSSSLARPHLKPLSSEMVNYINKLNSTWTAGHNF-HNVDYSYVKKLCGTLL 62 Query: 388 DEHFATLPIKTHKIDLIASLPENFDPRDXWPDCPTL 495 L I+ + D+ LP+ FD R+ WP+CPTL Sbjct: 63 KGPKLPLMIR-YAGDI--KLPKEFDSREQWPNCPTL 95 >UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 precursor; n=5; Caenorhabditis|Rep: Cathepsin B-like cysteine proteinase 4 precursor - Caenorhabditis elegans Length = 335 Score = 115 bits (277), Expect = 1e-24 Identities = 50/104 (48%), Positives = 59/104 (56%), Gaps = 2/104 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQ CGSCWAF A EA +DR C SNG + SAED+LSCC CG GC GG P W Sbjct: 100 IRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSNCGYGCEGGYPINAW 159 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIPP--X*TSRTRAPGCPXXG 809 +Y G +GGSY + GC+PY + P P CP G Sbjct: 160 KYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDG 203 Score = 38.3 bits (85), Expect = 0.25 Identities = 28/99 (28%), Positives = 48/99 (48%), Gaps = 7/99 (7%) Frame = +1 Query: 220 YVTLVCVLAAAKDLPHPL----SDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIK 387 Y+ L ++A L PL + +N KQ+ WKA P+D + +KK + ++ Sbjct: 3 YLILAALVAVTAGLVIPLVPKTQEAITEYVNSKQSLWKA--EIPKDITIEQVKKRL--MR 58 Query: 388 DEHFA--TLPIKTHKIDLIA-SLPENFDPRDXWPDCPTL 495 E A T ++ K D+ ++P FD R WP+C ++ Sbjct: 59 TEFVAPHTPDVEVVKHDINEDTIPATFDARTQWPNCMSI 97 >UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=1; Nilaparvata lugens|Rep: Cathepsin B-like protease precursor - Nilaparvata lugens (Brown planthopper) Length = 347 Score = 115 bits (276), Expect = 2e-24 Identities = 45/87 (51%), Positives = 55/87 (63%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQG+CGSCWA A DR+C SN + H S+ +L+SCC CG GC GG P W Sbjct: 111 IRDQGNCGSCWAVSVAAAFADRLCIASNAKWNGHISSRELMSCCSYCGFGCEGGFPDAAW 170 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIPP 764 + K GLV+GG YHS GC+PY I P Sbjct: 171 VFIKRHGLVTGGDYHSHDGCQPYPIAP 197 Score = 44.0 bits (99), Expect = 0.005 Identities = 27/99 (27%), Positives = 53/99 (53%), Gaps = 6/99 (6%) Frame = +1 Query: 217 AYVTLVCVLAAAKDLPHPLSDEFINTINLKQNS-WKAGRNFPRDTSFAHLKKIMGVIK-D 390 A V+ + L ++ +++++I+ IN S WKAG NF DT ++L+ ++GV + + Sbjct: 10 AVVSAISALPDQENTVREIANKWIDAINNNPKSTWKAGHNFHPDTPMSYLQGLLGVSELE 69 Query: 391 EHFATL----PIKTHKIDLIASLPENFDPRDXWPDCPTL 495 + A L ++ ++ + +P+ FD R W C +L Sbjct: 70 SNLADLDKYEEMEENEENKKIKVPKYFDARKKWKKCKSL 108 >UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: Cathepsin B - Apriona germari Length = 324 Score = 113 bits (273), Expect = 4e-24 Identities = 46/83 (55%), Positives = 58/83 (69%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RD+G+CGSCWAF AVE M+DR+C S G K F FSAE+++SCC CG GC GG + Sbjct: 103 IRDEGACGSCWAFAAVEVMSDRLCLASEGRKKFIFSAEEVVSCCTACGGGCRGGFLNEPY 162 Query: 684 EYWKXFGLVSGGSYHSSXGCRPY 752 +YW G+ SGG Y S GC+PY Sbjct: 163 KYWVTNGIPSGGDYGSKLGCKPY 185 Score = 50.4 bits (115), Expect = 6e-05 Identities = 25/76 (32%), Positives = 44/76 (57%), Gaps = 2/76 (2%) Frame = +1 Query: 274 SDEFINTINLKQNSWKAGRNFPRDT--SFAHLKKIMGVIKDEHFATLPIKTHKIDLIASL 447 ++ FI +IN K +W A +NF T L ++G+ +D + TLP+ H + I+ + Sbjct: 28 TEAFIQSINEKATTWTARKNFEGRTPEQLKALADVIGINRDPN-VTLPVVFH--EAISGI 84 Query: 448 PENFDPRDXWPDCPTL 495 P++FD R+ WP C ++ Sbjct: 85 PDSFDAREQWPFCESI 100 >UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase precursor; n=29; Schistosomatidae|Rep: Cathepsin B-like cysteine proteinase precursor - Schistosoma mansoni (Blood fluke) Length = 340 Score = 112 bits (269), Expect = 1e-23 Identities = 45/86 (52%), Positives = 56/86 (65%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQ CGSCW+FGAVEAM+DR C S G ++ SA DLL+CC CGLGC GG+ W Sbjct: 108 IRDQSRCGSCWSFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCESCGLGCEGGILGPAW 167 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIP 761 +YW G+V+ S + GC PY P Sbjct: 168 DYWVKEGIVTASSKENHTGCEPYPFP 193 Score = 36.7 bits (81), Expect = 0.75 Identities = 26/80 (32%), Positives = 38/80 (47%), Gaps = 4/80 (5%) Frame = +1 Query: 268 PLSDEFINTINLKQNS-WKAGRNFPRDTSFAHLKKIMGVIKDE---HFATLPIKTHKIDL 435 PLSD+ I+ IN N+ W+A ++ R S + MG ++E P H D Sbjct: 28 PLSDDIISYINEHPNAGWRAEKS-NRFHSLDDARIQMGARREEPDLRRKRRPTVDHN-DW 85 Query: 436 IASLPENFDPRDXWPDCPTL 495 +P NFD R WP C ++ Sbjct: 86 NVEIPSNFDSRKKWPGCKSI 105 >UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.4; n=1; Caenorhabditis elegans|Rep: Putative uncharacterized protein W07B8.4 - Caenorhabditis elegans Length = 335 Score = 111 bits (268), Expect = 2e-23 Identities = 48/90 (53%), Positives = 57/90 (63%), Gaps = 3/90 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP---ICGLGCSGGMPX 674 +RDQ CGSCWA A EA++DR C SNG + SAED+L+CC CG GC GG P Sbjct: 92 IRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLSAEDILTCCTGKFNCGDGCEGGYPI 151 Query: 675 LTWEYWKXFGLVSGGSYHSSXGCRPYEIPP 764 W YW GLV+GGS+ S GC+PY I P Sbjct: 152 QAWRYWVKNGLVTGGSFESQYGCKPYSIAP 181 Score = 38.7 bits (86), Expect = 0.19 Identities = 30/88 (34%), Positives = 45/88 (51%), Gaps = 1/88 (1%) Frame = +1 Query: 226 TLVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIKDEHFAT 405 +L+ +LAA+ + P + FIN IN Q W A T F ++ ++K EH A Sbjct: 7 SLLFILAASA-VVLPRNKLFINHINSAQKLWTAEHY---TTPF----EVKNLMKVEHVAA 58 Query: 406 LPIKTHKIDLIA-SLPENFDPRDXWPDC 486 K K+ A S+P+++D RD WP C Sbjct: 59 HLDKDIKLAETADSIPDSYDVRDHWPQC 86 >UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA; n=1; Tribolium castaneum|Rep: PREDICTED: similar to CG10992-PA - Tribolium castaneum Length = 325 Score = 111 bits (267), Expect = 2e-23 Identities = 45/83 (54%), Positives = 57/83 (68%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +R+QG+CGSCWAF + E MTDR+C S G F FS E+LL+CC CG GC GG W Sbjct: 96 IRNQGNCGSCWAFASTEVMTDRLCISSKGKIKFVFSPENLLTCCKDCGCGCKGGYIKNAW 155 Query: 684 EYWKXFGLVSGGSYHSSXGCRPY 752 +Y+ G+ SGG Y+SS GC+PY Sbjct: 156 DYYINEGIASGGDYNSSEGCQPY 178 Score = 50.0 bits (114), Expect = 8e-05 Identities = 30/90 (33%), Positives = 45/90 (50%), Gaps = 2/90 (2%) Frame = +1 Query: 223 VTLVCVLAA--AKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIKDEH 396 +T +C L + P+ S + I IN +Q SWKA N +G+ D + Sbjct: 4 ITFLCALTLPLSWSKPNTSSLQVIQEINSEQISWKAETNC---LDIKSRLGFLGLHPDPN 60 Query: 397 FATLPIKTHKIDLIASLPENFDPRDXWPDC 486 + + K HKI I S+PE+FD R+ WP+C Sbjct: 61 YK-IQTKQHKISRIISIPESFDAREKWPEC 89 >UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin B-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 331 Score = 110 bits (265), Expect = 4e-23 Identities = 47/93 (50%), Positives = 55/93 (59%) Frame = +3 Query: 486 SNVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGG 665 S+V V DQ CGSCWA A AM+DR C S G SAE+LLSCC CG GC GG Sbjct: 93 SDVISTVVDQSDCGSCWAVAAASAMSDRRCIASQGKLKVPVSAENLLSCCDSCGYGCEGG 152 Query: 666 MPXLTWEYWKXFGLVSGGSYHSSXGCRPYEIPP 764 P + W YW G+ +GG Y S GC+PY + P Sbjct: 153 YPTMAWSYWIDTGITTGGLYGSKQGCQPYSLQP 185 Score = 66.5 bits (155), Expect = 8e-10 Identities = 33/94 (35%), Positives = 56/94 (59%), Gaps = 2/94 (2%) Frame = +1 Query: 211 RAAYVT--LVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVI 384 +AA++ L+ ++ + K P+PLS++FIN IN KQ++W AG+NF + S +K ++G Sbjct: 2 KAAFIITLLLPIVLSYKGSPNPLSNDFINYINSKQSTWVAGKNFDENLSIQEIKNLLGAK 61 Query: 385 KDEHFATLPIKTHKIDLIASLPENFDPRDXWPDC 486 K + TH D+ +P +FD R+ W +C Sbjct: 62 KGK-LGVAKEFTHSEDI--QVPNSFDARENWKEC 92 >UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Cathepsin B - Fasciola gigantica (Giant liver fluke) Length = 339 Score = 110 bits (264), Expect = 5e-23 Identities = 43/83 (51%), Positives = 55/83 (66%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQ SCGSCWA A AM+DRVC +SNG +A D LSCC CG GC GG P W Sbjct: 105 IRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTYCGQGCRGGYPPKAW 164 Query: 684 EYWKXFGLVSGGSYHSSXGCRPY 752 +YW G+V+GG++ + GC+P+ Sbjct: 165 DYWMREGIVTGGTWENRTGCQPW 187 Score = 44.4 bits (100), Expect = 0.004 Identities = 29/78 (37%), Positives = 39/78 (50%), Gaps = 4/78 (5%) Frame = +1 Query: 274 SDEFINTINLKQN-SWKAGRNFPRDTSFAHLKKIMGVIKD---EHFATLPIKTHKIDLIA 441 SDE I +N + SWKA R+ R ++ H K +G + + E A P H I Sbjct: 27 SDELIRFVNEESGASWKAARS-TRFSNVDHFKLHLGALSETPEERNALRPTIKHDISK-N 84 Query: 442 SLPENFDPRDXWPDCPTL 495 LPE+FD R WP C T+ Sbjct: 85 DLPESFDARSQWPQCWTI 102 >UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n=4; Tenebrionidae|Rep: Putative cathepsin B-like proteinase - Tenebrio molitor (Yellow mealworm) Length = 321 Score = 108 bits (259), Expect = 2e-22 Identities = 44/83 (53%), Positives = 60/83 (72%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQG+CGSCWAF ++E+M+DR+C +S+G+ F FS EDLLSCC CG C GG Sbjct: 102 IRDQGACGSCWAFASIESMSDRICIHSSGSAQFMFSPEDLLSCCTSCG-DCGGGYMMSAL 160 Query: 684 EYWKXFGLVSGGSYHSSXGCRPY 752 +++ G+VSGG +S+ GCRPY Sbjct: 161 DFYINEGIVSGGDVNSNEGCRPY 183 Score = 70.5 bits (165), Expect = 5e-11 Identities = 42/103 (40%), Positives = 62/103 (60%), Gaps = 3/103 (2%) Frame = +1 Query: 196 KMFVSRAAYVTLVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIM 375 K+F+S +V LV VL+A+ LS EFI++IN Q+SW AGRNFP +T+ +L K+ Sbjct: 2 KIFLS---FVVLVAVLSASLAEIDVLSSEFIDSINRIQSSWVAGRNFPENTTNEYLYKLN 58 Query: 376 GVI---KDEHFATLPIKTHKIDLIASLPENFDPRDXWPDCPTL 495 G I D ++ P+ H + +PE+FD R WP+C +L Sbjct: 59 GFIGLHPDPNYKP-PVLVHTFN-ARDVPESFDARTKWPNCDSL 99 >UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Cathepsin b - Aedes aegypti (Yellowfever mosquito) Length = 386 Score = 107 bits (257), Expect = 4e-22 Identities = 46/85 (54%), Positives = 52/85 (61%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQG CGSCWA A AMTDR C S G + F F + DLLSCC CG GC GG W Sbjct: 144 IRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCCHSCGQGCRGGTLGPAW 203 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEI 758 ++W GL SGG +S GC PY I Sbjct: 204 QFWVEKGLSSGGPLNSRQGCHPYPI 228 >UniRef50_Q23FP9 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 340 Score = 107 bits (256), Expect = 5e-22 Identities = 44/88 (50%), Positives = 53/88 (60%), Gaps = 1/88 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLT 680 +RDQ +CGSCWAF A E +DR+C SN T S+EDLL CC CG+GC GG P Sbjct: 107 IRDQSTCGSCWAFAATETFSDRICIASNQTLQTSISSEDLLECCADYCGMGCKGGYPSAA 166 Query: 681 WEYWKXFGLVSGGSYHSSXGCRPYEIPP 764 W Y K G+ +GG Y C+PY PP Sbjct: 167 WGYMKRQGVSTGGLYGDDTSCKPYIFPP 194 Score = 34.7 bits (76), Expect = 3.0 Identities = 24/79 (30%), Positives = 39/79 (49%), Gaps = 4/79 (5%) Frame = +1 Query: 271 LSDEFINTINLKQNS-WKAGR--NFPRDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIA 441 +S + +N NS WKA R +F + T L +G + + + LP K + A Sbjct: 27 MSPFIVFEVNSNPNSTWKAARYPHFEKMTR-EQLLGHLGSLDEPDWVKLPTKEFDPNANA 85 Query: 442 S-LPENFDPRDXWPDCPTL 495 +PE FD R+ WP+C ++ Sbjct: 86 DPIPEFFDAREQWPNCQSI 104 >UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|Rep: Cysteine proteinase 3 - Necator americanus (Human hookworm) Length = 360 Score = 105 bits (252), Expect = 1e-21 Identities = 42/87 (48%), Positives = 53/87 (60%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQ CGSCWA + E M+DR+C SNGT S D+L+CCP CG GC GG W Sbjct: 109 IRDQSHCGSCWAVSSAETMSDRLCVQSNGTIKVLLSDTDILACCPNCGAGCGGGHTIRAW 168 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIPP 764 EY+K G+ +GG Y + C+PY P Sbjct: 169 EYFKNTGVCTGGLYGTKDSCKPYAFYP 195 >UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Cathepsin b - Aedes aegypti (Yellowfever mosquito) Length = 332 Score = 104 bits (249), Expect = 3e-21 Identities = 41/88 (46%), Positives = 59/88 (67%), Gaps = 1/88 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGG-MPXLT 680 +++QG CG+CWA AV M+DR+C +S G +AEDL+ CC CG GC+GG + + Sbjct: 104 IKNQGLCGACWAVAAVSVMSDRLCIHSEGKFDVELAAEDLMGCCKDCGNGCNGGFLDGTS 163 Query: 681 WEYWKXFGLVSGGSYHSSXGCRPYEIPP 764 ++YW GLVSG +Y+S+ GC+PY P Sbjct: 164 FQYWVDVGLVSGAAYNSTDGCKPYPFKP 191 Score = 46.8 bits (106), Expect = 7e-04 Identities = 23/89 (25%), Positives = 41/89 (46%), Gaps = 1/89 (1%) Frame = +1 Query: 232 VCVLAAAKDL-PHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIKDEHFATL 408 V V+A ++ L P +D F+ + +W F F + + + G+ + + L Sbjct: 13 VVVIARSERLGDDPFNDGFLAQVQRHAKTWTPDATFRDGIRFENFQNMKGIFESKIGFRL 72 Query: 409 PIKTHKIDLIASLPENFDPRDXWPDCPTL 495 P K H + +PE FD R+ WP C ++ Sbjct: 73 PTKRHDVAYNMDIPEFFDAREKWPYCKSI 101 >UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoidea|Rep: Cathepsin B-S precursor - Tuberaphis styraci Length = 349 Score = 103 bits (248), Expect = 4e-21 Identities = 39/87 (44%), Positives = 54/87 (62%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQG+CGSCW+F A DR+C + G + S E+L CC CG GC GG P W Sbjct: 104 IRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELAFCCMDCGKGCGGGYPIKAW 163 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIPP 764 +Y++ G+ +GG Y + GC PY++PP Sbjct: 164 KYFRTQGVTTGGDYDTKEGCMPYKVPP 190 Score = 48.4 bits (110), Expect = 2e-04 Identities = 32/98 (32%), Positives = 50/98 (51%), Gaps = 7/98 (7%) Frame = +1 Query: 214 AAYVTLVCVLAAAKDLPHP----LSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGV 381 A +VT+VC + + L P LSDE I IN +WKA R FP +TS + ++G Sbjct: 2 AKFVTIVCAIFVSVYLAEPTLQFLSDERIKYINEVAKTWKAERYFPANTSEEYFIGLLGS 61 Query: 382 IKDEHFATLPIKTHKIDLI---ASLPENFDPRDXWPDC 486 +++ T ++ K D + + P+ FD R+ W C Sbjct: 62 RGYKNY-TNEVEIKKYDPLYVENNSPKQFDSRENWKSC 98 >UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin B-like cysteine proteinase 4 precursor (Cysteine protease-related 4); n=2; Tribolium castaneum|Rep: PREDICTED: similar to Cathepsin B-like cysteine proteinase 4 precursor (Cysteine protease-related 4) - Tribolium castaneum Length = 360 Score = 102 bits (244), Expect = 1e-20 Identities = 41/83 (49%), Positives = 51/83 (61%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +R+QG C S WAF A E M+DR+C +NG S EDL+ CC CG C GG W Sbjct: 92 IRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDLIDCCHYCGNQCKGGYTYYAW 151 Query: 684 EYWKXFGLVSGGSYHSSXGCRPY 752 Y+ GLVSGG Y++S GC+PY Sbjct: 152 NYFMLTGLVSGGDYNTSTGCQPY 174 Score = 39.1 bits (87), Expect = 0.14 Identities = 25/67 (37%), Positives = 36/67 (53%) Frame = +1 Query: 286 INTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDP 465 IN IN +Q++W AG N P D + L +G+ D +F IK + +PE FD Sbjct: 23 INQINSQQSAWTAGIN-PFDDIESRLG-FLGIHPDPNFKP-EIKEPQATQNV-IPETFDA 78 Query: 466 RDXWPDC 486 R+ WP+C Sbjct: 79 REYWPEC 85 >UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 precursor; n=3; Haemonchidae|Rep: Cathepsin B-like cysteine proteinase 1 precursor - Ostertagia ostertagi Length = 341 Score = 102 bits (244), Expect = 1e-20 Identities = 41/87 (47%), Positives = 55/87 (63%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 + DQ +CGSCWA + AM+DR+C S G K SA+D++SCC CG GC GG P + Sbjct: 110 IPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVVSCCTWCGDGCEGGWPISAF 169 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIPP 764 + G+V+GG Y++ CRPYEI P Sbjct: 170 RFHADEGVVTGGDYNTKGSCRPYEIHP 196 >UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8.1; n=1; Caenorhabditis elegans|Rep: Putative uncharacterized protein W07B8.1 - Caenorhabditis elegans Length = 335 Score = 101 bits (243), Expect = 2e-20 Identities = 44/90 (48%), Positives = 56/90 (62%), Gaps = 3/90 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP---ICGLGCSGGMPX 674 + D C + WAF A E+M+DR+C S G K+ SAE+LLSCC CG GC GG P Sbjct: 95 INDISECKTSWAFAAAESMSDRLCINSGGFKNTILSAEELLSCCTGMFSCGEGCEGGNPF 154 Query: 675 LTWEYWKXFGLVSGGSYHSSXGCRPYEIPP 764 W+Y + G+ +GGSY S GC+PY IPP Sbjct: 155 KAWQYIQKHGIPTGGSYESQFGCKPYSIPP 184 Score = 35.1 bits (77), Expect = 2.3 Identities = 26/102 (25%), Positives = 47/102 (46%), Gaps = 1/102 (0%) Frame = +1 Query: 211 RAAYVTLVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIKD 390 R + L+ VL A +P D I+ +N ++ +W AG P + + LK + + D Sbjct: 2 RKILICLIGVLFQADGVPPSEIDRIIHYVNSQKTTWTAG--IPALSRNSMLKTL---VTD 56 Query: 391 EHFATLPIKTHKIDLIAS-LPENFDPRDXWPDCPTLX*XSEI 513 I+ + S L +FD R+ WP+C ++ ++I Sbjct: 57 AATIGFKIQNFGVSQANSDLSPSFDARERWPECMSIPQINDI 98 >UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 precursor; n=4; Caenorhabditis|Rep: Cathepsin B-like cysteine proteinase 3 precursor - Caenorhabditis elegans Length = 370 Score = 101 bits (243), Expect = 2e-20 Identities = 45/93 (48%), Positives = 54/93 (58%), Gaps = 1/93 (1%) Frame = +3 Query: 489 NVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGG 665 N +R+Q +CGSCWAFGA E ++DRVC SNGT+ S ED+LSCC CG GC GG Sbjct: 106 NTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDILSCCGTTCGYGCKGG 165 Query: 666 MPXLTWEYWKXFGLVSGGSYHSSXGCRPYEIPP 764 +W G V+GG Y GC PY P Sbjct: 166 YSIEALRFWASSGAVTGGDY-GGHGCMPYSFAP 197 Score = 33.1 bits (72), Expect = 9.3 Identities = 22/74 (29%), Positives = 33/74 (44%), Gaps = 4/74 (5%) Frame = +1 Query: 286 INTINLKQNSWKAGRN----FPRDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPE 453 ++ +N Q SW A N F +K + KD A+ +I + LP+ Sbjct: 36 VDHVNTVQTSWVAEHNEISEFEMKFKVMDVKFAEPLEKDSDVASELFVRGEI-VPEPLPD 94 Query: 454 NFDPRDXWPDCPTL 495 FD R+ WPDC T+ Sbjct: 95 TFDAREKWPDCNTI 108 >UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 precursor; n=8; Haemonchus contortus|Rep: Cathepsin B-like cysteine proteinase 2 precursor - Haemonchus contortus (Barber pole worm) Length = 342 Score = 101 bits (241), Expect = 3e-20 Identities = 42/88 (47%), Positives = 55/88 (62%), Gaps = 1/88 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLT 680 +RDQ +CGSCWA A++DR+C S K + SA D+++CC P CG GC GG P Sbjct: 105 IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEA 164 Query: 681 WEYWKXFGLVSGGSYHSSXGCRPYEIPP 764 W+Y+ G+VSGG Y + CRPY I P Sbjct: 165 WKYFIYDGVVSGGEYLTKDVCRPYPIHP 192 >UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus contortus|Rep: Cysteine proteinase - Haemonchus contortus (Barber pole worm) Length = 350 Score = 100 bits (240), Expect = 4e-20 Identities = 45/100 (45%), Positives = 54/100 (54%), Gaps = 1/100 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP-ICGLGCSGGMPXLT 680 VRDQ CGSCWA A M+DR+C + G S D+LSCC +CG GC GG L Sbjct: 113 VRDQSRCGSCWAVSAASTMSDRICVQTKGKLQTILSDTDILSCCGRMCGDGCEGGYDHLA 172 Query: 681 WEYWKXFGLVSGGSYHSSXGCRPYEIPPX*TSRTRAPGCP 800 WE+ + FG+V+GG Y CRPY P R CP Sbjct: 173 WEWVQRFGVVTGGPYQQKGVCRPYAFHPCGLHHGRRYDCP 212 >UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep: Cathepsin B - Pandalus borealis (Northern red shrimp) Length = 328 Score = 99 bits (238), Expect = 7e-20 Identities = 43/103 (41%), Positives = 57/103 (55%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQG+CGSCWA A MTDR C + G F FS+E++ +CC CG C GG + Sbjct: 95 IRDQGNCGSCWAVSAASVMTDRTCIDTEGLVDFRFSSENVAACCTECGNACYGGDEDTAF 154 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIPPX*TSRTRAPGCPXXGD 812 +W G VSGG ++S+ GC+PY + P P GD Sbjct: 155 THWVTKGFVSGGRHNSNEGCQPYSVEEC-EHHIEGPRPPCEGD 196 Score = 71.7 bits (168), Expect = 2e-11 Identities = 36/89 (40%), Positives = 50/89 (56%) Frame = +1 Query: 229 LVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIKDEHFATL 408 L+ ++AAA PLSDEF+ + KQ +WKAGRNF +D S LK + V K+ L Sbjct: 6 LLALVAAASAELDPLSDEFLELLQSKQMTWKAGRNFAKDISKDFLKSLNCVRKNPDIPKL 65 Query: 409 PIKTHKIDLIASLPENFDPRDXWPDCPTL 495 P+K + +P FD R+ WP CP + Sbjct: 66 PLK--NVTPTKEIPVEFDAREQWPHCPCI 92 >UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000012227 - Anopheles gambiae str. PEST Length = 218 Score = 99 bits (238), Expect = 7e-20 Identities = 41/78 (52%), Positives = 57/78 (73%), Gaps = 1/78 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGG-MPXLT 680 +R+QG+CGSCWA A M+DRVC +SNGT + +AEDL+ CC CG GC+GG + + Sbjct: 20 IRNQGTCGSCWAVAAASVMSDRVCIHSNGTINVALAAEDLMGCCVDCGNGCNGGFLDGTS 79 Query: 681 WEYWKXFGLVSGGSYHSS 734 ++YW GLVSGG+Y+S+ Sbjct: 80 FQYWVDAGLVSGGAYNST 97 >UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep: Cathepsin B - Uronema marinum Length = 350 Score = 98.7 bits (235), Expect = 2e-19 Identities = 46/95 (48%), Positives = 57/95 (60%), Gaps = 8/95 (8%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP---ICGLGCSGGMPX 674 VRDQ +CGSCWAFG VEA++DR+C S S+E+LLSCC CG+GC+GG Sbjct: 105 VRDQSNCGSCWAFGTVEAISDRICIASGQKDQTRISSENLLSCCRGTFACGMGCNGGYTA 164 Query: 675 LTWEYWKXFGLVSGGSY-----HSSXGCRPYEIPP 764 W Y+ GLVSG Y +S C+PY PP Sbjct: 165 GAWNYYVKTGLVSGNLYTDDNQNSKTECQPYSFPP 199 >UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_115, whole genome shotgun sequence - Paramecium tetraurelia Length = 332 Score = 98.3 bits (234), Expect = 2e-19 Identities = 45/92 (48%), Positives = 56/92 (60%), Gaps = 5/92 (5%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPI-CGL----GCSGGM 668 + DQG+CGSCWA A M+DR+C S T SAEDLLSCC I C L GC GG Sbjct: 90 IPDQGNCGSCWAVSAASTMSDRLCIASGQTDKRQISAEDLLSCCGINCELDGNGGCDGGY 149 Query: 669 PXLTWEYWKXFGLVSGGSYHSSXGCRPYEIPP 764 P W+Y + G+V+GG+Y+ C+PY PP Sbjct: 150 PYGAWKYLRVDGIVTGGTYNDFSLCKPYSFPP 181 Score = 37.9 bits (84), Expect = 0.33 Identities = 24/90 (26%), Positives = 44/90 (48%), Gaps = 2/90 (2%) Frame = +1 Query: 232 VCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPR--DTSFAHLKKIMGVIKDEHFAT 405 +C++ + +P F+N+I + +W A N+ R + S K VI D H Sbjct: 5 ICLIISLVSARNPFITAFVNSI---KTTWTA-TNYERWNEKSDGFYSKYFNVIVD-HSEP 59 Query: 406 LPIKTHKIDLIASLPENFDPRDXWPDCPTL 495 + K H + + +LP +F ++ WP CP++ Sbjct: 60 VEYKYH--EKLENLPPSFSAQEKWPGCPSI 87 >UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2; Arthropoda|Rep: Cathepsin B-like cysteine protease - Callosobruchus maculatus (Southern cowpea weevil) (Pulse bruchid) Length = 330 Score = 96.7 bits (230), Expect = 7e-19 Identities = 39/89 (43%), Positives = 51/89 (57%), Gaps = 3/89 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGL---GCSGGMPX 674 +RDQ CGSCWA + M+DR+C S+ SA D++ CC C GC GG+P Sbjct: 100 IRDQSGCGSCWAVSSASVMSDRICIQSDQKNQLRISAADMIECCESCTFSVDGCHGGIPS 159 Query: 675 LTWEYWKXFGLVSGGSYHSSXGCRPYEIP 761 T+ WK G VSGG Y+S+ GC Y +P Sbjct: 160 FTFTEWKDSGFVSGGEYNSTNGCMSYPLP 188 Score = 58.8 bits (136), Expect = 2e-07 Identities = 33/97 (34%), Positives = 50/97 (51%), Gaps = 2/97 (2%) Frame = +1 Query: 211 RAAYVTLVCVLAAAKDLPHP--LSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVI 384 + A++ L V++ P LSDE+I +N K WKAGRNF RDTS ++++++ V Sbjct: 2 KLAFIALAAVVSCTFAQPELDFLSDEYIEQLNSKNLPWKAGRNFERDTSLYNIQRLLSVG 61 Query: 385 KDEHFATLPIKTHKIDLIASLPENFDPRDXWPDCPTL 495 + H+ D LPE FD R W C ++ Sbjct: 62 TINPPSEFETIFHEDD-GKDLPEEFDARKQWSKCESI 97 >UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7; n=2; Haemonchidae|Rep: Cathepsin B-like cysteine protease GCP7 - Haemonchus contortus (Barber pole worm) Length = 348 Score = 95.9 bits (228), Expect = 1e-18 Identities = 41/101 (40%), Positives = 57/101 (56%), Gaps = 2/101 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLT 680 + DQ +CGSCWA A + M+DR+C +S G K SA D+L+CC CG GC GG Sbjct: 115 IPDQSNCGSCWAVSAAQCMSDRLCIHSQGRKKVLLSATDILACCGKFCGYGCDGGYNARA 174 Query: 681 WEYWKXFGLVSGGSYHSSXGCRPYEIPPX*TSRTRA-PGCP 800 W++ G+V+GG+Y C+PY P + +A CP Sbjct: 175 WKWATIAGVVTGGAYKEKGNCKPYVFPQCGAHKGKAFNNCP 215 >UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomatidae|Rep: Cysteine proteinase - Ancylostoma ceylanicum Length = 348 Score = 92.3 bits (219), Expect = 1e-17 Identities = 41/88 (46%), Positives = 51/88 (57%), Gaps = 1/88 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLT 680 +RDQ SCGSCWA A AM+DRVC +NG + S ++LSCC CG GC GG P Sbjct: 113 IRDQSSCGSCWAVAAASAMSDRVCALTNGRINRILSDTEVLSCCFGSCGFGCKGGYPARA 172 Query: 681 WEYWKXFGLVSGGSYHSSXGCRPYEIPP 764 + Y +GL +GG Y C+PY P Sbjct: 173 FGYAWRYGLSTGGPYGEKDACQPYAFYP 200 >UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishmania|Rep: Cathepsin B-like protease - Leishmania major Length = 340 Score = 91.9 bits (218), Expect = 2e-17 Identities = 40/87 (45%), Positives = 53/87 (60%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQ +CGSCWA AVEA++DR CT+ G S +LLSCC ICGLGC GG+P + W Sbjct: 117 IRDQSNCGSCWAIAAVEAISDRYCTFG-GVPDRRMSTSNLLSCCFICGLGCHGGIPTVAW 175 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIPP 764 +W G+ ++ C+PY P Sbjct: 176 LWWVWVGI-------ATEDCQPYPFDP 195 Score = 37.1 bits (82), Expect = 0.57 Identities = 28/94 (29%), Positives = 39/94 (41%), Gaps = 4/94 (4%) Frame = +1 Query: 226 TLVCVLAAAKDLPHPLSDEFINTINLK-QNSWKAGRN---FPRDTSFAHLKKIMGVIKDE 393 T+ + A D P L F+ +N K + W A N S ++K+MGV Sbjct: 22 TVSGLYAKPSDFPL-LGKSFVAEVNSKAKGQWTASANNGYLVTGKSLGEVRKLMGVTDMS 80 Query: 394 HFATLPIKTHKIDLIASLPENFDPRDXWPDCPTL 495 A P +L LPE FD + WP C T+ Sbjct: 81 TEAVPPRNFSVEELQQDLPEFFDAAEHWPMCLTI 114 >UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep: Thiol protease - Trichuris suis Length = 348 Score = 91.5 bits (217), Expect = 2e-17 Identities = 38/85 (44%), Positives = 50/85 (58%), Gaps = 1/85 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPI-CGLGCSGGMPXLT 680 +RDQ CGSCWA A E M+DR+C SN + S D+LSCC + CG GC+GG P Sbjct: 102 IRDQAKCGSCWAVSAAETMSDRICVQSNCSIKACISDTDILSCCGLYCGYGCNGGFPIEA 161 Query: 681 WEYWKXFGLVSGGSYHSSXGCRPYE 755 W ++ G +GG GC+PY+ Sbjct: 162 WRHFTVAGNCTGGKTIDKYGCKPYK 186 >UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator americanus|Rep: Cysteine proteinase 4 - Necator americanus (Human hookworm) Length = 339 Score = 91.1 bits (216), Expect = 3e-17 Identities = 38/88 (43%), Positives = 50/88 (56%), Gaps = 1/88 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLT 680 +RD +CGSCWA A M+DR+C +NGT S+ D+L+CC CG GC GG P Sbjct: 107 IRDHSACGSCWAVSAASVMSDRLCIQTNGTNQKILSSADILACCGEDCGSGCEGGYPIQA 166 Query: 681 WEYWKXFGLVSGGSYHSSXGCRPYEIPP 764 + Y + G+ SGG Y C+PY P Sbjct: 167 YFYLENTGVCSGGEYREKNVCKPYPFYP 194 >UniRef50_Q237A1 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 346 Score = 90.2 bits (214), Expect = 6e-17 Identities = 38/87 (43%), Positives = 53/87 (60%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 VRDQ +CGSCWAFGA E+++DR C + + S ++LL+CC CG GC GG P Sbjct: 113 VRDQSTCGSCWAFGAAESLSDRHCIHLG--QDIRLSTQNLLTCCAACGDGCDGGWPEAAM 170 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIPP 764 +Y+ GLV+G Y ++ C+ Y P Sbjct: 171 DYYVNTGLVTGDLYGNNSWCQAYTFAP 197 Score = 34.7 bits (76), Expect = 3.0 Identities = 26/80 (32%), Positives = 37/80 (46%), Gaps = 3/80 (3%) Frame = +1 Query: 265 HPLSDEFINTINLKQNSWKAGRNFP-RDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIA 441 H + I +N ++WKAG N ++ A +K MGV + IK + A Sbjct: 34 HDKLKQIIQKVNSSNSTWKAGENTKWINSDIAGVKAHMGVKLGQESG---IKLETVSAQA 90 Query: 442 S-LPENFDPRDXWPD-CPTL 495 + LPE FD R W D C +L Sbjct: 91 NGLPEEFDARVQWGDKCSSL 110 >UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep: Cysteine proteinase - Toxoplasma gondii Length = 569 Score = 89.4 bits (212), Expect = 1e-16 Identities = 40/97 (41%), Positives = 55/97 (56%), Gaps = 6/97 (6%) Frame = +3 Query: 489 NVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC---PICGLGCS 659 +V VRDQG CGSCWAF + EA DR+C S G + SA+ SCC GC+ Sbjct: 289 DVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHTTSCCNAIHCASFGCN 348 Query: 660 GGMPXLTWEYWKXFGLVSGGSYHS---SXGCRPYEIP 761 GG P + W +++ G+V+GG + + C PYE+P Sbjct: 349 GGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVP 385 >UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 1 - Rhipicephalus appendiculatus (Brown ear tick) Length = 332 Score = 88.6 bits (210), Expect = 2e-16 Identities = 38/87 (43%), Positives = 53/87 (60%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQ +CGSCWAF A E+++DR+C ++NG + SAEDLL+CC CG GC G + Sbjct: 106 IRDQSACGSCWAFAAAESISDRICIHTNGKVQVNISAEDLLACCHTCGHGCDGRCHCSSV 165 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIPP 764 + LV + GC+PY +PP Sbjct: 166 AILQGRRLVP-EPVRTEDGCQPYSLPP 191 Score = 45.6 bits (103), Expect = 0.002 Identities = 27/80 (33%), Positives = 38/80 (47%), Gaps = 4/80 (5%) Frame = +1 Query: 268 PLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIKDEHFATLPIKTH----KIDL 435 PLS+E IN IN +WKAGRNF D +H + G +H + D Sbjct: 26 PLSEEMINFINSINTTWKAGRNF--DEKRSHSDCVQGGDGASVLTATSTSSHFTSYEEDS 83 Query: 436 IASLPENFDPRDXWPDCPTL 495 + PE+F PR+ W C ++ Sbjct: 84 RWTCPESFTPREYWSHCSSI 103 >UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8; Trypanosoma|Rep: Cathepsin B-like cysteine protease - Trypanosoma brucei Length = 340 Score = 87.8 bits (208), Expect = 3e-16 Identities = 42/86 (48%), Positives = 50/86 (58%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 + DQ +CGSCWA A AM+DR CT G + H SA DLL+CC CG GC+GG P W Sbjct: 113 IADQSACGSCWAVAAASAMSDRFCTMG-GVQDVHISAGDLLACCSDCGDGCNGGDPDRAW 171 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIP 761 Y+ GLVS Y C+PY P Sbjct: 172 AYFSSTGLVS--DY-----CQPYPFP 190 Score = 46.4 bits (105), Expect = 0.001 Identities = 33/99 (33%), Positives = 54/99 (54%), Gaps = 6/99 (6%) Frame = +1 Query: 217 AYVTLVCVLAA--AKDLPHPLSDEFINTIN-LKQNSWKAGRN-FPRDTSFAHLKKIMGVI 384 A +V V AA A+D P LS F++ +N L + WKA + ++ + K++ GVI Sbjct: 13 ASTAVVAVNAALVAEDAP-VLSKAFVDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGVI 71 Query: 385 KDEHFATLPIKTH--KIDLIASLPENFDPRDXWPDCPTL 495 K + A++ K + + A LP +FD + WP+CPT+ Sbjct: 72 KKNNNASILPKRRFTEEEARAPLPSSFDSAEAWPNCPTI 110 >UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 356 Score = 86.2 bits (204), Expect = 9e-16 Identities = 43/93 (46%), Positives = 54/93 (58%), Gaps = 6/93 (6%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC----PIC--GLGCSGG 665 VRDQ CGS AVE +DR C SNGT ++ SA+D LSCC IC G GC G Sbjct: 111 VRDQSDCGSAAHLVAVEIASDRTCIASNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGS 170 Query: 666 MPXLTWEYWKXFGLVSGGSYHSSXGCRPYEIPP 764 P ++W+ GL +GG+Y+ GC+PY I P Sbjct: 171 WPKDILKWWQTHGLCTGGNYNDQFGCKPYSIYP 203 Score = 33.5 bits (73), Expect = 7.0 Identities = 18/65 (27%), Positives = 29/65 (44%), Gaps = 1/65 (1%) Frame = +1 Query: 295 INLKQNSWKAGRN-FPRDTSFAHLKKIMGVIKDEHFATLPIKTHKIDLIASLPENFDPRD 471 +N KQ WKA + A K I + ++ + KT +++ +P +FD R Sbjct: 44 VNKKQKLWKAETSRMTFQEKMARAKSIKFIKSNDEVSE---KTGNDNVLVDIPSSFDSRQ 100 Query: 472 XWPDC 486 WP C Sbjct: 101 KWPSC 105 >UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyledons|Rep: At1g02300/T6A9_10 - Arabidopsis thaliana (Mouse-ear cress) Length = 362 Score = 85.8 bits (203), Expect = 1e-15 Identities = 39/83 (46%), Positives = 51/83 (61%), Gaps = 3/83 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP-ICGLGCSGGMPXLT 680 + DQG CGSCWAFGAVE+++DR C N + S DLL+CC +CG GC+GG P Sbjct: 125 ILDQGHCGSCWAFGAVESLSDRFCIKYN--MNVSLSVNDLLACCGFLCGQGCNGGYPIAA 182 Query: 681 WEYWKXFGLVSG--GSYHSSXGC 743 W Y+K G+V+ Y + GC Sbjct: 183 WRYFKHHGVVTEECDPYFDNTGC 205 Score = 40.7 bits (91), Expect = 0.046 Identities = 25/79 (31%), Positives = 39/79 (49%), Gaps = 4/79 (5%) Frame = +1 Query: 271 LSDEFINTINLKQNS-WKAGRNFP-RDTSFAHLKKIMGV--IKDEHFATLPIKTHKIDLI 438 L +E + +N N+ WKA N + + A K+++GV F +PI +H I L Sbjct: 46 LQNEIVKEVNENPNAGWKASFNDRFANATVAEFKRLLGVKPTPKTEFLGVPIVSHDISL- 104 Query: 439 ASLPENFDPRDXWPDCPTL 495 LP+ FD R W C ++ Sbjct: 105 -KLPKEFDARTAWSQCTSI 122 >UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG01102; n=1; Caenorhabditis briggsae|Rep: Putative uncharacterized protein CBG01102 - Caenorhabditis briggsae Length = 374 Score = 54.8 bits (126), Expect(2) = 2e-15 Identities = 23/50 (46%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Frame = +3 Query: 654 CSGGMPXLTWEYWKXFGLVSGGSYHSSXGCRPYEIPPX*T--SRTRAPGC 797 C+GG W+YW+ GL +GGSY S GC+PY I P T PGC Sbjct: 189 CAGGNVFKAWQYWQKHGLPTGGSYESQFGCKPYSISPCDTVIGNITFPGC 238 Score = 50.8 bits (116), Expect(2) = 2e-15 Identities = 25/55 (45%), Positives = 32/55 (58%), Gaps = 3/55 (5%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP---ICGLGCS 659 + D C S WAF A E+M+DR+C S G + SA++LLSCC CG G S Sbjct: 100 INDISDCKSSWAFSAAESMSDRLCINSGGMINTVLSAQELLSCCTGVFSCGEGDS 154 >UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 421 Score = 82.2 bits (194), Expect = 2e-14 Identities = 39/83 (46%), Positives = 47/83 (56%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V +QG CGSC+A A +DR C +SNGT S ED++ CC +CG C GG P Sbjct: 157 VPNQGGCGSCFAVAAAGVASDRACIHSNGTFKSLLSEEDIIGCCSVCG-NCYGGDPLKAL 215 Query: 684 EYWKXFGLVSGGSYHSSXGCRPY 752 YW GLV+GG GCRPY Sbjct: 216 TYWVNQGLVTGG----RDGCRPY 234 >UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: Cathepsin B - Triticum aestivum (Wheat) Length = 353 Score = 81.4 bits (192), Expect = 3e-14 Identities = 37/84 (44%), Positives = 49/84 (58%), Gaps = 3/84 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP-ICGLGCSGGMPXLT 680 + DQG CG+CWAF AVEA+ DR C + N S DLL+CC +CG GC+GG P Sbjct: 116 ILDQGHCGACWAFAAVEALQDRFCIHLN--MSVSLSVNDLLACCGFLCGSGCNGGYPISA 173 Query: 681 WEYWKXFGLVSG--GSYHSSXGCR 746 W Y++ G+V+ Y GC+ Sbjct: 174 WRYFRRSGVVTEECDPYFDQTGCQ 197 Score = 36.7 bits (81), Expect = 0.75 Identities = 25/79 (31%), Positives = 36/79 (45%), Gaps = 4/79 (5%) Frame = +1 Query: 271 LSDEFINTINLKQNS-WKAGRN-FPRDTSFAHLKKIMGVIKDEH--FATLPIKTHKIDLI 438 + + I T+N N+ W AG N + + + K I+GV A +PIK H Sbjct: 38 IQKDIIQTVNKHPNAGWTAGHNPYFANYTIEQFKHILGVKPTPPGLLAGVPIKIHPE--- 94 Query: 439 ASLPENFDPRDXWPDCPTL 495 LP+ FD R W C T+ Sbjct: 95 MDLPKEFDARTQWSSCSTI 113 >UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06356 protein - Schistosoma japonicum (Blood fluke) Length = 279 Score = 73.3 bits (172), Expect = 7e-12 Identities = 32/86 (37%), Positives = 46/86 (53%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 + D+ C + WA V++++DR+C SNG SA D +SC GC G Sbjct: 47 IHDESLCRADWAIATVDSISDRICIRSNGRISVQLSARDAISCG--FSPGCFHGSEVEVL 104 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIP 761 YW +G+V+GGSY GC+PY +P Sbjct: 105 VYWITYGIVTGGSYEDQSGCQPYPLP 130 >UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2; Ostreococcus|Rep: Cysteine proteinase - Ostreococcus tauri Length = 362 Score = 69.7 bits (163), Expect = 9e-11 Identities = 40/98 (40%), Positives = 48/98 (48%), Gaps = 13/98 (13%) Frame = +3 Query: 510 DQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP-----------ICG--L 650 DQG+CGSCWA +AMTDR+C +NG + H SA LLSC + G Sbjct: 110 DQGACGSCWAVAPAKAMTDRLCIATNGAVNTHVSAIQLLSCNSHSNSAYTYDENLAGGSG 169 Query: 651 GCSGGMPXLTWEYWKXFGLVSGGSYHSSXGCRPYEIPP 764 GC GG P +E G+VSGG C PY P Sbjct: 170 GCMGGYPTEAYETAHRVGVVSGGLNGDQDTCMPYPFAP 207 >UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 311 Score = 68.1 bits (159), Expect = 3e-10 Identities = 31/74 (41%), Positives = 41/74 (55%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +R+QG CGSCWAFGA E ++DR S + SA+ L+ C + GCSGG P W Sbjct: 100 IRNQGQCGSCWAFGASEVLSDRFAIASKNQIYVTLSAQQLVD-CDLDNSGCSGGWPINAW 158 Query: 684 EYWKXFGLVSGGSY 725 Y GL++ Y Sbjct: 159 NYMVKTGLLTEQCY 172 >UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|Rep: Cysteine proteinase - Globodera pallida Length = 53 Score = 67.3 bits (157), Expect = 5e-10 Identities = 28/52 (53%), Positives = 34/52 (65%), Gaps = 1/52 (1%) Frame = +3 Query: 513 QGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPI-CGLGCSGG 665 QG CG CWAF E ++DR C SNGT+ S DLL+CC + CG GC+GG Sbjct: 1 QGQCGRCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCCGMSCGEGCNGG 52 >UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 314 Score = 65.7 bits (153), Expect = 1e-09 Identities = 30/69 (43%), Positives = 41/69 (59%), Gaps = 1/69 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNG-TKHFHFSAEDLLSCCPICGLGCSGGMPXLT 680 + +Q CGSCWAF + E ++DR+C SN T S + L++C GCSGG+P L Sbjct: 105 ILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGALSPQTLVACDVYGNDGCSGGIPQLA 164 Query: 681 WEYWKXFGL 707 WEY + GL Sbjct: 165 WEYMELKGL 173 Score = 44.0 bits (99), Expect = 0.005 Identities = 33/91 (36%), Positives = 46/91 (50%), Gaps = 2/91 (2%) Frame = +1 Query: 220 YVTLVCVLAAAKDLPHPLSDEFINTINL-KQNSWKAGRNFPRD-TSFAHLKKIMGVIKDE 393 Y VC L + D P L D IN+IN K++SW A RN + +F + +MG K Sbjct: 15 YFASVC-LGSFLDKP-VLDDNLINSINNNKKSSWTAHRNKNFEGKTFGDIIGMMGTKKTA 72 Query: 394 HFATLPIKTHKIDLIASLPENFDPRDXWPDC 486 A + + +L S+P +FD R WPDC Sbjct: 73 --APFKLTENGEELKGSIPTSFDSRVQWPDC 101 >UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus lucimarinus CCE9901|Rep: Predicted protein - Ostreococcus lucimarinus CCE9901 Length = 330 Score = 65.3 bits (152), Expect = 2e-09 Identities = 40/103 (38%), Positives = 47/103 (45%), Gaps = 4/103 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 VRDQG CGSCWA A E M DR+C ++G S + LSC G GC GG T Sbjct: 132 VRDQGRCGSCWAVAATEVMNDRLCVATDGENADELSPQYALSCFD-SGSGCDGGDVLDTL 190 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIP----PX*TSRTRAPGCP 800 G+ GG S+ C PYE P + T CP Sbjct: 191 RIAFTKGIPYGGMLDSN-ACLPYEFEACDHPCMVAGTTPQSCP 232 >UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomuscorum|Rep: Cathepsin B - Oxytricha trifallax (Sterkiella histriomuscorum) Length = 294 Score = 65.3 bits (152), Expect = 2e-09 Identities = 35/79 (44%), Positives = 42/79 (53%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +RDQ CGSCWAFGA EA +DR NG K S EDL+S C GC+GG + W Sbjct: 93 IRDQQQCGSCWAFGATEAFSDRFAI--NG-KDVILSPEDLVS-CDTNDYGCNGGYMDVAW 148 Query: 684 EYWKXFGLVSGGSYHSSXG 740 EY G + + S G Sbjct: 149 EYLADHGAATDSCFPYSAG 167 >UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; Ostreococcus tauri|Rep: Cysteine proteinase Cathepsin F - Ostreococcus tauri Length = 498 Score = 64.9 bits (151), Expect = 2e-09 Identities = 36/87 (41%), Positives = 41/87 (47%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 VRDQG CGSCWA A E M DR+C S G + S + LSC G GC GG T Sbjct: 277 VRDQGKCGSCWAVAATEIMNDRLCISSGGKEVAELSPQFALSCYN-SGAGCEGGDVVDTL 335 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIPP 764 G+ GG C PY+ P Sbjct: 336 TLALAKGVPHGGML-DKGACLPYQFEP 361 >UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep: Cathepsin B - Streblomastix strix Length = 312 Score = 59.7 bits (138), Expect = 9e-08 Identities = 32/98 (32%), Positives = 47/98 (47%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 + DQG CGSCWA + E + DR C S G + S + L SC P C GC+GG + Sbjct: 95 IYDQGHCGSCWAMSSFEVLQDRFCIKSEGKQTPELSPQHLTSCTPGCS-GCNGGWMSTAF 153 Query: 684 EYWKXFGLVSGGSYHSSXGCRPYEIPPX*TSRTRAPGC 797 + + G++ C PY++ + + PGC Sbjct: 154 GFMQSNGIL-------GEDCIPYQM-----GKCKHPGC 179 >UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|Rep: Cysteine protease - Giardia muris Length = 301 Score = 59.7 bits (138), Expect = 9e-08 Identities = 28/68 (41%), Positives = 37/68 (54%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V DQ SCGSCWAF AV DR C Y +K H+S + ++SC G C+GG W Sbjct: 94 VADQASCGSCWAFSAVATFADRRCAYGLDSKQVHYSEQYVVSCDFGDG-ACNGGWLSNVW 152 Query: 684 EYWKXFGL 707 ++ G+ Sbjct: 153 KFLTKTGV 160 >UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-LDL responsive gene 2, partial; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to oxidized-LDL responsive gene 2, partial - Strongylocentrotus purpuratus Length = 363 Score = 59.3 bits (137), Expect = 1e-07 Identities = 29/75 (38%), Positives = 40/75 (53%), Gaps = 1/75 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGT-KHFHFSAEDLLSCCPICGLGCSGGMPXLT 680 V++QG+C S WA +DR+ SNGT K+ H S + LLSC GC+GG Sbjct: 239 VQNQGNCASSWAMSTAATASDRLAIQSNGTFKYMHLSPQHLLSCNVKRQQGCAGGHLDRA 298 Query: 681 WEYWKXFGLVSGGSY 725 W Y + G+V+ Y Sbjct: 299 WWYMRKRGIVTEDCY 313 >UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; n=35; Fasciola|Rep: Cathepsin L-like proteinase precursor - Fasciola hepatica (Liver fluke) Length = 326 Score = 58.4 bits (135), Expect = 2e-07 Identities = 30/75 (40%), Positives = 39/75 (52%), Gaps = 1/75 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLT 680 V+DQG+CGSCWAF M + N FS + L+ C P GCSGG+ Sbjct: 123 VKDQGNCGSCWAFSTTGTMEGQY--MKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENA 180 Query: 681 WEYWKXFGLVSGGSY 725 ++Y K FGL + SY Sbjct: 181 YQYLKQFGLETESSY 195 >UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep: Cathepsin B - Streblomastix strix Length = 283 Score = 58.0 bits (134), Expect = 3e-07 Identities = 29/70 (41%), Positives = 38/70 (54%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 VRDQG CGSCWAF E + DR+ G + EDL+S C I GC GG + W Sbjct: 80 VRDQGECGSCWAFSIAETIGDRLGVL--GCSRGDIAPEDLVS-CDIFDDGCDGGFIDMAW 136 Query: 684 EYWKXFGLVS 713 ++ + GL + Sbjct: 137 DWCQENGLTT 146 >UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1; Uronema marinum|Rep: Cathepsin L-like cysteine protease - Uronema marinum Length = 333 Score = 57.6 bits (133), Expect = 4e-07 Identities = 32/74 (43%), Positives = 42/74 (56%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V++QG CGSCWAF AV ++ +R+ + G K FS + L+SC P GC GG P + Sbjct: 135 VQNQGVCGSCWAFSAVCSL-ERLYKINTG-KLLSFSEQQLVSCEP-KSYGCDGGWPEAAF 191 Query: 684 EYWKXFGLVSGGSY 725 Y GL S SY Sbjct: 192 AYSATHGLESSASY 205 >UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to GM06507p - Nasonia vitripennis Length = 483 Score = 56.8 bits (131), Expect = 7e-07 Identities = 26/74 (35%), Positives = 36/74 (48%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQG CG+ WA V+ +DR S G + S + L+SC GC GG W Sbjct: 253 VQDQGWCGASWAISTVDVASDRFAIMSKGIEKVQLSGQHLISCNNRGQRGCKGGYLDRAW 312 Query: 684 EYWKXFGLVSGGSY 725 + + FG+V Y Sbjct: 313 LFMRKFGVVDEDCY 326 >UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 234 Score = 56.8 bits (131), Expect = 7e-07 Identities = 29/66 (43%), Positives = 39/66 (59%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 ++DQ CGSCWAFG+ AM + +GT + S + L+ CC C LGC G +P L + Sbjct: 33 IKDQKHCGSCWAFGSCAAM-ESSWFLKHGTL-YSLSEQCLVDCCHDC-LGCHGCLPSLAF 89 Query: 684 EYWKXF 701 EY K F Sbjct: 90 EYVKIF 95 >UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 394 Score = 56.0 bits (129), Expect = 1e-06 Identities = 39/123 (31%), Positives = 55/123 (44%), Gaps = 3/123 (2%) Frame = +3 Query: 366 ENNGSYKGRTFCDPANKDS*NRFNRQSTGKLRSQRXMA*LSNVEXXVRDQGSCGSCWAFG 545 +NN D N ++ N+ N +T + + + NV V+DQG CGSCW FG Sbjct: 155 DNNNDDNNNNNNDNNNNNNNNQNNTNTT--VAASVDWRNVKNVLNPVKDQGQCGSCWTFG 212 Query: 546 AVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGL---GCSGGMPXLTWEYWKXFGLVSG 716 A M + +NG FS + L+ C G GC+GG EY FG+V+ Sbjct: 213 AAGVM-ESFNAITNGVLK-SFSEQQLVDCVHQAGFSSDGCNGGFQSDGVEYAIKFGIVTE 270 Query: 717 GSY 725 Y Sbjct: 271 DKY 273 >UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Giardia intestinalis|Rep: Cathepsin B-like CP1 precursor - Giardia lamblia (Giardia intestinalis) Length = 303 Score = 55.6 bits (128), Expect = 2e-06 Identities = 25/60 (41%), Positives = 32/60 (53%) Frame = +3 Query: 510 DQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTWEY 689 DQGSCGSCWAF A+ DR C + +S + L+S C + GC GG TW + Sbjct: 98 DQGSCGSCWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLIS-CSLENFGCDGGDFQPTWSF 156 >UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia irregularis virus a|Rep: FirrV-1-A48 precursor - Feldmannia irregularis virus a Length = 373 Score = 54.8 bits (126), Expect = 3e-06 Identities = 25/62 (40%), Positives = 36/62 (58%), Gaps = 2/62 (3%) Frame = +3 Query: 510 DQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP-ICGLGCS-GGMPXLTW 683 DQGSC SCW+ V+ + DRV +NG S ++++SC GL CS GG+P + Sbjct: 80 DQGSCASCWSISVVQMLADRVSVSTNGKIKLKLSVQEMISCWDGHDGLACSKGGVPEKAY 139 Query: 684 EY 689 +Y Sbjct: 140 QY 141 >UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma japonicum|Rep: SJCHGC02853 protein - Schistosoma japonicum (Blood fluke) Length = 181 Score = 54.8 bits (126), Expect = 3e-06 Identities = 20/27 (74%), Positives = 25/27 (92%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYS 584 +RDQ SCGSCWAFGAVE+M+DR+C +S Sbjct: 101 IRDQSSCGSCWAFGAVESMSDRICIHS 127 Score = 54.4 bits (125), Expect = 4e-06 Identities = 34/80 (42%), Positives = 43/80 (53%), Gaps = 4/80 (5%) Frame = +1 Query: 268 PLSDEFINTINLKQN-SWKAGRNFPRDTSFAHLKKIMGVI---KDEHFATLPIKTHKIDL 435 PLSDE I IN + N WKA R R TS H K +MGV+ D+H PI H D+ Sbjct: 21 PLSDELITFINKQPNIEWKADRT-KRFTSIHHAKSMMGVLLNSVDQHKLHHPI-IHHNDI 78 Query: 436 IASLPENFDPRDXWPDCPTL 495 LP+ FD R W +C ++ Sbjct: 79 NIKLPKYFDSRKYWKNCSSI 98 >UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen; n=20; Amniota|Rep: Tubulointerstitial nephritis antigen - Homo sapiens (Human) Length = 476 Score = 54.4 bits (125), Expect = 4e-06 Identities = 25/72 (34%), Positives = 34/72 (47%) Frame = +3 Query: 510 DQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTWEY 689 DQ +C + WAF DR+ S G + S ++L+SCC GC+ G W Y Sbjct: 236 DQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWWY 295 Query: 690 WKXFGLVSGGSY 725 + GLVS Y Sbjct: 296 LRKRGLVSHACY 307 >UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase" precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 315 Score = 54.0 bits (124), Expect = 5e-06 Identities = 25/74 (33%), Positives = 38/74 (51%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQG CGSCWAF ++ ++ + N + S ++L+ C GC+GG+ + Sbjct: 125 VKDQGQCGSCWAFSTTGSLEGQLAIHKN--QRVPLSEQELVDCDTSRNAGCNGGLMTDAF 182 Query: 684 EYWKXFGLVSGGSY 725 Y K GL S Y Sbjct: 183 NYVKRHGLSSESQY 196 >UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium tetraurelia|Rep: Cathepsin L1 precursor - Paramecium tetraurelia Length = 314 Score = 54.0 bits (124), Expect = 5e-06 Identities = 29/75 (38%), Positives = 39/75 (52%), Gaps = 1/75 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLT 680 V++QGSCGSCWAF AV A+ + T + + S +DL+ C P GC+GG Sbjct: 126 VKNQGSCGSCWAFSAVGAL--EINTDIELNRKYELSEQDLVDCSGPYDNDGCNGGWMDSA 183 Query: 681 WEYWKXFGLVSGGSY 725 +EY GL Y Sbjct: 184 FEYVADNGLAEAKDY 198 >UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin C; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin C - Strongylocentrotus purpuratus Length = 482 Score = 53.6 bits (123), Expect = 6e-06 Identities = 28/75 (37%), Positives = 38/75 (50%), Gaps = 1/75 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXL-T 680 VRDQG CGSC+AF + R+ +N S ++++SC GC GG P L Sbjct: 267 VRDQGICGSCYAFASTATQESRLRVMTNNNVKVVMSPQEVVSCSEY-AQGCEGGFPYLIA 325 Query: 681 WEYWKXFGLVSGGSY 725 +Y + FGLV Y Sbjct: 326 GKYGQDFGLVDETCY 340 >UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA, isoform A; n=2; Endopterygota|Rep: PREDICTED: similar to CG3074-PA, isoform A - Tribolium castaneum Length = 445 Score = 53.2 bits (122), Expect = 8e-06 Identities = 26/69 (37%), Positives = 33/69 (47%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 ++DQG CGS WA +DR S G + SA+ LLSC C+GG W Sbjct: 214 IQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHLLSCDRRGQQSCNGGYLDRAW 273 Query: 684 EYWKXFGLV 710 Y + GLV Sbjct: 274 SYIRKIGLV 282 >UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestinalis|Rep: GLP_41_8294_9919 - Giardia lamblia ATCC 50803 Length = 541 Score = 53.2 bits (122), Expect = 8e-06 Identities = 30/80 (37%), Positives = 42/80 (52%), Gaps = 5/80 (6%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSN-----GTKHFHFSAEDLLSCCPICGLGCSGGM 668 V DQG+CGSC+ FGAV+AM R+ +N GTK S E L C + GC GG Sbjct: 259 VLDQGACGSCFTFGAVQAMNSRIMIATNRTDPVGTKTI-LSTEHALD-CNVYSQGCDGGF 316 Query: 669 PXLTWEYWKXFGLVSGGSYH 728 P + + G+++ Y+ Sbjct: 317 PEHVLRFAETNGIMTEDDYY 336 >UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcoptes scabiei type hominis|Rep: Sar s 1 allergen Yv9053H09 - Sarcoptes scabiei type hominis Length = 253 Score = 53.2 bits (122), Expect = 8e-06 Identities = 28/78 (35%), Positives = 41/78 (52%), Gaps = 4/78 (5%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAV----EAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMP 671 +R+QG CG+CWAF A+ A R N T+ HFS ++L+ C P GCSG + Sbjct: 52 IRNQGRCGACWAFAALASVESAYNRRTRIVHNRTRKHHFSEQELVDCSPNTE-GCSGNII 110 Query: 672 XLTWEYWKXFGLVSGGSY 725 +Y + G+V +Y Sbjct: 111 SNGLKYVQLRGVVKSANY 128 >UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; Theileria|Rep: Cysteine proteinase precursor - Theileria parva Length = 440 Score = 53.2 bits (122), Expect = 8e-06 Identities = 25/70 (35%), Positives = 39/70 (55%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQ +CG CWAF V ++ ++ + K + S ++LL C GC GG+ + Sbjct: 244 VKDQSNCGGCWAFSTVGSVEGYYMSHFD--KSYELSVQELLDCDSFSN-GCQGGLLESAY 300 Query: 684 EYWKXFGLVS 713 EY + +GLVS Sbjct: 301 EYVRKYGLVS 310 >UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to glucocorticoid-inducible protein; n=1; Gallus gallus|Rep: PREDICTED: similar to glucocorticoid-inducible protein - Gallus gallus Length = 307 Score = 52.8 bits (121), Expect = 1e-05 Identities = 32/96 (33%), Positives = 44/96 (45%), Gaps = 1/96 (1%) Frame = +3 Query: 510 DQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTWEY 689 DQG+C WAF +DR+ +S G S ++LLSC GCSGG W Y Sbjct: 172 DQGNCAGSWAFSTAAVASDRISIHSMGHMTPSLSPQNLLSCDTRNQRGCSGGRLDGAWWY 231 Query: 690 WKXFGLVSGGSY-HSSXGCRPYEIPPX*TSRTRAPG 794 + G+V+ Y +S +P P SR+ G Sbjct: 232 LRRRGVVTDECYPFTSQDSQPAAQPCMMHSRSTGRG 267 >UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin O precursor; n=2; Apocrita|Rep: PREDICTED: similar to Cathepsin O precursor - Apis mellifera Length = 374 Score = 52.8 bits (121), Expect = 1e-05 Identities = 22/54 (40%), Positives = 31/54 (57%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGG 665 VR QGSCG+CWAF +E + + + NGT H S ++++ C GC GG Sbjct: 170 VRSQGSCGACWAFSTIEVI-ESMFAIKNGTLH-SLSVQEMIDCAKNSNFGCEGG 221 >UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae|Rep: Cysteine proteinase - Hypera postica (alfalfa weevil) Length = 324 Score = 52.8 bits (121), Expect = 1e-05 Identities = 28/74 (37%), Positives = 37/74 (50%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQG CGSCWAF ++ T+ +G K S + L+ CC GC GG + Sbjct: 127 VKDQGDCGSCWAF-SITGSTEGAYARKSG-KLVSLSEQQLIDCCTDTSAGCDGGSLDDNF 184 Query: 684 EYWKXFGLVSGGSY 725 +Y GL S SY Sbjct: 185 KYVMKDGLQSEESY 198 >UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like protein F26E4.3; n=2; Caenorhabditis|Rep: Uncharacterized peptidase C1-like protein F26E4.3 - Caenorhabditis elegans Length = 491 Score = 52.8 bits (121), Expect = 1e-05 Identities = 26/74 (35%), Positives = 34/74 (45%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V DQG CGS W+ +DR+ S G + S++ LLSC GC GG W Sbjct: 240 VADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLLSCNQHRQKGCEGGYLDRAW 299 Query: 684 EYWKXFGLVSGGSY 725 Y + G+V Y Sbjct: 300 WYIRKLGVVGDHCY 313 >UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: hypothetical protein - Strongylocentrotus purpuratus Length = 450 Score = 52.4 bits (120), Expect = 1e-05 Identities = 28/74 (37%), Positives = 34/74 (45%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V DQG CGS WA +DR+ S G + S + LLSC GCSGG W Sbjct: 214 VIDQGKCGSSWAISTASVASDRLAIQSMGEINPRLSEQHLLSCNIRGQRGCSGGYLDRAW 273 Query: 684 EYWKXFGLVSGGSY 725 + + G VS Y Sbjct: 274 YHLRRAGAVSRACY 287 >UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadidae|Rep: Cysteine protease - Tritrichomonas foetus (Trichomonas foetus) Length = 315 Score = 52.4 bits (120), Expect = 1e-05 Identities = 24/62 (38%), Positives = 37/62 (59%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 ++DQ +CGSCWAF A++A + S GT +S ++L+ C C GCSGG+ + Sbjct: 115 IKDQAACGSCWAFSAIQA-AESAYAISTGTLE-SYSEQNLVDCVQGC-YGCSGGLMDYAY 171 Query: 684 EY 689 +Y Sbjct: 172 KY 173 >UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 367 Score = 52.4 bits (120), Expect = 1e-05 Identities = 29/78 (37%), Positives = 42/78 (53%), Gaps = 4/78 (5%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC----PICGLGCSGGMP 671 V++QGSCGSCWAF AV A+ + V N + +S ++L+ C GC GG P Sbjct: 170 VKNQGSCGSCWAFSAV-ALAESVNLLRNNSLAL-YSEQELVDCTYKNPQYYNYGCQGGWP 227 Query: 672 XLTWEYWKXFGLVSGGSY 725 + + Y K G+ S +Y Sbjct: 228 SVAYRYIKDQGISSQQNY 245 >UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 395 Score = 52.0 bits (119), Expect = 2e-05 Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 3/83 (3%) Frame = +3 Query: 486 SNVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKH---FHFSAEDLLSCCPICGLGC 656 S+ + VRDQG C SCW FG++ A+ R NG H SA++ ++C GC Sbjct: 195 SDYQTPVRDQGECKSCWVFGSLAALESRY-LIKNGVSEKSTLHLSAQNAMNCIT---SGC 250 Query: 657 SGGMPXLTWEYWKXFGLVSGGSY 725 G P ++Y++ G+ Y Sbjct: 251 ESGWPANVFDYFESSGIAFEKDY 273 >UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MGC107932 protein - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 333 Score = 51.6 bits (118), Expect = 2e-05 Identities = 30/84 (35%), Positives = 40/84 (47%), Gaps = 1/84 (1%) Frame = +3 Query: 486 SNVEXXVRDQGS-CGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSG 662 SN V++QG+ CGSCWAF V M R C + + + S + L+ C I GC G Sbjct: 124 SNCVTPVKNQGTFCGSCWAFATVGVMESRYCIRTK--ELLNLSEQQLVDCDEI-NEGCCG 180 Query: 663 GMPXLTWEYWKXFGLVSGGSYHSS 734 G P EY G++ Y S Sbjct: 181 GFPIKALEYVAQHGVMRNKEYEYS 204 >UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus salmonis|Rep: Cysteine proteinase - Lepeophtheirus salmonis (salmon louse) Length = 372 Score = 51.6 bits (118), Expect = 2e-05 Identities = 28/84 (33%), Positives = 42/84 (50%), Gaps = 5/84 (5%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP---ICG--LGCSGGM 668 V++QGSCGSCW F AVE + V +N T S + + SC CG GC G + Sbjct: 130 VKNQGSCGSCWVFSAVEQIESYVAIENNMTSPPLLSTQQITSCSSNPYSCGGSGGCKGAI 189 Query: 669 PXLTWEYWKXFGLVSGGSYHSSXG 740 + + Y + +G+ + Y + G Sbjct: 190 NEIAYMYTQLYGIETEKEYPYTSG 213 >UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 306 Score = 51.6 bits (118), Expect = 2e-05 Identities = 23/62 (37%), Positives = 35/62 (56%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +++QG+CGSCWAF A++ + +V N + + S ++LL C C GC GG Sbjct: 103 IKNQGACGSCWAFSAIQVIESQVA--KNQKQLYDLSEQNLLDCVTSC-FGCGGGWSPGAL 159 Query: 684 EY 689 EY Sbjct: 160 EY 161 >UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_56, whole genome shotgun sequence - Paramecium tetraurelia Length = 314 Score = 51.2 bits (117), Expect = 3e-05 Identities = 26/74 (35%), Positives = 38/74 (51%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V++QG+CGSCWAF AV A+ + +K S + L+ C GC+GG L Sbjct: 125 VKNQGNCGSCWAFSAVGAVETLLTIKGVISKDLWLSEQQLVDCDKGTNNGCNGGFENLGI 184 Query: 684 EYWKXFGLVSGGSY 725 ++ K GL + Y Sbjct: 185 QWAKKNGLTTDKQY 198 >UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypanosoma cruzi|Rep: Cysteine protease, putative - Trypanosoma cruzi Length = 434 Score = 50.8 bits (116), Expect = 4e-05 Identities = 29/67 (43%), Positives = 37/67 (55%), Gaps = 5/67 (7%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC---CPICG--LGCSGGM 668 V+DQGSCGSCWA A E++ + + S+G K S + + SC CG GC GG Sbjct: 142 VKDQGSCGSCWAHAATESV-ESMYAISSG-KLLTLSTQQITSCVNNTRKCGGSGGCGGGT 199 Query: 669 PXLTWEY 689 L WEY Sbjct: 200 AQLAWEY 206 >UniRef50_Q24E33 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 328 Score = 50.8 bits (116), Expect = 4e-05 Identities = 25/75 (33%), Positives = 39/75 (52%), Gaps = 1/75 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPI-CGLGCSGGMPXLT 680 V++QGSCGSCWAF A+ +N + FS + L+ C + +GC+GG+ Sbjct: 142 VKNQGSCGSCWAFSTTGALEGSYFLKNN--QLISFSEQQLVDCSRLYLNMGCNGGLMPRA 199 Query: 681 WEYWKXFGLVSGGSY 725 + Y K G+ + Y Sbjct: 200 FRYVKAHGITTEEEY 214 >UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing protein; n=5; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 437 Score = 50.8 bits (116), Expect = 4e-05 Identities = 27/64 (42%), Positives = 33/64 (51%), Gaps = 2/64 (3%) Frame = +3 Query: 504 VRDQGS-CGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXL 677 V+ QG CGSCWAF AV A+ G K FS + L+ C GCSGG+P Sbjct: 220 VKSQGKDCGSCWAFAAVAALESHY-ALKTGKKPIQFSEQQLVDCARKFDTKGCSGGLPSK 278 Query: 678 TWEY 689 +EY Sbjct: 279 GFEY 282 >UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; Theileria|Rep: Cysteine proteinase precursor - Theileria annulata Length = 441 Score = 50.8 bits (116), Expect = 4e-05 Identities = 23/63 (36%), Positives = 37/63 (58%), Gaps = 1/63 (1%) Frame = +3 Query: 504 VRDQGS-CGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLT 680 ++DQG CGSCWAF ++ ++ Y N K + S ++L++ C +GC+GG+P Sbjct: 242 IKDQGDHCGSCWAFSSIASVESLYRLYKN--KSYFLSEQELVN-CDKSSMGCAGGLPITA 298 Query: 681 WEY 689 EY Sbjct: 299 LEY 301 >UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 280 Score = 50.4 bits (115), Expect = 6e-05 Identities = 27/77 (35%), Positives = 39/77 (50%), Gaps = 3/77 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP---ICGLGCSGGMPX 674 V++QG+CGSCWAF + + + + N T +S ++LL C GC GG P Sbjct: 83 VKNQGNCGSCWAF-TITGLFESINLIRNKTVEL-YSEQELLDCSSNGIYRNSGCQGGWPH 140 Query: 675 LTWEYWKXFGLVSGGSY 725 L +EY K G+ Y Sbjct: 141 LAFEYSKKNGISLSSQY 157 >UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - Drosophila melanogaster (Fruit fly) Length = 431 Score = 50.4 bits (115), Expect = 6e-05 Identities = 25/74 (33%), Positives = 33/74 (44%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V DQG CG+ W +DR S G ++ SA+++LSC GC GG W Sbjct: 204 VPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNILSCTR-RQQGCEGGHLDAAW 262 Query: 684 EYWKXFGLVSGGSY 725 Y G+V Y Sbjct: 263 RYLHKKGVVDENCY 276 >UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: Vivapain-4 - Plasmodium vivax Length = 484 Score = 50.4 bits (115), Expect = 6e-05 Identities = 27/87 (31%), Positives = 42/87 (48%) Frame = +3 Query: 489 NVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGM 668 N +++Q CGSCWAFGAV A+ + N +H S ++L+ C GC GG+ Sbjct: 272 NAVSEIKNQNLCGSCWAFGAVGAVESQYAIRKN--QHVLISEQELVDCSD-KNFGCFGGL 328 Query: 669 PXLTWEYWKXFGLVSGGSYHSSXGCRP 749 L ++ G + S + G +P Sbjct: 329 ASLAFDDMIDLGYLCSESDYPYVGFKP 355 >UniRef50_Q23H32 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 365 Score = 50.4 bits (115), Expect = 6e-05 Identities = 23/56 (41%), Positives = 30/56 (53%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMP 671 V+ QG CGSCWAF V A+ + FS ++L+ CC I GC+GG P Sbjct: 149 VQKQGGCGSCWAFSTVIALEGAYAKQTGNV--IKFSEQNLIDCCRIENNGCNGGDP 202 >UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus|Rep: Cathepsin L - Aphrocallistes vastus Length = 329 Score = 50.4 bits (115), Expect = 6e-05 Identities = 23/65 (35%), Positives = 36/65 (55%), Gaps = 1/65 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 V++QG CGSCW+F A ++ + S K FS ++L+ C G GC GG+ Sbjct: 130 VKNQGQCGSCWSFSATGSLEGQYAIKSG--KLVSFSEQELVDCSTSLGNHGCQGGLMDYA 187 Query: 681 WEYWK 695 ++YW+ Sbjct: 188 FKYWE 192 >UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain - Tetrahymena pyriformis Length = 330 Score = 50.4 bits (115), Expect = 6e-05 Identities = 26/82 (31%), Positives = 36/82 (43%), Gaps = 3/82 (3%) Frame = +3 Query: 489 NVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGL---GCS 659 NV V++Q CGSCWAF + + + FS + L+ CC G GC+ Sbjct: 130 NVLPPVKNQQQCGSCWAFSTAGMLEGVYNIHESPQTPISFSEQQLVDCCGAQGFGCEGCN 189 Query: 660 GGMPXLTWEYWKXFGLVSGGSY 725 G P Y + FG+V Y Sbjct: 190 GAWPTDAVAYTQKFGIVQESQY 211 >UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag-RP - Bombyx mori (Silk moth) Length = 404 Score = 50.4 bits (115), Expect = 6e-05 Identities = 26/70 (37%), Positives = 37/70 (52%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 + DQ CGS WA + DR S GT++ S++ LLSC GC+GG + + Sbjct: 202 IADQDWCGSDWAVSIASIVGDRFSIQSFGTENVRMSSQTLLSCHLKGQRGCNGGNLDIAF 261 Query: 684 EYWKXFGLVS 713 ++ K GLVS Sbjct: 262 DFVKTHGLVS 271 >UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cellular organisms|Rep: Cysteine proteinase, putative - Archaeoglobus fulgidus Length = 1088 Score = 50.4 bits (115), Expect = 6e-05 Identities = 23/50 (46%), Positives = 28/50 (56%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLG 653 VRDQGSCGSCWA AV A+ + S + S + LLSC C +G Sbjct: 609 VRDQGSCGSCWAHSAVAALESALIVESGASSSIDLSEQHLLSCEQDCEVG 658 >UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-329; n=2; Caenorhabditis|Rep: Putative uncharacterized protein tag-329 - Caenorhabditis elegans Length = 374 Score = 50.0 bits (114), Expect = 8e-05 Identities = 26/74 (35%), Positives = 36/74 (48%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 ++ Q SC CW F A A+ + T + K + S +++ C P G GC+GG P Sbjct: 160 IKTQDSCACCWGFAAT-AVAEAALTV-HLKKAMNLSEQEVCDCAPKHGPGCNGGDPVDGL 217 Query: 684 EYWKXFGLVSGGSY 725 EY K GL G Y Sbjct: 218 EYIKEMGLTGGKEY 231 >UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin W, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to Cathepsin W, partial - Ornithorhynchus anatinus Length = 229 Score = 49.6 bits (113), Expect = 1e-04 Identities = 30/83 (36%), Positives = 39/83 (46%), Gaps = 1/83 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V++QGSCGSCWAF AV + + G + S +++L C C GC GG P + Sbjct: 83 VKNQGSCGSCWAFAAV-GNAESMWYLRAGKRLVSLSVQEVLD-CGRCRDGCQGGYPEDAF 140 Query: 684 -EYWKXFGLVSGGSYHSSXGCRP 749 W GL S Y RP Sbjct: 141 VTMWFNRGLASEKDYPYKVRARP 163 >UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like midgut cysteine proteinase - Tenebrio molitor (Yellow mealworm) Length = 330 Score = 49.6 bits (113), Expect = 1e-04 Identities = 25/81 (30%), Positives = 39/81 (48%), Gaps = 1/81 (1%) Frame = +3 Query: 486 SNVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSG 662 SN V+DQG CGSCW+F A+ ++ + S ++L+ C G GC G Sbjct: 125 SNAVSEVKDQGQCGSCWSFSTTGAVEGQLALQRG--RLTSLSEQNLIDCSSSYGNAGCDG 182 Query: 663 GMPXLTWEYWKXFGLVSGGSY 725 G + Y +G++S +Y Sbjct: 183 GWMDSAFSYIHDYGIMSESAY 203 >UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Trypanosoma cruzi|Rep: Cysteine proteinase, putative - Trypanosoma cruzi Length = 392 Score = 49.6 bits (113), Expect = 1e-04 Identities = 33/79 (41%), Positives = 38/79 (48%), Gaps = 5/79 (6%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP---ICG--LGCSGGM 668 V+DQG CGSCWA GA E M + G H S + L SC P CG GC G Sbjct: 158 VKDQGRCGSCWAHGAAEEMESHFAILT-GRLHV-LSQQQLTSCAPNPKKCGGTGGCYGST 215 Query: 669 PXLTWEYWKXFGLVSGGSY 725 L +EY K G+ S Y Sbjct: 216 ADLAYEYAKQ-GITSEWVY 233 >UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing protein; n=7; Hymenostomatida|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 387 Score = 49.6 bits (113), Expect = 1e-04 Identities = 32/88 (36%), Positives = 43/88 (48%), Gaps = 5/88 (5%) Frame = +3 Query: 486 SNVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP---ICG--L 650 + V V+DQG CGSCWAF A A+ + + G S + L+SC CG Sbjct: 142 AGVVTPVKDQGHCGSCWAF-ATTAVIESYAAIATGQLK-TLSTQQLVSCVQNSYQCGGQG 199 Query: 651 GCSGGMPXLTWEYWKXFGLVSGGSYHSS 734 GC+G + L + Y + FGL S Y S Sbjct: 200 GCNGAVSELAYNYVQLFGLTSEYKYSYS 227 >UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platyhelminthes|Rep: Cathepsin L-like proteinase - Echinococcus multilocularis Length = 338 Score = 49.6 bits (113), Expect = 1e-04 Identities = 26/75 (34%), Positives = 36/75 (48%), Gaps = 1/75 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 ++DQG CGSCWAF A A+ ++ + K S + L+ C G GC+GG Sbjct: 137 IKDQGDCGSCWAFSATGALEGQLKRKTG--KLISLSEQQLVDCSTYTGNEGCNGGDMNDA 194 Query: 681 WEYWKXFGLVSGGSY 725 + YW G S Y Sbjct: 195 FRYWMRNGAESESDY 209 >UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_21, whole genome shotgun sequence - Paramecium tetraurelia Length = 349 Score = 49.6 bits (113), Expect = 1e-04 Identities = 26/77 (33%), Positives = 37/77 (48%), Gaps = 3/77 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC---PICGLGCSGGMPX 674 V++QGSCGSCWAF AV A+ G K+ S ++L+ C GC GG Sbjct: 140 VKNQGSCGSCWAFSAVAAL--ETALRQGGVKNVELSEQELVDCAVKDEFESEGCDGGEMY 197 Query: 675 LTWEYWKXFGLVSGGSY 725 ++Y +G+ Y Sbjct: 198 DGFQYASKYGIAIRSEY 214 >UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3.4.14.1) (Dipeptidyl-peptidase I) (DPP-I) (DPPI) (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase) [Contains: Dipeptidyl-peptidase 1 exclusion domain chain (Dipeptidyl- peptidase I exclusion domain chain); Dipeptidyl-peptidase 1 heavy chain (Dipeptidyl-peptidase I heavy chain); Dipeptidyl-peptidase 1 light chain (Dipeptidyl-peptidase I light chain)]; n=50; Coelomata|Rep: Dipeptidyl-peptidase 1 precursor (EC 3.4.14.1) (Dipeptidyl-peptidase I) (DPP-I) (DPPI) (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase) [Contains: Dipeptidyl-peptidase 1 exclusion domain chain (Dipeptidyl- peptidase I exclusion domain chain); Dipeptidyl-peptidase 1 heavy chain (Dipeptidyl-peptidase I heavy chain); Dipeptidyl-peptidase 1 light chain (Dipeptidyl-peptidase I light chain)] - Homo sapiens (Human) Length = 463 Score = 49.6 bits (113), Expect = 1e-04 Identities = 26/75 (34%), Positives = 42/75 (56%), Gaps = 1/75 (1%) Frame = +3 Query: 489 NVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGM 668 N VR+Q SCGSC++F ++ + R+ +N ++ S ++++SC GC GG Sbjct: 244 NFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY-AQGCEGGF 302 Query: 669 PXL-TWEYWKXFGLV 710 P L +Y + FGLV Sbjct: 303 PYLIAGKYAQDFGLV 317 >UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-ear cress). SAG12 protein; n=2; Dictyostelium discoideum|Rep: Similar to Arabidopsis thaliana (Mouse-ear cress). SAG12 protein - Dictyostelium discoideum (Slime mold) Length = 358 Score = 49.2 bits (112), Expect = 1e-04 Identities = 27/70 (38%), Positives = 34/70 (48%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQG CGSC+ F AVE + G K S + + C P G C GG P + Sbjct: 160 VKDQGQCGSCYIFSAVEQI--ETAWIKAGNKPILLSEQQAVDCDPYDG-QCGGGDPYTVY 216 Query: 684 EYWKXFGLVS 713 EY+ G VS Sbjct: 217 EYFSQVGGVS 226 >UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Giardia intestinalis|Rep: Cathepsin B-like CP2 precursor - Giardia lamblia (Giardia intestinalis) Length = 300 Score = 49.2 bits (112), Expect = 1e-04 Identities = 23/62 (37%), Positives = 32/62 (51%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V DQG CGSCWAF +V DR C K +S + ++S C + C+GG W Sbjct: 92 VVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQYVVS-CDHGDMACNGGWLPNVW 150 Query: 684 EY 689 ++ Sbjct: 151 KF 152 >UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio molitor (Yellow mealworm) Length = 336 Score = 48.8 bits (111), Expect = 2e-04 Identities = 27/75 (36%), Positives = 40/75 (53%), Gaps = 1/75 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V++QGSCGSCWAF + A+ ++ + S + L+ C P LGCSGG + Sbjct: 136 VKNQGSCGSCWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDCVP-NALGCSGGWMNDAF 194 Query: 684 EY-WKXFGLVSGGSY 725 Y + G+ S G+Y Sbjct: 195 TYVAQNGGIDSEGAY 209 >UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1; Dictyostelium discoideum AX4|Rep: Counting factor associated protein - Dictyostelium discoideum AX4 Length = 531 Score = 48.8 bits (111), Expect = 2e-04 Identities = 33/103 (32%), Positives = 47/103 (45%), Gaps = 5/103 (4%) Frame = +3 Query: 489 NVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGG 665 N V+DQG CGSCW FG+ ++ C +NG + S + L+ C + G GC GG Sbjct: 319 NCVTPVKDQGICGSCWTFGSTGSLEGTNCV-TNG-ELVSLSEQQLVDCAILTGSQGCGGG 376 Query: 666 MPXLTWEYWKXFGLVSGGS---YHSSXG-CRPYEIPPX*TSRT 782 ++Y G ++ S Y G CR + P S T Sbjct: 377 FASSAFQYVMEIGSLATESNYPYLMQNGLCRDRTVTPSGVSIT 419 >UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin B-like cysteine peptidase - Trichomonas vaginalis G3 Length = 288 Score = 48.8 bits (111), Expect = 2e-04 Identities = 26/68 (38%), Positives = 33/68 (48%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V DQG CGSCW+F ++ + R C N K FS L++ C GC GG+ W Sbjct: 85 VLDQGKCGSCWSFAVSKSFSHRYCRKYN--KPVLFSQSHLVA-CDRRNSGCGGGIEVNAW 141 Query: 684 EYWKXFGL 707 Y GL Sbjct: 142 RYIDLRGL 149 >UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-like precursor; n=26; Euteleostomi|Rep: Tubulointerstitial nephritis antigen-like precursor - Homo sapiens (Human) Length = 467 Score = 48.8 bits (111), Expect = 2e-04 Identities = 26/72 (36%), Positives = 34/72 (47%) Frame = +3 Query: 510 DQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTWEY 689 DQG+C WAF +DRV +S G S ++LLSC GC GG W + Sbjct: 222 DQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGGRLDGAWWF 281 Query: 690 WKXFGLVSGGSY 725 + G+VS Y Sbjct: 282 LRRRGVVSDHCY 293 >UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 48.4 bits (110), Expect = 2e-04 Identities = 25/78 (32%), Positives = 34/78 (43%), Gaps = 4/78 (5%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP----ICGLGCSGGMP 671 V+DQG CGSCWAF + + + FS + L+ C GCSGG P Sbjct: 140 VKDQGQCGSCWAFSTTGIL--EALYFMENRQKISFSEQQLVDCATNSNGFNSYGCSGGWP 197 Query: 672 XLTWEYWKXFGLVSGGSY 725 +Y FG++ Y Sbjct: 198 EEALKYVAKFGILKEEQY 215 >UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis|Rep: Cathepsin L - Culicoides sonorensis Length = 331 Score = 48.4 bits (110), Expect = 2e-04 Identities = 24/68 (35%), Positives = 37/68 (54%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V++Q CGSCWAF +V ++ R + N K + + ++L+ C GCSGG L Sbjct: 131 VKNQAQCGSCWAFASVASVEMRYKRFHN--KSYTLAEQELVD-CETTSHGCSGGWSDLAL 187 Query: 684 EYWKXFGL 707 +Y + GL Sbjct: 188 QYMRDNGL 195 >UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine proteinase precursor - Heterodera glycines (Soybean cyst nematode worm) Length = 353 Score = 48.4 bits (110), Expect = 2e-04 Identities = 24/63 (38%), Positives = 33/63 (52%), Gaps = 1/63 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 V+DQG CGSCWAF A A+ + +K S ++L+ C G GC GG+ Sbjct: 150 VKDQGDCGSCWAFSATGAI-EGALAQKKASKIISLSEQNLVDCSSKYGNEGCDGGLMDSA 208 Query: 681 WEY 689 +EY Sbjct: 209 FEY 211 >UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanensis|Rep: Sui m 1 allergen - Suidasia medanensis Length = 336 Score = 48.4 bits (110), Expect = 2e-04 Identities = 28/80 (35%), Positives = 37/80 (46%), Gaps = 6/80 (7%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC--CPICGL----GCSGG 665 VR+QG CGSCWAF + + N H S + L+ C P G GC GG Sbjct: 129 VRNQGQCGSCWAFATAATVEAQYAIRKN--VHVTLSEQQLVDCDHRPFQGQYEDHGCQGG 186 Query: 666 MPXLTWEYWKXFGLVSGGSY 725 P + + Y + GLV +Y Sbjct: 187 NPIIAYAYVQQTGLVEESAY 206 >UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia ATCC 50803|Rep: GLP_113_4299_5381 - Giardia lamblia ATCC 50803 Length = 360 Score = 48.0 bits (109), Expect = 3e-04 Identities = 24/69 (34%), Positives = 33/69 (47%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V DQG+CGSCWAF +V+ D C +S + +L C GC+GG P + Sbjct: 157 VVDQGNCGSCWAFSSVQTFADHRCRSGLDATGVSYSVQYVLD-CDRKDHGCNGGEPVNAF 215 Query: 684 EYWKXFGLV 710 + G V Sbjct: 216 NFLHNTGTV 224 >UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1; Brugia malayi|Rep: Cathepsin F-like cysteine proteinase - Brugia malayi (Filarial nematode worm) Length = 461 Score = 48.0 bits (109), Expect = 3e-04 Identities = 25/67 (37%), Positives = 36/67 (53%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQGSCGSCWAF +V + + G K S ++L+ C + GC+GG+P + Sbjct: 263 VKDQGSCGSCWAF-SVTGNIESLWAIKTG-KLISLSEQELID-CDVIDKGCNGGLPINAF 319 Query: 684 EYWKXFG 704 K G Sbjct: 320 REIKRMG 326 >UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 356 Score = 48.0 bits (109), Expect = 3e-04 Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 1/71 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLT 680 V+DQ +CGSCW F A+ + + + S + L+ C GCSGG+P Sbjct: 142 VKDQQNCGSCWTFSTTGAIESHYAIFED-VEPTSLSEQQLIDCAGAFNNNGCSGGLPSQA 200 Query: 681 WEYWKXFGLVS 713 +EY K G +S Sbjct: 201 FEYIKYNGGIS 211 >UniRef50_Q248G1 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 334 Score = 48.0 bits (109), Expect = 3e-04 Identities = 28/102 (27%), Positives = 52/102 (50%), Gaps = 9/102 (8%) Frame = +3 Query: 483 LSNVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPI-CGL--- 650 ++NV +++QG CGSCW F ++ + + +G+ + ++ +++L C + G Sbjct: 130 VTNVVGPIKNQGHCGSCWTF-SIAGIVESHYVLKHGS-YVSYAEQEILDCVSVSAGYQSD 187 Query: 651 GCSGGMPXLTWEYWKXFGLVSGGSY---HSSXGCR--PYEIP 761 GC+GG P +Y +G+V Y CR PY++P Sbjct: 188 GCNGGWPEEALQYVIEYGIVKSEVYPYVAVQGKCRDIPYDVP 229 >UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1 precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 355 Score = 48.0 bits (109), Expect = 3e-04 Identities = 25/75 (33%), Positives = 39/75 (52%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQG CGSCWAF V A+ + + + G S ++L+ C GC+GG+ + Sbjct: 152 VKDQGQCGSCWAFSTVAAV-EGINQITTGNLS-SLSEQELIDCDTTFNSGCNGGLMDYAF 209 Query: 684 EYWKXFGLVSGGSYH 728 +Y ++S G H Sbjct: 210 QY-----IISTGGLH 219 >UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase precursor - Phaedon cochleariae (Mustard beetle) Length = 324 Score = 48.0 bits (109), Expect = 3e-04 Identities = 28/75 (37%), Positives = 37/75 (49%), Gaps = 1/75 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 VR+QG CGSCWA A+ + +G+K S + L+ C G GC+GG Sbjct: 125 VRNQGECGSCWALSTAAAIESQ-SAIKSGSK-VPLSPQQLVDCSTSYGNHGCNGGFAVNG 182 Query: 681 WEYWKXFGLVSGGSY 725 +EY K GL S Y Sbjct: 183 FEYVKDNGLESDADY 197 >UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma|Rep: Cathepsin C precursor - Schistosoma mansoni (Blood fluke) Length = 454 Score = 48.0 bits (109), Expect = 3e-04 Identities = 25/69 (36%), Positives = 39/69 (56%), Gaps = 1/69 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXL-T 680 +R+QG CGSC+A + A+ R+ SN ++ S + ++ C P GC+GG P L Sbjct: 238 IRNQGICGSCYASPSAAALEARIRLVSNFSEQPILSPQTVVDCSPY-SEGCNGGFPFLIA 296 Query: 681 WEYWKXFGL 707 +Y + FGL Sbjct: 297 GKYGEDFGL 305 >UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: hypothetical protein - Strongylocentrotus purpuratus Length = 331 Score = 47.6 bits (108), Expect = 4e-04 Identities = 23/63 (36%), Positives = 36/63 (57%), Gaps = 1/63 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 V++Q SCG+CWAF VE M ++ + + SA++L+ C G GC GG+P T Sbjct: 144 VKNQKSCGACWAFSVVETMETQIALKTK--RLTQLSAQELVDCGTAAGDGGCRGGIPCKT 201 Query: 681 WEY 689 ++ Sbjct: 202 LDW 204 >UniRef50_Q58HK5 Cluster: Hc58; n=1; Haemonchus contortus|Rep: Hc58 - Haemonchus contortus (Barber pole worm) Length = 241 Score = 47.6 bits (108), Expect = 4e-04 Identities = 17/29 (58%), Positives = 21/29 (72%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNG 590 +RDQ +CGSCWA A E M+DR C +S G Sbjct: 108 IRDQSNCGSCWAVSAAETMSDRACIHSKG 136 Score = 40.3 bits (90), Expect = 0.061 Identities = 24/60 (40%), Positives = 32/60 (53%), Gaps = 4/60 (6%) Frame = +3 Query: 558 MTDRVCTYSNGTKHF--HFSAEDLLSCC--PICGLGCSGGMPXLTWEYWKXFGLVSGGSY 725 M+DR C +S G K F S D+LSCC C +G GG+ W Y +G+ +GG Y Sbjct: 3 MSDRACIHSKG-KAFKARLSDTDILSCCGKDPCQIG-EGGISARAWLYAMQYGVCTGGYY 60 >UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_36, whole genome shotgun sequence - Paramecium tetraurelia Length = 307 Score = 47.6 bits (108), Expect = 4e-04 Identities = 25/74 (33%), Positives = 36/74 (48%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +++QG+CGSCW F A+ A+ + G K S + L+ C G GC+GG L Sbjct: 121 IKNQGNCGSCWTFSAIGAV-EGFLAIRKGFKGV-LSEQQLVDCAVDAGEGCNGGNSDLAL 178 Query: 684 EYWKXFGLVSGGSY 725 +Y G V Y Sbjct: 179 DYIAEVGSVYERDY 192 >UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_23, whole genome shotgun sequence - Paramecium tetraurelia Length = 321 Score = 47.6 bits (108), Expect = 4e-04 Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 1/63 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLT 680 ++DQG CGSCWAF AV A+ + T + S +DL+ C P GC GG Sbjct: 134 IKDQGDCGSCWAFSAVGAL--EINTKIQFNEIVDLSEQDLVDCAGPYGNAGCDGGWMESA 191 Query: 681 WEY 689 +Y Sbjct: 192 LDY 194 >UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18; Magnoliophyta|Rep: Thiol protease aleurain precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 47.6 bits (108), Expect = 4e-04 Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 1/68 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLT 680 V+DQG CGSCW F A+ + K S + L+ C GC+GG+P Sbjct: 156 VKDQGGCGSCWTFSTTGAL--EAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQA 213 Query: 681 WEYWKXFG 704 +EY K G Sbjct: 214 FEYIKSNG 221 >UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin L - Misgurnus mizolepis (Mud loach) Length = 337 Score = 47.2 bits (107), Expect = 5e-04 Identities = 24/65 (36%), Positives = 35/65 (53%), Gaps = 1/65 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLT 680 V+DQG CGSCWAF AM ++ + K S ++L+ C P GC+GG+ Sbjct: 131 VKDQGECGSCWAFSTTGAMEGQM--FRKQGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQA 188 Query: 681 WEYWK 695 ++Y K Sbjct: 189 FQYIK 193 >UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine protease; n=11; Callosobruchus maculatus|Rep: Putative gut cathepsin L-like cysteine protease - Callosobruchus maculatus (Southern cowpea weevil) (Pulse bruchid) Length = 326 Score = 47.2 bits (107), Expect = 5e-04 Identities = 27/76 (35%), Positives = 41/76 (53%), Gaps = 2/76 (2%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC--PICGLGCSGGMPXL 677 V+DQ +CGSCWAF AV A+ + NGT SA++L+ C GC GG+ Sbjct: 127 VKDQANCGSCWAFSAVGAIEGQFFK-KNGTL-VSLSAQELVDCATEDYGNNGCKGGLMGQ 184 Query: 678 TWEYWKXFGLVSGGSY 725 +++ + G+ + SY Sbjct: 185 AFDFVQDEGIQTEESY 200 >UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcoptes scabiei type hominis|Rep: Sar s 1 allergen Yv6030H07 - Sarcoptes scabiei type hominis Length = 322 Score = 47.2 bits (107), Expect = 5e-04 Identities = 25/80 (31%), Positives = 39/80 (48%), Gaps = 1/80 (1%) Frame = +3 Query: 489 NVEXXVRDQGSCGSCWAFGAV-EAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGG 665 NV +R+QG+CGSCWAF + A ++ + T + S + L+ C GC G Sbjct: 115 NVLTPIREQGACGSCWAFSTICTAESNYLTTRQAPLNKWTLSEQQLVDCA--SPKGCDGE 172 Query: 666 MPXLTWEYWKXFGLVSGGSY 725 P ++Y G+ +G Y Sbjct: 173 KPTTGFKYLLEKGVTTGDRY 192 >UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa zeasingle nucleocapsid nuclear polyhedrosis virus) Length = 367 Score = 47.2 bits (107), Expect = 5e-04 Identities = 24/67 (35%), Positives = 36/67 (53%) Frame = +3 Query: 486 SNVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGG 665 +N ++DQG CGSCWAF A+ + + N K S + LL C + LGC+GG Sbjct: 165 TNKVTPIKDQGVCGSCWAFVAIGNIESQYAIRHN--KLIDLSEQQLLDCDEV-DLGCNGG 221 Query: 666 MPXLTWE 686 + L ++ Sbjct: 222 LMHLAFQ 228 >UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Bigelowiella natans|Rep: Digestive cysteine proteinase - Bigelowiella natans (Pedinomonas minutissima) (Chlorarachnion sp.(strain CCMP 621)) Length = 360 Score = 46.8 bits (106), Expect = 7e-04 Identities = 30/82 (36%), Positives = 36/82 (43%), Gaps = 3/82 (3%) Frame = +3 Query: 489 NVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT--KHFHFSAEDLLSCCPICGLGCSG 662 N V+DQG CGSCWAF A +A+ N T S E L+ C C G Sbjct: 119 NALTPVKDQGGCGSCWAFSATQALESAHYIKHNDTLDSPIALSTEQLVE-CDQHDYACYG 177 Query: 663 GMPXLTWEYWKXF-GLVSGGSY 725 G P +Y K GLV+ Y Sbjct: 178 GFPRDAMKYIKESGGLVAEADY 199 >UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba histolytica|Rep: Cysteine protease 19 - Entamoeba histolytica Length = 324 Score = 46.8 bits (106), Expect = 7e-04 Identities = 27/78 (34%), Positives = 43/78 (55%), Gaps = 4/78 (5%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAM-TDRVCTYSN-GTKHFHFSAEDLLSCC--PICGLGCSGGMP 671 V+DQG+CGSC+AF +V M T + +Y + ++ S +++SCC P GC GG Sbjct: 115 VKDQGNCGSCYAFSSVALMETAVLLSYDDLSPSNYALSTAEIVSCCYDPSECRGCEGGSI 174 Query: 672 XLTWEYWKXFGLVSGGSY 725 +Y + G+ S S+ Sbjct: 175 GGALKYAQDNGMQSESSF 192 >UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus; n=4; Cryptosporidium|Rep: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus - Cryptosporidium parvum Iowa II Length = 401 Score = 46.8 bits (106), Expect = 7e-04 Identities = 23/63 (36%), Positives = 32/63 (50%), Gaps = 1/63 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 +R+Q +CGSCWAF AV A+ C +N S + + C G GC GG L Sbjct: 191 IRNQKNCGSCWAFSAVAALEGATCAQTNRGLP-SLSEQQFVDCSKQNGNFGCDGGTMGLA 249 Query: 681 WEY 689 ++Y Sbjct: 250 FQY 252 >UniRef50_Q24FA8 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 335 Score = 46.8 bits (106), Expect = 7e-04 Identities = 24/78 (30%), Positives = 35/78 (44%), Gaps = 4/78 (5%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPI----CGLGCSGGMP 671 V++QG CG CW F A M ++ +S + LL C + GC GG+P Sbjct: 139 VKNQGECGGCWTFSATGLMESFNLIHNKPQNVSLYSQQQLLDCVTLENGYFSEGCEGGVP 198 Query: 672 XLTWEYWKXFGLVSGGSY 725 +Y FG++S Y Sbjct: 199 SDAVQYAADFGVLSDNEY 216 >UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 330 Score = 46.8 bits (106), Expect = 7e-04 Identities = 22/74 (29%), Positives = 33/74 (44%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQG CGSCWAF ++ + + S + L+ C GC GG + Sbjct: 132 VKDQGQCGSCWAFSTTGSVESALIIAGYANQTIDLSEQQLVD-CSATNYGCGGGWMDNAF 190 Query: 684 EYWKXFGLVSGGSY 725 EY + L + +Y Sbjct: 191 EYIEESPLTTNSNY 204 >UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; n=16; Chrysomelidae|Rep: Digestive cysteine protease intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 46.8 bits (106), Expect = 7e-04 Identities = 28/76 (36%), Positives = 37/76 (48%), Gaps = 2/76 (2%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLG-C-SGGMPXL 677 V+DQ CGSCWAF A A+ + +N S + LL C G G C GG Sbjct: 125 VKDQNPCGSCWAFSATGALEGQNAILNN--VKISLSEQQLLDCSAAYGNGNCKEGGDMSA 182 Query: 678 TWEYWKXFGLVSGGSY 725 +EY + +G+ S SY Sbjct: 183 AFEYVRDYGIQSEKSY 198 >UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n=1; Xenopus tropicalis|Rep: UPI00006A2275 UniRef100 entry - Xenopus tropicalis Length = 272 Score = 46.4 bits (105), Expect = 0.001 Identities = 30/82 (36%), Positives = 43/82 (52%), Gaps = 3/82 (3%) Frame = +3 Query: 489 NVEXXVRDQGS-CGSCWAFGAVEAMTDRVCTYSNGTKHF-HFSAEDLLSCCPICGL-GCS 659 N VRDQGS C SC+AF AV A+ C + T FS ++L+ C G GC+ Sbjct: 89 NCVTPVRDQGSFCRSCYAFSAVGALE---CQWKKKTVRLVTFSPQELVDCSDGEGNHGCN 145 Query: 660 GGMPXLTWEYWKXFGLVSGGSY 725 GG ++Y K +G++ +Y Sbjct: 146 GGKIEKAFKYMKKYGVMEESAY 167 >UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4; core eudicotyledons|Rep: Papain-like cysteine peptidase XBCP3 - Arabidopsis thaliana (Mouse-ear cress) Length = 437 Score = 46.4 bits (105), Expect = 0.001 Identities = 22/62 (35%), Positives = 34/62 (54%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQGSCG+CW+F A AM + + G S ++L+ C GC+GG+ + Sbjct: 133 VKDQGSCGACWSFSATGAM-EGINQIVTGDL-ISLSEQELIDCDKSYNAGCNGGLMDYAF 190 Query: 684 EY 689 E+ Sbjct: 191 EF 192 >UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 317 Score = 46.4 bits (105), Expect = 0.001 Identities = 27/75 (36%), Positives = 35/75 (46%), Gaps = 1/75 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLT 680 VRDQ CGSCWAF A A+ + + K S + L+ C GC+GG P Sbjct: 119 VRDQEQCGSCWAFSAAGALEGQ--RFLKEGKLEVLSTQQLVDCSRDYKNEGCNGGWPHWA 176 Query: 681 WEYWKXFGLVSGGSY 725 ++Y K GL Y Sbjct: 177 YDYIKDNGLCLESKY 191 >UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foetus|Rep: TFCP2 protein - Tritrichomonas foetus (Trichomonas foetus) Length = 270 Score = 46.4 bits (105), Expect = 0.001 Identities = 24/64 (37%), Positives = 32/64 (50%), Gaps = 2/64 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC--CPICGLGCSGGMPXL 677 +++QGSCGSCWAF A+ A C + FS + L+ C GCSGG P Sbjct: 65 IKNQGSCGSCWAFSAIAAQES--CHAIATGELLRFSEQSLVDCVTSDYSCQGCSGGWPDQ 122 Query: 678 TWEY 689 +Y Sbjct: 123 AMKY 126 >UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep: Cysteine protease - Babesia equi Length = 438 Score = 46.4 bits (105), Expect = 0.001 Identities = 25/68 (36%), Positives = 38/68 (55%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQG+CGSCWAF AV ++ + + G + S ++L++C GC G +P Sbjct: 239 VKDQGNCGSCWAFAAVGSV-ESLYLIKKG-QALDLSEQELVNCEENSN-GCEGDLPNKAL 295 Query: 684 EYWKXFGL 707 EY K G+ Sbjct: 296 EYIKAKGI 303 >UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 291 Score = 46.4 bits (105), Expect = 0.001 Identities = 23/56 (41%), Positives = 32/56 (57%), Gaps = 1/56 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAM-TDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGM 668 +RDQ CGSCWAFG V A ++ YSN + S ++++ C C GC GG+ Sbjct: 93 IRDQKQCGSCWAFGTVAACESNYALLYSNLPQ---LSEQNIIDCATTC-YGCGGGI 144 >UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole genome shotgun sequence; n=7; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_22, whole genome shotgun sequence - Paramecium tetraurelia Length = 350 Score = 46.4 bits (105), Expect = 0.001 Identities = 27/77 (35%), Positives = 34/77 (44%), Gaps = 3/77 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHF-HFSAEDLLSCCPICGL--GCSGGMPX 674 V+DQG CGSCWAF + Y T S + L+ C + GC GGMP Sbjct: 157 VKDQGQCGSCWAFSTTGVLEG---FYKVQTGELPDLSEQQLVDCSTLIDFNQGCDGGMPS 213 Query: 675 LTWEYWKXFGLVSGGSY 725 Y K GL + +Y Sbjct: 214 RALNYVKRNGLTTQDAY 230 >UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_101, whole genome shotgun sequence - Paramecium tetraurelia Length = 306 Score = 46.4 bits (105), Expect = 0.001 Identities = 26/74 (35%), Positives = 37/74 (50%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V++QG+CGS W+F AV A + + GT HF +S ++L+ C GC GG P Sbjct: 123 VKNQGTCGSGWSFSAVGAF-EAFFIFVKGT-HFQYSEQNLVD-CDTNSHGCDGGYPAKAI 179 Query: 684 EYWKXFGLVSGGSY 725 +Y G Y Sbjct: 180 DYLNKNGAFLESEY 193 >UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Plasmodium (Vinckeia)|Rep: Cysteine proteinase precursor - Plasmodium vinckei Length = 506 Score = 46.4 bits (105), Expect = 0.001 Identities = 23/73 (31%), Positives = 35/73 (47%) Frame = +3 Query: 507 RDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTWE 686 +DQG+CGSCWAF A+ + + ++ FS + ++ C GC GG P + Sbjct: 279 KDQGNCGSCWAFAAI-GNFEYLYVHTRHEMPISFSEQQMVDCSTE-NYGCDGGNPFYAFL 336 Query: 687 YWKXFGLVSGGSY 725 Y G+ G Y Sbjct: 337 YMINNGVCLGDEY 349 >UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo sapiens|Rep: Isoform 2 of Q9GZM7 - Homo sapiens (Human) Length = 283 Score = 46.0 bits (104), Expect = 0.001 Identities = 25/77 (32%), Positives = 33/77 (42%) Frame = +3 Query: 510 DQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTWEY 689 DQG+C WAF +DRV +S G S ++LLSC GC GG W + Sbjct: 88 DQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGGRLDGAWWF 147 Query: 690 WKXFGLVSGGSYHSSXG 740 + G + G G Sbjct: 148 LRRRGYAATGDVGREEG 164 >UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep: Cathepsin L - Stylonychia lemnae Length = 340 Score = 46.0 bits (104), Expect = 0.001 Identities = 22/62 (35%), Positives = 31/62 (50%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQG CGSCWAF + ++ R + K S + L+ C GC+GG L Sbjct: 140 VKDQGQCGSCWAFSTIASLESRY--FIETGKLQSLSEQQLVDCSKNGNEGCNGGDMGLAM 197 Query: 684 EY 689 +Y Sbjct: 198 DY 199 >UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Cathepsin - Geodia cydonium (Sponge) Length = 322 Score = 46.0 bits (104), Expect = 0.001 Identities = 28/77 (36%), Positives = 43/77 (55%), Gaps = 3/77 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGT-KHFHFSAEDLLSCCPICG-LGCSGGMPXL 677 V++QG CGSCWAF A ++ + + N T K S ++L+ C G GC+GG+P Sbjct: 118 VKNQGQCGSCWAFSATGSLEGQ---HFNATGKLVSLSEQNLVDCSSAEGNEGCNGGLPDD 174 Query: 678 TWEY-WKXFGLVSGGSY 725 ++Y K G+ + SY Sbjct: 175 AFKYVIKNGGIDTEASY 191 >UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06231 protein - Schistosoma japonicum (Blood fluke) Length = 372 Score = 46.0 bits (104), Expect = 0.001 Identities = 22/63 (34%), Positives = 34/63 (53%), Gaps = 1/63 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 V++QG CGSCWAF + A+ + Y + + S + L+ C G GC GG+ L Sbjct: 165 VKNQGQCGSCWAFSSTGAIEGQ--HYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLA 222 Query: 681 WEY 689 ++Y Sbjct: 223 FQY 225 >UniRef50_Q231X3 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 46.0 bits (104), Expect = 0.001 Identities = 22/75 (29%), Positives = 34/75 (45%), Gaps = 1/75 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 V+ QG+CGSCWAF A ++ + K S + L+ C G GC+ G Sbjct: 130 VKSQGNCGSCWAFSATASVESALIIAGKVDKSISLSEQQLIDCSGDYGNYGCAAGQKEQA 189 Query: 681 WEYWKXFGLVSGGSY 725 Y K + + + +Y Sbjct: 190 LVYIKRYSITTEQNY 204 >UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; Dictyostelium discoideum|Rep: Cysteine proteinase 7 precursor - Dictyostelium discoideum (Slime mold) Length = 460 Score = 46.0 bits (104), Expect = 0.001 Identities = 23/64 (35%), Positives = 35/64 (54%), Gaps = 2/64 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHF-HFSAEDLLSCCPICG-LGCSGGMPXL 677 +++QG CG CW+F A T+ +NG K+ S ++L+ C G GC GG+ L Sbjct: 125 IKNQGQCGGCWSFSTTGA-TEGAQYLANGKKNLVSLSEQNLIDCSGSYGNNGCEGGLMTL 183 Query: 678 TWEY 689 +EY Sbjct: 184 AFEY 187 >UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2] - Vigna mungo (Rice bean) (Black gram) Length = 362 Score = 46.0 bits (104), Expect = 0.001 Identities = 24/67 (35%), Positives = 35/67 (52%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQG CGSCWAF + A+ +N K S ++L+ C GC+GG+ + Sbjct: 143 VKDQGQCGSCWAFSTIVAVEGINQIKTN--KLVSLSEQELVDCDKEENQGCNGGLMESAF 200 Query: 684 EYWKXFG 704 E+ K G Sbjct: 201 EFIKQKG 207 >UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin F like protease - Nasonia vitripennis Length = 1036 Score = 45.6 bits (103), Expect = 0.002 Identities = 25/72 (34%), Positives = 38/72 (52%) Frame = +3 Query: 489 NVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGM 668 NV V+DQGSCGSCWAF +V + +G + S ++L+ C + GC+GG+ Sbjct: 827 NVVTPVKDQGSCGSCWAF-SVTGNIEGQYAIKHG-ELLSLSEQELVDCDKL-DSGCNGGL 883 Query: 669 PXLTWEYWKXFG 704 P + + G Sbjct: 884 PDTAYRAIEELG 895 >UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin O; n=1; Danio rerio|Rep: PREDICTED: similar to cathepsin O - Danio rerio Length = 327 Score = 45.6 bits (103), Expect = 0.002 Identities = 28/76 (36%), Positives = 35/76 (46%), Gaps = 2/76 (2%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMP--XL 677 V +QGSCG CWAF VEA+ + G K S + ++ C GC+GG P L Sbjct: 135 VHNQGSCGGCWAFSIVEAIES--VSAKVGEKLQQLSVQQVID-CSYQNQGCNGGSPVEAL 191 Query: 678 TWEYWKXFGLVSGGSY 725 W LVS Y Sbjct: 192 YWLTQSKLKLVSEAEY 207 >UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 343 Score = 45.6 bits (103), Expect = 0.002 Identities = 23/74 (31%), Positives = 35/74 (47%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 ++ QG CGSCWAF A+ V G + S++ LL C + C GG P Sbjct: 153 IKYQGPCGSCWAFATAAAIESAVSISGGGLQ--SLSSQQLLDCTVVSD-KCGGGEPVEAL 209 Query: 684 EYWKXFGLVSGGSY 725 +Y + G+ + +Y Sbjct: 210 KYAQSHGITTAHNY 223 >UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lamblia ATCC 50803|Rep: GLP_42_16392_14707 - Giardia lamblia ATCC 50803 Length = 561 Score = 45.6 bits (103), Expect = 0.002 Identities = 27/79 (34%), Positives = 37/79 (46%), Gaps = 4/79 (5%) Frame = +3 Query: 510 DQGSCGSCWAFGAVEAMTDRVCTYSNGTKHF----HFSAEDLLSCCPICGLGCSGGMPXL 677 DQG CGSC+ V AMT RV S S + L C GCSGG + Sbjct: 245 DQGHCGSCYTAATVWAMTARVMVASEDEDKLGATRRLSVQHALDCNQY-AQGCSGGFAEM 303 Query: 678 TWEYWKXFGLVSGGSYHSS 734 ++ + FG+++ SY+ S Sbjct: 304 VVKFAEEFGILTENSYYIS 322 >UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabditis|Rep: Cathepsin z protein 1 - Caenorhabditis elegans Length = 306 Score = 45.6 bits (103), Expect = 0.002 Identities = 26/81 (32%), Positives = 39/81 (48%), Gaps = 4/81 (4%) Frame = +3 Query: 522 CGSCWAFGAVEAMTDRV-CTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTWEYWKX 698 CGSCWAFGA A+ DR+ N + S ++++ C G GG P ++Y Sbjct: 92 CGSCWAFGATSALADRINIKRKNAWPQAYLSVQEVIDCSG-AGTCVMGGEPGGVYKYAHE 150 Query: 699 FGL--VSGGSYHSSXG-CRPY 752 G+ + +Y + G C PY Sbjct: 151 HGIPHETCNNYQARDGKCDPY 171 >UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis|Rep: Cysteine protease 2 - Babesia bovis Length = 445 Score = 45.6 bits (103), Expect = 0.002 Identities = 24/68 (35%), Positives = 35/68 (51%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQG CGSCWAF AV ++ + + S ++L+S C + GC+GG Sbjct: 251 VKDQGMCGSCWAFAAVGSVESLLKRQKTDVR---LSEQELVS-CQLGNQGCNGGYSDYAL 306 Query: 684 EYWKXFGL 707 Y K G+ Sbjct: 307 NYIKFNGI 314 >UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_79, whole genome shotgun sequence - Paramecium tetraurelia Length = 324 Score = 45.6 bits (103), Expect = 0.002 Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 2/76 (2%) Frame = +3 Query: 504 VRDQGS-CGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXL 677 ++DQGS CGS WAF AV + + + S +D+L C P GCSGG Sbjct: 133 IKDQGSSCGSSWAFSAVGVL--EINSNIEFGLETTLSEQDMLDCSGPYGNQGCSGGWMDS 190 Query: 678 TWEYWKXFGLVSGGSY 725 +EY + G+ +G Y Sbjct: 191 GFEYVRDHGIANGSVY 206 >UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; Eukaryota|Rep: Cathepsin-like cysteine protease - Phytophthora infestans (Potato late blight fungus) Length = 635 Score = 45.2 bits (102), Expect = 0.002 Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 1/57 (1%) Frame = +3 Query: 522 CGSCWAFGAVEAMTDRVCTYSNGT-KHFHFSAEDLLSCCPICGLGCSGGMPXLTWEY 689 CGSCWA G A++DR+ N + S + L++C G C+GG P L +EY Sbjct: 389 CGSCWAQGTTSALSDRISILRNASWPEIALSPQVLINC--HAGGTCNGGNPGLVYEY 443 Score = 34.3 bits (75), Expect = 4.0 Identities = 24/76 (31%), Positives = 33/76 (43%), Gaps = 10/76 (13%) Frame = +3 Query: 522 CGSCWAFGAVEAMTDRVCTYSN---GTK---HFH----FSAEDLLSCCPICGLGCSGGMP 671 CGSCW+F A A+ DR+ + G K H S + +L+C GC GG Sbjct: 83 CGSCWSFAATSALADRILIFKERNPGNKPSVEVHRGVVLSPQVILNCDKKDN-GCHGGDQ 141 Query: 672 XLTWEYWKXFGLVSGG 719 + Y K G+ G Sbjct: 142 LEAYRYIKEHGVPEEG 157 >UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Plasmodium|Rep: Cysteine proteinase precursor - Plasmodium vivax (strain Salvador I) Length = 583 Score = 45.2 bits (102), Expect = 0.002 Identities = 24/73 (32%), Positives = 35/73 (47%) Frame = +3 Query: 507 RDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTWE 686 +DQG CGSCWAF +V + N T S ++++ C + GC GG P ++ Sbjct: 355 KDQGLCGSCWAFASVGNVECMYAKEHNKT-ILTLSEQEVVDCSKL-NFGCDGGHPFYSFI 412 Query: 687 YWKXFGLVSGGSY 725 Y G+ G Y Sbjct: 413 YAIENGICMGDDY 425 >UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; Entamoeba|Rep: Cysteine proteinase 2 precursor - Entamoeba histolytica Length = 315 Score = 45.2 bits (102), Expect = 0.002 Identities = 23/76 (30%), Positives = 37/76 (48%), Gaps = 2/76 (2%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKH-FHFSAEDLLSCCPICG-LGCSGGMPXL 677 +RDQ CGSC+ FG++ A+ R+ G + S E ++ C G GC+GG+ Sbjct: 109 IRDQAQCGSCYTFGSLAALEGRLLIEKGGDANTLDLSEEHMVQCTRDNGNNGCNGGLGSN 168 Query: 678 TWEYWKXFGLVSGGSY 725 ++Y G+ Y Sbjct: 169 VYDYIIEHGVAKESDY 184 >UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor; n=17; Magnoliophyta|Rep: Thiol protease aleurain-like precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 45.2 bits (102), Expect = 0.002 Identities = 23/68 (33%), Positives = 31/68 (45%), Gaps = 1/68 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLT 680 V++QG CGSCW F A+ + K S + L+ C GC GG+P Sbjct: 156 VKEQGHCGSCWTFSTTGAL--EAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQA 213 Query: 681 WEYWKXFG 704 +EY K G Sbjct: 214 FEYIKYNG 221 >UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; n=23; Magnoliophyta|Rep: Senescence-specific cysteine protease - Arabidopsis thaliana (Mouse-ear cress) Length = 346 Score = 44.8 bits (101), Expect = 0.003 Identities = 24/67 (35%), Positives = 33/67 (49%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +++QGSCG CWAF AV A+ T K S + L+ C GC GG+ + Sbjct: 145 IKNQGSCGCCWAFSAVAAIEG--ATQIKKGKLISLSEQQLVD-CDTNDFGCEGGLMDTAF 201 Query: 684 EYWKXFG 704 E+ K G Sbjct: 202 EHIKATG 208 >UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa|Rep: Os09g0497500 protein - Oryza sativa subsp. japonica (Rice) Length = 349 Score = 44.8 bits (101), Expect = 0.003 Identities = 29/82 (35%), Positives = 44/82 (53%), Gaps = 3/82 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V++QG CGSCWAF AV A+ + + NG + S ++L+ C +GC GG + Sbjct: 137 VKNQGDCGSCWAFSAVAAI-EGINQIKNG-ELVSLSEQELVDCDDE-AVGCGGGYMSWAF 193 Query: 684 EY-WKXFGLVSGGS--YHSSXG 740 E+ GL + S YH++ G Sbjct: 194 EFVVGNHGLTTEASYPYHAANG 215 >UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa (Rice) Length = 339 Score = 44.8 bits (101), Expect = 0.003 Identities = 28/77 (36%), Positives = 41/77 (53%), Gaps = 3/77 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG--LGCSGGMPXL 677 ++DQG CG CWAF AV AM + + S G K S ++L+ C + G GC GG+ Sbjct: 138 IKDQGQCGCCWAFSAVAAM-EGIVKLSTG-KLISLSEQELVD-CDVHGEDQGCEGGLMDD 194 Query: 678 TWEY-WKXFGLVSGGSY 725 +++ K GL + Y Sbjct: 195 AFKFIIKNGGLTTESKY 211 >UniRef50_Q23FQ5 Cluster: Papain family cysteine protease containing protein; n=4; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 44.8 bits (101), Expect = 0.003 Identities = 25/77 (32%), Positives = 35/77 (45%), Gaps = 3/77 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGL---GCSGGMPX 674 +++QG CGSC AFG + Y + FS + LL C G GC G Sbjct: 140 IQNQGQCGSCAAFGTAGVLES--FYYLKSKQLLKFSEQQLLDCARQAGFDTYGCDGAWQQ 197 Query: 675 LTWEYWKXFGLVSGGSY 725 ++Y +G+V G SY Sbjct: 198 EYFKYAIKYGIVQGSSY 214 >UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=19; Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Homo sapiens (Human) Length = 333 Score = 44.8 bits (101), Expect = 0.003 Identities = 26/76 (34%), Positives = 41/76 (53%), Gaps = 2/76 (2%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLT 680 V++QG CGSCWAF A A+ ++ + + S ++L+ C P GC+GG+ Sbjct: 129 VKNQGQCGSCWAFSATGALEGQM--FRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYA 186 Query: 681 WEYWK-XFGLVSGGSY 725 ++Y + GL S SY Sbjct: 187 FQYVQDNGGLDSEESY 202 >UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [Contains: Cathepsin H mini chain; Cathepsin H heavy chain; Cathepsin H light chain]; n=37; Eukaryota|Rep: Cathepsin H precursor (EC 3.4.22.16) [Contains: Cathepsin H mini chain; Cathepsin H heavy chain; Cathepsin H light chain] - Homo sapiens (Human) Length = 335 Score = 44.8 bits (101), Expect = 0.003 Identities = 21/68 (30%), Positives = 32/68 (47%), Gaps = 1/68 (1%) Frame = +3 Query: 489 NVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGG 665 N V++QG+CGSCW F A+ + + K + + L+ C GC GG Sbjct: 127 NFVSPVKNQGACGSCWTFSTTGALESAIAIATG--KMLSLAEQQLVDCAQDFNNHGCQGG 184 Query: 666 MPXLTWEY 689 +P +EY Sbjct: 185 LPSQAFEY 192 >UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 386 Score = 44.4 bits (100), Expect = 0.004 Identities = 25/74 (33%), Positives = 35/74 (47%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 ++DQG C CW F AV A+ + V +G K S +++ C GC GG L Sbjct: 167 IKDQGQCACCWGF-AVTALVETVYAAHSG-KFKSLSDQEVCDCGTEGTPGCKGGSLTLGV 224 Query: 684 EYWKXFGLVSGGSY 725 +Y K +GL Y Sbjct: 225 QYVKKYGLSGDEDY 238 >UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein A; n=2; Dictyostelium discoideum|Rep: Gamete and mating-type specific protein A - Dictyostelium discoideum (Slime mold) Length = 448 Score = 44.4 bits (100), Expect = 0.004 Identities = 23/76 (30%), Positives = 39/76 (51%), Gaps = 2/76 (2%) Frame = +3 Query: 486 SNVEXXVRDQGSCGSCWAFGAVEAMTDR-VCTYSNGTKH-FHFSAEDLLSCCPICGLGCS 659 ++ + +RDQG CGSCWAF + A+ R + Y K S ++ ++C GC+ Sbjct: 247 TSYQTPIRDQGQCGSCWAFASSAALESRYLIKYGTAQKSTLQLSNQNAVNC---IASGCN 303 Query: 660 GGMPXLTWEYWKXFGL 707 GG + ++K G+ Sbjct: 304 GGWSGNYFNFFKTPGI 319 >UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcoptes scabiei type hominis|Rep: Cathepsin L-like protease - Sarcoptes scabiei type hominis Length = 245 Score = 44.4 bits (100), Expect = 0.004 Identities = 32/96 (33%), Positives = 47/96 (48%), Gaps = 5/96 (5%) Frame = +3 Query: 483 LSNVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCS 659 L NV ++DQ CGSCWAF AV +M + + + S ++L+ C G GC Sbjct: 128 LKNVVAPIKDQKQCGSCWAFSAVASMESQNALKTG--QLVELSEQELVDCSVGEGNEGCD 185 Query: 660 GGMPXLTWEY-WKXFGLVSGGS--YHS-SXGCRPYE 755 GG +E+ K G+ + S YH + CR Y+ Sbjct: 186 GGWMDSAFEFVIKADGIDTEKSYPYHGVNQVCRSYQ 221 >UniRef50_Q239L8 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 44.4 bits (100), Expect = 0.004 Identities = 21/74 (28%), Positives = 37/74 (50%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQG CGSCW+F A+ + + + K S + L+ C GC+GG+ + Sbjct: 138 VKDQGQCGSCWSFSTTGAVEGAL--FLSTKKLTSLSEQYLVDCSKDGNEGCNGGLMDTAF 195 Query: 684 EYWKXFGLVSGGSY 725 ++ G+ + +Y Sbjct: 196 DFISQHGIPTEAAY 209 >UniRef50_Q22DX2 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 358 Score = 44.4 bits (100), Expect = 0.004 Identities = 25/75 (33%), Positives = 36/75 (48%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V +QG CGSCWAF A+ N T + S + L+ C G GC GG + Sbjct: 164 VENQGQCGSCWAFSTSGAVESYYSAKKNIT--LNLSKQQLVDCVYDHG-GCDGGWFNDAF 220 Query: 684 EYWKXFGLVSGGSYH 728 +Y + G+V +Y+ Sbjct: 221 KYIQSVGIVLNATYY 235 >UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_46, whole genome shotgun sequence - Paramecium tetraurelia Length = 336 Score = 44.4 bits (100), Expect = 0.004 Identities = 25/68 (36%), Positives = 30/68 (44%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQG C CWAFGAV A Y S + L+ C GC+GG L Sbjct: 154 VKDQGQCSGCWAFGAVGAA--EAWFYVKNKTTVLLSEQQLID-CDTQSFGCNGGYQNLAL 210 Query: 684 EYWKXFGL 707 +Y GL Sbjct: 211 KYIANHGL 218 >UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin L-like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin L-like protease - Nasonia vitripennis Length = 353 Score = 44.0 bits (99), Expect = 0.005 Identities = 25/64 (39%), Positives = 36/64 (56%), Gaps = 2/64 (3%) Frame = +3 Query: 504 VRDQG-SCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXL 677 VRDQG +CGSCWAF A A+ + + SA++L+ C G LGC GG L Sbjct: 147 VRDQGLTCGSCWAFSAAGALEAQY--FKKTGVLTALSAQNLIDCTMEYGNLGCGGGSAAL 204 Query: 678 TWEY 689 ++++ Sbjct: 205 SFQF 208 >UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin l - Strongylocentrotus purpuratus Length = 489 Score = 44.0 bits (99), Expect = 0.005 Identities = 22/63 (34%), Positives = 31/63 (49%), Gaps = 1/63 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 V+DQ CGSCW+FG+ E + V + K S + L+ C G GC GG Sbjct: 282 VKDQAVCGSCWSFGSAETIEGAV--FMQSGKRVRLSQQMLMDCTWAAGNNGCDGGEEWRV 339 Query: 681 WEY 689 +E+ Sbjct: 340 YEW 342 >UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays (Maize) Length = 493 Score = 44.0 bits (99), Expect = 0.005 Identities = 21/55 (38%), Positives = 30/55 (54%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGM 668 V+DQG CG CWAF AV A+ + + G+ S ++L+ C GC GG+ Sbjct: 179 VKDQGQCGGCWAFSAVAAV-EGINKIVTGSL-ISLSEQELIDCDKFQDQGCDGGL 231 >UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax (Sterkiella histriomuscorum) Length = 366 Score = 44.0 bits (99), Expect = 0.005 Identities = 23/68 (33%), Positives = 32/68 (47%), Gaps = 1/68 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLT 680 V++QG CGSCW F V + + + S + L+ C GCSGG+P Sbjct: 150 VKNQGKCGSCWTFSTVGCVESHYLLKYGAFR--NLSEQQLVDCAGDYDNHGCSGGLPSHA 207 Query: 681 WEYWKXFG 704 +EY K G Sbjct: 208 FEYIKDNG 215 >UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2; Brugia malayi|Rep: Cahepsin L-like cysteine protease - Brugia malayi (Filarial nematode worm) Length = 371 Score = 44.0 bits (99), Expect = 0.005 Identities = 24/64 (37%), Positives = 32/64 (50%), Gaps = 2/64 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC--PICGLGCSGGMPXL 677 V+DQG CGSCW F AV A+ + + K S ++LL C GC GG+ Sbjct: 158 VKDQGYCGSCWTFSAVGALEGQ--HFLQTGKLVELSMQNLLDCSDDTYGNYGCDGGLMME 215 Query: 678 TWEY 689 +EY Sbjct: 216 AFEY 219 >UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2; Theileria|Rep: Cysteine protease, tacP, putative - Theileria annulata Length = 461 Score = 44.0 bits (99), Expect = 0.005 Identities = 30/84 (35%), Positives = 39/84 (46%), Gaps = 3/84 (3%) Frame = +3 Query: 489 NVEXXVRDQG-SCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGG 665 +V V+DQG C SCWAF +V A+ + S + L++C C GCSGG Sbjct: 246 DVVTKVKDQGLDCSSCWAFASVAAVESIFQLLQD--VDLDLSEQHLINCETRCS-GCSGG 302 Query: 666 MPXLTWEYWKXFGLVSGG--SYHS 731 L +Y K GL YHS Sbjct: 303 YADLALDYVKNKGLPKSSVVPYHS 326 >UniRef50_Q23H10 Cluster: Papain family cysteine protease containing protein; n=14; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 44.0 bits (99), Expect = 0.005 Identities = 25/78 (32%), Positives = 35/78 (44%), Gaps = 4/78 (5%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC-CPICG---LGCSGGMP 671 V++QG CGSCW+F A M + FS + L+ C P G GC+GG P Sbjct: 142 VKNQGGCGSCWSFSAAAVMES--FNFIQNKALVDFSEQQLVDCVIPANGYNSYGCNGGWP 199 Query: 672 XLTWEYWKXFGLVSGGSY 725 +Y G+ + Y Sbjct: 200 VQCLDYASKVGITTLDKY 217 >UniRef50_Q23H06 Cluster: Papain family cysteine protease containing protein; n=18; Tetrahymena thermophila|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 349 Score = 44.0 bits (99), Expect = 0.005 Identities = 26/79 (32%), Positives = 36/79 (45%), Gaps = 4/79 (5%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC-CPICGL---GCSGGMP 671 V+ QG+CG+CWAF A M + FS + LL C P G GC GG P Sbjct: 156 VKWQGNCGACWAFSATGVMES--FNFIQNKALVEFSEQQLLDCVIPANGYPSSGCHGGWP 213 Query: 672 XLTWEYWKXFGLVSGGSYH 728 +Y G+++ Y+ Sbjct: 214 VQCIDYASKVGILNQDRYY 232 >UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=17; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 318 Score = 44.0 bits (99), Expect = 0.005 Identities = 19/62 (30%), Positives = 31/62 (50%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 ++DQ CGSCWAF V+A + + + ++++ C C GC GG L + Sbjct: 115 IKDQAQCGSCWAFSVVQAQESQWALKKG--QLLSLAEQNMVDCVDTC-YGCDGGDEYLAY 171 Query: 684 EY 689 +Y Sbjct: 172 DY 173 >UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: Cathepsin L - Kudoa thyrsites Length = 300 Score = 44.0 bits (99), Expect = 0.005 Identities = 21/62 (33%), Positives = 34/62 (54%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V++QG CGSCW+F A A+ + + +FS + L+ C GC+GG+P + + Sbjct: 117 VKNQGHCGSCWSFSAAGAIESAYAIKTG--ELVNFSEQQLVDCSTE-NHGCNGGLPEIAF 173 Query: 684 EY 689 Y Sbjct: 174 LY 175 >UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, whole genome shotgun sequence; n=3; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_2, whole genome shotgun sequence - Paramecium tetraurelia Length = 376 Score = 44.0 bits (99), Expect = 0.005 Identities = 23/74 (31%), Positives = 33/74 (44%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+ QG CGSCWAF + + R+ +N K S L+ C GC GG + Sbjct: 178 VQQQGRCGSCWAFAVQDVVISRL-AIANKNKLDQLSKTHLIDCADGNTEGCDGGSVSDAF 236 Query: 684 EYWKXFGLVSGGSY 725 ++ +G V Y Sbjct: 237 DFINKYGTVYEKDY 250 >UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; Leishmania|Rep: Cysteine proteinase 2 precursor - Leishmania pifanoi Length = 444 Score = 44.0 bits (99), Expect = 0.005 Identities = 23/55 (41%), Positives = 32/55 (58%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGM 668 V+DQG+CGSCWAF AV + + Y G + S + L+SC + GC GG+ Sbjct: 141 VKDQGACGSCWAFSAVGNIEGQ--WYLAGHELVSLSEQQLVSCDDM-NDGCDGGL 192 >UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thaliana|Rep: F9P14.12 protein - Arabidopsis thaliana (Mouse-ear cress) Length = 343 Score = 43.6 bits (98), Expect = 0.007 Identities = 24/68 (35%), Positives = 34/68 (50%), Gaps = 1/68 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC-CPICGLGCSGGMPXLT 680 +R+QG CG CWAF AV A+ + + G S + L+ C GCSGG+ Sbjct: 142 IRNQGKCGGCWAFSAVAAI-EGINKIKTGNL-VSLSEQQLIDCDVGTYNKGCSGGLMETA 199 Query: 681 WEYWKXFG 704 +E+ K G Sbjct: 200 FEFIKTNG 207 >UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 2 - Rhipicephalus appendiculatus (Brown ear tick) Length = 564 Score = 43.6 bits (98), Expect = 0.007 Identities = 26/80 (32%), Positives = 35/80 (43%), Gaps = 1/80 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 V+DQ CGSCW+FG V + + + S + L+ C G GC GG Sbjct: 360 VKDQAVCGSCWSFGTVGELEG--AYFRKTGRLVRLSEQQLVDCSWNNGNNGCDGGEDFRA 417 Query: 681 WEYWKXFGLVSGGSYHSSXG 740 +EY GL S Y + G Sbjct: 418 YEYIADHGLASDEDYGAYIG 437 >UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 326 Score = 43.6 bits (98), Expect = 0.007 Identities = 26/75 (34%), Positives = 35/75 (46%), Gaps = 1/75 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQGSCGSCW+F T + K S ++L+ C GCSGG Sbjct: 125 VKDQGSCGSCWSFSTTG--TVEGAYFLKTGKLVSLSEQNLVDCAKEDCYGCSGGYMDKAL 182 Query: 684 EYWKXF-GLVSGGSY 725 EY + G++S Y Sbjct: 183 EYIETAGGIMSENDY 197 >UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=176; Viridiplantae|Rep: Cysteine proteinase RD21a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 462 Score = 43.6 bits (98), Expect = 0.007 Identities = 21/62 (33%), Positives = 33/62 (53%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQG CGSCWAF + A+ + + G S ++L+ C GC+GG+ + Sbjct: 152 VKDQGGCGSCWAFSTIGAV-EGINQIVTGDL-ITLSEQELVDCDTSYNEGCNGGLMDYAF 209 Query: 684 EY 689 E+ Sbjct: 210 EF 211 >UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi Length = 467 Score = 43.6 bits (98), Expect = 0.007 Identities = 22/62 (35%), Positives = 31/62 (50%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQG CGSCWAF A+ + C + +E +L C GCSGG+ + Sbjct: 138 VKDQGQCGSCWAFSAIGNVE---CQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAF 194 Query: 684 EY 689 E+ Sbjct: 195 EW 196 >UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 protein, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to MGC81823 protein, partial - Ornithorhynchus anatinus Length = 361 Score = 43.2 bits (97), Expect = 0.009 Identities = 30/97 (30%), Positives = 48/97 (49%), Gaps = 3/97 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 V+DQG CGSCWAFG+ + ++ + + S ++L+ C G GC GG+ + Sbjct: 205 VKDQGRCGSCWAFGSTGVLEGQL--FRRTGRLAAVSEQNLMDCSRKQGNRGCDGGLMQQS 262 Query: 681 WEYWKXFGLVSGGSYHSSXGCRPYE--IPPX*TSRTR 785 + Y + G V S PY+ +PP ++ TR Sbjct: 263 FLYVRDNGGV------DSEEAYPYDAKVPPPPSTSTR 293 >UniRef50_UPI00006CB77B Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 382 Score = 43.2 bits (97), Expect = 0.009 Identities = 27/86 (31%), Positives = 41/86 (47%), Gaps = 6/86 (6%) Frame = +3 Query: 504 VRDQGS-CGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLT 680 + +QG C + ++ AV ++ DR+C S G +F SA+ +SC C GG T Sbjct: 142 IANQGKDCSASYSIAAVSSVADRLCMASEGDFNFGLSAQPTISCYENQSYKCEGGYVSKT 201 Query: 681 WEYWKXFGLVSGG--SYH---SSXGC 743 ++ K G V YH S+ GC Sbjct: 202 FQKGKTTGFVKEECLPYHGTDSNEGC 227 >UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cathepsin L; n=4; Danio rerio|Rep: Novel protein similar to vertebrate cathepsin L - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 334 Score = 43.2 bits (97), Expect = 0.009 Identities = 26/84 (30%), Positives = 41/84 (48%), Gaps = 2/84 (2%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 V+DQG CGSCW+F A+ ++ Y + + S + L+ C G GCSG Sbjct: 133 VKDQGYCGSCWSFSTTGAIEGQM--YKHTGRLVSLSEQQLVDCSRSYGTYGCSGAWMANA 190 Query: 681 WEYWKXFGLVSGGSY-HSSXGCRP 749 ++Y L S +Y ++S +P Sbjct: 191 YDYVINNALESSDTYPYTSVDTQP 214 >UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis thaliana|Rep: Cysteine proteinase - Arabidopsis thaliana (Mouse-ear cress) Length = 348 Score = 43.2 bits (97), Expect = 0.009 Identities = 26/75 (34%), Positives = 38/75 (50%), Gaps = 1/75 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+ QG CG CWAF AV A+ + + + G + S + LL C GC GG+ + Sbjct: 143 VKYQGRCGGCWAFSAVAAV-EGITKITKG-ELVSLSEQQLLDCDRDYNQGCRGGIMSKAF 200 Query: 684 EY-WKXFGLVSGGSY 725 EY K G+ + +Y Sbjct: 201 EYIIKNQGITTEDNY 215 >UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; Phytophthora infestans|Rep: Cathepsin-like cysteine protease - Phytophthora infestans (Potato late blight fungus) Length = 376 Score = 43.2 bits (97), Expect = 0.009 Identities = 23/55 (41%), Positives = 30/55 (54%), Gaps = 1/55 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLG-CSGG 665 V++QG CGSCWAF AV AM C Y+ T +E L C + G+ C+ G Sbjct: 148 VKNQGQCGSCWAFSAVAAME---CAYALSTGTLESLSEQELVDCTLNGIDTCNHG 199 >UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schistosoma japonicum|Rep: Clone ZZD209 mRNA sequence - Schistosoma japonicum (Blood fluke) Length = 339 Score = 43.2 bits (97), Expect = 0.009 Identities = 27/80 (33%), Positives = 39/80 (48%), Gaps = 1/80 (1%) Frame = +3 Query: 489 NVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGG 665 N+ RDQGSC +AF AV A T+ + + H + S + + C I G +GC GG Sbjct: 131 NIVNEPRDQGSCIGSYAF-AVTASTESQYAL-HTSNHMNLSVQQFIDCTRIYGNMGCHGG 188 Query: 666 MPXLTWEYWKXFGLVSGGSY 725 + Y + FGL + Y Sbjct: 189 YTFTLFIYLQSFGLETEQMY 208 >UniRef50_Q23EG5 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 429 Score = 43.2 bits (97), Expect = 0.009 Identities = 20/57 (35%), Positives = 27/57 (47%), Gaps = 1/57 (1%) Frame = +3 Query: 522 CGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLTWEY 689 CGSCW F A A+ + G F+ S + L+ C GC GG+P +EY Sbjct: 147 CGSCWTFSATGAIESHL-ALKTGKAPFNLSQQQLVDCAGKFDNQGCDGGLPSRAFEY 202 >UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precursor; n=20; Psoroptidia|Rep: Major mite fecal allergen Der f 1 precursor - Dermatophagoides farinae (House-dust mite) Length = 321 Score = 43.2 bits (97), Expect = 0.009 Identities = 26/74 (35%), Positives = 33/74 (44%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +R QG CGSCWAF V A Y N + S ++L+ C GC G Sbjct: 124 IRMQGGCGSCWAFSGVAATESAYLAYRNTS--LDLSEQELVDCA--SQHGCHGDTIPRGI 179 Query: 684 EYWKXFGLVSGGSY 725 EY + G+V SY Sbjct: 180 EYIQQNGVVEERSY 193 >UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35; Viridiplantae|Rep: Cysteine proteinase 15A precursor - Pisum sativum (Garden pea) Length = 363 Score = 43.2 bits (97), Expect = 0.009 Identities = 25/70 (35%), Positives = 33/70 (47%), Gaps = 8/70 (11%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPI--------CGLGCS 659 V+DQGSCGSCWAF A+ Y K S + L+ C + C GC+ Sbjct: 147 VKDQGSCGSCWAFSTTGALEG--AHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCN 204 Query: 660 GGMPXLTWEY 689 GG+ +EY Sbjct: 205 GGLMNNAFEY 214 >UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens (Human) Length = 334 Score = 43.2 bits (97), Expect = 0.009 Identities = 27/76 (35%), Positives = 39/76 (51%), Gaps = 2/76 (2%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLT 680 V++Q CGSCWAF A A+ ++ + K S ++L+ C P GC+GG Sbjct: 129 VKNQKQCGSCWAFSATGALEGQM--FRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARA 186 Query: 681 WEYWK-XFGLVSGGSY 725 ++Y K GL S SY Sbjct: 187 FQYVKENGGLDSEESY 202 >UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin heavy chain; n=3; Amniota|Rep: PREDICTED: similar to ferritin heavy chain - Ornithorhynchus anatinus Length = 338 Score = 42.7 bits (96), Expect = 0.011 Identities = 24/68 (35%), Positives = 34/68 (50%), Gaps = 1/68 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 V++QG CGSCWAF A A+ + K S ++L+ C G +GC GG Sbjct: 135 VKNQGLCGSCWAFSATGAL--EALVFKTTGKMVSLSEQNLVDCSWRQGNVGCRGGQYIGA 192 Query: 681 WEYWKXFG 704 +EY + G Sbjct: 193 FEYVRANG 200 >UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF6860, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 251 Score = 42.7 bits (96), Expect = 0.011 Identities = 24/71 (33%), Positives = 34/71 (47%), Gaps = 1/71 (1%) Frame = +3 Query: 516 GSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLTWEYW 692 G CGSCWAF A+ ++ Y + S ++L+ C G GCSG ++Y Sbjct: 1 GYCGSCWAFSTTGAIEGQI--YKKTGQLVSLSEQNLVDCSKSYGTYGCSGAWMANAYDYV 58 Query: 693 KXFGLVSGGSY 725 GL S G+Y Sbjct: 59 VNNGLESTGTY 69 >UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 21 SCAF14577, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 406 Score = 42.7 bits (96), Expect = 0.011 Identities = 30/98 (30%), Positives = 45/98 (45%), Gaps = 1/98 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 V++QG C SCWAF ++ A+ ++ + S ++LL C G LGC GG + Sbjct: 170 VQNQGFCNSCWAFSSLGALEGQMKKRTGFL--VPLSPQNLLDCSISDGNLGCRGGYISKS 227 Query: 681 WEYWKXFGLVSGGSYHSSXGCRPYEIPPX*TSRTRAPG 794 + Y G V S++ + P TS APG Sbjct: 228 YSYIIRNGGVDSDSFYPYEHQVSASLQPRLTSSAPAPG 265 >UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber officinale (Ginger) Length = 475 Score = 42.7 bits (96), Expect = 0.011 Identities = 25/79 (31%), Positives = 38/79 (48%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V++QG CGSCWAF A+ A+ + + G S + L+ C GC GG P + Sbjct: 158 VKNQGRCGSCWAFAAIAAV-EGINQIVTGDL-ISLSEQQLVD-CSTRNYGCEGGWPYRAF 214 Query: 684 EYWKXFGLVSGGSYHSSXG 740 +Y G V+ ++ G Sbjct: 215 QYIINNGGVNSEEHYPYTG 233 >UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa (japonica cultivar-group)|Rep: Os09g0562700 protein - Oryza sativa subsp. japonica (Rice) Length = 235 Score = 42.7 bits (96), Expect = 0.011 Identities = 23/62 (37%), Positives = 33/62 (53%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQG CGSCWAF V A+ + + G K S ++L+ C + GC GG+ Sbjct: 24 VKDQGRCGSCWAFSTV-AVVEGIQKIKKG-KLVSLSEQELVDCDTL-DSGCDGGVSYRAL 80 Query: 684 EY 689 E+ Sbjct: 81 EW 82 >UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Longidorus elongatus Length = 358 Score = 42.7 bits (96), Expect = 0.011 Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 2/64 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG--LGCSGGMPXL 677 V+DQGSCGSCWAF A ++ + Y K S ++L+ C + G GC+GG Sbjct: 154 VKDQGSCGSCWAFSATGSLEGQ--HYKQTGKLVSLSEQNLVD-CDVNGDDEGCNGGYMDG 210 Query: 678 TWEY 689 ++Y Sbjct: 211 AFQY 214 >UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra erinaceieuropaei|Rep: Cysteine proteinase - Spirometra erinaceieuropaei (Tapeworm) Length = 336 Score = 42.7 bits (96), Expect = 0.011 Identities = 22/75 (29%), Positives = 39/75 (52%), Gaps = 1/75 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 V++QG CGSCW+F A A+ + + + S + L+ C G GC+GG+ Sbjct: 136 VKNQGQCGSCWSFSANGAIEGAIQIKTGALR--SLSEQQLMDCSWDYGNQGCNGGLMPQA 193 Query: 681 WEYWKXFGLVSGGSY 725 ++Y + +G+ + Y Sbjct: 194 FQYAQRYGVEAEVDY 208 >UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 462 Score = 42.7 bits (96), Expect = 0.011 Identities = 18/43 (41%), Positives = 28/43 (65%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 632 VRDQ +CGSCWA A EA++ ++ +S G +F S + ++ C Sbjct: 242 VRDQANCGSCWAQSAGEAISSQISLHSKG--NFTVSIQQIMDC 282 >UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar to human SRY (sex determining region Y)-box 30 (SOX30),transcript variant 1,; n=3; Mammalia|Rep: Testis cDNA clone: QtsA-12228, similar to human SRY (sex determining region Y)-box 30 (SOX30),transcript variant 1, - Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) Length = 433 Score = 42.3 bits (95), Expect = 0.015 Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 2/76 (2%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC-PICGLGCSGGMPXLT 680 V++Q CGSCWAF A A+ ++ + K S ++L+ C P GC+GG Sbjct: 129 VKNQKQCGSCWAFSATGALEGQM--FRKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMNSA 186 Query: 681 WEYWK-XFGLVSGGSY 725 + Y K GL S SY Sbjct: 187 FRYVKENGGLDSEESY 202 >UniRef50_Q8I0V1 Cluster: Preprocathepsin c, putative; n=1; Plasmodium falciparum 3D7|Rep: Preprocathepsin c, putative - Plasmodium falciparum (isolate 3D7) Length = 504 Score = 42.3 bits (95), Expect = 0.015 Identities = 24/79 (30%), Positives = 36/79 (45%), Gaps = 5/79 (6%) Frame = +3 Query: 489 NVEXXVRDQGSCGSCWAFGAVEAMTDRV-----CTYSNGTKHFHFSAEDLLSCCPICGLG 653 N E V DQ CGSC++ +V ++ R Y S + +LSC P G Sbjct: 222 NFEENVDDQKDCGSCYSISSVYSLERRFEILFWKKYKKKVNMPRLSHQSILSCSPY-NQG 280 Query: 654 CSGGMPXLTWEYWKXFGLV 710 C GG P L ++ +G++ Sbjct: 281 CDGGYPFLVGKHMYEYGII 299 >UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n=21; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Globodera pallida Length = 379 Score = 42.3 bits (95), Expect = 0.015 Identities = 21/65 (32%), Positives = 36/65 (55%), Gaps = 1/65 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 V++QG CGSCWAF + A+ + + + S ++L+ C G +GC+GG+ Sbjct: 176 VKNQGMCGSCWAFSSTGALEAQHARQTG--QLISLSEQNLIDCSKKYGNMGCNGGIMDNA 233 Query: 681 WEYWK 695 ++Y K Sbjct: 234 FQYIK 238 >UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 397 Score = 42.3 bits (95), Expect = 0.015 Identities = 26/80 (32%), Positives = 35/80 (43%), Gaps = 6/80 (7%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC------PICGLGCSGG 665 V+DQG CG CWAF A A+ + V N T +S ++L+ C LGC GG Sbjct: 195 VKDQGRCGCCWAFSAT-ALAESVNLMRNNTLQ-QYSEQELVDCTNNQYQEDYSSLGCGGG 252 Query: 666 MPXLTWEYWKXFGLVSGGSY 725 Y + G+ Y Sbjct: 253 WAYNALVYMQRKGIFLESQY 272 >UniRef50_Q22W19 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 42.3 bits (95), Expect = 0.015 Identities = 23/79 (29%), Positives = 35/79 (44%) Frame = +3 Query: 489 NVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGM 668 N V++QG CGSCWAF V + Y+ T + +E + C GC+GG Sbjct: 133 NAVTPVKNQGQCGSCWAFSTVGGLEG---AYAIATGNLTSFSEQQIVDCSKANAGCNGGD 189 Query: 669 PXLTWEYWKXFGLVSGGSY 725 ++Y G+ + Y Sbjct: 190 LPPAYKYVVQNGIETEADY 208 >UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179, whole genome shotgun sequence; n=3; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_179, whole genome shotgun sequence - Paramecium tetraurelia Length = 339 Score = 42.3 bits (95), Expect = 0.015 Identities = 25/70 (35%), Positives = 37/70 (52%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V +QG+C S ++ + +DRVC N T+ SA++LLSC LGC GG + Sbjct: 142 VYNQGNCSSSYSIAVSSSFSDRVCK-QNQTQQL--SAQNLLSCDGKLNLGCKGGHLTKSA 198 Query: 684 EYWKXFGLVS 713 +Y GL + Sbjct: 199 DYIIKHGLTT 208 >UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens (Human) Length = 331 Score = 42.3 bits (95), Expect = 0.015 Identities = 23/64 (35%), Positives = 35/64 (54%), Gaps = 2/64 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC--PICGLGCSGGMPXL 677 V+ QGSCG+CWAF AV A+ ++ + K SA++L+ C GC+GG Sbjct: 130 VKYQGSCGACWAFSAVGALEAQLKLKTG--KLVSLSAQNLVDCSTEKYGNKGCNGGFMTT 187 Query: 678 TWEY 689 ++Y Sbjct: 188 AFQY 191 >UniRef50_UPI000155D183 Cluster: PREDICTED: similar to Cathepsin Z; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to Cathepsin Z - Ornithorhynchus anatinus Length = 294 Score = 41.9 bits (94), Expect = 0.020 Identities = 21/62 (33%), Positives = 27/62 (43%) Frame = +3 Query: 522 CGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTWEYWKXF 701 CGSCWA G+ A+ DR+ G F + + C G C GG WEY Sbjct: 170 CGSCWAHGSTSALADRINIKRKGAWPSAFLSVQHVIDCGNAG-SCEGGDDMAVWEYAHQH 228 Query: 702 GL 707 G+ Sbjct: 229 GI 230 >UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 344 Score = 41.9 bits (94), Expect = 0.020 Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 3/57 (5%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGL---GCSGG 665 V+ QG CGSCW F A A+ + NG +FS + +L C G GC+GG Sbjct: 150 VKQQGKCGSCWTF-ASTAVLESFSFIKNGAPLTNFSEQQILDCVYGSGYYSNGCNGG 205 >UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia theta|Rep: Cathepsin H precursor - Guillardia theta (Cryptomonas phi) Length = 353 Score = 41.9 bits (94), Expect = 0.020 Identities = 21/63 (33%), Positives = 33/63 (52%), Gaps = 1/63 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC-CPICGLGCSGGMPXLT 680 V++QG+CGSCW F A+ + + G + S + L+ C GC+GG+P Sbjct: 138 VKNQGTCGSCWTFSTAAAL-ESLHAIKTG-EMVLLSEQQLVDCAADFKNNGCNGGLPSQA 195 Query: 681 WEY 689 +EY Sbjct: 196 FEY 198 >UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia deliciosa (Kiwi) Length = 509 Score = 41.9 bits (94), Expect = 0.020 Identities = 22/62 (35%), Positives = 33/62 (53%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQG CGSCWAF + A+ + + +NG S ++L+ C GC GG + Sbjct: 162 VKDQGDCGSCWAFSSTGAI-EGINALANGDL-ISLSEQELVD-CDSTNDGCEGGYMDYAF 218 Query: 684 EY 689 E+ Sbjct: 219 EW 220 >UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia ATCC 50803|Rep: GLP_542_3431_1206 - Giardia lamblia ATCC 50803 Length = 741 Score = 41.9 bits (94), Expect = 0.020 Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 5/72 (6%) Frame = +3 Query: 510 DQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC-----CPICGLGCSGGMPX 674 +QGSCG C+A AVE +T R C N ++ S EDL++C I GC GG Sbjct: 76 NQGSCGCCYAAAAVEMVTARRCLQLNDSR--LVSLEDLVTCDHTKYLNIQNNGCRGGNSL 133 Query: 675 LTWEYWKXFGLV 710 + ++ + G+V Sbjct: 134 ASLKFGETTGMV 145 >UniRef50_Q235G6 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 325 Score = 41.9 bits (94), Expect = 0.020 Identities = 22/74 (29%), Positives = 32/74 (43%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V++QG CG CW+F + Y N + S + L+ C GC GG+ + Sbjct: 132 VKNQGGCGGCWSFATTGGVEGANFVYKNVLP--NLSQQQLID-CNTQNKGCGGGLRDIAL 188 Query: 684 EYWKXFGLVSGGSY 725 Y K GL + Y Sbjct: 189 NYVKETGLTTEEEY 202 >UniRef50_A7APS9 Cluster: Papain family cysteine protease containing protein; n=1; Babesia bovis|Rep: Papain family cysteine protease containing protein - Babesia bovis Length = 435 Score = 41.9 bits (94), Expect = 0.020 Identities = 24/79 (30%), Positives = 36/79 (45%) Frame = +3 Query: 489 NVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGM 668 N V+DQG+CGSCWAF + + + + S ++L+ C C GC G Sbjct: 236 NYMTPVKDQGNCGSCWAFSLI-GVAEPFFKHKRDI-DVVLSEQNLVDCVKECH-GCDYGN 292 Query: 669 PXLTWEYWKXFGLVSGGSY 725 +EY + G+ SY Sbjct: 293 SYFAYEYIRDHGVYRLASY 311 >UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precursor; n=3; Plasmodium (Laverania)|Rep: Trophozoite cysteine proteinase precursor - Plasmodium falciparum Length = 569 Score = 41.9 bits (94), Expect = 0.020 Identities = 25/73 (34%), Positives = 34/73 (46%) Frame = +3 Query: 507 RDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTWE 686 +DQG CGSCWAF +V + V N FS ++++ C GC GG P ++ Sbjct: 349 KDQGLCGSCWAFASV-GNIESVFAKKN-KNILSFSEQEVVDCSK-DNFGCDGGHPFYSFL 405 Query: 687 YWKXFGLVSGGSY 725 Y L G Y Sbjct: 406 YVLQNELCLGDEY 418 >UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina) Length = 339 Score = 41.9 bits (94), Expect = 0.020 Identities = 23/68 (33%), Positives = 34/68 (50%), Gaps = 1/68 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 V+DQG CGSCWAF + A+ + + S ++L+ C G GC+GG+ Sbjct: 137 VKDQGHCGSCWAFSSTGALEGQ--HFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNA 194 Query: 681 WEYWKXFG 704 + Y K G Sbjct: 195 FRYIKDNG 202 >UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum|Rep: Falcipain 2 - Plasmodium falciparum Length = 484 Score = 41.5 bits (93), Expect = 0.026 Identities = 19/55 (34%), Positives = 31/55 (56%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGM 668 V+DQ +CGSCWAF ++ ++ + N K S ++L+ C GC+GG+ Sbjct: 276 VKDQKNCGSCWAFSSIGSVESQYAIRKN--KLITLSEQELVD-CSFKNYGCNGGL 327 >UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3; Bilateria|Rep: Cathepsin L-like cysteine protease - Neobenedenia melleni Length = 335 Score = 41.5 bits (93), Expect = 0.026 Identities = 26/76 (34%), Positives = 38/76 (50%), Gaps = 2/76 (2%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 V++Q CGSCWAF + ++ V + K FS + L+ C G GC+GG+ + Sbjct: 133 VKNQAQCGSCWAFSSTGSIEGAVKRATG--KLISFSEQQLVDCSTAFGNHGCNGGIMDNS 190 Query: 681 WEYW-KXFGLVSGGSY 725 + Y GL S SY Sbjct: 191 FNYLIHNKGLESEASY 206 >UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_26, whole genome shotgun sequence - Paramecium tetraurelia Length = 358 Score = 41.5 bits (93), Expect = 0.026 Identities = 23/75 (30%), Positives = 34/75 (45%), Gaps = 2/75 (2%) Frame = +3 Query: 507 RDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHF-HFSAEDLLSCC-PICGLGCSGGMPXLT 680 R QG+CGSCWAF + + R+ G + S L+ CC GC+GG P Sbjct: 159 RPQGTCGSCWAFSSSDVAISRLAL--KGKEDLTQLSKTHLIDCCVGDKNKGCNGGSPIGA 216 Query: 681 WEYWKXFGLVSGGSY 725 +++ G + Y Sbjct: 217 YKFINENGALKENEY 231 >UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15; Magnoliophyta|Rep: Cysteine proteinase RD19a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 368 Score = 41.5 bits (93), Expect = 0.026 Identities = 27/83 (32%), Positives = 38/83 (45%), Gaps = 9/83 (10%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC--------CPICGLGCS 659 V++QGSCGSCW+F A A+ + K S + L+ C C GC+ Sbjct: 150 VKNQGSCGSCWSFSATGALEG--ANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCN 207 Query: 660 GGMPXLTWEY-WKXFGLVSGGSY 725 GG+ +EY K GL+ Y Sbjct: 208 GGLMNSAFEYTLKTGGLMKEEDY 230 >UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber officinale (Ginger) Length = 221 Score = 41.5 bits (93), Expect = 0.026 Identities = 22/62 (35%), Positives = 32/62 (51%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V++QG CGSCWAF A+ A+ + + G S + L+ C GC GG P + Sbjct: 18 VKNQGGCGSCWAFDAIAAV-EGINQIVTGDL-ISLSEQQLVD-CSTRNHGCEGGWPYRAF 74 Query: 684 EY 689 +Y Sbjct: 75 QY 76 >UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease Gip1p; n=4; Tetrahymena thermophila|Rep: Granule-biosynthesis induced protease Gip1p - Tetrahymena thermophila Length = 345 Score = 41.1 bits (92), Expect = 0.035 Identities = 24/77 (31%), Positives = 35/77 (45%), Gaps = 3/77 (3%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGL---GCSGGMPX 674 V++QG+CGSCW F A + + N + FS + L+ C + G GC GG Sbjct: 148 VKNQGTCGSCWTF-ATAGILESFNQIKN-KQLLKFSEQQLVDCVSLAGYDSDGCDGGFQE 205 Query: 675 LTWEYWKXFGLVSGGSY 725 Y +G+V Y Sbjct: 206 DGVRYAIEYGIVQSYKY 222 >UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathepsin o - Aedes aegypti (Yellowfever mosquito) Length = 375 Score = 41.1 bits (92), Expect = 0.035 Identities = 17/54 (31%), Positives = 25/54 (46%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGG 665 VR QGSCG+CWA V+ +T + + +++C GC GG Sbjct: 168 VRSQGSCGACWAISVVDTITS-ISAIKRQQNFSELCLDQVINCAGNGNFGCEGG 220 >UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin B-like cysteine peptidase - Trichomonas vaginalis G3 Length = 255 Score = 41.1 bits (92), Expect = 0.035 Identities = 19/62 (30%), Positives = 34/62 (54%) Frame = +3 Query: 522 CGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTWEYWKXF 701 CG C+A+G ++AM+ R+C N K SA+ +++ C + GC GG + + + Sbjct: 53 CGCCYAYGPIKAMSHRICKAKN--KKTFLSAQFIVA-CDLLESGCEGGCSRSVYYFLEQH 109 Query: 702 GL 707 G+ Sbjct: 110 GV 111 >UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria dispar multicapsid nuclear polyhedrosis virus (LdMNPV) Length = 356 Score = 41.1 bits (92), Expect = 0.035 Identities = 17/61 (27%), Positives = 34/61 (55%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +++QG+CG+CWAF + ++ + N + S + L+ C + +GC+GG+ + Sbjct: 159 IKNQGACGACWAFATLASVESQFAMRHN--RLIDLSEQQLIDCDSV-DMGCNGGLLHTAF 215 Query: 684 E 686 E Sbjct: 216 E 216 >UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|Rep: Cathepsin F precursor - Homo sapiens (Human) Length = 484 Score = 41.1 bits (92), Expect = 0.035 Identities = 24/67 (35%), Positives = 33/67 (49%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 V+DQG CGSCWAF +V + + GT S ++LL C + C GG+P + Sbjct: 286 VKDQGMCGSCWAF-SVTGNVEGQWFLNQGTL-LSLSEQELLDCDKM-DKACMGGLPSNAY 342 Query: 684 EYWKXFG 704 K G Sbjct: 343 SAIKNLG 349 >UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2; Entamoeba|Rep: Cysteine proteinase ACP1 precursor - Entamoeba histolytica Length = 308 Score = 41.1 bits (92), Expect = 0.035 Identities = 21/55 (38%), Positives = 27/55 (49%) Frame = +3 Query: 507 RDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMP 671 +DQG CGSCW F + RV + K + FS + L+ C GC GG P Sbjct: 107 KDQGQCGSCWTFCTTAVLEGRV--NKDLGKLYSFSEQQLVD-CDASDNGCEGGHP 158 >UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel protein - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 328 Score = 40.7 bits (91), Expect = 0.046 Identities = 22/55 (40%), Positives = 31/55 (56%), Gaps = 1/55 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGG 665 V++QG CGSCWAF AV ++ ++ + SA++LL C G GC GG Sbjct: 128 VQNQGPCGSCWAFSAVGSLEAQMKRRTAAL--VPLSAQNLLDCSVSLGNRGCKGG 180 >UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: Cysteine protease - Saprolegnia parasitica Length = 523 Score = 40.7 bits (91), Expect = 0.046 Identities = 18/55 (32%), Positives = 30/55 (54%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGM 668 V++QG CGSCWAF A+ + + + S ++L+ C +GC+GG+ Sbjct: 131 VKNQGMCGSCWAFSTTGAIEG--AAFVSSKQLVSVSEQELVDCDHNGDMGCNGGL 183 >UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba culbertsoni|Rep: Cysteine proteinase - Acanthamoeba culbertsoni Length = 482 Score = 40.7 bits (91), Expect = 0.046 Identities = 25/63 (39%), Positives = 35/63 (55%), Gaps = 1/63 (1%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICG-LGCSGGMPXLT 680 V++QGSC SCWAF A A+ + V + G+ S + LL C G GCSGG +T Sbjct: 171 VKNQGSCASCWAFVATGAV-EGVRKIAGGSL-VSLSDQMLLDCAVGTGNQGCSGGNVEIT 228 Query: 681 WEY 689 + + Sbjct: 229 YRW 231 >UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaster|Rep: CG11459-PA - Drosophila melanogaster (Fruit fly) Length = 336 Score = 40.7 bits (91), Expect = 0.046 Identities = 25/76 (32%), Positives = 35/76 (46%), Gaps = 2/76 (2%) Frame = +3 Query: 504 VRDQGS-CGSCWAFGAVEAMTDRVCT-YSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXL 677 V DQG+ C SCWAF + + Y N S + L+ C P GCSGG + Sbjct: 133 VGDQGTECLSCWAFSTSGVLEAHMAKKYGNLVP---LSPKHLVDCVPYPNNGCSGGWVSV 189 Query: 678 TWEYWKXFGLVSGGSY 725 + Y + G+ + SY Sbjct: 190 AFNYTRDHGIATKESY 205 >UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Plasmodium|Rep: Cysteine protease falcipain-3 - Plasmodium falciparum Length = 492 Score = 40.7 bits (91), Expect = 0.046 Identities = 20/54 (37%), Positives = 29/54 (53%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGG 665 V+DQ CGSCWAF +V ++ + F FS ++L+ C + GC GG Sbjct: 284 VKDQALCGSCWAFSSVGSVESQYAIRKKAL--FLFSEQELVD-CSVKNNGCYGG 334 >UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 4 - Rhipicephalus appendiculatus (Brown ear tick) Length = 345 Score = 40.7 bits (91), Expect = 0.046 Identities = 21/69 (30%), Positives = 35/69 (50%), Gaps = 2/69 (2%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC--PICGLGCSGGMPXL 677 V++QG CGSCWAF + A+ +V + + S ++L+ C GC+GG Sbjct: 141 VKNQGQCGSCWAFSSTGALEGQV--FKRTRRLISLSEQNLMDCAGQRYGNNGCNGGQMPG 198 Query: 678 TWEYWKXFG 704 ++Y + G Sbjct: 199 AFQYVQDAG 207 >UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2; Taeniidae|Rep: Cathepsin L-like cysteine proteinase - Taenia solium (Pork tapeworm) Length = 339 Score = 40.7 bits (91), Expect = 0.046 Identities = 22/68 (32%), Positives = 34/68 (50%), Gaps = 1/68 (1%) Frame = +3 Query: 489 NVEXXVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGL-GCSGG 665 N+ V++QG+CGSCWAF + A+ + K S + L+ C G GC+GG Sbjct: 134 NLVTEVKNQGNCGSCWAFSSTGALEGAFAKKTG--KLISLSEQQLVDCSLKNGNDGCNGG 191 Query: 666 MPXLTWEY 689 ++Y Sbjct: 192 YMSYAFKY 199 >UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 383 Score = 40.7 bits (91), Expect = 0.046 Identities = 25/74 (33%), Positives = 35/74 (47%) Frame = +3 Query: 504 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPXLTW 683 +++QG CGSCWAF V A + G K S ++++ C GCSGG Sbjct: 183 IKNQGQCGSCWAFATV-ASVEAQNAIKKG-KLVSLSEQEMVD-CDGRNNGCSGGYRPYAM 239 Query: 684 EYWKXFGLVSGGSY 725 ++ K GL S Y Sbjct: 240 KFVKENGLESEKEY 253 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 809,000,594 Number of Sequences: 1657284 Number of extensions: 16095078 Number of successful extensions: 37352 Number of sequences better than 10.0: 409 Number of HSP's better than 10.0 without gapping: 35842 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 37161 length of database: 575,637,011 effective HSP length: 100 effective length of database: 409,908,611 effective search space used: 76243001646 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -