BLASTX 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= heS00028
(846 letters)
Database: uniref50
1,657,284 sequences; 575,637,011 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw... 131 2e-29
UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca... 127 4e-28
UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ... 118 2e-25
UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n... 111 2e-23
UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh... 108 2e-22
UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=... 107 3e-22
UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: ... 105 2e-21
UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteina... 104 3e-21
UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|... 103 4e-21
UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ... 103 4e-21
UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Ca... 100 5e-20
UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr... 100 5e-20
UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA... 99 2e-19
UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n... 97 4e-19
UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 ... 97 4e-19
UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr... 97 5e-19
UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=... 96 9e-19
UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; ... 96 9e-19
UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps... 94 3e-18
UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gamb... 93 1e-17
UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|... 93 1e-17
UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain... 92 1e-17
UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ... 92 1e-17
UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep... 92 2e-17
UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhip... 91 3e-17
UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Ca... 90 6e-17
UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7... 90 6e-17
UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, w... 90 7e-17
UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.... 89 2e-16
UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep... 88 2e-16
UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomati... 88 3e-16
UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8... 87 4e-16
UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG011... 87 7e-16
UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; ... 86 9e-16
UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma... 86 1e-15
UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 ... 85 2e-15
UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator ame... 84 4e-15
UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8.... 83 6e-15
UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma j... 83 9e-15
UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 ... 83 9e-15
UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl... 81 3e-14
UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2; Ostreococcus|... 81 3e-14
UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma j... 81 5e-14
UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2... 80 6e-14
UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2; ... 80 6e-14
UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin ... 79 1e-13
UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C... 78 3e-13
UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 77 4e-13
UniRef50_Q237A1 Cluster: Papain family cysteine protease contain... 77 6e-13
UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus co... 77 7e-13
UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep... 73 7e-12
UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid... 72 2e-11
UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O... 72 2e-11
UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep... 71 4e-11
UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|R... 69 1e-10
UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve... 68 3e-10
UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus... 65 2e-09
UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus lu... 64 3e-09
UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; ... 63 7e-09
UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 62 1e-08
UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cel... 62 2e-08
UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi... 62 2e-08
UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi... 60 7e-08
UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-L... 59 2e-07
UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia... 59 2e-07
UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like prote... 59 2e-07
UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;... 58 2e-07
UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,... 56 8e-07
UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; ... 56 8e-07
UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|R... 56 1e-06
UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcop... 54 6e-06
UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35... 54 6e-06
UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lambl... 53 1e-05
UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - ... 52 1e-05
UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ... 52 2e-05
UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to glucocorti... 52 2e-05
UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; ... 51 3e-05
UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=... 51 3e-05
UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 51 4e-05
UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo sapie... 50 1e-04
UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida... 50 1e-04
UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-li... 50 1e-04
UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio... 49 1e-04
UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy... 49 1e-04
UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr... 48 2e-04
UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy... 48 2e-04
UniRef50_Q58HK5 Cluster: Hc58; n=1; Haemonchus contortus|Rep: Hc... 48 4e-04
UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr... 48 4e-04
UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh... 48 4e-04
UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ... 47 5e-04
UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin... 47 5e-04
UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try... 47 5e-04
UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia ... 47 7e-04
UniRef50_P81494 Cluster: Cathepsin B; n=2; Phasianidae|Rep: Cath... 47 7e-04
UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ... 46 0.001
UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb... 46 0.001
UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ... 46 0.002
UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re... 46 0.002
UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big... 46 0.002
UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ... 46 0.002
UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory... 46 0.002
UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ... 45 0.002
UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted... 45 0.002
UniRef50_Q23H32 Cluster: Papain family cysteine protease contain... 45 0.002
UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy... 45 0.002
UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali... 45 0.002
UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen; n... 45 0.002
UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ... 45 0.003
UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag... 45 0.003
UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ... 44 0.004
UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia... 44 0.004
UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ... 44 0.005
UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ... 44 0.005
UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain... 44 0.005
UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain... 44 0.005
UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, wh... 44 0.005
UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ... 44 0.005
UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto... 44 0.005
UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt... 44 0.006
UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re... 44 0.006
UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa... 44 0.006
UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain... 44 0.006
UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ... 43 0.008
UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1... 43 0.008
UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ... 43 0.008
UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n... 43 0.008
UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen... 43 0.008
UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ... 43 0.008
UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ... 43 0.011
UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n... 43 0.011
UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1... 43 0.011
UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n... 43 0.011
UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet... 43 0.011
UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain... 43 0.011
UniRef50_Q231X3 Cluster: Papain family cysteine protease contain... 43 0.011
UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy... 43 0.011
UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh... 43 0.011
UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh... 43 0.011
UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w... 43 0.011
UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ... 43 0.011
UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr... 43 0.011
UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;... 42 0.015
UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy... 42 0.015
UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi... 42 0.015
UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium... 42 0.015
UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n... 42 0.019
UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis... 42 0.019
UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel... 42 0.019
UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ... 42 0.026
UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs... 42 0.026
UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e... 42 0.026
UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ... 41 0.034
UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;... 41 0.034
UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;... 41 0.034
UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain... 41 0.034
UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li... 41 0.034
UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like cy... 41 0.034
UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ... 41 0.034
UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ... 41 0.034
UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The... 41 0.034
UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The... 41 0.034
UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl... 41 0.034
UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa... 41 0.045
UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ... 41 0.045
UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi... 41 0.045
UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ... 41 0.045
UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=... 41 0.045
UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R... 41 0.045
UniRef50_UPI00006CB77B Cluster: Papain family cysteine protease ... 40 0.059
UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio... 40 0.059
UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil... 40 0.059
UniRef50_Q22W19 Cluster: Papain family cysteine protease contain... 40 0.059
UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:... 40 0.059
UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu... 40 0.059
UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi... 40 0.059
UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG... 40 0.079
UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa... 40 0.079
UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae... 40 0.079
UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox... 40 0.079
UniRef50_O16454 Cluster: Temporarily assigned gene name protein ... 40 0.079
UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy... 40 0.079
UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ... 40 0.10
UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p... 40 0.10
UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep... 40 0.10
UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi... 40 0.10
UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n... 40 0.10
UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabdi... 40 0.10
UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H... 40 0.10
UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cy... 40 0.10
UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh... 40 0.10
UniRef50_Q2FLC7 Cluster: Periplasmic copper-binding precursor; n... 40 0.10
UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo... 40 0.10
UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ... 39 0.14
UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ... 39 0.14
UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G... 39 0.14
UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl... 39 0.14
UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain... 39 0.14
UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain... 39 0.14
UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain... 39 0.14
UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy... 39 0.14
UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R... 39 0.14
UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M... 39 0.14
UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ... 39 0.18
UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p... 39 0.18
UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca... 39 0.18
UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No... 39 0.18
UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|... 39 0.18
UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip... 39 0.18
UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip... 39 0.18
UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ... 39 0.18
UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus... 39 0.18
UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re... 39 0.18
UniRef50_A5KBM2 Cluster: Serine-repeat antigen; n=3; Plasmodium|... 39 0.18
UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D... 39 0.18
UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18... 39 0.18
UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ... 38 0.24
UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s... 38 0.24
UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocysti... 38 0.24
UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa... 38 0.24
UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n... 38 0.24
UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ... 38 0.24
UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy... 38 0.24
UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ... 38 0.24
UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia... 38 0.24
UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j... 38 0.24
UniRef50_Q24E33 Cluster: Papain family cysteine protease contain... 38 0.24
UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain... 38 0.24
UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain... 38 0.24
UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep... 38 0.24
UniRef50_O96163 Cluster: Cysteine protease, putative; n=5; Plasm... 38 0.24
UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er... 38 0.24
UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl... 38 0.24
UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto... 38 0.24
UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|... 38 0.24
UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2... 38 0.24
UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ... 38 0.32
UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ... 38 0.32
UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum... 38 0.32
UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein... 38 0.32
UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain... 38 0.32
UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy... 38 0.32
UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who... 38 0.32
UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179, w... 38 0.32
UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon G... 38 0.32
UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ... 38 0.42
UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat... 38 0.42
UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R... 38 0.42
UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv... 38 0.42
UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph... 38 0.42
UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa... 38 0.42
UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir... 38 0.42
UniRef50_Q26015 Cluster: Serine rich protein homologue; n=4; Pla... 38 0.42
UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh... 38 0.42
UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ... 38 0.42
UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ... 38 0.42
UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov... 38 0.42
UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:... 38 0.42
UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ... 38 0.42
UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h... 37 0.55
UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica... 37 0.55
UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil... 37 0.55
UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli... 37 0.55
UniRef50_Q7QSU1 Cluster: GLP_127_20145_14275; n=1; Giardia lambl... 37 0.55
UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop... 37 0.55
UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain... 37 0.55
UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ... 37 0.55
UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v... 37 0.55
UniRef50_A0D5R4 Cluster: Chromosome undetermined scaffold_39, wh... 37 0.55
UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh... 37 0.55
UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ... 37 0.55
UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M... 37 0.55
UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ... 37 0.73
UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ... 37 0.73
UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ... 37 0.73
UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; ... 37 0.73
UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3; Bilate... 37 0.73
UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl... 37 0.73
UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt... 37 0.73
UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis... 37 0.73
UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain... 37 0.73
UniRef50_Q23H15 Cluster: Papain family cysteine protease contain... 37 0.73
UniRef50_Q23H06 Cluster: Papain family cysteine protease contain... 37 0.73
UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ... 37 0.73
UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w... 37 0.73
UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|... 37 0.73
UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C... 37 0.73
UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata... 37 0.73
UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3.... 37 0.73
UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S... 36 0.97
UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal... 36 0.97
UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster... 36 0.97
UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-... 36 0.97
UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca... 36 0.97
UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei... 36 0.97
UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest... 36 0.97
UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi... 36 0.97
UniRef50_Q4YCM9 Cluster: Cysteine protease, putative; n=5; Plasm... 36 0.97
UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n... 36 0.97
UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi... 36 0.97
UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat... 36 0.97
UniRef50_A5KBM1 Cluster: Serine-repeat antigen; n=1; Plasmodium ... 36 0.97
UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ... 36 0.97
UniRef50_Q3A8I3 Cluster: Putative serine protease; n=1; Pelobact... 36 1.3
UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t... 36 1.3
UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ... 36 1.3
UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ... 36 1.3
UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ... 36 1.3
UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-... 36 1.3
UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli... 36 1.3
UniRef50_Q23H10 Cluster: Papain family cysteine protease contain... 36 1.3
UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ... 36 1.3
UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir... 36 1.3
UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophi... 36 1.3
UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic... 36 1.3
UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:... 36 1.3
UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma... 36 1.3
UniRef50_UPI0001509FF9 Cluster: Papain family cysteine protease ... 36 1.7
UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ... 36 1.7
UniRef50_Q8ZRX7 Cluster: Putative viral protein; n=1; Salmonella... 36 1.7
UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz... 36 1.7
UniRef50_Q7R0G3 Cluster: GLP_29_33036_32140; n=1; Giardia lambli... 36 1.7
UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop... 36 1.7
UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li... 36 1.7
UniRef50_A5UP12 Cluster: Adhesin-like protein; n=1; Methanobrevi... 36 1.7
UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,... 35 2.2
UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ... 35 2.2
UniRef50_Q8QNJ8 Cluster: EsV-1-75; n=1; Ectocarpus siliculosus v... 35 2.2
UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280... 35 2.2
UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O... 35 2.2
UniRef50_Q239L8 Cluster: Papain family cysteine protease contain... 35 2.2
UniRef50_Q235G6 Cluster: Papain family cysteine protease contain... 35 2.2
UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C... 35 2.2
UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve... 35 2.2
UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w... 35 2.2
UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc... 35 2.2
UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi... 35 3.0
UniRef50_Q6A1H9 Cluster: Cathepsin X/O; n=1; Suberites domuncula... 35 3.0
UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop... 35 3.0
UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia... 35 3.0
UniRef50_Q26153 Cluster: V-SERA 4; n=1; Plasmodium vivax|Rep: V-... 35 3.0
UniRef50_Q22A69 Cluster: Papain family cysteine protease contain... 35 3.0
UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina... 35 3.0
UniRef50_A7APS9 Cluster: Papain family cysteine protease contain... 35 3.0
UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs... 35 3.0
UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;... 35 3.0
UniRef50_UPI000155D183 Cluster: PREDICTED: similar to Cathepsin ... 34 3.9
UniRef50_A0IYD1 Cluster: Putative outer membrane adhesin like pr... 34 3.9
UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ... 34 3.9
UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus... 34 3.9
UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli... 34 3.9
UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain... 34 3.9
UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve... 34 3.9
UniRef50_A2DPZ6 Cluster: Clan CA, family C1, cathepsin B-like cy... 34 3.9
UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh... 34 3.9
UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba... 34 5.2
UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|... 34 5.2
UniRef50_Q01FL5 Cluster: Cysteine protease-5; n=3; Ostreococcus|... 34 5.2
UniRef50_Q015J8 Cluster: Cathepsin; n=2; Ostreococcus|Rep: Cathe... 34 5.2
UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ... 34 5.2
UniRef50_Q8N0R5 Cluster: Cycle like factor BmCyc b; n=4; Obtecto... 34 5.2
UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist... 34 5.2
UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv... 34 5.2
UniRef50_Q3YJ15 Cluster: Putative galactosyl transferase; n=1; H... 33 6.8
UniRef50_Q5YER6 Cluster: Cathepsin Z; n=1; Bigelowiella natans|R... 33 6.8
UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep: Cat... 33 6.8
UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; ... 33 6.8
UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; ... 33 6.8
UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ... 33 6.8
UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|... 33 6.8
UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep... 33 6.8
UniRef50_Q54MB6 Cluster: Putative uncharacterized protein; n=1; ... 33 6.8
UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;... 33 6.8
UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop... 33 6.8
UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl... 33 6.8
UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain... 33 6.8
UniRef50_A2ERV3 Cluster: Putative uncharacterized protein; n=1; ... 33 6.8
UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cy... 33 6.8
UniRef50_A0EAG2 Cluster: Chromosome undetermined scaffold_86, wh... 33 6.8
UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C... 33 6.8
UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ... 33 9.0
UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s... 33 9.0
UniRef50_Q89Z69 Cluster: Putative uncharacterized protein; n=1; ... 33 9.0
UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty... 33 9.0
UniRef50_A3B2E1 Cluster: Putative uncharacterized protein; n=1; ... 33 9.0
UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n... 33 9.0
UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei... 33 9.0
UniRef50_Q54JE9 Cluster: Putative uncharacterized protein; n=1; ... 33 9.0
UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil... 33 9.0
UniRef50_Q248G1 Cluster: Papain family cysteine protease contain... 33 9.0
UniRef50_Q1AMF3 Cluster: Cathepsin C1; n=1; Toxoplasma gondii|Re... 33 9.0
UniRef50_A7S9N1 Cluster: Predicted protein; n=1; Nematostella ve... 33 9.0
>UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep:
Parcxpwnx02 - Periplaneta americana (American cockroach)
Length = 343
Score = 131 bits (316), Expect = 2e-29
Identities = 54/79 (68%), Positives = 64/79 (81%)
Frame = +2
Query: 272 LPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 451
LP K+ D+ +PE FDPR++WP+CPTL E+RDQGSCGSCWAFGAVEAM+DRVC +S
Sbjct: 81 LPEKSME-DIDIEIPEEFDPREQWPECPTLKEIRDQGSCGSCWAFGAVEAMSDRVCIHSK 139
Query: 452 GTKHFHFSAEDLLSCCPIC 508
G HFHFSAEDLL+CC C
Sbjct: 140 GKTHFHFSAEDLLTCCSSC 158
Score = 128 bits (309), Expect = 2e-28
Identities = 51/85 (60%), Positives = 61/85 (71%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693
GC+GG P AW+YW G+VSGGSYNS QGC+PY I PCEHHV G R PC G+ TP+C
Sbjct: 161 GCNGGEPGAAWDYWVSTGIVSGGSYNSHQGCQPYAIEPCEHHVNGTRKPC-GEGDTPRCV 219
Query: 694 KKCESGYDVNYKQDKQYGKHVYTCP 768
K+CE GYDV Y +D+ +GK Y P
Sbjct: 220 KRCEEGYDVPYGKDRHFGKSAYAVP 244
Score = 46.4 bits (105), Expect = 0.001
Identities = 21/41 (51%), Positives = 26/41 (63%)
Frame = +3
Query: 126 LPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGV 248
L PLSD+FI+ IN +WKA RNF D +KK+MGV
Sbjct: 32 LVDPLSDDFIDHINSLNTTWKAHRNFGNDIPLREIKKLMGV 72
>UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1)
(Cathepsin B1) (APP secretase) (APPS) [Contains:
Cathepsin B light chain; Cathepsin B heavy chain]; n=85;
Eukaryota|Rep: Cathepsin B precursor (EC 3.4.22.1)
(Cathepsin B1) (APP secretase) (APPS) [Contains:
Cathepsin B light chain; Cathepsin B heavy chain] - Homo
sapiens (Human)
Length = 339
Score = 127 bits (306), Expect = 4e-28
Identities = 51/83 (61%), Positives = 59/83 (71%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693
GC+GG P AW +W GLVSGG Y S GCRPY IPPCEHHV G+R PC+G+ TPKC+
Sbjct: 149 GCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCS 208
Query: 694 KKCESGYDVNYKQDKQYGKHVYT 762
K CE GY YKQDK YG + Y+
Sbjct: 209 KICEPGYSPTYKQDKHYGYNSYS 231
Score = 104 bits (249), Expect = 3e-21
Identities = 40/63 (63%), Positives = 51/63 (80%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
LP +FD R++WP CPT+ E+RDQGSCGSCWAFGAVEA++DR+C ++N SAEDLL
Sbjct: 80 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139
Query: 491 SCC 499
+CC
Sbjct: 140 TCC 142
Score = 46.0 bits (104), Expect = 0.001
Identities = 23/57 (40%), Positives = 36/57 (63%), Gaps = 4/57 (7%)
Frame = +3
Query: 87 YVTLVC--VLAAAKDLP--HPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMG 245
+ +L C VLA A+ P HPLSDE +N +N + +W+AG NF + ++LK++ G
Sbjct: 5 WASLCCLLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLCG 60
Score = 42.7 bits (96), Expect = 0.011
Identities = 18/27 (66%), Positives = 22/27 (81%)
Frame = +3
Query: 762 LSGDEDHIRAELFKNGPVEGAFTVYSD 842
+S E I AE++KNGPVEGAF+VYSD
Sbjct: 232 VSNSEKDIMAEIYKNGPVEGAFSVYSD 258
>UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin B;
n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
similar to cathepsin B - Strongylocentrotus purpuratus
Length = 346
Score = 118 bits (284), Expect = 2e-25
Identities = 48/76 (63%), Positives = 56/76 (73%)
Frame = +2
Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460
K N I LPENFD R+ WP+CPT+ EVRDQGSCGSCWAFGAVEA++DR+C S G
Sbjct: 68 KLENQTRIKDLPENFDARENWPNCPTIKEVRDQGSCGSCWAFGAVEAISDRICIKSKGQT 127
Query: 461 HFHFSAEDLLSCCPIC 508
H SAEDL++CC C
Sbjct: 128 QVHISAEDLMTCCKTC 143
Score = 114 bits (274), Expect = 3e-24
Identities = 44/81 (54%), Positives = 57/81 (70%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693
GC+GG P AWEY+K G+V+GG +NSSQGC+PY+I C+HHV G + PC G+ TP+C
Sbjct: 146 GCNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGPCQGEGPTPECK 205
Query: 694 KKCESGYDVNYKQDKQYGKHV 756
KCE+ Y Y+QDK Y V
Sbjct: 206 HKCEASYSTPYEQDKHYALSV 226
>UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n=4;
Tenebrionidae|Rep: Putative cathepsin B-like proteinase
- Tenebrio molitor (Yellow mealworm)
Length = 321
Score = 111 bits (268), Expect = 2e-23
Identities = 43/78 (55%), Positives = 59/78 (75%)
Frame = +2
Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454
P+ H F+ +PE+FD R KWP+C +LN +RDQG+CGSCWAF ++E+M+DR+C +S+G
Sbjct: 72 PVLVHTFNA-RDVPESFDARTKWPNCDSLNRIRDQGACGSCWAFASIESMSDRICIHSSG 130
Query: 455 TKHFHFSAEDLLSCCPIC 508
+ F FS EDLLSCC C
Sbjct: 131 SAQFMFSPEDLLSCCTSC 148
Score = 64.5 bits (150), Expect = 3e-09
Identities = 32/81 (39%), Positives = 44/81 (54%)
Frame = +1
Query: 517 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCTK 696
C GG A +++ + G+VSGG NS++GCRPY + H G +TP CTK
Sbjct: 151 CGGGYMMSALDFYINEGIVSGGDVNSNEGCRPY---TADAHDQG---------QTPACTK 198
Query: 697 KCESGYDVNYKQDKQYGKHVY 759
C +GY +Y DK YG + Y
Sbjct: 199 SCRNGYSTSYSADKHYGSNDY 219
Score = 54.0 bits (124), Expect = 5e-06
Identities = 30/66 (45%), Positives = 44/66 (66%)
Frame = +3
Query: 63 KMFISRAAYVTLVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIM 242
K+F+S +V LV VL+A+ LS EFI++IN Q+SW AGRNFP +T+ +L K+
Sbjct: 2 KIFLS---FVVLVAVLSASLAEIDVLSSEFIDSINRIQSSWVAGRNFPENTTNEYLYKLN 58
Query: 243 GVIEMN 260
G I ++
Sbjct: 59 GFIGLH 64
>UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome
shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 5
SCAF15026, whole genome shotgun sequence - Tetraodon
nigroviridis (Green puffer)
Length = 351
Score = 108 bits (259), Expect = 2e-22
Identities = 43/66 (65%), Positives = 52/66 (78%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
LP+ FD R++WP+CPTL E+RDQGSCGSCWAFGA EAM+DRVC +SN SA+DLL
Sbjct: 79 LPKEFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAMSDRVCIHSNAKVSVELSAQDLL 138
Query: 491 SCCPIC 508
+CC C
Sbjct: 139 TCCNSC 144
Score = 100 bits (240), Expect = 4e-20
Identities = 50/106 (47%), Positives = 62/106 (58%), Gaps = 22/106 (20%)
Frame = +1
Query: 511 LGCSGGMPRLAWEYWKHFGLVSGGSYNS---------------------SQGCRPYEIPP 627
+GC+GG P AW +W GLVSGG Y+S S GCRPY IPP
Sbjct: 146 MGCNGGYPSSAWNFWVSDGLVSGGLYDSHIGRIQVSLCVLLLAVDRDFVSPGCRPYTIPP 205
Query: 628 CEHHVPGNRMPCSGD-TKTPKCTKKCESGYDVNYKQDKQYGKHVYT 762
CEHHV G+R CSG+ TP+C +CE+GY +YKQDK +GK Y+
Sbjct: 206 CEHHVNGSRPSCSGEGGDTPECIFRCEAGYSPSYKQDKHFGKTSYS 251
Score = 48.4 bits (110), Expect = 2e-04
Identities = 20/32 (62%), Positives = 25/32 (78%)
Frame = +3
Query: 747 KTCIYLSGDEDHIRAELFKNGPVEGAFTVYSD 842
KT +S +ED I+ E++KNGPVEGAFTVY D
Sbjct: 247 KTSYSVSSEEDEIKQEIYKNGPVEGAFTVYED 278
Score = 41.9 bits (94), Expect = 0.019
Identities = 20/59 (33%), Positives = 35/59 (59%), Gaps = 2/59 (3%)
Frame = +3
Query: 81 AAYVTLVCVLAAAKDLPH--PLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVI 251
AA++ L +++ PH PLS E +N IN ++W AG NF + ++++KK+ G +
Sbjct: 4 AAFLFLAAAWSSSLARPHLKPLSSEMVNYINKLNSTWTAGHNF-HNVDYSYVKKLCGTL 61
>UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=1;
Biomphalaria glabrata|Rep: Cathepsin B preproprotein
precursor - Biomphalaria glabrata (Bloodfluke planorb)
Length = 333
Score = 107 bits (257), Expect = 3e-22
Identities = 48/101 (47%), Positives = 62/101 (61%)
Frame = +2
Query: 206 ARHIVRAS*ENNGSYRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGS 385
AR ++ + N +Y H ++ N LP+NFDPR KWPDC +LNE+RDQ +
Sbjct: 57 ARALLGVNMAENKAYNRIHLKYKQVQPRN-----DLPDNFDPRTKWPDCASLNEIRDQAN 111
Query: 386 CGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPIC 508
CGSCWAFG+ EAMTDR+C G + H SAED+ CC C
Sbjct: 112 CGSCWAFGSAEAMTDRICIAGKG--NIHISAEDINDCCKSC 150
Score = 104 bits (250), Expect = 2e-21
Identities = 40/83 (48%), Positives = 52/83 (62%)
Frame = +1
Query: 511 LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKC 690
+GC+GG P AWE++ G+VSGG Y +++GC PY +P C+HH G PC TPKC
Sbjct: 152 MGCNGGYPAAAWEWYVDTGVVSGGQYGTNEGCMPYSLPHCDHHTTGKYQPCPAVVPTPKC 211
Query: 691 TKKCESGYDVNYKQDKQYGKHVY 759
KKC +GY +Y DK GK Y
Sbjct: 212 EKKCLTGYPKSYSNDKTRGKKSY 234
>UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep:
Cathepsin B - Apriona germari
Length = 324
Score = 105 bits (251), Expect = 2e-21
Identities = 44/89 (49%), Positives = 63/89 (70%)
Frame = +2
Query: 242 GSYRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEA 421
G RD + TLP+ H + I+ +P++FD R++WP C ++ +RD+G+CGSCWAF AVE
Sbjct: 64 GINRDPN-VTLPVVFH--EAISGIPDSFDAREQWPFCESIRTIRDEGACGSCWAFAAVEV 120
Query: 422 MTDRVCTYSNGTKHFHFSAEDLLSCCPIC 508
M+DR+C S G K F FSAE+++SCC C
Sbjct: 121 MSDRLCLASEGRKKFIFSAEEVVSCCTAC 149
Score = 53.6 bits (123), Expect = 6e-06
Identities = 28/82 (34%), Positives = 41/82 (50%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693
GC GG ++YW G+ SGG Y S GC+PY SG+ TP+C
Sbjct: 152 GCRGGFLNEPYKYWVTNGIPSGGDYGSKLGCKPY------------TAAVSGE--TPQCQ 197
Query: 694 KKCESGYDVNYKQDKQYGKHVY 759
K C SGY+ ++++D ++ Y
Sbjct: 198 KACVSGYEKSWEKDLRHATSAY 219
>UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteinase;
n=1; Tenebrio molitor|Rep: Putative cathepsin B-like
like proteinase - Tenebrio molitor (Yellow mealworm)
Length = 301
Score = 104 bits (249), Expect = 3e-21
Identities = 45/80 (56%), Positives = 59/80 (73%), Gaps = 1/80 (1%)
Frame = +2
Query: 272 LPIKTHNFDLIASLPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVCTYS 448
LP+KTH +L A +PE+FD R+ WP+C ++ E+RDQ SCGSCWAFGAVEAM+DR+C +S
Sbjct: 72 LPVKTHAVNLDA-IPESFDAREAWPECTSIIGEIRDQASCGSCWAFGAVEAMSDRICIHS 130
Query: 449 NGTKHFHFSAEDLLSCCPIC 508
+ + SAEDL CC C
Sbjct: 131 DASVKVRISAEDLNDCCYDC 150
Score = 104 bits (249), Expect = 3e-21
Identities = 41/89 (46%), Positives = 56/89 (62%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693
GC+GG P LAW YW G+V+GG Y +GC+ Y I PC+HHV GN PC +TP C
Sbjct: 153 GCNGGWPDLAWSYWSSTGIVTGGLYGVDEGCKAYSIKPCDHHVDGNLGPCGDIQRTPACK 212
Query: 694 KKCESGYDVNYKQDKQYGKHVYTCPETKT 780
K C+S D+ YK D + G Y+ P++++
Sbjct: 213 KSCDSTSDLEYKSDLRRGS-AYSIPKSES 240
Score = 61.3 bits (142), Expect = 3e-08
Identities = 24/40 (60%), Positives = 33/40 (82%)
Frame = +3
Query: 132 HPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVI 251
HPLSDEFIN IN KQ +WKAGRNF +T +H+++++GV+
Sbjct: 24 HPLSDEFINEINSKQTTWKAGRNFDVNTPISHVRRLLGVL 63
>UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis
sinensis|Rep: Cathepsin B5 - Clonorchis sinensis
Length = 343
Score = 103 bits (248), Expect = 4e-21
Identities = 48/85 (56%), Positives = 59/85 (69%), Gaps = 1/85 (1%)
Frame = +2
Query: 257 EHFATLPIKTHN-FDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 433
E A P H+ FD + LP+NFD R WP C +++E+RDQ SCGSCWAFGAVEAM+DR
Sbjct: 68 EQKAQRPTLRHDGFDNMR-LPKNFDARKTWPHCSSISEIRDQSSCGSCWAFGAVEAMSDR 126
Query: 434 VCTYSNGTKHFHFSAEDLLSCCPIC 508
+C +SNG + SA DLLSCC C
Sbjct: 127 LCIHSNGAFNKSLSAVDLLSCCKDC 151
Score = 90.6 bits (215), Expect = 4e-17
Identities = 37/76 (48%), Positives = 49/76 (64%), Gaps = 1/76 (1%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPKC 690
GC GG P +AW+YWK G+V+GGS GCR Y P CEHHV G+ PC + TP+C
Sbjct: 154 GCRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEHHVQGHYPPCPRELYPTPEC 213
Query: 691 TKKCESGYDVNYKQDK 738
++C++ DV Y +DK
Sbjct: 214 VQQCDTP-DVGYLEDK 228
>UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6
precursor; n=11; Bilateria|Rep: Cathepsin B-like
cysteine proteinase 6 precursor - Caenorhabditis elegans
Length = 379
Score = 103 bits (248), Expect = 4e-21
Identities = 44/76 (57%), Positives = 54/76 (71%)
Frame = +2
Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460
KT + DL +PE+FD RD WP C ++ +RDQ SCGSCWAFGAVEAM+DR+C S+G
Sbjct: 97 KTKDLDL--DIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGEL 154
Query: 461 HFHFSAEDLLSCCPIC 508
SA+DLLSCC C
Sbjct: 155 QVTLSADDLLSCCKSC 170
Score = 94.7 bits (225), Expect = 3e-18
Identities = 41/85 (48%), Positives = 50/85 (58%), Gaps = 3/85 (3%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRM-PCSGDT-KTPK 687
GC+GG P AW YW G+V+G +Y ++ GC+PY PPCEHH PC D TPK
Sbjct: 173 GCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPK 232
Query: 688 CTKKCESGY-DVNYKQDKQYGKHVY 759
C KKC S Y D Y +DK +G Y
Sbjct: 233 CEKKCVSDYTDKTYSEDKFFGASAY 257
>UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep:
Cathepsin b - Aedes aegypti (Yellowfever mosquito)
Length = 386
Score = 100 bits (239), Expect = 5e-20
Identities = 40/66 (60%), Positives = 47/66 (71%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
LP+ FD R+KWP+CP+L E+RDQG CGSCWA A AMTDR C S G + F F + DLL
Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184
Query: 491 SCCPIC 508
SCC C
Sbjct: 185 SCCHSC 190
Score = 81.0 bits (191), Expect = 3e-14
Identities = 40/86 (46%), Positives = 49/86 (56%), Gaps = 1/86 (1%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693
GC GG AW++W GL SGG NS QGC PY I C +PG D TPKC+
Sbjct: 193 GCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGEC--RIPGE------DEDTPKCS 244
Query: 694 KKCESGYDV-NYKQDKQYGKHVYTCP 768
KC SGY+V + QD+ YG+ Y+ P
Sbjct: 245 NKCRSGYNVTDVWQDRHYGRVAYSLP 270
>UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase
precursor; n=28; Bilateria|Rep: Cathepsin B-like
cysteine proteinase precursor - Schistosoma japonicum
(Blood fluke)
Length = 342
Score = 100 bits (239), Expect = 5e-20
Identities = 42/78 (53%), Positives = 52/78 (66%)
Frame = +2
Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454
P H+ DL +P FD R KWP C +++++RDQ CGSCWAFGAVEAMTDR+C S G
Sbjct: 79 PTVDHH-DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGG 137
Query: 455 TKHFHFSAEDLLSCCPIC 508
+ SA DL+SCC C
Sbjct: 138 GQSAELSALDLISCCKDC 155
Score = 99.1 bits (236), Expect = 1e-19
Identities = 40/95 (42%), Positives = 52/95 (54%), Gaps = 1/95 (1%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPKC 690
GC GG P +AW+YW G+V+GGS + GC+PY P CEHH G C KTP+C
Sbjct: 158 GCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQC 217
Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCPETKTTSARN 795
+ C+ GY Y+QDK YG Y + R+
Sbjct: 218 KQTCQKGYKTPYEQDKHYGDESYNVQNNEKVIQRD 252
>UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA;
n=1; Tribolium castaneum|Rep: PREDICTED: similar to
CG10992-PA - Tribolium castaneum
Length = 325
Score = 98.7 bits (235), Expect = 2e-19
Identities = 42/90 (46%), Positives = 59/90 (65%), Gaps = 1/90 (1%)
Frame = +2
Query: 242 GSYRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCP-TLNEVRDQGSCGSCWAFGAVE 418
G + D ++ + K H I S+PE+FD R+KWP+C + ++R+QG+CGSCWAF + E
Sbjct: 54 GLHPDPNYK-IQTKQHKISRIISIPESFDAREKWPECKDVIGKIRNQGNCGSCWAFASTE 112
Query: 419 AMTDRVCTYSNGTKHFHFSAEDLLSCCPIC 508
MTDR+C S G F FS E+LL+CC C
Sbjct: 113 VMTDRLCISSKGKIKFVFSPENLLTCCKDC 142
Score = 52.4 bits (120), Expect = 1e-05
Identities = 19/34 (55%), Positives = 26/34 (76%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY 615
GC GG + AW+Y+ + G+ SGG YNSS+GC+PY
Sbjct: 145 GCKGGYIKNAWDYYINEGIASGGDYNSSEGCQPY 178
>UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n=8;
Strongylida|Rep: Cathepsin B-like cysteine protease 2 -
Parelaphostrongylus tenuis
Length = 344
Score = 97.5 bits (232), Expect = 4e-19
Identities = 37/66 (56%), Positives = 50/66 (75%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
+P++FD R +WP CP+++ +RDQ CGSCWAFG+ EAM+DRVC S+G K SA+D+L
Sbjct: 94 IPDSFDARVQWPHCPSISYIRDQSQCGSCWAFGSAEAMSDRVCIASHGNKTVELSADDIL 153
Query: 491 SCCPIC 508
SCC C
Sbjct: 154 SCCYDC 159
Score = 93.9 bits (223), Expect = 5e-18
Identities = 40/90 (44%), Positives = 51/90 (56%), Gaps = 1/90 (1%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRM-PCSGDTKTPKC 690
GC GG P AWEY+ G+V+GG Y + CRPYEIPPC HH C+ TP C
Sbjct: 162 GCDGGYPISAWEYFVETGVVTGGLYGTKDSCRPYEIPPCGHHRNETFYGNCTQIADTPDC 221
Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCPETKT 780
C++GY ++Y DK +GK YT + T
Sbjct: 222 VTTCQAGYPISYDDDKTFGKDSYTIESSVT 251
>UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3
precursor; n=4; Caenorhabditis|Rep: Cathepsin B-like
cysteine proteinase 3 precursor - Caenorhabditis elegans
Length = 370
Score = 97.5 bits (232), Expect = 4e-19
Identities = 38/63 (60%), Positives = 48/63 (76%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
LP+ FD R+KWPDC T+ +R+Q +CGSCWAFGA E ++DRVC SNGT+ S ED+L
Sbjct: 92 LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151
Query: 491 SCC 499
SCC
Sbjct: 152 SCC 154
Score = 65.3 bits (152), Expect = 2e-09
Identities = 33/92 (35%), Positives = 42/92 (45%), Gaps = 1/92 (1%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693
GC GG A +W G V+GG Y GC PY PC + P ++ TP C
Sbjct: 161 GCKGGYSIEALRFWASSGAVTGGDY-GGHGCMPYSFAPCTKNCP--------ESTTPSCK 211
Query: 694 KKCESGYDV-NYKQDKQYGKHVYTCPETKTTS 786
C+S Y YK+DK YG Y TK+ +
Sbjct: 212 TTCQSSYKTEEYKKDKHYGASAYKVTTTKSVT 243
>UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase
precursor; n=29; Schistosomatidae|Rep: Cathepsin B-like
cysteine proteinase precursor - Schistosoma mansoni
(Blood fluke)
Length = 340
Score = 97.1 bits (231), Expect = 5e-19
Identities = 41/78 (52%), Positives = 50/78 (64%)
Frame = +2
Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454
P HN D +P NFD R KWP C ++ +RDQ CGSCW+FGAVEAM+DR C S G
Sbjct: 78 PTVDHN-DWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWSFGAVEAMSDRSCIQSGG 136
Query: 455 TKHFHFSAEDLLSCCPIC 508
++ SA DLL+CC C
Sbjct: 137 KQNVELSAVDLLTCCESC 154
Score = 86.2 bits (204), Expect = 9e-16
Identities = 36/84 (42%), Positives = 44/84 (52%), Gaps = 1/84 (1%)
Frame = +1
Query: 511 LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPK 687
LGC GG+ AW+YW G+V+ S + GC PY P CEHH G PC TP+
Sbjct: 156 LGCEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPR 215
Query: 688 CTKKCESGYDVNYKQDKQYGKHVY 759
C + C+ Y Y QDK GK Y
Sbjct: 216 CKQTCQRKYKTPYTQDKHRGKSSY 239
Score = 34.7 bits (76), Expect = 3.0
Identities = 15/32 (46%), Positives = 20/32 (62%)
Frame = +3
Query: 747 KTCIYLSGDEDHIRAELFKNGPVEGAFTVYSD 842
K+ + DE I+ E+ K GPVE +FTVY D
Sbjct: 236 KSSYNVKNDEKAIQKEIMKYGPVEASFTVYED 267
>UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=1;
Nilaparvata lugens|Rep: Cathepsin B-like protease
precursor - Nilaparvata lugens (Brown planthopper)
Length = 347
Score = 96.3 bits (229), Expect = 9e-19
Identities = 41/91 (45%), Positives = 52/91 (57%), Gaps = 2/91 (2%)
Frame = +1
Query: 502 YL*LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGD--T 675
Y GC GG P AW + K GLV+GG Y+S GC+PY I PCEHH+ G++ CS
Sbjct: 156 YCGFGCEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPYPIAPCEHHMEGSKPNCSASPTE 215
Query: 676 KTPKCTKKCESGYDVNYKQDKQYGKHVYTCP 768
TP C C G + Y++D+Q GK Y P
Sbjct: 216 PTPACETTCTHGSSLAYQKDRQKGKSAYLVP 246
Score = 83.4 bits (197), Expect = 6e-15
Identities = 32/66 (48%), Positives = 42/66 (63%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
+P+ FD R KW C +L E+RDQG+CGSCWA A DR+C SN + H S+ +L+
Sbjct: 92 VPKYFDARKKWKKCKSLREIRDQGNCGSCWAVSVAAAFADRLCIASNAKWNGHISSRELM 151
Query: 491 SCCPIC 508
SCC C
Sbjct: 152 SCCSYC 157
Score = 38.7 bits (86), Expect = 0.18
Identities = 19/59 (32%), Positives = 35/59 (59%), Gaps = 1/59 (1%)
Frame = +3
Query: 84 AYVTLVCVLAAAKDLPHPLSDEFINTINLKQNS-WKAGRNFPRDTSFAHLKKIMGVIEM 257
A V+ + L ++ +++++I+ IN S WKAG NF DT ++L+ ++GV E+
Sbjct: 10 AVVSAISALPDQENTVREIANKWIDAINNNPKSTWKAGHNFHPDTPMSYLQGLLGVSEL 68
>UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor;
n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
B-like proteinase precursor - Diabrotica virgifera
virgifera (western corn rootworm)
Length = 331
Score = 96.3 bits (229), Expect = 9e-19
Identities = 38/79 (48%), Positives = 49/79 (62%), Gaps = 1/79 (1%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSG-DTKTPKC 690
GC GG P +AW YW G+ +GG Y S QGC+PY + PCEHH GN++ CS D TP C
Sbjct: 148 GCEGGYPTMAWSYWIDTGITTGGLYGSKQGCQPYSLQPCEHHTEGNKVQCSTLDYDTPSC 207
Query: 691 TKKCESGYDVNYKQDKQYG 747
KC+ +NYK + +G
Sbjct: 208 KHKCDDS-ALNYKSELTFG 225
Score = 76.2 bits (179), Expect = 1e-12
Identities = 35/78 (44%), Positives = 46/78 (58%), Gaps = 1/78 (1%)
Frame = +2
Query: 284 THNFDLIASLPENFDPRDKWPDCP-TLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460
TH+ D+ +P +FD R+ W +C ++ V DQ CGSCWA A AM+DR C S G
Sbjct: 72 THSEDI--QVPNSFDARENWKECSDVISTVVDQSDCGSCWAVAAASAMSDRRCIASQGKL 129
Query: 461 HFHFSAEDLLSCCPICDW 514
SAE+LLSCC C +
Sbjct: 130 KVPVSAENLLSCCDSCGY 147
Score = 52.4 bits (120), Expect = 1e-05
Identities = 23/58 (39%), Positives = 40/58 (68%), Gaps = 2/58 (3%)
Frame = +3
Query: 78 RAAYVT--LVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMG 245
+AA++ L+ ++ + K P+PLS++FIN IN KQ++W AG+NF + S +K ++G
Sbjct: 2 KAAFIITLLLPIVLSYKGSPNPLSNDFINYINSKQSTWVAGKNFDENLSIQEIKNLLG 59
>UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Cathepsin
B - Fasciola gigantica (Giant liver fluke)
Length = 339
Score = 94.3 bits (224), Expect = 3e-18
Identities = 42/85 (49%), Positives = 52/85 (61%)
Frame = +2
Query: 254 DEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 433
+E A P H+ LPE+FD R +WP C T++E+RDQ SCGSCWA A AM+DR
Sbjct: 68 EERNALRPTIKHDISK-NDLPESFDARSQWPQCWTISEIRDQASCGSCWATAAASAMSDR 126
Query: 434 VCTYSNGTKHFHFSAEDLLSCCPIC 508
VC +SNG +A D LSCC C
Sbjct: 127 VCIHSNGQMRPRLAAADPLSCCTYC 151
Score = 80.6 bits (190), Expect = 5e-14
Identities = 36/105 (34%), Positives = 56/105 (53%), Gaps = 2/105 (1%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSGDT-KTPK 687
GC GG P AW+YW G+V+GG++ + GC+P+ C+H + C T TP
Sbjct: 154 GCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTPP 213
Query: 688 CTKKCESGYDVNYKQDKQYGKHVYTCPETKTTSARNCSRMVPSKV 822
C + C++GY+ Y+QDK YG Y E ++ + + P +V
Sbjct: 214 CARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEV 258
>UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gambiae
str. PEST|Rep: ENSANGP00000012227 - Anopheles gambiae
str. PEST
Length = 218
Score = 92.7 bits (220), Expect = 1e-17
Identities = 35/66 (53%), Positives = 48/66 (72%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
+PE+FD R+ WP+C +L +R+QG+CGSCWA A M+DRVC +SNGT + +AEDL+
Sbjct: 1 IPESFDARNHWPNCESLRAIRNQGTCGSCWAVAAASVMSDRVCIHSNGTINVALAAEDLM 60
Query: 491 SCCPIC 508
CC C
Sbjct: 61 GCCVDC 66
Score = 36.7 bits (81), Expect(2) = 0.010
Identities = 15/29 (51%), Positives = 22/29 (75%), Gaps = 1/29 (3%)
Frame = +1
Query: 514 GCSGG-MPRLAWEYWKHFGLVSGGSYNSS 597
GC+GG + +++YW GLVSGG+YNS+
Sbjct: 69 GCNGGFLDGTSFQYWVDAGLVSGGAYNST 97
Score = 25.4 bits (53), Expect(2) = 0.010
Identities = 9/22 (40%), Positives = 14/22 (63%)
Frame = +1
Query: 703 ESGYDVNYKQDKQYGKHVYTCP 768
+ G D +Y +DK +GK Y+ P
Sbjct: 98 DDGVDRHYSKDKLFGKVAYSVP 119
>UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7;
Rhabditida|Rep: Cysteine proteinase 3 - Necator
americanus (Human hookworm)
Length = 360
Score = 92.7 bits (220), Expect = 1e-17
Identities = 36/77 (46%), Positives = 48/77 (62%)
Frame = +2
Query: 278 IKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT 457
+K + D +P +FD RDKWP C ++ +RDQ CGSCWA + E M+DR+C SNGT
Sbjct: 79 LKEEDMDFSEEIPVSFDARDKWPKCTSIGFIRDQSHCGSCWAVSSAETMSDRLCVQSNGT 138
Query: 458 KHFHFSAEDLLSCCPIC 508
S D+L+CCP C
Sbjct: 139 IKVLLSDTDILACCPNC 155
Score = 77.0 bits (181), Expect = 6e-13
Identities = 35/90 (38%), Positives = 46/90 (51%), Gaps = 1/90 (1%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPKC 690
GC GG AWEY+K+ G+ +GG Y + C+PY PC+ G C D+ TPKC
Sbjct: 158 GCGGGHTIRAWEYFKNTGVCTGGLYGTKDSCKPYAFYPCKDESYGK---CPKDSFPTPKC 214
Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCPETKT 780
K C+ Y Y DK Y Y P+ +T
Sbjct: 215 RKICQYKYSKKYADDKYYANSAYRIPQNET 244
>UniRef50_Q23FP9 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 340
Score = 92.3 bits (219), Expect = 1e-17
Identities = 37/88 (42%), Positives = 48/88 (54%), Gaps = 1/88 (1%)
Frame = +1
Query: 502 YL*LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKT 681
Y +GC GG P AW Y K G+ +GG Y C+PY PPC+HHV G PC T
Sbjct: 153 YCGMGCKGGYPSAAWGYMKRQGVSTGGLYGDDTSCKPYIFPPCDHHVTGQYQPCGPIQPT 212
Query: 682 PKCTKKCESGYDVN-YKQDKQYGKHVYT 762
P+C K+C S Y N Y++D + Y+
Sbjct: 213 PQCVKECNSEYTQNTYEKDLHFASQTYS 240
Score = 89.8 bits (213), Expect = 7e-17
Identities = 39/87 (44%), Positives = 54/87 (62%), Gaps = 1/87 (1%)
Frame = +2
Query: 242 GSYRDEHFATLPIKTHNFDLIAS-LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVE 418
GS + + LP K + + A +PE FD R++WP+C ++ +RDQ +CGSCWAF A E
Sbjct: 64 GSLDEPDWVKLPTKEFDPNANADPIPEFFDAREQWPNCQSIKLIRDQSTCGSCWAFAATE 123
Query: 419 AMTDRVCTYSNGTKHFHFSAEDLLSCC 499
+DR+C SN T S+EDLL CC
Sbjct: 124 TFSDRICIASNQTLQTSISSEDLLECC 150
>UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4
precursor; n=5; Caenorhabditis|Rep: Cathepsin B-like
cysteine proteinase 4 precursor - Caenorhabditis elegans
Length = 335
Score = 92.3 bits (219), Expect = 1e-17
Identities = 36/69 (52%), Positives = 47/69 (68%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
++P FD R +WP+C ++N +RDQ CGSCWAF A EA +DR C SNG + SAED+
Sbjct: 80 TIPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDV 139
Query: 488 LSCCPICDW 514
LSCC C +
Sbjct: 140 LSCCSNCGY 148
Score = 72.1 bits (169), Expect = 2e-11
Identities = 35/85 (41%), Positives = 42/85 (49%), Gaps = 3/85 (3%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSGD-TKTPK 687
GC GG P AW+Y G +GGSY + GC+PY + PC V P C D TP
Sbjct: 149 GCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPA 208
Query: 688 CTKKC-ESGYDVNYKQDKQYGKHVY 759
C KC Y+V Y DK +G Y
Sbjct: 209 CVNKCTNKNYNVAYTADKHFGSTAY 233
>UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep:
Cathepsin B - Pandalus borealis (Northern red shrimp)
Length = 328
Score = 91.9 bits (218), Expect = 2e-17
Identities = 37/79 (46%), Positives = 50/79 (63%)
Frame = +2
Query: 272 LPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 451
LP+K N +P FD R++WP CP ++E+RDQG+CGSCWA A MTDR C +
Sbjct: 65 LPLK--NVTPTKEIPVEFDAREQWPHCPCIDEIRDQGNCGSCWAVSAASVMTDRTCIDTE 122
Query: 452 GTKHFHFSAEDLLSCCPIC 508
G F FS+E++ +CC C
Sbjct: 123 GLVDFRFSSENVAACCTEC 141
Score = 91.5 bits (217), Expect = 2e-17
Identities = 36/88 (40%), Positives = 50/88 (56%)
Frame = +1
Query: 517 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCTK 696
C GG A+ +W G VSGG +NS++GC+PY + CEHH+ G R PC GD C++
Sbjct: 145 CYGGDEDTAFTHWVTKGFVSGGRHNSNEGCQPYSVEECEHHIEGPRPPCEGDMPELVCSE 204
Query: 697 KCESGYDVNYKQDKQYGKHVYTCPETKT 780
C Y Y++D +YG Y P+ T
Sbjct: 205 TCHEEYGKTYEEDLEYGLEAYVLPQDVT 232
Score = 48.4 bits (110), Expect = 2e-04
Identities = 23/48 (47%), Positives = 31/48 (64%)
Frame = +3
Query: 96 LVCVLAAAKDLPHPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKI 239
L+ ++AAA PLSDEF+ + KQ +WKAGRNF +D S LK +
Sbjct: 6 LLALVAAASAELDPLSDEFLELLQSKQMTWKAGRNFAKDISKDFLKSL 53
>UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1;
Rhipicephalus appendiculatus|Rep: Midgut cysteine
proteinase 1 - Rhipicephalus appendiculatus (Brown ear
tick)
Length = 332
Score = 91.1 bits (216), Expect = 3e-17
Identities = 34/73 (46%), Positives = 48/73 (65%)
Frame = +2
Query: 314 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493
PE+F PR+ W C ++ +RDQ +CGSCWAF A E+++DR+C ++NG + SAEDLL+
Sbjct: 88 PESFTPREYWSHCSSIRVIRDQSACGSCWAFAAAESISDRICIHTNGKVQVNISAEDLLA 147
Query: 494 CCPICDWDAAEEC 532
CC C C
Sbjct: 148 CCHTCGHGCDGRC 160
Score = 55.2 bits (127), Expect = 2e-06
Identities = 27/72 (37%), Positives = 38/72 (52%), Gaps = 4/72 (5%)
Frame = +1
Query: 592 SSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCTKKCESGYDVNYKQDKQYGKHVY---- 759
+ GC+PY +PPC VP C+ TPKC C GY+ +Y++DK + K+VY
Sbjct: 180 TEDGCQPYSLPPC---VPN----CTHPEPTPKCQHVCRKGYEKSYEEDKHFAKNVYRLLK 232
Query: 760 TCPETKTTSARN 795
C KT +N
Sbjct: 233 KCDAIKTDIYKN 244
Score = 36.3 bits (80), Expect = 0.97
Identities = 16/28 (57%), Positives = 18/28 (64%)
Frame = +3
Query: 135 PLSDEFINTINLKQNSWKAGRNFPRDTS 218
PLS+E IN IN +WKAGRNF S
Sbjct: 26 PLSEEMINFINSINTTWKAGRNFDEKRS 53
Score = 34.7 bits (76), Expect = 3.0
Identities = 13/22 (59%), Positives = 18/22 (81%)
Frame = +3
Query: 777 DHIRAELFKNGPVEGAFTVYSD 842
D I+ +++KNGPVE AF VY+D
Sbjct: 235 DAIKTDIYKNGPVESAFFVYAD 256
>UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep:
Cathepsin b - Aedes aegypti (Yellowfever mosquito)
Length = 332
Score = 90.2 bits (214), Expect = 6e-17
Identities = 35/79 (44%), Positives = 50/79 (63%)
Frame = +2
Query: 272 LPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 451
LP K H+ +PE FD R+KWP C +++ +++QG CG+CWA AV M+DR+C +S
Sbjct: 72 LPTKRHDVAYNMDIPEFFDAREKWPYCKSISTIKNQGLCGACWAVAAVSVMSDRLCIHSE 131
Query: 452 GTKHFHFSAEDLLSCCPIC 508
G +AEDL+ CC C
Sbjct: 132 GKFDVELAAEDLMGCCKDC 150
Score = 83.4 bits (197), Expect = 6e-15
Identities = 38/86 (44%), Positives = 50/86 (58%), Gaps = 1/86 (1%)
Frame = +1
Query: 514 GCSGG-MPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKC 690
GC+GG + +++YW GLVSG +YNS+ GC+PY PC + G C + KTP C
Sbjct: 153 GCNGGFLDGTSFQYWVDVGLVSGAAYNSTDGCKPYPFKPCLYPFVG----CHPE-KTPSC 207
Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCP 768
T C GYD Y++DK YG Y P
Sbjct: 208 THHCTEGYDGTYRRDKYYGSAAYKLP 233
Score = 34.7 bits (76), Expect = 3.0
Identities = 15/28 (53%), Positives = 18/28 (64%)
Frame = +3
Query: 762 LSGDEDHIRAELFKNGPVEGAFTVYSDL 845
L DE I+ E+ NGPVE F+VY DL
Sbjct: 232 LPNDERMIQLEIMTNGPVESGFSVYQDL 259
>UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7;
n=2; Haemonchidae|Rep: Cathepsin B-like cysteine
protease GCP7 - Haemonchus contortus (Barber pole worm)
Length = 348
Score = 90.2 bits (214), Expect = 6e-17
Identities = 39/85 (45%), Positives = 53/85 (62%)
Frame = +2
Query: 245 SYRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
SY E+ + T N D+ PE+FD R+KW DCP+L + DQ +CGSCWA A + M
Sbjct: 78 SYNQENVLPIANITSNDDI----PESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCM 133
Query: 425 TDRVCTYSNGTKHFHFSAEDLLSCC 499
+DR+C +S G K SA D+L+CC
Sbjct: 134 SDRLCIHSQGRKKVLLSATDILACC 158
Score = 62.1 bits (144), Expect = 2e-08
Identities = 31/91 (34%), Positives = 41/91 (45%), Gaps = 1/91 (1%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPC-SGDTKTPKC 690
GC GG AW++ G+V+GG+Y C+PY P C H C S TP C
Sbjct: 165 GCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAHKGKAFNNCPSHPYATPAC 224
Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCPETKTT 783
C+ GY Y+ DK + Y P + T
Sbjct: 225 KPYCQYGYGKRYENDKIKARTWYWLPNDERT 255
>UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115,
whole genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_115,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 332
Score = 89.8 bits (213), Expect = 7e-17
Identities = 38/87 (43%), Positives = 53/87 (60%), Gaps = 1/87 (1%)
Frame = +2
Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454
P++ + + +LP +F ++KWP CP++ + DQG+CGSCWA A M+DR+C S
Sbjct: 59 PVEYKYHEKLENLPPSFSAQEKWPGCPSIELIPDQGNCGSCWAVSAASTMSDRLCIASGQ 118
Query: 455 TKHFHFSAEDLLSCCPI-CDWDAAEEC 532
T SAEDLLSCC I C+ D C
Sbjct: 119 TDKRQISAEDLLSCCGINCELDGNGGC 145
Score = 72.9 bits (171), Expect = 9e-12
Identities = 34/81 (41%), Positives = 42/81 (51%), Gaps = 6/81 (7%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEH-HVPGNRMPCSGD-----T 675
GC GG P AW+Y + G+V+GG+YN C+PY PPC H + G C D
Sbjct: 144 GCDGGYPYGAWKYLRVDGIVTGGTYNDFSLCKPYSFPPCSHGNDSGKYSKCENDFFMLTE 203
Query: 676 KTPKCTKKCESGYDVNYKQDK 738
TP CTKKC + Y DK
Sbjct: 204 VTPSCTKKCHPQFSRTYDVDK 224
>UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.4;
n=1; Caenorhabditis elegans|Rep: Putative
uncharacterized protein W07B8.4 - Caenorhabditis elegans
Length = 335
Score = 88.6 bits (210), Expect = 2e-16
Identities = 34/64 (53%), Positives = 46/64 (71%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
S+P+++D RD WP C ++N +RDQ CGSCWA A EA++DR C SNG + SAED+
Sbjct: 72 SIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLSAEDI 131
Query: 488 LSCC 499
L+CC
Sbjct: 132 LTCC 135
Score = 81.8 bits (193), Expect = 2e-14
Identities = 38/86 (44%), Positives = 46/86 (53%), Gaps = 4/86 (4%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSGD-TKTPK 687
GC GG P AW YW GLV+GGS+ S GC+PY I PC + G P C + TPK
Sbjct: 144 GCEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPYSIAPCGETIDGVTWPECPMKISDTPK 203
Query: 688 CTKKC--ESGYDVNYKQDKQYGKHVY 759
C C + Y + Y QDK +G Y
Sbjct: 204 CEHHCTGNNSYPIPYDQDKHFGASAY 229
>UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep:
Cathepsin B - Uronema marinum
Length = 350
Score = 88.2 bits (209), Expect = 2e-16
Identities = 36/64 (56%), Positives = 47/64 (73%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
SLPE+FD R+ +P C +L +VRDQ +CGSCWAFG VEA++DR+C S S+E+L
Sbjct: 85 SLPESFDLREAYPKCESLQQVRDQSNCGSCWAFGTVEAISDRICIASGQKDQTRISSENL 144
Query: 488 LSCC 499
LSCC
Sbjct: 145 LSCC 148
Score = 80.2 bits (189), Expect = 6e-14
Identities = 40/97 (41%), Positives = 51/97 (52%), Gaps = 8/97 (8%)
Frame = +1
Query: 511 LGCSGGMPRLAWEYWKHFGLVSGGSY-----NSSQGCRPYEIPPCEHHVPGNRMPCSG-- 669
+GC+GG AW Y+ GLVSG Y NS C+PY PPC HHV G C+
Sbjct: 156 MGCNGGYTAGAWNYYVKTGLVSGNLYTDDNQNSKTECQPYSFPPCSHHVQGEYQACTDLP 215
Query: 670 DTKTPKCTKKCESGYDVN-YKQDKQYGKHVYTCPETK 777
TPKC +C S Y N Y+QD G Y+ P+++
Sbjct: 216 QFNTPKCYTECNSQYTQNSYEQDLHKGVSSYSVPKSE 252
>UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4;
Ancylostomatidae|Rep: Cysteine proteinase - Ancylostoma
ceylanicum
Length = 348
Score = 87.8 bits (208), Expect = 3e-16
Identities = 39/88 (44%), Positives = 53/88 (60%), Gaps = 6/88 (6%)
Frame = +2
Query: 254 DEHFATLPIKTH------NFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 415
D FA P KT N ++ +P+ FD RD+WP+C ++ +RDQ SCGSCWA A
Sbjct: 69 DVKFAVDPEKTEPNYVLANTEMKVDIPDTFDARDRWPNCTSMKHIRDQSSCGSCWAVAAA 128
Query: 416 EAMTDRVCTYSNGTKHFHFSAEDLLSCC 499
AM+DRVC +NG + S ++LSCC
Sbjct: 129 SAMSDRVCALTNGRINRILSDTEVLSCC 156
Score = 66.1 bits (154), Expect = 1e-09
Identities = 29/84 (34%), Positives = 42/84 (50%), Gaps = 2/84 (2%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRM-PCSGDT-KTPK 687
GC GG P A+ Y +GL +GG Y C+PY PC +H PC + TP
Sbjct: 163 GCKGGYPARAFGYAWRYGLSTGGPYGEKDACQPYAFYPCGNHAHEPYYGPCPDELWPTPT 222
Query: 688 CTKKCESGYDVNYKQDKQYGKHVY 759
C + C+ GY + +++DK + Y
Sbjct: 223 CRRTCQLGYPIPFEKDKIFNDQTY 246
>UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8;
Trypanosoma|Rep: Cathepsin B-like cysteine protease -
Trypanosoma brucei
Length = 340
Score = 87.4 bits (207), Expect = 4e-16
Identities = 35/68 (51%), Positives = 45/68 (66%)
Frame = +2
Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
A LP +FD + WP+CPT+ ++ DQ +CGSCWA A AM+DR CT G + H SA D
Sbjct: 92 APLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCT-MGGVQDVHISAGD 150
Query: 485 LLSCCPIC 508
LL+CC C
Sbjct: 151 LLACCSDC 158
Score = 50.4 bits (115), Expect = 6e-05
Identities = 32/82 (39%), Positives = 38/82 (46%), Gaps = 5/82 (6%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNR--MPCSG-DTKTP 684
GC+GG P AW Y+ GLVS Y C+PY P C HH PCS + TP
Sbjct: 161 GCNGGDPDRAWAYFSSTGLVS--DY-----CQPYPFPHCSHHSKSKNGYPPCSQFNFDTP 213
Query: 685 KCTKKCESGY--DVNYKQDKQY 744
KC C+ VNY+ Y
Sbjct: 214 KCNYTCDDPTIPVVNYRSWTSY 235
>UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG01102;
n=1; Caenorhabditis briggsae|Rep: Putative
uncharacterized protein CBG01102 - Caenorhabditis
briggsae
Length = 374
Score = 86.6 bits (205), Expect = 7e-16
Identities = 39/86 (45%), Positives = 50/86 (58%), Gaps = 2/86 (2%)
Frame = +1
Query: 517 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSGDT-KTPKC 690
C+GG AW+YW+ GL +GGSY S GC+PY I PC+ + P C T +TP C
Sbjct: 189 CAGGNVFKAWQYWQKHGLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSC 248
Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCP 768
KKC+SGY V +D+ YG V P
Sbjct: 249 EKKCKSGYPVELDKDRHYGVSVDQLP 274
Score = 68.9 bits (161), Expect = 1e-10
Identities = 27/59 (45%), Positives = 39/59 (66%)
Frame = +2
Query: 323 FDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC 499
FD R++WP+C ++ + D C S WAF A E+M+DR+C S G + SA++LLSCC
Sbjct: 85 FDARERWPECSSIPIINDISDCKSSWAFSAAESMSDRLCINSGGMINTVLSAQELLSCC 143
>UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2;
Caenorhabditis|Rep: Putative uncharacterized protein -
Caenorhabditis elegans
Length = 421
Score = 86.2 bits (204), Expect = 9e-16
Identities = 32/68 (47%), Positives = 46/68 (67%)
Frame = +2
Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
+ +P+NFD R KWP+CP+++ V +QG CGSC+A A +DR C +SNGT S ED
Sbjct: 136 SDVPKNFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKSLLSEED 195
Query: 485 LLSCCPIC 508
++ CC +C
Sbjct: 196 IIGCCSVC 203
Score = 46.8 bits (106), Expect = 7e-04
Identities = 31/108 (28%), Positives = 47/108 (43%), Gaps = 2/108 (1%)
Frame = +1
Query: 517 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCTK 696
C GG P A YW + GLV+GG GCRPY VP + + C K
Sbjct: 206 CYGGDPLKALTYWVNQGLVTGG----RDGCRPYSF-DLSCGVPCSPATFFEAEEKRTCMK 260
Query: 697 KCES-GYDVNYKQDKQYGKHVYTC-PETKTTSARNCSRMVPSKVLSQY 834
+C++ Y Y++DK + Y+ P + T S R+ ++ +
Sbjct: 261 RCQNIYYQQKYEEDKHFATFAYSMYPRSMTVSPDGKERVKVPTIIGHF 308
>UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8;
Leishmania|Rep: Cathepsin B-like protease - Leishmania
major
Length = 340
Score = 85.8 bits (203), Expect = 1e-15
Identities = 37/71 (52%), Positives = 47/71 (66%)
Frame = +2
Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475
+L LPE FD + WP C T++E+RDQ +CGSCWA AVEA++DR CT+ G S
Sbjct: 93 ELQQDLPEFFDAAEHWPMCLTISEIRDQSNCGSCWAIAAVEAISDRYCTF-GGVPDRRMS 151
Query: 476 AEDLLSCCPIC 508
+LLSCC IC
Sbjct: 152 TSNLLSCCFIC 162
Score = 55.6 bits (128), Expect = 1e-06
Identities = 30/82 (36%), Positives = 40/82 (48%), Gaps = 4/82 (4%)
Frame = +1
Query: 511 LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT--KTP 684
LGC GG+P +AW +W G+ +++ C+PY PC HH + P T TP
Sbjct: 164 LGCHGGIPTVAWLWWVWVGI-------ATEDCQPYPFDPCSHHGNSEKYPPCPSTIYDTP 216
Query: 685 KCTKKCE-SGYD-VNYKQDKQY 744
KC CE + D V YK Y
Sbjct: 217 KCNTTCERNEMDLVKYKGSTSY 238
>UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2
precursor; n=8; Haemonchus contortus|Rep: Cathepsin
B-like cysteine proteinase 2 precursor - Haemonchus
contortus (Barber pole worm)
Length = 342
Score = 85.4 bits (202), Expect = 2e-15
Identities = 40/85 (47%), Positives = 48/85 (56%), Gaps = 3/85 (3%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRM---PCSGDTKTP 684
GC GG P AW+Y+ + G+VSGG Y + CRPY I PC HH GN C G TP
Sbjct: 155 GCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHH--GNDTYYGECRGTAPTP 212
Query: 685 KCTKKCESGYDVNYKQDKQYGKHVY 759
C +KC G Y+ DK+YGK Y
Sbjct: 213 PCKRKCRPGVRKMYRIDKRYGKDAY 237
Score = 75.8 bits (178), Expect = 1e-12
Identities = 30/67 (44%), Positives = 43/67 (64%), Gaps = 1/67 (1%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
+P ++DPRD W +C T +RDQ +CGSCWA A++DR+C S K + SA D++
Sbjct: 87 IPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIM 145
Query: 491 SCC-PIC 508
+CC P C
Sbjct: 146 TCCRPQC 152
>UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator
americanus|Rep: Cysteine proteinase 4 - Necator
americanus (Human hookworm)
Length = 339
Score = 84.2 bits (199), Expect = 4e-15
Identities = 32/68 (47%), Positives = 44/68 (64%)
Frame = +2
Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475
+L LPE FD R+KWP C ++ +RD +CGSCWA A M+DR+C +NGT S
Sbjct: 83 NLNVELPERFDAREKWPHCASIGLIRDHSACGSCWAVSAASVMSDRLCIQTNGTNQKILS 142
Query: 476 AEDLLSCC 499
+ D+L+CC
Sbjct: 143 SADILACC 150
Score = 72.9 bits (171), Expect = 9e-12
Identities = 35/82 (42%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPC--SGDTKTPK 687
GC GG P A+ Y ++ G+ SGG Y C+PY PC+ GN PC G TPK
Sbjct: 157 GCEGGYPIQAYFYLENTGVCSGGEYREKNVCKPYPFYPCD----GNYGPCPKEGAFDTPK 212
Query: 688 CTKKCESGYDVNYKQDKQYGKH 753
C K C+ Y V Y++DK +GK+
Sbjct: 213 CRKICQFRYPVPYEEDKVFGKN 234
>UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8.1;
n=1; Caenorhabditis elegans|Rep: Putative
uncharacterized protein W07B8.1 - Caenorhabditis elegans
Length = 335
Score = 83.4 bits (197), Expect = 6e-15
Identities = 40/92 (43%), Positives = 52/92 (56%), Gaps = 4/92 (4%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSGDTK-TPK 687
GC GG P AW+Y + G+ +GGSY S GC+PY IPPC V P C+ T TP
Sbjct: 147 GCEGGNPFKAWQYIQKHGIPTGGSYESQFGCKPYSIPPCGKTVGNVTYPACTNTTSPTPS 206
Query: 688 CTKKCES--GYDVNYKQDKQYGKHVYTCPETK 777
C KKC S GY ++ +D+ YG V P ++
Sbjct: 207 CEKKCTSRIGYPIDIDKDRHYGVSVDQLPNSQ 238
Score = 78.2 bits (184), Expect = 2e-13
Identities = 34/81 (41%), Positives = 51/81 (62%), Gaps = 3/81 (3%)
Frame = +2
Query: 266 ATLPIKTHNFDLI---ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 436
AT+ K NF + + L +FD R++WP+C ++ ++ D C + WAF A E+M+DR+
Sbjct: 58 ATIGFKIQNFGVSQANSDLSPSFDARERWPECMSIPQINDISECKTSWAFAAAESMSDRL 117
Query: 437 CTYSNGTKHFHFSAEDLLSCC 499
C S G K+ SAE+LLSCC
Sbjct: 118 CINSGGFKNTILSAEELLSCC 138
>UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC06356 protein - Schistosoma
japonicum (Blood fluke)
Length = 279
Score = 83.0 bits (196), Expect = 9e-15
Identities = 32/83 (38%), Positives = 48/83 (57%), Gaps = 1/83 (1%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPKC 690
GC G YW +G+V+GGSY GC+PY +P C +H + C+ +T + P+C
Sbjct: 94 GCFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSYHPESRFLDCNNNTFEFPQC 153
Query: 691 TKKCESGYDVNYKQDKQYGKHVY 759
T +C+ GY+ Y DK YG+ +Y
Sbjct: 154 TNECQDGYNKTYDDDKFYGERIY 176
Score = 65.3 bits (152), Expect = 2e-09
Identities = 29/81 (35%), Positives = 46/81 (56%), Gaps = 1/81 (1%)
Frame = +2
Query: 257 EHFATLPIKTHNFDLI-ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 433
E+ T IKT + + I +P +FD R W +C T+ ++ D+ C + WA V++++DR
Sbjct: 9 ENIQTKHIKTISHNSINMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDR 68
Query: 434 VCTYSNGTKHFHFSAEDLLSC 496
+C SNG SA D +SC
Sbjct: 69 ICIRSNGRISVQLSARDAISC 89
>UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1
precursor; n=3; Haemonchidae|Rep: Cathepsin B-like
cysteine proteinase 1 precursor - Ostertagia ostertagi
Length = 341
Score = 83.0 bits (196), Expect = 9e-15
Identities = 31/66 (46%), Positives = 45/66 (68%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
+PE++DPR +W +C +L + DQ +CGSCWA + AM+DR+C S G K SA+D++
Sbjct: 91 IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150
Query: 491 SCCPIC 508
SCC C
Sbjct: 151 SCCTWC 156
Score = 78.6 bits (185), Expect = 2e-13
Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 3/82 (3%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRM---PCSGDTKTP 684
GC GG P A+ + G+V+GG YN+ CRPYEI PC HH GN C G TP
Sbjct: 159 GCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPCGHH--GNETYYGECVGMADTP 216
Query: 685 KCTKKCESGYDVNYKQDKQYGK 750
+C ++C GY +Y D+ Y K
Sbjct: 217 RCKRRCLLGYPKSYPSDRYYKK 238
>UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core
eudicotyledons|Rep: At1g02300/T6A9_10 - Arabidopsis
thaliana (Mouse-ear cress)
Length = 362
Score = 81.4 bits (192), Expect = 3e-14
Identities = 36/79 (45%), Positives = 48/79 (60%)
Frame = +2
Query: 263 FATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCT 442
F +PI +H+ L LP+ FD R W C ++ + DQG CGSCWAFGAVE+++DR C
Sbjct: 92 FLGVPIVSHDISL--KLPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCI 149
Query: 443 YSNGTKHFHFSAEDLLSCC 499
N + S DLL+CC
Sbjct: 150 KYN--MNVSLSVNDLLACC 166
Score = 55.6 bits (128), Expect = 1e-06
Identities = 32/83 (38%), Positives = 43/83 (51%), Gaps = 1/83 (1%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRMPCSGDTKTPKC 690
GC+GG P AW Y+KH G+V ++ C PY + C H PG C TPKC
Sbjct: 173 GCNGGYPIAAWRYFKHHGVV-------TEECDPYFDNTGCSH--PG----CEPAYPTPKC 219
Query: 691 TKKCESGYDVNYKQDKQYGKHVY 759
+KC SG + +++ K YG Y
Sbjct: 220 ARKCVSGNQL-WRESKHYGVSAY 241
Score = 37.5 bits (83), Expect = 0.42
Identities = 16/22 (72%), Positives = 18/22 (81%)
Frame = +3
Query: 777 DHIRAELFKNGPVEGAFTVYSD 842
D I AE++KNGPVE AFTVY D
Sbjct: 248 DDIMAEVYKNGPVEVAFTVYED 269
>UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2;
Ostreococcus|Rep: Cysteine proteinase - Ostreococcus
tauri
Length = 362
Score = 81.0 bits (191), Expect = 3e-14
Identities = 35/63 (55%), Positives = 43/63 (68%), Gaps = 1/63 (1%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
LP+ FD R+KWP C L +E DQG+CGSCWA +AMTDR+C +NG + H SA L
Sbjct: 88 LPDTFDVREKWPKCAALVSEAVDQGACGSCWAVAPAKAMTDRLCIATNGAVNTHVSAIQL 147
Query: 488 LSC 496
LSC
Sbjct: 148 LSC 150
Score = 43.2 bits (97), Expect = 0.008
Identities = 18/41 (43%), Positives = 20/41 (48%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEH 636
GC GG P A+E G+VSGG C PY PC H
Sbjct: 170 GCMGGYPTEAYETAHRVGVVSGGLNGDQDTCMPYPFAPCHH 210
>UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC02853 protein - Schistosoma
japonicum (Blood fluke)
Length = 181
Score = 80.6 bits (190), Expect = 5e-14
Identities = 34/65 (52%), Positives = 45/65 (69%)
Frame = +2
Query: 254 DEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 433
D+H PI HN D+ LP+ FD R W +C ++ +RDQ SCGSCWAFGAVE+M+DR
Sbjct: 64 DQHKLHHPIIHHN-DINIKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDR 122
Query: 434 VCTYS 448
+C +S
Sbjct: 123 ICIHS 127
Score = 37.1 bits (82), Expect = 0.55
Identities = 21/40 (52%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Frame = +3
Query: 135 PLSDEFINTINLKQN-SWKAGRNFPRDTSFAHLKKIMGVI 251
PLSDE I IN + N WKA R R TS H K +MGV+
Sbjct: 21 PLSDELITFINKQPNIEWKADRT-KRFTSIHHAKSMMGVL 59
>UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2;
Arthropoda|Rep: Cathepsin B-like cysteine protease -
Callosobruchus maculatus (Southern cowpea weevil) (Pulse
bruchid)
Length = 330
Score = 80.2 bits (189), Expect = 6e-14
Identities = 29/66 (43%), Positives = 39/66 (59%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
LPE FD R +W C ++ E+RDQ CGSCWA + M+DR+C S+ SA D++
Sbjct: 81 LPEEFDARKQWSKCESIKEIRDQSGCGSCWAVSSASVMSDRICIQSDQKNQLRISAADMI 140
Query: 491 SCCPIC 508
CC C
Sbjct: 141 ECCESC 146
Score = 73.7 bits (173), Expect = 5e-12
Identities = 33/82 (40%), Positives = 42/82 (51%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693
GC GG+P + WK G VSGG YNS+ GC Y +P C P C P C
Sbjct: 152 GCHGGIPSFTFTEWKDSGFVSGGEYNSTNGCMSYPLPRCN---PS----CKTLYDAPTCK 204
Query: 694 KKCESGYDVNYKQDKQYGKHVY 759
K+C+ G + Y++DK Y K Y
Sbjct: 205 KECDKGSPLKYEEDKHYAKQAY 226
Score = 49.6 bits (113), Expect = 1e-04
Identities = 24/63 (38%), Positives = 38/63 (60%), Gaps = 2/63 (3%)
Frame = +3
Query: 78 RAAYVTLVCVLAAAKDLPHP--LSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVI 251
+ A++ L V++ P LSDE+I +N K WKAGRNF RDTS ++++++ V
Sbjct: 2 KLAFIALAAVVSCTFAQPELDFLSDEYIEQLNSKNLPWKAGRNFERDTSLYNIQRLLSVG 61
Query: 252 EMN 260
+N
Sbjct: 62 TIN 64
>UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2;
Caenorhabditis|Rep: Putative uncharacterized protein -
Caenorhabditis elegans
Length = 356
Score = 80.2 bits (189), Expect = 6e-14
Identities = 35/73 (47%), Positives = 45/73 (61%)
Frame = +2
Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460
KT N +++ +P +FD R KWP C + VRDQ CGS AVE +DR C SNGT
Sbjct: 82 KTGNDNVLVDIPSSFDSRQKWPSCSQIGAVRDQSDCGSAAHLVAVEIASDRTCIASNGTF 141
Query: 461 HFHFSAEDLLSCC 499
++ SA+D LSCC
Sbjct: 142 NWPLSAQDPLSCC 154
Score = 73.3 bits (172), Expect = 7e-12
Identities = 35/93 (37%), Positives = 49/93 (52%), Gaps = 4/93 (4%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPG--NRMPCSGDTKTPK 687
GC G P+ ++W+ GL +GG+YN GC+PY I PC+ +PC G TP
Sbjct: 166 GCDGSWPKDILKWWQTHGLCTGGNYNDQFGCKPYSIYPCDKKYANGTTSVPCPG-YHTPT 224
Query: 688 CTKKCESG--YDVNYKQDKQYGKHVYTCPETKT 780
C + C S + + YKQDK +GK Y + T
Sbjct: 225 CEEHCTSNITWPIAYKQDKHFGKAHYNVGKKMT 257
>UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin
B-like cysteine proteinase 4 precursor (Cysteine
protease-related 4); n=2; Tribolium castaneum|Rep:
PREDICTED: similar to Cathepsin B-like cysteine
proteinase 4 precursor (Cysteine protease-related 4) -
Tribolium castaneum
Length = 360
Score = 79.4 bits (187), Expect = 1e-13
Identities = 31/67 (46%), Positives = 41/67 (61%), Gaps = 1/67 (1%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
+PE FD R+ WP+C + +R+QG C S WAF A E M+DR+C +NG S EDL
Sbjct: 72 IPETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDL 131
Query: 488 LSCCPIC 508
+ CC C
Sbjct: 132 IDCCHYC 138
Score = 58.8 bits (136), Expect = 2e-07
Identities = 32/89 (35%), Positives = 44/89 (49%), Gaps = 1/89 (1%)
Frame = +1
Query: 517 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCTK 696
C GG AW Y+ GLVSGG YN+S GC+PY + R+ TP C
Sbjct: 142 CKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYS------ELNYYRI-------TPPCNT 188
Query: 697 KCESG-YDVNYKQDKQYGKHVYTCPETKT 780
C++ Y + Y DK +G +Y P+ +T
Sbjct: 189 TCQNDKYPIPYVSDKHFGDSIYYIPQNET 217
>UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep:
Cathepsin B - Triticum aestivum (Wheat)
Length = 353
Score = 77.8 bits (183), Expect = 3e-13
Identities = 36/78 (46%), Positives = 45/78 (57%)
Frame = +2
Query: 266 ATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTY 445
A +PIK H LP+ FD R +W C T+ + DQG CG+CWAF AVEA+ DR C +
Sbjct: 85 AGVPIKIHPE---MDLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRFCIH 141
Query: 446 SNGTKHFHFSAEDLLSCC 499
N S DLL+CC
Sbjct: 142 LN--MSVSLSVNDLLACC 157
Score = 45.6 bits (103), Expect = 0.002
Identities = 32/111 (28%), Positives = 49/111 (44%), Gaps = 1/111 (0%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRMPCSGDTKTPKC 690
GC+GG P AW Y++ G+V ++ C PY + C+H PG C TPKC
Sbjct: 164 GCNGGYPISAWRYFRRSGVV-------TEECDPYFDQTGCQH--PG----CEPAYPTPKC 210
Query: 691 TKKCESGYDVNYKQDKQYGKHVYTCPETKTTSARNCSRMVPSKVLSQYIQI 843
+KC+ +K++K + + Y + P +V Y QI
Sbjct: 211 QRKCKVENQA-WKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTYCQI 260
>UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep:
Cathepsin B - Streblomastix strix
Length = 312
Score = 77.4 bits (182), Expect = 4e-13
Identities = 30/69 (43%), Positives = 41/69 (59%)
Frame = +2
Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
+A+LP+ FD R WP+C + ++ DQG CGSCWA + E + DR C S G + S +
Sbjct: 73 VANLPDEFDSRTNWPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEGKQTPELSPQ 132
Query: 482 DLLSCCPIC 508
L SC P C
Sbjct: 133 HLTSCTPGC 141
>UniRef50_Q237A1 Cluster: Papain family cysteine protease containing
protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 346
Score = 77.0 bits (181), Expect = 6e-13
Identities = 34/67 (50%), Positives = 45/67 (67%), Gaps = 1/67 (1%)
Frame = +2
Query: 311 LPENFDPRDKWPD-CPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
LPE FD R +W D C +L EVRDQ +CGSCWAFGA E+++DR C + + S ++L
Sbjct: 93 LPEEFDARVQWGDKCSSLWEVRDQSTCGSCWAFGAAESLSDRHCIHLG--QDIRLSTQNL 150
Query: 488 LSCCPIC 508
L+CC C
Sbjct: 151 LTCCAAC 157
Score = 72.1 bits (169), Expect = 2e-11
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 3/85 (3%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGN-RMPCSGDTKTPKC 690
GC GG P A +Y+ + GLV+G Y ++ C+ Y PC HHV + PC+G+ TP C
Sbjct: 160 GCDGGWPEAAMDYYVNTGLVTGDLYGNNSWCQAYTFAPCAHHVTSDIYPPCTGELPTPPC 219
Query: 691 TKKCESG--YDVNYKQDKQYGKHVY 759
C+S + + Y +D G Y
Sbjct: 220 INSCDSNSTHTIPYSKDIHRGSKAY 244
Score = 36.7 bits (81), Expect = 0.73
Identities = 15/27 (55%), Positives = 20/27 (74%)
Frame = +3
Query: 762 LSGDEDHIRAELFKNGPVEGAFTVYSD 842
++ DE I AE++KNGP+E A TVY D
Sbjct: 246 IAKDEKAIMAEIYKNGPIEVALTVYED 272
>UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus
contortus|Rep: Cysteine proteinase - Haemonchus
contortus (Barber pole worm)
Length = 350
Score = 76.6 bits (180), Expect = 7e-13
Identities = 36/84 (42%), Positives = 44/84 (52%), Gaps = 2/84 (2%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGD--TKTPK 687
GC GG LAWE+ + FG+V+GG Y CRPY PC H G R C D TP
Sbjct: 163 GCEGGYDHLAWEWVQRFGVVTGGPYQQKGVCRPYAFHPCGLH-HGRRYDCPWDHSFSTPA 221
Query: 688 CTKKCESGYDVNYKQDKQYGKHVY 759
C C+ GY Y++DK + K Y
Sbjct: 222 CKPYCQFGYGKRYEKDKFFVKSTY 245
Score = 73.7 bits (173), Expect = 5e-12
Identities = 29/63 (46%), Positives = 38/63 (60%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
+PE+FD R W +C ++ VRDQ CGSCWA A M+DR+C + G S D+L
Sbjct: 94 IPESFDSRIVWKNCSSITYVRDQSRCGSCWAVSAASTMSDRICVQTKGKLQTILSDTDIL 153
Query: 491 SCC 499
SCC
Sbjct: 154 SCC 156
Score = 35.1 bits (77), Expect = 2.2
Identities = 15/32 (46%), Positives = 19/32 (59%)
Frame = +3
Query: 747 KTCIYLSGDEDHIRAELFKNGPVEGAFTVYSD 842
K+ L DE I+ E+ KNGPV+ AF Y D
Sbjct: 242 KSTYILDNDEKVIQREMMKNGPVQAAFITYED 273
>UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep:
Thiol protease - Trichuris suis
Length = 348
Score = 73.3 bits (172), Expect = 7e-12
Identities = 33/67 (49%), Positives = 41/67 (61%)
Frame = +2
Query: 299 LIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSA 478
L S+P +FD R W C +LN +RDQ CGSCWA A E M+DR+C SN + S
Sbjct: 80 LALSIPPSFDVRSLWHVC-SLNLIRDQAKCGSCWAVSAAETMSDRICVQSNCSIKACISD 138
Query: 479 EDLLSCC 499
D+LSCC
Sbjct: 139 TDILSCC 145
Score = 63.7 bits (148), Expect = 6e-09
Identities = 36/115 (31%), Positives = 51/115 (44%), Gaps = 11/115 (9%)
Frame = +1
Query: 502 YL*LGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYE-IPPCEHHVPGNRM-PCSGDT 675
Y GC+GG P AW ++ G +GG GC+PY+ P H+ N PC DT
Sbjct: 148 YCGYGCNGGFPIEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRHLKRNDYAPCPNDT 207
Query: 676 ---------KTPKCTKKCESGYDVNYKQDKQYGKHVYTCPETKTTSARNCSRMVP 813
TP+C ++C GY +Y D+ YGK Y ++ R + P
Sbjct: 208 YYGECVGMADTPRCKRRCLLGYPKSYPSDRYYGKSAYIVKQSVKAIQREIMKNGP 262
>UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15;
Aphidoidea|Rep: Cathepsin B-S precursor - Tuberaphis
styraci
Length = 349
Score = 72.1 bits (169), Expect = 2e-11
Identities = 27/65 (41%), Positives = 37/65 (56%)
Frame = +2
Query: 314 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493
P+ FD R+ W C + +RDQG+CGSCW+F A DR+C + G + S E+L
Sbjct: 86 PKQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELAF 145
Query: 494 CCPIC 508
CC C
Sbjct: 146 CCMDC 150
Score = 63.3 bits (147), Expect = 7e-09
Identities = 29/89 (32%), Positives = 44/89 (49%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPKCT 693
GC GG P AW+Y++ G+ +GG Y++ +GC PY++PPC N + +C
Sbjct: 153 GCGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYDEQGKNTCGGKPMERNHQCP 212
Query: 694 KKCESGYDVNYKQDKQYGKHVYTCPETKT 780
K C Y QD+ K+ Y +T
Sbjct: 213 KTC---YGKTTVQDRYKTKNEYVINSIET 238
Score = 39.5 bits (88), Expect = 0.10
Identities = 23/59 (38%), Positives = 32/59 (54%), Gaps = 4/59 (6%)
Frame = +3
Query: 81 AAYVTLVCVLAAAKDLPHP----LSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMG 245
A +VT+VC + + L P LSDE I IN +WKA R FP +TS + ++G
Sbjct: 2 AKFVTIVCAIFVSVYLAEPTLQFLSDERIKYINEVAKTWKAERYFPANTSEEYFIGLLG 60
>UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1;
Ostreococcus tauri|Rep: Cysteine proteinase Cathepsin F
- Ostreococcus tauri
Length = 498
Score = 71.7 bits (168), Expect = 2e-11
Identities = 33/64 (51%), Positives = 39/64 (60%), Gaps = 1/64 (1%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
SLP +FD RD++P C L VRDQG CGSCWA A E M DR+C S G + S +
Sbjct: 256 SLPRHFDARDEYPKCARLIGTVRDQGKCGSCWAVAATEIMNDRLCISSGGKEVAELSPQF 315
Query: 485 LLSC 496
LSC
Sbjct: 316 ALSC 319
>UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep:
Cysteine proteinase - Toxoplasma gondii
Length = 569
Score = 70.9 bits (166), Expect = 4e-11
Identities = 32/78 (41%), Positives = 43/78 (55%), Gaps = 2/78 (2%)
Frame = +2
Query: 272 LPIKTHNFDLIAS-LPENFDPRDKWPDCP-TLNEVRDQGSCGSCWAFGAVEAMTDRVCTY 445
+P+ F+ +P +FD R +P C + VRDQG CGSCWAF + EA DR+C
Sbjct: 260 MPLPAKEFENATEPVPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIR 319
Query: 446 SNGTKHFHFSAEDLLSCC 499
S G + SA+ SCC
Sbjct: 320 SQGKRLMPLSAQHTTSCC 337
Score = 64.1 bits (149), Expect = 4e-09
Identities = 29/70 (41%), Positives = 41/70 (58%), Gaps = 6/70 (8%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGLVSGGSYNS-SQG--CRPYEIPPCEHHVPGNRMPCSG---DT 675
GC+GG P +AW +++ G+V+GG +++ +G C PYE+P C HH C
Sbjct: 346 GCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAHHAKAPFPDCDATLVPR 405
Query: 676 KTPKCTKKCE 705
KTPKC K CE
Sbjct: 406 KTPKCRKDCE 415
>UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|Rep:
Cysteine protease - Giardia muris
Length = 301
Score = 69.3 bits (162), Expect = 1e-10
Identities = 35/84 (41%), Positives = 47/84 (55%), Gaps = 4/84 (4%)
Frame = +2
Query: 257 EHFATLPIKTH----NFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
E+ +L +TH N LP+++DPR + C L EV DQ SCGSCWAF AV
Sbjct: 55 ENLRSLRTETHVSQLNLGKTKELPKDYDPRVERAHC--LPEVADQASCGSCWAFSAVATF 112
Query: 425 TDRVCTYSNGTKHFHFSAEDLLSC 496
DR C Y +K H+S + ++SC
Sbjct: 113 ADRRCAYGLDSKQVHYSEQYVVSC 136
>UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 311
Score = 68.1 bits (159), Expect = 3e-10
Identities = 28/63 (44%), Positives = 41/63 (65%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
++PENFD R +WP +++ +R+QG CGSCWAFGA E ++DR S + SA+ L
Sbjct: 82 NIPENFDARKQWPG--SIHPIRNQGQCGSCWAFGASEVLSDRFAIASKNQIYVTLSAQQL 139
Query: 488 LSC 496
+ C
Sbjct: 140 VDC 142
>UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella
histriomuscorum|Rep: Cathepsin B - Oxytricha trifallax
(Sterkiella histriomuscorum)
Length = 294
Score = 65.3 bits (152), Expect = 2e-09
Identities = 33/65 (50%), Positives = 41/65 (63%)
Frame = +2
Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
I ++PENFD R +W ++ +RDQ CGSCWAFGA EA +DR NG K S E
Sbjct: 73 IMTVPENFDARQQWGS--KIHAIRDQQQCGSCWAFGATEAFSDRFAI--NG-KDVILSPE 127
Query: 482 DLLSC 496
DL+SC
Sbjct: 128 DLVSC 132
Score = 33.9 bits (74), Expect = 5.2
Identities = 13/20 (65%), Positives = 18/20 (90%)
Frame = +3
Query: 783 IRAELFKNGPVEGAFTVYSD 842
I++E+ +GPVEGAFTVY+D
Sbjct: 203 IQSEIVSHGPVEGAFTVYTD 222
>UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus
lucimarinus CCE9901|Rep: Predicted protein -
Ostreococcus lucimarinus CCE9901
Length = 330
Score = 64.5 bits (150), Expect = 3e-09
Identities = 30/63 (47%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
LP +FD R +P C L VRDQG CGSCWA A E M DR+C ++G S +
Sbjct: 112 LPTSFDARVAYPKCSRLLGAVRDQGRCGSCWAVAATEVMNDRLCVATDGENADELSPQYA 171
Query: 488 LSC 496
LSC
Sbjct: 172 LSC 174
Score = 37.5 bits (83), Expect = 0.42
Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 3/104 (2%)
Frame = +1
Query: 514 GCSGG--MPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDTKTPK 687
GC GG + L + K G+ GG +S+ C PYE C+H PC TP+
Sbjct: 180 GCDGGDVLDTLRIAFTK--GIPYGGMLDSN-ACLPYEFEACDH-------PCMVAGTTPQ 229
Query: 688 -CTKKCESGYDVNYKQDKQYGKHVYTCPETKTTSARNCSRMVPS 816
C KC G +++ YTCP+ T + VP+
Sbjct: 230 SCPAKCADGSALSFVHPT---SEPYTCPKGDVTHTGSGVYTVPN 270
>UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1;
Dictyostelium discoideum AX4|Rep: Putative
uncharacterized protein - Dictyostelium discoideum AX4
Length = 314
Score = 63.3 bits (147), Expect = 7e-09
Identities = 28/68 (41%), Positives = 43/68 (63%), Gaps = 1/68 (1%)
Frame = +2
Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG-TKHFHF 472
+L S+P +FD R +WPDC ++ + +Q CGSCWAF + E ++DR+C SN T
Sbjct: 83 ELKGSIPTSFDSRVQWPDC--IHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGAL 140
Query: 473 SAEDLLSC 496
S + L++C
Sbjct: 141 SPQTLVAC 148
Score = 34.3 bits (75), Expect = 3.9
Identities = 13/19 (68%), Positives = 16/19 (84%)
Frame = +1
Query: 514 GCSGGMPRLAWEYWKHFGL 570
GCSGG+P+LAWEY + GL
Sbjct: 155 GCSGGIPQLAWEYMELKGL 173
>UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep:
Cathepsin B - Streblomastix strix
Length = 283
Score = 62.5 bits (145), Expect = 1e-08
Identities = 29/62 (46%), Positives = 37/62 (59%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
+P+ FD R+KWPD + VRDQG CGSCWAF E + DR+ G + EDL+
Sbjct: 63 VPDTFDAREKWPDA--ILPVRDQGECGSCWAFSIAETIGDRLGVL--GCSRGDIAPEDLV 118
Query: 491 SC 496
SC
Sbjct: 119 SC 120
>UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2;
cellular organisms|Rep: Cysteine proteinase, putative -
Archaeoglobus fulgidus
Length = 1088
Score = 62.1 bits (144), Expect = 2e-08
Identities = 32/77 (41%), Positives = 40/77 (51%)
Frame = +2
Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
+ASLP FD W D L+ VRDQGSCGSCWA AV A+ + S + S +
Sbjct: 591 MASLPSRFD----WRDYTGLSAVRDQGSCGSCWAHSAVAALESALIVESGASSSIDLSEQ 646
Query: 482 DLLSCCPICDWDAAEEC 532
LLSC C+ + C
Sbjct: 647 HLLSCEQDCEVGIGDWC 663
>UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3;
Giardia intestinalis|Rep: Cathepsin B-like CP1 precursor
- Giardia lamblia (Giardia intestinalis)
Length = 303
Score = 62.1 bits (144), Expect = 2e-08
Identities = 27/67 (40%), Positives = 38/67 (56%)
Frame = +2
Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475
+L+ +P FD RD++P C + DQGSCGSCWAF A+ DR C + +S
Sbjct: 74 ELVDPIPPQFDFRDEYPQC--VKPALDQGSCGSCWAFSAIGVFGDRRCAMGIDKEAVSYS 131
Query: 476 AEDLLSC 496
+ L+SC
Sbjct: 132 QQHLISC 138
>UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4;
Giardia intestinalis|Rep: Cathepsin B-like CP2 precursor
- Giardia lamblia (Giardia intestinalis)
Length = 300
Score = 60.1 bits (139), Expect = 7e-08
Identities = 27/62 (43%), Positives = 38/62 (61%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
+PE+FD R+++P C + EV DQG CGSCWAF +V DR C K +S + ++
Sbjct: 75 VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQYVV 132
Query: 491 SC 496
SC
Sbjct: 133 SC 134
>UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-LDL
responsive gene 2, partial; n=1; Strongylocentrotus
purpuratus|Rep: PREDICTED: similar to oxidized-LDL
responsive gene 2, partial - Strongylocentrotus
purpuratus
Length = 363
Score = 58.8 bits (136), Expect = 2e-07
Identities = 27/64 (42%), Positives = 38/64 (59%), Gaps = 1/64 (1%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT-KHFHFSAED 484
++PE FD R +WP + V++QG+C S WA +DR+ SNGT K+ H S +
Sbjct: 221 AIPEEFDARAQWPGL--VEGVQNQGNCASSWAMSTAATASDRLAIQSNGTFKYMHLSPQH 278
Query: 485 LLSC 496
LLSC
Sbjct: 279 LLSC 282
>UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia
ATCC 50803|Rep: GLP_113_4299_5381 - Giardia lamblia ATCC
50803
Length = 360
Score = 58.8 bits (136), Expect = 2e-07
Identities = 26/61 (42%), Positives = 36/61 (59%)
Frame = +2
Query: 314 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493
PE++D RD++P C T EV DQG+CGSCWAF +V+ D C +S + +L
Sbjct: 141 PESYDFRDEYPHCIT--EVVDQGNCGSCWAFSSVQTFADHRCRSGLDATGVSYSVQYVLD 198
Query: 494 C 496
C
Sbjct: 199 C 199
>UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like protein
F26E4.3; n=2; Caenorhabditis|Rep: Uncharacterized
peptidase C1-like protein F26E4.3 - Caenorhabditis
elegans
Length = 491
Score = 58.8 bits (136), Expect = 2e-07
Identities = 27/62 (43%), Positives = 36/62 (58%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
LPE+FD RDKW P ++ V DQG CGS W+ +DR+ S G + S++ LL
Sbjct: 223 LPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLL 280
Query: 491 SC 496
SC
Sbjct: 281 SC 282
>UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;
n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
hypothetical protein - Strongylocentrotus purpuratus
Length = 450
Score = 58.4 bits (135), Expect = 2e-07
Identities = 28/64 (43%), Positives = 35/64 (54%)
Frame = +2
Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
A LPE FD R+ WP ++EV DQG CGS WA +DR+ S G + S +
Sbjct: 195 ARLPETFDARENWPGL--IDEVIDQGKCGSSWAISTASVASDRLAIQSMGEINPRLSEQH 252
Query: 485 LLSC 496
LLSC
Sbjct: 253 LLSC 256
>UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,
isoform A; n=2; Endopterygota|Rep: PREDICTED: similar to
CG3074-PA, isoform A - Tribolium castaneum
Length = 445
Score = 56.4 bits (130), Expect = 8e-07
Identities = 27/63 (42%), Positives = 34/63 (53%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
SLP FD KWP ++E++DQG CGS WA +DR S G + SA+ L
Sbjct: 196 SLPREFDSEFKWPGW--MSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHL 253
Query: 488 LSC 496
LSC
Sbjct: 254 LSC 256
>UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1;
Dictyostelium discoideum AX4|Rep: Putative
uncharacterized protein - Dictyostelium discoideum AX4
Length = 323
Score = 56.4 bits (130), Expect = 8e-07
Identities = 27/77 (35%), Positives = 39/77 (50%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
++P +FD R W DC ++ VR+Q SCGSCWA + DR+C S+ S + L
Sbjct: 45 TIPASFDVRTNWGDC--MSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYL 102
Query: 488 LSCCPICDWDAAEECRD 538
+ C C D C +
Sbjct: 103 MDCDGSCVSDGVSGCNN 119
>UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|Rep:
Cysteine proteinase - Globodera pallida
Length = 53
Score = 56.0 bits (129), Expect = 1e-06
Identities = 22/41 (53%), Positives = 26/41 (63%)
Frame = +2
Query: 377 QGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC 499
QG CG CWAF E ++DR C SNGT+ S DLL+CC
Sbjct: 1 QGQCGRCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCC 41
>UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcoptes
scabiei type hominis|Rep: Sar s 1 allergen Yv9053H09 -
Sarcoptes scabiei type hominis
Length = 253
Score = 53.6 bits (123), Expect = 6e-06
Identities = 28/68 (41%), Positives = 38/68 (55%), Gaps = 4/68 (5%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV----EAMTDRVCTYSNGTKHFHFSA 478
LPE FD RD L+++R+QG CG+CWAF A+ A R N T+ HFS
Sbjct: 37 LPEKFDLRD----LGYLSKIRNQGRCGACWAFAALASVESAYNRRTRIVHNRTRKHHFSE 92
Query: 479 EDLLSCCP 502
++L+ C P
Sbjct: 93 QELVDCSP 100
>UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35;
Viridiplantae|Rep: Cysteine proteinase 15A precursor -
Pisum sativum (Garden pea)
Length = 363
Score = 53.6 bits (123), Expect = 6e-06
Identities = 30/75 (40%), Positives = 39/75 (52%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
+LPE+FD R+K P V+DQGSCGSCWAF A+ Y K S + L
Sbjct: 131 NLPEDFDWREKGAVTP----VKDQGSCGSCWAFSTTGALEG--AHYLATGKLVSLSEQQL 184
Query: 488 LSCCPICDWDAAEEC 532
+ C +CD + A C
Sbjct: 185 VDCDHVCDPEQAGSC 199
>UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lamblia
ATCC 50803|Rep: GLP_217_11853_10927 - Giardia lamblia
ATCC 50803
Length = 308
Score = 52.8 bits (121), Expect = 1e-05
Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 1/86 (1%)
Frame = +2
Query: 242 GSYRDEHFATLPIKT-HNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVE 418
GS R + P++ N D + P++FD R+++P C T EV D G C S WA+ AV+
Sbjct: 54 GSPRTQSSIVRPVRVPENEDPV---PDHFDFREEYPQCIT--EVIDIGLCSSSWAYSAVD 108
Query: 419 AMTDRVCTYSNGTKHFHFSAEDLLSC 496
A + R C + +SA+ +LSC
Sbjct: 109 AFSHRRCLTGLDQEATRYSAQYILSC 134
>UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p -
Drosophila melanogaster (Fruit fly)
Length = 431
Score = 52.4 bits (120), Expect = 1e-05
Identities = 23/62 (37%), Positives = 34/62 (54%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
LP +F+ DKW ++EV DQG CG+ W +DR S G ++ SA+++L
Sbjct: 187 LPSSFNALDKWSSY--ISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNIL 244
Query: 491 SC 496
SC
Sbjct: 245 SC 246
>UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin O
precursor; n=2; Apocrita|Rep: PREDICTED: similar to
Cathepsin O precursor - Apis mellifera
Length = 374
Score = 52.0 bits (119), Expect = 2e-05
Identities = 32/93 (34%), Positives = 44/93 (47%)
Frame = +2
Query: 218 VRAS*ENNGSYRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSC 397
+R N SY +H I S+P FD RDK P VR QGSCG+C
Sbjct: 128 IRGEKHMNASYHRKH----QISIDRMKRSISIPLRFDWRDKGVITP----VRSQGSCGAC 179
Query: 398 WAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
WAF +E + + + NGT H S ++++ C
Sbjct: 180 WAFSTIEVI-ESMFAIKNGTLH-SLSVQEMIDC 210
>UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to
glucocorticoid-inducible protein; n=1; Gallus
gallus|Rep: PREDICTED: similar to
glucocorticoid-inducible protein - Gallus gallus
Length = 307
Score = 51.6 bits (118), Expect = 2e-05
Identities = 24/62 (38%), Positives = 33/62 (53%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
LP +FD KWP ++E DQG+C WAF +DR+ +S G S ++LL
Sbjct: 153 LPRHFDAATKWPGM--IHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPSLSPQNLL 210
Query: 491 SC 496
SC
Sbjct: 211 SC 212
>UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p;
n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
GM06507p - Nasonia vitripennis
Length = 483
Score = 51.2 bits (117), Expect = 3e-05
Identities = 23/62 (37%), Positives = 33/62 (53%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
LP FD R +W + + V+DQG CG+ WA V+ +DR S G + S + L+
Sbjct: 236 LPREFDSRIQWGN--DITPVQDQGWCGASWAISTVDVASDRFAIMSKGIEKVQLSGQHLI 293
Query: 491 SC 496
SC
Sbjct: 294 SC 295
>UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15;
Magnoliophyta|Rep: Cysteine proteinase RD19a precursor -
Arabidopsis thaliana (Mouse-ear cress)
Length = 368
Score = 51.2 bits (117), Expect = 3e-05
Identities = 27/75 (36%), Positives = 39/75 (52%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
+LPE+FD W D + V++QGSCGSCW+F A A+ + K S + L
Sbjct: 134 NLPEDFD----WRDHGAVTPVKNQGSCGSCWSFSATGALEG--ANFLATGKLVSLSEQQL 187
Query: 488 LSCCPICDWDAAEEC 532
+ C CD + A+ C
Sbjct: 188 VDCDHECDPEEADSC 202
>UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus
salmonis|Rep: Cysteine proteinase - Lepeophtheirus
salmonis (salmon louse)
Length = 372
Score = 50.8 bits (116), Expect = 4e-05
Identities = 26/65 (40%), Positives = 36/65 (55%)
Frame = +2
Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
I LPE+ D R+K + +V++QGSCGSCW F AVE + V +N T S +
Sbjct: 112 IKDLPESVDWREKG----VITDVKNQGSCGSCWVFSAVEQIESYVAIENNMTSPPLLSTQ 167
Query: 482 DLLSC 496
+ SC
Sbjct: 168 QITSC 172
>UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo
sapiens|Rep: Isoform 2 of Q9GZM7 - Homo sapiens (Human)
Length = 283
Score = 49.6 bits (113), Expect = 1e-04
Identities = 24/62 (38%), Positives = 34/62 (54%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
LP F+ +KWP+ ++E DQG+C WAF +DRV +S G S ++LL
Sbjct: 69 LPTAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLL 126
Query: 491 SC 496
SC
Sbjct: 127 SC 128
>UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11;
Trichomonadidae|Rep: Cysteine protease - Tritrichomonas
foetus (Trichomonas foetus)
Length = 315
Score = 49.6 bits (113), Expect = 1e-04
Identities = 24/63 (38%), Positives = 36/63 (57%)
Frame = +2
Query: 320 NFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC 499
N D D W + +NE++DQ +CGSCWAF A++A + S GT +S ++L+ C
Sbjct: 100 NVDSID-WREKGVVNEIKDQAACGSCWAFSAIQA-AESAYAISTGTLE-SYSEQNLVDCV 156
Query: 500 PIC 508
C
Sbjct: 157 QGC 159
>UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-like
precursor; n=26; Euteleostomi|Rep: Tubulointerstitial
nephritis antigen-like precursor - Homo sapiens (Human)
Length = 467
Score = 49.6 bits (113), Expect = 1e-04
Identities = 24/62 (38%), Positives = 34/62 (54%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
LP F+ +KWP+ ++E DQG+C WAF +DRV +S G S ++LL
Sbjct: 203 LPTAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLL 260
Query: 491 SC 496
SC
Sbjct: 261 SC 262
>UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2;
Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio
molitor (Yellow mealworm)
Length = 336
Score = 49.2 bits (112), Expect = 1e-04
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Frame = +2
Query: 254 DEHFATLPIKTH-NFDLIASL--PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
D H +PIKT + L AS+ P +FD W D ++ V++QGSCGSCWAF + A+
Sbjct: 99 DLHKNGIPIKTREDLGLNASVRYPASFD----WRDQGMVSPVKNQGSCGSCWAFSSTGAI 154
Query: 425 TDRVCTYSNGTKHFHFSAEDLLSCCP 502
++ + S + L+ C P
Sbjct: 155 ESQMKIANGAGYDSSVSEQQLVDCVP 180
>UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan
CA, family C1, cathepsin L-like cysteine peptidase -
Trichomonas vaginalis G3
Length = 306
Score = 49.2 bits (112), Expect = 1e-04
Identities = 19/56 (33%), Positives = 34/56 (60%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPIC 508
W + +N++++QG+CGSCWAF A++ + +V N + + S ++LL C C
Sbjct: 94 WREQGIVNKIKNQGACGSCWAFSAIQVIESQVA--KNQKQLYDLSEQNLLDCVTSC 147
>UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase
precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin
L-like cysteine proteinase precursor - Acanthoscelides
obtectus (Bean weevil)
Length = 321
Score = 48.4 bits (110), Expect = 2e-04
Identities = 27/72 (37%), Positives = 42/72 (58%)
Frame = +2
Query: 290 NFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFH 469
+FD + +P+ D R+K + EV+ QG+CGSCWAF AV ++ +V NG+
Sbjct: 103 SFDNVNDIPKTVDWREKG----AVTEVKKQGNCGSCWAFSAVGSIEGQV-FLKNGSLE-S 156
Query: 470 FSAEDLLSCCPI 505
SA++L+ C I
Sbjct: 157 LSAQNLVDCAGI 168
>UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
Clan CA, family C1, cathepsin L-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 234
Score = 48.4 bits (110), Expect = 2e-04
Identities = 25/70 (35%), Positives = 38/70 (54%)
Frame = +2
Query: 299 LIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSA 478
++ +P+ D R K +NE++DQ CGSCWAFG+ AM + +GT + S
Sbjct: 14 IVGDIPDEIDYRTKG----AVNEIKDQKHCGSCWAFGSCAAM-ESSWFLKHGTL-YSLSE 67
Query: 479 EDLLSCCPIC 508
+ L+ CC C
Sbjct: 68 QCLVDCCHDC 77
>UniRef50_Q58HK5 Cluster: Hc58; n=1; Haemonchus contortus|Rep: Hc58
- Haemonchus contortus (Barber pole worm)
Length = 241
Score = 47.6 bits (108), Expect = 4e-04
Identities = 17/29 (58%), Positives = 21/29 (72%)
Frame = +2
Query: 368 VRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454
+RDQ +CGSCWA A E M+DR C +S G
Sbjct: 108 IRDQSNCGSCWAVSAAETMSDRACIHSKG 136
>UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase
precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine
proteinase precursor - Heterodera glycines (Soybean cyst
nematode worm)
Length = 353
Score = 47.6 bits (108), Expect = 4e-04
Identities = 22/70 (31%), Positives = 33/70 (47%)
Frame = +2
Query: 287 HNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHF 466
HN +A + W + + EV+DQG CGSCWAF A A+ + +K
Sbjct: 123 HNMATLAGNSSTLPEKLDWREKGAVTEVKDQGDCGSCWAFSATGAI-EGALAQKKASKII 181
Query: 467 HFSAEDLLSC 496
S ++L+ C
Sbjct: 182 SLSEQNLVDC 191
>UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole
genome shotgun sequence; n=2; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_21,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 349
Score = 47.6 bits (108), Expect = 4e-04
Identities = 23/58 (39%), Positives = 34/58 (58%)
Frame = +2
Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICDWDAAEEC 532
++EV++QGSCGSCWAF AV A+ G K+ S ++L+ C + D +E C
Sbjct: 137 VSEVKNQGSCGSCWAFSAVAAL--ETALRQGGVKNVELSEQELVDCA-VKDEFESEGC 191
>UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1;
Phytophthora infestans|Rep: Cathepsin-like cysteine
protease - Phytophthora infestans (Potato late blight
fungus)
Length = 376
Score = 47.2 bits (107), Expect = 5e-04
Identities = 31/96 (32%), Positives = 46/96 (47%)
Frame = +2
Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
+ LP +D W + T+ V++QG CGSCWAF AV AM C Y+ T +E
Sbjct: 130 VEDLPATWD----WREHSTVTPVKNQGQCGSCWAFSAVAAME---CAYALSTGTLESLSE 182
Query: 482 DLLSCCPICDWDAAEECRD*LGNIGSTSV*YQEVVT 589
L C + + + C + G S Y+E++T
Sbjct: 183 QELVDCTL---NGIDTC----NHGGEMSEGYEEIIT 211
>UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia
intestinalis|Rep: GLP_41_8294_9919 - Giardia lamblia
ATCC 50803
Length = 541
Score = 47.2 bits (107), Expect = 5e-04
Identities = 29/68 (42%), Positives = 38/68 (55%), Gaps = 5/68 (7%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN-----GTKHFHF 472
+LP++FD RD + V DQG+CGSC+ FGAV+AM R+ +N GTK
Sbjct: 240 TLPDDFDWRDV-NGVSYIPGVLDQGACGSCFTFGAVQAMNSRIMIATNRTDPVGTKTI-L 297
Query: 473 SAEDLLSC 496
S E L C
Sbjct: 298 STEHALDC 305
>UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2;
Trypanosoma cruzi|Rep: Cysteine proteinase, putative -
Trypanosoma cruzi
Length = 392
Score = 47.2 bits (107), Expect = 5e-04
Identities = 26/64 (40%), Positives = 32/64 (50%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
+P+ D R+ P L V+DQG CGSCWA GA E M + G H S + L
Sbjct: 141 IPDEVDYRNSSP--AILTAVKDQGRCGSCWAHGAAEEMESHFAILT-GRLHV-LSQQQLT 196
Query: 491 SCCP 502
SC P
Sbjct: 197 SCAP 200
>UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia
irregularis virus a|Rep: FirrV-1-A48 precursor -
Feldmannia irregularis virus a
Length = 373
Score = 46.8 bits (106), Expect = 7e-04
Identities = 17/41 (41%), Positives = 25/41 (60%)
Frame = +2
Query: 374 DQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
DQGSC SCW+ V+ + DRV +NG S ++++SC
Sbjct: 80 DQGSCASCWSISVVQMLADRVSVSTNGKIKLKLSVQEMISC 120
>UniRef50_P81494 Cluster: Cathepsin B; n=2; Phasianidae|Rep:
Cathepsin B - Coturnix coturnix japonica (Japanese
quail)
Length = 48
Score = 46.8 bits (106), Expect = 7e-04
Identities = 16/25 (64%), Positives = 22/25 (88%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGS 385
LP+ FD R +WP+CPT++E+RDQGS
Sbjct: 1 LPDTFDSRKQWPNCPTISEIRDQGS 25
>UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin C;
n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
similar to cathepsin C - Strongylocentrotus purpuratus
Length = 482
Score = 46.4 bits (105), Expect = 0.001
Identities = 23/64 (35%), Positives = 35/64 (54%)
Frame = +2
Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
++LPE FD RD ++ VRDQG CGSC+AF + R+ +N S ++
Sbjct: 247 SNLPEKFDWRDVG-GIDYVSPVRDQGICGSCYAFASTATQESRLRVMTNNNVKVVMSPQE 305
Query: 485 LLSC 496
++SC
Sbjct: 306 VVSC 309
>UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae
str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae
str. PEST
Length = 559
Score = 46.4 bits (105), Expect = 0.001
Identities = 20/38 (52%), Positives = 25/38 (65%)
Frame = +2
Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 415
+ LP +FD W D + EV++QGSCGSCWAF AV
Sbjct: 336 VGDLPRSFD----WRDHGAVTEVKNQGSCGSCWAFSAV 369
>UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 336
Score = 45.6 bits (103), Expect = 0.002
Identities = 22/62 (35%), Positives = 31/62 (50%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
LP +FD W D L++V+DQG CGSCWAF + + + FS + L+
Sbjct: 125 LPASFD----WRDYGILSDVKDQGQCGSCWAFSTTGIL--EALYFMENRQKISFSEQQLV 178
Query: 491 SC 496
C
Sbjct: 179 DC 180
>UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep:
Cysteine protease - Solanum lycopersicum (Tomato)
(Lycopersicon esculentum)
Length = 345
Score = 45.6 bits (103), Expect = 0.002
Identities = 24/83 (28%), Positives = 42/83 (50%), Gaps = 2/83 (2%)
Frame = +2
Query: 254 DEHFATLPIKTHNFDLIASLPENFDPRD-KWPDCPTLNEVRDQGSCGSCWAFGAVEAMTD 430
+ + + P+ + F I L +++ P + W + + +V+ QG CG CWAF AV ++
Sbjct: 107 NSYLSPSPMSSTEFKKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEG 166
Query: 431 RVCTYSNGTKH-FHFSAEDLLSC 496
Y T + FS ++LL C
Sbjct: 167 ---AYKIATGNLMEFSEQELLDC 186
>UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1;
Bigelowiella natans|Rep: Digestive cysteine proteinase -
Bigelowiella natans (Pedinomonas minutissima)
(Chlorarachnion sp.(strain CCMP 621))
Length = 360
Score = 45.6 bits (103), Expect = 0.002
Identities = 22/54 (40%), Positives = 26/54 (48%), Gaps = 2/54 (3%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT--KHFHFSAEDLLSC 496
W D L V+DQG CGSCWAF A +A+ N T S E L+ C
Sbjct: 115 WRDFNALTPVKDQGGCGSCWAFSATQALESAHYIKHNDTLDSPIALSTEQLVEC 168
>UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep:
Vivapain-4 - Plasmodium vivax
Length = 484
Score = 45.6 bits (103), Expect = 0.002
Identities = 19/52 (36%), Positives = 31/52 (59%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
W + ++E+++Q CGSCWAFGAV A+ + N +H S ++L+ C
Sbjct: 268 WREHNAVSEIKNQNLCGSCWAFGAVGAVESQYAIRKN--QHVLISEQELVDC 317
>UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza
sativa|Rep: Cysteine protease 1 precursor - Oryza sativa
subsp. japonica (Rice)
Length = 490
Score = 45.6 bits (103), Expect = 0.002
Identities = 25/69 (36%), Positives = 37/69 (53%), Gaps = 7/69 (10%)
Frame = +2
Query: 239 NGSYRDEHFATLPI-------KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSC 397
NG +R + T P + + D + +LP++ D RDK + V++QG CGSC
Sbjct: 124 NGEFRATYLGTTPAGRGRRVGEAYRHDGVEALPDSVDWRDKGA---VVAPVKNQGQCGSC 180
Query: 398 WAFGAVEAM 424
WAF AV A+
Sbjct: 181 WAFSAVAAV 189
>UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 280
Score = 45.2 bits (102), Expect = 0.002
Identities = 22/64 (34%), Positives = 37/64 (57%)
Frame = +2
Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
+SLP+ FD W + + +V++QG+CGSCWAF + + + + N T +S ++
Sbjct: 66 SSLPQQFD----WRNLGKVTQVKNQGNCGSCWAF-TITGLFESINLIRNKTVEL-YSEQE 119
Query: 485 LLSC 496
LL C
Sbjct: 120 LLDC 123
>UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted,
possible transmembrane domain near N-terminus; n=4;
Cryptosporidium|Rep: Cryptopain-cysteine proteinase
secreted, possible transmembrane domain near N-terminus
- Cryptosporidium parvum Iowa II
Length = 401
Score = 45.2 bits (102), Expect = 0.002
Identities = 19/47 (40%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Frame = +2
Query: 317 ENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 451
E F P + W + +N +R+Q +CGSCWAF AV A+ C +N
Sbjct: 172 EEFVPPNSINWVEAGCVNPIRNQKNCGSCWAFSAVAALEGATCAQTN 218
>UniRef50_Q23H32 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 365
Score = 45.2 bits (102), Expect = 0.002
Identities = 26/72 (36%), Positives = 37/72 (51%)
Frame = +2
Query: 290 NFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFH 469
+F L S+PE+ D R+K + V+ QG CGSCWAF V A+ +
Sbjct: 128 SFLLSDSVPESVDWREK-----LVAPVQKQGGCGSCWAFSTVIALEGAYAKQTGNV--IK 180
Query: 470 FSAEDLLSCCPI 505
FS ++L+ CC I
Sbjct: 181 FSEQNLIDCCRI 192
>UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like
cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
Clan CA, family C1, cathepsin B-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 288
Score = 45.2 bits (102), Expect = 0.002
Identities = 22/63 (34%), Positives = 35/63 (55%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
S+P +++ +++P C V DQG CGSCW+F ++ + R C N K FS L
Sbjct: 67 SIPMSYNFTERFPQCDF--GVLDQGKCGSCWSFAVSKSFSHRYCRKYN--KPVLFSQSHL 122
Query: 488 LSC 496
++C
Sbjct: 123 VAC 125
>UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia
tropicalis|Rep: Blo t 1 allergen - Blomia tropicalis
(Mite)
Length = 333
Score = 45.2 bits (102), Expect = 0.002
Identities = 21/35 (60%), Positives = 23/35 (65%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGA 412
SLP+NFD R K L +R QGSCGSCWAF A
Sbjct: 112 SLPQNFDWRQK----ARLTRIRQQGSCGSCWAFAA 142
>UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen;
n=20; Amniota|Rep: Tubulointerstitial nephritis antigen
- Homo sapiens (Human)
Length = 476
Score = 45.2 bits (102), Expect = 0.002
Identities = 23/72 (31%), Positives = 32/72 (44%)
Frame = +2
Query: 284 THNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKH 463
T + LPE F KWP + DQ +C + WAF DR+ S G
Sbjct: 208 TASLPATTDLPEFFVASYKWPGWT--HGPLDQKNCAASWAFSTASVAADRIAIQSKGRYT 265
Query: 464 FHFSAEDLLSCC 499
+ S ++L+SCC
Sbjct: 266 ANLSPQNLISCC 277
>UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin O;
n=1; Danio rerio|Rep: PREDICTED: similar to cathepsin O
- Danio rerio
Length = 327
Score = 44.8 bits (101), Expect = 0.003
Identities = 22/59 (37%), Positives = 29/59 (49%)
Frame = +2
Query: 320 NFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
N PR W D + V +QGSCG CWAF VEA+ + G K S + ++ C
Sbjct: 119 NNPPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAIES--VSAKVGEKLQQLSVQQVIDC 175
>UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag-RP
- Bombyx mori (Silk moth)
Length = 404
Score = 44.8 bits (101), Expect = 0.003
Identities = 22/61 (36%), Positives = 32/61 (52%)
Frame = +2
Query: 314 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493
P+ FD R +W ++ + DQ CGS WA + DR S GT++ S++ LLS
Sbjct: 186 PDEFDARREWYGY--ISPIADQDWCGSDWAVSIASIVGDRFSIQSFGTENVRMSSQTLLS 243
Query: 494 C 496
C
Sbjct: 244 C 244
>UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 344
Score = 44.4 bits (100), Expect = 0.004
Identities = 21/60 (35%), Positives = 30/60 (50%)
Frame = +2
Query: 317 ENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
+N P D W + + V+ QG CGSCW F A A+ + NG +FS + +L C
Sbjct: 134 KNAPPMD-WRNASAITPVKQQGKCGSCWTF-ASTAVLESFSFIKNGAPLTNFSEQQILDC 191
>UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia
ATCC 50803|Rep: GLP_542_3431_1206 - Giardia lamblia ATCC
50803
Length = 741
Score = 44.4 bits (100), Expect = 0.004
Identities = 30/82 (36%), Positives = 43/82 (52%), Gaps = 1/82 (1%)
Frame = +2
Query: 254 DEHFATLPIKTHNFDLI-ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTD 430
++ + LP N DL A+LP NF R ++ +QGSCG C+A AVE +T
Sbjct: 40 EDEYNELPDGPDNADLTRAALPTNFTYRGH-----RCIQIINQGSCGCCYAAAAVEMVTA 94
Query: 431 RVCTYSNGTKHFHFSAEDLLSC 496
R C N ++ S EDL++C
Sbjct: 95 RRCLQLNDSR--LVSLEDLVTC 114
>UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine
protease; n=11; Callosobruchus maculatus|Rep: Putative
gut cathepsin L-like cysteine protease - Callosobruchus
maculatus (Southern cowpea weevil) (Pulse bruchid)
Length = 326
Score = 44.0 bits (99), Expect = 0.005
Identities = 25/75 (33%), Positives = 36/75 (48%)
Frame = +2
Query: 272 LPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSN 451
LP +FD + W + + V+DQ +CGSCWAF AV A+ + N
Sbjct: 95 LPSNAVHFDNFEDIDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAIEGQFFK-KN 153
Query: 452 GTKHFHFSAEDLLSC 496
GT SA++L+ C
Sbjct: 154 GTL-VSLSAQELVDC 167
>UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor;
n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase
precursor - Diabrotica virgifera virgifera (western corn
rootworm)
Length = 326
Score = 44.0 bits (99), Expect = 0.005
Identities = 20/44 (45%), Positives = 26/44 (59%)
Frame = +2
Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 406
P H+ + LP FD R+K + EV+DQGSCGSCW+F
Sbjct: 98 PRVIHSLTPVKDLPSKFDWREKG----AVTEVKDQGSCGSCWSF 137
>UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing
protein; n=5; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 437
Score = 44.0 bits (99), Expect = 0.005
Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 3/80 (3%)
Frame = +2
Query: 266 ATLPIKTHNFDL--IASLPENFDPRDKWPDCPTLNEVRDQGS-CGSCWAFGAVEAMTDRV 436
AT T +F ++ LP+ D R+K + +V+ QG CGSCWAF AV A+
Sbjct: 188 ATAQANTRSFRKYDLSQLPQYVDWREKG----VVTQVKSQGKDCGSCWAFAAVAALESHY 243
Query: 437 CTYSNGTKHFHFSAEDLLSC 496
G K FS + L+ C
Sbjct: 244 -ALKTGKKPIQFSEQQLVDC 262
>UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing
protein; n=7; Hymenostomatida|Rep: Papain family
cysteine protease containing protein - Tetrahymena
thermophila SB210
Length = 387
Score = 44.0 bits (99), Expect = 0.005
Identities = 25/73 (34%), Positives = 37/73 (50%)
Frame = +2
Query: 278 IKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT 457
+KT + + LP++ D W D + V+DQG CGSCWAF A A+ + + G
Sbjct: 122 LKTSDKINVKDLPKSVD----WRDAGVVTPVKDQGHCGSCWAF-ATTAVIESYAAIATGQ 176
Query: 458 KHFHFSAEDLLSC 496
S + L+SC
Sbjct: 177 LK-TLSTQQLVSC 188
>UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, whole
genome shotgun sequence; n=3; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_31,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 358
Score = 44.0 bits (99), Expect = 0.005
Identities = 20/62 (32%), Positives = 34/62 (54%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
+PE+++ R+ P+C + QG+C S ++ AV A +DR+C NG S + +
Sbjct: 131 IPESYNFREAQPECA--QPIYFQGNCSSSYSIAAVSATSDRLCKSKNGEFQDQLSPQSPI 188
Query: 491 SC 496
SC
Sbjct: 189 SC 190
>UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46;
Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea
mays (Maize)
Length = 371
Score = 44.0 bits (99), Expect = 0.005
Identities = 25/74 (33%), Positives = 35/74 (47%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
LP++FD W D + V++QGSCGSCW+F A A+ Y K S + +
Sbjct: 137 LPDDFD----WRDHGAVGPVKNQGSCGSCWSFSASGALEG--AHYLATGKLEVLSEQQFV 190
Query: 491 SCCPICDWDAAEEC 532
C CD + C
Sbjct: 191 DCDHECDSSEPDSC 204
>UniRef50_P25774 Cluster: Cathepsin S precursor; n=78;
Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens
(Human)
Length = 331
Score = 44.0 bits (99), Expect = 0.005
Identities = 26/62 (41%), Positives = 38/62 (61%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
LP++ D R+K C T EV+ QGSCG+CWAF AV A+ ++ + K SA++L+
Sbjct: 115 LPDSVDWREK--GCVT--EVKYQGSCGACWAFSAVGALEAQLKLKTG--KLVSLSAQNLV 168
Query: 491 SC 496
C
Sbjct: 169 DC 170
>UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24;
Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa
(Rice)
Length = 339
Score = 43.6 bits (98), Expect = 0.006
Identities = 26/65 (40%), Positives = 34/65 (52%)
Frame = +2
Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
I +LP D R K P ++DQG CG CWAF AV AM + + S G K S +
Sbjct: 120 IDTLPATVDWRTKGAVTP----IKDQGQCGCCWAFSAVAAM-EGIVKLSTG-KLISLSEQ 173
Query: 482 DLLSC 496
+L+ C
Sbjct: 174 ELVDC 178
>UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep:
Cathepsin L - Stylonychia lemnae
Length = 340
Score = 43.6 bits (98), Expect = 0.006
Identities = 18/44 (40%), Positives = 27/44 (61%)
Frame = +2
Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDR 433
+ +PE+ D R+K +N V+DQG CGSCWAF + ++ R
Sbjct: 122 LKDIPESIDWREKG----AVNAVKDQGQCGSCWAFSTIASLESR 161
>UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1;
Trypanosoma cruzi|Rep: Cysteine protease, putative -
Trypanosoma cruzi
Length = 434
Score = 43.6 bits (98), Expect = 0.006
Identities = 21/48 (43%), Positives = 29/48 (60%)
Frame = +2
Query: 353 PTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
P L V+DQGSCGSCWA A E++ + + S+G K S + + SC
Sbjct: 137 PVLTPVKDQGSCGSCWAHAATESV-ESMYAISSG-KLLTLSTQQITSC 182
>UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 394
Score = 43.6 bits (98), Expect = 0.006
Identities = 21/46 (45%), Positives = 26/46 (56%)
Frame = +2
Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
LN V+DQG CGSCW FGA M + +NG FS + L+ C
Sbjct: 196 LNPVKDQGQCGSCWTFGAAGVM-ESFNAITNGVLK-SFSEQQLVDC 239
>UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh
fly) (Boettcherisca peregrina). Cathepsin L; n=2;
Dictyostelium discoideum|Rep: Similar to Sarcophaga
peregrina (Flesh fly) (Boettcherisca peregrina).
Cathepsin L - Dictyostelium discoideum (Slime mold)
Length = 265
Score = 43.2 bits (97), Expect = 0.008
Identities = 25/74 (33%), Positives = 41/74 (55%)
Frame = +2
Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454
P K HN + A++P++FD W D + +V++QGSC SCW+F A+ A+ Y
Sbjct: 38 PFK-HNVN--ATIPKSFD----WRDHGAVGKVKNQGSCASCWSFSALGALEGHY--YIKY 88
Query: 455 TKHFHFSAEDLLSC 496
+ S ++L+ C
Sbjct: 89 GELLDLSEQNLVDC 102
>UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1;
Dictyostelium discoideum AX4|Rep: Counting factor
associated protein - Dictyostelium discoideum AX4
Length = 531
Score = 43.2 bits (97), Expect = 0.008
Identities = 24/70 (34%), Positives = 39/70 (55%)
Frame = +2
Query: 287 HNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHF 466
H+ + + S+P D R++ +C T V+DQG CGSCW FG+ ++ C +NG +
Sbjct: 301 HDDESLRSIPSTVDWRNQ--NCVT--PVKDQGICGSCWTFGSTGSLEGTNCV-TNG-ELV 354
Query: 467 HFSAEDLLSC 496
S + L+ C
Sbjct: 355 SLSEQQLVDC 364
>UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1;
Dictyostelium discoideum AX4|Rep: Putative
uncharacterized protein - Dictyostelium discoideum AX4
Length = 395
Score = 43.2 bits (97), Expect = 0.008
Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 3/55 (5%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKH---FHFSAEDLLSC 496
W D T VRDQG C SCW FG++ A+ R NG H SA++ ++C
Sbjct: 194 WSDYQT--PVRDQGECKSCWVFGSLAALESRY-LIKNGVSEKSTLHLSAQNAMNC 245
>UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3;
Bilateria|Rep: Cathepsin L-like cysteine proteinase -
Longidorus elongatus
Length = 358
Score = 43.2 bits (97), Expect = 0.008
Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 2/68 (2%)
Frame = +2
Query: 299 LIASLPENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHF 472
+I +P+N D W + +V+DQGSCGSCWAF A ++ + Y K
Sbjct: 129 MIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAFSATGSLEGQ--HYKQTGKLVSL 186
Query: 473 SAEDLLSC 496
S ++L+ C
Sbjct: 187 SEQNLVDC 194
>UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia
medanensis|Rep: Sui m 1 allergen - Suidasia medanensis
Length = 336
Score = 43.2 bits (97), Expect = 0.008
Identities = 23/67 (34%), Positives = 31/67 (46%)
Frame = +2
Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475
D+ +LP FD R +W VR+QG CGSCWAF + + N H S
Sbjct: 110 DISVALPAAFDWRQQWNTA-----VRNQGQCGSCWAFATAATVEAQYAIRKN--VHVTLS 162
Query: 476 AEDLLSC 496
+ L+ C
Sbjct: 163 EQQLVDC 169
>UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor;
n=35; Fasciola|Rep: Cathepsin L-like proteinase
precursor - Fasciola hepatica (Liver fluke)
Length = 326
Score = 43.2 bits (97), Expect = 0.008
Identities = 19/52 (36%), Positives = 26/52 (50%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
W + + EV+DQG+CGSCWAF M + N FS + L+ C
Sbjct: 114 WRESGYVTEVKDQGNCGSCWAFSTTGTMEGQY--MKNERTSISFSEQQLVDC 163
>UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1;
Oryza sativa (japonica cultivar-group)|Rep: Putative
uncharacterized protein - Oryza sativa subsp. japonica
(Rice)
Length = 326
Score = 42.7 bits (96), Expect = 0.011
Identities = 17/41 (41%), Positives = 25/41 (60%)
Frame = +2
Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
+A++ + P W + + V+DQG CGSCWAF VEA+
Sbjct: 110 LAAVAGDAPPAWDWREHGAVTRVKDQGPCGSCWAFSVVEAV 150
>UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1;
Brugia malayi|Rep: Cathepsin F-like cysteine proteinase
- Brugia malayi (Filarial nematode worm)
Length = 461
Score = 42.7 bits (96), Expect = 0.011
Identities = 26/71 (36%), Positives = 38/71 (53%), Gaps = 1/71 (1%)
Frame = +2
Query: 287 HNFDL-IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKH 463
++F+L I +LP FD W + V+DQGSCGSCWAF +V + + G K
Sbjct: 239 NDFNLSIYNLPSKFD----WRTEGVVTPVKDQGSCGSCWAF-SVTGNIESLWAIKTG-KL 292
Query: 464 FHFSAEDLLSC 496
S ++L+ C
Sbjct: 293 ISLSEQELIDC 303
>UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1;
Uronema marinum|Rep: Cathepsin L-like cysteine protease
- Uronema marinum
Length = 333
Score = 42.7 bits (96), Expect = 0.011
Identities = 22/54 (40%), Positives = 32/54 (59%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP 502
W + V++QG CGSCWAF AV ++ +R+ + G K FS + L+SC P
Sbjct: 126 WVSKGAVQGVQNQGVCGSCWAFSAVCSL-ERLYKINTG-KLLSFSEQQLVSCEP 177
>UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase;
n=21; Bilateria|Rep: Cathepsin L-like cysteine
proteinase - Globodera pallida
Length = 379
Score = 42.7 bits (96), Expect = 0.011
Identities = 24/48 (50%), Positives = 30/48 (62%), Gaps = 4/48 (8%)
Frame = +2
Query: 302 IASLPENFDPRDK-WPDCPTLNEVRDQGSCGSCWAF---GAVEAMTDR 433
+ LPE+ D RDK W + EV++QG CGSCWAF GA+EA R
Sbjct: 158 VGDLPESVDWRDKGW-----VTEVKNQGMCGSCWAFSSTGALEAQHAR 200
>UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas
foetus|Rep: TFCP2 protein - Tritrichomonas foetus
(Trichomonas foetus)
Length = 270
Score = 42.7 bits (96), Expect = 0.011
Identities = 22/61 (36%), Positives = 31/61 (50%)
Frame = +2
Query: 314 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493
P +FD W +N +++QGSCGSCWAF A+ A C + FS + L+
Sbjct: 51 PTSFD----WRSEGKVNPIKNQGSCGSCWAFSAIAAQES--CHAIATGELLRFSEQSLVD 104
Query: 494 C 496
C
Sbjct: 105 C 105
>UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 367
Score = 42.7 bits (96), Expect = 0.011
Identities = 20/52 (38%), Positives = 31/52 (59%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
W ++ V++QGSCGSCWAF AV A+ + V N + +S ++L+ C
Sbjct: 161 WRQSGAVSPVKNQGSCGSCWAFSAV-ALAESVNLLRNNSLAL-YSEQELVDC 210
>UniRef50_Q231X3 Cluster: Papain family cysteine protease containing
protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 323
Score = 42.7 bits (96), Expect = 0.011
Identities = 16/52 (30%), Positives = 26/52 (50%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
W + ++ V+ QG+CGSCWAF A ++ + K S + L+ C
Sbjct: 121 WVEAGKVSNVKSQGNCGSCWAFSATASVESALIIAGKVDKSISLSEQQLIDC 172
>UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
Clan CA, family C1, cathepsin L-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 462
Score = 42.7 bits (96), Expect = 0.011
Identities = 18/43 (41%), Positives = 28/43 (65%)
Frame = +2
Query: 368 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
VRDQ +CGSCWA A EA++ ++ +S G +F S + ++ C
Sbjct: 242 VRDQANCGSCWAQSAGEAISSQISLHSKG--NFTVSIQQIMDC 282
>UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, whole
genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_56,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 314
Score = 42.7 bits (96), Expect = 0.011
Identities = 27/83 (32%), Positives = 39/83 (46%), Gaps = 2/83 (2%)
Frame = +2
Query: 254 DEHFATLPI--KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMT 427
+E FA L + K +L A L P D + V++QG+CGSCWAF AV A+
Sbjct: 85 NEEFAALLLTRKESPMNLDAELYVPQGPLKASADWSKITSVKNQGNCGSCWAFSAVGAVE 144
Query: 428 DRVCTYSNGTKHFHFSAEDLLSC 496
+ +K S + L+ C
Sbjct: 145 TLLTIKGVISKDLWLSEQQLVDC 167
>UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole
genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_23,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 321
Score = 42.7 bits (96), Expect = 0.011
Identities = 25/90 (27%), Positives = 41/90 (45%), Gaps = 5/90 (5%)
Frame = +2
Query: 242 GSYRDEHFATLPIKTHNFDLIASLPENFDP-----RDKWPDCPTLNEVRDQGSCGSCWAF 406
G D+ F T+ + + ++ +N +P W + ++DQG CGSCWAF
Sbjct: 87 GDLTDQEFLTIYLNLQMPARVKNIQKNEEPFLVQEEVDWVQKGKVPAIKDQGDCGSCWAF 146
Query: 407 GAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
AV A+ + T + S +DL+ C
Sbjct: 147 SAVGAL--EINTKIQFNEIVDLSEQDLVDC 174
>UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101,
whole genome shotgun sequence; n=2; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_101,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 306
Score = 42.7 bits (96), Expect = 0.011
Identities = 24/63 (38%), Positives = 37/63 (58%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
+LPE+ D K +N V++QG+CGS W+F AV A + + GT HF +S ++L
Sbjct: 109 NLPESVDWSSK------MNPVKNQGTCGSGWSFSAVGAF-EAFFIFVKGT-HFQYSEQNL 160
Query: 488 LSC 496
+ C
Sbjct: 161 VDC 163
>UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61;
Leishmania|Rep: Cysteine proteinase 2 precursor -
Leishmania pifanoi
Length = 444
Score = 42.7 bits (96), Expect = 0.011
Identities = 24/65 (36%), Positives = 37/65 (56%)
Frame = +2
Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
++++P+ D R+K P V+DQG+CGSCWAF AV + + Y G + S +
Sbjct: 123 LSAVPDAVDWREKGAVTP----VKDQGACGSCWAFSAVGNIEGQ--WYLAGHELVSLSEQ 176
Query: 482 DLLSC 496
L+SC
Sbjct: 177 QLVSC 181
>UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163
precursor; n=4; Schizophora|Rep: Putative cysteine
proteinase CG12163 precursor - Drosophila melanogaster
(Fruit fly)
Length = 614
Score = 42.7 bits (96), Expect = 0.011
Identities = 23/62 (37%), Positives = 32/62 (51%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
LP+ FD R K + +V++QGSCGSCWAF + + K FS ++LL
Sbjct: 394 LPKEFDWRQK----DAVTQVKNQGSCGSCWAFSVTGNIEGLYAVKTGELK--EFSEQELL 447
Query: 491 SC 496
C
Sbjct: 448 DC 449
>UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;
n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
hypothetical protein - Strongylocentrotus purpuratus
Length = 331
Score = 42.3 bits (95), Expect = 0.015
Identities = 21/65 (32%), Positives = 36/65 (55%)
Frame = +2
Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
+ ++P +D R P P + V++Q SCG+CWAF VE M ++ + + SA+
Sbjct: 124 LKTMPLVYDLRSIKP--PVVTPVKNQKSCGACWAFSVVETMETQIALKTK--RLTQLSAQ 179
Query: 482 DLLSC 496
+L+ C
Sbjct: 180 ELVDC 184
>UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
Clan CA, family C1, cathepsin L-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 291
Score = 42.3 bits (95), Expect = 0.015
Identities = 20/51 (39%), Positives = 29/51 (56%), Gaps = 1/51 (1%)
Frame = +2
Query: 359 LNEVRDQGSCGSCWAFGAVEAM-TDRVCTYSNGTKHFHFSAEDLLSCCPIC 508
+N +RDQ CGSCWAFG V A ++ YSN + S ++++ C C
Sbjct: 90 VNPIRDQKQCGSCWAFGTVAACESNYALLYSNLPQ---LSEQNIIDCATTC 137
>UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4;
Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis
zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa
zeasingle nucleocapsid nuclear polyhedrosis virus)
Length = 367
Score = 42.3 bits (95), Expect = 0.015
Identities = 22/62 (35%), Positives = 31/62 (50%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
LP+ +D W D + ++DQG CGSCWAF A+ + + N K S + LL
Sbjct: 156 LPDYYD----WRDTNKVTPIKDQGVCGSCWAFVAIGNIESQYAIRHN--KLIDLSEQQLL 209
Query: 491 SC 496
C
Sbjct: 210 DC 211
>UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium
tetraurelia|Rep: Cathepsin L1 precursor - Paramecium
tetraurelia
Length = 314
Score = 42.3 bits (95), Expect = 0.015
Identities = 19/43 (44%), Positives = 27/43 (62%)
Frame = +2
Query: 368 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
V++QGSCGSCWAF AV A+ + T + + S +DL+ C
Sbjct: 126 VKNQGSCGSCWAFSAVGAL--EINTDIELNRKYELSEQDLVDC 166
>UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2;
Taeniidae|Rep: Cathepsin L-like cysteine proteinase -
Taenia solium (Pork tapeworm)
Length = 339
Score = 41.9 bits (94), Expect = 0.019
Identities = 23/64 (35%), Positives = 33/64 (51%)
Frame = +2
Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
A LP+ D RDK + EV++QG+CGSCWAF + A+ + K S +
Sbjct: 122 AGLPDTVDWRDK----NLVTEVKNQGNCGSCWAFSSTGALEGAFAKKTG--KLISLSEQQ 175
Query: 485 LLSC 496
L+ C
Sbjct: 176 LVDC 179
>UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia
bovis|Rep: Cysteine protease 2 - Babesia bovis
Length = 445
Score = 41.9 bits (94), Expect = 0.019
Identities = 21/59 (35%), Positives = 31/59 (52%)
Frame = +2
Query: 320 NFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
NF+ D W + V+DQG CGSCWAF AV ++ + + S ++L+SC
Sbjct: 236 NFEDID-WRRADAVTPVKDQGMCGSCWAFAAVGSVESLLKRQKTDVR---LSEQELVSC 290
>UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16;
Bromeliaceae|Rep: Fruit bromelain precursor - Ananas
comosus (Pineapple)
Length = 351
Score = 41.9 bits (94), Expect = 0.019
Identities = 21/65 (32%), Positives = 36/65 (55%)
Frame = +2
Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
I+++P++ D W D +NEV++Q CGSCW+F A+ A + + G S +
Sbjct: 120 ISAVPQSID----WRDYGAVNEVKNQNPCGSCWSFAAI-ATVEGIYKIKTGYL-VSLSEQ 173
Query: 482 DLLSC 496
++L C
Sbjct: 174 EVLDC 178
>UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain;
n=16; Chrysomelidae|Rep: Digestive cysteine protease
intestain - Leptinotarsa decemlineata (Colorado potato
beetle)
Length = 326
Score = 41.5 bits (93), Expect = 0.026
Identities = 22/63 (34%), Positives = 31/63 (49%), Gaps = 2/63 (3%)
Frame = +2
Query: 314 PENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
PE+ + D W + + EV+DQ CGSCWAF A A+ + +N S + L
Sbjct: 105 PEDLEVPDSIDWTEKGAVLEVKDQNPCGSCWAFSATGALEGQNAILNN--VKISLSEQQL 162
Query: 488 LSC 496
L C
Sbjct: 163 LDC 165
>UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precursor;
n=3; Plasmodium (Laverania)|Rep: Trophozoite cysteine
proteinase precursor - Plasmodium falciparum
Length = 569
Score = 41.5 bits (93), Expect = 0.026
Identities = 24/72 (33%), Positives = 38/72 (52%)
Frame = +2
Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460
K + D+ + +PE D R+K ++E +DQG CGSCWAF +V + V N
Sbjct: 323 KRNEKDIFSKVPEILDYREKG----IVHEPKDQGLCGSCWAFASV-GNIESVFAKKN-KN 376
Query: 461 HFHFSAEDLLSC 496
FS ++++ C
Sbjct: 377 ILSFSEQEVVDC 388
>UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean
endopeptidase) (Cysteine proteinase)
(Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor
(EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase)
(Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
Vignain-2] - Vigna mungo (Rice bean) (Black gram)
Length = 362
Score = 41.5 bits (93), Expect = 0.026
Identities = 23/71 (32%), Positives = 37/71 (52%)
Frame = +2
Query: 284 THNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKH 463
T ++ + S+P + D R K + +V+DQG CGSCWAF + A+ +N K
Sbjct: 119 TFMYEKVGSVPASVDWRKKG----AVTDVKDQGQCGSCWAFSTIVAVEGINQIKTN--KL 172
Query: 464 FHFSAEDLLSC 496
S ++L+ C
Sbjct: 173 VSLSEQELVDC 183
>UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin W,
partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
similar to Cathepsin W, partial - Ornithorhynchus
anatinus
Length = 229
Score = 41.1 bits (92), Expect = 0.034
Identities = 22/67 (32%), Positives = 35/67 (52%), Gaps = 2/67 (2%)
Frame = +2
Query: 302 IASLPENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475
+AS+PE ++ W + V++QGSCGSCWAF AV + + G + S
Sbjct: 59 MASIPEGPLRKETCDWRKRGAITSVKNQGSCGSCWAFAAV-GNAESMWYLRAGKRLVSLS 117
Query: 476 AEDLLSC 496
+++L C
Sbjct: 118 VQEVLDC 124
>UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;
Brugia malayi|Rep: Cahepsin L-like cysteine protease -
Brugia malayi (Filarial nematode worm)
Length = 371
Score = 41.1 bits (92), Expect = 0.034
Identities = 22/62 (35%), Positives = 32/62 (51%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
LP++ D W + +V+DQG CGSCW F AV A+ + + K S ++LL
Sbjct: 143 LPKSID----WRTSGAVTKVKDQGYCGSCWTFSAVGALEGQ--HFLQTGKLVELSMQNLL 196
Query: 491 SC 496
C
Sbjct: 197 DC 198
>UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;
n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
L-like proteinase" precursor - Diabrotica virgifera
virgifera (western corn rootworm)
Length = 315
Score = 41.1 bits (92), Expect = 0.034
Identities = 18/52 (34%), Positives = 28/52 (53%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
W D L V+DQG CGSCWAF ++ ++ + N + S ++L+ C
Sbjct: 117 WRDSAVLG-VKDQGQCGSCWAFSTTGSLEGQLAIHKN--QRVPLSEQELVDC 165
>UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain;
n=9; Cucujiformia|Rep: Digestive cysteine proteinase
intestain - Leptinotarsa decemlineata (Colorado potato
beetle)
Length = 326
Score = 41.1 bits (92), Expect = 0.034
Identities = 25/85 (29%), Positives = 38/85 (44%), Gaps = 2/85 (2%)
Frame = +2
Query: 248 YRDEHFATLPIKTHNFDLIASLPENFDPRDK--WPDCPTLNEVRDQGSCGSCWAFGAVEA 421
++DE + K + +A PE + D W + +V+ QG CGSCWAF A A
Sbjct: 83 FKDELRRQIKTKPNVEATLAVFPEGLEVPDSIDWTQKGAVLDVKYQGGCGSCWAFSATGA 142
Query: 422 MTDRVCTYSNGTKHFHFSAEDLLSC 496
+ + +N S + LL C
Sbjct: 143 LEGQNAIVNN--VKIPLSEQQLLDC 165
>UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-like
cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
Clan CA, family C1, cathepsin L or H-like cysteine
peptidase - Trichomonas vaginalis G3
Length = 435
Score = 41.1 bits (92), Expect = 0.034
Identities = 22/72 (30%), Positives = 37/72 (51%), Gaps = 1/72 (1%)
Frame = +2
Query: 284 THNFDLIASLPENFDPRDKWPDCPTLNEV-RDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460
T + D LPE+F W + P + + RDQ +CGSCWA A +++ ++ +N T
Sbjct: 204 TKHIDFKGDLPESFS----WRNLPNVVAMPRDQANCGSCWAQAAATSISSQISMRTNKTT 259
Query: 461 HFHFSAEDLLSC 496
S + ++ C
Sbjct: 260 --KVSVQQIVDC 269
>UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like
cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
Clan CA, family C1, cathepsin B-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 255
Score = 41.1 bits (92), Expect = 0.034
Identities = 21/78 (26%), Positives = 42/78 (53%)
Frame = +2
Query: 263 FATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCT 442
F I++ D+ +P+ ++ ++P C L + + CG C+A+G ++AM+ R+C
Sbjct: 15 FVDESIRSFPEDISIDIPDEYNFLQEYPHCD-LGPLTQE--CGCCYAYGPIKAMSHRICK 71
Query: 443 YSNGTKHFHFSAEDLLSC 496
N K SA+ +++C
Sbjct: 72 AKN--KKTFLSAQFIVAC 87
>UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep:
Cathepsin L - Kudoa thyrsites
Length = 300
Score = 41.1 bits (92), Expect = 0.034
Identities = 22/74 (29%), Positives = 37/74 (50%)
Frame = +2
Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454
P +T D+ ++LP + D W + V++QG CGSCW+F A A+ +
Sbjct: 90 PKETATKDIKSTLPSSVD----WKALGKVTSVKNQGHCGSCWSFSAAGAIESAYAIKTG- 144
Query: 455 TKHFHFSAEDLLSC 496
+ +FS + L+ C
Sbjct: 145 -ELVNFSEQQLVDC 157
>UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor;
n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1
precursor - Arabidopsis thaliana (Mouse-ear cress)
Length = 355
Score = 41.1 bits (92), Expect = 0.034
Identities = 24/65 (36%), Positives = 34/65 (52%)
Frame = +2
Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 481
I LP++ D R K P V+DQG CGSCWAF V A+ + + + G S +
Sbjct: 134 ITDLPKSVDWRKKGAVAP----VKDQGQCGSCWAFSTVAAV-EGINQITTGNLS-SLSEQ 187
Query: 482 DLLSC 496
+L+ C
Sbjct: 188 ELIDC 192
>UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5;
Theileria|Rep: Cysteine proteinase precursor - Theileria
parva
Length = 440
Score = 41.1 bits (92), Expect = 0.034
Identities = 21/67 (31%), Positives = 33/67 (49%)
Frame = +2
Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475
DL EN D W ++ V+DQ +CG CWAF V ++ ++ + K + S
Sbjct: 224 DLAKLTGENLD----WRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFD--KSYELS 277
Query: 476 AEDLLSC 496
++LL C
Sbjct: 278 VQELLDC 284
>UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3;
Theileria|Rep: Cysteine proteinase precursor - Theileria
annulata
Length = 441
Score = 41.1 bits (92), Expect = 0.034
Identities = 17/53 (32%), Positives = 30/53 (56%), Gaps = 1/53 (1%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGS-CGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
W ++ ++DQG CGSCWAF ++ ++ Y N K + S ++L++C
Sbjct: 233 WARTDAVSPIKDQGDHCGSCWAFSSIASVESLYRLYKN--KSYFLSEQELVNC 283
>UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18;
Plasmodium|Rep: Cysteine proteinase precursor -
Plasmodium vivax (strain Salvador I)
Length = 583
Score = 41.1 bits (92), Expect = 0.034
Identities = 19/40 (47%), Positives = 27/40 (67%)
Frame = +2
Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 415
+L+A +PE D R+K ++E +DQG CGSCWAF +V
Sbjct: 334 NLLADVPEILDYREKG----IVHEPKDQGLCGSCWAFASV 369
>UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza
sativa|Rep: Os09g0497500 protein - Oryza sativa subsp.
japonica (Rice)
Length = 349
Score = 40.7 bits (91), Expect = 0.045
Identities = 24/62 (38%), Positives = 36/62 (58%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
LP++ D R K + EV++QG CGSCWAF AV A+ + + NG + S ++L+
Sbjct: 122 LPKSVDWRKKG----AVVEVKNQGDCGSCWAFSAVAAI-EGINQIKNG-ELVSLSEQELV 175
Query: 491 SC 496
C
Sbjct: 176 DC 177
>UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2;
Caenorhabditis|Rep: Putative uncharacterized protein -
Caenorhabditis elegans
Length = 328
Score = 40.7 bits (91), Expect = 0.045
Identities = 20/45 (44%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Frame = +2
Query: 311 LPENFDPRDKWPD-CPTLNEVRDQGSCGSCWAFGAVEAMTDRVCT 442
+P+ FD RD + D P + V+DQ CG CWAF A A+T+ T
Sbjct: 97 IPDYFDLRDIYVDGSPVVGPVKDQEQCGCCWAF-ATTAITEAANT 140
>UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba
histolytica|Rep: Cysteine protease 19 - Entamoeba
histolytica
Length = 324
Score = 40.7 bits (91), Expect = 0.045
Identities = 18/49 (36%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Frame = +2
Query: 359 LNEVRDQGSCGSCWAFGAVEAM-TDRVCTYSN-GTKHFHFSAEDLLSCC 499
+ V+DQG+CGSC+AF +V M T + +Y + ++ S +++SCC
Sbjct: 112 MTPVKDQGNCGSCYAFSSVALMETAVLLSYDDLSPSNYALSTAEIVSCC 160
>UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor;
n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
L-like proteinase precursor - Diabrotica virgifera
virgifera (western corn rootworm)
Length = 317
Score = 40.7 bits (91), Expect = 0.045
Identities = 19/39 (48%), Positives = 25/39 (64%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
++PE+ D R+K +N VRDQ CGSCWAF A A+
Sbjct: 103 TVPESIDWREKG----AVNPVRDQEQCGSCWAFSAAGAL 137
>UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor;
n=176; Viridiplantae|Rep: Cysteine proteinase RD21a
precursor - Arabidopsis thaliana (Mouse-ear cress)
Length = 462
Score = 40.7 bits (91), Expect = 0.045
Identities = 19/38 (50%), Positives = 24/38 (63%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
LPE+ D R K + EV+DQG CGSCWAF + A+
Sbjct: 137 LPESIDWRKKG----AVAEVKDQGGCGSCWAFSTIGAV 170
>UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep:
Cathepsin L precursor - Schistosoma mansoni (Blood
fluke)
Length = 319
Score = 40.7 bits (91), Expect = 0.045
Identities = 17/35 (48%), Positives = 25/35 (71%)
Frame = +2
Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 406
+ ++P+NFD R+K + EV++QG CGSCWAF
Sbjct: 102 VNNIPKNFDWREKG----AVTEVKNQGMCGSCWAF 132
>UniRef50_UPI00006CB77B Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 382
Score = 40.3 bits (90), Expect = 0.059
Identities = 19/60 (31%), Positives = 34/60 (56%), Gaps = 1/60 (1%)
Frame = +2
Query: 320 NFDPRDKWPDCPTLNEVRDQGS-CGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
+F+ K+P C + + +QG C + ++ AV ++ DR+C S G +F SA+ +SC
Sbjct: 128 SFNFHTKYPQC--VRPIANQGKDCSASYSIAAVSSVADRLCMASEGDFNFGLSAQPTISC 185
>UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3;
Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays
(Maize)
Length = 493
Score = 40.3 bits (90), Expect = 0.059
Identities = 15/28 (53%), Positives = 19/28 (67%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
W + + EV+DQG CG CWAF AV A+
Sbjct: 170 WRERGAVAEVKDQGQCGGCWAFSAVAAV 197
>UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3;
Theileria|Rep: Cysteine protease, putative - Theileria
annulata
Length = 580
Score = 40.3 bits (90), Expect = 0.059
Identities = 19/52 (36%), Positives = 28/52 (53%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
W + +NEV +QGSCGSCWA + + + N K FS++ L+ C
Sbjct: 370 WRESGFVNEVVNQGSCGSCWAIASEDIFSTFKSIKKN--KLMKFSSQQLVDC 419
>UniRef50_Q22W19 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 332
Score = 40.3 bits (90), Expect = 0.059
Identities = 23/73 (31%), Positives = 33/73 (45%), Gaps = 1/73 (1%)
Frame = +2
Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460
K LI SL + P W + V++QG CGSCWAF V + Y+ T
Sbjct: 109 KRQKSHLIYSLKGDVAPSIDWRQKNAVTPVKNQGQCGSCWAFSTVGGLEG---AYAIATG 165
Query: 461 HF-HFSAEDLLSC 496
+ FS + ++ C
Sbjct: 166 NLTSFSEQQIVDC 178
>UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:
Aca s 1 allergen - Acarus siro (Dust mite)
Length = 331
Score = 40.3 bits (90), Expect = 0.059
Identities = 21/63 (33%), Positives = 31/63 (49%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
+LPE FD R K L + +QG CG+CWAF ++ + N H S ++L
Sbjct: 108 NLPETFDWRSK------LGPIENQGRCGACWAFASLATVEAAFAIKYN--THIRLSKQEL 159
Query: 488 LSC 496
+ C
Sbjct: 160 VEC 162
>UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1
precursor; n=20; Psoroptidia|Rep: Major mite fecal
allergen Der f 1 precursor - Dermatophagoides farinae
(House-dust mite)
Length = 321
Score = 40.3 bits (90), Expect = 0.059
Identities = 18/47 (38%), Positives = 24/47 (51%)
Frame = +2
Query: 356 TLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
T+ +R QG CGSCWAF V A Y N + S ++L+ C
Sbjct: 120 TVTPIRMQGGCGSCWAFSGVAATESAYLAYRNTS--LDLSEQELVDC 164
>UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2;
Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera
litura multicapsid nucleopolyhedrovirus (SpltMNPV)
Length = 337
Score = 40.3 bits (90), Expect = 0.059
Identities = 21/64 (32%), Positives = 31/64 (48%)
Frame = +2
Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
A PE+FD W + +V++QG CGSCWAF A+ + + + S +
Sbjct: 124 ARTPESFD----WRKLNKVTKVKEQGVCGSCWAFAAIGNIESQYAIMHDSL--IDLSEQQ 177
Query: 485 LLSC 496
LL C
Sbjct: 178 LLDC 181
>UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep:
MGC107932 protein - Xenopus tropicalis (Western clawed
frog) (Silurana tropicalis)
Length = 333
Score = 39.9 bits (89), Expect = 0.079
Identities = 22/76 (28%), Positives = 38/76 (50%), Gaps = 2/76 (2%)
Frame = +2
Query: 275 PIKTHNFDLIA-SLPENFDPRDKWPDCPTLNEVRDQGS-CGSCWAFGAVEAMTDRVCTYS 448
P+K ++ + ++P+ D W + V++QG+ CGSCWAF V M R C +
Sbjct: 102 PVKAESYSYTSITIPKEVD----WRKSNCVTPVKNQGTFCGSCWAFATVGVMESRYCIRT 157
Query: 449 NGTKHFHFSAEDLLSC 496
+ + S + L+ C
Sbjct: 158 K--ELLNLSEQQLVDC 171
>UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa
(japonica cultivar-group)|Rep: Os09g0562700 protein -
Oryza sativa subsp. japonica (Rice)
Length = 235
Score = 39.9 bits (89), Expect = 0.079
Identities = 19/46 (41%), Positives = 27/46 (58%)
Frame = +2
Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
+ EV+DQG CGSCWAF V A+ + + G K S ++L+ C
Sbjct: 21 VTEVKDQGRCGSCWAFSTV-AVVEGIQKIKKG-KLVSLSEQELVDC 64
>UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3;
Curculionidae|Rep: Cysteine proteinase - Hypera postica
(alfalfa weevil)
Length = 324
Score = 39.9 bits (89), Expect = 0.079
Identities = 18/44 (40%), Positives = 25/44 (56%)
Frame = +2
Query: 368 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC 499
V+DQG CGSCWAF ++ T+ +G K S + L+ CC
Sbjct: 127 VKDQGDCGSCWAF-SITGSTEGAYARKSG-KLVSLSEQQLIDCC 168
>UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep:
Toxopain-2 - Toxoplasma gondii
Length = 422
Score = 39.9 bits (89), Expect = 0.079
Identities = 20/67 (29%), Positives = 29/67 (43%)
Frame = +2
Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFS 475
+L+ LP W + V+DQ CGSCWAF A+ C + K S
Sbjct: 196 ELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTG--KLVSLS 253
Query: 476 AEDLLSC 496
++L+ C
Sbjct: 254 EQELMDC 260
>UniRef50_O16454 Cluster: Temporarily assigned gene name protein
196; n=4; Bilateria|Rep: Temporarily assigned gene name
protein 196 - Caenorhabditis elegans
Length = 477
Score = 39.9 bits (89), Expect = 0.079
Identities = 17/32 (53%), Positives = 24/32 (75%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF 406
LPE+FD R+K + +V++QG+CGSCWAF
Sbjct: 264 LPESFDWREKG----AVTQVKNQGNCGSCWAF 291
>UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=17; Trichomonas vaginalis|Rep:
Clan CA, family C1, cathepsin L-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 318
Score = 39.9 bits (89), Expect = 0.079
Identities = 13/27 (48%), Positives = 18/27 (66%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEA 421
W + +N ++DQ CGSCWAF V+A
Sbjct: 106 WRNAKIVNPIKDQAQCGSCWAFSVVQA 132
>UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l;
n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
similar to cathepsin l - Strongylocentrotus purpuratus
Length = 489
Score = 39.5 bits (88), Expect = 0.10
Identities = 19/63 (30%), Positives = 32/63 (50%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
++P++ D W ++ V+DQ CGSCW+FG+ E + V + K S + L
Sbjct: 266 AVPDHID----WNVLGAVSPVKDQAVCGSCWSFGSAETIEGAV--FMQSGKRVRLSQQML 319
Query: 488 LSC 496
+ C
Sbjct: 320 MDC 322
>UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine
protease; n=1; Strongylocentrotus purpuratus|Rep:
PREDICTED: similar to cysteine protease -
Strongylocentrotus purpuratus
Length = 494
Score = 39.5 bits (88), Expect = 0.10
Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Frame = +2
Query: 275 PIKTHNFDLIASLPENFDPRD-KWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
P+K A++P+ P + W + V++QG CGSCWAF A+ M
Sbjct: 223 PLKKTGIKKQAAIPQGPVPEEYDWRTHGAVTPVKNQGMCGSCWAFSAIGNM 273
>UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin
L - Misgurnus mizolepis (Mud loach)
Length = 337
Score = 39.5 bits (88), Expect = 0.10
Identities = 18/52 (34%), Positives = 27/52 (51%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
W + + V+DQG CGSCWAF AM ++ + K S ++L+ C
Sbjct: 122 WREKGYVTPVKDQGECGSCWAFSTTGAMEGQM--FRKQGKLVSLSEQNLVDC 171
>UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba
histolytica|Rep: Cysteine protease 17 - Entamoeba
histolytica
Length = 420
Score = 39.5 bits (88), Expect = 0.10
Identities = 24/73 (32%), Positives = 34/73 (46%), Gaps = 5/73 (6%)
Frame = +2
Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGT-----K 460
D++ LPE D R L +R+Q CG CW+F +V A+ R N T +
Sbjct: 162 DIVKELPEGIDFRK----FGKLTYIREQTGCGGCWSFASVCALESRYLIDYNLTVDDVGR 217
Query: 461 HFHFSAEDLLSCC 499
+ S + LL CC
Sbjct: 218 TWALSEQQLLDCC 230
>UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n=1;
Myxobolus cerebralis|Rep: Cathepsin Z-like cysteine
proteinase - Myxobolus cerebralis
Length = 297
Score = 39.5 bits (88), Expect = 0.10
Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 5/68 (7%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGS---CGSCWAFGAVEAMTDRVCTYSNGT--KHFHF 472
++P++FD W + L+ V++Q CGSCWAF + + DR+ N + HF
Sbjct: 49 NMPKSFD----WRENAYLSSVKNQHLPTYCGSCWAFASTSTIADRIYIAKNLSHFDHFSL 104
Query: 473 SAEDLLSC 496
S + +++C
Sbjct: 105 SVQVVIAC 112
>UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4;
Caenorhabditis|Rep: Cathepsin z protein 1 -
Caenorhabditis elegans
Length = 306
Score = 39.5 bits (88), Expect = 0.10
Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 7/79 (8%)
Frame = +2
Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEV---RDQGS---CGSCWAFGAVEAMTDRV-C 439
+T +FD LP+ +D W D +N R+Q CGSCWAFGA A+ DR+
Sbjct: 56 ETEDFDS-EDLPKTWD----WRDANGINYASADRNQHIPQYCGSCWAFGATSALADRINI 110
Query: 440 TYSNGTKHFHFSAEDLLSC 496
N + S ++++ C
Sbjct: 111 KRKNAWPQAYLSVQEVIDC 129
>UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or
H-like cysteine peptidase; n=1; Trichomonas vaginalis
G3|Rep: Clan CA, family C1, cathepsin L, S or H-like
cysteine peptidase - Trichomonas vaginalis G3
Length = 473
Score = 39.5 bits (88), Expect = 0.10
Identities = 15/33 (45%), Positives = 22/33 (66%), Gaps = 1/33 (3%)
Frame = +2
Query: 341 WPDCPTL-NEVRDQGSCGSCWAFGAVEAMTDRV 436
W D P + + RDQ +CGSCWAFG E++ ++
Sbjct: 257 WRDVPNVVGKPRDQVACGSCWAFGTAESLESQL 289
>UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
Clan CA, family C1, cathepsin L-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 452
Score = 39.5 bits (88), Expect = 0.10
Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
Frame = +2
Query: 284 THNFDLIASLPENFDPRDKWPDCPTLNEV-RDQGSCGSCWAFGAVEAMTDRVCTYSNGTK 460
T++ +I +LPE+F W + P + E DQ CG+C+AFGA EA+ + +N +
Sbjct: 216 TYDQKVIQNLPESFS----WRNVPYVLEYPHDQAVCGTCFAFGASEAINGQFSLRAN--R 269
Query: 461 HFHFSAEDLLSC 496
S + L+ C
Sbjct: 270 SIITSVQQLVDC 281
>UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole
genome shotgun sequence; n=7; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_22,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 350
Score = 39.5 bits (88), Expect = 0.10
Identities = 23/57 (40%), Positives = 32/57 (56%), Gaps = 6/57 (10%)
Frame = +2
Query: 254 DEHFA----TLPIKTHNFDLIASLPENFD--PRDKWPDCPTLNEVRDQGSCGSCWAF 406
DE FA TL + + ++ + EN + P D W +N+V+DQG CGSCWAF
Sbjct: 114 DEEFAATYLTLKVNPDDLEVPKAQFENVNATPID-WRTRGAVNKVKDQGQCGSCWAF 169
>UniRef50_Q2FLC7 Cluster: Periplasmic copper-binding precursor; n=1;
Methanospirillum hungatei JF-1|Rep: Periplasmic
copper-binding precursor - Methanospirillum hungatei
(strain JF-1 / DSM 864)
Length = 1092
Score = 39.5 bits (88), Expect = 0.10
Identities = 18/48 (37%), Positives = 26/48 (54%)
Frame = +2
Query: 281 KTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
K + ++A P FD RD + +RDQG GSCW F AV+++
Sbjct: 77 KIRSLSILADYPSKFDLRDS----KRVPAIRDQGQSGSCWDFAAVKSL 120
>UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27;
Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber
officinale (Ginger)
Length = 221
Score = 39.5 bits (88), Expect = 0.10
Identities = 18/38 (47%), Positives = 25/38 (65%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
LP++ D R+K P V++QG CGSCWAF A+ A+
Sbjct: 3 LPDSIDWREKGAVVP----VKNQGGCGSCWAFDAIAAV 36
>UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 360
Score = 39.1 bits (87), Expect = 0.14
Identities = 23/66 (34%), Positives = 34/66 (51%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
+LP +FD RDK P V+ Q CG CWAF V+++ + + G K S + +
Sbjct: 130 NLPASFDWRDKGAITP----VKVQNGCGGCWAFSTVQSI-EGLYFLKTG-KLESLSTQQV 183
Query: 488 LSCCPI 505
+ CC I
Sbjct: 184 IDCCRI 189
>UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba
culbertsoni|Rep: Cysteine proteinase - Acanthamoeba
culbertsoni
Length = 482
Score = 39.1 bits (87), Expect = 0.14
Identities = 22/43 (51%), Positives = 27/43 (62%), Gaps = 3/43 (6%)
Frame = +2
Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF---GAVEAM 424
AS+P N+D R K P V++QGSC SCWAF GAVE +
Sbjct: 154 ASIPANWDWRTKGAVTP----VKNQGSCASCWAFVATGAVEGV 192
>UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease
Gip1p; n=4; Tetrahymena thermophila|Rep:
Granule-biosynthesis induced protease Gip1p -
Tetrahymena thermophila
Length = 345
Score = 39.1 bits (87), Expect = 0.14
Identities = 19/60 (31%), Positives = 30/60 (50%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICDWDA 520
W LN V++QG+CGSCW F A + + N + FS + L+ C + +D+
Sbjct: 139 WRKRGVLNPVKNQGTCGSCWTF-ATAGILESFNQIKN-KQLLKFSEQQLVDCVSLAGYDS 196
>UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lamblia
ATCC 50803|Rep: GLP_163_69918_68548 - Giardia lamblia
ATCC 50803
Length = 456
Score = 39.1 bits (87), Expect = 0.14
Identities = 21/59 (35%), Positives = 32/59 (54%)
Frame = +2
Query: 239 NGSYRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 415
+G+ R + T P+ T + +P ++D R+ P V+DQG CGSCWAFG +
Sbjct: 58 SGTCRQVYTLTDPLST-----LPEIPTSYDLREAGLQVP----VKDQGVCGSCWAFGTM 107
>UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 397
Score = 39.1 bits (87), Expect = 0.14
Identities = 18/46 (39%), Positives = 27/46 (58%)
Frame = +2
Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
++ V+DQG CG CWAF A A+ + V N T +S ++L+ C
Sbjct: 192 VSPVKDQGRCGCCWAFSAT-ALAESVNLMRNNTLQ-QYSEQELVDC 235
>UniRef50_Q23FQ5 Cluster: Papain family cysteine protease containing
protein; n=4; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 332
Score = 39.1 bits (87), Expect = 0.14
Identities = 20/66 (30%), Positives = 29/66 (43%)
Frame = +2
Query: 320 NFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCC 499
N+ W + LN +++QG CGSC AFG + Y + FS + LL C
Sbjct: 124 NYPTSVDWRNSGALNPIQNQGQCGSCAAFGTAGVLES--FYYLKSKQLLKFSEQQLLDCA 181
Query: 500 PICDWD 517
+D
Sbjct: 182 RQAGFD 187
>UniRef50_Q22DX2 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 358
Score = 39.1 bits (87), Expect = 0.14
Identities = 22/63 (34%), Positives = 30/63 (47%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
S+P ++D R P L V +QG CGSCWAF A+ N T + S + L
Sbjct: 146 SIPSSWDIRTDGPGL--LQPVENQGQCGSCWAFSTSGAVESYYSAKKNIT--LNLSKQQL 201
Query: 488 LSC 496
+ C
Sbjct: 202 VDC 204
>UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
Clan CA, family C1, cathepsin L-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 493
Score = 39.1 bits (87), Expect = 0.14
Identities = 21/63 (33%), Positives = 31/63 (49%), Gaps = 1/63 (1%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFH-FSAEDL 487
LP F R+ + + + RDQ +CGSCWAFG E + + +K FH S +
Sbjct: 266 LPRTFSWRN---NTQVVGKPRDQVACGSCWAFGTAEVLEG---AFGIASKEFHEVSTNQI 319
Query: 488 LSC 496
+ C
Sbjct: 320 MDC 322
>UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep:
CG4847-PD, isoform D - Drosophila melanogaster (Fruit
fly)
Length = 420
Score = 39.1 bits (87), Expect = 0.14
Identities = 21/68 (30%), Positives = 35/68 (51%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
+P+ FD W + + V+ QG+CGSCWAF A+ T+ + S ++L+
Sbjct: 203 IPDAFD----WREHGGVTPVKFQGTCGSCWAFATTGAIEGH--TFRKTGSLPNLSEQNLV 256
Query: 491 SCCPICDW 514
C P+ D+
Sbjct: 257 DCGPVEDF 264
>UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1;
Methanospirillum hungatei JF-1|Rep: Peptidase C1A,
papain precursor - Methanospirillum hungatei (strain
JF-1 / DSM 864)
Length = 1096
Score = 39.1 bits (87), Expect = 0.14
Identities = 18/37 (48%), Positives = 23/37 (62%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEA 421
LP +FD R+ D T +++QGSCGSCWAF A
Sbjct: 321 LPTSFDWRNNGGDYTT--PIKNQGSCGSCWAFATTGA 355
>UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F
like protease; n=1; Nasonia vitripennis|Rep: PREDICTED:
similar to cathepsin F like protease - Nasonia
vitripennis
Length = 1036
Score = 38.7 bits (86), Expect = 0.18
Identities = 17/36 (47%), Positives = 23/36 (63%), Gaps = 1/36 (2%)
Frame = +2
Query: 302 IASLPENFDPRD-KWPDCPTLNEVRDQGSCGSCWAF 406
+A++P+ P D W + V+DQGSCGSCWAF
Sbjct: 809 MATIPDIELPSDYDWRHHNVVTPVKDQGSCGSCWAF 844
>UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823
protein, partial; n=1; Ornithorhynchus anatinus|Rep:
PREDICTED: similar to MGC81823 protein, partial -
Ornithorhynchus anatinus
Length = 361
Score = 38.7 bits (86), Expect = 0.18
Identities = 14/24 (58%), Positives = 17/24 (70%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGA 412
W D + V+DQG CGSCWAFG+
Sbjct: 196 WRDHGYVTPVKDQGRCGSCWAFGS 219
>UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep:
Cathepsin K - Danio rerio (Zebrafish) (Brachydanio
rerio)
Length = 333
Score = 38.7 bits (86), Expect = 0.18
Identities = 18/43 (41%), Positives = 26/43 (60%)
Frame = +2
Query: 296 DLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
D + LP++ D R + V++QGSCGSCWAF +V A+
Sbjct: 113 DRVGKLPKSIDYRK----LGYVTSVKNQGSCGSCWAFSSVGAL 151
>UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel
protein - Danio rerio (Zebrafish) (Brachydanio rerio)
Length = 328
Score = 38.7 bits (86), Expect = 0.18
Identities = 19/55 (34%), Positives = 31/55 (56%)
Frame = +2
Query: 332 RDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
R W + ++ V++QG CGSCWAF AV ++ ++ + SA++LL C
Sbjct: 116 RVNWTEHGMVSPVQNQGPCGSCWAFSAVGSLEAQMKRRTAAL--VPLSAQNLLDC 168
>UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4;
Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena
thermophila
Length = 320
Score = 38.7 bits (86), Expect = 0.18
Identities = 23/93 (24%), Positives = 45/93 (48%), Gaps = 1/93 (1%)
Frame = +2
Query: 257 EHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRV 436
+ F TL K ++ ++ + E + W + V++QGSCGSCWAF + A+ +
Sbjct: 92 QQFLTLHEKVNSTEVYRAQGEATEV--DWTAKGKVTPVKNQGSCGSCWAFSTIGAVESAL 149
Query: 437 CTYSNGTKH-FHFSAEDLLSCCPICDWDAAEEC 532
G ++ + + ++ + C +D +E C
Sbjct: 150 WIAGQGEQNTLNLAEQEQVDCAKSPKYD-SEGC 181
>UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1;
Rhipicephalus appendiculatus|Rep: Midgut cysteine
proteinase 2 - Rhipicephalus appendiculatus (Brown ear
tick)
Length = 564
Score = 38.7 bits (86), Expect = 0.18
Identities = 19/47 (40%), Positives = 23/47 (48%)
Frame = +2
Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAV 415
P H F A LP+ D W + V+DQ CGSCW+FG V
Sbjct: 335 PFPRHRFT--AKLPDQID----WRPYGAVTPVKDQAVCGSCWSFGTV 375
>UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1;
Rhipicephalus appendiculatus|Rep: Midgut cysteine
proteinase 4 - Rhipicephalus appendiculatus (Brown ear
tick)
Length = 345
Score = 38.7 bits (86), Expect = 0.18
Identities = 16/53 (30%), Positives = 29/53 (54%)
Frame = +2
Query: 338 KWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
+W + + V++QG CGSCWAF + A+ +V + + S ++L+ C
Sbjct: 131 EWRENGFVTPVKNQGQCGSCWAFSSTGALEGQV--FKRTRRLISLSEQNLMDC 181
>UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3;
Dictyostelium discoideum AX4|Rep: Putative
uncharacterized protein - Dictyostelium discoideum AX4
Length = 664
Score = 38.7 bits (86), Expect = 0.18
Identities = 17/52 (32%), Positives = 28/52 (53%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
W +++V++QGSCGSC+AF V A+ Y + S ++L+ C
Sbjct: 476 WRTWGMVSKVKNQGSCGSCYAFSTVGALESHY--YRKNNRMLDLSEQNLVDC 525
>UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes
vastus|Rep: Cathepsin L - Aphrocallistes vastus
Length = 329
Score = 38.7 bits (86), Expect = 0.18
Identities = 17/52 (32%), Positives = 27/52 (51%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
W + V++QG CGSCW+F A ++ + S K FS ++L+ C
Sbjct: 121 WRSKGVVTPVKNQGQCGSCWSFSATGSLEGQYAIKSG--KLVSFSEQELVDC 170
>UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep:
Cysteine protease - Babesia equi
Length = 438
Score = 38.7 bits (86), Expect = 0.18
Identities = 18/52 (34%), Positives = 30/52 (57%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
W + V+DQG+CGSCWAF AV ++ + + G + S ++L++C
Sbjct: 230 WRKLNGVTPVKDQGNCGSCWAFAAVGSV-ESLYLIKKG-QALDLSEQELVNC 279
>UniRef50_A5KBM2 Cluster: Serine-repeat antigen; n=3;
Plasmodium|Rep: Serine-repeat antigen - Plasmodium vivax
Length = 1014
Score = 38.7 bits (86), Expect = 0.18
Identities = 21/62 (33%), Positives = 31/62 (50%), Gaps = 3/62 (4%)
Frame = +2
Query: 320 NFDPRDKWPD---CPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
N++ D+W D C + EV +QG+CG CW F + + C G HF SA +
Sbjct: 555 NYEYCDRWKDKTSCISNIEVEEQGNCGLCWVFASKLHLETIRC--MRGYGHFRSSALYVA 612
Query: 491 SC 496
+C
Sbjct: 613 NC 614
>UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3;
Dictyostelium discoideum|Rep: Cysteine proteinase 1
precursor - Dictyostelium discoideum (Slime mold)
Length = 343
Score = 38.7 bits (86), Expect = 0.18
Identities = 25/89 (28%), Positives = 42/89 (47%), Gaps = 2/89 (2%)
Frame = +2
Query: 272 LPIKTHNFD-LIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYS 448
LP+ + D I S+P FD W + V++QG CGSCW+F + + +
Sbjct: 104 LPVADYLDDEFINSIPTAFD----WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQ--HFI 157
Query: 449 NGTKHFHFSAEDLLSCCPIC-DWDAAEEC 532
+ K S ++L+ C C +++ E C
Sbjct: 158 SQNKLVSLSEQNLVDCDHECMEYEGEEAC 186
>UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18;
Magnoliophyta|Rep: Thiol protease aleurain precursor -
Arabidopsis thaliana (Mouse-ear cress)
Length = 358
Score = 38.7 bits (86), Expect = 0.18
Identities = 20/42 (47%), Positives = 26/42 (61%), Gaps = 3/42 (7%)
Frame = +2
Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAF---GAVEA 421
A+LPE D W + ++ V+DQG CGSCW F GA+EA
Sbjct: 139 AALPETKD----WREDGIVSPVKDQGGCGSCWTFSTTGALEA 176
>UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin O
precursor; n=1; Tribolium castaneum|Rep: PREDICTED:
similar to Cathepsin O precursor - Tribolium castaneum
Length = 326
Score = 38.3 bits (85), Expect = 0.24
Identities = 18/64 (28%), Positives = 33/64 (51%)
Frame = +2
Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
A++P D R+K + + +QGSCG+CWA+ +E + +N K S ++
Sbjct: 119 ATVPNKVDWREK----NAVTRIYNQGSCGACWAYSVIETVESMNAIKTN--KSEELSVQE 172
Query: 485 LLSC 496
++ C
Sbjct: 173 IIDC 176
>UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 20 SCAF14744, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 175
Score = 38.3 bits (85), Expect = 0.24
Identities = 18/41 (43%), Positives = 23/41 (56%)
Frame = +2
Query: 302 IASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
I LP FD W D + V++Q +CGSCWAF V A+
Sbjct: 56 IKGLPARFD----WRDNAVVGPVQNQQACGSCWAFSVVGAV 92
>UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocystis
pacifica SIR-1|Rep: Peptidase C1A, papain - Plesiocystis
pacifica SIR-1
Length = 650
Score = 38.3 bits (85), Expect = 0.24
Identities = 18/46 (39%), Positives = 24/46 (52%)
Frame = +2
Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
L +R+QG+CGSCWAF AV + + G S + LSC
Sbjct: 176 LGAIRNQGACGSCWAFAAVSTIEASNAIVNGGRS--DLSEQHALSC 219
>UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza
sativa|Rep: Cysteine proteinase-like - Oryza sativa
subsp. japonica (Rice)
Length = 360
Score = 38.3 bits (85), Expect = 0.24
Identities = 20/52 (38%), Positives = 28/52 (53%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
W + EV++Q SCGSCWAF AV A T+ + + G S + +L C
Sbjct: 143 WRARGAVTEVKNQRSCGSCWAFAAV-AATEGLVQLATGNL-VSLSEQQVLDC 192
>UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4;
core eudicotyledons|Rep: Papain-like cysteine peptidase
XBCP3 - Arabidopsis thaliana (Mouse-ear cress)
Length = 437
Score = 38.3 bits (85), Expect = 0.24
Identities = 14/28 (50%), Positives = 18/28 (64%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
W + V+DQGSCG+CW+F A AM
Sbjct: 124 WRKKGAVTNVKDQGSCGACWSFSATGAM 151
>UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza
sativa|Rep: Os01g0347600 protein - Oryza sativa subsp.
japonica (Rice)
Length = 343
Score = 38.3 bits (85), Expect = 0.24
Identities = 14/19 (73%), Positives = 17/19 (89%)
Frame = +2
Query: 368 VRDQGSCGSCWAFGAVEAM 424
V+DQG+CGSCWAF AV A+
Sbjct: 140 VKDQGACGSCWAFAAVAAI 158
>UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6;
Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia
deliciosa (Kiwi)
Length = 509
Score = 38.3 bits (85), Expect = 0.24
Identities = 18/52 (34%), Positives = 28/52 (53%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
W + V+DQG CGSCWAF + A+ + + +NG S ++L+ C
Sbjct: 153 WRKYGIVTGVKDQGDCGSCWAFSSTGAI-EGINALANGDL-ISLSEQELVDC 202
>UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1;
Oryza sativa (japonica cultivar-group)|Rep: Putative
uncharacterized protein - Oryza sativa subsp. japonica
(Rice)
Length = 289
Score = 38.3 bits (85), Expect = 0.24
Identities = 14/19 (73%), Positives = 17/19 (89%)
Frame = +2
Query: 368 VRDQGSCGSCWAFGAVEAM 424
V+DQG+CGSCWAF AV A+
Sbjct: 139 VKDQGACGSCWAFAAVAAI 157
>UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia
ATCC 50803|Rep: GLP_567_6496_7413 - Giardia lamblia ATCC
50803
Length = 305
Score = 38.3 bits (85), Expect = 0.24
Identities = 19/64 (29%), Positives = 29/64 (45%)
Frame = +2
Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
A P+ D R P+C E DQ C C+AF + A++ R C + S +
Sbjct: 79 AGSPDRLDYRQTHPEC--FFEPEDQKECSCCYAFATLGALSTRRCIAKLDPQAVSLSVQH 136
Query: 485 LLSC 496
++SC
Sbjct: 137 MVSC 140
>UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC06231 protein - Schistosoma
japonicum (Blood fluke)
Length = 372
Score = 38.3 bits (85), Expect = 0.24
Identities = 20/64 (31%), Positives = 31/64 (48%)
Frame = +2
Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
A LP+ D W + V++QG CGSCWAF + A+ + Y + + S +
Sbjct: 148 AKLPDRVD----WRRNGAVTPVKNQGQCGSCWAFSSTGAIEGQ--HYRKTNRLVNLSEQQ 201
Query: 485 LLSC 496
L+ C
Sbjct: 202 LIDC 205
>UniRef50_Q24E33 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 328
Score = 38.3 bits (85), Expect = 0.24
Identities = 18/52 (34%), Positives = 26/52 (50%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
W + V++QGSCGSCWAF A+ +N + FS + L+ C
Sbjct: 133 WTAQGAVTPVKNQGSCGSCWAFSTTGALEGSYFLKNN--QLISFSEQQLVDC 182
>UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing
protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 330
Score = 38.3 bits (85), Expect = 0.24
Identities = 15/52 (28%), Positives = 24/52 (46%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
W ++ V+DQG CGSCWAF ++ + + S + L+ C
Sbjct: 123 WVTRGKVSAVKDQGQCGSCWAFSTTGSVESALIIAGYANQTIDLSEQQLVDC 174
>UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 344
Score = 38.3 bits (85), Expect = 0.24
Identities = 24/64 (37%), Positives = 32/64 (50%)
Frame = +2
Query: 305 ASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAED 484
+ LPE+FD RDK P + Q +CGSCW F A + + G + HFS +
Sbjct: 129 SDLPESFDWRDKGIITPA----KFQNTCGSCWTF-ATTGVIESQYALKYG-ELLHFSEQM 182
Query: 485 LLSC 496
LL C
Sbjct: 183 LLDC 186
>UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathepsin
o - Aedes aegypti (Yellowfever mosquito)
Length = 375
Score = 38.3 bits (85), Expect = 0.24
Identities = 19/39 (48%), Positives = 23/39 (58%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMT 427
LP+ D RDK P VR QGSCG+CWA V+ +T
Sbjct: 153 LPKVVDWRDKGVVAP----VRSQGSCGACWAISVVDTIT 187
>UniRef50_O96163 Cluster: Cysteine protease, putative; n=5;
Plasmodium|Rep: Cysteine protease, putative - Plasmodium
falciparum (isolate 3D7)
Length = 946
Score = 38.3 bits (85), Expect = 0.24
Identities = 29/87 (33%), Positives = 40/87 (45%), Gaps = 3/87 (3%)
Frame = +2
Query: 335 DKWPD---CPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPI 505
D+W D C + EV +QG+CG CW F + C G HF SA + +C
Sbjct: 529 DRWKDKTGCISKIEVEEQGNCGLCWIFASKLHFETIRC--MRGYGHFRSSALYVANC--- 583
Query: 506 CDWDAAEECRD*LGNIGSTSV*YQEVV 586
D D+ E C +GS V + E+V
Sbjct: 584 SDRDSDEIC-----FVGSNPVEFLEIV 605
>UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra
erinaceieuropaei|Rep: Cysteine proteinase - Spirometra
erinaceieuropaei (Tapeworm)
Length = 336
Score = 38.3 bits (85), Expect = 0.24
Identities = 18/62 (29%), Positives = 29/62 (46%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
L EN W + + V++QG CGSCW+F A A+ + + + S + L+
Sbjct: 117 LKENLPDSVNWRERGAVTSVKNQGQCGSCWSFSANGAIEGAIQIKTGALR--SLSEQQLM 174
Query: 491 SC 496
C
Sbjct: 175 DC 176
>UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16;
Plasmodium (Vinckeia)|Rep: Cysteine proteinase precursor
- Plasmodium vinckei
Length = 506
Score = 38.3 bits (85), Expect = 0.24
Identities = 25/83 (30%), Positives = 42/83 (50%), Gaps = 8/83 (9%)
Frame = +2
Query: 272 LPIKTH--NFDLIA------SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMT 427
+P+K H N +LI+ P++ D R K+ P +DQG+CGSCWAF A+
Sbjct: 242 VPLKKHLANTNLISVDNKSKDFPDSRDYRSKFNFLPP----KDQGNCGSCWAFAAI-GNF 296
Query: 428 DRVCTYSNGTKHFHFSAEDLLSC 496
+ + ++ FS + ++ C
Sbjct: 297 EYLYVHTRHEMPISFSEQQMVDC 319
>UniRef50_P43234 Cluster: Cathepsin O precursor; n=22;
Euteleostomi|Rep: Cathepsin O precursor - Homo sapiens
(Human)
Length = 321
Score = 38.3 bits (85), Expect = 0.24
Identities = 19/39 (48%), Positives = 23/39 (58%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
SLP FD RDK + +VR+Q CG CWAF V A+
Sbjct: 107 SLPLRFDWRDK----QVVTQVRNQQMCGGCWAFSVVGAV 141
>UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19;
Bilateria|Rep: Cathepsin F precursor - Homo sapiens
(Human)
Length = 484
Score = 38.3 bits (85), Expect = 0.24
Identities = 20/56 (35%), Positives = 28/56 (50%)
Frame = +2
Query: 329 PRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
P W + +V+DQG CGSCWAF +V + + GT S ++LL C
Sbjct: 273 PEWDWRSKGAVTKVKDQGMCGSCWAF-SVTGNVEGQWFLNQGTL-LSLSEQELLDC 326
>UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2;
Entamoeba|Rep: Cysteine proteinase ACP1 precursor -
Entamoeba histolytica
Length = 308
Score = 38.3 bits (85), Expect = 0.24
Identities = 17/46 (36%), Positives = 24/46 (52%)
Frame = +2
Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
+N +DQG CGSCW F + RV + K + FS + L+ C
Sbjct: 103 MNPAKDQGQCGSCWTFCTTAVLEGRV--NKDLGKLYSFSEQQLVDC 146
>UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease;
n=23; Magnoliophyta|Rep: Senescence-specific cysteine
protease - Arabidopsis thaliana (Mouse-ear cress)
Length = 346
Score = 37.9 bits (84), Expect = 0.32
Identities = 23/63 (36%), Positives = 31/63 (49%)
Frame = +2
Query: 308 SLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDL 487
+LP + D R K P +++QGSCG CWAF AV A+ T K S + L
Sbjct: 129 ALPVSVDWRKKGAVTP----IKNQGSCGCCWAFSAVAAIEG--ATQIKKGKLISLSEQQL 182
Query: 488 LSC 496
+ C
Sbjct: 183 VDC 185
>UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2;
Caenorhabditis|Rep: Putative uncharacterized protein -
Caenorhabditis elegans
Length = 343
Score = 37.9 bits (84), Expect = 0.32
Identities = 17/48 (35%), Positives = 24/48 (50%)
Frame = +2
Query: 368 VRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCPICD 511
++ QG CGSCWAF A+ V G + S++ LL C + D
Sbjct: 153 IKYQGPCGSCWAFATAAAIESAVSISGGGLQ--SLSSQQLLDCTVVSD 198
>UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium
falciparum|Rep: Falcipain 2 - Plasmodium falciparum
Length = 484
Score = 37.9 bits (84), Expect = 0.32
Identities = 20/61 (32%), Positives = 31/61 (50%), Gaps = 1/61 (1%)
Frame = +2
Query: 317 ENFDPRD-KWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493
ENFD W + V+DQ +CGSCWAF ++ ++ + N K S ++L+
Sbjct: 258 ENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKN--KLITLSEQELVD 315
Query: 494 C 496
C
Sbjct: 316 C 316
>UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein A;
n=2; Dictyostelium discoideum|Rep: Gamete and
mating-type specific protein A - Dictyostelium
discoideum (Slime mold)
Length = 448
Score = 37.9 bits (84), Expect = 0.32
Identities = 17/45 (37%), Positives = 25/45 (55%), Gaps = 2/45 (4%)
Frame = +2
Query: 368 VRDQGSCGSCWAFGAVEAMTDR-VCTYSNGTKH-FHFSAEDLLSC 496
+RDQG CGSCWAF + A+ R + Y K S ++ ++C
Sbjct: 253 IRDQGQCGSCWAFASSAALESRYLIKYGTAQKSTLQLSNQNAVNC 297
>UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 356
Score = 37.9 bits (84), Expect = 0.32
Identities = 20/74 (27%), Positives = 35/74 (47%)
Frame = +2
Query: 275 PIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNG 454
P+K N + +PE+ + W D ++ V+DQ +CGSCW F A+ + +
Sbjct: 116 PMKIQNKKNV-QVPESIN----WKDLNKVSPVKDQQNCGSCWTFSTTGAIESHYAIFED- 169
Query: 455 TKHFHFSAEDLLSC 496
+ S + L+ C
Sbjct: 170 VEPTSLSEQQLIDC 183
>UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2;
Platyhelminthes|Rep: Cathepsin L-like proteinase -
Echinococcus multilocularis
Length = 338
Score = 37.9 bits (84), Expect = 0.32
Identities = 21/62 (33%), Positives = 32/62 (51%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 490
+P++ D R K P ++DQG CGSCWAF A A+ ++ + K S + L+
Sbjct: 122 VPDSIDWRKKGLVTP----IKDQGDCGSCWAFSATGALEGQLKRKTG--KLISLSEQQLV 175
Query: 491 SC 496
C
Sbjct: 176 DC 177
>UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, whole
genome shotgun sequence; n=3; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_2,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 376
Score = 37.9 bits (84), Expect = 0.32
Identities = 17/46 (36%), Positives = 24/46 (52%)
Frame = +2
Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
+ EV+ QG CGSCWAF + + R+ +N K S L+ C
Sbjct: 175 VTEVQQQGRCGSCWAFAVQDVVISRL-AIANKNKLDQLSKTHLIDC 219
>UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179,
whole genome shotgun sequence; n=3; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_179,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 339
Score = 37.9 bits (84), Expect = 0.32
Identities = 20/83 (24%), Positives = 44/83 (53%)
Frame = +2
Query: 248 YRDEHFATLPIKTHNFDLIASLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMT 427
++++ + ++ + P ++ ++ +P C ++V +QG+C S ++ + +
Sbjct: 104 FKNDFTQQINVEKCKLSFMDETPVYYNFKEAYPQCN--HQVYNQGNCSSSYSIAVSSSFS 161
Query: 428 DRVCTYSNGTKHFHFSAEDLLSC 496
DRVC N T+ SA++LLSC
Sbjct: 162 DRVCK-QNQTQ--QLSAQNLLSC 181
>UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon
GZfos34G5|Rep: Cathepsin C - uncultured archaeon
GZfos34G5
Length = 760
Score = 37.9 bits (84), Expect = 0.32
Identities = 21/43 (48%), Positives = 27/43 (62%), Gaps = 1/43 (2%)
Frame = +2
Query: 299 LIASLP-ENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
L AS+P FD RDK + V++QGSCGSC AFG + A+
Sbjct: 301 LDASVPIGTFDWRDK-DGANWITSVKEQGSCGSCVAFGTIGAL 342
>UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease
containing protein; n=2; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 332
Score = 37.5 bits (83), Expect = 0.42
Identities = 17/38 (44%), Positives = 24/38 (63%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
LPE+ D W ++ VRDQG+CGSC+AF + A+
Sbjct: 127 LPESVD----WRKLGAVSPVRDQGNCGSCYAFASTGAL 160
>UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate
cathepsin L; n=4; Danio rerio|Rep: Novel protein similar
to vertebrate cathepsin L - Danio rerio (Zebrafish)
(Brachydanio rerio)
Length = 334
Score = 37.5 bits (83), Expect = 0.42
Identities = 16/46 (34%), Positives = 26/46 (56%)
Frame = +2
Query: 359 LNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
+ EV+DQG CGSCW+F A+ ++ Y + + S + L+ C
Sbjct: 130 VTEVKDQGYCGSCWSFSTTGAIEGQM--YKHTGRLVSLSEQQLVDC 173
>UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2;
Roseiflexus|Rep: Peptidase C1A, papain precursor -
Roseiflexus sp. RS-1
Length = 1202
Score = 37.5 bits (83), Expect = 0.42
Identities = 17/35 (48%), Positives = 20/35 (57%), Gaps = 3/35 (8%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAF---GAVEAMTDRV 436
W D V+DQG CGSCWAF G VE+ R+
Sbjct: 175 WCDQGACTPVKDQGVCGSCWAFATTGVVESALKRI 209
>UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum
aestivum|Rep: Cysteine protease - Triticum aestivum
(Wheat)
Length = 371
Score = 37.5 bits (83), Expect = 0.42
Identities = 20/61 (32%), Positives = 28/61 (45%)
Frame = +2
Query: 314 PENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLS 493
P FD W + + + QG+CG CWAF A A T NG + S ++L+
Sbjct: 154 PRQFD----WREHGVVTPAKQQGACGCCWAFAA--AATVESLNKINGGELVDLSVQELVD 207
Query: 494 C 496
C
Sbjct: 208 C 208
>UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3;
Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber
officinale (Ginger)
Length = 475
Score = 37.5 bits (83), Expect = 0.42
Identities = 17/38 (44%), Positives = 25/38 (65%)
Frame = +2
Query: 311 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAM 424
LP++ D R+K + V++QG CGSCWAF A+ A+
Sbjct: 143 LPDSIDWREKG----AVVAVKNQGRCGSCWAFAAIAAV 176
>UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza
sativa|Rep: Os01g0240900 protein - Oryza sativa subsp.
japonica (Rice)
Length = 166
Score = 37.5 bits (83), Expect = 0.42
Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Frame = +2
Query: 341 WPDCPTLNEVRDQGSCGSCWAF---GAVEAMTDRVCTYSNGTKHFHFSAEDLLSC 496
W D + +V+ QG+C SCWAF GAVE D N + S + L++C
Sbjct: 104 WRDRGAVTDVKMQGTCASCWAFSTTGAVEG--DNFLASGNLRNLLNLSEQQLVNC 156
Database: uniref50
Posted date: Oct 5, 2007 11:19 AM
Number of letters in database: 575,637,011
Number of sequences in database: 1,657,284
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 878,143,735
Number of Sequences: 1657284
Number of extensions: 18445955
Number of successful extensions: 54072
Number of sequences better than 10.0: 391
Number of HSP's better than 10.0 without gapping: 50872
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 53967
length of database: 575,637,011
effective HSP length: 100
effective length of database: 409,908,611
effective search space used: 74193458591
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)
- SilkBase 1999-2023 -