BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= ps4M0472.Seq (657 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C... 120 2e-26 UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n... 109 6e-23 UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|... 101 1e-20 UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr... 101 2e-20 UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina... 100 4e-20 UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3... 99 1e-19 UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs... 97 4e-19 UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata... 97 4e-19 UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi... 95 1e-18 UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep... 94 2e-18 UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy... 93 4e-18 UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ... 92 1e-17 UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ... 92 1e-17 UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus... 92 1e-17 UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat... 90 4e-17 UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy... 90 4e-17 UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine prot... 90 5e-17 UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M... 89 7e-17 UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j... 88 2e-16 UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida... 88 2e-16 UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin ... 87 3e-16 UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei... 87 3e-16 UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er... 87 3e-16 UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca... 86 6e-16 UniRef50_Q4TI44 Cluster: Chromosome undetermined SCAF2412, whole... 86 8e-16 UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate... 86 8e-16 UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:... 85 1e-15 UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ... 85 1e-15 UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No... 85 2e-15 UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ... 84 2e-15 UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ... 84 3e-15 UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome s... 83 4e-15 UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto... 83 4e-15 UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|... 83 6e-15 UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re... 83 8e-15 UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep... 82 1e-14 UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi... 82 1e-14 UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy... 82 1e-14 UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca... 82 1e-14 UniRef50_Q0MYX5 Cluster: Putative cysteine protease; n=1; Emilia... 82 1e-14 UniRef50_UPI0000E224BB Cluster: PREDICTED: hypothetical protein;... 81 2e-14 UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy... 81 2e-14 UniRef50_A2F4G0 Cluster: Clan CA, family C1, cathepsin L-like cy... 81 2e-14 UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]... 81 2e-14 UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica... 81 2e-14 UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n... 81 2e-14 UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e... 79 7e-14 UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18... 79 9e-14 UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1; ... 79 1e-13 UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist... 78 2e-13 UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ... 77 3e-13 UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip... 77 3e-13 UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste... 77 4e-13 UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Sl... 77 4e-13 UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat... 77 5e-13 UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy... 77 5e-13 UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li... 77 5e-13 UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory... 76 9e-13 UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:... 75 1e-12 UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae... 75 1e-12 UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=... 75 2e-12 UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ... 74 3e-12 UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc... 74 3e-12 UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n... 73 5e-12 UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1... 73 5e-12 UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain... 73 5e-12 UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio... 73 6e-12 UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ... 73 8e-12 UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-li... 72 1e-11 UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus... 71 2e-11 UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli... 71 2e-11 UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R... 71 2e-11 UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome s... 71 3e-11 UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n... 71 3e-11 UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph... 70 4e-11 UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster... 70 4e-11 UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ... 70 4e-11 UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ... 70 4e-11 UniRef50_Q9LUX8 Cluster: Cysteine protease; n=1; Pyrus pyrifolia... 70 6e-11 UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cy... 70 6e-11 UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ... 69 8e-11 UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt... 69 8e-11 UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac... 69 8e-11 UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h... 69 1e-10 UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n... 69 1e-10 UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve... 69 1e-10 UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi... 69 1e-10 UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ... 68 2e-10 UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;... 68 2e-10 UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; ... 68 2e-10 UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz... 68 2e-10 UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio... 68 2e-10 UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2... 68 2e-10 UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ... 67 3e-10 UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ... 67 4e-10 UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli... 67 4e-10 UniRef50_Q22A69 Cluster: Papain family cysteine protease contain... 66 5e-10 UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv... 66 7e-10 UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi... 66 9e-10 UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ... 66 9e-10 UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C... 66 9e-10 UniRef50_A2DKX2 Cluster: Clan CA, family C1, cathepsin L-like cy... 65 1e-09 UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;... 65 1e-09 UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ... 65 2e-09 UniRef50_P25773 Cluster: Cathepsin L; n=38; Eukaryota|Rep: Cathe... 65 2e-09 UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ... 64 2e-09 UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t... 64 3e-09 UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal... 64 3e-09 UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia ... 64 3e-09 UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP000... 64 4e-09 UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox... 64 4e-09 UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ... 63 7e-09 UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty... 63 7e-09 UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|R... 63 7e-09 UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re... 62 9e-09 UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ... 62 1e-08 UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-... 62 1e-08 UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG... 62 2e-08 UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ... 62 2e-08 UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ... 62 2e-08 UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl... 61 2e-08 UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arab... 61 3e-08 UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big... 61 3e-08 UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi... 61 3e-08 UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain... 60 3e-08 UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endop... 60 3e-08 UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D... 60 3e-08 UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz... 60 5e-08 UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;... 60 5e-08 UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy... 60 6e-08 UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel... 60 6e-08 UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ... 59 8e-08 UniRef50_Q5BTK3 Cluster: SJCHGC00358 protein; n=1; Schistosoma j... 59 8e-08 UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain... 59 8e-08 UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr... 59 8e-08 UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryz... 59 1e-07 UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyt... 59 1e-07 UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic... 59 1e-07 UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; ... 58 1e-07 UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ... 58 1e-07 UniRef50_UPI00006CBB5F Cluster: Papain family cysteine protease ... 58 2e-07 UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted... 58 2e-07 UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia... 58 2e-07 UniRef50_O16454 Cluster: Temporarily assigned gene name protein ... 58 2e-07 UniRef50_UPI0000498E2F Cluster: cysteine proteinase; n=1; Entamo... 58 2e-07 UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole... 58 2e-07 UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil... 57 3e-07 UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb... 57 4e-07 UniRef50_Q22RC9 Cluster: Papain family cysteine protease contain... 57 4e-07 UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ... 57 4e-07 UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryz... 56 6e-07 UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa... 56 7e-07 UniRef50_Q9VNK7 Cluster: CG1075-PA; n=1; Drosophila melanogaster... 56 7e-07 UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re... 56 7e-07 UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain... 56 7e-07 UniRef50_Q23H32 Cluster: Papain family cysteine protease contain... 56 7e-07 UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ... 56 7e-07 UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lambl... 56 1e-06 UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:... 56 1e-06 UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve... 56 1e-06 UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen... 56 1e-06 UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ... 55 1e-06 UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n... 55 1e-06 UniRef50_Q24E33 Cluster: Papain family cysteine protease contain... 55 1e-06 UniRef50_Q239L8 Cluster: Papain family cysteine protease contain... 55 1e-06 UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi... 55 1e-06 UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa... 55 2e-06 UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia... 55 2e-06 UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr... 55 2e-06 UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir... 55 2e-06 UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D... 55 2e-06 UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v... 54 2e-06 UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet... 54 3e-06 UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|... 54 3e-06 UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sa... 54 4e-06 UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3; Bilate... 54 4e-06 UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;... 54 4e-06 UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain... 54 4e-06 UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabdi... 54 4e-06 UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa... 53 5e-06 UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n... 53 5e-06 UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis... 53 5e-06 UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophi... 53 5e-06 UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ... 53 7e-06 UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ... 53 7e-06 UniRef50_Q54VR1 Cluster: Putative uncharacterized protein; n=1; ... 53 7e-06 UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try... 53 7e-06 UniRef50_A7APS9 Cluster: Papain family cysteine protease contain... 53 7e-06 UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w... 53 7e-06 UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia... 52 9e-06 UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov... 52 9e-06 UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R... 52 9e-06 UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl... 52 1e-05 UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa... 52 1e-05 UniRef50_Q015J8 Cluster: Cathepsin; n=2; Ostreococcus|Rep: Cathe... 52 1e-05 UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep: Cat... 52 2e-05 UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35... 52 2e-05 UniRef50_Q9BL26 Cluster: Putative uncharacterized protein; n=4; ... 51 2e-05 UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa... 51 2e-05 UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=... 51 2e-05 UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The... 51 2e-05 UniRef50_A2WPN4 Cluster: Putative uncharacterized protein; n=2; ... 51 3e-05 UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi... 51 3e-05 UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - ... 51 3e-05 UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep... 51 3e-05 UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ... 50 4e-05 UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3.... 50 4e-05 UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease ... 50 5e-05 UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps... 50 5e-05 UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n... 50 5e-05 UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi... 50 5e-05 UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:... 50 5e-05 UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C... 50 6e-05 UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid... 50 6e-05 UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain... 50 6e-05 UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain... 50 6e-05 UniRef50_A0DIY3 Cluster: Chromosome undetermined scaffold_52, wh... 50 6e-05 UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl... 36 7e-05 UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhip... 49 9e-05 UniRef50_Q7JYA0 Cluster: RE20049p; n=2; Sophophora|Rep: RE20049p... 49 9e-05 UniRef50_Q4UCF5 Cluster: Cysteine proteinase, tacP, putative; n=... 49 9e-05 UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ... 49 1e-04 UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma... 49 1e-04 UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ... 48 1e-04 UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep... 48 1e-04 UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 48 1e-04 UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ... 48 1e-04 UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ... 48 2e-04 UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ... 48 2e-04 UniRef50_Q26988 Cluster: Cysteine proteinase 4; n=1; Tritrichomo... 48 2e-04 UniRef50_Q23H15 Cluster: Papain family cysteine protease contain... 48 2e-04 UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep... 48 2e-04 UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv... 48 3e-04 UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ... 48 3e-04 UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|... 48 3e-04 UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.... 48 3e-04 UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ... 48 3e-04 UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo... 48 3e-04 UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:... 48 3e-04 UniRef50_Q84SA7 Cluster: Thiol protease; n=1; Aster tripolium|Re... 47 3e-04 UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus... 47 3e-04 UniRef50_Q7QQ92 Cluster: GLP_243_18349_20043; n=1; Giardia lambl... 47 3e-04 UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;... 47 3e-04 UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ... 47 3e-04 UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh... 47 3e-04 UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi... 47 3e-04 UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh... 47 5e-04 UniRef50_O48605 Cluster: Putative thiol protease; n=1; Hordeum v... 47 5e-04 UniRef50_A7QDM1 Cluster: Chromosome chr10 scaffold_81, whole gen... 47 5e-04 UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain... 47 5e-04 UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C... 47 5e-04 UniRef50_Q962W3 Cluster: Cysteine protease; n=3; Giardia intesti... 46 6e-04 UniRef50_Q7R0G3 Cluster: GLP_29_33036_32140; n=1; Giardia lambli... 46 6e-04 UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; ... 46 6e-04 UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 46 6e-04 UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ... 46 8e-04 UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein... 46 8e-04 UniRef50_Q5CYC8 Cluster: Cathepsin like thiol protease possibly ... 46 8e-04 UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1... 46 8e-04 UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain... 46 8e-04 UniRef50_Q6A1H9 Cluster: Cathepsin X/O; n=1; Suberites domuncula... 46 0.001 UniRef50_Q237A1 Cluster: Papain family cysteine protease contain... 46 0.001 UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Ca... 46 0.001 UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,... 45 0.001 UniRef50_Q7QWN9 Cluster: GLP_26_50243_51811; n=2; Giardia lambli... 45 0.001 UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia... 45 0.001 UniRef50_Q7QRX3 Cluster: GLP_549_24108_24914; n=1; Giardia lambl... 45 0.001 UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gamb... 45 0.001 UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain... 45 0.001 UniRef50_Q23H06 Cluster: Papain family cysteine protease contain... 45 0.001 UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain... 45 0.001 UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Ca... 45 0.001 UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA... 45 0.002 UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n... 45 0.002 UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis... 45 0.002 UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil... 45 0.002 UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain... 45 0.002 UniRef50_A3FQ13 Cluster: Cathepsin like thiol protease possibly ... 45 0.002 UniRef50_P05993 Cluster: Cysteine proteinase; n=7; Eukaryota|Rep... 45 0.002 UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto... 45 0.002 UniRef50_Q23H10 Cluster: Papain family cysteine protease contain... 44 0.002 UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep... 44 0.002 UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve... 44 0.002 UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh... 44 0.002 UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ... 44 0.002 UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|... 44 0.002 UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ... 44 0.003 UniRef50_Q22W19 Cluster: Papain family cysteine protease contain... 44 0.003 UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like prote... 44 0.003 UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The... 44 0.003 UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 ... 44 0.003 UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca... 44 0.003 UniRef50_A1SVF0 Cluster: Peptidase C1A, papain; n=1; Psychromona... 44 0.004 UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; ... 44 0.004 UniRef50_A2DPZ6 Cluster: Clan CA, family C1, cathepsin B-like cy... 44 0.004 UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali... 44 0.004 UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh... 44 0.004 UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ... 43 0.006 UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ... 43 0.006 UniRef50_Q26987 Cluster: Cysteine proteinase 3; n=1; Tritrichomo... 43 0.006 UniRef50_Q84YE7 Cluster: Cysteine proteinase-like protein; n=1; ... 43 0.007 UniRef50_Q24F16 Cluster: Papain family cysteine protease contain... 43 0.007 UniRef50_O62484 Cluster: Putative uncharacterized protein; n=1; ... 43 0.007 UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 ... 43 0.007 UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re... 42 0.010 UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|... 42 0.010 UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest... 42 0.010 UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2... 42 0.010 UniRef50_Q4UC83 Cluster: Cysteine proteinase, putative; n=2; The... 42 0.010 UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain... 42 0.010 UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;... 42 0.013 UniRef50_Q5YER6 Cluster: Cathepsin Z; n=1; Bigelowiella natans|R... 42 0.013 UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep... 42 0.013 UniRef50_Q4N5Z7 Cluster: Cysteine proteinase, putative; n=2; The... 42 0.013 UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n... 42 0.013 UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma... 42 0.013 UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh... 42 0.013 UniRef50_UPI0000E46171 Cluster: PREDICTED: hypothetical protein;... 42 0.017 UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomati... 42 0.017 UniRef50_Q5CKC2 Cluster: Preprocathepsin c; n=2; Cryptosporidium... 42 0.017 UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain... 42 0.017 UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re... 42 0.017 UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh... 42 0.017 UniRef50_A0CHZ5 Cluster: Chromosome undetermined scaffold_186, w... 42 0.017 UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ... 41 0.023 UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; ... 41 0.023 UniRef50_Q4UFL9 Cluster: Cathepsin-like cysteine protease, putat... 41 0.023 UniRef50_Q231X3 Cluster: Papain family cysteine protease contain... 41 0.023 UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w... 41 0.023 UniRef50_A0BZE1 Cluster: Chromosome undetermined scaffold_139, w... 41 0.023 UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=... 41 0.023 UniRef50_Q9UBR2 Cluster: Cathepsin Z precursor; n=40; Bilateria|... 41 0.023 UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ... 41 0.030 UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=... 41 0.030 UniRef50_Q7R5X2 Cluster: GLP_81_104117_102504; n=1; Giardia lamb... 41 0.030 UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl... 41 0.030 UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil... 41 0.030 UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ... 41 0.030 UniRef50_A0DZB1 Cluster: Chromosome undetermined scaffold_7, who... 41 0.030 UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M... 41 0.030 UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O... 40 0.040 UniRef50_Q26993 Cluster: Cysteine proteinase 9; n=1; Tritrichomo... 40 0.040 UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain... 40 0.040 UniRef50_Q1AMF3 Cluster: Cathepsin C1; n=1; Toxoplasma gondii|Re... 40 0.040 UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7... 40 0.040 UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|... 40 0.040 UniRef50_A2GCC2 Cluster: Clan CA, family C1, cathepsin B-like cy... 40 0.040 UniRef50_A0DCA5 Cluster: Chromosome undetermined scaffold_45, wh... 40 0.040 UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, wh... 40 0.040 UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-li... 40 0.040 UniRef50_UPI0000E49DA9 Cluster: PREDICTED: similar to cathepsin ... 40 0.053 UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ... 40 0.053 UniRef50_Q7M1Q7 Cluster: Actinidain; n=1; Actinidia chinensis|Re... 40 0.053 UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; ... 40 0.053 UniRef50_Q4N3V5 Cluster: Cathepsin C, putative; n=1; Theileria p... 40 0.053 UniRef50_Q235G6 Cluster: Papain family cysteine protease contain... 40 0.053 UniRef50_Q0PWU8 Cluster: Cathepsin B preproprotein-like protein;... 40 0.053 UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; ... 40 0.069 UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin ... 40 0.069 UniRef50_Q01FL5 Cluster: Cysteine protease-5; n=3; Ostreococcus|... 40 0.069 UniRef50_Q9XW98 Cluster: Putative uncharacterized protein; n=1; ... 40 0.069 UniRef50_Q9NHY2 Cluster: Cysteine protease cp1; n=2; Theileria c... 40 0.069 UniRef50_Q8I8D3 Cluster: Cysteine protease 15; n=2; Entamoeba hi... 40 0.069 UniRef50_Q7Q9Y5 Cluster: ENSANGP00000012222; n=1; Anopheles gamb... 40 0.069 UniRef50_UPI00006CB653 Cluster: Papain family cysteine protease ... 39 0.092 UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G... 39 0.092 UniRef50_Q8I8D7 Cluster: Cysteine protease 11; n=4; Entamoeba hi... 39 0.092 UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw... 39 0.092 UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8.... 39 0.092 UniRef50_A7AX75 Cluster: Preprocathepsin c, putative; n=1; Babes... 39 0.092 UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy... 39 0.092 UniRef50_Q9TY95 Cluster: Serine-repeat antigen protein precursor... 39 0.092 UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ... 39 0.12 UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ... 39 0.12 UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10... 39 0.12 UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li... 39 0.12 UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs... 39 0.12 UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Sl... 38 0.16 UniRef50_Q248G1 Cluster: Papain family cysteine protease contain... 38 0.16 UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator ame... 38 0.16 UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, w... 38 0.16 UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium... 38 0.16 UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: ... 38 0.21 UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop... 38 0.21 UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr... 38 0.21 UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ... 38 0.21 UniRef50_A4S004 Cluster: Predicted protein; n=2; Ostreococcus|Re... 38 0.28 UniRef50_Q8I8D5 Cluster: Cysteine protease 13; n=2; Entamoeba hi... 38 0.28 UniRef50_Q5CM16 Cluster: P3ECSL-related; n=2; Cryptosporidium|Re... 38 0.28 UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr... 38 0.28 UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;... 37 0.37 UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R... 37 0.37 UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy... 37 0.37 UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG011... 37 0.49 UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag... 37 0.49 UniRef50_A0EI50 Cluster: Chromosome undetermined scaffold_98, wh... 37 0.49 UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|R... 37 0.49 UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,... 36 0.65 UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep... 36 0.65 UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin... 36 0.65 UniRef50_Q1AMF2 Cluster: Cathepsin C2; n=1; Toxoplasma gondii|Re... 36 0.65 UniRef50_A7SNM3 Cluster: Predicted protein; n=1; Nematostella ve... 36 0.65 UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like cy... 36 0.86 UniRef50_Q2H7E7 Cluster: Putative uncharacterized protein; n=1; ... 36 0.86 UniRef50_A2XHS0 Cluster: Putative uncharacterized protein; n=2; ... 36 1.1 UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; ... 36 1.1 UniRef50_Q4XZE6 Cluster: Preprocathepsin c, putative; n=6; Plasm... 36 1.1 UniRef50_O96167 Cluster: Cysteine protease, putative; n=1; Plasm... 36 1.1 UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina... 36 1.1 UniRef50_A7ASR7 Cluster: Cathepsin C, putative; n=1; Babesia bov... 36 1.1 UniRef50_Q06VH9 Cluster: Putative uncharacterized protein; n=1; ... 35 1.5 UniRef50_A5K8Y0 Cluster: Preprocathepsin c, putative; n=1; Plasm... 35 1.5 UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy... 35 1.5 UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba... 35 2.0 UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O... 35 2.0 UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain... 35 2.0 UniRef50_A7TZ14 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 35 2.0 UniRef50_Q8IIJ9 Cluster: Probable cathepsin C precursor; n=11; P... 35 2.0 UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ... 34 2.6 UniRef50_Q8I8D6 Cluster: Cysteine protease 12; n=1; Entamoeba hi... 34 2.6 UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei... 34 2.6 UniRef50_Q7R6M0 Cluster: GLP_170_106076_104580; n=1; Giardia lam... 34 2.6 UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli... 34 2.6 UniRef50_Q1AMF1 Cluster: Cathepsin C3; n=1; Toxoplasma gondii|Re... 34 2.6 UniRef50_O96163 Cluster: Cysteine protease, putative; n=5; Plasm... 34 2.6 UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu... 34 2.6 UniRef50_Q8I8D2 Cluster: Cysteine protease 16; n=2; Entamoeba hi... 34 3.5 UniRef50_Q8I8D0 Cluster: Cysteine protease 18; n=2; Entamoeba hi... 34 3.5 UniRef50_Q7R6L4 Cluster: GLP_170_114230_115951; n=1; Giardia lam... 34 3.5 UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh... 34 3.5 UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ... 33 4.6 UniRef50_UPI0000E4622C Cluster: PREDICTED: hypothetical protein;... 33 4.6 UniRef50_A0BNM1 Cluster: Chromosome undetermined scaffold_119, w... 33 4.6 UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum... 33 6.0 UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop... 33 6.0 UniRef50_O96166 Cluster: Cysteine protease, putative; n=1; Plasm... 33 6.0 UniRef50_O96165 Cluster: Cysteine protease, putative; n=1; Plasm... 33 6.0 UniRef50_A1SAN0 Cluster: Putative uncharacterized protein; n=1; ... 33 8.0 UniRef50_Q8WQ50 Cluster: Zerknuellt protein; n=1; Haematopota pl... 33 8.0 UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n... 33 8.0 >UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina) Length = 339 Score = 120 bits (290), Expect = 2e-26 Identities = 52/78 (66%), Positives = 60/78 (76%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 H SFQLYS GVYNE EC +LDHGVLVVGYGTDE G+DYW + G WGE GYIKM Sbjct: 262 HESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMA 321 Query: 477 RNKNNRCGIASSASYXXV 424 RN+NN+CGIA+++SY V Sbjct: 322 RNQNNQCGIATASSYPTV 339 >UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n=21; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Globodera pallida Length = 379 Score = 109 bits (262), Expect = 6e-23 Identities = 50/78 (64%), Positives = 56/78 (71%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 H SFQLY+ GVY E+ECS +LDHGVLVVGYGTD Q DYW + G WGE GYI+M Sbjct: 302 HRSFQLYTHGVYFEKECSPENLDHGVLVVGYGTDAQQGDYWIVKNSWGAHWGEQGYIRMA 361 Query: 477 RNKNNRCGIASSASYXXV 424 RN+ N CGIAS ASY V Sbjct: 362 RNRKNNCGIASHASYPLV 379 >UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|Rep: Cysteine proteinase - Acanthamoeba healyi Length = 330 Score = 101 bits (243), Expect = 1e-20 Identities = 47/75 (62%), Positives = 54/75 (72%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 H SFQ YS GVY E CSST LDHGVLVVG+G+ E G D+W + G WG GYIKM Sbjct: 254 HNSFQFYSGGVYYESACSSTQLDHGVLVVGWGS-ENGQDFWWVKNSWGASWGLNGYIKMS 312 Query: 477 RNKNNRCGIASSASY 433 RN+NN CGIA++ASY Sbjct: 313 RNQNNNCGIATAASY 327 >UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine proteinase precursor - Heterodera glycines (Soybean cyst nematode worm) Length = 353 Score = 101 bits (242), Expect = 2e-20 Identities = 46/76 (60%), Positives = 53/76 (69%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 SFQ Y +GVY E CS+ LDHGVL+VGYGTDE DYW + GP WGE GYI++ RN Sbjct: 278 SFQFYKTGVYYERWCSNRYLDHGVLLVGYGTDETHGDYWLVKNSWGPHWGENGYIRIARN 337 Query: 471 KNNRCGIASSASYXXV 424 K N CGIA+ ASY V Sbjct: 338 KQNHCGIATMASYPVV 353 >UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteinase B; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like tick cysteine proteinase B - Haemaphysalis longicornis (Bush tick) Length = 332 Score = 100 bits (239), Expect = 4e-20 Identities = 46/74 (62%), Positives = 54/74 (72%) Frame = -1 Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466 Q YS G+Y+E ECSS LDHGVLVVGYGT + G DYW + G WG+ GYI M RN++ Sbjct: 260 QFYSEGIYDEPECSSEQLDHGVLVVGYGTKD-GKDYWLVKNSWGTTWGDEGYIYMTRNQD 318 Query: 465 NRCGIASSASYXXV 424 N+CGIASSASY V Sbjct: 319 NQCGIASSASYPLV 332 >UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3; Bilateria|Rep: Cathepsin L-like cysteine protease - Neobenedenia melleni Length = 335 Score = 98.7 bits (235), Expect = 1e-19 Identities = 44/76 (57%), Positives = 54/76 (71%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 SF LY SGVY+EE+CS T L+HGVL VGYGT +G+DYW + WG GYI M RN Sbjct: 260 SFHLYDSGVYDEEDCSQTMLNHGVLAVGYGTTPEGLDYWKVKNSWTNTWGMEGYILMSRN 319 Query: 471 KNNRCGIASSASYXXV 424 K+N+CG+A+ ASY V Sbjct: 320 KDNQCGVATVASYPIV 335 >UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precursor; n=3; Metazoa|Rep: Digestive cysteine proteinase 2 precursor - Homarus americanus (American lobster) Length = 323 Score = 96.7 bits (230), Expect = 4e-19 Identities = 45/78 (57%), Positives = 53/78 (67%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 H+SFQ YSSGVY E CS + LDH VL VGYG+ E G D+W + WG+ GYIKM Sbjct: 247 HSSFQFYSSGVYYEPSCSPSYLDHAVLAVGYGS-EGGQDFWLVKNSWATSWGDAGYIKMS 305 Query: 477 RNKNNRCGIASSASYXXV 424 RN+NN CGIA+ ASY V Sbjct: 306 RNRNNNCGIATVASYPLV 323 >UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens (Human) Length = 334 Score = 96.7 bits (230), Expect = 4e-19 Identities = 43/81 (53%), Positives = 55/81 (67%), Gaps = 3/81 (3%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGY---GTDEQGVDYWXREELVGPLWGELGYI 487 H+SFQ Y SG+Y E +CSS +LDHGVLVVGY G + YW + GP WG GY+ Sbjct: 254 HSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYV 313 Query: 486 KMIRNKNNRCGIASSASYXXV 424 K+ ++KNN CGIA++ASY V Sbjct: 314 KIAKDKNNHCGIATAASYPNV 334 >UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L - Suberites domuncula (Sponge) Length = 324 Score = 95.5 bits (227), Expect = 1e-18 Identities = 47/78 (60%), Positives = 52/78 (66%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 H SFQ Y +GVY E CSS+ LDHGVLVVGYGT E G DY+ + G WG GYI M Sbjct: 248 HRSFQFYKNGVYYEPSCSSSRLDHGVLVVGYGT-EGGQDYFIVKNSWGTRWGMDGYIMMS 306 Query: 477 RNKNNRCGIASSASYXXV 424 RN+ N CGIAS ASY V Sbjct: 307 RNRRNNCGIASQASYPIV 324 >UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin L - Misgurnus mizolepis (Mud loach) Length = 337 Score = 94.3 bits (224), Expect = 2e-18 Identities = 43/81 (53%), Positives = 55/81 (67%), Gaps = 3/81 (3%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVD---YWXREELVGPLWGELGYI 487 H SFQ Y SG+Y E+ECSS +LDHGVLVVGYG + + VD YW + WG+ GYI Sbjct: 257 HESFQFYQSGIYFEKECSSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSESWGDKGYI 316 Query: 486 KMIRNKNNRCGIASSASYXXV 424 M +++ N CGIA++ASY V Sbjct: 317 YMAKDRKNHCGIATAASYPLV 337 >UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=17; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 318 Score = 93.5 bits (222), Expect = 4e-18 Identities = 42/68 (61%), Positives = 49/68 (72%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 FQLYSSG+YN + CSST LDH V +VGYGT E VDYW G WGE GYI+MIRN Sbjct: 244 FQLYSSGIYNPKSCSSTFLDHAVGLVGYGT-ENKVDYWIVRNSWGTSWGEKGYIRMIRNN 302 Query: 468 NNRCGIAS 445 N+CG+A+ Sbjct: 303 GNKCGVAT 310 >UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin L-like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin L-like protease - Nasonia vitripennis Length = 353 Score = 91.9 bits (218), Expect = 1e-17 Identities = 39/79 (49%), Positives = 52/79 (65%), Gaps = 1/79 (1%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIKM 481 H +F+ YS GVY + EC+ DLDH VL+VGYGTD + D+W + G WGE GY K+ Sbjct: 275 HDTFRFYSEGVYYQPECNEDDLDHAVLIVGYGTDNRTDQDFWLVKNSWGETWGEGGYFKV 334 Query: 480 IRNKNNRCGIASSASYXXV 424 RN+ N CGIA++A Y + Sbjct: 335 ARNRRNHCGIAAAAVYPVI 353 >UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba culbertsoni|Rep: Cysteine proteinase - Acanthamoeba culbertsoni Length = 482 Score = 91.9 bits (218), Expect = 1e-17 Identities = 41/71 (57%), Positives = 48/71 (67%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 SF YS G Y + CSST+L+H VLVVG+GTD Q DYW + G WG+ GY+ M RN Sbjct: 298 SFMFYSGGYYYDPTCSSTNLNHAVLVVGWGTDPQRGDYWIAKNEWGTAWGDDGYVYMARN 357 Query: 471 KNNRCGIASSA 439 KNN CGIAS A Sbjct: 358 KNNNCGIASLA 368 >UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus|Rep: Cathepsin L - Aphrocallistes vastus Length = 329 Score = 91.9 bits (218), Expect = 1e-17 Identities = 44/75 (58%), Positives = 51/75 (68%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 HTSFQ+Y SG+Y CS T LDHGVLVVGYGTD GVDYW + G WG GY K I Sbjct: 254 HTSFQMYHSGIYTPFLCSKTKLDHGVLVVGYGTD-NGVDYWLIKNSWGMAWGMDGYFK-I 311 Query: 477 RNKNNRCGIASSASY 433 K+++CGI + ASY Sbjct: 312 EMKSDKCGICTQASY 326 >UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cathepsin L; n=4; Danio rerio|Rep: Novel protein similar to vertebrate cathepsin L - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 334 Score = 90.2 bits (214), Expect = 4e-17 Identities = 41/76 (53%), Positives = 51/76 (67%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 SF YSSG+Y E C+ +L+H VLVVGYG++E G DYW + G WGE GY++MIRN Sbjct: 260 SFLFYSSGIYKESNCNPNNLNHAVLVVGYGSEE-GTDYWIIKNSWGTGWGEGGYMRMIRN 318 Query: 471 KNNRCGIASSASYXXV 424 N CGIAS A Y + Sbjct: 319 GKNTCGIASYALYPII 334 >UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 306 Score = 90.2 bits (214), Expect = 4e-17 Identities = 38/71 (53%), Positives = 49/71 (69%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 SF LY G+Y+E +CS DLDH V VGYG + + DYW G +WGE GY++MIRN Sbjct: 231 SFMLYKEGIYDEPKCSEEDLDHAVGCVGYGVEGEK-DYWIVRNSWGEVWGEKGYVRMIRN 289 Query: 471 KNNRCGIASSA 439 KNN+CG+A+ A Sbjct: 290 KNNQCGVATEA 300 >UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine protease; n=1; Maconellicoccus hirsutus|Rep: Putative cathepsin L-like cysteine protease - Maconellicoccus hirsutus (hibiscus mealybug) Length = 339 Score = 89.8 bits (213), Expect = 5e-17 Identities = 42/76 (55%), Positives = 52/76 (68%), Gaps = 2/76 (2%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIK 484 H SF Y G+Y E +C + +++HGVLVVGYG+ E G DYW + G WGE GYI+ Sbjct: 261 HQSFHSYKGGIYFEPDCGNKKDEVNHGVLVVGYGS-ENGQDYWIVKNSYGTDWGEDGYIR 319 Query: 483 MIRNKNNRCGIASSAS 436 M RNKNN CGIA+SAS Sbjct: 320 MARNKNNHCGIATSAS 335 >UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=19; Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Homo sapiens (Human) Length = 333 Score = 89.4 bits (212), Expect = 7e-17 Identities = 41/81 (50%), Positives = 51/81 (62%), Gaps = 3/81 (3%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYG---TDEQGVDYWXREELVGPLWGELGYI 487 H SF Y G+Y E +CSS D+DHGVLVVGYG T+ YW + G WG GY+ Sbjct: 253 HESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYV 312 Query: 486 KMIRNKNNRCGIASSASYXXV 424 KM +++ N CGIAS+ASY V Sbjct: 313 KMAKDRRNHCGIASAASYPTV 333 >UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06231 protein - Schistosoma japonicum (Blood fluke) Length = 372 Score = 88.2 bits (209), Expect = 2e-16 Identities = 39/78 (50%), Positives = 54/78 (69%), Gaps = 2/78 (2%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 SF +Y SG+Y++ EC+S DLDHGVL+VGYG E G YW + G WG+ GY+K++ Sbjct: 296 SFSMYKSGIYSDPECASASEDLDHGVLLVGYGI-EDGKPYWLIKNSWGEDWGDKGYVKIL 354 Query: 477 RNKNNRCGIASSASYXXV 424 ++ N CG+AS+ASY V Sbjct: 355 KDSKNMCGVASAASYPLV 372 >UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadidae|Rep: Cysteine protease - Tritrichomonas foetus (Trichomonas foetus) Length = 315 Score = 87.8 bits (208), Expect = 2e-16 Identities = 41/75 (54%), Positives = 53/75 (70%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 H SFQLY SG+Y+E ECS+T L+HGV +G+G+D YW G WGE GYI++I Sbjct: 240 HQSFQLYKSGIYDEPECSATFLNHGVGCIGFGSDND-TKYWIVPNSWGLTWGEEGYIRII 298 Query: 477 RNKNNRCGIASSASY 433 R K+NRCGIA+SA + Sbjct: 299 R-KDNRCGIAASACF 312 >UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin L-like proteinase; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin L-like proteinase - Strongylocentrotus purpuratus Length = 329 Score = 87.0 bits (206), Expect = 3e-16 Identities = 39/73 (53%), Positives = 47/73 (64%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 SFQLY SGVY++ CSST LD +L+VGYG G +YW G WG+ GYI + RN Sbjct: 253 SFQLYVSGVYSDPNCSSTLLDLSLLLVGYGVSSVGTEYWICRNTWGEEWGDNGYINIARN 312 Query: 471 KNNRCGIASSASY 433 NN CGIA+ A Y Sbjct: 313 HNNMCGIATDAIY 325 >UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like midgut cysteine proteinase - Tenebrio molitor (Yellow mealworm) Length = 330 Score = 87.0 bits (206), Expect = 3e-16 Identities = 37/71 (52%), Positives = 50/71 (70%) Frame = -1 Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466 Q YS G++ ++ C+ +DL+HGVLVVGYG+D G DYW + G WGE GY + +RN Sbjct: 258 QFYSGGLFYDQTCNQSDLNHGVLVVGYGSDN-GQDYWILKNSWGSGWGESGYWRQVRNYG 316 Query: 465 NRCGIASSASY 433 N CGIA++ASY Sbjct: 317 NNCGIATAASY 327 >UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra erinaceieuropaei|Rep: Cysteine proteinase - Spirometra erinaceieuropaei (Tapeworm) Length = 336 Score = 87.0 bits (206), Expect = 3e-16 Identities = 42/75 (56%), Positives = 48/75 (64%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 F YS GV+ + CS +DHGVLVVGYG E G YW + G WGE GY+KM RN+ Sbjct: 263 FMSYSHGVFVSKTCSPYAIDHGVLVVGYGA-ENGDAYWLVKNSWGSSWGEDGYLKMARNR 321 Query: 468 NNRCGIASSASYXXV 424 NN CGIAS ASY V Sbjct: 322 NNMCGIASMASYPTV 336 >UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Cathepsin - Geodia cydonium (Sponge) Length = 322 Score = 86.2 bits (204), Expect = 6e-16 Identities = 41/78 (52%), Positives = 50/78 (64%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 H FQLY GVY+ + CS T LDHGVLVVGYG ++ DYW + G WG G + M Sbjct: 243 HLGFQLYDGGVYHSDLCSQTRLDHGVLVVGYGVYKE-KDYWMVKNSWGTNWGISGDMMMS 301 Query: 477 RNKNNRCGIASSASYXXV 424 RN++N CGIA+ ASY V Sbjct: 302 RNRDNNCGIATMASYPVV 319 >UniRef50_Q4TI44 Cluster: Chromosome undetermined SCAF2412, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF2412, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 123 Score = 85.8 bits (203), Expect = 8e-16 Identities = 37/74 (50%), Positives = 48/74 (64%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 T+F LYS GVY + +C+ D++H VL+VGYG +G YW + G WG GYI M R Sbjct: 47 TTFHLYSKGVYYDPDCNPEDINHAVLLVGYGVTRRGQQYWIVKNSWGTGWGTEGYILMAR 106 Query: 474 NKNNRCGIASSASY 433 N+ N CGIA+ ASY Sbjct: 107 NRGNLCGIANLASY 120 >UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicatein a3 - Lubomirskia baicalensis Length = 344 Score = 85.8 bits (203), Expect = 8e-16 Identities = 38/73 (52%), Positives = 49/73 (67%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 +F Y SGV++ CS++ L+H +LV GYG+ G DYW + G WGE GYIKM+RN Sbjct: 270 AFMFYQSGVFDSSTCSTSKLNHAMLVTGYGSTN-GKDYWLVKNSWGTGWGESGYIKMVRN 328 Query: 471 KNNRCGIASSASY 433 K N+CGIAS A Y Sbjct: 329 KYNQCGIASDALY 341 >UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep: Silicatein beta - Suberites domuncula (Sponge) Length = 383 Score = 85.4 bits (202), Expect = 1e-15 Identities = 39/74 (52%), Positives = 49/74 (66%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 TSFQ YS GV N CSS+ L H ++V+GYG G DYW + GP WG GY K+ R Sbjct: 308 TSFQFYSDGVLNVPYCSSSTLSHALVVIGYGK-YSGQDYWLVKNSWGPNWGVRGYGKLAR 366 Query: 474 NKNNRCGIASSASY 433 NK N+CGIA++AS+ Sbjct: 367 NKGNKCGIATAASF 380 >UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; n=35; Fasciola|Rep: Cathepsin L-like proteinase precursor - Fasciola hepatica (Liver fluke) Length = 326 Score = 85.0 bits (201), Expect = 1e-15 Identities = 38/77 (49%), Positives = 48/77 (62%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 + F +Y SG+Y + CS ++H VL VGYGT + G DYW + G WGE GYI+M R Sbjct: 247 SDFMMYRSGIYQSQTCSPLRVNHAVLAVGYGT-QGGTDYWIVKNSWGTYWGERGYIRMAR 305 Query: 474 NKNNRCGIASSASYXXV 424 N+ N CGIAS AS V Sbjct: 306 NRGNMCGIASLASLPMV 322 >UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel protein - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 328 Score = 84.6 bits (200), Expect = 2e-15 Identities = 40/76 (52%), Positives = 50/76 (65%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 SF Y SG+YN+ +CSS ++H VLVVGYG+ E G DYW + G WGE GYI+M RN Sbjct: 255 SFHRYRSGIYNDPKCSSALINHAVLVVGYGS-ENGQDYWLVKNSWGTAWGENGYIRMARN 313 Query: 471 KNNRCGIASSASYXXV 424 K N CGI+S Y + Sbjct: 314 K-NMCGISSFGIYPTI 328 >UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2); n=2; Tribolium castaneum|Rep: PREDICTED: similar to Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2) - Tribolium castaneum Length = 332 Score = 84.2 bits (199), Expect = 2e-15 Identities = 38/73 (52%), Positives = 49/73 (67%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 +F Y SGVYN C L+H V++VGYG E+GVDYW + G WG+ GY+KM RN Sbjct: 259 TFHKYKSGVYNNPSCRG-GLNHAVVIVGYGR-ERGVDYWLVKNSWGAGWGQKGYVKMARN 316 Query: 471 KNNRCGIASSASY 433 + N+CGIA+ ASY Sbjct: 317 RRNQCGIATHASY 329 >UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 326 Score = 83.8 bits (198), Expect = 3e-15 Identities = 40/78 (51%), Positives = 53/78 (67%), Gaps = 2/78 (2%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSS--TDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 +FQLY SG+ ++ C S L+HGVLVVGYGT+++ DYW + G WG GYI M Sbjct: 250 NFQLYDSGILDDSSCYSDFNSLNHGVLVVGYGTEKEQ-DYWIVKNSWGADWGMDGYIWMS 308 Query: 477 RNKNNRCGIASSASYXXV 424 RNKNN+CGIA+ A+Y + Sbjct: 309 RNKNNQCGIATDATYPTI 326 >UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome shotgun sequence; n=3; Tetraodontidae|Rep: Chromosome 12 SCAF14996, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 362 Score = 83.4 bits (197), Expect = 4e-15 Identities = 38/73 (52%), Positives = 47/73 (64%), Gaps = 3/73 (4%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVD---YWXREELVGPLWGELGYI 487 H SFQ Y SG+Y E+ECSS +LDHGVLVVGYG + VD +W + WG GYI Sbjct: 289 HESFQFYQSGIYYEKECSSEELDHGVLVVGYGFQGEDVDGKKFWIVKNSWSENWGNKGYI 348 Query: 486 KMIRNKNNRCGIA 448 M +++ N CGIA Sbjct: 349 YMAKDRKNHCGIA 361 >UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens (Human) Length = 331 Score = 83.4 bits (197), Expect = 4e-15 Identities = 42/78 (53%), Positives = 51/78 (65%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 H SF LY SGVY E C+ +++HGVLVVGYG D G +YW + G +GE GYI+M Sbjct: 256 HPSFFLYRSGVYYEPSCTQ-NVNHGVLVVGYG-DLNGKEYWLVKNSWGHNFGEEGYIRMA 313 Query: 477 RNKNNRCGIASSASYXXV 424 RNK N CGIAS SY + Sbjct: 314 RNKGNHCGIASFPSYPEI 331 >UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|Rep: LD36817p - Drosophila melanogaster (Fruit fly) Length = 352 Score = 83.0 bits (196), Expect = 6e-15 Identities = 36/73 (49%), Positives = 48/73 (65%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 SF+ YS G+Y +EEC+ +L+H V VVGYGT E G DYW + WGE G+++++RN Sbjct: 278 SFEQYSGGIYEDEECNQGELNHSVTVVGYGT-ENGRDYWIIKNSYSQNWGEGGFMRILRN 336 Query: 471 KNNRCGIASSASY 433 CGIAS SY Sbjct: 337 AGGFCGIASECSY 349 >UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Rep: Cathepsin R precursor - Mus musculus (Mouse) Length = 334 Score = 82.6 bits (195), Expect = 8e-15 Identities = 38/81 (46%), Positives = 49/81 (60%), Gaps = 3/81 (3%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYG---TDEQGVDYWXREELVGPLWGELGYI 487 H SF+ Y G+Y+E CSS + HGVLVVGYG + G YW + G WG GY+ Sbjct: 254 HESFKNYKGGIYHEPNCSSDTVTHGVLVVGYGFKGIETDGNHYWLIKNSWGKRWGIRGYM 313 Query: 486 KMIRNKNNRCGIASSASYXXV 424 K+ ++KNN CGIAS A Y + Sbjct: 314 KLAKDKNNHCGIASYAHYPTI 334 >UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep: Cysteine proteinase - Entamoeba histolytica Length = 320 Score = 82.2 bits (194), Expect = 1e-14 Identities = 39/73 (53%), Positives = 47/73 (64%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 SFQLY SGVY+E +C L+H V VGYG+ + G DY+ G WG GYI M RN Sbjct: 240 SFQLYKSGVYDEPKCKKVILNHAVCAVGYGSQD-GQDYYIVRNSWGTSWGMDGYILMSRN 298 Query: 471 KNNRCGIASSASY 433 KNN+CGIA+ A Y Sbjct: 299 KNNQCGIANDAIY 311 >UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schistosoma|Rep: Preprocathepsin cathepsin L - Schistosoma japonicum (Blood fluke) Length = 331 Score = 82.2 bits (194), Expect = 1e-14 Identities = 36/73 (49%), Positives = 48/73 (65%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 S LY SG+Y ++C D++HGVL VGYG E G DYW + G LWG GY K+ RN Sbjct: 256 SLILYKSGIYESKDCKYADINHGVLAVGYGR-ENGKDYWLIKNSWGDLWGMNGYFKLRRN 314 Query: 471 KNNRCGIASSASY 433 K + CGI+S++S+ Sbjct: 315 KPHMCGISSNSSF 327 >UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 234 Score = 82.2 bits (194), Expect = 1e-14 Identities = 38/70 (54%), Positives = 46/70 (65%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 F+LYSSGV++ +C LDH V V+GYG E G DYW G WG GYIKM RNK Sbjct: 160 FRLYSSGVFDNPKCGKIILDHVVTVIGYGV-EDGKDYWLVRNSWGKYWGLEGYIKMSRNK 218 Query: 468 NNRCGIASSA 439 +N+CGIA+ A Sbjct: 219 DNQCGIATEA 228 >UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Cathepsin K - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 333 Score = 81.8 bits (193), Expect = 1e-14 Identities = 34/74 (45%), Positives = 47/74 (63%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 ++F Y SGVY + C+ D++H VL VGYG +G YW + G WG+ GY+ M R Sbjct: 257 STFLYYKSGVYYDPNCNKEDVNHAVLAVGYGATPRGKKYWIVKNSWGEEWGKKGYVLMAR 316 Query: 474 NKNNRCGIASSASY 433 N+NN CGIA+ AS+ Sbjct: 317 NRNNACGIANLASF 330 >UniRef50_Q0MYX5 Cluster: Putative cysteine protease; n=1; Emiliania huxleyi|Rep: Putative cysteine protease - Emiliania huxleyi Length = 276 Score = 81.8 bits (193), Expect = 1e-14 Identities = 39/75 (52%), Positives = 51/75 (68%), Gaps = 1/75 (1%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD-EQGVDYWXREELVGPLWGELGYIKMI 478 ++FQLY SGV + C +LDHGVLVVGYGTD G DYW + G WGE G+++++ Sbjct: 74 SAFQLYQSGVIDSASCGK-ELDHGVLVVGYGTDTATGKDYWKIKNSWGGTWGEEGFVRVV 132 Query: 477 RNKNNRCGIASSASY 433 + K N CGI+S ASY Sbjct: 133 QGK-NMCGISSQASY 146 >UniRef50_UPI0000E224BB Cluster: PREDICTED: hypothetical protein; n=1; Pan troglodytes|Rep: PREDICTED: hypothetical protein - Pan troglodytes Length = 143 Score = 81.4 bits (192), Expect = 2e-14 Identities = 38/81 (46%), Positives = 46/81 (56%), Gaps = 3/81 (3%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGY---GTDEQGVDYWXREELVGPLWGELGYI 487 H SFQ Y G+Y E C LDH +LVVGY G D YW + G WG GYI Sbjct: 63 HVSFQFYKKGIYFEPRCDPEGLDHAMLVVGYSYEGADSDNNKYWLVKNSWGKNWGMDGYI 122 Query: 486 KMIRNKNNRCGIASSASYXXV 424 KM +++ N CGIA++ASY V Sbjct: 123 KMAKDRRNNCGIATAASYPTV 143 >UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platyhelminthes|Rep: Cathepsin L-like proteinase - Echinococcus multilocularis Length = 338 Score = 81.4 bits (192), Expect = 2e-14 Identities = 36/77 (46%), Positives = 45/77 (58%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 + F LY G+Y + CS LDH VLVVGY D+ YW + G WG+ GYI M R Sbjct: 262 SGFMLYKKGIYQDNTCSQQYLDHAVLVVGYDADKTRQKYWIVKNSWGEDWGQRGYIWMAR 321 Query: 474 NKNNRCGIASSASYXXV 424 +K N CGIA+ ASY + Sbjct: 322 DKGNMCGIATMASYPLI 338 >UniRef50_A2F4G0 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 203 Score = 81.4 bits (192), Expect = 2e-14 Identities = 36/70 (51%), Positives = 47/70 (67%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 TSFQLY SG+Y E +CS+ +D + VGYGT E +YW + G WGE GYI+MI+ Sbjct: 127 TSFQLYQSGIYYEPDCSTETMDLSMACVGYGT-EGTTNYWIVKNCFGDKWGEQGYIRMIK 185 Query: 474 NKNNRCGIAS 445 +KNN C IA+ Sbjct: 186 DKNNNCAIAT 195 >UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]; n=11; Eutheria|Rep: Testin-2 precursor [Contains: Testin-1] - Mus musculus (Mouse) Length = 333 Score = 81.4 bits (192), Expect = 2e-14 Identities = 38/81 (46%), Positives = 48/81 (59%), Gaps = 3/81 (3%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGY---GTDEQGVDYWXREELVGPLWGELGYI 487 H SFQ Y SG+Y E +C L+H VLVVGY G + G YW + G WG GYI Sbjct: 253 HDSFQFYDSGIYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSYWLVKNSWGEEWGMKGYI 312 Query: 486 KMIRNKNNRCGIASSASYXXV 424 K+ ++ NN CGIA+ A+Y V Sbjct: 313 KIAKDWNNHCGIATLATYPIV 333 >UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropicalis|Rep: LOC594890 protein - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 355 Score = 81.0 bits (191), Expect = 2e-14 Identities = 37/76 (48%), Positives = 50/76 (65%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 +F++Y +GVY + CSS+ DH VLVVGYG E GV+YW + G +G+ GYIKM RN Sbjct: 281 TFRMYKNGVYYDPNCSSSTPDHSVLVVGYGA-EDGVEYWLVKNSWGTSFGDEGYIKMARN 339 Query: 471 KNNRCGIASSASYXXV 424 +N CGIA+ + V Sbjct: 340 HHNNCGIANFGCFPVV 355 >UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Longidorus elongatus Length = 358 Score = 81.0 bits (191), Expect = 2e-14 Identities = 38/75 (50%), Positives = 45/75 (60%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 FQ YS GVY + CS LDHGVL VGY + + G Y+ + WG+ GYI M R K Sbjct: 282 FQFYSHGVYYDRSCSPEYLDHGVLAVGYNSTKDGKQYYIVKNSWSEDWGDDGYILMSRRK 341 Query: 468 NNRCGIASSASYXXV 424 NN CGIA+ ASY V Sbjct: 342 NNNCGIATMASYPFV 356 >UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2] - Vigna mungo (Rice bean) (Black gram) Length = 362 Score = 79.4 bits (187), Expect = 7e-14 Identities = 40/77 (51%), Positives = 48/77 (62%), Gaps = 3/77 (3%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 + FQ YS GV+ + C+ TDL+HGV +VGYGT G +YW GP WGE GYI+M R Sbjct: 268 SDFQFYSEGVFTGD-CN-TDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQR 325 Query: 474 N---KNNRCGIASSASY 433 N K CGIA ASY Sbjct: 326 NISKKEGLCGIAMMASY 342 >UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18; Magnoliophyta|Rep: Thiol protease aleurain precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 79.0 bits (186), Expect = 9e-14 Identities = 41/78 (52%), Positives = 48/78 (61%), Gaps = 2/78 (2%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 SF+LY SGVY + C ST D++H VL VGYG E GV YW + G WG+ GY KM Sbjct: 282 SFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGV-EDGVPYWLIKNSWGADWGDKGYFKME 340 Query: 477 RNKNNRCGIASSASYXXV 424 K N CGIA+ ASY V Sbjct: 341 MGK-NMCGIATCASYPVV 357 >UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 339 Score = 78.6 bits (185), Expect = 1e-13 Identities = 36/70 (51%), Positives = 46/70 (65%), Gaps = 1/70 (1%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYG-TDEQGVDYWXREELVGPLWGELGYIKMIR 475 SF LY SGVY + CSST L+HG+L +G+G T E G +Y+ + G WG GYI + R Sbjct: 263 SFMLYKSGVYKDPSCSSTILNHGILNIGFGVTPENGNEYYILKNSFGSKWGMKGYIYLSR 322 Query: 474 NKNNRCGIAS 445 N NN CGI+S Sbjct: 323 NFNNHCGISS 332 >UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schistosoma japonicum|Rep: Clone ZZD209 mRNA sequence - Schistosoma japonicum (Blood fluke) Length = 339 Score = 78.2 bits (184), Expect = 2e-13 Identities = 33/70 (47%), Positives = 44/70 (62%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 F Y SG+Y + C+ +L+ +L+VGYG D G+DYW + G WGE GY+K+ RN Sbjct: 262 FLHYKSGIYQSDTCTHYNLNQSMLLVGYGYDNDGIDYWIVQNSWGKKWGESGYVKVRRNN 321 Query: 468 NNRCGIASSA 439 N CGIAS A Sbjct: 322 WNMCGIASLA 331 >UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin l - Strongylocentrotus purpuratus Length = 489 Score = 77.4 bits (182), Expect = 3e-13 Identities = 36/75 (48%), Positives = 45/75 (60%), Gaps = 2/75 (2%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 SF YS G Y + C +T DLDH VL VGYGTD G DYW + WG GY+ I Sbjct: 410 SFSFYSYGTYYDASCGNTVDDLDHAVLAVGYGTDSSGQDYWLIKNSWSTHWGNNGYV-AI 468 Query: 477 RNKNNRCGIASSASY 433 K+N CG+A++A+Y Sbjct: 469 SMKDNNCGVATAATY 483 >UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 4 - Rhipicephalus appendiculatus (Brown ear tick) Length = 345 Score = 77.4 bits (182), Expect = 3e-13 Identities = 34/73 (46%), Positives = 48/73 (65%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 +F Y +G+Y E C L+H VL+VGYG +E+GV YW + GP WGE GYIK++RN Sbjct: 272 TFMFYKNGIYGEPNCDPRGLNHAVLLVGYG-EERGVPYWIVKNSWGPGWGEGGYIKILRN 330 Query: 471 KNNRCGIASSASY 433 + N CG++ S+ Sbjct: 331 R-NVCGMSQDPSF 342 >UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaster|Rep: CG11459-PA - Drosophila melanogaster (Fruit fly) Length = 336 Score = 77.0 bits (181), Expect = 4e-13 Identities = 36/77 (46%), Positives = 44/77 (57%), Gaps = 2/77 (2%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIK 484 H F YS GV + C S DL H VL+VG+GT + DYW + G WGE GY+K Sbjct: 257 HEEFDQYSGGVLSIPACRSKRQDLTHSVLLVGFGTHRKWGDYWIIKNSYGTDWGESGYLK 316 Query: 483 MIRNKNNRCGIASSASY 433 + RN NN CG+AS Y Sbjct: 317 LARNANNMCGVASLPQY 333 >UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Slime mold). Cysteine proteinase 5; n=2; Dictyostelium discoideum|Rep: Similar to Dictyostelium discoideum (Slime mold). Cysteine proteinase 5 - Dictyostelium discoideum (Slime mold) Length = 345 Score = 77.0 bits (181), Expect = 4e-13 Identities = 37/85 (43%), Positives = 53/85 (62%), Gaps = 8/85 (9%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYG------TD--EQGVDYWXREELVGPLWGE 499 +SFQ YSSG+Y E C+STDL+H +L+VG+ TD + +YW + G WGE Sbjct: 261 SSFQFYSSGIYYEPSCNSTDLNHSILIVGFSDFSTTPTDSLKHSSNYWIVQNSFGKNWGE 320 Query: 498 LGYIKMIRNKNNRCGIASSASYXXV 424 GYI M +++++ CGI+ ASY V Sbjct: 321 NGYIFMSKDRDDNCGISKMASYVIV 345 >UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviatus|Rep: Cathepsin L 2 - Diaprepes abbreviatus (Sugarcane rootstalk borer weevil) Length = 348 Score = 76.6 bits (180), Expect = 5e-13 Identities = 36/75 (48%), Positives = 50/75 (66%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 FQ Y SGVY+E +C + L+H +L VGYG+ G ++W + G WG+ GYI+M ++K Sbjct: 276 FQFYHSGVYDEPQCGHS-LNHAMLAVGYGS-MGGKNFWLVKNSWGTGWGDQGYIRMAKDK 333 Query: 468 NNRCGIASSASYXXV 424 NN+CGIA ASY V Sbjct: 334 NNQCGIALMASYPGV 348 >UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 291 Score = 76.6 bits (180), Expect = 5e-13 Identities = 35/72 (48%), Positives = 47/72 (65%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 SFQLYSSG+Y++ CSS +LDH + VVGY YW G WGE GY+++ ++ Sbjct: 220 SFQLYSSGIYSDPCCSSQNLDHAMNVVGYSD-----SYWIIRNSWGTSWGESGYMRLAKD 274 Query: 471 KNNRCGIASSAS 436 KNN CG+A+ AS Sbjct: 275 KNNMCGVATMAS 286 >UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-like cysteine peptidase; n=3; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L or K-like cysteine peptidase - Trichomonas vaginalis G3 Length = 320 Score = 76.6 bits (180), Expect = 5e-13 Identities = 33/69 (47%), Positives = 44/69 (63%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 SF Y SG+Y++ +C T LDH V +VGYG+ E G++YW G WGE GYI++I N Sbjct: 245 SFMQYKSGIYDDTKCDPTQLDHYVNLVGYGS-ESGINYWIIRNSWGEAWGESGYIRIINN 303 Query: 471 KNNRCGIAS 445 N CG+ S Sbjct: 304 AANVCGVLS 312 >UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza sativa|Rep: Cysteine protease 1 precursor - Oryza sativa subsp. japonica (Rice) Length = 490 Score = 75.8 bits (178), Expect = 9e-13 Identities = 41/76 (53%), Positives = 47/76 (61%), Gaps = 4/76 (5%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIKMIRN 472 FQLY SGV+ C T+LDHGV+ VGYGTD G YW GP WGE GYI+M RN Sbjct: 299 FQLYDSGVFTGR-CG-TNLDHGVVAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERN 356 Query: 471 ---KNNRCGIASSASY 433 + +CGIA ASY Sbjct: 357 VTARTGKCGIAMMASY 372 >UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep: Cathepsin - Petromyzon marinus (Sea lamprey) Length = 333 Score = 75.4 bits (177), Expect = 1e-12 Identities = 36/76 (47%), Positives = 49/76 (64%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 +F+ Y SG++NE C + +H +LVVGYG+ G D+W + G WGE GYI MIRN Sbjct: 260 TFKHYKSGLFNEPSCDKSP-NHAMLVVGYGS-LSGNDFWIVKNSWGEDWGEKGYIYMIRN 317 Query: 471 KNNRCGIASSASYXXV 424 K+N+CGIAS Y + Sbjct: 318 KDNQCGIASIGIYPII 333 >UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae|Rep: Cysteine proteinase - Hypera postica (alfalfa weevil) Length = 324 Score = 75.4 bits (177), Expect = 1e-12 Identities = 32/72 (44%), Positives = 44/72 (61%) Frame = -1 Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNR 460 Y SG+Y +++CS L+H +L VGYGT E G DYW + G WGE GY ++ R K N+ Sbjct: 254 YDSGIYEDQDCSPAGLNHAILAVGYGT-ENGKDYWIIKNSWGASWGEQGYFRLARGK-NQ 311 Query: 459 CGIASSASYXXV 424 CGI+ Y + Sbjct: 312 CGISEDTVYPTI 323 >UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=176; Viridiplantae|Rep: Cysteine proteinase RD21a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 462 Score = 74.5 bits (175), Expect = 2e-12 Identities = 38/76 (50%), Positives = 47/76 (61%), Gaps = 3/76 (3%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 +FQLY SG++ + C T LDHGV+ VGYGT E G DYW G WGE GY++M RN Sbjct: 278 AFQLYDSGIF-DGSCG-TQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMARN 334 Query: 471 ---KNNRCGIASSASY 433 + +CGIA SY Sbjct: 335 IASSSGKCGIAIEPSY 350 >UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina). Cathepsin L; n=2; Dictyostelium discoideum|Rep: Similar to Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina). Cathepsin L - Dictyostelium discoideum (Slime mold) Length = 265 Score = 73.7 bits (173), Expect = 3e-12 Identities = 32/75 (42%), Positives = 43/75 (57%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 FQ S G+Y + C + H VL +GYGTDE GVDY+ + G WG G+ K+ R Sbjct: 190 FQHLSGGIYYSDSCDPWNTIHAVLAIGYGTDENGVDYFLMKNSWGKSWGTNGFFKVKRGV 249 Query: 468 NNRCGIASSASYXXV 424 +CGI ++ASY V Sbjct: 250 KGKCGIVTAASYPIV 264 >UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchocercidae|Rep: Cathepsin L-like precursor - Brugia pahangi (Filarial nematode worm) Length = 395 Score = 73.7 bits (173), Expect = 3e-12 Identities = 35/73 (47%), Positives = 43/73 (58%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 SF+ Y GVY+E C D H VL VGYGT DYW + G WG+ GY+ M RN Sbjct: 323 SFRFYKDGVYSEGNCGRPD--HAVLAVGYGTHPSYGDYWIVKNSWGTDWGKDGYVYMARN 380 Query: 471 KNNRCGIASSASY 433 + N C IAS+AS+ Sbjct: 381 RGNMCHIASAASF 393 >UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n=1; Rattus norvegicus|Rep: UPI0000501FDB UniRef100 entry - Rattus norvegicus Length = 338 Score = 73.3 bits (172), Expect = 5e-12 Identities = 33/81 (40%), Positives = 51/81 (62%), Gaps = 3/81 (3%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYG---TDEQGVDYWXREELVGPLWGELGYI 487 H+S + Y G+Y+E +C++ ++H VLVVGYG + G +YW + G WG GY+ Sbjct: 259 HSSLRFYKKGIYHEPKCNNY-VNHAVLVVGYGFEGNETDGNNYWLIQNSWGERWGLNGYM 317 Query: 486 KMIRNKNNRCGIASSASYXXV 424 K+ +++NN CGIA+ A Y V Sbjct: 318 KIAKDRNNHCGIATFAQYPIV 338 >UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1; Dictyostelium discoideum AX4|Rep: Counting factor associated protein - Dictyostelium discoideum AX4 Length = 531 Score = 73.3 bits (172), Expect = 5e-12 Identities = 36/74 (48%), Positives = 45/74 (60%), Gaps = 2/74 (2%) Frame = -1 Query: 648 FQLYSSGVYNEEECSS--TDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 F+ Y SGVYN C + DLDH VL +GYGT QG DY+ + WG GY+ M R Sbjct: 453 FRYYMSGVYNNPACKNGLDDLDHEVLAIGYGT-YQGQDYFLVKNSWSTNWGMDGYVYMAR 511 Query: 474 NKNNRCGIASSASY 433 N NN CG++S A+Y Sbjct: 512 NDNNLCGVSSQATY 525 >UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 356 Score = 73.3 bits (172), Expect = 5e-12 Identities = 36/74 (48%), Positives = 48/74 (64%), Gaps = 2/74 (2%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTD--LDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 F+LY SGVY+ +CSS+ ++H VL VGYG+ E GVDYW + WG+ GY K+ R Sbjct: 271 FKLYKSGVYSNPDCSSSPQTVNHAVLAVGYGS-ENGVDYWYVKNSWSEFWGDEGYFKIQR 329 Query: 474 NKNNRCGIASSASY 433 N CG+A+ ASY Sbjct: 330 GV-NMCGVATCASY 342 >UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio molitor (Yellow mealworm) Length = 336 Score = 72.9 bits (171), Expect = 6e-12 Identities = 35/71 (49%), Positives = 40/71 (56%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 F YS GVY C + H VL+VGYG +E G DYW + G WG GY K+ RN Sbjct: 263 FGSYSGGVYYNPTCETNKFTHAVLIVGYG-NENGQDYWLVKNSWGDGWGLDGYFKIARNA 321 Query: 468 NNRCGIASSAS 436 NN CGIA AS Sbjct: 322 NNHCGIAGVAS 332 >UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; Phytophthora infestans|Rep: Cathepsin-like cysteine protease - Phytophthora infestans (Potato late blight fungus) Length = 376 Score = 72.5 bits (170), Expect = 8e-12 Identities = 36/75 (48%), Positives = 46/75 (61%), Gaps = 2/75 (2%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTD--LDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 +FQLY GVY+ C + LDHGV GYG ++ DYW + G WG GYI M Sbjct: 278 TFQLYRHGVYSWPLCGNAPDALDHGVAAAGYGVYKKK-DYWLVKNSWGNSWGMKGYIMMS 336 Query: 477 RNKNNRCGIASSASY 433 RNK+N+CGIA+ A+Y Sbjct: 337 RNKDNQCGIATDATY 351 >UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-like protein; n=1; Maconellicoccus hirsutus|Rep: Cathepsin L-like cysteine proteinase-like protein - Maconellicoccus hirsutus (hibiscus mealybug) Length = 253 Score = 72.1 bits (169), Expect = 1e-11 Identities = 31/70 (44%), Positives = 46/70 (65%), Gaps = 2/70 (2%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 SF+ Y +Y++ +C ++ + + VLVVGYGTD DYW + +G WGE GY+++ Sbjct: 176 SFKHYKGDIYDDPQCDNSRHESSYAVLVVGYGTDNN-TDYWLIKNSLGTSWGEKGYMRLA 234 Query: 477 RNKNNRCGIA 448 RN+NN CGIA Sbjct: 235 RNRNNLCGIA 244 >UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax (Sterkiella histriomuscorum) Length = 366 Score = 70.9 bits (166), Expect = 2e-11 Identities = 35/74 (47%), Positives = 45/74 (60%), Gaps = 2/74 (2%) Frame = -1 Query: 648 FQLYSSGVYNEEECSS--TDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 F+ Y SGVY E C++ D++H VL VG+GTDE VDYW + G WG+ G+ KM R Sbjct: 277 FRDYKSGVYAVEGCANGPNDVNHAVLAVGFGTDENKVDYWIIKNSWGAAWGDQGFFKMKR 336 Query: 474 NKNNRCGIASSASY 433 N CGI + SY Sbjct: 337 GV-NMCGIQNCNSY 349 >UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lamblia ATCC 50803|Rep: GLP_26_47548_45815 - Giardia lamblia ATCC 50803 Length = 577 Score = 70.9 bits (166), Expect = 2e-11 Identities = 37/78 (47%), Positives = 42/78 (53%), Gaps = 2/78 (2%) Frame = -1 Query: 651 SFQLYSSGVYNEEEC--SSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 S YS GVYN+ C DL H VL VGYGTD+ DYW PLWG GY + Sbjct: 496 SLLFYSGGVYNDPACPYKYDDLSHAVLAVGYGTDDTYGDYWIVRNSWSPLWGMDGYF-YL 554 Query: 477 RNKNNRCGIASSASYXXV 424 K+N CGI + ASY V Sbjct: 555 SMKDNICGILTDASYAVV 572 >UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep: CG4847-PD, isoform D - Drosophila melanogaster (Fruit fly) Length = 420 Score = 70.9 bits (166), Expect = 2e-11 Identities = 32/72 (44%), Positives = 45/72 (62%) Frame = -1 Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNR 460 Y+ G+YN++EC+ + +H +LVVGYG+ E+G DYW + WGE GY ++ R K N Sbjct: 351 YAGGIYNDDECNKGEPNHSILVVGYGS-EKGQDYWIVKNSWDDTWGEKGYFRLPRGK-NY 408 Query: 459 CGIASSASYXXV 424 C IA SY V Sbjct: 409 CFIAEECSYPVV 420 >UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 21 SCAF14577, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 478 Score = 70.5 bits (165), Expect = 3e-11 Identities = 36/77 (46%), Positives = 45/77 (58%), Gaps = 2/77 (2%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIK 484 H SF YS+GVY E C ST DLDH VL VGYG + G YW + WG GYI Sbjct: 400 HRSFVFYSNGVYYEPACGSTVEDLDHAVLAVGYG-NLNGEPYWLIKNSWSTYWGNDGYI- 457 Query: 483 MIRNKNNRCGIASSASY 433 ++ K+N CG+ + A+Y Sbjct: 458 LMSMKDNNCGVTTDATY 474 >UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2; Taeniidae|Rep: Cathepsin L-like cysteine proteinase - Taenia solium (Pork tapeworm) Length = 339 Score = 70.5 bits (165), Expect = 3e-11 Identities = 32/75 (42%), Positives = 42/75 (56%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 F Y G+Y CSS L+HGVL +GYG + G YW + G WG GYI M ++ Sbjct: 266 FMFYRHGIYKSHWCSSKFLNHGVLAIGYGKQD-GKPYWLVKNSWGTRWGMKGYIMMAKDY 324 Query: 468 NNRCGIASSASYXXV 424 +N CG+AS A + V Sbjct: 325 HNMCGVASLADFPYV 339 >UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber officinale (Ginger) Length = 475 Score = 70.1 bits (164), Expect = 4e-11 Identities = 39/76 (51%), Positives = 47/76 (61%), Gaps = 3/76 (3%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 +FQLY SG++ C+ T L+HGV VVGYGT E G DYW + G WG GYI M RN Sbjct: 283 NFQLYHSGIFTGS-CN-TSLNHGVTVVGYGT-ENGNDYWIVKNSWGENWGNSGYILMERN 339 Query: 471 ---KNNRCGIASSASY 433 + +CGIA S SY Sbjct: 340 IAESSGKCGIAISPSY 355 >UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster|Rep: CG5367-PA - Drosophila melanogaster (Fruit fly) Length = 338 Score = 70.1 bits (164), Expect = 4e-11 Identities = 34/76 (44%), Positives = 48/76 (63%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 +FQLYS G+Y++ CSS ++H ++V+G+G DYW + G WGE GYI+ IR Sbjct: 269 TFQLYSDGIYDDPLCSSASVNHAMVVIGFGK-----DYWILKNWWGQNWGENGYIR-IRK 322 Query: 471 KNNRCGIASSASYXXV 424 N CGIA+ A+Y V Sbjct: 323 GVNMCGIANYAAYAIV 338 >UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; n=16; Chrysomelidae|Rep: Digestive cysteine protease intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 70.1 bits (164), Expect = 4e-11 Identities = 35/74 (47%), Positives = 45/74 (60%), Gaps = 3/74 (4%) Frame = -1 Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQG---VDYWXREELVGPLWGELGYIKMIR 475 QLY SG+ + + CS DLDHGVLVVGYG Q +W + G +WGE GY ++ R Sbjct: 251 QLYYSGIISGKGCSH-DLDHGVLVVGYGKASQWSGETKFWRVKNSWGKIWGENGYFRIKR 309 Query: 474 NKNNRCGIASSASY 433 + NN CGIA +Y Sbjct: 310 DANNLCGIADDPTY 323 >UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase precursor - Phaedon cochleariae (Mustard beetle) Length = 324 Score = 70.1 bits (164), Expect = 4e-11 Identities = 30/69 (43%), Positives = 42/69 (60%) Frame = -1 Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNR 460 Y G++++ C +L HGV VVGYG E G YW + G WGE GYI++IR+ ++ Sbjct: 253 YGGGIFDDSSCLGDNLHHGVNVVGYGI-ENGQKYWIIKNTWGADWGESGYIRLIRDTDHS 311 Query: 459 CGIASSASY 433 CG+ ASY Sbjct: 312 CGVEKMASY 320 >UniRef50_Q9LUX8 Cluster: Cysteine protease; n=1; Pyrus pyrifolia|Rep: Cysteine protease - Pyrus pyrifolia (Japanese pear) (Pyrus serotina) Length = 147 Score = 69.7 bits (163), Expect = 6e-11 Identities = 39/71 (54%), Positives = 43/71 (60%), Gaps = 4/71 (5%) Frame = -1 Query: 633 SGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNR-- 460 SGV+ C TDLDHGV VVGYGTD+ G+DYW G WGE GYI+M RN N Sbjct: 1 SGVFTGR-CG-TDLDHGVTVVGYGTDK-GLDYWIVRNSWGESWGEKGYIRMQRNLGNTAN 57 Query: 459 --CGIASSASY 433 CGIA SY Sbjct: 58 GICGIAMEPSY 68 >UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 293 Score = 69.7 bits (163), Expect = 6e-11 Identities = 28/66 (42%), Positives = 42/66 (63%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 F+ YSS VY+ +C + H +++ GYGTD G DYW + G WG GYI+++RNK Sbjct: 219 FEWYSSCVYDNPDCDPWGICHWMMICGYGTDA-GKDYWLAKNSFGSTWGMEGYIELVRNK 277 Query: 468 NNRCGI 451 + +CG+ Sbjct: 278 DGQCGV 283 >UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin S preproprotein; n=2; Tribolium castaneum|Rep: PREDICTED: similar to cathepsin S preproprotein - Tribolium castaneum Length = 525 Score = 69.3 bits (162), Expect = 8e-11 Identities = 31/72 (43%), Positives = 44/72 (61%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 F+ YS GV+ + C+ H ++VGYGT E G D+W + GP WG GY+K+ RN+ Sbjct: 452 FKSYSGGVFYNKTCTRMKT-HVAVLVGYGT-ENGEDFWLVKNSYGPQWGLDGYVKIARNR 509 Query: 468 NNRCGIASSASY 433 NN CGI + +Y Sbjct: 510 NNHCGITNRITY 521 >UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa (Rice) Length = 339 Score = 69.3 bits (162), Expect = 8e-11 Identities = 33/76 (43%), Positives = 43/76 (56%), Gaps = 3/76 (3%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKM--- 481 +FQ YS GV C TDLDHG++ +GYG D G YW + G WGE G+++M Sbjct: 263 TFQFYSGGVMTGS-CG-TDLDHGIVAIGYGKDGDGTQYWLLKNSWGTTWGENGFLRMEKD 320 Query: 480 IRNKNNRCGIASSASY 433 I +K CG+A SY Sbjct: 321 ISDKRGMCGLAMEPSY 336 >UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Actinidin Act3a - Actinidia eriantha Length = 380 Score = 69.3 bits (162), Expect = 8e-11 Identities = 34/74 (45%), Positives = 44/74 (59%), Gaps = 2/74 (2%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN- 472 F+ Y SG++ C +T L+H V ++GYGT E G+DYW + G WGE GY K+ RN Sbjct: 269 FRFYQSGIFTGGSCGTT-LNHAVTIIGYGT-ENGIDYWIVKNSYGTQWGESGYGKVQRNV 326 Query: 471 -KNNRCGIASSASY 433 RCGIAS Y Sbjct: 327 GGEGRCGIASYPFY 340 >UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin heavy chain; n=3; Amniota|Rep: PREDICTED: similar to ferritin heavy chain - Ornithorhynchus anatinus Length = 338 Score = 68.9 bits (161), Expect = 1e-10 Identities = 29/75 (38%), Positives = 46/75 (61%), Gaps = 3/75 (4%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQ---GVDYWXREELVGPLWGELGYIKMI 478 F Y SG+++ C+ ++H +L VGYGT ++ G DYW + WGE GY++++ Sbjct: 262 FFFYHSGIFSSHSCTQK-VNHAMLAVGYGTSKEPGGGQDYWILKNSWSERWGEQGYMRLL 320 Query: 477 RNKNNRCGIASSASY 433 + NN CG+AS AS+ Sbjct: 321 KGANNHCGVASVASF 335 >UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4; core eudicotyledons|Rep: Papain-like cysteine peptidase XBCP3 - Arabidopsis thaliana (Mouse-ear cress) Length = 437 Score = 68.9 bits (161), Expect = 1e-10 Identities = 38/76 (50%), Positives = 48/76 (63%), Gaps = 3/76 (3%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 +FQLYSSG+++ CS T LDH VL+VGYG+ + GVDYW + G WG G++ M RN Sbjct: 259 AFQLYSSGIFSGP-CS-TSLDHAVLIVGYGS-QNGVDYWIVKNSWGKSWGMDGFMHMQRN 315 Query: 471 KNNR---CGIASSASY 433 N CGI ASY Sbjct: 316 TENSDGVCGINMLASY 331 >UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 513 Score = 68.9 bits (161), Expect = 1e-10 Identities = 32/71 (45%), Positives = 45/71 (63%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 +F+ Y SG+Y + +C+ LDH L VGYG +E+GV YW + +WGE GYIK I Sbjct: 439 TFKFYGSGIYYDTQCTHA-LDHAALAVGYG-EEKGVSYWIVKNSWSAMWGEEGYIK-IAM 495 Query: 471 KNNRCGIASSA 439 K++ CG+A A Sbjct: 496 KDDNCGVAQKA 506 >UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Giardia intestinalis|Rep: Cathepsin B-like CP2 precursor - Giardia lamblia (Giardia intestinalis) Length = 300 Score = 68.9 bits (161), Expect = 1e-10 Identities = 34/73 (46%), Positives = 42/73 (57%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 H+ F Y SGVY + + H V +VGYGTD+ GVDYW + GP WGE GY +MI Sbjct: 223 HSDFMYYESGVY-QHTYGYMEGGHAVEMVGYGTDDDGVDYWIIKNSWGPDWGEDGYFRMI 281 Query: 477 RNKNNRCGIASSA 439 R N+ C I A Sbjct: 282 RGIND-CSIEEQA 293 >UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue of Sarcophaga 26,29kDa proteinase; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to homologue of Sarcophaga 26,29kDa proteinase - Nasonia vitripennis Length = 553 Score = 68.1 bits (159), Expect = 2e-10 Identities = 34/77 (44%), Positives = 45/77 (58%), Gaps = 2/77 (2%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTD--LDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIK 484 H +F YS+GVY E C +T+ LDH VL VGYGT G +W + WG GYI Sbjct: 475 HKTFSFYSNGVYYEPACGNTENSLDHAVLAVGYGT-INGKGFWLIKNSWSNYWGNDGYIL 533 Query: 483 MIRNKNNRCGIASSASY 433 M + KNN CG+ ++ +Y Sbjct: 534 MAQ-KNNNCGVMTAPTY 549 >UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to CG5367-PA - Nasonia vitripennis Length = 362 Score = 68.1 bits (159), Expect = 2e-10 Identities = 32/76 (42%), Positives = 49/76 (64%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 +FQLY SG+Y++ CSS ++H +L+VGY +YW + G WGE GY+++ + Sbjct: 293 TFQLYHSGIYDDPTCSSDLVNHAMLIVGYTP-----NYWILKNWWGASWGENGYMRLRKG 347 Query: 471 KNNRCGIASSASYXXV 424 K NRCG+A+ A+Y V Sbjct: 348 K-NRCGVANYAAYAKV 362 >UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; n=2; Danio rerio|Rep: hypothetical protein LOC550326 - Danio rerio Length = 531 Score = 68.1 bits (159), Expect = 2e-10 Identities = 35/77 (45%), Positives = 43/77 (55%), Gaps = 2/77 (2%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSS--TDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIK 484 H SF YS+GVY E EC + DLDH VL VGYG YW + WG GYI Sbjct: 453 HRSFAFYSNGVYYEPECKNGINDLDHAVLAVGYGI-MNNESYWLVKNSWSSYWGNDGYI- 510 Query: 483 MIRNKNNRCGIASSASY 433 ++ K+N CG+A+ A Y Sbjct: 511 LMSMKDNNCGVATDAIY 527 >UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 357 Score = 68.1 bits (159), Expect = 2e-10 Identities = 33/76 (43%), Positives = 44/76 (57%), Gaps = 5/76 (6%) Frame = -1 Query: 645 QLYSSGVYN--EEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR- 475 Q +SSGV+ + E +TDL+H + VGYGTDE G YW + G WGE GY+K+ R Sbjct: 279 QFFSSGVFGAMQNETCTTDLNHAMTAVGYGTDEHGTKYWLMKNSWGTDWGEGGYMKIARD 338 Query: 474 --NKNNRCGIASSASY 433 + CG+A SY Sbjct: 339 VASNTGLCGLAMQPSY 354 >UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays (Maize) Length = 493 Score = 67.7 bits (158), Expect = 2e-10 Identities = 38/76 (50%), Positives = 46/76 (60%), Gaps = 3/76 (3%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 +FQLYSSG++ + C T LDHGV VVGYG+ E G DYW + G WGE GY++M RN Sbjct: 305 AFQLYSSGIF-DGRCG-TYLDHGVTVVGYGS-EGGKDYWIVKNSWGTQWGEAGYVRMARN 361 Query: 471 KNNR---CGIASSASY 433 R GIA Y Sbjct: 362 VRVRPPSAGIAMEPLY 377 >UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2; Entamoeba|Rep: Cysteine proteinase ACP1 precursor - Entamoeba histolytica Length = 308 Score = 67.7 bits (158), Expect = 2e-10 Identities = 30/74 (40%), Positives = 43/74 (58%), Gaps = 1/74 (1%) Frame = -1 Query: 651 SFQLYSSG-VYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 SFQLY G +Y++ +C S ++H V VGYG++ G YW G WG+ GY + R Sbjct: 229 SFQLYKKGTIYSDTKCRSRMMNHCVTAVGYGSNSNG-KYWIIRNSWGTSWGDAGYFLLAR 287 Query: 474 NKNNRCGIASSASY 433 + NN CGI ++Y Sbjct: 288 DSNNMCGIGRDSNY 301 >UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: Cysteine protease - Saprolegnia parasitica Length = 523 Score = 67.3 bits (157), Expect = 3e-10 Identities = 36/75 (48%), Positives = 46/75 (61%), Gaps = 3/75 (4%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN- 472 FQ Y SGV+ ++ C T LDHGVLVVGYG +E G YW + G WG+ GYIK+ R Sbjct: 257 FQFYKSGVF-DKSCG-TKLDHGVLVVGYG-EEGGKKYWKVKNSWGADWGDKGYIKLAREF 313 Query: 471 --KNNRCGIASSASY 433 + +CG+A SY Sbjct: 314 GPETGQCGVAMVPSY 328 >UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; n=23; Magnoliophyta|Rep: Senescence-specific cysteine protease - Arabidopsis thaliana (Mouse-ear cress) Length = 346 Score = 66.9 bits (156), Expect = 4e-10 Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 3/78 (3%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKM---I 478 FQ YSSGV+ E C+ T LDH V +GYG G YW + G WGE GY+++ + Sbjct: 271 FQFYSSGVFTGE-CT-TYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDV 328 Query: 477 RNKNNRCGIASSASYXXV 424 ++K CG+A ASY + Sbjct: 329 KDKQGLCGLAMKASYPTI 346 >UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lamblia ATCC 50803|Rep: GLP_26_49243_47612 - Giardia lamblia ATCC 50803 Length = 543 Score = 66.9 bits (156), Expect = 4e-10 Identities = 34/78 (43%), Positives = 44/78 (56%), Gaps = 2/78 (2%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSS--TDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 +F YS GV+N+ C+S DL H VL+VG+GTDE DYW WG GY+ + Sbjct: 466 TFSWYSGGVFNDPACASGVDDLAHAVLLVGWGTDEVAGDYWIVRNSWSNAWGIDGYM-YL 524 Query: 477 RNKNNRCGIASSASYXXV 424 KNN CG+ + A Y V Sbjct: 525 SMKNNICGVLTCADYVMV 542 >UniRef50_Q22A69 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 330 Score = 66.5 bits (155), Expect = 5e-10 Identities = 30/73 (41%), Positives = 44/73 (60%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 + Q Y+ G+ N C+ L+HGVL+VG G+ E G D+W + G WGE GY +++R Sbjct: 256 NLQFYAGGISNPLICNPNGLNHGVLIVGLGS-ENGKDFWKVKNSWGASWGEKGYFRIVRG 314 Query: 471 KNNRCGIASSASY 433 K +CGI + SY Sbjct: 315 K-GKCGINRAVSY 326 >UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dvir_CG5367 - Drosophila virilis (Fruit fly) Length = 298 Score = 66.1 bits (154), Expect = 7e-10 Identities = 30/76 (39%), Positives = 49/76 (64%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 +FQLYS G+Y++ C+ST ++H +L++G+ ++W + G LWGE G+++M R Sbjct: 229 TFQLYSEGIYDDVSCTSTSVNHAMLLIGFDK-----NFWILKNWWGELWGEAGFMRM-RK 282 Query: 471 KNNRCGIASSASYXXV 424 N CGIA+ A+Y V Sbjct: 283 GINLCGIANYAAYAIV 298 >UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba histolytica|Rep: Cysteine protease 19 - Entamoeba histolytica Length = 324 Score = 65.7 bits (153), Expect = 9e-10 Identities = 29/74 (39%), Positives = 43/74 (58%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 +SF LY G+YN+++C S V++VGYG D+ Y+ GP WGE GY + I Sbjct: 244 SSFLLYHGGIYNDKKCRSDKSTIAVVIVGYGIDKNNGKYFIVRNSWGPYWGEQGYFR-IS 302 Query: 474 NKNNRCGIASSASY 433 + NN CG+++ Y Sbjct: 303 SDNNLCGLSNDIYY 316 >UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1 precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 355 Score = 65.7 bits (153), Expect = 9e-10 Identities = 37/75 (49%), Positives = 44/75 (58%), Gaps = 3/75 (4%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN- 472 FQ Y GV+N + C TDLDHGV VGYG+ + G DY + GP WGE G+I+M RN Sbjct: 279 FQFYKGGVFNGK-CG-TDLDHGVAAVGYGSSK-GSDYVIVKNSWGPRWGEKGFIRMKRNT 335 Query: 471 --KNNRCGIASSASY 433 CGI ASY Sbjct: 336 GKPEGLCGINKMASY 350 >UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [Contains: Cathepsin H mini chain; Cathepsin H heavy chain; Cathepsin H light chain]; n=37; Eukaryota|Rep: Cathepsin H precursor (EC 3.4.22.16) [Contains: Cathepsin H mini chain; Cathepsin H heavy chain; Cathepsin H light chain] - Homo sapiens (Human) Length = 335 Score = 65.7 bits (153), Expect = 9e-10 Identities = 31/74 (41%), Positives = 44/74 (59%), Gaps = 2/74 (2%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTD--LDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 F +Y +G+Y+ C T ++H VL VGYG ++ G+ YW + GP WG GY + R Sbjct: 259 FMMYRTGIYSSTSCHKTPDKVNHAVLAVGYG-EKNGIPYWIVKNSWGPQWGMNGYFLIER 317 Query: 474 NKNNRCGIASSASY 433 K N CG+A+ ASY Sbjct: 318 GK-NMCGLAACASY 330 >UniRef50_A2DKX2 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 317 Score = 65.3 bits (152), Expect = 1e-09 Identities = 27/66 (40%), Positives = 40/66 (60%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 F+ Y GVY ++CS+ +DH + +VGYGT G DYW + G WG+ GY + RN+ Sbjct: 244 FEYYYQGVYYSDDCSAWGIDHWMTIVGYGT-YNGDDYWLVKNSFGKGWGQQGYGMVARNR 302 Query: 468 NNRCGI 451 + CG+ Sbjct: 303 DGACGV 308 >UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor; n=17; Magnoliophyta|Rep: Thiol protease aleurain-like precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 65.3 bits (152), Expect = 1e-09 Identities = 32/77 (41%), Positives = 44/77 (57%), Gaps = 2/77 (2%) Frame = -1 Query: 648 FQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 F+ Y GV+ C +T D++H VL VGYG ++ V YW + G WG+ GY KM Sbjct: 283 FRFYKKGVFTSNTCGNTPMDVNHAVLAVGYGVEDD-VPYWLIKNSWGGEWGDNGYFKMEM 341 Query: 474 NKNNRCGIASSASYXXV 424 K N CG+A+ +SY V Sbjct: 342 GK-NMCGVATCSSYPVV 357 >UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 317 Score = 64.9 bits (151), Expect = 2e-09 Identities = 30/73 (41%), Positives = 44/73 (60%), Gaps = 1/73 (1%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTD-LDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 +QLYS G+ + C + ++H VL VGYG+ E G D+W + WGE GY++++R Sbjct: 244 WQLYSGGILESQSCPGGESINHAVLAVGYGS-ENGKDFWLIKNSWNTYWGEEGYLRIVRG 302 Query: 471 KNNRCGIASSASY 433 K N+CGI A Y Sbjct: 303 K-NQCGINEVADY 314 >UniRef50_P25773 Cluster: Cathepsin L; n=38; Eukaryota|Rep: Cathepsin L - Felis silvestris catus (Cat) Length = 139 Score = 64.9 bits (151), Expect = 2e-09 Identities = 29/64 (45%), Positives = 40/64 (62%), Gaps = 3/64 (4%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGY---GTDEQGVDYWXREELVGPLWGELGYIKM 481 +F+ Y G+Y + CSS D+DHGVLVVGY GT+ + YW + G WG GYIKM Sbjct: 76 TFRFYKEGIYYDPSCSSEDVDHGVLVVGYGADGTETENKKYWIIKNSWGTDWGMDGYIKM 135 Query: 480 IRNK 469 +++ Sbjct: 136 AKDR 139 >UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus tauri|Rep: Cysteine protease-1 - Ostreococcus tauri Length = 430 Score = 64.5 bits (150), Expect = 2e-09 Identities = 36/86 (41%), Positives = 50/86 (58%), Gaps = 13/86 (15%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDE----------QGVDYWXREELVGPLWG 502 SFQLY GVY+ +EC S +DHGVLVVGYG D+ + +W + G WG Sbjct: 341 SFQLYDGGVYDSKECGS-QVDHGVLVVGYGFDDTHHNATKHHKRHRHFWKVKNSWGGTWG 399 Query: 501 ELGYIKMIR---NKNNRCGIASSASY 433 E G+I+M R ++ +CGI ++ SY Sbjct: 400 EGGFIRMARRISDETGQCGITTAPSY 425 >UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis thaliana|Rep: Cysteine proteinase - Arabidopsis thaliana (Mouse-ear cress) Length = 348 Score = 64.1 bits (149), Expect = 3e-09 Identities = 33/76 (43%), Positives = 44/76 (57%), Gaps = 3/76 (3%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 +F+ YS GV+N E C TDL H V +VGYG E+G YW + G WGE GY+++ R+ Sbjct: 272 AFRHYSGGVFNGE-CG-TDLHHAVTIVGYGMSEEGTKYWVVKNSWGETWGENGYMRIKRD 329 Query: 471 ---KNNRCGIASSASY 433 CG+A A Y Sbjct: 330 VDAPQGMCGLAILAFY 345 >UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thaliana|Rep: F9P14.12 protein - Arabidopsis thaliana (Mouse-ear cress) Length = 343 Score = 64.1 bits (149), Expect = 3e-09 Identities = 37/75 (49%), Positives = 44/75 (58%), Gaps = 3/75 (4%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKM---I 478 FQLYSSGV+ C T+L+HGV VVGYG E YW + G WGE GYI+M + Sbjct: 269 FQLYSSGVFTNY-CG-TNLNHGVTVVGYGV-EGDQKYWIVKNSWGTGWGEEGYIRMERGV 325 Query: 477 RNKNNRCGIASSASY 433 +CGIA ASY Sbjct: 326 SEDTGKCGIAMMASY 340 >UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia theta|Rep: Cathepsin H precursor - Guillardia theta (Cryptomonas phi) Length = 353 Score = 64.1 bits (149), Expect = 3e-09 Identities = 33/71 (46%), Positives = 44/71 (61%), Gaps = 2/71 (2%) Frame = -1 Query: 639 YSSGVYNEEECSSTD--LDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466 YSSGVY+ C T ++H VL VGYGT E G+ YW + G WG+ GY K I+ + Sbjct: 279 YSSGVYSSPTCVGTPDKVNHAVLAVGYGT-EGGIPYWTIKNSWGFAWGDNGYFK-IQRGS 336 Query: 465 NRCGIASSASY 433 N+CGI+ AS+ Sbjct: 337 NKCGISVCASF 347 >UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP00000013730, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to ENSANGP00000013730, partial - Ornithorhynchus anatinus Length = 229 Score = 63.7 bits (148), Expect = 4e-09 Identities = 30/75 (40%), Positives = 46/75 (61%), Gaps = 2/75 (2%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 SF Y++G+Y E +C L+H VL+VGYG QG +W + PLWG GY+ ++ Sbjct: 153 SFAFYANGIYYEPQCRHKLEQLNHAVLLVGYGV-LQGQAFWLLKNSWSPLWGNSGYM-LL 210 Query: 477 RNKNNRCGIASSASY 433 K+N CG+ ++A+Y Sbjct: 211 AMKDNDCGVTTAATY 225 >UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Toxopain-2 - Toxoplasma gondii Length = 422 Score = 63.7 bits (148), Expect = 4e-09 Identities = 34/75 (45%), Positives = 44/75 (58%), Gaps = 3/75 (4%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTD-EQGVDYWXREELVGPLWGELGYIKMIRN 472 FQ Y GV+ + C TDLDHGVL+VGYGTD E D+W + G WG GY+ M + Sbjct: 347 FQFYHEGVF-DASCG-TDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMH 404 Query: 471 K--NNRCGIASSASY 433 K +CG+ AS+ Sbjct: 405 KGEEGQCGLLLDASF 419 >UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 360 Score = 62.9 bits (146), Expect = 7e-09 Identities = 33/69 (47%), Positives = 43/69 (62%), Gaps = 2/69 (2%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTD-LDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIKM 481 TSF+ Y SGV E E D DH +L+VGYG DE+ VDYW + G WGE GY+++ Sbjct: 269 TSFKYYKSGVITECEDGPYDGPDHCLLLVGYGHDEELKVDYWLIKNQWGTTWGEEGYVRI 328 Query: 480 IRNKNNRCG 454 IR+ N+ G Sbjct: 329 IRDDNDHKG 337 >UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicotyledons|Rep: Cysteine proteinase - Mesembryanthemum crystallinum (Common ice plant) Length = 367 Score = 62.9 bits (146), Expect = 7e-09 Identities = 32/74 (43%), Positives = 41/74 (55%), Gaps = 2/74 (2%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 + Y GV+ C T L+HGV VGYGT G DYW + G WGE GY++M+R Sbjct: 269 WMFYFQGVFTGP-CG-TKLNHGVTAVGYGTTNDGYDYWIIKNSWGETWGERGYMRMLRGV 326 Query: 468 N--NRCGIASSASY 433 + CGIA AS+ Sbjct: 327 SPYGLCGIAMQASF 340 >UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|Rep: Cysteine protease - Giardia muris Length = 301 Score = 62.9 bits (146), Expect = 7e-09 Identities = 32/73 (43%), Positives = 40/73 (54%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 ++ F YSSGVY + H V +VGYG DE G+ YW GP WGE GY ++I Sbjct: 224 YSDFGYYSSGVYQHVN-GMMEGGHAVEMVGYGIDESGLKYWIIRNSWGPDWGEGGYFRII 282 Query: 477 RNKNNRCGIASSA 439 R + N CGI A Sbjct: 283 R-RVNECGIEEQA 294 >UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep: Cysteine protease - Solanum lycopersicum (Tomato) (Lycopersicon esculentum) Length = 345 Score = 62.5 bits (145), Expect = 9e-09 Identities = 30/77 (38%), Positives = 44/77 (57%), Gaps = 3/77 (3%) Frame = -1 Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK- 469 Q Y+ G Y + C+ ++H V +GYGTDE+G YW + G WGE GY+K+IR+ Sbjct: 270 QFYAGGTY-DGNCADR-INHAVTAIGYGTDEEGQKYWLLKNSWGTSWGENGYMKIIRDSG 327 Query: 468 --NNRCGIASSASYXXV 424 + C IA +SY + Sbjct: 328 DPSGLCDIAKMSSYPNI 344 >UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 326 Score = 62.1 bits (144), Expect = 1e-08 Identities = 33/72 (45%), Positives = 40/72 (55%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 F +Y GV++ C T+L+H VLVVGY E G YW + G WGE GYI+MIRN Sbjct: 238 FMIYQGGVFSGP-CG-TELNHAVLVVGYDETEDGTPYWIVKNSWGAGWGESGYIRMIRNI 295 Query: 468 NNRCGIASSASY 433 GI A Y Sbjct: 296 PAPEGICGIAMY 307 >UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-PA - Drosophila melanogaster (Fruit fly) Length = 549 Score = 62.1 bits (144), Expect = 1e-08 Identities = 33/75 (44%), Positives = 40/75 (53%), Gaps = 2/75 (2%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 +F YS GVY E C + LDH VL VGYG+ G DYW + WG GYI M Sbjct: 474 TFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGSIN-GEDYWLVKNSWSTYWGNDGYILMS 532 Query: 477 RNKNNRCGIASSASY 433 KNN CG+ + +Y Sbjct: 533 AKKNN-CGVMTMPTY 546 >UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MGC107932 protein - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 333 Score = 61.7 bits (143), Expect = 2e-08 Identities = 33/79 (41%), Positives = 48/79 (60%), Gaps = 6/79 (7%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD------EQGVDYWXREELVGPLWGELG 493 + FQLYS G++ E +C+ + +H V++VGYGT+ E+ DYW + G WGE G Sbjct: 252 SDFQLYSEGIF-EGDCAESP-NHAVIIVGYGTEHANDKEEEDKDYWIIKNSWGKEWGEDG 309 Query: 492 YIKMIRNKNNRCGIASSAS 436 Y+KM RN N+C I A+ Sbjct: 310 YVKMKRN-INQCSITEMAA 327 >UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: Cathepsin L - Kudoa thyrsites Length = 300 Score = 61.7 bits (143), Expect = 2e-08 Identities = 29/63 (46%), Positives = 38/63 (60%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 SFQ Y G+Y++ SS LDH VL+VGYG + +YW + GP WGE GYI + R+ Sbjct: 239 SFQFYGGGIYSDPWASSYPLDHAVLLVGYGY-KNTENYWHVKNSWGPWWGEQGYINIKRD 297 Query: 471 KNN 463 N Sbjct: 298 GKN 300 >UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 precursor; n=2; Arabidopsis thaliana|Rep: Probable cysteine proteinase At3g43960 precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 376 Score = 61.7 bits (143), Expect = 2e-08 Identities = 31/72 (43%), Positives = 40/72 (55%), Gaps = 3/72 (4%) Frame = -1 Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN---K 469 Y SGVY + CS+ DH VL+VGYGT DYW GP WGE GY+++ RN Sbjct: 274 YKSGVY-KGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEP 332 Query: 468 NNRCGIASSASY 433 +C +A + Y Sbjct: 333 TGKCAVAVAPVY 344 >UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegleria fowleri|Rep: Cysteine proteinase homolog - Naegleria fowleri Length = 347 Score = 61.3 bits (142), Expect = 2e-08 Identities = 31/74 (41%), Positives = 44/74 (59%), Gaps = 4/74 (5%) Frame = -1 Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGV----DYWXREELVGPLWGELGYIKMI 478 Q Y+SG+ + C+ DLDHGVL+VGYG + + +YW + G WGE GY ++I Sbjct: 271 QYYTSGISDPWFCNPQDLDHGVLIVGYGVGKSWLGSEENYWIVKNSWGSDWGEDGYFRII 330 Query: 477 RNKNNRCGIASSAS 436 R K +CG+ S S Sbjct: 331 RGK-GKCGLNSVPS 343 >UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arabidopsis thaliana|Rep: Putative cysteine proteinase - Arabidopsis thaliana (Mouse-ear cress) Length = 365 Score = 60.9 bits (141), Expect = 3e-08 Identities = 32/76 (42%), Positives = 44/76 (57%), Gaps = 3/76 (3%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 SF Y GVY +C TD++H V +VGYGT G++YW + G WGE GY+++ R+ Sbjct: 289 SFGHYKGGVYAGLDCG-TDVNHAVTIVGYGT-MSGLNYWVLKNSWGESWGENGYMRIRRD 346 Query: 471 ---KNNRCGIASSASY 433 CGIA A+Y Sbjct: 347 VEWPQGMCGIAQVAAY 362 >UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Bigelowiella natans|Rep: Digestive cysteine proteinase - Bigelowiella natans (Pedinomonas minutissima) (Chlorarachnion sp.(strain CCMP 621)) Length = 360 Score = 60.9 bits (141), Expect = 3e-08 Identities = 30/65 (46%), Positives = 37/65 (56%) Frame = -1 Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466 Q Y GV N CS T L+H VL+VG+G D G +W + G WGE GY ++IR K Sbjct: 288 QFYKHGVANPRFCSKTSLNHAVLLVGFGVD-GGKAFWIVKNSWGEKWGENGYFRLIRGK- 345 Query: 465 NRCGI 451 CGI Sbjct: 346 GACGI 350 >UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba histolytica|Rep: Cysteine protease 17 - Entamoeba histolytica Length = 420 Score = 60.9 bits (141), Expect = 3e-08 Identities = 28/73 (38%), Positives = 43/73 (58%), Gaps = 1/73 (1%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGP-LWGELGYIKMIRN 472 F+ Y G+Y EEC+ L H + +VGYGT ++G Y+ G WGE GY+++ R Sbjct: 316 FKHYRGGIYYNEECTRRGLSHAMNLVGYGTTKEGQKYYIIRNSWGDWKWGEDGYMRLYRG 375 Query: 471 KNNRCGIASSASY 433 N CG+A++A + Sbjct: 376 -GNHCGVATNAFF 387 >UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain; n=9; Cucujiformia|Rep: Digestive cysteine proteinase intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 60.5 bits (140), Expect = 3e-08 Identities = 31/74 (41%), Positives = 43/74 (58%), Gaps = 3/74 (4%) Frame = -1 Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQ---GVDYWXREELVGPLWGELGYIKMIR 475 QLY G+ + C+ +L+HGVL VGYG ++ +W + G WGE GY ++ R Sbjct: 251 QLYFGGILDGLFCTH-NLNHGVLAVGYGEEDHLFGKKKFWKVKNSWGKDWGEQGYFRIKR 309 Query: 474 NKNNRCGIASSASY 433 + NN CGIA ASY Sbjct: 310 DANNLCGIADKASY 323 >UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endopterygota|Rep: Cathepsin L-like proteinase - Bombyx mori (Silk moth) Length = 402 Score = 60.5 bits (140), Expect = 3e-08 Identities = 34/73 (46%), Positives = 45/73 (61%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 +FQLYS GVY++ C S L+H +L+VGY DYW G WGE GY++ IR Sbjct: 334 TFQLYS-GVYDDPFCVSWHLNHAMLLVGYTQ-----DYWILLNWWGRNWGEDGYMR-IRR 386 Query: 471 KNNRCGIASSASY 433 NRCG+A+ A+Y Sbjct: 387 GLNRCGVANMATY 399 >UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; Dictyostelium discoideum|Rep: Cysteine proteinase 2 precursor - Dictyostelium discoideum (Slime mold) Length = 376 Score = 60.5 bits (140), Expect = 3e-08 Identities = 34/74 (45%), Positives = 47/74 (63%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 H SFQLY+SG+Y E +CS T+LDHGVLVVGYG QG D +E GP+ I + Sbjct: 263 HNSFQLYTSGIYYEPKCSPTELDHGVLVVGYGV--QGKD----DE--GPVLNRKQTIVIH 314 Query: 477 RNKNNRCGIASSAS 436 +N++N+ + +S Sbjct: 315 KNEDNKVESSDDSS 328 Score = 41.1 bits (92), Expect = 0.023 Identities = 17/37 (45%), Positives = 23/37 (62%) Frame = -1 Query: 543 DYWXREELVGPLWGELGYIKMIRNKNNRCGIASSASY 433 +YW + G WG GYI M +++ N CGIAS +SY Sbjct: 337 NYWIVKNSWGTSWGIKGYILMSKDRKNNCGIASVSSY 373 >UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryza sativa|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 352 Score = 60.1 bits (139), Expect = 5e-08 Identities = 31/80 (38%), Positives = 45/80 (56%), Gaps = 5/80 (6%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGV---DYWXREELVGPLWGELGYIKMI 478 F+ Y SGV+ + C T LDH V VVGYG + G YW + G WG+ GY+K+ Sbjct: 272 FRHYGSGVFTADSCG-TKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLE 330 Query: 477 RNKNNR--CGIASSASYXXV 424 ++ ++ CG+A + SY V Sbjct: 331 KDVGSQGACGVAMAPSYPVV 350 >UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2; Brugia malayi|Rep: Cahepsin L-like cysteine protease - Brugia malayi (Filarial nematode worm) Length = 371 Score = 60.1 bits (139), Expect = 5e-08 Identities = 30/82 (36%), Positives = 44/82 (53%), Gaps = 8/82 (9%) Frame = -1 Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDE--------QGVDYWXREELVGPLWGELGY 490 + Y G+++ +C+ T + H +L VGYGT+E + VDYW + WG GY Sbjct: 286 KFYRRGIFSTSKCT-TRMGHALLAVGYGTEEVKLQNGTKKSVDYWLLKNSWSKRWGIGGY 344 Query: 489 IKMIRNKNNRCGIASSASYXXV 424 +K+ RN+ N CGI A Y V Sbjct: 345 LKLARNQENMCGIGFYACYPLV 366 >UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia deliciosa (Kiwi) Length = 509 Score = 59.7 bits (138), Expect = 6e-08 Identities = 34/77 (44%), Positives = 44/77 (57%), Gaps = 5/77 (6%) Frame = -1 Query: 648 FQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 FQLY+ G+Y + +CS D+DH VLVVGYG E G +YW + G WG GY + R Sbjct: 287 FQLYTGGIY-DGDCSDDPDDIDHAVLVVGYGA-ESGEEYWIIKNSWGTDWGMKGYAYIKR 344 Query: 474 NKNNR---CGIASSASY 433 N + C I + ASY Sbjct: 345 NTSKDYGVCAINAMASY 361 >UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromeliaceae|Rep: Fruit bromelain precursor - Ananas comosus (Pineapple) Length = 351 Score = 59.7 bits (138), Expect = 6e-08 Identities = 26/71 (36%), Positives = 39/71 (54%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 +FQ Y+ GV++ C T L+H + ++GYG D G YW G WGE GY++M R Sbjct: 260 NFQYYNGGVFSGP-CG-TSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARG 317 Query: 471 KNNRCGIASSA 439 ++ G+ A Sbjct: 318 VSSSSGVCGIA 328 >UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine protease; n=11; Callosobruchus maculatus|Rep: Putative gut cathepsin L-like cysteine protease - Callosobruchus maculatus (Southern cowpea weevil) (Pulse bruchid) Length = 326 Score = 59.3 bits (137), Expect = 8e-08 Identities = 32/77 (41%), Positives = 43/77 (55%), Gaps = 3/77 (3%) Frame = -1 Query: 654 TSFQLYSSGVYNEE-ECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIK 484 + Y G+ +E CS+ DL+HGVLVVGYG+ E GVDYW + G WGE GY + Sbjct: 248 SQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYGS-ENGVDYWIVKNSWGADWGEKGYFR 306 Query: 483 MIRNKNNRCGIASSASY 433 ++ CGI +Y Sbjct: 307 -LKKDVKACGIGYYNTY 322 >UniRef50_Q5BTK3 Cluster: SJCHGC00358 protein; n=1; Schistosoma japonicum|Rep: SJCHGC00358 protein - Schistosoma japonicum (Blood fluke) Length = 78 Score = 59.3 bits (137), Expect = 8e-08 Identities = 29/75 (38%), Positives = 42/75 (56%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 F Y SGV +C + VL+VGYG +++ YW + +G +G+ GYIK+ RN Sbjct: 5 FLAYESGVLIPTDCQDKEAFESVLLVGYGIEDE-TPYWLIKFSLGTEFGDQGYIKLARNH 63 Query: 468 NNRCGIASSASYXXV 424 +N C IAS A Y + Sbjct: 64 SNMCHIASYAYYPVI 78 >UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 344 Score = 59.3 bits (137), Expect = 8e-08 Identities = 29/76 (38%), Positives = 43/76 (56%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 + Q Y G+ + + C ++H VL+VGYG +E G+ YW + G WG G+ K+IR Sbjct: 269 TLQFYEGGIVDPKNCDDK-INHAVLIVGYGVEE-GIPYWLIKNQWGAEWGIKGFFKLIRG 326 Query: 471 KNNRCGIASSASYXXV 424 K +CGI + AS V Sbjct: 327 K-KQCGIHTYASIAYV 341 >UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 precursor; n=4; Schizophora|Rep: Putative cysteine proteinase CG12163 precursor - Drosophila melanogaster (Fruit fly) Length = 614 Score = 59.3 bits (137), Expect = 8e-08 Identities = 31/79 (39%), Positives = 45/79 (56%), Gaps = 7/79 (8%) Frame = -1 Query: 651 SFQLYSSGVYNEEE--CSSTDLDHGVLVVGYGTDE-----QGVDYWXREELVGPLWGELG 493 + Q Y GV + + CS +LDHGVLVVGYG + + + YW + GP WGE G Sbjct: 532 AMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQG 591 Query: 492 YIKMIRNKNNRCGIASSAS 436 Y ++ R +N CG++ A+ Sbjct: 592 YYRVYRG-DNTCGVSEMAT 609 >UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 385 Score = 58.8 bits (136), Expect = 1e-07 Identities = 32/74 (43%), Positives = 41/74 (55%), Gaps = 4/74 (5%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLD-HGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKM--- 481 F+ Y GV+ S+ ++D H VLVVGYG + YW + G WGE GYI+M Sbjct: 290 FRSYRGGVFRGPCGSNPNVDNHVVLVVGYGVTTDNIKYWIIKNSWGKTWGEYGYIRMERD 349 Query: 480 IRNKNNRCGIASSA 439 I NKN CGI + A Sbjct: 350 ILNKNGICGITTWA 363 >UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyta|Rep: Os12g0273800 protein - Oryza sativa subsp. japonica (Rice) Length = 504 Score = 58.8 bits (136), Expect = 1e-07 Identities = 27/65 (41%), Positives = 37/65 (56%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 + FQ Y GV EC T LDHGV V+GYG G YW + G WGE GY++M + Sbjct: 263 SKFQFYGGGVM-AGECG-TSLDHGVTVIGYGAASDGTKYWLVKNSWGTTWGEAGYLRMEK 320 Query: 474 NKNNR 460 + +++ Sbjct: 321 DIDDK 325 >UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudicotyledons|Rep: Chymopapain precursor - Carica papaya (Papaya) Length = 352 Score = 58.8 bits (136), Expect = 1e-07 Identities = 33/75 (44%), Positives = 43/75 (57%), Gaps = 3/75 (4%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR-- 475 FQLY SGV+ + C T LDH V VGYGT + G +Y + GP WGE GY+++ R Sbjct: 275 FQLYKSGVF-DGPCG-TKLDHAVTAVGYGTSD-GKNYIIIKNSWGPNWGEKGYMRLKRQS 331 Query: 474 -NKNNRCGIASSASY 433 N CG+ S+ Y Sbjct: 332 GNSQGTCGVYKSSYY 346 >UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; Eukaryota|Rep: Cathepsin-like cysteine protease - Phytophthora infestans (Potato late blight fungus) Length = 635 Score = 58.4 bits (135), Expect = 1e-07 Identities = 23/62 (37%), Positives = 41/62 (66%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 F YS G++ +++ ++TD+DH + +VG+G +E GV +W G WGE G+++++R Sbjct: 230 FLKYSGGIF-DDKTNATDVDHAISIVGWG-EENGVPFWVLRNSWGSFWGESGWMRLVRGV 287 Query: 468 NN 463 NN Sbjct: 288 NN 289 Score = 44.4 bits (100), Expect = 0.002 Identities = 20/65 (30%), Positives = 35/65 (53%), Gaps = 1/65 (1%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIKMI 478 + F+ Y+ G+Y+E ++H + V G+G DE+ +YW G WGE G+ ++ Sbjct: 526 SKFESYTGGIYSEHVMFPL-INHEISVAGWGYDEETDTEYWIGRNSWGTYWGENGWFRIQ 584 Query: 477 RNKNN 463 + NN Sbjct: 585 MHHNN 589 >UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; Caenorhabditis elegans|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 299 Score = 58.4 bits (135), Expect = 1e-07 Identities = 31/74 (41%), Positives = 42/74 (56%), Gaps = 2/74 (2%) Frame = -1 Query: 651 SFQLYSSGVYN--EEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 SF Y +G+YN +EEC + + + +VGYG D YW + G WGE GY+K+ Sbjct: 220 SFFHYKTGIYNPTKEECGNANEARSLAIVGYGKDG-AEKYWIVKGSFGTSWGEHGYMKLA 278 Query: 477 RNKNNRCGIASSAS 436 RN N CG+A S S Sbjct: 279 RNV-NACGMAESIS 291 >UniRef50_UPI00006CBB5F Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 1367 Score = 58.0 bits (134), Expect = 2e-07 Identities = 27/75 (36%), Positives = 41/75 (54%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 F+ Y+ G+ N + S + H + +VG+G DE+ YW +G WGE G+I++IR K Sbjct: 956 FRNYTGGILNPPD-SPVQITHSLSIVGWGEDEKQTKYWIARNSLGTFWGENGFIRIIRGK 1014 Query: 468 NNRCGIASSASYXXV 424 N I S SY + Sbjct: 1015 -NALKIESDCSYGRI 1028 Score = 44.4 bits (100), Expect = 0.002 Identities = 21/59 (35%), Positives = 33/59 (55%) Frame = -1 Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNN 463 Y+ G+Y+E+ +H V VVG+G +G +YW G WGE G+ K+ +K+N Sbjct: 1294 YTGGIYSEKVKLPIP-NHYVSVVGWGQTLEGEEYWIVRNSWGTYWGEEGFFKLKMHKDN 1351 >UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus; n=4; Cryptosporidium|Rep: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus - Cryptosporidium parvum Iowa II Length = 401 Score = 58.0 bits (134), Expect = 2e-07 Identities = 32/80 (40%), Positives = 41/80 (51%), Gaps = 3/80 (3%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIKMI 478 T FQ Y SGV+ + C T ++HGV++VGY DE +YW G WGE GYIK+ Sbjct: 320 TPFQFYKSGVF-DAPCG-TKVNHGVVLVGYDMDEDTNKEYWLVRNSWGEAWGEKGYIKLA 377 Query: 477 --RNKNNRCGIASSASYXXV 424 K CGI Y + Sbjct: 378 LHSGKKGTCGILVEPVYPVI 397 >UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia circumcincta|Rep: Secreted cathepsin F - Teladorsagia circumcincta Length = 364 Score = 58.0 bits (134), Expect = 2e-07 Identities = 25/60 (41%), Positives = 34/60 (56%) Frame = -1 Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466 Q Y GV C + + HG L+VGYG E+ + YW + GP WGE GY +M+R +N Sbjct: 292 QFYKGGVSRPTTCRLSSMIHGALLVGYGV-EKNIPYWIIKNSWGPNWGEDGYYRMVRGEN 350 >UniRef50_O16454 Cluster: Temporarily assigned gene name protein 196; n=4; Bilateria|Rep: Temporarily assigned gene name protein 196 - Caenorhabditis elegans Length = 477 Score = 58.0 bits (134), Expect = 2e-07 Identities = 31/74 (41%), Positives = 41/74 (55%), Gaps = 2/74 (2%) Frame = -1 Query: 651 SFQLYSSGVYNEEE--CSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 + Q Y GV + + C L+HGVL+VGYG D + YW + GP WGE GY K+ Sbjct: 401 TLQFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDGRK-PYWIVKNSWGPNWGEAGYFKLY 459 Query: 477 RNKNNRCGIASSAS 436 R K N CG+ A+ Sbjct: 460 RGK-NVCGVQEMAT 472 >UniRef50_UPI0000498E2F Cluster: cysteine proteinase; n=1; Entamoeba histolytica HM-1:IMSS|Rep: cysteine proteinase - Entamoeba histolytica HM-1:IMSS Length = 317 Score = 57.6 bits (133), Expect = 2e-07 Identities = 28/67 (41%), Positives = 39/67 (58%), Gaps = 1/67 (1%) Frame = -1 Query: 630 GVY-NEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNRCG 454 G++ N EECS + G+L++GYG G+ YW + G WG GY+ + RNK N CG Sbjct: 245 GIFENIEECSQSSPRIGLLLIGYGKTINGIPYWILKNCWGSSWGSNGYLYLKRNK-NVCG 303 Query: 453 IASSASY 433 I S +Y Sbjct: 304 IYSYGTY 310 >UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF6860, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 251 Score = 57.6 bits (133), Expect = 2e-07 Identities = 39/103 (37%), Positives = 50/103 (48%), Gaps = 28/103 (27%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPL---------- 508 H+SF YSSG+Y E C+ +L H VL+VGYG+ E G DYW + G Sbjct: 148 HSSFLFYSSGIYEESNCNPNNLSHAVLLVGYGS-EGGQDYWLIKNRWGTTRQTAPAVAND 206 Query: 507 --------------WG----ELGYIKMIRNKNNRCGIASSASY 433 WG E GY+++IR+ N CGIAS A Y Sbjct: 207 HFLIKTLCLFCFFSWGSSWGEGGYMRLIRDGKNSCGIASYALY 249 >UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Liliopsida|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 416 Score = 57.2 bits (132), Expect = 3e-07 Identities = 29/69 (42%), Positives = 37/69 (53%), Gaps = 3/69 (4%) Frame = -1 Query: 630 GVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN---KNNR 460 GVYN C T ++H V VGYG + ++YW GP WGE GYI+M R+ K Sbjct: 332 GVYNGP-CG-TSVNHAVTTVGYGVTQDNINYWIARNSWGPRWGESGYIRMKRDIAAKEGL 389 Query: 459 CGIASSASY 433 CGI+ Y Sbjct: 390 CGISMYGVY 398 Score = 50.0 bits (114), Expect = 5e-05 Identities = 29/72 (40%), Positives = 37/72 (51%), Gaps = 5/72 (6%) Frame = -1 Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYG--TDEQGVDYWXREELVGPLWGELGYIKMIRN 472 Q Y GV+ C + L+HGV+VVGYG T YW + G WGE GYI+M R+ Sbjct: 259 QHYKKGVFTGR-CKTAPLNHGVVVVGYGVNTTPDKTKYWIVKNSWGKGWGEGGYIRMKRD 317 Query: 471 ---KNNRCGIAS 445 CGI + Sbjct: 318 VGTPGGLCGITT 329 >UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae str. PEST Length = 559 Score = 56.8 bits (131), Expect = 4e-07 Identities = 29/79 (36%), Positives = 43/79 (54%), Gaps = 7/79 (8%) Frame = -1 Query: 651 SFQLYSSGVYNEEE--CSSTDLDHGVLVVGYGTDE-----QGVDYWXREELVGPLWGELG 493 + Q Y G+ + C+ +DHGVL+VGYG E + + YW + GP WGE G Sbjct: 477 AMQFYRGGISHPWHPLCNHKSIDHGVLIVGYGIKEYPMFNKTLPYWIIKNSWGPRWGEQG 536 Query: 492 YIKMIRNKNNRCGIASSAS 436 Y ++ R +N CG++ AS Sbjct: 537 YYRIYRG-DNSCGVSEMAS 554 >UniRef50_Q22RC9 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 389 Score = 56.8 bits (131), Expect = 4e-07 Identities = 27/65 (41%), Positives = 36/65 (55%) Frame = -1 Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466 Q Y G+ + CS T L+H VL+ GYG D GV++W + G WGE GY ++ R Sbjct: 299 QFYKKGISAPKFCSKTTLNHAVLLTGYGID-NGVEFWNVKNSWGAKWGEQGYFRLKRGV- 356 Query: 465 NRCGI 451 CGI Sbjct: 357 GMCGI 361 >UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; Dictyostelium discoideum|Rep: Cysteine proteinase 7 precursor - Dictyostelium discoideum (Slime mold) Length = 460 Score = 56.8 bits (131), Expect = 4e-07 Identities = 24/31 (77%), Positives = 26/31 (83%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGT 559 SFQLY SG+YNE CSST LDHGVL VG+GT Sbjct: 254 SFQLYVSGIYNEPACSSTQLDHGVLAVGFGT 284 Score = 42.3 bits (95), Expect = 0.010 Identities = 18/36 (50%), Positives = 22/36 (61%) Frame = -1 Query: 543 DYWXREELVGPLWGELGYIKMIRNKNNRCGIASSAS 436 DYW + G WG GYI M + NN+CGIA+ AS Sbjct: 417 DYWIVKNSWGTSWGMDGYILMTKGNNNQCGIATMAS 452 >UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 361 Score = 56.4 bits (130), Expect = 6e-07 Identities = 32/73 (43%), Positives = 41/73 (56%), Gaps = 3/73 (4%) Frame = -1 Query: 642 LYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN--- 472 + GV+ + CSST ++H VLVVGYG DYW + G WGE GYI++ RN Sbjct: 274 ILKGGVF-DGYCSSTKVNHNVLVVGYGE-----DYWIIKNSWGIYWGENGYIRLKRNVPA 327 Query: 471 KNNRCGIASSASY 433 K +CGI A Y Sbjct: 328 KQGKCGITLQAWY 340 >UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa|Rep: Os09g0381400 protein - Oryza sativa subsp. japonica (Rice) Length = 362 Score = 56.0 bits (129), Expect = 7e-07 Identities = 31/77 (40%), Positives = 39/77 (50%), Gaps = 3/77 (3%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD-EQGVDYWXREELVGPLWGELGYIKMI 478 + Q Y GVY C T L H V VVGYGTD G YW + G WGE GYI+++ Sbjct: 283 SGMQFYKGGVYTGP-CG-TRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRIL 340 Query: 477 RNKN--NRCGIASSASY 433 R+ CG+ +Y Sbjct: 341 RDVGGPGLCGVTLDIAY 357 >UniRef50_Q9VNK7 Cluster: CG1075-PA; n=1; Drosophila melanogaster|Rep: CG1075-PA - Drosophila melanogaster (Fruit fly) Length = 274 Score = 56.0 bits (129), Expect = 7e-07 Identities = 26/64 (40%), Positives = 33/64 (51%), Gaps = 2/64 (3%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIK 484 H F Y G+ C +T DL H VL+VG+ T + DYW + G WGE GY K Sbjct: 187 HEEFDQYFGGILRTPSCRNTNYDLKHSVLLVGFETHPKWGDYWIIKNSYGTEWGESGYFK 246 Query: 483 MIRN 472 + RN Sbjct: 247 LARN 250 >UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep: Cathepsin L - Stylonychia lemnae Length = 340 Score = 56.0 bits (129), Expect = 7e-07 Identities = 30/68 (44%), Positives = 36/68 (52%), Gaps = 2/68 (2%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN- 472 FQ Y SG+++ C T+LDHGV VGYG D G Y+ WG GYI +I N Sbjct: 266 FQFYRSGIFDSSWCG-TNLDHGVAAVGYGVD-NGKQYYIVRNSWSDSWGLKGYINIIANG 323 Query: 471 -KNNRCGI 451 N CGI Sbjct: 324 DGNGMCGI 331 >UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing protein; n=7; Hymenostomatida|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 387 Score = 56.0 bits (129), Expect = 7e-07 Identities = 26/67 (38%), Positives = 40/67 (59%), Gaps = 1/67 (1%) Frame = -1 Query: 654 TSFQLYSSGVYNE-EECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 ++F Y SGV++ + + D++H V++VGYGTDE+ DYW G +GE GYI++ Sbjct: 280 SNFHDYESGVFHGCDGADNVDINHAVVLVGYGTDEKEGDYWIVRNSWGTRFGENGYIRVK 339 Query: 477 RNKNNRC 457 R C Sbjct: 340 REATPTC 346 >UniRef50_Q23H32 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 365 Score = 56.0 bits (129), Expect = 7e-07 Identities = 29/75 (38%), Positives = 42/75 (56%), Gaps = 6/75 (8%) Frame = -1 Query: 651 SFQLYSSGVYNEE--ECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 +F+ Y G+ +E+ EC DH + +VGYG+ E G YW + G WGE GYI+++ Sbjct: 276 NFKFYKGGIADEKLLECDPQYTDHCLGIVGYGS-ENGKQYWILKNSWGENWGEKGYIRLL 334 Query: 477 R----NKNNRCGIAS 445 R N CGIA+ Sbjct: 335 RSDSSNTQGTCGIAT 349 >UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; Entamoeba|Rep: Cysteine proteinase 2 precursor - Entamoeba histolytica Length = 315 Score = 56.0 bits (129), Expect = 7e-07 Identities = 29/74 (39%), Positives = 40/74 (54%), Gaps = 2/74 (2%) Frame = -1 Query: 648 FQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 FQLY SG Y + +C + L+H V VGYG + G + W G WG+ GYI M+ Sbjct: 237 FQLYKSGAYTDTKCKNNYFALNHEVCAVGYGVVD-GKECWIVRNSWGTGWGDKGYINMV- 294 Query: 474 NKNNRCGIASSASY 433 + N CG+A+ Y Sbjct: 295 IEGNTCGVATDPLY 308 >UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lamblia ATCC 50803|Rep: GLP_217_11853_10927 - Giardia lamblia ATCC 50803 Length = 308 Score = 55.6 bits (128), Expect = 1e-06 Identities = 29/81 (35%), Positives = 44/81 (54%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 + F Y G+Y+ + V +VGYGT ++G DYW + GP WGE GY +++ Sbjct: 223 YEDFTYYLEGIYSYTYGNRVGF-LSVEIVGYGTSDEGQDYWIVKNYWGPGWGEDGYFRIV 281 Query: 477 RNKNNRCGIASSASYXXV*TP 415 R + N C I +SA Y + +P Sbjct: 282 RGQ-NECQIENSA-YGAIISP 300 >UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep: Aca s 1 allergen - Acarus siro (Dust mite) Length = 331 Score = 55.6 bits (128), Expect = 1e-06 Identities = 26/65 (40%), Positives = 37/65 (56%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 H F+ Y SGV +T+++H + +VG+G E G+DYW G WGE GY K+ Sbjct: 255 HNGFKHYKSGVIRLTRGGTTEVNHVINIVGWGR-ENGLDYWLIRNSWGTHWGEAGYGKVE 313 Query: 477 RNKNN 463 R+ NN Sbjct: 314 RHHNN 318 >UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 514 Score = 55.6 bits (128), Expect = 1e-06 Identities = 31/68 (45%), Positives = 37/68 (54%), Gaps = 1/68 (1%) Frame = -1 Query: 651 SFQLYSSGVYNEEECS-STDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 S + YS G+Y++ EC T H VLVVGYG E G YW + WG GYIK I Sbjct: 445 SLKFYSWGLYDDPECGRDTAAVHSVLVVGYGV-EDGEPYWLVKNSWSTTWGMDGYIK-IA 502 Query: 474 NKNNRCGI 451 K N CG+ Sbjct: 503 WKRNTCGV 510 >UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanensis|Rep: Sui m 1 allergen - Suidasia medanensis Length = 336 Score = 55.6 bits (128), Expect = 1e-06 Identities = 25/61 (40%), Positives = 36/61 (59%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 F+ Y +GV +S ++H V +VG+GT E G DYW + GP WGE GY ++ R+ Sbjct: 264 FRFYRNGVIQNLRPNSRQINHAVTLVGWGT-EDGQDYWIVKNSWGPSWGESGYFRLGRHH 322 Query: 468 N 466 N Sbjct: 323 N 323 >UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 328 Score = 55.2 bits (127), Expect = 1e-06 Identities = 31/74 (41%), Positives = 40/74 (54%), Gaps = 2/74 (2%) Frame = -1 Query: 651 SFQLYSSGVYNEEEC-SSTDLD-HGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 +F+ Y+SGV E+C T + H V +VGYGT + GV YW WG GY+K I Sbjct: 250 NFEWYTSGVLQSEDCYQMTPAEWHSVAIVGYGTSDDGVPYWLVRNSWNSDWGLHGYVK-I 308 Query: 477 RNKNNRCGIASSAS 436 R N C I S A+ Sbjct: 309 RRGVNWCLIESHAA 322 >UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1; Brugia malayi|Rep: Cathepsin F-like cysteine proteinase - Brugia malayi (Filarial nematode worm) Length = 461 Score = 55.2 bits (127), Expect = 1e-06 Identities = 26/70 (37%), Positives = 40/70 (57%), Gaps = 2/70 (2%) Frame = -1 Query: 639 YSSGVYN--EEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466 Y SG+ + + C + ++HGVL+ GYG E + YW + G WGE GY +++R K Sbjct: 389 YKSGILHPSKSRCPPSKINHGVLITGYGI-ENNLPYWTIKNSWGEQWGENGYFQLMRGK- 446 Query: 465 NRCGIASSAS 436 N CG++ S Sbjct: 447 NICGVSDLVS 456 >UniRef50_Q24E33 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 328 Score = 55.2 bits (127), Expect = 1e-06 Identities = 29/74 (39%), Positives = 42/74 (56%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 T+F+ Y+SGV+ + C L+HGVL GY D YW + G WG+ GYI + Sbjct: 262 TNFKFYTSGVF--DNCKKK-LNHGVLATGYTAD-----YWIIKNSWGTAWGQNGYINL-- 311 Query: 474 NKNNRCGIASSASY 433 + N CG+ ++ASY Sbjct: 312 KRGNTCGVCNTASY 325 >UniRef50_Q239L8 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 55.2 bits (127), Expect = 1e-06 Identities = 28/73 (38%), Positives = 43/73 (58%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 +FQ Y ++++ C T+LDHGVL+VGY + YW + GP WGE G+I++ Sbjct: 256 NFQYYQKDIFSD--CG-TELDHGVLLVGYSASGK---YWKVKNSWGPNWGESGFIRLA-- 307 Query: 471 KNNRCGIASSASY 433 N CG+ + AS+ Sbjct: 308 AGNTCGLCNMASF 320 >UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Giardia intestinalis|Rep: Cathepsin B-like CP1 precursor - Giardia lamblia (Giardia intestinalis) Length = 303 Score = 55.2 bits (127), Expect = 1e-06 Identities = 24/64 (37%), Positives = 33/64 (51%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 + Y SGVY + H + +VGYGT + G DYW + GP WGE GY +++ Sbjct: 226 YADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIV 285 Query: 477 RNKN 466 R N Sbjct: 286 RGVN 289 >UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sativa|Rep: Cysteine proteinase-like - Oryza sativa subsp. japonica (Rice) Length = 360 Score = 54.8 bits (126), Expect = 2e-06 Identities = 30/75 (40%), Positives = 39/75 (52%), Gaps = 3/75 (4%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYG-TDEQGVDYWXREELVGPLWGELGYIKMIRN 472 F+ Y SGVY L+H V VVGYG + G +YW + G WGE GY+++ R Sbjct: 281 FRHYRSGVYAGSAACGRRLNHAVTVVGYGAAADGGGEYWLVKNQWGTWWGEGGYMRVARG 340 Query: 471 --KNNRCGIASSASY 433 CGIA+ A Y Sbjct: 341 GAAGGNCGIATYAFY 355 >UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia ATCC 50803|Rep: GLP_113_4299_5381 - Giardia lamblia ATCC 50803 Length = 360 Score = 54.8 bits (126), Expect = 2e-06 Identities = 26/66 (39%), Positives = 35/66 (53%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 F Y SGVY + H V ++GYG + G+DYW GP WGE GY +++R Sbjct: 286 FMYYKSGVY-QHRWGLWLGGHAVEIIGYGVTDSGLDYWTVRNSWGPDWGEDGYFRIVRG- 343 Query: 468 NNRCGI 451 + CGI Sbjct: 344 GDECGI 349 >UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin L-like cysteine proteinase precursor - Acanthoscelides obtectus (Bean weevil) Length = 321 Score = 54.8 bits (126), Expect = 2e-06 Identities = 25/58 (43%), Positives = 35/58 (60%) Frame = -1 Query: 606 SSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNRCGIASSASY 433 S DL+HGVL+VGYG YW + G +WGE GY ++ ++ N CG+A+ SY Sbjct: 266 SEKDLNHGVLLVGYGDG-----YWIVKNSWGRIWGEQGYFRLKKDAGNTCGVATWPSY 318 >UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Piroplasmida|Rep: Cysteine proteinase, putative - Theileria parva Length = 460 Score = 54.8 bits (126), Expect = 2e-06 Identities = 33/69 (47%), Positives = 43/69 (62%), Gaps = 3/69 (4%) Frame = -1 Query: 642 LYSSGVYNEEECSSTDLDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIKMIR-NK 469 +Y +GVYN E C S L+H VL+VG G DE YW + GP WGE GY+++ R NK Sbjct: 387 MYQAGVYNGE-CGSA-LNHAVLLVGEGYDEVLDKRYWVIKNSWGPDWGEDGYLRLERTNK 444 Query: 468 -NNRCGIAS 445 ++CGI S Sbjct: 445 GEDKCGILS 453 >UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; Dictyostelium discoideum|Rep: Cysteine proteinase 1 precursor - Dictyostelium discoideum (Slime mold) Length = 343 Score = 54.8 bits (126), Expect = 2e-06 Identities = 28/75 (37%), Positives = 41/75 (54%), Gaps = 4/75 (5%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDE----QGVDYWXREELVGPLWGELGYIKM 481 +Q Y GV+ + C+ LDHG+L+VGY + + YW + G WGE GYI + Sbjct: 267 WQFYIGGVF-DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYL 325 Query: 480 IRNKNNRCGIASSAS 436 R KN CG+++ S Sbjct: 326 RRGKNT-CGVSNFVS 339 >UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 392 Score = 54.4 bits (125), Expect = 2e-06 Identities = 27/67 (40%), Positives = 39/67 (58%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN 472 S + YS G+ +++ CS+ DH VL++GYG+D GV YW + WG G+IK+ Sbjct: 324 SLKFYSDGIMSDKHCSNKT-DHAVLLIGYGSD-NGVPYWLIKNSWSHKWGNNGFIKI--- 378 Query: 471 KNNRCGI 451 K CGI Sbjct: 379 KQGLCGI 385 >UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foetus|Rep: TFCP2 protein - Tritrichomonas foetus (Trichomonas foetus) Length = 270 Score = 54.0 bits (124), Expect = 3e-06 Identities = 29/76 (38%), Positives = 40/76 (52%), Gaps = 1/76 (1%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDL-DHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKM 481 H SFQLY G+Y C + + +H + +VGYG E +YW G WGE GYI+ Sbjct: 193 HYSFQLYQGGIYWSWFCRTQYIYNHAMGIVGYGV-EGSEEYWIVRNSWGESWGEQGYIRY 251 Query: 480 IRNKNNRCGIASSASY 433 + +N C IA +Y Sbjct: 252 LLG-SNVCNIADYVTY 266 >UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi Length = 467 Score = 54.0 bits (124), Expect = 3e-06 Identities = 31/80 (38%), Positives = 42/80 (52%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 +S+ Y+ GV C S LDHGVL+VGY D V YW + WGE GYI++ + Sbjct: 264 SSWMTYTGGVMTS--CVSEQLDHGVLLVGY-NDSAAVPYWIIKNSWTTQWGEEGYIRIAK 320 Query: 474 NKNNRCGIASSASYXXV*TP 415 +N+C + AS V P Sbjct: 321 G-SNQCLVKEEASSAVVGGP 339 >UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sativa|Rep: OSJNBb0085F13.15 protein - Oryza sativa subsp. japonica (Rice) Length = 383 Score = 53.6 bits (123), Expect = 4e-06 Identities = 25/73 (34%), Positives = 43/73 (58%), Gaps = 3/73 (4%) Frame = -1 Query: 648 FQLY-SSGVYNEEECSSTDLDHGVLVVGYGTD--EQGVDYWXREELVGPLWGELGYIKMI 478 FQ Y +GVY ST+++H + +VGYGT+ + G +YW + G LWG+ G++ + Sbjct: 302 FQNYRGNGVYKGGTGCSTNVNHALTIVGYGTNHPDTGENYWIAKNSYGNLWGDNGFVYLA 361 Query: 477 RNKNNRCGIASSA 439 ++ +R G+ A Sbjct: 362 KDTADRTGVCGLA 374 >UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3; Bilateria|Rep: Cathepsin Z1 preproprotein - Toxocara canis (Canine roundworm) Length = 307 Score = 53.6 bits (123), Expect = 4e-06 Identities = 23/67 (34%), Positives = 39/67 (58%), Gaps = 1/67 (1%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD-EQGVDYWXREELVGPLWGELGYIKMIR 475 +F++YS G+Y EE +S ++DH + V G+G D + V YW G WGE G+ +++ Sbjct: 225 AFEMYSGGIYTEE--TSEEIDHIIAVYGWGVDHDSSVPYWIGRNSWGTPWGESGWFRVVT 282 Query: 474 NKNNRCG 454 ++ G Sbjct: 283 SEYKHAG 289 >UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase" precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 315 Score = 53.6 bits (123), Expect = 4e-06 Identities = 32/74 (43%), Positives = 43/74 (58%), Gaps = 1/74 (1%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGV-DYWXREELVGPLWGELGYIKMIR 475 ++QLY G++N + C T+L+HGVL VGY D V + W G WGE GYI++ R Sbjct: 247 TWQLYGGGLFNNKNCR-TNLNHGVLAVGYTKDAFIVKNSW------GTSWGEQGYIRVAR 299 Query: 474 NKNNRCGIASSASY 433 + N CGI SY Sbjct: 300 GE-NLCGINLMNSY 312 >UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 330 Score = 53.6 bits (123), Expect = 4e-06 Identities = 30/76 (39%), Positives = 45/76 (59%) Frame = -1 Query: 642 LYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNN 463 LY+SG+++ C +L+HGVL+VG+ + E W + G WGE GYI++ N Sbjct: 259 LYNSGIFSN--CGQ-NLNHGVLLVGFNSTEGS---WLVKNSWGTSWGEQGYIRLA--DGN 310 Query: 462 RCGIASSASYXXV*TP 415 CG+A++ASY V P Sbjct: 311 TCGLANAASYPTVVPP 326 >UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabditis|Rep: Cathepsin z protein 1 - Caenorhabditis elegans Length = 306 Score = 53.6 bits (123), Expect = 4e-06 Identities = 24/67 (35%), Positives = 39/67 (58%), Gaps = 1/67 (1%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD-EQGVDYWXREELVGPLWGELGYIKMIR 475 +F+ Y+ G+Y +E + D+DH + V G+G D E GV+YW G WGE G+ K++ Sbjct: 224 AFETYAGGIY--KEVTDEDIDHIISVHGWGVDHESGVEYWIGRNSWGEPWGEHGWFKIVT 281 Query: 474 NKNNRCG 454 ++ G Sbjct: 282 SQYKNAG 288 >UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa|Rep: Os09g0497500 protein - Oryza sativa subsp. japonica (Rice) Length = 349 Score = 53.2 bits (122), Expect = 5e-06 Identities = 36/86 (41%), Positives = 44/86 (51%), Gaps = 14/86 (16%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVD----------YWXREELVGPLWGE 499 FQLY SGVY C++ D++HGV VVGYG E D YW + G WG+ Sbjct: 263 FQLYGSGVYTGP-CTA-DVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGD 320 Query: 498 LGYIKMIRN----KNNRCGIASSASY 433 GYI M R+ + CGIA SY Sbjct: 321 AGYILMQRDVAGLASGLCGIALLPSY 346 >UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n=3; Brugia malayi|Rep: Cathepsin L-like cysteine proteinase - Brugia malayi (Filarial nematode worm) Length = 353 Score = 53.2 bits (122), Expect = 5e-06 Identities = 27/69 (39%), Positives = 42/69 (60%), Gaps = 2/69 (2%) Frame = -1 Query: 651 SFQLYSSGVYNEEEC--SSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 SF Y SG+YN+ +C ++ ++H V+ VGYG + G++Y+ + GP WG+ GY + I Sbjct: 277 SFVAYRSGIYNDPKCPTNAEKVNHAVIAVGYGV-QNGMEYFIIKNSWGPTWGQKGYGR-I 334 Query: 477 RNKNNRCGI 451 R CGI Sbjct: 335 RAGVFMCGI 343 >UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis|Rep: Cysteine protease 2 - Babesia bovis Length = 445 Score = 53.2 bits (122), Expect = 5e-06 Identities = 28/68 (41%), Positives = 39/68 (57%), Gaps = 3/68 (4%) Frame = -1 Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIKMIRNK-- 469 YS GV+N E CS ++L+H VL+VG G D YW + G WGE GY ++ R Sbjct: 373 YSGGVFNGE-CSDSELNHAVLLVGEGYDSALKKRYWLLKNSWGTSWGEDGYFRLERTNTP 431 Query: 468 NNRCGIAS 445 ++CG+ S Sbjct: 432 TDKCGVLS 439 >UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophila SB210|Rep: Cathepsin z - Tetrahymena thermophila SB210 Length = 585 Score = 53.2 bits (122), Expect = 5e-06 Identities = 24/63 (38%), Positives = 36/63 (57%), Gaps = 1/63 (1%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIKMIRN 472 F+ Y+ G+Y E ++H + VVG+GTD Q GV+YW G WGE G+ ++ + Sbjct: 509 FEAYTGGIYKESTAFPM-INHEIAVVGWGTDPQTGVEYWIGRNSWGTYWGENGFFRIQMH 567 Query: 471 KNN 463 K N Sbjct: 568 KQN 570 Score = 41.9 bits (94), Expect = 0.013 Identities = 20/58 (34%), Positives = 30/58 (51%) Frame = -1 Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466 Y+ G+YN+ S +H + VVG+G +E YW G WGE G+ + +R N Sbjct: 212 YTGGIYNDTS-SYPGTNHVIEVVGWG-EENNEKYWIIRNSWGSYWGEKGFYRQLRGVN 267 >UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin F like protease - Nasonia vitripennis Length = 1036 Score = 52.8 bits (121), Expect = 7e-06 Identities = 28/74 (37%), Positives = 40/74 (54%), Gaps = 7/74 (9%) Frame = -1 Query: 651 SFQLYSSGVYNEEE--CSSTDLDHGVLVVGYGTD-----EQGVDYWXREELVGPLWGELG 493 + Q Y GV + + CS LDHGVL+VGYG ++ + YW + GP WGE G Sbjct: 954 AMQFYMGGVSHPFKFLCSPDSLDHGVLIVGYGVKFYPIFKKTMPYWIIKNSWGPRWGEQG 1013 Query: 492 YIKMIRNKNNRCGI 451 Y ++ R + CG+ Sbjct: 1014 YYRVYRG-DGTCGV 1026 >UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin L preproprotein; n=1; Monodelphis domestica|Rep: PREDICTED: similar to cathepsin L preproprotein - Monodelphis domestica Length = 356 Score = 52.8 bits (121), Expect = 7e-06 Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 17/90 (18%) Frame = -1 Query: 651 SFQLYSSGVYNEEECS---STDLDHGVLVVGYGT------DEQGVD--------YWXREE 523 SF+ Y G Y E C ++++H +LVVGYG +E G+ +W + Sbjct: 264 SFRYYQGGPYIEPRCRLSYMSNMNHALLVVGYGPLERSKYEEFGLQAYMHKDNKFWIAKN 323 Query: 522 LVGPLWGELGYIKMIRNKNNRCGIASSASY 433 G WG+ GYI + +++ N+CGIAS+A+Y Sbjct: 324 SWGEQWGDRGYIYIPKDRYNQCGIASNANY 353 >UniRef50_Q54VR1 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 291 Score = 52.8 bits (121), Expect = 7e-06 Identities = 22/59 (37%), Positives = 38/59 (64%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 +F+ Y+SGV+ S+ +++H + ++G+GT E GVDYW G +GELG+ ++ R Sbjct: 214 AFESYTSGVFTSSVGSTGEINHEISIIGWGT-ENGVDYWIGRNSWGTYFGELGFFRIQR 271 >UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Trypanosoma cruzi|Rep: Cysteine proteinase, putative - Trypanosoma cruzi Length = 392 Score = 52.8 bits (121), Expect = 7e-06 Identities = 24/69 (34%), Positives = 38/69 (55%), Gaps = 2/69 (2%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSST-DLDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIKM 481 T + Y+ G++N + S ++H V +VGYG D + +DYW P WGE GY+++ Sbjct: 288 TYWSAYAGGIFNGCDYSKNITINHVVQLVGYGHDNKLNLDYWILRNSWSPSWGENGYMRL 347 Query: 480 IRNKNNRCG 454 +R CG Sbjct: 348 LRTDKAECG 356 >UniRef50_A7APS9 Cluster: Papain family cysteine protease containing protein; n=1; Babesia bovis|Rep: Papain family cysteine protease containing protein - Babesia bovis Length = 435 Score = 52.8 bits (121), Expect = 7e-06 Identities = 26/78 (33%), Positives = 44/78 (56%), Gaps = 3/78 (3%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 + +Q YSSG+ + C+ +++H V++ G G D+ G +W + G WGE GY+++ Sbjct: 359 NVDWQFYSSGIL--DSCAD-EINHAVVLAGVGQDDDG-PFWLIKNSWGTSWGEEGYVRLA 414 Query: 477 RNK---NNRCGIASSASY 433 R +N CG+A A Y Sbjct: 415 RGSSAFDNECGLAHMALY 432 >UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_184, whole genome shotgun sequence - Paramecium tetraurelia Length = 331 Score = 52.8 bits (121), Expect = 7e-06 Identities = 32/77 (41%), Positives = 42/77 (54%), Gaps = 5/77 (6%) Frame = -1 Query: 648 FQLYSSGVYNEEECSST--DLDHGVLVVGYGTDEQGV-DYWXREELVGPLWGELGYIK-- 484 FQLYS GVY+ + T DL+HGVL VGY D + + W G WGE GY++ Sbjct: 258 FQLYSGGVYSRSCTAKTIDDLNHGVLAVGYAKDSYTIKNSW------GASWGEKGYMRLG 311 Query: 483 MIRNKNNRCGIASSASY 433 ++ K +CGI SY Sbjct: 312 LVAAKEGQCGIHWVPSY 328 >UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia ATCC 50803|Rep: GLP_567_6496_7413 - Giardia lamblia ATCC 50803 Length = 305 Score = 52.4 bits (120), Expect = 9e-06 Identities = 28/73 (38%), Positives = 38/73 (52%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 H F Y G+Y++ +S H VL+VGYG+ DYW G WGE GY +++ Sbjct: 230 HEDFLYYVGGIYHKVYGTSLG-GHAVLIVGYGSMNNH-DYWIVRNSWGSDWGENGYFRIL 287 Query: 477 RNKNNRCGIASSA 439 R N CGI +A Sbjct: 288 RG-TNECGIEKNA 299 >UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria dispar multicapsid nuclear polyhedrosis virus (LdMNPV) Length = 356 Score = 52.4 bits (120), Expect = 9e-06 Identities = 27/65 (41%), Positives = 36/65 (55%) Frame = -1 Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNR 460 Y GV + C + L+H VL+VGYG E GV YW + G WGE GY + +R N Sbjct: 287 YYRGVISS--CENNGLNHAVLLVGYGV-ENGVPYWVFKNTWGDDWGENGYFR-VRQNVNA 342 Query: 459 CGIAS 445 CG+ + Sbjct: 343 CGMVN 347 >UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep: Cathepsin L precursor - Schistosoma mansoni (Blood fluke) Length = 319 Score = 52.4 bits (120), Expect = 9e-06 Identities = 25/58 (43%), Positives = 33/58 (56%) Frame = -1 Query: 609 CSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNRCGIASSAS 436 CS LDH VL+VGYG E+ +W + G WGE GY +M R + CGI + A+ Sbjct: 258 CSKYLLDHAVLLVGYGVSEKNEPFWIVKNSWGVEWGENGYFRMYRG-DGSCGINTVAT 314 >UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyledons|Rep: At1g02300/T6A9_10 - Arabidopsis thaliana (Mouse-ear cress) Length = 362 Score = 52.0 bits (119), Expect = 1e-05 Identities = 27/70 (38%), Positives = 37/70 (52%), Gaps = 1/70 (1%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLD-HGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKM 481 + F Y SGVY + + T++ H V ++G+GT + G DYW WG+ GY K Sbjct: 267 YEDFAHYKSGVY--KHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFK- 323 Query: 480 IRNKNNRCGI 451 IR N CGI Sbjct: 324 IRRGTNECGI 333 >UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa (japonica cultivar-group)|Rep: Os09g0562700 protein - Oryza sativa subsp. japonica (Rice) Length = 235 Score = 52.0 bits (119), Expect = 1e-05 Identities = 34/85 (40%), Positives = 44/85 (51%), Gaps = 12/85 (14%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVD--------YWXREELVGPLWGEL 496 +FQ Y GVY + C T L+HGV VVGYG +E D YW + G WG+ Sbjct: 150 NFQHYRKGVY-DGPCG-TRLNHGVTVVGYGQEEAAADGGAAGGDKYWIIKNSWGKNWGDQ 207 Query: 495 GYIKMIRNKNNR----CGIASSASY 433 GYIKM ++ + CGIA S+ Sbjct: 208 GYIKMKKDVAGKPEGLCGIAIRPSF 232 >UniRef50_Q015J8 Cluster: Cathepsin; n=2; Ostreococcus|Rep: Cathepsin - Ostreococcus tauri Length = 556 Score = 52.0 bits (119), Expect = 1e-05 Identities = 22/64 (34%), Positives = 35/64 (54%), Gaps = 5/64 (7%) Frame = -1 Query: 645 QLYSSGVYNEEECSS-----TDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKM 481 Q Y GV ++C + ++H VLVVG+G + G+ YW + GP WG+ G+ K+ Sbjct: 315 QAYDDGVIMMDDCHPLGRGISSINHAVLVVGWGVTKDGIKYWELKNSYGPKWGDQGFFKL 374 Query: 480 IRNK 469 R + Sbjct: 375 ERGR 378 >UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep: Cathepsin Z - Ostreococcus tauri Length = 387 Score = 51.6 bits (118), Expect = 2e-05 Identities = 22/58 (37%), Positives = 34/58 (58%) Frame = -1 Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466 Y G+Y ++ S +++H V +VG+GT + G YW G WGE+GY ++IR N Sbjct: 278 YVGGIY--KDTPSFEINHIVSIVGWGTAKDGTKYWIVRNSWGQYWGEMGYFRIIRGVN 333 >UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35; Viridiplantae|Rep: Cysteine proteinase 15A precursor - Pisum sativum (Garden pea) Length = 363 Score = 51.6 bits (118), Expect = 2e-05 Identities = 30/76 (39%), Positives = 39/76 (51%), Gaps = 6/76 (7%) Frame = -1 Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDE------QGVDYWXREELVGPLWGELGYIK 484 Q Y SGV C+ + LDHGVL+VG+G + YW + G WGE GY K Sbjct: 280 QTYMSGVSCPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSWGQNWGEQGYYK 339 Query: 483 MIRNKNNRCGIASSAS 436 + R + N CG+ S S Sbjct: 340 ICRGR-NVCGVDSMVS 354 >UniRef50_Q9BL26 Cluster: Putative uncharacterized protein; n=4; Caenorhabditis elegans|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 345 Score = 51.2 bits (117), Expect = 2e-05 Identities = 26/68 (38%), Positives = 40/68 (58%), Gaps = 2/68 (2%) Frame = -1 Query: 639 YSSGVYNE--EECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466 Y G+YN EEC+ST +++VGYG + + YW + G WGE GY+K+ R+ Sbjct: 226 YKIGIYNPSIEECTSTHEIRSMVIVGYGIEGEQ-KYWIVKGSFGTSWGEQGYMKLARDV- 283 Query: 465 NRCGIASS 442 N C +A++ Sbjct: 284 NACAMATT 291 >UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypanosoma cruzi|Rep: Cysteine protease, putative - Trypanosoma cruzi Length = 434 Score = 51.2 bits (117), Expect = 2e-05 Identities = 24/75 (32%), Positives = 43/75 (57%), Gaps = 3/75 (4%) Frame = -1 Query: 654 TSFQLYSSGVYNE--EECSSTDLDHGVLVVGYGTDEQ-GVDYWXREELVGPLWGELGYIK 484 + + Y+ GV++ ++ + + H V +VGYGTD + DYW G WGE G+I+ Sbjct: 275 SDWMFYTGGVFDGCGKDGENITISHAVQLVGYGTDNKTNQDYWVVRNSWGEGWGENGFIR 334 Query: 483 MIRNKNNRCGIASSA 439 ++R K+N + ++A Sbjct: 335 LLRKKHNELCVFNNA 349 >UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=1; Biomphalaria glabrata|Rep: Cathepsin B preproprotein precursor - Biomphalaria glabrata (Bloodfluke planorb) Length = 333 Score = 51.2 bits (117), Expect = 2e-05 Identities = 27/72 (37%), Positives = 38/72 (52%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 ++ F Y +GVY S + H V ++GYGT E G DYW WG+ G+ K+ Sbjct: 259 YSDFLSYKTGVYRHTT-GSYEGGHAVKIIGYGT-ESGQDYWLVANSWNEDWGDKGFFKIA 316 Query: 477 RNKNNRCGIASS 442 + K + CGI SS Sbjct: 317 KGK-DECGIESS 327 >UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; Theileria|Rep: Cysteine proteinase precursor - Theileria annulata Length = 441 Score = 51.2 bits (117), Expect = 2e-05 Identities = 27/68 (39%), Positives = 42/68 (61%), Gaps = 3/68 (4%) Frame = -1 Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTD-EQGVDYWXREELVGPLWGELGYIKMIRNK 469 +LYS G++ + C +L+H VL+VG G D E G+ YW + G WGE G++++ R K Sbjct: 364 KLYSGGIFTGK-CGG-ELNHAVLLVGEGVDHETGMRYWIIKNSWGEDWGENGFLRLQRTK 421 Query: 468 N--NRCGI 451 ++CGI Sbjct: 422 KGLDKCGI 429 >UniRef50_A2WPN4 Cluster: Putative uncharacterized protein; n=2; Oryza sativa (indica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. indica (Rice) Length = 325 Score = 50.8 bits (116), Expect = 3e-05 Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 3/78 (3%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN- 472 FQ Y GVY + C+ ++H V +VGY + G YW + WGE GY+ + ++ Sbjct: 249 FQFYKGGVY-KGPCNPGSVNHAVTIVGYCENFGGEKYWIAKNSWSNDWGEQGYVYLAKDV 307 Query: 471 --KNNRCGIASSASYXXV 424 CG+A+S Y V Sbjct: 308 WWPQGTCGLATSPFYPTV 325 >UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba histolytica|Rep: Cysteine protease 10 - Entamoeba histolytica Length = 297 Score = 50.8 bits (116), Expect = 3e-05 Identities = 23/50 (46%), Positives = 29/50 (58%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWG 502 SFQ Y G+Y+E C +DH V VVGYGT E+ D+W + G WG Sbjct: 250 SFQFYEGGIYDEPNCKW--VDHIVTVVGYGTTEEHQDFWVVKNSYGNEWG 297 >UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - Drosophila melanogaster (Fruit fly) Length = 431 Score = 50.8 bits (116), Expect = 3e-05 Identities = 26/68 (38%), Positives = 34/68 (50%), Gaps = 2/68 (2%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLD--HGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 F YS GVY E + H V +VG+G + G YW G WGE GY +++R Sbjct: 345 FFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILR 404 Query: 474 NKNNRCGI 451 +N CGI Sbjct: 405 G-SNECGI 411 >UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep: Cysteine proteinase - Cryptobia salmositica Length = 443 Score = 50.8 bits (116), Expect = 3e-05 Identities = 25/73 (34%), Positives = 41/73 (56%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 +++Q Y+ G+ + C +DHGVL+VG+ D YW + WGE GYI++ + Sbjct: 257 STWQSYAGGIMSY--CPQDQIDHGVLIVGF-DDTASTPYWIIKNSWTANWGEEGYIRVAK 313 Query: 474 NKNNRCGIASSAS 436 +N+CG+ S S Sbjct: 314 G-SNQCGLTSHPS 325 >UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 395 Score = 50.4 bits (115), Expect = 4e-05 Identities = 27/68 (39%), Positives = 41/68 (60%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 T+FQ Y+ G+Y+ E D++H VL+VGY ++ D W + +G WGELGY + I Sbjct: 322 TAFQSYAGGIYDSVE-EYKDVNHIVLLVGY---DKPTDSWKIKNSLGTKWGELGYAR-IT 376 Query: 474 NKNNRCGI 451 N++ GI Sbjct: 377 ASNDKLGI 384 >UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3.4.14.1) (Dipeptidyl-peptidase I) (DPP-I) (DPPI) (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase) [Contains: Dipeptidyl-peptidase 1 exclusion domain chain (Dipeptidyl- peptidase I exclusion domain chain); Dipeptidyl-peptidase 1 heavy chain (Dipeptidyl-peptidase I heavy chain); Dipeptidyl-peptidase 1 light chain (Dipeptidyl-peptidase I light chain)]; n=50; Coelomata|Rep: Dipeptidyl-peptidase 1 precursor (EC 3.4.14.1) (Dipeptidyl-peptidase I) (DPP-I) (DPPI) (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase) [Contains: Dipeptidyl-peptidase 1 exclusion domain chain (Dipeptidyl- peptidase I exclusion domain chain); Dipeptidyl-peptidase 1 heavy chain (Dipeptidyl-peptidase I heavy chain); Dipeptidyl-peptidase 1 light chain (Dipeptidyl-peptidase I light chain)] - Homo sapiens (Human) Length = 463 Score = 50.4 bits (115), Expect = 4e-05 Identities = 30/79 (37%), Positives = 42/79 (53%), Gaps = 6/79 (7%) Frame = -1 Query: 657 HTSFQLYSSGVYNE----EECSSTDL-DHGVLVVGYGTDE-QGVDYWXREELVGPLWGEL 496 + F Y G+Y+ + + +L +H VL+VGYGTD G+DYW + G WGE Sbjct: 377 YDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEN 436 Query: 495 GYIKMIRNKNNRCGIASSA 439 GY + IR + C I S A Sbjct: 437 GYFR-IRRGTDECAIESIA 454 >UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 393 Score = 50.0 bits (114), Expect = 5e-05 Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 4/80 (5%) Frame = -1 Query: 651 SFQLYSSGVYNEEECS--STDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 S Q+Y SG+Y + CS D +H V++VGY ++ Y+ GP WGE G+ K+ Sbjct: 318 SLQMYGSGIY-DFPCSIDRNDANHAVVIVGYTSE-----YFLIRNSWGPHWGEEGHFKVR 371 Query: 477 RNKNNR--CGIASSASYXXV 424 + NN+ CG+ + SY + Sbjct: 372 KESNNKGTCGLYNDMSYPYI 391 >UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Cathepsin B - Fasciola gigantica (Giant liver fluke) Length = 339 Score = 50.0 bits (114), Expect = 5e-05 Identities = 26/68 (38%), Positives = 36/68 (52%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 F +Y SG+Y+ H V ++G+G E GV+YW WGE GY +M+R + Sbjct: 266 FGVYRSGIYHHVAGKFIGR-HAVRMIGWGV-ENGVNYWLMANSWNEEWGENGYFRMVRGR 323 Query: 468 NNRCGIAS 445 N CGI S Sbjct: 324 -NECGIES 330 >UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n=4; Tenebrionidae|Rep: Putative cathepsin B-like proteinase - Tenebrio molitor (Yellow mealworm) Length = 321 Score = 50.0 bits (114), Expect = 5e-05 Identities = 28/66 (42%), Positives = 34/66 (51%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 F Y SGVY S H V +VG+G E GV YW G WG+ G+ KM+R + Sbjct: 248 FYNYVSGVYRHVSGESVGF-HVVKIVGWGV-ENGVPYWLIANSWGSSWGDHGFFKMLRGQ 305 Query: 468 NNRCGI 451 N CGI Sbjct: 306 -NECGI 310 >UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa zeasingle nucleocapsid nuclear polyhedrosis virus) Length = 367 Score = 50.0 bits (114), Expect = 5e-05 Identities = 25/63 (39%), Positives = 37/63 (58%) Frame = -1 Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNR 460 Y G+ N+ C DL+H VL++G+G E V YW + G WGE G++++ RN N Sbjct: 298 YRRGILNQ--CHIYDLNHAVLLIGWGI-ENNVPYWIIKNSWGEDWGENGFLRVRRNV-NA 353 Query: 459 CGI 451 CG+ Sbjct: 354 CGL 356 >UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep: Viral cathepsin - Xestia c-nigrum granulosis virus (XnGV) (Xestia c-nigrumgranulovirus) Length = 346 Score = 50.0 bits (114), Expect = 5e-05 Identities = 29/64 (45%), Positives = 38/64 (59%), Gaps = 1/64 (1%) Frame = -1 Query: 639 YSSGVYNEEECS-STDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNN 463 Y SGV + CS L+HGVL+VGYG E V YW + G WGE G+ ++ R+ N+ Sbjct: 273 YKSGV--AKHCSVDHGLNHGVLLVGYG-QENDVKYWTLKNSWGSDWGEQGFFRIKRDVNS 329 Query: 462 RCGI 451 CGI Sbjct: 330 -CGI 332 >UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: Cathepsin B - Triticum aestivum (Wheat) Length = 353 Score = 49.6 bits (113), Expect = 6e-05 Identities = 26/66 (39%), Positives = 34/66 (51%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 F Y SGVY + H V ++G+GT + G DYW WG+ GY K+IR + Sbjct: 263 FAHYKSGVY-KHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGE 321 Query: 468 NNRCGI 451 N CGI Sbjct: 322 -NECGI 326 >UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoidea|Rep: Cathepsin B-S precursor - Tuberaphis styraci Length = 349 Score = 49.6 bits (113), Expect = 6e-05 Identities = 21/69 (30%), Positives = 36/69 (52%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 + F +Y SG+Y + + + H + ++G+G +E G YW WG+ G K+I Sbjct: 256 YDDFSVYKSGIYRKTPKAKYEGGHSIKIIGWG-EENGTPYWLAVNSWSKFWGDHGTFKII 314 Query: 477 RNKNNRCGI 451 + + N CGI Sbjct: 315 KGR-NECGI 322 >UniRef50_Q22RR8 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 49.6 bits (113), Expect = 6e-05 Identities = 30/79 (37%), Positives = 42/79 (53%), Gaps = 2/79 (2%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 T++Q Y G +N+ C +L+HGVL+VGY + W + G WGE GYI++ Sbjct: 260 TNWQYYEFGTFND--CFD-NLNHGVLLVGYNSK---THQWKVKNSWGTSWGEDGYIRLGA 313 Query: 474 NKN--NRCGIASSASYXXV 424 + N CGI ASY V Sbjct: 314 STKYLNTCGICEQASYPIV 332 >UniRef50_Q22NW9 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 894 Score = 49.6 bits (113), Expect = 6e-05 Identities = 31/72 (43%), Positives = 43/72 (59%), Gaps = 1/72 (1%) Frame = -1 Query: 645 QLYSSGVYNEEEC-SSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 Q Y SG+ + C SS +L+HGVL+VGY T+ D++ + G WGE GY ++ K Sbjct: 825 QRYHSGIIGD--CGSSVNLNHGVLIVGY-TE----DFFIVKNSWGTNWGEDGYFRI--TK 875 Query: 468 NNRCGIASSASY 433 N CGI +ASY Sbjct: 876 TNTCGICEAASY 887 >UniRef50_A0DIY3 Cluster: Chromosome undetermined scaffold_52, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_52, whole genome shotgun sequence - Paramecium tetraurelia Length = 512 Score = 49.6 bits (113), Expect = 6e-05 Identities = 23/59 (38%), Positives = 33/59 (55%) Frame = -1 Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNN 463 Y G ++ + T L+H V VVG+G E GV+YW G WG++GY KM + +N Sbjct: 441 YEGGYIFSQKTNKTILNHYVSVVGWGV-EDGVEYWIVRNSWGSYWGDMGYAKMKMHSDN 498 >UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Plasmodium (Vinckeia)|Rep: Cysteine proteinase precursor - Plasmodium vinckei Length = 506 Score = 36.3 bits (80), Expect(2) = 7e-05 Identities = 17/45 (37%), Positives = 24/45 (53%), Gaps = 3/45 (6%) Frame = -1 Query: 558 DEQGVDYWXREELVGPLWGELGYIKMIRNK---NNRCGIASSASY 433 D+ + YW GP WGE GYI++ RNK + CG+ S + Sbjct: 459 DDDIIYYWIVRNSWGPNWGEGGYIRIKRNKAGDDGFCGVGSDVFF 503 Score = 32.7 bits (71), Expect(2) = 7e-05 Identities = 16/29 (55%), Positives = 22/29 (75%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYG 562 F LYS GV+ + EC+ +L+H VL+VGYG Sbjct: 401 FVLYSGGVF-DGECNP-ELNHSVLLVGYG 427 >UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 1 - Rhipicephalus appendiculatus (Brown ear tick) Length = 332 Score = 49.2 bits (112), Expect = 9e-05 Identities = 25/69 (36%), Positives = 36/69 (52%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 + F Y SGVY + + H + ++G+GT E GV YW WG+ GY K++ Sbjct: 254 YADFPSYKSGVYQQHMIKFMGV-HAIKILGWGT-EDGVPYWLVANSWNVGWGDKGYFKIL 311 Query: 477 RNKNNRCGI 451 R K + CGI Sbjct: 312 RGK-DECGI 319 >UniRef50_Q7JYA0 Cluster: RE20049p; n=2; Sophophora|Rep: RE20049p - Drosophila melanogaster (Fruit fly) Length = 340 Score = 49.2 bits (112), Expect = 9e-05 Identities = 26/64 (40%), Positives = 36/64 (56%), Gaps = 3/64 (4%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHG--VLVVGYGTD-EQGVDYWXREELVGPLWGELGYIKMI 478 F YSSGVY +E + T+ ++VVGY D + +DYW G WGE GYI+++ Sbjct: 263 FMQYSSGVYVQETRALTNPKSSQFLVVVGYDHDVDSNLDYWRCLNSFGDTWGEEGYIRIV 322 Query: 477 RNKN 466 R N Sbjct: 323 RRSN 326 >UniRef50_Q4UCF5 Cluster: Cysteine proteinase, tacP, putative; n=8; Theileria|Rep: Cysteine proteinase, tacP, putative - Theileria annulata Length = 498 Score = 49.2 bits (112), Expect = 9e-05 Identities = 28/74 (37%), Positives = 41/74 (55%), Gaps = 3/74 (4%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVD-YWXREELVGPLWGELGYIKM 481 H F Y G+Y + C+ +L+H VL+VG G DE+ YW + G WGE GY ++ Sbjct: 367 HREFLSYKGGLY-DGPCAK-NLNHYVLLVGEGYDEETKSRYWIIKNTFGQSWGENGYARI 424 Query: 480 IR--NKNNRCGIAS 445 +R K ++C I S Sbjct: 425 VRTDEKFDKCDILS 438 >UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea mays (Maize) Length = 371 Score = 48.8 bits (111), Expect = 1e-04 Identities = 31/78 (39%), Positives = 37/78 (47%), Gaps = 8/78 (10%) Frame = -1 Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDE------QGVDYWXREELVGPLWGELGYIK 484 Q Y GV C LDHGVL+VGYG + YW + G WGE GY K Sbjct: 285 QTYIGGVSCPYICGR-HLDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENWGENGYYK 343 Query: 483 MIRNKN--NRCGIASSAS 436 + R N N+CG+ S S Sbjct: 344 ICRGSNVRNKCGVDSMVS 361 >UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma|Rep: Cathepsin C precursor - Schistosoma mansoni (Blood fluke) Length = 454 Score = 48.8 bits (111), Expect = 1e-04 Identities = 26/80 (32%), Positives = 39/80 (48%), Gaps = 9/80 (11%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLD--------HGVLVVGYGTDE-QGVDYWXREELVGPLW 505 + FQ Y G+Y+ + + H VL+VGYG D+ G YW + G W Sbjct: 367 YEDFQFYKEGIYHHTTVQTDHYNFNPFELTNHAVLLVGYGVDKLSGEPYWKVKNSWGVEW 426 Query: 504 GELGYIKMIRNKNNRCGIAS 445 GE GY +++R + CG+ S Sbjct: 427 GEQGYFRILRG-TDECGVES 445 >UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativa|Rep: Os01g0347600 protein - Oryza sativa subsp. japonica (Rice) Length = 343 Score = 48.4 bits (110), Expect = 1e-04 Identities = 29/80 (36%), Positives = 40/80 (50%), Gaps = 4/80 (5%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD-EQGVDYWXREELVGPLWGELGYI---K 484 +FQ Y SGV+ C ++ +H V +VGY D G YW + G WG+ GYI K Sbjct: 266 AFQFYKSGVF-PGPCGASS-NHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILLEK 323 Query: 483 MIRNKNNRCGIASSASYXXV 424 + + CG+A S Y V Sbjct: 324 DVLQPHGTCGLAVSPFYPTV 343 >UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep: Cathepsin B - Uronema marinum Length = 350 Score = 48.4 bits (110), Expect = 1e-04 Identities = 26/71 (36%), Positives = 36/71 (50%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 ++ F YSSGVY S H + ++G+G E G YW WGE G+ K++ Sbjct: 272 YSDFLTYSSGVYQNTSGSYMG-GHAIKMLGWGV-ENGTPYWLCANSWNSSWGENGFFKIL 329 Query: 477 RNKNNRCGIAS 445 R +N CGI S Sbjct: 330 RG-SNECGIES 339 >UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep: Cathepsin B - Streblomastix strix Length = 283 Score = 48.4 bits (110), Expect = 1e-04 Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 1/76 (1%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 ++ F Y SGVY + + H VL+VG+G +++ V YW + G WGE G+ K++ Sbjct: 205 YSDFMSYKSGVY-VHQAGYIEGGHAVLIVGWGVEDE-VPYWLVQNSWGTDWGENGFFKIL 262 Query: 477 RNKNN-RCGIASSASY 433 R ++ C +A Y Sbjct: 263 RGSDHCECESNVTAGY 278 >UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; Leishmania|Rep: Cysteine proteinase 2 precursor - Leishmania pifanoi Length = 444 Score = 48.4 bits (110), Expect = 1e-04 Identities = 25/63 (39%), Positives = 34/63 (53%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 +SF Y SGV C L+HGVL+VGY + V YW + G WGE GY++++ Sbjct: 269 SSFMSYKSGVLTA--CIGKQLNHGVLLVGYDMTGE-VPYWVIKNSWGGDWGEQGYVRVVM 325 Query: 474 NKN 466 N Sbjct: 326 GVN 328 >UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza sativa|Rep: Putative cysteine protease - Oryza sativa subsp. japonica (Rice) Length = 357 Score = 48.0 bits (109), Expect = 2e-04 Identities = 28/81 (34%), Positives = 40/81 (49%), Gaps = 5/81 (6%) Frame = -1 Query: 651 SFQLYSSGVY-NEEECSSTDLDHGVLVVGYGTD-EQGVDYWXREELVGPLWGELGYI--- 487 +FQ Y SGV+ ++ +H V +VGY D G YW + G WG+ GYI Sbjct: 277 AFQFYGSGVFPGPRGTAAPKPNHAVTLVGYCQDGASGKKYWIAKNSWGKTWGQQGYILLE 336 Query: 486 KMIRNKNNRCGIASSASYXXV 424 K + + + CG+A S Y V Sbjct: 337 KDVASPHGTCGLAVSPFYPTV 357 >UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 289 Score = 48.0 bits (109), Expect = 2e-04 Identities = 28/82 (34%), Positives = 40/82 (48%), Gaps = 4/82 (4%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD-EQGVDYWXREELVGPLWGELGYI-- 487 H ++ Y SGV+ C ++ +H V +VGY D G YW + G WG+ GYI Sbjct: 210 HATYPFYKSGVF-PGPCGASS-NHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILL 267 Query: 486 -KMIRNKNNRCGIASSASYXXV 424 K + + CG+A S Y V Sbjct: 268 EKDVLQPHGTCGLAVSPFYPTV 289 >UniRef50_Q26988 Cluster: Cysteine proteinase 4; n=1; Tritrichomonas foetus|Rep: Cysteine proteinase 4 - Tritrichomonas foetus (Trichomonas foetus) Length = 152 Score = 48.0 bits (109), Expect = 2e-04 Identities = 19/32 (59%), Positives = 24/32 (75%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD 556 SF YSSG+YN+ +CSST LDH V +GYG + Sbjct: 118 SFNSYSSGIYNDRQCSSTVLDHAVGCIGYGAE 149 >UniRef50_Q23H15 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 370 Score = 48.0 bits (109), Expect = 2e-04 Identities = 27/63 (42%), Positives = 36/63 (57%) Frame = -1 Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNR 460 Y SG++N + S L+H VL VGY D+QG W + GP WGE GY+++ NN Sbjct: 305 YQSGIFNGCDQSLIILNHAVLAVGY--DKQG--NWIVKNSWGPYWGENGYMRLA--PNNT 358 Query: 459 CGI 451 C I Sbjct: 359 CSI 361 >UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathepsin o - Aedes aegypti (Yellowfever mosquito) Length = 375 Score = 48.0 bits (109), Expect = 2e-04 Identities = 30/75 (40%), Positives = 44/75 (58%), Gaps = 2/75 (2%) Frame = -1 Query: 651 SFQLYSSGV--YNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 S++ Y GV Y+ EE DL+H V +VGY + Q + Y+ + GP +G+ GYIK I Sbjct: 300 SWKYYLGGVIQYHCEEAYE-DLNHAVEIVGYNLESQ-IPYYLVKNSWGPRFGDRGYIK-I 356 Query: 477 RNKNNRCGIASSASY 433 + N CGIA+ S+ Sbjct: 357 QVGKNLCGIANRVSF 371 >UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestivum|Rep: Cysteine protease - Triticum aestivum (Wheat) Length = 371 Score = 47.6 bits (108), Expect = 3e-04 Identities = 26/71 (36%), Positives = 35/71 (49%) Frame = -1 Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466 Q Y SGVY C+ T +H V VVGYG G +YW + G WG+ G+ + R + Sbjct: 297 QDYKSGVYRGP-CT-TSQNHVVTVVGYGVTGAGEEYWIAKNSWGQTWGQKGFFFVRRGAD 354 Query: 465 NRCGIASSASY 433 G+ A Y Sbjct: 355 GPRGLCGIAMY 365 >UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 343 Score = 47.6 bits (108), Expect = 3e-04 Identities = 23/65 (35%), Positives = 37/65 (56%) Frame = -1 Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN 466 + Y SG+ + +C T+ H ++V+GYG D YW + +WGE GY+++ R+ Sbjct: 276 RFYHSGIAEDPDCG-TEPTHALIVIGYGPD-----YWILKNTYSKVWGEKGYMRVKRDV- 328 Query: 465 NRCGI 451 N CGI Sbjct: 329 NWCGI 333 >UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena thermophila Length = 320 Score = 47.6 bits (108), Expect = 3e-04 Identities = 29/78 (37%), Positives = 45/78 (57%), Gaps = 1/78 (1%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGV-DYWXREELVGPLWGELGYIKMI 478 T+FQ Y+SGV+ + C + +L+HGVL+V + + W GP WGE G+I++ Sbjct: 254 TNFQFYTSGVF--KNCKA-NLNHGVLLVANVDSSLKIKNSW------GPSWGEKGFIRLA 304 Query: 477 RNKNNRCGIASSASYXXV 424 N CG+ ++ASY V Sbjct: 305 --AGNTCGVCNAASYPIV 320 >UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.4; n=1; Caenorhabditis elegans|Rep: Putative uncharacterized protein W07B8.4 - Caenorhabditis elegans Length = 335 Score = 47.6 bits (108), Expect = 3e-04 Identities = 24/73 (32%), Positives = 36/73 (49%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 + F LY +G+Y H V ++G+G D G YW +WGE GY +++ Sbjct: 255 YEDFYLYKTGIYTHVAGGELG-GHAVKMLGWGVDN-GTPYWLAANSWNTVWGEKGYFRIL 312 Query: 477 RNKNNRCGIASSA 439 R + CGI S+A Sbjct: 313 RGV-DECGIESAA 324 >UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; Leishmania|Rep: Cysteine proteinase 1 precursor - Leishmania pifanoi Length = 354 Score = 47.6 bits (108), Expect = 3e-04 Identities = 24/63 (38%), Positives = 36/63 (57%) Frame = -1 Query: 654 TSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 T++QLY GV + C + L+HGVL+VG+ + + YW + G WGE GYI++ Sbjct: 269 TTWQLYFGGVVSL--CLAWSLNHGVLIVGFNKNAKP-PYWIVKNSWGSSWGEKGYIRLAM 325 Query: 474 NKN 466 N Sbjct: 326 GSN 328 >UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber officinale (Ginger) Length = 221 Score = 47.6 bits (108), Expect = 3e-04 Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 3/75 (4%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRN- 472 FQLY +G++ C+ + +H V G T E DYW + G WGE GYI++ RN Sbjct: 143 FQLYRNGIFTGS-CNIS-ANHYRTVGGRET-ENDKDYWTVKNSWGKNWGESGYIRVERNI 199 Query: 471 --KNNRCGIASSASY 433 + +CGIA S SY Sbjct: 200 AESSGKCGIAISPSY 214 >UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep: Viral cathepsin - Cydia pomonella granulosis virus (CpGV) (Cydia pomonellagranulovirus) Length = 333 Score = 47.6 bits (108), Expect = 3e-04 Identities = 25/63 (39%), Positives = 39/63 (61%) Frame = -1 Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNR 460 Y +G+ + E ++ L+H VL+VGYG + V YW + G WGE GY ++ R+KN+ Sbjct: 264 YKAGIADICE-NNEGLNHAVLLVGYGV-KNDVPYWILKNSWGAEWGEEGYFRVQRDKNS- 320 Query: 459 CGI 451 CG+ Sbjct: 321 CGM 323 >UniRef50_Q84SA7 Cluster: Thiol protease; n=1; Aster tripolium|Rep: Thiol protease - Aster tripolium (Sea aster) Length = 188 Score = 47.2 bits (107), Expect = 3e-04 Identities = 30/76 (39%), Positives = 37/76 (48%), Gaps = 6/76 (7%) Frame = -1 Query: 645 QLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVD------YWXREELVGPLWGELGYIK 484 Q Y V CS LDHGVL+VGYG+ YW + GP WGE GY K Sbjct: 107 QTYIGKVSCPYVCSKKPLDHGVLLVGYGSAGYAPSRLKEKPYWIIKNSWGPDWGEDGYYK 166 Query: 483 MIRNKNNRCGIASSAS 436 I + +N CG+ + S Sbjct: 167 -ICSGHNLCGMDTMVS 181 >UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomuscorum|Rep: Cathepsin B - Oxytricha trifallax (Sterkiella histriomuscorum) Length = 294 Score = 47.2 bits (107), Expect = 3e-04 Identities = 27/71 (38%), Positives = 36/71 (50%), Gaps = 2/71 (2%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDL--DHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIK 484 +T F Y SGVY ++TD+ H + ++GYG E G YW GP WG G+ K Sbjct: 220 YTDFFNYQSGVYTP---TTTDVAGGHAIKILGYGV-ENGTPYWLCANSWGPAWGMSGFFK 275 Query: 483 MIRNKNNRCGI 451 + K CGI Sbjct: 276 I---KQGECGI 283 >UniRef50_Q7QQ92 Cluster: GLP_243_18349_20043; n=1; Giardia lamblia ATCC 50803|Rep: GLP_243_18349_20043 - Giardia lamblia ATCC 50803 Length = 564 Score = 47.2 bits (107), Expect = 3e-04 Identities = 24/64 (37%), Positives = 30/64 (46%) Frame = -1 Query: 648 FQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 F Y G++N+ CS T LDH V+ GYG QGV+ W G WG G+ Sbjct: 377 FSNYKGGIFNKP-CSKTGLDHQVMFAGYGY-YQGVEVWVMRNSWGEQWGSYGHFYTPIGN 434 Query: 468 NNRC 457 N C Sbjct: 435 NVLC 438 >UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2; Theileria|Rep: Cysteine protease, tacP, putative - Theileria annulata Length = 461 Score = 47.2 bits (107), Expect = 3e-04 Identities = 28/70 (40%), Positives = 42/70 (60%), Gaps = 3/70 (4%) Frame = -1 Query: 651 SFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVD-YWXREELVGPLWGELGYIKMIR 475 SF Y SG+Y + +CS +L+H VL+VG G D + YW + G WGE G++++ R Sbjct: 371 SFFDYKSGIY-DGDCS-VNLNHAVLLVGEGYDPKTKKRYWIIKNSWGRDWGEDGFMRLER 428 Query: 474 NK--NNRCGI 451 N++CGI Sbjct: 429 TNEGNDKCGI 438 >UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 383 Score = 47.2 bits (107), Expect = 3e-04 Identities = 22/69 (31%), Positives = 39/69 (56%), Gaps = 3/69 (4%) Frame = -1 Query: 639 YSSGVYNE--EECSSTDLD-HGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNK 469 Y SG++N E+C+ + H + ++GYG + + YW + G WG GY ++ R Sbjct: 310 YRSGIFNPSVEDCTEKSMGAHALTIIGYGGEGESA-YWIVKNSWGTSWGASGYFRLARGV 368 Query: 468 NNRCGIASS 442 N+ CG+A++ Sbjct: 369 NS-CGLANT 376 >UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_21, whole genome shotgun sequence - Paramecium tetraurelia Length = 349 Score = 47.2 bits (107), Expect = 3e-04 Identities = 27/76 (35%), Positives = 40/76 (52%), Gaps = 3/76 (3%) Frame = -1 Query: 651 SFQLYSSGVYNEE-ECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 +FQLY G+Y+ + + S L+HGV VGY D Y+ + G WGE GYI+ R Sbjct: 267 NFQLYKKGIYSAKCDGSKPALNHGVTNVGYAPD-----YYLIKNSWGQSWGESGYIRFAR 321 Query: 474 --NKNNRCGIASSASY 433 +K +CG ++ Sbjct: 322 IADKAGQCGAQQEVNF 337 >UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera litura multicapsid nucleopolyhedrovirus (SpltMNPV) Length = 337 Score = 47.2 bits (107), Expect = 3e-04 Identities = 26/63 (41%), Positives = 33/63 (52%) Frame = -1 Query: 639 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKNNR 460 Y SG+ C+ L+H VL+VGYG E YW + G WGE GY + RN N Sbjct: 268 YRSGIATV--CNDNGLNHAVLLVGYGI-ENDTPYWIFKNSWGSNWGENGYFRARRN-INA 323 Query: 459 CGI 451 CG+ Sbjct: 324 CGM 326 >UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 5 SCAF15026, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 351 Score = 46.8 bits (106), Expect = 5e-04 Identities = 25/71 (35%), Positives = 38/71 (53%) Frame = -1 Query: 657 HTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMI 478 + F LY SGVY S+ H + ++G+G +E GV YW WG+ G+ K++ Sbjct: 276 YEDFVLYKSGVYQHVSGSALG-GHAIKMLGWG-EENGVPYWLCANSWNTDWGDNGFFKIL 333 Query: 477 RNKNNRCGIAS 445 R ++ CGI S Sbjct: 334 RGADH-CGIES 343 >UniRef50_O48605 Cluster: Putative thiol protease; n=1; Hordeum vulgare|Rep: Putative thiol protease - Hordeum vulgare (Barley) Length = 91 Score = 46.8 bits (106), Expect = 5e-04 Identities = 23/69 (33%), Positives = 35/69 (50%), Gaps = 2/69 (2%) Frame = -1 Query: 633 SGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIRNKN--NR 460 SGVY + C + +H + +VGYGT G YW + WG+ G+I ++R+ Sbjct: 22 SGVYIKGACKTAQ-NHAMALVGYGTKPDGTKYWIGKNSWTAKWGDKGFIYLLRDSPPLGL 80 Query: 459 CGIASSASY 433 CG+A Y Sbjct: 81 CGLAKLPVY 89 >UniRef50_A7QDM1 Cluster: Chromosome chr10 scaffold_81, whole genome shotgun sequence; n=1; Vitis vinifera|Rep: Chromosome chr10 scaffold_81, whole genome shotgun sequence - Vitis vinifera (Grape) Length = 98 Score = 46.8 bits (106), Expect = 5e-04 Identities = 23/61 (37%), Positives = 29/61 (47%), Gaps = 3/61 (4%) Frame = -1 Query: 606 SSTDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKM---IRNKNNRCGIASSAS 436 S DLD+GV GYG G +W + G WGE GY +M ++ CG AS Sbjct: 35 SGNDLDYGVTTDGYGRSADGKKHWLVKNSWGTDWGENGYTRMERGVKATTGLCGFTMQAS 94 Query: 435 Y 433 Y Sbjct: 95 Y 95 >UniRef50_Q23EG5 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 429 Score = 46.8 bits (106), Expect = 5e-04 Identities = 27/74 (36%), Positives = 39/74 (52%), Gaps = 2/74 (2%) Frame = -1 Query: 648 FQLYSSGVYNEEECSS--TDLDHGVLVVGYGTDEQGVDYWXREELVGPLWGELGYIKMIR 475 F+ Y G+Y+ ECS+ +++H VL VGY + Y+ + G WG GY I Sbjct: 269 FENYEGGIYSNPECSTDPQEVNHAVLAVGYNLTGR---YYIVKNSWGKDWGMDGYF-YIE 324 Query: 474 NKNNRCGIASSASY 433 +N CG+A ASY Sbjct: 325 LGSNMCGLADCASY 338 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 519,672,949 Number of Sequences: 1657284 Number of extensions: 8876468 Number of successful extensions: 22674 Number of sequences better than 10.0: 440 Number of HSP's better than 10.0 without gapping: 21812 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 22384 length of database: 575,637,011 effective HSP length: 98 effective length of database: 413,223,179 effective search space used: 49586781480 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -