BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= fbpv0107 (804 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ... 105 1e-21 UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil... 77 7e-13 UniRef50_Q22A69 Cluster: Papain family cysteine protease contain... 75 2e-12 UniRef50_O16454 Cluster: Temporarily assigned gene name protein ... 70 6e-11 UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C... 69 1e-10 UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|... 69 1e-10 UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac... 68 2e-10 UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb... 66 7e-10 UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh... 66 7e-10 UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=... 66 1e-09 UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ... 65 2e-09 UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ... 65 2e-09 UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr... 64 3e-09 UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p... 64 4e-09 UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov... 63 7e-09 UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei... 62 1e-08 UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain... 61 4e-08 UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=... 60 6e-08 UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ... 59 1e-07 UniRef50_Q22RC9 Cluster: Papain family cysteine protease contain... 59 1e-07 UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D... 59 1e-07 UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa... 59 1e-07 UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei... 59 1e-07 UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum... 58 2e-07 UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R... 58 3e-07 UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e... 58 3e-07 UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35... 57 5e-07 UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n... 57 6e-07 UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh... 57 6e-07 UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryz... 56 8e-07 UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D... 56 8e-07 UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel... 56 8e-07 UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;... 56 1e-06 UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl... 56 1e-06 UniRef50_Q239L8 Cluster: Papain family cysteine protease contain... 56 1e-06 UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia... 56 1e-06 UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 56 1e-06 UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt... 55 2e-06 UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,... 54 3e-06 UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ... 54 3e-06 UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ... 54 3e-06 UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n... 54 4e-06 UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain... 54 4e-06 UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium... 54 4e-06 UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ... 54 6e-06 UniRef50_Q231X3 Cluster: Papain family cysteine protease contain... 54 6e-06 UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain... 54 6e-06 UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh... 54 6e-06 UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;... 53 1e-05 UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ... 53 1e-05 UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t... 52 1e-05 UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph... 52 1e-05 UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic... 52 1e-05 UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi... 52 1e-05 UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ... 52 2e-05 UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae... 52 2e-05 UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain... 52 2e-05 UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w... 52 2e-05 UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2... 52 2e-05 UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryz... 52 2e-05 UniRef50_Q650W8 Cluster: Putative cysteine proteinase; n=2; Oryz... 52 2e-05 UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ... 52 2e-05 UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis... 52 2e-05 UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ... 51 3e-05 UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ... 51 3e-05 UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory... 51 3e-05 UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ... 51 4e-05 UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|... 51 4e-05 UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep... 51 4e-05 UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep... 51 4e-05 UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh... 51 4e-05 UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyosteli... 51 4e-05 UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re... 50 5e-05 UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re... 50 5e-05 UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ... 50 5e-05 UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ... 50 7e-05 UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ... 50 9e-05 UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18... 50 9e-05 UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio... 49 2e-04 UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy... 49 2e-04 UniRef50_Q23H32 Cluster: Papain family cysteine protease contain... 49 2e-04 UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ... 48 3e-04 UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty... 48 3e-04 UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyt... 48 3e-04 UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr... 48 3e-04 UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ... 48 3e-04 UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:... 48 3e-04 UniRef50_Q945E4 Cluster: Cysteine proteinase; n=1; Vasconcellea ... 48 4e-04 UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; P... 48 4e-04 UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re... 47 5e-04 UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big... 47 5e-04 UniRef50_Q5Y801 Cluster: Cysteine proteinase; n=1; Petunia x hyb... 47 5e-04 UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh... 47 5e-04 UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ... 47 5e-04 UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox... 47 6e-04 UniRef50_Q23H10 Cluster: Papain family cysteine protease contain... 46 8e-04 UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Sl... 46 0.001 UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa... 46 0.001 UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ... 46 0.001 UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sa... 45 0.002 UniRef50_Q8I8D5 Cluster: Cysteine protease 13; n=2; Entamoeba hi... 45 0.003 UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ... 45 0.003 UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-32... 45 0.003 UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen... 45 0.003 UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w... 45 0.003 UniRef50_Q3LFN3 Cluster: Cysteine proteinase; n=1; Dianthus cary... 44 0.003 UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-... 44 0.003 UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl... 44 0.003 UniRef50_A7TC64 Cluster: Predicted protein; n=1; Nematostella ve... 44 0.003 UniRef50_A7APS9 Cluster: Papain family cysteine protease contain... 44 0.003 UniRef50_A0DKP6 Cluster: Chromosome undetermined scaffold_54, wh... 44 0.003 UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;... 44 0.003 UniRef50_A7LFV3 Cluster: Cathepsin L protease inhibitor 2; n=1; ... 44 0.005 UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:... 44 0.005 UniRef50_UPI0001509FF9 Cluster: Papain family cysteine protease ... 44 0.006 UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ... 44 0.006 UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted... 44 0.006 UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|R... 44 0.006 UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C... 44 0.006 UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w... 43 0.008 UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ... 43 0.010 UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh... 43 0.010 UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|... 43 0.010 UniRef50_Q9JM84 Cluster: DD72 protein; n=4; Murinae|Rep: DD72 pr... 42 0.014 UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ... 42 0.018 UniRef50_UPI0000ECC98C Cluster: Cystatin-F precursor (Leukocysta... 42 0.018 UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal... 42 0.018 UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa... 42 0.018 UniRef50_A2YHE2 Cluster: Putative uncharacterized protein; n=2; ... 42 0.018 UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip... 42 0.018 UniRef50_A0CS14 Cluster: Chromosome undetermined scaffold_26, wh... 42 0.018 UniRef50_P01034 Cluster: Cystatin-C precursor; n=28; Eutheria|Re... 42 0.018 UniRef50_P01035 Cluster: Cystatin-C precursor; n=3; Cetartiodact... 42 0.018 UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs... 42 0.018 UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ... 42 0.024 UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:... 42 0.024 UniRef50_Q1LYJ7 Cluster: Novel protein; n=3; Danio rerio|Rep: No... 42 0.024 UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz... 42 0.024 UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G... 42 0.024 UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|... 42 0.024 UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain... 42 0.024 UniRef50_UPI0000F2B877 Cluster: PREDICTED: hypothetical protein;... 41 0.032 UniRef50_Q5YER1 Cluster: Cysteine proteinase; n=1; Bigelowiella ... 41 0.032 UniRef50_A5HC50 Cluster: Cathepsin W; n=2; Theria|Rep: Cathepsin... 41 0.032 UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh... 41 0.032 UniRef50_Q24F16 Cluster: Papain family cysteine protease contain... 41 0.042 UniRef50_Q248G1 Cluster: Papain family cysteine protease contain... 41 0.042 UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve... 41 0.042 UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi... 41 0.042 UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca... 40 0.055 UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto... 40 0.055 UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ... 40 0.073 UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v... 40 0.073 UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|... 40 0.097 UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt... 40 0.097 UniRef50_Q61CG7 Cluster: Putative uncharacterized protein CBG129... 40 0.097 UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n... 40 0.097 UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain... 40 0.097 UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ... 39 0.13 UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arab... 39 0.13 UniRef50_O48608 Cluster: Putative thiol protease; n=1; Hordeum v... 39 0.13 UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ... 39 0.13 UniRef50_A0CHZ5 Cluster: Chromosome undetermined scaffold_186, w... 39 0.13 UniRef50_UPI00006A00FD Cluster: Cystatin-M precursor (Cystatin-6... 39 0.17 UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca... 39 0.17 UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ... 38 0.22 UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep... 38 0.22 UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain... 38 0.22 UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl... 38 0.22 UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ... 38 0.22 UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ... 38 0.30 UniRef50_A1Z9I0 Cluster: CG6357-PA; n=3; Drosophila melanogaster... 38 0.30 UniRef50_P22085 Cluster: Onchocystatin precursor; n=6; Onchocerc... 38 0.30 UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto... 38 0.30 UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome s... 38 0.39 UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ... 38 0.39 UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1... 38 0.39 UniRef50_A7LFV2 Cluster: Cathepsin L protease inhibitor 1; n=1; ... 38 0.39 UniRef50_Q7M429 Cluster: L-cystatin precursor; n=1; Tachypleus t... 38 0.39 UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain... 37 0.52 UniRef50_P01038 Cluster: Cystatin precursor; n=2; Phasianidae|Re... 37 0.52 UniRef50_O76096 Cluster: Cystatin-F precursor; n=13; Eutheria|Re... 37 0.52 UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ... 37 0.68 UniRef50_A3EYB2 Cluster: Vap1; n=2; Mammalia|Rep: Vap1 - Trichos... 37 0.68 UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n... 37 0.68 UniRef50_Q23H15 Cluster: Papain family cysteine protease contain... 37 0.68 UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain... 37 0.68 UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi... 37 0.68 UniRef50_A2FLT7 Cluster: Putative uncharacterized protein; n=1; ... 37 0.68 UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG... 36 0.90 UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-... 36 0.90 UniRef50_Q1WDN1 Cluster: Cystatin-2; n=1; Haemaphysalis longicor... 36 0.90 UniRef50_O08677 Cluster: Kininogen-1 precursor [Contains: Kinino... 36 0.90 UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h... 36 1.2 UniRef50_UPI0000E255D2 Cluster: PREDICTED: similar to Cystatin C... 36 1.2 UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi... 36 1.2 UniRef50_Q711N7 Cluster: Putative cys1 protein; n=1; Fasciola he... 36 1.2 UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n... 36 1.2 UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio... 36 1.2 UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain... 36 1.2 UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy... 36 1.2 UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3... 36 1.2 UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R... 36 1.2 UniRef50_Q4RQ21 Cluster: Chromosome 17 SCAF15006, whole genome s... 36 1.6 UniRef50_Q70AR5 Cluster: Putative cytochrome P450; n=1; Streptom... 36 1.6 UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ... 36 1.6 UniRef50_Q5DB58 Cluster: SJCHGC06844 protein; n=1; Schistosoma j... 36 1.6 UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus... 36 1.6 UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali... 36 1.6 UniRef50_A0BLR4 Cluster: Chromosome undetermined scaffold_115, w... 36 1.6 UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;... 35 2.1 UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j... 35 2.1 UniRef50_Q4U3Y4 Cluster: CYP325C2; n=4; Anopheles gambiae|Rep: C... 35 2.1 UniRef50_Q23H06 Cluster: Papain family cysteine protease contain... 35 2.1 UniRef50_Q22W19 Cluster: Papain family cysteine protease contain... 35 2.1 UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C... 35 2.1 UniRef50_UPI00015B5E04 Cluster: PREDICTED: similar to CG8302-PA;... 35 2.8 UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat... 35 2.8 UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil... 35 2.8 UniRef50_P35481 Cluster: Cystatin precursor; n=1; Cyprinus carpi... 35 2.8 UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ... 34 3.6 UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ... 34 3.6 UniRef50_Q2XXN5 Cluster: Cystatin-POGU1; n=1; Pogona barbata|Rep... 34 3.6 UniRef50_Q9U7F7 Cluster: Cysteine protease; n=2; Entamoeba histo... 34 3.6 UniRef50_Q24E33 Cluster: Papain family cysteine protease contain... 34 3.6 UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain... 34 3.6 UniRef50_O61973 Cluster: Cystatin-like protease inhibitor protei... 34 3.6 UniRef50_O45120 Cluster: Family 4 cytochrome P450; n=2; Coptoter... 34 3.6 UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]... 34 3.6 UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ... 34 3.6 UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; ... 34 4.8 UniRef50_Q84YE7 Cluster: Cysteine proteinase-like protein; n=1; ... 34 4.8 UniRef50_Q9Y1T8 Cluster: Cytochrome P450 4W1; n=3; Arthropoda|Re... 34 4.8 UniRef50_Q967Y5 Cluster: Cytochrome P450 CYP4G13v2; n=4; Neopter... 34 4.8 UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n... 34 4.8 UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ... 33 6.4 UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa... 33 6.4 UniRef50_Q015J8 Cluster: Cathepsin; n=2; Ostreococcus|Rep: Cathe... 33 6.4 UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi... 33 6.4 UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1... 33 6.4 UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate... 33 6.4 UniRef50_O15725 Cluster: Pol; n=20; Dictyostelium discoideum|Rep... 33 6.4 UniRef50_Q91195 Cluster: Cystatin precursor; n=4; Actinopteri|Re... 33 6.4 UniRef50_Q9V7G5 Cluster: Probable cytochrome P450 4aa1; n=5; Dip... 33 6.4 UniRef50_Q45RG8 Cluster: Cystatin; n=4; Danio rerio|Rep: Cystati... 33 8.4 UniRef50_A7HJT1 Cluster: MutS2 family protein; n=1; Fervidobacte... 33 8.4 UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz... 33 8.4 UniRef50_Q9U9A1 Cluster: Cystatin-type cysteine proteinase inhib... 33 8.4 UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus... 33 8.4 UniRef50_Q6QZV5 Cluster: Cystatin precursor; n=1; Ornithodoros m... 33 8.4 UniRef50_Q9VYY4 Cluster: Cytochrome P450 4g15; n=8; Neoptera|Rep... 33 8.4 >UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin F like protease - Nasonia vitripennis Length = 1036 Score = 105 bits (252), Expect = 1e-21 Identities = 64/158 (40%), Positives = 85/158 (53%), Gaps = 1/158 (0%) Frame = +2 Query: 257 YDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGK 436 ++F+ +K Y + E RF+IFK N+ I EL +E GT YG+TQF DL+ EF Sbjct: 732 HEFMGKYKKMY-HNKEEKEMRFQIFKDNLNLIEELQRNEMGTGRYGVTQFTDLTKAEFKA 790 Query: 437 KYLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLV 613 ++LGLKP+L+ N IPM A IP ++ P + VKDQG CGS W + + Sbjct: 791 RHLGLKPTLKSENDIPMPMATIPDIELPSDYDWRHHNVVTPVKDQGSCGSCWAFSVTGNI 850 Query: 614 MSRVNIS*RLDSCCHFSEQELVDCDKP*RRM*RGGLPD 727 + I + SEQELVDCDK GGLPD Sbjct: 851 EGQYAI--KHGELLSLSEQELVDCDKLDSGC-NGGLPD 885 Score = 44.4 bits (100), Expect = 0.003 Identities = 18/66 (27%), Positives = 35/66 (53%), Gaps = 1/66 (1%) Frame = +3 Query: 18 QVIAGIHYRMKVEVGLTNCTALTNRSDCKHISDESLNKFCRVNVWMRPWTNH-PPNFRVT 194 QV++G+ Y+++ ++G++ C+ T DC+ D + + C + W +PW + P V Sbjct: 641 QVVSGLLYKIQTDIGVSTCSKGTVTGDCQLSKDHGVEE-CVIEAWSQPWLDKGNPKITVK 699 Query: 195 CDYQES 212 C S Sbjct: 700 CGQNRS 705 >UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Liliopsida|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 416 Score = 76.6 bits (180), Expect = 7e-13 Identities = 48/133 (36%), Positives = 68/133 (51%), Gaps = 3/133 (2%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELNTHERGTA-VYGITQFADLSYEEFGKKYLGLK--PSLR 466 D ++M RFE+FK N R IHE N +G + V G+ +F+DL+YEEF KY G+K S Sbjct: 38 DLSDMESRFEVFKANARYIHEFNQKSKGMSYVLGLNKFSDLTYEEFAAKYTGVKVDASAF 97 Query: 467 DTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVMSRVNIS*RLD 646 T E+P P + DVKDQG CGS A+ + +N + Sbjct: 98 ATATTSSPDEELPVGVPPATWDWRLNGAVTDVKDQGQCGSCWVFSAVGAVEGIN-AIMTG 156 Query: 647 SCCHFSEQELVDC 685 + SEQ+++DC Sbjct: 157 NLLTLSEQQVLDC 169 >UniRef50_Q22A69 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 330 Score = 74.9 bits (176), Expect = 2e-12 Identities = 57/160 (35%), Positives = 81/160 (50%), Gaps = 5/160 (3%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKY 442 F T+ Y + R IFK N+R+I N ++ A +GITQFADL++EEF Y Sbjct: 33 FTQTYNKKYSSEE-HYNARLSIFKENLRRIELFNKNDE--AQHGITQFADLTHEEFADMY 89 Query: 443 LGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAI--MTQS--PDVKDQGMCGS-WLGPLAL 607 LG KP LR++ QA++ +P + AI T+ VK+QG CGS W Sbjct: 90 LGYKPQLRNS------QAKVSLSSTPFTAPTAIDWTTKGAVTPVKNQGSCGSCWAFSTTG 143 Query: 608 LVMSRVNIS*RLDSCCHFSEQELVDCDKP*RRM*RGGLPD 727 + + + + + FSEQ+LVDCD + GGL D Sbjct: 144 SIEGQYVLQLK-QNLTSFSEQQLVDCDTKEDQGCNGGLMD 182 >UniRef50_O16454 Cluster: Temporarily assigned gene name protein 196; n=4; Bilateria|Rep: Temporarily assigned gene name protein 196 - Caenorhabditis elegans Length = 477 Score = 70.1 bits (164), Expect = 6e-11 Identities = 60/165 (36%), Positives = 80/165 (48%), Gaps = 10/165 (6%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 DF+ H+ Y + E+ +RF +FK N + I EL +E+GTAVYG T+F+D++ EF K Sbjct: 176 DFVDRHEKKYTNKR-EVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKKI 234 Query: 440 YLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPD---------VKDQGMCGS-W 589 L P + PM QA K IN + +S D VK+QG CGS W Sbjct: 235 ML---PYQWEQPVYPMEQANFEKHDVTINE--EDLPESFDWREKGAVTQVKNQGNCGSCW 289 Query: 590 LGPLALLVMSRVNIS*RLDSCCHFSEQELVDCDKP*RRM*RGGLP 724 V I+ + SEQELVDCD + GGLP Sbjct: 290 AFSTTGNVEGAWFIA--KNKLVSLSEQELVDCDSMDQGC-NGGLP 331 >UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: Cysteine protease - Clonorchis sinensis Length = 328 Score = 69.3 bits (162), Expect = 1e-10 Identities = 55/159 (34%), Positives = 76/159 (47%), Gaps = 5/159 (3%) Frame = +2 Query: 227 VPPHTS*TFVYDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQF 406 V P + +F +K Y +D E+R FEIFK N+ + L E+GTA YG+TQF Sbjct: 23 VEPDNARALYEEFKLKYKKTYSNDDDELR--FEIFKDNLLRAKRLQEMEQGTAQYGVTQF 80 Query: 407 ADLSYEEFGKKYLGLK--PSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMC 580 +DL+ EEF +YL ++ + + P + K GA+ P V DQG C Sbjct: 81 SDLTSEEFKTRYLRMRFDGPIVSEDPSPEEDVTMDNEKFDWREHGAV---GP-VLDQGKC 136 Query: 581 GS-WLGPLALLVMSRVNIS--*RLDSCCHFSEQELVDCD 688 GS W A V+ V + SEQ+LVDCD Sbjct: 137 GSCW----AFSVIGNVEGQWFRKTGDLLALSEQQLVDCD 171 >UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|Rep: Cathepsin F precursor - Homo sapiens (Human) Length = 484 Score = 68.9 bits (161), Expect = 1e-10 Identities = 57/160 (35%), Positives = 82/160 (51%), Gaps = 5/160 (3%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 +F+ T+ Y + E R R +F N+ + ++ +RGTA YG+T+F+DL+ EEF Sbjct: 189 NFVITYNRTY-ESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTI 247 Query: 440 YLGLKPSLRDTNQIPMRQAE-IPKLKSP---INSIGAIMTQSPDVKDQGMCGS-WLGPLA 604 Y L LR M+QA+ + L P S GA+ VKDQGMCGS W + Sbjct: 248 Y--LNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAV----TKVKDQGMCGSCWAFSVT 301 Query: 605 LLVMSRVNIS*RLDSCCHFSEQELVDCDKP*RRM*RGGLP 724 V + ++ + SEQEL+DCDK + GGLP Sbjct: 302 GNVEGQWFLN--QGTLLSLSEQELLDCDKMDKAC-MGGLP 338 >UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Actinidin Act3a - Actinidia eriantha Length = 380 Score = 68.1 bits (159), Expect = 2e-10 Identities = 50/144 (34%), Positives = 71/144 (49%), Gaps = 3/144 (2%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRD--TNQ 478 E R EIFK N+R I E N + G+ QFADL+ EE+ YLG K SL+ +N+ Sbjct: 58 EREMRIEIFKENLRFIDEHNADPNRSYTVGLNQFADLTDEEYRSTYLGFKSSLKSKVSNR 117 Query: 479 IPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVMSRVNIS*RLDSCCH 658 + E+ + GA++ DVK+QG+C S + + +N D Sbjct: 118 YMPQVGEVLPDYVDWRTTGAVV----DVKNQGLCSSCWAFATIATVESINQIITGD-LIS 172 Query: 659 FSEQELVDCDK-P*RRM*RGGLPD 727 SEQELVDC++ P +GG D Sbjct: 173 LSEQELVDCNRTPINEGCKGGFMD 196 >UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae str. PEST Length = 559 Score = 66.5 bits (155), Expect = 7e-10 Identities = 52/153 (33%), Positives = 75/153 (49%), Gaps = 7/153 (4%) Frame = +2 Query: 254 VYDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFG 433 ++D H + E RF IF+ N+ KI +LN ERGTA YG+T+FAD++ E+ Sbjct: 248 MFDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERGTAKYGVTKFADMTVAEY- 306 Query: 434 KKYLGLKPSLRD-TNQIPMRQAEIPKLKS----PINSIGAIMTQSPDVKDQGMCGS-WLG 595 + + GL D N + R A + P + +VK+QG CGS W Sbjct: 307 RAHTGLVVPKHDRANHVGNRVASEEDVAGVGDLPRSFDWRDHGAVTEVKNQGSCGSCWAF 366 Query: 596 PLALLVMSRVNI-S*RLDSCCHFSEQELVDCDK 691 V I + +L+S +SEQEL+DCDK Sbjct: 367 SAVGNVEGLHQIKTKKLES---YSEQELIDCDK 396 >UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_56, whole genome shotgun sequence - Paramecium tetraurelia Length = 314 Score = 66.5 bits (155), Expect = 7e-10 Identities = 47/132 (35%), Positives = 68/132 (51%), Gaps = 2/132 (1%) Frame = +2 Query: 302 AEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQI 481 AE RF++++ +++I LN+ E T V+G TQF DL+ EEF L K S Sbjct: 46 AERAYRFQVYQDAMKQIQILNSEENSTTVFGETQFTDLTNEEFAALLLTRKES------- 98 Query: 482 PMR-QAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCC 655 PM AE+ + P+ + A ++ VK+QG CGS W V + + I + Sbjct: 99 PMNLDAELYVPQGPLKA-SADWSKITSVKNQGNCGSCWAFSAVGAVETLLTIKGVISKDL 157 Query: 656 HFSEQELVDCDK 691 SEQ+LVDCDK Sbjct: 158 WLSEQQLVDCDK 169 >UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=176; Viridiplantae|Rep: Cysteine proteinase RD21a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 462 Score = 66.1 bits (154), Expect = 1e-09 Identities = 57/164 (34%), Positives = 81/164 (49%), Gaps = 6/164 (3%) Frame = +2 Query: 254 VYD-FLATH-KPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEE 427 +Y+ +L H K + E RRFEIFK N+R + E N + G+T+FADL+ +E Sbjct: 49 IYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRL-GLTRFADLTNDE 107 Query: 428 FGKKYLGLKPSLRDTNQIPMR-QAEI-PKLKSPIN--SIGAIMTQSPDVKDQGMCGSWLG 595 + KYLG K + + +R +A + +L I+ GA+ +VKDQG CGS Sbjct: 108 YRSKYLGAKMEKKGERRTSLRYEARVGDELPESIDWRKKGAV----AEVKDQGGCGSCWA 163 Query: 596 PLALLVMSRVNIS*RLDSCCHFSEQELVDCDKP*RRM*RGGLPD 727 + + +N D SEQELVDCD GGL D Sbjct: 164 FSTIGAVEGINQIVTGD-LITLSEQELVDCDTSYNEGCNGGLMD 206 >UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1 precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 355 Score = 64.9 bits (151), Expect = 2e-09 Identities = 53/160 (33%), Positives = 77/160 (48%), Gaps = 5/160 (3%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKY 442 +++ H Y E RFE+F+ N+ I + N +E + G+ +FADL++EEF +Y Sbjct: 54 WMSEHSKAY-KSVEEKVHRFEVFRENLMHIDQRN-NEINSYWLGLNEFADLTHEEFKGRY 111 Query: 443 LGL-KPSLRDTNQ--IPMRQAEIPKLKSPIN--SIGAIMTQSPDVKDQGMCGSWLGPLAL 607 LGL KP Q R +I L ++ GA+ +P VKDQG CGS + Sbjct: 112 LGLAKPQFSRKRQPSANFRYRDITDLPKSVDWRKKGAV---AP-VKDQGQCGSCWAFSTV 167 Query: 608 LVMSRVNIS*RLDSCCHFSEQELVDCDKP*RRM*RGGLPD 727 + +N + SEQEL+DCD GGL D Sbjct: 168 AAVEGIN-QITTGNLSSLSEQELIDCDTTFNSGCNGGLMD 206 >UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 precursor; n=2; Arabidopsis thaliana|Rep: Probable cysteine proteinase At3g43960 precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 376 Score = 64.9 bits (151), Expect = 2e-09 Identities = 48/152 (31%), Positives = 74/152 (48%), Gaps = 4/152 (2%) Frame = +2 Query: 248 TFVYDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEE 427 T +L + NY + E RRF+IFK N+++I E N+ + G+ +F+DL+ +E Sbjct: 39 TMYEQWLVENGKNY-NGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADE 97 Query: 428 FGKKYLG---LKPSLRD-TNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLG 595 F YLG K SL D + ++ ++ + GA++ P VK QG CGS Sbjct: 98 FQASYLGGKMEKKSLSDVAERYQYKEGDVLPDEVDWRERGAVV---PRVKRQGECGSCWA 154 Query: 596 PLALLVMSRVNIS*RLDSCCHFSEQELVDCDK 691 A + +N SEQEL+DCD+ Sbjct: 155 FAATGAVEGIN-QITTGELVSLSEQELIDCDR 185 >UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 precursor; n=4; Schizophora|Rep: Putative cysteine proteinase CG12163 precursor - Drosophila melanogaster (Fruit fly) Length = 614 Score = 64.5 bits (150), Expect = 3e-09 Identities = 52/151 (34%), Positives = 75/151 (49%), Gaps = 7/151 (4%) Frame = +2 Query: 257 YDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGK 436 Y F Y+ AE + R IF+ N++ I ELN +E G+A YGIT+FAD++ E+ K Sbjct: 309 YKFQVRFGRRYVS-TAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEY-K 366 Query: 437 KYLGL------KPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLG 595 + GL K + +P E+PK + A+ TQ VK+QG CGS W Sbjct: 367 ERTGLWQRDEAKATGGSAAVVPAYHGELPK-EFDWRQKDAV-TQ---VKNQGSCGSCWAF 421 Query: 596 PLALLVMSRVNIS*RLDSCCHFSEQELVDCD 688 + + + + FSEQEL+DCD Sbjct: 422 SVTGNIEGLYAV--KTGELKEFSEQELLDCD 450 >UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine protease; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cysteine protease - Strongylocentrotus purpuratus Length = 494 Score = 64.1 bits (149), Expect = 4e-09 Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 2/145 (1%) Frame = +2 Query: 263 FLATHKPNYI--DDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGK 436 FL T K Y D E R+ +F N+ + N E+GTA YG T+FAD++ EF K Sbjct: 159 FLMTFKREYRQNDGTNEYEYRYSVFVQNMLTVEMFNQFEQGTAKYGPTKFADMTEAEFRK 218 Query: 437 KYLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVM 616 G Q + Q +P+ + + GA+ +P VK+QGMCGS A+ M Sbjct: 219 LQSGPLKKTGIKKQAAIPQGPVPE-EYDWRTHGAV---TP-VKNQGMCGSCWAFSAIGNM 273 Query: 617 SRVNIS*RLDSCCHFSEQELVDCDK 691 + SEQELVDCDK Sbjct: 274 EG-QWQIKKGELISLSEQELVDCDK 297 >UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria dispar multicapsid nuclear polyhedrosis virus (LdMNPV) Length = 356 Score = 63.3 bits (147), Expect = 7e-09 Identities = 45/145 (31%), Positives = 68/145 (46%), Gaps = 3/145 (2%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTH--ERGTAVYGITQFADLSYEEFGK 436 F+ + NY D E +R+ IFK N+ +I+ N + + TA Y I +F+DLS E Sbjct: 59 FVENYNKNYTSDW-EKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKFSDLSKSELIA 117 Query: 437 KYLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLV 613 K+ GL R +N P K P++ + +K+QG CG+ W A L Sbjct: 118 KFTGLSIPERVSNFCKTIILNQPPDKGPLHFDWREQNKVTSIKNQGACGACWA--FATLA 175 Query: 614 MSRVNIS*RLDSCCHFSEQELVDCD 688 + R + SEQ+L+DCD Sbjct: 176 SVESQFAMRHNRLIDLSEQQLIDCD 200 >UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckeia)|Rep: Berghepain-2 - Plasmodium yoelii yoelii Length = 472 Score = 62.5 bits (145), Expect = 1e-08 Identities = 53/155 (34%), Positives = 73/155 (47%), Gaps = 12/155 (7%) Frame = +2 Query: 257 YDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGK 436 Y F+ + Y A EM+ RF IF ++KI + N E GI F+D+ +EEF Sbjct: 157 YSFMKKYNKEY-SSAEEMQERFYIFSEKLKKIEKHNK-ENHLYTKGINAFSDMRHEEFKM 214 Query: 437 KYLGLKPSLRDTNQIPMRQ-----AEIPKLKSPINSIGAIM------TQSPDVKDQGMCG 583 KYL K L++ +QI +R I K KSP + I D+KDQ C Sbjct: 215 KYLNNK--LKENHQIDLRHLIPYTIAINKYKSPTDQINYTSFDWRDHNAIIDIKDQQKCA 272 Query: 584 S-WLGPLALLVMSRVNIS*RLDSCCHFSEQELVDC 685 S W A +V ++ I R + SEQ+LVDC Sbjct: 273 SCWAFATAGVVAAQYAI--RKNQKVSLSEQQLVDC 305 >UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain; n=9; Cucujiformia|Rep: Digestive cysteine proteinase intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 60.9 bits (141), Expect = 4e-08 Identities = 51/150 (34%), Positives = 73/150 (48%), Gaps = 6/150 (4%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNT-HERGTAVY--GITQFADLSYEEFG 433 F TH Y E R RF IF+ N+RKI E N +++G Y G+T FADL+++EF Sbjct: 26 FKQTHGKTY-KSLLEERTRFGIFQSNLRKIEEHNAKYDKGEESYFLGVTPFADLTHDEFK 84 Query: 434 ---KKYLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLA 604 ++ + KP++ T + E+P GA++ DVK QG CGS A Sbjct: 85 DELRRQIKTKPNVEATLAVFPEGLEVPD-SIDWTQKGAVL----DVKYQGGCGSCWAFSA 139 Query: 605 LLVMSRVNIS*RLDSCCHFSEQELVDCDKP 694 + N + SEQ+L+DC KP Sbjct: 140 TGALEGQNAIVN-NVKIPLSEQQLLDCSKP 168 >UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15; Magnoliophyta|Rep: Cysteine proteinase RD19a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 368 Score = 60.1 bits (139), Expect = 6e-08 Identities = 43/127 (33%), Positives = 63/127 (49%), Gaps = 3/127 (2%) Frame = +2 Query: 317 RFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSL---RDTNQIPM 487 RF +FK N+R+ + +A +G+TQF+DL+ EF KK+LG++ +D N+ P+ Sbjct: 71 RFSVFKANLRRARRHQKLDP-SATHGVTQFSDLTRSEFRKKHLGVRSGFKLPKDANKAPI 129 Query: 488 RQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVMSRVNIS*RLDSCCHFSE 667 E GA+ +P VK+QG CGS A + N SE Sbjct: 130 LPTENLPEDFDWRDHGAV---TP-VKNQGSCGSCWSFSATGALEGANFL-ATGKLVSLSE 184 Query: 668 QELVDCD 688 Q+LVDCD Sbjct: 185 QQLVDCD 191 >UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; n=23; Magnoliophyta|Rep: Senescence-specific cysteine protease - Arabidopsis thaliana (Mouse-ear cress) Length = 346 Score = 59.3 bits (137), Expect = 1e-07 Identities = 42/138 (30%), Positives = 57/138 (41%), Gaps = 7/138 (5%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELNTHERG-TAVYGITQFADLSYEEFGKKYLGLK-----P 457 D E R+ +FK NV +I LN+ G T + QFADL+ +EF Y G K Sbjct: 51 DVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSALS 110 Query: 458 SLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCG-SWLGPLALLVMSRVNIS 634 S T P R + P++ +K+QG CG W + I Sbjct: 111 SQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQI- 169 Query: 635 *RLDSCCHFSEQELVDCD 688 + SEQ+LVDCD Sbjct: 170 -KKGKLISLSEQQLVDCD 186 >UniRef50_Q22RC9 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 389 Score = 59.3 bits (137), Expect = 1e-07 Identities = 50/145 (34%), Positives = 71/145 (48%), Gaps = 3/145 (2%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKY 442 F A HK Y + E +RRFEIF+ N+ I ELN E GTA YGITQF+D++ EEF + Sbjct: 43 FKAEHKKFY--NFLEEQRRFEIFRQNLDIISELNQVEEGTAEYGITQFSDMTTEEFKSQI 100 Query: 443 LGLKPSLRDTNQIPMRQAEIPKLK--SPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLV 613 L PS N R K+ +P + VK+QG G+ W + Sbjct: 101 --LIPSTYARNFTGSRYHGFQKISQDAPTSYDWRDHGAVTPVKNQGTVGTCWTFSTTGNI 158 Query: 614 MSRVNIS*RLDSCCHFSEQELVDCD 688 + ++ + SE+++VDCD Sbjct: 159 EGQWFLA--GNPLVSLSEEQIVDCD 181 Score = 38.3 bits (85), Expect = 0.22 Identities = 19/42 (45%), Positives = 24/42 (57%) Frame = +1 Query: 511 EIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKL 636 + P +DWRD+ AVT G V FS TGN+EGQ+ L Sbjct: 124 DAPTSYDWRDHGAVTPVKNQGTV-GTCWTFSTTGNIEGQWFL 164 >UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; Dictyostelium discoideum|Rep: Cysteine proteinase 1 precursor - Dictyostelium discoideum (Slime mold) Length = 343 Score = 59.3 bits (137), Expect = 1e-07 Identities = 52/160 (32%), Positives = 75/160 (46%), Gaps = 6/160 (3%) Frame = +2 Query: 227 VPPHTS*TFVYDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNT---HERGTAVYGI 397 +PP F+ +F Y + E RFEIFK N+ KI ELN + + +G+ Sbjct: 21 IPPEEQSQFL-EFQDKFNKKYSHE--EYLERFEIFKSNLGKIEELNLIAINHKADTKFGV 77 Query: 398 TQFADLSYEEFGKKYLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQS--PDVKDQ 571 +FADLS +EF YL K ++ T+ +P+ + + I + T+ VK+Q Sbjct: 78 NKFADLSSDEFKNYYLNNKEAI-FTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQ 136 Query: 572 GMCGS-WLGPLALLVMSRVNIS*RLDSCCHFSEQELVDCD 688 G CGS W V + IS + SEQ LVDCD Sbjct: 137 GQCGSCWSFSTTGNVEGQHFIS--QNKLVSLSEQNLVDCD 174 >UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa|Rep: Os09g0497500 protein - Oryza sativa subsp. japonica (Rice) Length = 349 Score = 58.8 bits (136), Expect = 1e-07 Identities = 47/140 (33%), Positives = 66/140 (47%), Gaps = 9/140 (6%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTN 475 DA E +RRFE+++ NV + N+ G + +FADL+ EEF K LG +P + Sbjct: 44 DAGEKQRRFEVYRRNVELVETFNSMSNGYKLAD-NKFADLTNEEFRAKMLGFRPHVTIPQ 102 Query: 476 QIPMRQAEIPKLKSPINSIGAIMTQSPD---------VKDQGMCGSWLGPLALLVMSRVN 628 A+I P S I+ +S D VK+QG CGS A+ + +N Sbjct: 103 ISNTCSADI---AMPGESSDDILPKSVDWRKKGAVVEVKNQGDCGSCWAFSAVAAIEGIN 159 Query: 629 IS*RLDSCCHFSEQELVDCD 688 + SEQELVDCD Sbjct: 160 -QIKNGELVSLSEQELVDCD 178 >UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like midgut cysteine proteinase - Tenebrio molitor (Yellow mealworm) Length = 330 Score = 58.8 bits (136), Expect = 1e-07 Identities = 50/146 (34%), Positives = 68/146 (46%), Gaps = 5/146 (3%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTH-ERGTAVYG--ITQFADLSYEEFG 433 F THK +Y E+RR+ IFK NV KI E N E+G Y + QF D+S EEF Sbjct: 31 FKLTHKKSYSSPIEEIRRQL-IFKDNVAKIAEHNAKFEKGEVTYSKAMNQFGDMSKEEF- 88 Query: 434 KKYLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALL 610 Y+ + + + +R + K S+ +VKDQG CGS W Sbjct: 89 LAYVNRGKAQKPKHPENLRMPYVSSKKPLAASVDWRSNAVSEVKDQGQCGSCWSFSTTGA 148 Query: 611 VMSRVNIS-*RLDSCCHFSEQELVDC 685 V ++ + RL S SEQ L+DC Sbjct: 149 VEGQLALQRGRLTS---LSEQNLIDC 171 >UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum|Rep: Falcipain 2 - Plasmodium falciparum Length = 484 Score = 58.4 bits (135), Expect = 2e-07 Identities = 51/154 (33%), Positives = 74/154 (48%), Gaps = 11/154 (7%) Frame = +2 Query: 257 YDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGK 436 Y F+ T+ Y + EM+ RF++F N K++ N ++ + +FADL+Y EF Sbjct: 166 YMFIKTNNKQY-NSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFADLTYHEFKN 224 Query: 437 KYLGLKPS--LRDTNQI--PMRQAE-IPKLKSPINSIGA-----IMTQSPDVKDQGMCGS 586 KYL L+ S L+++ + M E I K + N A + + VKDQ CGS Sbjct: 225 KYLSLRSSKPLKNSKYLLDQMNYEEVIKKYRGEENFDHAAYDWRLHSGVTPVKDQKNCGS 284 Query: 587 -WLGPLALLVMSRVNIS*RLDSCCHFSEQELVDC 685 W V S+ I R + SEQELVDC Sbjct: 285 CWAFSSIGSVESQYAI--RKNKLITLSEQELVDC 316 >UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep: Cathepsin L precursor - Schistosoma mansoni (Blood fluke) Length = 319 Score = 58.0 bits (134), Expect = 3e-07 Identities = 45/130 (34%), Positives = 64/130 (49%), Gaps = 6/130 (4%) Frame = +2 Query: 317 RFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYL---GLKPSLRDTNQIPM 487 RF IFK N+ K RG+A+YG+T ++DL+ +EF + +L + PS R + Sbjct: 39 RFNIFKSNILKAQLYQVFVRGSAIYGVTPYSDLTTDEFARTHLTASWVVPSSRSNTPTSL 98 Query: 488 RQA--EIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCH 658 + IPK GA+ +VK+QGMCGS W V S+ + Sbjct: 99 GKEVNNIPK-NFDWREKGAV----TEVKNQGMCGSCWAFSTTGNVESQ--WFRKTGKLLS 151 Query: 659 FSEQELVDCD 688 SEQ+LVDCD Sbjct: 152 LSEQQLVDCD 161 >UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2] - Vigna mungo (Rice bean) (Black gram) Length = 362 Score = 57.6 bits (133), Expect = 3e-07 Identities = 42/144 (29%), Positives = 66/144 (45%), Gaps = 5/144 (3%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIP 484 E +RF +FK NV +H N ++ + + +FAD++ EF Y G K + + Sbjct: 55 EKHKRFNVFKANVMHVHNTNKMDKPYKLK-LNKFADMTNHEFRSTYAGSKVNHHKMFR-G 112 Query: 485 MRQAEIPKLKSPINSIGAIMTQSP-----DVKDQGMCGSWLGPLALLVMSRVNIS*RLDS 649 + + + S+ A + DVKDQG CGS ++ + +N + + Sbjct: 113 SQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGIN-QIKTNK 171 Query: 650 CCHFSEQELVDCDKP*RRM*RGGL 721 SEQELVDCDK + GGL Sbjct: 172 LVSLSEQELVDCDKEENQGCNGGL 195 >UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35; Viridiplantae|Rep: Cysteine proteinase 15A precursor - Pisum sativum (Garden pea) Length = 363 Score = 57.2 bits (132), Expect = 5e-07 Identities = 45/126 (35%), Positives = 64/126 (50%), Gaps = 2/126 (1%) Frame = +2 Query: 317 RFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIPMRQA 496 RF +FK N+ K +L+ + TA +GIT+F+DL+ EF +++LGLK LR + Sbjct: 68 RFGVFKSNLIKA-KLHQNRDPTAEHGITKFSDLTASEFRRQFLGLKKRLR-LPAHAQKAP 125 Query: 497 EIPKLKSPINSIGAIMTQSPDVKDQGMCGS-W-LGPLALLVMSRVNIS*RLDSCCHFSEQ 670 +P P + VKDQG CGS W L + + +L S SEQ Sbjct: 126 ILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVS---LSEQ 182 Query: 671 ELVDCD 688 +LVDCD Sbjct: 183 QLVDCD 188 >UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4; core eudicotyledons|Rep: Papain-like cysteine peptidase XBCP3 - Arabidopsis thaliana (Mouse-ear cress) Length = 437 Score = 56.8 bits (131), Expect = 6e-07 Identities = 48/156 (30%), Positives = 65/156 (41%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 D+ H Y + E ++R +IFK N + + N T + FADL++ EF Sbjct: 34 DWCQKHGKTYGSEE-ERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKAS 92 Query: 440 YLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVMS 619 LGL S Q+ +K P + +VKDQG CG+ A M Sbjct: 93 RLGLSVSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAME 152 Query: 620 RVNIS*RLDSCCHFSEQELVDCDKP*RRM*RGGLPD 727 +N D SEQEL+DCDK GGL D Sbjct: 153 GINQIVTGD-LISLSEQELIDCDKSYNAGCNGGLMD 187 >UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_21, whole genome shotgun sequence - Paramecium tetraurelia Length = 349 Score = 56.8 bits (131), Expect = 6e-07 Identities = 45/131 (34%), Positives = 63/131 (48%), Gaps = 4/131 (3%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTH-ERGTAVY--GITQFADLSYEEFGKKYLGLKPSLRD-T 472 E RF IFK N + I E E G + G+ FADLS EEF KYL + + R+ T Sbjct: 55 ENSHRFGIFKKNYQYIQEHQQRVEAGLETFELGLNDFADLSVEEFEAKYLKYRSTPREQT 114 Query: 473 NQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVMSRVNIS*RLDSC 652 NQ+ R + ++ + G + +VK+QG CGS A+ + + + Sbjct: 115 NQVYRRTGKQVPIEVDLRKDGVV----SEVKNQGSCGSCWAFSAVAALETALRQGGVKN- 169 Query: 653 CHFSEQELVDC 685 SEQELVDC Sbjct: 170 VELSEQELVDC 180 >UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 361 Score = 56.4 bits (130), Expect = 8e-07 Identities = 35/102 (34%), Positives = 50/102 (49%), Gaps = 2/102 (1%) Frame = +2 Query: 293 DDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLK--PSLR 466 D A + + RFE+FK N R IHE N E + G+ +F+D++ EEF KY G++ Sbjct: 50 DLAEDKKSRFEVFKANARHIHEFNKKEGMSYKLGLNKFSDMTVEEFAAKYTGVQVDAGAA 109 Query: 467 DTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWL 592 P Q + P+ +P VKDQG CG+ L Sbjct: 110 VVTSAPDEQPVLVGDAPPVWDWRDHGAVTP-VKDQGSCGTEL 150 >UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; Dictyostelium discoideum|Rep: Cysteine proteinase 2 precursor - Dictyostelium discoideum (Slime mold) Length = 376 Score = 56.4 bits (130), Expect = 8e-07 Identities = 42/144 (29%), Positives = 63/144 (43%), Gaps = 3/144 (2%) Frame = +2 Query: 272 THKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGL 451 T K N ++E R+ IFK N+ + N+ V G+ FAD++ EE+ K YLG Sbjct: 40 TLKFNRQYSSSEFSNRYSIFKSNMDYVDNWNSKGDSQTVLGLNNFADITNEEYRKTYLGT 99 Query: 452 KPSLRDTNQIPMRQA-EIPKLKSPINSIG-AIMTQSPDVKDQGMCGS-WLGPLALLVMSR 622 + + N R+ + L++ SI +KDQG CGS W + + Sbjct: 100 RVNAHSYNGYDGREVLNVEDLQTNPKSIDWRTKNAVTPIKDQGQCGSCW--SFSTTGSTE 157 Query: 623 VNIS*RLDSCCHFSEQELVDCDKP 694 + + SEQ LVDC P Sbjct: 158 GAHALKTKKLVSLSEQNLVDCSGP 181 >UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromeliaceae|Rep: Fruit bromelain precursor - Ananas comosus (Pineapple) Length = 351 Score = 56.4 bits (130), Expect = 8e-07 Identities = 42/146 (28%), Positives = 68/146 (46%), Gaps = 4/146 (2%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 +++A + Y DD +MRR F+IFK NV+ I N+ + GI QF D++ EF + Sbjct: 39 EWMAEYGRVYKDDDEKMRR-FQIFKNNVKHIETFNSRNENSYTLGINQFTDMTKSEFVAQ 97 Query: 440 YLGLKPSLRDTNQ--IPMRQAEIPKLKSPIN--SIGAIMTQSPDVKDQGMCGSWLGPLAL 607 Y G+ L + + I + I+ GA+ +VK+Q CGS A+ Sbjct: 98 YTGVSLPLNIEREPVVSFDDVNISAVPQSIDWRDYGAV----NEVKNQNPCGSCWSFAAI 153 Query: 608 LVMSRVNIS*RLDSCCHFSEQELVDC 685 + + + SEQE++DC Sbjct: 154 ATVEGI-YKIKTGYLVSLSEQEVLDC 178 >UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: hypothetical protein - Strongylocentrotus purpuratus Length = 331 Score = 56.0 bits (129), Expect = 1e-06 Identities = 53/168 (31%), Positives = 78/168 (46%), Gaps = 8/168 (4%) Frame = +2 Query: 206 RKRNN*SVPPHTS*TFVYDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELN---THER 376 RKR + + H +F F+ Y + E +R+ IFK ++ K LN TH R Sbjct: 22 RKRADGPLHYHLEESFFQIFIQKFNKTYTRGSQEYFKRYRIFKESLLKHEMLNAIATH-R 80 Query: 377 GTAVYGITQFADLSYEEFGKKYLGL----KPSLRDTNQIPMRQAEIPKLKSPINSIGAIM 544 A YGIT+F+DL+ EEF +YLG S+R R + L + SI + Sbjct: 81 DHATYGITKFSDLTSEEFQFQYLGTASIPDQSVRSVPGPVRRPLKTMPLVYDLRSIKPPV 140 Query: 545 TQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCHFSEQELVDC 685 +P VK+Q CG+ W +++ I+ + S QELVDC Sbjct: 141 V-TP-VKNQKSCGACW--AFSVVETMETQIALKTKRLTQLSAQELVDC 184 >UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Plasmodium|Rep: Cysteine protease falcipain-3 - Plasmodium falciparum Length = 492 Score = 56.0 bits (129), Expect = 1e-06 Identities = 53/156 (33%), Positives = 69/156 (44%), Gaps = 13/156 (8%) Frame = +2 Query: 257 YDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGK 436 Y FL + Y + + EM++RF IF N RKI N G+ +F DLS EEF Sbjct: 172 YIFLKENNKKY-ETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGDLSPEEFRS 230 Query: 437 KYLGLK-----PSLRDTNQIPMRQAEIPKLKSPINS-IGAIMTQ------SPDVKDQGMC 580 KYL LK +L ++ K P ++ + I VKDQ +C Sbjct: 231 KYLNLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVTPVKDQALC 290 Query: 581 GS-WLGPLALLVMSRVNIS*RLDSCCHFSEQELVDC 685 GS W V S+ I R + FSEQELVDC Sbjct: 291 GSCWAFSSVGSVESQYAI--RKKALFLFSEQELVDC 324 >UniRef50_Q239L8 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 56.0 bits (129), Expect = 1e-06 Identities = 55/156 (35%), Positives = 69/156 (44%), Gaps = 1/156 (0%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKY 442 F + Y D E R R EIF N+ K+ E NT YGITQF D++ EEF + Y Sbjct: 51 FKTKYNKKYADPDFE-RYRIEIFTENL-KVVESNTKN-----YGITQFMDITREEFKQTY 103 Query: 443 LGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMS 619 L LK P + ++ + GA+ +P VKDQG CGS W V Sbjct: 104 LTLKMK-NGLKASPFAKFNDAGVEIDWTTKGAV---TP-VKDQGQCGSCWSFSTTGAVEG 158 Query: 620 RVNIS*RLDSCCHFSEQELVDCDKP*RRM*RGGLPD 727 + +S SEQ LVDC K GGL D Sbjct: 159 ALFLS--TKKLTSLSEQYLVDCSKDGNEGCNGGLMD 192 >UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia circumcincta|Rep: Secreted cathepsin F - Teladorsagia circumcincta Length = 364 Score = 55.6 bits (128), Expect = 1e-06 Identities = 28/61 (45%), Positives = 39/61 (63%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKY 442 F+ H Y +++ E +RF IFK N+ I +++GTA+YGI QFADLS EEF K + Sbjct: 67 FIERHDKVYRNES-EALKRFGIFKRNLEIIRSAQENDKGTAIYGINQFADLSPEEFKKTH 125 Query: 443 L 445 L Sbjct: 126 L 126 Score = 47.6 bits (108), Expect = 4e-04 Identities = 22/48 (45%), Positives = 32/48 (66%) Frame = +1 Query: 493 GRNPEVEIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKL 636 G +P+ +P+ FDWR++ AVT+ G+ AFSVTGN+EGQ+ L Sbjct: 146 GVDPKEPLPESFDWREHGAVTKVKTEGHC-AACWAFSVTGNIEGQWFL 192 >UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus salmonis|Rep: Cysteine proteinase - Lepeophtheirus salmonis (salmon louse) Length = 372 Score = 55.6 bits (128), Expect = 1e-06 Identities = 40/129 (31%), Positives = 62/129 (48%), Gaps = 6/129 (4%) Frame = +2 Query: 317 RFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRD----TNQIP 484 + ++F N+R+I E N + + T GI +F+DL+ EEF KY+G P T Sbjct: 47 KLKVFVDNLREIEEHNANPKRTWDMGINEFSDLTDEEFESKYMGYSPMSSSAGLVTRTAA 106 Query: 485 MRQAEIPKLKSPIN-SIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCH 658 +Q I L ++ ++T DVK+QG CGS W+ + S V I + S Sbjct: 107 PKQGNIKDLPESVDWREKGVIT---DVKNQGSCGSCWVFSAVEQIESYVAIENNMTSPPL 163 Query: 659 FSEQELVDC 685 S Q++ C Sbjct: 164 LSTQQITSC 172 >UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa (Rice) Length = 339 Score = 54.8 bits (126), Expect = 2e-06 Identities = 48/136 (35%), Positives = 62/136 (45%), Gaps = 5/136 (3%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEF--GKKYLGLKPS-LR 466 DA E RRFEIFK NV I N + + QFADL+ EF K G PS +R Sbjct: 50 DATEKARRFEIFKANVAFIESFNAGNHKFWL-SVNQFADLTNYEFRATKTNKGFIPSTVR 108 Query: 467 DTNQIPMRQAEIPKLKSPIN--SIGAIMTQSPDVKDQGMCGSWLGPLALLVMSRVNIS*R 640 I L + ++ + GA+ +P +KDQG CG A+ M + + Sbjct: 109 VPTTFRYENVSIDTLPATVDWRTKGAV---TP-IKDQGQCGCCWAFSAVAAMEGI-VKLS 163 Query: 641 LDSCCHFSEQELVDCD 688 SEQELVDCD Sbjct: 164 TGKLISLSEQELVDCD 179 >UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: hypothetical protein, partial - Ornithorhynchus anatinus Length = 224 Score = 54.4 bits (125), Expect = 3e-06 Identities = 39/111 (35%), Positives = 57/111 (51%), Gaps = 2/111 (1%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 +F + +Y +D AE RRFEIF N+ + +L ++GTA +G+T F+DLS +EF Sbjct: 49 EFQIRYNKSY-EDQAEHARRFEIFVQNLARARKLQEEDQGTAEFGVTPFSDLSEDEFLSL 107 Query: 440 YLGLKPSLRDTNQIPMRQAEIP--KLKSPINSIGAIMTQSPDVKDQGMCGS 586 Y P R + A IP L++ +P VK+QG CGS Sbjct: 108 Y---APRFRMPTSWVNQTARIPAGPLRAETCDWRKEGAVTP-VKNQGDCGS 154 >UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativa|Rep: Os01g0347600 protein - Oryza sativa subsp. japonica (Rice) Length = 343 Score = 54.4 bits (125), Expect = 3e-06 Identities = 43/129 (33%), Positives = 52/129 (40%), Gaps = 1/129 (0%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIP 484 E RF IF+ NV I + GI QFADL+ +EF Y G KP + P Sbjct: 60 EKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPP--HPKEAP 117 Query: 485 MRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCHF 661 + + +P VKDQG CGS W + I R Sbjct: 118 ---RPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKI--RTGQLTPL 172 Query: 662 SEQELVDCD 688 SEQELVDCD Sbjct: 173 SEQELVDCD 181 >UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; n=16; Chrysomelidae|Rep: Digestive cysteine protease intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 54.4 bits (125), Expect = 3e-06 Identities = 49/147 (33%), Positives = 70/147 (47%), Gaps = 6/147 (4%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNT-HERGTAVY--GITQFADLSYEEFG 433 F TH Y + E + RF IF+ N+ KI E N +++G Y G+T+FADL++EEF Sbjct: 26 FKQTHGKTY-KNLLEEKTRFGIFQRNLIKIKEHNARYDKGEETYLLGVTRFADLTHEEFK 84 Query: 434 ---KKYLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLA 604 K + KP L T + E+P GA++ +VKDQ CGS A Sbjct: 85 DILKGQIKNKPRLNATPTVFPEDLEVPD-SIDWTEKGAVL----EVKDQNPCGSCWAFSA 139 Query: 605 LLVMSRVNIS*RLDSCCHFSEQELVDC 685 + N + SEQ+L+DC Sbjct: 140 TGALEGQNAILN-NVKISLSEQQLLDC 165 >UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1; Brugia malayi|Rep: Cathepsin F-like cysteine proteinase - Brugia malayi (Filarial nematode worm) Length = 461 Score = 54.0 bits (124), Expect = 4e-06 Identities = 50/158 (31%), Positives = 71/158 (44%), Gaps = 4/158 (2%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKY 442 F+ K Y E RF I+ N+ +L E+GTA+YG T+F+D++ EEF K Sbjct: 162 FIKKFKREY-SSIEEQLDRFRIYLQNMNFAKKLQFEEKGTAIYGATKFSDMTAEEFQKIM 220 Query: 443 L-GLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQS--PDVKDQGMCGS-WLGPLALL 610 L + ++N I + + S T+ VKDQG CGS W + Sbjct: 221 LPSIWWDRVESNGITFNLNDFNLSIYNLPSKFDWRTEGVVTPVKDQGSCGSCWAFSVTGN 280 Query: 611 VMSRVNIS*RLDSCCHFSEQELVDCDKP*RRM*RGGLP 724 + S I + SEQEL+DCD + GGLP Sbjct: 281 IESLWAI--KTGKLISLSEQELIDCDVIDKGC-NGGLP 315 Score = 38.7 bits (86), Expect = 0.17 Identities = 20/45 (44%), Positives = 25/45 (55%) Frame = +1 Query: 514 IPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTGQ 648 +P KFDWR VT G AFSVTGN+E + +KTG+ Sbjct: 248 LPSKFDWRTEGVVTPVKDQGSCGS-CWAFSVTGNIESLWAIKTGK 291 >UniRef50_Q22LI1 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 987 Score = 54.0 bits (124), Expect = 4e-06 Identities = 43/147 (29%), Positives = 67/147 (45%), Gaps = 10/147 (6%) Frame = +2 Query: 278 KPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKP 457 K N + D +++ R IF N +KI E N + T G+ ++A ++ +EF + +L P Sbjct: 37 KHNKVFDPEQLKYRLSIFAENYKKIKEHNYNSSNTFQLGLNEYAHMTSQEFAEVFL--TP 94 Query: 458 SLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSP----------DVKDQGMCGSWLGPLAL 607 S+ + Q + P+ P NS +T +P VK QG CGS A Sbjct: 95 SISKSQQKQPKPKPQPQ-PHPNNSTNTTVTITPIDWRNKGAVTSVKRQGKCGSCWSFSAA 153 Query: 608 LVMSRVNIS*RLDSCCHFSEQELVDCD 688 +M + + SEQ+LVDCD Sbjct: 154 GLMEAFQYF-KTGNLIDLSEQQLVDCD 179 >UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium tetraurelia|Rep: Cathepsin L1 precursor - Paramecium tetraurelia Length = 314 Score = 54.0 bits (124), Expect = 4e-06 Identities = 40/137 (29%), Positives = 64/137 (46%), Gaps = 1/137 (0%) Frame = +2 Query: 287 YIDDAAEMRRRFEIFKGNVRKIHEL-NTHERGTAVYGITQFADLSYEEFGKKYLGLKPSL 463 Y + EM R +++F N+ I + E T + QFAD+S +EF + YL LK + Sbjct: 37 YTNQRDEMYR-YKVFTDNLNYIRAFYESPEEATFTLELNQFADMSQQEFAQTYLSLK--V 93 Query: 464 RDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVMSRVNIS*RL 643 T ++ + + ++ + P VK+QG CGS A+ + +N L Sbjct: 94 PRTAKLNAANSNFQYKGAEVDWTDNKKVKYPAVKNQGSCGSCWAFSAVGAL-EINTDIEL 152 Query: 644 DSCCHFSEQELVDCDKP 694 + SEQ+LVDC P Sbjct: 153 NRKYELSEQDLVDCSGP 169 >UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza sativa|Rep: Putative cysteine protease - Oryza sativa subsp. japonica (Rice) Length = 357 Score = 53.6 bits (123), Expect = 6e-06 Identities = 45/131 (34%), Positives = 55/131 (41%), Gaps = 4/131 (3%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIP 484 E RF +F+ NVR I + I QFADL+ EF Y G+K T+ P Sbjct: 60 EKEHRFAVFRDNVRFIRSYRPEATYDSAVRINQFADLTNGEFVATYTGVKQPPPATHPHP 119 Query: 485 MRQAEIPKLKSPINSIGAI----MTQSPDVKDQGMCGSWLGPLALLVMSRVNIS*RLDSC 652 E P+ PI I VKDQG CGS A+ M + + R Sbjct: 120 -HPEEAPRPVDPIWMPCCIDWRFKGAVTGVKDQGACGSSWAFAAVAAMEGL-MKIRTGQL 177 Query: 653 CHFSEQELVDC 685 SEQELVDC Sbjct: 178 TPLSEQELVDC 188 >UniRef50_Q231X3 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 53.6 bits (123), Expect = 6e-06 Identities = 42/124 (33%), Positives = 63/124 (50%), Gaps = 1/124 (0%) Frame = +2 Query: 317 RFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIPMRQA 496 R E+F N+ + T GT YGIT+F DL+ +EF +L LK + + + Sbjct: 59 RLEVFAENLEVVKNDQT---GT--YGITKFLDLTDDEFAGNFLNLKAQYPEDSIAEDIEV 113 Query: 497 EIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCHFSEQE 673 + PK+ IN + A + +VK QG CGS W V S + I+ ++D SEQ+ Sbjct: 114 D-PKIN--INWVEA--GKVSNVKSQGNCGSCWAFSATASVESALIIAGKVDKSISLSEQQ 168 Query: 674 LVDC 685 L+DC Sbjct: 169 LIDC 172 >UniRef50_Q22RR8 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 53.6 bits (123), Expect = 6e-06 Identities = 50/146 (34%), Positives = 67/146 (45%), Gaps = 4/146 (2%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 +FL H Y E RF +F+ N++KI G + YGIT+F DL+ EEF ++ Sbjct: 45 EFLKKHSITY-KTIEEKLHRFAVFRDNLKKIE-------GHSNYGITKFMDLTSEEFQQR 96 Query: 440 YLGLKPSL--RDTNQIPMRQAEI-PKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLAL 607 YL LK + R + + A++ KL I VKDQ CGS W Sbjct: 97 YLRLKTNTIKRQNFKSNPKNAQLNMKLGDDIIIDWTKKGAVTPVKDQEQCGSCWAFSATG 156 Query: 608 LVMSRVNIS*RLDSCCHFSEQELVDC 685 + S IS + SEQELVDC Sbjct: 157 ALESATFIS--TGTLPSLSEQELVDC 180 >UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole genome shotgun sequence; n=7; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_22, whole genome shotgun sequence - Paramecium tetraurelia Length = 350 Score = 53.6 bits (123), Expect = 6e-06 Identities = 46/145 (31%), Positives = 71/145 (48%), Gaps = 3/145 (2%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 ++ AT Y +E+ R ++++ N+ I N + G ++G TQF DL+ EEF Sbjct: 64 NYQATFNKQY--SGSELLYRLQVYEANLADIKARN-QKLGREIFGETQFTDLTDEEFAAT 120 Query: 440 YLGLKPSLRDTNQIPMRQAEIPKLKSPIN--SIGAIMTQSPDVKDQGMCGS-WLGPLALL 610 YL LK + D ++P Q E +PI+ + GA+ VKDQG CGS W + Sbjct: 121 YLTLKVN-PDDLEVPKAQFENVN-ATPIDWRTRGAV----NKVKDQGQCGSCWAFSTTGV 174 Query: 611 VMSRVNIS*RLDSCCHFSEQELVDC 685 + + + SEQ+LVDC Sbjct: 175 LEGFYKV--QTGELPDLSEQQLVDC 197 >UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase" precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 315 Score = 52.8 bits (121), Expect = 1e-05 Identities = 47/157 (29%), Positives = 72/157 (45%), Gaps = 4/157 (2%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNT-HERGTAVY--GITQFADLSYEEFG 433 F ATH +Y + E + RF +F+ N++KI E N +E G Y + +FAD S EF Sbjct: 27 FKATHNKSY--NVIEDKLRFAVFQDNLKKIEEHNAKYESGEETYYLAVNKFADWSSAEF- 83 Query: 434 KKYLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALL 610 + L + + + + P +++ + + + VKDQG CGS W Sbjct: 84 QAMLARQMANKPKQSFIAKHVADPNVQA-VEEVDWRDSAVLGVKDQGQCGSCWAFSTTGS 142 Query: 611 VMSRVNIS*RLDSCCHFSEQELVDCDKP*RRM*RGGL 721 + ++ I + SEQELVDCD GGL Sbjct: 143 LEGQLAI--HKNQRVPLSEQELVDCDTSRNAGCNGGL 177 >UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain - Tetrahymena pyriformis Length = 330 Score = 52.8 bits (121), Expect = 1e-05 Identities = 38/124 (30%), Positives = 55/124 (44%), Gaps = 1/124 (0%) Frame = +2 Query: 317 RFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIPMRQA 496 R +F N++ I N + T V + F DL+ EEF +YL + +P+ Sbjct: 56 RLSVFLENLKSIEANNANPLSTHVEEVNSFTDLTEEEFAARYLMKDLPQQMNKDLPI--L 113 Query: 497 EIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCHFSEQE 673 E+ L +P P VK+Q CGS W A ++ NI + FSEQ+ Sbjct: 114 EMETLAAPQVIDWTAKNVLPPVKNQQQCGSCWAFSTAGMLEGVYNIHESPQTPISFSEQQ 173 Query: 674 LVDC 685 LVDC Sbjct: 174 LVDC 177 >UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis thaliana|Rep: Cysteine proteinase - Arabidopsis thaliana (Mouse-ear cress) Length = 348 Score = 52.4 bits (120), Expect = 1e-05 Identities = 42/150 (28%), Positives = 65/150 (43%), Gaps = 8/150 (5%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGL--KPSLRD 469 D E R RF IFK N+ + N + + T I +F+DL+ EEF + GL ++ Sbjct: 48 DETEKRNRFNIFKKNLEFVQNFNMNNKITYKVDINEFSDLTDEEFRATHTGLVVPEAITR 107 Query: 470 TNQIPMRQAEIPKLKSPINSIGAIMTQSPD-----VKDQGMCGS-WLGPLALLVMSRVNI 631 + + + +P ++ G M + VK QG CG W V I Sbjct: 108 ISTLSSGKNTVPFRYGNVSDNGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKI 167 Query: 632 S*RLDSCCHFSEQELVDCDKP*RRM*RGGL 721 + SEQ+L+DCD+ + RGG+ Sbjct: 168 T--KGELVSLSEQQLLDCDRDYNQGCRGGI 195 >UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber officinale (Ginger) Length = 475 Score = 52.4 bits (120), Expect = 1e-05 Identities = 43/132 (32%), Positives = 65/132 (49%), Gaps = 9/132 (6%) Frame = +2 Query: 317 RFEIFKGNVRKIHELNTH-ERGTAVY--GITQFADLSYEEFGKKY------LGLKPSLRD 469 R E+FK N+R + E N +RG Y G+ +FADL+ EE+ ++ LG S Sbjct: 72 RLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEEYRARFLRDLSRLGRSTSGEI 131 Query: 470 TNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVMSRVNIS*RLDS 649 +NQ +R+ ++ GA++ VK+QG CGS A+ + +N D Sbjct: 132 SNQYRLREGDVLPDSIDWREKGAVVA----VKNQGRCGSCWAFAAIAAVEGINQIVTGD- 186 Query: 650 CCHFSEQELVDC 685 SEQ+LVDC Sbjct: 187 LISLSEQQLVDC 198 >UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudicotyledons|Rep: Chymopapain precursor - Carica papaya (Papaya) Length = 352 Score = 52.4 bits (120), Expect = 1e-05 Identities = 44/129 (34%), Positives = 60/129 (46%), Gaps = 4/129 (3%) Frame = +2 Query: 317 RFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIPMRQA 496 RFEIF+ N+ I E N + + G+ FADLS +EF KKY+G D + Sbjct: 68 RFEIFRDNLMYIDETNK-KNNSYWLGLNGFADLSNDEFKKKYVGF--VAEDFTGLEHFDN 124 Query: 497 EIPKLKSPINSIGAIMTQS----PDVKDQGMCGSWLGPLALLVMSRVNIS*RLDSCCHFS 664 E K N +I ++ VK+QG CGS + + +N + S Sbjct: 125 EDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGIN-KIVTGNLLELS 183 Query: 665 EQELVDCDK 691 EQELVDCDK Sbjct: 184 EQELVDCDK 192 >UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera litura multicapsid nucleopolyhedrovirus (SpltMNPV) Length = 337 Score = 52.4 bits (120), Expect = 1e-05 Identities = 45/153 (29%), Positives = 72/153 (47%), Gaps = 9/153 (5%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 +F+ H Y + F FK N+ ++ +N + AVYGI +F+D+ F + Sbjct: 35 NFIKQHNKEYTTPD-QRDAAFVNFKRNLADMNAMN-NVSNQAVYGINKFSDIDKITFVNE 92 Query: 440 YLGLKPSL---RDTNQIPMRQAEI-----PKLKSPINSIGAIMTQSPDVKDQGMCGS-WL 592 + GL +L D+N P R E P ++P + + + VK+QG+CGS W Sbjct: 93 HAGLVSNLINSTDSNFDPYRLCEYVTVAGPSARTPESFDWRKLNKVTKVKEQGVCGSCWA 152 Query: 593 GPLALLVMSRVNIS*RLDSCCHFSEQELVDCDK 691 + S+ I DS SEQ+L+DCD+ Sbjct: 153 FAAIGNIESQYAI--MHDSLIDLSEQQLLDCDR 183 >UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 326 Score = 52.0 bits (119), Expect = 2e-05 Identities = 24/54 (44%), Positives = 32/54 (59%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKP 457 D A+ RFE+FK N R IH+ N + + G+ +FADL+ EEF KY G P Sbjct: 42 DLADKGSRFEVFKKNARYIHDFNRKKGMSYKLGLNKFADLTLEEFTAKYTGANP 95 >UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae|Rep: Cysteine proteinase - Hypera postica (alfalfa weevil) Length = 324 Score = 52.0 bits (119), Expect = 2e-05 Identities = 51/149 (34%), Positives = 69/149 (46%), Gaps = 8/149 (5%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNT-HERGTAVY--GITQFADLSYEEFG 433 F H Y++ A E +R F IF NVR I N +E+G Y GI +F D+S EEF Sbjct: 29 FKLEHGKTYLNQAEESKR-FNIFTDNVRAIEAHNALYEQGKVSYKKGINKFTDMSQEEF- 86 Query: 434 KKYLGL----KPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGP 598 K L L KP+L T+ + EIP G + VKDQG CGS W Sbjct: 87 KTMLTLSASRKPTLETTSYV-KTGVEIPS-SVDWRKEGRV----TGVKDQGDCGSCW--A 138 Query: 599 LALLVMSRVNIS*RLDSCCHFSEQELVDC 685 ++ + + + SEQ+L+DC Sbjct: 139 FSITGSTEGAYARKSGKLVSLSEQQLIDC 167 Score = 35.5 bits (78), Expect = 1.6 Identities = 20/47 (42%), Positives = 24/47 (51%) Frame = +1 Query: 508 VEIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTGQ 648 VEIP DWR VT G AFS+TG+ EG Y K+G+ Sbjct: 110 VEIPSSVDWRKEGRVTGVKDQGDCGS-CWAFSITGSTEGAYARKSGK 155 >UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 330 Score = 52.0 bits (119), Expect = 2e-05 Identities = 37/103 (35%), Positives = 51/103 (49%), Gaps = 1/103 (0%) Frame = +2 Query: 380 TAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPD 559 T +G+TQF DL+ EEF YL L+ R+ N + PK + +N + + Sbjct: 76 TGTFGVTQFFDLTEEEFAATYLTLRVQ-RNVN-ATVSSPSTPKGQYDVNWVTRGKVSA-- 131 Query: 560 VKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCHFSEQELVDC 685 VKDQG CGS W V S + I+ + SEQ+LVDC Sbjct: 132 VKDQGQCGSCWAFSTTGSVESALIIAGYANQTIDLSEQQLVDC 174 >UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_101, whole genome shotgun sequence - Paramecium tetraurelia Length = 306 Score = 52.0 bits (119), Expect = 2e-05 Identities = 44/124 (35%), Positives = 63/124 (50%) Frame = +2 Query: 317 RFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIPMRQA 496 RFEIFK N E+N+ + + GI QFA L+ EEF + YLG D++ I + ++ Sbjct: 50 RFEIFKQNYNYYQEVNSRQSSYTL-GINQFATLTDEEFEQIYLG----RADSSPIEIDES 104 Query: 497 EIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVMSRVNIS*RLDSCCHFSEQEL 676 I + P S+ +P VK+QG CGS A+ I + + +SEQ L Sbjct: 105 -IDSINLP-ESVDWSSKMNP-VKNQGTCGSGWSFSAVGAFEAFFIFVK-GTHFQYSEQNL 160 Query: 677 VDCD 688 VDCD Sbjct: 161 VDCD 164 >UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2; Entamoeba|Rep: Cysteine proteinase ACP1 precursor - Entamoeba histolytica Length = 308 Score = 52.0 bits (119), Expect = 2e-05 Identities = 46/142 (32%), Positives = 69/142 (48%), Gaps = 2/142 (1%) Frame = +2 Query: 269 ATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLG 448 ATH + + AE RF +F N +K E N + + FAD+++EEF + +LG Sbjct: 23 ATHNKVFAN-RAEYLYRFAVFLDN-KKFVEANANTE------LNVFADMTHEEFIQTHLG 74 Query: 449 LKPSLRDTNQIPMRQAEIPK-LKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSR 622 + T ++P + + +K+ S+ +P KDQG CGS W ++ R Sbjct: 75 M------TYEVPETTSNVKAAVKAAPESVDWRSIMNP-AKDQGQCGSCWTFCTTAVLEGR 127 Query: 623 VNIS*RLDSCCHFSEQELVDCD 688 VN L FSEQ+LVDCD Sbjct: 128 VNKD--LGKLYSFSEQQLVDCD 147 >UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 385 Score = 51.6 bits (118), Expect = 2e-05 Identities = 22/51 (43%), Positives = 32/51 (62%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLG 448 D A++ RFE FK N R ++E N E T G+ QF+D+++EEF K+ G Sbjct: 60 DLADVESRFEAFKANARHVNEFNKKEGMTYRLGLNQFSDMTFEEFAGKFTG 110 >UniRef50_Q650W8 Cluster: Putative cysteine proteinase; n=2; Oryza sativa (japonica cultivar-group)|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 319 Score = 51.6 bits (118), Expect = 2e-05 Identities = 25/53 (47%), Positives = 32/53 (60%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLK 454 D AE RFE+FK N R IHE N + + G+ +FAD++ EEF KY G K Sbjct: 49 DLAEKVSRFEVFKKNARYIHEFNKRKGMSYWLGLNKFADMTSEEFMAKYTGAK 101 >UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 326 Score = 51.6 bits (118), Expect = 2e-05 Identities = 47/143 (32%), Positives = 69/143 (48%), Gaps = 7/143 (4%) Frame = +2 Query: 284 NYIDDAAEMRRRFEIFKGNVRKIHELN---THERGTAVYGITQFADLSYEEFGKKYLGLK 454 NYI++ ++RF IF+G++RKI N H T G+T+FADL+ +EF LG+ Sbjct: 36 NYIEE----QKRFTIFQGSLRKIENHNDKYDHGLSTFKLGVTKFADLTEKEF-SDMLGIS 90 Query: 455 PSLRDTN-QIPMRQAEIPKLKSPIN--SIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSR 622 S + + ++ + L S + GA+ +VKDQG CGS W V Sbjct: 91 RSTKSSRPRVIHSLTPVKDLPSKFDWREKGAV----TEVKDQGSCGSCWSFSTTGTVEGA 146 Query: 623 VNIS*RLDSCCHFSEQELVDCDK 691 + + SEQ LVDC K Sbjct: 147 YFL--KTGKLVSLSEQNLVDCAK 167 Score = 44.4 bits (100), Expect = 0.003 Identities = 23/49 (46%), Positives = 28/49 (57%) Frame = +1 Query: 502 PEVEIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTGQ 648 P ++P KFDWR+ AVT G +FS TG VEG Y LKTG+ Sbjct: 106 PVKDLPSKFDWREKGAVTEVKDQGSCGS-CWSFSTTGTVEGAYFLKTGK 153 >UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis|Rep: Cathepsin L - Culicoides sonorensis Length = 331 Score = 51.6 bits (118), Expect = 2e-05 Identities = 39/127 (30%), Positives = 58/127 (45%), Gaps = 1/127 (0%) Frame = +2 Query: 311 RRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIPMR 490 R ++ + N R + + T+E+G + QF+DL+YEEF K YLG K S + Sbjct: 53 RNLADVMEHNARYLSGMETYEKG-----VNQFSDLTYEEFAKLYLGEKISFNELMTNADG 107 Query: 491 QAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCHFSE 667 E P + A T+ VK+Q CGS W A + + + +E Sbjct: 108 WIEKPLRRQLAPESYAWDTKDVPVKNQAQCGSCW--AFASVASVEMRYKRFHNKSYTLAE 165 Query: 668 QELVDCD 688 QELVDC+ Sbjct: 166 QELVDCE 172 >UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 343 Score = 51.2 bits (117), Expect = 3e-05 Identities = 40/144 (27%), Positives = 65/144 (45%), Gaps = 2/144 (1%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 +FL + Y ++ E+ +RF IF N+ + N + G Y + F+DL+ EE+ K Sbjct: 53 NFLVKYLREYPNEY-EIVKRFTIFSRNLDLVERYNKEDAGKVTYELNDFSDLTEEEWKKY 111 Query: 440 YLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQS-PDVKDQGMCGS-WLGPLALLV 613 + KP + + P + L + ++ T +K QG CGS W A + Sbjct: 112 LMTPKPDHSEKSLKPKTLIDKKNLPNSVDWRNVNGTNHVTGIKYQGPCGSCWAFATAAAI 171 Query: 614 MSRVNIS*RLDSCCHFSEQELVDC 685 S V+IS S Q+L+DC Sbjct: 172 ESAVSIS--GGGLQSLSSQQLLDC 193 >UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: Cysteine proteinase - Paragonimus westermani Length = 272 Score = 51.2 bits (117), Expect = 3e-05 Identities = 36/116 (31%), Positives = 57/116 (49%), Gaps = 1/116 (0%) Frame = +2 Query: 347 KIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIPMRQAEIPKLKSPIN 526 + +L ++GTA YG+TQF+DL+ EEF KYL + ++ + + Sbjct: 2 RAQKLQLKDQGTARYGVTQFSDLTPEEFAAKYLSAPVNNDQVKRVRPTGLKAAPERIDWR 61 Query: 527 SIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCHFSEQELVDCDK 691 + GA+ V++QG CGS W A V + I + S+Q+LVDCD+ Sbjct: 62 AKGAVTA----VENQGSCGSCWAFSTAGNVEGQWFI--KTGQLVSLSKQQLVDCDR 111 >UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza sativa|Rep: Cysteine protease 1 precursor - Oryza sativa subsp. japonica (Rice) Length = 490 Score = 51.2 bits (117), Expect = 3e-05 Identities = 42/132 (31%), Positives = 60/132 (45%), Gaps = 5/132 (3%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKI--HELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTN- 475 E RRF +F N++ + H ERG G+ +FADL+ EF YLG P+ R Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTPAGRGRRV 143 Query: 476 QIPMRQAEIPKLKSPIN--SIGAIMTQSPDVKDQGMCGSWLGPLALLVMSRVNIS*RLDS 649 R + L ++ GA++ VK+QG CGS A+ + +N Sbjct: 144 GEAYRHDGVEALPDSVDWRDKGAVVA---PVKNQGQCGSCWAFSAVAAVEGIN-KIVTGE 199 Query: 650 CCHFSEQELVDC 685 SEQELV+C Sbjct: 200 LVSLSEQELVEC 211 >UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin O precursor; n=1; Tribolium castaneum|Rep: PREDICTED: similar to Cathepsin O precursor - Tribolium castaneum Length = 326 Score = 50.8 bits (116), Expect = 4e-05 Identities = 38/143 (26%), Positives = 67/143 (46%), Gaps = 1/143 (0%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHER-GTAVYGITQFADLSYEEFGK 436 ++L Y DD + + R FK +++ I LN+ +R G+A+YG+T+F+DL EEF + Sbjct: 37 EYLKRFNKTY-DDPSVYQNRLHAFKQSLQTIETLNSKKRNGSALYGLTKFSDLLPEEFFQ 95 Query: 437 KYLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVM 616 YL S + + P R + P + +QG CG+ + + Sbjct: 96 TYLQSNLSQKTHSNEPKRHHH-KRATVPNKVDWREKNAVTRIYNQGSCGACWAYSVIETV 154 Query: 617 SRVNIS*RLDSCCHFSEQELVDC 685 +N + + + S QE++DC Sbjct: 155 ESMN-AIKTNKSEELSVQEIIDC 176 >UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|Rep: Thiol protease - Triticum aestivum (Wheat) Length = 374 Score = 50.8 bits (116), Expect = 4e-05 Identities = 46/154 (29%), Positives = 71/154 (46%), Gaps = 10/154 (6%) Frame = +2 Query: 257 YDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGK 436 + ++A H +Y E RRF+IF+ NV I N R + G+ QFADL++EEF Sbjct: 51 HGWMAKHGKSYAG-VEEKLRRFDIFRRNVEFIEAANRDGRLSYTLGVNQFADLTHEEFLA 109 Query: 437 KYLGLKPSLRDTNQIPMRQAEI-------PKLKSPINSIGAI-MTQSPDVKDQG-MCGS- 586 + + + I R + P + SI + ++ VK+QG +CG+ Sbjct: 110 THTSRRVVPSEEMVITTRAGVVVEGANCQPAPNAVPRSINWVNQSKVTPVKNQGKVCGAC 169 Query: 587 WLGPLALLVMSRVNIS*RLDSCCHFSEQELVDCD 688 W + S I+ R + SEQEL+DCD Sbjct: 170 WAFSAVATIESAYAIAKRGEPPV-LSEQELIDCD 202 >UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep: Cysteine proteinase - Cryptobia salmositica Length = 443 Score = 50.8 bits (116), Expect = 4e-05 Identities = 51/161 (31%), Positives = 68/161 (42%), Gaps = 5/161 (3%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 +F A H NY E R+RFEIF GN++K LN + A +G +FAD++ EEF + Sbjct: 27 NFKAAHARNYASPDEE-RKRFEIFAGNMKKAAVLN-RKNPMATFGPNEFADMTSEEFQTR 84 Query: 440 YLGLK----PSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLA 604 + + R AE K + VK+QG CGS W Sbjct: 85 HNAARHYAAAKARPPKNTKTFTAEEIKAAVGQQIDWRLKGAVTPVKNQGACGSCWSFSTT 144 Query: 605 LLVMSRVNIS*RLDSCCHFSEQELVDCDKP*RRM*RGGLPD 727 + + I+ SEQELV CD P GGL D Sbjct: 145 GNIEGQHAIA--TGQLVAVSEQELVSCD-PIDDGCNGGLMD 182 Score = 34.3 bits (75), Expect = 3.6 Identities = 18/45 (40%), Positives = 24/45 (53%) Frame = +1 Query: 514 IPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTGQ 648 + + DWR AVT G +FS TGN+EGQ+ + TGQ Sbjct: 114 VGQQIDWRLKGAVTPVKNQGACGS-CWSFSTTGNIEGQHAIATGQ 157 >UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathepsin o - Aedes aegypti (Yellowfever mosquito) Length = 375 Score = 50.8 bits (116), Expect = 4e-05 Identities = 26/63 (41%), Positives = 39/63 (61%), Gaps = 2/63 (3%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTH--ERGTAVYGITQFADLSYEEFGK 436 F+ + Y + E RF+IF+ ++ KI LN H E TA+YGITQ+ADL+ +EF + Sbjct: 40 FIKLYDKPYRYNVREYDHRFQIFRVSLNKIASLNAHRVENDTAIYGITQYADLTDQEFLR 99 Query: 437 KYL 445 +L Sbjct: 100 LHL 102 >UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_23, whole genome shotgun sequence - Paramecium tetraurelia Length = 321 Score = 50.8 bits (116), Expect = 4e-05 Identities = 40/130 (30%), Positives = 62/130 (47%), Gaps = 4/130 (3%) Frame = +2 Query: 317 RFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIPMRQA 496 RF I++ N+ KI + N+ + + I +F DL+ +EF YL L Q+P R Sbjct: 58 RFSIYQQNIMKIEDFNS-QNNSYKQKINKFGDLTDQEFLTIYLNL--------QMPARVK 108 Query: 497 EIPKLKSPI---NSIGAIMT-QSPDVKDQGMCGSWLGPLALLVMSRVNIS*RLDSCCHFS 664 I K + P + + + P +KDQG CGS A+ + +N + + S Sbjct: 109 NIQKNEEPFLVQEEVDWVQKGKVPAIKDQGDCGSCWAFSAVGAL-EINTKIQFNEIVDLS 167 Query: 665 EQELVDCDKP 694 EQ+LVDC P Sbjct: 168 EQDLVDCAGP 177 >UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyostelium discoideum|Rep: Cysteine proteinase 3 - Dictyostelium discoideum (Slime mold) Length = 151 Score = 50.8 bits (116), Expect = 4e-05 Identities = 35/98 (35%), Positives = 46/98 (46%), Gaps = 4/98 (4%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIP 484 E R+E FK N+ +H N+ T V G+ Q ADLS EE+ YLG + ++ N Sbjct: 4 EFMPRYEEFKKNMDYVHNWNSKGSKT-VLGLNQHADLSNEEYRLNYLGTRAHIK-LNGYH 61 Query: 485 MRQAEI----PKLKSPINSIGAIMTQSPDVKDQGMCGS 586 R + P K P+N VKDQG CGS Sbjct: 62 KRNLGLRLNRPHFKQPLNVDWREKDAVTPVKDQGQCGS 99 >UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep: Cysteine protease - Solanum lycopersicum (Tomato) (Lycopersicon esculentum) Length = 345 Score = 50.4 bits (115), Expect = 5e-05 Identities = 41/137 (29%), Positives = 57/137 (41%), Gaps = 7/137 (5%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTN 475 D E RF IFK N++ I +N + G+ +FAD++ +EF K+ GL + Sbjct: 52 DEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS 111 Query: 476 QIPMRQAEIPKLKS------PINSIGAIMTQSPDVKDQGMCG-SWLGPLALLVMSRVNIS 634 PM E K+ P N VK QG CG W + I+ Sbjct: 112 PSPMSSTEFKKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 171 Query: 635 *RLDSCCHFSEQELVDC 685 + FSEQEL+DC Sbjct: 172 --TGNLMEFSEQELLDC 186 >UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep: Cathepsin L - Stylonychia lemnae Length = 340 Score = 50.4 bits (115), Expect = 5e-05 Identities = 45/133 (33%), Positives = 64/133 (48%), Gaps = 4/133 (3%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTA-VYGITQFADLSYEEFGKKYLGLKPSLRDTNQI 481 E R + +K N+ I+ N+ GT+ G AD +++E+ KK LG KP + ++ Sbjct: 58 EFEMRLQQYKSNIAFINNHNSQNDGTSFTLGPNHLADYTHDEY-KKMLGYKPRNKTGKEV 116 Query: 482 PMRQAEIPKLKSPINSIGAIMTQSPD-VKDQGMCGS-WLGPLALLVMSRVNI-S*RLDSC 652 P LK SI + + VKDQG CGS W + SR I + +L S Sbjct: 117 ----YSTPNLKDIPESIDWREKGAVNAVKDQGQCGSCWAFSTIASLESRYFIETGKLQS- 171 Query: 653 CHFSEQELVDCDK 691 SEQ+LVDC K Sbjct: 172 --LSEQQLVDCSK 182 >UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; n=35; Fasciola|Rep: Cathepsin L-like proteinase precursor - Fasciola hepatica (Liver fluke) Length = 326 Score = 50.4 bits (115), Expect = 5e-05 Identities = 45/141 (31%), Positives = 67/141 (47%), Gaps = 8/141 (5%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELNT-HERGTAVY--GITQFADLSYEEFGKKYLGLKPSLR 466 + A+ + R I++ NV+ I E N H+ G Y G+ QF D+++EEF KYL Sbjct: 33 NGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAKYLTEMSRAS 92 Query: 467 D--TNQIP--MRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNI 631 D ++ +P +P K G + +VKDQG CGS W + + Sbjct: 93 DILSHGVPYEANNRAVPD-KIDWRESGYV----TEVKDQGNCGSCWAFSTTGTMEGQYMK 147 Query: 632 S*RLDSCCHFSEQELVDCDKP 694 + R + FSEQ+LVDC P Sbjct: 148 NER--TSISFSEQQLVDCSGP 166 >UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin W, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to Cathepsin W, partial - Ornithorhynchus anatinus Length = 229 Score = 50.0 bits (114), Expect = 7e-05 Identities = 35/94 (37%), Positives = 48/94 (51%), Gaps = 3/94 (3%) Frame = +2 Query: 314 RRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKY---LGLKPSLRDTNQIP 484 RRF+IF N+ + +L + GTA YG+T F+DLS EEF Y G+ PS Sbjct: 3 RRFKIFVQNLARARKLQEEDLGTAEYGVTPFSDLSEEEFLSLYAPRFGM-PSGWANQMAS 61 Query: 485 MRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS 586 + + + K GAI + VK+QG CGS Sbjct: 62 IPEGPLRKETCDWRKRGAITS----VKNQGSCGS 91 >UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: Cathepsin L - Kudoa thyrsites Length = 300 Score = 49.6 bits (113), Expect = 9e-05 Identities = 48/152 (31%), Positives = 66/152 (43%), Gaps = 7/152 (4%) Frame = +2 Query: 293 DDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSL--- 463 D E RRR FK N + IH N H + S+EE+ +L LKP L Sbjct: 23 DSIEEERRRLCNFKENHQFIHNFNLHNTHYHYCRHNHLSHWSHEEY-MAWLTLKPKLPVV 81 Query: 464 -RDTNQIPMRQAEIPKLKS--PINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNI 631 T+ I ++ +KS P + + + VK+QG CGS W A + S I Sbjct: 82 STPTHGITPKETATKDIKSTLPSSVDWKALGKVTSVKNQGHCGSCWSFSAAGAIESAYAI 141 Query: 632 S*RLDSCCHFSEQELVDCDKP*RRM*RGGLPD 727 + +FSEQ+LVDC GGLP+ Sbjct: 142 --KTGELVNFSEQQLVDCSTE-NHGCNGGLPE 170 >UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18; Magnoliophyta|Rep: Thiol protease aleurain precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 49.6 bits (113), Expect = 9e-05 Identities = 44/131 (33%), Positives = 67/131 (51%), Gaps = 4/131 (3%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAV-YGITQFADLSYEEFGKKYLGLKPSLRDT--N 475 EM+ RF IFK N+ I +T+++G + G+ QFADL+++EF + LG + T Sbjct: 75 EMKLRFSIFKENLDLIR--STNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNCSATLKG 132 Query: 476 QIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSC 652 + +A +P+ K G + SP VKDQG CGS W + + + + Sbjct: 133 SHKVTEAALPETKD-WREDGIV---SP-VKDQGGCGSCWTFSTTGALEAAYHQA--FGKG 185 Query: 653 CHFSEQELVDC 685 SEQ+LVDC Sbjct: 186 ISLSEQQLVDC 196 >UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays (Maize) Length = 493 Score = 48.8 bits (111), Expect = 2e-04 Identities = 44/144 (30%), Positives = 66/144 (45%), Gaps = 6/144 (4%) Frame = +2 Query: 314 RRFEIFKGNVRKIHELNTH-ERGTAVY--GITQFADLSYEEF-GKKYLGLKPSLRDTNQI 481 RR E+F+ N+R I N + G + G+T+FADL+ EE+ + LG + + Sbjct: 91 RRLEVFRDNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGV 150 Query: 482 PMRQAEIPKLKSPINSIGAIMTQSP--DVKDQGMCGSWLGPLALLVMSRVNIS*RLDSCC 655 R+ +P + + +VKDQG CG A+ + +N S Sbjct: 151 VGRRRYLPLAGEQLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGIN-KIVTGSLI 209 Query: 656 HFSEQELVDCDKP*RRM*RGGLPD 727 SEQEL+DCDK + GGL D Sbjct: 210 SLSEQELIDCDKFQDQGCDGGLMD 233 >UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia deliciosa (Kiwi) Length = 509 Score = 48.8 bits (111), Expect = 2e-04 Identities = 44/139 (31%), Positives = 65/139 (46%), Gaps = 11/139 (7%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTA---VYGITQFADLSYEEFGKKYLG--LKPSLRD 469 E+ ++F+ F+ N+R + E N ERG + + G+ +FAD+S EEF + Y+ KP+ + Sbjct: 67 EVEKKFQNFRDNLRYVMEKNG-ERGASGGHLVGLNKFADMSNEEFREVYVSKVKKPTSKR 125 Query: 470 TNQIPMRQAEIPKLKSPINSIGAIMTQ------SPDVKDQGMCGSWLGPLALLVMSRVNI 631 RQ + K+ G VKDQG CGS + + +N Sbjct: 126 MAIERRRQGKAAAAKAVAACDGPTSLDWRKYGIVTGVKDQGDCGSCWAFSSTGAIEGINA 185 Query: 632 S*RLDSCCHFSEQELVDCD 688 D SEQELVDCD Sbjct: 186 LANGD-LISLSEQELVDCD 203 >UniRef50_Q23H32 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 365 Score = 48.8 bits (111), Expect = 2e-04 Identities = 44/163 (26%), Positives = 69/163 (42%), Gaps = 8/163 (4%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIH---ELNTHERGTAVYGITQFADLSYEEFG 433 F T + Y D + R F+IF N IH ++N + + + +FADLS +EF Sbjct: 45 FKKTFRKRYADSEGDYR--FQIFAENYNYIHNYNQINENSQDNIQLEVNEFADLSLQEFR 102 Query: 434 KKYLGLKPSLRDTNQ-----IPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGP 598 + Y G S + NQ +RQ+ + P S+ V+ QG CGS Sbjct: 103 ELYFGYNSSKKHNNQQNGSTKNLRQSFLLSDSVP-ESVDWREKLVAPVQKQGGCGSCWAF 161 Query: 599 LALLVMSRVNIS*RLDSCCHFSEQELVDCDKP*RRM*RGGLPD 727 ++ + + + FSEQ L+DC + GG P+ Sbjct: 162 STVIALEGAYAK-QTGNVIKFSEQNLIDCCRIENNGCNGGDPE 203 >UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 325 Score = 48.0 bits (109), Expect = 3e-04 Identities = 50/163 (30%), Positives = 76/163 (46%), Gaps = 3/163 (1%) Frame = +2 Query: 248 TFVYDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEE 427 T F + Y D E R F +F N+ I +T +GITQF DL+ E Sbjct: 38 TLFKQFKMKYNKRYADPDFESYR-FGVFSENLEVIKTDST-------FGITQFMDLTSAE 89 Query: 428 FGKKYLGLKPSL-RDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPL 601 F ++YL LK + +D ++I + ++ + ++G + +P VKDQG CGS + Sbjct: 90 FSEQYLTLKVNKNQDNSKIYKPKDDVEIKEIDFTTLGKV---TP-VKDQGRCGSCYAFST 145 Query: 602 ALLVMSRVNIS*RLD-SCCHFSEQELVDCDKP*RRM*RGGLPD 727 + S + IS + + SEQE+VDC K GG D Sbjct: 146 TGAIESALLISGVGEANTLSLSEQEIVDCVKEPEYNQLGGCQD 188 >UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicotyledons|Rep: Cysteine proteinase - Mesembryanthemum crystallinum (Common ice plant) Length = 367 Score = 48.0 bits (109), Expect = 3e-04 Identities = 37/131 (28%), Positives = 62/131 (47%), Gaps = 3/131 (2%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLG---LKPSLRDTN 475 E + RF +FK NV+ I+E+N ++ + + QF DL+ EF + Y ++ + ++ Sbjct: 59 EKQNRFHVFKENVKYINEVNKMDKPYKL-RLNQFGDLTPSEFARTYANSKIIEGTRNESG 117 Query: 476 QIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVMSRVNIS*RLDSCC 655 E+P+ GA+ +P VK+QG CG A + +N Sbjct: 118 GFMYENVEVPR-SIDWRVKGAV---TP-VKNQGRCGGCWAFSAAAAVEGIN-QITTGQLI 171 Query: 656 HFSEQELVDCD 688 SEQ+L+DCD Sbjct: 172 SLSEQQLIDCD 182 >UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyta|Rep: Os12g0273800 protein - Oryza sativa subsp. japonica (Rice) Length = 504 Score = 48.0 bits (109), Expect = 3e-04 Identities = 44/146 (30%), Positives = 61/146 (41%), Gaps = 4/146 (2%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKY 442 ++A H Y DAAE RR E+FK NV I N + G+ QFADL+ EEF Sbjct: 47 WMAQHGRVY-KDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATM 105 Query: 443 LGLKPSLRDTNQIPM----RQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALL 610 K N + + + + P + +KDQG C + G + L Sbjct: 106 TNSKGFSTPNNGVRVSTGFKYENVSADALPASVDWRTKGAVTRIKDQGQC-AMEGFVKLS 164 Query: 611 VMSRVNIS*RLDSCCHFSEQELVDCD 688 +++ SEQELVDCD Sbjct: 165 TGKLISL----------SEQELVDCD 180 >UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin L-like cysteine proteinase precursor - Acanthoscelides obtectus (Bean weevil) Length = 321 Score = 48.0 bits (109), Expect = 3e-04 Identities = 49/135 (36%), Positives = 66/135 (48%), Gaps = 8/135 (5%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELN-THERGTAVY--GITQFADLSYEEFGKKYLGL-KPS--LR 466 E +RRFEIFK N+R I E N + G + GI QF D++ EEF K+ L L KP L Sbjct: 39 EEKRRFEIFKFNLRTIEEHNERYHNGEETFEMGINQFGDMTQEEF-KRMLALQKPQMPLP 97 Query: 467 DTNQIPMRQA-EIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*R 640 +++ +IPK GA+ +VK QG CGS W + +V + + Sbjct: 98 RGDEVSFDNVNDIPKTVD-WREKGAV----TEVKKQGNCGSCWAFSAVGSIEGQVFL--K 150 Query: 641 LDSCCHFSEQELVDC 685 S S Q LVDC Sbjct: 151 NGSLESLSAQNLVDC 165 >UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea mays (Maize) Length = 371 Score = 48.0 bits (109), Expect = 3e-04 Identities = 42/137 (30%), Positives = 64/137 (46%), Gaps = 6/137 (4%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPS----L 463 DA E R +FK N+R+ + +A +G+T+F+DL+ EF + YLGL+ S L Sbjct: 61 DADEHAYRLSVFKDNLRRARRHQLLDP-SAEHGVTKFSDLTPAEFRRTYLGLRKSRRALL 119 Query: 464 RDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-W-LGPLALLVMSRVNIS* 637 R+ + +P P + VK+QG CGS W L + + Sbjct: 120 RELGESAHEAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYLATG 179 Query: 638 RLDSCCHFSEQELVDCD 688 +L+ SEQ+ VDCD Sbjct: 180 KLEV---LSEQQFVDCD 193 >UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep: Viral cathepsin - Xestia c-nigrum granulosis virus (XnGV) (Xestia c-nigrumgranulovirus) Length = 346 Score = 48.0 bits (109), Expect = 3e-04 Identities = 48/150 (32%), Positives = 68/150 (45%), Gaps = 6/150 (4%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 +F+ + Y DD E RFEIFK N+ I+ N E +A++ I AD+S E +K Sbjct: 45 EFVVKYNKVYKDDQ-EKEARFEIFKQNLADINARNALE-DSAMFEINSRADISSNELLQK 102 Query: 440 YLGLKPSL-----RDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPL 601 GLK SL +++ P + K P + VK Q CGS W Sbjct: 103 LTGLKLSLMRGEKKNSFCTPTVISGDSSGKVPDSFDWRDRNSVTSVKMQKECGSCWAFSA 162 Query: 602 ALLVMSRVNIS*RLDSCCHFSEQELVDCDK 691 + S +I + + SEQ+LVDCDK Sbjct: 163 VANIESLYHI--KHNVSLDLSEQQLVDCDK 190 Score = 33.5 bits (73), Expect = 6.4 Identities = 18/46 (39%), Positives = 26/46 (56%), Gaps = 3/46 (6%) Frame = +1 Query: 511 EIPDKFDWRDYDAVTRC*RPGYVRKLAG---AFSVTGNVEGQYKLK 639 ++PD FDWRD ++VT ++K G AFS N+E Y +K Sbjct: 132 KVPDSFDWRDRNSVTSV----KMQKECGSCWAFSAVANIESLYHIK 173 >UniRef50_Q945E4 Cluster: Cysteine proteinase; n=1; Vasconcellea cundinamarcensis|Rep: Cysteine proteinase - Carica candamarcensis Length = 179 Score = 47.6 bits (108), Expect = 4e-04 Identities = 24/51 (47%), Positives = 33/51 (64%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKP 457 E +RF+IFK N+R I E N+ T G+ +FADL+ EE+ YLG+KP Sbjct: 93 EKEKRFDIFKDNLRFIDEHNSQNL-TYRLGLNRFADLTNEEYRSTYLGVKP 142 >UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; Paramecium tetraurelia|Rep: Putative cathepsin L2 precursor - Paramecium tetraurelia Length = 294 Score = 47.6 bits (108), Expect = 4e-04 Identities = 42/139 (30%), Positives = 66/139 (47%), Gaps = 2/139 (1%) Frame = +2 Query: 278 KPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKP 457 K N +E R EI+ N R I E N E T G QF LS+EEF YL Sbjct: 21 KNNKFYTESEKLYRMEIYNSNKRMIEEHNQREDVTYQMGENQFMTLSHEEFVDLYL---- 76 Query: 458 SLRDTNQIPMRQAEIPKLKSPINSIGAIMTQS-PDVKDQGMCGS-WLGPLALLVMSRVNI 631 + + + + A +P+++ + +GA+ ++ VK+QG C S W ++ + + I Sbjct: 77 -QKSDSSVNIMGASLPEVQ--LEGLGAVDWRNYTTVKEQGQCASGWAFSVSNSLEAWYAI 133 Query: 632 S*RLDSCCHFSEQELVDCD 688 R + S Q++VDCD Sbjct: 134 --RGFQKINASTQQIVDCD 150 >UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Rep: Cathepsin W - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 303 Score = 47.2 bits (107), Expect = 5e-04 Identities = 37/129 (28%), Positives = 59/129 (45%), Gaps = 1/129 (0%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIP 484 E + R IF N+++ L E GTA YG+T+F+DL+ EEF + L ++ T I Sbjct: 13 EFKYRLRIFSENLKEASRLQREELGTAQYGVTKFSDLTDEEFSIYH--LPTNILPTPPIL 70 Query: 485 MRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCHF 661 + E+ L P + K+Q C S W + ++ I L Sbjct: 71 KQSEEV--LPFPTSCDWRTQNVISKAKNQRTCHSCWAFAAVANIEAQWAI---LGQTISL 125 Query: 662 SEQELVDCD 688 SEQ+++DC+ Sbjct: 126 SEQQVIDCN 134 >UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Bigelowiella natans|Rep: Digestive cysteine proteinase - Bigelowiella natans (Pedinomonas minutissima) (Chlorarachnion sp.(strain CCMP 621)) Length = 360 Score = 47.2 bits (107), Expect = 5e-04 Identities = 43/139 (30%), Positives = 62/139 (44%), Gaps = 3/139 (2%) Frame = +2 Query: 284 NYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSL 463 +Y + E + R F N R I LN +E G+AVYG T+F+D+S E+F K Sbjct: 34 SYEEAGKEDKARLN-FVENERIIQGLNENELGSAVYGHTRFSDMSPEQFRAMMTPFKYHT 92 Query: 464 RDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WL--GPLALLVMSRVNIS 634 + Q + +K + VKDQG CGS W AL + + Sbjct: 93 DEAENAAYDQNK-NAVKVTDSFDWRDFNALTPVKDQGGCGSCWAFSATQALESAHYIKHN 151 Query: 635 *RLDSCCHFSEQELVDCDK 691 LDS S ++LV+CD+ Sbjct: 152 DTLDSPIALSTEQLVECDQ 170 >UniRef50_Q5Y801 Cluster: Cysteine proteinase; n=1; Petunia x hybrida|Rep: Cysteine proteinase - Petunia hybrida (Petunia) Length = 167 Score = 47.2 bits (107), Expect = 5e-04 Identities = 25/64 (39%), Positives = 37/64 (57%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKY 442 +L H +Y + E +RF+IFK N+ I E N+ + G+T+FADL+ EE+ Y Sbjct: 83 WLVQHGKSY-NGLQEKDKRFQIFKDNLNYIDEQNSVPNKSYKLGLTKFADLTNEEYKSTY 141 Query: 443 LGLK 454 LG K Sbjct: 142 LGTK 145 >UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_79, whole genome shotgun sequence - Paramecium tetraurelia Length = 324 Score = 47.2 bits (107), Expect = 5e-04 Identities = 43/153 (28%), Positives = 70/153 (45%), Gaps = 8/153 (5%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGI--TQFADLSYEEFG 433 D+ + + + EM R + +F+ N + I N + G Y + QFADL+ +EF Sbjct: 38 DWKIQYNKKFSSEKEEMYR-YLVFQQNAQLIEAHNNDKSGKYTYTMETNQFADLTEQEFA 96 Query: 434 KKYLGLKP----SLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQG-MCG-SWLG 595 +KYL +P + T+ +P QA + + P +KDQG CG SW Sbjct: 97 QKYLTFRPKSTNKSKSTDYVPNGQARDWVEEGKV----------PPIKDQGSSCGSSWAF 146 Query: 596 PLALLVMSRVNIS*RLDSCCHFSEQELVDCDKP 694 ++ NI L++ SEQ+++DC P Sbjct: 147 SAVGVLEINSNIEFGLETT--LSEQDMLDCSGP 177 >UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; Dictyostelium discoideum|Rep: Cysteine proteinase 7 precursor - Dictyostelium discoideum (Slime mold) Length = 460 Score = 47.2 bits (107), Expect = 5e-04 Identities = 41/146 (28%), Positives = 72/146 (49%), Gaps = 4/146 (2%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 +++ H+ +Y + E R+ IFK N+ ++E NT T V G+ FAD+S EE+ Sbjct: 32 NWMIAHQRHYSSE--EFNGRYNIFKANMDYVNEWNTKGSET-VLGLNVFADISNEEYRAT 88 Query: 440 YLGLKPSLRDTNQIPMRQAE-IPKLKSPIN--SIGAIMTQSPDVKDQGMCGS-WLGPLAL 607 YLG + D + + M +++ I + ++ + GA+ +P +K+QG CG W Sbjct: 89 YLG---TPFDASSLEMTESDKIFDASAQVDWRTQGAV---TP-IKNQGQCGGCWSFSTTG 141 Query: 608 LVMSRVNIS*RLDSCCHFSEQELVDC 685 ++ + SEQ L+DC Sbjct: 142 ATEGAQYLANGKKNLVSLSEQNLIDC 167 >UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Toxopain-2 - Toxoplasma gondii Length = 422 Score = 46.8 bits (106), Expect = 6e-04 Identities = 43/148 (29%), Positives = 68/148 (45%), Gaps = 5/148 (3%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGI--TQFADLSYEEFGK 436 F A + +Y + E +RR+ IFK N+ IH TH + Y + F DLS +EF + Sbjct: 120 FQAMYAKSYATEE-EKQRRYAIFKNNLVYIH---THNQQGYSYSLKMNHFGDLSRDEFRR 175 Query: 437 KYLGLKPSLR-DTNQIPMRQAEIPKLKSPINSIGAIMTQS--PDVKDQGMCGSWLGPLAL 607 KYLG K S ++ + + + L S + + ++ VKDQ CGS Sbjct: 176 KYLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTT 235 Query: 608 LVMSRVNIS*RLDSCCHFSEQELVDCDK 691 + + + + SEQEL+DC + Sbjct: 236 GALEGAHCA-KTGKLVSLSEQELMDCSR 262 >UniRef50_Q23H10 Cluster: Papain family cysteine protease containing protein; n=14; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 46.4 bits (105), Expect = 8e-04 Identities = 44/149 (29%), Positives = 70/149 (46%), Gaps = 13/149 (8%) Frame = +2 Query: 287 YIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLR 466 Y+++ ++ R+ F+ N +KI E N+ T + QF+D++ EEF +K L +K L Sbjct: 40 YLNEHEKLFRQMVFFE-NFQKIQEHNSDPNNTYSVHLNQFSDMTKEEFAEKIL-MKSDLV 97 Query: 467 DTNQIPMRQA----EIPKLKSPINSIGAIMTQSPD---------VKDQGMCGSWLGPLAL 607 D + Q + ++ ++S + S D VK+QG CGS A Sbjct: 98 DHLMKGISQEATHNDTNNNETQLSSNSLTLADSIDWRTKGAVTSVKNQGGCGSCWSFSAA 157 Query: 608 LVMSRVNIS*RLDSCCHFSEQELVDCDKP 694 VM N + + FSEQ+LVDC P Sbjct: 158 AVMESFNFI-QNKALVDFSEQQLVDCVIP 185 >UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Slime mold). Cysteine proteinase 5; n=2; Dictyostelium discoideum|Rep: Similar to Dictyostelium discoideum (Slime mold). Cysteine proteinase 5 - Dictyostelium discoideum (Slime mold) Length = 345 Score = 46.0 bits (104), Expect = 0.001 Identities = 39/137 (28%), Positives = 62/137 (45%), Gaps = 8/137 (5%) Frame = +2 Query: 299 AAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQ 478 ++E R+ FK N+ I++ N+ T V + +FAD+S EE+ K YL ++ + Sbjct: 42 SSEFTNRYNTFKSNLDFINQWNSKGSKT-VLALNEFADISNEEYRKNYLRNDNNINKLSS 100 Query: 479 IPMRQAEIPKLKSPIN----SIGAIMTQS---PDVKDQ-GMCGSWLGPLALLVMSRVNIS 634 + + E ++KS + S G + P VK Q G CGSW S ++ Sbjct: 101 LLINDKEDKEIKSSSSSGSGSSGIDWRKKGAVPSVKSQIGGCGSWPITAVGATESAHFLA 160 Query: 635 *RLDSCCHFSEQELVDC 685 D S Q L+DC Sbjct: 161 NPKDPFISLSMQNLIDC 177 >UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypanosoma cruzi|Rep: Cysteine protease, putative - Trypanosoma cruzi Length = 434 Score = 46.0 bits (104), Expect = 0.001 Identities = 41/137 (29%), Positives = 61/137 (44%), Gaps = 7/137 (5%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTN 475 D E R+R IFK N+ K+ N + GI +F+D++ EEF K+ G + + + Sbjct: 52 DPEEHRKRAAIFKENLAKVRAFNGALGRSYRLGINKFSDMTKEEFNAKFNG-RVAAPQST 110 Query: 476 QIPMR------QAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS 634 Q P R +A P+ + + ++T VKDQG CGS W V S IS Sbjct: 111 QSPQRAPYKRTKATFPEALNWQEAKNPVLT---PVKDQGSCGSCWAHAATESVESMYAIS 167 Query: 635 *RLDSCCHFSEQELVDC 685 S Q++ C Sbjct: 168 --SGKLLTLSTQQITSC 182 >UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: Cysteine protease - Saprolegnia parasitica Length = 523 Score = 45.6 bits (103), Expect = 0.001 Identities = 42/145 (28%), Positives = 61/145 (42%), Gaps = 4/145 (2%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLK--PSLRDTNQ 478 E RFE+F N ++I N + G +++ L+++EF K GL+ PS + Sbjct: 43 EWVHRFEVFILNDQRIEAHNKDASSSFTMGHNEYSHLTFDEFKKLRTGLRVSPSYIQSRA 102 Query: 479 IPMRQAEIPKLKSPINSIGAIMTQS-PDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSC 652 A + N + + VK+QGMCGS W + +S + Sbjct: 103 KYALMAPAVNMTDVPNEMDWVEQGGVTPVKNQGMCGSCWAFSTTGAIEGAAFVSSK--QL 160 Query: 653 CHFSEQELVDCDKP*RRM*RGGLPD 727 SEQELVDCD GGL D Sbjct: 161 VSVSEQELVDCDHNGDMGCNGGLMD 185 >UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sativa|Rep: OSJNBb0085F13.15 protein - Oryza sativa subsp. japonica (Rice) Length = 383 Score = 45.2 bits (102), Expect = 0.002 Identities = 49/161 (30%), Positives = 66/161 (40%), Gaps = 19/161 (11%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKY 442 ++ATH +Y A E RRFE+++ N+ I N + T G T F DL++EEF Y Sbjct: 59 WMATHNRSYAS-ADEKLRRFEVYRSNMEFIEATNRNGSLTFKLGETPFTDLTHEEFLATY 117 Query: 443 LG---LKPSLRD-TNQIPMRQAEIPKLKSPINSIGA-----IMTQSPD---------VKD 568 G L P R + A I + GA + +S D K Sbjct: 118 TGDVRLPPERRGMQDDSDEEDAVITTSAGYVAGAGAGRRTVAVPESVDWRKEGAVTPAKH 177 Query: 569 QGMCGS-WLGPLALLVMSRVNIS*RLDSCCHFSEQELVDCD 688 QG C + W + S I + SEQELVDCD Sbjct: 178 QGQCAACWAFAAVAAIESLHKI--KGGDLISLSEQELVDCD 216 >UniRef50_Q8I8D5 Cluster: Cysteine protease 13; n=2; Entamoeba histolytica|Rep: Cysteine protease 13 - Entamoeba histolytica Length = 379 Score = 44.8 bits (101), Expect = 0.003 Identities = 38/128 (29%), Positives = 61/128 (47%), Gaps = 15/128 (11%) Frame = +2 Query: 248 TFVYDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGT--AVYGITQFADLSY 421 T+ + + +K Y + E+ R+ IF N+++I++LN+ T AV+GI F+DL Sbjct: 32 TYWSKWKSDNKKVYNSISEELTRK-AIFLSNLKRINQLNSQRIDTDDAVFGINAFSDLKP 90 Query: 422 EEFGKKY-----LGLKPSLRDTNQIPMRQAEIPKLKSPI--------NSIGAIMTQSPDV 562 EEF +++ LKP ++P+ E+P S NS I V Sbjct: 91 EEFARRFNKINLKSLKPKQTTHYKLPVPSGEVPTQYSACLQNKLLGQNSSNNIDLCGGIV 150 Query: 563 KDQGMCGS 586 DQG CG+ Sbjct: 151 MDQGDCGN 158 >UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: Vivapain-4 - Plasmodium vivax Length = 484 Score = 44.8 bits (101), Expect = 0.003 Identities = 42/154 (27%), Positives = 69/154 (44%), Gaps = 11/154 (7%) Frame = +2 Query: 257 YDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGK 436 Y F+ H Y + EM++R+ F N+ +I+ N+ G Q++D+S+EEF K Sbjct: 167 YLFMKEHGKKYKTEE-EMQQRYLAFTENLARINSHNSKANILYKKGTNQYSDISFEEFRK 225 Query: 437 KYLGLKPSLRD---TNQIPMRQAEIPKLKSPINSI-------GAIMTQSPDVKDQGMCGS 586 L L+ L+ + ++ K P +++ ++K+Q +CGS Sbjct: 226 TMLTLRFDLKKKLANSPYVSNYDDVLKKYKPADAVVDNEKYDWREHNAVSEIKNQNLCGS 285 Query: 587 -WLGPLALLVMSRVNIS*RLDSCCHFSEQELVDC 685 W V S+ I R + SEQELVDC Sbjct: 286 CWAFGAVGAVESQYAI--RKNQHVLISEQELVDC 317 >UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-329; n=2; Caenorhabditis|Rep: Putative uncharacterized protein tag-329 - Caenorhabditis elegans Length = 374 Score = 44.8 bits (101), Expect = 0.003 Identities = 44/156 (28%), Positives = 68/156 (43%), Gaps = 14/156 (8%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTA---VYGITQFADLSYEEF 430 DF+ +K NY D+ E + RF+ F ++ ++N + YGI +F+DLS +E Sbjct: 49 DFIVKYKRNYKDEI-EKKFRFQQFVATHNRVGKMNKAAKKAGHDTKYGINKFSDLSKKEI 107 Query: 431 GKKYLGLKPSLRDTNQIP---------MRQAE-IPKLKSPIN-SIGAIMTQSPDVKDQGM 577 Y P +TN +P RQ E +PK N +G P +K Q Sbjct: 108 HGMYSKFGPPKNNTN-VPKFNLKNLRVKRQMEGLPKTFDLRNKKVGGHYIIGP-IKTQDS 165 Query: 578 CGSWLGPLALLVMSRVNIS*RLDSCCHFSEQELVDC 685 C G A ++ ++ L + SEQE+ DC Sbjct: 166 CACCWG-FAATAVAEAALTVHLKKAMNLSEQEVCDC 200 >UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanensis|Rep: Sui m 1 allergen - Suidasia medanensis Length = 336 Score = 44.8 bits (101), Expect = 0.003 Identities = 40/134 (29%), Positives = 60/134 (44%), Gaps = 4/134 (2%) Frame = +2 Query: 299 AAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQ 478 A E +R IF+ N+R I E H + A + + ADL+ EEF Y L + Sbjct: 41 AEEEPQRRAIFEENLRWIQE--NHGKHGAGLEVNEHADLTAEEFSSMYATLNQEAFLKSP 98 Query: 479 IPMRQAEIPKLKSPINSIGAI---MTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLD 646 + ++P+ + A + V++QG CGS W A V ++ I R + Sbjct: 99 LHKEFVQVPESDISVALPAAFDWRQQWNTAVRNQGQCGSCWAFATAATVEAQYAI--RKN 156 Query: 647 SCCHFSEQELVDCD 688 SEQ+LVDCD Sbjct: 157 VHVTLSEQQLVDCD 170 >UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_158, whole genome shotgun sequence - Paramecium tetraurelia Length = 308 Score = 44.8 bits (101), Expect = 0.003 Identities = 41/127 (32%), Positives = 58/127 (45%), Gaps = 3/127 (2%) Frame = +2 Query: 317 RFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIPMRQA 496 R +IF+ ++ N + T G QF DL+ EEF YL R + Q + + Sbjct: 51 RAKIFEERIKLFEAHNADKTQTFTMGENQFTDLTQEEFKAIYL-----RRRSPQKLVNEK 105 Query: 497 EIPKLKSPINSIGAIMTQSPDVKDQGMCG-SWLGPLALLVMS--RVNIS*RLDSCCHFSE 667 +P ++ + S A VKDQG CG +W V S R+N LD SE Sbjct: 106 YVPTNEANLTS--ANWAGLTSVKDQGYCGAAWAFAAIGAVESVLRINSVTNLD----LSE 159 Query: 668 QELVDCD 688 Q+L+DCD Sbjct: 160 QQLIDCD 166 >UniRef50_Q3LFN3 Cluster: Cysteine proteinase; n=1; Dianthus caryophyllus|Rep: Cysteine proteinase - Dianthus caryophyllus (Carnation) (Clove pink) Length = 140 Score = 44.4 bits (100), Expect = 0.003 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 7/84 (8%) Frame = +2 Query: 224 SVPPHTS*TF--VYD-FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTA--- 385 S PP T+ +Y+ +L H+ NY + E +RF IF+ N+ I + N + G Sbjct: 52 STPPRTTAEVMQIYESWLVKHRKNY-NALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGE 110 Query: 386 -VYGITQFADLSYEEFGKKYLGLK 454 G+ +FADL+ +EF + Y G+K Sbjct: 111 FELGLNKFADLTNDEFRRIYFGVK 134 >UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-ear cress). SAG12 protein; n=2; Dictyostelium discoideum|Rep: Similar to Arabidopsis thaliana (Mouse-ear cress). SAG12 protein - Dictyostelium discoideum (Slime mold) Length = 358 Score = 44.4 bits (100), Expect = 0.003 Identities = 49/146 (33%), Positives = 67/146 (45%), Gaps = 15/146 (10%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGL----KPS- 460 D+ EM RF FK N++K ELN+ G A + F+DLS EEF +L KPS Sbjct: 57 DSIEMENRFSNFKENMKKNIELNSMHAGKAKFESNGFSDLSEEEFSNFHLNKAFKGKPSH 116 Query: 461 LRDT--NQIPMRQAEIPKLK----SPINSIGAIMTQS----PDVKDQGMCGSWLGPLALL 610 LR++ Q + I K +N + +I + VKDQG CGS A+ Sbjct: 117 LRNSIKPQPTPHHSLINGYKEMENGDLNELYSIDWRKKGLVTPVKDQGQCGSCYIFSAVE 176 Query: 611 VMSRVNIS*RLDSCCHFSEQELVDCD 688 + I + SEQ+ VDCD Sbjct: 177 QIETAWIK-AGNKPILLSEQQAVDCD 201 >UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegleria fowleri|Rep: Cysteine proteinase homolog - Naegleria fowleri Length = 347 Score = 44.4 bits (100), Expect = 0.003 Identities = 42/134 (31%), Positives = 60/134 (44%), Gaps = 6/134 (4%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQI- 481 E R++IFK NV K N H +GIT+F+DL+ EEF + +L + + +I Sbjct: 48 EHNNRYQIFKANVEKSRYYN-HVGKRENFGITKFSDLTPEEFKRMFLMKTYTPEEAKKIL 106 Query: 482 --PMRQ--AEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLD 646 P +E +P + VK+QG CGS W V + I + Sbjct: 107 AAPQHAVLSEKEVQTAPTSFDWRQHGAVTRVKNQGACGSCWTFSTTGNVEGQWAI--KKG 164 Query: 647 SCCHFSEQELVDCD 688 SEQ+LVDCD Sbjct: 165 KLVSLSEQQLVDCD 178 Score = 42.7 bits (96), Expect = 0.010 Identities = 21/44 (47%), Positives = 25/44 (56%) Frame = +1 Query: 517 PDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTGQ 648 P FDWR + AVTR G FS TGNVEGQ+ +K G+ Sbjct: 123 PTSFDWRQHGAVTRVKNQGACGS-CWTFSTTGNVEGQWAIKKGK 165 >UniRef50_A7TC64 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 218 Score = 44.4 bits (100), Expect = 0.003 Identities = 23/60 (38%), Positives = 37/60 (61%), Gaps = 3/60 (5%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHER---GTAVYGITQFADLSYEEF 430 +F++ +Y+DD E R EIF ++ + +LN E+ G+A YG+ QF+DL+ EEF Sbjct: 35 EFVSAFNKSYVDDVYEYGIRKEIFLQSLIRHDKLNREEKELGGSARYGVNQFSDLTPEEF 94 >UniRef50_A7APS9 Cluster: Papain family cysteine protease containing protein; n=1; Babesia bovis|Rep: Papain family cysteine protease containing protein - Babesia bovis Length = 435 Score = 44.4 bits (100), Expect = 0.003 Identities = 47/151 (31%), Positives = 62/151 (41%), Gaps = 21/151 (13%) Frame = +2 Query: 302 AEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQI 481 +E RF F NV +I E N + T I QFAD++ E+F +R + I Sbjct: 135 SEKIERFATFYRNVTRIREFNMNVHKTYTMKINQFADMTPEQFMSLQGTRASKIRVSKGI 194 Query: 482 PMRQAEI------PKLKSPINSIGAIMTQ-SPD-------------VKDQGMCGS-WLGP 598 P Q P LKS + G SP+ VKDQG CGS W Sbjct: 195 PDSQVAAVGNQKGPNLKSEVRQTGNRFADISPEDFIDLRKDNYMTPVKDQGNCGSCW--A 252 Query: 599 LALLVMSRVNIS*RLDSCCHFSEQELVDCDK 691 +L+ ++ + D SEQ LVDC K Sbjct: 253 FSLIGVAEPFFKHKRDIDVVLSEQNLVDCVK 283 >UniRef50_A0DKP6 Cluster: Chromosome undetermined scaffold_54, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_54, whole genome shotgun sequence - Paramecium tetraurelia Length = 312 Score = 44.4 bits (100), Expect = 0.003 Identities = 40/147 (27%), Positives = 67/147 (45%), Gaps = 3/147 (2%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 D+ H ++++ E + RF+IF+ N++KI + N+ E T G+ +F L+ E+F Sbjct: 35 DWKLKHGMQFLNE--ENQYRFQIFQTNLQKIEQHNSDESQTYTMGMNKFMHLTQEQFQSL 92 Query: 440 YLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVM 616 +L + Q EI +L + + VKDQG C S W A V Sbjct: 93 HL-----MNIQEHYVGDQPEILQLGNIQLNASIDYRNHTIVKDQGQCNSGW----AFSVT 143 Query: 617 SRVNIS*RL--DSCCHFSEQELVDCDK 691 + + ++ SEQ L+DCD+ Sbjct: 144 GTLEVYQKIYQKKNVSLSEQHLIDCDQ 170 >UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor; n=17; Magnoliophyta|Rep: Thiol protease aleurain-like precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 44.4 bits (100), Expect = 0.003 Identities = 41/131 (31%), Positives = 65/131 (49%), Gaps = 4/131 (3%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAV-YGITQFADLSYEEFGKKYLGLKPSLRDT--N 475 EM+ RF +FK N+ I +T+++G + + QFADL+++EF + LG + T Sbjct: 75 EMKLRFSVFKENLDLIR--STNKKGLSYKLSLNQFADLTWQEFQRYKLGAAQNCSATLKG 132 Query: 476 QIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSC 652 + +A +P K G + SP VK+QG CGS W + + + + Sbjct: 133 SHKITEATVPDTKD-WREDGIV---SP-VKEQGHCGSCWTFSTTGALEAAYHQA--FGKG 185 Query: 653 CHFSEQELVDC 685 SEQ+LVDC Sbjct: 186 ISLSEQQLVDC 196 >UniRef50_A7LFV3 Cluster: Cathepsin L protease inhibitor 2; n=1; Diaprepes abbreviatus|Rep: Cathepsin L protease inhibitor 2 - Diaprepes abbreviatus (Sugarcane rootstalk borer weevil) Length = 91 Score = 44.0 bits (99), Expect = 0.005 Identities = 26/59 (44%), Positives = 35/59 (59%), Gaps = 3/59 (5%) Frame = +2 Query: 284 NYIDDAAEMRRRFEIFKGNVRKIHELNTH-ERGTAVY--GITQFADLSYEEFGKKYLGL 451 NY D + E +RF IF+ N++ I E N ERG + GI QF DL+ EEF ++ GL Sbjct: 27 NY-DSSDEEAKRFNIFQQNLQSIREHNEKFERGETTFTQGINQFTDLTKEEFKARHTGL 84 >UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep: Viral cathepsin - Cydia pomonella granulosis virus (CpGV) (Cydia pomonellagranulovirus) Length = 333 Score = 44.0 bits (99), Expect = 0.005 Identities = 39/151 (25%), Positives = 68/151 (45%), Gaps = 8/151 (5%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 +F + Y+ D E + E FK N++ I+E N + AV+ I +++DL+ ++ Sbjct: 34 NFAIKYNKTYVSDE-ERAIKLENFKNNLKMINEKNMASK-YAVFDINEYSDLNKNALLRR 91 Query: 440 YLGLKPSLR-DTNQIPMRQAEIPKLKSPINSIGAIMTQSPD------VKDQGMCGS-WLG 595 G + L+ + + M + + +K ++ D VK+Q CGS W Sbjct: 92 TTGFRLGLKKNPSAFTMTECSVVVIKDEPQALLPETLDWRDKHGVTPVKNQMECGSCWAF 151 Query: 596 PLALLVMSRVNIS*RLDSCCHFSEQELVDCD 688 + S NI + D + SEQ LV+CD Sbjct: 152 STIANIESLYNI--KYDKALNLSEQHLVNCD 180 >UniRef50_UPI0001509FF9 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 348 Score = 43.6 bits (98), Expect = 0.006 Identities = 44/143 (30%), Positives = 70/143 (48%), Gaps = 3/143 (2%) Frame = +2 Query: 272 THKPNYIDDAAEM-RRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLG 448 T+K ++ D E +RRF+IF N+ ++ L+ G + ITQ+ L+ EEF + G Sbjct: 70 TYKIHFDDSGEEEEKRRFQIFTKNL--VYILS--RPGLS---ITQYTHLTKEEFAQMSFG 122 Query: 449 LKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQS-PDVKDQGMC-GSWLGPLALLVMSR 622 + D Q+ + ++ P+NSI I + +VK QGMC SW V S Sbjct: 123 VVEQEPDNFQL------LQQVNEPVNSIDWISKNAVSNVKTQGMCQSSWAFAAVAGVESA 176 Query: 623 VNIS*RLDSCCHFSEQELVDCDK 691 + + + SEQ L+DCD+ Sbjct: 177 LFL--KNGKIPDVSEQNLLDCDQ 197 >UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 317 Score = 43.6 bits (98), Expect = 0.006 Identities = 43/148 (29%), Positives = 64/148 (43%), Gaps = 5/148 (3%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNT-HERGTAVY--GITQFADLSYEEFG 433 F H Y E + RF++F N++KI + N ++ G + G+ QFAD++ EEF Sbjct: 19 FKVNHSKKY-GHLKEEQVRFQVFSQNLQKIEQHNARYQNGEVSFYLGVNQFADMTSEEF- 76 Query: 434 KKYLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-W-LGPLAL 607 K L + + I R P+L P + V+DQ CGS W Sbjct: 77 KAMLDSQLIHKPKRDITSRFVADPQLTVPESIDWREKGAVNPVRDQEQCGSCWAFSAAGA 136 Query: 608 LVMSRVNIS*RLDSCCHFSEQELVDCDK 691 L R +L+ S Q+LVDC + Sbjct: 137 LEGQRFLKEGKLEV---LSTQQLVDCSR 161 >UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus; n=4; Cryptosporidium|Rep: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus - Cryptosporidium parvum Iowa II Length = 401 Score = 43.6 bits (98), Expect = 0.006 Identities = 36/135 (26%), Positives = 62/135 (45%), Gaps = 6/135 (4%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIP 484 E +RFEI+K N+ I N+ + + V + +F DLS EEF ++ G +D ++ Sbjct: 102 EENQRFEIYKQNMNFIKTTNS-QGFSYVLEMNEFGDLSKEEFMARFTGYIKDSKDDERV- 159 Query: 485 MRQAEIPKLKS-----PINSIGAIMTQSPD-VKDQGMCGSWLGPLALLVMSRVNIS*RLD 646 + + + +S P NSI + + +++Q CGS A+ + + Sbjct: 160 FKSSRVSASESEEEFVPPNSINWVEAGCVNPIRNQKNCGSCWAFSAVAALEGATCAQTNR 219 Query: 647 SCCHFSEQELVDCDK 691 SEQ+ VDC K Sbjct: 220 GLPSLSEQQFVDCSK 234 >UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|Rep: Cathepsin W precursor - Homo sapiens (Human) Length = 376 Score = 43.6 bits (98), Expect = 0.006 Identities = 19/46 (41%), Positives = 28/46 (60%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKY 442 E R +IF N+ + L + GTA +G+T F+DL+ EEFG+ Y Sbjct: 58 EHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLY 103 >UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [Contains: Cathepsin H mini chain; Cathepsin H heavy chain; Cathepsin H light chain]; n=37; Eukaryota|Rep: Cathepsin H precursor (EC 3.4.22.16) [Contains: Cathepsin H mini chain; Cathepsin H heavy chain; Cathepsin H light chain] - Homo sapiens (Human) Length = 335 Score = 43.6 bits (98), Expect = 0.006 Identities = 45/158 (28%), Positives = 69/158 (43%), Gaps = 4/158 (2%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVY--GITQFADLSYEEFGK 436 +++ H+ Y + E R + F N RKI N H G + + QF+D+S+ E Sbjct: 38 WMSKHRKTYSTE--EYHHRLQTFASNWRKI---NAHNNGNHTFKMALNQFSDMSFAEIKH 92 Query: 437 KYLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLV 613 KYL +P + + P S ++ SP VK+QG CGS W + Sbjct: 93 KYLWSEPQNCSATKSNYLRGTGPYPPS-VDWRKKGNFVSP-VKNQGACGSCWTFSTTGAL 150 Query: 614 MSRVNIS*RLDSCCHFSEQELVDCDKP*RRM-*RGGLP 724 S + I+ +EQ+LVDC + +GGLP Sbjct: 151 ESAIAIA--TGKMLSLAEQQLVDCAQDFNNHGCQGGLP 186 >UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_184, whole genome shotgun sequence - Paramecium tetraurelia Length = 331 Score = 43.2 bits (97), Expect = 0.008 Identities = 43/141 (30%), Positives = 61/141 (43%), Gaps = 4/141 (2%) Frame = +2 Query: 275 HKPNYIDDAAEMRRRFEIFKGNVRKIHELNTH-ERG--TAVYGITQFADLSYEEFGKKYL 445 H Y D E RF +F N+ + E N+ E G T G+ Q+ADL+ EEF +L Sbjct: 41 HGKRYSD--FEEVHRFSVFAQNLAVVMEHNSKFELGQETFTLGMNQYADLTPEEFQASFL 98 Query: 446 GLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSR 622 LK ++D + L P +++ VK+QG CGS W A + Sbjct: 99 TLKTKVQDRKNVKSYSG----LSFP-DTVD--WKDGLTVKNQGSCGSCWAFAAAAAI--E 149 Query: 623 VNIS*RLDSCCHFSEQELVDC 685 + + SEQE VDC Sbjct: 150 AGFQHHKKNKVNISEQEFVDC 170 >UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 289 Score = 42.7 bits (96), Expect = 0.010 Identities = 31/94 (32%), Positives = 39/94 (41%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIP 484 E RF IF+ NV I + GI QFADL+ +EF Y G KP + P Sbjct: 59 EKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPP--HPKEAP 116 Query: 485 MRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS 586 + + +P VKDQG CGS Sbjct: 117 ---RPVDPIWTPCCIDWRFRGAVTGVKDQGACGS 147 >UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_46, whole genome shotgun sequence - Paramecium tetraurelia Length = 336 Score = 42.7 bits (96), Expect = 0.010 Identities = 35/126 (27%), Positives = 58/126 (46%), Gaps = 2/126 (1%) Frame = +2 Query: 317 RFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIPMRQA 496 RF F+ N K+++ N+ T + QF+DLS EEF YL + + + + Sbjct: 73 RFFNFQINRNKVNKHNSDPNKTYFMKMNQFSDLSQEEFSLIYL-THDNAEEVMEQNLIID 131 Query: 497 EIPKLKSPINSIGAI-MTQSPDVKDQGMC-GSWLGPLALLVMSRVNIS*RLDSCCHFSEQ 670 E+ K + +I ++ + VKDQG C G W + + + + SEQ Sbjct: 132 ELQKTQENDKTINSVDWRKITQVKDQGQCSGCW--AFGAVGAAEAWFYVKNKTTVLLSEQ 189 Query: 671 ELVDCD 688 +L+DCD Sbjct: 190 QLIDCD 195 >UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi Length = 467 Score = 42.7 bits (96), Expect = 0.010 Identities = 41/147 (27%), Positives = 60/147 (40%), Gaps = 3/147 (2%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 +F H Y + AAE R +F+ N+ + L+ A +G+T F+DL+ EEF + Sbjct: 40 EFKQKHGRVY-ESAAEEAFRLSVFRENLF-LARLHAAANPHATFGVTPFSDLTREEFRSR 97 Query: 440 YLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVM 616 Y + ++ + +P VKDQG CGS W A + Sbjct: 98 YHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCW----AFSAI 153 Query: 617 SRVNIS*RL--DSCCHFSEQELVDCDK 691 V L + SEQ LV CDK Sbjct: 154 GNVECQWFLAGHPLTNLSEQMLVSCDK 180 >UniRef50_Q9JM84 Cluster: DD72 protein; n=4; Murinae|Rep: DD72 protein - Mus musculus (Mouse) Length = 148 Score = 42.3 bits (95), Expect = 0.014 Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 3/58 (5%) Frame = +3 Query: 6 SAREQVIAGIHYRMKVEVGLTNCTAL-TNRSDC--KHISDESLNKFCRVNVWMRPWTN 170 SA +QV+AG +Y +K+E+G T CT +N DC D+ C + + PW N Sbjct: 79 SASQQVVAGKNYYLKIELGRTTCTKTESNLVDCPFNEQPDQQKRVICNFQINVAPWLN 136 >UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 41.9 bits (94), Expect = 0.018 Identities = 46/148 (31%), Positives = 69/148 (46%), Gaps = 6/148 (4%) Frame = +2 Query: 293 DDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLG--LKP-SL 463 +D E + R +F N ++I N + + GI +F+ L+ EEF KYL +P S Sbjct: 51 NDIQEEQYRLFVFHENFKQIELDNMNSDNGFISGINKFSHLTKEEFKAKYLNRPQRPASE 110 Query: 464 RDTNQI-PMRQAEIPKLKSPIN--SIGAIMTQSPDVKDQGMCGSWLGPLALLVMSRVNIS 634 TN I +Q KL ++ +GA+ SP V+DQG CGS + + + Sbjct: 111 MKTNSILSSQQKTDEKLPESVDWRKLGAV---SP-VRDQGNCGSCYAFASTGALEGL-YQ 165 Query: 635 *RLDSCCHFSEQELVDCDKP*RRM*RGG 718 + FS Q +VDC K + RGG Sbjct: 166 IKTGKLEVFSPQYIVDCAK--HQFSRGG 191 >UniRef50_UPI0000ECC98C Cluster: Cystatin-F precursor (Leukocystatin) (Cystatin-7) (Cystatin-like metastasis-associated protein) (CMAP).; n=2; Gallus gallus|Rep: Cystatin-F precursor (Leukocystatin) (Cystatin-7) (Cystatin-like metastasis-associated protein) (CMAP). - Gallus gallus Length = 137 Score = 41.9 bits (94), Expect = 0.018 Identities = 20/58 (34%), Positives = 29/58 (50%), Gaps = 4/58 (6%) Frame = +3 Query: 3 NSAREQVIAGIHYRMKVEVGLTNCT--ALTNRSDCKHISDESLNKF--CRVNVWMRPW 164 N A Q++ G+ Y + VE+G T C +N DC ++L + C VWM PW Sbjct: 68 NKAMVQIVRGLKYMLHVEIGRTVCEKRGYSNLDDCHFQKKKNLQQILKCYFEVWMTPW 125 >UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thaliana|Rep: F9P14.12 protein - Arabidopsis thaliana (Mouse-ear cress) Length = 343 Score = 41.9 bits (94), Expect = 0.018 Identities = 39/146 (26%), Positives = 67/146 (45%), Gaps = 4/146 (2%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKY 442 +L TH Y E RF I++ NV+ I +N+ + +FAD++ EF + Sbjct: 46 WLKTHSKLY-GGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTD-NRFADMTNSEFKAHF 103 Query: 443 LGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAI--MTQS--PDVKDQGMCGSWLGPLALL 610 LGL +T+ + + + + P N A+ TQ +++QG CG A+ Sbjct: 104 LGL-----NTSSLRLHKKQRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVA 158 Query: 611 VMSRVNIS*RLDSCCHFSEQELVDCD 688 + +N + + SEQ+L+DCD Sbjct: 159 AIEGIN-KIKTGNLVSLSEQQLIDCD 183 Score = 33.9 bits (74), Expect = 4.8 Identities = 21/51 (41%), Positives = 25/51 (49%), Gaps = 2/51 (3%) Frame = +1 Query: 499 NPEVEIPDKFDWRDYDAVTRC*RPGYVRKLAG--AFSVTGNVEGQYKLKTG 645 +P +PD DWR AVT G K G AFS +EG K+KTG Sbjct: 122 DPAGNVPDAVDWRTQGAVTPIRNQG---KCGGCWAFSAVAAIEGINKIKTG 169 >UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sativa|Rep: Cysteine proteinase-like - Oryza sativa subsp. japonica (Rice) Length = 360 Score = 41.9 bits (94), Expect = 0.018 Identities = 39/140 (27%), Positives = 60/140 (42%), Gaps = 10/140 (7%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELNTHERGTAVY--GITQFADLSYEEFGKKYLGLK----- 454 DAAE RR E+F N ++ N G Y G+ QF+DL+ +EF + +LG Sbjct: 56 DAAEKARRMEVFAANAERVDAAN-RAGGDRTYTLGLNQFSDLTDDEFAQTHLGYSWAPPP 114 Query: 455 PSLRDTNQIP--MRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRV 625 PS R ++ A P + +VK+Q CGS W A + + Sbjct: 115 PSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQRSCGSCW--AFAAVAATEG 172 Query: 626 NIS*RLDSCCHFSEQELVDC 685 + + SEQ+++DC Sbjct: 173 LVQLATGNLVSLSEQQVLDC 192 >UniRef50_A2YHE2 Cluster: Putative uncharacterized protein; n=2; Oryza sativa|Rep: Putative uncharacterized protein - Oryza sativa subsp. indica (Rice) Length = 167 Score = 41.9 bits (94), Expect = 0.018 Identities = 25/51 (49%), Positives = 28/51 (54%), Gaps = 5/51 (9%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELNTH---ERG--TAVYGITQFADLSYEEFG 433 D AE RFEIFK VR + N E G + G TQFADL+ EEFG Sbjct: 98 DEAEKAYRFEIFKSTVRFAEKFNAEQVKEHGYCKCILGTTQFADLTLEEFG 148 >UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 2 - Rhipicephalus appendiculatus (Brown ear tick) Length = 564 Score = 41.9 bits (94), Expect = 0.018 Identities = 53/156 (33%), Positives = 69/156 (44%), Gaps = 6/156 (3%) Frame = +2 Query: 236 HTS*TFVYDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADL 415 HT +F DF THK Y D RRR +IF+ N+R I N G + + AD Sbjct: 256 HTKHSFE-DFKETHKRTYELDTEHDRRR-DIFRQNLRFIDSKNRANLGYNL-AVNHLADR 312 Query: 416 SYEEFG--KKYLGLKPSLRDTNQIPMRQAEIPKLKSPIN--SIGAIMTQSPDVKDQGMCG 583 + EE + L K P R KL I+ GA+ +P VKDQ +CG Sbjct: 313 TREEISVLRGRLQSKDGSSRAEPFP-RHRFTAKLPDQIDWRPYGAV---TP-VKDQAVCG 367 Query: 584 S-W-LGPLALLVMSRVNIS*RLDSCCHFSEQELVDC 685 S W G + L + + RL SEQ+LVDC Sbjct: 368 SCWSFGTVGELEGAYFRKTGRL---VRLSEQQLVDC 400 Score = 35.5 bits (78), Expect = 1.6 Identities = 18/46 (39%), Positives = 25/46 (54%) Frame = +1 Query: 511 EIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTGQ 648 ++PD+ DWR Y AVT + V +F G +EG Y KTG+ Sbjct: 344 KLPDQIDWRPYGAVTPV-KDQAVCGSCWSFGTVGELEGAYFRKTGR 388 >UniRef50_A0CS14 Cluster: Chromosome undetermined scaffold_26, whole genome shotgun sequence; n=3; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_26, whole genome shotgun sequence - Paramecium tetraurelia Length = 312 Score = 41.9 bits (94), Expect = 0.018 Identities = 43/142 (30%), Positives = 59/142 (41%), Gaps = 1/142 (0%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIP 484 E+ RR IF+ N KI N+ + T + QF D S +EF L P + P Sbjct: 48 EIFRRV-IFRSNYEKIQAHNSDKTQTYSVDVNQFTDFSQDEFVAIQLSFIP---PSGWKP 103 Query: 485 MRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCHF 661 + I P +S+ VK+Q CG+ W V + I LD Sbjct: 104 SDEEVIQVGVEPNDSVD--WRSKVRVKNQQWCGAGWAFSAVGAVEAFFKIKKNLD--YSL 159 Query: 662 SEQELVDCDKP*RRM*RGGLPD 727 SEQ L+DCD+ + GG PD Sbjct: 160 SEQYLIDCDRTKNKGCLGGHPD 181 >UniRef50_P01034 Cluster: Cystatin-C precursor; n=28; Eutheria|Rep: Cystatin-C precursor - Homo sapiens (Human) Length = 146 Score = 41.9 bits (94), Expect = 0.018 Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 3/66 (4%) Frame = +3 Query: 9 AREQVIAGIHYRMKVEVGLTNCT-ALTNRSDCKHISDESLNK--FCRVNVWMRPWTNHPP 179 AR+Q++AG++Y + VE+G T CT N +C L + FC ++ PW Sbjct: 78 ARKQIVAGVNYFLDVELGRTTCTKTQPNLDNCPFHDQPHLKRKAFCSFQIYAVPWQGTMT 137 Query: 180 NFRVTC 197 + TC Sbjct: 138 LSKSTC 143 >UniRef50_P01035 Cluster: Cystatin-C precursor; n=3; Cetartiodactyla|Rep: Cystatin-C precursor - Bos taurus (Bovine) Length = 148 Score = 41.9 bits (94), Expect = 0.018 Identities = 21/57 (36%), Positives = 32/57 (56%), Gaps = 3/57 (5%) Frame = +3 Query: 9 AREQVIAGIHYRMKVEVGLTNCT-ALTNRSDCKHISDESL--NKFCRVNVWMRPWTN 170 AR+QV++G++Y + VE+G T CT + N C + L K C V++ PW N Sbjct: 81 ARKQVVSGMNYFLDVELGRTTCTKSQANLDSCPFHNQPHLKREKLCSFQVYVVPWMN 137 >UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precursor; n=3; Metazoa|Rep: Digestive cysteine proteinase 2 precursor - Homarus americanus (American lobster) Length = 323 Score = 41.9 bits (94), Expect = 0.018 Identities = 41/150 (27%), Positives = 66/150 (44%), Gaps = 6/150 (4%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNT-HERGTAVYGIT--QFADLSYEEFG 433 F + Y+D + RR IF+ N + I E N +E G + + +F D++ EEF Sbjct: 23 FKGKYGRQYVDAEEDSYRRV-IFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEFN 81 Query: 434 KKYLGLKPSLRDTNQI--PMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLA 604 G P + P ++ + + GA+ +P VKDQG CGS W Sbjct: 82 AVMKGNIPRRSAPVSVFYPKKETGPQATEVDWRTKGAV---TP-VKDQGQCGSCWAFSTT 137 Query: 605 LLVMSRVNIS*RLDSCCHFSEQELVDCDKP 694 + + + + S +EQ+LVDC +P Sbjct: 138 GSLEGQHFL--KTGSLISLAEQQLVDCSRP 165 Score = 33.1 bits (72), Expect = 8.4 Identities = 19/39 (48%), Positives = 22/39 (56%) Frame = +1 Query: 529 DWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTG 645 DWR AVT G AFS TG++EGQ+ LKTG Sbjct: 112 DWRTKGAVTPVKDQGQCGS-CWAFSTTGSLEGQHFLKTG 149 >UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 360 Score = 41.5 bits (93), Expect = 0.024 Identities = 34/109 (31%), Positives = 48/109 (44%), Gaps = 1/109 (0%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 +F + Y DD E + RF +F N +I+ N + V G+ QFADL++EEF Sbjct: 47 NFKVKYAKTYKDDTEE-QYRFSVFTNNYVEIYRHNKFLVFSKV-GVNQFADLTHEEFKAL 104 Query: 440 YLGLKPSL-RDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCG 583 Y G K S D + +Q +P P + VK Q CG Sbjct: 105 YTGHKHSKDDDDDDNKNKQPHLPTDNLPASFDWRDKGAITPVKVQNGCG 153 Score = 37.1 bits (82), Expect = 0.52 Identities = 23/60 (38%), Positives = 31/60 (51%), Gaps = 3/60 (5%) Frame = +1 Query: 478 DSNEAGRNPEVEIPDKFDWRDYDAVTRC*RPGYVRKLAG---AFSVTGNVEGQYKLKTGQ 648 + N+ P +P FDWRD A+T P V+ G AFS ++EG Y LKTG+ Sbjct: 119 NKNKQPHLPTDNLPASFDWRDKGAIT----PVKVQNGCGGCWAFSTVQSIEGLYFLKTGK 174 >UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep: Cathepsin - Petromyzon marinus (Sea lamprey) Length = 333 Score = 41.5 bits (93), Expect = 0.024 Identities = 41/148 (27%), Positives = 71/148 (47%), Gaps = 7/148 (4%) Frame = +2 Query: 269 ATHKPNYIDDAAEMRRRFEIFKGNVRKI--HELNTHERGTAVY-GITQFADLSYEEFGKK 439 +T+ +Y + + RR ++F+ N++++ H L E + + GI +++DL E+ +K Sbjct: 32 STYGKHYGSEQEDAHRR-DVFEQNLKRVLQHNLLADEGNVSFHLGINKYSDLELHEYHEK 90 Query: 440 YLGLKPSLRDTNQIPMRQAEIPKLKS----PINSIGAIMTQSPDVKDQGMCGSWLGPLAL 607 +G +LR N R A P L+S P + VK+QG+CGS A Sbjct: 91 VVGRFWNLR--NGTRRRGAPFP-LRSMDNLPEQVDWRLKGYVTPVKEQGLCGSSWAFSAT 147 Query: 608 LVMSRVNIS*RLDSCCHFSEQELVDCDK 691 + + + + SEQ+LVDC K Sbjct: 148 GSLEGQHFA-ATGNLTSLSEQQLVDCTK 174 >UniRef50_Q1LYJ7 Cluster: Novel protein; n=3; Danio rerio|Rep: Novel protein - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 331 Score = 41.5 bits (93), Expect = 0.024 Identities = 21/74 (28%), Positives = 38/74 (51%), Gaps = 3/74 (4%) Frame = +3 Query: 6 SAREQVIAGIHYRMKVEVGLTNCTALTNR---SDCKHISDESLNKFCRVNVWMRPWTNHP 176 SA +QV+AG Y+++ E+ +NCT + +C + +++ C +V + PW + Sbjct: 182 SATKQVVAGFRYKLQFEIEKSNCTRPEFKIVTEECHPLLEKTEVLKCNSSVDVAPWRHEV 241 Query: 177 PNFRVTCDYQESAT 218 P V C+ S T Sbjct: 242 PEVHVVCEAGVSKT 255 >UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryza sativa|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 352 Score = 41.5 bits (93), Expect = 0.024 Identities = 24/66 (36%), Positives = 30/66 (45%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKY 442 ++A H Y DAAE RRF +FK NV I N +F DL+ EF Y Sbjct: 45 WMAEHGRTY-KDAAEKARRFRVFKANVDLIDRSNAAGNKRYRLATNRFTDLTDAEFAAMY 103 Query: 443 LGLKPS 460 G P+ Sbjct: 104 TGYNPA 109 >UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease Gip1p; n=4; Tetrahymena thermophila|Rep: Granule-biosynthesis induced protease Gip1p - Tetrahymena thermophila Length = 345 Score = 41.5 bits (93), Expect = 0.024 Identities = 40/147 (27%), Positives = 74/147 (50%), Gaps = 10/147 (6%) Frame = +2 Query: 275 HKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLK 454 +K Y+++ ++ R+ F+ N+ +++ +H+ + G+ QF+D++ EEF ++ L K Sbjct: 47 YKRVYLNEEEQIYRQIVFFE-NLASVNKHPSHKSYSK--GLNQFSDMTKEEFKQRVLNKK 103 Query: 455 PSLR-DTNQIPMRQAEIPKLKS---PINSIGAIMTQSP-----DVKDQGMCGS-WLGPLA 604 S + +N+ A P + + P N++ + VK+QG CGS W A Sbjct: 104 ISKKASSNKGGRNLAADPAVSNLVFPTNNLPLSVDWRKRGVLNPVKNQGTCGSCWTFATA 163 Query: 605 LLVMSRVNIS*RLDSCCHFSEQELVDC 685 ++ S I + FSEQ+LVDC Sbjct: 164 GILESFNQI--KNKQLLKFSEQQLVDC 188 >UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena thermophila Length = 320 Score = 41.5 bits (93), Expect = 0.024 Identities = 44/150 (29%), Positives = 71/150 (47%), Gaps = 2/150 (1%) Frame = +2 Query: 248 TFVYDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEE 427 T F T+ Y D E R F +F N+ + +T +G+TQF DL+ E Sbjct: 38 TLFKQFKQTYNKKYADATFETYR-FGVFTQNLEIVKTDST-------FGVTQFMDLTPAE 89 Query: 428 FGKKYLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLA 604 F +++L L + T ++ Q E ++ + G + +P VK+QG CGS W Sbjct: 90 FAQQFLTLHEKVNST-EVYRAQGEATEV--DWTAKGKV---TP-VKNQGSCGSCWAFSTI 142 Query: 605 LLVMSRVNIS*RLD-SCCHFSEQELVDCDK 691 V S + I+ + + + + +EQE VDC K Sbjct: 143 GAVESALWIAGQGEQNTLNLAEQEQVDCAK 172 >UniRef50_Q22NW9 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 894 Score = 41.5 bits (93), Expect = 0.024 Identities = 38/136 (27%), Positives = 60/136 (44%), Gaps = 1/136 (0%) Frame = +2 Query: 287 YIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLK-PSL 463 +I + E R IF N++ I N + GI QF L+ EEF + YL L+ P+ Sbjct: 611 HIINPKEYMYRLNIFAKNLQNIKNHNQISNKPYIEGINQFTHLTEEEFEQTYLTLQIPAS 670 Query: 464 RDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVMSRVNIS*RL 643 + E+P + A+ +P VK+QG CGS + ++ Sbjct: 671 KQYKTQEFLGDEVPS-SIDWRDLNAV---TP-VKNQGSCGSGYAFSTTGALEGIHKISGK 725 Query: 644 DSCCHFSEQELVDCDK 691 D FSEQ+++DC + Sbjct: 726 D-WKGFSEQQIIDCSR 740 Score = 34.7 bits (76), Expect = 2.8 Identities = 18/42 (42%), Positives = 23/42 (54%) Frame = +1 Query: 511 EIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKL 636 E+P DWRD +AVT G AFS TG +EG +K+ Sbjct: 682 EVPSSIDWRDLNAVTPVKNQGSCGS-GYAFSTTGALEGIHKI 722 >UniRef50_UPI0000F2B877 Cluster: PREDICTED: hypothetical protein; n=1; Monodelphis domestica|Rep: PREDICTED: hypothetical protein - Monodelphis domestica Length = 141 Score = 41.1 bits (92), Expect = 0.032 Identities = 21/55 (38%), Positives = 33/55 (60%), Gaps = 3/55 (5%) Frame = +3 Query: 9 AREQVIAGIHYRMKVEVGLTNCT-ALTNRSDCKHISDESLNK--FCRVNVWMRPW 164 A++Q++AGI Y ++VE+ T CT ++T+ S C D +L K C V+ PW Sbjct: 73 AQKQLVAGIKYILEVEISRTTCTKSVTDFSSCPLHEDPTLKKHSICNFVVYFVPW 127 >UniRef50_Q5YER1 Cluster: Cysteine proteinase; n=1; Bigelowiella natans|Rep: Cysteine proteinase - Bigelowiella natans (Pedinomonas minutissima) (Chlorarachnion sp.(strain CCMP 621)) Length = 140 Score = 41.1 bits (92), Expect = 0.032 Identities = 32/97 (32%), Positives = 45/97 (46%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTN 475 + A+ +R+ FKGN+ + N V + +FADL+ EF Y GLKP+ Sbjct: 41 EVADFFKRYNAFKGNMDFVTRHNVGGYSYTVE-LNEFADLTNAEFRSLYHGLKPNA---- 95 Query: 476 QIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS 586 Q P R A + KS + VK+QG CGS Sbjct: 96 QGPRRTANL-STKSADSVDWVSKGAVTPVKNQGQCGS 131 >UniRef50_A5HC50 Cluster: Cathepsin W; n=2; Theria|Rep: Cathepsin W - Oryctolagus cuniculus (Rabbit) Length = 242 Score = 41.1 bits (92), Expect = 0.032 Identities = 17/42 (40%), Positives = 28/42 (66%) Frame = +2 Query: 317 RFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKY 442 R +IF ++ + L + GTA +G+T+F+DL+ EEFG+ Y Sbjct: 3 RLDIFAHHLARAQRLPEEDLGTAEFGVTRFSDLTEEEFGQLY 44 >UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_36, whole genome shotgun sequence - Paramecium tetraurelia Length = 307 Score = 41.1 bits (92), Expect = 0.032 Identities = 38/121 (31%), Positives = 56/121 (46%), Gaps = 1/121 (0%) Frame = +2 Query: 326 IFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIPMRQAEIP 505 IF NV I++ N++ + + QFADL+ EEF YLG KP+ + I + + Sbjct: 51 IFNQNVELINKHNSNPNKSYSMAVNQFADLTDEEFQSMYLG-KPTYVKIDNIELSKG--- 106 Query: 506 KLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCHFSEQELVD 682 + + +P +K+QG CGS W V + I R SEQ+LVD Sbjct: 107 ---NTLGDADWASKMNP-IKNQGNCGSCWTFSAIGAVEGFLAI--RKGFKGVLSEQQLVD 160 Query: 683 C 685 C Sbjct: 161 C 161 >UniRef50_Q24F16 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 40.7 bits (91), Expect = 0.042 Identities = 41/148 (27%), Positives = 65/148 (43%), Gaps = 6/148 (4%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 DF + Y E+ R F +F N+++I LN E TA + +TQF+D + EEF K Sbjct: 42 DFKKSFAKKYNSQEHELFR-FNVFLENLKEIERLNK-EITTAKFDVTQFSDYTKEEFLKL 99 Query: 440 YLG-LKPSLRDTNQIPMRQAEIPKLK-SPINSIGAIMTQSPD----VKDQGMCGSWLGPL 601 + G + P +T+ ++ + K + I P VK+QG C Sbjct: 100 HTGVIIPQEVETSSSSQSNSDQDERKLQSLPLDWDIRVNGPGKLQAVKNQGNCACDTAFS 159 Query: 602 ALLVMSRVNIS*RLDSCCHFSEQELVDC 685 + + S + + FSEQ+ VDC Sbjct: 160 TSATVENL-YSIKTGTNVSFSEQQFVDC 186 >UniRef50_Q248G1 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 334 Score = 40.7 bits (91), Expect = 0.042 Identities = 38/124 (30%), Positives = 58/124 (46%), Gaps = 4/124 (3%) Frame = +2 Query: 326 IFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIPMRQAEIP 505 IF N R++ N+ + T + QFAD + EEF KY L + T R+ E Sbjct: 59 IFVENKRQVDSHNS-QNPTFTQSLNQFADFTDEEF--KYRVLNTKVSQTRPKKGRRLESR 115 Query: 506 KLKSPI-NSIG--AIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCHFSEQE 673 L I S+ + +K+QG CGS W +A +V S + + S ++EQE Sbjct: 116 VLDQQIPESVDWRNVTNVVGPIKNQGHCGSCWTFSIAGIVESHYVL--KHGSYVSYAEQE 173 Query: 674 LVDC 685 ++DC Sbjct: 174 ILDC 177 >UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 514 Score = 40.7 bits (91), Expect = 0.042 Identities = 23/60 (38%), Positives = 32/60 (53%) Frame = +1 Query: 469 YQSDSNEAGRNPEVEIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTGQ 648 Y SN + V++PD+ DWRDY AV+ G + A + G VEG Y +KTG+ Sbjct: 288 YGPYSNMSHVLQRVDVPDELDWRDYGAVSPVRGQG-ICGSCYALAAVGAVEGAYFMKTGK 346 >UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa zeasingle nucleocapsid nuclear polyhedrosis virus) Length = 367 Score = 40.7 bits (91), Expect = 0.042 Identities = 40/157 (25%), Positives = 70/157 (44%), Gaps = 14/157 (8%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHE-----------RGTAVYGITQFA 409 FL + +Y DD E + R+ +FK N+ KI+ N +A +G+ +F+ Sbjct: 60 FLQQYNKSY-DDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNKFS 118 Query: 410 DLSYEEFGKKYLGLKPSLRDTNQIPMRQAE--IPKLKSPINSIGAIMTQSPDVKDQGMCG 583 D + +E G +L + + P ++ P + +KDQG+CG Sbjct: 119 DKTPDEVLHSNTGFFLNLSQHYTLCENRIVKGAPDIRLPDYYDWRDTNKVTPIKDQGVCG 178 Query: 584 SWLGPLAL-LVMSRVNIS*RLDSCCHFSEQELVDCDK 691 S +A+ + S+ I R + SEQ+L+DCD+ Sbjct: 179 SCWAFVAIGNIESQYAI--RHNKLIDLSEQQLLDCDE 213 Score = 38.3 bits (85), Expect = 0.22 Identities = 18/46 (39%), Positives = 26/46 (56%) Frame = +1 Query: 502 PEVEIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLK 639 P++ +PD +DWRD + VT G V AF GN+E QY ++ Sbjct: 152 PDIRLPDYYDWRDTNKVTPIKDQG-VCGSCWAFVAIGNIESQYAIR 196 >UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Cathepsin - Geodia cydonium (Sponge) Length = 322 Score = 40.3 bits (90), Expect = 0.055 Identities = 41/141 (29%), Positives = 60/141 (42%), Gaps = 4/141 (2%) Frame = +2 Query: 317 RFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIPMRQA 496 R ++ N++ + E ++ G V + +FADL EF Y GL+ ++ P Sbjct: 39 RQRVWLSNLKFVEEFDSEREGYTV-AMNEFADLDPREFVSHYNGLRRRPHTSSGEPCTLG 97 Query: 497 E-IPKLKSPINSIGAIMTQSPDVKDQGMCGS-W-LGPLALLVMSRVNIS*RLDSCCHFSE 667 E + L + ++ VK+QG CGS W L N + +L S SE Sbjct: 98 EDVSALPTTVD--WRTKGYVTGVKNQGQCGSCWAFSATGSLEGQHFNATGKLVS---LSE 152 Query: 668 QELVDCDK-P*RRM*RGGLPD 727 Q LVDC GGLPD Sbjct: 153 QNLVDCSSAEGNEGCNGGLPD 173 >UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleostomi|Rep: Cathepsin O precursor - Homo sapiens (Human) Length = 321 Score = 40.3 bits (90), Expect = 0.055 Identities = 35/115 (30%), Positives = 49/115 (42%), Gaps = 1/115 (0%) Frame = +2 Query: 344 RKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIPMRQAEIPKLKSPI 523 R ++ L E TA YGI QF+ L EEF YL KPS + + IP + P+ Sbjct: 52 RYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVHMS-IPNVSLPL 110 Query: 524 NSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCHFSEQELVDC 685 V++Q MCG W + V S I + S Q+++DC Sbjct: 111 RFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGK--PLEDLSVQQVIDC 163 >UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue of Sarcophaga 26,29kDa proteinase; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to homologue of Sarcophaga 26,29kDa proteinase - Nasonia vitripennis Length = 553 Score = 39.9 bits (89), Expect = 0.073 Identities = 42/148 (28%), Positives = 64/148 (43%), Gaps = 7/148 (4%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEF---- 430 F TH NY D E ++R E F+ N+R IH +N G + + AD + E Sbjct: 251 FKKTHNKNYAHDL-EHKQRKEHFRHNLRFIHSINRANLGFTL-DVNHLADRNEAELKVLR 308 Query: 431 GKKYL--GLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPL 601 GK+Y G + + + +A++P GA+ +P VKDQ +CGS W Sbjct: 309 GKQYTQHGYNGGMPFPHDVEKEKADVPD-SFDWRLYGAV---TP-VKDQSVCGSCWSFGT 363 Query: 602 ALLVMSRVNIS*RLDSCCHFSEQELVDC 685 V + + S+Q L+DC Sbjct: 364 TGAVEGAYFM--KYKKLVRLSQQALIDC 389 Score = 36.7 bits (81), Expect = 0.68 Identities = 19/45 (42%), Positives = 25/45 (55%) Frame = +1 Query: 505 EVEIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLK 639 + ++PD FDWR Y AVT + V +F TG VEG Y +K Sbjct: 331 KADVPDSFDWRLYGAVTPV-KDQSVCGSCWSFGTTGAVEGAYFMK 374 >UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 392 Score = 39.9 bits (89), Expect = 0.073 Identities = 50/155 (32%), Positives = 72/155 (46%), Gaps = 13/155 (8%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGI--TQFADLSYEEFG 433 +F H Y DD+ E RRR IF+ NVR I +N R + Y + FADL+ +EF Sbjct: 90 EFRQQHDKVYEDDS-EHRRRKHIFRHNVRYIRSMN---RRSLPYKLEPNHFADLTDDEF- 144 Query: 434 KKYLGL-----KPSLRDTNQI-----PMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCG 583 K Y G K + D + + R E+P + + GA+ +P K QG CG Sbjct: 145 KSYKGALDDESKDVMNDHDDVIDDDRSKRMFEVPD-QLDWRNYGAV---NP-AKGQGTCG 199 Query: 584 S-WLGPLALLVMSRVNIS*RLDSCCHFSEQELVDC 685 S W A V + I + + +EQ+L+DC Sbjct: 200 SCWAFATAGAVEAAHFI--QKGELLNLAEQQLLDC 232 >UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|Rep: Cysteine proteinase - Acanthamoeba healyi Length = 330 Score = 39.5 bits (88), Expect = 0.097 Identities = 22/63 (34%), Positives = 30/63 (47%) Frame = +1 Query: 460 FARYQSDSNEAGRNPEVEIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLK 639 ++++ A P IP +FDWR AVT G +FS TG+ EG LK Sbjct: 96 YSKHAKIHTAAPEAPATGIPSEFDWRQKGAVTHVKNQGQCGS-CWSFSTTGSTEGANFLK 154 Query: 640 TGQ 648 TG+ Sbjct: 155 TGR 157 >UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcoptes scabiei type hominis|Rep: Cathepsin L-like protease - Sarcoptes scabiei type hominis Length = 245 Score = 39.5 bits (88), Expect = 0.097 Identities = 36/130 (27%), Positives = 58/130 (44%), Gaps = 3/130 (2%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNT-HERGTAVY--GITQFADLSYEEFGKKYLGLKPSLRDTN 475 E+ R+ IF+ N I + N +E G + Y G+ QF DL+ +E+ + LK + Sbjct: 50 ELLRKL-IFQRNYIYIRKHNEKYEAGLSTYELGVNQFTDLTNKEYNDQMNRLKVKHDVQS 108 Query: 476 QIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVMSRVNIS*RLDSCC 655 + ++ L ++ + +KDQ CGS A+ M N + + Sbjct: 109 EHVFDNEDVSDLPDEVD--WTLKNVVAPIKDQKQCGSCWAFSAVASMESQN-ALKTGQLV 165 Query: 656 HFSEQELVDC 685 SEQELVDC Sbjct: 166 ELSEQELVDC 175 >UniRef50_Q61CG7 Cluster: Putative uncharacterized protein CBG12922; n=1; Caenorhabditis briggsae|Rep: Putative uncharacterized protein CBG12922 - Caenorhabditis briggsae Length = 371 Score = 39.5 bits (88), Expect = 0.097 Identities = 24/82 (29%), Positives = 45/82 (54%), Gaps = 3/82 (3%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELN--THERG-TAVYGITQFADLSYEEFGKKYLGLKPSLR 466 DAAE +RR + F + I LN ++E G T+ +GI +F+DLS +EF ++ + PS + Sbjct: 54 DAAETQRRMQNFIKSYNTIGILNLKSNESGYTSTFGINKFSDLSSKEFQQRLSNIAPSQK 113 Query: 467 DTNQIPMRQAEIPKLKSPINSI 532 + + + + K ++ + Sbjct: 114 SRSTMKKASPFLKRHKRQVDEL 135 >UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n=1; Toxocara canis|Rep: Cathepsin L-like cysteine proteinase - Toxocara canis (Canine roundworm) Length = 360 Score = 39.5 bits (88), Expect = 0.097 Identities = 20/46 (43%), Positives = 25/46 (54%) Frame = +1 Query: 511 EIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTGQ 648 EIPD FDWR Y+ VT + + AF+ G VE Y L TG+ Sbjct: 144 EIPDHFDWRPYNVVTPV-KSQFKCGSCWAFATVGTVESAYALGTGE 188 >UniRef50_Q23FQ5 Cluster: Papain family cysteine protease containing protein; n=4; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 39.5 bits (88), Expect = 0.097 Identities = 39/146 (26%), Positives = 71/146 (48%), Gaps = 7/146 (4%) Frame = +2 Query: 269 ATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGT-AVYGIT--QFADLSYEEFGKK 439 ++++ ++++ E R+ F+ N++K L THE+ T A Y ++ QF+D S EEF ++ Sbjct: 41 SSYRRVFLNEDEETYRQLVFFE-NLQK---LKTHEKNTEATYTVSLNQFSDYSQEEFVQR 96 Query: 440 YLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSP----DVKDQGMCGSWLGPLAL 607 L S D + I Q L+ +N ++ ++ +++QG CGS Sbjct: 97 ILNKHISRSDAD-IQKEQEPNGNLRKAVNYPTSVDWRNSGALNPIQNQGQCGSCAAFGTA 155 Query: 608 LVMSRVNIS*RLDSCCHFSEQELVDC 685 V+ + FSEQ+L+DC Sbjct: 156 GVLESFYYL-KSKQLLKFSEQQLLDC 180 >UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 344 Score = 39.1 bits (87), Expect = 0.13 Identities = 40/149 (26%), Positives = 59/149 (39%), Gaps = 11/149 (7%) Frame = +2 Query: 272 THKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLG- 448 TH Y D + E R+ IF N KI E N+ + G +D+++EEF L Sbjct: 44 THNVKYEDSSIEAYRK-AIFLDNHNKIIEHNSDPSHSYTLGHNHLSDMTHEEFSLYQLNP 102 Query: 449 ----LKPSLRDTNQIPMRQAEIPKLKSPINSIGAI------MTQSPDVKDQGMCGSWLGP 598 K S N + P + PI + A + VK QG CGS Sbjct: 103 ARTASKSSKGGNNSGNSSGSSNPYVDPPITTKNAPPMDWRNASAITPVKQQGKCGSCWTF 162 Query: 599 LALLVMSRVNIS*RLDSCCHFSEQELVDC 685 + V+ + +FSEQ+++DC Sbjct: 163 ASTAVLESFSFIKNGAPLTNFSEQQILDC 191 >UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arabidopsis thaliana|Rep: Putative cysteine proteinase - Arabidopsis thaliana (Mouse-ear cress) Length = 365 Score = 39.1 bits (87), Expect = 0.13 Identities = 23/102 (22%), Positives = 49/102 (48%) Frame = +2 Query: 176 T*LQSDMRLSRKRNN*SVPPHTS*TFVYDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIH 355 T L D+R+S+ R + ++ + + ++ Y D++ E R ++FK N++ I Sbjct: 12 TILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDES-EKEMRLKVFKKNLKFIE 70 Query: 356 ELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQI 481 N + G+ +F D EEF + GL+ ++ +++ Sbjct: 71 NFNNMGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSEL 112 >UniRef50_O48608 Cluster: Putative thiol protease; n=1; Hordeum vulgare|Rep: Putative thiol protease - Hordeum vulgare (Barley) Length = 111 Score = 39.1 bits (87), Expect = 0.13 Identities = 20/58 (34%), Positives = 31/58 (53%) Frame = +2 Query: 257 YDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEF 430 + ++A H +Y E RRFE+++ N+ I N R + G T F DL++EEF Sbjct: 50 HGWMAAHGRSY-PTVEEKLRRFEVYRSNMEFIEAANRDSRMSYSLGETPFTDLTHEEF 106 >UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine protease; n=11; Callosobruchus maculatus|Rep: Putative gut cathepsin L-like cysteine protease - Callosobruchus maculatus (Southern cowpea weevil) (Pulse bruchid) Length = 326 Score = 39.1 bits (87), Expect = 0.13 Identities = 41/132 (31%), Positives = 61/132 (46%), Gaps = 5/132 (3%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNT-HERGTAVYG--ITQFADLSYEEFGK--KYLGLKPSLRD 469 E +RRF +F+ N+ I E N +ERG + +TQFAD+++EEF K G+ P+L Sbjct: 39 EEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEFLDLLKLQGV-PAL-P 96 Query: 470 TNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVMSRVNIS*RLDS 649 +N + E ++ VKDQ CGS A+ + + + Sbjct: 97 SNAVHFDNFEDIDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAIEGQFFK-KNGT 155 Query: 650 CCHFSEQELVDC 685 S QELVDC Sbjct: 156 LVSLSAQELVDC 167 >UniRef50_A0CHZ5 Cluster: Chromosome undetermined scaffold_186, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_186, whole genome shotgun sequence - Paramecium tetraurelia Length = 311 Score = 39.1 bits (87), Expect = 0.13 Identities = 34/127 (26%), Positives = 60/127 (47%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIP 484 E R ++F+ NV+ + E N + V I +FADL+ EEF KYL + +NQ Sbjct: 48 EQEYRRQVFERNVKLVEETNKKQTDF-VLEINEFADLTQEEFSIKYLQYDHQI--SNQ-- 102 Query: 485 MRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVMSRVNIS*RLDSCCHFS 664 + ++ K +N + V++QG C ++ +L + N ++ S Sbjct: 103 -QTQQLFKDGQDLNQEIDWSKYAGSVRNQGQCAGYI--FNVLDLLDANNKIIKNNQNPLS 159 Query: 665 EQELVDC 685 +Q+L+DC Sbjct: 160 QQDLIDC 166 >UniRef50_UPI00006A00FD Cluster: Cystatin-M precursor (Cystatin-6) (Cystatin-E).; n=3; Xenopus tropicalis|Rep: Cystatin-M precursor (Cystatin-6) (Cystatin-E). - Xenopus tropicalis Length = 149 Score = 38.7 bits (86), Expect = 0.17 Identities = 18/59 (30%), Positives = 33/59 (55%), Gaps = 4/59 (6%) Frame = +3 Query: 6 SAREQVIAGIHYRMKVEVGLTNCTALTNRSDCKHI--SDES--LNKFCRVNVWMRPWTN 170 SA+ QV+AG++Y + +++G TNC + + + +DE+ + C V+ PW N Sbjct: 79 SAKSQVVAGVNYYLTMKIGATNCRKNSENLEACELAQNDEAQLQTRICTFQVYSIPWKN 137 >UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Cathepsin K - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 333 Score = 38.7 bits (86), Expect = 0.17 Identities = 36/111 (32%), Positives = 53/111 (47%), Gaps = 6/111 (5%) Frame = +2 Query: 272 THKPNYIDDAAEMRRRFEIFKGNVRKIHELNT-HERGTAVY--GITQFADLSYEEFGKKY 442 THK Y E RR I++ N+ I N +E G Y G+ F D++ EE +K Sbjct: 36 THKREYNGLNEESIRR-TIWEKNMLFIEAHNKEYELGIHTYDLGMNHFGDMTLEEVAEKV 94 Query: 443 LGLK-PSLRDTNQIPMRQAEIPKLKSPIN--SIGAIMTQSPDVKDQGMCGS 586 +GL+ P RD + + KL I+ +G + + VK+QG CGS Sbjct: 95 MGLQMPMYRDPANTFVPDDRVGKLPKSIDYRKLGYVTS----VKNQGSCGS 141 >UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 350 Score = 38.3 bits (85), Expect = 0.22 Identities = 37/151 (24%), Positives = 67/151 (44%), Gaps = 10/151 (6%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKY 442 F + Y+ + R+ +F N + I + N+ T QF+D++ +EF + Sbjct: 50 FSSGRSRTYLSEEERTYRQI-VFLQNDQNIQKHNSDSNNTYKLQHNQFSDMTKDEFAHRV 108 Query: 443 LGLKPSLRDTNQIPMRQAEIPKLKSPIN-SIGA--------IMTQSPDVKDQGMCGS-WL 592 L L+ + + A+ P+L+ ++ S+ A +VK+QG CGS W Sbjct: 109 --LNSQLKTSASSSSQPAQTPQLRGSVDASLNASQGFDWRNYQGVLGNVKNQGQCGSCWT 166 Query: 593 GPLALLVMSRVNIS*RLDSCCHFSEQELVDC 685 A ++ S + + FSEQ++VDC Sbjct: 167 FATAGVLESYYAL--KYQQSLIFSEQDIVDC 195 >UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin L - Misgurnus mizolepis (Mud loach) Length = 337 Score = 38.3 bits (85), Expect = 0.22 Identities = 42/144 (29%), Positives = 58/144 (40%), Gaps = 4/144 (2%) Frame = +2 Query: 275 HKPNYIDDAAEMRRRFEIFKGNVRKIHELNT-HERGTAVY--GITQFADLSYEEFGKKYL 445 H NY + RR I++ N+RKI N H G Y G+ F D+++EEF + Sbjct: 36 HGKNYHEKEEGWRRM--IWEKNLRKIQFHNLEHSMGIHTYRLGMNHFGDMNHEEFRQVMN 93 Query: 446 GLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSR 622 G K + + E L+ P VKDQG CGS W + Sbjct: 94 GYKHKTERKFKGSLFM-EPNFLEVPSKLDWREKGYVTPVKDQGECGSCW--AFSTTGAME 150 Query: 623 VNIS*RLDSCCHFSEQELVDCDKP 694 + + SEQ LVDC +P Sbjct: 151 GQMFRKQGKLVSLSEQNLVDCSRP 174 Score = 33.1 bits (72), Expect = 8.4 Identities = 19/47 (40%), Positives = 24/47 (51%) Frame = +1 Query: 508 VEIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTGQ 648 +E+P K DWR+ VT G AFS TG +EGQ K G+ Sbjct: 114 LEVPSKLDWREKGYVTPVKDQGECGS-CWAFSTTGAMEGQMFRKQGK 159 >UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 344 Score = 38.3 bits (85), Expect = 0.22 Identities = 42/131 (32%), Positives = 63/131 (48%), Gaps = 14/131 (10%) Frame = +2 Query: 371 ERGTAVYGITQFADLSYEEFGKKYLGLKPSL--RDTNQIPMRQAEIPK--LKSPINSIGA 538 E A +G T+F+D+S EEF K L SL + +Q +AE K L+ N + Sbjct: 70 ENPNAKFGHTKFSDMSPEEFENKMLNFDFSLFKKAKSQGIKLKAEPMKGYLRQGENVDNS 129 Query: 539 IMTQSPDVKDQGM---------CGS-WLGPLALLVMSRVNIS*RLDSCCHFSEQELVDCD 688 + +S D +D+G+ CGS W ++ S+ + + HFSEQ L+DCD Sbjct: 130 DLPESFDWRDKGIITPAKFQNTCGSCWTFATTGVIESQYAL--KYGELLHFSEQMLLDCD 187 Query: 689 KP*RRM*RGGL 721 + RGGL Sbjct: 188 NI-NQGCRGGL 197 >UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Plasmodium|Rep: Cysteine proteinase precursor - Plasmodium vivax (strain Salvador I) Length = 583 Score = 38.3 bits (85), Expect = 0.22 Identities = 40/162 (24%), Positives = 74/162 (45%), Gaps = 17/162 (10%) Frame = +2 Query: 257 YDFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLS---YEE 427 ++F+ +K +Y D E +++ FK N KI + N + + + QF+D S +E Sbjct: 238 FNFMNKYKRSY-KDINEQMEKYKNFKMNYLKIKKHNETNQMYKMK-VNQFSDYSKKDFES 295 Query: 428 FGKKYLGLKPSLRDTNQIPMRQAEIPKLKSPI-NSIGA-IMTQSPDV------------K 565 + +K + + L+ +P K K+ + +S GA ++ P++ K Sbjct: 296 YFRKLVPIPDHLKKKYVVPFSSMNNGKGKNVVTSSSGANLLADVPEILDYREKGIVHEPK 355 Query: 566 DQGMCGSWLGPLALLVMSRVNIS*RLDSCCHFSEQELVDCDK 691 DQG+CGS ++ + + + SEQE+VDC K Sbjct: 356 DQGLCGSCWAFASVGNVECMYAKEHNKTILTLSEQEVVDCSK 397 >UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; Leishmania|Rep: Cysteine proteinase 1 precursor - Leishmania pifanoi Length = 354 Score = 38.3 bits (85), Expect = 0.22 Identities = 44/136 (32%), Positives = 62/136 (45%), Gaps = 7/136 (5%) Frame = +2 Query: 302 AEMRRRFEIFKGNVRKIHELNTHERGTAVYGIT-QFADLSYEEFGKKYLG---LKPSLRD 469 AE RF FK N++ + LNT + A Y ++ +FADL+ +EF K YL L+D Sbjct: 57 AEEGHRFNAFKQNMQTAYFLNT-QNPHAHYDVSGKFADLTPQEFAKLYLNPDYYARHLKD 115 Query: 470 TNQIPMRQAEIPK--LKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*R 640 + P + GA+ +P VK+QG+CGS W + + S Sbjct: 116 HKEDVHVDDSAPSGVMSVDWRDKGAV---TP-VKNQGLCGSCWAFSAIGNIEGQWAASGH 171 Query: 641 LDSCCHFSEQELVDCD 688 S SEQ LV CD Sbjct: 172 --SLVSLSEQMLVSCD 185 >UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin O; n=1; Monodelphis domestica|Rep: PREDICTED: similar to cathepsin O - Monodelphis domestica Length = 414 Score = 37.9 bits (84), Expect = 0.30 Identities = 34/127 (26%), Positives = 58/127 (45%), Gaps = 4/127 (3%) Frame = +2 Query: 317 RFEIFKGNVRKIHELNTH---ERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIPM 487 R F+ ++++ H LN+ + +A+YGI QF+ L EEF YL KPS+ + Sbjct: 133 RSTAFRESLKRHHYLNSFSSSDNTSAIYGINQFSYLFPEEFKDIYLRSKPSVLPLYSEAL 192 Query: 488 RQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCHFS 664 + + P+ V++Q MCG W + + S I + +S S Sbjct: 193 KM-PTTHMPLPVRFDWRDKHVVTKVRNQQMCGGCWAFSVVGSIESAYAI--KGESLEDLS 249 Query: 665 EQELVDC 685 Q+++DC Sbjct: 250 VQQVIDC 256 >UniRef50_A1Z9I0 Cluster: CG6357-PA; n=3; Drosophila melanogaster|Rep: CG6357-PA - Drosophila melanogaster (Fruit fly) Length = 439 Score = 37.9 bits (84), Expect = 0.30 Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 3/62 (4%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTH-ERGTAVY--GITQFADLSYEEFG 433 FL KP+Y DD E +R +F N + IH+ N + G + GI Q++DL+ EE+ Sbjct: 255 FLIDFKPSYQDD-TETEKRRNVFCDNFKSIHKHNVQFDLGNISFKKGINQWSDLTVEEWK 313 Query: 434 KK 439 K Sbjct: 314 NK 315 >UniRef50_P22085 Cluster: Onchocystatin precursor; n=6; Onchocercidae|Rep: Onchocystatin precursor - Onchocerca volvulus Length = 162 Score = 37.9 bits (84), Expect = 0.30 Identities = 18/55 (32%), Positives = 28/55 (50%), Gaps = 4/55 (7%) Frame = +3 Query: 18 QVIAGIHYRMKVEVGLTNCTALTNR----SDCKHISDESLNKFCRVNVWMRPWTN 170 QV+AG+ Y+M V+V + C +N + CK + K + VW +PW N Sbjct: 97 QVVAGVKYKMDVQVARSQCKKSSNEKVDLTKCKKLEGHP-EKVMTLEVWEKPWEN 150 >UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens (Human) Length = 331 Score = 37.9 bits (84), Expect = 0.30 Identities = 21/50 (42%), Positives = 25/50 (50%) Frame = +1 Query: 499 NPEVEIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTGQ 648 NP +PD DWR+ VT G AFS G +E Q KLKTG+ Sbjct: 110 NPNRILPDSVDWREKGCVTEVKYQGSCGA-CWAFSAVGALEAQLKLKTGK 158 >UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 21 SCAF14577, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 478 Score = 37.5 bits (83), Expect = 0.39 Identities = 19/46 (41%), Positives = 26/46 (56%) Frame = +1 Query: 508 VEIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTG 645 VE+P+ DWR Y AVT + + +F+ TG +EG LKTG Sbjct: 203 VEVPESLDWRLYGAVTPV-KDQAICGSCWSFATTGTIEGALFLKTG 247 >UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina). Cathepsin L; n=2; Dictyostelium discoideum|Rep: Similar to Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina). Cathepsin L - Dictyostelium discoideum (Slime mold) Length = 265 Score = 37.5 bits (83), Expect = 0.39 Identities = 19/50 (38%), Positives = 25/50 (50%) Frame = +1 Query: 499 NPEVEIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTGQ 648 N IP FDWRD+ AV + G +FS G +EG Y +K G+ Sbjct: 42 NVNATIPKSFDWRDHGAVGKVKNQGSCAS-CWSFSALGALEGHYYIKYGE 90 >UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1; Uronema marinum|Rep: Cathepsin L-like cysteine protease - Uronema marinum Length = 333 Score = 37.5 bits (83), Expect = 0.39 Identities = 34/137 (24%), Positives = 63/137 (45%), Gaps = 2/137 (1%) Frame = +2 Query: 284 NYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGI-TQFADLSYEEFGKKYLGLKPS 460 N + ++E RF+++ N + + E N + T G+ QFA ++ EEF ++ S Sbjct: 44 NLVYSSSEDAYRFQVYFENFQFVEEFNANNSFTL--GVENQFAAMTNEEFKAQFTSEIIS 101 Query: 461 LRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPD-VKDQGMCGSWLGPLALLVMSRVNIS* 637 N + + + +P S+ + + V++QG+CGS A+ + R+ Sbjct: 102 -EGYNYQQVDRNVYEAVNAPSGSVNWVSKGAVQGVQNQGVCGSCWAFSAVCSLERL-YKI 159 Query: 638 RLDSCCHFSEQELVDCD 688 FSEQ+LV C+ Sbjct: 160 NTGKLLSFSEQQLVSCE 176 >UniRef50_A7LFV2 Cluster: Cathepsin L protease inhibitor 1; n=1; Diaprepes abbreviatus|Rep: Cathepsin L protease inhibitor 1 - Diaprepes abbreviatus (Sugarcane rootstalk borer weevil) Length = 109 Score = 37.5 bits (83), Expect = 0.39 Identities = 22/60 (36%), Positives = 31/60 (51%), Gaps = 3/60 (5%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIH-ELNTHERGTAVY--GITQFADLSYEEF 430 +F NY + E +RFEIFK N++ I +E G Y G+ F DL++EEF Sbjct: 37 NFKTKFNRNY-ESPEEESKRFEIFKNNLKDIQAHQKKYEAGEVSYQQGVNDFTDLTHEEF 95 >UniRef50_Q7M429 Cluster: L-cystatin precursor; n=1; Tachypleus tridentatus|Rep: L-cystatin precursor - Tachypleus tridentatus (Japanese horseshoe crab) Length = 133 Score = 37.5 bits (83), Expect = 0.39 Identities = 18/69 (26%), Positives = 33/69 (47%), Gaps = 4/69 (5%) Frame = +3 Query: 3 NSAREQVIAGIHYRMKVEVGLTNC----TALTNRSDCKHISDESLNKFCRVNVWMRPWTN 170 + AR QV++GI+Y + +E G T C L + C + + + C+ VW++ W Sbjct: 62 HKARTQVVSGINYEVFIETGTTTCKKSEVPLEDLKRCA-VPENGVKHLCQAIVWVQAWIP 120 Query: 171 HPPNFRVTC 197 ++ C Sbjct: 121 RTKVTKLEC 129 >UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 394 Score = 37.1 bits (82), Expect = 0.52 Identities = 19/64 (29%), Positives = 34/64 (53%) Frame = +2 Query: 275 HKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLK 454 H+ Y+++ ++ R+ IF N+ K++E N T G+ +F+D + EEF + L K Sbjct: 43 HQRVYLNEHEQLFRQL-IFLENLAKVNEHNQKSNATYTIGLNKFSDFTQEEFKHRILNKK 101 Query: 455 PSLR 466 R Sbjct: 102 LGTR 105 >UniRef50_P01038 Cluster: Cystatin precursor; n=2; Phasianidae|Rep: Cystatin precursor - Gallus gallus (Chicken) Length = 139 Score = 37.1 bits (82), Expect = 0.52 Identities = 18/58 (31%), Positives = 32/58 (55%), Gaps = 3/58 (5%) Frame = +3 Query: 6 SAREQVIAGIHYRMKVEVGLTNCTALT-NRSDCKHISDESLNKF--CRVNVWMRPWTN 170 SA+ Q+++GI Y ++VE+G T C + + C+ + + K+ C V+ PW N Sbjct: 72 SAKRQLVSGIKYILQVEIGRTTCPKSSGDLQSCEFHDEPEMAKYTTCTFVVYSIPWLN 129 >UniRef50_O76096 Cluster: Cystatin-F precursor; n=13; Eutheria|Rep: Cystatin-F precursor - Homo sapiens (Human) Length = 145 Score = 37.1 bits (82), Expect = 0.52 Identities = 17/56 (30%), Positives = 28/56 (50%), Gaps = 4/56 (7%) Frame = +3 Query: 18 QVIAGIHYRMKVEVGLTNC--TALTNRSDCKHISDESLNK--FCRVNVWMRPWTNH 173 Q++ G+ Y ++VE+G T C DC ++ +L + C VW+ PW H Sbjct: 81 QIVKGLKYMLEVEIGRTTCKKNQHLRLDDCDFQTNHTLKQTLSCYSEVWVVPWLQH 136 >UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin O precursor; n=2; Apocrita|Rep: PREDICTED: similar to Cathepsin O precursor - Apis mellifera Length = 374 Score = 36.7 bits (81), Expect = 0.68 Identities = 16/59 (27%), Positives = 36/59 (61%), Gaps = 2/59 (3%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELN--THERGTAVYGITQFADLSYEEF 430 +++ + +Y ++ +E RF+ F+ +++ I +N + +A YG+T+F+D+S EF Sbjct: 59 NYVIRYNKSYRNNPSEYEERFKRFQRSLQHIERMNGLRSSQESAYYGLTEFSDMSENEF 117 >UniRef50_A3EYB2 Cluster: Vap1; n=2; Mammalia|Rep: Vap1 - Trichosurus vulpecula (Brush-tailed possum) Length = 172 Score = 36.7 bits (81), Expect = 0.68 Identities = 20/70 (28%), Positives = 34/70 (48%), Gaps = 3/70 (4%) Frame = +3 Query: 12 REQVIAGIHYRMKVEVGLTNCT-ALTNRSDCKHISDESLNK--FCRVNVWMRPWTNHPPN 182 R+Q++AG+ Y + EV T CT ++ + + C + D +L K C V+ PW Sbjct: 49 RKQLVAGVKYYIDAEVRRTTCTKSVADLASCPYHEDPALKKHSVCVFEVYTIPWLGKTTL 108 Query: 183 FRVTCDYQES 212 + C E+ Sbjct: 109 LKNECKDAEA 118 >UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Longidorus elongatus Length = 358 Score = 36.7 bits (81), Expect = 0.68 Identities = 42/153 (27%), Positives = 69/153 (45%), Gaps = 10/153 (6%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNT-HERGTAVYGIT--QFADLSYEEF 430 +F H +Y E+ R F++F N + I + N +E G + ++ +FAD++ EF Sbjct: 45 NFKLKHAKSYKTKDEELLR-FQVFASNHKVIEQHNIEYEAGQHSFALSLNKFADMTNAEF 103 Query: 431 GKKYLGLK-PSLRD-TNQIPMRQA----EIPKLKSPINSIGAIMT-QSPDVKDQGMCGSW 589 ++ G K P+ R P+++ E+P + +S+ VKDQG CGS Sbjct: 104 RQRMNGFKLPAKRKLAKSQPLKEDGMIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSC 163 Query: 590 LGPLALLVMSRVNIS*RLDSCCHFSEQELVDCD 688 A + + + SEQ LVDCD Sbjct: 164 WAFSATGSLEGQHYK-QTGKLVSLSEQNLVDCD 195 Score = 36.3 bits (80), Expect = 0.90 Identities = 20/47 (42%), Positives = 26/47 (55%) Frame = +1 Query: 508 VEIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTGQ 648 V IPD DWR VT+ G AFS TG++EGQ+ +TG+ Sbjct: 137 VTIPDSVDWRKEGYVTKVKDQGSCGS-CWAFSATGSLEGQHYKQTGK 182 >UniRef50_Q23H15 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 370 Score = 36.7 bits (81), Expect = 0.68 Identities = 35/103 (33%), Positives = 51/103 (49%), Gaps = 4/103 (3%) Frame = +2 Query: 398 TQFADLSYEEFGKKYLGLKPSLRDTNQIPMRQAEI--PKLKSPIN--SIGAIMTQSPDVK 565 TQ D++ EEF +K + +K L D I E P L + I+ + GA+ + VK Sbjct: 124 TQLPDMTKEEFTEK-IDMKQDLVDHLMIRRSLTEFKSPTLAASIDWRTKGAVTS----VK 178 Query: 566 DQGMCGSWLGPLALLVMSRVNIS*RLDSCCHFSEQELVDCDKP 694 +QG CGS A +M N + + FSEQ+L+DC P Sbjct: 179 NQGNCGSCWSFSAAGLMESFNFI-QNKALVDFSEQQLLDCVIP 220 >UniRef50_Q22DX2 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 358 Score = 36.7 bits (81), Expect = 0.68 Identities = 44/149 (29%), Positives = 67/149 (44%), Gaps = 26/149 (17%) Frame = +2 Query: 317 RFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGL-KPSLRDTNQI---- 481 RF F N+++I LN E TA + I+ F+D + EEF + G KP++ D +Q+ Sbjct: 59 RFATFVENLKEIDRLNA-EVTTAQFDISFFSDFTKEEFLNLFTGAHKPAMSDQDQLQNNN 117 Query: 482 ----------------PMRQAEIPKLKSPINSIGAIMTQSP----DVKDQGMCGS-WLGP 598 Q E +++ I S I T P V++QG CGS W Sbjct: 118 NSNNQNDQSNNQKSSDKSNQNEQKQIEESIPSSWDIRTDGPGLLQPVENQGQCGSCWAFS 177 Query: 599 LALLVMSRVNIS*RLDSCCHFSEQELVDC 685 + V S S + + + S+Q+LVDC Sbjct: 178 TSGAVES--YYSAKKNITLNLSKQQLVDC 204 >UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schistosoma|Rep: Preprocathepsin cathepsin L - Schistosoma japonicum (Blood fluke) Length = 331 Score = 36.7 bits (81), Expect = 0.68 Identities = 21/57 (36%), Positives = 28/57 (49%) Frame = +1 Query: 469 YQSDSNEAGRNPEVEIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLK 639 + D NE + +P +DWRD+ AVT G AFS TG +EGQ + K Sbjct: 102 WNDDGNELELTNK-PVPSTWDWRDHGAVTAVKHQGLCGS-CWAFSATGAIEGQLRRK 156 >UniRef50_A2FLT7 Cluster: Putative uncharacterized protein; n=1; Trichomonas vaginalis G3|Rep: Putative uncharacterized protein - Trichomonas vaginalis G3 Length = 229 Score = 36.7 bits (81), Expect = 0.68 Identities = 21/56 (37%), Positives = 35/56 (62%) Frame = +2 Query: 302 AEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRD 469 A+M+ ++FK +V+ I ELN + IT F D+S+EE +KY+ L+ +L+D Sbjct: 126 AKMKSYNDMFKQDVKSISELNVSGSQDEI-AIT-FPDMSHEEMEQKYMKLEHNLKD 179 >UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MGC107932 protein - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 333 Score = 36.3 bits (80), Expect = 0.90 Identities = 37/136 (27%), Positives = 55/136 (40%), Gaps = 2/136 (1%) Frame = +2 Query: 290 IDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRD 469 +D R+ +E V+K ++L + + QFADL+ E K L P + Sbjct: 41 LDKELNRRKAWEATWEKVQKHNQLADQGLKSYRMAMNQFADLTDNERSSKSC-LLPREKS 99 Query: 470 TNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQG-MCGS-WLGPLALLVMSRVNIS*RL 643 N + + P VK+QG CGS W ++ SR I R Sbjct: 100 LNPVKAESYSYTSITIPKEVDWRKSNCVTPVKNQGTFCGSCWAFATVGVMESRYCI--RT 157 Query: 644 DSCCHFSEQELVDCDK 691 + SEQ+LVDCD+ Sbjct: 158 KELLNLSEQQLVDCDE 173 >UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-PA - Drosophila melanogaster (Fruit fly) Length = 549 Score = 36.3 bits (80), Expect = 0.90 Identities = 19/45 (42%), Positives = 26/45 (57%) Frame = +1 Query: 511 EIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTG 645 EIPD++DWR Y AVT + V +F G++EG + LK G Sbjct: 329 EIPDQYDWRLYGAVTPV-KDQSVCGSCWSFGTIGHLEGAFFLKNG 372 >UniRef50_Q1WDN1 Cluster: Cystatin-2; n=1; Haemaphysalis longicornis|Rep: Cystatin-2 - Haemaphysalis longicornis (Bush tick) Length = 131 Score = 36.3 bits (80), Expect = 0.90 Identities = 20/54 (37%), Positives = 29/54 (53%), Gaps = 2/54 (3%) Frame = +3 Query: 18 QVIAGIHYRMKVEVGLTNCTALTNRS--DCKHISDESLNKFCRVNVWMRPWTNH 173 QV+AGI+YR+ E TNC S +CK ++ + C V+ RPW N+ Sbjct: 69 QVVAGINYRVIFETAPTNCPVNEKYSIENCKPTTNMP-SATCIATVYERPWENY 121 >UniRef50_O08677 Cluster: Kininogen-1 precursor [Contains: Kininogen-1 heavy chain; Bradykinin; Kininogen-1 light chain]; n=43; Coelomata|Rep: Kininogen-1 precursor [Contains: Kininogen-1 heavy chain; Bradykinin; Kininogen-1 light chain] - Mus musculus (Mouse) Length = 661 Score = 36.3 bits (80), Expect = 0.90 Identities = 23/59 (38%), Positives = 32/59 (54%), Gaps = 5/59 (8%) Frame = +3 Query: 9 AREQVIAGIHYRMKVEVGLTNCTALTNRS---DC--KHISDESLNKFCRVNVWMRPWTN 170 A QV+AG Y ++ T C+ +N DC KH+ +SL+ C NV+MRPW N Sbjct: 306 ATSQVVAGTKYVIEFIARETKCSKESNTELAEDCEIKHLG-QSLD--CNANVYMRPWEN 361 >UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin heavy chain; n=3; Amniota|Rep: PREDICTED: similar to ferritin heavy chain - Ornithorhynchus anatinus Length = 338 Score = 35.9 bits (79), Expect = 1.2 Identities = 44/148 (29%), Positives = 57/148 (38%), Gaps = 11/148 (7%) Frame = +2 Query: 275 HKPNYIDDAAEMRRRFEIFKGNVRKIHELNTH-ERGTAVY--GITQFADLSYEEFGKKYL 445 H NY +A E+ RR ++ NVR I N +G Y + F D + EE ++ Sbjct: 35 HGKNYSVEAEEVFRR-AAWEKNVRVIERHNEEMSQGKHSYRLAMNHFGDQTNEELHERLN 93 Query: 446 GLKPSLRDTNQIPMRQAEIPKLKS---PINSIGAIMTQSPDVKDQGMCGS-W----LGPL 601 G +P L + QA S P VK+QG+CGS W G L Sbjct: 94 GFRPDLGGALRSGREQARFRSKTSWEGPEEVDWRTKGYVTPVKNQGLCGSCWAFSATGAL 153 Query: 602 ALLVMSRVNIS*RLDSCCHFSEQELVDC 685 LV SEQ LVDC Sbjct: 154 EALVFKTTG------KMVSLSEQNLVDC 175 >UniRef50_UPI0000E255D2 Cluster: PREDICTED: similar to Cystatin C precursor (Neuroendocrine basic polypeptide) (Gamma-trace) (Post-gamma-globulin); n=1; Pan troglodytes|Rep: PREDICTED: similar to Cystatin C precursor (Neuroendocrine basic polypeptide) (Gamma-trace) (Post-gamma-globulin) - Pan troglodytes Length = 242 Score = 35.9 bits (79), Expect = 1.2 Identities = 18/62 (29%), Positives = 29/62 (46%), Gaps = 3/62 (4%) Frame = +3 Query: 21 VIAGIHYRMKVEVGLTNCT-ALTNRSDCKHISDESLNK--FCRVNVWMRPWTNHPPNFRV 191 ++AG++Y + VE+G T CT N +C L + FC ++ PW + Sbjct: 178 IVAGVNYFLDVELGRTTCTKTQPNLDNCPFHDQPHLKRKAFCSFQIYAVPWQGTMTLSKS 237 Query: 192 TC 197 TC Sbjct: 238 TC 239 >UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba histolytica|Rep: Cysteine protease 17 - Entamoeba histolytica Length = 420 Score = 35.9 bits (79), Expect = 1.2 Identities = 23/61 (37%), Positives = 35/61 (57%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 +FL ++ Y +EM RR IF+ + ++I E N E + GITQFAD + EEF + Sbjct: 23 EFLKANQIVY-STPSEMLRRRAIFEQSKKEIEEFNK-EPHSFFLGITQFADKTDEEFNQM 80 Query: 440 Y 442 + Sbjct: 81 F 81 >UniRef50_Q711N7 Cluster: Putative cys1 protein; n=1; Fasciola hepatica|Rep: Putative cys1 protein - Fasciola hepatica (Liver fluke) Length = 690 Score = 35.9 bits (79), Expect = 1.2 Identities = 17/54 (31%), Positives = 25/54 (46%) Frame = +3 Query: 3 NSAREQVIAGIHYRMKVEVGLTNCTALTNRSDCKHISDESLNKFCRVNVWMRPW 164 + A EQV+AG+ R K+ + C C ++ L C+V W RPW Sbjct: 396 SDAEEQVVAGLITRFKLRMEPVACKRTARNRQCNPLNSR-LRVECQVVFWERPW 448 >UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2; Taeniidae|Rep: Cathepsin L-like cysteine proteinase - Taenia solium (Pork tapeworm) Length = 339 Score = 35.9 bits (79), Expect = 1.2 Identities = 29/101 (28%), Positives = 45/101 (44%), Gaps = 3/101 (2%) Frame = +2 Query: 392 GITQFADLSYEEFGKKYLGLKPSLR---DTNQIPMRQAEIPKLKSPINSIGAIMTQSPDV 562 G+ QFADL EF +++LG +P R +I A L ++ + +V Sbjct: 82 GLNQFADLESSEFSERFLGTRPESRVAGRRGRIWKALASAAGLPDTVDWRDKNLV--TEV 139 Query: 563 KDQGMCGSWLGPLALLVMSRVNIS*RLDSCCHFSEQELVDC 685 K+QG CGS + + + + SEQ+LVDC Sbjct: 140 KNQGNCGSCWAFSSTGALEGA-FAKKTGKLISLSEQQLVDC 179 >UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio molitor (Yellow mealworm) Length = 336 Score = 35.9 bits (79), Expect = 1.2 Identities = 20/53 (37%), Positives = 24/53 (45%) Frame = +1 Query: 487 EAGRNPEVEIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTG 645 + G N V P FDWRD V+ G AFS TG +E Q K+ G Sbjct: 112 DLGLNASVRYPASFDWRDQGMVSPVKNQGSCGS-CWAFSSTGAIESQMKIANG 163 >UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 367 Score = 35.9 bits (79), Expect = 1.2 Identities = 20/42 (47%), Positives = 26/42 (61%) Frame = +2 Query: 560 VKDQGMCGSWLGPLALLVMSRVNIS*RLDSCCHFSEQELVDC 685 VK+QG CGS A+ + VN+ R +S +SEQELVDC Sbjct: 170 VKNQGSCGSCWAFSAVALAESVNLL-RNNSLALYSEQELVDC 210 >UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platyhelminthes|Rep: Cathepsin L-like proteinase - Echinococcus multilocularis Length = 338 Score = 35.9 bits (79), Expect = 1.2 Identities = 31/100 (31%), Positives = 44/100 (44%), Gaps = 3/100 (3%) Frame = +2 Query: 395 ITQFADLSYEEFGKKYLGLK--PSLRDTNQIPMRQAEIP-KLKSPINSIGAIMTQSPDVK 565 + FADL+ EEF +KYL LK P + + E P ++ P + +K Sbjct: 79 LNAFADLTLEEFAEKYLTLKQTPMEGIWQDMSTQYVERPTRMLVPDSIDWRKKGLVTPIK 138 Query: 566 DQGMCGSWLGPLALLVMSRVNIS*RLDSCCHFSEQELVDC 685 DQG CGS A + + + SEQ+LVDC Sbjct: 139 DQGDCGSCWAFSATGALEG-QLKRKTGKLISLSEQQLVDC 177 >UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3; Bilateria|Rep: Cathepsin L-like cysteine protease - Neobenedenia melleni Length = 335 Score = 35.9 bits (79), Expect = 1.2 Identities = 41/145 (28%), Positives = 63/145 (43%), Gaps = 8/145 (5%) Frame = +2 Query: 275 HKPNYIDDAAEMRRRFEIFKG--NVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLG 448 ++ +Y+ E+ + K VRK +EL + + + ADLS EEF K L Sbjct: 34 YQKDYLSSEDELNKLLTWSKNLETVRKHNELYAQGKKSYTLAMNHMADLSSEEF--KALY 91 Query: 449 LKPSLRDTNQIPMR---QAEIPKLKS-PINSIGAIMT-QSPDVKDQGMCGS-WLGPLALL 610 L P D ++P + E ++K+ P + I + VK+Q CGS W Sbjct: 92 LVPKF-DATKVPRKGKAAGEHRQIKNDPPSEIDWVRKGHVTAVKNQAQCGSCWAFSSTGS 150 Query: 611 VMSRVNIS*RLDSCCHFSEQELVDC 685 + V + FSEQ+LVDC Sbjct: 151 IEGAVKRA--TGKLISFSEQQLVDC 173 >UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep: CG4847-PD, isoform D - Drosophila melanogaster (Fruit fly) Length = 420 Score = 35.9 bits (79), Expect = 1.2 Identities = 20/48 (41%), Positives = 24/48 (50%) Frame = +1 Query: 502 PEVEIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTG 645 P IPD FDWR++ VT G AF+ TG +EG KTG Sbjct: 199 PAKPIPDAFDWREHGGVTPVKFQGTCGS-CWAFATTGAIEGHTFRKTG 245 >UniRef50_Q4RQ21 Cluster: Chromosome 17 SCAF15006, whole genome shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 17 SCAF15006, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 130 Score = 35.5 bits (78), Expect = 1.6 Identities = 17/62 (27%), Positives = 28/62 (45%) Frame = +3 Query: 12 REQVIAGIHYRMKVEVGLTNCTALTNRSDCKHISDESLNKFCRVNVWMRPWTNHPPNFRV 191 + QV++G+ Y + V + + C + + C+ I + C VW RPW N Sbjct: 64 QRQVVSGLKYVITVNMARSLCRKSSPQEVCE-IPQSAQPYQCTFTVWTRPWVNEVKLLNE 122 Query: 192 TC 197 TC Sbjct: 123 TC 124 >UniRef50_Q70AR5 Cluster: Putative cytochrome P450; n=1; Streptomyces peucetius|Rep: Putative cytochrome P450 - Streptomyces peucetius Length = 477 Score = 35.5 bits (78), Expect = 1.6 Identities = 15/45 (33%), Positives = 26/45 (57%) Frame = -1 Query: 456 GFKPRYFFPNSS*DKSANCVIPYTAVPRSCVFNS*IFLTLPLKIS 322 GF+P F P +S ++ +P+ A PR C+ +S L +PL ++ Sbjct: 392 GFEPERFTPENSANRHRMAYLPFGAGPRKCIGDSFAMLQMPLVVA 436 >UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; Phytophthora infestans|Rep: Cathepsin-like cysteine protease - Phytophthora infestans (Potato late blight fungus) Length = 376 Score = 35.5 bits (78), Expect = 1.6 Identities = 43/153 (28%), Positives = 72/153 (47%), Gaps = 11/153 (7%) Frame = +2 Query: 260 DFLATHKPNYIDDAAE---MRRRFEIFKGNVRKIHELN-THERGTAVY--GITQFADLSY 421 D+ ++ +Y +DA + ++ RF F N+ +I N +ERG + G+ ADL+ Sbjct: 42 DYALDYEKSYRNDANDHDVVQLRFRSFATNLERIQTHNEAYERGEHSFTLGLNDLADLAD 101 Query: 422 EEFGKKYLGLKPSLRDTNQIPMRQAEI-PKLKSPINSIGAIMTQSP--DVKDQGMCGS-W 589 E+ K+ L + RD+ + + P+ + + S VK+QG CGS W Sbjct: 102 AEY-KQLLSYRT--RDSKSSSASETFVKPENVEDLPATWDWREHSTVTPVKNQGQCGSCW 158 Query: 590 -LGPLALLVMSRVNIS*RLDSCCHFSEQELVDC 685 +A + + + L+S SEQELVDC Sbjct: 159 AFSAVAAMECAYALSTGTLES---LSEQELVDC 188 >UniRef50_Q5DB58 Cluster: SJCHGC06844 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06844 protein - Schistosoma japonicum (Blood fluke) Length = 145 Score = 35.5 bits (78), Expect = 1.6 Identities = 22/64 (34%), Positives = 33/64 (51%), Gaps = 11/64 (17%) Frame = +3 Query: 6 SAREQVIAGIHYRMKVEVGLTNCT-------ALTN----RSDCKHISDESLNKFCRVNVW 152 +A QV+AGI Y++ V+ +CT +L N R C S + +K C+V +W Sbjct: 65 NATSQVVAGIIYKLFVKFTPASCTDFAEDKVSLDNIVFSRDSCD--SGNNKSKICKVTIW 122 Query: 153 MRPW 164 RPW Sbjct: 123 KRPW 126 >UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus|Rep: Cathepsin L - Aphrocallistes vastus Length = 329 Score = 35.5 bits (78), Expect = 1.6 Identities = 39/130 (30%), Positives = 62/130 (47%), Gaps = 7/130 (5%) Frame = +2 Query: 317 RFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGL----KPSLRDTNQIP 484 R +I+ N+ + E N E + QFADL+ E+ + YLG + S + ++ Sbjct: 48 RKKIWANNMLYVKEFNA-EGHSYKLAANQFADLTNLEYRQIYLGYDNEARLSRKREGKVF 106 Query: 485 MRQAEIPKLKSPIN--SIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCC 655 R+ + L + ++ S G + +P VK+QG CGS W + + I + Sbjct: 107 QRKMKDEDLPTTVDWRSKGVV---TP-VKNQGQCGSCWSFSATGSLEGQYAI--KSGKLV 160 Query: 656 HFSEQELVDC 685 FSEQELVDC Sbjct: 161 SFSEQELVDC 170 Score = 35.5 bits (78), Expect = 1.6 Identities = 17/46 (36%), Positives = 25/46 (54%) Frame = +1 Query: 511 EIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTGQ 648 ++P DWR VT G +FS TG++EGQY +K+G+ Sbjct: 114 DLPTTVDWRSKGVVTPVKNQGQCGS-CWSFSATGSLEGQYAIKSGK 158 >UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicalis|Rep: Blo t 1 allergen - Blomia tropicalis (Mite) Length = 333 Score = 35.5 bits (78), Expect = 1.6 Identities = 39/135 (28%), Positives = 59/135 (43%), Gaps = 5/135 (3%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTN 475 +A E RR FK ++ + E N + Y I +++D+S +EF G L T Sbjct: 41 NAEEEARREHHFKEQLKWVEEHNGIDG--VEYAINEYSDMSEQEFSFHLSG--GGLNFT- 95 Query: 476 QIPMRQAEIPKLKS----PINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*R 640 + M A+ P + + P N + ++ QG CGS W A + S +I + Sbjct: 96 YMKMEAAKEPLINTYGSLPQNFDWRQKARLTRIRQQGSCGSCWAFAAAGVAESLYSI--Q 153 Query: 641 LDSCCHFSEQELVDC 685 SEQELVDC Sbjct: 154 KQQSIELSEQELVDC 168 >UniRef50_A0BLR4 Cluster: Chromosome undetermined scaffold_115, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_115, whole genome shotgun sequence - Paramecium tetraurelia Length = 304 Score = 35.5 bits (78), Expect = 1.6 Identities = 39/130 (30%), Positives = 53/130 (40%), Gaps = 2/130 (1%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLG--LKPSLRDTNQ 478 E RR IF+ N + I E N + T + QFADL+ EEF YL L L+ + Sbjct: 47 EQYRRM-IFEQNKKMIDEHNANPENTYTMALNQFADLTTEEFVATYLDSQLSAGLKKRSV 105 Query: 479 IPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVMSRVNIS*RLDSCCH 658 P Q+ IP + T D+K G SW S + + Sbjct: 106 KPKSQS-IPNEAYDWRN----TTSVRDMK-SGCISSWAFSTVGAAESYLTVV--KSQKLS 157 Query: 659 FSEQELVDCD 688 S Q+L+DCD Sbjct: 158 LSPQQLLDCD 167 >UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2; Brugia malayi|Rep: Cahepsin L-like cysteine protease - Brugia malayi (Filarial nematode worm) Length = 371 Score = 35.1 bits (77), Expect = 2.1 Identities = 18/45 (40%), Positives = 24/45 (53%) Frame = +1 Query: 514 IPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTGQ 648 +P DWR AVT+ GY FS G +EGQ+ L+TG+ Sbjct: 143 LPKSIDWRTSGAVTKVKDQGYCGS-CWTFSAVGALEGQHFLQTGK 186 >UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06231 protein - Schistosoma japonicum (Blood fluke) Length = 372 Score = 35.1 bits (77), Expect = 2.1 Identities = 19/46 (41%), Positives = 25/46 (54%) Frame = +1 Query: 511 EIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTGQ 648 ++PD+ DWR AVT G AFS TG +EGQ+ KT + Sbjct: 149 KLPDRVDWRRNGAVTPVKNQGQCGS-CWAFSSTGAIEGQHYRKTNR 193 Score = 34.7 bits (76), Expect = 2.8 Identities = 39/137 (28%), Positives = 63/137 (45%), Gaps = 8/137 (5%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELN-THERGTAVY--GITQFADLSYEEFGKKYLGLKPSLR--- 466 E +RF IF N K+ E N ++ G A Y G+ F D + E +K G + + R Sbjct: 78 EETKRFLIFGTNFIKMMEHNRAYQEGKATYKMGVNNFTDKTEYEL-RKLRGYRSACRIAK 136 Query: 467 --DTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVMSRVNIS*R 640 + I A++P + GA+ +P VK+QG CGS + + + + Sbjct: 137 PKGSTFISSEHAKLPD-RVDWRRNGAV---TP-VKNQGQCGSCWAFSSTGAIEGQHYR-K 190 Query: 641 LDSCCHFSEQELVDCDK 691 + + SEQ+L+DC K Sbjct: 191 TNRLVNLSEQQLIDCSK 207 >UniRef50_Q4U3Y4 Cluster: CYP325C2; n=4; Anopheles gambiae|Rep: CYP325C2 - Anopheles gambiae (African malaria mosquito) Length = 264 Score = 35.1 bits (77), Expect = 2.1 Identities = 16/48 (33%), Positives = 27/48 (56%) Frame = -1 Query: 453 FKPRYFFPNSS*DKSANCVIPYTAVPRSCVFNS*IFLTLPLKISNLLR 310 F P F P S +S N IP++A R+C+ L++ + +S++LR Sbjct: 182 FDPDRFLPERSEGRSTNVFIPFSAGSRNCIGGRYAMLSMKVMLSSILR 229 >UniRef50_Q23H06 Cluster: Papain family cysteine protease containing protein; n=18; Tetrahymena thermophila|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 349 Score = 35.1 bits (77), Expect = 2.1 Identities = 16/57 (28%), Positives = 35/57 (61%) Frame = +2 Query: 275 HKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYL 445 H+ Y+++ ++ R+ F+ N++KI + N++ T + QF+D++ +EF +K L Sbjct: 36 HQRVYLNEHEKLFRQMVFFE-NLQKIQDHNSNPNNTYSIHLNQFSDMTKQEFAEKIL 91 >UniRef50_Q22W19 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 35.1 bits (77), Expect = 2.1 Identities = 37/144 (25%), Positives = 62/144 (43%), Gaps = 1/144 (0%) Frame = +2 Query: 296 DAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTN 475 D +++ R IF N + +LN+ GT + + FA + +EF + + G + + Sbjct: 57 DEVQLQYRRSIFYQNKDLVEQLNSENNGT-FHTLNAFAIYTKDEFNQLFKGYQKRQKSHL 115 Query: 476 QIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSC 652 ++ P + A+ +P VK+QG CGS W + I+ + Sbjct: 116 IYSLKGDVAPSIDW--RQKNAV---TP-VKNQGQCGSCWAFSTVGGLEGAYAIA--TGNL 167 Query: 653 CHFSEQELVDCDKP*RRM*RGGLP 724 FSEQ++VDC K G LP Sbjct: 168 TSFSEQQIVDCSKANAGCNGGDLP 191 >UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina) Length = 339 Score = 35.1 bits (77), Expect = 2.1 Identities = 19/46 (41%), Positives = 25/46 (54%) Frame = +1 Query: 508 VEIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTG 645 V +P DWR++ AVT G+ AFS TG +EGQ+ K G Sbjct: 120 VTVPKSVDWREHGAVTGVKDQGHCGS-CWAFSSTGALEGQHFRKAG 164 >UniRef50_UPI00015B5E04 Cluster: PREDICTED: similar to CG8302-PA; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to CG8302-PA - Nasonia vitripennis Length = 508 Score = 34.7 bits (76), Expect = 2.8 Identities = 16/48 (33%), Positives = 26/48 (54%) Frame = -1 Query: 453 FKPRYFFPNSS*DKSANCVIPYTAVPRSCVFNS*IFLTLPLKISNLLR 310 F+P F P +S + +P++A PR+C+ N L + IS +LR Sbjct: 423 FRPERFSPENSEKRHPYAYLPFSAGPRNCIGNKFAILEMKAVISAILR 470 >UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cathepsin L; n=4; Danio rerio|Rep: Novel protein similar to vertebrate cathepsin L - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 334 Score = 34.7 bits (76), Expect = 2.8 Identities = 38/148 (25%), Positives = 74/148 (50%), Gaps = 9/148 (6%) Frame = +2 Query: 275 HKPNYIDDAAEMRRRFEIFKGNVRKIHELNTH-ERGTAVY--GITQFADLSYEEFGKKYL 445 H+ +Y +++ ++ R+ I++ N++KI + N G +++ + ++ DL+ E+ K+ L Sbjct: 33 HEISYDEESEDVHRK-TIWETNMQKIWKNNNDFSFGLSMFKMAMNKYGDLTSVEY-KRLL 90 Query: 446 GLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSP----DVKDQGMCGS-W-LGPLAL 607 G K + + A++ +L + + I ++ +VKDQG CGS W Sbjct: 91 GSKIKGTGNRKGKITSAQMLRLNAKRLGVTNIDYRAKGYVTEVKDQGYCGSCWSFSTTGA 150 Query: 608 LVMSRVNIS*RLDSCCHFSEQELVDCDK 691 + + RL S SEQ+LVDC + Sbjct: 151 IEGQMYKHTGRLVS---LSEQQLVDCSR 175 >UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theileria|Rep: Cysteine protease, putative - Theileria parva Length = 612 Score = 34.7 bits (76), Expect = 2.8 Identities = 43/161 (26%), Positives = 72/161 (44%), Gaps = 7/161 (4%) Frame = +2 Query: 263 FLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKY 442 F++ ++ Y D+ E + R+ F+ N I N++ G T D S EE G+ Sbjct: 183 FISRYEKKYKDED-EYKTRYLNFRDNRIFIETHNSNHNKIFTMGYTSSTDSSDEELGRAV 241 Query: 443 LGLKPSLRDT-NQIPMRQAEIPKLKSPINSIGAIMTQSPD-----VKDQGMCGS-WLGPL 601 + S + T ++I R +E ++ S G I V+DQ CGS W + Sbjct: 242 SSI--SYKPTQDEIYSRASE--EMSSSKKYPGVIFDWREKGVILPVQDQKECGSCWAVSM 297 Query: 602 ALLVMSRVNIS*RLDSCCHFSEQELVDCDKP*RRM*RGGLP 724 + L+ + + IS +S+Q+L+DC P +GG P Sbjct: 298 SDLLSTMMAISGH--KLQDYSKQQLMDCIDPMFNCTKGGDP 336 >UniRef50_P35481 Cluster: Cystatin precursor; n=1; Cyprinus carpio|Rep: Cystatin precursor - Cyprinus carpio (Common carp) Length = 129 Score = 34.7 bits (76), Expect = 2.8 Identities = 17/64 (26%), Positives = 32/64 (50%), Gaps = 2/64 (3%) Frame = +3 Query: 12 REQVIAGIHYRMKVEVGLTNCTALTNRSDCKHISDESLNKF--CRVNVWMRPWTNHPPNF 185 ++QV AG+ Y V++ + +C ++ C + S+ + C++ VW +PW N Sbjct: 65 QQQVAAGMKYIFTVKMEVASCKKGGVKTMCAVPKNPSIEQVIQCKITVWSQPWLNSLKVT 124 Query: 186 RVTC 197 TC Sbjct: 125 ENTC 128 >UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin L-like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin L-like protease - Nasonia vitripennis Length = 353 Score = 34.3 bits (75), Expect = 3.6 Identities = 18/44 (40%), Positives = 21/44 (47%) Frame = +1 Query: 514 IPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTG 645 +P+ DWR AVT G AFS G +E QY KTG Sbjct: 132 VPEHVDWRQRGAVTPVRDQGLTCGSCWAFSAAGALEAQYFKKTG 175 >UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin C; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin C - Strongylocentrotus purpuratus Length = 482 Score = 34.3 bits (75), Expect = 3.6 Identities = 34/144 (23%), Positives = 61/144 (42%), Gaps = 6/144 (4%) Frame = +2 Query: 278 KPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEF-----GKKY 442 K N +D++A I + N + I +N H+ ++ +L+ + GK + Sbjct: 169 KVNNLDESATQFDENAIHRRNDKFIEGINKHQDSWKATYYDRYVNLTLGDMRRRAGGKLW 228 Query: 443 LGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVM-S 619 + P + T++ + A K +G I SP V+DQG+CGS + S Sbjct: 229 KRVWPDVSPTDERTKQAASNLPEKFDWRDVGGIDYVSP-VRDQGICGSCYAFASTATQES 287 Query: 620 RVNIS*RLDSCCHFSEQELVDCDK 691 R+ + + S QE+V C + Sbjct: 288 RLRVMTNNNVKVVMSPQEVVSCSE 311 >UniRef50_Q2XXN5 Cluster: Cystatin-POGU1; n=1; Pogona barbata|Rep: Cystatin-POGU1 - Pogona barbata (Bearded dragon) Length = 144 Score = 34.3 bits (75), Expect = 3.6 Identities = 20/70 (28%), Positives = 30/70 (42%), Gaps = 7/70 (10%) Frame = +3 Query: 9 AREQVIAGIHYRMKVEVGLTNCT-----ALTNRS--DCKHISDESLNKFCRVNVWMRPWT 167 A QV++G+ Y + VE+ T C L N +C S+ + C VW RPW Sbjct: 70 AETQVVSGMQYYLTVEIVNTRCEKKVGCGLKNMGSENCAVPSEAEQKQICEFVVWSRPWM 129 Query: 168 NHPPNFRVTC 197 ++C Sbjct: 130 QDTRLSSISC 139 >UniRef50_Q9U7F7 Cluster: Cysteine protease; n=2; Entamoeba histolytica|Rep: Cysteine protease - Entamoeba histolytica Length = 446 Score = 34.3 bits (75), Expect = 3.6 Identities = 25/99 (25%), Positives = 44/99 (44%), Gaps = 3/99 (3%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTH--ERGTAVYGITQFADLSYEEFG-KKYLGLKPSLRDTN 475 E + RF+IFK N++ I LN + A + I + DL EE K + + S D Sbjct: 47 EEQFRFQIFKNNLKNIKTLNEKRTQPSDAFHDINMYTDLIDEELPISKGMAIPVSSYDNE 106 Query: 476 QIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWL 592 E+ K++ P N + + + ++ CG ++ Sbjct: 107 H--FNSKELKKVEKPWNEVPPLPSGDNLPQNYAFCGEYV 143 >UniRef50_Q24E33 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 328 Score = 34.3 bits (75), Expect = 3.6 Identities = 39/130 (30%), Positives = 59/130 (45%), Gaps = 5/130 (3%) Frame = +2 Query: 317 RFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGL--KPSLRDTNQIPMR 490 R IF NVR I N + T I A L+ EE+ YL L + S+ + + Sbjct: 62 RQNIFFQNVRYIQSENA-KNNTFKLAINIMAILTDEEYSSLYLNLDQQESIDIFDSLVDD 120 Query: 491 QAEIPKLKSPIN--SIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCHF 661 + + S +N + GA+ +P VK+QG CGS W + + + + F Sbjct: 121 NETVGDIPSEVNWTAQGAV---TP-VKNQGSCGSCWAFSTTGALEGSYFL--KNNQLISF 174 Query: 662 SEQELVDCDK 691 SEQ+LVDC + Sbjct: 175 SEQQLVDCSR 184 >UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 397 Score = 34.3 bits (75), Expect = 3.6 Identities = 19/42 (45%), Positives = 24/42 (57%) Frame = +2 Query: 560 VKDQGMCGSWLGPLALLVMSRVNIS*RLDSCCHFSEQELVDC 685 VKDQG CG A + VN+ R ++ +SEQELVDC Sbjct: 195 VKDQGRCGCCWAFSATALAESVNLM-RNNTLQQYSEQELVDC 235 >UniRef50_O61973 Cluster: Cystatin-like protease inhibitor protein 1, isoform a; n=3; Rhabditida|Rep: Cystatin-like protease inhibitor protein 1, isoform a - Caenorhabditis elegans Length = 143 Score = 34.3 bits (75), Expect = 3.6 Identities = 20/60 (33%), Positives = 30/60 (50%), Gaps = 6/60 (10%) Frame = +3 Query: 9 AREQVIAGIHYRMKVEVGLTNC------TALTNRSDCKHISDESLNKFCRVNVWMRPWTN 170 A QV+AGI +++V VG +NC S+C+ I D +V +W +PW N Sbjct: 67 ASTQVVAGISTKLEVLVGESNCKKGELQAHEITSSNCQ-IKDGGSRALYQVTIWEKPWEN 125 >UniRef50_O45120 Cluster: Family 4 cytochrome P450; n=2; Coptotermes acinaciformis|Rep: Family 4 cytochrome P450 - Coptotermes acinaciformis Length = 501 Score = 34.3 bits (75), Expect = 3.6 Identities = 16/48 (33%), Positives = 24/48 (50%) Frame = -1 Query: 453 FKPRYFFPNSS*DKSANCVIPYTAVPRSCVFNS*IFLTLPLKISNLLR 310 F P F P + + C +P++A PR+C+ L L IS +LR Sbjct: 420 FDPDRFLPENCVGRHPYCYVPFSAGPRNCIGQKFAILELKSTISQVLR 467 >UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]; n=11; Eutheria|Rep: Testin-2 precursor [Contains: Testin-1] - Mus musculus (Mouse) Length = 333 Score = 34.3 bits (75), Expect = 3.6 Identities = 19/45 (42%), Positives = 24/45 (53%) Frame = +1 Query: 514 IPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTGQ 648 +P DWR VT GY + AFS TG++EGQ KTG+ Sbjct: 114 VPKYVDWRMLGYVTPVKNQGYCAS-SWAFSATGSLEGQMFKKTGR 157 >UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; Leishmania|Rep: Cysteine proteinase 2 precursor - Leishmania pifanoi Length = 444 Score = 34.3 bits (75), Expect = 3.6 Identities = 18/41 (43%), Positives = 22/41 (53%) Frame = +1 Query: 514 IPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKL 636 +PD DWR+ AVT G AFS GN+EGQ+ L Sbjct: 126 VPDAVDWREKGAVTPVKDQGACGS-CWAFSAVGNIEGQWYL 165 >UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; n=2; Danio rerio|Rep: hypothetical protein LOC550326 - Danio rerio Length = 531 Score = 33.9 bits (74), Expect = 4.8 Identities = 19/47 (40%), Positives = 25/47 (53%) Frame = +1 Query: 508 VEIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTGQ 648 + P+ DWR Y AVT + V +F+ TG +EG LKTGQ Sbjct: 310 IATPNSVDWRLYGAVTPV-KDQAVCGSCWSFATTGTLEGALFLKTGQ 355 >UniRef50_Q84YE7 Cluster: Cysteine proteinase-like protein; n=1; Sorghum bicolor|Rep: Cysteine proteinase-like protein - Sorghum bicolor (Sorghum) (Sorghum vulgare) Length = 358 Score = 33.9 bits (74), Expect = 4.8 Identities = 19/57 (33%), Positives = 31/57 (54%) Frame = +2 Query: 299 AAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRD 469 A+E R RF+++ GN+R I N + + G T++ DL+ +EF Y +L D Sbjct: 80 ASEERHRFQVYAGNMRYILARNGEDPSYEL-GETEYTDLTTDEFMAMYTTATLALDD 135 >UniRef50_Q9Y1T8 Cluster: Cytochrome P450 4W1; n=3; Arthropoda|Rep: Cytochrome P450 4W1 - Boophilus microplus (Cattle tick) Length = 549 Score = 33.9 bits (74), Expect = 4.8 Identities = 15/48 (31%), Positives = 27/48 (56%) Frame = -1 Query: 453 FKPRYFFPNSS*DKSANCVIPYTAVPRSCVFNS*IFLTLPLKISNLLR 310 F+P FFP + + A +P++A PR+C+ + + I+N+LR Sbjct: 462 FRPDRFFPENVRGRHAFAFVPFSAGPRNCIGQRFAMMEEKVVIANILR 509 >UniRef50_Q967Y5 Cluster: Cytochrome P450 CYP4G13v2; n=4; Neoptera|Rep: Cytochrome P450 CYP4G13v2 - Musca domestica (House fly) Length = 552 Score = 33.9 bits (74), Expect = 4.8 Identities = 18/54 (33%), Positives = 27/54 (50%) Frame = -1 Query: 453 FKPRYFFPNSS*DKSANCVIPYTAVPRSCVFNS*IFLTLPLKISNLLRISAASS 292 F P F P + ++ IP++A PRSCV L L + +S ++R SS Sbjct: 466 FNPDNFLPERTANRHYYAYIPFSAGPRSCVGRKFAMLQLKVLLSTIIRNYRVSS 519 >UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n=21; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Globodera pallida Length = 379 Score = 33.9 bits (74), Expect = 4.8 Identities = 18/46 (39%), Positives = 24/46 (52%) Frame = +1 Query: 511 EIPDKFDWRDYDAVTRC*RPGYVRKLAGAFSVTGNVEGQYKLKTGQ 648 ++P+ DWRD VT G AFS TG +E Q+ +TGQ Sbjct: 160 DLPESVDWRDKGWVTEVKNQGMCGS-CWAFSSTGALEAQHARQTGQ 204 >UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin l - Strongylocentrotus purpuratus Length = 489 Score = 33.5 bits (73), Expect = 6.4 Identities = 31/113 (27%), Positives = 48/113 (42%), Gaps = 1/113 (0%) Frame = +2 Query: 350 IHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKPSLRDTNQIPMRQAEIPKLKSPINS 529 IH +N G V I AD S++E K+ G R N +P +++ P + Sbjct: 214 IHSINRANLGY-VLDINHMADQSHQEL-KRMRGRLRQTRPNNGLPYDGSDVSDDAVPDHI 271 Query: 530 IGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCHFSEQELVDC 685 ++ VKDQ +CGS W A + V + + S+Q L+DC Sbjct: 272 DWNVLGAVSPVKDQAVCGSCWSFGSAETIEGAVFM--QSGKRVRLSQQMLMDC 322 >UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa (japonica cultivar-group)|Rep: Os09g0562700 protein - Oryza sativa subsp. japonica (Rice) Length = 235 Score = 33.5 bits (73), Expect = 6.4 Identities = 23/58 (39%), Positives = 29/58 (50%), Gaps = 1/58 (1%) Frame = +2 Query: 518 PINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCHFSEQELVDCD 688 P++ GA+ +VKDQG CGS W +V I + SEQELVDCD Sbjct: 14 PVDHGGAVT----EVKDQGRCGSCWAFSTVAVVEGIQKI--KKGKLVSLSEQELVDCD 65 >UniRef50_Q015J8 Cluster: Cathepsin; n=2; Ostreococcus|Rep: Cathepsin - Ostreococcus tauri Length = 556 Score = 33.5 bits (73), Expect = 6.4 Identities = 16/48 (33%), Positives = 29/48 (60%) Frame = +2 Query: 314 RRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKYLGLKP 457 RR E++ N+ + N++ER + T+F+DL+ EEF +++L P Sbjct: 23 RRQEVYFANMVMYEKHNSNERASYRVRETKFSDLTEEEFAQRWLTYTP 70 >UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba histolytica|Rep: Cysteine protease 19 - Entamoeba histolytica Length = 324 Score = 33.5 bits (73), Expect = 6.4 Identities = 29/109 (26%), Positives = 46/109 (42%) Frame = +2 Query: 260 DFLATHKPNYIDDAAEMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKK 439 ++++ HK + E RR +F N + ++E+N G + FA L+ EE Sbjct: 19 EWISLHKKAF--SPIEYLRRRAVFIENTKYVNEMNKQNLGFTLSNEGPFAILTREESVAI 76 Query: 440 YLGLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS 586 G+ D Q + E+ + N G + VKDQG CGS Sbjct: 77 AQGIHIDKSDLEQYKPSKREMVEAIDYRNIQGK--SYMTPVKDQGNCGS 123 >UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1; Dictyostelium discoideum AX4|Rep: Counting factor associated protein - Dictyostelium discoideum AX4 Length = 531 Score = 33.5 bits (73), Expect = 6.4 Identities = 40/131 (30%), Positives = 61/131 (46%), Gaps = 4/131 (3%) Frame = +2 Query: 305 EMRRRFEIFKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGK--KYLGLKPSLRDTNQ 478 E RF FK RKI + + + G+ +ADLS +EF K +PS+ + Sbjct: 241 EHDERFINFKA-ARKIIATHNAKESSYKLGMNHYADLSNKEFNTLVKPKVARPSVTGADS 299 Query: 479 IPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGS-W-LGPLALLVMSRVNIS*RLDSC 652 + ++ + + S ++ +P VKDQG+CGS W G L + + L S Sbjct: 300 VHDDES-LRSIPSTVDWRNQNCV-TP-VKDQGICGSCWTFGSTGSLEGTNCVTNGELVS- 355 Query: 653 CHFSEQELVDC 685 SEQ+LVDC Sbjct: 356 --LSEQQLVDC 364 >UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicatein a3 - Lubomirskia baicalensis Length = 344 Score = 33.5 bits (73), Expect = 6.4 Identities = 39/143 (27%), Positives = 56/143 (39%), Gaps = 3/143 (2%) Frame = +2 Query: 275 HKPNYIDDAAEMRRRFEIFKGNVRKI--HELNTHERGTAVYGITQFADLSYEEFGKKYLG 448 H+ +Y EM R I+ N + I H N G + + F DL EF ++YL Sbjct: 51 HQRSYESQLQEMERH-SIWVANKKYIEHHNANADLFGYTL-AMNGFGDLMSAEFTERYLT 108 Query: 449 LKPSLRDTNQIPMRQAEIPKLKSPINSIG-AIMTQSPDVKDQGMCGSWLGPLALLVMSRV 625 K S R ++ E PK + +S+ V+ QG CGS A + Sbjct: 109 HKHSQRSG----LQTFESPKGVTYADSLDWRTRGVVTSVQSQGQCGSSYAFAAAGALEGA 164 Query: 626 NIS*RLDSCCHFSEQELVDCDKP 694 D SEQ ++DC P Sbjct: 165 TAL-AADKLVALSEQNIIDCSVP 186 >UniRef50_O15725 Cluster: Pol; n=20; Dictyostelium discoideum|Rep: Pol - Dictyostelium discoideum (Slime mold) Length = 1116 Score = 33.5 bits (73), Expect = 6.4 Identities = 15/51 (29%), Positives = 25/51 (49%) Frame = +3 Query: 180 NFRVTCDYQESATIDLYHHIQAEHLFMIFWRHTNRIT*TMPPKCVGDSKFL 332 +F + CD + A + + IQ F + W H ++T T +GD +FL Sbjct: 425 SFHLYCDVSDKALSGVLYQIQGNK-FKVIWFHCRKLTDTQKRYSIGDREFL 474 >UniRef50_Q91195 Cluster: Cystatin precursor; n=4; Actinopteri|Rep: Cystatin precursor - Oncorhynchus mykiss (Rainbow trout) (Salmo gairdneri) Length = 130 Score = 33.5 bits (73), Expect = 6.4 Identities = 18/67 (26%), Positives = 31/67 (46%), Gaps = 2/67 (2%) Frame = +3 Query: 6 SAREQVIAGIHYRMKVEVGLTNCTALTNRSDCKHISDE--SLNKFCRVNVWMRPWTNHPP 179 +A++QV++G+ Y V++G T C C D ++ C VW RPW + Sbjct: 63 NAQKQVVSGMKYIFTVQMGRTPCRKGGVEKVCSVHKDPQMAVPYKCTFEVWSRPWMSDIQ 122 Query: 180 NFRVTCD 200 + C+ Sbjct: 123 MVKNQCE 129 >UniRef50_Q9V7G5 Cluster: Probable cytochrome P450 4aa1; n=5; Diptera|Rep: Probable cytochrome P450 4aa1 - Drosophila melanogaster (Fruit fly) Length = 514 Score = 33.5 bits (73), Expect = 6.4 Identities = 15/48 (31%), Positives = 26/48 (54%) Frame = -1 Query: 453 FKPRYFFPNSS*DKSANCVIPYTAVPRSCVFNS*IFLTLPLKISNLLR 310 F+P F P +S ++ +P++A PR C+ N + + +S LLR Sbjct: 426 FQPERFSPENSENRHPYAFLPFSAGPRYCIGNRFAIMEIKTIVSRLLR 473 >UniRef50_Q45RG8 Cluster: Cystatin; n=4; Danio rerio|Rep: Cystatin - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 128 Score = 33.1 bits (72), Expect = 8.4 Identities = 19/55 (34%), Positives = 27/55 (49%), Gaps = 2/55 (3%) Frame = +3 Query: 12 REQVIAGIHYRMKVEVGLTNCTALTNRSDCK-HISDESLN-KFCRVNVWMRPWTN 170 ++QV+AGI Y V+V T C C H + E K C++ VW + W N Sbjct: 64 QKQVVAGIKYIFTVDVARTTCRKGGVEELCAIHENPEIAQVKECKIVVWTKLWEN 118 >UniRef50_A7HJT1 Cluster: MutS2 family protein; n=1; Fervidobacterium nodosum Rt17-B1|Rep: MutS2 family protein - Fervidobacterium nodosum Rt17-B1 Length = 803 Score = 33.1 bits (72), Expect = 8.4 Identities = 13/29 (44%), Positives = 20/29 (68%) Frame = -3 Query: 706 HPSSRFITVHQLLLREVTAAVQSSTYIDP 620 H + RFIT HQ +L+EVT +++ Y+ P Sbjct: 180 HKAERFITHHQNILQEVTYTIRNDRYVFP 208 >UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 357 Score = 33.1 bits (72), Expect = 8.4 Identities = 37/140 (26%), Positives = 54/140 (38%), Gaps = 1/140 (0%) Frame = +2 Query: 269 ATHKPNYIDDAAEMRRRFEIFKGNVRKIHELN-THERGTAVYGITQFADLSYEEFGKKYL 445 A H Y D+ E RRFE+F+ N I N + + +FADL+ EEF +Y Sbjct: 54 ADHGRTY-KDSLEKARRFEVFRTNALFIDSFNAAGGKKSPRLTTNKFADLTNEEFA-EYY 111 Query: 446 GLKPSLRDTNQIPMRQAEIPKLKSPINSIGAIMTQSPDVKDQGMCGSWLGPLALLVMSRV 625 G S + P N VK+Q C S A+ + + Sbjct: 112 GRPFSTPVIGGSGFMYGNVRTSDVPANINWRDRGAVTQVKNQKDCASCWAFSAVAAVEGI 171 Query: 626 NIS*RLDSCCHFSEQELVDC 685 + R + S Q+L+DC Sbjct: 172 H-QIRSHNLVALSTQQLLDC 190 >UniRef50_Q9U9A1 Cluster: Cystatin-type cysteine proteinase inhibitor CPI-1; n=2; Onchocercidae|Rep: Cystatin-type cysteine proteinase inhibitor CPI-1 - Onchocerca volvulus Length = 127 Score = 33.1 bits (72), Expect = 8.4 Identities = 18/56 (32%), Positives = 25/56 (44%), Gaps = 1/56 (1%) Frame = +3 Query: 6 SAREQVIAGIHYRMKVEVGLTNCTALTNRSDCKHISDESL-NKFCRVNVWMRPWTN 170 +AR QV+AG+ Y + + T C + S D S K + VW PW N Sbjct: 65 NARTQVVAGMKYYLTILTAPTTCRKNSGMSPANCAIDHSKPKKKVILEVWSAPWQN 120 >UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax (Sterkiella histriomuscorum) Length = 366 Score = 33.1 bits (72), Expect = 8.4 Identities = 36/123 (29%), Positives = 58/123 (47%), Gaps = 4/123 (3%) Frame = +2 Query: 329 FKGNVRKIHELNTHERGTAVYGITQFADLSYEEFGKKY-LGLKPSLRDTNQ--IPMRQAE 499 F +++I + N+ T G+ F+D++ EEF Y + + + TN+ A Sbjct: 75 FANKLQQIIKHNSDGTNTYKKGLNAFSDMTDEEFFDYYNIKAEQNCSATNRKSFGNSNAN 134 Query: 500 IPKLKSPINSIGAIMTQSPDVKDQGMCGS-WLGPLALLVMSRVNIS*RLDSCCHFSEQEL 676 IP + + G + SP VK+QG CGS W V S + + + + SEQ+L Sbjct: 135 IP-TEWDWRTFGVV---SP-VKNQGKCGSCWTFSTVGCVESHYLL--KYGAFRNLSEQQL 187 Query: 677 VDC 685 VDC Sbjct: 188 VDC 190 >UniRef50_Q6QZV5 Cluster: Cystatin precursor; n=1; Ornithodoros moubata|Rep: Cystatin precursor - Ornithodoros moubata (Soft tick) Length = 128 Score = 33.1 bits (72), Expect = 8.4 Identities = 16/57 (28%), Positives = 31/57 (54%), Gaps = 3/57 (5%) Frame = +3 Query: 9 AREQVIAGIHYRMKVEVGLTNCTALTNRSDCKHISDESLN---KFCRVNVWMRPWTN 170 A +QV+AG++Y++ ++V + C ++ K + LN K C +++ PW N Sbjct: 63 ASQQVVAGVNYKLTLKVAPSKC-KVSETVYSKELCQPQLNAAPKDCEAQLYVVPWRN 118 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 794,063,612 Number of Sequences: 1657284 Number of extensions: 16086112 Number of successful extensions: 39659 Number of sequences better than 10.0: 251 Number of HSP's better than 10.0 without gapping: 37853 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 39357 length of database: 575,637,011 effective HSP length: 99 effective length of database: 411,565,895 effective search space used: 69143070360 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -