BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= S06A01NCLL0001_H21 (515 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ... 268 7e-71 UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-... 252 3e-66 UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; ... 225 5e-58 UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome s... 220 1e-56 UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ... 197 1e-49 UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve... 164 9e-40 UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1... 155 7e-37 UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v... 143 2e-33 UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep... 142 3e-33 UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi... 142 4e-33 UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs... 140 2e-32 UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ... 136 2e-31 UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve... 134 1e-30 UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C... 132 3e-30 UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome s... 132 5e-30 UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata... 131 8e-30 UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n... 130 2e-29 UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca... 129 4e-29 UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip... 127 1e-28 UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j... 127 2e-28 UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er... 126 4e-28 UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine prot... 126 4e-28 UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ... 125 5e-28 UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|... 125 5e-28 UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep... 124 9e-28 UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M... 124 9e-28 UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica... 123 2e-27 UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D... 89 3e-27 UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]... 122 5e-27 UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy... 121 9e-27 UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae... 121 1e-26 UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP000... 120 2e-26 UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ... 120 2e-26 UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto... 120 3e-26 UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;... 120 3e-26 UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr... 118 8e-26 UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ... 117 2e-25 UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re... 117 2e-25 UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n... 116 2e-25 UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus... 116 3e-25 UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca... 116 4e-25 UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei... 116 4e-25 UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18... 115 6e-25 UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ... 114 1e-24 UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph... 114 1e-24 UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain... 114 1e-24 UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C... 114 1e-24 UniRef50_Q4TI44 Cluster: Chromosome undetermined SCAF2412, whole... 113 2e-24 UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip... 113 2e-24 UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli... 113 3e-24 UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy... 113 3e-24 UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ... 112 5e-24 UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida... 112 5e-24 UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate... 112 5e-24 UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No... 111 7e-24 UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi... 111 7e-24 UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy... 111 7e-24 UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli... 111 9e-24 UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n... 111 9e-24 UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain... 111 1e-23 UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=... 109 4e-23 UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:... 109 5e-23 UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3... 108 6e-23 UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc... 108 6e-23 UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy... 108 8e-23 UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio... 108 8e-23 UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat... 107 1e-22 UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ... 107 1e-22 UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Sl... 107 1e-22 UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat... 107 1e-22 UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n... 106 3e-22 UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ... 105 5e-22 UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n... 105 5e-22 UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox... 105 6e-22 UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina... 104 1e-21 UniRef50_P25773 Cluster: Cathepsin L; n=38; Eukaryota|Rep: Cathe... 104 1e-21 UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h... 103 2e-21 UniRef50_UPI0000E224BB Cluster: PREDICTED: hypothetical protein;... 103 2e-21 UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain... 103 2e-21 UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H... 102 4e-21 UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste... 102 6e-21 UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n... 102 6e-21 UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia ... 101 1e-20 UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2... 101 1e-20 UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endop... 101 1e-20 UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain... 100 2e-20 UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e... 100 2e-20 UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy... 100 2e-20 UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt... 99 3e-20 UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio... 99 3e-20 UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl... 99 3e-20 UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ... 99 3e-20 UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1; ... 100 4e-20 UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ... 99 5e-20 UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster... 99 5e-20 UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus... 99 5e-20 UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ... 99 7e-20 UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re... 98 9e-20 UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory... 98 9e-20 UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;... 98 1e-19 UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac... 98 1e-19 UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ... 98 1e-19 UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ... 97 2e-19 UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n... 97 2e-19 UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ... 96 4e-19 UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi... 96 5e-19 UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:... 96 5e-19 UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr... 96 5e-19 UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz... 95 6e-19 UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ... 95 1e-18 UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel... 95 1e-18 UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz... 94 1e-18 UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;... 94 2e-18 UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;... 94 2e-18 UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv... 94 2e-18 UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted... 93 3e-18 UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet... 93 3e-18 UniRef50_Q24E33 Cluster: Papain family cysteine protease contain... 93 3e-18 UniRef50_A2F4G0 Cluster: Clan CA, family C1, cathepsin L-like cy... 93 3e-18 UniRef50_Q22W19 Cluster: Papain family cysteine protease contain... 93 5e-18 UniRef50_A2WPN4 Cluster: Putative uncharacterized protein; n=2; ... 92 6e-18 UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ... 92 6e-18 UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy... 91 1e-17 UniRef50_Q0MYX5 Cluster: Putative cysteine protease; n=1; Emilia... 91 1e-17 UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal... 90 2e-17 UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ... 90 2e-17 UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo... 90 3e-17 UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa... 89 4e-17 UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ... 89 4e-17 UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain... 89 4e-17 UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arab... 89 6e-17 UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy... 89 6e-17 UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cy... 89 6e-17 UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li... 89 6e-17 UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin ... 88 1e-16 UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyt... 88 1e-16 UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic... 88 1e-16 UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole... 88 1e-16 UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty... 88 1e-16 UniRef50_A2DKX2 Cluster: Clan CA, family C1, cathepsin L-like cy... 88 1e-16 UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35... 88 1e-16 UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ... 87 2e-16 UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ... 87 2e-16 UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi... 87 2e-16 UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re... 87 2e-16 UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ... 87 3e-16 UniRef50_O16454 Cluster: Temporarily assigned gene name protein ... 87 3e-16 UniRef50_Q9VNK7 Cluster: CG1075-PA; n=1; Drosophila melanogaster... 86 4e-16 UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr... 86 4e-16 UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov... 86 4e-16 UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|... 86 5e-16 UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:... 86 5e-16 UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa... 85 7e-16 UniRef50_Q22RC9 Cluster: Papain family cysteine protease contain... 85 7e-16 UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-li... 85 7e-16 UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ... 85 9e-16 UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ... 85 9e-16 UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist... 85 1e-15 UniRef50_Q23H32 Cluster: Papain family cysteine protease contain... 85 1e-15 UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R... 85 1e-15 UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen... 84 2e-15 UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n... 84 2e-15 UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia... 83 3e-15 UniRef50_Q22A69 Cluster: Papain family cysteine protease contain... 83 3e-15 UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:... 83 3e-15 UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryz... 83 4e-15 UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li... 83 4e-15 UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w... 83 4e-15 UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D... 83 4e-15 UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t... 83 5e-15 UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try... 83 5e-15 UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|... 83 5e-15 UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ... 82 6e-15 UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=... 82 6e-15 UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi... 82 8e-15 UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain... 81 1e-14 UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R... 81 1e-14 UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi... 81 1e-14 UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ... 81 2e-14 UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb... 81 2e-14 UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy... 81 2e-14 UniRef50_Q248G1 Cluster: Papain family cysteine protease contain... 80 3e-14 UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ... 79 4e-14 UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big... 79 6e-14 UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa... 79 6e-14 UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep... 79 6e-14 UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n... 79 6e-14 UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali... 79 6e-14 UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG... 79 8e-14 UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh... 78 1e-13 UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|... 78 1e-13 UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ... 78 1e-13 UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ... 77 2e-13 UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain... 77 2e-13 UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma... 77 2e-13 UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ... 77 2e-13 UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ... 77 2e-13 UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ... 77 2e-13 UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain... 77 3e-13 UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain... 77 3e-13 UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil... 76 4e-13 UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi... 76 4e-13 UniRef50_Q239L8 Cluster: Papain family cysteine protease contain... 75 7e-13 UniRef50_UPI0000498E2F Cluster: cysteine proteinase; n=1; Entamo... 75 1e-12 UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis... 75 1e-12 UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia... 75 1e-12 UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3.... 75 1e-12 UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:... 74 2e-12 UniRef50_Q23H10 Cluster: Papain family cysteine protease contain... 74 2e-12 UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto... 74 2e-12 UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ... 73 3e-12 UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ... 73 3e-12 UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8... 73 3e-12 UniRef50_Q23H06 Cluster: Papain family cysteine protease contain... 73 3e-12 UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cy... 73 3e-12 UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa... 73 4e-12 UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-... 73 4e-12 UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ... 73 4e-12 UniRef50_UPI00006CBB5F Cluster: Papain family cysteine protease ... 73 5e-12 UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh... 73 5e-12 UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa... 72 7e-12 UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain... 72 7e-12 UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease ... 72 9e-12 UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain... 72 9e-12 UniRef50_Q237A1 Cluster: Papain family cysteine protease contain... 71 1e-11 UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain... 71 2e-11 UniRef50_Q5CYC8 Cluster: Cathepsin like thiol protease possibly ... 71 2e-11 UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ... 71 2e-11 UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|R... 71 2e-11 UniRef50_A3FQ13 Cluster: Cathepsin like thiol protease possibly ... 71 2e-11 UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ... 70 3e-11 UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M... 70 3e-11 UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps... 70 4e-11 UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 70 4e-11 UniRef50_A7APS9 Cluster: Papain family cysteine protease contain... 70 4e-11 UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - ... 69 5e-11 UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid... 69 6e-11 UniRef50_Q26988 Cluster: Cysteine proteinase 4; n=1; Tritrichomo... 69 6e-11 UniRef50_Q231X3 Cluster: Papain family cysteine protease contain... 69 6e-11 UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep... 69 6e-11 UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh... 69 6e-11 UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca... 69 6e-11 UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; ... 68 1e-10 UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|... 68 1e-10 UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain... 68 1e-10 UniRef50_A0DZB1 Cluster: Chromosome undetermined scaffold_7, who... 68 1e-10 UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain... 68 1e-10 UniRef50_Q235G6 Cluster: Papain family cysteine protease contain... 68 1e-10 UniRef50_A0DIY3 Cluster: Chromosome undetermined scaffold_52, wh... 67 2e-10 UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sa... 67 3e-10 UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|... 67 3e-10 UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum... 67 3e-10 UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 67 3e-10 UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ... 66 3e-10 UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G... 66 3e-10 UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C... 66 3e-10 UniRef50_Q7QRX3 Cluster: GLP_549_24108_24914; n=1; Giardia lambl... 66 4e-10 UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain... 66 4e-10 UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag... 66 4e-10 UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh... 66 4e-10 UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomati... 66 6e-10 UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7... 66 6e-10 UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=... 66 6e-10 UniRef50_A0BZE1 Cluster: Chromosome undetermined scaffold_139, w... 66 6e-10 UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 ... 66 6e-10 UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 ... 66 6e-10 UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re... 65 8e-10 UniRef50_Q5CKC2 Cluster: Preprocathepsin c; n=2; Cryptosporidium... 65 8e-10 UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy... 65 8e-10 UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p... 65 1e-09 UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;... 65 1e-09 UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1... 65 1e-09 UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ... 65 1e-09 UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma... 65 1e-09 UniRef50_UPI0000D566F0 Cluster: PREDICTED: similar to CathePsin ... 64 1e-09 UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2... 64 1e-09 UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis... 64 1e-09 UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi... 64 1e-09 UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ... 64 2e-09 UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl... 64 2e-09 UniRef50_Q9XZM9 Cluster: Cysteine proteinase CPW2; n=1; Acantham... 64 2e-09 UniRef50_Q23H15 Cluster: Papain family cysteine protease contain... 64 2e-09 UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Ca... 64 2e-09 UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep... 64 2e-09 UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh... 64 2e-09 UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ... 64 2e-09 UniRef50_Q84SA7 Cluster: Thiol protease; n=1; Aster tripolium|Re... 64 2e-09 UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lambl... 64 2e-09 UniRef50_Q1AMF3 Cluster: Cathepsin C1; n=1; Toxoplasma gondii|Re... 64 2e-09 UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.... 64 2e-09 UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w... 64 2e-09 UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr... 64 2e-09 UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ... 64 2e-09 UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ... 63 3e-09 UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ... 63 3e-09 UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium... 63 3e-09 UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep... 63 4e-09 UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n... 63 4e-09 UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain... 63 4e-09 UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus co... 62 6e-09 UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 62 6e-09 UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh... 62 7e-09 UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv... 62 7e-09 UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O... 62 7e-09 UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3; Bilate... 62 7e-09 UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=... 62 7e-09 UniRef50_Q8I8D3 Cluster: Cysteine protease 15; n=2; Entamoeba hi... 62 7e-09 UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir... 62 7e-09 UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n... 62 7e-09 UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, wh... 62 7e-09 UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179, w... 62 7e-09 UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C... 62 1e-08 UniRef50_Q015J8 Cluster: Cathepsin; n=2; Ostreococcus|Rep: Cathe... 62 1e-08 UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|... 62 1e-08 UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like prote... 62 1e-08 UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia... 61 1e-08 UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhip... 61 2e-08 UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil... 61 2e-08 UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabdi... 61 2e-08 UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs... 61 2e-08 UniRef50_Q9UBR2 Cluster: Cathepsin Z precursor; n=40; Bilateria|... 61 2e-08 UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin... 60 2e-08 UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir... 60 2e-08 UniRef50_UPI00006CB653 Cluster: Papain family cysteine protease ... 60 3e-08 UniRef50_Q84YE7 Cluster: Cysteine proteinase-like protein; n=1; ... 60 3e-08 UniRef50_P05993 Cluster: Cysteine proteinase; n=7; Eukaryota|Rep... 60 3e-08 UniRef50_Q9LUX8 Cluster: Cysteine protease; n=1; Pyrus pyrifolia... 60 4e-08 UniRef50_Q8I8D5 Cluster: Cysteine protease 13; n=2; Entamoeba hi... 60 4e-08 UniRef50_Q0PWU8 Cluster: Cathepsin B preproprotein-like protein;... 60 4e-08 UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ... 60 4e-08 UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh... 60 4e-08 UniRef50_Q9BL26 Cluster: Putative uncharacterized protein; n=4; ... 59 5e-08 UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve... 59 5e-08 UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophi... 59 5e-08 UniRef50_UPI0000E49DA9 Cluster: PREDICTED: similar to cathepsin ... 59 7e-08 UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ... 59 7e-08 UniRef50_A7AX75 Cluster: Preprocathepsin c, putative; n=1; Babes... 59 7e-08 UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ... 58 9e-08 UniRef50_Q962W3 Cluster: Cysteine protease; n=3; Giardia intesti... 58 9e-08 UniRef50_Q58HK5 Cluster: Hc58; n=1; Haemonchus contortus|Rep: Hc... 58 9e-08 UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr... 58 9e-08 UniRef50_Q8IIJ9 Cluster: Probable cathepsin C precursor; n=11; P... 58 9e-08 UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocysti... 58 1e-07 UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R... 58 1e-07 UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ... 58 1e-07 UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain... 58 1e-07 UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like cy... 58 1e-07 UniRef50_UPI0000498719 Cluster: cysteine protease 18-related; n=... 58 2e-07 UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ... 58 2e-07 UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus... 58 2e-07 UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw... 58 2e-07 UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|... 58 2e-07 UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|R... 58 2e-07 UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba... 57 2e-07 UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep: Cat... 57 2e-07 UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl... 57 2e-07 UniRef50_Q8I8D2 Cluster: Cysteine protease 16; n=2; Entamoeba hi... 57 2e-07 UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 ... 57 2e-07 UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; ... 57 3e-07 UniRef50_Q4UFL9 Cluster: Cathepsin-like cysteine protease, putat... 57 3e-07 UniRef50_Q24F16 Cluster: Papain family cysteine protease contain... 57 3e-07 UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu... 57 3e-07 UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep... 56 4e-07 UniRef50_Q8I8D0 Cluster: Cysteine protease 18; n=2; Entamoeba hi... 56 4e-07 UniRef50_Q6A1H9 Cluster: Cathepsin X/O; n=1; Suberites domuncula... 56 4e-07 UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w... 56 4e-07 UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The... 56 4e-07 UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; P... 56 4e-07 UniRef50_Q9GU75 Cluster: Thiolproteinase; n=2; Babesia|Rep: Thio... 56 5e-07 UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gamb... 56 5e-07 UniRef50_Q26987 Cluster: Cysteine proteinase 3; n=1; Tritrichomo... 56 5e-07 UniRef50_A0DKP6 Cluster: Chromosome undetermined scaffold_54, wh... 56 5e-07 UniRef50_A0CS14 Cluster: Chromosome undetermined scaffold_26, wh... 56 5e-07 UniRef50_Q1AMF1 Cluster: Cathepsin C3; n=1; Toxoplasma gondii|Re... 56 6e-07 UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; ... 55 8e-07 UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl... 55 8e-07 UniRef50_Q54R55 Cluster: Putative uncharacterized protein; n=1; ... 55 8e-07 UniRef50_Q4N3V5 Cluster: Cathepsin C, putative; n=1; Theileria p... 55 8e-07 UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re... 55 8e-07 UniRef50_A7SNM3 Cluster: Predicted protein; n=1; Nematostella ve... 55 8e-07 UniRef50_Q7R0G3 Cluster: GLP_29_33036_32140; n=1; Giardia lambli... 55 1e-06 UniRef50_Q26989 Cluster: Cysteine proteinase 5; n=1; Tritrichomo... 55 1e-06 UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8.... 55 1e-06 UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh... 55 1e-06 UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA... 54 1e-06 UniRef50_Q26993 Cluster: Cysteine proteinase 9; n=1; Tritrichomo... 54 1e-06 UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Ca... 54 1e-06 UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl... 54 1e-06 UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep... 54 2e-06 UniRef50_Q7R5X2 Cluster: GLP_81_104117_102504; n=1; Giardia lamb... 54 2e-06 UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG011... 54 2e-06 UniRef50_Q54MB6 Cluster: Putative uncharacterized protein; n=1; ... 54 2e-06 UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina... 54 2e-06 UniRef50_O48605 Cluster: Putative thiol protease; n=1; Hordeum v... 54 3e-06 UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ... 54 3e-06 UniRef50_Q5VUI9 Cluster: Tubulointerstitial nephritis antigen; n... 54 3e-06 UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein... 53 3e-06 UniRef50_Q7QWN9 Cluster: GLP_26_50243_51811; n=2; Giardia lambli... 53 3e-06 UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Sl... 53 3e-06 UniRef50_A2DPZ6 Cluster: Clan CA, family C1, cathepsin B-like cy... 53 3e-06 UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi... 53 3e-06 UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest... 53 4e-06 UniRef50_A0BLR4 Cluster: Chromosome undetermined scaffold_115, w... 52 8e-06 UniRef50_A5KAP8 Cluster: Protease, putative; n=1; Plasmodium viv... 38 9e-06 UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,... 52 1e-05 UniRef50_Q53K53 Cluster: Cysteine protease 1, putative; n=5; Ory... 52 1e-05 UniRef50_Q7QQ92 Cluster: GLP_243_18349_20043; n=1; Giardia lambl... 52 1e-05 UniRef50_Q54VR1 Cluster: Putative uncharacterized protein; n=1; ... 52 1e-05 UniRef50_Q8I8D7 Cluster: Cysteine protease 11; n=4; Entamoeba hi... 51 1e-05 UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;... 51 1e-05 UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop... 51 1e-05 UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen; n... 51 1e-05 UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei... 51 2e-05 UniRef50_A5K8Y0 Cluster: Preprocathepsin c, putative; n=1; Plasm... 51 2e-05 UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator ame... 51 2e-05 UniRef50_Q5YER6 Cluster: Cathepsin Z; n=1; Bigelowiella natans|R... 50 2e-05 UniRef50_O96166 Cluster: Cysteine protease, putative; n=1; Plasm... 50 2e-05 UniRef50_A2GCC2 Cluster: Clan CA, family C1, cathepsin B-like cy... 50 2e-05 UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh... 50 2e-05 UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who... 50 2e-05 UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop... 50 3e-05 UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The... 50 3e-05 UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; ... 50 4e-05 UniRef50_Q6E7B6 Cluster: Cathepsin L-like cysteine proteinase; n... 50 4e-05 UniRef50_O96164 Cluster: Cysteine protease, putative; n=1; Plasm... 50 4e-05 UniRef50_A0CHZ5 Cluster: Chromosome undetermined scaffold_186, w... 50 4e-05 UniRef50_Q00W63 Cluster: Cysteine proteinase; n=2; Ostreococcus|... 49 6e-05 UniRef50_Q8I8D6 Cluster: Cysteine protease 12; n=1; Entamoeba hi... 49 6e-05 UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia... 49 6e-05 UniRef50_Q5BTK3 Cluster: SJCHGC00358 protein; n=1; Schistosoma j... 49 6e-05 UniRef50_O96167 Cluster: Cysteine protease, putative; n=1; Plasm... 49 6e-05 UniRef50_Q61CG7 Cluster: Putative uncharacterized protein CBG129... 49 7e-05 UniRef50_Q1AMF2 Cluster: Cathepsin C2; n=1; Toxoplasma gondii|Re... 49 7e-05 UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl... 49 7e-05 UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;... 48 1e-04 UniRef50_Q9LFI9 Cluster: Putative uncharacterized protein F2K13_... 48 1e-04 UniRef50_Q7RMW5 Cluster: Papain family cysteine protease, putati... 48 1e-04 UniRef50_Q7JYA0 Cluster: RE20049p; n=2; Sophophora|Rep: RE20049p... 48 1e-04 UniRef50_Q3L7L2 Cluster: Sar s 1 allergen SMIPP-C Yv6008G08; n=2... 48 1e-04 UniRef50_A1SVF0 Cluster: Peptidase C1A, papain; n=1; Psychromona... 48 1e-04 UniRef50_A7QDM1 Cluster: Chromosome chr10 scaffold_81, whole gen... 48 1e-04 UniRef50_Q7Q9Y5 Cluster: ENSANGP00000012222; n=1; Anopheles gamb... 48 1e-04 UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; ... 48 1e-04 UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil... 48 1e-04 UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli... 48 2e-04 UniRef50_Q4UCF5 Cluster: Cysteine proteinase, tacP, putative; n=... 48 2e-04 UniRef50_Q4N5Z7 Cluster: Cysteine proteinase, putative; n=2; The... 48 2e-04 UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop... 48 2e-04 UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-32... 48 2e-04 UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryz... 47 2e-04 UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10... 47 2e-04 UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, w... 47 2e-04 UniRef50_Q9TY95 Cluster: Serine-repeat antigen protein precursor... 47 2e-04 UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin ... 47 3e-04 UniRef50_O96165 Cluster: Cysteine protease, putative; n=1; Plasm... 47 3e-04 UniRef50_O62484 Cluster: Putative uncharacterized protein; n=1; ... 47 3e-04 UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon G... 47 3e-04 UniRef50_Q3A8I3 Cluster: Putative serine protease; n=1; Pelobact... 46 4e-04 UniRef50_Q9U7F7 Cluster: Cysteine protease; n=2; Entamoeba histo... 46 4e-04 UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: ... 46 4e-04 UniRef50_Q4XZE6 Cluster: Preprocathepsin c, putative; n=6; Plasm... 46 4e-04 UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-li... 46 4e-04 UniRef50_O65214 Cluster: Cysteine protease; n=2; Volvox carteri ... 46 5e-04 UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ... 46 5e-04 UniRef50_Q5CM16 Cluster: P3ECSL-related; n=2; Cryptosporidium|Re... 46 5e-04 UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O... 46 7e-04 UniRef50_Q9XW98 Cluster: Putative uncharacterized protein; n=1; ... 46 7e-04 UniRef50_Q4UC83 Cluster: Cysteine proteinase, putative; n=2; The... 46 7e-04 UniRef50_Q01FL5 Cluster: Cysteine protease-5; n=3; Ostreococcus|... 45 9e-04 UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ... 45 9e-04 UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ... 45 9e-04 UniRef50_Q4YCM9 Cluster: Cysteine protease, putative; n=5; Plasm... 45 9e-04 UniRef50_A5KBM0 Cluster: Serine-repeat antigen (SERA), putative;... 45 9e-04 UniRef50_UPI0000E4622C Cluster: PREDICTED: hypothetical protein;... 45 0.001 UniRef50_A4S004 Cluster: Predicted protein; n=2; Ostreococcus|Re... 45 0.001 UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n... 44 0.002 UniRef50_Q7RSR1 Cluster: Papain family cysteine protease, putati... 44 0.002 UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop... 44 0.002 UniRef50_Q8QNJ8 Cluster: EsV-1-75; n=1; Ectocarpus siliculosus v... 44 0.002 UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; ... 44 0.002 UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma j... 44 0.002 UniRef50_Q9NHY1 Cluster: Cysteine protease cp2; n=1; Theileria c... 44 0.003 UniRef50_UPI00006CB77B Cluster: Papain family cysteine protease ... 43 0.005 UniRef50_Q3L7L0 Cluster: Sar s 1 allergen SMIPP-C Yv5009F04; n=3... 43 0.005 UniRef50_A0DCA5 Cluster: Chromosome undetermined scaffold_45, wh... 43 0.005 UniRef50_O96163 Cluster: Cysteine protease, putative; n=5; Plasm... 42 0.006 UniRef50_A5KBM7 Cluster: Serine-repeat antigen 4; n=1; Plasmodiu... 42 0.006 UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S... 42 0.008 UniRef50_Q8I3C0 Cluster: Papain family cysteine protease, putati... 42 0.008 UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; ... 42 0.008 UniRef50_Q26015 Cluster: Serine rich protein homologue; n=4; Pla... 42 0.008 UniRef50_A5KBM1 Cluster: Serine-repeat antigen; n=1; Plasmodium ... 42 0.008 UniRef50_Q8TQM7 Cluster: Putative uncharacterized protein; n=1; ... 42 0.008 UniRef50_UPI000155C322 Cluster: PREDICTED: similar to cathepsin ... 42 0.011 UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,... 42 0.011 UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ... 42 0.011 UniRef50_A5VDP2 Cluster: Peptidase C1A, papain; n=1; Sphingomona... 42 0.011 UniRef50_A2XHS0 Cluster: Putative uncharacterized protein; n=2; ... 42 0.011 UniRef50_A5KBM2 Cluster: Serine-repeat antigen; n=3; Plasmodium|... 42 0.011 UniRef50_UPI00006CBAC7 Cluster: Papain family cysteine protease ... 41 0.015 >UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue of Sarcophaga 26,29kDa proteinase; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to homologue of Sarcophaga 26,29kDa proteinase - Nasonia vitripennis Length = 553 Score = 268 bits (656), Expect = 7e-71 Identities = 114/152 (75%), Positives = 132/152 (86%), Gaps = 1/152 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GGEDFR+YQWI+KHG LPTEE+YGGYLGQDGYCHI NVT I K+ G+VNV TNN +A+KL Sbjct: 400 GGEDFRSYQWIIKHGGLPTEEEYGGYLGQDGYCHIKNVTQIAKLKGFVNVDTNNVDAMKL 459 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 ALFKHGPISVAIDA+HKTFSFYSNGVY+EP C N + LDHAVLAVGYG +NG +WL+K Sbjct: 460 ALFKHGPISVAIDASHKTFSFYSNGVYYEPACGNTENSLDHAVLAVGYGTINGKGFWLIK 519 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVL 462 NSWSN WGNDGY+LM+ + NNCGV +APTY + Sbjct: 520 NSWSNYWGNDGYILMAQKNNNCGVMTAPTYAI 551 >UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-PA - Drosophila melanogaster (Fruit fly) Length = 549 Score = 252 bits (618), Expect = 3e-66 Identities = 108/151 (71%), Positives = 128/151 (84%), Gaps = 1/151 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GGEDFR YQW+++ G +PTEE+YG YLGQDGYCH++NVT + I G+VNVT+N+ NA KL Sbjct: 397 GGEDFRVYQWMLQSGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNVTSNDPNAFKL 456 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 AL KHGP+SVAIDA+ KTFSFYS+GVY+EP CKN VD LDHAVLAVGYG +NG YWLVK Sbjct: 457 ALLKHGPLSVAIDASPKTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGSINGEDYWLVK 516 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYV 459 NSWS WGNDGY+LMS ++NNCGV + PTYV Sbjct: 517 NSWSTYWGNDGYILMSAKKNNCGVMTMPTYV 547 >UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; n=2; Danio rerio|Rep: hypothetical protein LOC550326 - Danio rerio Length = 531 Score = 225 bits (550), Expect = 5e-58 Identities = 90/153 (58%), Positives = 124/153 (81%), Gaps = 1/153 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GGE++RA++WIMKHG + T E YG Y+G +G CH D + + ++TG+ NVT+ + ALK Sbjct: 378 GGEEWRAFEWIMKHGGISTAESYGAYMGMNGLCHYDKTSMVAQLTGYTNVTSGDILALKA 437 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 A+FK GP++V+IDAAH++F+FYSNGVY+EP+CKN +++LDHAVLAVGYG++N YWLVK Sbjct: 438 AIFKFGPVAVSIDAAHRSFAFYSNGVYYEPECKNGINDLDHAVLAVGYGIMNNESYWLVK 497 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 NSWS+ WGNDGY+LMSM++NNCGV + Y + Sbjct: 498 NSWSSYWGNDGYILMSMKDNNCGVATDAIYATL 530 >UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 21 SCAF14577, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 478 Score = 220 bits (538), Expect = 1e-56 Identities = 92/153 (60%), Positives = 123/153 (80%), Gaps = 1/153 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GGE++RAY+WIMKHG + + E YG YLG +G+CH+++ +I + NVT+ + ALKL Sbjct: 325 GGEEWRAYEWIMKHGGIASAETYGPYLGMNGFCHVNSSELTAQIQSYTNVTSGDALALKL 384 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 ALFK+GP++V+IDA+H++F FYSNGVY+EP C + V++LDHAVLAVGYG LNG YWL+K Sbjct: 385 ALFKNGPVAVSIDASHRSFVFYSNGVYYEPACGSTVEDLDHAVLAVGYGNLNGEPYWLIK 444 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 NSWS WGNDGY+LMSM++NNCGV + TYV + Sbjct: 445 NSWSTYWGNDGYILMSMKDNNCGVTTDATYVTL 477 Score = 46.0 bits (104), Expect = 5e-04 Identities = 19/31 (61%), Positives = 24/31 (77%), Gaps = 1/31 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDG 99 GGE++RAY+WIMKH G+ + E YG YLG G Sbjct: 271 GGEEWRAYEWIMKHGGIASAETYGPYLGMTG 301 >UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin l - Strongylocentrotus purpuratus Length = 489 Score = 197 bits (480), Expect = 1e-49 Identities = 88/152 (57%), Positives = 111/152 (73%), Gaps = 2/152 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GGE++R Y+W+MK+G +P EE YG YLGQ+G CH D A+ I + NVT+ N+ LK Sbjct: 333 GGEEWRVYEWLMKNGGIPLEETYGPYLGQNGMCHYDKSKAVASIKKYYNVTSGNQKDLKK 392 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLV 363 AL GPI+V IDAA +FSFYS G Y++ C N VD+LDHAVLAVGYG +G YWL+ Sbjct: 393 ALATKGPIAVGIDAAVPSFSFYSYGTYYDASCGNTVDDLDHAVLAVGYGTDSSGQDYWLI 452 Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQSAPTYV 459 KNSWS WGN+GYV +SM++NNCGV +A TYV Sbjct: 453 KNSWSTHWGNNGYVAISMKDNNCGVATAATYV 484 >UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 513 Score = 164 bits (399), Expect = 9e-40 Identities = 69/144 (47%), Positives = 100/144 (69%), Gaps = 1/144 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG +RA QWI+KHG L TEE YG YL Q+GYCH N + ++ ++++ N + LKL Sbjct: 362 GGYPYRAMQWILKHGGLATEESYGRYLAQEGYCHFKNTSIGARLDKYMSIRQGNTSQLKL 421 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 A+ +GP+S+ ++ KTF FY +G+Y++ +C + LDHA LAVGYG G YW+VK Sbjct: 422 AVAFYGPVSILVNTQPKTFKFYGSGIYYDTQCTH---ALDHAALAVGYGEEKGVSYWIVK 478 Query: 367 NSWSNMWGNDGYVLMSMRENNCGV 438 NSWS MWG +GY+ ++M+++NCGV Sbjct: 479 NSWSAMWGEEGYIKIAMKDDNCGV 502 >UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1; Dictyostelium discoideum AX4|Rep: Counting factor associated protein - Dictyostelium discoideum AX4 Length = 531 Score = 155 bits (375), Expect = 7e-37 Identities = 78/153 (50%), Positives = 102/153 (66%), Gaps = 3/153 (1%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAI-TKITGWVNVTTNNENAL 180 GGG A+Q++M+ G L TE +Y YL Q+G C VT ITG+VNVT+ +E+AL Sbjct: 374 GGGFASSAFQYVMEIGSLATESNYP-YLMQNGLCRDRTVTPSGVSITGYVNVTSGSESAL 432 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360 + A+ GP+++AIDA+ F +Y +GVY P CKN +D+LDH VLA+GYG G Y+L Sbjct: 433 QNAIATTGPVAIAIDASVDDFRYYMSGVYNNPACKNGLDDLDHEVLAIGYGTYQGQDYFL 492 Query: 361 VKNSWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456 VKNSWS WG DGYV M+ +NN CGV S TY Sbjct: 493 VKNSWSTNWGMDGYVYMARNDNNLCGVSSQATY 525 >UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 392 Score = 143 bits (347), Expect = 2e-33 Identities = 64/150 (42%), Positives = 94/150 (62%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG ++A+ W+ K G+ T + YG Y GQ+G+C N+T +IT + V N ALK A Sbjct: 248 GGWTWKAFSWVKKFGIATTKSYGHYRGQEGFCKTSNLTVGARITSYRRVKRFNPIALKKA 307 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 L HGP +++I+A K+ FYS+G+ + C NK DHAVL +GYG NG YWL+KN Sbjct: 308 LSYHGPATISINANPKSLKFYSDGIMSDKHCSNKT---DHAVLLIGYGSDNGVPYWLIKN 364 Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTYV 459 SWS+ WGN+G++ +++ CG++ P V Sbjct: 365 SWSHKWGNNGFI--KIKQGLCGIEKRPFVV 392 >UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin L - Misgurnus mizolepis (Mud loach) Length = 337 Score = 142 bits (345), Expect = 3e-33 Identities = 73/159 (45%), Positives = 104/159 (65%), Gaps = 7/159 (4%) Frame = +1 Query: 10 GGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGY-CHIDNVTAITKITGWVNVTTNNENALK 183 GG +A+Q+I +GL +EE Y YLG D CH D TG+V++ + E+AL Sbjct: 182 GGLMDQAFQYIKDNNGLDSEEAYP-YLGTDDQPCHYDPKYNAANDTGFVDIPSGKEHALM 240 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNGHK 351 A+ GP+SVAIDA H++F FY +G+YFE +C + +ELDH VL VGYG ++G K Sbjct: 241 KAVASVGPVSVAIDAGHESFQFYQSGIYFEKECSS--EELDHGVLVVGYGFEGEDVDGKK 298 Query: 352 YWLVKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465 YW+VKNSWS WG+ GY+ M+ R+N+CG+ +A +Y L+ Sbjct: 299 YWIVKNSWSESWGDKGYIYMAKDRKNHCGIATAASYPLV 337 >UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L - Suberites domuncula (Sponge) Length = 324 Score = 142 bits (344), Expect = 4e-33 Identities = 68/155 (43%), Positives = 92/155 (59%), Gaps = 1/155 (0%) Frame = +1 Query: 4 RGGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183 +GG D I HG+ TE Y Y +DGYC + T + ++ +E++L Sbjct: 173 KGGIMDDAFRYVISNHGVDTESSYP-YTAKDGYCRFNQNNVGATETSYRDIARGSESSLT 231 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363 A + GPISVAIDA+H++F FY NGVY+EP C + LDH VL VGYG G Y++V Sbjct: 232 QASAQIGPISVAIDASHRSFQFYKNGVYYEPSCSS--SRLDHGVLVVGYGTEGGQDYFIV 289 Query: 364 KNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465 KNSW WG DGY++MS R NNCG+ S +Y ++ Sbjct: 290 KNSWGTRWGMDGYIMMSRNRRNNCGIASQASYPIV 324 >UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precursor; n=3; Metazoa|Rep: Digestive cysteine proteinase 2 precursor - Homarus americanus (American lobster) Length = 323 Score = 140 bits (339), Expect = 2e-32 Identities = 70/154 (45%), Positives = 94/154 (61%), Gaps = 2/154 (1%) Frame = +1 Query: 10 GGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A+ +I +G+ TE Y Y +DG C D+ + +G N+ + +E L+ Sbjct: 173 GGWMNDAFDYIKANNGIDTEAAYP-YEARDGSCRFDSNSVAATCSGHTNIASGSETGLQQ 231 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 A+ GPISV IDAAH +F FYS+GVY+EP C LDHAVLAVGYG G +WLVK Sbjct: 232 AVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSY--LDHAVLAVGYGSEGGQDFWLVK 289 Query: 367 NSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465 NSW+ WG+ GY+ MS R NNCG+ + +Y L+ Sbjct: 290 NSWATSWGDAGYIKMSRNRNNNCGIATVASYPLV 323 >UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; Entamoeba|Rep: Cysteine proteinase 2 precursor - Entamoeba histolytica Length = 315 Score = 136 bits (330), Expect = 2e-31 Identities = 66/149 (44%), Positives = 87/149 (58%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG Y +I++HG+ E DY Y G D C NV + KITG+ V NNE LK A Sbjct: 163 GGLGSNVYDYIIEHGVAKESDYP-YTGSDSTCKT-NVKSFAKITGYTKVPRNNEAELKAA 220 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 L G + V+IDA+ F Y +G Y + KCKN L+H V AVGYGV++G + W+V+N Sbjct: 221 L-SQGLVDVSIDASSAKFQLYKSGAYTDTKCKNNYFALNHEVCAVGYGVVDGKECWIVRN 279 Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTY 456 SW WG+ GY+ M + N CGV + P Y Sbjct: 280 SWGTGWGDKGYINMVIEGNTCGVATDPLY 308 >UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 514 Score = 134 bits (324), Expect = 1e-30 Identities = 62/147 (42%), Positives = 86/147 (58%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG +A WI HG+ + E YG YLGQ+G C I+ + I + V N ALK++ Sbjct: 369 GGYYNKAMSWIYLHGIASAESYGPYLGQEGTCRIEGLRRAAAIDAFAFVPKYNNTALKIS 428 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + + GP V+I+ + FYS G+Y +P+C + H+VL VGYGV +G YWLVKN Sbjct: 429 VARFGPAVVSINENPLSLKFYSWGLYDDPECGRDTAAV-HSVLVVGYGVEDGEPYWLVKN 487 Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAP 450 SWS WG DGY+ ++ + N CGV P Sbjct: 488 SWSTTWGMDGYIKIAWKRNTCGVTRNP 514 >UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina) Length = 339 Score = 132 bits (320), Expect = 3e-30 Identities = 67/152 (44%), Positives = 93/152 (61%), Gaps = 3/152 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A+++I +G + TE+ Y Y G D CH + T TG+V++ +E +K Sbjct: 188 GGLMDNAFRYIKDNGGIDTEKSYP-YEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKK 246 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLV 363 A+ GP+SVAIDA+H++F YS GVY EP+C + LDH VL VGYG +G YWLV Sbjct: 247 AVATMGPVSVAIDASHESFQLYSEGVYNEPECDEQ--NLDHGVLVVGYGTDESGMDYWLV 304 Query: 364 KNSWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456 KNSW WG GY+ M+ +NN CG+ +A +Y Sbjct: 305 KNSWGTTWGEQGYIKMARNQNNQCGIATASSY 336 >UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome shotgun sequence; n=3; Tetraodontidae|Rep: Chromosome 12 SCAF14996, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 362 Score = 132 bits (319), Expect = 5e-30 Identities = 66/149 (44%), Positives = 94/149 (63%), Gaps = 6/149 (4%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG +A+Q+I +G L +E Y D CH D TG+V+V + +E AL Sbjct: 214 GGLMDQAFQYIKDNGGLDSEASYPYLATDDQPCHYDPSNNSANETGFVDVPSGSERALMK 273 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNGHKY 354 A+ GP+SVAIDA H++F FY +G+Y+E +C + +ELDH VL VGYG ++G K+ Sbjct: 274 AVASVGPVSVAIDAGHESFQFYQSGIYYEKECSS--EELDHGVLVVGYGFQGEDVDGKKF 331 Query: 355 WLVKNSWSNMWGNDGYVLMSM-RENNCGV 438 W+VKNSWS WGN GY+ M+ R+N+CG+ Sbjct: 332 WIVKNSWSENWGNKGYIYMAKDRKNHCGI 360 >UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens (Human) Length = 334 Score = 131 bits (317), Expect = 8e-30 Identities = 69/155 (44%), Positives = 93/155 (60%), Gaps = 6/155 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG RA+Q++ ++G L +EE Y Y+ D C ++ TG+ V E AL Sbjct: 180 GGFMARAFQYVKENGGLDSEESYP-YVAVDEICKYRPENSVANDTGFTVVAPGKEKALMK 238 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNGHKY 354 A+ GPISVA+DA H +F FY +G+YFEP C +K LDH VL VGYG N KY Sbjct: 239 AVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSK--NLDHGVLVVGYGFEGANSNNSKY 296 Query: 355 WLVKNSWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456 WLVKNSW WG++GYV ++ +NN CG+ +A +Y Sbjct: 297 WLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331 >UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Longidorus elongatus Length = 358 Score = 130 bits (314), Expect = 2e-29 Identities = 68/155 (43%), Positives = 96/155 (61%), Gaps = 3/155 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A+Q++ + G+ TE Y Y G+DG C + TG+V++ NE L+ Sbjct: 205 GGYMDGAFQYVETNKGIDTEASYP-YKGRDGRCRFKSEDVGATDTGFVDIPEGNETLLEA 263 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGY-GVLNGHKYWLV 363 A+ GP+SVAIDAA F FYS+GVY++ C + LDH VLAVGY +G +Y++V Sbjct: 264 AIATVGPVSVAIDAASFKFQFYSHGVYYDRSC--SPEYLDHGVLAVGYNSTKDGKQYYIV 321 Query: 364 KNSWSNMWGNDGYVLMSMRE-NNCGVQSAPTYVLI 465 KNSWS WG+DGY+LMS R+ NNCG+ + +Y + Sbjct: 322 KNSWSEDWGDDGYILMSRRKNNNCGIATMASYPFV 356 >UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Cathepsin K - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 333 Score = 129 bits (311), Expect = 4e-29 Identities = 64/153 (41%), Positives = 91/153 (59%), Gaps = 3/153 (1%) Frame = +1 Query: 7 GGGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183 GGG A++++ + G+ +EE Y Y+G D C + G+ + NE AL Sbjct: 181 GGGYMTNAFRYVSNNQGIDSEESYP-YVGTDQQCAYNTSGVAASCRGYKEIPQGNERALT 239 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL-NGHKYWL 360 A+ GP+SV IDA TF +Y +GVY++P C NK ++++HAVLAVGYG G KYW+ Sbjct: 240 AAVANVGPVSVGIDAMQSTFLYYKSGVYYDPNC-NK-EDVNHAVLAVGYGATPRGKKYWI 297 Query: 361 VKNSWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456 VKNSW WG GYVLM+ NN CG+ + ++ Sbjct: 298 VKNSWGEEWGKKGYVLMARNRNNACGIANLASF 330 >UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 2 - Rhipicephalus appendiculatus (Brown ear tick) Length = 564 Score = 127 bits (307), Expect = 1e-28 Identities = 72/154 (46%), Positives = 91/154 (59%), Gaps = 4/154 (2%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNV-TAITKITGWVNVTTNNENALKL 186 GGEDFRAY++I HGL ++EDYG Y+GQDG CH V + I+ I +VN+T N + L Sbjct: 411 GGEDFRAYEYIADHGLASDEDYGAYIGQDGVCHDSKVNSTISSIKSYVNIT--NRDDLPT 468 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVL-AVGYGVLNGHKYWLV 363 AL GP+SV+IDAA ++FSFY P D LDH+VL L G YW V Sbjct: 469 ALANVGPVSVSIDAALRSFSFYPTVSSMIPTAAMDTDSLDHSVLRQSATRTLQGEPYWGV 528 Query: 364 KNSWSNMWGN-DGYVLMSMRENNC-GVQSAPTYV 459 KNSW + G GYVL+S + GV + TYV Sbjct: 529 KNSWVYLLGEMMGYVLISPKGTTTGGVATQGTYV 562 >UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06231 protein - Schistosoma japonicum (Blood fluke) Length = 372 Score = 127 bits (306), Expect = 2e-28 Identities = 62/158 (39%), Positives = 98/158 (62%), Gaps = 6/158 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDG----YCHIDNVTAITKITGWVNVTTNNEN 174 GG A+Q++ + G+ +E Y Y+ DG C ++ + ++TG++N+ +E Sbjct: 216 GGLMDLAFQYVRDNKGIDSEISYP-YISGDGDENVRCLFNSTNIMAQVTGYINIHEGDER 274 Query: 175 ALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKY 354 AL A+ GP+SVAI+A +FS Y +G+Y +P+C + ++LDH VL VGYG+ +G Y Sbjct: 275 ALMNAVATIGPVSVAINAGLPSFSMYKSGIYSDPECASASEDLDHGVLLVGYGIEDGKPY 334 Query: 355 WLVKNSWSNMWGNDGYV-LMSMRENNCGVQSAPTYVLI 465 WL+KNSW WG+ GYV ++ +N CGV SA +Y L+ Sbjct: 335 WLIKNSWGEDWGDKGYVKILKDSKNMCGVASAASYPLV 372 >UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra erinaceieuropaei|Rep: Cysteine proteinase - Spirometra erinaceieuropaei (Tapeworm) Length = 336 Score = 126 bits (303), Expect = 4e-28 Identities = 62/150 (41%), Positives = 84/150 (56%), Gaps = 1/150 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG +A+Q+ ++G+ E DY Y +DG C + +TG+ + +E L+ A Sbjct: 187 GGLMPQAFQYAQRYGVEAEVDYR-YTERDGVCRYRQDLVVANVTGYAELPEGDEGGLQRA 245 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + GPISV IDAA F YS+GV+ C +DH VL VGYG NG YWLVKN Sbjct: 246 VATIGPISVGIDAADPGFMSYSHGVFVSKTCSPYA--IDHGVLVVGYGAENGDAYWLVKN 303 Query: 370 SWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456 SW + WG DGY+ M+ NN CG+ S +Y Sbjct: 304 SWGSSWGEDGYLKMARNRNNMCGIASMASY 333 >UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine protease; n=1; Maconellicoccus hirsutus|Rep: Putative cathepsin L-like cysteine protease - Maconellicoccus hirsutus (hibiscus mealybug) Length = 339 Score = 126 bits (303), Expect = 4e-28 Identities = 55/153 (35%), Positives = 87/153 (56%), Gaps = 1/153 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG +Y +I K G ++ Y + C +T+++G + + E L + Sbjct: 187 GGNQHHSYFYIYKQGGVDDDVSYPYKDAEEPCAFKKENVVTRVSGEITLPDGYETNLHES 246 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + +GP++ IDA H++F Y G+YFEP C NK DE++H VL VGYG NG YW+VKN Sbjct: 247 VAVYGPVAATIDATHQSFHSYKGGIYFEPDCGNKKDEVNHGVLVVGYGSENGQDYWIVKN 306 Query: 370 SWSNMWGNDGYVLMSMRENN-CGVQSAPTYVLI 465 S+ WG DGY+ M+ +NN CG+ ++ + ++ Sbjct: 307 SYGTDWGEDGYIRMARNKNNHCGIATSASVPML 339 >UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; Phytophthora infestans|Rep: Cathepsin-like cysteine protease - Phytophthora infestans (Potato late blight fungus) Length = 376 Score = 125 bits (302), Expect = 5e-28 Identities = 66/156 (42%), Positives = 87/156 (55%), Gaps = 4/156 (2%) Frame = +1 Query: 10 GGEDFRAYQWIM---KHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENAL 180 GGE Y+ I+ K + EE Y G C+ + AI T + NVT+ +E AL Sbjct: 199 GGEMSEGYEEIITNHKGKIDREEVYRYTAESKGVCNAKDDKAIGHFTSYANVTSGDEAAL 258 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360 + A+ G +VAIDA+ TF Y +GVY P C N D LDH V A GYGV YWL Sbjct: 259 QAAIATKGVQAVAIDASSFTFQLYRHGVYSWPLCGNAPDALDHGVAAAGYGVYKKKDYWL 318 Query: 361 VKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465 VKNSW N WG GY++MS ++N CG+ + TY ++ Sbjct: 319 VKNSWGNSWGMKGYIMMSRNKDNQCGIATDATYPIM 354 >UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|Rep: Cysteine proteinase - Acanthamoeba healyi Length = 330 Score = 125 bits (302), Expect = 5e-28 Identities = 65/152 (42%), Positives = 92/152 (60%), Gaps = 2/152 (1%) Frame = +1 Query: 7 GGGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183 GG D+ A+++I+ + G+ TE Y C + +TG+ +VT+ +ENAL Sbjct: 180 GGLMDY-AFEYIINNRGIDTEASYPYQTAGPLTCQYNAANKGGSLTGYTDVTSGDENALL 238 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363 A K P+SVAIDA+H +F FYS GVY+E C + +LDH VL VG+G NG +W V Sbjct: 239 NAAVKE-PVSVAIDASHNSFQFYSGGVYYESACSST--QLDHGVLVVGWGSENGQDFWWV 295 Query: 364 KNSWSNMWGNDGYVLMSMRE-NNCGVQSAPTY 456 KNSW WG +GY+ MS + NNCG+ +A +Y Sbjct: 296 KNSWGASWGLNGYIKMSRNQNNNCGIATAASY 327 >UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep: Cysteine proteinase - Entamoeba histolytica Length = 320 Score = 124 bits (300), Expect = 9e-28 Identities = 65/150 (43%), Positives = 88/150 (58%), Gaps = 1/150 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG + + ++G+ E+DY Y +G C D I K G V V NE AL A Sbjct: 166 GGSILYVFAYTKRNGVIEEKDYP-YTATNGTCQYDADKIIVKNAGQVIVEQRNEVALVEA 224 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + + GP++VAIDA +F Y +GVY EPKCK + L+HAV AVGYG +G Y++V+N Sbjct: 225 IAE-GPVAVAIDAGQASFQLYKSGVYDEPKCKKVI--LNHAVCAVGYGSQDGQDYYIVRN 281 Query: 370 SWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456 SW WG DGY+LMS +NN CG+ + Y Sbjct: 282 SWGTSWGMDGYILMSRNKNNQCGIANDAIY 311 >UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=19; Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Homo sapiens (Human) Length = 333 Score = 124 bits (300), Expect = 9e-28 Identities = 68/156 (43%), Positives = 95/156 (60%), Gaps = 6/156 (3%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183 GG D+ A+Q++ +G L +EE Y Y + C + ++ TG+V++ E AL Sbjct: 180 GGLMDY-AFQYVQDNGGLDSEESYP-YEATEESCKYNPKYSVANDTGFVDIP-KQEKALM 236 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNGHK 351 A+ GPISVAIDA H++F FY G+YFEP C + +++DH VL VGYG + +K Sbjct: 237 KAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSS--EDMDHGVLVVGYGFESTESDNNK 294 Query: 352 YWLVKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTY 456 YWLVKNSW WG GYV M+ R N+CG+ SA +Y Sbjct: 295 YWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASY 330 >UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropicalis|Rep: LOC594890 protein - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 355 Score = 123 bits (297), Expect = 2e-27 Identities = 59/144 (40%), Positives = 87/144 (60%), Gaps = 1/144 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG ++++I+ +G+ E +Y Y G+DG C V + T + + +E LK Sbjct: 206 GGWVVSSFRYIIDNGIELESNYP-YQGKDGKCSYTPVKKASVCTSYRQLPYGDEATLKQV 264 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + GP+SVAIDA+ KTF Y NGVY++P C + DH+VL VGYG +G +YWLVKN Sbjct: 265 VGLMGPVSVAIDASRKTFRMYKNGVYYDPNCSSSTP--DHSVLVVGYGAEDGVEYWLVKN 322 Query: 370 SWSNMWGNDGYVLMSM-RENNCGV 438 SW +G++GY+ M+ NNCG+ Sbjct: 323 SWGTSFGDEGYIKMARNHHNNCGI 346 >UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; Dictyostelium discoideum|Rep: Cysteine proteinase 2 precursor - Dictyostelium discoideum (Slime mold) Length = 376 Score = 88.6 bits (210), Expect(2) = 3e-27 Identities = 49/111 (44%), Positives = 70/111 (63%), Gaps = 2/111 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAI-TKITGWVNVTTNNENALK 183 GG A+ +I+K+ G+ TE Y Y + G + N + I I G+VN+T +E +L+ Sbjct: 189 GGLMNNAFDYIIKNKGIDTESSYP-YTAETGSTCLFNKSDIGATIKGYVNITAGSEISLE 247 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV 336 +HGP+SVAIDA+H +F Y++G+Y+EPKC ELDH VL VGYGV Sbjct: 248 NGA-QHGPVSVAIDASHNSFQLYTSGIYYEPKC--SPTELDHGVLVVGYGV 295 Score = 55.6 bits (128), Expect(2) = 3e-27 Identities = 22/40 (55%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Frame = +1 Query: 346 HKYWLVKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVL 462 + YW+VKNSW WG GY+LMS R+NNCG+ S +Y L Sbjct: 336 NNYWIVKNSWGTSWGIKGYILMSKDRKNNCGIASVSSYPL 375 >UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]; n=11; Eutheria|Rep: Testin-2 precursor [Contains: Testin-1] - Mus musculus (Mouse) Length = 333 Score = 122 bits (294), Expect = 5e-27 Identities = 65/158 (41%), Positives = 92/158 (58%), Gaps = 6/158 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A+Q++ +G L TEE Y Y+G C + + +V + E AL Sbjct: 180 GGFMQNAFQYVKDNGGLATEESYP-YIGPGRKCRYHAENSAANVRDFVQIP-GREEALMK 237 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNGHKY 354 A+ K GPISVA+DA+H +F FY +G+Y+EP+CK L+HAVL VGYG +G+ Y Sbjct: 238 AVAKVGPISVAVDASHDSFQFYDSGIYYEPQCKRV--HLNHAVLVVGYGFEGEESDGNSY 295 Query: 355 WLVKNSWSNMWGNDGYVLMSMRENN-CGVQSAPTYVLI 465 WLVKNSW WG GY+ ++ NN CG+ + TY ++ Sbjct: 296 WLVKNSWGEEWGMKGYIKIAKDWNNHCGIATLATYPIV 333 >UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platyhelminthes|Rep: Cathepsin L-like proteinase - Echinococcus multilocularis Length = 338 Score = 121 bits (292), Expect = 9e-27 Identities = 61/154 (39%), Positives = 89/154 (57%), Gaps = 2/154 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG+ A+++ M++G +E DY Y DG C ++ +TK++ +V V E+ LKL+ Sbjct: 188 GGDMNDAFRYWMRNGAESESDYP-YTAMDGKCKFNSSKVVTKVSKFVKVPKKREDQLKLS 246 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLVK 366 + + GP+SVAIDA F Y G+Y + C + LDHAVL VGY KYW+VK Sbjct: 247 VAQVGPVSVAIDATSSGFMLYKKGIYQDNTCSQQY--LDHAVLVVGYDADKTRQKYWIVK 304 Query: 367 NSWSNMWGNDGYVLMSMRENN-CGVQSAPTYVLI 465 NSW WG GY+ M+ + N CG+ + +Y LI Sbjct: 305 NSWGEDWGQRGYIWMARDKGNMCGIATMASYPLI 338 >UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae|Rep: Cysteine proteinase - Hypera postica (alfalfa weevil) Length = 324 Score = 121 bits (291), Expect = 1e-26 Identities = 57/152 (37%), Positives = 89/152 (58%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG ++++MK GL +EE Y Y G+DG C + + +TK++ + ++ +E+AL A Sbjct: 177 GGSLDDNFKYVMKDGLQSEESYT-YKGEDGACKYNVASVVTKVSKYTSIPAEDEDALLEA 235 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + GP+SV +DA++ S Y +G+Y + C L+HA+LAVGYG NG YW++KN Sbjct: 236 VATVGPVSVGMDASY--LSSYDSGIYEDQDCSPA--GLNHAILAVGYGTENGKDYWIIKN 291 Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 SW WG GY ++ +N CG+ Y I Sbjct: 292 SWGASWGEQGYFRLARGKNQCGISEDTVYPTI 323 >UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP00000013730, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to ENSANGP00000013730, partial - Ornithorhynchus anatinus Length = 229 Score = 120 bits (289), Expect = 2e-26 Identities = 43/77 (55%), Positives = 68/77 (88%) Frame = +1 Query: 235 KTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLMS 414 ++F+FY+NG+Y+EP+C++K+++L+HAVL VGYGVL G +WL+KNSWS +WGN GY+L++ Sbjct: 152 RSFAFYANGIYYEPQCRHKLEQLNHAVLLVGYGVLQGQAFWLLKNSWSPLWGNSGYMLLA 211 Query: 415 MRENNCGVQSAPTYVLI 465 M++N+CGV +A TY ++ Sbjct: 212 MKDNDCGVTTAATYPIL 228 >UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: Cysteine protease - Saprolegnia parasitica Length = 523 Score = 120 bits (289), Expect = 2e-26 Identities = 66/154 (42%), Positives = 87/154 (56%), Gaps = 5/154 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A++W+ H GL EEDY Y ++G C + +TK+T + +V N+E ALK Sbjct: 181 GGLMDNAFKWVKTHKGLCKEEDYP-YHAKEGTCALKKCKPVTKVTAFHDVPANDEQALKA 239 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 A+ K P+SVAI+A F FY +GV F+ C K LDH VL VGYG G KYW VK Sbjct: 240 AVAKQ-PVSVAIEADQPEFQFYKSGV-FDKSCGTK---LDHGVLVVGYGEEGGKKYWKVK 294 Query: 367 NSWSNMWGNDGYVLMSM----RENNCGVQSAPTY 456 NSW WG+ GY+ ++ CGV P+Y Sbjct: 295 NSWGADWGDKGYIKLAREFGPETGQCGVAMVPSY 328 >UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens (Human) Length = 331 Score = 120 bits (288), Expect = 3e-26 Identities = 59/151 (39%), Positives = 88/151 (58%), Gaps = 2/151 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A+Q+I+ + G+ ++ Y Y D C D+ + + + E+ LK Sbjct: 182 GGFMTTAFQYIIDNKGIDSDASYP-YKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKE 240 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 A+ GP+SV +DA H +F Y +GVY+EP C V+ H VL VGYG LNG +YWLVK Sbjct: 241 AVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVN---HGVLVVGYGDLNGKEYWLVK 297 Query: 367 NSWSNMWGNDGYVLMSMRE-NNCGVQSAPTY 456 NSW + +G +GY+ M+ + N+CG+ S P+Y Sbjct: 298 NSWGHNFGEEGYIRMARNKGNHCGIASFPSY 328 >UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor; n=17; Magnoliophyta|Rep: Thiol protease aleurain-like precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 120 bits (288), Expect = 3e-26 Identities = 62/153 (40%), Positives = 89/153 (58%), Gaps = 1/153 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG +A+++I +G L TEE Y Y G+DG C ++ VN+T E+ LK Sbjct: 207 GGLPSQAFEYIKYNGGLDTEEAYP-YTGKDGGCKFSAKNIGVQVRDSVNITLGAEDELKH 265 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 A+ P+SVA + H+ F FY GV+ C N +++HAVLAVGYGV + YWL+K Sbjct: 266 AVGLVRPVSVAFEVVHE-FRFYKKGVFTSNTCGNTPMDVNHAVLAVGYGVEDDVPYWLIK 324 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 NSW WG++GY M M +N CGV + +Y ++ Sbjct: 325 NSWGGEWGDNGYFKMEMGKNMCGVATCSSYPVV 357 >UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine proteinase precursor - Heterodera glycines (Soybean cyst nematode worm) Length = 353 Score = 118 bits (284), Expect = 8e-26 Identities = 61/155 (39%), Positives = 91/155 (58%), Gaps = 3/155 (1%) Frame = +1 Query: 10 GGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A++++ +GL TEE Y Y G C N T + + ++ +E LK+ Sbjct: 202 GGLMDSAFEYVRDNNGLDTEESYP-YEAVTGKCQFKNETVGGTVVSFKDLKKGDEEQLKI 260 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH-KYWLV 363 A+ GPISVA+DA++ +F FY GVY+E C N+ LDH VL VGYG H YWLV Sbjct: 261 AVATIGPISVALDASNLSFQFYKTGVYYERWCSNRY--LDHGVLLVGYGTDETHGDYWLV 318 Query: 364 KNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465 KNSW WG +GY+ ++ ++N+CG+ + +Y ++ Sbjct: 319 KNSWGPHWGENGYIRIARNKQNHCGIATMASYPVV 353 >UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 326 Score = 117 bits (281), Expect = 2e-25 Identities = 62/154 (40%), Positives = 86/154 (55%), Gaps = 2/154 (1%) Frame = +1 Query: 10 GGEDFRAYQWI-MKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG +A ++I G+ +E DY Y G D C D+ KI+ + + N+E+ LK Sbjct: 175 GGYMDKALEYIETAGGIMSENDYP-YEGIDDKCRFDSSKVAAKISNFTYIKKNDEDDLKN 233 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 A+ GPISVAIDA+ F Y +G+ + C + + L+H VL VGYG YW+VK Sbjct: 234 AVIAKGPISVAIDASFN-FQLYDSGILDDSSCYSDFNSLNHGVLVVGYGTEKEQDYWIVK 292 Query: 367 NSWSNMWGNDGYVLMSMRENN-CGVQSAPTYVLI 465 NSW WG DGY+ MS +NN CG+ + TY I Sbjct: 293 NSWGADWGMDGYIWMSRNKNNQCGIATDATYPTI 326 >UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Rep: Cathepsin R precursor - Mus musculus (Mouse) Length = 334 Score = 117 bits (281), Expect = 2e-25 Identities = 61/158 (38%), Positives = 94/158 (59%), Gaps = 6/158 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG+ + A+Q+++ +G L +E Y Y G+DG C + + +ITG+V++ +E+ L Sbjct: 181 GGDTYNAFQYVLHNGGLESEATYP-YEGKDGPCRYNPKNSKAEITGFVSLP-QSEDILMA 238 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNGHKY 354 A+ GPI+ IDA+H++F Y G+Y EP C + D + H VL VGYG +G+ Y Sbjct: 239 AVATIGPITAGIDASHESFKNYKGGIYHEPNCSS--DTVTHGVLVVGYGFKGIETDGNHY 296 Query: 355 WLVKNSWSNMWGNDGYVLMSMRENN-CGVQSAPTYVLI 465 WL+KNSW WG GY+ ++ +NN CG+ S Y I Sbjct: 297 WLIKNSWGKRWGIRGYMKLAKDKNNHCGIASYAHYPTI 334 >UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n=21; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Globodera pallida Length = 379 Score = 116 bits (280), Expect = 2e-25 Identities = 64/156 (41%), Positives = 88/156 (56%), Gaps = 4/156 (2%) Frame = +1 Query: 10 GGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGY-CHIDNVTAITKITGWVNVTTNNENALK 183 GG A+Q+I +G+ E DY Y + G C TG+ ++ +E LK Sbjct: 227 GGIMDNAFQYIKDNNGVDKELDYP-YKAKTGKKCLFKRNDVGATDTGFFDIAEGDEEKLK 285 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWL 360 +A+ GP SVAIDA H++F Y++GVYFE +C + LDH VL VGYG YW+ Sbjct: 286 IAVATQGPASVAIDAGHRSFQLYTHGVYFEKEC--SPENLDHGVLVVGYGTDAQQGDYWI 343 Query: 361 VKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465 VKNSW WG GY+ M+ R+NNCG+ S +Y L+ Sbjct: 344 VKNSWGAHWGEQGYIRMARNRKNNCGIASHASYPLV 379 >UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus|Rep: Cathepsin L - Aphrocallistes vastus Length = 329 Score = 116 bits (279), Expect = 3e-25 Identities = 57/151 (37%), Positives = 88/151 (58%) Frame = +1 Query: 4 RGGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183 +GG D+ A+++ + E DY Y ++G C + +TK + + ++ + N +ALK Sbjct: 180 QGGLMDY-AFKYWETNLAEKESDYT-YTAKNGKCKYNAQLGVTKDSSFTDIPSENCDALK 237 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363 A+ GPI+VA+DA+H +F Y +G+Y C +LDH VL VGYG NG YWL+ Sbjct: 238 EAVANKGPIAVAMDASHTSFQMYHSGIYTPFLCSKT--KLDHGVLVVGYGTDNGVDYWLI 295 Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQSAPTY 456 KNSW WG DGY + M+ + CG+ + +Y Sbjct: 296 KNSWGMAWGMDGYFKIEMKSDKCGICTQASY 326 >UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Cathepsin - Geodia cydonium (Sponge) Length = 322 Score = 116 bits (278), Expect = 4e-25 Identities = 58/154 (37%), Positives = 87/154 (56%), Gaps = 2/154 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A+++++K+G + TE Y Y+ +D CH + + + +V++ + +E L++ Sbjct: 169 GGLPDDAFKYVIKNGGIDTEASYP-YVARDEKCHYSSANIGSTCSSYVDIESKSEAQLQV 227 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 A GPI V IDA+H F Y GVY C LDH VL VGYGV YW+VK Sbjct: 228 ASATVGPIPVGIDASHLGFQLYDGGVYHSDLCSQT--RLDHGVLVVGYGVYKEKDYWMVK 285 Query: 367 NSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465 NSW WG G ++MS R+NNCG+ + +Y ++ Sbjct: 286 NSWGTNWGISGDMMMSRNRDNNCGIATMASYPVV 319 >UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like midgut cysteine proteinase - Tenebrio molitor (Yellow mealworm) Length = 330 Score = 116 bits (278), Expect = 4e-25 Identities = 56/150 (37%), Positives = 89/150 (59%), Gaps = 1/150 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG A+ +I +G+ +E Y Y Q YC D+ ++T ++G+ ++ + +EN+L A Sbjct: 182 GGWMDSAFSYIHDYGIMSESAYP-YEAQGDYCRFDSSQSVTTLSGYYDLPSGDENSLADA 240 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + + GP++VAIDA + FYS G++++ C +L+H VL VGYG NG YW++KN Sbjct: 241 VGQAGPVAVAIDATDE-LQFYSGGLFYDQTCNQS--DLNHGVLVVGYGSDNGQDYWILKN 297 Query: 370 SWSNMWGNDGYVLMSMR-ENNCGVQSAPTY 456 SW + WG GY NNCG+ +A +Y Sbjct: 298 SWGSGWGESGYWRQVRNYGNNCGIATAASY 327 >UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18; Magnoliophyta|Rep: Thiol protease aleurain precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 115 bits (277), Expect = 6e-25 Identities = 58/153 (37%), Positives = 89/153 (58%), Gaps = 1/153 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG +A+++I +G L TE+ Y Y G+D C ++ VN+T E+ LK Sbjct: 207 GGLPSQAFEYIKSNGGLDTEKAYP-YTGKDETCKFSAENVGVQVLNSVNITLGAEDELKH 265 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 A+ P+S+A + H +F Y +GVY + C + +++HAVLAVGYGV +G YWL+K Sbjct: 266 AVGLVRPVSIAFEVIH-SFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIK 324 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 NSW WG+ GY M M +N CG+ + +Y ++ Sbjct: 325 NSWGADWGDKGYFKMEMGKNMCGIATCASYPVV 357 >UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin L-like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin L-like protease - Nasonia vitripennis Length = 353 Score = 114 bits (275), Expect = 1e-24 Identities = 63/159 (39%), Positives = 90/159 (56%), Gaps = 6/159 (3%) Frame = +1 Query: 7 GGGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGYCHIDNVTAITKI--TGWVNVTTNNENA 177 GGG ++Q+++ + GL E +Y Y G+ C + + ++ V +E Sbjct: 198 GGGSAALSFQFVVDQKGLEPEANYS-YEGRTKECPYNTSDDEDEELDASFIYVNGGDEAT 256 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN--GHK 351 LK+A+ GP S AID +H TF FYS GVY++P+C D+LDHAVL VGYG N Sbjct: 257 LKVAVATVGPFSAAIDGSHDTFRFYSEGVYYQPECNE--DDLDHAVLIVGYGTDNRTDQD 314 Query: 352 YWLVKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465 +WLVKNSW WG GY ++ R N+CG+ +A Y +I Sbjct: 315 FWLVKNSWGETWGEGGYFKVARNRRNHCGIAAAAVYPVI 353 >UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber officinale (Ginger) Length = 475 Score = 114 bits (275), Expect = 1e-24 Identities = 59/154 (38%), Positives = 85/154 (55%), Gaps = 5/154 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTA-ITKITGWVNVTTNNENALKL 186 GG +RA+Q+I+ +G E++ Y G +G C+ A + I + NV +N+E +L+ Sbjct: 207 GGWPYRAFQYIINNGGVNSEEHYPYTGTNGTCNTTKENAHVVSIDSYRNVPSNDEKSLQK 266 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 A PISV IDA+ + F Y +G+ F C L+H V VGYG NG+ YW+VK Sbjct: 267 AAANQ-PISVGIDASGRNFQLYHSGI-FTGSCNTS---LNHGVTVVGYGTENGNDYWIVK 321 Query: 367 NSWSNMWGNDGYVLMSMR----ENNCGVQSAPTY 456 NSW WGN GY+LM CG+ +P+Y Sbjct: 322 NSWGENWGNSGYILMERNIAESSGKCGIAISPSY 355 >UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 356 Score = 114 bits (275), Expect = 1e-24 Identities = 58/153 (37%), Positives = 84/153 (54%), Gaps = 1/153 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGW-VNVTTNNENALKL 186 GG +A+++I +G + E+ Y+ QD C T ++ G N+T +E+ LK Sbjct: 194 GGLPSQAFEYIKYNGGISYENSYYYIAQDQECQFSPETVGARVRGGSFNITQGDEDQLKQ 253 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 A+ GP+S+A F Y +GVY P C + ++HAVLAVGYG NG YW VK Sbjct: 254 AVGTVGPVSIAFQVMGD-FKLYKSGVYSNPDCSSSPQTVNHAVLAVGYGSENGVDYWYVK 312 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 NSWS WG++GY + N CGV + +Y L+ Sbjct: 313 NSWSEFWGDEGYFKIQRGVNMCGVATCASYPLL 345 >UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [Contains: Cathepsin H mini chain; Cathepsin H heavy chain; Cathepsin H light chain]; n=37; Eukaryota|Rep: Cathepsin H precursor (EC 3.4.22.16) [Contains: Cathepsin H mini chain; Cathepsin H heavy chain; Cathepsin H light chain] - Homo sapiens (Human) Length = 335 Score = 114 bits (274), Expect = 1e-24 Identities = 56/150 (37%), Positives = 85/150 (56%), Gaps = 1/150 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG +A+++I+ + G+ E+ Y Y G+DGYC AI + N+T +E A+ Sbjct: 183 GGLPSQAFEYILYNKGIMGEDTYP-YQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVE 241 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 A+ + P+S A + + F Y G+Y C D+++HAVLAVGYG NG YW+VK Sbjct: 242 AVALYNPVSFAFEVT-QDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVK 300 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTY 456 NSW WG +GY L+ +N CG+ + +Y Sbjct: 301 NSWGPQWGMNGYFLIERGKNMCGLAACASY 330 >UniRef50_Q4TI44 Cluster: Chromosome undetermined SCAF2412, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF2412, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 123 Score = 113 bits (272), Expect = 2e-24 Identities = 50/105 (47%), Positives = 70/105 (66%), Gaps = 2/105 (1%) Frame = +1 Query: 157 TTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV 336 + NE L ALFKHGP+++ IDA TF YS GVY++P C ++++HAVL VGYGV Sbjct: 21 SAGNEKLLAYALFKHGPVAIGIDATLTTFHLYSKGVYYDPDC--NPEDINHAVLLVGYGV 78 Query: 337 L-NGHKYWLVKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465 G +YW+VKNSW WG +GY+LM+ R N CG+ + +Y ++ Sbjct: 79 TRRGQQYWIVKNSWGTGWGTEGYILMARNRGNLCGIANLASYPIM 123 >UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 4 - Rhipicephalus appendiculatus (Brown ear tick) Length = 345 Score = 113 bits (272), Expect = 2e-24 Identities = 60/152 (39%), Positives = 83/152 (54%), Gaps = 3/152 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKIT--GWVNVTTNNENAL 180 GG+ A+Q++ G L TE Y G + C N +++ G V NE L Sbjct: 193 GGQMPGAFQYVQDAGGLDTEARYPYRQGTNFQCQFSNSFEARRVSVNGHTRVPPRNERVL 252 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360 + A+ GPIS+AI+A+ +TF FY NG+Y EP C + L+HAVL VGYG G YW+ Sbjct: 253 QDAVANVGPISIAINASPQTFMFYKNGIYGEPNCDPR--GLNHAVLLVGYGEERGVPYWI 310 Query: 361 VKNSWSNMWGNDGYVLMSMRENNCGVQSAPTY 456 VKNSW WG GY+ + N CG+ P++ Sbjct: 311 VKNSWGPGWGEGGYIKILRNRNVCGMSQDPSF 342 >UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lamblia ATCC 50803|Rep: GLP_26_47548_45815 - Giardia lamblia ATCC 50803 Length = 577 Score = 113 bits (271), Expect = 3e-24 Identities = 60/158 (37%), Positives = 97/158 (61%), Gaps = 6/158 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG---LPTEEDYGGYLGQDGYCH--IDNVTAITKITGWVNVTTNNEN 174 GG+ A +W++++ + E +Y YLGQ+ C + + + +TG+ V + Sbjct: 418 GGDTLAALKWLVENNGGRVAFESEYP-YLGQNDLCKEALFDHESFYFVTGYSAVKQYSIP 476 Query: 175 ALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH-K 351 +LK AL + GP++V+I ++ FYS GVY +P C K D+L HAVLAVGYG + + Sbjct: 477 SLKAAL-QDGPVAVSIGIT-ESLLFYSGGVYNDPACPYKYDDLSHAVLAVGYGTDDTYGD 534 Query: 352 YWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 YW+V+NSWS +WG DGY +SM++N CG+ + +Y ++ Sbjct: 535 YWIVRNSWSPLWGMDGYFYLSMKDNICGILTDASYAVV 572 >UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 234 Score = 113 bits (271), Expect = 3e-24 Identities = 57/146 (39%), Positives = 81/146 (55%), Gaps = 1/146 (0%) Frame = +1 Query: 31 YQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPI 210 Y I HGL ED Y + C D + K+TG+ + +NE+ LK + +GP Sbjct: 91 YVKIFMHGLFETEDNYPYQAEHHSCKFDKTRGVGKLTGY-HKCKSNEDQLKTEVAANGPY 149 Query: 211 SVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWG 390 +V I+A + F YS+GV+ PKC + LDH V +GYGV +G YWLV+NSW WG Sbjct: 150 AVMINADSEQFRLYSSGVFDNPKCGKII--LDHVVTVIGYGVEDGKDYWLVRNSWGKYWG 207 Query: 391 NDGYVLMSM-RENNCGVQSAPTYVLI 465 +GY+ MS ++N CG+ + LI Sbjct: 208 LEGYIKMSRNKDNQCGIATEAVIPLI 233 >UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 317 Score = 112 bits (269), Expect = 5e-24 Identities = 59/152 (38%), Positives = 85/152 (55%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG AY +I +GL E Y Y G DGY + + AI KI G+ ++ E ALK A Sbjct: 170 GGWPHWAYDYIKDNGLCLESKYK-YQGYDGYYCKECIPAIKKINGYSSIN-QTEEALKEA 227 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + GPI+V ++A + + YS G+ C + ++HAVLAVGYG NG +WL+KN Sbjct: 228 VGTAGPIAVCVNA-NDDWQLYSGGILESQSCPGG-ESINHAVLAVGYGSENGKDFWLIKN 285 Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 SW+ WG +GY+ + +N CG+ Y L+ Sbjct: 286 SWNTYWGEEGYLRIVRGKNQCGINEVADYPLL 317 >UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadidae|Rep: Cysteine protease - Tritrichomonas foetus (Trichomonas foetus) Length = 315 Score = 112 bits (269), Expect = 5e-24 Identities = 54/155 (34%), Positives = 87/155 (56%), Gaps = 3/155 (1%) Frame = +1 Query: 7 GGGEDFRAYQWIM---KHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENA 177 GG D+ AY++I+ K + E DY Y DG C + + ++ + N+E Sbjct: 164 GGLMDY-AYKYIIDRQKGKMILESDYV-YTALDGVCKFAQFQTVGNVASFLYIAENDEED 221 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357 L + HGP++VAIDA+H++F Y +G+Y EP+C L+H V +G+G N KYW Sbjct: 222 LAANVETHGPVAVAIDASHQSFQLYKSGIYDEPEC--SATFLNHGVGCIGFGSDNDTKYW 279 Query: 358 LVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVL 462 +V NSW WG +GY+ + ++N CG+ ++ + L Sbjct: 280 IVPNSWGLTWGEEGYIRIIRKDNRCGIAASACFPL 314 >UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicatein a3 - Lubomirskia baicalensis Length = 344 Score = 112 bits (269), Expect = 5e-24 Identities = 60/156 (38%), Positives = 89/156 (57%), Gaps = 4/156 (2%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHID--NVTAITKITGWVNVTTNNENAL 180 GG+ + A+++++ +G + TE Y Y G+ C + NV AI+ TG V + + +E L Sbjct: 194 GGDVYTAFKYVVDNGGIDTESSYP-YKGKKSSCQYNSKNVGAIS--TGVVKIASGSETDL 250 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360 A+ GPI+VA+DA+ F FY +GV+ C +L+HA+L GYG NG YWL Sbjct: 251 LSAVASVGPIAVAVDASVNAFMFYQSGVFDSSTCSTS--KLNHAMLVTGYGSTNGKDYWL 308 Query: 361 VKNSWSNMWGNDGYVLMSMRE-NNCGVQSAPTYVLI 465 VKNSW WG GY+ M + N CG+ S Y ++ Sbjct: 309 VKNSWGTGWGESGYIKMVRNKYNQCGIASDALYPML 344 >UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel protein - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 328 Score = 111 bits (268), Expect = 7e-24 Identities = 57/152 (37%), Positives = 85/152 (55%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG RA+ +++++ + Y ++G C TG+ V +NE AL+ A Sbjct: 179 GGFLSRAFLYVIQNRGIDSSTFYPYEHKEGVCRYSVSGRAGYCTGFRIVPRHNEAALQSA 238 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + GP+SV I+A +F Y +G+Y +PKC + + ++HAVL VGYG NG YWLVKN Sbjct: 239 VANIGPVSVGINAKLLSFHRYRSGIYNDPKCSSAL--INHAVLVVGYGSENGQDYWLVKN 296 Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 SW WG +GY+ M+ +N CG+ S Y I Sbjct: 297 SWGTAWGENGYIRMARNKNMCGISSFGIYPTI 328 >UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schistosoma|Rep: Preprocathepsin cathepsin L - Schistosoma japonicum (Blood fluke) Length = 331 Score = 111 bits (268), Expect = 7e-24 Identities = 55/150 (36%), Positives = 86/150 (57%), Gaps = 1/150 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG A+ ++ KH + +E DY YLG D CH + K+ + ++ +E L+ A Sbjct: 182 GGTMDLAFNYLEKHYIESENDYK-YLGHDANCHYRKSKGVVKVKKFGDLPARDEKTLEKA 240 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 ++++GPISV I A + Y +G+Y CK +++H VLAVGYG NG YWL+KN Sbjct: 241 VYQYGPISVGI-VALDSLILYKSGIYESKDCKYA--DINHGVLAVGYGRENGKDYWLIKN 297 Query: 370 SWSNMWGNDGYV-LMSMRENNCGVQSAPTY 456 SW ++WG +GY L + + CG+ S ++ Sbjct: 298 SWGDLWGMNGYFKLRRNKPHMCGISSNSSF 327 >UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=17; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 318 Score = 111 bits (268), Expect = 7e-24 Identities = 61/148 (41%), Positives = 80/148 (54%), Gaps = 5/148 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKH--GL-PTEEDYGGYLGQDGYCHIDNVTAITKITGWVN-VTTNNENA 177 GG+++ AY +++KH GL E DY Y +DG C +T +V TT NE+ Sbjct: 164 GGDEYLAYDYVIKHQKGLWMLETDYP-YTARDGSCKFKAAKGVTLTKSYVRPTTTQNEDE 222 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357 LK K G +S+AIDA+ F YS+G+Y C + LDHAV VGYG N YW Sbjct: 223 LKAGCAKGGVVSIAIDASGYDFQLYSSGIYNPKSCSSTF--LDHAVGLVGYGTENKVDYW 280 Query: 358 LVKNSWSNMWGNDGYVLMSMRE-NNCGV 438 +V+NSW WG GY+ M N CGV Sbjct: 281 IVRNSWGTSWGEKGYIRMIRNNGNKCGV 308 >UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lamblia ATCC 50803|Rep: GLP_26_49243_47612 - Giardia lamblia ATCC 50803 Length = 543 Score = 111 bits (267), Expect = 9e-24 Identities = 57/130 (43%), Positives = 82/130 (63%), Gaps = 2/130 (1%) Frame = +1 Query: 82 YLGQDGYCHIDNVTAIT-KITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSFYSN 258 YLG + C+ T+ +I G +V + A+K AL GP+S+A+ A +TFS+YS Sbjct: 415 YLGVESLCNESIFTSDHGRIRGVAHVKEYDIGAMKYALLS-GPVSIAV-AVTETFSWYSG 472 Query: 259 GVYFEPKCKNKVDELDHAVLAVGYGVLN-GHKYWLVKNSWSNMWGNDGYVLMSMRENNCG 435 GV+ +P C + VD+L HAVL VG+G YW+V+NSWSN WG DGY+ +SM+ N CG Sbjct: 473 GVFNDPACASGVDDLAHAVLLVGWGTDEVAGDYWIVRNSWSNAWGIDGYMYLSMKNNICG 532 Query: 436 VQSAPTYVLI 465 V + YV++ Sbjct: 533 VLTCADYVMV 542 >UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2; Taeniidae|Rep: Cathepsin L-like cysteine proteinase - Taenia solium (Pork tapeworm) Length = 339 Score = 111 bits (267), Expect = 9e-24 Identities = 58/146 (39%), Positives = 82/146 (56%), Gaps = 1/146 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG A++++ +H + E Y Y DG C + + +T ++ NE AL A Sbjct: 190 GGYMSYAFKYLEEHFIEPESAYP-YRATDGPCRYNESLGVGTVTDIGDIPEGNETALMEA 248 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + GPIS+AIDA+ F FY +G+Y C +K L+H VLA+GYG +G YWLVKN Sbjct: 249 VATVGPISIAIDASSLGFMFYRHGIYKSHWCSSKF--LNHGVLAIGYGKQDGKPYWLVKN 306 Query: 370 SWSNMWGNDGYVLMSMRENN-CGVQS 444 SW WG GY++M+ +N CGV S Sbjct: 307 SWGTRWGMKGYIMMAKDYHNMCGVAS 332 >UniRef50_Q23EG5 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 429 Score = 111 bits (266), Expect = 1e-23 Identities = 57/153 (37%), Positives = 83/153 (54%), Gaps = 1/153 (0%) Frame = +1 Query: 10 GGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG RA+++I G+ + DY Y G+DG C + K+ N+T +EN L Sbjct: 193 GGLPSRAFEYIAYAGGIESSRDYP-YKGKDGKCKFKPQKVVAKVQSSFNITFQDENELIY 251 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 L K+GP+S+A F Y G+Y P+C E++HAVLAVGY L G +Y++VK Sbjct: 252 HLAKNGPVSIAYQVTDD-FENYEGGIYSNPECSTDPQEVNHAVLAVGYN-LTG-RYYIVK 308 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 NSW WG DGY + + N CG+ +Y ++ Sbjct: 309 NSWGKDWGMDGYFYIELGSNMCGLADCASYPIL 341 >UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=176; Viridiplantae|Rep: Cysteine proteinase RD21a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 462 Score = 109 bits (262), Expect = 4e-23 Identities = 61/156 (39%), Positives = 90/156 (57%), Gaps = 6/156 (3%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYC-HIDNVTAITKITGWVNVTTNNENAL 180 GG D+ A+++I+K+G + T++DY Y G DG C I + I + +V T +E +L Sbjct: 202 GGLMDY-AFEFIIKNGGIDTDKDYP-YKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESL 259 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360 K A+ H PIS+AI+A + F Y +G+ F+ C +LDH V+AVGYG NG YW+ Sbjct: 260 KKAV-AHQPISIAIEAGGRAFQLYDSGI-FDGSCGT---QLDHGVVAVGYGTENGKDYWI 314 Query: 361 VKNSWSNMWGNDGYVLMSMR----ENNCGVQSAPTY 456 V+NSW WG GY+ M+ CG+ P+Y Sbjct: 315 VRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSY 350 >UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep: Cathepsin - Petromyzon marinus (Sea lamprey) Length = 333 Score = 109 bits (261), Expect = 5e-23 Identities = 58/155 (37%), Positives = 87/155 (56%), Gaps = 3/155 (1%) Frame = +1 Query: 10 GGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVT-TNNENALK 183 GG RA Q+I+ +G+ +E Y Y DG C TK + + V ++NE L+ Sbjct: 183 GGRSERALQYIIDNNGIDSELSYP-YEHADGKCRFKPANVATKCSSYQFVEPSSNEEVLR 241 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363 A+ GPI++A++A TF Y +G++ EP C + HA+L VGYG L+G+ +W+V Sbjct: 242 QAVASVGPIAIAMNADLDTFKHYKSGLFNEPSCDKSPN---HAMLVVGYGSLSGNDFWIV 298 Query: 364 KNSWSNMWGNDGYVLM-SMRENNCGVQSAPTYVLI 465 KNSW WG GY+ M ++N CG+ S Y +I Sbjct: 299 KNSWGEDWGEKGYIYMIRNKDNQCGIASIGIYPII 333 >UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3; Bilateria|Rep: Cathepsin L-like cysteine protease - Neobenedenia melleni Length = 335 Score = 108 bits (260), Expect = 6e-23 Identities = 61/155 (39%), Positives = 85/155 (54%), Gaps = 2/155 (1%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG D I GL +E Y Y Q C + I+ + +V+ +E LK Sbjct: 184 GGIMDNSFNYLIHNKGLESEASYP-YEAQKKECRYKKALSKGTISSFTDVSQFDEKDLKR 242 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL-NGHKYWLV 363 A+ GP+S+AIDA+ +F Y +GVY E C + L+H VLAVGYG G YW V Sbjct: 243 AVGLVGPVSIAIDASQFSFHLYDSGVYDEEDCSQTM--LNHGVLAVGYGTTPEGLDYWKV 300 Query: 364 KNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465 KNSW+N WG +GY+LMS ++N CGV + +Y ++ Sbjct: 301 KNSWTNTWGMEGYILMSRNKDNQCGVATVASYPIV 335 >UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchocercidae|Rep: Cathepsin L-like precursor - Brugia pahangi (Filarial nematode worm) Length = 395 Score = 108 bits (260), Expect = 6e-23 Identities = 59/151 (39%), Positives = 82/151 (54%), Gaps = 2/151 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG A+Q+ ++G+ E Y Y+G + C A+ G+ + +E ALK A Sbjct: 248 GGYMPTAFQYASRYGIAMESRYP-YVGTEQRCRWQQSIAVVTDNGFNEIQPGDELALKHA 306 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH-KYWLVK 366 + K GP+ V I + ++F FY +GVY E C DHAVLAVGYG + YW+VK Sbjct: 307 VAKRGPVVVGISGSKRSFRFYKDGVYSEGNCGRP----DHAVLAVGYGTHPSYGDYWIVK 362 Query: 367 NSWSNMWGNDGYVLMSM-RENNCGVQSAPTY 456 NSW WG DGYV M+ R N C + SA ++ Sbjct: 363 NSWGTDWGKDGYVYMARNRGNMCHIASAASF 393 >UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia deliciosa (Kiwi) Length = 509 Score = 108 bits (259), Expect = 8e-23 Identities = 59/156 (37%), Positives = 86/156 (55%), Gaps = 6/156 (3%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNV-TAITKITGWVNVTTNNENAL 180 GG D+ A++W+M +G + TE DY Y G+DG C+ T I G+ +V E+AL Sbjct: 211 GGYMDY-AFEWVMSNGGIDTETDYP-YTGEDGTCNTTKEETKAVSIDGYEDVA-EEESAL 267 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360 A+ K PISV ID F Y+ G+Y + C + D++DHAVL VGYG +G +YW+ Sbjct: 268 FCAVLKQ-PISVGIDGGAIDFQLYTGGIY-DGDCSDDPDDIDHAVLVVGYGAESGEEYWI 325 Query: 361 VKNSWSNMWGNDGYVLMSMRENN----CGVQSAPTY 456 +KNSW WG GY + + C + + +Y Sbjct: 326 IKNSWGTDWGMKGYAYIKRNTSKDYGVCAINAMASY 361 >UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio molitor (Yellow mealworm) Length = 336 Score = 108 bits (259), Expect = 8e-23 Identities = 60/152 (39%), Positives = 82/152 (53%), Gaps = 4/152 (2%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG A+ ++ ++G E Y DG CH D +++G+V ++ +EN L Sbjct: 187 GGWMNDAFTYVAQNGGIDSEGAYPYEMADGNCHYDPNQVAARLSGYVYLSGPDENMLADM 246 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + GP++VA DA F YS GVY+ P C+ ++ HAVL VGYG NG YWLVKN Sbjct: 247 VATKGPVAVAFDA-DDPFGSYSGGVYYNPTCET--NKFTHAVLIVGYGNENGQDYWLVKN 303 Query: 370 SWSNMWGNDGYVLMSMRENN----CGVQSAPT 453 SW + WG DGY ++ NN GV S PT Sbjct: 304 SWGDGWGLDGYFKIARNANNHCGIAGVASVPT 335 >UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cathepsin L; n=4; Danio rerio|Rep: Novel protein similar to vertebrate cathepsin L - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 334 Score = 107 bits (258), Expect = 1e-22 Identities = 58/148 (39%), Positives = 82/148 (55%), Gaps = 2/148 (1%) Frame = +1 Query: 28 AYQWIMKHGLPTEEDYGGYLGQDGY-CHIDNVTAITKITGWVNVTTNNENALKLALFKHG 204 AY +++ + L + + Y Y D C + A+ I+ + V NE AL A+ G Sbjct: 190 AYDYVINNALESSDTYP-YTSVDTQPCFYEKNLAMAGISDYRFVPAGNEQALADAVATVG 248 Query: 205 PISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNM 384 P+SVAIDA + +F FYS+G+Y E C + L+HAVL VGYG G YW++KNSW Sbjct: 249 PVSVAIDADNPSFLFYSSGIYKESNCNP--NNLNHAVLVVGYGSEEGTDYWIIKNSWGTG 306 Query: 385 WGNDGYVLMSMR-ENNCGVQSAPTYVLI 465 WG GY+ M +N CG+ S Y +I Sbjct: 307 WGEGGYMRMIRNGKNTCGIASYALYPII 334 >UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; n=35; Fasciola|Rep: Cathepsin L-like proteinase precursor - Fasciola hepatica (Liver fluke) Length = 326 Score = 107 bits (258), Expect = 1e-22 Identities = 56/153 (36%), Positives = 80/153 (52%), Gaps = 1/153 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG AYQ++ + GL TE Y Y +G C + + K+TG+ V + +E LK Sbjct: 174 GGLMENAYQYLKQFGLETESSYP-YTAVEGQCRYNKQLGVAKVTGYYTVHSGSEVELKNL 232 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + P +VA+D F Y +G+Y C ++HAVLAVGYG G YW+VKN Sbjct: 233 VGARRPAAVAVDV-ESDFMMYRSGIYQSQTCSPL--RVNHAVLAVGYGTQGGTDYWIVKN 289 Query: 370 SWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465 SW WG GY+ M+ R N CG+ S + ++ Sbjct: 290 SWGTYWGERGYIRMARNRGNMCGIASLASLPMV 322 >UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Slime mold). Cysteine proteinase 5; n=2; Dictyostelium discoideum|Rep: Similar to Dictyostelium discoideum (Slime mold). Cysteine proteinase 5 - Dictyostelium discoideum (Slime mold) Length = 345 Score = 107 bits (257), Expect = 1e-22 Identities = 54/157 (34%), Positives = 97/157 (61%), Gaps = 11/157 (7%) Frame = +1 Query: 28 AYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHG 204 A+Q+I+++G + +EE Y G+ G C ++ ++ KIT + V + +E++L+ A+ Sbjct: 192 AFQYIIENGGIDSEESYKFSGGEPGKCKYNSSNSVAKITSYEKVKSGSESSLESAVSLK- 250 Query: 205 PISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGY---------GVLNGHKYW 357 P++ IDA+ +F FYS+G+Y+EP C N D L+H++L VG+ + + YW Sbjct: 251 PVAAYIDASLSSFQFYSSGIYYEPSC-NSTD-LNHSILIVGFSDFSTTPTDSLKHSSNYW 308 Query: 358 LVKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465 +V+NS+ WG +GY+ MS R++NCG+ +YV++ Sbjct: 309 IVQNSFGKNWGENGYIFMSKDRDDNCGISKMASYVIV 345 >UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviatus|Rep: Cathepsin L 2 - Diaprepes abbreviatus (Sugarcane rootstalk borer weevil) Length = 348 Score = 107 bits (257), Expect = 1e-22 Identities = 59/151 (39%), Positives = 86/151 (56%), Gaps = 2/151 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A+ +I ++G + TE+ Y Y +DG C ++ + V EN L Sbjct: 201 GGWMHWAFGYIKENGGIDTEQSYP-YTAKDGRCAYKPGNKAATVSQVIMVP-RGENQLAA 258 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 + GPIS+A + +HK F FY +GVY EP+C + L+HA+LAVGYG + G +WLVK Sbjct: 259 KVSSVGPISIAAEVSHK-FQFYHSGVYDEPQCGHS---LNHAMLAVGYGSMGGKNFWLVK 314 Query: 367 NSWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456 NSW WG+ GY+ M+ +NN CG+ +Y Sbjct: 315 NSWGTGWGDQGYIRMAKDKNNQCGIALMASY 345 >UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4; core eudicotyledons|Rep: Papain-like cysteine peptidase XBCP3 - Arabidopsis thaliana (Mouse-ear cress) Length = 437 Score = 106 bits (255), Expect = 3e-22 Identities = 61/156 (39%), Positives = 87/156 (55%), Gaps = 6/156 (3%) Frame = +1 Query: 7 GGGEDFRAYQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTA-ITKITGWVNVTTNNENAL 180 GG D+ A+++++K HG+ TE+DY Y +DG C D + + I + V +N+E AL Sbjct: 183 GGLMDY-AFEFVIKNHGIDTEKDYP-YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKAL 240 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360 A+ P+SV I + + F YS+G++ P C LDHAVL VGYG NG YW+ Sbjct: 241 MEAVAAQ-PVSVGICGSERAFQLYSSGIFSGP-CSTS---LDHAVLIVGYGSQNGVDYWI 295 Query: 361 VKNSWSNMWGNDGYVLMSMRENN----CGVQSAPTY 456 VKNSW WG DG++ M N CG+ +Y Sbjct: 296 VKNSWGKSWGMDGFMHMQRNTENSDGVCGINMLASY 331 >UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2); n=2; Tribolium castaneum|Rep: PREDICTED: similar to Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2) - Tribolium castaneum Length = 332 Score = 105 bits (253), Expect = 5e-22 Identities = 60/153 (39%), Positives = 82/153 (53%), Gaps = 3/153 (1%) Frame = +1 Query: 7 GGGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNN-ENAL 180 GGG AY +I ++ G+ DY YLG++G C + I + + NN E + Sbjct: 181 GGGWIPTAYSYIARNKGVNYNRDYP-YLGRNGKCRYRSSKPHIAIRSYAAINNNNNEERV 239 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360 + + GP+SVAI +TF Y +GVY P C+ L+HAV+ VGYG G YWL Sbjct: 240 RRLVATKGPVSVAIHVDSRTFHKYKSGVYNNPSCRGG---LNHAVVIVGYGRERGVDYWL 296 Query: 361 VKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTY 456 VKNSW WG GYV M+ R N CG+ + +Y Sbjct: 297 VKNSWGAGWGQKGYVKMARNRRNQCGIATHASY 329 >UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n=1; Rattus norvegicus|Rep: UPI0000501FDB UniRef100 entry - Rattus norvegicus Length = 338 Score = 105 bits (253), Expect = 5e-22 Identities = 57/158 (36%), Positives = 88/158 (55%), Gaps = 6/158 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG + A+Q+++++G L +E Y Y G++G C N + KIT NE+ L Sbjct: 187 GGTTYNAFQYVLQNGGLESEATYP-YEGKEGLCRY-NPNSSAKITXICAPPQKNEDVLMD 244 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNGHKY 354 A+ P++ I H + FY G+Y EPKC N V+ HAVL VGYG +G+ Y Sbjct: 245 AVATK-PVAAGIHVVHSSLRFYKKGIYHEPKCNNYVN---HAVLVVGYGFEGNETDGNNY 300 Query: 355 WLVKNSWSNMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465 WL++NSW WG +GY+ ++ R N+CG+ + Y ++ Sbjct: 301 WLIQNSWGERWGLNGYMKIAKDRNNHCGIATFAQYPIV 338 >UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Toxopain-2 - Toxoplasma gondii Length = 422 Score = 105 bits (252), Expect = 6e-22 Identities = 59/148 (39%), Positives = 81/148 (54%), Gaps = 5/148 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GGE A+Q+++ G ED YL +D C + + KI G+ +V +E A+K A Sbjct: 271 GGEMNDAFQYVLDSGGICSEDAYPYLARDEECRAQSCEKVVKILGFKDVPRRSEAAMKAA 330 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHK--YWLV 363 L K P+S+AI+A F FY GV F+ C +LDH VL VGYG K +W++ Sbjct: 331 LAK-SPVSIAIEADQMPFQFYHEGV-FDASCGT---DLDHGVLLVGYGTDKESKKDFWIM 385 Query: 364 KNSWSNMWGNDGYVLMSM---RENNCGV 438 KNSW WG DGY+ M+M E CG+ Sbjct: 386 KNSWGTGWGRDGYMYMAMHKGEEGQCGL 413 >UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteinase B; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like tick cysteine proteinase B - Haemaphysalis longicornis (Bush tick) Length = 332 Score = 104 bits (250), Expect = 1e-21 Identities = 46/90 (51%), Positives = 64/90 (71%), Gaps = 2/90 (2%) Frame = +1 Query: 202 GPISVAIDAAHKTFS-FYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWS 378 GP+SVAIDA + S FYS G+Y EP+C + ++LDH VL VGYG +G YWLVKNSW Sbjct: 245 GPVSVAIDAQPTSHSQFYSEGIYDEPECSS--EQLDHGVLVVGYGTKDGKDYWLVKNSWG 302 Query: 379 NMWGNDGYVLMSM-RENNCGVQSAPTYVLI 465 WG++GY+ M+ ++N CG+ S+ +Y L+ Sbjct: 303 TTWGDEGYIYMTRNQDNQCGIASSASYPLV 332 >UniRef50_P25773 Cluster: Cathepsin L; n=38; Eukaryota|Rep: Cathepsin L - Felis silvestris catus (Cat) Length = 139 Score = 104 bits (249), Expect = 1e-21 Identities = 54/140 (38%), Positives = 79/140 (56%), Gaps = 5/140 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A+Q++ +G L +EE Y Y Q C ++ +T + ++ + EN L + Sbjct: 1 GGLIDDAFQYVKDNGGLDSEESYP-YHAQGDSCKYRPENSVANVTDYWDIPSK-ENELMI 58 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNGHKY 354 L GPIS AIDA+ TF FY G+Y++P C + +++DH VL VGYG KY Sbjct: 59 TLAAVGPISAAIDASLDTFRFYKEGIYYDPSCSS--EDVDHGVLVVGYGADGTETENKKY 116 Query: 355 WLVKNSWSNMWGNDGYVLMS 414 W++KNSW WG DGY+ M+ Sbjct: 117 WIIKNSWGTDWGMDGYIKMA 136 >UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin heavy chain; n=3; Amniota|Rep: PREDICTED: similar to ferritin heavy chain - Ornithorhynchus anatinus Length = 338 Score = 103 bits (248), Expect = 2e-21 Identities = 58/158 (36%), Positives = 87/158 (55%), Gaps = 6/158 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGY-CHIDNVTAITKITGWVNVTTNNENALKL 186 GG+ A++++ +G ED YLG+D C T ++ V +NE AL+ Sbjct: 186 GGQYIGAFEYVRANGGIDAEDLYPYLGRDDISCRYSLQGKAGNCTSYMVVDQDNEQALEQ 245 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN----GHKY 354 A+ GP+SVA+DA + F FY +G++ C KV+ HA+LAVGYG G Y Sbjct: 246 AVATVGPVSVAVDA--RPFFFYHSGIFSSHSCTQKVN---HAMLAVGYGTSKEPGGGQDY 300 Query: 355 WLVKNSWSNMWGNDGYV-LMSMRENNCGVQSAPTYVLI 465 W++KNSWS WG GY+ L+ N+CGV S ++ ++ Sbjct: 301 WILKNSWSERWGEQGYMRLLKGANNHCGVASVASFPVL 338 >UniRef50_UPI0000E224BB Cluster: PREDICTED: hypothetical protein; n=1; Pan troglodytes|Rep: PREDICTED: hypothetical protein - Pan troglodytes Length = 143 Score = 103 bits (247), Expect = 2e-21 Identities = 47/90 (52%), Positives = 61/90 (67%), Gaps = 5/90 (5%) Frame = +1 Query: 202 GPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNGHKYWLVKN 369 GPISVA+ A+H +F FY G+YFEP+C + LDHA+L VGY + +KYWLVKN Sbjct: 53 GPISVAVGASHVSFQFYKKGIYFEPRC--DPEGLDHAMLVVGYSYEGADSDNNKYWLVKN 110 Query: 370 SWSNMWGNDGYVLMSM-RENNCGVQSAPTY 456 SW WG DGY+ M+ R NNCG+ +A +Y Sbjct: 111 SWGKNWGMDGYIKMAKDRRNNCGIATAASY 140 >UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 344 Score = 103 bits (247), Expect = 2e-21 Identities = 52/146 (35%), Positives = 79/146 (54%), Gaps = 1/146 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG AYQ++ + G + T + YG Y + C+ D K+ W + NE ++ Sbjct: 195 GGLMTDAYQFLQQSGGIQTADTYGDYKNKKDICNFDKAKVKAKVVDWYQIP-ENEETIRR 253 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 L K+GP++V I+A +T FY G+ +PK N D+++HAVL VGYGV G YWL+K Sbjct: 254 ELVKNGPVAVGINA--RTLQFYEGGIV-DPK--NCDDKINHAVLIVGYGVEEGIPYWLIK 308 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQS 444 N W WG G+ + + CG+ + Sbjct: 309 NQWGAEWGIKGFFKLIRGKKQCGIHT 334 >UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L, S or H-like cysteine peptidase - Trichomonas vaginalis G3 Length = 473 Score = 102 bits (245), Expect = 4e-21 Identities = 53/150 (35%), Positives = 88/150 (58%), Gaps = 4/150 (2%) Frame = +1 Query: 1 ARGGGEDFRAYQWIMKHG--LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNEN 174 A GGGE A++ ++ L E+DY Y+G GYC+ + + ++ + + + + Sbjct: 315 ACGGGEAGPAFRSLINQNFKLFLEKDYP-YIGVAGYCNRNPEHPVARVVDCIAIDKSTQ- 372 Query: 175 ALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKY 354 ALK AL+++GP S+ I+ ++ SFY+ G +P C D+L H VL G+ +++G + Sbjct: 373 ALKEALYQYGPASIGINVI-ESMSFYTKGAVNDPTCTGAADDLVHEVLLTGWKIVDGIEC 431 Query: 355 WLVKNSWSNMWGNDGYVLMSM--RENNCGV 438 W +KNSWS WGN+GY+ + +E NCGV Sbjct: 432 WEIKNSWSTHWGNEGYIYIQAENQEYNCGV 461 >UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaster|Rep: CG11459-PA - Drosophila melanogaster (Fruit fly) Length = 336 Score = 102 bits (244), Expect = 6e-21 Identities = 53/151 (35%), Positives = 81/151 (53%), Gaps = 2/151 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG A+ + HG+ T+E Y Y G C + + ++G+V + +E L Sbjct: 184 GGWVSVAFNYTRDHGIATKESYP-YEPVSGECLWKSDRSAGTLSGYVTLGNYDERELAEV 242 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLVK 366 ++ GP++V+ID H+ F YS GV P C++K +L H+VL VG+G YW++K Sbjct: 243 VYNIGPVAVSIDHLHEEFDQYSGGVLSIPACRSKRQDLTHSVLLVGFGTHRKWGDYWIIK 302 Query: 367 NSWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456 NS+ WG GY+ ++ NN CGV S P Y Sbjct: 303 NSYGTDWGESGYLKLARNANNMCGVASLPQY 333 >UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n=3; Brugia malayi|Rep: Cathepsin L-like cysteine proteinase - Brugia malayi (Filarial nematode worm) Length = 353 Score = 102 bits (244), Expect = 6e-21 Identities = 48/145 (33%), Positives = 81/145 (55%), Gaps = 2/145 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITK--ITGWVNVTTNNENALK 183 GG + ++W+ +HG+ T++ Y Y D N K + + +NE LK Sbjct: 200 GGNEPAVFRWVAEHGVKTDKSYP-YKENDSVSCPRNTPQRRKYGLADAFYLPPSNEQILK 258 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363 L +GP+ V++ ++ ++F Y +G+Y +PKC ++++HAV+AVGYGV NG +Y+++ Sbjct: 259 KILALYGPVCVSLHSSLQSFVAYRSGIYNDPKCPTNAEKVNHAVIAVGYGVQNGMEYFII 318 Query: 364 KNSWSNMWGNDGYVLMSMRENNCGV 438 KNSW WG GY + CG+ Sbjct: 319 KNSWGPTWGQKGYGRIRAGVFMCGI 343 >UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia theta|Rep: Cathepsin H precursor - Guillardia theta (Cryptomonas phi) Length = 353 Score = 101 bits (242), Expect = 1e-20 Identities = 57/161 (35%), Positives = 86/161 (53%), Gaps = 12/161 (7%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHI-------DNV----TAITKITGWVN 153 GG +A+++IM +G L E+Y Y+ DG+C++ D V + K++ N Sbjct: 189 GGLPSQAFEYIMYNGGLSKMEEYP-YVCGDGHCNVTGGPCAFDPVGKPWSVGAKVSKVAN 247 Query: 154 VTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYG 333 T +E ++K + H PISVA + YS+GVY P C D+++HAVLAVGYG Sbjct: 248 FTPGDEISMKTVVGSHNPISVAFEVV-ADLRHYSSGVYSSPTCVGTPDKVNHAVLAVGYG 306 Query: 334 VLNGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTY 456 G YW +KNSW WG++GY + N CG+ ++ Sbjct: 307 TEGGIPYWTIKNSWGFAWGDNGYFKIQRGSNKCGISVCASF 347 >UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2; Entamoeba|Rep: Cysteine proteinase ACP1 precursor - Entamoeba histolytica Length = 308 Score = 101 bits (242), Expect = 1e-20 Identities = 52/152 (34%), Positives = 85/152 (55%), Gaps = 3/152 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG + ++I ++ GL E DY Y G C V + +TG VT +E L+ Sbjct: 155 GGHPSNSLKFIQENNGLGLESDYP-YKAVAGTCK--KVKNVATVTGSRRVTDGSETGLQT 211 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNG-VYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363 + ++GP++V +DA+ +F Y G +Y + KC++++ ++H V AVGYG + KYW++ Sbjct: 212 IIAENGPVAVGMDASRPSFQLYKKGTIYSDTKCRSRM--MNHCVTAVGYGSNSNGKYWII 269 Query: 364 KNSWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456 +NSW WG+ GY L++ NN CG+ Y Sbjct: 270 RNSWGTSWGDAGYFLLARDSNNMCGIGRDSNY 301 >UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endopterygota|Rep: Cathepsin L-like proteinase - Bombyx mori (Silk moth) Length = 402 Score = 101 bits (241), Expect = 1e-20 Identities = 56/150 (37%), Positives = 81/150 (54%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG A ++ + GL E Y Y+G+ GYC D+ + W + + +E A++ A Sbjct: 259 GGSLRGALRYAAREGLVMESHYP-YVGKKGYCRYDSNLVRARPRRWATLPSGDEEAMEKA 317 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 L GP++VA++AA TF YS GVY +P C + L+HA+L VGY YW++ N Sbjct: 318 LATVGPLAVAVNAAPFTFQLYS-GVYDDPFCVSW--HLNHAMLLVGY----TQDYWILLN 370 Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTYV 459 W WG DGY+ + N CGV + TYV Sbjct: 371 WWGRNWGEDGYMRIRRGLNRCGVANMATYV 400 >UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing protein; n=5; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 437 Score = 100 bits (240), Expect = 2e-20 Identities = 49/153 (32%), Positives = 82/153 (53%), Gaps = 1/153 (0%) Frame = +1 Query: 10 GGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG + ++++ G+ E DY Y G+D C ++ + ++ N+T +EN L Sbjct: 273 GGLPSKGFEYLAYAGGIQNEADYP-YEGEDKNCRFNSSKTVVQVQKSYNITFQDENELIY 331 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 L +GP+++A + F Y NGV+ C ++++HAVLAVGY + KY++ K Sbjct: 332 HLANYGPVTIAYQV-NSDFDNYKNGVFTSSNCSKDPEDVNHAVLAVGYNMTG--KYFIAK 388 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 NSW N WG +GY + + N CG+ +Y +I Sbjct: 389 NSWGNDWGMNGYFYIELGSNMCGLADCASYPII 421 >UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2] - Vigna mungo (Rice bean) (Black gram) Length = 362 Score = 100 bits (240), Expect = 2e-20 Identities = 63/156 (40%), Positives = 83/156 (53%), Gaps = 7/156 (4%) Frame = +1 Query: 10 GGEDFRAYQWI-MKHGLPTEEDYGGYLGQDGYCHIDNVTAIT-KITGWVNVTTNNENALK 183 GG A+++I K G+ TE +Y Y Q+G C V + I G NV N+ENAL Sbjct: 193 GGLMESAFEFIKQKGGITTESNYP-YTAQEGTCDESKVNDLAVSIDGHENVPVNDENALL 251 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWL 360 A+ P+SVAIDA F FYS GV F C +L+H V VGYG ++G YW+ Sbjct: 252 KAVANQ-PVSVAIDAGGSDFQFYSEGV-FTGDCNT---DLNHGVAIVGYGTTVDGTNYWI 306 Query: 361 VKNSWSNMWGNDGYVLM----SMRENNCGVQSAPTY 456 V+NSW WG GY+ M S +E CG+ +Y Sbjct: 307 VRNSWGPEWGEQGYIRMQRNISKKEGLCGIAMMASY 342 >UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 306 Score = 100 bits (239), Expect = 2e-20 Identities = 53/132 (40%), Positives = 71/132 (53%), Gaps = 4/132 (3%) Frame = +1 Query: 82 YLGQDGYCHIDNVTAITKITGWVN---VTTNNENALKLALFKHGPISVAIDAAHKTFSFY 252 Y G C DN A K G + V+ +E L A+ +GP ++IDA+ +F Y Sbjct: 178 YTAVQGTCKYDNKKA--KYFGMLELAGVSRKSETELAKAVATYGPAMISIDASQHSFMLY 235 Query: 253 SNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLMSMRENN- 429 G+Y EPKC ++LDHAV VGYGV YW+V+NSW +WG GYV M +NN Sbjct: 236 KEGIYDEPKCSE--EDLDHAVGCVGYGVEGEKDYWIVRNSWGEVWGEKGYVRMIRNKNNQ 293 Query: 430 CGVQSAPTYVLI 465 CGV + V + Sbjct: 294 CGVATEAYNVFV 305 >UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa (Rice) Length = 339 Score = 99 bits (238), Expect = 3e-20 Identities = 60/154 (38%), Positives = 86/154 (55%), Gaps = 5/154 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG A+++I+K+G T E Y DG C+ + +A T I G+ +V NNE AL A Sbjct: 189 GGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNSAAT-IKGYEDVPANNEAALMKA 247 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL-NGHKYWLVK 366 + P+SVA+D TF FYS GV C +LDH ++A+GYG +G +YWL+K Sbjct: 248 VANQ-PVSVAVDGGDMTFQFYSGGV-MTGSCGT---DLDHGIVAIGYGKDGDGTQYWLLK 302 Query: 367 NSWSNMWGNDGYVLM----SMRENNCGVQSAPTY 456 NSW WG +G++ M S + CG+ P+Y Sbjct: 303 NSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSY 336 >UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays (Maize) Length = 493 Score = 99 bits (238), Expect = 3e-20 Identities = 58/155 (37%), Positives = 84/155 (54%), Gaps = 6/155 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHID-NVTAITKITGWVNVTTNNENALK 183 GG A+ +++K+G + TE DY + G DG C + T + I + V N E AL+ Sbjct: 229 GGLMDNAFVFMIKNGGIDTEADYP-FTGHDGTCDLKLKNTRVVSIDSFERVPINYERALQ 287 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363 A+ H P+S +I+A+ + F YS+G+ F+ +C LDH V VGYG G YW+V Sbjct: 288 KAV-AHQPVSASIEASRRAFQLYSSGI-FDGRCGTY---LDHGVTVVGYGSEGGKDYWIV 342 Query: 364 KNSWSNMWGNDGYVLMS----MRENNCGVQSAPTY 456 KNSW WG GYV M+ +R + G+ P Y Sbjct: 343 KNSWGTQWGEAGYVRMARNVRVRPPSAGIAMEPLY 377 >UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegleria fowleri|Rep: Cysteine proteinase homolog - Naegleria fowleri Length = 347 Score = 99 bits (238), Expect = 3e-20 Identities = 57/158 (36%), Positives = 89/158 (56%), Gaps = 6/158 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG + A+Q+++K+G L TE+ Y Y G D C + I+ W +++++ EN + Sbjct: 196 GGLMWSAFQYVIKNGGLDTEDSYP-YEGVDDTCRFNKSNVAATISSWTSISSD-ENQMAA 253 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNG-----HK 351 L +GPIS+AI+A + +Y++G+ +P N D LDH VL VGYGV Sbjct: 254 WLAANGPISIAINA--EWLQYYTSGIS-DPWFCNPQD-LDHGVLIVGYGVGKSWLGSEEN 309 Query: 352 YWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 YW+VKNSW + WG DGY + + CG+ S P+ ++ Sbjct: 310 YWIVKNSWGSDWGEDGYFRIIRGKGKCGLNSVPSSSIV 347 >UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase precursor - Phaedon cochleariae (Mustard beetle) Length = 324 Score = 99 bits (238), Expect = 3e-20 Identities = 51/154 (33%), Positives = 85/154 (55%), Gaps = 2/154 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHI-DNVTAITKITGWVNVTTNNENALKL 186 GG ++++ +GL ++ DY Y G++ C D ++ ++TG+ VT + E +LK Sbjct: 176 GGFAVNGFEYVKDNGLESDADYP-YSGKEDKCKANDKSRSVVELTGYKKVTAS-ETSLKE 233 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 A+ GPIS + K Y G++ + C D L H V VGYG+ NG KYW++K Sbjct: 234 AVGTIGPISAVVFG--KPMKSYGGGIFDDSSCLG--DNLHHGVNVVGYGIENGQKYWIIK 289 Query: 367 NSWSNMWGNDGYV-LMSMRENNCGVQSAPTYVLI 465 N+W WG GY+ L+ +++CGV+ +Y ++ Sbjct: 290 NTWGADWGESGYIRLIRDTDHSCGVEKMASYPIL 323 >UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 339 Score = 99.5 bits (237), Expect = 4e-20 Identities = 57/162 (35%), Positives = 94/162 (58%), Gaps = 10/162 (6%) Frame = +1 Query: 10 GGEDFRAYQWIMKH-GLPTEEDYG--GYLGQD----GYCHIDNVTAITKITGWVNVTTNN 168 GG A+ +I+K G+ +E +Y GYL + G C ++ + I+ ++ + N Sbjct: 181 GGLALIAFDYIIKQKGIDSEFNYPYEGYLIEPYEGRGRCRYNSFYSKASISSYIEIERFN 240 Query: 169 ENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL--N 342 EN L +L K P+SV IDA+ +F Y +GVY +P C + + L+H +L +G+GV N Sbjct: 241 ENELTQSLIK-SPVSVMIDASQLSFMLYKSGVYKDPSCSSTI--LNHGILNIGFGVTPEN 297 Query: 343 GHKYWLVKNSWSNMWGNDGYVLMSMRENN-CGVQSAPTYVLI 465 G++Y+++KNS+ + WG GY+ +S NN CG+ S V+I Sbjct: 298 GNEYYILKNSFGSKWGMKGYIYLSRNFNNHCGISSVGISVVI 339 >UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; n=23; Magnoliophyta|Rep: Senescence-specific cysteine protease - Arabidopsis thaliana (Mouse-ear cress) Length = 346 Score = 99.1 bits (236), Expect = 5e-20 Identities = 58/144 (40%), Positives = 78/144 (54%), Gaps = 6/144 (4%) Frame = +1 Query: 52 GLPTEEDYGGYLGQDGYCHIDNVTA-ITKITGWVNVTTNNENALKLALFKHGPISVAIDA 228 GL TE +Y Y G+D C+ T ITG+ +V N+E AL A+ H P+SV I+ Sbjct: 209 GLTTESNYP-YKGEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAV-AHQPVSVGIEG 266 Query: 229 AHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLVKNSWSNMWGNDGYV 405 F FYS+GV F +C LDHAV A+GYG NG KYW++KNSW WG GY+ Sbjct: 267 GGFDFQFYSSGV-FTGECTTY---LDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYM 322 Query: 406 LMSM----RENNCGVQSAPTYVLI 465 + ++ CG+ +Y I Sbjct: 323 RIQKDVKDKQGLCGLAMKASYPTI 346 >UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster|Rep: CG5367-PA - Drosophila melanogaster (Fruit fly) Length = 338 Score = 99.1 bits (236), Expect = 5e-20 Identities = 41/138 (29%), Positives = 79/138 (57%) Frame = +1 Query: 52 GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAA 231 G+ ++DY Y+ + G C ++ +T W + +E A++ A+ GP++++I+A+ Sbjct: 208 GIMRDQDYP-YVARKGKCQFVPDLSVVNVTSWAILPVRDEQAIQAAVTHIGPVAISINAS 266 Query: 232 HKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLM 411 KTF YS+G+Y +P C + ++HA++ +G+ G YW++KN W WG +GY+ + Sbjct: 267 PKTFQLYSDGIYDDPLCSSA--SVNHAMVVIGF----GKDYWILKNWWGQNWGENGYIRI 320 Query: 412 SMRENNCGVQSAPTYVLI 465 N CG+ + Y ++ Sbjct: 321 RKGVNMCGIANYAAYAIV 338 >UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax (Sterkiella histriomuscorum) Length = 366 Score = 99.1 bits (236), Expect = 5e-20 Identities = 56/151 (37%), Positives = 79/151 (52%), Gaps = 2/151 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A+++I +G L E Y Y +G C I I G + NE+ LK Sbjct: 201 GGLPSHAFEYIKDNGGLALETTYP-YKAANGQCSIQKGQQSVGIRGGAVNISLNEDDLKQ 259 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLV 363 A++ HGP+SVA F Y +GVY C N ++++HAVLAVG+G N YW++ Sbjct: 260 AIYLHGPVSVAFRVIDG-FRDYKSGVYAVEGCANGPNDVNHAVLAVGFGTDENKVDYWII 318 Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQSAPTY 456 KNSW WG+ G+ M N CG+Q+ +Y Sbjct: 319 KNSWGAAWGDQGFFKMKRGVNMCGIQNCNSY 349 >UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1 precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 355 Score = 98.7 bits (235), Expect = 7e-20 Identities = 63/157 (40%), Positives = 89/157 (56%), Gaps = 7/157 (4%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHI--DNVTAITKITGWVNVTTNNENA 177 GG D+ A+Q+I+ G L E+DY YL ++G C ++V +T I+G+ +V N++ + Sbjct: 202 GGLMDY-AFQYIISTGGLHKEDDYP-YLMEEGICQEQKEDVERVT-ISGYEDVPENDDES 258 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357 L AL H P+SVAI+A+ + F FY GV F KC +LDH V AVGYG G Y Sbjct: 259 LVKAL-AHQPVSVAIEASGRDFQFYKGGV-FNGKCGT---DLDHGVAAVGYGSSKGSDYV 313 Query: 358 LVKNSWSNMWGNDGYVLMSMR----ENNCGVQSAPTY 456 +VKNSW WG G++ M E CG+ +Y Sbjct: 314 IVKNSWGPRWGEKGFIRMKRNTGKPEGLCGINKMASY 350 >UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep: Cathepsin L - Stylonychia lemnae Length = 340 Score = 98.3 bits (234), Expect = 9e-20 Identities = 56/151 (37%), Positives = 85/151 (56%), Gaps = 4/151 (2%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG+ A +I G + TE+DY Y+G+D C + + G +N+ L+ Sbjct: 190 GGDMGLAMDYIASAGGVETEKDYP-YVGKDQTCAFEASKEVATDKGHINIVPGKFATLQA 248 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 A+ + GP+SVAI+A F FY +G++ C LDH V AVGYGV NG +Y++V+ Sbjct: 249 AIAE-GPVSVAIEADSLFFQFYRSGIFDSSWCGTN---LDHGVAAVGYGVDNGKQYYIVR 304 Query: 367 NSWSNMWGNDGYV-LMSMRENN--CGVQSAP 450 NSWS+ WG GY+ +++ + N CG+Q P Sbjct: 305 NSWSDSWGLKGYINIIANGDGNGMCGIQMEP 335 >UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza sativa|Rep: Cysteine protease 1 precursor - Oryza sativa subsp. japonica (Rice) Length = 490 Score = 98.3 bits (234), Expect = 9e-20 Identities = 61/157 (38%), Positives = 86/157 (54%), Gaps = 8/157 (5%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTA-ITKITGWVNVTTNNENALK 183 GG A+ +I ++G L TEEDY Y DG C++ + + I G+ +V N+E +L+ Sbjct: 222 GGIMDDAFAFIARNGGLDTEEDYP-YTAMDGKCNLAKRSRKVVSIDGFEDVPENDELSLQ 280 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV--LNGHKYW 357 A+ H P+SVAIDA + F Y +GV F +C LDH V+AVGYG G YW Sbjct: 281 KAV-AHQPVSVAIDAGGREFQLYDSGV-FTGRCGTN---LDHGVVAVGYGTDAATGAAYW 335 Query: 358 LVKNSWSNMWGNDGYVLM----SMRENNCGVQSAPTY 456 V+NSW WG +GY+ M + R CG+ +Y Sbjct: 336 TVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASY 372 >UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to CG5367-PA - Nasonia vitripennis Length = 362 Score = 97.9 bits (233), Expect = 1e-19 Identities = 46/138 (33%), Positives = 73/138 (52%) Frame = +1 Query: 52 GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAA 231 GL T+ Y Y G C ++ +T W + +E AL+ A+ GPI+ +I+A Sbjct: 232 GLMTDATYP-YTAHQGVCKFQRKLSVVNVTSWAILPARDERALEAAVATIGPIAASINAG 290 Query: 232 HKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLM 411 +TF Y +G+Y +P C + D ++HA+L VGY YW++KN W WG +GY+ + Sbjct: 291 PRTFQLYHSGIYDDPTCSS--DLVNHAMLIVGY----TPNYWILKNWWGASWGENGYMRL 344 Query: 412 SMRENNCGVQSAPTYVLI 465 +N CGV + Y + Sbjct: 345 RKGKNRCGVANYAAYAKV 362 >UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Actinidin Act3a - Actinidia eriantha Length = 380 Score = 97.9 bits (233), Expect = 1e-19 Identities = 62/157 (39%), Positives = 86/157 (54%), Gaps = 6/157 (3%) Frame = +1 Query: 4 RGGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHID--NVTAITKITGWVNVTTNNEN 174 +GG D AY++I+ +G + TEE+Y Y+GQD C N +T I + V N+E Sbjct: 191 KGGFMD-DAYEFIINNGGINTEENYP-YIGQDDQCDEPKKNQNYVT-IDSYEQVPPNDEL 247 Query: 175 ALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKY 354 A+K A+ + P+SVAIDA F FY +G++ C L+HAV +GYG NG Y Sbjct: 248 AMKRAV-AYQPVSVAIDAYCLGFRFYQSGIFTGGSCGTT---LNHAVTIIGYGTENGIDY 303 Query: 355 WLVKNSWSNMWGNDGYVLMSMR---ENNCGVQSAPTY 456 W+VKNS+ WG GY + E CG+ S P Y Sbjct: 304 WIVKNSYGTQWGESGYGKVQRNVGGEGRCGIASYPFY 340 >UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; n=16; Chrysomelidae|Rep: Digestive cysteine protease intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 97.9 bits (233), Expect = 1e-19 Identities = 56/157 (35%), Positives = 87/157 (55%), Gaps = 5/157 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG+ A++++ +G+ +E+ Y Y+ + C D I KI G+ NVTT+ E L+ A Sbjct: 177 GGDMSAAFEYVRDYGIQSEKSYP-YIRKQTECQYDASKTILKIKGYKNVTTSEEG-LRKA 234 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN---GH-KYW 357 + GPIS+A+++ Y +G+ C + +LDH VL VGYG + G K+W Sbjct: 235 VGAIGPISIAMNS--DPLQLYYSGIISGKGCSH---DLDHGVLVVGYGKASQWSGETKFW 289 Query: 358 LVKNSWSNMWGNDGYVLMSMRENN-CGVQSAPTYVLI 465 VKNSW +WG +GY + NN CG+ PTY ++ Sbjct: 290 RVKNSWGKIWGENGYFRIKRDANNLCGIADDPTYPVL 326 >UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba culbertsoni|Rep: Cysteine proteinase - Acanthamoeba culbertsoni Length = 482 Score = 97.1 bits (231), Expect = 2e-19 Identities = 53/149 (35%), Positives = 79/149 (53%), Gaps = 4/149 (2%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG--LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183 GG Y+W++ + L T+ Y Y+ + C + I + V +E+ L Sbjct: 222 GGNVEITYRWMISNNARLMTQASYP-YIARQSTCRYVPSQGVQGIRNIMRVRAGSESDL- 279 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWL 360 LA P++VAID + ++F FYS G Y++P C + L+HAVL VG+G YW+ Sbjct: 280 LAKAAIAPVTVAIDGSKRSFMFYSGGYYYDPTCSST--NLNHAVLVVGWGTDPQRGDYWI 337 Query: 361 VKNSWSNMWGNDGYVLMSM-RENNCGVQS 444 KN W WG+DGYV M+ + NNCG+ S Sbjct: 338 AKNEWGTAWGDDGYVYMARNKNNNCGIAS 366 >UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1; Brugia malayi|Rep: Cathepsin F-like cysteine proteinase - Brugia malayi (Filarial nematode worm) Length = 461 Score = 97.1 bits (231), Expect = 2e-19 Identities = 49/152 (32%), Positives = 75/152 (49%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG A++ I + G ED Y ++G CH+ I V + NE +K Sbjct: 312 GGLPINAFREIKRMGGLEPEDQYPYEAKNGTCHLVRAQIAVSIDDAVEIP-RNETVMKAW 370 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + + GP+SV IDA + S+Y +G+ K + +++H VL GYG+ N YW +KN Sbjct: 371 IAQRGPLSVGIDA--ELLSYYKSGILHPSKSRCPPSKINHGVLITGYGIENNLPYWTIKN 428 Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 SW WG +GY + +N CGV + +I Sbjct: 429 SWGEQWGENGYFQLMRGKNICGVSDLVSSAII 460 >UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine protease; n=11; Callosobruchus maculatus|Rep: Putative gut cathepsin L-like cysteine protease - Callosobruchus maculatus (Southern cowpea weevil) (Pulse bruchid) Length = 326 Score = 96.3 bits (229), Expect = 4e-19 Identities = 56/150 (37%), Positives = 78/150 (52%), Gaps = 1/150 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG +A+ ++ G+ TEE Y Y G+ C +TK+ +V E A +A Sbjct: 179 GGLMGQAFDFVQDEGIQTEESYP-YEGRRSSCKKSG-EYVTKVKTYVFPLDEQEMARTVA 236 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEP-KCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 GP++VAI+A+ SFY G+ E +C NK ++L+H VL VGYG NG YW+VK Sbjct: 237 A--KGPVAVAIEASQ--LSFYDKGIVDERCRCSNKREDLNHGVLVVGYGSENGVDYWIVK 292 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTY 456 NSW WG GY + CG+ TY Sbjct: 293 NSWGADWGEKGYFRLKKDVKACGIGYYNTY 322 >UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba histolytica|Rep: Cysteine protease 10 - Entamoeba histolytica Length = 297 Score = 95.9 bits (228), Expect = 5e-19 Identities = 47/122 (38%), Positives = 70/122 (57%), Gaps = 1/122 (0%) Frame = +1 Query: 28 AYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGP 207 ++ ++ HG+ E DY Y G+ C ID + KI + V E LK+A++ H P Sbjct: 182 SFNYVRDHGILLERDYP-YTGKANNCSIDGKKPVIKIKDYSFVFPQTEENLKIAVY-HQP 239 Query: 208 ISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHK-YWLVKNSWSNM 384 ++V+ID++ +F FY G+Y EP CK +DH V VGYG H+ +W+VKNS+ N Sbjct: 240 VAVSIDSSQLSFQFYEGGIYDEPNCK----WVDHIVTVVGYGTTEEHQDFWVVKNSYGNE 295 Query: 385 WG 390 WG Sbjct: 296 WG 297 >UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep: Silicatein beta - Suberites domuncula (Sponge) Length = 383 Score = 95.9 bits (228), Expect = 5e-19 Identities = 54/154 (35%), Positives = 83/154 (53%), Gaps = 6/154 (3%) Frame = +1 Query: 13 GEDFRAYQWIMKH-GLPTEEDY--GG--YLGQDGYCHIDNVTAITKITGWVNVTTNNENA 177 G+ RA +++++ G+ T + Y GG Y + C + G V++ + +EN Sbjct: 229 GDVNRALLYVIENDGVDTWKGYPSGGDPYRSKQYSCKYERQYRGASARGIVSLASGDENT 288 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357 L A+ GP+SV +DA +F FYS+GV P C + L HA++ +GYG +G YW Sbjct: 289 LLTAVANSGPVSVYVDATSTSFQFYSDGVLNVPYCSSST--LSHALVVIGYGKYSGQDYW 346 Query: 358 LVKNSWSNMWGNDGY-VLMSMRENNCGVQSAPTY 456 LVKNSW WG GY L + N CG+ +A ++ Sbjct: 347 LVKNSWGPNWGVRGYGKLARNKGNKCGIATAASF 380 >UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 precursor; n=4; Schizophora|Rep: Putative cysteine proteinase CG12163 precursor - Drosophila melanogaster (Fruit fly) Length = 614 Score = 95.9 bits (228), Expect = 5e-19 Identities = 59/162 (36%), Positives = 79/162 (48%), Gaps = 7/162 (4%) Frame = +1 Query: 1 ARGGGEDFRAYQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENA 177 A GG AY+ I GL E +Y Y + CH + + ++ G+V++ NE A Sbjct: 455 ACNGGLMDNAYKAIKDIGGLEYEAEYP-YKAKKNQCHFNRTLSHVQVAGFVDLPKGNETA 513 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL---NGH 348 ++ L +GPIS+ I+A FY GV K LDH VL VGYGV N H Sbjct: 514 MQEWLLANGPISIGINA--NAMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDYPNFH 571 Query: 349 K---YWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 K YW+VKNSW WG GY + +N CGV T ++ Sbjct: 572 KTLPYWIVKNSWGPRWGEQGYYRVYRGDNTCGVSEMATSAVL 613 >UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 357 Score = 95.5 bits (227), Expect = 6e-19 Identities = 56/154 (36%), Positives = 80/154 (51%), Gaps = 6/154 (3%) Frame = +1 Query: 13 GEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 G+ A+++I +G + E DY G C I G+ V NNE AL LA Sbjct: 202 GDMDEAFRYITSNGGIAAESDYPYEDRALGTCRASGKPVAASIRGFQYVPPNNETALLLA 261 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLVK 366 + H P+SVA+D K F+S+GV+ + + +L+HA+ AVGYG +G KYWL+K Sbjct: 262 V-AHQPVSVALDGVGKVSQFFSSGVFGAMQNETCTTDLNHAMTAVGYGTDEHGTKYWLMK 320 Query: 367 NSWSNMWGNDGY--VLMSMRENN--CGVQSAPTY 456 NSW WG GY + + N CG+ P+Y Sbjct: 321 NSWGTDWGEGGYMKIARDVASNTGLCGLAMQPSY 354 >UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina). Cathepsin L; n=2; Dictyostelium discoideum|Rep: Similar to Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina). Cathepsin L - Dictyostelium discoideum (Slime mold) Length = 265 Score = 94.7 bits (225), Expect = 1e-18 Identities = 49/148 (33%), Positives = 80/148 (54%), Gaps = 2/148 (1%) Frame = +1 Query: 28 AYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGP 207 A+++I+ G E Y G+D C + K++G+V + +E+AL A+ +GP Sbjct: 119 AFKYIISSGGVNLESQYPYTGKDEVCKFNQSEKEAKVSGFVMIPKFDESALMEAIALYGP 178 Query: 208 ISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLVKNSWSNM 384 ++V ID + K F S G+Y+ C HAVLA+GYG NG Y+L+KNSW Sbjct: 179 VAVPIDTSTKEFQHLSGGIYYSDSCDPW--NTIHAVLAIGYGTDENGVDYFLMKNSWGKS 236 Query: 385 WGNDGYVLMSMR-ENNCGVQSAPTYVLI 465 WG +G+ + + CG+ +A +Y ++ Sbjct: 237 WGTNGFFKVKRGVKGKCGIVTAASYPIV 264 >UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromeliaceae|Rep: Fruit bromelain precursor - Ananas comosus (Pineapple) Length = 351 Score = 94.7 bits (225), Expect = 1e-18 Identities = 57/155 (36%), Positives = 87/155 (56%), Gaps = 6/155 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG +AY +I+ + G+ TEE+Y YL G C+ ++ ITG+ V N+E ++ Sbjct: 186 GGWVNKAYDFIISNNGVTTEENYP-YLAYQGTCNANSFPNSAYITGYSYVRRNDERSMMY 244 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLV 363 A+ + PI+ IDA+ + F +Y+ GV+ P C L+HA+ +GYG +G KYW+V Sbjct: 245 AV-SNQPIAALIDAS-ENFQYYNGGVFSGP-CGTS---LNHAITIIGYGQDSSGTKYWIV 298 Query: 364 KNSWSNMWGNDGYVLM----SMRENNCGVQSAPTY 456 +NSW + WG GYV M S CG+ AP + Sbjct: 299 RNSWGSSWGEGGYVRMARGVSSSSGVCGIAMAPLF 333 >UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryza sativa|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 352 Score = 94.3 bits (224), Expect = 1e-18 Identities = 56/163 (34%), Positives = 81/163 (49%), Gaps = 11/163 (6%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAIT----KITGWVNVTTNNENA 177 GG A+Q++ G T E Y G G C D ++ + I+G+ V N+E + Sbjct: 192 GGSLDNAFQYMANSGGVTTEAAYAYQGAQGACQFDASSSASGVAATISGYQRVNPNDEGS 251 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV----LNG 345 L A+ P+SVAI+ + F Y +GV+ C K LDHAV VGYG G Sbjct: 252 LAAAVASQ-PVSVAIEGSGAMFRHYGSGVFTADSCGTK---LDHAVAVVGYGAEADGSGG 307 Query: 346 HKYWLVKNSWSNMWGNDGYVLMSM---RENNCGVQSAPTYVLI 465 YW++KNSW WG+ GY+ + + CGV AP+Y ++ Sbjct: 308 GGYWIIKNSWGTTWGDGGYMKLEKDVGSQGACGVAMAPSYPVV 350 >UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2; Brugia malayi|Rep: Cahepsin L-like cysteine protease - Brugia malayi (Filarial nematode worm) Length = 371 Score = 93.9 bits (223), Expect = 2e-18 Identities = 61/163 (37%), Positives = 85/163 (52%), Gaps = 11/163 (6%) Frame = +1 Query: 10 GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A+++++K+ G+ TE+ Y Y G C N T T + +E L+ Sbjct: 210 GGLMMEAFEYVVKNDGIDTEKSYP-YQGYQNTCRYSNSTRGTTAYAGKLLPEGDELQLQA 268 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYG-----VLNGHK 351 A+ GPISVA+DA K FY G++ KC + + HA+LAVGYG + NG K Sbjct: 269 AIATIGPISVAVDA--KLMKFYRRGIFSTSKCTTR---MGHALLAVGYGTEEVKLQNGTK 323 Query: 352 ----YWLVKNSWSNMWGNDGYV-LMSMRENNCGVQSAPTYVLI 465 YWL+KNSWS WG GY+ L +EN CG+ Y L+ Sbjct: 324 KSVDYWLLKNSWSKRWGIGGYLKLARNQENMCGIGFYACYPLV 366 >UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase" precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 315 Score = 93.9 bits (223), Expect = 2e-18 Identities = 52/149 (34%), Positives = 80/149 (53%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG A+ ++ +HGL +E Y Y G+D C ++ I+G+V + T E+AL A Sbjct: 175 GGLMTDAFNYVKRHGLSSESQYA-YTGRDDRCKNVENKPLSSISGYVELETT-EDALASA 232 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + GP+S+A+DA T+ Y G++ C+ L+H VLAVGY ++VKN Sbjct: 233 VASVGPVSIAVDA--DTWQLYGGGLFNNKNCRTN---LNHGVLAVGYT----KDAFIVKN 283 Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTY 456 SW WG GY+ ++ EN CG+ +Y Sbjct: 284 SWGTSWGEQGYIRVARGENLCGINLMNSY 312 >UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dvir_CG5367 - Drosophila virilis (Fruit fly) Length = 298 Score = 93.9 bits (223), Expect = 2e-18 Identities = 43/138 (31%), Positives = 75/138 (54%) Frame = +1 Query: 52 GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAA 231 GL DY Y + G C + A+ +T W + +ENA++ A+ GP++V+I+A+ Sbjct: 168 GLMRSLDYK-YASKKGECQFVSELAVVNVTSWAILPAKDENAIQAAVAHIGPVAVSINAS 226 Query: 232 HKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLM 411 KTF YS G+Y + C + ++HA+L +G+ +W++KN W +WG G++ M Sbjct: 227 PKTFQLYSEGIYDDVSCTS--TSVNHAMLLIGF----DKNFWILKNWWGELWGEAGFMRM 280 Query: 412 SMRENNCGVQSAPTYVLI 465 N CG+ + Y ++ Sbjct: 281 RKGINLCGIANYAAYAIV 298 >UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus; n=4; Cryptosporidium|Rep: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus - Cryptosporidium parvum Iowa II Length = 401 Score = 93.5 bits (222), Expect = 3e-18 Identities = 62/161 (38%), Positives = 85/161 (52%), Gaps = 9/161 (5%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYC---HIDNVTAITKITGWVNVTTNNENA 177 GG A+Q+ +K+ L T +DY Y ++ C +N I + + V N NA Sbjct: 243 GGTMGLAFQYAIKNKYLCTNDDYP-YFAEEKTCMDSFCENYIEIP-VKAYKYVFPRNINA 300 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV--LNGHK 351 LK AL K+GPISVAI A F FY +GV F+ C KV +H V+ VGY + + Sbjct: 301 LKTALAKYGPISVAIQADQTPFQFYKSGV-FDAPCGTKV---NHGVVLVGYDMDEDTNKE 356 Query: 352 YWLVKNSWSNMWGNDGYV---LMSMRENNCGVQSAPTYVLI 465 YWLV+NSW WG GY+ L S ++ CG+ P Y +I Sbjct: 357 YWLVRNSWGEAWGEKGYIKLALHSGKKGTCGILVEPVYPVI 397 >UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foetus|Rep: TFCP2 protein - Tritrichomonas foetus (Trichomonas foetus) Length = 270 Score = 93.5 bits (222), Expect = 3e-18 Identities = 44/131 (33%), Positives = 67/131 (51%) Frame = +1 Query: 64 EEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTF 243 EE+Y Y G G C D + ++ I ++E LK + +GP+S +DA H +F Sbjct: 138 EENYQ-YSGHKGACLYDEKSKVSNIVAVTMFPQSDEQNLKGHIAANGPVSCNVDAGHYSF 196 Query: 244 SFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLMSMRE 423 Y G+Y+ C+ + +HA+ VGYGV +YW+V+NSW WG GY+ + Sbjct: 197 QLYQGGIYWSWFCRTQYI-YNHAMGIVGYGVEGSEEYWIVRNSWGESWGEQGYIRYLLGS 255 Query: 424 NNCGVQSAPTY 456 N C + TY Sbjct: 256 NVCNIADYVTY 266 >UniRef50_Q24E33 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 328 Score = 93.1 bits (221), Expect = 3e-18 Identities = 56/149 (37%), Positives = 81/149 (54%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG RA++++ HG+ TEE+Y Y +DG C KI + V N + L A Sbjct: 193 GGLMPRAFRYVKAHGITTEEEYP-YTAKDGKCQTKQ--GQYKIKSFSTVPRGNCDKLAAA 249 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + + P+SV +DA + F FY++GV+ CK K L+H VLA GY YW++KN Sbjct: 250 IAQQ-PVSVGVDATN--FKFYTSGVF--DNCKKK---LNHGVLATGYTA----DYWIIKN 297 Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTY 456 SW WG +GY+ + R N CGV + +Y Sbjct: 298 SWGTAWGQNGYINLK-RGNTCGVCNTASY 325 >UniRef50_A2F4G0 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 203 Score = 93.1 bits (221), Expect = 3e-18 Identities = 51/149 (34%), Positives = 77/149 (51%), Gaps = 5/149 (3%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHGLPT---EEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTN-NEN 174 GGG Y+ IMK T + DY Y + G C D++ I TT NE Sbjct: 48 GGGWPSGTYKSIMKQFNGTFILDSDYP-YTAKRGVCKFDSMPKAAPIMTTYGTTTKYNET 106 Query: 175 ALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKY 354 AL LA+ G +V++DA+ +F Y +G+Y+EP C + +D ++ VGYG Y Sbjct: 107 ALALAVSLVGVATVSVDASRTSFQLYQSGIYYEPDC--STETMDLSMACVGYGTEGTTNY 164 Query: 355 WLVKNSWSNMWGNDGYV-LMSMRENNCGV 438 W+VKN + + WG GY+ ++ + NNC + Sbjct: 165 WIVKNCFGDKWGEQGYIRMIKDKNNNCAI 193 >UniRef50_Q22W19 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 92.7 bits (220), Expect = 5e-18 Identities = 55/153 (35%), Positives = 81/153 (52%), Gaps = 4/153 (2%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG+ AY++++++G+ TE DY Y G + C D + K +V VT N+ + L +A Sbjct: 187 GGDLPPAYKYVVQNGIETEADYP-YKGVNQKCAYDASKVVFKPKSFVQVTPNSPDQLAIA 245 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 L K P+ + I+A K F FY++G+ C LDH VLAVGY + W+VKN Sbjct: 246 LNKE-PVPICIEADQKAFQFYTSGI-ISSGCGTN---LDHCVLAVGYDADS----WIVKN 296 Query: 370 SWSNMWGNDGYVLMSMRENN----CGVQSAPTY 456 SW WG +GYV ++ CG+ P Y Sbjct: 297 SWGASWGENGYVRIARTTAKGPGVCGIYEEPVY 329 >UniRef50_A2WPN4 Cluster: Putative uncharacterized protein; n=2; Oryza sativa (indica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. indica (Rice) Length = 325 Score = 92.3 bits (219), Expect = 6e-18 Identities = 55/156 (35%), Positives = 78/156 (50%), Gaps = 7/156 (4%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTA--ITKITGWVNVTTNNENALK 183 GG A + G T E+ Y G G C + + ++G+ V N+E L Sbjct: 171 GGHSDTALNLVASRGGITSEEKYPYTGVQGSCDVGKLLFDHSASVSGFAAVPPNDERQLA 230 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWL 360 LA+ + P++V IDA+ + F FY GVY P C ++HAV VGY G KYW+ Sbjct: 231 LAVARQ-PVTVYIDASAQEFQFYKGGVYKGP-CNP--GSVNHAVTIVGYCENFGGEKYWI 286 Query: 361 VKNSWSNMWGNDGYVLMS----MRENNCGVQSAPTY 456 KNSWSN WG GYV ++ + CG+ ++P Y Sbjct: 287 AKNSWSNDWGEQGYVYLAKDVWWPQGTCGLATSPFY 322 >UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: Cathepsin L - Kudoa thyrsites Length = 300 Score = 92.3 bits (219), Expect = 6e-18 Identities = 47/132 (35%), Positives = 74/132 (56%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG A+ +++ +G+ +DY Y + G C + + +I+ + V N E+ ++ + Sbjct: 166 GGLPEIAFLYVINNGIMKLKDYP-YTAKQGTCQY-SPEDVVRISSFKCVENNEESVME-S 222 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + +GP S+ I+AA ++F FY G+Y +P + LDHAVL VGYG N YW VKN Sbjct: 223 VANNGPNSIGINAASRSFQFYGGGIYSDPWASSY--PLDHAVLLVGYGYKNTENYWHVKN 280 Query: 370 SWSNMWGNDGYV 405 SW WG GY+ Sbjct: 281 SWGPWWGEQGYI 292 >UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 291 Score = 91.5 bits (217), Expect = 1e-17 Identities = 43/120 (35%), Positives = 69/120 (57%), Gaps = 1/120 (0%) Frame = +1 Query: 82 YLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNG 261 Y G DG C D TA+ + +V+V + +E L ++++G V +D + +F YS+G Sbjct: 168 YQGVDGACKFDAKTAMPVTSNFVSVPSGSERDLANYVYQYGVAVVVLDCSRISFQLYSSG 227 Query: 262 VYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLMSMRENN-CGV 438 +Y +P C ++ LDHA+ VGY YW+++NSW WG GY+ ++ +NN CGV Sbjct: 228 IYSDPCCSSQ--NLDHAMNVVGY----SDSYWIIRNSWGTSWGESGYMRLAKDKNNMCGV 281 >UniRef50_Q0MYX5 Cluster: Putative cysteine protease; n=1; Emiliania huxleyi|Rep: Putative cysteine protease - Emiliania huxleyi Length = 276 Score = 91.1 bits (216), Expect = 1e-17 Identities = 54/148 (36%), Positives = 75/148 (50%), Gaps = 5/148 (3%) Frame = +1 Query: 28 AYQWIMK-HGLPTEEDYG--GYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFK 198 A++WI + + L TE Y G G C +T +V + +E+AL+ A+ K Sbjct: 4 AFEWIAEGNPLCTESTYPYTSGAGLTGTCK-KACNGEVSLTSHKDVPSGDEDALRAAVAK 62 Query: 199 HGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV--LNGHKYWLVKNS 372 P+SVAI+A F Y +GV C ELDH VL VGYG G YW +KNS Sbjct: 63 Q-PVSVAIEADKSAFQLYQSGVIDSASCGK---ELDHGVLVVGYGTDTATGKDYWKIKNS 118 Query: 373 WSNMWGNDGYVLMSMRENNCGVQSAPTY 456 W WG +G+V + +N CG+ S +Y Sbjct: 119 WGGTWGEEGFVRVVQGKNMCGISSQASY 146 >UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thaliana|Rep: F9P14.12 protein - Arabidopsis thaliana (Mouse-ear cress) Length = 343 Score = 90.2 bits (214), Expect = 2e-17 Identities = 60/157 (38%), Positives = 81/157 (51%), Gaps = 6/157 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHID-NVTAITKITGWVNVTTNNENALK 183 GG A+++I +G L TE DY Y G +G C + + + I G+ V NE +L+ Sbjct: 193 GGLMETAFEFIKTNGGLATETDYP-YTGIEGTCDQEKSKNKVVTIQGYQKVA-QNEASLQ 250 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363 +A + P+SV IDA F YS+GV F C L+H V VGYGV KYW+V Sbjct: 251 IAAAQQ-PVSVGIDAGGFIFQLYSSGV-FTNYCGTN---LNHGVTVVGYGVEGDQKYWIV 305 Query: 364 KNSWSNMWGNDGYVLM----SMRENNCGVQSAPTYVL 462 KNSW WG +GY+ M S CG+ +Y L Sbjct: 306 KNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPL 342 >UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativa|Rep: Os01g0347600 protein - Oryza sativa subsp. japonica (Rice) Length = 343 Score = 90.2 bits (214), Expect = 2e-17 Identities = 56/159 (35%), Positives = 81/159 (50%), Gaps = 9/159 (5%) Frame = +1 Query: 7 GGGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGYCHIDNV--TAITKITGWVNVTTNNENA 177 GGG RA++ + K G+ E DY Y G G C +D++ +I G+ V N+E Sbjct: 188 GGGHTDRAFELVASKGGITAESDYR-YEGFQGKCRVDDMLFNHAARIGGYRAVPPNDERQ 246 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGY--GVLNGHK 351 L A+ + P++V IDA+ F FY +GV+ P + +HAV VGY +G K Sbjct: 247 LATAVARQ-PVTVYIDASGPAFQFYKSGVFPGPCGASS----NHAVTLVGYCQDGASGKK 301 Query: 352 YWLVKNSWSNMWGNDGYVLMS----MRENNCGVQSAPTY 456 YW+ KNSW WG GY+L+ CG+ +P Y Sbjct: 302 YWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFY 340 >UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber officinale (Ginger) Length = 221 Score = 89.8 bits (213), Expect = 3e-17 Identities = 49/153 (32%), Positives = 75/153 (49%), Gaps = 4/153 (2%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG +RA+Q+I+ +G E++ Y G +G C + I + NV +N+E +L+ A Sbjct: 67 GGWPYRAFQYIINNGGINSEEHYPYTGTNGTCDTKENAHVVSIDSYRNVPSNDEKSLQKA 126 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + + P+SV +DAA + F Y NG+ F C +H G N YW VKN Sbjct: 127 V-ANQPVSVTMDAAGRDFQLYRNGI-FTGSCNISA---NHYRTVGGRETENDKDYWTVKN 181 Query: 370 SWSNMWGNDGYVLMSMR----ENNCGVQSAPTY 456 SW WG GY+ + CG+ +P+Y Sbjct: 182 SWGKNWGESGYIRVERNIAESSGKCGIAISPSY 214 >UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa|Rep: Os09g0381400 protein - Oryza sativa subsp. japonica (Rice) Length = 362 Score = 89.4 bits (212), Expect = 4e-17 Identities = 54/131 (41%), Positives = 73/131 (55%), Gaps = 4/131 (3%) Frame = +1 Query: 25 RAYQWIMKHG-LPTEEDYGGYLGQDGYCH-IDNVTAITKITGWVNVTTNNENALKLALFK 198 RAY+W++++G L TE DY Y + G C+ + KITG+ V NE AL+ A+ + Sbjct: 214 RAYKWVVENGGLTTEADYP-YTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVAR 272 Query: 199 HGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV--LNGHKYWLVKNS 372 P++VAI+ FY GVY P C + L HAV VGYG +G KYW +KNS Sbjct: 273 Q-PVAVAIEVG-SGMQFYKGGVYTGP-CGTR---LAHAVTVVGYGTDASSGAKYWTIKNS 326 Query: 373 WSNMWGNDGYV 405 W WG GY+ Sbjct: 327 WGQSWGERGYI 337 >UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 326 Score = 89.4 bits (212), Expect = 4e-17 Identities = 54/134 (40%), Positives = 69/134 (51%), Gaps = 6/134 (4%) Frame = +1 Query: 73 YGGYLGQDGYCHID-NVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSF 249 Y Y C D N I KI + V N+E ALK A++ GP+SV I+A+++ F Sbjct: 182 YPAYEAVQEPCRFDPNKAPIVKIDSYSFVDPNDEEALKQAVYSQGPVSVLIEASYE-FMI 240 Query: 250 YSNGVYFEPKCKNKVDELDHAVLAVGYGVL-NGHKYWLVKNSWSNMWGNDGYVLMSMR-- 420 Y GV+ P C EL+HAVL VGY +G YW+VKNSW WG GY+ M Sbjct: 241 YQGGVFSGP-CGT---ELNHAVLVVGYDETEDGTPYWIVKNSWGAGWGESGYIRMIRNIP 296 Query: 421 --ENNCGVQSAPTY 456 E CG+ P Y Sbjct: 297 APEGICGIAMYPIY 310 >UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain; n=9; Cucujiformia|Rep: Digestive cysteine proteinase intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 89.4 bits (212), Expect = 4e-17 Identities = 56/157 (35%), Positives = 78/157 (49%), Gaps = 5/157 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG A+ +++ G+ + Y Y G D C D + KI G+ NV+ N+E LK A Sbjct: 177 GGLMSFAFDYVLDKGIEADSSYP-YKGIDTPCQYDAKKTVLKIKGYKNVS-NSEEELKKA 234 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYG----VLNGHKYW 357 + GP+SVAIDA Y G+ C + L+H VLAVGYG + K+W Sbjct: 235 VGTVGPVSVAIDA--DPIQLYFGGILDGLFCTHN---LNHGVLAVGYGEEDHLFGKKKFW 289 Query: 358 LVKNSWSNMWGNDGYVLMSMRENN-CGVQSAPTYVLI 465 VKNSW WG GY + NN CG+ +Y ++ Sbjct: 290 KVKNSWGKDWGEQGYFRIKRDANNLCGIADKASYPIL 326 >UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arabidopsis thaliana|Rep: Putative cysteine proteinase - Arabidopsis thaliana (Mouse-ear cress) Length = 365 Score = 89.0 bits (211), Expect = 6e-17 Identities = 54/154 (35%), Positives = 79/154 (51%), Gaps = 5/154 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAI-TKITGWVNVTTNNENALKL 186 GGE A+++I+K+G + E Y + C + A T+I G+ V ++NE AL L Sbjct: 213 GGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQIRGFQMVPSHNERAL-L 271 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 + P+SV IDA +F Y GVY C V+ HAV VGYG ++G YW++K Sbjct: 272 EAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVN---HAVTIVGYGTMSGLNYWVLK 328 Query: 367 NSWSNMWGNDGYVL----MSMRENNCGVQSAPTY 456 NSW WG +GY+ + + CG+ Y Sbjct: 329 NSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAY 362 >UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 462 Score = 89.0 bits (211), Expect = 6e-17 Identities = 54/166 (32%), Positives = 87/166 (52%), Gaps = 8/166 (4%) Frame = +1 Query: 1 ARGGGEDFRAYQWI--MKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNEN 174 A GGE + AY + ++ L TEE+Y YLG G+C + I K+TG + ++ N Sbjct: 296 ACAGGEGYDAYGKLAELQLNLTTEEEYP-YLGVSGHCQKNFGKTIGKVTGCYQIMRDSSN 354 Query: 175 A---LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDE---LDHAVLAVGYGV 336 + AL+K+GP+ + I A F Y+ G + + +D+ DH VL G+ Sbjct: 355 KDINVLRALYKYGPLMIYIRAGTAPFVAYTGGSFNNHEVCGGIDDHDKTDHGVLLTGWKT 414 Query: 337 LNGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI*IS 474 ++G ++ + NSWS WG +G+ +S EN+CGV P L+ I+ Sbjct: 415 IDGVIHYEIMNSWSTFWGEEGFAYIS-SENDCGVPVMPLLPLVEIN 459 >UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 452 Score = 89.0 bits (211), Expect = 6e-17 Identities = 47/146 (32%), Positives = 79/146 (54%), Gaps = 3/146 (2%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLP-TEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GGE Y+ + + + T ED YLG YC + + + G + ++ LK Sbjct: 292 GGEHDEIYRILRESKMELTLEDEYPYLGVGSYCGKNFKHTVGYVKGCYKIPEHDNEKLKS 351 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKC--KNKVDELDHAVLAVGYGVLNGHKYWL 360 ALF+HGP++V I A F ++ +Y C +KV ++DH+VL G+ +NG W Sbjct: 352 ALFEHGPLAVGIIADQDGFGTLTDNIYDNANCYVHDKV-KIDHSVLLTGWKRINGVDAWE 410 Query: 361 VKNSWSNMWGNDGYVLMSMRENNCGV 438 + NSWS++WG+ G+ + M +++CG+ Sbjct: 411 IMNSWSDVWGDHGFGYIVMGDHDCGI 436 >UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-like cysteine peptidase; n=3; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L or K-like cysteine peptidase - Trichomonas vaginalis G3 Length = 320 Score = 89.0 bits (211), Expect = 6e-17 Identities = 47/131 (35%), Positives = 71/131 (54%), Gaps = 1/131 (0%) Frame = +1 Query: 61 TEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAAHKT 240 T DY Y+ + C D ++ K TG+ V + +AL A+ + S+ IDA+ + Sbjct: 188 TAADYP-YIARASICKFDKTKSVAKTTGFERVKPGSSDALIEAV-QTSVCSLLIDASINS 245 Query: 241 FSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYV-LMSM 417 F Y +G+Y + KC +LDH V VGYG +G YW+++NSW WG GY+ +++ Sbjct: 246 FMQYKSGIYDDTKCDPT--QLDHYVNLVGYGSESGINYWIIRNSWGEAWGESGYIRIINN 303 Query: 418 RENNCGVQSAP 450 N CGV S P Sbjct: 304 AANVCGVLSHP 314 >UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin L-like proteinase; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin L-like proteinase - Strongylocentrotus purpuratus Length = 329 Score = 88.2 bits (209), Expect = 1e-16 Identities = 44/120 (36%), Positives = 72/120 (60%), Gaps = 2/120 (1%) Frame = +1 Query: 103 CHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKC 282 C+ + A+ +VT NE+AL A++ P+ VAIDA+ +F Y +GVY +P C Sbjct: 209 CNNASCKAVASSNVGKSVTQGNESALAEAVY-FTPVVVAIDASQPSFQLYVSGVYSDPNC 267 Query: 283 KNKVDELDHAVLAVGYGVLN-GHKYWLVKNSWSNMWGNDGYVLMSMRENN-CGVQSAPTY 456 + + LD ++L VGYGV + G +YW+ +N+W WG++GY+ ++ NN CG+ + Y Sbjct: 268 SSTL--LDLSLLLVGYGVSSVGTEYWICRNTWGEEWGDNGYINIARNHNNMCGIATDAIY 325 >UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyta|Rep: Os12g0273800 protein - Oryza sativa subsp. japonica (Rice) Length = 504 Score = 88.2 bits (209), Expect = 1e-16 Identities = 54/136 (39%), Positives = 72/136 (52%), Gaps = 2/136 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAIT-KITGWVNVTTNNENALKL 186 GGE A+Q+I+ +G T E Y +DG C + I G+ +V N+E +L Sbjct: 190 GGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKTTAAADVAASIRGYEDVPANDEPSLMK 249 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN-GHKYWLV 363 A+ P+SVA+DA+ F FY GV +C LDH V +GYG + G KYWLV Sbjct: 250 AVAGQ-PVSVAVDASK--FQFYGGGV-MAGECGTS---LDHGVTVIGYGAASDGTKYWLV 302 Query: 364 KNSWSNMWGNDGYVLM 411 KNSW WG GY+ M Sbjct: 303 KNSWGTTWGEAGYLRM 318 >UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudicotyledons|Rep: Chymopapain precursor - Carica papaya (Papaya) Length = 352 Score = 88.2 bits (209), Expect = 1e-16 Identities = 54/154 (35%), Positives = 77/154 (50%), Gaps = 5/154 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHI-DNVTAITKITGWVNVTTNNENALKL 186 GG + Q++ +G+ T + Y Y + C D KITG+ V +N E + L Sbjct: 199 GGYQTTSLQYVANNGVHTSKVYP-YQAKQYKCRATDKPGPKVKITGYKRVPSNCETSF-L 256 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 + P+SV ++A K F Y +GV+ P C K LDHAV AVGYG +G Y ++K Sbjct: 257 GALANQPLSVLVEAGGKPFQLYKSGVFDGP-CGTK---LDHAVTAVGYGTSDGKNYIIIK 312 Query: 367 NSWSNMWGNDGYVLMSMRENN----CGVQSAPTY 456 NSW WG GY+ + + N CGV + Y Sbjct: 313 NSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYY 346 >UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF6860, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 251 Score = 87.8 bits (208), Expect = 1e-16 Identities = 45/99 (45%), Positives = 59/99 (59%), Gaps = 1/99 (1%) Frame = +1 Query: 82 YLGQDGY-CHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSFYSN 258 +L QD C+ DN A+ I + + +E AL A+ GPI+VAIDA+H +F FYS+ Sbjct: 97 FLQQDTQPCYYDNKRAVGTIRDYRFIPKGDEQALADAVATIGPITVAIDASHSSFLFYSS 156 Query: 259 GVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSW 375 G+Y E C + L HAVL VGYG G YWL+KN W Sbjct: 157 GIYEESNC--NPNNLSHAVLLVGYGSEGGQDYWLIKNRW 193 >UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicotyledons|Rep: Cysteine proteinase - Mesembryanthemum crystallinum (Common ice plant) Length = 367 Score = 87.8 bits (208), Expect = 1e-16 Identities = 53/139 (38%), Positives = 74/139 (53%), Gaps = 5/139 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAIT-KITGWVNVTTNNENALKL 186 GG RA+++I + G T E Y Q G C + + T I G+ N+ + + LK+ Sbjct: 190 GGTMGRAFEYIKQRGGITSEANYPYKAQAGMCKNNLIQRPTVSIDGYYNIRRSEDAVLKI 249 Query: 187 ALFKHGPISVAIDA---AHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN-GHKY 354 H P+SVA+DA + + FY GV+ P C K L+H V AVGYG N G+ Y Sbjct: 250 --LAHQPVSVAVDATTWSSLDWMFYFQGVFTGP-CGTK---LNHGVTAVGYGTTNDGYDY 303 Query: 355 WLVKNSWSNMWGNDGYVLM 411 W++KNSW WG GY+ M Sbjct: 304 WIIKNSWGETWGERGYMRM 322 >UniRef50_A2DKX2 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 317 Score = 87.8 bits (208), Expect = 1e-16 Identities = 55/149 (36%), Positives = 75/149 (50%), Gaps = 6/149 (4%) Frame = +1 Query: 10 GGEDFRAYQWIM-----KHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNEN 174 GGE RA +I+ K GL E DY GYC D +TK VN T +E Sbjct: 166 GGEADRAVGYIVTDQDGKFGL--ESDYPYKSESMGYCEFDPSKGVTKALA-VNYT-RDEA 221 Query: 175 ALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKY 354 +K+ + GP+ D++ + F +Y GVY+ C +DH + VGYG NG Y Sbjct: 222 DMKVRVATTGPLICGYDSSSEDFEYYYQGVYYSDDCS--AWGIDHWMTIVGYGTYNGDDY 279 Query: 355 WLVKNSWSNMWGNDGYVLMSM-RENNCGV 438 WLVKNS+ WG GY +++ R+ CGV Sbjct: 280 WLVKNSFGKGWGQQGYGMVARNRDGACGV 308 >UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35; Viridiplantae|Rep: Cysteine proteinase 15A precursor - Pisum sativum (Garden pea) Length = 363 Score = 87.8 bits (208), Expect = 1e-16 Identities = 53/153 (34%), Positives = 79/153 (51%), Gaps = 8/153 (5%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A++++++ G + E+DY Y G+DG C D + ++ + +V T +E+ + Sbjct: 205 GGLMNNAFEYLLESGGVVQEKDYA-YTGRDGSCKFDKSKVVASVSNF-SVVTLDEDQIAA 262 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-------LNG 345 L K+GP++VAI+AA Y +GV C LDH VL VG+G L Sbjct: 263 NLVKNGPLAVAINAAW--MQTYMSGVSCPYVCAKS--RLDHGVLLVGFGKGAYAPIRLKE 318 Query: 346 HKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444 YW++KNSW WG GY + N CGV S Sbjct: 319 KPYWIIKNSWGQNWGEQGYYKICRGRNVCGVDS 351 >UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin F like protease - Nasonia vitripennis Length = 1036 Score = 87.4 bits (207), Expect = 2e-16 Identities = 56/159 (35%), Positives = 74/159 (46%), Gaps = 7/159 (4%) Frame = +1 Query: 10 GGEDFRAYQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG AY+ I + GL E DY Y +D CH + I +N+T+N E + Sbjct: 881 GGLPDTAYRAIEELGGLELESDYP-YDAEDEKCHFNKNKVKVNIVSGLNITSN-ETQMAQ 938 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL------NGH 348 L K+GP+S+ I+A FY GV K D LDH VL VGYGV Sbjct: 939 WLVKNGPMSIGINA--NAMQFYMGGVSHPFKFLCSPDSLDHGVLIVGYGVKFYPIFKKTM 996 Query: 349 KYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 YW++KNSW WG GY + + CGV T ++ Sbjct: 997 PYWIIKNSWGPRWGEQGYYRVYRGDGTCGVNKMVTSAVV 1035 >UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin S preproprotein; n=2; Tribolium castaneum|Rep: PREDICTED: similar to cathepsin S preproprotein - Tribolium castaneum Length = 525 Score = 87.4 bits (207), Expect = 2e-16 Identities = 48/144 (33%), Positives = 72/144 (50%), Gaps = 2/144 (1%) Frame = +1 Query: 31 YQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGP 207 Y++I+K G+ ++DY Y G C + +T +E L+ + GP Sbjct: 382 YKYIVKSEGINYDQDYR-YQSAPGTCRFRADKPKITFRKYAYLTAISEEDLQWIVANVGP 440 Query: 208 ISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMW 387 ++V+ D K F YS GV++ C H + VGYG NG +WLVKNS+ W Sbjct: 441 VTVSFDGRGKQFKSYSGGVFYNKTCTRMKT---HVAVLVGYGTENGEDFWLVKNSYGPQW 497 Query: 388 GNDGYVLMSM-RENNCGVQSAPTY 456 G DGYV ++ R N+CG+ + TY Sbjct: 498 GLDGYVKIARNRNNHCGITNRITY 521 Score = 40.3 bits (90), Expect = 0.025 Identities = 26/101 (25%), Positives = 43/101 (42%), Gaps = 1/101 (0%) Frame = +1 Query: 31 YQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGP 207 Y++I+ +G+ ++DY Y G C + + +E L+ + K GP Sbjct: 106 YEYIINSNGINYDQDYR-YESAPGSCRFKPNKPTVTFKKYAYLAEISEEDLQWIVAKIGP 164 Query: 208 ISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGY 330 +V+ DA YS G+Y+ C L H + VGY Sbjct: 165 ATVSFDARGSQLKSYSGGIYYNRTC---TKTLTHVAVVVGY 202 >UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa zeasingle nucleocapsid nuclear polyhedrosis virus) Length = 367 Score = 87.4 bits (207), Expect = 2e-16 Identities = 47/144 (32%), Positives = 76/144 (52%), Gaps = 1/144 (0%) Frame = +1 Query: 10 GGEDFRAYQ-WIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A+Q ++ G+ TE DY Y G + C +DN K+ +EN LK Sbjct: 220 GGLMHLAFQELLLMGGVETEADYP-YQGSEQMCTLDNRKIAVKLNSCFKYDIRDENKLKE 278 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 ++ GP+++A+DA Y G+ + C + +L+HAVL +G+G+ N YW++K Sbjct: 279 LVYTTGPVAIAVDAMD--IINYRRGILNQ--CH--IYDLNHAVLLIGWGIENNVPYWIIK 332 Query: 367 NSWSNMWGNDGYVLMSMRENNCGV 438 NSW WG +G++ + N CG+ Sbjct: 333 NSWGEDWGENGFLRVRRNVNACGL 356 >UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep: Cysteine protease - Solanum lycopersicum (Tomato) (Lycopersicon esculentum) Length = 345 Score = 87.0 bits (206), Expect = 2e-16 Identities = 52/134 (38%), Positives = 74/134 (55%), Gaps = 2/134 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A+ +I+++G + E DY YLGQ C TA +I+ + V E +L Sbjct: 195 GGFMTNAFDFIIENGGISRESDYE-YLGQQYTCRSQEKTAAVQISSY-QVVPEGETSLLQ 252 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLV 363 A+ K P+S+ I AA + FY+ G Y + C D ++HAV A+GYG G KYWL+ Sbjct: 253 AVTKQ-PVSIGI-AASQDLQFYAGGTY-DGNC---ADRINHAVTAIGYGTDEEGQKYWLL 306 Query: 364 KNSWSNMWGNDGYV 405 KNSW WG +GY+ Sbjct: 307 KNSWGTSWGENGYM 320 >UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza sativa|Rep: Putative cysteine protease - Oryza sativa subsp. japonica (Rice) Length = 357 Score = 86.6 bits (205), Expect = 3e-16 Identities = 51/159 (32%), Positives = 81/159 (50%), Gaps = 9/159 (5%) Frame = +1 Query: 7 GGGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGYCHIDNV--TAITKITGWVNVTTNNENA 177 GGG A+Q ++ K G+ E +Y Y G G C +D++ ++ G+ V +E Sbjct: 199 GGGHTDAAFQLVVDKGGITAESEYR-YEGYKGRCRVDDMLFNHAARVGGYRAVPPADERQ 257 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGY--GVLNGHK 351 L A+ + P++ +DA+ F FY +GV+ P+ + +HAV VGY +G K Sbjct: 258 LATAVARQ-PVTAYVDASGPAFQFYGSGVFPGPR-GTAAPKPNHAVTLVGYCQDGASGKK 315 Query: 352 YWLVKNSWSNMWGNDGYVLM----SMRENNCGVQSAPTY 456 YW+ KNSW WG GY+L+ + CG+ +P Y Sbjct: 316 YWIAKNSWGKTWGQQGYILLEKDVASPHGTCGLAVSPFY 354 >UniRef50_O16454 Cluster: Temporarily assigned gene name protein 196; n=4; Bilateria|Rep: Temporarily assigned gene name protein 196 - Caenorhabditis elegans Length = 477 Score = 86.6 bits (205), Expect = 3e-16 Identities = 54/152 (35%), Positives = 74/152 (48%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG AY+ I++ G ED Y G+ CH+ I G V + ++E ++ Sbjct: 328 GGLPSNAYKEIIRMGGLEPEDAYPYDGRGETCHLVRKDIAVYINGSVELP-HDEVEMQKW 386 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 L GPIS+ ++A T FY +GV K + L+H VL VGYG YW+VKN Sbjct: 387 LVTKGPISIGLNA--NTLQFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDGRKPYWIVKN 444 Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 SW WG GY + +N CGVQ T L+ Sbjct: 445 SWGPNWGEAGYFKLYRGKNVCGVQEMATSALV 476 >UniRef50_Q9VNK7 Cluster: CG1075-PA; n=1; Drosophila melanogaster|Rep: CG1075-PA - Drosophila melanogaster (Fruit fly) Length = 274 Score = 86.2 bits (204), Expect = 4e-16 Identities = 41/126 (32%), Positives = 68/126 (53%), Gaps = 1/126 (0%) Frame = +1 Query: 28 AYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGP 207 A+ + +G+ ++E Y Y ++G C D + + +V +T+N+E L ++K GP Sbjct: 120 AFNFKRDYGIASKESYP-YKPENGECRWDRRKSTGTLREYVTLTSNDERELAKVVYKIGP 178 Query: 208 ISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLVKNSWSNM 384 + V+ID H+ F Y G+ P C+N +L H+VL VG+ YW++KNS+ Sbjct: 179 VEVSIDHLHEEFDQYFGGILRTPSCRNTNYDLKHSVLLVGFETHPKWGDYWIIKNSYGTE 238 Query: 385 WGNDGY 402 WG GY Sbjct: 239 WGESGY 244 >UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin L-like cysteine proteinase precursor - Acanthoscelides obtectus (Bean weevil) Length = 321 Score = 86.2 bits (204), Expect = 4e-16 Identities = 48/113 (42%), Positives = 65/113 (57%), Gaps = 2/113 (1%) Frame = +1 Query: 133 KITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFE-PKCKNKVDELDH 309 KITG+ V+ +E L A+ GPIS+A+D H FY G+ + CKN +L+H Sbjct: 215 KITGYQAVSKGDEVVLAQAVATIGPISIALDGNH--IMFYRRGIVSKWCGCKNSEKDLNH 272 Query: 310 AVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYV-LMSMRENNCGVQSAPTYVLI 465 VL VGYG +G YW+VKNSW +WG GY L N CGV + P+Y ++ Sbjct: 273 GVLLVGYG--DG--YWIVKNSWGRIWGEQGYFRLKKDAGNTCGVATWPSYPIL 321 >UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria dispar multicapsid nuclear polyhedrosis virus (LdMNPV) Length = 356 Score = 86.2 bits (204), Expect = 4e-16 Identities = 56/145 (38%), Positives = 77/145 (53%), Gaps = 2/145 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTA-ITKITGWVNVTTNNENALK 183 GG A++ IM+ G + TE DY ++G++ C +D + + G NE LK Sbjct: 208 GGLLHTAFEEIMRMGGVQTELDYP-FVGRNRRCGLDRHRPYVVSLVGCYRYVMVNEEKLK 266 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363 L GPI +AIDAA Y GV C+N + L+HAVL VGYGV NG YW+ Sbjct: 267 DLLRAVGPIPMAIDAAD--IVNYYRGVI--SSCEN--NGLNHAVLLVGYGVENGVPYWVF 320 Query: 364 KNSWSNMWGNDGYVLMSMRENNCGV 438 KN+W + WG +GY + N CG+ Sbjct: 321 KNTWGDDWGENGYFRVRQNVNACGM 345 >UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|Rep: LD36817p - Drosophila melanogaster (Fruit fly) Length = 352 Score = 85.8 bits (203), Expect = 5e-16 Identities = 48/161 (29%), Positives = 84/161 (52%), Gaps = 8/161 (4%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTA-------ITKITGWVNVTTN 165 GG +++ +++I HG+ Y Y + C N TA + KI + +T Sbjct: 197 GGFQEY-GFEYIRDHGVTLANKYP-YTQTEMQCR-QNETAGRPPRESLVKIRDYATITPG 253 Query: 166 NENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNG 345 +E +K + GP++ +++A +F YS G+Y + +C EL+H+V VGYG NG Sbjct: 254 DEEKMKEVIATLGPLACSMNADTISFEQYSGGIYEDEECNQ--GELNHSVTVVGYGTENG 311 Query: 346 HKYWLVKNSWSNMWGNDGYV-LMSMRENNCGVQSAPTYVLI 465 YW++KNS+S WG G++ ++ CG+ S +Y ++ Sbjct: 312 RDYWIIKNSYSQNWGEGGFMRILRNAGGFCGIASECSYPIL 352 >UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep: Viral cathepsin - Cydia pomonella granulosis virus (CpGV) (Cydia pomonellagranulovirus) Length = 333 Score = 85.8 bits (203), Expect = 5e-16 Identities = 53/145 (36%), Positives = 73/145 (50%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG A + I++ G + Y G DG C I+G NEN L+ Sbjct: 188 GGLMHWALESILQEGGVVSAENEPYYGFDGVCKKSPFEL--SISGSRRYVLQNENKLREL 245 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 L +GPISVAID + Y G+ C+N + L+HAVL VGYGV N YW++KN Sbjct: 246 LVVNGPISVAIDVSD--LINYKAGI--ADICENN-EGLNHAVLLVGYGVKNDVPYWILKN 300 Query: 370 SWSNMWGNDGYVLMSMRENNCGVQS 444 SW WG +GY + +N+CG+ + Sbjct: 301 SWGAEWGEEGYFRVQRDKNSCGMMN 325 >UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa|Rep: Os09g0497500 protein - Oryza sativa subsp. japonica (Rice) Length = 349 Score = 85.4 bits (202), Expect = 7e-16 Identities = 58/168 (34%), Positives = 81/168 (48%), Gaps = 18/168 (10%) Frame = +1 Query: 7 GGGEDFRAYQWIM-KHGLPTEEDYGGYLGQDGYCHIDNVT-AITKITGWVNVTTNNENAL 180 GGG A+++++ HGL TE Y Y +G C + + I G+ NVT ++E L Sbjct: 185 GGGYMSWAFEFVVGNHGLTTEASYP-YHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDL 243 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYG--------- 333 A P+SVA+D F Y +GVY P C +++H V VGYG Sbjct: 244 ARAAAAQ-PVSVAVDGGSFMFQLYGSGVYTGP-C---TADVNHGVTVVGYGESEPKTDGG 298 Query: 334 --VLNGHKYWLVKNSWSNMWGNDGYVLM-----SMRENNCGVQSAPTY 456 G KYW+VKNSW WG+ GY+LM + CG+ P+Y Sbjct: 299 GAAKGGEKYWIVKNSWGAEWGDAGYILMQRDVAGLASGLCGIALLPSY 346 >UniRef50_Q22RC9 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 389 Score = 85.4 bits (202), Expect = 7e-16 Identities = 44/114 (38%), Positives = 63/114 (55%) Frame = +1 Query: 103 CHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKC 282 C V KI W + +E+++K LF+ GP+SVA+DA++ FY G+ PK Sbjct: 255 CRQGQVPIAAKIEDW-KALSKDEDSIKQQLFEIGPLSVALDASY--LQFYKKGIS-APKF 310 Query: 283 KNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444 +K L+HAVL GYG+ NG ++W VKNSW WG GY + CG+ + Sbjct: 311 CSKTT-LNHAVLLTGYGIDNGVEFWNVKNSWGAKWGEQGYFRLKRGVGMCGINT 363 >UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-like protein; n=1; Maconellicoccus hirsutus|Rep: Cathepsin L-like cysteine proteinase-like protein - Maconellicoccus hirsutus (hibiscus mealybug) Length = 253 Score = 85.4 bits (202), Expect = 7e-16 Identities = 47/146 (32%), Positives = 72/146 (49%), Gaps = 3/146 (2%) Frame = +1 Query: 10 GGEDFRAYQWIM-KHGLPTEEDY-GGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183 GG Y+++ G+ E+ Y + + C D+ + I + TN E LK Sbjct: 100 GGNLENTYKYVNHSRGIEKEDSYRDNFRHINSRCQYDSTKSAVSIKNFSRCQTN-EAHLK 158 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363 + + P+SV I+ ++F Y +Y +P+C N E +AVL VGYG N YWL+ Sbjct: 159 MQVVGR-PVSVYINPTLESFKHYKGDIYDDPQCDNSRHESSYAVLVVGYGTDNNTDYWLI 217 Query: 364 KNSWSNMWGNDGYVLMSMRENN-CGV 438 KNS WG GY+ ++ NN CG+ Sbjct: 218 KNSLGTSWGEKGYMRLARNRNNLCGI 243 >UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus tauri|Rep: Cysteine protease-1 - Ostreococcus tauri Length = 430 Score = 85.0 bits (201), Expect = 9e-16 Identities = 55/166 (33%), Positives = 85/166 (51%), Gaps = 16/166 (9%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTA-ITKITGWVNVTTNNENALK 183 GG D+ A++WI+K+G E Y + C+ + + I G+ +V +E L+ Sbjct: 265 GGLMDY-AFRWIVKNGGIDSEFQYPYSAEALACNRWKLQLHVATIDGFKDVPPGDEKELE 323 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH----- 348 A+ + P+S+AI+A K+F Y GVY +C ++V DH VL VGYG + H Sbjct: 324 KAVSQQ-PVSIAIEADTKSFQLYDGGVYDSKECGSQV---DHGVLVVGYGFDDTHHNATK 379 Query: 349 ------KYWLVKNSWSNMWGNDGYVLMSMR----ENNCGVQSAPTY 456 +W VKNSW WG G++ M+ R CG+ +AP+Y Sbjct: 380 HHKRHRHFWKVKNSWGGTWGEGGFIRMARRISDETGQCGITTAPSY 425 >UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea mays (Maize) Length = 371 Score = 85.0 bits (201), Expect = 9e-16 Identities = 55/156 (35%), Positives = 76/156 (48%), Gaps = 11/156 (7%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A+ ++ K G L +E+DY Y G DG C D + + + +V + +E + Sbjct: 210 GGLMTTAFSYLQKAGGLESEKDYP-YTGSDGKCKFDKSKIVASVQNF-SVVSVDEAQISA 267 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-------LNG 345 L KHGP+++ I+AA+ Y GV C LDH VL VGYG L Sbjct: 268 NLIKHGPLAIGINAAY--MQTYIGGVSCPYICGR---HLDHGVLLVGYGASGFAPIRLKD 322 Query: 346 HKYWLVKNSWSNMWGNDGYVLM---SMRENNCGVQS 444 YW++KNSW WG +GY + S N CGV S Sbjct: 323 KPYWIIKNSWGENWGENGYYKICRGSNVRNKCGVDS 358 >UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schistosoma japonicum|Rep: Clone ZZD209 mRNA sequence - Schistosoma japonicum (Blood fluke) Length = 339 Score = 84.6 bits (200), Expect = 1e-15 Identities = 48/147 (32%), Positives = 74/147 (50%), Gaps = 2/147 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG F + ++ GL TE+ Y + G+D C ++ + + G+ E LK A Sbjct: 187 GGYTFTLFIYLQSFGLETEQMYP-FTGEDQDCMANSSDVVVQSIGYKFHRHGYETILKWA 245 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN-GHKYWLVK 366 L+ GP ++++ K F Y +G+Y C + L+ ++L VGYG N G YW+V+ Sbjct: 246 LYNEGPYVISMNIDEK-FLHYKSGIYQSDTCTHY--NLNQSMLLVGYGYDNDGIDYWIVQ 302 Query: 367 NSWSNMWGNDGYVLMSMRE-NNCGVQS 444 NSW WG GYV + N CG+ S Sbjct: 303 NSWGKKWGESGYVKVRRNNWNMCGIAS 329 >UniRef50_Q23H32 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 365 Score = 84.6 bits (200), Expect = 1e-15 Identities = 49/157 (31%), Positives = 72/157 (45%), Gaps = 7/157 (4%) Frame = +1 Query: 10 GGEDFRAYQWIMK--HGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183 GG+ A +M G+ +DY C D + G+ N+ NNE A+K Sbjct: 199 GGDPEPALDCVMNVLKGIMKNQDYPYQAITRKECDHDQSKNVFSPDGYENIPINNELAIK 258 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363 A+ + PIS I + + F FY G+ E + DH + VGYG NG +YW++ Sbjct: 259 EAVSRQ-PISACISGSSQNFKFYKGGIADEKLLECDPQYTDHCLGIVGYGSENGKQYWIL 317 Query: 364 KNSWSNMWGNDGYVLM-----SMRENNCGVQSAPTYV 459 KNSW WG GY+ + S + CG+ + P V Sbjct: 318 KNSWGENWGEKGYIRLLRSDSSNTQGTCGIATEPRIV 354 >UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep: CG4847-PD, isoform D - Drosophila melanogaster (Fruit fly) Length = 420 Score = 84.6 bits (200), Expect = 1e-15 Identities = 46/154 (29%), Positives = 79/154 (51%), Gaps = 2/154 (1%) Frame = +1 Query: 10 GGEDFRAYQWI--MKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183 GG A+ +I ++ G+ E Y Y+ G C D + + G+ + +E LK Sbjct: 271 GGFQEAAFCFIDEVQKGVSQEGAYP-YIDNKGTCKYDGSKSGATLQGFAAIPPKDEEQLK 329 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363 + GP++ +++ +T Y+ G+Y + +C NK E +H++L VGYG G YW+V Sbjct: 330 KVVATLGPVACSVNGL-ETLKNYAGGIYNDDEC-NK-GEPNHSILVVGYGSEKGQDYWIV 386 Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 KNSW + WG GY + +N C + +Y ++ Sbjct: 387 KNSWDDTWGEKGYFRLPRGKNYCFIAEECSYPVV 420 >UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanensis|Rep: Sui m 1 allergen - Suidasia medanensis Length = 336 Score = 84.2 bits (199), Expect = 2e-15 Identities = 47/155 (30%), Positives = 76/155 (49%), Gaps = 3/155 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITK--ITGWVNVTTN-NENAL 180 GG AY ++ + GL E Y Y +DG C V + ++ + N + + Sbjct: 185 GGNPIIAYAYVQQTGLVEESAYP-YQARDGQCQSSTVNGHQRYHVSAGRELPFNATDETI 243 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360 +L + GP++V I A+ F FY NGV + ++ +++HAV VG+G +G YW+ Sbjct: 244 MNSLHQIGPMAVLIFASDNEFRFYRNGVIQNLRPNSR--QINHAVTLVGWGTEDGQDYWI 301 Query: 361 VKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 VKNSW WG GY + N G+ + Y ++ Sbjct: 302 VKNSWGPSWGESGYFRLGRHHNLIGINNYVFYPVL 336 >UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n=1; Xenopus tropicalis|Rep: UPI00006A2275 UniRef100 entry - Xenopus tropicalis Length = 272 Score = 83.8 bits (198), Expect = 2e-15 Identities = 49/129 (37%), Positives = 72/129 (55%), Gaps = 1/129 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG+ +A++++ K+G+ E Y Y GQ G C I + ++ + NE L Sbjct: 146 GGKIEKAFKYMKKYGVMEESAYP-YTGQKGLCRKKQPGNIGVVKAIHDLPSGNETLLMNT 204 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKC-KNKVDELDHAVLAVGYGVLNGHKYWLVK 366 + GP+SV+I+A+ + F + +GVY+ P C NKV +HAVL VGYG NG YWLVK Sbjct: 205 VGTIGPVSVSINASSEKFHQFKSGVYYNPDCLPNKV---NHAVLVVGYGKENGMDYWLVK 261 Query: 367 NSWSNMWGN 393 N WG+ Sbjct: 262 NR-RVAWGS 269 >UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia circumcincta|Rep: Secreted cathepsin F - Teladorsagia circumcincta Length = 364 Score = 83.4 bits (197), Expect = 3e-15 Identities = 50/152 (32%), Positives = 69/152 (45%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG AY+ I++ G ED Y + C + I G V + ++E ++ Sbjct: 217 GGFPLDAYKEIVRMGGLEPEDKYPYEAKAEQCRLVPSDIAVYINGSVELP-HDEEKMRAW 275 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 L K GPIS+ I FY GV C+ + + H L VGYGV YW++KN Sbjct: 276 LVKKGPISIGITV--DDIQFYKGGVSRPTTCR--LSSMIHGALLVGYGVEKNIPYWIIKN 331 Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 SW WG DGY M EN C + PT ++ Sbjct: 332 SWGPNWGEDGYYRMVRGENACRINRFPTSAVV 363 >UniRef50_Q22A69 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 330 Score = 83.4 bits (197), Expect = 3e-15 Identities = 50/157 (31%), Positives = 75/157 (47%), Gaps = 5/157 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNV-----TTNNEN 174 GG A+ ++ L TE Y Y DG C + + + +V++ + EN Sbjct: 178 GGLMDNAFTYLESAKLETESAYP-YTAVDGSCKYNQSLGVVGVASFVDIEQGKTVADTEN 236 Query: 175 ALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKY 354 + +AL GP+SVAI+A FY+ G+ C + L+H VL VG G NG + Sbjct: 237 TMGVALDNIGPLSVAINA--NNLQFYAGGISNPLICNP--NGLNHGVLIVGLGSENGKDF 292 Query: 355 WLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 W VKNSW WG GY + + CG+ A +Y ++ Sbjct: 293 WKVKNSWGASWGEKGYFRIVRGKGKCGINRAVSYPVL 329 >UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep: Viral cathepsin - Xestia c-nigrum granulosis virus (XnGV) (Xestia c-nigrumgranulovirus) Length = 346 Score = 83.4 bits (197), Expect = 3e-15 Identities = 53/144 (36%), Positives = 73/144 (50%), Gaps = 1/144 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG A++ I++ G + E Y G DG C N T +++G +E L+ Sbjct: 197 GGLMSWAFEGIIRAGGISYEAPYPYTGVDGVCK--NTTRYVQLSGCYAYDLRSEKKLRQV 254 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDE-LDHAVLAVGYGVLNGHKYWLVK 366 L + GP+SVAID T Y +GV C VD L+H VL VGYG N KYW +K Sbjct: 255 LHEKGPVSVAIDVVDLTN--YKSGV--AKHCS--VDHGLNHGVLLVGYGQENDVKYWTLK 308 Query: 367 NSWSNMWGNDGYVLMSMRENNCGV 438 NSW + WG G+ + N+CG+ Sbjct: 309 NSWGSDWGEQGFFRIKRDVNSCGI 332 >UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 385 Score = 83.0 bits (196), Expect = 4e-15 Identities = 54/154 (35%), Positives = 77/154 (50%), Gaps = 11/154 (7%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEED-----YGGYLGQDGYCHID-NVTAITKITGWVNVTTNNE 171 GG + A+ +++K G+ + Y Y Q C D KI G V + NE Sbjct: 209 GGYPYDAFDYVIKTGISLDNRGNPPYYPPYENQKQKCRFDPRKPPFVKIDGECLVPSGNE 268 Query: 172 NALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH- 348 ALKLA+ P+SV I + + F Y GV+ P C + + +H VL VGYGV + Sbjct: 269 TALKLAVLSQ-PVSVVITISDE-FRSYRGGVFRGP-CGSNPNVDNHVVLVVGYGVTTDNI 325 Query: 349 KYWLVKNSWSNMWGNDGYVLMS---MRENN-CGV 438 KYW++KNSW WG GY+ M + +N CG+ Sbjct: 326 KYWIIKNSWGKTWGEYGYIRMERDILNKNGICGI 359 >UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L or H-like cysteine peptidase - Trichomonas vaginalis G3 Length = 435 Score = 83.0 bits (196), Expect = 4e-15 Identities = 47/121 (38%), Positives = 68/121 (56%), Gaps = 1/121 (0%) Frame = +1 Query: 55 LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAAH 234 L E+DY Y+G GYC +N + + V + LK AL+ +GP++VAI A Sbjct: 297 LVLEDDYP-YIGLGGYCPTNNHSMNVIVKDCWQVEPKDVEQLKRALYLYGPVAVAI-ATD 354 Query: 235 KTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL-NGHKYWLVKNSWSNMWGNDGYVLM 411 +F+ Y F K +D+L HAV G+GV +G KYW ++NSWS+ WG DGY L+ Sbjct: 355 SSFAKYQGPGVFPGKSAT-LDDLTHAVTLTGWGVAKDGTKYWEIQNSWSDFWGIDGYGLI 413 Query: 412 S 414 + Sbjct: 414 N 414 >UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_184, whole genome shotgun sequence - Paramecium tetraurelia Length = 331 Score = 83.0 bits (196), Expect = 4e-15 Identities = 53/155 (34%), Positives = 80/155 (51%), Gaps = 3/155 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG A+ + + +G+ TEE+Y Y G D C I+ +V+V + +AL A Sbjct: 184 GGWMDDAFDYTVNYGVTTEEEYP-YKGVDQPCP-SGFKKKHFISSFVDVEPLSSDALHEA 241 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + K P++VAI A F YS GVY +D+L+H VLAVGY + +KN Sbjct: 242 IAKT-PVAVAIKADGILFQLYSGGVYSRSCTAKTIDDLNHGVLAVGY----AKDSYTIKN 296 Query: 370 SWSNMWGNDGYV---LMSMRENNCGVQSAPTYVLI 465 SW WG GY+ L++ +E CG+ P+Y ++ Sbjct: 297 SWGASWGEKGYMRLGLVAAKEGQCGIHWVPSYPVL 331 >UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; Dictyostelium discoideum|Rep: Cysteine proteinase 1 precursor - Dictyostelium discoideum (Slime mold) Length = 343 Score = 83.0 bits (196), Expect = 4e-15 Identities = 51/152 (33%), Positives = 75/152 (49%), Gaps = 7/152 (4%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDG-YCHIDNVTAITKITGWVNVTTNNENALK 183 GG AY +I+K+G + TE Y Y + G C+ ++ KI+ + + NE + Sbjct: 192 GGLQPNAYNYIIKNGGIQTESSYP-YTAETGTQCNFNSANIGAKISNFTMIP-KNETVMA 249 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN-----GH 348 + GP+++A DA + FY GV+ P N LDH +L VGY N Sbjct: 250 GYIVSTGPLAIAADAVE--WQFYIGGVFDIPCNPNS---LDHGILIVGYSAKNTIFRKNM 304 Query: 349 KYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444 YW+VKNSW WG GY+ + +N CGV + Sbjct: 305 PYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 336 >UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis thaliana|Rep: Cysteine proteinase - Arabidopsis thaliana (Mouse-ear cress) Length = 348 Score = 82.6 bits (195), Expect = 5e-15 Identities = 51/137 (37%), Positives = 76/137 (55%), Gaps = 5/137 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITK---ITGWVNVTTNNENA 177 GG +A+++I+K+ G+ TE++Y Q +++ + I+G+ V NNE A Sbjct: 193 GGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEA 252 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN-GHKY 354 L A+ + P+SV I+ F YS GV F +C +L HAV VGYG+ G KY Sbjct: 253 LLQAVSQQ-PVSVGIEGTGAAFRHYSGGV-FNGECGT---DLHHAVTIVGYGMSEEGTKY 307 Query: 355 WLVKNSWSNMWGNDGYV 405 W+VKNSW WG +GY+ Sbjct: 308 WVVKNSWGETWGENGYM 324 >UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Trypanosoma cruzi|Rep: Cysteine proteinase, putative - Trypanosoma cruzi Length = 392 Score = 82.6 bits (195), Expect = 5e-15 Identities = 49/142 (34%), Positives = 83/142 (58%), Gaps = 6/142 (4%) Frame = +1 Query: 28 AYQWIMKHGLPTE--EDYGGYLGQDGYCHID-NVTAITKITGWVNVTTNNENALKLALFK 198 AY++ K G+ +E Y Y G+ G C + +V A+ ++ +V + +N+++A+ AL K Sbjct: 219 AYEYA-KQGITSEWVYSYTSYRGETGDCRNELDVIAVAQVQSYVKIPSNDQDAVMEALAK 277 Query: 199 HGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN--GHKYWLVKNS 372 +GP+SV +DA + +S Y+ G+ F +K ++H V VGYG N YW+++NS Sbjct: 278 NGPLSVNVDATY--WSAYAGGI-FNGCDYSKNITINHVVQLVGYGHDNKLNLDYWILRNS 334 Query: 373 WSNMWGNDGYV-LMSMRENNCG 435 WS WG +GY+ L+ + CG Sbjct: 335 WSPSWGENGYMRLLRTDKAECG 356 >UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|Rep: Cathepsin F precursor - Homo sapiens (Human) Length = 484 Score = 82.6 bits (195), Expect = 5e-15 Identities = 56/155 (36%), Positives = 73/155 (47%), Gaps = 3/155 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG AY I GL TE+DY Y G C+ A I V ++ NE L Sbjct: 335 GGLPSNAYSAIKNLGGLETEDDYS-YQGHMQSCNFSAEKAKVYINDSVELS-QNEQKLAA 392 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVY--FEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360 L K GPISVAI+A FY +G+ P C + +DHAVL VGYG + +W Sbjct: 393 WLAKRGPISVAINAFG--MQFYRHGISRPLRPLCSPWL--IDHAVLLVGYGNRSDVPFWA 448 Query: 361 VKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 +KNSW WG GY + CGV + + ++ Sbjct: 449 IKNSWGTDWGEKGYYYLHRGSGACGVNTMASSAVV 483 >UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 343 Score = 82.2 bits (194), Expect = 6e-15 Identities = 45/146 (30%), Positives = 75/146 (51%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GGGE A ++ HG+ T +Y Y C + V + +I+ W+ + +E A + Sbjct: 201 GGGEPVEALKYAQSHGITTAHNYPYYFWTTK-CR-ETVPTVARISSWMKAESEDEMAQIV 258 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 AL +GP+ V + A FY +G+ +P C E HA++ +GYG YW++K Sbjct: 259 AL--NGPMIVCANFATNKNRFYHSGIAEDPDCGT---EPTHALIVIGYGP----DYWILK 309 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQS 444 N++S +WG GY+ + N CG+ + Sbjct: 310 NTYSKVWGEKGYMRVKRDVNWCGINT 335 >UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15; Magnoliophyta|Rep: Cysteine proteinase RD19a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 368 Score = 82.2 bits (194), Expect = 6e-15 Identities = 55/165 (33%), Positives = 83/165 (50%), Gaps = 9/165 (5%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGY-CHIDNVTAITKITGWVNVTTNNENALK 183 GG A+++ +K G L EEDY Y G+DG C +D + ++ + +V + +E + Sbjct: 208 GGLMNSAFEYTLKTGGLMKEEDYP-YTGKDGKTCKLDKSKIVASVSNF-SVISIDEEQIA 265 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-------LN 342 L K+GP++VAI+A + Y GV C + L+H VL VGYG Sbjct: 266 ANLVKNGPLAVAINAGY--MQTYIGGVSCPYICTRR---LNHGVLLVGYGAAGYAPARFK 320 Query: 343 GHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI*IST 477 YW++KNSW WG +G+ + N CGV S + V +ST Sbjct: 321 EKPYWIIKNSWGETWGENGFYKICKGRNICGVDSMVSTVAATVST 365 >UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba histolytica|Rep: Cysteine protease 17 - Entamoeba histolytica Length = 420 Score = 81.8 bits (193), Expect = 8e-15 Identities = 38/101 (37%), Positives = 56/101 (55%), Gaps = 2/101 (1%) Frame = +1 Query: 142 GWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLA 321 G+ V NE AL A+ K G + + +D K F Y G+Y+ +C + L HA+ Sbjct: 283 GYALVLRGNERALMSAIHKFGVLGIGLDTRSKLFKHYRGGIYYNEECTRR--GLSHAMNL 340 Query: 322 VGYGVLN-GHKYWLVKNSWSNM-WGNDGYVLMSMRENNCGV 438 VGYG G KY++++NSW + WG DGY+ + N+CGV Sbjct: 341 VGYGTTKEGQKYYIIRNSWGDWKWGEDGYMRLYRGGNHCGV 381 >UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 367 Score = 81.4 bits (192), Expect = 1e-14 Identities = 55/151 (36%), Positives = 81/151 (53%), Gaps = 4/151 (2%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGW--VNVTTNNENALK 183 GG AY++I G+ ++++Y Y+GQ+ C I++ + + TNN N Sbjct: 224 GGWPSVAYRYIKDQGISSQQNYP-YIGQNRNCSINSASPPKAFYAKDPIYYYTNNGNQTN 282 Query: 184 LALF--KHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357 L + PISV +DA + +S YS GV+ C N ++HAVL VGY +G+ W Sbjct: 283 LVQYAVNQAPISVLVDATN--WSSYSQGVF--NNCGNVT--INHAVLLVGYDT-SGN--W 333 Query: 358 LVKNSWSNMWGNDGYVLMSMRENNCGVQSAP 450 LVKNSW WG GY+ ++ N C VQS+P Sbjct: 334 LVKNSWGTNWGQKGYITLA-PGNTCNVQSSP 363 >UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep: Cathepsin L precursor - Schistosoma mansoni (Blood fluke) Length = 319 Score = 81.4 bits (192), Expect = 1e-14 Identities = 52/153 (33%), Positives = 75/153 (49%), Gaps = 1/153 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG AY+ I+K G ED Y ++ CH+ I VN+T +E L Sbjct: 169 GGLPSNAYESIIKMGGLMLEDNYPYDAKNEKCHLKTDGVAVYINSSVNLT-QDETELAAW 227 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHK-YWLVK 366 L+ + ISV ++A FY +G+ LDHAVL VGYGV ++ +W+VK Sbjct: 228 LYHNSTISVGMNAL--LLQFYQHGISHPWWIFCSKYLLDHAVLLVGYGVSEKNEPFWIVK 285 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 NSW WG +GY M + +CG+ + T +I Sbjct: 286 NSWGVEWGENGYFRMYRGDGSCGINTVATSAMI 318 >UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba histolytica|Rep: Cysteine protease 19 - Entamoeba histolytica Length = 324 Score = 81.0 bits (191), Expect = 1e-14 Identities = 45/152 (29%), Positives = 79/152 (51%), Gaps = 2/152 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYC-HIDNVTAITKITGWVNVTTNNENALKL 186 GG A ++ +G+ +E + Y + +C V + K T + T ++ ++ Sbjct: 171 GGSIGGALKYAQDNGMQSESSFP-YKPFEQHCLQNQKVMKVKKYTH--SDTKGDDEKVRS 227 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLV 363 + +GP+ A+DA+ +F Y G+Y + KC++ D+ AV+ VGYG+ N KY++V Sbjct: 228 EILSYGPVGSAMDASRSSFLLYHGGIYNDKKCRS--DKSTIAVVIVGYGIDKNNGKYFIV 285 Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQSAPTYV 459 +NSW WG GY +S N CG+ + Y+ Sbjct: 286 RNSWGPYWGEQGYFRISSDNNLCGLSNDIYYI 317 >UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin L preproprotein; n=1; Monodelphis domestica|Rep: PREDICTED: similar to cathepsin L preproprotein - Monodelphis domestica Length = 356 Score = 80.6 bits (190), Expect = 2e-14 Identities = 44/128 (34%), Positives = 68/128 (53%), Gaps = 17/128 (13%) Frame = +1 Query: 133 KITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCK-NKVDELDH 309 +I +V + + +E AL A+ GP++VAI A +F +Y G Y EP+C+ + + ++H Sbjct: 230 RIRDYVTLPSGDERALMQAVATVGPVAVAIHAP-PSFRYYQGGPYIEPRCRLSYMSNMNH 288 Query: 310 AVLAVGYGVLNGHKY---------------WLVKNSWSNMWGNDGYVLMSM-RENNCGVQ 441 A+L VGYG L KY W+ KNSW WG+ GY+ + R N CG+ Sbjct: 289 ALLVVGYGPLERSKYEEFGLQAYMHKDNKFWIAKNSWGEQWGDRGYIYIPKDRYNQCGIA 348 Query: 442 SAPTYVLI 465 S Y ++ Sbjct: 349 SNANYPIL 356 >UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae str. PEST Length = 559 Score = 80.6 bits (190), Expect = 2e-14 Identities = 50/162 (30%), Positives = 78/162 (48%), Gaps = 9/162 (5%) Frame = +1 Query: 7 GGGEDFRAYQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183 GGG A++ I + GL E DY CH + + ++ G V++ NE + Sbjct: 402 GGGYMDDAFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAVDMP-KNETYIA 460 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVY--FEPKCKNKVDELDHAVLAVGYGVLNGHK-- 351 L K+GPI++ ++A FY G+ + P C +K +DH VL VGYG+ Sbjct: 461 KYLIKNGPIAIGLNA--NAMQFYRGGISHPWHPLCNHK--SIDHGVLIVGYGIKEYPMFN 516 Query: 352 ----YWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 YW++KNSW WG GY + +N+CGV + ++ Sbjct: 517 KTLPYWIIKNSWGPRWGEQGYYRIYRGDNSCGVSEMASSAIL 558 >UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 493 Score = 80.6 bits (190), Expect = 2e-14 Identities = 43/128 (33%), Positives = 66/128 (51%), Gaps = 3/128 (2%) Frame = +1 Query: 64 EEDYGGYLGQDG-YCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAIDAAHKT 240 E DY YLG +C + + +TG + + ++ A++ GP+ +AI+ + Sbjct: 355 ESDYP-YLGASSQFCDNNKDDYLGTVTGCYKIEQRTRSVME-AIYTFGPLGIAINVI-EP 411 Query: 241 FSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLMSMR 420 Y+NGV + C +L HAVL G+ ++G W VKNSWS WG DGY+ + M Sbjct: 412 MMLYTNGVIDDETCTGAQSDLVHAVLLTGWAEIDGKLAWEVKNSWSTYWGWDGYIYIQME 471 Query: 421 E--NNCGV 438 + NCGV Sbjct: 472 DQTKNCGV 479 >UniRef50_Q248G1 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 334 Score = 80.2 bits (189), Expect = 3e-14 Identities = 53/146 (36%), Positives = 84/146 (57%), Gaps = 3/146 (2%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKI--TGW-VNVTTNNENAL 180 GG A Q+++++G+ E Y Y+ G C D + K GW VN+ +E AL Sbjct: 191 GGWPEEALQYVIEYGIVKSEVYP-YVAVQGKCR-DIPYDVPKYYPEGWYVNLDQTSE-AL 247 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360 K A+ K P+SV +DA+ T+ FY +G+ F D+L+HA++AVGY +G+ W+ Sbjct: 248 KAAIAK-APVSVCVDAS--TWKFYKSGI-FSGCGPTTEDDLNHAIVAVGYDA-DGN--WI 300 Query: 361 VKNSWSNMWGNDGYVLMSMRENNCGV 438 ++NSW+ WG +GY+ ++ N CGV Sbjct: 301 IRNSWATKWGENGYIRLA-AGNTCGV 325 >UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; Leishmania|Rep: Cysteine proteinase 2 precursor - Leishmania pifanoi Length = 444 Score = 79.4 bits (187), Expect = 4e-14 Identities = 53/155 (34%), Positives = 83/155 (53%), Gaps = 8/155 (5%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG---LPTEEDYGGYLGQDGY---CHIDNVTAIT--KITGWVNVTTN 165 GG +A+ W++++ L TE+ Y Y+ +GY C + + +I G V + ++ Sbjct: 190 GGLMLQAFDWLLQNTNGHLHTEDSYP-YVSGNGYVPECSNSSEELVVGAQIDGHVLIGSS 248 Query: 166 NENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNG 345 E A+ L K+GPI++A+DA+ +F Y +GV C K +L+H VL VGY + Sbjct: 249 -EKAMAAWLAKNGPIAIALDAS--SFMSYKSGVL--TACIGK--QLNHGVLLVGYDMTGE 301 Query: 346 HKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAP 450 YW++KNSW WG GYV + M N C + P Sbjct: 302 VPYWVIKNSWGGDWGEQGYVRVVMGVNACLLSEYP 336 >UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Bigelowiella natans|Rep: Digestive cysteine proteinase - Bigelowiella natans (Pedinomonas minutissima) (Chlorarachnion sp.(strain CCMP 621)) Length = 360 Score = 79.0 bits (186), Expect = 6e-14 Identities = 40/108 (37%), Positives = 58/108 (53%), Gaps = 2/108 (1%) Frame = +1 Query: 127 ITKITGWVNVTTNNENALKLALFKHGPISVAIDAAH--KTFSFYSNGVYFEPKCKNKVDE 300 + IT W V ++ + KH P+SV+IDA FY +GV P+ +K Sbjct: 248 VASITDWEQVPSDEDKIASYLALKH-PLSVSIDAGEGLSWMQFYKHGVA-NPRFCSKTS- 304 Query: 301 LDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444 L+HAVL VG+GV G +W+VKNSW WG +GY + + CG+ + Sbjct: 305 LNHAVLLVGFGVDGGKAFWIVKNSWGEKWGENGYFRLIRGKGACGINT 352 >UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa (japonica cultivar-group)|Rep: Os09g0562700 protein - Oryza sativa subsp. japonica (Rice) Length = 235 Score = 79.0 bits (186), Expect = 6e-14 Identities = 54/168 (32%), Positives = 76/168 (45%), Gaps = 16/168 (9%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTA-ITKITGWVNVTTNNENALK 183 GG +RA +WI +G + T +DY C + I G V T +E +L Sbjct: 73 GGVSYRALEWITANGGITTRDDYPYTAAASAACDRAKLGHHAATIAGLRRVATRSEASLA 132 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYG---------V 336 A P++V+I+A F Y GVY P C + L+H V VGYG Sbjct: 133 NAAAAQ-PVAVSIEAGGDNFQHYRKGVYDGP-CGTR---LNHGVTVVGYGQEEAAADGGA 187 Query: 337 LNGHKYWLVKNSWSNMWGNDGYVLM-----SMRENNCGVQSAPTYVLI 465 G KYW++KNSW WG+ GY+ M E CG+ P++ L+ Sbjct: 188 AGGDKYWIIKNSWGKNWGDQGYIKMKKDVAGKPEGLCGIAIRPSFPLM 235 >UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep: Cysteine proteinase - Cryptobia salmositica Length = 443 Score = 79.0 bits (186), Expect = 6e-14 Identities = 35/95 (36%), Positives = 56/95 (58%) Frame = +1 Query: 169 ENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH 348 E + +FKHGP+S+ +DA+ T+ Y+ G+ C D++DH VL VG+ Sbjct: 237 EEDMAAFVFKHGPLSIGVDAS--TWQSYAGGIM--SYCPQ--DQIDHGVLIVGFDDTAST 290 Query: 349 KYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPT 453 YW++KNSW+ WG +GY+ ++ N CG+ S P+ Sbjct: 291 PYWIIKNSWTANWGEEGYIRVAKGSNQCGLTSHPS 325 >UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n=1; Toxocara canis|Rep: Cathepsin L-like cysteine proteinase - Toxocara canis (Canine roundworm) Length = 360 Score = 79.0 bits (186), Expect = 6e-14 Identities = 50/156 (32%), Positives = 77/156 (49%), Gaps = 5/156 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG+ +A +++ GL E DY + C + T K +++ +E ++ Sbjct: 209 GGDVDKALRYVYDEGLMREYDYPYVAHRQDTCQLRGETTRIKAAVFLH---QDEASIIDW 265 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPK--CKNKVDELDHAVLAVGYGVLNG--HKYW 357 L +GP++V I+ Y GVY K C+NK+ H++ VGYG N KYW Sbjct: 266 LLHYGPVNVGINVT-ADMKAYKGGVYTPDKWECENKIIGT-HSINIVGYGTWNATNQKYW 323 Query: 358 LVKNSWSNMWG-NDGYVLMSMRENNCGVQSAPTYVL 462 +VKNSW +G DGYV + N+CG++ P VL Sbjct: 324 IVKNSWGQSYGIEDGYVYFARGINSCGIEDEPVGVL 359 >UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicalis|Rep: Blo t 1 allergen - Blomia tropicalis (Mite) Length = 333 Score = 79.0 bits (186), Expect = 6e-14 Identities = 41/154 (26%), Positives = 78/154 (50%), Gaps = 1/154 (0%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVT-TNNENALK 183 G G A++++++ GL EE+Y Y + +C+ D ++G+ + +++ + Sbjct: 183 GSGYSTEAFKYMIRTGLVEEENYP-YNMRTQWCNPDVEGQRYHVSGYQQLRYQSSDEDVM 241 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363 + +HGP+ + + ++ F NGV + DHAV+ VG+G + G YW++ Sbjct: 242 YTIQQHGPVVIYMHGSNNYFRNLGNGVLRGVAYNDAYT--DHAVILVGWGTVQGVDYWII 299 Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 +NSW WGN GY + N+ G+ + TY + Sbjct: 300 RNSWGTGWGNGGYGYVERGHNSLGINNFVTYATL 333 >UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MGC107932 protein - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 333 Score = 78.6 bits (185), Expect = 8e-14 Identities = 47/150 (31%), Positives = 72/150 (48%), Gaps = 7/150 (4%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG +A +++ +HG+ ++Y Y + C D+ AI + EN + + Sbjct: 180 GGFPIKALEYVAQHGVMRNKEYE-YSQKKATCEYDSDKAIHMNVSKFYILPGEEN-MATS 237 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHK------ 351 + GPI+V I + F YS G+ FE C + +HAV+ VGYG + + Sbjct: 238 VAIEGPITVGIGVS-SDFQLYSEGI-FEGDC---AESPNHAVIIVGYGTEHANDKEEEDK 292 Query: 352 -YWLVKNSWSNMWGNDGYVLMSMRENNCGV 438 YW++KNSW WG DGYV M N C + Sbjct: 293 DYWIIKNSWGKEWGEDGYVKMKRNINQCSI 322 >UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_21, whole genome shotgun sequence - Paramecium tetraurelia Length = 349 Score = 78.2 bits (184), Expect = 1e-13 Identities = 46/154 (29%), Positives = 69/154 (44%), Gaps = 3/154 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GGE + +Q+ K+G+ +Y Y G D C + G+V+V + A A Sbjct: 193 GGEMYDGFQYASKYGIAIRSEYP-YAGVDQKCAAKQTKTRYQFAGYVDVEPLSAQAYVEA 251 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 +H +S+ I+A+ F Y G+Y KC L+H V VGY Y+L+KN Sbjct: 252 ASEHA-LSIGINASGINFQLYKKGIY-SAKCDGSKPALNHGVTNVGYAP----DYYLIKN 305 Query: 370 SWSNMWGNDGYVLMSM---RENNCGVQSAPTYVL 462 SW WG GY+ + + CG Q + L Sbjct: 306 SWGQSWGESGYIRFARIADKAGQCGAQQEVNFPL 339 >UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi Length = 467 Score = 78.2 bits (184), Expect = 1e-13 Identities = 49/157 (31%), Positives = 80/157 (50%), Gaps = 5/157 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMK--HGLPTEEDYGGYLGQDGY---CHIDNVTAITKITGWVNVTTNNEN 174 GG A++WI++ +G ED Y +G C T ITG V + +E Sbjct: 187 GGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELP-QDEA 245 Query: 175 ALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKY 354 + L +GP++VA+DA+ ++ Y+ GV C + ++LDH VL VGY Y Sbjct: 246 QIAAWLAVNGPVAVAVDAS--SWMTYTGGVM--TSCVS--EQLDHGVLLVGYNDSAAVPY 299 Query: 355 WLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 W++KNSW+ WG +GY+ ++ N C V+ + ++ Sbjct: 300 WIIKNSWTTQWGEEGYIRIAKGSNQCLVKEEASSAVV 336 >UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 precursor; n=2; Arabidopsis thaliana|Rep: Probable cysteine proteinase At3g43960 precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 376 Score = 78.2 bits (184), Expect = 1e-13 Identities = 56/157 (35%), Positives = 78/157 (49%), Gaps = 8/157 (5%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQD-GYCHIDNV--TAITKITGWVNVTTNNENAL 180 GG A+++I ++G ++ GY G+D C + T + I G V N+E +L Sbjct: 194 GGGAVWAFEFIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSL 253 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH-KYW 357 K A+ + PISV I AA+ S Y +GVY + C N DH VL VGYG + YW Sbjct: 254 KKAV-AYQPISVMISAAN--MSDYKSGVY-KGACSNLWG--DHNVLIVGYGTSSDEGDYW 307 Query: 358 LVKNSWSNMWGNDGYVLMSMR----ENNCGVQSAPTY 456 L++NSW WG GY+ + C V AP Y Sbjct: 308 LIRNSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVY 344 >UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 360 Score = 77.4 bits (182), Expect = 2e-13 Identities = 47/144 (32%), Positives = 82/144 (56%), Gaps = 4/144 (2%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG+ A++ I +G + TE +Y Y+ + C D +I G+++V ++ ++ +K Sbjct: 196 GGDPEPAFRCIQNNGGIMTETEYP-YIAKQQSCKFDEDKPTFQIGGYIDVPSD-QSQVKA 253 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKN-KVDELDHAVLAVGYGVLNGHK--YW 357 AL P+S+ ++++ +F +Y +GV E C++ D DH +L VGYG K YW Sbjct: 254 ALLIQ-PLSICLNSSDTSFKYYKSGVITE--CEDGPYDGPDHCLLLVGYGHDEELKVDYW 310 Query: 358 LVKNSWSNMWGNDGYVLMSMRENN 429 L+KN W WG +GYV + +R++N Sbjct: 311 LIKNQWGTTWGEEGYVRI-IRDDN 333 >UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing protein; n=7; Hymenostomatida|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 387 Score = 77.4 bits (182), Expect = 2e-13 Identities = 44/131 (33%), Positives = 71/131 (54%), Gaps = 5/131 (3%) Frame = +1 Query: 28 AYQWIMKHGLPTEE--DYGGYLGQDGYCHIDNVTAITKIT--GWVNVTTNNENALKLALF 195 AY ++ GL +E Y Y GQ G C D ++T G++ V N+ +L A+ Sbjct: 209 AYNYVQLFGLTSEYKYSYSSYQGQTGNCTFDPTQQPIEVTIDGYLKVPENDYASLMNAVA 268 Query: 196 KHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH-KYWLVKNS 372 GP+ +++DA++ F Y +GV+ + VD ++HAV+ VGYG YW+V+NS Sbjct: 269 TQGPLVISVDASN--FHDYESGVFHGCDGADNVD-INHAVVLVGYGTDEKEGDYWIVRNS 325 Query: 373 WSNMWGNDGYV 405 W +G +GY+ Sbjct: 326 WGTRFGENGYI 336 >UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma|Rep: Cathepsin C precursor - Schistosoma mansoni (Blood fluke) Length = 454 Score = 77.4 bits (182), Expect = 2e-13 Identities = 52/152 (34%), Positives = 71/152 (46%), Gaps = 8/152 (5%) Frame = +1 Query: 13 GEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLAL 192 GEDF Q I+ +P + G C T + I G+ T NE ++L L Sbjct: 300 GEDFGLPQKIV---IPYTGEDTGKCTVSKNCTRYYTTDYSYIGGYYGAT--NEKLMQLEL 354 Query: 193 FKHGPISVAIDAAHKTFSFYSNGVYFEPKCK------NKVDELDHAVLAVGYGV--LNGH 348 +GP V + ++ F FY G+Y + N + +HAVL VGYGV L+G Sbjct: 355 ISNGPFPVGFEV-YEDFQFYKEGIYHHTTVQTDHYNFNPFELTNHAVLLVGYGVDKLSGE 413 Query: 349 KYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444 YW VKNSW WG GY + + CGV+S Sbjct: 414 PYWKVKNSWGVEWGEQGYFRILRGTDECGVES 445 >UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 350 Score = 77.0 bits (181), Expect = 2e-13 Identities = 48/144 (33%), Positives = 79/144 (54%), Gaps = 1/144 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG Q+ GL + DY Y+ G C N + + +V ++ +AL+ A Sbjct: 209 GGFPSEGLQYASTVGL-VQSDYYPYVAVQGTCRQVNAPRYQLLDQYYSVQQSS-SALQYA 266 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKC-KNKVDELDHAVLAVGYGVLNGHKYWLVK 366 + + P +V +DA+ T+ FY++GVY C K + ++L+HAV+AVGY + + W+++ Sbjct: 267 ITR-APTAVGVDAS--TWQFYNSGVY--NGCGKTQRNQLNHAVIAVGY---DAYGNWIIR 318 Query: 367 NSWSNMWGNDGYVLMSMRENNCGV 438 NSW WG GY+ ++ R N CGV Sbjct: 319 NSWGTSWGQSGYITLA-RGNTCGV 341 >UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 664 Score = 77.0 bits (181), Expect = 2e-13 Identities = 41/119 (34%), Positives = 65/119 (54%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG Y +I ++G +E Y G+ G C ++ A ++I+ +V + ++E L Sbjct: 538 GGWMHNCYSYIQENGGINQESTYPYEGKFGQCRYNSGDAQSRISKFVMIKQHDEEDLADT 597 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 + GP+SVA DA+ + F +YS G+Y+ C NK HAV+ VGY NG YW++K Sbjct: 598 VASVGPVSVAYDASTREFMYYSRGIYYSDNC-NKY-RTTHAVVVVGYDNENGVDYWIIK 654 >UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; Dictyostelium discoideum|Rep: Cysteine proteinase 7 precursor - Dictyostelium discoideum (Slime mold) Length = 460 Score = 77.0 bits (181), Expect = 2e-13 Identities = 44/114 (38%), Positives = 69/114 (60%), Gaps = 2/114 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGY-CHIDNVTAITKITGWVNVTTNNENALK 183 GG A+++I+ + G+ TE Y Y +DG C + +++ +VNVT+ +E+ L Sbjct: 178 GGLMTLAFEYIINNKGIDTESSYP-YTAEDGKKCKFNPKNVAAQLSSYVNVTSGSESDLA 236 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNG 345 A GP SVAIDA++++F Y +G+Y EP C + +LDH VLAVG+G +G Sbjct: 237 -AKVTQGPTSVAIDASNQSFQLYVSGIYNEPACSS--TQLDHGVLAVGFGTGSG 287 Score = 49.6 bits (113), Expect = 4e-05 Identities = 22/40 (55%), Positives = 26/40 (65%), Gaps = 4/40 (10%) Frame = +1 Query: 352 YWLVKNSWSNMWGNDGYVLMSMRENN-CGV---QSAPTYV 459 YW+VKNSW WG DGY+LM+ NN CG+ S PT V Sbjct: 418 YWIVKNSWGTSWGMDGYILMTKGNNNQCGIATMASRPTAV 457 >UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 330 Score = 76.6 bits (180), Expect = 3e-13 Identities = 47/151 (31%), Positives = 80/151 (52%), Gaps = 1/151 (0%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAIT-KITGWVNVTTNNENALK 183 GGG A+++I + L T +Y Y+ D C+ + + ++ + +V + N LK Sbjct: 182 GGGWMDNAFEYIEESPLTTNSNYP-YVAVDQACNSTEIYGVLYSLSNYTDVESGNTVQLK 240 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363 L + P+S+A+DA++ + Y++G++ N L+H VL VG+ G WLV Sbjct: 241 QYL-QQQPLSIAVDASY--WYLYNSGIF-----SNCGQNLNHGVLLVGFNSTEGS--WLV 290 Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQSAPTY 456 KNSW WG GY+ ++ N CG+ +A +Y Sbjct: 291 KNSWGTSWGEQGYIRLA-DGNTCGLANAASY 320 >UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 394 Score = 76.6 bits (180), Expect = 3e-13 Identities = 47/143 (32%), Positives = 77/143 (53%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG ++ +K G+ TE+ Y Y G C I N T + LK + Sbjct: 252 GGFQSDGVEYAIKFGIVTEDKYP-YTAVGGDCQISNPTTDGFYPKTYRKLQQTVDDLKAS 310 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 L P++V++DA++ ++ Y +G+ F+ + D+L+HAV+AVGY +G+ W+++N Sbjct: 311 L-NFSPVTVSVDASN--WNSYESGI-FDNCGETTQDQLNHAVIAVGYDT-DGN--WIIRN 363 Query: 370 SWSNMWGNDGYVLMSMRENNCGV 438 SWS WG DGY+ ++ N CGV Sbjct: 364 SWSTSWGEDGYIRLA-AGNTCGV 385 >UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Liliopsida|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 416 Score = 76.2 bits (179), Expect = 4e-13 Identities = 54/143 (37%), Positives = 73/143 (51%), Gaps = 9/143 (6%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEED-----YGGYLGQDGYCH-IDNVTAITKITGWVNVTTNNE 171 GG+ A Q+I+K+G+ ++ Y GY + C + I K+ V N E Sbjct: 178 GGDPRAALQYIVKNGVTLDQCGKLPYYPGYEAKKLACRTVAGKPPIVKVDA-VKPVANTE 236 Query: 172 NALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL---N 342 AL L +F+ PISV IDA+ Y GV F +CK L+H V+ VGYGV + Sbjct: 237 AALLLKVFQQ-PISVGIDAS-ADLQHYKKGV-FTGRCKTA--PLNHGVVVVGYGVNTTPD 291 Query: 343 GHKYWLVKNSWSNMWGNDGYVLM 411 KYW+VKNSW WG GY+ M Sbjct: 292 KTKYWIVKNSWGKGWGEGGYIRM 314 Score = 51.6 bits (118), Expect = 1e-05 Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 5/71 (7%) Frame = +1 Query: 259 GVYFEPKCKNKVDELDHAVLAVGYGVLNGH-KYWLVKNSWSNMWGNDGYVLM----SMRE 423 GVY P C V+ HAV VGYGV + YW+ +NSW WG GY+ M + +E Sbjct: 332 GVYNGP-CGTSVN---HAVTTVGYGVTQDNINYWIARNSWGPRWGESGYIRMKRDIAAKE 387 Query: 424 NNCGVQSAPTY 456 CG+ Y Sbjct: 388 GLCGISMYGVY 398 >UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera litura multicapsid nucleopolyhedrovirus (SpltMNPV) Length = 337 Score = 76.2 bits (179), Expect = 4e-13 Identities = 49/144 (34%), Positives = 72/144 (50%), Gaps = 1/144 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A+Q I++ G+ E DY Y G + C + +++ +E L Sbjct: 190 GGLMHLAFQEIIRIGGVEHEIDYP-YQGIEYACRLAPSKLAVRLSHCYQYDLRDERKLLE 248 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 L+K+GPI+VAID Y +G+ C + + L+HAVL VGYG+ N YW+ K Sbjct: 249 LLYKNGPIAVAIDCVD--IIDYRSGI--ATVCND--NGLNHAVLLVGYGIENDTPYWIFK 302 Query: 367 NSWSNMWGNDGYVLMSMRENNCGV 438 NSW + WG +GY N CG+ Sbjct: 303 NSWGSNWGENGYFRARRNINACGM 326 >UniRef50_Q239L8 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 75.4 bits (177), Expect = 7e-13 Identities = 49/143 (34%), Positives = 73/143 (51%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG A+ +I +HG+PTE Y Y DG C + + KI+ ++ N+ K+ Sbjct: 188 GGLMDTAFDFISQHGIPTEAAYP-YKAVDGTCKM--TSGPYKISSHTDIQDCNDLLNKI- 243 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + PI++A+DA F +Y ++ + C ELDH VL VGY KYW VKN Sbjct: 244 --QKQPIAIAVDA--NNFQYYQKDIFSD--CGT---ELDHGVLLVGYSASG--KYWKVKN 292 Query: 370 SWSNMWGNDGYVLMSMRENNCGV 438 SW WG G++ ++ N CG+ Sbjct: 293 SWGPNWGESGFIRLA-AGNTCGL 314 >UniRef50_UPI0000498E2F Cluster: cysteine proteinase; n=1; Entamoeba histolytica HM-1:IMSS|Rep: cysteine proteinase - Entamoeba histolytica HM-1:IMSS Length = 317 Score = 74.9 bits (176), Expect = 1e-12 Identities = 43/144 (29%), Positives = 72/144 (50%), Gaps = 3/144 (2%) Frame = +1 Query: 37 WIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISV 216 ++ + G EEDY + G C ++ K+ ++ N++ L + + K+ PI V Sbjct: 173 YLQRFGFMKEEDYPE-TSEKGICQYNSTRIFGKVNKRRYLSVFNDDEL-IEVIKNTPIIV 230 Query: 217 AIDAAHKTFSFYSNGVYFE--PKCKNKVDELDHAVLAVGYG-VLNGHKYWLVKNSWSNMW 387 ID T +Y FE +C + +L +GYG +NG YW++KN W + W Sbjct: 231 NIDMP-PTMPYYDGEGIFENIEECSQSSPRI--GLLLIGYGKTINGIPYWILKNCWGSSW 287 Query: 388 GNDGYVLMSMRENNCGVQSAPTYV 459 G++GY+ + +N CG+ S TYV Sbjct: 288 GSNGYLYLKRNKNVCGIYSYGTYV 311 >UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis|Rep: Cathepsin L - Culicoides sonorensis Length = 331 Score = 74.9 bits (176), Expect = 1e-12 Identities = 47/149 (31%), Positives = 70/149 (46%), Gaps = 2/149 (1%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG D A Q++ +GL E+DY Y G+D CH N V T +E + K Sbjct: 180 GGWSDL-ALQYMRDNGLSFEKDYP-YKGKDEKCHASNENKSPVKVVNVCSTPKDEVSYKD 237 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 +++GP+ V F Y G++ C + ++HAV+ +GYG KYWLV+ Sbjct: 238 HFYQYGPL-VVYYFVDNNFKQYKGGIFSSKTCNVENAGINHAVVLMGYGSEKDVKYWLVR 296 Query: 367 NSWSNMWGNDGY--VLMSMRENNCGVQSA 447 NSW +G G+ +L N G +A Sbjct: 297 NSWGKSFGESGHFRILRDAHMCNLGYHNA 325 >UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia ATCC 50803|Rep: GLP_567_6496_7413 - Giardia lamblia ATCC 50803 Length = 305 Score = 74.5 bits (175), Expect = 1e-12 Identities = 33/93 (35%), Positives = 51/93 (54%) Frame = +1 Query: 163 NNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN 342 +N N + ++L GP+ H+ F +Y G+Y + + HAVL VGYG +N Sbjct: 208 SNYNEIMVSLLADGPVQTGF-YVHEDFLYYVGGIYHKVYGTSLGG---HAVLIVGYGSMN 263 Query: 343 GHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQ 441 H YW+V+NSW + WG +GY + N CG++ Sbjct: 264 NHDYWIVRNSWGSDWGENGYFRILRGTNECGIE 296 >UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3.4.14.1) (Dipeptidyl-peptidase I) (DPP-I) (DPPI) (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase) [Contains: Dipeptidyl-peptidase 1 exclusion domain chain (Dipeptidyl- peptidase I exclusion domain chain); Dipeptidyl-peptidase 1 heavy chain (Dipeptidyl-peptidase I heavy chain); Dipeptidyl-peptidase 1 light chain (Dipeptidyl-peptidase I light chain)]; n=50; Coelomata|Rep: Dipeptidyl-peptidase 1 precursor (EC 3.4.14.1) (Dipeptidyl-peptidase I) (DPP-I) (DPPI) (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase) [Contains: Dipeptidyl-peptidase 1 exclusion domain chain (Dipeptidyl- peptidase I exclusion domain chain); Dipeptidyl-peptidase 1 heavy chain (Dipeptidyl-peptidase I heavy chain); Dipeptidyl-peptidase 1 light chain (Dipeptidyl-peptidase I light chain)] - Homo sapiens (Human) Length = 463 Score = 74.5 bits (175), Expect = 1e-12 Identities = 37/98 (37%), Positives = 53/98 (54%), Gaps = 5/98 (5%) Frame = +1 Query: 166 NENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCK---NKVDELDHAVLAVGYGV 336 NE +KL L HGP++VA + + F Y G+Y + N + +HAVL VGYG Sbjct: 356 NEALMKLELVHHGPMAVAFEV-YDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGT 414 Query: 337 --LNGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444 +G YW+VKNSW WG +GY + + C ++S Sbjct: 415 DSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIES 452 >UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep: Aca s 1 allergen - Acarus siro (Dust mite) Length = 331 Score = 74.1 bits (174), Expect = 2e-12 Identities = 33/90 (36%), Positives = 48/90 (53%) Frame = +1 Query: 169 ENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH 348 + ++ L HGP++V IDA H F Y +GV + E++H + VG+G NG Sbjct: 234 DESIMTVLKTHGPVAVDIDADHNGFKHYKSGVI--RLTRGGTTEVNHVINIVGWGRENGL 291 Query: 349 KYWLVKNSWSNMWGNDGYVLMSMRENNCGV 438 YWL++NSW WG GY + NN G+ Sbjct: 292 DYWLIRNSWGTHWGEAGYGKVERHHNNMGI 321 >UniRef50_Q23H10 Cluster: Papain family cysteine protease containing protein; n=14; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 73.7 bits (173), Expect = 2e-12 Identities = 47/145 (32%), Positives = 72/145 (49%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG + + K G+ T + Y Y+ C++ K W+ + N N LK A Sbjct: 196 GGWPVQCLDYASKVGITTLDKYP-YVAVQKNCNVTGTDNGFKPKSWIQIP-NTSNDLKSA 253 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 L P+SV +DA+ T+ Y +G++ C L+HAVLAVGY W++KN Sbjct: 254 L-NFSPVSVLVDAS--TWGNYYSGIF--NGCDQTHISLNHAVLAVGYDQQGN---WIIKN 305 Query: 370 SWSNMWGNDGYVLMSMRENNCGVQS 444 SWS WG +G++ ++ N CG+ S Sbjct: 306 SWSTYWGENGFMRLA-PNNTCGILS 329 >UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleostomi|Rep: Cathepsin O precursor - Homo sapiens (Human) Length = 321 Score = 73.7 bits (173), Expect = 2e-12 Identities = 45/156 (28%), Positives = 75/156 (48%), Gaps = 4/156 (2%) Frame = +1 Query: 10 GGEDFRAYQWI--MKHGLPTEEDYGGYLGQDGYCH-IDNVTAITKITGWVNVT-TNNENA 177 GG A W+ M+ L + +Y + Q+G CH + I G+ ++ E+ Sbjct: 172 GGSTLNALNWLNKMQVKLVKDSEYP-FKAQNGLCHYFSGSHSGFSIKGYSAYDFSDQEDE 230 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357 + AL GP+ V +DA ++ Y G+ + C + E +HAVL G+ YW Sbjct: 231 MAKALLTFGPLVVIVDAV--SWQDYLGGI-IQHHCSS--GEANHAVLITGFDKTGSTPYW 285 Query: 358 LVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 +V+NSW + WG DGY + M N CG+ + + + + Sbjct: 286 IVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV 321 >UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin O; n=1; Monodelphis domestica|Rep: PREDICTED: similar to cathepsin O - Monodelphis domestica Length = 414 Score = 73.3 bits (172), Expect = 3e-12 Identities = 45/155 (29%), Positives = 69/155 (44%), Gaps = 3/155 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYG-GYLGQDGYCH-IDNVTAITKITGWVNVT-TNNENAL 180 GG A W+ K + +D + Q G CH A I + + + EN + Sbjct: 265 GGSTVNALNWLNKTQVRLVKDSEYSFKAQTGLCHYFSGSHAGVSIKDYSSYDFSGKENEM 324 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360 L GP++V +DA ++ Y G+ + C + E +HAVL G+ YW+ Sbjct: 325 ANVLLAFGPLAVIVDAV--SWQDYLGGI-IQHHCSS--GEANHAVLITGFDRTGNTPYWI 379 Query: 361 VKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 V+NSW WG DGY + M N CG+ + V + Sbjct: 380 VRNSWGTSWGVDGYAFVKMGANVCGIADLVSAVFV 414 >UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 73.3 bits (172), Expect = 3e-12 Identities = 50/149 (33%), Positives = 81/149 (54%), Gaps = 2/149 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAIT-KITGWVNVTTNNENALKL 186 GG A +++ K G+ EE Y YL D C + + T+ K+ + + +ALK Sbjct: 194 GGWPEEALKYVAKFGILKEEQYP-YLAVDSKCKVSSPTSDGFKVQSFYFID-KTADALKN 251 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKN-KVDELDHAVLAVGYGVLNGHKYWLV 363 + + P+SV +DA+ T+ YS+GVY C N + L+HAV+A+GY W++ Sbjct: 252 TVARI-PVSVLVDAS--TWGSYSSGVY--NGCGNTQTYNLNHAVVAIGYDEQGN---WII 303 Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQSAP 450 +NSWS WG DG++ ++ N CG+ +P Sbjct: 304 RNSWSTSWGMDGHMKLA-PGNTCGILLSP 331 >UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8; Trypanosoma|Rep: Cathepsin B-like cysteine protease - Trypanosoma brucei Length = 340 Score = 73.3 bits (172), Expect = 3e-12 Identities = 40/132 (30%), Positives = 58/132 (43%) Frame = +1 Query: 46 KHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGPISVAID 225 K+G P + + Y D + W + E+ LF GP VA D Sbjct: 199 KNGYPPCSQFNFDTPKCNYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAFD 258 Query: 226 AAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMWGNDGYV 405 ++ F Y++GVY + HAV VG+G NG YW + NSW+ WG DGY Sbjct: 259 V-YEDFIAYNSGVYHHVSGQYLGG---HAVRLVGWGTSNGVPYWKIANSWNTEWGMDGYF 314 Query: 406 LMSMRENNCGVQ 441 L+ + CG++ Sbjct: 315 LIRRGSSECGIE 326 >UniRef50_Q23H06 Cluster: Papain family cysteine protease containing protein; n=18; Tetrahymena thermophila|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 349 Score = 73.3 bits (172), Expect = 3e-12 Identities = 50/140 (35%), Positives = 69/140 (49%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG + + K G+ ++ Y Y G C + K WV + NN +ALK A Sbjct: 210 GGWPVQCIDYASKVGILNQDRYY-YFGVQMQCRVTGTNNGFKPKSWVQIP-NNSDALKTA 267 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 L P+SVA+D + T Y +GV+ C + V L+HAVL VGY W++KN Sbjct: 268 L-NFSPVSVAVDGTNWTD--YKSGVF--NGCDSHVS-LNHAVLVVGYDEQGN---WIIKN 318 Query: 370 SWSNMWGNDGYVLMSMRENN 429 SWS +WG GY M + NN Sbjct: 319 SWSTLWGEGGY--MRLAPNN 336 >UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 293 Score = 73.3 bits (172), Expect = 3e-12 Identities = 47/148 (31%), Positives = 78/148 (52%), Gaps = 4/148 (2%) Frame = +1 Query: 7 GGGEDFRAYQWIM-KHGL-PTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVT-TNNENA 177 GG D +Y ++ ++G+ ++ DY + G C D+ A +K +V +T T NE Sbjct: 142 GGSSDGASYFVLLNQYGMWMSDSDYP-FKPYVGECKFDSSMAQSK---FVQLTYTKNETD 197 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357 + + + HG ++ DA+ F +YS+ VY P C + H ++ GYG G YW Sbjct: 198 MAVTVATHGVLACGYDASAADFEWYSSCVYDNPDCDPW--GICHWMMICGYGTDAGKDYW 255 Query: 358 LVKNSWSNMWGNDGYV-LMSMRENNCGV 438 L KNS+ + WG +GY+ L+ ++ CGV Sbjct: 256 LAKNSFGSTWGMEGYIELVRNKDGQCGV 283 >UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sativa|Rep: Cysteine proteinase-like - Oryza sativa subsp. japonica (Rice) Length = 360 Score = 72.9 bits (171), Expect = 4e-12 Identities = 53/160 (33%), Positives = 72/160 (45%), Gaps = 11/160 (6%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITG-----WVNVTTNNE 171 GG+ A ++I G L TE Y Y GQ G C A W + +E Sbjct: 201 GGDVSAALRYIAASGGLQTEAAYA-YGGQQGACRAGGFAAPNSAAAVGGARWARLY-GDE 258 Query: 172 NALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL--NG 345 AL+ AL P+ V ++A+ F Y +GVY + L+HAV VGYG G Sbjct: 259 GALQ-ALAAGQPVVVVVEASEPDFRHYRSGVYAGSAACGR--RLNHAVTVVGYGAAADGG 315 Query: 346 HKYWLVKNSWSNMWGNDGYVLMS---MRENNCGVQSAPTY 456 +YWLVKN W WG GY+ ++ NCG+ + Y Sbjct: 316 GEYWLVKNQWGTWWGEGGYMRVARGGAAGGNCGIATYAFY 355 >UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-ear cress). SAG12 protein; n=2; Dictyostelium discoideum|Rep: Similar to Arabidopsis thaliana (Mouse-ear cress). SAG12 protein - Dictyostelium discoideum (Slime mold) Length = 358 Score = 72.9 bits (171), Expect = 4e-12 Identities = 48/158 (30%), Positives = 77/158 (48%), Gaps = 7/158 (4%) Frame = +1 Query: 7 GGGEDFRAYQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTN-NENAL 180 GGG+ + Y++ + G+ T Y Y DG C N++ + + VT +EN L Sbjct: 208 GGGDPYTVYEYFSQVGGVSTNAQYP-YTATDGTCV--NMSRAVPVVSYHYVTQGGDENTL 264 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-----LNG 345 + GP+S+ +DA+ T+ YS G+ KN +DH V VG V N Sbjct: 265 IKTIVNDGPVSICVDAS--TWQSYSGGIITTGCGKN----IDHCVQVVGLEVDKTDPSNP 318 Query: 346 HKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYV 459 +Y++++NSW WG DGY+ ++ + CG+ T V Sbjct: 319 VQYYIIRNSWGTDWGIDGYIYVATGSDLCGITYESTMV 356 >UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; Leishmania|Rep: Cysteine proteinase 1 precursor - Leishmania pifanoi Length = 354 Score = 72.9 bits (171), Expect = 4e-12 Identities = 53/154 (34%), Positives = 77/154 (50%), Gaps = 7/154 (4%) Frame = +1 Query: 10 GGEDFRAYQWIMKH---GLPTEEDY----GGYLGQDGYCHIDNVTAITKITGWVNVTTNN 168 GG +A WIM+ + TE Y GG G CH D KITG++++ + Sbjct: 193 GGLMDQAMNWIMQSHNGSVFTEASYPYTSGG--GTRPPCH-DEGEVGAKITGFLSLPHDE 249 Query: 169 ENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH 348 E + + K GP++VA+DA T+ Y GV C L+H VL VG+ Sbjct: 250 ERIAEW-VEKRGPVAVAVDAT--TWQLYFGGVV--SLCL--AWSLNHGVLIVGFNKNAKP 302 Query: 349 KYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAP 450 YW+VKNSW + WG GY+ ++M N C +++ P Sbjct: 303 PYWIVKNSWGSSWGEKGYIRLAMGSNQCMLKNYP 336 >UniRef50_UPI00006CBB5F Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 1367 Score = 72.5 bits (170), Expect = 5e-12 Identities = 33/89 (37%), Positives = 52/89 (58%), Gaps = 1/89 (1%) Frame = +1 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYG-VLNGHKY 354 +K ++ GPIS IDA + Y+ G+Y E K K+ +H V VG+G L G +Y Sbjct: 1270 MKSEIYSRGPISCTIDATDNLENNYTGGIYSE---KVKLPIPNHYVSVVGWGQTLEGEEY 1326 Query: 355 WLVKNSWSNMWGNDGYVLMSMRENNCGVQ 441 W+V+NSW WG +G+ + M ++N G++ Sbjct: 1327 WIVRNSWGTYWGEEGFFKLKMHKDNLGLE 1355 Score = 55.6 bits (128), Expect = 6e-07 Identities = 44/169 (26%), Positives = 77/169 (45%), Gaps = 20/169 (11%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQD---GY----------------CHIDNVTAIT 132 GG AY++I+++ + T+E Y G+D GY C + I Sbjct: 864 GGSPQTAYEYILRNNI-TDETCSPYTGRDFRDGYQCSSLTVCMECWPKVGCKARDDAYIY 922 Query: 133 KITGWVNVTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHA 312 I W V E ++ +F HGPIS I++ + F Y+ G+ P + ++ H+ Sbjct: 923 SIESWDQV--KGEEDMQQEIFNHGPISCVINST-EDFRNYTGGILNPP---DSPVQITHS 976 Query: 313 VLAVGYGVLNGH-KYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTY 456 + VG+G KYW+ +NS WG +G++ + +N ++S +Y Sbjct: 977 LSIVGWGEDEKQTKYWIARNSLGTFWGENGFIRIIRGKNALKIESDCSY 1025 >UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_56, whole genome shotgun sequence - Paramecium tetraurelia Length = 314 Score = 72.5 bits (170), Expect = 5e-12 Identities = 51/144 (35%), Positives = 73/144 (50%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG + QW K+GL T++ Y Q+ C T K +G+ V +N + A Sbjct: 177 GGFENLGIQWAKKNGLTTDKQYPYDGVQNKQCKYS--TGQYKPSGYQVVAADN---MYTA 231 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 L + PI+VA+DA ++ Y +GV+ KC K L+HAVLA G+ W++KN Sbjct: 232 L-SYQPITVAVDA--NSWQNYKSGVF--TKCTYK--SLNHAVLATGF---QEDGVWIIKN 281 Query: 370 SWSNMWGNDGYVLMSMRENNCGVQ 441 SW WG GY+ + N CGVQ Sbjct: 282 SWGTSWGEAGYIRLPATGNPCGVQ 305 >UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypanosoma cruzi|Rep: Cysteine protease, putative - Trypanosoma cruzi Length = 434 Score = 72.1 bits (169), Expect = 7e-12 Identities = 44/148 (29%), Positives = 78/148 (52%), Gaps = 7/148 (4%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHG---LPTEEDY-GGYLGQDGYCHID-NVTAITKITGWVNVTTNNE 171 GGG A+++IM G L E Y G G C ++ ++ + + G+ ++ N+ Sbjct: 196 GGGTAQLAWEYIMNTGGITLDAEYPYVSGETSVTGRCVLNRSMPRVVNVYGYASLPHNDY 255 Query: 172 NALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN--G 345 A+ AL + GP++V++ A+ + FY+ GV+ + + HAV VGYG N Sbjct: 256 EAVIEALVQKGPLAVSVAASD--WMFYTGGVFDGCGKDGENITISHAVQLVGYGTDNKTN 313 Query: 346 HKYWLVKNSWSNMWGNDGYVLMSMRENN 429 YW+V+NSW WG +G++ + +++N Sbjct: 314 QDYWVVRNSWGEGWGENGFIRLLRKKHN 341 >UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 397 Score = 72.1 bits (169), Expect = 7e-12 Identities = 50/160 (31%), Positives = 82/160 (51%), Gaps = 7/160 (4%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGW-------VNVTTN 165 GGG + A ++ + G+ E Y Y Q+G C+ N T+ ++ + ++ + N Sbjct: 250 GGGWAYNALVYMQRKGIFLESQYP-YKAQNGVCN--NATSASRQKAFFAKDQIIIDTSVN 306 Query: 166 NENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNG 345 N+L+ AL K P+SV +D+ + ++ YS+GV+ C + +DH VL VGY G Sbjct: 307 ITNSLQYALSKQ-PVSVKVDSRY--WNSYSSGVF--SNCLSDGWYVDHVVLLVGY-TKEG 360 Query: 346 HKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 + W+VKNSW WG GY+ ++ N C + P I Sbjct: 361 N--WIVKNSWGTNWGQSGYIYLA-PGNTCNLSVTPVITSI 397 >UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 393 Score = 71.7 bits (168), Expect = 9e-12 Identities = 49/157 (31%), Positives = 83/157 (52%), Gaps = 5/157 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHID--NVTAITKITGWVNVTTNNENALK 183 GG + Y + G+ EE+Y Q C ++ + + KI+ + +V +N E+ L+ Sbjct: 244 GGIPQKVYSYAAYLGITYEEEYPYIQRQRTGCGVNYNDTSKRVKISTYYDVQSNAES-LE 302 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363 AL K+ P++ AIDA K+ Y +G+Y P C ++ +HAV+ VGY +Y+L+ Sbjct: 303 TAL-KYAPVTAAIDA--KSLQMYGSGIYDFP-CSIDRNDANHAVVIVGYT----SEYFLI 354 Query: 364 KNSWSNMWGNDGYVLMSMRENN---CGVQSAPTYVLI 465 +NSW WG +G+ + NN CG+ + +Y I Sbjct: 355 RNSWGPHWGEEGHFKVRKESNNKGTCGLYNDMSYPYI 391 >UniRef50_Q22RR8 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 71.7 bits (168), Expect = 9e-12 Identities = 48/155 (30%), Positives = 79/155 (50%), Gaps = 3/155 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG+ A+++I + + TE++Y Y G D C ++ +V+V + +E +A Sbjct: 191 GGDMDAAFKFIHDNNIATEKEYT-YRGFDQKCKGTQYPTTYGLSSFVDVQSCDE---LVA 246 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + P+SVA+DA + + +Y G + + C D L+H VL VGY H+ W VKN Sbjct: 247 AIQQQPVSVAVDATN--WQYYEFGTFND--C---FDNLNHGVLLVGYNSKT-HQ-WKVKN 297 Query: 370 SWSNMWGNDGYVLMSMRE---NNCGVQSAPTYVLI 465 SW WG DGY+ + N CG+ +Y ++ Sbjct: 298 SWGTSWGEDGYIRLGASTKYLNTCGICEQASYPIV 332 >UniRef50_Q237A1 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 346 Score = 71.3 bits (167), Expect = 1e-11 Identities = 37/95 (38%), Positives = 54/95 (56%), Gaps = 1/95 (1%) Frame = +1 Query: 166 NENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELD-HAVLAVGYGVLN 342 +E A+ ++K+GPI VA+ ++ F Y GVY DEL HAV VG+GV N Sbjct: 249 DEKAIMAEIYKNGPIEVAL-TVYEDFLTYKTGVYQHVTG----DELGGHAVKMVGWGVEN 303 Query: 343 GHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSA 447 G YW + NSW+ WG+ G + +N CG++S+ Sbjct: 304 GTPYWTIVNSWNESWGDKGTFKILRGKNECGIESS 338 >UniRef50_Q22NW9 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 894 Score = 70.9 bits (166), Expect = 2e-11 Identities = 50/151 (33%), Positives = 81/151 (53%), Gaps = 2/151 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGY-CHIDNVTAIT-KITGWVNVTTNNENALK 183 GG A+ +++++G+ E DY Y G + C +N + KI G+ N+ + L+ Sbjct: 749 GGFMENAFDFVIENGILQENDYP-YEGHANFKCKKNNSNQQSYKIQGYYNINKYDCRGLQ 807 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363 A+ + P+SVAID K Y +G+ + C + V+ L+H VL VGY +++V Sbjct: 808 QAVAQQ-PVSVAIDG--KFLQRYHSGIIGD--CGSSVN-LNHGVLIVGYT----EDFFIV 857 Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQSAPTY 456 KNSW WG DGY ++ + N CG+ A +Y Sbjct: 858 KNSWGTNWGEDGYFRIT-KTNTCGICEAASY 887 >UniRef50_Q5CYC8 Cluster: Cathepsin like thiol protease possibly membrane associated; n=2; Cryptosporidium|Rep: Cathepsin like thiol protease possibly membrane associated - Cryptosporidium parvum Iowa II Length = 673 Score = 70.5 bits (165), Expect = 2e-11 Identities = 33/74 (44%), Positives = 50/74 (67%), Gaps = 1/74 (1%) Frame = +1 Query: 196 KHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL-NGHKYWLVKNS 372 K G IS++I++ FS YS+G+Y PKC EL+HAV+ +GYG+ NG KY++++NS Sbjct: 531 KVGSISLSINSNLPGFSSYSDGIYKAPKCTTH-SELNHAVIMIGYGINDNGDKYYVIQNS 589 Query: 373 WSNMWGNDGYVLMS 414 W WG G++ +S Sbjct: 590 WGVSWGIGGFMNVS 603 >UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain - Tetrahymena pyriformis Length = 330 Score = 70.5 bits (165), Expect = 2e-11 Identities = 48/143 (33%), Positives = 68/143 (47%) Frame = +1 Query: 28 AYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGP 207 A + K G+ E Y Y +DG C K + V + AL+ AL + P Sbjct: 196 AVAYTQKFGIVQESQYA-YTAKDGSCKTALQGTGYKPSAQFQVAATDA-ALQAAL-QVQP 252 Query: 208 ISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMW 387 IS+ +DA+ +S YS G++ C K DHAVL VG LN W V+NSW W Sbjct: 253 ISICVDASK--WSSYSKGIF--SNCSAKPSAADHAVLLVG---LNADNTWKVRNSWGTSW 305 Query: 388 GNDGYVLMSMRENNCGVQSAPTY 456 G GY+ ++ N CG+++ Y Sbjct: 306 GQSGYITLA-AGNTCGLENYAIY 327 >UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|Rep: Cysteine protease - Giardia muris Length = 301 Score = 70.5 bits (165), Expect = 2e-11 Identities = 36/86 (41%), Positives = 50/86 (58%), Gaps = 1/86 (1%) Frame = +1 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLV 363 AL GP+ VA + F +YS+GVY + N + E HAV VGYG+ +G KYW++ Sbjct: 210 ALVYDGPLQVAF-VVYSDFGYYSSGVY---QHVNGMMEGGHAVEMVGYGIDESGLKYWII 265 Query: 364 KNSWSNMWGNDGYVLMSMRENNCGVQ 441 +NSW WG GY + R N CG++ Sbjct: 266 RNSWGPDWGEGGYFRIIRRVNECGIE 291 >UniRef50_A3FQ13 Cluster: Cathepsin like thiol protease possibly membrane associated, putative; n=1; Cryptosporidium parvum Iowa II|Rep: Cathepsin like thiol protease possibly membrane associated, putative - Cryptosporidium parvum Iowa II Length = 298 Score = 70.5 bits (165), Expect = 2e-11 Identities = 46/166 (27%), Positives = 82/166 (49%), Gaps = 13/166 (7%) Frame = +1 Query: 1 ARGGGEDFRAYQWIMKHGLPTEEDYGGYL---GQDGYC--HIDNVTAITKI----TGWVN 153 A GG+ F + + +K + T + Y G+ G C + + I TG Sbjct: 107 ACSGGQTFEVFNYAIKSKVCTRDSYPSTTHKTGKLGECKSNCNECVGIKNFKWSYTGSSI 166 Query: 154 VTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKN---KVDELDHAVLAV 324 + + + + A++ +GP++V++ + F+ YS G Y P C + ++DHAV + Sbjct: 167 LYEDPWDVITDAIYNYGPVTVSVCSLMPGFNLYSGGYYEPPTCGSIWCGTRQVDHAVTLI 226 Query: 325 GYGVL-NGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYV 459 GYGV +G +Y+++KNSW WGN G+ M++ + C P +V Sbjct: 227 GYGVSESGKRYYIMKNSWGLSWGNKGF--MNISADMCSTFFNPGWV 270 >UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 325 Score = 70.1 bits (164), Expect = 3e-11 Identities = 49/146 (33%), Positives = 78/146 (53%) Frame = +1 Query: 28 AYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALFKHGP 207 ++++I+K+ + DY Y +G C + I+ +V+V + + AL AL H P Sbjct: 194 SFKYIIKNKISKAADYP-YTAVEGKCKDTSSFEKYAISSYVDVPSGDCKALLTALQDH-P 251 Query: 208 ISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMW 387 +SVAIDA K +Y++GVY N D L HAVL VGY +KNSW + Sbjct: 252 VSVAIDA--KNLQYYTSGVY-----SNCSDNLTHAVLLVGYS----SSALKLKNSWGTQF 300 Query: 388 GNDGYVLMSMRENNCGVQSAPTYVLI 465 G +GY +++ N CGV +A ++ ++ Sbjct: 301 GENGYFRLAV-GNTCGVCNAASFPVL 325 >UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; Methanospirillum hungatei JF-1|Rep: Peptidase C1A, papain precursor - Methanospirillum hungatei (strain JF-1 / DSM 864) Length = 1096 Score = 70.1 bits (164), Expect = 3e-11 Identities = 42/128 (32%), Positives = 62/128 (48%), Gaps = 11/128 (8%) Frame = +1 Query: 52 GLPTEEDYGGYLGQDGYCH-IDNVTAITKITG----WVNVTTNNE------NALKLALFK 198 G TE +Y Y G DG C + T + T W V NE +A+K A++ Sbjct: 410 GTVTEANYP-YTGSDGTCKSLSGYTRYSVDTAAGETWGYVGGGNEWSIPSDDAIKTAIYL 468 Query: 199 HGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWS 378 +GP++ + A TF Y +G+ + +HA++ VG+G LNG YW+ KNSW Sbjct: 469 YGPVAAGV-YAESTFDSYRSGIL---DSTSSASYANHAIIIVGWGTLNGRTYWICKNSWG 524 Query: 379 NMWGNDGY 402 WG G+ Sbjct: 525 TSWGESGW 532 >UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Cathepsin B - Fasciola gigantica (Giant liver fluke) Length = 339 Score = 69.7 bits (163), Expect = 4e-11 Identities = 34/93 (36%), Positives = 51/93 (54%) Frame = +1 Query: 166 NENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNG 345 +E+ + + K+GP+ V A + F Y +G+Y K + HAV +G+GV NG Sbjct: 242 HESYIMQEIMKNGPVEVTF-AIFQDFGVYRSGIYHHVAGKF-IGR--HAVRMIGWGVENG 297 Query: 346 HKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444 YWL+ NSW+ WG +GY M N CG++S Sbjct: 298 VNYWLMANSWNEEWGENGYFRMVRGRNECGIES 330 >UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|Rep: Cathepsin B - Streblomastix strix Length = 312 Score = 69.7 bits (163), Expect = 4e-11 Identities = 32/94 (34%), Positives = 57/94 (60%) Frame = +1 Query: 163 NNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN 342 +NE ++ ++++GP++ + A ++ S Y +GVY + L HA+ VG+G+L+ Sbjct: 213 SNEADIQKEIYENGPVTASF-AVYEDLSVYQSGVY--QHVTGGFEGL-HAIKVVGWGILD 268 Query: 343 GHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444 G KYW + NSW+ WG DG +L+ + CG++S Sbjct: 269 GVKYWTIVNSWAEDWGFDGLLLIRRGVDECGIES 302 >UniRef50_A7APS9 Cluster: Papain family cysteine protease containing protein; n=1; Babesia bovis|Rep: Papain family cysteine protease containing protein - Babesia bovis Length = 435 Score = 69.7 bits (163), Expect = 4e-11 Identities = 45/152 (29%), Positives = 75/152 (49%), Gaps = 4/152 (2%) Frame = +1 Query: 13 GEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLAL 192 G + AY++I HG+ Y Y + G C ++ + ++ N + L L Sbjct: 291 GNSYFAYEYIRDHGVYRLASYP-YTAKSGPC-VEPLNEPRLTISRFGLSENPD--LPQLL 346 Query: 193 FKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNS 372 ++GP++V + A + + FYS+G+ C DE++HAV+ G G + +WL+KNS Sbjct: 347 KQYGPLTVYV-AVNVDWQFYSSGIL--DSC---ADEINHAVVLAGVGQDDDGPFWLIKNS 400 Query: 373 WSNMWGNDGYVLM----SMRENNCGVQSAPTY 456 W WG +GYV + S +N CG+ Y Sbjct: 401 WGTSWGEEGYVRLARGSSAFDNECGLAHMALY 432 >UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - Drosophila melanogaster (Fruit fly) Length = 431 Score = 69.3 bits (162), Expect = 5e-11 Identities = 33/94 (35%), Positives = 46/94 (48%), Gaps = 1/94 (1%) Frame = +1 Query: 163 NNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL- 339 N E + +F GP+ + ++ F YS GVY E K H+V VG+G Sbjct: 320 NREADIMAEIFHSGPVQATM-RVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEH 378 Query: 340 NGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQ 441 NG KYW+ NSW + WG GY + N CG++ Sbjct: 379 NGEKYWIAANSWGSWWGEHGYFRILRGSNECGIE 412 >UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoidea|Rep: Cathepsin B-S precursor - Tuberaphis styraci Length = 349 Score = 68.9 bits (161), Expect = 6e-11 Identities = 34/98 (34%), Positives = 52/98 (53%), Gaps = 1/98 (1%) Frame = +1 Query: 163 NNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFE-PKCKNKVDELDHAVLAVGYGVL 339 N+ ++ L +GP+ + D + FS Y +G+Y + PK K E H++ +G+G Sbjct: 234 NSIETIEQDLMTYGPVEASFDV-YDDFSVYKSGIYRKTPKAKY---EGGHSIKIIGWGEE 289 Query: 340 NGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPT 453 NG YWL NSWS WG+ G + N CG++ A T Sbjct: 290 NGTPYWLAVNSWSKFWGDHGTFKIIKGRNECGIERAVT 327 >UniRef50_Q26988 Cluster: Cysteine proteinase 4; n=1; Tritrichomonas foetus|Rep: Cysteine proteinase 4 - Tritrichomonas foetus (Trichomonas foetus) Length = 152 Score = 68.9 bits (161), Expect = 6e-11 Identities = 42/115 (36%), Positives = 62/115 (53%), Gaps = 3/115 (2%) Frame = +1 Query: 10 GGEDFRAYQWIMK--HGLPTEEDYGGYLGQD-GYCHIDNVTAITKITGWVNVTTNNENAL 180 GG F A+ +I + +G ED Y G D C D +ITG+++V +E L Sbjct: 39 GGSPFSAFMFISRTQNGQINLEDDYPYTGTDTNDCKFDPSKGYGRITGFMSVQAQSEEDL 98 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNG 345 + GPI+V IDA+ +F+ YS+G+Y + +C + V LDHAV +GYG G Sbjct: 99 FKCVASVGPIAVCIDASLASFNSYSSGIYNDRQCSSTV--LDHAVGCIGYGAEGG 151 >UniRef50_Q231X3 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 68.9 bits (161), Expect = 6e-11 Identities = 51/153 (33%), Positives = 81/153 (52%), Gaps = 2/153 (1%) Frame = +1 Query: 13 GEDFRAYQWIMKHGLPTEEDYGGYLGQDGY-CHIDNVTAITKIT-GWVNVTTNNENALKL 186 G+ +A +I ++ + TE++Y Y +D C+ DN I T + + + N L Sbjct: 184 GQKEQALVYIKRYSITTEQNYP-YTEKDVQKCYFDNTKHIPNYTISDIKIVKASTNDLVE 242 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 AL K P++V++DA + + +Y GV+ + CK +HAVL VG+ NG WLVK Sbjct: 243 AL-KIQPVAVSVDATN--WKYYKGGVFSD--CKTYYH--NHAVLLVGFQ--NGT--WLVK 291 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 NS+ WG +GY+ + N CGV + P +I Sbjct: 292 NSYGTNWGENGYIRLK-NGNTCGVANQPYQPII 323 >UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathepsin o - Aedes aegypti (Yellowfever mosquito) Length = 375 Score = 68.9 bits (161), Expect = 6e-11 Identities = 33/99 (33%), Positives = 60/99 (60%) Frame = +1 Query: 163 NNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN 342 + E+ + L HGPI A++AA ++ +Y GV + C+ ++L+HAV VGY + + Sbjct: 277 DREHLMLRYLATHGPIVAAVNAA--SWKYYLGGV-IQYHCEEAYEDLNHAVEIVGYNLES 333 Query: 343 GHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYV 459 Y+LVKNSW +G+ GY+ + + +N CG+ + +++ Sbjct: 334 QIPYYLVKNSWGPRFGDRGYIKIQVGKNLCGIANRVSFI 372 >UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_36, whole genome shotgun sequence - Paramecium tetraurelia Length = 307 Score = 68.9 bits (161), Expect = 6e-11 Identities = 47/145 (32%), Positives = 75/145 (51%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG A +I + G E DY Y +DG C + +I G N N E+A+K Sbjct: 171 GGNSDLALDYIAEVGSVYERDYE-YTAKDGVCKVKQ--GKVRIAGRENYGPN-EDAIKKG 226 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 + + P+SV++DA + + FY+ GVY + C++ D +HAV+AVG+ W ++N Sbjct: 227 IQNY-PLSVSVDATY--WKFYNQGVY-DGACRD--DFHNHAVVAVGFDYAGN---WKIRN 277 Query: 370 SWSNMWGNDGYVLMSMRENNCGVQS 444 SW WG G++ + N+C V + Sbjct: 278 SWGEGWGEQGHIWLK-PGNSCAVMT 301 >UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Cathepsin B1) (APP secretase) (APPS) [Contains: Cathepsin B light chain; Cathepsin B heavy chain]; n=85; Eukaryota|Rep: Cathepsin B precursor (EC 3.4.22.1) (Cathepsin B1) (APP secretase) (APPS) [Contains: Cathepsin B light chain; Cathepsin B heavy chain] - Homo sapiens (Human) Length = 339 Score = 68.9 bits (161), Expect = 6e-11 Identities = 32/97 (32%), Positives = 58/97 (59%), Gaps = 2/97 (2%) Frame = +1 Query: 160 TNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDEL--DHAVLAVGYG 333 +N+E + ++K+GP+ A + + F Y +GVY ++ E+ HA+ +G+G Sbjct: 233 SNSEKDIMAEIYKNGPVEGAF-SVYSDFLLYKSGVY-----QHVTGEMMGGHAIRILGWG 286 Query: 334 VLNGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444 V NG YWLV NSW+ WG++G+ + +++CG++S Sbjct: 287 VENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIES 323 >UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; Eukaryota|Rep: Cathepsin-like cysteine protease - Phytophthora infestans (Potato late blight fungus) Length = 635 Score = 68.1 bits (159), Expect = 1e-10 Identities = 31/95 (32%), Positives = 53/95 (55%) Frame = +1 Query: 157 TTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV 336 TT E + ++ GPI+ ++ A F YS G++ + K ++DHA+ VG+G Sbjct: 203 TTLGEQQMMAEIYARGPIACSV-AVTDGFLKYSGGIFDD---KTNATDVDHAISIVGWGE 258 Query: 337 LNGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQ 441 NG +W+++NSW + WG G++ + NN GV+ Sbjct: 259 ENGVPFWVLRNSWGSFWGESGWMRLVRGVNNVGVE 293 Score = 59.3 bits (137), Expect = 5e-08 Identities = 27/88 (30%), Positives = 45/88 (51%) Frame = +1 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357 +K ++K GPI + A K F Y+ G+Y E ++ + +V GY +YW Sbjct: 508 MKAEIYKRGPIGCGVHATSK-FESYTGGIYSEHVMFPLINH-EISVAGWGYDEETDTEYW 565 Query: 358 LVKNSWSNMWGNDGYVLMSMRENNCGVQ 441 + +NSW WG +G+ + M NN G++ Sbjct: 566 IGRNSWGTYWGENGWFRIQMHHNNLGIE 593 >UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena thermophila Length = 320 Score = 68.1 bits (159), Expect = 1e-10 Identities = 47/152 (30%), Positives = 77/152 (50%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG +++I+ + + +Y Y +DG C + I+ + + + N+L A Sbjct: 183 GGWMVEGFKYIIDNKISQTANYP-YTAKDGKCKDTSSFKKFSISKYAEIPQGDCNSLNSA 241 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 L + GPISVA+DA + F FY++GV+ KN L+H VL V N +KN Sbjct: 242 L-EQGPISVAVDATN--FQFYTSGVF-----KNCKANLNHGVLLVA----NVDSSLKIKN 289 Query: 370 SWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 SW WG G++ ++ N CGV +A +Y ++ Sbjct: 290 SWGPSWGEKGFIRLA-AGNTCGVCNAASYPIV 320 >UniRef50_Q24FA8 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 335 Score = 68.1 bits (159), Expect = 1e-10 Identities = 50/143 (34%), Positives = 78/143 (54%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLA 189 GG A Q+ G+ ++ +Y Y G G C+I + T + + + E L+ A Sbjct: 195 GGVPSDAVQYAADFGVLSDNEYP-YTGIQGQCNITSKTNGFQPVQFSYLDGTAEG-LRKA 252 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKN 369 L +GP+SVA+DA++ Y++GV+ C +K L+HAVLAVGY G+ W++KN Sbjct: 253 L-NYGPVSVAMDASN--MKEYTSGVF--NNCTSKQFNLNHAVLAVGYDE-EGN--WIIKN 304 Query: 370 SWSNMWGNDGYVLMSMRENNCGV 438 S WG +GY L++ N CG+ Sbjct: 305 SKGPNWGMEGYFLLA-PGNTCGI 326 >UniRef50_A0DZB1 Cluster: Chromosome undetermined scaffold_7, whole genome shotgun sequence; n=4; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_7, whole genome shotgun sequence - Paramecium tetraurelia Length = 500 Score = 68.1 bits (159), Expect = 1e-10 Identities = 41/159 (25%), Positives = 80/159 (50%), Gaps = 14/159 (8%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVT-------TNN 168 GG F ++ + L TE+ Y Y G G C + + +K+ G N +N Sbjct: 315 GGYPFLVEKFASEQYLVTEQQYP-YKGDVGTCKKIDFSQSSKVYGAKNYKYIGGGYGLSN 373 Query: 169 ENALKLALFKHGPISVAIDAAHKTFSFYSNGVY-------FEPKCKNKVDELDHAVLAVG 327 E + + L+ +GP+ + + ++ F +Y +G+Y + + + + +++DH+VL G Sbjct: 374 ERDIMMELYTNGPVIMNFEPSYD-FMYYESGIYHSVAEHDWSTQERPEWEKVDHSVLCYG 432 Query: 328 YGVLNGHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444 +G +G K+WL++NSW + WG +G M + ++S Sbjct: 433 WGEEDGVKFWLLQNSWGSQWGENGSFRMKRGVDESAIES 471 >UniRef50_Q23FQ5 Cluster: Papain family cysteine protease containing protein; n=4; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 67.7 bits (158), Expect = 1e-10 Identities = 47/145 (32%), Positives = 78/145 (53%), Gaps = 1/145 (0%) Frame = +1 Query: 31 YQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTN-NENALKLALFKHGP 207 +++ +K+G+ Y Y+G C N + ++K N N + +K A+ GP Sbjct: 200 FKYAIKYGIVQGSSYP-YVGYQTTCK--NTSNLSKYFPQSFKFINPNASDVKAAI-SQGP 255 Query: 208 ISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNMW 387 ISV +DA+ T+S YS G++ C + + +L+HAV+AVGY + +++N W W Sbjct: 256 ISVTVDAS--TWSSYSGGIF--NGCNSNI-QLNHAVIAVGYDTQGNY---IIRNHWGTGW 307 Query: 388 GNDGYVLMSMRENNCGVQSAPTYVL 462 G GY+ +S NNCGV ++ VL Sbjct: 308 GEKGYMRLS-ANNNCGVLTSVIQVL 331 >UniRef50_Q235G6 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 325 Score = 67.7 bits (158), Expect = 1e-10 Identities = 45/141 (31%), Positives = 75/141 (53%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GGG A ++ + GL TEE+Y Y ++G C + + I+G+ + ++ L Sbjct: 180 GGGLRDIALNYVKETGLTTEEEYS-YEAKNGKCRLQGKSNPYTISGFTAIKQCSD--LVN 236 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 A+ K P++V ID+++ FY+NG++ C K++ H VL VGY + + W VK Sbjct: 237 AIQK-APVTVGIDSSN--LQFYTNGIF--SNCGTKIN---HGVLLVGYDSVK--EAWKVK 286 Query: 367 NSWSNMWGNDGYVLMSMRENN 429 NSW +G GY+ +S + N Sbjct: 287 NSWGPKFGEGGYIYLSAKITN 307 >UniRef50_A0DIY3 Cluster: Chromosome undetermined scaffold_52, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_52, whole genome shotgun sequence - Paramecium tetraurelia Length = 512 Score = 67.3 bits (157), Expect = 2e-10 Identities = 31/84 (36%), Positives = 46/84 (54%) Frame = +1 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357 +K+ +F GPI + A + Y G F K + L+H V VG+GV +G +YW Sbjct: 418 MKIEIFNRGPIVCGVYATQE-LDDYEGGYIFSQKTNKTI--LNHYVSVVGWGVEDGVEYW 474 Query: 358 LVKNSWSNMWGNDGYVLMSMRENN 429 +V+NSW + WG+ GY M M +N Sbjct: 475 IVRNSWGSYWGDMGYAKMKMHSDN 498 >UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sativa|Rep: OSJNBb0085F13.15 protein - Oryza sativa subsp. japonica (Rice) Length = 383 Score = 66.9 bits (156), Expect = 3e-10 Identities = 52/167 (31%), Positives = 84/167 (50%), Gaps = 12/167 (7%) Frame = +1 Query: 1 ARGGGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYC--HIDNVTAITKITGWVNVTTNNE 171 ++G +D A+ W+ K+ G+ ++ Y Y+G C + V T + G V + N E Sbjct: 224 SKGYSDD--AFLWVSKNKGIASDLIYP-YVGHKESCKKQLLGVHNAT-VRGVVTLPENRE 279 Query: 172 NALKLALFKHGPISVAIDAAHKTFSFY-SNGVYFEPK-CKNKVDELDHAVLAVGYGVLN- 342 + + A+ + P++V DA F Y NGVY C V+ HA+ VGYG + Sbjct: 280 DLIMAAVARQ-PVAVVFDAGDPLFQNYRGNGVYKGGTGCSTNVN---HALTIVGYGTNHP 335 Query: 343 --GHKYWLVKNSWSNMWGNDGYVLMSM----RENNCGVQSAPTYVLI 465 G YW+ KNS+ N+WG++G+V ++ R CG+ PT+ I Sbjct: 336 DTGENYWIAKNSYGNLWGDNGFVYLAKDTADRTGVCGLAIWPTFPTI 382 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 530,075,216 Number of Sequences: 1657284 Number of extensions: 10171785 Number of successful extensions: 28752 Number of sequences better than 10.0: 500 Number of HSP's better than 10.0 without gapping: 27642 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 28309 length of database: 575,637,011 effective HSP length: 95 effective length of database: 418,195,031 effective search space used: 31782822356 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -