BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bmte11c10 (725 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C... 262 6e-69 UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ... 181 1e-44 UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n... 180 4e-44 UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep... 177 3e-43 UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n... 168 1e-40 UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei... 166 6e-40 UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j... 165 7e-40 UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat... 162 7e-39 UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M... 159 9e-38 UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs... 151 2e-35 UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ... 150 4e-35 UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ... 150 4e-35 UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata... 149 7e-35 UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h... 148 1e-34 UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:... 147 3e-34 UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ... 147 3e-34 UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi... 145 9e-34 UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat... 145 1e-33 UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ... 143 3e-33 UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae... 142 8e-33 UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;... 142 8e-33 UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy... 141 1e-32 UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca... 140 4e-32 UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n... 139 7e-32 UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto... 138 1e-31 UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain... 138 1e-31 UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ... 137 3e-31 UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n... 136 4e-31 UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|... 135 1e-30 UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine prot... 134 2e-30 UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca... 134 3e-30 UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr... 133 4e-30 UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er... 133 5e-30 UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ... 132 6e-30 UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ... 132 6e-30 UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc... 132 6e-30 UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi... 132 8e-30 UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus... 130 3e-29 UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox... 129 6e-29 UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No... 129 8e-29 UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt... 129 8e-29 UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ... 129 8e-29 UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;... 128 1e-28 UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3... 127 3e-28 UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R... 127 3e-28 UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip... 126 6e-28 UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip... 126 7e-28 UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate... 124 2e-27 UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=... 124 2e-27 UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|... 123 4e-27 UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio... 123 4e-27 UniRef50_Q23H32 Cluster: Papain family cysteine protease contain... 122 7e-27 UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p... 122 1e-26 UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina... 121 2e-26 UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e... 121 2e-26 UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr... 121 2e-26 UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica... 120 4e-26 UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18... 120 4e-26 UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ... 119 6e-26 UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: L... 119 8e-26 UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re... 118 1e-25 UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re... 117 3e-25 UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ... 116 6e-25 UniRef50_Q24E33 Cluster: Papain family cysteine protease contain... 116 6e-25 UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]... 116 6e-25 UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D... 116 8e-25 UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re... 116 8e-25 UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep... 115 1e-24 UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio... 115 1e-24 UniRef50_Q239L8 Cluster: Papain family cysteine protease contain... 115 1e-24 UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35... 113 3e-24 UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ... 113 4e-24 UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb... 113 6e-24 UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1... 113 6e-24 UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-... 112 7e-24 UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir... 112 7e-24 UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;... 112 7e-24 UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph... 112 1e-23 UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ... 112 1e-23 UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; ... 111 1e-23 UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n... 111 1e-23 UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ... 111 2e-23 UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted... 111 2e-23 UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=... 111 2e-23 UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome s... 110 3e-23 UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ... 110 4e-23 UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:... 109 5e-23 UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v... 109 5e-23 UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R... 109 7e-23 UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ... 109 9e-23 UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy... 109 9e-23 UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster... 108 1e-22 UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain... 107 2e-22 UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac... 107 3e-22 UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ... 107 4e-22 UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve... 107 4e-22 UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p... 106 5e-22 UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus... 106 5e-22 UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa... 105 8e-22 UniRef50_O16454 Cluster: Temporarily assigned gene name protein ... 105 8e-22 UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG... 105 1e-21 UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz... 105 1e-21 UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ... 105 1e-21 UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei... 104 2e-21 UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ... 104 3e-21 UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome s... 104 3e-21 UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ... 104 3e-21 UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ... 104 3e-21 UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal... 103 3e-21 UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr... 103 3e-21 UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma j... 103 4e-21 UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D... 103 4e-21 UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory... 103 6e-21 UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain... 102 8e-21 UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv... 102 8e-21 UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty... 102 1e-20 UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ... 101 1e-20 UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ... 101 1e-20 UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C... 101 1e-20 UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w... 101 1e-20 UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain... 101 2e-20 UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh... 101 2e-20 UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ... 101 2e-20 UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl... 101 2e-20 UniRef50_Q22A69 Cluster: Papain family cysteine protease contain... 101 2e-20 UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|... 101 2e-20 UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP000... 100 3e-20 UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh... 100 3e-20 UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ... 100 3e-20 UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C... 100 3e-20 UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ... 100 4e-20 UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ... 100 4e-20 UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ... 99 6e-20 UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sa... 100 7e-20 UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t... 99 1e-19 UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste... 99 1e-19 UniRef50_Q22RC9 Cluster: Papain family cysteine protease contain... 99 1e-19 UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|... 99 1e-19 UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G... 98 2e-19 UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic... 98 2e-19 UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel... 98 2e-19 UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum... 98 2e-19 UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain... 98 2e-19 UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ... 98 2e-19 UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia... 97 3e-19 UniRef50_Q23H10 Cluster: Papain family cysteine protease contain... 97 3e-19 UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2; Culicid... 97 3e-19 UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh... 97 4e-19 UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s... 97 5e-19 UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt... 97 5e-19 UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl... 97 5e-19 UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyosteli... 97 5e-19 UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n... 96 7e-19 UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endop... 96 9e-19 UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;... 95 1e-18 UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa... 95 2e-18 UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ... 95 2e-18 UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil... 95 2e-18 UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ... 95 2e-18 UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ... 94 3e-18 UniRef50_Q22W19 Cluster: Papain family cysteine protease contain... 94 3e-18 UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi... 93 5e-18 UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ... 93 6e-18 UniRef50_Q23H06 Cluster: Papain family cysteine protease contain... 93 8e-18 UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz... 92 1e-17 UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis... 92 1e-17 UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ... 92 1e-17 UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve... 92 1e-17 UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ... 90 4e-17 UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ... 90 6e-17 UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain... 90 6e-17 UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh... 90 6e-17 UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ... 89 1e-16 UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain... 89 1e-16 UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov... 89 1e-16 UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain... 89 1e-16 UniRef50_Q248G1 Cluster: Papain family cysteine protease contain... 88 2e-16 UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ... 88 2e-16 UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ... 88 2e-16 UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium... 87 3e-16 UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain... 87 4e-16 UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa... 86 7e-16 UniRef50_Q235G6 Cluster: Papain family cysteine protease contain... 86 7e-16 UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|... 85 1e-15 UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida... 85 1e-15 UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole... 85 2e-15 UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ... 85 2e-15 UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re... 85 2e-15 UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo... 85 2e-15 UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n... 84 3e-15 UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain... 84 3e-15 UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:... 83 5e-15 UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia ... 83 7e-15 UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist... 83 7e-15 UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1; ... 83 7e-15 UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:... 83 9e-15 UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyt... 82 1e-14 UniRef50_Q231X3 Cluster: Papain family cysteine protease contain... 82 1e-14 UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 82 1e-14 UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy... 82 1e-14 UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s... 82 2e-14 UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ... 82 2e-14 UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain... 82 2e-14 UniRef50_A0BNM1 Cluster: Chromosome undetermined scaffold_119, w... 82 2e-14 UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi... 81 2e-14 UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis... 81 4e-14 UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2... 80 6e-14 UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; P... 79 8e-14 UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh... 79 1e-13 UniRef50_Q23H15 Cluster: Papain family cysteine protease contain... 79 1e-13 UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli... 78 2e-13 UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain... 78 3e-13 UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy... 78 3e-13 UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh... 78 3e-13 UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The... 78 3e-13 UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryz... 77 3e-13 UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain... 77 3e-13 UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen... 77 3e-13 UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet... 77 4e-13 UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl... 77 4e-13 UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ... 77 6e-13 UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy... 76 8e-13 UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ... 76 1e-12 UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa... 76 1e-12 UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ... 75 1e-12 UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n... 75 2e-12 UniRef50_Q2QS15 Cluster: Papain family cysteine protease contain... 75 2e-12 UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ... 75 2e-12 UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain... 75 2e-12 UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w... 75 2e-12 UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs... 75 2e-12 UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto... 75 2e-12 UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280... 75 2e-12 UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina... 75 2e-12 UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; ... 74 3e-12 UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-li... 74 3e-12 UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy... 74 4e-12 UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali... 74 4e-12 UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv... 73 5e-12 UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain... 73 5e-12 UniRef50_A0DKP6 Cluster: Chromosome undetermined scaffold_54, wh... 73 5e-12 UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big... 73 7e-12 UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa... 73 7e-12 UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li... 73 7e-12 UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,... 73 1e-11 UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ... 73 1e-11 UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ... 73 1e-11 UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa... 72 1e-11 UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ... 71 2e-11 UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1... 71 2e-11 UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|... 71 3e-11 UniRef50_A0EAG2 Cluster: Chromosome undetermined scaffold_86, wh... 71 4e-11 UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n... 70 5e-11 UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain... 70 7e-11 UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin ... 69 9e-11 UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re... 69 9e-11 UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; ... 69 1e-10 UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ... 68 2e-10 UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-32... 68 2e-10 UniRef50_A0EI50 Cluster: Chromosome undetermined scaffold_98, wh... 68 2e-10 UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ... 66 6e-10 UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep... 66 6e-10 UniRef50_A0BLR4 Cluster: Chromosome undetermined scaffold_115, w... 66 6e-10 UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|R... 66 6e-10 UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi... 66 8e-10 UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir... 66 8e-10 UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M... 66 8e-10 UniRef50_Q5YER1 Cluster: Cysteine proteinase; n=1; Bigelowiella ... 66 1e-09 UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The... 66 1e-09 UniRef50_UPI0000D566EC Cluster: PREDICTED: similar to CG10460-PA... 65 1e-09 UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-... 65 1e-09 UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli... 65 1e-09 UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R... 65 2e-09 UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ... 65 2e-09 UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu... 64 3e-09 UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi... 64 4e-09 UniRef50_UPI0001509FF9 Cluster: Papain family cysteine protease ... 63 6e-09 UniRef50_A7LFV2 Cluster: Cathepsin L protease inhibitor 1; n=1; ... 63 6e-09 UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl... 63 6e-09 UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;... 63 8e-09 UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H... 63 8e-09 UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arab... 62 1e-08 UniRef50_Q9BL26 Cluster: Putative uncharacterized protein; n=4; ... 62 1e-08 UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh... 62 1e-08 UniRef50_A7APS9 Cluster: Papain family cysteine protease contain... 62 1e-08 UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cy... 62 1e-08 UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ... 62 2e-08 UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh... 62 2e-08 UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Sl... 61 2e-08 UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop... 61 2e-08 UniRef50_A1ZBK7 Cluster: CG10460-PA; n=1; Drosophila melanogaste... 61 2e-08 UniRef50_A0DCA5 Cluster: Chromosome undetermined scaffold_45, wh... 61 3e-08 UniRef50_A0CHZ5 Cluster: Chromosome undetermined scaffold_186, w... 61 3e-08 UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O... 60 4e-08 UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop... 60 5e-08 UniRef50_A2Q4E7 Cluster: Peptidase C1A, papain; n=1; Medicago tr... 60 7e-08 UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;... 59 1e-07 UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:... 59 1e-07 UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cel... 59 1e-07 UniRef50_Q61CG7 Cluster: Putative uncharacterized protein CBG129... 51 1e-07 UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Sl... 59 1e-07 UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who... 59 1e-07 UniRef50_Q24F16 Cluster: Papain family cysteine protease contain... 57 4e-07 UniRef50_A0CS14 Cluster: Chromosome undetermined scaffold_26, wh... 57 4e-07 UniRef50_O62484 Cluster: Putative uncharacterized protein; n=1; ... 52 5e-07 UniRef50_Q5NE16 Cluster: Putative cathepsin L-like protein 3; n=... 57 5e-07 UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;... 56 7e-07 UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w... 56 7e-07 UniRef50_UPI00015B5D85 Cluster: PREDICTED: similar to cathepsin ... 56 9e-07 UniRef50_Q237A1 Cluster: Papain family cysteine protease contain... 56 9e-07 UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ... 56 9e-07 UniRef50_UPI0000D566ED Cluster: PREDICTED: similar to CTLA-2-alp... 56 1e-06 UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh... 56 1e-06 UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryz... 56 1e-06 UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein... 56 1e-06 UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve... 56 1e-06 UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr... 56 1e-06 UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop... 55 2e-06 UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcop... 55 2e-06 UniRef50_A2DKX2 Cluster: Clan CA, family C1, cathepsin L-like cy... 55 2e-06 UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop... 55 2e-06 UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy... 54 3e-06 UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr... 54 3e-06 UniRef50_Q650W8 Cluster: Putative cysteine proteinase; n=2; Oryz... 54 4e-06 UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus... 54 4e-06 UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try... 54 4e-06 UniRef50_Q3L7L2 Cluster: Sar s 1 allergen SMIPP-C Yv6008G08; n=2... 54 4e-06 UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ... 54 5e-06 UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C... 54 5e-06 UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest... 54 5e-06 UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep... 54 5e-06 UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ... 53 6e-06 UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl... 53 6e-06 UniRef50_A7LFV3 Cluster: Cathepsin L protease inhibitor 2; n=1; ... 53 6e-06 UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy... 53 8e-06 UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia... 52 1e-05 UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid... 52 1e-05 UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca... 52 1e-05 UniRef50_Q8I8D4 Cluster: Cysteine protease 14; n=1; Entamoeba hi... 52 2e-05 UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ... 51 3e-05 UniRef50_Q5CYC8 Cluster: Cathepsin like thiol protease possibly ... 51 3e-05 UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil... 51 3e-05 UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease ... 50 4e-05 UniRef50_Q2NG83 Cluster: Member of asn/thr-rich large protein fa... 50 4e-05 UniRef50_A5HC50 Cluster: Cathepsin W; n=2; Theria|Rep: Cathepsin... 50 6e-05 UniRef50_Q4N5Z7 Cluster: Cysteine proteinase, putative; n=2; The... 50 6e-05 UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Ca... 50 6e-05 UniRef50_UPI0000DA404B Cluster: PREDICTED: similar to cathepsin ... 50 8e-05 UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep... 50 8e-05 UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=... 50 8e-05 UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cy... 50 8e-05 UniRef50_A1Z9I0 Cluster: CG6357-PA; n=3; Drosophila melanogaster... 50 8e-05 UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3.... 50 8e-05 UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi... 50 8e-05 UniRef50_Q0AY53 Cluster: Putative uncharacterized protein; n=1; ... 49 1e-04 UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gamb... 49 1e-04 UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw... 49 1e-04 UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil... 49 1e-04 UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus co... 49 1e-04 UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|... 49 1e-04 UniRef50_A0DZB1 Cluster: Chromosome undetermined scaffold_7, who... 49 1e-04 UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,... 48 2e-04 UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2... 48 2e-04 UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 48 2e-04 UniRef50_Q207N1 Cluster: Cathepsin S; n=2; Clupeocephala|Rep: Ca... 48 3e-04 UniRef50_Q01FL5 Cluster: Cysteine protease-5; n=3; Ostreococcus|... 48 3e-04 UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like prote... 48 3e-04 UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; ... 47 4e-04 UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S... 47 4e-04 UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=... 47 4e-04 UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain... 47 4e-04 UniRef50_A0BZE1 Cluster: Chromosome undetermined scaffold_139, w... 47 4e-04 UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl... 47 5e-04 UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li... 47 5e-04 UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba... 46 7e-04 UniRef50_Q945E4 Cluster: Cysteine proteinase; n=1; Vasconcellea ... 46 7e-04 UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; ... 46 7e-04 UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy... 46 7e-04 UniRef50_Q8TKH5 Cluster: Cell surface protein; n=3; Methanosarci... 46 7e-04 UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma... 46 7e-04 UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep... 46 0.001 UniRef50_Q54MB6 Cluster: Putative uncharacterized protein; n=1; ... 46 0.001 UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10... 46 0.001 UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator ame... 46 0.001 UniRef50_UPI00006CB653 Cluster: Papain family cysteine protease ... 46 0.001 UniRef50_Q962W3 Cluster: Cysteine protease; n=3; Giardia intesti... 46 0.001 UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; ... 46 0.001 UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep... 46 0.001 UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|R... 46 0.001 UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteina... 46 0.001 UniRef50_Q70SU8 Cluster: Cystein proteinase inhibitor protein pr... 45 0.002 UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocysti... 45 0.002 UniRef50_Q7R5X2 Cluster: GLP_81_104117_102504; n=1; Giardia lamb... 45 0.002 UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n... 45 0.002 UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 45 0.002 UniRef50_A3FQ13 Cluster: Cathepsin like thiol protease possibly ... 45 0.002 UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi... 45 0.002 UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to glucocorti... 45 0.002 UniRef50_Q9SIE8 Cluster: Putative cysteine proteinase; n=1; Arab... 45 0.002 UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli... 45 0.002 UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8... 45 0.002 UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ... 45 0.002 UniRef50_UPI00001CC928 Cluster: PREDICTED: similar to CTLA-2-bet... 44 0.003 UniRef50_A6EGZ3 Cluster: Aminopeptidase C; n=1; Pedobacter sp. B... 44 0.003 UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep... 44 0.003 UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon G... 44 0.003 UniRef50_A5UP12 Cluster: Adhesin-like protein; n=1; Methanobrevi... 44 0.003 UniRef50_Q53K53 Cluster: Cysteine protease 1, putative; n=5; Ory... 44 0.004 UniRef50_Q8I8D7 Cluster: Cysteine protease 11; n=4; Entamoeba hi... 44 0.004 UniRef50_Q8TQM7 Cluster: Putative uncharacterized protein; n=1; ... 44 0.004 UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - ... 44 0.005 UniRef50_Q3L7L0 Cluster: Sar s 1 allergen SMIPP-C Yv5009F04; n=3... 44 0.005 UniRef50_Q26989 Cluster: Cysteine proteinase 5; n=1; Tritrichomo... 44 0.005 UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag... 44 0.005 UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3; Bilate... 43 0.007 UniRef50_Q8TQ91 Cluster: Putative uncharacterized protein; n=1; ... 43 0.007 UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen; n... 43 0.007 UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ... 43 0.007 UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, w... 43 0.009 UniRef50_P12400 Cluster: Protein CTLA-2-beta; n=6; Mus musculus|... 43 0.009 UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA... 42 0.012 UniRef50_Q8I0V1 Cluster: Preprocathepsin c, putative; n=1; Plasm... 42 0.012 UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia... 42 0.012 UniRef50_P21381 Cluster: Thaumatopain; n=10; Eukaryota|Rep: Thau... 42 0.012 UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 ... 42 0.012 UniRef50_Q3A8I3 Cluster: Putative serine protease; n=1; Pelobact... 42 0.016 UniRef50_A5Z488 Cluster: Putative uncharacterized protein; n=1; ... 42 0.016 UniRef50_Q26993 Cluster: Cysteine proteinase 9; n=1; Tritrichomo... 42 0.016 UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma... 42 0.016 UniRef50_A0D5R4 Cluster: Chromosome undetermined scaffold_39, wh... 42 0.016 UniRef50_A2SQ75 Cluster: Cysteine protease-like protein; n=1; Me... 42 0.016 UniRef50_Q9UBR2 Cluster: Cathepsin Z precursor; n=40; Bilateria|... 42 0.016 UniRef50_Q292E5 Cluster: GA10327-PA; n=1; Drosophila pseudoobscu... 42 0.021 UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n... 42 0.021 UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-L... 41 0.027 UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo sapie... 41 0.027 UniRef50_Q7MTY9 Cluster: Cysteine peptidase, putative; n=8; Bact... 41 0.027 UniRef50_Q4UCF5 Cluster: Cysteine proteinase, tacP, putative; n=... 41 0.027 UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7... 41 0.027 UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|... 41 0.027 UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-li... 41 0.027 UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin ... 41 0.036 UniRef50_A0GDF5 Cluster: Putative uncharacterized protein; n=1; ... 41 0.036 UniRef50_Q8I8D3 Cluster: Cysteine protease 15; n=2; Entamoeba hi... 41 0.036 UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi... 41 0.036 UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 ... 41 0.036 UniRef50_Q9GU75 Cluster: Thiolproteinase; n=2; Babesia|Rep: Thio... 40 0.047 UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhip... 40 0.047 UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin... 40 0.047 UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma j... 40 0.047 UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; ... 40 0.047 UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Ca... 40 0.047 UniRef50_Q3LFN3 Cluster: Cysteine proteinase; n=1; Dianthus cary... 40 0.063 UniRef50_UPI00006CBAC7 Cluster: Papain family cysteine protease ... 40 0.083 UniRef50_Q2XWW8 Cluster: Cysteine protease Mir1; n=1; Zea diplop... 40 0.083 UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O... 40 0.083 UniRef50_A2F4G0 Cluster: Clan CA, family C1, cathepsin L-like cy... 40 0.083 UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps... 39 0.11 UniRef50_Q54R55 Cluster: Putative uncharacterized protein; n=1; ... 39 0.11 UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.... 39 0.11 UniRef50_Q8PS79 Cluster: Putative uncharacterized protein; n=1; ... 39 0.11 UniRef50_Q2H7E7 Cluster: Putative uncharacterized protein; n=1; ... 39 0.14 UniRef50_UPI000155D183 Cluster: PREDICTED: similar to Cathepsin ... 38 0.19 UniRef50_A5FKT5 Cluster: Peptidase C1B, bleomycin hydrolase prec... 38 0.19 UniRef50_Q7JYA0 Cluster: RE20049p; n=2; Sophophora|Rep: RE20049p... 38 0.19 UniRef50_A2F4T7 Cluster: Clan CA, family C1, cathepsin L-like cy... 38 0.19 UniRef50_P84789 Cluster: Philibertain g 1; n=5; core eudicotyled... 38 0.19 UniRef50_A5ZM51 Cluster: Putative uncharacterized protein; n=1; ... 38 0.25 UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; ... 38 0.25 UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n... 38 0.25 UniRef50_Q26988 Cluster: Cysteine proteinase 4; n=1; Tritrichomo... 38 0.25 UniRef50_Q4AI35 Cluster: Cysteine peptidase, putative precursor;... 38 0.33 UniRef50_A1SVF0 Cluster: Peptidase C1A, papain; n=1; Psychromona... 38 0.33 UniRef50_Q26987 Cluster: Cysteine proteinase 3; n=1; Tritrichomo... 38 0.33 UniRef50_O02470 Cluster: Cysteine proteinase; n=2; Chromadorea|R... 38 0.33 UniRef50_Q2FUI9 Cluster: Peptidase S8 and S53, subtilisin, kexin... 38 0.33 UniRef50_A5Z7Z2 Cluster: Putative uncharacterized protein; n=1; ... 37 0.44 UniRef50_A3J6N5 Cluster: Aminopeptidase C; n=4; Bacteroidetes|Re... 37 0.44 UniRef50_A1ZZ62 Cluster: Aminopeptidase C; n=1; Microscilla mari... 37 0.44 UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; ... 37 0.44 UniRef50_Q9VNK7 Cluster: CG1075-PA; n=1; Drosophila melanogaster... 37 0.44 UniRef50_Q9NHY1 Cluster: Cysteine protease cp2; n=1; Theileria c... 37 0.44 UniRef50_Q7QQ92 Cluster: GLP_243_18349_20043; n=1; Giardia lambl... 37 0.44 UniRef50_A0DTZ2 Cluster: Chromosome undetermined scaffold_63, wh... 37 0.44 UniRef50_A6LML6 Cluster: Peptidase C1A, papain precursor; n=1; T... 37 0.58 UniRef50_A2WPN4 Cluster: Putative uncharacterized protein; n=2; ... 37 0.58 UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: ... 37 0.58 UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomati... 37 0.58 UniRef50_Q7QWN9 Cluster: GLP_26_50243_51811; n=2; Giardia lambli... 37 0.58 UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 ... 37 0.58 UniRef50_UPI00006CFA59 Cluster: Papain family cysteine protease ... 32 0.76 UniRef50_Q5Y801 Cluster: Cysteine proteinase; n=1; Petunia x hyb... 36 0.77 UniRef50_Q54VR1 Cluster: Putative uncharacterized protein; n=1; ... 36 0.77 UniRef50_Q2FLC7 Cluster: Periplasmic copper-binding precursor; n... 36 0.77 UniRef50_A2ZKU2 Cluster: Putative uncharacterized protein; n=1; ... 36 1.0 UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia ... 36 1.3 UniRef50_O65214 Cluster: Cysteine protease; n=2; Volvox carteri ... 36 1.3 UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus lu... 36 1.3 UniRef50_Q6A1H9 Cluster: Cathepsin X/O; n=1; Suberites domuncula... 36 1.3 UniRef50_Q26015 Cluster: Serine rich protein homologue; n=4; Pla... 36 1.3 >UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina) Length = 339 Score = 262 bits (642), Expect = 6e-69 Identities = 115/174 (66%), Positives = 141/174 (81%) Frame = +1 Query: 202 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYG 381 DL+KEEW +KLQHR NY +EVE+ FRMKI+ E++H IAKHNQ + G VSYKLG+NKY Sbjct: 22 DLIKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYA 81 Query: 382 DMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIK 561 DMLHHEF +TMNG+N T + L + + GA +I PA+V +P+ VDWR+HGAVT +K Sbjct: 82 DMLHHEFKETMNGYNHTLRQ---LMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVK 138 Query: 562 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 DQG CGSCW+FS+TGALEGQHFR++G LVSLSEQNL+DCS +YGNNGCNGGLMD Sbjct: 139 DQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMD 192 >UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin L-like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin L-like protease - Nasonia vitripennis Length = 353 Score = 181 bits (441), Expect = 1e-44 Identities = 80/169 (47%), Positives = 120/169 (71%), Gaps = 2/169 (1%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 ++W+AFKL+++ NY +VE+NFR ++ E++ IA+HNQK+++GL +YK+ +N++GDM+ Sbjct: 38 DDWAAFKLRYKKNYNGDVEENFRRSVFHENQRKIAEHNQKHDLGLFTYKVRINQFGDMMF 97 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK-LPEQVDWRKHGAVTDIKDQG 570 E+ M+ N T K + RG +FI P + + +PE VDWR+ GAVT ++DQG Sbjct: 98 EEYKNYMHAANNTITQLKRI------PRGDEFIKPKSAENVPEHVDWRQRGAVTPVRDQG 151 Query: 571 -KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 CGSCW+FS GALE Q+F+++G L +LS QNLIDC+ +YGN GC GG Sbjct: 152 LTCGSCWAFSAAGALEAQYFKKTGVLTALSAQNLIDCTMEYGNLGCGGG 200 >UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Longidorus elongatus Length = 358 Score = 180 bits (437), Expect = 4e-44 Identities = 82/168 (48%), Positives = 112/168 (66%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 W+ FKL+H +Y+++ E+ R +++A + +I +HN +YE G S+ L +NK+ DM + E Sbjct: 43 WTNFKLKHAKSYKTKDEELLRFQVFASNHKVIEQHNIEYEAGQHSFALSLNKFADMTNAE 102 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 579 F + MNGF AK K + G F P NV +P+ VDWRK G VT +KDQG CG Sbjct: 103 FRQRMNGFKLPAKR-KLAKSQPLKEDGMIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCG 161 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 SCW+FS TG+LEGQH++Q+G LVSLSEQNL+DC + GCNGG MD Sbjct: 162 SCWAFSATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMD 209 >UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin L - Misgurnus mizolepis (Mud loach) Length = 337 Score = 177 bits (430), Expect = 3e-43 Identities = 84/170 (49%), Positives = 113/170 (66%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 + W +K H NY E E+ +R I+ ++ I HN ++ MG+ +Y+LGMN +GDM H Sbjct: 27 DHWEQWKTWHGKNYH-EKEEGWRRMIWEKNLRKIQFHNLEHSMGIHTYRLGMNHFGDMNH 85 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 EF + MNG+ KH KG + F+ P +++P ++DWR+ G VT +KDQG+ Sbjct: 86 EEFRQVMNGY----KHKTERKFKG-----SLFMEPNFLEVPSKLDWREKGYVTPVKDQGE 136 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 CGSCW+FSTTGA+EGQ FR+ G LVSLSEQNL+DCS GN GCNGGLMD Sbjct: 137 CGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDCSRPEGNEGCNGGLMD 186 >UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n=21; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Globodera pallida Length = 379 Score = 168 bits (409), Expect = 1e-40 Identities = 83/171 (48%), Positives = 111/171 (64%), Gaps = 2/171 (1%) Frame = +1 Query: 217 EWSAFKLQH-RLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 +W+A+K +H R Y + +N RM Y K I KHNQ Y G V++++G N D+ Sbjct: 69 DWNAYKQKHGRKAYADQDVENERMLTYLSAKQFIDKHNQAYIEGKVTFRVGENHIADLPF 128 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANV-KLPEQVDWRKHGAVTDIKDQG 570 E+ K +NG+ + N + F++P NV LPE VDWR G VT++K+QG Sbjct: 129 SEY-KKLNGYRRLLGDNLRR-------NASTFLAPMNVGDLPESVDWRDKGWVTEVKNQG 180 Query: 571 KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 CGSCW+FS+TGALE QH RQ+G L+SLSEQNLIDCS++YGN GCNGG+MD Sbjct: 181 MCGSCWAFSSTGALEAQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMD 231 >UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like midgut cysteine proteinase - Tenebrio molitor (Yellow mealworm) Length = 330 Score = 166 bits (403), Expect = 6e-40 Identities = 86/174 (49%), Positives = 114/174 (65%), Gaps = 1/174 (0%) Frame = +1 Query: 205 LVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGD 384 L +E+WS FKL H+ +Y S +E+ R I+ ++ IA+HN K+E G V+Y MN++GD Sbjct: 23 LFQEQWSQFKLTHKKSYSSPIEEIRRQLIFKDNVAKIAEHNAKFEKGEVTYSKAMNQFGD 82 Query: 385 MLHHEFVKTMN-GFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIK 561 M EF+ +N G + KH +NL M ++S + L VDWR + AV+++K Sbjct: 83 MSKEEFLAYVNRGKAQKPKHPENLRMP--------YVS-SKKPLAASVDWRSN-AVSEVK 132 Query: 562 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 DQG+CGSCWSFSTTGA+EGQ Q G L SLSEQNLIDCS YGN GC+GG MD Sbjct: 133 DQGQCGSCWSFSTTGAVEGQLALQRGRLTSLSEQNLIDCSSSYGNAGCDGGWMD 186 >UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06231 protein - Schistosoma japonicum (Blood fluke) Length = 372 Score = 165 bits (402), Expect = 7e-40 Identities = 75/168 (44%), Positives = 112/168 (66%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 W FK+ + Y + +E+ R I+ + + +HN+ Y+ G +YK+G+N + D +E Sbjct: 62 WKFFKINFKRAYGNVMEETKRFLIFGTNFIKMMEHNRAYQEGKATYKMGVNNFTDKTEYE 121 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 579 K + G+ + K +G+ FIS + KLP++VDWR++GAVT +K+QG+CG Sbjct: 122 LRK-LRGYRSACRIAKP--------KGSTFISSEHAKLPDRVDWRRNGAVTPVKNQGQCG 172 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 SCW+FS+TGA+EGQH+R++ LV+LSEQ LIDCS+ YGNNGC GGLMD Sbjct: 173 SCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMD 220 >UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviatus|Rep: Cathepsin L 2 - Diaprepes abbreviatus (Sugarcane rootstalk borer weevil) Length = 348 Score = 162 bits (394), Expect = 7e-39 Identities = 81/182 (44%), Positives = 113/182 (62%), Gaps = 10/182 (5%) Frame = +1 Query: 205 LVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGD 384 LV+E+W FKL+H YESE E+ +R ++ E+ I +HN+ YEMGL SY++ MN GD Sbjct: 23 LVQEQWEQFKLEHGKVYESESENEYRQSVFMENLFQINEHNKLYEMGLSSYQMAMNHLGD 82 Query: 385 MLHHEFVKTMNGFNKTAKHNKNLYMKGG------SVRG-AKFISPAN---VKLPEQVDWR 534 + EF++ ++NL ++G + P N V LP +DWR Sbjct: 83 LTKDEFMRIYTVNMPQLPQSENLSDSEPWLDLPQDLQGFVTYALPTNLDEVDLPTDIDWR 142 Query: 535 KHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 + GAVT +K+Q CGSCWSFS TGALE Q F+++ L+SLSEQ L+DCS +YGN+GC+GG Sbjct: 143 QKGAVTPVKNQRNCGSCWSFSATGALEAQWFKKTNKLISLSEQQLVDCSGRYGNHGCHGG 202 Query: 715 LM 720 M Sbjct: 203 WM 204 >UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=19; Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Homo sapiens (Human) Length = 333 Score = 159 bits (385), Expect = 9e-38 Identities = 76/172 (44%), Positives = 104/172 (60%) Frame = +1 Query: 208 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 ++ +W+ +K H Y E+ +R ++ ++ +I HNQ+Y G S+ + MN +GDM Sbjct: 25 LEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDM 83 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 EF + MNGF +G F P + P VDWR+ G VT +K+Q Sbjct: 84 TSEEFRQVMNGFQNRKPR-----------KGKVFQEPLFYEAPRSVDWREKGYVTPVKNQ 132 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 G+CGSCW+FS TGALEGQ FR++G L+SLSEQNL+DCS GN GCNGGLMD Sbjct: 133 GQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMD 184 >UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precursor; n=3; Metazoa|Rep: Digestive cysteine proteinase 2 precursor - Homarus americanus (American lobster) Length = 323 Score = 151 bits (365), Expect = 2e-35 Identities = 76/170 (44%), Positives = 103/170 (60%), Gaps = 2/170 (1%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 W FK ++ Y ED++R I+ +++ I + N+KYE G V++ L MNK+GDM E Sbjct: 20 WEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEE 79 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPE--QVDWRKHGAVTDIKDQGK 573 F M G N+ + V P P+ +VDWR GAVT +KDQG+ Sbjct: 80 FNAVMKG---------NIPRRSAPV---SVFYPKKETGPQATEVDWRTKGAVTPVKDQGQ 127 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 CGSCW+FSTTG+LEGQHF ++G L+SL+EQ L+DCS YG GCNGG M+ Sbjct: 128 CGSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMN 177 >UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar to human SRY (sex determining region Y)-box 30 (SOX30),transcript variant 1,; n=3; Mammalia|Rep: Testis cDNA clone: QtsA-12228, similar to human SRY (sex determining region Y)-box 30 (SOX30),transcript variant 1, - Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) Length = 433 Score = 150 bits (363), Expect = 4e-35 Identities = 75/169 (44%), Positives = 103/169 (60%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 +W +K HR Y + E+ +R ++ ++ +I HN +Y G + + MN +GDM + Sbjct: 28 KWYQWKATHRRLYGAS-EEGWRRAVWEKNMKMIELHNGEYSQGKHGFAMAMNAFGDMTNE 86 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 EF + M F N+ L +G F P + LP+ VDWRK G VT +K+Q +C Sbjct: 87 EFRQVMGCFR-----NQKLR------KGKLFREPLFLDLPKSVDWRKKGYVTPVKNQKQC 135 Query: 577 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 GSCW+FS TGALEGQ FR++G LVSLSEQNL+DCS GN GCNGG M+ Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMN 184 >UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 326 Score = 150 bits (363), Expect = 4e-35 Identities = 74/171 (43%), Positives = 108/171 (63%) Frame = +1 Query: 211 KEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDML 390 KEEW FK+++ +Y + +E+ R I+ I HN KY+ GL ++KLG+ K+ D+ Sbjct: 20 KEEWVQFKVRNNKSYRNYIEEQKRFTIFQGSLRKIENHNDKYDHGLSTFKLGVTKFADLT 79 Query: 391 HHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQG 570 EF M G +++ K ++ R ++P LP + DWR+ GAVT++KDQG Sbjct: 80 EKEF-SDMLGISRSTKSSRP--------RVIHSLTPVK-DLPSKFDWREKGAVTEVKDQG 129 Query: 571 KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 CGSCWSFSTTG +EG +F ++G LVSLSEQNL+DC+++ GC+GG MD Sbjct: 130 SCGSCWSFSTTGTVEGAYFLKTGKLVSLSEQNLVDCAKE-DCYGCSGGYMD 179 >UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens (Human) Length = 334 Score = 149 bits (361), Expect = 7e-35 Identities = 75/168 (44%), Positives = 102/168 (60%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 +W +K HR Y + E+ +R ++ ++ +I HN +Y G + + MN +GDM + Sbjct: 28 KWYQWKATHRRLYGAN-EEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNE 86 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 EF + M F +N + G V F P + LP+ VDWRK G VT +K+Q +C Sbjct: 87 EFRQMMGCF-------RNQKFRKGKV----FREPLFLDLPKSVDWRKKGYVTPVKNQKQC 135 Query: 577 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 GSCW+FS TGALEGQ FR++G LVSLSEQNL+DCS GN GCNGG M Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFM 183 >UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin heavy chain; n=3; Amniota|Rep: PREDICTED: similar to ferritin heavy chain - Ornithorhynchus anatinus Length = 338 Score = 148 bits (359), Expect = 1e-34 Identities = 72/167 (43%), Positives = 104/167 (62%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 E W +K+ H NY E E+ FR + ++ +I +HN++ G SY+L MN +GD + Sbjct: 26 EGWWRWKVLHGKNYSVEAEEVFRRAAWEKNVRVIERHNEEMSQGKHSYRLAMNHFGDQTN 85 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 E + +NGF + + ++ G + A+F S + + PE+VDWR G VT +K+QG Sbjct: 86 EELHERLNGF----RPDLGGALRSGREQ-ARFRSKTSWEGPEEVDWRTKGYVTPVKNQGL 140 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 CGSCW+FS TGALE F+ +G +VSLSEQNL+DCS + GN GC GG Sbjct: 141 CGSCWAFSATGALEALVFKTTGKMVSLSEQNLVDCSWRQGNVGCRGG 187 >UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep: Cathepsin - Petromyzon marinus (Sea lamprey) Length = 333 Score = 147 bits (356), Expect = 3e-34 Identities = 75/166 (45%), Positives = 99/166 (59%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 +W +K + +Y SE ED R ++ ++ + +HN + G VS+ LG+NKY D+ H Sbjct: 26 QWDTWKSTYGKHYGSEQEDAHRRDVFEQNLKRVLQHNLLADEGNVSFHLGINKYSDLELH 85 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 E+ K NL G RGA F + LPEQVDWR G VT +K+QG C Sbjct: 86 EY------HEKVVGRFWNL-RNGTRRRGAPFPLRSMDNLPEQVDWRLKGYVTPVKEQGLC 138 Query: 577 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 GS W+FS TG+LEGQHF +G L SLSEQ L+DC++ Y NNGCNGG Sbjct: 139 GSSWAFSATGSLEGQHFAATGNLTSLSEQQLVDCTKSYYNNGCNGG 184 >UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine protease; n=11; Callosobruchus maculatus|Rep: Putative gut cathepsin L-like cysteine protease - Callosobruchus maculatus (Southern cowpea weevil) (Pulse bruchid) Length = 326 Score = 147 bits (356), Expect = 3e-34 Identities = 74/172 (43%), Positives = 102/172 (59%), Gaps = 1/172 (0%) Frame = +1 Query: 208 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 V EEW FKL H Y S VE+ R ++ ++ I +HN+KYE G S+ + ++ DM Sbjct: 19 VYEEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADM 78 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 H EF+ + A + +V F +++ + VDWR+ GAVT +KDQ Sbjct: 79 THEEFLDLLKLQGVPA-------LPSNAVHFDNF-EDIDMEEKDAVDWREEGAVTPVKDQ 130 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC-SEQYGNNGCNGGLM 720 CGSCW+FS GA+EGQ F+++G LVSLS Q L+DC +E YGNNGC GGLM Sbjct: 131 ANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLM 182 >UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L - Suberites domuncula (Sponge) Length = 324 Score = 145 bits (352), Expect = 9e-34 Identities = 73/170 (42%), Positives = 102/170 (60%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 EEW A+K +H Y E+E+ R I+ +K I HN + Y L MN++GD+ Sbjct: 21 EEWVAWKQEHSKEYTEELEELRRHTIWQSNKKFIDSHNSVSDK--FGYTLEMNEFGDLSG 78 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 EF + NG+ + N + ++ PA VDWR+ G V+++K+QG+ Sbjct: 79 VEFKQIYNGYIMQERANDTKLFTA-----SPYMEPA-----ASVDWRQKGVVSEVKNQGQ 128 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 CGSCWSFS TG+LEGQH + G LVSLSEQNL+DCS ++GN+GC GG+MD Sbjct: 129 CGSCWSFSATGSLEGQHALKMGRLVSLSEQNLMDCSSRFGNHGCKGGIMD 178 >UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cathepsin L; n=4; Danio rerio|Rep: Novel protein similar to vertebrate cathepsin L - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 334 Score = 145 bits (351), Expect = 1e-33 Identities = 71/169 (42%), Positives = 100/169 (59%), Gaps = 1/169 (0%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 EW+ +K +H ++Y+ E ED R I+ + I K+N + GL +K+ MNKYGD+ Sbjct: 25 EWNLWKKKHEISYDEESEDVHRKTIWETNMQKIWKNNNDFSFGLSMFKMAMNKYGDLTSV 84 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVR-GAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 E+ + + K + K +R AK + N+ D+R G VT++KDQG Sbjct: 85 EYKRLLGSKIKGTGNRKGKITSAQMLRLNAKRLGVTNI------DYRAKGYVTEVKDQGY 138 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 CGSCWSFSTTGA+EGQ ++ +G LVSLSEQ L+DCS YG GC+G M Sbjct: 139 CGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQLVDCSRSYGTYGCSGAWM 187 >UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 317 Score = 143 bits (347), Expect = 3e-33 Identities = 66/169 (39%), Positives = 101/169 (59%) Frame = +1 Query: 208 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 V ++W+ FK+ H Y E+ R ++++++ I +HN +Y+ G VS+ LG+N++ DM Sbjct: 12 VHQQWAQFKVNHSKKYGHLKEEQVRFQVFSQNLQKIEQHNARYQNGEVSFYLGVNQFADM 71 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 EF K M K +++ ++F++ + +PE +DWR+ GAV ++DQ Sbjct: 72 TSEEF-KAMLDSQLIHKPKRDIT--------SRFVADPQLTVPESIDWREKGAVNPVRDQ 122 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 +CGSCW+FS GALEGQ F + G L LS Q L+DCS Y N GCNGG Sbjct: 123 EQCGSCWAFSAAGALEGQRFLKEGKLEVLSTQQLVDCSRDYKNEGCNGG 171 >UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae|Rep: Cysteine proteinase - Hypera postica (alfalfa weevil) Length = 324 Score = 142 bits (344), Expect = 8e-33 Identities = 77/171 (45%), Positives = 102/171 (59%), Gaps = 2/171 (1%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 ++ AFKL+H Y ++ E++ R I+ ++ I HN YE G VSYK G+NK+ DM Sbjct: 25 KFQAFKLEHGKTYLNQAEESKRFNIFTDNVRAIEAHNALYEQGKVSYKKGINKFTDMSQE 84 Query: 397 EFVKTMNGFNKTAKHNKNL--YMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQG 570 EF KTM + + K Y+K G V++P VDWRK G VT +KDQG Sbjct: 85 EF-KTMLTLSASRKPTLETTSYVKTG------------VEIPSSVDWRKEGRVTGVKDQG 131 Query: 571 KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 CGSCW+FS TG+ EG + R+SG LVSLSEQ LIDC + GC+GG +D Sbjct: 132 DCGSCWAFSITGSTEGAYARKSGKLVSLSEQQLIDCCTD-TSAGCDGGSLD 181 >UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2; Brugia malayi|Rep: Cahepsin L-like cysteine protease - Brugia malayi (Filarial nematode worm) Length = 371 Score = 142 bits (344), Expect = 8e-33 Identities = 76/173 (43%), Positives = 106/173 (61%), Gaps = 4/173 (2%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNF---RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGD 384 +++++++L R + + E N R Y ++ I KHN++YE +Y+L +N D Sbjct: 49 KQYASYRLYKRKYNKRDEEINLEHRRFMTYLKNVKEIEKHNERYERNEETYELAINHLAD 108 Query: 385 MLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKD 564 ML EF K ++GF +KN + ++R N LP+ +DWR GAVT +KD Sbjct: 109 MLPEEFRK-LHGFQSRKITSKNNFKN--TIR-----MKINGPLPKSIDWRTSGAVTKVKD 160 Query: 565 QGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQ-YGNNGCNGGLM 720 QG CGSCW+FS GALEGQHF Q+G LV LS QNL+DCS+ YGN GC+GGLM Sbjct: 161 QGYCGSCWTFSAVGALEGQHFLQTGKLVELSMQNLLDCSDDTYGNYGCDGGLM 213 >UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platyhelminthes|Rep: Cathepsin L-like proteinase - Echinococcus multilocularis Length = 338 Score = 141 bits (342), Expect = 1e-32 Identities = 68/168 (40%), Positives = 97/168 (57%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 W +K+ + Y + E++ RM+I+ + + HN++Y +GL +Y +N + D+ E Sbjct: 30 WRGWKVANNKTYATLREEHLRMRIFINNYLFVRWHNERYYLGLETYSTALNAFADLTLEE 89 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 579 F + +T M V P + +P+ +DWRK G VT IKDQG CG Sbjct: 90 FAEKYLTLKQTPMEGIWQDMSTQYVE-----RPTRMLVPDSIDWRKKGLVTPIKDQGDCG 144 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 SCW+FS TGALEGQ R++G L+SLSEQ L+DCS GN GCNGG M+ Sbjct: 145 SCWAFSATGALEGQLKRKTGKLISLSEQQLVDCSTYTGNEGCNGGDMN 192 >UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Cathepsin - Geodia cydonium (Sponge) Length = 322 Score = 140 bits (338), Expect = 4e-32 Identities = 71/170 (41%), Positives = 99/170 (58%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 +EW +KL++ Y S+ ED R +++ + + + + + E Y + MN++ D+ Sbjct: 17 DEWEQWKLKYNKQYSSQEEDYLRQRVWLSNLKFVEEFDSERE----GYTVAMNEFADLDP 72 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 EFV NG + H + G + +S LP VDWR G VT +K+QG+ Sbjct: 73 REFVSHYNGLRRRP-HTSS----GEPCTLGEDVSA----LPTTVDWRTKGYVTGVKNQGQ 123 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 CGSCW+FS TG+LEGQHF +G LVSLSEQNL+DCS GN GCNGGL D Sbjct: 124 CGSCWAFSATGSLEGQHFNATGKLVSLSEQNLVDCSSAEGNEGCNGGLPD 173 >UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4; core eudicotyledons|Rep: Papain-like cysteine peptidase XBCP3 - Arabidopsis thaliana (Mouse-ear cress) Length = 437 Score = 139 bits (336), Expect = 7e-32 Identities = 74/174 (42%), Positives = 107/174 (61%) Frame = +1 Query: 202 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYG 381 D + E + + +H Y SE E R++I+ ++ + +HN + +Y L +N + Sbjct: 26 DDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNL---ITNATYSLSLNAFA 82 Query: 382 DMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIK 561 D+ HHEF + G + +A + + KG S+ G+ VK+P+ VDWRK GAVT++K Sbjct: 83 DLTHHEFKASRLGLSVSAP-SVIMASKGQSLGGS-------VKVPDSVDWRKKGAVTNVK 134 Query: 562 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 DQG CG+CWSFS TGA+EG + +G L+SLSEQ LIDC + Y N GCNGGLMD Sbjct: 135 DQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSY-NAGCNGGLMD 187 >UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens (Human) Length = 331 Score = 138 bits (335), Expect = 1e-31 Identities = 70/168 (41%), Positives = 98/168 (58%), Gaps = 1/168 (0%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 W +K + Y+ + E+ R I+ ++ + HN ++ MG+ SY LGMN GDM E Sbjct: 28 WHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEE 87 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 579 + M+ ++ +N+ K S N LP+ VDWR+ G VT++K QG CG Sbjct: 88 VMSLMSSLRVPSQWQRNITYK----------SNPNRILPDSVDWREKGCVTEVKYQGSCG 137 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS-EQYGNNGCNGGLM 720 +CW+FS GALE Q ++G LVSLS QNL+DCS E+YGN GCNGG M Sbjct: 138 ACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFM 185 >UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain; n=9; Cucujiformia|Rep: Digestive cysteine proteinase intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 138 bits (334), Expect = 1e-31 Identities = 71/171 (41%), Positives = 100/171 (58%), Gaps = 1/171 (0%) Frame = +1 Query: 211 KEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDML 390 K++W AFK H Y+S +E+ R I+ + I +HN KY+ G SY LG+ + D+ Sbjct: 20 KDQWVAFKQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAKYDKGEESYFLGVTPFADLT 79 Query: 391 HHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQG 570 H EF + KT K N V + P +++P+ +DW + GAV D+K QG Sbjct: 80 HDEFKDELRRQIKT-KPN---------VEATLAVFPEGLEVPDSIDWTQKGAVLDVKYQG 129 Query: 571 KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGC-NGGLM 720 CGSCW+FS TGALEGQ+ + + LSEQ L+DCS+ YGN+ C +GGLM Sbjct: 130 GCGSCWAFSATGALEGQNAIVNNVKIPLSEQQLLDCSKPYGNDDCEHGGLM 180 >UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; n=35; Fasciola|Rep: Cathepsin L-like proteinase precursor - Fasciola hepatica (Liver fluke) Length = 326 Score = 137 bits (331), Expect = 3e-31 Identities = 64/168 (38%), Positives = 98/168 (58%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 W +K + Y +D R I+ ++ I +HN ++++GLV+Y LG+N++ DM E Sbjct: 21 WHQWKRMYNKEYNG-ADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEE 79 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 579 F AK+ + + N +P+++DWR+ G VT++KDQG CG Sbjct: 80 F---------KAKYLTEMSRASDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCG 130 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 SCW+FSTTG +EGQ+ + +S SEQ L+DCS +GNNGC+GGLM+ Sbjct: 131 SCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLME 178 >UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2; Taeniidae|Rep: Cathepsin L-like cysteine proteinase - Taenia solium (Pork tapeworm) Length = 339 Score = 136 bits (330), Expect = 4e-31 Identities = 68/171 (39%), Positives = 104/171 (60%) Frame = +1 Query: 208 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 + +W+ +KLQH Y + E+ +R ++A + I N+++ GL SY G+N++ D+ Sbjct: 31 LSRQWAGWKLQHGRVYSGK-EEAYRRGVFARNLLYIKGQNRRFNAGLESYSTGLNQFADL 89 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 EF + G ++ + G R K ++ A LP+ VDWR VT++K+Q Sbjct: 90 ESSEFSERFLGTRPESR------VAGRRGRIWKALASA-AGLPDTVDWRDKNLVTEVKNQ 142 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 G CGSCW+FS+TGALEG +++G L+SLSEQ L+DCS + GN+GCNGG M Sbjct: 143 GNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYM 193 >UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|Rep: Cysteine proteinase - Acanthamoeba healyi Length = 330 Score = 135 bits (326), Expect = 1e-30 Identities = 74/164 (45%), Positives = 101/164 (61%) Frame = +1 Query: 232 KLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKT 411 K +R Y +E E +R ++ + +H N++ + SY L MN++GD+ + EF + Sbjct: 39 KSNYRFVYSNE-EFIYRWNVWRDEEH-----NRQNK----SYFLAMNQFGDLTNAEFNRL 88 Query: 412 MNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWS 591 G Y K + A +PA +P + DWR+ GAVT +K+QG+CGSCWS Sbjct: 89 FKGLAFD-------YSKHAKIHTAAPEAPAT-GIPSEFDWRQKGAVTHVKNQGQCGSCWS 140 Query: 592 FSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 FSTTG+ EG +F ++G LVSLSEQNLIDCS YGNNGCNGGLMD Sbjct: 141 FSTTGSTEGANFLKTGRLVSLSEQNLIDCSVSYGNNGCNGGLMD 184 >UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine protease; n=1; Maconellicoccus hirsutus|Rep: Putative cathepsin L-like cysteine protease - Maconellicoccus hirsutus (hibiscus mealybug) Length = 339 Score = 134 bits (325), Expect = 2e-30 Identities = 67/171 (39%), Positives = 99/171 (57%) Frame = +1 Query: 202 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYG 381 +L EEW FK Q+ Y +++ED RMKI+ ++K+ IA+HN+ + GLV+++ G+N+Y Sbjct: 23 NLFHEEWQLFKTQYSKKYTTDIEDRLRMKIFIDNKYRIAQHNKLFHKGLVTFEQGINEYS 82 Query: 382 DMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIK 561 DML EF + M + + + +N G + +F NV P+ VDWR G V + Sbjct: 83 DMLQSEFNEKM---GQKSSNQRNTEANG--LPSIRFTPLHNVNPPDSVDWRTKGLVGPVG 137 Query: 562 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 Q C S +++S GALEGQ +S QN+IDCSE GN GC+GG Sbjct: 138 KQVNCSSGYAWSAIGALEGQLASDKKKFQGISVQNVIDCSESTGNKGCSGG 188 >UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Cathepsin K - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 333 Score = 134 bits (323), Expect = 3e-30 Identities = 68/170 (40%), Positives = 99/170 (58%), Gaps = 1/170 (0%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 E W ++K+ H+ Y E++ R I+ ++ I HN++YE+G+ +Y LGMN +GDM Sbjct: 28 EAWESWKITHKREYNGLNEESIRRTIWEKNMLFIEAHNKEYELGIHTYDLGMNHFGDMTL 87 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANV-KLPEQVDWRKHGAVTDIKDQG 570 E + + G +Y + F+ V KLP+ +D+RK G VT +K+QG Sbjct: 88 EEVAEKVMGLQMP------MYRDPANT----FVPDDRVGKLPKSIDYRKLGYVTSVKNQG 137 Query: 571 KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 CGSCW+FS+ GALEGQ + G LV LS QNL+DC + N+GC GG M Sbjct: 138 SCGSCWAFSSVGALEGQLMKTKGQLVDLSPQNLVDCVTE--NDGCGGGYM 185 >UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin L-like cysteine proteinase precursor - Acanthoscelides obtectus (Bean weevil) Length = 321 Score = 133 bits (322), Expect = 4e-30 Identities = 66/167 (39%), Positives = 103/167 (61%), Gaps = 2/167 (1%) Frame = +1 Query: 211 KEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDML 390 +E+W FK+QH Y + +E+ R +I+ + I +HN++Y G ++++G+N++GDM Sbjct: 20 QEKWQQFKIQHGRTYRTLLEEKRRFEIFKFNLRTIEEHNERYHNGEETFEMGINQFGDMT 79 Query: 391 HHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK-LPEQVDWRKHGAVTDIKDQ 567 EF + + A + + G +S NV +P+ VDWR+ GAVT++K Q Sbjct: 80 QEEFKRML------ALQKPQMPLPRGDE-----VSFDNVNDIPKTVDWREKGAVTEVKKQ 128 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSE-QYGNNGC 705 G CGSCW+FS G++EGQ F ++G L SLS QNL+DC+ +YGN GC Sbjct: 129 GNCGSCWAFSAVGSIEGQVFLKNGSLESLSAQNLVDCAGIEYGNFGC 175 >UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra erinaceieuropaei|Rep: Cysteine proteinase - Spirometra erinaceieuropaei (Tapeworm) Length = 336 Score = 133 bits (321), Expect = 5e-30 Identities = 72/173 (41%), Positives = 97/173 (56%), Gaps = 3/173 (1%) Frame = +1 Query: 211 KEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDML 390 +E W A+KL + Y S E+ R + + + I +HNQ+Y L SY + +N + D+ Sbjct: 29 RELWKAWKLAFKKEYFSSEEELHRKRAFFNNLDFIIRHNQRYYQQLESYAVRLNDFSDLT 88 Query: 391 HHEFVKT---MNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIK 561 EF + + G T K SV P LP+ V+WR+ GAVT +K Sbjct: 89 PGEFAERYLCLRGIVLTKLRRKEAV----SV-------PLKENLPDSVNWRERGAVTSVK 137 Query: 562 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 +QG+CGSCWSFS GA+EG ++G L SLSEQ L+DCS YGN GCNGGLM Sbjct: 138 NQGQCGSCWSFSANGAIEGAIQIKTGALRSLSEQQLMDCSWDYGNQGCNGGLM 190 >UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin L preproprotein; n=1; Monodelphis domestica|Rep: PREDICTED: similar to cathepsin L preproprotein - Monodelphis domestica Length = 356 Score = 132 bits (320), Expect = 6e-30 Identities = 71/166 (42%), Positives = 102/166 (61%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 EW A+K + NY SE E++FR +++ ++ +I HN+ ++ G SY +GMN++GDM Sbjct: 28 EWEAWKTTYGKNY-SEKEESFRRQVWEKNLKLINDHNRLFKEGKKSYFMGMNQFGDMTDK 86 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 EF +N + +N K R + +LP+ VDWR HG VT I++QG+C Sbjct: 87 EFESRLNLRIAPVRTRRNYTFK----RRIYY------RLPKSVDWRTHGYVTPIRNQGEC 136 Query: 577 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 G+CW+FST G+LEGQ FR++G LV LS+Q LIDCS Y C GG Sbjct: 137 GACWAFSTIGSLEGQLFRKTGRLVELSKQMLIDCSGYY---TCMGG 179 >UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; n=16; Chrysomelidae|Rep: Digestive cysteine protease intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 132 bits (320), Expect = 6e-30 Identities = 63/168 (37%), Positives = 97/168 (57%) Frame = +1 Query: 211 KEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDML 390 +++W AFK H Y++ +E+ R I+ + I +HN +Y+ G +Y LG+ ++ D+ Sbjct: 20 EDQWIAFKQTHGKTYKNLLEEKTRFGIFQRNLIKIKEHNARYDKGEETYLLGVTRFADLT 79 Query: 391 HHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQG 570 H EF + G K NK + + P ++++P+ +DW + GAV ++KDQ Sbjct: 80 HEEFKDILKGQIK----NKP------RLNATPTVFPEDLEVPDSIDWTEKGAVLEVKDQN 129 Query: 571 KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 CGSCW+FS TGALEGQ+ + +SLSEQ L+DCS YGN C G Sbjct: 130 PCGSCWAFSATGALEGQNAILNNVKISLSEQQLLDCSAAYGNGNCKEG 177 >UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchocercidae|Rep: Cathepsin L-like precursor - Brugia pahangi (Filarial nematode worm) Length = 395 Score = 132 bits (320), Expect = 6e-30 Identities = 64/171 (37%), Positives = 102/171 (59%) Frame = +1 Query: 208 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 ++ EW + +Y+ + E+NFRM I+ ++ + + N+KYE GLVSY +N D+ Sbjct: 87 LETEWKDYVTALGKHYDQK-ENNFRMAIFESNELMTERINKKYEQGLVSYTTALNDLADL 145 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 EF+ NG + + ++G + + +LP+QVDWR GAVT +++Q Sbjct: 146 TDEEFM-VRNGLRLPNQTD----LRGKRQTSEFYRYDKSERLPDQVDWRTKGAVTPVRNQ 200 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 G+CGSC++F+T ALE H + +G L+ LS QN++DC+ GNNGC+GG M Sbjct: 201 GECGSCYAFATAAALEAYHKQMTGRLLDLSPQNIVDCTRNLGNNGCSGGYM 251 >UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schistosoma|Rep: Preprocathepsin cathepsin L - Schistosoma japonicum (Blood fluke) Length = 331 Score = 132 bits (319), Expect = 8e-30 Identities = 72/178 (40%), Positives = 100/178 (56%), Gaps = 1/178 (0%) Frame = +1 Query: 193 QFFDLVKEE-WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGM 369 Q +D +E W +KL++ Y S ++ R I+ I +HN ++++GL Y +G+ Sbjct: 17 QHYDKQYDEIWRQWKLKYNKTYTSNDDEMRRKMIFMRRIGKIQEHNLRHDLGLEGYTMGL 76 Query: 370 NKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAV 549 N++ DM E + M F K N L+ G+ + N +P DWR HGAV Sbjct: 77 NQFCDMEWEEVNRIM--FPKVFG-NSPLWNDDGNE-----LELTNKPVPSTWDWRDHGAV 128 Query: 550 TDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 T +K QG CGSCW+FS TGA+EGQ R+ LV LSEQ L+DC YGN+GC GG MD Sbjct: 129 TAVKHQGLCGSCWAFSATGAIEGQLRRKHKKLVKLSEQQLVDCRYNYGNDGCEGGTMD 186 >UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus|Rep: Cathepsin L - Aphrocallistes vastus Length = 329 Score = 130 bits (314), Expect = 3e-29 Identities = 68/168 (40%), Positives = 101/168 (60%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 W +KL++ +Y +++ R KI+A + + + N + SYKL N++ D+ + E Sbjct: 30 WEGWKLKYNRSYG--LDEELRKKIWANNMLYVKEFNAEGH----SYKLAANQFADLTNLE 83 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 579 + + G++ A+ ++ + G V K + LP VDWR G VT +K+QG+CG Sbjct: 84 YRQIYLGYDNEARLSRK---REGKVFQRKM---KDEDLPTTVDWRSKGVVTPVKNQGQCG 137 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 SCWSFS TG+LEGQ+ +SG LVS SEQ L+DCS GN+GC GGLMD Sbjct: 138 SCWSFSATGSLEGQYAIKSGKLVSFSEQELVDCSTSLGNHGCQGGLMD 185 >UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Toxopain-2 - Toxoplasma gondii Length = 422 Score = 129 bits (312), Expect = 6e-29 Identities = 71/171 (41%), Positives = 98/171 (57%) Frame = +1 Query: 211 KEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDML 390 ++ +S+F+ + +Y +E E R I+ + I HNQ+ SY L MN +GD+ Sbjct: 114 QDAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQG----YSYSLKMNHFGDLS 169 Query: 391 HHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQG 570 EF + GF K+ +NL V + ++ +LP VDWR G VT +KDQ Sbjct: 170 RDEFRRKYLGFKKS----RNLKSHHLGV-ATELLNVLPSELPAGVDWRSRGCVTPVKDQR 224 Query: 571 KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 CGSCW+FSTTGALEG H ++G LVSLSEQ L+DCS GN C+GG M+ Sbjct: 225 DCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMN 275 >UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel protein - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 328 Score = 129 bits (311), Expect = 8e-29 Identities = 65/168 (38%), Positives = 95/168 (56%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 +W+ +K QH Y + E+ R ++ ++ I HN+ +GL SY LG+N+ DM Sbjct: 26 QWTTWKSQHNKTYRNTREERLRRSVWKQNLQDILLHNEAAAVGLHSYTLGLNQLSDMTAD 85 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 E V MNG + + N A F P+ LP++V+W +HG V+ +++QG C Sbjct: 86 E-VNDMNGLLEEDFPDVN----------ATFSPPSLQTLPQRVNWTEHGMVSPVQNQGPC 134 Query: 577 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 GSCW+FS G+LE Q R++ LV LS QNL+DCS GN GC GG + Sbjct: 135 GSCWAFSAVGSLEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFL 182 >UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcoptes scabiei type hominis|Rep: Cathepsin L-like protease - Sarcoptes scabiei type hominis Length = 245 Score = 129 bits (311), Expect = 8e-29 Identities = 64/172 (37%), Positives = 100/172 (58%) Frame = +1 Query: 208 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 + +W+ FK ++ + + ++ R I+ + I KHN+KYE GL +Y+LG+N++ D+ Sbjct: 29 IDHQWTVFKAKYNRQFRTVYDELLRKLIFQRNYIYIRKHNEKYEAGLSTYELGVNQFTDL 88 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 + E+ MN KH+ ++ V + +S LP++VDW V IKDQ Sbjct: 89 TNKEYNDQMNRLK--VKHD----VQSEHVFDNEDVSD----LPDEVDWTLKNVVAPIKDQ 138 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 +CGSCW+FS ++E Q+ ++G LV LSEQ L+DCS GN GC+GG MD Sbjct: 139 KQCGSCWAFSAVASMESQNALKTGQLVELSEQELVDCSVGEGNEGCDGGWMD 190 >UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase precursor - Phaedon cochleariae (Mustard beetle) Length = 324 Score = 129 bits (311), Expect = 8e-29 Identities = 66/168 (39%), Positives = 93/168 (55%) Frame = +1 Query: 211 KEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDML 390 +E W+ FK H Y+S E+ R I+ + IA+HN KYE G +Y L +NK+ D+ Sbjct: 20 QELWADFKKTHARTYKSLREEKLRFNIFQDTLRQIAEHNVKYENGESTYYLAINKFSDIT 79 Query: 391 HHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQG 570 EF + M N+ ++ N + G + PE +DWR G V +++QG Sbjct: 80 DEEF-RDMLMKNEASRPN---------LEGLEVADLTVGAAPESIDWRSKGVVLPVRNQG 129 Query: 571 KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 +CGSCW+ ST A+E Q +SG V LS Q L+DCS YGN+GCNGG Sbjct: 130 ECGSCWALSTAAAIESQSAIKSGSKVPLSPQQLVDCSTSYGNHGCNGG 177 >UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase" precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 315 Score = 128 bits (310), Expect = 1e-28 Identities = 70/169 (41%), Positives = 98/169 (57%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 E+W++FK H +Y + +ED R ++ ++ I +HN KYE G +Y L +NK+ D Sbjct: 22 EKWTSFKATHNKSY-NVIEDKLRFAVFQDNLKKIEEHNAKYESGEETYYLAVNKFADWSS 80 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 EF + + A K ++ AK ++ NV+ E+VDWR AV +KDQG+ Sbjct: 81 AEFQAMLA--RQMANKPKQSFI-------AKHVADPNVQAVEEVDWRD-SAVLGVKDQGQ 130 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 CGSCW+FSTTG+LEGQ V LSEQ L+DC + N GCNGGLM Sbjct: 131 CGSCWAFSTTGSLEGQLAIHKNQRVPLSEQELVDC-DTSRNAGCNGGLM 178 >UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3; Bilateria|Rep: Cathepsin L-like cysteine protease - Neobenedenia melleni Length = 335 Score = 127 bits (306), Expect = 3e-28 Identities = 60/169 (35%), Positives = 96/169 (56%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 +WS +K++++ +Y S ++ ++ ++++ + KHN+ Y G SY L MN D+ Sbjct: 26 QWSQWKVKYQKDYLSSEDELNKLLTWSKNLETVRKHNELYAQGKKSYTLAMNHMADLSSE 85 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 EF K + + G G P ++DW + G VT +K+Q +C Sbjct: 86 EF----KALYLVPKFDATKVPRKGKAAGEH--RQIKNDPPSEIDWVRKGHVTAVKNQAQC 139 Query: 577 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 GSCW+FS+TG++EG R +G L+S SEQ L+DCS +GN+GCNGG+MD Sbjct: 140 GSCWAFSSTGSIEGAVKRATGKLISFSEQQLVDCSTAFGNHGCNGGIMD 188 >UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep: CG4847-PD, isoform D - Drosophila melanogaster (Fruit fly) Length = 420 Score = 127 bits (306), Expect = 3e-28 Identities = 59/172 (34%), Positives = 96/172 (55%), Gaps = 2/172 (1%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 +++ F Q Y S + +A K+++ N + G+ ++K +N + D+ H Sbjct: 110 QDFGDFLSQSGKTYLSAADRALHEGAFASTKNLVEAGNAAFAQGVHTFKQAVNAFADLTH 169 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 EF+ + G ++ + K + K ++ +P+ DWR+HG VT +K QG Sbjct: 170 SEFLSQLTGLKRSPE------AKARAAASLKLVNLPAKPIPDAFDWREHGGVTPVKFQGT 223 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS--EQYGNNGCNGGLMD 723 CGSCW+F+TTGA+EG FR++G L +LSEQNL+DC E +G NGC+GG + Sbjct: 224 CGSCWAFATTGAIEGHTFRKTGSLPNLSEQNLVDCGPVEDFGLNGCDGGFQE 275 >UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 4 - Rhipicephalus appendiculatus (Brown ear tick) Length = 345 Score = 126 bits (304), Expect = 6e-28 Identities = 60/168 (35%), Positives = 98/168 (58%), Gaps = 1/168 (0%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 W F+ + Y + E +R +++ + + ++K++ G + Y + +N + DM E Sbjct: 38 WDKFRKIYNKTYGTSEETVYREQVFRRTFNFLRTVDEKFKNGTLLYSVAVNHFADMTPDE 97 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 579 V G+ + + +P PE ++WR++G VT +K+QG+CG Sbjct: 98 VVANYTGYKPPSAQQ---------LAEIPLYAPLFGDTPEFIEWRENGFVTPVKNQGQCG 148 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS-EQYGNNGCNGGLM 720 SCW+FS+TGALEGQ F+++ L+SLSEQNL+DC+ ++YGNNGCNGG M Sbjct: 149 SCWAFSSTGALEGQVFKRTRRLISLSEQNLMDCAGQRYGNNGCNGGQM 196 >UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 2 - Rhipicephalus appendiculatus (Brown ear tick) Length = 564 Score = 126 bits (303), Expect = 7e-28 Identities = 69/169 (40%), Positives = 90/169 (53%), Gaps = 1/169 (0%) Frame = +1 Query: 211 KEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDML 390 K + FK H+ YE + E + R I+ ++ I N+ + Y L +N D Sbjct: 258 KHSFEDFKETHKRTYELDTEHDRRRDIFRQNLRFIDSKNRAN----LGYNLAVNHLADRT 313 Query: 391 HHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPA-NVKLPEQVDWRKHGAVTDIKDQ 567 E + + G L K GS R F KLP+Q+DWR +GAVT +KDQ Sbjct: 314 REE-ISVLRG---------RLQSKDGSSRAEPFPRHRFTAKLPDQIDWRPYGAVTPVKDQ 363 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 CGSCWSF T G LEG +FR++G LV LSEQ L+DCS GNNGC+GG Sbjct: 364 AVCGSCWSFGTVGELEGAYFRKTGRLVRLSEQQLVDCSWNNGNNGCDGG 412 >UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicatein a3 - Lubomirskia baicalensis Length = 344 Score = 124 bits (300), Expect = 2e-27 Identities = 65/167 (38%), Positives = 97/167 (58%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 +EWS +K H+ +YES++++ R I+ +K I HN + L Y L MN +GD++ Sbjct: 42 QEWSVWKGHHQRSYESQLQEMERHSIWVANKKYIEHHNANAD--LFGYTLAMNGFGDLMS 99 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 EF + T KH++ ++ F SP V + +DWR G VT ++ QG+ Sbjct: 100 AEFTERY----LTHKHSQRSGLQ-------TFESPKGVTYADSLDWRTRGVVTSVQSQGQ 148 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 CGS ++F+ GALEG + LV+LSEQN+IDCS YGN+GC+GG Sbjct: 149 CGSSYAFAAAGALEGATALAADKLVALSEQNIIDCSVPYGNHGCSGG 195 >UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=176; Viridiplantae|Rep: Cysteine proteinase RD21a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 462 Score = 124 bits (299), Expect = 2e-27 Identities = 65/155 (41%), Positives = 95/155 (61%) Frame = +1 Query: 259 SEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAK 438 S VE + R +I+ ++ + +HN+K +SY+LG+ ++ D+ + E+ G AK Sbjct: 65 SLVEKDRRFEIFKDNLRFVDEHNEKN----LSYRLGLTRFADLTNDEYRSKYLG----AK 116 Query: 439 HNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEG 618 K KG ++ + +LPE +DWRK GAV ++KDQG CGSCW+FST GA+EG Sbjct: 117 MEK----KGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEG 172 Query: 619 QHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 + +G L++LSEQ L+DC Y N GCNGGLMD Sbjct: 173 INQIVTGDLITLSEQELVDCDTSY-NEGCNGGLMD 206 >UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|Rep: LD36817p - Drosophila melanogaster (Fruit fly) Length = 352 Score = 123 bits (297), Expect = 4e-27 Identities = 63/156 (40%), Positives = 93/156 (59%), Gaps = 1/156 (0%) Frame = +1 Query: 259 SEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAK 438 S+ E +R I+A +I N+ + G+ ++LG+N DM E + T+ G +K ++ Sbjct: 50 SDEERVYRESIFAAKMSLITLSNKNADNGVSGFRLGVNTLADMTRKE-IATLLG-SKISE 107 Query: 439 HNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK-CGSCWSFSTTGALE 615 + G + +PA+ LPE DWR+ G VT QG CG+CWSF+TTGALE Sbjct: 108 FGERY--TNGHINFVTARNPASANLPEMFDWREKGGVTPPGFQGVGCGACWSFATTGALE 165 Query: 616 GQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 G FR++G L SLS+QNL+DC++ YGN GC+GG + Sbjct: 166 GHLFRRTGVLASLSQQNLVDCADDYGNMGCDGGFQE 201 >UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio molitor (Yellow mealworm) Length = 336 Score = 123 bits (297), Expect = 4e-27 Identities = 67/175 (38%), Positives = 98/175 (56%), Gaps = 3/175 (1%) Frame = +1 Query: 208 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 V E+W FK + +Y + E+ FR +I+ + +HN+KY GLVSY LG+N + DM Sbjct: 23 VAEKWENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDM 82 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFIS-PANVKLPEQVDWRKHGAVTDIKD 564 E +G A +KN G ++ + + A+V+ P DWR G V+ +K+ Sbjct: 83 TPEEMKAYTHGLIMPADLHKN----GIPIKTREDLGLNASVRYPASFDWRDQGMVSPVKN 138 Query: 565 QGKCGSCWSFSTTGALEGQH--FRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 QG CGSCW+FS+TGA+E Q +GY S+SEQ L+DC GC+GG M+ Sbjct: 139 QGSCGSCWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDCVP--NALGCSGGWMN 191 >UniRef50_Q23H32 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 365 Score = 122 bits (295), Expect = 7e-27 Identities = 64/171 (37%), Positives = 101/171 (59%) Frame = +1 Query: 202 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYG 381 + + +E+ FK R Y ++ E ++R +I+AE+ + I +NQ E + +L +N++ Sbjct: 36 ETIMKEFQKFKKTFRKRY-ADSEGDYRFQIFAENYNYIHNYNQINENSQDNIQLEVNEFA 94 Query: 382 DMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIK 561 D+ EF + G+N + KHN + GS + + + +PE VDWR+ V ++ Sbjct: 95 DLSLQEFRELYFGYNSSKKHNNQ---QNGSTKNLRQSFLLSDSVPESVDWREK-LVAPVQ 150 Query: 562 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 QG CGSCW+FST ALEG + +Q+G ++ SEQNLIDC + NNGCNGG Sbjct: 151 KQGGCGSCWAFSTVIALEGAYAKQTGNVIKFSEQNLIDCC-RIENNGCNGG 200 >UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 protein, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to MGC81823 protein, partial - Ornithorhynchus anatinus Length = 361 Score = 122 bits (293), Expect = 1e-26 Identities = 55/108 (50%), Positives = 73/108 (67%) Frame = +1 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 EF MNG+ K A+ + S + F+ P + PE +DWR HG VT +KDQG+C Sbjct: 157 EFAAAMNGY-KAARGVE----ASASASASAFLGPNGTEPPEALDWRDHGYVTPVKDQGRC 211 Query: 577 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 GSCW+F +TG LEGQ FR++G L ++SEQNL+DCS + GN GC+GGLM Sbjct: 212 GSCWAFGSTGVLEGQLFRRTGRLAAVSEQNLMDCSRKQGNRGCDGGLM 259 >UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteinase A; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like tick cysteine proteinase A - Haemaphysalis longicornis (Bush tick) Length = 312 Score = 121 bits (292), Expect = 2e-26 Identities = 64/151 (42%), Positives = 90/151 (59%), Gaps = 4/151 (2%) Frame = +1 Query: 283 MKIYAEHKHIIAKHNQKYEMGLVSYKLG-MNKYGDMLHHEFVKTMNGFNKTAKHNKNLYM 459 +KI+ E+ ++AKHN KY GL ++G GD +V+ ++ A +N Sbjct: 22 VKIFTENTLLVAKHNAKYAKGLGVLQVGPWTSLGDFAA-AWVRQNGQWDTAASRTRN--- 77 Query: 460 KGGSVRGAKFISPANVK---LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFR 630 G AN+ LP VDW + G+ +K+QG+CGSCW+FSTTG+LEGQHFR Sbjct: 78 -----SGPHLFHQANLNDSSLPTTVDWAQEGSRAPVKNQGQCGSCWAFSTTGSLEGQHFR 132 Query: 631 QSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 ++ V+ EQNL+DCS+ +GN GCNGGLMD Sbjct: 133 KTESRVT-GEQNLVDCSDDFGNQGCNGGLMD 162 >UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2] - Vigna mungo (Rice bean) (Black gram) Length = 362 Score = 121 bits (292), Expect = 2e-26 Identities = 60/123 (48%), Positives = 79/123 (64%) Frame = +1 Query: 355 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 534 YKL +NK+ DM +HEF T G +K N + +G F+ +P VDWR Sbjct: 80 YKLKLNKFADMTNHEFRSTYAG----SKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWR 135 Query: 535 KHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 K GAVTD+KDQG+CGSCW+FST A+EG + ++ LVSLSEQ L+DC ++ N GCNGG Sbjct: 136 KKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKE-ENQGCNGG 194 Query: 715 LMD 723 LM+ Sbjct: 195 LME 197 >UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine proteinase precursor - Heterodera glycines (Soybean cyst nematode worm) Length = 353 Score = 121 bits (291), Expect = 2e-26 Identities = 65/150 (43%), Positives = 91/150 (60%), Gaps = 2/150 (1%) Frame = +1 Query: 280 RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYM 459 RM + + K I HN +E G VS+K+ N ++H T +N+ + L M Sbjct: 68 RMNEFIKAKKFIDAHNLAFEKGEVSFKVAPNH---LMHF----TPAQYNRI----RGLQM 116 Query: 460 KGGSVRGAKFISPANVK-LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQ-HFRQ 633 + R N LPE++DWR+ GAVT++KDQG CGSCW+FS TGA+EG ++ Sbjct: 117 RSNRQRHNMATLAGNSSTLPEKLDWREKGAVTEVKDQGDCGSCWAFSATGAIEGALAQKK 176 Query: 634 SGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 + ++SLSEQNL+DCS +YGN GC+GGLMD Sbjct: 177 ASKIISLSEQNLVDCSSKYGNEGCDGGLMD 206 >UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropicalis|Rep: LOC594890 protein - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 355 Score = 120 bits (289), Expect = 4e-26 Identities = 67/167 (40%), Positives = 97/167 (58%), Gaps = 2/167 (1%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 W + H+ Y++E E+ R I+ + I HN +Y MGL +Y++GMN GDM+ E Sbjct: 52 WRLWVQTHKKIYKNEGEELARRLIWEDTLKFIMLHNLEYSMGLHTYEVGMNHLGDMVAEE 111 Query: 400 FV-KTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 K MN + + ++ ++ IS ++ PE +DWR VT +KDQG C Sbjct: 112 MTDKQMNFIPQVIANITDVPVE---------ISKSSP--PESIDWRNKNCVTSVKDQGSC 160 Query: 577 GSCWSFSTTGALEGQHF-RQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 + W+FS+ GALE Q+ R++G L SLS QNL+DCS+ YGNNGC GG Sbjct: 161 IASWAFSSIGALECQNMKRRTGKLESLSVQNLLDCSQTYGNNGCKGG 207 >UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18; Magnoliophyta|Rep: Thiol protease aleurain precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 120 bits (289), Expect = 4e-26 Identities = 67/166 (40%), Positives = 94/166 (56%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 ++ F ++ Y++ E R I+ E+ +I N+K GL SYKLG+N++ D+ E Sbjct: 59 FARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKK---GL-SYKLGVNQFADLTWQE 114 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 579 F +T G A N + +KG LPE DWR+ G V+ +KDQG CG Sbjct: 115 FQRTKLG----AAQNCSATLKGSH-------KVTEAALPETKDWREDGIVSPVKDQGGCG 163 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGL 717 SCW+FSTTGALE + + G +SLSEQ L+DC+ + N GCNGGL Sbjct: 164 SCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGL 209 >UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1 precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 355 Score = 119 bits (287), Expect = 6e-26 Identities = 70/175 (40%), Positives = 97/175 (55%), Gaps = 1/175 (0%) Frame = +1 Query: 202 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEH-KHIIAKHNQKYEMGLVSYKLGMNKY 378 D + E + ++ +H Y+S E R +++ E+ HI ++N+ + SY LG+N++ Sbjct: 45 DKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNE-----INSYWLGLNEF 99 Query: 379 GDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDI 558 D+ H EF G K K A F LP+ VDWRK GAV + Sbjct: 100 ADLTHEEFKGRYLGLAKPQFSRKRQ-------PSANFRYRDITDLPKSVDWRKKGAVAPV 152 Query: 559 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 KDQG+CGSCW+FST A+EG + +G L SLSEQ LIDC + N+GCNGGLMD Sbjct: 153 KDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTF-NSGCNGGLMD 206 >UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: LOC443661 protein - Xenopus laevis (African clawed frog) Length = 346 Score = 119 bits (286), Expect = 8e-26 Identities = 63/165 (38%), Positives = 86/165 (52%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 W + H+ Y+ E+ R I+ E I HN +Y +GL +Y++GMN GDM E Sbjct: 51 WQLWVKTHQKIYKDAEEERARRTIWEETLKFITVHNLEYSLGLHTYEVGMNHLGDMTGEE 110 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 579 TM G+ + N+ R K + A P +DWR G VT ++ Q KCG Sbjct: 111 VEATMTGYTSSDDSLANM------TRVPKKLLEAQP--PASIDWRTKGCVTSVRRQRKCG 162 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 SC++FS GALE Q ++ G LV+ S Q L+DCS GN GC GG Sbjct: 163 SCYAFSAVGALECQWKKKKGTLVTFSPQELVDCSYSEGNKGCKGG 207 >UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep: Cathepsin L - Stylonychia lemnae Length = 340 Score = 118 bits (284), Expect = 1e-25 Identities = 66/157 (42%), Positives = 91/157 (57%), Gaps = 1/157 (0%) Frame = +1 Query: 253 YESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKT 432 Y+S+ E R++ Y + I HN + + S+ LG N D H E+ K M G+ Sbjct: 53 YKSKEEFEMRLQQYKSNIAFINNHNSQNDG--TSFTLGPNHLADYTHDEY-KKMLGYKPR 109 Query: 433 AKHNKNLYMKGGSVRGAKFISPANVK-LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGA 609 K K +Y S N+K +PE +DWR+ GAV +KDQG+CGSCW+FST + Sbjct: 110 NKTGKEVY------------STPNLKDIPESIDWREKGAVNAVKDQGQCGSCWAFSTIAS 157 Query: 610 LEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 LE ++F ++G L SLSEQ L+DCS+ GN GCNGG M Sbjct: 158 LESRYFIETGKLQSLSEQQLVDCSKN-GNEGCNGGDM 193 >UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep: Cysteine protease - Solanum lycopersicum (Tomato) (Lycopersicon esculentum) Length = 345 Score = 117 bits (281), Expect = 3e-25 Identities = 64/171 (37%), Positives = 90/171 (52%) Frame = +1 Query: 208 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 V E + +H Y+ EVE R I+ E+ I N+ G +SYKLGMN++ D+ Sbjct: 35 VSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKA---GNLSYKLGMNEFADI 91 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 EF+ G N + M S K ++ +P +DWR+ GAVT +K Q Sbjct: 92 TSQEFLAKFTGLNIPNSYLSPSPMS--STEFKKINDLSDDYMPSNLDWRESGAVTQVKHQ 149 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 G+CG CW+FS G+LEG + +G L+ SEQ L+DC+ N GCNGG M Sbjct: 150 GRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT--NNYGCNGGFM 198 >UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2); n=2; Tribolium castaneum|Rep: PREDICTED: similar to Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2) - Tribolium castaneum Length = 332 Score = 116 bits (279), Expect = 6e-25 Identities = 55/171 (32%), Positives = 95/171 (55%) Frame = +1 Query: 202 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYG 381 +LV+EEW+ FK H + +E+ FR ++ ++ I+ +HN+++ G +Y++G+NK+ Sbjct: 21 NLVEEEWNKFKAMHARAFFDPLEETFRKSLFTKNLEIVEEHNERFRNGSETYEMGVNKFS 80 Query: 382 DMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIK 561 D E + + G + + L + + + +DWR+ G VT +K Sbjct: 81 DFTDEE-LSNLTGLQVPLEFEQPL-----NETEDPLLPSLGRGISASLDWRQRGGVTPVK 134 Query: 562 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 +QG+CGSCW+F+T GA+E + + +SLSEQ L+DC + G GC GG Sbjct: 135 NQGQCGSCWAFATIGAIESHYKIRHKRAISLSEQQLVDCVGRGG--GCGGG 183 >UniRef50_Q24E33 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 328 Score = 116 bits (279), Expect = 6e-25 Identities = 60/169 (35%), Positives = 95/169 (56%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 E ++ FKL+H + +++ ED +R I+ ++ I N K ++KL +N + Sbjct: 40 EMYAEFKLEHNIVFQNSEEDLYRQNIFFQNVRYIQSENAKNN----TFKLAINIMAILTD 95 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 E+ ++ +++ + V + + +P +V+W GAVT +K+QG Sbjct: 96 EEYSSLYLNLDQ----QESIDIFDSLVDDNETVGD----IPSEVNWTAQGAVTPVKNQGS 147 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 CGSCW+FSTTGALEG +F ++ L+S SEQ L+DCS Y N GCNGGLM Sbjct: 148 CGSCWAFSTTGALEGSYFLKNNQLISFSEQQLVDCSRLYLNMGCNGGLM 196 >UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]; n=11; Eutheria|Rep: Testin-2 precursor [Contains: Testin-1] - Mus musculus (Mouse) Length = 333 Score = 116 bits (279), Expect = 6e-25 Identities = 60/168 (35%), Positives = 96/168 (57%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 +W+ ++ +H Y E+ R ++ ++ +I HN +Y G + + MN +GD+ + Sbjct: 28 QWNEWRTKHGKAYNVN-EERLRRAVWEKNFKMIELHNWEYLEGKHDFTMTMNAFGDLTNT 86 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 EFVK M GF + +++ + +F+ +P+ VDWR G VT +K+QG C Sbjct: 87 EFVKMMTGFRRQKIKRMHVF------QDHQFLY-----VPKYVDWRMLGYVTPVKNQGYC 135 Query: 577 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 S W+FS TG+LEGQ F+++G LV LSEQNL+DC + C+GG M Sbjct: 136 ASSWAFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSNVTHDCSGGFM 183 >UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; Dictyostelium discoideum|Rep: Cysteine proteinase 2 precursor - Dictyostelium discoideum (Slime mold) Length = 376 Score = 116 bits (278), Expect = 8e-25 Identities = 70/169 (41%), Positives = 93/169 (55%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 EW+ L+ Y S N R I+ + + N K + V LG+N + D+ + Sbjct: 38 EWT---LKFNRQYSSSEFSN-RYSIFKSNMDYVDNWNSKGDSQTV---LGLNNFADITNE 90 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 E+ KT G A H+ N Y G V + + P+ +DWR AVT IKDQG+C Sbjct: 91 EYRKTYLGTRVNA-HSYNGY-DGREVLNVEDLQTN----PKSIDWRTKNAVTPIKDQGQC 144 Query: 577 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 GSCWSFSTTG+ EG H ++ LVSLSEQNL+DCS N GC+GGLM+ Sbjct: 145 GSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMN 193 >UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Rep: Cathepsin R precursor - Mus musculus (Mouse) Length = 334 Score = 116 bits (278), Expect = 8e-25 Identities = 62/166 (37%), Positives = 91/166 (54%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 EW +K+++ +Y + E+ + ++ E +I HN++ +G + + MN++GD Sbjct: 28 EWQDWKIKYNKSYSLK-EEKLKRVVWEEKLKMIKLHNRENSLGKNGFTMKMNEFGDQTDE 86 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 EF K M + MK R A I LP+ VDWRK G VT ++ QG C Sbjct: 87 EFRKMMIEISVWTHREGKSIMK----REAGSI------LPKFVDWRKKGYVTPVRRQGDC 136 Query: 577 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 +CW+F+ TGA+E Q Q+G L LS QNL+DCS+ GNNGC GG Sbjct: 137 DACWAFAVTGAIEAQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGG 182 >UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep: Cysteine proteinase - Cryptobia salmositica Length = 443 Score = 115 bits (277), Expect = 1e-24 Identities = 68/167 (40%), Positives = 91/167 (54%), Gaps = 2/167 (1%) Frame = +1 Query: 229 FKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK 408 FK H NY S E+ R +I+A + A N+K M G N++ DM EF Sbjct: 28 FKAAHARNYASPDEERKRFEIFAGNMKKAAVLNRKNPMAT----FGPNEFADMTSEEFQT 83 Query: 409 TMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK--LPEQVDWRKHGAVTDIKDQGKCGS 582 N A+H K + K + +K + +Q+DWR GAVT +K+QG CGS Sbjct: 84 RHNA----ARHYAAA--KARPPKNTKTFTAEEIKAAVGQQIDWRLKGAVTPVKNQGACGS 137 Query: 583 CWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 CWSFSTTG +EGQH +G LV++SEQ L+ C ++GCNGGLMD Sbjct: 138 CWSFSTTGNIEGQHAIATGQLVAVSEQELVSCDPI--DDGCNGGLMD 182 >UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays (Maize) Length = 493 Score = 115 bits (276), Expect = 1e-24 Identities = 59/158 (37%), Positives = 95/158 (60%), Gaps = 4/158 (2%) Frame = +1 Query: 262 EVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTM----NGFNK 429 E +D R++++ ++ I HN + + GL ++LG+ ++ D+ E+ + G N Sbjct: 86 EDDDARRLEVFRDNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNG 145 Query: 430 TAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGA 609 TA G V +++ A +LP+ VDWR+ GAV ++KDQG+CG CW+FS A Sbjct: 146 TAV---------GVVGRRRYLPLAGEQLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAA 196 Query: 610 LEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 +EG + +G L+SLSEQ LIDC +++ + GC+GGLMD Sbjct: 197 VEGINKIVTGSLISLSEQELIDC-DKFQDQGCDGGLMD 233 >UniRef50_Q239L8 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 115 bits (276), Expect = 1e-24 Identities = 65/168 (38%), Positives = 95/168 (56%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 WSAFK ++ Y + +R++I+ E+ ++ + + Y G+ ++ D+ E Sbjct: 48 WSAFKTKYNKKYADPDFERYRIEIFTENLKVVESNTKNY---------GITQFMDITREE 98 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 579 F +T L MK G ++ + F + + ++DW GAVT +KDQG+CG Sbjct: 99 FKQTY----------LTLKMKNG-LKASPFAKFNDAGV--EIDWTTKGAVTPVKDQGQCG 145 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 SCWSFSTTGA+EG F + L SLSEQ L+DCS+ GN GCNGGLMD Sbjct: 146 SCWSFSTTGAVEGALFLSTKKLTSLSEQYLVDCSKD-GNEGCNGGLMD 192 >UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35; Viridiplantae|Rep: Cysteine proteinase 15A precursor - Pisum sativum (Garden pea) Length = 363 Score = 113 bits (273), Expect = 3e-24 Identities = 68/178 (38%), Positives = 102/178 (57%), Gaps = 7/178 (3%) Frame = +1 Query: 211 KEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDML 390 + +++FK + +Y ++ E ++R ++ + I AK +Q + + + G+ K+ D+ Sbjct: 45 EHHFTSFKSKFSKSYATKEEHDYRFGVFKSNL-IKAKLHQNRDP---TAEHGITKFSDLT 100 Query: 391 HHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQG 570 EF + G K + + + A + N LPE DWR+ GAVT +KDQG Sbjct: 101 ASEFRRQFLGLKKRLRLPAH-------AQKAPILPTTN--LPEDFDWREKGAVTPVKDQG 151 Query: 571 KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS-----EQYG--NNGCNGGLMD 723 CGSCW+FSTTGALEG H+ +G LVSLSEQ L+DC EQ G ++GCNGGLM+ Sbjct: 152 SCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMN 209 >UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin F like protease - Nasonia vitripennis Length = 1036 Score = 113 bits (272), Expect = 4e-24 Identities = 63/174 (36%), Positives = 99/174 (56%), Gaps = 2/174 (1%) Frame = +1 Query: 208 VKEE--WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYG 381 +KEE + F +++ Y ++ E R +I+ ++ ++I + Q+ EMG Y G+ ++ Sbjct: 725 LKEEILFHEFMGKYKKMYHNKEEKEMRFQIFKDNLNLI-EELQRNEMGTGRY--GVTQFT 781 Query: 382 DMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIK 561 D+ EF G T K ++ M ++ +++LP DWR H VT +K Sbjct: 782 DLTKAEFKARHLGLKPTLKSENDIPMPMATI--------PDIELPSDYDWRHHNVVTPVK 833 Query: 562 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 DQG CGSCW+FS TG +EGQ+ + G L+SLSEQ L+DC + ++GCNGGL D Sbjct: 834 DQGSCGSCWAFSVTGNIEGQYAIKHGELLSLSEQELVDCDKL--DSGCNGGLPD 885 >UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae str. PEST Length = 559 Score = 113 bits (271), Expect = 6e-24 Identities = 65/172 (37%), Positives = 93/172 (54%) Frame = +1 Query: 208 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 V+ + F+ HR Y S +E R I+ + I + N K+E G Y G+ K+ DM Sbjct: 245 VRRMFDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLN-KFERGTAKY--GVTKFADM 301 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 E+ + G KH++ ++ G V + ++ LP DWR HGAVT++K+Q Sbjct: 302 TVAEY-RAHTGL-VVPKHDRANHV-GNRVASEEDVAGVG-DLPRSFDWRDHGAVTEVKNQ 357 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 G CGSCW+FS G +EG H ++ L S SEQ LIDC + +NGC GG MD Sbjct: 358 GSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCDKV--DNGCGGGYMD 407 >UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1; Dictyostelium discoideum AX4|Rep: Counting factor associated protein - Dictyostelium discoideum AX4 Length = 531 Score = 113 bits (271), Expect = 6e-24 Identities = 67/164 (40%), Positives = 88/164 (53%), Gaps = 2/164 (1%) Frame = +1 Query: 229 FKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK 408 +K Q+ Y S+ E + R + + IIA HN K SYKLGMN Y D+ + EF Sbjct: 228 YKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKES----SYKLGMNHYADLSNKEFNT 283 Query: 409 TMNGFNKTAKHNKNLYMKGGSVRGAKFISPANV--KLPEQVDWRKHGAVTDIKDQGKCGS 582 + K A+ SV GA + +P VDWR VT +KDQG CGS Sbjct: 284 LVKP--KVARP---------SVTGADSVHDDESLRSIPSTVDWRNQNCVTPVKDQGICGS 332 Query: 583 CWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 CW+F +TG+LEG + +G LVSLSEQ L+DC+ G+ GC GG Sbjct: 333 CWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGG 376 >UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-PA - Drosophila melanogaster (Fruit fly) Length = 549 Score = 112 bits (270), Expect = 7e-24 Identities = 65/170 (38%), Positives = 90/170 (52%), Gaps = 1/170 (0%) Frame = +1 Query: 208 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 V + + FK +H + Y S+ E R I+ ++ I N+ ++Y L +N D Sbjct: 241 VDKAFHHFKRKHGVAYHSDTEHEHRKNIFRQNLRYIHSKNR----AKLTYTLAVNHLADK 296 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 E +K G+ + +N K K+ ++P+Q DWR +GAVT +KDQ Sbjct: 297 TEEE-LKARRGYKSSGIYNTG---KPFPYDVPKYKD----EIPDQYDWRLYGAVTPVKDQ 348 Query: 568 GKCGSCWSFSTTGALEGQHF-RQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 CGSCWSF T G LEG F + G LV LS+Q LIDCS YGNNGC+GG Sbjct: 349 SVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDCSWAYGNNGCDGG 398 >UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheirus salmonis|Rep: Putative cathepsin L - Lepeophtheirus salmonis (salmon louse) Length = 257 Score = 112 bits (270), Expect = 7e-24 Identities = 54/119 (45%), Positives = 73/119 (61%) Frame = +1 Query: 367 MNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGA 546 MN+YGD+L EF++ G K + N + S +P V+W K+GA Sbjct: 1 MNQYGDLLQSEFLQGYTGLAKGSYSGDNTVILDNSA-----------PVPSYVNWTKNGA 49 Query: 547 VTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 VT +KDQ CGSCW+FSTTG++EGQ+F ++ L+S SEQ L+DCS + N GCNGG MD Sbjct: 50 VTAVKDQKDCGSCWAFSTTGSVEGQYFIKNKKLLSFSEQQLVDCSSDFRNEGCNGGWMD 108 >UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor; n=17; Magnoliophyta|Rep: Thiol protease aleurain-like precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 112 bits (270), Expect = 7e-24 Identities = 65/166 (39%), Positives = 95/166 (57%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 +S F ++ Y+S E R ++ E+ +I N+K GL SYKL +N++ D+ E Sbjct: 59 FSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKK---GL-SYKLSLNQFADLTWQE 114 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 579 F + G A N + +KG I+ A V P+ DWR+ G V+ +K+QG CG Sbjct: 115 FQRYKLG----AAQNCSATLKGSHK-----ITEATV--PDTKDWREDGIVSPVKEQGHCG 163 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGL 717 SCW+FSTTGALE + + G +SLSEQ L+DC+ + N GC+GGL Sbjct: 164 SCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGL 209 >UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber officinale (Ginger) Length = 475 Score = 112 bits (269), Expect = 1e-23 Identities = 54/171 (31%), Positives = 100/171 (58%), Gaps = 1/171 (0%) Frame = +1 Query: 205 LVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGD 384 ++ +EW +++HR + ++R++++ E+ + +HN + G +Y+LGMN++ D Sbjct: 50 IIYQEW---RVKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFAD 106 Query: 385 MLHHEF-VKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIK 561 + + E+ + + ++ + G + + +V LP+ +DWR+ GAV +K Sbjct: 107 LTNEEYRARFLRDLSRLGRSTS------GEISNQYRLREGDV-LPDSIDWREKGAVVAVK 159 Query: 562 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 +QG+CGSCW+F+ A+EG + +G L+SLSEQ L+DCS + N GC GG Sbjct: 160 NQGRCGSCWAFAAIAAVEGINQIVTGDLISLSEQQLVDCSTR--NYGCEGG 208 >UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; Dictyostelium discoideum|Rep: Cysteine proteinase 7 precursor - Dictyostelium discoideum (Slime mold) Length = 460 Score = 112 bits (269), Expect = 1e-23 Identities = 71/172 (41%), Positives = 94/172 (54%), Gaps = 2/172 (1%) Frame = +1 Query: 211 KEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDML 390 + ++ + + H+ +Y SE E N R I+ + + + N K + LG+N + D+ Sbjct: 27 RNAFTNWMIAHQRHYSSE-EFNGRYNIFKANMDYVNEWNTKGSETV----LGLNVFADIS 81 Query: 391 HHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQG 570 + E+ T G T +L M K + QVDWR GAVT IK+QG Sbjct: 82 NEEYRATYLG---TPFDASSLEM----TESDKIFDAS-----AQVDWRTQGAVTPIKNQG 129 Query: 571 KCGSCWSFSTTGALEGQHFRQSG--YLVSLSEQNLIDCSEQYGNNGCNGGLM 720 +CG CWSFSTTGA EG + +G LVSLSEQNLIDCS YGNNGC GGLM Sbjct: 130 QCGGCWSFSTTGATEGAQYLANGKKNLVSLSEQNLIDCSGSYGNNGCEGGLM 181 >UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; n=2; Danio rerio|Rep: hypothetical protein LOC550326 - Danio rerio Length = 531 Score = 111 bits (268), Expect = 1e-23 Identities = 61/162 (37%), Positives = 86/162 (53%) Frame = +1 Query: 229 FKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK 408 FK + YESE E R ++ + +N+ GL +Y +G+N + D E + Sbjct: 232 FKEKFNRQYESEKEHEERENLFLHTFRFVHSNNRA---GL-TYSVGINHFADKTKEELAR 287 Query: 409 TMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCW 588 G K + +R ++ P VDWR +GAVT +KDQ CGSCW Sbjct: 288 MTGGL--LPKKEEKAQPFPSEIR--------SIATPNSVDWRLYGAVTPVKDQAVCGSCW 337 Query: 589 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 SF+TTG LEG F ++G L SLS+Q L+DC+ +GNNGC+GG Sbjct: 338 SFATTGTLEGALFLKTGQLTSLSQQMLVDCTWGFGNNGCDGG 379 >UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n=1; Xenopus tropicalis|Rep: UPI00006A2275 UniRef100 entry - Xenopus tropicalis Length = 272 Score = 111 bits (268), Expect = 1e-23 Identities = 59/158 (37%), Positives = 86/158 (54%), Gaps = 1/158 (0%) Frame = +1 Query: 253 YESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKT 432 Y S+ E+ R I+ E I+ HN +Y +GL +Y++GMN GDM E TM G+ + Sbjct: 1 YNSQEEERARRTIWEETLKFISVHNLEYSLGLHTYEVGMNHLGDMTGEEVAATMTGYTGS 60 Query: 433 AKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK-CGSCWSFSTTGA 609 N+ + A P +DWR VT ++DQG C SC++FS GA Sbjct: 61 GDSLANMSHVPKEILEAL--------APPSIDWRTQNCVTPVRDQGSFCRSCYAFSAVGA 112 Query: 610 LEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 LE Q +++ LV+ S Q L+DCS+ GN+GCNGG ++ Sbjct: 113 LECQWKKKTVRLVTFSPQELVDCSDGEGNHGCNGGKIE 150 >UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 111 bits (267), Expect = 2e-23 Identities = 61/174 (35%), Positives = 100/174 (57%), Gaps = 2/174 (1%) Frame = +1 Query: 199 FDLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKY 378 F+ +K E+ FK ++ L + E+ +R+ ++ E+ I N + G +S G+NK+ Sbjct: 32 FNKIKSEFENFKNRYNLEFNDIQEEQYRLFVFHENFKQIELDNMNSDNGFIS---GINKF 88 Query: 379 GDMLHHEF-VKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTD 555 + EF K +N + A MK S+ ++ + KLPE VDWRK GAV+ Sbjct: 89 SHLTKEEFKAKYLNRPQRPASE-----MKTNSILSSQ--QKTDEKLPESVDWRKLGAVSP 141 Query: 556 IKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSE-QYGNNGCNGG 714 ++DQG CGSC++F++TGALEG + ++G L S Q ++DC++ Q+ GC+GG Sbjct: 142 VRDQGNCGSCYAFASTGALEGLYQIKTGKLEVFSPQYIVDCAKHQFSRGGCHGG 195 >UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus; n=4; Cryptosporidium|Rep: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus - Cryptosporidium parvum Iowa II Length = 401 Score = 111 bits (266), Expect = 2e-23 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 1/171 (0%) Frame = +1 Query: 211 KEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDML 390 ++ + FK ++ Y S E+N R +IY ++ + I N + G SY L MN++GD+ Sbjct: 83 RKSFEEFKKKYHKVYSSMEEENQRFEIYKQNMNFIKTTNSQ---GF-SYVLEMNEFGDLS 138 Query: 391 HHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQG 570 EF+ G+ K +K ++ ++ K V ++ S P ++W + G V I++Q Sbjct: 139 KEEFMARFTGYIKDSKDDERVF-KSSRVSASE--SEEEFVPPNSINWVEAGCVNPIRNQK 195 Query: 571 KCGSCWSFSTTGALEGQHFRQSGY-LVSLSEQNLIDCSEQYGNNGCNGGLM 720 CGSCW+FS ALEG Q+ L SLSEQ +DCS+Q GN GC+GG M Sbjct: 196 NCGSCWAFSAVAALEGATCAQTNRGLPSLSEQQFVDCSKQNGNFGCDGGTM 246 >UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15; Magnoliophyta|Rep: Cysteine proteinase RD19a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 368 Score = 111 bits (266), Expect = 2e-23 Identities = 67/178 (37%), Positives = 93/178 (52%), Gaps = 7/178 (3%) Frame = +1 Query: 211 KEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDML 390 ++ +S FK + Y S E ++R ++ + +H QK + G+ ++ D+ Sbjct: 48 EDHFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRH-QKLDPSATH---GVTQFSDLT 103 Query: 391 HHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQG 570 EF K G K K+ A + N LPE DWR HGAVT +K+QG Sbjct: 104 RSEFRKKHLGVRSGFKLPKD-------ANKAPILPTEN--LPEDFDWRDHGAVTPVKNQG 154 Query: 571 KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYG-------NNGCNGGLMD 723 CGSCWSFS TGALEG +F +G LVSLSEQ L+DC + ++GCNGGLM+ Sbjct: 155 SCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMN 212 >UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 21 SCAF14577, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 478 Score = 110 bits (265), Expect = 3e-23 Identities = 64/162 (39%), Positives = 87/162 (53%) Frame = +1 Query: 229 FKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK 408 FK + + YE + E R + + + + N+ GL SY LG+N D E Sbjct: 124 FKEKFQRQYEDDKEHELRQQAFIHNLRYVHSKNRA---GL-SYTLGLNSLSDRTMSELA- 178 Query: 409 TMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCW 588 TM G + N L F +V++PE +DWR +GAVT +KDQ CGSCW Sbjct: 179 TMRGRKQRKTTNAGLPFP--------FKLYQHVEVPESLDWRLYGAVTPVKDQAICGSCW 230 Query: 589 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 SF+TTG +EG F ++G L LS+Q LIDCS +GNN C+GG Sbjct: 231 SFATTGTIEGALFLKTGSLQVLSQQMLIDCSWGFGNNACDGG 272 Score = 39.5 bits (88), Expect = 0.083 Identities = 17/33 (51%), Positives = 22/33 (66%) Frame = +1 Query: 616 GQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 G + +G L LS+Q LIDCS +GNN C+GG Sbjct: 294 GPYLGMTGSLQVLSQQMLIDCSWGFGNNACDGG 326 >UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus tauri|Rep: Cysteine protease-1 - Ostreococcus tauri Length = 430 Score = 110 bits (264), Expect = 4e-23 Identities = 62/153 (40%), Positives = 91/153 (59%), Gaps = 5/153 (3%) Frame = +1 Query: 280 RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYM 459 R+ +AE+ + +HN Y +G VS+ +G+N E+ + + G+ + + + M Sbjct: 120 RLATFAENAAYVVEHNALYAIGEVSHWVGLNSLAATTREEY-RALLGYKPELRSSGDAEM 178 Query: 460 KGGS----VRGAKFI-SPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQH 624 + V K A+V PE +DW + GAVT K+QG+CGSCW+FSTTGA+EG Sbjct: 179 LEATSTDKVEQYKASWEYASVDPPEAIDWVELGAVTPPKNQGQCGSCWAFSTTGAVEGIT 238 Query: 625 FRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 ++G LVSLSEQ ++ CS+Q N GCNGGLMD Sbjct: 239 KIRTGRLVSLSEQEMVSCSKQ--NMGCNGGLMD 269 >UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep: Silicatein beta - Suberites domuncula (Sponge) Length = 383 Score = 109 bits (263), Expect = 5e-23 Identities = 63/179 (35%), Positives = 93/179 (51%), Gaps = 11/179 (6%) Frame = +1 Query: 211 KEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDML 390 +E+W + H Y E + ++ +K I +HNQ + + Y L MNK+GD+ Sbjct: 53 EEDWKQWTTDHHKVYSDVRERVDKYTVWRANKEYIDQHNQNAQR--LGYTLKMNKFGDLT 110 Query: 391 HHEFVK---------TMNGFNKTAKHNKNLYMKGGS-VRGAKFISPANV-KLPEQVDWRK 537 EF++ N + KH + ++ G VRG V +PE +DWR Sbjct: 111 TKEFIEGYHCVQDYQPTNASHLNKKHKTHAFVDYGDFVRGGTGEGVRGVGNMPETMDWRT 170 Query: 538 HGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 G VT +KDQ +CGS ++FS +LEG + G LV+LSEQN++DCS YGN+GC G Sbjct: 171 SGVVTKVKDQLRCGSSYAFSAMASLEGINALSYGSLVTLSEQNIVDCSVTYGNHGCACG 229 >UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 392 Score = 109 bits (263), Expect = 5e-23 Identities = 62/176 (35%), Positives = 93/176 (52%), Gaps = 5/176 (2%) Frame = +1 Query: 202 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYG 381 DLV +++ F+ QH YE + E R I+ + I N++ + YKL N + Sbjct: 82 DLVDDDFDEFRQQHDKVYEDDSEHRRRKHIFRHNVRYIRSMNRRS----LPYKLEPNHFA 137 Query: 382 DMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIK 561 D+ EF + +K N + + + S ++P+Q+DWR +GAV K Sbjct: 138 DLTDDEFKSYKGALDDESKDVMNDH--DDVIDDDR--SKRMFEVPDQLDWRNYGAVNPAK 193 Query: 562 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS-----EQYGNNGCNGG 714 QG CGSCW+F+T GA+E HF Q G L++L+EQ L+DC+ +GNNGC GG Sbjct: 194 GQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWSTPGVYHGNNGCLGG 249 >UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep: Cathepsin L precursor - Schistosoma mansoni (Blood fluke) Length = 319 Score = 109 bits (262), Expect = 7e-23 Identities = 66/170 (38%), Positives = 95/170 (55%) Frame = +1 Query: 208 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 V E++ FKL++R Y E ED R I+ + + A+ Q + G Y G+ Y D+ Sbjct: 16 VDEKYVQFKLKYRKQYH-ETEDEIRFNIFKSNI-LKAQLYQVFVRGSAIY--GVTPYSDL 71 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 EF +T + TA K ++ +P+ DWR+ GAVT++K+Q Sbjct: 72 TTDEFART----HLTASWVVPSSRSNTPTSLGKEVN----NIPKNFDWREKGAVTEVKNQ 123 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGL 717 G CGSCW+FSTTG +E Q FR++G L+SLSEQ L+DC ++GCNGGL Sbjct: 124 GMCGSCWAFSTTGNVESQWFRKTGKLLSLSEQQLVDCDGL--DDGCNGGL 171 >UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 109 bits (261), Expect = 9e-23 Identities = 57/168 (33%), Positives = 90/168 (53%), Gaps = 3/168 (1%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 ++ ++ ++ Y S E FR +I+ E I HN E +YKL N++ DM E Sbjct: 32 YNKWRYANKRTYFSLEEQQFRQQIFFETHERIQNHNSNPE---ATYKLAHNQFSDMPQEE 88 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 579 F + + +N + + + +V+LP DWR +G ++D+KDQG+CG Sbjct: 89 FASRVL-MKSSQLIPRNAVQAQNNNSTTQQHTAQDVQLPASFDWRDYGILSDVKDQGQCG 147 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC---SEQYGNNGCNGG 714 SCW+FSTTG LE +F ++ +S SEQ L+DC S + + GC+GG Sbjct: 148 SCWAFSTTGILEALYFMENRQKISFSEQQLVDCATNSNGFNSYGCSGG 195 >UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia deliciosa (Kiwi) Length = 509 Score = 109 bits (261), Expect = 9e-23 Identities = 60/156 (38%), Positives = 89/156 (57%), Gaps = 2/156 (1%) Frame = +1 Query: 262 EVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF--VKTMNGFNKTA 435 EVE F+ ++++ K+ ++ G + +G+NK+ DM + EF V T+ Sbjct: 67 EVEKKFQ-NFRDNLRYVMEKNGERGASG--GHLVGLNKFADMSNEEFREVYVSKVKKPTS 123 Query: 436 KHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALE 615 K + G AK ++ + P +DWRK+G VT +KDQG CGSCW+FS+TGA+E Sbjct: 124 KRMAIERRRQGKAAAAKAVAACDG--PTSLDWRKYGIVTGVKDQGDCGSCWAFSSTGAIE 181 Query: 616 GQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 G + +G L+SLSEQ L+DC N+GC GG MD Sbjct: 182 GINALANGDLISLSEQELVDCDST--NDGCEGGYMD 215 >UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster|Rep: CG5367-PA - Drosophila melanogaster (Fruit fly) Length = 338 Score = 108 bits (260), Expect = 1e-22 Identities = 57/169 (33%), Positives = 94/169 (55%), Gaps = 1/169 (0%) Frame = +1 Query: 211 KEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDML 390 K E+ FK + Y ++ K + E+ +I +HNQ Y+ G S++L N + DM Sbjct: 33 KSEFEKFKNNNNRKYLRTYDEMRSYKAFEENFKVIEEHNQNYKEGQTSFRLKPNIFADMS 92 Query: 391 HHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI-SPANVKLPEQVDWRKHGAVTDIKDQ 567 ++K GF + K N ++ + A+ + SP +PE +DWR G +T +Q Sbjct: 93 TDGYLK---GFLRLLKSN----IEDSADNMAEIVGSPLMANVPESLDWRSKGFITPPYNQ 145 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 CGSC++FS ++ GQ F+++G ++SLS+Q ++DCS +GN GC GG Sbjct: 146 LSCGSCYAFSIAESIMGQVFKRTGKILSLSKQQIVDCSVSHGNQGCVGG 194 >UniRef50_Q22RR8 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 107 bits (258), Expect = 2e-22 Identities = 63/175 (36%), Positives = 93/175 (53%), Gaps = 2/175 (1%) Frame = +1 Query: 205 LVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGD 384 + ++W F +H + Y++ +E+ H+ + + N K G +Y G+ K+ D Sbjct: 38 IAAQKWQEFLKKHSITYKT-IEEKL-------HRFAVFRDNLKKIEGHSNY--GITKFMD 87 Query: 385 MLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQV--DWRKHGAVTDI 558 + EF + +N + + A+ N+KL + + DW K GAVT + Sbjct: 88 LTSEEFQQRYLRLKTNTIKRQNFK---SNPKNAQL----NMKLGDDIIIDWTKKGAVTPV 140 Query: 559 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 KDQ +CGSCW+FS TGALE F +G L SLSEQ L+DCS YGN GC+GG MD Sbjct: 141 KDQEQCGSCWAFSATGALESATFISTGTLPSLSEQELVDCSTSYGNEGCDGGDMD 195 >UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Actinidin Act3a - Actinidia eriantha Length = 380 Score = 107 bits (257), Expect = 3e-22 Identities = 63/176 (35%), Positives = 94/176 (53%), Gaps = 2/176 (1%) Frame = +1 Query: 202 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYG 381 D V + ++ +++ +Y S E R++I+ E+ I +HN SY +G+N++ Sbjct: 36 DEVMALYESWLVKYGKSYNSLGEREMRIEIFKENLRFIDEHNADPNR---SYTVGLNQFA 92 Query: 382 DMLHHEFVKTMNGFNKTAKHN-KNLYM-KGGSVRGAKFISPANVKLPEQVDWRKHGAVTD 555 D+ E+ T GF + K N YM + G V LP+ VDWR GAV D Sbjct: 93 DLTDEEYRSTYLGFKSSLKSKVSNRYMPQVGEV------------LPDYVDWRTTGAVVD 140 Query: 556 IKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 +K+QG C SCW+F+T +E + +G L+SLSEQ L+DC+ N GC GG MD Sbjct: 141 VKNQGLCSSCWAFATIATVESINQIITGDLISLSEQELVDCNRTPINEGCKGGFMD 196 >UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue of Sarcophaga 26,29kDa proteinase; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to homologue of Sarcophaga 26,29kDa proteinase - Nasonia vitripennis Length = 553 Score = 107 bits (256), Expect = 4e-22 Identities = 61/162 (37%), Positives = 85/162 (52%) Frame = +1 Query: 229 FKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK 408 FK H NY ++E R + + + I N+ + + L +N D E +K Sbjct: 251 FKKTHNKNYAHDLEHKQRKEHFRHNLRFIHSINRAN----LGFTLDVNHLADRNEAE-LK 305 Query: 409 TMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCW 588 + G T +H N G + + +P+ DWR +GAVT +KDQ CGSCW Sbjct: 306 VLRGKQYT-QHGYN-----GGMPFPHDVEKEKADVPDSFDWRLYGAVTPVKDQSVCGSCW 359 Query: 589 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 SF TTGA+EG +F + LV LS+Q LIDCS +GNNGC+GG Sbjct: 360 SFGTTGAVEGAYFMKYKKLVRLSQQALIDCSWGFGNNGCDGG 401 >UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 513 Score = 107 bits (256), Expect = 4e-22 Identities = 63/168 (37%), Positives = 86/168 (51%), Gaps = 3/168 (1%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 ++AFK +R Y S E R IY + I N+++ + Y L N DM E Sbjct: 210 FNAFKASYRKRYPSAHEHEKRKDIYRHNMRFIKSRNRQH----LGYSLKPNHMADMTDAE 265 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISP---ANVKLPEQVDWRKHGAVTDIKDQG 570 V M G L+ + + + F P V LP VDWRK GAV +K QG Sbjct: 266 -VNRMKGL---------LHEEPPLIGDSPFSIPDKDRGVPLPPHVDWRKAGAVNSVKSQG 315 Query: 571 KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 CGSC++F+ GALEG HF ++G + LSEQ ++DC+ +GN GC GG Sbjct: 316 ICGSCYAFAVAGALEGAHFIKTGLKLDLSEQQIVDCTWGFGNRGCKGG 363 >UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine protease; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cysteine protease - Strongylocentrotus purpuratus Length = 494 Score = 106 bits (255), Expect = 5e-22 Identities = 65/176 (36%), Positives = 90/176 (51%) Frame = +1 Query: 193 QFFDLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMN 372 ++ DL + FK ++R N + E +R ++ ++ + NQ +E G Y G Sbjct: 151 EYRDLFDKFLMTFKREYRQN-DGTNEYEYRYSVFVQNMLTVEMFNQ-FEQGTAKY--GPT 206 Query: 373 KYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVT 552 K+ DM EF K +G K K + G V PE+ DWR HGAVT Sbjct: 207 KFADMTEAEFRKLQSGPLKKTGIKKQAAIPQGPV-------------PEEYDWRTHGAVT 253 Query: 553 DIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 +K+QG CGSCW+FS G +EGQ + G L+SLSEQ L+DC + G GC GG M Sbjct: 254 PVKNQGMCGSCWAFSAIGNMEGQWQIKKGELISLSEQELVDCDKVDG--GCEGGEM 307 >UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax (Sterkiella histriomuscorum) Length = 366 Score = 106 bits (255), Expect = 5e-22 Identities = 56/146 (38%), Positives = 78/146 (53%) Frame = +1 Query: 280 RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYM 459 R +A I KHN G +YK G+N + DM EF + +N A+ N Sbjct: 71 RKATFANKLQQIIKHNSD---GTNTYKKGLNAFSDMTDEEF---FDYYNIKAEQNC---- 120 Query: 460 KGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSG 639 S K +N +P + DWR G V+ +K+QGKCGSCW+FST G +E + + G Sbjct: 121 ---SATNRKSFGNSNANIPTEWDWRTFGVVSPVKNQGKCGSCWTFSTVGCVESHYLLKYG 177 Query: 640 YLVSLSEQNLIDCSEQYGNNGCNGGL 717 +LSEQ L+DC+ Y N+GC+GGL Sbjct: 178 AFRNLSEQQLVDCAGDYDNHGCSGGL 203 >UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa|Rep: Os09g0497500 protein - Oryza sativa subsp. japonica (Rice) Length = 349 Score = 105 bits (253), Expect = 8e-22 Identities = 61/175 (34%), Positives = 91/175 (52%), Gaps = 2/175 (1%) Frame = +1 Query: 202 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYG 381 DL+ + + + ++H Y E R ++Y + ++ N YKL NK+ Sbjct: 25 DLMLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSN----GYKLADNKFA 80 Query: 382 DMLHHEFVKTMNGFNK--TAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTD 555 D+ + EF M GF T N ++ G ++ LP+ VDWRK GAV + Sbjct: 81 DLTNEEFRAKMLGFRPHVTIPQISNTCSADIAMPGES----SDDILPKSVDWRKKGAVVE 136 Query: 556 IKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 +K+QG CGSCW+FS A+EG + ++G LVSLSEQ L+DC ++ GC GG M Sbjct: 137 VKNQGDCGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDE--AVGCGGGYM 189 >UniRef50_O16454 Cluster: Temporarily assigned gene name protein 196; n=4; Bilateria|Rep: Temporarily assigned gene name protein 196 - Caenorhabditis elegans Length = 477 Score = 105 bits (253), Expect = 8e-22 Identities = 61/160 (38%), Positives = 86/160 (53%) Frame = +1 Query: 238 QHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN 417 +H Y ++ E R +++ ++ +I + QK E G Y G K+ DM EF K M Sbjct: 180 RHEKKYTNKREVLKRFRVFKKNAKVI-RELQKNEQGTAVY--GFTKFSDMTTMEFKKIML 236 Query: 418 GFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFS 597 + + + +Y + ++ LPE DWR+ GAVT +K+QG CGSCW+FS Sbjct: 237 PY----QWEQPVYPMEQANFEKHDVTINEEDLPESFDWREKGAVTQVKNQGNCGSCWAFS 292 Query: 598 TTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGL 717 TTG +EG F LVSLSEQ L+DC + GCNGGL Sbjct: 293 TTGNVEGAWFIAKNKLVSLSEQELVDCDSM--DQGCNGGL 330 >UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MGC107932 protein - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 333 Score = 105 bits (251), Expect = 1e-21 Identities = 57/168 (33%), Positives = 96/168 (57%), Gaps = 1/168 (0%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 +EW+A+K ++ Y + ++ R K + + KHNQ + GL SY++ MN++ D+ Sbjct: 25 QEWNAWKSKYEKKYVTLDKELNRRKAWEATWEKVQKHNQLADQGLKSYRMAMNQFADLTD 84 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 +E + + K+L V+ + S ++ +P++VDWRK VT +K+QG Sbjct: 85 NE----RSSKSCLLPREKSL----NPVKAESY-SYTSITIPKEVDWRKSNCVTPVKNQGT 135 Query: 574 -CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 CGSCW+F+T G +E ++ ++ L++LSEQ L+DC E N GC GG Sbjct: 136 FCGSCWAFATVGVMESRYCIRTKELLNLSEQQLVDCDEI--NEGCCGG 181 >UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryza sativa|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 352 Score = 105 bits (251), Expect = 1e-21 Identities = 58/170 (34%), Positives = 89/170 (52%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 ++W A +H Y+ E R +++ + +I + N G Y+L N++ D+ Sbjct: 43 DKWMA---EHGRTYKDAAEKARRFRVFKANVDLIDRSNAA---GNKRYRLATNRFTDLTD 96 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 EF G+N +Y + +S + + P +VDWR+ GAVT +K+Q Sbjct: 97 AEFAAMYTGYNPA----NTMY---AAANATTRLSSEDDQQPAEVDWRQQGAVTGVKNQRS 149 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 CG CW+FST A+EG H +G LVSLSEQ L+DC++ N GC GG +D Sbjct: 150 CGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCAD---NGGCTGGSLD 196 >UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: Cysteine protease - Saprolegnia parasitica Length = 523 Score = 105 bits (251), Expect = 1e-21 Identities = 56/148 (37%), Positives = 83/148 (56%) Frame = +1 Query: 280 RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYM 459 R +++ + I HN+ S+ +G N+Y + EF K G + + + Sbjct: 47 RFEVFILNDQRIEAHNKDASS---SFTMGHNEYSHLTFDEFKKLRTGLRVSPSY---IQS 100 Query: 460 KGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSG 639 + A ++ +V P ++DW + G VT +K+QG CGSCW+FSTTGA+EG F S Sbjct: 101 RAKYALMAPAVNMTDV--PNEMDWVEQGGVTPVKNQGMCGSCWAFSTTGAIEGAAFVSSK 158 Query: 640 YLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 LVS+SEQ L+DC + G+ GCNGGLMD Sbjct: 159 QLVSVSEQELVDC-DHNGDMGCNGGLMD 185 >UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckeia)|Rep: Berghepain-2 - Plasmodium yoelii yoelii Length = 472 Score = 104 bits (250), Expect = 2e-21 Identities = 59/168 (35%), Positives = 88/168 (52%), Gaps = 3/168 (1%) Frame = +1 Query: 226 AFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFV 405 +F ++ Y S E R I++E I KHN++ + Y G+N + DM H EF Sbjct: 158 SFMKKYNKEYSSAEEMQERFYIFSEKLKKIEKHNKENHL----YTKGINAFSDMRHEEF- 212 Query: 406 KTMNGFNKTAKHNKNLYMKG---GSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 M N K N + ++ ++ K+ SP + DWR H A+ DIKDQ KC Sbjct: 213 -KMKYLNNKLKENHQIDLRHLIPYTIAINKYKSPTDQINYTSFDWRDHNAIIDIKDQQKC 271 Query: 577 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 SCW+F+T G + Q+ + VSLSEQ L+DC++ N GC+GG++ Sbjct: 272 ASCWAFATAGVVAAQYAIRKNQKVSLSEQQLVDCAQ--NNFGCDGGIL 317 >UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 344 Score = 104 bits (249), Expect = 3e-21 Identities = 59/171 (34%), Positives = 89/171 (52%), Gaps = 6/171 (3%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 ++ +K H + YE + +R I+ ++ + I +HN SY LG N DM H E Sbjct: 38 YNLWKKTHNVKYEDSSIEAYRKAIFLDNHNKIIEHNSDPSH---SYTLGHNHLSDMTHEE 94 Query: 400 F-VKTMNGFNKTAKHNKNLYMKGGSVRGAK-FISPA-NVKLPEQVDWRKHGAVTDIKDQG 570 F + +N +K +K G S + ++ P K +DWR A+T +K QG Sbjct: 95 FSLYQLNPARTASKSSKGGNNSGNSSGSSNPYVDPPITTKNAPPMDWRNASAITPVKQQG 154 Query: 571 KCGSCWSFSTTGALEGQHFRQSGY-LVSLSEQNLIDC--SEQYGNNGCNGG 714 KCGSCW+F++T LE F ++G L + SEQ ++DC Y +NGCNGG Sbjct: 155 KCGSCWTFASTAVLESFSFIKNGAPLTNFSEQQILDCVYGSGYYSNGCNGG 205 >UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome shotgun sequence; n=3; Tetraodontidae|Rep: Chromosome 12 SCAF14996, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 362 Score = 104 bits (249), Expect = 3e-21 Identities = 67/194 (34%), Positives = 97/194 (50%), Gaps = 24/194 (12%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 + W +K H Y E E+ +R ++ ++ I HN ++ MG SY+LGMN +GDM H Sbjct: 26 QHWELWKGWHSKQYH-EKEEGWRRMVWEKNLKKIELHNLEHSMGQHSYRLGMNHFGDMTH 84 Query: 394 HEFVKTMNGFNKTA--KHNKNLYMKGGSVRGAK--------FISPANVKL----PEQVDW 531 EF + MNG+ K +L+M+ + + +++P +L P + Sbjct: 85 EEFRQIMNGYKHKPQRKFRGSLFMEPNFLEAPRAVDWRDKGYVTPVKDQLKPVRPAEKGL 144 Query: 532 RKHGAVTDIKD-------QGKCGSCW---SFSTTGALEGQHFRQSGYLVSLSEQNLIDCS 681 +G T + + + GS W GQHFRQ+G LVSLSEQNL+DCS Sbjct: 145 PLYGVNTAVPELLLSGFASARPGSVWLLLGLQHHRGPGGQHFRQTGKLVSLSEQNLVDCS 204 Query: 682 EQYGNNGCNGGLMD 723 GN GCNGGLMD Sbjct: 205 RPEGNEGCNGGLMD 218 >UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; n=23; Magnoliophyta|Rep: Senescence-specific cysteine protease - Arabidopsis thaliana (Mouse-ear cress) Length = 346 Score = 104 bits (249), Expect = 3e-21 Identities = 61/162 (37%), Positives = 83/162 (51%) Frame = +1 Query: 238 QHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN 417 +H Y E+N R ++ + I +H G ++KL +N++ D+ + EF Sbjct: 44 KHGRVYADVKEENNRYVVFKNNVERI-EHLNSIPAGR-TFKLAVNQFADLTNDEFRSMYT 101 Query: 418 GFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFS 597 GF + + K R S A LP VDWRK GAVT IK+QG CG CW+FS Sbjct: 102 GFKGVSALSSQSQTKMSPFRYQNVSSGA---LPVSVDWRKKGAVTPIKNQGSCGCCWAFS 158 Query: 598 TTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 A+EG + G L+SLSEQ L+DC + GC GGLMD Sbjct: 159 AVAAIEGATQIKKGKLISLSEQQLVDCDT--NDFGCEGGLMD 198 >UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; Phytophthora infestans|Rep: Cathepsin-like cysteine protease - Phytophthora infestans (Potato late blight fungus) Length = 376 Score = 104 bits (249), Expect = 3e-21 Identities = 60/176 (34%), Positives = 93/176 (52%), Gaps = 8/176 (4%) Frame = +1 Query: 211 KEEWSAF---KLQHRLNYESEVEDN----FRMKIYAEHKHIIAKHNQKYEMGLVSYKLGM 369 ++ W AF L + +Y ++ D+ R + +A + I HN+ YE G S+ LG+ Sbjct: 34 QKTWEAFVDYALDYEKSYRNDANDHDVVQLRFRSFATNLERIQTHNEAYERGEHSFTLGL 93 Query: 370 NKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK-LPEQVDWRKHGA 546 N D+ E+ + ++ + +K S F+ P NV+ LP DWR+H Sbjct: 94 NDLADLADAEYKQLLSYRTRDSK---------SSSASETFVKPENVEDLPATWDWREHST 144 Query: 547 VTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 VT +K+QG+CGSCW+FS A+E + +G L SLSEQ L+DC+ G + CN G Sbjct: 145 VTPVKNQGQCGSCWAFSAVAAMECAYALSTGTLESLSEQELVDCTLN-GIDTCNHG 199 >UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thaliana|Rep: F9P14.12 protein - Arabidopsis thaliana (Mouse-ear cress) Length = 343 Score = 103 bits (248), Expect = 3e-21 Identities = 64/172 (37%), Positives = 90/172 (52%) Frame = +1 Query: 208 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 +K+ + + H Y E R IY + +I N + + +KL N++ DM Sbjct: 39 LKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLH----LPFKLTDNRFADM 94 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 + EF G N ++ L+ K V PA +P+ VDWR GAVT I++Q Sbjct: 95 TNSEFKAHFLGLNTSSLR---LHKKQRPV-----CDPAG-NVPDAVDWRTQGAVTPIRNQ 145 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 GKCG CW+FS A+EG + ++G LVSLSEQ LIDC N GC+GGLM+ Sbjct: 146 GKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLME 197 >UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 precursor; n=4; Schizophora|Rep: Putative cysteine proteinase CG12163 precursor - Drosophila melanogaster (Fruit fly) Length = 614 Score = 103 bits (248), Expect = 3e-21 Identities = 63/175 (36%), Positives = 95/175 (54%) Frame = +1 Query: 199 FDLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKY 378 FD V + F+++ Y S E R++I+ ++ I + N EMG S K G+ ++ Sbjct: 301 FDKVDHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNAN-EMG--SAKYGITEF 357 Query: 379 GDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDI 558 DM E+ K G + + GGS A + + +LP++ DWR+ AVT + Sbjct: 358 ADMTSSEY-KERTGLWQRDEAKAT----GGS---AAVVPAYHGELPKEFDWRQKDAVTQV 409 Query: 559 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 K+QG CGSCW+FS TG +EG + ++G L SEQ L+DC ++ CNGGLMD Sbjct: 410 KNQGSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTT--DSACNGGLMD 462 >UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma japonicum|Rep: SJCHGC04937 protein - Schistosoma japonicum (Blood fluke) Length = 235 Score = 103 bits (247), Expect = 4e-21 Identities = 56/156 (35%), Positives = 87/156 (55%), Gaps = 5/156 (3%) Frame = +1 Query: 268 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 447 E+ +R I+ + I HN Y++ LV+Y LG+N++ D+ E + T + NK Sbjct: 75 EEIYRRHIWNMYVSRIGLHNLHYDLNLVTYTLGINQFSDLTWIE-LSTFYLHELSVNLNK 133 Query: 448 NLYMKGGSV---RGAKFISP--ANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGAL 612 N + ++ + F + + + +P+ DWR VT++K+Q KCG W+F++ GAL Sbjct: 134 NKLLNSLNMFKLQSYNFTTTLLSTLNIPDNFDWRTKNVVTNVKNQEKCGCGWAFASVGAL 193 Query: 613 EGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 EGQ S L SLS Q L+DC++ YGN GC GLM Sbjct: 194 EGQMKLHSIPLQSLSTQQLVDCTQDYGNYGCASGLM 229 >UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; Dictyostelium discoideum|Rep: Cysteine proteinase 1 precursor - Dictyostelium discoideum (Slime mold) Length = 343 Score = 103 bits (247), Expect = 4e-21 Identities = 70/178 (39%), Positives = 93/178 (52%), Gaps = 10/178 (5%) Frame = +1 Query: 214 EEWSAF-KLQHRLNYESEVEDNF-RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 EE S F + Q + N + E+ R +I+ + I + N K G+NK+ D+ Sbjct: 24 EEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADL 83 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 EF K NK A +L + +FI+ +P DWR GAVT +K+Q Sbjct: 84 SSDEF-KNYYLNNKEAIFTDDLPV--ADYLDDEFIN----SIPTAFDWRTRGAVTPVKNQ 136 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC--------SEQYGNNGCNGGL 717 G+CGSCWSFSTTG +EGQHF LVSLSEQNL+DC E+ + GCNGGL Sbjct: 137 GQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGL 194 >UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza sativa|Rep: Cysteine protease 1 precursor - Oryza sativa subsp. japonica (Rice) Length = 490 Score = 103 bits (246), Expect = 6e-21 Identities = 57/154 (37%), Positives = 85/154 (55%), Gaps = 2/154 (1%) Frame = +1 Query: 268 EDNFRMKIYAEHKHIIAKHNQKY-EMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHN 444 E R +++ ++ + HN + E G ++LGMN++ D+ + EF T G + Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERG--GFRLGMNRFADLTNGEFRATYLGTTPAGR-- 139 Query: 445 KNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVT-DIKDQGKCGSCWSFSTTGALEGQ 621 G G + LP+ VDWR GAV +K+QG+CGSCW+FS A+EG Sbjct: 140 -------GRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGI 192 Query: 622 HFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 + +G LVSLSEQ L++C+ N+GCNGG+MD Sbjct: 193 NKIVTGELVSLSEQELVECARNGQNSGCNGGIMD 226 >UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing protein; n=7; Hymenostomatida|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 387 Score = 102 bits (245), Expect = 8e-21 Identities = 60/157 (38%), Positives = 81/157 (51%), Gaps = 5/157 (3%) Frame = +1 Query: 268 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 447 E N R +I+ + I N E G YK G+N++ D E +T G++KT K+ Sbjct: 57 EYNQRKRIFEQKLKEIKAFNSNSENG---YKKGINQFTDRTAEELRETTLGYSKTVKNAA 113 Query: 448 NLYMKGGSVRGAKFISPANVK-LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQH 624 N K R K NVK LP+ VDWR G VT +KDQG CGSCW+F+TT +E Sbjct: 114 N---KQNMFRNLKTSDKINVKDLPKSVDWRDAGVVTPVKDQGHCGSCWAFATTAVIESYA 170 Query: 625 FRQSGYLVSLSEQNLIDCSEQY----GNNGCNGGLMD 723 +G L +LS Q L+ C + G GCNG + + Sbjct: 171 AIATGQLKTLSTQQLVSCVQNSYQCGGQGGCNGAVSE 207 >UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dvir_CG5367 - Drosophila virilis (Fruit fly) Length = 298 Score = 102 bits (245), Expect = 8e-21 Identities = 56/162 (34%), Positives = 91/162 (56%), Gaps = 1/162 (0%) Frame = +1 Query: 232 KLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKT 411 K+ +R +Y ++ + Y E++ I+ +HN YE G S++L N DM ++K Sbjct: 1 KINNR-SYARSHDEMRSYEAYEENQIIVNEHNTYYETGKSSFRLATNTMADMNTDSYLK- 58 Query: 412 MNGFNKTAKHNKNLYMKGGSVRGAKFI-SPANVKLPEQVDWRKHGAVTDIKDQGKCGSCW 588 G+ + + + S A + SP +PE DWRK G +T + +Q CGSC+ Sbjct: 59 --GYLRLLRSPEI----SDSDNIADIVGSPLMNNVPESFDWRKKGFITPLYNQQSCGSCY 112 Query: 589 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 +FS ++EGQ F+++G +V+LSEQ ++DCS +GN GC GG Sbjct: 113 AFSIAQSIEGQVFKRTGKIVALSEQQIVDCSVSHGNQGCIGG 154 >UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicotyledons|Rep: Cysteine proteinase - Mesembryanthemum crystallinum (Common ice plant) Length = 367 Score = 102 bits (244), Expect = 1e-20 Identities = 58/145 (40%), Positives = 85/145 (58%), Gaps = 4/145 (2%) Frame = +1 Query: 298 EHKHIIAKHNQKY--EMGLVS--YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKG 465 +++ + K N KY E+ + YKL +N++GD+ EF +T +K + +N G Sbjct: 61 QNRFHVFKENVKYINEVNKMDKPYKLRLNQFGDLTPSEFARTYAN-SKIIEGTRN--ESG 117 Query: 466 GSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYL 645 G + NV++P +DWR GAVT +K+QG+CG CW+FS A+EG + +G L Sbjct: 118 GFMY-------ENVEVPRSIDWRVKGAVTPVKNQGRCGGCWAFSAAAAVEGINQITTGQL 170 Query: 646 VSLSEQNLIDCSEQYGNNGCNGGLM 720 +SLSEQ LIDC Q N+GC GG M Sbjct: 171 ISLSEQQLIDCDTQ--NSGCRGGTM 193 >UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina). Cathepsin L; n=2; Dictyostelium discoideum|Rep: Similar to Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina). Cathepsin L - Dictyostelium discoideum (Slime mold) Length = 265 Score = 101 bits (243), Expect = 1e-20 Identities = 47/120 (39%), Positives = 67/120 (55%) Frame = +1 Query: 361 LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 540 + +N+Y D+ EF F K ++ + ++ F N +P+ DWR H Sbjct: 1 MDLNEYSDLTQKEFADKF--FEKLVPEPRSGPIN--DIKATPFKHNVNATIPKSFDWRDH 56 Query: 541 GAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 GAV +K+QG C SCWSFS GALEG ++ + G L+ LSEQNL+DC+ +G GC G M Sbjct: 57 GAVGKVKNQGSCASCWSFSALGALEGHYYIKYGELLDLSEQNLVDCATPFGPKGCKTGWM 116 >UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: Vivapain-4 - Plasmodium vivax Length = 484 Score = 101 bits (243), Expect = 1e-20 Identities = 62/166 (37%), Positives = 89/166 (53%), Gaps = 3/166 (1%) Frame = +1 Query: 229 FKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK 408 F +H Y++E E R + E+ I HN K + YK G N+Y D+ EF K Sbjct: 169 FMKEHGKKYKTEEEMQQRYLAFTENLARINSHNSKAN---ILYKKGTNQYSDISFEEFRK 225 Query: 409 TMNG--FNKTAKHNKNLYMKGGSVRGAKFISPANVKLP-EQVDWRKHGAVTDIKDQGKCG 579 TM F+ K + Y+ K+ PA+ + E+ DWR+H AV++IK+Q CG Sbjct: 226 TMLTLRFDLKKKLANSPYVSNYDDVLKKY-KPADAVVDNEKYDWREHNAVSEIKNQNLCG 284 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGL 717 SCW+F GA+E Q+ + V +SEQ L+DCS++ N GC GGL Sbjct: 285 SCWAFGAVGAVESQYAIRKNQHVLISEQELVDCSDK--NFGCFGGL 328 >UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: Cysteine protease - Clonorchis sinensis Length = 328 Score = 101 bits (243), Expect = 1e-20 Identities = 64/173 (36%), Positives = 99/173 (57%), Gaps = 2/173 (1%) Frame = +1 Query: 202 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYG 381 D + + FKL+++ Y ++ +D R +I+ ++ + AK Q+ E G Y G+ ++ Sbjct: 26 DNARALYEEFKLKYKKTYSND-DDELRFEIFKDNL-LRAKRLQEMEQGTAQY--GVTQFS 81 Query: 382 DMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPA-NVKLP-EQVDWRKHGAVTD 555 D+ EF KT + L M+ ++ SP +V + E+ DWR+HGAV Sbjct: 82 DLTSEEF--------KT----RYLRMRFDGPIVSEDPSPEEDVTMDNEKFDWREHGAVGP 129 Query: 556 IKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 + DQGKCGSCW+FS G +EGQ FR++G L++LSEQ L+DC + GCNGG Sbjct: 130 VLDQGKCGSCWAFSVIGNVEGQWFRKTGDLLALSEQQLVDC--DHLEKGCNGG 180 >UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_184, whole genome shotgun sequence - Paramecium tetraurelia Length = 331 Score = 101 bits (243), Expect = 1e-20 Identities = 55/172 (31%), Positives = 90/172 (52%), Gaps = 3/172 (1%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 ++S++K H Y S+ E+ R ++A++ ++ +HN K+E+G ++ LGMN+Y D+ Sbjct: 33 QFSSWKQLHGKRY-SDFEEVHRFSVFAQNLAVVMEHNSKFELGQETFTLGMNQYADLTPE 91 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 EF + + KN+ G + P+ VDW K G +K+QG C Sbjct: 92 EFQASFLTLKTKVQDRKNVKSYSG------------LSFPDTVDW-KDGLT--VKNQGSC 136 Query: 577 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQ---YGNNGCNGGLMD 723 GSCW+F+ A+E V++SEQ +DC+ + Y + GCNGG MD Sbjct: 137 GSCWAFAAAAAIEAGFQHHKKNKVNISEQEFVDCTTEKLGYESQGCNGGWMD 188 >UniRef50_Q22LI1 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 987 Score = 101 bits (242), Expect = 2e-20 Identities = 57/171 (33%), Positives = 89/171 (52%), Gaps = 5/171 (2%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 E++ + +H ++ E + +R+ I+AE+ I +HN +++LG+N+Y M Sbjct: 30 EFNKWSAKHNKVFDPE-QLKYRLSIFAENYKKIKEHNYNSSN---TFQLGLNEYAHMTSQ 85 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 EF + + + K K + V + +DWR GAVT +K QGKC Sbjct: 86 EFAEVFLTPSISKSQQKQPKPKPQPQPHPNNSTNTTVTITP-IDWRNKGAVTSVKRQGKC 144 Query: 577 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC-----SEQYGNNGCNGG 714 GSCWSFS G +E + ++G L+ LSEQ L+DC + Y +NGCNGG Sbjct: 145 GSCWSFSAAGLMEAFQYFKTGNLIDLSEQQLVDCDNSSFDKSYYSNGCNGG 195 >UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_21, whole genome shotgun sequence - Paramecium tetraurelia Length = 349 Score = 101 bits (242), Expect = 2e-20 Identities = 59/177 (33%), Positives = 98/177 (55%), Gaps = 4/177 (2%) Frame = +1 Query: 202 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYG 381 D V + + ++ +H Y ++ E++ R I+ ++ I +H Q+ E GL +++LG+N + Sbjct: 34 DEVMKVYQNWQKEHGKRY-TQFENSHRFGIFKKNYQYIQEHQQRVEAGLETFELGLNDFA 92 Query: 382 DMLHHEFVKTMNGFNKTAKHNKN-LYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDI 558 D+ EF + T + N +Y + G ++P +VD RK G V+++ Sbjct: 93 DLSVEEFEAKYLKYRSTPREQTNQVYRRTGK------------QVPIEVDLRKDGVVSEV 140 Query: 559 KDQGKCGSCWSFSTTGALEGQHFRQSGYL-VSLSEQNLIDCS--EQYGNNGCNGGLM 720 K+QG CGSCW+FS ALE RQ G V LSEQ L+DC+ +++ + GC+GG M Sbjct: 141 KNQGSCGSCWAFSAVAALE-TALRQGGVKNVELSEQELVDCAVKDEFESEGCDGGEM 196 >UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea mays (Maize) Length = 371 Score = 101 bits (242), Expect = 2e-20 Identities = 58/164 (35%), Positives = 91/164 (55%), Gaps = 7/164 (4%) Frame = +1 Query: 250 NYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNK 429 +Y+ E +R+ ++ ++ + +++++ S + G+ K+ D+ EF +T G K Sbjct: 58 SYKDADEHAYRLSVFKDN----LRRARRHQLLDPSAEHGVTKFSDLTPAEFRRTYLGLRK 113 Query: 430 TAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGA 609 + + L G S A + P + LP+ DWR HGAV +K+QG CGSCWSFS +GA Sbjct: 114 SRR--ALLRELGESAHEAPVL-PTD-GLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGA 169 Query: 610 LEGQHFRQSGYLVSLSEQNLIDCSEQYG-------NNGCNGGLM 720 LEG H+ +G L LSEQ +DC + ++GCNGGLM Sbjct: 170 LEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLM 213 >UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Plasmodium|Rep: Cysteine protease falcipain-3 - Plasmodium falciparum Length = 492 Score = 101 bits (241), Expect = 2e-20 Identities = 66/169 (39%), Positives = 87/169 (51%), Gaps = 7/169 (4%) Frame = +1 Query: 229 FKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF-- 402 F ++ YE+ E R I++E+ I HN+K YK GMNK+GD+ EF Sbjct: 174 FLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNS---LYKRGMNKFGDLSPEEFRS 230 Query: 403 ----VKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPE-QVDWRKHGAVTDIKDQ 567 +KT F KT + V K PA+ KL DWR HG VT +KDQ Sbjct: 231 KYLNLKTHGPF-KTLSPPVSYEANYEDV--IKKYKPADAKLDRIAYDWRLHGGVTPVKDQ 287 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 CGSCW+FS+ G++E Q+ + L SEQ L+DCS + NNGC GG Sbjct: 288 ALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSVK--NNGCYGG 334 >UniRef50_Q22A69 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 330 Score = 101 bits (241), Expect = 2e-20 Identities = 59/166 (35%), Positives = 86/166 (51%), Gaps = 1/166 (0%) Frame = +1 Query: 229 FKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK 408 F + Y SE N R+ I+ E+ I N+ E + G+ ++ D+ H EF Sbjct: 33 FTQTYNKKYSSEEHYNARLSIFKENLRRIELFNKNDEA-----QHGITQFADLTHEEFAD 87 Query: 409 TMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCW 588 G+ ++++ S+ F +P +DW GAVT +K+QG CGSCW Sbjct: 88 MYLGYKPQLRNSQAKV----SLSSTPFTAPT------AIDWTTKGAVTPVKNQGSCGSCW 137 Query: 589 SFSTTGALEGQHFRQ-SGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 +FSTTG++EGQ+ Q L S SEQ L+DC + + GCNGGLMD Sbjct: 138 AFSTTGSIEGQYVLQLKQNLTSFSEQQLVDCDTK-EDQGCNGGLMD 182 >UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|Rep: Cathepsin F precursor - Homo sapiens (Human) Length = 484 Score = 101 bits (241), Expect = 2e-20 Identities = 60/164 (36%), Positives = 87/164 (53%), Gaps = 1/164 (0%) Frame = +1 Query: 229 FKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK 408 F + + YES+ E +R+ ++ + + A+ Q + G Y G+ K+ D+ EF Sbjct: 190 FVITYNRTYESKEEARWRLSVFVNNM-VRAQKIQALDRGTAQY--GVTKFSDLTEEEF-- 244 Query: 409 TMNGFNKTAKHNKNLYMK-GGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSC 585 +T N L + G ++ AK + P + DWR GAVT +KDQG CGSC Sbjct: 245 ------RTIYLNTLLRKEPGNKMKQAKSVGDL---APPEWDWRSKGAVTKVKDQGMCGSC 295 Query: 586 WSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGL 717 W+FS TG +EGQ F G L+SLSEQ L+DC + + C GGL Sbjct: 296 WAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKM--DKACMGGL 337 >UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP00000013730, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to ENSANGP00000013730, partial - Ornithorhynchus anatinus Length = 229 Score = 100 bits (240), Expect = 3e-20 Identities = 47/74 (63%), Positives = 55/74 (74%), Gaps = 1/74 (1%) Frame = +1 Query: 499 ANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHF-RQSGYLVSLSEQNLID 675 ANV LPE +DWR +GAVT +KDQ CGSCWSF+TTG LEG F + + LV LS+Q LID Sbjct: 51 ANVALPESLDWRLYGAVTPVKDQAVCGSCWSFATTGTLEGALFLKVTVQLVPLSQQMLID 110 Query: 676 CSEQYGNNGCNGGL 717 CS GN GC+GGL Sbjct: 111 CSWDVGNFGCDGGL 124 >UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_23, whole genome shotgun sequence - Paramecium tetraurelia Length = 321 Score = 100 bits (240), Expect = 3e-20 Identities = 63/175 (36%), Positives = 93/175 (53%) Frame = +1 Query: 199 FDLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKY 378 F ++K+ + ++ ++ Y ++ E +R IY ++ I N + SYK +NK+ Sbjct: 32 FKIIKQ-YQEWQQKYNKRYPTQNEQIYRFSIYQQNIMKIEDFNSQNN----SYKQKINKF 86 Query: 379 GDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDI 558 GD+ EF+ A+ KN+ K P V+ E+VDW + G V I Sbjct: 87 GDLTDQEFLTIYLNLQMPARV-KNIQ---------KNEEPFLVQ--EEVDWVQKGKVPAI 134 Query: 559 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 KDQG CGSCW+FS GALE Q +V LSEQ+L+DC+ YGN GC+GG M+ Sbjct: 135 KDQGDCGSCWAFSAVGALEINTKIQFNEIVDLSEQDLVDCAGPYGNAGCDGGWME 189 >UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 precursor; n=2; Arabidopsis thaliana|Rep: Probable cysteine proteinase At3g43960 precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 376 Score = 100 bits (240), Expect = 3e-20 Identities = 64/168 (38%), Positives = 89/168 (52%), Gaps = 1/168 (0%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 E+W +++ NY E R KI+ ++ I +HN SY+ G+NK+ D+ Sbjct: 42 EQWL---VENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNR---SYERGLNKFSDLTA 95 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTD-IKDQG 570 EF + G K K K S ++ LP++VDWR+ GAV +K QG Sbjct: 96 DEFQASYLG----GKMEK----KSLSDVAERYQYKEGDVLPDEVDWRERGAVVPRVKRQG 147 Query: 571 KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 +CGSCW+F+ TGA+EG + +G LVSLSEQ LIDC N GC GG Sbjct: 148 ECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGG 195 >UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [Contains: Cathepsin H mini chain; Cathepsin H heavy chain; Cathepsin H light chain]; n=37; Eukaryota|Rep: Cathepsin H precursor (EC 3.4.22.16) [Contains: Cathepsin H mini chain; Cathepsin H heavy chain; Cathepsin H light chain] - Homo sapiens (Human) Length = 335 Score = 100 bits (240), Expect = 3e-20 Identities = 60/173 (34%), Positives = 94/173 (54%), Gaps = 2/173 (1%) Frame = +1 Query: 205 LVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGD 384 L K + ++ +HR Y +E E + R++ +A + I HN G ++K+ +N++ D Sbjct: 30 LEKFHFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKINAHNN----GNHTFKMALNQFSD 84 Query: 385 MLHHEFV-KTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGA-VTDI 558 M E K + + K+ Y++G P VDWRK G V+ + Sbjct: 85 MSFAEIKHKYLWSEPQNCSATKSNYLRGTG------------PYPPSVDWRKKGNFVSPV 132 Query: 559 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGL 717 K+QG CGSCW+FSTTGALE +G ++SL+EQ L+DC++ + N+GC GGL Sbjct: 133 KNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGL 185 >UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin S preproprotein; n=2; Tribolium castaneum|Rep: PREDICTED: similar to cathepsin S preproprotein - Tribolium castaneum Length = 525 Score = 100 bits (239), Expect = 4e-20 Identities = 57/168 (33%), Positives = 83/168 (49%) Frame = +1 Query: 208 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 + +EW FK ++ Y + E+NFR I+ + I HN++Y GL +Y L +N D Sbjct: 221 LNKEWENFKRKYERRYPNLEEENFRRAIFEKTFQEIKHHNERYRKGLETYYLRINDLSDY 280 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 E M+ ++ A + S + LP+ VDWR G VT +K Q Sbjct: 281 TDEE----MSCCSEKAPKPSITILPNVSTSSRQ-------NLPKMVDWRLRGVVTPVKHQ 329 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNG 711 GKCG+CW+F+ GA E Q+ G V LSEQ L+DC + + C G Sbjct: 330 GKCGTCWAFAIIGATEAQYRIHRGSFVILSEQQLVDCVREV--SSCRG 375 Score = 81.8 bits (193), Expect = 2e-14 Identities = 36/71 (50%), Positives = 45/71 (63%) Frame = +1 Query: 511 LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQY 690 LP+ VDWR G VT +K QGKCGSCW+F+ GA E + +Q G V LSEQ L+DC + Sbjct: 35 LPDMVDWRLQGVVTPVKRQGKCGSCWAFAILGATEAHYRKQRGSFVILSEQQLVDCVREV 94 Query: 691 GNNGCNGGLMD 723 G C G +D Sbjct: 95 GT--CKGVWLD 103 >UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 360 Score = 100 bits (239), Expect = 4e-20 Identities = 56/169 (33%), Positives = 92/169 (54%) Frame = +1 Query: 208 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 ++ + FK+++ Y+ + E+ +R ++ + I +HN K+ LV K+G+N++ D+ Sbjct: 41 IERAFKNFKVKYAKTYKDDTEEQYRFSVFTNNYVEIYRHN-KF---LVFSKVGVNQFADL 96 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 H EF G KH+K+ + + P + LP DWR GA+T +K Q Sbjct: 97 THEEFKALYTGH----KHSKD--DDDDDNKNKQPHLPTD-NLPASFDWRDKGAITPVKVQ 149 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 CG CW+FST ++EG +F ++G L SLS Q +IDC + +GC GG Sbjct: 150 NGCGGCWAFSTVQSIEGLYFLKTGKLESLSTQQVIDCC-RIDESGCLGG 197 >UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; Leishmania|Rep: Cysteine proteinase 2 precursor - Leishmania pifanoi Length = 444 Score = 99 bits (238), Expect = 6e-20 Identities = 59/168 (35%), Positives = 88/168 (52%), Gaps = 4/168 (2%) Frame = +1 Query: 229 FKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFV- 405 FK + YE+ E+ R+ + + ++ +H + + G+ K+ D+ EF Sbjct: 41 FKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHA----QFGITKFFDLSEAEFAA 96 Query: 406 KTMNG---FNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 + +NG F +H Y K + A +P+ VDWR+ GAVT +KDQG C Sbjct: 97 RYLNGAAYFAAAKRHAAQHYRKARADLSA---------VPDAVDWREKGAVTPVKDQGAC 147 Query: 577 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 GSCW+FS G +EGQ + LVSLSEQ L+ C + N+GC+GGLM Sbjct: 148 GSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDM--NDGCDGGLM 193 >UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sativa|Rep: OSJNBb0085F13.15 protein - Oryza sativa subsp. japonica (Rice) Length = 383 Score = 99.5 bits (237), Expect = 7e-20 Identities = 59/188 (31%), Positives = 89/188 (47%), Gaps = 11/188 (5%) Frame = +1 Query: 193 QFFDLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMN 372 Q ++ + + + H +Y S E R ++Y + I N+ G +++KLG Sbjct: 47 QLMMMMMDRFHRWMATHNRSYASADEKLRRFEVYRSNMEFIEATNRN---GSLTFKLGET 103 Query: 373 KYGDMLHHEFVKTMNGFNKTAKHNKNLY-----------MKGGSVRGAKFISPANVKLPE 519 + D+ H EF+ T G + + + G V GA V +PE Sbjct: 104 PFTDLTHEEFLATYTGDVRLPPERRGMQDDSDEEDAVITTSAGYVAGAG-AGRRTVAVPE 162 Query: 520 QVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNN 699 VDWRK GAVT K QG+C +CW+F+ A+E H + G L+SLSEQ L+DC + G Sbjct: 163 SVDWRKEGAVTPAKHQGQCAACWAFAAVAAIESLHKIKGGDLISLSEQELVDCDDT-GEA 221 Query: 700 GCNGGLMD 723 C+ G D Sbjct: 222 TCSKGYSD 229 >UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis thaliana|Rep: Cysteine proteinase - Arabidopsis thaliana (Mouse-ear cress) Length = 348 Score = 99.1 bits (236), Expect = 1e-19 Identities = 58/170 (34%), Positives = 83/170 (48%), Gaps = 1/170 (0%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 E+W A + Y E E R I+ ++ + N + ++YK+ +N++ D+ Sbjct: 36 EQWMA---RFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNNK---ITYKVDINEFSDLTD 89 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK-LPEQVDWRKHGAVTDIKDQG 570 EF T G + + G + NV E +DWR+ GAVT +K QG Sbjct: 90 EEFRATHTGLVVPEAITRISTLSSG--KNTVPFRYGNVSDNGESMDWRQEGAVTPVKYQG 147 Query: 571 KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 +CG CW+FS A+EG G LVSLSEQ L+DC Y N GC GG+M Sbjct: 148 RCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDRDY-NQGCRGGIM 196 >UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaster|Rep: CG11459-PA - Drosophila melanogaster (Fruit fly) Length = 336 Score = 99.1 bits (236), Expect = 1e-19 Identities = 52/167 (31%), Positives = 89/167 (53%), Gaps = 1/167 (0%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 EW +K ++ Y + D + +Y + + HNQ Y G V++K+G+NK+ D Sbjct: 29 EWDQYKAKYNKQYRNR--DKYHRALYEQRVLAVESHNQLYLQGKVAFKMGLNKFSDTDQR 86 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQG-K 573 + + + N + +V ++ ++ E +DWR++G ++ + DQG + Sbjct: 87 ILFNYRSSIPAPLETSTNALTE--TVNYKRYD-----QITEGIDWRQYGYISPVGDQGTE 139 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 C SCW+FST+G LE ++ G LV LS ++L+DC Y NNGC+GG Sbjct: 140 CLSCWAFSTSGVLEAHMAKKYGNLVPLSPKHLVDC-VPYPNNGCSGG 185 >UniRef50_Q22RC9 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 389 Score = 98.7 bits (235), Expect = 1e-19 Identities = 61/157 (38%), Positives = 86/157 (54%) Frame = +1 Query: 208 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 VK+ +S FK +H+ Y +E+ R +I+ ++ II++ NQ E G Y G+ ++ DM Sbjct: 36 VKQLFSKFKAEHKKFYNF-LEEQRRFEIFRQNLDIISELNQ-VEEGTAEY--GITQFSDM 91 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 EF K+ T N G G + IS P DWR HGAVT +K+Q Sbjct: 92 TTEEF-KSQILIPSTYARN----FTGSRYHGFQKISQ---DAPTSYDWRDHGAVTPVKNQ 143 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC 678 G G+CW+FSTTG +EGQ F LVSLSE+ ++DC Sbjct: 144 GTVGTCWTFSTTGNIEGQWFLAGNPLVSLSEEQIVDC 180 >UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi Length = 467 Score = 98.7 bits (235), Expect = 1e-19 Identities = 56/174 (32%), Positives = 86/174 (49%) Frame = +1 Query: 202 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYG 381 + + +++ FK +H YES E+ FR+ ++ E+ + H G+ + Sbjct: 32 ETLTSQFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHAT----FGVTPFS 87 Query: 382 DMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIK 561 D+ EF ++ HN + R + V P VDWR GAVT +K Sbjct: 88 DLTREEF--------RSRYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVK 139 Query: 562 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 DQG+CGSCW+FS G +E Q F L +LSEQ L+ C + ++GC+GGLM+ Sbjct: 140 DQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKT--DSGCSGGLMN 191 >UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease Gip1p; n=4; Tetrahymena thermophila|Rep: Granule-biosynthesis induced protease Gip1p - Tetrahymena thermophila Length = 345 Score = 98.3 bits (234), Expect = 2e-19 Identities = 54/172 (31%), Positives = 88/172 (51%), Gaps = 4/172 (2%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 ++ ++ ++ Y +E E +R ++ E+ + KH SY G+N++ DM E Sbjct: 40 YNKWRFNYKRVYLNEEEQIYRQIVFFENLASVNKHPSHK-----SYSKGLNQFSDMTKEE 94 Query: 400 FVKTM--NGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 F + + +K A NK + + P N LP VDWRK G + +K+QG Sbjct: 95 FKQRVLNKKISKKASSNKGGRNLAADPAVSNLVFPTN-NLPLSVDWRKRGVLNPVKNQGT 153 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSE--QYGNNGCNGGLMD 723 CGSCW+F+T G LE + ++ L+ SEQ L+DC Y ++GC+GG + Sbjct: 154 CGSCWTFATAGILESFNQIKNKQLLKFSEQQLVDCVSLAGYDSDGCDGGFQE 205 >UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudicotyledons|Rep: Chymopapain precursor - Carica papaya (Papaya) Length = 352 Score = 98.3 bits (234), Expect = 2e-19 Identities = 59/165 (35%), Positives = 89/165 (53%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 + ++ L+H YES E +R +I+ ++ I + N+K SY LG+N + D+ + E Sbjct: 48 FDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNN----SYWLGLNGFADLSNDE 103 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 579 F K GF A+ L K ++ P+ +DWR GAVT +K+QG CG Sbjct: 104 FKKKYVGF--VAEDFTGLEHFDNEDFTYKHVT----NYPQSIDWRAKGAVTPVKNQGACG 157 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 SCW+FST +EG + +G L+ LSEQ L+DC + + GC GG Sbjct: 158 SCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH--SYGCKGG 200 >UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromeliaceae|Rep: Fruit bromelain precursor - Ananas comosus (Pineapple) Length = 351 Score = 98.3 bits (234), Expect = 2e-19 Identities = 59/167 (35%), Positives = 88/167 (52%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 EEW A ++ Y+ + E R +I+ + I N + E SY LG+N++ DM Sbjct: 38 EEWMA---EYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNEN---SYTLGINQFTDMTK 91 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 EFV G + + + V IS +P+ +DWR +GAV ++K+Q Sbjct: 92 SEFVAQYTGVSLPLNIEREPVVSFDDVN----ISA----VPQSIDWRDYGAVNEVKNQNP 143 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 CGSCWSF+ +EG + ++GYLVSLSEQ ++DC+ Y GC GG Sbjct: 144 CGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVSY---GCKGG 187 >UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum|Rep: Falcipain 2 - Plasmodium falciparum Length = 484 Score = 97.9 bits (233), Expect = 2e-19 Identities = 54/159 (33%), Positives = 82/159 (51%), Gaps = 2/159 (1%) Frame = +1 Query: 253 YESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGF--N 426 Y S E R +++ ++ H + HN YK +N++ D+ +HEF + Sbjct: 176 YNSPNEMKERFQVFLQNAHKVNMHNNNKNS---LYKKELNRFADLTYHEFKNKYLSLRSS 232 Query: 427 KTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTG 606 K K++K L + K DWR H VT +KDQ CGSCW+FS+ G Sbjct: 233 KPLKNSKYLLDQMNYEEVIKKYRGEENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIG 292 Query: 607 ALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 ++E Q+ + L++LSEQ L+DCS + N GCNGGL++ Sbjct: 293 SVESQYAIRKNKLITLSEQELVDCS--FKNYGCNGGLIN 329 >UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 344 Score = 97.9 bits (233), Expect = 2e-19 Identities = 62/174 (35%), Positives = 87/174 (50%), Gaps = 6/174 (3%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 E+ FK + Y +E E + Y + I KH +M + K G K+ DM Sbjct: 32 EFEEFKSKFNKYYHNEHEHHSSFHNYKTSREHIVKH----QMENPNAKFGHTKFSDMSPE 87 Query: 397 EFVKTMNGFN----KTAKHNKNLYMKGGSVRG--AKFISPANVKLPEQVDWRKHGAVTDI 558 EF M F+ K AK ++ + +K ++G + + N LPE DWR G +T Sbjct: 88 EFENKMLNFDFSLFKKAK-SQGIKLKAEPMKGYLRQGENVDNSDLPESFDWRDKGIITPA 146 Query: 559 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 K Q CGSCW+F+TTG +E Q+ + G L+ SEQ L+DC N GC GGLM Sbjct: 147 KFQNTCGSCWTFATTGVIESQYALKYGELLHFSEQMLLDCDNI--NQGCRGGLM 198 >UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: Cathepsin L - Kudoa thyrsites Length = 300 Score = 97.9 bits (233), Expect = 2e-19 Identities = 60/172 (34%), Positives = 85/172 (49%) Frame = +1 Query: 202 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYG 381 D+ WSA KL+H + ++S E+ R+ + E+ I HN Y N Sbjct: 5 DVAIRLWSAHKLEHNIIFDSIEEERRRLCNFKENHQFI--HNFNLHNTHYHY-CRHNHLS 61 Query: 382 DMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIK 561 H E++ + K + + K I LP VDW+ G VT +K Sbjct: 62 HWSHEEYMAWLTLKPKLPVVSTPTHGITPKETATKDIKST---LPSSVDWKALGKVTSVK 118 Query: 562 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGL 717 +QG CGSCWSFS GA+E + ++G LV+ SEQ L+DCS + N+GCNGGL Sbjct: 119 NQGHCGSCWSFSAAGAIESAYAIKTGELVNFSEQQLVDCSTE--NHGCNGGL 168 >UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia circumcincta|Rep: Secreted cathepsin F - Teladorsagia circumcincta Length = 364 Score = 97.5 bits (232), Expect = 3e-19 Identities = 60/165 (36%), Positives = 86/165 (52%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 +++F +H Y +E E R I+ + II + Q+ + G Y G+N++ D+ E Sbjct: 64 FTSFIERHDKVYRNESEALKRFGIFKRNLEII-RSAQENDKGTAIY--GINQFADLSPEE 120 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 579 F KT + N + A+ + P LPE DWR+HGAVT +K +G C Sbjct: 121 FKKTHLPHTWKQPDHPNRIVD----LAAEGVDPKE-PLPESFDWREHGAVTKVKTEGHCA 175 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 +CW+FS TG +EGQ F LVSLS Q L+DC + GCNGG Sbjct: 176 ACWAFSVTGNIEGQWFLAKKKLVSLSAQQLLDC--DVVDEGCNGG 218 >UniRef50_Q23H10 Cluster: Papain family cysteine protease containing protein; n=14; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 97.5 bits (232), Expect = 3e-19 Identities = 58/177 (32%), Positives = 91/177 (51%), Gaps = 7/177 (3%) Frame = +1 Query: 205 LVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGD 384 L +WS+ Q++ Y +E E FR ++ E+ I +HN +Y + +N++ D Sbjct: 27 LAYNQWSS---QNQRVYLNEHEKLFRQMVFFENFQKIQEHNSDPNN---TYSVHLNQFSD 80 Query: 385 MLHHEFVKTMNGFNKTAKH-NKNLYMKG---GSVRGAKFISPANVKLPEQVDWRKHGAVT 552 M EF + + + H K + + + +S ++ L + +DWR GAVT Sbjct: 81 MTKEEFAEKILMKSDLVDHLMKGISQEATHNDTNNNETQLSSNSLTLADSIDWRTKGAVT 140 Query: 553 DIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC---SEQYGNNGCNGG 714 +K+QG CGSCWSFS +E +F Q+ LV SEQ L+DC + Y + GCNGG Sbjct: 141 SVKNQGGCGSCWSFSAAAVMESFNFIQNKALVDFSEQQLVDCVIPANGYNSYGCNGG 197 >UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2; Culicidae|Rep: Procathepsin L3, putative - Aedes aegypti (Yellowfever mosquito) Length = 313 Score = 97.5 bits (232), Expect = 3e-19 Identities = 48/164 (29%), Positives = 84/164 (51%), Gaps = 6/164 (3%) Frame = +1 Query: 241 HRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNG 420 ++ Y+++ + R + + ++ I +HN YE G ++++G+N+ DM ++K M Sbjct: 38 YQKKYKAKYRMDRRKRAFKKNMQEIEEHNANYEQGKSTFQMGVNELADMDKSSYLKKMVR 97 Query: 421 FNKTAKHNK------NLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGS 582 H K + ++ + G +F+ +P+ +DWR G T +Q CGS Sbjct: 98 MTDAIDHRKLDVDFNDEMLQATNAFGEEFVQATQNSMPDSLDWRDKGFTTMAVNQKTCGS 157 Query: 583 CWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 C++FS AL GQ R+ G + +S Q ++DCS GN GC GG Sbjct: 158 CYAFSIGHALNGQIMRRIGRVEYVSTQQMVDCSTSAGNKGCAGG 201 >UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_79, whole genome shotgun sequence - Paramecium tetraurelia Length = 324 Score = 97.1 bits (231), Expect = 4e-19 Identities = 56/172 (32%), Positives = 86/172 (50%), Gaps = 3/172 (1%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 +++ +K+Q+ + SE E+ +R ++ ++ +I HN + G +Y + N++ D+ Sbjct: 35 QFNDWKIQYNKKFSSEKEEMYRYLVFQQNAQLIEAHNND-KSGKYTYTMETNQFADLTEQ 93 Query: 397 EFVKTMNGFN--KTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQG 570 EF + F T K Y+ G R DW + G V IKDQG Sbjct: 94 EFAQKYLTFRPKSTNKSKSTDYVPNGQAR----------------DWVEEGKVPPIKDQG 137 Query: 571 K-CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 CGS W+FS G LE + G +LSEQ+++DCS YGN GC+GG MD Sbjct: 138 SSCGSSWAFSAVGVLEINSNIEFGLETTLSEQDMLDCSGPYGNQGCSGGWMD 189 >UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 21 SCAF14577, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 406 Score = 96.7 bits (230), Expect = 5e-19 Identities = 52/153 (33%), Positives = 82/153 (53%), Gaps = 8/153 (5%) Frame = +1 Query: 280 RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAK------- 438 R + + ++A+HN + G S+ L +N D++ + + ++ + Sbjct: 70 RRAAWERNARLVARHNLEASAGKHSFTLELNHLADLVRRVLLLQPSLASERVRLTAEEIN 129 Query: 439 HNKNLYMKGGS-VRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALE 615 NL ++ + VR + P VDWRK G V+ +++QG C SCW+FS+ GALE Sbjct: 130 EMNNLKVEERAPVRNGTSEEKLGFETPPSVDWRKAGLVSPVQNQGFCNSCWAFSSLGALE 189 Query: 616 GQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 GQ +++G+LV LS QNL+DCS GN GC GG Sbjct: 190 GQMKKRTGFLVPLSPQNLLDCSISDGNLGCRGG 222 >UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa (Rice) Length = 339 Score = 96.7 bits (230), Expect = 5e-19 Identities = 59/160 (36%), Positives = 81/160 (50%), Gaps = 3/160 (1%) Frame = +1 Query: 253 YESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKT 432 Y+ E R +I+ + I + + G + L +N++ D+ ++EF + Sbjct: 48 YKDATEKARRFEIFKANVAFI----ESFNAGNHKFWLSVNQFADLTNYEF--------RA 95 Query: 433 AKHNKNLYMKGGSVRGAKFISPANVK---LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTT 603 K NK +VR NV LP VDWR GAVT IKDQG+CG CW+FS Sbjct: 96 TKTNKGFIPS--TVRVPTTFRYENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAV 153 Query: 604 GALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 A+EG +G L+SLSEQ L+DC + GC GGLMD Sbjct: 154 AAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMD 193 >UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegleria fowleri|Rep: Cysteine proteinase homolog - Naegleria fowleri Length = 347 Score = 96.7 bits (230), Expect = 5e-19 Identities = 55/128 (42%), Positives = 73/128 (57%), Gaps = 9/128 (7%) Frame = +1 Query: 364 GMNKYGDMLHHEFVKTMNGFNKTAKHNKN-LYMKGGSVRGAKFISPANVKLPEQVDWRKH 540 G+ K+ D+ EF + T + K L +V K + A P DWR+H Sbjct: 76 GITKFSDLTPEEFKRMFLMKTYTPEEAKKILAAPQHAVLSEKEVQTA----PTSFDWRQH 131 Query: 541 GAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC--------SEQYGN 696 GAVT +K+QG CGSCW+FSTTG +EGQ + G LVSLSEQ L+DC ++Q + Sbjct: 132 GAVTRVKNQGACGSCWTFSTTGNVEGQWAIKKGKLVSLSEQQLVDCDHNCVTYQNQQACD 191 Query: 697 NGCNGGLM 720 +GCNGGLM Sbjct: 192 SGCNGGLM 199 >UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyostelium discoideum|Rep: Cysteine proteinase 3 - Dictyostelium discoideum (Slime mold) Length = 151 Score = 96.7 bits (230), Expect = 5e-19 Identities = 54/120 (45%), Positives = 73/120 (60%) Frame = +1 Query: 361 LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 540 LG+N++ D+ + E+ +N A N Y K G + P + K P VDWR+ Sbjct: 31 LGLNQHADLSNEEY--RLNYLGTRAHIKLNGYHKRNL--GLRLNRP-HFKQPLNVDWREK 85 Query: 541 GAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 AVT +KDQG+CGSC STTG++EG ++G LVSLSEQN++ S +GN GCNGGLM Sbjct: 86 DAVTPVKDQGQCGSC-IISTTGSVEGVTAIKTGKLVSLSEQNILRLSSSFGNEGCNGGLM 144 >UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1; Brugia malayi|Rep: Cathepsin F-like cysteine proteinase - Brugia malayi (Filarial nematode worm) Length = 461 Score = 96.3 bits (229), Expect = 7e-19 Identities = 62/167 (37%), Positives = 82/167 (49%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 ++ F + + Y S E R +IY ++ + AK Q E G Y G K+ DM Sbjct: 158 DFMTFIKKFKREYSSIEEQLDRFRIYLQNMNF-AKKLQFEEKGTAIY--GATKFSDMTAE 214 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 EF K M + N G + + LP + DWR G VT +KDQG C Sbjct: 215 EFQKIMLPSIWWDRVESN-----GITFNLNDFNLSIYNLPSKFDWRTEGVVTPVKDQGSC 269 Query: 577 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGL 717 GSCW+FS TG +E ++G L+SLSEQ LIDC + GCNGGL Sbjct: 270 GSCWAFSVTGNIESLWAIKTGKLISLSEQELIDC--DVIDKGCNGGL 314 >UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endopterygota|Rep: Cathepsin L-like proteinase - Bombyx mori (Silk moth) Length = 402 Score = 95.9 bits (228), Expect = 9e-19 Identities = 52/172 (30%), Positives = 88/172 (51%), Gaps = 2/172 (1%) Frame = +1 Query: 205 LVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGD 384 L + W +K H Y S + + + ++ +A+HN++Y G+ SY L +N +GD Sbjct: 95 LPRRHWHEYKAIHNKLYSSTHHEMAALMKWRQNLRRVARHNREYLAGIQSYSLHLNHFGD 154 Query: 385 MLHHEFVKTMNGFNKTAKHNKN--LYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDI 558 M E+ F K K K L+ + K+P+++DWR G Sbjct: 155 MHVTEY------FGKVLKLIKAFPLFDPAEDHHKTAYRHNRRCKVPKRIDWRDQGFKPRR 208 Query: 559 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 ++Q +CG+C++F+ T AL+ Q +++ G LS Q ++DCS + GN GC+GG Sbjct: 209 EEQWQCGACYAFAVTHALQAQLYKRHGEWNELSPQQIVDCSIKDGNMGCDGG 260 >UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to CG5367-PA - Nasonia vitripennis Length = 362 Score = 95.5 bits (227), Expect = 1e-18 Identities = 53/167 (31%), Positives = 87/167 (52%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 E W +K++H Y +E R + + ++ I +HN G Y L N D+ Sbjct: 59 EYWHLYKMRHNKTYTGTLEA-VRREAWEDNLLKIYEHNLLAAAGHHEYILRDNHIADLST 117 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 +++ + + + + + A P ++P+ +DWR+ G VT ++Q Sbjct: 118 SSYMRELVKLVPSRRRR----LDDDEMVAAVLHDPR--RIPKSLDWREKGFVTKPENQRD 171 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 CGSC+++S G++ GQ FRQ+G +V LSEQ L+DCS Q GN GC+GG Sbjct: 172 CGSCYAYSIAGSIAGQIFRQTGIVVPLSEQQLVDCSTQTGNLGCSGG 218 >UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sativa|Rep: Cysteine proteinase-like - Oryza sativa subsp. japonica (Rice) Length = 360 Score = 95.1 bits (226), Expect = 2e-18 Identities = 52/154 (33%), Positives = 82/154 (53%) Frame = +1 Query: 253 YESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKT 432 Y E RM+++A + + N+ G +Y LG+N++ D+ EF +T G++ Sbjct: 54 YADAAEKARRMEVFAANAERVDAANRAG--GDRTYTLGLNQFSDLTDDEFAQTHLGYSWA 111 Query: 433 AKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGAL 612 + + G + + +P+ VDWR GAVT++K+Q CGSCW+F+ A Sbjct: 112 PPPPSHRHGHRAE-NGTAAAAADDTDVPDSVDWRARGAVTEVKNQRSCGSCWAFAAVAAT 170 Query: 613 EGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 EG +G LVSLSEQ ++DC+ G N C+GG Sbjct: 171 EGLVQLATGNLVSLSEQQVLDCTG--GANTCSGG 202 >UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin l - Strongylocentrotus purpuratus Length = 489 Score = 94.7 bits (225), Expect = 2e-18 Identities = 49/122 (40%), Positives = 67/122 (54%) Frame = +1 Query: 349 VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVD 528 + Y L +N D H E +K M G + + N L G V ++ +P+ +D Sbjct: 222 LGYVLDINHMADQSHQE-LKRMRGRLRQTRPNNGLPYDGSDV--------SDDAVPDHID 272 Query: 529 WRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCN 708 W GAV+ +KDQ CGSCWSF + +EG F QSG V LS+Q L+DC+ GNNGC+ Sbjct: 273 WNVLGAVSPVKDQAVCGSCWSFGSAETIEGAVFMQSGKRVRLSQQMLMDCTWAAGNNGCD 332 Query: 709 GG 714 GG Sbjct: 333 GG 334 >UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Liliopsida|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 416 Score = 94.7 bits (225), Expect = 2e-18 Identities = 60/164 (36%), Positives = 92/164 (56%), Gaps = 4/164 (2%) Frame = +1 Query: 202 DLVKEE--WSAFKLQHRLNYES-EVED-NFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGM 369 DL EE WS ++ + S ++ D R +++ + I + NQK + G+ SY LG+ Sbjct: 15 DLETEESMWSLYERWRAVYAPSRDLSDMESRFEVFKANARYIHEFNQKSK-GM-SYVLGL 72 Query: 370 NKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAV 549 NK+ D+ + EF G K + + + + + + P V P DWR +GAV Sbjct: 73 NKFSDLTYEEFAAKYTG----VKVDASAFATATTSSPDEEL-PVGVP-PATWDWRLNGAV 126 Query: 550 TDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS 681 TD+KDQG+CGSCW FS GA+EG + +G L++LSEQ ++DCS Sbjct: 127 TDVKDQGQCGSCWVFSAVGAVEGINAIMTGNLLTLSEQQVLDCS 170 >UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: Cysteine proteinase - Paragonimus westermani Length = 272 Score = 94.7 bits (225), Expect = 2e-18 Identities = 42/78 (53%), Positives = 56/78 (71%), Gaps = 1/78 (1%) Frame = +1 Query: 484 KFISPANVKL-PEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSE 660 K + P +K PE++DWR GAVT +++QG CGSCW+FST G +EGQ F ++G LVSLS+ Sbjct: 44 KRVRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLVSLSK 103 Query: 661 QNLIDCSEQYGNNGCNGG 714 Q L+DC +GCNGG Sbjct: 104 QQLVDCDR--AADGCNGG 119 >UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba culbertsoni|Rep: Cysteine proteinase - Acanthamoeba culbertsoni Length = 482 Score = 94.3 bits (224), Expect = 3e-18 Identities = 53/166 (31%), Positives = 83/166 (50%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 +++++ +H +Y ++ E R + E+ I + N+ G ++ + MN++GD+ Sbjct: 63 QFNSWMRRHARSYSND-EFLERYNTWRENMDFIEEFNR----GNHTFTVAMNEHGDLTPE 117 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 EF + G A + +P DWR GAVT +K+QG C Sbjct: 118 EFARLYMGQVSPASEQELQERIAAESAMEDEHHHTRASIPANWDWRTKGAVTPVKNQGSC 177 Query: 577 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 SCW+F TGA+EG G LVSLS+Q L+DC+ GN GC+GG Sbjct: 178 ASCWAFVATGAVEGVRKIAGGSLVSLSDQMLLDCAVGTGNQGCSGG 223 >UniRef50_Q22W19 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 94.3 bits (224), Expect = 3e-18 Identities = 53/167 (31%), Positives = 89/167 (53%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 +E+ A+ ++ + EV+ +R I+ ++K ++ + N + + +N + Sbjct: 42 DEFQAWMHKYGFKFADEVQLQYRRSIFYQNKDLVEQLNSENNGTFHT----LNAFAIYTK 97 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 EF + G+ K K + +KG ++P+ +DWR+ AVT +K+QG+ Sbjct: 98 DEFNQLFKGYQKRQKSHLIYSLKGD-------VAPS-------IDWRQKNAVTPVKNQGQ 143 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 CGSCW+FST G LEG + +G L S SEQ ++DCS+ N GCNGG Sbjct: 144 CGSCWAFSTVGGLEGAYAIATGNLTSFSEQQIVDCSK--ANAGCNGG 188 >UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa zeasingle nucleocapsid nuclear polyhedrosis virus) Length = 367 Score = 93.5 bits (222), Expect = 5e-18 Identities = 53/151 (35%), Positives = 80/151 (52%) Frame = +1 Query: 268 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 447 +DN KI ++++ + + + S + G+NK+ D E + + GF + Sbjct: 82 KDNLN-KINSQNRENLLNNKNNNDSLSTSAQFGVNKFSDKTPDEVLHSNTGFFLNLSQHY 140 Query: 448 NLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHF 627 L + V+GA +++LP+ DWR VT IKDQG CGSCW+F G +E Q+ Sbjct: 141 TL-CENRIVKGAP-----DIRLPDYYDWRDTNKVTPIKDQGVCGSCWAFVAIGNIESQYA 194 Query: 628 RQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 + L+ LSEQ L+DC E + GCNGGLM Sbjct: 195 IRHNKLIDLSEQQLLDCDEV--DLGCNGGLM 223 >UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 350 Score = 93.1 bits (221), Expect = 6e-18 Identities = 55/172 (31%), Positives = 92/172 (53%), Gaps = 6/172 (3%) Frame = +1 Query: 217 EWS-AFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 +W +F Y SE E +R ++ ++ I KHN +YKL N++ DM Sbjct: 45 KWERSFSSGRSRTYLSEEERTYRQIVFLQNDQNIQKHNSDSNN---TYKLQHNQFSDMTK 101 Query: 394 HEFV-KTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH-GAVTDIKDQ 567 EF + +N KT+ + + + +RG+ A++ + DWR + G + ++K+Q Sbjct: 102 DEFAHRVLNSQLKTSASSSSQPAQTPQLRGSV---DASLNASQGFDWRNYQGVLGNVKNQ 158 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC-SEQYG--NNGCNGG 714 G+CGSCW+F+T G LE + + + SEQ+++DC S YG ++GCNGG Sbjct: 159 GQCGSCWTFATAGVLESYYALKYQQSLIFSEQDIVDCASRSYGYQSDGCNGG 210 >UniRef50_Q23H06 Cluster: Papain family cysteine protease containing protein; n=18; Tetrahymena thermophila|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 349 Score = 92.7 bits (220), Expect = 8e-18 Identities = 60/191 (31%), Positives = 91/191 (47%), Gaps = 21/191 (10%) Frame = +1 Query: 205 LVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGD 384 L +WS+ +H+ Y +E E FR ++ E+ I HN +Y + +N++ D Sbjct: 27 LAYNKWSS---EHQRVYLNEHEKLFRQMVFFENLQKIQDHNSNPNN---TYSIHLNQFSD 80 Query: 385 MLHHEFVKT-----------MNGF-----NKTAKHNKNLYMKGG--SVRGAKFISPANVK 510 M EF + M G N A HN+ + ++ N Sbjct: 81 MTKQEFAEKILMKQSFVENFMKGASQQDNNTNANHNEANHNDANHNDANHEMQLNSKNFT 140 Query: 511 LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC---S 681 + +DWR GAVT +K QG CG+CW+FS TG +E +F Q+ LV SEQ L+DC + Sbjct: 141 IATSIDWRSRGAVTQVKWQGNCGACWAFSATGVMESFNFIQNKALVEFSEQQLLDCVIPA 200 Query: 682 EQYGNNGCNGG 714 Y ++GC+GG Sbjct: 201 NGYPSSGCHGG 211 >UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 357 Score = 92.3 bits (219), Expect = 1e-17 Identities = 56/172 (32%), Positives = 85/172 (49%) Frame = +1 Query: 208 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 ++E + + H Y+ +E R +++ + I N G S +L NK+ D+ Sbjct: 45 MRERYEKWAADHGRTYKDSLEKARRFEVFRTNALFIDSFNAAG--GKKSPRLTTNKFADL 102 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 + EF + T + GGS G + + +P ++WR GAVT +K+Q Sbjct: 103 TNEEFAEYYGRPFSTP-------VIGGS--GFMYGNVRTSDVPANINWRDRGAVTQVKNQ 153 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 C SCW+FS A+EG H +S LV+LS Q L+DCS N+GCN G MD Sbjct: 154 KDCASCWAFSAVAAVEGIHQIRSHNLVALSTQQLLDCSTGRNNHGCNRGDMD 205 >UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis|Rep: Cathepsin L - Culicoides sonorensis Length = 331 Score = 92.3 bits (219), Expect = 1e-17 Identities = 51/171 (29%), Positives = 89/171 (52%), Gaps = 1/171 (0%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 EEW FKL++ Y E+N R I+ + + +HN +Y G+ +Y+ G+N++ D+ + Sbjct: 25 EEWKKFKLEYNKVYPLSTEENLRKGIFERNLADVMEHNARYLSGMETYEKGVNQFSDLTY 84 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL-PEQVDWRKHGAVTDIKDQG 570 EF K G + N+ + G + P +L PE W +K+Q Sbjct: 85 EEFAKLYLG--EKISFNELMTNADGWIE-----KPLRRQLAPESYAWDTKD--VPVKNQA 135 Query: 571 KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 +CGSCW+F++ ++E ++ R +L+EQ L+DC + ++GC+GG D Sbjct: 136 QCGSCWAFASVASVEMRYKRFHNKSYTLAEQELVDC--ETTSHGCSGGWSD 184 >UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 383 Score = 92.3 bits (219), Expect = 1e-17 Identities = 57/168 (33%), Positives = 93/168 (55%) Frame = +1 Query: 211 KEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDML 390 ++ ++ F L+ Y S E +R +I+ + I + ++ +GL L +N++ D Sbjct: 79 EQMFNDFILKFDRKYTSVEEFEYRYQIFLRNV-IEFEAEEERNLGL---DLDVNEFTDWT 134 Query: 391 HHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQG 570 E K + NK K++ + GS I PA++ DWR+ G +T IK+QG Sbjct: 135 DEELQKMVQE-NKYTKYDFDTPKFEGSYLETGVIRPASI------DWREQGKLTPIKNQG 187 Query: 571 KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 +CGSCW+F+T ++E Q+ + G LVSLSEQ ++DC + NNGC+GG Sbjct: 188 QCGSCWAFATVASVEAQNAIKKGKLVSLSEQEMVDCDGR--NNGCSGG 233 >UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 514 Score = 92.3 bits (219), Expect = 1e-17 Identities = 53/169 (31%), Positives = 89/169 (52%) Frame = +1 Query: 208 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 ++ + ++ QH Y+SE E + R I+ + I N+K + YKL N + D+ Sbjct: 216 IERMYRKYQGQHNKQYDSEHEVSKRKHIFRHNMRYIRSINRKN----LKYKLAPNHFVDL 271 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 E+ + K + + + G + + V +P+++DWR +GAV+ ++ Q Sbjct: 272 TDGEYDQH--------KGDSIITLYGPYSNMSHVLQ--RVDVPDELDWRDYGAVSPVRGQ 321 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 G CGSC++ + GA+EG +F ++G L LS Q +IDCS GN GC GG Sbjct: 322 GICGSCYALAAVGAVEGAYFMKTGKLKELSAQQVIDCSWGSGNRGCKGG 370 >UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 280 Score = 90.2 bits (214), Expect = 4e-17 Identities = 50/138 (36%), Positives = 76/138 (55%), Gaps = 6/138 (4%) Frame = +1 Query: 319 KHNQKYEMGLVSYKLGMNKYGDMLHHEFVK-TMNG--FNKTAKHNKNLYMKGGSVRGAKF 489 +HNQ+ SY++GMN++ D+ EF ++N FN ++ +N+ + Sbjct: 3 QHNQEKNN---SYQIGMNQFSDLTIEEFQSISLNQQLFNSESRKLENIKNENQQADFYLQ 59 Query: 490 ISPANVK-LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQN 666 + N LP+Q DWR G VT +K+QG CGSCW+F+ TG E + ++ + SEQ Sbjct: 60 LLKTNASSLPQQFDWRNLGKVTQVKNQGNCGSCWAFTITGLFESINLIRNKTVELYSEQE 119 Query: 667 LIDCSEQ--YGNNGCNGG 714 L+DCS Y N+GC GG Sbjct: 120 LLDCSSNGIYRNSGCQGG 137 >UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 664 Score = 89.8 bits (213), Expect = 6e-17 Identities = 37/71 (52%), Positives = 52/71 (73%), Gaps = 2/71 (2%) Frame = +1 Query: 514 PEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC--SEQ 687 P +DWR G V+ +K+QG CGSC++FST GALE ++R++ ++ LSEQNL+DC S + Sbjct: 471 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYRKNNRMLDLSEQNLVDCTASNK 530 Query: 688 YGNNGCNGGLM 720 Y N GC+GG M Sbjct: 531 YRNGGCSGGWM 541 >UniRef50_Q22NW9 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 894 Score = 89.8 bits (213), Expect = 6e-17 Identities = 53/162 (32%), Positives = 85/162 (52%) Frame = +1 Query: 238 QHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN 417 +++++ + E +R+ I+A++ I HNQ + Y G+N++ + EF +T Sbjct: 607 RYKMHIINPKEYMYRLNIFAKNLQNIKNHNQ---ISNKPYIEGINQFTHLTEEEFEQTYL 663 Query: 418 GFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFS 597 A + +F+ ++P +DWR AVT +K+QG CGS ++FS Sbjct: 664 TLQIPASKQ---------YKTQEFLGD---EVPSSIDWRDLNAVTPVKNQGSCGSGYAFS 711 Query: 598 TTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 TTGALEG H SEQ +IDCS + GN+GC+GG M+ Sbjct: 712 TTGALEGIHKISGKDWKGFSEQQIIDCSRKQGNSGCHGGFME 753 >UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole genome shotgun sequence; n=7; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_22, whole genome shotgun sequence - Paramecium tetraurelia Length = 350 Score = 89.8 bits (213), Expect = 6e-17 Identities = 39/66 (59%), Positives = 48/66 (72%), Gaps = 1/66 (1%) Frame = +1 Query: 523 VDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYG-NN 699 +DWR GAV +KDQG+CGSCW+FSTTG LEG + Q+G L LSEQ L+DCS N Sbjct: 146 IDWRTRGAVNKVKDQGQCGSCWAFSTTGVLEGFYKVQTGELPDLSEQQLVDCSTLIDFNQ 205 Query: 700 GCNGGL 717 GC+GG+ Sbjct: 206 GCDGGM 211 >UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativa|Rep: Os01g0347600 protein - Oryza sativa subsp. japonica (Rice) Length = 343 Score = 89.0 bits (211), Expect = 1e-16 Identities = 57/170 (33%), Positives = 82/170 (48%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 EEW A + Y+ E R I+ ++ H I + + +G+N++ D+ + Sbjct: 45 EEWMA---KFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSA---VGINQFADLTN 98 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 EFV T G H K + + P + P +DWR GAVT +KDQG Sbjct: 99 DEFVATYTGAKPP--HPKE---------APRPVDP--IWTPCCIDWRFRGAVTGVKDQGA 145 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 CGSCW+F+ A+EG ++G L LSEQ L+DC +NGC GG D Sbjct: 146 CGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDT--NSNGCGGGHTD 193 >UniRef50_Q24FA8 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 335 Score = 89.0 bits (211), Expect = 1e-16 Identities = 50/164 (30%), Positives = 88/164 (53%), Gaps = 9/164 (5%) Frame = +1 Query: 253 YESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTM----NG 420 Y SE E +R ++ E+ + +HN+ +Y +G+N++ D+ E+ + + + Sbjct: 43 YSSEAEKIYRQSVFLENYQSVQEHNKNSNH---TYSVGINQFSDITLQEYQQRILMKNSP 99 Query: 421 FNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFST 600 N+ AK NKN ++ ++ + + ++ +DWRK G V+ +K+QG+CG CW+FS Sbjct: 100 LNELAK-NKNRLLQSSPIQNSN-----DTQIASSIDWRKKGGVSPVKNQGECGGCWTFSA 153 Query: 601 TGALEGQH-FRQSGYLVSL-SEQNLIDC---SEQYGNNGCNGGL 717 TG +E + VSL S+Q L+DC Y + GC GG+ Sbjct: 154 TGLMESFNLIHNKPQNVSLYSQQQLLDCVTLENGYFSEGCEGGV 197 >UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria dispar multicapsid nuclear polyhedrosis virus (LdMNPV) Length = 356 Score = 89.0 bits (211), Expect = 1e-16 Identities = 54/168 (32%), Positives = 85/168 (50%), Gaps = 1/168 (0%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHII-AKHNQKYEMGLVSYKLGMNKYGDMLHH 396 + +F + NY S+ E N R I+ ++ H I AK+ + +YK+ NK+ D+ Sbjct: 56 FESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKI--NKFSDLSKS 113 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 E + G + + + + K ++ K P DWR+ VT IK+QG C Sbjct: 114 ELIAKFTGLSIPERVSN--FCK------TIILNQPPDKGPLHFDWREQNKVTSIKNQGAC 165 Query: 577 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 G+CW+F+T ++E Q + L+ LSEQ LIDC + GCNGGL+ Sbjct: 166 GACWAFATLASVESQFAMRHNRLIDLSEQQLIDCDSV--DMGCNGGLL 211 >UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 356 Score = 88.6 bits (210), Expect = 1e-16 Identities = 50/151 (33%), Positives = 81/151 (53%), Gaps = 2/151 (1%) Frame = +1 Query: 271 DNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNK-YGDMLHHEFVKTMNGFNKTAKHNK 447 ++ + + E + +HN+K +Y L ++ + M +FV G ++ Sbjct: 54 EHLEFQHFKESVRRVREHNKKVN---ATYTLSIDSPFAFMSDEQFVTEYLG-SQDCSATA 109 Query: 448 NLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQH- 624 L +K + K + NV++PE ++W+ V+ +KDQ CGSCW+FSTTGA+E + Sbjct: 110 ELTLK----KPMKIQNKKNVQVPESINWKDLNKVSPVKDQQNCGSCWTFSTTGAIESHYA 165 Query: 625 FRQSGYLVSLSEQNLIDCSEQYGNNGCNGGL 717 + SLSEQ LIDC+ + NNGC+GGL Sbjct: 166 IFEDVEPTSLSEQQLIDCAGAFNNNGCSGGL 196 >UniRef50_Q248G1 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 334 Score = 88.2 bits (209), Expect = 2e-16 Identities = 59/178 (33%), Positives = 88/178 (49%), Gaps = 8/178 (4%) Frame = +1 Query: 205 LVKEEWSAFKLQHRLN---YESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNK 375 L EE A+ L + N Y SE E FR I+ E+K + HN + ++ +N+ Sbjct: 28 LTVEELIAYNLWRQNNGRVYNSEEEQFFRQLIFVENKRQVDSHNSQNP----TFTQSLNQ 83 Query: 376 YGDMLHHEF-VKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRK-HGAV 549 + D EF + +N K ++ KG + + ++PE VDWR V Sbjct: 84 FADFTDEEFKYRVLN-----TKVSQTRPKKGRRLESRVL----DQQIPESVDWRNVTNVV 134 Query: 550 TDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC---SEQYGNNGCNGG 714 IK+QG CGSCW+FS G +E + + G VS +EQ ++DC S Y ++GCNGG Sbjct: 135 GPIKNQGHCGSCWTFSIAGIVESHYVLKHGSYVSYAEQEILDCVSVSAGYQSDGCNGG 192 >UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; Entamoeba|Rep: Cysteine proteinase 2 precursor - Entamoeba histolytica Length = 315 Score = 88.2 bits (209), Expect = 2e-16 Identities = 37/75 (49%), Positives = 52/75 (69%), Gaps = 3/75 (4%) Frame = +1 Query: 502 NVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSG---YLVSLSEQNLI 672 N++ PE VDWRK G VT I+DQ +CGSC++F + ALEG+ + G + LSE++++ Sbjct: 91 NIQAPESVDWRKEGKVTPIRDQAQCGSCYTFGSLAALEGRLLIEKGGDANTLDLSEEHMV 150 Query: 673 DCSEQYGNNGCNGGL 717 C+ GNNGCNGGL Sbjct: 151 QCTRDNGNNGCNGGL 165 >UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; Leishmania|Rep: Cysteine proteinase 1 precursor - Leishmania pifanoi Length = 354 Score = 87.8 bits (208), Expect = 2e-16 Identities = 57/168 (33%), Positives = 84/168 (50%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 + +FK +H + + E+ R + ++ N + Y + K+ D+ E Sbjct: 42 YGSFKKRHGKAFGGDAEEGHRFNAFKQNMQTAYFLNTQNPHA--HYDVS-GKFADLTPQE 98 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 579 F K + A+H K+ + + V + +P+ V VDWR GAVT +K+QG CG Sbjct: 99 FAKLYLNPDYYARHLKD-HKEDVHVDDS---APSGVM---SVDWRDKGAVTPVKNQGLCG 151 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 SCW+FS G +EGQ LVSLSEQ L+ C + GCNGGLMD Sbjct: 152 SCWAFSAIGNIEGQWAASGHSLVSLSEQMLVSCDNI--DEGCNGGLMD 197 >UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium tetraurelia|Rep: Cathepsin L1 precursor - Paramecium tetraurelia Length = 314 Score = 87.4 bits (207), Expect = 3e-16 Identities = 51/170 (30%), Positives = 84/170 (49%), Gaps = 2/170 (1%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 ++ +K+++ Y ++ ++ +R K++ ++ + I + E ++ L +N++ DM E Sbjct: 26 YANWKMKYNRRYTNQRDEMYRYKVFTDNLNYIRAFYESPEEA--TFTLELNQFADMSQQE 83 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVT--DIKDQGK 573 F +T N +GA +VDW + V +K+QG Sbjct: 84 FAQTYLSLKVPRTAKLNAANSNFQYKGA------------EVDWTDNKKVKYPAVKNQGS 131 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 CGSCW+FS GALE + LSEQ+L+DCS Y N+GCNGG MD Sbjct: 132 CGSCWAFSAVGALEINTDIELNRKYELSEQDLVDCSGPYDNDGCNGGWMD 181 >UniRef50_Q23FQ5 Cluster: Papain family cysteine protease containing protein; n=4; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 87.0 bits (206), Expect = 4e-16 Identities = 51/166 (30%), Positives = 86/166 (51%), Gaps = 2/166 (1%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 ++ ++ +R + +E E+ +R ++ E+ + H + E +Y + +N++ D E Sbjct: 36 YNKWRSSYRRVFLNEDEETYRQLVFFENLQKLKTHEKNTE---ATYTVSLNQFSDYSQEE 92 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 579 FV+ + NK + K G + A V P VDWR GA+ I++QG+CG Sbjct: 93 FVQRI--LNKHISRSDADIQKEQEPNGN--LRKA-VNYPTSVDWRNSGALNPIQNQGQCG 147 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYG--NNGCNG 711 SC +F T G LE ++ +S L+ SEQ L+DC+ Q G GC+G Sbjct: 148 SCAAFGTAGVLESFYYLKSKQLLKFSEQQLLDCARQAGFDTYGCDG 193 >UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa|Rep: Os09g0381400 protein - Oryza sativa subsp. japonica (Rice) Length = 362 Score = 86.2 bits (204), Expect = 7e-16 Identities = 55/172 (31%), Positives = 83/172 (48%), Gaps = 2/172 (1%) Frame = +1 Query: 205 LVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGD 384 ++ + + A++ H +Y S E R +Y + I N + G ++Y+L N++ D Sbjct: 46 VMMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLR---GDLTYQLAENEFAD 102 Query: 385 MLHHEFVKTMNGFNK-TAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIK 561 + EF+ T G+ + ++ G A F V +P VDWR GAV K Sbjct: 103 LTEEEFLATYTGYYAGDGPVDDSVITTGAGDVDASF--SYRVDVPASVDWRAQGAVVPPK 160 Query: 562 DQ-GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 Q C SCW+F T +E + ++G LVSLSEQ L+DC G GCN G Sbjct: 161 SQTSTCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDG--GCNLG 210 >UniRef50_Q235G6 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 325 Score = 86.2 bits (204), Expect = 7e-16 Identities = 53/170 (31%), Positives = 81/170 (47%), Gaps = 2/170 (1%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDN--FRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 W +FK + Y + +D +RM ++ ++ K +G+ K+ D+ H Sbjct: 40 WKSFKQTYNKKYADQDDDEEVYRMNVFFDNLEFTKKDPT----------MGVTKFMDLTH 89 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 EF + N ++ + + P +DW + GAVT +K+QG Sbjct: 90 TEFAELY--LNPAENIDEEI----------DSLQPIQHNEDIVIDWVEKGAVTPVKNQGG 137 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 CG CWSF+TTG +EG +F L +LS+Q LIDC+ Q N GC GGL D Sbjct: 138 CGGCWSFATTGGVEGANFVYKNVLPNLSQQQLIDCNTQ--NKGCGGGLRD 185 >UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|Rep: Thiol protease - Triticum aestivum (Wheat) Length = 374 Score = 85.4 bits (202), Expect = 1e-15 Identities = 52/174 (29%), Positives = 79/174 (45%), Gaps = 2/174 (1%) Frame = +1 Query: 205 LVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGD 384 L+ E + + +H +Y E R I+ + I N+ G +SY LG+N++ D Sbjct: 45 LMMERFHGWMAKHGKSYAGVEEKLRRFDIFRRNVEFIEAANRD---GRLSYTLGVNQFAD 101 Query: 385 MLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKD 564 + H EF+ T + + G V PA +P ++W VT +K+ Sbjct: 102 LTHEEFLATHTSRRVVPSEEMVITTRAGVVVEGANCQPAPNAVPRSINWVNQSKVTPVKN 161 Query: 565 QGK-CGSCWSFSTTGALEGQH-FRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 QGK CG+CW+FS +E + + G LSEQ LIDC + GC G M Sbjct: 162 QGKVCGACWAFSAVATIESAYAIAKRGEPPVLSEQELIDCDT--FDRGCTSGEM 213 >UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadidae|Rep: Cysteine protease - Tritrichomonas foetus (Trichomonas foetus) Length = 315 Score = 85.4 bits (202), Expect = 1e-15 Identities = 52/136 (38%), Positives = 73/136 (53%) Frame = +1 Query: 316 AKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFIS 495 A++ Q++ G + + +NK+ + E+ K M G+ K K RG K Sbjct: 49 ARYVQEHNAGDSKFTVSLNKFAALTPSEY-KVMLGYKTGMKAEK-------VSRGMK--- 97 Query: 496 PANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLID 675 NV + +DWR+ G V +IKDQ CGSCW+FS A E + +G L S SEQNL+D Sbjct: 98 KPNV---DSIDWREKGVVNEIKDQAACGSCWAFSAIQAAESAYAISTGTLESYSEQNLVD 154 Query: 676 CSEQYGNNGCNGGLMD 723 C + G GC+GGLMD Sbjct: 155 CVQ--GCYGCSGGLMD 168 >UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF6860, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 251 Score = 84.6 bits (200), Expect = 2e-15 Identities = 34/51 (66%), Positives = 43/51 (84%) Frame = +1 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 G CGSCW+FSTTGA+EGQ ++++G LVSLSEQNL+DCS+ YG GC+G M Sbjct: 1 GYCGSCWAFSTTGAIEGQIYKKTGQLVSLSEQNLVDCSKSYGTYGCSGAWM 51 >UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 326 Score = 84.6 bits (200), Expect = 2e-15 Identities = 48/128 (37%), Positives = 70/128 (54%) Frame = +1 Query: 280 RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYM 459 R +++ ++ I N+K M SYKLG+NK+ D+ EF G N + Sbjct: 49 RFEVFKKNARYIHDFNRKKGM---SYKLGLNKFADLTLEEFTAKYTGANPGPITG----L 101 Query: 460 KGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSG 639 K G+ G+ ++ P DWR+HGAVT +KDQG CGSCW+FS A+EG + +G Sbjct: 102 KNGT--GSPPLAAVAGDAPPAWDWREHGAVTRVKDQGPCGSCWAFSVVEAVEGINEIMTG 159 Query: 640 YLVSLSEQ 663 ++LSEQ Sbjct: 160 NFLTLSEQ 167 >UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep: Cysteine protease - Babesia equi Length = 438 Score = 84.6 bits (200), Expect = 2e-15 Identities = 49/139 (35%), Positives = 71/139 (51%), Gaps = 10/139 (7%) Frame = +1 Query: 331 KYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGS----------VRG 480 K + G SY+ G+NK+ DM EF + + K+L + VR Sbjct: 155 KAQTGEESYEKGINKFSDMTDEEFNLRFPALS-VEELKKSLEVSASEEFTSPEHLDKVRI 213 Query: 481 AKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSE 660 AK + + E +DWRK VT +KDQG CGSCW+F+ G++E + + G + LSE Sbjct: 214 AKGLGVEDSVDGEDLDWRKLNGVTPVKDQGNCGSCWAFAAVGSVESLYLIKKGQALDLSE 273 Query: 661 QNLIDCSEQYGNNGCNGGL 717 Q L++C E +NGC G L Sbjct: 274 QELVNCEE--NSNGCEGDL 290 >UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber officinale (Ginger) Length = 221 Score = 84.6 bits (200), Expect = 2e-15 Identities = 36/68 (52%), Positives = 49/68 (72%) Frame = +1 Query: 511 LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQY 690 LP+ +DWR+ GAV +K+QG CGSCW+F A+EG + +G L+SLSEQ L+DCS + Sbjct: 3 LPDSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCSTR- 61 Query: 691 GNNGCNGG 714 N+GC GG Sbjct: 62 -NHGCEGG 68 >UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n=1; Toxocara canis|Rep: Cathepsin L-like cysteine proteinase - Toxocara canis (Canine roundworm) Length = 360 Score = 84.2 bits (199), Expect = 3e-15 Identities = 50/157 (31%), Positives = 78/157 (49%) Frame = +1 Query: 253 YESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKT 432 Y+S E R +IY + K NQ+ Y G N++ D +EF + + + Sbjct: 61 YDSNEEFAERFRIYVNNMLEAQKLNQRNRDYGTIY--GENEFADWNVNEFREILLPKDFF 118 Query: 433 AKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGAL 612 K + + + ++P+ DWR + VT +K Q KCGSCW+F+T G + Sbjct: 119 KNLRKKSTFIDSFIDPPETVLARREEIPDHFDWRPYNVVTPVKSQFKCGSCWAFATVGTV 178 Query: 613 EGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 E + +G L SLSEQ L+DC+ + NN C+GG +D Sbjct: 179 ESAYALGTGELRSLSEQQLLDCNLE--NNACDGGDVD 213 >UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 330 Score = 84.2 bits (199), Expect = 3e-15 Identities = 55/172 (31%), Positives = 85/172 (49%), Gaps = 3/172 (1%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 +W FK + Y + +++R++++A N + V+ G+ ++ D+ Sbjct: 39 QWKLFKSRFNKRYADPITESYRLQVFAS--------NYLRVLSDVTGTFGVTQFFDLTEE 90 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 EF T T + +N+ A SP+ K V+W G V+ +KDQG+C Sbjct: 91 EFAATY----LTLRVQRNV--------NATVSSPSTPKGQYDVNWVTRGKVSAVKDQGQC 138 Query: 577 GSCWSFSTTGALEGQHFRQSGY---LVSLSEQNLIDCSEQYGNNGCNGGLMD 723 GSCW+FSTTG++E +GY + LSEQ L+DCS N GC GG MD Sbjct: 139 GSCWAFSTTGSVESA-LIIAGYANQTIDLSEQQLVDCSAT--NYGCGGGWMD 187 >UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep: Viral cathepsin - Cydia pomonella granulosis virus (CpGV) (Cydia pomonellagranulovirus) Length = 333 Score = 83.4 bits (197), Expect = 5e-15 Identities = 54/171 (31%), Positives = 89/171 (52%), Gaps = 2/171 (1%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 E + F +++ Y S+ E +++ + + +I + N + + +N+Y D+ Sbjct: 30 ELFKNFAIKYNKTYVSDEERAIKLENFKNNLKMINEKNMASKYAVFD----INEYSDLNK 85 Query: 394 HEFVKTMNGFNKTAKHNKNLY-MKGGSVRGAKFISPANVKLPEQVDWR-KHGAVTDIKDQ 567 + ++ GF K N + + M SV K LPE +DWR KHG VT +K+Q Sbjct: 86 NALLRRTTGFRLGLKKNPSAFTMTECSVVVIK--DEPQALLPETLDWRDKHG-VTPVKNQ 142 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 +CGSCW+FST +E + + ++LSEQ+L++C NNGC GGLM Sbjct: 143 MECGSCWAFSTIANIESLYNIKYDKALNLSEQHLVNCDNI--NNGCAGGLM 191 >UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia theta|Rep: Cathepsin H precursor - Guillardia theta (Cryptomonas phi) Length = 353 Score = 83.0 bits (196), Expect = 7e-15 Identities = 36/72 (50%), Positives = 48/72 (66%), Gaps = 5/72 (6%) Frame = +1 Query: 517 EQVDWRKH-----GAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS 681 ++ DWR V+ +K+QG CGSCW+FST ALE H ++G +V LSEQ L+DC+ Sbjct: 120 DEFDWRNQTCGETSCVSMVKNQGTCGSCWTFSTAAALESLHAIKTGEMVLLSEQQLVDCA 179 Query: 682 EQYGNNGCNGGL 717 + NNGCNGGL Sbjct: 180 ADFKNNGCNGGL 191 >UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schistosoma japonicum|Rep: Clone ZZD209 mRNA sequence - Schistosoma japonicum (Blood fluke) Length = 339 Score = 83.0 bits (196), Expect = 7e-15 Identities = 45/165 (27%), Positives = 83/165 (50%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 W +K H +Y + E+ R + + E+ I HN +Y++G+ +Y++G++++ D+ +E Sbjct: 31 WKIWKRLHDKHYTNRHEEVVRRRNWNENLVKIHLHNLRYDLGVETYEIGLSRFSDVDWNE 90 Query: 400 FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 579 F + +K + + V + P+ DWR V + +DQG C Sbjct: 91 FRSWYSVGDKLDIPESSYIDEKYDVNNVGWT-------PDSYDWRHLNIVNEPRDQGSCI 143 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 ++F+ T + E Q+ + ++LS Q IDC+ YGN GC+GG Sbjct: 144 GSYAFAVTASTESQYALHTSNHMNLSVQQFIDCTRIYGNMGCHGG 188 >UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 339 Score = 83.0 bits (196), Expect = 7e-15 Identities = 50/154 (32%), Positives = 81/154 (52%), Gaps = 1/154 (0%) Frame = +1 Query: 259 SEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAK 438 S E R + ++K + + N+K + L +N + D+ +E++ N + + Sbjct: 39 SNKEFYMRFNNFKKNKEYVDQWNEKQ----LETILELNFFADLSRNEYI---NNYLASFI 91 Query: 439 HNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC-GSCWSFSTTGALE 615 N+ K G + N + + +DWR AVT +K+QG C G+ +SFS G +E Sbjct: 92 DISNIEQKNTKYEG-NLKNNFNNSI-KSIDWRNFDAVTPVKNQGLCSGAGYSFSAIGVIE 149 Query: 616 GQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGL 717 HF ++ L++LSEQN+IDC+ GNNGC GGL Sbjct: 150 SSHFIKNKELITLSEQNIIDCTTDMGNNGCMGGL 183 >UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep: Viral cathepsin - Xestia c-nigrum granulosis virus (XnGV) (Xestia c-nigrumgranulovirus) Length = 346 Score = 82.6 bits (195), Expect = 9e-15 Identities = 46/170 (27%), Positives = 82/170 (48%) Frame = +1 Query: 211 KEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDML 390 +E ++ F +++ Y+ + E R +I+ ++ I N + + +N D+ Sbjct: 40 QELFNEFVVKYNKVYKDDQEKEARFEIFKQNLADINARNALEDSAMFE----INSRADIS 95 Query: 391 HHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQG 570 +E ++ + G + + K ++ K+P+ DWR +VT +K Q Sbjct: 96 SNELLQKLTGLKLSLMRGEK---KNSFCTPTVISGDSSGKVPDSFDWRDRNSVTSVKMQK 152 Query: 571 KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 +CGSCW+FS +E + + + LSEQ L+DC + NNGCNGGLM Sbjct: 153 ECGSCWAFSAVANIESLYHIKHNVSLDLSEQQLVDCDKV--NNGCNGGLM 200 >UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyta|Rep: Os12g0273800 protein - Oryza sativa subsp. japonica (Rice) Length = 504 Score = 82.2 bits (194), Expect = 1e-14 Identities = 57/170 (33%), Positives = 80/170 (47%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 E W A QH Y+ E R++++ + I N G Y LG+N++ D+ Sbjct: 45 ERWMA---QHGRVYKDAAEKARRLEVFKANVAFIESFNAG---GKNRYWLGVNQFADLTS 98 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 EF TM + N + + G K+ + + LP VDWR GAVT IKDQG+ Sbjct: 99 EEFKATMTNSKGFSTPNNGVRVS----TGFKYENVSADALPASVDWRTKGAVTRIKDQGQ 154 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 C A+EG +G L+SLSEQ L+DC + GC GG +D Sbjct: 155 C----------AMEGFVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEID 194 >UniRef50_Q231X3 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 82.2 bits (194), Expect = 1e-14 Identities = 54/171 (31%), Positives = 92/171 (53%), Gaps = 2/171 (1%) Frame = +1 Query: 208 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 V + WS +K +H YE+ +++R++++AE+ ++ K++Q + G+ K+ D+ Sbjct: 35 VTKIWSQWKQKHNKRYENTDYESYRLEVFAENLEVV-KNDQ-------TGTYGITKFLDL 86 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 EF N N A++ ++ S+ + P K+ ++W + G V+++K Q Sbjct: 87 TDDEFAG--NFLNLKAQYPED------SIAEDIEVDP---KI--NINWVEAGKVSNVKSQ 133 Query: 568 GKCGSCWSFSTTGALEGQHF--RQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 G CGSCW+FS T ++E + +SLSEQ LIDCS YGN GC G Sbjct: 134 GNCGSCWAFSATASVESALIIAGKVDKSISLSEQQLIDCSGDYGNYGCAAG 184 >UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus salmonis|Rep: Cysteine proteinase - Lepeophtheirus salmonis (salmon louse) Length = 372 Score = 82.2 bits (194), Expect = 1e-14 Identities = 48/175 (27%), Positives = 85/175 (48%), Gaps = 7/175 (4%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 +E+ +F ++ +Y + + ++K++ ++ I +HN + ++ +G+N++ D+ Sbjct: 25 QEFESFVKEYSKSYHNRALRSLKLKVFVDNLREIEEHNANPKR---TWDMGINEFSDLTD 81 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK-LPEQVDWRKHGAVTDIKDQG 570 EF G++ + G V N+K LPE VDWR+ G +TD+K+QG Sbjct: 82 EEFESKYMGYSPMSS-------SAGLVTRTAAPKQGNIKDLPESVDWREKGVITDVKNQG 134 Query: 571 KCGSCWSFSTTGALEGQHFRQSGYLVS--LSEQNLIDCSEQ-Y---GNNGCNGGL 717 CGSCW FS +E ++ LS Q + CS Y G+ GC G + Sbjct: 135 SCGSCWVFSAVEQIESYVAIENNMTSPPLLSTQQITSCSSNPYSCGGSGGCKGAI 189 >UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=17; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 318 Score = 82.2 bits (194), Expect = 1e-14 Identities = 53/149 (35%), Positives = 76/149 (51%) Frame = +1 Query: 268 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 447 E +FR+ +Y +K + +HN+ Y+L MN M E+ K + G +T K Sbjct: 37 EYHFRLGVYNTNKRRVQEHNRANS----GYQLTMNHLSCMTPSEY-KVLLGHKQTKK--- 88 Query: 448 NLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHF 627 + G I +V P+ VDWR V IKDQ +CGSCW+FS A E Q Sbjct: 89 --------IEGEAKIFKGDV--PDAVDWRNAKIVNPIKDQAQCGSCWAFSVVQAQESQWA 138 Query: 628 RQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 + G L+SL+EQN++DC + GC+GG Sbjct: 139 LKKGQLLSLAEQNMVDCVDTC--YGCDGG 165 >UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 20 SCAF14744, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 175 Score = 81.8 bits (193), Expect = 2e-14 Identities = 45/121 (37%), Positives = 64/121 (52%) Frame = +1 Query: 352 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 531 S K G+N++ D+ EF K+LY++ + R F LP + DW Sbjct: 20 SAKYGINQFSDLSEREF--------------KDLYLRASADRAPVFTGQKIKGLPARFDW 65 Query: 532 RKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNG 711 R + V +++Q CGSCW+FS GA++ H S LV LS Q ++DCS Q NNGC+G Sbjct: 66 RDNAVVGPVQNQQACGSCWAFSVVGAVQSVHAIGSSPLVELSVQQVLDCSFQ--NNGCDG 123 Query: 712 G 714 G Sbjct: 124 G 124 >UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza sativa|Rep: Putative cysteine protease - Oryza sativa subsp. japonica (Rice) Length = 357 Score = 81.8 bits (193), Expect = 2e-14 Identities = 51/171 (29%), Positives = 82/171 (47%), Gaps = 1/171 (0%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 EEW A + Y+ E R ++ ++ I + + + +N++ D+ + Sbjct: 45 EEWMA---KFGKTYKCHGEKEHRFAVFRDNVRFIRSYRPE---ATYDSAVRINQFADLTN 98 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 EFV T G + + + + P + +P +DWR GAVT +KDQG Sbjct: 99 GEFVATYTGVKQPPPAT---HPHPHPEEAPRPVDP--IWMPCCIDWRFKGAVTGVKDQGA 153 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYG-NNGCNGGLMD 723 CGS W+F+ A+EG ++G L LSEQ L+DC + G ++GC GG D Sbjct: 154 CGSSWAFAAVAAMEGLMKIRTGQLTPLSEQELVDCVDGGGDSDGCGGGHTD 204 >UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing protein; n=5; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 437 Score = 81.8 bits (193), Expect = 2e-14 Identities = 34/72 (47%), Positives = 49/72 (68%), Gaps = 2/72 (2%) Frame = +1 Query: 508 KLPEQVDWRKHGAVTDIKDQGK-CGSCWSFSTTGALEGQHFRQSGYL-VSLSEQNLIDCS 681 +LP+ VDWR+ G VT +K QGK CGSCW+F+ ALE + ++G + SEQ L+DC+ Sbjct: 204 QLPQYVDWREKGVVTQVKSQGKDCGSCWAFAAVAALESHYALKTGKKPIQFSEQQLVDCA 263 Query: 682 EQYGNNGCNGGL 717 ++ GC+GGL Sbjct: 264 RKFDTKGCSGGL 275 >UniRef50_A0BNM1 Cluster: Chromosome undetermined scaffold_119, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_119, whole genome shotgun sequence - Paramecium tetraurelia Length = 341 Score = 81.8 bits (193), Expect = 2e-14 Identities = 47/170 (27%), Positives = 92/170 (54%), Gaps = 1/170 (0%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 ++ + L+H +Y + E +R IY ++K +I +HN++ E ++ +G N++ + + Sbjct: 28 DFERWALKHGKHYFGD-EKKYRQAIYFQNKQMIEEHNKRSEF---TFLMGENQFMAITNE 83 Query: 397 EFVKT-MNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 EFV +N + ++ ++ ++ + + + I N+K + VDWR + V K+ G Sbjct: 84 EFVSLYLNPISPEKQNEQDQIIRKTNPKSPEPIREYNLK--DDVDWRGYAPV---KNSGN 138 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 CGS W+ + T +E + G V+LS QN++DC+ +G GC+ L D Sbjct: 139 CGSSWAMAATNVIEAAYAIDKGIKVTLSAQNVMDCANSWG--GCDASLAD 186 >UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera litura multicapsid nucleopolyhedrovirus (SpltMNPV) Length = 337 Score = 81.4 bits (192), Expect = 2e-14 Identities = 41/119 (34%), Positives = 62/119 (52%) Frame = +1 Query: 364 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 543 G+NK+ D+ FV G ++ + + ++ + + PE DWRK Sbjct: 77 GINKFSDIDKITFVNEHAGLVSNLINSTDSNFDPYRLCEYVTVAGPSARTPESFDWRKLN 136 Query: 544 AVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 VT +K+QG CGSCW+F+ G +E Q+ L+ LSEQ L+DC + GC+GGLM Sbjct: 137 KVTKVKEQGVCGSCWAFAAIGNIESQYAIMHDSLIDLSEQQLLDCDRV--DQGCDGGLM 193 >UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis|Rep: Cysteine protease 2 - Babesia bovis Length = 445 Score = 80.6 bits (190), Expect = 4e-14 Identities = 59/176 (33%), Positives = 87/176 (49%), Gaps = 1/176 (0%) Frame = +1 Query: 199 FDLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKY 378 +D V E +AF L R N++ V+ + K K + H ++ V+ KL ++K Sbjct: 137 YDTVAERHTAF-LNFRRNHDI-VKSHEHNKAATYTKDL--NHFFDKDIKAVAAKL-LHKI 191 Query: 379 GDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLP-EQVDWRKHGAVTD 555 D+ + + K N+ +Y + + P K+ E +DWR+ AVT Sbjct: 192 -DVYNESNISVTPTDTTATKENQPIYATLKNYSVSAGYPPIGSKVNFEDIDWRRADAVTP 250 Query: 556 IKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 +KDQG CGSCW+F+ G++E RQ V LSEQ L+ C Q GN GCNGG D Sbjct: 251 VKDQGMCGSCWAFAAVGSVESLLKRQKTD-VRLSEQELVSC--QLGNQGCNGGYSD 303 >UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2; Entamoeba|Rep: Cysteine proteinase ACP1 precursor - Entamoeba histolytica Length = 308 Score = 79.8 bits (188), Expect = 6e-14 Identities = 46/116 (39%), Positives = 61/116 (52%) Frame = +1 Query: 367 MNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGA 546 +N + DM H EF++T G + +V+ A + A PE VDWR Sbjct: 57 LNVFADMTHEEFIQTHLGMTYEVPETTS------NVKAA--VKAA----PESVDWR--SI 102 Query: 547 VTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 + KDQG+CGSCW+F TT LEG+ + G L S SEQ L+DC +NGC GG Sbjct: 103 MNPAKDQGQCGSCWTFCTTAVLEGRVNKDLGKLYSFSEQQLVDCDA--SDNGCEGG 156 >UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; Paramecium tetraurelia|Rep: Putative cathepsin L2 precursor - Paramecium tetraurelia Length = 294 Score = 79.4 bits (187), Expect = 8e-14 Identities = 47/154 (30%), Positives = 83/154 (53%) Frame = +1 Query: 253 YESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKT 432 + +E E +RM+IY +K +I +HNQ+ + V+Y++G N++ + H EFV Sbjct: 25 FYTESEKLYRMEIYNSNKRMIEEHNQRED---VTYQMGENQFMTLSHEEFVDLY-----L 76 Query: 433 AKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGAL 612 K + ++ + G S + ++ VDWR + T +K+QG+C S W+FS + +L Sbjct: 77 QKSDSSVNIMGAS------LPEVQLEGLGAVDWRNY---TTVKEQGQCASGWAFSVSNSL 127 Query: 613 EGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 E + + ++ S Q ++DC Y N GC+GG Sbjct: 128 EAWYAIRGFQKINASTQQIVDC--DYNNTGCSGG 159 >UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_36, whole genome shotgun sequence - Paramecium tetraurelia Length = 307 Score = 79.0 bits (186), Expect = 1e-13 Identities = 56/172 (32%), Positives = 82/172 (47%), Gaps = 2/172 (1%) Frame = +1 Query: 214 EEWSAFKL-QHRLN-YESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 387 EE +FK Q N + + E+ +R I+ ++ +I KHN SY + +N++ D+ Sbjct: 24 EEAHSFKTWQKNFNKFYTSNEETYRQVIFNQNVELINKHNSNPNK---SYSMAVNQFADL 80 Query: 388 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 567 EF G K + N+ + G+ G DW + IK+Q Sbjct: 81 TDEEFQSMYLGKPTYVKID-NIELSKGNTLG-------------DADWASK--MNPIKNQ 124 Query: 568 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 G CGSCW+FS GA+EG + G+ LSEQ L+DC+ G GCNGG D Sbjct: 125 GNCGSCWTFSAIGAVEGFLAIRKGFKGVLSEQQLVDCAVDAG-EGCNGGNSD 175 >UniRef50_Q23H15 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 370 Score = 78.6 bits (185), Expect = 1e-13 Identities = 35/70 (50%), Positives = 44/70 (62%), Gaps = 3/70 (4%) Frame = +1 Query: 511 LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC---S 681 L +DWR GAVT +K+QG CGSCWSFS G +E +F Q+ LV SEQ L+DC + Sbjct: 162 LAASIDWRTKGAVTSVKNQGNCGSCWSFSAAGLMESFNFIQNKALVDFSEQQLLDCVIPA 221 Query: 682 EQYGNNGCNG 711 Y +GC G Sbjct: 222 NGYNIHGCEG 231 >UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lamblia ATCC 50803|Rep: GLP_26_49243_47612 - Giardia lamblia ATCC 50803 Length = 543 Score = 78.2 bits (184), Expect = 2e-13 Identities = 35/78 (44%), Positives = 46/78 (58%), Gaps = 7/78 (8%) Frame = +1 Query: 505 VKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEG-------QHFRQSGYLVSLSEQ 663 V+ P Q+DWR G +T +KDQ CGSCWSF G +EG + + L+ +SEQ Sbjct: 314 VQFPRQLDWRVRGVITPVKDQAACGSCWSFGAAGTIEGRLNALKWKRGERDTPLLRVSEQ 373 Query: 664 NLIDCSEQYGNNGCNGGL 717 ++I C NNGCNGGL Sbjct: 374 SIISCVWNEDNNGCNGGL 391 >UniRef50_Q23EG5 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 429 Score = 77.8 bits (183), Expect = 3e-13 Identities = 34/75 (45%), Positives = 51/75 (68%), Gaps = 5/75 (6%) Frame = +1 Query: 508 KLPEQVDWRKHGAVTDIKDQGK----CGSCWSFSTTGALEGQHFRQSGYL-VSLSEQNLI 672 ++P+ VDWR+ G V+ +KDQ CGSCW+FS TGA+E ++G +LS+Q L+ Sbjct: 121 EIPDYVDWREKGIVSSVKDQDAVGDDCGSCWTFSATGAIESHLALKTGKAPFNLSQQQLV 180 Query: 673 DCSEQYGNNGCNGGL 717 DC+ ++ N GC+GGL Sbjct: 181 DCAGKFDNQGCDGGL 195 >UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 291 Score = 77.8 bits (183), Expect = 3e-13 Identities = 50/151 (33%), Positives = 73/151 (48%) Frame = +1 Query: 268 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 447 E FR I+ +K+ + HN+ +YKL +N + E+ + K +K Sbjct: 12 EYKFRFGIWMANKNFVETHNKAN----ANYKLSLNSLSHLTPTEYQSLLG-----TKIDK 62 Query: 448 NLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHF 627 NL +G VR P P +D+R+ G V I+DQ +CGSCW+F T A E + Sbjct: 63 NLVSQGKKVR------PQIKDSPGILDYREMGVVNPIRDQKQCGSCWAFGTVAACESNYA 116 Query: 628 RQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 720 L LSEQN+IDC+ GC GG++ Sbjct: 117 LLYSNLPQLSEQNIIDCATTC--YGCGGGII 145 >UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_46, whole genome shotgun sequence - Paramecium tetraurelia Length = 336 Score = 77.8 bits (183), Expect = 3e-13 Identities = 50/166 (30%), Positives = 80/166 (48%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 E+ +K+++ +Y + ++ FR + +++ + KHN +Y + MN++ D+ Sbjct: 53 EFQRWKIEYGKSYSGQ-QEVFRFFNFQINRNKVNKHNSDPNK---TYFMKMNQFSDLSQE 108 Query: 397 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 576 EF N M+ + + N K VDWRK +T +KDQG+C Sbjct: 109 EF-----SLIYLTHDNAEEVMEQNLIIDELQKTQENDKTINSVDWRK---ITQVKDQGQC 160 Query: 577 GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 CW+F GA E + ++ V LSEQ LIDC Q + GCNGG Sbjct: 161 SGCWAFGAVGAAEAWFYVKNKTTVLLSEQQLIDCDTQ--SFGCNGG 204 >UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; Theileria|Rep: Cysteine proteinase precursor - Theileria parva Length = 440 Score = 77.8 bits (183), Expect = 3e-13 Identities = 45/144 (31%), Positives = 70/144 (48%), Gaps = 13/144 (9%) Frame = +1 Query: 331 KYEMGLVSYKLGMNKYGDMLHHEFVKTM-----------NGFNKTAKHNKNLYMKG--GS 471 K + G Y G+N++ D+ EF K NG+ + Y+K + Sbjct: 157 KEQKGDEPYVKGINRFSDLTEREFYKLFPVMKPPKATYSNGYYLLSHMANKTYLKNLKKA 216 Query: 472 VRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVS 651 + + + A + E +DWR+ +VT +KDQ CG CW+FST G++EG + Sbjct: 217 LNTDEDVDLAKLT-GENLDWRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFDKSYE 275 Query: 652 LSEQNLIDCSEQYGNNGCNGGLMD 723 LS Q L+DC +NGC GGL++ Sbjct: 276 LSVQELLDCDS--FSNGCQGGLLE 297 >UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 385 Score = 77.4 bits (182), Expect = 3e-13 Identities = 56/163 (34%), Positives = 84/163 (51%), Gaps = 11/163 (6%) Frame = +1 Query: 259 SEVEDNFR-MKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTA 435 ++VE F K A H + + N+K M +Y+LG+N++ DM EF G +T Sbjct: 62 ADVESRFEAFKANARH---VNEFNKKEGM---TYRLGLNQFSDMTFEEFAGKFTG-GRTG 114 Query: 436 KHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC----------GSC 585 +L + G+V K PA +P +W K+G VT +K+Q C GSC Sbjct: 115 SIAGDL--RDGAVTYCK--PPAVGYVPPSWNWTKYGVVTPVKNQLTCVNTIKMSMYEGSC 170 Query: 586 WSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 W+FS A+E + ++G L++LSEQ ++DCS G CNGG Sbjct: 171 WAFSVAAAVESINMIRTGNLLTLSEQQILDCS---GAGDCNGG 210 >UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 367 Score = 77.4 bits (182), Expect = 3e-13 Identities = 52/189 (27%), Positives = 92/189 (48%), Gaps = 24/189 (12%) Frame = +1 Query: 220 WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 399 ++ +K +++ +Y + E +FR ++ ++ I HN +YK+ +N++ D+ E Sbjct: 40 FNKWKFENKKSYFNHEEASFRQILFLKNLKNINFHNANKTH---TYKVAVNQFTDLTQEE 96 Query: 400 FVKT-MNGFNKTAKHNKNLYMKGGS----------------VRGAKFISPANVK----LP 516 F + +N A+ + L GG V+ + P ++ + Sbjct: 97 FEASYLNPILTQAEKLRFLQRDGGQNGGKDGGSNQTQNCTDVKNCQNPPPPVIQPLYNVS 156 Query: 517 EQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC---SEQ 687 + +DWR+ GAV+ +K+QG CGSCW+FS E + ++ L SEQ L+DC + Q Sbjct: 157 QSIDWRQSGAVSPVKNQGSCGSCWAFSAVALAESVNLLRNNSLALYSEQELVDCTYKNPQ 216 Query: 688 YGNNGCNGG 714 Y N GC GG Sbjct: 217 YYNYGCQGG 225 >UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanensis|Rep: Sui m 1 allergen - Suidasia medanensis Length = 336 Score = 77.4 bits (182), Expect = 3e-13 Identities = 53/167 (31%), Positives = 83/167 (49%), Gaps = 5/167 (2%) Frame = +1 Query: 229 FKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK 408 FK + Y +E E R I+ E+ I +++ K+ GL +N++ D+ EF Sbjct: 31 FKELYGKQYTAEEEPQ-RRAIFEENLRWIQENHGKHGAGLE-----VNEHADLTAEEFSS 84 Query: 409 TMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCW 588 N+ A L+ + V S +V LP DWR+ T +++QG+CGSCW Sbjct: 85 MYATLNQEAFLKSPLHKEFVQVPE----SDISVALPAAFDWRQQWN-TAVRNQGQCGSCW 139 Query: 589 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSE-----QYGNNGCNGG 714 +F+T +E Q+ + V+LSEQ L+DC QY ++GC GG Sbjct: 140 AFATAATVEAQYAIRKNVHVTLSEQQLVDCDHRPFQGQYEDHGCQGG 186 >UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foetus|Rep: TFCP2 protein - Tritrichomonas foetus (Trichomonas foetus) Length = 270 Score = 77.0 bits (181), Expect = 4e-13 Identities = 34/71 (47%), Positives = 42/71 (59%), Gaps = 1/71 (1%) Frame = +1 Query: 514 PEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC-SEQY 690 P DWR G V IK+QG CGSCW+FS A E H +G L+ SEQ+L+DC + Y Sbjct: 51 PTSFDWRSEGKVNPIKNQGSCGSCWAFSAIAAQESCHAIATGELLRFSEQSLVDCVTSDY 110 Query: 691 GNNGCNGGLMD 723 GC+GG D Sbjct: 111 SCQGCSGGWPD 121 >UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Plasmodium|Rep: Cysteine proteinase precursor - Plasmodium vivax (strain Salvador I) Length = 583 Score = 77.0 bits (181), Expect = 4e-13 Identities = 57/184 (30%), Positives = 98/184 (53%), Gaps = 10/184 (5%) Frame = +1 Query: 193 QFFDLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMN 372 +FF+ + + ++K +N + E NF+M Y + I KHN+ +M YK+ +N Sbjct: 236 KFFNFMNKYKRSYK---DINEQMEKYKNFKMN-YLK----IKKHNETNQM----YKMKVN 283 Query: 373 KYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSV----RGAKFI---SPANV--KLPEQV 525 ++ D +F H K Y+ S +G + S AN+ +PE + Sbjct: 284 QFSDYSKKDFESYFRKLVPIPDHLKKKYVVPFSSMNNGKGKNVVTSSSGANLLADVPEIL 343 Query: 526 DWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQ-SGYLVSLSEQNLIDCSEQYGNNG 702 D+R+ G V + KDQG CGSCW+F++ G +E + ++ + +++LSEQ ++DCS+ N G Sbjct: 344 DYREKGIVHEPKDQGLCGSCWAFASVGNVECMYAKEHNKTILTLSEQEVVDCSKL--NFG 401 Query: 703 CNGG 714 C+GG Sbjct: 402 CDGG 405 >UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin O precursor; n=1; Tribolium castaneum|Rep: PREDICTED: similar to Cathepsin O precursor - Tribolium castaneum Length = 326 Score = 76.6 bits (180), Expect = 6e-13 Identities = 50/172 (29%), Positives = 79/172 (45%), Gaps = 1/172 (0%) Frame = +1 Query: 202 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYG 381 D + ++ + + Y+ R+ + + I N K G Y G+ K+ Sbjct: 29 DQAESQFQEYLKRFNKTYDDPSVYQNRLHAFKQSLQTIETLNSKKRNGSALY--GLTKFS 86 Query: 382 DMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIK 561 D+L EF +T N + K + N + R +P +VDWR+ AVT I Sbjct: 87 DLLPEEFFQTYLQSNLSQKTHSNEPKRHHHKRAT---------VPNKVDWREKNAVTRIY 137 Query: 562 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNN-GCNGG 714 +QG CG+CW++S +E + ++ LS Q +IDC+ GNN GCNGG Sbjct: 138 NQGSCGACWAYSVIETVESMNAIKTNKSEELSVQEIIDCA---GNNKGCNGG 186 >UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 306 Score = 76.2 bits (179), Expect = 8e-13 Identities = 46/149 (30%), Positives = 77/149 (51%) Frame = +1 Query: 268 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 447 E +FR+ I+ +K + + N + +G + L +N++ + +E+ ++M G+ K+ Sbjct: 25 EYHFRLGIWLSNKRYVQEKN-RVNLG---FTLALNRFAHLTENEY-RSMLGY----KYGH 75 Query: 448 NLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHF 627 Y +++ +P ++DWR+ G V IK+QG CGSCW+FS +E Q Sbjct: 76 KSYPITKNIKN---------DVPTEIDWREQGIVNKIKNQGACGSCWAFSAIQVIESQVA 126 Query: 628 RQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 + L LSEQNL+DC GC GG Sbjct: 127 KNQKQLYDLSEQNLLDCVTSC--FGCGGG 153 >UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin O precursor; n=2; Apocrita|Rep: PREDICTED: similar to Cathepsin O precursor - Apis mellifera Length = 374 Score = 75.8 bits (178), Expect = 1e-12 Identities = 47/158 (29%), Positives = 79/158 (50%), Gaps = 3/158 (1%) Frame = +1 Query: 250 NYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFV-KTM--NG 420 N SE E+ F+ + +HI + + Y G+ ++ DM +EF+ T+ + Sbjct: 70 NNPSEYEERFK-RFQRSLQHIERMNGLRSSQESAYY--GLTEFSDMSENEFLLHTLLPDL 126 Query: 421 FNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFST 600 + KH Y + + + ++ +P + DWR G +T ++ QG CG+CW+FST Sbjct: 127 PIRGEKHMNASYHRKHQISIDRM--KRSISIPLRFDWRDKGVITPVRSQGSCGACWAFST 184 Query: 601 TGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 +E ++G L SLS Q +IDC++ N GC GG Sbjct: 185 IEVIESMFAIKNGTLHSLSVQEMIDCAKN-SNFGCEGG 221 >UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa (japonica cultivar-group)|Rep: Os09g0562700 protein - Oryza sativa subsp. japonica (Rice) Length = 235 Score = 75.8 bits (178), Expect = 1e-12 Identities = 33/59 (55%), Positives = 44/59 (74%) Frame = +1 Query: 541 GAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGL 717 GAVT++KDQG+CGSCW+FST +EG + G LVSLSEQ L+DC ++GC+GG+ Sbjct: 19 GAVTEVKDQGRCGSCWAFSTVAVVEGIQKIKKGKLVSLSEQELVDCDTL--DSGCDGGV 75 >UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain - Tetrahymena pyriformis Length = 330 Score = 75.4 bits (177), Expect = 1e-12 Identities = 46/165 (27%), Positives = 82/165 (49%), Gaps = 4/165 (2%) Frame = +1 Query: 229 FKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK 408 FK + Y+++ E+++R+ ++ E+ I +N L ++ +N + D+ EF Sbjct: 39 FKRNFGVTYKNQGEESYRLSVFLENLKSIEANNAN---PLSTHVEEVNSFTDLTEEEFAA 95 Query: 409 TMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCW 588 + + NK+L + + P+ +DW + +K+Q +CGSCW Sbjct: 96 RYLMKDLPQQMNKDL----------PILEMETLAAPQVIDWTAKNVLPPVKNQQQCGSCW 145 Query: 589 SFSTTGALEG-QHFRQSGYL-VSLSEQNLIDC--SEQYGNNGCNG 711 +FST G LEG + +S +S SEQ L+DC ++ +G GCNG Sbjct: 146 AFSTAGMLEGVYNIHESPQTPISFSEQQLVDCCGAQGFGCEGCNG 190 >UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n=1; Rattus norvegicus|Rep: UPI0000501FDB UniRef100 entry - Rattus norvegicus Length = 338 Score = 74.9 bits (176), Expect = 2e-12 Identities = 29/50 (58%), Positives = 37/50 (74%) Frame = +1 Query: 565 QGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 QG+C SCW+F GA+EGQ F+++G L LS QNL+DCS+ GN GC GG Sbjct: 139 QGRCNSCWAFPVVGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGG 188 >UniRef50_Q2QS15 Cluster: Papain family cysteine protease containing protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Papain family cysteine protease containing protein - Oryza sativa subsp. japonica (Rice) Length = 351 Score = 74.9 bits (176), Expect = 2e-12 Identities = 34/70 (48%), Positives = 45/70 (64%) Frame = +1 Query: 511 LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQY 690 LP+ VDWRK GAV ++K CGSCW+FS A+EG ++G LVSL EQ L+DC ++ Sbjct: 145 LPKSVDWRKKGAVVEVKYHEDCGSCWAFSAVAAIEG--INKNGELVSLLEQELVDCDDE- 201 Query: 691 GNNGCNGGLM 720 GC G + Sbjct: 202 -AMGCGGSFL 210 >UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 343 Score = 74.9 bits (176), Expect = 2e-12 Identities = 54/165 (32%), Positives = 84/165 (50%), Gaps = 3/165 (1%) Frame = +1 Query: 229 FKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK 408 F +++ Y +E E R I++ + ++ ++N K + G V+Y+L N + D+ E+ K Sbjct: 54 FLVKYLREYPNEYEIVKRFTIFSRNLDLVERYN-KEDAGKVTYEL--NDFSDLTEEEWKK 110 Query: 409 TMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRK-HGA--VTDIKDQGKCG 579 + H++ S++ I N LP VDWR +G VT IK QG CG Sbjct: 111 YL--MTPKPDHSEK------SLKPKTLIDKKN--LPNSVDWRNVNGTNHVTGIKYQGPCG 160 Query: 580 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 SCW+F+T A+E G L SLS Q L+DC+ ++ C GG Sbjct: 161 SCWAFATAAAIESAVSISGGGLQSLSSQQLLDCT--VVSDKCGGG 203 >UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 397 Score = 74.9 bits (176), Expect = 2e-12 Identities = 32/73 (43%), Positives = 44/73 (60%), Gaps = 5/73 (6%) Frame = +1 Query: 511 LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS--- 681 +P+ VDWR G V+ +KDQG+CG CW+FS T E + ++ L SEQ L+DC+ Sbjct: 180 VPQSVDWRIQGKVSPVKDQGRCGCCWAFSATALAESVNLMRNNTLQQYSEQELVDCTNNQ 239 Query: 682 --EQYGNNGCNGG 714 E Y + GC GG Sbjct: 240 YQEDYSSLGCGGG 252 Score = 33.5 bits (73), Expect = 5.4 Identities = 18/62 (29%), Positives = 30/62 (48%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHH 396 +++ +K QH Y + E NFR IY + +HN +YK+ N++ D+ Sbjct: 39 DFNKWKYQHGKKYFNADEANFRQLIYLMNLQKFNEHNSNPNN---TYKVATNQFSDLSQE 95 Query: 397 EF 402 EF Sbjct: 96 EF 97 >UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_101, whole genome shotgun sequence - Paramecium tetraurelia Length = 306 Score = 74.9 bits (176), Expect = 2e-12 Identities = 49/171 (28%), Positives = 82/171 (47%) Frame = +1 Query: 202 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYG 381 D ++ + +K +++ Y S+ ED +R +I+ ++ + + N + SY LG+N++ Sbjct: 24 DPLRRLYQEWKQKYQTRYTSQFEDEYRFEIFKQNYNYYQEVNSRQS----SYTLGINQFA 79 Query: 382 DMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIK 561 + EF + G ++ + S ++ LPE VDW + +K Sbjct: 80 TLTDEEFEQIYLGRADSSPIEIDE-------------SIDSINLPESVDWSSK--MNPVK 124 Query: 562 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 +QG CGS WSFS GA E G SEQNL+DC ++GC+GG Sbjct: 125 NQGTCGSGWSFSAVGAFEAFFIFVKGTHFQYSEQNLVDCDT--NSHGCDGG 173 >UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precursor; n=3; Plasmodium (Laverania)|Rep: Trophozoite cysteine proteinase precursor - Plasmodium falciparum Length = 569 Score = 74.9 bits (176), Expect = 2e-12 Identities = 52/175 (29%), Positives = 90/175 (51%), Gaps = 13/175 (7%) Frame = +1 Query: 229 FKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE--- 399 F +H Y++ E + +I+ + I HN+ + + YK +N++ D E Sbjct: 228 FMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAM--YKKKVNQFSDYSEEELKE 285 Query: 400 FVKTM-----NGFNKTAK----HNK-NLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAV 549 + KT+ + K +K H K N+ + G + K+PE +D+R+ G V Sbjct: 286 YFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIV 345 Query: 550 TDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 + KDQG CGSCW+F++ G +E +++ ++S SEQ ++DCS+ N GC+GG Sbjct: 346 HEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD--NFGCDGG 398 >UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleostomi|Rep: Cathepsin O precursor - Homo sapiens (Human) Length = 321 Score = 74.9 bits (176), Expect = 2e-12 Identities = 35/75 (46%), Positives = 45/75 (60%) Frame = +1 Query: 490 ISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNL 669 +S NV LP + DWR VT +++Q CG CW+FS GA+E + + L LS Q + Sbjct: 101 MSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQV 160 Query: 670 IDCSEQYGNNGCNGG 714 IDCS Y N GCNGG Sbjct: 161 IDCS--YNNYGCNGG 173 >UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280_A04.4; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein OJ1280_A04.4 - Oryza sativa subsp. japonica (Rice) Length = 340 Score = 74.5 bits (175), Expect = 2e-12 Identities = 35/68 (51%), Positives = 46/68 (67%) Frame = +1 Query: 511 LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQY 690 LP+ +D RK GAV ++K Q CGSCW+FS A+EG ++G LVSLSEQ L+DC ++ Sbjct: 130 LPKSIDRRKKGAVVEVKYQEDCGSCWAFSAVAAIEG--INKNGELVSLSEQELVDCDDE- 186 Query: 691 GNNGCNGG 714 GC GG Sbjct: 187 -AVGCGGG 193 >UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteinase B; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like tick cysteine proteinase B - Haemaphysalis longicornis (Bush tick) Length = 332 Score = 74.5 bits (175), Expect = 2e-12 Identities = 54/145 (37%), Positives = 78/145 (53%), Gaps = 6/145 (4%) Frame = +1 Query: 202 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKY-EMGLVSYKLGMNKY 378 +LV EWSAFK H + S + + IY E++ IA+HN KY GLV + Sbjct: 21 ELVGAEWSAFKALHGKD-TSRKQKSTTGWIYMENRLKIARHNAKYANNGLVQAR------ 73 Query: 379 GDMLHHEFVKTMNGFNKTAKHNKNLY--MKGGSVRGAKFISPANVK---LPEQVDWRKHG 543 HE V + + +H + L + G G+ +I P ++ LP+ +DWRK G Sbjct: 74 -----HERVWRLVA-PRVCEHPQRLQAQLPGPPTWGSTYIEPEGLEDEHLPKTMDWRKKG 127 Query: 544 AVTDIKDQGKCGSCWSFSTTGALEG 618 AVT +K+QG+CGSCW+ S G+LEG Sbjct: 128 AVTPVKNQGQCGSCWA-SHYGSLEG 151 >UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; Oryza sativa|Rep: Putative uncharacterized protein - Oryza sativa subsp. indica (Rice) Length = 149 Score = 74.1 bits (174), Expect = 3e-12 Identities = 30/59 (50%), Positives = 43/59 (72%) Frame = +1 Query: 511 LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQ 687 +P+ +DWRK GAV ++K Q CGSCW+FS A+EG ++G LVSLS+Q L+DC ++ Sbjct: 17 MPKSIDWRKKGAVVEVKYQEDCGSCWAFSAVAAIEG--INKNGELVSLSKQELVDCDDE 73 >UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-like protein; n=1; Maconellicoccus hirsutus|Rep: Cathepsin L-like cysteine proteinase-like protein - Maconellicoccus hirsutus (hibiscus mealybug) Length = 253 Score = 74.1 bits (174), Expect = 3e-12 Identities = 33/76 (43%), Positives = 48/76 (63%), Gaps = 1/76 (1%) Frame = +1 Query: 499 ANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQH-FRQSGYLVSLSEQNLID 675 A ++P +++W G VT + +QGKC W+FS TGALE + + V LSEQNLI+ Sbjct: 29 AQEEIPNEINWVAKGKVTPVGNQGKCNVGWAFSVTGALESEKAIKYEAAPVKLSEQNLIE 88 Query: 676 CSEQYGNNGCNGGLMD 723 CS +GN C+GG ++ Sbjct: 89 CSGGFGNKRCSGGNLE 104 >UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 234 Score = 73.7 bits (173), Expect = 4e-12 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 1/66 (1%) Frame = +1 Query: 511 LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQ- 687 +P+++D+R GAV +IKDQ CGSCW+F + A+E F + G L SLSEQ L+DC Sbjct: 18 IPDEIDYRTKGAVNEIKDQKHCGSCWAFGSCAAMESSWFLKHGTLYSLSEQCLVDCCHDC 77 Query: 688 YGNNGC 705 G +GC Sbjct: 78 LGCHGC 83 >UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicalis|Rep: Blo t 1 allergen - Blomia tropicalis (Mite) Length = 333 Score = 73.7 bits (173), Expect = 4e-12 Identities = 47/159 (29%), Positives = 72/159 (45%), Gaps = 5/159 (3%) Frame = +1 Query: 253 YESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKT 432 Y + E+ R + E + +HN G+ + +N+Y DM EF F+ + Sbjct: 39 YRNAEEEARREHHFKEQLKWVEEHN-----GIDGVEYAINEYSDMSEQEF-----SFHLS 88 Query: 433 AKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGAL 612 YMK + + + + LP+ DWR+ +T I+ QG CGSCW+F+ G Sbjct: 89 GGGLNFTYMKMEAAKEPLINTYGS--LPQNFDWRQKARLTRIRQQGSCGSCWAFAAAGVA 146 Query: 613 EGQHFRQSGYLVSLSEQNLIDCS-----EQYGNNGCNGG 714 E + Q + LSEQ L+DC+ Y NGC G Sbjct: 147 ESLYSIQKQQSIELSEQELVDCTYNRYDSSYQCNGCGSG 185 >UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestivum|Rep: Cysteine protease - Triticum aestivum (Wheat) Length = 371 Score = 73.3 bits (172), Expect = 5e-12 Identities = 46/135 (34%), Positives = 66/135 (48%), Gaps = 13/135 (9%) Frame = +1 Query: 349 VSYKLGMNKYGDMLHHEFV-KTMNGFNKTAKHNKNLY--MKGGSVRGAKFISPA-----N 504 + Y+LG N++ D+ + EF+ + + G A L + G V GA A N Sbjct: 86 LGYELGENEFTDLTNEEFMARYVGGAYGGAGDGGGLITTLAGDVVEGAASSKNAIEEDRN 145 Query: 505 VKL-----PEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNL 669 + + P Q DWR+HG VT K QG CG CW+F+ +E + G LV LS Q L Sbjct: 146 LTMTASDPPRQFDWREHGVVTPAKQQGACGCCWAFAAAATVESLNKINGGELVDLSVQEL 205 Query: 670 IDCSEQYGNNGCNGG 714 +DCS ++ C G Sbjct: 206 VDCSTGVFSSPCGYG 220 >UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 394 Score = 73.3 bits (172), Expect = 5e-12 Identities = 33/74 (44%), Positives = 44/74 (59%), Gaps = 3/74 (4%) Frame = +1 Query: 502 NVKLPEQVDWRK-HGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC 678 N + VDWR + +KDQG+CGSCW+F G +E + +G L S SEQ L+DC Sbjct: 180 NTTVAASVDWRNVKNVLNPVKDQGQCGSCWTFGAAGVMESFNAITNGVLKSFSEQQLVDC 239 Query: 679 SEQYG--NNGCNGG 714 Q G ++GCNGG Sbjct: 240 VHQAGFSSDGCNGG 253 >UniRef50_A0DKP6 Cluster: Chromosome undetermined scaffold_54, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_54, whole genome shotgun sequence - Paramecium tetraurelia Length = 312 Score = 73.3 bits (172), Expect = 5e-12 Identities = 54/167 (32%), Positives = 88/167 (52%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 E+W KL+H + + +E E+ +R +I+ + I +HN +Y +GMNK+ + Sbjct: 34 EDW---KLKHGMQFLNE-ENQYRFQIFQTNLQKIEQHNSDESQ---TYTMGMNKFMHLTQ 86 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 +F ++++ N +H Y+ G + + N++L +D+R H T +KDQG+ Sbjct: 87 EQF-QSLHLMN-IQEH----YV-GDQ---PEILQLGNIQLNASIDYRNH---TIVKDQGQ 133 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 C S W+FS TG LE VSLSEQ+LIDC + + GC G Sbjct: 134 CNSGWAFSVTGTLEVYQKIYQKKNVSLSEQHLIDCDQL--SRGCTDG 178 >UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Bigelowiella natans|Rep: Digestive cysteine proteinase - Bigelowiella natans (Pedinomonas minutissima) (Chlorarachnion sp.(strain CCMP 621)) Length = 360 Score = 72.9 bits (171), Expect = 7e-12 Identities = 54/172 (31%), Positives = 85/172 (49%), Gaps = 6/172 (3%) Frame = +1 Query: 217 EWSAFKLQHRLNYESE-VEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 ++ A+K + +YE ED R+ + E++ II N+ E+G Y G ++ DM Sbjct: 23 KFEAWKKEFGKSYEEAGKEDKARLN-FVENERIIQGLNEN-ELGSAVY--GHTRFSDMSP 78 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPAN-VKLPEQVDWRKHGAVTDIKDQG 570 +F M F +N A + N VK+ + DWR A+T +KDQG Sbjct: 79 EQFRAMMTPFKYHTDEAEN----------AAYDQNKNAVKVTDSFDWRDFNALTPVKDQG 128 Query: 571 KCGSCWSFSTTGALEGQHF-RQSGYL---VSLSEQNLIDCSEQYGNNGCNGG 714 CGSCW+FS T ALE H+ + + L ++LS + L++C + + C GG Sbjct: 129 GCGSCWAFSATQALESAHYIKHNDTLDSPIALSTEQLVECDQH--DYACYGG 178 >UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypanosoma cruzi|Rep: Cysteine protease, putative - Trypanosoma cruzi Length = 434 Score = 72.9 bits (171), Expect = 7e-12 Identities = 42/127 (33%), Positives = 63/127 (49%), Gaps = 6/127 (4%) Frame = +1 Query: 352 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 531 SY+LG+NK+ DM EF NG + A + + K PE ++W Sbjct: 80 SYRLGINKFSDMTKEEFNAKFNG--RVAAPQSTQSPQRAPYKRTK------ATFPEALNW 131 Query: 532 R--KHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQY----G 693 + K+ +T +KDQG CGSCW+ + T ++E + SG L++LS Q + C G Sbjct: 132 QEAKNPVLTPVKDQGSCGSCWAHAATESVESMYAISSGKLLTLSTQQITSCVNNTRKCGG 191 Query: 694 NNGCNGG 714 + GC GG Sbjct: 192 SGGCGGG 198 >UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-like cysteine peptidase; n=3; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L or K-like cysteine peptidase - Trichomonas vaginalis G3 Length = 320 Score = 72.9 bits (171), Expect = 7e-12 Identities = 46/153 (30%), Positives = 79/153 (51%), Gaps = 1/153 (0%) Frame = +1 Query: 268 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 447 E +FR I+ +K + + N +Y+L +N++ + + E+ K++ G ++K+N Sbjct: 37 EFHFRFGIFLANKRFVQEQNSINR----NYRLSLNQFSFLTNSEY-KSLLGGKVSSKNND 91 Query: 448 NLYMKGGSVRGAKFISPANVKLPEQV-DWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQH 624 + ++ SP + K E DWR G + I++QG+CG CW+FST +E + Sbjct: 92 DSHL----------FSPQSKKSSEVTFDWRTKGIINPIRNQGQCGLCWAFSTICCVEARW 141 Query: 625 FRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 723 + L+ LSEQ L+DC + GC GG D Sbjct: 142 AQAYNTLLQLSEQMLVDCVDTC--YGCMGGYAD 172 >UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: hypothetical protein, partial - Ornithorhynchus anatinus Length = 224 Score = 72.5 bits (170), Expect = 1e-11 Identities = 47/152 (30%), Positives = 76/152 (50%), Gaps = 1/152 (0%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 +++ F++++ +YE + E R +I+ ++ A+ Q+ + G + G+ + D+ Sbjct: 45 DKFKEFQIRYNKSYEDQAEHARRFEIFVQNL-ARARKLQEEDQGTAEF--GVTPFSDLSE 101 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 EF+ + M V I PA E DWRK GAVT +K+QG Sbjct: 102 DEFLSL---------YAPRFRMPTSWVNQTARI-PAGPLRAETCDWRKEGAVTPVKNQGD 151 Query: 574 CGSCWSFSTTGALEGQ-HFRQSGYLVSLSEQN 666 CGSCW+F+ G +E + R S LVSLSEQ+ Sbjct: 152 CGSCWAFAAVGNVESMWYLRASNRLVSLSEQD 183 >UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin W, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to Cathepsin W, partial - Ornithorhynchus anatinus Length = 229 Score = 72.5 bits (170), Expect = 1e-11 Identities = 32/67 (47%), Positives = 43/67 (64%), Gaps = 1/67 (1%) Frame = +1 Query: 517 EQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSG-YLVSLSEQNLIDCSEQYG 693 E DWRK GA+T +K+QG CGSCW+F+ G E + ++G LVSLS Q ++DC Sbjct: 70 ETCDWRKRGAITSVKNQGSCGSCWAFAAVGNAESMWYLRAGKRLVSLSVQEVLDCGR--C 127 Query: 694 NNGCNGG 714 +GC GG Sbjct: 128 RDGCQGG 134 >UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin O; n=1; Danio rerio|Rep: PREDICTED: similar to cathepsin O - Danio rerio Length = 327 Score = 72.5 bits (170), Expect = 1e-11 Identities = 39/98 (39%), Positives = 51/98 (52%), Gaps = 5/98 (5%) Frame = +1 Query: 436 KHNKNLYMKGGSVRGAKFI-SPANVKL----PEQVDWRKHGAVTDIKDQGKCGSCWSFST 600 K K Y+ + KF S + +K+ P + DWR HG V + +QG CG CW+FS Sbjct: 90 KQFKEQYLTARAEAAPKFDQSKSEIKVKANNPPRFDWRDHGVVGPVHNQGSCGGCWAFSI 149 Query: 601 TGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 A+E + L LS Q +IDCS Y N GCNGG Sbjct: 150 VEAIESVSAKVGEKLQQLSVQQVIDCS--YQNQGCNGG 185 >UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa|Rep: Os01g0240900 protein - Oryza sativa subsp. japonica (Rice) Length = 166 Score = 72.1 bits (169), Expect = 1e-11 Identities = 32/53 (60%), Positives = 40/53 (75%), Gaps = 3/53 (5%) Frame = +1 Query: 529 WRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSG---YLVSLSEQNLIDC 678 WR GAVTD+K QG C SCW+FSTTGA+EG +F SG L++LSEQ L++C Sbjct: 104 WRDRGAVTDVKMQGTCASCWAFSTTGAVEGDNFLASGNLRNLLNLSEQQLVNC 156 >UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 289 Score = 71.3 bits (167), Expect = 2e-11 Identities = 49/167 (29%), Positives = 77/167 (46%), Gaps = 1/167 (0%) Frame = +1 Query: 214 EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLH 393 EEW A + Y+ E R I+ ++ H I + + +G+N++ D+ + Sbjct: 44 EEWMA---KFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSA---VGINQFADLTN 97 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGK 573 EFV T G H K + + P + P +DWR GAVT +KDQG Sbjct: 98 DEFVATYTGAKPP--HPKE---------APRPVDP--IWTPCCIDWRFRGAVTGVKDQGA 144 Query: 574 CGSCWSFSTTGALEGQHFRQSGYLVSLSE-QNLIDCSEQYGNNGCNG 711 CGSCW+F+ A+EG ++G L LS+ + L++ Q+ G Sbjct: 145 CGSCWAFAAVAAIEGLTKIRTGQLTPLSDARTLVELRNQHATGAAAG 191 >UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1; Uronema marinum|Rep: Cathepsin L-like cysteine protease - Uronema marinum Length = 333 Score = 71.3 bits (167), Expect = 2e-11 Identities = 50/168 (29%), Positives = 82/168 (48%), Gaps = 2/168 (1%) Frame = +1 Query: 217 EWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGM-NKYGDMLH 393 ++ +K H L Y S ED +R ++Y E+ + + N S+ LG+ N++ M + Sbjct: 35 KFKEWKQNHNLVYSSS-EDAYRFQVYFENFQFVEEFNANN-----SFTLGVENQFAAMTN 88 Query: 394 HEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPE-QVDWRKHGAVTDIKDQG 570 EF A+ + +G + + V P V+W GAV +++QG Sbjct: 89 EEF---------KAQFTSEIISEGYNYQQVDRNVYEAVNAPSGSVNWVSKGAVQGVQNQG 139 Query: 571 KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGG 714 CGSCW+FS +LE + +G L+S SEQ L+ C + + GC+GG Sbjct: 140 VCGSCWAFSAVCSLERLYKINTGKLLSFSEQQLVSCEPK--SYGCDGG 185 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 688,744,072 Number of Sequences: 1657284 Number of extensions: 14240659 Number of successful extensions: 52571 Number of sequences better than 10.0: 500 Number of HSP's better than 10.0 without gapping: 48544 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 52131 length of database: 575,637,011 effective HSP length: 98 effective length of database: 413,223,179 effective search space used: 59090914597 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -