BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= wdS00165 (584 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C... 167 1e-40 UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep... 146 4e-34 UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M... 145 8e-34 UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca... 138 7e-32 UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata... 138 9e-32 UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ... 138 1e-31 UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome s... 136 3e-31 UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n... 136 3e-31 UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs... 135 6e-31 UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n... 134 2e-30 UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi... 131 1e-29 UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr... 130 2e-29 UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|... 130 2e-29 UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;... 130 2e-29 UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j... 129 5e-29 UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat... 129 5e-29 UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:... 126 4e-28 UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl... 125 9e-28 UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip... 124 2e-27 UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei... 124 2e-27 UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ... 124 2e-27 UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p... 123 4e-27 UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ... 122 6e-27 UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina... 122 8e-27 UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18... 122 8e-27 UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ... 121 1e-26 UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto... 120 3e-26 UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir... 119 6e-26 UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n... 118 1e-25 UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3... 118 1e-25 UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=... 117 2e-25 UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ... 116 3e-25 UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox... 116 3e-25 UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h... 116 4e-25 UniRef50_Q24E33 Cluster: Papain family cysteine protease contain... 116 4e-25 UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]... 116 5e-25 UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;... 116 5e-25 UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca... 115 7e-25 UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ... 115 7e-25 UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy... 115 9e-25 UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35... 114 1e-24 UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re... 114 1e-24 UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D... 114 2e-24 UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n... 113 4e-24 UniRef50_Q239L8 Cluster: Papain family cysteine protease contain... 112 5e-24 UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory... 112 5e-24 UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n... 112 7e-24 UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain... 112 7e-24 UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus... 111 9e-24 UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat... 111 2e-23 UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy... 110 3e-23 UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus... 110 3e-23 UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate... 109 4e-23 UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ... 109 5e-23 UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica... 109 6e-23 UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt... 108 8e-23 UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C... 108 8e-23 UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re... 108 1e-22 UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ... 108 1e-22 UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ... 108 1e-22 UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D... 107 2e-22 UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal... 106 3e-22 UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ... 106 4e-22 UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain... 106 4e-22 UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt... 105 8e-22 UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ... 105 8e-22 UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio... 105 1e-21 UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ... 104 1e-21 UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e... 104 1e-21 UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole... 104 2e-21 UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|... 104 2e-21 UniRef50_Q22W19 Cluster: Papain family cysteine protease contain... 103 2e-21 UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=... 103 2e-21 UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;... 103 3e-21 UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain... 103 3e-21 UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No... 103 4e-21 UniRef50_Q22A69 Cluster: Papain family cysteine protease contain... 103 4e-21 UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia ... 102 5e-21 UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ... 101 9e-21 UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; ... 101 2e-20 UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae... 101 2e-20 UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er... 101 2e-20 UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac... 100 2e-20 UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi... 100 2e-20 UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R... 99 4e-20 UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz... 100 5e-20 UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s... 99 7e-20 UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio... 99 7e-20 UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ... 99 7e-20 UniRef50_O16454 Cluster: Temporarily assigned gene name protein ... 99 9e-20 UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ... 99 9e-20 UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh... 99 9e-20 UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1... 98 1e-19 UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr... 98 1e-19 UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ... 98 2e-19 UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip... 98 2e-19 UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty... 97 2e-19 UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc... 97 2e-19 UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ... 97 3e-19 UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|... 97 3e-19 UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida... 97 4e-19 UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain... 97 4e-19 UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R... 97 4e-19 UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh... 97 4e-19 UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:... 96 6e-19 UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;... 95 8e-19 UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium... 95 8e-19 UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-... 95 1e-18 UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n... 95 1e-18 UniRef50_Q235G6 Cluster: Papain family cysteine protease contain... 95 1e-18 UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ... 94 2e-18 UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t... 94 2e-18 UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re... 94 3e-18 UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph... 94 3e-18 UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster... 94 3e-18 UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep... 93 3e-18 UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo... 93 3e-18 UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2... 93 3e-18 UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C... 93 4e-18 UniRef50_P25773 Cluster: Cathepsin L; n=38; Eukaryota|Rep: Cathe... 93 4e-18 UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ... 93 6e-18 UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz... 93 6e-18 UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina... 93 6e-18 UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p... 92 8e-18 UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ... 92 8e-18 UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ... 92 8e-18 UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ... 92 1e-17 UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ... 92 1e-17 UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ... 91 1e-17 UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb... 91 1e-17 UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain... 91 1e-17 UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain... 91 1e-17 UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh... 91 1e-17 UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain... 91 2e-17 UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome s... 91 2e-17 UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ... 90 4e-17 UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ... 90 4e-17 UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ... 89 7e-17 UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa... 89 7e-17 UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ... 89 7e-17 UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi... 89 7e-17 UniRef50_Q0MYX5 Cluster: Putative cysteine protease; n=1; Emilia... 89 9e-17 UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1; ... 89 9e-17 UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv... 88 1e-16 UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin ... 88 2e-16 UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia... 88 2e-16 UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w... 88 2e-16 UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine prot... 87 2e-16 UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi... 87 2e-16 UniRef50_UPI0000E224BB Cluster: PREDICTED: hypothetical protein;... 87 3e-16 UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ... 87 3e-16 UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|... 87 3e-16 UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: L... 87 4e-16 UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel... 87 4e-16 UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa... 86 5e-16 UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum... 86 5e-16 UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ... 86 5e-16 UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n... 86 7e-16 UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve... 86 7e-16 UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyt... 85 9e-16 UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste... 85 9e-16 UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ... 85 2e-15 UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain... 85 2e-15 UniRef50_Q248G1 Cluster: Papain family cysteine protease contain... 84 2e-15 UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy... 84 2e-15 UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov... 84 3e-15 UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:... 84 3e-15 UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ... 83 5e-15 UniRef50_Q4TI44 Cluster: Chromosome undetermined SCAF2412, whole... 83 5e-15 UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep... 83 5e-15 UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ... 83 6e-15 UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis... 83 6e-15 UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ... 83 6e-15 UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v... 82 8e-15 UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ... 82 1e-14 UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic... 81 1e-14 UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted... 81 2e-14 UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet... 81 2e-14 UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain... 81 2e-14 UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-li... 81 2e-14 UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li... 81 2e-14 UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ... 80 3e-14 UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sa... 80 3e-14 UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh... 80 3e-14 UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa... 80 4e-14 UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis... 80 4e-14 UniRef50_Q23H32 Cluster: Papain family cysteine protease contain... 80 4e-14 UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain... 80 4e-14 UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy... 80 4e-14 UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl... 79 6e-14 UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain... 79 6e-14 UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ... 79 8e-14 UniRef50_Q231X3 Cluster: Papain family cysteine protease contain... 79 8e-14 UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|... 79 1e-13 UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy... 79 1e-13 UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain... 78 1e-13 UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re... 78 1e-13 UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ... 78 2e-13 UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endop... 78 2e-13 UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh... 78 2e-13 UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n... 77 3e-13 UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh... 77 3e-13 UniRef50_A2F4G0 Cluster: Clan CA, family C1, cathepsin L-like cy... 77 4e-13 UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen... 77 4e-13 UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve... 76 5e-13 UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ... 76 7e-13 UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa... 76 7e-13 UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei... 76 7e-13 UniRef50_Q26988 Cluster: Cysteine proteinase 4; n=1; Tritrichomo... 76 7e-13 UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain... 76 7e-13 UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy... 75 9e-13 UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi... 75 1e-12 UniRef50_Q22RC9 Cluster: Papain family cysteine protease contain... 75 1e-12 UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ... 75 2e-12 UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H... 75 2e-12 UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ... 74 2e-12 UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP000... 74 3e-12 UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w... 74 3e-12 UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big... 73 4e-12 UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|... 73 5e-12 UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi... 73 5e-12 UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh... 73 5e-12 UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:... 73 5e-12 UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G... 73 7e-12 UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ... 73 7e-12 UniRef50_Q23H10 Cluster: Papain family cysteine protease contain... 73 7e-12 UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu... 73 7e-12 UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist... 72 9e-12 UniRef50_A2WPN4 Cluster: Putative uncharacterized protein; n=2; ... 72 1e-11 UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arab... 71 2e-11 UniRef50_A2Q4E7 Cluster: Peptidase C1A, papain; n=1; Medicago tr... 71 2e-11 UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Sl... 71 2e-11 UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re... 71 2e-11 UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n... 71 2e-11 UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try... 71 2e-11 UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain... 71 2e-11 UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:... 71 2e-11 UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyosteli... 71 2e-11 UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa... 71 3e-11 UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w... 71 3e-11 UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr... 70 4e-11 UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,... 70 5e-11 UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv... 70 5e-11 UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma j... 70 5e-11 UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir... 70 5e-11 UniRef50_Q23H06 Cluster: Papain family cysteine protease contain... 70 5e-11 UniRef50_UPI0000D566F0 Cluster: PREDICTED: similar to CathePsin ... 69 6e-11 UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2; Culicid... 69 8e-11 UniRef50_A0DKP6 Cluster: Chromosome undetermined scaffold_54, wh... 69 8e-11 UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ... 69 1e-10 UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ... 69 1e-10 UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain... 69 1e-10 UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The... 69 1e-10 UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryz... 68 1e-10 UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain... 68 1e-10 UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ... 68 2e-10 UniRef50_A2DKX2 Cluster: Clan CA, family C1, cathepsin L-like cy... 68 2e-10 UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali... 68 2e-10 UniRef50_Q9BL26 Cluster: Putative uncharacterized protein; n=4; ... 67 2e-10 UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli... 67 2e-10 UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl... 67 2e-10 UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi... 67 2e-10 UniRef50_UPI0001509FF9 Cluster: Papain family cysteine protease ... 67 3e-10 UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG... 67 3e-10 UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ... 67 3e-10 UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh... 66 4e-10 UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy... 66 6e-10 UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; ... 66 8e-10 UniRef50_Q9VNK7 Cluster: CG1075-PA; n=1; Drosophila melanogaster... 66 8e-10 UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli... 66 8e-10 UniRef50_Q23H15 Cluster: Papain family cysteine protease contain... 66 8e-10 UniRef50_A7APS9 Cluster: Papain family cysteine protease contain... 66 8e-10 UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-... 65 1e-09 UniRef50_A0DCA5 Cluster: Chromosome undetermined scaffold_45, wh... 65 1e-09 UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M... 65 1e-09 UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs... 65 1e-09 UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s... 64 2e-09 UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi... 64 2e-09 UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1... 64 2e-09 UniRef50_A0BNM1 Cluster: Chromosome undetermined scaffold_119, w... 64 2e-09 UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto... 64 2e-09 UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil... 64 3e-09 UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus... 64 3e-09 UniRef50_Q26987 Cluster: Cysteine proteinase 3; n=1; Tritrichomo... 63 4e-09 UniRef50_A3FQ13 Cluster: Cathepsin like thiol protease possibly ... 63 4e-09 UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;... 63 5e-09 UniRef50_Q650W8 Cluster: Putative cysteine proteinase; n=2; Oryz... 63 5e-09 UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Sl... 63 5e-09 UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl... 63 5e-09 UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|R... 62 7e-09 UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; ... 62 1e-08 UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop... 62 1e-08 UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The... 62 1e-08 UniRef50_Q5CYC8 Cluster: Cathepsin like thiol protease possibly ... 61 2e-08 UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop... 61 2e-08 UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; ... 61 2e-08 UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein... 61 2e-08 UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop... 61 2e-08 UniRef50_A0CHZ5 Cluster: Chromosome undetermined scaffold_186, w... 61 2e-08 UniRef50_Q7M1Q7 Cluster: Actinidain; n=1; Actinidia chinensis|Re... 60 3e-08 UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ... 60 3e-08 UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocysti... 60 4e-08 UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ... 60 4e-08 UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag... 60 4e-08 UniRef50_UPI0000498E2F Cluster: cysteine proteinase; n=1; Entamo... 60 5e-08 UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;... 60 5e-08 UniRef50_Q26993 Cluster: Cysteine proteinase 9; n=1; Tritrichomo... 60 5e-08 UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve... 60 5e-08 UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C... 59 7e-08 UniRef50_Q26989 Cluster: Cysteine proteinase 5; n=1; Tritrichomo... 59 7e-08 UniRef50_A0CS14 Cluster: Chromosome undetermined scaffold_26, wh... 59 7e-08 UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ... 59 9e-08 UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O... 59 9e-08 UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - ... 59 9e-08 UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cy... 59 9e-08 UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; P... 59 9e-08 UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ... 58 1e-07 UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcop... 58 1e-07 UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 58 1e-07 UniRef50_Q015J8 Cluster: Cathepsin; n=2; Ostreococcus|Rep: Cathe... 58 2e-07 UniRef50_Q24F16 Cluster: Papain family cysteine protease contain... 58 2e-07 UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;... 58 2e-07 UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl... 58 2e-07 UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ... 58 2e-07 UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy... 58 2e-07 UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to glucocorti... 57 3e-07 UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ... 57 3e-07 UniRef50_Q53K53 Cluster: Cysteine protease 1, putative; n=5; Ory... 57 4e-07 UniRef50_A0EAG2 Cluster: Chromosome undetermined scaffold_86, wh... 57 4e-07 UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cel... 57 4e-07 UniRef50_Q9GU75 Cluster: Thiolproteinase; n=2; Babesia|Rep: Thio... 56 5e-07 UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cy... 56 5e-07 UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280... 56 6e-07 UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 56 6e-07 UniRef50_A0BLR4 Cluster: Chromosome undetermined scaffold_115, w... 56 6e-07 UniRef50_Q7Q9Y5 Cluster: ENSANGP00000012222; n=1; Anopheles gamb... 56 8e-07 UniRef50_A0BZE1 Cluster: Chromosome undetermined scaffold_139, w... 56 8e-07 UniRef50_Q5CKC2 Cluster: Preprocathepsin c; n=2; Cryptosporidium... 55 1e-06 UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease ... 55 1e-06 UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa... 55 1e-06 UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid... 55 1e-06 UniRef50_Q237A1 Cluster: Papain family cysteine protease contain... 55 1e-06 UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 55 1e-06 UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=... 55 1e-06 UniRef50_A0DZB1 Cluster: Chromosome undetermined scaffold_7, who... 55 1e-06 UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|R... 54 2e-06 UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep: Cat... 54 2e-06 UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep... 54 2e-06 UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ... 54 3e-06 UniRef50_Q9XZM9 Cluster: Cysteine proteinase CPW2; n=1; Acantham... 54 3e-06 UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3; Bilate... 54 3e-06 UniRef50_Q7QQ92 Cluster: GLP_243_18349_20043; n=1; Giardia lambl... 54 3e-06 UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophi... 54 3e-06 UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like prote... 54 3e-06 UniRef50_Q54VR1 Cluster: Putative uncharacterized protein; n=1; ... 53 4e-06 UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; ... 53 4e-06 UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who... 53 4e-06 UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3.... 53 4e-06 UniRef50_UPI00006CBB5F Cluster: Papain family cysteine protease ... 53 6e-06 UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R... 53 6e-06 UniRef50_Q9NHY2 Cluster: Cysteine protease cp1; n=2; Theileria c... 53 6e-06 UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia... 53 6e-06 UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabdi... 53 6e-06 UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia ... 52 8e-06 UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li... 52 8e-06 UniRef50_UPI0000E49DA9 Cluster: PREDICTED: similar to cathepsin ... 52 1e-05 UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-li... 52 1e-05 UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ... 52 1e-05 UniRef50_Q9NHY1 Cluster: Cysteine protease cp2; n=1; Theileria c... 52 1e-05 UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain... 52 1e-05 UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-32... 52 1e-05 UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr... 52 1e-05 UniRef50_Q9LUX8 Cluster: Cysteine protease; n=1; Pyrus pyrifolia... 51 2e-05 UniRef50_Q2QS15 Cluster: Papain family cysteine protease contain... 51 2e-05 UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=... 51 2e-05 UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteina... 51 2e-05 UniRef50_Q84YE7 Cluster: Cysteine proteinase-like protein; n=1; ... 51 2e-05 UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; ... 51 2e-05 UniRef50_Q7QXA2 Cluster: GLP_217_11853_10927; n=1; Giardia lambl... 51 2e-05 UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8... 51 2e-05 UniRef50_Q54R55 Cluster: Putative uncharacterized protein; n=1; ... 51 2e-05 UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil... 51 2e-05 UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop... 51 2e-05 UniRef50_Q3L7L0 Cluster: Sar s 1 allergen SMIPP-C Yv5009F04; n=3... 51 2e-05 UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Ca... 51 2e-05 UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh... 50 3e-05 UniRef50_UPI00006CB653 Cluster: Papain family cysteine protease ... 50 4e-05 UniRef50_A0CYS7 Cluster: Chromosome undetermined scaffold_31, wh... 50 4e-05 UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ... 50 5e-05 UniRef50_Q7QWN9 Cluster: GLP_26_50243_51811; n=2; Giardia lambli... 50 5e-05 UniRef50_A7AX75 Cluster: Preprocathepsin c, putative; n=1; Babes... 50 5e-05 UniRef50_A5HC50 Cluster: Cathepsin W; n=2; Theria|Rep: Cathepsin... 49 7e-05 UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, w... 49 7e-05 UniRef50_Q9UBR2 Cluster: Cathepsin Z precursor; n=40; Bilateria|... 49 7e-05 UniRef50_Q84SA7 Cluster: Thiol protease; n=1; Aster tripolium|Re... 49 9e-05 UniRef50_O48605 Cluster: Putative thiol protease; n=1; Hordeum v... 49 9e-05 UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy... 49 9e-05 UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen; n... 49 9e-05 UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi... 49 9e-05 UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-L... 48 1e-04 UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; ... 48 1e-04 UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7... 48 1e-04 UniRef50_Q962W3 Cluster: Cysteine protease; n=3; Giardia intesti... 48 2e-04 UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomati... 48 2e-04 UniRef50_A7SNM3 Cluster: Predicted protein; n=1; Nematostella ve... 48 2e-04 UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon G... 48 2e-04 UniRef50_UPI0000498719 Cluster: cysteine protease 18-related; n=... 48 2e-04 UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryz... 48 2e-04 UniRef50_Q5YER6 Cluster: Cathepsin Z; n=1; Bigelowiella natans|R... 48 2e-04 UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ... 48 2e-04 UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gamb... 48 2e-04 UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil... 48 2e-04 UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n... 48 2e-04 UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Ca... 48 2e-04 UniRef50_UPI000155C322 Cluster: PREDICTED: similar to cathepsin ... 47 3e-04 UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba... 47 3e-04 UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O... 47 3e-04 UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia... 47 3e-04 UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest... 47 3e-04 UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ... 47 3e-04 UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA... 47 4e-04 UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo sapie... 47 4e-04 UniRef50_A1SVF0 Cluster: Peptidase C1A, papain; n=1; Psychromona... 47 4e-04 UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2... 47 4e-04 UniRef50_A0DIY3 Cluster: Chromosome undetermined scaffold_52, wh... 47 4e-04 UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep... 46 5e-04 UniRef50_P05993 Cluster: Cysteine proteinase; n=7; Eukaryota|Rep... 46 5e-04 UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ... 46 5e-04 UniRef50_Q9LFI9 Cluster: Putative uncharacterized protein F2K13_... 46 7e-04 UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl... 46 7e-04 UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep... 46 7e-04 UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|... 46 7e-04 UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr... 46 7e-04 UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma... 46 7e-04 UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep... 46 9e-04 UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps... 46 9e-04 UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; ... 46 9e-04 UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca... 46 9e-04 UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n... 45 0.001 UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,... 45 0.002 UniRef50_Q3A8I3 Cluster: Putative serine protease; n=1; Pelobact... 45 0.002 UniRef50_Q01FL5 Cluster: Cysteine protease-5; n=3; Ostreococcus|... 45 0.002 UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin... 45 0.002 UniRef50_Q7JYA0 Cluster: RE20049p; n=2; Sophophora|Rep: RE20049p... 45 0.002 UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep... 45 0.002 UniRef50_Q6A1H9 Cluster: Cathepsin X/O; n=1; Suberites domuncula... 45 0.002 UniRef50_O96163 Cluster: Cysteine protease, putative; n=5; Plasm... 45 0.002 UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 ... 45 0.002 UniRef50_Q4N5Z7 Cluster: Cysteine proteinase, putative; n=2; The... 44 0.002 UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus co... 44 0.002 UniRef50_Q0PWU8 Cluster: Cathepsin B preproprotein-like protein;... 44 0.002 UniRef50_A2DPZ6 Cluster: Clan CA, family C1, cathepsin B-like cy... 44 0.002 UniRef50_UPI0000EBEFA5 Cluster: PREDICTED: similar to Cathepsin ... 44 0.003 UniRef50_Q61CG7 Cluster: Putative uncharacterized protein CBG129... 32 0.003 UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli... 44 0.004 UniRef50_Q5CM16 Cluster: P3ECSL-related; n=2; Cryptosporidium|Re... 44 0.004 UniRef50_A0EI50 Cluster: Chromosome undetermined scaffold_98, wh... 44 0.004 UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 ... 44 0.004 UniRef50_A2XHS0 Cluster: Putative uncharacterized protein; n=2; ... 43 0.005 UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhip... 43 0.005 UniRef50_Q1AMF1 Cluster: Cathepsin C3; n=1; Toxoplasma gondii|Re... 43 0.005 UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|... 43 0.005 UniRef50_A7QDM1 Cluster: Chromosome chr10 scaffold_81, whole gen... 42 0.008 UniRef50_A4S004 Cluster: Predicted protein; n=2; Ostreococcus|Re... 42 0.008 UniRef50_Q4UCF5 Cluster: Cysteine proteinase, tacP, putative; n=... 42 0.008 UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma... 42 0.008 UniRef50_A0DTZ2 Cluster: Chromosome undetermined scaffold_63, wh... 42 0.008 UniRef50_Q9U7F7 Cluster: Cysteine protease; n=2; Entamoeba histo... 42 0.011 UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: ... 42 0.011 UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n... 42 0.011 UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw... 42 0.011 UniRef50_Q4UFL9 Cluster: Cathepsin-like cysteine protease, putat... 42 0.011 UniRef50_Q0AY53 Cluster: Putative uncharacterized protein; n=1; ... 42 0.014 UniRef50_UPI000155D183 Cluster: PREDICTED: similar to Cathepsin ... 41 0.019 UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus lu... 41 0.019 UniRef50_Q8I8D5 Cluster: Cysteine protease 13; n=2; Entamoeba hi... 41 0.019 UniRef50_Q4N3V5 Cluster: Cathepsin C, putative; n=1; Theileria p... 41 0.019 UniRef50_UPI0000D9FBA6 Cluster: PREDICTED: similar to Cathepsin ... 41 0.025 UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S... 41 0.025 UniRef50_Q8I8D3 Cluster: Cysteine protease 15; n=2; Entamoeba hi... 41 0.025 UniRef50_O62484 Cluster: Putative uncharacterized protein; n=1; ... 31 0.028 UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin ... 40 0.043 UniRef50_UPI00006CBAC7 Cluster: Papain family cysteine protease ... 40 0.043 UniRef50_Q7RSR1 Cluster: Papain family cysteine protease, putati... 40 0.043 UniRef50_Q625X3 Cluster: Putative uncharacterized protein CBG011... 40 0.043 UniRef50_Q4UC83 Cluster: Cysteine proteinase, putative; n=2; The... 40 0.043 UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.... 40 0.043 UniRef50_A5KBM1 Cluster: Serine-repeat antigen; n=1; Plasmodium ... 40 0.043 UniRef50_Q5YER1 Cluster: Cysteine proteinase; n=1; Bigelowiella ... 40 0.057 UniRef50_Q8I0V1 Cluster: Preprocathepsin c, putative; n=1; Plasm... 40 0.057 UniRef50_A7TZ14 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 40 0.057 UniRef50_A5KBM7 Cluster: Serine-repeat antigen 4; n=1; Plasmodiu... 40 0.057 UniRef50_A5KBM2 Cluster: Serine-repeat antigen; n=3; Plasmodium|... 40 0.057 UniRef50_Q2XWW8 Cluster: Cysteine protease Mir1; n=1; Zea diplop... 39 0.076 UniRef50_Q8I8D0 Cluster: Cysteine protease 18; n=2; Entamoeba hi... 39 0.076 UniRef50_Q8I3C0 Cluster: Papain family cysteine protease, putati... 39 0.076 UniRef50_Q7QVE9 Cluster: GLP_542_3431_1206; n=1; Giardia lamblia... 39 0.076 UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma j... 39 0.076 UniRef50_A5KBN2 Cluster: Serine-repeat antigen 2; n=2; Plasmodiu... 39 0.076 UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 ... 39 0.076 >UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina) Length = 339 Score = 167 bits (407), Expect = 1e-40 Identities = 68/84 (80%), Positives = 79/84 (94%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCW+FS+TGALEGQHFR++G LVSLSEQNL+DCS +YGNNGCNGGLMDNAF+Y Sbjct: 138 KDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRY 197 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 IKDNGGIDTE++YPYEG+DD C + Sbjct: 198 IKDNGGIDTEKSYPYEGIDDSCHF 221 Score = 146 bits (354), Expect = 3e-34 Identities = 63/86 (73%), Positives = 70/86 (81%) Frame = +1 Query: 268 GAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHG 447 GA D GFVDIPEGDE+K+ +AVAT+GPVSVAIDASH SFQLYS GVYNE EC LDHG Sbjct: 227 GATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDEQNLDHG 286 Query: 448 VLVVGYGNDEQGVEYWLLKNCWAARW 525 VLVVGYG DE G++YWL+KN W W Sbjct: 287 VLVVGYGTDESGMDYWLVKNSWGTTW 312 >UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin L - Misgurnus mizolepis (Mud loach) Length = 337 Score = 146 bits (353), Expect = 4e-34 Identities = 62/88 (70%), Positives = 73/88 (82%), Gaps = 1/88 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG+CGSCW+FSTTGA+EGQ FR+ G LVSLSEQNL+DCS GN GCNGGLMD AF+Y Sbjct: 132 KDQGECGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQY 191 Query: 182 IKDNGGIDTEQTYPYEGVDDK-CRYIPR 262 IKDN G+D+E+ YPY G DD+ C Y P+ Sbjct: 192 IKDNNGLDSEEAYPYLGTDDQPCHYDPK 219 Score = 130 bits (314), Expect = 2e-29 Identities = 60/95 (63%), Positives = 68/95 (71%), Gaps = 3/95 (3%) Frame = +1 Query: 256 PKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTE 435 PK A D GFVDIP G E LM+AVA+VGPVSVAIDA H SFQ Y SG+Y E ECSS E Sbjct: 218 PKYNAANDTGFVDIPSGKEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYFEKECSSEE 277 Query: 436 LDHGVLVVGY---GNDEQGVEYWLLKNCWAARWAN 531 LDHGVLVVGY G D G +YW++KN W+ W + Sbjct: 278 LDHGVLVVGYGFEGEDVDGKKYWIVKNSWSESWGD 312 >UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=19; Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Homo sapiens (Human) Length = 333 Score = 145 bits (351), Expect = 8e-34 Identities = 58/90 (64%), Positives = 76/90 (84%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG+CGSCW+FS TGALEGQ FR++G L+SLSEQNL+DCS GN GCNGGLMD AF+Y Sbjct: 130 KNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQY 189 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRTPV 271 ++DNGG+D+E++YPYE ++ C+Y P+ V Sbjct: 190 VQDNGGLDSEESYPYEATEESCKYNPKYSV 219 Score = 116 bits (278), Expect = 5e-25 Identities = 57/120 (47%), Positives = 72/120 (60%), Gaps = 3/120 (2%) Frame = +1 Query: 175 QVHQGQRGHRHRADLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVS 354 Q Q G P + + +PK + A D GFVDIP+ E+ LM+AVATVGP+S Sbjct: 188 QYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKALMKAVATVGPIS 246 Query: 355 VAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYG---NDEQGVEYWLLKNCWAARW 525 VAIDA H SF Y G+Y E +CSS ++DHGVLVVGYG + +YWL+KN W W Sbjct: 247 VAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEW 306 >UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Cathepsin - Geodia cydonium (Sponge) Length = 322 Score = 138 bits (335), Expect = 7e-32 Identities = 58/84 (69%), Positives = 68/84 (80%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG+CGSCW+FS TG+LEGQHF +G LVSLSEQNL+DCS GN GCNGGL D+AFKY Sbjct: 119 KNQGQCGSCWAFSATGSLEGQHFNATGKLVSLSEQNLVDCSSAEGNEGCNGGLPDDAFKY 178 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 + NGGIDTE +YPY D+KC Y Sbjct: 179 VIKNGGIDTEASYPYVARDEKCHY 202 Score = 102 bits (245), Expect = 5e-21 Identities = 47/88 (53%), Positives = 57/88 (64%) Frame = +1 Query: 262 NTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELD 441 N G+ +VDI E +L A ATVGP+ V IDASH FQLY GVY+ D CS T LD Sbjct: 206 NIGSTCSSYVDIESKSEAQLQVASATVGPIPVGIDASHLGFQLYDGGVYHSDLCSQTRLD 265 Query: 442 HGVLVVGYGNDEQGVEYWLLKNCWAARW 525 HGVLVVGYG ++ +YW++KN W W Sbjct: 266 HGVLVVGYGVYKE-KDYWMVKNSWGTNW 292 >UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens (Human) Length = 334 Score = 138 bits (334), Expect = 9e-32 Identities = 57/90 (63%), Positives = 72/90 (80%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+Q +CGSCW+FS TGALEGQ FR++G LVSLSEQNL+DCS GN GCNGG M AF+Y Sbjct: 130 KNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQY 189 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRTPV 271 +K+NGG+D+E++YPY VD+ C+Y P V Sbjct: 190 VKENGGLDSEESYPYVAVDEICKYRPENSV 219 Score = 116 bits (279), Expect = 4e-25 Identities = 52/95 (54%), Positives = 66/95 (69%), Gaps = 3/95 (3%) Frame = +1 Query: 256 PKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTE 435 P+N+ A D GF + G E+ LM+AVATVGP+SVA+DA H+SFQ Y SG+Y E +CSS Sbjct: 215 PENSVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKN 274 Query: 436 LDHGVLVVGY---GNDEQGVEYWLLKNCWAARWAN 531 LDHGVLVVGY G + +YWL+KN W W + Sbjct: 275 LDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGS 309 >UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar to human SRY (sex determining region Y)-box 30 (SOX30),transcript variant 1,; n=3; Mammalia|Rep: Testis cDNA clone: QtsA-12228, similar to human SRY (sex determining region Y)-box 30 (SOX30),transcript variant 1, - Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) Length = 433 Score = 138 bits (333), Expect = 1e-31 Identities = 56/90 (62%), Positives = 73/90 (81%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+Q +CGSCW+FS TGALEGQ FR++G LVSLSEQNL+DCS GN GCNGG M++AF+Y Sbjct: 130 KNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMNSAFRY 189 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRTPV 271 +K+NGG+D+E++YPY +D C+Y P V Sbjct: 190 VKENGGLDSEESYPYVAMDGICKYRPENSV 219 >UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome shotgun sequence; n=3; Tetraodontidae|Rep: Chromosome 12 SCAF14996, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 362 Score = 136 bits (330), Expect = 3e-31 Identities = 79/183 (43%), Positives = 97/183 (53%), Gaps = 8/183 (4%) Frame = +1 Query: 7 PREVWLMLVLQHDWSFGRTALPSVRLPGVALGAKPHRLLGAVREQRLQRGAHG----QRL 174 P VWL+L LQH G R G + L+ R + G +G Q Sbjct: 166 PGSVWLLLGLQHHRGPGGQHF---RQTGKLVSLSEQNLVDCSRPEG-NEGCNGGLMDQAF 221 Query: 175 QVHQGQRGHRHRADLPLRGS*RQ-VQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPV 351 Q + G A P + Q P N A + GFVD+P G E+ LM+AVA+VGPV Sbjct: 222 QYIKDNGGLDSEASYPYLATDDQPCHYDPSNNSANETGFVDVPSGSERALMKAVASVGPV 281 Query: 352 SVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY---GNDEQGVEYWLLKNCWAAR 522 SVAIDA H SFQ Y SG+Y E ECSS ELDHGVLVVGY G D G ++W++KN W+ Sbjct: 282 SVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFQGEDVDGKKFWIVKNSWSEN 341 Query: 523 WAN 531 W N Sbjct: 342 WGN 344 >UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n=21; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Globodera pallida Length = 379 Score = 136 bits (330), Expect = 3e-31 Identities = 58/85 (68%), Positives = 70/85 (82%), Gaps = 1/85 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FS+TGALE QH RQ+G L+SLSEQNLIDCS++YGN GCNGG+MDNAF+Y Sbjct: 177 KNQGMCGSCWAFSSTGALEAQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDNAFQY 236 Query: 182 IKDNGGIDTEQTYPYEG-VDDKCRY 253 IKDN G+D E YPY+ KC + Sbjct: 237 IKDNNGVDKELDYPYKAKTGKKCLF 261 Score = 128 bits (309), Expect = 1e-28 Identities = 58/88 (65%), Positives = 64/88 (72%) Frame = +1 Query: 262 NTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELD 441 + GA D GF DI EGDE+KL AVAT GP SVAIDA H SFQLY+ GVY E ECS LD Sbjct: 265 DVGATDTGFFDIAEGDEEKLKIAVATQGPASVAIDAGHRSFQLYTHGVYFEKECSPENLD 324 Query: 442 HGVLVVGYGNDEQGVEYWLLKNCWAARW 525 HGVLVVGYG D Q +YW++KN W A W Sbjct: 325 HGVLVVGYGTDAQQGDYWIVKNSWGAHW 352 >UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precursor; n=3; Metazoa|Rep: Digestive cysteine proteinase 2 precursor - Homarus americanus (American lobster) Length = 323 Score = 135 bits (327), Expect = 6e-31 Identities = 56/84 (66%), Positives = 67/84 (79%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG+CGSCW+FSTTG+LEGQHF ++G L+SL+EQ L+DCS YG GCNGG M++AF Y Sbjct: 123 KDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDY 182 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 IK N GIDTE YPYE D CR+ Sbjct: 183 IKANNGIDTEAAYPYEARDGSCRF 206 Score = 101 bits (243), Expect = 9e-21 Identities = 45/83 (54%), Positives = 57/83 (68%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVG 462 G +I G E L +AV +GP+SV IDA+H+SFQ YSSGVY E CS + LDH VL VG Sbjct: 217 GHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSYLDHAVLAVG 276 Query: 463 YGNDEQGVEYWLLKNCWAARWAN 531 YG+ E G ++WL+KN WA W + Sbjct: 277 YGS-EGGQDFWLVKNSWATSWGD 298 >UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Longidorus elongatus Length = 358 Score = 134 bits (323), Expect = 2e-30 Identities = 54/84 (64%), Positives = 68/84 (80%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCW+FS TG+LEGQH++Q+G LVSLSEQNL+DC + GCNGG MD AF+Y Sbjct: 155 KDQGSCGSCWAFSATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQY 214 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 ++ N GIDTE +YPY+G D +CR+ Sbjct: 215 VETNKGIDTEASYPYKGRDGRCRF 238 Score = 116 bits (280), Expect = 3e-25 Identities = 55/119 (46%), Positives = 74/119 (62%) Frame = +1 Query: 175 QVHQGQRGHRHRADLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVS 354 Q + +G A P +G + + ++ GA D GFVDIPEG+E L A+ATVGPVS Sbjct: 213 QYVETNKGIDTEASYPYKGRDGRCRFKSEDVGATDTGFVDIPEGNETLLEAAIATVGPVS 272 Query: 355 VAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARWAN 531 VAIDA+ FQ YS GVY + CS LDHGVL VGY + + G +Y+++KN W+ W + Sbjct: 273 VAIDAASFKFQFYSHGVYYDRSCSPEYLDHGVLAVGYNSTKDGKQYYIVKNSWSEDWGD 331 >UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L - Suberites domuncula (Sponge) Length = 324 Score = 131 bits (316), Expect = 1e-29 Identities = 53/84 (63%), Positives = 68/84 (80%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG+CGSCWSFS TG+LEGQH + G LVSLSEQNL+DCS ++GN+GC GG+MD+AF+Y Sbjct: 124 KNQGQCGSCWSFSATGSLEGQHALKMGRLVSLSEQNLMDCSSRFGNHGCKGGIMDDAFRY 183 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 + N G+DTE +YPY D CR+ Sbjct: 184 VISNHGVDTESSYPYTAKDGYCRF 207 Score = 111 bits (268), Expect = 9e-24 Identities = 50/88 (56%), Positives = 61/88 (69%) Frame = +1 Query: 262 NTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELD 441 N GA + + DI G E L +A A +GP+SVAIDASH SFQ Y +GVY E CSS+ LD Sbjct: 211 NVGATETSYRDIARGSESSLTQASAQIGPISVAIDASHRSFQFYKNGVYYEPSCSSSRLD 270 Query: 442 HGVLVVGYGNDEQGVEYWLLKNCWAARW 525 HGVLVVGYG E G +Y+++KN W RW Sbjct: 271 HGVLVVGYGT-EGGQDYFIVKNSWGTRW 297 >UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine proteinase precursor - Heterodera glycines (Soybean cyst nematode worm) Length = 353 Score = 130 bits (315), Expect = 2e-29 Identities = 52/85 (61%), Positives = 71/85 (83%), Gaps = 1/85 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHF-RQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK 178 KDQG CGSCW+FS TGA+EG +++ ++SLSEQNL+DCS +YGN GC+GGLMD+AF+ Sbjct: 151 KDQGDCGSCWAFSATGAIEGALAQKKASKIISLSEQNLVDCSSKYGNEGCDGGLMDSAFE 210 Query: 179 YIKDNGGIDTEQTYPYEGVDDKCRY 253 Y++DN G+DTE++YPYE V KC++ Sbjct: 211 YVRDNNGLDTEESYPYEAVTGKCQF 235 Score = 112 bits (270), Expect = 5e-24 Identities = 50/93 (53%), Positives = 64/93 (68%) Frame = +1 Query: 247 QVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECS 426 Q + G V F D+ +GDE++L AVAT+GP+SVA+DAS+ SFQ Y +GVY E CS Sbjct: 234 QFKNETVGGTVVSFKDLKKGDEEQLKIAVATIGPISVALDASNLSFQFYKTGVYYERWCS 293 Query: 427 STELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 + LDHGVL+VGYG DE +YWL+KN W W Sbjct: 294 NRYLDHGVLLVGYGTDETHGDYWLVKNSWGPHW 326 >UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|Rep: Cysteine proteinase - Acanthamoeba healyi Length = 330 Score = 130 bits (314), Expect = 2e-29 Identities = 58/85 (68%), Positives = 69/85 (81%), Gaps = 1/85 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG+CGSCWSFSTTG+ EG +F ++G LVSLSEQNLIDCS YGNNGCNGGLMD AF+Y Sbjct: 130 KNQGQCGSCWSFSTTGSTEGANFLKTGRLVSLSEQNLIDCSVSYGNNGCNGGLMDYAFEY 189 Query: 182 IKDNGGIDTEQTYPYEGVDD-KCRY 253 I +N GIDTE +YPY+ C+Y Sbjct: 190 IINNRGIDTEASYPYQTAGPLTCQY 214 Score = 109 bits (263), Expect = 4e-23 Identities = 52/93 (55%), Positives = 62/93 (66%) Frame = +1 Query: 247 QVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECS 426 Q + N G G+ D+ GDE L+ A A PVSVAIDASH SFQ YS GVY E CS Sbjct: 213 QYNAANKGGSLTGYTDVTSGDENALLNA-AVKEPVSVAIDASHNSFQFYSGGVYYESACS 271 Query: 427 STELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 ST+LDHGVLVVG+G+ E G ++W +KN W A W Sbjct: 272 STQLDHGVLVVGWGS-ENGQDFWWVKNSWGASW 303 >UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2; Brugia malayi|Rep: Cahepsin L-like cysteine protease - Brugia malayi (Filarial nematode worm) Length = 371 Score = 130 bits (314), Expect = 2e-29 Identities = 56/85 (65%), Positives = 67/85 (78%), Gaps = 1/85 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQ-YGNNGCNGGLMDNAFK 178 KDQG CGSCW+FS GALEGQHF Q+G LV LS QNL+DCS+ YGN GC+GGLM AF+ Sbjct: 159 KDQGYCGSCWTFSAVGALEGQHFLQTGKLVELSMQNLLDCSDDTYGNYGCDGGLMMEAFE 218 Query: 179 YIKDNGGIDTEQTYPYEGVDDKCRY 253 Y+ N GIDTE++YPY+G + CRY Sbjct: 219 YVVKNDGIDTEKSYPYQGYQNTCRY 243 Score = 81.8 bits (193), Expect = 1e-14 Identities = 38/85 (44%), Positives = 55/85 (64%), Gaps = 8/85 (9%) Frame = +1 Query: 295 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGND 474 +PEGDE +L A+AT+GP+SVA+DA F Y G+++ +C +T + H +L VGYG + Sbjct: 258 LPEGDELQLQAAIATIGPISVAVDAKLMKF--YRRGIFSTSKC-TTRMGHALLAVGYGTE 314 Query: 475 E--------QGVEYWLLKNCWAARW 525 E + V+YWLLKN W+ RW Sbjct: 315 EVKLQNGTKKSVDYWLLKNSWSKRW 339 >UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06231 protein - Schistosoma japonicum (Blood fluke) Length = 372 Score = 129 bits (311), Expect = 5e-29 Identities = 52/75 (69%), Positives = 67/75 (89%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG+CGSCW+FS+TGA+EGQH+R++ LV+LSEQ LIDCS+ YGNNGC GGLMD AF+Y Sbjct: 166 KNQGQCGSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQY 225 Query: 182 IKDNGGIDTEQTYPY 226 ++DN GID+E +YPY Sbjct: 226 VRDNKGIDSEISYPY 240 Score = 107 bits (258), Expect = 1e-22 Identities = 50/92 (54%), Positives = 66/92 (71%), Gaps = 2/92 (2%) Frame = +1 Query: 262 NTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSST--E 435 N A+ G+++I EGDE+ LM AVAT+GPVSVAI+A SF +Y SG+Y++ EC+S + Sbjct: 257 NIMAQVTGYINIHEGDERALMNAVATIGPVSVAINAGLPSFSMYKSGIYSDPECASASED 316 Query: 436 LDHGVLVVGYGNDEQGVEYWLLKNCWAARWAN 531 LDHGVL+VGYG E G YWL+KN W W + Sbjct: 317 LDHGVLLVGYG-IEDGKPYWLIKNSWGEDWGD 347 >UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviatus|Rep: Cathepsin L 2 - Diaprepes abbreviatus (Sugarcane rootstalk borer weevil) Length = 348 Score = 129 bits (311), Expect = 5e-29 Identities = 55/86 (63%), Positives = 67/86 (77%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+Q CGSCWSFS TGALE Q F+++ L+SLSEQ L+DCS +YGN+GC+GG M AF Y Sbjct: 151 KNQRNCGSCWSFSATGALEAQWFKKTNKLISLSEQQLVDCSGRYGNHGCHGGWMHWAFGY 210 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIP 259 IK+NGGIDTEQ+YPY D +C Y P Sbjct: 211 IKENGGIDTEQSYPYTAKDGRCAYKP 236 Score = 77.0 bits (181), Expect = 3e-13 Identities = 38/92 (41%), Positives = 55/92 (59%) Frame = +1 Query: 256 PKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTE 435 P N A + +P G+ Q L V++VGP+S+A + SH FQ Y SGVY+E +C + Sbjct: 236 PGNKAATVSQVIMVPRGENQ-LAAKVSSVGPISIAAEVSH-KFQFYHSGVYDEPQCGHS- 292 Query: 436 LDHGVLVVGYGNDEQGVEYWLLKNCWAARWAN 531 L+H +L VGYG+ G +WL+KN W W + Sbjct: 293 LNHAMLAVGYGS-MGGKNFWLVKNSWGTGWGD 323 >UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep: Cathepsin - Petromyzon marinus (Sea lamprey) Length = 333 Score = 126 bits (304), Expect = 4e-28 Identities = 53/86 (61%), Positives = 65/86 (75%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGS W+FS TG+LEGQHF +G L SLSEQ L+DC++ Y NNGCNGG + A +Y Sbjct: 133 KEQGLCGSSWAFSATGSLEGQHFAATGNLTSLSEQQLVDCTKSYYNNGCNGGRSERALQY 192 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIP 259 I DN GID+E +YPYE D KCR+ P Sbjct: 193 IIDNNGIDSELSYPYEHADGKCRFKP 218 Score = 74.5 bits (175), Expect = 2e-12 Identities = 33/80 (41%), Positives = 55/80 (68%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 FV+ P +E+ L +AVA+VGP+++A++A +F+ Y SG++NE C + +H +LVVGY Sbjct: 230 FVE-PSSNEEVLRQAVASVGPIAIAMNADLDTFKHYKSGLFNEPSCDKSP-NHAMLVVGY 287 Query: 466 GNDEQGVEYWLLKNCWAARW 525 G+ G ++W++KN W W Sbjct: 288 GS-LSGNDFWIVKNSWGEDW 306 >UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegleria fowleri|Rep: Cysteine proteinase homolog - Naegleria fowleri Length = 347 Score = 125 bits (301), Expect = 9e-28 Identities = 60/107 (56%), Positives = 78/107 (72%), Gaps = 8/107 (7%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC--------SEQYGNNGCNGG 157 K+QG CGSCW+FSTTG +EGQ + G LVSLSEQ L+DC ++Q ++GCNGG Sbjct: 138 KNQGACGSCWTFSTTGNVEGQWAIKKGKLVSLSEQQLVDCDHNCVTYQNQQACDSGCNGG 197 Query: 158 LMDNAFKYIKDNGGIDTEQTYPYEGVDDKCRYIPRTPVLRTWASWTS 298 LM +AF+Y+ NGG+DTE +YPYEGVDD CR+ ++ V T +SWTS Sbjct: 198 LMWSAFQYVIKNGGLDTEDSYPYEGVDDTCRF-NKSNVAATISSWTS 243 Score = 63.7 bits (148), Expect = 3e-09 Identities = 32/77 (41%), Positives = 48/77 (62%), Gaps = 4/77 (5%) Frame = +1 Query: 307 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQ-- 480 DE ++ +A GP+S+AI+A Q Y+SG+ + C+ +LDHGVL+VGYG + Sbjct: 247 DENQMAAWLAANGPISIAINAEW--LQYYTSGISDPWFCNPQDLDHGVLIVGYGVGKSWL 304 Query: 481 GVE--YWLLKNCWAARW 525 G E YW++KN W + W Sbjct: 305 GSEENYWIVKNSWGSDW 321 >UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 4 - Rhipicephalus appendiculatus (Brown ear tick) Length = 345 Score = 124 bits (299), Expect = 2e-27 Identities = 52/86 (60%), Positives = 73/86 (84%), Gaps = 2/86 (2%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS-EQYGNNGCNGGLMDNAFK 178 K+QG+CGSCW+FS+TGALEGQ F+++ L+SLSEQNL+DC+ ++YGNNGCNGG M AF+ Sbjct: 142 KNQGQCGSCWAFSSTGALEGQVFKRTRRLISLSEQNLMDCAGQRYGNNGCNGGQMPGAFQ 201 Query: 179 YIKDNGGIDTEQTYPY-EGVDDKCRY 253 Y++D GG+DTE YPY +G + +C++ Sbjct: 202 YVQDAGGLDTEARYPYRQGTNFQCQF 227 Score = 90.6 bits (215), Expect = 2e-17 Identities = 38/81 (46%), Positives = 54/81 (66%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVG 462 G +P +E+ L +AVA VGP+S+AI+AS +F Y +G+Y E C L+H VL+VG Sbjct: 240 GHTRVPPRNERVLQDAVANVGPISIAINASPQTFMFYKNGIYGEPNCDPRGLNHAVLLVG 299 Query: 463 YGNDEQGVEYWLLKNCWAARW 525 YG +E+GV YW++KN W W Sbjct: 300 YG-EERGVPYWIVKNSWGPGW 319 >UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like midgut cysteine proteinase - Tenebrio molitor (Yellow mealworm) Length = 330 Score = 124 bits (298), Expect = 2e-27 Identities = 57/84 (67%), Positives = 63/84 (75%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG+CGSCWSFSTTGA+EGQ Q G L SLSEQNLIDCS YGN GC+GG MD+AF Y Sbjct: 132 KDQGQCGSCWSFSTTGAVEGQLALQRGRLTSLSEQNLIDCSSSYGNAGCDGGWMDSAFSY 191 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 I D GI +E YPYE D CR+ Sbjct: 192 IHDY-GIMSESAYPYEAQGDYCRF 214 Score = 96.7 bits (230), Expect = 4e-19 Identities = 41/81 (50%), Positives = 57/81 (70%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVG 462 G+ D+P GDE L +AV GPV+VAIDA+ Q YS G++ + C+ ++L+HGVLVVG Sbjct: 225 GYYDLPSGDENSLADAVGQAGPVAVAIDATD-ELQFYSGGLFYDQTCNQSDLNHGVLVVG 283 Query: 463 YGNDEQGVEYWLLKNCWAARW 525 YG+D G +YW+LKN W + W Sbjct: 284 YGSD-NGQDYWILKNSWGSGW 303 >UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 326 Score = 124 bits (298), Expect = 2e-27 Identities = 53/84 (63%), Positives = 67/84 (79%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCWSFSTTG +EG +F ++G LVSLSEQNL+DC+++ GC+GG MD A +Y Sbjct: 126 KDQGSCGSCWSFSTTGTVEGAYFLKTGKLVSLSEQNLVDCAKE-DCYGCSGGYMDKALEY 184 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 I+ GGI +E YPYEG+DDKCR+ Sbjct: 185 IETAGGIMSENDYPYEGIDDKCRF 208 Score = 86.6 bits (205), Expect = 4e-16 Identities = 45/106 (42%), Positives = 61/106 (57%), Gaps = 2/106 (1%) Frame = +1 Query: 214 DLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLY 393 D P G + + A+ F I + DE L AV GP+SVAIDAS +FQLY Sbjct: 196 DYPYEGIDDKCRFDSSKVAAKISNFTYIKKNDEDDLKNAVIAKGPISVAIDASF-NFQLY 254 Query: 394 SSGVYNEDECSS--TELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 SG+ ++ C S L+HGVLVVGYG +++ +YW++KN W A W Sbjct: 255 DSGILDDSSCYSDFNSLNHGVLVVGYGTEKE-QDYWIVKNSWGADW 299 >UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 protein, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to MGC81823 protein, partial - Ornithorhynchus anatinus Length = 361 Score = 123 bits (296), Expect = 4e-27 Identities = 47/76 (61%), Positives = 64/76 (84%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG+CGSCW+F +TG LEGQ FR++G L ++SEQNL+DCS + GN GC+GGLM +F Y Sbjct: 206 KDQGRCGSCWAFGSTGVLEGQLFRRTGRLAAVSEQNLMDCSRKQGNRGCDGGLMQQSFLY 265 Query: 182 IKDNGGIDTEQTYPYE 229 ++DNGG+D+E+ YPY+ Sbjct: 266 VRDNGGVDSEEAYPYD 281 >UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; Dictyostelium discoideum|Rep: Cysteine proteinase 7 precursor - Dictyostelium discoideum (Slime mold) Length = 460 Score = 122 bits (294), Expect = 6e-27 Identities = 57/90 (63%), Positives = 67/90 (74%), Gaps = 3/90 (3%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGY--LVSLSEQNLIDCSEQYGNNGCNGGLMDNAF 175 K+QG+CG CWSFSTTGA EG + +G LVSLSEQNLIDCS YGNNGC GGLM AF Sbjct: 126 KNQGQCGGCWSFSTTGATEGAQYLANGKKNLVSLSEQNLIDCSGSYGNNGCEGGLMTLAF 185 Query: 176 KYIKDNGGIDTEQTYPYEGVD-DKCRYIPR 262 +YI +N GIDTE +YPY D KC++ P+ Sbjct: 186 EYIINNKGIDTESSYPYTAEDGKKCKFNPK 215 Score = 88.2 bits (209), Expect = 1e-16 Identities = 42/77 (54%), Positives = 54/77 (70%) Frame = +1 Query: 238 RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNED 417 ++ + +PKN A+ +V++ G E L V T GP SVAIDAS+ SFQLY SG+YNE Sbjct: 208 KKCKFNPKNVAAQLSSYVNVTSGSESDLAAKV-TQGPTSVAIDASNQSFQLYVSGIYNEP 266 Query: 418 ECSSTELDHGVLVVGYG 468 CSST+LDHGVL VG+G Sbjct: 267 ACSSTQLDHGVLAVGFG 283 >UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteinase A; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like tick cysteine proteinase A - Haemaphysalis longicornis (Bush tick) Length = 312 Score = 122 bits (293), Expect = 8e-27 Identities = 51/72 (70%), Positives = 63/72 (87%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG+CGSCW+FSTTG+LEGQHFR++ V+ EQNL+DCS+ +GN GCNGGLMDN F+Y Sbjct: 109 KNQGQCGSCWAFSTTGSLEGQHFRKTESRVT-GEQNLVDCSDDFGNQGCNGGLMDNGFQY 167 Query: 182 IKDNGGIDTEQT 217 IK NGGIDTE+T Sbjct: 168 IKANGGIDTEET 179 Score = 41.5 bits (93), Expect = 0.014 Identities = 17/27 (62%), Positives = 21/27 (77%) Frame = +1 Query: 433 ELDHGVLVVGYGNDEQGVEYWLLKNCW 513 +LDHGVL VGYG + G +YWL+KN W Sbjct: 252 QLDHGVLTVGYG-VKNGKKYWLVKNSW 277 >UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18; Magnoliophyta|Rep: Thiol protease aleurain precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 122 bits (293), Expect = 8e-27 Identities = 50/84 (59%), Positives = 63/84 (75%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCW+FSTTGALE + + G +SLSEQ L+DC+ + N GCNGGL AF+Y Sbjct: 157 KDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEY 216 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 IK NGG+DTE+ YPY G D+ C++ Sbjct: 217 IKSNGGLDTEKAYPYTGKDETCKF 240 Score = 91.5 bits (217), Expect = 1e-17 Identities = 52/138 (37%), Positives = 67/138 (48%), Gaps = 2/138 (1%) Frame = +1 Query: 124 GAVREQRLQRGAHGQRLQVHQGQRGHRHRADLPLRGS*RQVQVHPKNTGAEDVGFVDIPE 303 GA G Q + + G P G + +N G + + V+I Sbjct: 198 GAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDETCKFSAENVGVQVLNSVNITL 257 Query: 304 GDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELD--HGVLVVGYGNDE 477 G E +L AV V PVS+A + H SF+LY SGVY + C ST +D H VL VGYG E Sbjct: 258 GAEDELKHAVGLVRPVSIAFEVIH-SFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYG-VE 315 Query: 478 QGVEYWLLKNCWAARWAN 531 GV YWL+KN W A W + Sbjct: 316 DGVPYWLIKNSWGADWGD 333 >UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; n=35; Fasciola|Rep: Cathepsin L-like proteinase precursor - Fasciola hepatica (Liver fluke) Length = 326 Score = 121 bits (291), Expect = 1e-26 Identities = 50/98 (51%), Positives = 70/98 (71%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCW+FSTTG +EGQ+ + +S SEQ L+DCS +GNNGC+GGLM+NA++Y Sbjct: 124 KDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQY 183 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRTPVLRTWASWT 295 +K G++TE +YPY V+ +CRY + V + +T Sbjct: 184 LK-QFGLETESSYPYTAVEGQCRYNKQLGVAKVTGYYT 220 Score = 72.1 bits (169), Expect = 9e-12 Identities = 31/85 (36%), Positives = 46/85 (54%) Frame = +1 Query: 271 AEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGV 450 A+ G+ + G E +L V P +VA+D + F +Y SG+Y CS ++H V Sbjct: 213 AKVTGYYTVHSGSEVELKNLVGARRPAAVAVDVE-SDFMMYRSGIYQSQTCSPLRVNHAV 271 Query: 451 LVVGYGNDEQGVEYWLLKNCWAARW 525 L VGYG + G +YW++KN W W Sbjct: 272 LAVGYGT-QGGTDYWIVKNSWGTYW 295 >UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens (Human) Length = 331 Score = 120 bits (288), Expect = 3e-26 Identities = 52/85 (61%), Positives = 65/85 (76%), Gaps = 1/85 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS-EQYGNNGCNGGLMDNAFK 178 K QG CG+CW+FS GALE Q ++G LVSLS QNL+DCS E+YGN GCNGG M AF+ Sbjct: 131 KYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQ 190 Query: 179 YIKDNGGIDTEQTYPYEGVDDKCRY 253 YI DN GID++ +YPY+ +D KC+Y Sbjct: 191 YIIDNKGIDSDASYPYKAMDQKCQY 215 Score = 93.9 bits (223), Expect = 3e-18 Identities = 49/101 (48%), Positives = 60/101 (59%) Frame = +1 Query: 211 ADLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQL 390 A P + ++ Q K A + ++P G E L EAVA GPVSV +DA H SF L Sbjct: 202 ASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFL 261 Query: 391 YSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCW 513 Y SGVY E C+ ++HGVLVVGYG D G EYWL+KN W Sbjct: 262 YRSGVYYEPSCTQ-NVNHGVLVVGYG-DLNGKEYWLVKNSW 300 >UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheirus salmonis|Rep: Putative cathepsin L - Lepeophtheirus salmonis (salmon louse) Length = 257 Score = 119 bits (286), Expect = 6e-26 Identities = 51/84 (60%), Positives = 61/84 (72%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQ CGSCW+FSTTG++EGQ+F ++ L+S SEQ L+DCS + N GCNGG MDNAFKY Sbjct: 54 KDQKDCGSCWAFSTTGSVEGQYFIKNKKLLSFSEQQLVDCSSDFRNEGCNGGWMDNAFKY 113 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 + N GI TE TYPY D C Y Sbjct: 114 LIANKGIATEDTYPYTATDGVCVY 137 Score = 51.6 bits (118), Expect = 1e-05 Identities = 27/59 (45%), Positives = 35/59 (59%), Gaps = 1/59 (1%) Frame = +1 Query: 244 VQVHPKNTGAEDVG-FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNED 417 V V+ K A + F D+ G E +L AVA +GP+SVAIDAS FQ Y GVY ++ Sbjct: 134 VCVYNKTMAAGRISSFKDVKHGSEDQLKLAVAQIGPISVAIDASSGDFQFYKKGVYVDE 192 >UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n=1; Rattus norvegicus|Rep: UPI0000501FDB UniRef100 entry - Rattus norvegicus Length = 338 Score = 118 bits (283), Expect = 1e-25 Identities = 47/86 (54%), Positives = 62/86 (72%) Frame = +2 Query: 8 QGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIK 187 QG+C SCW+F GA+EGQ F+++G L LS QNL+DCS+ GN GC GG NAF+Y+ Sbjct: 139 QGRCNSCWAFPVVGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVL 198 Query: 188 DNGGIDTEQTYPYEGVDDKCRYIPRT 265 NGG+++E TYPYEG + CRY P + Sbjct: 199 QNGGLESEATYPYEGKEGLCRYNPNS 224 Score = 77.4 bits (182), Expect = 2e-13 Identities = 42/113 (37%), Positives = 62/113 (54%), Gaps = 3/113 (2%) Frame = +1 Query: 196 GHRHRADLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASH 375 G A P G + +P N+ A+ P+ +E LM+AVAT PV+ I H Sbjct: 202 GLESEATYPYEGKEGLCRYNP-NSSAKITXICAPPQKNEDVLMDAVATK-PVAAGIHVVH 259 Query: 376 TSFQLYSSGVYNEDECSSTELDHGVLVVGYG---NDEQGVEYWLLKNCWAARW 525 +S + Y G+Y+E +C++ ++H VLVVGYG N+ G YWL++N W RW Sbjct: 260 SSLRFYKKGIYHEPKCNNY-VNHAVLVVGYGFEGNETDGNNYWLIQNSWGERW 311 >UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3; Bilateria|Rep: Cathepsin L-like cysteine protease - Neobenedenia melleni Length = 335 Score = 118 bits (283), Expect = 1e-25 Identities = 46/84 (54%), Positives = 65/84 (77%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+Q +CGSCW+FS+TG++EG R +G L+S SEQ L+DCS +GN+GCNGG+MDN+F Y Sbjct: 134 KNQAQCGSCWAFSSTGSIEGAVKRATGKLISFSEQQLVDCSTAFGNHGCNGGIMDNSFNY 193 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 + N G+++E +YPYE +CRY Sbjct: 194 LIHNKGLESEASYPYEAQKKECRY 217 Score = 106 bits (255), Expect = 3e-22 Identities = 45/80 (56%), Positives = 57/80 (71%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 F D+ + DE+ L AV VGPVS+AIDAS SF LY SGVY+E++CS T L+HGVL VGY Sbjct: 229 FTDVSQFDEKDLKRAVGLVGPVSIAIDASQFSFHLYDSGVYDEEDCSQTMLNHGVLAVGY 288 Query: 466 GNDEQGVEYWLLKNCWAARW 525 G +G++YW +KN W W Sbjct: 289 GTTPEGLDYWKVKNSWTNTW 308 >UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=176; Viridiplantae|Rep: Cysteine proteinase RD21a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 462 Score = 117 bits (281), Expect = 2e-25 Identities = 54/97 (55%), Positives = 69/97 (71%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCW+FST GA+EG + +G L++LSEQ L+DC Y N GCNGGLMD AF++ Sbjct: 153 KDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSY-NEGCNGGLMDYAFEF 211 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRTPVLRTWASW 292 I NGGIDT++ YPY+GVD C I + + T S+ Sbjct: 212 IIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSY 248 Score = 78.2 bits (184), Expect = 1e-13 Identities = 36/80 (45%), Positives = 52/80 (65%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 + D+P E+ L +AVA P+S+AI+A +FQLY SG++ D T+LDHGV+ VGY Sbjct: 248 YEDVPTYSEESLKKAVAHQ-PISIAIEAGGRAFQLYDSGIF--DGSCGTQLDHGVVAVGY 304 Query: 466 GNDEQGVEYWLLKNCWAARW 525 G E G +YW+++N W W Sbjct: 305 GT-ENGKDYWIVRNSWGKSW 323 >UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine protease; n=11; Callosobruchus maculatus|Rep: Putative gut cathepsin L-like cysteine protease - Callosobruchus maculatus (Southern cowpea weevil) (Pulse bruchid) Length = 326 Score = 116 bits (280), Expect = 3e-25 Identities = 50/84 (59%), Positives = 63/84 (75%), Gaps = 1/84 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC-SEQYGNNGCNGGLMDNAFK 178 KDQ CGSCW+FS GA+EGQ F+++G LVSLS Q L+DC +E YGNNGC GGLM AF Sbjct: 128 KDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFD 187 Query: 179 YIKDNGGIDTEQTYPYEGVDDKCR 250 +++D GI TE++YPYEG C+ Sbjct: 188 FVQDE-GIQTEESYPYEGRRSSCK 210 Score = 79.0 bits (186), Expect = 8e-14 Identities = 40/76 (52%), Positives = 53/76 (69%), Gaps = 3/76 (3%) Frame = +1 Query: 307 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNED-ECSS--TELDHGVLVVGYGNDE 477 DEQ++ VA GPV+VAI+AS SF Y G+ +E CS+ +L+HGVLVVGYG+ E Sbjct: 227 DEQEMARTVAAKGPVAVAIEASQLSF--YDKGIVDERCRCSNKREDLNHGVLVVGYGS-E 283 Query: 478 QGVEYWLLKNCWAARW 525 GV+YW++KN W A W Sbjct: 284 NGVDYWIVKNSWGADW 299 >UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Toxopain-2 - Toxoplasma gondii Length = 422 Score = 116 bits (280), Expect = 3e-25 Identities = 50/83 (60%), Positives = 63/83 (75%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQ CGSCW+FSTTGALEG H ++G LVSLSEQ L+DCS GN C+GG M++AF+Y Sbjct: 221 KDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQY 280 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 + D+GGI +E YPY D++CR Sbjct: 281 VLDSGGICSEDAYPYLARDEECR 303 Score = 78.2 bits (184), Expect = 1e-13 Identities = 37/83 (44%), Positives = 50/83 (60%), Gaps = 1/83 (1%) Frame = +1 Query: 280 VGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVV 459 +GF D+P E + A+A PVS+AI+A FQ Y GV+ D T+LDHGVL+V Sbjct: 314 LGFKDVPRRSEAAMKAALAK-SPVSIAIEADQMPFQFYHEGVF--DASCGTDLDHGVLLV 370 Query: 460 GYGND-EQGVEYWLLKNCWAARW 525 GYG D E ++W++KN W W Sbjct: 371 GYGTDKESKKDFWIMKNSWGTGW 393 >UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin heavy chain; n=3; Amniota|Rep: PREDICTED: similar to ferritin heavy chain - Ornithorhynchus anatinus Length = 338 Score = 116 bits (279), Expect = 4e-25 Identities = 52/85 (61%), Positives = 62/85 (72%), Gaps = 1/85 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FS TGALE F+ +G +VSLSEQNL+DCS + GN GC GG AF+Y Sbjct: 136 KNQGLCGSCWAFSATGALEALVFKTTGKMVSLSEQNLVDCSWRQGNVGCRGGQYIGAFEY 195 Query: 182 IKDNGGIDTEQTYPYEGVDD-KCRY 253 ++ NGGID E YPY G DD CRY Sbjct: 196 VRANGGIDAEDLYPYLGRDDISCRY 220 Score = 78.2 bits (184), Expect = 1e-13 Identities = 36/83 (43%), Positives = 56/83 (67%), Gaps = 3/83 (3%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 ++ + + +EQ L +AVATVGPVSVA+DA F Y SG+++ C+ +++H +L VGY Sbjct: 232 YMVVDQDNEQALEQAVATVGPVSVAVDA--RPFFFYHSGIFSSHSCTQ-KVNHAMLAVGY 288 Query: 466 GNDEQ---GVEYWLLKNCWAARW 525 G ++ G +YW+LKN W+ RW Sbjct: 289 GTSKEPGGGQDYWILKNSWSERW 311 >UniRef50_Q24E33 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 328 Score = 116 bits (279), Expect = 4e-25 Identities = 50/83 (60%), Positives = 62/83 (74%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FSTTGALEG +F ++ L+S SEQ L+DCS Y N GCNGGLM AF+Y Sbjct: 143 KNQGSCGSCWAFSTTGALEGSYFLKNNQLISFSEQQLVDCSRLYLNMGCNGGLMPRAFRY 202 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 +K + GI TE+ YPY D KC+ Sbjct: 203 VKAH-GITTEEEYPYTAKDGKCQ 224 Score = 62.5 bits (145), Expect = 7e-09 Identities = 35/80 (43%), Positives = 47/80 (58%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 F +P G+ KL A+A PVSV +DA T+F+ Y+SGV+ D C +L+HGVL GY Sbjct: 235 FSTVPRGNCDKLAAAIAQQ-PVSVGVDA--TNFKFYTSGVF--DNCKK-KLNHGVLATGY 288 Query: 466 GNDEQGVEYWLLKNCWAARW 525 D YW++KN W W Sbjct: 289 TAD-----YWIIKNSWGTAW 303 >UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]; n=11; Eutheria|Rep: Testin-2 precursor [Contains: Testin-1] - Mus musculus (Mouse) Length = 333 Score = 116 bits (278), Expect = 5e-25 Identities = 49/84 (58%), Positives = 63/84 (75%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG C S W+FS TG+LEGQ F+++G LV LSEQNL+DC + C+GG M NAF+Y Sbjct: 130 KNQGYCASSWAFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSNVTHDCSGGFMQNAFQY 189 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 +KDNGG+ TE++YPY G KCRY Sbjct: 190 VKDNGGLATEESYPYIGPGRKCRY 213 Score = 110 bits (264), Expect = 3e-23 Identities = 53/105 (50%), Positives = 66/105 (62%), Gaps = 3/105 (2%) Frame = +1 Query: 220 PLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSS 399 P G R+ + H +N+ A FV IP G E+ LM+AVA VGP+SVA+DASH SFQ Y S Sbjct: 203 PYIGPGRKCRYHAENSAANVRDFVQIP-GREEALMKAVAKVGPISVAVDASHDSFQFYDS 261 Query: 400 GVYNEDECSSTELDHGVLVVGY---GNDEQGVEYWLLKNCWAARW 525 G+Y E +C L+H VLVVGY G + G YWL+KN W W Sbjct: 262 GIYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSYWLVKNSWGEEW 306 >UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor; n=17; Magnoliophyta|Rep: Thiol protease aleurain-like precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 116 bits (278), Expect = 5e-25 Identities = 48/84 (57%), Positives = 62/84 (73%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FSTTGALE + + G +SLSEQ L+DC+ + N GC+GGL AF+Y Sbjct: 157 KEQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEY 216 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 IK NGG+DTE+ YPY G D C++ Sbjct: 217 IKYNGGLDTEEAYPYTGKDGGCKF 240 Score = 77.0 bits (181), Expect = 3e-13 Identities = 39/93 (41%), Positives = 52/93 (55%), Gaps = 2/93 (2%) Frame = +1 Query: 259 KNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTEL 438 KN G + V+I G E +L AV V PVSVA + H F+ Y GV+ + C +T + Sbjct: 243 KNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVH-EFRFYKKGVFTSNTCGNTPM 301 Query: 439 D--HGVLVVGYGNDEQGVEYWLLKNCWAARWAN 531 D H VL VGYG ++ V YWL+KN W W + Sbjct: 302 DVNHAVLAVGYGVEDD-VPYWLIKNSWGGEWGD 333 >UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Cathepsin K - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 333 Score = 115 bits (277), Expect = 7e-25 Identities = 48/84 (57%), Positives = 62/84 (73%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FS+ GALEGQ + G LV LS QNL+DC + N+GC GG M NAF+Y Sbjct: 134 KNQGSCGSCWAFSSVGALEGQLMKTKGQLVDLSPQNLVDCVTE--NDGCGGGYMTNAFRY 191 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 + +N GID+E++YPY G D +C Y Sbjct: 192 VSNNQGIDSEESYPYVGTDQQCAY 215 Score = 99.1 bits (236), Expect = 7e-20 Identities = 50/151 (33%), Positives = 76/151 (50%), Gaps = 1/151 (0%) Frame = +1 Query: 76 VRLPGVALGAKPHRLLGAVREQRLQRGAH-GQRLQVHQGQRGHRHRADLPLRGS*RQVQV 252 ++ G + P L+ V E G + + +G P G+ +Q Sbjct: 156 MKTKGQLVDLSPQNLVDCVTENDGCGGGYMTNAFRYVSNNQGIDSEESYPYVGTDQQCAY 215 Query: 253 HPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSST 432 + A G+ +IP+G+E+ L AVA VGPVSV IDA ++F Y SGVY + C+ Sbjct: 216 NTSGVAASCRGYKEIPQGNERALTAAVANVGPVSVGIDAMQSTFLYYKSGVYYDPNCNKE 275 Query: 433 ELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 +++H VL VGYG +G +YW++KN W W Sbjct: 276 DVNHAVLAVGYGATPRGKKYWIVKNSWGEEW 306 >UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 664 Score = 115 bits (277), Expect = 7e-25 Identities = 48/86 (55%), Positives = 65/86 (75%), Gaps = 2/86 (2%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC--SEQYGNNGCNGGLMDNAF 175 K+QG CGSC++FST GALE ++R++ ++ LSEQNL+DC S +Y N GC+GG M N + Sbjct: 486 KNQGSCGSCYAFSTVGALESHYYRKNNRMLDLSEQNLVDCTASNKYRNGGCSGGWMHNCY 545 Query: 176 KYIKDNGGIDTEQTYPYEGVDDKCRY 253 YI++NGGI+ E TYPYEG +CRY Sbjct: 546 SYIQENGGINQESTYPYEGKFGQCRY 571 Score = 83.0 bits (196), Expect = 5e-15 Identities = 43/107 (40%), Positives = 58/107 (54%) Frame = +1 Query: 184 QGQRGHRHRADLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAI 363 Q G + P G Q + + + + FV I + DE+ L + VA+VGPVSVA Sbjct: 549 QENGGINQESTYPYEGKFGQCRYNSGDAQSRISKFVMIKQHDEEDLADTVASVGPVSVAY 608 Query: 364 DASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLK 504 DAS F YS G+Y D C+ H V+VVGY N E GV+YW++K Sbjct: 609 DASTREFMYYSRGIYYSDNCNKYRTTHAVVVVGYDN-ENGVDYWIIK 654 >UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platyhelminthes|Rep: Cathepsin L-like proteinase - Echinococcus multilocularis Length = 338 Score = 115 bits (276), Expect = 9e-25 Identities = 50/84 (59%), Positives = 63/84 (75%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCW+FS TGALEGQ R++G L+SLSEQ L+DCS GN GCNGG M++AF+Y Sbjct: 138 KDQGDCGSCWAFSATGALEGQLKRKTGKLISLSEQQLVDCSTYTGNEGCNGGDMNDAFRY 197 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 NG ++E YPY +D KC++ Sbjct: 198 WMRNGA-ESESDYPYTAMDGKCKF 220 Score = 92.7 bits (220), Expect = 6e-18 Identities = 40/80 (50%), Positives = 53/80 (66%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 FV +P+ E +L +VA VGPVSVAIDA+ + F LY G+Y ++ CS LDH VLVVGY Sbjct: 232 FVKVPKKREDQLKLSVAQVGPVSVAIDATSSGFMLYKKGIYQDNTCSQQYLDHAVLVVGY 291 Query: 466 GNDEQGVEYWLLKNCWAARW 525 D+ +YW++KN W W Sbjct: 292 DADKTRQKYWIVKNSWGEDW 311 >UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35; Viridiplantae|Rep: Cysteine proteinase 15A precursor - Pisum sativum (Garden pea) Length = 363 Score = 114 bits (275), Expect = 1e-24 Identities = 52/91 (57%), Positives = 67/91 (73%), Gaps = 7/91 (7%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS-----EQYG--NNGCNGGL 160 KDQG CGSCW+FSTTGALEG H+ +G LVSLSEQ L+DC EQ G ++GCNGGL Sbjct: 148 KDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGL 207 Query: 161 MDNAFKYIKDNGGIDTEQTYPYEGVDDKCRY 253 M+NAF+Y+ ++GG+ E+ Y Y G D C++ Sbjct: 208 MNNAFEYLLESGGVVQEKDYAYTGRDGSCKF 238 Score = 55.2 bits (127), Expect = 1e-06 Identities = 29/79 (36%), Positives = 43/79 (54%), Gaps = 6/79 (7%) Frame = +1 Query: 307 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDE--- 477 DE ++ + GP++VAI+A+ Q Y SGV C+ + LDHGVL+VG+G Sbjct: 256 DEDQIAANLVKNGPLAVAINAAW--MQTYMSGVSCPYVCAKSRLDHGVLLVGFGKGAYAP 313 Query: 478 ---QGVEYWLLKNCWAARW 525 + YW++KN W W Sbjct: 314 IRLKEKPYWIIKNSWGQNW 332 >UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Rep: Cathepsin R precursor - Mus musculus (Mouse) Length = 334 Score = 114 bits (275), Expect = 1e-24 Identities = 48/85 (56%), Positives = 61/85 (71%) Frame = +2 Query: 8 QGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIK 187 QG C +CW+F+ TGA+E Q Q+G L LS QNL+DCS+ GNNGC GG NAF+Y+ Sbjct: 133 QGDCDACWAFAVTGAIEAQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVL 192 Query: 188 DNGGIDTEQTYPYEGVDDKCRYIPR 262 NGG+++E TYPYEG D CRY P+ Sbjct: 193 HNGGLESEATYPYEGKDGPCRYNPK 217 Score = 111 bits (268), Expect = 9e-24 Identities = 56/119 (47%), Positives = 71/119 (59%), Gaps = 3/119 (2%) Frame = +1 Query: 178 VHQGQRGHRHRADLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSV 357 +H G G A P G + +PKN+ AE GFV +P+ E LM AVAT+GP++ Sbjct: 192 LHNG--GLESEATYPYEGKDGPCRYNPKNSKAEITGFVSLPQS-EDILMAAVATIGPITA 248 Query: 358 AIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY---GNDEQGVEYWLLKNCWAARW 525 IDASH SF+ Y G+Y+E CSS + HGVLVVGY G + G YWL+KN W RW Sbjct: 249 GIDASHESFKNYKGGIYHEPNCSSDTVTHGVLVVGYGFKGIETDGNHYWLIKNSWGKRW 307 >UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; Dictyostelium discoideum|Rep: Cysteine proteinase 2 precursor - Dictyostelium discoideum (Slime mold) Length = 376 Score = 114 bits (274), Expect = 2e-24 Identities = 51/75 (68%), Positives = 59/75 (78%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG+CGSCWSFSTTG+ EG H ++ LVSLSEQNL+DCS N GC+GGLM+NAF Y Sbjct: 139 KDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDY 198 Query: 182 IKDNGGIDTEQTYPY 226 I N GIDTE +YPY Sbjct: 199 IIKNKGIDTESSYPY 213 Score = 87.8 bits (208), Expect(2) = 7e-18 Identities = 47/75 (62%), Positives = 54/75 (72%), Gaps = 3/75 (4%) Frame = +1 Query: 268 GAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHG 447 GA G+V+I G E L E A GPVSVAIDASH SFQLY+SG+Y E +CS TELDHG Sbjct: 229 GATIKGYVNITAGSEISL-ENGAQHGPVSVAIDASHNSFQLYTSGIYYEPKCSPTELDHG 287 Query: 448 VLVVGY---GNDEQG 483 VLVVGY G D++G Sbjct: 288 VLVVGYGVQGKDDEG 302 Score = 25.0 bits (52), Expect(2) = 7e-18 Identities = 6/12 (50%), Positives = 8/12 (66%) Frame = +1 Query: 490 YWLLKNCWAARW 525 YW++KN W W Sbjct: 338 YWIVKNSWGTSW 349 >UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2; Taeniidae|Rep: Cathepsin L-like cysteine proteinase - Taenia solium (Pork tapeworm) Length = 339 Score = 113 bits (271), Expect = 4e-24 Identities = 48/84 (57%), Positives = 63/84 (75%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FS+TGALEG +++G L+SLSEQ L+DCS + GN+GCNGG M AFKY Sbjct: 140 KNQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSYAFKY 199 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 ++++ I+ E YPY D CRY Sbjct: 200 LEEH-FIEPESAYPYRATDGPCRY 222 Score = 105 bits (251), Expect = 1e-21 Identities = 48/83 (57%), Positives = 57/83 (68%) Frame = +1 Query: 277 DVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLV 456 D+G DIPEG+E LMEAVATVGP+S+AIDAS F Y G+Y CSS L+HGVL Sbjct: 233 DIG--DIPEGNETALMEAVATVGPISIAIDASSLGFMFYRHGIYKSHWCSSKFLNHGVLA 290 Query: 457 VGYGNDEQGVEYWLLKNCWAARW 525 +GYG + G YWL+KN W RW Sbjct: 291 IGYGK-QDGKPYWLVKNSWGTRW 312 >UniRef50_Q239L8 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 112 bits (270), Expect = 5e-24 Identities = 52/83 (62%), Positives = 61/83 (73%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG+CGSCWSFSTTGA+EG F + L SLSEQ L+DCS+ GN GCNGGLMD AF + Sbjct: 139 KDQGQCGSCWSFSTTGAVEGALFLSTKKLTSLSEQYLVDCSKD-GNEGCNGGLMDTAFDF 197 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 I + GI TE YPY+ VD C+ Sbjct: 198 ISQH-GIPTEAAYPYKAVDGTCK 219 Score = 52.4 bits (120), Expect = 8e-06 Identities = 25/60 (41%), Positives = 38/60 (63%) Frame = +1 Query: 346 PVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 P+++A+DA++ FQ Y ++++ C TELDHGVL+VGY +YW +KN W W Sbjct: 247 PIAIAVDANN--FQYYQKDIFSD--CG-TELDHGVLLVGY---SASGKYWKVKNSWGPNW 298 >UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza sativa|Rep: Cysteine protease 1 precursor - Oryza sativa subsp. japonica (Rice) Length = 490 Score = 112 bits (270), Expect = 5e-24 Identities = 47/88 (53%), Positives = 64/88 (72%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG+CGSCW+FS A+EG + +G LVSLSEQ L++C+ N+GCNGG+MD+AF + Sbjct: 172 KNQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGQNSGCNGGIMDDAFAF 231 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRT 265 I NGG+DTE+ YPY +D KC R+ Sbjct: 232 IARNGGLDTEEDYPYTAMDGKCNLAKRS 259 Score = 84.6 bits (200), Expect = 2e-15 Identities = 44/82 (53%), Positives = 50/82 (60%), Gaps = 1/82 (1%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVG 462 GF D+PE DE L +AVA PVSVAIDA FQLY SGV+ C T LDHGV+ VG Sbjct: 267 GFEDVPENDELSLQKAVAHQ-PVSVAIDAGGREFQLYDSGVFT-GRC-GTNLDHGVVAVG 323 Query: 463 YGND-EQGVEYWLLKNCWAARW 525 YG D G YW ++N W W Sbjct: 324 YGTDAATGAAYWTVRNSWGPDW 345 >UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4; core eudicotyledons|Rep: Papain-like cysteine peptidase XBCP3 - Arabidopsis thaliana (Mouse-ear cress) Length = 437 Score = 112 bits (269), Expect = 7e-24 Identities = 50/83 (60%), Positives = 62/83 (74%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CG+CWSFS TGA+EG + +G L+SLSEQ LIDC + Y N GCNGGLMD AF++ Sbjct: 134 KDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSY-NAGCNGGLMDYAFEF 192 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 + N GIDTE+ YPY+ D C+ Sbjct: 193 VIKNHGIDTEKDYPYQERDGTCK 215 Score = 83.0 bits (196), Expect = 5e-15 Identities = 41/80 (51%), Positives = 54/80 (67%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 + + DE+ LMEAVA PVSV I S +FQLYSSG+++ C ST LDH VL+VGY Sbjct: 229 YAGVKSNDEKALMEAVAAQ-PVSVGICGSERAFQLYSSGIFS-GPC-STSLDHAVLIVGY 285 Query: 466 GNDEQGVEYWLLKNCWAARW 525 G+ + GV+YW++KN W W Sbjct: 286 GS-QNGVDYWIVKNSWGKSW 304 >UniRef50_Q22RR8 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 112 bits (269), Expect = 7e-24 Identities = 51/83 (61%), Positives = 59/83 (71%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQ +CGSCW+FS TGALE F +G L SLSEQ L+DCS YGN GC+GG MD AFK+ Sbjct: 141 KDQEQCGSCWAFSATGALESATFISTGTLPSLSEQELVDCSTSYGNEGCDGGDMDAAFKF 200 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 I DN I TE+ Y Y G D KC+ Sbjct: 201 IHDN-NIATEKEYTYRGFDQKCK 222 Score = 51.6 bits (118), Expect = 1e-05 Identities = 32/80 (40%), Positives = 43/80 (53%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 FVD+ DE + A PVSVA+DA T++Q Y G +N+ C L+HGVL+VGY Sbjct: 235 FVDVQSCDE---LVAAIQQQPVSVAVDA--TNWQYYEFGTFND--CFDN-LNHGVLLVGY 286 Query: 466 GNDEQGVEYWLLKNCWAARW 525 + W +KN W W Sbjct: 287 NSK---THQWKVKNSWGTSW 303 >UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus|Rep: Cathepsin L - Aphrocallistes vastus Length = 329 Score = 111 bits (268), Expect = 9e-24 Identities = 50/84 (59%), Positives = 60/84 (71%) Frame = +1 Query: 274 EDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVL 453 +D F DIP + L EAVA GP++VA+DASHTSFQ+Y SG+Y CS T+LDHGVL Sbjct: 221 KDSSFTDIPSENCDALKEAVANKGPIAVAMDASHTSFQMYHSGIYTPFLCSKTKLDHGVL 280 Query: 454 VVGYGNDEQGVEYWLLKNCWAARW 525 VVGYG D GV+YWL+KN W W Sbjct: 281 VVGYGTD-NGVDYWLIKNSWGMAW 303 Score = 108 bits (260), Expect = 8e-23 Identities = 49/84 (58%), Positives = 60/84 (71%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG+CGSCWSFS TG+LEGQ+ +SG LVS SEQ L+DCS GN+GC GGLMD AFKY Sbjct: 131 KNQGQCGSCWSFSATGSLEGQYAIKSGKLVSFSEQELVDCSTSLGNHGCQGGLMDYAFKY 190 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 + N + E Y Y + KC+Y Sbjct: 191 WETNLA-EKESDYTYTAKNGKCKY 213 >UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cathepsin L; n=4; Danio rerio|Rep: Novel protein similar to vertebrate cathepsin L - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 334 Score = 111 bits (266), Expect = 2e-23 Identities = 49/85 (57%), Positives = 62/85 (72%), Gaps = 1/85 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCWSFSTTGA+EGQ ++ +G LVSLSEQ L+DCS YG GC+G M NA+ Y Sbjct: 134 KDQGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQLVDCSRSYGTYGCSGAWMANAYDY 193 Query: 182 IKDNGGIDTEQTYPYEGVDDK-CRY 253 + +N +++ TYPY VD + C Y Sbjct: 194 VINN-ALESSDTYPYTSVDTQPCFY 217 Score = 106 bits (254), Expect = 4e-22 Identities = 49/86 (56%), Positives = 60/86 (69%) Frame = +1 Query: 268 GAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHG 447 G D FV P G+EQ L +AVATVGPVSVAIDA + SF YSSG+Y E C+ L+H Sbjct: 225 GISDYRFV--PAGNEQALADAVATVGPVSVAIDADNPSFLFYSSGIYKESNCNPNNLNHA 282 Query: 448 VLVVGYGNDEQGVEYWLLKNCWAARW 525 VLVVGYG+ E+G +YW++KN W W Sbjct: 283 VLVVGYGS-EEGTDYWIIKNSWGTGW 307 >UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia deliciosa (Kiwi) Length = 509 Score = 110 bits (264), Expect = 3e-23 Identities = 48/82 (58%), Positives = 59/82 (71%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCW+FS+TGA+EG + +G L+SLSEQ L+DC N+GC GG MD AF++ Sbjct: 163 KDQGDCGSCWAFSSTGAIEGINALANGDLISLSEQELVDCDST--NDGCEGGYMDYAFEW 220 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 + NGGIDTE YPY G D C Sbjct: 221 VMSNGGIDTETDYPYTGEDGTC 242 Score = 74.1 bits (174), Expect = 2e-12 Identities = 44/107 (41%), Positives = 57/107 (53%), Gaps = 3/107 (2%) Frame = +1 Query: 214 DLPLRGS*RQVQVHPKNTGAEDV-GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQL 390 D P G + T A + G+ D+ E +E L AV P+SV ID FQL Sbjct: 232 DYPYTGEDGTCNTTKEETKAVSIDGYEDVAE-EESALFCAVLKQ-PISVGIDGGAIDFQL 289 Query: 391 YSSGVYNEDECSS--TELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 Y+ G+Y+ D CS ++DH VLVVGYG E G EYW++KN W W Sbjct: 290 YTGGIYDGD-CSDDPDDIDHAVLVVGYG-AESGEEYWIIKNSWGTDW 334 >UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax (Sterkiella histriomuscorum) Length = 366 Score = 110 bits (264), Expect = 3e-23 Identities = 44/82 (53%), Positives = 60/82 (73%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QGKCGSCW+FST G +E + + G +LSEQ L+DC+ Y N+GC+GGL +AF+Y Sbjct: 151 KNQGKCGSCWTFSTVGCVESHYLLKYGAFRNLSEQQLVDCAGDYDNHGCSGGLPSHAFEY 210 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 IKDNGG+ E TYPY+ + +C Sbjct: 211 IKDNGGLALETTYPYKAANGQC 232 Score = 68.9 bits (161), Expect = 8e-11 Identities = 32/77 (41%), Positives = 47/77 (61%), Gaps = 2/77 (2%) Frame = +1 Query: 307 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSS--TELDHGVLVVGYGNDEQ 480 +E L +A+ GPVSVA F+ Y SGVY + C++ +++H VL VG+G DE Sbjct: 253 NEDDLKQAIYLHGPVSVAFRVID-GFRDYKSGVYAVEGCANGPNDVNHAVLAVGFGTDEN 311 Query: 481 GVEYWLLKNCWAARWAN 531 V+YW++KN W A W + Sbjct: 312 KVDYWIIKNSWGAAWGD 328 >UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicatein a3 - Lubomirskia baicalensis Length = 344 Score = 109 bits (263), Expect = 4e-23 Identities = 47/84 (55%), Positives = 62/84 (73%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 + QG+CGS ++F+ GALEG + LV+LSEQN+IDCS YGN+GC+GG + AFKY Sbjct: 144 QSQGQCGSSYAFAAAGALEGATALAADKLVALSEQNIIDCSVPYGNHGCSGGDVYTAFKY 203 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 + DNGGIDTE +YPY+G C+Y Sbjct: 204 VVDNGGIDTESSYPYKGKKSSCQY 227 Score = 103 bits (248), Expect = 2e-21 Identities = 46/102 (45%), Positives = 64/102 (62%) Frame = +1 Query: 220 PLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSS 399 P +G Q + KN GA G V I G E L+ AVA+VGP++VA+DAS +F Y S Sbjct: 217 PYKGKKSSCQYNSKNVGAISTGVVKIASGSETDLLSAVASVGPIAVAVDASVNAFMFYQS 276 Query: 400 GVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 GV++ CS+++L+H +LV GYG+ G +YWL+KN W W Sbjct: 277 GVFDSSTCSTSKLNHAMLVTGYGS-TNGKDYWLVKNSWGTGW 317 >UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: Cysteine protease - Saprolegnia parasitica Length = 523 Score = 109 bits (262), Expect = 5e-23 Identities = 50/95 (52%), Positives = 64/95 (67%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FSTTGA+EG F S LVS+SEQ L+DC + G+ GCNGGLMDNAFK+ Sbjct: 132 KNQGMCGSCWAFSTTGAIEGAAFVSSKQLVSVSEQELVDC-DHNGDMGCNGGLMDNAFKW 190 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRTPVLRTWA 286 +K + G+ E+ YPY + C PV + A Sbjct: 191 VKTHKGLCKEEDYPYHAKEGTCALKKCKPVTKVTA 225 Score = 87.8 bits (208), Expect = 2e-16 Identities = 45/82 (54%), Positives = 54/82 (65%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 F D+P DEQ L AVA PVSVAI+A FQ Y SGV+ D+ T+LDHGVLVVGY Sbjct: 226 FHDVPANDEQALKAAVAKQ-PVSVAIEADQPEFQFYKSGVF--DKSCGTKLDHGVLVVGY 282 Query: 466 GNDEQGVEYWLLKNCWAARWAN 531 G +E G +YW +KN W A W + Sbjct: 283 G-EEGGKKYWKVKNSWGADWGD 303 >UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropicalis|Rep: LOC594890 protein - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 355 Score = 109 bits (261), Expect = 6e-23 Identities = 50/87 (57%), Positives = 64/87 (73%), Gaps = 1/87 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHF-RQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK 178 KDQG C + W+FS+ GALE Q+ R++G L SLS QNL+DCS+ YGNNGC GG + ++F+ Sbjct: 155 KDQGSCIASWAFSSIGALECQNMKRRTGKLESLSVQNLLDCSQTYGNNGCKGGWVVSSFR 214 Query: 179 YIKDNGGIDTEQTYPYEGVDDKCRYIP 259 YI DN GI+ E YPY+G D KC Y P Sbjct: 215 YIIDN-GIELESNYPYQGKDGKCSYTP 240 Score = 97.1 bits (231), Expect = 3e-19 Identities = 46/107 (42%), Positives = 64/107 (59%) Frame = +1 Query: 211 ADLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQL 390 ++ P +G + P + + +P GDE L + V +GPVSVAIDAS +F++ Sbjct: 225 SNYPYQGKDGKCSYTPVKKASVCTSYRQLPYGDEATLKQVVGLMGPVSVAIDASRKTFRM 284 Query: 391 YSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARWAN 531 Y +GVY + CSS+ DH VLVVGYG E GVEYWL+KN W + + Sbjct: 285 YKNGVYYDPNCSSSTPDHSVLVVGYG-AEDGVEYWLVKNSWGTSFGD 330 >UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcoptes scabiei type hominis|Rep: Cathepsin L-like protease - Sarcoptes scabiei type hominis Length = 245 Score = 108 bits (260), Expect = 8e-23 Identities = 45/83 (54%), Positives = 61/83 (73%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQ +CGSCW+FS ++E Q+ ++G LV LSEQ L+DCS GN GC+GG MD+AF++ Sbjct: 136 KDQKQCGSCWAFSAVASMESQNALKTGQLVELSEQELVDCSVGEGNEGCDGGWMDSAFEF 195 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 + GIDTE++YPY GV+ CR Sbjct: 196 VIKADGIDTEKSYPYHGVNQVCR 218 >UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [Contains: Cathepsin H mini chain; Cathepsin H heavy chain; Cathepsin H light chain]; n=37; Eukaryota|Rep: Cathepsin H precursor (EC 3.4.22.16) [Contains: Cathepsin H mini chain; Cathepsin H heavy chain; Cathepsin H light chain] - Homo sapiens (Human) Length = 335 Score = 108 bits (260), Expect = 8e-23 Identities = 46/86 (53%), Positives = 61/86 (70%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FSTTGALE +G ++SL+EQ L+DC++ + N+GC GGL AF+Y Sbjct: 133 KNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEY 192 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIP 259 I N GI E TYPY+G D C++ P Sbjct: 193 ILYNKGIMGEDTYPYQGKDGYCKFQP 218 Score = 70.9 bits (166), Expect = 2e-11 Identities = 30/75 (40%), Positives = 48/75 (64%), Gaps = 2/75 (2%) Frame = +1 Query: 307 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSST--ELDHGVLVVGYGNDEQ 480 DE+ ++EAVA PVS A + + F +Y +G+Y+ C T +++H VL VGYG ++ Sbjct: 235 DEEAMVEAVALYNPVSFAFEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYG-EKN 292 Query: 481 GVEYWLLKNCWAARW 525 G+ YW++KN W +W Sbjct: 293 GIPYWIVKNSWGPQW 307 >UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep: Cathepsin L - Stylonychia lemnae Length = 340 Score = 108 bits (259), Expect = 1e-22 Identities = 47/84 (55%), Positives = 60/84 (71%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG+CGSCW+FST +LE ++F ++G L SLSEQ L+DCS+ GN GCNGG M A Y Sbjct: 141 KDQGQCGSCWAFSTIASLESRYFIETGKLQSLSEQQLVDCSKN-GNEGCNGGDMGLAMDY 199 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 I GG++TE+ YPY G D C + Sbjct: 200 IASAGGVETEKDYPYVGKDQTCAF 223 Score = 75.4 bits (177), Expect = 9e-13 Identities = 41/104 (39%), Positives = 55/104 (52%) Frame = +1 Query: 214 DLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLY 393 D P G + A D G ++I G L A+A GPVSVAI+A FQ Y Sbjct: 211 DYPYVGKDQTCAFEASKEVATDKGHINIVPGKFATLQAAIAE-GPVSVAIEADSLFFQFY 269 Query: 394 SSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 SG+++ C T LDHGV VGYG D G +Y++++N W+ W Sbjct: 270 RSGIFDSSWC-GTNLDHGVAAVGYGVD-NGKQYYIVRNSWSDSW 311 >UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina). Cathepsin L; n=2; Dictyostelium discoideum|Rep: Similar to Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina). Cathepsin L - Dictyostelium discoideum (Slime mold) Length = 265 Score = 108 bits (259), Expect = 1e-22 Identities = 44/84 (52%), Positives = 59/84 (70%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG C SCWSFS GALEG ++ + G L+ LSEQNL+DC+ +G GC G M +AFKY Sbjct: 63 KNQGSCASCWSFSALGALEGHYYIKYGELLDLSEQNLVDCATPFGPKGCKTGWMHDAFKY 122 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 I +GG++ E YPY G D+ C++ Sbjct: 123 IISSGGVNLESQYPYTGKDEVCKF 146 Score = 95.1 bits (226), Expect = 1e-18 Identities = 45/102 (44%), Positives = 56/102 (54%) Frame = +1 Query: 220 PLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSS 399 P G + + A+ GFV IP+ DE LMEA+A GPV+V ID S FQ S Sbjct: 136 PYTGKDEVCKFNQSEKEAKVSGFVMIPKFDESALMEAIALYGPVAVPIDTSTKEFQHLSG 195 Query: 400 GVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 G+Y D C H VL +GYG DE GV+Y+L+KN W W Sbjct: 196 GIYYSDSCDPWNTIHAVLAIGYGTDENGVDYFLMKNSWGKSW 237 >UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase precursor - Phaedon cochleariae (Mustard beetle) Length = 324 Score = 108 bits (259), Expect = 1e-22 Identities = 44/83 (53%), Positives = 60/83 (72%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 ++QG+CGSCW+ ST A+E Q +SG V LS Q L+DCS YGN+GCNGG N F+Y Sbjct: 126 RNQGECGSCWALSTAAAIESQSAIKSGSKVPLSPQQLVDCSTSYGNHGCNGGFAVNGFEY 185 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 +KDN G++++ YPY G +DKC+ Sbjct: 186 VKDN-GLESDADYPYSGKEDKCK 207 Score = 70.9 bits (166), Expect = 2e-11 Identities = 35/105 (33%), Positives = 52/105 (49%) Frame = +1 Query: 211 ADLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQL 390 AD P G + + + K+ ++ E L EAV T+GP+S + + Sbjct: 195 ADYPYSGKEDKCKANDKSRSVVELTGYKKVTASETSLKEAVGTIGPISAVVFGK--PMKS 252 Query: 391 YSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 Y G++++ C L HGV VVGYG E G +YW++KN W A W Sbjct: 253 YGGGIFDDSSCLGDNLHHGVNVVGYG-IENGQKYWIIKNTWGADW 296 >UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; Dictyostelium discoideum|Rep: Cysteine proteinase 1 precursor - Dictyostelium discoideum (Slime mold) Length = 343 Score = 107 bits (257), Expect = 2e-22 Identities = 50/83 (60%), Positives = 58/83 (69%), Gaps = 8/83 (9%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC--------SEQYGNNGCNGG 157 K+QG+CGSCWSFSTTG +EGQHF LVSLSEQNL+DC E+ + GCNGG Sbjct: 134 KNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGG 193 Query: 158 LMDNAFKYIKDNGGIDTEQTYPY 226 L NA+ YI NGGI TE +YPY Sbjct: 194 LQPNAYNYIIKNGGIQTESSYPY 216 Score = 62.9 bits (146), Expect = 5e-09 Identities = 33/99 (33%), Positives = 53/99 (53%), Gaps = 4/99 (4%) Frame = +1 Query: 241 QVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDE 420 Q + N GA+ F IP+ +E + + + GP+++A DA +Q Y GV+ + Sbjct: 223 QCNFNSANIGAKISNFTMIPK-NETVMAGYIVSTGPLAIAADA--VEWQFYIGGVF-DIP 278 Query: 421 CSSTELDHGVLVVGYGND----EQGVEYWLLKNCWAARW 525 C+ LDHG+L+VGY + + YW++KN W A W Sbjct: 279 CNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 317 >UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thaliana|Rep: F9P14.12 protein - Arabidopsis thaliana (Mouse-ear cress) Length = 343 Score = 106 bits (255), Expect = 3e-22 Identities = 45/82 (54%), Positives = 59/82 (71%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 ++QGKCG CW+FS A+EG + ++G LVSLSEQ LIDC N GC+GGLM+ AF++ Sbjct: 143 RNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEF 202 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 IK NGG+ TE YPY G++ C Sbjct: 203 IKTNGGLATETDYPYTGIEGTC 224 Score = 64.9 bits (151), Expect = 1e-09 Identities = 34/68 (50%), Positives = 42/68 (61%) Frame = +1 Query: 322 MEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLL 501 ++ A PVSV IDA FQLYSSGV+ + C T L+HGV VVGYG E +YW++ Sbjct: 249 LQIAAAQQPVSVGIDAGGFIFQLYSSGVFT-NYC-GTNLNHGVTVVGYG-VEGDQKYWIV 305 Query: 502 KNCWAARW 525 KN W W Sbjct: 306 KNSWGTGW 313 >UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin F like protease - Nasonia vitripennis Length = 1036 Score = 106 bits (254), Expect = 4e-22 Identities = 44/84 (52%), Positives = 62/84 (73%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCW+FS TG +EGQ+ + G L+SLSEQ L+DC + ++GCNGGL D A++ Sbjct: 833 KDQGSCGSCWAFSVTGNIEGQYAIKHGELLSLSEQELVDCDKL--DSGCNGGLPDTAYRA 890 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 I++ GG++ E YPY+ D+KC + Sbjct: 891 IEELGGLELESDYPYDAEDEKCHF 914 Score = 56.4 bits (130), Expect = 5e-07 Identities = 29/80 (36%), Positives = 47/80 (58%), Gaps = 7/80 (8%) Frame = +1 Query: 307 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDE--CSSTELDHGVLVVGYGND-- 474 +E ++ + + GP+S+ I+A+ + Q Y GV + + CS LDHGVL+VGYG Sbjct: 932 NETQMAQWLVKNGPMSIGINAN--AMQFYMGGVSHPFKFLCSPDSLDHGVLIVGYGVKFY 989 Query: 475 ---EQGVEYWLLKNCWAARW 525 ++ + YW++KN W RW Sbjct: 990 PIFKKTMPYWIIKNSWGPRW 1009 >UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain; n=9; Cucujiformia|Rep: Digestive cysteine proteinase intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 106 bits (254), Expect = 4e-22 Identities = 48/93 (51%), Positives = 65/93 (69%), Gaps = 1/93 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGC-NGGLMDNAFK 178 K QG CGSCW+FS TGALEGQ+ + + LSEQ L+DCS+ YGN+ C +GGLM AF Sbjct: 126 KYQGGCGSCWAFSATGALEGQNAIVNNVKIPLSEQQLLDCSKPYGNDDCEHGGLMSFAFD 185 Query: 179 YIKDNGGIDTEQTYPYEGVDDKCRYIPRTPVLR 277 Y+ D GI+ + +YPY+G+D C+Y + VL+ Sbjct: 186 YVLDK-GIEADSSYPYKGIDTPCQYDAKKTVLK 217 Score = 67.7 bits (158), Expect = 2e-10 Identities = 41/114 (35%), Positives = 60/114 (52%), Gaps = 3/114 (2%) Frame = +1 Query: 193 RGHRHRADLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDAS 372 +G + P +G Q K T + G+ ++ +E+ L +AV TVGPVSVAIDA Sbjct: 190 KGIEADSSYPYKGIDTPCQYDAKKTVLKIKGYKNVSNSEEE-LKKAVGTVGPVSVAIDAD 248 Query: 373 HTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQ---GVEYWLLKNCWAARW 525 QLY G+ + C+ L+HGVL VGYG ++ ++W +KN W W Sbjct: 249 --PIQLYFGGILDGLFCTH-NLNHGVLAVGYGEEDHLFGKKKFWKVKNSWGKDW 299 >UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa (Rice) Length = 339 Score = 105 bits (252), Expect = 8e-22 Identities = 45/82 (54%), Positives = 55/82 (67%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG+CG CW+FS A+EG +G L+SLSEQ L+DC + GC GGLMD+AFK+ Sbjct: 139 KDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKF 198 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 I NGG+ TE YPY D KC Sbjct: 199 IIKNGGLTTESKYPYTAADGKC 220 Score = 90.2 bits (214), Expect = 3e-17 Identities = 42/88 (47%), Positives = 54/88 (61%) Frame = +1 Query: 262 NTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELD 441 N+ A G+ D+P +E LM+AVA PVSVA+D +FQ YS GV C T+LD Sbjct: 225 NSAATIKGYEDVPANNEAALMKAVANQ-PVSVAVDGGDMTFQFYSGGVMT-GSCG-TDLD 281 Query: 442 HGVLVVGYGNDEQGVEYWLLKNCWAARW 525 HG++ +GYG D G +YWLLKN W W Sbjct: 282 HGIVAIGYGKDGDGTQYWLLKNSWGTTW 309 >UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus tauri|Rep: Cysteine protease-1 - Ostreococcus tauri Length = 430 Score = 105 bits (252), Expect = 8e-22 Identities = 48/75 (64%), Positives = 60/75 (80%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG+CGSCW+FSTTGA+EG ++G LVSLSEQ ++ CS+Q N GCNGGLMD AF++ Sbjct: 217 KNQGQCGSCWAFSTTGAVEGITKIRTGRLVSLSEQEMVSCSKQ--NMGCNGGLMDYAFRW 274 Query: 182 IKDNGGIDTEQTYPY 226 I NGGID+E YPY Sbjct: 275 IVKNGGIDSEFQYPY 289 Score = 82.6 bits (195), Expect = 6e-15 Identities = 43/91 (47%), Positives = 57/91 (62%), Gaps = 10/91 (10%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVG 462 GF D+P GDE++L +AV+ PVS+AI+A SFQLY GVY+ EC S ++DHGVLVVG Sbjct: 310 GFKDVPPGDEKELEKAVSQQ-PVSIAIEADTKSFQLYDGGVYDSKECGS-QVDHGVLVVG 367 Query: 463 YGNDE----------QGVEYWLLKNCWAARW 525 YG D+ + +W +KN W W Sbjct: 368 YGFDDTHHNATKHHKRHRHFWKVKNSWGGTW 398 >UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays (Maize) Length = 493 Score = 105 bits (251), Expect = 1e-21 Identities = 46/82 (56%), Positives = 60/82 (73%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG+CG CW+FS A+EG + +G L+SLSEQ LIDC +++ + GC+GGLMDNAF + Sbjct: 180 KDQGQCGGCWAFSAVAAVEGINKIVTGSLISLSEQELIDC-DKFQDQGCDGGLMDNAFVF 238 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 + NGGIDTE YP+ G D C Sbjct: 239 MIKNGGIDTEADYPFTGHDGTC 260 Score = 83.0 bits (196), Expect = 5e-15 Identities = 47/106 (44%), Positives = 63/106 (59%), Gaps = 1/106 (0%) Frame = +1 Query: 211 ADLPLRGS*RQVQVHPKNTGAEDV-GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQ 387 AD P G + KNT + F +P E+ L +AVA PVS +I+AS +FQ Sbjct: 249 ADYPFTGHDGTCDLKLKNTRVVSIDSFERVPINYERALQKAVAHQ-PVSASIEASRRAFQ 307 Query: 388 LYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 LYSSG++ + C T LDHGV VVGYG+ E G +YW++KN W +W Sbjct: 308 LYSSGIF-DGRC-GTYLDHGVTVVGYGS-EGGKDYWIVKNSWGTQW 350 >UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea mays (Maize) Length = 371 Score = 104 bits (250), Expect = 1e-21 Identities = 45/91 (49%), Positives = 61/91 (67%), Gaps = 7/91 (7%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYG-------NNGCNGGL 160 K+QG CGSCWSFS +GALEG H+ +G L LSEQ +DC + ++GCNGGL Sbjct: 153 KNQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGL 212 Query: 161 MDNAFKYIKDNGGIDTEQTYPYEGVDDKCRY 253 M AF Y++ GG+++E+ YPY G D KC++ Sbjct: 213 MTTAFSYLQKAGGLESEKDYPYTGSDGKCKF 243 Score = 47.6 bits (108), Expect = 2e-04 Identities = 34/116 (29%), Positives = 50/116 (43%), Gaps = 6/116 (5%) Frame = +1 Query: 196 GHRHRADLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASH 375 G D P GS + + A F + DE ++ + GP+++ I+A++ Sbjct: 225 GLESEKDYPYTGSDGKCKFDKSKIVASVQNF-SVVSVDEAQISANLIKHGPLAIGINAAY 283 Query: 376 TSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDE------QGVEYWLLKNCWAARW 525 Q Y GV C LDHGVL+VGYG + YW++KN W W Sbjct: 284 --MQTYIGGVSCPYICGR-HLDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENW 336 >UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2] - Vigna mungo (Rice bean) (Black gram) Length = 362 Score = 104 bits (250), Expect = 1e-21 Identities = 46/82 (56%), Positives = 60/82 (73%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG+CGSCW+FST A+EG + ++ LVSLSEQ L+DC ++ N GCNGGLM++AF++ Sbjct: 144 KDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKE-ENQGCNGGLMESAFEF 202 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 IK GGI TE YPY + C Sbjct: 203 IKQKGGITTESNYPYTAQEGTC 224 Score = 83.8 bits (198), Expect = 3e-15 Identities = 39/81 (48%), Positives = 52/81 (64%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVG 462 G ++P DE L++AVA PVSVAIDA + FQ YS GV+ D C+ T+L+HGV +VG Sbjct: 238 GHENVPVNDENALLKAVANQ-PVSVAIDAGGSDFQFYSEGVFTGD-CN-TDLNHGVAIVG 294 Query: 463 YGNDEQGVEYWLLKNCWAARW 525 YG G YW+++N W W Sbjct: 295 YGTTVDGTNYWIVRNSWGPEW 315 >UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF6860, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 251 Score = 104 bits (249), Expect = 2e-21 Identities = 45/73 (61%), Positives = 57/73 (78%) Frame = +1 Query: 295 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGND 474 IP+GDEQ L +AVAT+GP++VAIDASH+SF YSSG+Y E C+ L H VL+VGYG+ Sbjct: 122 IPKGDEQALADAVATIGPITVAIDASHSSFLFYSSGIYEESNCNPNNLSHAVLLVGYGS- 180 Query: 475 EQGVEYWLLKNCW 513 E G +YWL+KN W Sbjct: 181 EGGQDYWLIKNRW 193 Score = 103 bits (248), Expect = 2e-21 Identities = 43/72 (59%), Positives = 58/72 (80%) Frame = +2 Query: 11 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKD 190 G CGSCW+FSTTGA+EGQ ++++G LVSLSEQNL+DCS+ YG GC+G M NA+ Y+ + Sbjct: 1 GYCGSCWAFSTTGAIEGQIYKKTGQLVSLSEQNLVDCSKSYGTYGCSGAWMANAYDYVVN 60 Query: 191 NGGIDTEQTYPY 226 N G+++ TYPY Sbjct: 61 N-GLESTGTYPY 71 >UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|Rep: LD36817p - Drosophila melanogaster (Fruit fly) Length = 352 Score = 104 bits (249), Expect = 2e-21 Identities = 42/78 (53%), Positives = 58/78 (74%) Frame = +2 Query: 17 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNG 196 CG+CWSF+TTGALEG FR++G L SLS+QNL+DC++ YGN GC+GG + F+YI+D+ Sbjct: 152 CGACWSFATTGALEGHLFRRTGVLASLSQQNLVDCADDYGNMGCDGGFQEYGFEYIRDH- 210 Query: 197 GIDTEQTYPYEGVDDKCR 250 G+ YPY + +CR Sbjct: 211 GVTLANKYPYTQTEMQCR 228 Score = 89.0 bits (211), Expect = 7e-17 Identities = 35/80 (43%), Positives = 56/80 (70%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 + I GDE+K+ E +AT+GP++ +++A SF+ YS G+Y ++EC+ EL+H V VVGY Sbjct: 247 YATITPGDEEKMKEVIATLGPLACSMNADTISFEQYSGGIYEDEECNQGELNHSVTVVGY 306 Query: 466 GNDEQGVEYWLLKNCWAARW 525 G E G +YW++KN ++ W Sbjct: 307 GT-ENGRDYWIIKNSYSQNW 325 >UniRef50_Q22W19 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 103 bits (248), Expect = 2e-21 Identities = 46/84 (54%), Positives = 60/84 (71%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG+CGSCW+FST G LEG + +G L S SEQ ++DCS+ N GCNGG + A+KY Sbjct: 139 KNQGQCGSCWAFSTVGGLEGAYAIATGNLTSFSEQQIVDCSK--ANAGCNGGDLPPAYKY 196 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 + N GI+TE YPY+GV+ KC Y Sbjct: 197 VVQN-GIETEADYPYKGVNQKCAY 219 Score = 57.6 bits (133), Expect = 2e-07 Identities = 36/112 (32%), Positives = 51/112 (45%) Frame = +1 Query: 190 QRGHRHRADLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDA 369 Q G AD P +G ++ + FV + +L A+ PV + I+A Sbjct: 199 QNGIETEADYPYKGVNQKCAYDASKVVFKPKSFVQVTPNSPDQLAIAL-NKEPVPICIEA 257 Query: 370 SHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 +FQ Y+SG+ + C T LDH VL VGY D W++KN W A W Sbjct: 258 DQKAFQFYTSGIISSG-C-GTNLDHCVLAVGYDADS-----WIVKNSWGASW 302 >UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15; Magnoliophyta|Rep: Cysteine proteinase RD19a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 368 Score = 103 bits (248), Expect = 2e-21 Identities = 48/105 (45%), Positives = 70/105 (66%), Gaps = 7/105 (6%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYG-------NNGCNGGL 160 K+QG CGSCWSFS TGALEG +F +G LVSLSEQ L+DC + ++GCNGGL Sbjct: 151 KNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGL 210 Query: 161 MDNAFKYIKDNGGIDTEQTYPYEGVDDKCRYIPRTPVLRTWASWT 295 M++AF+Y GG+ E+ YPY G D K + ++ ++ + ++++ Sbjct: 211 MNSAFEYTLKTGGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFS 255 Score = 47.2 bits (107), Expect = 3e-04 Identities = 28/79 (35%), Positives = 41/79 (51%), Gaps = 6/79 (7%) Frame = +1 Query: 307 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGV 486 DE+++ + GP++VAI+A + Q Y GV C+ L+HGVL+VGYG Sbjct: 260 DEEQIAANLVKNGPLAVAINAGY--MQTYIGGVSCPYICTR-RLNHGVLLVGYGAAGYAP 316 Query: 487 E------YWLLKNCWAARW 525 YW++KN W W Sbjct: 317 ARFKEKPYWIIKNSWGETW 335 >UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase" precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 315 Score = 103 bits (247), Expect = 3e-21 Identities = 47/90 (52%), Positives = 61/90 (67%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG+CGSCW+FSTTG+LEGQ V LSEQ L+DC + N GCNGGLM +AF Y Sbjct: 126 KDQGQCGSCWAFSTTGSLEGQLAIHKNQRVPLSEQELVDC-DTSRNAGCNGGLMTDAFNY 184 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRTPV 271 +K + G+ +E Y Y G DD+C+ + P+ Sbjct: 185 VKRH-GLSSESQYAYTGRDDRCKNVENKPL 213 Score = 67.7 bits (158), Expect = 2e-10 Identities = 36/81 (44%), Positives = 50/81 (61%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVG 462 G+V++ E E L AVA+VGPVS+A+DA ++QLY G++N C T L+HGVL VG Sbjct: 218 GYVEL-ETTEDALASAVASVGPVSIAVDAD--TWQLYGGGLFNNKNC-RTNLNHGVLAVG 273 Query: 463 YGNDEQGVEYWLLKNCWAARW 525 Y D +++KN W W Sbjct: 274 YTKDA-----FIVKNSWGTSW 289 >UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 356 Score = 103 bits (247), Expect = 3e-21 Identities = 47/89 (52%), Positives = 59/89 (66%), Gaps = 1/89 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQH-FRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK 178 KDQ CGSCW+FSTTGA+E + + SLSEQ LIDC+ + NNGC+GGL AF+ Sbjct: 143 KDQQNCGSCWTFSTTGAIESHYAIFEDVEPTSLSEQQLIDCAGAFNNNGCSGGLPSQAFE 202 Query: 179 YIKDNGGIDTEQTYPYEGVDDKCRYIPRT 265 YIK NGGI E +Y Y D +C++ P T Sbjct: 203 YIKYNGGISYENSYYYIAQDQECQFSPET 231 Score = 89.0 bits (211), Expect = 7e-17 Identities = 45/101 (44%), Positives = 64/101 (63%), Gaps = 3/101 (2%) Frame = +1 Query: 238 RQVQVHPKNTGAE-DVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNE 414 ++ Q P+ GA G +I +GDE +L +AV TVGPVS+A F+LY SGVY+ Sbjct: 223 QECQFSPETVGARVRGGSFNITQGDEDQLKQAVGTVGPVSIAFQVM-GDFKLYKSGVYSN 281 Query: 415 DECSST--ELDHGVLVVGYGNDEQGVEYWLLKNCWAARWAN 531 +CSS+ ++H VL VGYG+ E GV+YW +KN W+ W + Sbjct: 282 PDCSSSPQTVNHAVLAVGYGS-ENGVDYWYVKNSWSEFWGD 321 >UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel protein - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 328 Score = 103 bits (246), Expect = 4e-21 Identities = 44/84 (52%), Positives = 55/84 (65%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 ++QG CGSCW+FS G+LE Q R++ LV LS QNL+DCS GN GC GG + AF Y Sbjct: 129 QNQGPCGSCWAFSAVGSLEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLY 188 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 + N GID+ YPYE + CRY Sbjct: 189 VIQNRGIDSSTFYPYEHKEGVCRY 212 Score = 94.3 bits (224), Expect = 2e-18 Identities = 42/81 (51%), Positives = 54/81 (66%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVG 462 GF +P +E L AVA +GPVSV I+A SF Y SG+YN+ +CSS ++H VLVVG Sbjct: 223 GFRIVPRHNEAALQSAVANIGPVSVGINAKLLSFHRYRSGIYNDPKCSSALINHAVLVVG 282 Query: 463 YGNDEQGVEYWLLKNCWAARW 525 YG+ E G +YWL+KN W W Sbjct: 283 YGS-ENGQDYWLVKNSWGTAW 302 >UniRef50_Q22A69 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 330 Score = 103 bits (246), Expect = 4e-21 Identities = 46/85 (54%), Positives = 60/85 (70%), Gaps = 1/85 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQ-SGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK 178 K+QG CGSCW+FSTTG++EGQ+ Q L S SEQ L+DC + + GCNGGLMDNAF Sbjct: 128 KNQGSCGSCWAFSTTGSIEGQYVLQLKQNLTSFSEQQLVDCDTK-EDQGCNGGLMDNAFT 186 Query: 179 YIKDNGGIDTEQTYPYEGVDDKCRY 253 Y+ ++ ++TE YPY VD C+Y Sbjct: 187 YL-ESAKLETESAYPYTAVDGSCKY 210 Score = 67.3 bits (157), Expect = 2e-10 Identities = 35/85 (41%), Positives = 52/85 (61%), Gaps = 5/85 (5%) Frame = +1 Query: 286 FVDIPEGD-----EQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGV 450 FVDI +G E + A+ +GP+SVAI+A++ F Y+ G+ N C+ L+HGV Sbjct: 222 FVDIEQGKTVADTENTMGVALDNIGPLSVAINANNLQF--YAGGISNPLICNPNGLNHGV 279 Query: 451 LVVGYGNDEQGVEYWLLKNCWAARW 525 L+VG G+ E G ++W +KN W A W Sbjct: 280 LIVGLGS-ENGKDFWKVKNSWGASW 303 >UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia theta|Rep: Cathepsin H precursor - Guillardia theta (Cryptomonas phi) Length = 353 Score = 102 bits (245), Expect = 5e-21 Identities = 44/82 (53%), Positives = 55/82 (67%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FST ALE H ++G +V LSEQ L+DC+ + NNGCNGGL AF+Y Sbjct: 139 KNQGTCGSCWTFSTAAALESLHAIKTGEMVLLSEQQLVDCAADFKNNGCNGGLPSQAFEY 198 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 I NGG+ + YPY D C Sbjct: 199 IMYNGGLSKMEEYPYVCGDGHC 220 Score = 65.7 bits (153), Expect = 8e-10 Identities = 34/94 (36%), Positives = 49/94 (52%), Gaps = 2/94 (2%) Frame = +1 Query: 256 PKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSST- 432 P + GA+ + GDE + V + P+SVA + + YSSGVY+ C T Sbjct: 235 PWSVGAKVSKVANFTPGDEISMKTVVGSHNPISVAFEVV-ADLRHYSSGVYSSPTCVGTP 293 Query: 433 -ELDHGVLVVGYGNDEQGVEYWLLKNCWAARWAN 531 +++H VL VGYG E G+ YW +KN W W + Sbjct: 294 DKVNHAVLAVGYGT-EGGIPYWTIKNSWGFAWGD 326 >UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin L-like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin L-like protease - Nasonia vitripennis Length = 353 Score = 101 bits (243), Expect = 9e-21 Identities = 43/85 (50%), Positives = 59/85 (69%), Gaps = 1/85 (1%) Frame = +2 Query: 2 KDQG-KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK 178 +DQG CGSCW+FS GALE Q+F+++G L +LS QNLIDC+ +YGN GC GG +F+ Sbjct: 148 RDQGLTCGSCWAFSAAGALEAQYFKKTGVLTALSAQNLIDCTMEYGNLGCGGGSAALSFQ 207 Query: 179 YIKDNGGIDTEQTYPYEGVDDKCRY 253 ++ D G++ E Y YEG +C Y Sbjct: 208 FVVDQKGLEPEANYSYEGRTKECPY 232 Score = 99.1 bits (236), Expect = 7e-20 Identities = 43/84 (51%), Positives = 55/84 (65%), Gaps = 1/84 (1%) Frame = +1 Query: 277 DVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLV 456 D F+ + GDE L AVATVGP S AID SH +F+ YS GVY + EC+ +LDH VL+ Sbjct: 243 DASFIYVNGGDEATLKVAVATVGPFSAAIDGSHDTFRFYSEGVYYQPECNEDDLDHAVLI 302 Query: 457 VGYGNDEQ-GVEYWLLKNCWAARW 525 VGYG D + ++WL+KN W W Sbjct: 303 VGYGTDNRTDQDFWLVKNSWGETW 326 >UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; n=2; Danio rerio|Rep: hypothetical protein LOC550326 - Danio rerio Length = 531 Score = 101 bits (241), Expect = 2e-20 Identities = 45/85 (52%), Positives = 60/85 (70%), Gaps = 1/85 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQ CGSCWSF+TTG LEG F ++G L SLS+Q L+DC+ +GNNGC+GG AF++ Sbjct: 328 KDQAVCGSCWSFATTGTLEGALFLKTGQLTSLSQQMLVDCTWGFGNNGCDGGEEWRAFEW 387 Query: 182 IKDNGGIDTEQTY-PYEGVDDKCRY 253 I +GGI T ++Y Y G++ C Y Sbjct: 388 IMKHGGISTAESYGAYMGMNGLCHY 412 Score = 87.8 bits (208), Expect = 2e-16 Identities = 43/91 (47%), Positives = 58/91 (63%), Gaps = 4/91 (4%) Frame = +1 Query: 271 AEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSS--TELDH 444 A+ G+ ++ GD L A+ GPV+V+IDA+H SF YS+GVY E EC + +LDH Sbjct: 419 AQLTGYTNVTSGDILALKAAIFKFGPVAVSIDAAHRSFAFYSNGVYYEPECKNGINDLDH 478 Query: 445 GVLVVGYG--NDEQGVEYWLLKNCWAARWAN 531 VL VGYG N+E YWL+KN W++ W N Sbjct: 479 AVLAVGYGIMNNE---SYWLVKNSWSSYWGN 506 >UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae|Rep: Cysteine proteinase - Hypera postica (alfalfa weevil) Length = 324 Score = 101 bits (241), Expect = 2e-20 Identities = 45/84 (53%), Positives = 61/84 (72%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCW+FS TG+ EG + R+SG LVSLSEQ LIDC + GC+GG +D+ FKY Sbjct: 128 KDQGDCGSCWAFSITGSTEGAYARKSGKLVSLSEQQLIDCCTD-TSAGCDGGSLDDNFKY 186 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 + + G+ +E++Y Y+G D C+Y Sbjct: 187 VMKD-GLQSEESYTYKGEDGACKY 209 Score = 94.3 bits (224), Expect = 2e-18 Identities = 42/80 (52%), Positives = 54/80 (67%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 + IP DE L+EAVATVGPVSV +DAS+ S Y SG+Y + +CS L+H +L VGY Sbjct: 221 YTSIPAEDEDALLEAVATVGPVSVGMDASYLS--SYDSGIYEDQDCSPAGLNHAILAVGY 278 Query: 466 GNDEQGVEYWLLKNCWAARW 525 G E G +YW++KN W A W Sbjct: 279 GT-ENGKDYWIIKNSWGASW 297 >UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra erinaceieuropaei|Rep: Cysteine proteinase - Spirometra erinaceieuropaei (Tapeworm) Length = 336 Score = 101 bits (241), Expect = 2e-20 Identities = 47/84 (55%), Positives = 57/84 (67%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG+CGSCWSFS GA+EG ++G L SLSEQ L+DCS YGN GCNGGLM AF+Y Sbjct: 137 KNQGQCGSCWSFSANGAIEGAIQIKTGALRSLSEQQLMDCSWDYGNQGCNGGLMPQAFQY 196 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 + G++ E Y Y D CRY Sbjct: 197 AQ-RYGVEAEVDYRYTERDGVCRY 219 Score = 99.5 bits (237), Expect = 5e-20 Identities = 45/85 (52%), Positives = 55/85 (64%) Frame = +1 Query: 271 AEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGV 450 A G+ ++PEGDE L AVAT+GP+SV IDA+ F YS GV+ CS +DHGV Sbjct: 226 ANVTGYAELPEGDEGGLQRAVATIGPISVGIDAADPGFMSYSHGVFVSKTCSPYAIDHGV 285 Query: 451 LVVGYGNDEQGVEYWLLKNCWAARW 525 LVVGYG E G YWL+KN W + W Sbjct: 286 LVVGYG-AENGDAYWLVKNSWGSSW 309 >UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Actinidin Act3a - Actinidia eriantha Length = 380 Score = 100 bits (240), Expect = 2e-20 Identities = 42/82 (51%), Positives = 59/82 (71%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG C SCW+F+T +E + +G L+SLSEQ L+DC+ N GC GG MD+A+++ Sbjct: 142 KNQGLCSSCWAFATIATVESINQIITGDLISLSEQELVDCNRTPINEGCKGGFMDDAYEF 201 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 I +NGGI+TE+ YPY G DD+C Sbjct: 202 IINNGGINTEENYPYIGQDDQC 223 Score = 72.5 bits (170), Expect = 7e-12 Identities = 33/80 (41%), Positives = 49/80 (61%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 + +P DE + AVA PVSVAIDA F+ Y SG++ C +T L+H V ++GY Sbjct: 238 YEQVPPNDELAMKRAVA-YQPVSVAIDAYCLGFRFYQSGIFTGGSCGTT-LNHAVTIIGY 295 Query: 466 GNDEQGVEYWLLKNCWAARW 525 G E G++YW++KN + +W Sbjct: 296 GT-ENGIDYWIVKNSYGTQW 314 >UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schistosoma|Rep: Preprocathepsin cathepsin L - Schistosoma japonicum (Blood fluke) Length = 331 Score = 100 bits (240), Expect = 2e-20 Identities = 46/84 (54%), Positives = 56/84 (66%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K QG CGSCW+FS TGA+EGQ R+ LV LSEQ L+DC YGN+GC GG MD AF Y Sbjct: 132 KHQGLCGSCWAFSATGAIEGQLRRKHKKLVKLSEQQLVDCRYNYGNDGCEGGTMDLAFNY 191 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 ++ + I++E Y Y G D C Y Sbjct: 192 LEKH-YIESENDYKYLGHDANCHY 214 Score = 80.6 bits (190), Expect = 3e-14 Identities = 38/80 (47%), Positives = 49/80 (61%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 F D+P DE+ L +AV GP+SV I A S LY SG+Y +C +++HGVL VGY Sbjct: 226 FGDLPARDEKTLEKAVYQYGPISVGIVALD-SLILYKSGIYESKDCKYADINHGVLAVGY 284 Query: 466 GNDEQGVEYWLLKNCWAARW 525 G E G +YWL+KN W W Sbjct: 285 GR-ENGKDYWLIKNSWGDLW 303 >UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep: Cathepsin L precursor - Schistosoma mansoni (Blood fluke) Length = 319 Score = 99 bits (238), Expect = 4e-20 Identities = 44/82 (53%), Positives = 59/82 (71%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FSTTG +E Q FR++G L+SLSEQ L+DC ++GCNGGL NA++ Sbjct: 121 KNQGMCGSCWAFSTTGNVESQWFRKTGKLLSLSEQQLVDCDGL--DDGCNGGLPSNAYES 178 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 I GG+ E YPY+ ++KC Sbjct: 179 IIKMGGLMLEDNYPYDAKNEKC 200 Score = 48.4 bits (110), Expect = 1e-04 Identities = 26/75 (34%), Positives = 38/75 (50%), Gaps = 2/75 (2%) Frame = +1 Query: 307 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNE--DECSSTELDHGVLVVGYGNDEQ 480 DE +L + +SV ++A Q Y G+ + CS LDH VL+VGYG E+ Sbjct: 220 DETELAAWLYHNSTISVGMNA--LLLQFYQHGISHPWWIFCSKYLLDHAVLLVGYGVSEK 277 Query: 481 GVEYWLLKNCWAARW 525 +W++KN W W Sbjct: 278 NEPFWIVKNSWGVEW 292 >UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryza sativa|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 352 Score = 99.5 bits (237), Expect = 5e-20 Identities = 42/84 (50%), Positives = 58/84 (69%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+Q CG CW+FST A+EG H +G LVSLSEQ L+DC++ N GC GG +DNAF+Y Sbjct: 145 KNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCAD---NGGCTGGSLDNAFQY 201 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 + ++GG+ TE Y Y+G C++ Sbjct: 202 MANSGGVTTEAAYAYQGAQGACQF 225 Score = 72.9 bits (171), Expect = 5e-12 Identities = 38/86 (44%), Positives = 49/86 (56%), Gaps = 3/86 (3%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVG 462 G+ + DE L AVA+ PVSVAI+ S F+ Y SGV+ D C T+LDH V VVG Sbjct: 240 GYQRVNPNDEGSLAAAVASQ-PVSVAIEGSGAMFRHYGSGVFTADSC-GTKLDHAVAVVG 297 Query: 463 YGNDEQGV---EYWLLKNCWAARWAN 531 YG + G YW++KN W W + Sbjct: 298 YGAEADGSGGGGYWIIKNSWGTTWGD 323 >UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 21 SCAF14577, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 406 Score = 99.1 bits (236), Expect = 7e-20 Identities = 41/76 (53%), Positives = 56/76 (73%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 ++QG C SCW+FS+ GALEGQ +++G+LV LS QNL+DCS GN GC GG + ++ Y Sbjct: 171 QNQGFCNSCWAFSSLGALEGQMKKRTGFLVPLSPQNLLDCSISDGNLGCRGGYISKSYSY 230 Query: 182 IKDNGGIDTEQTYPYE 229 I NGG+D++ YPYE Sbjct: 231 IIRNGGVDSDSFYPYE 246 >UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio molitor (Yellow mealworm) Length = 336 Score = 99.1 bits (236), Expect = 7e-20 Identities = 47/94 (50%), Positives = 59/94 (62%), Gaps = 2/94 (2%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQH--FRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAF 175 K+QG CGSCW+FS+TGA+E Q +GY S+SEQ L+DC GC+GG M++AF Sbjct: 137 KNQGSCGSCWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDCVPNA--LGCSGGWMNDAF 194 Query: 176 KYIKDNGGIDTEQTYPYEGVDDKCRYIPRTPVLR 277 Y+ NGGID+E YPYE D C Y P R Sbjct: 195 TYVAQNGGIDSEGAYPYEMADGNCHYDPNQVAAR 228 Score = 82.6 bits (195), Expect = 6e-15 Identities = 41/90 (45%), Positives = 50/90 (55%) Frame = +1 Query: 256 PKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTE 435 P A G+V + DE L + VAT GPV+VA DA F YS GVY C + + Sbjct: 222 PNQVAARLSGYVYLSGPDENMLADMVATKGPVAVAFDADDP-FGSYSGGVYYNPTCETNK 280 Query: 436 LDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 H VL+VGYGN E G +YWL+KN W W Sbjct: 281 FTHAVLIVGYGN-ENGQDYWLVKNSWGDGW 309 >UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1 precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 355 Score = 99.1 bits (236), Expect = 7e-20 Identities = 45/75 (60%), Positives = 54/75 (72%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG+CGSCW+FST A+EG + +G L SLSEQ LIDC + N+GCNGGLMD AF+Y Sbjct: 153 KDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTF-NSGCNGGLMDYAFQY 211 Query: 182 IKDNGGIDTEQTYPY 226 I GG+ E YPY Sbjct: 212 IISTGGLHKEDDYPY 226 Score = 81.8 bits (193), Expect = 1e-14 Identities = 40/81 (49%), Positives = 56/81 (69%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVG 462 G+ D+PE D++ L++A+A PVSVAI+AS FQ Y GV+N +C T+LDHGV VG Sbjct: 247 GYEDVPENDDESLVKALAHQ-PVSVAIEASGRDFQFYKGGVFN-GKC-GTDLDHGVAAVG 303 Query: 463 YGNDEQGVEYWLLKNCWAARW 525 YG+ +G +Y ++KN W RW Sbjct: 304 YGS-SKGSDYVIVKNSWGPRW 323 >UniRef50_O16454 Cluster: Temporarily assigned gene name protein 196; n=4; Bilateria|Rep: Temporarily assigned gene name protein 196 - Caenorhabditis elegans Length = 477 Score = 98.7 bits (235), Expect = 9e-20 Identities = 44/85 (51%), Positives = 55/85 (64%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FSTTG +EG F LVSLSEQ L+DC + GCNGGL NA+K Sbjct: 280 KNQGNCGSCWAFSTTGNVEGAWFIAKNKLVSLSEQELVDCDSM--DQGCNGGLPSNAYKE 337 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYI 256 I GG++ E YPY+G + C + Sbjct: 338 IIRMGGLEPEDAYPYDGRGETCHLV 362 Score = 62.5 bits (145), Expect = 7e-09 Identities = 31/83 (37%), Positives = 50/83 (60%), Gaps = 2/83 (2%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDE--CSSTELDHGVLV 456 G V++P DE ++ + + T GP+S+ ++A+ + Q Y GV + + C L+HGVL+ Sbjct: 372 GSVELPH-DEVEMQKWLVTKGPISIGLNAN--TLQFYRHGVVHPFKIFCEPFMLNHGVLI 428 Query: 457 VGYGNDEQGVEYWLLKNCWAARW 525 VGYG D + YW++KN W W Sbjct: 429 VGYGKDGR-KPYWIVKNSWGPNW 450 >UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; n=16; Chrysomelidae|Rep: Digestive cysteine protease intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 98.7 bits (235), Expect = 9e-20 Identities = 46/93 (49%), Positives = 61/93 (65%), Gaps = 1/93 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGC-NGGLMDNAFK 178 KDQ CGSCW+FS TGALEGQ+ + +SLSEQ L+DCS YGN C GG M AF+ Sbjct: 126 KDQNPCGSCWAFSATGALEGQNAILNNVKISLSEQQLLDCSAAYGNGNCKEGGDMSAAFE 185 Query: 179 YIKDNGGIDTEQTYPYEGVDDKCRYIPRTPVLR 277 Y++D GI +E++YPY +C+Y +L+ Sbjct: 186 YVRDY-GIQSEKSYPYIRKQTECQYDASKTILK 217 Score = 63.7 bits (148), Expect = 3e-09 Identities = 32/75 (42%), Positives = 46/75 (61%), Gaps = 3/75 (4%) Frame = +1 Query: 310 EQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQ--- 480 E+ L +AV +GP+S+A+++ QLY SG+ + CS +LDHGVLVVGYG Q Sbjct: 228 EEGLRKAVGAIGPISIAMNSD--PLQLYYSGIISGKGCSH-DLDHGVLVVGYGKASQWSG 284 Query: 481 GVEYWLLKNCWAARW 525 ++W +KN W W Sbjct: 285 ETKFWRVKNSWGKIW 299 >UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_23, whole genome shotgun sequence - Paramecium tetraurelia Length = 321 Score = 98.7 bits (235), Expect = 9e-20 Identities = 45/87 (51%), Positives = 59/87 (67%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCW+FS GALE Q +V LSEQ+L+DC+ YGN GC+GG M++A Y Sbjct: 135 KDQGDCGSCWAFSAVGALEINTKIQFNEIVDLSEQDLVDCAGPYGNAGCDGGWMESALDY 194 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPR 262 I D+G +T + YPY+G D C+ + R Sbjct: 195 IIDSGIAET-KVYPYKGEDGICKSVER 220 Score = 41.1 bits (92), Expect = 0.019 Identities = 30/82 (36%), Positives = 50/82 (60%) Frame = +1 Query: 280 VGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVV 459 +G+VD+ +G Q + A+ VSV +DA T+++ YSSGV++ +C L+HGV++V Sbjct: 226 IGYVDL-DGC-QDISNALIQQS-VSVGVDA--TNWRFYSSGVFS--DCKK-YLNHGVVLV 277 Query: 460 GYGNDEQGVEYWLLKNCWAARW 525 G ++ GV W ++N W W Sbjct: 278 GI--NKNGV--WKVRNSWGQDW 295 >UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1; Dictyostelium discoideum AX4|Rep: Counting factor associated protein - Dictyostelium discoideum AX4 Length = 531 Score = 98.3 bits (234), Expect = 1e-19 Identities = 44/89 (49%), Positives = 58/89 (65%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCW+F +TG+LEG + +G LVSLSEQ L+DC+ G+ GC GG +AF+Y Sbjct: 325 KDQGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQY 384 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRTP 268 + + G + TE YPY + CR TP Sbjct: 385 VMEIGSLATESNYPYLMQNGLCRDRTVTP 413 Score = 90.2 bits (214), Expect = 3e-17 Identities = 41/89 (46%), Positives = 56/89 (62%), Gaps = 2/89 (2%) Frame = +1 Query: 265 TGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSS--TEL 438 +G G+V++ G E L A+AT GPV++AIDAS F+ Y SGVYN C + +L Sbjct: 414 SGVSITGYVNVTSGSESALQNAIATTGPVAIAIDASVDDFRYYMSGVYNNPACKNGLDDL 473 Query: 439 DHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 DH VL +GYG QG +Y+L+KN W+ W Sbjct: 474 DHEVLAIGYGT-YQGQDYFLVKNSWSTNW 501 >UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 precursor; n=4; Schizophora|Rep: Putative cysteine proteinase CG12163 precursor - Drosophila melanogaster (Fruit fly) Length = 614 Score = 98.3 bits (234), Expect = 1e-19 Identities = 42/84 (50%), Positives = 58/84 (69%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FS TG +EG + ++G L SEQ L+DC ++ CNGGLMDNA+K Sbjct: 410 KNQGSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTT--DSACNGGLMDNAYKA 467 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 IKD GG++ E YPY+ ++C + Sbjct: 468 IKDIGGLEYEAEYPYKAKKNQCHF 491 Score = 79.8 bits (188), Expect = 4e-14 Identities = 42/117 (35%), Positives = 64/117 (54%), Gaps = 7/117 (5%) Frame = +1 Query: 196 GHRHRADLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASH 375 G + A+ P + Q + + + GFVD+P+G+E + E + GP+S+ I+A+ Sbjct: 473 GLEYEAEYPYKAKKNQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPISIGINAN- 531 Query: 376 TSFQLYSSGVYN--EDECSSTELDHGVLVVGYG-----NDEQGVEYWLLKNCWAARW 525 + Q Y GV + + CS LDHGVLVVGYG N + + YW++KN W RW Sbjct: 532 -AMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRW 587 >UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; n=23; Magnoliophyta|Rep: Senescence-specific cysteine protease - Arabidopsis thaliana (Mouse-ear cress) Length = 346 Score = 97.9 bits (233), Expect = 2e-19 Identities = 43/82 (52%), Positives = 54/82 (65%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CG CW+FS A+EG + G L+SLSEQ L+DC + GC GGLMD AF++ Sbjct: 146 KNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDT--NDFGCEGGLMDTAFEH 203 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 IK GG+ TE YPY+G D C Sbjct: 204 IKATGGLTTESNYPYKGEDATC 225 Score = 86.2 bits (204), Expect = 5e-16 Identities = 43/93 (46%), Positives = 56/93 (60%) Frame = +1 Query: 247 QVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECS 426 + +PK T G+ D+P DEQ LM+AVA PVSV I+ FQ YSSGV+ EC+ Sbjct: 229 KTNPKATSI--TGYEDVPVNDEQALMKAVAHQ-PVSVGIEGGGFDFQFYSSGVFT-GECT 284 Query: 427 STELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 T LDH V +GYG G +YW++KN W +W Sbjct: 285 -TYLDHAVTAIGYGESTNGSKYWIIKNSWGTKW 316 >UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 2 - Rhipicephalus appendiculatus (Brown ear tick) Length = 564 Score = 97.9 bits (233), Expect = 2e-19 Identities = 45/82 (54%), Positives = 53/82 (64%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQ CGSCWSF T G LEG +FR++G LV LSEQ L+DCS GNNGC+GG A++Y Sbjct: 361 KDQAVCGSCWSFGTVGELEGAYFRKTGRLVRLSEQQLVDCSWNNGNNGCDGGEDFRAYEY 420 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 I D+G E Y G D C Sbjct: 421 IADHGLASDEDYGAYIGQDGVC 442 Score = 49.2 bits (112), Expect = 7e-05 Identities = 31/78 (39%), Positives = 40/78 (51%), Gaps = 2/78 (2%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYS--SGVYNEDECSSTELDHGVLVV 459 +V+I D+ L A+A VGPVSV+IDA+ SF Y S + + LDH VL Sbjct: 457 YVNITNRDD--LPTALANVGPVSVSIDAALRSFSFYPTVSSMIPTAAMDTDSLDHSVLRQ 514 Query: 460 GYGNDEQGVEYWLLKNCW 513 QG YW +KN W Sbjct: 515 SATRTLQGEPYWGVKNSW 532 >UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicotyledons|Rep: Cysteine proteinase - Mesembryanthemum crystallinum (Common ice plant) Length = 367 Score = 97.5 bits (232), Expect = 2e-19 Identities = 43/83 (51%), Positives = 55/83 (66%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG+CG CW+FS A+EG + +G L+SLSEQ LIDC Q N+GC GG M AF+Y Sbjct: 142 KNQGRCGGCWAFSAAAAVEGINQITTGQLISLSEQQLIDCDTQ--NSGCRGGTMGRAFEY 199 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 IK GGI +E YPY+ C+ Sbjct: 200 IKQRGGITSEANYPYKAQAGMCK 222 Score = 58.8 bits (136), Expect = 9e-08 Identities = 28/63 (44%), Positives = 36/63 (57%), Gaps = 3/63 (4%) Frame = +1 Query: 346 PVSVAIDA---SHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWA 516 PVSVA+DA S + Y GV+ C T+L+HGV VGYG G +YW++KN W Sbjct: 254 PVSVAVDATTWSSLDWMFYFQGVFT-GPCG-TKLNHGVTAVGYGTTNDGYDYWIIKNSWG 311 Query: 517 ARW 525 W Sbjct: 312 ETW 314 >UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchocercidae|Rep: Cathepsin L-like precursor - Brugia pahangi (Filarial nematode worm) Length = 395 Score = 97.5 bits (232), Expect = 2e-19 Identities = 40/84 (47%), Positives = 57/84 (67%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 ++QG+CGSC++F+T ALE H + +G L+ LS QN++DC+ GNNGC+GG M AF+Y Sbjct: 198 RNQGECGSCYAFATAAALEAYHKQMTGRLLDLSPQNIVDCTRNLGNNGCSGGYMPTAFQY 257 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 GI E YPY G + +CR+ Sbjct: 258 -ASRYGIAMESRYPYVGTEQRCRW 280 Score = 80.6 bits (190), Expect = 3e-14 Identities = 39/83 (46%), Positives = 45/83 (54%) Frame = +1 Query: 277 DVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLV 456 D GF +I GDE L AVA GPV V I S SF+ Y GVY+E C DH VL Sbjct: 289 DNGFNEIQPGDELALKHAVAKRGPVVVGISGSKRSFRFYKDGVYSEGNCGRP--DHAVLA 346 Query: 457 VGYGNDEQGVEYWLLKNCWAARW 525 VGYG +YW++KN W W Sbjct: 347 VGYGTHPSYGDYWIVKNSWGTDW 369 >UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 317 Score = 97.1 bits (231), Expect = 3e-19 Identities = 44/79 (55%), Positives = 52/79 (65%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 +DQ +CGSCW+FS GALEGQ F + G L LS Q L+DCS Y N GCNGG A+ Y Sbjct: 120 RDQEQCGSCWAFSAAGALEGQRFLKEGKLEVLSTQQLVDCSRDYKNEGCNGGWPHWAYDY 179 Query: 182 IKDNGGIDTEQTYPYEGVD 238 IKDN G+ E Y Y+G D Sbjct: 180 IKDN-GLCLESKYKYQGYD 197 Score = 70.5 bits (165), Expect = 3e-11 Identities = 32/73 (43%), Positives = 46/73 (63%), Gaps = 1/73 (1%) Frame = +1 Query: 310 EQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTE-LDHGVLVVGYGNDEQGV 486 E+ L EAV T GP++V ++A+ +QLYS G+ C E ++H VL VGYG+ E G Sbjct: 221 EEALKEAVGTAGPIAVCVNAND-DWQLYSGGILESQSCPGGESINHAVLAVGYGS-ENGK 278 Query: 487 EYWLLKNCWAARW 525 ++WL+KN W W Sbjct: 279 DFWLIKNSWNTYW 291 >UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|Rep: Cathepsin F precursor - Homo sapiens (Human) Length = 484 Score = 97.1 bits (231), Expect = 3e-19 Identities = 43/84 (51%), Positives = 55/84 (65%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCW+FS TG +EGQ F G L+SLSEQ L+DC + + C GGL NA+ Sbjct: 287 KDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKM--DKACMGGLPSNAYSA 344 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 IK+ GG++TE Y Y+G C + Sbjct: 345 IKNLGGLETEDDYSYQGHMQSCNF 368 Score = 58.4 bits (135), Expect = 1e-07 Identities = 30/73 (41%), Positives = 39/73 (53%) Frame = +1 Query: 307 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGV 486 +EQKL +A GP+SVAI+A F + CS +DH VL+VGYGN V Sbjct: 386 NEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGN-RSDV 444 Query: 487 EYWLLKNCWAARW 525 +W +KN W W Sbjct: 445 PFWAIKNSWGTDW 457 >UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadidae|Rep: Cysteine protease - Tritrichomonas foetus (Trichomonas foetus) Length = 315 Score = 96.7 bits (230), Expect = 4e-19 Identities = 43/80 (53%), Positives = 56/80 (70%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 F+ I E DE+ L V T GPV+VAIDASH SFQLY SG+Y+E ECS+T L+HGV +G+ Sbjct: 211 FLYIAENDEEDLAANVETHGPVAVAIDASHQSFQLYKSGIYDEPECSATFLNHGVGCIGF 270 Query: 466 GNDEQGVEYWLLKNCWAARW 525 G+D +YW++ N W W Sbjct: 271 GSDND-TKYWIVPNSWGLTW 289 Score = 82.2 bits (194), Expect = 8e-15 Identities = 41/86 (47%), Positives = 52/86 (60%), Gaps = 2/86 (2%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQ CGSCW+FS A E + +G L S SEQNL+DC + G GC+GGLMD A+KY Sbjct: 116 KDQAACGSCWAFSAIQAAESAYAISTGTLESYSEQNLVDCVQ--GCYGCSGGLMDYAYKY 173 Query: 182 IKD--NGGIDTEQTYPYEGVDDKCRY 253 I D G + E Y Y +D C++ Sbjct: 174 IIDRQKGKMILESDYVYTALDGVCKF 199 >UniRef50_Q23EG5 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 429 Score = 96.7 bits (230), Expect = 4e-19 Identities = 41/93 (44%), Positives = 63/93 (67%), Gaps = 1/93 (1%) Frame = +2 Query: 17 CGSCWSFSTTGALEGQHFRQSGYL-VSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDN 193 CGSCW+FS TGA+E ++G +LS+Q L+DC+ ++ N GC+GGL AF+YI Sbjct: 147 CGSCWTFSATGAIESHLALKTGKAPFNLSQQQLVDCAGKFDNQGCDGGLPSRAFEYIAYA 206 Query: 194 GGIDTEQTYPYEGVDDKCRYIPRTPVLRTWASW 292 GGI++ + YPY+G D KC++ P+ V + +S+ Sbjct: 207 GGIESSRDYPYKGKDGKCKFKPQKVVAKVQSSF 239 Score = 59.3 bits (137), Expect = 7e-08 Identities = 34/106 (32%), Positives = 55/106 (51%), Gaps = 2/106 (1%) Frame = +1 Query: 214 DLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLY 393 D P +G + + P+ A+ +I DE +L+ +A GPVS+A + F+ Y Sbjct: 214 DYPYKGKDGKCKFKPQKVVAKVQSSFNITFQDENELIYHLAKNGPVSIAYQVTD-DFENY 272 Query: 394 SSGVYNEDECSS--TELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 G+Y+ ECS+ E++H VL VGY + Y+++KN W W Sbjct: 273 EGGIYSNPECSTDPQEVNHAVLAVGYNLTGR---YYIVKNSWGKDW 315 >UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep: CG4847-PD, isoform D - Drosophila melanogaster (Fruit fly) Length = 420 Score = 96.7 bits (230), Expect = 4e-19 Identities = 44/87 (50%), Positives = 58/87 (66%), Gaps = 3/87 (3%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS--EQYGNNGCNGGLMDNAF 175 K QG CGSCW+F+TTGA+EG FR++G L +LSEQNL+DC E +G NGC+GG + AF Sbjct: 219 KFQGTCGSCWAFATTGAIEGHTFRKTGSLPNLSEQNLVDCGPVEDFGLNGCDGGFQEAAF 278 Query: 176 KYIKD-NGGIDTEQTYPYEGVDDKCRY 253 +I + G+ E YPY C+Y Sbjct: 279 CFIDEVQKGVSQEGAYPYIDNKGTCKY 305 Score = 89.4 bits (212), Expect = 5e-17 Identities = 39/87 (44%), Positives = 60/87 (68%) Frame = +1 Query: 265 TGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDH 444 +GA GF IP DE++L + VAT+GPV+ +++ T + Y+ G+YN+DEC+ E +H Sbjct: 310 SGATLQGFAAIPPKDEEQLKKVVATLGPVACSVNGLET-LKNYAGGIYNDDECNKGEPNH 368 Query: 445 GVLVVGYGNDEQGVEYWLLKNCWAARW 525 +LVVGYG+ E+G +YW++KN W W Sbjct: 369 SILVVGYGS-EKGQDYWIVKNSWDDTW 394 >UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole genome shotgun sequence; n=7; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_22, whole genome shotgun sequence - Paramecium tetraurelia Length = 350 Score = 96.7 bits (230), Expect = 4e-19 Identities = 44/82 (53%), Positives = 56/82 (68%), Gaps = 1/82 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYG-NNGCNGGLMDNAFK 178 KDQG+CGSCW+FSTTG LEG + Q+G L LSEQ L+DCS N GC+GG+ A Sbjct: 158 KDQGQCGSCWAFSTTGVLEGFYKVQTGELPDLSEQQLVDCSTLIDFNQGCDGGMPSRALN 217 Query: 179 YIKDNGGIDTEQTYPYEGVDDK 244 Y+K N G+ T+ YPYE + +K Sbjct: 218 YVKRN-GLTTQDAYPYEHIQNK 238 >UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep: Silicatein beta - Suberites domuncula (Sponge) Length = 383 Score = 95.9 bits (228), Expect = 6e-19 Identities = 44/86 (51%), Positives = 54/86 (62%) Frame = +1 Query: 268 GAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHG 447 GA G V + GDE L+ AVA GPVSV +DA+ TSFQ YS GV N CSS+ L H Sbjct: 272 GASARGIVSLASGDENTLLTAVANSGPVSVYVDATSTSFQFYSDGVLNVPYCSSSTLSHA 331 Query: 448 VLVVGYGNDEQGVEYWLLKNCWAARW 525 ++V+GYG G +YWL+KN W W Sbjct: 332 LVVIGYGK-YSGQDYWLVKNSWGPNW 356 Score = 81.4 bits (192), Expect = 1e-14 Identities = 37/77 (48%), Positives = 52/77 (67%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQ +CGS ++FS +LEG + G LV+LSEQN++DCS YGN+GC G ++ A Y Sbjct: 178 KDQLRCGSSYAFSAMASLEGINALSYGSLVTLSEQNIVDCSVTYGNHGCACGDVNRALLY 237 Query: 182 IKDNGGIDTEQTYPYEG 232 + +N G+DT + YP G Sbjct: 238 VIENDGVDTWKGYPSGG 254 >UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to CG5367-PA - Nasonia vitripennis Length = 362 Score = 95.5 bits (227), Expect = 8e-19 Identities = 41/97 (42%), Positives = 63/97 (64%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 ++Q CGSC+++S G++ GQ FRQ+G +V LSEQ L+DCS Q GN GC+GG + N +Y Sbjct: 167 ENQRDCGSCYAYSIAGSIAGQIFRQTGIVVPLSEQQLVDCSTQTGNLGCSGGSLRNTLRY 226 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRTPVLRTWASW 292 ++ + G+ T+ TYPY C++ + V+ SW Sbjct: 227 LERSKGLMTDATYPYTAHQGVCKFQRKLSVVNV-TSW 262 Score = 80.2 bits (189), Expect = 3e-14 Identities = 35/77 (45%), Positives = 52/77 (67%) Frame = +1 Query: 295 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGND 474 +P DE+ L AVAT+GP++ +I+A +FQLY SG+Y++ CSS ++H +L+VGY + Sbjct: 265 LPARDERALEAAVATIGPIAASINAGPRTFQLYHSGIYDDPTCSSDLVNHAMLIVGYTPN 324 Query: 475 EQGVEYWLLKNCWAARW 525 YW+LKN W A W Sbjct: 325 -----YWILKNWWGASW 336 >UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium tetraurelia|Rep: Cathepsin L1 precursor - Paramecium tetraurelia Length = 314 Score = 95.5 bits (227), Expect = 8e-19 Identities = 43/89 (48%), Positives = 56/89 (62%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FS GALE + LSEQ+L+DCS Y N+GCNGG MD+AF+Y Sbjct: 127 KNQGSCGSCWAFSAVGALEINTDIELNRKYELSEQDLVDCSGPYDNDGCNGGWMDSAFEY 186 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRTP 268 + DN G+ + YPY D C+ + P Sbjct: 187 VADN-GLAEAKDYPYTAKDGTCKTSVKRP 214 >UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-PA - Drosophila melanogaster (Fruit fly) Length = 549 Score = 95.1 bits (226), Expect = 1e-18 Identities = 45/84 (53%), Positives = 55/84 (65%), Gaps = 2/84 (2%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHF-RQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK 178 KDQ CGSCWSF T G LEG F + G LV LS+Q LIDCS YGNNGC+GG ++ Sbjct: 346 KDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDCSWAYGNNGCDGGEDFRVYQ 405 Query: 179 YIKDNGGIDTEQTY-PYEGVDDKC 247 ++ +GG+ TE+ Y PY G D C Sbjct: 406 WMLQSGGVPTEEEYGPYLGQDGYC 429 Score = 81.0 bits (191), Expect = 2e-14 Identities = 40/85 (47%), Positives = 50/85 (58%), Gaps = 2/85 (2%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSS--TELDHGVLV 456 GFV++ D A+ GP+SVAIDAS +F YS GVY E C + LDH VL Sbjct: 442 GFVNVTSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPTCKNDVDGLDHAVLA 501 Query: 457 VGYGNDEQGVEYWLLKNCWAARWAN 531 VGYG+ G +YWL+KN W+ W N Sbjct: 502 VGYGS-INGEDYWLVKNSWSTYWGN 525 >UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1; Brugia malayi|Rep: Cathepsin F-like cysteine proteinase - Brugia malayi (Filarial nematode worm) Length = 461 Score = 95.1 bits (226), Expect = 1e-18 Identities = 44/85 (51%), Positives = 55/85 (64%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCW+FS TG +E ++G L+SLSEQ LIDC + GCNGGL NAF+ Sbjct: 264 KDQGSCGSCWAFSVTGNIESLWAIKTGKLISLSEQELIDC--DVIDKGCNGGLPINAFRE 321 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYI 256 IK GG++ E YPYE + C + Sbjct: 322 IKRMGGLEPEDQYPYEAKNGTCHLV 346 Score = 60.9 bits (141), Expect = 2e-08 Identities = 31/81 (38%), Positives = 48/81 (59%), Gaps = 2/81 (2%) Frame = +1 Query: 289 VDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYN--EDECSSTELDHGVLVVG 462 V+IP +E + +A GP+SV IDA S+ Y SG+ + + C ++++HGVL+ G Sbjct: 358 VEIPR-NETVMKAWIAQRGPLSVGIDAELLSY--YKSGILHPSKSRCPPSKINHGVLITG 414 Query: 463 YGNDEQGVEYWLLKNCWAARW 525 YG E + YW +KN W +W Sbjct: 415 YG-IENNLPYWTIKNSWGEQW 434 >UniRef50_Q235G6 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 325 Score = 94.7 bits (225), Expect = 1e-18 Identities = 43/83 (51%), Positives = 55/83 (66%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CG CWSF+TTG +EG +F L +LS+Q LIDC+ Q N GC GGL D A Y Sbjct: 133 KNQGGCGGCWSFATTGGVEGANFVYKNVLPNLSQQQLIDCNTQ--NKGCGGGLRDIALNY 190 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 +K+ G+ TE+ Y YE + KCR Sbjct: 191 VKET-GLTTEEEYSYEAKNGKCR 212 Score = 44.0 bits (99), Expect = 0.003 Identities = 23/60 (38%), Positives = 40/60 (66%) Frame = +1 Query: 346 PVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 PV+V ID+S+ F Y++G+++ C T+++HGVL+VGY + + E W +KN W ++ Sbjct: 242 PVTVGIDSSNLQF--YTNGIFSN--CG-TKINHGVLLVGYDSVK---EAWKVKNSWGPKF 293 >UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue of Sarcophaga 26,29kDa proteinase; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to homologue of Sarcophaga 26,29kDa proteinase - Nasonia vitripennis Length = 553 Score = 94.3 bits (224), Expect = 2e-18 Identities = 44/83 (53%), Positives = 57/83 (68%), Gaps = 1/83 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQ CGSCWSF TTGA+EG +F + LV LS+Q LIDCS +GNNGC+GG ++++ Sbjct: 350 KDQSVCGSCWSFGTTGAVEGAYFMKYKKLVRLSQQALIDCSWGFGNNGCDGGEDFRSYQW 409 Query: 182 IKDNGGIDTEQTY-PYEGVDDKC 247 I +GG+ TE+ Y Y G D C Sbjct: 410 IIKHGGLPTEEEYGGYLGQDGYC 432 Score = 85.4 bits (202), Expect = 9e-16 Identities = 41/85 (48%), Positives = 53/85 (62%), Gaps = 2/85 (2%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTE--LDHGVLV 456 GFV++ + + A+ GP+SVAIDASH +F YS+GVY E C +TE LDH VL Sbjct: 445 GFVNVDTNNVDAMKLALFKHGPISVAIDASHKTFSFYSNGVYYEPACGNTENSLDHAVLA 504 Query: 457 VGYGNDEQGVEYWLLKNCWAARWAN 531 VGYG G +WL+KN W+ W N Sbjct: 505 VGYGT-INGKGFWLIKNSWSNYWGN 528 >UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis thaliana|Rep: Cysteine proteinase - Arabidopsis thaliana (Mouse-ear cress) Length = 348 Score = 94.3 bits (224), Expect = 2e-18 Identities = 43/82 (52%), Positives = 50/82 (60%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K QG+CG CW+FS A+EG G LVSLSEQ L+DC Y N GC GG+M AF+Y Sbjct: 144 KYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDRDY-NQGCRGGIMSKAFEY 202 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 I N GI TE YPY+ C Sbjct: 203 IIKNQGITTEDNYPYQESQQTC 224 Score = 77.8 bits (183), Expect = 2e-13 Identities = 35/81 (43%), Positives = 53/81 (65%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVG 462 G+ +P +E+ L++AV+ PVSV I+ + +F+ YS GV+N EC T+L H V +VG Sbjct: 241 GYETVPMNNEEALLQAVSQQ-PVSVGIEGTGAAFRHYSGGVFN-GECG-TDLHHAVTIVG 297 Query: 463 YGNDEQGVEYWLLKNCWAARW 525 YG E+G +YW++KN W W Sbjct: 298 YGMSEEGTKYWVVKNSWGETW 318 >UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep: Cysteine protease - Solanum lycopersicum (Tomato) (Lycopersicon esculentum) Length = 345 Score = 93.9 bits (223), Expect = 3e-18 Identities = 43/92 (46%), Positives = 56/92 (60%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K QG+CG CW+FS G+LEG + +G L+ SEQ L+DC+ N GCNGG M NAF + Sbjct: 147 KHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT--NNYGCNGGFMTNAFDF 204 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRTPVLR 277 I +NGGI E Y Y G CR +T ++ Sbjct: 205 IIENGGISRESDYEYLGQQYTCRSQEKTAAVQ 236 Score = 69.7 bits (163), Expect = 5e-11 Identities = 35/77 (45%), Positives = 47/77 (61%) Frame = +1 Query: 295 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGND 474 +PEG E L++AV T PVS+ I AS Q Y+ G Y + C+ ++H V +GYG D Sbjct: 243 VPEG-ETSLLQAV-TKQPVSIGIAASQ-DLQFYAGGTY-DGNCAD-RINHAVTAIGYGTD 297 Query: 475 EQGVEYWLLKNCWAARW 525 E+G +YWLLKN W W Sbjct: 298 EEGQKYWLLKNSWGTSW 314 >UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber officinale (Ginger) Length = 475 Score = 93.9 bits (223), Expect = 3e-18 Identities = 41/82 (50%), Positives = 58/82 (70%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG+CGSCW+F+ A+EG + +G L+SLSEQ L+DCS + N GC GG AF+Y Sbjct: 159 KNQGRCGSCWAFAAIAAVEGINQIVTGDLISLSEQQLVDCSTR--NYGCEGGWPYRAFQY 216 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 I +NGG+++E+ YPY G + C Sbjct: 217 IINNGGVNSEEHYPYTGTNGTC 238 Score = 81.4 bits (192), Expect = 1e-14 Identities = 39/80 (48%), Positives = 52/80 (65%) Frame = +1 Query: 292 DIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGN 471 ++P DE+ L +A A P+SV IDAS +FQLY SG++ C +T L+HGV VVGYG Sbjct: 255 NVPSNDEKSLQKAAANQ-PISVGIDASGRNFQLYHSGIFT-GSC-NTSLNHGVTVVGYGT 311 Query: 472 DEQGVEYWLLKNCWAARWAN 531 E G +YW++KN W W N Sbjct: 312 -ENGNDYWIVKNSWGENWGN 330 >UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster|Rep: CG5367-PA - Drosophila melanogaster (Fruit fly) Length = 338 Score = 93.9 bits (223), Expect = 3e-18 Identities = 39/96 (40%), Positives = 61/96 (63%) Frame = +2 Query: 5 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYI 184 +Q CGSC++FS ++ GQ F+++G ++SLS+Q ++DCS +GN GC GG + N Y+ Sbjct: 144 NQLSCGSCYAFSIAESIMGQVFKRTGKILSLSKQQIVDCSVSHGNQGCVGGSLRNTLSYL 203 Query: 185 KDNGGIDTEQTYPYEGVDDKCRYIPRTPVLRTWASW 292 + GGI +Q YPY KC+++P V+ SW Sbjct: 204 QSTGGIMRDQDYPYVARKGKCQFVPDLSVVNV-TSW 238 Score = 83.0 bits (196), Expect = 5e-15 Identities = 34/77 (44%), Positives = 52/77 (67%) Frame = +1 Query: 295 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGND 474 +P DEQ + AV +GPV+++I+AS +FQLYS G+Y++ CSS ++H ++V+G+G D Sbjct: 241 LPVRDEQAIQAAVTHIGPVAISINASPKTFQLYSDGIYDDPLCSSASVNHAMVVIGFGKD 300 Query: 475 EQGVEYWLLKNCWAARW 525 YW+LKN W W Sbjct: 301 -----YWILKNWWGQNW 312 >UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep: Cysteine proteinase - Cryptobia salmositica Length = 443 Score = 93.5 bits (222), Expect = 3e-18 Identities = 47/93 (50%), Positives = 59/93 (63%), Gaps = 5/93 (5%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCWSFSTTG +EGQH +G LV++SEQ L+ C ++GCNGGLMDNAF + Sbjct: 130 KNQGACGSCWSFSTTGNIEGQHAIATGQLVAVSEQELVSCDPI--DDGCNGGLMDNAFGW 187 Query: 182 I--KDNGGIDTEQTYPY---EGVDDKCRYIPRT 265 + G I TE YPY G+ C P + Sbjct: 188 LISAHKGQIATEANYPYVSGNGIVPACSSSPES 220 Score = 63.7 bits (148), Expect = 3e-09 Identities = 33/89 (37%), Positives = 50/89 (56%) Frame = +1 Query: 259 KNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTEL 438 K GA F DI +E + V GP+S+ +DAS ++Q Y+ G+ + C ++ Sbjct: 221 KPVGATISAFQDIARTEED-MAAFVFKHGPLSIGVDAS--TWQSYAGGIMSY--CPQDQI 275 Query: 439 DHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 DHGVL+VG+ +D YW++KN W A W Sbjct: 276 DHGVLIVGF-DDTASTPYWIIKNSWTANW 303 >UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber officinale (Ginger) Length = 221 Score = 93.5 bits (222), Expect = 3e-18 Identities = 42/82 (51%), Positives = 57/82 (69%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+F A+EG + +G L+SLSEQ L+DCS + N+GC GG AF+Y Sbjct: 19 KNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCSTR--NHGCEGGWPYRAFQY 76 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 I +NGGI++E+ YPY G + C Sbjct: 77 IINNGGINSEEHYPYTGTNGTC 98 Score = 54.8 bits (126), Expect = 1e-06 Identities = 29/78 (37%), Positives = 42/78 (53%) Frame = +1 Query: 292 DIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGN 471 ++P DE+ L +AVA PVSV +DA+ FQLY +G++ C + +H VG Sbjct: 114 NVPSNDEKSLQKAVANQ-PVSVTMDAAGRDFQLYRNGIFT-GSC-NISANH-YRTVGGRE 169 Query: 472 DEQGVEYWLLKNCWAARW 525 E +YW +KN W W Sbjct: 170 TENDKDYWTVKNSWGKNW 187 >UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2; Entamoeba|Rep: Cysteine proteinase ACP1 precursor - Entamoeba histolytica Length = 308 Score = 93.5 bits (222), Expect = 3e-18 Identities = 40/85 (47%), Positives = 54/85 (63%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG+CGSCW+F TT LEG+ + G L S SEQ L+DC +NGC GG N+ K+ Sbjct: 107 KDQGQCGSCWTFCTTAVLEGRVNKDLGKLYSFSEQQLVDCDA--SDNGCEGGHPSNSLKF 164 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYI 256 I++N G+ E YPY+ V C+ + Sbjct: 165 IQENNGLGLESDYPYKAVAGTCKKV 189 Score = 76.2 bits (179), Expect = 5e-13 Identities = 32/80 (40%), Positives = 50/80 (62%), Gaps = 1/80 (1%) Frame = +1 Query: 295 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSG-VYNEDECSSTELDHGVLVVGYGN 471 + +G E L +A GPV+V +DAS SFQLY G +Y++ +C S ++H V VGYG+ Sbjct: 201 VTDGSETGLQTIIAENGPVAVGMDASRPSFQLYKKGTIYSDTKCRSRMMNHCVTAVGYGS 260 Query: 472 DEQGVEYWLLKNCWAARWAN 531 + G +YW+++N W W + Sbjct: 261 NSNG-KYWIIRNSWGTSWGD 279 >UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: Cysteine protease - Clonorchis sinensis Length = 328 Score = 93.1 bits (221), Expect = 4e-18 Identities = 41/81 (50%), Positives = 53/81 (65%) Frame = +2 Query: 5 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYI 184 DQGKCGSCW+FS G +EGQ FR++G L++LSEQ L+DC + GCNGG + I Sbjct: 132 DQGKCGSCWAFSVIGNVEGQWFRKTGDLLALSEQQLVDC--DHLEKGCNGGYPPKTYGEI 189 Query: 185 KDNGGIDTEQTYPYEGVDDKC 247 + GG++ YPY GVD C Sbjct: 190 EKMGGLELASDYPYTGVDGIC 210 Score = 41.5 bits (93), Expect = 0.014 Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 2/67 (2%) Frame = +1 Query: 313 QKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDE--CSSTELDHGVLVVGYGNDEQGV 486 QKL E +GP+S A++A Q Y G+ C+ L+H VL VGYG E G+ Sbjct: 236 QKLKE----IGPLSSALNA--VLLQFYLGGIIFPIPFLCNPHGLNHAVLTVGYGT-EFGI 288 Query: 487 EYWLLKN 507 YW++KN Sbjct: 289 PYWIVKN 295 >UniRef50_P25773 Cluster: Cathepsin L; n=38; Eukaryota|Rep: Cathepsin L - Felis silvestris catus (Cat) Length = 139 Score = 93.1 bits (221), Expect = 4e-18 Identities = 43/93 (46%), Positives = 59/93 (63%), Gaps = 3/93 (3%) Frame = +1 Query: 256 PKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTE 435 P+N+ A + DIP E +LM +A VGP+S AIDAS +F+ Y G+Y + CSS + Sbjct: 36 PENSVANVTDYWDIPS-KENELMITLAAVGPISAAIDASLDTFRFYKEGIYYDPSCSSED 94 Query: 436 LDHGVLVVGYGND---EQGVEYWLLKNCWAARW 525 +DHGVLVVGYG D + +YW++KN W W Sbjct: 95 VDHGVLVVGYGADGTETENKKYWIIKNSWGTDW 127 Score = 64.9 bits (151), Expect = 1e-09 Identities = 25/54 (46%), Positives = 34/54 (62%) Frame = +2 Query: 152 GGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCRYIPRTPVLRTWASWTSPRATN 313 GGL+D+AF+Y+KDNGG+D+E++YPY D C+Y P V W P N Sbjct: 1 GGLIDDAFQYVKDNGGLDSEESYPYHAQGDSCKYRPENSVANVTDYWDIPSKEN 54 >UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin l - Strongylocentrotus purpuratus Length = 489 Score = 92.7 bits (220), Expect = 6e-18 Identities = 38/84 (45%), Positives = 55/84 (65%), Gaps = 2/84 (2%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSST--ELDHGVLVV 459 + ++ G+++ L +A+AT GP++V IDA+ SF YS G Y + C +T +LDH VL V Sbjct: 379 YYNVTSGNQKDLKKALATKGPIAVGIDAAVPSFSFYSYGTYYDASCGNTVDDLDHAVLAV 438 Query: 460 GYGNDEQGVEYWLLKNCWAARWAN 531 GYG D G +YWL+KN W+ W N Sbjct: 439 GYGTDSSGQDYWLIKNSWSTHWGN 462 Score = 92.3 bits (219), Expect = 8e-18 Identities = 42/85 (49%), Positives = 54/85 (63%), Gaps = 1/85 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQ CGSCWSF + +EG F QSG V LS+Q L+DC+ GNNGC+GG +++ Sbjct: 283 KDQAVCGSCWSFGSAETIEGAVFMQSGKRVRLSQQMLMDCTWAAGNNGCDGGEEWRVYEW 342 Query: 182 IKDNGGIDTEQTY-PYEGVDDKCRY 253 + NGGI E+TY PY G + C Y Sbjct: 343 LMKNGGIPLEETYGPYLGQNGMCHY 367 >UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 357 Score = 92.7 bits (220), Expect = 6e-18 Identities = 42/76 (55%), Positives = 50/76 (65%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+Q C SCW+FS A+EG H +S LV+LS Q L+DCS N+GCN G MD AF+Y Sbjct: 151 KNQKDCASCWAFSAVAAVEGIHQIRSHNLVALSTQQLLDCSTGRNNHGCNRGDMDEAFRY 210 Query: 182 IKDNGGIDTEQTYPYE 229 I NGGI E YPYE Sbjct: 211 ITSNGGIAAESDYPYE 226 Score = 77.0 bits (181), Expect = 3e-13 Identities = 39/91 (42%), Positives = 53/91 (58%), Gaps = 2/91 (2%) Frame = +1 Query: 259 KNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYN--EDECSST 432 K A GF +P +E L+ AVA PVSVA+D Q +SSGV+ ++E +T Sbjct: 238 KPVAASIRGFQYVPPNNETALLLAVAHQ-PVSVALDGVGKVSQFFSSGVFGAMQNETCTT 296 Query: 433 ELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 +L+H + VGYG DE G +YWL+KN W W Sbjct: 297 DLNHAMTAVGYGTDEHGTKYWLMKNSWGTDW 327 >UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteinase B; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like tick cysteine proteinase B - Haemaphysalis longicornis (Bush tick) Length = 332 Score = 92.7 bits (220), Expect = 6e-18 Identities = 45/89 (50%), Positives = 54/89 (60%), Gaps = 1/89 (1%) Frame = +1 Query: 268 GAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSF-QLYSSGVYNEDECSSTELDH 444 G G + P + TVGPVSVAIDA TS Q YS G+Y+E ECSS +LDH Sbjct: 220 GPPTAGTLTSPRETRRSCRRLWPTVGPVSVAIDAQPTSHSQFYSEGIYDEPECSSEQLDH 279 Query: 445 GVLVVGYGNDEQGVEYWLLKNCWAARWAN 531 GVLVVGYG + G +YWL+KN W W + Sbjct: 280 GVLVVGYGT-KDGKDYWLVKNSWGTTWGD 307 >UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine protease; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cysteine protease - Strongylocentrotus purpuratus Length = 494 Score = 92.3 bits (219), Expect = 8e-18 Identities = 41/84 (48%), Positives = 57/84 (67%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FS G +EGQ + G L+SLSEQ L+DC + G GC GG M +A++ Sbjct: 256 KNQGMCGSCWAFSAIGNMEGQWQIKKGELISLSEQELVDCDKVDG--GCEGGEMSDAYEA 313 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 I GG +E+ YPY G ++KC++ Sbjct: 314 IIKLGGAMSEEKYPYRGENEKCKF 337 Score = 43.2 bits (97), Expect = 0.005 Identities = 27/84 (32%), Positives = 46/84 (54%), Gaps = 2/84 (2%) Frame = +1 Query: 220 PLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSS 399 P RG + + + + + G+V+I + +E ++ +A GP+S+ I+A Q Y Sbjct: 327 PYRGENEKCKFNMTDVRVKINGYVNISK-NETEMAGWLAAHGPISIGINA--LMMQFYFG 383 Query: 400 GVYNEDE--CSSTELDHGVLVVGY 465 G+ + + CS LDHGVL+VGY Sbjct: 384 GIAHPWKIFCSPDSLDHGVLIVGY 407 >UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: Cathepsin L - Kudoa thyrsites Length = 300 Score = 92.3 bits (219), Expect = 8e-18 Identities = 44/90 (48%), Positives = 58/90 (64%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCWSFS GA+E + ++G LV+ SEQ L+DCS + N+GCNGGL + AF Y Sbjct: 118 KNQGHCGSCWSFSAAGAIESAYAIKTGELVNFSEQQLVDCSTE--NHGCNGGLPEIAFLY 175 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRTPV 271 + +N GI + YPY C+Y P V Sbjct: 176 VINN-GIMKLKDYPYTAKQGTCQYSPEDVV 204 Score = 74.5 bits (175), Expect = 2e-12 Identities = 34/75 (45%), Positives = 47/75 (62%) Frame = +1 Query: 301 EGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQ 480 E +E+ +ME+VA GP S+ I+A+ SFQ Y G+Y++ SS LDH VL+VGYG + Sbjct: 213 ENNEESVMESVANNGPNSIGINAASRSFQFYGGGIYSDPWASSYPLDHAVLLVGYGY-KN 271 Query: 481 GVEYWLLKNCWAARW 525 YW +KN W W Sbjct: 272 TENYWHVKNSWGPWW 286 >UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 precursor; n=2; Arabidopsis thaliana|Rep: Probable cysteine proteinase At3g43960 precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 376 Score = 92.3 bits (219), Expect = 8e-18 Identities = 43/79 (54%), Positives = 55/79 (69%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K QG+CGSCW+F+ TGA+EG + +G LVSLSEQ LIDC N GC GG AF++ Sbjct: 144 KRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEF 203 Query: 182 IKDNGGIDTEQTYPYEGVD 238 IK+NGGI +++ Y Y G D Sbjct: 204 IKENGGIVSDEVYGYTGED 222 Score = 66.1 bits (154), Expect = 6e-10 Identities = 34/77 (44%), Positives = 45/77 (58%) Frame = +1 Query: 295 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGND 474 +P DE L +AVA P+SV I A++ S Y SGVY + CS+ DH VL+VGYG Sbjct: 245 VPVNDEMSLKKAVA-YQPISVMISAANMSD--YKSGVY-KGACSNLWGDHNVLIVGYGTS 300 Query: 475 EQGVEYWLLKNCWAARW 525 +YWL++N W W Sbjct: 301 SDEGDYWLIRNSWGPEW 317 >UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativa|Rep: Os01g0347600 protein - Oryza sativa subsp. japonica (Rice) Length = 343 Score = 91.9 bits (218), Expect = 1e-17 Identities = 41/83 (49%), Positives = 50/83 (60%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCW+F+ A+EG ++G L LSEQ L+DC +NGC GG D AF+ Sbjct: 141 KDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDT--NSNGCGGGHTDRAFEL 198 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 + GGI E Y YEG KCR Sbjct: 199 VASKGGITAESDYRYEGFQGKCR 221 Score = 65.7 bits (153), Expect = 8e-10 Identities = 37/89 (41%), Positives = 50/89 (56%), Gaps = 1/89 (1%) Frame = +1 Query: 262 NTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELD 441 N A G+ +P DE++L AVA PV+V IDAS +FQ Y SGV+ C ++ + Sbjct: 228 NHAARIGGYRAVPPNDERQLATAVARQ-PVTVYIDASGPAFQFYKSGVF-PGPCGASS-N 284 Query: 442 HGVLVVGYGND-EQGVEYWLLKNCWAARW 525 H V +VGY D G +YW+ KN W W Sbjct: 285 HAVTLVGYCQDGASGKKYWVAKNSWGKTW 313 >UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: Cysteine proteinase - Paragonimus westermani Length = 272 Score = 91.9 bits (218), Expect = 1e-17 Identities = 39/82 (47%), Positives = 57/82 (69%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 ++QG CGSCW+FST G +EGQ F ++G LVSLS+Q L+DC +GCNGG +++ Sbjct: 70 ENQGSCGSCWAFSTAGNVEGQWFIKTGQLVSLSKQQLVDCDR--AADGCNGGWPASSYLE 127 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 I GG++++ YPY GV ++C Sbjct: 128 IMHMGGLESQDDYPYAGVKEQC 149 Score = 40.7 bits (91), Expect = 0.025 Identities = 22/60 (36%), Positives = 35/60 (58%), Gaps = 2/60 (3%) Frame = +1 Query: 331 VATVGPVSVAIDASHTSFQLYSSGVYNED--ECSSTELDHGVLVVGYGNDEQGVEYWLLK 504 +A GP+S ++A + Q Y SG+ + CS +L+H VL VGY + E + YW++K Sbjct: 177 LAEHGPLSTLLNA--ITLQYYQSGIIHPSYXXCSPVDLNHAVLTVGY-DKEGDMPYWIIK 233 >UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 91.5 bits (217), Expect = 1e-17 Identities = 42/86 (48%), Positives = 56/86 (65%), Gaps = 3/86 (3%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC---SEQYGNNGCNGGLMDNA 172 KDQG+CGSCW+FSTTG LE +F ++ +S SEQ L+DC S + + GC+GG + A Sbjct: 141 KDQGQCGSCWAFSTTGILEALYFMENRQKISFSEQQLVDCATNSNGFNSYGCSGGWPEEA 200 Query: 173 FKYIKDNGGIDTEQTYPYEGVDDKCR 250 KY+ GI E+ YPY VD KC+ Sbjct: 201 LKYVA-KFGILKEEQYPYLAVDSKCK 225 Score = 52.0 bits (119), Expect = 1e-05 Identities = 32/72 (44%), Positives = 45/72 (62%), Gaps = 3/72 (4%) Frame = +1 Query: 319 LMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTE---LDHGVLVVGYGNDEQGVE 489 L VA + PVSV +DAS ++ YSSGVYN C +T+ L+H V+ +GY DEQG Sbjct: 249 LKNTVARI-PVSVLVDAS--TWGSYSSGVYN--GCGNTQTYNLNHAVVAIGY--DEQG-- 299 Query: 490 YWLLKNCWAARW 525 W+++N W+ W Sbjct: 300 NWIIRNSWSTSW 311 >UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae str. PEST Length = 559 Score = 91.5 bits (217), Expect = 1e-17 Identities = 43/88 (48%), Positives = 56/88 (63%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FS G +EG H ++ L S SEQ LIDC + +NGC GG MD+AFK Sbjct: 355 KNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCDKV--DNGCGGGYMDDAFKA 412 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRT 265 I+ GG++ E YPYE K + R+ Sbjct: 413 IEQLGGLELENDYPYEAKAQKSCHFNRS 440 Score = 57.2 bits (132), Expect = 3e-07 Identities = 29/88 (32%), Positives = 51/88 (57%), Gaps = 7/88 (7%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYN--EDECSSTELDHGVLV 456 G VD+P+ +E + + + GP+++ ++A+ + Q Y G+ + C+ +DHGVL+ Sbjct: 448 GAVDMPK-NETYIAKYLIKNGPIAIGLNAN--AMQFYRGGISHPWHPLCNHKSIDHGVLI 504 Query: 457 VGYGNDE-----QGVEYWLLKNCWAARW 525 VGYG E + + YW++KN W RW Sbjct: 505 VGYGIKEYPMFNKTLPYWIIKNSWGPRW 532 >UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing protein; n=5; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 437 Score = 91.5 bits (217), Expect = 1e-17 Identities = 41/99 (41%), Positives = 58/99 (58%), Gaps = 2/99 (2%) Frame = +2 Query: 2 KDQGK-CGSCWSFSTTGALEGQHFRQSGYL-VSLSEQNLIDCSEQYGNNGCNGGLMDNAF 175 K QGK CGSCW+F+ ALE + ++G + SEQ L+DC+ ++ GC+GGL F Sbjct: 221 KSQGKDCGSCWAFAAVAALESHYALKTGKKPIQFSEQQLVDCARKFDTKGCSGGLPSKGF 280 Query: 176 KYIKDNGGIDTEQTYPYEGVDDKCRYIPRTPVLRTWASW 292 +Y+ GGI E YPYEG D CR+ V++ S+ Sbjct: 281 EYLAYAGGIQNEADYPYEGEDKNCRFNSSKTVVQVQKSY 319 Score = 54.8 bits (126), Expect = 1e-06 Identities = 32/112 (28%), Positives = 54/112 (48%), Gaps = 2/112 (1%) Frame = +1 Query: 196 GHRHRADLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASH 375 G ++ AD P G + + + T + +I DE +L+ +A GPV++A + Sbjct: 288 GIQNEADYPYEGEDKNCRFNSSKTVVQVQKSYNITFQDENELIYHLANYGPVTIAYQVN- 346 Query: 376 TSFQLYSSGVYNEDECSS--TELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 + F Y +GV+ CS +++H VL VGY +Y++ KN W W Sbjct: 347 SDFDNYKNGVFTSSNCSKDPEDVNHAVLAVGY---NMTGKYFIAKNSWGNDW 395 >UniRef50_Q22NW9 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 894 Score = 91.5 bits (217), Expect = 1e-17 Identities = 44/84 (52%), Positives = 57/84 (67%), Gaps = 1/84 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGS ++FSTTGALEG H SEQ +IDCS + GN+GC+GG M+NAF + Sbjct: 699 KNQGSCGSGYAFSTTGALEGIHKISGKDWKGFSEQQIIDCSRKQGNSGCHGGFMENAFDF 758 Query: 182 IKDNGGIDTEQTYPYEG-VDDKCR 250 + +N GI E YPYEG + KC+ Sbjct: 759 VIEN-GILQENDYPYEGHANFKCK 781 Score = 52.8 bits (121), Expect = 6e-06 Identities = 34/81 (41%), Positives = 46/81 (56%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVG 462 G+ +I + D + L +AVA PVSVAID Q Y SG+ D SS L+HGVL+VG Sbjct: 794 GYYNINKYDCRGLQQAVAQQ-PVSVAIDGKF--LQRYHSGIIG-DCGSSVNLNHGVLIVG 849 Query: 463 YGNDEQGVEYWLLKNCWAARW 525 Y D ++++KN W W Sbjct: 850 YTED-----FFIVKNSWGTNW 865 >UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_79, whole genome shotgun sequence - Paramecium tetraurelia Length = 324 Score = 91.5 bits (217), Expect = 1e-17 Identities = 42/84 (50%), Positives = 54/84 (64%), Gaps = 1/84 (1%) Frame = +2 Query: 2 KDQGK-CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK 178 KDQG CGS W+FS G LE + G +LSEQ+++DCS YGN GC+GG MD+ F+ Sbjct: 134 KDQGSSCGSSWAFSAVGVLEINSNIEFGLETTLSEQDMLDCSGPYGNQGCSGGWMDSGFE 193 Query: 179 YIKDNGGIDTEQTYPYEGVDDKCR 250 Y++D+ GI YPY G D CR Sbjct: 194 YVRDH-GIANGSVYPYVGSDQTCR 216 >UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 330 Score = 91.1 bits (216), Expect = 2e-17 Identities = 48/107 (44%), Positives = 65/107 (60%), Gaps = 3/107 (2%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGY---LVSLSEQNLIDCSEQYGNNGCNGGLMDNA 172 KDQG+CGSCW+FSTTG++E +GY + LSEQ L+DCS N GC GG MDNA Sbjct: 133 KDQGQCGSCWAFSTTGSVESA-LIIAGYANQTIDLSEQQLVDCSAT--NYGCGGGWMDNA 189 Query: 173 FKYIKDNGGIDTEQTYPYEGVDDKCRYIPRTPVLRTWASWTSPRATN 313 F+YI+++ + T YPY VD C VL + +++T + N Sbjct: 190 FEYIEES-PLTTNSNYPYVAVDQACNSTEIYGVLYSLSNYTDVESGN 235 Score = 52.4 bits (120), Expect = 8e-06 Identities = 28/80 (35%), Positives = 48/80 (60%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 + D+ G+ +L + + P+S+A+DAS+ + LY+SG+++ C L+HGVL+VG+ Sbjct: 228 YTDVESGNTVQLKQYLQQQ-PLSIAVDASY--WYLYNSGIFSN--CGQN-LNHGVLLVGF 281 Query: 466 GNDEQGVEYWLLKNCWAARW 525 + E WL+KN W W Sbjct: 282 NSTEGS---WLVKNSWGTSW 298 >UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 21 SCAF14577, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 478 Score = 90.6 bits (215), Expect = 2e-17 Identities = 42/79 (53%), Positives = 56/79 (70%), Gaps = 1/79 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQ CGSCWSF+TTG +EG F ++G L LS+Q LIDCS +GNN C+GG A+++ Sbjct: 221 KDQAICGSCWSFATTGTIEGALFLKTGSLQVLSQQMLIDCSWGFGNNACDGGEEWRAYEW 280 Query: 182 IKDNGGIDTEQTY-PYEGV 235 I +GGI + +TY PY G+ Sbjct: 281 IMKHGGIASAETYGPYLGM 299 Score = 89.8 bits (213), Expect = 4e-17 Identities = 45/96 (46%), Positives = 57/96 (59%), Gaps = 2/96 (2%) Frame = +1 Query: 250 VHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSS 429 V+ A+ + ++ GD L A+ GPV+V+IDASH SF YS+GVY E C S Sbjct: 359 VNSSELTAQIQSYTNVTSGDALALKLALFKNGPVAVSIDASHRSFVFYSNGVYYEPACGS 418 Query: 430 T--ELDHGVLVVGYGNDEQGVEYWLLKNCWAARWAN 531 T +LDH VL VGYGN G YWL+KN W+ W N Sbjct: 419 TVEDLDHAVLAVGYGN-LNGEPYWLIKNSWSTYWGN 453 Score = 57.6 bits (133), Expect = 2e-07 Identities = 28/64 (43%), Positives = 41/64 (64%), Gaps = 1/64 (1%) Frame = +2 Query: 59 GQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTY-PYEGV 235 G + +G L LS+Q LIDCS +GNN C+GG A+++I +GGI + +TY PY G+ Sbjct: 294 GPYLGMTGSLQVLSQQMLIDCSWGFGNNACDGGEEWRAYEWIMKHGGIASAETYGPYLGM 353 Query: 236 DDKC 247 + C Sbjct: 354 NGFC 357 >UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2); n=2; Tribolium castaneum|Rep: PREDICTED: similar to Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2) - Tribolium castaneum Length = 332 Score = 89.8 bits (213), Expect = 4e-17 Identities = 42/98 (42%), Positives = 62/98 (63%), Gaps = 2/98 (2%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG+CGSCW+F+T GA+E + + +SLSEQ L+DC + G GC GG + A+ Y Sbjct: 134 KNQGQCGSCWAFATIGAIESHYKIRHKRAISLSEQQLVDCVGRGG--GCGGGWIPTAYSY 191 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRTP--VLRTWAS 289 I N G++ + YPY G + KCRY P +R++A+ Sbjct: 192 IARNKGVNYNRDYPYLGRNGKCRYRSSKPHIAIRSYAA 229 Score = 81.0 bits (191), Expect = 2e-14 Identities = 37/73 (50%), Positives = 48/73 (65%) Frame = +1 Query: 307 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGV 486 +E+++ VAT GPVSVAI +F Y SGVYN C L+H V++VGYG E+GV Sbjct: 235 NEERVRRLVATKGPVSVAIHVDSRTFHKYKSGVYNNPSCRG-GLNHAVVIVGYGR-ERGV 292 Query: 487 EYWLLKNCWAARW 525 +YWL+KN W A W Sbjct: 293 DYWLVKNSWGAGW 305 >UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 383 Score = 89.8 bits (213), Expect = 4e-17 Identities = 39/83 (46%), Positives = 60/83 (72%), Gaps = 1/83 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG+CGSCW+F+T ++E Q+ + G LVSLSEQ ++DC + NNGC+GG A K+ Sbjct: 184 KNQGQCGSCWAFATVASVEAQNAIKKGKLVSLSEQEMVDCDGR--NNGCSGGYRPYAMKF 241 Query: 182 IKDNGGIDTEQTYPYEGV-DDKC 247 +K+N G+++E+ YPY + D+C Sbjct: 242 VKEN-GLESEKEYPYSALKHDQC 263 Score = 49.2 bits (112), Expect = 7e-05 Identities = 21/76 (27%), Positives = 41/76 (53%), Gaps = 3/76 (3%) Frame = +1 Query: 307 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNE--DECSSTELD-HGVLVVGYGNDE 477 +E+ + V T GPV+ ++ + Y SG++N ++C+ + H + ++GYG + Sbjct: 283 NEEDIANWVGTKGPVTFGMNVVKAMYS-YRSGIFNPSVEDCTEKSMGAHALTIIGYGGEG 341 Query: 478 QGVEYWLLKNCWAARW 525 + YW++KN W W Sbjct: 342 ESA-YWIVKNSWGTSW 356 >UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin S preproprotein; n=2; Tribolium castaneum|Rep: PREDICTED: similar to cathepsin S preproprotein - Tribolium castaneum Length = 525 Score = 89.0 bits (211), Expect = 7e-17 Identities = 39/91 (42%), Positives = 54/91 (59%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K QGKCGSCW+F+ GA E + +Q G V LSEQ L+DC + G C G +D ++Y Sbjct: 51 KRQGKCGSCWAFAILGATEAHYRKQRGSFVILSEQQLVDCVREVGT--CKGVWLDEVYEY 108 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRTPVL 274 I ++ GI+ +Q Y YE CR+ P P + Sbjct: 109 IINSNGINYDQDYRYESAPGSCRFKPNKPTV 139 Score = 76.2 bits (179), Expect = 5e-13 Identities = 34/84 (40%), Positives = 48/84 (57%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K QGKCG+CW+F+ GA E Q+ G V LSEQ L+DC + + C G + +KY Sbjct: 327 KHQGKCGTCWAFAIIGATEAQYRIHRGSFVILSEQQLVDCVREV--SSCRGVYLHETYKY 384 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 I + GI+ +Q Y Y+ CR+ Sbjct: 385 IVKSEGINYDQDYRYQSAPGTCRF 408 Score = 59.3 bits (137), Expect = 7e-08 Identities = 28/72 (38%), Positives = 42/72 (58%) Frame = +1 Query: 310 EQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVE 489 E+ L VA VGPV+V+ D F+ YS GV+ C+ + H ++VGYG E G + Sbjct: 428 EEDLQWIVANVGPVTVSFDGRGKQFKSYSGGVFYNKTCTRMK-THVAVLVGYGT-ENGED 485 Query: 490 YWLLKNCWAARW 525 +WL+KN + +W Sbjct: 486 FWLVKNSYGPQW 497 Score = 44.4 bits (100), Expect = 0.002 Identities = 22/57 (38%), Positives = 32/57 (56%) Frame = +1 Query: 295 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 + E E+ L VA +GP +V+ DA + + YS G+Y C+ T L H +VVGY Sbjct: 147 LAEISEEDLQWIVAKIGPATVSFDARGSQLKSYSGGIYYNRTCTKT-LTHVAVVVGY 202 >UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa|Rep: Os09g0497500 protein - Oryza sativa subsp. japonica (Rice) Length = 349 Score = 89.0 bits (211), Expect = 7e-17 Identities = 39/83 (46%), Positives = 55/83 (66%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FS A+EG + ++G LVSLSEQ L+DC ++ GC GG M AF++ Sbjct: 138 KNQGDCGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDE--AVGCGGGYMSWAFEF 195 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 + N G+ TE +YPY + C+ Sbjct: 196 VVGNHGLTTEASYPYHAANGACQ 218 Score = 67.7 bits (158), Expect = 2e-10 Identities = 44/126 (34%), Positives = 59/126 (46%), Gaps = 11/126 (8%) Frame = +1 Query: 187 GQRGHRHRADLPLRGS*RQVQVHPKNTGAEDV-GFVDIPEGDEQKLMEAVATVGPVSVAI 363 G G A P + Q N A + G+ ++ E L A A PVSVA+ Sbjct: 198 GNHGLTTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQ-PVSVAV 256 Query: 364 DASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQ----------GVEYWLLKNCW 513 D FQLY SGVY C++ +++HGV VVGYG E G +YW++KN W Sbjct: 257 DGGSFMFQLYGSGVYT-GPCTA-DVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSW 314 Query: 514 AARWAN 531 A W + Sbjct: 315 GAEWGD 320 >UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba culbertsoni|Rep: Cysteine proteinase - Acanthamoeba culbertsoni Length = 482 Score = 89.0 bits (211), Expect = 7e-17 Identities = 39/87 (44%), Positives = 56/87 (64%), Gaps = 1/87 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG C SCW+F TGA+EG G LVSLS+Q L+DC+ GN GC+GG ++ +++ Sbjct: 172 KNQGSCASCWAFVATGAVEGVRKIAGGSLVSLSDQMLLDCAVGTGNQGCSGGNVEITYRW 231 Query: 182 -IKDNGGIDTEQTYPYEGVDDKCRYIP 259 I +N + T+ +YPY CRY+P Sbjct: 232 MISNNARLMTQASYPYIARQSTCRYVP 258 Score = 83.4 bits (197), Expect = 4e-15 Identities = 38/76 (50%), Positives = 47/76 (61%) Frame = +1 Query: 304 GDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQG 483 G E L+ A A + PV+VAID S SF YS G Y + CSST L+H VLVVG+G D Q Sbjct: 274 GSESDLL-AKAAIAPVTVAIDGSKRSFMFYSGGYYYDPTCSSTNLNHAVLVVGWGTDPQR 332 Query: 484 VEYWLLKNCWAARWAN 531 +YW+ KN W W + Sbjct: 333 GDYWIAKNEWGTAWGD 348 >UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa zeasingle nucleocapsid nuclear polyhedrosis virus) Length = 367 Score = 89.0 bits (211), Expect = 7e-17 Identities = 40/82 (48%), Positives = 53/82 (64%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCW+F G +E Q+ + L+ LSEQ L+DC E + GCNGGLM AF+ Sbjct: 172 KDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCDEV--DLGCNGGLMHLAFQE 229 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 + GG++TE YPY+G + C Sbjct: 230 LLLMGGVETEADYPYQGSEQMC 251 Score = 64.5 bits (150), Expect = 2e-09 Identities = 37/110 (33%), Positives = 54/110 (49%) Frame = +1 Query: 196 GHRHRADLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASH 375 G AD P +GS + + + + DE KL E V T GPV++A+DA Sbjct: 235 GVETEADYPYQGSEQMCTLDNRKIAVKLNSCFKYDIRDENKLKELVYTTGPVAIAVDA-- 292 Query: 376 TSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 Y G+ N +C +L+H VL++G+G E V YW++KN W W Sbjct: 293 MDIINYRRGILN--QCHIYDLNHAVLLIGWG-IENNVPYWIIKNSWGEDW 339 >UniRef50_Q0MYX5 Cluster: Putative cysteine protease; n=1; Emiliania huxleyi|Rep: Putative cysteine protease - Emiliania huxleyi Length = 276 Score = 88.6 bits (210), Expect = 9e-17 Identities = 44/79 (55%), Positives = 51/79 (64%), Gaps = 1/79 (1%) Frame = +1 Query: 292 DIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGN 471 D+P GDE L AVA PVSVAI+A ++FQLY SGV + C ELDHGVLVVGYG Sbjct: 47 DVPSGDEDALRAAVAKQ-PVSVAIEADKSAFQLYQSGVIDSASCGK-ELDHGVLVVGYGT 104 Query: 472 D-EQGVEYWLLKNCWAARW 525 D G +YW +KN W W Sbjct: 105 DTATGKDYWKIKNSWGGTW 123 >UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 339 Score = 88.6 bits (210), Expect = 9e-17 Identities = 42/78 (53%), Positives = 53/78 (67%), Gaps = 1/78 (1%) Frame = +2 Query: 2 KDQGKC-GSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK 178 K+QG C G+ +SFS G +E HF ++ L++LSEQN+IDC+ GNNGC GGL AF Sbjct: 130 KNQGLCSGAGYSFSAIGVIESSHFIKNKELITLSEQNIIDCTTDMGNNGCMGGLALIAFD 189 Query: 179 YIKDNGGIDTEQTYPYEG 232 YI GID+E YPYEG Sbjct: 190 YIIKQKGIDSEFNYPYEG 207 Score = 79.8 bits (188), Expect = 4e-14 Identities = 37/81 (45%), Positives = 55/81 (67%), Gaps = 1/81 (1%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 +++I +E +L +++ PVSV IDAS SF LY SGVY + CSST L+HG+L +G+ Sbjct: 233 YIEIERFNENELTQSLIK-SPVSVMIDASQLSFMLYKSGVYKDPSCSSTILNHGILNIGF 291 Query: 466 G-NDEQGVEYWLLKNCWAARW 525 G E G EY++LKN + ++W Sbjct: 292 GVTPENGNEYYILKNSFGSKW 312 >UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dvir_CG5367 - Drosophila virilis (Fruit fly) Length = 298 Score = 88.2 bits (209), Expect = 1e-16 Identities = 36/96 (37%), Positives = 59/96 (61%) Frame = +2 Query: 5 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYI 184 +Q CGSC++FS ++EGQ F+++G +V+LSEQ ++DCS +GN GC GG + N +Y+ Sbjct: 104 NQQSCGSCYAFSIAQSIEGQVFKRTGKIVALSEQQIVDCSVSHGNQGCIGGSLRNTLRYL 163 Query: 185 KDNGGIDTEQTYPYEGVDDKCRYIPRTPVLRTWASW 292 + GG+ Y Y +C+++ V+ SW Sbjct: 164 QATGGLMRSLDYKYASKKGECQFVSELAVVNV-TSW 198 Score = 77.4 bits (182), Expect = 2e-13 Identities = 32/77 (41%), Positives = 52/77 (67%) Frame = +1 Query: 295 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGND 474 +P DE + AVA +GPV+V+I+AS +FQLYS G+Y++ C+ST ++H +L++G+ + Sbjct: 201 LPAKDENAIQAAVAHIGPVAVSINASPKTFQLYSEGIYDDVSCTSTSVNHAMLLIGFDKN 260 Query: 475 EQGVEYWLLKNCWAARW 525 +W+LKN W W Sbjct: 261 -----FWILKNWWGELW 272 >UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin L-like proteinase; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin L-like proteinase - Strongylocentrotus purpuratus Length = 329 Score = 87.8 bits (208), Expect = 2e-16 Identities = 44/91 (48%), Positives = 56/91 (61%) Frame = +1 Query: 259 KNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTEL 438 K + +VG + +G+E L EAV PV VAIDAS SFQLY SGVY++ CSST L Sbjct: 215 KAVASSNVG-KSVTQGNESALAEAVYFT-PVVVAIDASQPSFQLYVSGVYSDPNCSSTLL 272 Query: 439 DHGVLVVGYGNDEQGVEYWLLKNCWAARWAN 531 D +L+VGYG G EYW+ +N W W + Sbjct: 273 DLSLLLVGYGVSSVGTEYWICRNTWGEEWGD 303 >UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia circumcincta|Rep: Secreted cathepsin F - Teladorsagia circumcincta Length = 364 Score = 87.8 bits (208), Expect = 2e-16 Identities = 40/86 (46%), Positives = 53/86 (61%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K +G C +CW+FS TG +EGQ F LVSLS Q L+DC + GCNGG +A+K Sbjct: 169 KTEGHCAACWAFSVTGNIEGQWFLAKKKLVSLSAQQLLDC--DVVDEGCNGGFPLDAYKE 226 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIP 259 I GG++ E YPYE ++CR +P Sbjct: 227 IVRMGGLEPEDKYPYEAKAEQCRLVP 252 Score = 64.5 bits (150), Expect = 2e-09 Identities = 32/102 (31%), Positives = 49/102 (48%) Frame = +1 Query: 220 PLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSS 399 P Q ++ P + G V++P DE+K+ + GP+S+ I Q Y Sbjct: 240 PYEAKAEQCRLVPSDIAVYINGSVELPH-DEEKMRAWLVKKGPISIGITVD--DIQFYKG 296 Query: 400 GVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 GV C + + HG L+VGYG E+ + YW++KN W W Sbjct: 297 GVSRPTTCRLSSMIHGALLVGYG-VEKNIPYWIIKNSWGPNW 337 >UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_184, whole genome shotgun sequence - Paramecium tetraurelia Length = 331 Score = 87.8 bits (208), Expect = 2e-16 Identities = 39/85 (45%), Positives = 53/85 (62%), Gaps = 3/85 (3%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQ---YGNNGCNGGLMDNA 172 K+QG CGSCW+F+ A+E V++SEQ +DC+ + Y + GCNGG MD+A Sbjct: 131 KNQGSCGSCWAFAAAAAIEAGFQHHKKNKVNISEQEFVDCTTEKLGYESQGCNGGWMDDA 190 Query: 173 FKYIKDNGGIDTEQTYPYEGVDDKC 247 F Y N G+ TE+ YPY+GVD C Sbjct: 191 FDYTV-NYGVTTEEEYPYKGVDQPC 214 Score = 61.7 bits (143), Expect = 1e-08 Identities = 36/82 (43%), Positives = 45/82 (54%), Gaps = 2/82 (2%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSST--ELDHGVLVV 459 FVD+ L EA+A PV+VAI A FQLYS GVY+ + T +L+HGVL V Sbjct: 227 FVDVEPLSSDALHEAIAKT-PVAVAIKADGILFQLYSGGVYSRSCTAKTIDDLNHGVLAV 285 Query: 460 GYGNDEQGVEYWLLKNCWAARW 525 GY D + +KN W A W Sbjct: 286 GYAKDS-----YTIKNSWGASW 302 >UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine protease; n=1; Maconellicoccus hirsutus|Rep: Putative cathepsin L-like cysteine protease - Maconellicoccus hirsutus (hibiscus mealybug) Length = 339 Score = 87.4 bits (207), Expect = 2e-16 Identities = 39/83 (46%), Positives = 55/83 (66%), Gaps = 2/83 (2%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSS--TELDHGVLV 456 G + +P+G E L E+VA GPV+ IDA+H SF Y G+Y E +C + E++HGVLV Sbjct: 231 GEITLPDGYETNLHESVAVYGPVAATIDATHQSFHSYKGGIYFEPDCGNKKDEVNHGVLV 290 Query: 457 VGYGNDEQGVEYWLLKNCWAARW 525 VGYG+ E G +YW++KN + W Sbjct: 291 VGYGS-ENGQDYWIVKNSYGTDW 312 Score = 79.4 bits (187), Expect = 6e-14 Identities = 37/98 (37%), Positives = 53/98 (54%) Frame = +2 Query: 8 QGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIK 187 Q C S +++S GALEGQ +S QN+IDCSE GN GC+GG +++ YI Sbjct: 139 QVNCSSGYAWSAIGALEGQLASDKKKFQGISVQNVIDCSESTGNKGCSGGNQHHSYFYIY 198 Query: 188 DNGGIDTEQTYPYEGVDDKCRYIPRTPVLRTWASWTSP 301 GG+D + +YPY+ ++ C + V R T P Sbjct: 199 KQGGVDDDVSYPYKDAEEPCAFKKENVVTRVSGEITLP 236 >UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera litura multicapsid nucleopolyhedrovirus (SpltMNPV) Length = 337 Score = 87.4 bits (207), Expect = 2e-16 Identities = 40/92 (43%), Positives = 56/92 (60%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+F+ G +E Q+ L+ LSEQ L+DC + GC+GGLM AF+ Sbjct: 142 KEQGVCGSCWAFAAIGNIESQYAIMHDSLIDLSEQQLLDCDRV--DQGCDGGLMHLAFQE 199 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRTPVLR 277 I GG++ E YPY+G++ CR P +R Sbjct: 200 IIRIGGVEHEIDYPYQGIEYACRLAPSKLAVR 231 Score = 60.5 bits (140), Expect = 3e-08 Identities = 36/110 (32%), Positives = 50/110 (45%) Frame = +1 Query: 196 GHRHRADLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASH 375 G H D P +G ++ P DE+KL+E + GP++VAID Sbjct: 205 GVEHEIDYPYQGIEYACRLAPSKLAVRLSHCYQYDLRDERKLLELLYKNGPIAVAIDC-- 262 Query: 376 TSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 Y SG+ C+ L+H VL+VGYG E YW+ KN W + W Sbjct: 263 VDIIDYRSGI--ATVCNDNGLNHAVLLVGYG-IENDTPYWIFKNSWGSNW 309 >UniRef50_UPI0000E224BB Cluster: PREDICTED: hypothetical protein; n=1; Pan troglodytes|Rep: PREDICTED: hypothetical protein - Pan troglodytes Length = 143 Score = 87.0 bits (206), Expect = 3e-16 Identities = 39/72 (54%), Positives = 46/72 (63%), Gaps = 3/72 (4%) Frame = +1 Query: 319 LMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY---GNDEQGVE 489 L +AVATVGP+SVA+ ASH SFQ Y G+Y E C LDH +LVVGY G D + Sbjct: 45 LAKAVATVGPISVAVGASHVSFQFYKKGIYFEPRCDPEGLDHAMLVVGYSYEGADSDNNK 104 Query: 490 YWLLKNCWAARW 525 YWL+KN W W Sbjct: 105 YWLVKNSWGKNW 116 >UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza sativa|Rep: Putative cysteine protease - Oryza sativa subsp. japonica (Rice) Length = 357 Score = 87.0 bits (206), Expect = 3e-16 Identities = 40/84 (47%), Positives = 52/84 (61%), Gaps = 1/84 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYG-NNGCNGGLMDNAFK 178 KDQG CGS W+F+ A+EG ++G L LSEQ L+DC + G ++GC GG D AF+ Sbjct: 149 KDQGACGSSWAFAAVAAMEGLMKIRTGQLTPLSEQELVDCVDGGGDSDGCGGGHTDAAFQ 208 Query: 179 YIKDNGGIDTEQTYPYEGVDDKCR 250 + D GGI E Y YEG +CR Sbjct: 209 LVVDKGGITAESEYRYEGYKGRCR 232 Score = 62.5 bits (145), Expect = 7e-09 Identities = 34/90 (37%), Positives = 49/90 (54%), Gaps = 2/90 (2%) Frame = +1 Query: 262 NTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVY-NEDECSSTEL 438 N A G+ +P DE++L AVA PV+ +DAS +FQ Y SGV+ ++ + Sbjct: 239 NHAARVGGYRAVPPADERQLATAVARQ-PVTAYVDASGPAFQFYGSGVFPGPRGTAAPKP 297 Query: 439 DHGVLVVGYGND-EQGVEYWLLKNCWAARW 525 +H V +VGY D G +YW+ KN W W Sbjct: 298 NHAVTLVGYCQDGASGKKYWIAKNSWGKTW 327 >UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi Length = 467 Score = 87.0 bits (206), Expect = 3e-16 Identities = 42/87 (48%), Positives = 59/87 (67%), Gaps = 5/87 (5%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG+CGSCW+FS G +E Q F L +LSEQ L+ C + ++GC+GGLM+NAF++ Sbjct: 139 KDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKT--DSGCSGGLMNNAFEW 196 Query: 182 I--KDNGGIDTEQTYPY---EGVDDKC 247 I ++NG + TE +YPY EG+ C Sbjct: 197 IVQENNGAVYTEDSYPYASGEGISPPC 223 Score = 78.6 bits (185), Expect = 1e-13 Identities = 40/86 (46%), Positives = 55/86 (63%) Frame = +1 Query: 268 GAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHG 447 GA G V++P+ DE ++ +A GPV+VA+DAS S+ Y+ GV C S +LDHG Sbjct: 231 GATITGHVELPQ-DEAQIAAWLAVNGPVAVAVDAS--SWMTYTGGVMTS--CVSEQLDHG 285 Query: 448 VLVVGYGNDEQGVEYWLLKNCWAARW 525 VL+VGY ND V YW++KN W +W Sbjct: 286 VLLVGY-NDSAAVPYWIIKNSWTTQW 310 >UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: LOC443661 protein - Xenopus laevis (African clawed frog) Length = 346 Score = 86.6 bits (205), Expect = 4e-16 Identities = 38/81 (46%), Positives = 55/81 (67%) Frame = +2 Query: 8 QGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIK 187 Q KCGSC++FS GALE Q ++ G LV+ S Q L+DCS GN GC GG + ++F Y+K Sbjct: 158 QRKCGSCYAFSAVGALECQWKKKKGTLVTFSPQELVDCSYSEGNKGCKGGSIRSSFTYMK 217 Query: 188 DNGGIDTEQTYPYEGVDDKCR 250 +G ++ + YPY G ++KC+ Sbjct: 218 KSGVME-DFNYPYTGKEEKCK 237 Score = 35.9 bits (79), Expect = 0.70 Identities = 20/39 (51%), Positives = 23/39 (58%) Frame = +1 Query: 256 PKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDAS 372 P TG F +P DE LM+ V TVGPVSVAI+ S Sbjct: 241 PSKTGVIK-DFHSVPARDEILLMKVVGTVGPVSVAINCS 278 >UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromeliaceae|Rep: Fruit bromelain precursor - Ananas comosus (Pineapple) Length = 351 Score = 86.6 bits (205), Expect = 4e-16 Identities = 37/82 (45%), Positives = 52/82 (63%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+Q CGSCWSF+ +EG + ++GYLVSLSEQ ++DC+ Y GC GG ++ A+ + Sbjct: 139 KNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVSY---GCKGGWVNKAYDF 195 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 I N G+ TE+ YPY C Sbjct: 196 IISNNGVTTEENYPYLAYQGTC 217 Score = 68.9 bits (161), Expect = 8e-11 Identities = 30/81 (37%), Positives = 50/81 (61%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVG 462 G+ + DE+ +M AV+ P++ IDAS +FQ Y+ GV++ C T L+H + ++G Sbjct: 230 GYSYVRRNDERSMMYAVSNQ-PIAALIDASE-NFQYYNGGVFS-GPCG-TSLNHAITIIG 285 Query: 463 YGNDEQGVEYWLLKNCWAARW 525 YG D G +YW+++N W + W Sbjct: 286 YGQDSSGTKYWIVRNSWGSSW 306 >UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa (japonica cultivar-group)|Rep: Os09g0562700 protein - Oryza sativa subsp. japonica (Rice) Length = 235 Score = 86.2 bits (204), Expect = 5e-16 Identities = 39/75 (52%), Positives = 50/75 (66%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG+CGSCW+FST +EG + G LVSLSEQ L+DC ++GC+GG+ A ++ Sbjct: 25 KDQGRCGSCWAFSTVAVVEGIQKIKKGKLVSLSEQELVDCDTL--DSGCDGGVSYRALEW 82 Query: 182 IKDNGGIDTEQTYPY 226 I NGGI T YPY Sbjct: 83 ITANGGITTRDDYPY 97 Score = 59.7 bits (138), Expect = 5e-08 Identities = 31/74 (41%), Positives = 41/74 (55%), Gaps = 8/74 (10%) Frame = +1 Query: 334 ATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQ--------GVE 489 A PV+V+I+A +FQ Y GVY D T L+HGV VVGYG +E G + Sbjct: 135 AAAQPVAVSIEAGGDNFQHYRKGVY--DGPCGTRLNHGVTVVGYGQEEAAADGGAAGGDK 192 Query: 490 YWLLKNCWAARWAN 531 YW++KN W W + Sbjct: 193 YWIIKNSWGKNWGD 206 >UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum|Rep: Falcipain 2 - Plasmodium falciparum Length = 484 Score = 86.2 bits (204), Expect = 5e-16 Identities = 38/75 (50%), Positives = 54/75 (72%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQ CGSCW+FS+ G++E Q+ + L++LSEQ L+DCS + N GCNGGL++NAF+ Sbjct: 277 KDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCS--FKNYGCNGGLINNAFED 334 Query: 182 IKDNGGIDTEQTYPY 226 + + GGI + YPY Sbjct: 335 MIELGGICPDGDYPY 349 Score = 48.4 bits (110), Expect = 1e-04 Identities = 26/81 (32%), Positives = 47/81 (58%), Gaps = 9/81 (11%) Frame = +1 Query: 310 EQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDE---- 477 + KL EA+ +GP+S+++ S F Y G++ + EC +L+H V++VG+G E Sbjct: 376 DNKLKEALRFLGPISISVAVSD-DFAFYKEGIF-DGECGD-QLNHAVMLVGFGMKEIVNP 432 Query: 478 ---QGVE--YWLLKNCWAARW 525 +G + Y+++KN W +W Sbjct: 433 LTKKGEKHYYYIIKNSWGQQW 453 >UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; Leishmania|Rep: Cysteine proteinase 2 precursor - Leishmania pifanoi Length = 444 Score = 86.2 bits (204), Expect = 5e-16 Identities = 40/77 (51%), Positives = 51/77 (66%), Gaps = 2/77 (2%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCW+FS G +EGQ + LVSLSEQ L+ C + N+GC+GGLM AF + Sbjct: 142 KDQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDM--NDGCDGGLMLQAFDW 199 Query: 182 I--KDNGGIDTEQTYPY 226 + NG + TE +YPY Sbjct: 200 LLQNTNGHLHTEDSYPY 216 Score = 62.5 bits (145), Expect = 7e-09 Identities = 38/87 (43%), Positives = 51/87 (58%), Gaps = 1/87 (1%) Frame = +1 Query: 268 GAEDVGFVDIPEGDEQKLMEA-VATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDH 444 GA+ G V I G +K M A +A GP+++A+DAS SF Y SGV C +L+H Sbjct: 236 GAQIDGHVLI--GSSEKAMAAWLAKNGPIAIALDAS--SFMSYKSGVLTA--CIGKQLNH 289 Query: 445 GVLVVGYGNDEQGVEYWLLKNCWAARW 525 GVL+VGY + V YW++KN W W Sbjct: 290 GVLLVGYDMTGE-VPYWVIKNSWGGDW 315 >UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n=1; Xenopus tropicalis|Rep: UPI00006A2275 UniRef100 entry - Xenopus tropicalis Length = 272 Score = 85.8 bits (203), Expect = 7e-16 Identities = 41/84 (48%), Positives = 56/84 (66%), Gaps = 1/84 (1%) Frame = +2 Query: 2 KDQGK-CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK 178 +DQG C SC++FS GALE Q +++ LV+ S Q L+DCS+ GN+GCNGG ++ AFK Sbjct: 95 RDQGSFCRSCYAFSAVGALECQWKKKTVRLVTFSPQELVDCSDGEGNHGCNGGKIEKAFK 154 Query: 179 YIKDNGGIDTEQTYPYEGVDDKCR 250 Y+K G ++ E YPY G CR Sbjct: 155 YMKKYGVME-ESAYPYTGQKGLCR 177 Score = 83.8 bits (198), Expect = 3e-15 Identities = 37/72 (51%), Positives = 50/72 (69%) Frame = +1 Query: 292 DIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGN 471 D+P G+E LM V T+GPVSV+I+AS F + SGVY +C +++H VLVVGYG Sbjct: 192 DLPSGNETLLMNTVGTIGPVSVSINASSEKFHQFKSGVYYNPDCLPNKVNHAVLVVGYGK 251 Query: 472 DEQGVEYWLLKN 507 E G++YWL+KN Sbjct: 252 -ENGMDYWLVKN 262 >UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 513 Score = 85.8 bits (203), Expect = 7e-16 Identities = 35/73 (47%), Positives = 51/73 (69%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K QG CGSC++F+ GALEG HF ++G + LSEQ ++DC+ +GN GC GG A ++ Sbjct: 312 KSQGICGSCYAFAVAGALEGAHFIKTGLKLDLSEQQIVDCTWGFGNRGCKGGYPYRAMQW 371 Query: 182 IKDNGGIDTEQTY 220 I +GG+ TE++Y Sbjct: 372 ILKHGGLATEESY 384 Score = 84.2 bits (199), Expect = 2e-15 Identities = 40/93 (43%), Positives = 59/93 (63%), Gaps = 2/93 (2%) Frame = +1 Query: 253 HPKNT--GAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECS 426 H KNT GA ++ I +G+ +L AVA GPVS+ ++ +F+ Y SG+Y + +C+ Sbjct: 395 HFKNTSIGARLDKYMSIRQGNTSQLKLAVAFYGPVSILVNTQPKTFKFYGSGIYYDTQCT 454 Query: 427 STELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 LDH L VGYG +E+GV YW++KN W+A W Sbjct: 455 HA-LDHAALAVGYG-EEKGVSYWIVKNSWSAMW 485 >UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyta|Rep: Os12g0273800 protein - Oryza sativa subsp. japonica (Rice) Length = 504 Score = 85.4 bits (202), Expect = 9e-16 Identities = 48/94 (51%), Positives = 55/94 (58%), Gaps = 5/94 (5%) Frame = +1 Query: 259 KNTGAEDV-----GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDEC 423 K T A DV G+ D+P DE LM+AVA PVSVA+DAS FQ Y GV EC Sbjct: 222 KTTAAADVAASIRGYEDVPANDEPSLMKAVAGQ-PVSVAVDAS--KFQFYGGGVM-AGEC 277 Query: 424 SSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 T LDHGV V+GYG G +YWL+KN W W Sbjct: 278 G-TSLDHGVTVIGYGAASDGTKYWLVKNSWGTTW 310 Score = 71.3 bits (167), Expect = 2e-11 Identities = 35/83 (42%), Positives = 47/83 (56%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG+C A+EG +G L+SLSEQ L+DC + GC GG +D AF++ Sbjct: 150 KDQGQC----------AMEGFVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQF 199 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 I NGG+ E YPY D +C+ Sbjct: 200 ILSNGGLTAEANYPYTAEDGRCK 222 >UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaster|Rep: CG11459-PA - Drosophila melanogaster (Fruit fly) Length = 336 Score = 85.4 bits (202), Expect = 9e-16 Identities = 41/82 (50%), Positives = 57/82 (69%), Gaps = 1/82 (1%) Frame = +2 Query: 5 DQG-KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 DQG +C SCW+FST+G LE ++ G LV LS ++L+DC Y NNGC+GG + AF Y Sbjct: 135 DQGTECLSCWAFSTSGVLEAHMAKKYGNLVPLSPKHLVDC-VPYPNNGCSGGWVSVAFNY 193 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 +D+ GI T+++YPYE V +C Sbjct: 194 TRDH-GIATKESYPYEPVSGEC 214 Score = 72.5 bits (170), Expect = 7e-12 Identities = 33/83 (39%), Positives = 49/83 (59%), Gaps = 2/83 (2%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSS--TELDHGVLV 456 G+V + DE++L E V +GPV+V+ID H F YS GV + C S +L H VL+ Sbjct: 227 GYVTLGNYDERELAEVVYNIGPVAVSIDHLHEEFDQYSGGVLSIPACRSKRQDLTHSVLL 286 Query: 457 VGYGNDEQGVEYWLLKNCWAARW 525 VG+G + +YW++KN + W Sbjct: 287 VGFGTHRKWGDYWIIKNSYGTDW 309 >UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 84.6 bits (200), Expect = 2e-15 Identities = 37/84 (44%), Positives = 58/84 (69%), Gaps = 2/84 (2%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSE-QYGNNGCNGGLMDNAFK 178 +DQG CGSC++F++TGALEG + ++G L S Q ++DC++ Q+ GC+GG F Sbjct: 143 RDQGNCGSCYAFASTGALEGLYQIKTGKLEVFSPQYIVDCAKHQFSRGGCHGGYSSGVFT 202 Query: 179 YIKDNGGIDTEQTYPYEGVD-DKC 247 ++K+N G++ E YPY+G + DKC Sbjct: 203 FVKEN-GMNLESRYPYKGEENDKC 225 Score = 48.0 bits (109), Expect = 2e-04 Identities = 31/78 (39%), Positives = 45/78 (57%), Gaps = 1/78 (1%) Frame = +1 Query: 295 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSST-ELDHGVLVVGYGN 471 I +GD Q++ E V PVS+++DA Q Y SG+ + CS T ++H VL VGY + Sbjct: 240 INQGDCQEI-ERVLFKQPVSISLDAEKV--QHYQSGILKQ--CSDTININHEVLAVGYTS 294 Query: 472 DEQGVEYWLLKNCWAARW 525 D Y++LKN W + W Sbjct: 295 D-----YFILKNSWGSDW 307 >UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 344 Score = 84.6 bits (200), Expect = 2e-15 Identities = 38/85 (44%), Positives = 54/85 (63%), Gaps = 1/85 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K Q CGSCW+F+TTG +E Q+ + G L+ SEQ L+DC N GC GGLM +A+++ Sbjct: 147 KFQNTCGSCWTFATTGVIESQYALKYGELLHFSEQMLLDCDNI--NQGCRGGLMTDAYQF 204 Query: 182 IKDNGGIDTEQTY-PYEGVDDKCRY 253 ++ +GGI T TY Y+ D C + Sbjct: 205 LQQSGGIQTADTYGDYKNKKDICNF 229 Score = 67.3 bits (157), Expect = 2e-10 Identities = 35/85 (41%), Positives = 50/85 (58%) Frame = +1 Query: 271 AEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGV 450 A+ V + IPE +E E V GPV+V I+A + Q Y G+ + C +++H V Sbjct: 236 AKVVDWYQIPENEETIRRELVKN-GPVAVGINAR--TLQFYEGGIVDPKNCDD-KINHAV 291 Query: 451 LVVGYGNDEQGVEYWLLKNCWAARW 525 L+VGYG +E G+ YWL+KN W A W Sbjct: 292 LIVGYGVEE-GIPYWLIKNQWGAEW 315 >UniRef50_Q248G1 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 334 Score = 84.2 bits (199), Expect = 2e-15 Identities = 39/89 (43%), Positives = 55/89 (61%), Gaps = 3/89 (3%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC---SEQYGNNGCNGGLMDNA 172 K+QG CGSCW+FS G +E + + G VS +EQ ++DC S Y ++GCNGG + A Sbjct: 138 KNQGHCGSCWTFSIAGIVESHYVLKHGSYVSYAEQEILDCVSVSAGYQSDGCNGGWPEEA 197 Query: 173 FKYIKDNGGIDTEQTYPYEGVDDKCRYIP 259 +Y+ + G + +E YPY V KCR IP Sbjct: 198 LQYVIEYGIVKSE-VYPYVAVQGKCRDIP 225 Score = 47.6 bits (108), Expect = 2e-04 Identities = 22/69 (31%), Positives = 40/69 (57%), Gaps = 1/69 (1%) Frame = +1 Query: 322 MEAVATVGPVSVAIDASHTSFQLYSSGVYNE-DECSSTELDHGVLVVGYGNDEQGVEYWL 498 ++A PVSV +DAS +++ Y SG+++ + +L+H ++ VGY D W+ Sbjct: 247 LKAAIAKAPVSVCVDAS--TWKFYKSGIFSGCGPTTEDDLNHAIVAVGYDADGN----WI 300 Query: 499 LKNCWAARW 525 ++N WA +W Sbjct: 301 IRNSWATKW 309 >UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=17; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 318 Score = 84.2 bits (199), Expect = 2e-15 Identities = 39/73 (53%), Positives = 48/73 (65%) Frame = +1 Query: 307 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGV 486 +E +L A G VS+AIDAS FQLYSSG+YN CSST LDH V +VGYG E V Sbjct: 219 NEDELKAGCAKGGVVSIAIDASGYDFQLYSSGIYNPKSCSSTFLDHAVGLVGYGT-ENKV 277 Query: 487 EYWLLKNCWAARW 525 +YW+++N W W Sbjct: 278 DYWIVRNSWGTSW 290 Score = 71.3 bits (167), Expect = 2e-11 Identities = 42/105 (40%), Positives = 56/105 (53%), Gaps = 2/105 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQ +CGSCW+FS A E Q + G L+SL+EQN++DC + GC+GG A+ Y Sbjct: 116 KDQAQCGSCWAFSVVQAQESQWALKKGQLLSLAEQNMVDCVDTC--YGCDGGDEYLAYDY 173 Query: 182 -IKDNGGI-DTEQTYPYEGVDDKCRYIPRTPVLRTWASWTSPRAT 310 IK G+ E YPY D C++ V T S+ P T Sbjct: 174 VIKHQKGLWMLETDYPYTARDGSCKFKAAKGVTLT-KSYVRPTTT 217 >UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria dispar multicapsid nuclear polyhedrosis virus (LdMNPV) Length = 356 Score = 83.8 bits (198), Expect = 3e-15 Identities = 37/82 (45%), Positives = 52/82 (63%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CG+CW+F+T ++E Q + L+ LSEQ LIDC + GCNGGL+ AF+ Sbjct: 160 KNQGACGACWAFATLASVESQFAMRHNRLIDLSEQQLIDCDSV--DMGCNGGLLHTAFEE 217 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 I GG+ TE YP+ G + +C Sbjct: 218 IMRMGGVQTELDYPFVGRNRRC 239 Score = 60.1 bits (139), Expect = 4e-08 Identities = 31/73 (42%), Positives = 43/73 (58%) Frame = +1 Query: 307 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGV 486 +E+KL + + VGP+ +AIDA+ Y GV + C + L+H VL+VGYG E GV Sbjct: 261 NEEKLKDLLRAVGPIPMAIDAA--DIVNYYRGVISS--CENNGLNHAVLLVGYGV-ENGV 315 Query: 487 EYWLLKNCWAARW 525 YW+ KN W W Sbjct: 316 PYWVFKNTWGDDW 328 >UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep: Viral cathepsin - Xestia c-nigrum granulosis virus (XnGV) (Xestia c-nigrumgranulovirus) Length = 346 Score = 83.8 bits (198), Expect = 3e-15 Identities = 41/87 (47%), Positives = 51/87 (58%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K Q +CGSCW+FS +E + + + LSEQ L+DC + NNGCNGGLM AF+ Sbjct: 149 KMQKECGSCWAFSAVANIESLYHIKHNVSLDLSEQQLVDCDKV--NNGCNGGLMSWAFEG 206 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPR 262 I GGI E YPY GVD C+ R Sbjct: 207 IIRAGGISYEAPYPYTGVDGVCKNTTR 233 Score = 60.5 bits (140), Expect = 3e-08 Identities = 35/73 (47%), Positives = 42/73 (57%), Gaps = 1/73 (1%) Frame = +1 Query: 310 EQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTE-LDHGVLVVGYGNDEQGV 486 E+KL + + GPVSVAID Y SGV CS L+HGVL+VGYG E V Sbjct: 248 EKKLRQVLHEKGPVSVAIDV--VDLTNYKSGVAKH--CSVDHGLNHGVLLVGYGQ-ENDV 302 Query: 487 EYWLLKNCWAARW 525 +YW LKN W + W Sbjct: 303 KYWTLKNSWGSDW 315 >UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 360 Score = 83.0 bits (196), Expect = 5e-15 Identities = 39/92 (42%), Positives = 55/92 (59%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K Q CG CW+FST ++EG +F ++G L SLS Q +IDC + +GC GG + AF+ Sbjct: 147 KVQNGCGGCWAFSTVQSIEGLYFLKTGKLESLSTQQVIDCC-RIDESGCLGGDPEPAFRC 205 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRTPVLR 277 I++NGGI TE YPY C++ P + Sbjct: 206 IQNNGGIMTETEYPYIAKQQSCKFDEDKPTFQ 237 Score = 72.9 bits (171), Expect = 5e-12 Identities = 34/83 (40%), Positives = 54/83 (65%), Gaps = 2/83 (2%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTE-LDHGVLVV 459 G++D+P +Q ++A + P+S+ +++S TSF+ Y SGV E E + DH +L+V Sbjct: 240 GYIDVPS--DQSQVKAALLIQPLSICLNSSDTSFKYYKSGVITECEDGPYDGPDHCLLLV 297 Query: 460 GYGNDEQ-GVEYWLLKNCWAARW 525 GYG+DE+ V+YWL+KN W W Sbjct: 298 GYGHDEELKVDYWLIKNQWGTTW 320 >UniRef50_Q4TI44 Cluster: Chromosome undetermined SCAF2412, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF2412, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 123 Score = 83.0 bits (196), Expect = 5e-15 Identities = 33/74 (44%), Positives = 51/74 (68%) Frame = +1 Query: 304 GDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQG 483 G+E+ L A+ GPV++ IDA+ T+F LYS GVY + +C+ +++H VL+VGYG +G Sbjct: 23 GNEKLLAYALFKHGPVAIGIDATLTTFHLYSKGVYYDPDCNPEDINHAVLLVGYGVTRRG 82 Query: 484 VEYWLLKNCWAARW 525 +YW++KN W W Sbjct: 83 QQYWIVKNSWGTGW 96 >UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep: Cysteine proteinase - Entamoeba histolytica Length = 320 Score = 83.0 bits (196), Expect = 5e-15 Identities = 38/84 (45%), Positives = 56/84 (66%) Frame = +1 Query: 274 EDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVL 453 ++ G V + + +E L+EA+A GPV+VAIDA SFQLY SGVY+E +C L+H V Sbjct: 206 KNAGQVIVEQRNEVALVEAIAE-GPVAVAIDAGQASFQLYKSGVYDEPKCKKVILNHAVC 264 Query: 454 VVGYGNDEQGVEYWLLKNCWAARW 525 VGYG+ + G +Y++++N W W Sbjct: 265 AVGYGS-QDGQDYYIVRNSWGTSW 287 Score = 68.5 bits (160), Expect = 1e-10 Identities = 36/89 (40%), Positives = 51/89 (57%), Gaps = 5/89 (5%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALE-----GQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMD 166 +D +CGSC+SF + A+E G + + LSEQ ++DCS + NNGCNGG + Sbjct: 113 RDHTQCGSCYSFGSLAAIESRLLIGGSQTYNADNLDLSEQQIVDCSNK--NNGCNGGSIL 170 Query: 167 NAFKYIKDNGGIDTEQTYPYEGVDDKCRY 253 F Y K NG I+ E+ YPY + C+Y Sbjct: 171 YVFAYTKRNGVIE-EKDYPYTATNGTCQY 198 >UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin L preproprotein; n=1; Monodelphis domestica|Rep: PREDICTED: similar to cathepsin L preproprotein - Monodelphis domestica Length = 356 Score = 82.6 bits (195), Expect = 6e-15 Identities = 42/95 (44%), Positives = 59/95 (62%), Gaps = 1/95 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 ++QG+CG+CW+FST G+LEGQ FR++G LV LS+Q LIDCS Y C GG + A + Sbjct: 131 RNQGECGACWAFSTIGSLEGQLFRKTGRLVELSKQMLIDCSGYY---TCMGGSLTGALDF 187 Query: 182 IKDNGGIDTEQTYPY-EGVDDKCRYIPRTPVLRTW 283 I+ G+ +E+ YPY GV+ I + W Sbjct: 188 IR-RYGVVSERCYPYMNGVNKDTSGIAMVKFAKAW 221 Score = 69.3 bits (162), Expect = 6e-11 Identities = 40/99 (40%), Positives = 57/99 (57%), Gaps = 17/99 (17%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECS---STELDHGVLV 456 +V +P GDE+ LM+AVATVGPV+VAI A SF+ Y G Y E C + ++H +LV Sbjct: 234 YVTLPSGDERALMQAVATVGPVAVAIHAP-PSFRYYQGGPYIEPRCRLSYMSNMNHALLV 292 Query: 457 VGYG------NDEQGVE--------YWLLKNCWAARWAN 531 VGYG +E G++ +W+ KN W +W + Sbjct: 293 VGYGPLERSKYEEFGLQAYMHKDNKFWIAKNSWGEQWGD 331 >UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis|Rep: Cysteine protease 2 - Babesia bovis Length = 445 Score = 82.6 bits (195), Expect = 6e-15 Identities = 42/82 (51%), Positives = 49/82 (59%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCW+F+ G++E RQ V LSEQ L+ C Q GN GCNGG D A Y Sbjct: 252 KDQGMCGSCWAFAAVGSVESLLKRQKTD-VRLSEQELVSC--QLGNQGCNGGYSDYALNY 308 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 IK N GI + +PY D KC Sbjct: 309 IKFN-GIHRSEEWPYLAADGKC 329 Score = 56.0 bits (129), Expect = 6e-07 Identities = 30/63 (47%), Positives = 35/63 (55%), Gaps = 1/63 (1%) Frame = +1 Query: 340 VGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQ-GVEYWLLKNCWA 516 +GP V I S YS GV+N ECS +EL+H VL+VG G D YWLLKN W Sbjct: 357 MGPTVVYIAVSEDLMH-YSGGVFN-GECSDSELNHAVLLVGEGYDSALKKRYWLLKNSWG 414 Query: 517 ARW 525 W Sbjct: 415 TSW 417 >UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; Entamoeba|Rep: Cysteine proteinase 2 precursor - Entamoeba histolytica Length = 315 Score = 82.6 bits (195), Expect = 6e-15 Identities = 36/86 (41%), Positives = 54/86 (62%), Gaps = 3/86 (3%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSG---YLVSLSEQNLIDCSEQYGNNGCNGGLMDNA 172 +DQ +CGSC++F + ALEG+ + G + LSE++++ C+ GNNGCNGGL N Sbjct: 110 RDQAQCGSCYTFGSLAALEGRLLIEKGGDANTLDLSEEHMVQCTRDNGNNGCNGGLGSNV 169 Query: 173 FKYIKDNGGIDTEQTYPYEGVDDKCR 250 + YI ++ G+ E YPY G D C+ Sbjct: 170 YDYIIEH-GVAKESDYPYTGSDSTCK 194 Score = 72.5 bits (170), Expect = 7e-12 Identities = 42/128 (32%), Positives = 66/128 (51%), Gaps = 2/128 (1%) Frame = +1 Query: 154 GAHGQRLQVHQGQRGHRHRADLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAV 333 G G + + + G +D P GS + + K+ A+ G+ +P +E +L A+ Sbjct: 163 GGLGSNVYDYIIEHGVAKESDYPYTGSDSTCKTNVKSF-AKITGYTKVPRNNEAELKAAL 221 Query: 334 ATVGPVSVAIDASHTSFQLYSSGVYNEDECSST--ELDHGVLVVGYGNDEQGVEYWLLKN 507 + G V V+IDAS FQLY SG Y + +C + L+H V VGYG + G E W+++N Sbjct: 222 SQ-GLVDVSIDASSAKFQLYKSGAYTDTKCKNNYFALNHEVCAVGYGVVD-GKECWIVRN 279 Query: 508 CWAARWAN 531 W W + Sbjct: 280 SWGTGWGD 287 >UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 392 Score = 82.2 bits (194), Expect = 8e-15 Identities = 39/88 (44%), Positives = 53/88 (60%), Gaps = 5/88 (5%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS-----EQYGNNGCNGGLMD 166 K QG CGSCW+F+T GA+E HF Q G L++L+EQ L+DC+ +GNNGC GG Sbjct: 193 KGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWSTPGVYHGNNGCLGGWTW 252 Query: 167 NAFKYIKDNGGIDTEQTYPYEGVDDKCR 250 AF ++K G T+ Y G + C+ Sbjct: 253 KAFSWVKKFGIATTKSYGHYRGQEGFCK 280 Score = 73.3 bits (172), Expect = 4e-12 Identities = 30/71 (42%), Positives = 50/71 (70%) Frame = +1 Query: 319 LMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWL 498 L +A++ GP +++I+A+ S + YS G+ ++ CS+ + DH VL++GYG+D GV YWL Sbjct: 304 LKKALSYHGPATISINANPKSLKFYSDGIMSDKHCSN-KTDHAVLLIGYGSDN-GVPYWL 361 Query: 499 LKNCWAARWAN 531 +KN W+ +W N Sbjct: 362 IKNSWSHKWGN 372 >UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; Leishmania|Rep: Cysteine proteinase 1 precursor - Leishmania pifanoi Length = 354 Score = 81.8 bits (193), Expect = 1e-14 Identities = 40/77 (51%), Positives = 48/77 (62%), Gaps = 2/77 (2%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FS G +EGQ LVSLSEQ L+ C + GCNGGLMD A + Sbjct: 145 KNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQMLVSCDNI--DEGCNGGLMDQAMNW 202 Query: 182 I--KDNGGIDTEQTYPY 226 I NG + TE +YPY Sbjct: 203 IMQSHNGSVFTEASYPY 219 Score = 71.3 bits (167), Expect = 2e-11 Identities = 37/86 (43%), Positives = 55/86 (63%) Frame = +1 Query: 268 GAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHG 447 GA+ GF+ +P DE+++ E V GPV+VA+DA T++QLY GV + C + L+HG Sbjct: 236 GAKITGFLSLPH-DEERIAEWVEKRGPVAVAVDA--TTWQLYFGGVVS--LCLAWSLNHG 290 Query: 448 VLVVGYGNDEQGVEYWLLKNCWAARW 525 VL+VG+ N YW++KN W + W Sbjct: 291 VLIVGF-NKNAKPPYWIVKNSWGSSW 315 >UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudicotyledons|Rep: Chymopapain precursor - Carica papaya (Papaya) Length = 352 Score = 81.4 bits (192), Expect = 1e-14 Identities = 36/83 (43%), Positives = 52/83 (62%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FST +EG + +G L+ LSEQ L+DC + + GC GG + +Y Sbjct: 151 KNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH--SYGCKGGYQTTSLQY 208 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 + +N G+ T + YPY+ KCR Sbjct: 209 VANN-GVHTSKVYPYQAKQYKCR 230 Score = 61.3 bits (142), Expect = 2e-08 Identities = 32/81 (39%), Positives = 44/81 (54%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVG 462 G+ +P E + A+A P+SV ++A FQLY SGV+ D T+LDH V VG Sbjct: 243 GYKRVPSNCETSFLGALANQ-PLSVLVEAGGKPFQLYKSGVF--DGPCGTKLDHAVTAVG 299 Query: 463 YGNDEQGVEYWLLKNCWAARW 525 YG + G Y ++KN W W Sbjct: 300 YGTSD-GKNYIIIKNSWGPNW 319 >UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus; n=4; Cryptosporidium|Rep: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus - Cryptosporidium parvum Iowa II Length = 401 Score = 81.0 bits (191), Expect = 2e-14 Identities = 39/83 (46%), Positives = 49/83 (59%), Gaps = 1/83 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGY-LVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK 178 ++Q CGSCW+FS ALEG Q+ L SLSEQ +DCS+Q GN GC+GG M AF+ Sbjct: 192 RNQKNCGSCWAFSAVAALEGATCAQTNRGLPSLSEQQFVDCSKQNGNFGCDGGTMGLAFQ 251 Query: 179 YIKDNGGIDTEQTYPYEGVDDKC 247 Y N + T YPY + C Sbjct: 252 YAIKNKYLCTNDDYPYFAEEKTC 274 Score = 71.3 bits (167), Expect = 2e-11 Identities = 34/70 (48%), Positives = 44/70 (62%), Gaps = 1/70 (1%) Frame = +1 Query: 319 LMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQ-GVEYW 495 L A+A GP+SVAI A T FQ Y SGV+ D T+++HGV++VGY DE EYW Sbjct: 301 LKTALAKYGPISVAIQADQTPFQFYKSGVF--DAPCGTKVNHGVVLVGYDMDEDTNKEYW 358 Query: 496 LLKNCWAARW 525 L++N W W Sbjct: 359 LVRNSWGEAW 368 >UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foetus|Rep: TFCP2 protein - Tritrichomonas foetus (Trichomonas foetus) Length = 270 Score = 81.0 bits (191), Expect = 2e-14 Identities = 39/93 (41%), Positives = 52/93 (55%), Gaps = 3/93 (3%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC-SEQYGNNGCNGGLMDNAFK 178 K+QG CGSCW+FS A E H +G L+ SEQ+L+DC + Y GC+GG D A K Sbjct: 66 KNQGSCGSCWAFSAIAAQESCHAIATGELLRFSEQSLVDCVTSDYSCQGCSGGWPDQAMK 125 Query: 179 YI--KDNGGIDTEQTYPYEGVDDKCRYIPRTPV 271 Y+ + NG E+ Y Y G C Y ++ V Sbjct: 126 YVIEQQNGKFILEENYQYSGHKGACLYDEKSKV 158 Score = 70.5 bits (165), Expect = 3e-11 Identities = 33/77 (42%), Positives = 45/77 (58%), Gaps = 1/77 (1%) Frame = +1 Query: 298 PEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTEL-DHGVLVVGYGND 474 P+ DEQ L +A GPVS +DA H SFQLY G+Y C + + +H + +VGYG Sbjct: 168 PQSDEQNLKGHIAANGPVSCNVDAGHYSFQLYQGGIYWSWFCRTQYIYNHAMGIVGYG-V 226 Query: 475 EQGVEYWLLKNCWAARW 525 E EYW+++N W W Sbjct: 227 EGSEEYWIVRNSWGESW 243 >UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 367 Score = 81.0 bits (191), Expect = 2e-14 Identities = 39/92 (42%), Positives = 52/92 (56%), Gaps = 3/92 (3%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC---SEQYGNNGCNGGLMDNA 172 K+QG CGSCW+FS E + ++ L SEQ L+DC + QY N GC GG A Sbjct: 171 KNQGSCGSCWAFSAVALAESVNLLRNNSLALYSEQELVDCTYKNPQYYNYGCQGGWPSVA 230 Query: 173 FKYIKDNGGIDTEQTYPYEGVDDKCRYIPRTP 268 ++YIKD GI ++Q YPY G + C +P Sbjct: 231 YRYIKDQ-GISSQQNYPYIGQNRNCSINSASP 261 Score = 57.6 bits (133), Expect = 2e-07 Identities = 32/90 (35%), Positives = 48/90 (53%) Frame = +1 Query: 256 PKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTE 435 PK A+D + G++ L++ P+SV +DA T++ YS GV+N C + Sbjct: 262 PKAFYAKDPIYYYTNNGNQTNLVQYAVNQAPISVLVDA--TNWSSYSQGVFN--NCGNVT 317 Query: 436 LDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 ++H VL+VGY D G WL+KN W W Sbjct: 318 INHAVLLVGY--DTSG--NWLVKNSWGTNW 343 >UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-like protein; n=1; Maconellicoccus hirsutus|Rep: Cathepsin L-like cysteine proteinase-like protein - Maconellicoccus hirsutus (hibiscus mealybug) Length = 253 Score = 81.0 bits (191), Expect = 2e-14 Identities = 36/86 (41%), Positives = 55/86 (63%), Gaps = 3/86 (3%) Frame = +2 Query: 5 DQGKCGSCWSFSTTGALEGQH-FRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 +QGKC W+FS TGALE + + V LSEQNLI+CS +GN C+GG ++N +KY Sbjct: 50 NQGKCNVGWAFSVTGALESEKAIKYEAAPVKLSEQNLIECSGGFGNKRCSGGNLENTYKY 109 Query: 182 IKDNGGIDTEQTY--PYEGVDDKCRY 253 + + GI+ E +Y + ++ +C+Y Sbjct: 110 VNHSRGIEKEDSYRDNFRHINSRCQY 135 Score = 53.6 bits (123), Expect = 3e-06 Identities = 25/62 (40%), Positives = 37/62 (59%), Gaps = 2/62 (3%) Frame = +1 Query: 346 PVSVAIDASHTSFQLYSSGVYNEDECSST--ELDHGVLVVGYGNDEQGVEYWLLKNCWAA 519 PVSV I+ + SF+ Y +Y++ +C ++ E + VLVVGYG D +YWL+KN Sbjct: 165 PVSVYINPTLESFKHYKGDIYDDPQCDNSRHESSYAVLVVGYGTD-NNTDYWLIKNSLGT 223 Query: 520 RW 525 W Sbjct: 224 SW 225 >UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-like cysteine peptidase; n=3; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L or K-like cysteine peptidase - Trichomonas vaginalis G3 Length = 320 Score = 81.0 bits (191), Expect = 2e-14 Identities = 37/85 (43%), Positives = 52/85 (61%) Frame = +1 Query: 271 AEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGV 450 A+ GF + G L+EAV T S+ IDAS SF Y SG+Y++ +C T+LDH V Sbjct: 210 AKTTGFERVKPGSSDALIEAVQT-SVCSLLIDASINSFMQYKSGIYDDTKCDPTQLDHYV 268 Query: 451 LVVGYGNDEQGVEYWLLKNCWAARW 525 +VGYG+ E G+ YW+++N W W Sbjct: 269 NLVGYGS-ESGINYWIIRNSWGEAW 292 Score = 61.7 bits (143), Expect = 1e-08 Identities = 33/95 (34%), Positives = 50/95 (52%), Gaps = 2/95 (2%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 ++QG+CG CW+FST +E + + L+ LSEQ L+DC + GC GG D+A + Sbjct: 120 RNQGQCGLCWAFSTICCVEARWAQAYNTLLQLSEQMLVDCVDTC--YGCMGGYADDAAAF 177 Query: 182 IKDN--GGIDTEQTYPYEGVDDKCRYIPRTPVLRT 280 + +N G T YPY C++ V +T Sbjct: 178 VIENYEGKFMTAADYPYIARASICKFDKTKSVAKT 212 >UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 280 Score = 80.2 bits (189), Expect = 3e-14 Identities = 37/84 (44%), Positives = 50/84 (59%), Gaps = 2/84 (2%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQ--YGNNGCNGGLMDNAF 175 K+QG CGSCW+F+ TG E + ++ + SEQ L+DCS Y N+GC GG AF Sbjct: 84 KNQGNCGSCWAFTITGLFESINLIRNKTVELYSEQELLDCSSNGIYRNSGCQGGWPHLAF 143 Query: 176 KYIKDNGGIDTEQTYPYEGVDDKC 247 +Y K N GI YPY+G+ + C Sbjct: 144 EYSKKN-GISLSSQYPYKGIQENC 166 Score = 43.6 bits (98), Expect = 0.004 Identities = 24/75 (32%), Positives = 45/75 (60%) Frame = +1 Query: 301 EGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQ 480 E ++ ++++ + P++V +DAS+ S Y SGV++ C+ T+ +H L+VGY N+ Sbjct: 189 ESNKIQIIKQLLLNSPLAVIVDASNWSN--YKSGVFSN--CT-TQQNHVALLVGYTNEGN 243 Query: 481 GVEYWLLKNCWAARW 525 W++KN W + W Sbjct: 244 ----WIIKNSWGSAW 254 >UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sativa|Rep: OSJNBb0085F13.15 protein - Oryza sativa subsp. japonica (Rice) Length = 383 Score = 80.2 bits (189), Expect = 3e-14 Identities = 34/83 (40%), Positives = 51/83 (61%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K QG+C +CW+F+ A+E H + G L+SLSEQ L+DC + G C+ G D+AF + Sbjct: 176 KHQGQCAACWAFAAVAAIESLHKIKGGDLISLSEQELVDCDDT-GEATCSKGYSDDAFLW 234 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 + N GI ++ YPY G + C+ Sbjct: 235 VSKNKGIASDLIYPYVGHKESCK 257 Score = 60.5 bits (140), Expect = 3e-08 Identities = 33/86 (38%), Positives = 46/86 (53%), Gaps = 3/86 (3%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLY-SSGVYNEDECSSTELDHGVLVV 459 G V +PE E +M AVA PV+V DA FQ Y +GVY ST ++H + +V Sbjct: 270 GVVTLPENREDLIMAAVARQ-PVAVVFDAGDPLFQNYRGNGVYKGGTGCSTNVNHALTIV 328 Query: 460 GYGND--EQGVEYWLLKNCWAARWAN 531 GYG + + G YW+ KN + W + Sbjct: 329 GYGTNHPDTGENYWIAKNSYGNLWGD 354 >UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_21, whole genome shotgun sequence - Paramecium tetraurelia Length = 349 Score = 80.2 bits (189), Expect = 3e-14 Identities = 41/85 (48%), Positives = 52/85 (61%), Gaps = 3/85 (3%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYL-VSLSEQNLIDCS--EQYGNNGCNGGLMDNA 172 K+QG CGSCW+FS ALE RQ G V LSEQ L+DC+ +++ + GC+GG M + Sbjct: 141 KNQGSCGSCWAFSAVAALE-TALRQGGVKNVELSEQELVDCAVKDEFESEGCDGGEMYDG 199 Query: 173 FKYIKDNGGIDTEQTYPYEGVDDKC 247 F+Y GI YPY GVD KC Sbjct: 200 FQY-ASKYGIAIRSEYPYAGVDQKC 223 Score = 59.7 bits (138), Expect = 5e-08 Identities = 36/107 (33%), Positives = 56/107 (52%), Gaps = 1/107 (0%) Frame = +1 Query: 208 RADLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQ 387 R++ P G ++ T + G+VD+ Q +EA A+ +S+ I+AS +FQ Sbjct: 211 RSEYPYAGVDQKCAAKQTKTRYQFAGYVDVEPLSAQAYVEA-ASEHALSIGINASGINFQ 269 Query: 388 LYSSGVYN-EDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 LY G+Y+ + + S L+HGV VGY D Y+L+KN W W Sbjct: 270 LYKKGIYSAKCDGSKPALNHGVTNVGYAPD-----YYLIKNSWGQSW 311 >UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sativa|Rep: Cysteine proteinase-like - Oryza sativa subsp. japonica (Rice) Length = 360 Score = 79.8 bits (188), Expect = 4e-14 Identities = 38/83 (45%), Positives = 50/83 (60%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+Q CGSCW+F+ A EG +G LVSLSEQ ++DC+ G N C+GG + A +Y Sbjct: 153 KNQRSCGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLDCTG--GANTCSGGDVSAALRY 210 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 I +GG+ TE Y Y G CR Sbjct: 211 IAASGGLQTEAAYAYGGQQGACR 233 Score = 63.7 bits (148), Expect = 3e-09 Identities = 34/75 (45%), Positives = 42/75 (56%), Gaps = 1/75 (1%) Frame = +1 Query: 304 GDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYG-NDEQ 480 GDE L +A+A PV V ++AS F+ Y SGVY L+H V VVGYG + Sbjct: 256 GDEGAL-QALAAGQPVVVVVEASEPDFRHYRSGVYAGSAACGRRLNHAVTVVGYGAAADG 314 Query: 481 GVEYWLLKNCWAARW 525 G EYWL+KN W W Sbjct: 315 GGEYWLVKNQWGTWW 329 >UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis|Rep: Cathepsin L - Culicoides sonorensis Length = 331 Score = 79.8 bits (188), Expect = 4e-14 Identities = 34/82 (41%), Positives = 57/82 (69%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+Q +CGSCW+F++ ++E ++ R +L+EQ L+DC + ++GC+GG D A +Y Sbjct: 132 KNQAQCGSCWAFASVASVEMRYKRFHNKSYTLAEQELVDC--ETTSHGCSGGWSDLALQY 189 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 ++DN G+ E+ YPY+G D+KC Sbjct: 190 MRDN-GLSFEKDYPYKGKDEKC 210 Score = 47.6 bits (108), Expect = 2e-04 Identities = 28/104 (26%), Positives = 53/104 (50%), Gaps = 4/104 (3%) Frame = +1 Query: 214 DLPLRGS*RQVQVHPKNTGAEDVGFVDI--PEGDEQKLMEAVATVGPVSVAIDASHTSFQ 387 D P +G + + H N V V++ DE + GP+ V + +F+ Sbjct: 200 DYPYKG--KDEKCHASNENKSPVKVVNVCSTPKDEVSYKDHFYQYGPLVVYYFVDN-NFK 256 Query: 388 LYSSGVYNEDECS--STELDHGVLVVGYGNDEQGVEYWLLKNCW 513 Y G+++ C+ + ++H V+++GYG+ E+ V+YWL++N W Sbjct: 257 QYKGGIFSSKTCNVENAGINHAVVLMGYGS-EKDVKYWLVRNSW 299 >UniRef50_Q23H32 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 365 Score = 79.8 bits (188), Expect = 4e-14 Identities = 39/80 (48%), Positives = 51/80 (63%), Gaps = 1/80 (1%) Frame = +2 Query: 8 QGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIK 187 QG CGSCW+FST ALEG + +Q+G ++ SEQNLIDC + NNGCNGG + A + Sbjct: 152 QGGCGSCWAFSTVIALEGAYAKQTGNVIKFSEQNLIDCC-RIENNGCNGGDPEPALDCVM 210 Query: 188 D-NGGIDTEQTYPYEGVDDK 244 + GI Q YPY+ + K Sbjct: 211 NVLKGIMKNQDYPYQAITRK 230 Score = 66.9 bits (156), Expect = 3e-10 Identities = 35/91 (38%), Positives = 51/91 (56%), Gaps = 2/91 (2%) Frame = +1 Query: 259 KNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNED--ECSST 432 KN + D G+ +IP +E + EAV+ P+S I S +F+ Y G+ +E EC Sbjct: 238 KNVFSPD-GYENIPINNELAIKEAVSRQ-PISACISGSSQNFKFYKGGIADEKLLECDPQ 295 Query: 433 ELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 DH + +VGYG+ E G +YW+LKN W W Sbjct: 296 YTDHCLGIVGYGS-ENGKQYWILKNSWGENW 325 >UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 394 Score = 79.8 bits (188), Expect = 4e-14 Identities = 38/85 (44%), Positives = 49/85 (57%), Gaps = 2/85 (2%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYG--NNGCNGGLMDNAF 175 KDQG+CGSCW+F G +E + +G L S SEQ L+DC Q G ++GCNGG + Sbjct: 200 KDQGQCGSCWTFGAAGVMESFNAITNGVLKSFSEQQLVDCVHQAGFSSDGCNGGFQSDGV 259 Query: 176 KYIKDNGGIDTEQTYPYEGVDDKCR 250 +Y GI TE YPY V C+ Sbjct: 260 EY-AIKFGIVTEDKYPYTAVGGDCQ 283 Score = 48.0 bits (109), Expect = 2e-04 Identities = 23/69 (33%), Positives = 40/69 (57%), Gaps = 1/69 (1%) Frame = +1 Query: 322 MEAVATVGPVSVAIDASHTSFQLYSSGVY-NEDECSSTELDHGVLVVGYGNDEQGVEYWL 498 ++A PV+V++DAS+ + Y SG++ N E + +L+H V+ VGY D W+ Sbjct: 307 LKASLNFSPVTVSVDASN--WNSYESGIFDNCGETTQDQLNHAVIAVGYDTDGN----WI 360 Query: 499 LKNCWAARW 525 ++N W+ W Sbjct: 361 IRNSWSTSW 369 >UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 306 Score = 79.8 bits (188), Expect = 4e-14 Identities = 34/72 (47%), Positives = 48/72 (66%) Frame = +1 Query: 310 EQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVE 489 E +L +AVAT GP ++IDAS SF LY G+Y+E +CS +LDH V VGYG + + + Sbjct: 208 ETELAKAVATYGPAMISIDASQHSFMLYKEGIYDEPKCSEEDLDHAVGCVGYGVEGE-KD 266 Query: 490 YWLLKNCWAARW 525 YW+++N W W Sbjct: 267 YWIVRNSWGEVW 278 Score = 69.7 bits (163), Expect = 5e-11 Identities = 35/86 (40%), Positives = 43/86 (50%), Gaps = 2/86 (2%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FS +E Q + L LSEQNL+DC GC GG A +Y Sbjct: 104 KNQGACGSCWAFSAIQVIESQVAKNQKQLYDLSEQNLLDCVTSC--FGCGGGWSPGALEY 161 Query: 182 I--KDNGGIDTEQTYPYEGVDDKCRY 253 + K N YPY V C+Y Sbjct: 162 VYEKQNSKFMLTTDYPYTAVQGTCKY 187 >UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Plasmodium|Rep: Cysteine protease falcipain-3 - Plasmodium falciparum Length = 492 Score = 79.4 bits (187), Expect = 6e-14 Identities = 36/75 (48%), Positives = 49/75 (65%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQ CGSCW+FS+ G++E Q+ + L SEQ L+DCS + NNGC GG + NAF Sbjct: 285 KDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSVK--NNGCYGGYITNAFDD 342 Query: 182 IKDNGGIDTEQTYPY 226 + D GG+ ++ YPY Sbjct: 343 MIDLGGLCSQDDYPY 357 Score = 50.0 bits (114), Expect = 4e-05 Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 9/89 (10%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 +V IP+ K EA+ +GP+S++I AS F Y G Y + EC + +H V++VGY Sbjct: 379 YVSIPD---DKFKEALRYLGPISISIAASD-DFAFYRGGFY-DGECGAAP-NHAVILVGY 432 Query: 466 G-----NDEQG----VEYWLLKNCWAARW 525 G N++ G Y+++KN W + W Sbjct: 433 GMKDIYNEDTGRMEKFYYYIIKNSWGSDW 461 >UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing protein; n=7; Hymenostomatida|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 387 Score = 79.4 bits (187), Expect = 6e-14 Identities = 34/82 (41%), Positives = 56/82 (68%), Gaps = 1/82 (1%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNE-DECSSTELDHGVLVV 459 G++ +PE D LM AVAT GP+ +++DAS+ F Y SGV++ D + +++H V++V Sbjct: 251 GYLKVPENDYASLMNAVATQGPLVISVDASN--FHDYESGVFHGCDGADNVDINHAVVLV 308 Query: 460 GYGNDEQGVEYWLLKNCWAARW 525 GYG DE+ +YW+++N W R+ Sbjct: 309 GYGTDEKEGDYWIVRNSWGTRF 330 Score = 66.5 bits (155), Expect = 4e-10 Identities = 34/93 (36%), Positives = 50/93 (53%), Gaps = 7/93 (7%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQY----GNNGCNGGLMDN 169 KDQG CGSCW+F+TT +E +G L +LS Q L+ C + G GCNG + + Sbjct: 149 KDQGHCGSCWAFATTAVIESYAAIATGQLKTLSTQQLVSCVQNSYQCGGQGGCNGAVSEL 208 Query: 170 AFKYIKDNGGIDTEQTY---PYEGVDDKCRYIP 259 A+ Y++ G+ +E Y Y+G C + P Sbjct: 209 AYNYVQ-LFGLTSEYKYSYSSYQGQTGNCTFDP 240 >UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 344 Score = 79.0 bits (186), Expect = 8e-14 Identities = 39/87 (44%), Positives = 49/87 (56%), Gaps = 3/87 (3%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGY-LVSLSEQNLIDC--SEQYGNNGCNGGLMDNA 172 K QGKCGSCW+F++T LE F ++G L + SEQ ++DC Y +NGCNGG A Sbjct: 151 KQQGKCGSCWTFASTAVLESFSFIKNGAPLTNFSEQQILDCVYGSGYYSNGCNGGFGSEA 210 Query: 173 FKYIKDNGGIDTEQTYPYEGVDDKCRY 253 Y NG Q YPY G C+Y Sbjct: 211 LNYAIQNGIAPLSQ-YPYVGKQQGCKY 236 Score = 50.4 bits (115), Expect = 3e-05 Identities = 23/60 (38%), Positives = 35/60 (58%) Frame = +1 Query: 346 PVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 P+ V +DA T +Q Y SGV+N + ++ L+H VL+VGY + W++KN W W Sbjct: 266 PIGVVVDA--TKWQFYRSGVFNSCDNNNVNLNHEVLLVGYDANHN----WIIKNSWGVGW 319 >UniRef50_Q231X3 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 79.0 bits (186), Expect = 8e-14 Identities = 42/87 (48%), Positives = 49/87 (56%), Gaps = 3/87 (3%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHF--RQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAF 175 K QG CGSCW+FS T ++E + +SLSEQ LIDCS YGN GC G + A Sbjct: 131 KSQGNCGSCWAFSATASVESALIIAGKVDKSISLSEQQLIDCSGDYGNYGCAAGQKEQAL 190 Query: 176 KYIKDNGGIDTEQTYPYEGVD-DKCRY 253 YIK I TEQ YPY D KC + Sbjct: 191 VYIK-RYSITTEQNYPYTEKDVQKCYF 216 >UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena thermophila Length = 320 Score = 78.6 bits (185), Expect = 1e-13 Identities = 40/88 (45%), Positives = 52/88 (59%), Gaps = 5/88 (5%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHF---RQSGYLVSLSEQNLIDC--SEQYGNNGCNGGLMD 166 K+QG CGSCW+FST GA+E + + ++L+EQ +DC S +Y + GCNGG M Sbjct: 128 KNQGSCGSCWAFSTIGAVESALWIAGQGEQNTLNLAEQEQVDCAKSPKYDSEGCNGGWMV 187 Query: 167 NAFKYIKDNGGIDTEQTYPYEGVDDKCR 250 FKYI DN I YPY D KC+ Sbjct: 188 EGFKYIIDN-KISQTANYPYTAKDGKCK 214 Score = 52.0 bits (119), Expect = 1e-05 Identities = 32/80 (40%), Positives = 48/80 (60%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 + +IP+GD L A+ GP+SVA+DA T+FQ Y+SGV+ C + L+HGVL+V Sbjct: 227 YAEIPQGDCNSLNSALEQ-GPISVAVDA--TNFQFYTSGVFK--NCKA-NLNHGVLLV-- 278 Query: 466 GNDEQGVEYWLLKNCWAARW 525 N + ++ +KN W W Sbjct: 279 ANVDSSLK---IKNSWGPSW 295 >UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 291 Score = 78.6 bits (185), Expect = 1e-13 Identities = 39/105 (37%), Positives = 53/105 (50%) Frame = +1 Query: 211 ADLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQL 390 +D P +G + K FV +P G E+ L V G V +D S SFQL Sbjct: 164 SDYPYQGVDGACKFDAKTAMPVTSNFVSVPSGSERDLANYVYQYGVAVVVLDCSRISFQL 223 Query: 391 YSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 YSSG+Y++ CSS LDH + VVGY + YW+++N W W Sbjct: 224 YSSGIYSDPCCSSQNLDHAMNVVGYSD-----SYWIIRNSWGTSW 263 Score = 76.6 bits (180), Expect = 4e-13 Identities = 40/100 (40%), Positives = 54/100 (54%), Gaps = 4/100 (4%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 +DQ +CGSCW+F T A E + L LSEQN+IDC+ GC GG++ A + Sbjct: 94 RDQKQCGSCWAFGTVAACESNYALLYSNLPQLSEQNIIDCATTC--YGCGGGIIQAAMSF 151 Query: 182 I--KDNGGIDTEQTYPYEGVDDKCRYIPRT--PVLRTWAS 289 I K G I YPY+GVD C++ +T PV + S Sbjct: 152 IINKQGGAIMKLSDYPYQGVDGACKFDAKTAMPVTSNFVS 191 >UniRef50_Q22DX2 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 358 Score = 78.2 bits (184), Expect = 1e-13 Identities = 34/79 (43%), Positives = 52/79 (65%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 ++QG+CGSCW+FST+GA+E + + ++LS+Q L+DC Y + GC+GG ++AFKY Sbjct: 165 ENQGQCGSCWAFSTSGAVESYYSAKKNITLNLSKQQLVDC--VYDHGGCDGGWFNDAFKY 222 Query: 182 IKDNGGIDTEQTYPYEGVD 238 I+ G + YPY D Sbjct: 223 IQSVGIVLNATYYPYINKD 241 Score = 54.8 bits (126), Expect = 1e-06 Identities = 28/95 (29%), Positives = 51/95 (53%) Frame = +1 Query: 241 QVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDE 420 Q+ PK T + E D + +A+ G +S+A+DA++ + Y SG++ + E Sbjct: 247 QLSKLPKGTSFYQIQGYKKLENDTSVIKQAIMQNGALSIAVDATY--WANYKSGIFTQKE 304 Query: 421 CSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 +++H V ++G+G+D YWLL+N W + W Sbjct: 305 --KPQINHAVTLIGWGSD-----YWLLRNSWGSSW 332 >UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep: Cysteine protease - Babesia equi Length = 438 Score = 78.2 bits (184), Expect = 1e-13 Identities = 34/82 (41%), Positives = 50/82 (60%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CGSCW+F+ G++E + + G + LSEQ L++C E +NGC G L + A +Y Sbjct: 240 KDQGNCGSCWAFAAVGSVESLYLIKKGQALDLSEQELVNCEE--NSNGCEGDLPNKALEY 297 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 IK GI + PY +++C Sbjct: 298 IKAK-GISHSKDLPYHAANEEC 318 Score = 52.8 bits (121), Expect = 6e-06 Identities = 28/63 (44%), Positives = 37/63 (58%), Gaps = 1/63 (1%) Frame = +1 Query: 340 VGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDE-QGVEYWLLKNCWA 516 V P VAI AS F Y G++ EC+ EL+H VL+VG G+DE G +W++KN W Sbjct: 346 VSPTIVAIAASK-EFTAYKGGIFT-GECAP-ELNHAVLLVGEGHDEATGKRFWIVKNSWG 402 Query: 517 ARW 525 W Sbjct: 403 TDW 405 >UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 326 Score = 77.8 bits (183), Expect = 2e-13 Identities = 40/84 (47%), Positives = 52/84 (61%) Frame = +1 Query: 274 EDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVL 453 + FVD DE+ L +AV + GPVSV I+AS+ F +Y GV++ C TEL+H VL Sbjct: 205 DSYSFVD--PNDEEALKQAVYSQGPVSVLIEASY-EFMIYQGGVFS-GPCG-TELNHAVL 259 Query: 454 VVGYGNDEQGVEYWLLKNCWAARW 525 VVGY E G YW++KN W A W Sbjct: 260 VVGYDETEDGTPYWIVKNSWGAGW 283 Score = 45.2 bits (102), Expect = 0.001 Identities = 19/35 (54%), Positives = 25/35 (71%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQ 106 KDQG CGSCW+FS A+EG + +G ++LSEQ Sbjct: 133 KDQGPCGSCWAFSVVEAVEGINEIMTGNFLTLSEQ 167 >UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endopterygota|Rep: Cathepsin L-like proteinase - Bombyx mori (Silk moth) Length = 402 Score = 77.8 bits (183), Expect = 2e-13 Identities = 36/77 (46%), Positives = 53/77 (68%) Frame = +1 Query: 295 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGND 474 +P GDE+ + +A+ATVGP++VA++A+ +FQLY SGVY++ C S L+H +L+VGY D Sbjct: 306 LPSGDEEAMEKALATVGPLAVAVNAAPFTFQLY-SGVYDDPFCVSWHLNHAMLLVGYTQD 364 Query: 475 EQGVEYWLLKNCWAARW 525 YW+L N W W Sbjct: 365 -----YWILLNWWGRNW 376 Score = 73.7 bits (173), Expect = 3e-12 Identities = 33/84 (39%), Positives = 51/84 (60%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 ++Q +CG+C++F+ T AL+ Q +++ G LS Q ++DCS + GN GC+GG + A +Y Sbjct: 209 EEQWQCGACYAFAVTHALQAQLYKRHGEWNELSPQQIVDCSIKDGNMGCDGGSLRGALRY 268 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRY 253 G+ E YPY G CRY Sbjct: 269 AA-REGLVMESHYPYVGKKGYCRY 291 >UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_56, whole genome shotgun sequence - Paramecium tetraurelia Length = 314 Score = 77.8 bits (183), Expect = 2e-13 Identities = 40/88 (45%), Positives = 57/88 (64%), Gaps = 4/88 (4%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYL---VSLSEQNLIDCSEQYGNNGCNGGLMDNA 172 K+QG CGSCW+FS GA+E G + + LSEQ L+DC ++ NNGCNGG + Sbjct: 126 KNQGNCGSCWAFSAVGAVE-TLLTIKGVISKDLWLSEQQLVDC-DKGTNNGCNGGFENLG 183 Query: 173 FKYIKDNGGIDTEQTYPYEGVDDK-CRY 253 ++ K N G+ T++ YPY+GV +K C+Y Sbjct: 184 IQWAKKN-GLTTDKQYPYDGVQNKQCKY 210 Score = 48.8 bits (111), Expect = 9e-05 Identities = 25/60 (41%), Positives = 37/60 (61%) Frame = +1 Query: 346 PVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 P++VA+DA+ S+Q Y SGV+ + C+ L+H VL G+ E GV W++KN W W Sbjct: 236 PITVAVDAN--SWQNYKSGVFTK--CTYKSLNHAVLATGF--QEDGV--WIIKNSWGTSW 287 >UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n=1; Toxocara canis|Rep: Cathepsin L-like cysteine proteinase - Toxocara canis (Canine roundworm) Length = 360 Score = 77.0 bits (181), Expect = 3e-13 Identities = 36/75 (48%), Positives = 49/75 (65%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K Q KCGSCW+F+T G +E + +G L SLSEQ L+DC+ + NN C+GG +D A +Y Sbjct: 161 KSQFKCGSCWAFATVGTVESAYALGTGELRSLSEQQLLDCNLE--NNACDGGDVDKALRY 218 Query: 182 IKDNGGIDTEQTYPY 226 + D G+ E YPY Sbjct: 219 VYDE-GLMREYDYPY 232 Score = 46.8 bits (106), Expect = 4e-04 Identities = 24/73 (32%), Positives = 40/73 (54%), Gaps = 4/73 (5%) Frame = +1 Query: 307 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNED--ECSSTELD-HGVLVVGYGN-D 474 DE +++ + GPV+V I+ + + Y GVY D EC + + H + +VGYG + Sbjct: 258 DEASIIDWLLHYGPVNVGINVT-ADMKAYKGGVYTPDKWECENKIIGTHSINIVGYGTWN 316 Query: 475 EQGVEYWLLKNCW 513 +YW++KN W Sbjct: 317 ATNQKYWIVKNSW 329 >UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_36, whole genome shotgun sequence - Paramecium tetraurelia Length = 307 Score = 77.0 bits (181), Expect = 3e-13 Identities = 38/83 (45%), Positives = 49/83 (59%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGSCW+FS GA+EG + G+ LSEQ L+DC+ G GCNGG D A Y Sbjct: 122 KNQGNCGSCWTFSAIGAVEGFLAIRKGFKGVLSEQQLVDCAVDAG-EGCNGGNSDLALDY 180 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 I + G + E+ Y Y D C+ Sbjct: 181 IAEVGSV-YERDYEYTAKDGVCK 202 >UniRef50_A2F4G0 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 203 Score = 76.6 bits (180), Expect = 4e-13 Identities = 34/73 (46%), Positives = 48/73 (65%) Frame = +1 Query: 307 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGV 486 +E L AV+ VG +V++DAS TSFQLY SG+Y E +CS+ +D + VGYG E Sbjct: 104 NETALALAVSLVGVATVSVDASRTSFQLYQSGIYYEPDCSTETMDLSMACVGYGT-EGTT 162 Query: 487 EYWLLKNCWAARW 525 YW++KNC+ +W Sbjct: 163 NYWIVKNCFGDKW 175 Score = 53.2 bits (122), Expect = 4e-06 Identities = 33/105 (31%), Positives = 50/105 (47%), Gaps = 5/105 (4%) Frame = +2 Query: 11 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYI-- 184 G C + W+F T A E Q Q L+ LS Q L+DC + + GC GG +K I Sbjct: 4 GACAASWAFGTIAAEESQWAIQKDQLLVLSSQCLVDCVQL--SFGCGGGWPSGTYKSIMK 61 Query: 185 KDNGGIDTEQTYPYEGVDDKCRY--IPR-TPVLRTWASWTSPRAT 310 + NG + YPY C++ +P+ P++ T+ + T T Sbjct: 62 QFNGTFILDSDYPYTAKRGVCKFDSMPKAAPIMTTYGTTTKYNET 106 >UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanensis|Rep: Sui m 1 allergen - Suidasia medanensis Length = 336 Score = 76.6 bits (180), Expect = 4e-13 Identities = 34/88 (38%), Positives = 54/88 (61%), Gaps = 5/88 (5%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSE-----QYGNNGCNGGLMD 166 ++QG+CGSCW+F+T +E Q+ + V+LSEQ L+DC QY ++GC GG Sbjct: 130 RNQGQCGSCWAFATAATVEAQYAIRKNVHVTLSEQQLVDCDHRPFQGQYEDHGCQGGNPI 189 Query: 167 NAFKYIKDNGGIDTEQTYPYEGVDDKCR 250 A+ Y++ G ++ E YPY+ D +C+ Sbjct: 190 IAYAYVQQTGLVE-ESAYPYQARDGQCQ 216 Score = 62.1 bits (144), Expect = 9e-09 Identities = 25/72 (34%), Positives = 44/72 (61%) Frame = +1 Query: 310 EQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVE 489 ++ +M ++ +GP++V I AS F+ Y +GV +S +++H V +VG+G E G + Sbjct: 240 DETIMNSLHQIGPMAVLIFASDNEFRFYRNGVIQNLRPNSRQINHAVTLVGWGT-EDGQD 298 Query: 490 YWLLKNCWAARW 525 YW++KN W W Sbjct: 299 YWIVKNSWGPSW 310 >UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 514 Score = 76.2 bits (179), Expect = 5e-13 Identities = 35/83 (42%), Positives = 48/83 (57%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 + QG CGSC++ + GA+EG +F ++G L LS Q +IDCS GN GC GG + A + Sbjct: 319 RGQGICGSCYALAAVGAVEGAYFMKTGKLKELSAQQVIDCSWGSGNRGCKGGYYNKAMSW 378 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 I +G E PY G + CR Sbjct: 379 IYLHGIASAESYGPYLGQEGTCR 401 Score = 68.1 bits (159), Expect = 1e-10 Identities = 34/81 (41%), Positives = 47/81 (58%), Gaps = 1/81 (1%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECS-STELDHGVLVVG 462 F +P+ + L +VA GP V+I+ + S + YS G+Y++ EC T H VLVVG Sbjct: 414 FAFVPKYNNTALKISVARFGPAVVSINENPLSLKFYSWGLYDDPECGRDTAAVHSVLVVG 473 Query: 463 YGNDEQGVEYWLLKNCWAARW 525 YG E G YWL+KN W+ W Sbjct: 474 YG-VEDGEPYWLVKNSWSTTW 493 >UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 325 Score = 75.8 bits (178), Expect = 7e-13 Identities = 40/89 (44%), Positives = 52/89 (58%), Gaps = 6/89 (6%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQ---SGYLVSLSEQNLIDCSEQYGNN---GCNGGLM 163 KDQG+CGSC++FSTTGA+E +SLSEQ ++DC ++ N GC G M Sbjct: 132 KDQGRCGSCYAFSTTGAIESALLISGVGEANTLSLSEQEIVDCVKEPEYNQLGGCQDGYM 191 Query: 164 DNAFKYIKDNGGIDTEQTYPYEGVDDKCR 250 D +FKYI N I YPY V+ KC+ Sbjct: 192 DESFKYIIKN-KISKAADYPYTAVEGKCK 219 Score = 50.4 bits (115), Expect = 3e-05 Identities = 34/80 (42%), Positives = 46/80 (57%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGY 465 +VD+P GD + L+ A+ PVSVAIDA + Q Y+SGVY+ CS L H VL+VGY Sbjct: 232 YVDVPSGDCKALLTALQD-HPVSVAIDAK--NLQYYTSGVYS--NCSD-NLTHAVLLVGY 285 Query: 466 GNDEQGVEYWLLKNCWAARW 525 + LKN W ++ Sbjct: 286 SSSA-----LKLKNSWGTQF 300 >UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa|Rep: Os09g0381400 protein - Oryza sativa subsp. japonica (Rice) Length = 362 Score = 75.8 bits (178), Expect = 7e-13 Identities = 34/77 (44%), Positives = 45/77 (58%) Frame = +2 Query: 17 CGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNG 196 C SCW+F T +E + ++G LVSLSEQ L+DC G GCN G A+K++ +NG Sbjct: 166 CSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDG--GCNLGSYGRAYKWVVENG 223 Query: 197 GIDTEQTYPYEGVDDKC 247 G+ TE YPY C Sbjct: 224 GLTTEADYPYTARRGPC 240 Score = 63.7 bits (148), Expect = 3e-09 Identities = 37/86 (43%), Positives = 45/86 (52%), Gaps = 1/86 (1%) Frame = +1 Query: 271 AEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGV 450 A+ GF +P +E L AVA PV+VAI+ + Q Y GVY C T L H V Sbjct: 250 AKITGFGKVPPRNEAALQAAVARQ-PVAVAIEVG-SGMQFYKGGVYT-GPCG-TRLAHAV 305 Query: 451 LVVGYGND-EQGVEYWLLKNCWAARW 525 VVGYG D G +YW +KN W W Sbjct: 306 TVVGYGTDASSGAKYWTIKNSWGQSW 331 >UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckeia)|Rep: Berghepain-2 - Plasmodium yoelii yoelii Length = 472 Score = 75.8 bits (178), Expect = 7e-13 Identities = 33/75 (44%), Positives = 49/75 (65%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQ KC SCW+F+T G + Q+ + VSLSEQ L+DC++ N GC+GG++ AF+ Sbjct: 266 KDQQKCASCWAFATAGVVAAQYAIRKNQKVSLSEQQLVDCAQ--NNFGCDGGILPYAFED 323 Query: 182 IKDNGGIDTEQTYPY 226 + D G+ ++ YPY Sbjct: 324 LIDMNGLCEDKYYPY 338 >UniRef50_Q26988 Cluster: Cysteine proteinase 4; n=1; Tritrichomonas foetus|Rep: Cysteine proteinase 4 - Tritrichomonas foetus (Trichomonas foetus) Length = 152 Score = 75.8 bits (178), Expect = 7e-13 Identities = 33/62 (53%), Positives = 44/62 (70%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVG 462 GF+ + E+ L + VA+VGP++V IDAS SF YSSG+YN+ +CSST LDH V +G Sbjct: 86 GFMSVQAQSEEDLFKCVASVGPIAVCIDASLASFNSYSSGIYNDRQCSSTVLDHAVGCIG 145 Query: 463 YG 468 YG Sbjct: 146 YG 147 Score = 59.7 bits (138), Expect = 5e-08 Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 3/79 (3%) Frame = +2 Query: 32 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIK--DNGGID 205 +F+TT +E + + L S SEQNL+DC Q +NGC GG +AF +I NG I+ Sbjct: 1 AFATTQCMESINALRFKSLFSFSEQNLVDCDPQ--SNGCAGGSPFSAFMFISRTQNGQIN 58 Query: 206 TEQTYPYEGVD-DKCRYIP 259 E YPY G D + C++ P Sbjct: 59 LEDDYPYTGTDTNDCKFDP 77 >UniRef50_Q22LI1 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 987 Score = 75.8 bits (178), Expect = 7e-13 Identities = 37/87 (42%), Positives = 47/87 (54%), Gaps = 5/87 (5%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC-----SEQYGNNGCNGGLMD 166 K QGKCGSCWSFS G +E + ++G L+ LSEQ L+DC + Y +NGCNGG Sbjct: 139 KRQGKCGSCWSFSAAGLMEAFQYFKTGNLIDLSEQQLVDCDNSSFDKSYYSNGCNGGYPQ 198 Query: 167 NAFKYIKDNGGIDTEQTYPYEGVDDKC 247 A +Y G + YPY C Sbjct: 199 EAVEYASKYGIVPLTD-YPYVKQQQPC 224 >UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 234 Score = 75.4 bits (177), Expect = 9e-13 Identities = 37/86 (43%), Positives = 51/86 (59%), Gaps = 2/86 (2%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQ CGSCW+F + A+E F + G L SLSEQ L+DC + GC+G L AF+Y Sbjct: 34 KDQKHCGSCWAFGSCAAMESSWFLKHGTLYSLSEQCLVDCC--HDCLGCHGCLPSLAFEY 91 Query: 182 IK--DNGGIDTEQTYPYEGVDDKCRY 253 +K +G +TE YPY+ C++ Sbjct: 92 VKIFMHGLFETEDNYPYQAEHHSCKF 117 Score = 73.3 bits (172), Expect = 4e-12 Identities = 33/75 (44%), Positives = 46/75 (61%) Frame = +1 Query: 301 EGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQ 480 + +E +L VA GP +V I+A F+LYSSGV++ +C LDH V V+GYG E Sbjct: 133 KSNEDQLKTEVAANGPYAVMINADSEQFRLYSSGVFDNPKCGKIILDHVVTVIGYG-VED 191 Query: 481 GVEYWLLKNCWAARW 525 G +YWL++N W W Sbjct: 192 GKDYWLVRNSWGKYW 206 >UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba histolytica|Rep: Cysteine protease 19 - Entamoeba histolytica Length = 324 Score = 74.9 bits (176), Expect = 1e-12 Identities = 29/75 (38%), Positives = 49/75 (65%) Frame = +1 Query: 301 EGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQ 480 +GD++K+ + + GPV A+DAS +SF LY G+YN+ +C S + V++VGYG D+ Sbjct: 219 KGDDEKVRSEILSYGPVGSAMDASRSSFLLYHGGIYNDKKCRSDKSTIAVVIVGYGIDKN 278 Query: 481 GVEYWLLKNCWAARW 525 +Y++++N W W Sbjct: 279 NGKYFIVRNSWGPYW 293 Score = 56.0 bits (129), Expect = 6e-07 Identities = 30/89 (33%), Positives = 46/89 (51%), Gaps = 7/89 (7%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEG------QHFRQSGYLVSLSEQNLIDCSEQYGN-NGCNGGL 160 KDQG CGSC++FS+ +E S Y +S +E ++ C GC GG Sbjct: 116 KDQGNCGSCYAFSSVALMETAVLLSYDDLSPSNYALSTAE--IVSCCYDPSECRGCEGGS 173 Query: 161 MDNAFKYIKDNGGIDTEQTYPYEGVDDKC 247 + A KY +DN G+ +E ++PY+ + C Sbjct: 174 IGGALKYAQDN-GMQSESSFPYKPFEQHC 201 >UniRef50_Q22RC9 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 389 Score = 74.9 bits (176), Expect = 1e-12 Identities = 37/81 (45%), Positives = 51/81 (62%), Gaps = 6/81 (7%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC------SEQYGNNGCNGGLM 163 K+QG G+CW+FSTTG +EGQ F LVSLSE+ ++DC S + + G GG Sbjct: 141 KNQGTVGTCWTFSTTGNIEGQWFLAGNPLVSLSEEQIVDCDGSQEPSTGHADCGVFGGWP 200 Query: 164 DNAFKYIKDNGGIDTEQTYPY 226 AF Y+ + GG+ +E+TYPY Sbjct: 201 YLAFDYVINAGGLPSEETYPY 221 Score = 73.3 bits (172), Expect = 4e-12 Identities = 33/73 (45%), Positives = 46/73 (63%) Frame = +1 Query: 307 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGV 486 DE + + + +GP+SVA+DAS+ F Y G+ CS T L+H VL+ GYG D GV Sbjct: 275 DEDSIKQQLFEIGPLSVALDASYLQF--YKKGISAPKFCSKTTLNHAVLLTGYGID-NGV 331 Query: 487 EYWLLKNCWAARW 525 E+W +KN W A+W Sbjct: 332 EFWNVKNSWGAKW 344 >UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; Phytophthora infestans|Rep: Cathepsin-like cysteine protease - Phytophthora infestans (Potato late blight fungus) Length = 376 Score = 74.5 bits (175), Expect = 2e-12 Identities = 37/82 (45%), Positives = 49/82 (59%), Gaps = 2/82 (2%) Frame = +1 Query: 286 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSST--ELDHGVLVV 459 + ++ GDE L A+AT G +VAIDAS +FQLY GVY+ C + LDHGV Sbjct: 247 YANVTSGDEAALQAAIATKGVQAVAIDASSFTFQLYRHGVYSWPLCGNAPDALDHGVAAA 306 Query: 460 GYGNDEQGVEYWLLKNCWAARW 525 GYG ++ +YWL+KN W W Sbjct: 307 GYGVYKK-KDYWLVKNSWGNSW 327 Score = 67.7 bits (158), Expect = 2e-10 Identities = 36/78 (46%), Positives = 48/78 (61%), Gaps = 3/78 (3%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCN-GGLMDNAFK 178 K+QG+CGSCW+FS A+E + +G L SLSEQ L+DC+ G + CN GG M ++ Sbjct: 149 KNQGQCGSCWAFSAVAAMECAYALSTGTLESLSEQELVDCTLN-GIDTCNHGGEMSEGYE 207 Query: 179 YIKDN--GGIDTEQTYPY 226 I N G ID E+ Y Y Sbjct: 208 EIITNHKGKIDREEVYRY 225 >UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L, S or H-like cysteine peptidase - Trichomonas vaginalis G3 Length = 473 Score = 74.5 bits (175), Expect = 2e-12 Identities = 37/93 (39%), Positives = 49/93 (52%), Gaps = 1/93 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK- 178 +DQ CGSCW+F T +LE Q ++G LS ++DC+ Y N+ C GG AF+ Sbjct: 268 RDQVACGSCWAFGTAESLESQLALKTGVFRELSVNQIMDCTWDYNNSACGGGEAGPAFRS 327 Query: 179 YIKDNGGIDTEQTYPYEGVDDKCRYIPRTPVLR 277 I N + E+ YPY GV C P PV R Sbjct: 328 LINQNFKLFLEKDYPYIGVAGYCNRNPEHPVAR 360 Score = 51.6 bits (118), Expect = 1e-05 Identities = 33/108 (30%), Positives = 50/108 (46%), Gaps = 2/108 (1%) Frame = +1 Query: 214 DLPLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLY 393 D P G +P++ A V + I + Q L EA+ GP S+ I+ S Y Sbjct: 340 DYPYIGVAGYCNRNPEHPVARVVDCIAIDKST-QALKEALYQYGPASIGINVIE-SMSFY 397 Query: 394 SSGVYNEDECSST--ELDHGVLVVGYGNDEQGVEYWLLKNCWAARWAN 531 + G N+ C+ +L H VL+ G+ G+E W +KN W+ W N Sbjct: 398 TKGAVNDPTCTGAADDLVHEVLLTGW-KIVDGIECWEIKNSWSTHWGN 444 >UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 350 Score = 74.1 bits (174), Expect = 2e-12 Identities = 38/101 (37%), Positives = 57/101 (56%), Gaps = 5/101 (4%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC-SEQYG--NNGCNGGLMDNA 172 K+QG+CGSCW+F+T G LE + + + SEQ+++DC S YG ++GCNGG Sbjct: 156 KNQGQCGSCWTFATAGVLESYYALKYQQSLIFSEQDIVDCASRSYGYQSDGCNGGFPSEG 215 Query: 173 FKYIKDNGGIDTEQTYPYEGVDDKCRYI--PRTPVLRTWAS 289 +Y G + ++ YPY V CR + PR +L + S Sbjct: 216 LQYASTVGLVQSDY-YPYVAVQGTCRQVNAPRYQLLDQYYS 255 Score = 48.0 bits (109), Expect = 2e-04 Identities = 25/69 (36%), Positives = 39/69 (56%), Gaps = 1/69 (1%) Frame = +1 Query: 322 MEAVATVGPVSVAIDASHTSFQLYSSGVYNE-DECSSTELDHGVLVVGYGNDEQGVEYWL 498 ++ T P +V +DAS ++Q Y+SGVYN + +L+H V+ VGY D G W+ Sbjct: 263 LQYAITRAPTAVGVDAS--TWQFYNSGVYNGCGKTQRNQLNHAVIAVGY--DAYG--NWI 316 Query: 499 LKNCWAARW 525 ++N W W Sbjct: 317 IRNSWGTSW 325 >UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP00000013730, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to ENSANGP00000013730, partial - Ornithorhynchus anatinus Length = 229 Score = 73.7 bits (173), Expect = 3e-12 Identities = 36/60 (60%), Positives = 42/60 (70%), Gaps = 1/60 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHF-RQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK 178 KDQ CGSCWSF+TTG LEG F + + LV LS+Q LIDCS GN GC+GGL AF+ Sbjct: 71 KDQAVCGSCWSFATTGTLEGALFLKVTVQLVPLSQQMLIDCSWDVGNFGCDGGLEWQAFR 130 Score = 55.6 bits (128), Expect = 8e-07 Identities = 26/56 (46%), Positives = 35/56 (62%), Gaps = 2/56 (3%) Frame = +1 Query: 370 SHTSFQLYSSGVYNEDECSST--ELDHGVLVVGYGNDEQGVEYWLLKNCWAARWAN 531 S SF Y++G+Y E +C +L+H VL+VGYG QG +WLLKN W+ W N Sbjct: 150 SPRSFAFYANGIYYEPQCRHKLEQLNHAVLLVGYGV-LQGQAFWLLKNSWSPLWGN 204 >UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_101, whole genome shotgun sequence - Paramecium tetraurelia Length = 306 Score = 73.7 bits (173), Expect = 3e-12 Identities = 38/99 (38%), Positives = 49/99 (49%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+QG CGS WSFS GA E G SEQNL+DC ++GC+GG A Y Sbjct: 124 KNQGTCGSGWSFSAVGAFEAFFIFVKGTHFQYSEQNLVDCDT--NSHGCDGGYPAKAIDY 181 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRTPVLRTWASWTS 298 + NG E YPY +KCR + + +WT+ Sbjct: 182 LNKNGAF-LESEYPYVASKEKCRKTQGSTKANSRKTWTT 219 Score = 40.7 bits (91), Expect = 0.025 Identities = 20/67 (29%), Positives = 40/67 (59%) Frame = +1 Query: 325 EAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLK 504 EA+A P+SV++ +S+ ++ Y+ G+++ C +T +H + VGY + + WL++ Sbjct: 225 EAIAQY-PISVSVQSSN--WKGYTGGIFSN--CINTSTNHAAVAVGYDSKKN----WLIR 275 Query: 505 NCWAARW 525 N W + W Sbjct: 276 NSWGSDW 282 >UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Bigelowiella natans|Rep: Digestive cysteine proteinase - Bigelowiella natans (Pedinomonas minutissima) (Chlorarachnion sp.(strain CCMP 621)) Length = 360 Score = 73.3 bits (172), Expect = 4e-12 Identities = 35/79 (44%), Positives = 50/79 (63%), Gaps = 4/79 (5%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHF-RQSGYL---VSLSEQNLIDCSEQYGNNGCNGGLMDN 169 KDQG CGSCW+FS T ALE H+ + + L ++LS + L++C + + C GG + Sbjct: 125 KDQGGCGSCWAFSATQALESAHYIKHNDTLDSPIALSTEQLVECDQH--DYACYGGFPRD 182 Query: 170 AFKYIKDNGGIDTEQTYPY 226 A KYIK++GG+ E YPY Sbjct: 183 AMKYIKESGGLVAEADYPY 201 Score = 65.3 bits (152), Expect = 1e-09 Identities = 32/75 (42%), Positives = 44/75 (58%), Gaps = 2/75 (2%) Frame = +1 Query: 307 DEQKLMEAVATVGPVSVAIDASH--TSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQ 480 DE K+ +A P+SV+IDA + Q Y GV N CS T L+H VL+VG+G D Sbjct: 260 DEDKIASYLALKHPLSVSIDAGEGLSWMQFYKHGVANPRFCSKTSLNHAVLLVGFGVD-G 318 Query: 481 GVEYWLLKNCWAARW 525 G +W++KN W +W Sbjct: 319 GKAFWIVKNSWGEKW 333 >UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|Rep: Thiol protease - Triticum aestivum (Wheat) Length = 374 Score = 72.9 bits (171), Expect = 5e-12 Identities = 37/84 (44%), Positives = 48/84 (57%), Gaps = 2/84 (2%) Frame = +2 Query: 2 KDQGK-CGSCWSFSTTGALEGQH-FRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAF 175 K+QGK CG+CW+FS +E + + G LSEQ LIDC + GC G M NA+ Sbjct: 160 KNQGKVCGACWAFSAVATIESAYAIAKRGEPPVLSEQELIDCDTF--DRGCTSGEMYNAY 217 Query: 176 KYIKDNGGIDTEQTYPYEGVDDKC 247 ++ NGGI TYPY+ D KC Sbjct: 218 FWVLRNGGIANSSTYPYKETDGKC 241 Score = 55.6 bits (128), Expect = 8e-07 Identities = 30/82 (36%), Positives = 45/82 (54%), Gaps = 10/82 (12%) Frame = +1 Query: 310 EQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNE---------DECSSTELDHGVLVVG 462 E++LM AVA V PV+V D++ F+ Y +G+Y+ CSS + H + +VG Sbjct: 264 EEQLMAAVA-VRPVAVGFDSNDECFKFYQAGLYDGMCIKHGEYFGPCSSNDRIHSLAIVG 322 Query: 463 Y-GNDEQGVEYWLLKNCWAARW 525 Y G V+YW+ KN W +W Sbjct: 323 YAGKGGDRVKYWIAKNSWGEKW 344 >UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba histolytica|Rep: Cysteine protease 10 - Entamoeba histolytica Length = 297 Score = 72.9 bits (171), Expect = 5e-12 Identities = 34/94 (36%), Positives = 57/94 (60%), Gaps = 2/94 (2%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYL-VSLSEQNLIDCSE-QYGNNGCNGGLMDNAF 175 K+Q KC SC++F + +E +++ + LSEQ ++DCS+ +Y N GC G + N+F Sbjct: 124 KNQRKCASCYAFGSIATIESLIMQETSIKEIDLSEQQIVDCSQGEYSNWGCTCGNVGNSF 183 Query: 176 KYIKDNGGIDTEQTYPYEGVDDKCRYIPRTPVLR 277 Y++D+ GI E+ YPY G + C + PV++ Sbjct: 184 NYVRDH-GILLERDYPYTGKANNCSIDGKKPVIK 216 Score = 64.5 bits (150), Expect = 2e-09 Identities = 32/84 (38%), Positives = 49/84 (58%) Frame = +1 Query: 274 EDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVL 453 +D FV P+ +E ++ PV+V+ID+S SFQ Y G+Y+E C +DH V Sbjct: 218 KDYSFV-FPQTEEN--LKIAVYHQPVAVSIDSSQLSFQFYEGGIYDEPNCKW--VDHIVT 272 Query: 454 VVGYGNDEQGVEYWLLKNCWAARW 525 VVGYG E+ ++W++KN + W Sbjct: 273 VVGYGTTEEHQDFWVVKNSYGNEW 296 >UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_46, whole genome shotgun sequence - Paramecium tetraurelia Length = 336 Score = 72.9 bits (171), Expect = 5e-12 Identities = 37/85 (43%), Positives = 49/85 (57%), Gaps = 1/85 (1%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG+C CW+F GA E + ++ V LSEQ LIDC Q + GCNGG + A KY Sbjct: 155 KDQGQCSGCWAFGAVGAAEAWFYVKNKTTVLLSEQQLIDCDTQ--SFGCNGGYQNLALKY 212 Query: 182 IKDNGGIDTEQTYPY-EGVDDKCRY 253 I N G++ + YPY + C+Y Sbjct: 213 IA-NHGLNDARVYPYTQKQSAYCKY 236 >UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep: Viral cathepsin - Cydia pomonella granulosis virus (CpGV) (Cydia pomonellagranulovirus) Length = 333 Score = 72.9 bits (171), Expect = 5e-12 Identities = 35/86 (40%), Positives = 51/86 (59%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+Q +CGSCW+FST +E + + ++LSEQ+L++C NNGC GGLM A + Sbjct: 140 KNQMECGSCWAFSTIANIESLYNIKYDKALNLSEQHLVNCDNI--NNGCAGGLMHWALES 197 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIP 259 I GG+ + + PY G D C+ P Sbjct: 198 ILQEGGVVSAENEPYYGFDGVCKKSP 223 Score = 61.7 bits (143), Expect = 1e-08 Identities = 34/74 (45%), Positives = 44/74 (59%), Gaps = 1/74 (1%) Frame = +1 Query: 307 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTE-LDHGVLVVGYGNDEQG 483 +E KL E + GP+SVAID S Y +G+ D C + E L+H VL+VGYG + Sbjct: 238 NENKLRELLVVNGPISVAIDVS--DLINYKAGI--ADICENNEGLNHAVLLVGYGV-KND 292 Query: 484 VEYWLLKNCWAARW 525 V YW+LKN W A W Sbjct: 293 VPYWILKNSWGAEW 306 >UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease Gip1p; n=4; Tetrahymena thermophila|Rep: Granule-biosynthesis induced protease Gip1p - Tetrahymena thermophila Length = 345 Score = 72.5 bits (170), Expect = 7e-12 Identities = 32/85 (37%), Positives = 52/85 (61%), Gaps = 2/85 (2%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSE--QYGNNGCNGGLMDNAF 175 K+QG CGSCW+F+T G LE + ++ L+ SEQ L+DC Y ++GC+GG ++ Sbjct: 149 KNQGTCGSCWTFATAGILESFNQIKNKQLLKFSEQQLVDCVSLAGYDSDGCDGGFQEDGV 208 Query: 176 KYIKDNGGIDTEQTYPYEGVDDKCR 250 +Y + G + + + YPY G +C+ Sbjct: 209 RYAIEYGIVQSYK-YPYVGYQGRCK 232 Score = 46.4 bits (105), Expect = 5e-04 Identities = 25/71 (35%), Positives = 44/71 (61%), Gaps = 3/71 (4%) Frame = +1 Query: 322 MEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSST---ELDHGVLVVGYGNDEQGVEY 492 ++A PVS++++A +++ Y GV+ DEC T +L+H V+ VGY D++G Sbjct: 258 LKAALVFSPVSISVNAD--TWKEYYGGVF--DECGYTTEEDLNHAVIAVGY--DQEG--N 309 Query: 493 WLLKNCWAARW 525 W+++N W+A W Sbjct: 310 WIVRNSWSAAW 320 >UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: Vivapain-4 - Plasmodium vivax Length = 484 Score = 72.5 bits (170), Expect = 7e-12 Identities = 35/77 (45%), Positives = 47/77 (61%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+Q CGSCW+F GA+E Q+ + V +SEQ L+DCS++ N GC GGL AF Sbjct: 278 KNQNLCGSCWAFGAVGAVESQYAIRKNQHVLISEQELVDCSDK--NFGCFGGLASLAFDD 335 Query: 182 IKDNGGIDTEQTYPYEG 232 + D G + +E YPY G Sbjct: 336 MIDLGYLCSESDYPYVG 352 >UniRef50_Q23H10 Cluster: Papain family cysteine protease containing protein; n=14; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 72.5 bits (170), Expect = 7e-12 Identities = 37/85 (43%), Positives = 44/85 (51%), Gaps = 3/85 (3%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC---SEQYGNNGCNGGLMDNA 172 K+QG CGSCWSFS +E +F Q+ LV SEQ L+DC + Y + GCNGG Sbjct: 143 KNQGGCGSCWSFSAAAVMESFNFIQNKALVDFSEQQLVDCVIPANGYNSYGCNGGWPVQC 202 Query: 173 FKYIKDNGGIDTEQTYPYEGVDDKC 247 Y GI T YPY V C Sbjct: 203 LDY-ASKVGITTLDKYPYVAVQKNC 226 Score = 52.0 bits (119), Expect = 1e-05 Identities = 29/88 (32%), Positives = 48/88 (54%) Frame = +1 Query: 262 NTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELD 441 + G + ++ IP +++ PVSV +DAS ++ Y SG++N + + L+ Sbjct: 232 DNGFKPKSWIQIPNTSND--LKSALNFSPVSVLVDAS--TWGNYYSGIFNGCDQTHISLN 287 Query: 442 HGVLVVGYGNDEQGVEYWLLKNCWAARW 525 H VL VGY D+QG W++KN W+ W Sbjct: 288 HAVLAVGY--DQQG--NWIIKNSWSTYW 311 >UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precursor; n=20; Psoroptidia|Rep: Major mite fecal allergen Der f 1 precursor - Dermatophagoides farinae (House-dust mite) Length = 321 Score = 72.5 bits (170), Expect = 7e-12 Identities = 32/81 (39%), Positives = 48/81 (59%) Frame = +2 Query: 8 QGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIK 187 QG CGSCW+FS A E + + LSEQ L+DC+ Q+ GC+G + +YI+ Sbjct: 127 QGGCGSCWAFSGVAATESAYLAYRNTSLDLSEQELVDCASQH---GCHGDTIPRGIEYIQ 183 Query: 188 DNGGIDTEQTYPYEGVDDKCR 250 NG ++ E++YPY + +CR Sbjct: 184 QNGVVE-ERSYPYVAREQRCR 203 Score = 37.5 bits (83), Expect = 0.23 Identities = 22/77 (28%), Positives = 38/77 (49%), Gaps = 2/77 (2%) Frame = +1 Query: 307 DEQKLMEAVA-TVGPVSVAIDASHT-SFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQ 480 D +++ EA+ T ++V I +FQ Y + + H V +VGYG+ Q Sbjct: 222 DVKQIREALTQTHTAIAVIIGIKDLRAFQHYDGRTIIQHDNGYQPNYHAVNIVGYGST-Q 280 Query: 481 GVEYWLLKNCWAARWAN 531 G +YW+++N W W + Sbjct: 281 GDDYWIVRNSWDTTWGD 297 >UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schistosoma japonicum|Rep: Clone ZZD209 mRNA sequence - Schistosoma japonicum (Blood fluke) Length = 339 Score = 72.1 bits (169), Expect = 9e-12 Identities = 32/82 (39%), Positives = 49/82 (59%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 +DQG C ++F+ T + E Q+ + ++LS Q IDC+ YGN GC+GG F Y Sbjct: 137 RDQGSCIGSYAFAVTASTESQYALHTSNHMNLSVQQFIDCTRIYGNMGCHGGYTFTLFIY 196 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 ++ + G++TEQ YP+ G D C Sbjct: 197 LQ-SFGLETEQMYPFTGEDQDC 217 Score = 67.7 bits (158), Expect = 2e-10 Identities = 29/102 (28%), Positives = 51/102 (50%) Frame = +1 Query: 220 PLRGS*RQVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSS 399 P G + + + + +G+ G E L A+ GP ++++ F Y S Sbjct: 209 PFTGEDQDCMANSSDVVVQSIGYKFHRHGYETILKWALYNEGPYVISMNIDE-KFLHYKS 267 Query: 400 GVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 G+Y D C+ L+ +L+VGYG D G++YW+++N W +W Sbjct: 268 GIYQSDTCTHYNLNQSMLLVGYGYDNDGIDYWIVQNSWGKKW 309 >UniRef50_A2WPN4 Cluster: Putative uncharacterized protein; n=2; Oryza sativa (indica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. indica (Rice) Length = 325 Score = 71.7 bits (168), Expect = 1e-11 Identities = 35/81 (43%), Positives = 48/81 (59%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVG 462 GF +P DE++L AVA PV+V IDAS FQ Y GVY + C+ ++H V +VG Sbjct: 217 GFAAVPPNDERQLALAVARQ-PVTVYIDASAQEFQFYKGGVY-KGPCNPGSVNHAVTIVG 274 Query: 463 YGNDEQGVEYWLLKNCWAARW 525 Y + G +YW+ KN W+ W Sbjct: 275 YCENFGGEKYWIAKNSWSNDW 295 Score = 46.4 bits (105), Expect = 5e-04 Identities = 19/46 (41%), Positives = 26/46 (56%) Frame = +2 Query: 110 LIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKC 247 ++DC G+ GC+GG D A + GGI +E+ YPY GV C Sbjct: 159 MVDCDT--GSFGCSGGHSDTALNLVASRGGITSEEKYPYTGVQGSC 202 >UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arabidopsis thaliana|Rep: Putative cysteine proteinase - Arabidopsis thaliana (Mouse-ear cress) Length = 365 Score = 71.3 bits (167), Expect = 2e-11 Identities = 37/81 (45%), Positives = 48/81 (59%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVG 462 GF +P +E+ L+EAV PVSV IDA SF Y GVY +C T+++H V +VG Sbjct: 258 GFQMVPSHNERALLEAVRRQ-PVSVLIDARADSFGHYKGGVYAGLDC-GTDVNHAVTIVG 315 Query: 463 YGNDEQGVEYWLLKNCWAARW 525 YG G+ YW+LKN W W Sbjct: 316 YGT-MSGLNYWVLKNSWGESW 335 Score = 64.1 bits (149), Expect = 2e-09 Identities = 29/55 (52%), Positives = 36/55 (65%) Frame = +2 Query: 86 LVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCR 250 L++LSEQ LIDC + N GCNGG + AFKYI NGG+ E YPY+ + CR Sbjct: 192 LLTLSEQQLIDCDIEK-NGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESCR 245 >UniRef50_A2Q4E7 Cluster: Peptidase C1A, papain; n=1; Medicago truncatula|Rep: Peptidase C1A, papain - Medicago truncatula (Barrel medic) Length = 263 Score = 71.3 bits (167), Expect = 2e-11 Identities = 34/61 (55%), Positives = 41/61 (67%) Frame = +2 Query: 53 LEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEG 232 +EG SG LVS SEQ L+DC NGCNGG +AFK+I +NGGI TE +YPY+G Sbjct: 187 IEGIQQIISGNLVSFSEQQLVDCVTSNWTNGCNGGNKIDAFKFILENGGIATEASYPYKG 246 Query: 233 V 235 V Sbjct: 247 V 247 >UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Slime mold). Cysteine proteinase 5; n=2; Dictyostelium discoideum|Rep: Similar to Dictyostelium discoideum (Slime mold). Cysteine proteinase 5 - Dictyostelium discoideum (Slime mold) Length = 345 Score = 71.3 bits (167), Expect = 2e-11 Identities = 38/84 (45%), Positives = 50/84 (59%), Gaps = 3/84 (3%) Frame = +2 Query: 11 GKCGSCWSFSTTGALEGQHF--RQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYI 184 G CGS W + GA E HF +SLS QNLIDCS N C G ++ AF+YI Sbjct: 140 GGCGS-WPITAVGATESAHFLANPKDPFISLSMQNLIDCSNL--NKQCYQGTVNEAFQYI 196 Query: 185 KDNGGIDTEQTYPYEGVD-DKCRY 253 +NGGID+E++Y + G + KC+Y Sbjct: 197 IENGGIDSEESYKFSGGEPGKCKY 220 Score = 68.1 bits (159), Expect = 1e-10 Identities = 34/96 (35%), Positives = 55/96 (57%), Gaps = 8/96 (8%) Frame = +1 Query: 262 NTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELD 441 N+ A+ + + G E L AV+ + PV+ IDAS +SFQ YSSG+Y E C+ST+L+ Sbjct: 224 NSVAKITSYEKVKSGSESSLESAVS-LKPVAAYIDASLSSFQFYSSGIYYEPSCNSTDLN 282 Query: 442 HGVLVVGYGND--------EQGVEYWLLKNCWAARW 525 H +L+VG+ + + YW+++N + W Sbjct: 283 HSILIVGFSDFSTTPTDSLKHSSNYWIVQNSFGKNW 318 >UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Rep: Cathepsin W - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 303 Score = 70.9 bits (166), Expect = 2e-11 Identities = 34/83 (40%), Positives = 49/83 (59%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+Q C SCW+F+ +E Q + G +SLSEQ +IDC+ NGC+GG +AF Sbjct: 95 KNQRTCHSCWAFAAVANIEAQ-WAILGQTISLSEQQVIDCNTC--RNGCSGGYAWDAFMT 151 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 + GG+ +E++YPY G CR Sbjct: 152 VLQQGGLTSEKSYPYTGHVSNCR 174 Score = 40.7 bits (91), Expect = 0.025 Identities = 26/91 (28%), Positives = 45/91 (49%), Gaps = 5/91 (5%) Frame = +1 Query: 268 GAEDVGFV---DIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYN--EDECSST 432 G E VG++ ++ + +E + VA G ++V I+ + + Y G+ + C Sbjct: 176 GFEAVGWIHDFEMLKKNETAMASHVAHKGTLTVTINKA--PLKHYQKGIVDTLRSNCDPN 233 Query: 433 ELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 +DH VL+VGY + + W+LKN W W Sbjct: 234 YVDHVVLIVGYRGGGK-LPQWILKNSWGEDW 263 >UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n=3; Brugia malayi|Rep: Cathepsin L-like cysteine proteinase - Brugia malayi (Filarial nematode worm) Length = 353 Score = 70.9 bits (166), Expect = 2e-11 Identities = 36/96 (37%), Positives = 58/96 (60%), Gaps = 6/96 (6%) Frame = +1 Query: 256 PKNTGAE-DVGFVD---IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDEC 423 P+NT G D +P +EQ L + +A GPV V++ +S SF Y SG+YN+ +C Sbjct: 232 PRNTPQRRKYGLADAFYLPPSNEQILKKILALYGPVCVSLHSSLQSFVAYRSGIYNDPKC 291 Query: 424 --SSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 ++ +++H V+ VGYG + G+EY+++KN W W Sbjct: 292 PTNAEKVNHAVIAVGYG-VQNGMEYFIIKNSWGPTW 326 Score = 52.0 bits (119), Expect = 1e-05 Identities = 30/79 (37%), Positives = 45/79 (56%), Gaps = 1/79 (1%) Frame = +2 Query: 5 DQGKCGSCWSFSTTGALEGQ-HFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 DQG+CG C+ FS GALE R V LS Q+++DCS G GG F++ Sbjct: 151 DQGRCGVCFIFSALGALEMYVALRTKKRPVKLSVQDVMDCSGMEKCKG-RGGNEPAVFRW 209 Query: 182 IKDNGGIDTEQTYPYEGVD 238 + ++ G+ T+++YPY+ D Sbjct: 210 VAEH-GVKTDKSYPYKEND 227 >UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Trypanosoma cruzi|Rep: Cysteine proteinase, putative - Trypanosoma cruzi Length = 392 Score = 70.9 bits (166), Expect = 2e-11 Identities = 33/87 (37%), Positives = 56/87 (64%), Gaps = 2/87 (2%) Frame = +1 Query: 271 AEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNE-DECSSTELDHG 447 A+ +V IP D+ +MEA+A GP+SV +DA++ S Y+ G++N D + ++H Sbjct: 255 AQVQSYVKIPSNDQDAVMEALAKNGPLSVNVDATYWS--AYAGGIFNGCDYSKNITINHV 312 Query: 448 VLVVGYGNDEQ-GVEYWLLKNCWAARW 525 V +VGYG+D + ++YW+L+N W+ W Sbjct: 313 VQLVGYGHDNKLNLDYWILRNSWSPSW 339 Score = 55.6 bits (128), Expect = 8e-07 Identities = 30/79 (37%), Positives = 39/79 (49%), Gaps = 4/79 (5%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQ----YGNNGCNGGLMDN 169 KDQG+CGSCW+ +E +G L LS+Q L C+ G GC G D Sbjct: 159 KDQGRCGSCWAHGAAEEMESHFAILTGRLHVLSQQQLTSCAPNPKKCGGTGGCYGSTADL 218 Query: 170 AFKYIKDNGGIDTEQTYPY 226 A++Y K GI +E Y Y Sbjct: 219 AYEYAKQ--GITSEWVYSY 235 >UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 397 Score = 70.9 bits (166), Expect = 2e-11 Identities = 35/87 (40%), Positives = 47/87 (54%), Gaps = 5/87 (5%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS-----EQYGNNGCNGGLMD 166 KDQG+CG CW+FS T E + ++ L SEQ L+DC+ E Y + GC GG Sbjct: 196 KDQGRCGCCWAFSATALAESVNLMRNNTLQQYSEQELVDCTNNQYQEDYSSLGCGGGWAY 255 Query: 167 NAFKYIKDNGGIDTEQTYPYEGVDDKC 247 NA Y++ GI E YPY+ + C Sbjct: 256 NALVYMQ-RKGIFLESQYPYKAQNGVC 281 Score = 43.6 bits (98), Expect = 0.004 Identities = 22/60 (36%), Positives = 33/60 (55%) Frame = +1 Query: 346 PVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 PVSV +D+ + + YSSGV++ +DH VL+VGY + W++KN W W Sbjct: 319 PVSVKVDSRY--WNSYSSGVFSNCLSDGWYVDHVVLLVGYTKEGN----WIVKNSWGTNW 372 >UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep: Aca s 1 allergen - Acarus siro (Dust mite) Length = 331 Score = 70.9 bits (166), Expect = 2e-11 Identities = 29/72 (40%), Positives = 45/72 (62%) Frame = +1 Query: 310 EQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVE 489 ++ +M + T GPV+V IDA H F+ Y SGV +TE++H + +VG+G E G++ Sbjct: 234 DESIMTVLKTHGPVAVDIDADHNGFKHYKSGVIRLTRGGTTEVNHVINIVGWGR-ENGLD 292 Query: 490 YWLLKNCWAARW 525 YWL++N W W Sbjct: 293 YWLIRNSWGTHW 304 Score = 67.3 bits (157), Expect = 2e-10 Identities = 30/89 (33%), Positives = 52/89 (58%), Gaps = 5/89 (5%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQ-----YGNNGCNGGLMD 166 ++QG+CG+CW+F++ +E + + LS+Q L++C+ + Y N+GC GG Sbjct: 123 ENQGRCGACWAFASLATVEAAFAIKYNTHIRLSKQELVECTRESDHTPYENSGCQGGYSW 182 Query: 167 NAFKYIKDNGGIDTEQTYPYEGVDDKCRY 253 A KY++ G ++ E YPYE D++ Y Sbjct: 183 EALKYVQVTGVVE-EAAYPYEAKDNQACY 210 >UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyostelium discoideum|Rep: Cysteine proteinase 3 - Dictyostelium discoideum (Slime mold) Length = 151 Score = 70.9 bits (166), Expect = 2e-11 Identities = 33/54 (61%), Positives = 41/54 (75%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 163 KDQG+CGSC STTG++EG ++G LVSLSEQN++ S +GN GCNGGLM Sbjct: 92 KDQGQCGSC-IISTTGSVEGVTAIKTGKLVSLSEQNILRLSSSFGNEGCNGGLM 144 >UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypanosoma cruzi|Rep: Cysteine protease, putative - Trypanosoma cruzi Length = 434 Score = 70.5 bits (165), Expect = 3e-11 Identities = 32/79 (40%), Positives = 45/79 (56%), Gaps = 4/79 (5%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQY----GNNGCNGGLMDN 169 KDQG CGSCW+ + T ++E + SG L++LS Q + C G+ GC GG Sbjct: 143 KDQGSCGSCWAHAATESVESMYAISSGKLLTLSTQQITSCVNNTRKCGGSGGCGGGTAQL 202 Query: 170 AFKYIKDNGGIDTEQTYPY 226 A++YI + GGI + YPY Sbjct: 203 AWEYIMNTGGITLDAEYPY 221 Score = 57.2 bits (132), Expect = 3e-07 Identities = 26/84 (30%), Positives = 48/84 (57%), Gaps = 3/84 (3%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNE--DECSSTELDHGVLV 456 G+ +P D + ++EA+ GP++V++ AS F Y+ GV++ + + + H V + Sbjct: 246 GYASLPHNDYEAVIEALVQKGPLAVSVAASDWMF--YTGGVFDGCGKDGENITISHAVQL 303 Query: 457 VGYGNDEQ-GVEYWLLKNCWAARW 525 VGYG D + +YW+++N W W Sbjct: 304 VGYGTDNKTNQDYWVVRNSWGEGW 327 >UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_158, whole genome shotgun sequence - Paramecium tetraurelia Length = 308 Score = 70.5 bits (165), Expect = 3e-11 Identities = 34/83 (40%), Positives = 49/83 (59%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG CG+ W+F+ GA+E S + LSEQ LIDC + N GC G ++N+ + Sbjct: 126 KDQGYCGAAWAFAAIGAVESVLRINSVTNLDLSEQQLIDCDLE--NQGCEDGNLNNSLNW 183 Query: 182 IKDNGGIDTEQTYPYEGVDDKCR 250 ++N G+ T +YPY G D C+ Sbjct: 184 AQNN-GVTTSASYPYTGQTDGCK 205 Score = 49.2 bits (112), Expect = 7e-05 Identities = 24/72 (33%), Positives = 40/72 (55%) Frame = +1 Query: 310 EQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVE 489 E M+A P++ +DA T++ Y SGV+N+ C+ EL+H L++G+ +D Sbjct: 220 EPDQMQAAIIKSPIAATVDA--TTWLFYKSGVFNK--CTFEELNHDALIIGFKDDGT--- 272 Query: 490 YWLLKNCWAARW 525 W++KN W W Sbjct: 273 -WIVKNSWGQWW 283 >UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin L-like cysteine proteinase precursor - Acanthoscelides obtectus (Bean weevil) Length = 321 Score = 70.1 bits (164), Expect = 4e-11 Identities = 30/50 (60%), Positives = 38/50 (76%), Gaps = 1/50 (2%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSE-QYGNNGC 148 K QG CGSCW+FS G++EGQ F ++G L SLS QNL+DC+ +YGN GC Sbjct: 126 KKQGNCGSCWAFSAVGSIEGQVFLKNGSLESLSAQNLVDCAGIEYGNFGC 175 Score = 70.1 bits (164), Expect = 4e-11 Identities = 35/84 (41%), Positives = 54/84 (64%), Gaps = 3/84 (3%) Frame = +1 Query: 283 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNE-DECSSTE--LDHGVL 453 G+ + +GDE L +AVAT+GP+S+A+D +H F Y G+ ++ C ++E L+HGVL Sbjct: 218 GYQAVSKGDEVVLAQAVATIGPISIALDGNHIMF--YRRGIVSKWCGCKNSEKDLNHGVL 275 Query: 454 VVGYGNDEQGVEYWLLKNCWAARW 525 +VGYG+ YW++KN W W Sbjct: 276 LVGYGDG-----YWIVKNSWGRIW 294 >UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA, isoform A; n=2; Endopterygota|Rep: PREDICTED: similar to CG3074-PA, isoform A - Tribolium castaneum Length = 445 Score = 69.7 bits (163), Expect = 5e-11 Identities = 38/90 (42%), Positives = 53/90 (58%), Gaps = 3/90 (3%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFR---QSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNA 172 +DQG CGS W+ TT A+ F + V+LS Q+L+ C + G CNGG +D A Sbjct: 215 QDQGWCGSSWAI-TTAAVASDRFAILSKGREKVTLSAQHLLSCDRR-GQQSCNGGYLDRA 272 Query: 173 FKYIKDNGGIDTEQTYPYEGVDDKCRYIPR 262 + YI+ G +D EQ +PY ++KCR IPR Sbjct: 273 WSYIRKIGLVD-EQCFPYSATNEKCR-IPR 300 Score = 43.2 bits (97), Expect = 0.005 Identities = 23/79 (29%), Positives = 37/79 (46%), Gaps = 5/79 (6%) Frame = +1 Query: 304 GDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELD--HGVLVVGYGND- 474 G+E +M + GPV + H F Y G+Y S+ + H V +VG+G + Sbjct: 330 GNETDIMYEILHSGPVQATMKVYHDFFT-YKRGIYRHSPISTNDRTGYHSVRIVGWGEEY 388 Query: 475 -EQGVE-YWLLKNCWAARW 525 +G++ YW + N W W Sbjct: 389 SPEGLKKYWKVANSWGPEW 407 >UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestivum|Rep: Cysteine protease - Triticum aestivum (Wheat) Length = 371 Score = 69.7 bits (163), Expect = 5e-11 Identities = 32/82 (39%), Positives = 43/82 (52%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K QG CG CW+F+ +E + G LV LS Q L+DCS ++ C G +A + Sbjct: 169 KQQGACGCCWAFAAAATVESLNKINGGELVDLSVQELVDCSTGVFSSPCGYGWPKSALAW 228 Query: 182 IKDNGGIDTEQTYPYEGVDDKC 247 IK GG+ TE YPY +C Sbjct: 229 IKSKGGLLTEAEYPYMAKRGRC 250 Score = 56.4 bits (130), Expect = 5e-07 Identities = 28/60 (46%), Positives = 35/60 (58%) Frame = +1 Query: 346 PVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 PV+V ID S Q Y SGVY C++++ +H V VVGYG G EYW+ KN W W Sbjct: 284 PVTVQIDGSGPVLQDYKSGVYR-GPCTTSQ-NHVVTVVGYGVTGAGEEYWIAKNSWGQTW 341 >UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma japonicum|Rep: SJCHGC04937 protein - Schistosoma japonicum (Blood fluke) Length = 235 Score = 69.7 bits (163), Expect = 5e-11 Identities = 32/60 (53%), Positives = 40/60 (66%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 K+Q KCG W+F++ GALEGQ S L SLS Q L+DC++ YGN GC GLM A+ Y Sbjct: 176 KNQEKCGCGWAFASVGALEGQMKLHSIPLQSLSTQQLVDCTQDYGNYGCASGLMKYAYDY 235 >UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Piroplasmida|Rep: Cysteine proteinase, putative - Theileria parva Length = 460 Score = 69.7 bits (163), Expect = 5e-11 Identities = 33/83 (39%), Positives = 49/83 (59%), Gaps = 1/83 (1%) Frame = +2 Query: 2 KDQG-KCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK 178 K+QG +CGSCW+F++ ++E + + LSEQ L+DC + + GC GG D A K Sbjct: 265 KNQGLECGSCWAFASVSSVESLYKIYRNVTLDLSEQELVDC--ETSSKGCEGGFGDTALK 322 Query: 179 YIKDNGGIDTEQTYPYEGVDDKC 247 YI+ N G+ T+ PY G + C Sbjct: 323 YIQ-NKGVSTDSEIPYLGKKNNC 344 Score = 52.8 bits (121), Expect = 6e-06 Identities = 29/72 (40%), Positives = 40/72 (55%), Gaps = 1/72 (1%) Frame = +1 Query: 313 QKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQ-GVE 489 Q +++ + P V I AS+ +Y +GVYN EC S L+H VL+VG G DE Sbjct: 363 QDVLKKSLVISPTIVYIAASN-DLSMYQAGVYN-GECGSA-LNHAVLLVGEGYDEVLDKR 419 Query: 490 YWLLKNCWAARW 525 YW++KN W W Sbjct: 420 YWVIKNSWGPDW 431 >UniRef50_Q23H06 Cluster: Papain family cysteine protease containing protein; n=18; Tetrahymena thermophila|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 349 Score = 69.7 bits (163), Expect = 5e-11 Identities = 36/86 (41%), Positives = 48/86 (55%), Gaps = 3/86 (3%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC---SEQYGNNGCNGGLMDNA 172 K QG CG+CW+FS TG +E +F Q+ LV SEQ L+DC + Y ++GC+GG Sbjct: 157 KWQGNCGACWAFSATGVMESFNFIQNKALVEFSEQQLLDCVIPANGYPSSGCHGGWPVQC 216 Query: 173 FKYIKDNGGIDTEQTYPYEGVDDKCR 250 Y GI + Y Y GV +CR Sbjct: 217 IDY-ASKVGILNQDRYYYFGVQMQCR 241 Score = 60.1 bits (139), Expect = 4e-08 Identities = 37/95 (38%), Positives = 52/95 (54%) Frame = +1 Query: 241 QVQVHPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDE 420 Q +V N G + +V IP + ++ PVSVA+D T++ Y SGV+N + Sbjct: 239 QCRVTGTNNGFKPKSWVQIPNNSDA--LKTALNFSPVSVAVDG--TNWTDYKSGVFNGCD 294 Query: 421 CSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 S L+H VLVVGY DEQG W++KN W+ W Sbjct: 295 -SHVSLNHAVLVVGY--DEQG--NWIIKNSWSTLW 324 >UniRef50_UPI0000D566F0 Cluster: PREDICTED: similar to CathePsin L family member (cpl-1); n=1; Tribolium castaneum|Rep: PREDICTED: similar to CathePsin L family member (cpl-1) - Tribolium castaneum Length = 185 Score = 69.3 bits (162), Expect = 6e-11 Identities = 35/86 (40%), Positives = 52/86 (60%), Gaps = 2/86 (2%) Frame = +1 Query: 256 PKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDEC--SS 429 P+N GA G+ + EGDE++L V T+GPVSV + A F LY G+Y D +S Sbjct: 101 PENIGASIQGYGTVTEGDEEELKAVVGTLGPVSVIVTAD-LIFILYRKGIYFNDNWLNAS 159 Query: 430 TELDHGVLVVGYGNDEQGVEYWLLKN 507 +H + V+GYG+ E G +YW+++N Sbjct: 160 EPYNHALTVIGYGS-ENGQDYWIVRN 184 Score = 45.6 bits (103), Expect = 9e-04 Identities = 29/84 (34%), Positives = 44/84 (52%), Gaps = 7/84 (8%) Frame = +2 Query: 29 WSFSTTGALEGQ---HFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNA----FKYIK 187 W ALEG H Q +LS++NLIDC Y + C + +A ++Y+ Sbjct: 22 WENFKVAALEGHVGIHLGQKNQ--TLSQENLIDCV--YSDFQCKQEMKRSALVDCYQYMV 77 Query: 188 DNGGIDTEQTYPYEGVDDKCRYIP 259 ++GGIDT ++YPY+ CR+ P Sbjct: 78 NSGGIDTLESYPYDQKPPLCRFKP 101 >UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2; Culicidae|Rep: Procathepsin L3, putative - Aedes aegypti (Yellowfever mosquito) Length = 313 Score = 68.9 bits (161), Expect = 8e-11 Identities = 31/80 (38%), Positives = 45/80 (56%) Frame = +2 Query: 5 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYI 184 +Q CGSC++FS AL GQ R+ G + +S Q ++DCS GN GC GG + +Y+ Sbjct: 151 NQKTCGSCYAFSIGHALNGQIMRRIGRVEYVSTQQMVDCSTSAGNKGCAGGSLRFTMQYL 210 Query: 185 KDNGGIDTEQTYPYEGVDDK 244 +++ GI YPY K Sbjct: 211 QNSQGIMRSSDYPYTSSSSK 230 >UniRef50_A0DKP6 Cluster: Chromosome undetermined scaffold_54, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_54, whole genome shotgun sequence - Paramecium tetraurelia Length = 312 Score = 68.9 bits (161), Expect = 8e-11 Identities = 38/88 (43%), Positives = 48/88 (54%) Frame = +2 Query: 2 KDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 181 KDQG+C S W+FS TG LE VSLSEQ+LIDC + + GC G N +K+ Sbjct: 129 KDQGQCNSGWAFSVTGTLEVYQKIYQKKNVSLSEQHLIDCDQL--SRGCTDGSNINGYKF 186 Query: 182 IKDNGGIDTEQTYPYEGVDDKCRYIPRT 265 N GI T YPY G + C+ + T Sbjct: 187 AISN-GIATNIEYPYVGYNQTCKRLNGT 213 Score = 43.2 bits (97), Expect = 0.005 Identities = 25/60 (41%), Positives = 36/60 (60%) Frame = +1 Query: 346 PVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQGVEYWLLKNCWAARW 525 PVS +DA + +Q YSSG+++ C T L+H +VVGY +E G W++KN W W Sbjct: 236 PVSAGLDAQN--WQFYSSGIFSN--CGIT-LNHYAVVVGY--EESG--NWIVKNSWGLGW 286 >UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin O; n=1; Danio rerio|Rep: PREDICTED: similar to cathepsin O - Danio rerio Length = 327 Score = 68.5 bits (160), Expect = 1e-10 Identities = 33/87 (37%), Positives = 48/87 (55%), Gaps = 1/87 (1%) Frame = +2 Query: 5 DQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYI 184 +QG CG CW+FS A+E + L LS Q +IDCS Y N GCNGG A ++ Sbjct: 137 NQGSCGGCWAFSIVEAIESVSAKVGEKLQQLSVQQVIDCS--YQNQGCNGGSPVEALYWL 194 Query: 185 KDNG-GIDTEQTYPYEGVDDKCRYIPR 262 + + +E YP++G D C++ P+ Sbjct: 195 TQSKLKLVSEAEYPFKGADGVCQFFPQ 221 Score = 43.2 bits (97), Expect = 0.005 Identities = 22/59 (37%), Positives = 34/59 (57%) Frame = +1 Query: 304 GDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEDECSSTELDHGVLVVGYGNDEQ 480 G E+ +M A+ GP+ V +DA S+Q Y G+ + CSS + +H VL+ GY E+ Sbjct: 238 GQEEVMMSALVDFGPLVVIVDA--ISWQDYLGGII-QHHCSSHKANHAVLITGYDTTEE 293 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 515,027,964 Number of Sequences: 1657284 Number of extensions: 9758456 Number of successful extensions: 41939 Number of sequences better than 10.0: 500 Number of HSP's better than 10.0 without gapping: 38470 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 41135 length of database: 575,637,011 effective HSP length: 96 effective length of database: 416,537,747 effective search space used: 40820699206 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -