BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= NV021685 (664 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C... 151 2e-35 UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M... 129 5e-29 UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep... 127 3e-28 UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata... 124 1e-27 UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ... 124 2e-27 UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome s... 124 3e-27 UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca... 120 3e-26 UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n... 120 3e-26 UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi... 117 2e-25 UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n... 116 7e-25 UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat... 115 1e-24 UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|... 113 4e-24 UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs... 112 6e-24 UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:... 111 1e-23 UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ... 110 3e-23 UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re... 109 6e-23 UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr... 109 8e-23 UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;... 108 1e-22 UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j... 108 1e-22 UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl... 108 1e-22 UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate... 107 3e-22 UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]... 105 9e-22 UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18... 104 2e-21 UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n... 103 4e-21 UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto... 102 7e-21 UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip... 101 1e-20 UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ... 101 2e-20 UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir... 101 2e-20 UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;... 100 3e-20 UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei... 100 6e-20 UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ... 100 6e-20 UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ... 99 8e-20 UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina... 99 8e-20 UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p... 98 2e-19 UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3... 98 2e-19 UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca... 97 4e-19 UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox... 96 8e-19 UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h... 95 1e-18 UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ... 95 2e-18 UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=... 95 2e-18 UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica... 94 2e-18 UniRef50_Q24E33 Cluster: Papain family cysteine protease contain... 94 3e-18 UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D... 94 3e-18 UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy... 93 4e-18 UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n... 93 5e-18 UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ... 93 7e-18 UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D... 92 9e-18 UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|... 92 1e-17 UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ... 91 2e-17 UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain... 91 3e-17 UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35... 91 3e-17 UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat... 90 5e-17 UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt... 89 7e-17 UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C... 89 7e-17 UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n... 89 9e-17 UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy... 89 1e-16 UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus... 89 1e-16 UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory... 89 1e-16 UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain... 88 2e-16 UniRef50_Q239L8 Cluster: Papain family cysteine protease contain... 88 2e-16 UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ... 87 4e-16 UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole... 87 5e-16 UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio... 87 5e-16 UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ... 86 8e-16 UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus... 86 8e-16 UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain... 86 8e-16 UniRef50_Q22A69 Cluster: Papain family cysteine protease contain... 85 1e-15 UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal... 85 1e-15 UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re... 85 1e-15 UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ... 85 2e-15 UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;... 84 3e-15 UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz... 84 3e-15 UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No... 83 4e-15 UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt... 83 4e-15 UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac... 83 4e-15 UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ... 83 8e-15 UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ... 83 8e-15 UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s... 82 1e-14 UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1; ... 82 1e-14 UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ... 81 2e-14 UniRef50_Q22W19 Cluster: Papain family cysteine protease contain... 81 2e-14 UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain... 81 2e-14 UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; ... 81 3e-14 UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia ... 81 3e-14 UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip... 81 3e-14 UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain... 81 3e-14 UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=... 81 3e-14 UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi... 80 4e-14 UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R... 80 4e-14 UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae... 80 5e-14 UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e... 80 5e-14 UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster... 79 7e-14 UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio... 79 7e-14 UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc... 79 7e-14 UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er... 78 2e-13 UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ... 78 2e-13 UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr... 78 2e-13 UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R... 78 2e-13 UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty... 77 3e-13 UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;... 77 3e-13 UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ... 77 5e-13 UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz... 76 7e-13 UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh... 76 7e-13 UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re... 76 9e-13 UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-... 76 9e-13 UniRef50_Q235G6 Cluster: Papain family cysteine protease contain... 76 9e-13 UniRef50_O16454 Cluster: Temporarily assigned gene name protein ... 75 1e-12 UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n... 75 2e-12 UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain... 75 2e-12 UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t... 74 3e-12 UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1... 74 3e-12 UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ... 74 3e-12 UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyt... 74 4e-12 UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine prot... 74 4e-12 UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: L... 73 5e-12 UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph... 73 5e-12 UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh... 73 5e-12 UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|... 73 5e-12 UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ... 73 6e-12 UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep... 73 6e-12 UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium... 73 6e-12 UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p... 73 8e-12 UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv... 73 8e-12 UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve... 72 1e-11 UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ... 72 1e-11 UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh... 72 1e-11 UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo... 72 1e-11 UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ... 71 2e-11 UniRef50_A2Q4E7 Cluster: Peptidase C1A, papain; n=1; Medicago tr... 71 2e-11 UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste... 71 2e-11 UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ... 71 2e-11 UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb... 71 2e-11 UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ... 71 3e-11 UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome s... 70 6e-11 UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa... 70 6e-11 UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:... 70 6e-11 UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n... 70 6e-11 UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C... 70 6e-11 UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia... 69 8e-11 UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain... 69 1e-10 UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Sl... 69 1e-10 UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-li... 69 1e-10 UniRef50_P25773 Cluster: Cathepsin L; n=38; Eukaryota|Rep: Cathe... 69 1e-10 UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel... 69 1e-10 UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ... 68 2e-10 UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ... 68 2e-10 UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2... 68 2e-10 UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ... 68 2e-10 UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ... 67 3e-10 UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ... 67 3e-10 UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ... 67 4e-10 UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ... 67 4e-10 UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida... 67 4e-10 UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arab... 66 7e-10 UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain... 66 7e-10 UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ... 66 9e-10 UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w... 65 1e-09 UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi... 65 1e-09 UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:... 65 1e-09 UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ... 65 2e-09 UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum... 65 2e-09 UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi... 64 2e-09 UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ... 64 3e-09 UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist... 64 3e-09 UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|... 64 3e-09 UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain... 64 4e-09 UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted... 63 7e-09 UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v... 63 7e-09 UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov... 63 7e-09 UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa... 62 9e-09 UniRef50_Q26988 Cluster: Cysteine proteinase 4; n=1; Tritrichomo... 62 1e-08 UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet... 62 1e-08 UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ... 62 1e-08 UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sa... 62 2e-08 UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa... 61 3e-08 UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa... 61 3e-08 UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis... 61 3e-08 UniRef50_Q23H32 Cluster: Papain family cysteine protease contain... 61 3e-08 UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w... 61 3e-08 UniRef50_Q7M1Q7 Cluster: Actinidain; n=1; Actinidia chinensis|Re... 60 4e-08 UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve... 60 4e-08 UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endop... 60 4e-08 UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ... 60 5e-08 UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ... 60 5e-08 UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic... 60 5e-08 UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ... 60 5e-08 UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ... 60 6e-08 UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain... 60 6e-08 UniRef50_Q248G1 Cluster: Papain family cysteine protease contain... 59 8e-08 UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis... 58 1e-07 UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl... 58 2e-07 UniRef50_Q231X3 Cluster: Papain family cysteine protease contain... 58 2e-07 UniRef50_Q22RC9 Cluster: Papain family cysteine protease contain... 58 2e-07 UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain... 58 2e-07 UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh... 58 2e-07 UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ... 57 3e-07 UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n... 57 3e-07 UniRef50_Q53K53 Cluster: Cysteine protease 1, putative; n=5; Ory... 57 4e-07 UniRef50_Q9XZM9 Cluster: Cysteine proteinase CPW2; n=1; Acantham... 57 4e-07 UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain... 57 4e-07 UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G... 56 6e-07 UniRef50_Q26989 Cluster: Cysteine proteinase 5; n=1; Tritrichomo... 56 6e-07 UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy... 56 6e-07 UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh... 56 6e-07 UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|... 56 8e-07 UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei... 56 8e-07 UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep... 56 8e-07 UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 56 8e-07 UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ... 56 1e-06 UniRef50_Q26987 Cluster: Cysteine proteinase 3; n=1; Tritrichomo... 56 1e-06 UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2; Culicid... 56 1e-06 UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen... 56 1e-06 UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh... 56 1e-06 UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyosteli... 56 1e-06 UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re... 55 1e-06 UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil... 55 1e-06 UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi... 55 1e-06 UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma j... 55 1e-06 UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy... 55 2e-06 UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H... 55 2e-06 UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:... 55 2e-06 UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,... 54 2e-06 UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|... 54 2e-06 UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh... 54 2e-06 UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ... 54 3e-06 UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryz... 54 3e-06 UniRef50_Q9BL26 Cluster: Putative uncharacterized protein; n=4; ... 54 3e-06 UniRef50_Q23H06 Cluster: Papain family cysteine protease contain... 54 3e-06 UniRef50_A0CHZ5 Cluster: Chromosome undetermined scaffold_186, w... 54 3e-06 UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr... 54 4e-06 UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re... 54 4e-06 UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ... 53 5e-06 UniRef50_Q23H10 Cluster: Papain family cysteine protease contain... 53 5e-06 UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy... 53 5e-06 UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu... 53 5e-06 UniRef50_UPI0001509FF9 Cluster: Papain family cysteine protease ... 53 7e-06 UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain... 53 7e-06 UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP000... 52 9e-06 UniRef50_UPI0000E224BB Cluster: PREDICTED: hypothetical protein;... 52 9e-06 UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain... 52 9e-06 UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin ... 52 1e-05 UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv... 52 1e-05 UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryz... 52 1e-05 UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag... 52 1e-05 UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w... 52 2e-05 UniRef50_UPI0000D566F0 Cluster: PREDICTED: similar to CathePsin ... 51 2e-05 UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir... 51 2e-05 UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy... 51 3e-05 UniRef50_A0DKP6 Cluster: Chromosome undetermined scaffold_54, wh... 51 3e-05 UniRef50_A0DCA5 Cluster: Chromosome undetermined scaffold_45, wh... 51 3e-05 UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain... 50 4e-05 UniRef50_Q650W8 Cluster: Putative cysteine proteinase; n=2; Oryz... 50 5e-05 UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi... 50 5e-05 UniRef50_Q24F16 Cluster: Papain family cysteine protease contain... 50 5e-05 UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ... 50 7e-05 UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG... 50 7e-05 UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big... 50 7e-05 UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina... 50 7e-05 UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ... 50 7e-05 UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;... 49 9e-05 UniRef50_A2F4G0 Cluster: Clan CA, family C1, cathepsin L-like cy... 49 9e-05 UniRef50_Q9VNK7 Cluster: CG1075-PA; n=1; Drosophila melanogaster... 49 1e-04 UniRef50_Q5NE16 Cluster: Putative cathepsin L-like protein 3; n=... 49 1e-04 UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ... 48 2e-04 UniRef50_Q0MYX5 Cluster: Putative cysteine protease; n=1; Emilia... 48 2e-04 UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh... 48 2e-04 UniRef50_A0BLR4 Cluster: Chromosome undetermined scaffold_115, w... 48 2e-04 UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The... 48 2e-04 UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto... 48 2e-04 UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain... 48 3e-04 UniRef50_A0BNM1 Cluster: Chromosome undetermined scaffold_119, w... 48 3e-04 UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; ... 47 4e-04 UniRef50_UPI0000EBEFA5 Cluster: PREDICTED: similar to Cathepsin ... 47 4e-04 UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s... 47 4e-04 UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi... 47 4e-04 UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli... 47 4e-04 UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa... 47 4e-04 UniRef50_Q26993 Cluster: Cysteine proteinase 9; n=1; Tritrichomo... 47 4e-04 UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:... 47 4e-04 UniRef50_A0CS14 Cluster: Chromosome undetermined scaffold_26, wh... 47 4e-04 UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ... 47 5e-04 UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali... 47 5e-04 UniRef50_Q4TI44 Cluster: Chromosome undetermined SCAF2412, whole... 46 6e-04 UniRef50_A2WPN4 Cluster: Putative uncharacterized protein; n=2; ... 46 6e-04 UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1... 46 6e-04 UniRef50_Q23H15 Cluster: Papain family cysteine protease contain... 46 6e-04 UniRef50_UPI000155C322 Cluster: PREDICTED: similar to cathepsin ... 46 8e-04 UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli... 46 8e-04 UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cy... 46 8e-04 UniRef50_Q5YER1 Cluster: Cysteine proteinase; n=1; Bigelowiella ... 46 0.001 UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - ... 46 0.001 UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ... 45 0.001 UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; ... 45 0.001 UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ... 45 0.001 UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ... 45 0.002 UniRef50_Q9SIE8 Cluster: Putative cysteine proteinase; n=1; Arab... 44 0.002 UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-... 44 0.002 UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The... 44 0.002 UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,... 44 0.003 UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ... 44 0.003 UniRef50_Q2XWW8 Cluster: Cysteine protease Mir1; n=1; Zea diplop... 44 0.003 UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl... 44 0.003 UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; P... 44 0.003 UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to glucocorti... 44 0.004 UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus... 43 0.006 UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop... 43 0.006 UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li... 43 0.008 UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M... 43 0.008 UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ... 42 0.010 UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n... 42 0.010 UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;... 42 0.010 UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy... 42 0.010 UniRef50_Q3L7L0 Cluster: Sar s 1 allergen SMIPP-C Yv5009F04; n=3... 42 0.013 UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve... 42 0.013 UniRef50_A2DKX2 Cluster: Clan CA, family C1, cathepsin L-like cy... 42 0.013 UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|R... 42 0.013 UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop... 42 0.018 UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop... 42 0.018 UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs... 42 0.018 UniRef50_Q5CYC8 Cluster: Cathepsin like thiol protease possibly ... 41 0.023 UniRef50_A7APS9 Cluster: Papain family cysteine protease contain... 41 0.023 UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ... 41 0.031 UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try... 41 0.031 UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;... 40 0.040 UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O... 40 0.040 UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; ... 40 0.040 UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest... 40 0.040 UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ... 40 0.053 UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcop... 40 0.053 UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10... 40 0.053 UniRef50_Q2QS15 Cluster: Papain family cysteine protease contain... 40 0.071 UniRef50_A0DZB1 Cluster: Chromosome undetermined scaffold_7, who... 40 0.071 UniRef50_A5HC50 Cluster: Cathepsin W; n=2; Theria|Rep: Cathepsin... 39 0.093 UniRef50_Q54R55 Cluster: Putative uncharacterized protein; n=1; ... 39 0.093 UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; ... 39 0.093 UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy... 39 0.093 UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl... 39 0.093 UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280... 39 0.12 UniRef50_A7TZ14 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 39 0.12 UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C... 38 0.16 UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like prote... 38 0.16 UniRef50_P84789 Cluster: Philibertain g 1; n=5; core eudicotyled... 38 0.16 UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease ... 38 0.22 UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ... 38 0.22 UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 38 0.22 UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep... 38 0.22 UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who... 38 0.22 UniRef50_P21381 Cluster: Thaumatopain; n=10; Eukaryota|Rep: Thau... 38 0.22 UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3.... 38 0.22 UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocysti... 38 0.28 UniRef50_A0DIY3 Cluster: Chromosome undetermined scaffold_52, wh... 38 0.28 UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-li... 38 0.28 UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ... 37 0.38 UniRef50_UPI00006CBB5F Cluster: Papain family cysteine protease ... 37 0.38 UniRef50_Q962W3 Cluster: Cysteine protease; n=3; Giardia intesti... 37 0.38 UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein... 37 0.38 UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-32... 37 0.38 UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cy... 37 0.38 UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cel... 37 0.38 UniRef50_Q9UJW2 Cluster: Tubulointerstitial nephritis antigen; n... 37 0.38 UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia ... 37 0.50 UniRef50_Q945E4 Cluster: Cysteine proteinase; n=1; Vasconcellea ... 37 0.50 UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa... 37 0.50 UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R... 36 0.66 UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Sl... 36 0.66 UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil... 36 0.66 UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 36 0.66 UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl... 36 0.87 UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep: Cat... 36 1.1 UniRef50_Q9TWP8 Cluster: Cysteine protease; n=5; Eukaryota|Rep: ... 35 1.5 UniRef50_A7AX75 Cluster: Preprocathepsin c, putative; n=1; Babes... 35 1.5 UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ... 35 2.0 UniRef50_Q7M1Q8 Cluster: Proteinase omega; n=1; Carica papaya|Re... 35 2.0 UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia... 35 2.0 UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophi... 35 2.0 UniRef50_Q8GFF2 Cluster: Putative uncharacterized protein; n=1; ... 34 2.7 UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; ... 34 2.7 UniRef50_Q7Q9Y5 Cluster: ENSANGP00000012222; n=1; Anopheles gamb... 34 2.7 UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain... 34 2.7 UniRef50_A7SNM3 Cluster: Predicted protein; n=1; Nematostella ve... 34 2.7 UniRef50_Q0UAL2 Cluster: Putative uncharacterized protein; n=1; ... 34 2.7 UniRef50_UPI000069FB13 Cluster: UPI000069FB13 related cluster; n... 34 3.5 UniRef50_Q0AY53 Cluster: Putative uncharacterized protein; n=1; ... 34 3.5 UniRef50_Q9W0L7 Cluster: CG32479-PA; n=1; Drosophila melanogaste... 34 3.5 UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin... 34 3.5 UniRef50_Q6A1H9 Cluster: Cathepsin X/O; n=1; Suberites domuncula... 34 3.5 UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw... 34 3.5 UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteina... 34 3.5 UniRef50_Q8IIJ9 Cluster: Probable cathepsin C precursor; n=11; P... 34 3.5 UniRef50_UPI00006CFA59 Cluster: Papain family cysteine protease ... 33 4.6 UniRef50_UPI0000D8B388 Cluster: hornerin; n=2; Euteleostomi|Rep:... 33 4.6 UniRef50_A4FJR8 Cluster: Secreted protein; n=2; Bacteria|Rep: Se... 33 4.6 UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia... 33 4.6 UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma j... 33 4.6 UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil... 33 4.6 UniRef50_Q3JQL5 Cluster: Putative uncharacterized protein; n=1; ... 33 6.1 UniRef50_A0UP06 Cluster: Cell divisionFtsK/SpoIIIE; n=1; Burkhol... 33 6.1 UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=... 33 6.1 UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep... 33 6.1 UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl... 33 6.1 UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8... 33 6.1 UniRef50_Q61CG7 Cluster: Putative uncharacterized protein CBG129... 33 6.1 UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Ca... 33 6.1 UniRef50_A7LFV2 Cluster: Cathepsin L protease inhibitor 1; n=1; ... 33 6.1 UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr... 33 6.1 UniRef50_UPI000023E9E1 Cluster: predicted protein; n=1; Gibberel... 33 8.1 UniRef50_Q3JUP4 Cluster: Putative uncharacterized protein; n=2; ... 33 8.1 UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gamb... 33 8.1 UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|... 33 8.1 UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li... 33 8.1 UniRef50_A4RJ84 Cluster: Putative uncharacterized protein; n=2; ... 33 8.1 >UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina) Length = 339 Score = 151 bits (365), Expect = 2e-35 Identities = 63/83 (75%), Positives = 74/83 (89%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS+TGALEGQHFR++G LVSLSEQNL+DCS +YGNNGCNGGLMDNAF+YIKDNGGIDTE Sbjct: 148 AFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTE 207 Query: 437 QTYPYEGVDDKCRYNPKNTGAED 505 ++YPYEG+DD C +N GA D Sbjct: 208 KSYPYEGIDDSCHFNKATIGATD 230 Score = 108 bits (259), Expect = 1e-22 Identities = 48/84 (57%), Positives = 61/84 (72%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 + G VSYKLG+NKY DMLHHEF +TMNG+N T + L + + GA +I PA+V + Sbjct: 66 FAQGKVSYKLGLNKYADMLHHEFKETMNGYNHTLRQ---LMRERTGLVGATYIPPAHVTV 122 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P+ VDWR+HGAVT +KDQG CGSC Sbjct: 123 PKSVDWREHGAVTGVKDQGHCGSC 146 Score = 89.0 bits (211), Expect = 9e-17 Identities = 40/52 (76%), Positives = 45/52 (86%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 GFVDIPEGDE+K+ +AVAT+GPVSVAIDASH SFQLYS GVYNE EC +L Sbjct: 232 GFVDIPEGDEEKMKKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDEQNL 283 >UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=19; Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Homo sapiens (Human) Length = 333 Score = 129 bits (312), Expect = 5e-29 Identities = 54/91 (59%), Positives = 71/91 (78%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS TGALEGQ FR++G L+SLSEQNL+DCS GN GCNGGLMD AF+Y++DNGG+D+E Sbjct: 140 AFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSE 199 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASWTSPR 529 ++YPYE ++ C+YNPK + A D P+ Sbjct: 200 ESYPYEATEESCKYNPKYSVANDTGFVDIPK 230 Score = 70.9 bits (166), Expect = 2e-11 Identities = 33/52 (63%), Positives = 40/52 (76%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 GFVDIP+ E+ LM+AVATVGP+SVAIDA H SF Y G+Y E +CSS D+ Sbjct: 224 GFVDIPK-QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDM 274 Score = 62.1 bits (144), Expect = 1e-08 Identities = 32/84 (38%), Positives = 42/84 (50%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 Y G S+ + MN +GDM EF + MNGF +G F P + Sbjct: 66 YREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPR-----------KGKVFQEPLFYEA 114 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P VDWR+ G VT +K+QG+CGSC Sbjct: 115 PRSVDWREKGYVTPVKNQGQCGSC 138 >UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin L - Misgurnus mizolepis (Mud loach) Length = 337 Score = 127 bits (306), Expect = 3e-28 Identities = 56/84 (66%), Positives = 66/84 (78%), Gaps = 1/84 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FSTTGA+EGQ FR+ G LVSLSEQNL+DCS GN GCNGGLMD AF+YIKDN G+D+E Sbjct: 142 AFSTTGAMEGQMFRKQGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNNGLDSE 201 Query: 437 QTYPYEGVDDK-CRYNPKNTGAED 505 + YPY G DD+ C Y+PK A D Sbjct: 202 EAYPYLGTDDQPCHYDPKYNAAND 225 Score = 81.0 bits (191), Expect = 2e-14 Identities = 37/52 (71%), Positives = 42/52 (80%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 GFVDIP G E LM+AVA+VGPVSVAIDA H SFQ Y SG+Y E+ECSS +L Sbjct: 227 GFVDIPSGKEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYFEKECSSEEL 278 Score = 79.4 bits (187), Expect = 7e-14 Identities = 37/84 (44%), Positives = 54/84 (64%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 + MG+ +Y+LGMN +GDM H EF + MNG+ KH KG + F+ P +++ Sbjct: 66 HSMGIHTYRLGMNHFGDMNHEEFRQVMNGY----KHKTERKFKG-----SLFMEPNFLEV 116 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P ++DWR+ G VT +KDQG+CGSC Sbjct: 117 PSKLDWREKGYVTPVKDQGECGSC 140 >UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens (Human) Length = 334 Score = 124 bits (300), Expect = 1e-27 Identities = 52/83 (62%), Positives = 67/83 (80%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS TGALEGQ FR++G LVSLSEQNL+DCS GN GCNGG M AF+Y+K+NGG+D+E Sbjct: 140 AFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSE 199 Query: 437 QTYPYEGVDDKCRYNPKNTGAED 505 ++YPY VD+ C+Y P+N+ A D Sbjct: 200 ESYPYVAVDEICKYRPENSVAND 222 Score = 70.5 bits (165), Expect = 3e-11 Identities = 31/52 (59%), Positives = 40/52 (76%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 GF + G E+ LM+AVATVGP+SVA+DA H+SFQ Y SG+Y E +CSS +L Sbjct: 224 GFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNL 275 Score = 57.6 bits (133), Expect = 2e-07 Identities = 32/84 (38%), Positives = 44/84 (52%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 Y G + + MN +GDM + EF + M F +N + G V F P + L Sbjct: 66 YSQGKHGFTMAMNAFGDMTNEEFRQMMGCF-------RNQKFRKGKV----FREPLFLDL 114 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P+ VDWRK G VT +K+Q +CGSC Sbjct: 115 PKSVDWRKKGYVTPVKNQKQCGSC 138 >UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar to human SRY (sex determining region Y)-box 30 (SOX30),transcript variant 1,; n=3; Mammalia|Rep: Testis cDNA clone: QtsA-12228, similar to human SRY (sex determining region Y)-box 30 (SOX30),transcript variant 1, - Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) Length = 433 Score = 124 bits (299), Expect = 2e-27 Identities = 51/83 (61%), Positives = 68/83 (81%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS TGALEGQ FR++G LVSLSEQNL+DCS GN GCNGG M++AF+Y+K+NGG+D+E Sbjct: 140 AFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMNSAFRYVKENGGLDSE 199 Query: 437 QTYPYEGVDDKCRYNPKNTGAED 505 ++YPY +D C+Y P+N+ A D Sbjct: 200 ESYPYVAMDGICKYRPENSVAND 222 Score = 58.0 bits (134), Expect = 2e-07 Identities = 32/84 (38%), Positives = 44/84 (52%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 Y G + + MN +GDM + EF + M F N+ L +G F P + L Sbjct: 66 YSQGKHGFAMAMNAFGDMTNEEFRQVMGCFR-----NQKLR------KGKLFREPLFLDL 114 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P+ VDWRK G VT +K+Q +CGSC Sbjct: 115 PKSVDWRKKGYVTPVKNQKQCGSC 138 >UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome shotgun sequence; n=3; Tetraodontidae|Rep: Chromosome 12 SCAF14996, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 362 Score = 124 bits (298), Expect = 3e-27 Identities = 56/105 (53%), Positives = 68/105 (64%), Gaps = 1/105 (0%) Frame = +2 Query: 233 PREVWLMRSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIK 412 P VWL+ GQHFRQ+G LVSLSEQNL+DCS GN GCNGGLMD AF+YIK Sbjct: 166 PGSVWLLLGLQHHRGPGGQHFRQTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIK 225 Query: 413 DNGGIDTEQTYPYEGVDDK-CRYNPKNTGAEDVASWTSPRATNRS 544 DNGG+D+E +YPY DD+ C Y+P N A + P + R+ Sbjct: 226 DNGGLDSEASYPYLATDDQPCHYDPSNNSANETGFVDVPSGSERA 270 Score = 81.8 bits (193), Expect = 1e-14 Identities = 36/52 (69%), Positives = 43/52 (82%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 GFVD+P G E+ LM+AVA+VGPVSVAIDA H SFQ Y SG+Y E+ECSS +L Sbjct: 259 GFVDVPSGSERALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEEL 310 Score = 64.9 bits (151), Expect = 2e-09 Identities = 35/80 (43%), Positives = 45/80 (56%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 + MG SY+LGMN +GDM H EF + MNG+ KH RG+ F+ P ++ Sbjct: 65 HSMGQHSYRLGMNHFGDMTHEEFRQIMNGY----KHKPQ-----RKFRGSLFMEPNFLEA 115 Query: 183 PEQVDWRKHGAVTDIKDQGK 242 P VDWR G VT +KDQ K Sbjct: 116 PRAVDWRDKGYVTPVKDQLK 135 >UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Cathepsin - Geodia cydonium (Sponge) Length = 322 Score = 120 bits (289), Expect = 3e-26 Identities = 52/81 (64%), Positives = 62/81 (76%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS TG+LEGQHF +G LVSLSEQNL+DCS GN GCNGGL D+AFKY+ NGGIDTE Sbjct: 129 AFSATGSLEGQHFNATGKLVSLSEQNLVDCSSAEGNEGCNGGLPDDAFKYVIKNGGIDTE 188 Query: 437 QTYPYEGVDDKCRYNPKNTGA 499 +YPY D+KC Y+ N G+ Sbjct: 189 ASYPYVARDEKCHYSSANIGS 209 Score = 60.1 bits (139), Expect = 5e-08 Identities = 28/51 (54%), Positives = 33/51 (64%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 +VDI E +L A ATVGP+ V IDASH FQLY GVY+ + CS T L Sbjct: 214 YVDIESKSEAQLQVASATVGPIPVGIDASHLGFQLYDGGVYHSDLCSQTRL 264 Score = 47.2 bits (107), Expect = 4e-04 Identities = 28/77 (36%), Positives = 39/77 (50%) Frame = +3 Query: 24 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 203 Y + MN++ D+ EFV NG + H + G + +S LP VDWR Sbjct: 60 YTVAMNEFADLDPREFVSHYNGLRRRP-HTSS----GEPCTLGEDVSA----LPTTVDWR 110 Query: 204 KHGAVTDIKDQGKCGSC 254 G VT +K+QG+CGSC Sbjct: 111 TKGYVTGVKNQGQCGSC 127 >UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n=21; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Globodera pallida Length = 379 Score = 120 bits (289), Expect = 3e-26 Identities = 53/84 (63%), Positives = 65/84 (77%), Gaps = 1/84 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS+TGALE QH RQ+G L+SLSEQNLIDCS++YGN GCNGG+MDNAF+YIKDN G+D E Sbjct: 187 AFSSTGALEAQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDNAFQYIKDNNGVDKE 246 Query: 437 QTYPYEG-VDDKCRYNPKNTGAED 505 YPY+ KC + + GA D Sbjct: 247 LDYPYKAKTGKKCLFKRNDVGATD 270 Score = 75.4 bits (177), Expect = 1e-12 Identities = 36/52 (69%), Positives = 40/52 (76%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 GF DI EGDE+KL AVAT GP SVAIDA H SFQLY+ GVY E+ECS +L Sbjct: 272 GFFDIAEGDEEKLKIAVATQGPASVAIDAGHRSFQLYTHGVYFEKECSPENL 323 Score = 58.8 bits (136), Expect = 1e-07 Identities = 32/85 (37%), Positives = 47/85 (55%), Gaps = 1/85 (1%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANV-K 179 Y G V++++G N D+ E+ K +NG+ + N + F++P NV Sbjct: 109 YIEGKVTFRVGENHIADLPFSEY-KKLNGYRRLLGDNLRR-------NASTFLAPMNVGD 160 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 LPE VDWR G VT++K+QG CGSC Sbjct: 161 LPESVDWRDKGWVTEVKNQGMCGSC 185 >UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L - Suberites domuncula (Sponge) Length = 324 Score = 117 bits (282), Expect = 2e-25 Identities = 49/85 (57%), Positives = 64/85 (75%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SFS TG+LEGQH + G LVSLSEQNL+DCS ++GN+GC GG+MD+AF+Y+ N G+DTE Sbjct: 134 SFSATGSLEGQHALKMGRLVSLSEQNLMDCSSRFGNHGCKGGIMDDAFRYVISNHGVDTE 193 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVA 511 +YPY D CR+N N GA + + Sbjct: 194 SSYPYTAKDGYCRFNQNNVGATETS 218 Score = 63.7 bits (148), Expect = 4e-09 Identities = 29/49 (59%), Positives = 34/49 (69%) Frame = +1 Query: 517 DIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 DI G E L +A A +GP+SVAIDASH SFQ Y +GVY E CSS+ L Sbjct: 221 DIARGSESSLTQASAQIGPISVAIDASHRSFQFYKNGVYYEPSCSSSRL 269 Score = 49.2 bits (112), Expect = 9e-05 Identities = 26/77 (33%), Positives = 41/77 (53%) Frame = +3 Query: 24 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 203 Y L MN++GD+ EF + NG+ + N + ++ PA VDWR Sbjct: 66 YTLEMNEFGDLSGVEFKQIYNGYIMQERANDTKLFTA-----SPYMEPA-----ASVDWR 115 Query: 204 KHGAVTDIKDQGKCGSC 254 + G V+++K+QG+CGSC Sbjct: 116 QKGVVSEVKNQGQCGSC 132 >UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Longidorus elongatus Length = 358 Score = 116 bits (278), Expect = 7e-25 Identities = 48/83 (57%), Positives = 64/83 (77%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS TG+LEGQH++Q+G LVSLSEQNL+DC + GCNGG MD AF+Y++ N GIDTE Sbjct: 165 AFSATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQYVETNKGIDTE 224 Query: 437 QTYPYEGVDDKCRYNPKNTGAED 505 +YPY+G D +CR+ ++ GA D Sbjct: 225 ASYPYKGRDGRCRFKSEDVGATD 247 Score = 79.4 bits (187), Expect = 7e-14 Identities = 40/84 (47%), Positives = 49/84 (58%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 YE G S+ L +NK+ DM + EF + MNGF AK K + G F P NV + Sbjct: 81 YEAGQHSFALSLNKFADMTNAEFRQRMNGFKLPAKR-KLAKSQPLKEDGMIFEMPDNVTI 139 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P+ VDWRK G VT +KDQG CGSC Sbjct: 140 PDSVDWRKEGYVTKVKDQGSCGSC 163 Score = 68.5 bits (160), Expect = 1e-10 Identities = 32/48 (66%), Positives = 36/48 (75%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECS 651 GFVDIPEG+E L A+ATVGPVSVAIDA+ FQ YS GVY + CS Sbjct: 249 GFVDIPEGNETLLEAAIATVGPVSVAIDAASFKFQFYSHGVYYDRSCS 296 >UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviatus|Rep: Cathepsin L 2 - Diaprepes abbreviatus (Sugarcane rootstalk borer weevil) Length = 348 Score = 115 bits (276), Expect = 1e-24 Identities = 53/95 (55%), Positives = 65/95 (68%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SFS TGALE Q F+++ L+SLSEQ L+DCS +YGN+GC+GG M AF YIK+NGGIDTE Sbjct: 161 SFSATGALEAQWFKKTNKLISLSEQQLVDCSGRYGNHGCHGGWMHWAFGYIKENGGIDTE 220 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASWTSPRATNR 541 Q+YPY D +C Y P N A PR N+ Sbjct: 221 QSYPYTAKDGRCAYKPGNKAATVSQVIMVPRGENQ 255 Score = 56.8 bits (131), Expect = 4e-07 Identities = 34/94 (36%), Positives = 48/94 (51%), Gaps = 10/94 (10%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGG------SVRG-AKFI 161 YEMGL SY++ MN GD+ EF++ ++NL ++G + Sbjct: 66 YEMGLSSYQMAMNHLGDLTKDEFMRIYTVNMPQLPQSENLSDSEPWLDLPQDLQGFVTYA 125 Query: 162 SPAN---VKLPEQVDWRKHGAVTDIKDQGKCGSC 254 P N V LP +DWR+ GAVT +K+Q CGSC Sbjct: 126 LPTNLDEVDLPTDIDWRQKGAVTPVKNQRNCGSC 159 Score = 46.8 bits (106), Expect = 5e-04 Identities = 21/43 (48%), Positives = 30/43 (69%) Frame = +1 Query: 520 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEEC 648 +P G+ Q L V++VGP+S+A + SH FQ Y SGVY+E +C Sbjct: 249 VPRGENQ-LAAKVSSVGPISIAAEVSH-KFQFYHSGVYDEPQC 289 >UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|Rep: Cysteine proteinase - Acanthamoeba healyi Length = 330 Score = 113 bits (272), Expect = 4e-24 Identities = 53/81 (65%), Positives = 62/81 (76%), Gaps = 1/81 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SFSTTG+ EG +F ++G LVSLSEQNLIDCS YGNNGCNGGLMD AF+YI +N GIDTE Sbjct: 140 SFSTTGSTEGANFLKTGRLVSLSEQNLIDCSVSYGNNGCNGGLMDYAFEYIINNRGIDTE 199 Query: 437 QTYPYEGVDD-KCRYNPKNTG 496 +YPY+ C+YN N G Sbjct: 200 ASYPYQTAGPLTCQYNAANKG 220 Score = 65.3 bits (152), Expect = 1e-09 Identities = 32/52 (61%), Positives = 35/52 (67%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 G+ D+ GDE L+ A A PVSVAIDASH SFQ YS GVY E CSST L Sbjct: 225 GYTDVTSGDENALLNA-AVKEPVSVAIDASHNSFQFYSGGVYYESACSSTQL 275 Score = 56.0 bits (129), Expect = 8e-07 Identities = 30/78 (38%), Positives = 43/78 (55%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 SY L MN++GD+ + EF + G Y K + A +PA +P + DW Sbjct: 69 SYFLAMNQFGDLTNAEFNRLFKGLAFD-------YSKHAKIHTAAPEAPAT-GIPSEFDW 120 Query: 201 RKHGAVTDIKDQGKCGSC 254 R+ GAVT +K+QG+CGSC Sbjct: 121 RQKGAVTHVKNQGQCGSC 138 >UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precursor; n=3; Metazoa|Rep: Digestive cysteine proteinase 2 precursor - Homarus americanus (American lobster) Length = 323 Score = 112 bits (270), Expect = 6e-24 Identities = 48/81 (59%), Positives = 60/81 (74%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FSTTG+LEGQHF ++G L+SL+EQ L+DCS YG GCNGG M++AF YIK N GIDTE Sbjct: 133 AFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTE 192 Query: 437 QTYPYEGVDDKCRYNPKNTGA 499 YPYE D CR++ + A Sbjct: 193 AAYPYEARDGSCRFDSNSVAA 213 Score = 61.7 bits (143), Expect = 2e-08 Identities = 28/52 (53%), Positives = 35/52 (67%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 G +I G E L +AV +GP+SV IDA+H+SFQ YSSGVY E CS + L Sbjct: 217 GHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSYL 268 Score = 57.2 bits (132), Expect = 3e-07 Identities = 35/86 (40%), Positives = 44/86 (51%), Gaps = 2/86 (2%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 YE G V++ L MNK+GDM EF M G N+ + V P Sbjct: 58 YENGEVTFNLAMNKFGDMTLEEFNAVMKG---------NIPRRSAPV---SVFYPKKETG 105 Query: 183 PE--QVDWRKHGAVTDIKDQGKCGSC 254 P+ +VDWR GAVT +KDQG+CGSC Sbjct: 106 PQATEVDWRTKGAVTPVKDQGQCGSC 131 >UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep: Cathepsin - Petromyzon marinus (Sea lamprey) Length = 333 Score = 111 bits (268), Expect = 1e-23 Identities = 47/82 (57%), Positives = 59/82 (71%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS TG+LEGQHF +G L SLSEQ L+DC++ Y NNGCNGG + A +YI DN GID+E Sbjct: 143 AFSATGSLEGQHFAATGNLTSLSEQQLVDCTKSYYNNGCNGGRSERALQYIIDNNGIDSE 202 Query: 437 QTYPYEGVDDKCRYNPKNTGAE 502 +YPYE D KCR+ P N + Sbjct: 203 LSYPYEHADGKCRFKPANVATK 224 Score = 58.8 bits (136), Expect = 1e-07 Identities = 36/80 (45%), Positives = 43/80 (53%) Frame = +3 Query: 12 GLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQ 191 G VS+ LG+NKY D+ HE+ K NL G RGA F + LPEQ Sbjct: 68 GNVSFHLGINKYSDLELHEY------HEKVVGRFWNL-RNGTRRRGAPFPLRSMDNLPEQ 120 Query: 192 VDWRKHGAVTDIKDQGKCGS 251 VDWR G VT +K+QG CGS Sbjct: 121 VDWRLKGYVTPVKEQGLCGS 140 Score = 48.8 bits (111), Expect = 1e-04 Identities = 21/55 (38%), Positives = 37/55 (67%) Frame = +1 Query: 493 RC*GRGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSST 657 +C FV+ P +E+ L +AVA+VGP+++A++A +F+ Y SG++NE C + Sbjct: 224 KCSSYQFVE-PSSNEEVLRQAVASVGPIAIAMNADLDTFKHYKSGLFNEPSCDKS 277 >UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; Dictyostelium discoideum|Rep: Cysteine proteinase 7 precursor - Dictyostelium discoideum (Slime mold) Length = 460 Score = 110 bits (265), Expect = 3e-23 Identities = 54/85 (63%), Positives = 62/85 (72%), Gaps = 3/85 (3%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGY--LVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGID 430 SFSTTGA EG + +G LVSLSEQNLIDCS YGNNGC GGLM AF+YI +N GID Sbjct: 136 SFSTTGATEGAQYLANGKKNLVSLSEQNLIDCSGSYGNNGCEGGLMTLAFEYIINNKGID 195 Query: 431 TEQTYPYEGVD-DKCRYNPKNTGAE 502 TE +YPY D KC++NPKN A+ Sbjct: 196 TESSYPYTAEDGKKCKFNPKNVAAQ 220 Score = 60.1 bits (139), Expect = 5e-08 Identities = 30/51 (58%), Positives = 35/51 (68%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 +V++ G E L V T GP SVAIDAS+ SFQLY SG+YNE CSST L Sbjct: 224 YVNVTSGSESDLAAKV-TQGPTSVAIDASNQSFQLYVSGIYNEPACSSTQL 273 Score = 41.9 bits (94), Expect = 0.013 Identities = 16/22 (72%), Positives = 18/22 (81%) Frame = +3 Query: 189 QVDWRKHGAVTDIKDQGKCGSC 254 QVDWR GAVT IK+QG+CG C Sbjct: 113 QVDWRTQGAVTPIKNQGQCGGC 134 >UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Rep: Cathepsin R precursor - Mus musculus (Mouse) Length = 334 Score = 109 bits (262), Expect = 6e-23 Identities = 48/82 (58%), Positives = 60/82 (73%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ TGA+E Q Q+G L LS QNL+DCS+ GNNGC GG NAF+Y+ NGG+++E Sbjct: 141 AFAVTGAIEAQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGLESE 200 Query: 437 QTYPYEGVDDKCRYNPKNTGAE 502 TYPYEG D CRYNPKN+ AE Sbjct: 201 ATYPYEGKDGPCRYNPKNSKAE 222 Score = 60.9 bits (141), Expect = 3e-08 Identities = 27/49 (55%), Positives = 35/49 (71%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 GFV +P+ E LM AVAT+GP++ IDASH SF+ Y G+Y+E CSS Sbjct: 225 GFVSLPQS-EDILMAAVATIGPITAGIDASHESFKNYKGGIYHEPNCSS 272 Score = 41.9 bits (94), Expect = 0.013 Identities = 28/82 (34%), Positives = 38/82 (46%) Frame = +3 Query: 9 MGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPE 188 +G + + MN++GD EF K M + MK R A I LP+ Sbjct: 68 LGKNGFTMKMNEFGDQTDEEFRKMMIEISVWTHREGKSIMK----REAGSI------LPK 117 Query: 189 QVDWRKHGAVTDIKDQGKCGSC 254 VDWRK G VT ++ QG C +C Sbjct: 118 FVDWRKKGYVTPVRRQGDCDAC 139 >UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine proteinase precursor - Heterodera glycines (Soybean cyst nematode worm) Length = 353 Score = 109 bits (261), Expect = 8e-23 Identities = 45/86 (52%), Positives = 66/86 (76%), Gaps = 1/86 (1%) Frame = +2 Query: 257 SFSTTGALEGQHF-RQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDT 433 +FS TGA+EG +++ ++SLSEQNL+DCS +YGN GC+GGLMD+AF+Y++DN G+DT Sbjct: 161 AFSATGAIEGALAQKKASKIISLSEQNLVDCSSKYGNEGCDGGLMDSAFEYVRDNNGLDT 220 Query: 434 EQTYPYEGVDDKCRYNPKNTGAEDVA 511 E++YPYE V KC++ + G V+ Sbjct: 221 EESYPYEAVTGKCQFKNETVGGTVVS 246 Score = 64.5 bits (150), Expect = 2e-09 Identities = 28/48 (58%), Positives = 38/48 (79%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 F D+ +GDE++L AVAT+GP+SVA+DAS+ SFQ Y +GVY E CS+ Sbjct: 247 FKDLKKGDEEQLKIAVATIGPISVALDASNLSFQFYKTGVYYERWCSN 294 Score = 49.6 bits (113), Expect = 7e-05 Identities = 18/25 (72%), Positives = 23/25 (92%) Frame = +3 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 LPE++DWR+ GAVT++KDQG CGSC Sbjct: 135 LPEKLDWREKGAVTEVKDQGDCGSC 159 >UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2; Brugia malayi|Rep: Cahepsin L-like cysteine protease - Brugia malayi (Filarial nematode worm) Length = 371 Score = 108 bits (260), Expect = 1e-22 Identities = 50/91 (54%), Positives = 62/91 (68%), Gaps = 1/91 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQ-YGNNGCNGGLMDNAFKYIKDNGGIDT 433 +FS GALEGQHF Q+G LV LS QNL+DCS+ YGN GC+GGLM AF+Y+ N GIDT Sbjct: 169 TFSAVGALEGQHFLQTGKLVELSMQNLLDCSDDTYGNYGCDGGLMMEAFEYVVKNDGIDT 228 Query: 434 EQTYPYEGVDDKCRYNPKNTGAEDVASWTSP 526 E++YPY+G + CRY+ G A P Sbjct: 229 EKSYPYQGYQNTCRYSNSTRGTTAYAGKLLP 259 Score = 59.7 bits (138), Expect = 6e-08 Identities = 34/84 (40%), Positives = 46/84 (54%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 YE +Y+L +N DML EF K ++GF +KN + ++R N L Sbjct: 92 YERNEETYELAINHLADMLPEEFRK-LHGFQSRKITSKNNFKN--TIR-----MKINGPL 143 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P+ +DWR GAVT +KDQG CGSC Sbjct: 144 PKSIDWRTSGAVTKVKDQGYCGSC 167 Score = 48.8 bits (111), Expect = 1e-04 Identities = 20/45 (44%), Positives = 32/45 (71%) Frame = +1 Query: 520 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 +PEGDE +L A+AT+GP+SVA+DA F Y G+++ +C++ Sbjct: 258 LPEGDELQLQAAIATIGPISVAVDAKLMKF--YRRGIFSTSKCTT 300 >UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06231 protein - Schistosoma japonicum (Blood fluke) Length = 372 Score = 108 bits (260), Expect = 1e-22 Identities = 49/86 (56%), Positives = 65/86 (75%), Gaps = 4/86 (4%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS+TGA+EGQH+R++ LV+LSEQ LIDCS+ YGNNGC GGLMD AF+Y++DN GID+E Sbjct: 176 AFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYVRDNKGIDSE 235 Query: 437 QTYPYEGVDD----KCRYNPKNTGAE 502 +YPY D +C +N N A+ Sbjct: 236 ISYPYISGDGDENVRCLFNSTNIMAQ 261 Score = 70.1 bits (164), Expect = 4e-11 Identities = 33/84 (39%), Positives = 52/84 (61%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 Y+ G +YK+G+N + D +E K + G+ + K +G+ FIS + KL Sbjct: 100 YQEGKATYKMGVNNFTDKTEYELRK-LRGYRSACRIAKP--------KGSTFISSEHAKL 150 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P++VDWR++GAVT +K+QG+CGSC Sbjct: 151 PDRVDWRRNGAVTPVKNQGQCGSC 174 Score = 68.5 bits (160), Expect = 1e-10 Identities = 29/49 (59%), Positives = 40/49 (81%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 G+++I EGDE+ LM AVAT+GPVSVAI+A SF +Y SG+Y++ EC+S Sbjct: 264 GYINIHEGDERALMNAVATIGPVSVAINAGLPSFSMYKSGIYSDPECAS 312 >UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegleria fowleri|Rep: Cysteine proteinase homolog - Naegleria fowleri Length = 347 Score = 108 bits (260), Expect = 1e-22 Identities = 53/97 (54%), Positives = 69/97 (71%), Gaps = 8/97 (8%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDC--------SEQYGNNGCNGGLMDNAFKYIK 412 +FSTTG +EGQ + G LVSLSEQ L+DC ++Q ++GCNGGLM +AF+Y+ Sbjct: 148 TFSTTGNVEGQWAIKKGKLVSLSEQQLVDCDHNCVTYQNQQACDSGCNGGLMWSAFQYVI 207 Query: 413 DNGGIDTEQTYPYEGVDDKCRYNPKNTGAEDVASWTS 523 NGG+DTE +YPYEGVDD CR+N N A ++SWTS Sbjct: 208 KNGGLDTEDSYPYEGVDDTCRFNKSNVAA-TISSWTS 243 Score = 46.4 bits (105), Expect = 6e-04 Identities = 27/75 (36%), Positives = 37/75 (49%), Gaps = 1/75 (1%) Frame = +3 Query: 33 GMNKYGDMLHHEFVKTMNGFNKTAKHNKN-LYMKGGSVRGAKFISPANVKLPEQVDWRKH 209 G+ K+ D+ EF + T + K L +V K + A P DWR+H Sbjct: 76 GITKFSDLTPEEFKRMFLMKTYTPEEAKKILAAPQHAVLSEKEVQTA----PTSFDWRQH 131 Query: 210 GAVTDIKDQGKCGSC 254 GAVT +K+QG CGSC Sbjct: 132 GAVTRVKNQGACGSC 146 >UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicatein a3 - Lubomirskia baicalensis Length = 344 Score = 107 bits (256), Expect = 3e-22 Identities = 47/81 (58%), Positives = 59/81 (72%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ GALEG + LV+LSEQN+IDCS YGN+GC+GG + AFKY+ DNGGIDTE Sbjct: 154 AFAAAGALEGATALAADKLVALSEQNIIDCSVPYGNHGCSGGDVYTAFKYVVDNGGIDTE 213 Query: 437 QTYPYEGVDDKCRYNPKNTGA 499 +YPY+G C+YN KN GA Sbjct: 214 SSYPYKGKKSSCQYNSKNVGA 234 Score = 56.8 bits (131), Expect = 4e-07 Identities = 25/52 (48%), Positives = 35/52 (67%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 G V I G E L+ AVA+VGP++VA+DAS +F Y SGV++ CS++ L Sbjct: 238 GVVKIASGSETDLLSAVASVGPIAVAVDASVNAFMFYQSGVFDSSTCSTSKL 289 Score = 48.0 bits (109), Expect = 2e-04 Identities = 27/79 (34%), Positives = 40/79 (50%) Frame = +3 Query: 15 LVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQV 194 L Y L MN +GD++ EF + T KH++ ++ F SP V + + Sbjct: 84 LFGYTLAMNGFGDLMSAEFTERY----LTHKHSQRSGLQ-------TFESPKGVTYADSL 132 Query: 195 DWRKHGAVTDIKDQGKCGS 251 DWR G VT ++ QG+CGS Sbjct: 133 DWRTRGVVTSVQSQGQCGS 151 >UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]; n=11; Eutheria|Rep: Testin-2 precursor [Contains: Testin-1] - Mus musculus (Mouse) Length = 333 Score = 105 bits (252), Expect = 9e-22 Identities = 45/81 (55%), Positives = 61/81 (75%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS TG+LEGQ F+++G LV LSEQNL+DC + C+GG M NAF+Y+KDNGG+ TE Sbjct: 140 AFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSNVTHDCSGGFMQNAFQYVKDNGGLATE 199 Query: 437 QTYPYEGVDDKCRYNPKNTGA 499 ++YPY G KCRY+ +N+ A Sbjct: 200 ESYPYIGPGRKCRYHAENSAA 220 Score = 67.7 bits (158), Expect = 2e-10 Identities = 32/53 (60%), Positives = 38/53 (71%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 R FV IP G E+ LM+AVA VGP+SVA+DASH SFQ Y SG+Y E +C L Sbjct: 223 RDFVQIP-GREEALMKAVAKVGPISVAVDASHDSFQFYDSGIYYEPQCKRVHL 274 Score = 46.0 bits (104), Expect = 8e-04 Identities = 27/83 (32%), Positives = 43/83 (51%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 Y G + + MN +GD+ + EFVK M GF + +++ + +F+ + Sbjct: 66 YLEGKHDFTMTMNAFGDLTNTEFVKMMTGFRRQKIKRMHVF------QDHQFLY-----V 114 Query: 183 PEQVDWRKHGAVTDIKDQGKCGS 251 P+ VDWR G VT +K+QG C S Sbjct: 115 PKYVDWRMLGYVTPVKNQGYCAS 137 >UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18; Magnoliophyta|Rep: Thiol protease aleurain precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 104 bits (249), Expect = 2e-21 Identities = 43/82 (52%), Positives = 59/82 (71%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FSTTGALE + + G +SLSEQ L+DC+ + N GCNGGL AF+YIK NGG+DTE Sbjct: 167 TFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTE 226 Query: 437 QTYPYEGVDDKCRYNPKNTGAE 502 + YPY G D+ C+++ +N G + Sbjct: 227 KAYPYTGKDETCKFSAENVGVQ 248 Score = 56.0 bits (129), Expect = 8e-07 Identities = 31/79 (39%), Positives = 42/79 (53%) Frame = +3 Query: 18 VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVD 197 +SYKLG+N++ D+ EF +T G A N + +KG LPE D Sbjct: 98 LSYKLGVNQFADLTWQEFQRTKLG----AAQNCSATLKGSH-------KVTEAALPETKD 146 Query: 198 WRKHGAVTDIKDQGKCGSC 254 WR+ G V+ +KDQG CGSC Sbjct: 147 WREDGIVSPVKDQGGCGSC 165 Score = 48.4 bits (110), Expect = 2e-04 Identities = 24/50 (48%), Positives = 31/50 (62%) Frame = +1 Query: 514 VDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 V+I G E +L AV V PVS+A + H SF+LY SGVY + C ST + Sbjct: 253 VNITLGAEDELKHAVGLVRPVSIAFEVIH-SFRLYKSGVYTDSHCGSTPM 301 >UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n=1; Rattus norvegicus|Rep: UPI0000501FDB UniRef100 entry - Rattus norvegicus Length = 338 Score = 103 bits (247), Expect = 4e-21 Identities = 42/79 (53%), Positives = 57/79 (72%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F GA+EGQ F+++G L LS QNL+DCS+ GN GC GG NAF+Y+ NGG+++E Sbjct: 147 AFPVVGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVLQNGGLESE 206 Query: 437 QTYPYEGVDDKCRYNPKNT 493 TYPYEG + CRYNP ++ Sbjct: 207 ATYPYEGKEGLCRYNPNSS 225 Score = 41.5 bits (93), Expect = 0.018 Identities = 18/44 (40%), Positives = 29/44 (65%) Frame = +1 Query: 523 PEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 P+ +E LM+AVAT PV+ I H+S + Y G+Y+E +C++ Sbjct: 235 PQKNEDVLMDAVATK-PVAAGIHVVHSSLRFYKKGIYHEPKCNN 277 >UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens (Human) Length = 331 Score = 102 bits (245), Expect = 7e-21 Identities = 47/82 (57%), Positives = 60/82 (73%), Gaps = 1/82 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCS-EQYGNNGCNGGLMDNAFKYIKDNGGIDT 433 +FS GALE Q ++G LVSLS QNL+DCS E+YGN GCNGG M AF+YI DN GID+ Sbjct: 141 AFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDS 200 Query: 434 EQTYPYEGVDDKCRYNPKNTGA 499 + +YPY+ +D KC+Y+ K A Sbjct: 201 DASYPYKAMDQKCQYDSKYRAA 222 Score = 62.1 bits (144), Expect = 1e-08 Identities = 32/84 (38%), Positives = 45/84 (53%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 + MG+ SY LGMN GDM E + M+ ++ +N+ K S N L Sbjct: 66 HSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNITYK----------SNPNRIL 115 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P+ VDWR+ G VT++K QG CG+C Sbjct: 116 PDSVDWREKGCVTEVKYQGSCGAC 139 Score = 56.8 bits (131), Expect = 4e-07 Identities = 26/47 (55%), Positives = 31/47 (65%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECS 651 + ++P G E L EAVA GPVSV +DA H SF LY SGVY E C+ Sbjct: 227 YTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCT 273 >UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 4 - Rhipicephalus appendiculatus (Brown ear tick) Length = 345 Score = 101 bits (243), Expect = 1e-20 Identities = 45/77 (58%), Positives = 63/77 (81%), Gaps = 2/77 (2%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQ-YGNNGCNGGLMDNAFKYIKDNGGIDT 433 +FS+TGALEGQ F+++ L+SLSEQNL+DC+ Q YGNNGCNGG M AF+Y++D GG+DT Sbjct: 152 AFSSTGALEGQVFKRTRRLISLSEQNLMDCAGQRYGNNGCNGGQMPGAFQYVQDAGGLDT 211 Query: 434 EQTYPY-EGVDDKCRYN 481 E YPY +G + +C+++ Sbjct: 212 EARYPYRQGTNFQCQFS 228 Score = 52.0 bits (119), Expect = 1e-05 Identities = 22/52 (42%), Positives = 32/52 (61%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 G +P +E+ L +AVA VGP+S+AI+AS +F Y +G+Y E C L Sbjct: 240 GHTRVPPRNERVLQDAVANVGPISIAINASPQTFMFYKNGIYGEPNCDPRGL 291 Score = 45.6 bits (103), Expect = 0.001 Identities = 23/84 (27%), Positives = 40/84 (47%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 ++ G + Y + +N + DM E V G+ + + +P Sbjct: 76 FKNGTLLYSVAVNHFADMTPDEVVANYTGYKPPSAQQ---------LAEIPLYAPLFGDT 126 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 PE ++WR++G VT +K+QG+CGSC Sbjct: 127 PEFIEWRENGFVTPVKNQGQCGSC 150 >UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 326 Score = 101 bits (242), Expect = 2e-20 Identities = 45/82 (54%), Positives = 61/82 (74%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SFSTTG +EG +F ++G LVSLSEQNL+DC+++ GC+GG MD A +YI+ GGI +E Sbjct: 136 SFSTTGTVEGAYFLKTGKLVSLSEQNLVDCAKE-DCYGCSGGYMDKALEYIETAGGIMSE 194 Query: 437 QTYPYEGVDDKCRYNPKNTGAE 502 YPYEG+DDKCR++ A+ Sbjct: 195 NDYPYEGIDDKCRFDSSKVAAK 216 Score = 61.7 bits (143), Expect = 2e-08 Identities = 32/84 (38%), Positives = 49/84 (58%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 Y+ GL ++KLG+ K+ D+ EF M G +++ K ++ R ++P L Sbjct: 61 YDHGLSTFKLGVTKFADLTEKEF-SDMLGISRSTKSSRP--------RVIHSLTPVK-DL 110 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P + DWR+ GAVT++KDQG CGSC Sbjct: 111 PSKFDWREKGAVTEVKDQGSCGSC 134 Score = 47.6 bits (108), Expect = 3e-04 Identities = 24/48 (50%), Positives = 30/48 (62%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 F I + DE L AV GP+SVAIDAS +FQLY SG+ ++ C S Sbjct: 220 FTYIKKNDEDDLKNAVIAKGPISVAIDASF-NFQLYDSGILDDSSCYS 266 >UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheirus salmonis|Rep: Putative cathepsin L - Lepeophtheirus salmonis (salmon louse) Length = 257 Score = 101 bits (242), Expect = 2e-20 Identities = 47/87 (54%), Positives = 60/87 (68%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FSTTG++EGQ+F ++ L+S SEQ L+DCS + N GCNGG MDNAFKY+ N GI TE Sbjct: 64 AFSTTGSVEGQYFIKNKKLLSFSEQQLVDCSSDFRNEGCNGGWMDNAFKYLIANKGIATE 123 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASW 517 TYPY D C YN K A ++S+ Sbjct: 124 DTYPYTATDGVCVYN-KTMAAGRISSF 149 Score = 52.4 bits (120), Expect = 9e-06 Identities = 24/44 (54%), Positives = 29/44 (65%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEE 642 F D+ G E +L AVA +GP+SVAIDAS FQ Y GVY +E Sbjct: 149 FKDVKHGSEDQLKLAVAQIGPISVAIDASSGDFQFYKKGVYVDE 192 Score = 49.6 bits (113), Expect = 7e-05 Identities = 27/73 (36%), Positives = 37/73 (50%) Frame = +3 Query: 36 MNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGA 215 MN+YGD+L EF++ G K + N + S +P V+W K+GA Sbjct: 1 MNQYGDLLQSEFLQGYTGLAKGSYSGDNTVILDNSA-----------PVPSYVNWTKNGA 49 Query: 216 VTDIKDQGKCGSC 254 VT +KDQ CGSC Sbjct: 50 VTAVKDQKDCGSC 62 >UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor; n=17; Magnoliophyta|Rep: Thiol protease aleurain-like precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 100 bits (240), Expect = 3e-20 Identities = 43/82 (52%), Positives = 58/82 (70%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FSTTGALE + + G +SLSEQ L+DC+ + N GC+GGL AF+YIK NGG+DTE Sbjct: 167 TFSTTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTE 226 Query: 437 QTYPYEGVDDKCRYNPKNTGAE 502 + YPY G D C+++ KN G + Sbjct: 227 EAYPYTGKDGGCKFSAKNIGVQ 248 Score = 48.8 bits (111), Expect = 1e-04 Identities = 29/79 (36%), Positives = 43/79 (54%) Frame = +3 Query: 18 VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVD 197 +SYKL +N++ D+ EF + G A N + +KG I+ A V P+ D Sbjct: 98 LSYKLSLNQFADLTWQEFQRYKLG----AAQNCSATLKGSHK-----ITEATV--PDTKD 146 Query: 198 WRKHGAVTDIKDQGKCGSC 254 WR+ G V+ +K+QG CGSC Sbjct: 147 WREDGIVSPVKEQGHCGSC 165 >UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like midgut cysteine proteinase - Tenebrio molitor (Yellow mealworm) Length = 330 Score = 99.5 bits (237), Expect = 6e-20 Identities = 48/75 (64%), Positives = 54/75 (72%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SFSTTGA+EGQ Q G L SLSEQNLIDCS YGN GC+GG MD+AF YI D GI +E Sbjct: 142 SFSTTGAVEGQLALQRGRLTSLSEQNLIDCSSSYGNAGCDGGWMDSAFSYIHDY-GIMSE 200 Query: 437 QTYPYEGVDDKCRYN 481 YPYE D CR++ Sbjct: 201 SAYPYEAQGDYCRFD 215 Score = 57.6 bits (133), Expect = 2e-07 Identities = 34/85 (40%), Positives = 50/85 (58%), Gaps = 1/85 (1%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMN-GFNKTAKHNKNLYMKGGSVRGAKFISPANVK 179 +E G V+Y MN++GDM EF+ +N G + KH +NL M ++S + Sbjct: 66 FEKGEVTYSKAMNQFGDMSKEEFLAYVNRGKAQKPKHPENLRMP--------YVS-SKKP 116 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 L VDWR + AV+++KDQG+CGSC Sbjct: 117 LAASVDWRSN-AVSEVKDQGQCGSC 140 Score = 56.0 bits (129), Expect = 8e-07 Identities = 24/52 (46%), Positives = 35/52 (67%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 G+ D+P GDE L +AV GPV+VAIDA+ Q YS G++ ++ C+ +DL Sbjct: 225 GYYDLPSGDENSLADAVGQAGPVAVAIDAT-DELQFYSGGLFYDQTCNQSDL 275 >UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; n=35; Fasciola|Rep: Cathepsin L-like proteinase precursor - Fasciola hepatica (Liver fluke) Length = 326 Score = 99.5 bits (237), Expect = 6e-20 Identities = 43/87 (49%), Positives = 61/87 (70%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FSTTG +EGQ+ + +S SEQ L+DCS +GNNGC+GGLM+NA++Y+K G++TE Sbjct: 134 AFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLK-QFGLETE 192 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASW 517 +YPY V+ +CRYN K G V + Sbjct: 193 SSYPYTAVEGQCRYN-KQLGVAKVTGY 218 Score = 58.4 bits (135), Expect = 1e-07 Identities = 29/84 (34%), Positives = 46/84 (54%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 +++GLV+Y LG+N++ DM EF AK+ + + N + Sbjct: 58 HDLGLVTYTLGLNQFTDMTFEEF---------KAKYLTEMSRASDILSHGVPYEANNRAV 108 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P+++DWR+ G VT++KDQG CGSC Sbjct: 109 PDKIDWRESGYVTEVKDQGNCGSC 132 Score = 37.1 bits (82), Expect = 0.38 Identities = 16/48 (33%), Positives = 25/48 (52%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECS 651 G+ + G E +L V P +VA+D + F +Y SG+Y + CS Sbjct: 217 GYYTVHSGSEVELKNLVGARRPAAVAVDVE-SDFMMYRSGIYQSQTCS 263 >UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 664 Score = 99.1 bits (236), Expect = 8e-20 Identities = 42/77 (54%), Positives = 57/77 (74%), Gaps = 2/77 (2%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDC--SEQYGNNGCNGGLMDNAFKYIKDNGGID 430 +FST GALE ++R++ ++ LSEQNL+DC S +Y N GC+GG M N + YI++NGGI+ Sbjct: 496 AFSTVGALESHYYRKNNRMLDLSEQNLVDCTASNKYRNGGCSGGWMHNCYSYIQENGGIN 555 Query: 431 TEQTYPYEGVDDKCRYN 481 E TYPYEG +CRYN Sbjct: 556 QESTYPYEGKFGQCRYN 572 Score = 52.4 bits (120), Expect = 9e-06 Identities = 24/47 (51%), Positives = 31/47 (65%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECS 651 FV I + DE+ L + VA+VGPVSVA DAS F YS G+Y + C+ Sbjct: 583 FVMIKQHDEEDLADTVASVGPVSVAYDASTREFMYYSRGIYYSDNCN 629 Score = 37.5 bits (83), Expect = 0.28 Identities = 13/24 (54%), Positives = 17/24 (70%) Frame = +3 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P +DWR G V+ +K+QG CGSC Sbjct: 471 PISIDWRTWGMVSKVKNQGSCGSC 494 >UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteinase A; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like tick cysteine proteinase A - Haemaphysalis longicornis (Bush tick) Length = 312 Score = 99.1 bits (236), Expect = 8e-20 Identities = 43/62 (69%), Positives = 53/62 (85%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FSTTG+LEGQHFR++ V+ EQNL+DCS+ +GN GCNGGLMDN F+YIK NGGIDTE Sbjct: 119 AFSTTGSLEGQHFRKTESRVT-GEQNLVDCSDDFGNQGCNGGLMDNGFQYIKANGGIDTE 177 Query: 437 QT 442 +T Sbjct: 178 ET 179 Score = 37.1 bits (82), Expect = 0.38 Identities = 13/25 (52%), Positives = 18/25 (72%) Frame = +3 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 LP VDW + G+ +K+QG+CGSC Sbjct: 93 LPTTVDWAQEGSRAPVKNQGQCGSC 117 >UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 protein, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to MGC81823 protein, partial - Ornithorhynchus anatinus Length = 361 Score = 97.9 bits (233), Expect = 2e-19 Identities = 38/66 (57%), Positives = 54/66 (81%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F +TG LEGQ FR++G L ++SEQNL+DCS + GN GC+GGLM +F Y++DNGG+D+E Sbjct: 216 AFGSTGVLEGQLFRRTGRLAAVSEQNLMDCSRKQGNRGCDGGLMQQSFLYVRDNGGVDSE 275 Query: 437 QTYPYE 454 + YPY+ Sbjct: 276 EAYPYD 281 Score = 56.4 bits (130), Expect = 6e-07 Identities = 27/63 (42%), Positives = 36/63 (57%) Frame = +3 Query: 66 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKC 245 EF MNG+ K A+ + S + F+ P + PE +DWR HG VT +KDQG+C Sbjct: 157 EFAAAMNGY-KAARGVE----ASASASASAFLGPNGTEPPEALDWRDHGYVTPVKDQGRC 211 Query: 246 GSC 254 GSC Sbjct: 212 GSC 214 >UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3; Bilateria|Rep: Cathepsin L-like cysteine protease - Neobenedenia melleni Length = 335 Score = 97.9 bits (233), Expect = 2e-19 Identities = 42/88 (47%), Positives = 62/88 (70%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS+TG++EG R +G L+S SEQ L+DCS +GN+GCNGG+MDN+F Y+ N G+++E Sbjct: 144 AFSSTGSIEGAVKRATGKLISFSEQQLVDCSTAFGNHGCNGGIMDNSFNYLIHNKGLESE 203 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASWT 520 +YPYE +CRY K ++S+T Sbjct: 204 ASYPYEAQKKECRYK-KALSKGTISSFT 230 Score = 66.9 bits (156), Expect = 4e-10 Identities = 31/51 (60%), Positives = 37/51 (72%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 F D+ + DE+ L AV VGPVS+AIDAS SF LY SGVY+EE+CS T L Sbjct: 229 FTDVSQFDEKDLKRAVGLVGPVSIAIDASQFSFHLYDSGVYDEEDCSQTML 279 Score = 40.7 bits (91), Expect = 0.031 Identities = 25/84 (29%), Positives = 34/84 (40%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 Y G SY L MN D+ EF K + + G G Sbjct: 65 YAQGKKSYTLAMNHMADLSSEEF----KALYLVPKFDATKVPRKGKAAGEH--RQIKNDP 118 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P ++DW + G VT +K+Q +CGSC Sbjct: 119 PSEIDWVRKGHVTAVKNQAQCGSC 142 >UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Cathepsin K - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 333 Score = 96.7 bits (230), Expect = 4e-19 Identities = 42/81 (51%), Positives = 55/81 (67%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS+ GALEGQ + G LV LS QNL+DC + N+GC GG M NAF+Y+ +N GID+E Sbjct: 144 AFSSVGALEGQLMKTKGQLVDLSPQNLVDCVTE--NDGCGGGYMTNAFRYVSNNQGIDSE 201 Query: 437 QTYPYEGVDDKCRYNPKNTGA 499 ++YPY G D +C YN A Sbjct: 202 ESYPYVGTDQQCAYNTSGVAA 222 Score = 61.7 bits (143), Expect = 2e-08 Identities = 27/53 (50%), Positives = 37/53 (69%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 RG+ +IP+G+E+ L AVA VGPVSV IDA ++F Y SGVY + C+ D+ Sbjct: 225 RGYKEIPQGNERALTAAVANVGPVSVGIDAMQSTFLYYKSGVYYDPNCNKEDV 277 Score = 56.8 bits (131), Expect = 4e-07 Identities = 32/85 (37%), Positives = 46/85 (54%), Gaps = 1/85 (1%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANV-K 179 YE+G+ +Y LGMN +GDM E + + G +Y + F+ V K Sbjct: 68 YELGIHTYDLGMNHFGDMTLEEVAEKVMGLQMP------MYRDPANT----FVPDDRVGK 117 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 LP+ +D+RK G VT +K+QG CGSC Sbjct: 118 LPKSIDYRKLGYVTSVKNQGSCGSC 142 >UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Toxopain-2 - Toxoplasma gondii Length = 422 Score = 95.9 bits (228), Expect = 8e-19 Identities = 42/73 (57%), Positives = 55/73 (75%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FSTTGALEG H ++G LVSLSEQ L+DCS GN C+GG M++AF+Y+ D+GGI +E Sbjct: 231 AFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSE 290 Query: 437 QTYPYEGVDDKCR 475 YPY D++CR Sbjct: 291 DAYPYLARDEECR 303 Score = 55.6 bits (128), Expect = 1e-06 Identities = 31/78 (39%), Positives = 41/78 (52%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 SY L MN +GD+ EF + GF K+ +NL V + ++ +LP VDW Sbjct: 157 SYSLKMNHFGDLSRDEFRRKYLGFKKS----RNLKSHHLGV-ATELLNVLPSELPAGVDW 211 Query: 201 RKHGAVTDIKDQGKCGSC 254 R G VT +KDQ CGSC Sbjct: 212 RSRGCVTPVKDQRDCGSC 229 Score = 39.1 bits (87), Expect = 0.093 Identities = 22/52 (42%), Positives = 29/52 (55%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 GF D+P E + A+A PVS+AI+A FQ Y GV+ + C TDL Sbjct: 315 GFKDVPRRSEAAMKAALAK-SPVSIAIEADQMPFQFYHEGVF-DASC-GTDL 363 >UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin heavy chain; n=3; Amniota|Rep: PREDICTED: similar to ferritin heavy chain - Ornithorhynchus anatinus Length = 338 Score = 95.1 bits (226), Expect = 1e-18 Identities = 44/76 (57%), Positives = 54/76 (71%), Gaps = 1/76 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS TGALE F+ +G +VSLSEQNL+DCS + GN GC GG AF+Y++ NGGID E Sbjct: 146 AFSATGALEALVFKTTGKMVSLSEQNLVDCSWRQGNVGCRGGQYIGAFEYVRANGGIDAE 205 Query: 437 QTYPYEGVDD-KCRYN 481 YPY G DD CRY+ Sbjct: 206 DLYPYLGRDDISCRYS 221 Score = 60.9 bits (141), Expect = 3e-08 Identities = 32/81 (39%), Positives = 48/81 (59%) Frame = +3 Query: 12 GLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQ 191 G SY+L MN +GD + E + +NGF + + ++ G + A+F S + + PE+ Sbjct: 69 GKHSYRLAMNHFGDQTNEELHERLNGF----RPDLGGALRSGREQ-ARFRSKTSWEGPEE 123 Query: 192 VDWRKHGAVTDIKDQGKCGSC 254 VDWR G VT +K+QG CGSC Sbjct: 124 VDWRTKGYVTPVKNQGLCGSC 144 Score = 45.2 bits (102), Expect = 0.001 Identities = 21/47 (44%), Positives = 32/47 (68%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECS 651 ++ + + +EQ L +AVATVGPVSVA+DA F Y SG+++ C+ Sbjct: 232 YMVVDQDNEQALEQAVATVGPVSVAVDA--RPFFFYHSGIFSSHSCT 276 >UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine protease; n=11; Callosobruchus maculatus|Rep: Putative gut cathepsin L-like cysteine protease - Callosobruchus maculatus (Southern cowpea weevil) (Pulse bruchid) Length = 326 Score = 94.7 bits (225), Expect = 2e-18 Identities = 42/74 (56%), Positives = 55/74 (74%), Gaps = 1/74 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDC-SEQYGNNGCNGGLMDNAFKYIKDNGGIDT 433 +FS GA+EGQ F+++G LVSLS Q L+DC +E YGNNGC GGLM AF +++D GI T Sbjct: 138 AFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDE-GIQT 196 Query: 434 EQTYPYEGVDDKCR 475 E++YPYEG C+ Sbjct: 197 EESYPYEGRRSSCK 210 Score = 46.8 bits (106), Expect = 5e-04 Identities = 27/84 (32%), Positives = 41/84 (48%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 YE G S+ + ++ DM H EF+ + A + +V F +++ Sbjct: 61 YERGEESFAKKVTQFADMTHEEFLDLLKLQGVPA-------LPSNAVHFDNF-EDIDMEE 112 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 + VDWR+ GAVT +KDQ CGSC Sbjct: 113 KDAVDWREEGAVTPVKDQANCGSC 136 Score = 37.9 bits (84), Expect = 0.22 Identities = 20/42 (47%), Positives = 27/42 (64%), Gaps = 1/42 (2%) Frame = +1 Query: 532 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEE-ECSS 654 DEQ++ VA GPV+VAI+AS SF Y G+ +E CS+ Sbjct: 227 DEQEMARTVAAKGPVAVAIEASQLSF--YDKGIVDERCRCSN 266 >UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=176; Viridiplantae|Rep: Cysteine proteinase RD21a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 462 Score = 94.7 bits (225), Expect = 2e-18 Identities = 45/87 (51%), Positives = 59/87 (67%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FST GA+EG + +G L++LSEQ L+DC Y N GCNGGLMD AF++I NGGIDT+ Sbjct: 163 AFSTIGAVEGINQIVTGDLITLSEQELVDCDTSY-NEGCNGGLMDYAFEFIIKNGGIDTD 221 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASW 517 + YPY+GVD C KN + S+ Sbjct: 222 KDYPYKGVDGTCDQIRKNAKVVTIDSY 248 Score = 58.8 bits (136), Expect = 1e-07 Identities = 30/79 (37%), Positives = 45/79 (56%) Frame = +3 Query: 18 VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVD 197 +SY+LG+ ++ D+ + E+ G AK K KG ++ + +LPE +D Sbjct: 91 LSYRLGLTRFADLTNDEYRSKYLG----AKMEK----KGERRTSLRYEARVGDELPESID 142 Query: 198 WRKHGAVTDIKDQGKCGSC 254 WRK GAV ++KDQG CGSC Sbjct: 143 WRKKGAVAEVKDQGGCGSC 161 Score = 39.9 bits (89), Expect = 0.053 Identities = 18/42 (42%), Positives = 29/42 (69%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYN 636 + D+P E+ L +AVA P+S+AI+A +FQLY SG+++ Sbjct: 248 YEDVPTYSEESLKKAVAHQ-PISIAIEAGGRAFQLYDSGIFD 288 >UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropicalis|Rep: LOC594890 protein - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 355 Score = 94.3 bits (224), Expect = 2e-18 Identities = 44/77 (57%), Positives = 57/77 (74%), Gaps = 1/77 (1%) Frame = +2 Query: 257 SFSTTGALEGQHF-RQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDT 433 +FS+ GALE Q+ R++G L SLS QNL+DCS+ YGNNGC GG + ++F+YI DN GI+ Sbjct: 165 AFSSIGALECQNMKRRTGKLESLSVQNLLDCSQTYGNNGCKGGWVVSSFRYIIDN-GIEL 223 Query: 434 EQTYPYEGVDDKCRYNP 484 E YPY+G D KC Y P Sbjct: 224 ESNYPYQGKDGKCSYTP 240 Score = 56.0 bits (129), Expect = 8e-07 Identities = 24/46 (52%), Positives = 33/46 (71%) Frame = +1 Query: 520 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSST 657 +P GDE L + V +GPVSVAIDAS +F++Y +GVY + CSS+ Sbjct: 253 LPYGDEATLKQVVGLMGPVSVAIDASRKTFRMYKNGVYYDPNCSSS 298 Score = 47.2 bits (107), Expect = 4e-04 Identities = 29/82 (35%), Positives = 43/82 (52%), Gaps = 1/82 (1%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFV-KTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK 179 Y MGL +Y++GMN GDM+ E K MN + + ++ ++ IS ++ Sbjct: 90 YSMGLHTYEVGMNHLGDMVAEEMTDKQMNFIPQVIANITDVPVE---------ISKSSP- 139 Query: 180 LPEQVDWRKHGAVTDIKDQGKC 245 PE +DWR VT +KDQG C Sbjct: 140 -PESIDWRNKNCVTSVKDQGSC 160 >UniRef50_Q24E33 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 328 Score = 93.9 bits (223), Expect = 3e-18 Identities = 42/73 (57%), Positives = 53/73 (72%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FSTTGALEG +F ++ L+S SEQ L+DCS Y N GCNGGLM AF+Y+K + GI TE Sbjct: 153 AFSTTGALEGSYFLKNNQLISFSEQQLVDCSRLYLNMGCNGGLMPRAFRYVKAH-GITTE 211 Query: 437 QTYPYEGVDDKCR 475 + YPY D KC+ Sbjct: 212 EEYPYTAKDGKCQ 224 Score = 40.7 bits (91), Expect = 0.031 Identities = 14/25 (56%), Positives = 19/25 (76%) Frame = +3 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 +P +V+W GAVT +K+QG CGSC Sbjct: 127 IPSEVNWTAQGAVTPVKNQGSCGSC 151 Score = 36.3 bits (80), Expect = 0.66 Identities = 19/44 (43%), Positives = 29/44 (65%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYN 636 + F +P G+ KL A+A PVSV +DA T+F+ Y+SGV++ Sbjct: 233 KSFSTVPRGNCDKLAAAIAQQ-PVSVGVDA--TNFKFYTSGVFD 273 >UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; Dictyostelium discoideum|Rep: Cysteine proteinase 2 precursor - Dictyostelium discoideum (Slime mold) Length = 376 Score = 93.9 bits (223), Expect = 3e-18 Identities = 46/82 (56%), Positives = 55/82 (67%), Gaps = 1/82 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SFSTTG+ EG H ++ LVSLSEQNL+DCS N GC+GGLM+NAF YI N GIDTE Sbjct: 149 SFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYIIKNKGIDTE 208 Query: 437 QTYPYEG-VDDKCRYNPKNTGA 499 +YPY C +N + GA Sbjct: 209 SSYPYTAETGSTCLFNKSDIGA 230 Score = 63.3 bits (147), Expect = 5e-09 Identities = 32/53 (60%), Positives = 39/53 (73%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 +G+V+I G E L E A GPVSVAIDASH SFQLY+SG+Y E +CS T+L Sbjct: 233 KGYVNITAGSEISL-ENGAQHGPVSVAIDASHNSFQLYTSGIYYEPKCSPTEL 284 Score = 53.2 bits (122), Expect = 5e-06 Identities = 30/75 (40%), Positives = 41/75 (54%) Frame = +3 Query: 30 LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 209 LG+N + D+ + E+ KT G A H+ N Y G V + + P+ +DWR Sbjct: 79 LGLNNFADITNEEYRKTYLGTRVNA-HSYNGY-DGREVLNVEDLQTN----PKSIDWRTK 132 Query: 210 GAVTDIKDQGKCGSC 254 AVT IKDQG+CGSC Sbjct: 133 NAVTPIKDQGQCGSC 147 >UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platyhelminthes|Rep: Cathepsin L-like proteinase - Echinococcus multilocularis Length = 338 Score = 93.5 bits (222), Expect = 4e-18 Identities = 42/75 (56%), Positives = 55/75 (73%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS TGALEGQ R++G L+SLSEQ L+DCS GN GCNGG M++AF+Y NG ++E Sbjct: 148 AFSATGALEGQLKRKTGKLISLSEQQLVDCSTYTGNEGCNGGDMNDAFRYWMRNGA-ESE 206 Query: 437 QTYPYEGVDDKCRYN 481 YPY +D KC++N Sbjct: 207 SDYPYTAMDGKCKFN 221 Score = 55.2 bits (127), Expect = 1e-06 Identities = 29/84 (34%), Positives = 40/84 (47%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 Y +GL +Y +N + D+ EF + +T M V P + + Sbjct: 68 YYLGLETYSTALNAFADLTLEEFAEKYLTLKQTPMEGIWQDMSTQYVE-----RPTRMLV 122 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P+ +DWRK G VT IKDQG CGSC Sbjct: 123 PDSIDWRKKGLVTPIKDQGDCGSC 146 Score = 54.4 bits (125), Expect = 2e-06 Identities = 24/47 (51%), Positives = 32/47 (68%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECS 651 FV +P+ E +L +VA VGPVSVAIDA+ + F LY G+Y + CS Sbjct: 232 FVKVPKKREDQLKLSVAQVGPVSVAIDATSSGFMLYKKGIYQDNTCS 278 >UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2; Taeniidae|Rep: Cathepsin L-like cysteine proteinase - Taenia solium (Pork tapeworm) Length = 339 Score = 93.1 bits (221), Expect = 5e-18 Identities = 41/75 (54%), Positives = 55/75 (73%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS+TGALEG +++G L+SLSEQ L+DCS + GN+GCNGG M AFKY++++ I+ E Sbjct: 150 AFSSTGALEGAFAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSYAFKYLEEH-FIEPE 208 Query: 437 QTYPYEGVDDKCRYN 481 YPY D CRYN Sbjct: 209 SAYPYRATDGPCRYN 223 Score = 64.9 bits (151), Expect = 2e-09 Identities = 29/46 (63%), Positives = 33/46 (71%) Frame = +1 Query: 517 DIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 DIPEG+E LMEAVATVGP+S+AIDAS F Y G+Y CSS Sbjct: 236 DIPEGNETALMEAVATVGPISIAIDASSLGFMFYRHGIYKSHWCSS 281 Score = 52.0 bits (119), Expect = 1e-05 Identities = 29/84 (34%), Positives = 44/84 (52%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 + GL SY G+N++ D+ EF + G ++ + G R K ++ A L Sbjct: 72 FNAGLESYSTGLNQFADLESSEFSERFLGTRPESR------VAGRRGRIWKALASA-AGL 124 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P+ VDWR VT++K+QG CGSC Sbjct: 125 PDTVDWRDKNLVTEVKNQGNCGSC 148 >UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase precursor - Phaedon cochleariae (Mustard beetle) Length = 324 Score = 92.7 bits (220), Expect = 7e-18 Identities = 41/96 (42%), Positives = 59/96 (61%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 + ST A+E Q +SG V LS Q L+DCS YGN+GCNGG N F+Y+KDN G++++ Sbjct: 136 ALSTAAAIESQSAIKSGSKVPLSPQQLVDCSTSYGNHGCNGGFAVNGFEYVKDN-GLESD 194 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASWTSPRATNRS 544 YPY G +DKC+ N K+ ++ + A+ S Sbjct: 195 ADYPYSGKEDKCKANDKSRSVVELTGYKKVTASETS 230 Score = 48.0 bits (109), Expect = 2e-04 Identities = 27/84 (32%), Positives = 42/84 (50%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 YE G +Y L +NK+ D+ EF + M N+ ++ N + G + Sbjct: 61 YENGESTYYLAINKFSDITDEEF-RDMLMKNEASRPN---------LEGLEVADLTVGAA 110 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 PE +DWR G V +++QG+CGSC Sbjct: 111 PESIDWRSKGVVLPVRNQGECGSC 134 >UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; Dictyostelium discoideum|Rep: Cysteine proteinase 1 precursor - Dictyostelium discoideum (Slime mold) Length = 343 Score = 92.3 bits (219), Expect = 9e-18 Identities = 47/91 (51%), Positives = 56/91 (61%), Gaps = 9/91 (9%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDC--------SEQYGNNGCNGGLMDNAFKYIK 412 SFSTTG +EGQHF LVSLSEQNL+DC E+ + GCNGGL NA+ YI Sbjct: 144 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 203 Query: 413 DNGGIDTEQTYPYEG-VDDKCRYNPKNTGAE 502 NGGI TE +YPY +C +N N GA+ Sbjct: 204 KNGGIQTESSYPYTAETGTQCNFNSANIGAK 234 Score = 47.2 bits (107), Expect = 4e-04 Identities = 29/76 (38%), Positives = 40/76 (52%) Frame = +3 Query: 27 KLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRK 206 K G+NK+ D+ EF K NK A +L + +FI+ +P DWR Sbjct: 74 KFGVNKFADLSSDEF-KNYYLNNKEAIFTDDLPV--ADYLDDEFIN----SIPTAFDWRT 126 Query: 207 HGAVTDIKDQGKCGSC 254 GAVT +K+QG+CGSC Sbjct: 127 RGAVTPVKNQGQCGSC 142 >UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|Rep: LD36817p - Drosophila melanogaster (Fruit fly) Length = 352 Score = 91.9 bits (218), Expect = 1e-17 Identities = 39/75 (52%), Positives = 54/75 (72%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SF+TTGALEG FR++G L SLS+QNL+DC++ YGN GC+GG + F+YI+D+ G+ Sbjct: 157 SFATTGALEGHLFRRTGVLASLSQQNLVDCADDYGNMGCDGGFQEYGFEYIRDH-GVTLA 215 Query: 437 QTYPYEGVDDKCRYN 481 YPY + +CR N Sbjct: 216 NKYPYTQTEMQCRQN 230 Score = 57.2 bits (132), Expect = 3e-07 Identities = 22/53 (41%), Positives = 37/53 (69%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 R + I GDE+K+ E +AT+GP++ +++A SF+ YS G+Y +EEC+ +L Sbjct: 245 RDYATITPGDEEKMKEVIATLGPLACSMNADTISFEQYSGGIYEDEECNQGEL 297 Score = 44.0 bits (99), Expect = 0.003 Identities = 27/82 (32%), Positives = 42/82 (51%), Gaps = 1/82 (1%) Frame = +3 Query: 12 GLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQ 191 G+ ++LG+N DM E + T+ G +K ++ + G + +PA+ LPE Sbjct: 78 GVSGFRLGVNTLADMTRKE-IATLLG-SKISEFGERY--TNGHINFVTARNPASANLPEM 133 Query: 192 VDWRKHGAVTDIKDQG-KCGSC 254 DWR+ G VT QG CG+C Sbjct: 134 FDWREKGGVTPPGFQGVGCGAC 155 >UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina). Cathepsin L; n=2; Dictyostelium discoideum|Rep: Similar to Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina). Cathepsin L - Dictyostelium discoideum (Slime mold) Length = 265 Score = 91.1 bits (216), Expect = 2e-17 Identities = 39/82 (47%), Positives = 54/82 (65%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SFS GALEG ++ + G L+ LSEQNL+DC+ +G GC G M +AFKYI +GG++ E Sbjct: 73 SFSALGALEGHYYIKYGELLDLSEQNLVDCATPFGPKGCKTGWMHDAFKYIISSGGVNLE 132 Query: 437 QTYPYEGVDDKCRYNPKNTGAE 502 YPY G D+ C++N A+ Sbjct: 133 SQYPYTGKDEVCKFNQSEKEAK 154 Score = 53.6 bits (123), Expect = 4e-06 Identities = 25/47 (53%), Positives = 30/47 (63%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEEC 648 GFV IP+ DE LMEA+A GPV+V ID S FQ S G+Y + C Sbjct: 157 GFVMIPKFDESALMEAIALYGPVAVPIDTSTKEFQHLSGGIYYSDSC 203 Score = 48.4 bits (110), Expect = 2e-04 Identities = 23/75 (30%), Positives = 36/75 (48%) Frame = +3 Query: 30 LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 209 + +N+Y D+ EF F K ++ + ++ F N +P+ DWR H Sbjct: 1 MDLNEYSDLTQKEFADKF--FEKLVPEPRSGPIN--DIKATPFKHNVNATIPKSFDWRDH 56 Query: 210 GAVTDIKDQGKCGSC 254 GAV +K+QG C SC Sbjct: 57 GAVGKVKNQGSCASC 71 >UniRef50_Q22RR8 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 90.6 bits (215), Expect = 3e-17 Identities = 43/73 (58%), Positives = 50/73 (68%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS TGALE F +G L SLSEQ L+DCS YGN GC+GG MD AFK+I DN I TE Sbjct: 151 AFSATGALESATFISTGTLPSLSEQELVDCSTSYGNEGCDGGDMDAAFKFIHDN-NIATE 209 Query: 437 QTYPYEGVDDKCR 475 + Y Y G D KC+ Sbjct: 210 KEYTYRGFDQKCK 222 Score = 42.3 bits (95), Expect = 0.010 Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 2/49 (4%) Frame = +3 Query: 171 NVKLPEQV--DWRKHGAVTDIKDQGKCGSCGPSARLELWKDSTSVSPAT 311 N+KL + + DW K GAVT +KDQ +CGSC + + +T +S T Sbjct: 120 NMKLGDDIIIDWTKKGAVTPVKDQEQCGSCWAFSATGALESATFISTGT 168 >UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35; Viridiplantae|Rep: Cysteine proteinase 15A precursor - Pisum sativum (Garden pea) Length = 363 Score = 90.6 bits (215), Expect = 3e-17 Identities = 43/82 (52%), Positives = 59/82 (71%), Gaps = 7/82 (8%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCS-----EQYGN--NGCNGGLMDNAFKYIKD 415 +FSTTGALEG H+ +G LVSLSEQ L+DC EQ G+ +GCNGGLM+NAF+Y+ + Sbjct: 158 AFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLE 217 Query: 416 NGGIDTEQTYPYEGVDDKCRYN 481 +GG+ E+ Y Y G D C+++ Sbjct: 218 SGGVVQEKDYAYTGRDGSCKFD 239 Score = 48.8 bits (111), Expect = 1e-04 Identities = 27/74 (36%), Positives = 37/74 (50%) Frame = +3 Query: 33 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 212 G+ K+ D+ EF + G K + + + A + N LPE DWR+ G Sbjct: 92 GITKFSDLTASEFRRQFLGLKKRLRLPAH-------AQKAPILPTTN--LPEDFDWREKG 142 Query: 213 AVTDIKDQGKCGSC 254 AVT +KDQG CGSC Sbjct: 143 AVTPVKDQGSCGSC 156 >UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cathepsin L; n=4; Danio rerio|Rep: Novel protein similar to vertebrate cathepsin L - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 334 Score = 89.8 bits (213), Expect = 5e-17 Identities = 42/94 (44%), Positives = 59/94 (62%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SFSTTGA+EGQ ++ +G LVSLSEQ L+DCS YG GC+G M NA+ Y+ +N +++ Sbjct: 144 SFSTTGAIEGQMYKHTGRLVSLSEQQLVDCSRSYGTYGCSGAWMANAYDYVINN-ALESS 202 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASWTSPRATN 538 TYPY VD + + KN ++ + A N Sbjct: 203 DTYPYTSVDTQPCFYEKNLAMAGISDYRFVPAGN 236 Score = 65.3 bits (152), Expect = 1e-09 Identities = 29/48 (60%), Positives = 36/48 (75%) Frame = +1 Query: 520 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 +P G+EQ L +AVATVGPVSVAIDA + SF YSSG+Y E C+ +L Sbjct: 232 VPAGNEQALADAVATVGPVSVAIDADNPSFLFYSSGIYKESNCNPNNL 279 Score = 52.8 bits (121), Expect = 7e-06 Identities = 29/85 (34%), Positives = 43/85 (50%), Gaps = 1/85 (1%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR-GAKFISPANVK 179 + GL +K+ MNKYGD+ E+ + + K + K +R AK + N+ Sbjct: 64 FSFGLSMFKMAMNKYGDLTSVEYKRLLGSKIKGTGNRKGKITSAQMLRLNAKRLGVTNI- 122 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 D+R G VT++KDQG CGSC Sbjct: 123 -----DYRAKGYVTEVKDQGYCGSC 142 >UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcoptes scabiei type hominis|Rep: Cathepsin L-like protease - Sarcoptes scabiei type hominis Length = 245 Score = 89.4 bits (212), Expect = 7e-17 Identities = 39/78 (50%), Positives = 54/78 (69%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS ++E Q+ ++G LV LSEQ L+DCS GN GC+GG MD+AF+++ GIDTE Sbjct: 146 AFSAVASMESQNALKTGQLVELSEQELVDCSVGEGNEGCDGGWMDSAFEFVIKADGIDTE 205 Query: 437 QTYPYEGVDDKCRYNPKN 490 ++YPY GV+ CR KN Sbjct: 206 KSYPYHGVNQVCRSYQKN 223 Score = 50.8 bits (116), Expect = 3e-05 Identities = 30/84 (35%), Positives = 46/84 (54%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 YE GL +Y+LG+N++ D+ + E+ MN KH+ ++ V + +S L Sbjct: 71 YEAGLSTYELGVNQFTDLTNKEYNDQMNRLK--VKHD----VQSEHVFDNEDVSD----L 120 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P++VDW V IKDQ +CGSC Sbjct: 121 PDEVDWTLKNVVAPIKDQKQCGSC 144 >UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [Contains: Cathepsin H mini chain; Cathepsin H heavy chain; Cathepsin H light chain]; n=37; Eukaryota|Rep: Cathepsin H precursor (EC 3.4.22.16) [Contains: Cathepsin H mini chain; Cathepsin H heavy chain; Cathepsin H light chain] - Homo sapiens (Human) Length = 335 Score = 89.4 bits (212), Expect = 7e-17 Identities = 44/90 (48%), Positives = 60/90 (66%), Gaps = 2/90 (2%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FSTTGALE +G ++SL+EQ L+DC++ + N+GC GGL AF+YI N GI E Sbjct: 143 TFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGE 202 Query: 437 QTYPYEGVDDKCRYNP-KNTG-AEDVASWT 520 TYPY+G D C++ P K G +DVA+ T Sbjct: 203 DTYPYQGKDGYCKFQPGKAIGFVKDVANIT 232 Score = 37.9 bits (84), Expect = 0.22 Identities = 16/42 (38%), Positives = 25/42 (59%) Frame = +1 Query: 532 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSST 657 DE+ ++EAVA PVS A + + F +Y +G+Y+ C T Sbjct: 235 DEEAMVEAVALYNPVSFAFEVTQ-DFMMYRTGIYSSTSCHKT 275 Score = 36.3 bits (80), Expect = 0.66 Identities = 15/25 (60%), Positives = 18/25 (72%), Gaps = 1/25 (4%) Frame = +3 Query: 183 PEQVDWRKHGA-VTDIKDQGKCGSC 254 P VDWRK G V+ +K+QG CGSC Sbjct: 117 PPSVDWRKKGNFVSPVKNQGACGSC 141 >UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4; core eudicotyledons|Rep: Papain-like cysteine peptidase XBCP3 - Arabidopsis thaliana (Mouse-ear cress) Length = 437 Score = 89.0 bits (211), Expect = 9e-17 Identities = 42/73 (57%), Positives = 53/73 (72%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SFS TGA+EG + +G L+SLSEQ LIDC + Y N GCNGGLMD AF+++ N GIDTE Sbjct: 144 SFSATGAMEGINQIVTGDLISLSEQELIDCDKSY-NAGCNGGLMDYAFEFVIKNHGIDTE 202 Query: 437 QTYPYEGVDDKCR 475 + YPY+ D C+ Sbjct: 203 KDYPYQERDGTCK 215 Score = 67.7 bits (158), Expect = 2e-10 Identities = 33/78 (42%), Positives = 49/78 (62%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 +Y L +N + D+ HHEF + G + +A + + KG S+ G+ VK+P+ VDW Sbjct: 73 TYSLSLNAFADLTHHEFKASRLGLSVSAP-SVIMASKGQSLGGS-------VKVPDSVDW 124 Query: 201 RKHGAVTDIKDQGKCGSC 254 RK GAVT++KDQG CG+C Sbjct: 125 RKKGAVTNVKDQGSCGAC 142 Score = 43.2 bits (97), Expect = 0.006 Identities = 23/50 (46%), Positives = 31/50 (62%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTD 660 + + DE+ LMEAVA PVSV I S +FQLYSSG+++ +S D Sbjct: 229 YAGVKSNDEKALMEAVAAQ-PVSVGICGSERAFQLYSSGIFSGPCSTSLD 277 >UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia deliciosa (Kiwi) Length = 509 Score = 88.6 bits (210), Expect = 1e-16 Identities = 41/84 (48%), Positives = 54/84 (64%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS+TGA+EG + +G L+SLSEQ L+DC N+GC GG MD AF+++ NGGIDTE Sbjct: 173 AFSSTGAIEGINALANGDLISLSEQELVDCDST--NDGCEGGYMDYAFEWVMSNGGIDTE 230 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDV 508 YPY G D C + T A + Sbjct: 231 TDYPYTGEDGTCNTTKEETKAVSI 254 Score = 54.4 bits (125), Expect = 2e-06 Identities = 29/77 (37%), Positives = 41/77 (53%), Gaps = 2/77 (2%) Frame = +3 Query: 30 LGMNKYGDMLHHEF--VKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 203 +G+NK+ DM + EF V T+K + G AK ++ + P +DWR Sbjct: 97 VGLNKFADMSNEEFREVYVSKVKKPTSKRMAIERRRQGKAAAAKAVAACDG--PTSLDWR 154 Query: 204 KHGAVTDIKDQGKCGSC 254 K+G VT +KDQG CGSC Sbjct: 155 KYGIVTGVKDQGDCGSC 171 Score = 34.7 bits (76), Expect = 2.0 Identities = 20/48 (41%), Positives = 28/48 (58%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECS 651 G+ D+ E +E L AV P+SV ID FQLY+ G+Y + +CS Sbjct: 256 GYEDVAE-EESALFCAVLKQ-PISVGIDGGAIDFQLYTGGIY-DGDCS 300 >UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus|Rep: Cathepsin L - Aphrocallistes vastus Length = 329 Score = 88.6 bits (210), Expect = 1e-16 Identities = 46/94 (48%), Positives = 59/94 (62%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SFS TG+LEGQ+ +SG LVS SEQ L+DCS GN+GC GGLMD AFKY + N + E Sbjct: 141 SFSATGSLEGQYAIKSGKLVSFSEQELVDCSTSLGNHGCQGGLMDYAFKYWETNLA-EKE 199 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASWTSPRATN 538 Y Y + KC+YN + G +S+T + N Sbjct: 200 SDYTYTAKNGKCKYNAQ-LGVTKDSSFTDIPSEN 232 Score = 62.5 bits (145), Expect = 9e-09 Identities = 29/51 (56%), Positives = 35/51 (68%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 F DIP + L EAVA GP++VA+DASHTSFQ+Y SG+Y CS T L Sbjct: 225 FTDIPSENCDALKEAVANKGPIAVAMDASHTSFQMYHSGIYTPFLCSKTKL 275 Score = 53.6 bits (123), Expect = 4e-06 Identities = 28/78 (35%), Positives = 44/78 (56%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 SYKL N++ D+ + E+ + G++ A+ ++ + G V K + LP VDW Sbjct: 68 SYKLAANQFADLTNLEYRQIYLGYDNEARLSRK---REGKVFQRKM---KDEDLPTTVDW 121 Query: 201 RKHGAVTDIKDQGKCGSC 254 R G VT +K+QG+CGSC Sbjct: 122 RSKGVVTPVKNQGQCGSC 139 >UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza sativa|Rep: Cysteine protease 1 precursor - Oryza sativa subsp. japonica (Rice) Length = 490 Score = 88.6 bits (210), Expect = 1e-16 Identities = 38/72 (52%), Positives = 52/72 (72%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS A+EG + +G LVSLSEQ L++C+ N+GCNGG+MD+AF +I NGG+DTE Sbjct: 182 AFSAVAAVEGINKIVTGELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTE 241 Query: 437 QTYPYEGVDDKC 472 + YPY +D KC Sbjct: 242 EDYPYTAMDGKC 253 Score = 50.4 bits (115), Expect = 4e-05 Identities = 26/42 (61%), Positives = 29/42 (69%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVY 633 GF D+PE DE L +AVA PVSVAIDA FQLY SGV+ Sbjct: 267 GFEDVPENDELSLQKAVAHQ-PVSVAIDAGGREFQLYDSGVF 307 Score = 46.8 bits (106), Expect = 5e-04 Identities = 27/78 (34%), Positives = 39/78 (50%), Gaps = 1/78 (1%) Frame = +3 Query: 24 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 203 ++LGMN++ D+ + EF T G + G G + LP+ VDWR Sbjct: 112 FRLGMNRFADLTNGEFRATYLGTTPAGR---------GRRVGEAYRHDGVEALPDSVDWR 162 Query: 204 KHGAVT-DIKDQGKCGSC 254 GAV +K+QG+CGSC Sbjct: 163 DKGAVVAPVKNQGQCGSC 180 >UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain; n=9; Cucujiformia|Rep: Digestive cysteine proteinase intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 87.8 bits (208), Expect = 2e-16 Identities = 40/80 (50%), Positives = 56/80 (70%), Gaps = 1/80 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGC-NGGLMDNAFKYIKDNGGIDT 433 +FS TGALEGQ+ + + LSEQ L+DCS+ YGN+ C +GGLM AF Y+ D GI+ Sbjct: 136 AFSATGALEGQNAIVNNVKIPLSEQQLLDCSKPYGNDDCEHGGLMSFAFDYVLDK-GIEA 194 Query: 434 EQTYPYEGVDDKCRYNPKNT 493 + +YPY+G+D C+Y+ K T Sbjct: 195 DSSYPYKGIDTPCQYDAKKT 214 Score = 54.4 bits (125), Expect = 2e-06 Identities = 30/84 (35%), Positives = 43/84 (51%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 Y+ G SY LG+ + D+ H EF + KT K N V + P +++ Sbjct: 61 YDKGEESYFLGVTPFADLTHDEFKDELRRQIKT-KPN---------VEATLAVFPEGLEV 110 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P+ +DW + GAV D+K QG CGSC Sbjct: 111 PDSIDWTQKGAVLDVKYQGGCGSC 134 Score = 38.3 bits (85), Expect = 0.16 Identities = 21/49 (42%), Positives = 31/49 (63%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECS 651 +G+ ++ +E+ L +AV TVGPVSVAIDA QLY G+ + C+ Sbjct: 219 KGYKNVSNSEEE-LKKAVGTVGPVSVAIDAD--PIQLYFGGILDGLFCT 264 >UniRef50_Q239L8 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 87.8 bits (208), Expect = 2e-16 Identities = 43/73 (58%), Positives = 51/73 (69%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SFSTTGA+EG F + L SLSEQ L+DCS+ GN GCNGGLMD AF +I + GI TE Sbjct: 149 SFSTTGAVEGALFLSTKKLTSLSEQYLVDCSKD-GNEGCNGGLMDTAFDFISQH-GIPTE 206 Query: 437 QTYPYEGVDDKCR 475 YPY+ VD C+ Sbjct: 207 AAYPYKAVDGTCK 219 Score = 40.7 bits (91), Expect = 0.031 Identities = 14/22 (63%), Positives = 18/22 (81%) Frame = +3 Query: 189 QVDWRKHGAVTDIKDQGKCGSC 254 ++DW GAVT +KDQG+CGSC Sbjct: 126 EIDWTTKGAVTPVKDQGQCGSC 147 >UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin L-like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin L-like protease - Nasonia vitripennis Length = 353 Score = 87.0 bits (206), Expect = 4e-16 Identities = 37/84 (44%), Positives = 55/84 (65%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS GALE Q+F+++G L +LS QNLIDC+ +YGN GC GG +F+++ D G++ E Sbjct: 159 AFSAAGALEAQYFKKTGVLTALSAQNLIDCTMEYGNLGCGGGSAALSFQFVVDQKGLEPE 218 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDV 508 Y YEG +C YN + E++ Sbjct: 219 ANYSYEGRTKECPYNTSDDEDEEL 242 Score = 71.3 bits (167), Expect = 2e-11 Identities = 35/86 (40%), Positives = 54/86 (62%), Gaps = 2/86 (2%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK- 179 +++GL +YK+ +N++GDM+ E+ M+ N T K + RG +FI P + + Sbjct: 78 HDLGLFTYKVRINQFGDMMFEEYKNYMHAANNTITQLKRI------PRGDEFIKPKSAEN 131 Query: 180 LPEQVDWRKHGAVTDIKDQG-KCGSC 254 +PE VDWR+ GAVT ++DQG CGSC Sbjct: 132 VPEHVDWRQRGAVTPVRDQGLTCGSC 157 Score = 61.3 bits (142), Expect = 2e-08 Identities = 28/51 (54%), Positives = 34/51 (66%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 F+ + GDE L AVATVGP S AID SH +F+ YS GVY + EC+ DL Sbjct: 246 FIYVNGGDEATLKVAVATVGPFSAAIDGSHDTFRFYSEGVYYQPECNEDDL 296 >UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF6860, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 251 Score = 86.6 bits (205), Expect = 5e-16 Identities = 37/65 (56%), Positives = 52/65 (80%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FSTTGA+EGQ ++++G LVSLSEQNL+DCS+ YG GC+G M NA+ Y+ +N G+++ Sbjct: 8 AFSTTGAIEGQIYKKTGQLVSLSEQNLVDCSKSYGTYGCSGAWMANAYDYVVNN-GLEST 66 Query: 437 QTYPY 451 TYPY Sbjct: 67 GTYPY 71 Score = 72.1 bits (169), Expect = 1e-11 Identities = 31/53 (58%), Positives = 41/53 (77%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 R + IP+GDEQ L +AVAT+GP++VAIDASH+SF YSSG+Y E C+ +L Sbjct: 117 RDYRFIPKGDEQALADAVATIGPITVAIDASHSSFLFYSSGIYEESNCNPNNL 169 >UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays (Maize) Length = 493 Score = 86.6 bits (205), Expect = 5e-16 Identities = 42/87 (48%), Positives = 57/87 (65%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS A+EG + +G L+SLSEQ LIDC +++ + GC+GGLMDNAF ++ NGGIDTE Sbjct: 190 AFSAVAAVEGINKIVTGSLISLSEQELIDC-DKFQDQGCDGGLMDNAFVFMIKNGGIDTE 248 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASW 517 YP+ G D C KNT + S+ Sbjct: 249 ADYPFTGHDGTCDLKLKNTRVVSIDSF 275 Score = 53.6 bits (123), Expect = 4e-06 Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 4/87 (4%) Frame = +3 Query: 6 EMGLVSYKLGMNKYGDMLHHEFVKTM----NGFNKTAKHNKNLYMKGGSVRGAKFISPAN 173 + GL ++LG+ ++ D+ E+ + G N TA G V +++ A Sbjct: 111 DAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAV---------GVVGRRRYLPLAG 161 Query: 174 VKLPEQVDWRKHGAVTDIKDQGKCGSC 254 +LP+ VDWR+ GAV ++KDQG+CG C Sbjct: 162 EQLPDAVDWRERGAVAEVKDQGQCGGC 188 Score = 38.7 bits (86), Expect = 0.12 Identities = 20/42 (47%), Positives = 29/42 (69%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYN 636 F +P E+ L +AVA PVS +I+AS +FQLYSSG+++ Sbjct: 275 FERVPINYERALQKAVAHQ-PVSASIEASRRAFQLYSSGIFD 315 >UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: Cysteine protease - Saprolegnia parasitica Length = 523 Score = 85.8 bits (203), Expect = 8e-16 Identities = 39/72 (54%), Positives = 50/72 (69%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FSTTGA+EG F S LVS+SEQ L+DC G+ GCNGGLMDNAFK++K + G+ E Sbjct: 142 AFSTTGAIEGAAFVSSKQLVSVSEQELVDCDHN-GDMGCNGGLMDNAFKWVKTHKGLCKE 200 Query: 437 QTYPYEGVDDKC 472 + YPY + C Sbjct: 201 EDYPYHAKEGTC 212 Score = 45.2 bits (102), Expect = 0.001 Identities = 23/43 (53%), Positives = 28/43 (65%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNE 639 F D+P DEQ L AVA PVSVAI+A FQ Y SGV+++ Sbjct: 226 FHDVPANDEQALKAAVAKQ-PVSVAIEADQPEFQFYKSGVFDK 267 Score = 44.8 bits (101), Expect = 0.002 Identities = 23/78 (29%), Positives = 39/78 (50%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 S+ +G N+Y + EF K G + + + + A ++ +V P ++DW Sbjct: 68 SFTMGHNEYSHLTFDEFKKLRTGLRVSPSY---IQSRAKYALMAPAVNMTDV--PNEMDW 122 Query: 201 RKHGAVTDIKDQGKCGSC 254 + G VT +K+QG CGSC Sbjct: 123 VEQGGVTPVKNQGMCGSC 140 >UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax (Sterkiella histriomuscorum) Length = 366 Score = 85.8 bits (203), Expect = 8e-16 Identities = 35/72 (48%), Positives = 50/72 (69%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FST G +E + + G +LSEQ L+DC+ Y N+GC+GGL +AF+YIKDNGG+ E Sbjct: 161 TFSTVGCVESHYLLKYGAFRNLSEQQLVDCAGDYDNHGCSGGLPSHAFEYIKDNGGLALE 220 Query: 437 QTYPYEGVDDKC 472 TYPY+ + +C Sbjct: 221 TTYPYKAANGQC 232 Score = 53.2 bits (122), Expect = 5e-06 Identities = 29/81 (35%), Positives = 41/81 (50%) Frame = +3 Query: 12 GLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQ 191 G +YK G+N + DM EF + +N A+ N S K +N +P + Sbjct: 89 GTNTYKKGLNAFSDMTDEEF---FDYYNIKAEQNC-------SATNRKSFGNSNANIPTE 138 Query: 192 VDWRKHGAVTDIKDQGKCGSC 254 DWR G V+ +K+QGKCGSC Sbjct: 139 WDWRTFGVVSPVKNQGKCGSC 159 >UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 356 Score = 85.8 bits (203), Expect = 8e-16 Identities = 40/82 (48%), Positives = 54/82 (65%), Gaps = 1/82 (1%) Frame = +2 Query: 257 SFSTTGALEGQH-FRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDT 433 +FSTTGA+E + + SLSEQ LIDC+ + NNGC+GGL AF+YIK NGGI Sbjct: 153 TFSTTGAIESHYAIFEDVEPTSLSEQQLIDCAGAFNNNGCSGGLPSQAFEYIKYNGGISY 212 Query: 434 EQTYPYEGVDDKCRYNPKNTGA 499 E +Y Y D +C+++P+ GA Sbjct: 213 ENSYYYIAQDQECQFSPETVGA 234 Score = 53.2 bits (122), Expect = 5e-06 Identities = 25/50 (50%), Positives = 34/50 (68%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSST 657 G +I +GDE +L +AV TVGPVS+A F+LY SGVY+ +CSS+ Sbjct: 239 GSFNITQGDEDQLKQAVGTVGPVSIAFQVM-GDFKLYKSGVYSNPDCSSS 287 Score = 38.7 bits (86), Expect = 0.12 Identities = 14/34 (41%), Positives = 22/34 (64%) Frame = +3 Query: 153 KFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSC 254 K + NV++PE ++W+ V+ +KDQ CGSC Sbjct: 118 KIQNKKNVQVPESINWKDLNKVSPVKDQQNCGSC 151 >UniRef50_Q22A69 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 330 Score = 85.4 bits (202), Expect = 1e-15 Identities = 43/88 (48%), Positives = 59/88 (67%), Gaps = 1/88 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQ-SGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDT 433 +FSTTG++EGQ+ Q L S SEQ L+DC + + GCNGGLMDNAF Y+ ++ ++T Sbjct: 138 AFSTTGSIEGQYVLQLKQNLTSFSEQQLVDCDTKE-DQGCNGGLMDNAFTYL-ESAKLET 195 Query: 434 EQTYPYEGVDDKCRYNPKNTGAEDVASW 517 E YPY VD C+YN ++ G VAS+ Sbjct: 196 ESAYPYTAVDGSCKYN-QSLGVVGVASF 222 Score = 43.2 bits (97), Expect = 0.006 Identities = 22/74 (29%), Positives = 36/74 (48%) Frame = +3 Query: 33 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 212 G+ ++ D+ H EF G+ ++++ S+ F +P +DW G Sbjct: 73 GITQFADLTHEEFADMYLGYKPQLRNSQAKV----SLSSTPFTAPT------AIDWTTKG 122 Query: 213 AVTDIKDQGKCGSC 254 AVT +K+QG CGSC Sbjct: 123 AVTPVKNQGSCGSC 136 >UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thaliana|Rep: F9P14.12 protein - Arabidopsis thaliana (Mouse-ear cress) Length = 343 Score = 85.0 bits (201), Expect = 1e-15 Identities = 38/72 (52%), Positives = 50/72 (69%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS A+EG + ++G LVSLSEQ LIDC N GC+GGLM+ AF++IK NGG+ TE Sbjct: 153 AFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGGLATE 212 Query: 437 QTYPYEGVDDKC 472 YPY G++ C Sbjct: 213 TDYPYTGIEGTC 224 Score = 50.4 bits (115), Expect = 4e-05 Identities = 30/77 (38%), Positives = 41/77 (53%) Frame = +3 Query: 24 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 203 +KL N++ DM + EF G N ++ L+ K V PA +P+ VDWR Sbjct: 84 FKLTDNRFADMTNSEFKAHFLGLNTSSLR---LHKKQRPV-----CDPAG-NVPDAVDWR 134 Query: 204 KHGAVTDIKDQGKCGSC 254 GAVT I++QGKCG C Sbjct: 135 TQGAVTPIRNQGKCGGC 151 Score = 33.5 bits (73), Expect = 4.6 Identities = 16/29 (55%), Positives = 19/29 (65%) Frame = +1 Query: 547 MEAVATVGPVSVAIDASHTSFQLYSSGVY 633 ++ A PVSV IDA FQLYSSGV+ Sbjct: 249 LQIAAAQQPVSVGIDAGGFIFQLYSSGVF 277 >UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep: Cathepsin L - Stylonychia lemnae Length = 340 Score = 85.0 bits (201), Expect = 1e-15 Identities = 40/83 (48%), Positives = 52/83 (62%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FST +LE ++F ++G L SLSEQ L+DCS+ GN GCNGG M A YI GG++TE Sbjct: 151 AFSTIASLESRYFIETGKLQSLSEQQLVDCSKN-GNEGCNGGDMGLAMDYIASAGGVETE 209 Query: 437 QTYPYEGVDDKCRYNPKNTGAED 505 + YPY G D C + A D Sbjct: 210 KDYPYVGKDQTCAFEASKEVATD 232 Score = 55.6 bits (128), Expect = 1e-06 Identities = 32/79 (40%), Positives = 42/79 (53%), Gaps = 1/79 (1%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK-LPEQVD 197 S+ LG N D H E+ K M G+ K K +Y S N+K +PE +D Sbjct: 84 SFTLGPNHLADYTHDEY-KKMLGYKPRNKTGKEVY------------STPNLKDIPESID 130 Query: 198 WRKHGAVTDIKDQGKCGSC 254 WR+ GAV +KDQG+CGSC Sbjct: 131 WREKGAVNAVKDQGQCGSC 149 Score = 40.3 bits (90), Expect = 0.040 Identities = 20/50 (40%), Positives = 29/50 (58%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 +G ++I G L A+A GPVSVAI+A FQ Y SG+++ C + Sbjct: 233 KGHINIVPGKFATLQAAIAE-GPVSVAIEADSLFFQFYRSGIFDSSWCGT 281 >UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin F like protease - Nasonia vitripennis Length = 1036 Score = 84.6 bits (200), Expect = 2e-15 Identities = 39/86 (45%), Positives = 59/86 (68%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS TG +EGQ+ + G L+SLSEQ L+DC + ++GCNGGL D A++ I++ GG++ E Sbjct: 843 AFSVTGNIEGQYAIKHGELLSLSEQELVDCDKL--DSGCNGGLPDTAYRAIEELGGLELE 900 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVAS 514 YPY+ D+KC +N KN ++ S Sbjct: 901 SDYPYDAEDEKCHFN-KNKVKVNIVS 925 Score = 51.2 bits (117), Expect = 2e-05 Identities = 28/83 (33%), Positives = 40/83 (48%) Frame = +3 Query: 6 EMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLP 185 EMG Y G+ ++ D+ EF G T K ++ M ++ +++LP Sbjct: 769 EMGTGRY--GVTQFTDLTKAEFKARHLGLKPTLKSENDIPMPMATI--------PDIELP 818 Query: 186 EQVDWRKHGAVTDIKDQGKCGSC 254 DWR H VT +KDQG CGSC Sbjct: 819 SDYDWRHHNVVTPVKDQGSCGSC 841 >UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to CG5367-PA - Nasonia vitripennis Length = 362 Score = 83.8 bits (198), Expect = 3e-15 Identities = 37/87 (42%), Positives = 55/87 (63%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 ++S G++ GQ FRQ+G +V LSEQ L+DCS Q GN GC+GG + N +Y++ + G+ T+ Sbjct: 177 AYSIAGSIAGQIFRQTGIVVPLSEQQLVDCSTQTGNLGCSGGSLRNTLRYLERSKGLMTD 236 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASW 517 TYPY C++ K +V SW Sbjct: 237 ATYPYTAHQGVCKFQRK-LSVVNVTSW 262 Score = 53.2 bits (122), Expect = 5e-06 Identities = 22/45 (48%), Positives = 33/45 (73%) Frame = +1 Query: 520 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 +P DE+ L AVAT+GP++ +I+A +FQLY SG+Y++ CSS Sbjct: 265 LPARDERALEAAVATIGPIAASINAGPRTFQLYHSGIYDDPTCSS 309 Score = 34.7 bits (76), Expect = 2.0 Identities = 12/26 (46%), Positives = 19/26 (73%) Frame = +3 Query: 177 KLPEQVDWRKHGAVTDIKDQGKCGSC 254 ++P+ +DWR+ G VT ++Q CGSC Sbjct: 150 RIPKSLDWREKGFVTKPENQRDCGSC 175 >UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryza sativa|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 352 Score = 83.8 bits (198), Expect = 3e-15 Identities = 40/89 (44%), Positives = 59/89 (66%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FST A+EG H +G LVSLSEQ L+DC++ N GC GG +DNAF+Y+ ++GG+ TE Sbjct: 155 AFSTVAAVEGIHQITTGELVSLSEQQLLDCAD---NGGCTGGSLDNAFQYMANSGGVTTE 211 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASWTS 523 Y Y+G C+++ ++ A VA+ S Sbjct: 212 AAYAYQGAQGACQFD-ASSSASGVAATIS 239 Score = 48.4 bits (110), Expect = 2e-04 Identities = 24/77 (31%), Positives = 38/77 (49%) Frame = +3 Query: 24 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 203 Y+L N++ D+ EF G+N +Y + +S + + P +VDWR Sbjct: 84 YRLATNRFTDLTDAEFAAMYTGYNPA----NTMY---AAANATTRLSSEDDQQPAEVDWR 136 Query: 204 KHGAVTDIKDQGKCGSC 254 + GAVT +K+Q CG C Sbjct: 137 QQGAVTGVKNQRSCGCC 153 Score = 38.3 bits (85), Expect = 0.16 Identities = 20/49 (40%), Positives = 28/49 (57%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 G+ + DE L AVA+ PVSVAI+ S F+ Y SGV+ + C + Sbjct: 240 GYQRVNPNDEGSLAAAVASQ-PVSVAIEGSGAMFRHYGSGVFTADSCGT 287 >UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel protein - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 328 Score = 83.4 bits (197), Expect = 4e-15 Identities = 41/87 (47%), Positives = 52/87 (59%), Gaps = 1/87 (1%) Frame = +2 Query: 224 HQGP-REVWLMRSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAF 400 +QGP W +FS G+LE Q R++ LV LS QNL+DCS GN GC GG + AF Sbjct: 130 NQGPCGSCW---AFSAVGSLEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAF 186 Query: 401 KYIKDNGGIDTEQTYPYEGVDDKCRYN 481 Y+ N GID+ YPYE + CRY+ Sbjct: 187 LYVIQNRGIDSSTFYPYEHKEGVCRYS 213 Score = 54.8 bits (126), Expect = 2e-06 Identities = 25/49 (51%), Positives = 32/49 (65%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 GF +P +E L AVA +GPVSV I+A SF Y SG+YN+ +CSS Sbjct: 223 GFRIVPRHNEAALQSAVANIGPVSVGINAKLLSFHRYRSGIYNDPKCSS 271 Score = 52.8 bits (121), Expect = 7e-06 Identities = 31/82 (37%), Positives = 45/82 (54%) Frame = +3 Query: 9 MGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPE 188 +GL SY LG+N+ DM E V MNG + + N A F P+ LP+ Sbjct: 67 VGLHSYTLGLNQLSDMTADE-VNDMNGLLEEDFPDVN----------ATFSPPSLQTLPQ 115 Query: 189 QVDWRKHGAVTDIKDQGKCGSC 254 +V+W +HG V+ +++QG CGSC Sbjct: 116 RVNWTEHGMVSPVQNQGPCGSC 137 >UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa (Rice) Length = 339 Score = 83.4 bits (197), Expect = 4e-15 Identities = 41/96 (42%), Positives = 54/96 (56%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS A+EG +G L+SLSEQ L+DC + GC GGLMD+AFK+I NGG+ TE Sbjct: 149 AFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTE 208 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASWTSPRATNRS 544 YPY D KC N + A + + A N + Sbjct: 209 SKYPYTAADGKC--NGGSNSAATIKGYEDVPANNEA 242 Score = 47.2 bits (107), Expect = 4e-04 Identities = 31/87 (35%), Positives = 42/87 (48%), Gaps = 3/87 (3%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK- 179 + G + L +N++ D+ ++EF + K NK +VR NV Sbjct: 71 FNAGNHKFWLSVNQFADLTNYEF--------RATKTNKGFIPS--TVRVPTTFRYENVSI 120 Query: 180 --LPEQVDWRKHGAVTDIKDQGKCGSC 254 LP VDWR GAVT IKDQG+CG C Sbjct: 121 DTLPATVDWRTKGAVTPIKDQGQCGCC 147 Score = 44.4 bits (100), Expect = 0.002 Identities = 21/42 (50%), Positives = 28/42 (66%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGV 630 +G+ D+P +E LM+AVA PVSVA+D +FQ YS GV Sbjct: 231 KGYEDVPANNEAALMKAVANQ-PVSVAVDGGDMTFQFYSGGV 271 >UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Actinidin Act3a - Actinidia eriantha Length = 380 Score = 83.4 bits (197), Expect = 4e-15 Identities = 38/87 (43%), Positives = 56/87 (64%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+T +E + +G L+SLSEQ L+DC+ N GC GG MD+A+++I +NGGI+TE Sbjct: 152 AFATIATVESINQIITGDLISLSEQELVDCNRTPINEGCKGGFMDDAYEFIINNGGINTE 211 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASW 517 + YPY G DD+C KN + S+ Sbjct: 212 ENYPYIGQDDQCDEPKKNQNYVTIDSY 238 Score = 52.8 bits (121), Expect = 7e-06 Identities = 31/80 (38%), Positives = 42/80 (52%), Gaps = 2/80 (2%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHN-KNLYM-KGGSVRGAKFISPANVKLPEQV 194 SY +G+N++ D+ E+ T GF + K N YM + G V LP+ V Sbjct: 83 SYTVGLNQFADLTDEEYRSTYLGFKSSLKSKVSNRYMPQVGEV------------LPDYV 130 Query: 195 DWRKHGAVTDIKDQGKCGSC 254 DWR GAV D+K+QG C SC Sbjct: 131 DWRTTGAVVDVKNQGLCSSC 150 Score = 39.9 bits (89), Expect = 0.053 Identities = 20/49 (40%), Positives = 27/49 (55%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSST 657 + +P DE + AVA PVSVAIDA F+ Y SG++ C +T Sbjct: 238 YEQVPPNDELAMKRAVA-YQPVSVAIDAYCLGFRFYQSGIFTGGSCGTT 285 >UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus tauri|Rep: Cysteine protease-1 - Ostreococcus tauri Length = 430 Score = 82.6 bits (195), Expect = 8e-15 Identities = 40/65 (61%), Positives = 50/65 (76%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FSTTGA+EG ++G LVSLSEQ ++ CS+Q N GCNGGLMD AF++I NGGID+E Sbjct: 227 AFSTTGAVEGITKIRTGRLVSLSEQEMVSCSKQ--NMGCNGGLMDYAFRWIVKNGGIDSE 284 Query: 437 QTYPY 451 YPY Sbjct: 285 FQYPY 289 Score = 58.0 bits (134), Expect = 2e-07 Identities = 27/49 (55%), Positives = 36/49 (73%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 GF D+P GDE++L +AV+ PVS+AI+A SFQLY GVY+ +EC S Sbjct: 310 GFKDVPPGDEKELEKAVSQQ-PVSIAIEADTKSFQLYDGGVYDSKECGS 357 Score = 41.5 bits (93), Expect = 0.018 Identities = 28/89 (31%), Positives = 45/89 (50%), Gaps = 5/89 (5%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGS----VRGAKFI-SP 167 Y +G VS+ +G+N E+ + + G+ + + + M + V K Sbjct: 138 YAIGEVSHWVGLNSLAATTREEY-RALLGYKPELRSSGDAEMLEATSTDKVEQYKASWEY 196 Query: 168 ANVKLPEQVDWRKHGAVTDIKDQGKCGSC 254 A+V PE +DW + GAVT K+QG+CGSC Sbjct: 197 ASVDPPEAIDWVELGAVTPPKNQGQCGSC 225 >UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea mays (Maize) Length = 371 Score = 82.6 bits (195), Expect = 8e-15 Identities = 37/82 (45%), Positives = 53/82 (64%), Gaps = 7/82 (8%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNN-------GCNGGLMDNAFKYIKD 415 SFS +GALEG H+ +G L LSEQ +DC + ++ GCNGGLM AF Y++ Sbjct: 163 SFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYLQK 222 Query: 416 NGGIDTEQTYPYEGVDDKCRYN 481 GG+++E+ YPY G D KC+++ Sbjct: 223 AGGLESEKDYPYTGSDGKCKFD 244 Score = 51.2 bits (117), Expect = 2e-05 Identities = 29/74 (39%), Positives = 40/74 (54%) Frame = +3 Query: 33 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 212 G+ K+ D+ EF +T G K+ + L G S A + P + LP+ DWR HG Sbjct: 92 GVTKFSDLTPAEFRRTYLGLRKSRR--ALLRELGESAHEAPVL-PTD-GLPDDFDWRDHG 147 Query: 213 AVTDIKDQGKCGSC 254 AV +K+QG CGSC Sbjct: 148 AVGPVKNQGSCGSC 161 >UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 21 SCAF14577, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 406 Score = 81.8 bits (193), Expect = 1e-14 Identities = 35/66 (53%), Positives = 48/66 (72%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS+ GALEGQ +++G+LV LS QNL+DCS GN GC GG + ++ YI NGG+D++ Sbjct: 181 AFSSLGALEGQMKKRTGFLVPLSPQNLLDCSISDGNLGCRGGYISKSYSYIIRNGGVDSD 240 Query: 437 QTYPYE 454 YPYE Sbjct: 241 SFYPYE 246 Score = 35.9 bits (79), Expect = 0.87 Identities = 13/24 (54%), Positives = 17/24 (70%) Frame = +3 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P VDWRK G V+ +++QG C SC Sbjct: 156 PPSVDWRKAGLVSPVQNQGFCNSC 179 >UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 339 Score = 81.8 bits (193), Expect = 1e-14 Identities = 41/82 (50%), Positives = 50/82 (60%), Gaps = 7/82 (8%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SFS G +E HF ++ L++LSEQN+IDC+ GNNGC GGL AF YI GID+E Sbjct: 141 SFSAIGVIESSHFIKNKELITLSEQNIIDCTTDMGNNGCMGGLALIAFDYIIKQKGIDSE 200 Query: 437 QTYPYEGV-------DDKCRYN 481 YPYEG +CRYN Sbjct: 201 FNYPYEGYLIEPYEGRGRCRYN 222 Score = 48.4 bits (110), Expect = 2e-04 Identities = 24/51 (47%), Positives = 33/51 (64%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 +++I +E +L +++ PVSV IDAS SF LY SGVY + CSST L Sbjct: 233 YIEIERFNENELTQSLIK-SPVSVMIDASQLSFMLYKSGVYKDPSCSSTIL 282 Score = 32.7 bits (71), Expect = 8.1 Identities = 21/78 (26%), Positives = 36/78 (46%) Frame = +3 Query: 30 LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 209 L +N + D+ +E++ N + + N+ K G + N + + +DWR Sbjct: 69 LELNFFADLSRNEYI---NNYLASFIDISNIEQKNTKYEG-NLKNNFNNSI-KSIDWRNF 123 Query: 210 GAVTDIKDQGKCGSCGPS 263 AVT +K+QG C G S Sbjct: 124 DAVTPVKNQGLCSGAGYS 141 >UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; n=23; Magnoliophyta|Rep: Senescence-specific cysteine protease - Arabidopsis thaliana (Mouse-ear cress) Length = 346 Score = 81.4 bits (192), Expect = 2e-14 Identities = 44/90 (48%), Positives = 55/90 (61%), Gaps = 6/90 (6%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS A+EG + G L+SLSEQ L+DC + GC GGLMD AF++IK GG+ TE Sbjct: 156 AFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN--DFGCEGGLMDTAFEHIKATGGLTTE 213 Query: 437 QTYPYEGVDDKC---RYNPKN---TGAEDV 508 YPY+G D C + NPK TG EDV Sbjct: 214 SNYPYKGEDATCNSKKTNPKATSITGYEDV 243 Score = 56.8 bits (131), Expect = 4e-07 Identities = 30/78 (38%), Positives = 40/78 (51%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 ++KL +N++ D+ + EF GF + + K R S A LP VDW Sbjct: 80 TFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVSSGA---LPVSVDW 136 Query: 201 RKHGAVTDIKDQGKCGSC 254 RK GAVT IK+QG CG C Sbjct: 137 RKKGAVTPIKNQGSCGCC 154 Score = 46.8 bits (106), Expect = 5e-04 Identities = 24/45 (53%), Positives = 29/45 (64%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEE 642 G+ D+P DEQ LM+AVA PVSV I+ FQ YSSGV+ E Sbjct: 239 GYEDVPVNDEQALMKAVAHQ-PVSVGIEGGGFDFQFYSSGVFTGE 282 >UniRef50_Q22W19 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 81.4 bits (192), Expect = 2e-14 Identities = 38/75 (50%), Positives = 51/75 (68%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FST G LEG + +G L S SEQ ++DCS+ N GCNGG + A+KY+ N GI+TE Sbjct: 149 AFSTVGGLEGAYAIATGNLTSFSEQQIVDCSK--ANAGCNGGDLPPAYKYVVQN-GIETE 205 Query: 437 QTYPYEGVDDKCRYN 481 YPY+GV+ KC Y+ Sbjct: 206 ADYPYKGVNQKCAYD 220 Score = 42.3 bits (95), Expect = 0.010 Identities = 22/60 (36%), Positives = 30/60 (50%), Gaps = 1/60 (1%) Frame = +3 Query: 78 TMNGFNKTAKHNKNLYMKGGSVRG-AKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSC 254 T+N F K N KG R + I + +DWR+ AVT +K+QG+CGSC Sbjct: 88 TLNAFAIYTKDEFNQLFKGYQKRQKSHLIYSLKGDVAPSIDWRQKNAVTPVKNQGQCGSC 147 >UniRef50_Q23EG5 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 429 Score = 81.0 bits (191), Expect = 2e-14 Identities = 36/88 (40%), Positives = 58/88 (65%), Gaps = 1/88 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYL-VSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDT 433 +FS TGA+E ++G +LS+Q L+DC+ ++ N GC+GGL AF+YI GGI++ Sbjct: 152 TFSATGAIESHLALKTGKAPFNLSQQQLVDCAGKFDNQGCDGGLPSRAFEYIAYAGGIES 211 Query: 434 EQTYPYEGVDDKCRYNPKNTGAEDVASW 517 + YPY+G D KC++ P+ A+ +S+ Sbjct: 212 SRDYPYKGKDGKCKFKPQKVVAKVQSSF 239 Score = 35.1 bits (77), Expect = 1.5 Identities = 16/41 (39%), Positives = 25/41 (60%) Frame = +1 Query: 532 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 DE +L+ +A GPVS+A + F+ Y G+Y+ ECS+ Sbjct: 245 DENELIYHLAKNGPVSIAYQVTD-DFENYEGGIYSNPECST 284 Score = 34.3 bits (75), Expect = 2.7 Identities = 14/30 (46%), Positives = 20/30 (66%), Gaps = 4/30 (13%) Frame = +3 Query: 177 KLPEQVDWRKHGAVTDIKDQ----GKCGSC 254 ++P+ VDWR+ G V+ +KDQ CGSC Sbjct: 121 EIPDYVDWREKGIVSSVKDQDAVGDDCGSC 150 >UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; n=2; Danio rerio|Rep: hypothetical protein LOC550326 - Danio rerio Length = 531 Score = 80.6 bits (190), Expect = 3e-14 Identities = 38/83 (45%), Positives = 56/83 (67%), Gaps = 1/83 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SF+TTG LEG F ++G L SLS+Q L+DC+ +GNNGC+GG AF++I +GGI T Sbjct: 338 SFATTGTLEGALFLKTGQLTSLSQQMLVDCTWGFGNNGCDGGEEWRAFEWIMKHGGISTA 397 Query: 437 QTY-PYEGVDDKCRYNPKNTGAE 502 ++Y Y G++ C Y+ + A+ Sbjct: 398 ESYGAYMGMNGLCHYDKTSMVAQ 420 Score = 53.6 bits (123), Expect = 4e-06 Identities = 23/49 (46%), Positives = 32/49 (65%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 G+ ++ GD L A+ GPV+V+IDA+H SF YS+GVY E EC + Sbjct: 423 GYTNVTSGDILALKAAIFKFGPVAVSIDAAHRSFAFYSNGVYYEPECKN 471 Score = 44.8 bits (101), Expect = 0.002 Identities = 24/79 (30%), Positives = 36/79 (45%) Frame = +3 Query: 18 VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVD 197 ++Y +G+N + D E + G K + +R ++ P VD Sbjct: 268 LTYSVGINHFADKTKEELARMTGGL--LPKKEEKAQPFPSEIR--------SIATPNSVD 317 Query: 198 WRKHGAVTDIKDQGKCGSC 254 WR +GAVT +KDQ CGSC Sbjct: 318 WRLYGAVTPVKDQAVCGSC 336 >UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia theta|Rep: Cathepsin H precursor - Guillardia theta (Cryptomonas phi) Length = 353 Score = 80.6 bits (190), Expect = 3e-14 Identities = 36/72 (50%), Positives = 46/72 (63%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FST ALE H ++G +V LSEQ L+DC+ + NNGCNGGL AF+YI NGG+ Sbjct: 149 TFSTAAALESLHAIKTGEMVLLSEQQLVDCAADFKNNGCNGGLPSQAFEYIMYNGGLSKM 208 Query: 437 QTYPYEGVDDKC 472 + YPY D C Sbjct: 209 EEYPYVCGDGHC 220 >UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhipicephalus appendiculatus|Rep: Midgut cysteine proteinase 2 - Rhipicephalus appendiculatus (Brown ear tick) Length = 564 Score = 80.6 bits (190), Expect = 3e-14 Identities = 42/95 (44%), Positives = 55/95 (57%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SF T G LEG +FR++G LV LSEQ L+DCS GNNGC+GG A++YI D+G E Sbjct: 371 SFGTVGELEGAYFRKTGRLVRLSEQQLVDCSWNNGNNGCDGGEDFRAYEYIADHGLASDE 430 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASWTSPRATNR 541 Y G D C + N+ + S+ + TNR Sbjct: 431 DYGAYIGQDGVCHDSKVNSTISSIKSYVN--ITNR 463 Score = 51.2 bits (117), Expect = 2e-05 Identities = 24/46 (52%), Positives = 28/46 (60%), Gaps = 1/46 (2%) Frame = +3 Query: 120 LYMKGGSVRGAKFISPA-NVKLPEQVDWRKHGAVTDIKDQGKCGSC 254 L K GS R F KLP+Q+DWR +GAVT +KDQ CGSC Sbjct: 324 LQSKDGSSRAEPFPRHRFTAKLPDQIDWRPYGAVTPVKDQAVCGSC 369 Score = 33.9 bits (74), Expect = 3.5 Identities = 18/38 (47%), Positives = 25/38 (65%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLY 618 + +V+I D+ L A+A VGPVSV+IDA+ SF Y Sbjct: 455 KSYVNITNRDD--LPTALANVGPVSVSIDAALRSFSFY 490 >UniRef50_Q22NW9 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 894 Score = 80.6 bits (190), Expect = 3e-14 Identities = 40/79 (50%), Positives = 51/79 (64%), Gaps = 1/79 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FSTTGALEG H SEQ +IDCS + GN+GC+GG M+NAF ++ +N GI E Sbjct: 709 AFSTTGALEGIHKISGKDWKGFSEQQIIDCSRKQGNSGCHGGFMENAFDFVIEN-GILQE 767 Query: 437 QTYPYEG-VDDKCRYNPKN 490 YPYEG + KC+ N N Sbjct: 768 NDYPYEGHANFKCKKNNSN 786 Score = 36.7 bits (81), Expect = 0.50 Identities = 13/25 (52%), Positives = 18/25 (72%) Frame = +3 Query: 177 KLPEQVDWRKHGAVTDIKDQGKCGS 251 ++P +DWR AVT +K+QG CGS Sbjct: 682 EVPSSIDWRDLNAVTPVKNQGSCGS 706 >UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15; Magnoliophyta|Rep: Cysteine proteinase RD19a precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 368 Score = 80.6 bits (190), Expect = 3e-14 Identities = 40/78 (51%), Positives = 51/78 (65%), Gaps = 7/78 (8%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYG-------NNGCNGGLMDNAFKYIKD 415 SFS TGALEG +F +G LVSLSEQ L+DC + ++GCNGGLM++AF+Y Sbjct: 161 SFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLK 220 Query: 416 NGGIDTEQTYPYEGVDDK 469 GG+ E+ YPY G D K Sbjct: 221 TGGLMKEEDYPYTGKDGK 238 Score = 48.8 bits (111), Expect = 1e-04 Identities = 28/74 (37%), Positives = 36/74 (48%) Frame = +3 Query: 33 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 212 G+ ++ D+ EF K G K K+ A + N LPE DWR HG Sbjct: 95 GVTQFSDLTRSEFRKKHLGVRSGFKLPKD-------ANKAPILPTEN--LPEDFDWRDHG 145 Query: 213 AVTDIKDQGKCGSC 254 AVT +K+QG CGSC Sbjct: 146 AVTPVKNQGSCGSC 159 >UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schistosoma|Rep: Preprocathepsin cathepsin L - Schistosoma japonicum (Blood fluke) Length = 331 Score = 80.2 bits (189), Expect = 4e-14 Identities = 41/84 (48%), Positives = 52/84 (61%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS TGA+EGQ R+ LV LSEQ L+DC YGN+GC GG MD AF Y++ + I++E Sbjct: 142 AFSATGAIEGQLRRKHKKLVKLSEQQLVDCRYNYGNDGCEGGTMDLAFNYLEKH-YIESE 200 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDV 508 Y Y G D C Y K+ G V Sbjct: 201 NDYKYLGHDANCHYR-KSKGVVKV 223 Score = 52.0 bits (119), Expect = 1e-05 Identities = 31/84 (36%), Positives = 44/84 (52%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 +++GL Y +G+N++ DM E + M F K N L+ G+ + N + Sbjct: 65 HDLGLEGYTMGLNQFCDMEWEEVNRIM--FPKVFG-NSPLWNDDGNE-----LELTNKPV 116 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P DWR HGAVT +K QG CGSC Sbjct: 117 PSTWDWRDHGAVTAVKHQGLCGSC 140 Score = 43.6 bits (98), Expect = 0.004 Identities = 22/51 (43%), Positives = 30/51 (58%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 F D+P DE+ L +AV GP+SV I A S LY SG+Y ++C D+ Sbjct: 226 FGDLPARDEKTLEKAVYQYGPISVGIVAL-DSLILYKSGIYESKDCKYADI 275 >UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep: CG4847-PD, isoform D - Drosophila melanogaster (Fruit fly) Length = 420 Score = 80.2 bits (189), Expect = 4e-14 Identities = 38/84 (45%), Positives = 54/84 (64%), Gaps = 3/84 (3%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCS--EQYGNNGCNGGLMDNAFKYIKD-NGGI 427 +F+TTGA+EG FR++G L +LSEQNL+DC E +G NGC+GG + AF +I + G+ Sbjct: 229 AFATTGAIEGHTFRKTGSLPNLSEQNLVDCGPVEDFGLNGCDGGFQEAAFCFIDEVQKGV 288 Query: 428 DTEQTYPYEGVDDKCRYNPKNTGA 499 E YPY C+Y+ +GA Sbjct: 289 SQEGAYPYIDNKGTCKYDGSKSGA 312 Score = 58.4 bits (135), Expect = 1e-07 Identities = 25/84 (29%), Positives = 44/84 (52%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 + G+ ++K +N + D+ H EF+ + G ++ + K + K ++ + Sbjct: 150 FAQGVHTFKQAVNAFADLTHSEFLSQLTGLKRSPE------AKARAAASLKLVNLPAKPI 203 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P+ DWR+HG VT +K QG CGSC Sbjct: 204 PDAFDWREHGGVTPVKFQGTCGSC 227 Score = 49.2 bits (112), Expect = 9e-05 Identities = 20/49 (40%), Positives = 35/49 (71%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECS 651 +GF IP DE++L + VAT+GPV+ +++ T + Y+ G+YN++EC+ Sbjct: 315 QGFAAIPPKDEEQLKKVVATLGPVACSVNGLET-LKNYAGGIYNDDECN 362 >UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae|Rep: Cysteine proteinase - Hypera postica (alfalfa weevil) Length = 324 Score = 79.8 bits (188), Expect = 5e-14 Identities = 40/89 (44%), Positives = 59/89 (66%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS TG+ EG + R+SG LVSLSEQ LIDC + GC+GG +D+ FKY+ + G+ +E Sbjct: 138 AFSITGSTEGAYARKSGKLVSLSEQQLIDCCTD-TSAGCDGGSLDDNFKYVMKD-GLQSE 195 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASWTS 523 ++Y Y+G D C+YN + V+ +TS Sbjct: 196 ESYTYKGEDGACKYNVASV-VTKVSKYTS 223 Score = 65.7 bits (153), Expect = 9e-10 Identities = 39/86 (45%), Positives = 47/86 (54%), Gaps = 2/86 (2%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNL--YMKGGSVRGAKFISPANV 176 YE G VSYK G+NK+ DM EF KTM + + K Y+K G V Sbjct: 64 YEQGKVSYKKGINKFTDMSQEEF-KTMLTLSASRKPTLETTSYVKTG------------V 110 Query: 177 KLPEQVDWRKHGAVTDIKDQGKCGSC 254 ++P VDWRK G VT +KDQG CGSC Sbjct: 111 EIPSSVDWRKEGRVTGVKDQGDCGSC 136 Score = 56.8 bits (131), Expect = 4e-07 Identities = 27/51 (52%), Positives = 35/51 (68%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 + IP DE L+EAVATVGPVSV +DAS+ S Y SG+Y +++CS L Sbjct: 221 YTSIPAEDEDALLEAVATVGPVSVGMDASYLS--SYDSGIYEDQDCSPAGL 269 >UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor (EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase) (Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1; Vignain-2] - Vigna mungo (Rice bean) (Black gram) Length = 362 Score = 79.8 bits (188), Expect = 5e-14 Identities = 37/72 (51%), Positives = 50/72 (69%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FST A+EG + ++ LVSLSEQ L+DC ++ N GCNGGLM++AF++IK GGI TE Sbjct: 154 AFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEE-NQGCNGGLMESAFEFIKQKGGITTE 212 Query: 437 QTYPYEGVDDKC 472 YPY + C Sbjct: 213 SNYPYTAQEGTC 224 Score = 72.1 bits (169), Expect = 1e-11 Identities = 35/77 (45%), Positives = 45/77 (58%) Frame = +3 Query: 24 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 203 YKL +NK+ DM +HEF T G +K N + +G F+ +P VDWR Sbjct: 80 YKLKLNKFADMTNHEFRSTYAG----SKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWR 135 Query: 204 KHGAVTDIKDQGKCGSC 254 K GAVTD+KDQG+CGSC Sbjct: 136 KKGAVTDVKDQGQCGSC 152 Score = 45.2 bits (102), Expect = 0.001 Identities = 26/52 (50%), Positives = 34/52 (65%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 G ++P DE L++AVA PVSVAIDA + FQ YS GV+ +C +TDL Sbjct: 238 GHENVPVNDENALLKAVANQ-PVSVAIDAGGSDFQFYSEGVFT-GDC-NTDL 286 >UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster|Rep: CG5367-PA - Drosophila melanogaster (Fruit fly) Length = 338 Score = 79.4 bits (187), Expect = 7e-14 Identities = 34/87 (39%), Positives = 54/87 (62%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS ++ GQ F+++G ++SLS+Q ++DCS +GN GC GG + N Y++ GGI + Sbjct: 153 AFSIAESIMGQVFKRTGKILSLSKQQIVDCSVSHGNQGCVGGSLRNTLSYLQSTGGIMRD 212 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASW 517 Q YPY KC++ P + +V SW Sbjct: 213 QDYPYVARKGKCQFVP-DLSVVNVTSW 238 Score = 53.2 bits (122), Expect = 5e-06 Identities = 22/48 (45%), Positives = 34/48 (70%) Frame = +1 Query: 520 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 +P DEQ + AV +GPV+++I+AS +FQLYS G+Y++ CSS + Sbjct: 241 LPVRDEQAIQAAVTHIGPVAISINASPKTFQLYSDGIYDDPLCSSASV 288 Score = 43.2 bits (97), Expect = 0.006 Identities = 27/85 (31%), Positives = 43/85 (50%), Gaps = 1/85 (1%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI-SPANVK 179 Y+ G S++L N + DM ++K GF + K N ++ + A+ + SP Sbjct: 74 YKEGQTSFRLKPNIFADMSTDGYLK---GFLRLLKSN----IEDSADNMAEIVGSPLMAN 126 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 +PE +DWR G +T +Q CGSC Sbjct: 127 VPESLDWRSKGFITPPYNQLSCGSC 151 >UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio molitor (Yellow mealworm) Length = 336 Score = 79.4 bits (187), Expect = 7e-14 Identities = 39/83 (46%), Positives = 51/83 (61%), Gaps = 2/83 (2%) Frame = +2 Query: 257 SFSTTGALEGQH--FRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGID 430 +FS+TGA+E Q +GY S+SEQ L+DC GC+GG M++AF Y+ NGGID Sbjct: 147 AFSSTGAIESQMKIANGAGYDSSVSEQQLVDCVPNA--LGCSGGWMNDAFTYVAQNGGID 204 Query: 431 TEQTYPYEGVDDKCRYNPKNTGA 499 +E YPYE D C Y+P A Sbjct: 205 SEGAYPYEMADGNCHYDPNQVAA 227 Score = 58.0 bits (134), Expect = 2e-07 Identities = 32/85 (37%), Positives = 45/85 (52%), Gaps = 1/85 (1%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFIS-PANVK 179 Y GLVSY LG+N + DM E +G A +KN G ++ + + A+V+ Sbjct: 65 YRQGLVSYTLGVNLFTDMTPEEMKAYTHGLIMPADLHKN----GIPIKTREDLGLNASVR 120 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 P DWR G V+ +K+QG CGSC Sbjct: 121 YPASFDWRDQGMVSPVKNQGSCGSC 145 Score = 40.7 bits (91), Expect = 0.031 Identities = 22/49 (44%), Positives = 27/49 (55%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 G+V + DE L + VAT GPV+VA DA F YS GVY C + Sbjct: 231 GYVYLSGPDENMLADMVATKGPVAVAFDAD-DPFGSYSGGVYYNPTCET 278 >UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchocercidae|Rep: Cathepsin L-like precursor - Brugia pahangi (Filarial nematode worm) Length = 395 Score = 79.4 bits (187), Expect = 7e-14 Identities = 34/74 (45%), Positives = 47/74 (63%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+T ALE H + +G L+ LS QN++DC+ GNNGC+GG M AF+Y GI E Sbjct: 208 AFATAAALEAYHKQMTGRLLDLSPQNIVDCTRNLGNNGCSGGYMPTAFQY-ASRYGIAME 266 Query: 437 QTYPYEGVDDKCRY 478 YPY G + +CR+ Sbjct: 267 SRYPYVGTEQRCRW 280 Score = 60.5 bits (140), Expect = 4e-08 Identities = 31/84 (36%), Positives = 46/84 (54%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 YE GLVSY +N D+ EF+ NG + + ++G + + +L Sbjct: 128 YEQGLVSYTTALNDLADLTDEEFM-VRNGLRLPNQTD----LRGKRQTSEFYRYDKSERL 182 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P+QVDWR GAVT +++QG+CGSC Sbjct: 183 PDQVDWRTKGAVTPVRNQGECGSC 206 Score = 50.4 bits (115), Expect = 4e-05 Identities = 25/51 (49%), Positives = 28/51 (54%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTD 660 GF +I GDE L AVA GPV V I S SF+ Y GVY+E C D Sbjct: 291 GFNEIQPGDELALKHAVAKRGPVVVGISGSKRSFRFYKDGVYSEGNCGRPD 341 >UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra erinaceieuropaei|Rep: Cysteine proteinase - Spirometra erinaceieuropaei (Tapeworm) Length = 336 Score = 78.2 bits (184), Expect = 2e-13 Identities = 39/74 (52%), Positives = 47/74 (63%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SFS GA+EG ++G L SLSEQ L+DCS YGN GCNGGLM AF+Y + G++ E Sbjct: 147 SFSANGAIEGAIQIKTGALRSLSEQQLMDCSWDYGNQGCNGGLMPQAFQYAQ-RYGVEAE 205 Query: 437 QTYPYEGVDDKCRY 478 Y Y D CRY Sbjct: 206 VDYRYTERDGVCRY 219 Score = 58.8 bits (136), Expect = 1e-07 Identities = 25/48 (52%), Positives = 33/48 (68%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECS 651 G+ ++PEGDE L AVAT+GP+SV IDA+ F YS GV+ + CS Sbjct: 230 GYAELPEGDEGGLQRAVATIGPISVGIDAADPGFMSYSHGVFVSKTCS 277 Score = 47.2 bits (107), Expect = 4e-04 Identities = 17/30 (56%), Positives = 23/30 (76%) Frame = +3 Query: 165 PANVKLPEQVDWRKHGAVTDIKDQGKCGSC 254 P LP+ V+WR+ GAVT +K+QG+CGSC Sbjct: 116 PLKENLPDSVNWRERGAVTSVKNQGQCGSC 145 >UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; n=16; Chrysomelidae|Rep: Digestive cysteine protease intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 78.2 bits (184), Expect = 2e-13 Identities = 38/80 (47%), Positives = 52/80 (65%), Gaps = 1/80 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGC-NGGLMDNAFKYIKDNGGIDT 433 +FS TGALEGQ+ + +SLSEQ L+DCS YGN C GG M AF+Y++D GI + Sbjct: 136 AFSATGALEGQNAILNNVKISLSEQQLLDCSAAYGNGNCKEGGDMSAAFEYVRDY-GIQS 194 Query: 434 EQTYPYEGVDDKCRYNPKNT 493 E++YPY +C+Y+ T Sbjct: 195 EKSYPYIRKQTECQYDASKT 214 Score = 55.6 bits (128), Expect = 1e-06 Identities = 27/84 (32%), Positives = 45/84 (53%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 Y+ G +Y LG+ ++ D+ H EF + G K NK + + P ++++ Sbjct: 61 YDKGEETYLLGVTRFADLTHEEFKDILKGQIK----NKP------RLNATPTVFPEDLEV 110 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P+ +DW + GAV ++KDQ CGSC Sbjct: 111 PDSIDWTEKGAVLEVKDQNPCGSC 134 >UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 precursor; n=4; Schizophora|Rep: Putative cysteine proteinase CG12163 precursor - Drosophila melanogaster (Fruit fly) Length = 614 Score = 78.2 bits (184), Expect = 2e-13 Identities = 35/75 (46%), Positives = 50/75 (66%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS TG +EG + ++G L SEQ L+DC ++ CNGGLMDNA+K IKD GG++ E Sbjct: 420 AFSVTGNIEGLYAVKTGELKEFSEQELLDCDTT--DSACNGGLMDNAYKAIKDIGGLEYE 477 Query: 437 QTYPYEGVDDKCRYN 481 YPY+ ++C +N Sbjct: 478 AEYPYKAKKNQCHFN 492 Score = 46.4 bits (105), Expect = 6e-04 Identities = 30/83 (36%), Positives = 44/83 (53%) Frame = +3 Query: 6 EMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLP 185 EMG S K G+ ++ DM E+ K G + + GGS A + + +LP Sbjct: 346 EMG--SAKYGITEFADMTSSEY-KERTGLWQRDEAKAT----GGS---AAVVPAYHGELP 395 Query: 186 EQVDWRKHGAVTDIKDQGKCGSC 254 ++ DWR+ AVT +K+QG CGSC Sbjct: 396 KEFDWRQKDAVTQVKNQGSCGSC 418 Score = 38.3 bits (85), Expect = 0.16 Identities = 17/41 (41%), Positives = 27/41 (65%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGV 630 GFVD+P+G+E + E + GP+S+ I+A+ + Q Y GV Sbjct: 502 GFVDLPKGNETAMQEWLLANGPISIGINAN--AMQFYRGGV 540 >UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep: Cathepsin L precursor - Schistosoma mansoni (Blood fluke) Length = 319 Score = 77.8 bits (183), Expect = 2e-13 Identities = 36/72 (50%), Positives = 50/72 (69%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FSTTG +E Q FR++G L+SLSEQ L+DC ++GCNGGL NA++ I GG+ E Sbjct: 131 AFSTTGNVESQWFRKTGKLLSLSEQQLVDCDGL--DDGCNGGLPSNAYESIIKMGGLMLE 188 Query: 437 QTYPYEGVDDKC 472 YPY+ ++KC Sbjct: 189 DNYPYDAKNEKC 200 Score = 44.0 bits (99), Expect = 0.003 Identities = 15/25 (60%), Positives = 21/25 (84%) Frame = +3 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 +P+ DWR+ GAVT++K+QG CGSC Sbjct: 105 IPKNFDWREKGAVTEVKNQGMCGSC 129 >UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicotyledons|Rep: Cysteine proteinase - Mesembryanthemum crystallinum (Common ice plant) Length = 367 Score = 77.4 bits (182), Expect = 3e-13 Identities = 37/75 (49%), Positives = 47/75 (62%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS A+EG + +G L+SLSEQ LIDC Q N+GC GG M AF+YIK GGI +E Sbjct: 152 AFSAAAAVEGINQITTGQLISLSEQQLIDCDTQ--NSGCRGGTMGRAFEYIKQRGGITSE 209 Query: 437 QTYPYEGVDDKCRYN 481 YPY+ C+ N Sbjct: 210 ANYPYKAQAGMCKNN 224 Score = 54.8 bits (126), Expect = 2e-06 Identities = 29/77 (37%), Positives = 44/77 (57%) Frame = +3 Query: 24 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 203 YKL +N++GD+ EF +T +K + +N GG + NV++P +DWR Sbjct: 84 YKLRLNQFGDLTPSEFARTYAN-SKIIEGTRN--ESGGFMY-------ENVEVPRSIDWR 133 Query: 204 KHGAVTDIKDQGKCGSC 254 GAVT +K+QG+CG C Sbjct: 134 VKGAVTPVKNQGRCGGC 150 >UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase" precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 315 Score = 77.4 bits (182), Expect = 3e-13 Identities = 40/93 (43%), Positives = 55/93 (59%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FSTTG+LEGQ V LSEQ L+DC + N GCNGGLM +AF Y+K + G+ +E Sbjct: 136 AFSTTGSLEGQLAIHKNQRVPLSEQELVDC-DTSRNAGCNGGLMTDAFNYVKRH-GLSSE 193 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASWTSPRAT 535 Y Y G DD+C+ N +N ++ + T Sbjct: 194 SQYAYTGRDDRCK-NVENKPLSSISGYVELETT 225 Score = 50.4 bits (115), Expect = 4e-05 Identities = 31/84 (36%), Positives = 44/84 (52%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 YE G +Y L +NK+ D EF + + A K ++ AK ++ NV+ Sbjct: 61 YESGEETYYLAVNKFADWSSAEFQAMLA--RQMANKPKQSFI-------AKHVADPNVQA 111 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 E+VDWR AV +KDQG+CGSC Sbjct: 112 VEEVDWRD-SAVLGVKDQGQCGSC 134 Score = 46.0 bits (104), Expect = 8e-04 Identities = 22/47 (46%), Positives = 33/47 (70%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEEC 648 G+V++ E E L AVA+VGPVS+A+DA ++QLY G++N + C Sbjct: 218 GYVEL-ETTEDALASAVASVGPVSIAVDAD--TWQLYGGGLFNNKNC 261 >UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 317 Score = 76.6 bits (180), Expect = 5e-13 Identities = 37/69 (53%), Positives = 43/69 (62%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS GALEGQ F + G L LS Q L+DCS Y N GCNGG A+ YIKDN G+ E Sbjct: 130 AFSAAGALEGQRFLKEGKLEVLSTQQLVDCSRDYKNEGCNGGWPHWAYDYIKDN-GLCLE 188 Query: 437 QTYPYEGVD 463 Y Y+G D Sbjct: 189 SKYKYQGYD 197 Score = 58.0 bits (134), Expect = 2e-07 Identities = 29/84 (34%), Positives = 48/84 (57%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 Y+ G VS+ LG+N++ DM EF K M K +++ ++F++ + + Sbjct: 54 YQNGEVSFYLGVNQFADMTSEEF-KAMLDSQLIHKPKRDIT--------SRFVADPQLTV 104 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 PE +DWR+ GAV ++DQ +CGSC Sbjct: 105 PESIDWREKGAVNPVRDQEQCGSC 128 Score = 38.3 bits (85), Expect = 0.16 Identities = 16/38 (42%), Positives = 25/38 (65%) Frame = +1 Query: 535 EQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEEC 648 E+ L EAV T GP++V ++A + +QLYS G+ + C Sbjct: 221 EEALKEAVGTAGPIAVCVNA-NDDWQLYSGGILESQSC 257 >UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 357 Score = 76.2 bits (179), Expect = 7e-13 Identities = 36/66 (54%), Positives = 43/66 (65%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS A+EG H +S LV+LS Q L+DCS N+GCN G MD AF+YI NGGI E Sbjct: 161 AFSAVAAVEGIHQIRSHNLVALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGGIAAE 220 Query: 437 QTYPYE 454 YPYE Sbjct: 221 SDYPYE 226 Score = 38.3 bits (85), Expect = 0.16 Identities = 25/81 (30%), Positives = 38/81 (46%) Frame = +3 Query: 12 GLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQ 191 G S +L NK+ D+ + EF + T + GGS G + + +P Sbjct: 88 GKKSPRLTTNKFADLTNEEFAEYYGRPFSTP-------VIGGS--GFMYGNVRTSDVPAN 138 Query: 192 VDWRKHGAVTDIKDQGKCGSC 254 ++WR GAVT +K+Q C SC Sbjct: 139 INWRDRGAVTQVKNQKDCASC 159 Score = 37.9 bits (84), Expect = 0.22 Identities = 24/55 (43%), Positives = 32/55 (58%), Gaps = 2/55 (3%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYN--EEECSSTDL 663 RGF +P +E L+ AVA PVSVA+D Q +SSGV+ + E +TDL Sbjct: 245 RGFQYVPPNNETALLLAVAHQ-PVSVALDGVGKVSQFFSSGVFGAMQNETCTTDL 298 >UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_79, whole genome shotgun sequence - Paramecium tetraurelia Length = 324 Score = 76.2 bits (179), Expect = 7e-13 Identities = 35/77 (45%), Positives = 48/77 (62%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS G LE + G +LSEQ+++DCS YGN GC+GG MD+ F+Y++D+ GI Sbjct: 145 AFSAVGVLEINSNIEFGLETTLSEQDMLDCSGPYGNQGCSGGWMDSGFEYVRDH-GIANG 203 Query: 437 QTYPYEGVDDKCRYNPK 487 YPY G D CR + K Sbjct: 204 SVYPYVGSDQTCRTSVK 220 >UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep: Cysteine protease - Solanum lycopersicum (Tomato) (Lycopersicon esculentum) Length = 345 Score = 75.8 bits (178), Expect = 9e-13 Identities = 39/87 (44%), Positives = 51/87 (58%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS G+LEG + +G L+ SEQ L+DC+ N GCNGG M NAF +I +NGGI E Sbjct: 157 AFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRE 214 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASW 517 Y Y G CR K T A ++S+ Sbjct: 215 SDYEYLGQQYTCRSQEK-TAAVQISSY 240 Score = 60.5 bits (140), Expect = 4e-08 Identities = 30/81 (37%), Positives = 43/81 (53%) Frame = +3 Query: 12 GLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQ 191 G +SYKLGMN++ D+ EF+ G N + M S K ++ +P Sbjct: 77 GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS--STEFKKINDLSDDYMPSN 134 Query: 192 VDWRKHGAVTDIKDQGKCGSC 254 +DWR+ GAVT +K QG+CG C Sbjct: 135 LDWRESGAVTQVKHQGRCGCC 155 >UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-PA - Drosophila melanogaster (Fruit fly) Length = 549 Score = 75.8 bits (178), Expect = 9e-13 Identities = 38/77 (49%), Positives = 48/77 (62%), Gaps = 2/77 (2%) Frame = +2 Query: 257 SFSTTGALEGQHF-RQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDT 433 SF T G LEG F + G LV LS+Q LIDCS YGNNGC+GG ++++ +GG+ T Sbjct: 356 SFGTIGHLEGAFFLKNGGNLVRLSQQALIDCSWAYGNNGCDGGEDFRVYQWMLQSGGVPT 415 Query: 434 EQTY-PYEGVDDKCRYN 481 E+ Y PY G D C N Sbjct: 416 EEEYGPYLGQDGYCHVN 432 Score = 47.2 bits (107), Expect = 4e-04 Identities = 22/50 (44%), Positives = 29/50 (58%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 +GFV++ D A+ GP+SVAIDAS +F YS GVY E C + Sbjct: 441 KGFVNVTSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPTCKN 490 Score = 44.8 bits (101), Expect = 0.002 Identities = 16/26 (61%), Positives = 21/26 (80%) Frame = +3 Query: 177 KLPEQVDWRKHGAVTDIKDQGKCGSC 254 ++P+Q DWR +GAVT +KDQ CGSC Sbjct: 329 EIPDQYDWRLYGAVTPVKDQSVCGSC 354 >UniRef50_Q235G6 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 325 Score = 75.8 bits (178), Expect = 9e-13 Identities = 37/78 (47%), Positives = 49/78 (62%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SF+TTG +EG +F L +LS+Q LIDC+ Q N GC GGL D A Y+K+ G+ TE Sbjct: 143 SFATTGGVEGANFVYKNVLPNLSQQQLIDCNTQ--NKGCGGGLRDIALNYVKET-GLTTE 199 Query: 437 QTYPYEGVDDKCRYNPKN 490 + Y YE + KCR K+ Sbjct: 200 EEYSYEAKNGKCRLQGKS 217 Score = 35.1 bits (77), Expect = 1.5 Identities = 12/21 (57%), Positives = 16/21 (76%) Frame = +3 Query: 192 VDWRKHGAVTDIKDQGKCGSC 254 +DW + GAVT +K+QG CG C Sbjct: 121 IDWVEKGAVTPVKNQGGCGGC 141 >UniRef50_O16454 Cluster: Temporarily assigned gene name protein 196; n=4; Bilateria|Rep: Temporarily assigned gene name protein 196 - Caenorhabditis elegans Length = 477 Score = 75.4 bits (177), Expect = 1e-12 Identities = 36/72 (50%), Positives = 45/72 (62%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FSTTG +EG F LVSLSEQ L+DC + GCNGGL NA+K I GG++ E Sbjct: 290 AFSTTGNVEGAWFIAKNKLVSLSEQELVDCDSM--DQGCNGGLPSNAYKEIIRMGGLEPE 347 Query: 437 QTYPYEGVDDKC 472 YPY+G + C Sbjct: 348 DAYPYDGRGETC 359 Score = 47.6 bits (108), Expect = 3e-04 Identities = 29/83 (34%), Positives = 40/83 (48%) Frame = +3 Query: 6 EMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLP 185 E G Y G K+ DM EF K M + + + +Y + ++ LP Sbjct: 212 EQGTAVY--GFTKFSDMTTMEFKKIMLPY----QWEQPVYPMEQANFEKHDVTINEEDLP 265 Query: 186 EQVDWRKHGAVTDIKDQGKCGSC 254 E DWR+ GAVT +K+QG CGSC Sbjct: 266 ESFDWREKGAVTQVKNQGNCGSC 288 >UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n=1; Xenopus tropicalis|Rep: UPI00006A2275 UniRef100 entry - Xenopus tropicalis Length = 272 Score = 74.9 bits (176), Expect = 2e-12 Identities = 38/81 (46%), Positives = 51/81 (62%), Gaps = 1/81 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS GALE Q +++ LV+ S Q L+DCS+ GN+GCNGG ++ AFKY+K G ++ E Sbjct: 106 AFSAVGALECQWKKKTVRLVTFSPQELVDCSDGEGNHGCNGGKIEKAFKYMKKYGVME-E 164 Query: 437 QTYPYEGVDDKCR-YNPKNTG 496 YPY G CR P N G Sbjct: 165 SAYPYTGQKGLCRKKQPGNIG 185 Score = 51.6 bits (118), Expect = 2e-05 Identities = 22/44 (50%), Positives = 29/44 (65%) Frame = +1 Query: 517 DIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEEC 648 D+P G+E LM V T+GPVSV+I+AS F + SGVY +C Sbjct: 192 DLPSGNETLLMNTVGTIGPVSVSINASSEKFHQFKSGVYYNPDC 235 Score = 49.2 bits (112), Expect = 9e-05 Identities = 28/85 (32%), Positives = 39/85 (45%), Gaps = 1/85 (1%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 Y +GL +Y++GMN GDM E TM G+ + N+ + A Sbjct: 28 YSLGLHTYEVGMNHLGDMTGEEVAATMTGYTGSGDSLANMSHVPKEILEA--------LA 79 Query: 183 PEQVDWRKHGAVTDIKDQGK-CGSC 254 P +DWR VT ++DQG C SC Sbjct: 80 PPSIDWRTQNCVTPVRDQGSFCRSC 104 >UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing protein; n=5; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 437 Score = 74.9 bits (176), Expect = 2e-12 Identities = 32/80 (40%), Positives = 46/80 (57%), Gaps = 1/80 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYL-VSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDT 433 +F+ ALE + ++G + SEQ L+DC+ ++ GC+GGL F+Y+ GGI Sbjct: 232 AFAAVAALESHYALKTGKKPIQFSEQQLVDCARKFDTKGCSGGLPSKGFEYLAYAGGIQN 291 Query: 434 EQTYPYEGVDDKCRYNPKNT 493 E YPYEG D CR+N T Sbjct: 292 EADYPYEGEDKNCRFNSSKT 311 Score = 42.3 bits (95), Expect = 0.010 Identities = 17/27 (62%), Positives = 21/27 (77%), Gaps = 1/27 (3%) Frame = +3 Query: 177 KLPEQVDWRKHGAVTDIKDQGK-CGSC 254 +LP+ VDWR+ G VT +K QGK CGSC Sbjct: 204 QLPQYVDWREKGVVTQVKSQGKDCGSC 230 >UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis thaliana|Rep: Cysteine proteinase - Arabidopsis thaliana (Mouse-ear cress) Length = 348 Score = 74.1 bits (174), Expect = 3e-12 Identities = 36/72 (50%), Positives = 42/72 (58%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS A+EG G LVSLSEQ L+DC Y N GC GG+M AF+YI N GI TE Sbjct: 154 AFSAVAAVEGITKITKGELVSLSEQQLLDCDRDY-NQGCRGGIMSKAFEYIIKNQGITTE 212 Query: 437 QTYPYEGVDDKC 472 YPY+ C Sbjct: 213 DNYPYQESQQTC 224 Score = 45.6 bits (103), Expect = 0.001 Identities = 25/80 (31%), Positives = 39/80 (48%), Gaps = 1/80 (1%) Frame = +3 Query: 18 VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK-LPEQV 194 ++YK+ +N++ D+ EF T G + + G + NV E + Sbjct: 75 ITYKVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSSG--KNTVPFRYGNVSDNGESM 132 Query: 195 DWRKHGAVTDIKDQGKCGSC 254 DWR+ GAVT +K QG+CG C Sbjct: 133 DWRQEGAVTPVKYQGRCGGC 152 Score = 39.5 bits (88), Expect = 0.071 Identities = 22/52 (42%), Positives = 34/52 (65%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 G+ +P +E+ L++AV+ PVSV I+ + +F+ YS GV+N EC TDL Sbjct: 241 GYETVPMNNEEALLQAVSQQ-PVSVGIEGTGAAFRHYSGGVFN-GEC-GTDL 289 >UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1; Dictyostelium discoideum AX4|Rep: Counting factor associated protein - Dictyostelium discoideum AX4 Length = 531 Score = 74.1 bits (174), Expect = 3e-12 Identities = 33/73 (45%), Positives = 47/73 (64%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F +TG+LEG + +G LVSLSEQ L+DC+ G+ GC GG +AF+Y+ + G + TE Sbjct: 335 TFGSTGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATE 394 Query: 437 QTYPYEGVDDKCR 475 YPY + CR Sbjct: 395 SNYPYLMQNGLCR 407 Score = 57.2 bits (132), Expect = 3e-07 Identities = 24/49 (48%), Positives = 32/49 (65%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 G+V++ G E L A+AT GPV++AIDAS F+ Y SGVYN C + Sbjct: 420 GYVNVTSGSESALQNAIATTGPVAIAIDASVDDFRYYMSGVYNNPACKN 468 Score = 49.6 bits (113), Expect = 7e-05 Identities = 32/80 (40%), Positives = 39/80 (48%), Gaps = 2/80 (2%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANV--KLPEQV 194 SYKLGMN Y D+ + EF + K A+ SV GA + +P V Sbjct: 265 SYKLGMNHYADLSNKEFNTLVKP--KVARP---------SVTGADSVHDDESLRSIPSTV 313 Query: 195 DWRKHGAVTDIKDQGKCGSC 254 DWR VT +KDQG CGSC Sbjct: 314 DWRNQNCVTPVKDQGICGSC 333 >UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1 precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 355 Score = 74.1 bits (174), Expect = 3e-12 Identities = 36/65 (55%), Positives = 44/65 (67%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FST A+EG + +G L SLSEQ LIDC + N+GCNGGLMD AF+YI GG+ E Sbjct: 163 AFSTVAAVEGINQITTGNLSSLSEQELIDCDTTF-NSGCNGGLMDYAFQYIISTGGLHKE 221 Query: 437 QTYPY 451 YPY Sbjct: 222 DDYPY 226 Score = 58.0 bits (134), Expect = 2e-07 Identities = 32/78 (41%), Positives = 39/78 (50%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 SY LG+N++ D+ H EF G K K A F LP+ VDW Sbjct: 91 SYWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQ-------PSANFRYRDITDLPKSVDW 143 Query: 201 RKHGAVTDIKDQGKCGSC 254 RK GAV +KDQG+CGSC Sbjct: 144 RKKGAVAPVKDQGQCGSC 161 Score = 48.4 bits (110), Expect = 2e-04 Identities = 26/52 (50%), Positives = 36/52 (69%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 G+ D+PE D++ L++A+A PVSVAI+AS FQ Y GV+N +C TDL Sbjct: 247 GYEDVPENDDESLVKALAHQ-PVSVAIEASGRDFQFYKGGVFN-GKC-GTDL 295 >UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyta|Rep: Os12g0273800 protein - Oryza sativa subsp. japonica (Rice) Length = 504 Score = 73.7 bits (173), Expect = 4e-12 Identities = 36/80 (45%), Positives = 47/80 (58%) Frame = +2 Query: 275 ALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYE 454 A+EG +G L+SLSEQ L+DC + GC GG +D AF++I NGG+ E YPY Sbjct: 156 AMEGFVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYT 215 Query: 455 GVDDKCRYNPKNTGAEDVAS 514 D +C K T A DVA+ Sbjct: 216 AEDGRC----KTTAAADVAA 231 Score = 52.4 bits (120), Expect = 9e-06 Identities = 29/78 (37%), Positives = 40/78 (51%) Frame = +3 Query: 24 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 203 Y LG+N++ D+ EF TM + N + + G K+ + + LP VDWR Sbjct: 86 YWLGVNQFADLTSEEFKATMTNSKGFSTPNNGVRVS----TGFKYENVSADALPASVDWR 141 Query: 204 KHGAVTDIKDQGKCGSCG 257 GAVT IKDQG+C G Sbjct: 142 TKGAVTRIKDQGQCAMEG 159 Score = 45.2 bits (102), Expect = 0.001 Identities = 27/52 (51%), Positives = 32/52 (61%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTD 660 RG+ D+P DE LM+AVA PVSVA+DAS FQ Y GV E +S D Sbjct: 234 RGYEDVPANDEPSLMKAVAG-QPVSVAVDAS--KFQFYGGGVMAGECGTSLD 282 >UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine protease; n=1; Maconellicoccus hirsutus|Rep: Putative cathepsin L-like cysteine protease - Maconellicoccus hirsutus (hibiscus mealybug) Length = 339 Score = 73.7 bits (173), Expect = 4e-12 Identities = 31/78 (39%), Positives = 47/78 (60%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 ++S GALEGQ +S QN+IDCSE GN GC+GG +++ YI GG+D + Sbjct: 147 AWSAIGALEGQLASDKKKFQGISVQNVIDCSESTGNKGCSGGNQHHSYFYIYKQGGVDDD 206 Query: 437 QTYPYEGVDDKCRYNPKN 490 +YPY+ ++ C + +N Sbjct: 207 VSYPYKDAEEPCAFKKEN 224 Score = 53.2 bits (122), Expect = 5e-06 Identities = 22/49 (44%), Positives = 31/49 (63%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 G + +P+G E L E+VA GPV+ IDA+H SF Y G+Y E +C + Sbjct: 231 GEITLPDGYETNLHESVAVYGPVAATIDATHQSFHSYKGGIYFEPDCGN 279 Score = 48.8 bits (111), Expect = 1e-04 Identities = 27/83 (32%), Positives = 42/83 (50%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 + GLV+++ G+N+Y DML EF + M + + + +N G + +F NV Sbjct: 67 FHKGLVTFEQGINEYSDMLQSEFNEKM---GQKSSNQRNTEANG--LPSIRFTPLHNVNP 121 Query: 183 PEQVDWRKHGAVTDIKDQGKCGS 251 P+ VDWR G V + Q C S Sbjct: 122 PDSVDWRTKGLVGPVGKQVNCSS 144 >UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: LOC443661 protein - Xenopus laevis (African clawed frog) Length = 346 Score = 73.3 bits (172), Expect = 5e-12 Identities = 35/81 (43%), Positives = 51/81 (62%), Gaps = 1/81 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS GALE Q ++ G LV+ S Q L+DCS GN GC GG + ++F Y+K +G ++ + Sbjct: 166 AFSAVGALECQWKKKKGTLVTFSPQELVDCSYSEGNKGCKGGSIRSSFTYMKKSGVME-D 224 Query: 437 QTYPYEGVDDKC-RYNPKNTG 496 YPY G ++KC + P TG Sbjct: 225 FNYPYTGKEEKCKKKKPSKTG 245 Score = 56.0 bits (129), Expect = 8e-07 Identities = 31/84 (36%), Positives = 42/84 (50%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 Y +GL +Y++GMN GDM E TM G+ + N+ R K + A Sbjct: 89 YSLGLHTYEVGMNHLGDMTGEEVEATMTGYTSSDDSLANM------TRVPKKLLEAQP-- 140 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P +DWR G VT ++ Q KCGSC Sbjct: 141 PASIDWRTKGCVTSVRRQRKCGSC 164 Score = 35.5 bits (78), Expect = 1.1 Identities = 17/31 (54%), Positives = 21/31 (67%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDAS 597 + F +P DE LM+ V TVGPVSVAI+ S Sbjct: 248 KDFHSVPARDEILLMKVVGTVGPVSVAINCS 278 >UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber officinale (Ginger) Length = 475 Score = 73.3 bits (172), Expect = 5e-12 Identities = 35/87 (40%), Positives = 53/87 (60%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ A+EG + +G L+SLSEQ L+DCS + N GC GG AF+YI +NGG+++E Sbjct: 169 AFAAIAAVEGINQIVTGDLISLSEQQLVDCSTR--NYGCEGGWPYRAFQYIINNGGVNSE 226 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASW 517 + YPY G + C +N + S+ Sbjct: 227 EHYPYTGTNGTCNTTKENAHVVSIDSY 253 Score = 52.8 bits (121), Expect = 7e-06 Identities = 25/84 (29%), Positives = 47/84 (55%), Gaps = 1/84 (1%) Frame = +3 Query: 6 EMGLVSYKLGMNKYGDMLHHEF-VKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 + G +Y+LGMN++ D+ + E+ + + ++ + G + + +V L Sbjct: 91 DRGEHAYRLGMNRFADLTNEEYRARFLRDLSRLGRSTS------GEISNQYRLREGDV-L 143 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P+ +DWR+ GAV +K+QG+CGSC Sbjct: 144 PDSIDWREKGAVVAVKNQGRCGSC 167 Score = 41.9 bits (94), Expect = 0.013 Identities = 19/39 (48%), Positives = 27/39 (69%) Frame = +1 Query: 517 DIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVY 633 ++P DE+ L +A A P+SV IDAS +FQLY SG++ Sbjct: 255 NVPSNDEKSLQKAAANQ-PISVGIDASGRNFQLYHSGIF 292 >UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_23, whole genome shotgun sequence - Paramecium tetraurelia Length = 321 Score = 73.3 bits (172), Expect = 5e-12 Identities = 36/78 (46%), Positives = 50/78 (64%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS GALE Q +V LSEQ+L+DC+ YGN GC+GG M++A YI D+G +T Sbjct: 145 AFSAVGALEINTKIQFNEIVDLSEQDLVDCAGPYGNAGCDGGWMESALDYIIDSGIAET- 203 Query: 437 QTYPYEGVDDKCRYNPKN 490 + YPY+G D C+ +N Sbjct: 204 KVYPYKGEDGICKSVERN 221 Score = 43.2 bits (97), Expect = 0.006 Identities = 30/78 (38%), Positives = 39/78 (50%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 SYK +NK+GD+ EF+ A+ KN+ K P V+ E+VDW Sbjct: 78 SYKQKINKFGDLTDQEFLTIYLNLQMPARV-KNIQ---------KNEEPFLVQ--EEVDW 125 Query: 201 RKHGAVTDIKDQGKCGSC 254 + G V IKDQG CGSC Sbjct: 126 VQKGKVPAIKDQGDCGSC 143 >UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|Rep: Cathepsin F precursor - Homo sapiens (Human) Length = 484 Score = 73.3 bits (172), Expect = 5e-12 Identities = 34/75 (45%), Positives = 47/75 (62%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS TG +EGQ F G L+SLSEQ L+DC + + C GGL NA+ IK+ GG++TE Sbjct: 297 AFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKM--DKACMGGLPSNAYSAIKNLGGLETE 354 Query: 437 QTYPYEGVDDKCRYN 481 Y Y+G C ++ Sbjct: 355 DDYSYQGHMQSCNFS 369 Score = 43.2 bits (97), Expect = 0.006 Identities = 27/75 (36%), Positives = 37/75 (49%), Gaps = 1/75 (1%) Frame = +3 Query: 33 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMK-GGSVRGAKFISPANVKLPEQVDWRKH 209 G+ K+ D+ EF +T N L + G ++ AK + P + DWR Sbjct: 232 GVTKFSDLTEEEF--------RTIYLNTLLRKEPGNKMKQAKSVGDL---APPEWDWRSK 280 Query: 210 GAVTDIKDQGKCGSC 254 GAVT +KDQG CGSC Sbjct: 281 GAVTKVKDQGMCGSC 295 >UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue of Sarcophaga 26,29kDa proteinase; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to homologue of Sarcophaga 26,29kDa proteinase - Nasonia vitripennis Length = 553 Score = 72.9 bits (171), Expect = 6e-12 Identities = 36/73 (49%), Positives = 49/73 (67%), Gaps = 1/73 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SF TTGA+EG +F + LV LS+Q LIDCS +GNNGC+GG ++++I +GG+ TE Sbjct: 360 SFGTTGAVEGAYFMKYKKLVRLSQQALIDCSWGFGNNGCDGGEDFRSYQWIIKHGGLPTE 419 Query: 437 QTY-PYEGVDDKC 472 + Y Y G D C Sbjct: 420 EEYGGYLGQDGYC 432 Score = 53.2 bits (122), Expect = 5e-06 Identities = 23/52 (44%), Positives = 34/52 (65%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTD 660 +GFV++ + + A+ GP+SVAIDASH +F YS+GVY E C +T+ Sbjct: 444 KGFVNVDTNNVDAMKLALFKHGPISVAIDASHKTFSFYSNGVYYEPACGNTE 495 Score = 41.9 bits (94), Expect = 0.013 Identities = 15/25 (60%), Positives = 19/25 (76%) Frame = +3 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 +P+ DWR +GAVT +KDQ CGSC Sbjct: 334 VPDSFDWRLYGAVTPVKDQSVCGSC 358 >UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep: Cysteine proteinase - Cryptobia salmositica Length = 443 Score = 72.9 bits (171), Expect = 6e-12 Identities = 39/83 (46%), Positives = 52/83 (62%), Gaps = 5/83 (6%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYI--KDNGGID 430 SFSTTG +EGQH +G LV++SEQ L+ C ++GCNGGLMDNAF ++ G I Sbjct: 140 SFSTTGNIEGQHAIATGQLVAVSEQELVSCDPI--DDGCNGGLMDNAFGWLISAHKGQIA 197 Query: 431 TEQTYPY---EGVDDKCRYNPKN 490 TE YPY G+ C +P++ Sbjct: 198 TEANYPYVSGNGIVPACSSSPES 220 Score = 46.0 bits (104), Expect = 8e-04 Identities = 27/76 (35%), Positives = 38/76 (50%), Gaps = 2/76 (2%) Frame = +3 Query: 33 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK--LPEQVDWRK 206 G N++ DM EF N A+H K + K + +K + +Q+DWR Sbjct: 69 GPNEFADMTSEEFQTRHNA----ARHYAAA--KARPPKNTKTFTAEEIKAAVGQQIDWRL 122 Query: 207 HGAVTDIKDQGKCGSC 254 GAVT +K+QG CGSC Sbjct: 123 KGAVTPVKNQGACGSC 138 >UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium tetraurelia|Rep: Cathepsin L1 precursor - Paramecium tetraurelia Length = 314 Score = 72.9 bits (171), Expect = 6e-12 Identities = 35/77 (45%), Positives = 47/77 (61%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS GALE + LSEQ+L+DCS Y N+GCNGG MD+AF+Y+ DN G+ Sbjct: 137 AFSAVGALEINTDIELNRKYELSEQDLVDCSGPYDNDGCNGGWMDSAFEYVADN-GLAEA 195 Query: 437 QTYPYEGVDDKCRYNPK 487 + YPY D C+ + K Sbjct: 196 KDYPYTAKDGTCKTSVK 212 >UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine protease; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cysteine protease - Strongylocentrotus purpuratus Length = 494 Score = 72.5 bits (170), Expect = 8e-12 Identities = 34/75 (45%), Positives = 49/75 (65%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS G +EGQ + G L+SLSEQ L+DC + G GC GG M +A++ I GG +E Sbjct: 266 AFSAIGNMEGQWQIKKGELISLSEQELVDCDKVDG--GCEGGEMSDAYEAIIKLGGAMSE 323 Query: 437 QTYPYEGVDDKCRYN 481 + YPY G ++KC++N Sbjct: 324 EKYPYRGENEKCKFN 338 Score = 52.0 bits (119), Expect = 1e-05 Identities = 32/84 (38%), Positives = 39/84 (46%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 +E G Y G K+ DM EF K +G K K + G V Sbjct: 196 FEQGTAKY--GPTKFADMTEAEFRKLQSGPLKKTGIKKQAAIPQGPV------------- 240 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 PE+ DWR HGAVT +K+QG CGSC Sbjct: 241 PEEYDWRTHGAVTPVKNQGMCGSC 264 >UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dvir_CG5367 - Drosophila virilis (Fruit fly) Length = 298 Score = 72.5 bits (170), Expect = 8e-12 Identities = 31/87 (35%), Positives = 51/87 (58%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS ++EGQ F+++G +V+LSEQ ++DCS +GN GC GG + N +Y++ GG+ Sbjct: 113 AFSIAQSIEGQVFKRTGKIVALSEQQIVDCSVSHGNQGCIGGSLRNTLRYLQATGGLMRS 172 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASW 517 Y Y +C++ +V SW Sbjct: 173 LDYKYASKKGECQF-VSELAVVNVTSW 198 Score = 55.2 bits (127), Expect = 1e-06 Identities = 23/48 (47%), Positives = 35/48 (72%) Frame = +1 Query: 520 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 +P DE + AVA +GPV+V+I+AS +FQLYS G+Y++ C+ST + Sbjct: 201 LPAKDENAIQAAVAHIGPVAVSINASPKTFQLYSEGIYDDVSCTSTSV 248 Score = 40.7 bits (91), Expect = 0.031 Identities = 27/85 (31%), Positives = 40/85 (47%), Gaps = 1/85 (1%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI-SPANVK 179 YE G S++L N DM ++K G+ + + + S A + SP Sbjct: 34 YETGKSSFRLATNTMADMNTDSYLK---GYLRLLRSPEI----SDSDNIADIVGSPLMNN 86 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 +PE DWRK G +T + +Q CGSC Sbjct: 87 VPESFDWRKKGFITPLYNQQSCGSC 111 >UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 513 Score = 71.7 bits (168), Expect = 1e-11 Identities = 36/97 (37%), Positives = 56/97 (57%), Gaps = 1/97 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ GALEG HF ++G + LSEQ ++DC+ +GN GC GG A ++I +GG+ TE Sbjct: 322 AFAVAGALEGAHFIKTGLKLDLSEQQIVDCTWGFGNRGCKGGYPYRAMQWILKHGGLATE 381 Query: 437 QTY-PYEGVDDKCRYNPKNTGAEDVASWTSPRATNRS 544 ++Y Y + C + + GA + + S R N S Sbjct: 382 ESYGRYLAQEGYCHFKNTSIGAR-LDKYMSIRQGNTS 417 Score = 44.4 bits (100), Expect = 0.002 Identities = 18/27 (66%), Positives = 19/27 (70%) Frame = +3 Query: 174 VKLPEQVDWRKHGAVTDIKDQGKCGSC 254 V LP VDWRK GAV +K QG CGSC Sbjct: 294 VPLPPHVDWRKAGAVNSVKSQGICGSC 320 Score = 41.9 bits (94), Expect = 0.013 Identities = 16/47 (34%), Positives = 30/47 (63%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECS 651 ++ I +G+ +L AVA GPVS+ ++ +F+ Y SG+Y + +C+ Sbjct: 408 YMSIRQGNTSQLKLAVAFYGPVSILVNTQPKTFKFYGSGIYYDTQCT 454 >UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: Cathepsin L - Kudoa thyrsites Length = 300 Score = 71.7 bits (168), Expect = 1e-11 Identities = 35/78 (44%), Positives = 51/78 (65%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SFS GA+E + ++G LV+ SEQ L+DCS + N+GCNGGL + AF Y+ +N GI Sbjct: 128 SFSAAGAIESAYAIKTGELVNFSEQQLVDCSTE--NHGCNGGLPEIAFLYVINN-GIMKL 184 Query: 437 QTYPYEGVDDKCRYNPKN 490 + YPY C+Y+P++ Sbjct: 185 KDYPYTAKQGTCQYSPED 202 Score = 44.8 bits (101), Expect = 0.002 Identities = 20/46 (43%), Positives = 30/46 (65%) Frame = +1 Query: 526 EGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 E +E+ +ME+VA GP S+ I+A+ SFQ Y G+Y++ SS L Sbjct: 213 ENNEESVMESVANNGPNSIGINAASRSFQFYGGGIYSDPWASSYPL 258 Score = 40.7 bits (91), Expect = 0.031 Identities = 15/25 (60%), Positives = 18/25 (72%) Frame = +3 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 LP VDW+ G VT +K+QG CGSC Sbjct: 102 LPSSVDWKALGKVTSVKNQGHCGSC 126 >UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole genome shotgun sequence; n=7; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_22, whole genome shotgun sequence - Paramecium tetraurelia Length = 350 Score = 71.7 bits (168), Expect = 1e-11 Identities = 35/72 (48%), Positives = 46/72 (63%), Gaps = 1/72 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYG-NNGCNGGLMDNAFKYIKDNGGIDT 433 +FSTTG LEG + Q+G L LSEQ L+DCS N GC+GG+ A Y+K N G+ T Sbjct: 168 AFSTTGVLEGFYKVQTGELPDLSEQQLVDCSTLIDFNQGCDGGMPSRALNYVKRN-GLTT 226 Query: 434 EQTYPYEGVDDK 469 + YPYE + +K Sbjct: 227 QDAYPYEHIQNK 238 Score = 40.7 bits (91), Expect = 0.031 Identities = 14/21 (66%), Positives = 17/21 (80%) Frame = +3 Query: 192 VDWRKHGAVTDIKDQGKCGSC 254 +DWR GAV +KDQG+CGSC Sbjct: 146 IDWRTRGAVNKVKDQGQCGSC 166 >UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber officinale (Ginger) Length = 221 Score = 71.7 bits (168), Expect = 1e-11 Identities = 34/72 (47%), Positives = 48/72 (66%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F A+EG + +G L+SLSEQ L+DCS + N+GC GG AF+YI +NGGI++E Sbjct: 29 AFDAIAAVEGINQIVTGDLISLSEQQLVDCSTR--NHGCEGGWPYRAFQYIINNGGINSE 86 Query: 437 QTYPYEGVDDKC 472 + YPY G + C Sbjct: 87 EHYPYTGTNGTC 98 Score = 43.2 bits (97), Expect = 0.006 Identities = 15/25 (60%), Positives = 20/25 (80%) Frame = +3 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 LP+ +DWR+ GAV +K+QG CGSC Sbjct: 3 LPDSIDWREKGAVVPVKNQGGCGSC 27 Score = 39.9 bits (89), Expect = 0.053 Identities = 18/39 (46%), Positives = 27/39 (69%) Frame = +1 Query: 517 DIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVY 633 ++P DE+ L +AVA PVSV +DA+ FQLY +G++ Sbjct: 114 NVPSNDEKSLQKAVANQ-PVSVTMDAAGRDFQLYRNGIF 151 >UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin l - Strongylocentrotus purpuratus Length = 489 Score = 71.3 bits (167), Expect = 2e-11 Identities = 34/76 (44%), Positives = 47/76 (61%), Gaps = 1/76 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SF + +EG F QSG V LS+Q L+DC+ GNNGC+GG ++++ NGGI E Sbjct: 293 SFGSAETIEGAVFMQSGKRVRLSQQMLMDCTWAAGNNGCDGGEEWRVYEWLMKNGGIPLE 352 Query: 437 QTY-PYEGVDDKCRYN 481 +TY PY G + C Y+ Sbjct: 353 ETYGPYLGQNGMCHYD 368 Score = 47.6 bits (108), Expect = 3e-04 Identities = 19/49 (38%), Positives = 32/49 (65%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSST 657 + ++ G+++ L +A+AT GP++V IDA+ SF YS G Y + C +T Sbjct: 379 YYNVTSGNQKDLKKALATKGPIAVGIDAAVPSFSFYSYGTYYDASCGNT 427 Score = 45.6 bits (103), Expect = 0.001 Identities = 26/79 (32%), Positives = 38/79 (48%) Frame = +3 Query: 18 VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVD 197 + Y L +N D H E +K M G + + N L G V ++ +P+ +D Sbjct: 222 LGYVLDINHMADQSHQE-LKRMRGRLRQTRPNNGLPYDGSDV--------SDDAVPDHID 272 Query: 198 WRKHGAVTDIKDQGKCGSC 254 W GAV+ +KDQ CGSC Sbjct: 273 WNVLGAVSPVKDQAVCGSC 291 >UniRef50_A2Q4E7 Cluster: Peptidase C1A, papain; n=1; Medicago truncatula|Rep: Peptidase C1A, papain - Medicago truncatula (Barrel medic) Length = 263 Score = 71.3 bits (167), Expect = 2e-11 Identities = 34/61 (55%), Positives = 41/61 (67%) Frame = +2 Query: 278 LEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEG 457 +EG SG LVS SEQ L+DC NGCNGG +AFK+I +NGGI TE +YPY+G Sbjct: 187 IEGIQQIISGNLVSFSEQQLVDCVTSNWTNGCNGGNKIDAFKFILENGGIATEASYPYKG 246 Query: 458 V 460 V Sbjct: 247 V 247 >UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaster|Rep: CG11459-PA - Drosophila melanogaster (Fruit fly) Length = 336 Score = 70.9 bits (166), Expect = 2e-11 Identities = 34/72 (47%), Positives = 49/72 (68%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FST+G LE ++ G LV LS ++L+DC Y NNGC+GG + AF Y +D+ GI T+ Sbjct: 145 AFSTSGVLEAHMAKKYGNLVPLSPKHLVDC-VPYPNNGCSGGWVSVAFNYTRDH-GIATK 202 Query: 437 QTYPYEGVDDKC 472 ++YPYE V +C Sbjct: 203 ESYPYEPVSGEC 214 Score = 45.6 bits (103), Expect = 0.001 Identities = 21/49 (42%), Positives = 29/49 (59%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 G+V + DE++L E V +GPV+V+ID H F YS GV + C S Sbjct: 227 GYVTLGNYDERELAEVVYNIGPVAVSIDHLHEEFDQYSGGVLSIPACRS 275 >UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: Cysteine proteinase - Paragonimus westermani Length = 272 Score = 70.9 bits (166), Expect = 2e-11 Identities = 32/72 (44%), Positives = 48/72 (66%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FST G +EGQ F ++G LVSLS+Q L+DC +GCNGG +++ I GG++++ Sbjct: 80 AFSTAGNVEGQWFIKTGQLVSLSKQQLVDCDR--AADGCNGGWPASSYLEIMHMGGLESQ 137 Query: 437 QTYPYEGVDDKC 472 YPY GV ++C Sbjct: 138 DDYPYAGVKEQC 149 Score = 45.2 bits (102), Expect = 0.001 Identities = 18/35 (51%), Positives = 25/35 (71%), Gaps = 1/35 (2%) Frame = +3 Query: 153 KFISPANVKL-PEQVDWRKHGAVTDIKDQGKCGSC 254 K + P +K PE++DWR GAVT +++QG CGSC Sbjct: 44 KRVRPTGLKAAPERIDWRAKGAVTAVENQGSCGSC 78 >UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae str. PEST Length = 559 Score = 70.9 bits (166), Expect = 2e-11 Identities = 36/76 (47%), Positives = 47/76 (61%), Gaps = 1/76 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS G +EG H ++ L S SEQ LIDC + +NGC GG MD+AFK I+ GG++ E Sbjct: 365 AFSAVGNVEGLHQIKTKKLESYSEQELIDCDKV--DNGCGGGYMDDAFKAIEQLGGLELE 422 Query: 437 QTYPYEGVDDK-CRYN 481 YPYE K C +N Sbjct: 423 NDYPYEAKAQKSCHFN 438 Score = 52.8 bits (121), Expect = 7e-06 Identities = 30/84 (35%), Positives = 45/84 (53%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 +E G Y G+ K+ DM E+ + G KH++ ++ G V + ++ L Sbjct: 286 FERGTAKY--GVTKFADMTVAEY-RAHTGL-VVPKHDRANHV-GNRVASEEDVAGVG-DL 339 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P DWR HGAVT++K+QG CGSC Sbjct: 340 PRSFDWRDHGAVTEVKNQGSCGSC 363 >UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 precursor; n=2; Arabidopsis thaliana|Rep: Probable cysteine proteinase At3g43960 precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 376 Score = 70.5 bits (165), Expect = 3e-11 Identities = 35/69 (50%), Positives = 46/69 (66%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ TGA+EG + +G LVSLSEQ LIDC N GC GG AF++IK+NGGI ++ Sbjct: 154 AFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSD 213 Query: 437 QTYPYEGVD 463 + Y Y G D Sbjct: 214 EVYGYTGED 222 Score = 45.2 bits (102), Expect = 0.001 Identities = 29/79 (36%), Positives = 41/79 (51%), Gaps = 1/79 (1%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 SY+ G+NK+ D+ EF + G K K K S ++ LP++VDW Sbjct: 82 SYERGLNKFSDLTADEFQASYLG----GKMEK----KSLSDVAERYQYKEGDVLPDEVDW 133 Query: 201 RKHGAVTD-IKDQGKCGSC 254 R+ GAV +K QG+CGSC Sbjct: 134 RERGAVVPRVKRQGECGSC 152 >UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 21 SCAF14577, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 478 Score = 69.7 bits (163), Expect = 6e-11 Identities = 34/69 (49%), Positives = 48/69 (69%), Gaps = 1/69 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SF+TTG +EG F ++G L LS+Q LIDCS +GNN C+GG A+++I +GGI + Sbjct: 231 SFATTGTIEGALFLKTGSLQVLSQQMLIDCSWGFGNNACDGGEEWRAYEWIMKHGGIASA 290 Query: 437 QTY-PYEGV 460 +TY PY G+ Sbjct: 291 ETYGPYLGM 299 Score = 63.7 bits (148), Expect = 4e-09 Identities = 35/95 (36%), Positives = 54/95 (56%), Gaps = 2/95 (2%) Frame = +2 Query: 245 WLMRSFSTTGA-LEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNG 421 W+M+ A G + +G L LS+Q LIDCS +GNN C+GG A+++I +G Sbjct: 280 WIMKHGGIASAETYGPYLGMTGSLQVLSQQMLIDCSWGFGNNACDGGEEWRAYEWIMKHG 339 Query: 422 GIDTEQTY-PYEGVDDKCRYNPKNTGAEDVASWTS 523 GI + +TY PY G++ C N A+ + S+T+ Sbjct: 340 GIASAETYGPYLGMNGFCHVNSSELTAQ-IQSYTN 373 Score = 51.6 bits (118), Expect = 2e-05 Identities = 24/51 (47%), Positives = 32/51 (62%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSST 657 + + ++ GD L A+ GPV+V+IDASH SF YS+GVY E C ST Sbjct: 369 QSYTNVTSGDALALKLALFKNGPVAVSIDASHRSFVFYSNGVYYEPACGST 419 Score = 49.6 bits (113), Expect = 7e-05 Identities = 30/79 (37%), Positives = 39/79 (49%) Frame = +3 Query: 18 VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVD 197 +SY LG+N D E TM G + N L F +V++PE +D Sbjct: 160 LSYTLGLNSLSDRTMSELA-TMRGRKQRKTTNAGLPFP--------FKLYQHVEVPESLD 210 Query: 198 WRKHGAVTDIKDQGKCGSC 254 WR +GAVT +KDQ CGSC Sbjct: 211 WRLYGAVTPVKDQAICGSC 229 >UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa|Rep: Os09g0497500 protein - Oryza sativa subsp. japonica (Rice) Length = 349 Score = 69.7 bits (163), Expect = 6e-11 Identities = 34/87 (39%), Positives = 51/87 (58%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS A+EG + ++G LVSLSEQ L+DC ++ GC GG M AF+++ N G+ TE Sbjct: 148 AFSAVAAIEGINQIKNGELVSLSEQELVDCDDEAV--GCGGGYMSWAFEFVVGNHGLTTE 205 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASW 517 +YPY + C+ N A +A + Sbjct: 206 ASYPYHAANGACQAAKLNQSAVAIAGY 232 Score = 54.8 bits (126), Expect = 2e-06 Identities = 31/79 (39%), Positives = 42/79 (53%), Gaps = 2/79 (2%) Frame = +3 Query: 24 YKLGMNKYGDMLHHEFVKTMNGFNK--TAKHNKNLYMKGGSVRGAKFISPANVKLPEQVD 197 YKL NK+ D+ + EF M GF T N ++ G ++ LP+ VD Sbjct: 72 YKLADNKFADLTNEEFRAKMLGFRPHVTIPQISNTCSADIAMPGES----SDDILPKSVD 127 Query: 198 WRKHGAVTDIKDQGKCGSC 254 WRK GAV ++K+QG CGSC Sbjct: 128 WRKKGAVVEVKNQGDCGSC 146 Score = 36.3 bits (80), Expect = 0.66 Identities = 19/42 (45%), Positives = 23/42 (54%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVY 633 G+ ++ E L A A PVSVA+D FQLY SGVY Sbjct: 231 GYRNVTPSSEPDLARAAAAQ-PVSVAVDGGSFMFQLYGSGVY 271 >UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep: Silicatein beta - Suberites domuncula (Sponge) Length = 383 Score = 69.7 bits (163), Expect = 6e-11 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 5/86 (5%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS +LEG + G LV+LSEQN++DCS YGN+GC G ++ A Y+ +N G+DT Sbjct: 188 AFSAMASLEGINALSYGSLVTLSEQNIVDCSVTYGNHGCACGDVNRALLYVIENDGVDTW 247 Query: 437 QTY-----PYEGVDDKCRYNPKNTGA 499 + Y PY C+Y + GA Sbjct: 248 KGYPSGGDPYRSKQYSCKYERQYRGA 273 Score = 62.5 bits (145), Expect = 9e-09 Identities = 30/53 (56%), Positives = 35/53 (66%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 RG V + GDE L+ AVA GPVSV +DA+ TSFQ YS GV N CSS+ L Sbjct: 276 RGIVSLASGDENTLLTAVANSGPVSVYVDATSTSFQFYSDGVLNVPYCSSSTL 328 Score = 50.4 bits (115), Expect = 4e-05 Identities = 31/89 (34%), Positives = 44/89 (49%), Gaps = 11/89 (12%) Frame = +3 Query: 18 VSYKLGMNKYGDMLHHEFVK---------TMNGFNKTAKHNKNLYMKGGS-VRGAKFISP 167 + Y L MNK+GD+ EF++ N + KH + ++ G VRG Sbjct: 97 LGYTLKMNKFGDLTTKEFIEGYHCVQDYQPTNASHLNKKHKTHAFVDYGDFVRGGTGEGV 156 Query: 168 ANV-KLPEQVDWRKHGAVTDIKDQGKCGS 251 V +PE +DWR G VT +KDQ +CGS Sbjct: 157 RGVGNMPETMDWRTSGVVTKVKDQLRCGS 185 >UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1; Brugia malayi|Rep: Cathepsin F-like cysteine proteinase - Brugia malayi (Filarial nematode worm) Length = 461 Score = 69.7 bits (163), Expect = 6e-11 Identities = 35/72 (48%), Positives = 45/72 (62%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS TG +E ++G L+SLSEQ LIDC + GCNGGL NAF+ IK GG++ E Sbjct: 274 AFSVTGNIESLWAIKTGKLISLSEQELIDCDVI--DKGCNGGLPINAFREIKRMGGLEPE 331 Query: 437 QTYPYEGVDDKC 472 YPYE + C Sbjct: 332 DQYPYEAKNGTC 343 Score = 44.4 bits (100), Expect = 0.002 Identities = 29/83 (34%), Positives = 35/83 (42%) Frame = +3 Query: 6 EMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLP 185 E G Y G K+ DM EF K M + N G + + LP Sbjct: 197 EKGTAIY--GATKFSDMTAEEFQKIMLPSIWWDRVESN-----GITFNLNDFNLSIYNLP 249 Query: 186 EQVDWRKHGAVTDIKDQGKCGSC 254 + DWR G VT +KDQG CGSC Sbjct: 250 SKFDWRTEGVVTPVKDQGSCGSC 272 >UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: Cysteine protease - Clonorchis sinensis Length = 328 Score = 69.7 bits (163), Expect = 6e-11 Identities = 33/75 (44%), Positives = 45/75 (60%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS G +EGQ FR++G L++LSEQ L+DC + GCNGG + I+ GG++ Sbjct: 141 AFSVIGNVEGQWFRKTGDLLALSEQQLVDC--DHLEKGCNGGYPPKTYGEIEKMGGLELA 198 Query: 437 QTYPYEGVDDKCRYN 481 YPY GVD C N Sbjct: 199 SDYPYTGVDGICYMN 213 Score = 43.6 bits (98), Expect = 0.004 Identities = 16/23 (69%), Positives = 19/23 (82%) Frame = +3 Query: 186 EQVDWRKHGAVTDIKDQGKCGSC 254 E+ DWR+HGAV + DQGKCGSC Sbjct: 117 EKFDWREHGAVGPVLDQGKCGSC 139 >UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia circumcincta|Rep: Secreted cathepsin F - Teladorsagia circumcincta Length = 364 Score = 69.3 bits (162), Expect = 8e-11 Identities = 35/78 (44%), Positives = 46/78 (58%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS TG +EGQ F LVSLS Q L+DC + GCNGG +A+K I GG++ E Sbjct: 179 AFSVTGNIEGQWFLAKKKLVSLSAQQLLDCDVV--DEGCNGGFPLDAYKEIVRMGGLEPE 236 Query: 437 QTYPYEGVDDKCRYNPKN 490 YPYE ++CR P + Sbjct: 237 DKYPYEAKAEQCRLVPSD 254 Score = 46.4 bits (105), Expect = 6e-04 Identities = 25/74 (33%), Positives = 37/74 (50%) Frame = +3 Query: 33 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 212 G+N++ D+ EF KT + N + A+ + P LPE DWR+HG Sbjct: 109 GINQFADLSPEEFKKTHLPHTWKQPDHPNRIVD----LAAEGVDPKE-PLPESFDWREHG 163 Query: 213 AVTDIKDQGKCGSC 254 AVT +K +G C +C Sbjct: 164 AVTKVKTEGHCAAC 177 >UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 344 Score = 68.9 bits (161), Expect = 1e-10 Identities = 34/88 (38%), Positives = 52/88 (59%), Gaps = 1/88 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+TTG +E Q+ + G L+ SEQ L+DC N GC GGLM +A+++++ +GGI T Sbjct: 157 TFATTGVIESQYALKYGELLHFSEQMLLDCDNI--NQGCRGGLMTDAYQFLQQSGGIQTA 214 Query: 437 QTY-PYEGVDDKCRYNPKNTGAEDVASW 517 TY Y+ D C ++ A+ V W Sbjct: 215 DTYGDYKNKKDICNFDKAKVKAK-VVDW 241 Score = 45.6 bits (103), Expect = 0.001 Identities = 30/90 (33%), Positives = 44/90 (48%), Gaps = 6/90 (6%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFN----KTAKHNKNLYMKGGSVRG--AKFIS 164 ++M + K G K+ DM EF M F+ K AK ++ + +K ++G + + Sbjct: 67 HQMENPNAKFGHTKFSDMSPEEFENKMLNFDFSLFKKAK-SQGIKLKAEPMKGYLRQGEN 125 Query: 165 PANVKLPEQVDWRKHGAVTDIKDQGKCGSC 254 N LPE DWR G +T K Q CGSC Sbjct: 126 VDNSDLPESFDWRDKGIITPAKFQNTCGSC 155 >UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Slime mold). Cysteine proteinase 5; n=2; Dictyostelium discoideum|Rep: Similar to Dictyostelium discoideum (Slime mold). Cysteine proteinase 5 - Dictyostelium discoideum (Slime mold) Length = 345 Score = 68.5 bits (160), Expect = 1e-10 Identities = 38/97 (39%), Positives = 57/97 (58%), Gaps = 3/97 (3%) Frame = +2 Query: 263 STTGALEGQHF--RQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 + GA E HF +SLS QNLIDCS N C G ++ AF+YI +NGGID+E Sbjct: 148 TAVGATESAHFLANPKDPFISLSMQNLIDCSNL--NKQCYQGTVNEAFQYIIENGGIDSE 205 Query: 437 QTYPYEGVD-DKCRYNPKNTGAEDVASWTSPRATNRS 544 ++Y + G + KC+YN N+ A+ + S+ ++ + S Sbjct: 206 ESYKFSGGEPGKCKYNSSNSVAK-ITSYEKVKSGSES 241 Score = 50.4 bits (115), Expect = 4e-05 Identities = 25/48 (52%), Positives = 32/48 (66%) Frame = +1 Query: 520 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 + G E L AV+ + PV+ IDAS +SFQ YSSG+Y E C+STDL Sbjct: 235 VKSGSESSLESAVS-LKPVAAYIDASLSSFQFYSSGIYYEPSCNSTDL 281 Score = 32.7 bits (71), Expect = 8.1 Identities = 21/75 (28%), Positives = 34/75 (45%), Gaps = 1/75 (1%) Frame = +3 Query: 30 LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 209 L +N++ D+ + E+ K + +L + + K S + +DWRK Sbjct: 71 LALNEFADISNEEYRKNYLRNDNNINKLSSLLINDKEDKEIKSSSSSGSG-SSGIDWRKK 129 Query: 210 GAVTDIKDQ-GKCGS 251 GAV +K Q G CGS Sbjct: 130 GAVPSVKSQIGGCGS 144 >UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-like protein; n=1; Maconellicoccus hirsutus|Rep: Cathepsin L-like cysteine proteinase-like protein - Maconellicoccus hirsutus (hibiscus mealybug) Length = 253 Score = 68.5 bits (160), Expect = 1e-10 Identities = 31/78 (39%), Positives = 50/78 (64%), Gaps = 3/78 (3%) Frame = +2 Query: 257 SFSTTGALEGQH-FRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDT 433 +FS TGALE + + V LSEQNLI+CS +GN C+GG ++N +KY+ + GI+ Sbjct: 59 AFSVTGALESEKAIKYEAAPVKLSEQNLIECSGGFGNKRCSGGNLENTYKYVNHSRGIEK 118 Query: 434 EQTY--PYEGVDDKCRYN 481 E +Y + ++ +C+Y+ Sbjct: 119 EDSYRDNFRHINSRCQYD 136 >UniRef50_P25773 Cluster: Cathepsin L; n=38; Eukaryota|Rep: Cathepsin L - Felis silvestris catus (Cat) Length = 139 Score = 68.5 bits (160), Expect = 1e-10 Identities = 26/54 (48%), Positives = 37/54 (68%) Frame = +2 Query: 377 GGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCRYNPKNTGAEDVASWTSPRATN 538 GGL+D+AF+Y+KDNGG+D+E++YPY D C+Y P+N+ A W P N Sbjct: 1 GGLIDDAFQYVKDNGGLDSEESYPYHAQGDSCKYRPENSVANVTDYWDIPSKEN 54 Score = 51.2 bits (117), Expect = 2e-05 Identities = 24/49 (48%), Positives = 32/49 (65%) Frame = +1 Query: 517 DIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 DIP E +LM +A VGP+S AIDAS +F+ Y G+Y + CSS D+ Sbjct: 48 DIPS-KENELMITLAAVGPISAAIDASLDTFRFYKEGIYYDPSCSSEDV 95 >UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromeliaceae|Rep: Fruit bromelain precursor - Ananas comosus (Pineapple) Length = 351 Score = 68.5 bits (160), Expect = 1e-10 Identities = 31/75 (41%), Positives = 45/75 (60%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SF+ +EG + ++GYLVSLSEQ ++DC+ Y GC GG ++ A+ +I N G+ TE Sbjct: 149 SFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVSY---GCKGGWVNKAYDFIISNNGVTTE 205 Query: 437 QTYPYEGVDDKCRYN 481 + YPY C N Sbjct: 206 ENYPYLAYQGTCNAN 220 Score = 49.6 bits (113), Expect = 7e-05 Identities = 27/78 (34%), Positives = 40/78 (51%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 SY LG+N++ DM EFV G + + + V IS +P+ +DW Sbjct: 78 SYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVSFDDVN----ISA----VPQSIDW 129 Query: 201 RKHGAVTDIKDQGKCGSC 254 R +GAV ++K+Q CGSC Sbjct: 130 RDYGAVNEVKNQNPCGSC 147 >UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba culbertsoni|Rep: Cysteine proteinase - Acanthamoeba culbertsoni Length = 482 Score = 68.1 bits (159), Expect = 2e-10 Identities = 36/97 (37%), Positives = 56/97 (57%), Gaps = 1/97 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY-IKDNGGIDT 433 +F TGA+EG G LVSLS+Q L+DC+ GN GC+GG ++ +++ I +N + T Sbjct: 182 AFVATGAVEGVRKIAGGSLVSLSDQMLLDCAVGTGNQGCSGGNVEITYRWMISNNARLMT 241 Query: 434 EQTYPYEGVDDKCRYNPKNTGAEDVASWTSPRATNRS 544 + +YPY CRY P + G + + + RA + S Sbjct: 242 QASYPYIARQSTCRYVP-SQGVQGIRNIMRVRAGSES 277 Score = 47.6 bits (108), Expect = 3e-04 Identities = 24/53 (45%), Positives = 31/53 (58%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 R + + G E L+ A A + PV+VAID S SF YS G Y + CSST+L Sbjct: 266 RNIMRVRAGSESDLL-AKAAIAPVTVAIDGSKRSFMFYSGGYYYDPTCSSTNL 317 Score = 44.8 bits (101), Expect = 0.002 Identities = 23/84 (27%), Positives = 35/84 (41%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 + G ++ + MN++GD+ EF + G A + + Sbjct: 97 FNRGNHTFTVAMNEHGDLTPEEFARLYMGQVSPASEQELQERIAAESAMEDEHHHTRASI 156 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P DWR GAVT +K+QG C SC Sbjct: 157 PANWDWRTKGAVTPVKNQGSCASC 180 >UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; Entamoeba|Rep: Cysteine proteinase 2 precursor - Entamoeba histolytica Length = 315 Score = 68.1 bits (159), Expect = 2e-10 Identities = 32/81 (39%), Positives = 48/81 (59%), Gaps = 3/81 (3%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSG---YLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGI 427 +F + ALEG+ + G + LSE++++ C+ GNNGCNGGL N + YI ++ G+ Sbjct: 120 TFGSLAALEGRLLIEKGGDANTLDLSEEHMVQCTRDNGNNGCNGGLGSNVYDYIIEH-GV 178 Query: 428 DTEQTYPYEGVDDKCRYNPKN 490 E YPY G D C+ N K+ Sbjct: 179 AKESDYPYTGSDSTCKTNVKS 199 Score = 48.8 bits (111), Expect = 1e-04 Identities = 18/28 (64%), Positives = 22/28 (78%) Frame = +3 Query: 171 NVKLPEQVDWRKHGAVTDIKDQGKCGSC 254 N++ PE VDWRK G VT I+DQ +CGSC Sbjct: 91 NIQAPESVDWRKEGKVTPIRDQAQCGSC 118 Score = 44.4 bits (100), Expect = 0.002 Identities = 20/49 (40%), Positives = 30/49 (61%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 G+ +P +E +L A++ G V V+IDAS FQLY SG Y + +C + Sbjct: 205 GYTKVPRNNEAELKAALSQ-GLVDVSIDASSAKFQLYKSGAYTDTKCKN 252 >UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2; Entamoeba|Rep: Cysteine proteinase ACP1 precursor - Entamoeba histolytica Length = 308 Score = 68.1 bits (159), Expect = 2e-10 Identities = 31/73 (42%), Positives = 43/73 (58%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F TT LEG+ + G L S SEQ L+DC +NGC GG N+ K+I++N G+ E Sbjct: 117 TFCTTAVLEGRVNKDLGKLYSFSEQQLVDCDAS--DNGCEGGHPSNSLKFIQENNGLGLE 174 Query: 437 QTYPYEGVDDKCR 475 YPY+ V C+ Sbjct: 175 SDYPYKAVAGTCK 187 Score = 44.0 bits (99), Expect = 0.003 Identities = 20/46 (43%), Positives = 29/46 (63%), Gaps = 1/46 (2%) Frame = +1 Query: 520 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSG-VYNEEECSS 654 + +G E L +A GPV+V +DAS SFQLY G +Y++ +C S Sbjct: 201 VTDGSETGLQTIIAENGPVAVGMDASRPSFQLYKKGTIYSDTKCRS 246 Score = 37.9 bits (84), Expect = 0.22 Identities = 25/73 (34%), Positives = 35/73 (47%) Frame = +3 Query: 36 MNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGA 215 +N + DM H EF++T G + +V+ A + A PE VDWR Sbjct: 57 LNVFADMTHEEFIQTHLGMTYEVPETTS------NVKAA--VKAA----PESVDWR--SI 102 Query: 216 VTDIKDQGKCGSC 254 + KDQG+CGSC Sbjct: 103 MNPAKDQGQCGSC 115 >UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativa|Rep: Os01g0347600 protein - Oryza sativa subsp. japonica (Rice) Length = 343 Score = 67.7 bits (158), Expect = 2e-10 Identities = 32/73 (43%), Positives = 41/73 (56%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ A+EG ++G L LSEQ L+DC +NGC GG D AF+ + GGI E Sbjct: 151 AFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN--SNGCGGGHTDRAFELVASKGGITAE 208 Query: 437 QTYPYEGVDDKCR 475 Y YEG KCR Sbjct: 209 SDYRYEGFQGKCR 221 Score = 44.8 bits (101), Expect = 0.002 Identities = 27/75 (36%), Positives = 38/75 (50%) Frame = +3 Query: 30 LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 209 +G+N++ D+ + EFV T G H K + + P + P +DWR Sbjct: 88 VGINQFADLTNDEFVATYTGAKPP--HPKE---------APRPVDP--IWTPCCIDWRFR 134 Query: 210 GAVTDIKDQGKCGSC 254 GAVT +KDQG CGSC Sbjct: 135 GAVTGVKDQGACGSC 149 Score = 40.7 bits (91), Expect = 0.031 Identities = 21/42 (50%), Positives = 28/42 (66%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVY 633 G+ +P DE++L AVA PV+V IDAS +FQ Y SGV+ Sbjct: 235 GYRAVPPNDERQLATAVARQ-PVTVYIDASGPAFQFYKSGVF 275 >UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 67.3 bits (157), Expect = 3e-10 Identities = 35/90 (38%), Positives = 51/90 (56%), Gaps = 3/90 (3%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDC---SEQYGNNGCNGGLMDNAFKYIKDNGGI 427 +FSTTG LE +F ++ +S SEQ L+DC S + + GC+GG + A KY+ GI Sbjct: 151 AFSTTGILEALYFMENRQKISFSEQQLVDCATNSNGFNSYGCSGGWPEEALKYVA-KFGI 209 Query: 428 DTEQTYPYEGVDDKCRYNPKNTGAEDVASW 517 E+ YPY VD KC+ + + V S+ Sbjct: 210 LKEEQYPYLAVDSKCKVSSPTSDGFKVQSF 239 Score = 52.4 bits (120), Expect = 9e-06 Identities = 25/78 (32%), Positives = 41/78 (52%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 +YKL N++ DM EF + + +N + + + +V+LP DW Sbjct: 73 TYKLAHNQFSDMPQEEFASRVL-MKSSQLIPRNAVQAQNNNSTTQQHTAQDVQLPASFDW 131 Query: 201 RKHGAVTDIKDQGKCGSC 254 R +G ++D+KDQG+CGSC Sbjct: 132 RDYGILSDVKDQGQCGSC 149 >UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 383 Score = 67.3 bits (157), Expect = 3e-10 Identities = 32/80 (40%), Positives = 52/80 (65%), Gaps = 1/80 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+T ++E Q+ + G LVSLSEQ ++DC + NNGC+GG A K++K+N G+++E Sbjct: 194 AFATVASVEAQNAIKKGKLVSLSEQEMVDCDGR--NNGCSGGYRPYAMKFVKEN-GLESE 250 Query: 437 QTYPYEGV-DDKCRYNPKNT 493 + YPY + D+C +T Sbjct: 251 KEYPYSALKHDQCFLKENDT 270 Score = 43.2 bits (97), Expect = 0.006 Identities = 26/75 (34%), Positives = 39/75 (52%) Frame = +3 Query: 30 LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 209 L +N++ D E K + NK K++ + GS I PA++ DWR+ Sbjct: 125 LDVNEFTDWTDEELQKMVQE-NKYTKYDFDTPKFEGSYLETGVIRPASI------DWREQ 177 Query: 210 GAVTDIKDQGKCGSC 254 G +T IK+QG+CGSC Sbjct: 178 GKLTPIKNQGQCGSC 192 >UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 360 Score = 66.9 bits (156), Expect = 4e-10 Identities = 32/75 (42%), Positives = 48/75 (64%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FST ++EG +F ++G L SLS Q +IDC + +GC GG + AF+ I++NGGI TE Sbjct: 157 AFSTVQSIEGLYFLKTGKLESLSTQQVIDCC-RIDESGCLGGDPEPAFRCIQNNGGIMTE 215 Query: 437 QTYPYEGVDDKCRYN 481 YPY C+++ Sbjct: 216 TEYPYIAKQQSCKFD 230 Score = 45.2 bits (102), Expect = 0.001 Identities = 27/80 (33%), Positives = 39/80 (48%) Frame = +3 Query: 15 LVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQV 194 LV K+G+N++ D+ H EF G KH+K+ + + P + LP Sbjct: 83 LVFSKVGVNQFADLTHEEFKALYTGH----KHSKD--DDDDDNKNKQPHLPTD-NLPASF 135 Query: 195 DWRKHGAVTDIKDQGKCGSC 254 DWR GA+T +K Q CG C Sbjct: 136 DWRDKGAITPVKVQNGCGGC 155 Score = 39.5 bits (88), Expect = 0.071 Identities = 17/46 (36%), Positives = 30/46 (65%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEE 645 G++D+P +Q ++A + P+S+ +++S TSF+ Y SGV E E Sbjct: 240 GYIDVPS--DQSQVKAALLIQPLSICLNSSDTSFKYYKSGVITECE 283 >UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza sativa|Rep: Putative cysteine protease - Oryza sativa subsp. japonica (Rice) Length = 357 Score = 66.9 bits (156), Expect = 4e-10 Identities = 32/74 (43%), Positives = 44/74 (59%), Gaps = 1/74 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNN-GCNGGLMDNAFKYIKDNGGIDT 433 +F+ A+EG ++G L LSEQ L+DC + G++ GC GG D AF+ + D GGI Sbjct: 159 AFAAVAAMEGLMKIRTGQLTPLSEQELVDCVDGGGDSDGCGGGHTDAAFQLVVDKGGITA 218 Query: 434 EQTYPYEGVDDKCR 475 E Y YEG +CR Sbjct: 219 ESEYRYEGYKGRCR 232 Score = 40.7 bits (91), Expect = 0.031 Identities = 23/72 (31%), Positives = 36/72 (50%) Frame = +3 Query: 36 MNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGA 215 +N++ D+ + EFV T G + + + + P + +P +DWR GA Sbjct: 90 INQFADLTNGEFVATYTGVKQPPPAT---HPHPHPEEAPRPVDP--IWMPCCIDWRFKGA 144 Query: 216 VTDIKDQGKCGS 251 VT +KDQG CGS Sbjct: 145 VTGVKDQGACGS 156 Score = 38.7 bits (86), Expect = 0.12 Identities = 19/42 (45%), Positives = 27/42 (64%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVY 633 G+ +P DE++L AVA PV+ +DAS +FQ Y SGV+ Sbjct: 246 GYRAVPPADERQLATAVARQ-PVTAYVDASGPAFQFYGSGVF 286 >UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadidae|Rep: Cysteine protease - Tritrichomonas foetus (Trichomonas foetus) Length = 315 Score = 66.9 bits (156), Expect = 4e-10 Identities = 32/51 (62%), Positives = 38/51 (74%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 F+ I E DE+ L V T GPV+VAIDASH SFQLY SG+Y+E ECS+T L Sbjct: 211 FLYIAENDEEDLAANVETHGPVAVAIDASHQSFQLYKSGIYDEPECSATFL 261 Score = 60.5 bits (140), Expect = 4e-08 Identities = 33/76 (43%), Positives = 44/76 (57%), Gaps = 2/76 (2%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKD--NGGID 430 +FS A E + +G L S SEQNL+DC + G GC+GGLMD A+KYI D G + Sbjct: 126 AFSAIQAAESAYAISTGTLESYSEQNLVDCVQ--GCYGCSGGLMDYAYKYIIDRQKGKMI 183 Query: 431 TEQTYPYEGVDDKCRY 478 E Y Y +D C++ Sbjct: 184 LESDYVYTALDGVCKF 199 Score = 41.5 bits (93), Expect = 0.018 Identities = 15/42 (35%), Positives = 26/42 (61%) Frame = +3 Query: 186 EQVDWRKHGAVTDIKDQGKCGSCGPSARLELWKDSTSVSPAT 311 + +DWR+ G V +IKDQ CGSC + ++ + + ++S T Sbjct: 102 DSIDWREKGVVNEIKDQAACGSCWAFSAIQAAESAYAISTGT 143 >UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arabidopsis thaliana|Rep: Putative cysteine proteinase - Arabidopsis thaliana (Mouse-ear cress) Length = 365 Score = 66.1 bits (154), Expect = 7e-10 Identities = 30/59 (50%), Positives = 38/59 (64%) Frame = +2 Query: 311 LVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCRYNPK 487 L++LSEQ LIDC + N GCNGG + AFKYI NGG+ E YPY+ + CR N + Sbjct: 192 LLTLSEQQLIDCDIEK-NGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANAR 249 Score = 45.6 bits (103), Expect = 0.001 Identities = 28/79 (35%), Positives = 38/79 (48%) Frame = +3 Query: 9 MGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPE 188 MG SY LG+N++ D EF+ T G L+ K R +S +++ E Sbjct: 75 MGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWN-MSDIDME-DE 132 Query: 189 QVDWRKHGAVTDIKDQGKC 245 DWR GAVT +K QG C Sbjct: 133 SKDWRDEGAVTPVKYQGAC 151 Score = 41.1 bits (92), Expect = 0.023 Identities = 23/50 (46%), Positives = 29/50 (58%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 RGF +P +E+ L+EAV PVSV IDA SF Y GVY +C + Sbjct: 257 RGFQMVPSHNERALLEAVRRQ-PVSVLIDARADSFGHYKGGVYAGLDCGT 305 >UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing protein; n=7; Hymenostomatida|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 387 Score = 66.1 bits (154), Expect = 7e-10 Identities = 35/78 (44%), Positives = 44/78 (56%), Gaps = 1/78 (1%) Frame = +3 Query: 24 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK-LPEQVDW 200 YK G+N++ D E +T G++KT K+ N K R K NVK LP+ VDW Sbjct: 83 YKKGINQFTDRTAEELRETTLGYSKTVKNAAN---KQNMFRNLKTSDKINVKDLPKSVDW 139 Query: 201 RKHGAVTDIKDQGKCGSC 254 R G VT +KDQG CGSC Sbjct: 140 RDAGVVTPVKDQGHCGSC 157 Score = 44.0 bits (99), Expect = 0.003 Identities = 25/83 (30%), Positives = 42/83 (50%), Gaps = 7/83 (8%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQY----GNNGCNGGLMDNAFKYIKDNGG 424 +F+TT +E +G L +LS Q L+ C + G GCNG + + A+ Y++ G Sbjct: 159 AFATTAVIESYAAIATGQLKTLSTQQLVSCVQNSYQCGGQGGCNGAVSELAYNYVQ-LFG 217 Query: 425 IDTEQTY---PYEGVDDKCRYNP 484 + +E Y Y+G C ++P Sbjct: 218 LTSEYKYSYSSYQGQTGNCTFDP 240 Score = 44.0 bits (99), Expect = 0.003 Identities = 20/43 (46%), Positives = 30/43 (69%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYN 636 G++ +PE D LM AVAT GP+ +++DAS +F Y SGV++ Sbjct: 251 GYLKVPENDYASLMNAVATQGPLVISVDAS--NFHDYESGVFH 291 >UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2); n=2; Tribolium castaneum|Rep: PREDICTED: similar to Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2) - Tribolium castaneum Length = 332 Score = 65.7 bits (153), Expect = 9e-10 Identities = 31/74 (41%), Positives = 45/74 (60%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+T GA+E + + +SLSEQ L+DC + G GC GG + A+ YI N G++ Sbjct: 144 AFATIGAIESHYKIRHKRAISLSEQQLVDCVGRGG--GCGGGWIPTAYSYIARNKGVNYN 201 Query: 437 QTYPYEGVDDKCRY 478 + YPY G + KCRY Sbjct: 202 RDYPYLGRNGKCRY 215 Score = 43.2 bits (97), Expect = 0.006 Identities = 19/39 (48%), Positives = 24/39 (61%) Frame = +1 Query: 532 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEEC 648 +E+++ VAT GPVSVAI +F Y SGVYN C Sbjct: 235 NEERVRRLVATKGPVSVAIHVDSRTFHKYKSGVYNNPSC 273 Score = 42.7 bits (96), Expect = 0.008 Identities = 22/84 (26%), Positives = 40/84 (47%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 + G +Y++G+NK+ D E + + G + + L + + + Sbjct: 65 FRNGSETYEMGVNKFSDFTDEE-LSNLTGLQVPLEFEQPL-----NETEDPLLPSLGRGI 118 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 +DWR+ G VT +K+QG+CGSC Sbjct: 119 SASLDWRQRGGVTPVKNQGQCGSC 142 >UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_184, whole genome shotgun sequence - Paramecium tetraurelia Length = 331 Score = 65.3 bits (152), Expect = 1e-09 Identities = 31/75 (41%), Positives = 44/75 (58%), Gaps = 3/75 (4%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQ---YGNNGCNGGLMDNAFKYIKDNGGI 427 +F+ A+E V++SEQ +DC+ + Y + GCNGG MD+AF Y N G+ Sbjct: 141 AFAAAAAIEAGFQHHKKNKVNISEQEFVDCTTEKLGYESQGCNGGWMDDAFDYTV-NYGV 199 Query: 428 DTEQTYPYEGVDDKC 472 TE+ YPY+GVD C Sbjct: 200 TTEEEYPYKGVDQPC 214 Score = 41.9 bits (94), Expect = 0.013 Identities = 26/84 (30%), Positives = 39/84 (46%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 +E+G ++ LGMN+Y D+ EF + + KN+ G + Sbjct: 71 FELGQETFTLGMNQYADLTPEEFQASFLTLKTKVQDRKNVKSYSG------------LSF 118 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P+ VDW K G +K+QG CGSC Sbjct: 119 PDTVDW-KDGLT--VKNQGSCGSC 139 Score = 38.3 bits (85), Expect = 0.16 Identities = 22/49 (44%), Positives = 27/49 (55%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSST 657 FVD+ L EA+A PV+VAI A FQLYS GVY+ + T Sbjct: 227 FVDVEPLSSDALHEAIAKT-PVAVAIKADGILFQLYSGGVYSRSCTAKT 274 >UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa zeasingle nucleocapsid nuclear polyhedrosis virus) Length = 367 Score = 65.3 bits (152), Expect = 1e-09 Identities = 31/72 (43%), Positives = 44/72 (61%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F G +E Q+ + L+ LSEQ L+DC E + GCNGGLM AF+ + GG++TE Sbjct: 182 AFVAIGNIESQYAIRHNKLIDLSEQQLLDCDEV--DLGCNGGLMHLAFQELLLMGGVETE 239 Query: 437 QTYPYEGVDDKC 472 YPY+G + C Sbjct: 240 ADYPYQGSEQMC 251 Score = 49.6 bits (113), Expect = 7e-05 Identities = 28/78 (35%), Positives = 40/78 (51%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 S + G+NK+ D E + + GF + L + V+GA +++LP+ DW Sbjct: 109 SAQFGVNKFSDKTPDEVLHSNTGFFLNLSQHYTL-CENRIVKGAP-----DIRLPDYYDW 162 Query: 201 RKHGAVTDIKDQGKCGSC 254 R VT IKDQG CGSC Sbjct: 163 RDTNKVTPIKDQGVCGSC 180 >UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep: Viral cathepsin - Xestia c-nigrum granulosis virus (XnGV) (Xestia c-nigrumgranulovirus) Length = 346 Score = 65.3 bits (152), Expect = 1e-09 Identities = 33/73 (45%), Positives = 42/73 (57%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS +E + + + LSEQ L+DC + NNGCNGGLM AF+ I GGI E Sbjct: 159 AFSAVANIESLYHIKHNVSLDLSEQQLVDCDKV--NNGCNGGLMSWAFEGIIRAGGISYE 216 Query: 437 QTYPYEGVDDKCR 475 YPY GVD C+ Sbjct: 217 APYPYTGVDGVCK 229 Score = 37.1 bits (82), Expect = 0.38 Identities = 13/26 (50%), Positives = 18/26 (69%) Frame = +3 Query: 177 KLPEQVDWRKHGAVTDIKDQGKCGSC 254 K+P+ DWR +VT +K Q +CGSC Sbjct: 132 KVPDSFDWRDRNSVTSVKMQKECGSC 157 >UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 64.9 bits (151), Expect = 2e-09 Identities = 30/74 (40%), Positives = 49/74 (66%), Gaps = 2/74 (2%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSE-QYGNNGCNGGLMDNAFKYIKDNGGIDT 433 +F++TGALEG + ++G L S Q ++DC++ Q+ GC+GG F ++K+N G++ Sbjct: 153 AFASTGALEGLYQIKTGKLEVFSPQYIVDCAKHQFSRGGCHGGYSSGVFTFVKEN-GMNL 211 Query: 434 EQTYPYEGVD-DKC 472 E YPY+G + DKC Sbjct: 212 ESRYPYKGEENDKC 225 Score = 54.0 bits (124), Expect = 3e-06 Identities = 30/75 (40%), Positives = 42/75 (56%), Gaps = 1/75 (1%) Frame = +3 Query: 33 GMNKYGDMLHHEF-VKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 209 G+NK+ + EF K +N + A MK S+ ++ + KLPE VDWRK Sbjct: 84 GINKFSHLTKEEFKAKYLNRPQRPASE-----MKTNSILSSQ--QKTDEKLPESVDWRKL 136 Query: 210 GAVTDIKDQGKCGSC 254 GAV+ ++DQG CGSC Sbjct: 137 GAVSPVRDQGNCGSC 151 >UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum|Rep: Falcipain 2 - Plasmodium falciparum Length = 484 Score = 64.9 bits (151), Expect = 2e-09 Identities = 30/65 (46%), Positives = 46/65 (70%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS+ G++E Q+ + L++LSEQ L+DCS + N GCNGGL++NAF+ + + GGI + Sbjct: 287 AFSSIGSVESQYAIRKNKLITLSEQELVDCS--FKNYGCNGGLINNAFEDMIELGGICPD 344 Query: 437 QTYPY 451 YPY Sbjct: 345 GDYPY 349 Score = 42.7 bits (96), Expect = 0.008 Identities = 25/79 (31%), Positives = 35/79 (44%), Gaps = 2/79 (2%) Frame = +3 Query: 24 YKLGMNKYGDMLHHEFVKTMNGF--NKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVD 197 YK +N++ D+ +HEF +K K++K L + K D Sbjct: 207 YKKELNRFADLTYHEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYRGEENFDHAAYD 266 Query: 198 WRKHGAVTDIKDQGKCGSC 254 WR H VT +KDQ CGSC Sbjct: 267 WRLHSGVTPVKDQKNCGSC 285 >UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera litura multicapsid nucleopolyhedrovirus (SpltMNPV) Length = 337 Score = 64.5 bits (150), Expect = 2e-09 Identities = 31/76 (40%), Positives = 45/76 (59%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ G +E Q+ L+ LSEQ L+DC + GC+GGLM AF+ I GG++ E Sbjct: 152 AFAAIGNIESQYAIMHDSLIDLSEQQLLDCDRV--DQGCDGGLMHLAFQEIIRIGGVEHE 209 Query: 437 QTYPYEGVDDKCRYNP 484 YPY+G++ CR P Sbjct: 210 IDYPYQGIEYACRLAP 225 Score = 42.7 bits (96), Expect = 0.008 Identities = 22/74 (29%), Positives = 35/74 (47%) Frame = +3 Query: 33 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 212 G+NK+ D+ FV G ++ + + ++ + + PE DWRK Sbjct: 77 GINKFSDIDKITFVNEHAGLVSNLINSTDSNFDPYRLCEYVTVAGPSARTPESFDWRKLN 136 Query: 213 AVTDIKDQGKCGSC 254 VT +K+QG CGSC Sbjct: 137 KVTKVKEQGVCGSC 150 >UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin S preproprotein; n=2; Tribolium castaneum|Rep: PREDICTED: similar to cathepsin S preproprotein - Tribolium castaneum Length = 525 Score = 64.1 bits (149), Expect = 3e-09 Identities = 29/76 (38%), Positives = 43/76 (56%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ GA E + +Q G V LSEQ L+DC + G C G +D ++YI ++ GI+ + Sbjct: 61 AFAILGATEAHYRKQRGSFVILSEQQLVDCVREVGT--CKGVWLDEVYEYIINSNGINYD 118 Query: 437 QTYPYEGVDDKCRYNP 484 Q Y YE CR+ P Sbjct: 119 QDYRYESAPGSCRFKP 134 Score = 53.6 bits (123), Expect = 4e-06 Identities = 26/74 (35%), Positives = 39/74 (52%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ GA E Q+ G V LSEQ L+DC + + C G + +KYI + GI+ + Sbjct: 337 AFAIIGATEAQYRIHRGSFVILSEQQLVDCVREVSS--CRGVYLHETYKYIVKSEGINYD 394 Query: 437 QTYPYEGVDDKCRY 478 Q Y Y+ CR+ Sbjct: 395 QDYRYQSAPGTCRF 408 Score = 44.8 bits (101), Expect = 0.002 Identities = 17/25 (68%), Positives = 19/25 (76%) Frame = +3 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 LP+ VDWR G VT +K QGKCGSC Sbjct: 35 LPDMVDWRLQGVVTPVKRQGKCGSC 59 Score = 43.6 bits (98), Expect = 0.004 Identities = 16/25 (64%), Positives = 19/25 (76%) Frame = +3 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 LP+ VDWR G VT +K QGKCG+C Sbjct: 311 LPKMVDWRLRGVVTPVKHQGKCGTC 335 Score = 37.5 bits (83), Expect = 0.28 Identities = 16/46 (34%), Positives = 25/46 (54%) Frame = +1 Query: 520 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSST 657 + E E+ L VA +GP +V+ DA + + YS G+Y C+ T Sbjct: 147 LAEISEEDLQWIVAKIGPATVSFDARGSQLKSYSGGIYYNRTCTKT 192 Score = 36.3 bits (80), Expect = 0.66 Identities = 16/39 (41%), Positives = 23/39 (58%) Frame = +1 Query: 535 EQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECS 651 E+ L VA VGPV+V+ D F+ YS GV+ + C+ Sbjct: 428 EEDLQWIVANVGPVTVSFDGRGKQFKSYSGGVFYNKTCT 466 >UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schistosoma japonicum|Rep: Clone ZZD209 mRNA sequence - Schistosoma japonicum (Blood fluke) Length = 339 Score = 64.1 bits (149), Expect = 3e-09 Identities = 29/84 (34%), Positives = 47/84 (55%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ T + E Q+ + ++LS Q IDC+ YGN GC+GG F Y++ + G++TE Sbjct: 147 AFAVTASTESQYALHTSNHMNLSVQQFIDCTRIYGNMGCHGGYTFTLFIYLQ-SFGLETE 205 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDV 508 Q YP+ G D C N + + + Sbjct: 206 QMYPFTGEDQDCMANSSDVVVQSI 229 >UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi Length = 467 Score = 64.1 bits (149), Expect = 3e-09 Identities = 35/86 (40%), Positives = 52/86 (60%), Gaps = 5/86 (5%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYI--KDNGGID 430 +FS G +E Q F L +LSEQ L+ C + ++GC+GGLM+NAF++I ++NG + Sbjct: 149 AFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKT--DSGCSGGLMNNAFEWIVQENNGAVY 206 Query: 431 TEQTYPY---EGVDDKCRYNPKNTGA 499 TE +YPY EG+ C + GA Sbjct: 207 TEDSYPYASGEGISPPCTTSGHTVGA 232 Score = 47.2 bits (107), Expect = 4e-04 Identities = 25/74 (33%), Positives = 34/74 (45%) Frame = +3 Query: 33 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 212 G+ + D+ EF ++ HN + R + V P VDWR G Sbjct: 82 GVTPFSDLTREEF--------RSRYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARG 133 Query: 213 AVTDIKDQGKCGSC 254 AVT +KDQG+CGSC Sbjct: 134 AVTAVKDQGQCGSC 147 Score = 33.5 bits (73), Expect = 4.6 Identities = 18/41 (43%), Positives = 28/41 (68%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGV 630 G V++P+ DE ++ +A GPV+VA+DAS S+ Y+ GV Sbjct: 236 GHVELPQ-DEAQIAAWLAVNGPVAVAVDAS--SWMTYTGGV 273 >UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 330 Score = 63.7 bits (148), Expect = 4e-09 Identities = 35/75 (46%), Positives = 46/75 (61%), Gaps = 3/75 (4%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYL---VSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGI 427 +FSTTG++E +GY + LSEQ L+DCS N GC GG MDNAF+YI+++ + Sbjct: 143 AFSTTGSVESALII-AGYANQTIDLSEQQLVDCSAT--NYGCGGGWMDNAFEYIEES-PL 198 Query: 428 DTEQTYPYEGVDDKC 472 T YPY VD C Sbjct: 199 TTNSNYPYVAVDQAC 213 Score = 37.1 bits (82), Expect = 0.38 Identities = 15/31 (48%), Positives = 20/31 (64%) Frame = +3 Query: 162 SPANVKLPEQVDWRKHGAVTDIKDQGKCGSC 254 SP+ K V+W G V+ +KDQG+CGSC Sbjct: 111 SPSTPKGQYDVNWVTRGKVSAVKDQGQCGSC 141 >UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus; n=4; Cryptosporidium|Rep: Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus - Cryptosporidium parvum Iowa II Length = 401 Score = 62.9 bits (146), Expect = 7e-09 Identities = 33/73 (45%), Positives = 41/73 (56%), Gaps = 1/73 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGY-LVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDT 433 +FS ALEG Q+ L SLSEQ +DCS+Q GN GC+GG M AF+Y N + T Sbjct: 202 AFSAVAALEGATCAQTNRGLPSLSEQQFVDCSKQNGNFGCDGGTMGLAFQYAIKNKYLCT 261 Query: 434 EQTYPYEGVDDKC 472 YPY + C Sbjct: 262 NDDYPYFAEEKTC 274 Score = 50.0 bits (114), Expect = 5e-05 Identities = 25/78 (32%), Positives = 42/78 (53%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 SY L MN++GD+ EF+ G+ K +K ++ ++ K V ++ S P ++W Sbjct: 126 SYVLEMNEFGDLSKEEFMARFTGYIKDSKDDERVF-KSSRVSASE--SEEEFVPPNSINW 182 Query: 201 RKHGAVTDIKDQGKCGSC 254 + G V I++Q CGSC Sbjct: 183 VEAGCVNPIRNQKNCGSC 200 Score = 38.3 bits (85), Expect = 0.16 Identities = 17/31 (54%), Positives = 21/31 (67%) Frame = +1 Query: 544 LMEAVATVGPVSVAIDASHTSFQLYSSGVYN 636 L A+A GP+SVAI A T FQ Y SGV++ Sbjct: 301 LKTALAKYGPISVAIQADQTPFQFYKSGVFD 331 >UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 392 Score = 62.9 bits (146), Expect = 7e-09 Identities = 33/86 (38%), Positives = 48/86 (55%), Gaps = 5/86 (5%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCS-----EQYGNNGCNGGLMDNAFKYIKDNG 421 +F+T GA+E HF Q G L++L+EQ L+DC+ +GNNGC GG AF ++K G Sbjct: 203 AFATAGAVEAAHFIQKGELLNLAEQQLLDCTWSTPGVYHGNNGCLGGWTWKAFSWVKKFG 262 Query: 422 GIDTEQTYPYEGVDDKCRYNPKNTGA 499 T+ Y G + C+ + GA Sbjct: 263 IATTKSYGHYRGQEGFCKTSNLTVGA 288 Score = 44.0 bits (99), Expect = 0.003 Identities = 25/77 (32%), Positives = 37/77 (48%) Frame = +3 Query: 24 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 203 YKL N + D+ EF + +K N + + + S ++P+Q+DWR Sbjct: 129 YKLEPNHFADLTDDEFKSYKGALDDESKDVMNDH--DDVIDDDR--SKRMFEVPDQLDWR 184 Query: 204 KHGAVTDIKDQGKCGSC 254 +GAV K QG CGSC Sbjct: 185 NYGAVNPAKGQGTCGSC 201 Score = 33.1 bits (72), Expect = 6.1 Identities = 12/37 (32%), Positives = 26/37 (70%) Frame = +1 Query: 544 LMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 L +A++ GP +++I+A+ S + YS G+ +++ CS+ Sbjct: 304 LKKALSYHGPATISINANPKSLKFYSDGIMSDKHCSN 340 >UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria dispar multicapsid nuclear polyhedrosis virus (LdMNPV) Length = 356 Score = 62.9 bits (146), Expect = 7e-09 Identities = 30/72 (41%), Positives = 43/72 (59%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+T ++E Q + L+ LSEQ LIDC + GCNGGL+ AF+ I GG+ TE Sbjct: 170 AFATLASVESQFAMRHNRLIDLSEQQLIDCDSV--DMGCNGGLLHTAFEEIMRMGGVQTE 227 Query: 437 QTYPYEGVDDKC 472 YP+ G + +C Sbjct: 228 LDYPFVGRNRRC 239 Score = 37.1 bits (82), Expect = 0.38 Identities = 16/32 (50%), Positives = 19/32 (59%) Frame = +3 Query: 177 KLPEQVDWRKHGAVTDIKDQGKCGSCGPSARL 272 K P DWR+ VT IK+QG CG+C A L Sbjct: 143 KGPLHFDWREQNKVTSIKNQGACGACWAFATL 174 >UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa|Rep: Os09g0381400 protein - Oryza sativa subsp. japonica (Rice) Length = 362 Score = 62.5 bits (145), Expect = 9e-09 Identities = 30/72 (41%), Positives = 41/72 (56%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F T +E + ++G LVSLSEQ L+DC G GCN G A+K++ +NGG+ TE Sbjct: 171 AFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDG--GCNLGSYGRAYKWVVENGGLTTE 228 Query: 437 QTYPYEGVDDKC 472 YPY C Sbjct: 229 ADYPYTARRGPC 240 Score = 41.5 bits (93), Expect = 0.018 Identities = 26/83 (31%), Positives = 38/83 (45%), Gaps = 2/83 (2%) Frame = +3 Query: 12 GLVSYKLGMNKYGDMLHHEFVKTMNGFNK-TAKHNKNLYMKGGSVRGAKFISPANVKLPE 188 G ++Y+L N++ D+ EF+ T G+ + ++ G A F V +P Sbjct: 89 GDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITTGAGDVDASF--SYRVDVPA 146 Query: 189 QVDWRKHGAVTDIKDQ-GKCGSC 254 VDWR GAV K Q C SC Sbjct: 147 SVDWRAQGAVVPPKSQTSTCSSC 169 >UniRef50_Q26988 Cluster: Cysteine proteinase 4; n=1; Tritrichomonas foetus|Rep: Cysteine proteinase 4 - Tritrichomonas foetus (Trichomonas foetus) Length = 152 Score = 62.1 bits (144), Expect = 1e-08 Identities = 27/52 (51%), Positives = 37/52 (71%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 GF+ + E+ L + VA+VGP++V IDAS SF YSSG+YN+ +CSST L Sbjct: 86 GFMSVQAQSEEDLFKCVASVGPIAVCIDASLASFNSYSSGIYNDRQCSSTVL 137 Score = 61.3 bits (142), Expect = 2e-08 Identities = 33/79 (41%), Positives = 47/79 (59%), Gaps = 3/79 (3%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIK--DNGGID 430 +F+TT +E + + L S SEQNL+DC Q +NGC GG +AF +I NG I+ Sbjct: 1 AFATTQCMESINALRFKSLFSFSEQNLVDCDPQ--SNGCAGGSPFSAFMFISRTQNGQIN 58 Query: 431 TEQTYPYEGVD-DKCRYNP 484 E YPY G D + C+++P Sbjct: 59 LEDDYPYTGTDTNDCKFDP 77 >UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foetus|Rep: TFCP2 protein - Tritrichomonas foetus (Trichomonas foetus) Length = 270 Score = 62.1 bits (144), Expect = 1e-08 Identities = 34/99 (34%), Positives = 51/99 (51%), Gaps = 3/99 (3%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDC-SEQYGNNGCNGGLMDNAFKYI--KDNGGI 427 +FS A E H +G L+ SEQ+L+DC + Y GC+GG D A KY+ + NG Sbjct: 76 AFSAIAAQESCHAIATGELLRFSEQSLVDCVTSDYSCQGCSGGWPDQAMKYVIEQQNGKF 135 Query: 428 DTEQTYPYEGVDDKCRYNPKNTGAEDVASWTSPRATNRS 544 E+ Y Y G C Y+ K+ + VA P++ ++ Sbjct: 136 ILEENYQYSGHKGACLYDEKSKVSNIVAVTMFPQSDEQN 174 Score = 47.2 bits (107), Expect = 4e-04 Identities = 20/37 (54%), Positives = 24/37 (64%) Frame = +1 Query: 523 PEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVY 633 P+ DEQ L +A GPVS +DA H SFQLY G+Y Sbjct: 168 PQSDEQNLKGHIAANGPVSCNVDAGHYSFQLYQGGIY 204 Score = 37.9 bits (84), Expect = 0.22 Identities = 14/24 (58%), Positives = 15/24 (62%) Frame = +3 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P DWR G V IK+QG CGSC Sbjct: 51 PTSFDWRSEGKVNPIKNQGSCGSC 74 >UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; Leishmania|Rep: Cysteine proteinase 2 precursor - Leishmania pifanoi Length = 444 Score = 62.1 bits (144), Expect = 1e-08 Identities = 31/67 (46%), Positives = 42/67 (62%), Gaps = 2/67 (2%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYI--KDNGGID 430 +FS G +EGQ + LVSLSEQ L+ C + N+GC+GGLM AF ++ NG + Sbjct: 152 AFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDM--NDGCDGGLMLQAFDWLLQNTNGHLH 209 Query: 431 TEQTYPY 451 TE +YPY Sbjct: 210 TEDSYPY 216 Score = 48.0 bits (109), Expect = 2e-04 Identities = 29/80 (36%), Positives = 41/80 (51%), Gaps = 4/80 (5%) Frame = +3 Query: 27 KLGMNKYGDMLHHEFV-KTMNG---FNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQV 194 + G+ K+ D+ EF + +NG F +H Y K + A +P+ V Sbjct: 80 QFGITKFFDLSEAEFAARYLNGAAYFAAAKRHAAQHYRKARADLSA---------VPDAV 130 Query: 195 DWRKHGAVTDIKDQGKCGSC 254 DWR+ GAVT +KDQG CGSC Sbjct: 131 DWREKGAVTPVKDQGACGSC 150 >UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sativa|Rep: OSJNBb0085F13.15 protein - Oryza sativa subsp. japonica (Rice) Length = 383 Score = 61.7 bits (143), Expect = 2e-08 Identities = 28/73 (38%), Positives = 43/73 (58%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ A+E H + G L+SLSEQ L+DC + G C+ G D+AF ++ N GI ++ Sbjct: 186 AFAAVAAIESLHKIKGGDLISLSEQELVDCDDT-GEATCSKGYSDDAFLWVSKNKGIASD 244 Query: 437 QTYPYEGVDDKCR 475 YPY G + C+ Sbjct: 245 LIYPYVGHKESCK 257 Score = 52.8 bits (121), Expect = 7e-06 Identities = 31/92 (33%), Positives = 43/92 (46%), Gaps = 11/92 (11%) Frame = +3 Query: 12 GLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLY-----------MKGGSVRGAKF 158 G +++KLG + D+ H EF+ T G + + + G V GA Sbjct: 94 GSLTFKLGETPFTDLTHEEFLATYTGDVRLPPERRGMQDDSDEEDAVITTSAGYVAGAG- 152 Query: 159 ISPANVKLPEQVDWRKHGAVTDIKDQGKCGSC 254 V +PE VDWRK GAVT K QG+C +C Sbjct: 153 AGRRTVAVPESVDWRKEGAVTPAKHQGQCAAC 184 Score = 35.9 bits (79), Expect = 0.87 Identities = 24/59 (40%), Positives = 30/59 (50%), Gaps = 1/59 (1%) Frame = +1 Query: 490 HRC*GRGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLY-SSGVYNEEECSSTDL 663 H RG V +PE E +M AVA PV+V DA FQ Y +GVY ST++ Sbjct: 264 HNATVRGVVTLPENREDLIMAAVAR-QPVAVVFDAGDPLFQNYRGNGVYKGGTGCSTNV 321 >UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sativa|Rep: Cysteine proteinase-like - Oryza sativa subsp. japonica (Rice) Length = 360 Score = 60.9 bits (141), Expect = 3e-08 Identities = 35/91 (38%), Positives = 47/91 (51%), Gaps = 4/91 (4%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ A EG +G LVSLSEQ ++DC+ G N C+GG + A +YI +GG+ TE Sbjct: 163 AFAAVAATEGLVQLATGNLVSLSEQQVLDCTG--GANTCSGGDVSAALRYIAASGGLQTE 220 Query: 437 QTYPYEGVDDKCRYN----PKNTGAEDVASW 517 Y Y G CR P + A A W Sbjct: 221 AAYAYGGQQGACRAGGFAAPNSAAAVGGARW 251 Score = 55.2 bits (127), Expect = 1e-06 Identities = 25/78 (32%), Positives = 42/78 (53%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 +Y LG+N++ D+ EF +T G++ + + G + + +P+ VDW Sbjct: 85 TYTLGLNQFSDLTDDEFAQTHLGYSWAPPPPSHRHGHRAE-NGTAAAAADDTDVPDSVDW 143 Query: 201 RKHGAVTDIKDQGKCGSC 254 R GAVT++K+Q CGSC Sbjct: 144 RARGAVTEVKNQRSCGSC 161 >UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa (japonica cultivar-group)|Rep: Os09g0562700 protein - Oryza sativa subsp. japonica (Rice) Length = 235 Score = 60.9 bits (141), Expect = 3e-08 Identities = 30/65 (46%), Positives = 40/65 (61%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FST +EG + G LVSLSEQ L+DC ++GC+GG+ A ++I NGGI T Sbjct: 35 AFSTVAVVEGIQKIKKGKLVSLSEQELVDCDTL--DSGCDGGVSYRALEWITANGGITTR 92 Query: 437 QTYPY 451 YPY Sbjct: 93 DDYPY 97 Score = 34.7 bits (76), Expect = 2.0 Identities = 12/15 (80%), Positives = 15/15 (100%) Frame = +3 Query: 210 GAVTDIKDQGKCGSC 254 GAVT++KDQG+CGSC Sbjct: 19 GAVTEVKDQGRCGSC 33 >UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis|Rep: Cathepsin L - Culicoides sonorensis Length = 331 Score = 60.9 bits (141), Expect = 3e-08 Identities = 28/78 (35%), Positives = 50/78 (64%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F++ ++E ++ R +L+EQ L+DC ++GC+GG D A +Y++DNG + E Sbjct: 142 AFASVASVEMRYKRFHNKSYTLAEQELVDCETT--SHGCSGGWSDLALQYMRDNG-LSFE 198 Query: 437 QTYPYEGVDDKCRYNPKN 490 + YPY+G D+KC + +N Sbjct: 199 KDYPYKGKDEKCHASNEN 216 >UniRef50_Q23H32 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 365 Score = 60.9 bits (141), Expect = 3e-08 Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 3/86 (3%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKD-NGGIDT 433 +FST ALEG + +Q+G ++ SEQNLIDC + NNGCNGG + A + + GI Sbjct: 160 AFSTVIALEGAYAKQTGNVIKFSEQNLIDCC-RIENNGCNGGDPEPALDCVMNVLKGIMK 218 Query: 434 EQTYPYEGVDDK-CRYN-PKNTGAED 505 Q YPY+ + K C ++ KN + D Sbjct: 219 NQDYPYQAITRKECDHDQSKNVFSPD 244 Score = 48.8 bits (111), Expect = 1e-04 Identities = 25/76 (32%), Positives = 41/76 (53%) Frame = +3 Query: 27 KLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRK 206 +L +N++ D+ EF + G+N + KHN + GS + + + +PE VDWR+ Sbjct: 87 QLEVNEFADLSLQEFRELYFGYNSSKKHNN---QQNGSTKNLRQSFLLSDSVPESVDWRE 143 Query: 207 HGAVTDIKDQGKCGSC 254 V ++ QG CGSC Sbjct: 144 K-LVAPVQKQGGCGSC 158 >UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_101, whole genome shotgun sequence - Paramecium tetraurelia Length = 306 Score = 60.9 bits (141), Expect = 3e-08 Identities = 33/89 (37%), Positives = 42/89 (47%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 SFS GA E G SEQNL+DC ++GC+GG A Y+ NG E Sbjct: 134 SFSAVGAFEAFFIFVKGTHFQYSEQNLVDCDT--NSHGCDGGYPAKAIDYLNKNGAF-LE 190 Query: 437 QTYPYEGVDDKCRYNPKNTGAEDVASWTS 523 YPY +KCR +T A +WT+ Sbjct: 191 SEYPYVASKEKCRKTQGSTKANSRKTWTT 219 >UniRef50_Q7M1Q7 Cluster: Actinidain; n=1; Actinidia chinensis|Rep: Actinidain - Actinidia chinensis (Kiwi) (Yangtao) Length = 110 Score = 60.5 bits (140), Expect = 4e-08 Identities = 27/57 (47%), Positives = 38/57 (66%) Frame = +2 Query: 302 SGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKC 472 +G L+SLSEQ LIDC GC+GG + + F++I ++GGI+TE+ YPY D C Sbjct: 12 TGVLISLSEQELIDCGR-----GCDGGYITDGFQFIINDGGINTEENYPYTAQDGDC 63 >UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 514 Score = 60.5 bits (140), Expect = 4e-08 Identities = 29/73 (39%), Positives = 40/73 (54%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 + + GA+EG +F ++G L LS Q +IDCS GN GC GG + A +I +G E Sbjct: 329 ALAAVGAVEGAYFMKTGKLKELSAQQVIDCSWGSGNRGCKGGYYNKAMSWIYLHGIASAE 388 Query: 437 QTYPYEGVDDKCR 475 PY G + CR Sbjct: 389 SYGPYLGQEGTCR 401 Score = 41.5 bits (93), Expect = 0.018 Identities = 14/27 (51%), Positives = 22/27 (81%) Frame = +3 Query: 174 VKLPEQVDWRKHGAVTDIKDQGKCGSC 254 V +P+++DWR +GAV+ ++ QG CGSC Sbjct: 301 VDVPDELDWRDYGAVSPVRGQGICGSC 327 Score = 35.5 bits (78), Expect = 1.1 Identities = 16/46 (34%), Positives = 27/46 (58%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEEC 648 F +P+ + L +VA GP V+I+ + S + YS G+Y++ EC Sbjct: 414 FAFVPKYNNTALKISVARFGPAVVSINENPLSLKFYSWGLYDDPEC 459 >UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endopterygota|Rep: Cathepsin L-like proteinase - Bombyx mori (Silk moth) Length = 402 Score = 60.5 bits (140), Expect = 4e-08 Identities = 29/75 (38%), Positives = 43/75 (57%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ T AL+ Q +++ G LS Q ++DCS + GN GC+GG + A +Y G+ E Sbjct: 219 AFAVTHALQAQLYKRHGEWNELSPQQIVDCSIKDGNMGCDGGSLRGALRYAA-REGLVME 277 Query: 437 QTYPYEGVDDKCRYN 481 YPY G CRY+ Sbjct: 278 SHYPYVGKKGYCRYD 292 Score = 52.8 bits (121), Expect = 7e-06 Identities = 25/53 (47%), Positives = 39/53 (73%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 R + +P GDE+ + +A+ATVGP++VA++A+ +FQLY SGVY++ C S L Sbjct: 301 RRWATLPSGDEEAMEKALATVGPLAVAVNAAPFTFQLY-SGVYDDPFCVSWHL 352 Score = 37.9 bits (84), Expect = 0.22 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 2/86 (2%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKN--LYMKGGSVRGAKFISPANV 176 Y G+ SY L +N +GDM E+ F K K K L+ + Sbjct: 138 YLAGIQSYSLHLNHFGDMHVTEY------FGKVLKLIKAFPLFDPAEDHHKTAYRHNRRC 191 Query: 177 KLPEQVDWRKHGAVTDIKDQGKCGSC 254 K+P+++DWR G ++Q +CG+C Sbjct: 192 KVPKRIDWRDQGFKPRREEQWQCGAC 217 >UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin L preproprotein; n=1; Monodelphis domestica|Rep: PREDICTED: similar to cathepsin L preproprotein - Monodelphis domestica Length = 356 Score = 60.1 bits (139), Expect = 5e-08 Identities = 32/65 (49%), Positives = 43/65 (66%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FST G+LEGQ FR++G LV LS+Q LIDCS Y C GG + A +I+ G+ +E Sbjct: 141 AFSTIGSLEGQLFRKTGRLVELSKQMLIDCSGYY---TCMGGSLTGALDFIR-RYGVVSE 196 Query: 437 QTYPY 451 + YPY Sbjct: 197 RCYPY 201 Score = 56.8 bits (131), Expect = 4e-07 Identities = 31/84 (36%), Positives = 46/84 (54%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 ++ G SY +GMN++GDM EF +N + +N K R + +L Sbjct: 66 FKEGKKSYFMGMNQFGDMTDKEFESRLNLRIAPVRTRRNYTFK----RRIYY------RL 115 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P+ VDWR HG VT I++QG+CG+C Sbjct: 116 PKSVDWRTHGYVTPIRNQGECGAC 139 Score = 54.8 bits (126), Expect = 2e-06 Identities = 27/48 (56%), Positives = 33/48 (68%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEEC 648 R +V +P GDE+ LM+AVATVGPV+VAI A SF+ Y G Y E C Sbjct: 232 RDYVTLPSGDERALMQAVATVGPVAVAIHAP-PSFRYYQGGPYIEPRC 278 >UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 326 Score = 60.1 bits (139), Expect = 5e-08 Identities = 32/79 (40%), Positives = 43/79 (54%) Frame = +3 Query: 18 VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVD 197 +SYKLG+NK+ D+ EF G N +K G+ G+ ++ P D Sbjct: 69 MSYKLGLNKFADLTLEEFTAKYTGANPGPITG----LKNGT--GSPPLAAVAGDAPPAWD 122 Query: 198 WRKHGAVTDIKDQGKCGSC 254 WR+HGAVT +KDQG CGSC Sbjct: 123 WREHGAVTRVKDQGPCGSC 141 Score = 37.1 bits (82), Expect = 0.38 Identities = 20/42 (47%), Positives = 28/42 (66%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYN 636 FVD DE+ L +AV + GPVSV I+AS+ F +Y GV++ Sbjct: 209 FVD--PNDEEALKQAVYSQGPVSVLIEASY-EFMIYQGGVFS 247 >UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudicotyledons|Rep: Chymopapain precursor - Carica papaya (Papaya) Length = 352 Score = 60.1 bits (139), Expect = 5e-08 Identities = 29/77 (37%), Positives = 44/77 (57%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FST +EG + +G L+ LSEQ L+DC + + GC GG + +Y+ +N G+ T Sbjct: 161 AFSTIATVEGINKIVTGNLLELSEQELVDCDKH--SYGCKGGYQTTSLQYVANN-GVHTS 217 Query: 437 QTYPYEGVDDKCRYNPK 487 + YPY+ KCR K Sbjct: 218 KVYPYQAKQYKCRATDK 234 Score = 50.0 bits (114), Expect = 5e-05 Identities = 29/78 (37%), Positives = 40/78 (51%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 SY LG+N + D+ + EF K GF A+ L K ++ P+ +DW Sbjct: 88 SYWLGLNGFADLSNDEFKKKYVGF--VAEDFTGLEHFDNEDFTYKHVT----NYPQSIDW 141 Query: 201 RKHGAVTDIKDQGKCGSC 254 R GAVT +K+QG CGSC Sbjct: 142 RAKGAVTPVKNQGACGSC 159 Score = 33.1 bits (72), Expect = 6.1 Identities = 16/43 (37%), Positives = 25/43 (58%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYN 636 G+ +P E + A+A P+SV ++A FQLY SGV++ Sbjct: 243 GYKRVPSNCETSFLGALANQ-PLSVLVEAGGKPFQLYKSGVFD 284 >UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; Leishmania|Rep: Cysteine proteinase 1 precursor - Leishmania pifanoi Length = 354 Score = 60.1 bits (139), Expect = 5e-08 Identities = 32/67 (47%), Positives = 39/67 (58%), Gaps = 2/67 (2%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYI--KDNGGID 430 +FS G +EGQ LVSLSEQ L+ C + GCNGGLMD A +I NG + Sbjct: 155 AFSAIGNIEGQWAASGHSLVSLSEQMLVSCDNI--DEGCNGGLMDQAMNWIMQSHNGSVF 212 Query: 431 TEQTYPY 451 TE +YPY Sbjct: 213 TEASYPY 219 Score = 41.1 bits (92), Expect = 0.023 Identities = 26/71 (36%), Positives = 38/71 (53%) Frame = +3 Query: 42 KYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVT 221 K+ D+ EF K + A+H K+ + + V + +P+ V VDWR GAVT Sbjct: 90 KFADLTPQEFAKLYLNPDYYARHLKD-HKEDVHVDDS---APSGVM---SVDWRDKGAVT 142 Query: 222 DIKDQGKCGSC 254 +K+QG CGSC Sbjct: 143 PVKNQGLCGSC 153 Score = 38.3 bits (85), Expect = 0.16 Identities = 20/41 (48%), Positives = 29/41 (70%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGV 630 GF+ +P DE+++ E V GPV+VA+DA T++QLY GV Sbjct: 241 GFLSLPH-DEERIAEWVEKRGPVAVAVDA--TTWQLYFGGV 278 >UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 280 Score = 59.7 bits (138), Expect = 6e-08 Identities = 31/83 (37%), Positives = 44/83 (53%), Gaps = 2/83 (2%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQ--YGNNGCNGGLMDNAFKYIKDNGGID 430 +F+ TG E + ++ + SEQ L+DCS Y N+GC GG AF+Y K N GI Sbjct: 94 AFTITGLFESINLIRNKTVELYSEQELLDCSSNGIYRNSGCQGGWPHLAFEYSKKN-GIS 152 Query: 431 TEQTYPYEGVDDKCRYNPKNTGA 499 YPY+G+ + C N + A Sbjct: 153 LSSQYPYKGIQENCTVNQQTKKA 175 Score = 54.4 bits (125), Expect = 2e-06 Identities = 29/82 (35%), Positives = 45/82 (54%), Gaps = 4/82 (4%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVK-TMNG--FNKTAKHNKNLYMKGGSVRGAKFISPANVK-LPE 188 SY++GMN++ D+ EF ++N FN ++ +N+ + + N LP+ Sbjct: 11 SYQIGMNQFSDLTIEEFQSISLNQQLFNSESRKLENIKNENQQADFYLQLLKTNASSLPQ 70 Query: 189 QVDWRKHGAVTDIKDQGKCGSC 254 Q DWR G VT +K+QG CGSC Sbjct: 71 QFDWRNLGKVTQVKNQGNCGSC 92 >UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 367 Score = 59.7 bits (138), Expect = 6e-08 Identities = 31/78 (39%), Positives = 42/78 (53%), Gaps = 3/78 (3%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCS---EQYGNNGCNGGLMDNAFKYIKDNGGI 427 +FS E + ++ L SEQ L+DC+ QY N GC GG A++YIKD GI Sbjct: 181 AFSAVALAESVNLLRNNSLALYSEQELVDCTYKNPQYYNYGCQGGWPSVAYRYIKDQ-GI 239 Query: 428 DTEQTYPYEGVDDKCRYN 481 ++Q YPY G + C N Sbjct: 240 SSQQNYPYIGQNRNCSIN 257 Score = 39.9 bits (89), Expect = 0.053 Identities = 13/23 (56%), Positives = 19/23 (82%) Frame = +3 Query: 186 EQVDWRKHGAVTDIKDQGKCGSC 254 + +DWR+ GAV+ +K+QG CGSC Sbjct: 157 QSIDWRQSGAVSPVKNQGSCGSC 179 >UniRef50_Q248G1 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 334 Score = 59.3 bits (137), Expect = 8e-08 Identities = 30/79 (37%), Positives = 45/79 (56%), Gaps = 3/79 (3%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDC---SEQYGNNGCNGGLMDNAFKYIKDNGGI 427 +FS G +E + + G VS +EQ ++DC S Y ++GCNGG + A +Y+ + G + Sbjct: 148 TFSIAGIVESHYVLKHGSYVSYAEQEILDCVSVSAGYQSDGCNGGWPEEALQYVIEYGIV 207 Query: 428 DTEQTYPYEGVDDKCRYNP 484 +E YPY V KCR P Sbjct: 208 KSE-VYPYVAVQGKCRDIP 225 Score = 35.1 bits (77), Expect = 1.5 Identities = 15/27 (55%), Positives = 18/27 (66%), Gaps = 1/27 (3%) Frame = +3 Query: 177 KLPEQVDWRK-HGAVTDIKDQGKCGSC 254 ++PE VDWR V IK+QG CGSC Sbjct: 120 QIPESVDWRNVTNVVGPIKNQGHCGSC 146 >UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis|Rep: Cysteine protease 2 - Babesia bovis Length = 445 Score = 58.4 bits (135), Expect = 1e-07 Identities = 33/72 (45%), Positives = 40/72 (55%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ G++E RQ V LSEQ L+ C Q GN GCNGG D A YIK N GI Sbjct: 262 AFAAVGSVESLLKRQKTD-VRLSEQELVSC--QLGNQGCNGGYSDYALNYIKFN-GIHRS 317 Query: 437 QTYPYEGVDDKC 472 + +PY D KC Sbjct: 318 EEWPYLAADGKC 329 Score = 41.5 bits (93), Expect = 0.018 Identities = 15/23 (65%), Positives = 18/23 (78%) Frame = +3 Query: 186 EQVDWRKHGAVTDIKDQGKCGSC 254 E +DWR+ AVT +KDQG CGSC Sbjct: 238 EDIDWRRADAVTPVKDQGMCGSC 260 >UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Plasmodium|Rep: Cysteine protease falcipain-3 - Plasmodium falciparum Length = 492 Score = 58.0 bits (134), Expect = 2e-07 Identities = 28/65 (43%), Positives = 41/65 (63%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS+ G++E Q+ + L SEQ L+DCS + NNGC GG + NAF + D GG+ ++ Sbjct: 295 AFSSVGSVESQYAIRKKALFLFSEQELVDCSVK--NNGCYGGYITNAFDDMIDLGGLCSQ 352 Query: 437 QTYPY 451 YPY Sbjct: 353 DDYPY 357 Score = 51.2 bits (117), Expect = 2e-05 Identities = 35/84 (41%), Positives = 41/84 (48%), Gaps = 7/84 (8%) Frame = +3 Query: 24 YKLGMNKYGDMLHHEF------VKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLP 185 YK GMNK+GD+ EF +KT F KT + V K PA+ KL Sbjct: 213 YKRGMNKFGDLSPEEFRSKYLNLKTHGPF-KTLSPPVSYEANYEDV--IKKYKPADAKLD 269 Query: 186 E-QVDWRKHGAVTDIKDQGKCGSC 254 DWR HG VT +KDQ CGSC Sbjct: 270 RIAYDWRLHGGVTPVKDQALCGSC 293 >UniRef50_Q231X3 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 58.0 bits (134), Expect = 2e-07 Identities = 30/57 (52%), Positives = 34/57 (59%), Gaps = 1/57 (1%) Frame = +2 Query: 314 VSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVD-DKCRYN 481 +SLSEQ LIDCS YGN GC G + A YIK I TEQ YPY D KC ++ Sbjct: 162 ISLSEQQLIDCSGDYGNYGCAAGQKEQALVYIK-RYSITTEQNYPYTEKDVQKCYFD 217 >UniRef50_Q22RC9 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 389 Score = 58.0 bits (134), Expect = 2e-07 Identities = 31/71 (43%), Positives = 43/71 (60%), Gaps = 6/71 (8%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDC------SEQYGNNGCNGGLMDNAFKYIKDN 418 +FSTTG +EGQ F LVSLSE+ ++DC S + + G GG AF Y+ + Sbjct: 151 TFSTTGNIEGQWFLAGNPLVSLSEEQIVDCDGSQEPSTGHADCGVFGGWPYLAFDYVINA 210 Query: 419 GGIDTEQTYPY 451 GG+ +E+TYPY Sbjct: 211 GGLPSEETYPY 221 Score = 40.3 bits (90), Expect = 0.040 Identities = 29/83 (34%), Positives = 37/83 (44%) Frame = +3 Query: 6 EMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLP 185 E G Y G+ ++ DM EF K+ T N G G + IS P Sbjct: 77 EEGTAEY--GITQFSDMTTEEF-KSQILIPSTYARN----FTGSRYHGFQKISQ---DAP 126 Query: 186 EQVDWRKHGAVTDIKDQGKCGSC 254 DWR HGAVT +K+QG G+C Sbjct: 127 TSYDWRDHGAVTPVKNQGTVGTC 149 Score = 36.3 bits (80), Expect = 0.66 Identities = 17/44 (38%), Positives = 26/44 (59%) Frame = +1 Query: 532 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 DE + + + +GP+SVA+DAS+ Q Y G+ + CS T L Sbjct: 275 DEDSIKQQLFEIGPLSVALDASY--LQFYKKGISAPKFCSKTTL 316 >UniRef50_Q22DX2 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 358 Score = 57.6 bits (133), Expect = 2e-07 Identities = 31/83 (37%), Positives = 49/83 (59%), Gaps = 4/83 (4%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FST+GA+E + + ++LS+Q L+DC Y + GC+GG ++AFKYI+ G + Sbjct: 175 AFSTSGAVESYYSAKKNITLNLSKQQLVDCV--YDHGGCDGGWFNDAFKYIQSVGIVLNA 232 Query: 437 QTYPYEGVD--DKCRYN--PKNT 493 YPY D + C+ + PK T Sbjct: 233 TYYPYINKDQTEPCQLSKLPKGT 255 >UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_21, whole genome shotgun sequence - Paramecium tetraurelia Length = 349 Score = 57.6 bits (133), Expect = 2e-07 Identities = 33/75 (44%), Positives = 43/75 (57%), Gaps = 3/75 (4%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYL-VSLSEQNLIDCS--EQYGNNGCNGGLMDNAFKYIKDNGGI 427 +FS ALE RQ G V LSEQ L+DC+ +++ + GC+GG M + F+Y G I Sbjct: 151 AFSAVAALETA-LRQGGVKNVELSEQELVDCAVKDEFESEGCDGGEMYDGFQYASKYG-I 208 Query: 428 DTEQTYPYEGVDDKC 472 YPY GVD KC Sbjct: 209 AIRSEYPYAGVDQKC 223 Score = 48.4 bits (110), Expect = 2e-04 Identities = 27/84 (32%), Positives = 44/84 (52%), Gaps = 1/84 (1%) Frame = +3 Query: 6 EMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKN-LYMKGGSVRGAKFISPANVKL 182 E GL +++LG+N + D+ EF + T + N +Y + G ++ Sbjct: 78 EAGLETFELGLNDFADLSVEEFEAKYLKYRSTPREQTNQVYRRTGK------------QV 125 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSC 254 P +VD RK G V+++K+QG CGSC Sbjct: 126 PIEVDLRKDGVVSEVKNQGSCGSC 149 Score = 34.3 bits (75), Expect = 2.7 Identities = 17/43 (39%), Positives = 27/43 (62%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYN 636 G+VD+ Q +EA A+ +S+ I+AS +FQLY G+Y+ Sbjct: 236 GYVDVEPLSAQAYVEA-ASEHALSIGINASGINFQLYKKGIYS 277 >UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 344 Score = 57.2 bits (132), Expect = 3e-07 Identities = 31/78 (39%), Positives = 41/78 (52%), Gaps = 3/78 (3%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGY-LVSLSEQNLIDC--SEQYGNNGCNGGLMDNAFKYIKDNGGI 427 +F++T LE F ++G L + SEQ ++DC Y +NGCNGG A Y NG Sbjct: 161 TFASTAVLESFSFIKNGAPLTNFSEQQILDCVYGSGYYSNGCNGGFGSEALNYAIQNGIA 220 Query: 428 DTEQTYPYEGVDDKCRYN 481 Q YPY G C+YN Sbjct: 221 PLSQ-YPYVGKQQGCKYN 237 Score = 54.0 bits (124), Expect = 3e-06 Identities = 30/81 (37%), Positives = 40/81 (49%), Gaps = 3/81 (3%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEF-VKTMNGFNKTAKHNKNLYMKGGSVRGAK-FISPA-NVKLPEQ 191 SY LG N DM H EF + +N +K +K G S + ++ P K Sbjct: 79 SYTLGHNHLSDMTHEEFSLYQLNPARTASKSSKGGNNSGNSSGSSNPYVDPPITTKNAPP 138 Query: 192 VDWRKHGAVTDIKDQGKCGSC 254 +DWR A+T +K QGKCGSC Sbjct: 139 MDWRNASAITPVKQQGKCGSC 159 >UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n=1; Toxocara canis|Rep: Cathepsin L-like cysteine proteinase - Toxocara canis (Canine roundworm) Length = 360 Score = 57.2 bits (132), Expect = 3e-07 Identities = 31/80 (38%), Positives = 46/80 (57%), Gaps = 1/80 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+T G +E + +G L SLSEQ L+DC+ + NN C+GG +D A +Y+ D G+ E Sbjct: 171 AFATVGTVESAYALGTGELRSLSEQQLLDCNLE--NNACDGGDVDKALRYVYDE-GLMRE 227 Query: 437 QTYPYEG-VDDKCRYNPKNT 493 YPY D C+ + T Sbjct: 228 YDYPYVAHRQDTCQLRGETT 247 Score = 38.3 bits (85), Expect = 0.16 Identities = 13/26 (50%), Positives = 18/26 (69%) Frame = +3 Query: 177 KLPEQVDWRKHGAVTDIKDQGKCGSC 254 ++P+ DWR + VT +K Q KCGSC Sbjct: 144 EIPDHFDWRPYNVVTPVKSQFKCGSC 169 >UniRef50_Q53K53 Cluster: Cysteine protease 1, putative; n=5; Oryza sativa|Rep: Cysteine protease 1, putative - Oryza sativa subsp. japonica (Rice) Length = 472 Score = 56.8 bits (131), Expect = 4e-07 Identities = 27/61 (44%), Positives = 37/61 (60%) Frame = +2 Query: 269 TGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYP 448 T +E + ++ LVSLSEQ L+DC G GCN G A+K++ +NGG+ TE YP Sbjct: 324 TATIESLNMIKTRRLVSLSEQQLVDCDSYDG--GCNLGSYGRAYKWVVENGGLTTEADYP 381 Query: 449 Y 451 Y Sbjct: 382 Y 382 Score = 39.5 bits (88), Expect = 0.071 Identities = 30/103 (29%), Positives = 44/103 (42%), Gaps = 1/103 (0%) Frame = +3 Query: 12 GLVSYKLGMNKYGDMLHHEFVKTMNGFN-KTAKHNKNLYMKGGSVRGAKFISPANVKLPE 188 G ++Y+L N++ D+ EF+ T G+ + ++ G A F V +P Sbjct: 89 GDLTYQLAENEFADLTEEEFLATYTGYYIGDGPVDDFVFTTGAGDVDASF--SYRVDVPA 146 Query: 189 QVDWRKHGAVTDIKDQGKCGSCGPSARLELWKDSTSVSPATWC 317 VDWR GAV K Q S P + + S SV A C Sbjct: 147 SVDWRAQGAVVPPKSQTSTCSTTPRPKSAV---SESVGKAPMC 186 >UniRef50_Q9XZM9 Cluster: Cysteine proteinase CPW2; n=1; Acanthamoeba royreba|Rep: Cysteine proteinase CPW2 - Acanthamoeba royreba Length = 142 Score = 56.8 bits (131), Expect = 4e-07 Identities = 28/80 (35%), Positives = 42/80 (52%) Frame = +2 Query: 278 LEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEG 457 +E Q L LS Q ++DCS + ++GC GG A+ Y+ + G+DT +YPY Sbjct: 3 IESQWALAGHNLTELSMQQIVDCS--WWDSGCGGGWPSYAYDYVVNAPGLDTLASYPYTA 60 Query: 458 VDDKCRYNPKNTGAEDVASW 517 D C YN N A +++W Sbjct: 61 QDGSCAYNQNNVVA-TISTW 79 >UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 394 Score = 56.8 bits (131), Expect = 4e-07 Identities = 32/83 (38%), Positives = 42/83 (50%), Gaps = 3/83 (3%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYG--NNGCNGGLMDNAFKYIKDNGGID 430 +F G +E + +G L S SEQ L+DC Q G ++GCNGG + +Y GI Sbjct: 210 TFGAAGVMESFNAITNGVLKSFSEQQLVDCVHQAGFSSDGCNGGFQSDGVEY-AIKFGIV 268 Query: 431 TEQTYPYEGVDDKCRY-NPKNTG 496 TE YPY V C+ NP G Sbjct: 269 TEDKYPYTAVGGDCQISNPTTDG 291 Score = 33.1 bits (72), Expect = 6.1 Identities = 13/29 (44%), Positives = 17/29 (58%), Gaps = 1/29 (3%) Frame = +3 Query: 171 NVKLPEQVDWRK-HGAVTDIKDQGKCGSC 254 N + VDWR + +KDQG+CGSC Sbjct: 180 NTTVAASVDWRNVKNVLNPVKDQGQCGSC 208 >UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease Gip1p; n=4; Tetrahymena thermophila|Rep: Granule-biosynthesis induced protease Gip1p - Tetrahymena thermophila Length = 345 Score = 56.4 bits (130), Expect = 6e-07 Identities = 29/80 (36%), Positives = 40/80 (50%), Gaps = 2/80 (2%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTM--NGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQV 194 SY G+N++ DM EF + + +K A NK + + P N LP V Sbjct: 79 SYSKGLNQFSDMTKEEFKQRVLNKKISKKASSNKGGRNLAADPAVSNLVFPTN-NLPLSV 137 Query: 195 DWRKHGAVTDIKDQGKCGSC 254 DWRK G + +K+QG CGSC Sbjct: 138 DWRKRGVLNPVKNQGTCGSC 157 Score = 50.4 bits (115), Expect = 4e-05 Identities = 24/75 (32%), Positives = 43/75 (57%), Gaps = 2/75 (2%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSE--QYGNNGCNGGLMDNAFKYIKDNGGID 430 +F+T G LE + ++ L+ SEQ L+DC Y ++GC+GG ++ +Y + G + Sbjct: 159 TFATAGILESFNQIKNKQLLKFSEQQLVDCVSLAGYDSDGCDGGFQEDGVRYAIEYGIVQ 218 Query: 431 TEQTYPYEGVDDKCR 475 + + YPY G +C+ Sbjct: 219 SYK-YPYVGYQGRCK 232 >UniRef50_Q26989 Cluster: Cysteine proteinase 5; n=1; Tritrichomonas foetus|Rep: Cysteine proteinase 5 - Tritrichomonas foetus (Trichomonas foetus) Length = 155 Score = 56.4 bits (130), Expect = 6e-07 Identities = 30/67 (44%), Positives = 41/67 (61%), Gaps = 2/67 (2%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYI--KDNGGID 430 +FST A EG H ++G L+ LSEQNL+DC++ +GC+GG AF Y+ K G Sbjct: 1 AFSTIVAQEGCHQIETGELLRLSEQNLVDCADNC--HGCDGGWPIEAFNYVLNKQGGKYC 58 Query: 431 TEQTYPY 451 T+ YPY Sbjct: 59 TDDDYPY 65 Score = 52.8 bits (121), Expect = 7e-06 Identities = 20/43 (46%), Positives = 30/43 (69%) Frame = +1 Query: 520 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEEC 648 IP+GDE+ + E VA GPV++ +D+++ SF Y G+Y EE C Sbjct: 91 IPQGDEEAMKEVVANWGPVAINVDSNYGSFNFYDGGIYVEESC 133 >UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 291 Score = 56.4 bits (130), Expect = 6e-07 Identities = 30/79 (37%), Positives = 41/79 (51%), Gaps = 2/79 (2%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYI--KDNGGID 430 +F T A E + L LSEQN+IDC+ GC GG++ A +I K G I Sbjct: 104 AFGTVAACESNYALLYSNLPQLSEQNIIDCATTC--YGCGGGIIQAAMSFIINKQGGAIM 161 Query: 431 TEQTYPYEGVDDKCRYNPK 487 YPY+GVD C+++ K Sbjct: 162 KLSDYPYQGVDGACKFDAK 180 Score = 51.6 bits (118), Expect = 2e-05 Identities = 24/51 (47%), Positives = 31/51 (60%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 FV +P G E+ L V G V +D S SFQLYSSG+Y++ CSS +L Sbjct: 189 FVSVPSGSERDLANYVYQYGVAVVVLDCSRISFQLYSSGIYSDPCCSSQNL 239 Score = 37.5 bits (83), Expect = 0.28 Identities = 25/78 (32%), Positives = 37/78 (47%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 +YKL +N + E+ + K +KNL +G VR P P +D+ Sbjct: 36 NYKLSLNSLSHLTPTEYQSLLG-----TKIDKNLVSQGKKVR------PQIKDSPGILDY 84 Query: 201 RKHGAVTDIKDQGKCGSC 254 R+ G V I+DQ +CGSC Sbjct: 85 REMGVVNPIRDQKQCGSC 102 >UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_56, whole genome shotgun sequence - Paramecium tetraurelia Length = 314 Score = 56.4 bits (130), Expect = 6e-07 Identities = 26/55 (47%), Positives = 39/55 (70%), Gaps = 1/55 (1%) Frame = +2 Query: 320 LSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDK-CRYN 481 LSEQ L+DC ++ NNGCNGG + ++ K N G+ T++ YPY+GV +K C+Y+ Sbjct: 159 LSEQQLVDC-DKGTNNGCNGGFENLGIQWAKKN-GLTTDKQYPYDGVQNKQCKYS 211 >UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena thermophila Length = 320 Score = 56.0 bits (129), Expect = 8e-07 Identities = 32/78 (41%), Positives = 43/78 (55%), Gaps = 5/78 (6%) Frame = +2 Query: 257 SFSTTGALEGQHF---RQSGYLVSLSEQNLIDC--SEQYGNNGCNGGLMDNAFKYIKDNG 421 +FST GA+E + + ++L+EQ +DC S +Y + GCNGG M FKYI DN Sbjct: 138 AFSTIGAVESALWIAGQGEQNTLNLAEQEQVDCAKSPKYDSEGCNGGWMVEGFKYIIDN- 196 Query: 422 GIDTEQTYPYEGVDDKCR 475 I YPY D KC+ Sbjct: 197 KISQTANYPYTAKDGKCK 214 Score = 40.3 bits (90), Expect = 0.040 Identities = 20/41 (48%), Positives = 29/41 (70%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVY 633 + +IP+GD L A+ GP+SVA+DA T+FQ Y+SGV+ Sbjct: 227 YAEIPQGDCNSLNSALEQ-GPISVAVDA--TNFQFYTSGVF 264 Score = 36.7 bits (81), Expect = 0.50 Identities = 13/22 (59%), Positives = 16/22 (72%) Frame = +3 Query: 189 QVDWRKHGAVTDIKDQGKCGSC 254 +VDW G VT +K+QG CGSC Sbjct: 115 EVDWTAKGKVTPVKNQGSCGSC 136 >UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckeia)|Rep: Berghepain-2 - Plasmodium yoelii yoelii Length = 472 Score = 56.0 bits (129), Expect = 8e-07 Identities = 27/76 (35%), Positives = 45/76 (59%), Gaps = 1/76 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+T G + Q+ + VSLSEQ L+DC++ N GC+GG++ AF+ + D G+ + Sbjct: 276 AFATAGVVAAQYAIRKNQKVSLSEQQLVDCAQ--NNFGCDGGILPYAFEDLIDMNGLCED 333 Query: 437 QTYPY-EGVDDKCRYN 481 + YPY + + C N Sbjct: 334 KYYPYVSNLPELCEIN 349 Score = 52.0 bits (119), Expect = 1e-05 Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 3/80 (3%) Frame = +3 Query: 24 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKG---GSVRGAKFISPANVKLPEQV 194 Y G+N + DM H EF M N K N + ++ ++ K+ SP + Sbjct: 197 YTKGINAFSDMRHEEF--KMKYLNNKLKENHQIDLRHLIPYTIAINKYKSPTDQINYTSF 254 Query: 195 DWRKHGAVTDIKDQGKCGSC 254 DWR H A+ DIKDQ KC SC Sbjct: 255 DWRDHNAIIDIKDQQKCASC 274 >UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep: Cysteine proteinase - Entamoeba histolytica Length = 320 Score = 56.0 bits (129), Expect = 8e-07 Identities = 26/56 (46%), Positives = 36/56 (64%) Frame = +2 Query: 314 VSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCRYN 481 + LSEQ ++DCS + NNGCNGG + F Y K NG I+ E+ YPY + C+Y+ Sbjct: 147 LDLSEQQIVDCSNK--NNGCNGGSILYVFAYTKRNGVIE-EKDYPYTATNGTCQYD 199 Score = 52.8 bits (121), Expect = 7e-06 Identities = 27/52 (51%), Positives = 35/52 (67%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 G V + + +E L+EA+A GPV+VAIDA SFQLY SGVY+E +C L Sbjct: 209 GQVIVEQRNEVALVEAIAE-GPVAVAIDAGQASFQLYKSGVYDEPKCKKVIL 259 Score = 37.1 bits (82), Expect = 0.38 Identities = 19/67 (28%), Positives = 32/67 (47%), Gaps = 3/67 (4%) Frame = +3 Query: 63 HEFVKTMNG-FNKTAKHNKNLYMKGGSVRGAKFISPANVK--LPEQVDWRKHGAVTDIKD 233 H F +++G + N +K +V+ +K +P +DWR G +T I+D Sbjct: 55 HNFQLSVDGPYAAMTNAEYNTLLKARTVKNVNAPVRKAIKGDIPTAIDWRAEGKLTPIRD 114 Query: 234 QGKCGSC 254 +CGSC Sbjct: 115 HTQCGSC 121 >UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus salmonis|Rep: Cysteine proteinase - Lepeophtheirus salmonis (salmon louse) Length = 372 Score = 56.0 bits (129), Expect = 8e-07 Identities = 27/79 (34%), Positives = 42/79 (53%), Gaps = 1/79 (1%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK-LPEQVD 197 ++ +G+N++ D+ EF G++ + G V N+K LPE VD Sbjct: 68 TWDMGINEFSDLTDEEFESKYMGYSPMSS-------SAGLVTRTAAPKQGNIKDLPESVD 120 Query: 198 WRKHGAVTDIKDQGKCGSC 254 WR+ G +TD+K+QG CGSC Sbjct: 121 WREKGVITDVKNQGSCGSC 139 Score = 33.9 bits (74), Expect = 3.5 Identities = 23/62 (37%), Positives = 33/62 (53%), Gaps = 8/62 (12%) Frame = +2 Query: 320 LSEQNLIDCSEQ-Y---GNNGCNGGLMDNAFKYIKDNGGIDTEQTYPY-EGVDD---KCR 475 LS Q + CS Y G+ GC G + + A+ Y + G I+TE+ YPY G + +C Sbjct: 164 LSTQQITSCSSNPYSCGGSGGCKGAINEIAYMYTQLYG-IETEKEYPYTSGFTEESGECL 222 Query: 476 YN 481 YN Sbjct: 223 YN 224 Score = 33.1 bits (72), Expect = 6.1 Identities = 16/44 (36%), Positives = 25/44 (56%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYN 636 RG+ +P D +ME +A GP+ V++ A F+ Y SG+ N Sbjct: 236 RGYEVLPPNDMYSVMEHLANKGPLGVSVYAGR--FKSYKSGILN 277 >UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; Phytophthora infestans|Rep: Cathepsin-like cysteine protease - Phytophthora infestans (Potato late blight fungus) Length = 376 Score = 55.6 bits (128), Expect = 1e-06 Identities = 30/85 (35%), Positives = 45/85 (52%), Gaps = 1/85 (1%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK- 179 YE G S+ LG+N D+ E+ + ++ + +K S F+ P NV+ Sbjct: 82 YERGEHSFTLGLNDLADLADAEYKQLLSYRTRDSK---------SSSASETFVKPENVED 132 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 LP DWR+H VT +K+QG+CGSC Sbjct: 133 LPATWDWREHSTVTPVKNQGQCGSC 157 Score = 46.8 bits (106), Expect = 5e-04 Identities = 22/48 (45%), Positives = 30/48 (62%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSS 654 + ++ GDE L A+AT G +VAIDAS +FQLY GVY+ C + Sbjct: 247 YANVTSGDEAALQAAIATKGVQAVAIDASSFTFQLYRHGVYSWPLCGN 294 Score = 44.8 bits (101), Expect = 0.002 Identities = 28/68 (41%), Positives = 38/68 (55%), Gaps = 3/68 (4%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCN-GGLMDNAFKYIKDN--GGI 427 +FS A+E + +G L SLSEQ L+DC+ G + CN GG M ++ I N G I Sbjct: 159 AFSAVAAMECAYALSTGTLESLSEQELVDCTLN-GIDTCNHGGEMSEGYEEIITNHKGKI 217 Query: 428 DTEQTYPY 451 D E+ Y Y Sbjct: 218 DREEVYRY 225 >UniRef50_Q26987 Cluster: Cysteine proteinase 3; n=1; Tritrichomonas foetus|Rep: Cysteine proteinase 3 - Tritrichomonas foetus (Trichomonas foetus) Length = 157 Score = 55.6 bits (128), Expect = 1e-06 Identities = 31/76 (40%), Positives = 40/76 (52%), Gaps = 2/76 (2%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY-IKDNGG-ID 430 SF+ A EG F SG LV +SEQ +DC + GC GG D A+ + I +N G + Sbjct: 1 SFAACAAFEGAWFASSGKLVKISEQLFVDCCKYC--FGCYGGSADAAYNWAIHENDGKVC 58 Query: 431 TEQTYPYEGVDDKCRY 478 + YPY G CRY Sbjct: 59 LHEDYPYTGTQGVCRY 74 Score = 44.0 bits (99), Expect = 0.003 Identities = 17/43 (39%), Positives = 27/43 (62%) Frame = +1 Query: 532 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTD 660 DE + + + +GP++VAIDA F+LY SG+Y ++ C D Sbjct: 97 DEDLMCQTLEEIGPLTVAIDADGAKFRLYDSGIYYDDTCVQGD 139 >UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2; Culicidae|Rep: Procathepsin L3, putative - Aedes aegypti (Yellowfever mosquito) Length = 313 Score = 55.6 bits (128), Expect = 1e-06 Identities = 26/71 (36%), Positives = 38/71 (53%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS AL GQ R+ G + +S Q ++DCS GN GC GG + +Y++++ GI Sbjct: 160 AFSIGHALNGQIMRRIGRVEYVSTQQMVDCSTSAGNKGCAGGSLRFTMQYLQNSQGIMRS 219 Query: 437 QTYPYEGVDDK 469 YPY K Sbjct: 220 SDYPYTSSSSK 230 Score = 48.0 bits (109), Expect = 2e-04 Identities = 24/90 (26%), Positives = 42/90 (46%), Gaps = 6/90 (6%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK------NLYMKGGSVRGAKFIS 164 YE G ++++G+N+ DM ++K M H K + ++ + G +F+ Sbjct: 69 YEQGKSTFQMGVNELADMDKSSYLKKMVRMTDAIDHRKLDVDFNDEMLQATNAFGEEFVQ 128 Query: 165 PANVKLPEQVDWRKHGAVTDIKDQGKCGSC 254 +P+ +DWR G T +Q CGSC Sbjct: 129 ATQNSMPDSLDWRDKGFTTMAVNQKTCGSC 158 >UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanensis|Rep: Sui m 1 allergen - Suidasia medanensis Length = 336 Score = 55.6 bits (128), Expect = 1e-06 Identities = 28/83 (33%), Positives = 46/83 (55%), Gaps = 5/83 (6%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSE-----QYGNNGCNGGLMDNAFKYIKDNG 421 +F+T +E Q+ + V+LSEQ L+DC QY ++GC GG A+ Y++ G Sbjct: 140 AFATAATVEAQYAIRKNVHVTLSEQQLVDCDHRPFQGQYEDHGCQGGNPIIAYAYVQQTG 199 Query: 422 GIDTEQTYPYEGVDDKCRYNPKN 490 ++ E YPY+ D +C+ + N Sbjct: 200 LVE-ESAYPYQARDGQCQSSTVN 221 Score = 34.7 bits (76), Expect = 2.0 Identities = 23/75 (30%), Positives = 36/75 (48%) Frame = +3 Query: 30 LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 209 L +N++ D+ EF N+ A L+ + V S +V LP DWR+ Sbjct: 69 LEVNEHADLTAEEFSSMYATLNQEAFLKSPLHKEFVQVPE----SDISVALPAAFDWRQQ 124 Query: 210 GAVTDIKDQGKCGSC 254 T +++QG+CGSC Sbjct: 125 WN-TAVRNQGQCGSC 138 >UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_46, whole genome shotgun sequence - Paramecium tetraurelia Length = 336 Score = 55.6 bits (128), Expect = 1e-06 Identities = 35/90 (38%), Positives = 48/90 (53%), Gaps = 4/90 (4%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F GA E + ++ V LSEQ LIDC Q + GCNGG + A KYI N G++ Sbjct: 165 AFGAVGAAEAWFYVKNKTTVLLSEQQLIDCDTQ--SFGCNGGYQNLALKYIA-NHGLNDA 221 Query: 437 QTYPY-EGVDDKCRYNP---KNTGAEDVAS 514 + YPY + C+Y K GA+ V+S Sbjct: 222 RVYPYTQKQSAYCKYESGPYKTNGAQGVSS 251 Score = 34.7 bits (76), Expect = 2.0 Identities = 28/99 (28%), Positives = 41/99 (41%), Gaps = 6/99 (6%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 +Y + MN++ D+ EF N M+ + + N K VDW Sbjct: 94 TYFMKMNQFSDLSQEEF-----SLIYLTHDNAEEVMEQNLIIDELQKTQENDKTINSVDW 148 Query: 201 RKHGAVTDIKDQGKCGSC---GPSARLELW---KDSTSV 299 RK +T +KDQG+C C G E W K+ T+V Sbjct: 149 RK---ITQVKDQGQCSGCWAFGAVGAAEAWFYVKNKTTV 184 >UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyostelium discoideum|Rep: Cysteine proteinase 3 - Dictyostelium discoideum (Slime mold) Length = 151 Score = 55.6 bits (128), Expect = 1e-06 Identities = 25/42 (59%), Positives = 32/42 (76%) Frame = +2 Query: 263 STTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLM 388 STTG++EG ++G LVSLSEQN++ S +GN GCNGGLM Sbjct: 103 STTGSVEGVTAIKTGKLVSLSEQNILRLSSSFGNEGCNGGLM 144 Score = 50.4 bits (115), Expect = 4e-05 Identities = 29/75 (38%), Positives = 41/75 (54%) Frame = +3 Query: 30 LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 209 LG+N++ D+ + E+ +N A N Y K G + P + K P VDWR+ Sbjct: 31 LGLNQHADLSNEEY--RLNYLGTRAHIKLNGYHKRNL--GLRLNRP-HFKQPLNVDWREK 85 Query: 210 GAVTDIKDQGKCGSC 254 AVT +KDQG+CGSC Sbjct: 86 DAVTPVKDQGQCGSC 100 >UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Rep: Cathepsin W - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 303 Score = 55.2 bits (127), Expect = 1e-06 Identities = 28/73 (38%), Positives = 42/73 (57%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ +E Q + G +SLSEQ +IDC+ NGC+GG +AF + GG+ +E Sbjct: 105 AFAAVANIEAQ-WAILGQTISLSEQQVIDCNTC--RNGCSGGYAWDAFMTVLQQGGLTSE 161 Query: 437 QTYPYEGVDDKCR 475 ++YPY G CR Sbjct: 162 KSYPYTGHVSNCR 174 >UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Liliopsida|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 416 Score = 55.2 bits (127), Expect = 1e-06 Identities = 30/79 (37%), Positives = 44/79 (55%) Frame = +3 Query: 18 VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVD 197 +SY LG+NK+ D+ + EF G K + + + + + + P V P D Sbjct: 66 MSYVLGLNKFSDLTYEEFAAKYTG----VKVDASAFATATTSSPDEEL-PVGVP-PATWD 119 Query: 198 WRKHGAVTDIKDQGKCGSC 254 WR +GAVTD+KDQG+CGSC Sbjct: 120 WRLNGAVTDVKDQGQCGSC 138 Score = 38.7 bits (86), Expect = 0.12 Identities = 22/54 (40%), Positives = 31/54 (57%) Frame = +2 Query: 260 FSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNG 421 FS GA+EG + +G L++LSEQ ++DCS + GG A +YI NG Sbjct: 141 FSAVGAVEGINAIMTGNLLTLSEQQVLDCSNT--GDCLKGGDPRAALQYIVKNG 192 >UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba histolytica|Rep: Cysteine protease 10 - Entamoeba histolytica Length = 297 Score = 55.2 bits (127), Expect = 1e-06 Identities = 25/59 (42%), Positives = 38/59 (64%), Gaps = 1/59 (1%) Frame = +2 Query: 314 VSLSEQNLIDCSE-QYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCRYNPK 487 + LSEQ ++DCS+ +Y N GC G + N+F Y++D+ GI E+ YPY G + C + K Sbjct: 154 IDLSEQQIVDCSQGEYSNWGCTCGNVGNSFNYVRDH-GILLERDYPYTGKANNCSIDGK 211 Score = 38.7 bits (86), Expect = 0.12 Identities = 15/30 (50%), Positives = 20/30 (66%) Frame = +1 Query: 571 PVSVAIDASHTSFQLYSSGVYNEEECSSTD 660 PV+V+ID+S SFQ Y G+Y+E C D Sbjct: 239 PVAVSIDSSQLSFQFYEGGIYDEPNCKWVD 268 Score = 37.1 bits (82), Expect = 0.38 Identities = 13/28 (46%), Positives = 19/28 (67%) Frame = +3 Query: 171 NVKLPEQVDWRKHGAVTDIKDQGKCGSC 254 N ++ + +DWR G VT +K+Q KC SC Sbjct: 105 NKEVLDSIDWRSEGKVTPVKNQRKCASC 132 >UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma japonicum|Rep: SJCHGC04937 protein - Schistosoma japonicum (Blood fluke) Length = 235 Score = 55.2 bits (127), Expect = 1e-06 Identities = 26/50 (52%), Positives = 33/50 (66%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY 406 +F++ GALEGQ S L SLS Q L+DC++ YGN GC GLM A+ Y Sbjct: 186 AFASVGALEGQMKLHSIPLQSLSTQQLVDCTQDYGNYGCASGLMKYAYDY 235 Score = 46.8 bits (106), Expect = 5e-04 Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 5/90 (5%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSV---RGAKFISP-- 167 Y++ LV+Y LG+N++ D+ E + T + NKN + ++ + F + Sbjct: 97 YDLNLVTYTLGINQFSDLTWIE-LSTFYLHELSVNLNKNKLLNSLNMFKLQSYNFTTTLL 155 Query: 168 ANVKLPEQVDWRKHGAVTDIKDQGKCGSCG 257 + + +P+ DWR VT++K+Q KCG CG Sbjct: 156 STLNIPDNFDWRTKNVVTNVKNQEKCG-CG 184 >UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 234 Score = 54.8 bits (126), Expect = 2e-06 Identities = 29/77 (37%), Positives = 44/77 (57%), Gaps = 2/77 (2%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIK--DNGGID 430 +F + A+E F + G L SLSEQ L+DC + GC+G L AF+Y+K +G + Sbjct: 44 AFGSCAAMESSWFLKHGTLYSLSEQCLVDCC--HDCLGCHGCLPSLAFEYVKIFMHGLFE 101 Query: 431 TEQTYPYEGVDDKCRYN 481 TE YPY+ C+++ Sbjct: 102 TEDNYPYQAEHHSCKFD 118 Score = 39.5 bits (88), Expect = 0.071 Identities = 14/25 (56%), Positives = 20/25 (80%) Frame = +3 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 +P+++D+R GAV +IKDQ CGSC Sbjct: 18 IPDEIDYRTKGAVNEIKDQKHCGSC 42 Score = 38.7 bits (86), Expect = 0.12 Identities = 17/41 (41%), Positives = 26/41 (63%) Frame = +1 Query: 526 EGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEEC 648 + +E +L VA GP +V I+A F+LYSSGV++ +C Sbjct: 133 KSNEDQLKTEVAANGPYAVMINADSEQFRLYSSGVFDNPKC 173 >UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H-like cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L, S or H-like cysteine peptidase - Trichomonas vaginalis G3 Length = 473 Score = 54.8 bits (126), Expect = 2e-06 Identities = 30/85 (35%), Positives = 43/85 (50%), Gaps = 1/85 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK-YIKDNGGIDT 433 +F T +LE Q ++G LS ++DC+ Y N+ C GG AF+ I N + Sbjct: 278 AFGTAESLESQLALKTGVFRELSVNQIMDCTWDYNNSACGGGEAGPAFRSLINQNFKLFL 337 Query: 434 EQTYPYEGVDDKCRYNPKNTGAEDV 508 E+ YPY GV C NP++ A V Sbjct: 338 EKDYPYIGVAGYCNRNPEHPVARVV 362 >UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep: Viral cathepsin - Cydia pomonella granulosis virus (CpGV) (Cydia pomonellagranulovirus) Length = 333 Score = 54.8 bits (126), Expect = 2e-06 Identities = 28/76 (36%), Positives = 43/76 (56%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FST +E + + ++LSEQ+L++C NNGC GGLM A + I GG+ + Sbjct: 150 AFSTIANIESLYNIKYDKALNLSEQHLVNCDNI--NNGCAGGLMHWALESILQEGGVVSA 207 Query: 437 QTYPYEGVDDKCRYNP 484 + PY G D C+ +P Sbjct: 208 ENEPYYGFDGVCKKSP 223 Score = 44.0 bits (99), Expect = 0.003 Identities = 28/75 (37%), Positives = 40/75 (53%), Gaps = 2/75 (2%) Frame = +3 Query: 36 MNKYGDMLHHEFVKTMNGFNKTAKHNKNLY-MKGGSVRGAKFISPANVKLPEQVDWR-KH 209 +N+Y D+ + ++ GF K N + + M SV K LPE +DWR KH Sbjct: 77 INEYSDLNKNALLRRTTGFRLGLKKNPSAFTMTECSVVVIK--DEPQALLPETLDWRDKH 134 Query: 210 GAVTDIKDQGKCGSC 254 G VT +K+Q +CGSC Sbjct: 135 G-VTPVKNQMECGSC 148 >UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA, isoform A; n=2; Endopterygota|Rep: PREDICTED: similar to CG3074-PA, isoform A - Tribolium castaneum Length = 445 Score = 54.4 bits (125), Expect = 2e-06 Identities = 24/54 (44%), Positives = 35/54 (64%) Frame = +2 Query: 314 VSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCR 475 V+LS Q+L+ C + G CNGG +D A+ YI+ G +D EQ +PY ++KCR Sbjct: 246 VTLSAQHLLSCDRR-GQQSCNGGYLDRAWSYIRKIGLVD-EQCFPYSATNEKCR 297 >UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|Rep: Thiol protease - Triticum aestivum (Wheat) Length = 374 Score = 54.4 bits (125), Expect = 2e-06 Identities = 26/82 (31%), Positives = 40/82 (48%), Gaps = 1/82 (1%) Frame = +3 Query: 12 GLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQ 191 G +SY LG+N++ D+ H EF+ T + + G V PA +P Sbjct: 88 GRLSYTLGVNQFADLTHEEFLATHTSRRVVPSEEMVITTRAGVVVEGANCQPAPNAVPRS 147 Query: 192 VDWRKHGAVTDIKDQGK-CGSC 254 ++W VT +K+QGK CG+C Sbjct: 148 INWVNQSKVTPVKNQGKVCGAC 169 Score = 54.0 bits (124), Expect = 3e-06 Identities = 29/73 (39%), Positives = 38/73 (52%), Gaps = 1/73 (1%) Frame = +2 Query: 257 SFSTTGALEGQH-FRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDT 433 +FS +E + + G LSEQ LIDC + GC G M NA+ ++ NGGI Sbjct: 171 AFSAVATIESAYAIAKRGEPPVLSEQELIDCDTF--DRGCTSGEMYNAYFWVLRNGGIAN 228 Query: 434 EQTYPYEGVDDKC 472 TYPY+ D KC Sbjct: 229 SSTYPYKETDGKC 241 Score = 34.3 bits (75), Expect = 2.7 Identities = 18/50 (36%), Positives = 29/50 (58%) Frame = +1 Query: 487 EHRC*GRGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYN 636 EH R + + E++LM AVA V PV+V D++ F+ Y +G+Y+ Sbjct: 248 EHAATIRDYKFVKHNCEEQLMAAVA-VRPVAVGFDSNDECFKFYQAGLYD 296 >UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_36, whole genome shotgun sequence - Paramecium tetraurelia Length = 307 Score = 54.4 bits (125), Expect = 2e-06 Identities = 30/73 (41%), Positives = 40/73 (54%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS GA+EG + G+ LSEQ L+DC+ G GCNGG D A YI + G + E Sbjct: 132 TFSAIGAVEGFLAIRKGFKGVLSEQQLVDCAVDAG-EGCNGGNSDLALDYIAEVGSV-YE 189 Query: 437 QTYPYEGVDDKCR 475 + Y Y D C+ Sbjct: 190 RDYEYTAKDGVCK 202 >UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 325 Score = 54.0 bits (124), Expect = 3e-06 Identities = 32/79 (40%), Positives = 42/79 (53%), Gaps = 6/79 (7%) Frame = +2 Query: 257 SFSTTGALEGQHFRQS---GYLVSLSEQNLIDCSEQYGNN---GCNGGLMDNAFKYIKDN 418 +FSTTGA+E +SLSEQ ++DC ++ N GC G MD +FKYI N Sbjct: 142 AFSTTGAIESALLISGVGEANTLSLSEQEIVDCVKEPEYNQLGGCQDGYMDESFKYIIKN 201 Query: 419 GGIDTEQTYPYEGVDDKCR 475 I YPY V+ KC+ Sbjct: 202 -KISKAADYPYTAVEGKCK 219 Score = 38.3 bits (85), Expect = 0.16 Identities = 21/42 (50%), Positives = 29/42 (69%) Frame = +1 Query: 511 FVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYN 636 +VD+P GD + L+ A+ PVSVAIDA + Q Y+SGVY+ Sbjct: 232 YVDVPSGDCKALLTALQD-HPVSVAIDAK--NLQYYTSGVYS 270 Score = 35.1 bits (77), Expect = 1.5 Identities = 14/35 (40%), Positives = 22/35 (62%) Frame = +3 Query: 150 AKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSC 254 +K P + +++D+ G VT +KDQG+CGSC Sbjct: 106 SKIYKPKDDVEIKEIDFTTLGKVTPVKDQGRCGSC 140 >UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 385 Score = 54.0 bits (124), Expect = 3e-06 Identities = 31/82 (37%), Positives = 46/82 (56%), Gaps = 5/82 (6%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNG-GIDT 433 +FS A+E + ++G L++LSEQ ++DCS G CNGG +AF Y+ G +D Sbjct: 172 AFSVAAAVESINMIRTGNLLTLSEQQILDCS---GAGDCNGGYPYDAFDYVIKTGISLDN 228 Query: 434 --EQTY--PYEGVDDKCRYNPK 487 Y PYE KCR++P+ Sbjct: 229 RGNPPYYPPYENQKQKCRFDPR 250 Score = 41.5 bits (93), Expect = 0.018 Identities = 25/76 (32%), Positives = 40/76 (52%) Frame = +3 Query: 18 VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVD 197 ++Y+LG+N++ DM EF G +T +L + G+V K PA +P + Sbjct: 87 MTYRLGLNQFSDMTFEEFAGKFTG-GRTGSIAGDL--RDGAVTYCK--PPAVGYVPPSWN 141 Query: 198 WRKHGAVTDIKDQGKC 245 W K+G VT +K+Q C Sbjct: 142 WTKYGVVTPVKNQLTC 157 >UniRef50_Q9BL26 Cluster: Putative uncharacterized protein; n=4; Caenorhabditis elegans|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 345 Score = 54.0 bits (124), Expect = 3e-06 Identities = 29/77 (37%), Positives = 47/77 (61%), Gaps = 2/77 (2%) Frame = +2 Query: 257 SFSTTGALEGQHFRQS-GYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDT 433 +F+ T ++E + + + G L+S SEQ LIDC++Q G GC NA Y+ + GI+T Sbjct: 108 AFAITSSIESMYAKATNGTLLSFSEQQLIDCNDQ-GYKGCEEQFAMNAIGYLATH-GIET 165 Query: 434 EQTYPY-EGVDDKCRYN 481 E YPY + ++KC ++ Sbjct: 166 EADYPYVDKTNEKCTFD 182 Score = 32.7 bits (71), Expect = 8.1 Identities = 12/22 (54%), Positives = 16/22 (72%) Frame = +3 Query: 186 EQVDWRKHGAVTDIKDQGKCGS 251 E +DWR+ G V +KDQGKC + Sbjct: 84 EFLDWREKGIVGPVKDQGKCNA 105 >UniRef50_Q23H06 Cluster: Papain family cysteine protease containing protein; n=18; Tetrahymena thermophila|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 349 Score = 54.0 bits (124), Expect = 3e-06 Identities = 33/90 (36%), Positives = 45/90 (50%), Gaps = 3/90 (3%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDC---SEQYGNNGCNGGLMDNAFKYIKDNGGI 427 +FS TG +E +F Q+ LV SEQ L+DC + Y ++GC+GG Y GI Sbjct: 167 AFSATGVMESFNFIQNKALVEFSEQQLLDCVIPANGYPSSGCHGGWPVQCIDY-ASKVGI 225 Query: 428 DTEQTYPYEGVDDKCRYNPKNTGAEDVASW 517 + Y Y GV +CR N G + SW Sbjct: 226 LNQDRYYYFGVQMQCRVTGTNNGFKP-KSW 254 Score = 39.9 bits (89), Expect = 0.053 Identities = 14/32 (43%), Positives = 20/32 (62%) Frame = +3 Query: 159 ISPANVKLPEQVDWRKHGAVTDIKDQGKCGSC 254 ++ N + +DWR GAVT +K QG CG+C Sbjct: 134 LNSKNFTIATSIDWRSRGAVTQVKWQGNCGAC 165 >UniRef50_A0CHZ5 Cluster: Chromosome undetermined scaffold_186, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_186, whole genome shotgun sequence - Paramecium tetraurelia Length = 311 Score = 54.0 bits (124), Expect = 3e-06 Identities = 23/52 (44%), Positives = 30/52 (57%) Frame = +2 Query: 320 LSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCR 475 LS+Q+LIDCS YGN GC GG + Y+KD G+ E+ YP C+ Sbjct: 158 LSQQDLIDCSGSYGNQGCQGGFISGTLNYVKDK-GLAYEKDYPTTQTSGVCK 208 >UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin L-like cysteine proteinase precursor - Acanthoscelides obtectus (Bean weevil) Length = 321 Score = 53.6 bits (123), Expect = 4e-06 Identities = 30/85 (35%), Positives = 47/85 (55%), Gaps = 1/85 (1%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK- 179 Y G ++++G+N++GDM EF + + A + + G +S NV Sbjct: 61 YHNGEETFEMGINQFGDMTQEEFKRML------ALQKPQMPLPRGDE-----VSFDNVND 109 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 +P+ VDWR+ GAVT++K QG CGSC Sbjct: 110 IPKTVDWREKGAVTEVKKQGNCGSC 134 Score = 48.4 bits (110), Expect = 2e-04 Identities = 22/40 (55%), Positives = 30/40 (75%), Gaps = 1/40 (2%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSE-QYGNNGC 373 +FS G++EGQ F ++G L SLS QNL+DC+ +YGN GC Sbjct: 136 AFSAVGSIEGQVFLKNGSLESLSAQNLVDCAGIEYGNFGC 175 Score = 42.7 bits (96), Expect = 0.008 Identities = 18/44 (40%), Positives = 30/44 (68%) Frame = +1 Query: 508 GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNE 639 G+ + +GDE L +AVAT+GP+S+A+D +H F Y G+ ++ Sbjct: 218 GYQAVSKGDEVVLAQAVATIGPISIALDGNHIMF--YRRGIVSK 259 >UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep: Cysteine protease - Babesia equi Length = 438 Score = 53.6 bits (123), Expect = 4e-06 Identities = 25/72 (34%), Positives = 41/72 (56%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ G++E + + G + LSEQ L++C E +NGC G L + A +YIK GI Sbjct: 250 AFAAVGSVESLYLIKKGQALDLSEQELVNCEE--NSNGCEGDLPNKALEYIKAK-GISHS 306 Query: 437 QTYPYEGVDDKC 472 + PY +++C Sbjct: 307 KDLPYHAANEEC 318 Score = 46.0 bits (104), Expect = 8e-04 Identities = 31/93 (33%), Positives = 43/93 (46%), Gaps = 10/93 (10%) Frame = +3 Query: 6 EMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGS----------VRGAK 155 + G SY+ G+NK+ DM EF + + K+L + VR AK Sbjct: 157 QTGEESYEKGINKFSDMTDEEFNLRFPALS-VEELKKSLEVSASEEFTSPEHLDKVRIAK 215 Query: 156 FISPANVKLPEQVDWRKHGAVTDIKDQGKCGSC 254 + + E +DWRK VT +KDQG CGSC Sbjct: 216 GLGVEDSVDGEDLDWRKLNGVTPVKDQGNCGSC 248 >UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: Vivapain-4 - Plasmodium vivax Length = 484 Score = 53.2 bits (122), Expect = 5e-06 Identities = 28/67 (41%), Positives = 39/67 (58%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F GA+E Q+ + V +SEQ L+DCS++ N GC GGL AF + D G + +E Sbjct: 288 AFGAVGAVESQYAIRKNQHVLISEQELVDCSDK--NFGCFGGLASLAFDDMIDLGYLCSE 345 Query: 437 QTYPYEG 457 YPY G Sbjct: 346 SDYPYVG 352 Score = 51.6 bits (118), Expect = 2e-05 Identities = 31/82 (37%), Positives = 45/82 (54%), Gaps = 3/82 (3%) Frame = +3 Query: 18 VSYKLGMNKYGDMLHHEFVKTMNG--FNKTAKHNKNLYMKGGSVRGAKFISPANVKLP-E 188 + YK G N+Y D+ EF KTM F+ K + Y+ K+ PA+ + E Sbjct: 206 ILYKKGTNQYSDISFEEFRKTMLTLRFDLKKKLANSPYVSNYDDVLKKY-KPADAVVDNE 264 Query: 189 QVDWRKHGAVTDIKDQGKCGSC 254 + DWR+H AV++IK+Q CGSC Sbjct: 265 KYDWREHNAVSEIKNQNLCGSC 286 >UniRef50_Q23H10 Cluster: Papain family cysteine protease containing protein; n=14; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 53.2 bits (122), Expect = 5e-06 Identities = 34/98 (34%), Positives = 43/98 (43%), Gaps = 4/98 (4%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDC---SEQYGNNGCNGGLMDNAFKYIKDNGGI 427 SFS +E +F Q+ LV SEQ L+DC + Y + GCNGG Y GI Sbjct: 153 SFSAAAVMESFNFIQNKALVDFSEQQLVDCVIPANGYNSYGCNGGWPVQCLDY-ASKVGI 211 Query: 428 DTEQTYPYEGVDDKCRYNPKNTGAEDVASWTS-PRATN 538 T YPY V C + G + SW P +N Sbjct: 212 TTLDKYPYVAVQKNCNVTGTDNGFKP-KSWIQIPNTSN 248 Score = 46.0 bits (104), Expect = 8e-04 Identities = 24/82 (29%), Positives = 42/82 (51%), Gaps = 4/82 (4%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKH-NKNLYMKG---GSVRGAKFISPANVKLPE 188 +Y + +N++ DM EF + + + H K + + + +S ++ L + Sbjct: 70 TYSVHLNQFSDMTKEEFAEKILMKSDLVDHLMKGISQEATHNDTNNNETQLSSNSLTLAD 129 Query: 189 QVDWRKHGAVTDIKDQGKCGSC 254 +DWR GAVT +K+QG CGSC Sbjct: 130 SIDWRTKGAVTSVKNQGGCGSC 151 >UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 306 Score = 53.2 bits (122), Expect = 5e-06 Identities = 23/43 (53%), Positives = 30/43 (69%) Frame = +1 Query: 535 EQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 E +L +AVAT GP ++IDAS SF LY G+Y+E +CS DL Sbjct: 208 ETELAKAVATYGPAMISIDASQHSFMLYKEGIYDEPKCSEEDL 250 Score = 49.2 bits (112), Expect = 9e-05 Identities = 28/79 (35%), Positives = 36/79 (45%), Gaps = 2/79 (2%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYI--KDNGGID 430 +FS +E Q + L LSEQNL+DC GC GG A +Y+ K N Sbjct: 114 AFSAIQVIESQVAKNQKQLYDLSEQNLLDCVTSC--FGCGGGWSPGALEYVYEKQNSKFM 171 Query: 431 TEQTYPYEGVDDKCRYNPK 487 YPY V C+Y+ K Sbjct: 172 LTTDYPYTAVQGTCKYDNK 190 Score = 41.1 bits (92), Expect = 0.023 Identities = 16/30 (53%), Positives = 22/30 (73%), Gaps = 2/30 (6%) Frame = +3 Query: 171 NVK--LPEQVDWRKHGAVTDIKDQGKCGSC 254 N+K +P ++DWR+ G V IK+QG CGSC Sbjct: 83 NIKNDVPTEIDWREQGIVNKIKNQGACGSC 112 >UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precursor; n=20; Psoroptidia|Rep: Major mite fecal allergen Der f 1 precursor - Dermatophagoides farinae (House-dust mite) Length = 321 Score = 53.2 bits (122), Expect = 5e-06 Identities = 25/73 (34%), Positives = 41/73 (56%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS A E + + LSEQ L+DC+ Q+ GC+G + +YI+ NG ++ E Sbjct: 135 AFSGVAATESAYLAYRNTSLDLSEQELVDCASQH---GCHGDTIPRGIEYIQQNGVVE-E 190 Query: 437 QTYPYEGVDDKCR 475 ++YPY + +CR Sbjct: 191 RSYPYVAREQRCR 203 >UniRef50_UPI0001509FF9 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 348 Score = 52.8 bits (121), Expect = 7e-06 Identities = 27/78 (34%), Positives = 43/78 (55%), Gaps = 1/78 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ +E F ++G + +SEQNL+DC + N CNGG + A +YI N G+ ++ Sbjct: 166 AFAAVAGVESALFLKNGKIPDVSEQNLLDCDQ--SNQDCNGGDREKAIQYIL-NQGLTSQ 222 Query: 437 QTYPYEGV-DDKCRYNPK 487 T PY KC++ K Sbjct: 223 LTNPYRAYKQKKCKFQVK 240 >UniRef50_Q22LI1 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 987 Score = 52.8 bits (121), Expect = 7e-06 Identities = 29/84 (34%), Positives = 39/84 (46%), Gaps = 5/84 (5%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDC-----SEQYGNNGCNGGLMDNAFKYIKDNG 421 SFS G +E + ++G L+ LSEQ L+DC + Y +NGCNGG A +Y G Sbjct: 149 SFSAAGLMEAFQYFKTGNLIDLSEQQLVDCDNSSFDKSYYSNGCNGGYPQEAVEYASKYG 208 Query: 422 GIDTEQTYPYEGVDDKCRYNPKNT 493 + YPY C T Sbjct: 209 IVPLTD-YPYVKQQQPCAIKSPTT 231 Score = 46.0 bits (104), Expect = 8e-04 Identities = 25/78 (32%), Positives = 37/78 (47%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 +++LG+N+Y M EF + + + K K + V + +DW Sbjct: 71 TFQLGLNEYAHMTSQEFAEVFLTPSISKSQQKQPKPKPQPQPHPNNSTNTTVTITP-IDW 129 Query: 201 RKHGAVTDIKDQGKCGSC 254 R GAVT +K QGKCGSC Sbjct: 130 RNKGAVTSVKRQGKCGSC 147 >UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP00000013730, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to ENSANGP00000013730, partial - Ornithorhynchus anatinus Length = 229 Score = 52.4 bits (120), Expect = 9e-06 Identities = 28/50 (56%), Positives = 34/50 (68%), Gaps = 1/50 (2%) Frame = +2 Query: 257 SFSTTGALEGQHF-RQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK 403 SF+TTG LEG F + + LV LS+Q LIDCS GN GC+GGL AF+ Sbjct: 81 SFATTGTLEGALFLKVTVQLVPLSQQMLIDCSWDVGNFGCDGGLEWQAFR 130 Score = 50.4 bits (115), Expect = 4e-05 Identities = 20/29 (68%), Positives = 23/29 (79%) Frame = +3 Query: 168 ANVKLPEQVDWRKHGAVTDIKDQGKCGSC 254 ANV LPE +DWR +GAVT +KDQ CGSC Sbjct: 51 ANVALPESLDWRLYGAVTPVKDQAVCGSC 79 >UniRef50_UPI0000E224BB Cluster: PREDICTED: hypothetical protein; n=1; Pan troglodytes|Rep: PREDICTED: hypothetical protein - Pan troglodytes Length = 143 Score = 52.4 bits (120), Expect = 9e-06 Identities = 23/40 (57%), Positives = 27/40 (67%) Frame = +1 Query: 544 LMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 L +AVATVGP+SVA+ ASH SFQ Y G+Y E C L Sbjct: 45 LAKAVATVGPISVAVGASHVSFQFYKKGIYFEPRCDPEGL 84 >UniRef50_Q23FQ5 Cluster: Papain family cysteine protease containing protein; n=4; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 332 Score = 52.4 bits (120), Expect = 9e-06 Identities = 28/75 (37%), Positives = 38/75 (50%), Gaps = 2/75 (2%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYG--NNGCNGGLMDNAFKYIKDNGGID 430 +F T G LE ++ +S L+ SEQ L+DC+ Q G GC+G FKY GI Sbjct: 151 AFGTAGVLESFYYLKSKQLLKFSEQQLLDCARQAGFDTYGCDGAWQQEYFKY-AIKYGIV 209 Query: 431 TEQTYPYEGVDDKCR 475 +YPY G C+ Sbjct: 210 QGSSYPYVGYQTTCK 224 Score = 44.4 bits (100), Expect = 0.002 Identities = 26/78 (33%), Positives = 39/78 (50%) Frame = +3 Query: 21 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 200 +Y + +N++ D EFV+ + NK + K G + A V P VDW Sbjct: 77 TYTVSLNQFSDYSQEEFVQRI--LNKHISRSDADIQKEQEPNGN--LRKA-VNYPTSVDW 131 Query: 201 RKHGAVTDIKDQGKCGSC 254 R GA+ I++QG+CGSC Sbjct: 132 RNSGALNPIQNQGQCGSC 149 >UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin L-like proteinase; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin L-like proteinase - Strongylocentrotus purpuratus Length = 329 Score = 52.0 bits (119), Expect = 1e-05 Identities = 27/46 (58%), Positives = 32/46 (69%) Frame = +1 Query: 520 IPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSST 657 + +G+E L EAV PV VAIDAS SFQLY SGVY++ CSST Sbjct: 226 VTQGNESALAEAVYFT-PVVVAIDASQPSFQLYVSGVYSDPNCSST 270 Score = 41.1 bits (92), Expect = 0.023 Identities = 27/86 (31%), Positives = 42/86 (48%) Frame = +3 Query: 3 YEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKL 182 Y+ G S+K+ MN++ D + K N F+ A NL + R + S ++ L Sbjct: 66 YDEGRRSFKMAMNEFADQ---DMSKVRNKFDVQA----NL-LNAERKRKSSGTSSSSSTL 117 Query: 183 PEQVDWRKHGAVTDIKDQGKCGSCGP 260 P DWRK G V +++QG+ S P Sbjct: 118 PSSWDWRKEGKVNPVRNQGQMNSALP 143 >UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestivum|Rep: Cysteine protease - Triticum aestivum (Wheat) Length = 371 Score = 52.0 bits (119), Expect = 1e-05 Identities = 31/97 (31%), Positives = 46/97 (47%), Gaps = 1/97 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ +E + G LV LS Q L+DCS ++ C G +A +IK GG+ TE Sbjct: 179 AFAAAATVESLNKINGGELVDLSVQELVDCSTGVFSSPCGYGWPKSALAWIKSKGGLLTE 238 Query: 437 QTYPYEGVDDKCR-YNPKNTGAEDVASWTSPRATNRS 544 YPY +C ++ A+ AS RA R+ Sbjct: 239 AEYPYMAKRGRCAVHDTARVSAKSPASRMYGRAAARA 275 Score = 44.4 bits (100), Expect = 0.002 Identities = 31/92 (33%), Positives = 44/92 (47%), Gaps = 13/92 (14%) Frame = +3 Query: 18 VSYKLGMNKYGDMLHHEFV-KTMNGFNKTAKHNKNLY--MKGGSVRGAKFISPA-----N 173 + Y+LG N++ D+ + EF+ + + G A L + G V GA A N Sbjct: 86 LGYELGENEFTDLTNEEFMARYVGGAYGGAGDGGGLITTLAGDVVEGAASSKNAIEEDRN 145 Query: 174 VKL-----PEQVDWRKHGAVTDIKDQGKCGSC 254 + + P Q DWR+HG VT K QG CG C Sbjct: 146 LTMTASDPPRQFDWREHGVVTPAKQQGACGCC 177 >UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative cysteine proteinase - Oryza sativa subsp. japonica (Rice) Length = 361 Score = 52.0 bits (119), Expect = 1e-05 Identities = 32/79 (40%), Positives = 37/79 (46%), Gaps = 1/79 (1%) Frame = +3 Query: 18 VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANV-KLPEQV 194 +SYKLG+NK+ DM EF G A V A P V P Sbjct: 78 MSYKLGLNKFSDMTVEEFAAKYTGVQVDAG--------AAVVTSAPDEQPVLVGDAPPVW 129 Query: 195 DWRKHGAVTDIKDQGKCGS 251 DWR HGAVT +KDQG CG+ Sbjct: 130 DWRDHGAVTPVKDQGSCGT 148 >UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag-RP - Bombyx mori (Silk moth) Length = 404 Score = 52.0 bits (119), Expect = 1e-05 Identities = 24/54 (44%), Positives = 35/54 (64%) Frame = +2 Query: 314 VSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCR 475 V +S Q L+ C + G GCNGG +D AF ++K + G+ +EQ +PYEG +CR Sbjct: 234 VRMSSQTLLSCHLK-GQRGCNGGNLDIAFDFVKTH-GLVSEQCFPYEGAVTQCR 285 >UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_158, whole genome shotgun sequence - Paramecium tetraurelia Length = 308 Score = 51.6 bits (118), Expect = 2e-05 Identities = 27/73 (36%), Positives = 41/73 (56%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F+ GA+E S + LSEQ LIDC + N GC G ++N+ + ++NG + T Sbjct: 136 AFAAIGAVESVLRINSVTNLDLSEQQLIDCDLE--NQGCEDGNLNNSLNWAQNNG-VTTS 192 Query: 437 QTYPYEGVDDKCR 475 +YPY G D C+ Sbjct: 193 ASYPYTGQTDGCK 205 >UniRef50_UPI0000D566F0 Cluster: PREDICTED: similar to CathePsin L family member (cpl-1); n=1; Tribolium castaneum|Rep: PREDICTED: similar to CathePsin L family member (cpl-1) - Tribolium castaneum Length = 185 Score = 51.2 bits (117), Expect = 2e-05 Identities = 31/82 (37%), Positives = 47/82 (57%), Gaps = 7/82 (8%) Frame = +2 Query: 275 ALEGQ---HFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNA----FKYIKDNGGIDT 433 ALEG H Q +LS++NLIDC Y + C + +A ++Y+ ++GGIDT Sbjct: 29 ALEGHVGIHLGQKNQ--TLSQENLIDCV--YSDFQCKQEMKRSALVDCYQYMVNSGGIDT 84 Query: 434 EQTYPYEGVDDKCRYNPKNTGA 499 ++YPY+ CR+ P+N GA Sbjct: 85 LESYPYDQKPPLCRFKPENIGA 106 Score = 40.3 bits (90), Expect = 0.040 Identities = 19/43 (44%), Positives = 27/43 (62%) Frame = +1 Query: 505 RGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVY 633 +G+ + EGDE++L V T+GPVSV + A F LY G+Y Sbjct: 109 QGYGTVTEGDEEELKAVVGTLGPVSVIVTAD-LIFILYRKGIY 150 >UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Piroplasmida|Rep: Cysteine proteinase, putative - Theileria parva Length = 460 Score = 51.2 bits (117), Expect = 2e-05 Identities = 25/72 (34%), Positives = 39/72 (54%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +F++ ++E + + LSEQ L+DC + + GC GG D A KYI+ N G+ T+ Sbjct: 276 AFASVSSVESLYKIYRNVTLDLSEQELVDC--ETSSKGCEGGFGDTALKYIQ-NKGVSTD 332 Query: 437 QTYPYEGVDDKC 472 PY G + C Sbjct: 333 SEIPYLGKKNNC 344 Score = 35.1 bits (77), Expect = 1.5 Identities = 17/35 (48%), Positives = 23/35 (65%), Gaps = 1/35 (2%) Frame = +3 Query: 153 KFISPANVKLPEQVDWRKHGAVTDIKDQG-KCGSC 254 K + P N+ E +DWRK V+ IK+QG +CGSC Sbjct: 241 KDVDPKNIT-GEGLDWRKADGVSKIKNQGLECGSC 274 >UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=17; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 318 Score = 50.8 bits (116), Expect = 3e-05 Identities = 25/44 (56%), Positives = 30/44 (68%) Frame = +1 Query: 532 DEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 663 +E +L A G VS+AIDAS FQLYSSG+YN + CSST L Sbjct: 219 NEDELKAGCAKGGVVSIAIDASGYDFQLYSSGIYNPKSCSSTFL 262 Score = 49.2 bits (112), Expect = 9e-05 Identities = 33/95 (34%), Positives = 46/95 (48%), Gaps = 2/95 (2%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKY-IKDNGGI-D 430 +FS A E Q + G L+SL+EQN++DC + GC+GG A+ Y IK G+ Sbjct: 126 AFSVVQAQESQWALKKGQLLSLAEQNMVDCVDTC--YGCDGGDEYLAYDYVIKHQKGLWM 183 Query: 431 TEQTYPYEGVDDKCRYNPKNTGAEDVASWTSPRAT 535 E YPY D C++ G S+ P T Sbjct: 184 LETDYPYTARDGSCKFKAAK-GVTLTKSYVRPTTT 217 Score = 37.1 bits (82), Expect = 0.38 Identities = 14/25 (56%), Positives = 17/25 (68%) Frame = +3 Query: 180 LPEQVDWRKHGAVTDIKDQGKCGSC 254 +P+ VDWR V IKDQ +CGSC Sbjct: 100 VPDAVDWRNAKIVNPIKDQAQCGSC 124 >UniRef50_A0DKP6 Cluster: Chromosome undetermined scaffold_54, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_54, whole genome shotgun sequence - Paramecium tetraurelia Length = 312 Score = 50.8 bits (116), Expect = 3e-05 Identities = 30/73 (41%), Positives = 38/73 (52%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 436 +FS TG LE VSLSEQ+LIDC + + GC G N +K+ N GI T Sbjct: 139 AFSVTGTLEVYQKIYQKKNVSLSEQHLIDCDQL--SRGCTDGSNINGYKFAISN-GIATN 195 Query: 437 QTYPYEGVDDKCR 475 YPY G + C+ Sbjct: 196 IEYPYVGYNQTCK 208 >UniRef50_A0DCA5 Cluster: Chromosome undetermined scaffold_45, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_45, whole genome shotgun sequence - Paramecium tetraurelia Length = 603 Score = 50.8 bits (116), Expect = 3e-05 Identities = 28/73 (38%), Positives = 37/73 (50%), Gaps = 1/73 (1%) Frame = +2 Query: 257 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGN-NGCNGGLMDNAFKYIKDNGGIDT 433 + S+ LE +Q+ V LS Q +IDCS+ GC G+ +A KYIK N G+ Sbjct: 426 AISSKNCLESYFRQQTSKNVKLSLQQVIDCSDSLDPLKGCQNGIPTDALKYIKSN-GLHW 484 Query: 434 EQTYPYEGVDDKC 472 E YPY G C Sbjct: 485 ESKYPYTGKAQAC 497 Score = 33.1 bits (72), Expect = 6.1 Identities = 14/31 (45%), Positives = 20/31 (64%), Gaps = 1/31 (3%) Frame = +3 Query: 156 FISPANVKLPEQVDWR-KHGAVTDIKDQGKC 245 FIS AN+ E +DWR + V+++ DQG C Sbjct: 391 FISDANLTADEDIDWRVNNNIVSEVFDQGDC 421 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 637,629,055 Number of Sequences: 1657284 Number of extensions: 13124829 Number of successful extensions: 51269 Number of sequences better than 10.0: 411 Number of HSP's better than 10.0 without gapping: 47817 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 50892 length of database: 575,637,011 effective HSP length: 98 effective length of database: 413,223,179 effective search space used: 50413227838 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -