BLASTX 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= fner12p14f
(584 letters)
Database: uniref50
1,657,284 sequences; 575,637,011 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-... 283 2e-75
UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ... 276 3e-73
UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip... 243 3e-63
UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; ... 225 5e-58
UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome s... 221 7e-57
UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ... 193 2e-48
UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve... 187 1e-46
UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve... 169 6e-41
UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP000... 165 7e-40
UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v... 162 5e-39
UniRef50_Q24E33 Cluster: Papain family cysteine protease contain... 134 1e-30
UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox... 128 7e-29
UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n... 126 3e-28
UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei... 126 4e-28
UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ... 125 7e-28
UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j... 125 9e-28
UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt... 124 2e-27
UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ... 124 2e-27
UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1... 124 2e-27
UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi... 124 2e-27
UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ... 120 2e-26
UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs... 120 3e-26
UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h... 119 4e-26
UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=... 119 6e-26
UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata... 119 6e-26
UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac... 118 8e-26
UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ... 118 8e-26
UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re... 118 1e-25
UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=... 118 1e-25
UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C... 118 1e-25
UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ... 117 2e-25
UniRef50_Q239L8 Cluster: Papain family cysteine protease contain... 117 2e-25
UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ... 116 3e-25
UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;... 116 3e-25
UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ... 116 3e-25
UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re... 116 4e-25
UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy... 116 4e-25
UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C... 116 4e-25
UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;... 116 5e-25
UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ... 115 7e-25
UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca... 115 9e-25
UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35... 114 1e-24
UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory... 114 1e-24
UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy... 114 2e-24
UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat... 114 2e-24
UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr... 114 2e-24
UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ... 114 2e-24
UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18... 114 2e-24
UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ... 113 2e-24
UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae... 113 3e-24
UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|... 113 3e-24
UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz... 112 5e-24
UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep... 112 7e-24
UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa... 112 7e-24
UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz... 112 7e-24
UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D... 112 7e-24
UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel... 112 7e-24
UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph... 111 1e-23
UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ... 110 3e-23
UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi... 110 3e-23
UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er... 110 3e-23
UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal... 109 4e-23
UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ... 109 4e-23
UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ... 109 4e-23
UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ... 108 8e-23
UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e... 108 8e-23
UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty... 107 1e-22
UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|... 107 1e-22
UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3... 107 2e-22
UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n... 107 3e-22
UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ... 107 3e-22
UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ... 107 3e-22
UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re... 106 3e-22
UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:... 106 4e-22
UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus... 106 4e-22
UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;... 105 6e-22
UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ... 105 8e-22
UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio... 105 1e-21
UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain... 105 1e-21
UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ... 105 1e-21
UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n... 104 1e-21
UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M... 104 1e-21
UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep... 103 2e-21
UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc... 103 2e-21
UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t... 103 3e-21
UniRef50_Q22A69 Cluster: Papain family cysteine protease contain... 103 4e-21
UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil... 102 5e-21
UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip... 102 5e-21
UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt... 102 7e-21
UniRef50_Q22RC9 Cluster: Papain family cysteine protease contain... 102 7e-21
UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ... 101 9e-21
UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n... 101 9e-21
UniRef50_Q22W19 Cluster: Papain family cysteine protease contain... 101 9e-21
UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic... 101 9e-21
UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ... 101 1e-20
UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ... 101 1e-20
UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|... 101 1e-20
UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat... 101 2e-20
UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa... 100 3e-20
UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr... 100 3e-20
UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ... 99 4e-20
UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted... 99 4e-20
UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain... 99 4e-20
UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|... 100 5e-20
UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb... 100 5e-20
UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain... 99 7e-20
UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|... 99 7e-20
UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca... 99 9e-20
UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p... 98 1e-19
UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w... 98 1e-19
UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa... 98 2e-19
UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n... 98 2e-19
UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n... 98 2e-19
UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy... 98 2e-19
UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus... 97 2e-19
UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio... 97 2e-19
UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain... 97 4e-19
UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate... 97 4e-19
UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ... 97 4e-19
UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto... 97 4e-19
UniRef50_O16454 Cluster: Temporarily assigned gene name protein ... 96 5e-19
UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh... 96 6e-19
UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi... 96 6e-19
UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ... 95 8e-19
UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ... 95 8e-19
UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum... 95 8e-19
UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh... 95 8e-19
UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ... 95 1e-18
UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ... 95 1e-18
UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr... 95 1e-18
UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C... 95 1e-18
UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ... 94 2e-18
UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo... 94 2e-18
UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:... 94 2e-18
UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: L... 94 3e-18
UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ... 94 3e-18
UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R... 94 3e-18
UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia ... 93 3e-18
UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh... 93 3e-18
UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R... 93 4e-18
UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No... 93 6e-18
UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl... 93 6e-18
UniRef50_Q248G1 Cluster: Papain family cysteine protease contain... 93 6e-18
UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir... 93 6e-18
UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen... 93 6e-18
UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl... 92 8e-18
UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain... 92 8e-18
UniRef50_Q235G6 Cluster: Papain family cysteine protease contain... 92 8e-18
UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG... 92 1e-17
UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli... 92 1e-17
UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh... 92 1e-17
UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p... 91 1e-17
UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain... 91 1e-17
UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy... 91 1e-17
UniRef50_Q231X3 Cluster: Papain family cysteine protease contain... 91 2e-17
UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica... 91 2e-17
UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ... 91 2e-17
UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ... 90 3e-17
UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov... 90 3e-17
UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain... 90 4e-17
UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli... 89 5e-17
UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain... 89 5e-17
UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh... 89 7e-17
UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia... 89 9e-17
UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi... 88 1e-16
UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n... 88 1e-16
UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:... 88 2e-16
UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyt... 87 2e-16
UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis... 87 2e-16
UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]... 87 2e-16
UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s... 87 3e-16
UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv... 87 3e-16
UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D... 87 3e-16
UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida... 87 4e-16
UniRef50_Q23H32 Cluster: Papain family cysteine protease contain... 87 4e-16
UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain... 85 9e-16
UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ... 85 9e-16
UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ... 85 1e-15
UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ... 85 2e-15
UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ... 85 2e-15
UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa... 84 2e-15
UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try... 84 2e-15
UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet... 84 2e-15
UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep... 83 4e-15
UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium... 83 4e-15
UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G... 83 5e-15
UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain... 83 5e-15
UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ... 83 6e-15
UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sa... 83 6e-15
UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto... 83 6e-15
UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain... 82 8e-15
UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ... 82 1e-14
UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ... 82 1e-14
UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei... 82 1e-14
UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 82 1e-14
UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ... 81 1e-14
UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ... 81 1e-14
UniRef50_Q23H10 Cluster: Papain family cysteine protease contain... 81 1e-14
UniRef50_A7APS9 Cluster: Papain family cysteine protease contain... 81 1e-14
UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ... 81 2e-14
UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1; ... 81 3e-14
UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain... 81 3e-14
UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The... 81 3e-14
UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ... 80 3e-14
UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cy... 80 3e-14
UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endop... 80 3e-14
UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|R... 80 3e-14
UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ... 80 4e-14
UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy... 80 4e-14
UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w... 80 4e-14
UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ... 79 6e-14
UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re... 79 6e-14
UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste... 79 8e-14
UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina... 79 8e-14
UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu... 79 8e-14
UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s... 79 1e-13
UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali... 79 1e-13
UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who... 79 1e-13
UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2... 79 1e-13
UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa... 78 1e-13
UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyosteli... 78 1e-13
UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain... 78 2e-13
UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh... 78 2e-13
UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain... 77 2e-13
UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,... 77 3e-13
UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|... 77 3e-13
UniRef50_Q3L7L2 Cluster: Sar s 1 allergen SMIPP-C Yv6008G08; n=2... 77 3e-13
UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy... 77 3e-13
UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh... 77 3e-13
UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs... 77 3e-13
UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;... 77 4e-13
UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv... 77 4e-13
UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w... 77 4e-13
UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ... 76 5e-13
UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine prot... 76 7e-13
UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re... 75 1e-12
UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi... 75 1e-12
UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ... 75 2e-12
UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain... 74 2e-12
UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:... 74 2e-12
UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1... 74 3e-12
UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H... 74 3e-12
UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li... 74 3e-12
UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh... 73 4e-12
UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl... 73 4e-12
UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-... 73 5e-12
UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-li... 73 5e-12
UniRef50_A2DKX2 Cluster: Clan CA, family C1, cathepsin L-like cy... 73 5e-12
UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n... 73 7e-12
UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis... 73 7e-12
UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arab... 71 2e-11
UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster... 71 2e-11
UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n... 71 2e-11
UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:... 71 2e-11
UniRef50_Q23H06 Cluster: Papain family cysteine protease contain... 71 2e-11
UniRef50_A0CHZ5 Cluster: Chromosome undetermined scaffold_186, w... 71 2e-11
UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryz... 71 3e-11
UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma j... 71 3e-11
UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil... 71 3e-11
UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole... 70 4e-11
UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist... 70 4e-11
UniRef50_Q23H15 Cluster: Papain family cysteine protease contain... 70 5e-11
UniRef50_Q5YER1 Cluster: Cysteine proteinase; n=1; Bigelowiella ... 69 6e-11
UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big... 69 6e-11
UniRef50_A2EED5 Cluster: Clan CA, family C1, cathepsin L-like cy... 69 6e-11
UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cel... 69 8e-11
UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve... 69 1e-10
UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy... 69 1e-10
UniRef50_A0CS14 Cluster: Chromosome undetermined scaffold_26, wh... 69 1e-10
UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ... 68 1e-10
UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;... 67 2e-10
UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ... 67 2e-10
UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl... 67 2e-10
UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ... 67 3e-10
UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop... 67 3e-10
UniRef50_Q6DGW1 Cluster: 26-29kD-proteinase protein; n=23; Danio... 66 4e-10
UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Sl... 66 4e-10
UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ... 66 8e-10
UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl... 66 8e-10
UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir... 66 8e-10
UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain... 66 8e-10
UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus co... 65 1e-09
UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca... 65 1e-09
UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ... 64 2e-09
UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep... 64 2e-09
UniRef50_A0BLR4 Cluster: Chromosome undetermined scaffold_115, w... 64 2e-09
UniRef50_UPI0001509FF9 Cluster: Papain family cysteine protease ... 64 2e-09
UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; ... 64 2e-09
UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi... 64 3e-09
UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; ... 63 4e-09
UniRef50_Q650W8 Cluster: Putative cysteine proteinase; n=2; Oryz... 63 5e-09
UniRef50_Q2QA00 Cluster: Cathepsin B-like cysteine protease 2; n... 63 5e-09
UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2; Culicid... 62 7e-09
UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M... 62 7e-09
UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ... 62 7e-09
UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;... 62 9e-09
UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop... 62 9e-09
UniRef50_A0EI50 Cluster: Chromosome undetermined scaffold_98, wh... 62 9e-09
UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C... 62 1e-08
UniRef50_A5HC50 Cluster: Cathepsin W; n=2; Theria|Rep: Cathepsin... 62 1e-08
UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid... 62 1e-08
UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa... 61 2e-08
UniRef50_Q18740 Cluster: Putative uncharacterized protein tag-32... 61 2e-08
UniRef50_Q9BL26 Cluster: Putative uncharacterized protein; n=4; ... 61 2e-08
UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop... 61 2e-08
UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|... 61 2e-08
UniRef50_A0DKP6 Cluster: Chromosome undetermined scaffold_54, wh... 61 2e-08
UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S... 60 3e-08
UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw... 60 3e-08
UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280... 60 4e-08
UniRef50_Q2QS15 Cluster: Papain family cysteine protease contain... 60 4e-08
UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep... 60 4e-08
UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi... 60 4e-08
UniRef50_A0EAG2 Cluster: Chromosome undetermined scaffold_86, wh... 60 4e-08
UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr... 60 4e-08
UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi... 60 4e-08
UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R... 60 5e-08
UniRef50_Q237A1 Cluster: Papain family cysteine protease contain... 60 5e-08
UniRef50_O16289 Cluster: Putative uncharacterized protein W07B8.... 60 5e-08
UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li... 60 5e-08
UniRef50_A0MA79 Cluster: TIN-ag-RP; n=1; Bombyx mori|Rep: TIN-ag... 60 5e-08
UniRef50_Q54IS1 Cluster: Putative uncharacterized protein; n=1; ... 59 7e-08
UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy... 59 7e-08
UniRef50_UPI00015B5773 Cluster: PREDICTED: similar to GM06507p; ... 59 9e-08
UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The... 59 9e-08
UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3.... 59 9e-08
UniRef50_A2Q4E7 Cluster: Peptidase C1A, papain; n=1; Medicago tr... 58 1e-07
UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ... 58 2e-07
UniRef50_Q8MUI2 Cluster: Cysteine proteinase; n=4; Ancylostomati... 58 2e-07
UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop... 58 2e-07
UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10... 58 2e-07
UniRef50_A1XG92 Cluster: Putative cathepsin B-like like proteina... 58 2e-07
UniRef50_UPI000155D183 Cluster: PREDICTED: similar to Cathepsin ... 58 2e-07
UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Sl... 58 2e-07
UniRef50_A0BNM1 Cluster: Chromosome undetermined scaffold_119, w... 58 2e-07
UniRef50_Q0AY53 Cluster: Putative uncharacterized protein; n=1; ... 57 3e-07
UniRef50_Q3L7L0 Cluster: Sar s 1 allergen SMIPP-C Yv5009F04; n=3... 57 3e-07
UniRef50_Q8T659 Cluster: Cathepsin B; n=1; Apriona germari|Rep: ... 57 4e-07
UniRef50_Q54VR1 Cluster: Putative uncharacterized protein; n=1; ... 57 4e-07
UniRef50_O61515 Cluster: Cathepsin B-like cysteine protease GCP7... 57 4e-07
UniRef50_A7LM75 Cluster: Cathepsin B preproprotein precursor; n=... 57 4e-07
UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; P... 57 4e-07
UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh... 56 5e-07
UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O... 56 5e-07
UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=... 56 5e-07
UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 56 6e-07
UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|... 56 6e-07
UniRef50_Q2M436 Cluster: Cathepsin-like cysteine protease; n=3; ... 56 8e-07
UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr... 56 8e-07
UniRef50_Q9XZI2 Cluster: Cathepsin Z1 preproprotein; n=3; Bilate... 55 1e-06
UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ... 55 1e-06
UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain... 55 1e-06
UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryz... 54 2e-06
UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin... 54 2e-06
UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome s... 54 3e-06
UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ... 53 4e-06
UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma... 52 8e-06
UniRef50_Q86GZ6 Cluster: Midgut cysteine proteinase 1; n=1; Rhip... 52 1e-05
UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep... 52 1e-05
UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcop... 52 1e-05
UniRef50_A0DZB1 Cluster: Chromosome undetermined scaffold_7, who... 52 1e-05
UniRef50_Q015J8 Cluster: Cathepsin; n=2; Ostreococcus|Rep: Cathe... 52 1e-05
UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl... 51 2e-05
UniRef50_O01850 Cluster: Cathepsin z protein 1; n=4; Caenorhabdi... 51 2e-05
UniRef50_A0BLX3 Cluster: Chromosome undetermined scaffold_115, w... 51 2e-05
UniRef50_Q8TQM7 Cluster: Putative uncharacterized protein; n=1; ... 51 2e-05
UniRef50_Q9UBR2 Cluster: Cathepsin Z precursor; n=40; Bilateria|... 51 2e-05
UniRef50_UPI00006CBCFB Cluster: Papain family cysteine protease ... 51 2e-05
UniRef50_UPI00006CB653 Cluster: Papain family cysteine protease ... 51 2e-05
UniRef50_P43507 Cluster: Cathepsin B-like cysteine proteinase 3 ... 51 2e-05
UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocysti... 50 3e-05
UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein... 50 3e-05
UniRef50_Q6R7Z5 Cluster: Cathepsin B-like cysteine protease; n=8... 50 3e-05
UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 50 3e-05
UniRef50_O16288 Cluster: Putative uncharacterized protein W07B8.... 50 3e-05
UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator ame... 50 3e-05
UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ... 50 3e-05
UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;... 50 4e-05
UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus... 50 4e-05
UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n... 50 4e-05
UniRef50_UPI0000E815AE Cluster: PREDICTED: similar to glucocorti... 50 5e-05
UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma... 50 5e-05
UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin ... 49 7e-05
UniRef50_Q6A1H9 Cluster: Cathepsin X/O; n=1; Suberites domuncula... 49 7e-05
UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil... 49 7e-05
UniRef50_P25802 Cluster: Cathepsin B-like cysteine proteinase 1 ... 49 7e-05
UniRef50_Q3A8I3 Cluster: Putative serine protease; n=1; Pelobact... 49 9e-05
UniRef50_Q53K53 Cluster: Cysteine protease 1, putative; n=5; Ory... 49 9e-05
UniRef50_A4VE98 Cluster: Cathepsin z; n=3; Tetrahymena thermophi... 49 9e-05
UniRef50_Q6XM39 Cluster: FirrV-1-A48 precursor; n=1; Feldmannia ... 48 1e-04
UniRef50_Q945E4 Cluster: Cysteine proteinase; n=1; Vasconcellea ... 48 1e-04
UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps... 48 1e-04
UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest... 48 1e-04
UniRef50_Q54R55 Cluster: Putative uncharacterized protein; n=1; ... 48 1e-04
UniRef50_Q16V93 Cluster: Cathepsin b; n=2; Aedes aegypti|Rep: Ca... 48 1e-04
UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina... 48 1e-04
UniRef50_Q8PS79 Cluster: Putative uncharacterized protein; n=1; ... 48 1e-04
UniRef50_UPI0000D567C7 Cluster: PREDICTED: similar to CG3074-PA,... 48 2e-04
UniRef50_P90850 Cluster: Uncharacterized peptidase C1-like prote... 48 2e-04
UniRef50_Q5CYC8 Cluster: Cathepsin like thiol protease possibly ... 47 3e-04
UniRef50_O15555 Cluster: Cysteine protease; n=1; Giardia muris|R... 47 3e-04
UniRef50_UPI0000498E2F Cluster: cysteine proteinase; n=1; Entamo... 47 4e-04
UniRef50_Q8QNJ8 Cluster: EsV-1-75; n=1; Ectocarpus siliculosus v... 46 5e-04
UniRef50_Q54D62 Cluster: Putative uncharacterized protein; n=1; ... 46 5e-04
UniRef50_UPI0000E4866E Cluster: PREDICTED: similar to oxidized-L... 46 7e-04
UniRef50_Q86GK0 Cluster: Cathepsin Z-like cysteine proteinase; n... 46 7e-04
UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon G... 46 7e-04
UniRef50_UPI00015B5D85 Cluster: PREDICTED: similar to cathepsin ... 46 9e-04
UniRef50_UPI0000D56B9D Cluster: PREDICTED: similar to Cathepsin ... 46 9e-04
UniRef50_UPI00006CBB5F Cluster: Papain family cysteine protease ... 46 9e-04
UniRef50_Q9XZM9 Cluster: Cysteine proteinase CPW2; n=1; Acantham... 46 9e-04
UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2... 46 9e-04
UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba... 45 0.001
UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli... 45 0.001
UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma j... 45 0.001
UniRef50_A0BZE1 Cluster: Chromosome undetermined scaffold_139, w... 45 0.001
UniRef50_Q70EW7 Cluster: Cathepsin B-like proteinase precursor; ... 45 0.002
UniRef50_Q4N5Z7 Cluster: Cysteine proteinase, putative; n=2; The... 45 0.002
UniRef50_A3FQ13 Cluster: Cathepsin like thiol protease possibly ... 45 0.002
UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy... 45 0.002
UniRef50_UPI00006CBAC7 Cluster: Papain family cysteine protease ... 44 0.002
UniRef50_Q2NG83 Cluster: Member of asn/thr-rich large protein fa... 44 0.003
UniRef50_P25793 Cluster: Cathepsin B-like cysteine proteinase 2 ... 44 0.003
UniRef50_Q9GZM7-2 Cluster: Isoform 2 of Q9GZM7 ; n=1; Homo sapie... 44 0.004
UniRef50_Q7Q9Y2 Cluster: ENSANGP00000012227; n=1; Anopheles gamb... 44 0.004
UniRef50_A5K8Y0 Cluster: Preprocathepsin c, putative; n=1; Plasm... 44 0.004
UniRef50_Q2FLD5 Cluster: PKD precursor; n=1; Methanospirillum hu... 44 0.004
UniRef50_Q3LFN3 Cluster: Cysteine proteinase; n=1; Dianthus cary... 43 0.005
UniRef50_Q4UCF5 Cluster: Cysteine proteinase, tacP, putative; n=... 43 0.006
UniRef50_Q9GZM7 Cluster: Tubulointerstitial nephritis antigen-li... 43 0.006
UniRef50_Q7M1Q7 Cluster: Actinidain; n=1; Actinidia chinensis|Re... 42 0.011
UniRef50_Q01FL5 Cluster: Cysteine protease-5; n=3; Ostreococcus|... 42 0.011
UniRef50_A0DTZ2 Cluster: Chromosome undetermined scaffold_63, wh... 42 0.011
UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia... 42 0.014
UniRef50_Q4PGS1 Cluster: Putative uncharacterized protein; n=1; ... 42 0.014
UniRef50_Q70SU8 Cluster: Cystein proteinase inhibitor protein pr... 41 0.019
UniRef50_A4S004 Cluster: Predicted protein; n=2; Ostreococcus|Re... 41 0.019
UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep... 41 0.019
UniRef50_Q8TKH5 Cluster: Cell surface protein; n=3; Methanosarci... 41 0.019
UniRef50_UPI0000D56B9C Cluster: PREDICTED: similar to CG10992-PA... 41 0.025
UniRef50_Q5YER6 Cluster: Cathepsin Z; n=1; Bigelowiella natans|R... 41 0.025
UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O... 40 0.033
UniRef50_Q9BL59 Cluster: Putative uncharacterized protein; n=2; ... 40 0.033
UniRef50_Q8I0V1 Cluster: Preprocathepsin c, putative; n=1; Plasm... 40 0.033
UniRef50_Q7M4N9 Cluster: Dipeptidyl-peptidase I; n=1; Homo sapie... 40 0.033
UniRef50_Q8TMY7 Cluster: Cell surface protein; n=2; Methanosarci... 40 0.033
UniRef50_Q9SIE8 Cluster: Putative cysteine proteinase; n=1; Arab... 40 0.043
UniRef50_A4RYN2 Cluster: Predicted protein; n=1; Ostreococcus lu... 40 0.043
UniRef50_Q962W3 Cluster: Cysteine protease; n=3; Giardia intesti... 40 0.043
UniRef50_Q7R5X2 Cluster: GLP_81_104117_102504; n=1; Giardia lamb... 40 0.043
UniRef50_Q26989 Cluster: Cysteine proteinase 5; n=1; Tritrichomo... 40 0.043
UniRef50_UPI0000E48EBC Cluster: PREDICTED: hypothetical protein;... 40 0.057
UniRef50_Q42312 Cluster: Cysteine protease; n=1; Arabidopsis tha... 40 0.057
UniRef50_UPI0000E46ABB Cluster: PREDICTED: similar to SCO-spondi... 39 0.076
UniRef50_Q7JWQ7 Cluster: RE01730p; n=5; Diptera|Rep: RE01730p - ... 39 0.076
UniRef50_A2F4G0 Cluster: Clan CA, family C1, cathepsin L-like cy... 39 0.076
UniRef50_Q8TQ91 Cluster: Putative uncharacterized protein; n=1; ... 39 0.076
UniRef50_A1SVF0 Cluster: Peptidase C1A, papain; n=1; Psychromona... 39 0.100
UniRef50_A7LFV3 Cluster: Cathepsin L protease inhibitor 2; n=1; ... 39 0.100
UniRef50_A5Z7Z2 Cluster: Putative uncharacterized protein; n=1; ... 38 0.13
UniRef50_Q7QZB3 Cluster: GLP_567_6496_7413; n=1; Giardia lamblia... 38 0.13
UniRef50_Q5DEC9 Cluster: SJCHGC06356 protein; n=1; Schistosoma j... 38 0.13
UniRef50_Q9NHY1 Cluster: Cysteine protease cp2; n=1; Theileria c... 38 0.17
UniRef50_O17431 Cluster: Thiol protease; n=1; Trichuris suis|Rep... 38 0.17
UniRef50_Q6CPZ4 Cluster: Kluyveromyces lactis strain NRRL Y-1140... 38 0.17
UniRef50_Q61CG7 Cluster: Putative uncharacterized protein CBG129... 38 0.23
UniRef50_Q5Y801 Cluster: Cysteine proteinase; n=1; Petunia x hyb... 37 0.30
UniRef50_Q01FU9 Cluster: Cathepsin Z; n=2; Ostreococcus|Rep: Cat... 37 0.30
UniRef50_Q9VTF1 Cluster: CG32071-PA; n=2; Drosophila melanogaste... 37 0.30
UniRef50_Q54MB6 Cluster: Putative uncharacterized protein; n=1; ... 37 0.30
UniRef50_Q2FLC7 Cluster: Periplasmic copper-binding precursor; n... 37 0.30
UniRef50_UPI00006CFA59 Cluster: Papain family cysteine protease ... 37 0.40
UniRef50_Q84YE7 Cluster: Cysteine proteinase-like protein; n=1; ... 37 0.40
UniRef50_A5K7H0 Cluster: Putative uncharacterized protein; n=1; ... 37 0.40
UniRef50_A0DIY3 Cluster: Chromosome undetermined scaffold_52, wh... 37 0.40
UniRef50_A0DCA5 Cluster: Chromosome undetermined scaffold_45, wh... 37 0.40
UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi... 37 0.40
UniRef50_UPI0000DB7B97 Cluster: PREDICTED: hypothetical protein,... 36 0.53
UniRef50_O65214 Cluster: Cysteine protease; n=2; Volvox carteri ... 36 0.53
UniRef50_Q1AMF2 Cluster: Cathepsin C2; n=1; Toxoplasma gondii|Re... 36 0.53
UniRef50_A2E6N1 Cluster: Clan CA, family C1, cathepsin B-like cy... 36 0.53
UniRef50_UPI0000ECA2BE Cluster: UPI0000ECA2BE related cluster; n... 36 0.70
UniRef50_A6W7B6 Cluster: Methyl-accepting chemotaxis sensory tra... 36 0.70
UniRef50_Q7JYA0 Cluster: RE20049p; n=2; Sophophora|Rep: RE20049p... 36 0.70
UniRef50_Q4XZE6 Cluster: Preprocathepsin c, putative; n=6; Plasm... 36 0.70
UniRef50_P91991 Cluster: Putative uncharacterized protein; n=2; ... 36 0.70
UniRef50_A0CGA3 Cluster: Chromosome undetermined scaffold_179, w... 36 0.70
UniRef50_Q7MTY9 Cluster: Cysteine peptidase, putative; n=8; Bact... 36 0.93
UniRef50_A7NM03 Cluster: Putative uncharacterized protein precur... 36 0.93
UniRef50_A7DL96 Cluster: Putative uncharacterized protein precur... 36 0.93
UniRef50_Q8I3C0 Cluster: Papain family cysteine protease, putati... 36 0.93
UniRef50_Q55CB6 Cluster: Putative uncharacterized protein; n=1; ... 36 0.93
UniRef50_Q171M0 Cluster: Cathepsin b; n=7; Aedes aegypti|Rep: Ca... 36 0.93
UniRef50_Q4PH90 Cluster: Putative uncharacterized protein; n=1; ... 36 0.93
UniRef50_P21381 Cluster: Thaumatopain; n=10; Eukaryota|Rep: Thau... 36 0.93
UniRef50_Q6M7X4 Cluster: Conserved secreted protein; n=3; Coryne... 35 1.2
UniRef50_Q1CXI7 Cluster: Putative uncharacterized protein; n=1; ... 35 1.2
UniRef50_A6LML6 Cluster: Peptidase C1A, papain precursor; n=1; T... 35 1.2
UniRef50_A1ZZE0 Cluster: Aminopeptidase C; n=1; Microscilla mari... 35 1.2
UniRef50_Q7QWN9 Cluster: GLP_26_50243_51811; n=2; Giardia lambli... 35 1.2
>UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-PA
- Drosophila melanogaster (Fruit fly)
Length = 549
Score = 283 bits (695), Expect = 2e-75
Identities = 128/195 (65%), Positives = 154/195 (78%), Gaps = 1/195 (0%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K KH Y SD EHE R NIFRQ+LRYIHS NRA +T++VNHLAD+T++EL A RG +
Sbjct: 249 KRKHGVAYHSDTEHEHRKNIFRQNLRYIHSKNRAKLTYTLAVNHLADKTEEELKARRGYK 308
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
SG G PFPY + ++ ++P ++DWRL+GAVTPVKDQSVCGSCWSFGT+G +EG
Sbjct: 309 SSGIYNTGKPFPYDVPKYKD---EIPDQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHLEG 365
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYLGQ 538
A FL NGG+LVRLSQQALIDCSW +GNNGCDGGEDFR Y+W ++ G+PTEE+YG YLGQ
Sbjct: 366 AFFLKNGGNLVRLSQQALIDCSWAYGNNGCDGGEDFRVYQWMLQSGGVPTEEEYGPYLGQ 425
Query: 539 DGYCHVDNVTAVTSI 583
DGYCHV+NVT V I
Sbjct: 426 DGYCHVNNVTLVAPI 440
>UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue of
Sarcophaga 26,29kDa proteinase; n=1; Nasonia
vitripennis|Rep: PREDICTED: similar to homologue of
Sarcophaga 26,29kDa proteinase - Nasonia vitripennis
Length = 553
Score = 276 bits (676), Expect = 3e-73
Identities = 127/196 (64%), Positives = 151/196 (77%), Gaps = 2/196 (1%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K H + YA DLEH++R FR +LR+IHS NRAN GFT+ VNHLADR + EL LRG++
Sbjct: 252 KKTHNKNYAHDLEHKQRKEHFRHNLRFIHSINRANLGFTLDVNHLADRNEAELKVLRGKQ 311
Query: 182 YSGPSPHG-LPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
Y+ +G +PFP+ VE+ +P DWRL+GAVTPVKDQSVCGSCWSFGT GAVE
Sbjct: 312 YTQHGYNGGMPFPHD---VEKEKADVPDSFDWRLYGAVTPVKDQSVCGSCWSFGTTGAVE 368
Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLG 535
GA F+ LVRLSQQALIDCSWGFGNNGCDGGEDFR+Y+WI +H GLPTEE+YGGYLG
Sbjct: 369 GAYFMKYK-KLVRLSQQALIDCSWGFGNNGCDGGEDFRSYQWIIKHGGLPTEEEYGGYLG 427
Query: 536 QDGYCHVDNVTAVTSI 583
QDGYCH+ NVT + +
Sbjct: 428 QDGYCHIKNVTQIAKL 443
>UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1;
Rhipicephalus appendiculatus|Rep: Midgut cysteine
proteinase 2 - Rhipicephalus appendiculatus (Brown ear
tick)
Length = 564
Score = 243 bits (594), Expect = 3e-63
Identities = 114/194 (58%), Positives = 139/194 (71%), Gaps = 1/194 (0%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K H+R Y D EH++R +IFRQ+LR+I S NRAN G+ ++VNHLADRT +E++ LRGR
Sbjct: 265 KETHKRTYELDTEHDRRRDIFRQNLRFIDSKNRANLGYNLAVNHLADRTREEISVLRGRL 324
Query: 182 YSGP-SPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
S S PFP + + KLP + DWR +GAVTPVKDQ+VCGSCWSFGTVG +E
Sbjct: 325 QSKDGSSRAEPFPRHR-----FTAKLPDQIDWRPYGAVTPVKDQAVCGSCWSFGTVGELE 379
Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQ 538
GA F G LVRLS+Q L+DCSW GNNGCDGGEDFRAYE+I HGL ++EDYG Y+GQ
Sbjct: 380 GAYF-RKTGRLVRLSEQQLVDCSWNNGNNGCDGGEDFRAYEYIADHGLASDEDYGAYIGQ 438
Query: 539 DGYCHVDNVTAVTS 580
DG CH V + S
Sbjct: 439 DGVCHDSKVNSTIS 452
>UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326;
n=2; Danio rerio|Rep: hypothetical protein LOC550326 -
Danio rerio
Length = 531
Score = 225 bits (551), Expect = 5e-58
Identities = 103/195 (52%), Positives = 132/195 (67%), Gaps = 1/195 (0%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K K RQY S+ EHE+R N+F + R++HSNNRA +++ +NH AD+T +ELA + G
Sbjct: 233 KEKFNRQYESEKEHEERENLFLHTFRFVHSNNRAGLTYSVGINHFADKTKEELARMTGGL 292
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
PFP S+ R S+ P DWRL+GAVTPVKDQ+VCGSCWSF T G +EG
Sbjct: 293 LPKKEEKAQPFP-SEIR----SIATPNSVDWRLYGAVTPVKDQAVCGSCWSFATTGTLEG 347
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQ 538
ALFL G L LSQQ L+DC+WGFGNNGCDGGE++RA+EWI +H G+ T E YG Y+G
Sbjct: 348 ALFLKT-GQLTSLSQQMLVDCTWGFGNNGCDGGEEWRAFEWIMKHGGISTAESYGAYMGM 406
Query: 539 DGYCHVDNVTAVTSI 583
+G CH D + V +
Sbjct: 407 NGLCHYDKTSMVAQL 421
>UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 21 SCAF14577, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 478
Score = 221 bits (541), Expect = 7e-57
Identities = 105/186 (56%), Positives = 127/186 (68%), Gaps = 1/186 (0%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K K QRQY D EHE R F +LRY+HS NRA +T+ +N L+DRT ELA +RGR+
Sbjct: 125 KEKFQRQYEDDKEHELRQQAFIHNLRYVHSKNRAGLSYTLGLNSLSDRTMSELATMRGRK 184
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
+ GLPFP+ + V++P DWRL+GAVTPVKDQ++CGSCWSF T G +EG
Sbjct: 185 QRKTTNAGLPFPFKLYQ----HVEVPESLDWRLYGAVTPVKDQAICGSCWSFATTGTIEG 240
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQ 538
ALFL G L LSQQ LIDCSWGFGNN CDGGE++RAYEWI +H G+ + E YG YLG
Sbjct: 241 ALFLKTGS-LQVLSQQMLIDCSWGFGNNACDGGEEWRAYEWIMKHGGIASAETYGPYLGM 299
Query: 539 DGYCHV 556
G V
Sbjct: 300 TGSLQV 305
Score = 97.1 bits (231), Expect = 3e-19
Identities = 41/66 (62%), Positives = 51/66 (77%), Gaps = 1/66 (1%)
Frame = +2
Query: 368 FLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQDG 544
+L G L LSQQ LIDCSWGFGNN CDGGE++RAYEWI +H G+ + E YG YLG +G
Sbjct: 296 YLGMTGSLQVLSQQMLIDCSWGFGNNACDGGEEWRAYEWIMKHGGIASAETYGPYLGMNG 355
Query: 545 YCHVDN 562
+CHV++
Sbjct: 356 FCHVNS 361
>UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l;
n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
similar to cathepsin l - Strongylocentrotus purpuratus
Length = 489
Score = 193 bits (471), Expect = 2e-48
Identities = 92/169 (54%), Positives = 118/169 (69%), Gaps = 2/169 (1%)
Frame = +2
Query: 83 IHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPP 262
IHS NRAN G+ + +NH+AD++ EL +RGR +GLP Y S V + +V P
Sbjct: 214 IHSINRANLGYVLDINHMADQSHQELKRMRGRLRQTRPNNGLP--YDGSDVSDDAV---P 268
Query: 263 EH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFG 439
+H DW + GAV+PVKDQ+VCGSCWSFG+ +EGA+F+ +G VRLSQQ L+DC+W G
Sbjct: 269 DHIDWNVLGAVSPVKDQAVCGSCWSFGSAETIEGAVFMQSGKR-VRLSQQMLMDCTWAAG 327
Query: 440 NNGCDGGEDFRAYEWI-KRHGLPTEEDYGGYLGQDGYCHVDNVTAVTSI 583
NNGCDGGE++R YEW+ K G+P EE YG YLGQ+G CH D AV SI
Sbjct: 328 NNGCDGGEEWRVYEWLMKNGGIPLEETYGPYLGQNGMCHYDKSKAVASI 376
>UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 513
Score = 187 bits (456), Expect = 1e-46
Identities = 88/190 (46%), Positives = 123/190 (64%), Gaps = 1/190 (0%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K ++++Y S EHEKR +I+R ++R+I S NR + G+++ NH+AD TD E+ ++G
Sbjct: 214 KASYRKRYPSAHEHEKRKDIYRHNMRFIKSRNRQHLGYSLKPNHMADMTDAEVNRMKGLL 273
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
+ P G P+S ++ V LPP DWR GAV VK Q +CGSC++F GA+EG
Sbjct: 274 HEEPPLIG-DSPFSIPD-KDRGVPLPPHVDWRKAGAVNSVKSQGICGSCYAFAVAGALEG 331
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQ 538
A F+ G L LS+Q ++DC+WGFGN GC GG +RA +WI +H GL TEE YG YL Q
Sbjct: 332 AHFIKTGLKL-DLSEQQIVDCTWGFGNRGCKGGYPYRAMQWILKHGGLATEESYGRYLAQ 390
Query: 539 DGYCHVDNVT 568
+GYCH N +
Sbjct: 391 EGYCHFKNTS 400
>UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 514
Score = 169 bits (410), Expect = 6e-41
Identities = 85/192 (44%), Positives = 113/192 (58%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
+H +QY S+ E KR +IFR ++RYI S NR N + ++ NH D TD E ++
Sbjct: 226 QHNKQYDSEHEVSKRKHIFRHNMRYIRSINRKNLKYKLAPNHFVDLTDGEYD-----QHK 280
Query: 188 GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGAL 367
G S L PYS V +P E DWR +GAV+PV+ Q +CGSC++ VGAVEGA
Sbjct: 281 GDSIITLYGPYSNMSHVLQRVDVPDELDWRDYGAVSPVRGQGICGSCYALAAVGAVEGAY 340
Query: 368 FLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGY 547
F+ G L LS Q +IDCSWG GN GC GG +A WI HG+ + E YG YLGQ+G
Sbjct: 341 FMKTG-KLKELSAQQVIDCSWGSGNRGCKGGYYNKAMSWIYLHGIASAESYGPYLGQEGT 399
Query: 548 CHVDNVTAVTSI 583
C ++ + +I
Sbjct: 400 CRIEGLRRAAAI 411
>UniRef50_UPI000155637A Cluster: PREDICTED: similar to
ENSANGP00000013730, partial; n=1; Ornithorhynchus
anatinus|Rep: PREDICTED: similar to ENSANGP00000013730,
partial - Ornithorhynchus anatinus
Length = 229
Score = 165 bits (401), Expect = 7e-40
Identities = 80/133 (60%), Positives = 94/133 (70%)
Frame = +2
Query: 80 YIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLP 259
+I S+NRANR F ++ NHL DRT ELAALRGR S HG PFP+ + +V LP
Sbjct: 1 FIDSHNRANRPFRLAPNHLTDRTPGELAALRGRLRSSRPNHGQPFPHEQLA----NVALP 56
Query: 260 PEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFG 439
DWRL+GAVTPVKDQ+VCGSCWSF T G +EGALFL LV LSQQ LIDCSW G
Sbjct: 57 ESLDWRLYGAVTPVKDQAVCGSCWSFATTGTLEGALFLKVTVQLVPLSQQMLIDCSWDVG 116
Query: 440 NNGCDGGEDFRAY 478
N GCDGG +++A+
Sbjct: 117 NFGCDGGLEWQAF 129
>UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 392
Score = 162 bits (394), Expect = 5e-39
Identities = 78/194 (40%), Positives = 113/194 (58%), Gaps = 7/194 (3%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR-- 181
+H + Y D EH +R +IFR ++RYI S NR + + + NH AD TDDE + +G
Sbjct: 94 QHDKVYEDDSEHRRRKHIFRHNVRYIRSMNRRSLPYKLEPNHFADLTDDEFKSYKGALDD 153
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
S + R + + ++P + DWR +GAV P K Q CGSCW+F T GAVE
Sbjct: 154 ESKDVMNDHDDVIDDDRSKRM-FEVPDQLDWRNYGAVNPAKGQGTCGSCWAFATAGAVEA 212
Query: 362 ALFLHNGGHLVRLSQQALIDCSWG-----FGNNGCDGGEDFRAYEWIKRHGLPTEEDYGG 526
A F+ G L+ L++Q L+DC+W GNNGC GG ++A+ W+K+ G+ T + YG
Sbjct: 213 AHFIQK-GELLNLAEQQLLDCTWSTPGVYHGNNGCLGGWTWKAFSWVKKFGIATTKSYGH 271
Query: 527 YLGQDGYCHVDNVT 568
Y GQ+G+C N+T
Sbjct: 272 YRGQEGFCKTSNLT 285
>UniRef50_Q24E33 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 328
Score = 134 bits (324), Expect = 1e-30
Identities = 73/183 (39%), Positives = 105/183 (57%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K++H + + E R NIF Q++RYI S N N F +++N +A TD+E ++L
Sbjct: 46 KLEHNIVFQNSEEDLYRQNIFFQNVRYIQSENAKNNTFKLAINIMAILTDEEYSSLY--- 102
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
+ + S E +P E +W GAVTPVK+Q CGSCW+F T GA+EG
Sbjct: 103 LNLDQQESIDIFDSLVDDNETVGDIPSEVNWTAQGAVTPVKNQGSCGSCWAFSTTGALEG 162
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQD 541
+ FL N L+ S+Q L+DCS + N GC+GG RA+ ++K HG+ TEE+Y Y +D
Sbjct: 163 SYFLKN-NQLISFSEQQLVDCSRLYLNMGCNGGLMPRAFRYVKAHGITTEEEY-PYTAKD 220
Query: 542 GYC 550
G C
Sbjct: 221 GKC 223
>UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep:
Toxopain-2 - Toxoplasma gondii
Length = 422
Score = 128 bits (310), Expect = 7e-29
Identities = 74/195 (37%), Positives = 104/195 (53%), Gaps = 4/195 (2%)
Frame = +2
Query: 11 HQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSG 190
+ + YA++ E ++R IF+ +L YIH++N+ +++ +NH D + DE R+Y G
Sbjct: 124 YAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYSYSLKMNHFGDLSRDEFR----RKYLG 179
Query: 191 -PSPHGLPFPYSKSRVEELSV---KLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
L + E L+V +LP DWR G VTPVKDQ CGSCW+F T GA+E
Sbjct: 180 FKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALE 239
Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQ 538
GA G LV LS+Q L+DCS GN C GGE A++++ G ED YL +
Sbjct: 240 GA-HCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLAR 298
Query: 539 DGYCHVDNVTAVTSI 583
D C + V I
Sbjct: 299 DEECRAQSCEKVVKI 313
>UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4;
core eudicotyledons|Rep: Papain-like cysteine peptidase
XBCP3 - Arabidopsis thaliana (Mouse-ear cress)
Length = 437
Score = 126 bits (305), Expect = 3e-28
Identities = 78/189 (41%), Positives = 107/189 (56%), Gaps = 3/189 (1%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDELAALR-GRR 181
KH + Y S+ E ++R+ IF+ + ++ +N N +++S+N AD T E A R G
Sbjct: 38 KHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKASRLGLS 97
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
S PS SK + SVK+P DWR GAVT VKDQ CG+CWSF GA+EG
Sbjct: 98 VSAPSV----IMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEG 153
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYLGQ 538
+ G L+ LS+Q LIDC + N GC+GG A+E+ IK HG+ TE+DY Y +
Sbjct: 154 INQIVT-GDLISLSEQELIDCDKSY-NAGCNGGLMDYAFEFVIKNHGIDTEKDY-PYQER 210
Query: 539 DGYCHVDNV 565
DG C D +
Sbjct: 211 DGTCKKDKL 219
>UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine
proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like
midgut cysteine proteinase - Tenebrio molitor (Yellow
mealworm)
Length = 330
Score = 126 bits (304), Expect = 4e-28
Identities = 76/200 (38%), Positives = 111/200 (55%), Gaps = 6/200 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELAAL 169
K+ H++ Y+S +E +R IF+ ++ I +N + +G ++ ++N D + +E A
Sbjct: 32 KLTHKKSYSSPIEEIRRQLIFKDNVAKIAEHNAKFEKGEVTYSKAMNQFGDMSKEEFLAY 91
Query: 170 --RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
RG+ P L PY S+ + L+ + DWR AV+ VKDQ CGSCWSF T
Sbjct: 92 VNRGKAQKPKHPENLRMPYVSSK-KPLAASV----DWRS-NAVSEVKDQGQCGSCWSFST 145
Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYG 523
GAVEG L L G L LS+Q LIDCS +GN GCDGG A+ +I +G+ +E Y
Sbjct: 146 TGAVEGQLALQR-GRLTSLSEQNLIDCSSSYGNAGCDGGWMDSAFSYIHDYGIMSESAY- 203
Query: 524 GYLGQDGYCHVDNVTAVTSI 583
Y Q YC D+ +VT++
Sbjct: 204 PYEAQGDYCRFDSSQSVTTL 223
>UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10;
Dictyostelium discoideum|Rep: Cysteine proteinase 7
precursor - Dictyostelium discoideum (Slime mold)
Length = 460
Score = 125 bits (302), Expect = 7e-28
Identities = 75/184 (40%), Positives = 101/184 (54%), Gaps = 4/184 (2%)
Frame = +2
Query: 5 VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRY 184
+ HQR Y+S+ E R NIF+ ++ Y++ N + +N AD +++E A Y
Sbjct: 35 IAHQRHYSSE-EFNGRYNIFKANMDYVNEWNTKGSETVLGLNVFADISNEEYRAT----Y 89
Query: 185 SGPSPHGLPFPYSKSRVEELS--VKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
G PF S + E + DWR GAVTP+K+Q CG CWSF T GA E
Sbjct: 90 LGT-----PFDASSLEMTESDKIFDASAQVDWRTQGAVTPIKNQGQCGGCWSFSTTGATE 144
Query: 359 GALFLHNG-GHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYL 532
GA +L NG +LV LS+Q LIDCS +GNNGC+GG A+E+ I G+ TE Y Y
Sbjct: 145 GAQYLANGKKNLVSLSEQNLIDCSGSYGNNGCEGGLMTLAFEYIINNKGIDTESSY-PYT 203
Query: 533 GQDG 544
+DG
Sbjct: 204 AEDG 207
>UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC06231 protein - Schistosoma
japonicum (Blood fluke)
Length = 372
Score = 125 bits (301), Expect = 9e-28
Identities = 72/186 (38%), Positives = 103/186 (55%), Gaps = 5/186 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG----FTMSVNHLADRTDDELAAL 169
K+ +R Y + +E KR IF + + +NRA + + M VN+ D+T+ EL L
Sbjct: 66 KINFKRAYGNVMEETKRFLIFGTNFIKMMEHNRAYQEGKATYKMGVNNFTDKTEYELRKL 125
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
RG R S + P + + KLP DWR GAVTPVK+Q CGSCW+F + G
Sbjct: 126 RGYR----SACRIAKPKGSTFISSEHAKLPDRVDWRRNGAVTPVKNQGQCGSCWAFSSTG 181
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYGG 526
A+EG + LV LS+Q LIDCS +GNNGC+GG A+++++ G+ +E Y
Sbjct: 182 AIEGQHY-RKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYVRDNKGIDSEISY-P 239
Query: 527 YLGQDG 544
Y+ DG
Sbjct: 240 YISGDG 245
>UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24;
Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa
(Rice)
Length = 339
Score = 124 bits (299), Expect = 2e-27
Identities = 74/192 (38%), Positives = 105/192 (54%), Gaps = 2/192 (1%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
++ R Y E +R IF+ ++ +I S N N F +SVN AD T+ E A + +
Sbjct: 43 QYGRVYKDATEKARRFEIFKANVAFIESFNAGNHKFWLSVNQFADLTNYEFRATKTNKGF 102
Query: 188 GPSPHGLPFPYSKSRVEELSVK-LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
PS +P + R E +S+ LP DWR GAVTP+KDQ CG CW+F V A+EG
Sbjct: 103 IPSTVRVPTTF---RYENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGI 159
Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYLGQD 541
+ L + G L+ LS+Q L+DC + GC+GG A+++ IK GL TE Y Y D
Sbjct: 160 VKL-STGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKY-PYTAAD 217
Query: 542 GYCHVDNVTAVT 577
G C+ + +A T
Sbjct: 218 GKCNGGSNSAAT 229
>UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor;
n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
L-like proteinase precursor - Diabrotica virgifera
virgifera (western corn rootworm)
Length = 317
Score = 124 bits (299), Expect = 2e-27
Identities = 74/198 (37%), Positives = 102/198 (51%), Gaps = 4/198 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELAAL 169
KV H ++Y E + R +F Q+L+ I +N R G F + VN AD T +E A+
Sbjct: 20 KVNHSKKYGHLKEEQVRFQVFSQNLQKIEQHNARYQNGEVSFYLGVNQFADMTSEEFKAM 79
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
+ + + V + + +P DWR GAV PV+DQ CGSCW+F G
Sbjct: 80 LDSQLIHKPKRDITSRF----VADPQLTVPESIDWREKGAVNPVRDQEQCGSCWAFSAAG 135
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGY 529
A+EG FL G L LS Q L+DCS + N GC+GG AY++IK +GL E Y Y
Sbjct: 136 ALEGQRFLKE-GKLEVLSTQQLVDCSRDYKNEGCNGGWPHWAYDYIKDNGLCLESKY-KY 193
Query: 530 LGQDGYCHVDNVTAVTSI 583
G DGY + + A+ I
Sbjct: 194 QGYDGYYCKECIPAIKKI 211
>UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1;
Dictyostelium discoideum AX4|Rep: Counting factor
associated protein - Dictyostelium discoideum AX4
Length = 531
Score = 124 bits (299), Expect = 2e-27
Identities = 68/190 (35%), Positives = 106/190 (55%), Gaps = 1/190 (0%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K ++ ++Y+S EH++R F+ + + I ++N + + +NH AD ++ E L +
Sbjct: 229 KAQYNKEYSSQDEHDERFINFKAARKIIATHNAKESSYKLGMNHYADLSNKEFNTLVKPK 288
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
+ PS G + +E +P DWR VTPVKDQ +CGSCW+FG+ G++EG
Sbjct: 289 VARPSVTGADSVHD----DESLRSIPSTVDWRNQNCVTPVKDQGICGSCWTFGSTGSLEG 344
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHG-LPTEEDYGGYLGQ 538
+ N G LV LS+Q L+DC+ G+ GC GG A++++ G L TE +Y YL Q
Sbjct: 345 TNCVTN-GELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNY-PYLMQ 402
Query: 539 DGYCHVDNVT 568
+G C VT
Sbjct: 403 NGLCRDRTVT 412
>UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18;
Schistosoma|Rep: Preprocathepsin cathepsin L -
Schistosoma japonicum (Blood fluke)
Length = 331
Score = 124 bits (298), Expect = 2e-27
Identities = 67/198 (33%), Positives = 106/198 (53%), Gaps = 4/198 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRAN----RGFTMSVNHLADRTDDELAAL 169
K+K+ + Y S+ + +R IF + + I +N + G+TM +N D +E+ +
Sbjct: 31 KLKYNKTYTSNDDEMRRKMIFMRRIGKIQEHNLRHDLGLEGYTMGLNQFCDMEWEEVNRI 90
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
+ G SP + + +E + +P DWR GAVT VK Q +CGSCW+F G
Sbjct: 91 MFPKVFGNSPL---WNDDGNELELTNKPVPSTWDWRDHGAVTAVKHQGLCGSCWAFSATG 147
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGY 529
A+EG L LV+LS+Q L+DC + +GN+GC+GG A+ ++++H + +E DY Y
Sbjct: 148 AIEGQL-RRKHKKLVKLSEQQLVDCRYNYGNDGCEGGTMDLAFNYLEKHYIESENDY-KY 205
Query: 530 LGQDGYCHVDNVTAVTSI 583
LG D CH V +
Sbjct: 206 LGHDANCHYRKSKGVVKV 223
>UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine
protease; n=11; Callosobruchus maculatus|Rep: Putative
gut cathepsin L-like cysteine protease - Callosobruchus
maculatus (Southern cowpea weevil) (Pulse bruchid)
Length = 326
Score = 120 bits (290), Expect = 2e-26
Identities = 71/189 (37%), Positives = 105/189 (55%), Gaps = 6/189 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDE-LAA 166
K+ H + Y S +E ++R ++F+++L I +N R F V AD T +E L
Sbjct: 27 KLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEFLDL 86
Query: 167 LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
L+ + + + F E++ ++ DWR GAVTPVKDQ+ CGSCW+F V
Sbjct: 87 LKLQGVPALPSNAVHF----DNFEDIDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAV 142
Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSW-GFGNNGCDGGEDFRAYEWIKRHGLPTEEDYG 523
GA+EG F N G LV LS Q L+DC+ +GNNGC GG +A+++++ G+ TEE Y
Sbjct: 143 GAIEGQFFKKN-GTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDEGIQTEESY- 200
Query: 524 GYLGQDGYC 550
Y G+ C
Sbjct: 201 PYEGRRSSC 209
>UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precursor;
n=3; Metazoa|Rep: Digestive cysteine proteinase 2
precursor - Homarus americanus (American lobster)
Length = 323
Score = 120 bits (289), Expect = 3e-26
Identities = 76/199 (38%), Positives = 107/199 (53%), Gaps = 7/199 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHS-NNRANRG---FTMSVNHLADRTDDEL-AA 166
K K+ RQY E R IF Q+ +YI N + G F +++N D T +E A
Sbjct: 24 KGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEFNAV 83
Query: 167 LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
++G +P + +P ++ + V DWR GAVTPVKDQ CGSCW+F T
Sbjct: 84 MKGNIPRRSAPVSVFYPKKETGPQATEV------DWRTKGAVTPVKDQGQCGSCWAFSTT 137
Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYG 523
G++EG FL G L+ L++Q L+DCS +G GC+GG A+++IK +G+ TE Y
Sbjct: 138 GSLEGQHFLKTGS-LISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAY- 195
Query: 524 GYLGQDGYCHVD-NVTAVT 577
Y +DG C D N A T
Sbjct: 196 PYEARDGSCRFDSNSVAAT 214
>UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin
heavy chain; n=3; Amniota|Rep: PREDICTED: similar to
ferritin heavy chain - Ornithorhynchus anatinus
Length = 338
Score = 119 bits (287), Expect = 4e-26
Identities = 70/187 (37%), Positives = 103/187 (55%), Gaps = 7/187 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDEL-AA 166
KV H + Y+ + E R + +++R I +N + + +++NH D+T++EL
Sbjct: 32 KVLHGKNYSVEAEEVFRRAAWEKNVRVIERHNEEMSQGKHSYRLAMNHFGDQTNEELHER 91
Query: 167 LRGRR--YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFG 340
L G R G G +S+ S + P E DWR G VTPVK+Q +CGSCW+F
Sbjct: 92 LNGFRPDLGGALRSGREQARFRSKT---SWEGPEEVDWRTKGYVTPVKNQGLCGSCWAFS 148
Query: 341 TVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
GA+E AL G +V LS+Q L+DCSW GN GC GG+ A+E+++ +G ED
Sbjct: 149 ATGALE-ALVFKTTGKMVSLSEQNLVDCSWRQGNVGCRGGQYIGAFEYVRANGGIDAEDL 207
Query: 521 GGYLGQD 541
YLG+D
Sbjct: 208 YPYLGRD 214
>UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor;
n=176; Viridiplantae|Rep: Cysteine proteinase RD21a
precursor - Arabidopsis thaliana (Mouse-ear cress)
Length = 462
Score = 119 bits (286), Expect = 6e-26
Identities = 70/186 (37%), Positives = 103/186 (55%), Gaps = 4/186 (2%)
Frame = +2
Query: 5 VKHQRQYASD--LEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGR 178
VKH + + + +E ++R IF+ +LR++ +N N + + + AD T+DE +
Sbjct: 55 VKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRS---- 110
Query: 179 RYSGPSPHGLPFPYSKSRVE-ELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
+Y G + R E + +LP DWR GAV VKDQ CGSCW+F T+GAV
Sbjct: 111 KYLGAKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAV 170
Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYL 532
EG + G L+ LS+Q L+DC + N GC+GG A+E+ IK G+ T++DY Y
Sbjct: 171 EGINQIVT-GDLITLSEQELVDCDTSY-NEGCNGGLMDYAFEFIIKNGGIDTDKDY-PYK 227
Query: 533 GQDGYC 550
G DG C
Sbjct: 228 GVDGTC 233
>UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42;
Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens
(Human)
Length = 334
Score = 119 bits (286), Expect = 6e-26
Identities = 71/188 (37%), Positives = 104/188 (55%), Gaps = 5/188 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELAAL 169
K H+R Y ++ E +R ++ ++++ I +N + GFTM++N D T++E +
Sbjct: 33 KATHRRLYGANEEGWRRA-VWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQM 91
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
G + G F E L + LP DWR G VTPVK+Q CGSCW+F G
Sbjct: 92 MGCFRNQKFRKGKVFR------EPLFLDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATG 145
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGG 526
A+EG +F G LV LS+Q L+DCS GN GC+GG RA++++K + GL +EE Y
Sbjct: 146 ALEGQMF-RKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESY-P 203
Query: 527 YLGQDGYC 550
Y+ D C
Sbjct: 204 YVAVDEIC 211
>UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep:
Actinidin Act3a - Actinidia eriantha
Length = 380
Score = 118 bits (285), Expect = 8e-26
Identities = 72/184 (39%), Positives = 101/184 (54%), Gaps = 2/184 (1%)
Frame = +2
Query: 5 VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDELAALRGRR 181
VK+ + Y S E E R+ IF+++LR+I +N NR +T+ +N AD TD+E +
Sbjct: 47 VKYGKSYNSLGEREMRIEIFKENLRFIDEHNADPNRSYTVGLNQFADLTDEEYRST---- 102
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
Y G L S + ++ LP DWR GAV VK+Q +C SCW+F T+ VE
Sbjct: 103 YLG-FKSSLKSKVSNRYMPQVGEVLPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVES 161
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYLGQ 538
+ G L+ LS+Q L+DC+ N GC GG AYE+ I G+ TEE+Y Y+GQ
Sbjct: 162 INQIIT-GDLISLSEQELVDCNRTPINEGCKGGFMDDAYEFIINNGGINTEENY-PYIGQ 219
Query: 539 DGYC 550
D C
Sbjct: 220 DDQC 223
>UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba
culbertsoni|Rep: Cysteine proteinase - Acanthamoeba
culbertsoni
Length = 482
Score = 118 bits (285), Expect = 8e-26
Identities = 71/188 (37%), Positives = 100/188 (53%), Gaps = 7/188 (3%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
+H R Y++D E +R N +R+++ +I NR N FT+++N D T +E A L + S
Sbjct: 70 RHARSYSND-EFLERYNTWRENMDFIEEFNRGNHTFTVAMNEHGDLTPEEFARLYMGQVS 128
Query: 188 GPSPHGLPFPYS-KSRVEE----LSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGA 352
S L + +S +E+ +P DWR GAVTPVK+Q C SCW+F GA
Sbjct: 129 PASEQELQERIAAESAMEDEHHHTRASIPANWDWRTKGAVTPVKNQGSCASCWAFVATGA 188
Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHG--LPTEEDYGG 526
VEG + GG LV LS Q L+DC+ G GN GC GG Y W+ + L T+ Y
Sbjct: 189 VEGVRKI-AGGSLVSLSDQMLLDCAVGTGNQGCSGGNVEITYRWMISNNARLMTQASY-P 246
Query: 527 YLGQDGYC 550
Y+ + C
Sbjct: 247 YIARQSTC 254
>UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep:
Cathepsin L - Stylonychia lemnae
Length = 340
Score = 118 bits (284), Expect = 1e-25
Identities = 71/184 (38%), Positives = 103/184 (55%), Gaps = 3/184 (1%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG--FTMSVNHLADRTDDELAALRGRR 181
+ + Y S E E RL ++ ++ +I+++N N G FT+ NHLAD T DE + G
Sbjct: 48 RFSKAYKSKEEFEMRLQQYKSNIAFINNHNSQNDGTSFTLGPNHLADYTHDEYKKMLG-- 105
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
Y + G YS ++++ P DWR GAV VKDQ CGSCW+F T+ ++E
Sbjct: 106 YKPRNKTGKEV-YSTPNLKDI----PESIDWREKGAVNAVKDQGQCGSCWAFSTIASLES 160
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI-KRHGLPTEEDYGGYLGQ 538
F+ G L LS+Q L+DCS GN GC+GG+ A ++I G+ TE+DY Y+G+
Sbjct: 161 RYFIET-GKLQSLSEQQLVDCSKN-GNEGCNGGDMGLAMDYIASAGGVETEKDY-PYVGK 217
Query: 539 DGYC 550
D C
Sbjct: 218 DQTC 221
>UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15;
Magnoliophyta|Rep: Cysteine proteinase RD19a precursor -
Arabidopsis thaliana (Mouse-ear cress)
Length = 368
Score = 118 bits (283), Expect = 1e-25
Identities = 78/203 (38%), Positives = 107/203 (52%), Gaps = 9/203 (4%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K K + YAS+ EH+ R ++F+ +LR + + + T V +D T E R +
Sbjct: 55 KRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEF---RKKH 111
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
S LP +K+ + LP + DWR GAVTPVK+Q CGSCWSF GA+EG
Sbjct: 112 LGVRSGFKLPKDANKAPILPTE-NLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEG 170
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFG-------NNGCDGGEDFRAYEW-IKRHGLPTEED 517
A FL G LV LS+Q L+DC ++GC+GG A+E+ +K GL EED
Sbjct: 171 ANFLAT-GKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEED 229
Query: 518 YGGYLGQDG-YCHVDNVTAVTSI 583
Y Y G+DG C +D V S+
Sbjct: 230 Y-PYTGKDGKTCKLDKSKIVASV 251
>UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16)
[Contains: Cathepsin H mini chain; Cathepsin H heavy
chain; Cathepsin H light chain]; n=37; Eukaryota|Rep:
Cathepsin H precursor (EC 3.4.22.16) [Contains:
Cathepsin H mini chain; Cathepsin H heavy chain;
Cathepsin H light chain] - Homo sapiens (Human)
Length = 335
Score = 118 bits (283), Expect = 1e-25
Identities = 66/183 (36%), Positives = 103/183 (56%), Gaps = 2/183 (1%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
KH++ Y+++ E+ RL F + R I+++N N F M++N +D + E+ +Y
Sbjct: 41 KHRKTYSTE-EYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIK----HKYL 95
Query: 188 GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGA-VTPVKDQSVCGSCWSFGTVGAVEGA 364
P +KS + PP DWR G V+PVK+Q CGSCW+F T GA+E A
Sbjct: 96 WSEPQNCSA--TKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESA 153
Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI-KRHGLPTEEDYGGYLGQD 541
+ + G ++ L++Q L+DC+ F N+GC GG +A+E+I G+ E+ Y Y G+D
Sbjct: 154 IAIATG-KMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTY-PYQGKD 211
Query: 542 GYC 550
GYC
Sbjct: 212 GYC 214
>UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor;
n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase
precursor - Diabrotica virgifera virgifera (western corn
rootworm)
Length = 326
Score = 117 bits (282), Expect = 2e-25
Identities = 79/199 (39%), Positives = 108/199 (54%), Gaps = 5/199 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELAAL 169
KV++ + Y + +E +KR IF+ SLR I ++N + + G F + V AD T+ E + +
Sbjct: 27 KVRNNKSYRNYIEEQKRFTIFQGSLRKIENHNDKYDHGLSTFKLGVTKFADLTEKEFSDM 86
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
G S S +S + V++L P + DWR GAVT VKDQ CGSCWSF T G
Sbjct: 87 LGISRSTKSSRPRVI-HSLTPVKDL----PSKFDWREKGAVTEVKDQGSCGSCWSFSTTG 141
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYGG 526
VEGA FL G LV LS+Q L+DC+ GC GG +A E+I+ G+ +E DY
Sbjct: 142 TVEGAYFLKT-GKLVSLSEQNLVDCA-KEDCYGCSGGYMDKALEYIETAGGIMSENDY-P 198
Query: 527 YLGQDGYCHVDNVTAVTSI 583
Y G D C D+ I
Sbjct: 199 YEGIDDKCRFDSSKVAAKI 217
>UniRef50_Q239L8 Cluster: Papain family cysteine protease containing
protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 323
Score = 117 bits (281), Expect = 2e-25
Identities = 71/183 (38%), Positives = 99/183 (54%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K K+ ++YA R+ IF ++L+ + SN + N G T ++ + L+ +
Sbjct: 52 KTKYNKKYADPDFERYRIEIFTENLKVVESNTK-NYGITQFMDITREEFKQTYLTLKMKN 110
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
SP ++ + V++ DW GAVTPVKDQ CGSCWSF T GAVEG
Sbjct: 111 GLKASPF--------AKFNDAGVEI----DWTTKGAVTPVKDQGQCGSCWSFSTTGAVEG 158
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQD 541
ALFL + L LS+Q L+DCS GN GC+GG A+++I +HG+PTE Y Y D
Sbjct: 159 ALFL-STKKLTSLSEQYLVDCSKD-GNEGCNGGLMDTAFDFISQHGIPTEAAY-PYKAVD 215
Query: 542 GYC 550
G C
Sbjct: 216 GTC 218
>UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar to
human SRY (sex determining region Y)-box 30
(SOX30),transcript variant 1,; n=3; Mammalia|Rep: Testis
cDNA clone: QtsA-12228, similar to human SRY (sex
determining region Y)-box 30 (SOX30),transcript variant
1, - Macaca fascicularis (Crab eating macaque)
(Cynomolgus monkey)
Length = 433
Score = 116 bits (280), Expect = 3e-25
Identities = 70/188 (37%), Positives = 101/188 (53%), Gaps = 5/188 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELAAL 169
K H+R Y + E +R ++ ++++ I +N + GF M++N D T++E +
Sbjct: 33 KATHRRLYGASEEGWRRA-VWEKNMKMIELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQV 91
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
G + G F E L + LP DWR G VTPVK+Q CGSCW+F G
Sbjct: 92 MGCFRNQKLRKGKLFR------EPLFLDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATG 145
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGG 526
A+EG +F G LV LS+Q L+DCS GN GC+GG A+ ++K + GL +EE Y
Sbjct: 146 ALEGQMF-RKTGKLVSLSEQNLVDCSHPQGNQGCNGGFMNSAFRYVKENGGLDSEESY-P 203
Query: 527 YLGQDGYC 550
Y+ DG C
Sbjct: 204 YVAMDGIC 211
>UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;
Brugia malayi|Rep: Cahepsin L-like cysteine protease -
Brugia malayi (Filarial nematode worm)
Length = 371
Score = 116 bits (280), Expect = 3e-25
Identities = 72/196 (36%), Positives = 104/196 (53%), Gaps = 6/196 (3%)
Frame = +2
Query: 11 HQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELAALRGR 178
++R +LEH +R + ++++ I +N R + +++NHLAD +E L G
Sbjct: 62 NKRDEEINLEH-RRFMTYLKNVKEIEKHNERYERNEETYELAINHLADMLPEEFRKLHGF 120
Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
+ + + + +++ LP DWR GAVT VKDQ CGSCW+F VGA+E
Sbjct: 121 QSRKITSKN---NFKNTIRMKINGPLPKSIDWRTSGAVTKVKDQGYCGSCWTFSAVGALE 177
Query: 359 GALFLHNGGHLVRLSQQALIDCSWG-FGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYL 532
G FL G LV LS Q L+DCS +GN GCDGG A+E+ +K G+ TE+ Y Y
Sbjct: 178 GQHFLQT-GKLVELSMQNLLDCSDDTYGNYGCDGGLMMEAFEYVVKNDGIDTEKSY-PYQ 235
Query: 533 GQDGYCHVDNVTAVTS 580
G C N T T+
Sbjct: 236 GYQNTCRYSNSTRGTT 251
>UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor;
n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase
precursor - Phaedon cochleariae (Mustard beetle)
Length = 324
Score = 116 bits (280), Expect = 3e-25
Identities = 69/193 (35%), Positives = 102/193 (52%), Gaps = 6/193 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELA-A 166
K H R Y S E + R NIF+ +LR I +N + +++N +D TD+E
Sbjct: 27 KKTHARTYKSLREEKLRFNIFQDTLRQIAEHNVKYENGESTYYLAINKFSDITDEEFRDM 86
Query: 167 LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGT 343
L S P+ GL V +L+V PE DWR G V PV++Q CGSCW+ T
Sbjct: 87 LMKNEASRPNLEGL-------EVADLTVGAAPESIDWRSKGVVLPVRNQGECGSCWALST 139
Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYG 523
A+E + +G V LS Q L+DCS +GN+GC+GG +E++K +GL ++ DY
Sbjct: 140 AAAIESQSAIKSGSK-VPLSPQQLVDCSTSYGNHGCNGGFAVNGFEYVKDNGLESDADY- 197
Query: 524 GYLGQDGYCHVDN 562
Y G++ C ++
Sbjct: 198 PYSGKEDKCKAND 210
>UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep:
Cysteine protease - Solanum lycopersicum (Tomato)
(Lycopersicon esculentum)
Length = 345
Score = 116 bits (279), Expect = 4e-25
Identities = 73/199 (36%), Positives = 109/199 (54%), Gaps = 7/199 (3%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRA-NRGFTMSVNHLADRTDDE-LAALRGRR 181
+H R Y ++E +R IF++++++I S N+A N + + +N AD T E LA G
Sbjct: 45 RHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLN 104
Query: 182 YSGPSPHGLPFPYSKS---RVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVG 349
P+ + P P S + ++ +LS P + DWR GAVT VK Q CG CW+F VG
Sbjct: 105 I--PNSYLSPSPMSSTEFKKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 162
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGG 526
++EGA + G+L+ S+Q L+DC+ N GC+GG A+++ I+ G+ E DY
Sbjct: 163 SLEGAYKIAT-GNLMEFSEQELLDCT--TNNYGCNGGFMTNAFDFIIENGGISRESDY-E 218
Query: 527 YLGQDGYCHVDNVTAVTSI 583
YLGQ C TA I
Sbjct: 219 YLGQQYTCRSQEKTAAVQI 237
>UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6;
Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia
deliciosa (Kiwi)
Length = 509
Score = 116 bits (279), Expect = 4e-25
Identities = 68/191 (35%), Positives = 100/191 (52%), Gaps = 9/191 (4%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNR---ANRGFTMSVNHLADRTDDELAALRGR 178
KH + Y E EK+ FR +LRY+ N A+ G + +N AD +++E +
Sbjct: 57 KHGKVYKHGQEVEKKFQNFRDNLRYVMEKNGERGASGGHLVGLNKFADMSNEEFREVYVS 116
Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKL-----PPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
+ P+ + + + + P DWR +G VT VKDQ CGSCW+F +
Sbjct: 117 KVKKPTSKRMAIERRRQGKAAAAKAVAACDGPTSLDWRKYGIVTGVKDQGDCGSCWAFSS 176
Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDY 520
GA+EG L N G L+ LS+Q L+DC N+GC+GG A+EW+ + G+ TE DY
Sbjct: 177 TGAIEGINALAN-GDLISLSEQELVDCD--STNDGCEGGYMDYAFEWVMSNGGIDTETDY 233
Query: 521 GGYLGQDGYCH 553
Y G+DG C+
Sbjct: 234 -PYTGEDGTCN 243
>UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15)
[Contains: Cathepsin L heavy chain; Cathepsin L light
chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC
3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin
L light chain] - Sarcophaga peregrina (Flesh fly)
(Boettcherisca peregrina)
Length = 339
Score = 116 bits (279), Expect = 4e-25
Identities = 69/196 (35%), Positives = 103/196 (52%), Gaps = 7/196 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNR----ANRGFTMSVNHLADRTDDELA-A 166
K++H++ YA+++E R+ IF ++ I +N+ + + +N AD E
Sbjct: 32 KLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADMLHHEFKET 91
Query: 167 LRGRRYS-GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
+ G ++ + + V +P DWR GAVT VKDQ CGSCW+F +
Sbjct: 92 MNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHCGSCWAFSS 151
Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDY 520
GA+EG F G LV LS+Q L+DCS +GNNGC+GG A+ +IK + G+ TE+ Y
Sbjct: 152 TGALEGQHF-RKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSY 210
Query: 521 GGYLGQDGYCHVDNVT 568
Y G D CH + T
Sbjct: 211 -PYEGIDDSCHFNKAT 225
>UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;
n=17; Magnoliophyta|Rep: Thiol protease aleurain-like
precursor - Arabidopsis thaliana (Mouse-ear cress)
Length = 358
Score = 116 bits (278), Expect = 5e-25
Identities = 69/183 (37%), Positives = 103/183 (56%), Gaps = 2/183 (1%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
++ ++Y S E + R ++F+++L I S N+ + +S+N AD T E +RY
Sbjct: 65 RYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEF-----QRYK 119
Query: 188 -GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
G + + ++ E +V P DWR G V+PVK+Q CGSCW+F T GA+E A
Sbjct: 120 LGAAQNCSATLKGSHKITEATV--PDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAA 177
Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQD 541
+ G + LS+Q L+DC+ F N GC GG +A+E+IK + GL TEE Y Y G+D
Sbjct: 178 -YHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAY-PYTGKD 235
Query: 542 GYC 550
G C
Sbjct: 236 GGC 238
>UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep:
Cysteine protease - Saprolegnia parasitica
Length = 523
Score = 115 bits (277), Expect = 7e-25
Identities = 70/185 (37%), Positives = 98/185 (52%), Gaps = 2/185 (1%)
Frame = +2
Query: 35 LEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLP 211
LE R +F + + I ++N+ A+ FTM N + T DE LR PS
Sbjct: 42 LEWVHRFEVFILNDQRIEAHNKDASSSFTMGHNEYSHLTFDEFKKLRTGLRVSPSYIQSR 101
Query: 212 FPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHL 391
Y+ +P E DW G VTPVK+Q +CGSCW+F T GA+EGA F+ + L
Sbjct: 102 AKYALMAPAVNMTDVPNEMDWVEQGGVTPVKNQGMCGSCWAFSTTGAIEGAAFV-SSKQL 160
Query: 392 VRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQDGYCHVDNVT 568
V +S+Q L+DC G+ GC+GG A++W+K H GL EEDY Y ++G C +
Sbjct: 161 VSVSEQELVDCDHN-GDMGCNGGLMDNAFKWVKTHKGLCKEEDY-PYHAKEGTCALKKCK 218
Query: 569 AVTSI 583
VT +
Sbjct: 219 PVTKV 223
>UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep:
Cathepsin - Geodia cydonium (Sponge)
Length = 322
Score = 115 bits (276), Expect = 9e-25
Identities = 68/186 (36%), Positives = 101/186 (54%), Gaps = 2/186 (1%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAA-LRGR 178
K+K+ +QY+S E R ++ +L+++ + G+T+++N AD E + G
Sbjct: 23 KLKYNKQYSSQEEDYLRQRVWLSNLKFVEEFDSEREGYTVAMNEFADLDPREFVSHYNGL 82
Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
R + G P E++S LP DWR G VT VK+Q CGSCW+F G++E
Sbjct: 83 RRRPHTSSGEPCTLG----EDVSA-LPTTVDWRTKGYVTGVKNQGQCGSCWAFSATGSLE 137
Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYLG 535
G F + G LV LS+Q L+DCS GN GC+GG A+++ IK G+ TE Y Y+
Sbjct: 138 GQHF-NATGKLVSLSEQNLVDCSSAEGNEGCNGGLPDDAFKYVIKNGGIDTEASY-PYVA 195
Query: 536 QDGYCH 553
+D CH
Sbjct: 196 RDEKCH 201
>UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35;
Viridiplantae|Rep: Cysteine proteinase 15A precursor -
Pisum sativum (Garden pea)
Length = 363
Score = 114 bits (275), Expect = 1e-24
Identities = 76/203 (37%), Positives = 106/203 (52%), Gaps = 9/203 (4%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K K + YA+ EH+ R +F+ +L I + NR T H + D A+ R+
Sbjct: 52 KSKFSKSYATKEEHDYRFGVFKSNL--IKAKLHQNRDPT--AEHGITKFSDLTASEFRRQ 107
Query: 182 YSGPSPHGLPFPYSKSRVEEL-SVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
+ G L P + L + LP + DWR GAVTPVKDQ CGSCW+F T GA+E
Sbjct: 108 FLGLKKR-LRLPAHAQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALE 166
Query: 359 GALFLHNGGHLVRLSQQALIDCSW-------GFGNNGCDGGEDFRAYEW-IKRHGLPTEE 514
GA +L G LV LS+Q L+DC G ++GC+GG A+E+ ++ G+ E+
Sbjct: 167 GAHYLAT-GKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLESGGVVQEK 225
Query: 515 DYGGYLGQDGYCHVDNVTAVTSI 583
DY Y G+DG C D V S+
Sbjct: 226 DY-AYTGRDGSCKFDKSKVVASV 247
>UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza
sativa|Rep: Cysteine protease 1 precursor - Oryza sativa
subsp. japonica (Rice)
Length = 490
Score = 114 bits (275), Expect = 1e-24
Identities = 71/178 (39%), Positives = 101/178 (56%), Gaps = 5/178 (2%)
Frame = +2
Query: 38 EHEKRLNIFRQSLRYIHSNN-RANR--GFTMSVNHLADRTDDELAALRGRRYSGPSPHGL 208
EHE+R +F +L+++ ++N RA+ GF + +N AD T+ E A Y G +P G
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRAT----YLGTTPAGR 139
Query: 209 PFPYSKSRVEELSVKLPPEHDWRLFGAVT-PVKDQSVCGSCWSFGTVGAVEGALFLHNGG 385
++ + LP DWR GAV PVK+Q CGSCW+F V AVEG + G
Sbjct: 140 GRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTG- 198
Query: 386 HLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQDGYCHV 556
LV LS+Q L++C+ N+GC+GG A+ +I R+ GL TEEDY Y DG C++
Sbjct: 199 ELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDY-PYTAMDGKCNL 255
>UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2;
Platyhelminthes|Rep: Cathepsin L-like proteinase -
Echinococcus multilocularis
Length = 338
Score = 114 bits (274), Expect = 2e-24
Identities = 72/199 (36%), Positives = 102/199 (51%), Gaps = 5/199 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIH-SNNRANRG---FTMSVNHLADRTDDELAAL 169
KV + + YA+ E R+ IF + ++ N R G ++ ++N AD T +E A
Sbjct: 34 KVANNKTYATLREEHLRMRIFINNYLFVRWHNERYYLGLETYSTALNAFADLTLEEFAEK 93
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTV 346
P G+ S VE + L P+ DWR G VTP+KDQ CGSCW+F
Sbjct: 94 YLTLKQTPM-EGIWQDMSTQYVERPTRMLVPDSIDWRKKGLVTPIKDQGDCGSCWAFSAT 152
Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGG 526
GA+EG L G L+ LS+Q L+DCS GN GC+GG+ A+ + R+G +E DY
Sbjct: 153 GALEGQL-KRKTGKLISLSEQQLVDCSTYTGNEGCNGGDMNDAFRYWMRNGAESESDY-P 210
Query: 527 YLGQDGYCHVDNVTAVTSI 583
Y DG C ++ VT +
Sbjct: 211 YTAMDGKCKFNSSKVVTKV 229
>UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes
abbreviatus|Rep: Cathepsin L 2 - Diaprepes abbreviatus
(Sugarcane rootstalk borer weevil)
Length = 348
Score = 114 bits (274), Expect = 2e-24
Identities = 74/203 (36%), Positives = 106/203 (52%), Gaps = 20/203 (9%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANR----GFTMSVNHLADRTDDELAAL 169
K++H + Y S+ E+E R ++F ++L I+ +N+ + M++NHL D T DE +
Sbjct: 32 KLEHGKVYESESENEYRQSVFMENLFQINEHNKLYEMGLSSYQMAMNHLGDLTKDEFMRI 91
Query: 170 ---------RGRRYSGPSPH-GLPFPYSKSRVEEL-----SVKLPPEHDWRLFGAVTPVK 304
+ S P LP L V LP + DWR GAVTPVK
Sbjct: 92 YTVNMPQLPQSENLSDSEPWLDLPQDLQGFVTYALPTNLDEVDLPTDIDWRQKGAVTPVK 151
Query: 305 DQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW 484
+Q CGSCWSF GA+E A + L+ LS+Q L+DCS +GN+GC GG A+ +
Sbjct: 152 NQRNCGSCWSFSATGALE-AQWFKKTNKLISLSEQQLVDCSGRYGNHGCHGGWMHWAFGY 210
Query: 485 IKRH-GLPTEEDYGGYLGQDGYC 550
IK + G+ TE+ Y Y +DG C
Sbjct: 211 IKENGGIDTEQSY-PYTAKDGRC 232
>UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase
precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine
proteinase precursor - Heterodera glycines (Soybean cyst
nematode worm)
Length = 353
Score = 114 bits (274), Expect = 2e-24
Identities = 68/182 (37%), Positives = 99/182 (54%), Gaps = 5/182 (2%)
Frame = +2
Query: 38 EHEKRLNIFRQSLRYIHSNNRA-NRG---FTMSVNHLADRTDDELAALRGRRYSGPSPHG 205
E +R+N F ++ ++I ++N A +G F ++ NHL T + +RG +
Sbjct: 64 EKMERMNEFIKAKKFIDAHNLAFEKGEVSFKVAPNHLMHFTPAQYNRIRGLQMRSNRQR- 122
Query: 206 LPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGG 385
++ + + S LP + DWR GAVT VKDQ CGSCW+F GA+EGAL
Sbjct: 123 ----HNMATLAGNSSTLPEKLDWREKGAVTEVKDQGDCGSCWAFSATGAIEGALAQKKAS 178
Query: 386 HLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYGGYLGQDGYCHVDN 562
++ LS+Q L+DCS +GN GCDGG A+E+++ +GL TEE Y Y G C N
Sbjct: 179 KIISLSEQNLVDCSSKYGNEGCDGGLMDSAFEYVRDNNGLDTEESY-PYEAVTGKCQFKN 237
Query: 563 VT 568
T
Sbjct: 238 ET 239
>UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor;
n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1
precursor - Arabidopsis thaliana (Mouse-ear cress)
Length = 355
Score = 114 bits (274), Expect = 2e-24
Identities = 70/182 (38%), Positives = 97/182 (53%), Gaps = 1/182 (0%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
+H + Y S E R +FR++L +I N + + +N AD T +E R +
Sbjct: 57 EHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKG-RYLGLA 115
Query: 188 GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGAL 367
P P + R +++ LP DWR GAV PVKDQ CGSCW+F TV AVEG
Sbjct: 116 KPQFSRKRQPSANFRYRDIT-DLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGIN 174
Query: 368 FLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYLGQDG 544
+ G+L LS+Q LIDC F N+GC+GG A+++ I GL E+DY YL ++G
Sbjct: 175 QI-TTGNLSSLSEQELIDCDTTF-NSGCNGGLMDYAFQYIISTGGLHKEDDY-PYLMEEG 231
Query: 545 YC 550
C
Sbjct: 232 IC 233
>UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18;
Magnoliophyta|Rep: Thiol protease aleurain precursor -
Arabidopsis thaliana (Mouse-ear cress)
Length = 358
Score = 114 bits (274), Expect = 2e-24
Identities = 68/182 (37%), Positives = 100/182 (54%), Gaps = 1/182 (0%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
++ ++Y + E + R +IF+++L I S N+ + + VN AD T E R
Sbjct: 65 RYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQ----RTKL 120
Query: 188 GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGAL 367
G + + +V E + LP DWR G V+PVKDQ CGSCW+F T GA+E A
Sbjct: 121 GAAQNCSATLKGSHKVTEAA--LPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAA- 177
Query: 368 FLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQDG 544
+ G + LS+Q L+DC+ F N GC+GG +A+E+IK + GL TE+ Y Y G+D
Sbjct: 178 YHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAY-PYTGKDE 236
Query: 545 YC 550
C
Sbjct: 237 TC 238
>UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor;
n=35; Fasciola|Rep: Cathepsin L-like proteinase
precursor - Fasciola hepatica (Liver fluke)
Length = 326
Score = 113 bits (273), Expect = 2e-24
Identities = 68/201 (33%), Positives = 105/201 (52%), Gaps = 7/201 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELAA- 166
K + ++Y + + + R NI+ +++++I +N R + G +T+ +N D T +E A
Sbjct: 25 KRMYNKEY-NGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 83
Query: 167 --LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFG 340
R S HG+P+ + V P + DWR G VT VKDQ CGSCW+F
Sbjct: 84 YLTEMSRASDILSHGVPYEANNRAV-------PDKIDWRESGYVTEVKDQGNCGSCWAFS 136
Query: 341 TVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
T G +EG ++ N + S+Q L+DCS +GNNGC GG AY+++K+ GL TE Y
Sbjct: 137 TTGTMEGQ-YMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQFGLETESSY 195
Query: 521 GGYLGQDGYCHVDNVTAVTSI 583
Y +G C + V +
Sbjct: 196 -PYTAVEGQCRYNKQLGVAKV 215
>UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3;
Curculionidae|Rep: Cysteine proteinase - Hypera postica
(alfalfa weevil)
Length = 324
Score = 113 bits (272), Expect = 3e-24
Identities = 69/198 (34%), Positives = 100/198 (50%), Gaps = 4/198 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELAAL 169
K++H + Y + E KR NIF ++R I ++N + + +N D + +E +
Sbjct: 30 KLEHGKTYLNQAEESKRFNIFTDNVRAIEAHNALYEQGKVSYKKGINKFTDMSQEEFKTM 89
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
S P Y K+ VE +P DWR G VT VKDQ CGSCW+F G
Sbjct: 90 LTLSASR-KPTLETTSYVKTGVE-----IPSSVDWRKEGRVTGVKDQGDCGSCWAFSITG 143
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGY 529
+ EGA + G LV LS+Q LIDC + GCDGG ++++ + GL +EE Y Y
Sbjct: 144 STEGA-YARKSGKLVSLSEQQLIDCCTD-TSAGCDGGSLDDNFKYVMKDGLQSEESY-TY 200
Query: 530 LGQDGYCHVDNVTAVTSI 583
G+DG C + + VT +
Sbjct: 201 KGEDGACKYNVASVVTKV 218
>UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2;
Acanthamoeba|Rep: Cysteine proteinase - Acanthamoeba
healyi
Length = 330
Score = 113 bits (272), Expect = 3e-24
Identities = 68/161 (42%), Positives = 89/161 (55%), Gaps = 7/161 (4%)
Frame = +2
Query: 59 IFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSK---- 226
I+R ++ +NR N+ + +++N D T+ E L GL F YSK
Sbjct: 52 IYRWNVWRDEEHNRQNKSYFLAMNQFGDLTNAEFNRLF---------KGLAFDYSKHAKI 102
Query: 227 --SRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRL 400
+ E + +P E DWR GAVT VK+Q CGSCWSF T G+ EGA FL G LV L
Sbjct: 103 HTAAPEAPATGIPSEFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKT-GRLVSL 161
Query: 401 SQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDY 520
S+Q LIDCS +GNNGC+GG A+E+ I G+ TE Y
Sbjct: 162 SEQNLIDCSVSYGNNGCNGGLMDYAFEYIINNRGIDTEASY 202
>UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryza
sativa|Rep: Putative cysteine proteinase - Oryza sativa
subsp. japonica (Rice)
Length = 352
Score = 112 bits (270), Expect = 5e-24
Identities = 67/195 (34%), Positives = 100/195 (51%), Gaps = 3/195 (1%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRA-NRGFTMSVNHLADRTDDELAALRGRRY 184
+H R Y E +R +F+ ++ I +N A N+ + ++ N D TD E AA+ Y
Sbjct: 48 EHGRTYKDAAEKARRFRVFKANVDLIDRSNAAGNKRYRLATNRFTDLTDAEFAAM----Y 103
Query: 185 SGPSPHGLPFPYSKS--RVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
+G +P + + + R+ + P E DWR GAVT VK+Q CG CW+F TV AVE
Sbjct: 104 TGYNPANTMYAAANATTRLSSEDDQQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVE 163
Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQ 538
G + G LV LS+Q L+DC+ N GC GG A++++ G T E Y G
Sbjct: 164 G-IHQITTGELVSLSEQQLLDCA---DNGGCTGGSLDNAFQYMANSGGVTTEAAYAYQGA 219
Query: 539 DGYCHVDNVTAVTSI 583
G C D ++ + +
Sbjct: 220 QGACQFDASSSASGV 234
>UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin
L - Misgurnus mizolepis (Mud loach)
Length = 337
Score = 112 bits (269), Expect = 7e-24
Identities = 73/193 (37%), Positives = 104/193 (53%), Gaps = 7/193 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRAN----RGFTMSVNHLADRTDDELAAL 169
K H + Y E +R+ I+ ++LR I +N + + + +NH D +E
Sbjct: 33 KTWHGKNYHEKEEGWRRM-IWEKNLRKIQFHNLEHSMGIHTYRLGMNHFGDMNHEEF--- 88
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELS-VKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
R+ H + S E + +++P + DWR G VTPVKDQ CGSCW+F T
Sbjct: 89 --RQVMNGYKHKTERKFKGSLFMEPNFLEVPSKLDWREKGYVTPVKDQGECGSCWAFSTT 146
Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYG 523
GA+EG +F G LV LS+Q L+DCS GN GC+GG +A+++IK +GL +EE Y
Sbjct: 147 GAMEGQMF-RKQGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNNGLDSEEAY- 204
Query: 524 GYLGQDGY-CHVD 559
YLG D CH D
Sbjct: 205 PYLGTDDQPCHYD 217
>UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza
sativa|Rep: Cysteine proteinase-like - Oryza sativa
subsp. japonica (Rice)
Length = 360
Score = 112 bits (269), Expect = 7e-24
Identities = 72/195 (36%), Positives = 97/195 (49%), Gaps = 7/195 (3%)
Frame = +2
Query: 17 RQYASDLEHEKRLNIFRQSLRYIHSNNRA--NRGFTMSVNHLADRTDDELAALR-GRRYS 187
R YA E +R+ +F + + + NRA +R +T+ +N +D TDDE A G ++
Sbjct: 52 RAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDLTDDEFAQTHLGYSWA 111
Query: 188 GPSP---HGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
P P HG + +P DWR GAVT VK+Q CGSCW+F V A E
Sbjct: 112 PPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQRSCGSCWAFAAVAATE 171
Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLG 535
G + L G+LV LS+Q ++DC+ G N C GG+ A +I GL TE Y Y G
Sbjct: 172 GLVQLAT-GNLVSLSEQQVLDCTG--GANTCSGGDVSAALRYIAASGGLQTEAAY-AYGG 227
Query: 536 QDGYCHVDNVTAVTS 580
Q G C A S
Sbjct: 228 QQGACRAGGFAAPNS 242
>UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryza
sativa (japonica cultivar-group)|Rep: Putative cysteine
proteinase - Oryza sativa subsp. japonica (Rice)
Length = 357
Score = 112 bits (269), Expect = 7e-24
Identities = 66/173 (38%), Positives = 91/173 (52%), Gaps = 3/173 (1%)
Frame = +2
Query: 11 HQRQYASDLEHEKRLNIFRQSLRYIHSNNRAN--RGFTMSVNHLADRTDDELAALRGRRY 184
H R Y LE +R +FR + +I S N A + ++ N AD T++E A GR +
Sbjct: 56 HGRTYKDSLEKARRFEVFRTNALFIDSFNAAGGKKSPRLTTNKFADLTNEEFAEYYGRPF 115
Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
S P G F Y R ++ P +WR GAVT VK+Q C SCW+F V AVEG
Sbjct: 116 STPVIGGSGFMYGNVRTSDV----PANINWRDRGAVTQVKNQKDCASCWAFSAVAAVEG- 170
Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDY 520
+ +LV LS Q L+DCS G N+GC+ G+ A+ +I + G+ E DY
Sbjct: 171 IHQIRSHNLVALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGGIAAESDY 223
>UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2;
Dictyostelium discoideum|Rep: Cysteine proteinase 2
precursor - Dictyostelium discoideum (Slime mold)
Length = 376
Score = 112 bits (269), Expect = 7e-24
Identities = 69/175 (39%), Positives = 96/175 (54%), Gaps = 3/175 (1%)
Frame = +2
Query: 5 VKHQRQYASDLEHEKRLNIFRQSLRYIHS-NNRANRGFTMSVNHLADRTDDELA-ALRGR 178
+K RQY+S E R +IF+ ++ Y+ + N++ + + +N+ AD T++E G
Sbjct: 41 LKFNRQYSSS-EFSNRYSIFKSNMDYVDNWNSKGDSQTVLGLNNFADITNEEYRKTYLGT 99
Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
R + S +G VE+L P DWR AVTP+KDQ CGSCWSF T G+ E
Sbjct: 100 RVNAHSYNGYD-GREVLNVEDLQTN-PKSIDWRTKNAVTPIKDQGQCGSCWSFSTTGSTE 157
Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDY 520
GA L LV LS+Q L+DCS N GCDGG A+++ IK G+ TE Y
Sbjct: 158 GAHALKT-KKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYIIKNKGIDTESSY 211
>UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16;
Bromeliaceae|Rep: Fruit bromelain precursor - Ananas
comosus (Pineapple)
Length = 351
Score = 112 bits (269), Expect = 7e-24
Identities = 66/191 (34%), Positives = 109/191 (57%), Gaps = 6/191 (3%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHS-NNRANRGFTMSVNHLADRTDDELAALRGRRY 184
++ R Y D E +R IF+ ++++I + N+R +T+ +N D T E A +Y
Sbjct: 43 EYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNENSYTLGINQFTDMTKSEFVA----QY 98
Query: 185 SGPSPHGLPFPYSKSRV---EELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGA 352
+G S LP + V +++++ P+ DWR +GAV VK+Q+ CGSCWSF +
Sbjct: 99 TGVS---LPLNIEREPVVSFDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIAT 155
Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGY 529
VEG ++ G+LV LS+Q ++DC+ + GC GG +AY++ I +G+ TEE+Y Y
Sbjct: 156 VEG-IYKIKTGYLVSLSEQEVLDCAVSY---GCKGGWVNKAYDFIISNNGVTTEENY-PY 210
Query: 530 LGQDGYCHVDN 562
L G C+ ++
Sbjct: 211 LAYQGTCNANS 221
>UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3;
Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber
officinale (Ginger)
Length = 475
Score = 111 bits (267), Expect = 1e-23
Identities = 74/201 (36%), Positives = 110/201 (54%), Gaps = 7/201 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRA-NRG---FTMSVNHLADRTDDELAA- 166
+VKH+ + RL +F+++LR++ +N A +RG + + +N AD T++E A
Sbjct: 56 RVKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEEYRAR 115
Query: 167 -LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
LR G S G ++ R+ E V LP DWR GAV VK+Q CGSCW+F
Sbjct: 116 FLRDLSRLGRSTSGEIS--NQYRLREGDV-LPDSIDWREKGAVVAVKNQGRCGSCWAFAA 172
Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYG 523
+ AVEG + G L+ LS+Q L+DCS N GC+GG +RA+++I +G E++
Sbjct: 173 IAAVEGINQIVT-GDLISLSEQQLVDCS--TRNYGCEGGWPYRAFQYIINNGGVNSEEHY 229
Query: 524 GYLGQDGYCHVDNVTA-VTSI 583
Y G +G C+ A V SI
Sbjct: 230 PYTGTNGTCNTTKENAHVVSI 250
>UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza
sativa|Rep: Os01g0347600 protein - Oryza sativa subsp.
japonica (Rice)
Length = 343
Score = 110 bits (264), Expect = 3e-23
Identities = 70/188 (37%), Positives = 93/188 (49%), Gaps = 2/188 (1%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDELAALRGRRY 184
K + Y E E R IFR ++ +I + + +N AD T+DE A Y
Sbjct: 50 KFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVAT----Y 105
Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
+G P P P R + + P DWR GAVT VKDQ CGSCW+F V A+EG
Sbjct: 106 TGAKP---PHPKEAPRPVD-PIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGL 161
Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI-KRHGLPTEEDYGGYLGQD 541
+ G L LS+Q L+DC +NGC GG RA+E + + G+ E DY Y G
Sbjct: 162 TKIRT-GQLTPLSEQELVDCD--TNSNGCGGGHTDRAFELVASKGGITAESDY-RYEGFQ 217
Query: 542 GYCHVDNV 565
G C VD++
Sbjct: 218 GKCRVDDM 225
>UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L
- Suberites domuncula (Sponge)
Length = 324
Score = 110 bits (264), Expect = 3e-23
Identities = 67/188 (35%), Positives = 103/188 (54%), Gaps = 5/188 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANR--GFTMSVNHLADRTDDELAALRG 175
K +H ++Y +LE +R I++ + ++I S+N + G+T+ +N D + E +
Sbjct: 27 KQEHSKEYTEELEELRRHTIWQSNKKFIDSHNSVSDKFGYTLEMNEFGDLSGVEFKQI-- 84
Query: 176 RRYSGPSPHGLPFPYSKSRVEELSVKLPPEH--DWRLFGAVTPVKDQSVCGSCWSFGTVG 349
Y+G + + + +++ S + P DWR G V+ VK+Q CGSCWSF G
Sbjct: 85 --YNG---YIMQERANDTKLFTASPYMEPAASVDWRQKGVVSEVKNQGQCGSCWSFSATG 139
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGG 526
++EG L G LV LS+Q L+DCS FGN+GC GG A+ + I HG+ TE Y
Sbjct: 140 SLEGQHALKMG-RLVSLSEQNLMDCSSRFGNHGCKGGIMDDAFRYVISNHGVDTESSY-P 197
Query: 527 YLGQDGYC 550
Y +DGYC
Sbjct: 198 YTAKDGYC 205
>UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra
erinaceieuropaei|Rep: Cysteine proteinase - Spirometra
erinaceieuropaei (Tapeworm)
Length = 336
Score = 110 bits (264), Expect = 3e-23
Identities = 70/199 (35%), Positives = 101/199 (50%), Gaps = 5/199 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRAN----RGFTMSVNHLADRTDDELAAL 169
K+ +++Y S E R F +L +I +N+ + + +N +D T E A
Sbjct: 36 KLAFKKEYFSSEEELHRKRAFFNNLDFIIRHNQRYYQQLESYAVRLNDFSDLTPGEFA-- 93
Query: 170 RGRRYSGPSPHGLPFPYSKSRVE-ELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
RY L K V L LP +WR GAVT VK+Q CGSCWSF
Sbjct: 94 --ERYLCLRGIVLTKLRRKEAVSVPLKENLPDSVNWRERGAVTSVKNQGQCGSCWSFSAN 151
Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGG 526
GA+EGA+ + G L LS+Q L+DCSW +GN GC+GG +A+++ +R+G+ E DY
Sbjct: 152 GAIEGAIQIKTGA-LRSLSEQQLMDCSWDYGNQGCNGGLMPQAFQYAQRYGVEAEVDY-R 209
Query: 527 YLGQDGYCHVDNVTAVTSI 583
Y +DG C V ++
Sbjct: 210 YTERDGVCRYRQDLVVANV 228
>UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis
thaliana|Rep: F9P14.12 protein - Arabidopsis thaliana
(Mouse-ear cress)
Length = 343
Score = 109 bits (263), Expect = 4e-23
Identities = 66/181 (36%), Positives = 94/181 (51%), Gaps = 1/181 (0%)
Frame = +2
Query: 11 HQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSG 190
H + Y E R I++ +++ I N + F ++ N AD T+ E A + G
Sbjct: 50 HSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEFKA----HFLG 105
Query: 191 PSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALF 370
+ L + V + + +P DWR GAVTP+++Q CG CW+F V A+EG
Sbjct: 106 LNTSSLRLHKKQRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINK 165
Query: 371 LHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQDGY 547
+ G+LV LS+Q LIDC G N GC GG A+E+IK + GL TE DY Y G +G
Sbjct: 166 IKT-GNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGGLATETDY-PYTGIEGT 223
Query: 548 C 550
C
Sbjct: 224 C 224
>UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease;
n=23; Magnoliophyta|Rep: Senescence-specific cysteine
protease - Arabidopsis thaliana (Mouse-ear cress)
Length = 346
Score = 109 bits (263), Expect = 4e-23
Identities = 72/187 (38%), Positives = 99/187 (52%), Gaps = 5/187 (2%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYI-HSNN-RANRGFTMSVNHLADRTDDELAAL-RGR 178
KH R YA E R +F+ ++ I H N+ A R F ++VN AD T+DE ++ G
Sbjct: 44 KHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMYTGF 103
Query: 179 RYSGPSPHGLPFPYSKSRVEELSV-KLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
+ S R + +S LP DWR GAVTP+K+Q CG CW+F V A+
Sbjct: 104 KGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAI 163
Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYGGYL 532
EGA + G L+ LS+Q L+DC + GC+GG A+E IK GL TE +Y Y
Sbjct: 164 EGATQIKK-GKLISLSEQQLVDCD--TNDFGCEGGLMDTAFEHIKATGGLTTESNY-PYK 219
Query: 533 GQDGYCH 553
G+D C+
Sbjct: 220 GEDATCN 226
>UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960
precursor; n=2; Arabidopsis thaliana|Rep: Probable
cysteine proteinase At3g43960 precursor - Arabidopsis
thaliana (Mouse-ear cress)
Length = 376
Score = 109 bits (263), Expect = 4e-23
Identities = 72/182 (39%), Positives = 97/182 (53%), Gaps = 3/182 (1%)
Frame = +2
Query: 5 VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDEL-AALRGR 178
V++ + Y E E+R IF+ +L+ I +N NR + +N +D T DE A+ G
Sbjct: 46 VENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGG 105
Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTP-VKDQSVCGSCWSFGTVGAV 355
+ S + Y + +E V LP E DWR GAV P VK Q CGSCW+F GAV
Sbjct: 106 KMEKKSLSDVAERY---QYKEGDV-LPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAV 161
Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLG 535
EG + G LV LS+Q LIDC G N GC GG A+E+IK +G ++ GY G
Sbjct: 162 EGINQITTG-ELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYGYTG 220
Query: 536 QD 541
+D
Sbjct: 221 ED 222
>UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza
sativa|Rep: Putative cysteine protease - Oryza sativa
subsp. japonica (Rice)
Length = 357
Score = 108 bits (260), Expect = 8e-23
Identities = 68/189 (35%), Positives = 95/189 (50%), Gaps = 3/189 (1%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDE-LAALRGRR 181
K + Y E E R +FR ++R+I S A + +N AD T+ E +A G +
Sbjct: 50 KFGKTYKCHGEKEHRFAVFRDNVRFIRSYRPEATYDSAVRINQFADLTNGEFVATYTGVK 109
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
P+ H P P R + + +P DWR GAVT VKDQ CGS W+F V A+EG
Sbjct: 110 QPPPATHPHPHPEEAPRPVD-PIWMPCCIDWRFKGAVTGVKDQGACGSSWAFAAVAAMEG 168
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFG-NNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQ 538
+ + G L LS+Q L+DC G G ++GC GG A++ + G T E Y G
Sbjct: 169 LMKIRT-GQLTPLSEQELVDCVDGGGDSDGCGGGHTDAAFQLVVDKGGITAESEYRYEGY 227
Query: 539 DGYCHVDNV 565
G C VD++
Sbjct: 228 KGRCRVDDM 236
>UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean
endopeptidase) (Cysteine proteinase)
(Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor
(EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase)
(Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
Vignain-2] - Vigna mungo (Rice bean) (Black gram)
Length = 362
Score = 108 bits (260), Expect = 8e-23
Identities = 63/177 (35%), Positives = 93/177 (52%), Gaps = 1/177 (0%)
Frame = +2
Query: 38 EHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDEL-AALRGRRYSGPSPHGLPF 214
E KR N+F+ ++ ++H+ N+ ++ + + +N AD T+ E + G + +
Sbjct: 55 EKHKRFNVFKANVMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHHKMFRGSQ 114
Query: 215 PYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLV 394
S + + E +P DWR GAVT VKDQ CGSCW+F T+ AVEG + LV
Sbjct: 115 HGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKT-NKLV 173
Query: 395 RLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYCHVDNV 565
LS+Q L+DC N GC+GG A+E+IK+ G T E Y Q+G C V
Sbjct: 174 SLSEQELVDCD-KEENQGCNGGLMESAFEFIKQKGGITTESNYPYTAQEGTCDESKV 229
>UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core
eudicotyledons|Rep: Cysteine proteinase -
Mesembryanthemum crystallinum (Common ice plant)
Length = 367
Score = 107 bits (258), Expect = 1e-22
Identities = 62/173 (35%), Positives = 91/173 (52%), Gaps = 2/173 (1%)
Frame = +2
Query: 38 EHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAAL--RGRRYSGPSPHGLP 211
E + R ++F+++++YI+ N+ ++ + + +N D T E A + G
Sbjct: 59 EKQNRFHVFKENVKYINEVNKMDKPYKLRLNQFGDLTPSEFARTYANSKIIEGTRNESGG 118
Query: 212 FPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHL 391
F Y +V++P DWR+ GAVTPVK+Q CG CW+F AVEG + G L
Sbjct: 119 FMYE-------NVEVPRSIDWRVKGAVTPVKNQGRCGGCWAFSAAAAVEGINQI-TTGQL 170
Query: 392 VRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYC 550
+ LS+Q LIDC N+GC GG RA+E+IK+ G T E Y Q G C
Sbjct: 171 ISLSEQQLIDCD--TQNSGCRGGTMGRAFEYIKQRGGITSEANYPYKAQAGMC 221
>UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila
melanogaster|Rep: LD36817p - Drosophila melanogaster
(Fruit fly)
Length = 352
Score = 107 bits (258), Expect = 1e-22
Identities = 71/171 (41%), Positives = 88/171 (51%), Gaps = 7/171 (4%)
Frame = +2
Query: 29 SDLEHEKRLNIFRQSLRYIH-SNNRANRG---FTMSVNHLADRTDDELAALRGRRYS--G 190
SD E R +IF + I SN A+ G F + VN LAD T E+A L G + S G
Sbjct: 50 SDEERVYRESIFAAKMSLITLSNKNADNGVSGFRLGVNTLADMTRKEIATLLGSKISEFG 109
Query: 191 PSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSV-CGSCWSFGTVGAVEGAL 367
+ +R S LP DWR G VTP Q V CG+CWSF T GA+EG L
Sbjct: 110 ERYTNGHINFVTAR-NPASANLPEMFDWREKGGVTPPGFQGVGCGACWSFATTGALEGHL 168
Query: 368 FLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
F G L LSQQ L+DC+ +GN GCDGG +E+I+ HG+ Y
Sbjct: 169 FRRTGV-LASLSQQNLVDCADDYGNMGCDGGFQEYGFEYIRDHGVTLANKY 218
>UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3;
Bilateria|Rep: Cathepsin L-like cysteine protease -
Neobenedenia melleni
Length = 335
Score = 107 bits (257), Expect = 2e-22
Identities = 63/181 (34%), Positives = 99/181 (54%), Gaps = 8/181 (4%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELAAL 169
KVK+Q+ Y S + +L + ++L + +N + + +T+++NH+AD + +E AL
Sbjct: 31 KVKYQKDYLSSEDELNKLLTWSKNLETVRKHNELYAQGKKSYTLAMNHMADLSSEEFKAL 90
Query: 170 RGRRYSGPSPHGLPFPYS-KSRVEELSVKLPP--EHDWRLFGAVTPVKDQSVCGSCWSFG 340
Y P P K+ E +K P E DW G VT VK+Q+ CGSCW+F
Sbjct: 91 ----YLVPKFDATKVPRKGKAAGEHRQIKNDPPSEIDWVRKGHVTAVKNQAQCGSCWAFS 146
Query: 341 TVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEED 517
+ G++EGA+ G L+ S+Q L+DCS FGN+GC+GG ++ + I GL +E
Sbjct: 147 STGSIEGAV-KRATGKLISFSEQQLVDCSTAFGNHGCNGGIMDNSFNYLIHNKGLESEAS 205
Query: 518 Y 520
Y
Sbjct: 206 Y 206
>UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n=1;
Xenopus tropicalis|Rep: UPI00006A2275 UniRef100 entry -
Xenopus tropicalis
Length = 272
Score = 107 bits (256), Expect = 3e-22
Identities = 69/182 (37%), Positives = 100/182 (54%), Gaps = 6/182 (3%)
Frame = +2
Query: 23 YASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELAA-LRGRRYS 187
Y S E R I+ ++L++I +N + G + + +NHL D T +E+AA + G S
Sbjct: 1 YNSQEEERARRTIWEETLKFISVHNLEYSLGLHTYEVGMNHLGDMTGEEVAATMTGYTGS 60
Query: 188 GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQ-SVCGSCWSFGTVGAVEGA 364
G S + S E L PP DWR VTPV+DQ S C SC++F VGA+E
Sbjct: 61 GDSLANM----SHVPKEILEALAPPSIDWRTQNCVTPVRDQGSFCRSCYAFSAVGALE-C 115
Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDG 544
+ LV S Q L+DCS G GN+GC+GG+ +A++++K++G+ E Y Y GQ G
Sbjct: 116 QWKKKTVRLVTFSPQELVDCSDGEGNHGCNGGKIEKAFKYMKKYGVMEESAY-PYTGQKG 174
Query: 545 YC 550
C
Sbjct: 175 LC 176
>UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2;
Caenorhabditis|Rep: Putative uncharacterized protein -
Caenorhabditis elegans
Length = 383
Score = 107 bits (256), Expect = 3e-22
Identities = 61/173 (35%), Positives = 91/173 (52%), Gaps = 1/173 (0%)
Frame = +2
Query: 5 VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAAL-RGRR 181
+K R+Y S E E R IF +++ + N G + VN D TD+EL + + +
Sbjct: 87 LKFDRKYTSVEEFEYRYQIFLRNVIEFEAEEERNLGLDLDVNEFTDWTDEELQKMVQENK 146
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
Y+ + P + E V P DWR G +TP+K+Q CGSCW+F TV +VE
Sbjct: 147 YT---KYDFDTPKFEGSYLETGVIRPASIDWREQGKLTPIKNQGQCGSCWAFATVASVEA 203
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
+ G LV LS+Q ++DC NNGC GG A +++K +GL +E++Y
Sbjct: 204 QNAIKK-GKLVSLSEQEMVDCDG--RNNGCSGGYRPYAMKFVKENGLESEKEY 253
>UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61;
Leishmania|Rep: Cysteine proteinase 2 precursor -
Leishmania pifanoi
Length = 444
Score = 107 bits (256), Expect = 3e-22
Identities = 67/187 (35%), Positives = 97/187 (51%), Gaps = 5/187 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAA--LRG 175
K + R Y + E ++RL F ++L + + N + D ++ E AA L G
Sbjct: 42 KRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAARYLNG 101
Query: 176 RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
Y + Y K+R + +V P DWR GAVTPVKDQ CGSCW+F VG +
Sbjct: 102 AAYFAAAKRHAAQHYRKARADLSAV--PDAVDWREKGAVTPVKDQGACGSCWAFSAVGNI 159
Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH---GLPTEEDYGG 526
EG +L G LV LS+Q L+ C N+GCDGG +A++W+ ++ L TE+ Y
Sbjct: 160 EGQWYL-AGHELVSLSEQQLVSCD--DMNDGCDGGLMLQAFDWLLQNTNGHLHTEDSY-P 215
Query: 527 YLGQDGY 547
Y+ +GY
Sbjct: 216 YVSGNGY 222
>UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Rep:
Cathepsin R precursor - Mus musculus (Mouse)
Length = 334
Score = 106 bits (255), Expect = 3e-22
Identities = 69/188 (36%), Positives = 100/188 (53%), Gaps = 5/188 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRAN----RGFTMSVNHLADRTDDELAAL 169
K+K+ + Y+ E KR+ ++ + L+ I +NR N GFTM +N D+TD+E +
Sbjct: 33 KIKYNKSYSLKEEKLKRV-VWEEKLKMIKLHNRENSLGKNGFTMKMNEFGDQTDEEFRKM 91
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
G S + E S+ LP DWR G VTPV+ Q C +CW+F G
Sbjct: 92 MIEISVWTHREGK----SIMKREAGSI-LPKFVDWRKKGYVTPVRRQGDCDACWAFAVTG 146
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGG 526
A+E A + G L LS Q L+DCS GNNGC GG+ + A++++ + GL +E Y
Sbjct: 147 AIE-AQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGLESEATY-P 204
Query: 527 YLGQDGYC 550
Y G+DG C
Sbjct: 205 YEGKDGPC 212
>UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:
Cathepsin - Petromyzon marinus (Sea lamprey)
Length = 333
Score = 106 bits (254), Expect = 4e-22
Identities = 68/201 (33%), Positives = 100/201 (49%), Gaps = 9/201 (4%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDEL-AA 166
K + + Y S+ E R ++F Q+L+ + +N N F + +N +D E
Sbjct: 31 KSTYGKHYGSEQEDAHRRDVFEQNLKRVLQHNLLADEGNVSFHLGINKYSDLELHEYHEK 90
Query: 167 LRGRRYS---GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSF 337
+ GR ++ G G PFP LP + DWRL G VTPVK+Q +CGS W+F
Sbjct: 91 VVGRFWNLRNGTRRRGAPFPLRSMD------NLPEQVDWRLKGYVTPVKEQGLCGSSWAF 144
Query: 338 GTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEE 514
G++EG F G+L LS+Q L+DC+ + NNGC+GG RA ++ I +G+ +E
Sbjct: 145 SATGSLEGQHFAAT-GNLTSLSEQQLVDCTKSYYNNGCNGGRSERALQYIIDNNGIDSEL 203
Query: 515 DYGGYLGQDGYCHVDNVTAVT 577
Y Y DG C T
Sbjct: 204 SY-PYEHADGKCRFKPANVAT 223
>UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes
vastus|Rep: Cathepsin L - Aphrocallistes vastus
Length = 329
Score = 106 bits (254), Expect = 4e-22
Identities = 65/195 (33%), Positives = 96/195 (49%), Gaps = 3/195 (1%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K+K+ R Y L+ E R I+ ++ Y+ N + ++ N AD T+ E +
Sbjct: 34 KLKYNRSYG--LDEELRKKIWANNMLYVKEFNAEGHSYKLAANQFADLTNLEYRQI---- 87
Query: 182 YSGPSPHGLPFPYSKSRVEELSVK---LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGA 352
Y G + +V + +K LP DWR G VTPVK+Q CGSCWSF G+
Sbjct: 88 YLGYDNEARLSRKREGKVFQRKMKDEDLPTTVDWRSKGVVTPVKNQGQCGSCWSFSATGS 147
Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYL 532
+EG + G LV S+Q L+DCS GN+GC GG A+++ + + E DY Y
Sbjct: 148 LEGQ-YAIKSGKLVSFSEQELVDCSTSLGNHGCQGGLMDYAFKYWETNLAEKESDY-TYT 205
Query: 533 GQDGYCHVDNVTAVT 577
++G C + VT
Sbjct: 206 AKNGKCKYNAQLGVT 220
>UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;
n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
L-like proteinase" precursor - Diabrotica virgifera
virgifera (western corn rootworm)
Length = 315
Score = 105 bits (253), Expect = 6e-22
Identities = 66/192 (34%), Positives = 100/192 (52%), Gaps = 5/192 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNR----ANRGFTMSVNHLADRTDDELAAL 169
K H + Y + +E + R +F+ +L+ I +N + ++VN AD + E A+
Sbjct: 28 KATHNKSY-NVIEDKLRFAVFQDNLKKIEEHNAKYESGEETYYLAVNKFADWSSAEFQAM 86
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
R+ + + V + +V+ E DWR AV VKDQ CGSCW+F T G
Sbjct: 87 LARQMANKPKQS----FIAKHVADPNVQAVEEVDWR-DSAVLGVKDQGQCGSCWAFSTTG 141
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGY 529
++EG L +H V LS+Q L+DC N GC+GG A+ ++KRHGL +E Y Y
Sbjct: 142 SLEGQLAIHK-NQRVPLSEQELVDCDTS-RNAGCNGGLMTDAFNYVKRHGLSSESQY-AY 198
Query: 530 LGQDGYC-HVDN 562
G+D C +V+N
Sbjct: 199 TGRDDRCKNVEN 210
>UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11;
Entamoeba|Rep: Cysteine proteinase 2 precursor -
Entamoeba histolytica
Length = 315
Score = 105 bits (252), Expect = 8e-22
Identities = 62/195 (31%), Positives = 105/195 (53%), Gaps = 3/195 (1%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNH-LADRTDDELAALRGRRY 184
K+ + + + +E +R IF + +++ S N+ F +SV+ A T++E L +
Sbjct: 22 KNNKHFTA-IEKLRRRAIFNMNAKFVDSFNKIG-SFKLSVDGPFAAMTNEEYRTLLKSKR 79
Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
+ +V+ L+++ P DWR G VTP++DQ+ CGSC++FG++ A+EG
Sbjct: 80 TTEE---------NGQVKYLNIQAPESVDWRKEGKVTPIRDQAQCGSCYTFGSLAALEGR 130
Query: 365 LFLHNGG--HLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQ 538
L + GG + + LS++ ++ C+ GNNGC+GG Y++I HG+ E DY Y G
Sbjct: 131 LLIEKGGDANTLDLSEEHMVQCTRDNGNNGCNGGLGSNVYDYIIEHGVAKESDY-PYTGS 189
Query: 539 DGYCHVDNVTAVTSI 583
D C NV + I
Sbjct: 190 DSTCKT-NVKSFAKI 203
>UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3;
Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays
(Maize)
Length = 493
Score = 105 bits (251), Expect = 1e-21
Identities = 76/195 (38%), Positives = 101/195 (51%), Gaps = 9/195 (4%)
Frame = +2
Query: 26 ASDLEHEKRLNIFRQSLRYIHSNNRAN----RGFTMSVNHLADRTDDELAA--LRGRRYS 187
A + + +RL +FR +LRYI ++N GF + + AD T +E A L G R
Sbjct: 84 AGEDDDARRLEVFRDNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGR 143
Query: 188 GPSPHGLPFPYSKSRVEELS-VKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
+ G+ + R L+ +LP DWR GAV VKDQ CG CW+F V AVEG
Sbjct: 144 NGTAVGV---VGRRRYLPLAGEQLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGI 200
Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYLGQD 541
+ G L+ LS+Q LIDC F + GCDGG A+ + IK G+ TE DY + G D
Sbjct: 201 NKIVTGS-LISLSEQELIDCD-KFQDQGCDGGLMDNAFVFMIKNGGIDTEADY-PFTGHD 257
Query: 542 GYCHVD-NVTAVTSI 583
G C + T V SI
Sbjct: 258 GTCDLKLKNTRVVSI 272
>UniRef50_Q22RR8 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 332
Score = 105 bits (251), Expect = 1e-21
Identities = 69/185 (37%), Positives = 100/185 (54%), Gaps = 4/185 (2%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
KH Y + E R +FR +L+ I ++ N G T + D T +E +RY
Sbjct: 49 KHSITYKTIEEKLHRFAVFRDNLKKIEGHS--NYGITKFM----DLTSEEFQ----QRYL 98
Query: 188 GPSPHGLPFPYSKSRVE--ELSVKLPPEH--DWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
+ + KS + +L++KL + DW GAVTPVKDQ CGSCW+F GA+
Sbjct: 99 RLKTNTIKRQNFKSNPKNAQLNMKLGDDIIIDWTKKGAVTPVKDQEQCGSCWAFSATGAL 158
Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLG 535
E A F+ + G L LS+Q L+DCS +GN GCDGG+ A+++I + + TE++Y Y G
Sbjct: 159 ESATFI-STGTLPSLSEQELVDCSTSYGNEGCDGGDMDAAFKFIHDNNIATEKEY-TYRG 216
Query: 536 QDGYC 550
D C
Sbjct: 217 FDQKC 221
>UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain;
n=16; Chrysomelidae|Rep: Digestive cysteine protease
intestain - Leptinotarsa decemlineata (Colorado potato
beetle)
Length = 326
Score = 105 bits (251), Expect = 1e-21
Identities = 67/200 (33%), Positives = 106/200 (53%), Gaps = 6/200 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELA-A 166
K H + Y + LE + R IF+++L I +N R ++G + + V AD T +E
Sbjct: 27 KQTHGKTYKNLLEEKTRFGIFQRNLIKIKEHNARYDKGEETYLLGVTRFADLTHEEFKDI 86
Query: 167 LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
L+G+ + P + P + E+L V P DW GAV VKDQ+ CGSCW+F
Sbjct: 87 LKGQIKNKPRLNATPTVFP----EDLEV--PDSIDWTEKGAVLEVKDQNPCGSCWAFSAT 140
Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGC-DGGEDFRAYEWIKRHGLPTEEDYG 523
GA+EG + N + LS+Q L+DCS +GN C +GG+ A+E+++ +G+ +E+ Y
Sbjct: 141 GALEGQNAILNNVK-ISLSEQQLLDCSAAYGNGNCKEGGDMSAAFEYVRDYGIQSEKSY- 198
Query: 524 GYLGQDGYCHVDNVTAVTSI 583
Y+ + C D + I
Sbjct: 199 PYIRKQTECQYDASKTILKI 218
>UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2;
Taeniidae|Rep: Cathepsin L-like cysteine proteinase -
Taenia solium (Pork tapeworm)
Length = 339
Score = 104 bits (250), Expect = 1e-21
Identities = 69/201 (34%), Positives = 101/201 (50%), Gaps = 7/201 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRA-NRG---FTMSVNHLADRTDDELAAL 169
K++H R Y S E R +F ++L YI NR N G ++ +N AD E +
Sbjct: 39 KLQHGRVY-SGKEEAYRRGVFARNLLYIKGQNRRFNAGLESYSTGLNQFADLESSEFS-- 95
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEEL---SVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFG 340
R+ G P + R+ + + LP DWR VT VK+Q CGSCW+F
Sbjct: 96 --ERFLGTRPESR-VAGRRGRIWKALASAAGLPDTVDWRDKNLVTEVKNQGNCGSCWAFS 152
Query: 341 TVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
+ GA+EGA F G L+ LS+Q L+DCS GN+GC+GG A+++++ H + E Y
Sbjct: 153 STGALEGA-FAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSYAFKYLEEHFIEPESAY 211
Query: 521 GGYLGQDGYCHVDNVTAVTSI 583
Y DG C + V ++
Sbjct: 212 -PYRATDGPCRYNESLGVGTV 231
>UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15)
(Major excreted protein) (MEP) [Contains: Cathepsin L
heavy chain; Cathepsin L light chain]; n=19;
Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15)
(Major excreted protein) (MEP) [Contains: Cathepsin L
heavy chain; Cathepsin L light chain] - Homo sapiens
(Human)
Length = 333
Score = 104 bits (250), Expect = 1e-21
Identities = 63/178 (35%), Positives = 94/178 (52%), Gaps = 5/178 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANR----GFTMSVNHLADRTDDELAAL 169
K H R Y + E +R ++ ++++ I +N+ R FTM++N D T +E +
Sbjct: 33 KAMHNRLYGMNEEGWRRA-VWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQV 91
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
+ G F E L + P DWR G VTPVK+Q CGSCW+F G
Sbjct: 92 MNGFQNRKPRKGKVFQ------EPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATG 145
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDY 520
A+EG +F G L+ LS+Q L+DCS GN GC+GG A+++++ + GL +EE Y
Sbjct: 146 ALEGQMF-RKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESY 202
>UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep:
Cysteine proteinase - Cryptobia salmositica
Length = 443
Score = 103 bits (248), Expect = 2e-21
Identities = 65/177 (36%), Positives = 87/177 (49%), Gaps = 4/177 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K H R YAS E KR IF +++ NR N T N AD T +E
Sbjct: 29 KAAHARNYASPDEERKRFEIFAGNMKKAAVLNRKNPMATFGPNEFADMTSEEFQTRHNAA 88
Query: 182 YSGPSPHGLPFPYSKS-RVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
+ P +K+ EE+ + + DWRL GAVTPVK+Q CGSCWSF T G +E
Sbjct: 89 RHYAAAKARPPKNTKTFTAEEIKAAVGQQIDWRLKGAVTPVKNQGACGSCWSFSTTGNIE 148
Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRH--GLPTEEDY 520
G + G LV +S+Q L+ C ++GC+GG A+ W I H + TE +Y
Sbjct: 149 GQHAIAT-GQLVAVSEQELVSCD--PIDDGCNGGLMDNAFGWLISAHKGQIATEANY 202
>UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9;
Onchocercidae|Rep: Cathepsin L-like precursor - Brugia
pahangi (Filarial nematode worm)
Length = 395
Score = 103 bits (248), Expect = 2e-21
Identities = 64/185 (34%), Positives = 95/185 (51%), Gaps = 4/185 (2%)
Frame = +2
Query: 38 EHEKRLNIFRQS-LRYIHSNNRANRG---FTMSVNHLADRTDDELAALRGRRYSGPSPHG 205
E+ R+ IF + L N + +G +T ++N LAD TD+E G R +
Sbjct: 106 ENNFRMAIFESNELMTERINKKYEQGLVSYTTALNDLADLTDEEFMVRNGLRLPNQTDLR 165
Query: 206 LPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGG 385
S+ + S +LP + DWR GAVTPV++Q CGSC++F T A+E A G
Sbjct: 166 GKRQTSEFYRYDKSERLPDQVDWRTKGAVTPVRNQGECGSCYAFATAAALE-AYHKQMTG 224
Query: 386 HLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYCHVDNV 565
L+ LS Q ++DC+ GNNGC GG A+++ R+G+ E Y Y+G + C
Sbjct: 225 RLLDLSPQNIVDCTRNLGNNGCSGGYMPTAFQYASRYGIAMESRY-PYVGTEQRCRWQQS 283
Query: 566 TAVTS 580
AV +
Sbjct: 284 IAVVT 288
>UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis
thaliana|Rep: Cysteine proteinase - Arabidopsis thaliana
(Mouse-ear cress)
Length = 348
Score = 103 bits (247), Expect = 3e-21
Identities = 64/181 (35%), Positives = 93/181 (51%), Gaps = 10/181 (5%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANR-GFTMSVNHLADRTDDELAALRG--- 175
+ R Y+ + E R NIF+++L ++ + N N+ + + +N +D TD+E A
Sbjct: 41 RFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNNKITYKVDINEFSDLTDEEFRATHTGLV 100
Query: 176 -----RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFG 340
R S S P+ V + + DWR GAVTPVK Q CG CW+F
Sbjct: 101 VPEAITRISTLSSGKNTVPFRYGNVSDNGESM----DWRQEGAVTPVKYQGRCGGCWAFS 156
Query: 341 TVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEED 517
V AVEG + G LV LS+Q L+DC + N GC GG +A+E+ IK G+ TE++
Sbjct: 157 AVAAVEGITKI-TKGELVSLSEQQLLDCDRDY-NQGCRGGIMSKAFEYIIKNQGITTEDN 214
Query: 518 Y 520
Y
Sbjct: 215 Y 215
>UniRef50_Q22A69 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 330
Score = 103 bits (246), Expect = 4e-21
Identities = 63/191 (32%), Positives = 94/191 (49%)
Frame = +2
Query: 11 HQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSG 190
+ ++Y+S+ + RL+IF+++LR I N+ N + AD T +E A + Y G
Sbjct: 37 YNKKYSSEEHYNARLSIFKENLRRIELFNK-NDEAQHGITQFADLTHEEFADM----YLG 91
Query: 191 PSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALF 370
P L +K + P DW GAVTPVK+Q CGSCW+F T G++EG
Sbjct: 92 YKPQ-LRNSQAKVSLSSTPFTAPTAIDWTTKGAVTPVKNQGSCGSCWAFSTTGSIEGQYV 150
Query: 371 LHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYC 550
L +L S+Q L+DC + GC+GG A+ +++ L TE Y Y DG C
Sbjct: 151 LQLKQNLTSFSEQQLVDCDTK-EDQGCNGGLMDNAFTYLESAKLETESAY-PYTAVDGSC 208
Query: 551 HVDNVTAVTSI 583
+ V +
Sbjct: 209 KYNQSLGVVGV 219
>UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10;
Liliopsida|Rep: Putative cysteine proteinase - Oryza
sativa subsp. japonica (Rice)
Length = 416
Score = 102 bits (245), Expect = 5e-21
Identities = 63/163 (38%), Positives = 92/163 (56%), Gaps = 6/163 (3%)
Frame = +2
Query: 44 EKRLNIFRQSLRYIHSNNRANRG--FTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFP 217
E R +F+ + RYIH N+ ++G + + +N +D T +E AA +Y+G F
Sbjct: 43 ESRFEVFKANARYIHEFNQKSKGMSYVLGLNKFSDLTYEEFAA----KYTGVKVDASAFA 98
Query: 218 YS--KSRVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGH 388
+ S EEL V +PP DWRL GAVT VKDQ CGSCW F VGAVEG + G+
Sbjct: 99 TATTSSPDEELPVGVPPATWDWRLNGAVTDVKDQGQCGSCWVFSAVGAVEGINAIMT-GN 157
Query: 389 LVRLSQQALIDCSWGFGNNGC-DGGEDFRAYEWIKRHGLPTEE 514
L+ LS+Q ++DCS C GG+ A ++I ++G+ ++
Sbjct: 158 LLTLSEQQVLDCS---NTGDCLKGGDPRAALQYIVKNGVTLDQ 197
>UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1;
Rhipicephalus appendiculatus|Rep: Midgut cysteine
proteinase 4 - Rhipicephalus appendiculatus (Brown ear
tick)
Length = 345
Score = 102 bits (245), Expect = 5e-21
Identities = 63/190 (33%), Positives = 99/190 (52%), Gaps = 6/190 (3%)
Frame = +2
Query: 11 HQRQYASDLEHEKRLNIFRQSLRYIHS-NNRANRG---FTMSVNHLADRTDDELAALRGR 178
+ + Y + E R +FR++ ++ + + + G ++++VNH AD T DE+ A
Sbjct: 45 YNKTYGTSEETVYREQVFRRTFNFLRTVDEKFKNGTLLYSVAVNHFADMTPDEVVA---- 100
Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
Y+G P L P +WR G VTPVK+Q CGSCW+F + GA+E
Sbjct: 101 NYTGYKPPSAQQLAEIPLYAPLFGDTPEFIEWRENGFVTPVKNQGQCGSCWAFSSTGALE 160
Query: 359 GALFLHNGGHLVRLSQQALIDCS-WGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYGGYL 532
G +F L+ LS+Q L+DC+ +GNNGC+GG+ A+++++ GL TE Y
Sbjct: 161 GQVFKRT-RRLISLSEQNLMDCAGQRYGNNGCNGGQMPGAFQYVQDAGGLDTEARYPYRQ 219
Query: 533 GQDGYCHVDN 562
G + C N
Sbjct: 220 GTNFQCQFSN 229
>UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcoptes
scabiei type hominis|Rep: Cathepsin L-like protease -
Sarcoptes scabiei type hominis
Length = 245
Score = 102 bits (244), Expect = 7e-21
Identities = 68/188 (36%), Positives = 94/188 (50%), Gaps = 5/188 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG----FTMSVNHLADRTDDELAAL 169
K K+ RQ+ + + R IF+++ YI +N + + VN D T+ E
Sbjct: 37 KAKYNRQFRTVYDELLRKLIFQRNYIYIRKHNEKYEAGLSTYELGVNQFTDLTNKEYNDQ 96
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
R H + + E++S LP E DW L V P+KDQ CGSCW+F V
Sbjct: 97 MNRL---KVKHDVQSEHVFDN-EDVS-DLPDEVDWTLKNVVAPIKDQKQCGSCWAFSAVA 151
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGG 526
++E L G LV LS+Q L+DCS G GN GCDGG A+E+ IK G+ TE+ Y
Sbjct: 152 SMESQNALKT-GQLVELSEQELVDCSVGEGNEGCDGGWMDSAFEFVIKADGIDTEKSY-P 209
Query: 527 YLGQDGYC 550
Y G + C
Sbjct: 210 YHGVNQVC 217
>UniRef50_Q22RC9 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 389
Score = 102 bits (244), Expect = 7e-21
Identities = 66/190 (34%), Positives = 100/190 (52%), Gaps = 9/190 (4%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFT-MSVNHLADRTDDELAALRGR 178
K +H++ Y + LE ++R IFRQ+L I N+ G + +D T +E +
Sbjct: 44 KAEHKKFY-NFLEEQRRFEIFRQNLDIISELNQVEEGTAEYGITQFSDMTTEEFKS---- 98
Query: 179 RYSGPSPHGLPFPYSKSR-VEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
+ PS + F S+ +++S P +DWR GAVTPVK+Q G+CW+F T G +
Sbjct: 99 QILIPSTYARNFTGSRYHGFQKISQDAPTSYDWRDHGAVTPVKNQGTVGTCWTFSTTGNI 158
Query: 356 EGALFLHNGGHLVRLSQQALIDC------SWGFGNNGCDGGEDFRAYEW-IKRHGLPTEE 514
EG FL G LV LS++ ++DC S G + G GG + A+++ I GLP+EE
Sbjct: 159 EGQWFL-AGNPLVSLSEEQIVDCDGSQEPSTGHADCGVFGGWPYLAFDYVINAGGLPSEE 217
Query: 515 DYGGYLGQDG 544
Y +G G
Sbjct: 218 TYPYCVGNGG 227
>UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K
precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2);
n=2; Tribolium castaneum|Rep: PREDICTED: similar to
Cathepsin K precursor (Cathepsin O) (Cathepsin X)
(Cathepsin O2) - Tribolium castaneum
Length = 332
Score = 101 bits (243), Expect = 9e-21
Identities = 67/189 (35%), Positives = 97/189 (51%), Gaps = 6/189 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG----FTMSVNHLADRTDDELAAL 169
K H R + LE R ++F ++L + +N R + M VN +D TD+EL+ L
Sbjct: 31 KAMHARAFFDPLEETFRKSLFTKNLEIVEEHNERFRNGSETYEMGVNKFSDFTDEELSNL 90
Query: 170 RGRRYSGPSPHGLPFPYSKSRV-EELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
G + P P ++ + L + DWR G VTPVK+Q CGSCW+F T+
Sbjct: 91 TGLQV--PLEFEQPLNETEDPLLPSLGRGISASLDWRQRGGVTPVKNQGQCGSCWAFATI 148
Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYG 523
GA+E + + + LS+Q L+DC G G GC GG AY +I R+ G+ DY
Sbjct: 149 GAIESHYKIRH-KRAISLSEQQLVDCV-GRG-GGCGGGWIPTAYSYIARNKGVNYNRDY- 204
Query: 524 GYLGQDGYC 550
YLG++G C
Sbjct: 205 PYLGRNGKC 213
>UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3;
Bilateria|Rep: Cathepsin L-like cysteine proteinase -
Longidorus elongatus
Length = 358
Score = 101 bits (243), Expect = 9e-21
Identities = 63/192 (32%), Positives = 96/192 (50%), Gaps = 9/192 (4%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELAA- 166
K+KH + Y + E R +F + + I +N F +S+N AD T+ E
Sbjct: 47 KLKHAKSYKTKDEELLRFQVFASNHKVIEQHNIEYEAGQHSFALSLNKFADMTNAEFRQR 106
Query: 167 LRGRRYSGPSPHGLPFPYSKS-RVEEL--SVKLPPEHDWRLFGAVTPVKDQSVCGSCWSF 337
+ G + P + + E+ +V +P DWR G VT VKDQ CGSCW+F
Sbjct: 107 MNGFKLPAKRKLAKSQPLKEDGMIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAF 166
Query: 338 GTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEE 514
G++EG + G LV LS+Q L+DC + GC+GG A+++++ + G+ TE
Sbjct: 167 SATGSLEGQHYKQT-GKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQYVETNKGIDTEA 225
Query: 515 DYGGYLGQDGYC 550
Y Y G+DG C
Sbjct: 226 SY-PYKGRDGRC 236
>UniRef50_Q22W19 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 332
Score = 101 bits (243), Expect = 9e-21
Identities = 65/184 (35%), Positives = 96/184 (52%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
K+ ++A +++ + R +IF Q+ + N N G ++N A T DE L
Sbjct: 50 KYGFKFADEVQLQYRRSIFYQNKDLVEQLNSENNGTFHTLNAFAIYTKDEFNQLFKGYQK 109
Query: 188 GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGAL 367
H + YS L + P DWR AVTPVK+Q CGSCW+F TVG +EGA
Sbjct: 110 RQKSHLI---YS------LKGDVAPSIDWRQKNAVTPVKNQGQCGSCWAFSTVGGLEGAY 160
Query: 368 FLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGY 547
+ G+L S+Q ++DCS N GC+GG+ AY+++ ++G+ TE DY Y G +
Sbjct: 161 AIAT-GNLTSFSEQQIVDCS--KANAGCNGGDLPPAYKYVVQNGIETEADY-PYKGVNQK 216
Query: 548 CHVD 559
C D
Sbjct: 217 CAYD 220
>UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core
eudicotyledons|Rep: Chymopapain precursor - Carica
papaya (Papaya)
Length = 352
Score = 101 bits (243), Expect = 9e-21
Identities = 58/172 (33%), Positives = 88/172 (51%)
Frame = +2
Query: 5 VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRY 184
+KH + Y S E R IFR +L YI N+ N + + +N AD ++DE + +
Sbjct: 53 LKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKK-KYVGF 111
Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
GL ++ + P DWR GAVTPVK+Q CGSCW+F T+ VEG
Sbjct: 112 VAEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGI 171
Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
+ G+L+ LS+Q L+DC + GC GG + +++ +G+ T + Y
Sbjct: 172 NKIVT-GNLLELSEQELVDCD--KHSYGCKGGYQTTSLQYVANNGVHTSKVY 220
>UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F
like protease; n=1; Nasonia vitripennis|Rep: PREDICTED:
similar to cathepsin F like protease - Nasonia
vitripennis
Length = 1036
Score = 101 bits (242), Expect = 1e-20
Identities = 65/185 (35%), Positives = 92/185 (49%), Gaps = 3/185 (1%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGF-TMSVNHLADRTDDELAALR-GRR 181
K+++ Y + E E R IF+ +L I R G V D T E A G +
Sbjct: 737 KYKKMYHNKEEKEMRFQIFKDNLNLIEELQRNEMGTGRYGVTQFTDLTKAEFKARHLGLK 796
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
+ S + +P P + ++LP ++DWR VTPVKDQ CGSCW+F G +EG
Sbjct: 797 PTLKSENDIPMPMATIP----DIELPSDYDWRHHNVVTPVKDQGSCGSCWAFSVTGNIEG 852
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKR-HGLPTEEDYGGYLGQ 538
+ + G L+ LS+Q L+DC ++GC+GG AY I+ GL E DY Y +
Sbjct: 853 QYAIKH-GELLSLSEQELVDCD--KLDSGCNGGLPDTAYRAIEELGGLELESDY-PYDAE 908
Query: 539 DGYCH 553
D CH
Sbjct: 909 DEKCH 913
>UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 360
Score = 101 bits (242), Expect = 1e-20
Identities = 66/189 (34%), Positives = 97/189 (51%), Gaps = 3/189 (1%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAAL-RGR 178
KVK+ + Y D E + R ++F + I+ +N+ + VN AD T +E AL G
Sbjct: 49 KVKYAKTYKDDTEEQYRFSVFTNNYVEIYRHNKFLVFSKVGVNQFADLTHEEFKALYTGH 108
Query: 179 RYSGPSPHGLPFPYSKSRVEELSV-KLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
++S +K++ L LP DWR GA+TPVK Q+ CG CW+F TV ++
Sbjct: 109 KHSKDDDDD----DNKNKQPHLPTDNLPASFDWRDKGAITPVKVQNGCGGCWAFSTVQSI 164
Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYL 532
EG FL G L LS Q +IDC +GC GG+ A+ I+ + G+ TE +Y Y+
Sbjct: 165 EGLYFLKT-GKLESLSTQQVIDCC-RIDESGCLGGDPEPAFRCIQNNGGIMTETEY-PYI 221
Query: 533 GQDGYCHVD 559
+ C D
Sbjct: 222 AKQQSCKFD 230
>UniRef50_P25779 Cluster: Cruzipain precursor; n=54;
Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi
Length = 467
Score = 101 bits (242), Expect = 1e-20
Identities = 68/184 (36%), Positives = 91/184 (49%), Gaps = 3/184 (1%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K KH R Y S E RL++FR++L + AN T V +D T +E R R
Sbjct: 42 KQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEF---RSRY 98
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
++G + ++ V+ V P DWR GAVT VKDQ CGSCW+F +G VE
Sbjct: 99 HNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVEC 158
Query: 362 ALFLHNGGH-LVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI--KRHGLPTEEDYGGYL 532
FL GH L LS+Q L+ C ++GC GG A+EWI + +G ED Y
Sbjct: 159 QWFL--AGHPLTNLSEQMLVSCD--KTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYA 214
Query: 533 GQDG 544
+G
Sbjct: 215 SGEG 218
>UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate
cathepsin L; n=4; Danio rerio|Rep: Novel protein similar
to vertebrate cathepsin L - Danio rerio (Zebrafish)
(Brachydanio rerio)
Length = 334
Score = 101 bits (241), Expect = 2e-20
Identities = 62/179 (34%), Positives = 90/179 (50%), Gaps = 6/179 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRA-NRG---FTMSVNHLADRTDDELAAL 169
K KH+ Y + E R I+ +++ I NN + G F M++N D T E L
Sbjct: 30 KKKHEISYDEESEDVHRKTIWETNMQKIWKNNNDFSFGLSMFKMAMNKYGDLTSVEYKRL 89
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKL--PPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
G + G + +++ L+ K D+R G VT VKDQ CGSCWSF T
Sbjct: 90 LGSKIKGTGNR--KGKITSAQMLRLNAKRLGVTNIDYRAKGYVTEVKDQGYCGSCWSFST 147
Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
GA+EG ++ H G LV LS+Q L+DCS +G GC G AY+++ + L + + Y
Sbjct: 148 TGAIEGQMYKHT-GRLVSLSEQQLVDCSRSYGTYGCSGAWMANAYDYVINNALESSDTY 205
>UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza
sativa|Rep: Os09g0497500 protein - Oryza sativa subsp.
japonica (Rice)
Length = 349
Score = 100 bits (239), Expect = 3e-20
Identities = 64/190 (33%), Positives = 97/190 (51%), Gaps = 8/190 (4%)
Frame = +2
Query: 5 VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRY 184
++H R Y E ++R ++R+++ + + N + G+ ++ N AD T++E A +
Sbjct: 36 IRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFADLTNEEFRA----KM 91
Query: 185 SGPSPHGLPFPYSKSRVEELSVK-------LPPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
G PH S + ++++ LP DWR GAV VK+Q CGSCW+F
Sbjct: 92 LGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCGSCWAFSA 151
Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDY 520
V A+EG + N G LV LS+Q L+DC GC GG A+E+ + HGL TE Y
Sbjct: 152 VAAIEGINQIKN-GELVSLSEQELVDCD--DEAVGCGGGYMSWAFEFVVGNHGLTTEASY 208
Query: 521 GGYLGQDGYC 550
Y +G C
Sbjct: 209 -PYHAANGAC 217
>UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase
precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin
L-like cysteine proteinase precursor - Acanthoscelides
obtectus (Bean weevil)
Length = 321
Score = 100 bits (239), Expect = 3e-20
Identities = 61/156 (39%), Positives = 84/156 (53%), Gaps = 5/156 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELAAL 169
K++H R Y + LE ++R IF+ +LR I +N R + G F M +N D T +E
Sbjct: 27 KIQHGRTYRTLLEEKRRFEIFKFNLRTIEEHNERYHNGEETFEMGINQFGDMTQEEFK-- 84
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
R + P +P P + +P DWR GAVT VK Q CGSCW+F VG
Sbjct: 85 --RMLALQKPQ-MPLPRGDEVSFDNVNDIPKTVDWREKGAVTEVKKQGNCGSCWAFSAVG 141
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSW-GFGNNGCD 454
++EG +FL NG L LS Q L+DC+ +GN GC+
Sbjct: 142 SIEGQVFLKNGS-LESLSAQNLVDCAGIEYGNFGCE 176
>UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin L
preproprotein; n=1; Monodelphis domestica|Rep:
PREDICTED: similar to cathepsin L preproprotein -
Monodelphis domestica
Length = 356
Score = 99 bits (238), Expect = 4e-20
Identities = 61/177 (34%), Positives = 94/177 (53%), Gaps = 4/177 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNR----ANRGFTMSVNHLADRTDDELAAL 169
K + + Y S+ E R ++ ++L+ I+ +NR + + M +N D TD E +
Sbjct: 33 KTTYGKNY-SEKEESFRRQVWEKNLKLINDHNRLFKEGKKSYFMGMNQFGDMTDKEFESR 91
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
R + P Y+ R + +LP DWR G VTP+++Q CG+CW+F T+G
Sbjct: 92 LNLRIA---PVRTRRNYTFKR--RIYYRLPKSVDWRTHGYVTPIRNQGECGACWAFSTIG 146
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
++EG LF G LV LS+Q LIDCS G C GG A ++I+R+G+ +E Y
Sbjct: 147 SLEGQLF-RKTGRLVELSKQMLIDCS---GYYTCMGGSLTGALDFIRRYGVVSERCY 199
>UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted,
possible transmembrane domain near N-terminus; n=4;
Cryptosporidium|Rep: Cryptopain-cysteine proteinase
secreted, possible transmembrane domain near N-terminus
- Cryptosporidium parvum Iowa II
Length = 401
Score = 99 bits (238), Expect = 4e-20
Identities = 57/186 (30%), Positives = 89/186 (47%), Gaps = 3/186 (1%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K K+ + Y+S E +R I++Q++ +I + N + + +N D + +E A
Sbjct: 90 KKKYHKVYSSMEEENQRFEIYKQNMNFIKTTNSQGFSYVLEMNEFGDLSKEEFMARFTGY 149
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEH--DWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
F S+ E + P + +W G V P+++Q CGSCW+F V A+
Sbjct: 150 IKDSKDDERVFKSSRVSASESEEEFVPPNSINWVEAGCVNPIRNQKNCGSCWAFSAVAAL 209
Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYL 532
EGA L LS+Q +DCS GN GCDGG A+++ IK L T +DY Y
Sbjct: 210 EGATCAQTNRGLPSLSEQQFVDCSKQNGNFGCDGGTMGLAFQYAIKNKYLCTNDDY-PYF 268
Query: 533 GQDGYC 550
++ C
Sbjct: 269 AEEKTC 274
>UniRef50_Q23EG5 Cluster: Papain family cysteine protease containing
protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 429
Score = 99 bits (238), Expect = 4e-20
Identities = 61/178 (34%), Positives = 94/178 (52%), Gaps = 7/178 (3%)
Frame = +2
Query: 38 EHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDELAALRG-RRYSGPSPHGLP 211
++ +R +F++ + I +N N+ +T ++ T++E++ L+G + S +
Sbjct: 53 QNSERFQLFKKRVAKIAEHNLNPNKKYTQKISKFTFYTNEEISKLKGSQNCSATAKENTR 112
Query: 212 FPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSV----CGSCWSFGTVGAVEGALFLHN 379
+ +LS ++P DWR G V+ VKDQ CGSCW+F GA+E L L
Sbjct: 113 I----LQTYDLS-EIPDYVDWREKGIVSSVKDQDAVGDDCGSCWTFSATGAIESHLALKT 167
Query: 380 GGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYGGYLGQDGYC 550
G LSQQ L+DC+ F N GCDGG RA+E+I G+ + DY Y G+DG C
Sbjct: 168 GKAPFNLSQQQLVDCAGKFDNQGCDGGLPSRAFEYIAYAGGIESSRDY-PYKGKDGKC 224
>UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum
aestivum|Rep: Thiol protease - Triticum aestivum (Wheat)
Length = 374
Score = 99.5 bits (237), Expect = 5e-20
Identities = 64/188 (34%), Positives = 91/188 (48%), Gaps = 7/188 (3%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANR-GFTMSVNHLADRTDDELAALRGRRY 184
KH + YA E +R +IFR+++ +I + NR R +T+ VN AD T +E A R
Sbjct: 56 KHGKSYAGVEEKLRRFDIFRRNVEFIEAANRDGRLSYTLGVNQFADLTHEEFLATHTSRR 115
Query: 185 SGPSPHGLPFPYSKSRVEELSVK-----LPPEHDWRLFGAVTPVKDQS-VCGSCWSFGTV 346
PS + + VE + + +P +W VTPVK+Q VCG+CW+F V
Sbjct: 116 VVPSEEMVITTRAGVVVEGANCQPAPNAVPRSINWVNQSKVTPVKNQGKVCGACWAFSAV 175
Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGG 526
+E A + G LS+Q LIDC + GC GE + AY W+ R+G
Sbjct: 176 ATIESAYAIAKRGEPPVLSEQELIDCD--TFDRGCTSGEMYNAYFWVLRNGGIANSSTYP 233
Query: 527 YLGQDGYC 550
Y DG C
Sbjct: 234 YKETDGKC 241
>UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae
str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae
str. PEST
Length = 559
Score = 99.5 bits (237), Expect = 5e-20
Identities = 73/184 (39%), Positives = 86/184 (46%), Gaps = 3/184 (1%)
Frame = +2
Query: 11 HQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAALRGRRYS 187
H+RQYAS +EHE R NIFR +L I N+ RG V AD T E A G
Sbjct: 256 HRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERGTAKYGVTKFADMTVAEYRAHTGLVVP 315
Query: 188 GPSPHGLPFPYSKSRVEELSV-KLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
S + V LP DWR GAVT VK+Q CGSCW+F VG VEG
Sbjct: 316 KHDRANHVGNRVASEEDVAGVGDLPRSFDWRDHGAVTEVKNQGSCGSCWAFSAVGNVEG- 374
Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKR-HGLPTEEDYGGYLGQD 541
L L S+Q LIDC +NGC GG A++ I++ GL E DY
Sbjct: 375 LHQIKTKKLESYSEQELIDCD--KVDNGCGGGYMDDAFKAIEQLGGLELENDYPYEAKAQ 432
Query: 542 GYCH 553
CH
Sbjct: 433 KSCH 436
>UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 356
Score = 99.1 bits (236), Expect = 7e-20
Identities = 56/167 (33%), Positives = 90/167 (53%), Gaps = 4/167 (2%)
Frame = +2
Query: 62 FRQSLRYIHSNNR-ANRGFTMSVNH-LADRTDDELAA--LRGRRYSGPSPHGLPFPYSKS 229
F++S+R + +N+ N +T+S++ A +D++ L + S + L P
Sbjct: 61 FKESVRRVREHNKKVNATYTLSIDSPFAFMSDEQFVTEYLGSQDCSATAELTLKKPMKIQ 120
Query: 230 RVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQ 409
+ +V++P +W+ V+PVKDQ CGSCW+F T GA+E + LS+Q
Sbjct: 121 NKK--NVQVPESINWKDLNKVSPVKDQQNCGSCWTFSTTGAIESHYAIFEDVEPTSLSEQ 178
Query: 410 ALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYC 550
LIDC+ F NNGC GG +A+E+IK +G + E+ Y+ QD C
Sbjct: 179 QLIDCAGAFNNNGCSGGLPSQAFEYIKYNGGISYENSYYYIAQDQEC 225
>UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19;
Bilateria|Rep: Cathepsin F precursor - Homo sapiens
(Human)
Length = 484
Score = 99.1 bits (236), Expect = 7e-20
Identities = 70/185 (37%), Positives = 92/185 (49%), Gaps = 2/185 (1%)
Frame = +2
Query: 5 VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAALRGRR 181
+ + R Y S E RL++F ++ +RG V +D T++E +
Sbjct: 192 ITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNT 251
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
P G +KS V +L+ PPE DWR GAVT VKDQ +CGSCW+F G VEG
Sbjct: 252 LLRKEP-GNKMKQAKS-VGDLA---PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEG 306
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKR-HGLPTEEDYGGYLGQ 538
FL N G L+ LS+Q L+DC + C GG AY IK GL TE+DY Y G
Sbjct: 307 QWFL-NQGTLLSLSEQELLDCD--KMDKACMGGLPSNAYSAIKNLGGLETEDDY-SYQGH 362
Query: 539 DGYCH 553
C+
Sbjct: 363 MQSCN 367
>UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep:
Cathepsin K - Danio rerio (Zebrafish) (Brachydanio
rerio)
Length = 333
Score = 98.7 bits (235), Expect = 9e-20
Identities = 63/188 (33%), Positives = 98/188 (52%), Gaps = 5/188 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANR----GFTMSVNHLADRTDDELAAL 169
K+ H+R+Y E R I+ +++ +I ++N+ + + +NH D T +E+A
Sbjct: 34 KITHKREYNGLNEESIRRTIWEKNMLFIEAHNKEYELGIHTYDLGMNHFGDMTLEEVA-- 91
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
+ G P + ++ KLP D+R G VT VK+Q CGSCW+F +VG
Sbjct: 92 --EKVMGLQMPMYRDPANTFVPDDRVGKLPKSIDYRKLGYVTSVKNQGSCGSCWAFSSVG 149
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGG 526
A+EG L + G LV LS Q L+DC N+GC GG A+ ++ + G+ +EE Y
Sbjct: 150 ALEGQL-MKTKGQLVDLSPQNLVDCV--TENDGCGGGYMTNAFRYVSNNQGIDSEESY-P 205
Query: 527 YLGQDGYC 550
Y+G D C
Sbjct: 206 YVGTDQQC 213
>UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine
protease; n=1; Strongylocentrotus purpuratus|Rep:
PREDICTED: similar to cysteine protease -
Strongylocentrotus purpuratus
Length = 494
Score = 98.3 bits (234), Expect = 1e-19
Identities = 70/191 (36%), Positives = 98/191 (51%), Gaps = 2/191 (1%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAALRGRRY 184
+ RQ E+E R ++F Q++ + N+ +G AD T+ E L+
Sbjct: 165 REYRQNDGTNEYEYRYSVFVQNMLTVEMFNQFEQGTAKYGPTKFADMTEAEFRKLQ---- 220
Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
SGP L K + +P E+DWR GAVTPVK+Q +CGSCW+F +G +EG
Sbjct: 221 SGP----LKKTGIKKQAAIPQGPVPEEYDWRTHGAVTPVKNQGMCGSCWAFSAIGNMEGQ 276
Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYE-WIKRHGLPTEEDYGGYLGQD 541
+ G L+ LS+Q L+DC G GC+GGE AYE IK G +EE Y Y G++
Sbjct: 277 WQIKK-GELISLSEQELVDCDKVDG--GCEGGEMSDAYEAIIKLGGAMSEEKY-PYRGEN 332
Query: 542 GYCHVDNVTAV 574
C N+T V
Sbjct: 333 EKCKF-NMTDV 342
>UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101,
whole genome shotgun sequence; n=2; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_101,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 306
Score = 98.3 bits (234), Expect = 1e-19
Identities = 58/173 (33%), Positives = 86/173 (49%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K K+Q +Y S E E R IF+Q+ Y N +T+ +N A TD+E +
Sbjct: 34 KQKYQTRYTSQFEDEYRFEIFKQNYNYYQEVNSRQSSYTLGINQFATLTDEEFEQI---- 89
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
Y G + P +S ++ S+ LP DW + PVK+Q CGS WSF VGA E
Sbjct: 90 YLGRADSS-PIEIDES-ID--SINLPESVDWS--SKMNPVKNQGTCGSGWSFSAVGAFEA 143
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
G H + S+Q L+DC ++GCDGG +A +++ ++G E +Y
Sbjct: 144 FFIFVKGTHF-QYSEQNLVDCD--TNSHGCDGGYPAKAIDYLNKNGAFLESEY 193
>UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza
sativa|Rep: Os09g0381400 protein - Oryza sativa subsp.
japonica (Rice)
Length = 362
Score = 97.9 bits (233), Expect = 2e-19
Identities = 66/188 (35%), Positives = 97/188 (51%), Gaps = 7/188 (3%)
Frame = +2
Query: 11 HQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDELAALRGRRYS 187
H R Y S E +R +++R++ +I + N R + + ++ N AD T++E A Y+
Sbjct: 58 HNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYA 117
Query: 188 GPSPHGLPFPYSKSRVEELS----VKLPPEHDWRLFGAVTPVKDQ-SVCGSCWSFGTVGA 352
G P + + + S V +P DWR GAV P K Q S C SCW+F T
Sbjct: 118 GDGPVDDSVITTGAGDVDASFSYRVDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAAT 177
Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGY 529
+E +L + G LV LS+Q L+DC G GC+ G RAY+W ++ GL TE DY Y
Sbjct: 178 IE-SLNMIKTGKLVSLSEQQLVDCDSYDG--GCNLGSYGRAYKWVVENGGLTTEADY-PY 233
Query: 530 LGQDGYCH 553
+ G C+
Sbjct: 234 TARRGPCN 241
>UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1;
Brugia malayi|Rep: Cathepsin F-like cysteine proteinase
- Brugia malayi (Filarial nematode worm)
Length = 461
Score = 97.9 bits (233), Expect = 2e-19
Identities = 63/195 (32%), Positives = 93/195 (47%), Gaps = 3/195 (1%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAA--LRGR 178
K +R+Y+S E R I+ Q++ + +G + +D T +E L
Sbjct: 165 KFKREYSSIEEQLDRFRIYLQNMNFAKKLQFEEKGTAIYGATKFSDMTAEEFQKIMLPSI 224
Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
+ +G+ F + + + LP + DWR G VTPVKDQ CGSCW+F G +E
Sbjct: 225 WWDRVESNGITFNLNDFNLSIYN--LPSKFDWRTEGVVTPVKDQGSCGSCWAFSVTGNIE 282
Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQ 538
+L+ G L+ LS+Q LIDC + GC+GG A+ IKR G ED Y +
Sbjct: 283 -SLWAIKTGKLISLSEQELIDCD--VIDKGCNGGLPINAFREIKRMGGLEPEDQYPYEAK 339
Query: 539 DGYCHVDNVTAVTSI 583
+G CH+ SI
Sbjct: 340 NGTCHLVRAQIAVSI 354
>UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase;
n=21; Bilateria|Rep: Cathepsin L-like cysteine
proteinase - Globodera pallida
Length = 379
Score = 97.9 bits (233), Expect = 2e-19
Identities = 70/182 (38%), Positives = 102/182 (56%), Gaps = 9/182 (4%)
Frame = +2
Query: 2 KVKHQRQ-YAS-DLEHEKRLNIFRQSLRYIHSNNRAN-RG---FTMSVNHLADRTDDELA 163
K KH R+ YA D+E+E+ L + + ++I +N+A G F + NH+AD E
Sbjct: 74 KQKHGRKAYADQDVENERMLT-YLSAKQFIDKHNQAYIEGKVTFRVGENHIADLPFSEYK 132
Query: 164 ALRG-RRYSGPSPHGLPFPYSKSRVEELSV-KLPPEHDWRLFGAVTPVKDQSVCGSCWSF 337
L G RR G + + + + ++V LP DWR G VT VK+Q +CGSCW+F
Sbjct: 133 KLNGYRRLLGDNLRR----NASTFLAPMNVGDLPESVDWRDKGWVTEVKNQGMCGSCWAF 188
Query: 338 GTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEE 514
+ GA+E A G L+ LS+Q LIDCS +GN GC+GG A+++IK +G+ E
Sbjct: 189 SSTGALE-AQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDNAFQYIKDNNGVDKEL 247
Query: 515 DY 520
DY
Sbjct: 248 DY 249
>UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=17; Trichomonas vaginalis|Rep:
Clan CA, family C1, cathepsin L-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 318
Score = 97.9 bits (233), Expect = 2e-19
Identities = 60/183 (32%), Positives = 92/183 (50%), Gaps = 3/183 (1%)
Frame = +2
Query: 38 EHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFP 217
E+ RL ++ + R + +NRAN G+ +++NHL+ T E L G + +
Sbjct: 37 EYHFRLGVYNTNKRRVQEHNRANSGYQLTMNHLSCMTPSEYKVLLGHKQTKKI------- 89
Query: 218 YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVR 397
+ + +P DWR V P+KDQ+ CGSCW+F V A E L G L+
Sbjct: 90 --EGEAKIFKGDVPDAVDWRNAKIVNPIKDQAQCGSCWAFSVVQAQESQWALKK-GQLLS 146
Query: 398 LSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH--GL-PTEEDYGGYLGQDGYCHVDNVT 568
L++Q ++DC GCDGG+++ AY+++ +H GL E DY Y +DG C
Sbjct: 147 LAEQNMVDCV--DTCYGCDGGDEYLAYDYVIKHQKGLWMLETDY-PYTARDGSCKFKAAK 203
Query: 569 AVT 577
VT
Sbjct: 204 GVT 206
>UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella
histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax
(Sterkiella histriomuscorum)
Length = 366
Score = 97.5 bits (232), Expect = 2e-19
Identities = 57/159 (35%), Positives = 79/159 (49%), Gaps = 1/159 (0%)
Frame = +2
Query: 83 IHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPP 262
I N+ + +N +D TD+E Y+ + KS + +P
Sbjct: 83 IKHNSDGTNTYKKGLNAFSDMTDEEFFDY----YNIKAEQNCSATNRKS-FGNSNANIPT 137
Query: 263 EHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGN 442
E DWR FG V+PVK+Q CGSCW+F TVG VE L G LS+Q L+DC+ + N
Sbjct: 138 EWDWRTFGVVSPVKNQGKCGSCWTFSTVGCVESHYLLKYGA-FRNLSEQQLVDCAGDYDN 196
Query: 443 NGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQDGYCHV 556
+GC GG A+E+IK + GL E Y Y +G C +
Sbjct: 197 HGCSGGLPSHAFEYIKDNGGLALETTY-PYKAANGQCSI 234
>UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2;
Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio
molitor (Yellow mealworm)
Length = 336
Score = 97.5 bits (232), Expect = 2e-19
Identities = 62/193 (32%), Positives = 90/193 (46%), Gaps = 7/193 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANR----GFTMSVNHLADRTDDELAAL 169
K + R Y + E R IF++ L +N R +T+ VN D T +E+ A
Sbjct: 31 KTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDMTPEEMKAY 90
Query: 170 RGRRYSGPSPH--GLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
H G+P + SV+ P DWR G V+PVK+Q CGSCW+F +
Sbjct: 91 THGLIMPADLHKNGIPIKTREDLGLNASVRYPASFDWRDQGMVSPVKNQGSCGSCWAFSS 150
Query: 344 VGAVEGALFLHNG-GHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
GA+E + + NG G+ +S+Q L+DC GC GG A+ ++ ++G E
Sbjct: 151 TGAIESQMKIANGAGYDSSVSEQQLVDCV--PNALGCSGGWMNDAFTYVAQNGGIDSEGA 208
Query: 521 GGYLGQDGYCHVD 559
Y DG CH D
Sbjct: 209 YPYEMADGNCHYD 221
>UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing
protein; n=7; Hymenostomatida|Rep: Papain family
cysteine protease containing protein - Tetrahymena
thermophila SB210
Length = 387
Score = 96.7 bits (230), Expect = 4e-19
Identities = 63/185 (34%), Positives = 88/185 (47%), Gaps = 11/185 (5%)
Frame = +2
Query: 38 EHEKRLNIFRQSLRYIHS-NNRANRGFTMSVNHLADRTDDELAALR---GRRYSGPSPHG 205
E+ +R IF Q L+ I + N+ + G+ +N DRT +EL + +
Sbjct: 57 EYNQRKRIFEQKLKEIKAFNSNSENGYKKGINQFTDRTAEELRETTLGYSKTVKNAANKQ 116
Query: 206 LPFPYSKSRVEELSVK-LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNG 382
F K+ ++++VK LP DWR G VTPVKDQ CGSCW+F T +E +
Sbjct: 117 NMFRNLKTS-DKINVKDLPKSVDWRDAGVVTPVKDQGHCGSCWAFATTAVIESYAAIAT- 174
Query: 383 GHLVRLSQQALIDCSWGF----GNNGCDGGEDFRAYEWIKRHGLPTE--EDYGGYLGQDG 544
G L LS Q L+ C G GC+G AY +++ GL +E Y Y GQ G
Sbjct: 175 GQLKTLSTQQLVSCVQNSYQCGGQGGCNGAVSELAYNYVQLFGLTSEYKYSYSSYQGQTG 234
Query: 545 YCHVD 559
C D
Sbjct: 235 NCTFD 239
>UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicatein
a3 - Lubomirskia baicalensis
Length = 344
Score = 96.7 bits (230), Expect = 4e-19
Identities = 68/198 (34%), Positives = 100/198 (50%), Gaps = 5/198 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYI-HSNNRANR-GFTMSVNHLADRTDDELAALRG 175
K HQR Y S L+ +R +I+ + +YI H N A+ G+T+++N D E R
Sbjct: 48 KGHHQRSYESQLQEMERHSIWVANKKYIEHHNANADLFGYTLAMNGFGDLMSAEFTE-RY 106
Query: 176 RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
+ GL S V DWR G VT V+ Q CGS ++F GA+
Sbjct: 107 LTHKHSQRSGLQTFESPK-----GVTYADSLDWRTRGVVTSVQSQGQCGSSYAFAAAGAL 161
Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYL 532
EGA L LV LS+Q +IDCS +GN+GC GG+ + A+++ + G+ TE Y Y
Sbjct: 162 EGATALA-ADKLVALSEQNIIDCSVPYGNHGCSGGDVYTAFKYVVDNGGIDTESSY-PYK 219
Query: 533 GQDGYCHVD--NVTAVTS 580
G+ C + NV A+++
Sbjct: 220 GKKSSCQYNSKNVGAIST 237
>UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep:
Cathepsin L - Kudoa thyrsites
Length = 300
Score = 96.7 bits (230), Expect = 4e-19
Identities = 63/188 (33%), Positives = 101/188 (53%), Gaps = 5/188 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSV-NHLADRTDDELAA---L 169
K++H + S E +RL F+++ ++IH+ N N + NHL+ + +E A L
Sbjct: 15 KLEHNIIFDSIEEERRRLCNFKENHQFIHNFNLHNTHYHYCRHNHLSHWSHEEYMAWLTL 74
Query: 170 RGRRYSGPSP-HGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
+ + +P HG+ P ++ +++ LP DW+ G VT VK+Q CGSCWSF
Sbjct: 75 KPKLPVVSTPTHGIT-P-KETATKDIKSTLPSSVDWKALGKVTSVKNQGHCGSCWSFSAA 132
Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGG 526
GA+E A + G LV S+Q L+DCS N+GC+GG A+ ++ +G+ +DY
Sbjct: 133 GAIESAYAIKT-GELVNFSEQQLVDCS--TENHGCNGGLPEIAFLYVINNGIMKLKDY-P 188
Query: 527 YLGQDGYC 550
Y + G C
Sbjct: 189 YTAKQGTC 196
>UniRef50_P25774 Cluster: Cathepsin S precursor; n=78;
Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens
(Human)
Length = 331
Score = 96.7 bits (230), Expect = 4e-19
Identities = 63/194 (32%), Positives = 97/194 (50%), Gaps = 7/194 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRAN----RGFTMSVNHLADRTDDELAAL 169
K + +QY E R I+ ++L+++ +N + + + +NHL D T +E+ +L
Sbjct: 32 KKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSL 91
Query: 170 RGR-RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
R + + + +R+ LP DWR G VT VK Q CG+CW+F V
Sbjct: 92 MSSLRVPSQWQRNITYKSNPNRI------LPDSVDWREKGCVTEVKYQGSCGACWAFSAV 145
Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSW-GFGNNGCDGGEDFRAYEW-IKRHGLPTEEDY 520
GA+E L L G LV LS Q L+DCS +GN GC+GG A+++ I G+ ++ Y
Sbjct: 146 GALEAQLKLKT-GKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASY 204
Query: 521 GGYLGQDGYCHVDN 562
Y D C D+
Sbjct: 205 -PYKAMDQKCQYDS 217
>UniRef50_O16454 Cluster: Temporarily assigned gene name protein
196; n=4; Bilateria|Rep: Temporarily assigned gene name
protein 196 - Caenorhabditis elegans
Length = 477
Score = 96.3 bits (229), Expect = 5e-19
Identities = 64/189 (33%), Positives = 93/189 (49%), Gaps = 6/189 (3%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAALRGRRY 184
+H+++Y + E KR +F+++ + I + +G + +D T E + Y
Sbjct: 180 RHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKKIM-LPY 238
Query: 185 SGPSPHGLPFPYSKSRVEELSVK-----LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
P +P ++ E+ V LP DWR GAVT VK+Q CGSCW+F T G
Sbjct: 239 QWEQP---VYPMEQANFEKHDVTINEEDLPESFDWREKGAVTQVKNQGNCGSCWAFSTTG 295
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGY 529
VEGA F+ LV LS+Q L+DC + GC+GG AY+ I R G ED Y
Sbjct: 296 NVEGAWFIAK-NKLVSLSEQELVDCD--SMDQGCNGGLPSNAYKEIIRMGGLEPEDAYPY 352
Query: 530 LGQDGYCHV 556
G+ CH+
Sbjct: 353 DGRGETCHL 361
>UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole
genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_23,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 321
Score = 95.9 bits (228), Expect = 6e-19
Identities = 58/181 (32%), Positives = 87/181 (48%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
K+ ++Y + E R +I++Q++ I N N + +N D TD E + Y
Sbjct: 44 KYNKRYPTQNEQIYRFSIYQQNIMKIEDFNSQNNSYKQKINKFGDLTDQEFLTI----YL 99
Query: 188 GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGAL 367
+P + E + E DW G V +KDQ CGSCW+F VGA+E
Sbjct: 100 NLQ---MPARVKNIQKNEEPFLVQEEVDWVQKGKVPAIKDQGDCGSCWAFSAVGALEINT 156
Query: 368 FLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGY 547
+ +V LS+Q L+DC+ +GN GCDGG A ++I G+ + Y Y G+DG
Sbjct: 157 KI-QFNEIVDLSEQDLVDCAGPYGNAGCDGGWMESALDYIIDSGIAETKVY-PYKGEDGI 214
Query: 548 C 550
C
Sbjct: 215 C 215
>UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4;
Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis
zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa
zeasingle nucleocapsid nuclear polyhedrosis virus)
Length = 367
Score = 95.9 bits (228), Expect = 6e-19
Identities = 63/200 (31%), Positives = 101/200 (50%), Gaps = 15/200 (7%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRAN------------RGFTMSVNHLADRTD 151
++ + Y E++ R N+F+ +L I+S NR N VN +D+T
Sbjct: 63 QYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNKFSDKTP 122
Query: 152 DELAALRGRRYSGPSPHGLPFPYSKSRVEELS--VKLPPEHDWRLFGAVTPVKDQSVCGS 325
DE+ + S H + ++R+ + + ++LP +DWR VTP+KDQ VCGS
Sbjct: 123 DEVLHSNTGFFLNLSQH---YTLCENRIVKGAPDIRLPDYYDWRDTNKVTPIKDQGVCGS 179
Query: 326 CWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAY-EWIKRHGL 502
CW+F +G +E + + L+ LS+Q L+DC + GC+GG A+ E + G+
Sbjct: 180 CWAFVAIGNIESQYAIRH-NKLIDLSEQQLLDCD--EVDLGCNGGLMHLAFQELLLMGGV 236
Query: 503 PTEEDYGGYLGQDGYCHVDN 562
TE DY Y G + C +DN
Sbjct: 237 ETEADY-PYQGSEQMCTLDN 255
>UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin S
preproprotein; n=2; Tribolium castaneum|Rep: PREDICTED:
similar to cathepsin S preproprotein - Tribolium
castaneum
Length = 525
Score = 95.5 bits (227), Expect = 8e-19
Identities = 65/188 (34%), Positives = 96/188 (51%), Gaps = 5/188 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYI-HSNNRANRG---FTMSVNHLADRTDDELAAL 169
K K++R+Y + E R IF ++ + I H N R +G + + +N L+D TD+E++
Sbjct: 229 KRKYERRYPNLEEENFRRAIFEKTFQEIKHHNERYRKGLETYYLRINDLSDYTDEEMSCC 288
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
+ PS LP + SR LP DWRL G VTPVK Q CG+CW+F +G
Sbjct: 289 -SEKAPKPSITILPNVSTSSRQN-----LPKMVDWRLRGVVTPVKHQGKCGTCWAFAIIG 342
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI-KRHGLPTEEDYGG 526
A E +H G ++ LS+Q L+DC + C G Y++I K G+ ++DY
Sbjct: 343 ATEAQYRIHRGSFVI-LSEQQLVDCVREV--SSCRGVYLHETYKYIVKSEGINYDQDY-R 398
Query: 527 YLGQDGYC 550
Y G C
Sbjct: 399 YQSAPGTC 406
Score = 74.5 bits (175), Expect = 2e-12
Identities = 48/121 (39%), Positives = 62/121 (51%), Gaps = 1/121 (0%)
Frame = +2
Query: 191 PSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALF 370
P+P + FP +R + LP DWRL G VTPVK Q CGSCW+F +GA E A +
Sbjct: 17 PNPSIVIFPNMSARPQS---DLPDMVDWRLQGVVTPVKRQGKCGSCWAFAILGATE-AHY 72
Query: 371 LHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYLGQDGY 547
G V LS+Q L+DC G C G YE+ I +G+ ++DY Y G
Sbjct: 73 RKQRGSFVILSEQQLVDCVREVGT--CKGVWLDEVYEYIINSNGINYDQDY-RYESAPGS 129
Query: 548 C 550
C
Sbjct: 130 C 130
>UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 336
Score = 95.5 bits (227), Expect = 8e-19
Identities = 61/195 (31%), Positives = 99/195 (50%), Gaps = 8/195 (4%)
Frame = +2
Query: 11 HQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDELAALRGRRYS 187
++R Y S E + R IF ++ I ++N + ++ N +D +E A+ + S
Sbjct: 39 NKRTYFSLEEQQFRQQIFFETHERIQNHNSNPEATYKLAHNQFSDMPQEEFASRVLMKSS 98
Query: 188 GPSP-HGLPFPYSKSRVEELS---VKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
P + + + S ++ + V+LP DWR +G ++ VKDQ CGSCW+F T G +
Sbjct: 99 QLIPRNAVQAQNNNSTTQQHTAQDVQLPASFDWRDYGILSDVKDQGQCGSCWAFSTTGIL 158
Query: 356 EGALFLHNGGHLVRLSQQALIDC---SWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGG 526
E F+ N + S+Q L+DC S GF + GC GG A +++ + G+ EE Y
Sbjct: 159 EALYFMEN-RQKISFSEQQLVDCATNSNGFNSYGCSGGWPEEALKYVAKFGILKEEQY-P 216
Query: 527 YLGQDGYCHVDNVTA 571
YL D C V + T+
Sbjct: 217 YLAVDSKCKVSSPTS 231
>UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium
falciparum|Rep: Falcipain 2 - Plasmodium falciparum
Length = 484
Score = 95.5 bits (227), Expect = 8e-19
Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 10/196 (5%)
Frame = +2
Query: 11 HQRQYASDLEHEKRLNIFRQSLRYIHS-NNRANRGFTMSVNHLADRTDDELA-ALRGRRY 184
+ +QY S E ++R +F Q+ ++ NN N + +N AD T E R
Sbjct: 172 NNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFADLTYHEFKNKYLSLRS 231
Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPE-------HDWRLFGAVTPVKDQSVCGSCWSFGT 343
S P + + + EE+ K E +DWRL VTPVKDQ CGSCW+F +
Sbjct: 232 SKPLKNS-KYLLDQMNYEEVIKKYRGEENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSS 290
Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYE-WIKRHGLPTEEDY 520
+G+VE + L+ LS+Q L+DCS F N GC+GG A+E I+ G+ + DY
Sbjct: 291 IGSVESQYAIRK-NKLITLSEQELVDCS--FKNYGCNGGLINNAFEDMIELGGICPDGDY 347
Query: 521 GGYLGQDGYCHVDNVT 568
C++D T
Sbjct: 348 PYVSDAPNLCNIDRCT 363
>UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole
genome shotgun sequence; n=7; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_22,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 350
Score = 95.5 bits (227), Expect = 8e-19
Identities = 63/169 (37%), Positives = 85/169 (50%), Gaps = 1/169 (0%)
Frame = +2
Query: 17 RQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPS 196
+QY+ E RL ++ +L I + N+ D TD+E AA P
Sbjct: 71 KQYSGS-ELLYRLQVYEANLADIKARNQKLGREIFGETQFTDLTDEEFAATYLTLKVNPD 129
Query: 197 PHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLH 376
+P K++ E +V P DWR GAV VKDQ CGSCW+F T G +EG +
Sbjct: 130 DLEVP----KAQFE--NVNATPI-DWRTRGAVNKVKDQGQCGSCWAFSTTGVLEG-FYKV 181
Query: 377 NGGHLVRLSQQALIDCSWGFG-NNGCDGGEDFRAYEWIKRHGLPTEEDY 520
G L LS+Q L+DCS N GCDGG RA ++KR+GL T++ Y
Sbjct: 182 QTGELPDLSEQQLVDCSTLIDFNQGCDGGMPSRALNYVKRNGLTTQDAY 230
>UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin
L-like protease; n=1; Nasonia vitripennis|Rep:
PREDICTED: similar to cathepsin L-like protease -
Nasonia vitripennis
Length = 353
Score = 95.1 bits (226), Expect = 1e-18
Identities = 60/190 (31%), Positives = 96/190 (50%), Gaps = 7/190 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANR----GFTMSVNHLADRTDDELAAL 169
K+++++ Y D+E R ++F ++ R I +N+ + + + +N D +E
Sbjct: 44 KLRYKKNYNGDVEENFRRSVFHENQRKIAEHNQKHDLGLFTYKVRINQFGDMMFEEYKNY 103
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEH-DWRLFGAVTPVKDQSV-CGSCWSFGT 343
+ P ++ S + PEH DWR GAVTPV+DQ + CGSCW+F
Sbjct: 104 M-HAANNTITQLKRIPRGDEFIKPKSAENVPEHVDWRQRGAVTPVRDQGLTCGSCWAFSA 162
Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDY 520
GA+E A + G L LS Q LIDC+ +GN GC GG ++++ + + GL E +Y
Sbjct: 163 AGALE-AQYFKKTGVLTALSAQNLIDCTMEYGNLGCGGGSAALSFQFVVDQKGLEPEANY 221
Query: 521 GGYLGQDGYC 550
Y G+ C
Sbjct: 222 -SYEGRTKEC 230
>UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 344
Score = 95.1 bits (226), Expect = 1e-18
Identities = 58/200 (29%), Positives = 94/200 (47%), Gaps = 11/200 (5%)
Frame = +2
Query: 2 KVKHQRQYA-SDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDEL------ 160
K H +Y S +E ++ + I N+ + +T+ NHL+D T +E
Sbjct: 42 KKTHNVKYEDSSIEAYRKAIFLDNHNKIIEHNSDPSHSYTLGHNHLSDMTHEEFSLYQLN 101
Query: 161 -AALRGRRYSGPSPHGLPFPYSKSRVEE-LSVKLPPEHDWRLFGAVTPVKDQSVCGSCWS 334
A + G + G S V+ ++ K P DWR A+TPVK Q CGSCW+
Sbjct: 102 PARTASKSSKGGNNSGNSSGSSNPYVDPPITTKNAPPMDWRNASAITPVKQQGKCGSCWT 161
Query: 335 FGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFG--NNGCDGGEDFRAYEWIKRHGLPT 508
F + +E F+ NG L S+Q ++DC +G G +NGC+GG A + ++G+
Sbjct: 162 FASTAVLESFSFIKNGAPLTNFSEQQILDCVYGSGYYSNGCNGGFGSEALNYAIQNGIAP 221
Query: 509 EEDYGGYLGQDGYCHVDNVT 568
Y Y+G+ C ++ +
Sbjct: 222 LSQY-PYVGKQQGCKYNSTS 240
>UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163
precursor; n=4; Schizophora|Rep: Putative cysteine
proteinase CG12163 precursor - Drosophila melanogaster
(Fruit fly)
Length = 614
Score = 95.1 bits (226), Expect = 1e-18
Identities = 67/188 (35%), Positives = 90/188 (47%), Gaps = 4/188 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAALRG- 175
+V+ R+Y S E + RL IFRQ+L+ I N G + AD T E G
Sbjct: 312 QVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEYKERTGL 371
Query: 176 -RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGA 352
+R + G S + V +LP E DWR AVT VK+Q CGSCW+F G
Sbjct: 372 WQRDEAKATGG-----SAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCGSCWAFSVTGN 426
Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKR-HGLPTEEDYGGY 529
+EG L+ G L S+Q L+DC ++ C+GG AY+ IK GL E +Y Y
Sbjct: 427 IEG-LYAVKTGELKEFSEQELLDCD--TTDSACNGGLMDNAYKAIKDIGGLEYEAEY-PY 482
Query: 530 LGQDGYCH 553
+ CH
Sbjct: 483 KAKKNQCH 490
>UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep:
Cysteine protease - Clonorchis sinensis
Length = 328
Score = 94.7 bits (225), Expect = 1e-18
Identities = 65/189 (34%), Positives = 91/189 (48%), Gaps = 3/189 (1%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAALRGR 178
K+K+++ Y++D + E R IF+ +L +G V +D T +E R
Sbjct: 36 KLKYKKTYSND-DDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYLR 94
Query: 179 -RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
R+ GP P P ++ + DWR GAV PV DQ CGSCW+F +G V
Sbjct: 95 MRFDGPIVSEDPSPEEDVTMDN------EKFDWREHGAVGPVLDQGKCGSCWAFSVIGNV 148
Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAY-EWIKRHGLPTEEDYGGYL 532
EG F G L+ LS+Q L+DC GC+GG + Y E K GL DY Y
Sbjct: 149 EGQWF-RKTGDLLALSEQQLVDCD--HLEKGCNGGYPPKTYGEIEKMGGLELASDY-PYT 204
Query: 533 GQDGYCHVD 559
G DG C+++
Sbjct: 205 GVDGICYMN 213
>UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease
containing protein; n=2; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 332
Score = 94.3 bits (224), Expect = 2e-18
Identities = 56/182 (30%), Positives = 92/182 (50%), Gaps = 2/182 (1%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDELAALRGR 178
K ++ ++ E + RL +F ++ + I +N ++ GF +N + T +E A
Sbjct: 43 KNRYNLEFNDIQEEQYRLFVFHENFKQIELDNMNSDNGFISGINKFSHLTKEEFKAKYLN 102
Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
R P+ S+ ++ KLP DWR GAV+PV+DQ CGSC++F + GA+E
Sbjct: 103 RPQRPASEMKTNSILSSQ-QKTDEKLPESVDWRKLGAVSPVRDQGNCGSCYAFASTGALE 161
Query: 359 GALFLHNGGHLVRLSQQALIDCS-WGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLG 535
G L+ G L S Q ++DC+ F GC GG + ++K +G+ E Y Y G
Sbjct: 162 G-LYQIKTGKLEVFSPQYIVDCAKHQFSRGGCHGGYSSGVFTFVKENGMNLESRY-PYKG 219
Query: 536 QD 541
++
Sbjct: 220 EE 221
>UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27;
Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber
officinale (Ginger)
Length = 221
Score = 94.3 bits (224), Expect = 2e-18
Identities = 48/110 (43%), Positives = 65/110 (59%)
Frame = +2
Query: 254 LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWG 433
LP DWR GAV PVK+Q CGSCW+F + AVEG + G L+ LS+Q L+DCS
Sbjct: 3 LPDSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVT-GDLISLSEQQLVDCS-- 59
Query: 434 FGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYCHVDNVTAVTSI 583
N+GC+GG +RA+++I +G E++ Y G +G C V SI
Sbjct: 60 TRNHGCEGGWPYRAFQYIINNGGINSEEHYPYTGTNGTCDTKENAHVVSI 109
>UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:
Viral cathepsin - Xestia c-nigrum granulosis virus
(XnGV) (Xestia c-nigrumgranulovirus)
Length = 346
Score = 94.3 bits (224), Expect = 2e-18
Identities = 67/186 (36%), Positives = 90/186 (48%), Gaps = 4/186 (2%)
Frame = +2
Query: 5 VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAA-LRGRR 181
VK+ + Y D E E R IF+Q+L I++ N +N AD + +EL L G +
Sbjct: 48 VKYNKVYKDDQEKEARFEIFKQNLADINARNALEDSAMFEINSRADISSNELLQKLTGLK 107
Query: 182 YS---GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGA 352
S G + P S + S K+P DWR +VT VK Q CGSCW+F V
Sbjct: 108 LSLMRGEKKNSFCTPTVISG--DSSGKVPDSFDWRDRNSVTSVKMQKECGSCWAFSAVAN 165
Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYL 532
+E + + L LS+Q L+DC NNGC+GG A+E I R G + E Y
Sbjct: 166 IESLYHIKHNVSL-DLSEQQLVDCD--KVNNGCNGGLMSWAFEGIIRAGGISYEAPYPYT 222
Query: 533 GQDGYC 550
G DG C
Sbjct: 223 GVDGVC 228
>UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep:
LOC443661 protein - Xenopus laevis (African clawed frog)
Length = 346
Score = 93.9 bits (223), Expect = 3e-18
Identities = 60/185 (32%), Positives = 96/185 (51%), Gaps = 5/185 (2%)
Frame = +2
Query: 11 HQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDEL-AALRG 175
HQ+ Y E R I+ ++L++I +N + G + + +NHL D T +E+ A + G
Sbjct: 58 HQKIYKDAEEERARRTIWEETLKFITVHNLEYSLGLHTYEVGMNHLGDMTGEEVEATMTG 117
Query: 176 RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
S S + ++ + L + P DWR G VT V+ Q CGSC++F VGA+
Sbjct: 118 YTSSDDSLANM----TRVPKKLLEAQPPASIDWRTKGCVTSVRRQRKCGSCYAFSAVGAL 173
Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLG 535
E + G LV S Q L+DCS+ GN GC GG ++ ++K+ G+ + +Y Y G
Sbjct: 174 E-CQWKKKKGTLVTFSPQELVDCSYSEGNKGCKGGSIRSSFTYMKKSGVMEDFNY-PYTG 231
Query: 536 QDGYC 550
++ C
Sbjct: 232 KEEKC 236
>UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46;
Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea
mays (Maize)
Length = 371
Score = 93.9 bits (223), Expect = 3e-18
Identities = 70/202 (34%), Positives = 95/202 (47%), Gaps = 13/202 (6%)
Frame = +2
Query: 17 RQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSG-- 190
+ Y EH RL++F+ +LR + + V +D T E R Y G
Sbjct: 57 KSYKDADEHAYRLSVFKDNLRRARRHQLLDPSAEHGVTKFSDLTPAEFR----RTYLGLR 112
Query: 191 PSPHGLPFPYSKSRVEELSVK---LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
S L +S E + LP + DWR GAV PVK+Q CGSCWSF GA+EG
Sbjct: 113 KSRRALLRELGESAHEAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEG 172
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFG-------NNGCDGGEDFRAYEWI-KRHGLPTEED 517
A +L G L LS+Q +DC ++GC+GG A+ ++ K GL +E+D
Sbjct: 173 AHYLAT-GKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYLQKAGGLESEKD 231
Query: 518 YGGYLGQDGYCHVDNVTAVTSI 583
Y Y G DG C D V S+
Sbjct: 232 Y-PYTGSDGKCKFDKSKIVASV 252
>UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep:
Cathepsin L precursor - Schistosoma mansoni (Blood
fluke)
Length = 319
Score = 93.9 bits (223), Expect = 3e-18
Identities = 70/188 (37%), Positives = 99/188 (52%), Gaps = 3/188 (1%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAALR-G 175
K+K+++QY + E E R NIF+ ++ RG + V +D T DE A
Sbjct: 24 KLKYRKQY-HETEDEIRFNIFKSNILKAQLYQVFVRGSAIYGVTPYSDLTTDEFARTHLT 82
Query: 176 RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
+ PS P S + E++ +P DWR GAVT VK+Q +CGSCW+F T G V
Sbjct: 83 ASWVVPSSRSNT-PTSLGK--EVN-NIPKNFDWREKGAVTEVKNQGMCGSCWAFSTTGNV 138
Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYE-WIKRHGLPTEEDYGGYL 532
E F G L+ LS+Q L+DC G ++GC+GG AYE IK GL E++Y Y
Sbjct: 139 ESQWF-RKTGKLLSLSEQQLVDCD-GL-DDGCNGGLPSNAYESIIKMGGLMLEDNY-PYD 194
Query: 533 GQDGYCHV 556
++ CH+
Sbjct: 195 AKNEKCHL 202
>UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia
theta|Rep: Cathepsin H precursor - Guillardia theta
(Cryptomonas phi)
Length = 353
Score = 93.5 bits (222), Expect = 3e-18
Identities = 65/192 (33%), Positives = 103/192 (53%), Gaps = 9/192 (4%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHS-NNRANRGFTMSVNHLADRTDDEL--AALRGR 178
KH + Y D + +RL F SL+ + + N+R + ++N +D T +E A L
Sbjct: 39 KHSKVYEDDTTYLRRLASFCVSLKEVEAINSRPGTTWRAALNQYSDLTWEEFKHAKLMAE 98
Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRL-----FGAVTPVKDQSVCGSCWSFGT 343
+ G + + P K + ++ + + E DWR V+ VK+Q CGSCW+F T
Sbjct: 99 QNCGAT---VTTPVEK--LVKMGI-VADEFDWRNQTCGETSCVSMVKNQGTCGSCWTFST 152
Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDY 520
A+E +L G +V LS+Q L+DC+ F NNGC+GG +A+E+I + GL E+Y
Sbjct: 153 AAALE-SLHAIKTGEMVLLSEQQLVDCAADFKNNGCNGGLPSQAFEYIMYNGGLSKMEEY 211
Query: 521 GGYLGQDGYCHV 556
Y+ DG+C+V
Sbjct: 212 -PYVCGDGHCNV 222
>UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, whole
genome shotgun sequence; n=2; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_36,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 307
Score = 93.5 bits (222), Expect = 3e-18
Identities = 62/184 (33%), Positives = 92/184 (50%), Gaps = 1/184 (0%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDELAALRGRRY 184
K+ ++ + E R IF Q++ I+ +N N+ ++M+VN AD TD+E ++ Y
Sbjct: 34 KNFNKFYTSNEETYRQVIFNQNVELINKHNSNPNKSYSMAVNQFADLTDEEFQSM----Y 89
Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
G P +E + DW + P+K+Q CGSCW+F +GAVEG
Sbjct: 90 LGK-----PTYVKIDNIELSKGNTLGDADWA--SKMNPIKNQGNCGSCWTFSAIGAVEGF 142
Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDG 544
L + G V LS+Q L+DC+ G GC+GG A ++I G E DY Y +DG
Sbjct: 143 LAIRKGFKGV-LSEQQLVDCAVDAG-EGCNGGNSDLALDYIAEVGSVYERDY-EYTAKDG 199
Query: 545 YCHV 556
C V
Sbjct: 200 VCKV 203
>UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep:
CG4847-PD, isoform D - Drosophila melanogaster (Fruit
fly)
Length = 420
Score = 93.1 bits (221), Expect = 4e-18
Identities = 62/155 (40%), Positives = 78/155 (50%), Gaps = 6/155 (3%)
Frame = +2
Query: 113 FTMSVNHLADRTDDE-LAALRGRRYSGPSPHGLPFPYSKSRVEELSVK-LPPEHDWRLFG 286
F +VN AD T E L+ L G + S P + ++ L K +P DWR G
Sbjct: 157 FKQAVNAFADLTHSEFLSQLTGLKRS---PEAKARAAASLKLVNLPAKPIPDAFDWREHG 213
Query: 287 AVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCS--WGFGNNGCDGG 460
VTPVK Q CGSCW+F T GA+EG F G L LS+Q L+DC FG NGCDGG
Sbjct: 214 GVTPVKFQGTCGSCWAFATTGAIEGHTF-RKTGSLPNLSEQNLVDCGPVEDFGLNGCDGG 272
Query: 461 EDFRAYEWIK--RHGLPTEEDYGGYLGQDGYCHVD 559
A+ +I + G+ E Y Y+ G C D
Sbjct: 273 FQEAAFCFIDEVQKGVSQEGAY-PYIDNKGTCKYD 306
>UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel
protein - Danio rerio (Zebrafish) (Brachydanio rerio)
Length = 328
Score = 92.7 bits (220), Expect = 6e-18
Identities = 62/189 (32%), Positives = 96/189 (50%), Gaps = 6/189 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRAN----RGFTMSVNHLADRTDDELAAL 169
K +H + Y + E R ++++Q+L+ I +N A +T+ +N L+D T DE+ +
Sbjct: 31 KSQHNKTYRNTREERLRRSVWKQNLQDILLHNEAAAVGLHSYTLGLNQLSDMTADEVNDM 90
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVK-LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
G FP + S++ LP +W G V+PV++Q CGSCW+F V
Sbjct: 91 NGLLEED-------FPDVNATFSPPSLQTLPQRVNWTEHGMVSPVQNQGPCGSCWAFSAV 143
Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYG 523
G++E A LV LS Q L+DCS GN GC GG RA+ + I+ G+ + Y
Sbjct: 144 GSLE-AQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQNRGIDSSTFY- 201
Query: 524 GYLGQDGYC 550
Y ++G C
Sbjct: 202 PYEHKEGVC 210
>UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1;
Naegleria fowleri|Rep: Cysteine proteinase homolog -
Naegleria fowleri
Length = 347
Score = 92.7 bits (220), Expect = 6e-18
Identities = 67/204 (32%), Positives = 95/204 (46%), Gaps = 13/204 (6%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
K+ + Y ++ EH R IF+ ++ N + + +D T +E + +
Sbjct: 39 KYAKVYGTE-EHNNRYQIFKANVEKSRYYNHVGKRENFGITKFSDLTPEEFKRMFLMKTY 97
Query: 188 GPSPHG--LPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
P L P E+ P DWR GAVT VK+Q CGSCW+F T G VEG
Sbjct: 98 TPEEAKKILAAPQHAVLSEKEVQTAPTSFDWRQHGAVTRVKNQGACGSCWTFSTTGNVEG 157
Query: 362 ALFLHNGGHLVRLSQQALIDCSWG---FGN-----NGCDGGEDFRAYEW-IKRHGLPTEE 514
+ G LV LS+Q L+DC + N +GC+GG + A+++ IK GL TE+
Sbjct: 158 QWAIKK-GKLVSLSEQQLVDCDHNCVTYQNQQACDSGCNGGLMWSAFQYVIKNGGLDTED 216
Query: 515 DYGGYLGQDGYCHVD--NVTAVTS 580
Y Y G D C + NV A S
Sbjct: 217 SY-PYEGVDDTCRFNKSNVAATIS 239
>UniRef50_Q248G1 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 334
Score = 92.7 bits (220), Expect = 6e-18
Identities = 59/182 (32%), Positives = 91/182 (50%), Gaps = 4/182 (2%)
Frame = +2
Query: 17 RQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPS 196
R Y S+ E R IF ++ R + S+N N FT S+N AD TD+E + R +
Sbjct: 45 RVYNSEEEQFFRQLIFVENKRQVDSHNSQNPTFTQSLNQFADFTDEEF---KYRVLNTKV 101
Query: 197 PHGLPFPYSKSRVEELSVKLPPEHDWR-LFGAVTPVKDQSVCGSCWSFGTVGAVEGALFL 373
P + L ++P DWR + V P+K+Q CGSCW+F G VE L
Sbjct: 102 SQTRPKKGRRLESRVLDQQIPESVDWRNVTNVVGPIKNQGHCGSCWTFSIAGIVESHYVL 161
Query: 374 HNGGHLVRLSQQALIDC---SWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDG 544
+G + V ++Q ++DC S G+ ++GC+GG A +++ +G+ E Y Y+ G
Sbjct: 162 KHGSY-VSYAEQEILDCVSVSAGYQSDGCNGGWPEEALQYVIEYGIVKSEVY-PYVAVQG 219
Query: 545 YC 550
C
Sbjct: 220 KC 221
>UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheirus
salmonis|Rep: Putative cathepsin L - Lepeophtheirus
salmonis (salmon louse)
Length = 257
Score = 92.7 bits (220), Expect = 6e-18
Identities = 49/114 (42%), Positives = 64/114 (56%), Gaps = 1/114 (0%)
Frame = +2
Query: 245 SVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDC 424
S +P +W GAVT VKDQ CGSCW+F T G+VEG F+ N L+ S+Q L+DC
Sbjct: 35 SAPVPSYVNWTKNGAVTAVKDQKDCGSCWAFSTTGSVEGQYFIKN-KKLLSFSEQQLVDC 93
Query: 425 SWGFGNNGCDGGEDFRAYEW-IKRHGLPTEEDYGGYLGQDGYCHVDNVTAVTSI 583
S F N GC+GG A+++ I G+ TE+ Y Y DG C + A I
Sbjct: 94 SSDFRNEGCNGGWMDNAFKYLIANKGIATEDTY-PYTATDGVCVYNKTMAAGRI 146
>UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia
medanensis|Rep: Sui m 1 allergen - Suidasia medanensis
Length = 336
Score = 92.7 bits (220), Expect = 6e-18
Identities = 65/198 (32%), Positives = 99/198 (50%), Gaps = 10/198 (5%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K + +QY ++ E ++R IF ++LR+I N+ G + VN AD T +E +++
Sbjct: 32 KELYGKQYTAEEEPQRRA-IFEENLRWIQENH-GKHGAGLEVNEHADLTAEEFSSM---- 85
Query: 182 YSGPSPHG-LPFPYSKSRVE----ELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
Y+ + L P K V+ ++SV LP DWR T V++Q CGSCW+F T
Sbjct: 86 YATLNQEAFLKSPLHKEFVQVPESDISVALPAAFDWRQQWN-TAVRNQGQCGSCWAFATA 144
Query: 347 GAVEGALFLHNGGHLVRLSQQALIDC-----SWGFGNNGCDGGEDFRAYEWIKRHGLPTE 511
VE + H V LS+Q L+DC + ++GC GG AY ++++ GL E
Sbjct: 145 ATVEAQYAIRKNVH-VTLSEQQLVDCDHRPFQGQYEDHGCQGGNPIIAYAYVQQTGLVEE 203
Query: 512 EDYGGYLGQDGYCHVDNV 565
Y Y +DG C V
Sbjct: 204 SAY-PYQARDGQCQSSTV 220
>UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13;
Plasmodium|Rep: Cysteine protease falcipain-3 -
Plasmodium falciparum
Length = 492
Score = 92.3 bits (219), Expect = 8e-18
Identities = 64/187 (34%), Positives = 98/187 (52%), Gaps = 16/187 (8%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDELAALRGRRY 184
++ ++Y + E +KR IF ++ R I +N+ N + +N D + +E + +Y
Sbjct: 177 ENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGDLSPEEFRS----KY 232
Query: 185 SGPSPHGLPF-----PYS-KSRVEELSVKLPPE--------HDWRLFGAVTPVKDQSVCG 322
HG PF P S ++ E++ K P +DWRL G VTPVKDQ++CG
Sbjct: 233 LNLKTHG-PFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVTPVKDQALCG 291
Query: 323 SCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAY-EWIKRHG 499
SCW+F +VG+VE + L S+Q L+DCS NNGC GG A+ + I G
Sbjct: 292 SCWAFSSVGSVESQYAIRKKA-LFLFSEQELVDCS--VKNNGCYGGYITNAFDDMIDLGG 348
Query: 500 LPTEEDY 520
L +++DY
Sbjct: 349 LCSQDDY 355
>UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing
protein; n=5; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 437
Score = 92.3 bits (219), Expect = 8e-18
Identities = 57/184 (30%), Positives = 90/184 (48%), Gaps = 3/184 (1%)
Frame = +2
Query: 41 HEKRLNIFRQSL-RYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFP 217
+ +R +F+ L + I N+ ++ ++ +N L +TD EL R + +
Sbjct: 137 NSERFQLFKSRLAKIIEHNSNPDKKYSQIINKLTFQTDLELKKFRASQNCSATAQANTRS 196
Query: 218 YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSV-CGSCWSFGTVGAVEGALFLHNGGHLV 394
+ K +LS +LP DWR G VT VK Q CGSCW+F V A+E L G +
Sbjct: 197 FRKY---DLS-QLPQYVDWREKGVVTQVKSQGKDCGSCWAFAAVAALESHYALKTGKKPI 252
Query: 395 RLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYGGYLGQDGYCHVDNVTA 571
+ S+Q L+DC+ F GC GG + +E++ G+ E DY Y G+D C ++
Sbjct: 253 QFSEQQLVDCARKFDTKGCSGGLPSKGFEYLAYAGGIQNEADY-PYEGEDKNCRFNSSKT 311
Query: 572 VTSI 583
V +
Sbjct: 312 VVQV 315
>UniRef50_Q235G6 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 325
Score = 92.3 bits (219), Expect = 8e-18
Identities = 70/193 (36%), Positives = 95/193 (49%), Gaps = 8/193 (4%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEK--RLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRG 175
K + ++YA + E+ R+N+F +L + + TM V D T E A L
Sbjct: 44 KQTYNKKYADQDDDEEVYRMNVFFDNLEFTKKDP------TMGVTKFMDLTHTEFAEL-- 95
Query: 176 RRYSGPSPHGLPFPYSKSRVEELSVKLPPEH------DWRLFGAVTPVKDQSVCGSCWSF 337
Y P+ ++ EE+ P +H DW GAVTPVK+Q CG CWSF
Sbjct: 96 --YLNPA---------ENIDEEIDSLQPIQHNEDIVIDWVEKGAVTPVKNQGGCGGCWSF 144
Query: 338 GTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEED 517
T G VEGA F++ L LSQQ LIDC+ N GC GG A ++K GL TEE+
Sbjct: 145 ATTGGVEGANFVYK-NVLPNLSQQQLIDCN--TQNKGCGGGLRDIALNYVKETGLTTEEE 201
Query: 518 YGGYLGQDGYCHV 556
Y Y ++G C +
Sbjct: 202 Y-SYEAKNGKCRL 213
>UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep:
MGC107932 protein - Xenopus tropicalis (Western clawed
frog) (Silurana tropicalis)
Length = 333
Score = 91.9 bits (218), Expect = 1e-17
Identities = 60/196 (30%), Positives = 99/196 (50%), Gaps = 5/196 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNR-ANRG---FTMSVNHLADRTDDELAAL 169
K K++++Y + + R + + + +N+ A++G + M++N AD TD+E ++
Sbjct: 31 KSKYEKKYVTLDKELNRRKAWEATWEKVQKHNQLADQGLKSYRMAMNQFADLTDNERSS- 89
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSV-CGSCWSFGTV 346
+ P L P S+ +P E DWR VTPVK+Q CGSCW+F TV
Sbjct: 90 --KSCLLPREKSLN-PVKAESYSYTSITIPKEVDWRKSNCVTPVKNQGTFCGSCWAFATV 146
Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGG 526
G +E + L+ LS+Q L+DC N GC GG +A E++ +HG+ ++Y
Sbjct: 147 GVMESRYCIRT-KELLNLSEQQLVDCD--EINEGCCGGFPIKALEYVAQHGVMRNKEY-E 202
Query: 527 YLGQDGYCHVDNVTAV 574
Y + C D+ A+
Sbjct: 203 YSQKKATCEYDSDKAI 218
>UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lamblia
ATCC 50803|Rep: GLP_26_47548_45815 - Giardia lamblia
ATCC 50803
Length = 577
Score = 91.9 bits (218), Expect = 1e-17
Identities = 46/109 (42%), Positives = 66/109 (60%), Gaps = 10/109 (9%)
Frame = +2
Query: 254 LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGAL----FLHNG-GH--LVRLSQQA 412
LP E DWR+ G + KDQ CGSCW+FG +G +EG + + G H L S+Q+
Sbjct: 344 LPQELDWRVRGIMNMAKDQVACGSCWTFGAIGTIEGRINKLRVVEEGLRHEPLKAYSEQS 403
Query: 413 LIDCSWGFGNNGCDGGEDFRAYEW-IKRHG--LPTEEDYGGYLGQDGYC 550
++DC WGFG+ GCDGG+ A +W ++ +G + E +Y YLGQ+ C
Sbjct: 404 IVDCYWGFGSFGCDGGDTLAALKWLVENNGGRVAFESEY-PYLGQNDLC 451
>UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, whole
genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_79,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 324
Score = 91.9 bits (218), Expect = 1e-17
Identities = 60/187 (32%), Positives = 95/187 (50%), Gaps = 4/187 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG---FTMSVNHLADRTDDELAALR 172
K+++ ++++S+ E R +F+Q+ + I ++N G +TM N AD T+ E A
Sbjct: 40 KIQYNKKFSSEKEEMYRYLVFQQNAQLIEAHNNDKSGKYTYTMETNQFADLTEQEFA--- 96
Query: 173 GRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQ-SVCGSCWSFGTVG 349
++Y P +KS+ + V DW G V P+KDQ S CGS W+F VG
Sbjct: 97 -QKYLTFRPKST----NKSKSTDY-VPNGQARDWVEEGKVPPIKDQGSSCGSSWAFSAVG 150
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGY 529
+E + G LS+Q ++DCS +GN GC GG +E+++ HG+ Y Y
Sbjct: 151 VLEINSNIEFGLETT-LSEQDMLDCSGPYGNQGCSGGWMDSGFEYVRDHGIANGSVY-PY 208
Query: 530 LGQDGYC 550
+G D C
Sbjct: 209 VGSDQTC 215
>UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823
protein, partial; n=1; Ornithorhynchus anatinus|Rep:
PREDICTED: similar to MGC81823 protein, partial -
Ornithorhynchus anatinus
Length = 361
Score = 91.5 bits (217), Expect = 1e-17
Identities = 45/90 (50%), Positives = 60/90 (66%), Gaps = 2/90 (2%)
Frame = +2
Query: 257 PPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWG 433
PPE DWR G VTPVKDQ CGSCW+FG+ G +EG LF G L +S+Q L+DCS
Sbjct: 190 PPEALDWRDHGYVTPVKDQGRCGSCWAFGSTGVLEGQLF-RRTGRLAAVSEQNLMDCSRK 248
Query: 434 FGNNGCDGGEDFRAYEWIKRH-GLPTEEDY 520
GN GCDGG +++ +++ + G+ +EE Y
Sbjct: 249 QGNRGCDGGLMQQSFLYVRDNGGVDSEEAY 278
>UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain;
n=9; Cucujiformia|Rep: Digestive cysteine proteinase
intestain - Leptinotarsa decemlineata (Colorado potato
beetle)
Length = 326
Score = 91.5 bits (217), Expect = 1e-17
Identities = 64/200 (32%), Positives = 94/200 (47%), Gaps = 6/200 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELA-A 166
K H + Y S LE R IF+ +LR I +N + + + V AD T DE
Sbjct: 27 KQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAKYDKGEESYFLGVTPFADLTHDEFKDE 86
Query: 167 LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
LR + + P+ + + +++P DW GAV VK Q CGSCW+F
Sbjct: 87 LRRQIKTKPNVEATLAVFPEG------LEVPDSIDWTQKGAVLDVKYQGGCGSCWAFSAT 140
Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCD-GGEDFRAYEWIKRHGLPTEEDYG 523
GA+EG + N + LS+Q L+DCS +GN+ C+ GG A++++ G+ + Y
Sbjct: 141 GALEGQNAIVNNVK-IPLSEQQLLDCSKPYGNDDCEHGGLMSFAFDYVLDKGIEADSSY- 198
Query: 524 GYLGQDGYCHVDNVTAVTSI 583
Y G D C D V I
Sbjct: 199 PYKGIDTPCQYDAKKTVLKI 218
>UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan
CA, family C1, cathepsin L-like cysteine peptidase -
Trichomonas vaginalis G3
Length = 306
Score = 91.5 bits (217), Expect = 1e-17
Identities = 59/181 (32%), Positives = 87/181 (48%), Gaps = 3/181 (1%)
Frame = +2
Query: 38 EHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFP 217
E+ RL I+ + RY+ NR N GFT+++N A T++E ++ G +Y S +P
Sbjct: 25 EYHFRLGIWLSNKRYVQEKNRVNLGFTLALNRFAHLTENEYRSMLGYKYGHKS-----YP 79
Query: 218 YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVR 397
+K+ + +P E DWR G V +K+Q CGSCW+F + +E + N L
Sbjct: 80 ITKN----IKNDVPTEIDWREQGIVNKIKNQGACGSCWAFSAIQVIESQV-AKNQKQLYD 134
Query: 398 LSQQALIDCSWGFGNNGCDGGEDFRAYEWI---KRHGLPTEEDYGGYLGQDGYCHVDNVT 568
LS+Q L+DC GC GG A E++ + DY Y G C DN
Sbjct: 135 LSEQNLLDCVTSC--FGCGGGWSPGALEYVYEKQNSKFMLTTDY-PYTAVQGTCKYDNKK 191
Query: 569 A 571
A
Sbjct: 192 A 192
>UniRef50_Q231X3 Cluster: Papain family cysteine protease containing
protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 323
Score = 91.1 bits (216), Expect = 2e-17
Identities = 61/190 (32%), Positives = 93/190 (48%), Gaps = 3/190 (1%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K KH ++Y + RL +F ++L + ++ G T + D TDDE A
Sbjct: 43 KQKHNKRYENTDYESYRLEVFAENLEVVKNDQTGTYGITKFL----DLTDDEFAG----- 93
Query: 182 YSGPSPHGLPFPYSKSRV-EELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
+ L Y + + E++ V +W G V+ VK Q CGSCW+F +VE
Sbjct: 94 ----NFLNLKAQYPEDSIAEDIEVDPKININWVEAGKVSNVKSQGNCGSCWAFSATASVE 149
Query: 359 GALFLHNG-GHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLG 535
AL + + LS+Q LIDCS +GN GC G+ +A +IKR+ + TE++Y Y
Sbjct: 150 SALIIAGKVDKSISLSEQQLIDCSGDYGNYGCAAGQKEQALVYIKRYSITTEQNY-PYTE 208
Query: 536 QD-GYCHVDN 562
+D C+ DN
Sbjct: 209 KDVQKCYFDN 218
>UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus
tropicalis|Rep: LOC594890 protein - Xenopus tropicalis
(Western clawed frog) (Silurana tropicalis)
Length = 355
Score = 90.6 bits (215), Expect = 2e-17
Identities = 60/185 (32%), Positives = 92/185 (49%), Gaps = 5/185 (2%)
Frame = +2
Query: 11 HQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELAALRGR 178
H++ Y ++ E R I+ +L++I +N + G + + +NHL D +E+ +
Sbjct: 59 HKKIYKNEGEELARRLIWEDTLKFIMLHNLEYSMGLHTYEVGMNHLGDMVAEEMTDKQMN 118
Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
+ P E+S PPE DWR VT VKDQ C + W+F ++GA+
Sbjct: 119 FIPQVIANITDVPV------EISKSSPPESIDWRNKNCVTSVKDQGSCIASWAFSSIGAL 172
Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLG 535
E G L LS Q L+DCS +GNNGC GG ++ +I +G+ E +Y Y G
Sbjct: 173 ECQNMKRRTGKLESLSVQNLLDCSQTYGNNGCKGGWVVSSFRYIIDNGIELESNY-PYQG 231
Query: 536 QDGYC 550
+DG C
Sbjct: 232 KDGKC 236
>UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain -
Tetrahymena pyriformis
Length = 330
Score = 90.6 bits (215), Expect = 2e-17
Identities = 60/181 (33%), Positives = 87/181 (48%), Gaps = 5/181 (2%)
Frame = +2
Query: 23 YASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM--SVNHLADRTDDELAALRGRRYSGPS 196
Y + E RL++F ++L+ I +NN AN T VN D T++E AA R P
Sbjct: 47 YKNQGEESYRLSVFLENLKSIEANN-ANPLSTHVEEVNSFTDLTEEEFAA-RYLMKDLPQ 104
Query: 197 PHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLH 376
P +E ++ P DW + PVK+Q CGSCW+F T G +EG +H
Sbjct: 105 QMNKDLPI----LEMETLAAPQVIDWTAKNVLPPVKNQQQCGSCWAFSTAGMLEGVYNIH 160
Query: 377 NGGHL-VRLSQQALIDC--SWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGY 547
+ S+Q L+DC + GFG GC+G A + ++ G+ E Y Y +DG
Sbjct: 161 ESPQTPISFSEQQLVDCCGAQGFGCEGCNGAWPTDAVAYTQKFGIVQESQY-AYTAKDGS 219
Query: 548 C 550
C
Sbjct: 220 C 220
>UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2;
Caenorhabditis|Rep: Putative uncharacterized protein -
Caenorhabditis elegans
Length = 343
Score = 90.2 bits (214), Expect = 3e-17
Identities = 59/180 (32%), Positives = 93/180 (51%), Gaps = 5/180 (2%)
Frame = +2
Query: 5 VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG-FTMSVNHLADRTDDELAALRGRR 181
VK+ R+Y ++ E KR IF ++L + N+ + G T +N +D T++E +
Sbjct: 56 VKYLREYPNEYEIVKRFTIFSRNLDLVERYNKEDAGKVTYELNDFSDLTEEEWK----KY 111
Query: 182 YSGPSP-HGLPFPYSKSRVEELSVKLPPEHDWRLFGA---VTPVKDQSVCGSCWSFGTVG 349
P P H K+ +++ + LP DWR VT +K Q CGSCW+F T
Sbjct: 112 LMTPKPDHSEKSLKPKTLIDKKN--LPNSVDWRNVNGTNHVTGIKYQGPCGSCWAFATAA 169
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGY 529
A+E A+ + +GG L LS Q L+DC+ ++ C GGE A ++ + HG+ T +Y Y
Sbjct: 170 AIESAVSI-SGGGLQSLSSQQLLDCT--VVSDKCGGGEPVEALKYAQSHGITTAHNYPYY 226
>UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30;
Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria
dispar multicapsid nuclear polyhedrosis virus (LdMNPV)
Length = 356
Score = 90.2 bits (214), Expect = 3e-17
Identities = 64/188 (34%), Positives = 98/188 (52%), Gaps = 5/188 (2%)
Frame = +2
Query: 11 HQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG--FTMSVNHLADRTDDELAALRGRR 181
+ + Y SD E KR +IF+ +L I++ N A G T +N +D + EL A +
Sbjct: 63 YNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKFSDLSKSELIA----K 118
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
++G S + K+ + P H DWR VT +K+Q CG+CW+F T+ +VE
Sbjct: 119 FTGLSIPERVSNFCKTIILNQPPDKGPLHFDWREQNKVTSIKNQGACGACWAFATLASVE 178
Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKR-HGLPTEEDYGGYLG 535
+ F L+ LS+Q LIDC + GC+GG A+E I R G+ TE DY ++G
Sbjct: 179 -SQFAMRHNRLIDLSEQQLIDCD--SVDMGCNGGLLHTAFEEIMRMGGVQTELDY-PFVG 234
Query: 536 QDGYCHVD 559
++ C +D
Sbjct: 235 RNRRCGLD 242
>UniRef50_Q22LI1 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 987
Score = 89.8 bits (213), Expect = 4e-17
Identities = 61/199 (30%), Positives = 98/199 (49%), Gaps = 12/199 (6%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDELA------A 166
KH + + + + + RL+IF ++ + I +N ++ F + +N A T E A +
Sbjct: 37 KHNKVFDPE-QLKYRLSIFAENYKKIKEHNYNSSNTFQLGLNEYAHMTSQEFAEVFLTPS 95
Query: 167 LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
+ + P P P P+ + +V + P DWR GAVT VK Q CGSCWSF
Sbjct: 96 ISKSQQKQPKPKPQPQPHPNNSTNT-TVTITPI-DWRNKGAVTSVKRQGKCGSCWSFSAA 153
Query: 347 GAVEGALFLHNGGHLVRLSQQALIDC-----SWGFGNNGCDGGEDFRAYEWIKRHGLPTE 511
G +E + G+L+ LS+Q L+DC + +NGC+GG A E+ ++G+
Sbjct: 154 GLMEAFQYFKT-GNLIDLSEQQLVDCDNSSFDKSYYSNGCNGGYPQEAVEYASKYGIVPL 212
Query: 512 EDYGGYLGQDGYCHVDNVT 568
DY Y+ Q C + + T
Sbjct: 213 TDY-PYVKQQQPCAIKSPT 230
>UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lamblia
ATCC 50803|Rep: GLP_26_49243_47612 - Giardia lamblia
ATCC 50803
Length = 543
Score = 89.4 bits (212), Expect = 5e-17
Identities = 48/117 (41%), Positives = 65/117 (55%), Gaps = 8/117 (6%)
Frame = +2
Query: 227 SRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG---ALFLHNGGH--- 388
S + V+ P + DWR+ G +TPVKDQ+ CGSCWSFG G +EG AL G
Sbjct: 307 SEENQKRVQFPRQLDWRVRGVITPVKDQAACGSCWSFGAAGTIEGRLNALKWKRGERDTP 366
Query: 389 LVRLSQQALIDCSWGFGNNGCDGGEDFRAY-EWIKR-HGLPTEEDYGGYLGQDGYCH 553
L+R+S+Q++I C W NNGC+GG + A +I G E YLG + C+
Sbjct: 367 LLRVSEQSIISCVWNEDNNGCNGGLTYEALTAYINEFSGRIAYEMDSPYLGVESLCN 423
>UniRef50_Q22NW9 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein - Tetrahymena
thermophila SB210
Length = 894
Score = 89.4 bits (212), Expect = 5e-17
Identities = 58/164 (35%), Positives = 91/164 (55%), Gaps = 3/164 (1%)
Frame = +2
Query: 38 EHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPF 214
E+ RLNIF ++L+ I ++N+ +N+ + +N T++E + Y L
Sbjct: 617 EYMYRLNIFAKNLQNIKNHNQISNKPYIEGINQFTHLTEEEFE----QTYLT-----LQI 667
Query: 215 PYSKS-RVEE-LSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGH 388
P SK + +E L ++P DWR AVTPVK+Q CGS ++F T GA+EG + +G
Sbjct: 668 PASKQYKTQEFLGDEVPSSIDWRDLNAVTPVKNQGSCGSGYAFSTTGALEG-IHKISGKD 726
Query: 389 LVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
S+Q +IDCS GN+GC GG A++++ +G+ E DY
Sbjct: 727 WKGFSEQQIIDCSRKQGNSGCHGGFMENAFDFVIENGILQENDY 770
>UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole
genome shotgun sequence; n=2; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_21,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 349
Score = 89.0 bits (211), Expect = 7e-17
Identities = 59/187 (31%), Positives = 94/187 (50%), Gaps = 6/187 (3%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELAALRG 175
+H ++Y + E+ R IF+++ +YI + R G F + +N AD + +E A +
Sbjct: 46 EHGKRY-TQFENSHRFGIFKKNYQYIQEHQQRVEAGLETFELGLNDFADLSVEEFEA-KY 103
Query: 176 RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
+Y P + ++P E D R G V+ VK+Q CGSCW+F V A+
Sbjct: 104 LKY-----RSTPREQTNQVYRRTGKQVPIEVDLRKDGVVSEVKNQGSCGSCWAFSAVAAL 158
Query: 356 EGALFLHNGGHLVRLSQQALIDCSW--GFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGY 529
E AL G V LS+Q L+DC+ F + GCDGGE + +++ ++G+ +Y Y
Sbjct: 159 ETAL-RQGGVKNVELSEQELVDCAVKDEFESEGCDGGEMYDGFQYASKYGIAIRSEY-PY 216
Query: 530 LGQDGYC 550
G D C
Sbjct: 217 AGVDQKC 223
>UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia
circumcincta|Rep: Secreted cathepsin F - Teladorsagia
circumcincta
Length = 364
Score = 88.6 bits (210), Expect = 9e-17
Identities = 59/180 (32%), Positives = 87/180 (48%), Gaps = 9/180 (5%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAALRGRRY 184
+H + Y ++ E KR IF+++L I S ++G + +N AD + +E
Sbjct: 70 RHDKVYRNESEALKRFGIFKRNLEIIRSAQENDKGTAIYGINQFADLSPEEFKKTH---- 125
Query: 185 SGPSPHGLPFPYSKSRVEELSVK-------LPPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
PH P +R+ +L+ + LP DWR GAVT VK + C +CW+F
Sbjct: 126 ---LPHTWKQPDHPNRIVDLAAEGVDPKEPLPESFDWREHGAVTKVKTEGHCAACWAFSV 182
Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAY-EWIKRHGLPTEEDY 520
G +EG FL LV LS Q L+DC + GC+GG AY E ++ GL E+ Y
Sbjct: 183 TGNIEGQWFLAK-KKLVSLSAQQLLDCD--VVDEGCNGGFPLDAYKEIVRMGGLEPEDKY 239
>UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba
histolytica|Rep: Cysteine protease 10 - Entamoeba
histolytica
Length = 297
Score = 88.2 bits (209), Expect = 1e-16
Identities = 58/197 (29%), Positives = 97/197 (49%), Gaps = 3/197 (1%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMS-VNHLADRTDDELAALRGR 178
K+K+ +Y+ E +R IF Q+ + I N+ N FT++ + T++E L R
Sbjct: 24 KIKYNTKYSGS-EALRRRAIFLQNSKLIQMINKQNLSFTVTNEGPFSVLTNEEYRMLHHR 82
Query: 179 RYSGPSPHGLPFPYSKSRVEELSVK-LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
L + + V+++ K + DWR G VTPVK+Q C SC++FG++ +
Sbjct: 83 IDIEKEIKQLK-SHRMNLVKKMDNKEVLDSIDWRSEGKVTPVKNQRKCASCYAFGSIATI 141
Query: 356 EGALFLHNGGHLVRLSQQALIDCSWG-FGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYL 532
E + + LS+Q ++DCS G + N GC G ++ +++ HG+ E DY Y
Sbjct: 142 ESLIMQETSIKEIDLSEQQIVDCSQGEYSNWGCTCGNVGNSFNYVRDHGILLERDY-PYT 200
Query: 533 GQDGYCHVDNVTAVTSI 583
G+ C +D V I
Sbjct: 201 GKANNCSIDGKKPVIKI 217
>UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n=1;
Toxocara canis|Rep: Cathepsin L-like cysteine proteinase
- Toxocara canis (Canine roundworm)
Length = 360
Score = 88.2 bits (209), Expect = 1e-16
Identities = 43/90 (47%), Positives = 54/90 (60%)
Frame = +2
Query: 251 KLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSW 430
++P DWR + VTPVK Q CGSCW+F TVG VE A L G L LS+Q L+DC+
Sbjct: 144 EIPDHFDWRPYNVVTPVKSQFKCGSCWAFATVGTVESAYAL-GTGELRSLSEQQLLDCN- 201
Query: 431 GFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
NN CDGG+ +A ++ GL E DY
Sbjct: 202 -LENNACDGGDVDKALRYVYDEGLMREYDY 230
>UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:
Viral cathepsin - Cydia pomonella granulosis virus
(CpGV) (Cydia pomonellagranulovirus)
Length = 333
Score = 87.8 bits (208), Expect = 2e-16
Identities = 58/186 (31%), Positives = 87/186 (46%), Gaps = 4/186 (2%)
Frame = +2
Query: 5 VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRY 184
+K+ + Y SD E +L F+ +L+ I+ N A++ +N +D + L
Sbjct: 37 IKYNKTYVSDEERAIKLENFKNNLKMINEKNMASKYAVFDINEYSDLNKNALLRRTTGFR 96
Query: 185 SGPSPHGLPFPYSKSRV----EELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGA 352
G + F ++ V +E LP DWR VTPVK+Q CGSCW+F T+
Sbjct: 97 LGLKKNPSAFTMTECSVVVIKDEPQALLPETLDWRDKHGVTPVKNQMECGSCWAFSTIAN 156
Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYL 532
+E +L+ + LS+Q L++C NNGC GG A E I + G + Y
Sbjct: 157 IE-SLYNIKYDKALNLSEQHLVNCD--NINNGCAGGLMHWALESILQEGGVVSAENEPYY 213
Query: 533 GQDGYC 550
G DG C
Sbjct: 214 GFDGVC 219
>UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8;
Magnoliophyta|Rep: Os12g0273800 protein - Oryza sativa
subsp. japonica (Rice)
Length = 504
Score = 87.4 bits (207), Expect = 2e-16
Identities = 59/183 (32%), Positives = 88/183 (48%), Gaps = 2/183 (1%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG-FTMSVNHLADRTDDELAALRGRRY 184
+H R Y E +RL +F+ ++ +I S N + + + VN AD T +E A
Sbjct: 50 QHGRVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSK 109
Query: 185 SGPSPHGLPFPYSKSRVEELSVK-LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
+P+ + + E +S LP DWR GAVT +KDQ C A+EG
Sbjct: 110 GFSTPNNGVRVSTGFKYENVSADALPASVDWRTKGAVTRIKDQGQC----------AMEG 159
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQD 541
+ L G L+ LS+Q L+DC + GC+GGE A+++I +G T E Y +D
Sbjct: 160 FVKLSTG-KLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAED 218
Query: 542 GYC 550
G C
Sbjct: 219 GRC 221
>UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia
bovis|Rep: Cysteine protease 2 - Babesia bovis
Length = 445
Score = 87.4 bits (207), Expect = 2e-16
Identities = 45/94 (47%), Positives = 56/94 (59%)
Frame = +2
Query: 269 DWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNG 448
DWR AVTPVKDQ +CGSCW+F VG+VE L VRLS+Q L+ C GN G
Sbjct: 241 DWRRADAVTPVKDQGMCGSCWAFAAVGSVESLLKRQKTD--VRLSEQELVSCQ--LGNQG 296
Query: 449 CDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYC 550
C+GG A +IK +G+ E++ YL DG C
Sbjct: 297 CNGGYSDYALNYIKFNGIHRSEEW-PYLAADGKC 329
>UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1];
n=11; Eutheria|Rep: Testin-2 precursor [Contains:
Testin-1] - Mus musculus (Mouse)
Length = 333
Score = 87.4 bits (207), Expect = 2e-16
Identities = 62/199 (31%), Positives = 95/199 (47%), Gaps = 9/199 (4%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELAAL 169
+ KH + Y + E +R ++ ++ + I +N FTM++N D T+ E +
Sbjct: 33 RTKHGKAYNVNEERLRRA-VWEKNFKMIELHNWEYLEGKHDFTMTMNAFGDLTNTEFVKM 91
Query: 170 RG--RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
RR H + + +P DWR+ G VTPVK+Q C S W+F
Sbjct: 92 MTGFRRQKIKRMHVFQ--------DHQFLYVPKYVDWRMLGYVTPVKNQGYCASSWAFSA 143
Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDY 520
G++EG +F G LV LS+Q L+DC + C GG A++++K + GL TEE Y
Sbjct: 144 TGSLEGQMF-KKTGRLVPLSEQNLLDCMGSNVTHDCSGGFMQNAFQYVKDNGGLATEESY 202
Query: 521 GGYLGQDGYC--HVDNVTA 571
Y+G C H +N A
Sbjct: 203 -PYIGPGRKCRYHAENSAA 220
>UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 21 SCAF14577, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 406
Score = 87.0 bits (206), Expect = 3e-16
Identities = 41/88 (46%), Positives = 56/88 (63%)
Frame = +2
Query: 236 EELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQAL 415
E+L + PP DWR G V+PV++Q C SCW+F ++GA+EG + G LV LS Q L
Sbjct: 149 EKLGFETPPSVDWRKAGLVSPVQNQGFCNSCWAFSSLGALEGQM-KKRTGFLVPLSPQNL 207
Query: 416 IDCSWGFGNNGCDGGEDFRAYEWIKRHG 499
+DCS GN GC GG ++Y +I R+G
Sbjct: 208 LDCSISDGNLGCRGGYISKSYSYIIRNG 235
>UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum
aestivum|Rep: Cysteine protease - Triticum aestivum
(Wheat)
Length = 371
Score = 87.0 bits (206), Expect = 3e-16
Identities = 47/109 (43%), Positives = 60/109 (55%), Gaps = 1/109 (0%)
Frame = +2
Query: 257 PPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGF 436
P + DWR G VTP K Q CG CW+F VE +L NGG LV LS Q L+DCS G
Sbjct: 154 PRQFDWREHGVVTPAKQQGACGCCWAFAAAATVE-SLNKINGGELVDLSVQELVDCSTGV 212
Query: 437 GNNGCDGGEDFRAYEWIK-RHGLPTEEDYGGYLGQDGYCHVDNVTAVTS 580
++ C G A WIK + GL TE +Y Y+ + G C V + V++
Sbjct: 213 FSSPCGYGWPKSALAWIKSKGGLLTEAEY-PYMAKRGRCAVHDTARVSA 260
>UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3;
Dictyostelium discoideum|Rep: Cysteine proteinase 1
precursor - Dictyostelium discoideum (Slime mold)
Length = 343
Score = 87.0 bits (206), Expect = 3e-16
Identities = 65/186 (34%), Positives = 89/186 (47%), Gaps = 15/186 (8%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHS------NNRANRGFTMSVNHLADRTDDELAAL 169
K ++Y+ + E+ +R IF+ +L I N++A+ F VN AD + DE
Sbjct: 35 KFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKF--GVNKFADLSSDEFKNY 91
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
LP + +E +P DWR GAVTPVK+Q CGSCWSF T G
Sbjct: 92 YLNNKEAIFTDDLPV--ADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTG 149
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSW--------GFGNNGCDGGEDFRAYEW-IKRHGL 502
VEG F+ + LV LS+Q L+DC + GC+GG AY + IK G+
Sbjct: 150 NVEGQHFI-SQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGI 208
Query: 503 PTEEDY 520
TE Y
Sbjct: 209 QTESSY 214
>UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11;
Trichomonadidae|Rep: Cysteine protease - Tritrichomonas
foetus (Trichomonas foetus)
Length = 315
Score = 86.6 bits (205), Expect = 4e-16
Identities = 58/185 (31%), Positives = 89/185 (48%), Gaps = 3/185 (1%)
Frame = +2
Query: 5 VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRY 184
+++ Q+ E++ R IF + RY+ +N + FT+S+N A T E + G +
Sbjct: 26 MRNTNQFYVGNEYQLRFGIFLSNARYVQEHNAGDSKFTVSLNKFAALTPSEYKVMLGYK- 84
Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
+G + K V+ + DWR G V +KDQ+ CGSCW+F + A E A
Sbjct: 85 TGMKAEKVSRGMKKPNVDSI--------DWREKGVVNEIKDQAACGSCWAFSAIQAAESA 136
Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI---KRHGLPTEEDYGGYLG 535
+ + G L S+Q L+DC G GC GG AY++I ++ + E DY Y
Sbjct: 137 -YAISTGTLESYSEQNLVDCVQGC--YGCSGGLMDYAYKYIIDRQKGKMILESDY-VYTA 192
Query: 536 QDGYC 550
DG C
Sbjct: 193 LDGVC 197
>UniRef50_Q23H32 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 365
Score = 86.6 bits (205), Expect = 4e-16
Identities = 61/182 (33%), Positives = 90/182 (49%), Gaps = 9/182 (4%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG----FTMSVNHLADRTDDELAAL 169
K +++YA D E + R IF ++ YIH+ N+ N + VN AD + E L
Sbjct: 46 KKTFRKRYA-DSEGDYRFQIFAENYNYIHNYNQINENSQDNIQLEVNEFADLSLQEFREL 104
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEE---LSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFG 340
Y+ H S + + LS +P DWR V PV+ Q CGSCW+F
Sbjct: 105 YFG-YNSSKKHNNQQNGSTKNLRQSFLLSDSVPESVDWRE-KLVAPVQKQGGCGSCWAFS 162
Query: 341 TVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKR--HGLPTEE 514
TV A+EGA + G++++ S+Q LIDC NNGC+GG+ A + + G+ +
Sbjct: 163 TVIALEGA-YAKQTGNVIKFSEQNLIDCC-RIENNGCNGGDPEPALDCVMNVLKGIMKNQ 220
Query: 515 DY 520
DY
Sbjct: 221 DY 222
>UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 367
Score = 85.4 bits (202), Expect = 9e-16
Identities = 41/103 (39%), Positives = 61/103 (59%), Gaps = 3/103 (2%)
Frame = +2
Query: 269 DWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSW---GFG 439
DWR GAV+PVK+Q CGSCW+F V E L N L S+Q L+DC++ +
Sbjct: 160 DWRQSGAVSPVKNQGSCGSCWAFSAVALAESVNLLRNNS-LALYSEQELVDCTYKNPQYY 218
Query: 440 NNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYCHVDNVT 568
N GC GG AY +IK G+ ++++Y Y+GQ+ C +++ +
Sbjct: 219 NYGCQGGWPSVAYRYIKDQGISSQQNY-PYIGQNRNCSINSAS 260
>UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14;
Leishmania|Rep: Cysteine proteinase 1 precursor -
Leishmania pifanoi
Length = 354
Score = 85.4 bits (202), Expect = 9e-16
Identities = 56/165 (33%), Positives = 81/165 (49%), Gaps = 3/165 (1%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVN-HLADRTDDELAALRGR 178
K +H + + D E R N F+Q+++ + N N V+ AD T E A L
Sbjct: 46 KKRHGKAFGGDAEEGHRFNAFKQNMQTAYFLNTQNPHAHYDVSGKFADLTPQEFAKL--- 102
Query: 179 RYSGPSPHGLPFPYSKS--RVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGA 352
Y P + K V++ + DWR GAVTPVK+Q +CGSCW+F +G
Sbjct: 103 -YLNPDYYARHLKDHKEDVHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGN 161
Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI 487
+EG + +G LV LS+Q L+ C + GC+GG +A WI
Sbjct: 162 IEGQ-WAASGHSLVSLSEQMLVSCD--NIDEGCNGGLMDQAMNWI 203
>UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus
tauri|Rep: Cysteine protease-1 - Ostreococcus tauri
Length = 430
Score = 85.0 bits (201), Expect = 1e-15
Identities = 63/175 (36%), Positives = 84/175 (48%), Gaps = 14/175 (8%)
Frame = +2
Query: 38 EHEKRLNIFRQSLRYIHSNNR----ANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHG 205
E+ KRL F ++ Y+ +N + +N LA T +E AL G + S
Sbjct: 116 EYAKRLATFAENAAYVVEHNALYAIGEVSHWVGLNSLAATTREEYRALLGYKPELRSSGD 175
Query: 206 LPF--PYSKSRVEELSVKL------PPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
S +VE+ PPE DW GAVTP K+Q CGSCW+F T GAVE
Sbjct: 176 AEMLEATSTDKVEQYKASWEYASVDPPEAIDWVELGAVTPPKNQGQCGSCWAFSTTGAVE 235
Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI-KRHGLPTEEDY 520
G + G LV LS+Q ++ CS N GC+GG A+ WI K G+ +E Y
Sbjct: 236 GITKIRT-GRLVSLSEQEMVSCS--KQNMGCNGGLMDYAFRWIVKNGGIDSEFQY 287
>UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep:
Cysteine proteinase - Paragonimus westermani
Length = 272
Score = 84.6 bits (200), Expect = 2e-15
Identities = 49/120 (40%), Positives = 66/120 (55%), Gaps = 2/120 (1%)
Frame = +2
Query: 230 RVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQ 406
RV +K PE DWR GAVT V++Q CGSCW+F T G VEG F+ G LV LS+
Sbjct: 45 RVRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKT-GQLVSLSK 103
Query: 407 QALIDCSWGFGNNGCDGGEDFRAY-EWIKRHGLPTEEDYGGYLGQDGYCHVDNVTAVTSI 583
Q L+DC +GC+GG +Y E + GL +++DY Y G C ++ + I
Sbjct: 104 QQLVDCD--RAADGCNGGWPASSYLEIMHMGGLESQDDY-PYAGVKEQCFMEKERLLAKI 160
>UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh
fly) (Boettcherisca peregrina). Cathepsin L; n=2;
Dictyostelium discoideum|Rep: Similar to Sarcophaga
peregrina (Flesh fly) (Boettcherisca peregrina).
Cathepsin L - Dictyostelium discoideum (Slime mold)
Length = 265
Score = 84.6 bits (200), Expect = 2e-15
Identities = 51/146 (34%), Positives = 71/146 (48%), Gaps = 2/146 (1%)
Frame = +2
Query: 119 MSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRV--EELSVKLPPEHDWRLFGAV 292
M +N +D T E A + P P P K+ ++ +P DWR GAV
Sbjct: 1 MDLNEYSDLTQKEFADKFFEKLV-PEPRSGPINDIKATPFKHNVNATIPKSFDWRDHGAV 59
Query: 293 TPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFR 472
VK+Q C SCWSF +GA+EG ++ G L+ LS+Q L+DC+ FG GC G
Sbjct: 60 GKVKNQGSCASCWSFSALGALEGHYYI-KYGELLDLSEQNLVDCATPFGPKGCKTGWMHD 118
Query: 473 AYEWIKRHGLPTEEDYGGYLGQDGYC 550
A+++I G E Y G+D C
Sbjct: 119 AFKYIISSGGVNLESQYPYTGKDEVC 144
>UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa
(japonica cultivar-group)|Rep: Os09g0562700 protein -
Oryza sativa subsp. japonica (Rice)
Length = 235
Score = 84.2 bits (199), Expect = 2e-15
Identities = 46/93 (49%), Positives = 57/93 (61%), Gaps = 1/93 (1%)
Frame = +2
Query: 245 SVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDC 424
S LP +H GAVT VKDQ CGSCW+F TV VEG + G LV LS+Q L+DC
Sbjct: 10 SCLLPVDHG----GAVTEVKDQGRCGSCWAFSTVAVVEGIQKIKK-GKLVSLSEQELVDC 64
Query: 425 SWGFGNNGCDGGEDFRAYEWIKRH-GLPTEEDY 520
++GCDGG +RA EWI + G+ T +DY
Sbjct: 65 D--TLDSGCDGGVSYRALEWITANGGITTRDDY 95
>UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2;
Trypanosoma cruzi|Rep: Cysteine proteinase, putative -
Trypanosoma cruzi
Length = 392
Score = 84.2 bits (199), Expect = 2e-15
Identities = 66/192 (34%), Positives = 97/192 (50%), Gaps = 10/192 (5%)
Frame = +2
Query: 38 EHEKRLNIFRQSLRYIHSNNRA-NRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPF 214
E+ +R +F Q+L + ++N A N + M +NH++D T +ELA+L G R S H L
Sbjct: 70 EYVRRRALFEQTLARVRTHNEAGNHLYVMGINHMSDWTPEELASLNGARPRMMS-H-LAQ 127
Query: 215 PYSKSRVEELSVKLPPEHDWRLFGA--VTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGH 388
+ R + ++P E D+R +T VKDQ CGSCW+ G +E + F G
Sbjct: 128 KSLQRRYQSSGGRIPDEVDYRNSSPAILTAVKDQGRCGSCWAHGAAEEME-SHFAILTGR 186
Query: 389 LVRLSQQALIDCSWG----FGNNGCDGGEDFRAYEWIKRHGLPTE--EDYGGYLGQDGYC 550
L LSQQ L C+ G GC G AYE+ K+ G+ +E Y Y G+ G C
Sbjct: 187 LHVLSQQQLTSCAPNPKKCGGTGGCYGSTADLAYEYAKQ-GITSEWVYSYTSYRGETGDC 245
Query: 551 HVD-NVTAVTSI 583
+ +V AV +
Sbjct: 246 RNELDVIAVAQV 257
>UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas
foetus|Rep: TFCP2 protein - Tritrichomonas foetus
(Trichomonas foetus)
Length = 270
Score = 84.2 bits (199), Expect = 2e-15
Identities = 54/167 (32%), Positives = 81/167 (48%), Gaps = 4/167 (2%)
Frame = +2
Query: 95 NRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYSKSRVEELSVK-LPPEHD 271
N G+T+S+ H A T E A+L S H S E + K P D
Sbjct: 3 NSKGHGYTLSLYHFATYTSSEYASLLNVPSGRMSSH-------HSHHERIQYKDTPTSFD 55
Query: 272 WRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDC-SWGFGNNG 448
WR G V P+K+Q CGSCW+F + A E + G L+R S+Q+L+DC + + G
Sbjct: 56 WRSEGKVNPIKNQGSCGSCWAFSAIAAQESCHAIAT-GELLRFSEQSLVDCVTSDYSCQG 114
Query: 449 CDGGEDFRAYEWI--KRHGLPTEEDYGGYLGQDGYCHVDNVTAVTSI 583
C GG +A +++ +++G E+ Y G G C D + V++I
Sbjct: 115 CSGGWPDQAMKYVIEQQNGKFILEENYQYSGHKGACLYDEKSKVSNI 161
>UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep:
Cysteine proteinase - Entamoeba histolytica
Length = 320
Score = 83.4 bits (197), Expect = 4e-15
Identities = 42/107 (39%), Positives = 64/107 (59%), Gaps = 5/107 (4%)
Frame = +2
Query: 254 LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFL-----HNGGHLVRLSQQALI 418
+P DWR G +TP++D + CGSC+SFG++ A+E L + +N +L LS+Q ++
Sbjct: 97 IPTAIDWRAEGKLTPIRDHTQCGSCYSFGSLAAIESRLLIGGSQTYNADNL-DLSEQQIV 155
Query: 419 DCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYCHVD 559
DCS NNGC+GG + + KR+G+ E+DY Y +G C D
Sbjct: 156 DCS--NKNNGCNGGSILYVFAYTKRNGVIEEKDY-PYTATNGTCQYD 199
>UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium
tetraurelia|Rep: Cathepsin L1 precursor - Paramecium
tetraurelia
Length = 314
Score = 83.4 bits (197), Expect = 4e-15
Identities = 59/192 (30%), Positives = 92/192 (47%), Gaps = 9/192 (4%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHS--NNRANRGFTMSVNHLADRTDDELA---- 163
K+K+ R+Y + + R +F +L YI + + FT+ +N AD + E A
Sbjct: 30 KMKYNRRYTNQRDEMYRYKVFTDNLNYIRAFYESPEEATFTLELNQFADMSQQEFAQTYL 89
Query: 164 ALRGRRYSGPSPHGLPFPYSKSRVE---ELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWS 334
+L+ R + + F Y + V+ VK P VK+Q CGSCW+
Sbjct: 90 SLKVPRTAKLNAANSNFQYKGAEVDWTDNKKVKYPA------------VKNQGSCGSCWA 137
Query: 335 FGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEE 514
F VGA+E + LS+Q L+DCS + N+GC+GG A+E++ +GL +
Sbjct: 138 FSAVGALEINTDIELN-RKYELSEQDLVDCSGPYDNDGCNGGWMDSAFEYVADNGLAEAK 196
Query: 515 DYGGYLGQDGYC 550
DY Y +DG C
Sbjct: 197 DY-PYTAKDGTC 207
>UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease
Gip1p; n=4; Tetrahymena thermophila|Rep:
Granule-biosynthesis induced protease Gip1p -
Tetrahymena thermophila
Length = 345
Score = 83.0 bits (196), Expect = 5e-15
Identities = 53/198 (26%), Positives = 95/198 (47%), Gaps = 8/198 (4%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAA----- 166
+ ++R Y ++ E R +F ++L ++ + +++ ++ +N +D T +E
Sbjct: 44 RFNYKRVYLNEEEQIYRQIVFFENLASVNKHP-SHKSYSKGLNQFSDMTKEEFKQRVLNK 102
Query: 167 -LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGT 343
+ + S L + S + + LP DWR G + PVK+Q CGSCW+F T
Sbjct: 103 KISKKASSNKGGRNLAADPAVSNLVFPTNNLPLSVDWRKRGVLNPVKNQGTCGSCWTFAT 162
Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDC--SWGFGNNGCDGGEDFRAYEWIKRHGLPTEED 517
G +E + N L++ S+Q L+DC G+ ++GCDGG + +G+
Sbjct: 163 AGILESFNQIKN-KQLLKFSEQQLVDCVSLAGYDSDGCDGGFQEDGVRYAIEYGIVQSYK 221
Query: 518 YGGYLGQDGYCHVDNVTA 571
Y Y+G G C V + T+
Sbjct: 222 Y-PYVGYQGRCKVTSPTS 238
>UniRef50_Q24FA8 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 335
Score = 83.0 bits (196), Expect = 5e-15
Identities = 56/194 (28%), Positives = 97/194 (50%), Gaps = 12/194 (6%)
Frame = +2
Query: 23 YASDLEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDELAALRGRRYSGPSP 199
Y+S+ E R ++F ++ + + +N+ +N +++ +N +D T L + R SP
Sbjct: 43 YSSEAEKIYRQSVFLENYQSVQEHNKNSNHTYSVGINQFSDIT---LQEYQQRILMKNSP 99
Query: 200 HGLPFPYSKSRVEELS-------VKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
+K+R+ + S ++ DWR G V+PVK+Q CG CW+F G +E
Sbjct: 100 LN-ELAKNKNRLLQSSPIQNSNDTQIASSIDWRKKGGVSPVKNQGECGGCWTFSATGLME 158
Query: 359 GALFLHNGGHLVRL-SQQALIDC---SWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGG 526
+HN V L SQQ L+DC G+ + GC+GG A ++ G+ ++ +Y
Sbjct: 159 SFNLIHNKPQNVSLYSQQQLLDCVTLENGYFSEGCEGGVPSDAVQYAADFGVLSDNEY-P 217
Query: 527 YLGQDGYCHVDNVT 568
Y G G C++ + T
Sbjct: 218 YTGIQGQCNITSKT 231
>UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin O;
n=1; Monodelphis domestica|Rep: PREDICTED: similar to
cathepsin O - Monodelphis domestica
Length = 414
Score = 82.6 bits (195), Expect = 6e-15
Identities = 55/176 (31%), Positives = 82/176 (46%), Gaps = 6/176 (3%)
Frame = +2
Query: 44 EKRLNIFRQSLR---YIHS-NNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLP 211
E R FR+SL+ Y++S ++ N +N + +E + Y P LP
Sbjct: 131 ENRSTAFRESLKRHHYLNSFSSSDNTSAIYGINQFSYLFPEEFKDI----YLRSKPSVLP 186
Query: 212 FPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHL 391
++ + LP DWR VT V++Q +CG CW+F VG++E A + G L
Sbjct: 187 LYSEALKMPTTHMPLPVRFDWRDKHVVTKVRNQQMCGGCWAFSVVGSIESA-YAIKGESL 245
Query: 392 VRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH--GLPTEEDYGGYLGQDGYCH 553
LS Q +IDCS + N GC GG A W+ + L + +Y + Q G CH
Sbjct: 246 EDLSVQQVIDCS--YNNFGCSGGSTVNALNWLNKTQVRLVKDSEY-SFKAQTGLCH 298
>UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza
sativa|Rep: OSJNBb0085F13.15 protein - Oryza sativa
subsp. japonica (Rice)
Length = 383
Score = 82.6 bits (195), Expect = 6e-15
Identities = 60/197 (30%), Positives = 88/197 (44%), Gaps = 17/197 (8%)
Frame = +2
Query: 11 HQRQYASDLEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDE-LAALRGRRY 184
H R YAS E +R ++R ++ +I + NR + F + D T +E LA G
Sbjct: 63 HNRSYASADEKLRRFEVYRSNMEFIEATNRNGSLTFKLGETPFTDLTHEEFLATYTGDVR 122
Query: 185 SGPSPHGLPFPYSK--------------SRVEELSVKLPPEHDWRLFGAVTPVKDQSVCG 322
P G+ + + +V +P DWR GAVTP K Q C
Sbjct: 123 LPPERRGMQDDSDEEDAVITTSAGYVAGAGAGRRTVAVPESVDWRKEGAVTPAKHQGQCA 182
Query: 323 SCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI-KRHG 499
+CW+F V A+E +L GG L+ LS+Q L+DC G C G A+ W+ K G
Sbjct: 183 ACWAFAAVAAIE-SLHKIKGGDLISLSEQELVDCD-DTGEATCSKGYSDDAFLWVSKNKG 240
Query: 500 LPTEEDYGGYLGQDGYC 550
+ ++ Y Y+G C
Sbjct: 241 IASDLIY-PYVGHKESC 256
>UniRef50_P43234 Cluster: Cathepsin O precursor; n=22;
Euteleostomi|Rep: Cathepsin O precursor - Homo sapiens
(Human)
Length = 321
Score = 82.6 bits (195), Expect = 6e-15
Identities = 56/175 (32%), Positives = 83/175 (47%), Gaps = 5/175 (2%)
Frame = +2
Query: 44 EKRLNIFRQSL---RYIHSNNRANRGFTM-SVNHLADRTDDELAALRGRRYSGPSPHGLP 211
E+ FR+SL RY++S + +N + +E A+ Y P P
Sbjct: 38 EREAAAFRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAI----YLRSKPSKFP 93
Query: 212 FPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHL 391
++ + +V LP DWR VT V++Q +CG CW+F VGAVE A + G L
Sbjct: 94 RYSAEVHMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESA-YAIKGKPL 152
Query: 392 VRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYG-GYLGQDGYCH 553
LS Q +IDCS + N GC+GG A W+ + + +D + Q+G CH
Sbjct: 153 EDLSVQQVIDCS--YNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCH 205
>UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 344
Score = 82.2 bits (194), Expect = 8e-15
Identities = 41/103 (39%), Positives = 57/103 (55%), Gaps = 1/103 (0%)
Frame = +2
Query: 254 LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWG 433
LP DWR G +TP K Q+ CGSCW+F T G +E L G L+ S+Q L+DC
Sbjct: 131 LPESFDWRDKGIITPAKFQNTCGSCWTFATTGVIESQYAL-KYGELLHFSEQMLLDCD-- 187
Query: 434 FGNNGCDGGEDFRAYEWIKRH-GLPTEEDYGGYLGQDGYCHVD 559
N GC GG AY+++++ G+ T + YG Y + C+ D
Sbjct: 188 NINQGCRGGLMTDAYQFLQQSGGIQTADTYGDYKNKKDICNFD 230
>UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 350
Score = 81.8 bits (193), Expect = 1e-14
Identities = 57/190 (30%), Positives = 89/190 (46%), Gaps = 8/190 (4%)
Frame = +2
Query: 17 RQYASDLEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDELA--ALRGRRYS 187
R Y S+ E R +F Q+ + I +N +N + + N +D T DE A L + +
Sbjct: 56 RTYLSEEERTYRQIVFLQNDQNIQKHNSDSNNTYKLQHNQFSDMTKDEFAHRVLNSQLKT 115
Query: 188 GPSPHGLPFPYSKSRVE-ELSVKLPPEHDWRLF-GAVTPVKDQSVCGSCWSFGTVGAVEG 361
S P + R + S+ DWR + G + VK+Q CGSCW+F T G +E
Sbjct: 116 SASSSSQPAQTPQLRGSVDASLNASQGFDWRNYQGVLGNVKNQGQCGSCWTFATAGVLES 175
Query: 362 ALFLHNGGHLVRLSQQALIDC---SWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYL 532
L L+ S+Q ++DC S+G+ ++GC+GG ++ GL + DY Y+
Sbjct: 176 YYALKYQQSLI-FSEQDIVDCASRSYGYQSDGCNGGFPSEGLQYASTVGL-VQSDYYPYV 233
Query: 533 GQDGYCHVDN 562
G C N
Sbjct: 234 AVQGTCRQVN 243
>UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1;
Phytophthora infestans|Rep: Cathepsin-like cysteine
protease - Phytophthora infestans (Potato late blight
fungus)
Length = 376
Score = 81.8 bits (193), Expect = 1e-14
Identities = 67/200 (33%), Positives = 93/200 (46%), Gaps = 12/200 (6%)
Frame = +2
Query: 11 HQRQYASDL-EHEK---RLNIFRQSLRYIHSNNRA-NRG---FTMSVNHLADRTDDELAA 166
+++ Y +D +H+ R F +L I ++N A RG FT+ +N LAD D E
Sbjct: 47 YEKSYRNDANDHDVVQLRFRSFATNLERIQTHNEAYERGEHSFTLGLNDLADLADAEYKQ 106
Query: 167 LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
L R + K E LP DWR VTPVK+Q CGSCW+F V
Sbjct: 107 LLSYRTRDSKSSSASETFVKPENVE---DLPATWDWREHSTVTPVKNQGQCGSCWAFSAV 163
Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCD-GGEDFRAYEWI---KRHGLPTEE 514
A+E A L + G L LS+Q L+DC+ G + C+ GGE YE I + + EE
Sbjct: 164 AAMECAYAL-STGTLESLSEQELVDCTLN-GIDTCNHGGEMSEGYEEIITNHKGKIDREE 221
Query: 515 DYGGYLGQDGYCHVDNVTAV 574
Y G C+ + A+
Sbjct: 222 VYRYTAESKGVCNAKDDKAI 241
>UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium
(Vinckeia)|Rep: Berghepain-2 - Plasmodium yoelii yoelii
Length = 472
Score = 81.8 bits (193), Expect = 1e-14
Identities = 55/182 (30%), Positives = 92/182 (50%), Gaps = 7/182 (3%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDE--LAALRGR- 178
K+ ++Y+S E ++R IF + L+ I +N+ N +T +N +D +E + L +
Sbjct: 162 KYNKEYSSAEEMQERFYIFSEKLKKIEKHNKENHLYTKGINAFSDMRHEEFKMKYLNNKL 221
Query: 179 --RYSGPSPHGLPFPYSKSRVEELSVKLP-PEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
+ H +P+ + ++ + + ++ DWR A+ +KDQ C SCW+F T G
Sbjct: 222 KENHQIDLRHLIPYTIAINKYKSPTDQINYTSFDWRDHNAIIDIKDQQKCASCWAFATAG 281
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYE-WIKRHGLPTEEDYGG 526
V A + V LS+Q L+DC+ N GCDGG A+E I +GL E+ Y
Sbjct: 282 VV-AAQYAIRKNQKVSLSEQQLVDCAQ--NNFGCDGGILPYAFEDLIDMNGL-CEDKYYP 337
Query: 527 YL 532
Y+
Sbjct: 338 YV 339
>UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus
salmonis|Rep: Cysteine proteinase - Lepeophtheirus
salmonis (salmon louse)
Length = 372
Score = 81.8 bits (193), Expect = 1e-14
Identities = 59/202 (29%), Positives = 97/202 (48%), Gaps = 12/202 (5%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDELAALRGRRY 184
++ + Y + +L +F +LR I +N R + M +N +D TD+E + +Y
Sbjct: 33 EYSKSYHNRALRSLKLKVFVDNLREIEEHNANPKRTWDMGINEFSDLTDEEFES----KY 88
Query: 185 SGPSP--HGLPFPYSKSRVEELSVK-LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
G SP + ++ ++K LP DWR G +T VK+Q CGSCW F V +
Sbjct: 89 MGYSPMSSSAGLVTRTAAPKQGNIKDLPESVDWREKGVITDVKNQGSCGSCWVFSAVEQI 148
Query: 356 EGALFLHNG-GHLVRLSQQALIDCSWG----FGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
E + + N LS Q + CS G+ GC G + AY + + +G+ TE++Y
Sbjct: 149 ESYVAIENNMTSPPLLSTQQITSCSSNPYSCGGSGGCKGAINEIAYMYTQLYGIETEKEY 208
Query: 521 ---GGYLGQDGYCHVDNVTAVT 577
G+ + G C + N ++VT
Sbjct: 209 PYTSGFTEESGEC-LYNASSVT 229
>UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin W,
partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
similar to Cathepsin W, partial - Ornithorhynchus
anatinus
Length = 229
Score = 81.4 bits (192), Expect = 1e-14
Identities = 56/162 (34%), Positives = 77/162 (47%), Gaps = 4/162 (2%)
Frame = +2
Query: 47 KRLNIFRQSLRYIHSNNRANRGFT-MSVNHLADRTDDELAALRGRRYSGPSPHGLPFPYS 223
+R IF Q+L + G V +D +++E +L R+ G+P ++
Sbjct: 3 RRFKIFVQNLARARKLQEEDLGTAEYGVTPFSDLSEEEFLSLYAPRF------GMPSGWA 56
Query: 224 KSRVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRL 400
L E DWR GA+T VK+Q CGSCW+F VG E +L G LV L
Sbjct: 57 NQMASIPEGPLRKETCDWRKRGAITSVKNQGSCGSCWAFAAVGNAESMWYLRAGKRLVSL 116
Query: 401 SQQALIDCSWGFGNNGCDGG--EDFRAYEWIKRHGLPTEEDY 520
S Q ++DC G +GC GG ED W R GL +E+DY
Sbjct: 117 SVQEVLDC--GRCRDGCQGGYPEDAFVTMWFNR-GLASEKDY 155
>UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin O
precursor; n=2; Apocrita|Rep: PREDICTED: similar to
Cathepsin O precursor - Apis mellifera
Length = 374
Score = 81.4 bits (192), Expect = 1e-14
Identities = 55/204 (26%), Positives = 98/204 (48%), Gaps = 13/204 (6%)
Frame = +2
Query: 5 VKHQRQYASD-LEHEKRLNIFRQSLRYIHSNN---RANRGFTMSVNHLADRTDDELAA-- 166
+++ + Y ++ E+E+R F++SL++I N + + +D +++E
Sbjct: 62 IRYNKSYRNNPSEYEERFKRFQRSLQHIERMNGLRSSQESAYYGLTEFSDMSENEFLLHT 121
Query: 167 ------LRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSC 328
+RG ++ S H S R++ S+ +P DWR G +TPV+ Q CG+C
Sbjct: 122 LLPDLPIRGEKHMNASYHR-KHQISIDRMKR-SISIPLRFDWRDKGVITPVRSQGSCGAC 179
Query: 329 WSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLP- 505
W+F T+ +E ++F G L LS Q +IDC+ N GC+GG+ W+ +
Sbjct: 180 WAFSTIEVIE-SMFAIKNGTLHSLSVQEMIDCAKN-SNFGCEGGDICSLLSWLLISKVQI 237
Query: 506 TEEDYGGYLGQDGYCHVDNVTAVT 577
+E +G G C + +T T
Sbjct: 238 LQESIYPLVGMTGTCKLGKMTDKT 261
>UniRef50_Q23H10 Cluster: Papain family cysteine protease containing
protein; n=14; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 336
Score = 81.4 bits (192), Expect = 1e-14
Identities = 57/196 (29%), Positives = 95/196 (48%), Gaps = 13/196 (6%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDELAA------ 166
++QR Y ++ E R +F ++ + I +N N +++ +N +D T +E A
Sbjct: 35 QNQRVYLNEHEKLFRQMVFFENFQKIQEHNSDPNNTYSVHLNQFSDMTKEEFAEKILMKS 94
Query: 167 -LRGRRYSGPSPHGL--PFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSF 337
L G S +++++ S+ L DWR GAVT VK+Q CGSCWSF
Sbjct: 95 DLVDHLMKGISQEATHNDTNNNETQLSSNSLTLADSIDWRTKGAVTSVKNQGGCGSCWSF 154
Query: 338 GTVGAVEGALFLHNGGHLVRLSQQALIDC---SWGFGNNGCDGGEDFRAYEWIKRHGLPT 508
+E F+ N LV S+Q L+DC + G+ + GC+GG + ++ + G+ T
Sbjct: 155 SAAAVMESFNFIQNKA-LVDFSEQQLVDCVIPANGYNSYGCNGGWPVQCLDYASKVGITT 213
Query: 509 EEDYGGYLGQDGYCHV 556
+ Y Y+ C+V
Sbjct: 214 LDKY-PYVAVQKNCNV 228
>UniRef50_A7APS9 Cluster: Papain family cysteine protease containing
protein; n=1; Babesia bovis|Rep: Papain family cysteine
protease containing protein - Babesia bovis
Length = 435
Score = 81.4 bits (192), Expect = 1e-14
Identities = 61/186 (32%), Positives = 86/186 (46%), Gaps = 17/186 (9%)
Frame = +2
Query: 44 EKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPS-PHGLPF-- 214
E+ +R R N ++ +TM +N AD T ++ +L+G R S G+P
Sbjct: 139 ERFATFYRNVTRIREFNMNVHKTYTMKINQFADMTPEQFMSLQGTRASKIRVSKGIPDSQ 198
Query: 215 ---------PYSKSRVEELSVK---LPPEH--DWRLFGAVTPVKDQSVCGSCWSFGTVGA 352
P KS V + + + PE D R +TPVKDQ CGSCW+F +G
Sbjct: 199 VAAVGNQKGPNLKSEVRQTGNRFADISPEDFIDLRKDNYMTPVKDQGNCGSCWAFSLIGV 258
Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYL 532
E F H V LS+Q L+DC +GCD G + AYE+I+ HG+ Y Y
Sbjct: 259 AE-PFFKHKRDIDVVLSEQNLVDCVKEC--HGCDYGNSYFAYEYIRDHGVYRLASY-PYT 314
Query: 533 GQDGYC 550
+ G C
Sbjct: 315 AKSGPC 320
>UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3;
Dictyostelium discoideum AX4|Rep: Putative
uncharacterized protein - Dictyostelium discoideum AX4
Length = 664
Score = 81.0 bits (191), Expect = 2e-14
Identities = 47/123 (38%), Positives = 68/123 (55%), Gaps = 2/123 (1%)
Frame = +2
Query: 221 SKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRL 400
SKSR+ L P DWR +G V+ VK+Q CGSC++F TVGA+E + N ++ L
Sbjct: 461 SKSRL--LKWSRPISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYRKN-NRMLDL 517
Query: 401 SQQALIDC--SWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYCHVDNVTAV 574
S+Q L+DC S + N GC GG Y +I+ +G +E Y G+ G C ++ A
Sbjct: 518 SEQNLVDCTASNKYRNGGCSGGWMHNCYSYIQENGGINQESTYPYEGKFGQCRYNSGDAQ 577
Query: 575 TSI 583
+ I
Sbjct: 578 SRI 580
>UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1;
Dictyostelium discoideum AX4|Rep: Putative
uncharacterized protein - Dictyostelium discoideum AX4
Length = 339
Score = 80.6 bits (190), Expect = 3e-14
Identities = 42/92 (45%), Positives = 61/92 (66%), Gaps = 4/92 (4%)
Frame = +2
Query: 269 DWRLFGAVTPVKDQSVC-GSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNN 445
DWR F AVTPVK+Q +C G+ +SF +G +E + F+ N L+ LS+Q +IDC+ GNN
Sbjct: 119 DWRNFDAVTPVKNQGLCSGAGYSFSAIGVIESSHFIKNK-ELITLSEQNIIDCTTDMGNN 177
Query: 446 GCDGGEDFRAYEW-IKRHGLPTEED--YGGYL 532
GC GG A+++ IK+ G+ +E + Y GYL
Sbjct: 178 GCMGGLALIAFDYIIKQKGIDSEFNYPYEGYL 209
>UniRef50_Q23FQ5 Cluster: Papain family cysteine protease containing
protein; n=4; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 332
Score = 80.6 bits (190), Expect = 3e-14
Identities = 47/183 (25%), Positives = 90/183 (49%), Gaps = 3/183 (1%)
Frame = +2
Query: 11 HQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG-FTMSVNHLADRTDDELAALRGRRYS 187
++R + ++ E R +F ++L+ + ++ + +T+S+N +D + +E ++
Sbjct: 43 YRRVFLNEDEETYRQLVFFENLQKLKTHEKNTEATYTVSLNQFSDYSQEEFVQRILNKHI 102
Query: 188 GPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGAL 367
S + + +V P DWR GA+ P+++Q CGSC +FGT G +E
Sbjct: 103 SRSDADIQKEQEPNGNLRKAVNYPTSVDWRNSGALNPIQNQGQCGSCAAFGTAGVLESFY 162
Query: 368 FLHNGGHLVRLSQQALIDCS--WGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQD 541
+L L++ S+Q L+DC+ GF GCDG +++ ++G+ Y Y+G
Sbjct: 163 YL-KSKQLLKFSEQQLLDCARQAGFDTYGCDGAWQQEYFKYAIKYGIVQGSSY-PYVGYQ 220
Query: 542 GYC 550
C
Sbjct: 221 TTC 223
>UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5;
Theileria|Rep: Cysteine proteinase precursor - Theileria
parva
Length = 440
Score = 80.6 bits (190), Expect = 3e-14
Identities = 61/187 (32%), Positives = 92/187 (49%), Gaps = 17/187 (9%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAAL------ 169
K+ R++A+ E RL FR + + + + + +N +D T+ E L
Sbjct: 131 KYNRRHATQQERLNRLVTFRSNYLEV-KEQKGDEPYVKGINRFSDLTEREFYKLFPVMKP 189
Query: 170 RGRRYSGPS---PHGLPFPYSKSRVEELSV-------KLPPEH-DWRLFGAVTPVKDQSV 316
YS H Y K+ + L+ KL E+ DWR +VT VKDQS
Sbjct: 190 PKATYSNGYYLLSHMANKTYLKNLKKALNTDEDVDLAKLTGENLDWRRSSSVTSVKDQSN 249
Query: 317 CGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRH 496
CG CW+F TVG+VEG ++ + LS Q L+DC F +NGC GG AYE+++++
Sbjct: 250 CGGCWAFSTVGSVEG-YYMSHFDKSYELSVQELLDCD-SF-SNGCQGGLLESAYEYVRKY 306
Query: 497 GLPTEED 517
GL + +D
Sbjct: 307 GLVSAKD 313
>UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1;
Oryza sativa (japonica cultivar-group)|Rep: Putative
uncharacterized protein - Oryza sativa subsp. japonica
(Rice)
Length = 326
Score = 80.2 bits (189), Expect = 3e-14
Identities = 47/124 (37%), Positives = 63/124 (50%), Gaps = 4/124 (3%)
Frame = +2
Query: 50 RLNIFRQSLRYIHSNNRAN-RGFTMSVNHLADRTDDELAALRGRRYSGPSPH---GLPFP 217
R +F+++ RYIH NR + + +N AD T +E A +Y+G +P GL
Sbjct: 49 RFEVFKKNARYIHDFNRKKGMSYKLGLNKFADLTLEEFTA----KYTGANPGPITGLKNG 104
Query: 218 YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVR 397
+ ++ PP DWR GAVT VKDQ CGSCW+F V AVEG + G L
Sbjct: 105 TGSPPLAAVAGDAPPAWDWREHGAVTRVKDQGPCGSCWAFSVVEAVEGINEIMTGNFLTL 164
Query: 398 LSQQ 409
QQ
Sbjct: 165 SEQQ 168
>UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
Clan CA, family C1, cathepsin L-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 293
Score = 80.2 bits (189), Expect = 3e-14
Identities = 52/181 (28%), Positives = 88/181 (48%), Gaps = 3/181 (1%)
Frame = +2
Query: 38 EHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFP 217
E+ RL I+ ++RYI +N+A + + N A T E ++ + P L
Sbjct: 12 EYAFRLGIYLSNMRYIKEHNKAGSSYKLEGNRFAAFTPAEYRSMLSK------PKSLAKK 65
Query: 218 YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVR 397
+ + ++ +P E DWR G VTPV+ Q CG+ W+F + E +++ G L
Sbjct: 66 FESAPLKHKEGAIPAEFDWRTKGVVTPVRYQEGCGAGWAFASAALQESMWAIYDRG-LAH 124
Query: 398 LSQQALIDCSWGFGNNGCDGGEDFRA--YEWIKRHGL-PTEEDYGGYLGQDGYCHVDNVT 568
LS Q L+DC + ++GCDGG A + + ++G+ ++ DY + G C D+
Sbjct: 125 LSVQQLLDCD--YNDDGCDGGSSDGASYFVLLNQYGMWMSDSDY-PFKPYVGECKFDSSM 181
Query: 569 A 571
A
Sbjct: 182 A 182
>UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2;
Endopterygota|Rep: Cathepsin L-like proteinase - Bombyx
mori (Silk moth)
Length = 402
Score = 80.2 bits (189), Expect = 3e-14
Identities = 58/196 (29%), Positives = 89/196 (45%), Gaps = 9/196 (4%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRAN----RGFTMSVNHLADRTDDE---- 157
K H + Y+S L +RQ+LR + +NR + +++ +NH D E
Sbjct: 104 KAIHNKLYSSTHHEMAALMKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTEYFGK 163
Query: 158 -LAALRGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWS 334
L ++ P+ Y +R K+P DWR G ++Q CG+C++
Sbjct: 164 VLKLIKAFPLFDPAEDHHKTAYRHNR----RCKVPKRIDWRDQGFKPRREEQWQCGACYA 219
Query: 335 FGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEE 514
F A++ L+ +G LS Q ++DCS GN GCDGG A + R GL E
Sbjct: 220 FAVTHALQAQLYKRHG-EWNELSPQQIVDCSIKDGNMGCDGGSLRGALRYAAREGLVMES 278
Query: 515 DYGGYLGQDGYCHVDN 562
Y Y+G+ GYC D+
Sbjct: 279 HY-PYVGKKGYCRYDS 293
>UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|Rep:
Cathepsin W precursor - Homo sapiens (Human)
Length = 376
Score = 80.2 bits (189), Expect = 3e-14
Identities = 57/177 (32%), Positives = 86/177 (48%), Gaps = 4/177 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFT-MSVNHLADRTDDELAALRG- 175
+++ R Y S EH RL+IF +L + G V +D T++E L G
Sbjct: 46 QIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLYGY 105
Query: 176 RRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWR-LFGAVTPVKDQSVCGSCWSFGTVGA 352
RR +G G+P + R EE +P DWR + GA++P+KDQ C CW+ G
Sbjct: 106 RRAAG----GVPSMGREIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCWAMAAAGN 161
Query: 353 VEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAY-EWIKRHGLPTEEDY 520
+E L+ + V +S L+DC G +GC GG + A+ + GL +E+DY
Sbjct: 162 IE-TLWRISFWDFVDVSVHELLDC--GRCGDGCHGGFVWDAFITVLNNSGLASEKDY 215
>UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep:
Vivapain-4 - Plasmodium vivax
Length = 484
Score = 79.8 bits (188), Expect = 4e-14
Identities = 59/185 (31%), Positives = 98/185 (52%), Gaps = 9/185 (4%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDE----LAALR 172
+H ++Y ++ E ++R F ++L I+S+N +AN + N +D + +E + LR
Sbjct: 172 EHGKKYKTEEEMQQRYLAFTENLARINSHNSKANILYKKGTNQYSDISFEEFRKTMLTLR 231
Query: 173 G--RRYSGPSPHGLPFPYSKSRVEELSVKLPPE-HDWRLFGAVTPVKDQSVCGSCWSFGT 343
++ SP+ + + + + E +DWR AV+ +K+Q++CGSCW+FG
Sbjct: 232 FDLKKKLANSPYVSNYDDVLKKYKPADAVVDNEKYDWREHNAVSEIKNQNLCGSCWAFGA 291
Query: 344 VGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAY-EWIKRHGLPTEEDY 520
VGAVE + H V +S+Q L+DCS N GC GG A+ + I L +E DY
Sbjct: 292 VGAVESQYAIRKNQH-VLISEQELVDCS--DKNFGCFGGLASLAFDDMIDLGYLCSESDY 348
Query: 521 GGYLG 535
Y+G
Sbjct: 349 -PYVG 352
>UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
Clan CA, family C1, cathepsin L-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 291
Score = 79.8 bits (188), Expect = 4e-14
Identities = 59/181 (32%), Positives = 88/181 (48%), Gaps = 2/181 (1%)
Frame = +2
Query: 38 EHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFP 217
E++ R I+ + ++ ++N+AN + +S+N L+ T E +L G + L
Sbjct: 12 EYKFRFGIWMANKNFVETHNKANANYKLSLNSLSHLTPTEYQSLLGTKID----KNLVSQ 67
Query: 218 YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVR 397
K R + P D+R G V P++DQ CGSCW+FGTV A E L +L +
Sbjct: 68 GKKVRPQIKDS--PGILDYREMGVVNPIRDQKQCGSCWAFGTVAACESNYALLY-SNLPQ 124
Query: 398 LSQQALIDCSWGFGNNGCDGGEDFRAYEWI--KRHGLPTEEDYGGYLGQDGYCHVDNVTA 571
LS+Q +IDC+ GC GG A +I K+ G + Y G DG C D TA
Sbjct: 125 LSEQNIIDCATTC--YGCGGGIIQAAMSFIINKQGGAIMKLSDYPYQGVDGACKFDAKTA 182
Query: 572 V 574
+
Sbjct: 183 M 183
>UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184,
whole genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_184,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 331
Score = 79.8 bits (188), Expect = 4e-14
Identities = 59/190 (31%), Positives = 91/190 (47%), Gaps = 7/190 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNR----ANRGFTMSVNHLADRTDDELAAL 169
K H ++Y SD E R ++F Q+L + +N FT+ +N AD T +E A
Sbjct: 38 KQLHGKRY-SDFEEVHRFSVFAQNLAVVMEHNSKFELGQETFTLGMNQYADLTPEEFQAS 96
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
+ YS + P DW+ +T VK+Q CGSCW+F
Sbjct: 97 FLTLKTKVQDRKNVKSYS-------GLSFPDTVDWK--DGLT-VKNQGSCGSCWAFAAAA 146
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCS---WGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
A+E A F H+ + V +S+Q +DC+ G+ + GC+GG A+++ +G+ TEE+Y
Sbjct: 147 AIE-AGFQHHKKNKVNISEQEFVDCTTEKLGYESQGCNGGWMDDAFDYTVNYGVTTEEEY 205
Query: 521 GGYLGQDGYC 550
Y G D C
Sbjct: 206 -PYKGVDQPC 214
>UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin O;
n=1; Danio rerio|Rep: PREDICTED: similar to cathepsin O
- Danio rerio
Length = 327
Score = 79.4 bits (187), Expect = 6e-14
Identities = 45/115 (39%), Positives = 62/115 (53%), Gaps = 2/115 (1%)
Frame = +2
Query: 212 FPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHL 391
F SKS ++ + PP DWR G V PV +Q CG CW+F V A+E ++ G L
Sbjct: 107 FDQSKSEIK-VKANNPPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAIE-SVSAKVGEKL 164
Query: 392 VRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLP--TEEDYGGYLGQDGYC 550
+LS Q +IDCS + N GC+GG A W+ + L +E +Y + G DG C
Sbjct: 165 QQLSVQQVIDCS--YQNQGCNGGSPVEALYWLTQSKLKLVSEAEY-PFKGADGVC 216
>UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep:
Cysteine protease - Babesia equi
Length = 438
Score = 79.4 bits (187), Expect = 6e-14
Identities = 38/83 (45%), Positives = 52/83 (62%)
Frame = +2
Query: 269 DWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNG 448
DWR VTPVKDQ CGSCW+F VG+VE +L+L G + LS+Q L++C +NG
Sbjct: 229 DWRKLNGVTPVKDQGNCGSCWAFAAVGSVE-SLYLIKKGQALDLSEQELVNCE--ENSNG 285
Query: 449 CDGGEDFRAYEWIKRHGLPTEED 517
C+G +A E+IK G+ +D
Sbjct: 286 CEGDLPNKALEYIKAKGISHSKD 308
>UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila
melanogaster|Rep: CG11459-PA - Drosophila melanogaster
(Fruit fly)
Length = 336
Score = 79.0 bits (186), Expect = 8e-14
Identities = 58/183 (31%), Positives = 94/183 (51%), Gaps = 10/183 (5%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNR----ANRGFTMSVNHLADRTDDELAAL 169
K K+ +QY + ++ + L + Q + + S+N+ F M +N +D TD + L
Sbjct: 34 KAKYNKQYRNRDKYHRAL--YEQRVLAVESHNQLYLQGKVAFKMGLNKFSD-TDQRI--L 88
Query: 170 RGRRYSGPSP-----HGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSV-CGSCW 331
R S P+P + L + R ++++ + DWR +G ++PV DQ C SCW
Sbjct: 89 FNYRSSIPAPLETSTNALTETVNYKRYDQITEGI----DWRQYGYISPVGDQGTECLSCW 144
Query: 332 SFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTE 511
+F T G +E A G+LV LS + L+DC + NNGC GG A+ + + HG+ T+
Sbjct: 145 AFSTSGVLE-AHMAKKYGNLVPLSPKHLVDCV-PYPNNGCSGGWVSVAFNYTRDHGIATK 202
Query: 512 EDY 520
E Y
Sbjct: 203 ESY 205
>UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteinase
A; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like
tick cysteine proteinase A - Haemaphysalis longicornis
(Bush tick)
Length = 312
Score = 79.0 bits (186), Expect = 8e-14
Identities = 38/88 (43%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Frame = +2
Query: 254 LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWG 433
LP DW G+ PVK+Q CGSCW+F T G++EG F + +Q L+DCS
Sbjct: 93 LPTTVDWAQEGSRAPVKNQGQCGSCWAFSTTGSLEGQHFRKTESRVT--GEQNLVDCSDD 150
Query: 434 FGNNGCDGGEDFRAYEWIKRH-GLPTEE 514
FGN GC+GG +++IK + G+ TEE
Sbjct: 151 FGNQGCNGGLMDNGFQYIKANGGIDTEE 178
>UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1
precursor; n=20; Psoroptidia|Rep: Major mite fecal
allergen Der f 1 precursor - Dermatophagoides farinae
(House-dust mite)
Length = 321
Score = 79.0 bits (186), Expect = 8e-14
Identities = 60/191 (31%), Positives = 91/191 (47%), Gaps = 4/191 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELA---ALR 172
K + YA+ E E F +SL+Y+ AN+G ++NHL+D + DE +
Sbjct: 30 KKAFNKNYATVEEEEVARKNFLESLKYVE----ANKG---AINHLSDLSLDEFKNRYLMS 82
Query: 173 GRRYSG-PSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
+ + L S R+ SV +P E D R VTP++ Q CGSCW+F V
Sbjct: 83 AEAFEQLKTQFDLNAETSACRIN--SVNVPSELDLRSLRTVTPIRMQGGCGSCWAFSGVA 140
Query: 350 AVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGY 529
A E A + L LS+Q L+DC+ +GC G R E+I+++G+ E Y Y
Sbjct: 141 ATESAYLAYRNTSL-DLSEQELVDCA---SQHGCHGDTIPRGIEYIQQNGVVEERSY-PY 195
Query: 530 LGQDGYCHVDN 562
+ ++ C N
Sbjct: 196 VAREQRCRRPN 206
>UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 20 SCAF14744, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 175
Score = 78.6 bits (185), Expect = 1e-13
Identities = 37/78 (47%), Positives = 47/78 (60%)
Frame = +2
Query: 254 LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWG 433
LP DWR V PV++Q CGSCW+F VGAV+ ++ LV LS Q ++DCS
Sbjct: 59 LPARFDWRDNAVVGPVQNQQACGSCWAFSVVGAVQ-SVHAIGSSPLVELSVQQVLDCS-- 115
Query: 434 FGNNGCDGGEDFRAYEWI 487
F NNGCDGG A +W+
Sbjct: 116 FQNNGCDGGTPINALKWL 133
>UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia
tropicalis|Rep: Blo t 1 allergen - Blomia tropicalis
(Mite)
Length = 333
Score = 78.6 bits (185), Expect = 1e-13
Identities = 53/189 (28%), Positives = 91/189 (48%), Gaps = 10/189 (5%)
Frame = +2
Query: 23 YASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPH 202
Y + E +R + F++ L+++ +N + G ++N +D ++ E + SG
Sbjct: 39 YRNAEEEARREHHFKEQLKWVEEHNGID-GVEYAINEYSDMSEQEFSF----HLSGG--- 90
Query: 203 GLPFPYSKSRVEELSV-----KLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGAL 367
GL F Y K + + LP DWR +T ++ Q CGSCW+F G E +L
Sbjct: 91 GLNFTYMKMEAAKEPLINTYGSLPQNFDWRQKARLTRIRQQGSCGSCWAFAAAGVAE-SL 149
Query: 368 FLHNGGHLVRLSQQALIDCSW-----GFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYL 532
+ + LS+Q L+DC++ + NGC G A++++ R GL EE+Y Y
Sbjct: 150 YSIQKQQSIELSEQELVDCTYNRYDSSYQCNGCGSGYSTEAFKYMIRTGLVEEENY-PYN 208
Query: 533 GQDGYCHVD 559
+ +C+ D
Sbjct: 209 MRTQWCNPD 217
>UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, whole
genome shotgun sequence; n=3; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_2,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 376
Score = 78.6 bits (185), Expect = 1e-13
Identities = 41/108 (37%), Positives = 59/108 (54%)
Frame = +2
Query: 227 SRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQ 406
++ E+ + K PP DW VT V+ Q CGSCW+F V L + N L +LS+
Sbjct: 156 TKTEKATPKNPPSLDW--LKQVTEVQQQGRCGSCWAFAVQDVVISRLAIANKNKLDQLSK 213
Query: 407 QALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYC 550
LIDC+ G GCDGG A+++I ++G E+DY Y ++G C
Sbjct: 214 THLIDCADG-NTEGCDGGSVSDAFDFINKYGTVYEKDYREYDQKEGQC 260
>UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2;
Entamoeba|Rep: Cysteine proteinase ACP1 precursor -
Entamoeba histolytica
Length = 308
Score = 78.6 bits (185), Expect = 1e-13
Identities = 64/193 (33%), Positives = 93/193 (48%), Gaps = 4/193 (2%)
Frame = +2
Query: 11 HQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALR-GRRYS 187
H + +A+ E+ R +F + +++ +N AN +N AD T +E G Y
Sbjct: 25 HNKVFANRAEYLYRFAVFLDNKKFVEAN--ANT----ELNVFADMTHEEFIQTHLGMTYE 78
Query: 188 GPSPHGLPFPYSKSRVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
P + S V+ +VK PE DWR + P KDQ CGSCW+F T +EG
Sbjct: 79 VPE--------TTSNVKA-AVKAAPESVDWR--SIMNPAKDQGQCGSCWTFCTTAVLEGR 127
Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI-KRHGLPTEEDYGGYLGQD 541
+ + G L S+Q L+DC +NGC+GG + ++I + +GL E DY Y
Sbjct: 128 V-NKDLGKLYSFSEQQLVDCD--ASDNGCEGGHPSNSLKFIQENNGLGLESDY-PYKAVA 183
Query: 542 GYC-HVDNVTAVT 577
G C V NV VT
Sbjct: 184 GTCKKVKNVATVT 196
>UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1;
Trypanosoma cruzi|Rep: Cysteine protease, putative -
Trypanosoma cruzi
Length = 434
Score = 78.2 bits (184), Expect = 1e-13
Identities = 50/168 (29%), Positives = 80/168 (47%), Gaps = 7/168 (4%)
Frame = +2
Query: 17 RQYASDLEHEKRLNIFRQSLRYIHSNNRA-NRGFTMSVNHLADRTDDELAALRGRRYSGP 193
++YA EH KR IF+++L + + N A R + + +N +D T +E A R + P
Sbjct: 48 KRYADPEEHRKRAAIFKENLAKVRAFNGALGRSYRLGINKFSDMTKEEFNAKFNGRVAAP 107
Query: 194 SPHGLPFPYSKSRVEELSVKLPPEHDWRLFG--AVTPVKDQSVCGSCWSFGTVGAVEGAL 367
P ++ + P +W+ +TPVKDQ CGSCW+ +VE ++
Sbjct: 108 QSTQSP---QRAPYKRTKATFPEALNWQEAKNPVLTPVKDQGSCGSCWAHAATESVE-SM 163
Query: 368 FLHNGGHLVRLSQQALIDCSWGF----GNNGCDGGEDFRAYEWIKRHG 499
+ + G L+ LS Q + C G+ GC GG A+E+I G
Sbjct: 164 YAISSGKLLTLSTQQITSCVNNTRKCGGSGGCGGGTAQLAWEYIMNTG 211
>UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyostelium
discoideum|Rep: Cysteine proteinase 3 - Dictyostelium
discoideum (Slime mold)
Length = 151
Score = 78.2 bits (184), Expect = 1e-13
Identities = 52/145 (35%), Positives = 70/145 (48%), Gaps = 4/145 (2%)
Frame = +2
Query: 38 EHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFP 217
E R F++++ Y+H+ N + +N AD +++E Y G H
Sbjct: 4 EFMPRYEEFKKNMDYVHNWNSKGSKTVLGLNQHADLSNEEYRL----NYLGTRAHIKLNG 59
Query: 218 YSKS----RVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGG 385
Y K R+ K P DWR AVTPVKDQ CGSC T G+VEG + G
Sbjct: 60 YHKRNLGLRLNRPHFKQPLNVDWREKDAVTPVKDQGQCGSC-IISTTGSVEGVTAIKT-G 117
Query: 386 HLVRLSQQALIDCSWGFGNNGCDGG 460
LV LS+Q ++ S FGN GC+GG
Sbjct: 118 KLVSLSEQNILRLSSSFGNEGCNGG 142
>UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 394
Score = 77.8 bits (183), Expect = 2e-13
Identities = 41/103 (39%), Positives = 56/103 (54%), Gaps = 3/103 (2%)
Frame = +2
Query: 269 DWR-LFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDC--SWGFG 439
DWR + + PVKDQ CGSCW+FG G +E + N G L S+Q L+DC GF
Sbjct: 188 DWRNVKNVLNPVKDQGQCGSCWTFGAAGVMESFNAITN-GVLKSFSEQQLVDCVHQAGFS 246
Query: 440 NNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYCHVDNVT 568
++GC+GG E+ + G+ TE+ Y Y G C + N T
Sbjct: 247 SDGCNGGFQSDGVEYAIKFGIVTEDKY-PYTAVGGDCQISNPT 288
>UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, whole
genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_56,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 314
Score = 77.8 bits (183), Expect = 2e-13
Identities = 54/169 (31%), Positives = 84/169 (49%), Gaps = 2/169 (1%)
Frame = +2
Query: 20 QYASDLEHEKRLNIFRQSLRYIHS-NNRANRGFTMSVNHLADRTDDELAALRGRRYSGPS 196
++ + E R +++ +++ I N+ N D T++E AAL R S
Sbjct: 41 KFYTPAERAYRFQVYQDAMKQIQILNSEENSTTVFGETQFTDLTNEEFAALLLTRKE--S 98
Query: 197 PHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLH 376
P L ++ V + +K DW +T VK+Q CGSCW+F VGAVE L +
Sbjct: 99 PMNLD---AELYVPQGPLKASA--DW---SKITSVKNQGNCGSCWAFSAVGAVETLLTIK 150
Query: 377 NG-GHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
+ LS+Q L+DC G NNGC+GG + +W K++GL T++ Y
Sbjct: 151 GVISKDLWLSEQQLVDCDKG-TNNGCNGGFENLGIQWAKKNGLTTDKQY 198
>UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing
protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 330
Score = 77.4 bits (182), Expect = 2e-13
Identities = 60/196 (30%), Positives = 88/196 (44%), Gaps = 5/196 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAA----L 169
K + ++YA + RL +F + + S+ T V D T++E AA L
Sbjct: 44 KSRFNKRYADPITESYRLQVFASNYLRVLSDVTG----TFGVTQFFDLTEEEFAATYLTL 99
Query: 170 RGRRYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVG 349
R +R + P + V +W G V+ VKDQ CGSCW+F T G
Sbjct: 100 RVQRNVNATVSSPSTPKGQYDV-----------NWVTRGKVSAVKDQGQCGSCWAFSTTG 148
Query: 350 AVEGALFLHN-GGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGG 526
+VE AL + + LS+Q L+DCS N GC GG A+E+I+ L T +Y
Sbjct: 149 SVESALIIAGYANQTIDLSEQQLVDCS--ATNYGCGGGWMDNAFEYIEESPLTTNSNY-P 205
Query: 527 YLGQDGYCHVDNVTAV 574
Y+ D C+ + V
Sbjct: 206 YVAVDQACNSTEIYGV 221
>UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,
partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
hypothetical protein, partial - Ornithorhynchus anatinus
Length = 224
Score = 77.0 bits (181), Expect = 3e-13
Identities = 52/163 (31%), Positives = 78/163 (47%), Gaps = 4/163 (2%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFT-MSVNHLADRTDDELAALRGR 178
++++ + Y EH +R IF Q+L ++G V +D ++DE +L
Sbjct: 51 QIRYNKSYEDQAEHARRFEIFVQNLARARKLQEEDQGTAEFGVTPFSDLSEDEFLSLYAP 110
Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVE 358
R+ P+ + +R+ ++ DWR GAVTPVK+Q CGSCW+F VG VE
Sbjct: 111 RFRMPTS----WVNQTARIPAGPLRAET-CDWRKEGAVTPVKNQGDCGSCWAFAAVGNVE 165
Query: 359 GALFLHNGGHLVRLSQQALIDCSWGFGNN-GCDGGED--FRAY 478
+L LV LS+Q W N+ G + GE FR Y
Sbjct: 166 SMWYLRASNRLVSLSEQDGGYPQWILKNSWGPEWGEKGYFRLY 208
>UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4;
Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena
thermophila
Length = 320
Score = 77.0 bits (181), Expect = 3e-13
Identities = 39/100 (39%), Positives = 59/100 (59%), Gaps = 4/100 (4%)
Frame = +2
Query: 263 EHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGG--HLVRLSQQALIDC--SW 430
E DW G VTPVK+Q CGSCW+F T+GAVE AL++ G + + L++Q +DC S
Sbjct: 115 EVDWTAKGKVTPVKNQGSCGSCWAFSTIGAVESALWIAGQGEQNTLNLAEQEQVDCAKSP 174
Query: 431 GFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYC 550
+ + GC+GG +++I + + +Y Y +DG C
Sbjct: 175 KYDSEGCNGGWMVEGFKYIIDNKISQTANY-PYTAKDGKC 213
>UniRef50_Q3L7L2 Cluster: Sar s 1 allergen SMIPP-C Yv6008G08; n=2;
Sarcoptes scabiei type hominis|Rep: Sar s 1 allergen
SMIPP-C Yv6008G08 - Sarcoptes scabiei type hominis
Length = 341
Score = 77.0 bits (181), Expect = 3e-13
Identities = 42/104 (40%), Positives = 58/104 (55%), Gaps = 3/104 (2%)
Frame = +2
Query: 251 KLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALF--LHNGGHLVRLSQQALIDC 424
KLP E D R + PV++Q C + W+FG +GAVE AL H +LS Q L+DC
Sbjct: 114 KLPKEFDLRKLKVIPPVRNQKRCNASWAFGPLGAVESALIHRFHLPHRHFQLSTQELVDC 173
Query: 425 SWGFGNNGCDGGEDF-RAYEWIKRHGLPTEEDYGGYLGQDGYCH 553
+ GN GC GG D +A+ ++ G+ TE +Y Y + G CH
Sbjct: 174 A---GNQGCRGGVDVTQAFSYLMEKGVVTEFEY-PYTAKKGICH 213
>UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
Clan CA, family C1, cathepsin L-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 234
Score = 77.0 bits (181), Expect = 3e-13
Identities = 45/116 (38%), Positives = 61/116 (52%), Gaps = 2/116 (1%)
Frame = +2
Query: 233 VEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQA 412
+E + +P E D+R GAV +KDQ CGSCW+FG+ A+E + FL + G L LS+Q
Sbjct: 11 LETIVGDIPDEIDYRTKGAVNEIKDQKHCGSCWAFGSCAAMESSWFLKH-GTLYSLSEQC 69
Query: 413 LIDCSWGFGNNGCDGGEDFRAYEWIK--RHGLPTEEDYGGYLGQDGYCHVDNVTAV 574
L+DC GC G A+E++K HGL ED Y + C D V
Sbjct: 70 LVDCC--HDCLGCHGCLPSLAFEYVKIFMHGLFETEDNYPYQAEHHSCKFDKTRGV 123
>UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, whole
genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_26,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 358
Score = 77.0 bits (181), Expect = 3e-13
Identities = 53/189 (28%), Positives = 83/189 (43%), Gaps = 14/189 (7%)
Frame = +2
Query: 5 VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR- 181
+++ + Y +D R+ IF ++ + I +N + +N +D+ DEL+
Sbjct: 48 LEYGKSYDNDFTAIHRMQIFMRNKKNIEKHNHVGAKYKAKLNEFSDQDYDELSLKMFMHL 107
Query: 182 -YSGPS-PHGLPFPYSKSRVEEL-----------SVKLPPEHDWRLFGAVTPVKDQSVCG 322
+S G P +SK ++EL + DW VTP + Q CG
Sbjct: 108 DFSDDDFKFGNPHFFSKEDIKELRNHPILTQMREQARKGDSLDWTK--QVTPSRPQGTCG 165
Query: 323 SCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGL 502
SCW+F + L L L +LS+ LIDC G N GC+GG AY++I +G
Sbjct: 166 SCWAFSSSDVAISRLALKGKEDLTQLSKTHLIDCCVGDKNKGCNGGSPIGAYKFINENGA 225
Query: 503 PTEEDYGGY 529
E +Y Y
Sbjct: 226 LKENEYREY 234
>UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precursor;
n=3; Plasmodium (Laverania)|Rep: Trophozoite cysteine
proteinase precursor - Plasmodium falciparum
Length = 569
Score = 77.0 bits (181), Expect = 3e-13
Identities = 53/201 (26%), Positives = 98/201 (48%), Gaps = 20/201 (9%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRG--FTMSVNHLADRTDDELAALRG-- 175
+H + Y + E ++ IF+ + I ++N+ N+ + VN +D +++EL
Sbjct: 231 EHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTL 290
Query: 176 --------RRYSGPSPHGLP--------FPYSKSRVEELSVKLPPEHDWRLFGAVTPVKD 307
+YS P + L + K +++ K+P D+R G V KD
Sbjct: 291 LHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKD 350
Query: 308 QSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI 487
Q +CGSCW+F +VG +E ++F +++ S+Q ++DCS N GCDGG F ++ ++
Sbjct: 351 QGLCGSCWAFASVGNIE-SVFAKKNKNILSFSEQEVVDCS--KDNFGCDGGHPFYSFLYV 407
Query: 488 KRHGLPTEEDYGGYLGQDGYC 550
++ L ++Y D +C
Sbjct: 408 LQNELCLGDEYKYKAKDDMFC 428
>UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;
n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
CG5367-PA - Nasonia vitripennis
Length = 362
Score = 76.6 bits (180), Expect = 4e-13
Identities = 57/200 (28%), Positives = 92/200 (46%), Gaps = 6/200 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN----RANRGFTMSVNHLADRTDDELAAL 169
K++H + Y LE +R + +L I+ +N + + + NH+AD + +
Sbjct: 65 KMRHNKTYTGTLEAVRR-EAWEDNLLKIYEHNLLAAAGHHEYILRDNHIADLSTSSY--M 121
Query: 170 RGRRYSGPSPHG-LPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTV 346
R PS L + V ++P DWR G VT ++Q CGSC+++
Sbjct: 122 RELVKLVPSRRRRLDDDEMVAAVLHDPRRIPKSLDWREKGFVTKPENQRDCGSCYAYSIA 181
Query: 347 GAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKR-HGLPTEEDYG 523
G++ G +F G +V LS+Q L+DCS GN GC GG +++R GL T+ Y
Sbjct: 182 GSIAGQIF-RQTGIVVPLSEQQLVDCSTQTGNLGCSGGSLRNTLRYLERSKGLMTDATY- 239
Query: 524 GYLGQDGYCHVDNVTAVTSI 583
Y G C +V ++
Sbjct: 240 PYTAHQGVCKFQRKLSVVNV 259
>UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep:
Dvir_CG5367 - Drosophila virilis (Fruit fly)
Length = 298
Score = 76.6 bits (180), Expect = 4e-13
Identities = 54/197 (27%), Positives = 87/197 (44%), Gaps = 6/197 (3%)
Frame = +2
Query: 11 HQRQYASDLEHEKRLNIFRQSLRYIHSNNR----ANRGFTMSVNHLADRTDDELAALRGR 178
+ R YA + + + ++ ++ +N F ++ N +AD D L+G
Sbjct: 3 NNRSYARSHDEMRSYEAYEENQIIVNEHNTYYETGKSSFRLATNTMADMNTDSY--LKGY 60
Query: 179 RYSGPSPHGLPFPYSKSRV-EELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
SP V L +P DWR G +TP+ +Q CGSC++F ++
Sbjct: 61 LRLLRSPEISDSDNIADIVGSPLMNNVPESFDWRKKGFITPLYNQQSCGSCYAFSIAQSI 120
Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIK-RHGLPTEEDYGGYL 532
EG +F G +V LS+Q ++DCS GN GC GG +++ GL DY Y
Sbjct: 121 EGQVFKRT-GKIVALSEQQIVDCSVSHGNQGCIGGSLRNTLRYLQATGGLMRSLDY-KYA 178
Query: 533 GQDGYCHVDNVTAVTSI 583
+ G C + AV ++
Sbjct: 179 SKKGECQFVSELAVVNV 195
>UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158,
whole genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_158,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 308
Score = 76.6 bits (180), Expect = 4e-13
Identities = 58/182 (31%), Positives = 85/182 (46%), Gaps = 1/182 (0%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNR-ANRGFTMSVNHLADRTDDELAALRGRRY 184
K+Q+ Y E R IF + ++ ++N + FTM N D T +E A+ RR
Sbjct: 38 KYQKFYGPS-EKIYRAKIFEERIKLFEAHNADKTQTFTMGENQFTDLTQEEFKAIYLRRR 96
Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
S P L ++ V L + W +T VKDQ CG+ W+F +GAVE
Sbjct: 97 S---PQKL---VNEKYVPTNEANLTSAN-W---AGLTSVKDQGYCGAAWAFAAIGAVESV 146
Query: 365 LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDG 544
L +++ +L LS+Q LIDC N GC+ G + W + +G+ T Y Y GQ
Sbjct: 147 LRINSVTNL-DLSEQQLIDCD--LENQGCEDGNLNNSLNWAQNNGVTTSASY-PYTGQTD 202
Query: 545 YC 550
C
Sbjct: 203 GC 204
>UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 280
Score = 76.2 bits (179), Expect = 5e-13
Identities = 43/108 (39%), Positives = 57/108 (52%), Gaps = 3/108 (2%)
Frame = +2
Query: 254 LPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRL-SQQALIDCSW 430
LP + DWR G VT VK+Q CGSCW+F G E + N V L S+Q L+DCS
Sbjct: 68 LPQQFDWRNLGKVTQVKNQGNCGSCWAFTITGLFESINLIRN--KTVELYSEQELLDCSS 125
Query: 431 G--FGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYCHVDNVT 568
+ N+GC GG A+E+ K++G+ Y Y G C V+ T
Sbjct: 126 NGIYRNSGCQGGWPHLAFEYSKKNGISLSSQY-PYKGIQENCTVNQQT 172
>UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine
protease; n=1; Maconellicoccus hirsutus|Rep: Putative
cathepsin L-like cysteine protease - Maconellicoccus
hirsutus (hibiscus mealybug)
Length = 339
Score = 75.8 bits (178), Expect = 7e-13
Identities = 57/202 (28%), Positives = 90/202 (44%), Gaps = 8/202 (3%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRA-NRG---FTMSVNHLADRTDDELAAL 169
K ++ ++Y +D+E R+ IF + I +N+ ++G F +N +D E
Sbjct: 33 KTQYSKKYTTDIEDRLRMKIFIDNKYRIAQHNKLFHKGLVTFEQGINEYSDMLQSEFNEK 92
Query: 170 RGRRYSGP---SPHGLPFPYSKSRVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSF 337
G++ S +GLP R L PP+ DWR G V PV Q C S +++
Sbjct: 93 MGQKSSNQRNTEANGLP----SIRFTPLHNVNPPDSVDWRTKGLVGPVGKQVNCSSGYAW 148
Query: 338 GTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEED 517
+GA+EG L + +S Q +IDCS GN GC GG +Y +I + G ++
Sbjct: 149 SAIGALEGQL-ASDKKKFQGISVQNVIDCSESTGNKGCSGGNQHHSYFYIYKQGGVDDDV 207
Query: 518 YGGYLGQDGYCHVDNVTAVTSI 583
Y + C VT +
Sbjct: 208 SYPYKDAEEPCAFKKENVVTRV 229
>UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Rep:
Cathepsin W - Xenopus tropicalis (Western clawed frog)
(Silurana tropicalis)
Length = 303
Score = 74.9 bits (176), Expect = 1e-12
Identities = 54/183 (29%), Positives = 84/183 (45%), Gaps = 1/183 (0%)
Frame = +2
Query: 5 VKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTM-SVNHLADRTDDELAALRGRR 181
+++ R Y + E + RL IF ++L+ R G V +D TD+E +
Sbjct: 2 LQYNRSYKTREEFKYRLRIFSENLKEASRLQREELGTAQYGVTKFSDLTDEEFSI----- 56
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
Y P+ + LP P + EE+ + P DWR ++ K+Q C SCW+F V +E
Sbjct: 57 YHLPT-NILPTPPILKQSEEV-LPFPTSCDWRTQNVISKAKNQRTCHSCWAFAAVANIEA 114
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQD 541
+ G + LS+Q +IDC+ NGC GG + A+ + + G T E Y G
Sbjct: 115 QWAIL--GQTISLSEQQVIDCN--TCRNGCSGGYAWDAFMTVLQQGGLTSEKSYPYTGHV 170
Query: 542 GYC 550
C
Sbjct: 171 SNC 173
>UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2;
Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera
litura multicapsid nucleopolyhedrovirus (SpltMNPV)
Length = 337
Score = 74.9 bits (176), Expect = 1e-12
Identities = 54/190 (28%), Positives = 90/190 (47%), Gaps = 7/190 (3%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTD----DELAALRG 175
+H ++Y + + + F+++L +++ N + +N +D +E A L
Sbjct: 39 QHNKEYTTPDQRDAAFVNFKRNLADMNAMNNVSNQAVYGINKFSDIDKITFVNEHAGLVS 98
Query: 176 RRYSGPSPHGLPFPYSKS-RVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGA 352
+ + P+ + V S + P DWR VT VK+Q VCGSCW+F +G
Sbjct: 99 NLINSTDSNFDPYRLCEYVTVAGPSARTPESFDWRKLNKVTKVKEQGVCGSCWAFAAIGN 158
Query: 353 VEGA-LFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAY-EWIKRHGLPTEEDYGG 526
+E +H+ L+ LS+Q L+DC + GCDGG A+ E I+ G+ E DY
Sbjct: 159 IESQYAIMHDS--LIDLSEQQLLDCD--RVDQGCDGGLMHLAFQEIIRIGGVEHEIDY-P 213
Query: 527 YLGQDGYCHV 556
Y G + C +
Sbjct: 214 YQGIEYACRL 223
>UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1;
Oryza sativa (japonica cultivar-group)|Rep: Putative
uncharacterized protein - Oryza sativa subsp. japonica
(Rice)
Length = 289
Score = 74.5 bits (175), Expect = 2e-12
Identities = 54/162 (33%), Positives = 73/162 (45%), Gaps = 2/162 (1%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRGFTMSVNHLADRTDDELAALRGRRY 184
K + Y E E R IFR ++ +I + + +N AD T+DE A Y
Sbjct: 49 KFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVAT----Y 104
Query: 185 SGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGA 364
+G P P P R + + P DWR GAVT VKDQ CGSCW+F V A+EG
Sbjct: 105 TGAKP---PHPKEAPRPVD-PIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGL 160
Query: 365 LFLHNGGHLVRLSQ-QALIDCSWGFGNNGCDGGEDFRAYEWI 487
+ G L LS + L++ G D RA+E +
Sbjct: 161 TKIRT-GQLTPLSDARTLVELRNQHATGAAAGTPD-RAFELV 200
>UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 397
Score = 74.1 bits (174), Expect = 2e-12
Identities = 41/118 (34%), Positives = 60/118 (50%), Gaps = 5/118 (4%)
Frame = +2
Query: 215 PYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLV 394
P V +L V +P DWR+ G V+PVKDQ CG CW+F E + N L
Sbjct: 168 PNPNPPVNQLKV-VPQSVDWRIQGKVSPVKDQGRCGCCWAFSATALAESVNLMRN-NTLQ 225
Query: 395 RLSQQALIDCS-----WGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQDGYCH 553
+ S+Q L+DC+ + + GC GG + A +++R G+ E Y Y Q+G C+
Sbjct: 226 QYSEQELVDCTNNQYQEDYSSLGCGGGWAYNALVYMQRKGIFLESQY-PYKAQNGVCN 282
>UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:
Aca s 1 allergen - Acarus siro (Dust mite)
Length = 331
Score = 74.1 bits (174), Expect = 2e-12
Identities = 49/171 (28%), Positives = 83/171 (48%), Gaps = 5/171 (2%)
Frame = +2
Query: 23 YASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPH 202
YA+ E R F SL++I N+R + G ++VN AD +E + G +
Sbjct: 37 YATPEEESIRRANFEASLKWIQENDRKDGGAHLAVNQFADLGANESVGVNLTARRGEA-- 94
Query: 203 GLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNG 382
F + + LP DWR + P+++Q CG+CW+F ++ VE A +
Sbjct: 95 ---FFEAVTIHVTPEGNLPETFDWR--SKLGPIENQGRCGACWAFASLATVEAAFAIKYN 149
Query: 383 GHLVRLSQQALIDCS-----WGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
H +RLS+Q L++C+ + N+GC GG + A ++++ G+ E Y
Sbjct: 150 TH-IRLSKQELVECTRESDHTPYENSGCQGGYSWEALKYVQVTGVVEEAAY 199
>UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1;
Uronema marinum|Rep: Cathepsin L-like cysteine protease
- Uronema marinum
Length = 333
Score = 73.7 bits (173), Expect = 3e-12
Identities = 56/175 (32%), Positives = 84/175 (48%), Gaps = 2/175 (1%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSV-NHLADRTDDELAALRGR 178
K H Y+S E R ++ ++ +++ N AN FT+ V N A T++E A +
Sbjct: 40 KQNHNLVYSSS-EDAYRFQVYFENFQFVEEFN-ANNSFTLGVENQFAAMTNEEFKA---Q 94
Query: 179 RYSGPSPHGLPFPYSKSRVEELSVKLPPEH-DWRLFGAVTPVKDQSVCGSCWSFGTVGAV 355
S G + V E +V P +W GAV V++Q VCGSCW+F V ++
Sbjct: 95 FTSEIISEGYNYQQVDRNVYE-AVNAPSGSVNWVSKGAVQGVQNQGVCGSCWAFSAVCSL 153
Query: 356 EGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDY 520
E L+ N G L+ S+Q L+ C + GCDGG A+ + HGL + Y
Sbjct: 154 E-RLYKINTGKLLSFSEQQLVSCE--PKSYGCDGGWPEAAFAYSATHGLESSASY 205
>UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or
H-like cysteine peptidase; n=1; Trichomonas vaginalis
G3|Rep: Clan CA, family C1, cathepsin L, S or H-like
cysteine peptidase - Trichomonas vaginalis G3
Length = 473
Score = 73.7 bits (173), Expect = 3e-12
Identities = 40/103 (38%), Positives = 56/103 (54%), Gaps = 3/103 (2%)
Frame = +2
Query: 254 LPPEHDWR-LFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSW 430
LP E WR + V +DQ CGSCW+FGT ++E L L G LS ++DC+W
Sbjct: 251 LPAEFSWRDVPNVVGKPRDQVACGSCWAFGTAESLESQLALKT-GVFRELSVNQIMDCTW 309
Query: 431 GFGNNGCDGGEDFRAYEWI--KRHGLPTEEDYGGYLGQDGYCH 553
+ N+ C GGE A+ + + L E+DY Y+G GYC+
Sbjct: 310 DYNNSACGGGEAGPAFRSLINQNFKLFLEKDY-PYIGVAGYCN 351
>UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-like
cysteine peptidase; n=3; Trichomonas vaginalis|Rep: Clan
CA, family C1, cathepsin L or K-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 320
Score = 73.7 bits (173), Expect = 3e-12
Identities = 58/189 (30%), Positives = 85/189 (44%), Gaps = 5/189 (2%)
Frame = +2
Query: 23 YASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPH 202
Y D E R IF + R++ N NR + +S+N + T+ E +L G + S +
Sbjct: 33 YVGD-EFHFRFGIFLANKRFVQEQNSINRNYRLSLNQFSFLTNSEYKSLLGGKVSSKNND 91
Query: 203 G--LPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLH 376
L P SK E DWR G + P+++Q CG CW+F T+ VE A +
Sbjct: 92 DSHLFSPQSKKSSEVT-------FDWRTKGIINPIRNQGQCGLCWAFSTICCVE-ARWAQ 143
Query: 377 NGGHLVRLSQQALIDCSWGFGNNGCDGG--EDFRAYEWIKRHG-LPTEEDYGGYLGQDGY 547
L++LS+Q L+DC GC GG +D A+ G T DY Y+ +
Sbjct: 144 AYNTLLQLSEQMLVDCV--DTCYGCMGGYADDAAAFVIENYEGKFMTAADY-PYIARASI 200
Query: 548 CHVDNVTAV 574
C D +V
Sbjct: 201 CKFDKTKSV 209
>UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, whole
genome shotgun sequence; n=2; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_46,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 336
Score = 73.3 bits (172), Expect = 4e-12
Identities = 50/187 (26%), Positives = 82/187 (43%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRR 181
K+++ + Y+ E + N + N+ N+ + M +N +D + +E + +
Sbjct: 58 KIEYGKSYSGQQEVFRFFNFQINRNKVNKHNSDPNKTYFMKMNQFSDLSQEEFSLIYLTH 117
Query: 182 YSGPSPHGLPFPYSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEG 361
+ + + + + K DWR +T VKDQ C CW+FG VGA E
Sbjct: 118 DNAEEVMEQNLIIDELQKTQENDKTINSVDWR---KITQVKDQGQCSGCWAFGAVGAAEA 174
Query: 362 ALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEEDYGGYLGQD 541
++ N V LS+Q LIDC + GC+GG A ++I HGL Y Q
Sbjct: 175 WFYVKN-KTTVLLSEQQLIDCD--TQSFGCNGGYQNLALKYIANHGLNDARVYPYTQKQS 231
Query: 542 GYCHVDN 562
YC ++
Sbjct: 232 AYCKYES 238
>UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18;
Plasmodium|Rep: Cysteine proteinase precursor -
Plasmodium vivax (strain Salvador I)
Length = 583
Score = 73.3 bits (172), Expect = 4e-12
Identities = 51/185 (27%), Positives = 85/185 (45%), Gaps = 14/185 (7%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYS 187
K++R Y E ++ F+ + I +N N+ + M VN +D + + + +
Sbjct: 243 KYKRSYKDINEQMEKYKNFKMNYLKIKKHNETNQMYKMKVNQFSDYSKKDFESYFRKLVP 302
Query: 188 GPS----PHGLPFP----------YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGS 325
P + +PF + S L +P D+R G V KDQ +CGS
Sbjct: 303 IPDHLKKKYVVPFSSMNNGKGKNVVTSSSGANLLADVPEILDYREKGIVHEPKDQGLCGS 362
Query: 326 CWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLP 505
CW+F +VG VE + ++ LS+Q ++DCS N GCDGG F ++ + +G+
Sbjct: 363 CWAFASVGNVECMYAKEHNKTILTLSEQEVVDCS--KLNFGCDGGHPFYSFIYAIENGIC 420
Query: 506 TEEDY 520
+DY
Sbjct: 421 MGDDY 425
>UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-ear
cress). SAG12 protein; n=2; Dictyostelium
discoideum|Rep: Similar to Arabidopsis thaliana
(Mouse-ear cress). SAG12 protein - Dictyostelium
discoideum (Slime mold)
Length = 358
Score = 72.9 bits (171), Expect = 5e-12
Identities = 55/195 (28%), Positives = 86/195 (44%), Gaps = 14/195 (7%)
Frame = +2
Query: 8 KHQRQYASDLEHEKRLNIFRQSLRYIHSNNRANRGFT-MSVNHLADRTDDELAALR-GRR 181
KH + Y +E E R + F+++++ N + G N +D +++E + +
Sbjct: 50 KHSKIYKDSIEMENRFSNFKENMKKNIELNSMHAGKAKFESNGFSDLSEEEFSNFHLNKA 109
Query: 182 YSGPSPH------GLPFPYSK-----SRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSC 328
+ G H P P+ +E + DWR G VTPVKDQ CGSC
Sbjct: 110 FKGKPSHLRNSIKPQPTPHHSLINGYKEMENGDLNELYSIDWRKKGLVTPVKDQGQCGSC 169
Query: 329 WSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKR-HGLP 505
+ F V +E A ++ G + LS+Q +DC G C GG+ + YE+ + G+
Sbjct: 170 YIFSAVEQIETA-WIKAGNKPILLSEQQAVDCDPYDGQ--CGGGDPYTVYEYFSQVGGVS 226
Query: 506 TEEDYGGYLGQDGYC 550
T Y Y DG C
Sbjct: 227 TNAQY-PYTATDGTC 240
>UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-like
protein; n=1; Maconellicoccus hirsutus|Rep: Cathepsin
L-like cysteine proteinase-like protein -
Maconellicoccus hirsutus (hibiscus mealybug)
Length = 253
Score = 72.9 bits (171), Expect = 5e-12
Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 2/113 (1%)
Frame = +2
Query: 251 KLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSW 430
++P E +W G VTPV +Q C W+F GA+E + V+LS+Q LI+CS
Sbjct: 32 EIPNEINWVAKGKVTPVGNQGKCNVGWAFSVTGALESEKAIKYEAAPVKLSEQNLIECSG 91
Query: 431 GFGNNGCDGGEDFRAYEWIKR-HGLPTEEDY-GGYLGQDGYCHVDNVTAVTSI 583
GFGN C GG Y+++ G+ E+ Y + + C D+ + SI
Sbjct: 92 GFGNKRCSGGNLENTYKYVNHSRGIEKEDSYRDNFRHINSRCQYDSTKSAVSI 144
>UniRef50_A2DKX2 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=1; Trichomonas vaginalis G3|Rep:
Clan CA, family C1, cathepsin L-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 317
Score = 72.9 bits (171), Expect = 5e-12
Identities = 61/183 (33%), Positives = 83/183 (45%), Gaps = 3/183 (1%)
Frame = +2
Query: 38 EHEKRLNIFRQSLRYIHSNNRANRGFTMSVNHLADRTDDELAALRGRRYSGPSPHGLPFP 217
E+ RL I+ + RYI NR R T++ N + T E AL S P H P
Sbjct: 36 EYAFRLGIYLTTDRYIKQFNRGKRSHTLAHNKFSAYTHAEYKALLN---SKPI-H--PRN 89
Query: 218 YSKSRVEELSVKLPPEHDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVR 397
KS++ V++P DWR A PV+DQ C S ++F + E ++ L
Sbjct: 90 VQKSQITTQKVQVPDTWDWRDRVAFNPVRDQMECASGFAFASCACQEVTWNIYY-NKLYL 148
Query: 398 LSQQALIDCSWGFGNNGCDGGEDFRAYEWI--KRHG-LPTEEDYGGYLGQDGYCHVDNVT 568
LS Q ++DC+ + GCDGGE RA +I + G E DY GYC D
Sbjct: 149 LSPQNMLDCA--YNEEGCDGGEADRAVGYIVTDQDGKFGLESDYPYKSESMGYCEFDPSK 206
Query: 569 AVT 577
VT
Sbjct: 207 GVT 209
>UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n=1;
Rattus norvegicus|Rep: UPI0000501FDB UniRef100 entry -
Rattus norvegicus
Length = 338
Score = 72.5 bits (170), Expect = 7e-12
Identities = 36/82 (43%), Positives = 50/82 (60%), Gaps = 1/82 (1%)
Frame = +2
Query: 308 QSVCGSCWSFGTVGAVEGALFLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWI 487
Q C SCW+F VGA+EG +F G L LS Q L+DCS GN GC GG + A++++
Sbjct: 139 QGRCNSCWAFPVVGAIEGQMFKKTG-KLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYV 197
Query: 488 KRH-GLPTEEDYGGYLGQDGYC 550
++ GL +E Y Y G++G C
Sbjct: 198 LQNGGLESEATY-PYEGKEGLC 218
>UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides
sonorensis|Rep: Cathepsin L - Culicoides sonorensis
Length = 331
Score = 72.5 bits (170), Expect = 7e-12
Identities = 61/195 (31%), Positives = 96/195 (49%), Gaps = 8/195 (4%)
Frame = +2
Query: 2 KVKHQRQYASDLEHEKRLNIFRQSLRYIHSNN-RANRG---FTMSVNHLADRTDDELAAL 169
K+++ + Y E R IF ++L + +N R G + VN +D T +E A L
Sbjct: 31 KLEYNKVYPLSTEENLRKGIFERNLADVMEHNARYLSGMETYEKGVNQFSDLTYEEFAKL 90
Query: 170 R-GRRYSGPSPHGLPFPYSKSRVEE-LSVKLPPE-HDWRLFGAVTPVKDQSVCGSCWSFG 340
G + S + +E+ L +L PE + W PVK+Q+ CGSCW+F
Sbjct: 91 YLGEKIS----FNELMTNADGWIEKPLRRQLAPESYAWDTKDV--PVKNQAQCGSCWAFA 144
Query: 341 TVGAVEGAL-FLHNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYEWIKRHGLPTEED 517
+V +VE HN + L++Q L+DC ++GC GG A ++++ +GL E+D
Sbjct: 145 SVASVEMRYKRFHNKSY--TLAEQELVDCE--TTSHGCSGGWSDLALQYMRDNGLSFEKD 200
Query: 518 YGGYLGQDGYCHVDN 562
Y Y G+D CH N
Sbjct: 201 Y-PYKGKDEKCHASN 214
Database: uniref50
Posted date: Oct 5, 2007 11:19 AM
Number of letters in database: 575,637,011
Number of sequences in database: 1,657,284
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 574,653,192
Number of Sequences: 1657284
Number of extensions: 12517963
Number of successful extensions: 65452
Number of sequences better than 10.0: 500
Number of HSP's better than 10.0 without gapping: 57911
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 64504
length of database: 575,637,011
effective HSP length: 96
effective length of database: 416,537,747
effective search space used: 40820699206
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)
- SilkBase 1999-2023 -