BLASTX 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= e96h0134
(708 letters)
Database: uniref50
1,657,284 sequences; 575,637,011 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C... 128 1e-28
UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n... 91 2e-17
UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ... 89 7e-17
UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep... 88 2e-16
UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j... 79 8e-14
UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n... 75 2e-12
UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M... 74 3e-12
UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h... 74 4e-12
UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae... 74 4e-12
UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ... 74 4e-12
UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei... 73 5e-12
UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;... 73 5e-12
UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e... 73 9e-12
UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n... 72 2e-11
UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio... 72 2e-11
UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ... 71 4e-11
UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain... 69 9e-11
UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto... 69 9e-11
UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs... 69 1e-10
UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ... 69 1e-10
UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca... 68 3e-10
UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt... 68 3e-10
UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat... 68 3e-10
UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ... 67 3e-10
UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ... 67 5e-10
UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata... 67 5e-10
UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ... 66 6e-10
UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome s... 66 6e-10
UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:... 66 8e-10
UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R... 66 8e-10
UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=... 66 8e-10
UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ... 66 1e-09
UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ... 66 1e-09
UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy... 66 1e-09
UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re... 65 1e-09
UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph... 65 1e-09
UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain... 65 1e-09
UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc... 65 2e-09
UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr... 64 2e-09
UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;... 64 4e-09
UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi... 63 6e-09
UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ... 63 6e-09
UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No... 63 7e-09
UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio... 63 7e-09
UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: L... 62 1e-08
UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er... 62 1e-08
UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re... 62 2e-08
UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G... 61 2e-08
UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18... 61 2e-08
UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat... 61 3e-08
UniRef50_Q23H32 Cluster: Papain family cysteine protease contain... 61 3e-08
UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ... 61 3e-08
UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus... 60 4e-08
UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox... 60 4e-08
UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ... 60 5e-08
UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|... 60 5e-08
UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p... 60 7e-08
UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ... 60 7e-08
UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine prot... 60 7e-08
UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w... 60 7e-08
UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy... 59 9e-08
UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh... 59 9e-08
UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG... 59 1e-07
UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty... 59 1e-07
UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa... 58 2e-07
UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac... 58 2e-07
UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ... 58 2e-07
UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n... 58 2e-07
UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus... 58 2e-07
UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ... 58 2e-07
UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ... 58 3e-07
UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n... 58 3e-07
UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|... 58 3e-07
UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D... 58 3e-07
UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ... 57 4e-07
UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa... 57 4e-07
UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sa... 57 4e-07
UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1... 57 4e-07
UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei... 57 5e-07
UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb... 57 5e-07
UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ... 57 5e-07
UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina... 57 5e-07
UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr... 57 5e-07
UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v... 57 5e-07
UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ... 56 6e-07
UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2; Culicid... 56 6e-07
UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ... 56 9e-07
UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ... 56 9e-07
UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p... 56 1e-06
UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome s... 56 1e-06
UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip... 56 1e-06
UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi... 56 1e-06
UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil... 55 1e-06
UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster... 55 1e-06
UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl... 55 1e-06
UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ... 55 1e-06
UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 55 1e-06
UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endop... 55 1e-06
UniRef50_Q5NE16 Cluster: Putative cathepsin L-like protein 3; n=... 55 1e-06
UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP000... 55 2e-06
UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica... 55 2e-06
UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:... 55 2e-06
UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted... 55 2e-06
UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]... 55 2e-06
UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|... 54 3e-06
UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel... 54 3e-06
UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain... 54 3e-06
UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain... 54 3e-06
UniRef50_O16454 Cluster: Temporarily assigned gene name protein ... 54 3e-06
UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3... 54 3e-06
UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;... 54 3e-06
UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal... 54 5e-06
UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir... 54 5e-06
UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ... 54 5e-06
UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=... 53 6e-06
UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic... 53 6e-06
UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D... 53 6e-06
UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ... 53 8e-06
UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh... 53 8e-06
UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi... 53 8e-06
UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ... 52 1e-05
UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-... 52 1e-05
UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip... 52 1e-05
UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35... 52 1e-05
UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory... 52 1e-05
UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re... 52 1e-05
UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ... 52 2e-05
UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ... 52 2e-05
UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma j... 52 2e-05
UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz... 51 2e-05
UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt... 51 2e-05
UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia... 51 2e-05
UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina... 51 2e-05
UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|... 51 2e-05
UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ... 51 2e-05
UniRef50_Q5YER1 Cluster: Cysteine proteinase; n=1; Bigelowiella ... 51 3e-05
UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca... 51 3e-05
UniRef50_Q23H10 Cluster: Papain family cysteine protease contain... 51 3e-05
UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain... 51 3e-05
UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv... 51 3e-05
UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep... 50 4e-05
UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate... 50 4e-05
UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re... 50 4e-05
UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; ... 50 6e-05
UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl... 50 6e-05
UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n... 50 7e-05
UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:... 50 7e-05
UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t... 49 1e-04
UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ... 49 1e-04
UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ... 49 1e-04
UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C... 49 1e-04
UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr... 49 1e-04
UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv... 49 1e-04
UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ... 49 1e-04
UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ... 49 1e-04
UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n... 49 1e-04
UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ... 48 2e-04
UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyt... 48 2e-04
UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ... 48 2e-04
UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryz... 48 2e-04
UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ... 48 2e-04
UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl... 48 2e-04
UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo... 48 2e-04
UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|... 48 2e-04
UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ... 48 3e-04
UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa... 48 3e-04
UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli... 48 3e-04
UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis... 48 3e-04
UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve... 48 3e-04
UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ... 47 4e-04
UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,... 47 4e-04
UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste... 47 4e-04
UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ... 46 7e-04
UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum... 46 7e-04
UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida... 46 7e-04
UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa... 46 7e-04
UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain... 46 7e-04
UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ... 46 7e-04
UniRef50_Q22A69 Cluster: Papain family cysteine protease contain... 46 7e-04
UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ... 46 7e-04
UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyosteli... 46 7e-04
UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain... 46 0.001
UniRef50_Q23H15 Cluster: Papain family cysteine protease contain... 46 0.001
UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain... 46 0.001
UniRef50_Q22RC9 Cluster: Papain family cysteine protease contain... 46 0.001
UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy... 46 0.001
UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi... 46 0.001
UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R... 46 0.001
UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain... 46 0.001
UniRef50_Q22W19 Cluster: Papain family cysteine protease contain... 46 0.001
UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ... 46 0.001
UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh... 46 0.001
UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s... 45 0.002
UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain... 45 0.002
UniRef50_Q239L8 Cluster: Papain family cysteine protease contain... 45 0.002
UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big... 45 0.002
UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain... 45 0.002
UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis... 45 0.002
UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve... 44 0.003
UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ... 44 0.003
UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s... 44 0.004
UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; ... 44 0.004
UniRef50_Q24E33 Cluster: Papain family cysteine protease contain... 44 0.004
UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali... 44 0.004
UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2... 44 0.004
UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli... 44 0.005
UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest... 44 0.005
UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy... 44 0.005
UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ... 43 0.006
UniRef50_Q2QS15 Cluster: Papain family cysteine protease contain... 43 0.006
UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy... 43 0.006
UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li... 43 0.006
UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs... 43 0.006
UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin ... 43 0.009
UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet... 43 0.009
UniRef50_Q23H06 Cluster: Papain family cysteine protease contain... 43 0.009
UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain... 43 0.009
UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The... 43 0.009
UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ... 42 0.011
UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arab... 42 0.011
UniRef50_Q5ZC39 Cluster: CRK1 protein-like; n=2; Oryza sativa (j... 42 0.015
UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-... 42 0.015
UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10... 42 0.015
UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy... 42 0.015
UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n... 42 0.020
UniRef50_A0BNM1 Cluster: Chromosome undetermined scaffold_119, w... 42 0.020
UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ... 41 0.026
UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz... 41 0.026
UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen... 41 0.026
UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The... 41 0.026
UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C... 41 0.026
UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryz... 41 0.034
UniRef50_Q248G1 Cluster: Papain family cysteine protease contain... 41 0.034
UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain... 41 0.034
UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep... 41 0.034
UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M... 41 0.034
UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:... 41 0.034
UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; P... 41 0.034
UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir... 40 0.045
UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cel... 40 0.045
UniRef50_Q9SIE8 Cluster: Putative cysteine proteinase; n=1; Arab... 40 0.060
UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|... 40 0.060
UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma j... 40 0.060
UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh... 40 0.060
UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who... 40 0.060
UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl... 40 0.060
UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov... 40 0.060
UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R... 40 0.079
UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ... 40 0.079
UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain... 40 0.079
UniRef50_Q235G6 Cluster: Papain family cysteine protease contain... 40 0.079
UniRef50_A7LFV2 Cluster: Cathepsin L protease inhibitor 1; n=1; ... 40 0.079
UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280... 39 0.10
UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi... 39 0.10
UniRef50_P21381 Cluster: Thaumatopain; n=10; Eukaryota|Rep: Thau... 39 0.10
UniRef50_UPI0000D566EC Cluster: PREDICTED: similar to CG10460-PA... 39 0.14
UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ... 39 0.14
UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;... 39 0.14
UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ... 39 0.14
UniRef50_Q962W3 Cluster: Cysteine protease; n=3; Giardia intesti... 38 0.18
UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto... 38 0.18
UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium... 38 0.18
UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ... 38 0.24
UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop... 38 0.24
UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop... 38 0.24
UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep... 38 0.24
UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain... 38 0.24
UniRef50_Q5KH32 Cluster: Putative uncharacterized protein; n=2; ... 38 0.24
UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr... 38 0.24
UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|R... 38 0.24
UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Sl... 38 0.32
UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcop... 38 0.32
UniRef50_A7APS9 Cluster: Papain family cysteine protease contain... 38 0.32
UniRef50_A2SQ75 Cluster: Cysteine protease-like protein; n=1; Me... 38 0.32
UniRef50_Q0AY53 Cluster: Putative uncharacterized protein; n=1; ... 37 0.42
UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O... 37 0.42
UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try... 37 0.42
UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:... 37 0.42
UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh... 37 0.42
UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh... 37 0.56
UniRef50_Q75ZL3 Cluster: Putative uncharacterized protein; n=1; ... 37 0.56
UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa... 37 0.56
UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi... 37 0.56
UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl... 37 0.56
UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve... 37 0.56
UniRef50_UPI0000D566ED Cluster: PREDICTED: similar to CTLA-2-alp... 36 0.74
UniRef50_A5HC50 Cluster: Cathepsin W; n=2; Theria|Rep: Cathepsin... 36 0.74
UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist... 36 0.74
UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw... 36 0.74
UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil... 36 0.74
UniRef50_Q231X3 Cluster: Papain family cysteine protease contain... 36 0.74
UniRef50_A1ZBK7 Cluster: CG10460-PA; n=1; Drosophila melanogaste... 36 0.74
UniRef50_A1YUM6 Cluster: Cysteine proteinase 4; n=1; Necator ame... 36 0.74
UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr... 36 0.74
UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ... 36 0.74
UniRef50_P92132 Cluster: Cathepsin B-like CP2 precursor; n=4; Gi... 36 0.74
UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ... 36 0.98
UniRef50_Q53K53 Cluster: Cysteine protease 1, putative; n=5; Ory... 36 0.98
UniRef50_Q7QWN9 Cluster: GLP_26_50243_51811; n=2; Giardia lambli... 36 0.98
UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1... 36 0.98
UniRef50_A1Z9I0 Cluster: CG6357-PA; n=3; Drosophila melanogaster... 36 0.98
UniRef50_A0D5R4 Cluster: Chromosome undetermined scaffold_39, wh... 36 0.98
UniRef50_P84789 Cluster: Philibertain g 1; n=5; core eudicotyled... 36 0.98
UniRef50_Q3W780 Cluster: Peptidase S1, chymotrypsin:PDZ/DHR/GLGF... 36 1.3
UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S... 36 1.3
UniRef50_Q945E4 Cluster: Cysteine proteinase; n=1; Vasconcellea ... 36 1.3
UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa... 36 1.3
UniRef50_Q8I5D0 Cluster: Putative uncharacterized protein; n=2; ... 36 1.3
UniRef50_Q7QQ83 Cluster: GLP_42_16392_14707; n=1; Giardia lambli... 36 1.3
UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop... 36 1.3
UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca... 36 1.3
UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ... 35 1.7
UniRef50_UPI0000DA404B Cluster: PREDICTED: similar to cathepsin ... 35 1.7
UniRef50_Q55FL7 Cluster: Putative uncharacterized protein; n=1; ... 35 1.7
UniRef50_Q4YNP3 Cluster: Putative uncharacterized protein; n=1; ... 35 1.7
UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil... 35 1.7
UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop... 35 1.7
UniRef50_Q2FLC7 Cluster: Periplasmic copper-binding precursor; n... 35 1.7
UniRef50_P12400 Cluster: Protein CTLA-2-beta; n=6; Mus musculus|... 35 1.7
UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ... 35 2.3
UniRef50_UPI0000DA2FCA Cluster: PREDICTED: similar to alpha 3 ty... 35 2.3
UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ... 35 2.3
UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl... 35 2.3
UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Sl... 35 2.3
UniRef50_Q6SSE0 Cluster: Cathepsin B; n=2; Oligohymenophorea|Rep... 35 2.3
UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|... 35 2.3
UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w... 35 2.3
UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu... 35 2.3
UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;... 34 3.0
UniRef50_UPI0000D9BE07 Cluster: PREDICTED: hypothetical protein;... 34 3.0
UniRef50_UPI0000D9B393 Cluster: PREDICTED: hypothetical protein;... 34 3.0
UniRef50_UPI00001CC928 Cluster: PREDICTED: similar to CTLA-2-bet... 34 3.0
UniRef50_Q4SUM3 Cluster: Ephrin receptor; n=4; Tetraodon nigrovi... 34 3.0
UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=... 34 3.0
UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid... 34 3.0
UniRef50_Q1KYN0 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 34 3.0
UniRef50_A0EAG2 Cluster: Chromosome undetermined scaffold_86, wh... 34 3.0
UniRef50_A0BV23 Cluster: Chromosome undetermined scaffold_13, wh... 34 3.0
UniRef50_UPI00015B5D85 Cluster: PREDICTED: similar to cathepsin ... 34 3.9
UniRef50_Q207N1 Cluster: Cathepsin S; n=2; Clupeocephala|Rep: Ca... 34 3.9
UniRef50_A6GAX3 Cluster: Putative uncharacterized protein; n=1; ... 34 3.9
UniRef50_A4G7B4 Cluster: Putative uncharacterized protein; n=1; ... 34 3.9
UniRef50_Q4Y2Z9 Cluster: Putative uncharacterized protein; n=3; ... 34 3.9
UniRef50_Q3L7L2 Cluster: Sar s 1 allergen SMIPP-C Yv6008G08; n=2... 34 3.9
UniRef50_A0DCA5 Cluster: Chromosome undetermined scaffold_45, wh... 34 3.9
UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh... 34 3.9
UniRef50_Q2H7E7 Cluster: Putative uncharacterized protein; n=1; ... 34 3.9
UniRef50_Q0TZH4 Cluster: Predicted protein; n=1; Phaeosphaeria n... 34 3.9
UniRef50_Q8TKH5 Cluster: Cell surface protein; n=3; Methanosarci... 34 3.9
UniRef50_UPI0000499884 Cluster: hypothetical protein 25.t00008; ... 33 5.2
UniRef50_UPI000023E712 Cluster: hypothetical protein FG04225.1; ... 33 5.2
UniRef50_Q7QPH8 Cluster: GLP_41_8294_9919; n=2; Giardia intestin... 33 5.2
UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ... 33 5.2
UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H... 33 5.2
UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh... 33 5.2
UniRef50_UPI0000DB6CBD Cluster: PREDICTED: similar to rhinoceros... 33 6.9
UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein... 33 6.9
UniRef50_Q8IKV2 Cluster: Putative uncharacterized protein; n=1; ... 33 6.9
UniRef50_Q86GF5 Cluster: Cathepsin B; n=1; Pandalus borealis|Rep... 33 6.9
UniRef50_Q7R5X2 Cluster: GLP_81_104117_102504; n=1; Giardia lamb... 33 6.9
UniRef50_Q292E5 Cluster: GA10327-PA; n=1; Drosophila pseudoobscu... 33 6.9
UniRef50_Q0IEH6 Cluster: Putative uncharacterized protein; n=1; ... 33 6.9
UniRef50_A7SW33 Cluster: Predicted protein; n=3; Eumetazoa|Rep: ... 33 6.9
UniRef50_A0DKP6 Cluster: Chromosome undetermined scaffold_54, wh... 33 6.9
UniRef50_A3LZM2 Cluster: Predicted protein; n=1; Pichia stipitis... 33 6.9
UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re... 33 9.1
UniRef50_Q489L3 Cluster: Putative uncharacterized protein; n=1; ... 33 9.1
UniRef50_Q9BL26 Cluster: Putative uncharacterized protein; n=4; ... 33 9.1
UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy... 33 9.1
UniRef50_Q6CS17 Cluster: Similarities with sp|Q25662 Plasmodium ... 33 9.1
UniRef50_A4RJ84 Cluster: Putative uncharacterized protein; n=2; ... 33 9.1
>UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15)
[Contains: Cathepsin L heavy chain; Cathepsin L light
chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC
3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin
L light chain] - Sarcophaga peregrina (Flesh fly)
(Boettcherisca peregrina)
Length = 339
Score = 128 bits (309), Expect = 1e-28
Identities = 57/99 (57%), Positives = 72/99 (72%)
Frame = +2
Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
E++H IAKHNQ + G VSYKLG+NKY DMLHHEF +TMNG+N T + L + +
Sbjct: 54 ENRHKIAKHNQLFAQGKVSYKLGLNKYADMLHHEFKETMNGYNHTL---RQLMRERTGLV 110
Query: 437 GAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
GA +I PA+V +P+ VDWR+HGAV KDQG CGSCW+F
Sbjct: 111 GATYIPPAHVTVPKSVDWREHGAVTGVKDQGHCGSCWAF 149
Score = 53.6 bits (123), Expect = 5e-06
Identities = 21/31 (67%), Positives = 26/31 (83%)
Frame = +3
Query: 162 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIY 254
DL+KEEW +KLQHR NY +EVE+ FRMKI+
Sbjct: 22 DLIKEEWHTYKLQHRKNYANEVEERFRMKIF 52
Score = 43.2 bits (97), Expect = 0.006
Identities = 24/42 (57%), Positives = 29/42 (69%)
Frame = +1
Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLI 690
GALEGQHFR++G LVS L + +DCS GNNG GGL+
Sbjct: 153 GALEGQHFRKAGVLVS-LSEQNLVDCS-TKYGNNGCN-GGLM 191
>UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3;
Bilateria|Rep: Cathepsin L-like cysteine proteinase -
Longidorus elongatus
Length = 358
Score = 91.1 bits (216), Expect = 2e-17
Identities = 46/98 (46%), Positives = 58/98 (59%)
Frame = +2
Query: 260 HKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRG 439
HK +I +HN +YE G S+ L +NK+ DM + EF + MNGF AK K + G
Sbjct: 71 HK-VIEQHNIEYEAGQHSFALSLNKFADMTNAEFRQRMNGFKLPAKR-KLAKSQPLKEDG 128
Query: 440 AKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
F P NV +P+ VDWRK G V KDQG CGSCW+F
Sbjct: 129 MIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAF 166
Score = 34.7 bits (76), Expect = 2.3
Identities = 18/37 (48%), Positives = 25/37 (67%), Gaps = 2/37 (5%)
Frame = +1
Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDC--SGAVTGNNG 669
G+LEGQH++Q+G LVS L + +DC +G G NG
Sbjct: 170 GSLEGQHYKQTGKLVS-LSEQNLVDCDVNGDDEGCNG 205
>UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin
L-like protease; n=1; Nasonia vitripennis|Rep:
PREDICTED: similar to cathepsin L-like protease -
Nasonia vitripennis
Length = 353
Score = 89.4 bits (212), Expect = 7e-17
Identities = 43/101 (42%), Positives = 65/101 (64%), Gaps = 2/101 (1%)
Frame = +2
Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
E++ IA+HNQK+++GL +YK+ +N++GDM+ E+ M+ N T K + R
Sbjct: 66 ENQRKIAEHNQKHDLGLFTYKVRINQFGDMMFEEYKNYMHAANNTITQLKRI------PR 119
Query: 437 GAKFISPANVK-LPEQVDWRKHGAVPTFKDQG-KCGSCWSF 553
G +FI P + + +PE VDWR+ GAV +DQG CGSCW+F
Sbjct: 120 GDEFIKPKSAENVPEHVDWRQRGAVTPVRDQGLTCGSCWAF 160
Score = 38.7 bits (86), Expect = 0.14
Identities = 12/27 (44%), Positives = 22/27 (81%)
Frame = +3
Query: 174 EEWSAFKLQHRLNYESEVEDNFRMKIY 254
++W+AFKL+++ NY +VE+NFR ++
Sbjct: 38 DDWAAFKLRYKKNYNGDVEENFRRSVF 64
>UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin
L - Misgurnus mizolepis (Mud loach)
Length = 337
Score = 87.8 bits (208), Expect = 2e-16
Identities = 40/91 (43%), Positives = 58/91 (63%)
Frame = +2
Query: 281 HNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPA 460
HN ++ MG+ +Y+LGMN +GDM H EF + MNG+ KH KG + F+ P
Sbjct: 62 HNLEHSMGIHTYRLGMNHFGDMNHEEFRQVMNGY----KHKTERKFKG-----SLFMEPN 112
Query: 461 NVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+++P ++DWR+ G V KDQG+CGSCW+F
Sbjct: 113 FLEVPSKLDWREKGYVTPVKDQGECGSCWAF 143
Score = 35.9 bits (79), Expect = 0.98
Identities = 24/48 (50%), Positives = 29/48 (60%)
Frame = +1
Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLIGXTAFQ 708
GA+EGQ FR+ G LVS L + +DCS GN G GGL+ AFQ
Sbjct: 147 GAMEGQMFRKQGKLVS-LSEQNLVDCS-RPEGNEGCN-GGLM-DQAFQ 190
>UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC06231 protein - Schistosoma
japonicum (Blood fluke)
Length = 372
Score = 79.4 bits (187), Expect = 8e-14
Identities = 36/92 (39%), Positives = 57/92 (61%)
Frame = +2
Query: 278 KHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISP 457
+HN+ Y+ G +YK+G+N + D +E K + G+ + K +G+ FIS
Sbjct: 95 EHNRAYQEGKATYKMGVNNFTDKTEYELRK-LRGYRSACRIAKP--------KGSTFISS 145
Query: 458 ANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ KLP++VDWR++GAV K+QG+CGSCW+F
Sbjct: 146 EHAKLPDRVDWRRNGAVTPVKNQGQCGSCWAF 177
Score = 39.5 bits (88), Expect = 0.079
Identities = 24/48 (50%), Positives = 33/48 (68%)
Frame = +1
Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLIGXTAFQ 708
GA+EGQH+R++ LV+ L + IDCS + GNNG + GGL+ AFQ
Sbjct: 181 GAIEGQHYRKTNRLVN-LSEQQLIDCSKSY-GNNGCE-GGLM-DLAFQ 224
>UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase;
n=21; Bilateria|Rep: Cathepsin L-like cysteine
proteinase - Globodera pallida
Length = 379
Score = 74.5 bits (175), Expect = 2e-12
Identities = 43/122 (35%), Positives = 62/122 (50%), Gaps = 9/122 (7%)
Frame = +2
Query: 215 RKRGRRQFPHEDIPEH--------KHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKT 370
+K GR+ + +D+ K I KHNQ Y G V++++G N D+ E+ K
Sbjct: 75 QKHGRKAYADQDVENERMLTYLSAKQFIDKHNQAYIEGKVTFRVGENHIADLPFSEY-KK 133
Query: 371 MNGFNKTAKHNKNLYMKGGSVRGAKFISPANV-KLPEQVDWRKHGAVPTFKDQGKCGSCW 547
+NG+ + N + F++P NV LPE VDWR G V K+QG CGSCW
Sbjct: 134 LNGYRRLLGDNLRR-------NASTFLAPMNVGDLPESVDWRDKGWVTEVKNQGMCGSCW 186
Query: 548 SF 553
+F
Sbjct: 187 AF 188
Score = 36.3 bits (80), Expect = 0.74
Identities = 24/48 (50%), Positives = 29/48 (60%)
Frame = +1
Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLIGXTAFQ 708
GALE QH RQ+G L+S L + IDCS GN G GG++ AFQ
Sbjct: 192 GALEAQHARQTGQLIS-LSEQNLIDCSKKY-GNMGCN-GGIM-DNAFQ 235
>UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15)
(Major excreted protein) (MEP) [Contains: Cathepsin L
heavy chain; Cathepsin L light chain]; n=19;
Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15)
(Major excreted protein) (MEP) [Contains: Cathepsin L
heavy chain; Cathepsin L light chain] - Homo sapiens
(Human)
Length = 333
Score = 74.1 bits (174), Expect = 3e-12
Identities = 37/95 (38%), Positives = 49/95 (51%)
Frame = +2
Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
+I HNQ+Y G S+ + MN +GDM EF + MNGF +G F
Sbjct: 58 MIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPR-----------KGKVF 106
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
P + P VDWR+ G V K+QG+CGSCW+F
Sbjct: 107 QEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAF 141
Score = 39.9 bits (89), Expect = 0.060
Identities = 25/48 (52%), Positives = 31/48 (64%)
Frame = +1
Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLIGXTAFQ 708
GALEGQ FR++G L+S L + +DCSG GN G GGL+ AFQ
Sbjct: 145 GALEGQMFRKTGRLIS-LSEQNLVDCSGP-QGNEGCN-GGLMDY-AFQ 188
>UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin
heavy chain; n=3; Amniota|Rep: PREDICTED: similar to
ferritin heavy chain - Ornithorhynchus anatinus
Length = 338
Score = 73.7 bits (173), Expect = 4e-12
Identities = 36/95 (37%), Positives = 56/95 (58%)
Frame = +2
Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
+I +HN++ G SY+L MN +GD + E + +NGF + + ++ G + A+F
Sbjct: 58 VIERHNEEMSQGKHSYRLAMNHFGDQTNEELHERLNGF----RPDLGGALRSGREQ-ARF 112
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
S + + PE+VDWR G V K+QG CGSCW+F
Sbjct: 113 RSKTSWEGPEEVDWRTKGYVTPVKNQGLCGSCWAF 147
>UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3;
Curculionidae|Rep: Cysteine proteinase - Hypera postica
(alfalfa weevil)
Length = 324
Score = 73.7 bits (173), Expect = 4e-12
Identities = 43/96 (44%), Positives = 51/96 (53%), Gaps = 2/96 (2%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNL--YMKGGSVRGAK 445
I HN YE G VSYK G+NK+ DM EF KTM + + K Y+K G
Sbjct: 57 IEAHNALYEQGKVSYKKGINKFTDMSQEEF-KTMLTLSASRKPTLETTSYVKTG------ 109
Query: 446 FISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
V++P VDWRK G V KDQG CGSCW+F
Sbjct: 110 ------VEIPSSVDWRKEGRVTGVKDQGDCGSCWAF 139
>UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor;
n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase
precursor - Diabrotica virgifera virgifera (western corn
rootworm)
Length = 326
Score = 73.7 bits (173), Expect = 4e-12
Identities = 38/94 (40%), Positives = 53/94 (56%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I HN KY+ GL ++KLG+ K+ D+ EF M G +++ K ++ R +
Sbjct: 54 IENHNDKYDHGLSTFKLGVTKFADLTEKEF-SDMLGISRSTKSSRP--------RVIHSL 104
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+P LP + DWR+ GAV KDQG CGSCWSF
Sbjct: 105 TPVK-DLPSKFDWREKGAVTEVKDQGSCGSCWSF 137
>UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine
proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like
midgut cysteine proteinase - Tenebrio molitor (Yellow
mealworm)
Length = 330
Score = 73.3 bits (172), Expect = 5e-12
Identities = 42/95 (44%), Positives = 56/95 (58%), Gaps = 1/95 (1%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN-GFNKTAKHNKNLYMKGGSVRGAKF 448
IA+HN K+E G V+Y MN++GDM EF+ +N G + KH +NL M +
Sbjct: 59 IAEHNAKFEKGEVTYSKAMNQFGDMSKEEFLAYVNRGKAQKPKHPENLRM--------PY 110
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+S + L VDWR + AV KDQG+CGSCWSF
Sbjct: 111 VS-SKKPLAASVDWRSN-AVSEVKDQGQCGSCWSF 143
Score = 33.1 bits (72), Expect = 6.9
Identities = 13/30 (43%), Positives = 20/30 (66%)
Frame = +3
Query: 165 LVKEEWSAFKLQHRLNYESEVEDNFRMKIY 254
L +E+WS FKL H+ +Y S +E+ R I+
Sbjct: 23 LFQEQWSQFKLTHKKSYSSPIEEIRRQLIF 52
>UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;
Brugia malayi|Rep: Cahepsin L-like cysteine protease -
Brugia malayi (Filarial nematode worm)
Length = 371
Score = 73.3 bits (172), Expect = 5e-12
Identities = 39/94 (41%), Positives = 53/94 (56%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I KHN++YE +Y+L +N DML EF K ++GF +KN + ++R
Sbjct: 85 IEKHNERYERNEETYELAINHLADMLPEEFRK-LHGFQSRKITSKNNFKN--TIR----- 136
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
N LP+ +DWR GAV KDQG CGSCW+F
Sbjct: 137 MKINGPLPKSIDWRTSGAVTKVKDQGYCGSCWTF 170
Score = 40.3 bits (90), Expect = 0.045
Identities = 22/43 (51%), Positives = 27/43 (62%)
Frame = +1
Query: 562 LGALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLI 690
+GALEGQHF Q+G LV L + +DCS GN G GGL+
Sbjct: 173 VGALEGQHFLQTGKLVE-LSMQNLLDCSDDTYGNYGCD-GGLM 213
>UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean
endopeptidase) (Cysteine proteinase)
(Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor
(EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase)
(Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
Vignain-2] - Vigna mungo (Rice bean) (Black gram)
Length = 362
Score = 72.5 bits (170), Expect = 9e-12
Identities = 35/80 (43%), Positives = 45/80 (56%)
Frame = +2
Query: 314 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 493
YKL +NK+ DM +HEF T G +K N + +G F+ +P VDWR
Sbjct: 80 YKLKLNKFADMTNHEFRSTYAG----SKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWR 135
Query: 494 KHGAVPTFKDQGKCGSCWSF 553
K GAV KDQG+CGSCW+F
Sbjct: 136 KKGAVTDVKDQGQCGSCWAF 155
>UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4;
core eudicotyledons|Rep: Papain-like cysteine peptidase
XBCP3 - Arabidopsis thaliana (Mouse-ear cress)
Length = 437
Score = 71.7 bits (168), Expect = 2e-11
Identities = 35/81 (43%), Positives = 49/81 (60%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
+Y L +N + D+ HHEF + G + +A + + KG S+ G+ VK+P+ VDW
Sbjct: 73 TYSLSLNAFADLTHHEFKASRLGLSVSAP-SVIMASKGQSLGGS-------VKVPDSVDW 124
Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
RK GAV KDQG CG+CWSF
Sbjct: 125 RKKGAVTNVKDQGSCGACWSF 145
>UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2;
Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio
molitor (Yellow mealworm)
Length = 336
Score = 71.7 bits (168), Expect = 2e-11
Identities = 37/93 (39%), Positives = 51/93 (54%), Gaps = 1/93 (1%)
Frame = +2
Query: 278 KHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFIS- 454
+HN+KY GLVSY LG+N + DM E +G A +KN G ++ + +
Sbjct: 60 EHNEKYRQGLVSYTLGVNLFTDMTPEEMKAYTHGLIMPADLHKN----GIPIKTREDLGL 115
Query: 455 PANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
A+V+ P DWR G V K+QG CGSCW+F
Sbjct: 116 NASVRYPASFDWRDQGMVSPVKNQGSCGSCWAF 148
>UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor;
n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
L-like proteinase precursor - Diabrotica virgifera
virgifera (western corn rootworm)
Length = 317
Score = 70.5 bits (165), Expect = 4e-11
Identities = 34/94 (36%), Positives = 55/94 (58%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I +HN +Y+ G VS+ LG+N++ DM EF K M K +++ ++F+
Sbjct: 47 IEQHNARYQNGEVSFYLGVNQFADMTSEEF-KAMLDSQLIHKPKRDIT--------SRFV 97
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ + +PE +DWR+ GAV +DQ +CGSCW+F
Sbjct: 98 ADPQLTVPESIDWREKGAVNPVRDQEQCGSCWAF 131
>UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing
protein; n=7; Hymenostomatida|Rep: Papain family
cysteine protease containing protein - Tetrahymena
thermophila SB210
Length = 387
Score = 69.3 bits (162), Expect = 9e-11
Identities = 36/81 (44%), Positives = 45/81 (55%), Gaps = 1/81 (1%)
Frame = +2
Query: 314 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK-LPEQVDW 490
YK G+N++ D E +T G++KT K+ N K R K NVK LP+ VDW
Sbjct: 83 YKKGINQFTDRTAEELRETTLGYSKTVKNAAN---KQNMFRNLKTSDKINVKDLPKSVDW 139
Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
R G V KDQG CGSCW+F
Sbjct: 140 RDAGVVTPVKDQGHCGSCWAF 160
>UniRef50_P25774 Cluster: Cathepsin S precursor; n=78;
Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens
(Human)
Length = 331
Score = 69.3 bits (162), Expect = 9e-11
Identities = 35/91 (38%), Positives = 48/91 (52%)
Frame = +2
Query: 281 HNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPA 460
HN ++ MG+ SY LGMN GDM E + M+ ++ +N+ K S
Sbjct: 62 HNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNITYK----------SNP 111
Query: 461 NVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
N LP+ VDWR+ G V K QG CG+CW+F
Sbjct: 112 NRILPDSVDWREKGCVTEVKYQGSCGACWAF 142
Score = 32.7 bits (71), Expect = 9.1
Identities = 22/49 (44%), Positives = 29/49 (59%)
Frame = +1
Query: 562 LGALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLIGXTAFQ 708
+GALE Q ++G LVSL ++ +DCS GN G GG + TAFQ
Sbjct: 145 VGALEAQLKLKTGKLVSL-SAQNLVDCSTEKYGNKGCN-GGFM-TTAFQ 190
>UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precursor;
n=3; Metazoa|Rep: Digestive cysteine proteinase 2
precursor - Homarus americanus (American lobster)
Length = 323
Score = 68.5 bits (160), Expect = 1e-10
Identities = 48/134 (35%), Positives = 66/134 (49%), Gaps = 12/134 (8%)
Frame = +2
Query: 188 LQAAAPSQLRKRGR--RQFPHEDIPEHKHIIAKHNQKY--------EMGLVSYKLGMNKY 337
L AA+PS +G+ RQ+ + ++ +I + NQKY E G V++ L MNK+
Sbjct: 13 LAAASPSWEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKF 72
Query: 338 GDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPE--QVDWRKHGAVP 511
GDM EF M G N+ + V P P+ +VDWR GAV
Sbjct: 73 GDMTLEEFNAVMKG---------NIPRRSAPV---SVFYPKKETGPQATEVDWRTKGAVT 120
Query: 512 TFKDQGKCGSCWSF 553
KDQG+CGSCW+F
Sbjct: 121 PVKDQGQCGSCWAF 134
Score = 32.7 bits (71), Expect = 9.1
Identities = 14/27 (51%), Positives = 20/27 (74%)
Frame = +1
Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCS 645
G+LEGQHF ++G L+SL + +DCS
Sbjct: 138 GSLEGQHFLKTGSLISLAEQQ-LVDCS 163
>UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor;
n=35; Fasciola|Rep: Cathepsin L-like proteinase
precursor - Fasciola hepatica (Liver fluke)
Length = 326
Score = 68.5 bits (160), Expect = 1e-10
Identities = 35/97 (36%), Positives = 53/97 (54%)
Frame = +2
Query: 263 KHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGA 442
KHI +HN ++++GLV+Y LG+N++ DM EF AK+ + +
Sbjct: 49 KHI-QEHNLRHDLGLVTYTLGLNQFTDMTFEEF---------KAKYLTEMSRASDILSHG 98
Query: 443 KFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
N +P+++DWR+ G V KDQG CGSCW+F
Sbjct: 99 VPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAF 135
>UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep:
Cathepsin K - Danio rerio (Zebrafish) (Brachydanio
rerio)
Length = 333
Score = 67.7 bits (158), Expect = 3e-10
Identities = 36/95 (37%), Positives = 53/95 (55%), Gaps = 1/95 (1%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I HN++YE+G+ +Y LGMN +GDM E + + G +Y + F+
Sbjct: 61 IEAHNKEYELGIHTYDLGMNHFGDMTLEEVAEKVMGLQMP------MYRDPANT----FV 110
Query: 452 SPANV-KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
V KLP+ +D+RK G V + K+QG CGSCW+F
Sbjct: 111 PDDRVGKLPKSIDYRKLGYVTSVKNQGSCGSCWAF 145
>UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcoptes
scabiei type hominis|Rep: Cathepsin L-like protease -
Sarcoptes scabiei type hominis
Length = 245
Score = 67.7 bits (158), Expect = 3e-10
Identities = 36/94 (38%), Positives = 54/94 (57%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I KHN+KYE GL +Y+LG+N++ D+ + E+ MN KH+ ++ V + +
Sbjct: 64 IRKHNEKYEAGLSTYELGVNQFTDLTNKEYNDQMNRLK--VKHD----VQSEHVFDNEDV 117
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
S LP++VDW V KDQ +CGSCW+F
Sbjct: 118 S----DLPDEVDWTLKNVVAPIKDQKQCGSCWAF 147
>UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes
abbreviatus|Rep: Cathepsin L 2 - Diaprepes abbreviatus
(Sugarcane rootstalk borer weevil)
Length = 348
Score = 67.7 bits (158), Expect = 3e-10
Identities = 39/104 (37%), Positives = 54/104 (51%), Gaps = 10/104 (9%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGG------SV 433
I +HN+ YEMGL SY++ MN GD+ EF++ ++NL +
Sbjct: 59 INEHNKLYEMGLSSYQMAMNHLGDLTKDEFMRIYTVNMPQLPQSENLSDSEPWLDLPQDL 118
Query: 434 RG-AKFISPAN---VKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+G + P N V LP +DWR+ GAV K+Q CGSCWSF
Sbjct: 119 QGFVTYALPTNLDEVDLPTDIDWRQKGAVTPVKNQRNCGSCWSF 162
Score = 38.3 bits (85), Expect = 0.18
Identities = 14/31 (45%), Positives = 22/31 (70%)
Frame = +3
Query: 165 LVKEEWSAFKLQHRLNYESEVEDNFRMKIYL 257
LV+E+W FKL+H YESE E+ +R +++
Sbjct: 23 LVQEQWEQFKLEHGKVYESESENEYRQSVFM 53
>UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar to
human SRY (sex determining region Y)-box 30
(SOX30),transcript variant 1,; n=3; Mammalia|Rep: Testis
cDNA clone: QtsA-12228, similar to human SRY (sex
determining region Y)-box 30 (SOX30),transcript variant
1, - Macaca fascicularis (Crab eating macaque)
(Cynomolgus monkey)
Length = 433
Score = 67.3 bits (157), Expect = 3e-10
Identities = 36/95 (37%), Positives = 50/95 (52%)
Frame = +2
Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
+I HN +Y G + + MN +GDM + EF + M F N+ L +G F
Sbjct: 58 MIELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQVMGCFR-----NQKLR------KGKLF 106
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
P + LP+ VDWRK G V K+Q +CGSCW+F
Sbjct: 107 REPLFLDLPKSVDWRKKGYVTPVKNQKQCGSCWAF 141
Score = 35.5 bits (78), Expect = 1.3
Identities = 20/39 (51%), Positives = 24/39 (61%)
Frame = +1
Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPG 681
GALEGQ FR++G LVS L + +DCS GN G G
Sbjct: 145 GALEGQMFRKTGKLVS-LSEQNLVDCSHP-QGNQGCNGG 181
>UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain;
n=16; Chrysomelidae|Rep: Digestive cysteine protease
intestain - Leptinotarsa decemlineata (Colorado potato
beetle)
Length = 326
Score = 66.9 bits (156), Expect = 5e-10
Identities = 32/94 (34%), Positives = 51/94 (54%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I +HN +Y+ G +Y LG+ ++ D+ H EF + G K NK + +
Sbjct: 54 IKEHNARYDKGEETYLLGVTRFADLTHEEFKDILKGQIK----NKP------RLNATPTV 103
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
P ++++P+ +DW + GAV KDQ CGSCW+F
Sbjct: 104 FPEDLEVPDSIDWTEKGAVLEVKDQNPCGSCWAF 137
>UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42;
Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens
(Human)
Length = 334
Score = 66.9 bits (156), Expect = 5e-10
Identities = 36/95 (37%), Positives = 50/95 (52%)
Frame = +2
Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
+I HN +Y G + + MN +GDM + EF + M F +N + G V F
Sbjct: 58 MIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCF-------RNQKFRKGKV----F 106
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
P + LP+ VDWRK G V K+Q +CGSCW+F
Sbjct: 107 REPLFLDLPKSVDWRKKGYVTPVKNQKQCGSCWAF 141
Score = 35.9 bits (79), Expect = 0.98
Identities = 24/48 (50%), Positives = 29/48 (60%)
Frame = +1
Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLIGXTAFQ 708
GALEGQ FR++G LVS L + +DCS GN G GG + AFQ
Sbjct: 145 GALEGQMFRKTGKLVS-LSEQNLVDCS-RPQGNQGCN-GGFMA-RAFQ 188
>UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin L
preproprotein; n=1; Monodelphis domestica|Rep:
PREDICTED: similar to cathepsin L preproprotein -
Monodelphis domestica
Length = 356
Score = 66.5 bits (155), Expect = 6e-10
Identities = 34/95 (35%), Positives = 52/95 (54%)
Frame = +2
Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
+I HN+ ++ G SY +GMN++GDM EF +N + +N K R +
Sbjct: 58 LINDHNRLFKEGKKSYFMGMNQFGDMTDKEFESRLNLRIAPVRTRRNYTFK----RRIYY 113
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+LP+ VDWR HG V ++QG+CG+CW+F
Sbjct: 114 ------RLPKSVDWRTHGYVTPIRNQGECGACWAF 142
Score = 36.7 bits (81), Expect = 0.56
Identities = 20/42 (47%), Positives = 26/42 (61%)
Frame = +1
Query: 562 LGALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGL 687
+G+LEGQ FR++G LV L + + IDCSG T G G L
Sbjct: 145 IGSLEGQLFRKTGRLVELSK-QMLIDCSGYYTCMGGSLTGAL 185
>UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome
shotgun sequence; n=3; Tetraodontidae|Rep: Chromosome 12
SCAF14996, whole genome shotgun sequence - Tetraodon
nigroviridis (Green puffer)
Length = 362
Score = 66.5 bits (155), Expect = 6e-10
Identities = 37/87 (42%), Positives = 47/87 (54%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I HN ++ MG SY+LGMN +GDM H EF + MNG+ KH RG+ F+
Sbjct: 58 IELHNLEHSMGQHSYRLGMNHFGDMTHEEFRQIMNGY----KHKPQ-----RKFRGSLFM 108
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGK 532
P ++ P VDWR G V KDQ K
Sbjct: 109 EPNFLEAPRAVDWRDKGYVTPVKDQLK 135
Score = 42.7 bits (96), Expect = 0.009
Identities = 30/62 (48%), Positives = 35/62 (56%)
Frame = +1
Query: 523 PREVWLMLVLSARLGALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLIGXTA 702
P VWL+L L G GQHFRQ+G LVS L + +DCS GN G GGL+ A
Sbjct: 166 PGSVWLLLGLQHHRGP-GGQHFRQTGKLVS-LSEQNLVDCS-RPEGNEGCN-GGLM-DQA 220
Query: 703 FQ 708
FQ
Sbjct: 221 FQ 222
>UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:
Cathepsin - Petromyzon marinus (Sea lamprey)
Length = 333
Score = 66.1 bits (154), Expect = 8e-10
Identities = 39/94 (41%), Positives = 49/94 (52%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
+ +HN + G VS+ LG+NKY D+ HE+ K NL G RGA F
Sbjct: 58 VLQHNLLADEGNVSFHLGINKYSDLELHEY------HEKVVGRFWNL-RNGTRRRGAPFP 110
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ LPEQVDWR G V K+QG CGS W+F
Sbjct: 111 LRSMDNLPEQVDWRLKGYVTPVKEQGLCGSSWAF 144
>UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep:
CG4847-PD, isoform D - Drosophila melanogaster (Fruit
fly)
Length = 420
Score = 66.1 bits (154), Expect = 8e-10
Identities = 28/97 (28%), Positives = 50/97 (51%)
Frame = +2
Query: 263 KHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGA 442
K+++ N + G+ ++K +N + D+ H EF+ + G ++ + K +
Sbjct: 140 KNLVEAGNAAFAQGVHTFKQAVNAFADLTHSEFLSQLTGLKRSPE------AKARAAASL 193
Query: 443 KFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
K ++ +P+ DWR+HG V K QG CGSCW+F
Sbjct: 194 KLVNLPAKPIPDAFDWREHGGVTPVKFQGTCGSCWAF 230
>UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor;
n=176; Viridiplantae|Rep: Cysteine proteinase RD21a
precursor - Arabidopsis thaliana (Mouse-ear cress)
Length = 462
Score = 66.1 bits (154), Expect = 8e-10
Identities = 40/121 (33%), Positives = 64/121 (52%)
Frame = +2
Query: 191 QAAAPSQLRKRGRRQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKT 370
+A + + L ++ RR E ++ + +HN+K +SY+LG+ ++ D+ + E+
Sbjct: 59 KAQSQNSLVEKDRR---FEIFKDNLRFVDEHNEKN----LSYRLGLTRFADLTNDEYRSK 111
Query: 371 MNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWS 550
G AK K KG ++ + +LPE +DWRK GAV KDQG CGSCW+
Sbjct: 112 YLG----AKMEK----KGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWA 163
Query: 551 F 553
F
Sbjct: 164 F 164
>UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1;
Phytophthora infestans|Rep: Cathepsin-like cysteine
protease - Phytophthora infestans (Potato late blight
fungus)
Length = 376
Score = 65.7 bits (153), Expect = 1e-09
Identities = 34/95 (35%), Positives = 50/95 (52%), Gaps = 1/95 (1%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I HN+ YE G S+ LG+N D+ E+ + ++ + +K S F+
Sbjct: 75 IQTHNEAYERGEHSFTLGLNDLADLADAEYKQLLSYRTRDSK---------SSSASETFV 125
Query: 452 SPANVK-LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
P NV+ LP DWR+H V K+QG+CGSCW+F
Sbjct: 126 KPENVEDLPATWDWREHSTVTPVKNQGQCGSCWAF 160
>UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1;
Oryza sativa (japonica cultivar-group)|Rep: Putative
uncharacterized protein - Oryza sativa subsp. japonica
(Rice)
Length = 326
Score = 65.7 bits (153), Expect = 1e-09
Identities = 42/120 (35%), Positives = 59/120 (49%)
Frame = +2
Query: 194 AAAPSQLRKRGRRQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTM 373
+++P L +G R E ++ I N+K M SYKLG+NK+ D+ EF
Sbjct: 37 SSSPRDLADKGSR---FEVFKKNARYIHDFNRKKGM---SYKLGLNKFADLTLEEFTAKY 90
Query: 374 NGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
G N +K G+ G+ ++ P DWR+HGAV KDQG CGSCW+F
Sbjct: 91 TGANPGPITG----LKNGT--GSPPLAAVAGDAPPAWDWREHGAVTRVKDQGPCGSCWAF 144
>UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2;
Platyhelminthes|Rep: Cathepsin L-like proteinase -
Echinococcus multilocularis
Length = 338
Score = 65.7 bits (153), Expect = 1e-09
Identities = 31/91 (34%), Positives = 45/91 (49%)
Frame = +2
Query: 281 HNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPA 460
HN++Y +GL +Y +N + D+ EF + +T M V P
Sbjct: 64 HNERYYLGLETYSTALNAFADLTLEEFAEKYLTLKQTPMEGIWQDMSTQYVE-----RPT 118
Query: 461 NVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ +P+ +DWRK G V KDQG CGSCW+F
Sbjct: 119 RMLVPDSIDWRKKGLVTPIKDQGDCGSCWAF 149
Score = 34.3 bits (75), Expect = 3.0
Identities = 19/41 (46%), Positives = 25/41 (60%)
Frame = +1
Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGL 687
GALEGQ R++G L+S L + +DCS TGN G G +
Sbjct: 153 GALEGQLKRKTGKLIS-LSEQQLVDCS-TYTGNEGCNGGDM 191
>UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep:
Cysteine protease - Solanum lycopersicum (Tomato)
(Lycopersicon esculentum)
Length = 345
Score = 65.3 bits (152), Expect = 1e-09
Identities = 38/116 (32%), Positives = 56/116 (48%), Gaps = 5/116 (4%)
Frame = +2
Query: 221 RGRRQFPHEDIPEHKHIIAKHNQKY-----EMGLVSYKLGMNKYGDMLHHEFVKTMNGFN 385
R R + E + +I K N K+ + G +SYKLGMN++ D+ EF+ G N
Sbjct: 45 RHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLN 104
Query: 386 KTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ M S K ++ +P +DWR+ GAV K QG+CG CW+F
Sbjct: 105 IPNSYLSPSPMS--STEFKKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAF 158
>UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3;
Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber
officinale (Ginger)
Length = 475
Score = 65.3 bits (152), Expect = 1e-09
Identities = 31/104 (29%), Positives = 56/104 (53%), Gaps = 1/104 (0%)
Frame = +2
Query: 245 EDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF-VKTMNGFNKTAKHNKNLYMK 421
E E+ + +HN + G +Y+LGMN++ D+ + E+ + + ++ +
Sbjct: 74 EVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEEYRARFLRDLSRLGRSTS----- 128
Query: 422 GGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
G + + +V LP+ +DWR+ GAV K+QG+CGSCW+F
Sbjct: 129 -GEISNQYRLREGDV-LPDSIDWREKGAVVAVKNQGRCGSCWAF 170
>UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain;
n=9; Cucujiformia|Rep: Digestive cysteine proteinase
intestain - Leptinotarsa decemlineata (Colorado potato
beetle)
Length = 326
Score = 65.3 bits (152), Expect = 1e-09
Identities = 35/94 (37%), Positives = 49/94 (52%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I +HN KY+ G SY LG+ + D+ H EF + KT K N V +
Sbjct: 54 IEEHNAKYDKGEESYFLGVTPFADLTHDEFKDELRRQIKT-KPN---------VEATLAV 103
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
P +++P+ +DW + GAV K QG CGSCW+F
Sbjct: 104 FPEGLEVPDSIDWTQKGAVLDVKYQGGCGSCWAF 137
>UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9;
Onchocercidae|Rep: Cathepsin L-like precursor - Brugia
pahangi (Filarial nematode worm)
Length = 395
Score = 64.9 bits (151), Expect = 2e-09
Identities = 33/90 (36%), Positives = 50/90 (55%)
Frame = +2
Query: 284 NQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPAN 463
N+KYE GLVSY +N D+ EF+ NG + + ++G + +
Sbjct: 125 NKKYEQGLVSYTTALNDLADLTDEEFM-VRNGLRLPNQTD----LRGKRQTSEFYRYDKS 179
Query: 464 VKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+LP+QVDWR GAV ++QG+CGSC++F
Sbjct: 180 ERLPDQVDWRTKGAVTPVRNQGECGSCYAF 209
>UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase
precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin
L-like cysteine proteinase precursor - Acanthoscelides
obtectus (Bean weevil)
Length = 321
Score = 64.5 bits (150), Expect = 2e-09
Identities = 34/95 (35%), Positives = 53/95 (55%), Gaps = 1/95 (1%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I +HN++Y G ++++G+N++GDM EF + + A + + G +
Sbjct: 54 IEEHNERYHNGEETFEMGINQFGDMTQEEFKRML------ALQKPQMPLPRGDE-----V 102
Query: 452 SPANVK-LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
S NV +P+ VDWR+ GAV K QG CGSCW+F
Sbjct: 103 SFDNVNDIPKTVDWREKGAVTEVKKQGNCGSCWAF 137
>UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;
n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
L-like proteinase" precursor - Diabrotica virgifera
virgifera (western corn rootworm)
Length = 315
Score = 63.7 bits (148), Expect = 4e-09
Identities = 37/94 (39%), Positives = 51/94 (54%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I +HN KYE G +Y L +NK+ D EF + + A K ++ AK +
Sbjct: 54 IEEHNAKYESGEETYYLAVNKFADWSSAEFQAMLA--RQMANKPKQSFI-------AKHV 104
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ NV+ E+VDWR AV KDQG+CGSCW+F
Sbjct: 105 ADPNVQAVEEVDWRD-SAVLGVKDQGQCGSCWAF 137
>UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18;
Schistosoma|Rep: Preprocathepsin cathepsin L -
Schistosoma japonicum (Blood fluke)
Length = 331
Score = 63.3 bits (147), Expect = 6e-09
Identities = 35/94 (37%), Positives = 50/94 (53%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I +HN ++++GL Y +G+N++ DM E + M F K N L+ G+ +
Sbjct: 58 IQEHNLRHDLGLEGYTMGLNQFCDMEWEEVNRIM--FPKVFG-NSPLWNDDGNE-----L 109
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
N +P DWR HGAV K QG CGSCW+F
Sbjct: 110 ELTNKPVPSTWDWRDHGAVTAVKHQGLCGSCWAF 143
>UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor;
n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1
precursor - Arabidopsis thaliana (Mouse-ear cress)
Length = 355
Score = 63.3 bits (147), Expect = 6e-09
Identities = 34/81 (41%), Positives = 41/81 (50%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
SY LG+N++ D+ H EF G K K A F LP+ VDW
Sbjct: 91 SYWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQ-------PSANFRYRDITDLPKSVDW 143
Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
RK GAV KDQG+CGSCW+F
Sbjct: 144 RKKGAVAPVKDQGQCGSCWAF 164
>UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel
protein - Danio rerio (Zebrafish) (Brachydanio rerio)
Length = 328
Score = 62.9 bits (146), Expect = 7e-09
Identities = 36/94 (38%), Positives = 50/94 (53%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I HN+ +GL SY LG+N+ DM E V MNG + + N A F
Sbjct: 58 ILLHNEAAAVGLHSYTLGLNQLSDMTADE-VNDMNGLLEEDFPDVN----------ATFS 106
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
P+ LP++V+W +HG V ++QG CGSCW+F
Sbjct: 107 PPSLQTLPQRVNWTEHGMVSPVQNQGPCGSCWAF 140
>UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3;
Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays
(Maize)
Length = 493
Score = 62.9 bits (146), Expect = 7e-09
Identities = 34/98 (34%), Positives = 52/98 (53%), Gaps = 4/98 (4%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTM----NGFNKTAKHNKNLYMKGGSVRG 439
I HN + + GL ++LG+ ++ D+ E+ + G N TA G V
Sbjct: 103 IDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAV---------GVVGR 153
Query: 440 AKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+++ A +LP+ VDWR+ GAV KDQG+CG CW+F
Sbjct: 154 RRYLPLAGEQLPDAVDWRERGAVAEVKDQGQCGGCWAF 191
>UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep:
LOC443661 protein - Xenopus laevis (African clawed frog)
Length = 346
Score = 62.1 bits (144), Expect = 1e-08
Identities = 34/94 (36%), Positives = 49/94 (52%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I HN +Y +GL +Y++GMN GDM E TM G+ + N+ R K +
Sbjct: 82 ITVHNLEYSLGLHTYEVGMNHLGDMTGEEVEATMTGYTSSDDSLANM------TRVPKKL 135
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
A + P +DWR G V + + Q KCGSC++F
Sbjct: 136 LEA--QPPASIDWRTKGCVTSVRRQRKCGSCYAF 167
>UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra
erinaceieuropaei|Rep: Cysteine proteinase - Spirometra
erinaceieuropaei (Tapeworm)
Length = 336
Score = 62.1 bits (144), Expect = 1e-08
Identities = 36/97 (37%), Positives = 50/97 (51%), Gaps = 3/97 (3%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKT---MNGFNKTAKHNKNLYMKGGSVRGA 442
I +HNQ+Y L SY + +N + D+ EF + + G T K SV
Sbjct: 63 IIRHNQRYYQQLESYAVRLNDFSDLTPGEFAERYLCLRGIVLTKLRRKEAV----SV--- 115
Query: 443 KFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
P LP+ V+WR+ GAV + K+QG+CGSCWSF
Sbjct: 116 ----PLKENLPDSVNWRERGAVTSVKNQGQCGSCWSF 148
>UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep:
Cathepsin L - Stylonychia lemnae
Length = 340
Score = 61.7 bits (143), Expect = 2e-08
Identities = 37/95 (38%), Positives = 49/95 (51%), Gaps = 1/95 (1%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I HN + + S+ LG N D H E+ K M G+ K K +Y
Sbjct: 73 INNHNSQNDG--TSFTLGPNHLADYTHDEY-KKMLGYKPRNKTGKEVY------------ 117
Query: 452 SPANVK-LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
S N+K +PE +DWR+ GAV KDQG+CGSCW+F
Sbjct: 118 STPNLKDIPESIDWREKGAVNAVKDQGQCGSCWAF 152
>UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease
Gip1p; n=4; Tetrahymena thermophila|Rep:
Granule-biosynthesis induced protease Gip1p -
Tetrahymena thermophila
Length = 345
Score = 61.3 bits (142), Expect = 2e-08
Identities = 31/83 (37%), Positives = 42/83 (50%), Gaps = 2/83 (2%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTM--NGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQV 484
SY G+N++ DM EF + + +K A NK + + P N LP V
Sbjct: 79 SYSKGLNQFSDMTKEEFKQRVLNKKISKKASSNKGGRNLAADPAVSNLVFPTN-NLPLSV 137
Query: 485 DWRKHGAVPTFKDQGKCGSCWSF 553
DWRK G + K+QG CGSCW+F
Sbjct: 138 DWRKRGVLNPVKNQGTCGSCWTF 160
>UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18;
Magnoliophyta|Rep: Thiol protease aleurain precursor -
Arabidopsis thaliana (Mouse-ear cress)
Length = 358
Score = 61.3 bits (142), Expect = 2e-08
Identities = 39/99 (39%), Positives = 51/99 (51%)
Frame = +2
Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
E+ +I N+K GL SYKLG+N++ D+ EF +T G A N + +KG
Sbjct: 85 ENLDLIRSTNKK---GL-SYKLGVNQFADLTWQEFQRTKLG----AAQNCSATLKGSH-- 134
Query: 437 GAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
LPE DWR+ G V KDQG CGSCW+F
Sbjct: 135 -----KVTEAALPETKDWREDGIVSPVKDQGGCGSCWTF 168
>UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate
cathepsin L; n=4; Danio rerio|Rep: Novel protein similar
to vertebrate cathepsin L - Danio rerio (Zebrafish)
(Brachydanio rerio)
Length = 334
Score = 60.9 bits (141), Expect = 3e-08
Identities = 34/95 (35%), Positives = 47/95 (49%), Gaps = 1/95 (1%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR-GAKF 448
I K+N + GL +K+ MNKYGD+ E+ + + K + K +R AK
Sbjct: 57 IWKNNNDFSFGLSMFKMAMNKYGDLTSVEYKRLLGSKIKGTGNRKGKITSAQMLRLNAKR 116
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ N+ D+R G V KDQG CGSCWSF
Sbjct: 117 LGVTNI------DYRAKGYVTEVKDQGYCGSCWSF 145
>UniRef50_Q23H32 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 365
Score = 60.9 bits (141), Expect = 3e-08
Identities = 33/99 (33%), Positives = 52/99 (52%)
Frame = +2
Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
E+ + I +NQ E + +L +N++ D+ EF + G+N + KHN + GS +
Sbjct: 67 ENYNYIHNYNQINENSQDNIQLEVNEFADLSLQEFRELYFGYNSSKKHNN---QQNGSTK 123
Query: 437 GAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ + +PE VDWR+ P K QG CGSCW+F
Sbjct: 124 NLRQSFLLSDSVPESVDWREKLVAPVQK-QGGCGSCWAF 161
>UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor;
n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase
precursor - Phaedon cochleariae (Mustard beetle)
Length = 324
Score = 60.9 bits (141), Expect = 3e-08
Identities = 33/93 (35%), Positives = 49/93 (52%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
IA+HN KYE G +Y L +NK+ D+ EF + M N+ ++ N + G +
Sbjct: 54 IAEHNVKYENGESTYYLAINKFSDITDEEF-RDMLMKNEASRPN---------LEGLEVA 103
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWS 550
PE +DWR G V ++QG+CGSCW+
Sbjct: 104 DLTVGAAPESIDWRSKGVVLPVRNQGECGSCWA 136
>UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella
histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax
(Sterkiella histriomuscorum)
Length = 366
Score = 60.5 bits (140), Expect = 4e-08
Identities = 35/94 (37%), Positives = 46/94 (48%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I KHN G +YK G+N + DM EF + +N A+ N S K
Sbjct: 82 IIKHNSD---GTNTYKKGLNAFSDMTDEEF---FDYYNIKAEQNC-------SATNRKSF 128
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+N +P + DWR G V K+QGKCGSCW+F
Sbjct: 129 GNSNANIPTEWDWRTFGVVSPVKNQGKCGSCWTF 162
>UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep:
Toxopain-2 - Toxoplasma gondii
Length = 422
Score = 60.5 bits (140), Expect = 4e-08
Identities = 38/113 (33%), Positives = 56/113 (49%), Gaps = 5/113 (4%)
Frame = +2
Query: 230 RQFPHEDIPEHKHIIAKHNQKY-----EMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTA 394
+ + E+ + ++ I K+N Y + G SY L MN +GD+ EF + GF K+
Sbjct: 126 KSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEFRRKYLGFKKS- 183
Query: 395 KHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+NL V + ++ +LP VDWR G V KDQ CGSCW+F
Sbjct: 184 ---RNLKSHHLGV-ATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAF 232
>UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease;
n=23; Magnoliophyta|Rep: Senescence-specific cysteine
protease - Arabidopsis thaliana (Mouse-ear cress)
Length = 346
Score = 60.1 bits (139), Expect = 5e-08
Identities = 30/81 (37%), Positives = 41/81 (50%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
++KL +N++ D+ + EF GF + + K R S A LP VDW
Sbjct: 80 TFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVSSGA---LPVSVDW 136
Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
RK GAV K+QG CG CW+F
Sbjct: 137 RKKGAVTPIKNQGSCGCCWAF 157
>UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2;
Acanthamoeba|Rep: Cysteine proteinase - Acanthamoeba
healyi
Length = 330
Score = 60.1 bits (139), Expect = 5e-08
Identities = 32/81 (39%), Positives = 44/81 (54%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
SY L MN++GD+ + EF + G Y K + A +PA +P + DW
Sbjct: 69 SYFLAMNQFGDLTNAEFNRLFKGLAFD-------YSKHAKIHTAAPEAPAT-GIPSEFDW 120
Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
R+ GAV K+QG+CGSCWSF
Sbjct: 121 RQKGAVTHVKNQGQCGSCWSF 141
>UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823
protein, partial; n=1; Ornithorhynchus anatinus|Rep:
PREDICTED: similar to MGC81823 protein, partial -
Ornithorhynchus anatinus
Length = 361
Score = 59.7 bits (138), Expect = 7e-08
Identities = 28/66 (42%), Positives = 37/66 (56%)
Frame = +2
Query: 356 EFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKC 535
EF MNG+ K A+ + S + F+ P + PE +DWR HG V KDQG+C
Sbjct: 157 EFAAAMNGY-KAARGVE----ASASASASAFLGPNGTEPPEALDWRDHGYVTPVKDQGRC 211
Query: 536 GSCWSF 553
GSCW+F
Sbjct: 212 GSCWAF 217
>UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine
protease; n=11; Callosobruchus maculatus|Rep: Putative
gut cathepsin L-like cysteine protease - Callosobruchus
maculatus (Southern cowpea weevil) (Pulse bruchid)
Length = 326
Score = 59.7 bits (138), Expect = 7e-08
Identities = 32/94 (34%), Positives = 48/94 (51%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I +HN+KYE G S+ + ++ DM H EF+ + A + +V F
Sbjct: 54 IQEHNKKYERGEESFAKKVTQFADMTHEEFLDLLKLQGVPA-------LPSNAVHFDNF- 105
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+++ + VDWR+ GAV KDQ CGSCW+F
Sbjct: 106 EDIDMEEKDAVDWREEGAVTPVKDQANCGSCWAF 139
Score = 41.9 bits (94), Expect = 0.015
Identities = 21/44 (47%), Positives = 32/44 (72%)
Frame = +1
Query: 562 LGALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLIG 693
+GA+EGQ F+++G LVS L ++ +DC+ GNNG + GGL+G
Sbjct: 142 VGAIEGQFFKKNGTLVS-LSAQELVDCATEDYGNNGCK-GGLMG 183
>UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine
protease; n=1; Maconellicoccus hirsutus|Rep: Putative
cathepsin L-like cysteine protease - Maconellicoccus
hirsutus (hibiscus mealybug)
Length = 339
Score = 59.7 bits (138), Expect = 7e-08
Identities = 34/100 (34%), Positives = 53/100 (53%), Gaps = 2/100 (2%)
Frame = +2
Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
++K+ IA+HN+ + GLV+++ G+N+Y DML EF + M + + + +N G +
Sbjct: 55 DNKYRIAQHNKLFHKGLVTFEQGINEYSDMLQSEFNEKM---GQKSSNQRNTEANG--LP 109
Query: 437 GAKFISPANVKLPEQVDWRKHGAVPTFKDQGKC--GSCWS 550
+F NV P+ VDWR G V Q C G WS
Sbjct: 110 SIRFTPLHNVNPPDSVDWRTKGLVGPVGKQVNCSSGYAWS 149
Score = 37.9 bits (84), Expect = 0.24
Identities = 14/32 (43%), Positives = 21/32 (65%)
Frame = +3
Query: 162 DLVKEEWSAFKLQHRLNYESEVEDNFRMKIYL 257
+L EEW FK Q+ Y +++ED RMKI++
Sbjct: 23 NLFHEEWQLFKTQYSKKYTTDIEDRLRMKIFI 54
>UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184,
whole genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_184,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 331
Score = 59.7 bits (138), Expect = 7e-08
Identities = 32/95 (33%), Positives = 48/95 (50%)
Frame = +2
Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
++ +HN K+E+G ++ LGMN+Y D+ EF + + KN+ G
Sbjct: 63 VVMEHNSKFELGQETFTLGMNQYADLTPEEFQASFLTLKTKVQDRKNVKSYSG------- 115
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ P+ VDW K G T K+QG CGSCW+F
Sbjct: 116 -----LSFPDTVDW-KDGL--TVKNQGSCGSCWAF 142
>UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6;
Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia
deliciosa (Kiwi)
Length = 509
Score = 59.3 bits (137), Expect = 9e-08
Identities = 32/99 (32%), Positives = 52/99 (52%), Gaps = 2/99 (2%)
Frame = +2
Query: 263 KHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF--VKTMNGFNKTAKHNKNLYMKGGSVR 436
++++ K+ ++ G + +G+NK+ DM + EF V T+K + G
Sbjct: 80 RYVMEKNGERGASG--GHLVGLNKFADMSNEEFREVYVSKVKKPTSKRMAIERRRQGKAA 137
Query: 437 GAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
AK ++ + P +DWRK+G V KDQG CGSCW+F
Sbjct: 138 AAKAVAACDG--PTSLDWRKYGIVTGVKDQGDCGSCWAF 174
>UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole
genome shotgun sequence; n=2; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_21,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 349
Score = 59.3 bits (137), Expect = 9e-08
Identities = 32/95 (33%), Positives = 49/95 (51%), Gaps = 1/95 (1%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKN-LYMKGGSVRGAKF 448
I +H Q+ E GL +++LG+N + D+ EF + T + N +Y + G
Sbjct: 70 IQEHQQRVEAGLETFELGLNDFADLSVEEFEAKYLKYRSTPREQTNQVYRRTGK------ 123
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
++P +VD RK G V K+QG CGSCW+F
Sbjct: 124 ------QVPIEVDLRKDGVVSEVKNQGSCGSCWAF 152
>UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep:
MGC107932 protein - Xenopus tropicalis (Western clawed
frog) (Silurana tropicalis)
Length = 333
Score = 58.8 bits (136), Expect = 1e-07
Identities = 33/95 (34%), Positives = 53/95 (55%), Gaps = 1/95 (1%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
+ KHNQ + GL SY++ MN++ D+ +E + + K+L V+ A+
Sbjct: 58 VQKHNQLADQGLKSYRMAMNQFADLTDNE----RSSKSCLLPREKSL----NPVK-AESY 108
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGK-CGSCWSF 553
S ++ +P++VDWRK V K+QG CGSCW+F
Sbjct: 109 SYTSITIPKEVDWRKSNCVTPVKNQGTFCGSCWAF 143
>UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core
eudicotyledons|Rep: Cysteine proteinase -
Mesembryanthemum crystallinum (Common ice plant)
Length = 367
Score = 58.8 bits (136), Expect = 1e-07
Identities = 35/103 (33%), Positives = 56/103 (54%), Gaps = 4/103 (3%)
Frame = +2
Query: 257 EHKHIIAKHNQKY--EMGLVS--YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKG 424
+++ + K N KY E+ + YKL +N++GD+ EF +T +K + +N G
Sbjct: 61 QNRFHVFKENVKYINEVNKMDKPYKLRLNQFGDLTPSEFARTYAN-SKIIEGTRN--ESG 117
Query: 425 GSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
G + NV++P +DWR GAV K+QG+CG CW+F
Sbjct: 118 GFMY-------ENVEVPRSIDWRVKGAVTPVKNQGRCGGCWAF 153
>UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza
sativa|Rep: Os09g0497500 protein - Oryza sativa subsp.
japonica (Rice)
Length = 349
Score = 58.4 bits (135), Expect = 2e-07
Identities = 33/82 (40%), Positives = 43/82 (52%), Gaps = 2/82 (2%)
Frame = +2
Query: 314 YKLGMNKYGDMLHHEFVKTMNGFNK--TAKHNKNLYMKGGSVRGAKFISPANVKLPEQVD 487
YKL NK+ D+ + EF M GF T N ++ G ++ LP+ VD
Sbjct: 72 YKLADNKFADLTNEEFRAKMLGFRPHVTIPQISNTCSADIAMPGES----SDDILPKSVD 127
Query: 488 WRKHGAVPTFKDQGKCGSCWSF 553
WRK GAV K+QG CGSCW+F
Sbjct: 128 WRKKGAVVEVKNQGDCGSCWAF 149
>UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep:
Actinidin Act3a - Actinidia eriantha
Length = 380
Score = 58.4 bits (135), Expect = 2e-07
Identities = 39/112 (34%), Positives = 53/112 (47%), Gaps = 2/112 (1%)
Frame = +2
Query: 224 GRRQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHN 403
G R+ E E+ I +HN SY +G+N++ D+ E+ T GF + K
Sbjct: 57 GEREMRIEIFKENLRFIDEHNADPNR---SYTVGLNQFADLTDEEYRSTYLGFKSSLKSK 113
Query: 404 -KNLYM-KGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
N YM + G V LP+ VDWR GAV K+QG C SCW+F
Sbjct: 114 VSNRYMPQVGEV------------LPDYVDWRTTGAVVDVKNQGLCSSCWAF 153
>UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 280
Score = 58.0 bits (134), Expect = 2e-07
Identities = 33/96 (34%), Positives = 51/96 (53%), Gaps = 4/96 (4%)
Frame = +2
Query: 278 KHNQKYEMGLVSYKLGMNKYGDMLHHEFVK-TMNG--FNKTAKHNKNLYMKGGSVRGAKF 448
+HNQ+ SY++GMN++ D+ EF ++N FN ++ +N+ +
Sbjct: 3 QHNQEKNN---SYQIGMNQFSDLTIEEFQSISLNQQLFNSESRKLENIKNENQQADFYLQ 59
Query: 449 ISPANVK-LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ N LP+Q DWR G V K+QG CGSCW+F
Sbjct: 60 LLKTNASSLPQQFDWRNLGKVTQVKNQGNCGSCWAF 95
>UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2;
Taeniidae|Rep: Cathepsin L-like cysteine proteinase -
Taenia solium (Pork tapeworm)
Length = 339
Score = 58.0 bits (134), Expect = 2e-07
Identities = 32/94 (34%), Positives = 48/94 (51%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I N+++ GL SY G+N++ D+ EF + G ++ + G R K +
Sbjct: 65 IKGQNRRFNAGLESYSTGLNQFADLESSEFSERFLGTRPESR------VAGRRGRIWKAL 118
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ A LP+ VDWR V K+QG CGSCW+F
Sbjct: 119 ASA-AGLPDTVDWRDKNLVTEVKNQGNCGSCWAF 151
>UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes
vastus|Rep: Cathepsin L - Aphrocallistes vastus
Length = 329
Score = 58.0 bits (134), Expect = 2e-07
Identities = 30/81 (37%), Positives = 45/81 (55%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
SYKL N++ D+ + E+ + G++ A+ ++ + G V K + LP VDW
Sbjct: 68 SYKLAANQFADLTNLEYRQIYLGYDNEARLSRK---REGKVFQRKM---KDEDLPTTVDW 121
Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
R G V K+QG+CGSCWSF
Sbjct: 122 RSKGVVTPVKNQGQCGSCWSF 142
Score = 33.5 bits (73), Expect = 5.2
Identities = 20/42 (47%), Positives = 29/42 (69%)
Frame = +1
Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLI 690
G+LEGQ+ +SG LVS + +DCS ++ GN+G Q GGL+
Sbjct: 146 GSLEGQYAIKSGKLVS-FSEQELVDCSTSL-GNHGCQ-GGLM 184
>UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46;
Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea
mays (Maize)
Length = 371
Score = 58.0 bits (134), Expect = 2e-07
Identities = 32/77 (41%), Positives = 42/77 (54%)
Frame = +2
Query: 323 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 502
G+ K+ D+ EF +T G K+ + L G S A + P + LP+ DWR HG
Sbjct: 92 GVTKFSDLTPAEFRRTYLGLRKSRR--ALLRELGESAHEAPVL-PTD-GLPDDFDWRDHG 147
Query: 503 AVPTFKDQGKCGSCWSF 553
AV K+QG CGSCWSF
Sbjct: 148 AVGPVKNQGSCGSCWSF 164
>UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 336
Score = 57.6 bits (133), Expect = 3e-07
Identities = 33/108 (30%), Positives = 50/108 (46%)
Frame = +2
Query: 230 RQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKN 409
+QF + E I HN E +YKL N++ DM EF + + +N
Sbjct: 49 QQFRQQIFFETHERIQNHNSNPE---ATYKLAHNQFSDMPQEEFASRVL-MKSSQLIPRN 104
Query: 410 LYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ + + +V+LP DWR +G + KDQG+CGSCW+F
Sbjct: 105 AVQAQNNNSTTQQHTAQDVQLPASFDWRDYGILSDVKDQGQCGSCWAF 152
>UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n=1;
Xenopus tropicalis|Rep: UPI00006A2275 UniRef100 entry -
Xenopus tropicalis
Length = 272
Score = 57.6 bits (133), Expect = 3e-07
Identities = 41/131 (31%), Positives = 57/131 (43%), Gaps = 4/131 (3%)
Frame = +2
Query: 206 SQLRKRGRRQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFN 385
SQ +R RR E + I+ HN +Y +GL +Y++GMN GDM E TM G+
Sbjct: 3 SQEEERARRTIWEETLK----FISVHNLEYSLGLHTYEVGMNHLGDMTGEEVAATMTGYT 58
Query: 386 KTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGK-CGSCWSFQHD 562
+ N+ + A P +DWR V +DQG C SC++F
Sbjct: 59 GSGDSLANMSHVPKEILEA--------LAPPSIDWRTQNCVTPVRDQGSFCRSCYAFSAV 110
Query: 563 WEL---WKDST 586
L WK T
Sbjct: 111 GALECQWKKKT 121
>UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum
aestivum|Rep: Thiol protease - Triticum aestivum (Wheat)
Length = 374
Score = 57.6 bits (133), Expect = 3e-07
Identities = 27/85 (31%), Positives = 41/85 (48%), Gaps = 1/85 (1%)
Frame = +2
Query: 302 GLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQ 481
G +SY LG+N++ D+ H EF+ T + + G V PA +P
Sbjct: 88 GRLSYTLGVNQFADLTHEEFLATHTSRRVVPSEEMVITTRAGVVVEGANCQPAPNAVPRS 147
Query: 482 VDWRKHGAVPTFKDQGK-CGSCWSF 553
++W V K+QGK CG+CW+F
Sbjct: 148 INWVNQSKVTPVKNQGKVCGACWAF 172
>UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2;
Dictyostelium discoideum|Rep: Cysteine proteinase 2
precursor - Dictyostelium discoideum (Slime mold)
Length = 376
Score = 57.6 bits (133), Expect = 3e-07
Identities = 31/78 (39%), Positives = 42/78 (53%)
Frame = +2
Query: 320 LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 499
LG+N + D+ + E+ KT G A H+ N Y G V + + P+ +DWR
Sbjct: 79 LGLNNFADITNEEYRKTYLGTRVNA-HSYNGY-DGREVLNVEDLQTN----PKSIDWRTK 132
Query: 500 GAVPTFKDQGKCGSCWSF 553
AV KDQG+CGSCWSF
Sbjct: 133 NAVTPIKDQGQCGSCWSF 150
>UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 344
Score = 57.2 bits (132), Expect = 4e-07
Identities = 31/84 (36%), Positives = 41/84 (48%), Gaps = 3/84 (3%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEF-VKTMNGFNKTAKHNKNLYMKGGSVRGAK-FISPA-NVKLPEQ 481
SY LG N DM H EF + +N +K +K G S + ++ P K
Sbjct: 79 SYTLGHNHLSDMTHEEFSLYQLNPARTASKSSKGGNNSGNSSGSSNPYVDPPITTKNAPP 138
Query: 482 VDWRKHGAVPTFKDQGKCGSCWSF 553
+DWR A+ K QGKCGSCW+F
Sbjct: 139 MDWRNASAITPVKQQGKCGSCWTF 162
>UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza
sativa|Rep: Cysteine proteinase-like - Oryza sativa
subsp. japonica (Rice)
Length = 360
Score = 57.2 bits (132), Expect = 4e-07
Identities = 26/81 (32%), Positives = 42/81 (51%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
+Y LG+N++ D+ EF +T G++ + + G + + +P+ VDW
Sbjct: 85 TYTLGLNQFSDLTDDEFAQTHLGYSWAPPPPSHRHGHRAE-NGTAAAAADDTDVPDSVDW 143
Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
R GAV K+Q CGSCW+F
Sbjct: 144 RARGAVTEVKNQRSCGSCWAF 164
>UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza
sativa|Rep: OSJNBb0085F13.15 protein - Oryza sativa
subsp. japonica (Rice)
Length = 383
Score = 57.2 bits (132), Expect = 4e-07
Identities = 32/95 (33%), Positives = 45/95 (47%), Gaps = 11/95 (11%)
Frame = +2
Query: 302 GLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLY-----------MKGGSVRGAKF 448
G +++KLG + D+ H EF+ T G + + + G V GA
Sbjct: 94 GSLTFKLGETPFTDLTHEEFLATYTGDVRLPPERRGMQDDSDEEDAVITTSAGYVAGAG- 152
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
V +PE VDWRK GAV K QG+C +CW+F
Sbjct: 153 AGRRTVAVPESVDWRKEGAVTPAKHQGQCAACWAF 187
>UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1;
Dictyostelium discoideum AX4|Rep: Counting factor
associated protein - Dictyostelium discoideum AX4
Length = 531
Score = 57.2 bits (132), Expect = 4e-07
Identities = 39/99 (39%), Positives = 47/99 (47%), Gaps = 2/99 (2%)
Frame = +2
Query: 263 KHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGA 442
+ IIA HN K SYKLGMN Y D+ + EF + K A+ SV GA
Sbjct: 253 RKIIATHNAKES----SYKLGMNHYADLSNKEFNTLVKP--KVARP---------SVTGA 297
Query: 443 KFISPANV--KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ +P VDWR V KDQG CGSCW+F
Sbjct: 298 DSVHDDESLRSIPSTVDWRNQNCVTPVKDQGICGSCWTF 336
>UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium
(Vinckeia)|Rep: Berghepain-2 - Plasmodium yoelii yoelii
Length = 472
Score = 56.8 bits (131), Expect = 5e-07
Identities = 33/97 (34%), Positives = 47/97 (48%), Gaps = 3/97 (3%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKG---GSVRGA 442
I KHN++ + Y G+N + DM H EF M N K N + ++ ++
Sbjct: 187 IEKHNKENHL----YTKGINAFSDMRHEEF--KMKYLNNKLKENHQIDLRHLIPYTIAIN 240
Query: 443 KFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
K+ SP + DWR H A+ KDQ KC SCW+F
Sbjct: 241 KYKSPTDQINYTSFDWRDHNAIIDIKDQQKCASCWAF 277
>UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae
str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae
str. PEST
Length = 559
Score = 56.8 bits (131), Expect = 5e-07
Identities = 32/88 (36%), Positives = 46/88 (52%)
Frame = +2
Query: 290 KYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK 469
K+E G Y G+ K+ DM E+ + G KH++ ++ G V + ++
Sbjct: 285 KFERGTAKY--GVTKFADMTVAEY-RAHTGL-VVPKHDRANHV-GNRVASEEDVAGVG-D 338
Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
LP DWR HGAV K+QG CGSCW+F
Sbjct: 339 LPRSFDWRDHGAVTEVKNQGSCGSCWAF 366
>UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep:
Vivapain-4 - Plasmodium vivax
Length = 484
Score = 56.8 bits (131), Expect = 5e-07
Identities = 36/97 (37%), Positives = 49/97 (50%), Gaps = 3/97 (3%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNG--FNKTAKHNKNLYMKGGSVRGAK 445
I HN K + YK G N+Y D+ EF KTM F+ K + Y+ K
Sbjct: 197 INSHNSKAN---ILYKKGTNQYSDISFEEFRKTMLTLRFDLKKKLANSPYVSNYDDVLKK 253
Query: 446 FISPANVKLP-EQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ PA+ + E+ DWR+H AV K+Q CGSCW+F
Sbjct: 254 Y-KPADAVVDNEKYDWREHNAVSEIKNQNLCGSCWAF 289
>UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteinase
A; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like
tick cysteine proteinase A - Haemaphysalis longicornis
(Bush tick)
Length = 312
Score = 56.8 bits (131), Expect = 5e-07
Identities = 40/126 (31%), Positives = 59/126 (46%), Gaps = 4/126 (3%)
Frame = +2
Query: 188 LQAAAPSQLRKRGRRQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLG-MNKYGDMLHHEFV 364
LQ AA S ++ RR + E+ ++AKHN KY GL ++G GD +V
Sbjct: 4 LQIAAQSGVQFPRRRTIEVKIFTENTLLVAKHNAKYAKGLGVLQVGPWTSLGDFAA-AWV 62
Query: 365 KTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK---LPEQVDWRKHGAVPTFKDQGKC 535
+ ++ A +N G AN+ LP VDW + G+ K+QG+C
Sbjct: 63 RQNGQWDTAASRTRN--------SGPHLFHQANLNDSSLPTTVDWAQEGSRAPVKNQGQC 114
Query: 536 GSCWSF 553
GSCW+F
Sbjct: 115 GSCWAF 120
>UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase
precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine
proteinase precursor - Heterodera glycines (Soybean cyst
nematode worm)
Length = 353
Score = 56.8 bits (131), Expect = 5e-07
Identities = 36/98 (36%), Positives = 49/98 (50%), Gaps = 1/98 (1%)
Frame = +2
Query: 263 KHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGA 442
K I HN +E G VS+K+ N ++H T +N+ + L M+ R
Sbjct: 76 KKFIDAHNLAFEKGEVSFKVAPNH---LMHF----TPAQYNRI----RGLQMRSNRQRHN 124
Query: 443 KFISPANVK-LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
N LPE++DWR+ GAV KDQG CGSCW+F
Sbjct: 125 MATLAGNSSTLPEKLDWREKGAVTEVKDQGDCGSCWAF 162
>UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 392
Score = 56.8 bits (131), Expect = 5e-07
Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 7/111 (6%)
Frame = +2
Query: 242 HEDIPEH---KHIIAKHNQKYEMGL----VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKH 400
+ED EH KHI +HN +Y + + YKL N + D+ EF + +K
Sbjct: 99 YEDDSEHRRRKHIF-RHNVRYIRSMNRRSLPYKLEPNHFADLTDDEFKSYKGALDDESKD 157
Query: 401 NKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
N + + + S ++P+Q+DWR +GAV K QG CGSCW+F
Sbjct: 158 VMNDH--DDVIDDDR--SKRMFEVPDQLDWRNYGAVNPAKGQGTCGSCWAF 204
>UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F
like protease; n=1; Nasonia vitripennis|Rep: PREDICTED:
similar to cathepsin F like protease - Nasonia
vitripennis
Length = 1036
Score = 56.4 bits (130), Expect = 6e-07
Identities = 30/89 (33%), Positives = 43/89 (48%)
Frame = +2
Query: 287 QKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANV 466
Q+ EMG Y G+ ++ D+ EF G T K ++ M ++ ++
Sbjct: 766 QRNEMGTGRY--GVTQFTDLTKAEFKARHLGLKPTLKSENDIPMPMATI--------PDI 815
Query: 467 KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+LP DWR H V KDQG CGSCW+F
Sbjct: 816 ELPSDYDWRHHNVVTPVKDQGSCGSCWAF 844
>UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2;
Culicidae|Rep: Procathepsin L3, putative - Aedes aegypti
(Yellowfever mosquito)
Length = 313
Score = 56.4 bits (130), Expect = 6e-07
Identities = 27/100 (27%), Positives = 48/100 (48%), Gaps = 6/100 (6%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK------NLYMKGGSV 433
I +HN YE G ++++G+N+ DM ++K M H K + ++ +
Sbjct: 62 IEEHNANYEQGKSTFQMGVNELADMDKSSYLKKMVRMTDAIDHRKLDVDFNDEMLQATNA 121
Query: 434 RGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
G +F+ +P+ +DWR G +Q CGSC++F
Sbjct: 122 FGEEFVQATQNSMPDSLDWRDKGFTTMAVNQKTCGSCYAF 161
>UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K
precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2);
n=2; Tribolium castaneum|Rep: PREDICTED: similar to
Cathepsin K precursor (Cathepsin O) (Cathepsin X)
(Cathepsin O2) - Tribolium castaneum
Length = 332
Score = 56.0 bits (129), Expect = 9e-07
Identities = 26/95 (27%), Positives = 48/95 (50%)
Frame = +2
Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
I+ +HN+++ G +Y++G+NK+ D E + + G + + L +
Sbjct: 57 IVEEHNERFRNGSETYEMGVNKFSDFTDEE-LSNLTGLQVPLEFEQPL-----NETEDPL 110
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ + +DWR+ G V K+QG+CGSCW+F
Sbjct: 111 LPSLGRGISASLDWRQRGGVTPVKNQGQCGSCWAF 145
>UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease
containing protein; n=2; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 332
Score = 56.0 bits (129), Expect = 9e-07
Identities = 34/91 (37%), Positives = 48/91 (52%), Gaps = 1/91 (1%)
Frame = +2
Query: 284 NQKYEMGLVSYKLGMNKYGDMLHHEF-VKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPA 460
N + G +S G+NK+ + EF K +N + A MK S+ ++
Sbjct: 74 NMNSDNGFIS---GINKFSHLTKEEFKAKYLNRPQRPASE-----MKTNSILSSQ--QKT 123
Query: 461 NVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ KLPE VDWRK GAV +DQG CGSC++F
Sbjct: 124 DEKLPESVDWRKLGAVSPVRDQGNCGSCYAF 154
>UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine
protease; n=1; Strongylocentrotus purpuratus|Rep:
PREDICTED: similar to cysteine protease -
Strongylocentrotus purpuratus
Length = 494
Score = 55.6 bits (128), Expect = 1e-06
Identities = 33/88 (37%), Positives = 41/88 (46%)
Frame = +2
Query: 290 KYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK 469
++E G Y G K+ DM EF K +G K K + G V
Sbjct: 195 QFEQGTAKY--GPTKFADMTEAEFRKLQSGPLKKTGIKKQAAIPQGPV------------ 240
Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
PE+ DWR HGAV K+QG CGSCW+F
Sbjct: 241 -PEEYDWRTHGAVTPVKNQGMCGSCWAF 267
>UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 21 SCAF14577, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 478
Score = 55.6 bits (128), Expect = 1e-06
Identities = 40/118 (33%), Positives = 55/118 (46%), Gaps = 5/118 (4%)
Frame = +2
Query: 215 RKRGRRQFPHEDIPEHKHIIAKHNQKY-----EMGLVSYKLGMNKYGDMLHHEFVKTMNG 379
+++ +RQ+ + E + HN +Y GL SY LG+N D E TM G
Sbjct: 125 KEKFQRQYEDDKEHELRQQAFIHNLRYVHSKNRAGL-SYTLGLNSLSDRTMSELA-TMRG 182
Query: 380 FNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ N L F +V++PE +DWR +GAV KDQ CGSCWSF
Sbjct: 183 RKQRKTTNAGLPFP--------FKLYQHVEVPESLDWRLYGAVTPVKDQAICGSCWSF 232
>UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1;
Rhipicephalus appendiculatus|Rep: Midgut cysteine
proteinase 2 - Rhipicephalus appendiculatus (Brown ear
tick)
Length = 564
Score = 55.6 bits (128), Expect = 1e-06
Identities = 26/49 (53%), Positives = 29/49 (59%), Gaps = 1/49 (2%)
Frame = +2
Query: 410 LYMKGGSVRGAKFISPA-NVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
L K GS R F KLP+Q+DWR +GAV KDQ CGSCWSF
Sbjct: 324 LQSKDGSSRAEPFPRHRFTAKLPDQIDWRPYGAVTPVKDQAVCGSCWSF 372
Score = 33.1 bits (72), Expect = 6.9
Identities = 18/40 (45%), Positives = 24/40 (60%)
Frame = +1
Query: 562 LGALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPG 681
+G LEG +FR++G LV L + +DCS GNNG G
Sbjct: 375 VGELEGAYFRKTGRLVR-LSEQQLVDCSWN-NGNNGCDGG 412
>UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L
- Suberites domuncula (Sponge)
Length = 324
Score = 55.6 bits (128), Expect = 1e-06
Identities = 33/98 (33%), Positives = 47/98 (47%)
Frame = +2
Query: 260 HKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRG 439
+K I HN + Y L MN++GD+ EF + NG+ + N
Sbjct: 50 NKKFIDSHNSVSDK--FGYTLEMNEFGDLSGVEFKQIYNGYIMQERANDTKLFTA----- 102
Query: 440 AKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ ++ PA VDWR+ G V K+QG+CGSCWSF
Sbjct: 103 SPYMEPA-----ASVDWRQKGVVSEVKNQGQCGSCWSF 135
>UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10;
Liliopsida|Rep: Putative cysteine proteinase - Oryza
sativa subsp. japonica (Rice)
Length = 416
Score = 55.2 bits (127), Expect = 1e-06
Identities = 35/94 (37%), Positives = 50/94 (53%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I + NQK + G+ SY LG+NK+ D+ + EF G K + + + + + +
Sbjct: 56 IHEFNQKSK-GM-SYVLGLNKFSDLTYEEFAAKYTG----VKVDASAFATATTSSPDEEL 109
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
P V P DWR +GAV KDQG+CGSCW F
Sbjct: 110 -PVGVP-PATWDWRLNGAVTDVKDQGQCGSCWVF 141
>UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila
melanogaster|Rep: CG5367-PA - Drosophila melanogaster
(Fruit fly)
Length = 338
Score = 55.2 bits (127), Expect = 1e-06
Identities = 32/100 (32%), Positives = 53/100 (53%), Gaps = 1/100 (1%)
Frame = +2
Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
E+ +I +HNQ Y+ G S++L N + DM ++K GF + K N ++ +
Sbjct: 62 ENFKVIEEHNQNYKEGQTSFRLKPNIFADMSTDGYLK---GFLRLLKSN----IEDSADN 114
Query: 437 GAKFI-SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
A+ + SP +PE +DWR G + +Q CGSC++F
Sbjct: 115 MAEIVGSPLMANVPESLDWRSKGFITPPYNQLSCGSCYAF 154
>UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13;
Plasmodium|Rep: Cysteine protease falcipain-3 -
Plasmodium falciparum
Length = 492
Score = 55.2 bits (127), Expect = 1e-06
Identities = 41/106 (38%), Positives = 49/106 (46%), Gaps = 7/106 (6%)
Frame = +2
Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF------VKTMNGFNKTAKHNKNLYM 418
E+ I HN+K YK GMNK+GD+ EF +KT F KT +
Sbjct: 197 ENYRKIELHNKKTNS---LYKRGMNKFGDLSPEEFRSKYLNLKTHGPF-KTLSPPVSYEA 252
Query: 419 KGGSVRGAKFISPANVKLPE-QVDWRKHGAVPTFKDQGKCGSCWSF 553
V K PA+ KL DWR HG V KDQ CGSCW+F
Sbjct: 253 NYEDV--IKKYKPADAKLDRIAYDWRLHGGVTPVKDQALCGSCWAF 296
>UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh
fly) (Boettcherisca peregrina). Cathepsin L; n=2;
Dictyostelium discoideum|Rep: Similar to Sarcophaga
peregrina (Flesh fly) (Boettcherisca peregrina).
Cathepsin L - Dictyostelium discoideum (Slime mold)
Length = 265
Score = 55.2 bits (127), Expect = 1e-06
Identities = 26/78 (33%), Positives = 38/78 (48%)
Frame = +2
Query: 320 LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 499
+ +N+Y D+ EF F K ++ + ++ F N +P+ DWR H
Sbjct: 1 MDLNEYSDLTQKEFADKF--FEKLVPEPRSGPIN--DIKATPFKHNVNATIPKSFDWRDH 56
Query: 500 GAVPTFKDQGKCGSCWSF 553
GAV K+QG C SCWSF
Sbjct: 57 GAVGKVKNQGSCASCWSF 74
>UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus
salmonis|Rep: Cysteine proteinase - Lepeophtheirus
salmonis (salmon louse)
Length = 372
Score = 55.2 bits (127), Expect = 1e-06
Identities = 27/82 (32%), Positives = 41/82 (50%), Gaps = 1/82 (1%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK-LPEQVD 487
++ +G+N++ D+ EF G++ + G V N+K LPE VD
Sbjct: 68 TWDMGINEFSDLTDEEFESKYMGYSPMSS-------SAGLVTRTAAPKQGNIKDLPESVD 120
Query: 488 WRKHGAVPTFKDQGKCGSCWSF 553
WR+ G + K+QG CGSCW F
Sbjct: 121 WREKGVITDVKNQGSCGSCWVF 142
>UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2;
Endopterygota|Rep: Cathepsin L-like proteinase - Bombyx
mori (Silk moth)
Length = 402
Score = 55.2 bits (127), Expect = 1e-06
Identities = 30/96 (31%), Positives = 50/96 (52%), Gaps = 2/96 (2%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKN--LYMKGGSVRGAK 445
+A+HN++Y G+ SY L +N +GDM E+ F K K K L+
Sbjct: 131 VARHNREYLAGIQSYSLHLNHFGDMHVTEY------FGKVLKLIKAFPLFDPAEDHHKTA 184
Query: 446 FISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ K+P+++DWR G P ++Q +CG+C++F
Sbjct: 185 YRHNRRCKVPKRIDWRDQGFKPRREEQWQCGACYAF 220
>UniRef50_Q5NE16 Cluster: Putative cathepsin L-like protein 3; n=3;
Homo sapiens|Rep: Putative cathepsin L-like protein 3 -
Homo sapiens (Human)
Length = 218
Score = 55.2 bits (127), Expect = 1e-06
Identities = 39/120 (32%), Positives = 55/120 (45%)
Frame = +2
Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
+I +HNQ+Y G S+ + MN +G+M EF + +NGF + KH K G
Sbjct: 3 MIEQHNQEYREGKHSFTMAMNAFGEMTSEEFRQVVNGF-QNQKHRK----------GKVL 51
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSFQHDWELWKDSTSVSPATWCRFFGAK 628
P + + VDWR+ G V KDQ G S + D + S+S TW G K
Sbjct: 52 QEPLLHDIRKSVDWREKGYVTPVKDQCNWG---SVRTDVRKTEKLVSLSVQTWWTALGFK 108
>UniRef50_UPI000155637A Cluster: PREDICTED: similar to
ENSANGP00000013730, partial; n=1; Ornithorhynchus
anatinus|Rep: PREDICTED: similar to ENSANGP00000013730,
partial - Ornithorhynchus anatinus
Length = 229
Score = 54.8 bits (126), Expect = 2e-06
Identities = 22/32 (68%), Positives = 24/32 (75%)
Frame = +2
Query: 458 ANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
ANV LPE +DWR +GAV KDQ CGSCWSF
Sbjct: 51 ANVALPESLDWRLYGAVTPVKDQAVCGSCWSF 82
>UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus
tropicalis|Rep: LOC594890 protein - Xenopus tropicalis
(Western clawed frog) (Silurana tropicalis)
Length = 355
Score = 54.8 bits (126), Expect = 2e-06
Identities = 33/95 (34%), Positives = 50/95 (52%), Gaps = 1/95 (1%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFV-KTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
I HN +Y MGL +Y++GMN GDM+ E K MN + + ++ ++
Sbjct: 83 IMLHNLEYSMGLHTYEVGMNHLGDMVAEEMTDKQMNFIPQVIANITDVPVE--------- 133
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
IS ++ PE +DWR V + KDQG C + W+F
Sbjct: 134 ISKSSP--PESIDWRNKNCVTSVKDQGSCIASWAF 166
>UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:
Silicatein beta - Suberites domuncula (Sponge)
Length = 383
Score = 54.8 bits (126), Expect = 2e-06
Identities = 36/109 (33%), Positives = 53/109 (48%), Gaps = 11/109 (10%)
Frame = +2
Query: 260 HKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK---------TMNGFNKTAKHNKNL 412
+K I +HNQ + + Y L MNK+GD+ EF++ N + KH +
Sbjct: 83 NKEYIDQHNQNAQR--LGYTLKMNKFGDLTTKEFIEGYHCVQDYQPTNASHLNKKHKTHA 140
Query: 413 YMKGGS-VRGAKFISPANV-KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
++ G VRG V +PE +DWR G V KDQ +CGS ++F
Sbjct: 141 FVDYGDFVRGGTGEGVRGVGNMPETMDWRTSGVVTKVKDQLRCGSSYAF 189
>UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted,
possible transmembrane domain near N-terminus; n=4;
Cryptosporidium|Rep: Cryptopain-cysteine proteinase
secreted, possible transmembrane domain near N-terminus
- Cryptosporidium parvum Iowa II
Length = 401
Score = 54.8 bits (126), Expect = 2e-06
Identities = 26/81 (32%), Positives = 44/81 (54%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
SY L MN++GD+ EF+ G+ K +K ++ ++ K V ++ S P ++W
Sbjct: 126 SYVLEMNEFGDLSKEEFMARFTGYIKDSKDDERVF-KSSRVSASE--SEEEFVPPNSINW 182
Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
+ G V ++Q CGSCW+F
Sbjct: 183 VEAGCVNPIRNQKNCGSCWAF 203
>UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1];
n=11; Eutheria|Rep: Testin-2 precursor [Contains:
Testin-1] - Mus musculus (Mouse)
Length = 333
Score = 54.8 bits (126), Expect = 2e-06
Identities = 32/95 (33%), Positives = 49/95 (51%)
Frame = +2
Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
+I HN +Y G + + MN +GD+ + EFVK M GF + + K +++ F
Sbjct: 58 MIELHNWEYLEGKHDFTMTMNAFGDLTNTEFVKMMTGFRR--QKIKRMHV---------F 106
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ +P+ VDWR G V K+QG C S W+F
Sbjct: 107 QDHQFLYVPKYVDWRMLGYVTPVKNQGYCASSWAF 141
>UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila
melanogaster|Rep: LD36817p - Drosophila melanogaster
(Fruit fly)
Length = 352
Score = 54.4 bits (125), Expect = 3e-06
Identities = 32/97 (32%), Positives = 51/97 (52%), Gaps = 2/97 (2%)
Frame = +2
Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
+I N+ + G+ ++LG+N DM E + T+ G +K ++ + G +
Sbjct: 67 LITLSNKNADNGVSGFRLGVNTLADMTRKE-IATLLG-SKISEFGERY--TNGHINFVTA 122
Query: 449 ISPANVKLPEQVDWRKHGAV--PTFKDQGKCGSCWSF 553
+PA+ LPE DWR+ G V P F+ G CG+CWSF
Sbjct: 123 RNPASANLPEMFDWREKGGVTPPGFQGVG-CGACWSF 158
>UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16;
Bromeliaceae|Rep: Fruit bromelain precursor - Ananas
comosus (Pineapple)
Length = 351
Score = 54.4 bits (125), Expect = 3e-06
Identities = 30/81 (37%), Positives = 41/81 (50%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
SY LG+N++ DM EFV G + + + V IS +P+ +DW
Sbjct: 78 SYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVSFDDVN----ISA----VPQSIDW 129
Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
R +GAV K+Q CGSCWSF
Sbjct: 130 RDYGAVNEVKNQNPCGSCWSF 150
>UniRef50_Q24FA8 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 335
Score = 54.0 bits (124), Expect = 3e-06
Identities = 28/103 (27%), Positives = 55/103 (53%), Gaps = 4/103 (3%)
Frame = +2
Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTM----NGFNKTAKHNKNLYMKG 424
E+ + +HN+ +Y +G+N++ D+ E+ + + + N+ AK NKN ++
Sbjct: 58 ENYQSVQEHNKNSNH---TYSVGINQFSDITLQEYQQRILMKNSPLNELAK-NKNRLLQS 113
Query: 425 GSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
++ + + ++ +DWRK G V K+QG+CG CW+F
Sbjct: 114 SPIQNSN-----DTQIASSIDWRKKGGVSPVKNQGECGGCWTF 151
>UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 344
Score = 54.0 bits (124), Expect = 3e-06
Identities = 36/122 (29%), Positives = 55/122 (45%), Gaps = 6/122 (4%)
Frame = +2
Query: 206 SQLRKRGRRQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFN 385
S+ K + H + +H K++M + K G K+ DM EF M F+
Sbjct: 38 SKFNKYYHNEHEHHSSFHNYKTSREHIVKHQMENPNAKFGHTKFSDMSPEEFENKMLNFD 97
Query: 386 ----KTAKHNKNLYMKGGSVRG--AKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCW 547
K AK ++ + +K ++G + + N LPE DWR G + K Q CGSCW
Sbjct: 98 FSLFKKAK-SQGIKLKAEPMKGYLRQGENVDNSDLPESFDWRDKGIITPAKFQNTCGSCW 156
Query: 548 SF 553
+F
Sbjct: 157 TF 158
>UniRef50_O16454 Cluster: Temporarily assigned gene name protein
196; n=4; Bilateria|Rep: Temporarily assigned gene name
protein 196 - Caenorhabditis elegans
Length = 477
Score = 54.0 bits (124), Expect = 3e-06
Identities = 32/94 (34%), Positives = 45/94 (47%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
+ + QK E G Y G K+ DM EF K M + + + +Y + +
Sbjct: 204 VIRELQKNEQGTAVY--GFTKFSDMTTMEFKKIMLPY----QWEQPVYPMEQANFEKHDV 257
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ LPE DWR+ GAV K+QG CGSCW+F
Sbjct: 258 TINEEDLPESFDWREKGAVTQVKNQGNCGSCWAF 291
>UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3;
Bilateria|Rep: Cathepsin L-like cysteine protease -
Neobenedenia melleni
Length = 335
Score = 54.0 bits (124), Expect = 3e-06
Identities = 39/141 (27%), Positives = 55/141 (39%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
+ KHN+ Y G SY L MN D+ EF K + + G G
Sbjct: 58 VRKHNELYAQGKKSYTLAMNHMADLSSEEF----KALYLVPKFDATKVPRKGKAAGEH-- 111
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSFQHDWELWKDSTSVSPATWCRFFGAKP 631
P ++DW + G V K+Q +CGSCW+F + +V AT ++
Sbjct: 112 RQIKNDPPSEIDWVRKGHVTAVKNQAQCGSCWAFSSTGSI---EGAVKRATGKLISFSEQ 168
Query: 632 SSTAREQLRGTTGCNRGGSLD 694
G GCN GG +D
Sbjct: 169 QLVDCSTAFGNHGCN-GGIMD 188
>UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;
n=17; Magnoliophyta|Rep: Thiol protease aleurain-like
precursor - Arabidopsis thaliana (Mouse-ear cress)
Length = 358
Score = 54.0 bits (124), Expect = 3e-06
Identities = 37/99 (37%), Positives = 52/99 (52%)
Frame = +2
Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
E+ +I N+K GL SYKL +N++ D+ EF + G A N + +KG
Sbjct: 85 ENLDLIRSTNKK---GL-SYKLSLNQFADLTWQEFQRYKLG----AAQNCSATLKGSHK- 135
Query: 437 GAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
I+ A V P+ DWR+ G V K+QG CGSCW+F
Sbjct: 136 ----ITEATV--PDTKDWREDGIVSPVKEQGHCGSCWTF 168
>UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis
thaliana|Rep: F9P14.12 protein - Arabidopsis thaliana
(Mouse-ear cress)
Length = 343
Score = 53.6 bits (123), Expect = 5e-06
Identities = 30/80 (37%), Positives = 42/80 (52%)
Frame = +2
Query: 314 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 493
+KL N++ DM + EF G N ++ L+ K V PA +P+ VDWR
Sbjct: 84 FKLTDNRFADMTNSEFKAHFLGLNTSSLR---LHKKQRPV-----CDPAG-NVPDAVDWR 134
Query: 494 KHGAVPTFKDQGKCGSCWSF 553
GAV ++QGKCG CW+F
Sbjct: 135 TQGAVTPIRNQGKCGGCWAF 154
>UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheirus
salmonis|Rep: Putative cathepsin L - Lepeophtheirus
salmonis (salmon louse)
Length = 257
Score = 53.6 bits (123), Expect = 5e-06
Identities = 28/76 (36%), Positives = 38/76 (50%)
Frame = +2
Query: 326 MNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGA 505
MN+YGD+L EF++ G K + N + S +P V+W K+GA
Sbjct: 1 MNQYGDLLQSEFLQGYTGLAKGSYSGDNTVILDNSA-----------PVPSYVNWTKNGA 49
Query: 506 VPTFKDQGKCGSCWSF 553
V KDQ CGSCW+F
Sbjct: 50 VTAVKDQKDCGSCWAF 65
>UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960
precursor; n=2; Arabidopsis thaliana|Rep: Probable
cysteine proteinase At3g43960 precursor - Arabidopsis
thaliana (Mouse-ear cress)
Length = 376
Score = 53.6 bits (123), Expect = 5e-06
Identities = 32/82 (39%), Positives = 44/82 (53%), Gaps = 1/82 (1%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
SY+ G+NK+ D+ EF + G K K K S ++ LP++VDW
Sbjct: 82 SYERGLNKFSDLTADEFQASYLG----GKMEK----KSLSDVAERYQYKEGDVLPDEVDW 133
Query: 491 RKHGAV-PTFKDQGKCGSCWSF 553
R+ GAV P K QG+CGSCW+F
Sbjct: 134 RERGAVVPRVKRQGECGSCWAF 155
>UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15;
Magnoliophyta|Rep: Cysteine proteinase RD19a precursor -
Arabidopsis thaliana (Mouse-ear cress)
Length = 368
Score = 53.2 bits (122), Expect = 6e-06
Identities = 30/77 (38%), Positives = 37/77 (48%)
Frame = +2
Query: 323 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 502
G+ ++ D+ EF K G K K+ A + N LPE DWR HG
Sbjct: 95 GVTQFSDLTRSEFRKKHLGVRSGFKLPKD-------ANKAPILPTEN--LPEDFDWRDHG 145
Query: 503 AVPTFKDQGKCGSCWSF 553
AV K+QG CGSCWSF
Sbjct: 146 AVTPVKNQGSCGSCWSF 162
>UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core
eudicotyledons|Rep: Chymopapain precursor - Carica
papaya (Papaya)
Length = 352
Score = 53.2 bits (122), Expect = 6e-06
Identities = 30/81 (37%), Positives = 41/81 (50%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
SY LG+N + D+ + EF K GF A+ L K ++ P+ +DW
Sbjct: 88 SYWLGLNGFADLSNDEFKKKYVGF--VAEDFTGLEHFDNEDFTYKHVT----NYPQSIDW 141
Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
R GAV K+QG CGSCW+F
Sbjct: 142 RAKGAVTPVKNQGACGSCWAF 162
>UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3;
Dictyostelium discoideum|Rep: Cysteine proteinase 1
precursor - Dictyostelium discoideum (Slime mold)
Length = 343
Score = 53.2 bits (122), Expect = 6e-06
Identities = 38/122 (31%), Positives = 59/122 (48%), Gaps = 7/122 (5%)
Frame = +2
Query: 209 QLRKRGRRQFPHEDIPEHKHIIAKHNQKYE-MGLVSY------KLGMNKYGDMLHHEFVK 367
+ + + +++ HE+ E I + K E + L++ K G+NK+ D+ EF K
Sbjct: 31 EFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEF-K 89
Query: 368 TMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCW 547
NK A +L + +FI+ +P DWR GAV K+QG+CGSCW
Sbjct: 90 NYYLNNKEAIFTDDLPV--ADYLDDEFIN----SIPTAFDWRTRGAVTPVKNQGQCGSCW 143
Query: 548 SF 553
SF
Sbjct: 144 SF 145
>UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus
tauri|Rep: Cysteine protease-1 - Ostreococcus tauri
Length = 430
Score = 52.8 bits (121), Expect = 8e-06
Identities = 32/104 (30%), Positives = 53/104 (50%), Gaps = 5/104 (4%)
Frame = +2
Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGS-- 430
E+ + +HN Y +G VS+ +G+N E+ + + G+ + + + M +
Sbjct: 126 ENAAYVVEHNALYAIGEVSHWVGLNSLAATTREEY-RALLGYKPELRSSGDAEMLEATST 184
Query: 431 --VRGAKFI-SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
V K A+V PE +DW + GAV K+QG+CGSCW+F
Sbjct: 185 DKVEQYKASWEYASVDPPEAIDWVELGAVTPPKNQGQCGSCWAF 228
>UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole
genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_23,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 321
Score = 52.8 bits (121), Expect = 8e-06
Identities = 32/81 (39%), Positives = 42/81 (51%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
SYK +NK+GD+ EF+ A+ KN+ K P V+ E+VDW
Sbjct: 78 SYKQKINKFGDLTDQEFLTIYLNLQMPARV-KNIQ---------KNEEPFLVQ--EEVDW 125
Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
+ G VP KDQG CGSCW+F
Sbjct: 126 VQKGKVPAIKDQGDCGSCWAF 146
>UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4;
Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis
zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa
zeasingle nucleocapsid nuclear polyhedrosis virus)
Length = 367
Score = 52.8 bits (121), Expect = 8e-06
Identities = 28/81 (34%), Positives = 41/81 (50%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
S + G+NK+ D E + + GF + L + V+GA +++LP+ DW
Sbjct: 109 SAQFGVNKFSDKTPDEVLHSNTGFFLNLSQHYTL-CENRIVKGAP-----DIRLPDYYDW 162
Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
R V KDQG CGSCW+F
Sbjct: 163 RDTNKVTPIKDQGVCGSCWAF 183
>UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin S
preproprotein; n=2; Tribolium castaneum|Rep: PREDICTED:
similar to cathepsin S preproprotein - Tribolium
castaneum
Length = 525
Score = 52.4 bits (120), Expect = 1e-05
Identities = 36/117 (30%), Positives = 51/117 (43%)
Frame = +2
Query: 203 PSQLRKRGRRQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGF 382
P+ + RR + E KH HN++Y GL +Y L +N D E M+
Sbjct: 237 PNLEEENFRRAIFEKTFQEIKH----HNERYRKGLETYYLRINDLSDYTDEE----MSCC 288
Query: 383 NKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
++ A + S + LP+ VDWR G V K QGKCG+CW+F
Sbjct: 289 SEKAPKPSITILPNVSTSSRQ-------NLPKMVDWRLRGVVTPVKHQGKCGTCWAF 338
Score = 48.0 bits (109), Expect = 2e-04
Identities = 18/28 (64%), Positives = 20/28 (71%)
Frame = +2
Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
LP+ VDWR G V K QGKCGSCW+F
Sbjct: 35 LPDMVDWRLQGVVTPVKRQGKCGSCWAF 62
>UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-PA
- Drosophila melanogaster (Fruit fly)
Length = 549
Score = 52.4 bits (120), Expect = 1e-05
Identities = 35/109 (32%), Positives = 50/109 (45%), Gaps = 5/109 (4%)
Frame = +2
Query: 242 HEDIP-EHKHIIAKHNQKY----EMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 406
H D EH+ I + N +Y ++Y L +N D E +K G+ + +N
Sbjct: 257 HSDTEHEHRKNIFRQNLRYIHSKNRAKLTYTLAVNHLADKTEEE-LKARRGYKSSGIYNT 315
Query: 407 NLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
K K+ ++P+Q DWR +GAV KDQ CGSCWSF
Sbjct: 316 G---KPFPYDVPKYKD----EIPDQYDWRLYGAVTPVKDQSVCGSCWSF 357
>UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1;
Rhipicephalus appendiculatus|Rep: Midgut cysteine
proteinase 4 - Rhipicephalus appendiculatus (Brown ear
tick)
Length = 345
Score = 52.0 bits (119), Expect = 1e-05
Identities = 25/90 (27%), Positives = 44/90 (48%)
Frame = +2
Query: 284 NQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPAN 463
++K++ G + Y + +N + DM E V G+ + + +P
Sbjct: 73 DEKFKNGTLLYSVAVNHFADMTPDEVVANYTGYKPPSAQQ---------LAEIPLYAPLF 123
Query: 464 VKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
PE ++WR++G V K+QG+CGSCW+F
Sbjct: 124 GDTPEFIEWRENGFVTPVKNQGQCGSCWAF 153
Score = 38.7 bits (86), Expect = 0.14
Identities = 22/48 (45%), Positives = 30/48 (62%)
Frame = +1
Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGLIGXTAFQ 708
GALEGQ F+++ L+SL + +DC+G GNNG G + G AFQ
Sbjct: 157 GALEGQVFKRTRRLISL-SEQNLMDCAGQRYGNNGCNGGQMPG--AFQ 201
>UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35;
Viridiplantae|Rep: Cysteine proteinase 15A precursor -
Pisum sativum (Garden pea)
Length = 363
Score = 52.0 bits (119), Expect = 1e-05
Identities = 28/77 (36%), Positives = 38/77 (49%)
Frame = +2
Query: 323 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 502
G+ K+ D+ EF + G K + + + A + N LPE DWR+ G
Sbjct: 92 GITKFSDLTASEFRRQFLGLKKRLRLPAH-------AQKAPILPTTN--LPEDFDWREKG 142
Query: 503 AVPTFKDQGKCGSCWSF 553
AV KDQG CGSCW+F
Sbjct: 143 AVTPVKDQGSCGSCWAF 159
>UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza
sativa|Rep: Cysteine protease 1 precursor - Oryza sativa
subsp. japonica (Rice)
Length = 490
Score = 52.0 bits (119), Expect = 1e-05
Identities = 29/81 (35%), Positives = 41/81 (50%), Gaps = 1/81 (1%)
Frame = +2
Query: 314 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 493
++LGMN++ D+ + EF T G + G G + LP+ VDWR
Sbjct: 112 FRLGMNRFADLTNGEFRATYLGTTPAGR---------GRRVGEAYRHDGVEALPDSVDWR 162
Query: 494 KHGAVPT-FKDQGKCGSCWSF 553
GAV K+QG+CGSCW+F
Sbjct: 163 DKGAVVAPVKNQGQCGSCWAF 183
>UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Rep:
Cathepsin R precursor - Mus musculus (Mouse)
Length = 334
Score = 52.0 bits (119), Expect = 1e-05
Identities = 33/99 (33%), Positives = 46/99 (46%)
Frame = +2
Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
E +I HN++ +G + + MN++GD EF K M + MK R
Sbjct: 54 EKLKMIKLHNRENSLGKNGFTMKMNEFGDQTDEEFRKMMIEISVWTHREGKSIMK----R 109
Query: 437 GAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
A I LP+ VDWRK G V + QG C +CW+F
Sbjct: 110 EAGSI------LPKFVDWRKKGYVTPVRRQGDCDACWAF 142
>UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l;
n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
similar to cathepsin l - Strongylocentrotus purpuratus
Length = 489
Score = 51.6 bits (118), Expect = 2e-05
Identities = 29/82 (35%), Positives = 39/82 (47%)
Frame = +2
Query: 308 VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVD 487
+ Y L +N D H E +K M G + + N L G V ++ +P+ +D
Sbjct: 222 LGYVLDINHMADQSHQE-LKRMRGRLRQTRPNNGLPYDGSDV--------SDDAVPDHID 272
Query: 488 WRKHGAVPTFKDQGKCGSCWSF 553
W GAV KDQ CGSCWSF
Sbjct: 273 WNVLGAVSPVKDQAVCGSCWSF 294
>UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 350
Score = 51.6 bits (118), Expect = 2e-05
Identities = 31/96 (32%), Positives = 50/96 (52%), Gaps = 2/96 (2%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFV-KTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
I KHN +YKL N++ DM EF + +N KT+ + + + +RG+
Sbjct: 78 IQKHNSDSNN---TYKLQHNQFSDMTKDEFAHRVLNSQLKTSASSSSQPAQTPQLRGSV- 133
Query: 449 ISPANVKLPEQVDWRKH-GAVPTFKDQGKCGSCWSF 553
A++ + DWR + G + K+QG+CGSCW+F
Sbjct: 134 --DASLNASQGFDWRNYQGVLGNVKNQGQCGSCWTF 167
>UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC04937 protein - Schistosoma
japonicum (Blood fluke)
Length = 235
Score = 51.6 bits (118), Expect = 2e-05
Identities = 29/99 (29%), Positives = 49/99 (49%), Gaps = 5/99 (5%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSV---RGA 442
I HN Y++ LV+Y LG+N++ D+ E + T + NKN + ++ +
Sbjct: 90 IGLHNLHYDLNLVTYTLGINQFSDLTWIE-LSTFYLHELSVNLNKNKLLNSLNMFKLQSY 148
Query: 443 KFISP--ANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
F + + + +P+ DWR V K+Q KCG W+F
Sbjct: 149 NFTTTLLSTLNIPDNFDWRTKNVVTNVKNQEKCGCGWAF 187
>UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryza
sativa|Rep: Putative cysteine proteinase - Oryza sativa
subsp. japonica (Rice)
Length = 352
Score = 51.2 bits (117), Expect = 2e-05
Identities = 25/80 (31%), Positives = 39/80 (48%)
Frame = +2
Query: 314 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 493
Y+L N++ D+ EF G+N +Y + +S + + P +VDWR
Sbjct: 84 YRLATNRFTDLTDAEFAAMYTGYNPA----NTMY---AAANATTRLSSEDDQQPAEVDWR 136
Query: 494 KHGAVPTFKDQGKCGSCWSF 553
+ GAV K+Q CG CW+F
Sbjct: 137 QQGAVTGVKNQRSCGCCWAF 156
>UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24;
Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa
(Rice)
Length = 339
Score = 51.2 bits (117), Expect = 2e-05
Identities = 31/92 (33%), Positives = 44/92 (47%), Gaps = 3/92 (3%)
Frame = +2
Query: 287 QKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANV 466
+ + G + L +N++ D+ ++EF + K NK +VR NV
Sbjct: 69 ESFNAGNHKFWLSVNQFADLTNYEF--------RATKTNKGFIPS--TVRVPTTFRYENV 118
Query: 467 K---LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
LP VDWR GAV KDQG+CG CW+F
Sbjct: 119 SIDTLPATVDWRTKGAVTPIKDQGQCGCCWAF 150
>UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia
circumcincta|Rep: Secreted cathepsin F - Teladorsagia
circumcincta
Length = 364
Score = 51.2 bits (117), Expect = 2e-05
Identities = 30/94 (31%), Positives = 45/94 (47%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
I + Q+ + G Y G+N++ D+ EF KT + N + A+ +
Sbjct: 94 IIRSAQENDKGTAIY--GINQFADLSPEEFKKTHLPHTWKQPDHPNRIVD----LAAEGV 147
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
P LPE DWR+HGAV K +G C +CW+F
Sbjct: 148 DPKE-PLPESFDWREHGAVTKVKTEGHCAACWAF 180
>UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteinase
B; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like
tick cysteine proteinase B - Haemaphysalis longicornis
(Bush tick)
Length = 332
Score = 51.2 bits (117), Expect = 2e-05
Identities = 36/104 (34%), Positives = 54/104 (51%), Gaps = 6/104 (5%)
Frame = +2
Query: 257 EHKHIIAKHNQKY-EMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLY--MKGG 427
E++ IA+HN KY GLV + HE V + + +H + L + G
Sbjct: 52 ENRLKIARHNAKYANNGLVQAR-----------HERVWRLVA-PRVCEHPQRLQAQLPGP 99
Query: 428 SVRGAKFISPANVK---LPEQVDWRKHGAVPTFKDQGKCGSCWS 550
G+ +I P ++ LP+ +DWRK GAV K+QG+CGSCW+
Sbjct: 100 PTWGSTYIEPEGLEDEHLPKTMDWRKKGAVTPVKNQGQCGSCWA 143
>UniRef50_P25779 Cluster: Cruzipain precursor; n=54;
Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi
Length = 467
Score = 51.2 bits (117), Expect = 2e-05
Identities = 26/77 (33%), Positives = 35/77 (45%)
Frame = +2
Query: 323 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 502
G+ + D+ EF ++ HN + R + V P VDWR G
Sbjct: 82 GVTPFSDLTREEF--------RSRYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARG 133
Query: 503 AVPTFKDQGKCGSCWSF 553
AV KDQG+CGSCW+F
Sbjct: 134 AVTAVKDQGQCGSCWAF 150
>UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61;
Leishmania|Rep: Cysteine proteinase 2 precursor -
Leishmania pifanoi
Length = 444
Score = 51.2 bits (117), Expect = 2e-05
Identities = 30/83 (36%), Positives = 42/83 (50%), Gaps = 4/83 (4%)
Frame = +2
Query: 317 KLGMNKYGDMLHHEFV-KTMNG---FNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQV 484
+ G+ K+ D+ EF + +NG F +H Y K + A +P+ V
Sbjct: 80 QFGITKFFDLSEAEFAARYLNGAAYFAAAKRHAAQHYRKARADLSA---------VPDAV 130
Query: 485 DWRKHGAVPTFKDQGKCGSCWSF 553
DWR+ GAV KDQG CGSCW+F
Sbjct: 131 DWREKGAVTPVKDQGACGSCWAF 153
>UniRef50_Q5YER1 Cluster: Cysteine proteinase; n=1; Bigelowiella
natans|Rep: Cysteine proteinase - Bigelowiella natans
(Pedinomonas minutissima) (Chlorarachnion sp.(strain
CCMP 621))
Length = 140
Score = 50.8 bits (116), Expect = 3e-05
Identities = 27/88 (30%), Positives = 44/88 (50%)
Frame = +2
Query: 290 KYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK 469
++ +G SY + +N++ D+ + EF +G A+ G + + + K
Sbjct: 61 RHNVGGYSYTVELNEFADLTNAEFRSLYHGLKPNAQ-------------GPRRTANLSTK 107
Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ VDW GAV K+QG+CGSCWSF
Sbjct: 108 SADSVDWVSKGAVTPVKNQGQCGSCWSF 135
>UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep:
Cathepsin - Geodia cydonium (Sponge)
Length = 322
Score = 50.8 bits (116), Expect = 3e-05
Identities = 30/92 (32%), Positives = 45/92 (48%)
Frame = +2
Query: 278 KHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISP 457
K ++++ Y + MN++ D+ EFV NG + H + G + +S
Sbjct: 48 KFVEEFDSEREGYTVAMNEFADLDPREFVSHYNGLRRRP-HTSS----GEPCTLGEDVSA 102
Query: 458 ANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
LP VDWR G V K+QG+CGSCW+F
Sbjct: 103 ----LPTTVDWRTKGYVTGVKNQGQCGSCWAF 130
Score = 38.7 bits (86), Expect = 0.14
Identities = 22/41 (53%), Positives = 26/41 (63%)
Frame = +1
Query: 565 GALEGQHFRQSGYLVSLLRSKTFIDCSGAVTGNNGLQPGGL 687
G+LEGQHF +G LVS L + +DCS A GN G GGL
Sbjct: 134 GSLEGQHFNATGKLVS-LSEQNLVDCSSA-EGNEGCN-GGL 171
>UniRef50_Q23H10 Cluster: Papain family cysteine protease containing
protein; n=14; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 336
Score = 50.8 bits (116), Expect = 3e-05
Identities = 26/85 (30%), Positives = 44/85 (51%), Gaps = 4/85 (4%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKH-NKNLYMKG---GSVRGAKFISPANVKLPE 478
+Y + +N++ DM EF + + + H K + + + +S ++ L +
Sbjct: 70 TYSVHLNQFSDMTKEEFAEKILMKSDLVDHLMKGISQEATHNDTNNNETQLSSNSLTLAD 129
Query: 479 QVDWRKHGAVPTFKDQGKCGSCWSF 553
+DWR GAV + K+QG CGSCWSF
Sbjct: 130 SIDWRTKGAVTSVKNQGGCGSCWSF 154
>UniRef50_Q22LI1 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 987
Score = 50.8 bits (116), Expect = 3e-05
Identities = 27/81 (33%), Positives = 39/81 (48%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
+++LG+N+Y M EF + + + K K + V + +DW
Sbjct: 71 TFQLGLNEYAHMTSQEFAEVFLTPSISKSQQKQPKPKPQPQPHPNNSTNTTVTITP-IDW 129
Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
R GAV + K QGKCGSCWSF
Sbjct: 130 RNKGAVTSVKRQGKCGSCWSF 150
>UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep:
Dvir_CG5367 - Drosophila virilis (Fruit fly)
Length = 298
Score = 50.8 bits (116), Expect = 3e-05
Identities = 32/105 (30%), Positives = 51/105 (48%), Gaps = 1/105 (0%)
Frame = +2
Query: 242 HEDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMK 421
+E E++ I+ +HN YE G S++L N DM ++K G+ + + +
Sbjct: 17 YEAYEENQIIVNEHNTYYETGKSSFRLATNTMADMNTDSYLK---GYLRLLRSPEI---- 69
Query: 422 GGSVRGAKFI-SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
S A + SP +PE DWRK G + +Q CGSC++F
Sbjct: 70 SDSDNIADIVGSPLMNNVPESFDWRKKGFITPLYNQQSCGSCYAF 114
>UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep:
Cysteine proteinase - Cryptobia salmositica
Length = 443
Score = 50.4 bits (115), Expect = 4e-05
Identities = 29/79 (36%), Positives = 39/79 (49%), Gaps = 2/79 (2%)
Frame = +2
Query: 323 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVK--LPEQVDWRK 496
G N++ DM EF N A+H K + K + +K + +Q+DWR
Sbjct: 69 GPNEFADMTSEEFQTRHNA----ARHYAAA--KARPPKNTKTFTAEEIKAAVGQQIDWRL 122
Query: 497 HGAVPTFKDQGKCGSCWSF 553
GAV K+QG CGSCWSF
Sbjct: 123 KGAVTPVKNQGACGSCWSF 141
>UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicatein
a3 - Lubomirskia baicalensis
Length = 344
Score = 50.4 bits (115), Expect = 4e-05
Identities = 37/121 (30%), Positives = 57/121 (47%)
Frame = +2
Query: 191 QAAAPSQLRKRGRRQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKT 370
Q + SQL++ R H +K I HN + L Y L MN +GD++ EF +
Sbjct: 52 QRSYESQLQEMER----HSIWVANKKYIEHHNANAD--LFGYTLAMNGFGDLMSAEFTER 105
Query: 371 MNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWS 550
T KH++ ++ F SP V + +DWR G V + + QG+CGS ++
Sbjct: 106 Y----LTHKHSQRSGLQ-------TFESPKGVTYADSLDWRTRGVVTSVQSQGQCGSSYA 154
Query: 551 F 553
F
Sbjct: 155 F 155
>UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep:
Cysteine protease - Babesia equi
Length = 438
Score = 50.4 bits (115), Expect = 4e-05
Identities = 33/98 (33%), Positives = 45/98 (45%), Gaps = 10/98 (10%)
Frame = +2
Query: 290 KYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGS----------VRG 439
K + G SY+ G+NK+ DM EF + + K+L + VR
Sbjct: 155 KAQTGEESYEKGINKFSDMTDEEFNLRFPALS-VEELKKSLEVSASEEFTSPEHLDKVRI 213
Query: 440 AKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
AK + + E +DWRK V KDQG CGSCW+F
Sbjct: 214 AKGLGVEDSVDGEDLDWRKLNGVTPVKDQGNCGSCWAF 251
>UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326;
n=2; Danio rerio|Rep: hypothetical protein LOC550326 -
Danio rerio
Length = 531
Score = 50.0 bits (114), Expect = 6e-05
Identities = 33/118 (27%), Positives = 51/118 (43%), Gaps = 5/118 (4%)
Frame = +2
Query: 215 RKRGRRQFPHEDIPEHKHIIAKHNQKY-----EMGLVSYKLGMNKYGDMLHHEFVKTMNG 379
+++ RQ+ E E + + H ++ GL +Y +G+N + D E + G
Sbjct: 233 KEKFNRQYESEKEHEERENLFLHTFRFVHSNNRAGL-TYSVGINHFADKTKEELARMTGG 291
Query: 380 FNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
K + +R ++ P VDWR +GAV KDQ CGSCWSF
Sbjct: 292 L--LPKKEEKAQPFPSEIR--------SIATPNSVDWRLYGAVTPVKDQAVCGSCWSF 339
>UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1;
Naegleria fowleri|Rep: Cysteine proteinase homolog -
Naegleria fowleri
Length = 347
Score = 50.0 bits (114), Expect = 6e-05
Identities = 28/78 (35%), Positives = 38/78 (48%), Gaps = 1/78 (1%)
Frame = +2
Query: 323 GMNKYGDMLHHEFVKTMNGFNKTAKHNKN-LYMKGGSVRGAKFISPANVKLPEQVDWRKH 499
G+ K+ D+ EF + T + K L +V K + A P DWR+H
Sbjct: 76 GITKFSDLTPEEFKRMFLMKTYTPEEAKKILAAPQHAVLSEKEVQTA----PTSFDWRQH 131
Query: 500 GAVPTFKDQGKCGSCWSF 553
GAV K+QG CGSCW+F
Sbjct: 132 GAVTRVKNQGACGSCWTF 149
>UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1;
Brugia malayi|Rep: Cathepsin F-like cysteine proteinase
- Brugia malayi (Filarial nematode worm)
Length = 461
Score = 49.6 bits (113), Expect = 7e-05
Identities = 33/93 (35%), Positives = 39/93 (41%)
Frame = +2
Query: 275 AKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFIS 454
AK Q E G Y G K+ DM EF K M + N G +
Sbjct: 190 AKKLQFEEKGTAIY--GATKFSDMTAEEFQKIMLPSIWWDRVESN-----GITFNLNDFN 242
Query: 455 PANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ LP + DWR G V KDQG CGSCW+F
Sbjct: 243 LSIYNLPSKFDWRTEGVVTPVKDQGSCGSCWAF 275
>UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:
Viral cathepsin - Cydia pomonella granulosis virus
(CpGV) (Cydia pomonellagranulovirus)
Length = 333
Score = 49.6 bits (113), Expect = 7e-05
Identities = 29/78 (37%), Positives = 41/78 (52%), Gaps = 2/78 (2%)
Frame = +2
Query: 326 MNKYGDMLHHEFVKTMNGFNKTAKHNKNLY-MKGGSVRGAKFISPANVKLPEQVDWR-KH 499
+N+Y D+ + ++ GF K N + + M SV K LPE +DWR KH
Sbjct: 77 INEYSDLNKNALLRRTTGFRLGLKKNPSAFTMTECSVVVIK--DEPQALLPETLDWRDKH 134
Query: 500 GAVPTFKDQGKCGSCWSF 553
G P K+Q +CGSCW+F
Sbjct: 135 GVTPV-KNQMECGSCWAF 151
>UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis
thaliana|Rep: Cysteine proteinase - Arabidopsis thaliana
(Mouse-ear cress)
Length = 348
Score = 49.2 bits (112), Expect = 1e-04
Identities = 35/121 (28%), Positives = 53/121 (43%), Gaps = 6/121 (4%)
Frame = +2
Query: 209 QLRKRGRRQFPHEDIPEHKHIIAKHN----QKYEMG-LVSYKLGMNKYGDMLHHEFVKTM 373
Q R R + E ++ I K N Q + M ++YK+ +N++ D+ EF T
Sbjct: 37 QWMARFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNNKITYKVDINEFSDLTDEEFRATH 96
Query: 374 NGFNKTAKHNKNLYMKGGSVRGAKFISPANVK-LPEQVDWRKHGAVPTFKDQGKCGSCWS 550
G + + G + NV E +DWR+ GAV K QG+CG CW+
Sbjct: 97 TGLVVPEAITRISTLSSG--KNTVPFRYGNVSDNGESMDWRQEGAVTPVKYQGRCGGCWA 154
Query: 551 F 553
F
Sbjct: 155 F 155
>UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba
culbertsoni|Rep: Cysteine proteinase - Acanthamoeba
culbertsoni
Length = 482
Score = 49.2 bits (112), Expect = 1e-04
Identities = 24/89 (26%), Positives = 38/89 (42%)
Frame = +2
Query: 287 QKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANV 466
+++ G ++ + MN++GD+ EF + G A +
Sbjct: 95 EEFNRGNHTFTVAMNEHGDLTPEEFARLYMGQVSPASEQELQERIAAESAMEDEHHHTRA 154
Query: 467 KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+P DWR GAV K+QG C SCW+F
Sbjct: 155 SIPANWDWRTKGAVTPVKNQGSCASCWAF 183
>UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep:
Cysteine proteinase - Paragonimus westermani
Length = 272
Score = 49.2 bits (112), Expect = 1e-04
Identities = 19/38 (50%), Positives = 26/38 (68%), Gaps = 1/38 (2%)
Frame = +2
Query: 443 KFISPANVKL-PEQVDWRKHGAVPTFKDQGKCGSCWSF 553
K + P +K PE++DWR GAV ++QG CGSCW+F
Sbjct: 44 KRVRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWAF 81
>UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep:
Cysteine protease - Clonorchis sinensis
Length = 328
Score = 49.2 bits (112), Expect = 1e-04
Identities = 18/26 (69%), Positives = 21/26 (80%)
Frame = +2
Query: 476 EQVDWRKHGAVPTFKDQGKCGSCWSF 553
E+ DWR+HGAV DQGKCGSCW+F
Sbjct: 117 EKFDWREHGAVGPVLDQGKCGSCWAF 142
>UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163
precursor; n=4; Schizophora|Rep: Putative cysteine
proteinase CG12163 precursor - Drosophila melanogaster
(Fruit fly)
Length = 614
Score = 49.2 bits (112), Expect = 1e-04
Identities = 31/86 (36%), Positives = 45/86 (52%)
Frame = +2
Query: 296 EMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLP 475
EMG S K G+ ++ DM E+ K G + + GGS A + + +LP
Sbjct: 346 EMG--SAKYGITEFADMTSSEY-KERTGLWQRDEAKAT----GGS---AAVVPAYHGELP 395
Query: 476 EQVDWRKHGAVPTFKDQGKCGSCWSF 553
++ DWR+ AV K+QG CGSCW+F
Sbjct: 396 KEFDWRQKDAVTQVKNQGSCGSCWAF 421
>UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum
aestivum|Rep: Cysteine protease - Triticum aestivum
(Wheat)
Length = 371
Score = 48.8 bits (111), Expect = 1e-04
Identities = 32/95 (33%), Positives = 46/95 (48%), Gaps = 13/95 (13%)
Frame = +2
Query: 308 VSYKLGMNKYGDMLHHEFV-KTMNGFNKTAKHNKNLY--MKGGSVRGAKFISPA-----N 463
+ Y+LG N++ D+ + EF+ + + G A L + G V GA A N
Sbjct: 86 LGYELGENEFTDLTNEEFMARYVGGAYGGAGDGGGLITTLAGDVVEGAASSKNAIEEDRN 145
Query: 464 VKL-----PEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ + P Q DWR+HG V K QG CG CW+F
Sbjct: 146 LTMTASDPPRQFDWREHGVVTPAKQQGACGCCWAF 180
>UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza
sativa|Rep: Os01g0347600 protein - Oryza sativa subsp.
japonica (Rice)
Length = 343
Score = 48.8 bits (111), Expect = 1e-04
Identities = 35/104 (33%), Positives = 50/104 (48%), Gaps = 5/104 (4%)
Frame = +2
Query: 257 EHKHIIAKHNQKYEMGL---VSYK--LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMK 421
EH+ I + N + G V+Y +G+N++ D+ + EFV T G H K
Sbjct: 62 EHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPP--HPKE---- 115
Query: 422 GGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ + P + P +DWR GAV KDQG CGSCW+F
Sbjct: 116 -----APRPVDP--IWTPCCIDWRFRGAVTGVKDQGACGSCWAF 152
>UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1;
Oryza sativa (japonica cultivar-group)|Rep: Putative
uncharacterized protein - Oryza sativa subsp. japonica
(Rice)
Length = 289
Score = 48.8 bits (111), Expect = 1e-04
Identities = 35/104 (33%), Positives = 50/104 (48%), Gaps = 5/104 (4%)
Frame = +2
Query: 257 EHKHIIAKHNQKYEMGL---VSYK--LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMK 421
EH+ I + N + G V+Y +G+N++ D+ + EFV T G H K
Sbjct: 61 EHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPP--HPKE---- 114
Query: 422 GGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ + P + P +DWR GAV KDQG CGSCW+F
Sbjct: 115 -----APRPVDP--IWTPCCIDWRFRGAVTGVKDQGACGSCWAF 151
>UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n=3;
Brugia malayi|Rep: Cathepsin L-like cysteine proteinase
- Brugia malayi (Filarial nematode worm)
Length = 353
Score = 48.8 bits (111), Expect = 1e-04
Identities = 30/95 (31%), Positives = 46/95 (48%)
Frame = +2
Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
+I +HNQ+Y GL +YK+ +NK D E + + G+ N Y +G R +
Sbjct: 74 MIDEHNQRYSKGLETYKVDLNKMSDWTEEE-KERLRGYYP----NLTEYAEGDLSRIIR- 127
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+P+ D+RK V DQG+CG C+ F
Sbjct: 128 -GNITTTIPKSFDYRKKITVLPASDQGRCGVCFIF 161
>UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 360
Score = 48.4 bits (110), Expect = 2e-04
Identities = 28/83 (33%), Positives = 40/83 (48%)
Frame = +2
Query: 305 LVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQV 484
LV K+G+N++ D+ H EF G KH+K+ + + P + LP
Sbjct: 83 LVFSKVGVNQFADLTHEEFKALYTGH----KHSKD--DDDDDNKNKQPHLPTD-NLPASF 135
Query: 485 DWRKHGAVPTFKDQGKCGSCWSF 553
DWR GA+ K Q CG CW+F
Sbjct: 136 DWRDKGAITPVKVQNGCGGCWAF 158
>UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8;
Magnoliophyta|Rep: Os12g0273800 protein - Oryza sativa
subsp. japonica (Rice)
Length = 504
Score = 48.4 bits (110), Expect = 2e-04
Identities = 26/74 (35%), Positives = 37/74 (50%)
Frame = +2
Query: 314 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 493
Y LG+N++ D+ EF TM + N + + G K+ + + LP VDWR
Sbjct: 86 YWLGVNQFADLTSEEFKATMTNSKGFSTPNNGVRVS----TGFKYENVSADALPASVDWR 141
Query: 494 KHGAVPTFKDQGKC 535
GAV KDQG+C
Sbjct: 142 TKGAVTRIKDQGQC 155
>UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11;
Entamoeba|Rep: Cysteine proteinase 2 precursor -
Entamoeba histolytica
Length = 315
Score = 48.4 bits (110), Expect = 2e-04
Identities = 17/31 (54%), Positives = 23/31 (74%)
Frame = +2
Query: 461 NVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
N++ PE VDWRK G V +DQ +CGSC++F
Sbjct: 91 NIQAPESVDWRKEGKVTPIRDQAQCGSCYTF 121
>UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryza
sativa (japonica cultivar-group)|Rep: Putative cysteine
proteinase - Oryza sativa subsp. japonica (Rice)
Length = 361
Score = 48.0 bits (109), Expect = 2e-04
Identities = 31/79 (39%), Positives = 35/79 (44%), Gaps = 1/79 (1%)
Frame = +2
Query: 308 VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANV-KLPEQV 484
+SYKLG+NK+ DM EF G A V A P V P
Sbjct: 78 MSYKLGLNKFSDMTVEEFAAKYTGVQVDAG--------AAVVTSAPDEQPVLVGDAPPVW 129
Query: 485 DWRKHGAVPTFKDQGKCGS 541
DWR HGAV KDQG CG+
Sbjct: 130 DWRDHGAVTPVKDQGSCGT 148
>UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep:
Cysteine protease - Saprolegnia parasitica
Length = 523
Score = 48.0 bits (109), Expect = 2e-04
Identities = 24/81 (29%), Positives = 40/81 (49%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
S+ +G N+Y + EF K G + + + + A ++ +V P ++DW
Sbjct: 68 SFTMGHNEYSHLTFDEFKKLRTGLRVSPSY---IQSRAKYALMAPAVNMTDV--PNEMDW 122
Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
+ G V K+QG CGSCW+F
Sbjct: 123 VEQGGVTPVKNQGMCGSCWAF 143
>UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18;
Plasmodium|Rep: Cysteine proteinase precursor -
Plasmodium vivax (strain Salvador I)
Length = 583
Score = 48.0 bits (109), Expect = 2e-04
Identities = 34/103 (33%), Positives = 50/103 (48%), Gaps = 9/103 (8%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSV----RG 439
I KHN+ +M YK+ +N++ D +F H K Y+ S +G
Sbjct: 268 IKKHNETNQM----YKMKVNQFSDYSKKDFESYFRKLVPIPDHLKKKYVVPFSSMNNGKG 323
Query: 440 AKFI---SPANV--KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ S AN+ +PE +D+R+ G V KDQG CGSCW+F
Sbjct: 324 KNVVTSSSGANLLADVPEILDYREKGIVHEPKDQGLCGSCWAF 366
>UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27;
Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber
officinale (Ginger)
Length = 221
Score = 48.0 bits (109), Expect = 2e-04
Identities = 17/28 (60%), Positives = 22/28 (78%)
Frame = +2
Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
LP+ +DWR+ GAV K+QG CGSCW+F
Sbjct: 3 LPDSIDWREKGAVVPVKNQGGCGSCWAF 30
>UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19;
Bilateria|Rep: Cathepsin F precursor - Homo sapiens
(Human)
Length = 484
Score = 48.0 bits (109), Expect = 2e-04
Identities = 32/94 (34%), Positives = 44/94 (46%), Gaps = 1/94 (1%)
Frame = +2
Query: 275 AKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMK-GGSVRGAKFI 451
A+ Q + G Y G+ K+ D+ EF +T N L + G ++ AK +
Sbjct: 218 AQKIQALDRGTAQY--GVTKFSDLTEEEF--------RTIYLNTLLRKEPGNKMKQAKSV 267
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
P + DWR GAV KDQG CGSCW+F
Sbjct: 268 GDL---APPEWDWRSKGAVTKVKDQGMCGSCWAF 298
>UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin O
precursor; n=1; Tribolium castaneum|Rep: PREDICTED:
similar to Cathepsin O precursor - Tribolium castaneum
Length = 326
Score = 47.6 bits (108), Expect = 3e-04
Identities = 28/90 (31%), Positives = 42/90 (46%)
Frame = +2
Query: 284 NQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPAN 463
N K G Y G+ K+ D+L EF +T N + K + N + R
Sbjct: 70 NSKKRNGSALY--GLTKFSDLLPEEFFQTYLQSNLSQKTHSNEPKRHHHKRAT------- 120
Query: 464 VKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+P +VDWR+ AV +QG CG+CW++
Sbjct: 121 --VPNKVDWREKNAVTRIYNQGSCGACWAY 148
>UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza
sativa|Rep: Os09g0381400 protein - Oryza sativa subsp.
japonica (Rice)
Length = 362
Score = 47.6 bits (108), Expect = 3e-04
Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 2/86 (2%)
Frame = +2
Query: 302 GLVSYKLGMNKYGDMLHHEFVKTMNGFNK-TAKHNKNLYMKGGSVRGAKFISPANVKLPE 478
G ++Y+L N++ D+ EF+ T G+ + ++ G A F V +P
Sbjct: 89 GDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITTGAGDVDASF--SYRVDVPA 146
Query: 479 QVDWRKHGAVPTFKDQ-GKCGSCWSF 553
VDWR GAV K Q C SCW+F
Sbjct: 147 SVDWRAQGAVVPPKSQTSTCSSCWAF 172
>UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lamblia
ATCC 50803|Rep: GLP_26_49243_47612 - Giardia lamblia
ATCC 50803
Length = 543
Score = 47.6 bits (108), Expect = 3e-04
Identities = 17/30 (56%), Positives = 20/30 (66%)
Frame = +2
Query: 464 VKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
V+ P Q+DWR G + KDQ CGSCWSF
Sbjct: 314 VQFPRQLDWRVRGVITPVKDQAACGSCWSF 343
>UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides
sonorensis|Rep: Cathepsin L - Culicoides sonorensis
Length = 331
Score = 47.6 bits (108), Expect = 3e-04
Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 1/95 (1%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 451
+ +HN +Y G+ +Y+ G+N++ D+ + EF K G + N+ + G +
Sbjct: 58 VMEHNARYLSGMETYEKGVNQFSDLTYEEFAKLYLG--EKISFNELMTNADGWIE----- 110
Query: 452 SPANVKL-PEQVDWRKHGAVPTFKDQGKCGSCWSF 553
P +L PE W VP K+Q +CGSCW+F
Sbjct: 111 KPLRRQLAPESYAWDTKD-VPV-KNQAQCGSCWAF 143
>UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 513
Score = 47.6 bits (108), Expect = 3e-04
Identities = 37/116 (31%), Positives = 52/116 (44%), Gaps = 7/116 (6%)
Frame = +2
Query: 227 RRQFPHEDIPEHKHIIAKHNQKYEMGL----VSYKLGMNKYGDMLHHEFVKTMNGFNKTA 394
R+++P E + I +HN ++ + Y L N DM E V M G
Sbjct: 218 RKRYPSAHEHEKRKDIYRHNMRFIKSRNRQHLGYSLKPNHMADMTDAE-VNRMKGL---- 272
Query: 395 KHNKNLYMKGGSVRGAKFISP---ANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
L+ + + + F P V LP VDWRK GAV + K QG CGSC++F
Sbjct: 273 -----LHEEPPLIGDSPFSIPDKDRGVPLPPHVDWRKAGAVNSVKSQGICGSCYAF 323
>UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue of
Sarcophaga 26,29kDa proteinase; n=1; Nasonia
vitripennis|Rep: PREDICTED: similar to homologue of
Sarcophaga 26,29kDa proteinase - Nasonia vitripennis
Length = 553
Score = 47.2 bits (107), Expect = 4e-04
Identities = 31/119 (26%), Positives = 53/119 (44%), Gaps = 4/119 (3%)
Frame = +2
Query: 209 QLRKRGRRQFPHEDIPEHKHIIAKHNQKYEMGL----VSYKLGMNKYGDMLHHEFVKTMN 376
+ +K + + H+ + + +HN ++ + + + L +N D E +K +
Sbjct: 250 RFKKTHNKNYAHDLEHKQRKEHFRHNLRFIHSINRANLGFTLDVNHLADRNEAE-LKVLR 308
Query: 377 GFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
G T +H N G + + +P+ DWR +GAV KDQ CGSCWSF
Sbjct: 309 GKQYT-QHGYN-----GGMPFPHDVEKEKADVPDSFDWRLYGAVTPVKDQSVCGSCWSF 361
>UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,
partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
hypothetical protein, partial - Ornithorhynchus anatinus
Length = 224
Score = 47.2 bits (107), Expect = 4e-04
Identities = 19/33 (57%), Positives = 21/33 (63%)
Frame = +2
Query: 455 PANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
PA E DWRK GAV K+QG CGSCW+F
Sbjct: 126 PAGPLRAETCDWRKEGAVTPVKNQGDCGSCWAF 158
>UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila
melanogaster|Rep: CG11459-PA - Drosophila melanogaster
(Fruit fly)
Length = 336
Score = 47.2 bits (107), Expect = 4e-04
Identities = 30/113 (26%), Positives = 56/113 (49%), Gaps = 2/113 (1%)
Frame = +2
Query: 221 RGRRQFPHEDIPEHKHI-IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAK 397
R R ++ H + E + + + HNQ Y G V++K+G+NK+ D + +
Sbjct: 42 RNRDKY-HRALYEQRVLAVESHNQLYLQGKVAFKMGLNKFSDTDQRILFNYRSSIPAPLE 100
Query: 398 HNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQG-KCGSCWSF 553
+ N + +V ++ ++ E +DWR++G + DQG +C SCW+F
Sbjct: 101 TSTNALTE--TVNYKRYD-----QITEGIDWRQYGYISPVGDQGTECLSCWAF 146
>UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin W,
partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
similar to Cathepsin W, partial - Ornithorhynchus
anatinus
Length = 229
Score = 46.4 bits (105), Expect = 7e-04
Identities = 16/26 (61%), Positives = 20/26 (76%)
Frame = +2
Query: 476 EQVDWRKHGAVPTFKDQGKCGSCWSF 553
E DWRK GA+ + K+QG CGSCW+F
Sbjct: 70 ETCDWRKRGAITSVKNQGSCGSCWAF 95
>UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium
falciparum|Rep: Falcipain 2 - Plasmodium falciparum
Length = 484
Score = 46.4 bits (105), Expect = 7e-04
Identities = 29/101 (28%), Positives = 42/101 (41%), Gaps = 2/101 (1%)
Frame = +2
Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGF--NKTAKHNKNLYMKGGS 430
++ H + HN YK +N++ D+ +HEF +K K++K L +
Sbjct: 191 QNAHKVNMHNNNKNS---LYKKELNRFADLTYHEFKNKYLSLRSSKPLKNSKYLLDQMNY 247
Query: 431 VRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
K DWR H V KDQ CGSCW+F
Sbjct: 248 EEVIKKYRGEENFDHAAYDWRLHSGVTPVKDQKNCGSCWAF 288
>UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11;
Trichomonadidae|Rep: Cysteine protease - Tritrichomonas
foetus (Trichomonas foetus)
Length = 315
Score = 46.4 bits (105), Expect = 7e-04
Identities = 30/93 (32%), Positives = 45/93 (48%)
Frame = +2
Query: 275 AKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFIS 454
A++ Q++ G + + +NK+ + E+ K M G+ K K RG K
Sbjct: 49 ARYVQEHNAGDSKFTVSLNKFAALTPSEY-KVMLGYKTGMKAEK-------VSRGMK--- 97
Query: 455 PANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
NV + +DWR+ G V KDQ CGSCW+F
Sbjct: 98 KPNV---DSIDWREKGVVNEIKDQAACGSCWAF 127
>UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1;
Trypanosoma cruzi|Rep: Cysteine protease, putative -
Trypanosoma cruzi
Length = 434
Score = 46.4 bits (105), Expect = 7e-04
Identities = 27/82 (32%), Positives = 39/82 (47%), Gaps = 2/82 (2%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
SY+LG+NK+ DM EF NG + A + + K PE ++W
Sbjct: 80 SYRLGINKFSDMTKEEFNAKFNG--RVAAPQSTQSPQRAPYKRTK------ATFPEALNW 131
Query: 491 R--KHGAVPTFKDQGKCGSCWS 550
+ K+ + KDQG CGSCW+
Sbjct: 132 QEAKNPVLTPVKDQGSCGSCWA 153
>UniRef50_Q23FQ5 Cluster: Papain family cysteine protease containing
protein; n=4; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 332
Score = 46.4 bits (105), Expect = 7e-04
Identities = 36/118 (30%), Positives = 56/118 (47%), Gaps = 5/118 (4%)
Frame = +2
Query: 215 RKRGRRQFPHEDIPEHKHIIAKHN-QK---YEMGL-VSYKLGMNKYGDMLHHEFVKTMNG 379
R RR F +ED ++ ++ N QK +E +Y + +N++ D EFV+ +
Sbjct: 40 RSSYRRVFLNEDEETYRQLVFFENLQKLKTHEKNTEATYTVSLNQFSDYSQEEFVQRI-- 97
Query: 380 FNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
NK + K G + A V P VDWR GA+ ++QG+CGSC +F
Sbjct: 98 LNKHISRSDADIQKEQEPNGN--LRKA-VNYPTSVDWRNSGALNPIQNQGQCGSCAAF 152
>UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2;
Caenorhabditis|Rep: Putative uncharacterized protein -
Caenorhabditis elegans
Length = 383
Score = 46.4 bits (105), Expect = 7e-04
Identities = 26/78 (33%), Positives = 40/78 (51%)
Frame = +2
Query: 320 LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 499
L +N++ D E K + NK K++ + GS I PA++ DWR+
Sbjct: 125 LDVNEFTDWTDEELQKMVQE-NKYTKYDFDTPKFEGSYLETGVIRPASI------DWREQ 177
Query: 500 GAVPTFKDQGKCGSCWSF 553
G + K+QG+CGSCW+F
Sbjct: 178 GKLTPIKNQGQCGSCWAF 195
>UniRef50_Q22A69 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 330
Score = 46.4 bits (105), Expect = 7e-04
Identities = 23/77 (29%), Positives = 37/77 (48%)
Frame = +2
Query: 323 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 502
G+ ++ D+ H EF G+ ++++ S+ F +P +DW G
Sbjct: 73 GITQFADLTHEEFADMYLGYKPQLRNSQAKV----SLSSTPFTAPT------AIDWTTKG 122
Query: 503 AVPTFKDQGKCGSCWSF 553
AV K+QG CGSCW+F
Sbjct: 123 AVTPVKNQGSCGSCWAF 139
>UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10;
Dictyostelium discoideum|Rep: Cysteine proteinase 7
precursor - Dictyostelium discoideum (Slime mold)
Length = 460
Score = 46.4 bits (105), Expect = 7e-04
Identities = 17/25 (68%), Positives = 19/25 (76%)
Frame = +2
Query: 479 QVDWRKHGAVPTFKDQGKCGSCWSF 553
QVDWR GAV K+QG+CG CWSF
Sbjct: 113 QVDWRTQGAVTPIKNQGQCGGCWSF 137
>UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyostelium
discoideum|Rep: Cysteine proteinase 3 - Dictyostelium
discoideum (Slime mold)
Length = 151
Score = 46.4 bits (105), Expect = 7e-04
Identities = 28/75 (37%), Positives = 39/75 (52%)
Frame = +2
Query: 320 LGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKH 499
LG+N++ D+ + E+ +N A N Y K G + P + K P VDWR+
Sbjct: 31 LGLNQHADLSNEEY--RLNYLGTRAHIKLNGYHKRNL--GLRLNRP-HFKQPLNVDWREK 85
Query: 500 GAVPTFKDQGKCGSC 544
AV KDQG+CGSC
Sbjct: 86 DAVTPVKDQGQCGSC 100
>UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 397
Score = 46.0 bits (104), Expect = 0.001
Identities = 16/28 (57%), Positives = 20/28 (71%)
Frame = +2
Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+P+ VDWR G V KDQG+CG CW+F
Sbjct: 180 VPQSVDWRIQGKVSPVKDQGRCGCCWAF 207
>UniRef50_Q23H15 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 370
Score = 46.0 bits (104), Expect = 0.001
Identities = 17/28 (60%), Positives = 20/28 (71%)
Frame = +2
Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
L +DWR GAV + K+QG CGSCWSF
Sbjct: 162 LAASIDWRTKGAVTSVKNQGNCGSCWSF 189
>UniRef50_Q22RR8 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 332
Score = 46.0 bits (104), Expect = 0.001
Identities = 34/117 (29%), Positives = 54/117 (46%), Gaps = 2/117 (1%)
Frame = +2
Query: 260 HKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRG 439
H+ + + N K G +Y G+ K+ D+ EF + +N + +
Sbjct: 62 HRFAVFRDNLKKIEGHSNY--GITKFMDLTSEEFQQRYLRLKTNTIKRQNFK---SNPKN 116
Query: 440 AKFISPANVKLPEQV--DWRKHGAVPTFKDQGKCGSCWSFQHDWELWKDSTSVSPAT 604
A+ N+KL + + DW K GAV KDQ +CGSCW+F L + +T +S T
Sbjct: 117 AQL----NMKLGDDIIIDWTKKGAVTPVKDQEQCGSCWAFSATGAL-ESATFISTGT 168
>UniRef50_Q22RC9 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 389
Score = 46.0 bits (104), Expect = 0.001
Identities = 34/95 (35%), Positives = 44/95 (46%)
Frame = +2
Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
II++ NQ E G Y G+ ++ DM EF K+ T N G G +
Sbjct: 69 IISELNQ-VEEGTAEY--GITQFSDMTTEEF-KSQILIPSTYARN----FTGSRYHGFQK 120
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
IS P DWR HGAV K+QG G+CW+F
Sbjct: 121 ISQ---DAPTSYDWRDHGAVTPVKNQGTVGTCWTF 152
>UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan
CA, family C1, cathepsin L-like cysteine peptidase -
Trichomonas vaginalis G3
Length = 306
Score = 46.0 bits (104), Expect = 0.001
Identities = 17/33 (51%), Positives = 24/33 (72%), Gaps = 2/33 (6%)
Frame = +2
Query: 461 NVK--LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
N+K +P ++DWR+ G V K+QG CGSCW+F
Sbjct: 83 NIKNDVPTEIDWREQGIVNKIKNQGACGSCWAF 115
>UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2;
Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera
litura multicapsid nucleopolyhedrovirus (SpltMNPV)
Length = 337
Score = 46.0 bits (104), Expect = 0.001
Identities = 23/77 (29%), Positives = 36/77 (46%)
Frame = +2
Query: 323 GMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHG 502
G+NK+ D+ FV G ++ + + ++ + + PE DWRK
Sbjct: 77 GINKFSDIDKITFVNEHAGLVSNLINSTDSNFDPYRLCEYVTVAGPSARTPESFDWRKLN 136
Query: 503 AVPTFKDQGKCGSCWSF 553
V K+QG CGSCW+F
Sbjct: 137 KVTKVKEQGVCGSCWAF 153
>UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep:
Cathepsin L precursor - Schistosoma mansoni (Blood
fluke)
Length = 319
Score = 46.0 bits (104), Expect = 0.001
Identities = 16/28 (57%), Positives = 21/28 (75%)
Frame = +2
Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+P+ DWR+ GAV K+QG CGSCW+F
Sbjct: 105 IPKNFDWREKGAVTEVKNQGMCGSCWAF 132
>UniRef50_Q24HI2 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 356
Score = 45.6 bits (103), Expect = 0.001
Identities = 36/140 (25%), Positives = 62/140 (44%), Gaps = 2/140 (1%)
Frame = +2
Query: 272 IAKHNQKYEMGLVSYKLGMNK-YGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
+ +HN+K +Y L ++ + M +FV G ++ L +K + K
Sbjct: 68 VREHNKKVN---ATYTLSIDSPFAFMSDEQFVTEYLG-SQDCSATAELTLK----KPMKI 119
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSFQHDWELWKDSTSVSPATWCRFFGAK 628
+ NV++PE ++W+ V KDQ CGSCW+F +T + + F +
Sbjct: 120 QNKKNVQVPESINWKDLNKVSPVKDQQNCGSCWTF--------STTGAIESHYAIFEDVE 171
Query: 629 PSSTAREQLRGTTGC-NRGG 685
P+S + +QL G N G
Sbjct: 172 PTSLSEQQLIDCAGAFNNNG 191
>UniRef50_Q22W19 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 332
Score = 45.6 bits (103), Expect = 0.001
Identities = 23/63 (36%), Positives = 31/63 (49%), Gaps = 1/63 (1%)
Frame = +2
Query: 368 TMNGFNKTAKHNKNLYMKGGSVRG-AKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSC 544
T+N F K N KG R + I + +DWR+ AV K+QG+CGSC
Sbjct: 88 TLNAFAIYTKDEFNQLFKGYQKRQKSHLIYSLKGDVAPSIDWRQKNAVTPVKNQGQCGSC 147
Query: 545 WSF 553
W+F
Sbjct: 148 WAF 150
>UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep:
Cathepsin L - Kudoa thyrsites
Length = 300
Score = 45.6 bits (103), Expect = 0.001
Identities = 17/28 (60%), Positives = 20/28 (71%)
Frame = +2
Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
LP VDW+ G V + K+QG CGSCWSF
Sbjct: 102 LPSSVDWKALGKVTSVKNQGHCGSCWSF 129
>UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole
genome shotgun sequence; n=7; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_22,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 350
Score = 45.6 bits (103), Expect = 0.001
Identities = 16/24 (66%), Positives = 19/24 (79%)
Frame = +2
Query: 482 VDWRKHGAVPTFKDQGKCGSCWSF 553
+DWR GAV KDQG+CGSCW+F
Sbjct: 146 IDWRTRGAVNKVKDQGQCGSCWAF 169
>UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 20 SCAF14744, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 175
Score = 45.2 bits (102), Expect = 0.002
Identities = 25/81 (30%), Positives = 38/81 (46%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDW 490
S K G+N++ D+ EF K+LY++ + R F LP + DW
Sbjct: 20 SAKYGINQFSDLSEREF--------------KDLYLRASADRAPVFTGQKIKGLPARFDW 65
Query: 491 RKHGAVPTFKDQGKCGSCWSF 553
R + V ++Q CGSCW+F
Sbjct: 66 RDNAVVGPVQNQQACGSCWAF 86
>UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing
protein; n=5; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 437
Score = 45.2 bits (102), Expect = 0.002
Identities = 18/30 (60%), Positives = 22/30 (73%), Gaps = 1/30 (3%)
Frame = +2
Query: 467 KLPEQVDWRKHGAVPTFKDQGK-CGSCWSF 553
+LP+ VDWR+ G V K QGK CGSCW+F
Sbjct: 204 QLPQYVDWREKGVVTQVKSQGKDCGSCWAF 233
>UniRef50_Q239L8 Cluster: Papain family cysteine protease containing
protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 323
Score = 45.2 bits (102), Expect = 0.002
Identities = 16/25 (64%), Positives = 19/25 (76%)
Frame = +2
Query: 479 QVDWRKHGAVPTFKDQGKCGSCWSF 553
++DW GAV KDQG+CGSCWSF
Sbjct: 126 EIDWTTKGAVTPVKDQGQCGSCWSF 150
>UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1;
Bigelowiella natans|Rep: Digestive cysteine proteinase -
Bigelowiella natans (Pedinomonas minutissima)
(Chlorarachnion sp.(strain CCMP 621))
Length = 360
Score = 44.8 bits (101), Expect = 0.002
Identities = 32/100 (32%), Positives = 45/100 (45%), Gaps = 1/100 (1%)
Frame = +2
Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
E++ II N+ E+G Y G ++ DM +F M F +N
Sbjct: 50 ENERIIQGLNEN-ELGSAVY--GHTRFSDMSPEQFRAMMTPFKYHTDEAEN--------- 97
Query: 437 GAKFISPAN-VKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
A + N VK+ + DWR A+ KDQG CGSCW+F
Sbjct: 98 -AAYDQNKNAVKVTDSFDWRDFNALTPVKDQGGCGSCWAF 136
>UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 367
Score = 44.8 bits (101), Expect = 0.002
Identities = 15/26 (57%), Positives = 20/26 (76%)
Frame = +2
Query: 476 EQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ +DWR+ GAV K+QG CGSCW+F
Sbjct: 157 QSIDWRQSGAVSPVKNQGSCGSCWAF 182
>UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia
bovis|Rep: Cysteine protease 2 - Babesia bovis
Length = 445
Score = 44.8 bits (101), Expect = 0.002
Identities = 16/26 (61%), Positives = 19/26 (73%)
Frame = +2
Query: 476 EQVDWRKHGAVPTFKDQGKCGSCWSF 553
E +DWR+ AV KDQG CGSCW+F
Sbjct: 238 EDIDWRRADAVTPVKDQGMCGSCWAF 263
>UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 514
Score = 44.4 bits (100), Expect = 0.003
Identities = 31/108 (28%), Positives = 54/108 (50%), Gaps = 1/108 (0%)
Frame = +2
Query: 230 RQFPHE-DIPEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 406
+Q+ E ++ + KHI +HN +Y + L KY +H FV +G K +
Sbjct: 229 KQYDSEHEVSKRKHIF-RHNMRYIRSINRKNL---KYKLAPNH-FVDLTDGEYDQHKGDS 283
Query: 407 NLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWS 550
+ + G + + V +P+++DWR +GAV + QG CGSC++
Sbjct: 284 IITLYGPYSNMSHVLQ--RVDVPDELDWRDYGAVSPVRGQGICGSCYA 329
>UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14;
Leishmania|Rep: Cysteine proteinase 1 precursor -
Leishmania pifanoi
Length = 354
Score = 44.4 bits (100), Expect = 0.003
Identities = 27/74 (36%), Positives = 39/74 (52%)
Frame = +2
Query: 332 KYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVP 511
K+ D+ EF K + A+H K+ + + V + +P+ V VDWR GAV
Sbjct: 90 KFADLTPQEFAKLYLNPDYYARHLKD-HKEDVHVDDS---APSGVM---SVDWRDKGAVT 142
Query: 512 TFKDQGKCGSCWSF 553
K+QG CGSCW+F
Sbjct: 143 PVKNQGLCGSCWAF 156
>UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 21 SCAF14577, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 406
Score = 44.0 bits (99), Expect = 0.004
Identities = 27/103 (26%), Positives = 47/103 (45%), Gaps = 8/103 (7%)
Frame = +2
Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAK-------HNKNLYMKGG 427
++A+HN + G S+ L +N D++ + + ++ + NL ++
Sbjct: 80 LVARHNLEASAGKHSFTLELNHLADLVRRVLLLQPSLASERVRLTAEEINEMNNLKVEER 139
Query: 428 S-VRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ VR + P VDWRK G V ++QG C SCW+F
Sbjct: 140 APVRNGTSEEKLGFETPPSVDWRKAGLVSPVQNQGFCNSCWAF 182
>UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2;
Oryza sativa|Rep: Putative uncharacterized protein -
Oryza sativa subsp. indica (Rice)
Length = 149
Score = 44.0 bits (99), Expect = 0.004
Identities = 16/28 (57%), Positives = 20/28 (71%)
Frame = +2
Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+P+ +DWRK GAV K Q CGSCW+F
Sbjct: 17 MPKSIDWRKKGAVVEVKYQEDCGSCWAF 44
>UniRef50_Q24E33 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 328
Score = 44.0 bits (99), Expect = 0.004
Identities = 15/28 (53%), Positives = 20/28 (71%)
Frame = +2
Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+P +V+W GAV K+QG CGSCW+F
Sbjct: 127 IPSEVNWTAQGAVTPVKNQGSCGSCWAF 154
>UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia
tropicalis|Rep: Blo t 1 allergen - Blomia tropicalis
(Mite)
Length = 333
Score = 44.0 bits (99), Expect = 0.004
Identities = 24/84 (28%), Positives = 39/84 (46%)
Frame = +2
Query: 302 GLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQ 481
G+ + +N+Y DM EF F+ + YMK + + + + LP+
Sbjct: 64 GIDGVEYAINEYSDMSEQEF-----SFHLSGGGLNFTYMKMEAAKEPLINTYGS--LPQN 116
Query: 482 VDWRKHGAVPTFKDQGKCGSCWSF 553
DWR+ + + QG CGSCW+F
Sbjct: 117 FDWRQKARLTRIRQQGSCGSCWAF 140
>UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2;
Entamoeba|Rep: Cysteine proteinase ACP1 precursor -
Entamoeba histolytica
Length = 308
Score = 44.0 bits (99), Expect = 0.004
Identities = 27/76 (35%), Positives = 38/76 (50%)
Frame = +2
Query: 326 MNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGA 505
+N + DM H EF++T G + +V+ A + A PE VDWR
Sbjct: 57 LNVFADMTHEEFIQTHLGMTYEVPETTS------NVKAA--VKAA----PESVDWR--SI 102
Query: 506 VPTFKDQGKCGSCWSF 553
+ KDQG+CGSCW+F
Sbjct: 103 MNPAKDQGQCGSCWTF 118
>UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lamblia
ATCC 50803|Rep: GLP_26_47548_45815 - Giardia lamblia
ATCC 50803
Length = 577
Score = 43.6 bits (98), Expect = 0.005
Identities = 16/31 (51%), Positives = 21/31 (67%)
Frame = +2
Query: 461 NVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
N LP+++DWR G + KDQ CGSCW+F
Sbjct: 341 NEDLPQELDWRVRGIMNMAKDQVACGSCWTF 371
>UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia
intestinalis|Rep: GLP_90_15278_13989 - Giardia lamblia
ATCC 50803
Length = 429
Score = 43.6 bits (98), Expect = 0.005
Identities = 16/32 (50%), Positives = 23/32 (71%)
Frame = +2
Query: 458 ANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
A LP+ VD R++G + ++QGKCGSCW+F
Sbjct: 56 AEDNLPQSVDLREYGLMTPVRNQGKCGSCWAF 87
>UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
Clan CA, family C1, cathepsin L-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 291
Score = 43.6 bits (98), Expect = 0.005
Identities = 29/98 (29%), Positives = 46/98 (46%)
Frame = +2
Query: 260 HKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRG 439
+K+ + HN+ +YKL +N + E+ + K +KNL +G VR
Sbjct: 23 NKNFVETHNKAN----ANYKLSLNSLSHLTPTEYQSLLG-----TKIDKNLVSQGKKVR- 72
Query: 440 AKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
P P +D+R+ G V +DQ +CGSCW+F
Sbjct: 73 -----PQIKDSPGILDYREMGVVNPIRDQKQCGSCWAF 105
>UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza
sativa|Rep: Putative cysteine protease - Oryza sativa
subsp. japonica (Rice)
Length = 357
Score = 43.2 bits (97), Expect = 0.006
Identities = 24/76 (31%), Positives = 37/76 (48%)
Frame = +2
Query: 326 MNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGA 505
+N++ D+ + EFV T G + + + + P + +P +DWR GA
Sbjct: 90 INQFADLTNGEFVATYTGVKQPPPAT---HPHPHPEEAPRPVDP--IWMPCCIDWRFKGA 144
Query: 506 VPTFKDQGKCGSCWSF 553
V KDQG CGS W+F
Sbjct: 145 VTGVKDQGACGSSWAF 160
>UniRef50_Q2QS15 Cluster: Papain family cysteine protease containing
protein; n=1; Oryza sativa (japonica
cultivar-group)|Rep: Papain family cysteine protease
containing protein - Oryza sativa subsp. japonica (Rice)
Length = 351
Score = 43.2 bits (97), Expect = 0.006
Identities = 17/28 (60%), Positives = 19/28 (67%)
Frame = +2
Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
LP+ VDWRK GAV K CGSCW+F
Sbjct: 145 LPKSVDWRKKGAVVEVKYHEDCGSCWAF 172
>UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
Clan CA, family C1, cathepsin L-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 234
Score = 43.2 bits (97), Expect = 0.006
Identities = 15/28 (53%), Positives = 21/28 (75%)
Frame = +2
Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+P+++D+R GAV KDQ CGSCW+F
Sbjct: 18 IPDEIDYRTKGAVNEIKDQKHCGSCWAF 45
>UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-like
cysteine peptidase; n=3; Trichomonas vaginalis|Rep: Clan
CA, family C1, cathepsin L or K-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 320
Score = 43.2 bits (97), Expect = 0.006
Identities = 23/82 (28%), Positives = 45/82 (54%), Gaps = 1/82 (1%)
Frame = +2
Query: 311 SYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQV-D 487
+Y+L +N++ + + E+ K++ G ++K+N + ++ SP + K E D
Sbjct: 61 NYRLSLNQFSFLTNSEY-KSLLGGKVSSKNNDDSHL----------FSPQSKKSSEVTFD 109
Query: 488 WRKHGAVPTFKDQGKCGSCWSF 553
WR G + ++QG+CG CW+F
Sbjct: 110 WRTKGIINPIRNQGQCGLCWAF 131
>UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precursor;
n=3; Plasmodium (Laverania)|Rep: Trophozoite cysteine
proteinase precursor - Plasmodium falciparum
Length = 569
Score = 43.2 bits (97), Expect = 0.006
Identities = 17/29 (58%), Positives = 22/29 (75%)
Frame = +2
Query: 467 KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
K+PE +D+R+ G V KDQG CGSCW+F
Sbjct: 332 KVPEILDYREKGIVHEPKDQGLCGSCWAF 360
>UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin
L-like proteinase; n=2; Strongylocentrotus
purpuratus|Rep: PREDICTED: similar to cathepsin L-like
proteinase - Strongylocentrotus purpuratus
Length = 329
Score = 42.7 bits (96), Expect = 0.009
Identities = 27/95 (28%), Positives = 48/95 (50%)
Frame = +2
Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVR 436
++ ++ ++N+ Y+ G S+K+ MN++ D + K N F+ A NL + R
Sbjct: 54 KNNRLVDENNRAYDEGRRSFKMAMNEFADQ---DMSKVRNKFDVQA----NL-LNAERKR 105
Query: 437 GAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGS 541
+ S ++ LP DWRK G V ++QG+ S
Sbjct: 106 KSSGTSSSSSTLPSSWDWRKEGKVNPVRNQGQMNS 140
>UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas
foetus|Rep: TFCP2 protein - Tritrichomonas foetus
(Trichomonas foetus)
Length = 270
Score = 42.7 bits (96), Expect = 0.009
Identities = 15/27 (55%), Positives = 17/27 (62%)
Frame = +2
Query: 473 PEQVDWRKHGAVPTFKDQGKCGSCWSF 553
P DWR G V K+QG CGSCW+F
Sbjct: 51 PTSFDWRSEGKVNPIKNQGSCGSCWAF 77
>UniRef50_Q23H06 Cluster: Papain family cysteine protease containing
protein; n=18; Tetrahymena thermophila|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 349
Score = 42.7 bits (96), Expect = 0.009
Identities = 15/35 (42%), Positives = 21/35 (60%)
Frame = +2
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
++ N + +DWR GAV K QG CG+CW+F
Sbjct: 134 LNSKNFTIATSIDWRSRGAVTQVKWQGNCGACWAF 168
>UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing
protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 330
Score = 42.7 bits (96), Expect = 0.009
Identities = 17/34 (50%), Positives = 21/34 (61%)
Frame = +2
Query: 452 SPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
SP+ K V+W G V KDQG+CGSCW+F
Sbjct: 111 SPSTPKGQYDVNWVTRGKVSAVKDQGQCGSCWAF 144
>UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5;
Theileria|Rep: Cysteine proteinase precursor - Theileria
parva
Length = 440
Score = 42.7 bits (96), Expect = 0.009
Identities = 27/101 (26%), Positives = 45/101 (44%), Gaps = 13/101 (12%)
Frame = +2
Query: 290 KYEMGLVSYKLGMNKYGDMLHHEFVKTM-----------NGFNKTAKHNKNLYMKG--GS 430
K + G Y G+N++ D+ EF K NG+ + Y+K +
Sbjct: 157 KEQKGDEPYVKGINRFSDLTEREFYKLFPVMKPPKATYSNGYYLLSHMANKTYLKNLKKA 216
Query: 431 VRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ + + A + E +DWR+ +V + KDQ CG CW+F
Sbjct: 217 LNTDEDVDLAKLT-GENLDWRRSSSVTSVKDQSNCGGCWAF 256
>UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin O;
n=1; Danio rerio|Rep: PREDICTED: similar to cathepsin O
- Danio rerio
Length = 327
Score = 42.3 bits (95), Expect = 0.011
Identities = 21/58 (36%), Positives = 29/58 (50%), Gaps = 5/58 (8%)
Frame = +2
Query: 395 KHNKNLYMKGGSVRGAKFI-SPANVKL----PEQVDWRKHGAVPTFKDQGKCGSCWSF 553
K K Y+ + KF S + +K+ P + DWR HG V +QG CG CW+F
Sbjct: 90 KQFKEQYLTARAEAAPKFDQSKSEIKVKANNPPRFDWRDHGVVGPVHNQGSCGGCWAF 147
>UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2;
Arabidopsis thaliana|Rep: Putative cysteine proteinase -
Arabidopsis thaliana (Mouse-ear cress)
Length = 365
Score = 42.3 bits (95), Expect = 0.011
Identities = 33/107 (30%), Positives = 46/107 (42%), Gaps = 5/107 (4%)
Frame = +2
Query: 230 RQFPHEDIPEHKHIIAKHNQKY-----EMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTA 394
R + E E + + K N K+ MG SY LG+N++ D EF+ T G
Sbjct: 47 RVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSYTLGVNEFTDWKTEEFLATHTGLRVNV 106
Query: 395 KHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKC 535
L+ K R +S +++ E DWR GAV K QG C
Sbjct: 107 TSLSELFNKTKPSRNWN-MSDIDME-DESKDWRDEGAVTPVKYQGAC 151
>UniRef50_Q5ZC39 Cluster: CRK1 protein-like; n=2; Oryza sativa
(japonica cultivar-group)|Rep: CRK1 protein-like - Oryza
sativa subsp. japonica (Rice)
Length = 374
Score = 41.9 bits (94), Expect = 0.015
Identities = 24/65 (36%), Positives = 31/65 (47%)
Frame = +3
Query: 489 GGSTAPSRHSRTKGSVAHAGPFSTTGSFGRTALPSVRLPGVASSEQNLHRLLGSSYGEQR 668
GG+ APS S + G A P S+ G+ TA P G A E+ L +G GE+R
Sbjct: 287 GGTAAPSSSSSSAGQSRSAVPSSSAGAAPATAGPMPASAGAAKRERGLEPTMGEREGERR 346
Query: 669 AATGG 683
A G
Sbjct: 347 GAGDG 351
>UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-ear
cress). SAG12 protein; n=2; Dictyostelium
discoideum|Rep: Similar to Arabidopsis thaliana
(Mouse-ear cress). SAG12 protein - Dictyostelium
discoideum (Slime mold)
Length = 358
Score = 41.9 bits (94), Expect = 0.015
Identities = 17/41 (41%), Positives = 24/41 (58%)
Frame = +2
Query: 431 VRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+ G K + ++ +DWRK G V KDQG+CGSC+ F
Sbjct: 132 INGYKEMENGDLNELYSIDWRKKGLVTPVKDQGQCGSCYIF 172
>UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10;
Eukaryota|Rep: Extracellular cysteine protease 8 -
Tritrichomonas foetus (Trichomonas foetus)
Length = 315
Score = 41.9 bits (94), Expect = 0.015
Identities = 27/78 (34%), Positives = 38/78 (48%)
Frame = +2
Query: 314 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 493
+ G+NK+ M E+ K + GF K +V+ K A+V E +DWR
Sbjct: 62 FTTGLNKFAAMTPSEY-KALLGFRMDLAQRK-------AVKSTK---KASV---ESLDWR 107
Query: 494 KHGAVPTFKDQGKCGSCW 547
+ G V KDQ +CGSCW
Sbjct: 108 EKGVVNPIKDQAQCGSCW 125
>UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=17; Trichomonas vaginalis|Rep:
Clan CA, family C1, cathepsin L-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 318
Score = 41.9 bits (94), Expect = 0.015
Identities = 15/28 (53%), Positives = 19/28 (67%)
Frame = +2
Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
+P+ VDWR V KDQ +CGSCW+F
Sbjct: 100 VPDAVDWRNAKIVNPIKDQAQCGSCWAF 127
>UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n=1;
Toxocara canis|Rep: Cathepsin L-like cysteine proteinase
- Toxocara canis (Canine roundworm)
Length = 360
Score = 41.5 bits (93), Expect = 0.020
Identities = 14/29 (48%), Positives = 19/29 (65%)
Frame = +2
Query: 467 KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
++P+ DWR + V K Q KCGSCW+F
Sbjct: 144 EIPDHFDWRPYNVVTPVKSQFKCGSCWAF 172
>UniRef50_A0BNM1 Cluster: Chromosome undetermined scaffold_119,
whole genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_119,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 341
Score = 41.5 bits (93), Expect = 0.020
Identities = 25/99 (25%), Positives = 54/99 (54%), Gaps = 1/99 (1%)
Frame = +2
Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKT-MNGFNKTAKHNKNLYMKGGSV 433
++K +I +HN++ E ++ +G N++ + + EFV +N + ++ ++ ++ +
Sbjct: 54 QNKQMIEEHNKRSEF---TFLMGENQFMAITNEEFVSLYLNPISPEKQNEQDQIIRKTNP 110
Query: 434 RGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWS 550
+ + I N+K + VDWR + V K+ G CGS W+
Sbjct: 111 KSPEPIREYNLK--DDVDWRGYAPV---KNSGNCGSSWA 144
>UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin O
precursor; n=2; Apocrita|Rep: PREDICTED: similar to
Cathepsin O precursor - Apis mellifera
Length = 374
Score = 41.1 bits (92), Expect = 0.026
Identities = 12/31 (38%), Positives = 20/31 (64%)
Frame = +2
Query: 461 NVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
++ +P + DWR G + + QG CG+CW+F
Sbjct: 152 SISIPLRFDWRDKGVITPVRSQGSCGACWAF 182
>UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryza
sativa (japonica cultivar-group)|Rep: Putative cysteine
proteinase - Oryza sativa subsp. japonica (Rice)
Length = 357
Score = 41.1 bits (92), Expect = 0.026
Identities = 26/84 (30%), Positives = 39/84 (46%)
Frame = +2
Query: 302 GLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQ 481
G S +L NK+ D+ + EF + T + GGS G + + +P
Sbjct: 88 GKKSPRLTTNKFADLTNEEFAEYYGRPFSTP-------VIGGS--GFMYGNVRTSDVPAN 138
Query: 482 VDWRKHGAVPTFKDQGKCGSCWSF 553
++WR GAV K+Q C SCW+F
Sbjct: 139 INWRDRGAVTQVKNQKDCASCWAF 162
>UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia
medanensis|Rep: Sui m 1 allergen - Suidasia medanensis
Length = 336
Score = 41.1 bits (92), Expect = 0.026
Identities = 29/117 (24%), Positives = 54/117 (46%), Gaps = 2/117 (1%)
Frame = +2
Query: 209 QLRKRGRRQFPHEDIPEHKHIIAKHNQKYEMGLVSYKLGM--NKYGDMLHHEFVKTMNGF 382
Q ++ +Q+ E+ P+ + I ++ + + + G+ N++ D+ EF
Sbjct: 30 QFKELYGKQYTAEEEPQRRAIFEENLRWIQENHGKHGAGLEVNEHADLTAEEFSSMYATL 89
Query: 383 NKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
N+ A L+ + V S +V LP DWR+ ++QG+CGSCW+F
Sbjct: 90 NQEAFLKSPLHKEFVQVPE----SDISVALPAAFDWRQQWNTAV-RNQGQCGSCWAF 141
>UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3;
Theileria|Rep: Cysteine proteinase precursor - Theileria
annulata
Length = 441
Score = 41.1 bits (92), Expect = 0.026
Identities = 29/96 (30%), Positives = 44/96 (45%), Gaps = 16/96 (16%)
Frame = +2
Query: 314 YKLGMNKYGDMLHHEFV---------KTMNGFNKTAK-----HNKNLYM-KGGSVRGAKF 448
Y L +NK+ D+ EF KT +K + H +Y+ K +G +
Sbjct: 160 YSLDLNKFSDLSDEEFKALYPVITPPKTYTSLSKHLEFKKMSHKNPIYISKLKKAKGIEE 219
Query: 449 ISPANVKLPEQVDWRKHGAVPTFKDQG-KCGSCWSF 553
I ++ E ++W + AV KDQG CGSCW+F
Sbjct: 220 IKDLSLITGENLNWARTDAVSPIKDQGDHCGSCWAF 255
>UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16)
[Contains: Cathepsin H mini chain; Cathepsin H heavy
chain; Cathepsin H light chain]; n=37; Eukaryota|Rep:
Cathepsin H precursor (EC 3.4.22.16) [Contains:
Cathepsin H mini chain; Cathepsin H heavy chain;
Cathepsin H light chain] - Homo sapiens (Human)
Length = 335
Score = 41.1 bits (92), Expect = 0.026
Identities = 17/28 (60%), Positives = 19/28 (67%), Gaps = 1/28 (3%)
Frame = +2
Query: 473 PEQVDWRKHGA-VPTFKDQGKCGSCWSF 553
P VDWRK G V K+QG CGSCW+F
Sbjct: 117 PPSVDWRKKGNFVSPVKNQGACGSCWTF 144
>UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryza
sativa (japonica cultivar-group)|Rep: Putative cysteine
proteinase - Oryza sativa subsp. japonica (Rice)
Length = 385
Score = 40.7 bits (91), Expect = 0.034
Identities = 29/92 (31%), Positives = 44/92 (47%), Gaps = 10/92 (10%)
Frame = +2
Query: 308 VSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVD 487
++Y+LG+N++ DM EF G +T +L + G+V K PA +P +
Sbjct: 87 MTYRLGLNQFSDMTFEEFAGKFTG-GRTGSIAGDL--RDGAVTYCK--PPAVGYVPPSWN 141
Query: 488 WRKHGAVPTFKDQGKC----------GSCWSF 553
W K+G V K+Q C GSCW+F
Sbjct: 142 WTKYGVVTPVKNQLTCVNTIKMSMYEGSCWAF 173
>UniRef50_Q248G1 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 334
Score = 40.7 bits (91), Expect = 0.034
Identities = 16/30 (53%), Positives = 20/30 (66%), Gaps = 1/30 (3%)
Frame = +2
Query: 467 KLPEQVDWRK-HGAVPTFKDQGKCGSCWSF 553
++PE VDWR V K+QG CGSCW+F
Sbjct: 120 QIPESVDWRNVTNVVGPIKNQGHCGSCWTF 149
>UniRef50_Q22NW9 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 894
Score = 40.7 bits (91), Expect = 0.034
Identities = 23/72 (31%), Positives = 36/72 (50%)
Frame = +2
Query: 467 KLPEQVDWRKHGAVPTFKDQGKCGSCWSFQHDWELWKDSTSVSPATWCRFFGAKPSSTAR 646
++P +DWR AV K+QG CGS ++F L + +S W F + +R
Sbjct: 682 EVPSSIDWRDLNAVTPVKNQGSCGSGYAFSTTGAL-EGIHKISGKDWKGFSEQQIIDCSR 740
Query: 647 EQLRGTTGCNRG 682
+Q G +GC+ G
Sbjct: 741 KQ--GNSGCHGG 750
>UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathepsin
o - Aedes aegypti (Yellowfever mosquito)
Length = 375
Score = 40.7 bits (91), Expect = 0.034
Identities = 14/27 (51%), Positives = 18/27 (66%)
Frame = +2
Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWS 550
LP+ VDWR G V + QG CG+CW+
Sbjct: 153 LPKVVDWRDKGVVAPVRSQGSCGACWA 179
>UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1;
Methanospirillum hungatei JF-1|Rep: Peptidase C1A,
papain precursor - Methanospirillum hungatei (strain
JF-1 / DSM 864)
Length = 1096
Score = 40.7 bits (91), Expect = 0.034
Identities = 23/65 (35%), Positives = 34/65 (52%), Gaps = 4/65 (6%)
Frame = +2
Query: 416 MKGGSVRGAKFISPANVKLPEQVDWRKHGAVPT--FKDQGKCGSCWSFQHD--WELWKDS 583
+K ++ I+P LP DWR +G T K+QG CGSCW+F +E +K+
Sbjct: 304 LKSSTIVSGAGITPME-GLPTSFDWRNNGGDYTTPIKNQGSCGSCWAFATTGAFESYKEI 362
Query: 584 TSVSP 598
S +P
Sbjct: 363 KSGNP 367
>UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:
Viral cathepsin - Xestia c-nigrum granulosis virus
(XnGV) (Xestia c-nigrumgranulovirus)
Length = 346
Score = 40.7 bits (91), Expect = 0.034
Identities = 14/29 (48%), Positives = 20/29 (68%)
Frame = +2
Query: 467 KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
K+P+ DWR +V + K Q +CGSCW+F
Sbjct: 132 KVPDSFDWRDRNSVTSVKMQKECGSCWAF 160
>UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4;
Paramecium tetraurelia|Rep: Putative cathepsin L2
precursor - Paramecium tetraurelia
Length = 294
Score = 40.7 bits (91), Expect = 0.034
Identities = 30/107 (28%), Positives = 53/107 (49%), Gaps = 2/107 (1%)
Frame = +2
Query: 260 HKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRG 439
+K +I +HNQ+ + V+Y++G N++ + H EFV K + ++ + G S
Sbjct: 41 NKRMIEEHNQRED---VTYQMGENQFMTLSHEEFVDLY-----LQKSDSSVNIMGAS--- 89
Query: 440 AKFISPANVKLPEQVDWRKHGAVPTFKDQGKCGSCWSF--QHDWELW 574
+ ++ VDWR + T K+QG+C S W+F + E W
Sbjct: 90 ---LPEVQLEGLGAVDWRNY---TTVKEQGQCASGWAFSVSNSLEAW 130
>UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5;
Piroplasmida|Rep: Cysteine proteinase, putative -
Theileria parva
Length = 460
Score = 40.3 bits (90), Expect = 0.045
Identities = 32/105 (30%), Positives = 44/105 (41%), Gaps = 17/105 (16%)
Frame = +2
Query: 290 KYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFN-----KTAKHNKNL---------YMKG- 424
K G +Y +N + DM EF K T H L Y+K
Sbjct: 174 KIHQGHETYSREINSFADMTEEEFNKLFPPIKVPESKSTTSHVDRLMARMVSDETYLKNL 233
Query: 425 -GSVRGAKFISPANVKLPEQVDWRKHGAVPTFKDQG-KCGSCWSF 553
++ K + P N+ E +DWRK V K+QG +CGSCW+F
Sbjct: 234 KKALNTDKDVDPKNIT-GEGLDWRKADGVSKIKNQGLECGSCWAF 277
>UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2;
cellular organisms|Rep: Cysteine proteinase, putative -
Archaeoglobus fulgidus
Length = 1088
Score = 40.3 bits (90), Expect = 0.045
Identities = 13/27 (48%), Positives = 18/27 (66%)
Frame = +2
Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWS 550
LP + DWR + + +DQG CGSCW+
Sbjct: 594 LPSRFDWRDYTGLSAVRDQGSCGSCWA 620
>UniRef50_Q9SIE8 Cluster: Putative cysteine proteinase; n=1;
Arabidopsis thaliana|Rep: Putative cysteine proteinase -
Arabidopsis thaliana (Mouse-ear cress)
Length = 105
Score = 39.9 bits (89), Expect = 0.060
Identities = 25/72 (34%), Positives = 35/72 (48%)
Frame = +2
Query: 314 YKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWR 493
YKL +NK+ ++ EFV F+ + H K L K F + P+ +DWR
Sbjct: 35 YKLKLNKFANLTDVEFVNAHTCFDMS-DHKKILDSK-------PFFYENMTQAPDSLDWR 86
Query: 494 KHGAVPTFKDQG 529
+ GAV KDQG
Sbjct: 87 EKGAVTNVKDQG 98
>UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4;
Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena
thermophila
Length = 320
Score = 39.9 bits (89), Expect = 0.060
Identities = 14/25 (56%), Positives = 17/25 (68%)
Frame = +2
Query: 479 QVDWRKHGAVPTFKDQGKCGSCWSF 553
+VDW G V K+QG CGSCW+F
Sbjct: 115 EVDWTAKGKVTPVKNQGSCGSCWAF 139
>UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC02853 protein - Schistosoma
japonicum (Blood fluke)
Length = 181
Score = 39.9 bits (89), Expect = 0.060
Identities = 15/35 (42%), Positives = 22/35 (62%), Gaps = 4/35 (11%)
Frame = +2
Query: 461 NVKLPEQVD----WRKHGAVPTFKDQGKCGSCWSF 553
N+KLP+ D W+ ++ T +DQ CGSCW+F
Sbjct: 79 NIKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAF 113
>UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, whole
genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_79,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 324
Score = 39.9 bits (89), Expect = 0.060
Identities = 29/102 (28%), Positives = 42/102 (41%), Gaps = 3/102 (2%)
Frame = +2
Query: 257 EHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFN--KTAKHNKNLYMKGGS 430
++ +I HN + G +Y + N++ D+ EF + F T K Y+ G
Sbjct: 62 QNAQLIEAHNND-KSGKYTYTMETNQFADLTEQEFAQKYLTFRPKSTNKSKSTDYVPNGQ 120
Query: 431 VRGAKFISPANVKLPEQVDWRKHGAVPTFKDQG-KCGSCWSF 553
R DW + G VP KDQG CGS W+F
Sbjct: 121 AR----------------DWVEEGKVPPIKDQGSSCGSSWAF 146
>UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, whole
genome shotgun sequence; n=3; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_2,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 376
Score = 39.9 bits (89), Expect = 0.060
Identities = 23/74 (31%), Positives = 36/74 (48%)
Frame = +2
Query: 332 KYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVP 511
K + H+ F++ +KT K K + K + + +P N P +DW K V
Sbjct: 126 KSDSLSHNSFLQA----DKTVKVVKKVVKKASATTKTEKATPKN---PPSLDWLKQ--VT 176
Query: 512 TFKDQGKCGSCWSF 553
+ QG+CGSCW+F
Sbjct: 177 EVQQQGRCGSCWAF 190
>UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16;
Plasmodium (Vinckeia)|Rep: Cysteine proteinase precursor
- Plasmodium vinckei
Length = 506
Score = 39.9 bits (89), Expect = 0.060
Identities = 29/97 (29%), Positives = 41/97 (42%), Gaps = 5/97 (5%)
Frame = +2
Query: 278 KHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKG--GSVRGAKFI 451
KHN+ ++Y +N+Y D EF K+ Y+ + I
Sbjct: 195 KHNEMVGKNGLTYVQKVNQYSDFSKEEFDNYFKKLLSVPMDLKSKYIVPLKKHLANTNLI 254
Query: 452 SPANVK--LPEQVDWR-KHGAVPTFKDQGKCGSCWSF 553
S N P+ D+R K +P KDQG CGSCW+F
Sbjct: 255 SVDNKSKDFPDSRDYRSKFNFLPP-KDQGNCGSCWAF 290
>UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30;
Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria
dispar multicapsid nuclear polyhedrosis virus (LdMNPV)
Length = 356
Score = 39.9 bits (89), Expect = 0.060
Identities = 14/29 (48%), Positives = 19/29 (65%)
Frame = +2
Query: 467 KLPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
K P DWR+ V + K+QG CG+CW+F
Sbjct: 143 KGPLHFDWREQNKVTSIKNQGACGACWAF 171
>UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2;
Roseiflexus|Rep: Peptidase C1A, papain precursor -
Roseiflexus sp. RS-1
Length = 1202
Score = 39.5 bits (88), Expect = 0.079
Identities = 15/28 (53%), Positives = 17/28 (60%)
Frame = +2
Query: 470 LPEQVDWRKHGAVPTFKDQGKCGSCWSF 553
LP +W GA KDQG CGSCW+F
Sbjct: 169 LPAAFNWCDQGACTPVKDQGVCGSCWAF 196
>UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2;
Caenorhabditis|Rep: Putative uncharacterized protein -
Caenorhabditis elegans
Length = 343
Score = 39.5 bits (88), Expect = 0.079
Identities = 31/98 (31%), Positives = 48/98 (48%), Gaps = 3/98 (3%)
Frame = +2
Query: 269 IIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKF 448
++ ++N K + G V+Y+L N + D+ E+ K + H++ S++
Sbjct: 81 LVERYN-KEDAGKVTYEL--NDFSDLTEEEWKKYL--MTPKPDHSEK------SLKPKTL 129
Query: 449 ISPANVKLPEQVDWRK-HGA--VPTFKDQGKCGSCWSF 553
I N LP VDWR +G V K QG CGSCW+F
Sbjct: 130 IDKKN--LPNSVDWRNVNGTNHVTGIKYQGPCGSCWAF 165
>UniRef50_Q23EG5 Cluster: Papain family cysteine protease containing
protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 429
Score = 39.5 bits (88), Expect = 0.079
Identities = 16/33 (48%), Positives = 22/33 (66%), Gaps = 4/33 (12%)
Frame = +2
Query: 467 KLPEQVDWRKHGAVPTFKDQ----GKCGSCWSF 553
++P+ VDWR+ G V + KDQ CGSCW+F
Sbjct: 121 EIPDYVDWREKGIVSSVKDQDAVGDDCGSCWTF 153
Database: uniref50
Posted date: Oct 5, 2007 11:19 AM
Number of letters in database: 575,637,011
Number of sequences in database: 1,657,284
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 707,051,345
Number of Sequences: 1657284
Number of extensions: 15102863
Number of successful extensions: 55030
Number of sequences better than 10.0: 371
Number of HSP's better than 10.0 without gapping: 51322
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 54798
length of database: 575,637,011
effective HSP length: 98
effective length of database: 413,223,179
effective search space used: 56611575523
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)
- SilkBase 1999-2023 -