BLASTX 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= tesV0482.Seq
(797 letters)
Database: uniref50
1,657,284 sequences; 575,637,011 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C... 112 8e-24
UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ... 86 1e-15
UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n... 84 4e-15
UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j... 81 3e-14
UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M... 81 4e-14
UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep... 80 6e-14
UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;... 80 6e-14
UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823 p... 79 1e-13
UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi... 79 1e-13
UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs... 79 1e-13
UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n... 79 2e-13
UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca... 78 2e-13
UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar ... 77 5e-13
UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1; Rhip... 77 5e-13
UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata... 77 5e-13
UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|... 76 9e-13
UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ... 76 9e-13
UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae... 76 1e-12
UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy... 76 1e-12
UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3; D... 76 1e-12
UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei... 75 3e-12
UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ... 74 5e-12
UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Re... 74 5e-12
UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=... 74 5e-12
UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ... 73 6e-12
UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|R... 73 6e-12
UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35... 73 6e-12
UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean e... 73 6e-12
UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n... 73 8e-12
UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep: ... 73 8e-12
UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep: C... 73 8e-12
UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D... 73 8e-12
UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes vastus... 72 2e-11
UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr... 71 3e-11
UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10; ... 71 3e-11
UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat... 71 3e-11
UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep: Tox... 71 3e-11
UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine p... 71 4e-11
UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|R... 71 4e-11
UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16; Bromel... 71 4e-11
UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3; Magnolio... 70 6e-11
UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1; Naegl... 70 6e-11
UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ... 70 6e-11
UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza sativa... 70 8e-11
UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1; Rhip... 70 8e-11
UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326; ... 69 1e-10
UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis thal... 69 1e-10
UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24; Magnoliophyt... 69 1e-10
UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh ... 69 1e-10
UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain... 69 1e-10
UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat... 69 1e-10
UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61; ... 69 1e-10
UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19; Bilateria|... 69 1e-10
UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6; Magnoliophy... 69 2e-10
UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gamb... 69 2e-10
UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h... 68 2e-10
UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome s... 68 2e-10
UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza sa... 68 2e-10
UniRef50_Q23H15 Cluster: Papain family cysteine protease contain... 68 2e-10
UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:... 68 3e-10
UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3; Magnolioph... 68 3e-10
UniRef50_Q22W19 Cluster: Papain family cysteine protease contain... 68 3e-10
UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, wh... 68 3e-10
UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor; ... 68 3e-10
UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease; ... 67 4e-10
UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10; Lil... 67 4e-10
UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor; n=... 67 4e-10
UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin ... 67 6e-10
UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ... 67 6e-10
UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n... 67 6e-10
UniRef50_Q239L8 Cluster: Papain family cysteine protease contain... 67 6e-10
UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2; ... 67 6e-10
UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core eudic... 67 6e-10
UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease ... 66 7e-10
UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryz... 66 7e-10
UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep... 66 7e-10
UniRef50_Q23H10 Cluster: Papain family cysteine protease contain... 66 7e-10
UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto... 66 7e-10
UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome s... 66 1e-09
UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr... 66 1e-09
UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v... 66 1e-09
UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca... 66 1e-09
UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza sa... 66 1e-09
UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core eudicoty... 66 1e-09
UniRef50_Q24E33 Cluster: Papain family cysteine protease contain... 66 1e-09
UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46; ... 66 1e-09
UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza sativ... 65 2e-09
UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1... 65 2e-09
UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18; Schi... 65 2e-09
UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheir... 65 2e-09
UniRef50_Q22RR8 Cluster: Papain family cysteine protease contain... 65 2e-09
UniRef50_O16454 Cluster: Temporarily assigned gene name protein ... 64 3e-09
UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue ... 64 4e-09
UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-... 64 4e-09
UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ... 64 4e-09
UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er... 64 4e-09
UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella ve... 64 4e-09
UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27; Eukaryo... 64 4e-09
UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11; Trichomonadida... 64 5e-09
UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Re... 64 5e-09
UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|... 64 5e-09
UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ... 63 7e-09
UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium falciparum... 63 7e-09
UniRef50_Q22RC9 Cluster: Papain family cysteine protease contain... 63 7e-09
UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]... 63 7e-09
UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc... 63 7e-09
UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin ... 63 9e-09
UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ... 63 9e-09
UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy... 63 9e-09
UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis t... 62 1e-08
UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium (Vinckei... 62 1e-08
UniRef50_Q23H06 Cluster: Papain family cysteine protease contain... 62 1e-08
UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteina... 62 1e-08
UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like cy... 62 1e-08
UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14; ... 62 2e-08
UniRef50_UPI000155637A Cluster: PREDICTED: similar to ENSANGP000... 62 2e-08
UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ... 62 2e-08
UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza ... 62 2e-08
UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa... 62 2e-08
UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep: Ac... 62 2e-08
UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella ve... 62 2e-08
UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ... 62 2e-08
UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep: ... 61 3e-08
UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1; ... 61 3e-08
UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain... 61 3e-08
UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3; ... 61 3e-08
UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No... 61 4e-08
UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Re... 61 4e-08
UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2; ... 61 4e-08
UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila melanogaster|... 61 4e-08
UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n... 61 4e-08
UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960 ... 61 4e-08
UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4; Nucleopolyhedrovi... 61 4e-08
UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcopt... 60 5e-08
UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep: Dv... 60 5e-08
UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2; Nucleopolyhedrovi... 60 5e-08
UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re... 60 5e-08
UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin ... 60 6e-08
UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13; Pl... 60 6e-08
UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia... 60 6e-08
UniRef50_Q24FN2 Cluster: Papain family cysteine protease contain... 60 6e-08
UniRef50_Q22A69 Cluster: Papain family cysteine protease contain... 60 6e-08
UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30; Nucleopolyhedrov... 60 6e-08
UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease ... 60 8e-08
UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep: MG... 60 8e-08
UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus ... 60 8e-08
UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas foet... 60 8e-08
UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3... 60 8e-08
UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Ory... 60 8e-08
UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ... 60 8e-08
UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep: ... 59 1e-07
UniRef50_Q235G6 Cluster: Papain family cysteine protease contain... 59 1e-07
UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18... 59 1e-07
UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ... 59 1e-07
UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2; R... 59 1e-07
UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum aestiv... 59 1e-07
UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n... 59 1e-07
UniRef50_Q2QS15 Cluster: Papain family cysteine protease contain... 58 2e-07
UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ... 58 2e-07
UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella histriomus... 58 2e-07
UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy... 58 2e-07
UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;... 58 3e-07
UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin ... 58 3e-07
UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryz... 58 3e-07
UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza sativa... 58 3e-07
UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain... 58 3e-07
UniRef50_Q23TW3 Cluster: Papain family cysteine protease contain... 58 3e-07
UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;... 58 3e-07
UniRef50_Q23RT7 Cluster: Papain family cysteine protease contain... 58 3e-07
UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,... 57 4e-07
UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease ... 57 4e-07
UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein OJ1280... 57 4e-07
UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease G... 57 4e-07
UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:... 57 4e-07
UniRef50_Q23H32 Cluster: Papain family cysteine protease contain... 57 4e-07
UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate... 57 4e-07
UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine prot... 57 4e-07
UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11; ... 57 4e-07
UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;... 56 8e-07
UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163 pr... 56 8e-07
UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica... 56 1e-06
UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-... 56 1e-06
UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep: L... 56 1e-06
UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina... 56 1e-06
UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila melanogaster... 55 2e-06
UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1; ... 55 2e-06
UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy... 55 2e-06
UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16) [C... 55 2e-06
UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2... 55 2e-06
UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, wh... 54 3e-06
UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease ... 54 4e-06
UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain... 54 4e-06
UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyosteli... 54 4e-06
UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole... 54 5e-06
UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia bovis... 54 5e-06
UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, wh... 54 5e-06
UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1; Big... 53 7e-06
UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1; ... 53 7e-06
UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia tropicali... 53 7e-06
UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome s... 53 1e-05
UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza sativa... 53 1e-05
UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio... 53 1e-05
UniRef50_Q23FQ5 Cluster: Papain family cysteine protease contain... 53 1e-05
UniRef50_Q22AB1 Cluster: Papain family cysteine protease contain... 53 1e-05
UniRef50_UPI0001509FF9 Cluster: Papain family cysteine protease ... 52 2e-05
UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain... 52 2e-05
UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precurs... 52 2e-05
UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1... 52 2e-05
UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1; Trypa... 52 2e-05
UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin ... 51 3e-05
UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin ... 51 3e-05
UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba hi... 51 3e-05
UniRef50_A7APS9 Cluster: Papain family cysteine protease contain... 51 3e-05
UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma j... 51 4e-05
UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3; Theil... 51 4e-05
UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia medanen... 51 4e-05
UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5; The... 51 4e-05
UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia ... 50 5e-05
UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1; ... 50 5e-05
UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, wh... 50 5e-05
UniRef50_Q9BL26 Cluster: Putative uncharacterized protein; n=4; ... 50 7e-05
UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5; Pir... 50 9e-05
UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-li... 50 9e-05
UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease ... 49 1e-04
UniRef50_Q22NW9 Cluster: Papain family cysteine protease contain... 49 1e-04
UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18; Pl... 49 1e-04
UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:... 49 1e-04
UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n... 49 2e-04
UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Sl... 49 2e-04
UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Sl... 49 2e-04
UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted... 49 2e-04
UniRef50_Q248G1 Cluster: Papain family cysteine protease contain... 49 2e-04
UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 49 2e-04
UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2; cel... 49 2e-04
UniRef50_UPI0000D566ED Cluster: PREDICTED: similar to CTLA-2-alp... 48 2e-04
UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2; ... 48 2e-04
UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste... 48 2e-04
UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcop... 48 2e-04
UniRef50_A1ZBK7 Cluster: CG10460-PA; n=1; Drosophila melanogaste... 48 3e-04
UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n... 48 4e-04
UniRef50_Q23FS6 Cluster: Papain family cysteine protease contain... 48 4e-04
UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4; Hymenostomatida|... 47 6e-04
UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lambli... 47 6e-04
UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184, w... 47 6e-04
UniRef50_Q5YER1 Cluster: Cysteine proteinase; n=1; Bigelowiella ... 46 8e-04
UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2; Culicid... 46 8e-04
UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, wh... 46 8e-04
UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:... 46 8e-04
UniRef50_UPI0000D566EC Cluster: PREDICTED: similar to CG10460-PA... 46 0.001
UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome s... 46 0.001
UniRef50_Q0INY9 Cluster: Os12g0273800 protein; n=8; Magnoliophyt... 46 0.001
UniRef50_Q3L7L7 Cluster: Sar s 1 allergen Yv5020C01; n=1; Sarcop... 46 0.001
UniRef50_Q231X3 Cluster: Papain family cysteine protease contain... 46 0.001
UniRef50_Q01E11 Cluster: Cysteine proteinase Cathepsin F; n=3; O... 46 0.001
UniRef50_A0E5A3 Cluster: Chromosome undetermined scaffold_79, wh... 46 0.001
UniRef50_Q28F45 Cluster: Cathepsin W; n=2; Xenopus tropicalis|Re... 45 0.002
UniRef50_P16311 Cluster: Major mite fecal allergen Der f 1 precu... 45 0.002
UniRef50_Q4N640 Cluster: Cysteine protease, putative; n=2; Theil... 45 0.003
UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep... 45 0.003
UniRef50_P25781 Cluster: Cysteine proteinase precursor; n=3; The... 45 0.003
UniRef50_Q2XWW8 Cluster: Cysteine protease Mir1; n=1; Zea diplop... 44 0.003
UniRef50_Q7QVF5 Cluster: GLP_90_15278_13989; n=3; Giardia intest... 44 0.003
UniRef50_Q4UCF4 Cluster: Cysteine protease, tacP, putative; n=2;... 44 0.003
UniRef50_Q4DC63 Cluster: Cysteine proteinase, putative; n=2; Try... 44 0.003
UniRef50_Q3L7L5 Cluster: Sar s 1 allergen Yv6030H07; n=1; Sarcop... 44 0.003
UniRef50_A7LFV2 Cluster: Cathepsin L protease inhibitor 1; n=1; ... 44 0.003
UniRef50_A0BNM1 Cluster: Chromosome undetermined scaffold_119, w... 44 0.003
UniRef50_O76852 Cluster: Tetrain; n=2; Tetrahymena|Rep: Tetrain ... 44 0.004
UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-li... 44 0.004
UniRef50_Q24HI2 Cluster: Papain family cysteine protease contain... 44 0.006
UniRef50_Q5NE16 Cluster: Putative cathepsin L-like protein 3; n=... 44 0.006
UniRef50_UPI00015B5D85 Cluster: PREDICTED: similar to cathepsin ... 43 0.008
UniRef50_Q41522 Cluster: Thiol protease; n=2; Triticum aestivum|... 43 0.008
UniRef50_Q23EG5 Cluster: Papain family cysteine protease contain... 43 0.008
UniRef50_A7UNU3 Cluster: Aca s 1 allergen; n=1; Acarus siro|Rep:... 43 0.010
UniRef50_A0DKP6 Cluster: Chromosome undetermined scaffold_54, wh... 43 0.010
UniRef50_P43234 Cluster: Cathepsin O precursor; n=22; Euteleosto... 43 0.010
UniRef50_UPI0000F2D685 Cluster: PREDICTED: similar to cathepsin ... 42 0.014
UniRef50_UPI0000588EBB Cluster: PREDICTED: hypothetical protein;... 42 0.014
UniRef50_Q650Y1 Cluster: Putative cysteine proteinase; n=1; Oryz... 42 0.014
UniRef50_Q8I8D1 Cluster: Cysteine protease 17; n=2; Entamoeba hi... 42 0.014
UniRef50_Q9XXQ7 Cluster: Putative uncharacterized protein; n=2; ... 42 0.018
UniRef50_Q237A1 Cluster: Papain family cysteine protease contain... 42 0.018
UniRef50_A6N590 Cluster: Extracellular cysteine protease 8; n=10... 42 0.018
UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endop... 42 0.018
UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; P... 42 0.018
UniRef50_A6GBK8 Cluster: Peptidase C1A, papain; n=1; Plesiocysti... 42 0.024
UniRef50_Q3L7L6 Cluster: Sar s 1 allergen Yv5032C08; n=1; Sarcop... 42 0.024
UniRef50_P46102 Cluster: Cysteine proteinase precursor; n=16; Pl... 42 0.024
UniRef50_Q9U2X1 Cluster: Putative uncharacterized protein; n=2; ... 41 0.031
UniRef50_Q8MXZ4 Cluster: Gamete and mating-type specific protein... 41 0.031
UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis... 41 0.031
UniRef50_Q3L7L4 Cluster: Sar s 1 allergen Yv9053H09; n=1; Sarcop... 41 0.031
UniRef50_A0DCA5 Cluster: Chromosome undetermined scaffold_45, wh... 41 0.031
UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who... 41 0.031
UniRef50_Q2FUI8 Cluster: Peptidase C1A, papain precursor; n=1; M... 41 0.031
UniRef50_Q94714 Cluster: Cathepsin L1 precursor; n=3; Paramecium... 41 0.031
UniRef50_A2Q4E7 Cluster: Peptidase C1A, papain; n=1; Medicago tr... 41 0.041
UniRef50_Q54MB6 Cluster: Putative uncharacterized protein; n=1; ... 41 0.041
UniRef50_Q22DX2 Cluster: Papain family cysteine protease contain... 41 0.041
UniRef50_A0BEB2 Cluster: Chromosome undetermined scaffold_101, w... 41 0.041
UniRef50_UPI00006CA6D4 Cluster: Papain family cysteine protease ... 40 0.055
UniRef50_Q54J84 Cluster: Putative uncharacterized protein; n=1; ... 40 0.055
UniRef50_Q292E5 Cluster: GA10327-PA; n=1; Drosophila pseudoobscu... 40 0.055
UniRef50_A2DJ33 Cluster: Clan CA, family C1, cathepsin L-like cy... 40 0.055
UniRef50_Q03107 Cluster: Cathepsin B; n=20; Magnoliophyta|Rep: C... 40 0.072
UniRef50_UPI00001CC928 Cluster: PREDICTED: similar to CTLA-2-bet... 40 0.096
UniRef50_Q650X9 Cluster: Putative cysteine proteinase; n=1; Oryz... 40 0.096
UniRef50_A1Z9I0 Cluster: CG6357-PA; n=3; Drosophila melanogaster... 40 0.096
UniRef50_P25792 Cluster: Cathepsin B-like cysteine proteinase pr... 40 0.096
UniRef50_P12400 Cluster: Protein CTLA-2-beta; n=6; Mus musculus|... 39 0.13
UniRef50_UPI0000DA404B Cluster: PREDICTED: similar to cathepsin ... 39 0.17
UniRef50_Q962W3 Cluster: Cysteine protease; n=3; Giardia intesti... 39 0.17
UniRef50_Q7QWP3 Cluster: GLP_26_47548_45815; n=1; Giardia lambli... 39 0.17
UniRef50_Q17P70 Cluster: Cathepsin o; n=2; Culicidae|Rep: Cathep... 39 0.17
UniRef50_A0EAG2 Cluster: Chromosome undetermined scaffold_86, wh... 39 0.17
UniRef50_A0C8B1 Cluster: Chromosome undetermined scaffold_158, w... 39 0.17
UniRef50_Q93512 Cluster: Putative uncharacterized protein; n=2; ... 38 0.22
UniRef50_A7SDR5 Cluster: Predicted protein; n=1; Nematostella ve... 38 0.22
UniRef50_A2EWP9 Cluster: Clan CA, family C1, cathepsin L-like cy... 38 0.22
UniRef50_A0DM19 Cluster: Chromosome undetermined scaffold_56, wh... 38 0.22
UniRef50_Q9GU75 Cluster: Thiolproteinase; n=2; Babesia|Rep: Thio... 38 0.29
UniRef50_A2EBQ0 Cluster: Clan CA, family C1, cathepsin B-like cy... 38 0.29
UniRef50_UPI0000E490F4 Cluster: PREDICTED: similar to cathepsin ... 37 0.51
UniRef50_A2F844 Cluster: Clan CA, family C1, cathepsin L, S or H... 37 0.51
UniRef50_Q8TKH5 Cluster: Cell surface protein; n=3; Methanosarci... 37 0.51
UniRef50_UPI0000ECBFDF Cluster: UPI0000ECBFDF related cluster; n... 37 0.67
UniRef50_Q4S572 Cluster: Tyrosine-protein kinase receptor; n=2; ... 37 0.67
UniRef50_Q4RKR3 Cluster: Chromosome 5 SCAF15026, whole genome sh... 37 0.67
UniRef50_Q7QPZ7 Cluster: GLP_113_4299_5381; n=1; Giardia lamblia... 37 0.67
UniRef50_A2ECZ7 Cluster: Clan CA, family C1, cathepsin L or H-li... 37 0.67
UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh... 37 0.67
UniRef50_Q8TQ91 Cluster: Putative uncharacterized protein; n=1; ... 37 0.67
UniRef50_P43157 Cluster: Cathepsin B-like cysteine proteinase pr... 37 0.67
UniRef50_P56202 Cluster: Cathepsin W precursor; n=15; Eutheria|R... 37 0.67
UniRef50_A6EGZ3 Cluster: Aminopeptidase C; n=1; Pedobacter sp. B... 36 0.89
UniRef50_Q93VC9 Cluster: At1g02300/T6A9_10; n=11; core eudicotyl... 36 0.89
UniRef50_Q01FL5 Cluster: Cysteine protease-5; n=3; Ostreococcus|... 36 0.89
UniRef50_Q017I3 Cluster: Cysteine proteinase Cathepsin F; n=1; O... 36 0.89
UniRef50_A7LFV3 Cluster: Cathepsin L protease inhibitor 2; n=1; ... 36 0.89
UniRef50_A3FQ13 Cluster: Cathepsin like thiol protease possibly ... 36 0.89
UniRef50_Q2NG83 Cluster: Member of asn/thr-rich large protein fa... 36 0.89
UniRef50_Q0AY53 Cluster: Putative uncharacterized protein; n=1; ... 36 1.2
UniRef50_Q8MNY7 Cluster: Cathepsin B-like protease precursor; n=... 36 1.2
UniRef50_A0LES1 Cluster: Peptidase C1A, papain precursor; n=2; S... 36 1.6
UniRef50_Q7R1W9 Cluster: GLP_163_69918_68548; n=1; Giardia lambl... 36 1.6
UniRef50_Q4N5Z7 Cluster: Cysteine proteinase, putative; n=2; The... 36 1.6
UniRef50_Q25026 Cluster: Cysteine proteinase; n=3; Haemonchus co... 36 1.6
UniRef50_P43510 Cluster: Cathepsin B-like cysteine proteinase 6 ... 36 1.6
UniRef50_P53634 Cluster: Dipeptidyl-peptidase 1 precursor (EC 3.... 36 1.6
UniRef50_Q7X6B4 Cluster: OSJNBa0079F16.1 protein; n=41; Euphyllo... 35 2.1
UniRef50_Q8I8C9 Cluster: Cysteine protease 19; n=1; Entamoeba hi... 35 2.1
UniRef50_Q86GS9 Cluster: Cathepsin B; n=1; Sterkiella histriomus... 35 2.1
UniRef50_Q5VJM8 Cluster: Cathepsin B-like cysteine protease; n=2... 35 2.1
UniRef50_Q55FL7 Cluster: Putative uncharacterized protein; n=1; ... 35 2.1
UniRef50_P90627 Cluster: Cathepsin B-like protease; n=8; Leishma... 35 2.1
UniRef50_UPI0000DA2FCA Cluster: PREDICTED: similar to alpha 3 ty... 35 2.7
UniRef50_Q945E4 Cluster: Cysteine proteinase; n=1; Vasconcellea ... 35 2.7
UniRef50_Q650W8 Cluster: Putative cysteine proteinase; n=2; Oryz... 35 2.7
UniRef50_Q8IU31 Cluster: Cathepsin L-like cysteine proteinase; n... 35 2.7
UniRef50_Q86FI9 Cluster: Clone ZZD209 mRNA sequence; n=3; Schist... 35 2.7
UniRef50_Q7RPJ9 Cluster: Mature parasite-infected erythrocyte su... 35 2.7
UniRef50_A4GVW7 Cluster: Cathepsin B5; n=5; Clonorchis sinensis|... 35 2.7
UniRef50_A2EGT2 Cluster: Clan CA, family C1, cathepsin L-like cy... 35 2.7
UniRef50_A1XG93 Cluster: Putative cathepsin B-like proteinase; n... 35 2.7
UniRef50_P84789 Cluster: Philibertain g 1; n=5; core eudicotyled... 35 2.7
UniRef50_UPI0000E4A619 Cluster: PREDICTED: similar to cathepsin ... 34 3.6
UniRef50_Q207N1 Cluster: Cathepsin S; n=2; Clupeocephala|Rep: Ca... 34 3.6
UniRef50_A0LQG9 Cluster: Peptidase C1A, papain; n=1; Syntrophoba... 34 3.6
UniRef50_Q8MTU2 Cluster: Cysteine proteinase; n=2; Eukaryota|Rep... 34 3.6
UniRef50_Q7QWN9 Cluster: GLP_26_50243_51811; n=2; Giardia lambli... 34 3.6
UniRef50_Q23FP9 Cluster: Papain family cysteine protease contain... 34 3.6
UniRef50_A5K8Y0 Cluster: Preprocathepsin c, putative; n=1; Plasm... 34 3.6
UniRef50_A0BV23 Cluster: Chromosome undetermined scaffold_13, wh... 34 3.6
UniRef50_A6H8W3 Cluster: GPR124 protein; n=4; Euteleostomi|Rep: ... 34 3.6
UniRef50_A4YDW2 Cluster: Major facilitator superfamily MFS_1 pre... 34 3.6
UniRef50_P21381 Cluster: Thaumatopain; n=10; Eukaryota|Rep: Thau... 34 3.6
UniRef50_Q96PE1 Cluster: Probable G-protein coupled receptor 124... 34 3.6
UniRef50_P07858 Cluster: Cathepsin B precursor (EC 3.4.22.1) (Ca... 34 3.6
UniRef50_UPI0000E45E63 Cluster: PREDICTED: hypothetical protein;... 34 4.8
UniRef50_UPI00006CCC39 Cluster: hypothetical protein TTHERM_0033... 34 4.8
UniRef50_Q4AI35 Cluster: Cysteine peptidase, putative precursor;... 34 4.8
UniRef50_A1ZZ62 Cluster: Aminopeptidase C; n=1; Microscilla mari... 34 4.8
UniRef50_Q6L8N8 Cluster: Cathepsin B-S precursor; n=15; Aphidoid... 34 4.8
UniRef50_Q4Y2Z9 Cluster: Putative uncharacterized protein; n=3; ... 34 4.8
UniRef50_Q1KYN8 Cluster: Cathepsin B; n=1; Streblomastix strix|R... 34 4.8
UniRef50_Q2H7E7 Cluster: Putative uncharacterized protein; n=1; ... 34 4.8
UniRef50_A3LQQ7 Cluster: Putative uncharacterized protein ALS4; ... 34 4.8
UniRef50_Q649T1 Cluster: Cathepsin C; n=1; uncultured archaeon G... 34 4.8
UniRef50_P92131 Cluster: Cathepsin B-like CP1 precursor; n=3; Gi... 34 4.8
UniRef50_Q70SU8 Cluster: Cystein proteinase inhibitor protein pr... 33 6.3
UniRef50_Q9SIE8 Cluster: Putative cysteine proteinase; n=1; Arab... 33 6.3
UniRef50_Q9LP42 Cluster: Putative cysteine proteinase; n=2; Arab... 33 6.3
UniRef50_Q86MW7 Cluster: Cathepsin B; n=9; Fasciola|Rep: Catheps... 33 6.3
UniRef50_Q5MBV5 Cluster: Parcxpwnx02; n=3; Neoptera|Rep: Parcxpw... 33 6.3
UniRef50_Q4YWX6 Cluster: Putative uncharacterized protein; n=1; ... 33 6.3
UniRef50_A2F4T7 Cluster: Clan CA, family C1, cathepsin L-like cy... 33 6.3
UniRef50_A4RJ84 Cluster: Putative uncharacterized protein; n=2; ... 33 6.3
UniRef50_Q26563 Cluster: Cathepsin C precursor; n=6; Schistosoma... 33 6.3
UniRef50_Q7MTY9 Cluster: Cysteine peptidase, putative; n=8; Bact... 33 8.3
UniRef50_Q5BZ34 Cluster: SJCHGC02853 protein; n=1; Schistosoma j... 33 8.3
UniRef50_Q4Q6W9 Cluster: Putative uncharacterized protein; n=3; ... 33 8.3
UniRef50_Q22ST4 Cluster: Von Willebrand factor type A domain con... 33 8.3
UniRef50_A1YUM5 Cluster: Cysteine proteinase 3; n=7; Rhabditida|... 33 8.3
UniRef50_P43508 Cluster: Cathepsin B-like cysteine proteinase 4 ... 33 8.3
>UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15)
[Contains: Cathepsin L heavy chain; Cathepsin L light
chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC
3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin
L light chain] - Sarcophaga peregrina (Flesh fly)
(Boettcherisca peregrina)
Length = 339
Score = 112 bits (270), Expect = 8e-24
Identities = 70/182 (38%), Positives = 91/182 (50%), Gaps = 6/182 (3%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
E+ FRMKI+ E++H IAKHNQ + G VSYKLG+NKY DMLHHEF +TMNG+N T +
Sbjct: 44 EERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADMLHHEFKETMNGYNHTL---R 100
Query: 436 NLYMKGGSVRGAKFISPANVKLPE----RWTGGSTAPSPTSRTKG--SVAHAGLQHDWSF 597
L + + GA +I PA+V +P+ R G T + + G F
Sbjct: 101 QLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHF 160
Query: 598 GKDSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTEQTYLTRGV 777
K VS G F+YIKDNGGIDTE++Y G+
Sbjct: 161 RKAGVLVS-----LSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGI 215
Query: 778 DD 783
DD
Sbjct: 216 DD 217
Score = 90.2 bits (214), Expect = 5e-17
Identities = 37/54 (68%), Positives = 45/54 (83%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+VDWR+HGAVT +KDQG CGSCW+ + EGQHFR++G LVSLSEQNL+DCS
Sbjct: 125 SVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCS 178
Score = 40.3 bits (90), Expect = 0.055
Identities = 24/49 (48%), Positives = 27/49 (55%), Gaps = 3/49 (6%)
Frame = +2
Query: 578 FSTTGALGRTALPSVRLPGVALGAKPHRLLGA---YGNNGCNGGLMDNA 715
FS+TGAL R GV + L+ YGNNGCNGGLMDNA
Sbjct: 149 FSSTGALEGQHF---RKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNA 194
Score = 38.3 bits (85), Expect = 0.22
Identities = 14/23 (60%), Positives = 18/23 (78%)
Frame = +2
Query: 191 DLVKEEWSAFKLQHRLNYESEAK 259
DL+KEEW +KLQHR NY +E +
Sbjct: 22 DLIKEEWHTYKLQHRKNYANEVE 44
>UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor;
n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase
precursor - Diabrotica virgifera virgifera (western corn
rootworm)
Length = 326
Score = 85.8 bits (203), Expect = 1e-15
Identities = 42/105 (40%), Positives = 57/105 (54%)
Frame = +3
Query: 459 RPRG*VHIAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVS 638
RPR + ++ DWR+ GAVT++KDQG CGSCWS + EG +F ++G LVS
Sbjct: 97 RPRVIHSLTPVKDLPSKFDWREKGAVTEVKDQGSCGSCWSFSTTGTVEGAYFLKTGKLVS 156
Query: 639 LSEQNLIDCSEHXXXXXXXXXXXXXLQVHQGQRGDRHRADLPYEG 773
LSEQNL+DC++ L+ + G D PYEG
Sbjct: 157 LSEQNLVDCAKEDCYGCSGGYMDKALEYIETAGGIMSENDYPYEG 201
Score = 42.3 bits (95), Expect = 0.014
Identities = 20/60 (33%), Positives = 33/60 (55%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
E+ R I+ I HN KY+ GL ++KLG+ K+ D+ EF M G +++ K ++
Sbjct: 39 EEQKRFTIFQGSLRKIENHNDKYDHGLSTFKLGVTKFADLTEKEF-SDMLGISRSTKSSR 97
>UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3;
Bilateria|Rep: Cathepsin L-like cysteine proteinase -
Longidorus elongatus
Length = 358
Score = 83.8 bits (198), Expect = 4e-15
Identities = 40/90 (44%), Positives = 52/90 (57%), Gaps = 1/90 (1%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH-XXX 683
+VDWRK G VT +KDQG CGSCW+ + EGQH++Q+G LVSLSEQNL+DC +
Sbjct: 142 SVDWRKEGYVTKVKDQGSCGSCWAFSATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGDDE 201
Query: 684 XXXXXXXXXXLQVHQGQRGDRHRADLPYEG 773
Q + +G A PY+G
Sbjct: 202 GCNGGYMDGAFQYVETNKGIDTEASYPYKG 231
Score = 59.3 bits (137), Expect = 1e-07
Identities = 45/180 (25%), Positives = 73/180 (40%), Gaps = 1/180 (0%)
Frame = +1
Query: 244 RKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTA 423
+ + E+ R +++A + +I +HN +YE G S+ L +NK+ DM + EF + MNGF A
Sbjct: 55 KTKDEELLRFQVFASNHKVIEQHNIEYEAGQHSFALSLNKFADMTNAEFRQRMNGFKLPA 114
Query: 424 KHNKNLYMKGGSVRGAKFISPANVKLPERWTGGSTAPSPTSRTKGSVAHA-GLQHDWSFG 600
K K + G F P NV +P+ + +GS S
Sbjct: 115 K-RKLAKSQPLKEDGMIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAFSATGSLE 173
Query: 601 KDSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTEQTYLTRGVD 780
+ ++ G F+Y++ N GIDTE +Y +G D
Sbjct: 174 GQHYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQYVETNKGIDTEASYPYKGRD 233
>UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC06231 protein - Schistosoma
japonicum (Blood fluke)
Length = 372
Score = 81.0 bits (191), Expect = 3e-14
Identities = 35/72 (48%), Positives = 52/72 (72%), Gaps = 1/72 (1%)
Frame = +3
Query: 459 RPRG*VHIAGQR-EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLV 635
+P+G I+ + + VDWR++GAVT +K+QG+CGSCW+ + EGQH+R++ LV
Sbjct: 136 KPKGSTFISSEHAKLPDRVDWRRNGAVTPVKNQGQCGSCWAFSSTGAIEGQHYRKTNRLV 195
Query: 636 SLSEQNLIDCSE 671
+LSEQ LIDCS+
Sbjct: 196 NLSEQQLIDCSK 207
Score = 45.2 bits (102), Expect = 0.002
Identities = 41/175 (23%), Positives = 73/175 (41%), Gaps = 6/175 (3%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
E+ R I+ + + +HN+ Y+ G +YK+G+N + D +E ++ + G+ + K
Sbjct: 78 EETKRFLIFGTNFIKMMEHNRAYQEGKATYKMGVNNFTDKTEYE-LRKLRGYRSACRIAK 136
Query: 436 NLYMKGGSVRGAKFISPANVKLPER--W-TGGSTAPSPTSRTKGS---VAHAGLQHDWSF 597
+G+ FIS + KLP+R W G+ P GS + G +
Sbjct: 137 --------PKGSTFISSEHAKLPDRVDWRRNGAVTPVKNQGQCGSCWAFSSTGAIEGQHY 188
Query: 598 GKDSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTEQTY 762
K + V+ + G F+Y++DN GID+E +Y
Sbjct: 189 RKTNRLVN-----LSEQQLIDCSKSYGNNGCEGGLMDLAFQYVRDNKGIDSEISY 238
>UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15)
(Major excreted protein) (MEP) [Contains: Cathepsin L
heavy chain; Cathepsin L light chain]; n=19;
Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15)
(Major excreted protein) (MEP) [Contains: Cathepsin L
heavy chain; Cathepsin L light chain] - Homo sapiens
(Human)
Length = 333
Score = 80.6 bits (190), Expect = 4e-14
Identities = 34/58 (58%), Positives = 45/58 (77%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
EA +VDWR+ G VT +K+QG+CGSCW+ + EGQ FR++G L+SLSEQNL+DCS
Sbjct: 113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCS 170
Score = 53.6 bits (123), Expect = 5e-06
Identities = 43/182 (23%), Positives = 74/182 (40%), Gaps = 3/182 (1%)
Frame = +1
Query: 226 AAPSQLRKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN 405
A ++L E+ +R ++ ++ +I HNQ+Y G S+ + MN +GDM EF + MN
Sbjct: 34 AMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMN 93
Query: 406 GFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPERWTGGSTAPSPTSRTKGS---VAHAG 576
GF ++ + + +P +V E+ G P GS + G
Sbjct: 94 GFQNRKPRKGKVFQE-----PLFYEAPRSVDWREK---GYVTPVKNQGQCGSCWAFSATG 145
Query: 577 LQHDWSFGKDSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTEQ 756
F K +S + G F+Y++DNGG+D+E+
Sbjct: 146 ALEGQMFRKTGRLIS-----LSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEE 200
Query: 757 TY 762
+Y
Sbjct: 201 SY 202
>UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin
L - Misgurnus mizolepis (Mud loach)
Length = 337
Score = 80.2 bits (189), Expect = 6e-14
Identities = 34/58 (58%), Positives = 42/58 (72%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
E +DWR+ G VT +KDQG+CGSCW+ + EGQ FR+ G LVSLSEQNL+DCS
Sbjct: 115 EVPSKLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDCS 172
Score = 72.1 bits (169), Expect = 1e-11
Identities = 54/181 (29%), Positives = 77/181 (42%), Gaps = 4/181 (2%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
E+ +R I+ ++ I HN ++ MG+ +Y+LGMN +GDM H EF + MNG+ KH
Sbjct: 44 EEGWRRMIWEKNLRKIQFHNLEHSMGIHTYRLGMNHFGDMNHEEFRQVMNGY----KHKT 99
Query: 436 NLYMKGGSVRGAKFIS-PANVKLPERWTGGSTAPSPTSRTKGS---VAHAGLQHDWSFGK 603
KG F+ P+ + E+ G P GS + G F K
Sbjct: 100 ERKFKGSLFMEPNFLEVPSKLDWREK---GYVTPVKDQGECGSCWAFSTTGAMEGQMFRK 156
Query: 604 DSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTEQTYLTRGVDD 783
VS + G Q F+YIKDN G+D+E+ Y G DD
Sbjct: 157 QGKLVS-----LSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNNGLDSEEAYPYLGTDD 211
Query: 784 Q 786
Q
Sbjct: 212 Q 212
>UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;
Brugia malayi|Rep: Cahepsin L-like cysteine protease -
Brugia malayi (Filarial nematode worm)
Length = 371
Score = 80.2 bits (189), Expect = 6e-14
Identities = 33/55 (60%), Positives = 42/55 (76%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
++DWR GAVT +KDQG CGSCW+ + + EGQHF Q+G LV LS QNL+DCS+
Sbjct: 146 SIDWRTSGAVTKVKDQGYCGSCWTFSAVGALEGQHFLQTGKLVELSMQNLLDCSD 200
Score = 40.3 bits (90), Expect = 0.055
Identities = 21/57 (36%), Positives = 31/57 (54%)
Frame = +1
Query: 268 RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKN 438
R Y ++ I KHN++YE +Y+L +N DML EF K ++GF +KN
Sbjct: 74 RFMTYLKNVKEIEKHNERYERNEETYELAINHLADMLPEEFRK-LHGFQSRKITSKN 129
>UniRef50_UPI00015559A2 Cluster: PREDICTED: similar to MGC81823
protein, partial; n=1; Ornithorhynchus anatinus|Rep:
PREDICTED: similar to MGC81823 protein, partial -
Ornithorhynchus anatinus
Length = 361
Score = 79.4 bits (187), Expect = 1e-13
Identities = 33/58 (56%), Positives = 43/58 (74%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
E A+DWR HG VT +KDQG+CGSCW+ + EGQ FR++G L ++SEQNL+DCS
Sbjct: 189 EPPEALDWRDHGYVTPVKDQGRCGSCWAFGSTGVLEGQLFRRTGRLAAVSEQNLMDCS 246
>UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L
- Suberites domuncula (Sponge)
Length = 324
Score = 79.4 bits (187), Expect = 1e-13
Identities = 34/58 (58%), Positives = 44/58 (75%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
E A +VDWR+ G V+++K+QG+CGSCWS + EGQH + G LVSLSEQNL+DCS
Sbjct: 107 EPAASVDWRQKGVVSEVKNQGQCGSCWSFSATGSLEGQHALKMGRLVSLSEQNLMDCS 164
>UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precursor;
n=3; Metazoa|Rep: Digestive cysteine proteinase 2
precursor - Homarus americanus (American lobster)
Length = 323
Score = 79.0 bits (186), Expect = 1e-13
Identities = 33/56 (58%), Positives = 42/56 (75%)
Frame = +3
Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
A VDWR GAVT +KDQG+CGSCW+ + EGQHF ++G L+SL+EQ L+DCS
Sbjct: 108 ATEVDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCS 163
Score = 51.6 bits (118), Expect = 2e-05
Identities = 52/179 (29%), Positives = 73/179 (40%), Gaps = 10/179 (5%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
ED++R I+ +++ I + N+KYE G V++ L MNK+GDM EF
Sbjct: 36 EDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEF--------------- 80
Query: 436 NLYMKGGSVRGAKFISPANVKLPERWTGGSTAPSPTSRTKGSVAHAGLQHD----WSFGK 603
N MKG R + +P +V P++ T G A RTKG+V Q W+F
Sbjct: 81 NAVMKGNIPRRS---APVSVFYPKKET-GPQATEVDWRTKGAVTPVKDQGQCGSCWAFST 136
Query: 604 DST------SVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTEQTY 762
+ + + Q G F YIK N GIDTE Y
Sbjct: 137 TGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAY 195
>UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase;
n=21; Bilateria|Rep: Cathepsin L-like cysteine
proteinase - Globodera pallida
Length = 379
Score = 78.6 bits (185), Expect = 2e-13
Identities = 33/55 (60%), Positives = 42/55 (76%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
+VDWR G VT++K+QG CGSCW+ + E QH RQ+G L+SLSEQNLIDCS+
Sbjct: 164 SVDWRDKGWVTEVKNQGMCGSCWAFSSTGALEAQHARQTGQLISLSEQNLIDCSK 218
Score = 35.9 bits (79), Expect = 1.2
Identities = 21/49 (42%), Positives = 25/49 (51%), Gaps = 3/49 (6%)
Frame = +2
Query: 578 FSTTGALGRTALPSVRLPGVALGAKPHRLLGA---YGNNGCNGGLMDNA 715
FS+TGAL R G + L+ YGN GCNGG+MDNA
Sbjct: 188 FSSTGAL---EAQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDNA 233
>UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep:
Cathepsin - Geodia cydonium (Sponge)
Length = 322
Score = 78.2 bits (184), Expect = 2e-13
Identities = 33/53 (62%), Positives = 40/53 (75%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
VDWR G VT +K+QG+CGSCW+ + EGQHF +G LVSLSEQNL+DCS
Sbjct: 107 VDWRTKGYVTGVKNQGQCGSCWAFSATGSLEGQHFNATGKLVSLSEQNLVDCS 159
>UniRef50_Q4R8L0 Cluster: Testis cDNA clone: QtsA-12228, similar to
human SRY (sex determining region Y)-box 30
(SOX30),transcript variant 1,; n=3; Mammalia|Rep: Testis
cDNA clone: QtsA-12228, similar to human SRY (sex
determining region Y)-box 30 (SOX30),transcript variant
1, - Macaca fascicularis (Crab eating macaque)
(Cynomolgus monkey)
Length = 433
Score = 77.0 bits (181), Expect = 5e-13
Identities = 33/54 (61%), Positives = 42/54 (77%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+VDWRK G VT +K+Q +CGSCW+ + EGQ FR++G LVSLSEQNL+DCS
Sbjct: 117 SVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS 170
Score = 49.6 bits (113), Expect = 9e-05
Identities = 46/183 (25%), Positives = 74/183 (40%), Gaps = 4/183 (2%)
Frame = +1
Query: 226 AAPSQLRKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN 405
A +L E+ +R ++ ++ +I HN +Y G + + MN +GDM + EF + M
Sbjct: 34 ATHRRLYGASEEGWRRAVWEKNMKMIELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQVMG 93
Query: 406 GFNKTAKHNKNLYMKGGSVRGAKFIS-PANVKLPERWTGGSTAPSPTSRTKGS---VAHA 573
F N+ L KG R F+ P +V ++ G P + GS +
Sbjct: 94 CF-----RNQKL-RKGKLFREPLFLDLPKSVDWRKK---GYVTPVKNQKQCGSCWAFSAT 144
Query: 574 GLQHDWSFGKDSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTE 753
G F K VS Q G F+Y+K+NGG+D+E
Sbjct: 145 GALEGQMFRKTGKLVS-----LSEQNLVDCSHPQGNQGCNGGFMNSAFRYVKENGGLDSE 199
Query: 754 QTY 762
++Y
Sbjct: 200 ESY 202
>UniRef50_Q86GZ5 Cluster: Midgut cysteine proteinase 2; n=1;
Rhipicephalus appendiculatus|Rep: Midgut cysteine
proteinase 2 - Rhipicephalus appendiculatus (Brown ear
tick)
Length = 564
Score = 77.0 bits (181), Expect = 5e-13
Identities = 43/107 (40%), Positives = 58/107 (54%), Gaps = 1/107 (0%)
Frame = +3
Query: 351 GHEQVRRHAPPRVREDYERLQQNCQTQQESVH-EGWERPRG*VHIAGQREAAGAVDWRKH 527
G+ H R RE+ L+ Q++ S E + R R + Q +DWR +
Sbjct: 301 GYNLAVNHLADRTREEISVLRGRLQSKDGSSRAEPFPRHRFTAKLPDQ------IDWRPY 354
Query: 528 GAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
GAVT +KDQ CGSCWS + EG +FR++G LV LSEQ L+DCS
Sbjct: 355 GAVTPVKDQAVCGSCWSFGTVGELEGAYFRKTGRLVRLSEQQLVDCS 401
>UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42;
Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens
(Human)
Length = 334
Score = 77.0 bits (181), Expect = 5e-13
Identities = 33/54 (61%), Positives = 42/54 (77%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+VDWRK G VT +K+Q +CGSCW+ + EGQ FR++G LVSLSEQNL+DCS
Sbjct: 117 SVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS 170
Score = 50.8 bits (116), Expect = 4e-05
Identities = 47/190 (24%), Positives = 79/190 (41%), Gaps = 4/190 (2%)
Frame = +1
Query: 226 AAPSQLRKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN 405
A +L E+ +R ++ ++ +I HN +Y G + + MN +GDM + EF + M
Sbjct: 34 ATHRRLYGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMG 93
Query: 406 GFNKTAKHNKNLYMKGGSVRGAKFIS-PANVKLPERWTGGSTAPSPTSRTKGS---VAHA 573
F ++ K + KG R F+ P +V ++ G P + GS +
Sbjct: 94 CF----RNQK--FRKGKVFREPLFLDLPKSVDWRKK---GYVTPVKNQKQCGSCWAFSAT 144
Query: 574 GLQHDWSFGKDSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTE 753
G F K VS Q G + F+Y+K+NGG+D+E
Sbjct: 145 GALEGQMFRKTGKLVS-----LSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSE 199
Query: 754 QTYLTRGVDD 783
++Y VD+
Sbjct: 200 ESYPYVAVDE 209
>UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2;
Acanthamoeba|Rep: Cysteine proteinase - Acanthamoeba
healyi
Length = 330
Score = 76.2 bits (179), Expect = 9e-13
Identities = 33/52 (63%), Positives = 41/52 (78%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
DWR+ GAVT +K+QG+CGSCWS + EG +F ++G LVSLSEQNLIDCS
Sbjct: 119 DWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLVSLSEQNLIDCS 170
Score = 35.1 bits (77), Expect = 2.1
Identities = 25/50 (50%), Positives = 31/50 (62%), Gaps = 4/50 (8%)
Frame = +2
Query: 578 FSTTGAL-GRTALPSVRLPGVALGAKPHRLLG---AYGNNGCNGGLMDNA 715
FSTTG+ G L + RL V+L + L+ +YGNNGCNGGLMD A
Sbjct: 141 FSTTGSTEGANFLKTGRL--VSLSEQ--NLIDCSVSYGNNGCNGGLMDYA 186
>UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine
protease; n=11; Callosobruchus maculatus|Rep: Putative
gut cathepsin L-like cysteine protease - Callosobruchus
maculatus (Southern cowpea weevil) (Pulse bruchid)
Length = 326
Score = 76.2 bits (179), Expect = 9e-13
Identities = 32/58 (55%), Positives = 42/58 (72%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
E AVDWR+ GAVT +KDQ CGSCW+ + + EGQ F+++G LVSLS Q L+DC+
Sbjct: 111 EEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCA 168
Score = 36.7 bits (81), Expect = 0.67
Identities = 15/46 (32%), Positives = 27/46 (58%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFV 393
E+ R ++ ++ I +HN+KYE G S+ + ++ DM H EF+
Sbjct: 39 EEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEFL 84
>UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3;
Curculionidae|Rep: Cysteine proteinase - Hypera postica
(alfalfa weevil)
Length = 324
Score = 75.8 bits (178), Expect = 1e-12
Identities = 34/57 (59%), Positives = 40/57 (70%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
E +VDWRK G VT +KDQG CGSCW+ + EG + R+SG LVSLSEQ LIDC
Sbjct: 111 EIPSSVDWRKEGRVTGVKDQGDCGSCWAFSITGSTEGAYARKSGKLVSLSEQQLIDC 167
Score = 47.6 bits (108), Expect = 4e-04
Identities = 23/51 (45%), Positives = 31/51 (60%)
Frame = +1
Query: 250 RGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTM 402
+ E++ R I+ ++ I HN YE G VSYK G+NK+ DM EF KTM
Sbjct: 40 QAEESKRFNIFTDNVRAIEAHNALYEQGKVSYKKGINKFTDMSQEEF-KTM 89
>UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2;
Platyhelminthes|Rep: Cathepsin L-like proteinase -
Echinococcus multilocularis
Length = 338
Score = 75.8 bits (178), Expect = 1e-12
Identities = 36/87 (41%), Positives = 48/87 (55%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEHXXXX 686
++DWRK G VT IKDQG CGSCW+ + EGQ R++G L+SLSEQ L+DCS +
Sbjct: 125 SIDWRKKGLVTPIKDQGDCGSCWAFSATGALEGQLKRKTGKLISLSEQQLVDCSTYTGNE 184
Query: 687 XXXXXXXXXLQVHQGQRGDRHRADLPY 767
+ + G +D PY
Sbjct: 185 GCNGGDMNDAFRYWMRNGAESESDYPY 211
Score = 39.1 bits (87), Expect = 0.13
Identities = 14/47 (29%), Positives = 28/47 (59%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK 396
E++ RM+I+ + + HN++Y +GL +Y +N + D+ EF +
Sbjct: 46 EEHLRMRIFINNYLFVRWHNERYYLGLETYSTALNAFADLTLEEFAE 92
>UniRef50_P04988 Cluster: Cysteine proteinase 1 precursor; n=3;
Dictyostelium discoideum|Rep: Cysteine proteinase 1
precursor - Dictyostelium discoideum (Slime mold)
Length = 343
Score = 75.8 bits (178), Expect = 1e-12
Identities = 33/53 (62%), Positives = 38/53 (71%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
A DWR GAVT +K+QG+CGSCWS + EGQHF LVSLSEQNL+DC
Sbjct: 121 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDC 173
>UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine
proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like
midgut cysteine proteinase - Tenebrio molitor (Yellow
mealworm)
Length = 330
Score = 74.5 bits (175), Expect = 3e-12
Identities = 34/64 (53%), Positives = 46/64 (71%)
Frame = +3
Query: 477 HIAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNL 656
+++ ++ A +VDWR + AV+++KDQG+CGSCWS + EGQ Q G L SLSEQNL
Sbjct: 110 YVSSKKPLAASVDWRSN-AVSEVKDQGQCGSCWSFSTTGAVEGQLALQRGRLTSLSEQNL 168
Query: 657 IDCS 668
IDCS
Sbjct: 169 IDCS 172
Score = 49.2 bits (112), Expect = 1e-04
Identities = 26/65 (40%), Positives = 39/65 (60%), Gaps = 1/65 (1%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN-GFNKTAKHN 432
E+ R I+ ++ IA+HN K+E G V+Y MN++GDM EF+ +N G + KH
Sbjct: 44 EEIRRQLIFKDNVAKIAEHNAKFEKGEVTYSKAMNQFGDMSKEEFLAYVNRGKAQKPKHP 103
Query: 433 KNLYM 447
+NL M
Sbjct: 104 ENLRM 108
Score = 33.9 bits (74), Expect = 4.8
Identities = 21/47 (44%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
Frame = +2
Query: 578 FSTTGAL-GRTALPSVRLPGVALGAKPHRLLGAYGNNGCNGGLMDNA 715
FSTTGA+ G+ AL RL ++ +YGN GC+GG MD+A
Sbjct: 143 FSTTGAVEGQLALQRGRLTSLS-EQNLIDCSSSYGNAGCDGGWMDSA 188
>UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin L
preproprotein; n=1; Monodelphis domestica|Rep:
PREDICTED: similar to cathepsin L preproprotein -
Monodelphis domestica
Length = 356
Score = 73.7 bits (173), Expect = 5e-12
Identities = 31/54 (57%), Positives = 42/54 (77%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+VDWR HG VT I++QG+CG+CW+ + + EGQ FR++G LV LS+Q LIDCS
Sbjct: 118 SVDWRTHGYVTPIRNQGECGACWAFSTIGSLEGQLFRKTGRLVELSKQMLIDCS 171
Score = 45.6 bits (103), Expect = 0.001
Identities = 18/50 (36%), Positives = 33/50 (66%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN 405
E++FR +++ ++ +I HN+ ++ G SY +GMN++GDM EF +N
Sbjct: 44 EESFRRQVWEKNLKLINDHNRLFKEGKKSYFMGMNQFGDMTDKEFESRLN 93
>UniRef50_Q9NH99 Cluster: Cathepsin L; n=1; Stylonychia lemnae|Rep:
Cathepsin L - Stylonychia lemnae
Length = 340
Score = 73.7 bits (173), Expect = 5e-12
Identities = 33/89 (37%), Positives = 49/89 (55%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEHXXXX 686
++DWR+ GAV +KDQG+CGSCW+ + + E ++F ++G L SLSEQ L+DCS++
Sbjct: 128 SIDWREKGAVNAVKDQGQCGSCWAFSTIASLESRYFIETGKLQSLSEQQLVDCSKNGNEG 187
Query: 687 XXXXXXXXXLQVHQGQRGDRHRADLPYEG 773
+ G D PY G
Sbjct: 188 CNGGDMGLAMDYIASAGGVETEKDYPYVG 216
>UniRef50_P43296 Cluster: Cysteine proteinase RD19a precursor; n=15;
Magnoliophyta|Rep: Cysteine proteinase RD19a precursor -
Arabidopsis thaliana (Mouse-ear cress)
Length = 368
Score = 73.7 bits (173), Expect = 5e-12
Identities = 31/51 (60%), Positives = 37/51 (72%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
DWR HGAVT +K+QG CGSCWS + EG +F +G LVSLSEQ L+DC
Sbjct: 140 DWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDC 190
>UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin
L-like protease; n=1; Nasonia vitripennis|Rep:
PREDICTED: similar to cathepsin L-like protease -
Nasonia vitripennis
Length = 353
Score = 73.3 bits (172), Expect = 6e-12
Identities = 38/96 (39%), Positives = 53/96 (55%), Gaps = 2/96 (2%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQG-KCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS-EHXXX 683
VDWR+ GAVT ++DQG CGSCW+ + E Q+F+++G L +LS QNLIDC+ E+
Sbjct: 136 VDWRQRGAVTPVRDQGLTCGSCWAFSAAGALEAQYFKKTGVLTALSAQNLIDCTMEYGNL 195
Query: 684 XXXXXXXXXXLQVHQGQRGDRHRADLPYEGS*RPIP 791
Q Q+G A+ YEG + P
Sbjct: 196 GCGGGSAALSFQFVVDQKGLEPEANYSYEGRTKECP 231
Score = 63.7 bits (148), Expect = 5e-09
Identities = 31/85 (36%), Positives = 53/85 (62%), Gaps = 1/85 (1%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
E+NFR ++ E++ IA+HNQK+++GL +YK+ +N++GDM+ E+ M+ N T K
Sbjct: 56 EENFRRSVFHENQRKIAEHNQKHDLGLFTYKVRINQFGDMMFEEYKNYMHAANNTITQLK 115
Query: 436 NLYMKGGSVRGAKFISPANVK-LPE 507
+ RG +FI P + + +PE
Sbjct: 116 RI------PRGDEFIKPKSAENVPE 134
>UniRef50_A1ZAU4 Cluster: CG4847-PD, isoform D; n=4; Sophophora|Rep:
CG4847-PD, isoform D - Drosophila melanogaster (Fruit
fly)
Length = 420
Score = 73.3 bits (172), Expect = 6e-12
Identities = 31/53 (58%), Positives = 38/53 (71%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
A DWR+HG VT +K QG CGSCW+ A EG FR++G L +LSEQNL+DC
Sbjct: 206 AFDWREHGGVTPVKFQGTCGSCWAFATTGAIEGHTFRKTGSLPNLSEQNLVDC 258
>UniRef50_P25804 Cluster: Cysteine proteinase 15A precursor; n=35;
Viridiplantae|Rep: Cysteine proteinase 15A precursor -
Pisum sativum (Garden pea)
Length = 363
Score = 73.3 bits (172), Expect = 6e-12
Identities = 30/51 (58%), Positives = 37/51 (72%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
DWR+ GAVT +KDQG CGSCW+ + EG H+ +G LVSLSEQ L+DC
Sbjct: 137 DWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDC 187
>UniRef50_P12412 Cluster: Vignain precursor (EC 3.4.22.-) (Bean
endopeptidase) (Cysteine proteinase)
(Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
Vignain-2]; n=16; Magnoliophyta|Rep: Vignain precursor
(EC 3.4.22.-) (Bean endopeptidase) (Cysteine proteinase)
(Sulfhydryl-endopeptidase) (SH-EP) [Contains: Vignain-1;
Vignain-2] - Vigna mungo (Rice bean) (Black gram)
Length = 362
Score = 73.3 bits (172), Expect = 6e-12
Identities = 31/55 (56%), Positives = 42/55 (76%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
+VDWRK GAVTD+KDQG+CGSCW+ + + EG + ++ LVSLSEQ L+DC +
Sbjct: 131 SVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDK 185
>UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4;
core eudicotyledons|Rep: Papain-like cysteine peptidase
XBCP3 - Arabidopsis thaliana (Mouse-ear cress)
Length = 437
Score = 72.9 bits (171), Expect = 8e-12
Identities = 32/64 (50%), Positives = 43/64 (67%)
Frame = +3
Query: 480 IAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLI 659
+ G + +VDWRK GAVT++KDQG CG+CWS + EG + +G L+SLSEQ LI
Sbjct: 112 LGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELI 171
Query: 660 DCSE 671
DC +
Sbjct: 172 DCDK 175
>UniRef50_Q9U0C9 Cluster: Cysteine proteinase; n=4; Digenea|Rep:
Cysteine proteinase - Paragonimus westermani
Length = 272
Score = 72.9 bits (171), Expect = 8e-12
Identities = 39/101 (38%), Positives = 53/101 (52%), Gaps = 1/101 (0%)
Frame = +3
Query: 474 VHIAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQN 653
V G + A +DWR GAVT +++QG CGSCW+ + EGQ F ++G LVSLS+Q
Sbjct: 46 VRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLVSLSKQQ 105
Query: 654 LIDCSEHXXXXXXXXXXXXXLQV-HQGQRGDRHRADLPYEG 773
L+DC L++ H G G + D PY G
Sbjct: 106 LVDCDRAADGCNGGWPASSYLEIMHMG--GLESQDDYPYAG 144
>UniRef50_Q0ZM40 Cluster: Cysteine protease; n=32; Digenea|Rep:
Cysteine protease - Clonorchis sinensis
Length = 328
Score = 72.9 bits (171), Expect = 8e-12
Identities = 30/51 (58%), Positives = 40/51 (78%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
DWR+HGAV + DQGKCGSCW+ + + EGQ FR++G L++LSEQ L+DC
Sbjct: 120 DWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKTGDLLALSEQQLVDC 170
>UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2;
Dictyostelium discoideum|Rep: Cysteine proteinase 2
precursor - Dictyostelium discoideum (Slime mold)
Length = 376
Score = 72.9 bits (171), Expect = 8e-12
Identities = 32/54 (59%), Positives = 39/54 (72%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
++DWR AVT IKDQG+CGSCWS + EG H ++ LVSLSEQNL+DCS
Sbjct: 126 SIDWRTKNAVTPIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCS 179
>UniRef50_Q1MTY3 Cluster: Cathepsin L; n=1; Aphrocallistes
vastus|Rep: Cathepsin L - Aphrocallistes vastus
Length = 329
Score = 71.7 bits (168), Expect = 2e-11
Identities = 31/53 (58%), Positives = 38/53 (71%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
VDWR G VT +K+QG+CGSCWS + EGQ+ +SG LVS SEQ L+DCS
Sbjct: 119 VDWRSKGVVTPVKNQGQCGSCWSFSATGSLEGQYAIKSGKLVSFSEQELVDCS 171
>UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase
precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin
L-like cysteine proteinase precursor - Acanthoscelides
obtectus (Bean weevil)
Length = 321
Score = 71.3 bits (167), Expect = 3e-11
Identities = 30/53 (56%), Positives = 40/53 (75%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
VDWR+ GAVT++K QG CGSCW+ + + EGQ F ++G L SLS QNL+DC+
Sbjct: 114 VDWREKGAVTEVKKQGNCGSCWAFSAVGSIEGQVFLKNGSLESLSAQNLVDCA 166
Score = 40.7 bits (91), Expect = 0.041
Identities = 15/49 (30%), Positives = 31/49 (63%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTM 402
E+ R +I+ + I +HN++Y G ++++G+N++GDM EF + +
Sbjct: 39 EEKRRFEIFKFNLRTIEEHNERYHNGEETFEMGINQFGDMTQEEFKRML 87
>UniRef50_Q94504 Cluster: Cysteine proteinase 7 precursor; n=10;
Dictyostelium discoideum|Rep: Cysteine proteinase 7
precursor - Dictyostelium discoideum (Slime mold)
Length = 460
Score = 71.3 bits (167), Expect = 3e-11
Identities = 34/60 (56%), Positives = 41/60 (68%), Gaps = 2/60 (3%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSG--YLVSLSEQNLIDCS 668
+A+ VDWR GAVT IK+QG+CG CWS + EG + +G LVSLSEQNLIDCS
Sbjct: 109 DASAQVDWRTQGAVTPIKNQGQCGGCWSFSTTGATEGAQYLANGKKNLVSLSEQNLIDCS 168
>UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate
cathepsin L; n=4; Danio rerio|Rep: Novel protein similar
to vertebrate cathepsin L - Danio rerio (Zebrafish)
(Brachydanio rerio)
Length = 334
Score = 70.9 bits (166), Expect = 3e-11
Identities = 30/53 (56%), Positives = 39/53 (73%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+D+R G VT++KDQG CGSCWS + EGQ ++ +G LVSLSEQ L+DCS
Sbjct: 122 IDYRAKGYVTEVKDQGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQLVDCS 174
Score = 39.9 bits (89), Expect = 0.072
Identities = 23/84 (27%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Frame = +1
Query: 247 KRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAK 426
+ ED R I+ + I K+N + GL +K+ MNKYGD+ E+ + + K
Sbjct: 39 EESEDVHRKTIWETNMQKIWKNNNDFSFGLSMFKMAMNKYGDLTSVEYKRLLGSKIKGTG 98
Query: 427 HNKNLYMKGGSVR-GAKFISPANV 495
+ K +R AK + N+
Sbjct: 99 NRKGKITSAQMLRLNAKRLGVTNI 122
>UniRef50_Q6DMN0 Cluster: Toxopain-2; n=4; Sarcocystidae|Rep:
Toxopain-2 - Toxoplasma gondii
Length = 422
Score = 70.9 bits (166), Expect = 3e-11
Identities = 31/58 (53%), Positives = 37/58 (63%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
E VDWR G VT +KDQ CGSCW+ + EG H ++G LVSLSEQ L+DCS
Sbjct: 204 ELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCS 261
>UniRef50_UPI0000E469FF Cluster: PREDICTED: similar to cysteine
protease; n=1; Strongylocentrotus purpuratus|Rep:
PREDICTED: similar to cysteine protease -
Strongylocentrotus purpuratus
Length = 494
Score = 70.5 bits (165), Expect = 4e-11
Identities = 29/53 (54%), Positives = 38/53 (71%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
DWR HGAVT +K+QG CGSCW+ + + EGQ + G L+SLSEQ L+DC +
Sbjct: 245 DWRTHGAVTPVKNQGMCGSCWAFSAIGNMEGQWQIKKGELISLSEQELVDCDK 297
>UniRef50_Q26534 Cluster: Cathepsin L precursor; n=8; Eukaryota|Rep:
Cathepsin L precursor - Schistosoma mansoni (Blood
fluke)
Length = 319
Score = 70.5 bits (165), Expect = 4e-11
Identities = 29/51 (56%), Positives = 39/51 (76%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
DWR+ GAVT++K+QG CGSCW+ + E Q FR++G L+SLSEQ L+DC
Sbjct: 110 DWREKGAVTEVKNQGMCGSCWAFSTTGNVESQWFRKTGKLLSLSEQQLVDC 160
>UniRef50_O23791 Cluster: Fruit bromelain precursor; n=16;
Bromeliaceae|Rep: Fruit bromelain precursor - Ananas
comosus (Pineapple)
Length = 351
Score = 70.5 bits (165), Expect = 4e-11
Identities = 28/54 (51%), Positives = 41/54 (75%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
++DWR +GAV ++K+Q CGSCWS A + EG + ++GYLVSLSEQ ++DC+
Sbjct: 126 SIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCA 179
>UniRef50_O22499 Cluster: Cysteine proteinase Mir2; n=3;
Magnoliophyta|Rep: Cysteine proteinase Mir2 - Zea mays
(Maize)
Length = 493
Score = 70.1 bits (164), Expect = 6e-11
Identities = 31/64 (48%), Positives = 45/64 (70%)
Frame = +3
Query: 480 IAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLI 659
+AG+ + AVDWR+ GAV ++KDQG+CG CW+ + + EG + +G L+SLSEQ LI
Sbjct: 159 LAGE-QLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVTGSLISLSEQELI 217
Query: 660 DCSE 671
DC +
Sbjct: 218 DCDK 221
>UniRef50_Q25547 Cluster: Cysteine proteinase homolog; n=1;
Naegleria fowleri|Rep: Cysteine proteinase homolog -
Naegleria fowleri
Length = 347
Score = 70.1 bits (164), Expect = 6e-11
Identities = 31/59 (52%), Positives = 40/59 (67%)
Frame = +3
Query: 498 AAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
A + DWR+HGAVT +K+QG CGSCW+ + EGQ + G LVSLSEQ L+DC +
Sbjct: 122 APTSFDWRQHGAVTRVKNQGACGSCWTFSTTGNVEGQWAIKKGKLVSLSEQQLVDCDHN 180
Score = 33.9 bits (74), Expect = 4.8
Identities = 13/23 (56%), Positives = 17/23 (73%)
Frame = +1
Query: 715 FKYIKDNGGIDTEQTYLTRGVDD 783
F+Y+ NGG+DTE +Y GVDD
Sbjct: 203 FQYVIKNGGLDTEDSYPYEGVDD 225
>UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor;
n=35; Fasciola|Rep: Cathepsin L-like proteinase
precursor - Fasciola hepatica (Liver fluke)
Length = 326
Score = 70.1 bits (164), Expect = 6e-11
Identities = 28/62 (45%), Positives = 38/62 (61%)
Frame = +3
Query: 483 AGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLID 662
A R +DWR+ G VT++KDQG CGSCW+ + EGQ+ + +S SEQ L+D
Sbjct: 103 ANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVD 162
Query: 663 CS 668
CS
Sbjct: 163 CS 164
Score = 43.2 bits (97), Expect = 0.008
Identities = 17/45 (37%), Positives = 30/45 (66%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF 390
+D R I+ ++ I +HN ++++GLV+Y LG+N++ DM EF
Sbjct: 36 DDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEF 80
Score = 34.7 bits (76), Expect = 2.7
Identities = 19/49 (38%), Positives = 29/49 (59%), Gaps = 3/49 (6%)
Frame = +2
Query: 578 FSTTGALGRTALPSVRLPGVALGAKPHRLL---GAYGNNGCNGGLMDNA 715
FSTTG + + + R ++ +L+ G +GNNGC+GGLM+NA
Sbjct: 135 FSTTGTMEGQYMKNER---TSISFSEQQLVDCSGPWGNNGCSGGLMENA 180
>UniRef50_Q0J0J4 Cluster: Os09g0497500 protein; n=9; Oryza
sativa|Rep: Os09g0497500 protein - Oryza sativa subsp.
japonica (Rice)
Length = 349
Score = 69.7 bits (163), Expect = 8e-11
Identities = 29/55 (52%), Positives = 41/55 (74%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
+VDWRK GAV ++K+QG CGSCW+ + + EG + ++G LVSLSEQ L+DC +
Sbjct: 125 SVDWRKKGAVVEVKNQGDCGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDD 179
>UniRef50_Q86GZ3 Cluster: Midgut cysteine proteinase 4; n=1;
Rhipicephalus appendiculatus|Rep: Midgut cysteine
proteinase 4 - Rhipicephalus appendiculatus (Brown ear
tick)
Length = 345
Score = 69.7 bits (163), Expect = 8e-11
Identities = 27/53 (50%), Positives = 42/53 (79%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
++WR++G VT +K+QG+CGSCW+ + EGQ F+++ L+SLSEQNL+DC+
Sbjct: 130 IEWRENGFVTPVKNQGQCGSCWAFSSTGALEGQVFKRTRRLISLSEQNLMDCA 182
>UniRef50_UPI00015A56AC Cluster: hypothetical protein LOC550326;
n=2; Danio rerio|Rep: hypothetical protein LOC550326 -
Danio rerio
Length = 531
Score = 69.3 bits (162), Expect = 1e-10
Identities = 30/54 (55%), Positives = 38/54 (70%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+VDWR +GAVT +KDQ CGSCWS A EG F ++G L SLS+Q L+DC+
Sbjct: 315 SVDWRLYGAVTPVKDQAVCGSCWSFATTGTLEGALFLKTGQLTSLSQQMLVDCT 368
>UniRef50_Q9LNC1 Cluster: F9P14.12 protein; n=1; Arabidopsis
thaliana|Rep: F9P14.12 protein - Arabidopsis thaliana
(Mouse-ear cress)
Length = 343
Score = 69.3 bits (162), Expect = 1e-10
Identities = 31/53 (58%), Positives = 39/53 (73%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
AVDWR GAVT I++QGKCG CW+ + + EG + ++G LVSLSEQ LIDC
Sbjct: 130 AVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDC 182
>UniRef50_Q01LY2 Cluster: H0825G02.10 protein; n=24;
Magnoliophyta|Rep: H0825G02.10 protein - Oryza sativa
(Rice)
Length = 339
Score = 69.3 bits (162), Expect = 1e-10
Identities = 30/55 (54%), Positives = 37/55 (67%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
VDWR GAVT IKDQG+CG CW+ + + EG +G L+SLSEQ L+DC H
Sbjct: 127 VDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVH 181
>UniRef50_Q75JR5 Cluster: Similar to Sarcophaga peregrina (Flesh
fly) (Boettcherisca peregrina). Cathepsin L; n=2;
Dictyostelium discoideum|Rep: Similar to Sarcophaga
peregrina (Flesh fly) (Boettcherisca peregrina).
Cathepsin L - Dictyostelium discoideum (Slime mold)
Length = 265
Score = 69.3 bits (162), Expect = 1e-10
Identities = 28/52 (53%), Positives = 37/52 (71%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
DWR HGAV +K+QG C SCWS + L EG ++ + G L+ LSEQNL+DC+
Sbjct: 52 DWRDHGAVGKVKNQGSCASCWSFSALGALEGHYYIKYGELLDLSEQNLVDCA 103
>UniRef50_Q22LI1 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 987
Score = 68.9 bits (161), Expect = 1e-10
Identities = 28/52 (53%), Positives = 36/52 (69%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+DWR GAVT +K QGKCGSCWS + L E + ++G L+ LSEQ L+DC
Sbjct: 127 IDWRNKGAVTSVKRQGKCGSCWSFSAAGLMEAFQYFKTGNLIDLSEQQLVDC 178
>UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes
abbreviatus|Rep: Cathepsin L 2 - Diaprepes abbreviatus
(Sugarcane rootstalk borer weevil)
Length = 348
Score = 68.9 bits (161), Expect = 1e-10
Identities = 28/53 (52%), Positives = 38/53 (71%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+DWR+ GAVT +K+Q CGSCWS + E Q F+++ L+SLSEQ L+DCS
Sbjct: 139 IDWRQKGAVTPVKNQRNCGSCWSFSATGALEAQWFKKTNKLISLSEQQLVDCS 191
Score = 55.6 bits (128), Expect = 1e-06
Identities = 51/182 (28%), Positives = 73/182 (40%), Gaps = 13/182 (7%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
E+ +R ++ E+ I +HN+ YEMGL SY++ MN GD+ EF++ ++
Sbjct: 44 ENEYRQSVFMENLFQINEHNKLYEMGLSSYQMAMNHLGDLTKDEFMRIYTVNMPQLPQSE 103
Query: 436 NLYMKGGSVRGAKFISP-ANVKLPERWTGGSTAPSPTSRTKGSVAHAGLQHD----WSFG 600
NL + + + LP R KG+V Q + WSF
Sbjct: 104 NLSDSEPWLDLPQDLQGFVTYALPTNLDEVDLPTDIDWRQKGAVTPVKNQRNCGSCWSF- 162
Query: 601 KDSTSVSPATWCXXXXXXXXXXXXXXEQRLQR----GAHG----QRFKYIKDNGGIDTEQ 756
+T A W R G HG F YIK+NGGIDTEQ
Sbjct: 163 -SATGALEAQWFKKTNKLISLSEQQLVDCSGRYGNHGCHGGWMHWAFGYIKENGGIDTEQ 221
Query: 757 TY 762
+Y
Sbjct: 222 SY 223
>UniRef50_Q05094 Cluster: Cysteine proteinase 2 precursor; n=61;
Leishmania|Rep: Cysteine proteinase 2 precursor -
Leishmania pifanoi
Length = 444
Score = 68.9 bits (161), Expect = 1e-10
Identities = 30/55 (54%), Positives = 38/55 (69%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
AVDWR+ GAVT +KDQG CGSCW+ + + EGQ + LVSLSEQ L+ C +
Sbjct: 129 AVDWREKGAVTPVKDQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDD 183
>UniRef50_Q9UBX1 Cluster: Cathepsin F precursor; n=19;
Bilateria|Rep: Cathepsin F precursor - Homo sapiens
(Human)
Length = 484
Score = 68.9 bits (161), Expect = 1e-10
Identities = 30/53 (56%), Positives = 36/53 (67%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
DWR GAVT +KDQG CGSCW+ + EGQ F G L+SLSEQ L+DC +
Sbjct: 276 DWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK 328
>UniRef50_A5HIJ5 Cluster: Cysteine protease Cp5; n=6;
Magnoliophyta|Rep: Cysteine protease Cp5 - Actinidia
deliciosa (Kiwi)
Length = 509
Score = 68.5 bits (160), Expect = 2e-10
Identities = 41/118 (34%), Positives = 58/118 (49%), Gaps = 4/118 (3%)
Frame = +3
Query: 324 RNGPRFLQAGHEQVRRHAPPRVREDYERLQQNCQTQQESVHEGWERPRG*VHIAGQREAA 503
+NG R GH E++ + + + S ER R A + AA
Sbjct: 85 KNGERGASGGHLVGLNKFADMSNEEFREVYVSKVKKPTSKRMAIERRRQGKAAAAKAVAA 144
Query: 504 ----GAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
++DWRK+G VT +KDQG CGSCW+ + EG + +G L+SLSEQ L+DC
Sbjct: 145 CDGPTSLDWRKYGIVTGVKDQGDCGSCWAFSSTGAIEGINALANGDLISLSEQELVDC 202
>UniRef50_Q7QCZ7 Cluster: ENSANGP00000018713; n=1; Anopheles gambiae
str. PEST|Rep: ENSANGP00000018713 - Anopheles gambiae
str. PEST
Length = 559
Score = 68.5 bits (160), Expect = 2e-10
Identities = 30/64 (46%), Positives = 42/64 (65%)
Frame = +3
Query: 480 IAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLI 659
+AG + + DWR HGAVT++K+QG CGSCW+ + + EG H ++ L S SEQ LI
Sbjct: 333 VAGVGDLPRSFDWRDHGAVTEVKNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELI 392
Query: 660 DCSE 671
DC +
Sbjct: 393 DCDK 396
>UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin
heavy chain; n=3; Amniota|Rep: PREDICTED: similar to
ferritin heavy chain - Ornithorhynchus anatinus
Length = 338
Score = 68.1 bits (159), Expect = 2e-10
Identities = 30/58 (51%), Positives = 38/58 (65%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
E VDWR G VT +K+QG CGSCW+ + E F+ +G +VSLSEQNL+DCS
Sbjct: 119 EGPEEVDWRTKGYVTPVKNQGLCGSCWAFSATGALEALVFKTTGKMVSLSEQNLVDCS 176
Score = 51.6 bits (118), Expect = 2e-05
Identities = 47/183 (25%), Positives = 71/183 (38%), Gaps = 7/183 (3%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
E+ FR + ++ +I +HN++ G SY+L MN +GD + E + +NGF
Sbjct: 44 EEVFRRAAWEKNVRVIERHNEEMSQGKHSYRLAMNHFGDQTNEELHERLNGFRPDL---- 99
Query: 436 NLYMKGGSVRGAKFISPANVKLPERWTGGSTAPSPTSRTKGSVAHAGLQHD-WSF---GK 603
GG++R + A + W G T V + GL W+F G
Sbjct: 100 -----GGALRSGR--EQARFRSKTSWEGPEEVDWRTKGYVTPVKNQGLCGSCWAFSATGA 152
Query: 604 DSTSVSPATWCXXXXXXXXXXXXXXEQ---RLQRGAHGQRFKYIKDNGGIDTEQTYLTRG 774
V T Q + G + F+Y++ NGGID E Y G
Sbjct: 153 LEALVFKTTGKMVSLSEQNLVDCSWRQGNVGCRGGQYIGAFEYVRANGGIDAEDLYPYLG 212
Query: 775 VDD 783
DD
Sbjct: 213 RDD 215
>UniRef50_Q4SJ28 Cluster: Chromosome 21 SCAF14577, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 21 SCAF14577, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 406
Score = 68.1 bits (159), Expect = 2e-10
Identities = 29/58 (50%), Positives = 41/58 (70%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
E +VDWRK G V+ +++QG C SCW+ + L EGQ +++G+LV LS QNL+DCS
Sbjct: 154 ETPPSVDWRKAGLVSPVQNQGFCNSCWAFSSLGALEGQMKKRTGFLVPLSPQNLLDCS 211
>UniRef50_Q7XRA0 Cluster: OSJNBb0085F13.15 protein; n=3; Oryza
sativa|Rep: OSJNBb0085F13.15 protein - Oryza sativa
subsp. japonica (Rice)
Length = 383
Score = 68.1 bits (159), Expect = 2e-10
Identities = 32/65 (49%), Positives = 42/65 (64%), Gaps = 2/65 (3%)
Frame = +3
Query: 483 AGQREAA--GAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNL 656
AG+R A +VDWRK GAVT K QG+C +CW+ A + E H + G L+SLSEQ L
Sbjct: 153 AGRRTVAVPESVDWRKEGAVTPAKHQGQCAACWAFAAVAAIESLHKIKGGDLISLSEQEL 212
Query: 657 IDCSE 671
+DC +
Sbjct: 213 VDCDD 217
>UniRef50_Q23H15 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 370
Score = 68.1 bits (159), Expect = 2e-10
Identities = 29/55 (52%), Positives = 37/55 (67%)
Frame = +3
Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
A ++DWR GAVT +K+QG CGSCWS + L E +F Q+ LV SEQ L+DC
Sbjct: 163 AASIDWRTKGAVTSVKNQGNCGSCWSFSAAGLMESFNFIQNKALVDFSEQQLLDC 217
>UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:
Cathepsin - Petromyzon marinus (Sea lamprey)
Length = 333
Score = 67.7 bits (158), Expect = 3e-10
Identities = 29/54 (53%), Positives = 37/54 (68%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
VDWR G VT +K+QG CGS W+ + EGQHF +G L SLSEQ L+DC++
Sbjct: 121 VDWRLKGYVTPVKEQGLCGSSWAFSATGSLEGQHFAATGNLTSLSEQQLVDCTK 174
Score = 38.3 bits (85), Expect = 0.22
Identities = 17/51 (33%), Positives = 30/51 (58%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNG 408
ED R ++ ++ + +HN + G VS+ LG+NKY D+ HE+ + + G
Sbjct: 43 EDAHRRDVFEQNLKRVLQHNLLADEGNVSFHLGINKYSDLELHEYHEKVVG 93
>UniRef50_Q5ILG5 Cluster: Cysteine protease gp3a; n=3;
Magnoliophyta|Rep: Cysteine protease gp3a - Zingiber
officinale (Ginger)
Length = 475
Score = 67.7 bits (158), Expect = 3e-10
Identities = 28/54 (51%), Positives = 40/54 (74%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
++DWR+ GAV +K+QG+CGSCW+ A + EG + +G L+SLSEQ L+DCS
Sbjct: 146 SIDWREKGAVVAVKNQGRCGSCWAFAAIAAVEGINQIVTGDLISLSEQQLVDCS 199
Score = 38.7 bits (86), Expect = 0.17
Identities = 12/43 (27%), Positives = 29/43 (67%)
Frame = +1
Query: 262 NFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF 390
++R++++ E+ + +HN + G +Y+LGMN++ D+ + E+
Sbjct: 70 DYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEEY 112
>UniRef50_Q22W19 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 332
Score = 67.7 bits (158), Expect = 3e-10
Identities = 34/93 (36%), Positives = 50/93 (53%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
+ A ++DWR+ AVT +K+QG+CGSCW+ + + EG + +G L S SEQ ++DCS+
Sbjct: 122 DVAPSIDWRQKNAVTPVKNQGQCGSCWAFSTVGGLEGAYAIATGNLTSFSEQQIVDCSKA 181
Query: 675 XXXXXXXXXXXXXLQVHQGQRGDRHRADLPYEG 773
V Q G AD PY+G
Sbjct: 182 NAGCNGGDLPPAYKYV--VQNGIETEADYPYKG 212
>UniRef50_A0CNM8 Cluster: Chromosome undetermined scaffold_22, whole
genome shotgun sequence; n=7; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_22,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 350
Score = 67.7 bits (158), Expect = 3e-10
Identities = 29/56 (51%), Positives = 38/56 (67%)
Frame = +3
Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
A +DWR GAV +KDQG+CGSCW+ + + EG + Q+G L LSEQ L+DCS
Sbjct: 143 ATPIDWRTRGAVNKVKDQGQCGSCWAFSTTGVLEGFYKVQTGELPDLSEQQLVDCS 198
>UniRef50_O65493 Cluster: Xylem cysteine proteinase 1 precursor;
n=8; Magnoliophyta|Rep: Xylem cysteine proteinase 1
precursor - Arabidopsis thaliana (Mouse-ear cress)
Length = 355
Score = 67.7 bits (158), Expect = 3e-10
Identities = 30/53 (56%), Positives = 38/53 (71%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+VDWRK GAV +KDQG+CGSCW+ + + EG + +G L SLSEQ LIDC
Sbjct: 140 SVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDC 192
>UniRef50_Q9FJ47 Cluster: Senescence-specific cysteine protease;
n=23; Magnoliophyta|Rep: Senescence-specific cysteine
protease - Arabidopsis thaliana (Mouse-ear cress)
Length = 346
Score = 67.3 bits (157), Expect = 4e-10
Identities = 29/53 (54%), Positives = 37/53 (69%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+VDWRK GAVT IK+QG CG CW+ + + EG + G L+SLSEQ L+DC
Sbjct: 133 SVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDC 185
>UniRef50_Q650Y2 Cluster: Putative cysteine proteinase; n=10;
Liliopsida|Rep: Putative cysteine proteinase - Oryza
sativa subsp. japonica (Rice)
Length = 416
Score = 67.3 bits (157), Expect = 4e-10
Identities = 28/52 (53%), Positives = 39/52 (75%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
DWR +GAVTD+KDQG+CGSCW + + EG + +G L++LSEQ ++DCS
Sbjct: 119 DWRLNGAVTDVKDQGQCGSCWVFSAVGAVEGINAIMTGNLLTLSEQQVLDCS 170
>UniRef50_P43297 Cluster: Cysteine proteinase RD21a precursor;
n=176; Viridiplantae|Rep: Cysteine proteinase RD21a
precursor - Arabidopsis thaliana (Mouse-ear cress)
Length = 462
Score = 67.3 bits (157), Expect = 4e-10
Identities = 28/57 (49%), Positives = 40/57 (70%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
E ++DWRK GAV ++KDQG CGSCW+ + + EG + +G L++LSEQ L+DC
Sbjct: 136 ELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDC 192
>UniRef50_UPI00015B524C Cluster: PREDICTED: similar to cathepsin F
like protease; n=1; Nasonia vitripennis|Rep: PREDICTED:
similar to cathepsin F like protease - Nasonia
vitripennis
Length = 1036
Score = 66.9 bits (156), Expect = 6e-10
Identities = 28/53 (52%), Positives = 36/53 (67%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
DWR H VT +KDQG CGSCW+ + EGQ+ + G L+SLSEQ L+DC +
Sbjct: 822 DWRHHNVVTPVKDQGSCGSCWAFSVTGNIEGQYAIKHGELLSLSEQELVDCDK 874
>UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin S
preproprotein; n=2; Tribolium castaneum|Rep: PREDICTED:
similar to cathepsin S preproprotein - Tribolium
castaneum
Length = 525
Score = 66.9 bits (156), Expect = 6e-10
Identities = 30/59 (50%), Positives = 36/59 (61%)
Frame = +3
Query: 489 QREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
Q + VDWR G VT +K QGKCGSCW+ A L E + +Q G V LSEQ L+DC
Sbjct: 32 QSDLPDMVDWRLQGVVTPVKRQGKCGSCWAFAILGATEAHYRKQRGSFVILSEQQLVDC 90
Score = 62.5 bits (145), Expect = 1e-08
Identities = 27/52 (51%), Positives = 33/52 (63%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
VDWR G VT +K QGKCG+CW+ A + E Q+ G V LSEQ L+DC
Sbjct: 315 VDWRLRGVVTPVKHQGKCGTCWAFAIIGATEAQYRIHRGSFVILSEQQLVDC 366
Score = 34.7 bits (76), Expect = 2.7
Identities = 16/44 (36%), Positives = 23/44 (52%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 387
E+NFR I+ + I HN++Y GL +Y L +N D E
Sbjct: 241 EENFRRAIFEKTFQEIKHHNERYRKGLETYYLRINDLSDYTDEE 284
>UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2;
Taeniidae|Rep: Cathepsin L-like cysteine proteinase -
Taenia solium (Pork tapeworm)
Length = 339
Score = 66.9 bits (156), Expect = 6e-10
Identities = 27/53 (50%), Positives = 37/53 (69%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
VDWR VT++K+QG CGSCW+ + EG +++G L+SLSEQ L+DCS
Sbjct: 128 VDWRDKNLVTEVKNQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCS 180
>UniRef50_Q239L8 Cluster: Papain family cysteine protease containing
protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 323
Score = 66.9 bits (156), Expect = 6e-10
Identities = 30/59 (50%), Positives = 38/59 (64%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
+A +DW GAVT +KDQG+CGSCWS + EG F + L SLSEQ L+DCS+
Sbjct: 122 DAGVEIDWTTKGAVTPVKDQGQCGSCWSFSTTGAVEGALFLSTKKLTSLSEQYLVDCSK 180
>UniRef50_Q23030 Cluster: Putative uncharacterized protein; n=2;
Caenorhabditis|Rep: Putative uncharacterized protein -
Caenorhabditis elegans
Length = 383
Score = 66.9 bits (156), Expect = 6e-10
Identities = 27/53 (50%), Positives = 39/53 (73%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
++DWR+ G +T IK+QG+CGSCW+ A + E Q+ + G LVSLSEQ ++DC
Sbjct: 171 SIDWREQGKLTPIKNQGQCGSCWAFATVASVEAQNAIKKGKLVSLSEQEMVDC 223
>UniRef50_P14080 Cluster: Chymopapain precursor; n=22; core
eudicotyledons|Rep: Chymopapain precursor - Carica
papaya (Papaya)
Length = 352
Score = 66.9 bits (156), Expect = 6e-10
Identities = 27/56 (48%), Positives = 39/56 (69%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
++DWR GAVT +K+QG CGSCW+ + + EG + +G L+ LSEQ L+DC +H
Sbjct: 138 SIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH 193
>UniRef50_UPI00006CFE70 Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 336
Score = 66.5 bits (155), Expect = 7e-10
Identities = 25/64 (39%), Positives = 41/64 (64%)
Frame = +3
Query: 477 HIAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNL 656
H A + + DWR +G ++D+KDQG+CGSCW+ + + E +F ++ +S SEQ L
Sbjct: 118 HTAQDVQLPASFDWRDYGILSDVKDQGQCGSCWAFSTTGILEALYFMENRQKISFSEQQL 177
Query: 657 IDCS 668
+DC+
Sbjct: 178 VDCA 181
>UniRef50_Q8LIN6 Cluster: Putative cysteine proteinase; n=3; Oryza
sativa|Rep: Putative cysteine proteinase - Oryza sativa
subsp. japonica (Rice)
Length = 352
Score = 66.5 bits (155), Expect = 7e-10
Identities = 28/55 (50%), Positives = 39/55 (70%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
VDWR+ GAVT +K+Q CG CW+ + + EG H +G LVSLSEQ L+DC+++
Sbjct: 133 VDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCADN 187
>UniRef50_Q64F55 Cluster: Cysteine proteinase; n=5; Bodonidae|Rep:
Cysteine proteinase - Cryptobia salmositica
Length = 443
Score = 66.5 bits (155), Expect = 7e-10
Identities = 28/52 (53%), Positives = 36/52 (69%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+DWR GAVT +K+QG CGSCWS + EGQH +G LV++SEQ L+ C
Sbjct: 118 IDWRLKGAVTPVKNQGACGSCWSFSTTGNIEGQHAIATGQLVAVSEQELVSC 169
>UniRef50_Q23H10 Cluster: Papain family cysteine protease containing
protein; n=14; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 336
Score = 66.5 bits (155), Expect = 7e-10
Identities = 28/55 (50%), Positives = 37/55 (67%)
Frame = +3
Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
A ++DWR GAVT +K+QG CGSCWS + + E +F Q+ LV SEQ L+DC
Sbjct: 128 ADSIDWRTKGAVTSVKNQGGCGSCWSFSAAAVMESFNFIQNKALVDFSEQQLVDC 182
>UniRef50_P25774 Cluster: Cathepsin S precursor; n=78;
Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens
(Human)
Length = 331
Score = 66.5 bits (155), Expect = 7e-10
Identities = 28/54 (51%), Positives = 39/54 (72%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+VDWR+ G VT++K QG CG+CW+ + + E Q ++G LVSLS QNL+DCS
Sbjct: 118 SVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCS 171
Score = 48.0 bits (109), Expect = 3e-04
Identities = 48/192 (25%), Positives = 81/192 (42%), Gaps = 7/192 (3%)
Frame = +1
Query: 238 QLRKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNK 417
Q +++ E+ R I+ ++ + HN ++ MG+ SY LGMN GDM E + M+
Sbjct: 38 QYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRV 97
Query: 418 TAKHNKNLYMKGGSVRGAKFISPANVKLPERWTGGSTAPSPTSRTKGSVAHAGLQHDWSF 597
++ +N+ K R I P +V E+ G T + +GS W+F
Sbjct: 98 PSQWQRNITYKSNPNR----ILPDSVDWREK--GCVT----EVKYQGSCGAC-----WAF 142
Query: 598 ---GKDSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHG----QRFKYIKDNGGIDTEQ 756
G + T E+ +G +G F+YI DN GID++
Sbjct: 143 SAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDA 202
Query: 757 TYLTRGVDDQFQ 792
+Y + +D + Q
Sbjct: 203 SYPYKAMDQKCQ 214
>UniRef50_Q4SIQ6 Cluster: Chromosome 21 SCAF14577, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 21 SCAF14577, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 478
Score = 66.1 bits (154), Expect = 1e-09
Identities = 31/58 (53%), Positives = 38/58 (65%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
E ++DWR +GAVT +KDQ CGSCWS A EG F ++G L LS+Q LIDCS
Sbjct: 204 EVPESLDWRLYGAVTPVKDQAICGSCWSFATTGTIEGALFLKTGSLQVLSQQMLIDCS 261
>UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase
precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine
proteinase precursor - Heterodera glycines (Soybean cyst
nematode worm)
Length = 353
Score = 66.1 bits (154), Expect = 1e-09
Identities = 28/54 (51%), Positives = 40/54 (74%), Gaps = 1/54 (1%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQ-HFRQSGYLVSLSEQNLIDCS 668
+DWR+ GAVT++KDQG CGSCW+ + EG +++ ++SLSEQNL+DCS
Sbjct: 139 LDWREKGAVTEVKDQGDCGSCWAFSATGAIEGALAQKKASKIISLSEQNLVDCS 192
>UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 392
Score = 66.1 bits (154), Expect = 1e-09
Identities = 28/58 (48%), Positives = 36/58 (62%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
E +DWR +GAV K QG CGSCW+ A E HF Q G L++L+EQ L+DC+
Sbjct: 176 EVPDQLDWRNYGAVNPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCT 233
>UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep:
Cathepsin K - Danio rerio (Zebrafish) (Brachydanio
rerio)
Length = 333
Score = 65.7 bits (153), Expect = 1e-09
Identities = 27/53 (50%), Positives = 37/53 (69%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
++D+RK G VT +K+QG CGSCW+ + + EGQ + G LV LS QNL+DC
Sbjct: 121 SIDYRKLGYVTSVKNQGSCGSCWAFSSVGALEGQLMKTKGQLVDLSPQNLVDC 173
Score = 45.6 bits (103), Expect = 0.001
Identities = 19/51 (37%), Positives = 32/51 (62%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNG 408
E++ R I+ ++ I HN++YE+G+ +Y LGMN +GDM E + + G
Sbjct: 46 EESIRRTIWEKNMLFIEAHNKEYELGIHTYDLGMNHFGDMTLEEVAEKVMG 96
>UniRef50_Q9FTI3 Cluster: Cysteine proteinase-like; n=5; Oryza
sativa|Rep: Cysteine proteinase-like - Oryza sativa
subsp. japonica (Rice)
Length = 360
Score = 65.7 bits (153), Expect = 1e-09
Identities = 29/62 (46%), Positives = 40/62 (64%)
Frame = +3
Query: 483 AGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLID 662
A + +VDWR GAVT++K+Q CGSCW+ A + EG +G LVSLSEQ ++D
Sbjct: 132 ADDTDVPDSVDWRARGAVTEVKNQRSCGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLD 191
Query: 663 CS 668
C+
Sbjct: 192 CT 193
>UniRef50_Q40261 Cluster: Cysteine proteinase; n=3; core
eudicotyledons|Rep: Cysteine proteinase -
Mesembryanthemum crystallinum (Common ice plant)
Length = 367
Score = 65.7 bits (153), Expect = 1e-09
Identities = 28/57 (49%), Positives = 38/57 (66%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
E ++DWR GAVT +K+QG+CG CW+ + EG + +G L+SLSEQ LIDC
Sbjct: 125 EVPRSIDWRVKGAVTPVKNQGRCGGCWAFSAAAAVEGINQITTGQLISLSEQQLIDC 181
>UniRef50_Q24E33 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 328
Score = 65.7 bits (153), Expect = 1e-09
Identities = 26/53 (49%), Positives = 36/53 (67%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
V+W GAVT +K+QG CGSCW+ + EG +F ++ L+S SEQ L+DCS
Sbjct: 131 VNWTAQGAVTPVKNQGSCGSCWAFSTTGALEGSYFLKNNQLISFSEQQLVDCS 183
>UniRef50_Q10716 Cluster: Cysteine proteinase 1 precursor; n=46;
Eukaryota|Rep: Cysteine proteinase 1 precursor - Zea
mays (Maize)
Length = 371
Score = 65.7 bits (153), Expect = 1e-09
Identities = 27/51 (52%), Positives = 33/51 (64%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
DWR HGAV +K+QG CGSCWS + EG H+ +G L LSEQ +DC
Sbjct: 142 DWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDC 192
>UniRef50_Q0JMZ4 Cluster: Os01g0347600 protein; n=16; Oryza
sativa|Rep: Os01g0347600 protein - Oryza sativa subsp.
japonica (Rice)
Length = 343
Score = 65.3 bits (152), Expect = 2e-09
Identities = 28/52 (53%), Positives = 35/52 (67%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+DWR GAVT +KDQG CGSCW+ A + EG ++G L LSEQ L+DC
Sbjct: 129 IDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDC 180
>UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1;
Dictyostelium discoideum AX4|Rep: Counting factor
associated protein - Dictyostelium discoideum AX4
Length = 531
Score = 65.3 bits (152), Expect = 2e-09
Identities = 29/59 (49%), Positives = 35/59 (59%)
Frame = +3
Query: 492 REAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
R VDWR VT +KDQG CGSCW+ EG + +G LVSLSEQ L+DC+
Sbjct: 307 RSIPSTVDWRNQNCVTPVKDQGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCA 365
>UniRef50_Q10834 Cluster: Preprocathepsin cathepsin L; n=18;
Schistosoma|Rep: Preprocathepsin cathepsin L -
Schistosoma japonicum (Blood fluke)
Length = 331
Score = 65.3 bits (152), Expect = 2e-09
Identities = 29/51 (56%), Positives = 34/51 (66%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
DWR HGAVT +K QG CGSCW+ + EGQ R+ LV LSEQ L+DC
Sbjct: 121 DWRDHGAVTAVKHQGLCGSCWAFSATGAIEGQLRRKHKKLVKLSEQQLVDC 171
Score = 33.1 bits (72), Expect = 8.3
Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 1/50 (2%)
Frame = +1
Query: 256 EDNFRMK-IYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTM 402
+D R K I+ I +HN ++++GL Y +G+N++ DM E + M
Sbjct: 42 DDEMRRKMIFMRRIGKIQEHNLRHDLGLEGYTMGLNQFCDMEWEEVNRIM 91
>UniRef50_A7TZB4 Cluster: Putative cathepsin L; n=1; Lepeophtheirus
salmonis|Rep: Putative cathepsin L - Lepeophtheirus
salmonis (salmon louse)
Length = 257
Score = 65.3 bits (152), Expect = 2e-09
Identities = 28/53 (52%), Positives = 38/53 (71%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
V+W K+GAVT +KDQ CGSCW+ + EGQ+F ++ L+S SEQ L+DCS
Sbjct: 42 VNWTKNGAVTAVKDQKDCGSCWAFSTTGSVEGQYFIKNKKLLSFSEQQLVDCS 94
>UniRef50_Q22RR8 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 332
Score = 64.9 bits (151), Expect = 2e-09
Identities = 28/53 (52%), Positives = 35/53 (66%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+DW K GAVT +KDQ +CGSCW+ + E F +G L SLSEQ L+DCS
Sbjct: 129 IDWTKKGAVTPVKDQEQCGSCWAFSATGALESATFISTGTLPSLSEQELVDCS 181
>UniRef50_O16454 Cluster: Temporarily assigned gene name protein
196; n=4; Bilateria|Rep: Temporarily assigned gene name
protein 196 - Caenorhabditis elegans
Length = 477
Score = 64.5 bits (150), Expect = 3e-09
Identities = 28/51 (54%), Positives = 34/51 (66%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
DWR+ GAVT +K+QG CGSCW+ + EG F LVSLSEQ L+DC
Sbjct: 269 DWREKGAVTQVKNQGNCGSCWAFSTTGNVEGAWFIAKNKLVSLSEQELVDC 319
>UniRef50_UPI00015B60C0 Cluster: PREDICTED: similar to homologue of
Sarcophaga 26,29kDa proteinase; n=1; Nasonia
vitripennis|Rep: PREDICTED: similar to homologue of
Sarcophaga 26,29kDa proteinase - Nasonia vitripennis
Length = 553
Score = 64.1 bits (149), Expect = 4e-09
Identities = 29/52 (55%), Positives = 34/52 (65%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
DWR +GAVT +KDQ CGSCWS EG +F + LV LS+Q LIDCS
Sbjct: 339 DWRLYGAVTPVKDQSVCGSCWSFGTTGAVEGAYFMKYKKLVRLSQQALIDCS 390
>UniRef50_Q9V3U6 Cluster: CG8947-PA; n=22; Eumetazoa|Rep: CG8947-PA
- Drosophila melanogaster (Fruit fly)
Length = 549
Score = 64.1 bits (149), Expect = 4e-09
Identities = 30/53 (56%), Positives = 35/53 (66%), Gaps = 1/53 (1%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHF-RQSGYLVSLSEQNLIDCS 668
DWR +GAVT +KDQ CGSCWS + EG F + G LV LS+Q LIDCS
Sbjct: 335 DWRLYGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDCS 387
>UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor;
n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
L-like proteinase precursor - Diabrotica virgifera
virgifera (western corn rootworm)
Length = 317
Score = 64.1 bits (149), Expect = 4e-09
Identities = 26/54 (48%), Positives = 36/54 (66%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
++DWR+ GAV ++DQ +CGSCW+ + EGQ F + G L LS Q L+DCS
Sbjct: 107 SIDWREKGAVNPVRDQEQCGSCWAFSAAGALEGQRFLKEGKLEVLSTQQLVDCS 160
Score = 42.7 bits (96), Expect = 0.010
Identities = 16/45 (35%), Positives = 30/45 (66%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF 390
E+ R ++++++ I +HN +Y+ G VS+ LG+N++ DM EF
Sbjct: 32 EEQVRFQVFSQNLQKIEQHNARYQNGEVSFYLGVNQFADMTSEEF 76
>UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra
erinaceieuropaei|Rep: Cysteine proteinase - Spirometra
erinaceieuropaei (Tapeworm)
Length = 336
Score = 64.1 bits (149), Expect = 4e-09
Identities = 29/54 (53%), Positives = 39/54 (72%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+V+WR+ GAVT +K+QG+CGSCWS + EG ++G L SLSEQ L+DCS
Sbjct: 124 SVNWRERGAVTSVKNQGQCGSCWSFSANGAIEGAIQIKTGALRSLSEQQLMDCS 177
Score = 33.1 bits (72), Expect = 8.3
Identities = 14/47 (29%), Positives = 25/47 (53%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK 396
E+ R + + + I +HNQ+Y L SY + +N + D+ EF +
Sbjct: 48 EELHRKRAFFNNLDFIIRHNQRYYQQLESYAVRLNDFSDLTPGEFAE 94
>UniRef50_A7RWG6 Cluster: Predicted protein; n=3; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 513
Score = 64.1 bits (149), Expect = 4e-09
Identities = 27/53 (50%), Positives = 36/53 (67%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
VDWRK GAV +K QG CGSC++ A EG HF ++G + LSEQ ++DC+
Sbjct: 300 VDWRKAGAVNSVKSQGICGSCYAFAVAGALEGAHFIKTGLKLDLSEQQIVDCT 352
>UniRef50_P82473 Cluster: Cysteine proteinase GP-I; n=27;
Eukaryota|Rep: Cysteine proteinase GP-I - Zingiber
officinale (Ginger)
Length = 221
Score = 64.1 bits (149), Expect = 4e-09
Identities = 27/54 (50%), Positives = 38/54 (70%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
++DWR+ GAV +K+QG CGSCW+ + EG + +G L+SLSEQ L+DCS
Sbjct: 6 SIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCS 59
>UniRef50_Q6U8A7 Cluster: Cysteine protease; n=11;
Trichomonadidae|Rep: Cysteine protease - Tritrichomonas
foetus (Trichomonas foetus)
Length = 315
Score = 63.7 bits (148), Expect = 5e-09
Identities = 25/53 (47%), Positives = 36/53 (67%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
++DWR+ G V +IKDQ CGSCW+ + ++ E + +G L S SEQNL+DC
Sbjct: 103 SIDWREKGVVNEIKDQAACGSCWAFSAIQAAESAYAISTGTLESYSEQNLVDC 155
>UniRef50_O61165 Cluster: Cysteine protease; n=2; Babesia equi|Rep:
Cysteine protease - Babesia equi
Length = 438
Score = 63.7 bits (148), Expect = 5e-09
Identities = 31/89 (34%), Positives = 44/89 (49%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEHXXXXX 689
+DWRK VT +KDQG CGSCW+ A + E + + G + LSEQ L++C E+
Sbjct: 228 LDWRKLNGVTPVKDQGNCGSCWAFAAVGSVESLYLIKKGQALDLSEQELVNCEENSNGCE 287
Query: 690 XXXXXXXXLQVHQGQRGDRHRADLPYEGS 776
+ +G H DLPY +
Sbjct: 288 GDLPNKALEYIK--AKGISHSKDLPYHAA 314
>UniRef50_P25779 Cluster: Cruzipain precursor; n=54;
Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi
Length = 467
Score = 63.7 bits (148), Expect = 5e-09
Identities = 29/58 (50%), Positives = 37/58 (63%)
Frame = +3
Query: 498 AAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
A AVDWR GAVT +KDQG+CGSCW+ + + E Q F L +LSEQ L+ C +
Sbjct: 123 APAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDK 180
>UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1;
Phytophthora infestans|Rep: Cathepsin-like cysteine
protease - Phytophthora infestans (Potato late blight
fungus)
Length = 376
Score = 63.3 bits (147), Expect = 7e-09
Identities = 25/52 (48%), Positives = 36/52 (69%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
DWR+H VT +K+QG+CGSCW+ + + E + +G L SLSEQ L+DC+
Sbjct: 138 DWREHSTVTPVKNQGQCGSCWAFSAVAAMECAYALSTGTLESLSEQELVDCT 189
Score = 35.5 bits (78), Expect = 1.6
Identities = 23/83 (27%), Positives = 38/83 (45%), Gaps = 1/83 (1%)
Frame = +1
Query: 268 RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYM 447
R + +A + I HN+ YE G S+ LG+N D+ E+ + ++ + +K
Sbjct: 64 RFRSFATNLERIQTHNEAYERGEHSFTLGLNDLADLADAEYKQLLSYRTRDSK------- 116
Query: 448 KGGSVRGAKFISPANVK-LPERW 513
S F+ P NV+ LP W
Sbjct: 117 --SSSASETFVKPENVEDLPATW 137
>UniRef50_Q9N6S8 Cluster: Falcipain 2; n=8; Plasmodium
falciparum|Rep: Falcipain 2 - Plasmodium falciparum
Length = 484
Score = 63.3 bits (147), Expect = 7e-09
Identities = 26/54 (48%), Positives = 35/54 (64%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
A DWR H VT +KDQ CGSCW+ + + E Q+ + L++LSEQ L+DCS
Sbjct: 264 AYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCS 317
>UniRef50_Q22RC9 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 389
Score = 63.3 bits (147), Expect = 7e-09
Identities = 27/58 (46%), Positives = 38/58 (65%)
Frame = +3
Query: 492 REAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
++A + DWR HGAVT +K+QG G+CW+ + EGQ F LVSLSE+ ++DC
Sbjct: 123 QDAPTSYDWRDHGAVTPVKNQGTVGTCWTFSTTGNIEGQWFLAGNPLVSLSEEQIVDC 180
>UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1];
n=11; Eutheria|Rep: Testin-2 precursor [Contains:
Testin-1] - Mus musculus (Mouse)
Length = 333
Score = 63.3 bits (147), Expect = 7e-09
Identities = 28/52 (53%), Positives = 36/52 (69%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
VDWR G VT +K+QG C S W+ + EGQ F+++G LV LSEQNL+DC
Sbjct: 118 VDWRMLGYVTPVKNQGYCASSWAFSATGSLEGQMFKKTGRLVPLSEQNLLDC 169
Score = 42.3 bits (95), Expect = 0.014
Identities = 18/54 (33%), Positives = 31/54 (57%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNK 417
E+ R ++ ++ +I HN +Y G + + MN +GD+ + EFVK M GF +
Sbjct: 44 EERLRRAVWEKNFKMIELHNWEYLEGKHDFTMTMNAFGDLTNTEFVKMMTGFRR 97
>UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9;
Onchocercidae|Rep: Cathepsin L-like precursor - Brugia
pahangi (Filarial nematode worm)
Length = 395
Score = 63.3 bits (147), Expect = 7e-09
Identities = 25/55 (45%), Positives = 38/55 (69%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
VDWR GAVT +++QG+CGSC++ A E H + +G L+ LS QN++DC+ +
Sbjct: 186 VDWRTKGAVTPVRNQGECGSCYAFATAAALEAYHKQMTGRLLDLSPQNIVDCTRN 240
Score = 44.8 bits (101), Expect = 0.003
Identities = 19/46 (41%), Positives = 29/46 (63%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFV 393
E+NFRM I+ ++ + + N+KYE GLVSY +N D+ EF+
Sbjct: 106 ENNFRMAIFESNELMTERINKKYEQGLVSYTTALNDLADLTDEEFM 151
>UniRef50_UPI0000E4978C Cluster: PREDICTED: similar to cathepsin l;
n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
similar to cathepsin l - Strongylocentrotus purpuratus
Length = 489
Score = 62.9 bits (146), Expect = 9e-09
Identities = 28/53 (52%), Positives = 34/53 (64%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+DW GAV+ +KDQ CGSCWS E EG F QSG V LS+Q L+DC+
Sbjct: 271 IDWNVLGAVSPVKDQAVCGSCWSFGSAETIEGAVFMQSGKRVRLSQQMLMDCT 323
>UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain;
n=16; Chrysomelidae|Rep: Digestive cysteine protease
intestain - Leptinotarsa decemlineata (Colorado potato
beetle)
Length = 326
Score = 62.9 bits (146), Expect = 9e-09
Identities = 26/58 (44%), Positives = 37/58 (63%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
E ++DW + GAV ++KDQ CGSCW+ + EGQ+ + +SLSEQ L+DCS
Sbjct: 109 EVPDSIDWTEKGAVLEVKDQNPCGSCWAFSATGALEGQNAILNNVKISLSEQQLLDCS 166
Score = 39.1 bits (87), Expect = 0.13
Identities = 16/51 (31%), Positives = 28/51 (54%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNG 408
E+ R I+ + I +HN +Y+ G +Y LG+ ++ D+ H EF + G
Sbjct: 39 EEKTRFGIFQRNLIKIKEHNARYDKGEETYLLGVTRFADLTHEEFKDILKG 89
>UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan
CA, family C1, cathepsin L-like cysteine peptidase -
Trichomonas vaginalis G3
Length = 306
Score = 62.9 bits (146), Expect = 9e-09
Identities = 25/52 (48%), Positives = 35/52 (67%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+DWR+ G V IK+QG CGSCW+ + +++ E Q + L LSEQNL+DC
Sbjct: 92 IDWREQGIVNKIKNQGACGSCWAFSAIQVIESQVAKNQKQLYDLSEQNLLDC 143
>UniRef50_Q9ZQH7 Cluster: Cysteine proteinase; n=7; Arabidopsis
thaliana|Rep: Cysteine proteinase - Arabidopsis thaliana
(Mouse-ear cress)
Length = 348
Score = 62.5 bits (145), Expect = 1e-08
Identities = 27/53 (50%), Positives = 36/53 (67%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
++DWR+ GAVT +K QG+CG CW+ + + EG G LVSLSEQ L+DC
Sbjct: 131 SMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDC 183
>UniRef50_Q7RRE8 Cluster: Berghepain-2; n=10; Plasmodium
(Vinckeia)|Rep: Berghepain-2 - Plasmodium yoelii yoelii
Length = 472
Score = 62.5 bits (145), Expect = 1e-08
Identities = 26/54 (48%), Positives = 35/54 (64%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
DWR H A+ DIKDQ KC SCW+ A + Q+ + VSLSEQ L+DC+++
Sbjct: 255 DWRDHNAIIDIKDQQKCASCWAFATAGVVAAQYAIRKNQKVSLSEQQLVDCAQN 308
>UniRef50_Q23H06 Cluster: Papain family cysteine protease containing
protein; n=18; Tetrahymena thermophila|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 349
Score = 62.5 bits (145), Expect = 1e-08
Identities = 26/55 (47%), Positives = 36/55 (65%)
Frame = +3
Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
A ++DWR GAVT +K QG CG+CW+ + + E +F Q+ LV SEQ L+DC
Sbjct: 142 ATSIDWRSRGAVTQVKWQGNCGACWAFSATGVMESFNFIQNKALVEFSEQQLLDC 196
>UniRef50_O96087 Cluster: Cathepsin L-like tick cysteine proteinase
A; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like
tick cysteine proteinase A - Haemaphysalis longicornis
(Bush tick)
Length = 312
Score = 62.5 bits (145), Expect = 1e-08
Identities = 26/54 (48%), Positives = 38/54 (70%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
VDW + G+ +K+QG+CGSCW+ + EGQHFR++ V+ EQNL+DCS+
Sbjct: 97 VDWAQEGSRAPVKNQGQCGSCWAFSTTGSLEGQHFRKTESRVT-GEQNLVDCSD 149
Score = 42.3 bits (95), Expect = 0.014
Identities = 48/186 (25%), Positives = 73/186 (39%), Gaps = 5/186 (2%)
Frame = +1
Query: 217 LQAAAPSQLRKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLG-MNKYGDMLHHEFV 393
LQ AA S ++ +KI+ E+ ++AKHN KY GL ++G GD +V
Sbjct: 4 LQIAAQSGVQFPRRRTIEVKIFTENTLLVAKHNAKYAKGLGVLQVGPWTSLGDFA-AAWV 62
Query: 394 KTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPERWT-GGSTAPSPTSRTKGS--- 561
+ ++ A +N G + ++ +++ W GS AP GS
Sbjct: 63 RQNGQWDTAASRTRN---SGPHLFHQANLNDSSLPTTVDWAQEGSRAPVKNQGQCGSCWA 119
Query: 562 VAHAGLQHDWSFGKDSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGG 741
+ G F K + V+ Q G F+YIK NGG
Sbjct: 120 FSTTGSLEGQHFRKTESRVT------GEQNLVDCSDDFGNQGCNGGLMDNGFQYIKANGG 173
Query: 742 IDTEQT 759
IDTE+T
Sbjct: 174 IDTEET 179
>UniRef50_A2EZN7 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=17; Trichomonas vaginalis|Rep:
Clan CA, family C1, cathepsin L-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 318
Score = 62.5 bits (145), Expect = 1e-08
Identities = 26/53 (49%), Positives = 36/53 (67%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
AVDWR V IKDQ +CGSCW+ + ++ E Q + G L+SL+EQN++DC
Sbjct: 103 AVDWRNAKIVNPIKDQAQCGSCWAFSVVQAQESQWALKKGQLLSLAEQNMVDC 155
>UniRef50_P35591 Cluster: Cysteine proteinase 1 precursor; n=14;
Leishmania|Rep: Cysteine proteinase 1 precursor -
Leishmania pifanoi
Length = 354
Score = 62.1 bits (144), Expect = 2e-08
Identities = 28/53 (52%), Positives = 35/53 (66%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+VDWR GAVT +K+QG CGSCW+ + + EGQ LVSLSEQ L+ C
Sbjct: 132 SVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQMLVSC 184
>UniRef50_UPI000155637A Cluster: PREDICTED: similar to
ENSANGP00000013730, partial; n=1; Ornithorhynchus
anatinus|Rep: PREDICTED: similar to ENSANGP00000013730,
partial - Ornithorhynchus anatinus
Length = 229
Score = 61.7 bits (143), Expect = 2e-08
Identities = 30/55 (54%), Positives = 37/55 (67%), Gaps = 1/55 (1%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHF-RQSGYLVSLSEQNLIDCS 668
++DWR +GAVT +KDQ CGSCWS A EG F + + LV LS+Q LIDCS
Sbjct: 58 SLDWRLYGAVTPVKDQAVCGSCWSFATTGTLEGALFLKVTVQLVPLSQQMLIDCS 112
>UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K
precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2);
n=2; Tribolium castaneum|Rep: PREDICTED: similar to
Cathepsin K precursor (Cathepsin O) (Cathepsin X)
(Cathepsin O2) - Tribolium castaneum
Length = 332
Score = 61.7 bits (143), Expect = 2e-08
Identities = 25/58 (43%), Positives = 38/58 (65%)
Frame = +3
Query: 492 REAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
R + ++DWR+ G VT +K+QG+CGSCW+ A + E + + +SLSEQ L+DC
Sbjct: 116 RGISASLDWRQRGGVTPVKNQGQCGSCWAFATIGAIESHYKIRHKRAISLSEQQLVDC 173
Score = 40.7 bits (91), Expect = 0.041
Identities = 13/44 (29%), Positives = 28/44 (63%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 387
E+ FR ++ ++ I+ +HN+++ G +Y++G+NK+ D E
Sbjct: 43 EETFRKSLFTKNLEIVEEHNERFRNGSETYEMGVNKFSDFTDEE 86
>UniRef50_Q9ASM1 Cluster: Putative cysteine protease; n=6; Oryza
sativa|Rep: Putative cysteine protease - Oryza sativa
subsp. japonica (Rice)
Length = 357
Score = 61.7 bits (143), Expect = 2e-08
Identities = 27/52 (51%), Positives = 34/52 (65%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+DWR GAVT +KDQG CGS W+ A + EG ++G L LSEQ L+DC
Sbjct: 137 IDWRFKGAVTGVKDQGACGSSWAFAAVAAMEGLMKIRTGQLTPLSEQELVDC 188
>UniRef50_Q0IZL5 Cluster: Os09g0562700 protein; n=2; Oryza sativa
(japonica cultivar-group)|Rep: Os09g0562700 protein -
Oryza sativa subsp. japonica (Rice)
Length = 235
Score = 61.7 bits (143), Expect = 2e-08
Identities = 26/46 (56%), Positives = 35/46 (76%)
Frame = +3
Query: 528 GAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
GAVT++KDQG+CGSCW+ + + + EG + G LVSLSEQ L+DC
Sbjct: 19 GAVTEVKDQGRCGSCWAFSTVAVVEGIQKIKKGKLVSLSEQELVDC 64
>UniRef50_A5HII8 Cluster: Actinidin Act3a; n=5; Actinidia|Rep:
Actinidin Act3a - Actinidia eriantha
Length = 380
Score = 61.7 bits (143), Expect = 2e-08
Identities = 27/53 (50%), Positives = 36/53 (67%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
VDWR GAV D+K+QG C SCW+ A + E + +G L+SLSEQ L+DC+
Sbjct: 130 VDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIITGDLISLSEQELVDCN 182
Score = 37.9 bits (84), Expect = 0.29
Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 1/66 (1%)
Frame = +1
Query: 253 GEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHN 432
GE R++I+ E+ I +HN SY +G+N++ D+ E+ T GF + K
Sbjct: 57 GEREMRIEIFKENLRFIDEHNADPNR---SYTVGLNQFADLTDEEYRSTYLGFKSSLKSK 113
Query: 433 -KNLYM 447
N YM
Sbjct: 114 VSNRYM 119
>UniRef50_A7SM85 Cluster: Predicted protein; n=2; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 514
Score = 61.7 bits (143), Expect = 2e-08
Identities = 26/64 (40%), Positives = 41/64 (64%)
Frame = +3
Query: 477 HIAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNL 656
H+ + + +DWR +GAV+ ++ QG CGSC++ A + EG +F ++G L LS Q +
Sbjct: 296 HVLQRVDVPDELDWRDYGAVSPVRGQGICGSCYALAAVGAVEGAYFMKTGKLKELSAQQV 355
Query: 657 IDCS 668
IDCS
Sbjct: 356 IDCS 359
>UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep:
Cathepsin L - Kudoa thyrsites
Length = 300
Score = 61.7 bits (143), Expect = 2e-08
Identities = 26/54 (48%), Positives = 36/54 (66%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+VDW+ G VT +K+QG CGSCWS + E + ++G LV+ SEQ L+DCS
Sbjct: 105 SVDWKALGKVTSVKNQGHCGSCWSFSAAGAIESAYAIKTGELVNFSEQQLVDCS 158
>UniRef50_Q3ZD77 Cluster: Cysteine protease; n=2; Oomycetes|Rep:
Cysteine protease - Saprolegnia parasitica
Length = 523
Score = 61.3 bits (142), Expect = 3e-08
Identities = 26/55 (47%), Positives = 35/55 (63%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
+DW + G VT +K+QG CGSCW+ + EG F S LVS+SEQ L+DC +
Sbjct: 120 MDWVEQGGVTPVKNQGMCGSCWAFSTTGAIEGAAFVSSKQLVSVSEQELVDCDHN 174
>UniRef50_A3C1K2 Cluster: Putative uncharacterized protein; n=1;
Oryza sativa (japonica cultivar-group)|Rep: Putative
uncharacterized protein - Oryza sativa subsp. japonica
(Rice)
Length = 326
Score = 61.3 bits (142), Expect = 3e-08
Identities = 27/52 (51%), Positives = 37/52 (71%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQ 650
+A A DWR+HGAVT +KDQG CGSCW+ + +E EG + +G ++LSEQ
Sbjct: 116 DAPPAWDWREHGAVTRVKDQGPCGSCWAFSVVEAVEGINEIMTGNFLTLSEQ 167
Score = 37.1 bits (82), Expect = 0.51
Identities = 21/63 (33%), Positives = 31/63 (49%)
Frame = +1
Query: 226 AAPSQLRKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN 405
AA S R + R +++ ++ I N+K M SYKLG+NK+ D+ EF
Sbjct: 35 AASSSPRDLADKGSRFEVFKKNARYIHDFNRKKGM---SYKLGLNKFADLTLEEFTAKYT 91
Query: 406 GFN 414
G N
Sbjct: 92 GAN 94
>UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain;
n=9; Cucujiformia|Rep: Digestive cysteine proteinase
intestain - Leptinotarsa decemlineata (Colorado potato
beetle)
Length = 326
Score = 61.3 bits (142), Expect = 3e-08
Identities = 26/59 (44%), Positives = 37/59 (62%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
E ++DW + GAV D+K QG CGSCW+ + EGQ+ + + LSEQ L+DCS+
Sbjct: 109 EVPDSIDWTQKGAVLDVKYQGGCGSCWAFSATGALEGQNAIVNNVKIPLSEQQLLDCSK 167
Score = 39.1 bits (87), Expect = 0.13
Identities = 17/45 (37%), Positives = 25/45 (55%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF 390
E+ R I+ + I +HN KY+ G SY LG+ + D+ H EF
Sbjct: 39 EERTRFGIFQSNLRKIEEHNAKYDKGEESYFLGVTPFADLTHDEF 83
>UniRef50_Q54UH2 Cluster: Putative uncharacterized protein; n=3;
Dictyostelium discoideum AX4|Rep: Putative
uncharacterized protein - Dictyostelium discoideum AX4
Length = 664
Score = 61.3 bits (142), Expect = 3e-08
Identities = 22/54 (40%), Positives = 39/54 (72%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
++DWR G V+ +K+QG CGSC++ + + E ++R++ ++ LSEQNL+DC+
Sbjct: 473 SIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYRKNNRMLDLSEQNLVDCT 526
>UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel
protein - Danio rerio (Zebrafish) (Brachydanio rerio)
Length = 328
Score = 60.9 bits (141), Expect = 4e-08
Identities = 25/53 (47%), Positives = 37/53 (69%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
V+W +HG V+ +++QG CGSCW+ + + E Q R++ LV LS QNL+DCS
Sbjct: 117 VNWTEHGMVSPVQNQGPCGSCWAFSAVGSLEAQMKRRTAALVPLSAQNLLDCS 169
Score = 38.3 bits (85), Expect = 0.22
Identities = 20/55 (36%), Positives = 29/55 (52%)
Frame = +1
Query: 244 RKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNG 408
R E+ R ++ ++ I HN+ +GL SY LG+N+ DM E V MNG
Sbjct: 39 RNTREERLRRSVWKQNLQDILLHNEAAAVGLHSYTLGLNQLSDMTADE-VNDMNG 92
>UniRef50_Q8S333 Cluster: Cysteine protease; n=4; Lycopersicon|Rep:
Cysteine protease - Solanum lycopersicum (Tomato)
(Lycopersicon esculentum)
Length = 345
Score = 60.9 bits (141), Expect = 4e-08
Identities = 24/53 (45%), Positives = 36/53 (67%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+DWR+ GAVT +K QG+CG CW+ + + EG + +G L+ SEQ L+DC+
Sbjct: 135 LDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 187
Score = 35.5 bits (78), Expect = 1.6
Identities = 19/53 (35%), Positives = 27/53 (50%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFN 414
E R I+ E+ I N+ G +SYKLGMN++ D+ EF+ G N
Sbjct: 55 EKGERFMIFKENMKFIESVNKA---GNLSYKLGMNEFADITSQEFLAKFTGLN 104
>UniRef50_A2Y5I0 Cluster: Putative uncharacterized protein; n=2;
Oryza sativa|Rep: Putative uncharacterized protein -
Oryza sativa subsp. indica (Rice)
Length = 149
Score = 60.9 bits (141), Expect = 4e-08
Identities = 26/55 (47%), Positives = 38/55 (69%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
++DWRK GAV ++K Q CGSCW+ + + EG ++G LVSLS+Q L+DC +
Sbjct: 20 SIDWRKKGAVVEVKYQEDCGSCWAFSAVAAIEG--INKNGELVSLSKQELVDCDD 72
>UniRef50_Q7K0S6 Cluster: LD36817p; n=1; Drosophila
melanogaster|Rep: LD36817p - Drosophila melanogaster
(Fruit fly)
Length = 352
Score = 60.9 bits (141), Expect = 4e-08
Identities = 28/54 (51%), Positives = 36/54 (66%), Gaps = 1/54 (1%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGK-CGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
DWR+ G VT QG CG+CWS A EG FR++G L SLS+QNL+DC++
Sbjct: 135 DWREKGGVTPPGFQGVGCGACWSFATTGALEGHLFRRTGVLASLSQQNLVDCAD 188
>UniRef50_Q6E7B2 Cluster: Cathepsin F-like cysteine proteinase; n=1;
Brugia malayi|Rep: Cathepsin F-like cysteine proteinase
- Brugia malayi (Filarial nematode worm)
Length = 461
Score = 60.9 bits (141), Expect = 4e-08
Identities = 27/51 (52%), Positives = 33/51 (64%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
DWR G VT +KDQG CGSCW+ + E ++G L+SLSEQ LIDC
Sbjct: 253 DWRTEGVVTPVKDQGSCGSCWAFSVTGNIESLWAIKTGKLISLSEQELIDC 303
>UniRef50_Q9LXW3 Cluster: Probable cysteine proteinase At3g43960
precursor; n=2; Arabidopsis thaliana|Rep: Probable
cysteine proteinase At3g43960 precursor - Arabidopsis
thaliana (Mouse-ear cress)
Length = 376
Score = 60.9 bits (141), Expect = 4e-08
Identities = 30/53 (56%), Positives = 36/53 (67%), Gaps = 1/53 (1%)
Frame = +3
Query: 510 VDWRKHGAVTD-IKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
VDWR+ GAV +K QG+CGSCW+ A EG + +G LVSLSEQ LIDC
Sbjct: 131 VDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDC 183
>UniRef50_Q8V5U0 Cluster: Viral cathepsin; n=4;
Nucleopolyhedrovirus|Rep: Viral cathepsin - Heliothis
zea nuclear polyhedrosis virus (HzSNPV) (Helicoverpa
zeasingle nucleocapsid nuclear polyhedrosis virus)
Length = 367
Score = 60.9 bits (141), Expect = 4e-08
Identities = 34/88 (38%), Positives = 41/88 (46%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEHXXXXXX 692
DWR VT IKDQG CGSCW+ + E Q+ + L+ LSEQ L+DC E
Sbjct: 161 DWRDTNKVTPIKDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCDE-VDLGCN 219
Query: 693 XXXXXXXLQVHQGQRGDRHRADLPYEGS 776
Q G AD PY+GS
Sbjct: 220 GGLMHLAFQELLLMGGVETEADYPYQGS 247
>UniRef50_Q8I9S0 Cluster: Cathepsin L-like protease; n=1; Sarcoptes
scabiei type hominis|Rep: Cathepsin L-like protease -
Sarcoptes scabiei type hominis
Length = 245
Score = 60.5 bits (140), Expect = 5e-08
Identities = 26/53 (49%), Positives = 34/53 (64%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
VDW V IKDQ +CGSCW+ + + E Q+ ++G LV LSEQ L+DCS
Sbjct: 124 VDWTLKNVVAPIKDQKQCGSCWAFSAVASMESQNALKTGQLVELSEQELVDCS 176
Score = 46.0 bits (104), Expect = 0.001
Identities = 21/56 (37%), Positives = 34/56 (60%)
Frame = +1
Query: 238 QLRKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN 405
Q R ++ R I+ + I KHN+KYE GL +Y+LG+N++ D+ + E+ MN
Sbjct: 43 QFRTVYDELLRKLIFQRNYIYIRKHNEKYEAGLSTYELGVNQFTDLTNKEYNDQMN 98
>UniRef50_Q20C76 Cluster: Dvir_CG5367; n=3; Endopterygota|Rep:
Dvir_CG5367 - Drosophila virilis (Fruit fly)
Length = 298
Score = 60.5 bits (140), Expect = 5e-08
Identities = 24/52 (46%), Positives = 38/52 (73%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
DWRK G +T + +Q CGSC++ + + EGQ F+++G +V+LSEQ ++DCS
Sbjct: 92 DWRKKGFITPLYNQQSCGSCYAFSIAQSIEGQVFKRTGKIVALSEQQIVDCS 143
>UniRef50_Q91BH1 Cluster: Viral cathepsin; n=2;
Nucleopolyhedrovirus|Rep: Viral cathepsin - Spodoptera
litura multicapsid nucleopolyhedrovirus (SpltMNPV)
Length = 337
Score = 60.5 bits (140), Expect = 5e-08
Identities = 31/87 (35%), Positives = 42/87 (48%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEHXXXXXX 692
DWRK VT +K+QG CGSCW+ A + E Q+ L+ LSEQ L+DC
Sbjct: 131 DWRKLNKVTKVKEQGVCGSCWAFAAIGNIESQYAIMHDSLIDLSEQQLLDCDRVDQGCDG 190
Query: 693 XXXXXXXLQVHQGQRGDRHRADLPYEG 773
++ + G H D PY+G
Sbjct: 191 GLMHLAFQEIIR-IGGVEHEIDYPYQG 216
>UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Rep:
Cathepsin R precursor - Mus musculus (Mouse)
Length = 334
Score = 60.5 bits (140), Expect = 5e-08
Identities = 36/95 (37%), Positives = 45/95 (47%), Gaps = 3/95 (3%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE---HXX 680
VDWRK G VT ++ QG C +CW+ A E Q Q+G L LS QNL+DCS+ +
Sbjct: 119 VDWRKKGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLSVQNLVDCSKPQGNNG 178
Query: 681 XXXXXXXXXXXLQVHQGQRGDRHRADLPYEGS*RP 785
+H G G A PYEG P
Sbjct: 179 CLGGDTYNAFQYVLHNG--GLESEATYPYEGKDGP 211
Score = 34.7 bits (76), Expect = 2.7
Identities = 42/179 (23%), Positives = 68/179 (37%), Gaps = 4/179 (2%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNK-TAKHN 432
E+ + ++ E +I HN++ +G + + MN++GD EF K M + T +
Sbjct: 44 EEKLKRVVWEEKLKMIKLHNRENSLGKNGFTMKMNEFGDQTDEEFRKMMIEISVWTHREG 103
Query: 433 KNLYMKGGSVRGAKFIS--PANVKLPERWTGGSTAPSPTSRTKGSVAHAGLQHDWSFGK- 603
K++ + KF+ P R G A + T G++ Q W GK
Sbjct: 104 KSIMKREAGSILPKFVDWRKKGYVTPVRRQGDCDACWAFAVT-GAIE---AQAIWQTGKL 159
Query: 604 DSTSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTEQTYLTRGVD 780
SV C G F+Y+ NGG+++E TY G D
Sbjct: 160 TPLSVQNLVDCSKPQGNNGCLG---------GDTYNAFQYVLHNGGLESEATYPYEGKD 209
>UniRef50_UPI0001556527 Cluster: PREDICTED: similar to Cathepsin W,
partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
similar to Cathepsin W, partial - Ornithorhynchus
anatinus
Length = 229
Score = 60.1 bits (139), Expect = 6e-08
Identities = 26/52 (50%), Positives = 36/52 (69%), Gaps = 1/52 (1%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSG-YLVSLSEQNLIDC 665
DWRK GA+T +K+QG CGSCW+ A + E + ++G LVSLS Q ++DC
Sbjct: 73 DWRKRGAITSVKNQGSCGSCWAFAAVGNAESMWYLRAGKRLVSLSVQEVLDC 124
>UniRef50_Q9NAW4 Cluster: Cysteine protease falcipain-3; n=13;
Plasmodium|Rep: Cysteine protease falcipain-3 -
Plasmodium falciparum
Length = 492
Score = 60.1 bits (139), Expect = 6e-08
Identities = 26/54 (48%), Positives = 33/54 (61%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
A DWR HG VT +KDQ CGSCW+ + + E Q+ + L SEQ L+DCS
Sbjct: 272 AYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCS 325
>UniRef50_Q2I8Y2 Cluster: Secreted cathepsin F; n=1; Teladorsagia
circumcincta|Rep: Secreted cathepsin F - Teladorsagia
circumcincta
Length = 364
Score = 60.1 bits (139), Expect = 6e-08
Identities = 26/51 (50%), Positives = 33/51 (64%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
DWR+HGAVT +K +G C +CW+ + EGQ F LVSLS Q L+DC
Sbjct: 158 DWREHGAVTKVKTEGHCAACWAFSVTGNIEGQWFLAKKKLVSLSAQQLLDC 208
>UniRef50_Q24FN2 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 367
Score = 60.1 bits (139), Expect = 6e-08
Identities = 23/54 (42%), Positives = 37/54 (68%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
++DWR+ GAV+ +K+QG CGSCW+ + + L E + ++ L SEQ L+DC+
Sbjct: 158 SIDWRQSGAVSPVKNQGSCGSCWAFSAVALAESVNLLRNNSLALYSEQELVDCT 211
>UniRef50_Q22A69 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 330
Score = 60.1 bits (139), Expect = 6e-08
Identities = 28/57 (49%), Positives = 35/57 (61%), Gaps = 1/57 (1%)
Frame = +3
Query: 498 AAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQ-SGYLVSLSEQNLIDC 665
A A+DW GAVT +K+QG CGSCW+ + EGQ+ Q L S SEQ L+DC
Sbjct: 112 APTAIDWTTKGAVTPVKNQGSCGSCWAFSTTGSIEGQYVLQLKQNLTSFSEQQLVDC 168
>UniRef50_Q9YMP9 Cluster: Viral cathepsin; n=30;
Nucleopolyhedrovirus|Rep: Viral cathepsin - Lymantria
dispar multicapsid nuclear polyhedrosis virus (LdMNPV)
Length = 356
Score = 60.1 bits (139), Expect = 6e-08
Identities = 26/51 (50%), Positives = 32/51 (62%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
DWR+ VT IK+QG CG+CW+ A L E Q + L+ LSEQ LIDC
Sbjct: 149 DWREQNKVTSIKNQGACGACWAFATLASVESQFAMRHNRLIDLSEQQLIDC 199
>UniRef50_UPI00006CCD5A Cluster: Papain family cysteine protease
containing protein; n=2; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 332
Score = 59.7 bits (138), Expect = 8e-08
Identities = 25/56 (44%), Positives = 38/56 (67%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
+VDWRK GAV+ ++DQG CGSC++ A EG + ++G L S Q ++DC++H
Sbjct: 130 SVDWRKLGAVSPVRDQGNCGSCYAFASTGALEGLYQIKTGKLEVFSPQYIVDCAKH 185
>UniRef50_Q5FW00 Cluster: MGC107932 protein; n=6; Xenopus|Rep:
MGC107932 protein - Xenopus tropicalis (Western clawed
frog) (Silurana tropicalis)
Length = 333
Score = 59.7 bits (138), Expect = 8e-08
Identities = 26/55 (47%), Positives = 38/55 (69%), Gaps = 1/55 (1%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGK-CGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
VDWRK VT +K+QG CGSCW+ A + + E ++ ++ L++LSEQ L+DC E
Sbjct: 119 VDWRKSNCVTPVKNQGTFCGSCWAFATVGVMESRYCIRTKELLNLSEQQLVDCDE 173
Score = 33.9 bits (74), Expect = 4.8
Identities = 12/29 (41%), Positives = 20/29 (68%)
Frame = +1
Query: 301 IAKHNQKYEMGLVSYKLGMNKYGDMLHHE 387
+ KHNQ + GL SY++ MN++ D+ +E
Sbjct: 58 VQKHNQLADQGLKSYRMAMNQFADLTDNE 86
>UniRef50_Q00XY3 Cluster: Cysteine protease-1; n=1; Ostreococcus
tauri|Rep: Cysteine protease-1 - Ostreococcus tauri
Length = 430
Score = 59.7 bits (138), Expect = 8e-08
Identities = 27/55 (49%), Positives = 38/55 (69%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
A+DW + GAVT K+QG+CGSCW+ + EG ++G LVSLSEQ ++ CS+
Sbjct: 204 AIDWVELGAVTPPKNQGQCGSCWAFSTTGAVEGITKIRTGRLVSLSEQEMVSCSK 258
>UniRef50_Q26986 Cluster: TFCP2 protein; n=1; Tritrichomonas
foetus|Rep: TFCP2 protein - Tritrichomonas foetus
(Trichomonas foetus)
Length = 270
Score = 59.7 bits (138), Expect = 8e-08
Identities = 24/58 (41%), Positives = 35/58 (60%)
Frame = +3
Query: 492 REAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
++ + DWR G V IK+QG CGSCW+ + + E H +G L+ SEQ+L+DC
Sbjct: 48 KDTPTSFDWRSEGKVNPIKNQGSCGSCWAFSAIAAQESCHAIATGELLRFSEQSLVDC 105
>UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3;
Bilateria|Rep: Cathepsin L-like cysteine protease -
Neobenedenia melleni
Length = 335
Score = 59.7 bits (138), Expect = 8e-08
Identities = 25/53 (47%), Positives = 35/53 (66%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+DW + G VT +K+Q +CGSCW+ + EG R +G L+S SEQ L+DCS
Sbjct: 122 IDWVRKGHVTAVKNQAQCGSCWAFSSTGSIEGAVKRATGKLISFSEQQLVDCS 174
Score = 33.5 bits (73), Expect = 6.3
Identities = 11/15 (73%), Positives = 15/15 (100%)
Frame = +2
Query: 671 AYGNNGCNGGLMDNA 715
A+GN+GCNGG+MDN+
Sbjct: 176 AFGNHGCNGGIMDNS 190
>UniRef50_Q7XR52 Cluster: Cysteine protease 1 precursor; n=5; Oryza
sativa|Rep: Cysteine protease 1 precursor - Oryza sativa
subsp. japonica (Rice)
Length = 490
Score = 59.7 bits (138), Expect = 8e-08
Identities = 27/57 (47%), Positives = 40/57 (70%), Gaps = 1/57 (1%)
Frame = +3
Query: 507 AVDWRKHGAVT-DIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
+VDWR GAV +K+QG+CGSCW+ + + EG + +G LVSLSEQ L++C+ +
Sbjct: 158 SVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARN 214
>UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor;
n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase
precursor - Phaedon cochleariae (Mustard beetle)
Length = 324
Score = 59.7 bits (138), Expect = 8e-08
Identities = 31/92 (33%), Positives = 42/92 (45%)
Frame = +3
Query: 498 AAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEHX 677
A ++DWR G V +++QG+CGSCW+ + E Q +SG V LS Q L+DCS
Sbjct: 110 APESIDWRSKGVVLPVRNQGECGSCWALSTAAAIESQSAIKSGSKVPLSPQQLVDCSTSY 169
Query: 678 XXXXXXXXXXXXLQVHQGQRGDRHRADLPYEG 773
+ G AD PY G
Sbjct: 170 GNHGCNGGFAVNGFEYVKDNGLESDADYPYSG 201
Score = 40.7 bits (91), Expect = 0.041
Identities = 18/45 (40%), Positives = 26/45 (57%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF 390
E+ R I+ + IA+HN KYE G +Y L +NK+ D+ EF
Sbjct: 39 EEKLRFNIFQDTLRQIAEHNVKYENGESTYYLAINKFSDITDEEF 83
>UniRef50_Q5IZD8 Cluster: Vivapain-4; n=1; Plasmodium vivax|Rep:
Vivapain-4 - Plasmodium vivax
Length = 484
Score = 59.3 bits (137), Expect = 1e-07
Identities = 24/53 (45%), Positives = 35/53 (66%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
DWR+H AV++IK+Q CGSCW+ + E Q+ + V +SEQ L+DCS+
Sbjct: 267 DWREHNAVSEIKNQNLCGSCWAFGAVGAVESQYAIRKNQHVLISEQELVDCSD 319
>UniRef50_Q235G6 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 325
Score = 59.3 bits (137), Expect = 1e-07
Identities = 26/53 (49%), Positives = 34/53 (64%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+DW + GAVT +K+QG CG CWS A EG +F L +LS+Q LIDC+
Sbjct: 121 IDWVEKGAVTPVKNQGGCGGCWSFATTGGVEGANFVYKNVLPNLSQQQLIDCN 173
>UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18;
Magnoliophyta|Rep: Thiol protease aleurain precursor -
Arabidopsis thaliana (Mouse-ear cress)
Length = 358
Score = 59.3 bits (137), Expect = 1e-07
Identities = 24/52 (46%), Positives = 34/52 (65%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
DWR+ G V+ +KDQG CGSCW+ + E + + G +SLSEQ L+DC+
Sbjct: 146 DWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCA 197
Score = 44.0 bits (99), Expect = 0.004
Identities = 52/179 (29%), Positives = 76/179 (42%), Gaps = 3/179 (1%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGF--NKTAKH 429
E R I+ E+ +I N+K GL SYKLG+N++ D+ EF +T G N +A
Sbjct: 75 EMKLRFSIFKENLDLIRSTNKK---GL-SYKLGVNQFADLTWQEFQRTKLGAAQNCSATL 130
Query: 430 NKNLYMKGGSVRGAKFISPANVKLPERWTGGSTAPSPTSRTKGSVAHAGLQHDWSFGKD- 606
+ + ++ K + P + GG + T T G++ A Q +FGK
Sbjct: 131 KGSHKVTEAALPETKDWREDGIVSPVKDQGGCGS-CWTFSTTGALEAAYHQ---AFGKGI 186
Query: 607 STSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTEQTYLTRGVDD 783
S S C G Q F+YIK NGG+DTE+ Y G D+
Sbjct: 187 SLSEQQLVDCAGAFNNYGCNG---------GLPSQAFEYIKSNGGLDTEKAYPYTGKDE 236
>UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 360
Score = 58.8 bits (136), Expect = 1e-07
Identities = 24/51 (47%), Positives = 34/51 (66%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
DWR GA+T +K Q CG CW+ + ++ EG +F ++G L SLS Q +IDC
Sbjct: 136 DWRDKGAITPVKVQNGCGGCWAFSTVQSIEGLYFLKTGKLESLSTQQVIDC 186
>UniRef50_A5UW25 Cluster: Peptidase C1A, papain precursor; n=2;
Roseiflexus|Rep: Peptidase C1A, papain precursor -
Roseiflexus sp. RS-1
Length = 1202
Score = 58.8 bits (136), Expect = 1e-07
Identities = 37/108 (34%), Positives = 46/108 (42%), Gaps = 3/108 (2%)
Frame = +3
Query: 474 VHIAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQN 653
V + Q A +W GA T +KDQG CGSCW+ A + E R G LSEQ
Sbjct: 161 VVMGAQEGLPAAFNWCDQGACTPVKDQGVCGSCWAFATTGVVESALKRIDGVERDLSEQY 220
Query: 654 LIDCSEH---XXXXXXXXXXXXXLQVHQGQRGDRHRADLPYEGS*RPI 788
LI H L HQ + G + +DLPY G P+
Sbjct: 221 LISAGTHGTCNGGGPAYDLFIGDLPAHQTEAGAVYESDLPYLGQDVPL 268
>UniRef50_Q76CZ3 Cluster: Cysteine protease; n=1; Triticum
aestivum|Rep: Cysteine protease - Triticum aestivum
(Wheat)
Length = 371
Score = 58.8 bits (136), Expect = 1e-07
Identities = 26/52 (50%), Positives = 30/52 (57%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
DWR+HG VT K QG CG CW+ A E + G LV LS Q L+DCS
Sbjct: 158 DWREHGVVTPAKQQGACGCCWAFAAAATVESLNKINGGELVDLSVQELVDCS 209
>UniRef50_Q26888 Cluster: Cathepsin L-like cysteine proteinase; n=1;
Toxocara canis|Rep: Cathepsin L-like cysteine proteinase
- Toxocara canis (Canine roundworm)
Length = 360
Score = 58.8 bits (136), Expect = 1e-07
Identities = 27/63 (42%), Positives = 37/63 (58%)
Frame = +3
Query: 480 IAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLI 659
+A + E DWR + VT +K Q KCGSCW+ A + E + +G L SLSEQ L+
Sbjct: 139 LARREEIPDHFDWRPYNVVTPVKSQFKCGSCWAFATVGTVESAYALGTGELRSLSEQQLL 198
Query: 660 DCS 668
DC+
Sbjct: 199 DCN 201
>UniRef50_Q2QS15 Cluster: Papain family cysteine protease containing
protein; n=1; Oryza sativa (japonica
cultivar-group)|Rep: Papain family cysteine protease
containing protein - Oryza sativa subsp. japonica (Rice)
Length = 351
Score = 58.4 bits (135), Expect = 2e-07
Identities = 26/55 (47%), Positives = 36/55 (65%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
+VDWRK GAV ++K CGSCW+ + + EG ++G LVSL EQ L+DC +
Sbjct: 148 SVDWRKKGAVVEVKYHEDCGSCWAFSAVAAIEG--INKNGELVSLLEQELVDCDD 200
>UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba
culbertsoni|Rep: Cysteine proteinase - Acanthamoeba
culbertsoni
Length = 482
Score = 58.4 bits (135), Expect = 2e-07
Identities = 26/52 (50%), Positives = 32/52 (61%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
DWR GAVT +K+QG C SCW+ EG G LVSLS+Q L+DC+
Sbjct: 161 DWRTKGAVTPVKNQGSCASCWAFVATGAVEGVRKIAGGSLVSLSDQMLLDCA 212
>UniRef50_Q86GS8 Cluster: Cathepsin H; n=1; Sterkiella
histriomuscorum|Rep: Cathepsin H - Oxytricha trifallax
(Sterkiella histriomuscorum)
Length = 366
Score = 58.4 bits (135), Expect = 2e-07
Identities = 23/52 (44%), Positives = 34/52 (65%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
DWR G V+ +K+QGKCGSCW+ + + E + + G +LSEQ L+DC+
Sbjct: 140 DWRTFGVVSPVKNQGKCGSCWTFSTVGCVESHYLLKYGAFRNLSEQQLVDCA 191
>UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
Clan CA, family C1, cathepsin L-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 234
Score = 58.4 bits (135), Expect = 2e-07
Identities = 26/52 (50%), Positives = 32/52 (61%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+D+R GAV +IKDQ CGSCW+ E F + G L SLSEQ L+DC
Sbjct: 22 IDYRTKGAVNEIKDQKHCGSCWAFGSCAAMESSWFLKHGTLYSLSEQCLVDC 73
>UniRef50_UPI00015B5508 Cluster: PREDICTED: similar to CG5367-PA;
n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
CG5367-PA - Nasonia vitripennis
Length = 362
Score = 58.0 bits (134), Expect = 3e-07
Identities = 27/59 (45%), Positives = 38/59 (64%)
Frame = +3
Query: 492 REAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
R ++DWR+ G VT ++Q CGSC++ + GQ FRQ+G +V LSEQ L+DCS
Sbjct: 149 RRIPKSLDWREKGFVTKPENQRDCGSCYAYSIAGSIAGQIFRQTGIVVPLSEQQLVDCS 207
>UniRef50_UPI0000519B9B Cluster: PREDICTED: similar to Cathepsin O
precursor; n=2; Apocrita|Rep: PREDICTED: similar to
Cathepsin O precursor - Apis mellifera
Length = 374
Score = 58.0 bits (134), Expect = 3e-07
Identities = 22/54 (40%), Positives = 36/54 (66%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
DWR G +T ++ QG CG+CW+ + +E+ E ++G L SLS Q +IDC+++
Sbjct: 160 DWRDKGVITPVRSQGSCGACWAFSTIEVIESMFAIKNGTLHSLSVQEMIDCAKN 213
>UniRef50_Q5VP99 Cluster: Putative cysteine proteinase; n=1; Oryza
sativa (japonica cultivar-group)|Rep: Putative cysteine
proteinase - Oryza sativa subsp. japonica (Rice)
Length = 357
Score = 58.0 bits (134), Expect = 3e-07
Identities = 25/53 (47%), Positives = 35/53 (66%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
++WR GAVT +K+Q C SCW+ + + EG H +S LV+LS Q L+DCS
Sbjct: 139 INWRDRGAVTQVKNQKDCASCWAFSAVAAVEGIHQIRSHNLVALSTQQLLDCS 191
>UniRef50_Q0JP65 Cluster: Os01g0240900 protein; n=3; Oryza
sativa|Rep: Os01g0240900 protein - Oryza sativa subsp.
japonica (Rice)
Length = 166
Score = 58.0 bits (134), Expect = 3e-07
Identities = 28/57 (49%), Positives = 36/57 (63%), Gaps = 3/57 (5%)
Frame = +3
Query: 504 GAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSG---YLVSLSEQNLIDC 665
GA WR GAVTD+K QG C SCW+ + EG +F SG L++LSEQ L++C
Sbjct: 100 GASIWRDRGAVTDVKMQGTCASCWAFSTTGAVEGDNFLASGNLRNLLNLSEQQLVNC 156
>UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 397
Score = 58.0 bits (134), Expect = 3e-07
Identities = 23/56 (41%), Positives = 35/56 (62%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
+VDWR G V+ +KDQG+CG CW+ + L E + ++ L SEQ L+DC+ +
Sbjct: 183 SVDWRIQGKVSPVKDQGRCGCCWAFSATALAESVNLMRNNTLQQYSEQELVDCTNN 238
>UniRef50_Q23TW3 Cluster: Papain family cysteine protease containing
protein; n=5; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 437
Score = 58.0 bits (134), Expect = 3e-07
Identities = 32/91 (35%), Positives = 46/91 (50%), Gaps = 3/91 (3%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGK-CGSCWSSARLELWEGQHFRQSGYL-VSLSEQNLIDCS-EHXX 680
VDWR+ G VT +K QGK CGSCW+ A + E + ++G + SEQ L+DC+ +
Sbjct: 209 VDWREKGVVTQVKSQGKDCGSCWAFAAVAALESHYALKTGKKPIQFSEQQLVDCARKFDT 268
Query: 681 XXXXXXXXXXXLQVHQGQRGDRHRADLPYEG 773
+ G ++ AD PYEG
Sbjct: 269 KGCSGGLPSKGFEYLAYAGGIQNEADYPYEG 299
>UniRef50_Q8RWQ9 Cluster: Thiol protease aleurain-like precursor;
n=17; Magnoliophyta|Rep: Thiol protease aleurain-like
precursor - Arabidopsis thaliana (Mouse-ear cress)
Length = 358
Score = 58.0 bits (134), Expect = 3e-07
Identities = 23/52 (44%), Positives = 34/52 (65%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
DWR+ G V+ +K+QG CGSCW+ + E + + G +SLSEQ L+DC+
Sbjct: 146 DWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCA 197
Score = 36.7 bits (81), Expect = 0.67
Identities = 49/178 (27%), Positives = 71/178 (39%), Gaps = 3/178 (1%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGF--NKTAKH 429
E R ++ E+ +I N+K GL SYKL +N++ D+ EF + G N +A
Sbjct: 75 EMKLRFSVFKENLDLIRSTNKK---GL-SYKLSLNQFADLTWQEFQRYKLGAAQNCSATL 130
Query: 430 NKNLYMKGGSVRGAKFISPANVKLPERWTGGSTAPSPTSRTKGSVAHAGLQHDWSFGKD- 606
+ + +V K + P + G T T G++ A Q +FGK
Sbjct: 131 KGSHKITEATVPDTKDWREDGIVSPVK-EQGHCGSCWTFSTTGALEAAYHQ---AFGKGI 186
Query: 607 STSVSPATWCXXXXXXXXXXXXXXEQRLQRGAHGQRFKYIKDNGGIDTEQTYLTRGVD 780
S S C G Q F+YIK NGG+DTE+ Y G D
Sbjct: 187 SLSEQQLVDCAGTFNNFGCHG---------GLPSQAFEYIKYNGGLDTEEAYPYTGKD 235
>UniRef50_Q23RT7 Cluster: Papain family cysteine protease containing
protein; n=7; Hymenostomatida|Rep: Papain family
cysteine protease containing protein - Tetrahymena
thermophila SB210
Length = 387
Score = 57.6 bits (133), Expect = 3e-07
Identities = 25/56 (44%), Positives = 34/56 (60%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
+VDWR G VT +KDQG CGSCW+ A + E +G L +LS Q L+ C ++
Sbjct: 136 SVDWRDAGVVTPVKDQGHCGSCWAFATTAVIESYAAIATGQLKTLSTQQLVSCVQN 191
Score = 34.7 bits (76), Expect = 2.7
Identities = 28/85 (32%), Positives = 39/85 (45%), Gaps = 1/85 (1%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
E N R +I+ + I N E G YK G+N++ D E +T G++KT K+
Sbjct: 57 EYNQRKRIFEQKLKEIKAFNSNSENG---YKKGINQFTDRTAEELRETTLGYSKTVKNAA 113
Query: 436 NLYMKGGSVRGAKFISPANVK-LPE 507
N K R K NVK LP+
Sbjct: 114 N---KQNMFRNLKTSDKINVKDLPK 135
>UniRef50_UPI00015565F5 Cluster: PREDICTED: hypothetical protein,
partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
hypothetical protein, partial - Ornithorhynchus anatinus
Length = 224
Score = 57.2 bits (132), Expect = 4e-07
Identities = 28/52 (53%), Positives = 34/52 (65%), Gaps = 1/52 (1%)
Frame = +3
Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQ-HFRQSGYLVSLSEQN 653
A DWRK GAVT +K+QG CGSCW+ A + E + R S LVSLSEQ+
Sbjct: 132 AETCDWRKEGAVTPVKNQGDCGSCWAFAAVGNVESMWYLRASNRLVSLSEQD 183
>UniRef50_UPI00006CC36D Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 344
Score = 57.2 bits (132), Expect = 4e-07
Identities = 25/56 (44%), Positives = 35/56 (62%), Gaps = 1/56 (1%)
Frame = +3
Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGY-LVSLSEQNLIDC 665
A +DWR A+T +K QGKCGSCW+ A + E F ++G L + SEQ ++DC
Sbjct: 136 APPMDWRNASAITPVKQQGKCGSCWTFASTAVLESFSFIKNGAPLTNFSEQQILDC 191
>UniRef50_Q60EQ9 Cluster: Putative uncharacterized protein
OJ1280_A04.4; n=1; Oryza sativa (japonica
cultivar-group)|Rep: Putative uncharacterized protein
OJ1280_A04.4 - Oryza sativa subsp. japonica (Rice)
Length = 340
Score = 57.2 bits (132), Expect = 4e-07
Identities = 33/95 (34%), Positives = 48/95 (50%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEHXXXX 686
++D RK GAV ++K Q CGSCW+ + + EG ++G LVSLSEQ L+DC +
Sbjct: 133 SIDRRKKGAVVEVKYQEDCGSCWAFSAVAAIEG--INKNGELVSLSEQELVDCDDEAVGC 190
Query: 687 XXXXXXXXXLQVHQGQRGDRHRADLPYEGS*RPIP 791
H+ +R A+ P G R +P
Sbjct: 191 GGGHHGGELAVPHRERRVPGGEAE-PERGQHRGLP 224
>UniRef50_Q8T930 Cluster: Granule-biosynthesis induced protease
Gip1p; n=4; Tetrahymena thermophila|Rep:
Granule-biosynthesis induced protease Gip1p -
Tetrahymena thermophila
Length = 345
Score = 57.2 bits (132), Expect = 4e-07
Identities = 23/53 (43%), Positives = 34/53 (64%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+VDWRK G + +K+QG CGSCW+ A + E + ++ L+ SEQ L+DC
Sbjct: 136 SVDWRKRGVLNPVKNQGTCGSCWTFATAGILESFNQIKNKQLLKFSEQQLVDC 188
>UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:
Silicatein beta - Suberites domuncula (Sponge)
Length = 383
Score = 57.2 bits (132), Expect = 4e-07
Identities = 25/53 (47%), Positives = 36/53 (67%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+DWR G VT +KDQ +CGS ++ + + EG + G LV+LSEQN++DCS
Sbjct: 166 MDWRTSGVVTKVKDQLRCGSSYAFSAMASLEGINALSYGSLVTLSEQNIVDCS 218
>UniRef50_Q23H32 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 365
Score = 57.2 bits (132), Expect = 4e-07
Identities = 24/53 (45%), Positives = 36/53 (67%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+VDWR+ V ++ QG CGSCW+ + + EG + +Q+G ++ SEQNLIDC
Sbjct: 138 SVDWREK-LVAPVQKQGGCGSCWAFSTVIALEGAYAKQTGNVIKFSEQNLIDC 189
Score = 41.1 bits (92), Expect = 0.031
Identities = 19/59 (32%), Positives = 35/59 (59%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHN 432
E ++R +I+AE+ + I +NQ E + +L +N++ D+ EF + G+N + KHN
Sbjct: 57 EGDYRFQIFAENYNYIHNYNQINENSQDNIQLEVNEFADLSLQEFRELYFGYNSSKKHN 115
>UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicatein
a3 - Lubomirskia baicalensis
Length = 344
Score = 57.2 bits (132), Expect = 4e-07
Identities = 26/56 (46%), Positives = 36/56 (64%)
Frame = +3
Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
A ++DWR G VT ++ QG+CGS ++ A EG + LV+LSEQN+IDCS
Sbjct: 129 ADSLDWRTRGVVTSVQSQGQCGSSYAFAAAGALEGATALAADKLVALSEQNIIDCS 184
Score = 35.9 bits (79), Expect = 1.2
Identities = 45/176 (25%), Positives = 70/176 (39%), Gaps = 7/176 (3%)
Frame = +1
Query: 268 RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYM 447
R I+ +K I HN + L Y L MN +GD++ EF + T KH++ +
Sbjct: 64 RHSIWVANKKYIEHHNANAD--LFGYTLAMNGFGDLMSAEFTERY----LTHKHSQRSGL 117
Query: 448 KGGSVRGAKFISPANVKLPERWTGGSTAPSPTSRTKGSVAHAGLQHDWSFGKDSTSVSPA 627
+ + K ++ A+ L R G T+ + S A A + ++ A
Sbjct: 118 Q--TFESPKGVTYAD-SLDWRTRGVVTSVQSQGQCGSSYAFAA----------AGALEGA 164
Query: 628 TWCXXXXXXXXXXXXXXEQRLQRGAHG-------QRFKYIKDNGGIDTEQTYLTRG 774
T + + G HG FKY+ DNGGIDTE +Y +G
Sbjct: 165 TALAADKLVALSEQNIIDCSVPYGNHGCSGGDVYTAFKYVVDNGGIDTESSYPYKG 220
>UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine
protease; n=1; Maconellicoccus hirsutus|Rep: Putative
cathepsin L-like cysteine protease - Maconellicoccus
hirsutus (hibiscus mealybug)
Length = 339
Score = 57.2 bits (132), Expect = 4e-07
Identities = 39/115 (33%), Positives = 60/115 (52%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
ED RMKI+ ++K+ IA+HN+ + GLV+++ G+N+Y DML EF + M + + + +
Sbjct: 45 EDRLRMKIFIDNKYRIAQHNKLFHKGLVTFEQGINEYSDMLQSEFNEKM---GQKSSNQR 101
Query: 436 NLYMKGGSVRGAKFISPANVKLPERWTGGSTAPSPTSRTKGSVAHAGLQHDWSFG 600
N G + +F NV P+ S RTKG V G Q + S G
Sbjct: 102 NTEANG--LPSIRFTPLHNVNPPD---------SVDWRTKGLVGPVGKQVNCSSG 145
Score = 39.5 bits (88), Expect = 0.096
Identities = 20/55 (36%), Positives = 28/55 (50%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
+VDWR G V + Q C S ++ + + EGQ +S QN+IDCSE
Sbjct: 124 SVDWRTKGLVGPVGKQVNCSSGYAWSAIGALEGQLASDKKKFQGISVQNVIDCSE 178
>UniRef50_Q01958 Cluster: Cysteine proteinase 2 precursor; n=11;
Entamoeba|Rep: Cysteine proteinase 2 precursor -
Entamoeba histolytica
Length = 315
Score = 57.2 bits (132), Expect = 4e-07
Identities = 30/97 (30%), Positives = 48/97 (49%), Gaps = 3/97 (3%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSG---YLVSLSEQNLIDC 665
+A +VDWRK G VT I+DQ +CGSC++ L EG+ + G + LSE++++ C
Sbjct: 93 QAPESVDWRKEGKVTPIRDQAQCGSCYTFGSLAALEGRLLIEKGGDANTLDLSEEHMVQC 152
Query: 666 SEHXXXXXXXXXXXXXLQVHQGQRGDRHRADLPYEGS 776
+ + + + G +D PY GS
Sbjct: 153 TRDNGNNGCNGGLGSNVYDYIIEHGVAKESDYPYTGS 189
>UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;
n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
L-like proteinase" precursor - Diabrotica virgifera
virgifera (western corn rootworm)
Length = 315
Score = 56.4 bits (130), Expect = 8e-07
Identities = 29/64 (45%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Frame = +3
Query: 477 HIAGQR-EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQN 653
H+A +A VDWR AV +KDQG+CGSCW+ + EGQ V LSEQ
Sbjct: 103 HVADPNVQAVEEVDWRD-SAVLGVKDQGQCGSCWAFSTTGSLEGQLAIHKNQRVPLSEQE 161
Query: 654 LIDC 665
L+DC
Sbjct: 162 LVDC 165
Score = 39.1 bits (87), Expect = 0.13
Identities = 17/45 (37%), Positives = 25/45 (55%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEF 390
ED R ++ ++ I +HN KYE G +Y L +NK+ D EF
Sbjct: 39 EDKLRFAVFQDNLKKIEEHNAKYESGEETYYLAVNKFADWSSAEF 83
>UniRef50_Q9VN93 Cluster: Putative cysteine proteinase CG12163
precursor; n=4; Schizophora|Rep: Putative cysteine
proteinase CG12163 precursor - Drosophila melanogaster
(Fruit fly)
Length = 614
Score = 56.4 bits (130), Expect = 8e-07
Identities = 24/51 (47%), Positives = 33/51 (64%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
DWR+ AVT +K+QG CGSCW+ + EG + ++G L SEQ L+DC
Sbjct: 399 DWRQKDAVTQVKNQGSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDC 449
>UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus
tropicalis|Rep: LOC594890 protein - Xenopus tropicalis
(Western clawed frog) (Silurana tropicalis)
Length = 355
Score = 56.0 bits (129), Expect = 1e-06
Identities = 25/56 (44%), Positives = 37/56 (66%), Gaps = 1/56 (1%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHF-RQSGYLVSLSEQNLIDCSE 671
++DWR VT +KDQG C + W+ + + E Q+ R++G L SLS QNL+DCS+
Sbjct: 142 SIDWRNKNCVTSVKDQGSCIASWAFSSIGALECQNMKRRTGKLESLSVQNLLDCSQ 197
Score = 43.2 bits (97), Expect = 0.008
Identities = 22/55 (40%), Positives = 31/55 (56%), Gaps = 1/55 (1%)
Frame = +1
Query: 244 RKRGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFV-KTMN 405
+ GE+ R I+ + I HN +Y MGL +Y++GMN GDM+ E K MN
Sbjct: 64 KNEGEELARRLIWEDTLKFIMLHNLEYSMGLHTYEVGMNHLGDMVAEEMTDKQMN 118
>UniRef50_Q86KD4 Cluster: Similar to Arabidopsis thaliana (Mouse-ear
cress). SAG12 protein; n=2; Dictyostelium
discoideum|Rep: Similar to Arabidopsis thaliana
(Mouse-ear cress). SAG12 protein - Dictyostelium
discoideum (Slime mold)
Length = 358
Score = 56.0 bits (129), Expect = 1e-06
Identities = 23/56 (41%), Positives = 34/56 (60%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
++DWRK G VT +KDQG+CGSC+ + +E E + + LSEQ +DC +
Sbjct: 148 SIDWRKKGLVTPVKDQGQCGSCYIFSAVEQIETAWIKAGNKPILLSEQQAVDCDPY 203
>UniRef50_Q6GNG5 Cluster: LOC443661 protein; n=13; Xenopus|Rep:
LOC443661 protein - Xenopus laevis (African clawed frog)
Length = 346
Score = 55.6 bits (128), Expect = 1e-06
Identities = 23/60 (38%), Positives = 37/60 (61%)
Frame = +3
Query: 489 QREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+ + ++DWR G VT ++ Q KCGSC++ + + E Q ++ G LV+ S Q L+DCS
Sbjct: 137 EAQPPASIDWRTKGCVTSVRRQRKCGSCYAFSAVGALECQWKKKKGTLVTFSPQELVDCS 196
Score = 46.0 bits (104), Expect = 0.001
Identities = 21/55 (38%), Positives = 30/55 (54%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKT 420
E+ R I+ E I HN +Y +GL +Y++GMN GDM E TM G+ +
Sbjct: 67 EERARRTIWEETLKFITVHNLEYSLGLHTYEVGMNHLGDMTGEEVEATMTGYTSS 121
>UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteinase
B; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like
tick cysteine proteinase B - Haemaphysalis longicornis
(Bush tick)
Length = 332
Score = 55.6 bits (128), Expect = 1e-06
Identities = 32/80 (40%), Positives = 42/80 (52%)
Frame = +3
Query: 342 LQAGHEQVRRHAPPRVREDYERLQQNCQTQQESVHEGWERPRG*VHIAGQREAAGAVDWR 521
+QA HE+V R PRV E +RLQ + + P G +DWR
Sbjct: 70 VQARHERVWRLVAPRVCEHPQRLQAQLPGPP-TWGSTYIEPEG----LEDEHLPKTMDWR 124
Query: 522 KHGAVTDIKDQGKCGSCWSS 581
K GAVT +K+QG+CGSCW+S
Sbjct: 125 KKGAVTPVKNQGQCGSCWAS 144
>UniRef50_Q9VKY4 Cluster: CG5367-PA; n=2; Drosophila
melanogaster|Rep: CG5367-PA - Drosophila melanogaster
(Fruit fly)
Length = 338
Score = 54.8 bits (126), Expect = 2e-06
Identities = 29/88 (32%), Positives = 44/88 (50%), Gaps = 1/88 (1%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS-EHXXX 683
++DWR G +T +Q CGSC++ + E GQ F+++G ++SLS+Q ++DCS H
Sbjct: 130 SLDWRSKGFITPPYNQLSCGSCYAFSIAESIMGQVFKRTGKILSLSKQQIVDCSVSHGNQ 189
Query: 684 XXXXXXXXXXLQVHQGQRGDRHRADLPY 767
L Q G D PY
Sbjct: 190 GCVGGSLRNTLSYLQSTGGIMRDQDYPY 217
Score = 34.7 bits (76), Expect = 2.7
Identities = 18/53 (33%), Positives = 29/53 (54%)
Frame = +1
Query: 274 KIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHN 432
K + E+ +I +HNQ Y+ G S++L N + DM ++K GF + K N
Sbjct: 58 KAFEENFKVIEEHNQNYKEGQTSFRLKPNIFADMSTDGYLK---GFLRLLKSN 107
>UniRef50_Q54RQ2 Cluster: Putative uncharacterized protein; n=1;
Dictyostelium discoideum AX4|Rep: Putative
uncharacterized protein - Dictyostelium discoideum AX4
Length = 339
Score = 54.8 bits (126), Expect = 2e-06
Identities = 24/55 (43%), Positives = 39/55 (70%), Gaps = 1/55 (1%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKC-GSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
++DWR AVT +K+QG C G+ +S + + + E HF ++ L++LSEQN+IDC+
Sbjct: 117 SIDWRNFDAVTPVKNQGLCSGAGYSFSAIGVIESSHFIKNKELITLSEQNIIDCT 171
>UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like
cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep:
Clan CA, family C1, cathepsin L-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 291
Score = 54.8 bits (126), Expect = 2e-06
Identities = 23/59 (38%), Positives = 36/59 (61%)
Frame = +3
Query: 492 REAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+++ G +D+R+ G V I+DQ +CGSCW+ + E + L LSEQN+IDC+
Sbjct: 76 KDSPGILDYREMGVVNPIRDQKQCGSCWAFGTVAACESNYALLYSNLPQLSEQNIIDCA 134
>UniRef50_P09668 Cluster: Cathepsin H precursor (EC 3.4.22.16)
[Contains: Cathepsin H mini chain; Cathepsin H heavy
chain; Cathepsin H light chain]; n=37; Eukaryota|Rep:
Cathepsin H precursor (EC 3.4.22.16) [Contains:
Cathepsin H mini chain; Cathepsin H heavy chain;
Cathepsin H light chain] - Homo sapiens (Human)
Length = 335
Score = 54.8 bits (126), Expect = 2e-06
Identities = 24/56 (42%), Positives = 37/56 (66%), Gaps = 1/56 (1%)
Frame = +3
Query: 507 AVDWRKHGA-VTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
+VDWRK G V+ +K+QG CGSCW+ + E +G ++SL+EQ L+DC++
Sbjct: 119 SVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQ 174
>UniRef50_P36184 Cluster: Cysteine proteinase ACP1 precursor; n=2;
Entamoeba|Rep: Cysteine proteinase ACP1 precursor -
Entamoeba histolytica
Length = 308
Score = 54.8 bits (126), Expect = 2e-06
Identities = 26/61 (42%), Positives = 35/61 (57%)
Frame = +3
Query: 483 AGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLID 662
A + A +VDWR + KDQG+CGSCW+ + EG+ + G L S SEQ L+D
Sbjct: 88 AAVKAAPESVDWRS--IMNPAKDQGQCGSCWTFCTTAVLEGRVNKDLGKLYSFSEQQLVD 145
Query: 663 C 665
C
Sbjct: 146 C 146
>UniRef50_A0CPL8 Cluster: Chromosome undetermined scaffold_23, whole
genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_23,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 321
Score = 54.4 bits (125), Expect = 3e-06
Identities = 25/53 (47%), Positives = 33/53 (62%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
VDW + G V IKDQG CGSCW+ + + E Q +V LSEQ+L+DC+
Sbjct: 123 VDWVQKGKVPAIKDQGDCGSCWAFSAVGALEINTKIQFNEIVDLSEQDLVDCA 175
>UniRef50_UPI00006CF360 Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 280
Score = 54.0 bits (124), Expect = 4e-06
Identities = 23/54 (42%), Positives = 33/54 (61%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
DWR G VT +K+QG CGSCW+ L+E + ++ + SEQ L+DCS +
Sbjct: 73 DWRNLGKVTQVKNQGNCGSCWAFTITGLFESINLIRNKTVELYSEQELLDCSSN 126
>UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 394
Score = 54.0 bits (124), Expect = 4e-06
Identities = 24/56 (42%), Positives = 33/56 (58%), Gaps = 1/56 (1%)
Frame = +3
Query: 501 AGAVDWRK-HGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
A +VDWR + +KDQG+CGSCW+ + E + +G L S SEQ L+DC
Sbjct: 184 AASVDWRNVKNVLNPVKDQGQCGSCWTFGAAGVMESFNAITNGVLKSFSEQQLVDC 239
>UniRef50_Q23894 Cluster: Cysteine proteinase 3; n=2; Dictyostelium
discoideum|Rep: Cysteine proteinase 3 - Dictyostelium
discoideum (Slime mold)
Length = 151
Score = 54.0 bits (124), Expect = 4e-06
Identities = 28/53 (52%), Positives = 36/53 (67%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
VDWR+ AVT +KDQG+CGSC S + EG ++G LVSLSEQN++ S
Sbjct: 80 VDWREKDAVTPVKDQGQCGSCIISTTGSV-EGVTAIKTGKLVSLSEQNILRLS 131
Score = 34.7 bits (76), Expect = 2.7
Identities = 19/45 (42%), Positives = 29/45 (64%), Gaps = 1/45 (2%)
Frame = +2
Query: 575 VFSTTGAL-GRTALPSVRLPGVALGAKPHRLLGAYGNNGCNGGLM 706
+ STTG++ G TA+ + +L ++ RL ++GN GCNGGLM
Sbjct: 101 IISTTGSVEGVTAIKTGKLVSLS-EQNILRLSSSFGNEGCNGGLM 144
>UniRef50_Q4TCK1 Cluster: Chromosome undetermined SCAF6860, whole
genome shotgun sequence; n=1; Tetraodon
nigroviridis|Rep: Chromosome undetermined SCAF6860,
whole genome shotgun sequence - Tetraodon nigroviridis
(Green puffer)
Length = 251
Score = 53.6 bits (123), Expect = 5e-06
Identities = 22/39 (56%), Positives = 30/39 (76%)
Frame = +3
Query: 555 GKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
G CGSCW+ + EGQ ++++G LVSLSEQNL+DCS+
Sbjct: 1 GYCGSCWAFSTTGAIEGQIYKKTGQLVSLSEQNLVDCSK 39
>UniRef50_A7ARF8 Cluster: Cysteine protease 2; n=1; Babesia
bovis|Rep: Cysteine protease 2 - Babesia bovis
Length = 445
Score = 53.6 bits (123), Expect = 5e-06
Identities = 26/52 (50%), Positives = 32/52 (61%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+DWR+ AVT +KDQG CGSCW+ A + E RQ V LSEQ L+ C
Sbjct: 240 IDWRRADAVTPVKDQGMCGSCWAFAAVGSVESLLKRQKTD-VRLSEQELVSC 290
>UniRef50_A0CLZ5 Cluster: Chromosome undetermined scaffold_21, whole
genome shotgun sequence; n=2; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_21,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 349
Score = 53.6 bits (123), Expect = 5e-06
Identities = 26/54 (48%), Positives = 35/54 (64%), Gaps = 1/54 (1%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYL-VSLSEQNLIDCS 668
VD RK G V+++K+QG CGSCW+ + + E RQ G V LSEQ L+DC+
Sbjct: 129 VDLRKDGVVSEVKNQGSCGSCWAFSAVAALE-TALRQGGVKNVELSEQELVDCA 181
Score = 39.9 bits (89), Expect = 0.072
Identities = 19/68 (27%), Positives = 36/68 (52%), Gaps = 1/68 (1%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
E++ R I+ ++ I +H Q+ E GL +++LG+N + D+ EF + T +
Sbjct: 55 ENSHRFGIFKKNYQYIQEHQQRVEAGLETFELGLNDFADLSVEEFEAKYLKYRSTPREQT 114
Query: 436 N-LYMKGG 456
N +Y + G
Sbjct: 115 NQVYRRTG 122
>UniRef50_Q5YER0 Cluster: Digestive cysteine proteinase; n=1;
Bigelowiella natans|Rep: Digestive cysteine proteinase -
Bigelowiella natans (Pedinomonas minutissima)
(Chlorarachnion sp.(strain CCMP 621))
Length = 360
Score = 53.2 bits (122), Expect = 7e-06
Identities = 22/58 (37%), Positives = 36/58 (62%), Gaps = 4/58 (6%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHF-RQSGYL---VSLSEQNLIDCSEH 674
DWR A+T +KDQG CGSCW+ + + E H+ + + L ++LS + L++C +H
Sbjct: 114 DWRDFNALTPVKDQGGCGSCWAFSATQALESAHYIKHNDTLDSPIALSTEQLVECDQH 171
>UniRef50_A2ZSV3 Cluster: Putative uncharacterized protein; n=1;
Oryza sativa (japonica cultivar-group)|Rep: Putative
uncharacterized protein - Oryza sativa subsp. japonica
(Rice)
Length = 289
Score = 53.2 bits (122), Expect = 7e-06
Identities = 23/46 (50%), Positives = 30/46 (65%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSE 647
+DWR GAVT +KDQG CGSCW+ A + EG ++G L LS+
Sbjct: 128 IDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSD 173
>UniRef50_A1KXI0 Cluster: Blo t 1 allergen; n=2; Blomia
tropicalis|Rep: Blo t 1 allergen - Blomia tropicalis
(Mite)
Length = 333
Score = 53.2 bits (122), Expect = 7e-06
Identities = 22/52 (42%), Positives = 31/52 (59%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
DWR+ +T I+ QG CGSCW+ A + E + Q + LSEQ L+DC+
Sbjct: 118 DWRQKARLTRIRQQGSCGSCWAFAAAGVAESLYSIQKQQSIELSEQELVDCT 169
>UniRef50_Q4RTU4 Cluster: Chromosome 12 SCAF14996, whole genome
shotgun sequence; n=3; Tetraodontidae|Rep: Chromosome 12
SCAF14996, whole genome shotgun sequence - Tetraodon
nigroviridis (Green puffer)
Length = 362
Score = 52.8 bits (121), Expect = 1e-05
Identities = 22/52 (42%), Positives = 34/52 (65%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGF 411
E+ +R ++ ++ I HN ++ MG SY+LGMN +GDM H EF + MNG+
Sbjct: 43 EEGWRRMVWEKNLKKIELHNLEHSMGQHSYRLGMNHFGDMTHEEFRQIMNGY 94
Score = 45.2 bits (102), Expect = 0.002
Identities = 19/22 (86%), Positives = 21/22 (95%)
Frame = +3
Query: 603 GQHFRQSGYLVSLSEQNLIDCS 668
GQHFRQ+G LVSLSEQNL+DCS
Sbjct: 183 GQHFRQTGKLVSLSEQNLVDCS 204
Score = 37.9 bits (84), Expect = 0.29
Identities = 16/30 (53%), Positives = 20/30 (66%)
Frame = +1
Query: 697 GAHGQRFKYIKDNGGIDTEQTYLTRGVDDQ 786
G Q F+YIKDNGG+D+E +Y DDQ
Sbjct: 215 GLMDQAFQYIKDNGGLDSEASYPYLATDDQ 244
>UniRef50_Q0J238 Cluster: Os09g0381400 protein; n=5; Oryza
sativa|Rep: Os09g0381400 protein - Oryza sativa subsp.
japonica (Rice)
Length = 362
Score = 52.8 bits (121), Expect = 1e-05
Identities = 25/61 (40%), Positives = 33/61 (54%), Gaps = 1/61 (1%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQ-GKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
+ +VDWR GAV K Q C SCW+ E + ++G LVSLSEQ L+DC
Sbjct: 143 DVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDS 202
Query: 672 H 674
+
Sbjct: 203 Y 203
>UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2;
Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio
molitor (Yellow mealworm)
Length = 336
Score = 52.8 bits (121), Expect = 1e-05
Identities = 24/53 (45%), Positives = 32/53 (60%), Gaps = 2/53 (3%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQH--FRQSGYLVSLSEQNLIDC 665
DWR G V+ +K+QG CGSCW+ + E Q +GY S+SEQ L+DC
Sbjct: 126 DWRDQGMVSPVKNQGSCGSCWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDC 178
Score = 48.4 bits (110), Expect = 2e-04
Identities = 23/61 (37%), Positives = 33/61 (54%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
E+ FR +I+ + +HN+KY GLVSY LG+N + DM E +G A +K
Sbjct: 43 EETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDMTPEEMKAYTHGLIMPADLHK 102
Query: 436 N 438
N
Sbjct: 103 N 103
>UniRef50_Q23FQ5 Cluster: Papain family cysteine protease containing
protein; n=4; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 332
Score = 52.8 bits (121), Expect = 1e-05
Identities = 22/54 (40%), Positives = 35/54 (64%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+VDWR GA+ I++QG+CGSC + + E ++ +S L+ SEQ L+DC+
Sbjct: 128 SVDWRNSGALNPIQNQGQCGSCAAFGTAGVLESFYYLKSKQLLKFSEQQLLDCA 181
>UniRef50_Q22AB1 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 344
Score = 52.8 bits (121), Expect = 1e-05
Identities = 23/51 (45%), Positives = 30/51 (58%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
DWR G +T K Q CGSCW+ A + E Q+ + G L+ SEQ L+DC
Sbjct: 136 DWRDKGIITPAKFQNTCGSCWTFATTGVIESQYALKYGELLHFSEQMLLDC 186
>UniRef50_UPI0001509FF9 Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 348
Score = 52.0 bits (119), Expect = 2e-05
Identities = 22/59 (37%), Positives = 35/59 (59%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
E ++DW AV+++K QG C S W+ A + E F ++G + +SEQNL+DC +
Sbjct: 139 EPVNSIDWISKNAVSNVKTQGMCQSSWAFAAVAGVESALFLKNGKIPDVSEQNLLDCDQ 197
>UniRef50_Q24FA8 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 335
Score = 52.0 bits (119), Expect = 2e-05
Identities = 24/59 (40%), Positives = 36/59 (61%), Gaps = 2/59 (3%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQH-FRQSGYLVSL-SEQNLIDC 665
+ A ++DWRK G V+ +K+QG+CG CW+ + L E + VSL S+Q L+DC
Sbjct: 123 QIASSIDWRKKGGVSPVKNQGECGGCWTFSATGLMESFNLIHNKPQNVSLYSQQQLLDC 181
>UniRef50_P25805 Cluster: Trophozoite cysteine proteinase precursor;
n=3; Plasmodium (Laverania)|Rep: Trophozoite cysteine
proteinase precursor - Plasmodium falciparum
Length = 569
Score = 52.0 bits (119), Expect = 2e-05
Identities = 22/54 (40%), Positives = 36/54 (66%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
+D+R+ G V + KDQG CGSCW+ A + E +++ ++S SEQ ++DCS+
Sbjct: 337 LDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSK 390
>UniRef50_Q58HF6 Cluster: Cathepsin L-like cysteine protease; n=1;
Uronema marinum|Rep: Cathepsin L-like cysteine protease
- Uronema marinum
Length = 333
Score = 51.6 bits (118), Expect = 2e-05
Identities = 22/55 (40%), Positives = 35/55 (63%)
Frame = +3
Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+G+V+W GAV +++QG CGSCW+ + + E + +G L+S SEQ L+ C
Sbjct: 121 SGSVNWVSKGAVQGVQNQGVCGSCWAFSAVCSLERLYKINTGKLLSFSEQQLVSC 175
>UniRef50_Q4DV75 Cluster: Cysteine protease, putative; n=1;
Trypanosoma cruzi|Rep: Cysteine protease, putative -
Trypanosoma cruzi
Length = 434
Score = 51.6 bits (118), Expect = 2e-05
Identities = 23/55 (41%), Positives = 34/55 (61%), Gaps = 2/55 (3%)
Frame = +3
Query: 507 AVDWR--KHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
A++W+ K+ +T +KDQG CGSCW+ A E E + SG L++LS Q + C
Sbjct: 128 ALNWQEAKNPVLTPVKDQGSCGSCWAHAATESVESMYAISSGKLLTLSTQQITSC 182
>UniRef50_UPI0000F1FB3A Cluster: PREDICTED: similar to cathepsin O;
n=1; Danio rerio|Rep: PREDICTED: similar to cathepsin O
- Danio rerio
Length = 327
Score = 51.2 bits (117), Expect = 3e-05
Identities = 22/52 (42%), Positives = 29/52 (55%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
DWR HG V + +QG CG CW+ + +E E + L LS Q +IDCS
Sbjct: 125 DWRDHGVVGPVHNQGSCGGCWAFSIVEAIESVSAKVGEKLQQLSVQQVIDCS 176
>UniRef50_UPI0000D55E8B Cluster: PREDICTED: similar to Cathepsin O
precursor; n=1; Tribolium castaneum|Rep: PREDICTED:
similar to Cathepsin O precursor - Tribolium castaneum
Length = 326
Score = 51.2 bits (117), Expect = 3e-05
Identities = 22/53 (41%), Positives = 33/53 (62%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
VDWR+ AVT I +QG CG+CW+ + +E E + ++ LS Q +IDC+
Sbjct: 125 VDWREKNAVTRIYNQGSCGACWAYSVIETVESMNAIKTNKSEELSVQEIIDCA 177
>UniRef50_Q8I8D8 Cluster: Cysteine protease 10; n=1; Entamoeba
histolytica|Rep: Cysteine protease 10 - Entamoeba
histolytica
Length = 297
Score = 51.2 bits (117), Expect = 3e-05
Identities = 21/61 (34%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
Frame = +3
Query: 492 REAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYL-VSLSEQNLIDCS 668
+E ++DWR G VT +K+Q KC SC++ + E +++ + LSEQ ++DCS
Sbjct: 106 KEVLDSIDWRSEGKVTPVKNQRKCASCYAFGSIATIESLIMQETSIKEIDLSEQQIVDCS 165
Query: 669 E 671
+
Sbjct: 166 Q 166
>UniRef50_A7APS9 Cluster: Papain family cysteine protease containing
protein; n=1; Babesia bovis|Rep: Papain family cysteine
protease containing protein - Babesia bovis
Length = 435
Score = 51.2 bits (117), Expect = 3e-05
Identities = 23/52 (44%), Positives = 32/52 (61%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+D RK +T +KDQG CGSCW+ + + + E + V LSEQNL+DC
Sbjct: 230 IDLRKDNYMTPVKDQGNCGSCWAFSLIGVAEPFFKHKRDIDVVLSEQNLVDC 281
>UniRef50_Q5C1E8 Cluster: SJCHGC04937 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC04937 protein - Schistosoma
japonicum (Blood fluke)
Length = 235
Score = 50.8 bits (116), Expect = 4e-05
Identities = 24/53 (45%), Positives = 32/53 (60%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
DWR VT++K+Q KCG W+ A + EGQ S L SLS Q L+DC++
Sbjct: 165 DWRTKNVVTNVKNQEKCGCGWAFASVGALEGQMKLHSIPLQSLSTQQLVDCTQ 217
Score = 34.7 bits (76), Expect = 2.7
Identities = 14/40 (35%), Positives = 25/40 (62%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 375
E+ +R I+ + I HN Y++ LV+Y LG+N++ D+
Sbjct: 75 EEIYRRHIWNMYVSRIGLHNLHYDLNLVTYTLGINQFSDL 114
>UniRef50_Q4UDG2 Cluster: Cysteine protease, putative; n=3;
Theileria|Rep: Cysteine protease, putative - Theileria
annulata
Length = 580
Score = 50.8 bits (116), Expect = 4e-05
Identities = 20/52 (38%), Positives = 31/52 (59%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
VDWR+ G V ++ +QG CGSCW+ A +++ + L+ S Q L+DC
Sbjct: 368 VDWRESGFVNEVVNQGSCGSCWAIASEDIFSTFKSIKKNKLMKFSSQQLVDC 419
>UniRef50_A1KYY1 Cluster: Sui m 1 allergen; n=1; Suidasia
medanensis|Rep: Sui m 1 allergen - Suidasia medanensis
Length = 336
Score = 50.8 bits (116), Expect = 4e-05
Identities = 23/53 (43%), Positives = 33/53 (62%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
A DWR+ T +++QG+CGSCW+ A E Q+ + V+LSEQ L+DC
Sbjct: 118 AFDWRQQWN-TAVRNQGQCGSCWAFATAATVEAQYAIRKNVHVTLSEQQLVDC 169
>UniRef50_P22497 Cluster: Cysteine proteinase precursor; n=5;
Theileria|Rep: Cysteine proteinase precursor - Theileria
parva
Length = 440
Score = 50.8 bits (116), Expect = 4e-05
Identities = 20/52 (38%), Positives = 29/52 (55%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+DWR+ +VT +KDQ CG CW+ + + EG + LS Q L+DC
Sbjct: 233 LDWRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFDKSYELSVQELLDC 284
>UniRef50_Q1ENA8 Cluster: Cathepsin H precursor; n=1; Guillardia
theta|Rep: Cathepsin H precursor - Guillardia theta
(Cryptomonas phi)
Length = 353
Score = 50.4 bits (115), Expect = 5e-05
Identities = 24/61 (39%), Positives = 34/61 (55%), Gaps = 5/61 (8%)
Frame = +3
Query: 501 AGAVDWRKH-----GAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
A DWR V+ +K+QG CGSCW+ + E H ++G +V LSEQ L+DC
Sbjct: 119 ADEFDWRNQTCGETSCVSMVKNQGTCGSCWTFSTAAALESLHAIKTGEMVLLSEQQLVDC 178
Query: 666 S 668
+
Sbjct: 179 A 179
>UniRef50_O17255 Cluster: Putative uncharacterized protein; n=1;
Caenorhabditis elegans|Rep: Putative uncharacterized
protein - Caenorhabditis elegans
Length = 299
Score = 50.4 bits (115), Expect = 5e-05
Identities = 22/54 (40%), Positives = 35/54 (64%), Gaps = 1/54 (1%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFR-QSGYLVSLSEQNLIDCS 668
+DWR+ G V +KDQGKC + ++ A + E + + +G L+S SEQ +IDC+
Sbjct: 84 LDWREKGIVGPVKDQGKCNASYAFAAIAAIESMYAKANNGKLLSFSEQQIIDCA 137
>UniRef50_A0DD55 Cluster: Chromosome undetermined scaffold_46, whole
genome shotgun sequence; n=2; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_46,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 336
Score = 50.4 bits (115), Expect = 5e-05
Identities = 23/53 (43%), Positives = 32/53 (60%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+VDWRK +T +KDQG+C CW+ + E + ++ V LSEQ LIDC
Sbjct: 145 SVDWRK---ITQVKDQGQCSGCWAFGAVGAAEAWFYVKNKTTVLLSEQQLIDC 194
>UniRef50_Q9BL26 Cluster: Putative uncharacterized protein; n=4;
Caenorhabditis elegans|Rep: Putative uncharacterized
protein - Caenorhabditis elegans
Length = 345
Score = 50.0 bits (114), Expect = 7e-05
Identities = 28/74 (37%), Positives = 41/74 (55%), Gaps = 1/74 (1%)
Frame = +3
Query: 453 WERPRG*VHIAGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFR-QSGY 629
WE P +H+ R +DWR+ G V +KDQGKC + + A E + + +G
Sbjct: 72 WETP---IHM--DRTTEEFLDWREKGIVGPVKDQGKCNASHAFAITSSIESMYAKATNGT 126
Query: 630 LVSLSEQNLIDCSE 671
L+S SEQ LIDC++
Sbjct: 127 LLSFSEQQLIDCND 140
>UniRef50_Q4N071 Cluster: Cysteine proteinase, putative; n=5;
Piroplasmida|Rep: Cysteine proteinase, putative -
Theileria parva
Length = 460
Score = 49.6 bits (113), Expect = 9e-05
Identities = 23/53 (43%), Positives = 32/53 (60%), Gaps = 1/53 (1%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQG-KCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+DWRK V+ IK+QG +CGSCW+ A + E + + LSEQ L+DC
Sbjct: 253 LDWRKADGVSKIKNQGLECGSCWAFASVSSVESLYKIYRNVTLDLSEQELVDC 305
>UniRef50_A2DLD4 Cluster: Clan CA, family C1, cathepsin L or K-like
cysteine peptidase; n=3; Trichomonas vaginalis|Rep: Clan
CA, family C1, cathepsin L or K-like cysteine peptidase
- Trichomonas vaginalis G3
Length = 320
Score = 49.6 bits (113), Expect = 9e-05
Identities = 20/51 (39%), Positives = 31/51 (60%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
DWR G + I++QG+CG CW+ + + E + + L+ LSEQ L+DC
Sbjct: 109 DWRTKGIINPIRNQGQCGLCWAFSTICCVEARWAQAYNTLLQLSEQMLVDC 159
>UniRef50_UPI00006CEBCD Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 350
Score = 49.2 bits (112), Expect = 1e-04
Identities = 24/88 (27%), Positives = 45/88 (51%), Gaps = 1/88 (1%)
Frame = +3
Query: 408 LQQNCQTQQESVHEGWERPRG*VHIAGQREAAGAVDWRKH-GAVTDIKDQGKCGSCWSSA 584
L +T S + + P+ + A+ DWR + G + ++K+QG+CGSCW+ A
Sbjct: 109 LNSQLKTSASSSSQPAQTPQLRGSVDASLNASQGFDWRNYQGVLGNVKNQGQCGSCWTFA 168
Query: 585 RLELWEGQHFRQSGYLVSLSEQNLIDCS 668
+ E + + + SEQ+++DC+
Sbjct: 169 TAGVLESYYALKYQQSLIFSEQDIVDCA 196
>UniRef50_Q22NW9 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 894
Score = 49.2 bits (112), Expect = 1e-04
Identities = 29/93 (31%), Positives = 38/93 (40%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSEH 674
E ++DWR AVT +K+QG CGS ++ + EG H SEQ +IDCS
Sbjct: 682 EVPSSIDWRDLNAVTPVKNQGSCGSGYAFSTTGALEGIHKISGKDWKGFSEQQIIDCSRK 741
Query: 675 XXXXXXXXXXXXXLQVHQGQRGDRHRADLPYEG 773
+ G D PYEG
Sbjct: 742 QGNSGCHGGFMENAFDFVIENGILQENDYPYEG 774
>UniRef50_P42666 Cluster: Cysteine proteinase precursor; n=18;
Plasmodium|Rep: Cysteine proteinase precursor -
Plasmodium vivax (strain Salvador I)
Length = 583
Score = 49.2 bits (112), Expect = 1e-04
Identities = 22/55 (40%), Positives = 38/55 (69%), Gaps = 1/55 (1%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQ-SGYLVSLSEQNLIDCSE 671
+D+R+ G V + KDQG CGSCW+ A + E + ++ + +++LSEQ ++DCS+
Sbjct: 343 LDYREKGIVHEPKDQGLCGSCWAFASVGNVECMYAKEHNKTILTLSEQEVVDCSK 397
>UniRef50_Q9PYY5 Cluster: Viral cathepsin; n=3; Granulovirus|Rep:
Viral cathepsin - Xestia c-nigrum granulosis virus
(XnGV) (Xestia c-nigrumgranulovirus)
Length = 346
Score = 49.2 bits (112), Expect = 1e-04
Identities = 20/53 (37%), Positives = 31/53 (58%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
DWR +VT +K Q +CGSCW+ + + E + + + LSEQ L+DC +
Sbjct: 138 DWRDRNSVTSVKMQKECGSCWAFSAVANIESLYHIKHNVSLDLSEQQLVDCDK 190
>UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n=1;
Rattus norvegicus|Rep: UPI0000501FDB UniRef100 entry -
Rattus norvegicus
Length = 338
Score = 48.8 bits (111), Expect = 2e-04
Identities = 20/40 (50%), Positives = 28/40 (70%)
Frame = +3
Query: 552 QGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
QG+C SCW+ + EGQ F+++G L LS QNL+DCS+
Sbjct: 139 QGRCNSCWAFPVVGAIEGQMFKKTGKLTPLSVQNLVDCSK 178
>UniRef50_Q7KWP5 Cluster: Similar to Dictyostelium discoideum (Slime
mold). Cysteine proteinase 5; n=2; Dictyostelium
discoideum|Rep: Similar to Dictyostelium discoideum
(Slime mold). Cysteine proteinase 5 - Dictyostelium
discoideum (Slime mold)
Length = 345
Score = 48.8 bits (111), Expect = 2e-04
Identities = 27/59 (45%), Positives = 32/59 (54%), Gaps = 3/59 (5%)
Frame = +3
Query: 501 AGAVDWRKHGAVTDIKDQ-GKCGSCWSSARLELWEGQHF--RQSGYLVSLSEQNLIDCS 668
+ +DWRK GAV +K Q G CGS W + E HF +SLS QNLIDCS
Sbjct: 121 SSGIDWRKKGAVPSVKSQIGGCGS-WPITAVGATESAHFLANPKDPFISLSMQNLIDCS 178
>UniRef50_Q75JH0 Cluster: Similar to Dictyostelium discoideum (Slime
mold). Gamete and mating- type specific protein A; n=2;
Dictyostelium discoideum|Rep: Similar to Dictyostelium
discoideum (Slime mold). Gamete and mating- type
specific protein A - Dictyostelium discoideum (Slime
mold)
Length = 415
Score = 48.8 bits (111), Expect = 2e-04
Identities = 23/59 (38%), Positives = 33/59 (55%), Gaps = 3/59 (5%)
Frame = +3
Query: 498 AAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYL---VSLSEQNLIDC 665
+ G VDW+ G VT IK+QG+CG C+S A E + ++ + LSEQN + C
Sbjct: 209 STGDVDWKSLGFVTSIKNQGQCGGCYSFATCAALESAYLIKNNLPNTDIDLSEQNFVSC 267
>UniRef50_Q5CWJ0 Cluster: Cryptopain-cysteine proteinase secreted,
possible transmembrane domain near N-terminus; n=4;
Cryptosporidium|Rep: Cryptopain-cysteine proteinase
secreted, possible transmembrane domain near N-terminus
- Cryptosporidium parvum Iowa II
Length = 401
Score = 48.8 bits (111), Expect = 2e-04
Identities = 22/56 (39%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGY-LVSLSEQNLIDCSE 671
+++W + G V I++Q CGSCW+ + + EG Q+ L SLSEQ +DCS+
Sbjct: 179 SINWVEAGCVNPIRNQKNCGSCWAFSAVAALEGATCAQTNRGLPSLSEQQFVDCSK 234
Score = 38.3 bits (85), Expect = 0.22
Identities = 24/87 (27%), Positives = 44/87 (50%), Gaps = 3/87 (3%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
E+N R +IY ++ + I N + G SY L MN++GD+ EF+ G+ K +K ++
Sbjct: 102 EENQRFEIYKQNMNFIKTTNSQ---GF-SYVLEMNEFGDLSKEEFMARFTGYIKDSKDDE 157
Query: 436 NLYMK---GGSVRGAKFISPANVKLPE 507
++ S +F+ P ++ E
Sbjct: 158 RVFKSSRVSASESEEEFVPPNSINWVE 184
>UniRef50_Q248G1 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 334
Score = 48.8 bits (111), Expect = 2e-04
Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 1/54 (1%)
Frame = +3
Query: 507 AVDWRK-HGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+VDWR V IK+QG CGSCW+ + + E + + G VS +EQ ++DC
Sbjct: 124 SVDWRNVTNVVGPIKNQGHCGSCWTFSIAGIVESHYVLKHGSYVSYAEQEILDC 177
>UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus
salmonis|Rep: Cysteine proteinase - Lepeophtheirus
salmonis (salmon louse)
Length = 372
Score = 48.8 bits (111), Expect = 2e-04
Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Frame = +3
Query: 507 AVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVS--LSEQNLIDCSEH 674
+VDWR+ G +TD+K+QG CGSCW + +E E ++ LS Q + CS +
Sbjct: 118 SVDWREKGVITDVKNQGSCGSCWVFSAVEQIESYVAIENNMTSPPLLSTQQITSCSSN 175
>UniRef50_O28333 Cluster: Cysteine proteinase, putative; n=2;
cellular organisms|Rep: Cysteine proteinase, putative -
Archaeoglobus fulgidus
Length = 1088
Score = 48.8 bits (111), Expect = 2e-04
Identities = 20/55 (36%), Positives = 33/55 (60%), Gaps = 2/55 (3%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSG--YLVSLSEQNLIDCSE 671
DWR + ++ ++DQG CGSCW+ + + E +SG + LSEQ+L+ C +
Sbjct: 599 DWRDYTGLSAVRDQGSCGSCWAHSAVAALESALIVESGASSSIDLSEQHLLSCEQ 653
>UniRef50_UPI0000D566ED Cluster: PREDICTED: similar to CTLA-2-alpha
protein precursor; n=1; Tribolium castaneum|Rep:
PREDICTED: similar to CTLA-2-alpha protein precursor -
Tribolium castaneum
Length = 101
Score = 48.4 bits (110), Expect = 2e-04
Identities = 18/44 (40%), Positives = 32/44 (72%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 387
E+NFR +++A++ I +HN+KYE G V+Y +G+N++ D+ E
Sbjct: 45 EENFRKQLFAKNLEKIEEHNKKYEQGQVTYTMGVNQFSDLTPEE 88
>UniRef50_Q9XWA4 Cluster: Putative uncharacterized protein; n=2;
Caenorhabditis|Rep: Putative uncharacterized protein -
Caenorhabditis elegans
Length = 343
Score = 48.4 bits (110), Expect = 2e-04
Identities = 27/63 (42%), Positives = 34/63 (53%), Gaps = 3/63 (4%)
Frame = +3
Query: 489 QREAAGAVDWRK-HGA--VTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLI 659
++ +VDWR +G VT IK QG CGSCW+ A E G L SLS Q L+
Sbjct: 132 KKNLPNSVDWRNVNGTNHVTGIKYQGPCGSCWAFATAAAIESAVSISGGGLQSLSSQQLL 191
Query: 660 DCS 668
DC+
Sbjct: 192 DCT 194
>UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila
melanogaster|Rep: CG11459-PA - Drosophila melanogaster
(Fruit fly)
Length = 336
Score = 48.4 bits (110), Expect = 2e-04
Identities = 20/53 (37%), Positives = 35/53 (66%), Gaps = 1/53 (1%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQG-KCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+DWR++G ++ + DQG +C SCW+ + + E ++ G LV LS ++L+DC
Sbjct: 122 IDWRQYGYISPVGDQGTECLSCWAFSTSGVLEAHMAKKYGNLVPLSPKHLVDC 174
Score = 38.7 bits (86), Expect = 0.17
Identities = 14/41 (34%), Positives = 23/41 (56%)
Frame = +1
Query: 250 RGEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGD 372
R D + +Y + + HNQ Y G V++K+G+NK+ D
Sbjct: 42 RNRDKYHRALYEQRVLAVESHNQLYLQGKVAFKMGLNKFSD 82
>UniRef50_Q3L7L8 Cluster: Sar s 1 allergen Yv4003H01; n=1; Sarcoptes
scabiei type hominis|Rep: Sar s 1 allergen Yv4003H01 -
Sarcoptes scabiei type hominis
Length = 330
Score = 48.4 bits (110), Expect = 2e-04
Identities = 25/56 (44%), Positives = 35/56 (62%), Gaps = 3/56 (5%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHF--RQ-SGYLVSLSEQNLIDCS 668
+D RK G VT +KDQ KCG+CW+ + + E + RQ S + LSEQ L+DC+
Sbjct: 117 IDLRKCGFVTPVKDQKKCGACWAFSTVCTTESLYLSSRQVSPWKFGLSEQELVDCA 172
>UniRef50_A1ZBK7 Cluster: CG10460-PA; n=1; Drosophila
melanogaster|Rep: CG10460-PA - Drosophila melanogaster
(Fruit fly)
Length = 79
Score = 48.0 bits (109), Expect = 3e-04
Identities = 21/47 (44%), Positives = 31/47 (65%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVK 396
ED R +IYAE K I +HN+K+E G V++K+G+N D+ EF +
Sbjct: 24 EDLMRRRIYAESKARIEEHNRKFEKGEVTWKMGINHLADLTPEEFAQ 70
>UniRef50_UPI00006A2275 Cluster: UPI00006A2275 related cluster; n=1;
Xenopus tropicalis|Rep: UPI00006A2275 UniRef100 entry -
Xenopus tropicalis
Length = 272
Score = 47.6 bits (108), Expect = 4e-04
Identities = 22/59 (37%), Positives = 36/59 (61%), Gaps = 1/59 (1%)
Frame = +3
Query: 498 AAGAVDWRKHGAVTDIKDQGK-CGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCSE 671
A ++DWR VT ++DQG C SC++ + + E Q +++ LV+ S Q L+DCS+
Sbjct: 79 APPSIDWRTQNCVTPVRDQGSFCRSCYAFSAVGALECQWKKKTVRLVTFSPQELVDCSD 137
Score = 47.2 bits (107), Expect = 5e-04
Identities = 22/62 (35%), Positives = 33/62 (53%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
E+ R I+ E I+ HN +Y +GL +Y++GMN GDM E TM G+ +
Sbjct: 6 EERARRTIWEETLKFISVHNLEYSLGLHTYEVGMNHLGDMTGEEVAATMTGYTGSGDSLA 65
Query: 436 NL 441
N+
Sbjct: 66 NM 67
>UniRef50_Q23FS6 Cluster: Papain family cysteine protease containing
protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 330
Score = 47.6 bits (108), Expect = 4e-04
Identities = 24/56 (42%), Positives = 33/56 (58%), Gaps = 3/56 (5%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGY---LVSLSEQNLIDCS 668
V+W G V+ +KDQG+CGSCW+ + E +GY + LSEQ L+DCS
Sbjct: 121 VNWVTRGKVSAVKDQGQCGSCWAFSTTGSVESA-LIIAGYANQTIDLSEQQLVDCS 175
>UniRef50_Q8IEV8 Cluster: Phospholipase A1; n=4;
Hymenostomatida|Rep: Phospholipase A1 - Tetrahymena
thermophila
Length = 320
Score = 46.8 bits (106), Expect = 6e-04
Identities = 23/62 (37%), Positives = 35/62 (56%), Gaps = 5/62 (8%)
Frame = +3
Query: 501 AGAVDWRKHGAVTDIKDQGKCGSCWSSARL-----ELWEGQHFRQSGYLVSLSEQNLIDC 665
A VDW G VT +K+QG CGSCW+ + + LW Q+ ++L+EQ +DC
Sbjct: 113 ATEVDWTAKGKVTPVKNQGSCGSCWAFSTIGAVESALWIAGQGEQN--TLNLAEQEQVDC 170
Query: 666 SE 671
++
Sbjct: 171 AK 172
>UniRef50_Q7QWP2 Cluster: GLP_26_49243_47612; n=1; Giardia lamblia
ATCC 50803|Rep: GLP_26_49243_47612 - Giardia lamblia
ATCC 50803
Length = 543
Score = 46.8 bits (106), Expect = 6e-04
Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 7/59 (11%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWS-------SARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+DWR G +T +KDQ CGSCWS RL + + + L+ +SEQ++I C
Sbjct: 320 LDWRVRGVITPVKDQAACGSCWSFGAAGTIEGRLNALKWKRGERDTPLLRVSEQSIISC 378
>UniRef50_A0CHV8 Cluster: Chromosome undetermined scaffold_184,
whole genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_184,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 331
Score = 46.8 bits (106), Expect = 6e-04
Identities = 19/67 (28%), Positives = 36/67 (53%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK 435
E+ R ++A++ ++ +HN K+E+G ++ LGMN+Y D+ EF + + K
Sbjct: 49 EEVHRFSVFAQNLAVVMEHNSKFELGQETFTLGMNQYADLTPEEFQASFLTLKTKVQDRK 108
Query: 436 NLYMKGG 456
N+ G
Sbjct: 109 NVKSYSG 115
Score = 41.9 bits (94), Expect = 0.018
Identities = 21/53 (39%), Positives = 28/53 (52%)
Frame = +3
Query: 510 VDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
VDW K G +K+QG CGSCW+ A E V++SEQ +DC+
Sbjct: 122 VDW-KDGLT--VKNQGSCGSCWAFAAAAAIEAGFQHHKKNKVNISEQEFVDCT 171
>UniRef50_Q5YER1 Cluster: Cysteine proteinase; n=1; Bigelowiella
natans|Rep: Cysteine proteinase - Bigelowiella natans
(Pedinomonas minutissima) (Chlorarachnion sp.(strain
CCMP 621))
Length = 140
Score = 46.4 bits (105), Expect = 8e-04
Identities = 17/28 (60%), Positives = 23/28 (82%)
Frame = +3
Query: 495 EAAGAVDWRKHGAVTDIKDQGKCGSCWS 578
++A +VDW GAVT +K+QG+CGSCWS
Sbjct: 107 KSADSVDWVSKGAVTPVKNQGQCGSCWS 134
>UniRef50_Q17IZ6 Cluster: Procathepsin L3, putative; n=2;
Culicidae|Rep: Procathepsin L3, putative - Aedes aegypti
(Yellowfever mosquito)
Length = 313
Score = 46.4 bits (105), Expect = 8e-04
Identities = 28/99 (28%), Positives = 42/99 (42%), Gaps = 1/99 (1%)
Frame = +3
Query: 483 AGQREAAGAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLID 662
A Q ++DWR G T +Q CGSC++ + GQ R+ G + +S Q ++D
Sbjct: 129 ATQNSMPDSLDWRDKGFTTMAVNQKTCGSCYAFSIGHALNGQIMRRIGRVEYVSTQQMVD 188
Query: 663 CS-EHXXXXXXXXXXXXXLQVHQGQRGDRHRADLPYEGS 776
CS +Q Q +G +D PY S
Sbjct: 189 CSTSAGNKGCAGGSLRFTMQYLQNSQGIMRSSDYPYTSS 227
Score = 36.7 bits (81), Expect = 0.67
Identities = 25/115 (21%), Positives = 49/115 (42%), Gaps = 9/115 (7%)
Frame = +1
Query: 268 RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNK---- 435
R + + ++ I +HN YE G ++++G+N+ DM ++K M H K
Sbjct: 51 RKRAFKKNMQEIEEHNANYEQGKSTFQMGVNELADMDKSSYLKKMVRMTDAIDHRKLDVD 110
Query: 436 --NLYMKGGSVRGAKFISPANVKLPER--WTG-GSTAPSPTSRTKGSVAHAGLQH 585
+ ++ + G +F+ +P+ W G T + +T GS + H
Sbjct: 111 FNDEMLQATNAFGEEFVQATQNSMPDSLDWRDKGFTTMAVNQKTCGSCYAFSIGH 165
>UniRef50_A0D3D1 Cluster: Chromosome undetermined scaffold_36, whole
genome shotgun sequence; n=2; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_36,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 307
Score = 46.4 bits (105), Expect = 8e-04
Identities = 22/55 (40%), Positives = 31/55 (56%)
Frame = +3
Query: 504 GAVDWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
G DW + IK+QG CGSCW+ + + EG + G+ LSEQ L+DC+
Sbjct: 110 GDADWASK--MNPIKNQGNCGSCWTFSAIGAVEGFLAIRKGFKGVLSEQQLVDCA 162
>UniRef50_O91466 Cluster: Viral cathepsin; n=2; Granulovirus|Rep:
Viral cathepsin - Cydia pomonella granulosis virus
(CpGV) (Cydia pomonellagranulovirus)
Length = 333
Score = 46.4 bits (105), Expect = 8e-04
Identities = 22/53 (41%), Positives = 36/53 (67%), Gaps = 1/53 (1%)
Frame = +3
Query: 510 VDWR-KHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDC 665
+DWR KHG VT +K+Q +CGSCW+ + + E + + ++LSEQ+L++C
Sbjct: 128 LDWRDKHG-VTPVKNQMECGSCWAFSTIANIESLYNIKYDKALNLSEQHLVNC 179
>UniRef50_UPI0000D566EC Cluster: PREDICTED: similar to CG10460-PA;
n=1; Tribolium castaneum|Rep: PREDICTED: similar to
CG10460-PA - Tribolium castaneum
Length = 80
Score = 46.0 bits (104), Expect = 0.001
Identities = 16/44 (36%), Positives = 30/44 (68%)
Frame = +1
Query: 256 EDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 387
E+++R ++ + ++ HN+KYE GLV+YK+G+N++ D E
Sbjct: 30 EESYRKSLFVANLQMVESHNEKYEDGLVNYKMGINQFADYSKEE 73
>UniRef50_Q4S406 Cluster: Chromosome 20 SCAF14744, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 20 SCAF14744, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 175
Score = 46.0 bits (104), Expect = 0.001
Identities = 20/52 (38%), Positives = 30/52 (57%)
Frame = +3
Query: 513 DWRKHGAVTDIKDQGKCGSCWSSARLELWEGQHFRQSGYLVSLSEQNLIDCS 668
DWR + V +++Q CGSCW+ + + + H S LV LS Q ++DCS
Sbjct: 64 DWRDNAVVGPVQNQQACGSCWAFSVVGAVQSVHAIGSSPLVELSVQQVLDCS 115
Database: uniref50
Posted date: Oct 5, 2007 11:19 AM
Number of letters in database: 575,637,011
Number of sequences in database: 1,657,284
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 776,152,799
Number of Sequences: 1657284
Number of extensions: 16417678
Number of successful extensions: 56151
Number of sequences better than 10.0: 405
Number of HSP's better than 10.0 without gapping: 52437
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 55977
length of database: 575,637,011
effective HSP length: 99
effective length of database: 411,565,895
effective search space used: 68319938570
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)
- SilkBase 1999-2023 -