BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= I09A02NGRL0001_H14 (450 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C... 76 3e-13 UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine prot... 58 7e-08 UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat... 56 4e-07 UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ... 46 4e-04 UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ... 46 4e-04 UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ... 45 9e-04 UniRef50_Q8I880 Cluster: Digestive cysteine protease intestain; ... 43 0.003 UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ... 42 0.008 UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi... 42 0.008 UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis... 42 0.008 UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr... 41 0.011 UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei... 41 0.014 UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No... 40 0.025 UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;... 40 0.025 UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ... 40 0.025 UniRef50_UPI00015B5D85 Cluster: PREDICTED: similar to cathepsin ... 40 0.033 UniRef50_A2A5X5 Cluster: Ortholog of keratin associated protein ... 40 0.033 UniRef50_A1ZBK7 Cluster: CG10460-PA; n=1; Drosophila melanogaste... 39 0.044 UniRef50_UPI0000D566ED Cluster: PREDICTED: similar to CTLA-2-alp... 39 0.058 UniRef50_UPI0000D566EC Cluster: PREDICTED: similar to CG10460-PA... 39 0.058 UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v... 39 0.058 UniRef50_Q9BYR4 Cluster: Keratin-associated protein 4-3; n=53; M... 39 0.058 UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain... 38 0.076 UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain... 38 0.10 UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina... 38 0.13 UniRef50_A7LFV3 Cluster: Cathepsin L protease inhibitor 2; n=1; ... 37 0.18 UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh... 37 0.18 UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n... 37 0.23 UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain... 37 0.23 UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ... 36 0.31 UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat... 36 0.31 UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ... 36 0.31 UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio... 36 0.31 UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ... 36 0.31 UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae... 36 0.41 UniRef50_Q248G1 Cluster: Papain family cysteine protease contain... 36 0.41 UniRef50_Q23H06 Cluster: Papain family cysteine protease contain... 36 0.41 UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:... 36 0.54 UniRef50_Q17641 Cluster: Putative uncharacterized protein; n=11;... 36 0.54 UniRef50_Q9BYP8 Cluster: Keratin-associated protein 17-1; n=28; ... 36 0.54 UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; P... 36 0.54 UniRef50_Q14C04 Cluster: Keratin associated protein 4-7; n=17; M... 35 0.71 UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain... 35 0.71 UniRef50_A7LFV2 Cluster: Cathepsin L protease inhibitor 1; n=1; ... 35 0.71 UniRef50_Q231X3 Cluster: Papain family cysteine protease contain... 35 0.94 UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate... 35 0.94 UniRef50_UPI0001553357 Cluster: PREDICTED: similar to novel memb... 34 1.2 UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ... 34 1.2 UniRef50_Q86MN0 Cluster: Cathepsin L-like cysteine protease; n=1... 34 1.2 UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who... 34 1.2 UniRef50_Q86UY7 Cluster: Similar to RIKEN cDNA 2310002J15 gene; ... 34 1.2 UniRef50_UPI0001555AB0 Cluster: PREDICTED: hypothetical protein;... 34 1.6 UniRef50_UPI0000E22843 Cluster: PREDICTED: hypothetical protein;... 34 1.6 UniRef50_UPI000155F1D6 Cluster: PREDICTED: similar to keratin as... 33 2.2 UniRef50_UPI000155BC4F Cluster: PREDICTED: hypothetical protein,... 33 2.2 UniRef50_UPI000155483C Cluster: PREDICTED: similar to keratin as... 33 2.2 UniRef50_Q24E33 Cluster: Papain family cysteine protease contain... 33 2.2 UniRef50_A0CAI9 Cluster: Chromosome undetermined scaffold_161, w... 33 2.2 UniRef50_Q9BYR0 Cluster: Keratin-associated protein 4-7; n=149; ... 33 2.2 UniRef50_Q207N1 Cluster: Cathepsin S; n=2; Clupeocephala|Rep: Ca... 33 2.9 UniRef50_A2A4R5 Cluster: Novel member of the keratin associated ... 33 2.9 UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:... 33 2.9 UniRef50_Q8IUG1 Cluster: Keratin-associated protein 1-3; n=65; M... 33 2.9 UniRef50_UPI0001554838 Cluster: PREDICTED: similar to solute car... 33 3.8 UniRef50_UPI00006CB160 Cluster: hypothetical protein TTHERM_0029... 33 3.8 UniRef50_UPI0000499201 Cluster: RIO1 family protein; n=2; Entamo... 33 3.8 UniRef50_Q70SU8 Cluster: Cystein proteinase inhibitor protein pr... 33 3.8 UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n... 33 3.8 UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain... 33 3.8 UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 33 3.8 UniRef50_Q7YWV7 Cluster: Putative uncharacterized protein; n=2; ... 32 5.0 UniRef50_Q23H10 Cluster: Papain family cysteine protease contain... 32 5.0 UniRef50_A0EI50 Cluster: Chromosome undetermined scaffold_98, wh... 32 5.0 UniRef50_A0BNM1 Cluster: Chromosome undetermined scaffold_119, w... 32 5.0 UniRef50_P60410 Cluster: Keratin-associated protein 10-8; n=12; ... 32 5.0 UniRef50_P60372 Cluster: Keratin-associated protein 10-4; n=18; ... 32 5.0 UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|... 32 5.0 UniRef50_Q9BXQ6 Cluster: Cat eye syndrome critical region protei... 32 5.0 UniRef50_UPI0001555234 Cluster: PREDICTED: similar to hCG2041354... 32 6.6 UniRef50_Q4MW49 Cluster: Putative uncharacterized protein; n=1; ... 32 6.6 UniRef50_A1Z9I0 Cluster: CG6357-PA; n=3; Drosophila melanogaster... 32 6.6 UniRef50_Q4PEV5 Cluster: Putative uncharacterized protein; n=1; ... 32 6.6 UniRef50_Q6L8H1 Cluster: Keratin-associated protein 5-4; n=160; ... 32 6.6 UniRef50_P60412 Cluster: Keratin-associated protein 10-11; n=80;... 32 6.6 UniRef50_P60368 Cluster: Keratin-associated protein 10-2; n=64; ... 32 6.6 UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h... 31 8.8 UniRef50_UPI0001552953 Cluster: PREDICTED: hypothetical protein;... 31 8.8 UniRef50_UPI0000EBE77C Cluster: PREDICTED: hypothetical protein;... 31 8.8 UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n... 31 8.8 UniRef50_Q7RH27 Cluster: Transmembrane amino acid transporter pr... 31 8.8 UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ... 31 8.8 UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endop... 31 8.8 UniRef50_A0CKJ6 Cluster: Chromosome undetermined scaffold_2, who... 31 8.8 >UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina) Length = 339 Score = 76.2 bits (179), Expect = 3e-13 Identities = 31/52 (59%), Positives = 42/52 (80%) Frame = +2 Query: 143 ATVGAVSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298 A A+S DL+KEEW+T+K++H K Y NE+E++FRMKI+ EN+H IAKHNQ Sbjct: 13 ALTQAISPLDLIKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQ 64 >UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine protease; n=1; Maconellicoccus hirsutus|Rep: Putative cathepsin L-like cysteine protease - Maconellicoccus hirsutus (hibiscus mealybug) Length = 339 Score = 58.4 bits (135), Expect = 7e-08 Identities = 23/52 (44%), Positives = 38/52 (73%) Frame = +2 Query: 143 ATVGAVSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298 +++ A+S +L EEW FK ++SK Y ++ED+ RMKI+ +NK+ IA+HN+ Sbjct: 14 SSIQAISPVNLFHEEWQLFKTQYSKKYTTDIEDRLRMKIFIDNKYRIAQHNK 65 >UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviatus|Rep: Cathepsin L 2 - Diaprepes abbreviatus (Sugarcane rootstalk borer weevil) Length = 348 Score = 56.0 bits (129), Expect = 4e-07 Identities = 21/48 (43%), Positives = 35/48 (72%) Frame = +2 Query: 155 AVSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298 A+S+ LV+E+W FK+EH KVY++E E+++R ++ EN I +HN+ Sbjct: 17 AISYQVLVQEQWEQFKLEHGKVYESESENEYRQSVFMENLFQINEHNK 64 >UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin L-like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin L-like protease - Nasonia vitripennis Length = 353 Score = 46.0 bits (104), Expect = 4e-04 Identities = 15/40 (37%), Positives = 28/40 (70%) Frame = +2 Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 ++W FK+ + K Y+ ++E+ FR ++ EN+ IA+HNQ+ Sbjct: 38 DDWAAFKLRYKKNYNGDVEENFRRSVFHENQRKIAEHNQK 77 >UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2); n=2; Tribolium castaneum|Rep: PREDICTED: similar to Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2) - Tribolium castaneum Length = 332 Score = 46.0 bits (104), Expect = 4e-04 Identities = 16/44 (36%), Positives = 30/44 (68%) Frame = +2 Query: 170 DLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 +LV+EEWN FK H++ + + +E+ FR ++ +N + +HN+R Sbjct: 21 NLVEEEWNKFKAMHARAFFDPLEETFRKSLFTKNLEIVEEHNER 64 >UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine protease; n=11; Callosobruchus maculatus|Rep: Putative gut cathepsin L-like cysteine protease - Callosobruchus maculatus (Southern cowpea weevil) (Pulse bruchid) Length = 326 Score = 44.8 bits (101), Expect = 9e-04 Identities = 19/53 (35%), Positives = 33/53 (62%) Frame = +2 Query: 143 ATVGAVSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 AT+ V+ V EEW FK++H K Y + +E+K R ++ +N +I +HN++ Sbjct: 8 ATLVLVAGASSVYEEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKK 60 >UniRef50_Q8I880 Cluster: Digestive cysteine protease intestain; n=1; Leptinotarsa decemlineata|Rep: Digestive cysteine protease intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 60 Score = 43.2 bits (97), Expect = 0.003 Identities = 17/42 (40%), Positives = 26/42 (61%) Frame = +2 Query: 173 LVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298 +++E+W FK+ SK Y N +E+K R I+ N I +HNQ Sbjct: 18 VIREKWQNFKINFSKSYQNVVEEKGRFNIFLSNLLRIEEHNQ 59 >UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 317 Score = 41.5 bits (93), Expect = 0.008 Identities = 15/42 (35%), Positives = 27/42 (64%) Frame = +2 Query: 176 VKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 V ++W FK+ HSK Y + E++ R +++++N I +HN R Sbjct: 12 VHQQWAQFKVNHSKKYGHLKEEQVRFQVFSQNLQKIEQHNAR 53 >UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L - Suberites domuncula (Sponge) Length = 324 Score = 41.5 bits (93), Expect = 0.008 Identities = 21/47 (44%), Positives = 27/47 (57%) Frame = +2 Query: 155 AVSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295 +V+ FD EEW +K EHSK Y E+E+ R I+ NK I HN Sbjct: 13 SVAAFDF-PEEWVAWKQEHSKEYTEELEELRRHTIWQSNKKFIDSHN 58 >UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis|Rep: Cathepsin L - Culicoides sonorensis Length = 331 Score = 41.5 bits (93), Expect = 0.008 Identities = 17/42 (40%), Positives = 25/42 (59%) Frame = +2 Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQRRL 307 EEW FK+E++KVY E+ R I+ N ++ +HN R L Sbjct: 25 EEWKKFKLEYNKVYPLSTEENLRKGIFERNLADVMEHNARYL 66 >UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin L-like cysteine proteinase precursor - Acanthoscelides obtectus (Bean weevil) Length = 321 Score = 41.1 bits (92), Expect = 0.011 Identities = 15/41 (36%), Positives = 26/41 (63%) Frame = +2 Query: 179 KEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 +E+W FK++H + Y +E+K R +I+ N I +HN+R Sbjct: 20 QEKWQQFKIQHGRTYRTLLEEKRRFEIFKFNLRTIEEHNER 60 >UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like midgut cysteine proteinase - Tenebrio molitor (Yellow mealworm) Length = 330 Score = 40.7 bits (91), Expect = 0.014 Identities = 16/43 (37%), Positives = 27/43 (62%) Frame = +2 Query: 173 LVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 L +E+W+ FK+ H K Y + +E+ R I+ +N IA+HN + Sbjct: 23 LFQEQWSQFKLTHKKSYSSPIEEIRRQLIFKDNVAKIAEHNAK 65 >UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel protein - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 328 Score = 39.9 bits (89), Expect = 0.025 Identities = 13/38 (34%), Positives = 24/38 (63%) Frame = +2 Query: 185 EWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298 +W T+K +H+K Y N E++ R ++ +N +I HN+ Sbjct: 26 QWTTWKSQHNKTYRNTREERLRRSVWKQNLQDILLHNE 63 >UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor; n=1; Diabrotica virgifera virgifera|Rep: Cathepsin L-like proteinase" precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 315 Score = 39.9 bits (89), Expect = 0.025 Identities = 16/40 (40%), Positives = 25/40 (62%) Frame = +2 Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 E+W +FK H+K Y N +EDK R ++ +N I +HN + Sbjct: 22 EKWTSFKATHNKSY-NVIEDKLRFAVFQDNLKKIEEHNAK 60 >UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; n=16; Chrysomelidae|Rep: Digestive cysteine protease intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 39.9 bits (89), Expect = 0.025 Identities = 16/41 (39%), Positives = 23/41 (56%) Frame = +2 Query: 179 KEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 +++W FK H K Y N +E+K R I+ N I +HN R Sbjct: 20 EDQWIAFKQTHGKTYKNLLEEKTRFGIFQRNLIKIKEHNAR 60 >UniRef50_UPI00015B5D85 Cluster: PREDICTED: similar to cathepsin L-like proteinase; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin L-like proteinase - Nasonia vitripennis Length = 96 Score = 39.5 bits (88), Expect = 0.033 Identities = 15/41 (36%), Positives = 25/41 (60%) Frame = +2 Query: 173 LVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295 L +EW +K++ +K Y N E++ R KIY + K + +HN Sbjct: 18 LADDEWEQYKIKFNKKYANPEEEQRRYKIYLDTKKKVEEHN 58 >UniRef50_A2A5X5 Cluster: Ortholog of keratin associated protein 16-1 KRTAP16-1; n=7; Murinae|Rep: Ortholog of keratin associated protein 16-1 KRTAP16-1 - Mus musculus (Mouse) Length = 502 Score = 39.5 bits (88), Expect = 0.033 Identities = 17/35 (48%), Positives = 19/35 (54%) Frame = -2 Query: 305 GGAGCASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201 G +GC SQCC S +SSC C P C C PS Sbjct: 50 GSSGCGSQCCQPSCSVSSC--CQPVCCEATICEPS 82 >UniRef50_A1ZBK7 Cluster: CG10460-PA; n=1; Drosophila melanogaster|Rep: CG10460-PA - Drosophila melanogaster (Fruit fly) Length = 79 Score = 39.1 bits (87), Expect = 0.044 Identities = 18/40 (45%), Positives = 26/40 (65%) Frame = +2 Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 EEW +K + K Y+ E ED R +IYAE+K I +HN++ Sbjct: 7 EEWVEYKSKFDKNYEAE-EDLMRRRIYAESKARIEEHNRK 45 >UniRef50_UPI0000D566ED Cluster: PREDICTED: similar to CTLA-2-alpha protein precursor; n=1; Tribolium castaneum|Rep: PREDICTED: similar to CTLA-2-alpha protein precursor - Tribolium castaneum Length = 101 Score = 38.7 bits (86), Expect = 0.058 Identities = 14/42 (33%), Positives = 28/42 (66%) Frame = +2 Query: 176 VKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 V +++N FK ++ K Y + E+ FR +++A+N I +HN++ Sbjct: 25 VTQKFNEFKTKYGKTYADANEENFRKQLFAKNLEKIEEHNKK 66 >UniRef50_UPI0000D566EC Cluster: PREDICTED: similar to CG10460-PA; n=1; Tribolium castaneum|Rep: PREDICTED: similar to CG10460-PA - Tribolium castaneum Length = 80 Score = 38.7 bits (86), Expect = 0.058 Identities = 12/44 (27%), Positives = 26/44 (59%) Frame = +2 Query: 170 DLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 + ++E+WN FK ++ K Y + E+ +R ++ N + HN++ Sbjct: 8 EFIEEKWNEFKAKYRKNYTDAEEESYRKSLFVANLQMVESHNEK 51 >UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 392 Score = 38.7 bits (86), Expect = 0.058 Identities = 16/46 (34%), Positives = 28/46 (60%) Frame = +2 Query: 170 DLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQRRL 307 DLV ++++ F+ +H KVY+++ E + R I+ N I N+R L Sbjct: 82 DLVDDDFDEFRQQHDKVYEDDSEHRRRKHIFRHNVRYIRSMNRRSL 127 >UniRef50_Q9BYR4 Cluster: Keratin-associated protein 4-3; n=53; Mammalia|Rep: Keratin-associated protein 4-3 - Homo sapiens (Human) Length = 195 Score = 38.7 bits (86), Expect = 0.058 Identities = 18/31 (58%), Positives = 18/31 (58%) Frame = -2 Query: 293 CASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201 C S CC S ISSC C P CRT CC PS Sbjct: 45 CISSCCRPSCCISSC--CKPSCCRTTCCRPS 73 Score = 38.7 bits (86), Expect = 0.058 Identities = 18/31 (58%), Positives = 18/31 (58%) Frame = -2 Query: 293 CASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201 C S CC S ISSC C P CRT CC PS Sbjct: 75 CISSCCRPSCCISSC--CKPSCCRTTCCRPS 103 Score = 37.1 bits (82), Expect = 0.18 Identities = 17/31 (54%), Positives = 18/31 (58%) Frame = -2 Query: 293 CASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201 C S CC S ISSC C P C+T CC PS Sbjct: 105 CISSCCRPSCCISSC--CKPSCCQTTCCRPS 133 Score = 32.3 bits (70), Expect = 5.0 Identities = 16/30 (53%), Positives = 17/30 (56%) Frame = -2 Query: 299 AGCASQCCVCSQRISSCGTCPPFRCRTLCC 210 A C S CC S +SSC C PF C T CC Sbjct: 153 ACCISSCCHPSCCVSSC-RC-PFSCPTTCC 180 >UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain; n=9; Cucujiformia|Rep: Digestive cysteine proteinase intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 38.3 bits (85), Expect = 0.076 Identities = 14/41 (34%), Positives = 23/41 (56%) Frame = +2 Query: 179 KEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 K++W FK H K Y + +E++ R I+ N I +HN + Sbjct: 20 KDQWVAFKQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAK 60 >UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 394 Score = 37.9 bits (84), Expect = 0.10 Identities = 19/48 (39%), Positives = 29/48 (60%) Frame = +2 Query: 158 VSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 VS DL+ +N ++ +H +VY NE E FR I+ EN + +HNQ+ Sbjct: 28 VSVKDLLT--YNQWRNKHQRVYLNEHEQLFRQLIFLENLAKVNEHNQK 73 >UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteinase B; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like tick cysteine proteinase B - Haemaphysalis longicornis (Bush tick) Length = 332 Score = 37.5 bits (83), Expect = 0.13 Identities = 22/53 (41%), Positives = 29/53 (54%), Gaps = 1/53 (1%) Frame = +2 Query: 146 TVGAVSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMK-IYAENKHNIAKHNQR 301 +V AVS +LV EW+ FK H K D + K IY EN+ IA+HN + Sbjct: 13 SVAAVSHQELVGAEWSAFKALHGK--DTSRKQKSTTGWIYMENRLKIARHNAK 63 >UniRef50_A7LFV3 Cluster: Cathepsin L protease inhibitor 2; n=1; Diaprepes abbreviatus|Rep: Cathepsin L protease inhibitor 2 - Diaprepes abbreviatus (Sugarcane rootstalk borer weevil) Length = 91 Score = 37.1 bits (82), Expect = 0.18 Identities = 14/41 (34%), Positives = 25/41 (60%) Frame = +2 Query: 179 KEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 +EEW FK ++ YD+ E+ R I+ +N +I +HN++ Sbjct: 14 QEEWEKFKTGFNRNYDSSDEEAKRFNIFQQNLQSIREHNEK 54 >UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_26, whole genome shotgun sequence - Paramecium tetraurelia Length = 358 Score = 37.1 bits (82), Expect = 0.18 Identities = 15/36 (41%), Positives = 22/36 (61%) Frame = +2 Query: 188 WNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295 + + +E+ K YDN+ RM+I+ NK NI KHN Sbjct: 43 FRNWMLEYGKSYDNDFTAIHRMQIFMRNKKNIEKHN 78 >UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Longidorus elongatus Length = 358 Score = 36.7 bits (81), Expect = 0.23 Identities = 13/36 (36%), Positives = 22/36 (61%) Frame = +2 Query: 188 WNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295 W FK++H+K Y + E+ R +++A N I +HN Sbjct: 43 WTNFKLKHAKSYKTKDEELLRFQVFASNHKVIEQHN 78 >UniRef50_Q22LI1 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 987 Score = 36.7 bits (81), Expect = 0.23 Identities = 16/37 (43%), Positives = 26/37 (70%) Frame = +2 Query: 185 EWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295 E+N + +H+KV+D E + K+R+ I+AEN I +HN Sbjct: 30 EFNKWSAKHNKVFDPE-QLKYRLSIFAENYKKIKEHN 65 >UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin S preproprotein; n=2; Tribolium castaneum|Rep: PREDICTED: similar to cathepsin S preproprotein - Tribolium castaneum Length = 525 Score = 36.3 bits (80), Expect = 0.31 Identities = 14/42 (33%), Positives = 23/42 (54%) Frame = +2 Query: 176 VKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 + +EW FK ++ + Y N E+ FR I+ + I HN+R Sbjct: 221 LNKEWENFKRKYERRYPNLEEENFRRAIFEKTFQEIKHHNER 262 >UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cathepsin L; n=4; Danio rerio|Rep: Novel protein similar to vertebrate cathepsin L - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 334 Score = 36.3 bits (80), Expect = 0.31 Identities = 16/37 (43%), Positives = 20/37 (54%) Frame = +2 Query: 185 EWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295 EWN +K +H YD E ED R I+ N I K+N Sbjct: 25 EWNLWKKKHEISYDEESEDVHRKTIWETNMQKIWKNN 61 >UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 326 Score = 36.3 bits (80), Expect = 0.31 Identities = 15/41 (36%), Positives = 24/41 (58%) Frame = +2 Query: 179 KEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 KEEW FK+ ++K Y N +E++ R I+ + I HN + Sbjct: 20 KEEWVQFKVRNNKSYRNYIEEQKRFTIFQGSLRKIENHNDK 60 >UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio molitor (Yellow mealworm) Length = 336 Score = 36.3 bits (80), Expect = 0.31 Identities = 13/42 (30%), Positives = 24/42 (57%) Frame = +2 Query: 176 VKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 V E+W FK +++ Y N E+ FR +I+ + +HN++ Sbjct: 23 VAEKWENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNEK 64 >UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase precursor - Phaedon cochleariae (Mustard beetle) Length = 324 Score = 36.3 bits (80), Expect = 0.31 Identities = 14/39 (35%), Positives = 22/39 (56%) Frame = +2 Query: 179 KEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295 +E W FK H++ Y + E+K R I+ + IA+HN Sbjct: 20 QELWADFKKTHARTYKSLREEKLRFNIFQDTLRQIAEHN 58 >UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae|Rep: Cysteine proteinase - Hypera postica (alfalfa weevil) Length = 324 Score = 35.9 bits (79), Expect = 0.41 Identities = 14/37 (37%), Positives = 21/37 (56%) Frame = +2 Query: 185 EWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295 ++ FK+EH K Y N+ E+ R I+ +N I HN Sbjct: 25 KFQAFKLEHGKTYLNQAEESKRFNIFTDNVRAIEAHN 61 >UniRef50_Q248G1 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 334 Score = 35.9 bits (79), Expect = 0.41 Identities = 13/38 (34%), Positives = 23/38 (60%) Frame = +2 Query: 188 WNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 +N ++ + +VY++E E FR I+ ENK + HN + Sbjct: 36 YNLWRQNNGRVYNSEEEQFFRQLIFVENKRQVDSHNSQ 73 >UniRef50_Q23H06 Cluster: Papain family cysteine protease containing protein; n=18; Tetrahymena thermophila|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 349 Score = 35.9 bits (79), Expect = 0.41 Identities = 15/36 (41%), Positives = 20/36 (55%) Frame = +2 Query: 188 WNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295 +N + EH +VY NE E FR ++ EN I HN Sbjct: 29 YNKWSSEHQRVYLNEHEKLFRQMVFFENLQKIQDHN 64 >UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep: Cathepsin - Petromyzon marinus (Sea lamprey) Length = 333 Score = 35.5 bits (78), Expect = 0.54 Identities = 12/37 (32%), Positives = 22/37 (59%) Frame = +2 Query: 185 EWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295 +W+T+K + K Y +E ED R ++ +N + +HN Sbjct: 26 QWDTWKSTYGKHYGSEQEDAHRRDVFEQNLKRVLQHN 62 >UniRef50_Q17641 Cluster: Putative uncharacterized protein; n=11; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 197 Score = 35.5 bits (78), Expect = 0.54 Identities = 18/41 (43%), Positives = 20/41 (48%), Gaps = 4/41 (9%) Frame = -2 Query: 314 GIGGGAGCASQ----CCVCSQRISSCGTCPPFRCRTLCCAP 204 G GGG GC CC C + + C TC RC T CC P Sbjct: 79 GGGGGCGCCCCRPRCCCCCRRCCTCCRTCCCTRCCT-CCRP 118 Score = 33.5 bits (73), Expect = 2.2 Identities = 16/44 (36%), Positives = 18/44 (40%) Frame = -2 Query: 317 EGIGGGAGCASQCCVCSQRISSCGTCPPFRCRTLCCAPS*KCST 186 +G G GC CC C CG C CR CC +C T Sbjct: 62 QGGCGCCGCGCGCCGCGGGGGGCGCC---CCRPRCCCCCRRCCT 102 >UniRef50_Q9BYP8 Cluster: Keratin-associated protein 17-1; n=28; Coelomata|Rep: Keratin-associated protein 17-1 - Homo sapiens (Human) Length = 105 Score = 35.5 bits (78), Expect = 0.54 Identities = 18/43 (41%), Positives = 20/43 (46%) Frame = -2 Query: 314 GIGGGAGCASQCCVCSQRISSCGTCPPFRCRTLCCAPS*KCST 186 G GG GC CC S SSC C C +CC P+ C T Sbjct: 64 GCGGCGGCGGGCCGSSCCGSSC--CGSGCCGPVCCQPTPICDT 104 >UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; Paramecium tetraurelia|Rep: Putative cathepsin L2 precursor - Paramecium tetraurelia Length = 294 Score = 35.5 bits (78), Expect = 0.54 Identities = 18/49 (36%), Positives = 31/49 (63%), Gaps = 3/49 (6%) Frame = +2 Query: 164 FFDLVKEEWNTFK---MEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 ++ L +++ N F+ ++++K Y E E +RM+IY NK I +HNQR Sbjct: 4 YYHLQEDDTNDFERWALKNNKFY-TESEKLYRMEIYNSNKRMIEEHNQR 51 >UniRef50_Q14C04 Cluster: Keratin associated protein 4-7; n=17; Mammalia|Rep: Keratin associated protein 4-7 - Mus musculus (Mouse) Length = 168 Score = 35.1 bits (77), Expect = 0.71 Identities = 14/32 (43%), Positives = 20/32 (62%) Frame = -2 Query: 296 GCASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201 GC+ CC S +SSC C P C+++CC P+ Sbjct: 14 GCSQGCCQPSCCVSSC--CRPQCCQSVCCQPT 43 Score = 33.1 bits (72), Expect = 2.9 Identities = 16/31 (51%), Positives = 16/31 (51%) Frame = -2 Query: 293 CASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201 C CC S ISSC C P CR CC PS Sbjct: 40 CQPTCCRPSCCISSC--CRPSCCRPSCCRPS 68 Score = 31.5 bits (68), Expect = 8.8 Identities = 14/30 (46%), Positives = 16/30 (53%), Gaps = 2/30 (6%) Frame = -2 Query: 287 SQCC--VCSQRISSCGTCPPFRCRTLCCAP 204 S CC VCS+ S G C P C + CC P Sbjct: 3 SSCCGSVCSEEGCSQGCCQPSCCVSSCCRP 32 >UniRef50_Q24FA8 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 335 Score = 35.1 bits (77), Expect = 0.71 Identities = 12/37 (32%), Positives = 24/37 (64%) Frame = +2 Query: 188 WNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298 +N ++ E+ KVY +E E +R ++ EN ++ +HN+ Sbjct: 32 YNKWREENGKVYSSEAEKIYRQSVFLENYQSVQEHNK 68 >UniRef50_A7LFV2 Cluster: Cathepsin L protease inhibitor 1; n=1; Diaprepes abbreviatus|Rep: Cathepsin L protease inhibitor 1 - Diaprepes abbreviatus (Sugarcane rootstalk borer weevil) Length = 109 Score = 35.1 bits (77), Expect = 0.71 Identities = 13/42 (30%), Positives = 25/42 (59%) Frame = +2 Query: 176 VKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 V+E WN FK + ++ Y++ E+ R +I+ N +I H ++ Sbjct: 31 VEEHWNNFKTKFNRNYESPEEESKRFEIFKNNLKDIQAHQKK 72 >UniRef50_Q231X3 Cluster: Papain family cysteine protease containing protein; n=3; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 323 Score = 34.7 bits (76), Expect = 0.94 Identities = 13/47 (27%), Positives = 31/47 (65%) Frame = +2 Query: 158 VSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298 ++ + V + W+ +K +H+K Y+N + +R++++AEN + K++Q Sbjct: 29 ITIDESVTKIWSQWKQKHNKRYENTDYESYRLEVFAENL-EVVKNDQ 74 >UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicatein a3 - Lubomirskia baicalensis Length = 344 Score = 34.7 bits (76), Expect = 0.94 Identities = 12/38 (31%), Positives = 23/38 (60%) Frame = +2 Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295 +EW+ +K H + Y++++++ R I+ NK I HN Sbjct: 42 QEWSVWKGHHQRSYESQLQEMERHSIWVANKKYIEHHN 79 >UniRef50_UPI0001553357 Cluster: PREDICTED: similar to novel member of the keratin associated protein 4 (Krtap4) family; n=1; Mus musculus|Rep: PREDICTED: similar to novel member of the keratin associated protein 4 (Krtap4) family - Mus musculus Length = 292 Score = 34.3 bits (75), Expect = 1.2 Identities = 16/40 (40%), Positives = 23/40 (57%) Frame = -2 Query: 320 DEGIGGGAGCASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201 +EG G C + CC S +SSC C P C+++CC P+ Sbjct: 12 EEGCGQSC-CQTTCCRPSCCVSSC--CRPQCCQSVCCQPT 48 >UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 360 Score = 34.3 bits (75), Expect = 1.2 Identities = 10/41 (24%), Positives = 26/41 (63%) Frame = +2 Query: 176 VKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298 ++ + FK++++K Y ++ E+++R ++ N I +HN+ Sbjct: 41 IERAFKNFKVKYAKTYKDDTEEQYRFSVFTNNYVEIYRHNK 81 >UniRef50_Q86MN0 Cluster: Cathepsin L-like cysteine protease; n=1; Panagrolaimus davidi|Rep: Cathepsin L-like cysteine protease - Panagrolaimus davidi Length = 56 Score = 34.3 bits (75), Expect = 1.2 Identities = 14/40 (35%), Positives = 22/40 (55%) Frame = +2 Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 +EW +K + K Y +E +K R +IY +N HN+R Sbjct: 9 KEWQDYKQKFDKSYPDEETEKQRYQIYKKNVEENETHNKR 48 >UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, whole genome shotgun sequence; n=3; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_2, whole genome shotgun sequence - Paramecium tetraurelia Length = 376 Score = 34.3 bits (75), Expect = 1.2 Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 3/50 (6%) Frame = +2 Query: 161 SFFDLVKEEWNTFK---MEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 S + +EE FK E+ K Y+NE E +RM+++ +N + HN++ Sbjct: 32 SIYGWREEEQKQFKNWVQENQKTYNNEFEMIYRMEVFVKNYRTMKHHNEQ 81 >UniRef50_Q86UY7 Cluster: Similar to RIKEN cDNA 2310002J15 gene; n=12; Eutheria|Rep: Similar to RIKEN cDNA 2310002J15 gene - Homo sapiens (Human) Length = 144 Score = 34.3 bits (75), Expect = 1.2 Identities = 15/35 (42%), Positives = 17/35 (48%), Gaps = 1/35 (2%) Frame = -2 Query: 311 IGGGAGCASQCCVCSQRISSCGTCPPF-RCRTLCC 210 +G G A CC C C CPPF RC + CC Sbjct: 108 VGRGDDIAHHCCCCP--CCHCCHCPPFCRCHSCCC 140 >UniRef50_UPI0001555AB0 Cluster: PREDICTED: hypothetical protein; n=4; Ornithorhynchus anatinus|Rep: PREDICTED: hypothetical protein - Ornithorhynchus anatinus Length = 288 Score = 33.9 bits (74), Expect = 1.6 Identities = 13/31 (41%), Positives = 15/31 (48%) Frame = -2 Query: 302 GAGCASQCCVCSQRISSCGTCPPFRCRTLCC 210 G GC + C C S CPP C+T CC Sbjct: 16 GRGCCQETC-CEPSCCSSPCCPPTCCQTTCC 45 >UniRef50_UPI0000E22843 Cluster: PREDICTED: hypothetical protein; n=1; Pan troglodytes|Rep: PREDICTED: hypothetical protein - Pan troglodytes Length = 304 Score = 33.9 bits (74), Expect = 1.6 Identities = 14/41 (34%), Positives = 19/41 (46%), Gaps = 3/41 (7%) Frame = -2 Query: 299 AGCASQCC---VCSQRISSCGTCPPFRCRTLCCAPS*KCST 186 +GC S CC C C P+ C++ CC P CS+ Sbjct: 234 SGCGSSCCQSSCCKPYCCQSSCCKPYCCQSSCCKPC-SCSS 273 >UniRef50_UPI000155F1D6 Cluster: PREDICTED: similar to keratin associated protein 9.3; n=1; Equus caballus|Rep: PREDICTED: similar to keratin associated protein 9.3 - Equus caballus Length = 302 Score = 33.5 bits (73), Expect = 2.2 Identities = 13/33 (39%), Positives = 17/33 (51%) Frame = -2 Query: 299 AGCASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201 + C Q C S C CPP C+T+CC P+ Sbjct: 127 SSCCGQTCSRSSCCQPC--CPPACCQTICCQPA 157 >UniRef50_UPI000155BC4F Cluster: PREDICTED: hypothetical protein, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: hypothetical protein, partial - Ornithorhynchus anatinus Length = 309 Score = 33.5 bits (73), Expect = 2.2 Identities = 13/31 (41%), Positives = 15/31 (48%) Frame = -2 Query: 302 GAGCASQCCVCSQRISSCGTCPPFRCRTLCC 210 G GC + C C S CPP C+T CC Sbjct: 16 GRGCCQETC-CQPGCCSSPCCPPTCCQTTCC 45 Score = 33.1 bits (72), Expect = 2.9 Identities = 12/31 (38%), Positives = 19/31 (61%) Frame = -2 Query: 293 CASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201 C + CCV S +C C P+ C+++CC P+ Sbjct: 277 CQATCCVTSCCRPTC--CSPYCCQSVCCQPT 305 >UniRef50_UPI000155483C Cluster: PREDICTED: similar to keratin associated protein; n=8; Ornithorhynchus anatinus|Rep: PREDICTED: similar to keratin associated protein - Ornithorhynchus anatinus Length = 399 Score = 33.5 bits (73), Expect = 2.2 Identities = 13/36 (36%), Positives = 19/36 (52%) Frame = -2 Query: 308 GGGAGCASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201 G AGC S C + +++ G+C P C+ CC S Sbjct: 222 GAPAGCQSSCGPSTCQLACTGSCSPSCCQDSCCQQS 257 >UniRef50_Q24E33 Cluster: Papain family cysteine protease containing protein; n=1; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 328 Score = 33.5 bits (73), Expect = 2.2 Identities = 14/40 (35%), Positives = 22/40 (55%) Frame = +2 Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 E + FK+EH+ V+ N ED +R I+ +N I N + Sbjct: 40 EMYAEFKLEHNIVFQNSEEDLYRQNIFFQNVRYIQSENAK 79 >UniRef50_A0CAI9 Cluster: Chromosome undetermined scaffold_161, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_161, whole genome shotgun sequence - Paramecium tetraurelia Length = 3076 Score = 33.5 bits (73), Expect = 2.2 Identities = 12/24 (50%), Positives = 15/24 (62%) Frame = -2 Query: 293 CASQCCVCSQRISSCGTCPPFRCR 222 C+ +C CSQR C +CPPF R Sbjct: 957 CSYRCKTCSQREEQCLSCPPFSLR 980 >UniRef50_Q9BYR0 Cluster: Keratin-associated protein 4-7; n=149; Eukaryota|Rep: Keratin-associated protein 4-7 - Homo sapiens (Human) Length = 210 Score = 33.5 bits (73), Expect = 2.2 Identities = 14/31 (45%), Positives = 19/31 (61%) Frame = -2 Query: 293 CASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201 C S CC S +SSC C P C+++CC P+ Sbjct: 80 CISSCCRPSCCMSSC--CKPQCCQSVCCQPT 108 Score = 31.9 bits (69), Expect = 6.6 Identities = 13/31 (41%), Positives = 18/31 (58%) Frame = -2 Query: 293 CASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201 C S CC S +S C C P C+++CC P+ Sbjct: 115 CISSCCRPSCCVSRC--CRPQCCQSVCCQPT 143 >UniRef50_Q207N1 Cluster: Cathepsin S; n=2; Clupeocephala|Rep: Cathepsin S - Ictalurus punctatus (Channel catfish) Length = 84 Score = 33.1 bits (72), Expect = 2.9 Identities = 14/36 (38%), Positives = 20/36 (55%) Frame = +2 Query: 188 WNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295 W +K HSK Y +E+E+ R +I+ N I HN Sbjct: 26 WLMWKKNHSKTYTSELEELGRREIWERNLRLITVHN 61 >UniRef50_A2A4R5 Cluster: Novel member of the keratin associated protein 4 (Krtap4) family; n=10; Theria|Rep: Novel member of the keratin associated protein 4 (Krtap4) family - Mus musculus (Mouse) Length = 167 Score = 33.1 bits (72), Expect = 2.9 Identities = 14/34 (41%), Positives = 19/34 (55%), Gaps = 3/34 (8%) Frame = -2 Query: 293 CASQCCVCSQRISSC---GTCPPFRCRTLCCAPS 201 C S CC S +SSC C P C+++CC P+ Sbjct: 39 CVSSCCRPSCCVSSCCRPSCCRPQCCQSVCCQPT 72 >UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep: Silicatein beta - Suberites domuncula (Sponge) Length = 383 Score = 33.1 bits (72), Expect = 2.9 Identities = 13/40 (32%), Positives = 22/40 (55%) Frame = +2 Query: 179 KEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298 +E+W + +H KVY + E + ++ NK I +HNQ Sbjct: 53 EEDWKQWTTDHHKVYSDVRERVDKYTVWRANKEYIDQHNQ 92 >UniRef50_Q8IUG1 Cluster: Keratin-associated protein 1-3; n=65; Mammalia|Rep: Keratin-associated protein 1-3 - Homo sapiens (Human) Length = 177 Score = 33.1 bits (72), Expect = 2.9 Identities = 15/31 (48%), Positives = 16/31 (51%) Frame = -2 Query: 293 CASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201 C S CC S +SC C P C T CC PS Sbjct: 20 CGSSCCQPSCCETSC--CQPSCCETSCCQPS 48 >UniRef50_UPI0001554838 Cluster: PREDICTED: similar to solute carrier family 5 (sodium/glucose cotransporter), member 9; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to solute carrier family 5 (sodium/glucose cotransporter), member 9 - Ornithorhynchus anatinus Length = 300 Score = 32.7 bits (71), Expect = 3.8 Identities = 13/30 (43%), Positives = 15/30 (50%) Frame = -2 Query: 293 CASQCCVCSQRISSCGTCPPFRCRTLCCAP 204 C S CC S R+ C C P C +CC P Sbjct: 5 CCSPCCRPSGRVPVC--CKPVCCEPVCCKP 32 Score = 32.3 bits (70), Expect = 5.0 Identities = 15/39 (38%), Positives = 19/39 (48%), Gaps = 4/39 (10%) Frame = -2 Query: 308 GGGAGCASQCCVCSQ--RISSCGT--CPPFRCRTLCCAP 204 G + C + CC CS R S C C P C+ +CC P Sbjct: 143 GRDSCCGTVCCCCSPCCRPSGCVPVCCEPVCCKPVCCVP 181 Score = 31.5 bits (68), Expect = 8.8 Identities = 13/32 (40%), Positives = 15/32 (46%) Frame = -2 Query: 299 AGCASQCCVCSQRISSCGTCPPFRCRTLCCAP 204 A C S CC S + C C P C +CC P Sbjct: 58 ASCCSPCCRPSGCVPVC--CKPVCCEPVCCVP 87 >UniRef50_UPI00006CB160 Cluster: hypothetical protein TTHERM_00298410; n=1; Tetrahymena thermophila SB210|Rep: hypothetical protein TTHERM_00298410 - Tetrahymena thermophila SB210 Length = 1366 Score = 32.7 bits (71), Expect = 3.8 Identities = 14/39 (35%), Positives = 16/39 (41%) Frame = -2 Query: 320 DEGIGGGAGCASQCCVCSQRISSCGTCPPFRCRTLCCAP 204 D G CA +C CS C TC P R + C P Sbjct: 1064 DTGAANCQQCAPKCATCSTSSVQCLTCAPGRINSDCSCP 1102 >UniRef50_UPI0000499201 Cluster: RIO1 family protein; n=2; Entamoeba histolytica HM-1:IMSS|Rep: RIO1 family protein - Entamoeba histolytica HM-1:IMSS Length = 474 Score = 32.7 bits (71), Expect = 3.8 Identities = 14/41 (34%), Positives = 22/41 (53%) Frame = +2 Query: 179 KEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 KEE + EH K Y E +K + K+ + K + KH++R Sbjct: 430 KEEEKKLRKEHKKQYKIERREKLKHKMPKKKKEQLIKHSKR 470 >UniRef50_Q70SU8 Cluster: Cystein proteinase inhibitor protein precursor; n=4; Salmonidae|Rep: Cystein proteinase inhibitor protein precursor - Salmo salar (Atlantic salmon) Length = 342 Score = 32.7 bits (71), Expect = 3.8 Identities = 12/42 (28%), Positives = 27/42 (64%) Frame = +2 Query: 176 VKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 V +E+ T+K+++ K Y + +E+ R +I+ + + +HN+R Sbjct: 270 VHKEFETWKVKYGKTYPSTVEEAKRKEIWLATRKMVMEHNKR 311 Score = 32.3 bits (70), Expect = 5.0 Identities = 13/42 (30%), Positives = 25/42 (59%) Frame = +2 Query: 176 VKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 V +E+ T+K++H K Y + E+ R I+ + + +HN+R Sbjct: 193 VDKEFETWKVQHGKNYGSTEEEAKRKGIWLATRTRVMEHNKR 234 >UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n=21; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Globodera pallida Length = 379 Score = 32.7 bits (71), Expect = 3.8 Identities = 15/39 (38%), Positives = 21/39 (53%), Gaps = 1/39 (2%) Frame = +2 Query: 185 EWNTFKMEHS-KVYDNEMEDKFRMKIYAENKHNIAKHNQ 298 +WN +K +H K Y ++ + RM Y K I KHNQ Sbjct: 69 DWNAYKQKHGRKAYADQDVENERMLTYLSAKQFIDKHNQ 107 >UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing protein; n=2; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 397 Score = 32.7 bits (71), Expect = 3.8 Identities = 17/50 (34%), Positives = 26/50 (52%) Frame = +2 Query: 146 TVGAVSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295 ++ +V DL+ ++N +K +H K Y N E FR IY N +HN Sbjct: 28 SLSSVQIKDLL--DFNKWKYQHGKKYFNADEANFRQLIYLMNLQKFNEHN 75 >UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus salmonis|Rep: Cysteine proteinase - Lepeophtheirus salmonis (salmon louse) Length = 372 Score = 32.7 bits (71), Expect = 3.8 Identities = 12/38 (31%), Positives = 22/38 (57%) Frame = +2 Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295 +E+ +F E+SK Y N ++K++ +N I +HN Sbjct: 25 QEFESFVKEYSKSYHNRALRSLKLKVFVDNLREIEEHN 62 >UniRef50_Q7YWV7 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 212 Score = 32.3 bits (70), Expect = 5.0 Identities = 16/42 (38%), Positives = 16/42 (38%), Gaps = 3/42 (7%) Frame = -2 Query: 308 GGGAGCAS---QCCVCSQRISSCGTCPPFRCRTLCCAPS*KC 192 GGG GC CC C R C CRT CC C Sbjct: 74 GGGCGCCGCGCGCCCCRPRCCCCCRRCCTCCRTCCCTRCCTC 115 Score = 31.9 bits (69), Expect = 6.6 Identities = 16/39 (41%), Positives = 18/39 (46%), Gaps = 2/39 (5%) Frame = -2 Query: 314 GIGGGAGCASQ--CCVCSQRISSCGTCPPFRCRTLCCAP 204 G G G C CC C + + C TC RC T CC P Sbjct: 81 GCGCGCCCCRPRCCCCCRRCCTCCRTCCCTRCCT-CCRP 118 >UniRef50_Q23H10 Cluster: Papain family cysteine protease containing protein; n=14; Tetrahymena thermophila SB210|Rep: Papain family cysteine protease containing protein - Tetrahymena thermophila SB210 Length = 336 Score = 32.3 bits (70), Expect = 5.0 Identities = 13/36 (36%), Positives = 21/36 (58%) Frame = +2 Query: 188 WNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295 +N + ++ +VY NE E FR ++ EN I +HN Sbjct: 29 YNQWSSQNQRVYLNEHEKLFRQMVFFENFQKIQEHN 64 >UniRef50_A0EI50 Cluster: Chromosome undetermined scaffold_98, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_98, whole genome shotgun sequence - Paramecium tetraurelia Length = 336 Score = 32.3 bits (70), Expect = 5.0 Identities = 13/40 (32%), Positives = 24/40 (60%) Frame = +2 Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 +E++ +K H K+Y +ED +R +I+ +N + HN R Sbjct: 27 DEYSKWKQHHQKLYQG-VEDTYRKQIFHQNLQIVNDHNAR 65 >UniRef50_A0BNM1 Cluster: Chromosome undetermined scaffold_119, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_119, whole genome shotgun sequence - Paramecium tetraurelia Length = 341 Score = 32.3 bits (70), Expect = 5.0 Identities = 14/39 (35%), Positives = 24/39 (61%) Frame = +2 Query: 185 EWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301 ++ + ++H K Y + E K+R IY +NK I +HN+R Sbjct: 28 DFERWALKHGKHYFGD-EKKYRQAIYFQNKQMIEEHNKR 65 >UniRef50_P60410 Cluster: Keratin-associated protein 10-8; n=12; Eutheria|Rep: Keratin-associated protein 10-8 - Homo sapiens (Human) Length = 259 Score = 32.3 bits (70), Expect = 5.0 Identities = 12/32 (37%), Positives = 16/32 (50%) Frame = -2 Query: 299 AGCASQCCVCSQRISSCGTCPPFRCRTLCCAP 204 + C S CC S +C C P C+ +CC P Sbjct: 149 SSCQSACCTFSPCQQAC--CVPICCKPICCVP 178 >UniRef50_P60372 Cluster: Keratin-associated protein 10-4; n=18; Eutheria|Rep: Keratin-associated protein 10-4 - Homo sapiens (Human) Length = 401 Score = 32.3 bits (70), Expect = 5.0 Identities = 12/32 (37%), Positives = 16/32 (50%) Frame = -2 Query: 299 AGCASQCCVCSQRISSCGTCPPFRCRTLCCAP 204 + C CC S +C C P C+T+CC P Sbjct: 212 SSCQPACCTSSSCQQAC--CVPVCCKTVCCKP 241 Score = 31.5 bits (68), Expect = 8.8 Identities = 12/32 (37%), Positives = 16/32 (50%) Frame = -2 Query: 299 AGCASQCCVCSQRISSCGTCPPFRCRTLCCAP 204 + C CC S +C C P C+T+CC P Sbjct: 93 SSCQLACCASSPCQQAC--CVPVCCKTVCCKP 122 >UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi Length = 467 Score = 32.3 bits (70), Expect = 5.0 Identities = 10/34 (29%), Positives = 22/34 (64%) Frame = +2 Query: 170 DLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAEN 271 + + ++ FK +H +VY++ E+ FR+ ++ EN Sbjct: 32 ETLTSQFAEFKQKHGRVYESAAEEAFRLSVFREN 65 >UniRef50_Q9BXQ6 Cluster: Cat eye syndrome critical region protein 6; n=11; Mammalia|Rep: Cat eye syndrome critical region protein 6 - Homo sapiens (Human) Length = 578 Score = 32.3 bits (70), Expect = 5.0 Identities = 17/41 (41%), Positives = 18/41 (43%) Frame = -2 Query: 314 GIGGGAGCASQCCVCSQRISSCGTCPPFRCRTLCCAPS*KC 192 G G GA C CC C C P R R CAPS +C Sbjct: 161 GTGSGASCCPCCCCC-----GCPDRPGRRGRRRGCAPSPRC 196 >UniRef50_UPI0001555234 Cluster: PREDICTED: similar to hCG2041354; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to hCG2041354 - Ornithorhynchus anatinus Length = 121 Score = 31.9 bits (69), Expect = 6.6 Identities = 16/37 (43%), Positives = 16/37 (43%), Gaps = 1/37 (2%) Frame = -2 Query: 308 GGGAGCASQCCVCSQRISSCGTCPPF-RCRTLCCAPS 201 G G A CC C SC CPP RC CC S Sbjct: 87 GPGDDIAHNCCCCP--CCSCCHCPPCCRCHPCCCVVS 121 >UniRef50_Q4MW49 Cluster: Putative uncharacterized protein; n=1; Bacillus cereus G9241|Rep: Putative uncharacterized protein - Bacillus cereus G9241 Length = 165 Score = 31.9 bits (69), Expect = 6.6 Identities = 13/26 (50%), Positives = 13/26 (50%) Frame = -2 Query: 308 GGGAGCASQCCVCSQRISSCGTCPPF 231 G GAGC C VCS C TC F Sbjct: 31 GCGAGCCGSCFVCSCWTGCCATCCSF 56 >UniRef50_A1Z9I0 Cluster: CG6357-PA; n=3; Drosophila melanogaster|Rep: CG6357-PA - Drosophila melanogaster (Fruit fly) Length = 439 Score = 31.9 bits (69), Expect = 6.6 Identities = 15/50 (30%), Positives = 23/50 (46%) Frame = +2 Query: 146 TVGAVSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295 T G F + + W F ++ YDN+ E + R I+ EN + HN Sbjct: 58 TSGLSEFEEECQFAWQRFLVDFDVHYDNDYERQKRRDIFCENWQKVRDHN 107 >UniRef50_Q4PEV5 Cluster: Putative uncharacterized protein; n=1; Ustilago maydis|Rep: Putative uncharacterized protein - Ustilago maydis (Smut fungus) Length = 531 Score = 31.9 bits (69), Expect = 6.6 Identities = 14/30 (46%), Positives = 18/30 (60%) Frame = -2 Query: 290 ASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201 AS+C + S +S+ CPP RC TL A S Sbjct: 2 ASRCSLRSSWLSAVAVCPPQRCSTLTIAAS 31 >UniRef50_Q6L8H1 Cluster: Keratin-associated protein 5-4; n=160; Fungi/Metazoa group|Rep: Keratin-associated protein 5-4 - Homo sapiens (Human) Length = 288 Score = 31.9 bits (69), Expect = 6.6 Identities = 14/32 (43%), Positives = 18/32 (56%) Frame = -2 Query: 299 AGCASQCCVCSQRISSCGTCPPFRCRTLCCAP 204 +GC S CC SSC C P+ C++ CC P Sbjct: 228 SGCGSSCCQ-----SSC--CKPYCCQSSCCKP 252 >UniRef50_P60412 Cluster: Keratin-associated protein 10-11; n=80; Eutheria|Rep: Keratin-associated protein 10-11 - Homo sapiens (Human) Length = 298 Score = 31.9 bits (69), Expect = 6.6 Identities = 12/32 (37%), Positives = 16/32 (50%) Frame = -2 Query: 299 AGCASQCCVCSQRISSCGTCPPFRCRTLCCAP 204 + C CC S +C C P C+T+CC P Sbjct: 83 SSCQPACCTSSPCQQAC--CVPVCCKTVCCKP 112 >UniRef50_P60368 Cluster: Keratin-associated protein 10-2; n=64; Coelomata|Rep: Keratin-associated protein 10-2 - Homo sapiens (Human) Length = 255 Score = 31.9 bits (69), Expect = 6.6 Identities = 16/36 (44%), Positives = 18/36 (50%), Gaps = 5/36 (13%) Frame = -2 Query: 293 CASQCCV--CSQRISSC---GTCPPFRCRTLCCAPS 201 C S CCV CS S C +C P C + CC PS Sbjct: 193 CKSICCVPVCSGASSPCCQQSSCQPACCTSSCCRPS 228 >UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin heavy chain; n=3; Amniota|Rep: PREDICTED: similar to ferritin heavy chain - Ornithorhynchus anatinus Length = 338 Score = 31.5 bits (68), Expect = 8.8 Identities = 14/39 (35%), Positives = 21/39 (53%) Frame = +2 Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298 E W +K+ H K Y E E+ FR + +N I +HN+ Sbjct: 26 EGWWRWKVLHGKNYSVEAEEVFRRAAWEKNVRVIERHNE 64 >UniRef50_UPI0001552953 Cluster: PREDICTED: hypothetical protein; n=1; Mus musculus|Rep: PREDICTED: hypothetical protein - Mus musculus Length = 205 Score = 31.5 bits (68), Expect = 8.8 Identities = 15/37 (40%), Positives = 17/37 (45%), Gaps = 2/37 (5%) Frame = -2 Query: 314 GIGGGAGCASQC--CVCSQRISSCGTCPPFRCRTLCC 210 G GG GC+S C C C CG C R +CC Sbjct: 55 GCGGCGGCSSCCGGCGCGGCGGCCGCCGCCRPTVVCC 91 >UniRef50_UPI0000EBE77C Cluster: PREDICTED: hypothetical protein; n=7; Theria|Rep: PREDICTED: hypothetical protein - Bos taurus Length = 372 Score = 31.5 bits (68), Expect = 8.8 Identities = 12/31 (38%), Positives = 14/31 (45%) Frame = -2 Query: 296 GCASQCCVCSQRISSCGTCPPFRCRTLCCAP 204 GC S C C CG+C C + CC P Sbjct: 208 GCGSSCGGCGSSCGGCGSCG--GCGSSCCVP 236 >UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4; core eudicotyledons|Rep: Papain-like cysteine peptidase XBCP3 - Arabidopsis thaliana (Mouse-ear cress) Length = 437 Score = 31.5 bits (68), Expect = 8.8 Identities = 14/49 (28%), Positives = 28/49 (57%) Frame = +2 Query: 149 VGAVSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295 V + S D + E ++ + +H K Y +E E + R++I+ +N + +HN Sbjct: 19 VSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHN 67 >UniRef50_Q7RH27 Cluster: Transmembrane amino acid transporter protein, putative; n=6; Plasmodium|Rep: Transmembrane amino acid transporter protein, putative - Plasmodium yoelii yoelii Length = 645 Score = 31.5 bits (68), Expect = 8.8 Identities = 15/38 (39%), Positives = 23/38 (60%) Frame = +2 Query: 203 MEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQRRLLSP 316 ++H K D E+ +K + KIY EN+ N K ++R SP Sbjct: 148 IDHIKDDDQEINEKEKNKIYEENQTNKKKTWKKRTFSP 185 >UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: Cathepsin L - Kudoa thyrsites Length = 300 Score = 31.5 bits (68), Expect = 8.8 Identities = 15/49 (30%), Positives = 27/49 (55%), Gaps = 3/49 (6%) Frame = +2 Query: 158 VSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENK---HNIAKHN 295 +S D+ W+ K+EH+ ++D+ E++ R+ + EN HN HN Sbjct: 1 MSLEDVAIRLWSAHKLEHNIIFDSIEEERRRLCNFKENHQFIHNFNLHN 49 >UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endopterygota|Rep: Cathepsin L-like proteinase - Bombyx mori (Silk moth) Length = 402 Score = 31.5 bits (68), Expect = 8.8 Identities = 11/45 (24%), Positives = 24/45 (53%) Frame = +2 Query: 173 LVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQRRL 307 L + W+ +K H+K+Y + + + + +N +A+HN+ L Sbjct: 95 LPRRHWHEYKAIHNKLYSSTHHEMAALMKWRQNLRRVARHNREYL 139 >UniRef50_A0CKJ6 Cluster: Chromosome undetermined scaffold_2, whole genome shotgun sequence; n=3; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_2, whole genome shotgun sequence - Paramecium tetraurelia Length = 354 Score = 31.5 bits (68), Expect = 8.8 Identities = 17/51 (33%), Positives = 30/51 (58%), Gaps = 2/51 (3%) Frame = -1 Query: 276 CLFSAYIFMRNLSSISLSYTLLCSILKVFH--SSFTRSKKETAPTVATTRT 130 C++S ++NL S+ + L S+L H SSFTRSK +++ ++ R+ Sbjct: 45 CIYSTMSNIQNLQSLKNQISQLQSVLTQQHRKSSFTRSKADSSTNMSNDRS 95 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 313,289,215 Number of Sequences: 1657284 Number of extensions: 4808949 Number of successful extensions: 18114 Number of sequences better than 10.0: 93 Number of HSP's better than 10.0 without gapping: 16823 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 18021 length of database: 575,637,011 effective HSP length: 93 effective length of database: 421,509,599 effective search space used: 23604537544 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -