BLASTX 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= I09A02NGRL0001_H14
(450 letters)
Database: uniref50
1,657,284 sequences; 575,637,011 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C... 76 3e-13
UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine prot... 58 7e-08
UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat... 56 4e-07
UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ... 46 4e-04
UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ... 46 4e-04
UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine ... 45 9e-04
UniRef50_Q8I880 Cluster: Digestive cysteine protease intestain; ... 43 0.003
UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor; ... 42 0.008
UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi... 42 0.008
UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides sonorensis... 42 0.008
UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase pr... 41 0.011
UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei... 41 0.014
UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: No... 40 0.025
UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;... 40 0.025
UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain; ... 40 0.025
UniRef50_UPI00015B5D85 Cluster: PREDICTED: similar to cathepsin ... 40 0.033
UniRef50_A2A5X5 Cluster: Ortholog of keratin associated protein ... 40 0.033
UniRef50_A1ZBK7 Cluster: CG10460-PA; n=1; Drosophila melanogaste... 39 0.044
UniRef50_UPI0000D566ED Cluster: PREDICTED: similar to CTLA-2-alp... 39 0.058
UniRef50_UPI0000D566EC Cluster: PREDICTED: similar to CG10460-PA... 39 0.058
UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella v... 39 0.058
UniRef50_Q9BYR4 Cluster: Keratin-associated protein 4-3; n=53; M... 39 0.058
UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain... 38 0.076
UniRef50_Q23FR0 Cluster: Papain family cysteine protease contain... 38 0.10
UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina... 38 0.13
UniRef50_A7LFV3 Cluster: Cathepsin L protease inhibitor 2; n=1; ... 37 0.18
UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, wh... 37 0.18
UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n... 37 0.23
UniRef50_Q22LI1 Cluster: Papain family cysteine protease contain... 37 0.23
UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ... 36 0.31
UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat... 36 0.31
UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ... 36 0.31
UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio... 36 0.31
UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor; ... 36 0.31
UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3; Curculionidae... 36 0.41
UniRef50_Q248G1 Cluster: Papain family cysteine protease contain... 36 0.41
UniRef50_Q23H06 Cluster: Papain family cysteine protease contain... 36 0.41
UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:... 36 0.54
UniRef50_Q17641 Cluster: Putative uncharacterized protein; n=11;... 36 0.54
UniRef50_Q9BYP8 Cluster: Keratin-associated protein 17-1; n=28; ... 36 0.54
UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4; P... 36 0.54
UniRef50_Q14C04 Cluster: Keratin associated protein 4-7; n=17; M... 35 0.71
UniRef50_Q24FA8 Cluster: Papain family cysteine protease contain... 35 0.71
UniRef50_A7LFV2 Cluster: Cathepsin L protease inhibitor 1; n=1; ... 35 0.71
UniRef50_Q231X3 Cluster: Papain family cysteine protease contain... 35 0.94
UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate... 35 0.94
UniRef50_UPI0001553357 Cluster: PREDICTED: similar to novel memb... 34 1.2
UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease ... 34 1.2
UniRef50_Q86MN0 Cluster: Cathepsin L-like cysteine protease; n=1... 34 1.2
UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, who... 34 1.2
UniRef50_Q86UY7 Cluster: Similar to RIKEN cDNA 2310002J15 gene; ... 34 1.2
UniRef50_UPI0001555AB0 Cluster: PREDICTED: hypothetical protein;... 34 1.6
UniRef50_UPI0000E22843 Cluster: PREDICTED: hypothetical protein;... 34 1.6
UniRef50_UPI000155F1D6 Cluster: PREDICTED: similar to keratin as... 33 2.2
UniRef50_UPI000155BC4F Cluster: PREDICTED: hypothetical protein,... 33 2.2
UniRef50_UPI000155483C Cluster: PREDICTED: similar to keratin as... 33 2.2
UniRef50_Q24E33 Cluster: Papain family cysteine protease contain... 33 2.2
UniRef50_A0CAI9 Cluster: Chromosome undetermined scaffold_161, w... 33 2.2
UniRef50_Q9BYR0 Cluster: Keratin-associated protein 4-7; n=149; ... 33 2.2
UniRef50_Q207N1 Cluster: Cathepsin S; n=2; Clupeocephala|Rep: Ca... 33 2.9
UniRef50_A2A4R5 Cluster: Novel member of the keratin associated ... 33 2.9
UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:... 33 2.9
UniRef50_Q8IUG1 Cluster: Keratin-associated protein 1-3; n=65; M... 33 2.9
UniRef50_UPI0001554838 Cluster: PREDICTED: similar to solute car... 33 3.8
UniRef50_UPI00006CB160 Cluster: hypothetical protein TTHERM_0029... 33 3.8
UniRef50_UPI0000499201 Cluster: RIO1 family protein; n=2; Entamo... 33 3.8
UniRef50_Q70SU8 Cluster: Cystein proteinase inhibitor protein pr... 33 3.8
UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n... 33 3.8
UniRef50_Q23VA1 Cluster: Papain family cysteine protease contain... 33 3.8
UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheiru... 33 3.8
UniRef50_Q7YWV7 Cluster: Putative uncharacterized protein; n=2; ... 32 5.0
UniRef50_Q23H10 Cluster: Papain family cysteine protease contain... 32 5.0
UniRef50_A0EI50 Cluster: Chromosome undetermined scaffold_98, wh... 32 5.0
UniRef50_A0BNM1 Cluster: Chromosome undetermined scaffold_119, w... 32 5.0
UniRef50_P60410 Cluster: Keratin-associated protein 10-8; n=12; ... 32 5.0
UniRef50_P60372 Cluster: Keratin-associated protein 10-4; n=18; ... 32 5.0
UniRef50_P25779 Cluster: Cruzipain precursor; n=54; Trypanosoma|... 32 5.0
UniRef50_Q9BXQ6 Cluster: Cat eye syndrome critical region protei... 32 5.0
UniRef50_UPI0001555234 Cluster: PREDICTED: similar to hCG2041354... 32 6.6
UniRef50_Q4MW49 Cluster: Putative uncharacterized protein; n=1; ... 32 6.6
UniRef50_A1Z9I0 Cluster: CG6357-PA; n=3; Drosophila melanogaster... 32 6.6
UniRef50_Q4PEV5 Cluster: Putative uncharacterized protein; n=1; ... 32 6.6
UniRef50_Q6L8H1 Cluster: Keratin-associated protein 5-4; n=160; ... 32 6.6
UniRef50_P60412 Cluster: Keratin-associated protein 10-11; n=80;... 32 6.6
UniRef50_P60368 Cluster: Keratin-associated protein 10-2; n=64; ... 32 6.6
UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h... 31 8.8
UniRef50_UPI0001552953 Cluster: PREDICTED: hypothetical protein;... 31 8.8
UniRef50_UPI0000EBE77C Cluster: PREDICTED: hypothetical protein;... 31 8.8
UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n... 31 8.8
UniRef50_Q7RH27 Cluster: Transmembrane amino acid transporter pr... 31 8.8
UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep: ... 31 8.8
UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2; Endop... 31 8.8
UniRef50_A0CKJ6 Cluster: Chromosome undetermined scaffold_2, who... 31 8.8
>UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15)
[Contains: Cathepsin L heavy chain; Cathepsin L light
chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC
3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin
L light chain] - Sarcophaga peregrina (Flesh fly)
(Boettcherisca peregrina)
Length = 339
Score = 76.2 bits (179), Expect = 3e-13
Identities = 31/52 (59%), Positives = 42/52 (80%)
Frame = +2
Query: 143 ATVGAVSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298
A A+S DL+KEEW+T+K++H K Y NE+E++FRMKI+ EN+H IAKHNQ
Sbjct: 13 ALTQAISPLDLIKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQ 64
>UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine
protease; n=1; Maconellicoccus hirsutus|Rep: Putative
cathepsin L-like cysteine protease - Maconellicoccus
hirsutus (hibiscus mealybug)
Length = 339
Score = 58.4 bits (135), Expect = 7e-08
Identities = 23/52 (44%), Positives = 38/52 (73%)
Frame = +2
Query: 143 ATVGAVSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298
+++ A+S +L EEW FK ++SK Y ++ED+ RMKI+ +NK+ IA+HN+
Sbjct: 14 SSIQAISPVNLFHEEWQLFKTQYSKKYTTDIEDRLRMKIFIDNKYRIAQHNK 65
>UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes
abbreviatus|Rep: Cathepsin L 2 - Diaprepes abbreviatus
(Sugarcane rootstalk borer weevil)
Length = 348
Score = 56.0 bits (129), Expect = 4e-07
Identities = 21/48 (43%), Positives = 35/48 (72%)
Frame = +2
Query: 155 AVSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298
A+S+ LV+E+W FK+EH KVY++E E+++R ++ EN I +HN+
Sbjct: 17 AISYQVLVQEQWEQFKLEHGKVYESESENEYRQSVFMENLFQINEHNK 64
>UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin
L-like protease; n=1; Nasonia vitripennis|Rep:
PREDICTED: similar to cathepsin L-like protease -
Nasonia vitripennis
Length = 353
Score = 46.0 bits (104), Expect = 4e-04
Identities = 15/40 (37%), Positives = 28/40 (70%)
Frame = +2
Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
++W FK+ + K Y+ ++E+ FR ++ EN+ IA+HNQ+
Sbjct: 38 DDWAAFKLRYKKNYNGDVEENFRRSVFHENQRKIAEHNQK 77
>UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K
precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2);
n=2; Tribolium castaneum|Rep: PREDICTED: similar to
Cathepsin K precursor (Cathepsin O) (Cathepsin X)
(Cathepsin O2) - Tribolium castaneum
Length = 332
Score = 46.0 bits (104), Expect = 4e-04
Identities = 16/44 (36%), Positives = 30/44 (68%)
Frame = +2
Query: 170 DLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
+LV+EEWN FK H++ + + +E+ FR ++ +N + +HN+R
Sbjct: 21 NLVEEEWNKFKAMHARAFFDPLEETFRKSLFTKNLEIVEEHNER 64
>UniRef50_Q717S4 Cluster: Putative gut cathepsin L-like cysteine
protease; n=11; Callosobruchus maculatus|Rep: Putative
gut cathepsin L-like cysteine protease - Callosobruchus
maculatus (Southern cowpea weevil) (Pulse bruchid)
Length = 326
Score = 44.8 bits (101), Expect = 9e-04
Identities = 19/53 (35%), Positives = 33/53 (62%)
Frame = +2
Query: 143 ATVGAVSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
AT+ V+ V EEW FK++H K Y + +E+K R ++ +N +I +HN++
Sbjct: 8 ATLVLVAGASSVYEEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKK 60
>UniRef50_Q8I880 Cluster: Digestive cysteine protease intestain;
n=1; Leptinotarsa decemlineata|Rep: Digestive cysteine
protease intestain - Leptinotarsa decemlineata (Colorado
potato beetle)
Length = 60
Score = 43.2 bits (97), Expect = 0.003
Identities = 17/42 (40%), Positives = 26/42 (61%)
Frame = +2
Query: 173 LVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298
+++E+W FK+ SK Y N +E+K R I+ N I +HNQ
Sbjct: 18 VIREKWQNFKINFSKSYQNVVEEKGRFNIFLSNLLRIEEHNQ 59
>UniRef50_Q70EX0 Cluster: Cathepsin L-like proteinase precursor;
n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
L-like proteinase precursor - Diabrotica virgifera
virgifera (western corn rootworm)
Length = 317
Score = 41.5 bits (93), Expect = 0.008
Identities = 15/42 (35%), Positives = 27/42 (64%)
Frame = +2
Query: 176 VKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
V ++W FK+ HSK Y + E++ R +++++N I +HN R
Sbjct: 12 VHQQWAQFKVNHSKKYGHLKEEQVRFQVFSQNLQKIEQHNAR 53
>UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L
- Suberites domuncula (Sponge)
Length = 324
Score = 41.5 bits (93), Expect = 0.008
Identities = 21/47 (44%), Positives = 27/47 (57%)
Frame = +2
Query: 155 AVSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295
+V+ FD EEW +K EHSK Y E+E+ R I+ NK I HN
Sbjct: 13 SVAAFDF-PEEWVAWKQEHSKEYTEELEELRRHTIWQSNKKFIDSHN 58
>UniRef50_Q5QBL6 Cluster: Cathepsin L; n=1; Culicoides
sonorensis|Rep: Cathepsin L - Culicoides sonorensis
Length = 331
Score = 41.5 bits (93), Expect = 0.008
Identities = 17/42 (40%), Positives = 25/42 (59%)
Frame = +2
Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQRRL 307
EEW FK+E++KVY E+ R I+ N ++ +HN R L
Sbjct: 25 EEWKKFKLEYNKVYPLSTEENLRKGIFERNLADVMEHNARYL 66
>UniRef50_Q6VAM5 Cluster: Cathepsin L-like cysteine proteinase
precursor; n=1; Acanthoscelides obtectus|Rep: Cathepsin
L-like cysteine proteinase precursor - Acanthoscelides
obtectus (Bean weevil)
Length = 321
Score = 41.1 bits (92), Expect = 0.011
Identities = 15/41 (36%), Positives = 26/41 (63%)
Frame = +2
Query: 179 KEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
+E+W FK++H + Y +E+K R +I+ N I +HN+R
Sbjct: 20 QEKWQQFKIQHGRTYRTLLEEKRRFEIFKFNLRTIEEHNER 60
>UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine
proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like
midgut cysteine proteinase - Tenebrio molitor (Yellow
mealworm)
Length = 330
Score = 40.7 bits (91), Expect = 0.014
Identities = 16/43 (37%), Positives = 27/43 (62%)
Frame = +2
Query: 173 LVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
L +E+W+ FK+ H K Y + +E+ R I+ +N IA+HN +
Sbjct: 23 LFQEQWSQFKLTHKKSYSSPIEEIRRQLIFKDNVAKIAEHNAK 65
>UniRef50_Q1L8W8 Cluster: Novel protein; n=4; Danio rerio|Rep: Novel
protein - Danio rerio (Zebrafish) (Brachydanio rerio)
Length = 328
Score = 39.9 bits (89), Expect = 0.025
Identities = 13/38 (34%), Positives = 24/38 (63%)
Frame = +2
Query: 185 EWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298
+W T+K +H+K Y N E++ R ++ +N +I HN+
Sbjct: 26 QWTTWKSQHNKTYRNTREERLRRSVWKQNLQDILLHNE 63
>UniRef50_Q70EX2 Cluster: Cathepsin L-like proteinase" precursor;
n=1; Diabrotica virgifera virgifera|Rep: Cathepsin
L-like proteinase" precursor - Diabrotica virgifera
virgifera (western corn rootworm)
Length = 315
Score = 39.9 bits (89), Expect = 0.025
Identities = 16/40 (40%), Positives = 25/40 (62%)
Frame = +2
Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
E+W +FK H+K Y N +EDK R ++ +N I +HN +
Sbjct: 22 EKWTSFKATHNKSY-NVIEDKLRFAVFQDNLKKIEEHNAK 60
>UniRef50_A2I7P3 Cluster: Digestive cysteine protease intestain;
n=16; Chrysomelidae|Rep: Digestive cysteine protease
intestain - Leptinotarsa decemlineata (Colorado potato
beetle)
Length = 326
Score = 39.9 bits (89), Expect = 0.025
Identities = 16/41 (39%), Positives = 23/41 (56%)
Frame = +2
Query: 179 KEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
+++W FK H K Y N +E+K R I+ N I +HN R
Sbjct: 20 EDQWIAFKQTHGKTYKNLLEEKTRFGIFQRNLIKIKEHNAR 60
>UniRef50_UPI00015B5D85 Cluster: PREDICTED: similar to cathepsin
L-like proteinase; n=1; Nasonia vitripennis|Rep:
PREDICTED: similar to cathepsin L-like proteinase -
Nasonia vitripennis
Length = 96
Score = 39.5 bits (88), Expect = 0.033
Identities = 15/41 (36%), Positives = 25/41 (60%)
Frame = +2
Query: 173 LVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295
L +EW +K++ +K Y N E++ R KIY + K + +HN
Sbjct: 18 LADDEWEQYKIKFNKKYANPEEEQRRYKIYLDTKKKVEEHN 58
>UniRef50_A2A5X5 Cluster: Ortholog of keratin associated protein
16-1 KRTAP16-1; n=7; Murinae|Rep: Ortholog of keratin
associated protein 16-1 KRTAP16-1 - Mus musculus (Mouse)
Length = 502
Score = 39.5 bits (88), Expect = 0.033
Identities = 17/35 (48%), Positives = 19/35 (54%)
Frame = -2
Query: 305 GGAGCASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201
G +GC SQCC S +SSC C P C C PS
Sbjct: 50 GSSGCGSQCCQPSCSVSSC--CQPVCCEATICEPS 82
>UniRef50_A1ZBK7 Cluster: CG10460-PA; n=1; Drosophila
melanogaster|Rep: CG10460-PA - Drosophila melanogaster
(Fruit fly)
Length = 79
Score = 39.1 bits (87), Expect = 0.044
Identities = 18/40 (45%), Positives = 26/40 (65%)
Frame = +2
Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
EEW +K + K Y+ E ED R +IYAE+K I +HN++
Sbjct: 7 EEWVEYKSKFDKNYEAE-EDLMRRRIYAESKARIEEHNRK 45
>UniRef50_UPI0000D566ED Cluster: PREDICTED: similar to CTLA-2-alpha
protein precursor; n=1; Tribolium castaneum|Rep:
PREDICTED: similar to CTLA-2-alpha protein precursor -
Tribolium castaneum
Length = 101
Score = 38.7 bits (86), Expect = 0.058
Identities = 14/42 (33%), Positives = 28/42 (66%)
Frame = +2
Query: 176 VKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
V +++N FK ++ K Y + E+ FR +++A+N I +HN++
Sbjct: 25 VTQKFNEFKTKYGKTYADANEENFRKQLFAKNLEKIEEHNKK 66
>UniRef50_UPI0000D566EC Cluster: PREDICTED: similar to CG10460-PA;
n=1; Tribolium castaneum|Rep: PREDICTED: similar to
CG10460-PA - Tribolium castaneum
Length = 80
Score = 38.7 bits (86), Expect = 0.058
Identities = 12/44 (27%), Positives = 26/44 (59%)
Frame = +2
Query: 170 DLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
+ ++E+WN FK ++ K Y + E+ +R ++ N + HN++
Sbjct: 8 EFIEEKWNEFKAKYRKNYTDAEEESYRKSLFVANLQMVESHNEK 51
>UniRef50_A7SY62 Cluster: Predicted protein; n=18; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 392
Score = 38.7 bits (86), Expect = 0.058
Identities = 16/46 (34%), Positives = 28/46 (60%)
Frame = +2
Query: 170 DLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQRRL 307
DLV ++++ F+ +H KVY+++ E + R I+ N I N+R L
Sbjct: 82 DLVDDDFDEFRQQHDKVYEDDSEHRRRKHIFRHNVRYIRSMNRRSL 127
>UniRef50_Q9BYR4 Cluster: Keratin-associated protein 4-3; n=53;
Mammalia|Rep: Keratin-associated protein 4-3 - Homo
sapiens (Human)
Length = 195
Score = 38.7 bits (86), Expect = 0.058
Identities = 18/31 (58%), Positives = 18/31 (58%)
Frame = -2
Query: 293 CASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201
C S CC S ISSC C P CRT CC PS
Sbjct: 45 CISSCCRPSCCISSC--CKPSCCRTTCCRPS 73
Score = 38.7 bits (86), Expect = 0.058
Identities = 18/31 (58%), Positives = 18/31 (58%)
Frame = -2
Query: 293 CASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201
C S CC S ISSC C P CRT CC PS
Sbjct: 75 CISSCCRPSCCISSC--CKPSCCRTTCCRPS 103
Score = 37.1 bits (82), Expect = 0.18
Identities = 17/31 (54%), Positives = 18/31 (58%)
Frame = -2
Query: 293 CASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201
C S CC S ISSC C P C+T CC PS
Sbjct: 105 CISSCCRPSCCISSC--CKPSCCQTTCCRPS 133
Score = 32.3 bits (70), Expect = 5.0
Identities = 16/30 (53%), Positives = 17/30 (56%)
Frame = -2
Query: 299 AGCASQCCVCSQRISSCGTCPPFRCRTLCC 210
A C S CC S +SSC C PF C T CC
Sbjct: 153 ACCISSCCHPSCCVSSC-RC-PFSCPTTCC 180
>UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain;
n=9; Cucujiformia|Rep: Digestive cysteine proteinase
intestain - Leptinotarsa decemlineata (Colorado potato
beetle)
Length = 326
Score = 38.3 bits (85), Expect = 0.076
Identities = 14/41 (34%), Positives = 23/41 (56%)
Frame = +2
Query: 179 KEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
K++W FK H K Y + +E++ R I+ N I +HN +
Sbjct: 20 KDQWVAFKQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAK 60
>UniRef50_Q23FR0 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 394
Score = 37.9 bits (84), Expect = 0.10
Identities = 19/48 (39%), Positives = 29/48 (60%)
Frame = +2
Query: 158 VSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
VS DL+ +N ++ +H +VY NE E FR I+ EN + +HNQ+
Sbjct: 28 VSVKDLLT--YNQWRNKHQRVYLNEHEQLFRQLIFLENLAKVNEHNQK 73
>UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteinase
B; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like
tick cysteine proteinase B - Haemaphysalis longicornis
(Bush tick)
Length = 332
Score = 37.5 bits (83), Expect = 0.13
Identities = 22/53 (41%), Positives = 29/53 (54%), Gaps = 1/53 (1%)
Frame = +2
Query: 146 TVGAVSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMK-IYAENKHNIAKHNQR 301
+V AVS +LV EW+ FK H K D + K IY EN+ IA+HN +
Sbjct: 13 SVAAVSHQELVGAEWSAFKALHGK--DTSRKQKSTTGWIYMENRLKIARHNAK 63
>UniRef50_A7LFV3 Cluster: Cathepsin L protease inhibitor 2; n=1;
Diaprepes abbreviatus|Rep: Cathepsin L protease
inhibitor 2 - Diaprepes abbreviatus (Sugarcane rootstalk
borer weevil)
Length = 91
Score = 37.1 bits (82), Expect = 0.18
Identities = 14/41 (34%), Positives = 25/41 (60%)
Frame = +2
Query: 179 KEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
+EEW FK ++ YD+ E+ R I+ +N +I +HN++
Sbjct: 14 QEEWEKFKTGFNRNYDSSDEEAKRFNIFQQNLQSIREHNEK 54
>UniRef50_A0CSW6 Cluster: Chromosome undetermined scaffold_26, whole
genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_26,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 358
Score = 37.1 bits (82), Expect = 0.18
Identities = 15/36 (41%), Positives = 22/36 (61%)
Frame = +2
Query: 188 WNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295
+ + +E+ K YDN+ RM+I+ NK NI KHN
Sbjct: 43 FRNWMLEYGKSYDNDFTAIHRMQIFMRNKKNIEKHN 78
>UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3;
Bilateria|Rep: Cathepsin L-like cysteine proteinase -
Longidorus elongatus
Length = 358
Score = 36.7 bits (81), Expect = 0.23
Identities = 13/36 (36%), Positives = 22/36 (61%)
Frame = +2
Query: 188 WNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295
W FK++H+K Y + E+ R +++A N I +HN
Sbjct: 43 WTNFKLKHAKSYKTKDEELLRFQVFASNHKVIEQHN 78
>UniRef50_Q22LI1 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 987
Score = 36.7 bits (81), Expect = 0.23
Identities = 16/37 (43%), Positives = 26/37 (70%)
Frame = +2
Query: 185 EWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295
E+N + +H+KV+D E + K+R+ I+AEN I +HN
Sbjct: 30 EFNKWSAKHNKVFDPE-QLKYRLSIFAENYKKIKEHN 65
>UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin S
preproprotein; n=2; Tribolium castaneum|Rep: PREDICTED:
similar to cathepsin S preproprotein - Tribolium
castaneum
Length = 525
Score = 36.3 bits (80), Expect = 0.31
Identities = 14/42 (33%), Positives = 23/42 (54%)
Frame = +2
Query: 176 VKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
+ +EW FK ++ + Y N E+ FR I+ + I HN+R
Sbjct: 221 LNKEWENFKRKYERRYPNLEEENFRRAIFEKTFQEIKHHNER 262
>UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate
cathepsin L; n=4; Danio rerio|Rep: Novel protein similar
to vertebrate cathepsin L - Danio rerio (Zebrafish)
(Brachydanio rerio)
Length = 334
Score = 36.3 bits (80), Expect = 0.31
Identities = 16/37 (43%), Positives = 20/37 (54%)
Frame = +2
Query: 185 EWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295
EWN +K +H YD E ED R I+ N I K+N
Sbjct: 25 EWNLWKKKHEISYDEESEDVHRKTIWETNMQKIWKNN 61
>UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor;
n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase
precursor - Diabrotica virgifera virgifera (western corn
rootworm)
Length = 326
Score = 36.3 bits (80), Expect = 0.31
Identities = 15/41 (36%), Positives = 24/41 (58%)
Frame = +2
Query: 179 KEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
KEEW FK+ ++K Y N +E++ R I+ + I HN +
Sbjct: 20 KEEWVQFKVRNNKSYRNYIEEQKRFTIFQGSLRKIENHNDK 60
>UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2;
Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio
molitor (Yellow mealworm)
Length = 336
Score = 36.3 bits (80), Expect = 0.31
Identities = 13/42 (30%), Positives = 24/42 (57%)
Frame = +2
Query: 176 VKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
V E+W FK +++ Y N E+ FR +I+ + +HN++
Sbjct: 23 VAEKWENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNEK 64
>UniRef50_O97397 Cluster: Cathepsin L-like proteinase precursor;
n=6; Chrysomelinae|Rep: Cathepsin L-like proteinase
precursor - Phaedon cochleariae (Mustard beetle)
Length = 324
Score = 36.3 bits (80), Expect = 0.31
Identities = 14/39 (35%), Positives = 22/39 (56%)
Frame = +2
Query: 179 KEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295
+E W FK H++ Y + E+K R I+ + IA+HN
Sbjct: 20 QELWADFKKTHARTYKSLREEKLRFNIFQDTLRQIAEHN 58
>UniRef50_Q9Y0D2 Cluster: Cysteine proteinase; n=3;
Curculionidae|Rep: Cysteine proteinase - Hypera postica
(alfalfa weevil)
Length = 324
Score = 35.9 bits (79), Expect = 0.41
Identities = 14/37 (37%), Positives = 21/37 (56%)
Frame = +2
Query: 185 EWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295
++ FK+EH K Y N+ E+ R I+ +N I HN
Sbjct: 25 KFQAFKLEHGKTYLNQAEESKRFNIFTDNVRAIEAHN 61
>UniRef50_Q248G1 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 334
Score = 35.9 bits (79), Expect = 0.41
Identities = 13/38 (34%), Positives = 23/38 (60%)
Frame = +2
Query: 188 WNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
+N ++ + +VY++E E FR I+ ENK + HN +
Sbjct: 36 YNLWRQNNGRVYNSEEEQFFRQLIFVENKRQVDSHNSQ 73
>UniRef50_Q23H06 Cluster: Papain family cysteine protease containing
protein; n=18; Tetrahymena thermophila|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 349
Score = 35.9 bits (79), Expect = 0.41
Identities = 15/36 (41%), Positives = 20/36 (55%)
Frame = +2
Query: 188 WNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295
+N + EH +VY NE E FR ++ EN I HN
Sbjct: 29 YNKWSSEHQRVYLNEHEKLFRQMVFFENLQKIQDHN 64
>UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:
Cathepsin - Petromyzon marinus (Sea lamprey)
Length = 333
Score = 35.5 bits (78), Expect = 0.54
Identities = 12/37 (32%), Positives = 22/37 (59%)
Frame = +2
Query: 185 EWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295
+W+T+K + K Y +E ED R ++ +N + +HN
Sbjct: 26 QWDTWKSTYGKHYGSEQEDAHRRDVFEQNLKRVLQHN 62
>UniRef50_Q17641 Cluster: Putative uncharacterized protein; n=11;
Caenorhabditis|Rep: Putative uncharacterized protein -
Caenorhabditis elegans
Length = 197
Score = 35.5 bits (78), Expect = 0.54
Identities = 18/41 (43%), Positives = 20/41 (48%), Gaps = 4/41 (9%)
Frame = -2
Query: 314 GIGGGAGCASQ----CCVCSQRISSCGTCPPFRCRTLCCAP 204
G GGG GC CC C + + C TC RC T CC P
Sbjct: 79 GGGGGCGCCCCRPRCCCCCRRCCTCCRTCCCTRCCT-CCRP 118
Score = 33.5 bits (73), Expect = 2.2
Identities = 16/44 (36%), Positives = 18/44 (40%)
Frame = -2
Query: 317 EGIGGGAGCASQCCVCSQRISSCGTCPPFRCRTLCCAPS*KCST 186
+G G GC CC C CG C CR CC +C T
Sbjct: 62 QGGCGCCGCGCGCCGCGGGGGGCGCC---CCRPRCCCCCRRCCT 102
>UniRef50_Q9BYP8 Cluster: Keratin-associated protein 17-1; n=28;
Coelomata|Rep: Keratin-associated protein 17-1 - Homo
sapiens (Human)
Length = 105
Score = 35.5 bits (78), Expect = 0.54
Identities = 18/43 (41%), Positives = 20/43 (46%)
Frame = -2
Query: 314 GIGGGAGCASQCCVCSQRISSCGTCPPFRCRTLCCAPS*KCST 186
G GG GC CC S SSC C C +CC P+ C T
Sbjct: 64 GCGGCGGCGGGCCGSSCCGSSC--CGSGCCGPVCCQPTPICDT 104
>UniRef50_Q94715 Cluster: Putative cathepsin L2 precursor; n=4;
Paramecium tetraurelia|Rep: Putative cathepsin L2
precursor - Paramecium tetraurelia
Length = 294
Score = 35.5 bits (78), Expect = 0.54
Identities = 18/49 (36%), Positives = 31/49 (63%), Gaps = 3/49 (6%)
Frame = +2
Query: 164 FFDLVKEEWNTFK---MEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
++ L +++ N F+ ++++K Y E E +RM+IY NK I +HNQR
Sbjct: 4 YYHLQEDDTNDFERWALKNNKFY-TESEKLYRMEIYNSNKRMIEEHNQR 51
>UniRef50_Q14C04 Cluster: Keratin associated protein 4-7; n=17;
Mammalia|Rep: Keratin associated protein 4-7 - Mus
musculus (Mouse)
Length = 168
Score = 35.1 bits (77), Expect = 0.71
Identities = 14/32 (43%), Positives = 20/32 (62%)
Frame = -2
Query: 296 GCASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201
GC+ CC S +SSC C P C+++CC P+
Sbjct: 14 GCSQGCCQPSCCVSSC--CRPQCCQSVCCQPT 43
Score = 33.1 bits (72), Expect = 2.9
Identities = 16/31 (51%), Positives = 16/31 (51%)
Frame = -2
Query: 293 CASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201
C CC S ISSC C P CR CC PS
Sbjct: 40 CQPTCCRPSCCISSC--CRPSCCRPSCCRPS 68
Score = 31.5 bits (68), Expect = 8.8
Identities = 14/30 (46%), Positives = 16/30 (53%), Gaps = 2/30 (6%)
Frame = -2
Query: 287 SQCC--VCSQRISSCGTCPPFRCRTLCCAP 204
S CC VCS+ S G C P C + CC P
Sbjct: 3 SSCCGSVCSEEGCSQGCCQPSCCVSSCCRP 32
>UniRef50_Q24FA8 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 335
Score = 35.1 bits (77), Expect = 0.71
Identities = 12/37 (32%), Positives = 24/37 (64%)
Frame = +2
Query: 188 WNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298
+N ++ E+ KVY +E E +R ++ EN ++ +HN+
Sbjct: 32 YNKWREENGKVYSSEAEKIYRQSVFLENYQSVQEHNK 68
>UniRef50_A7LFV2 Cluster: Cathepsin L protease inhibitor 1; n=1;
Diaprepes abbreviatus|Rep: Cathepsin L protease
inhibitor 1 - Diaprepes abbreviatus (Sugarcane rootstalk
borer weevil)
Length = 109
Score = 35.1 bits (77), Expect = 0.71
Identities = 13/42 (30%), Positives = 25/42 (59%)
Frame = +2
Query: 176 VKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
V+E WN FK + ++ Y++ E+ R +I+ N +I H ++
Sbjct: 31 VEEHWNNFKTKFNRNYESPEEESKRFEIFKNNLKDIQAHQKK 72
>UniRef50_Q231X3 Cluster: Papain family cysteine protease containing
protein; n=3; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 323
Score = 34.7 bits (76), Expect = 0.94
Identities = 13/47 (27%), Positives = 31/47 (65%)
Frame = +2
Query: 158 VSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298
++ + V + W+ +K +H+K Y+N + +R++++AEN + K++Q
Sbjct: 29 ITIDESVTKIWSQWKQKHNKRYENTDYESYRLEVFAENL-EVVKNDQ 74
>UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicatein
a3 - Lubomirskia baicalensis
Length = 344
Score = 34.7 bits (76), Expect = 0.94
Identities = 12/38 (31%), Positives = 23/38 (60%)
Frame = +2
Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295
+EW+ +K H + Y++++++ R I+ NK I HN
Sbjct: 42 QEWSVWKGHHQRSYESQLQEMERHSIWVANKKYIEHHN 79
>UniRef50_UPI0001553357 Cluster: PREDICTED: similar to novel member
of the keratin associated protein 4 (Krtap4) family;
n=1; Mus musculus|Rep: PREDICTED: similar to novel
member of the keratin associated protein 4 (Krtap4)
family - Mus musculus
Length = 292
Score = 34.3 bits (75), Expect = 1.2
Identities = 16/40 (40%), Positives = 23/40 (57%)
Frame = -2
Query: 320 DEGIGGGAGCASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201
+EG G C + CC S +SSC C P C+++CC P+
Sbjct: 12 EEGCGQSC-CQTTCCRPSCCVSSC--CRPQCCQSVCCQPT 48
>UniRef50_UPI00006CFB97 Cluster: Papain family cysteine protease
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Papain family cysteine protease containing
protein - Tetrahymena thermophila SB210
Length = 360
Score = 34.3 bits (75), Expect = 1.2
Identities = 10/41 (24%), Positives = 26/41 (63%)
Frame = +2
Query: 176 VKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298
++ + FK++++K Y ++ E+++R ++ N I +HN+
Sbjct: 41 IERAFKNFKVKYAKTYKDDTEEQYRFSVFTNNYVEIYRHNK 81
>UniRef50_Q86MN0 Cluster: Cathepsin L-like cysteine protease; n=1;
Panagrolaimus davidi|Rep: Cathepsin L-like cysteine
protease - Panagrolaimus davidi
Length = 56
Score = 34.3 bits (75), Expect = 1.2
Identities = 14/40 (35%), Positives = 22/40 (55%)
Frame = +2
Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
+EW +K + K Y +E +K R +IY +N HN+R
Sbjct: 9 KEWQDYKQKFDKSYPDEETEKQRYQIYKKNVEENETHNKR 48
>UniRef50_A0CK16 Cluster: Chromosome undetermined scaffold_2, whole
genome shotgun sequence; n=3; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_2,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 376
Score = 34.3 bits (75), Expect = 1.2
Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 3/50 (6%)
Frame = +2
Query: 161 SFFDLVKEEWNTFK---MEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
S + +EE FK E+ K Y+NE E +RM+++ +N + HN++
Sbjct: 32 SIYGWREEEQKQFKNWVQENQKTYNNEFEMIYRMEVFVKNYRTMKHHNEQ 81
>UniRef50_Q86UY7 Cluster: Similar to RIKEN cDNA 2310002J15 gene;
n=12; Eutheria|Rep: Similar to RIKEN cDNA 2310002J15
gene - Homo sapiens (Human)
Length = 144
Score = 34.3 bits (75), Expect = 1.2
Identities = 15/35 (42%), Positives = 17/35 (48%), Gaps = 1/35 (2%)
Frame = -2
Query: 311 IGGGAGCASQCCVCSQRISSCGTCPPF-RCRTLCC 210
+G G A CC C C CPPF RC + CC
Sbjct: 108 VGRGDDIAHHCCCCP--CCHCCHCPPFCRCHSCCC 140
>UniRef50_UPI0001555AB0 Cluster: PREDICTED: hypothetical protein;
n=4; Ornithorhynchus anatinus|Rep: PREDICTED:
hypothetical protein - Ornithorhynchus anatinus
Length = 288
Score = 33.9 bits (74), Expect = 1.6
Identities = 13/31 (41%), Positives = 15/31 (48%)
Frame = -2
Query: 302 GAGCASQCCVCSQRISSCGTCPPFRCRTLCC 210
G GC + C C S CPP C+T CC
Sbjct: 16 GRGCCQETC-CEPSCCSSPCCPPTCCQTTCC 45
>UniRef50_UPI0000E22843 Cluster: PREDICTED: hypothetical protein;
n=1; Pan troglodytes|Rep: PREDICTED: hypothetical
protein - Pan troglodytes
Length = 304
Score = 33.9 bits (74), Expect = 1.6
Identities = 14/41 (34%), Positives = 19/41 (46%), Gaps = 3/41 (7%)
Frame = -2
Query: 299 AGCASQCC---VCSQRISSCGTCPPFRCRTLCCAPS*KCST 186
+GC S CC C C P+ C++ CC P CS+
Sbjct: 234 SGCGSSCCQSSCCKPYCCQSSCCKPYCCQSSCCKPC-SCSS 273
>UniRef50_UPI000155F1D6 Cluster: PREDICTED: similar to keratin
associated protein 9.3; n=1; Equus caballus|Rep:
PREDICTED: similar to keratin associated protein 9.3 -
Equus caballus
Length = 302
Score = 33.5 bits (73), Expect = 2.2
Identities = 13/33 (39%), Positives = 17/33 (51%)
Frame = -2
Query: 299 AGCASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201
+ C Q C S C CPP C+T+CC P+
Sbjct: 127 SSCCGQTCSRSSCCQPC--CPPACCQTICCQPA 157
>UniRef50_UPI000155BC4F Cluster: PREDICTED: hypothetical protein,
partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
hypothetical protein, partial - Ornithorhynchus anatinus
Length = 309
Score = 33.5 bits (73), Expect = 2.2
Identities = 13/31 (41%), Positives = 15/31 (48%)
Frame = -2
Query: 302 GAGCASQCCVCSQRISSCGTCPPFRCRTLCC 210
G GC + C C S CPP C+T CC
Sbjct: 16 GRGCCQETC-CQPGCCSSPCCPPTCCQTTCC 45
Score = 33.1 bits (72), Expect = 2.9
Identities = 12/31 (38%), Positives = 19/31 (61%)
Frame = -2
Query: 293 CASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201
C + CCV S +C C P+ C+++CC P+
Sbjct: 277 CQATCCVTSCCRPTC--CSPYCCQSVCCQPT 305
>UniRef50_UPI000155483C Cluster: PREDICTED: similar to keratin
associated protein; n=8; Ornithorhynchus anatinus|Rep:
PREDICTED: similar to keratin associated protein -
Ornithorhynchus anatinus
Length = 399
Score = 33.5 bits (73), Expect = 2.2
Identities = 13/36 (36%), Positives = 19/36 (52%)
Frame = -2
Query: 308 GGGAGCASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201
G AGC S C + +++ G+C P C+ CC S
Sbjct: 222 GAPAGCQSSCGPSTCQLACTGSCSPSCCQDSCCQQS 257
>UniRef50_Q24E33 Cluster: Papain family cysteine protease containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 328
Score = 33.5 bits (73), Expect = 2.2
Identities = 14/40 (35%), Positives = 22/40 (55%)
Frame = +2
Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
E + FK+EH+ V+ N ED +R I+ +N I N +
Sbjct: 40 EMYAEFKLEHNIVFQNSEEDLYRQNIFFQNVRYIQSENAK 79
>UniRef50_A0CAI9 Cluster: Chromosome undetermined scaffold_161, whole
genome shotgun sequence; n=1; Paramecium tetraurelia|Rep:
Chromosome undetermined scaffold_161, whole genome
shotgun sequence - Paramecium tetraurelia
Length = 3076
Score = 33.5 bits (73), Expect = 2.2
Identities = 12/24 (50%), Positives = 15/24 (62%)
Frame = -2
Query: 293 CASQCCVCSQRISSCGTCPPFRCR 222
C+ +C CSQR C +CPPF R
Sbjct: 957 CSYRCKTCSQREEQCLSCPPFSLR 980
>UniRef50_Q9BYR0 Cluster: Keratin-associated protein 4-7; n=149;
Eukaryota|Rep: Keratin-associated protein 4-7 - Homo
sapiens (Human)
Length = 210
Score = 33.5 bits (73), Expect = 2.2
Identities = 14/31 (45%), Positives = 19/31 (61%)
Frame = -2
Query: 293 CASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201
C S CC S +SSC C P C+++CC P+
Sbjct: 80 CISSCCRPSCCMSSC--CKPQCCQSVCCQPT 108
Score = 31.9 bits (69), Expect = 6.6
Identities = 13/31 (41%), Positives = 18/31 (58%)
Frame = -2
Query: 293 CASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201
C S CC S +S C C P C+++CC P+
Sbjct: 115 CISSCCRPSCCVSRC--CRPQCCQSVCCQPT 143
>UniRef50_Q207N1 Cluster: Cathepsin S; n=2; Clupeocephala|Rep:
Cathepsin S - Ictalurus punctatus (Channel catfish)
Length = 84
Score = 33.1 bits (72), Expect = 2.9
Identities = 14/36 (38%), Positives = 20/36 (55%)
Frame = +2
Query: 188 WNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295
W +K HSK Y +E+E+ R +I+ N I HN
Sbjct: 26 WLMWKKNHSKTYTSELEELGRREIWERNLRLITVHN 61
>UniRef50_A2A4R5 Cluster: Novel member of the keratin associated
protein 4 (Krtap4) family; n=10; Theria|Rep: Novel
member of the keratin associated protein 4 (Krtap4)
family - Mus musculus (Mouse)
Length = 167
Score = 33.1 bits (72), Expect = 2.9
Identities = 14/34 (41%), Positives = 19/34 (55%), Gaps = 3/34 (8%)
Frame = -2
Query: 293 CASQCCVCSQRISSC---GTCPPFRCRTLCCAPS 201
C S CC S +SSC C P C+++CC P+
Sbjct: 39 CVSSCCRPSCCVSSCCRPSCCRPQCCQSVCCQPT 72
>UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:
Silicatein beta - Suberites domuncula (Sponge)
Length = 383
Score = 33.1 bits (72), Expect = 2.9
Identities = 13/40 (32%), Positives = 22/40 (55%)
Frame = +2
Query: 179 KEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298
+E+W + +H KVY + E + ++ NK I +HNQ
Sbjct: 53 EEDWKQWTTDHHKVYSDVRERVDKYTVWRANKEYIDQHNQ 92
>UniRef50_Q8IUG1 Cluster: Keratin-associated protein 1-3; n=65;
Mammalia|Rep: Keratin-associated protein 1-3 - Homo
sapiens (Human)
Length = 177
Score = 33.1 bits (72), Expect = 2.9
Identities = 15/31 (48%), Positives = 16/31 (51%)
Frame = -2
Query: 293 CASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201
C S CC S +SC C P C T CC PS
Sbjct: 20 CGSSCCQPSCCETSC--CQPSCCETSCCQPS 48
>UniRef50_UPI0001554838 Cluster: PREDICTED: similar to solute
carrier family 5 (sodium/glucose cotransporter), member
9; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar
to solute carrier family 5 (sodium/glucose
cotransporter), member 9 - Ornithorhynchus anatinus
Length = 300
Score = 32.7 bits (71), Expect = 3.8
Identities = 13/30 (43%), Positives = 15/30 (50%)
Frame = -2
Query: 293 CASQCCVCSQRISSCGTCPPFRCRTLCCAP 204
C S CC S R+ C C P C +CC P
Sbjct: 5 CCSPCCRPSGRVPVC--CKPVCCEPVCCKP 32
Score = 32.3 bits (70), Expect = 5.0
Identities = 15/39 (38%), Positives = 19/39 (48%), Gaps = 4/39 (10%)
Frame = -2
Query: 308 GGGAGCASQCCVCSQ--RISSCGT--CPPFRCRTLCCAP 204
G + C + CC CS R S C C P C+ +CC P
Sbjct: 143 GRDSCCGTVCCCCSPCCRPSGCVPVCCEPVCCKPVCCVP 181
Score = 31.5 bits (68), Expect = 8.8
Identities = 13/32 (40%), Positives = 15/32 (46%)
Frame = -2
Query: 299 AGCASQCCVCSQRISSCGTCPPFRCRTLCCAP 204
A C S CC S + C C P C +CC P
Sbjct: 58 ASCCSPCCRPSGCVPVC--CKPVCCEPVCCVP 87
>UniRef50_UPI00006CB160 Cluster: hypothetical protein TTHERM_00298410;
n=1; Tetrahymena thermophila SB210|Rep: hypothetical
protein TTHERM_00298410 - Tetrahymena thermophila SB210
Length = 1366
Score = 32.7 bits (71), Expect = 3.8
Identities = 14/39 (35%), Positives = 16/39 (41%)
Frame = -2
Query: 320 DEGIGGGAGCASQCCVCSQRISSCGTCPPFRCRTLCCAP 204
D G CA +C CS C TC P R + C P
Sbjct: 1064 DTGAANCQQCAPKCATCSTSSVQCLTCAPGRINSDCSCP 1102
>UniRef50_UPI0000499201 Cluster: RIO1 family protein; n=2; Entamoeba
histolytica HM-1:IMSS|Rep: RIO1 family protein -
Entamoeba histolytica HM-1:IMSS
Length = 474
Score = 32.7 bits (71), Expect = 3.8
Identities = 14/41 (34%), Positives = 22/41 (53%)
Frame = +2
Query: 179 KEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
KEE + EH K Y E +K + K+ + K + KH++R
Sbjct: 430 KEEEKKLRKEHKKQYKIERREKLKHKMPKKKKEQLIKHSKR 470
>UniRef50_Q70SU8 Cluster: Cystein proteinase inhibitor protein
precursor; n=4; Salmonidae|Rep: Cystein proteinase
inhibitor protein precursor - Salmo salar (Atlantic
salmon)
Length = 342
Score = 32.7 bits (71), Expect = 3.8
Identities = 12/42 (28%), Positives = 27/42 (64%)
Frame = +2
Query: 176 VKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
V +E+ T+K+++ K Y + +E+ R +I+ + + +HN+R
Sbjct: 270 VHKEFETWKVKYGKTYPSTVEEAKRKEIWLATRKMVMEHNKR 311
Score = 32.3 bits (70), Expect = 5.0
Identities = 13/42 (30%), Positives = 25/42 (59%)
Frame = +2
Query: 176 VKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
V +E+ T+K++H K Y + E+ R I+ + + +HN+R
Sbjct: 193 VDKEFETWKVQHGKNYGSTEEEAKRKGIWLATRTRVMEHNKR 234
>UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase;
n=21; Bilateria|Rep: Cathepsin L-like cysteine
proteinase - Globodera pallida
Length = 379
Score = 32.7 bits (71), Expect = 3.8
Identities = 15/39 (38%), Positives = 21/39 (53%), Gaps = 1/39 (2%)
Frame = +2
Query: 185 EWNTFKMEHS-KVYDNEMEDKFRMKIYAENKHNIAKHNQ 298
+WN +K +H K Y ++ + RM Y K I KHNQ
Sbjct: 69 DWNAYKQKHGRKAYADQDVENERMLTYLSAKQFIDKHNQ 107
>UniRef50_Q23VA1 Cluster: Papain family cysteine protease containing
protein; n=2; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 397
Score = 32.7 bits (71), Expect = 3.8
Identities = 17/50 (34%), Positives = 26/50 (52%)
Frame = +2
Query: 146 TVGAVSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295
++ +V DL+ ++N +K +H K Y N E FR IY N +HN
Sbjct: 28 SLSSVQIKDLL--DFNKWKYQHGKKYFNADEANFRQLIYLMNLQKFNEHN 75
>UniRef50_A7TZ36 Cluster: Cysteine proteinase; n=1; Lepeophtheirus
salmonis|Rep: Cysteine proteinase - Lepeophtheirus
salmonis (salmon louse)
Length = 372
Score = 32.7 bits (71), Expect = 3.8
Identities = 12/38 (31%), Positives = 22/38 (57%)
Frame = +2
Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295
+E+ +F E+SK Y N ++K++ +N I +HN
Sbjct: 25 QEFESFVKEYSKSYHNRALRSLKLKVFVDNLREIEEHN 62
>UniRef50_Q7YWV7 Cluster: Putative uncharacterized protein; n=2;
Caenorhabditis|Rep: Putative uncharacterized protein -
Caenorhabditis elegans
Length = 212
Score = 32.3 bits (70), Expect = 5.0
Identities = 16/42 (38%), Positives = 16/42 (38%), Gaps = 3/42 (7%)
Frame = -2
Query: 308 GGGAGCAS---QCCVCSQRISSCGTCPPFRCRTLCCAPS*KC 192
GGG GC CC C R C CRT CC C
Sbjct: 74 GGGCGCCGCGCGCCCCRPRCCCCCRRCCTCCRTCCCTRCCTC 115
Score = 31.9 bits (69), Expect = 6.6
Identities = 16/39 (41%), Positives = 18/39 (46%), Gaps = 2/39 (5%)
Frame = -2
Query: 314 GIGGGAGCASQ--CCVCSQRISSCGTCPPFRCRTLCCAP 204
G G G C CC C + + C TC RC T CC P
Sbjct: 81 GCGCGCCCCRPRCCCCCRRCCTCCRTCCCTRCCT-CCRP 118
>UniRef50_Q23H10 Cluster: Papain family cysteine protease containing
protein; n=14; Tetrahymena thermophila SB210|Rep: Papain
family cysteine protease containing protein -
Tetrahymena thermophila SB210
Length = 336
Score = 32.3 bits (70), Expect = 5.0
Identities = 13/36 (36%), Positives = 21/36 (58%)
Frame = +2
Query: 188 WNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295
+N + ++ +VY NE E FR ++ EN I +HN
Sbjct: 29 YNQWSSQNQRVYLNEHEKLFRQMVFFENFQKIQEHN 64
>UniRef50_A0EI50 Cluster: Chromosome undetermined scaffold_98, whole
genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_98,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 336
Score = 32.3 bits (70), Expect = 5.0
Identities = 13/40 (32%), Positives = 24/40 (60%)
Frame = +2
Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
+E++ +K H K+Y +ED +R +I+ +N + HN R
Sbjct: 27 DEYSKWKQHHQKLYQG-VEDTYRKQIFHQNLQIVNDHNAR 65
>UniRef50_A0BNM1 Cluster: Chromosome undetermined scaffold_119,
whole genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_119,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 341
Score = 32.3 bits (70), Expect = 5.0
Identities = 14/39 (35%), Positives = 24/39 (61%)
Frame = +2
Query: 185 EWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQR 301
++ + ++H K Y + E K+R IY +NK I +HN+R
Sbjct: 28 DFERWALKHGKHYFGD-EKKYRQAIYFQNKQMIEEHNKR 65
>UniRef50_P60410 Cluster: Keratin-associated protein 10-8; n=12;
Eutheria|Rep: Keratin-associated protein 10-8 - Homo
sapiens (Human)
Length = 259
Score = 32.3 bits (70), Expect = 5.0
Identities = 12/32 (37%), Positives = 16/32 (50%)
Frame = -2
Query: 299 AGCASQCCVCSQRISSCGTCPPFRCRTLCCAP 204
+ C S CC S +C C P C+ +CC P
Sbjct: 149 SSCQSACCTFSPCQQAC--CVPICCKPICCVP 178
>UniRef50_P60372 Cluster: Keratin-associated protein 10-4; n=18;
Eutheria|Rep: Keratin-associated protein 10-4 - Homo
sapiens (Human)
Length = 401
Score = 32.3 bits (70), Expect = 5.0
Identities = 12/32 (37%), Positives = 16/32 (50%)
Frame = -2
Query: 299 AGCASQCCVCSQRISSCGTCPPFRCRTLCCAP 204
+ C CC S +C C P C+T+CC P
Sbjct: 212 SSCQPACCTSSSCQQAC--CVPVCCKTVCCKP 241
Score = 31.5 bits (68), Expect = 8.8
Identities = 12/32 (37%), Positives = 16/32 (50%)
Frame = -2
Query: 299 AGCASQCCVCSQRISSCGTCPPFRCRTLCCAP 204
+ C CC S +C C P C+T+CC P
Sbjct: 93 SSCQLACCASSPCQQAC--CVPVCCKTVCCKP 122
>UniRef50_P25779 Cluster: Cruzipain precursor; n=54;
Trypanosoma|Rep: Cruzipain precursor - Trypanosoma cruzi
Length = 467
Score = 32.3 bits (70), Expect = 5.0
Identities = 10/34 (29%), Positives = 22/34 (64%)
Frame = +2
Query: 170 DLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAEN 271
+ + ++ FK +H +VY++ E+ FR+ ++ EN
Sbjct: 32 ETLTSQFAEFKQKHGRVYESAAEEAFRLSVFREN 65
>UniRef50_Q9BXQ6 Cluster: Cat eye syndrome critical region protein
6; n=11; Mammalia|Rep: Cat eye syndrome critical region
protein 6 - Homo sapiens (Human)
Length = 578
Score = 32.3 bits (70), Expect = 5.0
Identities = 17/41 (41%), Positives = 18/41 (43%)
Frame = -2
Query: 314 GIGGGAGCASQCCVCSQRISSCGTCPPFRCRTLCCAPS*KC 192
G G GA C CC C C P R R CAPS +C
Sbjct: 161 GTGSGASCCPCCCCC-----GCPDRPGRRGRRRGCAPSPRC 196
>UniRef50_UPI0001555234 Cluster: PREDICTED: similar to hCG2041354;
n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to
hCG2041354 - Ornithorhynchus anatinus
Length = 121
Score = 31.9 bits (69), Expect = 6.6
Identities = 16/37 (43%), Positives = 16/37 (43%), Gaps = 1/37 (2%)
Frame = -2
Query: 308 GGGAGCASQCCVCSQRISSCGTCPPF-RCRTLCCAPS 201
G G A CC C SC CPP RC CC S
Sbjct: 87 GPGDDIAHNCCCCP--CCSCCHCPPCCRCHPCCCVVS 121
>UniRef50_Q4MW49 Cluster: Putative uncharacterized protein; n=1;
Bacillus cereus G9241|Rep: Putative uncharacterized
protein - Bacillus cereus G9241
Length = 165
Score = 31.9 bits (69), Expect = 6.6
Identities = 13/26 (50%), Positives = 13/26 (50%)
Frame = -2
Query: 308 GGGAGCASQCCVCSQRISSCGTCPPF 231
G GAGC C VCS C TC F
Sbjct: 31 GCGAGCCGSCFVCSCWTGCCATCCSF 56
>UniRef50_A1Z9I0 Cluster: CG6357-PA; n=3; Drosophila
melanogaster|Rep: CG6357-PA - Drosophila melanogaster
(Fruit fly)
Length = 439
Score = 31.9 bits (69), Expect = 6.6
Identities = 15/50 (30%), Positives = 23/50 (46%)
Frame = +2
Query: 146 TVGAVSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295
T G F + + W F ++ YDN+ E + R I+ EN + HN
Sbjct: 58 TSGLSEFEEECQFAWQRFLVDFDVHYDNDYERQKRRDIFCENWQKVRDHN 107
>UniRef50_Q4PEV5 Cluster: Putative uncharacterized protein; n=1;
Ustilago maydis|Rep: Putative uncharacterized protein -
Ustilago maydis (Smut fungus)
Length = 531
Score = 31.9 bits (69), Expect = 6.6
Identities = 14/30 (46%), Positives = 18/30 (60%)
Frame = -2
Query: 290 ASQCCVCSQRISSCGTCPPFRCRTLCCAPS 201
AS+C + S +S+ CPP RC TL A S
Sbjct: 2 ASRCSLRSSWLSAVAVCPPQRCSTLTIAAS 31
>UniRef50_Q6L8H1 Cluster: Keratin-associated protein 5-4; n=160;
Fungi/Metazoa group|Rep: Keratin-associated protein 5-4
- Homo sapiens (Human)
Length = 288
Score = 31.9 bits (69), Expect = 6.6
Identities = 14/32 (43%), Positives = 18/32 (56%)
Frame = -2
Query: 299 AGCASQCCVCSQRISSCGTCPPFRCRTLCCAP 204
+GC S CC SSC C P+ C++ CC P
Sbjct: 228 SGCGSSCCQ-----SSC--CKPYCCQSSCCKP 252
>UniRef50_P60412 Cluster: Keratin-associated protein 10-11; n=80;
Eutheria|Rep: Keratin-associated protein 10-11 - Homo
sapiens (Human)
Length = 298
Score = 31.9 bits (69), Expect = 6.6
Identities = 12/32 (37%), Positives = 16/32 (50%)
Frame = -2
Query: 299 AGCASQCCVCSQRISSCGTCPPFRCRTLCCAP 204
+ C CC S +C C P C+T+CC P
Sbjct: 83 SSCQPACCTSSPCQQAC--CVPVCCKTVCCKP 112
>UniRef50_P60368 Cluster: Keratin-associated protein 10-2; n=64;
Coelomata|Rep: Keratin-associated protein 10-2 - Homo
sapiens (Human)
Length = 255
Score = 31.9 bits (69), Expect = 6.6
Identities = 16/36 (44%), Positives = 18/36 (50%), Gaps = 5/36 (13%)
Frame = -2
Query: 293 CASQCCV--CSQRISSC---GTCPPFRCRTLCCAPS 201
C S CCV CS S C +C P C + CC PS
Sbjct: 193 CKSICCVPVCSGASSPCCQQSSCQPACCTSSCCRPS 228
>UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin
heavy chain; n=3; Amniota|Rep: PREDICTED: similar to
ferritin heavy chain - Ornithorhynchus anatinus
Length = 338
Score = 31.5 bits (68), Expect = 8.8
Identities = 14/39 (35%), Positives = 21/39 (53%)
Frame = +2
Query: 182 EEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQ 298
E W +K+ H K Y E E+ FR + +N I +HN+
Sbjct: 26 EGWWRWKVLHGKNYSVEAEEVFRRAAWEKNVRVIERHNE 64
>UniRef50_UPI0001552953 Cluster: PREDICTED: hypothetical protein;
n=1; Mus musculus|Rep: PREDICTED: hypothetical protein -
Mus musculus
Length = 205
Score = 31.5 bits (68), Expect = 8.8
Identities = 15/37 (40%), Positives = 17/37 (45%), Gaps = 2/37 (5%)
Frame = -2
Query: 314 GIGGGAGCASQC--CVCSQRISSCGTCPPFRCRTLCC 210
G GG GC+S C C C CG C R +CC
Sbjct: 55 GCGGCGGCSSCCGGCGCGGCGGCCGCCGCCRPTVVCC 91
>UniRef50_UPI0000EBE77C Cluster: PREDICTED: hypothetical protein;
n=7; Theria|Rep: PREDICTED: hypothetical protein - Bos
taurus
Length = 372
Score = 31.5 bits (68), Expect = 8.8
Identities = 12/31 (38%), Positives = 14/31 (45%)
Frame = -2
Query: 296 GCASQCCVCSQRISSCGTCPPFRCRTLCCAP 204
GC S C C CG+C C + CC P
Sbjct: 208 GCGSSCGGCGSSCGGCGSCG--GCGSSCCVP 236
>UniRef50_Q0WVJ5 Cluster: Papain-like cysteine peptidase XBCP3; n=4;
core eudicotyledons|Rep: Papain-like cysteine peptidase
XBCP3 - Arabidopsis thaliana (Mouse-ear cress)
Length = 437
Score = 31.5 bits (68), Expect = 8.8
Identities = 14/49 (28%), Positives = 28/49 (57%)
Frame = +2
Query: 149 VGAVSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHN 295
V + S D + E ++ + +H K Y +E E + R++I+ +N + +HN
Sbjct: 19 VSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHN 67
>UniRef50_Q7RH27 Cluster: Transmembrane amino acid transporter
protein, putative; n=6; Plasmodium|Rep: Transmembrane
amino acid transporter protein, putative - Plasmodium
yoelii yoelii
Length = 645
Score = 31.5 bits (68), Expect = 8.8
Identities = 15/38 (39%), Positives = 23/38 (60%)
Frame = +2
Query: 203 MEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQRRLLSP 316
++H K D E+ +K + KIY EN+ N K ++R SP
Sbjct: 148 IDHIKDDDQEINEKEKNKIYEENQTNKKKTWKKRTFSP 185
>UniRef50_A1YHR6 Cluster: Cathepsin L; n=7; Kudoa thyrsites|Rep:
Cathepsin L - Kudoa thyrsites
Length = 300
Score = 31.5 bits (68), Expect = 8.8
Identities = 15/49 (30%), Positives = 27/49 (55%), Gaps = 3/49 (6%)
Frame = +2
Query: 158 VSFFDLVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENK---HNIAKHN 295
+S D+ W+ K+EH+ ++D+ E++ R+ + EN HN HN
Sbjct: 1 MSLEDVAIRLWSAHKLEHNIIFDSIEEERRRLCNFKENHQFIHNFNLHN 49
>UniRef50_A0FDR3 Cluster: Cathepsin L-like proteinase; n=2;
Endopterygota|Rep: Cathepsin L-like proteinase - Bombyx
mori (Silk moth)
Length = 402
Score = 31.5 bits (68), Expect = 8.8
Identities = 11/45 (24%), Positives = 24/45 (53%)
Frame = +2
Query: 173 LVKEEWNTFKMEHSKVYDNEMEDKFRMKIYAENKHNIAKHNQRRL 307
L + W+ +K H+K+Y + + + + +N +A+HN+ L
Sbjct: 95 LPRRHWHEYKAIHNKLYSSTHHEMAALMKWRQNLRRVARHNREYL 139
>UniRef50_A0CKJ6 Cluster: Chromosome undetermined scaffold_2, whole
genome shotgun sequence; n=3; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_2,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 354
Score = 31.5 bits (68), Expect = 8.8
Identities = 17/51 (33%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Frame = -1
Query: 276 CLFSAYIFMRNLSSISLSYTLLCSILKVFH--SSFTRSKKETAPTVATTRT 130
C++S ++NL S+ + L S+L H SSFTRSK +++ ++ R+
Sbjct: 45 CIYSTMSNIQNLQSLKNQISQLQSVLTQQHRKSSFTRSKADSSTNMSNDRS 95
Database: uniref50
Posted date: Oct 5, 2007 11:19 AM
Number of letters in database: 575,637,011
Number of sequences in database: 1,657,284
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 313,289,215
Number of Sequences: 1657284
Number of extensions: 4808949
Number of successful extensions: 18114
Number of sequences better than 10.0: 93
Number of HSP's better than 10.0 without gapping: 16823
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 18021
length of database: 575,637,011
effective HSP length: 93
effective length of database: 421,509,599
effective search space used: 23604537544
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)
- SilkBase 1999-2023 -