BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= I10A02NGRL0001_H19 (643 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precurs... 51 3e-05 UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathep... 48 3e-04 UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (M... 48 3e-04 UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n... 47 4e-04 UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra er... 47 4e-04 UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [C... 46 6e-04 UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|... 46 0.001 UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine prot... 46 0.001 UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata... 45 0.001 UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin ... 45 0.002 UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase pr... 45 0.002 UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteina... 44 0.002 UniRef50_UPI0000E224BB Cluster: PREDICTED: hypothetical protein;... 44 0.003 UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsi... 44 0.003 UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n... 44 0.004 UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin ... 43 0.005 UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin ... 43 0.005 UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleosto... 43 0.005 UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]... 42 0.010 UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; ... 42 0.010 UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Re... 42 0.013 UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Ca... 41 0.022 UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n... 41 0.022 UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3... 41 0.022 UniRef50_Q4TI44 Cluster: Chromosome undetermined SCAF2412, whole... 41 0.029 UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma j... 41 0.029 UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviat... 41 0.029 UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrio... 40 0.039 UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropica... 40 0.051 UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; ... 40 0.067 UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicate... 40 0.067 UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platy... 39 0.089 UniRef50_UPI0000E46171 Cluster: PREDICTED: hypothetical protein;... 39 0.12 UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; ... 39 0.12 UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cy... 39 0.12 UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; D... 39 0.12 UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchoc... 39 0.12 UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba ... 38 0.16 UniRef50_Q5BTK3 Cluster: SJCHGC00358 protein; n=1; Schistosoma j... 38 0.16 UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin h... 38 0.21 UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin ... 38 0.21 UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaste... 38 0.21 UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep:... 38 0.21 UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1... 38 0.21 UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep... 38 0.21 UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Ca... 38 0.27 UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep:... 37 0.36 UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2;... 37 0.36 UniRef50_A4VEB9 Cluster: Putative uncharacterized protein; n=1; ... 37 0.48 UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-li... 36 0.63 UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cat... 36 0.83 UniRef50_A1VZU0 Cluster: Cation efflux family protein; n=12; Cam... 36 0.83 UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin ... 36 1.1 UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine protei... 36 1.1 UniRef50_Q2FUP2 Cluster: Uncharacterized membrane protein requir... 36 1.1 UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cy... 35 1.5 UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain... 35 1.9 UniRef50_Q4C5V5 Cluster: Protein kinase; n=1; Crocosphaera watso... 33 4.4 UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n... 33 4.4 UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cy... 33 4.4 UniRef50_Q9BJX3 Cluster: Putative ATPase; n=1; Trypanosoma cruzi... 33 5.9 UniRef50_UPI0000E4A202 Cluster: PREDICTED: similar to Transmembr... 33 7.7 UniRef50_A4Y8H0 Cluster: Putative uncharacterized protein; n=1; ... 33 7.7 UniRef50_Q09FA3 Cluster: Heme maturase; n=2; Tetrahymena|Rep: He... 33 7.7 UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18... 33 7.7 >UniRef50_P25782 Cluster: Digestive cysteine proteinase 2 precursor; n=3; Metazoa|Rep: Digestive cysteine proteinase 2 precursor - Homarus americanus (American lobster) Length = 323 Score = 50.8 bits (116), Expect = 3e-05 Identities = 21/26 (80%), Positives = 24/26 (92%) Frame = +3 Query: 3 DLGYIKMARNRNNHCGIASAASYPLV 80 D GYIKM+RNRNN+CGIA+ ASYPLV Sbjct: 298 DAGYIKMSRNRNNNCGIATVASYPLV 323 >UniRef50_A5HJW4 Cluster: Cathepsin L; n=3; Coelomata|Rep: Cathepsin L - Misgurnus mizolepis (Mud loach) Length = 337 Score = 47.6 bits (108), Expect = 3e-04 Identities = 20/26 (76%), Positives = 23/26 (88%) Frame = +3 Query: 3 DLGYIKMARNRNNHCGIASAASYPLV 80 D GYI MA++R NHCGIA+AASYPLV Sbjct: 312 DKGYIYMAKDRKNHCGIATAASYPLV 337 >UniRef50_P07711 Cluster: Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=19; Eukaryota|Rep: Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Homo sapiens (Human) Length = 333 Score = 47.6 bits (108), Expect = 3e-04 Identities = 19/24 (79%), Positives = 22/24 (91%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GY+KMA++R NHCGIASAASYP V Sbjct: 310 GYVKMAKDRRNHCGIASAASYPTV 333 >UniRef50_Q3YPH2 Cluster: Cathepsin L-like cysteine proteinase; n=21; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Globodera pallida Length = 379 Score = 46.8 bits (106), Expect = 4e-04 Identities = 20/24 (83%), Positives = 22/24 (91%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GYI+MARNR N+CGIAS ASYPLV Sbjct: 356 GYIRMARNRKNNCGIASHASYPLV 379 >UniRef50_O02586 Cluster: Cysteine proteinase; n=3; Spirometra erinaceieuropaei|Rep: Cysteine proteinase - Spirometra erinaceieuropaei (Tapeworm) Length = 336 Score = 46.8 bits (106), Expect = 4e-04 Identities = 20/24 (83%), Positives = 21/24 (87%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GY+KMARNRNN CGIAS ASYP V Sbjct: 313 GYLKMARNRNNMCGIASMASYPTV 336 >UniRef50_Q26636 Cluster: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain]; n=71; Coelomata|Rep: Cathepsin L precursor (EC 3.4.22.15) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] - Sarcophaga peregrina (Flesh fly) (Boettcherisca peregrina) Length = 339 Score = 46.4 bits (105), Expect = 6e-04 Identities = 19/24 (79%), Positives = 22/24 (91%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GYIKMARN+NN CGIA+A+SYP V Sbjct: 316 GYIKMARNQNNQCGIATASSYPTV 339 >UniRef50_Q8WQZ8 Cluster: Cysteine proteinase; n=2; Acanthamoeba|Rep: Cysteine proteinase - Acanthamoeba healyi Length = 330 Score = 45.6 bits (103), Expect = 0.001 Identities = 18/22 (81%), Positives = 22/22 (100%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYP 74 GYIKM+RN+NN+CGIA+AASYP Sbjct: 307 GYIKMSRNQNNNCGIATAASYP 328 >UniRef50_A2I407 Cluster: Putative cathepsin L-like cysteine protease; n=1; Maconellicoccus hirsutus|Rep: Putative cathepsin L-like cysteine protease - Maconellicoccus hirsutus (hibiscus mealybug) Length = 339 Score = 45.6 bits (103), Expect = 0.001 Identities = 17/24 (70%), Positives = 23/24 (95%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GYI+MARN+NNHCGIA++AS P++ Sbjct: 316 GYIRMARNKNNHCGIATSASVPML 339 >UniRef50_O60911 Cluster: Cathepsin L2 precursor; n=42; Coelomata|Rep: Cathepsin L2 precursor - Homo sapiens (Human) Length = 334 Score = 45.2 bits (102), Expect = 0.001 Identities = 17/24 (70%), Positives = 23/24 (95%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GY+K+A+++NNHCGIA+AASYP V Sbjct: 311 GYVKIAKDKNNHCGIATAASYPNV 334 >UniRef50_UPI00015B62BC Cluster: PREDICTED: similar to cathepsin L-like protease; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to cathepsin L-like protease - Nasonia vitripennis Length = 353 Score = 44.8 bits (101), Expect = 0.002 Identities = 17/24 (70%), Positives = 21/24 (87%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GY K+ARNR NHCGIA+AA YP++ Sbjct: 330 GYFKVARNRRNHCGIAAAAVYPVI 353 >UniRef50_O18456 Cluster: Cathepsin S-like cysteine proteinase precursor; n=2; Bilateria|Rep: Cathepsin S-like cysteine proteinase precursor - Heterodera glycines (Soybean cyst nematode worm) Length = 353 Score = 44.8 bits (101), Expect = 0.002 Identities = 17/24 (70%), Positives = 22/24 (91%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GYI++ARN+ NHCGIA+ ASYP+V Sbjct: 330 GYIRIARNKQNHCGIATMASYPVV 353 >UniRef50_O96086 Cluster: Cathepsin L-like tick cysteine proteinase B; n=1; Haemaphysalis longicornis|Rep: Cathepsin L-like tick cysteine proteinase B - Haemaphysalis longicornis (Bush tick) Length = 332 Score = 44.4 bits (100), Expect = 0.002 Identities = 19/26 (73%), Positives = 22/26 (84%) Frame = +3 Query: 3 DLGYIKMARNRNNHCGIASAASYPLV 80 D GYI M RN++N CGIAS+ASYPLV Sbjct: 307 DEGYIYMTRNQDNQCGIASSASYPLV 332 >UniRef50_UPI0000E224BB Cluster: PREDICTED: hypothetical protein; n=1; Pan troglodytes|Rep: PREDICTED: hypothetical protein - Pan troglodytes Length = 143 Score = 44.0 bits (99), Expect = 0.003 Identities = 18/24 (75%), Positives = 22/24 (91%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GYIKMA++R N+CGIA+AASYP V Sbjct: 120 GYIKMAKDRRNNCGIATAASYPTV 143 >UniRef50_Q6A1I0 Cluster: Cathepsin L; n=3; Metazoa|Rep: Cathepsin L - Suberites domuncula (Sponge) Length = 324 Score = 44.0 bits (99), Expect = 0.003 Identities = 18/24 (75%), Positives = 21/24 (87%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GYI M+RNR N+CGIAS ASYP+V Sbjct: 301 GYIMMSRNRRNNCGIASQASYPIV 324 >UniRef50_UPI0000501FDB Cluster: UPI0000501FDB related cluster; n=1; Rattus norvegicus|Rep: UPI0000501FDB UniRef100 entry - Rattus norvegicus Length = 338 Score = 43.6 bits (98), Expect = 0.004 Identities = 16/24 (66%), Positives = 22/24 (91%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GY+K+A++RNNHCGIA+ A YP+V Sbjct: 315 GYMKIAKDRNNHCGIATFAQYPIV 338 >UniRef50_UPI0000D566EF Cluster: PREDICTED: similar to cathepsin S preproprotein; n=2; Tribolium castaneum|Rep: PREDICTED: similar to cathepsin S preproprotein - Tribolium castaneum Length = 525 Score = 43.2 bits (97), Expect = 0.005 Identities = 15/23 (65%), Positives = 20/23 (86%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPL 77 GY+K+ARNRNNHCGI + +YP+ Sbjct: 501 GYVKIARNRNNHCGITNRITYPI 523 >UniRef50_UPI0000D55A76 Cluster: PREDICTED: similar to Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2); n=2; Tribolium castaneum|Rep: PREDICTED: similar to Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2) - Tribolium castaneum Length = 332 Score = 43.2 bits (97), Expect = 0.005 Identities = 17/23 (73%), Positives = 20/23 (86%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPL 77 GY+KMARNR N CGIA+ ASYP+ Sbjct: 309 GYVKMARNRRNQCGIATHASYPV 331 >UniRef50_P25774 Cluster: Cathepsin S precursor; n=78; Euteleostomi|Rep: Cathepsin S precursor - Homo sapiens (Human) Length = 331 Score = 43.2 bits (97), Expect = 0.005 Identities = 17/22 (77%), Positives = 19/22 (86%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYP 74 GYI+MARN+ NHCGIAS SYP Sbjct: 308 GYIRMARNKGNHCGIASFPSYP 329 >UniRef50_Q80UB0 Cluster: Testin-2 precursor [Contains: Testin-1]; n=11; Eutheria|Rep: Testin-2 precursor [Contains: Testin-1] - Mus musculus (Mouse) Length = 333 Score = 42.3 bits (95), Expect = 0.010 Identities = 16/24 (66%), Positives = 22/24 (91%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GYIK+A++ NNHCGIA+ A+YP+V Sbjct: 310 GYIKIAKDWNNHCGIATLATYPIV 333 >UniRef50_Q24940 Cluster: Cathepsin L-like proteinase precursor; n=35; Fasciola|Rep: Cathepsin L-like proteinase precursor - Fasciola hepatica (Liver fluke) Length = 326 Score = 42.3 bits (95), Expect = 0.010 Identities = 19/26 (73%), Positives = 21/26 (80%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV*R 86 GYI+MARNR N CGIAS AS P+V R Sbjct: 299 GYIRMARNRGNMCGIASLASLPMVAR 324 >UniRef50_Q9JIA9 Cluster: Cathepsin R precursor; n=30; Muridae|Rep: Cathepsin R precursor - Mus musculus (Mouse) Length = 334 Score = 41.9 bits (94), Expect = 0.013 Identities = 15/24 (62%), Positives = 21/24 (87%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GY+K+A+++NNHCGIAS A YP + Sbjct: 311 GYMKLAKDKNNHCGIASYAHYPTI 334 >UniRef50_Q568D6 Cluster: Cathepsin K; n=13; Euteleostomi|Rep: Cathepsin K - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 333 Score = 41.1 bits (92), Expect = 0.022 Identities = 16/24 (66%), Positives = 21/24 (87%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GY+ MARNRNN CGIA+ AS+P++ Sbjct: 310 GYVLMARNRNNACGIANLASFPVM 333 >UniRef50_Q3ZCX6 Cluster: Cathepsin L-like cysteine proteinase; n=3; Bilateria|Rep: Cathepsin L-like cysteine proteinase - Longidorus elongatus Length = 358 Score = 41.1 bits (92), Expect = 0.022 Identities = 17/26 (65%), Positives = 21/26 (80%) Frame = +3 Query: 3 DLGYIKMARNRNNHCGIASAASYPLV 80 D GYI M+R +NN+CGIA+ ASYP V Sbjct: 331 DDGYILMSRRKNNNCGIATMASYPFV 356 >UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3; Bilateria|Rep: Cathepsin L-like cysteine protease - Neobenedenia melleni Length = 335 Score = 41.1 bits (92), Expect = 0.022 Identities = 15/24 (62%), Positives = 21/24 (87%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GYI M+RN++N CG+A+ ASYP+V Sbjct: 312 GYILMSRNKDNQCGVATVASYPIV 335 >UniRef50_Q4TI44 Cluster: Chromosome undetermined SCAF2412, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF2412, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 123 Score = 40.7 bits (91), Expect = 0.029 Identities = 17/24 (70%), Positives = 20/24 (83%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GYI MARNR N CGIA+ ASYP++ Sbjct: 100 GYILMARNRGNLCGIANLASYPIM 123 >UniRef50_Q5DAH1 Cluster: SJCHGC06231 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06231 protein - Schistosoma japonicum (Blood fluke) Length = 372 Score = 40.7 bits (91), Expect = 0.029 Identities = 16/26 (61%), Positives = 21/26 (80%) Frame = +3 Query: 3 DLGYIKMARNRNNHCGIASAASYPLV 80 D GY+K+ ++ N CG+ASAASYPLV Sbjct: 347 DKGYVKILKDSKNMCGVASAASYPLV 372 >UniRef50_Q0PZI3 Cluster: Cathepsin L 2; n=1; Diaprepes abbreviatus|Rep: Cathepsin L 2 - Diaprepes abbreviatus (Sugarcane rootstalk borer weevil) Length = 348 Score = 40.7 bits (91), Expect = 0.029 Identities = 16/24 (66%), Positives = 20/24 (83%) Frame = +3 Query: 3 DLGYIKMARNRNNHCGIASAASYP 74 D GYI+MA+++NN CGIA ASYP Sbjct: 323 DQGYIRMAKDKNNQCGIALMASYP 346 >UniRef50_Q69G21 Cluster: Cathepsin L-like protein; n=2; Tenebrionidae|Rep: Cathepsin L-like protein - Tenebrio molitor (Yellow mealworm) Length = 336 Score = 40.3 bits (90), Expect = 0.039 Identities = 16/22 (72%), Positives = 17/22 (77%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYP 74 GY K+ARN NNHCGIA AS P Sbjct: 313 GYFKIARNANNHCGIAGVASVP 334 >UniRef50_Q5BL90 Cluster: LOC594890 protein; n=9; Xenopus tropicalis|Rep: LOC594890 protein - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 355 Score = 39.9 bits (89), Expect = 0.051 Identities = 16/26 (61%), Positives = 21/26 (80%) Frame = +3 Query: 3 DLGYIKMARNRNNHCGIASAASYPLV 80 D GYIKMARN +N+CGIA+ +P+V Sbjct: 330 DEGYIKMARNHHNNCGIANFGCFPVV 355 >UniRef50_Q70EW9 Cluster: Cathepsin L-like proteinase precursor; n=6; Chrysomelidae|Rep: Cathepsin L-like proteinase precursor - Diabrotica virgifera virgifera (western corn rootworm) Length = 326 Score = 39.5 bits (88), Expect = 0.067 Identities = 15/24 (62%), Positives = 20/24 (83%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GYI M+RN+NN CGIA+ A+YP + Sbjct: 303 GYIWMSRNKNNQCGIATDATYPTI 326 >UniRef50_Q1MTY8 Cluster: Silicatein a3; n=15; root|Rep: Silicatein a3 - Lubomirskia baicalensis Length = 344 Score = 39.5 bits (88), Expect = 0.067 Identities = 16/24 (66%), Positives = 19/24 (79%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GYIKM RN+ N CGIAS A YP++ Sbjct: 321 GYIKMVRNKYNQCGIASDALYPML 344 >UniRef50_Q0WYD8 Cluster: Cathepsin L-like proteinase; n=2; Platyhelminthes|Rep: Cathepsin L-like proteinase - Echinococcus multilocularis Length = 338 Score = 39.1 bits (87), Expect = 0.089 Identities = 16/24 (66%), Positives = 20/24 (83%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GYI MAR++ N CGIA+ ASYPL+ Sbjct: 315 GYIWMARDKGNMCGIATMASYPLI 338 >UniRef50_UPI0000E46171 Cluster: PREDICTED: hypothetical protein; n=4; Strongylocentrotus purpuratus|Rep: PREDICTED: hypothetical protein - Strongylocentrotus purpuratus Length = 892 Score = 38.7 bits (86), Expect = 0.12 Identities = 16/22 (72%), Positives = 18/22 (81%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYP 74 GYI +ARNRNN CGIA+ A YP Sbjct: 868 GYINIARNRNNMCGIATDAIYP 889 >UniRef50_Q2M437 Cluster: Cathepsin-like cysteine protease; n=1; Phytophthora infestans|Rep: Cathepsin-like cysteine protease - Phytophthora infestans (Potato late blight fungus) Length = 376 Score = 38.7 bits (86), Expect = 0.12 Identities = 14/24 (58%), Positives = 21/24 (87%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GYI M+RN++N CGIA+ A+YP++ Sbjct: 331 GYIMMSRNKDNQCGIATDATYPIM 354 >UniRef50_A2GD33 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 234 Score = 38.7 bits (86), Expect = 0.12 Identities = 15/24 (62%), Positives = 20/24 (83%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GYIKM+RN++N CGIA+ A PL+ Sbjct: 210 GYIKMSRNKDNQCGIATEAVIPLI 233 >UniRef50_P04989 Cluster: Cysteine proteinase 2 precursor; n=2; Dictyostelium discoideum|Rep: Cysteine proteinase 2 precursor - Dictyostelium discoideum (Slime mold) Length = 376 Score = 38.7 bits (86), Expect = 0.12 Identities = 15/23 (65%), Positives = 20/23 (86%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPL 77 GYI M+++R N+CGIAS +SYPL Sbjct: 353 GYILMSKDRKNNCGIASVSSYPL 375 >UniRef50_O17473 Cluster: Cathepsin L-like precursor; n=9; Onchocercidae|Rep: Cathepsin L-like precursor - Brugia pahangi (Filarial nematode worm) Length = 395 Score = 38.7 bits (86), Expect = 0.12 Identities = 16/23 (69%), Positives = 19/23 (82%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPL 77 GY+ MARNR N C IASAAS+P+ Sbjct: 373 GYVYMARNRGNMCHIASAASFPI 395 >UniRef50_Q9XY38 Cluster: Cysteine proteinase; n=1; Acanthamoeba culbertsoni|Rep: Cysteine proteinase - Acanthamoeba culbertsoni Length = 482 Score = 38.3 bits (85), Expect = 0.16 Identities = 16/24 (66%), Positives = 19/24 (79%) Frame = +3 Query: 3 DLGYIKMARNRNNHCGIASAASYP 74 D GY+ MARN+NN+CGIAS A P Sbjct: 348 DDGYVYMARNKNNNCGIASLAVLP 371 >UniRef50_Q5BTK3 Cluster: SJCHGC00358 protein; n=1; Schistosoma japonicum|Rep: SJCHGC00358 protein - Schistosoma japonicum (Blood fluke) Length = 78 Score = 38.3 bits (85), Expect = 0.16 Identities = 16/26 (61%), Positives = 20/26 (76%) Frame = +3 Query: 3 DLGYIKMARNRNNHCGIASAASYPLV 80 D GYIK+ARN +N C IAS A YP++ Sbjct: 53 DQGYIKLARNHSNMCHIASYAYYPVI 78 >UniRef50_UPI0001554DAA Cluster: PREDICTED: similar to ferritin heavy chain; n=3; Amniota|Rep: PREDICTED: similar to ferritin heavy chain - Ornithorhynchus anatinus Length = 338 Score = 37.9 bits (84), Expect = 0.21 Identities = 12/24 (50%), Positives = 20/24 (83%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GY+++ + NNHCG+AS AS+P++ Sbjct: 315 GYMRLLKGANNHCGVASVASFPVL 338 >UniRef50_UPI0000E492F4 Cluster: PREDICTED: similar to cathepsin L-like proteinase; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to cathepsin L-like proteinase - Strongylocentrotus purpuratus Length = 329 Score = 37.9 bits (84), Expect = 0.21 Identities = 16/24 (66%), Positives = 18/24 (75%) Frame = +3 Query: 3 DLGYIKMARNRNNHCGIASAASYP 74 D GYI +ARN NN CGIA+ A YP Sbjct: 303 DNGYINIARNHNNMCGIATDAIYP 326 >UniRef50_Q9VNK6 Cluster: CG11459-PA; n=1; Drosophila melanogaster|Rep: CG11459-PA - Drosophila melanogaster (Fruit fly) Length = 336 Score = 37.9 bits (84), Expect = 0.21 Identities = 14/22 (63%), Positives = 17/22 (77%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYP 74 GY+K+ARN NN CG+AS YP Sbjct: 313 GYLKLARNANNMCGVASLPQYP 334 >UniRef50_Q70TB1 Cluster: Silicatein beta; n=3; Demospongiae|Rep: Silicatein beta - Suberites domuncula (Sponge) Length = 383 Score = 37.9 bits (84), Expect = 0.21 Identities = 15/22 (68%), Positives = 19/22 (86%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYP 74 GY K+ARN+ N CGIA+AAS+P Sbjct: 360 GYGKLARNKGNKCGIATAASFP 381 >UniRef50_Q54TR1 Cluster: Counting factor associated protein; n=1; Dictyostelium discoideum AX4|Rep: Counting factor associated protein - Dictyostelium discoideum AX4 Length = 531 Score = 37.9 bits (84), Expect = 0.21 Identities = 14/23 (60%), Positives = 19/23 (82%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPL 77 GY+ MARN NN CG++S A+YP+ Sbjct: 505 GYVYMARNDNNLCGVSSQATYPI 527 >UniRef50_Q27645 Cluster: Cysteine proteinase; n=7; Entamoeba|Rep: Cysteine proteinase - Entamoeba histolytica Length = 320 Score = 37.9 bits (84), Expect = 0.21 Identities = 15/22 (68%), Positives = 18/22 (81%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYP 74 GYI M+RN+NN CGIA+ A YP Sbjct: 291 GYILMSRNKNNQCGIANDAIYP 312 >UniRef50_Q967D5 Cluster: Cathepsin; n=1; Geodia cydonium|Rep: Cathepsin - Geodia cydonium (Sponge) Length = 322 Score = 37.5 bits (83), Expect = 0.27 Identities = 15/24 (62%), Positives = 21/24 (87%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 G + M+RNR+N+CGIA+ ASYP+V Sbjct: 296 GDMMMSRNRDNNCGIATMASYPVV 319 >UniRef50_Q7SXQ7 Cluster: Cathepsin; n=1; Petromyzon marinus|Rep: Cathepsin - Petromyzon marinus (Sea lamprey) Length = 333 Score = 37.1 bits (82), Expect = 0.36 Identities = 14/24 (58%), Positives = 18/24 (75%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GYI M RN++N CGIAS YP++ Sbjct: 310 GYIYMIRNKDNQCGIASIGIYPII 333 >UniRef50_Q75PZ5 Cluster: Cahepsin L-like cysteine protease; n=2; Brugia malayi|Rep: Cahepsin L-like cysteine protease - Brugia malayi (Filarial nematode worm) Length = 371 Score = 37.1 bits (82), Expect = 0.36 Identities = 15/24 (62%), Positives = 18/24 (75%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GY+K+ARN+ N CGI A YPLV Sbjct: 343 GYLKLARNQENMCGIGFYACYPLV 366 >UniRef50_A4VEB9 Cluster: Putative uncharacterized protein; n=1; Tetrahymena thermophila SB210|Rep: Putative uncharacterized protein - Tetrahymena thermophila SB210 Length = 175 Score = 36.7 bits (81), Expect = 0.48 Identities = 24/76 (31%), Positives = 38/76 (50%), Gaps = 3/76 (3%) Frame = -2 Query: 399 LFNFIKNICKCVITSSIRYDSLFKI*QL--TLGAVILLTFIYLRDVIIFFSLLLFCNYFF 226 LFN K IC+ SI ++ I L L +ILL F++ + FF +L C +F Sbjct: 36 LFNLKKIICRQFYFKSIYECIIYNIIYLFFQLKCLILLAFLFQNKINGFFIFILICGFFV 95 Query: 225 LILFRIPDVK-YNVYV 181 L F+ +K +N ++ Sbjct: 96 LFYFKSSCLKGFNTFL 111 >UniRef50_A3EXS2 Cluster: Cathepsin L-like cysteine proteinase-like protein; n=1; Maconellicoccus hirsutus|Rep: Cathepsin L-like cysteine proteinase-like protein - Maconellicoccus hirsutus (hibiscus mealybug) Length = 253 Score = 36.3 bits (80), Expect = 0.63 Identities = 14/24 (58%), Positives = 19/24 (79%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GY+++ARNRNN CGIA YP++ Sbjct: 229 GYMRLARNRNNLCGIAHIFYYPVL 252 >UniRef50_Q1LUB5 Cluster: Novel protein similar to vertebrate cathepsin L; n=4; Danio rerio|Rep: Novel protein similar to vertebrate cathepsin L - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 334 Score = 35.9 bits (79), Expect = 0.83 Identities = 14/24 (58%), Positives = 18/24 (75%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GY++M RN N CGIAS A YP++ Sbjct: 311 GYMRMIRNGKNTCGIASYALYPII 334 >UniRef50_A1VZU0 Cluster: Cation efflux family protein; n=12; Campylobacter|Rep: Cation efflux family protein - Campylobacter jejuni subsp. jejuni serotype O:23/36 (strain 81-176) Length = 295 Score = 35.9 bits (79), Expect = 0.83 Identities = 31/92 (33%), Positives = 45/92 (48%), Gaps = 7/92 (7%) Frame = -2 Query: 498 FIFYSSKLKITY----KDKGICFHF*NSVLII*DFLVLF-NFIKNICKCVITSS--IRYD 340 FIFY S LKI Y KD + LI+ FLVLF N++ K +I S + Y Sbjct: 91 FIFYESILKIYYKEEIKDLNSSIYVMIFALIMTFFLVLFLNYVAKKTKSLIIESDALHYK 150 Query: 339 SLFKI*QLTLGAVILLTFIYLRDVIIFFSLLL 244 + TLGA++L+ F L + F +++ Sbjct: 151 TDCLTNACTLGALVLIYFTNLHIIDAIFGIVI 182 >UniRef50_UPI0000F2B3F1 Cluster: PREDICTED: similar to cathepsin L preproprotein; n=1; Monodelphis domestica|Rep: PREDICTED: similar to cathepsin L preproprotein - Monodelphis domestica Length = 356 Score = 35.5 bits (78), Expect = 1.1 Identities = 14/26 (53%), Positives = 20/26 (76%) Frame = +3 Query: 3 DLGYIKMARNRNNHCGIASAASYPLV 80 D GYI + ++R N CGIAS A+YP++ Sbjct: 331 DRGYIYIPKDRYNQCGIASNANYPIL 356 >UniRef50_Q7YXL2 Cluster: Cathepsin-L-like midgut cysteine proteinase; n=4; Tenebrio molitor|Rep: Cathepsin-L-like midgut cysteine proteinase - Tenebrio molitor (Yellow mealworm) Length = 330 Score = 35.5 bits (78), Expect = 1.1 Identities = 14/22 (63%), Positives = 17/22 (77%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYP 74 GY + RN N+CGIA+AASYP Sbjct: 307 GYWRQVRNYGNNCGIATAASYP 328 >UniRef50_Q2FUP2 Cluster: Uncharacterized membrane protein required for N-linked glycosylation- like precursor; n=1; Methanospirillum hungatei JF-1|Rep: Uncharacterized membrane protein required for N-linked glycosylation- like precursor - Methanospirillum hungatei (strain JF-1 / DSM 864) Length = 808 Score = 35.5 bits (78), Expect = 1.1 Identities = 30/94 (31%), Positives = 46/94 (48%), Gaps = 1/94 (1%) Frame = -2 Query: 405 LVLFNFIKNICKCVITSSIRYDSLFKI*QLTLGAVILLTFIYLRDVIIF-FSLLLFCNYF 229 ++LF I + +IT +++S L G V +LT I +R I+F L L YF Sbjct: 369 VMLFLLIPALILLLITGIRKHNSNIVYLLLWTGFVGVLTIINIRFEILFAVPLTLTSAYF 428 Query: 228 FLILFRIPDVKYNVYVN*ESDQSSRKRYVNMRNR 127 LF D + +Y N ES Q+S + +N + R Sbjct: 429 LDWLFHFNDQRTEIY-NEESCQNSTYKIINWKKR 461 >UniRef50_A2F139 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 291 Score = 35.1 bits (77), Expect = 1.5 Identities = 12/23 (52%), Positives = 20/23 (86%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPL 77 GY+++A+++NN CG+A+ AS PL Sbjct: 267 GYMRLAKDKNNMCGVATMASIPL 289 >UniRef50_Q6QRP7 Cluster: Digestive cysteine proteinase intestain; n=9; Cucujiformia|Rep: Digestive cysteine proteinase intestain - Leptinotarsa decemlineata (Colorado potato beetle) Length = 326 Score = 34.7 bits (76), Expect = 1.9 Identities = 13/24 (54%), Positives = 18/24 (75%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GY ++ R+ NN CGIA ASYP++ Sbjct: 303 GYFRIKRDANNLCGIADKASYPIL 326 >UniRef50_Q4C5V5 Cluster: Protein kinase; n=1; Crocosphaera watsonii WH 8501|Rep: Protein kinase - Crocosphaera watsonii Length = 590 Score = 33.5 bits (73), Expect = 4.4 Identities = 21/80 (26%), Positives = 35/80 (43%), Gaps = 1/80 (1%) Frame = -2 Query: 444 FHF*NSVLII*DFLVLFNFIKNICKCVITSSIRYDSLFKI*QLTLGAVILLTFIY-LRDV 268 F F +++ L+++NF K K S Y K + + L + + Sbjct: 366 FLFVTHIILSPGLLIIYNFQKRYRKIRYLSRKLYRKFKKTYLPQIAMIFFLMILRSFLSI 425 Query: 267 IIFFSLLLFCNYFFLILFRI 208 IF + LFC+Y LILF++ Sbjct: 426 SIFIGIYLFCSYLVLILFKL 445 >UniRef50_Q6R3Q1 Cluster: Cathepsin L-like cysteine proteinase; n=2; Taeniidae|Rep: Cathepsin L-like cysteine proteinase - Taenia solium (Pork tapeworm) Length = 339 Score = 33.5 bits (73), Expect = 4.4 Identities = 13/24 (54%), Positives = 18/24 (75%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAASYPLV 80 GYI MA++ +N CG+AS A +P V Sbjct: 316 GYIMMAKDYHNMCGVASLADFPYV 339 >UniRef50_A2E346 Cluster: Clan CA, family C1, cathepsin L-like cysteine peptidase; n=9; Trichomonas vaginalis|Rep: Clan CA, family C1, cathepsin L-like cysteine peptidase - Trichomonas vaginalis G3 Length = 306 Score = 33.5 bits (73), Expect = 4.4 Identities = 11/19 (57%), Positives = 16/19 (84%) Frame = +3 Query: 9 GYIKMARNRNNHCGIASAA 65 GY++M RN+NN CG+A+ A Sbjct: 282 GYVRMIRNKNNQCGVATEA 300 >UniRef50_Q9BJX3 Cluster: Putative ATPase; n=1; Trypanosoma cruzi|Rep: Putative ATPase - Trypanosoma cruzi Length = 201 Score = 33.1 bits (72), Expect = 5.9 Identities = 14/27 (51%), Positives = 20/27 (74%) Frame = -2 Query: 294 LTFIYLRDVIIFFSLLLFCNYFFLILF 214 L F++L ++FFSLLLFC+Y F + F Sbjct: 61 LFFVFL---LLFFSLLLFCHYLFFVCF 84 >UniRef50_UPI0000E4A202 Cluster: PREDICTED: similar to Transmembrane protein 39a; n=3; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to Transmembrane protein 39a - Strongylocentrotus purpuratus Length = 526 Score = 32.7 bits (71), Expect = 7.7 Identities = 24/81 (29%), Positives = 44/81 (54%), Gaps = 7/81 (8%) Frame = -2 Query: 429 SVLII*DFLVLFNFIKN---ICKCVITSSIRYDSLFKI*QLTLGAVILLTFIYLRD---- 271 S+L+ ++ VLF +++ + K + +R +LF L LG + ++ +R Sbjct: 431 SLLLFCNYYVLFKLLRSRFILGKFMFHDPLRVLNLFVF--LQLGVIATQLYLLIRHSHWH 488 Query: 270 VIIFFSLLLFCNYFFLILFRI 208 +I SLLLFCNY+ +LF++ Sbjct: 489 YVISLSLLLFCNYY--VLFKL 507 >UniRef50_A4Y8H0 Cluster: Putative uncharacterized protein; n=1; Shewanella putrefaciens CN-32|Rep: Putative uncharacterized protein - Shewanella putrefaciens CN-32 Length = 382 Score = 32.7 bits (71), Expect = 7.7 Identities = 21/55 (38%), Positives = 33/55 (60%) Frame = -2 Query: 429 SVLII*DFLVLFNFIKNICKCVITSSIRYDSLFKI*QLTLGAVILLTFIYLRDVI 265 S+LI+ FL+LF FI +I + +++IR+ LF + LG + FIYL V+ Sbjct: 149 SILILNRFLILFIFISSIIVYLCSANIRFKKLFVV---ILGVIF---FIYLFGVL 197 >UniRef50_Q09FA3 Cluster: Heme maturase; n=2; Tetrahymena|Rep: Heme maturase - Tetrahymena malaccensis Length = 519 Score = 32.7 bits (71), Expect = 7.7 Identities = 21/84 (25%), Positives = 44/84 (52%) Frame = -2 Query: 429 SVLII*DFLVLFNFIKNICKCVITSSIRYDSLFKI*QLTLGAVILLTFIYLRDVIIFFSL 250 +++++ F +LF ++++I ++ + I + F I ++L+ ++ I FFS Sbjct: 31 NLILLKSFFILFYYVESIYVSILNNLINLNYNFII-------ILLIFLLFFNQKIKFFSY 83 Query: 249 LLFCNYFFLILFRIPDVKYNVYVN 178 +LF FF+I F + YN +N Sbjct: 84 ILF---FFIIFFELL-TSYNYTIN 103 >UniRef50_Q8H166 Cluster: Thiol protease aleurain precursor; n=18; Magnoliophyta|Rep: Thiol protease aleurain precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 358 Score = 32.7 bits (71), Expect = 7.7 Identities = 15/26 (57%), Positives = 18/26 (69%) Frame = +3 Query: 3 DLGYIKMARNRNNHCGIASAASYPLV 80 D GY KM +N CGIA+ ASYP+V Sbjct: 333 DKGYFKMEMGKNM-CGIATCASYPVV 357 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 411,632,798 Number of Sequences: 1657284 Number of extensions: 6253141 Number of successful extensions: 13865 Number of sequences better than 10.0: 65 Number of HSP's better than 10.0 without gapping: 13371 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 13850 length of database: 575,637,011 effective HSP length: 97 effective length of database: 414,880,463 effective search space used: 48126133708 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -