SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTX 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= msgV0723.Seq
         (748 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_Q8GEG0 Cluster: Putative uncharacterized protein; n=1; ...    95   2e-18
UniRef50_Q47336 Cluster: LacZ-alpha peptide; n=2; cellular organ...    95   2e-18
UniRef50_Q37953 Cluster: LacZ protein; n=1; Phage M13mp18|Rep: L...    95   2e-18
UniRef50_P00722 Cluster: Beta-galactosidase; n=35; root|Rep: Bet...    95   2e-18
UniRef50_UPI0000498F17 Cluster: beta-galactosidase; n=3; Eukaryo...    77   4e-13
UniRef50_Q669R9 Cluster: Beta-galactosidase; n=14; Yersinia|Rep:...    65   2e-09
UniRef50_A2VBJ9 Cluster: Non-ribosomal peptide synthetase; n=1; ...    58   2e-07
UniRef50_A7MN76 Cluster: Putative uncharacterized protein; n=1; ...    55   2e-06
UniRef50_P06219 Cluster: Beta-galactosidase; n=11; Gammaproteoba...    53   9e-06
UniRef50_P07856 Cluster: Sericin 1 precursor; n=4; Bombyx mori|R...    47   4e-04
UniRef50_Q61343 Cluster: Beta-D-galactosidase fusion protein; n=...    44   0.004
UniRef50_P81650 Cluster: Beta-galactosidase; n=26; Gammaproteoba...    42   0.016
UniRef50_A2U9U3 Cluster: GCN5-related N-acetyltransferase; n=1; ...    36   0.80 
UniRef50_Q48727 Cluster: Beta-galactosidase; n=3; Lactococcus la...    36   1.1  
UniRef50_A0UVE2 Cluster: Glycoside hydrolase family 2, TIM barre...    36   1.4  
UniRef50_Q15XN9 Cluster: Glycoside hydrolase family 2, TIM barre...    35   1.8  
UniRef50_Q40538 Cluster: Nicotiana tabacum (clone 5) activating ...    35   1.8  
UniRef50_Q9JN59 Cluster: Beta-galactosidase; n=16; Vibrio choler...    35   2.4  
UniRef50_A7LU08 Cluster: Putative uncharacterized protein; n=1; ...    34   4.3  
UniRef50_A0M224 Cluster: Beta-galactosidase; n=1; Gramella forse...    34   4.3  
UniRef50_Q4X214 Cluster: C6 finger domain protein, putative; n=7...    34   4.3  
UniRef50_Q99GT5 Cluster: ORF131; n=3; Nucleopolyhedrovirus|Rep: ...    33   5.6  
UniRef50_A0ZLG1 Cluster: Beta-D-galactosidase; n=1; Nodularia sp...    33   7.5  
UniRef50_Q5DC94 Cluster: SJCHGC09076 protein; n=1; Schistosoma j...    33   7.5  
UniRef50_A6G4K3 Cluster: Putative uncharacterized protein; n=1; ...    33   9.9  
UniRef50_A6FJQ2 Cluster: 50S ribosomal protein L5; n=8; Bacteria...    33   9.9  

>UniRef50_Q8GEG0 Cluster: Putative uncharacterized protein; n=1;
           Erwinia amylovora|Rep: Putative uncharacterized protein
           - Erwinia amylovora (Fire blight bacteria)
          Length = 123

 Score = 94.7 bits (225), Expect = 2e-18
 Identities = 42/42 (100%), Positives = 42/42 (100%)
 Frame = +2

Query: 383 LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPS 508
           LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPS
Sbjct: 68  LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPS 109


>UniRef50_Q47336 Cluster: LacZ-alpha peptide; n=2; cellular
           organisms|Rep: LacZ-alpha peptide - Escherichia coli
          Length = 90

 Score = 94.7 bits (225), Expect = 2e-18
 Identities = 42/42 (100%), Positives = 42/42 (100%)
 Frame = +2

Query: 383 LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPS 508
           LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPS
Sbjct: 22  LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPS 63


>UniRef50_Q37953 Cluster: LacZ protein; n=1; Phage M13mp18|Rep: LacZ
           protein - Phage M13mp18
          Length = 102

 Score = 94.7 bits (225), Expect = 2e-18
 Identities = 42/42 (100%), Positives = 42/42 (100%)
 Frame = +2

Query: 383 LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPS 508
           LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPS
Sbjct: 26  LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPS 67


>UniRef50_P00722 Cluster: Beta-galactosidase; n=35; root|Rep:
           Beta-galactosidase - Escherichia coli (strain K12)
          Length = 1024

 Score = 94.7 bits (225), Expect = 2e-18
 Identities = 42/42 (100%), Positives = 42/42 (100%)
 Frame = +2

Query: 383 LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPS 508
           LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPS
Sbjct: 8   LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPS 49


>UniRef50_UPI0000498F17 Cluster: beta-galactosidase; n=3;
           Eukaryota|Rep: beta-galactosidase - Entamoeba
           histolytica HM-1:IMSS
          Length = 86

 Score = 77.4 bits (182), Expect = 4e-13
 Identities = 39/55 (70%), Positives = 44/55 (80%), Gaps = 2/55 (3%)
 Frame = +3

Query: 381 HWPSFYNVVTGKTLALPNLIALQHIPLSPAGVIAKRPAPIALPT--VAQPEWRMA 539
           HWPSFYNVVTGKTLALPNLIALQHIPLSPAGVI++  A    P+  +   +WRMA
Sbjct: 5   HWPSFYNVVTGKTLALPNLIALQHIPLSPAGVISEE-ARTDRPSQQLRSLKWRMA 58


>UniRef50_Q669R9 Cluster: Beta-galactosidase; n=14; Yersinia|Rep:
           Beta-galactosidase - Yersinia pseudotuberculosis
          Length = 1066

 Score = 65.3 bits (152), Expect = 2e-09
 Identities = 27/42 (64%), Positives = 32/42 (76%)
 Frame = +2

Query: 383 LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPS 508
           L  +L RRDWENP +TQ +RL AHPPF SWR+ E A+ DRPS
Sbjct: 15  LPQILSRRDWENPQITQYHRLEAHPPFHSWRDVESAQKDRPS 56


>UniRef50_A2VBJ9 Cluster: Non-ribosomal peptide synthetase; n=1;
           uncultured bacterium|Rep: Non-ribosomal peptide
           synthetase - uncultured bacterium
          Length = 338

 Score = 58.4 bits (135), Expect = 2e-07
 Identities = 33/57 (57%), Positives = 35/57 (61%), Gaps = 2/57 (3%)
 Frame = -2

Query: 432 WVTPGFSQSRRCKTTASEL*YDSL*GELGTGPPLEVDGIDKLDIEF--AAALLHKPG 268
           W   GF     C        YDSL GELGTGPPLEVDGIDKLDIEF   +A L+K G
Sbjct: 260 WSKTGFRPF--CLEAGRRAYYDSLYGELGTGPPLEVDGIDKLDIEFPIESARLYKTG 314


>UniRef50_A7MN76 Cluster: Putative uncharacterized protein; n=1;
           Enterobacter sakazakii ATCC BAA-894|Rep: Putative
           uncharacterized protein - Enterobacter sakazakii ATCC
           BAA-894
          Length = 1043

 Score = 55.2 bits (127), Expect = 2e-06
 Identities = 20/44 (45%), Positives = 30/44 (68%)
 Frame = +2

Query: 383 LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSNS 514
           LA +L R DW+NP +T +NRL +H P   WR+++ AR   PS++
Sbjct: 18  LATILARNDWQNPAITSVNRLPSHTPLHGWRDADRARRGEPSDA 61


>UniRef50_P06219 Cluster: Beta-galactosidase; n=11;
           Gammaproteobacteria|Rep: Beta-galactosidase - Klebsiella
           pneumoniae
          Length = 1034

 Score = 52.8 bits (121), Expect = 9e-06
 Identities = 23/40 (57%), Positives = 27/40 (67%)
 Frame = +2

Query: 392 VLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSN 511
           VL R DW N  +T LNRL AHP FASWR+   AR + PS+
Sbjct: 17  VLAREDWHNQTITHLNRLPAHPVFASWRDELAARDNLPSS 56


>UniRef50_P07856 Cluster: Sericin 1 precursor; n=4; Bombyx mori|Rep:
            Sericin 1 precursor - Bombyx mori (Silk moth)
          Length = 1186

 Score = 47.2 bits (107), Expect = 4e-04
 Identities = 20/22 (90%), Positives = 20/22 (90%)
 Frame = -1

Query: 262  KNILCFENIFDIPYHLRKNIGV 197
            K  LCFENIFDIPYHLRKNIGV
Sbjct: 1165 KICLCFENIFDIPYHLRKNIGV 1186


>UniRef50_Q61343 Cluster: Beta-D-galactosidase fusion protein; n=1;
           Mus musculus|Rep: Beta-D-galactosidase fusion protein -
           Mus musculus (Mouse)
          Length = 121

 Score = 44.0 bits (99), Expect = 0.004
 Identities = 19/20 (95%), Positives = 20/20 (100%)
 Frame = -3

Query: 353 NWVPGPPSRSTVSISLISNS 294
           +WVPGPPSRSTVSISLISNS
Sbjct: 18  SWVPGPPSRSTVSISLISNS 37


>UniRef50_P81650 Cluster: Beta-galactosidase; n=26;
           Gammaproteobacteria|Rep: Beta-galactosidase -
           Pseudoalteromonas haloplanktis (Alteromonas
           haloplanktis)
          Length = 1039

 Score = 41.9 bits (94), Expect = 0.016
 Identities = 16/39 (41%), Positives = 25/39 (64%)
 Frame = +2

Query: 392 VLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPS 508
           ++ RRDWENP   Q+N++ AH P   ++  E+AR +  S
Sbjct: 7   IINRRDWENPITVQVNQVKAHSPLNGFKTIEDARENTQS 45


>UniRef50_A2U9U3 Cluster: GCN5-related N-acetyltransferase; n=1;
           Bacillus coagulans 36D1|Rep: GCN5-related
           N-acetyltransferase - Bacillus coagulans 36D1
          Length = 180

 Score = 36.3 bits (80), Expect = 0.80
 Identities = 16/19 (84%), Positives = 18/19 (94%)
 Frame = -3

Query: 347 VPGPPSRSTVSISLISNSR 291
           VPGPPSRSTVSISL+ NS+
Sbjct: 2   VPGPPSRSTVSISLMFNSK 20


>UniRef50_Q48727 Cluster: Beta-galactosidase; n=3; Lactococcus
           lactis|Rep: Beta-galactosidase - Lactococcus lactis
           subsp. lactis (Streptococcus lactis)
          Length = 998

 Score = 35.9 bits (79), Expect = 1.1
 Identities = 14/23 (60%), Positives = 17/23 (73%)
 Frame = +2

Query: 392 VLQRRDWENPGVTQLNRLAAHPP 460
           VL+R+DWENP V+  NRL  H P
Sbjct: 9   VLERKDWENPVVSNWNRLPMHTP 31


>UniRef50_A0UVE2 Cluster: Glycoside hydrolase family 2, TIM barrel;
           n=1; Clostridium cellulolyticum H10|Rep: Glycoside
           hydrolase family 2, TIM barrel - Clostridium
           cellulolyticum H10
          Length = 1033

 Score = 35.5 bits (78), Expect = 1.4
 Identities = 12/29 (41%), Positives = 20/29 (68%)
 Frame = +2

Query: 404 RDWENPGVTQLNRLAAHPPFASWRNSEEA 490
           R+WEN  +TQ+NR   H P+ ++ + E+A
Sbjct: 3   REWENQYITQINRYPMHSPYGAYESVEQA 31


>UniRef50_Q15XN9 Cluster: Glycoside hydrolase family 2, TIM barrel
           precursor; n=1; Pseudoalteromonas atlantica T6c|Rep:
           Glycoside hydrolase family 2, TIM barrel precursor -
           Pseudoalteromonas atlantica (strain T6c / BAA-1087)
          Length = 1079

 Score = 35.1 bits (77), Expect = 1.8
 Identities = 19/47 (40%), Positives = 24/47 (51%)
 Frame = +2

Query: 401 RRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSNSCAA*MANGK 541
           + DWENP V Q+NRL A     S+   E+A T R  N       NG+
Sbjct: 31  KNDWENPDVIQINRLPARATSYSFDTPEQALT-RDRNQSTIQSLNGQ 76


>UniRef50_Q40538 Cluster: Nicotiana tabacum (clone 5) activating
           factor DNA sequence; n=2; Eukaryota|Rep: Nicotiana
           tabacum (clone 5) activating factor DNA sequence -
           Nicotiana tabacum (Common tobacco)
          Length = 56

 Score = 35.1 bits (77), Expect = 1.8
 Identities = 15/16 (93%), Positives = 15/16 (93%)
 Frame = -2

Query: 348 GTGPPLEVDGIDKLDI 301
           GTGPPLEV GIDKLDI
Sbjct: 22  GTGPPLEVRGIDKLDI 37


>UniRef50_Q9JN59 Cluster: Beta-galactosidase; n=16; Vibrio
           cholerae|Rep: Beta-galactosidase - Vibrio cholerae
          Length = 56

 Score = 34.7 bits (76), Expect = 2.4
 Identities = 14/40 (35%), Positives = 22/40 (55%)
 Frame = +2

Query: 392 VLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSN 511
           +L  +DW+NP + + +    H P  S+R  +EAR D   N
Sbjct: 7   ILLSQDWQNPHIVKWHCRTPHVPLHSYRTEQEARLDVGGN 46


>UniRef50_A7LU08 Cluster: Putative uncharacterized protein; n=1;
           Bacteroides ovatus ATCC 8483|Rep: Putative
           uncharacterized protein - Bacteroides ovatus ATCC 8483
          Length = 1046

 Score = 33.9 bits (74), Expect = 4.3
 Identities = 15/47 (31%), Positives = 22/47 (46%)
 Frame = +2

Query: 398 QRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSNSCAA*MANG 538
           Q  +WENP   + N+   H  F  +  +E+A  D+P  S      NG
Sbjct: 26  QNNEWENPAKYEWNKERPHADFRLYEQAEDAVNDKPRKSSWQHSLNG 72


>UniRef50_A0M224 Cluster: Beta-galactosidase; n=1; Gramella forsetii
           KT0803|Rep: Beta-galactosidase - Gramella forsetii
           (strain KT0803)
          Length = 1049

 Score = 33.9 bits (74), Expect = 4.3
 Identities = 17/44 (38%), Positives = 21/44 (47%)
 Frame = +2

Query: 407 DWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSNSCAA*MANG 538
           DWENP VT +N+L A     S+ N + A      NS      NG
Sbjct: 26  DWENPAVTGINKLPARATMYSFSNKQAAINLNKENSDRVKSLNG 69


>UniRef50_Q4X214 Cluster: C6 finger domain protein, putative; n=7;
           Trichocomaceae|Rep: C6 finger domain protein, putative -
           Aspergillus fumigatus (Sartorya fumigata)
          Length = 1148

 Score = 33.9 bits (74), Expect = 4.3
 Identities = 12/24 (50%), Positives = 16/24 (66%)
 Frame = -3

Query: 386 PVNCNTTHYRANWVPGPPSRSTVS 315
           PV  N   +R  W+PGPP+RS +S
Sbjct: 619 PVTDNPPDFRKEWIPGPPTRSVLS 642


>UniRef50_Q99GT5 Cluster: ORF131; n=3; Nucleopolyhedrovirus|Rep:
           ORF131 - Helicoverpa zea SNPV
          Length = 192

 Score = 33.5 bits (73), Expect = 5.6
 Identities = 15/35 (42%), Positives = 22/35 (62%)
 Frame = +2

Query: 152 LKHYKEYSKSCLVVLNTDILTEMVRNIEYVFEAKD 256
           L H +  +K  +V LN D + E V+NI++VFE  D
Sbjct: 149 LNHLRSINKQKIVFLNGDHVEEYVQNIKHVFERND 183


>UniRef50_A0ZLG1 Cluster: Beta-D-galactosidase; n=1; Nodularia
           spumigena CCY 9414|Rep: Beta-D-galactosidase - Nodularia
           spumigena CCY 9414
          Length = 72

 Score = 33.1 bits (72), Expect = 7.5
 Identities = 13/13 (100%), Positives = 13/13 (100%)
 Frame = +2

Query: 470 WRNSEEARTDRPS 508
           WRNSEEARTDRPS
Sbjct: 47  WRNSEEARTDRPS 59


>UniRef50_Q5DC94 Cluster: SJCHGC09076 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC09076 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 109

 Score = 33.1 bits (72), Expect = 7.5
 Identities = 17/43 (39%), Positives = 25/43 (58%)
 Frame = +2

Query: 386 AVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSNS 514
           A  L+RR+ +NPG  QLN L A P F     +++A  +R S +
Sbjct: 57  AAFLKRREGKNPGCPQLNPLEALPLFPGGEKTKKAPPNRLSKN 99


>UniRef50_A6G4K3 Cluster: Putative uncharacterized protein; n=1;
           Plesiocystis pacifica SIR-1|Rep: Putative
           uncharacterized protein - Plesiocystis pacifica SIR-1
          Length = 531

 Score = 32.7 bits (71), Expect = 9.9
 Identities = 15/34 (44%), Positives = 22/34 (64%)
 Frame = +3

Query: 405 VTGKTLALPNLIALQHIPLSPAGVIAKRPAPIAL 506
           V+G  L LP+L+     PL P G++A++  PIAL
Sbjct: 64  VSGPNLQLPHLVDELATPLPPTGLLARKMDPIAL 97


>UniRef50_A6FJQ2 Cluster: 50S ribosomal protein L5; n=8;
           Bacteria|Rep: 50S ribosomal protein L5 - Moritella sp.
           PE36
          Length = 45

 Score = 32.7 bits (71), Expect = 9.9
 Identities = 15/19 (78%), Positives = 16/19 (84%)
 Frame = -1

Query: 511 VGRAIGAGLFAITPAGERG 455
           +GRAIGAGLFAITP  E G
Sbjct: 20  LGRAIGAGLFAITPEFELG 38


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 734,906,775
Number of Sequences: 1657284
Number of extensions: 14750369
Number of successful extensions: 35538
Number of sequences better than 10.0: 26
Number of HSP's better than 10.0 without gapping: 34417
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 35526
length of database: 575,637,011
effective HSP length: 99
effective length of database: 411,565,895
effective search space used: 61323318355
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)

- SilkBase 1999-2023 -