SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTX 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= heS30197
         (623 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_Q8GEG0 Cluster: Putative uncharacterized protein; n=1; ...   102   8e-21
UniRef50_Q47336 Cluster: LacZ-alpha peptide; n=2; cellular organ...   102   8e-21
UniRef50_Q37953 Cluster: LacZ protein; n=1; Phage M13mp18|Rep: L...   102   8e-21
UniRef50_P00722 Cluster: Beta-galactosidase; n=35; root|Rep: Bet...   102   8e-21
UniRef50_UPI0000498F17 Cluster: beta-galactosidase; n=3; Eukaryo...    77   3e-13
UniRef50_Q669R9 Cluster: Beta-galactosidase; n=14; Yersinia|Rep:...    67   4e-10
UniRef50_A7MN76 Cluster: Putative uncharacterized protein; n=1; ...    54   2e-06
UniRef50_P06219 Cluster: Beta-galactosidase; n=11; Gammaproteoba...    54   3e-06
UniRef50_P81650 Cluster: Beta-galactosidase; n=26; Gammaproteoba...    44   0.002
UniRef50_A0ZLG1 Cluster: Beta-D-galactosidase; n=1; Nodularia sp...    41   0.028
UniRef50_A6FJQ2 Cluster: 50S ribosomal protein L5; n=8; Bacteria...    40   0.064
UniRef50_A7BPF2 Cluster: LacZ alpha peptide; n=1; Beggiatoa sp. ...    38   0.20 
UniRef50_Q4Z0C1 Cluster: Putative uncharacterized protein; n=3; ...    37   0.34 
UniRef50_Q48727 Cluster: Beta-galactosidase; n=3; Lactococcus la...    36   0.79 
UniRef50_A0UVE2 Cluster: Glycoside hydrolase family 2, TIM barre...    36   1.0  
UniRef50_A2VBJ9 Cluster: Non-ribosomal peptide synthetase; n=1; ...    35   1.8  
UniRef50_Q9JN59 Cluster: Beta-galactosidase; n=16; Vibrio choler...    34   2.4  
UniRef50_Q4X214 Cluster: C6 finger domain protein, putative; n=7...    34   2.4  
UniRef50_Q15XN9 Cluster: Glycoside hydrolase family 2, TIM barre...    34   3.2  
UniRef50_Q5DC94 Cluster: SJCHGC09076 protein; n=1; Schistosoma j...    33   4.2  
UniRef50_A6G4K3 Cluster: Putative uncharacterized protein; n=1; ...    33   7.3  
UniRef50_Q2WGJ9 Cluster: C8orfK23 protein; n=24; Euteleostomi|Re...    33   7.3  
UniRef50_UPI0000F1EDC6 Cluster: PREDICTED: hypothetical protein;...    32   9.7  
UniRef50_UPI000038DE68 Cluster: COG0457: FOG: TPR repeat; n=1; N...    32   9.7  
UniRef50_A7LU08 Cluster: Putative uncharacterized protein; n=1; ...    32   9.7  
UniRef50_A0M224 Cluster: Beta-galactosidase; n=1; Gramella forse...    32   9.7  

>UniRef50_Q8GEG0 Cluster: Putative uncharacterized protein; n=1;
           Erwinia amylovora|Rep: Putative uncharacterized protein
           - Erwinia amylovora (Fire blight bacteria)
          Length = 123

 Score =  102 bits (244), Expect = 8e-21
 Identities = 46/46 (100%), Positives = 46/46 (100%)
 Frame = +3

Query: 72  LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQLR 209
           LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQLR
Sbjct: 68  LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQLR 113


>UniRef50_Q47336 Cluster: LacZ-alpha peptide; n=2; cellular
           organisms|Rep: LacZ-alpha peptide - Escherichia coli
          Length = 90

 Score =  102 bits (244), Expect = 8e-21
 Identities = 46/46 (100%), Positives = 46/46 (100%)
 Frame = +3

Query: 72  LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQLR 209
           LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQLR
Sbjct: 22  LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQLR 67


>UniRef50_Q37953 Cluster: LacZ protein; n=1; Phage M13mp18|Rep: LacZ
           protein - Phage M13mp18
          Length = 102

 Score =  102 bits (244), Expect = 8e-21
 Identities = 46/46 (100%), Positives = 46/46 (100%)
 Frame = +3

Query: 72  LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQLR 209
           LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQLR
Sbjct: 26  LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQLR 71


>UniRef50_P00722 Cluster: Beta-galactosidase; n=35; root|Rep:
           Beta-galactosidase - Escherichia coli (strain K12)
          Length = 1024

 Score =  102 bits (244), Expect = 8e-21
 Identities = 46/46 (100%), Positives = 46/46 (100%)
 Frame = +3

Query: 72  LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQLR 209
           LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQLR
Sbjct: 8   LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQLR 53


>UniRef50_UPI0000498F17 Cluster: beta-galactosidase; n=3;
           Eukaryota|Rep: beta-galactosidase - Entamoeba
           histolytica HM-1:IMSS
          Length = 86

 Score = 77.4 bits (182), Expect = 3e-13
 Identities = 39/55 (70%), Positives = 44/55 (80%), Gaps = 2/55 (3%)
 Frame = +1

Query: 70  HWPSFYNVVTGKTLALPNLIALQHIPLSPAGVIAKRPAPIALPNS--CAXEWRMA 228
           HWPSFYNVVTGKTLALPNLIALQHIPLSPAGVI++  A    P+    + +WRMA
Sbjct: 5   HWPSFYNVVTGKTLALPNLIALQHIPLSPAGVISEE-ARTDRPSQQLRSLKWRMA 58


>UniRef50_Q669R9 Cluster: Beta-galactosidase; n=14; Yersinia|Rep:
           Beta-galactosidase - Yersinia pseudotuberculosis
          Length = 1066

 Score = 66.9 bits (156), Expect = 4e-10
 Identities = 28/44 (63%), Positives = 33/44 (75%)
 Frame = +3

Query: 72  LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQ 203
           L  +L RRDWENP +TQ +RL AHPPF SWR+ E A+ DRPS Q
Sbjct: 15  LPQILSRRDWENPQITQYHRLEAHPPFHSWRDVESAQKDRPSPQ 58


>UniRef50_A7MN76 Cluster: Putative uncharacterized protein; n=1;
           Enterobacter sakazakii ATCC BAA-894|Rep: Putative
           uncharacterized protein - Enterobacter sakazakii ATCC
           BAA-894
          Length = 1043

 Score = 54.4 bits (125), Expect = 2e-06
 Identities = 20/42 (47%), Positives = 28/42 (66%)
 Frame = +3

Query: 72  LAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPS 197
           LA +L R DW+NP +T +NRL +H P   WR+++ AR   PS
Sbjct: 18  LATILARNDWQNPAITSVNRLPSHTPLHGWRDADRARRGEPS 59


>UniRef50_P06219 Cluster: Beta-galactosidase; n=11;
           Gammaproteobacteria|Rep: Beta-galactosidase - Klebsiella
           pneumoniae
          Length = 1034

 Score = 54.0 bits (124), Expect = 3e-06
 Identities = 24/43 (55%), Positives = 28/43 (65%)
 Frame = +3

Query: 81  VLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQLR 209
           VL R DW N  +T LNRL AHP FASWR+   AR + PS + R
Sbjct: 17  VLAREDWHNQTITHLNRLPAHPVFASWRDELAARDNLPSSRRR 59


>UniRef50_P81650 Cluster: Beta-galactosidase; n=26;
           Gammaproteobacteria|Rep: Beta-galactosidase -
           Pseudoalteromonas haloplanktis (Alteromonas
           haloplanktis)
          Length = 1039

 Score = 44.4 bits (100), Expect = 0.002
 Identities = 17/41 (41%), Positives = 27/41 (65%)
 Frame = +3

Query: 81  VLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQ 203
           ++ RRDWENP   Q+N++ AH P   ++  E+AR +  SQ+
Sbjct: 7   IINRRDWENPITVQVNQVKAHSPLNGFKTIEDARENTQSQK 47


>UniRef50_A0ZLG1 Cluster: Beta-D-galactosidase; n=1; Nodularia
           spumigena CCY 9414|Rep: Beta-D-galactosidase - Nodularia
           spumigena CCY 9414
          Length = 72

 Score = 40.7 bits (91), Expect = 0.028
 Identities = 17/17 (100%), Positives = 17/17 (100%)
 Frame = +3

Query: 159 WRNSEEARTDRPSQQLR 209
           WRNSEEARTDRPSQQLR
Sbjct: 47  WRNSEEARTDRPSQQLR 63


>UniRef50_A6FJQ2 Cluster: 50S ribosomal protein L5; n=8;
           Bacteria|Rep: 50S ribosomal protein L5 - Moritella sp.
           PE36
          Length = 45

 Score = 39.5 bits (88), Expect = 0.064
 Identities = 22/29 (75%), Positives = 23/29 (79%)
 Frame = -1

Query: 230 FAIRHSXAQLLGRAIGAGLFAITPAGERG 144
           FAI+   AQLLGRAIGAGLFAITP  E G
Sbjct: 12  FAIQ--AAQLLGRAIGAGLFAITPEFELG 38


>UniRef50_A7BPF2 Cluster: LacZ alpha peptide; n=1; Beggiatoa sp.
           SS|Rep: LacZ alpha peptide - Beggiatoa sp. SS
          Length = 73

 Score = 37.9 bits (84), Expect = 0.20
 Identities = 18/28 (64%), Positives = 18/28 (64%)
 Frame = -2

Query: 553 GFPRQALNRGLPLGFDLVLYGTSTPKNL 470
           GFPRQALNRGLPLGF         PK L
Sbjct: 45  GFPRQALNRGLPLGFRFSALRHLDPKKL 72


>UniRef50_Q4Z0C1 Cluster: Putative uncharacterized protein; n=3;
           Plasmodium (Vinckeia)|Rep: Putative uncharacterized
           protein - Plasmodium berghei
          Length = 275

 Score = 37.1 bits (82), Expect = 0.34
 Identities = 16/16 (100%), Positives = 16/16 (100%)
 Frame = +1

Query: 19  RGGARYPIRPIVSRIT 66
           RGGARYPIRPIVSRIT
Sbjct: 260 RGGARYPIRPIVSRIT 275


>UniRef50_Q48727 Cluster: Beta-galactosidase; n=3; Lactococcus
           lactis|Rep: Beta-galactosidase - Lactococcus lactis
           subsp. lactis (Streptococcus lactis)
          Length = 998

 Score = 35.9 bits (79), Expect = 0.79
 Identities = 14/23 (60%), Positives = 17/23 (73%)
 Frame = +3

Query: 81  VLQRRDWENPGVTQLNRLAAHPP 149
           VL+R+DWENP V+  NRL  H P
Sbjct: 9   VLERKDWENPVVSNWNRLPMHTP 31


>UniRef50_A0UVE2 Cluster: Glycoside hydrolase family 2, TIM barrel;
           n=1; Clostridium cellulolyticum H10|Rep: Glycoside
           hydrolase family 2, TIM barrel - Clostridium
           cellulolyticum H10
          Length = 1033

 Score = 35.5 bits (78), Expect = 1.0
 Identities = 12/29 (41%), Positives = 20/29 (68%)
 Frame = +3

Query: 93  RDWENPGVTQLNRLAAHPPFASWRNSEEA 179
           R+WEN  +TQ+NR   H P+ ++ + E+A
Sbjct: 3   REWENQYITQINRYPMHSPYGAYESVEQA 31


>UniRef50_A2VBJ9 Cluster: Non-ribosomal peptide synthetase; n=1;
           uncultured bacterium|Rep: Non-ribosomal peptide
           synthetase - uncultured bacterium
          Length = 338

 Score = 34.7 bits (76), Expect = 1.8
 Identities = 18/39 (46%), Positives = 19/39 (48%)
 Frame = -2

Query: 121 WVTPGFSQSRRCKTTASEL*YDSL*GELGTGPPLETSSL 5
           W   GF     C        YDSL GELGTGPPLE   +
Sbjct: 260 WSKTGFRPF--CLEAGRRAYYDSLYGELGTGPPLEVDGI 296


>UniRef50_Q9JN59 Cluster: Beta-galactosidase; n=16; Vibrio
           cholerae|Rep: Beta-galactosidase - Vibrio cholerae
          Length = 56

 Score = 34.3 bits (75), Expect = 2.4
 Identities = 13/36 (36%), Positives = 21/36 (58%)
 Frame = +3

Query: 81  VLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTD 188
           +L  +DW+NP + + +    H P  S+R  +EAR D
Sbjct: 7   ILLSQDWQNPHIVKWHCRTPHVPLHSYRTEQEARLD 42


>UniRef50_Q4X214 Cluster: C6 finger domain protein, putative; n=7;
           Trichocomaceae|Rep: C6 finger domain protein, putative -
           Aspergillus fumigatus (Sartorya fumigata)
          Length = 1148

 Score = 34.3 bits (75), Expect = 2.4
 Identities = 13/24 (54%), Positives = 16/24 (66%)
 Frame = -3

Query: 75  PVNCNTTHYRANWVPGPPSRLVLS 4
           PV  N   +R  W+PGPP+R VLS
Sbjct: 619 PVTDNPPDFRKEWIPGPPTRSVLS 642


>UniRef50_Q15XN9 Cluster: Glycoside hydrolase family 2, TIM barrel
           precursor; n=1; Pseudoalteromonas atlantica T6c|Rep:
           Glycoside hydrolase family 2, TIM barrel precursor -
           Pseudoalteromonas atlantica (strain T6c / BAA-1087)
          Length = 1079

 Score = 33.9 bits (74), Expect = 3.2
 Identities = 15/32 (46%), Positives = 19/32 (59%)
 Frame = +3

Query: 90  RRDWENPGVTQLNRLAAHPPFASWRNSEEART 185
           + DWENP V Q+NRL A     S+   E+A T
Sbjct: 31  KNDWENPDVIQINRLPARATSYSFDTPEQALT 62


>UniRef50_Q5DC94 Cluster: SJCHGC09076 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC09076 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 109

 Score = 33.5 bits (73), Expect = 4.2
 Identities = 24/66 (36%), Positives = 29/66 (43%)
 Frame = +1

Query: 22  GGARYPIRPIVSRITIHWPSFYNVVTGKTLALPNLIALQHIPLSPAGVIAKRPAPIALPN 201
           GGAR PI P      I   +F     GK    P L  L+ +PL P G   K+ AP   PN
Sbjct: 39  GGARDPISPKGGPNKISGAAFLKRREGKNPGCPQLNPLEALPLFPGGEKTKK-AP---PN 94

Query: 202 SCAXEW 219
             +  W
Sbjct: 95  RLSKNW 100



 Score = 33.1 bits (72), Expect = 5.6
 Identities = 17/42 (40%), Positives = 25/42 (59%)
 Frame = +3

Query: 75  AVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQ 200
           A  L+RR+ +NPG  QLN L A P F     +++A  +R S+
Sbjct: 57  AAFLKRREGKNPGCPQLNPLEALPLFPGGEKTKKAPPNRLSK 98


>UniRef50_A6G4K3 Cluster: Putative uncharacterized protein; n=1;
           Plesiocystis pacifica SIR-1|Rep: Putative
           uncharacterized protein - Plesiocystis pacifica SIR-1
          Length = 531

 Score = 32.7 bits (71), Expect = 7.3
 Identities = 15/34 (44%), Positives = 22/34 (64%)
 Frame = +1

Query: 94  VTGKTLALPNLIALQHIPLSPAGVIAKRPAPIAL 195
           V+G  L LP+L+     PL P G++A++  PIAL
Sbjct: 64  VSGPNLQLPHLVDELATPLPPTGLLARKMDPIAL 97


>UniRef50_Q2WGJ9 Cluster: C8orfK23 protein; n=24; Euteleostomi|Rep:
            C8orfK23 protein - Homo sapiens (Human)
          Length = 1857

 Score = 32.7 bits (71), Expect = 7.3
 Identities = 21/68 (30%), Positives = 32/68 (47%), Gaps = 3/68 (4%)
 Frame = +1

Query: 58   RITIHWP-SFYNVVTGKTLALPNLIALQHIPLSPAGVIAKRPAPIALPN--SCAXEWRMA 228
            R+   WP S    +TGK  A  +L+  +    +P G   K P P+A PN    +  W M+
Sbjct: 1749 RVRGWWPFSKSKELTGKVEAEFHLVTAEEAEKNPVGKARKEPEPLAKPNRPDTSFSWFMS 1808

Query: 229  NCKR*YFV 252
              K  Y++
Sbjct: 1809 PFKCLYYL 1816


>UniRef50_UPI0000F1EDC6 Cluster: PREDICTED: hypothetical protein;
           n=1; Danio rerio|Rep: PREDICTED: hypothetical protein -
           Danio rerio
          Length = 195

 Score = 32.3 bits (70), Expect = 9.7
 Identities = 13/13 (100%), Positives = 13/13 (100%)
 Frame = +3

Query: 72  LAVVLQRRDWENP 110
           LAVVLQRRDWENP
Sbjct: 179 LAVVLQRRDWENP 191


>UniRef50_UPI000038DE68 Cluster: COG0457: FOG: TPR repeat; n=1;
           Nostoc punctiforme PCC 73102|Rep: COG0457: FOG: TPR
           repeat - Nostoc punctiforme PCC 73102
          Length = 532

 Score = 32.3 bits (70), Expect = 9.7
 Identities = 14/37 (37%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
 Frame = +3

Query: 72  LAVVLQRRDWENPGVTQLNRLAAH-PPFASWRNSEEA 179
           + V+L+  DWE P + QL+ L ++  P  SW + +EA
Sbjct: 96  IPVLLRYADWETPPIDQLSPLPSNRKPIKSWNDRDEA 132


>UniRef50_A7LU08 Cluster: Putative uncharacterized protein; n=1;
           Bacteroides ovatus ATCC 8483|Rep: Putative
           uncharacterized protein - Bacteroides ovatus ATCC 8483
          Length = 1046

 Score = 32.3 bits (70), Expect = 9.7
 Identities = 12/36 (33%), Positives = 19/36 (52%)
 Frame = +3

Query: 87  QRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRP 194
           Q  +WENP   + N+   H  F  +  +E+A  D+P
Sbjct: 26  QNNEWENPAKYEWNKERPHADFRLYEQAEDAVNDKP 61


>UniRef50_A0M224 Cluster: Beta-galactosidase; n=1; Gramella forsetii
           KT0803|Rep: Beta-galactosidase - Gramella forsetii
           (strain KT0803)
          Length = 1049

 Score = 32.3 bits (70), Expect = 9.7
 Identities = 13/28 (46%), Positives = 17/28 (60%)
 Frame = +3

Query: 96  DWENPGVTQLNRLAAHPPFASWRNSEEA 179
           DWENP VT +N+L A     S+ N + A
Sbjct: 26  DWENPAVTGINKLPARATMYSFSNKQAA 53


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 621,146,179
Number of Sequences: 1657284
Number of extensions: 12877944
Number of successful extensions: 29420
Number of sequences better than 10.0: 26
Number of HSP's better than 10.0 without gapping: 28676
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 29415
length of database: 575,637,011
effective HSP length: 97
effective length of database: 414,880,463
effective search space used: 45636850930
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)

- SilkBase 1999-2023 -