BLASTP 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= BGIBMGA000736-TA|BGIBMGA000736-PA|undefined
(99 letters)
Database: uniref50
1,657,284 sequences; 575,637,011 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef50_UPI0000D57570 Cluster: PREDICTED: hypothetical protein;... 52 2e-06
UniRef50_UPI00015B4118 Cluster: PREDICTED: hypothetical protein;... 52 3e-06
UniRef50_A3W017 Cluster: Probable flavohemoprotein; n=2; Alphapr... 35 0.37
UniRef50_A0CN60 Cluster: Chromosome undetermined scaffold_22, wh... 34 0.49
UniRef50_A3DI35 Cluster: SMC protein-like protein; n=1; Clostrid... 33 1.5
UniRef50_Q6BRR0 Cluster: Similar to CA4605|IPF6226 Candida albic... 33 1.5
UniRef50_A4B8X7 Cluster: Sensor protein; n=1; Reinekea sp. MED29... 32 2.0
UniRef50_A7FVH4 Cluster: Sortase family protein; n=4; Clostridiu... 32 2.6
UniRef50_A3WI90 Cluster: Putative uncharacterized protein; n=1; ... 32 2.6
UniRef50_A1TTI9 Cluster: Amino acid adenylation domain; n=1; Aci... 32 2.6
UniRef50_A4S3P0 Cluster: Predicted protein; n=1; Ostreococcus lu... 31 3.5
UniRef50_Q18DG2 Cluster: Type II restriction enzyme, methylase s... 31 3.5
UniRef50_A7HN81 Cluster: Putative uncharacterized protein; n=1; ... 31 4.6
UniRef50_Q7PT84 Cluster: ENSANGP00000016024; n=3; Culicidae|Rep:... 31 4.6
UniRef50_Q9W9L2 Cluster: IIIa; n=3; Mastadenovirus|Rep: IIIa - P... 30 8.0
UniRef50_A1WZL7 Cluster: Lipopolysaccharide biosynthesis; n=1; H... 30 8.0
UniRef50_A7S5V5 Cluster: Predicted protein; n=2; Nematostella ve... 30 8.0
UniRef50_Q0UNS0 Cluster: Putative uncharacterized protein; n=1; ... 30 8.0
>UniRef50_UPI0000D57570 Cluster: PREDICTED: hypothetical protein;
n=1; Tribolium castaneum|Rep: PREDICTED: hypothetical
protein - Tribolium castaneum
Length = 270
Score = 52.0 bits (119), Expect = 2e-06
Identities = 37/95 (38%), Positives = 47/95 (49%), Gaps = 13/95 (13%)
Query: 9 RTRVYDCNYNKGESYYRPVLDRLDGK-APIVREPERESIRADVDNRIKSALHDMETP--- 64
RTR+YD NYN G++YYR LD +D K +P P R SI D+ +R A D +
Sbjct: 11 RTRIYDANYNIGQNYYRSALDSIDRKYSPRAPSPLRPSIARDILDRHDEAFADEDLATSR 70
Query: 65 --------NDELFDFRGAR-AQRGRPLSSALEDDE 90
+ LFD RG R AQR L D+E
Sbjct: 71 LRAEKTIRQNHLFDSRGGRIAQRALDLVENDIDEE 105
>UniRef50_UPI00015B4118 Cluster: PREDICTED: hypothetical protein;
n=1; Nasonia vitripennis|Rep: PREDICTED: hypothetical
protein - Nasonia vitripennis
Length = 240
Score = 51.6 bits (118), Expect = 3e-06
Identities = 31/85 (36%), Positives = 42/85 (49%), Gaps = 4/85 (4%)
Query: 1 MSRVGRTR-RTRVYDCNYNKGESYYRPVLDRLDGK---APIVREPERESIRADVDNRIKS 56
MSR +R RTRVY CNY+KGESYY+P ++ LD K P+ EP R
Sbjct: 1 MSRPILSRPRTRVYGCNYDKGESYYKPTIENLDRKYSGRPLFSEPRNSLADEIAARRSDI 60
Query: 57 ALHDMETPNDELFDFRGARAQRGRP 81
++ N++ +DF R P
Sbjct: 61 GSRNLGGRNEDDYDFYYGRGYSEDP 85
>UniRef50_A3W017 Cluster: Probable flavohemoprotein; n=2;
Alphaproteobacteria|Rep: Probable flavohemoprotein -
Roseovarius sp. 217
Length = 545
Score = 34.7 bits (76), Expect = 0.37
Identities = 14/49 (28%), Positives = 26/49 (53%)
Query: 20 GESYYRPVLDRLDGKAPIVREPERESIRADVDNRIKSALHDMETPNDEL 68
G+S RP + ++G+ +R P+R S++ D + + LHD P +
Sbjct: 56 GDSAGRPWVTLMEGREGFIRSPDRHSLKVDANLHHQDPLHDTFAPGSSV 104
>UniRef50_A0CN60 Cluster: Chromosome undetermined scaffold_22, whole
genome shotgun sequence; n=2; Paramecium tetraurelia|Rep:
Chromosome undetermined scaffold_22, whole genome shotgun
sequence - Paramecium tetraurelia
Length = 1128
Score = 34.3 bits (75), Expect = 0.49
Identities = 20/68 (29%), Positives = 33/68 (48%), Gaps = 3/68 (4%)
Query: 29 DRLDGKAPIVREPERESIRADVDNRIKSALHDMETPNDELFDFRGARAQRGRPLSSALED 88
D L K I++ PE++ R + ++ ++ E NDE FD R G+PL +
Sbjct: 969 DMLKSKTQILKSPEQQQ-REQKEQQLFFSVFGDELQNDEEFDLEVIRQNMGKPLE--IMQ 1025
Query: 89 DEFNEDAM 96
D ED++
Sbjct: 1026 DHLKEDSI 1033
>UniRef50_A3DI35 Cluster: SMC protein-like protein; n=1; Clostridium
thermocellum ATCC 27405|Rep: SMC protein-like protein -
Clostridium thermocellum (strain ATCC 27405 / DSM 1237)
Length = 883
Score = 32.7 bits (71), Expect = 1.5
Identities = 20/78 (25%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 11 RVYDCNYNKGESYYRPVLDRLDGKA-PIVREPERESIRADVDNRIKSALHDMETPNDELF 69
++YD + ESY +L L+G+ IV E R+ + I++ + ++E EL
Sbjct: 658 KIYDAEFQSAESYIENLLSELNGRIDEIVLEMSRDWALVERGCLIENEIQELEVKTAELE 717
Query: 70 DFRGARAQRGRPLSSALE 87
+ G+ L +AL+
Sbjct: 718 REKERLLDIGKSLKTALD 735
>UniRef50_Q6BRR0 Cluster: Similar to CA4605|IPF6226 Candida albicans
IPF6226; n=1; Debaryomyces hansenii|Rep: Similar to
CA4605|IPF6226 Candida albicans IPF6226 - Debaryomyces
hansenii (Yeast) (Torulaspora hansenii)
Length = 574
Score = 32.7 bits (71), Expect = 1.5
Identities = 18/69 (26%), Positives = 35/69 (50%), Gaps = 1/69 (1%)
Query: 31 LDGKAPIVREPERESIRADVDNRIKSALHDMETPNDELFDFRGARAQRGRPLSSALEDDE 90
+D + ++ E I + +N ++ HD+ET D+L+D R + R L + L+ D+
Sbjct: 460 VDFNSTVLNNSLAEKINDNYNNVNENIPHDVETSTDKLYD-ASYRLSQARELIAKLQTDK 518
Query: 91 FNEDAMVVV 99
N+ +V
Sbjct: 519 LNDKVSSLV 527
>UniRef50_A4B8X7 Cluster: Sensor protein; n=1; Reinekea sp.
MED297|Rep: Sensor protein - Reinekea sp. MED297
Length = 662
Score = 32.3 bits (70), Expect = 2.0
Identities = 15/38 (39%), Positives = 22/38 (57%)
Query: 31 LDGKAPIVREPERESIRADVDNRIKSALHDMETPNDEL 68
L G+A RE + E I AD++ I D++T ND+L
Sbjct: 346 LTGQAIQKREQQLEDINADLEQHIAERTRDLQTANDDL 383
>UniRef50_A7FVH4 Cluster: Sortase family protein; n=4; Clostridium
botulinum|Rep: Sortase family protein - Clostridium
botulinum (strain ATCC 19397 / Type A)
Length = 212
Score = 31.9 bits (69), Expect = 2.6
Identities = 14/51 (27%), Positives = 30/51 (58%)
Query: 8 RRTRVYDCNYNKGESYYRPVLDRLDGKAPIVREPERESIRADVDNRIKSAL 58
+ T +YD N KG+++ ++ L K PI++ + + ++ V + ++SAL
Sbjct: 70 KSTVIYDVNPEKGDNFGSLIMPTLKQKFPIIQGTDDKELKKGVGHFLESAL 120
>UniRef50_A3WI90 Cluster: Putative uncharacterized protein; n=1;
Erythrobacter sp. NAP1|Rep: Putative uncharacterized
protein - Erythrobacter sp. NAP1
Length = 1003
Score = 31.9 bits (69), Expect = 2.6
Identities = 22/77 (28%), Positives = 35/77 (45%), Gaps = 7/77 (9%)
Query: 24 YRPVLDRLDGKAPIVREPERESIRADV------DNRIKSALHDMETPNDELFDFRG-ARA 76
+ PV DRL K IV EP+ ++ + D ++ + P + FDFRG A
Sbjct: 669 FSPVADRLPSKKHIVFEPDGAMLKLPINLLVTDDESVERYNEQLAGPEGDEFDFRGVAWL 728
Query: 77 QRGRPLSSALEDDEFNE 93
R +S+A+ F +
Sbjct: 729 GRNSQISTAVAPTAFRD 745
>UniRef50_A1TTI9 Cluster: Amino acid adenylation domain; n=1;
Acidovorax avenae subsp. citrulli AAC00-1|Rep: Amino
acid adenylation domain - Acidovorax avenae subsp.
citrulli (strain AAC00-1)
Length = 2651
Score = 31.9 bits (69), Expect = 2.6
Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 2/50 (4%)
Query: 30 RLDGKAPIVREPERESIRADVDNRIKSALHDMETPNDELFDFRGARAQRG 79
RL G+ P+++ P RAD R ++A+H + P D + R A RG
Sbjct: 245 RLGGEQPVLQLPAEHPRRAD--GRYRAAVHAWDLPADRVRALRSAAQARG 292
>UniRef50_A4S3P0 Cluster: Predicted protein; n=1; Ostreococcus
lucimarinus CCE9901|Rep: Predicted protein -
Ostreococcus lucimarinus CCE9901
Length = 1008
Score = 31.5 bits (68), Expect = 3.5
Identities = 19/66 (28%), Positives = 31/66 (46%), Gaps = 1/66 (1%)
Query: 1 MSRVGRTRRTRVYDCNYNKGESYYRPVLDRLDGKAPIVREPERESIRADVDNRIKSALHD 60
+ + RT + DC+ + R +LD+L A R P R + +AD+ + AL D
Sbjct: 793 LDAIANRARTEIVDCDATVSWTNIRALLDQLKTGAT-KRTPVRANAQADLARDVDFALLD 851
Query: 61 METPND 66
+ ND
Sbjct: 852 VVVDND 857
>UniRef50_Q18DG2 Cluster: Type II restriction enzyme, methylase
subunits; n=1; Haloquadratum walsbyi DSM 16790|Rep: Type
II restriction enzyme, methylase subunits - Haloquadratum
walsbyi (strain DSM 16790)
Length = 1208
Score = 31.5 bits (68), Expect = 3.5
Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 3/44 (6%)
Query: 24 YRPVLDRLDGKAPIVREPERESIRADVDNRIKSALHDMETPNDE 67
++ + +RLDG P E ER ++A++D A H E N+E
Sbjct: 1134 FKEMRERLDGIEPATEEGERRQLQAEID---AGAFHAYELENEE 1174
>UniRef50_A7HN81 Cluster: Putative uncharacterized protein; n=1;
Fervidobacterium nodosum Rt17-B1|Rep: Putative
uncharacterized protein - Fervidobacterium nodosum
Rt17-B1
Length = 328
Score = 31.1 bits (67), Expect = 4.6
Identities = 18/58 (31%), Positives = 30/58 (51%), Gaps = 7/58 (12%)
Query: 21 ESYYRPVLDRLDGKAPIVR-------EPERESIRADVDNRIKSALHDMETPNDELFDF 71
E Y VLD L K ++ +P+ ++++ DVDN IK L D++ +L D+
Sbjct: 71 EKYKLKVLDTLIRKIAFIQFAESLKVQPDPQTVKNDVDNEIKKMLVDLKMTETQLNDY 128
>UniRef50_Q7PT84 Cluster: ENSANGP00000016024; n=3; Culicidae|Rep:
ENSANGP00000016024 - Anopheles gambiae str. PEST
Length = 287
Score = 31.1 bits (67), Expect = 4.6
Identities = 14/27 (51%), Positives = 18/27 (66%)
Query: 6 RTRRTRVYDCNYNKGESYYRPVLDRLD 32
R RTRVYD N + E+ YR L+RL+
Sbjct: 5 RGLRTRVYDANMSMVENNYRAALERLE 31
>UniRef50_Q9W9L2 Cluster: IIIa; n=3; Mastadenovirus|Rep: IIIa -
Porcine adenovirus 3 (PAdV-3)
Length = 622
Score = 30.3 bits (65), Expect = 8.0
Identities = 20/56 (35%), Positives = 27/56 (48%), Gaps = 1/56 (1%)
Query: 39 REPERESIRADVDNRIKSALHDMETPNDELFDFRGARAQRGRPLSSALEDDEFNED 94
R R S+ D+D I S L T E+ + ++R +P ALEDDE ED
Sbjct: 543 RRASRSSLEEDID-AISSRLFTWRTRAQEMGLPVASFSRRHQPRPGALEDDEEEED 597
>UniRef50_A1WZL7 Cluster: Lipopolysaccharide biosynthesis; n=1;
Halorhodospira halophila SL1|Rep: Lipopolysaccharide
biosynthesis - Halorhodospira halophila (strain DSM 244
/ SL1) (Ectothiorhodospirahalophila (strain DSM 244 /
SL1))
Length = 518
Score = 30.3 bits (65), Expect = 8.0
Identities = 18/51 (35%), Positives = 25/51 (49%)
Query: 40 EPERESIRADVDNRIKSALHDMETPNDELFDFRGARAQRGRPLSSALEDDE 90
+ + E +RA R + A H +E D L A AQR L+ ALE+ E
Sbjct: 186 DADLEELRAKRAERRQEAEHRVEAHKDRLERLESAGAQRKEQLTHALEEAE 236
>UniRef50_A7S5V5 Cluster: Predicted protein; n=2; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 408
Score = 30.3 bits (65), Expect = 8.0
Identities = 19/54 (35%), Positives = 31/54 (57%), Gaps = 3/54 (5%)
Query: 17 YNKGESYYRPVLDRL-DGKAPI-VREPERESIRADVDNRIKSALHDMETPNDEL 68
Y KGE + + D KA + +R+ ES+ A V N++ A++D+E N+EL
Sbjct: 293 YEKGEYGLSEAVQEIKDHKAQLQIRDQNIESLTASV-NKLSLAVNDLEMENEEL 345
>UniRef50_Q0UNS0 Cluster: Putative uncharacterized protein; n=1;
Phaeosphaeria nodorum|Rep: Putative uncharacterized
protein - Phaeosphaeria nodorum (Septoria nodorum)
Length = 2233
Score = 30.3 bits (65), Expect = 8.0
Identities = 15/48 (31%), Positives = 23/48 (47%)
Query: 40 EPERESIRADVDNRIKSALHDMETPNDELFDFRGARAQRGRPLSSALE 87
+ E+++I AD+ R+ + T N EL + RG Q LS E
Sbjct: 1029 QEEKDAIIADLQRRVDDISSEFTTTNSELTELRGRHEQETMRLSQEKE 1076
Database: uniref50
Posted date: Oct 5, 2007 11:19 AM
Number of letters in database: 575,637,011
Number of sequences in database: 1,657,284
Lambda K H
0.318 0.136 0.390
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 112,251,841
Number of Sequences: 1657284
Number of extensions: 4243458
Number of successful extensions: 10382
Number of sequences better than 10.0: 18
Number of HSP's better than 10.0 without gapping: 8
Number of HSP's successfully gapped in prelim test: 10
Number of HSP's that attempted gapping in prelim test: 10373
Number of HSP's gapped (non-prelim): 18
length of query: 99
length of database: 575,637,011
effective HSP length: 76
effective length of query: 23
effective length of database: 449,683,427
effective search space: 10342718821
effective search space used: 10342718821
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)
S2: 65 (30.3 bits)
- SilkBase 1999-2023 -