BLASTP 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= BGIBMGA001025-TA|BGIBMGA001025-PA|undefined
(174 letters)
Database: uniref50
1,657,284 sequences; 575,637,011 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef50_Q9VXB3 Cluster: CG4756-PA; n=2; Sophophora|Rep: CG4756-... 227 8e-59
UniRef50_Q9HAJ7 Cluster: SAP30-like protein; n=6; Theria|Rep: SA... 198 4e-50
UniRef50_Q504M9 Cluster: Sap30l protein; n=23; Eumetazoa|Rep: Sa... 196 3e-49
UniRef50_O75446 Cluster: Histone deacetylase complex subunit SAP... 182 5e-45
UniRef50_UPI000155F71B Cluster: PREDICTED: hypothetical protein;... 134 1e-30
UniRef50_UPI0000DC0FE7 Cluster: sin3 associated polypeptide; n=3... 102 5e-21
UniRef50_Q54QG3 Cluster: Putative uncharacterized protein; n=1; ... 58 1e-07
UniRef50_Q6C130 Cluster: Similar to DEHA0F20284g Debaryomyces ha... 42 0.006
UniRef50_Q756I3 Cluster: AER278Wp; n=4; Saccharomycetales|Rep: A... 41 0.018
UniRef50_Q9LN65 Cluster: F18O14.5; n=4; Arabidopsis thaliana|Rep... 40 0.024
UniRef50_Q59NY8 Cluster: Possible Sin3-Rpd3 histone deacetylase ... 40 0.042
UniRef50_A7EWN0 Cluster: Putative uncharacterized protein; n=1; ... 38 0.096
UniRef50_A5DAD8 Cluster: Putative uncharacterized protein; n=1; ... 38 0.096
UniRef50_Q553R0 Cluster: ARID/BRIGHT DNA binding domain-containi... 38 0.17
UniRef50_A6SGM8 Cluster: Putative uncharacterized protein; n=1; ... 38 0.17
UniRef50_Q01HC0 Cluster: OSIGBa0136O08-OSIGBa0153H12.4 protein; ... 37 0.22
UniRef50_Q892K7 Cluster: Membrane associated protein; n=2; Clost... 35 0.90
UniRef50_A5KCJ6 Cluster: Putative uncharacterized protein; n=1; ... 33 2.7
UniRef50_P38429 Cluster: Transcriptional regulatory protein SAP3... 33 2.7
UniRef50_Q1DNG6 Cluster: Putative uncharacterized protein; n=1; ... 32 6.3
UniRef50_A5E2L1 Cluster: Putative uncharacterized protein; n=1; ... 32 6.3
UniRef50_UPI000065F66A Cluster: Fibulin-2 precursor.; n=1; Takif... 32 8.4
>UniRef50_Q9VXB3 Cluster: CG4756-PA; n=2; Sophophora|Rep: CG4756-PA
- Drosophila melanogaster (Fruit fly)
Length = 173
Score = 227 bits (556), Expect = 8e-59
Identities = 103/170 (60%), Positives = 126/170 (74%)
Query: 1 MNNGFSTGEEDSRGNSDQICCLVDDGDRCRRPAGNASYSKRIQKTVALRRLKLNIDGQAR 60
MNNGFSTGEEDSRG++DQ CCL+DD +RCR AG ASYSKRIQKTVA +RLKL+ D A+
Sbjct: 1 MNNGFSTGEEDSRGHTDQTCCLIDDMERCRNQAGYASYSKRIQKTVAQKRLKLSSDPAAQ 60
Query: 61 HTYICDYHKTMIQCARTKQRRTKXXXXXXXXXXXXXXXVDWFQLQVNTLRRYKRHYKVPT 120
H YICD+HK IQ RTK+RR D +QL V+TLRRYKRH+KV T
Sbjct: 61 HIYICDHHKERIQSVRTKRRRKDSEDDSNETDTDLHEFPDLYQLGVSTLRRYKRHFKVQT 120
Query: 121 RPGLNKAQLADAVQKHFKSLPVNEREIMTYFIYMVKTNGNKLDQKNGMNN 170
R G+ +AQLAD + KHFK++P+ E+EI+T+F+YMVK NKLDQKNG+ N
Sbjct: 121 RQGMKRAQLADTIMKHFKTIPIKEKEIITFFVYMVKMGSNKLDQKNGLGN 170
>UniRef50_Q9HAJ7 Cluster: SAP30-like protein; n=6; Theria|Rep:
SAP30-like protein - Homo sapiens (Human)
Length = 183
Score = 198 bits (484), Expect = 4e-50
Identities = 97/176 (55%), Positives = 123/176 (69%), Gaps = 13/176 (7%)
Query: 3 NGFSTGEEDSRGNSD-----------QICCLVDDGDRCRRPAGNASYSKRIQKTVALRRL 51
NGFST EEDSR Q CCL++DG+RC RPAGNAS+SKR+QK+++ ++L
Sbjct: 2 NGFST-EEDSREGPPAAPAAAAPGYGQSCCLIEDGERCVRPAGNASFSKRVQKSISQKKL 60
Query: 52 KLNIDGQARHTYICDYHKTMIQCARTKQRR-TKXXXXXXXXXXXXXXXVDWFQLQVNTLR 110
KL+ID RH YICD+HK IQ R K++R T VD FQLQVNTLR
Sbjct: 61 KLDIDKSVRHLYICDFHKNFIQSVRNKRKRKTSDDGGDSPEHDTDIPEVDLFQLQVNTLR 120
Query: 111 RYKRHYKVPTRPGLNKAQLADAVQKHFKSLPVNEREIMTYFIYMVKTNGNKLDQKN 166
RYKRHYK+ TRPG NKAQLA+ V +HF+++PVNE+E + YFIYMVK+N ++LDQK+
Sbjct: 121 RYKRHYKLQTRPGFNKAQLAETVSRHFRNIPVNEKETLAYFIYMVKSNKSRLDQKS 176
>UniRef50_Q504M9 Cluster: Sap30l protein; n=23; Eumetazoa|Rep:
Sap30l protein - Mus musculus (Mouse)
Length = 240
Score = 196 bits (477), Expect = 3e-49
Identities = 96/175 (54%), Positives = 121/175 (69%), Gaps = 12/175 (6%)
Query: 3 NGFSTGEEDSRGNSD----------QICCLVDDGDRCRRPAGNASYSKRIQKTVALRRLK 52
NGFST EEDSR Q CCL+ DG+RC RPAGNAS+SKR+QK+++ ++LK
Sbjct: 60 NGFST-EEDSREGPPAAPAAAPGYGQSCCLIADGERCVRPAGNASFSKRVQKSISQKKLK 118
Query: 53 LNIDGQARHTYICDYHKTMIQCARTKQRRTKXXXXXXXXXXXXXX-XVDWFQLQVNTLRR 111
L+ID RH YICD+HK IQ R K++R VD FQLQVNTLRR
Sbjct: 119 LDIDKSVRHLYICDFHKNFIQSVRNKRKRKASDDGGDSPEHDADIPEVDLFQLQVNTLRR 178
Query: 112 YKRHYKVPTRPGLNKAQLADAVQKHFKSLPVNEREIMTYFIYMVKTNGNKLDQKN 166
YKRHYK+ TRPG NKAQLA+ V +HF+++PVNE+E + YFIYMVK+N ++LDQK+
Sbjct: 179 YKRHYKLQTRPGFNKAQLAETVSRHFRNIPVNEKETLAYFIYMVKSNRSRLDQKS 233
>UniRef50_O75446 Cluster: Histone deacetylase complex subunit SAP30;
n=22; Tetrapoda|Rep: Histone deacetylase complex subunit
SAP30 - Homo sapiens (Human)
Length = 220
Score = 182 bits (442), Expect = 5e-45
Identities = 82/150 (54%), Positives = 109/150 (72%), Gaps = 2/150 (1%)
Query: 18 QICCLVDDGDRCRRPAGNASYSKRIQKTVALRRLKLNIDGQARHTYICDYHKTMIQCART 77
Q+CCL +DG+RC R AGNAS+SKRIQK+++ +++K+ +D ARH YICDYHK +IQ R
Sbjct: 65 QLCCLREDGERCGRAAGNASFSKRIQKSISQKKVKIELDKSARHLYICDYHKNLIQSVRN 124
Query: 78 KQRR--TKXXXXXXXXXXXXXXXVDWFQLQVNTLRRYKRHYKVPTRPGLNKAQLADAVQK 135
+++R + VD +QLQVNTLRRYKRH+K+PTRPGLNKAQL + V
Sbjct: 125 RRKRKGSDDDGGDSPVQDIDTPEVDLYQLQVNTLRRYKRHFKLPTRPGLNKAQLVEIVGC 184
Query: 136 HFKSLPVNEREIMTYFIYMVKTNGNKLDQK 165
HF+S+PVNE++ +TYFIY VK + NK D K
Sbjct: 185 HFRSIPVNEKDTLTYFIYSVKNDKNKSDLK 214
>UniRef50_UPI000155F71B Cluster: PREDICTED: hypothetical protein;
n=1; Equus caballus|Rep: PREDICTED: hypothetical protein
- Equus caballus
Length = 245
Score = 134 bits (324), Expect = 1e-30
Identities = 62/113 (54%), Positives = 78/113 (69%), Gaps = 1/113 (0%)
Query: 54 NIDGQARHTYICDYHKTMIQCARTKQRR-TKXXXXXXXXXXXXXXXVDWFQLQVNTLRRY 112
N+ +H YICD+HK IQ R K++R T VD FQLQVNTLRRY
Sbjct: 125 NLAKLLQHLYICDFHKNFIQSVRNKRKRKTSDDGGDSPEHDTDIPEVDLFQLQVNTLRRY 184
Query: 113 KRHYKVPTRPGLNKAQLADAVQKHFKSLPVNEREIMTYFIYMVKTNGNKLDQK 165
KRHYK+ TRPG NKAQLA+ V +HF+++PVNE+E + YFIYMVK+N ++LDQK
Sbjct: 185 KRHYKLQTRPGFNKAQLAETVSRHFRNIPVNEKETLAYFIYMVKSNKSRLDQK 237
>UniRef50_UPI0000DC0FE7 Cluster: sin3 associated polypeptide; n=3;
Rattus norvegicus|Rep: sin3 associated polypeptide -
Rattus norvegicus
Length = 221
Score = 102 bits (244), Expect = 5e-21
Identities = 50/151 (33%), Positives = 84/151 (55%), Gaps = 3/151 (1%)
Query: 18 QICCLVDDGDRCRRPAGNASYSKRIQKTVALRRLKLNIDGQARHTYICDYHKTMIQCAR- 76
Q+CCL +DG+RC R AGNAS+SKRIQK+++ +++K+ +D + Y+ ++ ++Q R
Sbjct: 65 QLCCLREDGERCGRAAGNASFSKRIQKSISQKKVKIELDKSVTYKYLLEFITNLVQSPRS 124
Query: 77 -TKQRRTKXXXXXXXXXXXXXXXVDWFQLQVNTLRRYKRHYKVPTRPGLNK-AQLADAVQ 134
T + + Q+ + Y + +P+ P L K +++ V
Sbjct: 125 ITNSQDLMQNSDSRKAGSLQSTNISCVVSQLLVIVAYMLDFSIPSNPMLTKGSKIHRIVG 184
Query: 135 KHFKSLPVNEREIMTYFIYMVKTNGNKLDQK 165
HFKS+PVNE++ +T FIY V+ + NK D K
Sbjct: 185 CHFKSIPVNEKDTLTCFIYSVRNDKNKSDLK 215
>UniRef50_Q54QG3 Cluster: Putative uncharacterized protein; n=1;
Dictyostelium discoideum AX4|Rep: Putative
uncharacterized protein - Dictyostelium discoideum AX4
Length = 179
Score = 58.0 bits (134), Expect = 1e-07
Identities = 26/54 (48%), Positives = 37/54 (68%)
Query: 103 QLQVNTLRRYKRHYKVPTRPGLNKAQLADAVQKHFKSLPVNEREIMTYFIYMVK 156
+L V L+RYK+HY++ T+ +KA+LA V+ HF LPVNE EI+ F+ VK
Sbjct: 123 RLDVAALKRYKKHYRLKTKHNSSKAELASIVRSHFDGLPVNEGEIIENFMVKVK 176
>UniRef50_Q6C130 Cluster: Similar to DEHA0F20284g Debaryomyces
hansenii; n=1; Yarrowia lipolytica|Rep: Similar to
DEHA0F20284g Debaryomyces hansenii - Yarrowia lipolytica
(Candida lipolytica)
Length = 162
Score = 42.3 bits (95), Expect = 0.006
Identities = 19/46 (41%), Positives = 28/46 (60%)
Query: 113 KRHYKVPTRPGLNKAQLADAVQKHFKSLPVNEREIMTYFIYMVKTN 158
K+ + R +NK +LA A +KHF + PV E EI+ F+Y VK +
Sbjct: 107 KKTWSYKHRTRVNKPELASAAKKHFMAQPVRESEIIVDFVYAVKNH 152
>UniRef50_Q756I3 Cluster: AER278Wp; n=4; Saccharomycetales|Rep:
AER278Wp - Ashbya gossypii (Yeast) (Eremothecium
gossypii)
Length = 224
Score = 40.7 bits (91), Expect = 0.018
Identities = 21/54 (38%), Positives = 30/54 (55%), Gaps = 2/54 (3%)
Query: 110 RRYKRHYKVPTRPG--LNKAQLADAVQKHFKSLPVNEREIMTYFIYMVKTNGNK 161
R Y +P++P + K QLAD V++HF + V E E + FIY VK + K
Sbjct: 164 RTYSYRRNIPSQPDSRVTKKQLADEVRRHFTASMVKETECIPAFIYKVKNSKKK 217
>UniRef50_Q9LN65 Cluster: F18O14.5; n=4; Arabidopsis thaliana|Rep:
F18O14.5 - Arabidopsis thaliana (Mouse-ear cress)
Length = 242
Score = 40.3 bits (90), Expect = 0.024
Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 1/59 (1%)
Query: 99 VDWFQLQVNTLRRYKRHYK-VPTRPGLNKAQLADAVQKHFKSLPVNEREIMTYFIYMVK 156
VD +L++ L Y RH+ V P +K QL D VQ+HF S ++E +++ F+ K
Sbjct: 162 VDLSKLEMPALLNYWRHFNLVDAIPNPSKEQLIDIVQRHFMSQQMDELQVIVGFVQAAK 220
>UniRef50_Q59NY8 Cluster: Possible Sin3-Rpd3 histone deacetylase
complex component Sap30p; n=3; Saccharomycetales|Rep:
Possible Sin3-Rpd3 histone deacetylase complex component
Sap30p - Candida albicans (Yeast)
Length = 168
Score = 39.5 bits (88), Expect = 0.042
Identities = 19/47 (40%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 110 RRYKRHYKVPTRPGLNKAQLADAVQKHFKSLPVNEREIMTYFIYMVK 156
+R K++ K R ++K ++A+ V+ HF SLP E +I+T F+Y VK
Sbjct: 113 KRSKKNQKDTGR--ISKLEIANHVKNHFSSLPAKENDIITGFLYKVK 157
>UniRef50_A7EWN0 Cluster: Putative uncharacterized protein; n=1;
Sclerotinia sclerotiorum 1980|Rep: Putative
uncharacterized protein - Sclerotinia sclerotiorum 1980
Length = 187
Score = 38.3 bits (85), Expect = 0.096
Identities = 16/37 (43%), Positives = 24/37 (64%)
Query: 125 NKAQLADAVQKHFKSLPVNEREIMTYFIYMVKTNGNK 161
+K QLA+ V+KHF S VNE +++ F+Y V+ K
Sbjct: 130 SKEQLANTVRKHFNSQSVNENDVIVKFLYKVRYQDKK 166
>UniRef50_A5DAD8 Cluster: Putative uncharacterized protein; n=1;
Pichia guilliermondii|Rep: Putative uncharacterized
protein - Pichia guilliermondii (Yeast) (Candida
guilliermondii)
Length = 148
Score = 38.3 bits (85), Expect = 0.096
Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 1/45 (2%)
Query: 113 KRH-YKVPTRPGLNKAQLADAVQKHFKSLPVNEREIMTYFIYMVK 156
K H YK + + K +LA+ ++ HF +LP E EI+T F+Y VK
Sbjct: 94 KTHSYKKSLQTLMPKRELAEKMRDHFLALPFRENEIITNFLYKVK 138
>UniRef50_Q553R0 Cluster: ARID/BRIGHT DNA binding domain-containing
protein; n=1; Dictyostelium discoideum AX4|Rep:
ARID/BRIGHT DNA binding domain-containing protein -
Dictyostelium discoideum AX4
Length = 478
Score = 37.5 bits (83), Expect = 0.17
Identities = 15/61 (24%), Positives = 35/61 (57%)
Query: 104 LQVNTLRRYKRHYKVPTRPGLNKAQLADAVQKHFKSLPVNEREIMTYFIYMVKTNGNKLD 163
L++ +L++Y ++++ ++ +L AV HF ++E I+T F+ VK+ +++
Sbjct: 414 LEIKSLKKYNAYHRLKVSHSPSRTELVSAVIHHFSHQNIDEDTIITSFLKRVKSESSRIR 473
Query: 164 Q 164
Q
Sbjct: 474 Q 474
>UniRef50_A6SGM8 Cluster: Putative uncharacterized protein; n=1;
Botryotinia fuckeliana B05.10|Rep: Putative
uncharacterized protein - Botryotinia fuckeliana B05.10
Length = 169
Score = 37.5 bits (83), Expect = 0.17
Identities = 17/38 (44%), Positives = 24/38 (63%)
Query: 125 NKAQLADAVQKHFKSLPVNEREIMTYFIYMVKTNGNKL 162
+K QLA+AV+KHF S VNE E + +Y V+ G +
Sbjct: 128 SKEQLANAVRKHFNSQSVNENECVAKTLYKVRYQGEPI 165
>UniRef50_Q01HC0 Cluster: OSIGBa0136O08-OSIGBa0153H12.4 protein;
n=14; Magnoliophyta|Rep: OSIGBa0136O08-OSIGBa0153H12.4
protein - Oryza sativa (Rice)
Length = 219
Score = 37.1 bits (82), Expect = 0.22
Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 1/59 (1%)
Query: 99 VDWFQLQVNTLRRYKRHYK-VPTRPGLNKAQLADAVQKHFKSLPVNEREIMTYFIYMVK 156
V+ +L TL RY +H+ V P +K QL VQ+HF+S ++E +++ FI K
Sbjct: 153 VNLTKLGTPTLWRYWKHFNLVSMNPNPSKEQLFHGVQQHFQSQQLDELQVILGFIQAAK 211
>UniRef50_Q892K7 Cluster: Membrane associated protein; n=2;
Clostridium tetani|Rep: Membrane associated protein -
Clostridium tetani
Length = 429
Score = 35.1 bits (77), Expect = 0.90
Identities = 18/55 (32%), Positives = 34/55 (61%), Gaps = 4/55 (7%)
Query: 120 TRPGLNKAQLADAVQKHFKSLPVNEREIMTY-FIYMVKTNGNKLDQKNGMNNSDS 173
T LNK+++ + + +K + +N++ TY +IY +K+ NK++ K NNS+S
Sbjct: 124 TTSALNKSEIEELFKDRYK-VAINKKSEETYKYIYSIKSGSNKMESK--FNNSNS 175
>UniRef50_A5KCJ6 Cluster: Putative uncharacterized protein; n=1;
Plasmodium vivax|Rep: Putative uncharacterized protein -
Plasmodium vivax
Length = 3459
Score = 33.5 bits (73), Expect = 2.7
Identities = 19/67 (28%), Positives = 33/67 (49%), Gaps = 1/67 (1%)
Query: 105 QVNTLRRYKRHYKVPTRPGLNKAQLADAVQKHFKSLPVNEREIMTYFIYMVKTNGNKLDQ 164
Q N + R++ H + TR L + + +K K++P N + M Y +Y+ + N N D+
Sbjct: 2339 QTNPISRHRTHDEAATRQMLMRGEEPSQPKKK-KNVPNNNKLEMLYKLYLDQNNLNYSDE 2397
Query: 165 KNGMNNS 171
N S
Sbjct: 2398 NENENES 2404
>UniRef50_P38429 Cluster: Transcriptional regulatory protein SAP30;
n=2; Saccharomyces cerevisiae|Rep: Transcriptional
regulatory protein SAP30 - Saccharomyces cerevisiae
(Baker's yeast)
Length = 201
Score = 33.5 bits (73), Expect = 2.7
Identities = 16/50 (32%), Positives = 26/50 (52%)
Query: 112 YKRHYKVPTRPGLNKAQLADAVQKHFKSLPVNEREIMTYFIYMVKTNGNK 161
YKR+ + ++K LA+ V++HF + E + + FIY VK K
Sbjct: 145 YKRNTQGQHDKRIHKRDLANVVRRHFDEHSIKETDCIPQFIYKVKNQKKK 194
>UniRef50_Q1DNG6 Cluster: Putative uncharacterized protein; n=1;
Coccidioides immitis|Rep: Putative uncharacterized
protein - Coccidioides immitis
Length = 218
Score = 32.3 bits (70), Expect = 6.3
Identities = 12/32 (37%), Positives = 22/32 (68%)
Query: 124 LNKAQLADAVQKHFKSLPVNEREIMTYFIYMV 155
++K LA AV+KHF + ++E++ + F+Y V
Sbjct: 171 VSKEHLASAVRKHFNNTAISEQDAIAKFVYKV 202
>UniRef50_A5E2L1 Cluster: Putative uncharacterized protein; n=1;
Lodderomyces elongisporus NRRL YB-4239|Rep: Putative
uncharacterized protein - Lodderomyces elongisporus
(Yeast) (Saccharomyces elongisporus)
Length = 226
Score = 32.3 bits (70), Expect = 6.3
Identities = 13/33 (39%), Positives = 22/33 (66%)
Query: 124 LNKAQLADAVQKHFKSLPVNEREIMTYFIYMVK 156
++K + A+ V+ HF SL + +I+T F+Y VK
Sbjct: 183 ISKKEFANHVRNHFLSLTAKDNDIITGFLYKVK 215
>UniRef50_UPI000065F66A Cluster: Fibulin-2 precursor.; n=1; Takifugu
rubripes|Rep: Fibulin-2 precursor. - Takifugu rubripes
Length = 1169
Score = 31.9 bits (69), Expect = 8.4
Identities = 16/45 (35%), Positives = 24/45 (53%)
Query: 5 FSTGEEDSRGNSDQICCLVDDGDRCRRPAGNASYSKRIQKTVALR 49
FS GEE GN+ VDD D+C+ AGN+ + + + + R
Sbjct: 592 FSIGEEQDGGNAVDGPIEVDDMDKCQLYAGNSCHHRCVNTPSSFR 636
Database: uniref50
Posted date: Oct 5, 2007 11:19 AM
Number of letters in database: 575,637,011
Number of sequences in database: 1,657,284
Lambda K H
0.320 0.133 0.400
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 182,774,648
Number of Sequences: 1657284
Number of extensions: 6294319
Number of successful extensions: 12441
Number of sequences better than 10.0: 22
Number of HSP's better than 10.0 without gapping: 18
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 12417
Number of HSP's gapped (non-prelim): 23
length of query: 174
length of database: 575,637,011
effective HSP length: 95
effective length of query: 79
effective length of database: 418,195,031
effective search space: 33037407449
effective search space used: 33037407449
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.8 bits)
S2: 69 (31.9 bits)
- SilkBase 1999-2023 -