BLASTP 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= BGIBMGA000749-TA|BGIBMGA000749-PA|undefined
(104 letters)
Database: arabidopsis
28,952 sequences; 12,070,560 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At5g25500.1 68418.m03034 expressed protein ; expression supporte... 31 0.19
At4g21800.2 68417.m03154 ATP-binding family protein contains Pfa... 29 0.59
At4g21800.1 68417.m03153 ATP-binding family protein contains Pfa... 29 0.59
At1g70320.1 68414.m08090 ubiquitin-protein ligase 2 (UPL2) nearl... 28 1.4
At5g03950.1 68418.m00375 hypothetical protein 27 1.8
At4g27500.1 68417.m03950 expressed protein non-consensus GA dono... 27 2.4
At1g72440.1 68414.m08377 CCAAT-box-binding transcription factor-... 27 2.4
At5g59920.1 68418.m07514 DC1 domain-containing protein contains ... 27 3.1
At5g40110.1 68418.m04865 hypothetical protein 27 3.1
At5g07170.1 68418.m00817 hypothetical protein 27 3.1
At1g13220.2 68414.m01534 nuclear matrix constituent protein-rela... 27 3.1
At5g21970.1 68418.m02553 expressed protein supported by full-len... 26 4.2
At3g30640.1 68416.m03878 Ulp1 protease family protein contains P... 26 4.2
At4g19000.1 68417.m02798 IWS1 C-terminus family protein contains... 26 5.5
At4g07526.1 68417.m01177 hypothetical protein 26 5.5
At3g53310.1 68416.m05881 transcriptional factor B3 family protei... 26 5.5
At1g48400.1 68414.m05406 F-box family protein contains F-box dom... 26 5.5
At5g56930.1 68418.m07107 zinc finger (CCCH-type) family protein ... 25 7.3
At5g51940.1 68418.m06444 DNA-directed RNA polymerase II, putativ... 25 7.3
At3g15357.1 68416.m01947 expressed protein 25 7.3
At1g23890.2 68414.m03013 NHL repeat-containing protein contains ... 25 7.3
At1g13160.1 68414.m01526 SDA1 family protein contains Pfam PF052... 25 7.3
At5g01660.1 68418.m00082 kelch repeat-containing protein similar... 25 9.6
At3g61030.1 68416.m06828 C2 domain-containing protein similar to... 25 9.6
At3g60950.1 68416.m06819 C2 domain-containing protein similar to... 25 9.6
At3g17160.1 68416.m02189 expressed protein 25 9.6
At3g14660.1 68416.m01855 cytochrome P450, putative similar to GB... 25 9.6
At2g24710.1 68415.m02952 glutamate receptor family protein (GLR2... 25 9.6
At2g19480.1 68415.m02277 nucleosome assembly protein (NAP), puta... 25 9.6
At1g21430.1 68414.m02680 flavin-containing monooxygenase family ... 25 9.6
>At5g25500.1 68418.m03034 expressed protein ; expression supported
by MPSS
Length = 420
Score = 30.7 bits (66), Expect = 0.19
Identities = 14/35 (40%), Positives = 22/35 (62%)
Query: 51 KTKIEVRPFLNKNIRVADVTDKVYDDLNDLIVEDD 85
+TK+ V LN+ +R D+ D +D+N +IV DD
Sbjct: 51 ETKLPVLKKLNRALRDVDLVDGKLEDINGVIVYDD 85
>At4g21800.2 68417.m03154 ATP-binding family protein contains Pfam
domain, PF03029: Conserved hypothetical ATP binding
protein
Length = 379
Score = 29.1 bits (62), Expect = 0.59
Identities = 15/47 (31%), Positives = 24/47 (51%), Gaps = 2/47 (4%)
Query: 60 LNKNIRVADVTDKVY--DDLNDLIVEDDESDTDNVTNSVEEDTGDNY 104
LN ++ D T+K+ +D D VED+E D + E+D +Y
Sbjct: 331 LNTGLKDRDATEKMMLEEDDEDFQVEDEEDSDDAIDEDDEDDETKHY 377
>At4g21800.1 68417.m03153 ATP-binding family protein contains Pfam
domain, PF03029: Conserved hypothetical ATP binding
protein
Length = 379
Score = 29.1 bits (62), Expect = 0.59
Identities = 15/47 (31%), Positives = 24/47 (51%), Gaps = 2/47 (4%)
Query: 60 LNKNIRVADVTDKVY--DDLNDLIVEDDESDTDNVTNSVEEDTGDNY 104
LN ++ D T+K+ +D D VED+E D + E+D +Y
Sbjct: 331 LNTGLKDRDATEKMMLEEDDEDFQVEDEEDSDDAIDEDDEDDETKHY 377
>At1g70320.1 68414.m08090 ubiquitin-protein ligase 2 (UPL2) nearly
identical to ubiquitin-protein ligase 2 [Arabidopsis
thaliana] GI:7108523; E3, HECT-domain protein family;
similar to ubiquitin-protein ligase 2 GI:7108523 from
[Arabidopsis thaliana]
Length = 3658
Score = 27.9 bits (59), Expect = 1.4
Identities = 14/41 (34%), Positives = 23/41 (56%), Gaps = 1/41 (2%)
Query: 63 NIRVADVTDKVYDDLNDLIVEDDESDTDNVTNS-VEEDTGD 102
N + D ++ D+ ND +V++DE D D + V+ED D
Sbjct: 2170 NDDMVDEDEEDEDEYNDDMVDEDEDDEDEYNDDMVDEDEDD 2210
Score = 26.6 bits (56), Expect = 3.1
Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 1/29 (3%)
Query: 75 DDLNDLIVEDDESDTDNVTNS-VEEDTGD 102
D+ ND +V++DE D D + V+ED D
Sbjct: 2167 DEYNDDMVDEDEEDEDEYNDDMVDEDEDD 2195
>At5g03950.1 68418.m00375 hypothetical protein
Length = 252
Score = 27.5 bits (58), Expect = 1.8
Identities = 16/42 (38%), Positives = 24/42 (57%), Gaps = 3/42 (7%)
Query: 62 KNIRVADVTDKVYDDLNDLIVEDDESDTDNVTNSVEEDTGDN 103
++ RV D V D N+ V++ + D D+ TN EE+ GDN
Sbjct: 102 ESYRVCDSVSNV--DENNEAVDEQDDDEDDKTNEDEEE-GDN 140
>At4g27500.1 68417.m03950 expressed protein non-consensus GA donor
splice site at exon 6
Length = 612
Score = 27.1 bits (57), Expect = 2.4
Identities = 13/43 (30%), Positives = 25/43 (58%)
Query: 52 TKIEVRPFLNKNIRVADVTDKVYDDLNDLIVEDDESDTDNVTN 94
TK E+ N+ V++ DK Y +++DL + DE++++ N
Sbjct: 275 TKDEITVLENELKTVSEKRDKAYSNIHDLRRQRDETNSEYYQN 317
>At1g72440.1 68414.m08377 CCAAT-box-binding transcription
factor-related similar to CCAAT-box-binding
transcription factor (CCAAT-binding factor) (CBF)
(Swiss-Prot:Q03701) [Homo sapiens], GB:P53569 [Mus
musculus]
Length = 1056
Score = 27.1 bits (57), Expect = 2.4
Identities = 16/38 (42%), Positives = 22/38 (57%), Gaps = 1/38 (2%)
Query: 66 VADVTDKVYDDLNDLIVEDDESDTDNVTNSVEEDTGDN 103
VADV+D D D+ + DDE D +NV + D GD+
Sbjct: 956 VADVSDAEMDTDMDMDLIDDEDD-NNVDDDGTGDGGDD 992
>At5g59920.1 68418.m07514 DC1 domain-containing protein contains
Pfam profile PF03107: DC1 domain
Length = 710
Score = 26.6 bits (56), Expect = 3.1
Identities = 13/37 (35%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
Query: 68 DVTDKVYDD-LNDLIVEDDESDTDNVTNSVEEDTGDN 103
DV+D V DD ND+ + + D+D V++ V +D ++
Sbjct: 647 DVSDDVSDDPSNDVSDDTSDDDSDVVSDVVSDDASND 683
>At5g40110.1 68418.m04865 hypothetical protein
Length = 280
Score = 26.6 bits (56), Expect = 3.1
Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 3/39 (7%)
Query: 65 RVADVTDKVYDDLNDLIVEDDESDTDNVTNSVEEDTGDN 103
RV D V D N+ V++ + D D+ TN EE+ GDN
Sbjct: 105 RVCDSVSNV--DENNEAVDEKDDDEDDKTNEDEEE-GDN 140
>At5g07170.1 68418.m00817 hypothetical protein
Length = 542
Score = 26.6 bits (56), Expect = 3.1
Identities = 13/36 (36%), Positives = 21/36 (58%)
Query: 68 DVTDKVYDDLNDLIVEDDESDTDNVTNSVEEDTGDN 103
D D DD +D +DD+ D ++ + VEE+ GD+
Sbjct: 122 DDDDDDDDDDDDDDDDDDDDDDESKDSEVEEEEGDD 157
>At1g13220.2 68414.m01534 nuclear matrix constituent protein-related
similar to nuclear matrix constituent protein 1 (NMCP1)
[Daucus carota] GI:2190187
Length = 1128
Score = 26.6 bits (56), Expect = 3.1
Identities = 17/44 (38%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 61 NKNIRVADVTDKVYDDLN-DLIVEDDES-DTDNVTNSVEEDTGD 102
N ++ VA+V V +D N D E+DE+ D DN N ++D D
Sbjct: 1061 NGDVPVANVEPTVNEDTNEDGDEEEDEAQDDDNEENQDDDDDDD 1104
>At5g21970.1 68418.m02553 expressed protein supported by full-length
cDNA gi:22531216 from [Arabidopsis thaliana]
Length = 449
Score = 26.2 bits (55), Expect = 4.2
Identities = 9/25 (36%), Positives = 17/25 (68%)
Query: 76 DLNDLIVEDDESDTDNVTNSVEEDT 100
D +D++ +DDE D V ++ EE++
Sbjct: 423 DYDDVVTDDDEMDIGEVNDAYEENS 447
>At3g30640.1 68416.m03878 Ulp1 protease family protein contains Pfam
profile PF02902: Ulp1 protease family, C-terminal
catalytic domain
Length = 661
Score = 26.2 bits (55), Expect = 4.2
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 66 VADVTDKVYDDLNDLIVEDDESDTDNV 92
+ DVTD++ + N L+ E DE + + +
Sbjct: 398 ILDVTDQIVSEYNRLLPESDEDEEETM 424
>At4g19000.1 68417.m02798 IWS1 C-terminus family protein contains
Pfam profile PF05909: IWS1 C-terminus
Length = 406
Score = 25.8 bits (54), Expect = 5.5
Identities = 11/37 (29%), Positives = 22/37 (59%)
Query: 50 DKTKIEVRPFLNKNIRVADVTDKVYDDLNDLIVEDDE 86
+++K R + K+I V ++ D+V +DL+D D+
Sbjct: 20 EESKFTGRRLVKKSISVPELVDEVEEDLDDFTEPADD 56
>At4g07526.1 68417.m01177 hypothetical protein
Length = 112
Score = 25.8 bits (54), Expect = 5.5
Identities = 11/30 (36%), Positives = 17/30 (56%)
Query: 75 DDLNDLIVEDDESDTDNVTNSVEEDTGDNY 104
+D+ +L+VED E D DN + G N+
Sbjct: 66 EDIMNLVVEDSEIDEDNSLWGKPKRKGSNF 95
>At3g53310.1 68416.m05881 transcriptional factor B3 family protein
contains Pfam profile PF02362: B3 DNA binding domain
Length = 286
Score = 25.8 bits (54), Expect = 5.5
Identities = 12/43 (27%), Positives = 23/43 (53%)
Query: 61 NKNIRVADVTDKVYDDLNDLIVEDDESDTDNVTNSVEEDTGDN 103
+K +R + D +D +VED++ TD V + +ED ++
Sbjct: 104 SKEVRAEIQAIPLSDSDSDSVVEDEKDSTDVVEDDDDEDEDED 146
>At1g48400.1 68414.m05406 F-box family protein contains F-box domain
Pfam:PF00646
Length = 513
Score = 25.8 bits (54), Expect = 5.5
Identities = 13/37 (35%), Positives = 20/37 (54%)
Query: 68 DVTDKVYDDLNDLIVEDDESDTDNVTNSVEEDTGDNY 104
D D DD +D +DD+ D D+ + ++D GD Y
Sbjct: 293 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDGDYY 329
>At5g56930.1 68418.m07107 zinc finger (CCCH-type) family protein
contains Pfam domain, PF00642: Zinc finger
C-x8-C-x5-C-x3-H type (and similar)
Length = 675
Score = 25.4 bits (53), Expect = 7.3
Identities = 12/31 (38%), Positives = 21/31 (67%), Gaps = 1/31 (3%)
Query: 75 DDLNDLIVEDDESDTDNVTNSVEED-TGDNY 104
DD +D++VEDDE+ + V +D TG+++
Sbjct: 240 DDNDDMLVEDDETVERHEEYQVSQDGTGNSH 270
>At5g51940.1 68418.m06444 DNA-directed RNA polymerase II, putative
similar to SP|O88828 DNA-directed RNA polymerase II
14.4 kDa polypeptide (EC 2.7.7.6) (RPB6) (RPB14.4)
{Rattus norvegicus}; contains Pfam profile PF01192: RNA
polymerases K / 14 to 18 kDa subunit
Length = 144
Score = 25.4 bits (53), Expect = 7.3
Identities = 11/31 (35%), Positives = 21/31 (67%), Gaps = 1/31 (3%)
Query: 69 VTDKVYDDLNDLIVEDDESDTDNVTNSVEED 99
+ D+ Y+D++DL ED+ ++ + + VEED
Sbjct: 1 MADEDYNDVDDLGYEDEPAEPE-IEEGVEED 30
>At3g15357.1 68416.m01947 expressed protein
Length = 143
Score = 25.4 bits (53), Expect = 7.3
Identities = 14/39 (35%), Positives = 22/39 (56%), Gaps = 1/39 (2%)
Query: 61 NKNIRVADVTDKVYDDLNDLIVEDDESDTDNVTNSVEED 99
NK+ + TD D+++D+ EDDE D+D E+D
Sbjct: 99 NKDADSEEDTDFDEDEIDDVDFEDDE-DSDEDDEEEEDD 136
>At1g23890.2 68414.m03013 NHL repeat-containing protein contains
Pfam profile PF01436: NHL repeat
Length = 400
Score = 25.4 bits (53), Expect = 7.3
Identities = 13/29 (44%), Positives = 17/29 (58%)
Query: 72 KVYDDLNDLIVEDDESDTDNVTNSVEEDT 100
K DDL DLI DDE + +N + E+T
Sbjct: 338 KPSDDLIDLISFDDEQEPNNDKDCRNEET 366
>At1g13160.1 68414.m01526 SDA1 family protein contains Pfam PF05285:
SDA1 domain; similar to mystery 45A
(GI:16797816){Drosophila melanogaster}
Length = 804
Score = 25.4 bits (53), Expect = 7.3
Identities = 9/36 (25%), Positives = 20/36 (55%)
Query: 67 ADVTDKVYDDLNDLIVEDDESDTDNVTNSVEEDTGD 102
+D+ + D ++ + + DE+DTD+ +E + D
Sbjct: 578 SDIDTSIGGDEDEEVNDSDEADTDSENEEIESEEED 613
>At5g01660.1 68418.m00082 kelch repeat-containing protein similar to
SP|P57790 Kelch-like ECH-associated protein 1 (Cytosolic
inhibitor of Nrf2) {Rattus norvegicus}; contains Pfam
profile PF01344: Kelch motif
Length = 621
Score = 25.0 bits (52), Expect = 9.6
Identities = 10/28 (35%), Positives = 17/28 (60%)
Query: 71 DKVYDDLNDLIVEDDESDTDNVTNSVEE 98
D V + L DL+ DE +++T +VE+
Sbjct: 195 DHVLEKLKDLVFSHDEHGDNSLTETVEQ 222
>At3g61030.1 68416.m06828 C2 domain-containing protein similar to
CLB1 [Lycopersicon esculentum] GI:2789434; contains Pfam
profile PF00168: C2 domain
Length = 592
Score = 25.0 bits (52), Expect = 9.6
Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 4/55 (7%)
Query: 50 DKTKIEVRPFLNKNIRVADVTDKVYDDLNDLIVEDDESDTDNVTNSV-EEDTGDN 103
D + E++P K I + + + V+D +LIVED E T ++T V ++D G +
Sbjct: 201 DLSDFELKP-QRKLIAIENNLNPVWDQTFELIVEDKE--TQSLTVEVFDKDVGQD 252
>At3g60950.1 68416.m06819 C2 domain-containing protein similar to
CLB1 [Lycopersicon esculentum] GI:2789434; contains Pfam
profile PF00168: C2 domain
Length = 592
Score = 25.0 bits (52), Expect = 9.6
Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 4/55 (7%)
Query: 50 DKTKIEVRPFLNKNIRVADVTDKVYDDLNDLIVEDDESDTDNVTNSV-EEDTGDN 103
D + E++P K I + + + V+D +LIVED E T ++T V ++D G +
Sbjct: 201 DLSDFELKP-QRKLIAIENNLNPVWDQTFELIVEDKE--TQSLTVEVFDKDVGQD 252
>At3g17160.1 68416.m02189 expressed protein
Length = 165
Score = 25.0 bits (52), Expect = 9.6
Identities = 10/20 (50%), Positives = 13/20 (65%)
Query: 83 EDDESDTDNVTNSVEEDTGD 102
EDD SD D N ++E+ GD
Sbjct: 98 EDDASDFDPEENGLDEEEGD 117
>At3g14660.1 68416.m01855 cytochrome P450, putative similar to
GB:Q05047 from [Catharanthus roseus]
Length = 512
Score = 25.0 bits (52), Expect = 9.6
Identities = 13/46 (28%), Positives = 22/46 (47%)
Query: 53 KIEVRPFLNKNIRVADVTDKVYDDLNDLIVEDDESDTDNVTNSVEE 98
K +R +NK +R + + DDL +++E + T S EE
Sbjct: 266 KFILRGIVNKRLRAREAGEAPSDDLLGILLESNLGQTKGNGMSTEE 311
>At2g24710.1 68415.m02952 glutamate receptor family protein (GLR2.3)
plant glutamate receptor family, PMID:11379626
Length = 895
Score = 25.0 bits (52), Expect = 9.6
Identities = 13/30 (43%), Positives = 17/30 (56%)
Query: 74 YDDLNDLIVEDDESDTDNVTNSVEEDTGDN 103
YD L V +E+ T+N+T S DTG N
Sbjct: 313 YDATTALAVAIEEAGTNNMTFSKVVDTGRN 342
>At2g19480.1 68415.m02277 nucleosome assembly protein (NAP),
putative similar to nucleosome assembly protein 1
[Glycine max] GI:1161252; contains Pfam profile PF00956:
Nucleosome assembly protein (NAP)
Length = 379
Score = 25.0 bits (52), Expect = 9.6
Identities = 13/37 (35%), Positives = 22/37 (59%), Gaps = 1/37 (2%)
Query: 67 ADVTDKVYDDLNDLIVEDDESDTDNVTNSVEEDTGDN 103
AD D + DD +++ +DDE D ++ + EED D+
Sbjct: 301 ADDLD-IEDDDDEIDEDDDEEDEEDDEDDEEEDDEDD 336
>At1g21430.1 68414.m02680 flavin-containing monooxygenase family
protein / FMO family protein similar to
flavin-containing monooxygenases YUCCA [gi:16555352],
YUCCA2 [gi:16555354], and YUCCA3 [gi:16555356] from
Arabidopsis thaliana
Length = 391
Score = 25.0 bits (52), Expect = 9.6
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 44 INYLRKDKTKIEVRPFLNKNIRVADVTD 71
INYL + T+ V P N+N++ A D
Sbjct: 83 INYLDEYATRFNVNPRYNRNVKSAYFKD 110
Database: arabidopsis
Posted date: Oct 3, 2007 3:31 PM
Number of letters in database: 12,070,560
Number of sequences in database: 28,952
Lambda K H
0.314 0.137 0.374
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 2,163,547
Number of Sequences: 28952
Number of extensions: 74783
Number of successful extensions: 379
Number of sequences better than 10.0: 30
Number of HSP's better than 10.0 without gapping: 17
Number of HSP's successfully gapped in prelim test: 13
Number of HSP's that attempted gapping in prelim test: 328
Number of HSP's gapped (non-prelim): 61
length of query: 104
length of database: 12,070,560
effective HSP length: 71
effective length of query: 33
effective length of database: 10,014,968
effective search space: 330493944
effective search space used: 330493944
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 42 (21.9 bits)
S2: 52 (25.0 bits)
- SilkBase 1999-2023 -