BLASTP 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= BGIBMGA001524-TA|BGIBMGA001524-PA|undefined
(130 letters)
Database: arabidopsis
28,952 sequences; 12,070,560 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At2g36230.1 68415.m04447 N'-5'-phosphoribosyl-formimino-5-aminoi... 31 0.33
At2g22795.1 68415.m02704 expressed protein 30 0.44
At1g64370.1 68414.m07295 expressed protein 29 1.0
At4g27790.1 68417.m03991 calcium-binding EF hand family protein ... 28 1.8
At2g42190.1 68415.m05221 expressed protein ; similar to GP|9826|... 28 2.3
At3g47260.1 68416.m05133 Ulp1 protease family protein contains P... 27 3.1
At2g14130.1 68415.m01574 Ulp1 protease family protein contains P... 27 3.1
At1g17460.1 68414.m02141 myb family transcription factor contain... 27 3.1
At1g08040.1 68414.m00878 expressed protein contains Pfam profile... 27 3.1
At5g50100.1 68418.m06204 expressed protein contains Pfam domain ... 27 4.1
At3g25680.1 68416.m03196 expressed protein 27 4.1
At5g39865.1 68418.m04835 glutaredoxin family protein 27 5.4
At4g36120.1 68417.m05141 expressed protein 27 5.4
At4g24560.1 68417.m03520 ubiquitin-specific protease 16, putativ... 27 5.4
At3g56570.1 68416.m06290 SET domain-containing protein low simil... 27 5.4
At1g21640.1 68414.m02709 ATP-NAD kinase family protein contains ... 27 5.4
At5g55480.1 68418.m06910 glycerophosphoryl diester phosphodieste... 26 7.1
At5g28740.1 68418.m03530 transcription-coupled DNA repair protei... 26 7.1
At5g27120.1 68418.m03237 SAR DNA-binding protein, putative stron... 26 7.1
At4g37180.2 68417.m05264 myb family transcription factor contain... 26 9.4
At4g37180.1 68417.m05263 myb family transcription factor contain... 26 9.4
At3g19080.1 68416.m02423 SWIB complex BAF60b domain-containing p... 26 9.4
At2g30800.1 68415.m03755 DEIH-box RNA/DNA helicase, putative sim... 26 9.4
At1g56660.1 68414.m06516 expressed protein 26 9.4
>At2g36230.1 68415.m04447
N'-5'-phosphoribosyl-formimino-5-aminoimidazole-4-
carboxamide ribonucleotide isomerase identical to
N'-5'-phosphoribosyl-formimino-5-aminoimidazole-
4-carboxamide ribonucleotide isomerase GI:3449284 from
[Arabidopsis thaliana]; contains Pfam profile: PF00977
histidine biosynthesis protein
Length = 304
Score = 30.7 bits (66), Expect = 0.33
Identities = 13/24 (54%), Positives = 17/24 (70%)
Query: 4 SVLPVTYATGETVFDDLSRSEEIG 27
S +PVTYA G TV DD+ R ++ G
Sbjct: 246 SPIPVTYAGGVTVMDDVERIKDAG 269
>At2g22795.1 68415.m02704 expressed protein
Length = 734
Score = 30.3 bits (65), Expect = 0.44
Identities = 23/56 (41%), Positives = 29/56 (51%), Gaps = 5/56 (8%)
Query: 3 KSVLPVTYATGETVFDDLSRSEEIGHE--KLIGSKDSEDQNEEKKGFDLQNKRINE 56
KSVLP T GE+ D+ S GHE L G K SE ++ EK +L K N+
Sbjct: 345 KSVLPNTTDNGESSSDEKSTGSSSGHESDSLEGIK-SEGESMEKN--ELLEKEFND 397
Score = 26.6 bits (56), Expect = 5.4
Identities = 13/43 (30%), Positives = 26/43 (60%)
Query: 14 ETVFDDLSRSEEIGHEKLIGSKDSEDQNEEKKGFDLQNKRINE 56
ET + S S+E EK +KD+E+ + +++ D +N++I +
Sbjct: 507 ETKEKEESSSQEKTEEKETETKDNEESSSQEETKDKENEKIEK 549
>At1g64370.1 68414.m07295 expressed protein
Length = 178
Score = 29.1 bits (62), Expect = 1.0
Identities = 13/35 (37%), Positives = 18/35 (51%)
Query: 27 GHEKLIGSKDSEDQNEEKKGFDLQNKRINEGTKKY 61
GH KL+G D + +E G + IN G KK+
Sbjct: 115 GHGKLLGQHDQKKTHETNSGLNGLGMFINNGEKKH 149
>At4g27790.1 68417.m03991 calcium-binding EF hand family protein
contains INTERPRO:IPR002048 calcium-binding EF-hand
domain
Length = 345
Score = 28.3 bits (60), Expect = 1.8
Identities = 18/45 (40%), Positives = 25/45 (55%), Gaps = 5/45 (11%)
Query: 16 VFDDL-SRSEEIGHEKLIGSKDSEDQNEEK----KGFDLQNKRIN 55
VFD L +R E + HEK G+K E EEK + + Q +R+N
Sbjct: 47 VFDPLVTRIERLSHEKEAGTKTVEAAKEEKDDMFEEYFAQERRLN 91
>At2g42190.1 68415.m05221 expressed protein ; similar to
GP|9826|X07453
Length = 141
Score = 27.9 bits (59), Expect = 2.3
Identities = 13/46 (28%), Positives = 23/46 (50%)
Query: 18 DDLSRSEEIGHEKLIGSKDSEDQNEEKKGFDLQNKRINEGTKKYIN 63
DD EE E+++ + ED++EE + +K + E +K N
Sbjct: 42 DDFEEEEEEDDEEMVEKMEVEDEDEEIDDGSVTSKDLKERKRKMSN 87
>At3g47260.1 68416.m05133 Ulp1 protease family protein contains Pfam
profile PF02902: Ulp1 protease family, C-terminal
catalytic domain; similar to At1g32840, At4g04010,
At2g06430, At2g15140, At2g04980, At2g14130, At3g44500,
At2g15190, At5g34900, At3g29210, At2g02210, At3g32900
Length = 820
Score = 27.5 bits (58), Expect = 3.1
Identities = 17/60 (28%), Positives = 24/60 (40%), Gaps = 3/60 (5%)
Query: 42 EEKKGFDLQNKRINEGTKKYINCLDFISIVCTINFFTGV---LIPSSVKYFIKLPQGLRN 98
+ K+GF Q++ I E KKY+ N + P S+K PQ RN
Sbjct: 713 DNKEGFQFQSEEIKEVKKKYVAAYSIAMPYIVRNILKKEDMDVSPFSIKVLTTFPQAPRN 772
>At2g14130.1 68415.m01574 Ulp1 protease family protein contains Pfam
profile PF02902: Ulp1 protease family, C-terminal
catalytic domain; similar to At1g32840, At4g04010,
At2g06430, At2g15140, At2g04980, At3g44500, At2g15190,
At3g47260, At5g34900, At3g29210, At2g02210, At3g32900
Length = 808
Score = 27.5 bits (58), Expect = 3.1
Identities = 17/60 (28%), Positives = 24/60 (40%), Gaps = 3/60 (5%)
Query: 42 EEKKGFDLQNKRINEGTKKYINCLDFISIVCTINFFTGV---LIPSSVKYFIKLPQGLRN 98
+ K+GF Q++ I E KKY+ N + P S+K PQ RN
Sbjct: 701 DNKEGFQFQSEEIKEVKKKYVAAYSIAMPYIVRNILKKEDMDVSPFSIKVLTTFPQAPRN 760
>At1g17460.1 68414.m02141 myb family transcription factor contains
Pfam PF00249: Myb-like DNA-binding domain
Length = 604
Score = 27.5 bits (58), Expect = 3.1
Identities = 13/40 (32%), Positives = 21/40 (52%)
Query: 23 SEEIGHEKLIGSKDSEDQNEEKKGFDLQNKRINEGTKKYI 62
S + GH + D NE+ +G D KR+ + T++YI
Sbjct: 298 SNDFGHSPVETFVDHYSGNEDFEGEDGSAKRVRKPTRRYI 337
>At1g08040.1 68414.m00878 expressed protein contains Pfam profile
PF05212: Protein of unknown function (DUF707)
Length = 382
Score = 27.5 bits (58), Expect = 3.1
Identities = 19/60 (31%), Positives = 28/60 (46%), Gaps = 4/60 (6%)
Query: 52 KRINEGTKKYINCLDFISIVCTINFFTGVLIPSSVKYFIKLPQGLRNSPLLVPLNDKAFS 111
+R NE K I + + +V FF G+ +P I LP GL +S + + K FS
Sbjct: 11 RRTNENAKLIITTI--VGVV--FGFFVGITLPLGSFRKISLPSGLMSSLDVAMSDGKLFS 66
>At5g50100.1 68418.m06204 expressed protein contains Pfam domain
PF04134: Protein of unknown function, DUF393
Length = 214
Score = 27.1 bits (57), Expect = 4.1
Identities = 10/32 (31%), Positives = 18/32 (56%)
Query: 77 FTGVLIPSSVKYFIKLPQGLRNSPLLVPLNDK 108
F +IP+ KY ++ QG P++ PL ++
Sbjct: 34 FNHRIIPTGFKYQVRAIQGTSTDPVITPLKNR 65
>At3g25680.1 68416.m03196 expressed protein
Length = 558
Score = 27.1 bits (57), Expect = 4.1
Identities = 14/41 (34%), Positives = 22/41 (53%)
Query: 20 LSRSEEIGHEKLIGSKDSEDQNEEKKGFDLQNKRINEGTKK 60
LSR E+ EK K S ++ +EK D Q + +N T++
Sbjct: 434 LSRVNEVEEEKTTQEKWSAERLKEKAAIDCQKQLLNSLTEE 474
>At5g39865.1 68418.m04835 glutaredoxin family protein
Length = 390
Score = 26.6 bits (56), Expect = 5.4
Identities = 26/90 (28%), Positives = 39/90 (43%), Gaps = 2/90 (2%)
Query: 23 SEEIGHEKLIGSKDSEDQNEEKKGFDLQNKRINEGTKKYINCLDFISIVCTINFFTGVLI 82
S+++G E LI +EEK DLQ K I IV T
Sbjct: 80 SDQMGEETLISGNGFLHGDEEKMNLDLQAKVIEAKVWSSTINEKIPKIVAKTPIVTPPGE 139
Query: 83 PSSVKYFIKLPQGLRN-SPLLVPLNDKAFS 111
P ++ + +L +GL + SPL P + ++FS
Sbjct: 140 PETINTW-ELMEGLEDVSPLRSPNHLRSFS 168
>At4g36120.1 68417.m05141 expressed protein
Length = 981
Score = 26.6 bits (56), Expect = 5.4
Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 4/33 (12%)
Query: 32 IGSKDSEDQNEEK----KGFDLQNKRINEGTKK 60
+ SK+ E +NEEK K D+ NK+ EG KK
Sbjct: 250 VASKEVEIRNEEKNMSLKSADIANKQHLEGVKK 282
>At4g24560.1 68417.m03520 ubiquitin-specific protease 16, putative
(UBP16) similar to ubiquitin-specific protease 16
GI:11993477 [Arabidopsis thaliana]
Length = 1008
Score = 26.6 bits (56), Expect = 5.4
Identities = 13/37 (35%), Positives = 20/37 (54%)
Query: 7 PVTYATGETVFDDLSRSEEIGHEKLIGSKDSEDQNEE 43
PVT GE++ + +RS E G+ + +KD EE
Sbjct: 149 PVTIPIGESLLSNRARSPEDGNGDIADNKDDLIDKEE 185
>At3g56570.1 68416.m06290 SET domain-containing protein low
similarity to SP|Q43088 Ribulose-1,5 bisphosphate
carboxylase/oxygenase large subunit N-
methyltransferase, chloroplast precursor (EC 2.1.1.127)
{Pisum sativum}; contains Pfam profile PF00856: SET
domain
Length = 531
Score = 26.6 bits (56), Expect = 5.4
Identities = 13/39 (33%), Positives = 19/39 (48%)
Query: 13 GETVFDDLSRSEEIGHEKLIGSKDSEDQNEEKKGFDLQN 51
GE D+ EE E+ G ++ E + EE+ LQN
Sbjct: 249 GENTDDEAKEEEEEEEEEEEGEEEEEGEEEEENSSMLQN 287
>At1g21640.1 68414.m02709 ATP-NAD kinase family protein contains
similarity to NAD kinase [Homo sapiens]
gi|20070086|gb|AAM01195; contains Pfam domain, PF01513:
ATP-NAD kinase
Length = 985
Score = 26.6 bits (56), Expect = 5.4
Identities = 21/68 (30%), Positives = 29/68 (42%), Gaps = 3/68 (4%)
Query: 62 INCLDFISIVCTINFFTGVLIPSSVKYFIKLPQGLRNSPLLVPLNDKAFSPL--GGFVRY 119
+ C+ F I F V++P S K +K+P R S V + K L G VR
Sbjct: 889 VPCMLFTPICPHSLSFRPVILPDSAKLELKIPDDAR-SNAWVSFDGKRRQQLSRGDSVRI 947
Query: 120 YKEVPPIP 127
Y P+P
Sbjct: 948 YMSQHPLP 955
>At5g55480.1 68418.m06910 glycerophosphoryl diester
phosphodiesterase family protein contains Pfam PF03009 :
Glycerophosphoryl diester phosphodiesterase family;
similar to Glycerophosphoryl diester phosphodiesterase
precursor (Glycerophosphodiester phosphodiesterase)
(Surface-exposed lipoprotein D) (Protein D)
(ImmunoglobulinD-binding protein) (IGD-binding protein)
(SP:Q06282) {Haemophilus influenzae}
Length = 766
Score = 26.2 bits (55), Expect = 7.1
Identities = 13/57 (22%), Positives = 27/57 (47%)
Query: 40 QNEEKKGFDLQNKRINEGTKKYINCLDFISIVCTINFFTGVLIPSSVKYFIKLPQGL 96
+N + F + + K+++ DF+++ + TGVLI +++ QGL
Sbjct: 461 ENPYSREFTMFRNPRERSSGKFVSLSDFLNLAKNSSSLTGVLISVENATYLREKQGL 517
>At5g28740.1 68418.m03530 transcription-coupled DNA repair
protein-related similar to XAB2 (XPA-binding protein 2)
[Homo sapiens] GI:10566459
Length = 917
Score = 26.2 bits (55), Expect = 7.1
Identities = 16/39 (41%), Positives = 20/39 (51%), Gaps = 3/39 (7%)
Query: 8 VTYATGETVFDDLSRSEEIGHEK---LIGSKDSEDQNEE 43
VT +FD SR EE K ++ S D ED+NEE
Sbjct: 280 VTVRDFSVIFDVYSRFEESTVAKKMEMMSSSDEEDENEE 318
>At5g27120.1 68418.m03237 SAR DNA-binding protein, putative strong
similarity to SAR DNA-binding protein-1 [Pisum sativum]
GI:3132696; contains Pfam profile PF01798: Putative
snoRNA binding domain
Length = 533
Score = 26.2 bits (55), Expect = 7.1
Identities = 15/52 (28%), Positives = 26/52 (50%), Gaps = 2/52 (3%)
Query: 12 TGETVFDDLSRSEEIGHEKLIGSKDSEDQNEEKKGFDLQNKRINEGTKKYIN 63
T T D L ++ + E G K+ +D+ ++KK D + + E +KK N
Sbjct: 434 TYNTAADSLLQTPTVDSEN--GVKEKKDKKKKKKADDEEEAKTEEPSKKKSN 483
>At4g37180.2 68417.m05264 myb family transcription factor contains
Pfam domain, PF00249: Myb-like DNA-binding domain
Length = 363
Score = 25.8 bits (54), Expect = 9.4
Identities = 14/44 (31%), Positives = 24/44 (54%), Gaps = 3/44 (6%)
Query: 23 SEEIGHEKLIGSKDSEDQNEEKKGFDLQNKRINEGTKKYINCLD 66
S +GH +D E+++EE++ D ++ N+ TKK LD
Sbjct: 320 SSNVGHSS---EEDEEEEDEEEEKSDGRSSCRNDETKKKRQVLD 360
>At4g37180.1 68417.m05263 myb family transcription factor contains
Pfam domain, PF00249: Myb-like DNA-binding domain
Length = 356
Score = 25.8 bits (54), Expect = 9.4
Identities = 14/44 (31%), Positives = 24/44 (54%), Gaps = 3/44 (6%)
Query: 23 SEEIGHEKLIGSKDSEDQNEEKKGFDLQNKRINEGTKKYINCLD 66
S +GH +D E+++EE++ D ++ N+ TKK LD
Sbjct: 313 SSNVGHSS---EEDEEEEDEEEEKSDGRSSCRNDETKKKRQVLD 353
>At3g19080.1 68416.m02423 SWIB complex BAF60b domain-containing
protein contains Pfam profile PF02201: BAF60b domain of
the SWIB complex
Length = 462
Score = 25.8 bits (54), Expect = 9.4
Identities = 15/43 (34%), Positives = 24/43 (55%), Gaps = 3/43 (6%)
Query: 3 KSVLPVTYATGETVFDDLSRSEEIGHEKLIGSKDSEDQNEEKK 45
K + P+ GE+V S+S + G +K+ DSE+ NE+ K
Sbjct: 336 KHIWPLEDNAGESVS---SKSPKNGKQKMESDGDSEEPNEKDK 375
>At2g30800.1 68415.m03755 DEIH-box RNA/DNA helicase, putative similar
to DEIH-box RNA/DNA helicase [Arabidopsis thaliana]
GI:5881579; contains Pfam profiles PF04408: Helicase
associated domain (HA2), PF00271: Helicase conserved
C-terminal domain
Length = 1299
Score = 25.8 bits (54), Expect = 9.4
Identities = 12/48 (25%), Positives = 20/48 (41%)
Query: 12 TGETVFDDLSRSEEIGHEKLIGSKDSEDQNEEKKGFDLQNKRINEGTK 59
TG + DD + EE E + + N ++G D+ + G K
Sbjct: 988 TGSSDSDDSNEEEEDDEEVAANTNEEVAANTNEEGMDIHKEESRRGAK 1035
>At1g56660.1 68414.m06516 expressed protein
Length = 522
Score = 25.8 bits (54), Expect = 9.4
Identities = 14/38 (36%), Positives = 18/38 (47%)
Query: 23 SEEIGHEKLIGSKDSEDQNEEKKGFDLQNKRINEGTKK 60
S +I E+ K ED EEKK ++ EG KK
Sbjct: 393 SRDIKLEEPEAEKKEEDDTEEKKKSKVEGGESEEGKKK 430
Database: arabidopsis
Posted date: Oct 3, 2007 3:31 PM
Number of letters in database: 12,070,560
Number of sequences in database: 28,952
Lambda K H
0.318 0.139 0.419
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 3,601,048
Number of Sequences: 28952
Number of extensions: 161303
Number of successful extensions: 516
Number of sequences better than 10.0: 24
Number of HSP's better than 10.0 without gapping: 12
Number of HSP's successfully gapped in prelim test: 12
Number of HSP's that attempted gapping in prelim test: 500
Number of HSP's gapped (non-prelim): 27
length of query: 130
length of database: 12,070,560
effective HSP length: 73
effective length of query: 57
effective length of database: 9,957,064
effective search space: 567552648
effective search space used: 567552648
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.6 bits)
S2: 54 (25.8 bits)
- SilkBase 1999-2023 -