BLASTX 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= fmgV11l24f
(607 letters)
Database: arabidopsis
28,952 sequences; 12,070,560 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At1g08080.1 68414.m00884 carbonic anhydrase family protein simil... 36 0.016
At2g28100.1 68415.m03413 glycosyl hydrolase family 29 / alpha-L-... 35 0.048
At2g28210.1 68415.m03425 carbonic anhydrase family protein simil... 31 0.59
At4g17240.1 68417.m02592 expressed protein 30 1.4
At3g52720.2 68416.m05809 carbonic anhydrase family protein low s... 30 1.4
At3g52720.1 68416.m05808 carbonic anhydrase family protein low s... 30 1.4
At5g35210.2 68418.m04175 peptidase M50 family protein / sterol-r... 29 1.8
At5g35210.1 68418.m04174 peptidase M50 family protein / sterol-r... 29 1.8
At3g19870.1 68416.m02516 expressed protein 29 1.8
At1g08065.1 68414.m00882 carbonic anhydrase family protein simil... 29 1.8
At5g04180.1 68418.m00406 carbonic anhydrase family protein simil... 29 3.2
At5g22760.1 68418.m02658 PHD finger family protein contains Pfam... 28 4.2
At4g39450.1 68417.m05582 expressed protein 28 4.2
At4g36250.1 68417.m05156 aldehyde dehydrogenase family protein c... 28 4.2
At5g49110.1 68418.m06079 expressed protein ; expression support... 28 5.5
At5g59670.1 68418.m07481 leucine-rich repeat protein kinase, put... 27 7.3
At3g26820.1 68416.m03355 esterase/lipase/thioesterase family pro... 27 7.3
At1g07910.1 68414.m00860 expressed protein identical to GB:AAB07... 27 7.3
At4g20910.1 68417.m03031 double-stranded RNA binding protein-rel... 27 9.6
At4g11670.1 68417.m01865 expressed protein contains Pfam PF05664... 27 9.6
At2g01990.1 68415.m00134 expressed protein 27 9.6
At1g21630.1 68414.m02708 calcium-binding EF hand family protein ... 27 9.6
>At1g08080.1 68414.m00884 carbonic anhydrase family protein similar
to storage protein (dioscorin) [Dioscorea cayenensis]
GI:433463; contains Pfam profile PF00194:
Eukaryotic-type carbonic anhydrase
Length = 275
Score = 36.3 bits (80), Expect = 0.016
Identities = 31/97 (31%), Positives = 43/97 (44%)
Frame = +1
Query: 187 QSPIAISLSRCPTWSSLDPLKFKGYWDSNANAILLNNGSTAYFTFNDASVRPTLSGGPLI 366
QSPI + R S L L + Y SNA L N G F D + ++G
Sbjct: 69 QSPIDLMNERVNIVSHLGRLN-RDYNPSNAT--LKNRGHDIMLKFEDGAGTIKINGF--- 122
Query: 367 GEYIFEQMHFHWSVDDFTGCEHVLDGHGYAAECHFVH 477
EY +Q+H+H + EH ++G +A E H VH
Sbjct: 123 -EYELQQLHWH------SPSEHTINGRRFALELHMVH 152
>At2g28100.1 68415.m03413 glycosyl hydrolase family 29 /
alpha-L-fucosidase, putative similar to
alpha-L-fucosidase SP:P10901 from [Dictyostelium
discoideum]
Length = 506
Score = 34.7 bits (76), Expect = 0.048
Identities = 24/66 (36%), Positives = 31/66 (46%), Gaps = 2/66 (3%)
Frame = -3
Query: 584 SNLGLGASTVSRRNPTTANPSGWPTAVSKDSYL--LL*WTKWHSAAYPWPSSTCSHPVKS 411
S G G + S NPT N S W ++KDS ++ K H WPS + VKS
Sbjct: 64 SEWGTGKANPSIFNPTHLNASQW-VQIAKDSGFSRVILTAKHHDGFCLWPSEYTDYSVKS 122
Query: 410 STDQWK 393
S QW+
Sbjct: 123 S--QWR 126
>At2g28210.1 68415.m03425 carbonic anhydrase family protein similar
to storage protein (dioscorin) [Dioscorea cayenensis]
GI:433463; contains Pfam profile PF00194:
Eukaryotic-type carbonic anhydrase
Length = 217
Score = 31.1 bits (67), Expect = 0.59
Identities = 21/69 (30%), Positives = 28/69 (40%)
Frame = +1
Query: 277 NAILLNNGSTAYFTFNDASVRPTLSGGPLIGEYIFEQMHFHWSVDDFTGCEHVLDGHGYA 456
NA L N G F + G EY Q+H+H + EH ++G +A
Sbjct: 67 NATLKNRGHDMMLKFGEEGSGSITVNGT---EYKLLQLHWH------SPSEHTMNGRRFA 117
Query: 457 AECHFVHYN 483
E H VH N
Sbjct: 118 LELHMVHEN 126
>At4g17240.1 68417.m02592 expressed protein
Length = 200
Score = 29.9 bits (64), Expect = 1.4
Identities = 14/30 (46%), Positives = 20/30 (66%), Gaps = 1/30 (3%)
Frame = +1
Query: 451 YAAE-CHFVHYNSKYESLETAVGHPDGLAV 537
Y +E C++ S Y+SLE + G PDGLA+
Sbjct: 96 YLSESCYWQAQTSSYDSLEFSSGSPDGLAL 125
>At3g52720.2 68416.m05809 carbonic anhydrase family protein low
similarity to storage protein (dioscorin) [Dioscorea
cayenensis] GI:433463; contains Pfam profile PF00194:
Eukaryotic-type carbonic anhydrase
Length = 230
Score = 29.9 bits (64), Expect = 1.4
Identities = 17/35 (48%), Positives = 18/35 (51%)
Frame = +1
Query: 373 YIFEQMHFHWSVDDFTGCEHVLDGHGYAAECHFVH 477
Y QMH+H T EH L G YAAE H VH
Sbjct: 113 YTLLQMHWH------TPSEHHLHGVQYAAELHMVH 141
>At3g52720.1 68416.m05808 carbonic anhydrase family protein low
similarity to storage protein (dioscorin) [Dioscorea
cayenensis] GI:433463; contains Pfam profile PF00194:
Eukaryotic-type carbonic anhydrase
Length = 284
Score = 29.9 bits (64), Expect = 1.4
Identities = 17/35 (48%), Positives = 18/35 (51%)
Frame = +1
Query: 373 YIFEQMHFHWSVDDFTGCEHVLDGHGYAAECHFVH 477
Y QMH+H T EH L G YAAE H VH
Sbjct: 113 YTLLQMHWH------TPSEHHLHGVQYAAELHMVH 141
>At5g35210.2 68418.m04175 peptidase M50 family protein /
sterol-regulatory element binding protein (SREBP) site 2
protease family protein contains PFam PF02163:
sterol-regulatory element binding protein (SREBP) site 2
protease
Length = 1409
Score = 29.5 bits (63), Expect = 1.8
Identities = 14/33 (42%), Positives = 18/33 (54%)
Frame = +1
Query: 163 HISRLRPSQSPIAISLSRCPTWSSLDPLKFKGY 261
H+ RL S+S +A RC WS LD L + Y
Sbjct: 249 HLERLSSSKSVLASKCLRCIDWSLLDVLTWPVY 281
>At5g35210.1 68418.m04174 peptidase M50 family protein /
sterol-regulatory element binding protein (SREBP) site 2
protease family protein contains PFam PF02163:
sterol-regulatory element binding protein (SREBP) site 2
protease
Length = 1576
Score = 29.5 bits (63), Expect = 1.8
Identities = 14/33 (42%), Positives = 18/33 (54%)
Frame = +1
Query: 163 HISRLRPSQSPIAISLSRCPTWSSLDPLKFKGY 261
H+ RL S+S +A RC WS LD L + Y
Sbjct: 249 HLERLSSSKSVLASKCLRCIDWSLLDVLTWPVY 281
>At3g19870.1 68416.m02516 expressed protein
Length = 1117
Score = 29.5 bits (63), Expect = 1.8
Identities = 20/56 (35%), Positives = 29/56 (51%), Gaps = 1/56 (1%)
Frame = +1
Query: 439 DGHGYAAECHFVHYNSKYESLET-AVGHPDGLAVVGFLLETVDAPNPRFDRLVQGL 603
DG E V + S S+++ +VGH + AVV LL V+ PN FDR + +
Sbjct: 102 DGSSGLKEQAMVSFTSVLVSIDSFSVGHVE--AVVDLLLALVNRPNHGFDRQARAI 155
>At1g08065.1 68414.m00882 carbonic anhydrase family protein similar
to storage protein (dioscorin) [Dioscorea cayenensis]
GI:433463; contains Pfam profile PF00194:
Eukaryotic-type carbonic anhydrase
Length = 263
Score = 29.5 bits (63), Expect = 1.8
Identities = 28/97 (28%), Positives = 44/97 (45%)
Frame = +1
Query: 187 QSPIAISLSRCPTWSSLDPLKFKGYWDSNANAILLNNGSTAYFTFNDASVRPTLSGGPLI 366
QSPI ++ R +L L+ + Y SNA + F +A + T++G
Sbjct: 50 QSPIDLTDKRVLIDHNLGYLRSQ-YLPSNATIKNRGHDIMMKFEGGNAGLGITINGT--- 105
Query: 367 GEYIFEQMHFHWSVDDFTGCEHVLDGHGYAAECHFVH 477
EY +Q+H+H + EH L+G + E H VH
Sbjct: 106 -EYKLQQIHWH------SPSEHTLNGKRFVLEEHMVH 135
>At5g04180.1 68418.m00406 carbonic anhydrase family protein similar
to storage protein (dioscorin) [Dioscorea cayenensis]
GI:433463; contains Pfam profile PF00194:
Eukaryotic-type carbonic anhydrase
Length = 277
Score = 28.7 bits (61), Expect = 3.2
Identities = 26/82 (31%), Positives = 38/82 (46%)
Frame = +1
Query: 349 SGGPLIGEYIFEQMHFHWSVDDFTGCEHVLDGHGYAAECHFVHYNSKYESLETAVGHPDG 528
+G +I + ++ + HW EH LDG A E H VH +S+E GH
Sbjct: 101 AGKIVINDTDYKLVQSHWHAPS----EHFLDGQRLAMELHMVH-----KSVE---GH--- 145
Query: 529 LAVVGFLLETVDAPNPRFDRLV 594
LAV+G L + PN R++
Sbjct: 146 LAVIGVLFREGE-PNAFISRIM 166
>At5g22760.1 68418.m02658 PHD finger family protein contains Pfam
domain, PF00628: PHD-finger
Length = 1566
Score = 28.3 bits (60), Expect = 4.2
Identities = 20/68 (29%), Positives = 28/68 (41%)
Frame = +1
Query: 163 HISRLRPSQSPIAISLSRCPTWSSLDPLKFKGYWDSNANAILLNNGSTAYFTFNDASVRP 342
H+ RL S +A RC WS LD L + Y A+ +G F FN+ V
Sbjct: 243 HLERLSSEGSEVASKCLRCIDWSLLDALTWPVYLVQYFAAMGHASGPLWRF-FNEFVVEK 301
Query: 343 TLSGGPLI 366
P++
Sbjct: 302 EYCSSPVV 309
>At4g39450.1 68417.m05582 expressed protein
Length = 1553
Score = 28.3 bits (60), Expect = 4.2
Identities = 26/99 (26%), Positives = 41/99 (41%), Gaps = 1/99 (1%)
Frame = +1
Query: 223 TWSSLDPLKFKGYWDSNANAILLNNGSTAYFTFNDASVRPTLSGGPLIGEYIFEQMHFHW 402
T+ L L F GY LL+ G + +D + +S +I ++I E++H
Sbjct: 7 TFGPLGWLSFSGY---PTGEYLLHRGVEFFINVDDPTEISAISWEAIIQKHIEEELHHTK 63
Query: 403 SVDDFTGCEHVLD-GHGYAAECHFVHYNSKYESLETAVG 516
+ G EH L G AA F+ + + LE G
Sbjct: 64 TEGTELGLEHFLHRGRPLAAFNAFLEHRVEKLKLEDQSG 102
>At4g36250.1 68417.m05156 aldehyde dehydrogenase family protein
contais aldehyde dehydrogenase (NADP) family protein
domain, Pfam:PF00171
Length = 484
Score = 28.3 bits (60), Expect = 4.2
Identities = 19/77 (24%), Positives = 35/77 (45%), Gaps = 3/77 (3%)
Frame = +1
Query: 265 DSNANAILLNNGSTAYFTFNDASVRPTLSGGPL--IGEYIFEQMHFHWSVDDFTGCEHVL 438
D N +L+ S+ TFND ++ P +GE + H +S D F+ + ++
Sbjct: 382 DENLKTRILSETSSGSVTFNDVMIQYMCDALPFGGVGESGIGRYHGKYSFDCFSHEKAIM 441
Query: 439 DGH-GYAAECHFVHYNS 486
+G G E + +N+
Sbjct: 442 EGSLGMDLEARYPPWNN 458
>At5g49110.1 68418.m06079 expressed protein ; expression supported by
MPSS
Length = 1487
Score = 27.9 bits (59), Expect = 5.5
Identities = 21/72 (29%), Positives = 33/72 (45%), Gaps = 1/72 (1%)
Frame = -3
Query: 356 PPLRVGLTDASLNVKYAVDPLFKSMALALLS-QYPLNLRGSSEDQVGHRLNDIAIGDCEG 180
PPL G T+ + VD + + LALLS + LN+ S G + +A+ E
Sbjct: 941 PPLATGQTNKEKKGRKDVDGRKQCLHLALLSLKELLNIYSSGSGLTGLLEDLLAVPASED 1000
Query: 179 LNLEMCSASEEV 144
LE C + ++
Sbjct: 1001 ATLEECREASKI 1012
>At5g59670.1 68418.m07481 leucine-rich repeat protein kinase,
putative similar to light repressible receptor protein
kinase [Arabidopsis thaliana] gi|1321686|emb|CAA66376;
contains leucine rich repeat (LRR) domains,
Pfam:PF00560; contains protein kinase domain,
Pfam:PF00069
Length = 868
Score = 27.5 bits (58), Expect = 7.3
Identities = 13/55 (23%), Positives = 28/55 (50%)
Frame = -3
Query: 338 LTDASLNVKYAVDPLFKSMALALLSQYPLNLRGSSEDQVGHRLNDIAIGDCEGLN 174
+ D +L Y ++ ++++ LA+ YP + + S QV H L + + G++
Sbjct: 791 IMDPNLRKDYNINSAWRALELAMSCAYPSSSKRPSMSQVIHELKECIACENTGIS 845
>At3g26820.1 68416.m03355 esterase/lipase/thioesterase family
protein contains Interpro entry IPR000379
Length = 634
Score = 27.5 bits (58), Expect = 7.3
Identities = 12/26 (46%), Positives = 14/26 (53%)
Frame = +3
Query: 498 LGDGGGPSRWIGCGRIPPGDCRRSQP 575
+GDGGGP RW P +CR P
Sbjct: 65 VGDGGGPPRWFS-----PLECRAQAP 85
>At1g07910.1 68414.m00860 expressed protein identical to GB:AAB07881
AT.I.24-9 gene product from [Arabidopsis thaliana] (Mol.
Gen. Genet. 219 (1-2), 106-112 (1989))
Length = 1081
Score = 27.5 bits (58), Expect = 7.3
Identities = 12/27 (44%), Positives = 17/27 (62%)
Frame = -2
Query: 477 VDEVAFCGVPVAIEHVLASGEVIYGPV 397
+DEVA VP + +HV GE++ G V
Sbjct: 280 LDEVADISVPASKDHVKVQGEILEGLV 306
>At4g20910.1 68417.m03031 double-stranded RNA binding
protein-related / DsRBD protein-related contains weak
similarity to Pfam profile PF00035: Double-stranded RNA
binding motif
Length = 942
Score = 27.1 bits (57), Expect = 9.6
Identities = 22/111 (19%), Positives = 48/111 (43%), Gaps = 2/111 (1%)
Frame = +1
Query: 88 MDNRTVKIDPKFLSAQPKKTSSDAEHISRLRPSQSPIAISLSR--CPTWSSLDPLKFKGY 261
+D V++D ++S+ S AE + +Q I+ + C + L K Y
Sbjct: 227 IDEEVVELDTLYISSNRHYLDSIAERLGLKDGNQVMISRMFGKASCGSECRLYSEIPKKY 286
Query: 262 WDSNANAILLNNGSTAYFTFNDASVRPTLSGGPLIGEYIFEQMHFHWSVDD 414
D++++A +N +++ + + + G + G+ I + + W DD
Sbjct: 287 LDNSSDASGTSNEDSSHIVKSRNARASYICGQDIHGDAILASVGYRWKSDD 337
>At4g11670.1 68417.m01865 expressed protein contains Pfam PF05664:
Protein of unknown function (DUF810)
Length = 985
Score = 27.1 bits (57), Expect = 9.6
Identities = 18/57 (31%), Positives = 28/57 (49%)
Frame = -3
Query: 443 PSSTCSHPVKSSTDQWKCICSKMYSPIRGPPLRVGLTDASLNVKYAVDPLFKSMALA 273
PS+ ++ K T K + P+ PPLR GL+D L + A + + SM L+
Sbjct: 131 PSARDNYVFKEETPDIKPVKPIKIIPLGLPPLRTGLSDDDLR-EAAYELMIASMLLS 186
>At2g01990.1 68415.m00134 expressed protein
Length = 221
Score = 27.1 bits (57), Expect = 9.6
Identities = 12/33 (36%), Positives = 17/33 (51%)
Frame = -2
Query: 438 EHVLASGEVIYGPVEVHLLEDVLSDQGAPAEGR 340
EH+ S E+ Y V LED L + G ++ R
Sbjct: 14 EHIFGSPEIDYSDVSTGYLEDALIESGERSKRR 46
>At1g21630.1 68414.m02708 calcium-binding EF hand family protein
contains INTERPRO:IPR002048 calcium-binding EF-hand
domain; ESTs gb|T44428 and gb|AA395440 come from this
gene
Length = 1218
Score = 27.1 bits (57), Expect = 9.6
Identities = 11/25 (44%), Positives = 16/25 (64%)
Frame = -1
Query: 574 GWERRQSPGGIRPQPIHRDGPPPSP 500
G++++ PGG+RP P G PP P
Sbjct: 533 GFQQQPHPGGLRP-PAGPKGKPPRP 556
Database: arabidopsis
Posted date: Oct 4, 2007 10:56 AM
Number of letters in database: 12,070,560
Number of sequences in database: 28,952
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 13,458,114
Number of Sequences: 28952
Number of extensions: 292518
Number of successful extensions: 1012
Number of sequences better than 10.0: 22
Number of HSP's better than 10.0 without gapping: 941
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1008
length of database: 12,070,560
effective HSP length: 78
effective length of database: 9,812,304
effective search space used: 1206913392
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)
- SilkBase 1999-2023 -