SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTX 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= fmgV10n08r
         (773 letters)

Database: arabidopsis 
           28,952 sequences; 12,070,560 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

At1g08080.1 68414.m00884 carbonic anhydrase family protein simil...    60   1e-09
At5g04180.1 68418.m00406 carbonic anhydrase family protein simil...    56   3e-08
At4g20990.1 68417.m03038 carbonic anhydrase family protein simil...    51   7e-07
At3g52720.1 68416.m05808 carbonic anhydrase family protein low s...    51   7e-07
At4g21000.1 68417.m03039 carbonic anhydrase family protein simil...    46   4e-05
At2g28210.1 68415.m03425 carbonic anhydrase family protein simil...    44   9e-05
At1g08065.1 68414.m00882 carbonic anhydrase family protein simil...    43   2e-04
At3g52720.2 68416.m05809 carbonic anhydrase family protein low s...    42   6e-04
At2g28100.1 68415.m03413 glycosyl hydrolase family 29 / alpha-L-...    35   0.069
At5g13540.2 68418.m01564 expressed protein HERC2 - Homo sapiens,...    30   2.0  
At4g17240.1 68417.m02592 expressed protein                             30   2.0  
At3g19870.1 68416.m02516 expressed protein                             29   2.6  
At1g12810.1 68414.m01488 proline-rich family protein contains pr...    29   2.6  
At3g52710.1 68416.m05807 expressed protein predicted protein, Ar...    29   4.5  

>At1g08080.1 68414.m00884 carbonic anhydrase family protein similar
           to storage protein (dioscorin) [Dioscorea cayenensis]
           GI:433463; contains Pfam profile PF00194:
           Eukaryotic-type carbonic anhydrase
          Length = 275

 Score = 60.5 bits (140), Expect = 1e-09
 Identities = 49/165 (29%), Positives = 77/165 (46%), Gaps = 2/165 (1%)
 Frame = -3

Query: 771 EYIFEQMHFHWSVDDFTGCEHVLDGHGYAAECHFVHYNSKYESLETAVGHPDGLAVVGFL 592
           EY  +Q+H+H      +  EH ++G  +A E H VH            G    +AVV  L
Sbjct: 123 EYELQQLHWH------SPSEHTINGRRFALELHMVH-----------EGRNRRMAVVTVL 165

Query: 591 LETVDAPNPRFDRLVQGLEGIQKRESVMNVTSESLLWMDREDLQIGN--YVTYKGSLTTP 418
            +       R D  ++ LE   +  + M    +++  +D   ++IG+  Y  Y GSLTTP
Sbjct: 166 YKI-----GRADTFIRSLEKELEGIAEMEEAEKNVGMIDPTKIKIGSRKYYRYTGSLTTP 220

Query: 417 PYTECVTWIIYEKPVQIGSEQLGLLRQLEGPDSQPIERNVRPTQR 283
           P T+ VTW +  K   +  +Q+ LLR     D+    R V+PT +
Sbjct: 221 PCTQNVTWSVVRKVRTVTRKQVKLLRVAVHDDANSNARPVQPTNK 265


>At5g04180.1 68418.m00406 carbonic anhydrase family protein similar
           to storage protein (dioscorin) [Dioscorea cayenensis]
           GI:433463; contains Pfam profile PF00194:
           Eukaryotic-type carbonic anhydrase
          Length = 277

 Score = 55.6 bits (128), Expect = 3e-08
 Identities = 53/179 (29%), Positives = 77/179 (43%), Gaps = 4/179 (2%)
 Frame = -3

Query: 762 FEQMHFHWSVDDFTGCEHVLDGHGYAAECHFVHYNSKYESLETAVGHPDGLAVVGFLLET 583
           ++ +  HW        EH LDG   A E H VH     +S+E   GH   LAV+G L   
Sbjct: 111 YKLVQSHWHAPS----EHFLDGQRLAMELHMVH-----KSVE---GH---LAVIGVLFRE 155

Query: 582 VDAPNPRFDRL---VQGLEGIQKRE-SVMNVTSESLLWMDREDLQIGNYVTYKGSLTTPP 415
            + PN    R+   +  +  +Q  E S+  +      W       +  +  Y+GSLTTPP
Sbjct: 156 GE-PNAFISRIMDKIHKIADVQDGEVSIGKIDPREFGW------DLTKFYEYRGSLTTPP 208

Query: 414 YTECVTWIIYEKPVQIGSEQLGLLRQLEGPDSQPIERNVRPTQRHPPGHSVIYVKQVRS 238
            TE V W I  K   +  EQ+ +L           E+N RP Q  P    ++Y+ +  S
Sbjct: 209 CTEDVMWTIINKVGTVSREQIDVLTDAR---RGGYEKNARPAQ--PLNGRLVYLNEQSS 262


>At4g20990.1 68417.m03038 carbonic anhydrase family protein similar
           to storage protein (dioscorin) [Dioscorea cayenensis]
           GI:433463; contains Pfam profile PF00194:
           Eukaryotic-type carbonic anhydrase
          Length = 267

 Score = 51.2 bits (117), Expect = 7e-07
 Identities = 43/159 (27%), Positives = 68/159 (42%)
 Frame = -3

Query: 762 FEQMHFHWSVDDFTGCEHVLDGHGYAAECHFVHYNSKYESLETAVGHPDGLAVVGFLLET 583
           F  +  HW     +  EH ++G  Y  E H VH +++  +           AV+G L + 
Sbjct: 119 FNLVQCHWH----SPSEHTVNGTRYDLELHMVHTSARGRT-----------AVIGVLYK- 162

Query: 582 VDAPNPRFDRLVQGLEGIQKRESVMNVTSESLLWMDREDLQIGNYVTYKGSLTTPPYTEC 403
           +  PN    +L+ G++ +  +E  + +     +       Q   +  Y GSLT PP TE 
Sbjct: 163 LGEPNEFLTKLLNGIKAVGNKEINLGMIDPREI-----RFQTRKFYRYIGSLTVPPCTEG 217

Query: 402 VTWIIYEKPVQIGSEQLGLLRQLEGPDSQPIERNVRPTQ 286
           V W + ++   I  EQ+  LRQ         E N RP Q
Sbjct: 218 VIWTVVKRVNTISMEQITALRQAV---DDGFETNSRPVQ 253


>At3g52720.1 68416.m05808 carbonic anhydrase family protein low
           similarity to storage protein (dioscorin) [Dioscorea
           cayenensis] GI:433463; contains Pfam profile PF00194:
           Eukaryotic-type carbonic anhydrase
          Length = 284

 Score = 51.2 bits (117), Expect = 7e-07
 Identities = 50/164 (30%), Positives = 69/164 (42%), Gaps = 3/164 (1%)
 Frame = -3

Query: 768 YIFEQMHFHWSVDDFTGCEHVLDGHGYAAECHFVHYNSKYESLETAVGHPDGLAVVGFLL 589
           Y   QMH+H      T  EH L G  YAAE H VH               DG   V   L
Sbjct: 113 YTLLQMHWH------TPSEHHLHGVQYAAELHMVHQAK------------DGSFAVVASL 154

Query: 588 ETVDAPNPRFDRLVQGLEGIQKRESVMNVTSESLLW-MDREDLQ--IGNYVTYKGSLTTP 418
             +    P   ++ + L  +++     N T++  +  +D   ++     Y  Y GSLTTP
Sbjct: 155 FKIGTEEPFLSQMKEKLVKLKEERLKGNHTAQVEVGRIDTRHIERKTRKYYRYIGSLTTP 214

Query: 417 PYTECVTWIIYEKPVQIGSEQLGLLRQLEGPDSQPIERNVRPTQ 286
           P +E V+W I  K   +  EQ+ LLR    P     + N RP Q
Sbjct: 215 PCSENVSWTILGKVRSMSKEQVELLR---SPLDTSFKNNSRPCQ 255


>At4g21000.1 68417.m03039 carbonic anhydrase family protein similar
           to storage protein (dioscorin) [Dioscorea cayenensis]
           GI:433463; contains Pfam profile PF00194:
           Eukaryotic-type carbonic anhydrase
          Length = 260

 Score = 45.6 bits (103), Expect = 4e-05
 Identities = 35/133 (26%), Positives = 63/133 (47%), Gaps = 2/133 (1%)
 Frame = -3

Query: 771 EYIFEQMHFHWSVDDFTGCEHVLDGHGYAAECHFVHYNSKYESLETAVGHPDGLAVVGFL 592
           +Y   Q H+H      +  EH ++G  Y  E H VH ++  ++            VVG L
Sbjct: 119 DYKLVQCHWH------SPSEHTINGTSYDLELHMVHTSASGKT-----------TVVGVL 161

Query: 591 LETVDAPNPRFDRLVQGLEGIQKRESVMNVTSESLLWMDREDLQI--GNYVTYKGSLTTP 418
            + +  P+    +++ G++G+ K+E  + +       +D  D++    N+  Y GSLT P
Sbjct: 162 YK-LGEPDEFLTKILNGIKGVGKKEIDLGI-------VDPRDIRFETNNFYRYIGSLTIP 213

Query: 417 PYTECVTWIIYEK 379
           P TE V W + ++
Sbjct: 214 PCTEGVIWTVQKR 226


>At2g28210.1 68415.m03425 carbonic anhydrase family protein similar
           to storage protein (dioscorin) [Dioscorea cayenensis]
           GI:433463; contains Pfam profile PF00194:
           Eukaryotic-type carbonic anhydrase
          Length = 217

 Score = 44.4 bits (100), Expect = 9e-05
 Identities = 40/133 (30%), Positives = 60/133 (45%), Gaps = 2/133 (1%)
 Frame = -3

Query: 771 EYIFEQMHFHWSVDDFTGCEHVLDGHGYAAECHFVHYNSKYESLETAVGHPDGLAVVGFL 592
           EY   Q+H+H      +  EH ++G  +A E H VH N               LAVV  L
Sbjct: 95  EYKLLQLHWH------SPSEHTMNGRRFALELHMVHENIN-----------GSLAVVTVL 137

Query: 591 LETVDAPNPRFDRLVQGLEGIQKRESVMNVTSESLLWMDREDLQIGN--YVTYKGSLTTP 418
            + +  P+     L   L  I  +    N   + +  +D  D++IG+  +  Y GSLTTP
Sbjct: 138 YK-IGRPDSFLGLLENKLSAITDQ----NEAEKYVDVIDPRDIKIGSRKFYRYIGSLTTP 192

Query: 417 PYTECVTWIIYEK 379
           P T+ V W + +K
Sbjct: 193 PCTQNVIWTVVKK 205


>At1g08065.1 68414.m00882 carbonic anhydrase family protein similar
           to storage protein (dioscorin) [Dioscorea cayenensis]
           GI:433463; contains Pfam profile PF00194:
           Eukaryotic-type carbonic anhydrase
          Length = 263

 Score = 43.2 bits (97), Expect = 2e-04
 Identities = 44/180 (24%), Positives = 75/180 (41%)
 Frame = -3

Query: 771 EYIFEQMHFHWSVDDFTGCEHVLDGHGYAAECHFVHYNSKYESLETAVGHPDGLAVVGFL 592
           EY  +Q+H+H      +  EH L+G  +  E H VH +    +   A  +   L    + 
Sbjct: 106 EYKLQQIHWH------SPSEHTLNGKRFVLEEHMVHQSKDGRNAVVAFFYK--LGKPDYF 157

Query: 591 LETVDAPNPRFDRLVQGLEGIQKRESVMNVTSESLLWMDREDLQIGNYVTYKGSLTTPPY 412
           L T++       R ++ +    + +  + +        + +     +Y  + GSLTTPP 
Sbjct: 158 LLTLE-------RYLKRITDTHESQEFVEMVHPRTFGFESK-----HYYRFIGSLTTPPC 205

Query: 411 TECVTWIIYEKPVQIGSEQLGLLRQLEGPDSQPIERNVRPTQRHPPGHSVIYVKQVRSKL 232
           +E V W I ++   +  +QL +LR      S     N RP QR       +Y+    SKL
Sbjct: 206 SENVIWTISKEMRTVTLKQLIMLRVTVHDQS---NSNARPLQRKNERPVALYIPTWHSKL 262


>At3g52720.2 68416.m05809 carbonic anhydrase family protein low
           similarity to storage protein (dioscorin) [Dioscorea
           cayenensis] GI:433463; contains Pfam profile PF00194:
           Eukaryotic-type carbonic anhydrase
          Length = 230

 Score = 41.5 bits (93), Expect = 6e-04
 Identities = 40/135 (29%), Positives = 57/135 (42%), Gaps = 3/135 (2%)
 Frame = -3

Query: 768 YIFEQMHFHWSVDDFTGCEHVLDGHGYAAECHFVHYNSKYESLETAVGHPDGLAVVGFLL 589
           Y   QMH+H      T  EH L G  YAAE H VH               DG   V   L
Sbjct: 113 YTLLQMHWH------TPSEHHLHGVQYAAELHMVHQAK------------DGSFAVVASL 154

Query: 588 ETVDAPNPRFDRLVQGLEGIQKRESVMNVTSESLLW-MDREDLQ--IGNYVTYKGSLTTP 418
             +    P   ++ + L  +++     N T++  +  +D   ++     Y  Y GSLTTP
Sbjct: 155 FKIGTEEPFLSQMKEKLVKLKEERLKGNHTAQVEVGRIDTRHIERKTRKYYRYIGSLTTP 214

Query: 417 PYTECVTWIIYEKPV 373
           P +E V+W I  K +
Sbjct: 215 PCSENVSWTILGKVI 229


>At2g28100.1 68415.m03413 glycosyl hydrolase family 29 /
           alpha-L-fucosidase, putative similar to
           alpha-L-fucosidase SP:P10901 from [Dictyostelium
           discoideum]
          Length = 506

 Score = 34.7 bits (76), Expect = 0.069
 Identities = 24/66 (36%), Positives = 31/66 (46%), Gaps = 2/66 (3%)
 Frame = +2

Query: 557 SNLGLGASTVSRRNPTTANPSGWPTAVSKDSYL--LL*WTKWHSAAYPWPSSTCSHPVKS 730
           S  G G +  S  NPT  N S W   ++KDS    ++   K H     WPS    + VKS
Sbjct: 64  SEWGTGKANPSIFNPTHLNASQW-VQIAKDSGFSRVILTAKHHDGFCLWPSEYTDYSVKS 122

Query: 731 STDQWK 748
           S  QW+
Sbjct: 123 S--QWR 126


>At5g13540.2 68418.m01564 expressed protein HERC2 - Homo sapiens,
           EMBL:AF071172; isoform contains non-consensus GG
           acceptor splice site at intron 6
          Length = 788

 Score = 29.9 bits (64), Expect = 2.0
 Identities = 11/23 (47%), Positives = 18/23 (78%)
 Frame = -3

Query: 480 MDREDLQIGNYVTYKGSLTTPPY 412
           ++ E+L+IG++V  K S+TTP Y
Sbjct: 656 IEEEELKIGDWVRVKASITTPTY 678


>At4g17240.1 68417.m02592 expressed protein
          Length = 200

 Score = 29.9 bits (64), Expect = 2.0
 Identities = 14/30 (46%), Positives = 20/30 (66%), Gaps = 1/30 (3%)
 Frame = -3

Query: 690 YAAE-CHFVHYNSKYESLETAVGHPDGLAV 604
           Y +E C++    S Y+SLE + G PDGLA+
Sbjct: 96  YLSESCYWQAQTSSYDSLEFSSGSPDGLAL 125


>At3g19870.1 68416.m02516 expressed protein 
          Length = 1117

 Score = 29.5 bits (63), Expect = 2.6
 Identities = 20/56 (35%), Positives = 29/56 (51%), Gaps = 1/56 (1%)
 Frame = -3

Query: 702 DGHGYAAECHFVHYNSKYESLET-AVGHPDGLAVVGFLLETVDAPNPRFDRLVQGL 538
           DG     E   V + S   S+++ +VGH +  AVV  LL  V+ PN  FDR  + +
Sbjct: 102 DGSSGLKEQAMVSFTSVLVSIDSFSVGHVE--AVVDLLLALVNRPNHGFDRQARAI 155


>At1g12810.1 68414.m01488 proline-rich family protein contains
           proline rich extensin domains, INTERPRO:IPR002965
          Length = 129

 Score = 29.5 bits (63), Expect = 2.6
 Identities = 15/42 (35%), Positives = 17/42 (40%), Gaps = 1/42 (2%)
 Frame = +3

Query: 519 PSSESPPDLVLVYRTSGWERRQSPGGIRPQPIHRDG-PPPSP 641
           P S  PP     Y   G+     P G    P H +G PPP P
Sbjct: 8   PESYPPPGYQSHYPPPGYPSAPPPPGYPSPPSHHEGYPPPQP 49


>At3g52710.1 68416.m05807 expressed protein predicted protein,
           Arabidopsis thaliana
          Length = 289

 Score = 28.7 bits (61), Expect = 4.5
 Identities = 13/30 (43%), Positives = 17/30 (56%)
 Frame = -2

Query: 604 GRIPPGDCRRSQPEVR*TSTRSGGDSEEGV 515
           GR+PPGD  +S P+   T +R   D  E V
Sbjct: 238 GRLPPGDVGKSSPQRNSTGSRRSIDGGEPV 267


  Database: arabidopsis
    Posted date:  Oct 4, 2007 10:56 AM
  Number of letters in database: 12,070,560
  Number of sequences in database:  28,952
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 17,176,123
Number of Sequences: 28952
Number of extensions: 384557
Number of successful extensions: 1214
Number of sequences better than 10.0: 14
Number of HSP's better than 10.0 without gapping: 1120
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1204
length of database: 12,070,560
effective HSP length: 80
effective length of database: 9,754,400
effective search space used: 1726528800
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)

- SilkBase 1999-2023 -