BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= fmgV11l24r (759 letters) Database: arabidopsis 28,952 sequences; 12,070,560 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value At1g08080.1 68414.m00884 carbonic anhydrase family protein simil... 57 1e-08 At5g04180.1 68418.m00406 carbonic anhydrase family protein simil... 55 4e-08 At4g20990.1 68417.m03038 carbonic anhydrase family protein simil... 50 1e-06 At3g52720.1 68416.m05808 carbonic anhydrase family protein low s... 50 2e-06 At4g21000.1 68417.m03039 carbonic anhydrase family protein simil... 45 6e-05 At2g28210.1 68415.m03425 carbonic anhydrase family protein simil... 42 4e-04 At3g52720.2 68416.m05809 carbonic anhydrase family protein low s... 40 0.001 At1g08065.1 68414.m00882 carbonic anhydrase family protein simil... 40 0.002 At2g28100.1 68415.m03413 glycosyl hydrolase family 29 / alpha-L-... 35 0.067 At5g13540.2 68418.m01564 expressed protein HERC2 - Homo sapiens,... 30 1.9 At4g17240.1 68417.m02592 expressed protein 30 1.9 At3g19870.1 68416.m02516 expressed protein 29 2.5 At1g12810.1 68414.m01488 proline-rich family protein contains pr... 29 2.5 At3g52710.1 68416.m05807 expressed protein predicted protein, Ar... 29 4.4 >At1g08080.1 68414.m00884 carbonic anhydrase family protein similar to storage protein (dioscorin) [Dioscorea cayenensis] GI:433463; contains Pfam profile PF00194: Eukaryotic-type carbonic anhydrase Length = 275 Score = 56.8 bits (131), Expect = 1e-08 Identities = 47/161 (29%), Positives = 75/161 (46%), Gaps = 2/161 (1%) Frame = -1 Query: 759 EQMHFHWSVDDFTGCEHVLDGHGYAAECHFVHYNSKYESLETAVGHPDGLAVVGFLLETV 580 +Q+H+H + EH ++G +A E H VH G +AVV L + Sbjct: 127 QQLHWH------SPSEHTINGRRFALELHMVH-----------EGRNRRMAVVTVLYKI- 168 Query: 579 DAPNPRFDRLVQGLEGIQKRESVMNVTSESLLWMDREDLQIGN--YVTYKGSLTTPPYTE 406 R D ++ LE + + M +++ +D ++IG+ Y Y GSLTTPP T+ Sbjct: 169 ----GRADTFIRSLEKELEGIAEMEEAEKNVGMIDPTKIKIGSRKYYRYTGSLTTPPCTQ 224 Query: 405 CVTWIIYEKPVQIGSEQLGLLRQLEGPDSQPIERNVRPTQR 283 VTW + K + +Q+ LLR D+ R V+PT + Sbjct: 225 NVTWSVVRKVRTVTRKQVKLLRVAVHDDANSNARPVQPTNK 265 >At5g04180.1 68418.m00406 carbonic anhydrase family protein similar to storage protein (dioscorin) [Dioscorea cayenensis] GI:433463; contains Pfam profile PF00194: Eukaryotic-type carbonic anhydrase Length = 277 Score = 55.2 bits (127), Expect = 4e-08 Identities = 51/163 (31%), Positives = 72/163 (44%), Gaps = 4/163 (2%) Frame = -1 Query: 714 EHVLDGHGYAAECHFVHYNSKYESLETAVGHPDGLAVVGFLLETVDAPNPRFDRL---VQ 544 EH LDG A E H VH +S+E GH LAV+G L + PN R+ + Sbjct: 123 EHFLDGQRLAMELHMVH-----KSVE---GH---LAVIGVLFREGE-PNAFISRIMDKIH 170 Query: 543 GLEGIQKRE-SVMNVTSESLLWMDREDLQIGNYVTYKGSLTTPPYTECVTWIIYEKPVQI 367 + +Q E S+ + W + + Y+GSLTTPP TE V W I K + Sbjct: 171 KIADVQDGEVSIGKIDPREFGW------DLTKFYEYRGSLTTPPCTEDVMWTIINKVGTV 224 Query: 366 GSEQLGLLRQLEGPDSQPIERNVRPTQRHPPGHSVIYVKQVRS 238 EQ+ +L E+N RP Q P ++Y+ + S Sbjct: 225 SREQIDVLTDAR---RGGYEKNARPAQ--PLNGRLVYLNEQSS 262 >At4g20990.1 68417.m03038 carbonic anhydrase family protein similar to storage protein (dioscorin) [Dioscorea cayenensis] GI:433463; contains Pfam profile PF00194: Eukaryotic-type carbonic anhydrase Length = 267 Score = 50.4 bits (115), Expect = 1e-06 Identities = 40/143 (27%), Positives = 63/143 (44%) Frame = -1 Query: 714 EHVLDGHGYAAECHFVHYNSKYESLETAVGHPDGLAVVGFLLETVDAPNPRFDRLVQGLE 535 EH ++G Y E H VH +++ + AV+G L + + PN +L+ G++ Sbjct: 131 EHTVNGTRYDLELHMVHTSARGRT-----------AVIGVLYK-LGEPNEFLTKLLNGIK 178 Query: 534 GIQKRESVMNVTSESLLWMDREDLQIGNYVTYKGSLTTPPYTECVTWIIYEKPVQIGSEQ 355 + +E + + + Q + Y GSLT PP TE V W + ++ I EQ Sbjct: 179 AVGNKEINLGMIDPREI-----RFQTRKFYRYIGSLTVPPCTEGVIWTVVKRVNTISMEQ 233 Query: 354 LGLLRQLEGPDSQPIERNVRPTQ 286 + LRQ E N RP Q Sbjct: 234 ITALRQAV---DDGFETNSRPVQ 253 >At3g52720.1 68416.m05808 carbonic anhydrase family protein low similarity to storage protein (dioscorin) [Dioscorea cayenensis] GI:433463; contains Pfam profile PF00194: Eukaryotic-type carbonic anhydrase Length = 284 Score = 50.0 bits (114), Expect = 2e-06 Identities = 49/160 (30%), Positives = 68/160 (42%), Gaps = 3/160 (1%) Frame = -1 Query: 756 QMHFHWSVDDFTGCEHVLDGHGYAAECHFVHYNSKYESLETAVGHPDGLAVVGFLLETVD 577 QMH+H T EH L G YAAE H VH DG V L + Sbjct: 117 QMHWH------TPSEHHLHGVQYAAELHMVHQAK------------DGSFAVVASLFKIG 158 Query: 576 APNPRFDRLVQGLEGIQKRESVMNVTSESLLW-MDREDLQ--IGNYVTYKGSLTTPPYTE 406 P ++ + L +++ N T++ + +D ++ Y Y GSLTTPP +E Sbjct: 159 TEEPFLSQMKEKLVKLKEERLKGNHTAQVEVGRIDTRHIERKTRKYYRYIGSLTTPPCSE 218 Query: 405 CVTWIIYEKPVQIGSEQLGLLRQLEGPDSQPIERNVRPTQ 286 V+W I K + EQ+ LLR P + N RP Q Sbjct: 219 NVSWTILGKVRSMSKEQVELLR---SPLDTSFKNNSRPCQ 255 >At4g21000.1 68417.m03039 carbonic anhydrase family protein similar to storage protein (dioscorin) [Dioscorea cayenensis] GI:433463; contains Pfam profile PF00194: Eukaryotic-type carbonic anhydrase Length = 260 Score = 44.8 bits (101), Expect = 6e-05 Identities = 31/114 (27%), Positives = 56/114 (49%), Gaps = 2/114 (1%) Frame = -1 Query: 714 EHVLDGHGYAAECHFVHYNSKYESLETAVGHPDGLAVVGFLLETVDAPNPRFDRLVQGLE 535 EH ++G Y E H VH ++ ++ VVG L + + P+ +++ G++ Sbjct: 132 EHTINGTSYDLELHMVHTSASGKT-----------TVVGVLYK-LGEPDEFLTKILNGIK 179 Query: 534 GIQKRESVMNVTSESLLWMDREDLQI--GNYVTYKGSLTTPPYTECVTWIIYEK 379 G+ K+E + + +D D++ N+ Y GSLT PP TE V W + ++ Sbjct: 180 GVGKKEIDLGI-------VDPRDIRFETNNFYRYIGSLTIPPCTEGVIWTVQKR 226 >At2g28210.1 68415.m03425 carbonic anhydrase family protein similar to storage protein (dioscorin) [Dioscorea cayenensis] GI:433463; contains Pfam profile PF00194: Eukaryotic-type carbonic anhydrase Length = 217 Score = 41.9 bits (94), Expect = 4e-04 Identities = 37/127 (29%), Positives = 56/127 (44%), Gaps = 2/127 (1%) Frame = -1 Query: 753 MHFHWSVDDFTGCEHVLDGHGYAAECHFVHYNSKYESLETAVGHPDGLAVVGFLLETVDA 574 + HW + EH ++G +A E H VH N LAVV L + + Sbjct: 99 LQLHWH----SPSEHTMNGRRFALELHMVHENIN-----------GSLAVVTVLYK-IGR 142 Query: 573 PNPRFDRLVQGLEGIQKRESVMNVTSESLLWMDREDLQIGN--YVTYKGSLTTPPYTECV 400 P+ L L I + N + + +D D++IG+ + Y GSLTTPP T+ V Sbjct: 143 PDSFLGLLENKLSAITDQ----NEAEKYVDVIDPRDIKIGSRKFYRYIGSLTTPPCTQNV 198 Query: 399 TWIIYEK 379 W + +K Sbjct: 199 IWTVVKK 205 >At3g52720.2 68416.m05809 carbonic anhydrase family protein low similarity to storage protein (dioscorin) [Dioscorea cayenensis] GI:433463; contains Pfam profile PF00194: Eukaryotic-type carbonic anhydrase Length = 230 Score = 40.3 bits (90), Expect = 0.001 Identities = 39/131 (29%), Positives = 56/131 (42%), Gaps = 3/131 (2%) Frame = -1 Query: 756 QMHFHWSVDDFTGCEHVLDGHGYAAECHFVHYNSKYESLETAVGHPDGLAVVGFLLETVD 577 QMH+H T EH L G YAAE H VH DG V L + Sbjct: 117 QMHWH------TPSEHHLHGVQYAAELHMVHQAK------------DGSFAVVASLFKIG 158 Query: 576 APNPRFDRLVQGLEGIQKRESVMNVTSESLLW-MDREDLQ--IGNYVTYKGSLTTPPYTE 406 P ++ + L +++ N T++ + +D ++ Y Y GSLTTPP +E Sbjct: 159 TEEPFLSQMKEKLVKLKEERLKGNHTAQVEVGRIDTRHIERKTRKYYRYIGSLTTPPCSE 218 Query: 405 CVTWIIYEKPV 373 V+W I K + Sbjct: 219 NVSWTILGKVI 229 >At1g08065.1 68414.m00882 carbonic anhydrase family protein similar to storage protein (dioscorin) [Dioscorea cayenensis] GI:433463; contains Pfam profile PF00194: Eukaryotic-type carbonic anhydrase Length = 263 Score = 39.9 bits (89), Expect = 0.002 Identities = 26/74 (35%), Positives = 36/74 (48%) Frame = -1 Query: 453 NYVTYKGSLTTPPYTECVTWIIYEKPVQIGSEQLGLLRQLEGPDSQPIERNVRPTQRHPP 274 +Y + GSLTTPP +E V W I ++ + +QL +LR S N RP QR Sbjct: 192 HYYRFIGSLTTPPCSENVIWTISKEMRTVTLKQLIMLRVTVHDQS---NSNARPLQRKNE 248 Query: 273 GHSVIYVKQVRSKL 232 +Y+ SKL Sbjct: 249 RPVALYIPTWHSKL 262 >At2g28100.1 68415.m03413 glycosyl hydrolase family 29 / alpha-L-fucosidase, putative similar to alpha-L-fucosidase SP:P10901 from [Dictyostelium discoideum] Length = 506 Score = 34.7 bits (76), Expect = 0.067 Identities = 24/66 (36%), Positives = 31/66 (46%), Gaps = 2/66 (3%) Frame = +2 Query: 557 SNLGLGASTVSRRNPTTANPSGWPTAVSKDSYL--LL*WTKWHSAAYPWPSSTCSHPVKS 730 S G G + S NPT N S W ++KDS ++ K H WPS + VKS Sbjct: 64 SEWGTGKANPSIFNPTHLNASQW-VQIAKDSGFSRVILTAKHHDGFCLWPSEYTDYSVKS 122 Query: 731 STDQWK 748 S QW+ Sbjct: 123 S--QWR 126 >At5g13540.2 68418.m01564 expressed protein HERC2 - Homo sapiens, EMBL:AF071172; isoform contains non-consensus GG acceptor splice site at intron 6 Length = 788 Score = 29.9 bits (64), Expect = 1.9 Identities = 11/23 (47%), Positives = 18/23 (78%) Frame = -1 Query: 480 MDREDLQIGNYVTYKGSLTTPPY 412 ++ E+L+IG++V K S+TTP Y Sbjct: 656 IEEEELKIGDWVRVKASITTPTY 678 >At4g17240.1 68417.m02592 expressed protein Length = 200 Score = 29.9 bits (64), Expect = 1.9 Identities = 14/30 (46%), Positives = 20/30 (66%), Gaps = 1/30 (3%) Frame = -1 Query: 690 YAAE-CHFVHYNSKYESLETAVGHPDGLAV 604 Y +E C++ S Y+SLE + G PDGLA+ Sbjct: 96 YLSESCYWQAQTSSYDSLEFSSGSPDGLAL 125 >At3g19870.1 68416.m02516 expressed protein Length = 1117 Score = 29.5 bits (63), Expect = 2.5 Identities = 20/56 (35%), Positives = 29/56 (51%), Gaps = 1/56 (1%) Frame = -1 Query: 702 DGHGYAAECHFVHYNSKYESLET-AVGHPDGLAVVGFLLETVDAPNPRFDRLVQGL 538 DG E V + S S+++ +VGH + AVV LL V+ PN FDR + + Sbjct: 102 DGSSGLKEQAMVSFTSVLVSIDSFSVGHVE--AVVDLLLALVNRPNHGFDRQARAI 155 >At1g12810.1 68414.m01488 proline-rich family protein contains proline rich extensin domains, INTERPRO:IPR002965 Length = 129 Score = 29.5 bits (63), Expect = 2.5 Identities = 15/42 (35%), Positives = 17/42 (40%), Gaps = 1/42 (2%) Frame = +3 Query: 519 PSSESPPDLVLVYRTSGWERRQSPGGIRPQPIHRDG-PPPSP 641 P S PP Y G+ P G P H +G PPP P Sbjct: 8 PESYPPPGYQSHYPPPGYPSAPPPPGYPSPPSHHEGYPPPQP 49 >At3g52710.1 68416.m05807 expressed protein predicted protein, Arabidopsis thaliana Length = 289 Score = 28.7 bits (61), Expect = 4.4 Identities = 13/30 (43%), Positives = 17/30 (56%) Frame = -3 Query: 604 GRIPPGDCRRSQPEVR*TSTRSGGDSEEGV 515 GR+PPGD +S P+ T +R D E V Sbjct: 238 GRLPPGDVGKSSPQRNSTGSRRSIDGGEPV 267 Database: arabidopsis Posted date: Oct 4, 2007 10:56 AM Number of letters in database: 12,070,560 Number of sequences in database: 28,952 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 16,824,590 Number of Sequences: 28952 Number of extensions: 376196 Number of successful extensions: 1169 Number of sequences better than 10.0: 14 Number of HSP's better than 10.0 without gapping: 1078 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 1160 length of database: 12,070,560 effective HSP length: 79 effective length of database: 9,783,352 effective search space used: 1692519896 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -