SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTP 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= BGIBMGA001781-TA|BGIBMGA001781-PA|IPR012934|Zinc finger,
AD-type
         (1137 letters)

Database: arabidopsis 
           28,952 sequences; 12,070,560 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

At4g05280.1 68417.m00799 Ulp1 protease family protein contains P...    38   0.038
At3g23780.1 68416.m02989 DNA-directed RNA polymerase family prot...    36   0.20 
At4g00760.1 68417.m00106 two-component responsive regulator fami...    34   0.47 
At2g22795.1 68415.m02704 expressed protein                             34   0.47 
At3g18090.1 68416.m02300 DNA-directed RNA polymerase family prot...    33   1.1  
At3g07340.1 68416.m00875 basic helix-loop-helix (bHLH) family pr...    33   1.1  
At5g14370.1 68418.m01679 expressed protein                             33   1.4  
At1g21630.1 68414.m02708 calcium-binding EF hand family protein ...    33   1.4  
At5g62110.1 68418.m07796 hypothetical protein                          32   2.5  
At5g15560.1 68418.m01822 hypothetical protein                          32   2.5  
At1g72480.1 68414.m08381 expressed protein                             32   2.5  
At3g42530.1 68416.m04410 Ulp1 protease family similar to At5g281...    31   4.4  
At2g14010.1 68415.m01557 hypothetical protein  and genefinder          31   4.4  
At3g10360.1 68416.m01242 pumilio/Puf RNA-binding domain-containi...    31   5.8  
At3g55990.1 68416.m06221 expressed protein contains Pfam profile...    30   7.7  
At1g45616.1 68414.m05200 leucine-rich repeat family protein cont...    30   7.7  

>At4g05280.1 68417.m00799 Ulp1 protease family protein contains Pfam
           profile PF02902: Ulp1 protease family, C-terminal
           catalytic domain; similar to At3g24380, At5g36840,
           At5g35010, At3g42740, At4g05290, At2g14770, At3g43390,
           At2g05560, At4g08880, At1g34730, At1g27790, At1g34740,
           At1g27780, At5g36850, At3g42730, At1g52020, At3g24390,
           At1g25886, At4g03300
          Length = 1312

 Score = 37.9 bits (84), Expect = 0.038
 Identities = 32/136 (23%), Positives = 62/136 (45%), Gaps = 9/136 (6%)

Query: 318 DICNPTDVQIVFMQNEDINENSTECFLSEVPHEMSDICNPTNVEIVFVQNEDINENSTEC 377
           D+   TDV++V M+   I E S     +E P+  SD  + T++     +N D ++N+   
Sbjct: 765 DLAAATDVEVV-MEEPGIVEGSVG---TEDPNPGSDEADKTDIP----KNNDESDNAVAV 816

Query: 378 FLNEVLHEMSDICNPTNVEIVFAQAEDINENSGRVNQDLISSVDHQQQDAVVNAKSVFRE 437
              E       +      ++V+ Q +D++ +  +    L+  V +QQ + V+ A++V   
Sbjct: 817 EAKEEKKSSPKVPKKVKNQLVYEQ-DDVHPHGFKAKTVLVPDVPNQQIEVVIRAENVSYN 875

Query: 438 KFQRWSPKKLKTKSKI 453
                 P K++  S I
Sbjct: 876 PLDMVDPSKVEEFSNI 891


>At3g23780.1 68416.m02989 DNA-directed RNA polymerase family protein
           similar to SP|P38420 DNA-directed RNA polymerase II 135
           kDa polypeptide (EC 2.7.7.6) (RNA polymerase II subunit
           2) {Arabidopsis thaliana}; contains Pfam profiles
           PF04560: RNA polymerase Rpb2 domain 7, PF04561: RNA
           polymerase Rpb2 domain 2, PF04565: RNA polymerase Rpb2
           domain 3, PF04566: RNA polymerase Rpb2 domain 4,
           PF04567: RNA polymerase Rpb2 domain 5
          Length = 946

 Score = 35.5 bits (78), Expect = 0.20
 Identities = 44/175 (25%), Positives = 76/175 (43%), Gaps = 16/175 (9%)

Query: 355 CNPTNVEIVFVQNEDINENSTECFLNEVLHEMSDICNPTNVEIVFAQAEDINENSGRVNQ 414
           C+  + ++ + Q       ++EC   EVL    +     NV + + Q + I  N   + +
Sbjct: 512 CDTLSQQLFYPQKPLFKTLASECLKKEVLFNGQNAIVAVNVHLGYNQEDSIVMNKASLER 571

Query: 415 DLISSVDHQQQDAVVNAKSVFREKFQRWSPKKLKTKSKI----SLDKEKFTTCFVDDLPR 470
            +  S   +   A V+AK   + K      +  KT SKI    SL+ + F   F+     
Sbjct: 572 GMFRSEQIRSYKAEVDAKDSEKRKKMDELVQFGKTHSKIGKVDSLEDDGFP--FI-GANM 628

Query: 471 ATSDI----CNPTNA-HSVFVQDEDKNENSGRVNQDLISSVDHQQQDALVNVRSV 520
           +T DI    C  + A HS+ +    K+   G V + ++SS D  +  A V++R V
Sbjct: 629 STGDIVIGRCTESGADHSIKL----KHTERGIVQKVVLSSNDEGKNFAAVSLRQV 679


>At4g00760.1 68417.m00106 two-component responsive regulator family
           protein / response regulator family protein contains
           Pfam profile: PF00072 response regulator receiver domain
          Length = 367

 Score = 34.3 bits (75), Expect = 0.47
 Identities = 36/157 (22%), Positives = 61/157 (38%), Gaps = 6/157 (3%)

Query: 676 SDISKPKNLQSAFVQDEDKNEDSGRDIQISXXXXXXXXXXXXXEKNGFS-GSKEDIETAF 734
           +++ K  N      +D DK +D                     + NG   G ++  +  F
Sbjct: 158 AELKKNNNNSEVETEDLDKYKDELGQGNKRKERADTDTGEHTEKNNGSDLGDQKKPKLLF 217

Query: 735 EDDDLNEYLQTEVSSPPRSKITNNKPTAISSNADCSG--SPKASVDLKDIAKNEVNQ--D 790
            DD  NE L+  V +   +      PT I  N + S   SP+     +++ K       D
Sbjct: 218 ADDLQNETLEA-VPNIEEANNERKAPTEIKKNGESSEKKSPELVCMEEELQKWSAESFID 276

Query: 791 LTSSVDHQQQDALVNARSVPKRNSQHRSPKKLKTNNI 827
           LT+SV+++ QD L +       +    SP +   NN+
Sbjct: 277 LTASVENESQDPLESVGDSVGPHEIPLSPPESSNNNV 313


>At2g22795.1 68415.m02704 expressed protein
          Length = 734

 Score = 34.3 bits (75), Expect = 0.47
 Identities = 36/241 (14%), Positives = 90/241 (37%), Gaps = 6/241 (2%)

Query: 668 VNELPRAKSDISKPKNLQSAFVQDEDKNEDSGRDIQISXXXXXXXXXXXXXEKNGFSGSK 727
           V    + K++  + + ++S+F+++  + ED  ++ + S             + N  S S+
Sbjct: 477 VESSSQEKNEDKETEKIESSFLEETKEKEDETKEKEESSSQEKTEEKETETKDNEESSSQ 536

Query: 728 EDIETAFEDDDLNEYLQTEVSSPPRSKITNNKPTAISSNADCSGSPKASVDLKDIAKNEV 787
           E+ +     D  NE ++ E +S       N   T     +      K   + K   +   
Sbjct: 537 EETK-----DKENEKIEKEEASSQEESKENETETKEKEESSSQEETKEKENEKIEKEESA 591

Query: 788 NQDLTSSVDHQQQDALVNARSVPKRNSQHRSPKKLKTNNIQISXXXXXXXXXXXXXETNG 847
            Q+ T   ++++ +   +A     +  +  + +K ++++ +               E N 
Sbjct: 592 PQEETKEKENEKIEKEESASQEETKEKETETKEKEESSSNESQENVNTESEKKEQVEENE 651

Query: 848 FSGSKEDIETAFEDD-DLNEYLQTEVSSPPRSKITNNKPTAISSNADCSGSPKASVDLED 906
               ++  E++ E+     E  Q+E +S       N +       +D S       +++D
Sbjct: 652 KKTDEDTSESSKENSVSDTEQKQSEETSEKEESNKNGETEVTQEQSDSSSDTNLPQEVKD 711

Query: 907 I 907
           +
Sbjct: 712 V 712


>At3g18090.1 68416.m02300 DNA-directed RNA polymerase family protein
           similar to SP|P38420 DNA-directed RNA polymerase II 135
           kDa polypeptide (EC 2.7.7.6) (RNA polymerase II subunit
           2) {Arabidopsis thaliana}; contains Pfam profiles
           PF04560: RNA polymerase Rpb2 domain 7, PF04561: RNA
           polymerase Rpb2 domain 2, PF04565: RNA polymerase Rpb2
           domain 3, PF04566: RNA polymerase Rpb2 domain 4,
           PF04567: RNA polymerase Rpb2 domain 5
          Length = 1038

 Score = 33.1 bits (72), Expect = 1.1
 Identities = 43/175 (24%), Positives = 75/175 (42%), Gaps = 16/175 (9%)

Query: 355 CNPTNVEIVFVQNEDINENSTECFLNEVLHEMSDICNPTNVEIVFAQAEDINENSGRVNQ 414
           C+  + ++ + Q       ++EC   EVL    +     NV + + Q + I  N   + +
Sbjct: 603 CDTLSQQLFYPQKPLFKTLASECLEKEVLFNGQNAIVAVNVHLGYNQEDSIVMNKASLER 662

Query: 415 DLISSVDHQQQDAVVNAKSVFREKFQRWSPKKLKTKSKI----SLDKEKFTTCFVDDLPR 470
            +  S   +   A V+ K   + K      +  KT SKI    SL+ + F   F+     
Sbjct: 663 GMFRSEQIRSYKAEVDTKDSEKRKKMDELVQFGKTYSKIGKVDSLEDDGFP--FI-GANM 719

Query: 471 ATSDI----CNPTNA-HSVFVQDEDKNENSGRVNQDLISSVDHQQQDALVNVRSV 520
           +T DI    C  + A HS+ +    K+   G V + ++SS D  +  A V++R V
Sbjct: 720 STGDIVIGRCTESGADHSIKL----KHTERGIVQKVVLSSNDEGKNFAAVSLRQV 770


>At3g07340.1 68416.m00875 basic helix-loop-helix (bHLH) family
           protein contains Pfam profile: PF00010 helix-loop-helix
           DNA-binding domain
          Length = 456

 Score = 33.1 bits (72), Expect = 1.1
 Identities = 21/62 (33%), Positives = 31/62 (50%), Gaps = 2/62 (3%)

Query: 746 EVSSPPRSKITNNKPTAISSNADCSGSPKASVDLKDIAKNEVNQDLTSSVDHQQQDALVN 805
           E+S   ++K   N P+A+SS+ +     K   D K   K+E N D T S+D  +    V 
Sbjct: 202 ELSRKRKTKSKQNSPSAVSSSKEIE--EKEDSDPKRCKKSEENGDKTKSIDPYKDYIHVR 259

Query: 806 AR 807
           AR
Sbjct: 260 AR 261


>At5g14370.1 68418.m01679 expressed protein
          Length = 339

 Score = 32.7 bits (71), Expect = 1.4
 Identities = 18/77 (23%), Positives = 36/77 (46%)

Query: 617 TRRIKKRKNLQRAFVQDDDKNENSGRGEYSNSASPLLVHVGVRVRKFTACLVNELPRAKS 676
           ++RIKKRKN +     +D  + N        + +     + +++   T+  +NE PR+K 
Sbjct: 20  SKRIKKRKNREATTTMEDKSSSNLDASRKIRTKTKKPKFLSLKLELNTSHEINENPRSKK 79

Query: 677 DISKPKNLQSAFVQDED 693
              K  N + +  ++ D
Sbjct: 80  SKKKNNNKKQSKKKEPD 96


>At1g21630.1 68414.m02708 calcium-binding EF hand family protein
           contains INTERPRO:IPR002048 calcium-binding EF-hand
           domain; ESTs gb|T44428 and gb|AA395440 come from this
           gene
          Length = 1218

 Score = 32.7 bits (71), Expect = 1.4
 Identities = 29/132 (21%), Positives = 50/132 (37%), Gaps = 7/132 (5%)

Query: 678 ISKPKNLQSAFVQDEDKNEDSGRDIQISXXXXXXXXXXXXXEKNG--FSGSKEDIETAFE 735
           I+ PK   SA+ ++ D +   G D+  S             E++     G   D++   +
Sbjct: 790 IAPPKEKSSAWRKEVDVSSKEGEDVSFSDADSKTGKKQSSGEEDSEQSEGKTSDVDARDK 849

Query: 736 D---DDLNEYLQTEVSSPPRSKITNNKPTAISSNADCSGSPKASVDLKD--IAKNEVNQD 790
           +   DD       E  S PR+K T ++       +  S     + D  D   + + VN D
Sbjct: 850 NGSLDDSKVRKGIEADSSPRTKDTRSENGHDDGESTASAGKTVNYDSHDETDSVSSVNPD 909

Query: 791 LTSSVDHQQQDA 802
                DH + D+
Sbjct: 910 NGKDKDHGKYDS 921


>At5g62110.1 68418.m07796 hypothetical protein
          Length = 691

 Score = 31.9 bits (69), Expect = 2.5
 Identities = 23/77 (29%), Positives = 34/77 (44%), Gaps = 6/77 (7%)

Query: 264 NINENSTECFLNEVTHETSDICNPMNPQIVFVQN-EDINGNSTECFLNEVPHETSDICNP 322
           N+N+NS E  LN +        N MN  I+ V     +NGNS E  LN +   +      
Sbjct: 539 NVNDNSNEETLNSIAAN-----NEMNFTILDVNTINQVNGNSNEETLNSIVANSEINFTT 593

Query: 323 TDVQIVFMQNEDINENS 339
            DV  +   + + NE +
Sbjct: 594 LDVNTINQVDNNFNEET 610


>At5g15560.1 68418.m01822 hypothetical protein
          Length = 239

 Score = 31.9 bits (69), Expect = 2.5
 Identities = 22/81 (27%), Positives = 37/81 (45%), Gaps = 4/81 (4%)

Query: 682 KNLQSAFVQDEDKNEDSGR----DIQISXXXXXXXXXXXXXEKNGFSGSKEDIETAFEDD 737
           K L ++  +DE+K ED  +    D + +               N  S  K+   T  +DD
Sbjct: 36  KPLVASGKEDEEKQEDGNKTKREDDESNIEEFIKFNSKEQDISNVTSKQKDKATTEDDDD 95

Query: 738 DLNEYLQTEVSSPPRSKITNN 758
           D+ E +Q ++S   RS+ +NN
Sbjct: 96  DVKEIIQDDISICCRSEASNN 116


>At1g72480.1 68414.m08381 expressed protein 
          Length = 509

 Score = 31.9 bits (69), Expect = 2.5
 Identities = 24/71 (33%), Positives = 33/71 (46%), Gaps = 4/71 (5%)

Query: 1033 WKEKKNSVPNKFSGRNGDIETAFEDDDLNEYLQTEVSSPPSLKISNNKPTAISSNADCSG 1092
            W   +NS    FSG +GD    FE DD    L  + S  PS ++ N   T +   AD  G
Sbjct: 441  WAPSQNSTRYAFSGSSGDTSAEFEKDDYTLTL-IKPSPIPSHEVKNLSETRLLP-AD-EG 497

Query: 1093 SPNASVDLKDK 1103
             P   ++ +DK
Sbjct: 498  EPEKDLE-EDK 507


>At3g42530.1 68416.m04410 Ulp1 protease family similar to At5g28170,
           At1g35110, At1g44880, At4g19320, At5g36020, At4g03970,
           At3g43010, At2g10350; contains Pfam profile PF02902:
           Ulp1 protease family, C-terminal catalytic domain
          Length = 889

 Score = 31.1 bits (67), Expect = 4.4
 Identities = 18/70 (25%), Positives = 31/70 (44%), Gaps = 1/70 (1%)

Query: 728 EDIETAFEDDDLNEYLQTEVSSPPRSKITNNKPTAISSNADCS-GSPKASVDLKDIAKNE 786
           + ++ A +  D    + TE++  P         +  S+N D + G+P    D+KD   NE
Sbjct: 480 DHLDKAQDSSDSTPLINTEITDDPMDVFVTPLQSEHSNNDDANEGNPVYDTDVKDQNANE 539

Query: 787 VNQDLTSSVD 796
            + D    VD
Sbjct: 540 EDVDSQMQVD 549


>At2g14010.1 68415.m01557 hypothetical protein  and genefinder
          Length = 833

 Score = 31.1 bits (67), Expect = 4.4
 Identities = 19/71 (26%), Positives = 30/71 (42%), Gaps = 1/71 (1%)

Query: 1032 GWKEKKNSVPNKFSGRNGDIETAFEDDDLNEYLQTEVSSPPSLKISNNKPTAISSNADCS 1091
            G  E+  + P   S    ++     DD ++E     +S PP L    N  +++  NA   
Sbjct: 559  GKDEETTAAPPTISDA-ANLSKPSADDIVSERYAERISDPPGLNDEANLSSSVMRNASDP 617

Query: 1092 GSPNASVDLKD 1102
            GSP   V  +D
Sbjct: 618  GSPPTEVVARD 628


>At3g10360.1 68416.m01242 pumilio/Puf RNA-binding domain-containing
           protein similar to RNA binding protein PufA GB:AAD39751
           [Dictyostelium discoideum] and similar to Pumilio
           protein GB:A46221 [Drosophila sp.]
          Length = 1003

 Score = 30.7 bits (66), Expect = 5.8
 Identities = 22/90 (24%), Positives = 40/90 (44%), Gaps = 3/90 (3%)

Query: 313 PHETSDICNPTDVQIVFMQNEDINENSTE-CFLSEVPHEMSDICNPTNVEIVFVQNEDIN 371
           PHE S+I N    QIV M  +    N  E C     P E   + N   +     +NE + 
Sbjct: 872 PHERSEIINKLAGQIVKMSQQKFASNVVEKCLTFGGPEERQVLVN--EMLGYTDENEPLQ 929

Query: 372 ENSTECFLNEVLHEMSDICNPTNVEIVFAQ 401
               + F N V+ ++ + C+  ++ ++ ++
Sbjct: 930 AMMKDPFGNYVVQKVLETCDDQSLALILSR 959


>At3g55990.1 68416.m06221 expressed protein contains Pfam profile
           PF03005: Arabidopsis proteins of unknown function
          Length = 487

 Score = 30.3 bits (65), Expect = 7.7
 Identities = 17/74 (22%), Positives = 33/74 (44%), Gaps = 4/74 (5%)

Query: 223 VFMQNEDINENSTECFLSEVPHETSDICNP-TNVQIVSVQDENINENSTECFL---NEVT 278
           +FM NED+   +   F +  PH+  D   P T +  + VQ+   N +  +  +   + V 
Sbjct: 39  IFMYNEDVKSIAEFPFSTSKPHDVHDEATPITEITTLPVQESIKNSDPIQESIKNADSVQ 98

Query: 279 HETSDICNPMNPQI 292
               D+  P+  ++
Sbjct: 99  DSVKDVAEPVQEEV 112


>At1g45616.1 68414.m05200 leucine-rich repeat family protein
           contains leucine rich-repeat (LRR) domains Pfam:PF00560,
           INTERPRO:IPR001611; similar to disease resistance
           protein [Lycopersicon esculentum] gi|3894383|gb|AAC78591
          Length = 994

 Score = 30.3 bits (65), Expect = 7.7
 Identities = 27/115 (23%), Positives = 47/115 (40%), Gaps = 11/115 (9%)

Query: 225 MQNEDINENSTECFLSEVPHETSDICNPTNVQIVSVQDENINENSTECFLNEVTHETSDI 284
           + N  ++EN+   F+ E+P   S + N   + +  V D N+N N     LN       DI
Sbjct: 328 LSNLVLSENN---FVGEIP---SSVSNLKQLTLFDVSDNNLNGNFPSSLLNLNQLRYIDI 381

Query: 285 CNP-----MNPQIVFVQNEDINGNSTECFLNEVPHETSDICNPTDVQIVFMQNED 334
           C+      + P I  + N +        F   +P    +I + T + + + Q  D
Sbjct: 382 CSNHFTGFLPPTISQLSNLEFFSACDNSFTGSIPSSLFNISSLTTLGLSYNQLND 436


  Database: arabidopsis
    Posted date:  Oct 3, 2007  3:31 PM
  Number of letters in database: 12,070,560
  Number of sequences in database:  28,952
  
Lambda     K      H
   0.311    0.128    0.362 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 26,075,934
Number of Sequences: 28952
Number of extensions: 1105836
Number of successful extensions: 2752
Number of sequences better than 10.0: 16
Number of HSP's better than 10.0 without gapping: 1
Number of HSP's successfully gapped in prelim test: 15
Number of HSP's that attempted gapping in prelim test: 2739
Number of HSP's gapped (non-prelim): 32
length of query: 1137
length of database: 12,070,560
effective HSP length: 89
effective length of query: 1048
effective length of database: 9,493,832
effective search space: 9949535936
effective search space used: 9949535936
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 42 (21.8 bits)
S2: 65 (30.3 bits)

- SilkBase 1999-2023 -