SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTX 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= an--0336
         (746 letters)

Database: arabidopsis 
           28,952 sequences; 12,070,560 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

At4g21710.1 68417.m03144 DNA-directed RNA polymerase II 135 kDa ...   367   e-102
At5g45140.1 68418.m05542 DNA-directed RNA polymerase, putative s...   232   2e-61
At3g23780.1 68416.m02989 DNA-directed RNA polymerase family prot...   185   2e-47
At3g18090.1 68416.m02300 DNA-directed RNA polymerase family prot...   185   2e-47
At1g29940.1 68414.m03658 DNA-directed RNA polymerase family prot...   181   5e-46
At1g72040.1 68414.m08327 deoxynucleoside kinase family contains ...    31   0.61 
At1g70020.1 68414.m08058 hypothetical protein                          30   1.9  
At4g13150.1 68417.m02047 expressed protein                             29   4.3  
At3g46640.1 68416.m05063 myb family transcription factor contain...    29   4.3  
At1g02910.1 68414.m00258 tetratricopeptide repeat (TPR)-containi...    29   4.3  
At5g58350.1 68418.m07306 protein kinase family protein contains ...    28   5.7  
At3g07150.1 68416.m00852 hypothetical protein                          28   5.7  
At5g58880.1 68418.m07377 hypothetical protein                          28   7.6  
At1g73875.1 68414.m08555 endonuclease/exonuclease/phosphatase fa...    28   7.6  
At1g15660.1 68414.m01880 expressed protein similar to CENPCA pro...    28   7.6  
At1g64810.1 68414.m07348 expressed protein contains Pfam PF05634...    27   10.0 

>At4g21710.1 68417.m03144 DNA-directed RNA polymerase II 135 kDa
            polypeptide / RNA polymerase II subunit 2 (RPB135) (RPB2)
            (RP140) identical to SP|P38420 DNA-directed RNA
            polymerase II 135 kDa polypeptide (EC 2.7.7.6) (RNA
            polymerase II subunit 2) {Arabidopsis thaliana}
          Length = 1188

 Score =  367 bits (904), Expect = e-102
 Identities = 167/227 (73%), Positives = 189/227 (83%)
 Frame = +3

Query: 3    GIQYRQEDMPFTCEGITPDIIINPHAIPSRMTIGHLIECIQGKVSSNKGEIGDATPFNDA 182
            G+ Y QEDMP+T EG+TPDII+NPHAIPSRMTIG LIECI GKV+++ G+ GDATPF D 
Sbjct: 952  GMTYTQEDMPWTIEGVTPDIIVNPHAIPSRMTIGQLIECIMGKVAAHMGKEGDATPFTD- 1010

Query: 183  VNVQKISSLLQDYGYHLRGNEVMYNGHTGRKINAQVFLGPTYYQRLKHMVDDKIHSRARG 362
            V V  IS  L   GY +RG E MYNGHTGR + A +FLGPTYYQRLKHMVDDKIHSR RG
Sbjct: 1011 VTVDNISKALHKCGYQMRGFERMYNGHTGRPLTAMIFLGPTYYQRLKHMVDDKIHSRGRG 1070

Query: 363  PVQILVRQPMEGRARDGGLRFGEMERDCQIAHGAAQFLRERLFEVSDPYRIHVCNFCGLI 542
            PVQIL RQP EGR+RDGGLRFGEMERDC IAHGAA FL+ERLF+ SD YR+HVC  CGLI
Sbjct: 1071 PVQILTRQPAEGRSRDGGLRFGEMERDCMIAHGAAHFLKERLFDQSDAYRVHVCEVCGLI 1130

Query: 543  AIANLRNNTFECKGCKNKTQISQIRLPYAAKLLFQELMSMNIAPRLM 683
            AIANL+ N+FEC+GCKNKT I Q+ +PYA KLLFQELMSM IAPR++
Sbjct: 1131 AIANLKKNSFECRGCKNKTDIVQVYIPYACKLLFQELMSMAIAPRML 1177


>At5g45140.1 68418.m05542 DNA-directed RNA polymerase, putative
            similar to SP|P22276 DNA-directed RNA polymerase III 130
            kDa polypeptide (EC 2.7.7.6) (RNA polymerase III subunit
            2) {Saccharomyces cerevisiae}; contains Pfam profiles
            PF04560: RNA polymerase Rpb2 domain 7, PF04561: RNA
            polymerase Rpb2 domain 2, PF04565: RNA polymerase Rpb2
            domain 3, PF04566: RNA polymerase Rpb2 domain 4, PF04567:
            RNA polymerase Rpb2 domain 5
          Length = 1150

 Score =  232 bits (567), Expect = 2e-61
 Identities = 108/230 (46%), Positives = 150/230 (65%), Gaps = 4/230 (1%)
 Frame = +3

Query: 3    GIQYRQEDMPFTCEGITPDIIINPHAIPSRMTIGHLIECIQGKVSSNKGEIGDATPFND- 179
            GI  +QED PF+  GI PD+I+NPH  PSRMT+G +IE +  K   + G     + F + 
Sbjct: 916  GIIIQQEDFPFSELGICPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGER 975

Query: 180  ---AVNVQKISSLLQDYGYHLRGNEVMYNGHTGRKINAQVFLGPTYYQRLKHMVDDKIHS 350
               A  V+ IS+ L + G+   G +++Y+G +G  + A +F+GP YYQ+LKHMV DK+H+
Sbjct: 976  SGHADKVETISATLVEKGFSYSGKDLLYSGISGEPVEAYIFMGPIYYQKLKHMVLDKMHA 1035

Query: 351  RARGPVQILVRQPMEGRARDGGLRFGEMERDCQIAHGAAQFLRERLFEVSDPYRIHVCNF 530
            R  GP  ++ RQP EG++++GGLR GEMERDC IA+GA+  + ERL   SDP+ + VC  
Sbjct: 1036 RGSGPRVMMTRQPTEGKSKNGGLRVGEMERDCLIAYGASMLIYERLMISSDPFEVQVCRA 1095

Query: 531  CGLIAIANLRNNTFECKGCKNKTQISQIRLPYAAKLLFQELMSMNIAPRL 680
            CGL+   N +     C  CKN   I+ ++LPYA KLLFQEL SMN+ PRL
Sbjct: 1096 CGLLGYYNYKLKKAVCTTCKNGDNIATMKLPYACKLLFQELQSMNVVPRL 1145


>At3g23780.1 68416.m02989 DNA-directed RNA polymerase family protein
            similar to SP|P38420 DNA-directed RNA polymerase II 135
            kDa polypeptide (EC 2.7.7.6) (RNA polymerase II subunit
            2) {Arabidopsis thaliana}; contains Pfam profiles
            PF04560: RNA polymerase Rpb2 domain 7, PF04561: RNA
            polymerase Rpb2 domain 2, PF04565: RNA polymerase Rpb2
            domain 3, PF04566: RNA polymerase Rpb2 domain 4, PF04567:
            RNA polymerase Rpb2 domain 5
          Length = 946

 Score =  185 bits (451), Expect = 2e-47
 Identities = 101/240 (42%), Positives = 137/240 (57%), Gaps = 18/240 (7%)
 Frame = +3

Query: 3    GIQYRQEDMPFTCEGITPDIIINPHAIPSRMTIGHLIECI-----------QGKVSSNKG 149
            G    Q++ PFT +GI PDI+INPHA PSR T G L+E             +G  ++   
Sbjct: 699  GYLEEQQNFPFTIQGIVPDIVINPHAFPSRQTPGQLLEAALSKGIACPIQKEGSSAAYTK 758

Query: 150  EIGDATPFNDAVNVQKISSLLQDYGYHLRGNEVMYNGHTGRKINAQVFLGPTYYQRLKHM 329
                ATPF+    V +I+  L   G+   GNE +YNG +G  + + +F+GPT+YQRL HM
Sbjct: 759  LTRHATPFSTP-GVTEITEQLHRAGFSRWGNERVYNGRSGEMMRSMIFMGPTFYQRLVHM 817

Query: 330  VDDKIHSRARGPVQILVRQPMEGRARDGGLRFGEMERDCQIAHGAAQFLRERLFEVSDPY 509
             +DK+  R  GPV  L RQP+  R R GG++FGEMERDC IAHGA+  L ERLF +SD  
Sbjct: 818  SEDKVKFRNTGPVHPLTRQPVADRKRFGGIKFGEMERDCLIAHGASANLHERLFTLSDSS 877

Query: 510  RIHVCNFCGLIAIANLRNNTF-------ECKGCKNKTQISQIRLPYAAKLLFQELMSMNI 668
            ++H+C  C   A    R  +         C+ C +   + ++ +PY AKLL QEL SM I
Sbjct: 878  QMHICRKCKTYANVIERTPSSGRKIRGPYCRVCVSSDHVVRVYVPYGAKLLCQELFSMGI 937


>At3g18090.1 68416.m02300 DNA-directed RNA polymerase family protein
            similar to SP|P38420 DNA-directed RNA polymerase II 135
            kDa polypeptide (EC 2.7.7.6) (RNA polymerase II subunit
            2) {Arabidopsis thaliana}; contains Pfam profiles
            PF04560: RNA polymerase Rpb2 domain 7, PF04561: RNA
            polymerase Rpb2 domain 2, PF04565: RNA polymerase Rpb2
            domain 3, PF04566: RNA polymerase Rpb2 domain 4, PF04567:
            RNA polymerase Rpb2 domain 5
          Length = 1038

 Score =  185 bits (451), Expect = 2e-47
 Identities = 102/241 (42%), Positives = 137/241 (56%), Gaps = 19/241 (7%)
 Frame = +3

Query: 3    GIQYRQEDMPFTCEGITPDIIINPHAIPSRMTIGHLIECIQGK-----VSSNKGEIG--- 158
            G    Q++ PFT +GI PDI+INPHA PSR T G L+E    K     +   +G      
Sbjct: 790  GYLEEQQNFPFTIQGIVPDIVINPHAFPSRQTPGQLLEAALSKGIACPIQKKEGSSAAYT 849

Query: 159  ----DATPFNDAVNVQKISSLLQDYGYHLRGNEVMYNGHTGRKINAQVFLGPTYYQRLKH 326
                 ATPF+    V +I+  L   G+   GNE +YNG +G  + + +F+GPT+YQRL H
Sbjct: 850  KLTRHATPFSTP-GVTEITEQLHRAGFSRWGNERVYNGRSGEMMRSLIFMGPTFYQRLVH 908

Query: 327  MVDDKIHSRARGPVQILVRQPMEGRARDGGLRFGEMERDCQIAHGAAQFLRERLFEVSDP 506
            M ++K+  R  GPV  L RQP+  R R GG+RFGEMERDC IAHGA+  L ERLF +SD 
Sbjct: 909  MSENKVKFRNTGPVHPLTRQPVADRKRFGGIRFGEMERDCLIAHGASANLHERLFTLSDS 968

Query: 507  YRIHVCNFCGLIAIANLRNNTF-------ECKGCKNKTQISQIRLPYAAKLLFQELMSMN 665
             ++H+C  C   A    R  +         C+ C +   + ++ +PY AKLL QEL SM 
Sbjct: 969  SQMHICRKCKTYANVIERTPSSGRKIRGPYCRVCASSDHVVRVYVPYGAKLLCQELFSMG 1028

Query: 666  I 668
            I
Sbjct: 1029 I 1029


>At1g29940.1 68414.m03658 DNA-directed RNA polymerase family protein
            similar to SP|P22138 DNA-directed RNA polymerase I 135
            kDa polypeptide (EC 2.7.7.6) (RNA polymerase I subunit 2)
            {Saccharomyces cerevisiae}; contains Pfam profiles
            PF04563; RNA polymerase beta subunit, PF04560: RNA
            polymerase Rpb2 domain 7, PF04561: RNA polymerase Rpb2
            domain 2, PF04565: RNA polymerase Rpb2 domain 3, PF00562:
            RNA polymerase Rpb2 domain 6
          Length = 1114

 Score =  181 bits (440), Expect = 5e-46
 Identities = 101/254 (39%), Positives = 138/254 (54%), Gaps = 33/254 (12%)
 Frame = +3

Query: 24   DMPFT-CEGITPDIIINPHAIPSRMTIGHLIECIQGKVSSNKGEIGDATPFNDAVN---- 188
            DMPF    G+ PD+IINPHA PSRMTI  L+E I  K  S  G+  DATPF DAV     
Sbjct: 853  DMPFNGVTGMRPDLIINPHAFPSRMTIAMLLESIAAKGGSLHGKFVDATPFRDAVKKTNG 912

Query: 189  ---------VQKISSLLQDYGYHLRGNEVMYNGHTGRKINAQVFLGPTYYQRLKHMVDDK 341
                     V  + S+L++ G++  G E +Y+G+ G ++  ++F+GP YYQRL+HMV DK
Sbjct: 913  EEESKSSLLVDDLGSMLKEKGFNHYGTETLYSGYLGVELKCEIFMGPVYYQRLRHMVSDK 972

Query: 342  IHSRARGPVQILVRQPMEGRARDGGLRFGEMERDCQIAHGAAQFLRERLFEVSDPYRIHV 521
               R+ G V  L  QP++GR R GG+RFGEMERD  +AHGA+  L +RL   SD +   V
Sbjct: 973  FQVRSTGQVDQLTHQPIKGRKRGGGIRFGEMERDSLLAHGASYLLHDRLHTSSDHHIADV 1032

Query: 522  CNFCGLIAIANLRN-------------------NTFECKGCKNKTQISQIRLPYAAKLLF 644
            C+ CG +  +++ N                       C  CK    +  + +PY  + L 
Sbjct: 1033 CSLCGSLLTSSVVNVQQKKLIQEIGKLPPGRTPKKVTCYSCKTSKGMETVAMPYVFRYLA 1092

Query: 645  QELMSMNIAPRLMV 686
             EL SMNI   L +
Sbjct: 1093 AELASMNIKMTLQL 1106


>At1g72040.1 68414.m08327 deoxynucleoside kinase family contains
           Pfam profile: PF01712 deoxynucleoside kinase
          Length = 580

 Score = 31.5 bits (68), Expect = 0.61
 Identities = 16/39 (41%), Positives = 21/39 (53%)
 Frame = +1

Query: 346 ILELEVQYKF**DSLWKVGQEMVDYVLVKWNVIVRLHTE 462
           + EL+  YK   D  WK  Q+MVDY+     +I R H E
Sbjct: 202 VAELKKLYK---DKFWKASQKMVDYLRSSVGIIHRNHAE 237


>At1g70020.1 68414.m08058 hypothetical protein
          Length = 225

 Score = 29.9 bits (64), Expect = 1.9
 Identities = 14/40 (35%), Positives = 24/40 (60%)
 Frame = +1

Query: 427 VKWNVIVRLHTELPNFYVNVYLKYQILTVYTCVISAA*SL 546
           VKW++++RL ++LP +Y+ +    Q   +Y  V  A  SL
Sbjct: 92  VKWDLVIRLPSDLPGYYMCLKGDLQTFILYKGVTIANSSL 131


>At4g13150.1 68417.m02047 expressed protein 
          Length = 119

 Score = 28.7 bits (61), Expect = 4.3
 Identities = 12/40 (30%), Positives = 22/40 (55%)
 Frame = +3

Query: 144 KGEIGDATPFNDAVNVQKISSLLQDYGYHLRGNEVMYNGH 263
           KG  G  +PF +  ++ K+SS  + + ++  GN  M  G+
Sbjct: 50  KGSGGGGSPFEEVKSLAKVSSSSEGFTFNAFGNRFMIPGN 89


>At3g46640.1 68416.m05063 myb family transcription factor contains
           Pfam profile: PF00249 myb-like DNA-binding domain
          Length = 323

 Score = 28.7 bits (61), Expect = 4.3
 Identities = 14/38 (36%), Positives = 21/38 (55%), Gaps = 3/38 (7%)
 Frame = -3

Query: 513 YGKDLILQINV---HVKIGQLRVQSDNHVPFHQNVIHH 409
           YG   ++Q+ V   H+ +     Q+ NH P+HQN  HH
Sbjct: 248 YGTQQMMQMPVYAHHMGMQGYHHQNHNHDPYHQNHRHH 285


>At1g02910.1 68414.m00258 tetratricopeptide repeat (TPR)-containing
           protein contains Pfam profile PF00515: TPR Domain
          Length = 453

 Score = 28.7 bits (61), Expect = 4.3
 Identities = 14/44 (31%), Positives = 21/44 (47%)
 Frame = -2

Query: 499 DTSNKRSRKNWAAPCAI*QSRSISPKRNPPSLALPSIGCLTRIC 368
           D +   SR+NW +   +  S S SP  +PPS   P+      +C
Sbjct: 35  DATLFNSRRNWDSHLFVYASSSSSPSSSPPSPNSPTDDLTAELC 78


>At5g58350.1 68418.m07306 protein kinase family protein contains
           protein kinase domain, Pfam:PF00069
          Length = 571

 Score = 28.3 bits (60), Expect = 5.7
 Identities = 21/73 (28%), Positives = 32/73 (43%), Gaps = 3/73 (4%)
 Frame = +3

Query: 396 GRARDGGLRFGEMERDCQIAH---GAAQFLRERLFEVSDPYRIHVCNFCGLIAIANLRNN 566
           G+ + G L    M RDC  AH   G  +F+   L+E +    I V +F G+     +  +
Sbjct: 160 GQVKIGDLGLARMLRDCHSAHSIIGTPEFMAPELYEENYNELIDVYSF-GM-CFLEMITS 217

Query: 567 TFECKGCKNKTQI 605
            F    C +  QI
Sbjct: 218 EFPYSECNHPAQI 230


>At3g07150.1 68416.m00852 hypothetical protein
          Length = 199

 Score = 28.3 bits (60), Expect = 5.7
 Identities = 17/50 (34%), Positives = 22/50 (44%)
 Frame = +3

Query: 414 GLRFGEMERDCQIAHGAAQFLRERLFEVSDPYRIHVCNFCGLIAIANLRN 563
           GL  G+ E D  + HG    L  +    SDP     CN  GLI    L++
Sbjct: 78  GLSTGKHEADALLVHGKLSKLGTKRAR-SDPLEDFACNDLGLIKTKKLKD 126


>At5g58880.1 68418.m07377 hypothetical protein
          Length = 1088

 Score = 27.9 bits (59), Expect = 7.6
 Identities = 10/27 (37%), Positives = 15/27 (55%)
 Frame = -2

Query: 502  SDTSNKRSRKNWAAPCAI*QSRSISPK 422
            +DT N +  + W   C I  S+ ISP+
Sbjct: 955  ADTQNSQDSQTWTQQCGIDSSQGISPR 981


>At1g73875.1 68414.m08555 endonuclease/exonuclease/phosphatase
           family protein contains Pfam profile PF03372:
           Endonuclease/Exonuclease/phosphatase family
          Length = 454

 Score = 27.9 bits (59), Expect = 7.6
 Identities = 15/42 (35%), Positives = 21/42 (50%)
 Frame = +3

Query: 294 LGPTYYQRLKHMVDDKIHSRARGPVQILVRQPMEGRARDGGL 419
           L  TY+ R    VD   H++   PV++L   P +   R GGL
Sbjct: 390 LATTYHSRFLGTVDYIWHTKELVPVRVLETLPADVLRRTGGL 431


>At1g15660.1 68414.m01880 expressed protein similar to CENPCA
           protein (GI:11863170) {Zea mays}
          Length = 705

 Score = 27.9 bits (59), Expect = 7.6
 Identities = 27/118 (22%), Positives = 49/118 (41%)
 Frame = +3

Query: 84  PSRMTIGHLIECIQGKVSSNKGEIGDATPFNDAVNVQKISSLLQDYGYHLRGNEVMYNGH 263
           PS + +  + + I     +N G +  A+PFND+V V++      D   H      ++  H
Sbjct: 350 PSEVNVQPIAKDIPNTSPTNVGTVDVASPFNDSV-VKRSGE--DDSHIH----SGIHRSH 402

Query: 264 TGRKINAQVFLGPTYYQRLKHMVDDKIHSRARGPVQILVRQPMEGRARDGGLRFGEME 437
             R  N  + +  +   R   M+   +  R +G  ++ V     G  R+ G R  + E
Sbjct: 403 LSRDGNPDICVMDSISNRSSAMLQKNVDMRTKGK-EVDVPMSESGANRNTGDRENDAE 459


>At1g64810.1 68414.m07348 expressed protein contains Pfam PF05634:
           Arabidopsis thaliana protein of unknown function
           (DUF794)
          Length = 436

 Score = 27.5 bits (58), Expect = 10.0
 Identities = 9/27 (33%), Positives = 16/27 (59%)
 Frame = +3

Query: 513 IHVCNFCGLIAIANLRNNTFECKGCKN 593
           +  C+ CG + +AN+ +N  +C G  N
Sbjct: 153 VFACSECGAVHVANVGHNIRDCNGPTN 179


  Database: arabidopsis
    Posted date:  Oct 4, 2007 10:56 AM
  Number of letters in database: 12,070,560
  Number of sequences in database:  28,952
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 16,317,742
Number of Sequences: 28952
Number of extensions: 349501
Number of successful extensions: 930
Number of sequences better than 10.0: 16
Number of HSP's better than 10.0 without gapping: 881
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 923
length of database: 12,070,560
effective HSP length: 79
effective length of database: 9,783,352
effective search space used: 1653386488
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)

- SilkBase 1999-2023 -