BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= an--0336 (746 letters) Database: arabidopsis 28,952 sequences; 12,070,560 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value At4g21710.1 68417.m03144 DNA-directed RNA polymerase II 135 kDa ... 367 e-102 At5g45140.1 68418.m05542 DNA-directed RNA polymerase, putative s... 232 2e-61 At3g23780.1 68416.m02989 DNA-directed RNA polymerase family prot... 185 2e-47 At3g18090.1 68416.m02300 DNA-directed RNA polymerase family prot... 185 2e-47 At1g29940.1 68414.m03658 DNA-directed RNA polymerase family prot... 181 5e-46 At1g72040.1 68414.m08327 deoxynucleoside kinase family contains ... 31 0.61 At1g70020.1 68414.m08058 hypothetical protein 30 1.9 At4g13150.1 68417.m02047 expressed protein 29 4.3 At3g46640.1 68416.m05063 myb family transcription factor contain... 29 4.3 At1g02910.1 68414.m00258 tetratricopeptide repeat (TPR)-containi... 29 4.3 At5g58350.1 68418.m07306 protein kinase family protein contains ... 28 5.7 At3g07150.1 68416.m00852 hypothetical protein 28 5.7 At5g58880.1 68418.m07377 hypothetical protein 28 7.6 At1g73875.1 68414.m08555 endonuclease/exonuclease/phosphatase fa... 28 7.6 At1g15660.1 68414.m01880 expressed protein similar to CENPCA pro... 28 7.6 At1g64810.1 68414.m07348 expressed protein contains Pfam PF05634... 27 10.0 >At4g21710.1 68417.m03144 DNA-directed RNA polymerase II 135 kDa polypeptide / RNA polymerase II subunit 2 (RPB135) (RPB2) (RP140) identical to SP|P38420 DNA-directed RNA polymerase II 135 kDa polypeptide (EC 2.7.7.6) (RNA polymerase II subunit 2) {Arabidopsis thaliana} Length = 1188 Score = 367 bits (904), Expect = e-102 Identities = 167/227 (73%), Positives = 189/227 (83%) Frame = +3 Query: 3 GIQYRQEDMPFTCEGITPDIIINPHAIPSRMTIGHLIECIQGKVSSNKGEIGDATPFNDA 182 G+ Y QEDMP+T EG+TPDII+NPHAIPSRMTIG LIECI GKV+++ G+ GDATPF D Sbjct: 952 GMTYTQEDMPWTIEGVTPDIIVNPHAIPSRMTIGQLIECIMGKVAAHMGKEGDATPFTD- 1010 Query: 183 VNVQKISSLLQDYGYHLRGNEVMYNGHTGRKINAQVFLGPTYYQRLKHMVDDKIHSRARG 362 V V IS L GY +RG E MYNGHTGR + A +FLGPTYYQRLKHMVDDKIHSR RG Sbjct: 1011 VTVDNISKALHKCGYQMRGFERMYNGHTGRPLTAMIFLGPTYYQRLKHMVDDKIHSRGRG 1070 Query: 363 PVQILVRQPMEGRARDGGLRFGEMERDCQIAHGAAQFLRERLFEVSDPYRIHVCNFCGLI 542 PVQIL RQP EGR+RDGGLRFGEMERDC IAHGAA FL+ERLF+ SD YR+HVC CGLI Sbjct: 1071 PVQILTRQPAEGRSRDGGLRFGEMERDCMIAHGAAHFLKERLFDQSDAYRVHVCEVCGLI 1130 Query: 543 AIANLRNNTFECKGCKNKTQISQIRLPYAAKLLFQELMSMNIAPRLM 683 AIANL+ N+FEC+GCKNKT I Q+ +PYA KLLFQELMSM IAPR++ Sbjct: 1131 AIANLKKNSFECRGCKNKTDIVQVYIPYACKLLFQELMSMAIAPRML 1177 >At5g45140.1 68418.m05542 DNA-directed RNA polymerase, putative similar to SP|P22276 DNA-directed RNA polymerase III 130 kDa polypeptide (EC 2.7.7.6) (RNA polymerase III subunit 2) {Saccharomyces cerevisiae}; contains Pfam profiles PF04560: RNA polymerase Rpb2 domain 7, PF04561: RNA polymerase Rpb2 domain 2, PF04565: RNA polymerase Rpb2 domain 3, PF04566: RNA polymerase Rpb2 domain 4, PF04567: RNA polymerase Rpb2 domain 5 Length = 1150 Score = 232 bits (567), Expect = 2e-61 Identities = 108/230 (46%), Positives = 150/230 (65%), Gaps = 4/230 (1%) Frame = +3 Query: 3 GIQYRQEDMPFTCEGITPDIIINPHAIPSRMTIGHLIECIQGKVSSNKGEIGDATPFND- 179 GI +QED PF+ GI PD+I+NPH PSRMT+G +IE + K + G + F + Sbjct: 916 GIIIQQEDFPFSELGICPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGER 975 Query: 180 ---AVNVQKISSLLQDYGYHLRGNEVMYNGHTGRKINAQVFLGPTYYQRLKHMVDDKIHS 350 A V+ IS+ L + G+ G +++Y+G +G + A +F+GP YYQ+LKHMV DK+H+ Sbjct: 976 SGHADKVETISATLVEKGFSYSGKDLLYSGISGEPVEAYIFMGPIYYQKLKHMVLDKMHA 1035 Query: 351 RARGPVQILVRQPMEGRARDGGLRFGEMERDCQIAHGAAQFLRERLFEVSDPYRIHVCNF 530 R GP ++ RQP EG++++GGLR GEMERDC IA+GA+ + ERL SDP+ + VC Sbjct: 1036 RGSGPRVMMTRQPTEGKSKNGGLRVGEMERDCLIAYGASMLIYERLMISSDPFEVQVCRA 1095 Query: 531 CGLIAIANLRNNTFECKGCKNKTQISQIRLPYAAKLLFQELMSMNIAPRL 680 CGL+ N + C CKN I+ ++LPYA KLLFQEL SMN+ PRL Sbjct: 1096 CGLLGYYNYKLKKAVCTTCKNGDNIATMKLPYACKLLFQELQSMNVVPRL 1145 >At3g23780.1 68416.m02989 DNA-directed RNA polymerase family protein similar to SP|P38420 DNA-directed RNA polymerase II 135 kDa polypeptide (EC 2.7.7.6) (RNA polymerase II subunit 2) {Arabidopsis thaliana}; contains Pfam profiles PF04560: RNA polymerase Rpb2 domain 7, PF04561: RNA polymerase Rpb2 domain 2, PF04565: RNA polymerase Rpb2 domain 3, PF04566: RNA polymerase Rpb2 domain 4, PF04567: RNA polymerase Rpb2 domain 5 Length = 946 Score = 185 bits (451), Expect = 2e-47 Identities = 101/240 (42%), Positives = 137/240 (57%), Gaps = 18/240 (7%) Frame = +3 Query: 3 GIQYRQEDMPFTCEGITPDIIINPHAIPSRMTIGHLIECI-----------QGKVSSNKG 149 G Q++ PFT +GI PDI+INPHA PSR T G L+E +G ++ Sbjct: 699 GYLEEQQNFPFTIQGIVPDIVINPHAFPSRQTPGQLLEAALSKGIACPIQKEGSSAAYTK 758 Query: 150 EIGDATPFNDAVNVQKISSLLQDYGYHLRGNEVMYNGHTGRKINAQVFLGPTYYQRLKHM 329 ATPF+ V +I+ L G+ GNE +YNG +G + + +F+GPT+YQRL HM Sbjct: 759 LTRHATPFSTP-GVTEITEQLHRAGFSRWGNERVYNGRSGEMMRSMIFMGPTFYQRLVHM 817 Query: 330 VDDKIHSRARGPVQILVRQPMEGRARDGGLRFGEMERDCQIAHGAAQFLRERLFEVSDPY 509 +DK+ R GPV L RQP+ R R GG++FGEMERDC IAHGA+ L ERLF +SD Sbjct: 818 SEDKVKFRNTGPVHPLTRQPVADRKRFGGIKFGEMERDCLIAHGASANLHERLFTLSDSS 877 Query: 510 RIHVCNFCGLIAIANLRNNTF-------ECKGCKNKTQISQIRLPYAAKLLFQELMSMNI 668 ++H+C C A R + C+ C + + ++ +PY AKLL QEL SM I Sbjct: 878 QMHICRKCKTYANVIERTPSSGRKIRGPYCRVCVSSDHVVRVYVPYGAKLLCQELFSMGI 937 >At3g18090.1 68416.m02300 DNA-directed RNA polymerase family protein similar to SP|P38420 DNA-directed RNA polymerase II 135 kDa polypeptide (EC 2.7.7.6) (RNA polymerase II subunit 2) {Arabidopsis thaliana}; contains Pfam profiles PF04560: RNA polymerase Rpb2 domain 7, PF04561: RNA polymerase Rpb2 domain 2, PF04565: RNA polymerase Rpb2 domain 3, PF04566: RNA polymerase Rpb2 domain 4, PF04567: RNA polymerase Rpb2 domain 5 Length = 1038 Score = 185 bits (451), Expect = 2e-47 Identities = 102/241 (42%), Positives = 137/241 (56%), Gaps = 19/241 (7%) Frame = +3 Query: 3 GIQYRQEDMPFTCEGITPDIIINPHAIPSRMTIGHLIECIQGK-----VSSNKGEIG--- 158 G Q++ PFT +GI PDI+INPHA PSR T G L+E K + +G Sbjct: 790 GYLEEQQNFPFTIQGIVPDIVINPHAFPSRQTPGQLLEAALSKGIACPIQKKEGSSAAYT 849 Query: 159 ----DATPFNDAVNVQKISSLLQDYGYHLRGNEVMYNGHTGRKINAQVFLGPTYYQRLKH 326 ATPF+ V +I+ L G+ GNE +YNG +G + + +F+GPT+YQRL H Sbjct: 850 KLTRHATPFSTP-GVTEITEQLHRAGFSRWGNERVYNGRSGEMMRSLIFMGPTFYQRLVH 908 Query: 327 MVDDKIHSRARGPVQILVRQPMEGRARDGGLRFGEMERDCQIAHGAAQFLRERLFEVSDP 506 M ++K+ R GPV L RQP+ R R GG+RFGEMERDC IAHGA+ L ERLF +SD Sbjct: 909 MSENKVKFRNTGPVHPLTRQPVADRKRFGGIRFGEMERDCLIAHGASANLHERLFTLSDS 968 Query: 507 YRIHVCNFCGLIAIANLRNNTF-------ECKGCKNKTQISQIRLPYAAKLLFQELMSMN 665 ++H+C C A R + C+ C + + ++ +PY AKLL QEL SM Sbjct: 969 SQMHICRKCKTYANVIERTPSSGRKIRGPYCRVCASSDHVVRVYVPYGAKLLCQELFSMG 1028 Query: 666 I 668 I Sbjct: 1029 I 1029 >At1g29940.1 68414.m03658 DNA-directed RNA polymerase family protein similar to SP|P22138 DNA-directed RNA polymerase I 135 kDa polypeptide (EC 2.7.7.6) (RNA polymerase I subunit 2) {Saccharomyces cerevisiae}; contains Pfam profiles PF04563; RNA polymerase beta subunit, PF04560: RNA polymerase Rpb2 domain 7, PF04561: RNA polymerase Rpb2 domain 2, PF04565: RNA polymerase Rpb2 domain 3, PF00562: RNA polymerase Rpb2 domain 6 Length = 1114 Score = 181 bits (440), Expect = 5e-46 Identities = 101/254 (39%), Positives = 138/254 (54%), Gaps = 33/254 (12%) Frame = +3 Query: 24 DMPFT-CEGITPDIIINPHAIPSRMTIGHLIECIQGKVSSNKGEIGDATPFNDAVN---- 188 DMPF G+ PD+IINPHA PSRMTI L+E I K S G+ DATPF DAV Sbjct: 853 DMPFNGVTGMRPDLIINPHAFPSRMTIAMLLESIAAKGGSLHGKFVDATPFRDAVKKTNG 912 Query: 189 ---------VQKISSLLQDYGYHLRGNEVMYNGHTGRKINAQVFLGPTYYQRLKHMVDDK 341 V + S+L++ G++ G E +Y+G+ G ++ ++F+GP YYQRL+HMV DK Sbjct: 913 EEESKSSLLVDDLGSMLKEKGFNHYGTETLYSGYLGVELKCEIFMGPVYYQRLRHMVSDK 972 Query: 342 IHSRARGPVQILVRQPMEGRARDGGLRFGEMERDCQIAHGAAQFLRERLFEVSDPYRIHV 521 R+ G V L QP++GR R GG+RFGEMERD +AHGA+ L +RL SD + V Sbjct: 973 FQVRSTGQVDQLTHQPIKGRKRGGGIRFGEMERDSLLAHGASYLLHDRLHTSSDHHIADV 1032 Query: 522 CNFCGLIAIANLRN-------------------NTFECKGCKNKTQISQIRLPYAAKLLF 644 C+ CG + +++ N C CK + + +PY + L Sbjct: 1033 CSLCGSLLTSSVVNVQQKKLIQEIGKLPPGRTPKKVTCYSCKTSKGMETVAMPYVFRYLA 1092 Query: 645 QELMSMNIAPRLMV 686 EL SMNI L + Sbjct: 1093 AELASMNIKMTLQL 1106 >At1g72040.1 68414.m08327 deoxynucleoside kinase family contains Pfam profile: PF01712 deoxynucleoside kinase Length = 580 Score = 31.5 bits (68), Expect = 0.61 Identities = 16/39 (41%), Positives = 21/39 (53%) Frame = +1 Query: 346 ILELEVQYKF**DSLWKVGQEMVDYVLVKWNVIVRLHTE 462 + EL+ YK D WK Q+MVDY+ +I R H E Sbjct: 202 VAELKKLYK---DKFWKASQKMVDYLRSSVGIIHRNHAE 237 >At1g70020.1 68414.m08058 hypothetical protein Length = 225 Score = 29.9 bits (64), Expect = 1.9 Identities = 14/40 (35%), Positives = 24/40 (60%) Frame = +1 Query: 427 VKWNVIVRLHTELPNFYVNVYLKYQILTVYTCVISAA*SL 546 VKW++++RL ++LP +Y+ + Q +Y V A SL Sbjct: 92 VKWDLVIRLPSDLPGYYMCLKGDLQTFILYKGVTIANSSL 131 >At4g13150.1 68417.m02047 expressed protein Length = 119 Score = 28.7 bits (61), Expect = 4.3 Identities = 12/40 (30%), Positives = 22/40 (55%) Frame = +3 Query: 144 KGEIGDATPFNDAVNVQKISSLLQDYGYHLRGNEVMYNGH 263 KG G +PF + ++ K+SS + + ++ GN M G+ Sbjct: 50 KGSGGGGSPFEEVKSLAKVSSSSEGFTFNAFGNRFMIPGN 89 >At3g46640.1 68416.m05063 myb family transcription factor contains Pfam profile: PF00249 myb-like DNA-binding domain Length = 323 Score = 28.7 bits (61), Expect = 4.3 Identities = 14/38 (36%), Positives = 21/38 (55%), Gaps = 3/38 (7%) Frame = -3 Query: 513 YGKDLILQINV---HVKIGQLRVQSDNHVPFHQNVIHH 409 YG ++Q+ V H+ + Q+ NH P+HQN HH Sbjct: 248 YGTQQMMQMPVYAHHMGMQGYHHQNHNHDPYHQNHRHH 285 >At1g02910.1 68414.m00258 tetratricopeptide repeat (TPR)-containing protein contains Pfam profile PF00515: TPR Domain Length = 453 Score = 28.7 bits (61), Expect = 4.3 Identities = 14/44 (31%), Positives = 21/44 (47%) Frame = -2 Query: 499 DTSNKRSRKNWAAPCAI*QSRSISPKRNPPSLALPSIGCLTRIC 368 D + SR+NW + + S S SP +PPS P+ +C Sbjct: 35 DATLFNSRRNWDSHLFVYASSSSSPSSSPPSPNSPTDDLTAELC 78 >At5g58350.1 68418.m07306 protein kinase family protein contains protein kinase domain, Pfam:PF00069 Length = 571 Score = 28.3 bits (60), Expect = 5.7 Identities = 21/73 (28%), Positives = 32/73 (43%), Gaps = 3/73 (4%) Frame = +3 Query: 396 GRARDGGLRFGEMERDCQIAH---GAAQFLRERLFEVSDPYRIHVCNFCGLIAIANLRNN 566 G+ + G L M RDC AH G +F+ L+E + I V +F G+ + + Sbjct: 160 GQVKIGDLGLARMLRDCHSAHSIIGTPEFMAPELYEENYNELIDVYSF-GM-CFLEMITS 217 Query: 567 TFECKGCKNKTQI 605 F C + QI Sbjct: 218 EFPYSECNHPAQI 230 >At3g07150.1 68416.m00852 hypothetical protein Length = 199 Score = 28.3 bits (60), Expect = 5.7 Identities = 17/50 (34%), Positives = 22/50 (44%) Frame = +3 Query: 414 GLRFGEMERDCQIAHGAAQFLRERLFEVSDPYRIHVCNFCGLIAIANLRN 563 GL G+ E D + HG L + SDP CN GLI L++ Sbjct: 78 GLSTGKHEADALLVHGKLSKLGTKRAR-SDPLEDFACNDLGLIKTKKLKD 126 >At5g58880.1 68418.m07377 hypothetical protein Length = 1088 Score = 27.9 bits (59), Expect = 7.6 Identities = 10/27 (37%), Positives = 15/27 (55%) Frame = -2 Query: 502 SDTSNKRSRKNWAAPCAI*QSRSISPK 422 +DT N + + W C I S+ ISP+ Sbjct: 955 ADTQNSQDSQTWTQQCGIDSSQGISPR 981 >At1g73875.1 68414.m08555 endonuclease/exonuclease/phosphatase family protein contains Pfam profile PF03372: Endonuclease/Exonuclease/phosphatase family Length = 454 Score = 27.9 bits (59), Expect = 7.6 Identities = 15/42 (35%), Positives = 21/42 (50%) Frame = +3 Query: 294 LGPTYYQRLKHMVDDKIHSRARGPVQILVRQPMEGRARDGGL 419 L TY+ R VD H++ PV++L P + R GGL Sbjct: 390 LATTYHSRFLGTVDYIWHTKELVPVRVLETLPADVLRRTGGL 431 >At1g15660.1 68414.m01880 expressed protein similar to CENPCA protein (GI:11863170) {Zea mays} Length = 705 Score = 27.9 bits (59), Expect = 7.6 Identities = 27/118 (22%), Positives = 49/118 (41%) Frame = +3 Query: 84 PSRMTIGHLIECIQGKVSSNKGEIGDATPFNDAVNVQKISSLLQDYGYHLRGNEVMYNGH 263 PS + + + + I +N G + A+PFND+V V++ D H ++ H Sbjct: 350 PSEVNVQPIAKDIPNTSPTNVGTVDVASPFNDSV-VKRSGE--DDSHIH----SGIHRSH 402 Query: 264 TGRKINAQVFLGPTYYQRLKHMVDDKIHSRARGPVQILVRQPMEGRARDGGLRFGEME 437 R N + + + R M+ + R +G ++ V G R+ G R + E Sbjct: 403 LSRDGNPDICVMDSISNRSSAMLQKNVDMRTKGK-EVDVPMSESGANRNTGDRENDAE 459 >At1g64810.1 68414.m07348 expressed protein contains Pfam PF05634: Arabidopsis thaliana protein of unknown function (DUF794) Length = 436 Score = 27.5 bits (58), Expect = 10.0 Identities = 9/27 (33%), Positives = 16/27 (59%) Frame = +3 Query: 513 IHVCNFCGLIAIANLRNNTFECKGCKN 593 + C+ CG + +AN+ +N +C G N Sbjct: 153 VFACSECGAVHVANVGHNIRDCNGPTN 179 Database: arabidopsis Posted date: Oct 4, 2007 10:56 AM Number of letters in database: 12,070,560 Number of sequences in database: 28,952 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 16,317,742 Number of Sequences: 28952 Number of extensions: 349501 Number of successful extensions: 930 Number of sequences better than 10.0: 16 Number of HSP's better than 10.0 without gapping: 881 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 923 length of database: 12,070,560 effective HSP length: 79 effective length of database: 9,783,352 effective search space used: 1653386488 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -