BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= FWDP01_FL5_J09 (900 letters) Database: arabidopsis 28,952 sequences; 12,070,560 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value At1g01040.1 68414.m00004 DEAD/DEAH box helicase carpel factory /... 139 2e-33 At3g03300.1 68416.m00327 DEAD/DEAH box helicase carpel factory-r... 137 8e-33 At3g43920.1 68416.m04701 ribonuclease III family protein similar... 104 9e-23 At5g20320.1 68418.m02418 DEAD/DEAH box helicase, putative simila... 103 2e-22 At4g15417.1 68417.m02358 ribonuclease III family protein similar... 91 9e-19 At3g20420.1 68416.m02586 ribonuclease III family protein similar... 88 9e-18 At5g45150.1 68418.m05543 ribonuclease III family protein similar... 73 3e-13 At4g13340.1 68417.m02084 leucine-rich repeat family protein / ex... 33 0.26 At5g11990.1 68418.m01402 proline-rich family protein contains pr... 33 0.34 At3g26090.1 68416.m03249 expressed protein 32 0.60 At3g60500.2 68416.m06767 3' exoribonuclease family protein simil... 31 1.4 At3g60500.1 68416.m06766 3' exoribonuclease family protein simil... 31 1.4 At4g36120.1 68417.m05141 expressed protein 30 1.8 At5g39430.1 68418.m04776 hypothetical protein 30 2.4 At2g04170.1 68415.m00402 meprin and TRAF homology domain-contain... 30 2.4 At5g56330.1 68418.m07031 carbonic anhydrase family protein conta... 29 4.2 At4g35800.1 68417.m05087 DNA-directed RNA polymerase II largest ... 29 4.2 At4g15160.1 68417.m02327 protease inhibitor/seed storage/lipid t... 29 4.2 At1g12040.1 68414.m01390 leucine-rich repeat family protein / ex... 29 4.2 At3g24540.1 68416.m03082 protein kinase family protein contains ... 29 5.6 At2g44710.1 68415.m05564 RNA recognition motif (RRM)-containing ... 29 5.6 At2g14890.2 68415.m01692 arabinogalactan-protein (AGP9) identica... 29 5.6 At2g14890.1 68415.m01693 arabinogalactan-protein (AGP9) identica... 29 5.6 At1g61080.1 68414.m06877 proline-rich family protein 29 5.6 At1g34370.2 68414.m04268 zinc finger (C2H2 type) family protein ... 29 5.6 At1g34370.1 68414.m04267 zinc finger (C2H2 type) family protein ... 29 5.6 At3g22800.1 68416.m02874 leucine-rich repeat family protein / ex... 28 7.3 At3g15400.1 68416.m01954 anther development protein, putative si... 28 7.3 At3g11310.1 68416.m01375 hypothetical protein 28 7.3 At1g68725.1 68414.m07853 arabinogalactan-protein, putative (AGP1... 28 7.3 At2g27390.1 68415.m03306 proline-rich family protein contains pr... 28 9.7 At1g70620.2 68414.m08137 cyclin-related contains weak similarity... 28 9.7 At1g70620.1 68414.m08138 cyclin-related contains weak similarity... 28 9.7 At1g49900.1 68414.m05596 zinc finger (C2H2 type) family protein ... 28 9.7 >At1g01040.1 68414.m00004 DEAD/DEAH box helicase carpel factory / CAF identical to RNA helicase/RNAseIII CAF protein GB:AAF03534 GI:6102610 from [Arabidopsis thaliana] Length = 1909 Score = 139 bits (337), Expect = 2e-33 Identities = 79/176 (44%), Positives = 110/176 (62%), Gaps = 1/176 (0%) Frame = +2 Query: 32 LRYVEDPEGELEKMVSGYEQLERVLQYRFRDRSLLLQAMTHASHHRNVLTDCYQRLEFLG 211 L+ V PE L+ + + LER L+Y F+++ LL++A+THAS + ++ CYQRLEF+G Sbjct: 1544 LKNVNVPESVLKSI--DFVGLERALKYEFKEKGLLVEAITHASRPSSGVS-CYQRLEFVG 1600 Query: 212 DAILDYLITRHLYEDKRCHSPGALTDLRSALVNNTIFATLAARHGFHKYFRHMSPGLNEV 391 DA+LD+LITRHL+ PG LTDLR+A VNN FA +A +H H Y RH S L + Sbjct: 1601 DAVLDHLITRHLFFTYTSLPPGRLTDLRAAAVNNENFARVAVKHKLHLYLRHGSSALEKQ 1660 Query: 392 LKKYVK-IQEENGHSISEEHYLIHEDEMEQAEDVEVPKALGDLFESVAGAIFLDSG 556 ++++VK +Q E+ L D + PK LGD+ ES+AGAIFLDSG Sbjct: 1661 IREFVKEVQTESSKPGFNSFGL---------GDCKAPKVLGDIVESIAGAIFLDSG 1707 Score = 56.4 bits (130), Expect = 2e-08 Identities = 52/178 (29%), Positives = 81/178 (45%), Gaps = 22/178 (12%) Frame = +2 Query: 89 QLERVLQYRFRDRSLLLQAMTHASHHRNVLTDCYQRLEFLGDAILDYLITRHLYEDKRCH 268 QL+ ++ Y S +L+A+T AS T CY+R E LGDA L ++++R L+ Sbjct: 1345 QLKNLISYPI-PTSKILEALTAASCQE---TFCYERAELLGDAYLKWVVSRFLFLKYPQK 1400 Query: 269 SPGALTDLRSALVNNTIFATLAARHGFHKYF--------RHMSPGLNEVLKKYVK----- 409 G LT +R +V+N + A G Y R +PG+ V + K Sbjct: 1401 HEGQLTRMRQQMVSNMVLYQFALVKGLQSYIQADRFAPSRWSAPGVPPVFDEDTKDGGSS 1460 Query: 410 IQEENGHSISEEHYLIHED-EMEQAE--------DVEVPKALGDLFESVAGAIFLDSG 556 +E +SEE+ + ED EME E V K L D+ E++ G +++ G Sbjct: 1461 FFDEEQKPVSEENSDVFEDGEMEDGELEGDLSSYRVLSSKTLADVVEALIGVYYVEGG 1518 >At3g03300.1 68416.m00327 DEAD/DEAH box helicase carpel factory-related similar to RNA helicase GB:AAF03534 Length = 1317 Score = 137 bits (332), Expect = 8e-33 Identities = 73/167 (43%), Positives = 105/167 (62%), Gaps = 1/167 (0%) Frame = +2 Query: 59 ELEKMVS-GYEQLERVLQYRFRDRSLLLQAMTHASHHRNVLTDCYQRLEFLGDAILDYLI 235 + EK+V+ GY +E +L Y F D+SLL++A+TH S+ + CYQRLEFLGD++LDYLI Sbjct: 1071 QAEKLVNVGY--MESLLNYSFEDKSLLVEALTHGSYMMPEIPRCYQRLEFLGDSVLDYLI 1128 Query: 236 TRHLYEDKRCHSPGALTDLRSALVNNTIFATLAARHGFHKYFRHMSPGLNEVLKKYVKIQ 415 T+HLY+ C SPG LTD+RSA VNN +A +A + HK+ + S L++ + + Sbjct: 1129 TKHLYDKYPCLSPGLLTDMRSASVNNECYALVAVKANLHKHILYASHHLHKHISR----- 1183 Query: 416 EENGHSISEEHYLIHEDEMEQAEDVEVPKALGDLFESVAGAIFLDSG 556 ++SE + D+ PK LGD+ ES+AGAIF+DSG Sbjct: 1184 -----TVSEFEQSSLQSTFGWESDISFPKVLGDVIESLAGAIFVDSG 1225 >At3g43920.1 68416.m04701 ribonuclease III family protein similar to RNA helicase/RNAseIII CAF protein [Arabidopsis thaliana] GI:6102610; contains Pfam profiles PF02170: PAZ domain, PF00636: RNase3 domain Length = 1531 Score = 104 bits (249), Expect = 9e-23 Identities = 61/162 (37%), Positives = 93/162 (57%), Gaps = 3/162 (1%) Frame = +2 Query: 89 QLERVLQYRFRDRSLLLQAMTHASHHRNVLTDCYQRLEFLGDAILDYLITRHLYEDKRCH 268 +LER +Q+ F + LL +A+TH+S + Y+RLEFLGD++LD+LITRHL+ Sbjct: 1151 ELERKIQHEFSAKFLLKEAITHSSLRESY---SYERLEFLGDSVLDFLITRHLFNTYEQT 1207 Query: 269 SPGALTDLRSALVNNTIFATLAARHGFHKYFRHMSPGLNEVLKKYV---KIQEENGHSIS 439 PG +TDLRSA VNN FA +A ++ H + + + L + Y+ + +E G SI Sbjct: 1208 GPGEMTDLRSACVNNENFAQVAVKNNLHTHLQRCATVLETQINDYLMSFQKPDETGRSI- 1266 Query: 440 EEHYLIHEDEMEQAEDVEVPKALGDLFESVAGAIFLDSGMSL 565 ++ PKALGD+ ES+AGA+ +D+ + L Sbjct: 1267 --------------PSIQGPKALGDVVESIAGALLIDTRLDL 1294 Score = 35.1 bits (77), Expect = 0.064 Identities = 17/59 (28%), Positives = 30/59 (50%) Frame = +2 Query: 191 QRLEFLGDAILDYLITRHLYEDKRCHSPGALTDLRSALVNNTIFATLAARHGFHKYFRH 367 +RLE LGD++L Y+ + HL+ G L+ R ++++N+ L Y R+ Sbjct: 973 ERLELLGDSVLKYVASCHLFLKYPDKDEGQLSRQRQSIISNSNLHRLTTSRKLQGYIRN 1031 >At5g20320.1 68418.m02418 DEAD/DEAH box helicase, putative similar to CAF protein [Arabidopsis thaliana] GI:6102610; contains Pfam profiles PF00270: DEAD/DEAH box helicase, PF00271: Helicase conserved C-terminal domain, PF03368: Domain of unknown function, PF00636: RNase3 domain, PF00035: Double-stranded RNA binding motif Length = 1676 Score = 103 bits (246), Expect = 2e-22 Identities = 64/160 (40%), Positives = 85/160 (53%) Frame = +2 Query: 86 EQLERVLQYRFRDRSLLLQAMTHASHHRNVLTDCYQRLEFLGDAILDYLITRHLYEDKRC 265 E LE L Y+F + LL+QA H S++R+ CYQRLEFLGDA+LDYL+T + + Sbjct: 1267 ETLENQLDYKFLHKGLLVQAFIHPSYNRHG-GGCYQRLEFLGDAVLDYLMTSYFFTVFPK 1325 Query: 266 HSPGALTDLRSALVNNTIFATLAARHGFHKYFRHMSPGLNEVLKKYVKIQEENGHSISEE 445 PG LTDLRS VNN A +A ++ S L+EV++ Y + + Sbjct: 1326 LKPGQLTDLRSLSVNNEALANVAVSFSLKRFLFCESIYLHEVIEDYTNFLASSPLASG-- 1383 Query: 446 HYLIHEDEMEQAEDVEVPKALGDLFESVAGAIFLDSGMSL 565 Q+E PK LGDL ES GA+FLD G +L Sbjct: 1384 ----------QSEGPRCPKVLGDLVESCLGALFLDCGFNL 1413 Score = 34.7 bits (76), Expect = 0.084 Identities = 38/145 (26%), Positives = 61/145 (42%), Gaps = 4/145 (2%) Frame = +2 Query: 134 LLQAMTHASHHRNVLTDCYQRLEFLGDAILDYLITRHLYEDKRCHSPGALTDLRSALVNN 313 +L+A+T H + +RLE LGDA L + ++RHL+ G LT RS N Sbjct: 1094 VLEALTTEKCHERL---SLERLEVLGDAFLKFAVSRHLFLHHDSLDEGELTRRRS---NV 1147 Query: 314 TIFATLAARHGFHKYFRHMSPGLNEVLKKYVKIQEENGHSISEEHYLIHEDEME----QA 481 I F + +EV K V HS++ + ++ + E + Sbjct: 1148 YIRDQALDPTQFFAFGHPCRVTCDEVASKEV-------HSLNRDLGILESNTGEIRCSKG 1200 Query: 482 EDVEVPKALGDLFESVAGAIFLDSG 556 K + D+ E++ GA +DSG Sbjct: 1201 HHWLYKKTIADVVEALVGAFLVDSG 1225 >At4g15417.1 68417.m02358 ribonuclease III family protein similar to CAF protein (RNA helicase/RNAseIII) [Arabidopsis thaliana] GI:6102610; contains Pfam profile PF00636 RNase3 domain Length = 213 Score = 91.1 bits (216), Expect = 9e-19 Identities = 56/155 (36%), Positives = 87/155 (56%) Frame = +2 Query: 86 EQLERVLQYRFRDRSLLLQAMTHASHHRNVLTDCYQRLEFLGDAILDYLITRHLYEDKRC 265 E LE++L Y+F+D+SLLL+A T AS+ + ++ Y+ LE LGD+IL+ I + Sbjct: 30 ESLEKILNYKFKDKSLLLKAFTDASYVDDK-SESYELLELLGDSILNMGIIYDFIKLYPK 88 Query: 266 HSPGALTDLRSALVNNTIFATLAARHGFHKYFRHMSPGLNEVLKKYVKIQEENGHSISEE 445 +PG LT LR+ V+ A +A H + Y RH P L E + ++V+ E+ Sbjct: 89 EAPGPLTKLRAVNVDTEKLARVAVNHQLYSYLRHKKPLLEEQILEFVEAMEK-------- 140 Query: 446 HYLIHEDEMEQAEDVEVPKALGDLFESVAGAIFLD 550 Y +H + + ++VPK L D+ ES GAIF+D Sbjct: 141 -YPLHSNGL-----LKVPKVLADIVESTIGAIFMD 169 >At3g20420.1 68416.m02586 ribonuclease III family protein similar to CAF protein (RNA helicase/RNAseIII) [Arabidopsis thaliana] GI:6102610; contains Pfam profiles: PF00636 RNase3 domain, PF00035 Double-stranded RNA binding motif Length = 391 Score = 87.8 bits (208), Expect = 9e-18 Identities = 56/176 (31%), Positives = 93/176 (52%), Gaps = 2/176 (1%) Frame = +2 Query: 50 PEGELEKMVSGYEQLERVLQYRFRDRSLLLQAMTHASHHRNVLTD--CYQRLEFLGDAIL 223 P + + E +E++L Y+F ++SLL +A+TH S TD Y+RLEF+GD+ + Sbjct: 49 PSVPVSSEMESMEAVEKILNYKFSNKSLLKEAITHTS-----CTDFPSYERLEFIGDSAI 103 Query: 224 DYLITRHLYEDKRCHSPGALTDLRSALVNNTIFATLAARHGFHKYFRHMSPGLNEVLKKY 403 I+ +LY P L+ LR+A V+ A ++ HG + + R +P L+E +K++ Sbjct: 104 GLAISNYLYLTYPSLEPHDLSLLRAANVSTEKLARVSLNHGLYSFLRRNAPSLDEKVKEF 163 Query: 404 VKIQEENGHSISEEHYLIHEDEMEQAEDVEVPKALGDLFESVAGAIFLDSGMSLGR 571 ++ +E L + V+ PK L DLFES+AGA+++D L R Sbjct: 164 -------SEAVGKEDDL----SVSYGGLVKAPKVLADLFESLAGAVYVDVNFDLQR 208 >At5g45150.1 68418.m05543 ribonuclease III family protein similar to CAF protein (RNA helicase/RNAseIII) [Arabidopsis thaliana] GI:6102610; contains Pfam profiles PF00035: Double-stranded RNA binding motif, PF00636 RNase3 domain Length = 957 Score = 72.9 bits (171), Expect = 3e-13 Identities = 49/173 (28%), Positives = 85/173 (49%) Frame = +2 Query: 74 VSGYEQLERVLQYRFRDRSLLLQAMTHASHHRNVLTDCYQRLEFLGDAILDYLITRHLYE 253 ++ E +E++L Y F +++LL +A+T S + RLEF GD+IL+ T ++ Sbjct: 1 MNSVEAVEKILNYSFVNKTLLKEAITQKS-------PLFDRLEFFGDSILEVAFTNYICH 53 Query: 254 DKRCHSPGALTDLRSALVNNTIFATLAARHGFHKYFRHMSPGLNEVLKKYVKIQEENGHS 433 L DLR+A V+N FA +A H H + +P L + +K + + + Sbjct: 54 TYPNLKVKELRDLRTANVSNEKFARIAVNHNLHHFLLLQNPSLFKKVKNFAE-------A 106 Query: 434 ISEEHYLIHEDEMEQAEDVEVPKALGDLFESVAGAIFLDSGMSLGRRVEVVRA 592 + +E +D + V+ PK L D ES+A +F+D + R E+ R+ Sbjct: 107 VRKE-----DDPVPYGGLVKAPKILADTLESIAATVFIDVNYDVKRLWEIFRS 154 Score = 59.3 bits (137), Expect = 3e-09 Identities = 44/174 (25%), Positives = 86/174 (49%) Frame = +2 Query: 68 KMVSGYEQLERVLQYRFRDRSLLLQAMTHASHHRNVLTDCYQRLEFLGDAILDYLITRHL 247 +M S E +E++L Y F +++LL + +TH + + +Q L F+G++ L T+HL Sbjct: 410 EMDSSVEAVEKILNYSFVNKTLLKELLTHNN------SPLFQGLMFVGESALSLAFTKHL 463 Query: 248 YEDKRCHSPGALTDLRSALVNNTIFATLAARHGFHKYFRHMSPGLNEVLKKYVKIQEENG 427 Y P L+ LR A + +A +A + G ++ F P ++ ++++ + Sbjct: 464 YLTYPMLEPKDLSVLRDANTCHDKYACVAVKKGIYQSFIGSVPKPEKMTTDFIELMGK-- 521 Query: 428 HSISEEHYLIHEDEMEQAEDVEVPKALGDLFESVAGAIFLDSGMSLGRRVEVVR 589 ++ Y + V+ PK L +L VAGA+++D ++ R +E+ R Sbjct: 522 ---EDDPYRV----------VKAPKILVNLLAGVAGAVYIDVKYNVQRLLEIFR 562 >At4g13340.1 68417.m02084 leucine-rich repeat family protein / extensin family protein similar to extensin-like protein [Lycopersicon esculentum] gi|5917664|gb|AAD55979; contains leucine-rich repeats, Pfam:PF00560; contains proline rich extensin domains, INTERPRO:IPR002965 Length = 760 Score = 33.1 bits (72), Expect = 0.26 Identities = 23/79 (29%), Positives = 26/79 (32%) Frame = +2 Query: 632 PXPXPLFXXPXLGPSPXTXQVSANPXPAXPAXXGAFRXXCXWXLXPPPXXLSPGXXXXPX 811 P P P++ P P P V + P P P PPP SP P Sbjct: 443 PPPPPVYSPPPPPPPPPPPPVYSPPPPPPPPP-------------PPPPVYSPPPPSPPP 489 Query: 812 PRIPHGXPXXPPAPXEXXP 868 P P P PP P P Sbjct: 490 PPPPVYSPPPPPPPPPPPP 508 Score = 30.3 bits (65), Expect = 1.8 Identities = 19/74 (25%), Positives = 25/74 (33%) Frame = +2 Query: 632 PXPXPLFXXPXLGPSPXTXQVSANPXPAXPAXXGAFRXXCXWXLXPPPXXLSPGXXXXPX 811 P P P++ P P P +P P P + PPP P P Sbjct: 458 PPPPPVYSPPPPPPPPPPPPPVYSPPPPSPPPP----PPPVYSPPPPPPPPPPPPVYSPP 513 Query: 812 PRIPHGXPXXPPAP 853 P + P PP+P Sbjct: 514 PPPVYSSPPPPPSP 527 >At5g11990.1 68418.m01402 proline-rich family protein contains proline-rich extensin domains, INTERPRO:IPR002965 Length = 181 Score = 32.7 bits (71), Expect = 0.34 Identities = 15/49 (30%), Positives = 17/49 (34%) Frame = +2 Query: 749 CXWXLXPPPXXLSPGXXXXPXPRIPHGXPXXPPAPXEXXPXXXGXGXSP 895 C PPP SP P P +P PP P + P SP Sbjct: 38 CPTICSPPPSKPSPSMSPPPSPSLPLSSSPPPPPPHKHSPPPLSQSLSP 86 >At3g26090.1 68416.m03249 expressed protein Length = 459 Score = 31.9 bits (69), Expect = 0.60 Identities = 20/89 (22%), Positives = 40/89 (44%), Gaps = 3/89 (3%) Frame = +2 Query: 11 KLPRSPLLRYVEDPEGELEKMVSGYEQLERVLQYRFRDRSLLLQAMTHASHHRNVLTDCY 190 K+P +R + +EK + ++E L ++ R L Q +TH +N L + Sbjct: 333 KIPEDDSIRRIYMARHIMEKFIVAGAEMELNLSHKTRQEILTTQDLTHTDLFKNALNEVM 392 Query: 191 QRLEFLGDAILDYLITRHLY---EDKRCH 268 Q ++ + + DY + + E++ CH Sbjct: 393 QLIKM--NLVRDYWSSIYFIKFKEEESCH 419 >At3g60500.2 68416.m06767 3' exoribonuclease family protein similar to SP|Q06265 Exosome complex exonuclease RRP45 [Homo sapiens]; contains Pfam profiles PF01138: 3' exoribonuclease family, domain 1, PF03725: 3' exoribonuclease family, domain 2 Length = 438 Score = 30.7 bits (66), Expect = 1.4 Identities = 19/57 (33%), Positives = 27/57 (47%) Frame = +1 Query: 268 LAGSPHRPPLGAGQQHHLRDPGGQTRLSQIFQTHVSRLERGAEEIREDTRGERTLYK 438 LA S P A ++ H R Q R ++I + HV RL+ EE+R E +K Sbjct: 295 LAKSEVSGPTVAVKEEH-RKSSDQERAAEISREHVERLKLSTEEVRSSKEEEAANFK 350 >At3g60500.1 68416.m06766 3' exoribonuclease family protein similar to SP|Q06265 Exosome complex exonuclease RRP45 [Homo sapiens]; contains Pfam profiles PF01138: 3' exoribonuclease family, domain 1, PF03725: 3' exoribonuclease family, domain 2 Length = 438 Score = 30.7 bits (66), Expect = 1.4 Identities = 19/57 (33%), Positives = 27/57 (47%) Frame = +1 Query: 268 LAGSPHRPPLGAGQQHHLRDPGGQTRLSQIFQTHVSRLERGAEEIREDTRGERTLYK 438 LA S P A ++ H R Q R ++I + HV RL+ EE+R E +K Sbjct: 295 LAKSEVSGPTVAVKEEH-RKSSDQERAAEISREHVERLKLSTEEVRSSKEEEAANFK 350 >At4g36120.1 68417.m05141 expressed protein Length = 981 Score = 30.3 bits (65), Expect = 1.8 Identities = 14/49 (28%), Positives = 21/49 (42%) Frame = +1 Query: 304 GQQHHLRDPGGQTRLSQIFQTHVSRLERGAEEIREDTRGERTLYKRRAL 450 G H DP Q +SQ H+++ E + E+ + E RR L Sbjct: 316 GLGHEFTDPRAQRNMSQNHNAHIAKAEISTDHKLEECKRENVYLTRRTL 364 >At5g39430.1 68418.m04776 hypothetical protein Length = 511 Score = 29.9 bits (64), Expect = 2.4 Identities = 14/43 (32%), Positives = 20/43 (46%) Frame = +1 Query: 319 LRDPGGQTRLSQIFQTHVSRLERGAEEIREDTRGERTLYKRRA 447 L P Q R S ++Q R EE+ E +R LY+ +A Sbjct: 189 LNSPTSQNRKSAVYQVSFKRRSCDGEEVTEHRSSKRLLYRPKA 231 >At2g04170.1 68415.m00402 meprin and TRAF homology domain-containing protein / MATH domain-containing protein weak similarity to NtN2 [Medicago truncatula] GI:3776084; contains Pfam profile PF00917: MATH domain Length = 420 Score = 29.9 bits (64), Expect = 2.4 Identities = 19/56 (33%), Positives = 19/56 (33%), Gaps = 1/56 (1%) Frame = -2 Query: 899 GXGKXPXPXXXGFXXXGXXXXXXPRG-GFGXXGXXXGPGXXFXGAGTTSXXXTGER 735 G G P GF G P G GFG G GPG G G G R Sbjct: 5 GCGGGPGRGGRGFGGRGGGPGFGPGGPGFGPGGPGFGPGGPGFGPGGPGFGGRGPR 60 >At5g56330.1 68418.m07031 carbonic anhydrase family protein contains proline-rich extensin domains, INTERPRO:IPR002965; contains Pfam profile PF00194: Eukaryotic-type carbonic anhydrase Length = 350 Score = 29.1 bits (62), Expect = 4.2 Identities = 20/79 (25%), Positives = 24/79 (30%) Frame = +2 Query: 632 PXPXPLFXXPXLGPSPXTXQVSANPXPAXPAXXGAFRXXCXWXLXPPPXXLSPGXXXXPX 811 P P P P P+P + P P P A P P P P Sbjct: 33 PAPAPTPPKPKPTPAPTPPKPKPKPAPTPPKPKPA-PAPTPPKPKPAPAPTPPKPKPKPA 91 Query: 812 PRIPHGXPXXPPAPXEXXP 868 P P+ P P P + P Sbjct: 92 PTPPNPKPTPAPTPPKPKP 110 Score = 27.9 bits (59), Expect = 9.7 Identities = 20/74 (27%), Positives = 23/74 (31%) Frame = +2 Query: 632 PXPXPLFXXPXLGPSPXTXQVSANPXPAXPAXXGAFRXXCXWXLXPPPXXLSPGXXXXPX 811 P P P P P+P + + P P P A P P P P Sbjct: 46 PAPTPPKPKPKPAPTPPKPKPAPAPTPPKPKPAPAPTPPKP---KPKPAPTPPNPKPTPA 102 Query: 812 PRIPHGXPXXPPAP 853 P P P PAP Sbjct: 103 PTPPKPKPAPAPAP 116 >At4g35800.1 68417.m05087 DNA-directed RNA polymerase II largest subunit (RPB205) (RPII) (RPB1) nearly identical to P|P18616 DNA-directed RNA polymerase II largest subunit (EC 2.7.7.6) {Arabidopsis thaliana} Length = 1840 Score = 29.1 bits (62), Expect = 4.2 Identities = 16/55 (29%), Positives = 27/55 (49%) Frame = +1 Query: 280 PHRPPLGAGQQHHLRDPGGQTRLSQIFQTHVSRLERGAEEIREDTRGERTLYKRR 444 PH PP G ++ +RD G + L + ++ LE G + R G+ L+ R+ Sbjct: 408 PHPPPGKTGAKYIIRDDGQRLDLRYLKKSSDQHLELGYKVERHLQDGDFVLFNRQ 462 >At4g15160.1 68417.m02327 protease inhibitor/seed storage/lipid transfer protein (LTP) family protein similar to SP|Q00451|PRF1_LYCES 36.4 kDa proline-rich protein Lycopersicon esculentum, proline-rich cell wall protein [Medicago sativa] GI:3818416; contains Pfam profile PF00234 Protease inhibitor/seed storage/LTP family Length = 428 Score = 29.1 bits (62), Expect = 4.2 Identities = 21/77 (27%), Positives = 26/77 (33%) Frame = +2 Query: 638 PXPLFXXPXLGPSPXTXQVSANPXPAXPAXXGAFRXXCXWXLXPPPXXLSPGXXXXPXPR 817 P P + P P P T + P P P + PPP ++P P P Sbjct: 106 PPPPYVKP---PPPPTVKPPPPPTPYTPPPPTPYTPPPPTVKPPPPPVVTP-PPPTPTPE 161 Query: 818 IPHGXPXXPPAPXEXXP 868 P P PP P P Sbjct: 162 AP--CPPPPPTPYPPPP 176 >At1g12040.1 68414.m01390 leucine-rich repeat family protein / extensin family protein (LRX1) similar to extensin-like protein [Lycopersicon esculentum] gi|5917664|gb|AAD55979; contains leucine-rich repeats, Pfam:PF00560; contains proline rich extensin domains, INTERPRO:IPR002965 Length = 744 Score = 29.1 bits (62), Expect = 4.2 Identities = 21/86 (24%), Positives = 29/86 (33%) Frame = +2 Query: 611 LEAFXRGPXPXPLFXXPXLGPSPXTXQVSANPXPAXPAXXGAFRXXCXWXLXPPPXXLSP 790 + A+ P P P P + SP V ++P P + PPP S Sbjct: 453 VRAYPPPPPPSPSPPPPYVYSSPPPPYVYSSPPPPPYVYSSPPPPPYVYSSPPPPYVYS- 511 Query: 791 GXXXXPXPRIPHGXPXXPPAPXEXXP 868 P P + P PP+P P Sbjct: 512 ---SPPPPYVYSSPPPPPPSPPPPCP 534 >At3g24540.1 68416.m03082 protein kinase family protein contains protein kinase domain, Pfam:PF00069 Length = 509 Score = 28.7 bits (61), Expect = 5.6 Identities = 22/73 (30%), Positives = 27/73 (36%), Gaps = 1/73 (1%) Frame = +2 Query: 638 PXPLFXXPXLGP-SPXTXQVSANPXPAXPAXXGAFRXXCXWXLXPPPXXLSPGXXXXPXP 814 P PLF P P +P +S P P P+ R PPP SP P Sbjct: 50 PEPLFSEPPPPPKAPVNVSLSPPPPPRSPSTSTPPRLG---NRNPPPPA-SPSGQEPTTP 105 Query: 815 RIPHGXPXXPPAP 853 + G PP+P Sbjct: 106 TMTPGFSLSPPSP 118 >At2g44710.1 68415.m05564 RNA recognition motif (RRM)-containing protein Length = 809 Score = 28.7 bits (61), Expect = 5.6 Identities = 14/39 (35%), Positives = 24/39 (61%), Gaps = 1/39 (2%) Frame = +2 Query: 440 EEHYLIHEDEMEQAEDVEVPKALGDLFESVA-GAIFLDS 553 E+H L++E+ E E++EV + G+ + + GA LDS Sbjct: 138 EDHELVNEEGEELEEEIEVEEEAGEFADEIGDGAEDLDS 176 >At2g14890.2 68415.m01692 arabinogalactan-protein (AGP9) identical to gi|10880495|gb|AAG24277 Length = 176 Score = 28.7 bits (61), Expect = 5.6 Identities = 22/78 (28%), Positives = 26/78 (33%), Gaps = 1/78 (1%) Frame = +2 Query: 638 PXPLFXXPXLGPSPXTXQVSANPXPAXPAXXGAFRXXCXWX-LXPPPXXLSPGXXXXPXP 814 P P+ P + SP V+ P PA P + PPP P P P Sbjct: 45 PPPVSAPPPVTTSPPP--VTTAPPPANPPPPVSSPPPASPPPATPPPVASPPPPVASPPP 102 Query: 815 RIPHGXPXXPPAPXEXXP 868 P PPAP P Sbjct: 103 ATPPPVATPPPAPLASPP 120 >At2g14890.1 68415.m01693 arabinogalactan-protein (AGP9) identical to gi|10880495|gb|AAG24277 Length = 191 Score = 28.7 bits (61), Expect = 5.6 Identities = 22/78 (28%), Positives = 26/78 (33%), Gaps = 1/78 (1%) Frame = +2 Query: 638 PXPLFXXPXLGPSPXTXQVSANPXPAXPAXXGAFRXXCXWX-LXPPPXXLSPGXXXXPXP 814 P P+ P + SP V+ P PA P + PPP P P P Sbjct: 45 PPPVSAPPPVTTSPPP--VTTAPPPANPPPPVSSPPPASPPPATPPPVASPPPPVASPPP 102 Query: 815 RIPHGXPXXPPAPXEXXP 868 P PPAP P Sbjct: 103 ATPPPVATPPPAPLASPP 120 >At1g61080.1 68414.m06877 proline-rich family protein Length = 907 Score = 28.7 bits (61), Expect = 5.6 Identities = 25/91 (27%), Positives = 27/91 (29%), Gaps = 3/91 (3%) Frame = +2 Query: 632 PXPXPL---FXXPXLGPSPXTXQVSANPXPAXPAXXGAFRXXCXWXLXPPPXXLSPGXXX 802 P P PL P P P V+ P P P A PPP PG Sbjct: 511 PPPPPLPTTIAAPPPPPPPPRAAVAPPPPPPPPGTAAA----------PPPPPPPPGTQA 560 Query: 803 XPXPRIPHGXPXXPPAPXEXXPXXXGXGXSP 895 P P P P+P G G P Sbjct: 561 APPPPPPPPMQNRAPSPPPMPMGNSGSGGPP 591 >At1g34370.2 68414.m04268 zinc finger (C2H2 type) family protein contains Pfam domain, PF00096: Zinc finger, C2H2 type Length = 499 Score = 28.7 bits (61), Expect = 5.6 Identities = 13/33 (39%), Positives = 21/33 (63%) Frame = +2 Query: 392 LKKYVKIQEENGHSISEEHYLIHEDEMEQAEDV 490 L K V + E GH + EEH + ED++E+ E++ Sbjct: 192 LPKPVLVDEREGHVV-EEHEMKDEDDVEEGENL 223 >At1g34370.1 68414.m04267 zinc finger (C2H2 type) family protein contains Pfam domain, PF00096: Zinc finger, C2H2 type Length = 499 Score = 28.7 bits (61), Expect = 5.6 Identities = 13/33 (39%), Positives = 21/33 (63%) Frame = +2 Query: 392 LKKYVKIQEENGHSISEEHYLIHEDEMEQAEDV 490 L K V + E GH + EEH + ED++E+ E++ Sbjct: 192 LPKPVLVDEREGHVV-EEHEMKDEDDVEEGENL 223 >At3g22800.1 68416.m02874 leucine-rich repeat family protein / extensin family protein similar to extensin-like protein [Lycsimilar to extensin-like protein [Lycopersicon esculentum] gi|5917664|gb|AAD55979; contains leucine-rich repeats, Pfam:PF00560; contains proline rich extensin domains, INTERPRO:IPR002965 Length = 470 Score = 28.3 bits (60), Expect = 7.3 Identities = 21/74 (28%), Positives = 26/74 (35%) Frame = +2 Query: 632 PXPXPLFXXPXLGPSPXTXQVSANPXPAXPAXXGAFRXXCXWXLXPPPXXLSPGXXXXPX 811 P P P P P P V +P P P+ + PPP P P Sbjct: 389 PPPPPPPPPPPPPPPPPPPYVYPSPPPPPPSPP-------PYVYPPPP----PPYVYPPP 437 Query: 812 PRIPHGXPXXPPAP 853 P P+ P PP+P Sbjct: 438 PSPPYVYPPPPPSP 451 >At3g15400.1 68416.m01954 anther development protein, putative similar to anther development protein ATA20 GB:AAC50042 GI:2708813 from [Arabidopsis thaliana] Length = 416 Score = 28.3 bits (60), Expect = 7.3 Identities = 10/21 (47%), Positives = 12/21 (57%) Frame = +1 Query: 280 PHRPPLGAGQQHHLRDPGGQT 342 PH PP +GQ H+ D G T Sbjct: 384 PHCPPFTSGQDKHMSDKGAMT 404 >At3g11310.1 68416.m01375 hypothetical protein Length = 539 Score = 28.3 bits (60), Expect = 7.3 Identities = 9/17 (52%), Positives = 10/17 (58%) Frame = +3 Query: 42 WKTLRVSWRRWCPATSS 92 W R +WRRWC A S Sbjct: 236 WNITRDAWRRWCQAVGS 252 >At1g68725.1 68414.m07853 arabinogalactan-protein, putative (AGP19) non-consensus splice site at the intron:exon boundary (AT:exon) Length = 247 Score = 28.3 bits (60), Expect = 7.3 Identities = 22/75 (29%), Positives = 24/75 (32%), Gaps = 1/75 (1%) Frame = +2 Query: 632 PXPXPLFXXPXLGPSPXTXQVSANPXPAXPAXXGAFRXXCXWXLXPPPXXLSPGXXXXPX 811 P P P P P+ VS P P P A PPP SP Sbjct: 96 PPPQPPQSPPASAPTVSPPPVSPPPAPTSPPPTPASPPPA--PASPPPAPASPPPAPVSP 153 Query: 812 PRIPHGXP-XXPPAP 853 P + P PPAP Sbjct: 154 PPVQAPSPISLPPAP 168 >At2g27390.1 68415.m03306 proline-rich family protein contains proline-rich extensin domains, INTERPRO:IPR002965 Length = 134 Score = 27.9 bits (59), Expect = 9.7 Identities = 23/79 (29%), Positives = 24/79 (30%), Gaps = 4/79 (5%) Frame = +2 Query: 644 PLFXXPXLGPSPXTXQVSANPXPAX----PAXXGAFRXXCXWXLXPPPXXLSPGXXXXPX 811 PL P PSP + P PA P F PPP P P Sbjct: 38 PLSPPPSPPPSPSSPPRLPPPFPALFPPEPPLPPRFELPPP-LFPPPPLPRLPPPLLPPP 96 Query: 812 PRIPHGXPXXPPAPXEXXP 868 P P PP P E P Sbjct: 97 EEPPREPPPPPPPPEEPPP 115 >At1g70620.2 68414.m08137 cyclin-related contains weak similarity to Swiss-Prot:P35662 cylicin I (Multiple-band polypeptide I) [Bos taurus] Length = 884 Score = 27.9 bits (59), Expect = 9.7 Identities = 25/92 (27%), Positives = 27/92 (29%), Gaps = 11/92 (11%) Frame = +2 Query: 638 PXPLFXXPXLGPSPXTX------QVSANPXPAXPAXXGA----FRXXCXWXLXPPPXXLS 787 P P + P GP P T Q A P P G + P P Sbjct: 6 PPPQYLRPPSGPPPPTDPYHQYYQHQARPPVPPPTQPGGPPAWYSNQFHHPHSPSPPPPP 65 Query: 788 PGXXXXPXPRIPHGXPXXPPA-PXEXXPXXXG 880 P P P P G P PA P P G Sbjct: 66 PPQWGPPSPHYPQGQPYSSPAYPPHQPPFNAG 97 >At1g70620.1 68414.m08138 cyclin-related contains weak similarity to Swiss-Prot:P35662 cylicin I (Multiple-band polypeptide I) [Bos taurus] Length = 897 Score = 27.9 bits (59), Expect = 9.7 Identities = 25/92 (27%), Positives = 27/92 (29%), Gaps = 11/92 (11%) Frame = +2 Query: 638 PXPLFXXPXLGPSPXTX------QVSANPXPAXPAXXGA----FRXXCXWXLXPPPXXLS 787 P P + P GP P T Q A P P G + P P Sbjct: 6 PPPQYLRPPSGPPPPTDPYHQYYQHQARPPVPPPTQPGGPPAWYSNQFHHPHSPSPPPPP 65 Query: 788 PGXXXXPXPRIPHGXPXXPPA-PXEXXPXXXG 880 P P P P G P PA P P G Sbjct: 66 PPQWGPPSPHYPQGQPYSSPAYPPHQPPFNAG 97 >At1g49900.1 68414.m05596 zinc finger (C2H2 type) family protein contains Pfam profile: PF00096 zinc finger, C2H2 type Length = 917 Score = 27.9 bits (59), Expect = 9.7 Identities = 13/51 (25%), Positives = 27/51 (52%) Frame = +2 Query: 413 QEENGHSISEEHYLIHEDEMEQAEDVEVPKALGDLFESVAGAIFLDSGMSL 565 + + GH S++ ++HED+ + + + + D +SV I L++ SL Sbjct: 346 ESDGGHKDSQDEVVVHEDKCSPSSNGSIVTNVSDPEQSVRRLIDLNNPPSL 396 Database: arabidopsis Posted date: Oct 4, 2007 10:56 AM Number of letters in database: 12,070,560 Number of sequences in database: 28,952 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 14,443,222 Number of Sequences: 28952 Number of extensions: 270825 Number of successful extensions: 1304 Number of sequences better than 10.0: 34 Number of HSP's better than 10.0 without gapping: 953 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 1163 length of database: 12,070,560 effective HSP length: 81 effective length of database: 9,725,448 effective search space used: 2120147664 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -