BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= I09A02NGRL0002_B20 (464 letters) Database: arabidopsis 28,952 sequences; 12,070,560 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value At1g02305.1 68414.m00175 cathepsin B-like cysteine protease, put... 128 1e-30 At4g01610.2 68417.m00211 cathepsin B-like cysteine protease, put... 119 1e-27 At4g01610.1 68417.m00210 cathepsin B-like cysteine protease, put... 119 1e-27 At1g02300.1 68414.m00173 cathepsin B-like cysteine protease, put... 118 3e-27 At3g45310.1 68416.m04892 cysteine proteinase, putative similar t... 71 3e-13 At5g60360.1 68418.m07568 cysteine proteinase, putative / AALP pr... 70 7e-13 At1g29110.1 68414.m03563 cysteine proteinase, putative contains ... 59 2e-09 At3g48340.1 68416.m05276 cysteine proteinase, putative similar t... 58 2e-09 At3g48350.1 68416.m05277 cysteine proteinase, putative similar t... 57 5e-09 At3g43960.1 68416.m04706 cysteine proteinase, putative contains ... 57 5e-09 At5g50260.1 68418.m06224 cysteine proteinase, putative similar t... 57 7e-09 At3g19390.1 68416.m02459 cysteine proteinase, putative / thiol p... 56 1e-08 At1g06260.1 68414.m00662 cysteine proteinase, putative contains ... 56 2e-08 At3g19400.1 68416.m02461 cysteine proteinase, putative non-conse... 55 2e-08 At3g49340.1 68416.m05394 cysteine proteinase, putative contains ... 54 5e-08 At4g23520.1 68417.m03390 cysteine proteinase, putative contains ... 53 8e-08 At2g21430.1 68415.m02550 cysteine proteinase A494, putative / th... 53 1e-07 At2g27420.1 68415.m03314 cysteine proteinase, putative contains ... 52 3e-07 At2g34080.1 68415.m04172 cysteine proteinase, putative contains ... 51 3e-07 At1g47128.1 68414.m05222 cysteine proteinase (RD21A) / thiol pro... 51 3e-07 At5g43060.1 68418.m05256 cysteine proteinase, putative / thiol p... 51 4e-07 At1g29080.1 68414.m03560 peptidase C1A papain family protein con... 51 4e-07 At4g36880.1 68417.m05229 cysteine proteinase, putative strong si... 50 1e-06 At4g39090.1 68417.m05535 cysteine proteinase RD19a (RD19A) / thi... 49 1e-06 At1g09850.1 68414.m01109 cysteine protease, papain-like (XBCP3) ... 49 1e-06 At4g11320.1 68417.m01828 cysteine proteinase, putative contains ... 48 2e-06 At1g29090.1 68414.m03561 peptidase C1A papain family protein con... 48 3e-06 At5g45890.1 68418.m05644 senescence-specific SAG12 protein (SAG1... 48 4e-06 At4g11310.1 68417.m01827 cysteine proteinase, putative contains ... 47 7e-06 At1g03720.1 68414.m00352 cathepsin-related contains weak similar... 47 7e-06 At4g35350.1 68417.m05023 cysteine endopeptidase, papain-type (XC... 45 2e-05 At4g16190.1 68417.m02457 cysteine proteinase, putative contains ... 45 2e-05 At3g54940.3 68416.m06091 cysteine proteinase, putative contains ... 43 1e-04 At5g17140.1 68418.m02008 cysteine proteinase-related low similar... 39 0.001 At1g20850.1 68414.m02612 cysteine endopeptidase, papain-type (XC... 38 0.003 At5g43900.1 68418.m05368 myosin heavy chain (MYA2) nearly identi... 32 0.17 At4g13345.2 68417.m02086 TMS membrane family protein / tumour di... 32 0.22 At4g13345.1 68417.m02085 TMS membrane family protein / tumour di... 32 0.22 At1g04160.1 68414.m00406 myosin family protein contains Pfam pro... 29 1.2 At2g27395.1 68415.m03308 cysteine protease-related contains simi... 29 1.5 At1g10170.1 68414.m01147 NF-X1 type zinc finger family protein c... 29 2.0 At5g05050.1 68418.m00536 peptidase C1A papain family protein wea... 28 2.7 At3g24460.1 68416.m03069 TMS membrane family protein / tumour di... 28 2.7 At5g39680.1 68418.m04805 pentatricopeptide (PPR) repeat-containi... 28 3.6 At2g44260.2 68415.m05508 expressed protein 28 3.6 At3g29060.1 68416.m03635 EXS family protein / ERD1/XPR1/SYG1 fam... 27 6.2 At2g36310.1 68415.m04457 inosine-uridine preferring nucleoside h... 27 6.2 At1g61460.1 68414.m06925 S-locus protein kinase, putative contai... 27 6.2 At1g43700.1 68414.m05020 VirE2-interacting protein (VIP1) identi... 27 6.2 At2g33240.1 68415.m04072 myosin, putative similar to myosin (GI:... 27 8.3 >At1g02305.1 68414.m00175 cathepsin B-like cysteine protease, putative similar to cathepsin B-like cysteine proteinase [Nicotiana rustica] GI:609175; contains Pfam profile PF00112: Papain family cysteine protease Length = 362 Score = 128 bits (310), Expect = 1e-30 Identities = 52/96 (54%), Positives = 67/96 (69%), Gaps = 1/96 (1%) Frame = +1 Query: 178 NVPFKKDKRYGKHVYSVSGHEDHIKAELFKNGPVEAAFTVYSDLLSYKNGVYKHTEGNAL 357 N +++ K YG Y V H D I AE++KNGPVE AFTVY D YK+GVYKH G + Sbjct: 227 NQLWRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNI 286 Query: 358 GGHAIKIIGWGVENNNK-YWLIANSWNSDWGDNGFF 462 GGHA+K+IGWG ++ + YWL+AN WN WGD+G+F Sbjct: 287 GGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYF 322 Score = 31.5 bits (68), Expect = 0.29 Identities = 21/57 (36%), Positives = 27/57 (47%), Gaps = 1/57 (1%) Frame = +3 Query: 6 AWEYWKHVGLVSGGNYNSSQGCRPY-EIPPCEHHVPGNRMPCNGDTKTPKCQKNCES 173 AW Y+KH G+V ++ C PY + C H PG C TPKC + C S Sbjct: 182 AWRYFKHHGVV-------TEECDPYFDNTGCSH--PG----CEPAYPTPKCARKCVS 225 >At4g01610.2 68417.m00211 cathepsin B-like cysteine protease, putative similar to cathepsin B-like cysteine proteinase GI:609175 from [Nicotiana rustica]; contains an unusually short, 5nt exon Length = 359 Score = 119 bits (286), Expect = 1e-27 Identities = 48/96 (50%), Positives = 65/96 (67%), Gaps = 1/96 (1%) Frame = +1 Query: 178 NVPFKKDKRYGKHVYSVSGHEDHIKAELFKNGPVEAAFTVYSDLLSYKNGVYKHTEGNAL 357 N + + K Y Y+V + I AE++KNGPVE +FTVY D YK+GVYKH G+ + Sbjct: 224 NKLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNI 283 Query: 358 GGHAIKIIGWGVENNNK-YWLIANSWNSDWGDNGFF 462 GGHA+K+IGWG + + YWL+AN WN WGD+G+F Sbjct: 284 GGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYF 319 >At4g01610.1 68417.m00210 cathepsin B-like cysteine protease, putative similar to cathepsin B-like cysteine proteinase GI:609175 from [Nicotiana rustica]; contains an unusually short, 5nt exon Length = 359 Score = 119 bits (286), Expect = 1e-27 Identities = 48/96 (50%), Positives = 65/96 (67%), Gaps = 1/96 (1%) Frame = +1 Query: 178 NVPFKKDKRYGKHVYSVSGHEDHIKAELFKNGPVEAAFTVYSDLLSYKNGVYKHTEGNAL 357 N + + K Y Y+V + I AE++KNGPVE +FTVY D YK+GVYKH G+ + Sbjct: 224 NKLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNI 283 Query: 358 GGHAIKIIGWGVENNNK-YWLIANSWNSDWGDNGFF 462 GGHA+K+IGWG + + YWL+AN WN WGD+G+F Sbjct: 284 GGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYF 319 >At1g02300.1 68414.m00173 cathepsin B-like cysteine protease, putative similar to cathepsin B-like cysteine proteinase GI:609175 from [Nicotiana rustica] Length = 379 Score = 118 bits (283), Expect = 3e-27 Identities = 47/91 (51%), Positives = 63/91 (69%), Gaps = 1/91 (1%) Frame = +1 Query: 193 KDKRYGKHVYSVSGHEDHIKAELFKNGPVEAAFTVYSDLLSYKNGVYKHTEGNALGGHAI 372 + K YG Y ++ I AE++KNGPVE AFTVY D YK+GVYK+ G +GGHA+ Sbjct: 249 ESKHYGVGAYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAV 308 Query: 373 KIIGWGVENNNK-YWLIANSWNSDWGDNGFF 462 K+IGWG ++ + YWL+AN WN WGD+G+F Sbjct: 309 KLIGWGTSDDGEDYWLLANQWNRSWGDDGYF 339 Score = 30.7 bits (66), Expect = 0.51 Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 1/57 (1%) Frame = +3 Query: 6 AWEYWKHVGLVSGGNYNSSQGCRPY-EIPPCEHHVPGNRMPCNGDTKTPKCQKNCES 173 AW Y+K+ G+V+ Q C PY + C H PG C TPKC++ C S Sbjct: 199 AWLYFKYHGVVT-------QECDPYFDNTGCSH--PG----CEPTYPTPKCERKCVS 242 >At3g45310.1 68416.m04892 cysteine proteinase, putative similar to AALP protein GI:7230640 from [Arabidopsis thaliana] and barley aleurain Length = 358 Score = 71.3 bits (167), Expect = 3e-13 Identities = 35/80 (43%), Positives = 47/80 (58%), Gaps = 3/80 (3%) Frame = +1 Query: 232 GHEDHIKAELFKNGPVEAAFTVYSDLLSYKNGVY-KHTEGNALG--GHAIKIIGWGVENN 402 G ED +K + PV AF V + YK GV+ +T GN HA+ +G+GVE++ Sbjct: 258 GAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGVFTSNTCGNTPMDVNHAVLAVGYGVEDD 317 Query: 403 NKYWLIANSWNSDWGDNGFF 462 YWLI NSW +WGDNG+F Sbjct: 318 VPYWLIKNSWGGEWGDNGYF 337 >At5g60360.1 68418.m07568 cysteine proteinase, putative / AALP protein (AALP) identical to AALP protein GI:7230640 from [Arabidopsis thaliana]; similar to barley aleurain Length = 358 Score = 70.1 bits (164), Expect = 7e-13 Identities = 34/80 (42%), Positives = 44/80 (55%), Gaps = 3/80 (3%) Frame = +1 Query: 232 GHEDHIKAELFKNGPVEAAFTVYSDLLSYKNGVY--KHTEGNALG-GHAIKIIGWGVENN 402 G ED +K + PV AF V YK+GVY H + HA+ +G+GVE+ Sbjct: 258 GAEDELKHAVGLVRPVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDG 317 Query: 403 NKYWLIANSWNSDWGDNGFF 462 YWLI NSW +DWGD G+F Sbjct: 318 VPYWLIKNSWGADWGDKGYF 337 >At1g29110.1 68414.m03563 cysteine proteinase, putative contains similarity to cysteine protease SPCP1 GI:13491750 from [Ipomoea batatas] Length = 334 Score = 58.8 bits (136), Expect = 2e-09 Identities = 25/79 (31%), Positives = 37/79 (46%), Gaps = 1/79 (1%) Frame = +1 Query: 226 VSGHEDHIKAELFKNGPVEAAFTVYSDLLS-YKNGVYKHTEGNALGGHAIKIIGWGVENN 402 V H + E + PV +D YK GVY + HA+ I+G+G + Sbjct: 231 VPSHNERALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTMSG 290 Query: 403 NKYWLIANSWNSDWGDNGF 459 YW++ NSW WG+NG+ Sbjct: 291 LNYWVLKNSWGESWGENGY 309 >At3g48340.1 68416.m05276 cysteine proteinase, putative similar to cysteine endopeptidase precursor [Ricinus communis] GI:2944446; contains Pfam profile PF00112: Papain family cysteine protease Length = 351 Score = 58.4 bits (135), Expect = 2e-09 Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 7/86 (8%) Frame = +1 Query: 223 SVSGHED---HIKAELFK---NGPVEAAFTV-YSDLLSYKNGVYKHTEGNALGGHAIKII 381 ++ GHED + + L K N PV A SD Y GV+ + G L H + + Sbjct: 225 TIDGHEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELN-HGVAAV 283 Query: 382 GWGVENNNKYWLIANSWNSDWGDNGF 459 G+G E KYW++ NSW ++WG+ G+ Sbjct: 284 GYGSERGKKYWIVRNSWGAEWGEGGY 309 >At3g48350.1 68416.m05277 cysteine proteinase, putative similar to cysteine endopeptidase precursor [Ricinus communis] GI:2944446; contains Pfam profile PF00112: Papain family cysteine protease Length = 364 Score = 57.2 bits (132), Expect = 5e-09 Identities = 33/102 (32%), Positives = 51/102 (50%), Gaps = 8/102 (7%) Frame = +1 Query: 178 NVPFKKDKRYGKHVYSVSGHE---DHIKAELFK---NGPVEAAFTV-YSDLLSYKNGVYK 336 +V F + G ++ GHE ++ + EL K + PV A SD Y GV+ Sbjct: 219 DVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFI 278 Query: 337 HTEGNALGGHAIKIIGWG-VENNNKYWLIANSWNSDWGDNGF 459 G L H + I+G+G +N KYW++ NSW +WG+ G+ Sbjct: 279 GECGTQLN-HGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGY 319 >At3g43960.1 68416.m04706 cysteine proteinase, putative contains similarity to cysteine proteinase RD21A (thiol protease) GI:435619, SP:P43297 from [Arabidopsis thaliana] Length = 376 Score = 57.2 bits (132), Expect = 5e-09 Identities = 22/54 (40%), Positives = 34/54 (62%), Gaps = 1/54 (1%) Frame = +1 Query: 301 SDLLSYKNGVYKHTEGNALGGHAIKIIGWGVENNN-KYWLIANSWNSDWGDNGF 459 +++ YK+GVYK N G H + I+G+G ++ YWLI NSW +WG+ G+ Sbjct: 269 ANMSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGY 322 >At5g50260.1 68418.m06224 cysteine proteinase, putative similar to cysteine endopeptidase precursor CysEP GI:2944446 from [Ricinus communis] Length = 361 Score = 56.8 bits (131), Expect = 7e-09 Identities = 31/89 (34%), Positives = 46/89 (51%), Gaps = 8/89 (8%) Frame = +1 Query: 217 VYSVSGHED---HIKAELFK---NGPVEAAFTVY-SDLLSYKNGVYKHTEGNALGGHAIK 375 V S+ GHED + + +L K N PV A SD Y GV+ G L H + Sbjct: 231 VVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELN-HGVA 289 Query: 376 IIGWGVE-NNNKYWLIANSWNSDWGDNGF 459 ++G+G + KYW++ NSW +WG+ G+ Sbjct: 290 VVGYGTTIDGTKYWIVKNSWGEEWGEKGY 318 >At3g19390.1 68416.m02459 cysteine proteinase, putative / thiol protease, putative contains similarity to cysteine proteinase RD21A (thiol protease) GI:435619, SP:P43297 from [Arabidopsis thaliana] Length = 452 Score = 56.0 bits (129), Expect = 1e-08 Identities = 34/103 (33%), Positives = 53/103 (51%), Gaps = 7/103 (6%) Frame = +1 Query: 175 VNVPFKKDKRYGKHVYSVSGHED---HIKAELFK---NGPVEAAFTVYSDLLS-YKNGVY 333 VNV DK+ V ++ G+ED + + L K N P+ A Y +GV+ Sbjct: 223 VNV-CNSDKK-NTRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVF 280 Query: 334 KHTEGNALGGHAIKIIGWGVENNNKYWLIANSWNSDWGDNGFF 462 T G +L H + +G+G E YW++ NSW S+WG++G+F Sbjct: 281 TGTCGTSLD-HGVVAVGYGSEGGQDYWIVRNSWGSNWGESGYF 322 >At1g06260.1 68414.m00662 cysteine proteinase, putative contains similarity to thiol-protease, pre-pro-TPE4A protein GI:3688528 [Pisum sativum] Length = 343 Score = 55.6 bits (128), Expect = 2e-08 Identities = 19/48 (39%), Positives = 32/48 (66%) Frame = +1 Query: 316 YKNGVYKHTEGNALGGHAIKIIGWGVENNNKYWLIANSWNSDWGDNGF 459 Y +GV+ + G L H + ++G+GVE + KYW++ NSW + WG+ G+ Sbjct: 272 YSSGVFTNYCGTNLN-HGVTVVGYGVEGDQKYWIVKNSWGTGWGEEGY 318 >At3g19400.1 68416.m02461 cysteine proteinase, putative non-consensus AT acceptor site at exon 3; contains similarity to cysteine protease CYP1 GI:2828252, TDI-65 GI:5726641 from [Lycopersicon esculentum] Length = 362 Score = 55.2 bits (127), Expect = 2e-08 Identities = 30/95 (31%), Positives = 47/95 (49%), Gaps = 7/95 (7%) Frame = +1 Query: 196 DKRYGKHVYSVSGHEDHIKAE------LFKNGPVEAAFTVYSDLLS-YKNGVYKHTEGNA 354 DK V ++ G+ED + + + PV A S YK+GV T G + Sbjct: 231 DKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGVMTGTCGIS 290 Query: 355 LGGHAIKIIGWGVENNNKYWLIANSWNSDWGDNGF 459 L H + ++G+G + YW+I NSW +WGD+G+ Sbjct: 291 LD-HGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGY 324 >At3g49340.1 68416.m05394 cysteine proteinase, putative contains PS00640: Eukaryotic thiol (cysteine) proteases asparagine active site; similar to cysteine proteinase GI:535454 from [Alnus glutinosam] Length = 341 Score = 54.0 bits (124), Expect = 5e-08 Identities = 22/53 (41%), Positives = 32/53 (60%), Gaps = 1/53 (1%) Frame = +1 Query: 304 DLLSYKNGVYKHTEGNALGGHAIKIIGWGV-ENNNKYWLIANSWNSDWGDNGF 459 + + Y G++ G L HA+ I+G+GV E KYWL+ NSW WG+NG+ Sbjct: 265 EFIHYSGGIFNGECGTQLT-HAVTIVGYGVSEEGIKYWLLKNSWGESWGENGY 316 >At4g23520.1 68417.m03390 cysteine proteinase, putative contains similarity to cysteine proteinase (thiol protease) RD21A GI:435619, SP:P43297 from [Arabidopsis thaliana] Length = 356 Score = 53.2 bits (122), Expect = 8e-08 Identities = 20/52 (38%), Positives = 32/52 (61%) Frame = +1 Query: 304 DLLSYKNGVYKHTEGNALGGHAIKIIGWGVENNNKYWLIANSWNSDWGDNGF 459 + + Y++ +Y G L HA+ I+G+G EN YW++ NSW + WGD G+ Sbjct: 275 EFMLYRSCIYNGPCGTNLD-HALVIVGYGSENGQDYWIVRNSWGTTWGDAGY 325 >At2g21430.1 68415.m02550 cysteine proteinase A494, putative / thiol protease, putative identical to SP:P43295 Probable cysteine proteinase A494 precursor [Arabidopsis thaliana]; strong similarity to cysteine proteinase RD19A (thiol protease) GI:435618, SP:P43296 from [Arabidopsis thaliana] Length = 361 Score = 52.8 bits (121), Expect = 1e-07 Identities = 29/86 (33%), Positives = 44/86 (51%), Gaps = 7/86 (8%) Frame = +1 Query: 226 VSGHEDHIKAELFKNGPVEAAFTVYSDLLSYKNGVYKHTEGNALGGHAIKIIGWGVENNN 405 VS +ED I A L KNGP+ A + + +Y GV + H + ++G+G + Sbjct: 254 VSINEDQIAANLIKNGPLAVAINA-AYMQTYIGGVSCPYICSRRLNHGVLLVGYGSAGFS 312 Query: 406 K-------YWLIANSWNSDWGDNGFF 462 + YW+I NSW WG+NGF+ Sbjct: 313 QARLKEKPYWIIKNSWGESWGENGFY 338 >At2g27420.1 68415.m03314 cysteine proteinase, putative contains similarity to cysteine protease SPCP1 GI:13491750 from [Ipomoea batatas] Length = 348 Score = 51.6 bits (118), Expect = 3e-07 Identities = 21/49 (42%), Positives = 30/49 (61%), Gaps = 1/49 (2%) Frame = +1 Query: 316 YKNGVYKHTEGNALGGHAIKIIGWGV-ENNNKYWLIANSWNSDWGDNGF 459 Y GV+ G L HA+ I+G+G+ E KYW++ NSW WG+NG+ Sbjct: 276 YSGGVFNGECGTDLH-HAVTIVGYGMSEEGTKYWVVKNSWGETWGENGY 323 >At2g34080.1 68415.m04172 cysteine proteinase, putative contains similarity to cysteine protease SPCP1 GI:13491750 from [Ipomoea batatas] Length = 345 Score = 51.2 bits (117), Expect = 3e-07 Identities = 24/81 (29%), Positives = 37/81 (45%), Gaps = 2/81 (2%) Frame = +1 Query: 223 SVSGHEDHIKAELFKNGPVEAAFTVYSD-LLSYKNGVYKHTEGNALGGHAIKIIGWGV-E 396 +V + + E PV + D + Y GVY G + HA+ +G+G + Sbjct: 241 TVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTS-SNHAVTFVGYGTSQ 299 Query: 397 NNNKYWLIANSWNSDWGDNGF 459 + KYWL NSW WG+ G+ Sbjct: 300 DGTKYWLAKNSWGETWGEKGY 320 >At1g47128.1 68414.m05222 cysteine proteinase (RD21A) / thiol protease identical to SP|P43297 Cysteine proteinase RD21A precursor (EC 3.4.22.-) {Arabidopsis thaliana}, thiol protease RD21A SP:P43297 from [Arabidopsis thaliana] Length = 462 Score = 51.2 bits (117), Expect = 3e-07 Identities = 17/48 (35%), Positives = 29/48 (60%) Frame = +1 Query: 316 YKNGVYKHTEGNALGGHAIKIIGWGVENNNKYWLIANSWNSDWGDNGF 459 Y +G++ + G L H + +G+G EN YW++ NSW WG++G+ Sbjct: 282 YDSGIFDGSCGTQLD-HGVVAVGYGTENGKDYWIVRNSWGKSWGESGY 328 >At5g43060.1 68418.m05256 cysteine proteinase, putative / thiol protease, putative similar to cysteine proteinase RD21A precursor (thiol protease) GI:435619, SP:P43297 from [Arabidopsis thaliana] Length = 463 Score = 50.8 bits (116), Expect = 4e-07 Identities = 18/48 (37%), Positives = 29/48 (60%) Frame = +1 Query: 316 YKNGVYKHTEGNALGGHAIKIIGWGVENNNKYWLIANSWNSDWGDNGF 459 Y +GV+ G L H + +G+G EN YW++ NSW + WG++G+ Sbjct: 283 YSSGVFDGLCGTELD-HGVVAVGYGTENGKDYWIVRNSWGNRWGESGY 329 >At1g29080.1 68414.m03560 peptidase C1A papain family protein contains similarity to cysteine protease SPCP1 GI:13491750 from [Ipomoea batatas]; contains Pfam profile PF00112: Papain family cysteine protease Length = 346 Score = 50.8 bits (116), Expect = 4e-07 Identities = 24/81 (29%), Positives = 35/81 (43%), Gaps = 2/81 (2%) Frame = +1 Query: 223 SVSGHEDHIKAELFKNGPVEAAFTVY-SDLLSYKNGVYKHTEGNALGGHAIKIIGWGVEN 399 +V + + E PV A + + Y GVY HA+ ++G+G Sbjct: 241 NVPSNNERALLEAVSRQPVAVAIDASEAGFVHYSGGVYNARNCGTSVNHAVTLVGYGTSP 300 Query: 400 NN-KYWLIANSWNSDWGDNGF 459 KYWL NSW WG+NG+ Sbjct: 301 EGMKYWLAKNSWGKTWGENGY 321 >At4g36880.1 68417.m05229 cysteine proteinase, putative strong similarity to cysteine proteinase COT44 precursor SP:P25251 from [Brassica napus] (Rape) Length = 376 Score = 49.6 bits (113), Expect = 1e-06 Identities = 18/48 (37%), Positives = 30/48 (62%) Frame = +1 Query: 316 YKNGVYKHTEGNALGGHAIKIIGWGVENNNKYWLIANSWNSDWGDNGF 459 Y++G++ + G L HA+ +G+G EN YW++ NSW WG+ G+ Sbjct: 290 YQSGIFTGSCGTNLD-HAVVAVGYGSENGVDYWIVRNSWGPRWGEEGY 336 >At4g39090.1 68417.m05535 cysteine proteinase RD19a (RD19A) / thiol protease identical to cysteine proteinase RD19a, thiol protease SP:P43296, GI:435618 from [Arabidopsis thaliana] Length = 368 Score = 49.2 bits (112), Expect = 1e-06 Identities = 27/86 (31%), Positives = 39/86 (45%), Gaps = 7/86 (8%) Frame = +1 Query: 226 VSGHEDHIKAELFKNGPVEAAFTVYSDLLSYKNGVYKHTEGNALGGHAIKIIGWGVEN-- 399 +S E+ I A L KNGP+ A + +Y GV H + ++G+G Sbjct: 257 ISIDEEQIAANLVKNGPLAVAINA-GYMQTYIGGVSCPYICTRRLNHGVLLVGYGAAGYA 315 Query: 400 -----NNKYWLIANSWNSDWGDNGFF 462 YW+I NSW WG+NGF+ Sbjct: 316 PARFKEKPYWIIKNSWGETWGENGFY 341 >At1g09850.1 68414.m01109 cysteine protease, papain-like (XBCP3) identical to papain-like cysteine peptidase XBCP3 GI:14600257 from [Arabidopsis thaliana]; contains Pfam profiles PF00112: Papain family cysteine protease and PF00396: Granulin Length = 437 Score = 49.2 bits (112), Expect = 1e-06 Identities = 18/48 (37%), Positives = 29/48 (60%) Frame = +1 Query: 316 YKNGVYKHTEGNALGGHAIKIIGWGVENNNKYWLIANSWNSDWGDNGF 459 Y +G++ +L HA+ I+G+G +N YW++ NSW WG +GF Sbjct: 263 YSSGIFSGPCSTSLD-HAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGF 309 >At4g11320.1 68417.m01828 cysteine proteinase, putative contains similarity to cysteine proteinase RD21A (thiol protease) GI:435619, SP:P43297 from [Arabidopsis thaliana] Length = 371 Score = 48.4 bits (110), Expect = 2e-06 Identities = 18/48 (37%), Positives = 29/48 (60%) Frame = +1 Query: 316 YKNGVYKHTEGNALGGHAIKIIGWGVENNNKYWLIANSWNSDWGDNGF 459 Y++GV+ T G L H + ++G+G EN YW++ NS WG+ G+ Sbjct: 289 YESGVFDGTCGTNLN-HGVVVVGYGTENGRDYWIVKNSRGDTWGEAGY 335 >At1g29090.1 68414.m03561 peptidase C1A papain family protein contains similarity to cysteine protease SPCP1 GI:13491750 from [Ipomoea batatas]; contains Pfam profile PF00112: Papain family cysteine protease Length = 355 Score = 48.0 bits (109), Expect = 3e-06 Identities = 19/51 (37%), Positives = 25/51 (49%), Gaps = 1/51 (1%) Frame = +1 Query: 310 LSYKNGVYKHTEGNALGGHAIKIIGWGVENNN-KYWLIANSWNSDWGDNGF 459 + Y GVY HA+ +G+G KYWL NSW WG+NG+ Sbjct: 280 MHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGY 330 >At5g45890.1 68418.m05644 senescence-specific SAG12 protein (SAG12) / cysteine proteinase, putative identical to senescence-specific protein SAG12 GI:1046373 from [Arabidopsis thaliana] Length = 346 Score = 47.6 bits (108), Expect = 4e-06 Identities = 21/53 (39%), Positives = 31/53 (58%), Gaps = 1/53 (1%) Frame = +1 Query: 304 DLLSYKNGVYKHTEGNALGGHAIKIIGWGVENN-NKYWLIANSWNSDWGDNGF 459 D Y +GV+ E HA+ IG+G N +KYW+I NSW + WG++G+ Sbjct: 270 DFQFYSSGVFTG-ECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGY 321 >At4g11310.1 68417.m01827 cysteine proteinase, putative contains similarity to cysteine proteinase RD21A (thiol protease) GI:435619, SP:P43297 from [Arabidopsis thaliana] Length = 364 Score = 46.8 bits (106), Expect = 7e-06 Identities = 18/48 (37%), Positives = 29/48 (60%) Frame = +1 Query: 316 YKNGVYKHTEGNALGGHAIKIIGWGVENNNKYWLIANSWNSDWGDNGF 459 Y++GV+ + G L H + ++G+G EN YWL+ NS WG+ G+ Sbjct: 282 YESGVFDGSCGTNLN-HGVVVVGYGTENGRDYWLVKNSRGITWGEAGY 328 >At1g03720.1 68414.m00352 cathepsin-related contains weak similarity to Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) (Cyclic protein-2) (CP-2) (Swiss-Prot:P07154) [Rattus norvegicus] Length = 274 Score = 46.8 bits (106), Expect = 7e-06 Identities = 20/40 (50%), Positives = 27/40 (67%), Gaps = 1/40 (2%) Frame = +1 Query: 343 EGNALGGHAIKIIGWGVENNNK-YWLIANSWNSDWGDNGF 459 +G+A GGH + I+G+G NK ++LI NSW DWG GF Sbjct: 216 DGDA-GGHVVLIVGYGYTKENKLFFLIQNSWGEDWGVKGF 254 >At4g35350.1 68417.m05023 cysteine endopeptidase, papain-type (XCP1) identical to papain-type cysteine endopeptidase XCP1 GI:6708181 from [Arabidopsis thaliana] Length = 355 Score = 45.2 bits (102), Expect = 2e-05 Identities = 23/79 (29%), Positives = 35/79 (44%), Gaps = 1/79 (1%) Frame = +1 Query: 226 VSGHEDHIKAELFKNGPVEAAFTVYS-DLLSYKNGVYKHTEGNALGGHAIKIIGWGVENN 402 V ++D + + PV A D YK GV+ G L H + +G+G Sbjct: 251 VPENDDESLVKALAHQPVSVAIEASGRDFQFYKGGVFNGKCGTDLD-HGVAAVGYGSSKG 309 Query: 403 NKYWLIANSWNSDWGDNGF 459 + Y ++ NSW WG+ GF Sbjct: 310 SDYVIVKNSWGPRWGEKGF 328 >At4g16190.1 68417.m02457 cysteine proteinase, putative contains similarity to papain-like cysteine proteinase isoform I GI:7381219 from [Ipomoea batatas] Length = 373 Score = 45.2 bits (102), Expect = 2e-05 Identities = 25/86 (29%), Positives = 41/86 (47%), Gaps = 7/86 (8%) Frame = +1 Query: 226 VSGHEDHIKAELFKNGPVEAAFTVYSDLLSYKNGVYKHTEGNALGGHAIKIIGWGVEN-- 399 VS ED I A L ++GP+ A + +Y GV + H + ++G+G Sbjct: 262 VSSDEDQIAANLVQHGPLAIAINAMW-MQTYIGGVSCPYVCSKSQDHGVLLVGFGSSGYA 320 Query: 400 -----NNKYWLIANSWNSDWGDNGFF 462 YW+I NSW + WG++G++ Sbjct: 321 PIRLKEKPYWIIKNSWGAMWGEHGYY 346 >At3g54940.3 68416.m06091 cysteine proteinase, putative contains similarity to cysteine proteinase GI:479060 from [Glycine max] Length = 368 Score = 42.7 bits (96), Expect = 1e-04 Identities = 23/83 (27%), Positives = 40/83 (48%), Gaps = 8/83 (9%) Frame = +1 Query: 238 EDHIKAELFKNGPVEAAFTVYSDLLSYKNGVYKHTEGNALG-GHAIKIIGWGVE------ 396 E+ I A L ++GP+ + +Y GV + H + ++G+G + Sbjct: 263 ENQIAANLVRHGPLAVGLNAVF-MQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILR 321 Query: 397 -NNNKYWLIANSWNSDWGDNGFF 462 +N YW+I NSW WG+NG++ Sbjct: 322 LSNKPYWIIKNSWGKKWGENGYY 344 >At5g17140.1 68418.m02008 cysteine proteinase-related low similarity to cysteine proteinase [Sitophilus zeamais] GI:2804262 Length = 112 Score = 39.1 bits (87), Expect = 0.001 Identities = 20/78 (25%), Positives = 42/78 (53%), Gaps = 5/78 (6%) Frame = +1 Query: 241 DHIKAELFKNGPVEAAFT---VYSDLLSYKNGVYKHTEGNA-LGGHAIKIIGWGVENNNK 408 + I+ L GP+ + ++ + +K G+Y E + HA+ I+G+G ++K Sbjct: 17 EEIRGLLLTQGPIGISVDLCGIFRQVYEFK-GIYVLPEPKENMERHALIIVGFGTTKDSK 75 Query: 409 -YWLIANSWNSDWGDNGF 459 ++++ N+W + WG NG+ Sbjct: 76 LFFIVQNTWGTKWGFNGY 93 >At1g20850.1 68414.m02612 cysteine endopeptidase, papain-type (XCP2) identical to papain-type cysteine endopeptidase XCP2 GI:6708183 from [Arabidopsis thaliana] Length = 356 Score = 37.9 bits (84), Expect = 0.003 Identities = 15/48 (31%), Positives = 24/48 (50%) Frame = +1 Query: 316 YKNGVYKHTEGNALGGHAIKIIGWGVENNNKYWLIANSWNSDWGDNGF 459 Y GV+ G L H + +G+G + Y ++ NSW WG+ G+ Sbjct: 283 YSGGVFDGRCGVDLD-HGVAAVGYGSSKGSDYIIVKNSWGPKWGEKGY 329 >At5g43900.1 68418.m05368 myosin heavy chain (MYA2) nearly identical to PIR|S51824 myosin heavy chain MYA2 [Arabidopsis thaliana] Length = 1505 Score = 32.3 bits (70), Expect = 0.17 Identities = 24/79 (30%), Positives = 39/79 (49%), Gaps = 6/79 (7%) Frame = +1 Query: 211 KHVYSVSGHEDHIKAEL-FKNGPVEAAFTVYSDLLSYKNGVYKHTEGNALGGHAIKIIGW 387 + + +V D +K + F NG AAFT+Y LL +K ++ + N I++IG Sbjct: 1100 RQIMNVDALIDCVKDNIGFSNGKPVAAFTIYKCLLHWK--CFESEKTNVF-DRLIQMIGS 1156 Query: 388 GVENNN-----KYWLIANS 429 +EN + YWL + S Sbjct: 1157 AIENEDDNSHLAYWLTSTS 1175 >At4g13345.2 68417.m02086 TMS membrane family protein / tumour differentially expressed (TDE) family protein contains Pfam domain, PF03348: TMS membrane protein/tumour differentially expressed protein (TDE) Length = 394 Score = 31.9 bits (69), Expect = 0.22 Identities = 12/47 (25%), Positives = 20/47 (42%) Frame = +1 Query: 304 DLLSYKNGVYKHTEGNALGGHAIKIIGWGVENNNKYWLIANSWNSDW 444 D + Y G + A+ ++GW + ++ K W I W S W Sbjct: 319 DAIPYGYGFFHFVFATGAMYFAMLLVGWNIHHSMKKWTIDVGWTSTW 365 >At4g13345.1 68417.m02085 TMS membrane family protein / tumour differentially expressed (TDE) family protein contains Pfam domain, PF03348: TMS membrane protein/tumour differentially expressed protein (TDE) Length = 394 Score = 31.9 bits (69), Expect = 0.22 Identities = 12/47 (25%), Positives = 20/47 (42%) Frame = +1 Query: 304 DLLSYKNGVYKHTEGNALGGHAIKIIGWGVENNNKYWLIANSWNSDW 444 D + Y G + A+ ++GW + ++ K W I W S W Sbjct: 319 DAIPYGYGFFHFVFATGAMYFAMLLVGWNIHHSMKKWTIDVGWTSTW 365 >At1g04160.1 68414.m00406 myosin family protein contains Pfam profiles: PF02736 myosin N-terminal SH3-like domain, PF00063 myosin head (motor domain), PF00612 IQ calmodulin-binding motif, PF01843: DIL domain Length = 1500 Score = 29.5 bits (63), Expect = 1.2 Identities = 23/69 (33%), Positives = 33/69 (47%), Gaps = 6/69 (8%) Frame = +1 Query: 241 DHIKAEL-FKNGPVEAAFTVYSDLLSYKNGVYKHTEGNALGGHAIKIIGWGVENNN---- 405 D +K + F NG AAFT+Y LL +K +E + I++IG +EN + Sbjct: 1108 DCVKENIGFSNGKPIAAFTIYKCLLHWK---CFESEKTSAFDRLIEMIGSAIENEDDNGH 1164 Query: 406 -KYWLIANS 429 YWL S Sbjct: 1165 LAYWLTNTS 1173 >At2g27395.1 68415.m03308 cysteine protease-related contains similarity to senescence-specific cysteine protease GI:5823018 from [Brassica napus] Length = 86 Score = 29.1 bits (62), Expect = 1.5 Identities = 11/22 (50%), Positives = 14/22 (63%) Frame = +1 Query: 394 ENNNKYWLIANSWNSDWGDNGF 459 E KYW++ NS WG+NGF Sbjct: 52 EEGTKYWMVKNS----WGENGF 69 >At1g10170.1 68414.m01147 NF-X1 type zinc finger family protein contains Pfam PF01422: NF-X1 type zinc finger; similar to transcriptional repressor NF-X1 (SP:Q12986) [Homo sapiens]; similar to EST gb|T21002 Length = 1188 Score = 28.7 bits (61), Expect = 2.0 Identities = 10/31 (32%), Positives = 13/31 (41%) Frame = +3 Query: 69 CRPYEIPPCEHHVPGNRMPCNGDTKTPKCQK 161 C P PPC+ P PC T +C + Sbjct: 343 CHPGPCPPCKAFAPPRSCPCGKKMVTTRCSE 373 >At5g05050.1 68418.m00536 peptidase C1A papain family protein weak similarity to berghepain-2 [Plasmodium berghei] GI:17978639; contains Pfam profile PF00112: Papain family cysteine protease Length = 299 Score = 28.3 bits (60), Expect = 2.7 Identities = 9/16 (56%), Positives = 11/16 (68%) Frame = +1 Query: 409 YWLIANSWNSDWGDNG 456 YWLI NS+ WG+ G Sbjct: 235 YWLIQNSYGEAWGEKG 250 >At3g24460.1 68416.m03069 TMS membrane family protein / tumour differentially expressed (TDE) family protein contains Pfam domain, PF03348: TMS membrane protein/tumour differentially expressed protein (TDE) Length = 409 Score = 28.3 bits (60), Expect = 2.7 Identities = 10/26 (38%), Positives = 13/26 (50%) Frame = +1 Query: 367 AIKIIGWGVENNNKYWLIANSWNSDW 444 A+ +IGW + K W I W S W Sbjct: 351 AMLLIGWNTHHPMKKWTIDVGWTSTW 376 >At5g39680.1 68418.m04805 pentatricopeptide (PPR) repeat-containing protein contains INTERPRO:IPR002885 PPR repeats Length = 710 Score = 27.9 bits (59), Expect = 3.6 Identities = 15/35 (42%), Positives = 17/35 (48%) Frame = -3 Query: 297 YSKSGFDRTILKQFSFDMVFMSRYRINMFTVSFVF 193 Y SGFD +LK F M F R N F + VF Sbjct: 110 YQNSGFDFEVLKLFK-SMFFSGESRPNEFVATVVF 143 >At2g44260.2 68415.m05508 expressed protein Length = 583 Score = 27.9 bits (59), Expect = 3.6 Identities = 15/35 (42%), Positives = 21/35 (60%) Frame = +1 Query: 235 HEDHIKAELFKNGPVEAAFTVYSDLLSYKNGVYKH 339 HED + +L K PV+AAF S L ++ G+Y H Sbjct: 12 HED-VSKKLPKALPVDAAFKFPSPLPTFTRGLYYH 45 >At3g29060.1 68416.m03635 EXS family protein / ERD1/XPR1/SYG1 family protein similar to PHO1 protein [Arabidopsis thaliana] GI:20069032; contains Pfam profiles PF03105: SPX domain, PF03124: EXS family Length = 794 Score = 27.1 bits (57), Expect = 6.2 Identities = 12/28 (42%), Positives = 17/28 (60%) Frame = -3 Query: 300 VYSKSGFDRTILKQFSFDMVFMSRYRIN 217 +YS GF L ++ D+ F SRYR+N Sbjct: 434 LYSLFGFVAVHLFMYAADIYFWSRYRVN 461 >At2g36310.1 68415.m04457 inosine-uridine preferring nucleoside hydrolase family protein similar to Chain A, Crystal Structure Of Nucleoside Hydrolase From Leishmania MajorGI:8569431; contains Pfam profile PF01156: Inosine-uridine preferring nucleoside hydrolase Length = 336 Score = 27.1 bits (57), Expect = 6.2 Identities = 21/75 (28%), Positives = 36/75 (48%), Gaps = 9/75 (12%) Frame = +1 Query: 244 HIKAE----LFKNGPVEAAFTVYSDLLSYKNGVYK-HTEGNALGGHAIKIIGWGVENNNK 408 H+K++ ++ + PV V DL +YK GV + T+G + GH + G N + Sbjct: 248 HVKSDGVYGVYLHDPVSFVAVVRPDLFTYKKGVVRVETQGICV-GHTLMDQGLKRWNGSN 306 Query: 409 YWL----IANSWNSD 441 W+ I+ +W D Sbjct: 307 PWVGYSPISVAWTVD 321 >At1g61460.1 68414.m06925 S-locus protein kinase, putative contains similarity to KI domain interacting kinase 1 [Zea mays] gi|2735017|gb|AAB93834; contains S-locus glycoprotein family domain, Pfam:PF00954 Length = 598 Score = 27.1 bits (57), Expect = 6.2 Identities = 11/30 (36%), Positives = 15/30 (50%) Frame = +3 Query: 6 AWEYWKHVGLVSGGNYNSSQGCRPYEIPPC 95 AWE W G V + + + CRP E+ C Sbjct: 502 AWESWCETGGVDLLDKDVADSCRPLEVERC 531 >At1g43700.1 68414.m05020 VirE2-interacting protein (VIP1) identical to VirE2-interacting protein VIP1 GB:AAF37279 GI:7258340 from [Arabidopsis thaliana] Length = 341 Score = 27.1 bits (57), Expect = 6.2 Identities = 21/81 (25%), Positives = 39/81 (48%), Gaps = 1/81 (1%) Frame = +1 Query: 124 PVTVILKHQNAKRTVNLVNVPFKKDKRYGKHVYSVSGHEDHIKAELFKN-GPVEAAFTVY 300 P++V + ++ V ++P K + R+G+HV S S + ++ F + G E F Sbjct: 80 PMSVDSEETSSNGVVPPNSLPPKPEARFGRHVRSFS-----VDSDFFDDLGVTEEKFIAT 134 Query: 301 SDLLSYKNGVYKHTEGNALGG 363 S K G + H+ N++ G Sbjct: 135 SS-GEKKKGNHHHSRSNSMDG 154 >At2g33240.1 68415.m04072 myosin, putative similar to myosin (GI:433663) [Arabidopsis thaliana]; myosin my5A (SP:Q02440) {Gallus gallus} Length = 1770 Score = 26.6 bits (56), Expect = 8.3 Identities = 19/61 (31%), Positives = 27/61 (44%), Gaps = 5/61 (8%) Frame = +1 Query: 262 FKNGPVEAAFTVYSDLLSYKNGVYKHTEGNALGGHAIKIIGWGVEN-----NNKYWLIAN 426 F +G AAFT+Y L+ +K E ++ + I G +EN N YWL Sbjct: 1317 FSHGKPVAAFTIYKCLIHWK---LFEAEKTSVFDRIVPIFGSAIENPEDDSNLAYWLTNT 1373 Query: 427 S 429 S Sbjct: 1374 S 1374 Database: arabidopsis Posted date: Oct 4, 2007 10:56 AM Number of letters in database: 12,070,560 Number of sequences in database: 28,952 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 11,059,684 Number of Sequences: 28952 Number of extensions: 236371 Number of successful extensions: 646 Number of sequences better than 10.0: 50 Number of HSP's better than 10.0 without gapping: 625 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 634 length of database: 12,070,560 effective HSP length: 75 effective length of database: 9,899,160 effective search space used: 782033640 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -