BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= S06A01NCLL0001_H21 (515 letters) Database: arabidopsis 28,952 sequences; 12,070,560 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value At3g45310.1 68416.m04892 cysteine proteinase, putative similar t... 120 8e-28 At5g60360.1 68418.m07568 cysteine proteinase, putative / AALP pr... 115 2e-26 At5g43060.1 68418.m05256 cysteine proteinase, putative / thiol p... 112 2e-25 At1g47128.1 68414.m05222 cysteine proteinase (RD21A) / thiol pro... 109 1e-24 At4g36880.1 68417.m05229 cysteine proteinase, putative strong si... 107 4e-24 At1g09850.1 68414.m01109 cysteine protease, papain-like (XBCP3) ... 106 8e-24 At3g48340.1 68416.m05276 cysteine proteinase, putative similar t... 105 2e-23 At3g19390.1 68416.m02459 cysteine proteinase, putative / thiol p... 102 2e-22 At1g20850.1 68414.m02612 cysteine endopeptidase, papain-type (XC... 102 2e-22 At3g19400.1 68416.m02461 cysteine proteinase, putative non-conse... 101 4e-22 At1g29080.1 68414.m03560 peptidase C1A papain family protein con... 100 7e-22 At5g45890.1 68418.m05644 senescence-specific SAG12 protein (SAG1... 99 2e-21 At4g35350.1 68417.m05023 cysteine endopeptidase, papain-type (XC... 99 2e-21 At4g11310.1 68417.m01827 cysteine proteinase, putative contains ... 97 6e-21 At5g50260.1 68418.m06224 cysteine proteinase, putative similar t... 94 4e-20 At4g11320.1 68417.m01828 cysteine proteinase, putative contains ... 94 4e-20 At2g34080.1 68415.m04172 cysteine proteinase, putative contains ... 94 6e-20 At1g29090.1 68414.m03561 peptidase C1A papain family protein con... 94 6e-20 At3g49340.1 68416.m05394 cysteine proteinase, putative contains ... 93 8e-20 At4g23520.1 68417.m03390 cysteine proteinase, putative contains ... 90 7e-19 At3g48350.1 68416.m05277 cysteine proteinase, putative similar t... 90 7e-19 At1g06260.1 68414.m00662 cysteine proteinase, putative contains ... 90 7e-19 At1g29110.1 68414.m03563 cysteine proteinase, putative contains ... 89 2e-18 At3g54940.3 68416.m06091 cysteine proteinase, putative contains ... 85 3e-17 At2g21430.1 68415.m02550 cysteine proteinase A494, putative / th... 83 8e-17 At2g27420.1 68415.m03314 cysteine proteinase, putative contains ... 83 1e-16 At4g39090.1 68417.m05535 cysteine proteinase RD19a (RD19A) / thi... 82 2e-16 At3g43960.1 68416.m04706 cysteine proteinase, putative contains ... 78 3e-15 At4g16190.1 68417.m02457 cysteine proteinase, putative contains ... 73 1e-13 At4g01610.2 68417.m00211 cathepsin B-like cysteine protease, put... 67 8e-12 At4g01610.1 68417.m00210 cathepsin B-like cysteine protease, put... 67 8e-12 At1g02305.1 68414.m00175 cathepsin B-like cysteine protease, put... 64 5e-11 At1g02300.1 68414.m00173 cathepsin B-like cysteine protease, put... 64 5e-11 At3g19400.2 68416.m02460 cysteine proteinase, putative non-conse... 53 1e-07 At4g35350.2 68417.m05022 cysteine endopeptidase, papain-type (XC... 52 2e-07 At5g17140.1 68418.m02008 cysteine proteinase-related low similar... 48 3e-06 At2g27395.1 68415.m03308 cysteine protease-related contains simi... 47 7e-06 At1g03720.1 68414.m00352 cathepsin-related contains weak similar... 38 0.003 At5g17080.1 68418.m02001 cathepsin-related contains weak similar... 37 0.009 At1g61730.1 68414.m06962 DNA-binding storekeeper protein-related... 32 0.26 At5g05050.1 68418.m00536 peptidase C1A papain family protein wea... 30 1.1 At1g20670.1 68414.m02589 DNA-binding bromodomain-containing prot... 30 1.1 At5g06600.2 68418.m00746 ubiquitin-specific protease 12 (UBP12) ... 28 4.3 At5g06600.1 68418.m00745 ubiquitin-specific protease 12 (UBP12) ... 28 4.3 At3g26990.1 68416.m03377 expressed protein contains Pfam domain,... 27 5.7 At3g11910.1 68416.m01460 ubiquitin-specific protease, putative s... 27 5.7 At2g40930.1 68415.m05052 ubiquitin-specific protease 5, putative... 27 5.7 At1g53110.1 68414.m06014 expressed protein 27 7.5 At4g18320.1 68417.m02717 hypothetical protein 27 9.9 At3g19515.1 68416.m02473 expressed protein 27 9.9 At1g12740.1 68414.m01479 cytochrome P450 family protein similar ... 27 9.9 >At3g45310.1 68416.m04892 cysteine proteinase, putative similar to AALP protein GI:7230640 from [Arabidopsis thaliana] and barley aleurain Length = 358 Score = 120 bits (288), Expect = 8e-28 Identities = 62/153 (40%), Positives = 89/153 (58%), Gaps = 1/153 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG +A+++I +G L TEE Y Y G+DG C ++ VN+T E+ LK Sbjct: 207 GGLPSQAFEYIKYNGGLDTEEAYP-YTGKDGGCKFSAKNIGVQVRDSVNITLGAEDELKH 265 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 A+ P+SVA + H+ F FY GV+ C N +++HAVLAVGYGV + YWL+K Sbjct: 266 AVGLVRPVSVAFEVVHE-FRFYKKGVFTSNTCGNTPMDVNHAVLAVGYGVEDDVPYWLIK 324 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 NSW WG++GY M M +N CGV + +Y ++ Sbjct: 325 NSWGGEWGDNGYFKMEMGKNMCGVATCSSYPVV 357 >At5g60360.1 68418.m07568 cysteine proteinase, putative / AALP protein (AALP) identical to AALP protein GI:7230640 from [Arabidopsis thaliana]; similar to barley aleurain Length = 358 Score = 115 bits (277), Expect = 2e-26 Identities = 58/153 (37%), Positives = 89/153 (58%), Gaps = 1/153 (0%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG +A+++I +G L TE+ Y Y G+D C ++ VN+T E+ LK Sbjct: 207 GGLPSQAFEYIKSNGGLDTEKAYP-YTGKDETCKFSAENVGVQVLNSVNITLGAEDELKH 265 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 A+ P+S+A + H +F Y +GVY + C + +++HAVLAVGYGV +G YWL+K Sbjct: 266 AVGLVRPVSIAFEVIH-SFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIK 324 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI 465 NSW WG+ GY M M +N CG+ + +Y ++ Sbjct: 325 NSWGADWGDKGYFKMEMGKNMCGIATCASYPVV 357 >At5g43060.1 68418.m05256 cysteine proteinase, putative / thiol protease, putative similar to cysteine proteinase RD21A precursor (thiol protease) GI:435619, SP:P43297 from [Arabidopsis thaliana] Length = 463 Score = 112 bits (269), Expect = 2e-25 Identities = 66/156 (42%), Positives = 90/156 (57%), Gaps = 6/156 (3%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTA-ITKITGWVNVTTNNENAL 180 GG D+ A+++I+K+G + TE DY Y DG C + A + I + +V N+E +L Sbjct: 203 GGLMDY-AFEFIIKNGGIDTEADYP-YKAADGRCDQNRKNAKVVTIDSYEDVPENSEASL 260 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360 K AL H PISVAI+A + F YS+GV F+ C ELDH V+AVGYG NG YW+ Sbjct: 261 KKAL-AHQPISVAIEAGGRAFQLYSSGV-FDGLCGT---ELDHGVVAVGYGTENGKDYWI 315 Query: 361 VKNSWSNMWGNDGYVLMSMR----ENNCGVQSAPTY 456 V+NSW N WG GY+ M+ CG+ +Y Sbjct: 316 VRNSWGNRWGESGYIKMARNIEAPTGKCGIAMEASY 351 >At1g47128.1 68414.m05222 cysteine proteinase (RD21A) / thiol protease identical to SP|P43297 Cysteine proteinase RD21A precursor (EC 3.4.22.-) {Arabidopsis thaliana}, thiol protease RD21A SP:P43297 from [Arabidopsis thaliana] Length = 462 Score = 109 bits (262), Expect = 1e-24 Identities = 61/156 (39%), Positives = 90/156 (57%), Gaps = 6/156 (3%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYC-HIDNVTAITKITGWVNVTTNNENAL 180 GG D+ A+++I+K+G + T++DY Y G DG C I + I + +V T +E +L Sbjct: 202 GGLMDY-AFEFIIKNGGIDTDKDYP-YKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESL 259 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360 K A+ H PIS+AI+A + F Y +G+ F+ C +LDH V+AVGYG NG YW+ Sbjct: 260 KKAV-AHQPISIAIEAGGRAFQLYDSGI-FDGSCGT---QLDHGVVAVGYGTENGKDYWI 314 Query: 361 VKNSWSNMWGNDGYVLMSMR----ENNCGVQSAPTY 456 V+NSW WG GY+ M+ CG+ P+Y Sbjct: 315 VRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSY 350 >At4g36880.1 68417.m05229 cysteine proteinase, putative strong similarity to cysteine proteinase COT44 precursor SP:P25251 from [Brassica napus] (Rape) Length = 376 Score = 107 bits (257), Expect = 4e-24 Identities = 65/158 (41%), Positives = 93/158 (58%), Gaps = 8/158 (5%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCH--IDNVTAITKITGWVNVTTNNENA 177 GG D+ A+Q+IMK+G L TE+DY Y G G C+ + N + + I G+ +V T +E A Sbjct: 210 GGLMDY-AFQFIMKNGGLNTEKDYP-YRGFGGKCNSFLKN-SRVVSIDGYEDVPTKDETA 266 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357 LK A+ + P+SVAI+A + F Y +G+ F C LDHAV+AVGYG NG YW Sbjct: 267 LKKAI-SYQPVSVAIEAGGRIFQHYQSGI-FTGSCGTN---LDHAVVAVGYGSENGVDYW 321 Query: 358 LVKNSWSNMWGNDGYV-----LMSMRENNCGVQSAPTY 456 +V+NSW WG +GY+ L + + CG+ +Y Sbjct: 322 IVRNSWGPRWGEEGYIRMERNLAASKSGKCGIAVEASY 359 >At1g09850.1 68414.m01109 cysteine protease, papain-like (XBCP3) identical to papain-like cysteine peptidase XBCP3 GI:14600257 from [Arabidopsis thaliana]; contains Pfam profiles PF00112: Papain family cysteine protease and PF00396: Granulin Length = 437 Score = 106 bits (255), Expect = 8e-24 Identities = 61/156 (39%), Positives = 87/156 (55%), Gaps = 6/156 (3%) Frame = +1 Query: 7 GGGEDFRAYQWIMK-HGLPTEEDYGGYLGQDGYCHIDNVTA-ITKITGWVNVTTNNENAL 180 GG D+ A+++++K HG+ TE+DY Y +DG C D + + I + V +N+E AL Sbjct: 183 GGLMDY-AFEFVIKNHGIDTEKDYP-YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKAL 240 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360 A+ P+SV I + + F YS+G++ P C LDHAVL VGYG NG YW+ Sbjct: 241 MEAVAAQ-PVSVGICGSERAFQLYSSGIFSGP-CSTS---LDHAVLIVGYGSQNGVDYWI 295 Query: 361 VKNSWSNMWGNDGYVLMSMRENN----CGVQSAPTY 456 VKNSW WG DG++ M N CG+ +Y Sbjct: 296 VKNSWGKSWGMDGFMHMQRNTENSDGVCGINMLASY 331 >At3g48340.1 68416.m05276 cysteine proteinase, putative similar to cysteine endopeptidase precursor [Ricinus communis] GI:2944446; contains Pfam profile PF00112: Papain family cysteine protease Length = 351 Score = 105 bits (251), Expect = 2e-23 Identities = 64/149 (42%), Positives = 80/149 (53%), Gaps = 6/149 (4%) Frame = +1 Query: 28 AYQWIMKHGLPTEEDYGGYLGQDGYCHI--DNVTAITKITGWVNVTTNNENALKLALFKH 201 A+++I K+G T ED Y G DG C DN +T I G +V N+ENAL A+ Sbjct: 189 AFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVT-IDGHEDVPENDENALLKAVANQ 247 Query: 202 GPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSN 381 P+SVAIDA F FYS GV F C EL+H V AVGYG G KYW+V+NSW Sbjct: 248 -PVSVAIDAGSSDFQFYSEGV-FTGSCGT---ELNHGVAAVGYGSERGKKYWIVRNSWGA 302 Query: 382 MWGNDGYVLMSMR----ENNCGVQSAPTY 456 WG GY+ + E CG+ +Y Sbjct: 303 EWGEGGYIKIEREIDEPEGRCGIAMEASY 331 >At3g19390.1 68416.m02459 cysteine proteinase, putative / thiol protease, putative contains similarity to cysteine proteinase RD21A (thiol protease) GI:435619, SP:P43297 from [Arabidopsis thaliana] Length = 452 Score = 102 bits (244), Expect = 2e-22 Identities = 63/157 (40%), Positives = 92/157 (58%), Gaps = 7/157 (4%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQD-GYCHIDNV-TAITKITGWVNVTTNNENA 177 GGG A+++I+++G + TEEDY Y+ D C+ D T + I G+ +V N+E + Sbjct: 193 GGGLMDYAFKFIIENGGIDTEEDYP-YIATDVNVCNSDKKNTRVVTIDGYEDVPQNDEKS 251 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357 LK AL PISVAI+A + F Y++GV F C LDH V+AVGYG G YW Sbjct: 252 LKKALANQ-PISVAIEAGGRAFQLYTSGV-FTGTCGTS---LDHGVVAVGYGSEGGQDYW 306 Query: 358 LVKNSWSNMWGNDGYVLM--SMRENN--CGVQSAPTY 456 +V+NSW + WG GY + +++E++ CGV +Y Sbjct: 307 IVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASY 343 >At1g20850.1 68414.m02612 cysteine endopeptidase, papain-type (XCP2) identical to papain-type cysteine endopeptidase XCP2 GI:6708183 from [Arabidopsis thaliana] Length = 356 Score = 102 bits (244), Expect = 2e-22 Identities = 66/156 (42%), Positives = 90/156 (57%), Gaps = 6/156 (3%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAIT-KITGWVNVTTNNENAL 180 GG D+ A+++I+K+G L EEDY Y ++G C + + T I G +V TN+E +L Sbjct: 203 GGLMDY-AFEYIVKNGGLRKEEDYP-YSMEEGTCEMQKDESETVTINGHQDVPTNDEKSL 260 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360 AL H P+SVAIDA+ + F FYS GV F+ +C VD LDH V AVGYG G Y + Sbjct: 261 LKAL-AHQPLSVAIDASGREFQFYSGGV-FDGRCG--VD-LDHGVAAVGYGSSKGSDYII 315 Query: 361 VKNSWSNMWGNDGYVLMSMR----ENNCGVQSAPTY 456 VKNSW WG GY+ + E CG+ ++ Sbjct: 316 VKNSWGPKWGEKGYIRLKRNTGKPEGLCGINKMASF 351 >At3g19400.1 68416.m02461 cysteine proteinase, putative non-consensus AT acceptor site at exon 3; contains similarity to cysteine protease CYP1 GI:2828252, TDI-65 GI:5726641 from [Lycopersicon esculentum] Length = 362 Score = 101 bits (241), Expect = 4e-22 Identities = 57/156 (36%), Positives = 88/156 (56%), Gaps = 7/156 (4%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHID--NVTAITKITGWVNVTTNNENAL 180 GG A+++IMK+G + T++DY G C+ D N T + I G+ +V ++E +L Sbjct: 196 GGIMNYAFEFIMKNGGIETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSL 255 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWL 360 K A+ H P+SVAI+A+ + F Y +GV C LDH V+ VGYG +G YW+ Sbjct: 256 KKAV-AHQPVSVAIEASSQAFQLYKSGV-MTGTCGIS---LDHGVVVVGYGSTSGEDYWI 310 Query: 361 VKNSWSNMWGNDGYVLMSMRENN----CGVQSAPTY 456 ++NSW WG+ GYV + ++ CG+ P+Y Sbjct: 311 IRNSWGLNWGDSGYVKLQRNIDDPFGKCGIAMMPSY 346 >At1g29080.1 68414.m03560 peptidase C1A papain family protein contains similarity to cysteine protease SPCP1 GI:13491750 from [Ipomoea batatas]; contains Pfam profile PF00112: Papain family cysteine protease Length = 346 Score = 100 bits (239), Expect = 7e-22 Identities = 62/155 (40%), Positives = 84/155 (54%), Gaps = 6/155 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A+ +I+KH G+ +E +Y Y ++G C + AI I G+ NV +NNE AL Sbjct: 195 GGTFVNAFNYIIKHRGISSENEYP-YQVKEGPCRSNARPAIL-IRGFENVPSNNERALLE 252 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL-NGHKYWLV 363 A+ + P++VAIDA+ F YS GVY C V+ HAV VGYG G KYWL Sbjct: 253 AVSRQ-PVAVAIDASEAGFVHYSGGVYNARNCGTSVN---HAVTLVGYGTSPEGMKYWLA 308 Query: 364 KNSWSNMWGNDGYVL----MSMRENNCGVQSAPTY 456 KNSW WG +GY+ + + CGV +Y Sbjct: 309 KNSWGKTWGENGYIRIRRDVEWPQGMCGVAQYASY 343 >At5g45890.1 68418.m05644 senescence-specific SAG12 protein (SAG12) / cysteine proteinase, putative identical to senescence-specific protein SAG12 GI:1046373 from [Arabidopsis thaliana] Length = 346 Score = 99.1 bits (236), Expect = 2e-21 Identities = 58/144 (40%), Positives = 78/144 (54%), Gaps = 6/144 (4%) Frame = +1 Query: 52 GLPTEEDYGGYLGQDGYCHIDNVTA-ITKITGWVNVTTNNENALKLALFKHGPISVAIDA 228 GL TE +Y Y G+D C+ T ITG+ +V N+E AL A+ H P+SV I+ Sbjct: 209 GLTTESNYP-YKGEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAV-AHQPVSVGIEG 266 Query: 229 AHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLVKNSWSNMWGNDGYV 405 F FYS+GV F +C LDHAV A+GYG NG KYW++KNSW WG GY+ Sbjct: 267 GGFDFQFYSSGV-FTGECTTY---LDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYM 322 Query: 406 LMSM----RENNCGVQSAPTYVLI 465 + ++ CG+ +Y I Sbjct: 323 RIQKDVKDKQGLCGLAMKASYPTI 346 >At4g35350.1 68417.m05023 cysteine endopeptidase, papain-type (XCP1) identical to papain-type cysteine endopeptidase XCP1 GI:6708181 from [Arabidopsis thaliana] Length = 355 Score = 98.7 bits (235), Expect = 2e-21 Identities = 63/157 (40%), Positives = 89/157 (56%), Gaps = 7/157 (4%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHI--DNVTAITKITGWVNVTTNNENA 177 GG D+ A+Q+I+ G L E+DY YL ++G C ++V +T I+G+ +V N++ + Sbjct: 202 GGLMDY-AFQYIISTGGLHKEDDYP-YLMEEGICQEQKEDVERVT-ISGYEDVPENDDES 258 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357 L AL H P+SVAI+A+ + F FY GV F KC +LDH V AVGYG G Y Sbjct: 259 LVKAL-AHQPVSVAIEASGRDFQFYKGGV-FNGKCGT---DLDHGVAAVGYGSSKGSDYV 313 Query: 358 LVKNSWSNMWGNDGYVLMSMR----ENNCGVQSAPTY 456 +VKNSW WG G++ M E CG+ +Y Sbjct: 314 IVKNSWGPRWGEKGFIRMKRNTGKPEGLCGINKMASY 350 >At4g11310.1 68417.m01827 cysteine proteinase, putative contains similarity to cysteine proteinase RD21A (thiol protease) GI:435619, SP:P43297 from [Arabidopsis thaliana] Length = 364 Score = 97.1 bits (231), Expect = 6e-21 Identities = 60/159 (37%), Positives = 85/159 (53%), Gaps = 7/159 (4%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYC--HIDNVTAITKITGWVNVTTNNENA 177 GGG+ AY++IMK+G L T+ DY Y +G C + I G+ N+ N+E+A Sbjct: 200 GGGKLETAYEFIMKNGGLGTDNDYP-YKAVNGVCDGRLKENNKNVMIDGYENLPANDESA 258 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357 L A+ H P++ ID++ + F Y +GV F+ C L+H V+ VGYG NG YW Sbjct: 259 LMKAV-AHQPVTAVIDSSSREFQLYESGV-FDGSCGTN---LNHGVVVVGYGTENGRDYW 313 Query: 358 LVKNSWSNMWGNDGYVLMSMRENN----CGVQSAPTYVL 462 LVKNS WG GY+ M+ N CG+ +Y L Sbjct: 314 LVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL 352 >At5g50260.1 68418.m06224 cysteine proteinase, putative similar to cysteine endopeptidase precursor CysEP GI:2944446 from [Ricinus communis] Length = 361 Score = 94.3 bits (224), Expect = 4e-20 Identities = 60/157 (38%), Positives = 80/157 (50%), Gaps = 6/157 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTA-ITKITGWVNVTTNNENALKL 186 GG A+++I + G T E Y D C + A + I G +V N+E+ L Sbjct: 191 GGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMK 250 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-LNGHKYWLV 363 A+ P+SVAIDA F FYS GV F +C EL+H V VGYG ++G KYW+V Sbjct: 251 AVANQ-PVSVAIDAGGSDFQFYSEGV-FTGRCGT---ELNHGVAVVGYGTTIDGTKYWIV 305 Query: 364 KNSWSNMWGNDGYVLMSM----RENNCGVQSAPTYVL 462 KNSW WG GY+ M +E CG+ +Y L Sbjct: 306 KNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342 >At4g11320.1 68417.m01828 cysteine proteinase, putative contains similarity to cysteine proteinase RD21A (thiol protease) GI:435619, SP:P43297 from [Arabidopsis thaliana] Length = 371 Score = 94.3 bits (224), Expect = 4e-20 Identities = 57/159 (35%), Positives = 84/159 (52%), Gaps = 7/159 (4%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCH--IDNVTAITKITGWVNVTTNNENA 177 GGG+ AY++IM +G L T+ DY Y +G C + I G+ N+ N+E A Sbjct: 207 GGGKVETAYEFIMNNGGLGTDNDYP-YKALNGVCEGRLKEDNKNVMIDGYENLPANDEAA 265 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357 L A+ H P++ +D++ + F Y +GV F+ C L+H V+ VGYG NG YW Sbjct: 266 LMKAV-AHQPVTAVVDSSSREFQLYESGV-FDGTCGTN---LNHGVVVVGYGTENGRDYW 320 Query: 358 LVKNSWSNMWGNDGYVLMSMRENN----CGVQSAPTYVL 462 +VKNS + WG GY+ M+ N CG+ +Y L Sbjct: 321 IVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359 >At2g34080.1 68415.m04172 cysteine proteinase, putative contains similarity to cysteine protease SPCP1 GI:13491750 from [Ipomoea batatas] Length = 345 Score = 93.9 bits (223), Expect = 6e-20 Identities = 55/134 (41%), Positives = 75/134 (55%), Gaps = 2/134 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A+ +++++ G+ +E DY Y G DG C N +I+G+ V +NNE AL Sbjct: 195 GGIMSDAFNYVVQNRGIASENDYS-YQGSDGGCR-SNARPAARISGFQTVPSNNERALLE 252 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL-NGHKYWLV 363 A+ + P+SV++DA F YS GVY P C + HAV VGYG +G KYWL Sbjct: 253 AVSRQ-PVSVSMDATGDGFMHYSGGVYDGP-CGTSSN---HAVTFVGYGTSQDGTKYWLA 307 Query: 364 KNSWSNMWGNDGYV 405 KNSW WG GY+ Sbjct: 308 KNSWGETWGEKGYI 321 >At1g29090.1 68414.m03561 peptidase C1A papain family protein contains similarity to cysteine protease SPCP1 GI:13491750 from [Ipomoea batatas]; contains Pfam profile PF00112: Papain family cysteine protease Length = 355 Score = 93.9 bits (223), Expect = 6e-20 Identities = 57/134 (42%), Positives = 74/134 (55%), Gaps = 2/134 (1%) Frame = +1 Query: 10 GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GG A+ +I+K+ G+ +E Y Y +G C + + I G+ V +NNE AL Sbjct: 204 GGIMSDAFSYIIKNRGIASEASYP-YQAAEGTCRYNGKPSAW-IRGFQTVPSNNERALLE 261 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVL-NGHKYWLV 363 A+ K P+SV+IDA F YS GVY EP C V+ HAV VGYG G KYWL Sbjct: 262 AVSKQ-PVSVSIDADGPGFMHYSGGVYDEPYCGTNVN---HAVTFVGYGTSPEGIKYWLA 317 Query: 364 KNSWSNMWGNDGYV 405 KNSW WG +GY+ Sbjct: 318 KNSWGETWGENGYI 331 >At3g49340.1 68416.m05394 cysteine proteinase, putative contains PS00640: Eukaryotic thiol (cysteine) proteases asparagine active site; similar to cysteine proteinase GI:535454 from [Alnus glutinosam] Length = 341 Score = 93.5 bits (222), Expect = 8e-20 Identities = 59/156 (37%), Positives = 88/156 (56%), Gaps = 6/156 (3%) Frame = +1 Query: 7 GGGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALK 183 GGG ++A+ +I ++ G+ TE++Y Y G C +++ A T I+G+ V N+E AL Sbjct: 190 GGGIMWKAFDYIKENQGITTEDNYP-YQGAQQTCESNHLAAAT-ISGYETVPQNDEEALL 247 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN-GHKYWL 360 A+ + P+SVAI+ + F YS G+ F +C +L HAV VGYGV G KYWL Sbjct: 248 KAVSQQ-PVSVAIEGSGYEFIHYSGGI-FNGECGT---QLTHAVTIVGYGVSEEGIKYWL 302 Query: 361 VKNSWSNMWGNDGYVL----MSMRENNCGVQSAPTY 456 +KNSW WG +GY+ + + CG+ S Y Sbjct: 303 LKNSWGESWGENGYMRIMRDVDSPQGMCGLASLAYY 338 >At4g23520.1 68417.m03390 cysteine proteinase, putative contains similarity to cysteine proteinase (thiol protease) RD21A GI:435619, SP:P43297 from [Arabidopsis thaliana] Length = 356 Score = 90.2 bits (214), Expect = 7e-19 Identities = 53/157 (33%), Positives = 85/157 (54%), Gaps = 7/157 (4%) Frame = +1 Query: 7 GGGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTA--ITKITGWVNVTTNNENA 177 G G A+Q+++ + GL +E+DY Y G G C+ T+ + I + +V N+E + Sbjct: 197 GSGLMDTAFQFLINNNGLDSEKDYP-YQGTQGSCNRKQSTSNKVITIDSYEDVPANDEIS 255 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYW 357 L+ A+ H P+SV +D + F Y + +Y P N LDHA++ VGYG NG YW Sbjct: 256 LQKAV-AHQPVSVGVDKKSQEFMLYRSCIYNGPCGTN----LDHALVIVGYGSENGQDYW 310 Query: 358 LVKNSWSNMWGNDGYVLMSMR----ENNCGVQSAPTY 456 +V+NSW WG+ GY+ ++ + CG+ +Y Sbjct: 311 IVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASY 347 >At3g48350.1 68416.m05277 cysteine proteinase, putative similar to cysteine endopeptidase precursor [Ricinus communis] GI:2944446; contains Pfam profile PF00112: Papain family cysteine protease Length = 364 Score = 90.2 bits (214), Expect = 7e-19 Identities = 58/156 (37%), Positives = 81/156 (51%), Gaps = 7/156 (4%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHIDNVTAIT-KITGWVNVTTNNENALK 183 GG A+++I +G + TEE Y +C +++ T I G +V N+E L Sbjct: 191 GGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELL 250 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYG-VLNGHKYWL 360 A+ H P+SVAIDA F YS GV+ +C +L+H V+ VGYG NG KYW+ Sbjct: 251 KAV-AHQPVSVAIDAGSSDFQLYSEGVFIG-ECGT---QLNHGVVIVGYGETKNGTKYWI 305 Query: 361 VKNSWSNMWGNDGYVL----MSMRENNCGVQSAPTY 456 V+NSW WG GYV +S E CG+ +Y Sbjct: 306 VRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASY 341 >At1g06260.1 68414.m00662 cysteine proteinase, putative contains similarity to thiol-protease, pre-pro-TPE4A protein GI:3688528 [Pisum sativum] Length = 343 Score = 90.2 bits (214), Expect = 7e-19 Identities = 60/157 (38%), Positives = 81/157 (51%), Gaps = 6/157 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHID-NVTAITKITGWVNVTTNNENALK 183 GG A+++I +G L TE DY Y G +G C + + + I G+ V NE +L+ Sbjct: 193 GGLMETAFEFIKTNGGLATETDYP-YTGIEGTCDQEKSKNKVVTIQGYQKVA-QNEASLQ 250 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLV 363 +A + P+SV IDA F YS+GV F C L+H V VGYGV KYW+V Sbjct: 251 IAAAQQ-PVSVGIDAGGFIFQLYSSGV-FTNYCGTN---LNHGVTVVGYGVEGDQKYWIV 305 Query: 364 KNSWSNMWGNDGYVLM----SMRENNCGVQSAPTYVL 462 KNSW WG +GY+ M S CG+ +Y L Sbjct: 306 KNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPL 342 >At1g29110.1 68414.m03563 cysteine proteinase, putative contains similarity to cysteine protease SPCP1 GI:13491750 from [Ipomoea batatas] Length = 334 Score = 89.0 bits (211), Expect = 2e-18 Identities = 54/154 (35%), Positives = 79/154 (51%), Gaps = 5/154 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAI-TKITGWVNVTTNNENALKL 186 GGE A+++I+K+G + E Y + C + A T+I G+ V ++NE AL L Sbjct: 182 GGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQIRGFQMVPSHNERAL-L 240 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVK 366 + P+SV IDA +F Y GVY C V+ HAV VGYG ++G YW++K Sbjct: 241 EAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVN---HAVTIVGYGTMSGLNYWVLK 297 Query: 367 NSWSNMWGNDGYVL----MSMRENNCGVQSAPTY 456 NSW WG +GY+ + + CG+ Y Sbjct: 298 NSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAY 331 >At3g54940.3 68416.m06091 cysteine proteinase, putative contains similarity to cysteine proteinase GI:479060 from [Glycine max] Length = 368 Score = 85.0 bits (201), Expect = 3e-17 Identities = 49/164 (29%), Positives = 79/164 (48%), Gaps = 7/164 (4%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHGLPTEEDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKL 186 GGG AY+++M+ G EE Y G+ G+C D ++ + + + EN + Sbjct: 210 GGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPEKVAVRVLNFTTIPLD-ENQIAA 268 Query: 187 ALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-------LNG 345 L +HGP++V ++A Y GV C + ++H VL VGYG L+ Sbjct: 269 NLVRHGPLAVGLNAVF--MQTYIGGVSCPLICSKR--NVNHGVLLVGYGSKGFSILRLSN 324 Query: 346 HKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI*IST 477 YW++KNSW WG +GY + + CG+ S + V +S+ Sbjct: 325 KPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVATQVSS 368 >At2g21430.1 68415.m02550 cysteine proteinase A494, putative / thiol protease, putative identical to SP:P43295 Probable cysteine proteinase A494 precursor [Arabidopsis thaliana]; strong similarity to cysteine proteinase RD19A (thiol protease) GI:435618, SP:P43296 from [Arabidopsis thaliana] Length = 361 Score = 83.4 bits (197), Expect = 8e-17 Identities = 54/154 (35%), Positives = 80/154 (51%), Gaps = 9/154 (5%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDG-YCHIDNVTAITKITGWVNVTTNNENALK 183 GG A+++ +K G L E+DY Y G DG C +D + ++ + +V + NE+ + Sbjct: 205 GGLMNSAFEYTLKTGGLMREKDYP-YTGTDGGSCKLDRSKIVASVSNF-SVVSINEDQIA 262 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-------LN 342 L K+GP++VAI+AA+ Y GV C + L+H VL VGYG L Sbjct: 263 ANLIKNGPLAVAINAAY--MQTYIGGVSCPYICSRR---LNHGVLLVGYGSAGFSQARLK 317 Query: 343 GHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQS 444 YW++KNSW WG +G+ + N CGV S Sbjct: 318 EKPYWIIKNSWGESWGENGFYKICKGRNICGVDS 351 >At2g27420.1 68415.m03314 cysteine proteinase, putative contains similarity to cysteine protease SPCP1 GI:13491750 from [Ipomoea batatas] Length = 348 Score = 82.6 bits (195), Expect = 1e-16 Identities = 51/137 (37%), Positives = 76/137 (55%), Gaps = 5/137 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKH-GLPTEEDYGGYLGQDGYCHIDNVTAITK---ITGWVNVTTNNENA 177 GG +A+++I+K+ G+ TE++Y Q +++ + I+G+ V NNE A Sbjct: 193 GGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEA 252 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN-GHKY 354 L A+ + P+SV I+ F YS GV F +C +L HAV VGYG+ G KY Sbjct: 253 LLQAVSQQ-PVSVGIEGTGAAFRHYSGGV-FNGECGT---DLHHAVTIVGYGMSEEGTKY 307 Query: 355 WLVKNSWSNMWGNDGYV 405 W+VKNSW WG +GY+ Sbjct: 308 WVVKNSWGETWGENGYM 324 >At4g39090.1 68417.m05535 cysteine proteinase RD19a (RD19A) / thiol protease identical to cysteine proteinase RD19a, thiol protease SP:P43296, GI:435618 from [Arabidopsis thaliana] Length = 368 Score = 82.2 bits (194), Expect = 2e-16 Identities = 55/165 (33%), Positives = 83/165 (50%), Gaps = 9/165 (5%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGY-CHIDNVTAITKITGWVNVTTNNENALK 183 GG A+++ +K G L EEDY Y G+DG C +D + ++ + +V + +E + Sbjct: 208 GGLMNSAFEYTLKTGGLMKEEDYP-YTGKDGKTCKLDKSKIVASVSNF-SVISIDEEQIA 265 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-------LN 342 L K+GP++VAI+A + Y GV C + L+H VL VGYG Sbjct: 266 ANLVKNGPLAVAINAGY--MQTYIGGVSCPYICTRR---LNHGVLLVGYGAAGYAPARFK 320 Query: 343 GHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAPTYVLI*IST 477 YW++KNSW WG +G+ + N CGV S + V +ST Sbjct: 321 EKPYWIIKNSWGETWGENGFYKICKGRNICGVDSMVSTVAATVST 365 >At3g43960.1 68416.m04706 cysteine proteinase, putative contains similarity to cysteine proteinase RD21A (thiol protease) GI:435619, SP:P43297 from [Arabidopsis thaliana] Length = 376 Score = 78.2 bits (184), Expect = 3e-15 Identities = 56/157 (35%), Positives = 78/157 (49%), Gaps = 8/157 (5%) Frame = +1 Query: 10 GGEDFRAYQWIMKHGLPTEEDYGGYLGQD-GYCHIDNV--TAITKITGWVNVTTNNENAL 180 GG A+++I ++G ++ GY G+D C + T + I G V N+E +L Sbjct: 194 GGGAVWAFEFIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSL 253 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGH-KYW 357 K A+ + PISV I AA+ S Y +GVY + C N DH VL VGYG + YW Sbjct: 254 KKAV-AYQPISVMISAAN--MSDYKSGVY-KGACSNLWG--DHNVLIVGYGTSSDEGDYW 307 Query: 358 LVKNSWSNMWGNDGYVLMSMR----ENNCGVQSAPTY 456 L++NSW WG GY+ + C V AP Y Sbjct: 308 LIRNSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVY 344 >At4g16190.1 68417.m02457 cysteine proteinase, putative contains similarity to papain-like cysteine proteinase isoform I GI:7381219 from [Ipomoea batatas] Length = 373 Score = 72.9 bits (171), Expect = 1e-13 Identities = 50/155 (32%), Positives = 77/155 (49%), Gaps = 10/155 (6%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGY-CHIDNVTAITKITGWVNVTTNNENALK 183 GG A+++ +K G L EEDY Y G+D C D + ++ + +V +++E+ + Sbjct: 213 GGLMNNAFEYALKAGGLMKEEDYP-YTGRDHTACKFDKSKIVASVSNF-SVVSSDEDQIA 270 Query: 184 LALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGV-------LN 342 L +HGP+++AI+A Y GV C D H VL VG+G L Sbjct: 271 ANLVQHGPLAIAINAMW--MQTYIGGVSCPYVCSKSQD---HGVLLVGFGSSGYAPIRLK 325 Query: 343 GHKYWLVKNSWSNMWGNDGYVLMSMRENN-CGVQS 444 YW++KNSW MWG GY + +N CG+ + Sbjct: 326 EKPYWIIKNSWGAMWGEHGYYKICRGPHNMCGMDT 360 >At4g01610.2 68417.m00211 cathepsin B-like cysteine protease, putative similar to cathepsin B-like cysteine proteinase GI:609175 from [Nicotiana rustica]; contains an unusually short, 5nt exon Length = 359 Score = 66.9 bits (156), Expect = 8e-12 Identities = 31/97 (31%), Positives = 52/97 (53%), Gaps = 1/97 (1%) Frame = +1 Query: 163 NNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN 342 +N + ++K+GP+ V+ ++ F+ Y +GVY N HAV +G+G + Sbjct: 242 SNPQDIMAEVYKNGPVEVSF-TVYEDFAHYKSGVYKHITGSNIGG---HAVKLIGWGTSS 297 Query: 343 -GHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAP 450 G YWL+ N W+ WG+DGY ++ N CG++ P Sbjct: 298 EGEDYWLMANQWNRGWGDDGYFMIRRGTNECGIEDEP 334 >At4g01610.1 68417.m00210 cathepsin B-like cysteine protease, putative similar to cathepsin B-like cysteine proteinase GI:609175 from [Nicotiana rustica]; contains an unusually short, 5nt exon Length = 359 Score = 66.9 bits (156), Expect = 8e-12 Identities = 31/97 (31%), Positives = 52/97 (53%), Gaps = 1/97 (1%) Frame = +1 Query: 163 NNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN 342 +N + ++K+GP+ V+ ++ F+ Y +GVY N HAV +G+G + Sbjct: 242 SNPQDIMAEVYKNGPVEVSF-TVYEDFAHYKSGVYKHITGSNIGG---HAVKLIGWGTSS 297 Query: 343 -GHKYWLVKNSWSNMWGNDGYVLMSMRENNCGVQSAP 450 G YWL+ N W+ WG+DGY ++ N CG++ P Sbjct: 298 EGEDYWLMANQWNRGWGDDGYFMIRRGTNECGIEDEP 334 >At1g02305.1 68414.m00175 cathepsin B-like cysteine protease, putative similar to cathepsin B-like cysteine proteinase [Nicotiana rustica] GI:609175; contains Pfam profile PF00112: Papain family cysteine protease Length = 362 Score = 64.1 bits (149), Expect = 5e-11 Identities = 30/85 (35%), Positives = 48/85 (56%), Gaps = 1/85 (1%) Frame = +1 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN-GHKYWLVK 366 ++K+GP+ VA ++ F+ Y +GVY K + HAV +G+G + G YWL+ Sbjct: 254 VYKNGPVEVAF-TVYEDFAHYKSGVY---KHITGTNIGGHAVKLIGWGTSDDGEDYWLLA 309 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQ 441 N W+ WG+DGY + N CG++ Sbjct: 310 NQWNRSWGDDGYFKIRRGTNECGIE 334 >At1g02300.1 68414.m00173 cathepsin B-like cysteine protease, putative similar to cathepsin B-like cysteine proteinase GI:609175 from [Nicotiana rustica] Length = 379 Score = 64.1 bits (149), Expect = 5e-11 Identities = 30/87 (34%), Positives = 48/87 (55%), Gaps = 1/87 (1%) Frame = +1 Query: 190 LFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN-GHKYWLVK 366 ++K+GP+ VA ++ F+ Y +GVY K HAV +G+G + G YWL+ Sbjct: 271 VYKNGPVEVAF-TVYEDFAHYKSGVY---KYITGTKIGGHAVKLIGWGTSDDGEDYWLLA 326 Query: 367 NSWSNMWGNDGYVLMSMRENNCGVQSA 447 N W+ WG+DGY + N CG++ + Sbjct: 327 NQWNRSWGDDGYFKIRRGTNECGIEQS 353 >At3g19400.2 68416.m02460 cysteine proteinase, putative non-consensus AT acceptor site at exon 3; contains similarity to cysteine protease CYP1 GI:2828252, TDI-65 GI:5726641 from [Lycopersicon esculentum] Length = 290 Score = 52.8 bits (121), Expect = 1e-07 Identities = 31/86 (36%), Positives = 51/86 (59%), Gaps = 3/86 (3%) Frame = +1 Query: 10 GGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHID--NVTAITKITGWVNVTTNNENAL 180 GG A+++IMK+G + T++DY G C+ D N T + I G+ +V ++E +L Sbjct: 196 GGIMNYAFEFIMKNGGIETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSL 255 Query: 181 KLALFKHGPISVAIDAAHKTFSFYSN 258 K A+ H P+SVAI+A+ + F Y + Sbjct: 256 KKAV-AHQPVSVAIEASSQAFQLYKS 280 >At4g35350.2 68417.m05022 cysteine endopeptidase, papain-type (XCP1) identical to papain-type cysteine endopeptidase XCP1 GI:6708181 from [Arabidopsis thaliana] Length = 288 Score = 52.0 bits (119), Expect = 2e-07 Identities = 37/90 (41%), Positives = 57/90 (63%), Gaps = 3/90 (3%) Frame = +1 Query: 7 GGGEDFRAYQWIMKHG-LPTEEDYGGYLGQDGYCHI--DNVTAITKITGWVNVTTNNENA 177 GG D+ A+Q+I+ G L E+DY YL ++G C ++V +T I+G+ +V N++ + Sbjct: 202 GGLMDY-AFQYIISTGGLHKEDDYP-YLMEEGICQEQKEDVERVT-ISGYEDVPENDDES 258 Query: 178 LKLALFKHGPISVAIDAAHKTFSFYSNGVY 267 L AL H P+SVAI+A+ + F FY GVY Sbjct: 259 LVKAL-AHQPVSVAIEASGRDFQFY-KGVY 286 >At5g17140.1 68418.m02008 cysteine proteinase-related low similarity to cysteine proteinase [Sitophilus zeamais] GI:2804262 Length = 112 Score = 48.4 bits (110), Expect = 3e-06 Identities = 23/80 (28%), Positives = 43/80 (53%), Gaps = 2/80 (2%) Frame = +1 Query: 190 LFKHGPISVAIDAAHKTFSFYS-NGVYFEPKCKNKVDELDHAVLAVGYGVLNGHK-YWLV 363 L GPI +++D Y G+Y P+ K ++ HA++ VG+G K +++V Sbjct: 23 LLTQGPIGISVDLCGIFRQVYEFKGIYVLPEPKENMER--HALIIVGFGTTKDSKLFFIV 80 Query: 364 KNSWSNMWGNDGYVLMSMRE 423 +N+W WG +GY + +++ Sbjct: 81 QNTWGTKWGFNGYARIIIKK 100 >At2g27395.1 68415.m03308 cysteine protease-related contains similarity to senescence-specific cysteine protease GI:5823018 from [Brassica napus] Length = 86 Score = 47.2 bits (107), Expect = 7e-06 Identities = 30/72 (41%), Positives = 39/72 (54%), Gaps = 1/72 (1%) Frame = +1 Query: 163 NNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLN 342 NNE AL A+ +H P+S+ I+ F YS G+ F +C E D A + GY Sbjct: 2 NNEEALVQAVSQH-PVSMGIEGTGAAFRHYSGGI-FNGEC-----ETDFA--SCGYNCCE 52 Query: 343 -GHKYWLVKNSW 375 G KYW+VKNSW Sbjct: 53 EGTKYWMVKNSW 64 >At1g03720.1 68414.m00352 cathepsin-related contains weak similarity to Cathepsin L precursor (EC 3.4.22.15) (Major excreted protein) (MEP) (Cyclic protein-2) (CP-2) (Swiss-Prot:P07154) [Rattus norvegicus] Length = 274 Score = 38.3 bits (85), Expect = 0.003 Identities = 25/76 (32%), Positives = 38/76 (50%), Gaps = 10/76 (13%) Frame = +1 Query: 205 PISVAIDAAHKTFSFY------SNGVYFEPKCKNKVDELD---HAVLAVGYGVLNGHK-Y 354 P+++A D H F F SNG+ +++ D H VL VGYG +K + Sbjct: 180 PVAIAFDITHN-FQFIGNVSKKSNGLSIYNVSGVDMEDGDAGGHVVLIVGYGYTKENKLF 238 Query: 355 WLVKNSWSNMWGNDGY 402 +L++NSW WG G+ Sbjct: 239 FLIQNSWGEDWGVKGF 254 >At5g17080.1 68418.m02001 cathepsin-related contains weak similarity to Cathepsin L (EC 3.4.22.15) (Progesterone-dependent protein) (PDP) (Fragment) (Swiss-Prot:P25773) [Felis silvestris catus] Length = 298 Score = 36.7 bits (81), Expect = 0.009 Identities = 19/67 (28%), Positives = 36/67 (53%), Gaps = 1/67 (1%) Frame = +1 Query: 202 GPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHK-YWLVKNSWS 378 GP+ + ID + +G+Y PK K+ + HA+ V YG+ + +++V+N+W Sbjct: 204 GPVGITIDMSVGLRKL-KDGIYMVPKPKDGAPK--HALTIVAYGMTKEDELFFVVQNTWG 260 Query: 379 NMWGNDG 399 +W +G Sbjct: 261 TIWRVNG 267 >At1g61730.1 68414.m06962 DNA-binding storekeeper protein-related contains Pfam profile: PF04504 protein of unknown function, DUF573; similar to storekeeper protein GI:14268476 [Solanum tuberosum] Length = 376 Score = 31.9 bits (69), Expect = 0.26 Identities = 20/66 (30%), Positives = 33/66 (50%) Frame = +1 Query: 154 VTTNNENALKLALFKHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYG 333 V +++ A +L+ F GP +A+D+ K SNGV + K K+D + ++ G Sbjct: 229 VKAHDKKAFELSKFIWGPKGIALDSNVK-----SNGVSKKSVAKKKIDSVKQELVFAGGS 283 Query: 334 VLNGHK 351 NG K Sbjct: 284 STNGKK 289 >At5g05050.1 68418.m00536 peptidase C1A papain family protein weak similarity to berghepain-2 [Plasmodium berghei] GI:17978639; contains Pfam profile PF00112: Papain family cysteine protease Length = 299 Score = 29.9 bits (64), Expect = 1.1 Identities = 9/19 (47%), Positives = 12/19 (63%) Frame = +1 Query: 343 GHKYWLVKNSWSNMWGNDG 399 G YWL++NS+ WG G Sbjct: 232 GEHYWLIQNSYGEAWGEKG 250 >At1g20670.1 68414.m02589 DNA-binding bromodomain-containing protein contains bromodomain, INTERPRO:IPR001487 Length = 652 Score = 29.9 bits (64), Expect = 1.1 Identities = 14/41 (34%), Positives = 21/41 (51%) Frame = +2 Query: 191 CSNTARYQSQSMRPTRPSVSTRTVYISNRNAKTKWMS*TTP 313 CSN S R P+ S + +I NR+A ++ + TTP Sbjct: 499 CSNDLASDDHSNRILSPTASVSSAFIGNRHASSQAIEETTP 539 >At5g06600.2 68418.m00746 ubiquitin-specific protease 12 (UBP12) almost identical to ubiquitin-specific protease 12 GI:11993471 [Arabidopsis thaliana], one amino acid difference Length = 1115 Score = 27.9 bits (59), Expect = 4.3 Identities = 15/46 (32%), Positives = 21/46 (45%), Gaps = 2/46 (4%) Frame = +1 Query: 256 NGVYFEPKCKNKVDELD--HAVLAVGYGVLNGHKYWLVKNSWSNMW 387 +G Y P V L H+VL GV GH Y ++ + S+ W Sbjct: 422 DGKYLSPDADRSVRNLYTLHSVLVHSGGVHGGHYYAFIRPTLSDQW 467 >At5g06600.1 68418.m00745 ubiquitin-specific protease 12 (UBP12) almost identical to ubiquitin-specific protease 12 GI:11993471 [Arabidopsis thaliana], one amino acid difference Length = 1116 Score = 27.9 bits (59), Expect = 4.3 Identities = 15/46 (32%), Positives = 21/46 (45%), Gaps = 2/46 (4%) Frame = +1 Query: 256 NGVYFEPKCKNKVDELD--HAVLAVGYGVLNGHKYWLVKNSWSNMW 387 +G Y P V L H+VL GV GH Y ++ + S+ W Sbjct: 423 DGKYLSPDADRSVRNLYTLHSVLVHSGGVHGGHYYAFIRPTLSDQW 468 >At3g26990.1 68416.m03377 expressed protein contains Pfam domain, PF04818: Protein of unknown function, DUF618 Length = 513 Score = 27.5 bits (58), Expect = 5.7 Identities = 11/28 (39%), Positives = 17/28 (60%) Frame = +1 Query: 58 PTEEDYGGYLGQDGYCHIDNVTAITKIT 141 P+E Y + GQDG+ I++ IT +T Sbjct: 484 PSENSYQKFQGQDGFYGINSSVPITPVT 511 >At3g11910.1 68416.m01460 ubiquitin-specific protease, putative strong similarity to ubiquitin-specific protease 12 (UBP12) [Arabidopsis thaliana] GI:11993471; contains Pfam profiles PF00443: Ubiquitin carboxyl-terminal hydrolase, PF00917: MATH domain Length = 1115 Score = 27.5 bits (58), Expect = 5.7 Identities = 15/46 (32%), Positives = 21/46 (45%), Gaps = 2/46 (4%) Frame = +1 Query: 256 NGVYFEPKCKNKVDELD--HAVLAVGYGVLNGHKYWLVKNSWSNMW 387 +G Y P V L H+VL GV GH Y ++ + S+ W Sbjct: 422 DGRYLSPDADKSVRNLYTLHSVLVHSGGVHGGHYYAFIRPTLSDQW 467 >At2g40930.1 68415.m05052 ubiquitin-specific protease 5, putative (UBP5) similar to GI:6648604 Length = 924 Score = 27.5 bits (58), Expect = 5.7 Identities = 10/21 (47%), Positives = 14/21 (66%) Frame = -1 Query: 440 WTPQLFSLIDISTYPSLPHIL 378 WTP+L + DI+ SLP +L Sbjct: 734 WTPELSGMYDITCLESLPEVL 754 >At1g53110.1 68414.m06014 expressed protein Length = 439 Score = 27.1 bits (57), Expect = 7.5 Identities = 11/25 (44%), Positives = 14/25 (56%) Frame = +1 Query: 241 FSFYSNGVYFEPKCKNKVDELDHAV 315 F F Y +P+ K K+DE DH V Sbjct: 52 FYFVKQFAYDDPEIKAKIDEADHEV 76 >At4g18320.1 68417.m02717 hypothetical protein Length = 320 Score = 26.6 bits (56), Expect = 9.9 Identities = 11/30 (36%), Positives = 19/30 (63%) Frame = +2 Query: 377 PICGATMDTYLCL*EKIIVVSKAHRLMYSF 466 PICG+T+D+ +KI ++ H L ++F Sbjct: 215 PICGSTIDSLSPSVDKISIILWGHTLDFTF 244 >At3g19515.1 68416.m02473 expressed protein Length = 507 Score = 26.6 bits (56), Expect = 9.9 Identities = 32/145 (22%), Positives = 55/145 (37%), Gaps = 7/145 (4%) Frame = +1 Query: 37 WIMKHGLPTE-------EDYGGYLGQDGYCHIDNVTAITKITGWVNVTTNNENALKLALF 195 +I +HG+P E +D+ D + H + ++ KI E AL L Sbjct: 215 YIKQHGIPREICKKFDCKDWQPPNADDPHMHTTKLKSVRKIESM-------EEAL--LLL 265 Query: 196 KHGPISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSW 375 H PI + K +Y+ P ++ HAV+ + G + K S Sbjct: 266 PHFPIGADL-VVFKELWTVGEEIYYGPGTNSRGFRSYHAVIVSSIEIYKGEVVAICKMSN 324 Query: 376 SNMWGNDGYVLMSMRENNCGVQSAP 450 ++GYV +S+ V ++P Sbjct: 325 GTKVCDEGYVRVSLATTYMVVGASP 349 >At1g12740.1 68414.m01479 cytochrome P450 family protein similar to Cytochrome P450 90A1 (SP:Q42569) [Arabidopsis thaliana] Length = 472 Score = 26.6 bits (56), Expect = 9.9 Identities = 19/65 (29%), Positives = 27/65 (41%) Frame = +1 Query: 205 PISVAIDAAHKTFSFYSNGVYFEPKCKNKVDELDHAVLAVGYGVLNGHKYWLVKNSWSNM 384 P+ V+ DA F F G F+ D H G L+G Y +KN + Sbjct: 79 PVIVSTDADLSYFVFNQEGRCFQSWYP---DTFTHIFGKKNVGSLHGFMYKYLKNMVLTL 135 Query: 385 WGNDG 399 +G+DG Sbjct: 136 FGHDG 140 Database: arabidopsis Posted date: Oct 4, 2007 10:56 AM Number of letters in database: 12,070,560 Number of sequences in database: 28,952 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 11,406,869 Number of Sequences: 28952 Number of extensions: 224065 Number of successful extensions: 697 Number of sequences better than 10.0: 51 Number of HSP's better than 10.0 without gapping: 631 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 647 length of database: 12,070,560 effective HSP length: 76 effective length of database: 9,870,208 effective search space used: 937669760 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -