BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bmov12a04 (665 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_UPI000051A875 Cluster: PREDICTED: similar to CG9953-PA;... 212 7e-54 UniRef50_Q9VS02 Cluster: CG9953-PA; n=6; Endopterygota|Rep: CG99... 205 9e-52 UniRef50_A1L226 Cluster: Zgc:158605; n=8; Deuterostomia|Rep: Zgc... 191 1e-47 UniRef50_A7RYG7 Cluster: Predicted protein; n=1; Nematostella ve... 169 4e-41 UniRef50_Q555E5 Cluster: Putative uncharacterized protein; n=1; ... 151 1e-35 UniRef50_Q54CF7 Cluster: Putative uncharacterized protein; n=1; ... 145 7e-34 UniRef50_P90893 Cluster: Putative serine protease F56F10.1 precu... 138 8e-32 UniRef50_P34528 Cluster: Putative serine protease K12H4.7 precur... 138 8e-32 UniRef50_Q54GI7 Cluster: Putative uncharacterized protein; n=1; ... 136 4e-31 UniRef50_Q5HZ74 Cluster: MGC85068 protein; n=6; Xenopus|Rep: MGC... 136 6e-31 UniRef50_Q7R4U6 Cluster: GLP_440_23177_21609; n=1; Giardia lambl... 132 1e-29 UniRef50_Q010M0 Cluster: Prolylcarboxypeptidase; n=2; Ostreococc... 126 5e-28 UniRef50_A5CG77 Cluster: Intestinal prolyl carboxypeptidase 2; n... 126 5e-28 UniRef50_UPI0000DB6BB8 Cluster: PREDICTED: similar to CG3734-PA;... 122 6e-27 UniRef50_A7SYK4 Cluster: Predicted protein; n=1; Nematostella ve... 122 8e-27 UniRef50_Q9NQE7 Cluster: Thymus-specific serine protease precurs... 122 1e-26 UniRef50_Q8SXS7 Cluster: RE36938p; n=1; Drosophila melanogaster|... 116 4e-25 UniRef50_Q54G47 Cluster: Putative uncharacterized protein; n=1; ... 114 2e-24 UniRef50_Q22N05 Cluster: Serine carboxypeptidase S28 family prot... 113 3e-24 UniRef50_Q67ZA2 Cluster: Prolyl carboxypeptidase like protein; n... 113 5e-24 UniRef50_Q54D54 Cluster: Putative uncharacterized protein; n=1; ... 113 5e-24 UniRef50_A2DLX9 Cluster: Clan SC, family S28, unassigned serine ... 112 6e-24 UniRef50_UPI000150A973 Cluster: Serine carboxypeptidase S28 fami... 111 1e-23 UniRef50_A0C0B8 Cluster: Chromosome undetermined scaffold_14, wh... 111 2e-23 UniRef50_Q16Y06 Cluster: Lysosomal pro-X carboxypeptidase, putat... 109 8e-23 UniRef50_A2G2H0 Cluster: Clan SC, family S28, unassigned serine ... 109 8e-23 UniRef50_Q7XCY0 Cluster: Prolyl carboxypeptidase like protein, p... 108 1e-22 UniRef50_Q22N04 Cluster: Serine carboxypeptidase S28 family prot... 108 1e-22 UniRef50_Q18198 Cluster: Putative uncharacterized protein; n=2; ... 105 1e-21 UniRef50_Q4RYV8 Cluster: Chromosome 16 SCAF14974, whole genome s... 104 2e-21 UniRef50_Q9VDX6 Cluster: CG18493-PA; n=4; Sophophora|Rep: CG1849... 103 3e-21 UniRef50_Q9VDX5 Cluster: CG3739-PA; n=5; Drosophila|Rep: CG3739-... 103 3e-21 UniRef50_Q19589 Cluster: Putative uncharacterized protein F19C7.... 103 3e-21 UniRef50_A0DE29 Cluster: Chromosome undetermined scaffold_47, wh... 103 3e-21 UniRef50_Q7PX68 Cluster: ENSANGP00000013861; n=3; Culicimorpha|R... 103 4e-21 UniRef50_Q16Y05 Cluster: Prolylcarboxypeptidase, putative; n=2; ... 103 4e-21 UniRef50_Q93Z34 Cluster: At2g24280/F27D4.19; n=6; core eudicotyl... 101 1e-20 UniRef50_UPI000049885B Cluster: serine protease; n=1; Entamoeba ... 101 2e-20 UniRef50_Q23AY4 Cluster: Serine carboxypeptidase S28 family prot... 100 4e-20 UniRef50_Q5YEQ9 Cluster: Serine peptidase; n=1; Bigelowiella nat... 99 5e-20 UniRef50_Q5DC37 Cluster: SJCHGC02147 protein; n=1; Schistosoma j... 99 5e-20 UniRef50_Q19590 Cluster: Putative uncharacterized protein F19C7.... 99 5e-20 UniRef50_O01979 Cluster: Putative uncharacterized protein pcp-2;... 100 6e-20 UniRef50_Q4DW34 Cluster: Serine carboxypeptidase S28, putative; ... 98 2e-19 UniRef50_P42785 Cluster: Lysosomal Pro-X carboxypeptidase precur... 98 2e-19 UniRef50_Q54YD0 Cluster: Putative uncharacterized protein; n=1; ... 97 4e-19 UniRef50_P34610 Cluster: Putative serine protease pcp-1 precurso... 97 4e-19 UniRef50_A2FRR3 Cluster: Clan SC, family S28, unassigned serine ... 95 1e-18 UniRef50_A2ET59 Cluster: Clan SC, family S28, unassigned serine ... 95 1e-18 UniRef50_A1C859 Cluster: Extracelular serine carboxypeptidase, p... 94 2e-18 UniRef50_Q7PJN6 Cluster: ENSANGP00000023762; n=1; Anopheles gamb... 94 3e-18 UniRef50_A2FRQ0 Cluster: Clan SC, family S28, unassigned serine ... 93 4e-18 UniRef50_A2F801 Cluster: Clan SC, family S28, unassigned serine ... 93 5e-18 UniRef50_Q7QAL7 Cluster: ENSANGP00000011396; n=2; Anopheles gamb... 91 2e-17 UniRef50_A0CB90 Cluster: Chromosome undetermined scaffold_163, w... 91 2e-17 UniRef50_UPI00004996CF Cluster: serine protease; n=1; Entamoeba ... 89 1e-16 UniRef50_Q1DJJ2 Cluster: Putative uncharacterized protein; n=2; ... 89 1e-16 UniRef50_Q9FLH1 Cluster: Lysosomal Pro-X carboxypeptidase; n=6; ... 88 2e-16 UniRef50_Q67WZ5 Cluster: Putative prolylcarboxypeptidase isoform... 88 2e-16 UniRef50_A2FGL0 Cluster: Clan SC, family S28, unassigned serine ... 88 2e-16 UniRef50_Q53ND8 Cluster: At2g24280/F27D4.19; n=4; Oryza sativa|R... 87 4e-16 UniRef50_UPI00015B5213 Cluster: PREDICTED: similar to prolylcarb... 85 1e-15 UniRef50_A3C6E7 Cluster: Putative uncharacterized protein; n=2; ... 85 1e-15 UniRef50_Q7QAL4 Cluster: ENSANGP00000011387; n=1; Anopheles gamb... 85 2e-15 UniRef50_A7PQM2 Cluster: Chromosome chr6 scaffold_25, whole geno... 84 3e-15 UniRef50_UPI0000E4A528 Cluster: PREDICTED: similar to prolylcarb... 83 4e-15 UniRef50_Q9FFC2 Cluster: Prolylcarboxypeptidase-like protein; n=... 83 4e-15 UniRef50_Q16Y07 Cluster: Prolylcarboxypeptidase, putative; n=1; ... 83 4e-15 UniRef50_Q9GRV9 Cluster: Putative uncharacterized protein pcp-4;... 82 1e-14 UniRef50_Q16LF2 Cluster: Prolylcarboxypeptidase, putative; n=4; ... 82 1e-14 UniRef50_A1CFV7 Cluster: Serine peptidase, putative; n=5; Pezizo... 82 1e-14 UniRef50_UPI0000499072 Cluster: serine protease; n=2; Entamoeba ... 81 2e-14 UniRef50_Q5CZT1 Cluster: Zgc:113564; n=12; Eumetazoa|Rep: Zgc:11... 80 4e-14 UniRef50_A6S9T4 Cluster: Putative uncharacterized protein; n=3; ... 80 4e-14 UniRef50_P34676 Cluster: Putative serine protease tag-282 precur... 80 4e-14 UniRef50_A4RKL9 Cluster: Putative uncharacterized protein; n=1; ... 79 7e-14 UniRef50_A4QUS9 Cluster: Putative uncharacterized protein; n=1; ... 79 7e-14 UniRef50_Q7SEA3 Cluster: Putative uncharacterized protein NCU008... 79 1e-13 UniRef50_Q0U1V1 Cluster: Putative uncharacterized protein; n=2; ... 79 1e-13 UniRef50_Q9UHL4 Cluster: Dipeptidyl-peptidase 2 precursor; n=19;... 78 2e-13 UniRef50_Q2HER6 Cluster: Putative uncharacterized protein; n=1; ... 78 2e-13 UniRef50_Q29MX0 Cluster: GA15377-PA; n=4; Endopterygota|Rep: GA1... 77 3e-13 UniRef50_Q7Z5N6 Cluster: Thymus specific serine peptidase; n=4; ... 77 5e-13 UniRef50_Q7Z5N5 Cluster: Thymus specific serine peptidase; n=3; ... 77 5e-13 UniRef50_Q5KFY9 Cluster: Putative uncharacterized protein; n=4; ... 77 5e-13 UniRef50_UPI0000078353 Cluster: C46C2.4; n=1; Caenorhabditis ele... 76 7e-13 UniRef50_Q54HT4 Cluster: Putative uncharacterized protein; n=1; ... 75 1e-12 UniRef50_Q9VIM0 Cluster: CG2493-PA; n=3; Diptera|Rep: CG2493-PA ... 75 2e-12 UniRef50_Q5BYD1 Cluster: SJCHGC06818 protein; n=2; Schistosoma j... 75 2e-12 UniRef50_A7EU48 Cluster: Putative uncharacterized protein; n=1; ... 75 2e-12 UniRef50_Q22MF3 Cluster: Serine carboxypeptidase S28 family prot... 75 2e-12 UniRef50_Q2GU64 Cluster: Putative uncharacterized protein; n=1; ... 75 2e-12 UniRef50_Q54H23 Cluster: Putative uncharacterized protein; n=1; ... 74 3e-12 UniRef50_A2E983 Cluster: Clan SC, family S28, unassigned serine ... 74 3e-12 UniRef50_Q0UTR3 Cluster: Predicted protein; n=1; Phaeosphaeria n... 74 4e-12 UniRef50_Q7S134 Cluster: Putative uncharacterized protein NCU099... 73 5e-12 UniRef50_A6SA13 Cluster: Putative uncharacterized protein; n=1; ... 73 6e-12 UniRef50_A4RA99 Cluster: Putative uncharacterized protein; n=1; ... 73 8e-12 UniRef50_Q7QQ95 Cluster: GLP_243_15169_16578; n=1; Giardia lambl... 71 2e-11 UniRef50_Q4PHW9 Cluster: Putative uncharacterized protein; n=1; ... 70 4e-11 UniRef50_Q4DM56 Cluster: Serine carboxypeptidase S28, putative; ... 66 7e-10 UniRef50_A2FA76 Cluster: Clan SC, family S28, unassigned serine ... 66 7e-10 UniRef50_A2ERP5 Cluster: Clan SC, family S28, unassigned serine ... 66 9e-10 UniRef50_Q0V7E6 Cluster: Putative uncharacterized protein; n=1; ... 65 1e-09 UniRef50_A7EHM7 Cluster: Putative uncharacterized protein; n=1; ... 64 3e-09 UniRef50_A4R3D5 Cluster: Putative uncharacterized protein; n=1; ... 59 8e-08 UniRef50_Q9VDX1 Cluster: CG11626-PA; n=2; Sophophora|Rep: CG1162... 57 4e-07 UniRef50_A2WVG2 Cluster: Putative uncharacterized protein; n=3; ... 54 2e-06 UniRef50_A2FQM0 Cluster: Putative uncharacterized protein; n=1; ... 52 1e-05 UniRef50_Q2UKB6 Cluster: Predicted protein; n=1; Aspergillus ory... 52 1e-05 UniRef50_Q64YV4 Cluster: Putative secreted tripeptidyl aminopept... 51 3e-05 UniRef50_Q2U0Q2 Cluster: Hydrolytic enzymes of the alpha/beta hy... 42 0.010 UniRef50_UPI0000D56B19 Cluster: PREDICTED: similar to CG31349-PB... 29 0.21 UniRef50_A6SFQ5 Cluster: Putative uncharacterized protein; n=1; ... 34 2.7 UniRef50_A5UT10 Cluster: Spermine synthase; n=2; Roseiflexus|Rep... 33 4.7 UniRef50_A1IEM6 Cluster: Hydrolase of the alpha/beta-hydrolase f... 33 4.7 UniRef50_UPI0000D9A547 Cluster: PREDICTED: similar to mucin 4, p... 33 6.2 UniRef50_A3LVZ5 Cluster: Predicted protein; n=1; Pichia stipitis... 33 6.2 UniRef50_Q1KMD3 Cluster: Heterogeneous nuclear ribonucleoprotein... 33 6.2 UniRef50_Q73LY9 Cluster: Putative uncharacterized protein; n=1; ... 33 8.2 UniRef50_A3Y6D3 Cluster: Sensor protein; n=1; Marinomonas sp. ME... 33 8.2 >UniRef50_UPI000051A875 Cluster: PREDICTED: similar to CG9953-PA; n=2; Apocrita|Rep: PREDICTED: similar to CG9953-PA - Apis mellifera Length = 493 Score = 212 bits (517), Expect = 7e-54 Identities = 98/159 (61%), Positives = 114/159 (71%) Frame = +2 Query: 188 GRSNGGNLGIPGGDYQSNLPPPQWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVF 367 GRS GNLG P LP QWF Q LDH +P+D R W+QRY++N +Y K GPVF Sbjct: 3 GRSKYGNLGAPILSENYKLPNEQWFTQFLDHFDPTDARVWQQRYFINGEYY--KKGGPVF 60 Query: 368 LMIGGEGPADARWMVTGTWINYAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQA 547 LMI GE A A+WMV G WI YAK+F ALC +EHRFYG+SHPT DLS+KNL++LSS QA Sbjct: 61 LMISGESTATAKWMVKGQWIEYAKQFGALCFQVEHRFYGKSHPTSDLSVKNLKYLSSQQA 120 Query: 548 LADLANFISSMKQKFRLNEKVKWIAFGGSYPGSLAAWLR 664 LADLA FI M ++L+ KWIAFGGSY GSLAAWLR Sbjct: 121 LADLAYFIEIMNIDYKLSNDTKWIAFGGSYAGSLAAWLR 159 >UniRef50_Q9VS02 Cluster: CG9953-PA; n=6; Endopterygota|Rep: CG9953-PA - Drosophila melanogaster (Fruit fly) Length = 508 Score = 205 bits (500), Expect = 9e-52 Identities = 97/165 (58%), Positives = 116/165 (70%), Gaps = 3/165 (1%) Frame = +2 Query: 179 FHLGRSNGGNLGIPGG--DYQSNLPPPQ-WFKQKLDHSNPSDLRTWKQRYYVNDSFYDFK 349 F GR G LG P Q +L WF+Q+LDH SD RTW+QRY+VN FY Sbjct: 29 FRRGRLTKGFLGEPSKIPTLQRSLHSEDLWFEQRLDHFKSSDKRTWQQRYFVNADFYRND 88 Query: 350 NQGPVFLMIGGEGPADARWMVTGTWINYAKKFNALCINLEHRFYGESHPTLDLSIKNLQF 529 + PVFLMIGGEG A A+WM G W++YA+ F ALC+ LEHRFYG+SHPT DLS +NL + Sbjct: 89 SSAPVFLMIGGEGEASAKWMREGAWVHYAEHFGALCLQLEHRFYGKSHPTADLSTENLHY 148 Query: 530 LSSYQALADLANFISSMKQKFRLNEKVKWIAFGGSYPGSLAAWLR 664 LSS QAL DLA+F+++MK KF L + KWIAFGGSYPGSLAAW R Sbjct: 149 LSSEQALEDLASFVTAMKVKFNLGDGQKWIAFGGSYPGSLAAWAR 193 >UniRef50_A1L226 Cluster: Zgc:158605; n=8; Deuterostomia|Rep: Zgc:158605 - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 488 Score = 191 bits (466), Expect = 1e-47 Identities = 90/137 (65%), Positives = 105/137 (76%) Frame = +2 Query: 254 QWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINY 433 QWF Q+LDH N +D R WKQRY+VNDSFY + GPVFLMIGGEGPA+ WM GTW+ Y Sbjct: 47 QWFIQRLDHFNGADSRVWKQRYFVNDSFY--RVGGPVFLMIGGEGPANPAWMQYGTWLTY 104 Query: 434 AKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVK 613 A+K ALC+ LEHRFYG+SHPT DLS +NL+FLSS QALADLA+F ++ R K Sbjct: 105 AQKLGALCLLLEHRFYGKSHPTEDLSTENLRFLSSRQALADLAHF-RTVTAAARGLTNSK 163 Query: 614 WIAFGGSYPGSLAAWLR 664 W+AFGGSYPGSLAAW R Sbjct: 164 WVAFGGSYPGSLAAWFR 180 >UniRef50_A7RYG7 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 444 Score = 169 bits (412), Expect = 4e-41 Identities = 78/140 (55%), Positives = 99/140 (70%) Frame = +2 Query: 245 PPPQWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTW 424 PP WF Q+LDH + S+ TWKQR+Y ND+F K+ PVFLM+GGEG W++ G Sbjct: 14 PPENWFIQRLDHFDDSNTETWKQRFYYNDTFRKTKDS-PVFLMVGGEGAISPVWVLIGNM 72 Query: 425 INYAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNE 604 + YA+ F A+ LEHRFYG+SHP D+S NL++L+S QALADLA F +M KF L + Sbjct: 73 MKYAEGFGAMAFILEHRFYGQSHPRSDMSDANLKYLNSEQALADLAAFRQAMSVKFNLTD 132 Query: 605 KVKWIAFGGSYPGSLAAWLR 664 KWI+FGGSYPGSL+AWLR Sbjct: 133 S-KWISFGGSYPGSLSAWLR 151 >UniRef50_Q555E5 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 487 Score = 151 bits (367), Expect = 1e-35 Identities = 80/194 (41%), Positives = 120/194 (61%), Gaps = 10/194 (5%) Frame = +2 Query: 113 MKLYTILFNLYVALISVDGVKKFHLGRSNGGNLGIPGGDYQS--------NLPPPQWFKQ 268 MK+ I+ +L + ++G + + G +PG D + N PP QWF Sbjct: 1 MKIIFIILSLLFFIGIINGHRNHDSPLNKGLKHRVPGFDSRPSSDRRVNPNDPPVQWFTN 60 Query: 269 KLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWI--NYAKK 442 ++DH +P + T+KQ++YVND++Y PVF ++GGEGP A + VTG ++ YA+K Sbjct: 61 RVDHYDPQNRNTFKQKFYVNDTYYT--PGSPVFYILGGEGPVGASY-VTGHFVFNQYAQK 117 Query: 443 FNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWIA 622 FNAL + +EHRFYG+S P LS++NL++L++ QALAD A F+ + QK+ KWI+ Sbjct: 118 FNALLVAIEHRFYGDSIPMGSLSLENLKYLTTQQALADYAAFVPFLTQKYNTGSS-KWIS 176 Query: 623 FGGSYPGSLAAWLR 664 FGGSY G+L+ WLR Sbjct: 177 FGGSYSGNLSGWLR 190 >UniRef50_Q54CF7 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 486 Score = 145 bits (352), Expect = 7e-34 Identities = 66/137 (48%), Positives = 98/137 (71%) Frame = +2 Query: 254 QWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINY 433 QWF Q +DH NP++ T++QRY +ND ++D GPVF+MI GEGP D + ++ + Sbjct: 52 QWFTQSVDHFNPANPTTFQQRYLINDQYWD--GTGPVFIMINGEGPMDINTVTQLQFVVW 109 Query: 434 AKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVK 613 AK+ +AL ++LEHR+YG S T DLS++NLQ+L+S QALAD A F + + Q++ + ++ K Sbjct: 110 AKQVSALVVSLEHRYYGASFVTEDLSLENLQWLNSAQALADNAVFRNFVAQQYNVPKESK 169 Query: 614 WIAFGGSYPGSLAAWLR 664 WI+FGGSY G+L +W R Sbjct: 170 WISFGGSYSGALTSWFR 186 >UniRef50_P90893 Cluster: Putative serine protease F56F10.1 precursor; n=2; Caenorhabditis|Rep: Putative serine protease F56F10.1 precursor - Caenorhabditis elegans Length = 540 Score = 138 bits (335), Expect = 8e-32 Identities = 78/191 (40%), Positives = 108/191 (56%), Gaps = 13/191 (6%) Frame = +2 Query: 131 LFNLYVALISVDGVKKFHLGRSNGGNL---------GIPGGDYQSNLPPPQW--FKQKLD 277 L L + L+ + F LGR NG L G Q P Q F QKLD Sbjct: 5 LLLLLLPLLIEAKLPPFFLGRLNGKTLLNHHLDRLTASDGASIQETYPNLQVHNFTQKLD 64 Query: 278 HSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGT--WINYAKKFNA 451 H +P + +TW Q+Y+ N F +N +FLMIGGEGP + +W ++ +AK+F A Sbjct: 65 HFDPYNTKTWNQKYFYNPVFS--RNNSIIFLMIGGEGPENGKWAANPNVQYLQWAKEFGA 122 Query: 452 LCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWIAFGG 631 +LEHRF+G+S P D+ +L++L++ QALADLA FI M Q++ + +W+ FGG Sbjct: 123 DVFDLEHRFFGDSWPIPDMQTSSLRYLTTQQALADLAFFIEFMNQQYGF-KNPRWVTFGG 181 Query: 632 SYPGSLAAWLR 664 SYPGSLAAW R Sbjct: 182 SYPGSLAAWFR 192 >UniRef50_P34528 Cluster: Putative serine protease K12H4.7 precursor; n=3; Caenorhabditis|Rep: Putative serine protease K12H4.7 precursor - Caenorhabditis elegans Length = 510 Score = 138 bits (335), Expect = 8e-32 Identities = 70/137 (51%), Positives = 90/137 (65%), Gaps = 2/137 (1%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWM-VTGTWI-NY 433 F Q LDH + S +T++QRYY N+ +Y K GP FLM+GGEGP + W+ G I N Sbjct: 63 FTQTLDHFDSSVGKTFQQRYYHNNQWY--KAGGPAFLMLGGEGPESSYWVSYPGLEITNL 120 Query: 434 AKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVK 613 A K A ++EHRFYGE+HPT D+S+ NL++LSS QA+ D A FI +M KF K Sbjct: 121 AAKQGAWVFDIEHRFYGETHPTSDMSVPNLKYLSSAQAIEDAAAFIKAMTAKFPQLANAK 180 Query: 614 WIAFGGSYPGSLAAWLR 664 W+ FGGSY G+LAAW R Sbjct: 181 WVTFGGSYSGALAAWTR 197 >UniRef50_Q54GI7 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 481 Score = 136 bits (329), Expect = 4e-31 Identities = 77/194 (39%), Positives = 113/194 (58%), Gaps = 1/194 (0%) Frame = +2 Query: 86 FNSNQMSTNMKLYTILFNLYVALISVDGVKKFHLGRSNGGNLGIPGGDYQSNLPPPQWFK 265 F +N+M+ +K+ ++F + V +I+ K+ N L + G S P QWF Sbjct: 1 FQNNKMNKLIKII-VIFTIIVNVINGLAYPKY-----NAEELILDG----SGSFPAQWFT 50 Query: 266 QKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADA-RWMVTGTWINYAKK 442 Q LDH N + +T++Q+YYVND +Y++KN GP+ L I GEGP + + + YA+ Sbjct: 51 QTLDHFNFQNNQTFQQKYYVNDQYYNYKNGGPIILYINGEGPVSSPPYSSDDGVVIYAQA 110 Query: 443 FNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWIA 622 N + + LEHRFYGES P +L+I+NLQ+LS QAL DLA F+ + K L + Sbjct: 111 LNCMIVTLEHRFYGESSPFSELTIENLQYLSHQQALEDLATFVVDFQSK--LVGAGHIVT 168 Query: 623 FGGSYPGSLAAWLR 664 GGSY G+L+AW R Sbjct: 169 IGGSYSGALSAWFR 182 >UniRef50_Q5HZ74 Cluster: MGC85068 protein; n=6; Xenopus|Rep: MGC85068 protein - Xenopus laevis (African clawed frog) Length = 506 Score = 136 bits (328), Expect = 6e-31 Identities = 66/153 (43%), Positives = 97/153 (63%) Frame = +2 Query: 206 NLGIPGGDYQSNLPPPQWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGE 385 N +P G + + + Q LDH N + T+ QRY++N+ ++++ + GPVFL IGGE Sbjct: 47 NRWMPKGAFPNTPSVESFIVQPLDHFNRRNNGTYNQRYWINEQYWNYPD-GPVFLYIGGE 105 Query: 386 GPADARWMVTGTWINYAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLAN 565 G +++G + A+ AL ++LEHRFYG S L+++N++FLSS QALADLA+ Sbjct: 106 GSLSEFSVLSGEHVELAQTHRALLVSLEHRFYGSSINIDGLTLENIKFLSSQQALADLAS 165 Query: 566 FISSMKQKFRLNEKVKWIAFGGSYPGSLAAWLR 664 F + QK+ L + WI FGGSYPGSL+AW R Sbjct: 166 FHMFISQKYNLTRQNTWICFGGSYPGSLSAWFR 198 >UniRef50_Q7R4U6 Cluster: GLP_440_23177_21609; n=1; Giardia lamblia ATCC 50803|Rep: GLP_440_23177_21609 - Giardia lamblia ATCC 50803 Length = 522 Score = 132 bits (318), Expect = 1e-29 Identities = 70/142 (49%), Positives = 93/142 (65%), Gaps = 2/142 (1%) Frame = +2 Query: 245 PPPQWFK-QKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGT 421 P WF+ Q +DH + ++ + W QRYY ND++Y K GPVFLMIGGEGPA R + Sbjct: 54 PGELWFREQHVDHFDSTNTKKWSQRYYYNDTYY--KAGGPVFLMIGGEGPATPRDVGDYF 111 Query: 422 WINY-AKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRL 598 I+Y AK N L + LEHRFYG S P+ + + NL L S QALAD+A F++ +K+++ L Sbjct: 112 SIDYFAKNMNGLKVALEHRFYGASFPSTNSA--NLSLLRSDQALADIATFLAYLKREYNL 169 Query: 599 NEKVKWIAFGGSYPGSLAAWLR 664 E K +A GGSY G+LAAW R Sbjct: 170 PEGTKIVAVGGSYSGNLAAWAR 191 >UniRef50_Q010M0 Cluster: Prolylcarboxypeptidase; n=2; Ostreococcus|Rep: Prolylcarboxypeptidase - Ostreococcus tauri Length = 542 Score = 126 bits (304), Expect = 5e-28 Identities = 78/188 (41%), Positives = 107/188 (56%), Gaps = 20/188 (10%) Frame = +2 Query: 161 VDGVKKFHLGRSNGGNLGIPGGDYQSN---LPPPQWFKQKLDHSNPSDLRTWKQRYYVND 331 VDG+++ + R+ GG GD++ N +WF Q LDH + D R W QRY+VN+ Sbjct: 29 VDGLRRASVARALGG----ARGDFEINDDVEDAERWFDQTLDHFDHVDRRRWSQRYFVNE 84 Query: 332 SFYD-FKNQGPVFLMIGGEGPA-DARWMVTG-----TWINYAKKFNALCINLEHRFYGES 490 F D + PVF+ +GGEGPA AR ++ G T I+ AKK + + LEHRFYG S Sbjct: 85 GFVDKIEASTPVFVCVGGEGPALTARAVLDGGTHCGTMIDLAKKHRGIALALEHRFYGAS 144 Query: 491 HPTLDLSIKNLQFLSSYQALADLANFISSMKQKF----------RLNEKVKWIAFGGSYP 640 PT DLS ++L++L+S QAL D+ F+ + + R + IAFGGSYP Sbjct: 145 QPTGDLSRESLRYLTSAQALEDVVAFVKYVADAYGLRTTPSDDGRNGSYSRVIAFGGSYP 204 Query: 641 GSLAAWLR 664 G LAAW R Sbjct: 205 GMLAAWSR 212 >UniRef50_A5CG77 Cluster: Intestinal prolyl carboxypeptidase 2; n=2; Haemonchus contortus|Rep: Intestinal prolyl carboxypeptidase 2 - Haemonchus contortus (Barber pole worm) Length = 1143 Score = 126 bits (304), Expect = 5e-28 Identities = 63/139 (45%), Positives = 84/139 (60%), Gaps = 3/139 (2%) Frame = +2 Query: 257 WFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGT--WIN 430 +FKQKLDH+ TW QRY+ + +Y K FLM+GG G D W+ ++ Sbjct: 53 YFKQKLDHTKDDGEGTWPQRYFYSQRYYR-KGGNVFFLMLGGMGVMDIGWVTNEKLPFVQ 111 Query: 431 YAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRL-NEK 607 + K+ A LEHRFYG+S PT +LS++NL +L+ QA+ D+ANFI M K R+ +E Sbjct: 112 WGKERGAQLYALEHRFYGKSRPTPNLSVRNLAYLTIDQAIGDVANFIKEMNAKHRIXDED 171 Query: 608 VKWIAFGGSYPGSLAAWLR 664 KWI FGGSY SLA W R Sbjct: 172 AKWIVFGGSYAASLALWAR 190 Score = 107 bits (258), Expect = 2e-22 Identities = 70/176 (39%), Positives = 101/176 (57%), Gaps = 11/176 (6%) Frame = +2 Query: 170 VKKFHLGRSNGGNLGIPGGDYQSNLPPPQ---WFKQKLDHSNPSDLRTWKQRYYVNDSFY 340 +++ HLGR G P D ++P +F Q +DH N + T++QRY+ ND + Sbjct: 576 LRRVHLGRPPHGLF--PDPDPLPDMPVQYEAGYFTQPVDHFNNKNPYTFEQRYFKNDQWA 633 Query: 341 DFKNQGPVFLMIGGEGPADARWMVTG--TWINYAKKFNALCINLEHRFYGESH--PTLD- 505 K GP+FLMIGGE D+ W++ T++ +A +F A LE R+YG+S +LD Sbjct: 634 --KPNGPIFLMIGGESERDSSWVLNENLTYLKWADEFGATVYALEXRYYGKSDLFDSLDP 691 Query: 506 -LSIKNLQ--FLSSYQALADLANFISSMKQKFRLNEKVKWIAFGGSYPGSLAAWLR 664 +S KN +LSS Q L D+ANFI ++ + + KWI FGGSY GSLA W+R Sbjct: 692 AVSKKNTYTTYLSSLQMLYDVANFIRAVDAE--RGQHGKWIMFGGSYAGSLALWMR 745 >UniRef50_UPI0000DB6BB8 Cluster: PREDICTED: similar to CG3734-PA; n=2; Apocrita|Rep: PREDICTED: similar to CG3734-PA - Apis mellifera Length = 478 Score = 122 bits (295), Expect = 6e-27 Identities = 65/137 (47%), Positives = 81/137 (59%), Gaps = 1/137 (0%) Frame = +2 Query: 257 WFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYA 436 W +Q LDH NP D RTW RY N F FK GP+ +MIGGE ++ G A Sbjct: 48 WIQQPLDHFNPRDNRTWSMRYLENSRF--FKENGPILIMIGGEWAISKGFLRAGLMYELA 105 Query: 437 KKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQ-KFRLNEKVK 613 +A EHR+YG+S PT D S +NLQ+LS QALADLA FI + K+ + R N V Sbjct: 106 SNHSASMYYTEHRYYGKSKPTNDTSSRNLQYLSVDQALADLAYFIKTKKKDESRRNSTV- 164 Query: 614 WIAFGGSYPGSLAAWLR 664 I FGGSY G++A+W R Sbjct: 165 -IVFGGSYAGNVASWAR 180 >UniRef50_A7SYK4 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 502 Score = 122 bits (294), Expect = 8e-27 Identities = 61/136 (44%), Positives = 89/136 (65%), Gaps = 1/136 (0%) Frame = +2 Query: 260 FKQKLDHSNPSDL-RTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYA 436 F+Q +DH + RT+ QRY++N +F+ + GPV L +GGE ++ G ++ A Sbjct: 61 FEQYIDHFEFTPRPRTYLQRYWMNRAFWKGPD-GPVLLYVGGESVLSGGYIAGGHIVDIA 119 Query: 437 KKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKW 616 K++ AL +EHR+YG+S+ L KN+++LSS ALADLA F++ K KF L +K KW Sbjct: 120 KEYGALLFAVEHRYYGKSNFFGCLKTKNMRYLSSQLALADLAQFVAHAKNKFGLTDKNKW 179 Query: 617 IAFGGSYPGSLAAWLR 664 I +GGSYPGSL+AW R Sbjct: 180 ITYGGSYPGSLSAWFR 195 >UniRef50_Q9NQE7 Cluster: Thymus-specific serine protease precursor; n=14; Theria|Rep: Thymus-specific serine protease precursor - Homo sapiens (Human) Length = 514 Score = 122 bits (293), Expect = 1e-26 Identities = 67/157 (42%), Positives = 90/157 (57%) Frame = +2 Query: 194 SNGGNLGIPGGDYQSNLPPPQWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLM 373 S+ LG+ G + LP W +Q LD N SD R++ QRY+VND + GP+FL Sbjct: 40 SSAQGLGLSLGPGAAALPKVGWLEQLLDPFNVSDRRSFLQRYWVNDQHW-VGQDGPIFLH 98 Query: 374 IGGEGPADARWMVTGTWINYAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALA 553 +GGEG ++ G A + AL I+LEHRFYG S P L + L+FLSS ALA Sbjct: 99 LGGEGSLGPGSVMRGHPAALAPAWGALVISLEHRFYGLSIPAGGLEMAQLRFLSSRLALA 158 Query: 554 DLANFISSMKQKFRLNEKVKWIAFGGSYPGSLAAWLR 664 D+ + ++ + F ++ WI FGGSY GSLAAW R Sbjct: 159 DVVSARLALSRLFNISSSSPWICFGGSYAGSLAAWAR 195 >UniRef50_Q8SXS7 Cluster: RE36938p; n=1; Drosophila melanogaster|Rep: RE36938p - Drosophila melanogaster (Fruit fly) Length = 473 Score = 116 bits (280), Expect = 4e-25 Identities = 56/150 (37%), Positives = 90/150 (60%) Frame = +2 Query: 215 IPGGDYQSNLPPPQWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPA 394 +P ++++ W +QKLDH +P + RTW+ RY +ND+ Y ++ P+F+ +GGE Sbjct: 35 LPTTQNRADVVQTLWIEQKLDHFDPEETRTWQMRYMLNDALY--QSGAPLFIYLGGEWEI 92 Query: 395 DARWMVTGTWINYAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFIS 574 + + G + AK+ NAL EHR+YG+S P DLS +N+++L+ Q+LADLA FI+ Sbjct: 93 SSGRITGGHLYDMAKEHNALLAYTEHRYYGQSKPLPDLSNENIKYLNVNQSLADLAYFIN 152 Query: 575 SMKQKFRLNEKVKWIAFGGSYPGSLAAWLR 664 ++KQ K I GGSY ++ W + Sbjct: 153 TIKQNHEGLSDSKVIIVGGSYSATMVTWFK 182 >UniRef50_Q54G47 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 469 Score = 114 bits (275), Expect = 2e-24 Identities = 60/135 (44%), Positives = 84/135 (62%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAK 439 F+Q +DH + + T+KQRY V D + F GP+F + GE P +N+A+ Sbjct: 52 FEQNVDHYDYFNNNTFKQRYIVVDDY--FTGDGPIFFYLAGEAPMGFFGFQEVQVVNWAQ 109 Query: 440 KFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWI 619 F AL I LEHR+YGES+P DLS NL++L+S QAL+D ANF+S+ KQ L + + + Sbjct: 110 DFGALFIVLEHRYYGESYPVDDLSTHNLKYLTSQQALSDAANFLSTYKQDNNLIDN-QVV 168 Query: 620 AFGGSYPGSLAAWLR 664 FG SY G+L+AW R Sbjct: 169 VFGCSYSGALSAWFR 183 >UniRef50_Q22N05 Cluster: Serine carboxypeptidase S28 family protein; n=1; Tetrahymena thermophila SB210|Rep: Serine carboxypeptidase S28 family protein - Tetrahymena thermophila SB210 Length = 480 Score = 113 bits (273), Expect = 3e-24 Identities = 64/142 (45%), Positives = 85/142 (59%), Gaps = 6/142 (4%) Frame = +2 Query: 257 WFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYA 436 ++ Q LDH NP+D RTW+QRY + Y+ N G VF+ IGGEG G + A Sbjct: 42 YYTQVLDHFNPNDQRTWQQRYAIYSDEYNPVN-GTVFVYIGGEGKQKGLSPGLGWMVELA 100 Query: 437 KKFNALCINLEHRFYGESHP----TLDLSIKNLQFLSSYQALADLANFISSMK--QKFRL 598 KKF+AL + +EHRFYG S P S +NL +LS QAL DLA I++ K + L Sbjct: 101 KKFSALFLIVEHRFYGASQPFGKDENSYSNQNLAYLSVEQALEDLAQIIANFKTLRLHGL 160 Query: 599 NEKVKWIAFGGSYPGSLAAWLR 664 +E V +I GGSYPG+++AW R Sbjct: 161 SENVPFITIGGSYPGAVSAWFR 182 >UniRef50_Q67ZA2 Cluster: Prolyl carboxypeptidase like protein; n=13; core eudicotyledons|Rep: Prolyl carboxypeptidase like protein - Arabidopsis thaliana (Mouse-ear cress) Length = 488 Score = 113 bits (271), Expect = 5e-24 Identities = 66/145 (45%), Positives = 88/145 (60%), Gaps = 9/145 (6%) Frame = +2 Query: 257 WFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKN--QGPVFLMIGGEGPADARWMVTGTWIN 430 WF Q LDH +PSD R +KQRYY + D GP+F+MI GEGP + + +I Sbjct: 49 WFNQTLDHYSPSDHREFKQRYY---EYLDHLRVPDGPIFMMICGEGPCNG---IPNDYIT 102 Query: 431 Y-AKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANF----ISSMKQKFR 595 AKKF+A ++LEHR+YG+S P L+ +NL++LSS QAL DLA F S+ KF Sbjct: 103 VLAKKFDAGIVSLEHRYYGKSSPFKSLATENLKYLSSKQALFDLAAFRQYYQDSLNVKFN 162 Query: 596 LNEKVK--WIAFGGSYPGSLAAWLR 664 + V+ W FG SY G+L+AW R Sbjct: 163 RSGDVENPWFFFGASYSGALSAWFR 187 >UniRef50_Q54D54 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 485 Score = 113 bits (271), Expect = 5e-24 Identities = 67/184 (36%), Positives = 108/184 (58%), Gaps = 1/184 (0%) Frame = +2 Query: 113 MKLYTILFNLYVALISVDGVKKFHLGRSNGGNLGIPGGDYQSNLPPPQWFKQKLDHSNPS 292 +K+Y + + L ++++S+ F + + ++ I Q + Q F QK+DH N Sbjct: 4 LKIYILFYILIISMVSISQCNSFLI--KSKPDVMIVDESIQIHEIVYQLFVQKVDHFNLL 61 Query: 293 DLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAKKFNALCINLEH 472 D RT+ QR+ VN +++ GPVF +I GE +A + + + +AK+ NAL ++LEH Sbjct: 62 DDRTFFQRFVVNSKYWN--GTGPVFFIISGEQNMEASSVNSCQYTIWAKQLNALIVSLEH 119 Query: 473 RFYGESHPTLDLSIKNLQFLSSYQALADLANFIS-SMKQKFRLNEKVKWIAFGGSYPGSL 649 R+YG S+ T DLS NL++L++ QALAD FI K + + K I+FGGSY G+L Sbjct: 120 RYYGGSYVTEDLSTDNLKYLTTQQALADCVVFIDWFTKVYYHVPSSSKIISFGGSYAGTL 179 Query: 650 AAWL 661 +A+L Sbjct: 180 SAYL 183 >UniRef50_A2DLX9 Cluster: Clan SC, family S28, unassigned serine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan SC, family S28, unassigned serine peptidase - Trichomonas vaginalis G3 Length = 518 Score = 112 bits (270), Expect = 6e-24 Identities = 58/136 (42%), Positives = 83/136 (61%) Frame = +2 Query: 257 WFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYA 436 +F Q LDH N SD RT+KQRYY ND+F + + IGGE R + G ++ A Sbjct: 23 YFDQFLDHFNTSDNRTFKQRYYYNDTFCQNTTTKKLIVFIGGEAAITERRVQKGAYMKLA 82 Query: 437 KKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKW 616 K+ ++ + LEHR++GES P +L NL++L+S QALADLA FI S K + + Sbjct: 83 KETDSCVVALEHRYFGESQPFEELITPNLKYLTSDQALADLAYFIESF-IKIKYQSRPTI 141 Query: 617 IAFGGSYPGSLAAWLR 664 + GGSYPG+L+++ R Sbjct: 142 LVVGGSYPGTLSSYFR 157 >UniRef50_UPI000150A973 Cluster: Serine carboxypeptidase S28 family protein; n=1; Tetrahymena thermophila SB210|Rep: Serine carboxypeptidase S28 family protein - Tetrahymena thermophila SB210 Length = 490 Score = 111 bits (267), Expect = 1e-23 Identities = 60/142 (42%), Positives = 84/142 (59%), Gaps = 6/142 (4%) Frame = +2 Query: 257 WFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYA 436 +F+QKLDH P D RTW QRY+V D +++ Q V L I GEG + + A Sbjct: 49 YFQQKLDHYAPLDNRTWAQRYFVMDHWFNKTAQPLVILYICGEGECNGVQYNSSFTSKIA 108 Query: 437 KKFNALCINLEHRFYGESHP----TLDLSIKNLQFLSSYQALADLANFISSMK--QKFRL 598 + N + ++LEHRFYG+S P ++ NL++L++ QAL DLA FI +K Q F + Sbjct: 109 EIHNGIVLSLEHRFYGKSQPFGFGNDSYALPNLKYLTAQQALNDLAWFIQYVKDNQLFGI 168 Query: 599 NEKVKWIAFGGSYPGSLAAWLR 664 + WI GGSYPG+L+AW R Sbjct: 169 TPNMPWITIGGSYPGALSAWFR 190 >UniRef50_A0C0B8 Cluster: Chromosome undetermined scaffold_14, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_14, whole genome shotgun sequence - Paramecium tetraurelia Length = 464 Score = 111 bits (266), Expect = 2e-23 Identities = 57/139 (41%), Positives = 83/139 (59%), Gaps = 2/139 (1%) Frame = +2 Query: 254 QWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINY 433 +WF QKLDH++P+ +KQR ++ + + V L I GE D + G + Sbjct: 36 EWFTQKLDHNDPTSQEVFKQRVHIYNEYVKDDQPEAVILYICGEWTCDG--IGKGLTFDA 93 Query: 434 AKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEK-- 607 A++ NA+ + LEHR+YG+S P D S NL++L+ +QAL D+A FI+S+K N K Sbjct: 94 AQQLNAVVLVLEHRYYGQSQPFEDWSTPNLKYLNIHQALDDIAYFITSIKANGNYNIKPD 153 Query: 608 VKWIAFGGSYPGSLAAWLR 664 WI GGSYPG+L+AW R Sbjct: 154 TPWIHLGGSYPGALSAWFR 172 >UniRef50_Q16Y06 Cluster: Lysosomal pro-X carboxypeptidase, putative; n=2; Culicidae|Rep: Lysosomal pro-X carboxypeptidase, putative - Aedes aegypti (Yellowfever mosquito) Length = 467 Score = 109 bits (261), Expect = 8e-23 Identities = 54/134 (40%), Positives = 78/134 (58%) Frame = +2 Query: 257 WFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYA 436 WF+ K+DH NP ++ T+ RYY ND + + +GP+F+++G GP + R++ G + + A Sbjct: 24 WFETKVDHFNPRNVDTFSMRYYSNDE-HSYP-KGPIFVIVGSNGPIETRYLSEGLFYDVA 81 Query: 437 KKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKW 616 A EHR++G S P D S NL FL+ QALADLA F+ +K + N + K Sbjct: 82 YLEGAFLFANEHRYFGHSLPVDDASTNNLDFLTIDQALADLAAFVHHIKHEVVRNPEAKV 141 Query: 617 IAFGGSYPGSLAAW 658 I G Y GSLA W Sbjct: 142 ILMGYGYGGSLATW 155 >UniRef50_A2G2H0 Cluster: Clan SC, family S28, unassigned serine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan SC, family S28, unassigned serine peptidase - Trichomonas vaginalis G3 Length = 496 Score = 109 bits (261), Expect = 8e-23 Identities = 59/135 (43%), Positives = 86/135 (63%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAK 439 F QKLDH + S T+ QRYY + N +F IGGE P + M++ ++ A+ Sbjct: 8 FTQKLDHFDASSQETFNQRYY-KITKNSTANVSALFFYIGGEAPLIGKRMLSLAPVDLAE 66 Query: 440 KFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWI 619 K NA+ LEHRF+G S PT +L+I+NL++L+ Q LADLA+FI++MKQ + + V+ Sbjct: 67 KNNAVLFGLEHRFFGNSAPT-NLTIENLKYLTIEQGLADLAHFINAMKQDY--DHTVRIG 123 Query: 620 AFGGSYPGSLAAWLR 664 GGSYPG+L++W R Sbjct: 124 VIGGSYPGALSSWFR 138 >UniRef50_Q7XCY0 Cluster: Prolyl carboxypeptidase like protein, putative, expressed; n=8; Oryza sativa|Rep: Prolyl carboxypeptidase like protein, putative, expressed - Oryza sativa subsp. japonica (Rice) Length = 507 Score = 108 bits (260), Expect = 1e-22 Identities = 65/161 (40%), Positives = 89/161 (55%), Gaps = 6/161 (3%) Frame = +2 Query: 200 GGNLGIPGGDYQSNLPPPQWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIG 379 GG LG + +W Q LDH NP+D R +KQRYY +Y +GP+FL I Sbjct: 37 GGRLGGAAAPGRYLTQEERWMDQTLDHFNPTDHRQFKQRYYEFLDYYRAP-KGPIFLYIC 95 Query: 380 GEGPADARWMVTGTWINY-AKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALAD 556 GE + + +++ AKKF A ++ EHR+YG+S P L+ +NL+FLSS QAL D Sbjct: 96 GESSCNG---IPNSYLAVMAKKFGAAVVSPEHRYYGKSSPFESLTTENLRFLSSKQALFD 152 Query: 557 LANF----ISSMKQKF-RLNEKVKWIAFGGSYPGSLAAWLR 664 LA F ++ K+ R W FGGSY G+L+AW R Sbjct: 153 LAVFRQYYQETLNAKYNRSGADSSWFVFGGSYAGALSAWFR 193 >UniRef50_Q22N04 Cluster: Serine carboxypeptidase S28 family protein; n=1; Tetrahymena thermophila SB210|Rep: Serine carboxypeptidase S28 family protein - Tetrahymena thermophila SB210 Length = 485 Score = 108 bits (260), Expect = 1e-22 Identities = 59/142 (41%), Positives = 89/142 (62%), Gaps = 6/142 (4%) Frame = +2 Query: 257 WFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYA 436 +F QK+DH +PS T+ QR+ V ++ N G VF+ IGGEGP +G ++ A Sbjct: 42 YFTQKVDHFDPSSTDTYNQRFTVYSEAFNPAN-GTVFIFIGGEGPQQGLTTGSGWYMLVA 100 Query: 437 KKFNALCINLEHRFYGESHPTLD----LSIKNLQFLSSYQALADLANFISSMKQK--FRL 598 ++F+A+ I +EHRFYG S P ++ +L+FL+ Q+LADLA FIS +K R+ Sbjct: 101 QQFSAMVICVEHRFYGVSQPFGQGQDAWTVDHLKFLTVDQSLADLAYFISYIKANNFLRI 160 Query: 599 NEKVKWIAFGGSYPGSLAAWLR 664 N++ +I GGSYPG+++AW R Sbjct: 161 NDRNPFITVGGSYPGAMSAWFR 182 >UniRef50_Q18198 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 516 Score = 105 bits (252), Expect = 1e-21 Identities = 56/135 (41%), Positives = 81/135 (60%), Gaps = 2/135 (1%) Frame = +2 Query: 266 QKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTW--INYAK 439 QK+D+ + ++ + QRY+ N +F KN VFLMI GE PA W+ + + +AK Sbjct: 63 QKVDNFDANNNAMYNQRYWYNPTFTQNKNI--VFLMIQGEAPATDTWISNPNYQYLQWAK 120 Query: 440 KFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWI 619 +F A LEHR +G+S P D S+ ++ + QALAD+ NFI M ++F + KWI Sbjct: 121 EFGADVFQLEHRCFGQSRPYPDTSMPGIKVCTMTQALADIHNFIQQMNRRFNF-QNPKWI 179 Query: 620 AFGGSYPGSLAAWLR 664 FGGSYPG+L+A R Sbjct: 180 TFGGSYPGTLSALFR 194 >UniRef50_Q4RYV8 Cluster: Chromosome 16 SCAF14974, whole genome shotgun sequence; n=3; Clupeocephala|Rep: Chromosome 16 SCAF14974, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 418 Score = 104 bits (249), Expect = 2e-21 Identities = 55/118 (46%), Positives = 75/118 (63%) Frame = +2 Query: 311 QRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAKKFNALCINLEHRFYGES 490 QR+ VN++F+ GPVFL IGGEGP ++ G ++ A++ +AL + LEHRFYG+S Sbjct: 4 QRFLVNEAFWR-NPDGPVFLYIGGEGPIFEYDVLAGHHVDMAQQHSALLLALEHRFYGDS 62 Query: 491 HPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWIAFGGSYPGSLAAWLR 664 L ++L LSS QALADLA F + F L+ WI+FGGSY G+L+AW R Sbjct: 63 VNPDGLKTEHLAHLSSKQALADLAVFHQYISGSFNLSHGNTWISFGGSYAGALSAWFR 120 >UniRef50_Q9VDX6 Cluster: CG18493-PA; n=4; Sophophora|Rep: CG18493-PA - Drosophila melanogaster (Fruit fly) Length = 480 Score = 103 bits (248), Expect = 3e-21 Identities = 50/138 (36%), Positives = 83/138 (60%), Gaps = 1/138 (0%) Frame = +2 Query: 254 QWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQG-PVFLMIGGEGPADARWMVTGTWIN 430 +W QKLD+ N S+ +T++ RY +ND +F+ +G P+F+ +GGE + + G W + Sbjct: 57 KWITQKLDNFNASNTQTYQMRYLLND---EFQTEGSPIFIYLGGEWEIEESMVSAGHWYD 113 Query: 431 YAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKV 610 A++ N + + EHR+YG+S PT +S ++L++L QALAD+A FI + K + Sbjct: 114 MAQEHNGVLVYTEHRYYGQSIPTSTMSTEDLKYLDVKQALADVAVFIETFKAENPQLANS 173 Query: 611 KWIAFGGSYPGSLAAWLR 664 K I GGSY ++ W + Sbjct: 174 KVILAGGSYSATMVVWFK 191 >UniRef50_Q9VDX5 Cluster: CG3739-PA; n=5; Drosophila|Rep: CG3739-PA - Drosophila melanogaster (Fruit fly) Length = 547 Score = 103 bits (248), Expect = 3e-21 Identities = 51/138 (36%), Positives = 84/138 (60%), Gaps = 1/138 (0%) Frame = +2 Query: 254 QWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINY 433 +W QKLD+ + S+ TW+ R Y+N+ + F + P+F+ +GGE D + +G W + Sbjct: 117 RWITQKLDNFDDSNNATWQDRIYINNKY--FVDGSPIFIYLGGEWAIDPSGITSGLWKDI 174 Query: 434 AKKFNALCINLEHRFYGESHPTLDLSIKNL-QFLSSYQALADLANFISSMKQKFRLNEKV 610 AK+ N + EHRF+G+S P LS +NL ++ S QALAD+ N I+++KQ+ + + Sbjct: 175 AKQHNGSLLYTEHRFFGQSIPITPLSTENLAKYQSVEQALADVINVIATLKQEDKYKDS- 233 Query: 611 KWIAFGGSYPGSLAAWLR 664 K + G SY ++A W+R Sbjct: 234 KVVVSGCSYSATMATWIR 251 >UniRef50_Q19589 Cluster: Putative uncharacterized protein F19C7.2; n=3; Caenorhabditis elegans|Rep: Putative uncharacterized protein F19C7.2 - Caenorhabditis elegans Length = 582 Score = 103 bits (248), Expect = 3e-21 Identities = 58/142 (40%), Positives = 84/142 (59%), Gaps = 9/142 (6%) Frame = +2 Query: 266 QKLDH-SNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADA----RWM--VTGTW 424 QK+DH SN ++ W+QRY N FY+ K G VFLM+GGEG + +W+ T Sbjct: 55 QKVDHFSNGTNNGVWQQRYQYNSKFYN-KTTGYVFLMLGGEGSINVTNGDKWVRHEGETM 113 Query: 425 INYAKKFNALCINLEHRFYG--ESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRL 598 + + +F A +EHRFYG E P D + +++ L+ QALAD+ FI+ M + Sbjct: 114 MKWVAEFQAAAFQVEHRFYGSKEYSPIGDQTTASMKLLTIDQALADIKEFITQMNALYFK 173 Query: 599 NEKVKWIAFGGSYPGSLAAWLR 664 ++K W+ FGGSYPGSL+A+ R Sbjct: 174 DDKPIWVTFGGSYPGSLSAFFR 195 >UniRef50_A0DE29 Cluster: Chromosome undetermined scaffold_47, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_47, whole genome shotgun sequence - Paramecium tetraurelia Length = 462 Score = 103 bits (248), Expect = 3e-21 Identities = 58/140 (41%), Positives = 86/140 (61%), Gaps = 5/140 (3%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTW-INYA 436 F Q LDHS+P++ +TW+QRY+V +++ +G V L I GE + + + ++ A Sbjct: 34 FTQLLDHSDPANTQTWQQRYHVYSQYFN-PTKGGVILYICGEW--NCQGVSDNSFSFQLA 90 Query: 437 KKFNALCINLEHRFYGESHP--TLDLSIKNLQFLSSYQALADLANFISSMK--QKFRLNE 604 K A+ I LEHRFYG+S P S++NL +L+ +QAL DLA FI MK + ++ Sbjct: 91 KDLGAIVIALEHRFYGQSQPFGADSWSLENLSYLNVHQALDDLAYFILQMKRLKLHSIDS 150 Query: 605 KVKWIAFGGSYPGSLAAWLR 664 + W A GGSYPG+L+AW R Sbjct: 151 TLPWYAIGGSYPGALSAWFR 170 >UniRef50_Q7PX68 Cluster: ENSANGP00000013861; n=3; Culicimorpha|Rep: ENSANGP00000013861 - Anopheles gambiae str. PEST Length = 494 Score = 103 bits (247), Expect = 4e-21 Identities = 51/133 (38%), Positives = 73/133 (54%) Frame = +2 Query: 266 QKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAKKF 445 Q+LDH +P ++ TW RY N Y GP+F+ +GGE + G + A + Sbjct: 65 QRLDHFDPQNVNTWSMRYMANGEHY--VEGGPLFIYVGGEWEISEGSISRGHVYDMAAEL 122 Query: 446 NALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWIAF 625 EHRFYG+SHPT+DL L++L+ QALADLA+F+ M++ EK I Sbjct: 123 KGYLFYTEHRFYGQSHPTVDLRTDKLKYLNIDQALADLAHFVVEMRKTIPGAEKSGVIMI 182 Query: 626 GGSYPGSLAAWLR 664 GGSY ++ +W R Sbjct: 183 GGSYSATMVSWFR 195 >UniRef50_Q16Y05 Cluster: Prolylcarboxypeptidase, putative; n=2; Aedes aegypti|Rep: Prolylcarboxypeptidase, putative - Aedes aegypti (Yellowfever mosquito) Length = 500 Score = 103 bits (247), Expect = 4e-21 Identities = 49/135 (36%), Positives = 78/135 (57%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAK 439 F+ ++DH NP + T++ YY ND FY + GP+F+ +GG P D ++ G + + A Sbjct: 58 FRTRVDHFNPQNRDTFEFEYYSNDEFY--RPGGPIFIFVGGNWPLDQYYIEHGHFHDIAN 115 Query: 440 KFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWI 619 NA EHR+YG S P +LS++NLQ+L+ QA+ DLA I ++ ++ + I Sbjct: 116 YENAWMFANEHRYYGHSFPVPNLSVENLQYLTVEQAMVDLAELIYHVRHNVVRDDDARVI 175 Query: 620 AFGGSYPGSLAAWLR 664 G Y G++A W+R Sbjct: 176 LLGTGYAGAIATWMR 190 >UniRef50_Q93Z34 Cluster: At2g24280/F27D4.19; n=6; core eudicotyledons|Rep: At2g24280/F27D4.19 - Arabidopsis thaliana (Mouse-ear cress) Length = 494 Score = 101 bits (243), Expect = 1e-20 Identities = 59/142 (41%), Positives = 81/142 (57%), Gaps = 5/142 (3%) Frame = +2 Query: 254 QWFKQKLDHSN--PSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWI 427 ++F Q LDH + P + + Q+Y +N+ F+ + GP+F+ G EG D TG + Sbjct: 49 RYFPQNLDHFSFTPDSYKVFHQKYLINNRFW--RKGGPIFVYTGNEGDIDWFASNTGFML 106 Query: 428 NYAKKFNALCINLEHRFYGESHP---TLDLSIKNLQFLSSYQALADLANFISSMKQKFRL 598 + A KF AL + +EHRFYGES P S + L +L+S QALAD A I S+KQ Sbjct: 107 DIAPKFRALLVFIEHRFYGESTPFGKKSHKSAETLGYLNSQQALADYAILIRSLKQNLS- 165 Query: 599 NEKVKWIAFGGSYPGSLAAWLR 664 +E + FGGSY G LAAW R Sbjct: 166 SEASPVVVFGGSYGGMLAAWFR 187 >UniRef50_UPI000049885B Cluster: serine protease; n=1; Entamoeba histolytica HM-1:IMSS|Rep: serine protease - Entamoeba histolytica HM-1:IMSS Length = 466 Score = 101 bits (242), Expect = 2e-20 Identities = 54/135 (40%), Positives = 86/135 (63%), Gaps = 2/135 (1%) Frame = +2 Query: 266 QKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWI--NYAK 439 Q +DH + ++ +T RY++ND+ Y + P+ + +GGEG A V G ++ YA+ Sbjct: 36 QPIDHFDLTNKKTINIRYFINDTIYS--KEAPLLVDLGGEGTQRAA-AVGGRFVINKYAE 92 Query: 440 KFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWI 619 K+N+L + +EHRFYG+S P LS +NL +LS+ QAL D I+ +K+++++ V I Sbjct: 93 KYNSLMLAIEHRFYGKSVPEGGLSQENLGYLSAAQALEDYIMIINQIKKEYQITGPV--I 150 Query: 620 AFGGSYPGSLAAWLR 664 FGGSY G+LA W+R Sbjct: 151 VFGGSYSGNLATWIR 165 >UniRef50_Q23AY4 Cluster: Serine carboxypeptidase S28 family protein; n=1; Tetrahymena thermophila SB210|Rep: Serine carboxypeptidase S28 family protein - Tetrahymena thermophila SB210 Length = 873 Score = 100 bits (239), Expect = 4e-20 Identities = 59/134 (44%), Positives = 79/134 (58%), Gaps = 4/134 (2%) Frame = +2 Query: 275 DHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAKKFNAL 454 DH N ++ RTW QRY+V D +Y+ +N G V L I GE I A+KF++L Sbjct: 434 DHFNITNNRTWSQRYWVLDQYYNPQN-GSVLLYICGEYTCPGIPEERQFPILLAQKFSSL 492 Query: 455 CINLEHRFYGESHPTLDLSIK--NLQFLSSYQALADLANFISSMKQK--FRLNEKVKWIA 622 + LEHRFYG S P D S+K NL L+ QALADLA FI+ +K + + W+ Sbjct: 493 VLVLEHRFYGNSMPFGDQSMKQHNLYLLNVDQALADLAYFITYVKDHHLHGVQNHIPWLT 552 Query: 623 FGGSYPGSLAAWLR 664 GGSYPG+++AW R Sbjct: 553 IGGSYPGAMSAWFR 566 Score = 37.1 bits (82), Expect = 0.38 Identities = 34/135 (25%), Positives = 61/135 (45%), Gaps = 5/135 (3%) Frame = +2 Query: 263 KQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQ-GPVFLMIGGEGPA--DARWMVTGTWINY 433 +Q+ DH + ++ W QRY++ + Q G V ++ + D + + + Sbjct: 34 EQRYDHFS-NNFELWDQRYFIAKNEKSQNGQLGKVNIIFVCDKDLTHDILSCIPPFFDSQ 92 Query: 434 AKKFNALCINLEHRFYGESHPTLD--LSIKNLQFLSSYQALADLANFISSMKQKFRLNEK 607 + + LE R+YGES P L I L + S Q +AD+A F+S +K+ ++ Sbjct: 93 RRNSDVNIFLLEMRYYGESQPYSSRYLGIDYLSYQSIQQNIADIALFVSFLKKDNMVSSD 152 Query: 608 VKWIAFGGSYPGSLA 652 K I + G +A Sbjct: 153 SKKIKYPHLIDGVIA 167 >UniRef50_Q5YEQ9 Cluster: Serine peptidase; n=1; Bigelowiella natans|Rep: Serine peptidase - Bigelowiella natans (Pedinomonas minutissima) (Chlorarachnion sp.(strain CCMP 621)) Length = 546 Score = 99 bits (238), Expect = 5e-20 Identities = 70/179 (39%), Positives = 95/179 (53%), Gaps = 18/179 (10%) Frame = +2 Query: 182 HLGRSNGGNLGIPGGDYQSNLPPPQWFKQKLDH-SNPSDLRTWKQRYYVNDSFYDFKNQG 358 H+ + GN + SN + LDH SD + W QRYYV+ S + + Sbjct: 34 HVSFTLQGNQSLLESHAGSNSTTHFYKNALLDHFGGLSDEKHWLQRYYVDSSQWGGEGY- 92 Query: 359 PVFLMIGGEGPADARWMVTGTWINY--AKKFNALCINLEHRFYGESHPTLDLSIKNLQFL 532 PVFL IGGEGP V+ + Y A + AL + LEHRFYGES P D+S NL+FL Sbjct: 93 PVFLYIGGEGPQGP---VSSSLFMYELAVEHKALVLALEHRFYGESRPVEDMSDANLKFL 149 Query: 533 SSYQALADLANFISSMK-QKFRLN--------------EKVKWIAFGGSYPGSLAAWLR 664 +S+QAL DLA F+ +K +N ++ ++AFGGSYPG+LAAW + Sbjct: 150 TSHQALGDLARFVEYIKAYDPNVNDAKSSPPLSLPASAQESPFVAFGGSYPGNLAAWFK 208 >UniRef50_Q5DC37 Cluster: SJCHGC02147 protein; n=1; Schistosoma japonicum|Rep: SJCHGC02147 protein - Schistosoma japonicum (Blood fluke) Length = 472 Score = 99 bits (238), Expect = 5e-20 Identities = 61/143 (42%), Positives = 78/143 (54%), Gaps = 3/143 (2%) Frame = +2 Query: 245 PPPQWFKQKLDH-SNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGT 421 P +F Q LDH S + T+KQRY D + FK GP+F G EG W TG Sbjct: 28 PKENYFDQTLDHFSFQARNLTFKQRYLYEDKW--FKPNGPIFFYCGNEGEIGGFWNNTGL 85 Query: 422 WINYAKKFNALCINLEHRFYGESHPTLDLSIKN--LQFLSSYQALADLANFISSMKQKFR 595 A FNA + EHR+YG+S P D S + +Q+LS QALAD A I +K KF Sbjct: 86 VFELAPSFNAFILFAEHRYYGKSLP-FDKSFQQPYIQYLSIGQALADYAYLIEGIKSKFN 144 Query: 596 LNEKVKWIAFGGSYPGSLAAWLR 664 + + +AFGGSY G LAA++R Sbjct: 145 MT-RSPVVAFGGSYGGMLAAYMR 166 >UniRef50_Q19590 Cluster: Putative uncharacterized protein F19C7.4; n=2; Caenorhabditis|Rep: Putative uncharacterized protein F19C7.4 - Caenorhabditis elegans Length = 542 Score = 99 bits (238), Expect = 5e-20 Identities = 57/142 (40%), Positives = 83/142 (58%), Gaps = 9/142 (6%) Frame = +2 Query: 266 QKLDH-SNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADA----RWM--VTGTW 424 QK+DH SN +++ W+Q Y N FY+ K G VFLMIGGE + RW+ T Sbjct: 55 QKVDHFSNGTNIGVWQQHYQYNWKFYN-KTTGYVFLMIGGESSINKTNGDRWIRHEGETM 113 Query: 425 INYAKKFNALCINLEHRFYG--ESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRL 598 + + +F A +EHRFYG E P D + +++ L+ QALAD+ FI+ + + Sbjct: 114 MKWVAEFQAAAFQVEHRFYGSKEYSPIGDQTTASMKLLTIDQALADIKEFITQINALYFK 173 Query: 599 NEKVKWIAFGGSYPGSLAAWLR 664 ++K W+ FGGSYPGSL+A+ R Sbjct: 174 DDKPIWVTFGGSYPGSLSAFFR 195 >UniRef50_O01979 Cluster: Putative uncharacterized protein pcp-2; n=3; Caenorhabditis|Rep: Putative uncharacterized protein pcp-2 - Caenorhabditis elegans Length = 1080 Score = 99.5 bits (237), Expect = 6e-20 Identities = 68/174 (39%), Positives = 92/174 (52%), Gaps = 10/174 (5%) Frame = +2 Query: 173 KKFHLGRSNGGNLGIPGGDYQSNLPPPQW--------FKQKLDHSNPSDLRTWKQRYYVN 328 KK LGR G L P D+ N+ P + F+Q+ DH N + ++QR++ N Sbjct: 546 KKVFLGRPPHGFL--PESDF--NMSPDDYPAGFETGSFRQRQDHFNNQNADFFQQRFFKN 601 Query: 329 DSFYDFKNQGPVFLMIGGEGPADARWMVTGT--WINYAKKFNALCINLEHRFYGESHPTL 502 + K GP FLMIGGEGP A W++ ++ +AKK+ A LEHRFYGES Sbjct: 602 TQWA--KPGGPNFLMIGGEGPDKASWVLNENLPYLIWAKKYGATVYMLEHRFYGESRVG- 658 Query: 503 DLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWIAFGGSYPGSLAAWLR 664 N LSS Q + D+A+FI S+ K + WI FGGSY G ++AW R Sbjct: 659 --DNTNFNRLSSLQMIYDIADFIRSVNIKSGTSN--PWITFGGSYSGLISAWTR 708 Score = 82.6 bits (195), Expect = 8e-15 Identities = 63/193 (32%), Positives = 98/193 (50%), Gaps = 5/193 (2%) Frame = +2 Query: 101 MSTNMKLYTILFNLYVALISVDGVKKFHLGRSNGGNLGIPGGDYQSNLPPPQWFKQKLDH 280 M+ N+ L T+L + +A+I K HL R + G+ ++ + + Q LDH Sbjct: 1 MTRNLLLLTLLVSFVLAIIPNHYHFKKHLKRGSRKY-----GNSETAMTTG-YMAQNLDH 54 Query: 281 SNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGG-EGP---ADARWMVTGTWINYAKKFN 448 + T+ QRY + + +Q FL + G EGP D R + T AK+F Sbjct: 55 LIGNASGTFTQRYLYSQQYT--LHQRTAFLYVSGVEGPNVVLDDRTPIVKT----AKQFG 108 Query: 449 ALCINLEHRFYGESHPTLD-LSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWIAF 625 A LEHR+YGES P +D L NL+ L+S+QA D+ +FI +F +++ V+W+ + Sbjct: 109 ATIFTLEHRYYGESKPNVDKLDAYNLRHLNSFQATQDVISFIKYANVQFNMDQDVRWVVW 168 Query: 626 GGSYPGSLAAWLR 664 G Y G +AA R Sbjct: 169 GIGYGGIIAAEAR 181 >UniRef50_Q4DW34 Cluster: Serine carboxypeptidase S28, putative; n=1; Trypanosoma cruzi|Rep: Serine carboxypeptidase S28, putative - Trypanosoma cruzi Length = 483 Score = 97.9 bits (233), Expect = 2e-19 Identities = 53/138 (38%), Positives = 86/138 (62%), Gaps = 1/138 (0%) Frame = +2 Query: 254 QWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWIN- 430 +++ Q++DH++ + L T++QR++V+ S +D N GP L++ GEG A + G ++ Sbjct: 71 RYYNQRVDHADVT-LGTFRQRWWVDRSSWD-ANSGPAILLVNGEGTAPG--LPDGGFVGE 126 Query: 431 YAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKV 610 Y K A+ +LEHR+YGES P + L++L+ ALADL F ++K + +KV Sbjct: 127 YGKSVKAIIFSLEHRYYGESMPAPLTNRSMLKYLTVENALADLQAFKKYAEKKV-VKKKV 185 Query: 611 KWIAFGGSYPGSLAAWLR 664 KW+ GGSY G+L+AW R Sbjct: 186 KWLIVGGSYAGALSAWAR 203 >UniRef50_P42785 Cluster: Lysosomal Pro-X carboxypeptidase precursor; n=37; Eumetazoa|Rep: Lysosomal Pro-X carboxypeptidase precursor - Homo sapiens (Human) Length = 496 Score = 97.9 bits (233), Expect = 2e-19 Identities = 59/141 (41%), Positives = 79/141 (56%), Gaps = 5/141 (3%) Frame = +2 Query: 257 WFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMV--TGTWIN 430 +F+QK+DH + ++T+ QRY V D ++ KN G + G EG D W TG + Sbjct: 52 YFQQKVDHFGFNTVKTFNQRYLVADKYWK-KNGGSILFYTGNEG--DIIWFCNNTGFMWD 108 Query: 431 YAKKFNALCINLEHRFYGESHPTLDLSIKN---LQFLSSYQALADLANFISSMKQKFRLN 601 A++ A+ + EHR+YGES P D S K+ L FL+S QALAD A I +K+ Sbjct: 109 VAEELKAMLVFAEHRYYGESLPFGDNSFKDSRHLNFLTSEQALADFAELIKHLKRTIPGA 168 Query: 602 EKVKWIAFGGSYPGSLAAWLR 664 E IA GGSY G LAAW R Sbjct: 169 ENQPVIAIGGSYGGMLAAWFR 189 >UniRef50_Q54YD0 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 635 Score = 96.7 bits (230), Expect = 4e-19 Identities = 48/136 (35%), Positives = 80/136 (58%), Gaps = 1/136 (0%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGP-VFLMIGGEGPADARWMVTGTWINYA 436 F+Q ++H + + T++QR+ VN F + VF ++ GEGP + + ++ A Sbjct: 76 FQQTINHLSYDTIGTFEQRFSVNKKFVPINGKPKAVFFLVSGEGPLSSEIVNHNPFVQIA 135 Query: 437 KKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKW 616 + AL + LE R+YGES P L+++ N+ +L++ Q L DLA F K++LN+ +KW Sbjct: 136 NETQALIVALELRYYGESMPFLNMNNSNMAYLTTDQILEDLATFQVFFTNKYQLND-IKW 194 Query: 617 IAFGGSYPGSLAAWLR 664 I G SY G+++AW R Sbjct: 195 IIMGCSYAGTISAWYR 210 >UniRef50_P34610 Cluster: Putative serine protease pcp-1 precursor; n=2; Caenorhabditis|Rep: Putative serine protease pcp-1 precursor - Caenorhabditis elegans Length = 565 Score = 96.7 bits (230), Expect = 4e-19 Identities = 69/195 (35%), Positives = 101/195 (51%), Gaps = 11/195 (5%) Frame = +2 Query: 113 MKLYTILFNLYVALISVDGVKKFHLGRSNGGNLGIPGGDYQSNLPPPQ--WFKQ-KLDHS 283 M+ + +L L VAL+SV+ ++ L + L +Y + Q W+K KLDH Sbjct: 1 MRWFLVL--LLVALVSVEASRRSRLFKK----LYQKASNYDAAPSNVQTVWYKNMKLDHF 54 Query: 284 NPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAKKFNALCIN 463 D RT+ R N++FY K GP+F G EG ++ TG + A FNA I Sbjct: 55 TWGDTRTFDMRVMWNNTFY--KPGGPIFFYTGNEGGLESFVTATGMMFDLAPMFNASIIF 112 Query: 464 LEHRFYGESHPTLD---LSIKNLQFLSSYQALADLANFISSMKQ-----KFRLNEKVKWI 619 EHRFYG++ P + S+ N+ +L+S QALAD A ++ +K+ K + I Sbjct: 113 AEHRFYGQTQPFGNQSYASLANVGYLTSEQALADYAELLTELKRDNNQFKMTFPAATQVI 172 Query: 620 AFGGSYPGSLAAWLR 664 +FGGSY G L+AW R Sbjct: 173 SFGGSYGGMLSAWFR 187 >UniRef50_A2FRR3 Cluster: Clan SC, family S28, unassigned serine peptidase; n=3; Trichomonas vaginalis G3|Rep: Clan SC, family S28, unassigned serine peptidase - Trichomonas vaginalis G3 Length = 504 Score = 95.5 bits (227), Expect = 1e-18 Identities = 59/144 (40%), Positives = 87/144 (60%), Gaps = 2/144 (1%) Frame = +2 Query: 239 NLPPPQWFKQKLDHSNPSDL-RTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVT 415 N+ WF QKLDH SDL T+KQRYY+N + Y K++ V + IGGE P + Sbjct: 23 NIGDQMWFDQKLDHF--SDLAETFKQRYYINTN-YSKKSKNLV-VYIGGEAPLLESSLKY 78 Query: 416 GTWINYAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFR 595 + A ++ + LEHR++GES P +L ++N ++L+ QA+ DLANFI+ MKQ + Sbjct: 79 DVQ-HIASVTKSVILALEHRYFGESIPHGNLELENFKYLTVDQAIEDLANFITQMKQNYC 137 Query: 596 LN-EKVKWIAFGGSYPGSLAAWLR 664 + K K + GGSYPG+L++ R Sbjct: 138 QDASKCKALMVGGSYPGALSSRFR 161 >UniRef50_A2ET59 Cluster: Clan SC, family S28, unassigned serine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan SC, family S28, unassigned serine peptidase - Trichomonas vaginalis G3 Length = 440 Score = 95.1 bits (226), Expect = 1e-18 Identities = 50/135 (37%), Positives = 80/135 (59%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAK 439 F Q +DH N S+ T+KQ++ +N+ + P+ L I GE V AK Sbjct: 26 FDQLIDH-NHSETGTFKQKFVINNQYGG--PDSPIILEISGESDGYYVGGVGDFEETLAK 82 Query: 440 KFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWI 619 +FN + L+HRFYGES+P + + +NLQ+LS QA+ D++ F+ K+ ++ +K KW+ Sbjct: 83 EFNCTVVTLQHRFYGESYPFEESTTENLQYLSVEQAVEDISYFVDYYKKTYKA-DKNKWL 141 Query: 620 AFGGSYPGSLAAWLR 664 +GGSYPG L+A+ + Sbjct: 142 LYGGSYPGLLSAYTK 156 >UniRef50_A1C859 Cluster: Extracelular serine carboxypeptidase, putative; n=7; Trichocomaceae|Rep: Extracelular serine carboxypeptidase, putative - Aspergillus clavatus Length = 582 Score = 94.3 bits (224), Expect = 2e-18 Identities = 58/171 (33%), Positives = 92/171 (53%), Gaps = 7/171 (4%) Frame = +2 Query: 173 KKFHLGRSNGGNLGIPGGDYQSNLPPPQWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKN 352 K G L + ++ ++P + + + D ++ RY+ + S Y K Sbjct: 39 KSIEKGEFRSQALSVSFAEHNFSVPVDHFHNESRYEPHSDD--SFNLRYWFDASHY--KE 94 Query: 353 QGPVFLMIGGEGPADARW--MVTGTWINYAKKFNALCINLEHRFYGESHPTLDLSIKNLQ 526 GPVFL+ GE A R+ + G AK +N L + LEHR+YGES+P ++L+++N++ Sbjct: 95 GGPVFLIAAGETDATDRFPFLSQGIVAQLAKTYNGLGVILEHRYYGESYPFVNLTVENIR 154 Query: 527 FLSSYQALADLANFISSMK----QKFRLNE-KVKWIAFGGSYPGSLAAWLR 664 FLS+ QALAD A+F S++ + L V WI +GGSY G+ A+LR Sbjct: 155 FLSTEQALADYAHFASNVAFPGLEHLNLTAGAVPWIGYGGSYAGAFVAFLR 205 >UniRef50_Q7PJN6 Cluster: ENSANGP00000023762; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000023762 - Anopheles gambiae str. PEST Length = 500 Score = 93.9 bits (223), Expect = 3e-18 Identities = 54/161 (33%), Positives = 77/161 (47%), Gaps = 1/161 (0%) Frame = +2 Query: 185 LGRSNGGNLGIPGGDYQSNLPPPQWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPV 364 L R +G+ +N+ ++F ++DH N DL TW RY F GP+ Sbjct: 36 LNRLRSATVGLKPSQRNANITE-EFFTTEVDHFNNQDLTTWSNRYLA--LMDHFVEGGPM 92 Query: 365 FLMIGGEGPADARWMVTGTWIN-YAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSY 541 + + G+ P D + GT IN A+ LE RFYG+S P D S +NL+FL S Sbjct: 93 LIFLTGDAPLDPSMIDDGTLINEMARDLGGAVFALETRFYGKSQPVGDYSTENLRFLKSE 152 Query: 542 QALADLANFISSMKQKFRLNEKVKWIAFGGSYPGSLAAWLR 664 QAL DL +I ++ + K + G Y G+LA W R Sbjct: 153 QALMDLIEWIDYLRNTVVGDPNAKVVLMGTGYAGALATWAR 193 >UniRef50_A2FRQ0 Cluster: Clan SC, family S28, unassigned serine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan SC, family S28, unassigned serine peptidase - Trichomonas vaginalis G3 Length = 543 Score = 93.5 bits (222), Expect = 4e-18 Identities = 46/136 (33%), Positives = 78/136 (57%), Gaps = 1/136 (0%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAK 439 F+Q +DH + + Q YYVND +D K + ++G + + +M +GT N AK Sbjct: 19 FEQYIDHEAKKE--KYNQTYYVND--FDLKKSNNLVFLVGNQESFNQEFMTSGTAFNIAK 74 Query: 440 KFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLN-EKVKW 616 A+ +EHR++GES PT LS + LQ+L+ Q + D+ +FI+ M+ ++ + K + Sbjct: 75 DLKAILFGIEHRYFGESKPTESLSTEELQYLTVEQTIEDVHDFIAQMRNQYCKDLNKCQS 134 Query: 617 IAFGGSYPGSLAAWLR 664 + G Y GS+AAW++ Sbjct: 135 LTVGQGYGGSIAAWVK 150 >UniRef50_A2F801 Cluster: Clan SC, family S28, unassigned serine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan SC, family S28, unassigned serine peptidase - Trichomonas vaginalis G3 Length = 436 Score = 93.1 bits (221), Expect = 5e-18 Identities = 53/136 (38%), Positives = 76/136 (55%), Gaps = 1/136 (0%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPV-FLMIGGEGPADARWMVTGTWINYA 436 F Q +DHS+P T+KQRY ++ +D+ L IGGE Sbjct: 21 FSQNIDHSDPQK-GTFKQRY---EALFDYTTDNKTAILFIGGESDTFRPRAFNDYMATLC 76 Query: 437 KKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKW 616 K+FNA LEHR++GES PT DLS N+++L+ A+ DL NF M +++++ + KW Sbjct: 77 KEFNAAFFMLEHRYFGESFPT-DLSYPNIKYLTVDNAIDDLYNFKVKMVEQYKMTDS-KW 134 Query: 617 IAFGGSYPGSLAAWLR 664 I GGSYPG L+A+ R Sbjct: 135 ILVGGSYPGLLSAYTR 150 >UniRef50_Q7QAL7 Cluster: ENSANGP00000011396; n=2; Anopheles gambiae str. PEST|Rep: ENSANGP00000011396 - Anopheles gambiae str. PEST Length = 500 Score = 91.1 bits (216), Expect = 2e-17 Identities = 46/135 (34%), Positives = 76/135 (56%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAK 439 F +++H +P + T++ Y ND +Y + GP+F+++GG P + +M + + A Sbjct: 65 FTSRVNHFDPQNRDTFEFNYLHNDQYY--RQGGPLFIVVGGHYPVNPYFMENSHFRDVAA 122 Query: 440 KFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWI 619 A EHR++GES+PT DLS +NL+F+ + Q L DL +I +K++ + + I Sbjct: 123 LEGAWLATNEHRYFGESYPTEDLSTENLRFMRTEQVLFDLIEWIDFLKREVMGDPNARVI 182 Query: 620 AFGGSYPGSLAAWLR 664 G Y GSLA W R Sbjct: 183 LHGVGYGGSLATWAR 197 >UniRef50_A0CB90 Cluster: Chromosome undetermined scaffold_163, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_163, whole genome shotgun sequence - Paramecium tetraurelia Length = 452 Score = 91.1 bits (216), Expect = 2e-17 Identities = 59/141 (41%), Positives = 79/141 (56%), Gaps = 5/141 (3%) Frame = +2 Query: 257 WFKQKL-DHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINY 433 WF+ +L DH + + + QRY+V + + G V I GE + I Sbjct: 27 WFEHQLVDHYDKLNKNVFHQRYWVVEENF-VPETGVVLFQICGEYTCINDIKLRLFIIQL 85 Query: 434 AKKFNALCINLEHRFYGESHPTLDLSIK--NLQFLSSYQALADLANFISSM--KQKFRLN 601 AK+FNAL I LEHR+YG+S P S+K NL++LS+ QAL DLA F M +K + Sbjct: 86 AKEFNALIIILEHRYYGKSMPLGKESLKDENLRYLSTRQALDDLAYFQRFMVLNKKHGIK 145 Query: 602 EKVKWIAFGGSYPGSLAAWLR 664 + WIA GGSYPG+LAAW R Sbjct: 146 SQNPWIAIGGSYPGALAAWYR 166 >UniRef50_UPI00004996CF Cluster: serine protease; n=1; Entamoeba histolytica HM-1:IMSS|Rep: serine protease - Entamoeba histolytica HM-1:IMSS Length = 457 Score = 88.6 bits (210), Expect = 1e-16 Identities = 50/133 (37%), Positives = 78/133 (58%), Gaps = 2/133 (1%) Frame = +2 Query: 272 LDHSNPSDLRTWKQRYYVNDSFYDFKN-QGPVFLMIGGEGPADARWMVTGTWIN-YAKKF 445 LDH N ++ + +Y+VN F D + P+F+++GGEGPA + + I+ AKK Sbjct: 46 LDHFNANNQNDFDIQYFVNKKFLDANDPNAPLFVLLGGEGPASPKVLQNNYVIDSLAKKH 105 Query: 446 NALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWIAF 625 L +++EHRFYG S P+L++ L + ++ QAL D IS ++++ L I Sbjct: 106 KGLMLSVEHRFYGASTPSLEMD--KLIYCTAEQALMDYVEVISHVQEENNLVGH-PVIVL 162 Query: 626 GGSYPGSLAAWLR 664 GGSY G+LAAW+R Sbjct: 163 GGSYSGNLAAWMR 175 >UniRef50_Q1DJJ2 Cluster: Putative uncharacterized protein; n=2; Coccidioides immitis|Rep: Putative uncharacterized protein - Coccidioides immitis Length = 555 Score = 88.6 bits (210), Expect = 1e-16 Identities = 56/157 (35%), Positives = 88/157 (56%), Gaps = 13/157 (8%) Frame = +2 Query: 233 QSNLPPPQWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADAR--- 403 +S P ++ + +DH NP +K R++VNDS Y K+ GPVFL GGE A Sbjct: 64 KSGAPEAEFTEIPIDHENPD--AKYKNRFWVNDSKY--KSGGPVFLFDGGEANAQRYADF 119 Query: 404 WMVTGT--WINYAKKFNALCINLEHRFYGESHP---TLDLSIKNLQFLSSYQALADLANF 568 ++V T ++ ++F+ + I EHR+YGES+P LD ++ Q+L++ QALAD+ F Sbjct: 120 YLVNETSFFVQLLEEFHGMGIVWEHRYYGESNPFPVNLDTPAEHFQYLNNEQALADIPYF 179 Query: 569 ISSMKQKFRLNE-----KVKWIAFGGSYPGSLAAWLR 664 + K++ ++ W+ GGSYPG AA+ R Sbjct: 180 AKNFKRENFPDDDLTPKSTPWVMIGGSYPGMRAAFTR 216 >UniRef50_Q9FLH1 Cluster: Lysosomal Pro-X carboxypeptidase; n=6; core eudicotyledons|Rep: Lysosomal Pro-X carboxypeptidase - Arabidopsis thaliana (Mouse-ear cress) Length = 529 Score = 88.2 bits (209), Expect = 2e-16 Identities = 57/149 (38%), Positives = 81/149 (54%), Gaps = 8/149 (5%) Frame = +2 Query: 224 GDYQSNLPPPQWFKQKLDHSNPSDLRTWKQRYYVN-DSFYDFKNQGPVFLMIGGEGPADA 400 GD ++F Q+LDH + +DL + QRY +N D + GP+FL G EG D Sbjct: 51 GDRNEYRYETKFFSQQLDHFSFADLPKFSQRYLINSDHWLGASALGPIFLYCGNEG--DI 108 Query: 401 RWMVTGTWI--NYAKKFNALCINLEHRFYGESHP--TLDLSIKN---LQFLSSYQALADL 559 W T + + A KF AL + EHR+YGES P + + + KN L +L++ QALAD Sbjct: 109 EWFATNSGFIWDIAPKFGALLVFPEHRYYGESMPYGSREEAYKNATTLSYLTTEQALADF 168 Query: 560 ANFISSMKQKFRLNEKVKWIAFGGSYPGS 646 A F++ +K+ E + FGGSY GS Sbjct: 169 AVFVTDLKRNLSA-EACPVVLFGGSYGGS 196 >UniRef50_Q67WZ5 Cluster: Putative prolylcarboxypeptidase isoform 1; n=4; Oryza sativa|Rep: Putative prolylcarboxypeptidase isoform 1 - Oryza sativa subsp. japonica (Rice) Length = 539 Score = 88.2 bits (209), Expect = 2e-16 Identities = 58/149 (38%), Positives = 78/149 (52%), Gaps = 13/149 (8%) Frame = +2 Query: 257 WFKQKLDHSN--PSDLRTWKQRYYVNDSFYDFKNQ------GPVFLMIGGEGPADARWMV 412 +F Q+LDH P+ + Q+Y VND+F+ GP+F+ G EG D W Sbjct: 86 YFPQELDHFTFTPNASAVFYQKYLVNDTFWRRSAAAGETPAGPIFVYTGNEG--DIEWFA 143 Query: 413 TGTWINY--AKKFNALCINLEHRFYGESHP---TLDLSIKNLQFLSSYQALADLANFISS 577 T T + A F AL + +EHRFYGES P + S + L +L+S QALAD A I+S Sbjct: 144 TNTGFMFDIAPSFGALLVFIEHRFYGESKPFGNESNSSPEKLGYLTSTQALADFAVLITS 203 Query: 578 MKQKFRLNEKVKWIAFGGSYPGSLAAWLR 664 +K + FGGSY G LA+W R Sbjct: 204 LKHNLSAVSS-PVVVFGGSYGGMLASWFR 231 >UniRef50_A2FGL0 Cluster: Clan SC, family S28, unassigned serine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan SC, family S28, unassigned serine peptidase - Trichomonas vaginalis G3 Length = 527 Score = 87.8 bits (208), Expect = 2e-16 Identities = 49/137 (35%), Positives = 76/137 (55%), Gaps = 2/137 (1%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAK 439 F+ ++DH + D + +R+ N +F + K L IGGE R++ G+++ A Sbjct: 26 FQNRIDHFDTHDSSYYMERFLENLTFVN-KTFKKALLYIGGESTLSPRYVQAGSYLELAA 84 Query: 440 KFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEK--VK 613 + NA LEHRF+G+S P L+ +N ++L+ QALADLA FI L ++ V Sbjct: 85 RENAAVFALEHRFFGKSMPFDQLTKENYKYLTIPQALADLAEFIERYIYTHHLADQDGVT 144 Query: 614 WIAFGGSYPGSLAAWLR 664 GGSYPG+L++W R Sbjct: 145 VAVVGGSYPGALSSWFR 161 >UniRef50_Q53ND8 Cluster: At2g24280/F27D4.19; n=4; Oryza sativa|Rep: At2g24280/F27D4.19 - Oryza sativa subsp. japonica (Rice) Length = 511 Score = 87.0 bits (206), Expect = 4e-16 Identities = 60/155 (38%), Positives = 81/155 (52%), Gaps = 15/155 (9%) Frame = +2 Query: 245 PPPQ-------WFKQKLDHSN--PSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPAD 397 PPPQ +F Q+LDH N P+ T++QRY VN +F+ PVF+ G EG Sbjct: 46 PPPQVVQYETRYFTQRLDHFNELPASNGTFRQRYLVNGTFWGGA-AAPVFVYAGNEGDVA 104 Query: 398 ARWMVTGTWINYAKKFNALCINLEHRFYGESHP---TLDLSIKNLQ---FLSSYQALADL 559 TG A +F A+ + +EHR+YGES P T + + +L++ QALAD Sbjct: 105 LFASNTGFMWEAAPRFRAMLVFVEHRYYGESLPFGGTRAAAFADASAAGYLTTAQALADF 164 Query: 560 ANFISSMKQKFRLNEKVKWIAFGGSYPGSLAAWLR 664 A I S+K K + FGGSY G LAAW+R Sbjct: 165 AELILSLKSNLTAC-KAPVVIFGGSYGGMLAAWMR 198 >UniRef50_UPI00015B5213 Cluster: PREDICTED: similar to prolylcarboxypeptidase, putative; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to prolylcarboxypeptidase, putative - Nasonia vitripennis Length = 425 Score = 85.4 bits (202), Expect = 1e-15 Identities = 52/143 (36%), Positives = 70/143 (48%), Gaps = 6/143 (4%) Frame = +2 Query: 254 QWFKQKLDHSNPSDLRTWKQ----RYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGT 421 QWF Q LDH +P+ RTWKQ + N++ + N+ + + W+ Sbjct: 45 QWFSQMLDHYDPASTRTWKQDSRLKIAHNNTLRENWNRQQITSRT-TDSTKKVAWIADDA 103 Query: 422 WIN--YAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFR 595 + AKKF A LEHRFYG+S PT + QALAD A FI M++ Sbjct: 104 SLESYLAKKFGAKIFFLEHRFYGKSQPT---------YTRVDQALADTAYFIEGMQRSHN 154 Query: 596 LNEKVKWIAFGGSYPGSLAAWLR 664 + KWI FG SY GSL +W+R Sbjct: 155 IPRSTKWILFGASYAGSLVSWMR 177 >UniRef50_A3C6E7 Cluster: Putative uncharacterized protein; n=2; Oryza sativa|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 616 Score = 85.0 bits (201), Expect = 1e-15 Identities = 56/143 (39%), Positives = 75/143 (52%), Gaps = 8/143 (5%) Frame = +2 Query: 254 QWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFK-NQGPVFLMIGGEGPADARWMVTGTWIN 430 +W Q+LDH +P+D R +KQRYY F D+ GPVFL I GE + + ++ Sbjct: 53 RWMDQRLDHFSPTDHRQFKQRYY---EFADYHAGGGPVFLRICGESSCNG---IPNDYLA 106 Query: 431 Y-AKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQ--KFRLN 601 +KKF A + EHR+YG+S P L+ +NL+FLSS QAL DL F ++ R N Sbjct: 107 VLSKKFGAAVVTPEHRYYGKSSPFESLTTENLRFLSSKQALFDLVAFRQHYQEILNARYN 166 Query: 602 EKV----KWIAFGGSYPGSLAAW 658 W FG P SL W Sbjct: 167 RSSGFDNPWFVFGAQVP-SLDMW 188 >UniRef50_Q7QAL4 Cluster: ENSANGP00000011387; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000011387 - Anopheles gambiae str. PEST Length = 439 Score = 84.6 bits (200), Expect = 2e-15 Identities = 42/135 (31%), Positives = 74/135 (54%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAK 439 F+ ++DH + + T++ Y N +Y + GP+F+++GG +A ++ G + + A+ Sbjct: 6 FRTRVDHFDVQNRATFEFNYVSNGEYY--RPGGPIFIVVGGNNALNAYFIENGLFHDIAR 63 Query: 440 KFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWI 619 + + EHR+YG S P D S N++FLS QAL DL +I ++++ + K I Sbjct: 64 RQGGWLFSNEHRYYGRSSPVEDYSAPNMRFLSVEQALIDLIEWIDHLRREVVRDPNAKVI 123 Query: 620 AFGGSYPGSLAAWLR 664 G Y G++A W R Sbjct: 124 LHGLGYGGAVAIWAR 138 >UniRef50_A7PQM2 Cluster: Chromosome chr6 scaffold_25, whole genome shotgun sequence; n=9; Vitis vinifera|Rep: Chromosome chr6 scaffold_25, whole genome shotgun sequence - Vitis vinifera (Grape) Length = 510 Score = 83.8 bits (198), Expect = 3e-15 Identities = 51/144 (35%), Positives = 78/144 (54%), Gaps = 8/144 (5%) Frame = +2 Query: 257 WFKQKLDHSN--PSDLRTWKQRYYVNDSFYDFKN-QGPVFLMIGGEGPADARWMVTGTWI 427 ++ Q LDH N P T++QRY +N ++ N P+F +G E D G + Sbjct: 66 FYNQTLDHFNYRPESYYTFQQRYVMNFKYWGGANASAPIFAYLGAEAALDFDLTGVGFPV 125 Query: 428 NYAKKFNALCINLEHRFYGESHP--TLDLSIKNLQ---FLSSYQALADLANFISSMKQKF 592 + A +F AL + +EHR+YG+S P + + ++KN + +S QA+AD A + +K+K Sbjct: 126 DNALQFKALLVYIEHRYYGQSIPFGSREEALKNASTRGYFNSAQAIADYAEVLEYIKKKL 185 Query: 593 RLNEKVKWIAFGGSYPGSLAAWLR 664 L E I GGSY G LA+W R Sbjct: 186 -LAENSPVIVIGGSYGGMLASWFR 208 >UniRef50_UPI0000E4A528 Cluster: PREDICTED: similar to prolylcarboxypeptidase; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to prolylcarboxypeptidase - Strongylocentrotus purpuratus Length = 496 Score = 83.4 bits (197), Expect = 4e-15 Identities = 54/143 (37%), Positives = 81/143 (56%), Gaps = 6/143 (4%) Frame = +2 Query: 254 QWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMV--TGTWI 427 ++F+Q++DH + ++ T++ RY V+D + GP+F G EG D W TG Sbjct: 53 EYFEQQVDHFSFTNSDTFQMRYLVSDELWT--KGGPIFFYTGNEG--DITWFCQNTGFVW 108 Query: 428 NYAKKFNALCINLEHRFYGESHPTLDLSIK---NLQFLSSYQALADLANFISSMKQKFRL 598 + A ++ A+ I EHR+YG+S P + S K +L +L++ QALAD A F+ K R Sbjct: 109 DLAVEYKAIVIFAEHRYYGKSLPYGNDSYKDAAHLGYLTAEQALADFAVFLDWYKANTRG 168 Query: 599 NEK-VKWIAFGGSYPGSLAAWLR 664 +AFGGSY G LAAW+R Sbjct: 169 GAAGSPVVAFGGSYGGMLAAWMR 191 >UniRef50_Q9FFC2 Cluster: Prolylcarboxypeptidase-like protein; n=7; core eudicotyledons|Rep: Prolylcarboxypeptidase-like protein - Arabidopsis thaliana (Mouse-ear cress) Length = 502 Score = 83.4 bits (197), Expect = 4e-15 Identities = 53/152 (34%), Positives = 81/152 (53%), Gaps = 8/152 (5%) Frame = +2 Query: 233 QSNLPPPQWFKQKLDHSN--PSDLRTWKQRYYVNDSFYD-FKNQGPVFLMIGGEGPADAR 403 +SNL +F Q LDH P T++QRY ++ + + K P+ +G E D+ Sbjct: 51 ESNLKM-YYFNQTLDHFTFTPESYMTFQQRYAIDSTHWGGAKANAPILAFLGEESSLDSD 109 Query: 404 WMVTGTWINYAKKFNALCINLEHRFYGESHP--TLDLSIKN---LQFLSSYQALADLANF 568 G + + NAL + +EHR+YGE+ P + + ++KN L +L++ QALAD A Sbjct: 110 LAAIGFLRDNGPRLNALLVYIEHRYYGETMPFGSAEEALKNASTLGYLNAAQALADYAAI 169 Query: 569 ISSMKQKFRLNEKVKWIAFGGSYPGSLAAWLR 664 + +K+K+ N I GGSY G LAAW R Sbjct: 170 LLHVKEKYSTNHS-PIIVIGGSYGGMLAAWFR 200 >UniRef50_Q16Y07 Cluster: Prolylcarboxypeptidase, putative; n=1; Aedes aegypti|Rep: Prolylcarboxypeptidase, putative - Aedes aegypti (Yellowfever mosquito) Length = 512 Score = 83.4 bits (197), Expect = 4e-15 Identities = 45/137 (32%), Positives = 67/137 (48%), Gaps = 1/137 (0%) Frame = +2 Query: 257 WFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWI-NY 433 +F ++DH N + W RY+ +Y GP+ + +GG P + T I + Sbjct: 62 FFTTRVDHFNSQNTAEWTLRYFAVTDYY--MPGGPILIFLGGNQPILTSMVDESTLIYDM 119 Query: 434 AKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVK 613 A++ N E RFYG+S T D S +NL L++ Q LADLA F+ +K+ N Sbjct: 120 AREMNGAVYAFESRFYGQSFVTEDASTENLSLLNTDQILADLAEFVQYLKRDVLKNPNAP 179 Query: 614 WIAFGGSYPGSLAAWLR 664 + G Y G+LA W R Sbjct: 180 VMVSGSEYGGALATWFR 196 >UniRef50_Q9GRV9 Cluster: Putative uncharacterized protein pcp-4; n=1; Caenorhabditis elegans|Rep: Putative uncharacterized protein pcp-4 - Caenorhabditis elegans Length = 1042 Score = 81.8 bits (193), Expect = 1e-14 Identities = 50/137 (36%), Positives = 75/137 (54%), Gaps = 2/137 (1%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGT--WINY 433 F+Q+ DH + ++ ++Q++Y N + + GP FLMIGG+ W++ W+ Sbjct: 550 FRQRQDHFDNLNVDFFQQKFYKNSQWA--RPGGPNFLMIGGQEAEGESWVLNEKLPWLIS 607 Query: 434 AKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVK 613 A+K+ A LEHRFYG+S L + NL LSS Q L D A FI ++ ++ Sbjct: 608 AQKYGATVYLLEHRFYGDS---LVGNNTNLNLLSSLQVLYDSAEFIKAI--NYKTQSSTP 662 Query: 614 WIAFGGSYPGSLAAWLR 664 WI FG S+P L+AW R Sbjct: 663 WITFGRSFP--LSAWTR 677 Score = 75.8 bits (178), Expect = 9e-13 Identities = 44/139 (31%), Positives = 68/139 (48%), Gaps = 3/139 (2%) Frame = +2 Query: 257 WFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVT--GTWIN 430 + QKLDH + + Q+Y+ + NQ FL + EG + M + Sbjct: 43 YLSQKLDHFSNDSQVFFTQQYFYTERL-SVSNQKVAFLYVNTEGNEEIAVMTDERSPVVK 101 Query: 431 YAKKFNALCINLEHRFYGESHPTL-DLSIKNLQFLSSYQALADLANFISSMKQKFRLNEK 607 AK+F A L+HR+YG S P + L++L+S QA+ D+ +FI +F +N Sbjct: 102 AAKRFGAQLFALKHRYYGASKPNFQNFDASALRYLTSRQAIQDILSFIKYANTQFNMNPD 161 Query: 608 VKWIAFGGSYPGSLAAWLR 664 V+W+ +G Y G LAA R Sbjct: 162 VRWVLWGTGYGGILAAEAR 180 >UniRef50_Q16LF2 Cluster: Prolylcarboxypeptidase, putative; n=4; Aedes aegypti|Rep: Prolylcarboxypeptidase, putative - Aedes aegypti (Yellowfever mosquito) Length = 493 Score = 81.8 bits (193), Expect = 1e-14 Identities = 48/153 (31%), Positives = 74/153 (48%), Gaps = 2/153 (1%) Frame = +2 Query: 212 GIPGGDYQSNLPPPQW--FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGE 385 G P N +W F Q+ HSN + + RY N FY + GP+FL +GG Sbjct: 40 GPPSDSIVDNGNYTEWRVFDQRQSHSNAHSVDMFPMRYVSNSKFY--RPGGPIFLFVGGP 97 Query: 386 GPADARWMVTGTWINYAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLAN 565 + ++ G +++ A++ NA + E R+YGES P + S NL+ L QA D+A Sbjct: 98 WELEQHFVEQGHFVDLAEENNAFVVANEMRYYGESLPVPNASRGNLRLLHIVQACTDIAR 157 Query: 566 FISSMKQKFRLNEKVKWIAFGGSYPGSLAAWLR 664 I ++ + + + I G + GSLA W R Sbjct: 158 LIVHIRYEVLRDPNARVIVAGVGFSGSLAHWTR 190 >UniRef50_A1CFV7 Cluster: Serine peptidase, putative; n=5; Pezizomycotina|Rep: Serine peptidase, putative - Aspergillus clavatus Length = 531 Score = 81.8 bits (193), Expect = 1e-14 Identities = 50/142 (35%), Positives = 79/142 (55%), Gaps = 9/142 (6%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQG-PVFLMIGGEG--PADARWMVTGTWIN 430 F Q +DH NP +L T++QR++ + F+ K G PV L GE P ++ T Sbjct: 54 FDQLIDHDNP-ELGTFQQRFWWSSEFW--KGPGSPVVLFTPGEADAPGYTGYLTNQTLPG 110 Query: 431 -YAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLN-- 601 +A++ I LEHR++G S P +L+ + LQ+L+ Q++ADL +F ++ F N Sbjct: 111 RFAQEIGGAVILLEHRYWGTSSPYTNLNTETLQYLTLEQSIADLTHFAKTVDLAFDSNHS 170 Query: 602 ---EKVKWIAFGGSYPGSLAAW 658 +K W+ GGSY G+L+AW Sbjct: 171 SNADKAPWVLTGGSYSGALSAW 192 >UniRef50_UPI0000499072 Cluster: serine protease; n=2; Entamoeba histolytica HM-1:IMSS|Rep: serine protease - Entamoeba histolytica HM-1:IMSS Length = 480 Score = 81.4 bits (192), Expect = 2e-14 Identities = 46/133 (34%), Positives = 77/133 (57%), Gaps = 2/133 (1%) Frame = +2 Query: 272 LDHSNPSDLRTWKQRYYVNDSFYDFKN-QGPVFLMIGGEGPADARWMVTGTWI-NYAKKF 445 LDH N ++ + +Y+++ + D + P+F+++GGEGP D + + + AKK Sbjct: 46 LDHFNANNQIDFDIQYFISTDYLDNNSPNAPLFVLLGGEGPEDETGLQNYFVVTDLAKKH 105 Query: 446 NALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWIAF 625 L +++EHRFYG S P+L++ L + ++ QAL D IS ++++ L I Sbjct: 106 KGLMLSVEHRFYGASTPSLEMD--KLIYCTAEQALMDYVEVISHVQEENNLVGH-PVIVL 162 Query: 626 GGSYPGSLAAWLR 664 GGSY G+LAAW+R Sbjct: 163 GGSYSGNLAAWMR 175 >UniRef50_Q5CZT1 Cluster: Zgc:113564; n=12; Eumetazoa|Rep: Zgc:113564 - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 500 Score = 80.2 bits (189), Expect = 4e-14 Identities = 51/141 (36%), Positives = 73/141 (51%), Gaps = 4/141 (2%) Frame = +2 Query: 254 QWFKQKLDHSNPSDLR--TWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWI 427 ++FKQ LDH N + L T+ QRY + D ++ K GP+F G EG +G + Sbjct: 53 KYFKQILDHFNYNSLGNGTYDQRYLITDKYWK-KGYGPIFFYTGNEGDISEFARNSGFMV 111 Query: 428 NYAKKFNALCINLEHRFYGESHP--TLDLSIKNLQFLSSYQALADLANFISSMKQKFRLN 601 A AL I EHR+YG+S P I + L+ QALAD A I+ +K++ Sbjct: 112 ELAAAQGALLIFAEHRYYGKSLPFGKNSFKIPEVGLLTVEQALADYAVMITELKEELG-G 170 Query: 602 EKVKWIAFGGSYPGSLAAWLR 664 + I FGGSY G L+ ++R Sbjct: 171 QTCPVIVFGGSYGGMLSVYMR 191 >UniRef50_A6S9T4 Cluster: Putative uncharacterized protein; n=3; Sclerotiniaceae|Rep: Putative uncharacterized protein - Botryotinia fuckeliana B05.10 Length = 544 Score = 80.2 bits (189), Expect = 4e-14 Identities = 48/141 (34%), Positives = 73/141 (51%), Gaps = 7/141 (4%) Frame = +2 Query: 257 WFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWIN-- 430 +F Q LDH NPS T++Q+++ N F+ VF G A+ +T + Sbjct: 54 FFTQLLDHDNPSK-GTFQQKFWWNSEFWAGPGSPIVFFTPGEIAAANYGAYLTNVTVTGL 112 Query: 431 YAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNE-- 604 +A++ + +EHRF+GES P +L+ NLQ L+ QA+AD +F ++ F N Sbjct: 113 FAQEIKGAVVMVEHRFWGESSPYDNLTTTNLQLLTLKQAIADFVHFAKTVDLPFDSNHSS 172 Query: 605 ---KVKWIAFGGSYPGSLAAW 658 WI GGSY G+L+AW Sbjct: 173 NAASAPWINSGGSYSGALSAW 193 >UniRef50_P34676 Cluster: Putative serine protease tag-282 precursor; n=3; Caenorhabditis|Rep: Putative serine protease tag-282 precursor - Caenorhabditis elegans Length = 507 Score = 80.2 bits (189), Expect = 4e-14 Identities = 49/121 (40%), Positives = 68/121 (56%), Gaps = 4/121 (3%) Frame = +2 Query: 314 RYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAKKFNALCINLEHRFYGESH 493 RY++N Y+ GP+ G EG +A TG + A + A + +EHRFYG+S Sbjct: 64 RYFLNIDHYE--TGGPILFYTGNEGSLEAFAENTGFMWDLAPELKAAVVFVEHRFYGKSQ 121 Query: 494 PTLDLS---IKNLQFLSSYQALADLANFISSMK-QKFRLNEKVKWIAFGGSYPGSLAAWL 661 P + S I++L +LSS QALAD A + K +K + +K IAFGGSY G L+AW Sbjct: 122 PFKNESYTDIRHLGYLSSQQALADFALSVQFFKNEKIKGAQKSAVIAFGGSYGGMLSAWF 181 Query: 662 R 664 R Sbjct: 182 R 182 >UniRef50_A4RKL9 Cluster: Putative uncharacterized protein; n=1; Magnaporthe grisea|Rep: Putative uncharacterized protein - Magnaporthe grisea (Rice blast fungus) (Pyricularia grisea) Length = 489 Score = 79.4 bits (187), Expect = 7e-14 Identities = 53/143 (37%), Positives = 76/143 (53%), Gaps = 10/143 (6%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADAR--WMVTGTWIN- 430 F Q LDH NPS T+KQRY+ + S + PVFL GE AD ++ T Sbjct: 21 FDQLLDHHNPSK-GTFKQRYFWDASSWAGPGS-PVFLFNPGEDAADGYVGYLDNHTLPGL 78 Query: 431 YAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLN--- 601 YA F I +EHR++G+S P L+ + LQ+L Q++ D+ +F +++ F + Sbjct: 79 YADTFQGAVIVIEHRYWGKSIPFDILTAETLQYLDVPQSIMDMTHFAKTVQLSFDSSGDG 138 Query: 602 ----EKVKWIAFGGSYPGSLAAW 658 EK W+ GGSY G+LAAW Sbjct: 139 GANAEKAPWVLIGGSYSGALAAW 161 >UniRef50_A4QUS9 Cluster: Putative uncharacterized protein; n=1; Magnaporthe grisea|Rep: Putative uncharacterized protein - Magnaporthe grisea (Rice blast fungus) (Pyricularia grisea) Length = 400 Score = 79.4 bits (187), Expect = 7e-14 Identities = 47/133 (35%), Positives = 75/133 (56%), Gaps = 7/133 (5%) Frame = +2 Query: 287 PSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADAR--WMVTGTWINYAKKFNALCI 460 P T++ RY+ + S Y N GPV +++GGE R +M G A+ + + Sbjct: 66 PHSNDTFELRYWFDASHY--VNGGPVIVLLGGETSGAERLPFMEKGILYRLARATRGMAV 123 Query: 461 NLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSM-----KQKFRLNEKVKWIAF 625 LEHR+YG S PT +L+ +NL+FL++ QALAD A F ++ + + + + A+ Sbjct: 124 VLEHRYYGASFPTPNLTTENLRFLTTDQALADTAYFAKNVVFHGYENRNLTSHTTPYFAY 183 Query: 626 GGSYPGSLAAWLR 664 GGSY G+ AA++R Sbjct: 184 GGSYAGAFAAFVR 196 >UniRef50_Q7SEA3 Cluster: Putative uncharacterized protein NCU00831.1; n=6; Pezizomycotina|Rep: Putative uncharacterized protein NCU00831.1 - Neurospora crassa Length = 561 Score = 78.6 bits (185), Expect = 1e-13 Identities = 54/154 (35%), Positives = 81/154 (52%), Gaps = 13/154 (8%) Frame = +2 Query: 242 LPPPQWFKQKLDHSN------PSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGE--GPAD 397 L P + K +DH + P T+ RY+ + ++Y K GPV ++ GE G Sbjct: 60 LYPARTIKVPVDHFHNDTKYEPHTNDTFDLRYWFDATYY--KKGGPVIVLAAGETSGVGR 117 Query: 398 ARWMVTGTWINYAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISS 577 +++ G AK + + LEHR+YG+S PT D S KNL+FL++ QALAD F + Sbjct: 118 LQFLQKGIVYQLAKATGGVGVILEHRYYGKSLPTSDFSTKNLRFLTTDQALADTVYFAKN 177 Query: 578 MK----QKFRLN-EKVKWIAFGGSYPGSLAAWLR 664 +K + L +IA+GGSY G+ A+LR Sbjct: 178 VKFAGLEHLDLTAPNTPYIAYGGSYAGAFVAFLR 211 >UniRef50_Q0U1V1 Cluster: Putative uncharacterized protein; n=2; Phaeosphaeria nodorum|Rep: Putative uncharacterized protein - Phaeosphaeria nodorum (Septoria nodorum) Length = 582 Score = 78.6 bits (185), Expect = 1e-13 Identities = 56/155 (36%), Positives = 77/155 (49%), Gaps = 16/155 (10%) Frame = +2 Query: 248 PPQWFKQKLDHSNPSDLR-TWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMV---- 412 P ++ LDH +PS T+ RY+ S Y K GPVF+ GEG A + Sbjct: 80 PEEYVTLPLDHFDPSKNHGTFNNRYWAASSSY--KPGGPVFIYDVGEGNASTNALFRIQN 137 Query: 413 -TGTWINYAKKFNALCINLEHRFYGESHP----TLDLSIKNLQFLSSYQALADLANFISS 577 T + K+N + I EHRFYG S P +D + +FL++ Q+LAD+A F S Sbjct: 138 STSFFKQIVDKYNGIGIVWEHRFYGNSSPGGPVNIDTPAEQFRFLNTEQSLADVAAFASQ 197 Query: 578 MKQKFR-LN-----EKVKWIAFGGSYPGSLAAWLR 664 K R +N E W+ GGSYPG AA++R Sbjct: 198 FSLKNRGINYTLTPETTPWVFVGGSYPGMRAAFMR 232 >UniRef50_Q9UHL4 Cluster: Dipeptidyl-peptidase 2 precursor; n=19; Euteleostomi|Rep: Dipeptidyl-peptidase 2 precursor - Homo sapiens (Human) Length = 492 Score = 78.2 bits (184), Expect = 2e-13 Identities = 50/141 (35%), Positives = 79/141 (56%), Gaps = 4/141 (2%) Frame = +2 Query: 254 QWFKQKLDHSNPSDL--RTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWI 427 ++F+Q+LDH N +T+ QR+ V+D F+ + +GP+F G EG A +G Sbjct: 34 RFFQQRLDHFNFERFGNKTFPQRFLVSDRFW-VRGEGPIFFYTGNEGDVWAFANNSGFVA 92 Query: 428 NYAKKFNALCINLEHRFYGESHPTLDLSIK--NLQFLSSYQALADLANFISSMKQKFRLN 601 A + AL + EHR+YG+S P S + + + L+ QALAD A + ++++ Sbjct: 93 ELAAERGALLVFAEHRYYGKSLPFGAQSTQRGHTELLTVEQALADFAELLRALRRDLGAQ 152 Query: 602 EKVKWIAFGGSYPGSLAAWLR 664 + IAFGGSY G L+A+LR Sbjct: 153 D-APAIAFGGSYGGMLSAYLR 172 >UniRef50_Q2HER6 Cluster: Putative uncharacterized protein; n=1; Chaetomium globosum|Rep: Putative uncharacterized protein - Chaetomium globosum (Soil fungus) Length = 506 Score = 77.8 bits (183), Expect = 2e-13 Identities = 47/150 (31%), Positives = 77/150 (51%), Gaps = 14/150 (9%) Frame = +2 Query: 254 QWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARW-------MV 412 +WF Q +DH NP DL TW+Q Y VN ++ PV +M GE P + + Sbjct: 46 EWFPQPIDHKNP-DLGTWQQLYCVNPQWW--APGAPVVVMTPGEMPITSAINSGFGYSYL 102 Query: 413 TGTWIN--YAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSM-- 580 + T ++ YA+ A + +EHR++G S P + LQ+L+ QA D+ NF ++ Sbjct: 103 SNTTMSGTYAETLGAAAVVVEHRYFGGSSPYDGFDSETLQYLTMEQAAEDIVNFAKNVVF 162 Query: 581 ---KQKFRLNEKVKWIAFGGSYPGSLAAWL 661 K++ + K W+ +G SY +L +W+ Sbjct: 163 PFDKEQTSVATKTPWVYWGASYAATLGSWI 192 >UniRef50_Q29MX0 Cluster: GA15377-PA; n=4; Endopterygota|Rep: GA15377-PA - Drosophila pseudoobscura (Fruit fly) Length = 444 Score = 77.4 bits (182), Expect = 3e-13 Identities = 54/141 (38%), Positives = 71/141 (50%), Gaps = 6/141 (4%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKN-QGPVFLMIGGEGPADARWMVTGTWINYA 436 F+ LDH + T+ RY NDSF D KN P+F G EG + TG A Sbjct: 9 FQVPLDHFSFLSNATFNIRYLYNDSFVDKKNAHTPIFFYTGNEGDIELFAQNTGFMWELA 68 Query: 437 KKFNALCINLEHRFYGESHP----TLDLSI-KNLQFLSSYQALADLANFISSMKQKFRLN 601 +K AL I EHR+YG+S P T + S+ +L + + Q L D A I+ ++ L Sbjct: 69 EKQRALLIFAEHRYYGKSLPFGASTFNASMPDHLAYFTVEQTLEDYAMLITFLRNDLPL- 127 Query: 602 EKVKWIAFGGSYPGSLAAWLR 664 +AFGGSY G LAAW R Sbjct: 128 ---PVVAFGGSYGGMLAAWFR 145 >UniRef50_Q7Z5N6 Cluster: Thymus specific serine peptidase; n=4; Homo/Pan/Gorilla group|Rep: Thymus specific serine peptidase - Homo sapiens (Human) Length = 138 Score = 76.6 bits (180), Expect = 5e-13 Identities = 38/77 (49%), Positives = 49/77 (63%) Frame = +2 Query: 434 AKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVK 613 A + AL I+LEHRFYG S P L + L+FLSS ALAD+ + ++ + F ++ Sbjct: 10 APAWGALVISLEHRFYGLSIPAGGLEMAQLRFLSSRLALADVVSARLALSRLFNISSSSP 69 Query: 614 WIAFGGSYPGSLAAWLR 664 WI FGGSY GSLAAW R Sbjct: 70 WICFGGSYAGSLAAWAR 86 >UniRef50_Q7Z5N5 Cluster: Thymus specific serine peptidase; n=3; Catarrhini|Rep: Thymus specific serine peptidase - Homo sapiens (Human) Length = 155 Score = 76.6 bits (180), Expect = 5e-13 Identities = 38/77 (49%), Positives = 49/77 (63%) Frame = +2 Query: 434 AKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVK 613 A + AL I+LEHRFYG S P L + L+FLSS ALAD+ + ++ + F ++ Sbjct: 10 APAWGALVISLEHRFYGLSIPAGGLEMAQLRFLSSRLALADVVSARLALSRLFNISSSSP 69 Query: 614 WIAFGGSYPGSLAAWLR 664 WI FGGSY GSLAAW R Sbjct: 70 WICFGGSYAGSLAAWAR 86 >UniRef50_Q5KFY9 Cluster: Putative uncharacterized protein; n=4; Filobasidiella neoformans|Rep: Putative uncharacterized protein - Cryptococcus neoformans (Filobasidiella neoformans) Length = 561 Score = 76.6 bits (180), Expect = 5e-13 Identities = 55/159 (34%), Positives = 81/159 (50%), Gaps = 16/159 (10%) Frame = +2 Query: 236 SNLPPPQWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADAR--WM 409 S++ P F Q + H + S T+ QRY+V+ S Y + GP++L+ GGE + R ++ Sbjct: 73 SSIFEPYCFPQFISHFDESVNGTFCQRYWVDASSY--RPGGPIYLLDGGETSGEYRLPFL 130 Query: 410 VTGTWINYAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMK-- 583 G + L + LEHR+YGES P S +L+FL++ +AL D A FI + K Sbjct: 131 EKGILDILSNATGGLSVVLEHRYYGESVPVSSFSTDDLRFLNNAEALEDSAYFIENFKLP 190 Query: 584 ------QKFRLNE------KVKWIAFGGSYPGSLAAWLR 664 F L E WI +GGSY G+ AA +R Sbjct: 191 ASLSNALPFELEETAFHPNNTPWIYYGGSYAGARAAHMR 229 >UniRef50_UPI0000078353 Cluster: C46C2.4; n=1; Caenorhabditis elegans|Rep: C46C2.4 - Caenorhabditis elegans Length = 614 Score = 76.2 bits (179), Expect = 7e-13 Identities = 52/122 (42%), Positives = 67/122 (54%), Gaps = 3/122 (2%) Frame = +2 Query: 308 KQRYYVNDSFYDFKNQGPVFLMIGGEG---PADARWMVTGTWINYAKKFNALCINLEHRF 478 +QR++ N + K GP FL IG EG P R+ V + A+KF A LEHRF Sbjct: 199 QQRFFKNSKYA--KEGGPNFLCIGQEGREDPNSIRFDVFAV-VEKAQKFGATVYVLEHRF 255 Query: 479 YGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWIAFGGSYPGSLAAW 658 YG+S+ D S +L LSS Q L DLA I ++ + N WI FGGSY G L+AW Sbjct: 256 YGDSNVG-DNS--DLSKLSSLQMLYDLAEIIK--EENLKTNTSNPWITFGGSYSGMLSAW 310 Query: 659 LR 664 +R Sbjct: 311 MR 312 Score = 66.9 bits (156), Expect = 4e-10 Identities = 35/102 (34%), Positives = 59/102 (57%), Gaps = 1/102 (0%) Frame = +2 Query: 311 QRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAKKFNALCINLEHRFYGES 490 QRY + SF N+ L + G+ + + G ++ A++F A LEHRFYG S Sbjct: 15 QRYLYSHSF-SLNNKKIALLYVSGQNTFNENILKQGPFVQAAEEFGASMFALEHRFYGNS 73 Query: 491 HP-TLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVK 613 P +++L+ K+L++L S +A+ D+ +FI+ +KF +N VK Sbjct: 74 KPRSMNLTSKDLRYLKSSEAVQDIISFINYSNKKFNMNPGVK 115 >UniRef50_Q54HT4 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 513 Score = 75.4 bits (177), Expect = 1e-12 Identities = 57/151 (37%), Positives = 76/151 (50%), Gaps = 15/151 (9%) Frame = +2 Query: 257 WFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQG--------PVFLMIGGEGPA----DA 400 WF Q LDH N + QR + D +++ K++ P+ G EG + Sbjct: 60 WFNQTLDHFNFETSGYFNQRVLIIDQYFNEKSKNEIDQICTKPLIFFCGNEGDVTFFYEN 119 Query: 401 RWMVTGTWINYAKKFNALCINLEHRFYGESHPTLDLSI--KNLQFLSSYQALADLANFIS 574 +T T A++ NAL I EHR+YGES P + S +N Q+LSS QALAD + I Sbjct: 120 SLFITNT---LAQEMNALVIFAEHRYYGESLPFGNQSYTNENFQYLSSEQALADYSKIIP 176 Query: 575 S-MKQKFRLNEKVKWIAFGGSYPGSLAAWLR 664 S +KQ LN V GSY G LAAW+R Sbjct: 177 SILKQYNALNCPV--FTTSGSYGGDLAAWMR 205 >UniRef50_Q9VIM0 Cluster: CG2493-PA; n=3; Diptera|Rep: CG2493-PA - Drosophila melanogaster (Fruit fly) Length = 475 Score = 74.9 bits (176), Expect = 2e-12 Identities = 52/141 (36%), Positives = 74/141 (52%), Gaps = 6/141 (4%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKN-QGPVFLMIGGEGPADARWMVTGTWINYA 436 F+ LDH + T+ RY NDSF D N + P+F G EG + TG A Sbjct: 40 FQVPLDHFSFLINATFNIRYLYNDSFVDKSNARTPIFFYTGNEGDIELFAQNTGFLWEQA 99 Query: 437 KKFNALCINLEHRFYGESHP----TLDLSI-KNLQFLSSYQALADLANFISSMKQKFRLN 601 ++ AL I EHR+YG+S P T + S+ ++L + + Q L D A I+ + R + Sbjct: 100 ERQRALVIFAEHRYYGKSLPFGSSTFNTSLPEHLAYFTVEQTLEDYAMLITFL----RND 155 Query: 602 EKVKWIAFGGSYPGSLAAWLR 664 ++ +AFGGSY G LAAW R Sbjct: 156 RQMPVVAFGGSYGGMLAAWFR 176 >UniRef50_Q5BYD1 Cluster: SJCHGC06818 protein; n=2; Schistosoma japonicum|Rep: SJCHGC06818 protein - Schistosoma japonicum (Blood fluke) Length = 271 Score = 74.9 bits (176), Expect = 2e-12 Identities = 45/140 (32%), Positives = 74/140 (52%), Gaps = 3/140 (2%) Frame = +2 Query: 254 QWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINY 433 ++F+ K+DH + ++ +Y +N+ F + GP+ G EG + +G Sbjct: 37 KYFRTKIDHFSFVTDGEFEIKYLINNE--SFSSGGPILFYTGNEGAIETFAENSGFIWKL 94 Query: 434 AKKFNALCINLEHRFYGESHPTLDLSIKNLQ---FLSSYQALADLANFISSMKQKFRLNE 604 A++ NA + EHR+YG S P + S K+ Q +L++ QALAD I+ +K + Sbjct: 95 AEELNASVVFAEHRYYGTSLPFGNDSFKDRQYFGYLTAEQALADYVLLINQLKVNYSCFA 154 Query: 605 KVKWIAFGGSYPGSLAAWLR 664 I+FGGSY G L+AW+R Sbjct: 155 SSPVISFGGSYGGMLSAWIR 174 >UniRef50_A7EU48 Cluster: Putative uncharacterized protein; n=1; Sclerotinia sclerotiorum 1980|Rep: Putative uncharacterized protein - Sclerotinia sclerotiorum 1980 Length = 588 Score = 74.9 bits (176), Expect = 2e-12 Identities = 51/149 (34%), Positives = 74/149 (49%), Gaps = 7/149 (4%) Frame = +2 Query: 239 NLPPPQWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADAR--WMV 412 NL P + P T+ RY+ + ++Y K GPV ++ GE A R ++ Sbjct: 135 NLSVPIDYFHNESRYEPHSNGTFPLRYWFDATYY--KPGGPVIVLQSGETDATGRLPFLQ 192 Query: 413 TGTWINYAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANF-----ISS 577 G A N + + LEHR+YGES PT D S KNL+FL++ QAL D F Sbjct: 193 NGLLHQLAVATNGIGVVLEHRYYGESIPTPDFSTKNLRFLTTEQALMDEVYFARNIVFPG 252 Query: 578 MKQKFRLNEKVKWIAFGGSYPGSLAAWLR 664 ++ + V +I +GGSY G+ A+LR Sbjct: 253 LEDQNLTAPNVAYIGYGGSYAGAFNAFLR 281 >UniRef50_Q22MF3 Cluster: Serine carboxypeptidase S28 family protein; n=2; Tetrahymena thermophila SB210|Rep: Serine carboxypeptidase S28 family protein - Tetrahymena thermophila SB210 Length = 502 Score = 74.5 bits (175), Expect = 2e-12 Identities = 50/154 (32%), Positives = 83/154 (53%), Gaps = 17/154 (11%) Frame = +2 Query: 254 QWFKQKLDHSN-PSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTW-I 427 ++F Q +DH + +T+KQ+Y + D +Y + ++GP+ G E P D + G Sbjct: 22 KYFDQLVDHIGFETGDKTFKQKYLIKDDYYRY-DKGPILFYCGNEAPVDFSFGGAGFMHT 80 Query: 428 NYAKKFNALCINLEHRFYGESHP--TLDLSIK--NLQFLSSYQALADLANFISSMKQKFR 595 A++ NAL + +EHR++GES P T S K N ++L+S+QA+ D A F+ K+ Sbjct: 81 TLAQELNALVVFMEHRYFGESQPFGTEKESFKKGNNKYLTSFQAINDYAKFLVWFKKSLG 140 Query: 596 L-NEKVKWIAFG----------GSYPGSLAAWLR 664 +++ +AFG SY G L+AW+R Sbjct: 141 CGDDECPVVAFGALSNIFINYKASYGGMLSAWIR 174 >UniRef50_Q2GU64 Cluster: Putative uncharacterized protein; n=1; Chaetomium globosum|Rep: Putative uncharacterized protein - Chaetomium globosum (Soil fungus) Length = 472 Score = 74.5 bits (175), Expect = 2e-12 Identities = 48/144 (33%), Positives = 75/144 (52%), Gaps = 10/144 (6%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQG-PVFLMIGGEGPA---DARWMVTGTWI 427 F Q +DH++P+ L T+KQRY+ F+ K G P++L+ GE + W+ + Sbjct: 54 FDQLIDHADPA-LGTFKQRYWYGTEFW--KGPGSPIYLVTPGEQTGTGFNRTWLGSARLS 110 Query: 428 NY-AKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKF---- 592 A + + LEHR++G S P +L+++NLQ+L+ +L DL F + F Sbjct: 111 GLMANQTGGAVVILEHRYWGGSSPYANLTVENLQYLTLDNSLKDLTYFAKNFVPPFDDSG 170 Query: 593 -RLNEKVKWIAFGGSYPGSLAAWL 661 K W+ GGSY G+LA WL Sbjct: 171 ASSAGKAPWVFAGGSYAGALAGWL 194 >UniRef50_Q54H23 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 513 Score = 74.1 bits (174), Expect = 3e-12 Identities = 55/152 (36%), Positives = 75/152 (49%), Gaps = 12/152 (7%) Frame = +2 Query: 245 PPPQ---WFKQKLDHSNPSDLRTWKQRYYVNDSFY------DFKNQGPVFLMIGGEGPAD 397 PPP +F Q LDH N + QRY V+D ++ D QGP+ G EG Sbjct: 59 PPPYQELFFLQTLDHFNFQSKGEFAQRYLVSDVYWKKPSPNDKVCQGPILFYTGNEGDIT 118 Query: 398 ARWMVTGTWINY-AKKFNALCINLEHRFYGESHP--TLDLSIKNLQFLSSYQALADLANF 568 + + N A++ NAL I EHR+YGES P + N+ +L+S QALAD A Sbjct: 119 LFYDNSQFVTNVLAQEMNALLIFAEHRYYGESLPFGNDSWTSDNIGYLTSEQALADYAQL 178 Query: 569 ISSMKQKFRLNEKVKWIAFGGSYPGSLAAWLR 664 I ++ + E ++ GGSY G L AW R Sbjct: 179 IPAVLSEMGA-EHCPVLSVGGSYGGMLTAWFR 209 >UniRef50_A2E983 Cluster: Clan SC, family S28, unassigned serine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan SC, family S28, unassigned serine peptidase - Trichomonas vaginalis G3 Length = 437 Score = 74.1 bits (174), Expect = 3e-12 Identities = 47/136 (34%), Positives = 71/136 (52%), Gaps = 1/136 (0%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYY-VNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYA 436 FKQ LDH N T+ Q YY V D + + IG E V+ A Sbjct: 20 FKQTLDHENTGS-ETFDQYYYEVTDHVVG--QPKAIIVKIGAESDKLVASGVSDFNAVLA 76 Query: 437 KKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKW 616 K++NA+ + ++HRF+G+S P L++ L+FL+ QA+ D F + + +LN + W Sbjct: 77 KRYNAIVLTIQHRFFGKSIPQDGLTVDKLKFLTVEQAVQDYKVFHDYYQNEKKLN--LPW 134 Query: 617 IAFGGSYPGSLAAWLR 664 + GGSYPG L+A +R Sbjct: 135 LVVGGSYPGLLSALIR 150 >UniRef50_Q0UTR3 Cluster: Predicted protein; n=1; Phaeosphaeria nodorum|Rep: Predicted protein - Phaeosphaeria nodorum (Septoria nodorum) Length = 353 Score = 73.7 bits (173), Expect = 4e-12 Identities = 48/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%) Frame = +2 Query: 275 DHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARW--MVTGTWINYAKKFN 448 D P T+KQRY + S+Y K GPVFL IGGE ++R+ + TG +KFN Sbjct: 49 DRYVPHTNDTFKQRYVFDSSYY--KPGGPVFLYIGGETSVESRFSNLQTGIIQILMEKFN 106 Query: 449 ALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMK----QKFRLNEKVKW 616 + + LE+R+YG+S+P + L+FL++ Q +AD A F + V W Sbjct: 107 GIGVILENRYYGKSYPYKTSTTDELRFLTTEQTIADNAYFRQHATFPGVNESLSGPDVPW 166 Query: 617 IAFGGSYPGSLAAW 658 I +GGS G+ A+ Sbjct: 167 IMYGGSLAGAHTAF 180 >UniRef50_Q7S134 Cluster: Putative uncharacterized protein NCU09992.1; n=1; Neurospora crassa|Rep: Putative uncharacterized protein NCU09992.1 - Neurospora crassa Length = 547 Score = 73.3 bits (172), Expect = 5e-12 Identities = 50/144 (34%), Positives = 75/144 (52%), Gaps = 10/144 (6%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQG-PVFLMIGGEGPADA---RWMVTGTWI 427 F Q +DH+ P +L T+KQR++ F +K G P+ L+ GE AD ++ Sbjct: 54 FDQLIDHNTP-ELGTFKQRFWYG--FQYWKGPGSPIILVNPGEQAADGFNKSYLSDQRLA 110 Query: 428 NY-AKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNE 604 + AK A + +EHR++G S P +L++KNLQ+L+ +L D+ F + F Sbjct: 111 GWMAKDMGAAVVIMEHRYWGNSSPFDELTVKNLQYLTLENSLKDINYFAEHIDLPFDKTN 170 Query: 605 KVK-----WIAFGGSYPGSLAAWL 661 K WI GGSY G+LA WL Sbjct: 171 GSKPANAPWIFSGGSYSGALAGWL 194 >UniRef50_A6SA13 Cluster: Putative uncharacterized protein; n=1; Botryotinia fuckeliana B05.10|Rep: Putative uncharacterized protein - Botryotinia fuckeliana B05.10 Length = 563 Score = 72.9 bits (171), Expect = 6e-12 Identities = 49/149 (32%), Positives = 75/149 (50%), Gaps = 7/149 (4%) Frame = +2 Query: 239 NLPPPQWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADAR--WMV 412 NL P + P T+ RY+ + ++Y K GPV ++ GE A+ R ++ Sbjct: 60 NLTVPIDYFHNESRYEPHSNGTFPLRYWFDATYY--KPGGPVIVLQSGETDAEGRLPFLQ 117 Query: 413 TGTWINYAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANF-----ISS 577 G A N + + LEHR+YG+S PT D S +NL+FL++ QAL D F Sbjct: 118 KGILHQLAVATNGIGVVLEHRYYGQSIPTPDFSTENLRFLTTEQALMDEVYFARNIVFPG 177 Query: 578 MKQKFRLNEKVKWIAFGGSYPGSLAAWLR 664 ++ + V +I +GGSY G+ A+LR Sbjct: 178 LEDQNLTAPNVAYIGYGGSYAGAFNAFLR 206 >UniRef50_A4RA99 Cluster: Putative uncharacterized protein; n=1; Magnaporthe grisea|Rep: Putative uncharacterized protein - Magnaporthe grisea (Rice blast fungus) (Pyricularia grisea) Length = 542 Score = 72.5 bits (170), Expect = 8e-12 Identities = 46/147 (31%), Positives = 74/147 (50%), Gaps = 14/147 (9%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWIN--- 430 F Q +DH NPS L T+KQRY+ + ++Y P+ + GE + + T T+++ Sbjct: 56 FDQLVDHGNPS-LGTFKQRYWWDTTYYAGAGH-PIVIYNAGE--FNGEYATTNTYVHNRS 111 Query: 431 ----YAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRL 598 A + + +EHR++G+S+P ++ NL L+ ++AD+ NF + K F Sbjct: 112 IPGMVAAEVGGAVVIIEHRYFGQSNPFSQYTVANLSHLNLNNSIADMVNFARTAKLPFAN 171 Query: 599 N-------EKVKWIAFGGSYPGSLAAW 658 +V WI G SY GSLA W Sbjct: 172 GNASATDPSRVPWINVGSSYSGSLADW 198 >UniRef50_Q7QQ95 Cluster: GLP_243_15169_16578; n=1; Giardia lamblia ATCC 50803|Rep: GLP_243_15169_16578 - Giardia lamblia ATCC 50803 Length = 469 Score = 71.3 bits (167), Expect = 2e-11 Identities = 56/165 (33%), Positives = 79/165 (47%), Gaps = 26/165 (15%) Frame = +2 Query: 245 PPPQWF-KQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGT 421 P Q F + ++DH NP + ++QRYY N F + + L IGGEG ++ GT Sbjct: 23 PSTQLFIENRVDHFNPFNQDVFRQRYYYNSEFVRDGSHVAI-LEIGGEGEINSA--PGGT 79 Query: 422 WIN------YAKKFNALCINLEHRFYGESHP------TLDLSIKNLQFLSSYQALADLAN 565 N A + A LEHRFYG SHP D+ L++LSS QA +DL Sbjct: 80 KSNPDILGRIADNYGAHIFVLEHRFYGISHPFQHTSEKYDVGTDKLRYLSSKQAQSDLLY 139 Query: 566 FISSMKQKF-----------RLNEKV--KWIAFGGSYPGSLAAWL 661 FIS M + R+ + +W+ GGSYPG++ W+ Sbjct: 140 FISVMDDRLCPANSKDGSFKRIEGRTCFQWVIVGGSYPGAVTGWI 184 >UniRef50_Q4PHW9 Cluster: Putative uncharacterized protein; n=1; Ustilago maydis|Rep: Putative uncharacterized protein - Ustilago maydis (Smut fungus) Length = 583 Score = 70.1 bits (164), Expect = 4e-11 Identities = 53/165 (32%), Positives = 86/165 (52%), Gaps = 16/165 (9%) Frame = +2 Query: 218 PGGDYQSNLPPPQWFKQKLDHSNPSDLRTWKQRYYVNDSFY---DFKNQG---PVFLMIG 379 P + ++ P + +Q LDH + + + QR++ + Y +N+G P++++ Sbjct: 126 PNKKSKHDIKEPAYHRQPLDHFDNTTQAQFDQRFFYSTRHYKPASARNKGEAVPIYILDS 185 Query: 380 GEGPADAR--WMVTGTWINYAKKFNALCINLEHRFYGESHPT-LDLS------IKNLQFL 532 GE A AR ++ TG +K + I LEHR+YG S P DL + L++L Sbjct: 186 GEADATARIPFLDTGILDILSKATGGIGIVLEHRYYGTSLPNRTDLGPGDTWGVDQLRWL 245 Query: 533 SSYQALADLANFISSMKQKFRLN-EKVKWIAFGGSYPGSLAAWLR 664 ++ QAL D A+FI + N EK K I +GGSYPG+ +A +R Sbjct: 246 TNKQALEDSADFIRHLSIPGTDNSEKRKIIYYGGSYPGARSAHMR 290 >UniRef50_Q4DM56 Cluster: Serine carboxypeptidase S28, putative; n=3; Trypanosoma cruzi|Rep: Serine carboxypeptidase S28, putative - Trypanosoma cruzi Length = 631 Score = 66.1 bits (154), Expect = 7e-10 Identities = 45/139 (32%), Positives = 72/139 (51%), Gaps = 1/139 (0%) Frame = +2 Query: 251 PQWFKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWIN 430 P F+Q +DHS T+ QRY+V+ S ++ +++ IG R G Sbjct: 57 PATFRQLVDHSKNGG-STFDQRYWVDYSAWNKSELAMLYIRIGSGDFTSPR----GYPGI 111 Query: 431 YAKKFNALCINLEHRFYGESHP-TLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEK 607 Y + N L LE R+YG+S P L + K ++L+ AL D+ F +++K L +K Sbjct: 112 YGHERNMLLFTLEGRYYGKSLPFPLTETEKLKKYLNVDIALEDIRGFQKFVEEKL-LQKK 170 Query: 608 VKWIAFGGSYPGSLAAWLR 664 ++W+ GGSY G+LA W + Sbjct: 171 LRWLIVGGSYAGALAVWFK 189 >UniRef50_A2FA76 Cluster: Clan SC, family S28, unassigned serine peptidase; n=2; Trichomonas vaginalis G3|Rep: Clan SC, family S28, unassigned serine peptidase - Trichomonas vaginalis G3 Length = 515 Score = 66.1 bits (154), Expect = 7e-10 Identities = 42/135 (31%), Positives = 69/135 (51%), Gaps = 1/135 (0%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAK 439 F Q LDH+NP +T+ Q+Y+V+ K + + IGG A + A Sbjct: 19 FTQTLDHANPG--KTFSQKYFVSTDHG--KKSDYLIVYIGGFTSLSASDLTDSPMNRIAN 74 Query: 440 KFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLN-EKVKW 616 + + LE+R++G S PT DLS +NL++ + Q L D+ FI +MK+++ + K + Sbjct: 75 NTQSPIVALENRYFGNSIPTDDLSTENLKYNTIDQHLDDIKEFIIAMKKEYCNDASKCRV 134 Query: 617 IAFGGSYPGSLAAWL 661 G + SLA W+ Sbjct: 135 ATIGRGFGASLATWI 149 >UniRef50_A2ERP5 Cluster: Clan SC, family S28, unassigned serine peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan SC, family S28, unassigned serine peptidase - Trichomonas vaginalis G3 Length = 491 Score = 65.7 bits (153), Expect = 9e-10 Identities = 40/121 (33%), Positives = 67/121 (55%), Gaps = 5/121 (4%) Frame = +2 Query: 311 QRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAKKFNALCINLEHRFYGES 490 QRY+VN S Y K++ + L +GG D + G + A + ++ I LEHR++G+S Sbjct: 34 QRYFVN-SDYANKSRN-IILYLGGANELDPNEITPGPILEIASQTKSVIIGLEHRYFGKS 91 Query: 491 HPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLN-----EKVKWIAFGGSYPGSLAA 655 PT+++S N+Q+ S QA+ D+ +F+ ++ K R + + K+ G Y G LA Sbjct: 92 VPTVNMSQFNMQYCSVPQAILDIKSFV--LQGKIRNDYCTEPDFCKFFLMGKGYGGGLAT 149 Query: 656 W 658 W Sbjct: 150 W 150 >UniRef50_Q0V7E6 Cluster: Putative uncharacterized protein; n=1; Phaeosphaeria nodorum|Rep: Putative uncharacterized protein - Phaeosphaeria nodorum (Septoria nodorum) Length = 536 Score = 65.3 bits (152), Expect = 1e-09 Identities = 37/106 (34%), Positives = 59/106 (55%), Gaps = 6/106 (5%) Frame = +2 Query: 365 FLMIGGEGPADAR--WMVTGTWINYAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSS 538 F+++GGE R ++ G + L + LEHR+YG+S P DL+ KN++FLS+ Sbjct: 89 FVLLGGETDGAGRLPFLQKGIVHQVIEATGGLGVILEHRYYGKSFPVDDLTTKNMRFLST 148 Query: 539 YQALADLANFISSMK----QKFRLNEKVKWIAFGGSYPGSLAAWLR 664 QALA++ F ++K W+ +GGSY G+ AA++R Sbjct: 149 DQALAEIDYFARNVKFEGIDADLTAPNTPWVVYGGSYAGAQAAFMR 194 >UniRef50_A7EHM7 Cluster: Putative uncharacterized protein; n=1; Sclerotinia sclerotiorum 1980|Rep: Putative uncharacterized protein - Sclerotinia sclerotiorum 1980 Length = 440 Score = 64.1 bits (149), Expect = 3e-09 Identities = 45/130 (34%), Positives = 66/130 (50%), Gaps = 7/130 (5%) Frame = +2 Query: 287 PSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARW--MVTGTWINYAKKFNALCI 460 P T+KQRY+ + ++Y K GP++L IGGE R+ + TG + N L I Sbjct: 36 PHTNATFKQRYWFDATYY--KPGGPIYLYIGGETNGQYRFSNLQTGIIQILMEATNGLGI 93 Query: 461 NLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKV-----KWIAF 625 LE+R+YGES P + L +L++ Q +AD A F + +N + KWI + Sbjct: 94 ILENRYYGESFPFNTSTTDQLAYLTNQQTVADNAYFAQHVSLP-GVNASITAPNTKWILY 152 Query: 626 GGSYPGSLAA 655 GGS G A Sbjct: 153 GGSLAGGQTA 162 >UniRef50_A4R3D5 Cluster: Putative uncharacterized protein; n=1; Magnaporthe grisea|Rep: Putative uncharacterized protein - Magnaporthe grisea (Rice blast fungus) (Pyricularia grisea) Length = 529 Score = 59.3 bits (137), Expect = 8e-08 Identities = 47/139 (33%), Positives = 69/139 (49%), Gaps = 5/139 (3%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWIN--- 430 F Q +DH NP L T+KQRY+ +++ P+ + GE AD + VT T Sbjct: 59 FDQLIDHENPQ-LGTFKQRYWYGTQYWNGTGS-PIVITTPGEQAADG-FNVTYTTKRRLT 115 Query: 431 --YAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNE 604 A+K A I LEHR++GES P +L+ +NL++L+ ++ DL Sbjct: 116 GLMAEKTGAAVIVLEHRYWGESSPYQELTTENLKYLTLNNSIHDL--------------- 160 Query: 605 KVKWIAFGGSYPGSLAAWL 661 I GGSY G+LA W+ Sbjct: 161 ----IYSGGSYSGALAGWI 175 >UniRef50_Q9VDX1 Cluster: CG11626-PA; n=2; Sophophora|Rep: CG11626-PA - Drosophila melanogaster (Fruit fly) Length = 270 Score = 56.8 bits (131), Expect = 4e-07 Identities = 33/67 (49%), Positives = 41/67 (61%), Gaps = 2/67 (2%) Frame = +2 Query: 467 EHRFYGESHPTLDLS--IKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWIAFGGSYP 640 EHR+YG S P + S + NL+ LS +Q+LADLA+FI K E K I GGSY Sbjct: 13 EHRYYGLSLPFGNESYRLSNLKQLSLHQSLADLAHFIRHQKSNDPEMEDSKVILVGGSYS 72 Query: 641 GSLAAWL 661 GSL AW+ Sbjct: 73 GSLVAWM 79 >UniRef50_A2WVG2 Cluster: Putative uncharacterized protein; n=3; Oryza sativa|Rep: Putative uncharacterized protein - Oryza sativa subsp. indica (Rice) Length = 549 Score = 54.4 bits (125), Expect = 2e-06 Identities = 31/73 (42%), Positives = 46/73 (63%), Gaps = 5/73 (6%) Frame = +2 Query: 461 NLEHRFYGESHP--TLDLSIKN---LQFLSSYQALADLANFISSMKQKFRLNEKVKWIAF 625 +L+HR+YGES P + D + N L +L++ QALAD A ++ +K+ +E + F Sbjct: 158 SLQHRYYGESMPFGSKDKAYNNSKSLAYLTAEQALADYAVLLTDLKKNLS-SEGSPVVLF 216 Query: 626 GGSYPGSLAAWLR 664 GGSY G LAAW+R Sbjct: 217 GGSYGGMLAAWMR 229 >UniRef50_A2FQM0 Cluster: Putative uncharacterized protein; n=1; Trichomonas vaginalis G3|Rep: Putative uncharacterized protein - Trichomonas vaginalis G3 Length = 323 Score = 52.0 bits (119), Expect = 1e-05 Identities = 32/80 (40%), Positives = 44/80 (55%), Gaps = 1/80 (1%) Frame = +2 Query: 428 NYAKKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEK 607 N A K N+ I++EHRF+G S P +S ++Q+LS L DL+ + +K N Sbjct: 7 NLAIKTNSTLISIEHRFFGTSKPA--VSYFSIQYLSIQNILEDLSLVLQDIKNN---NPN 61 Query: 608 VKWIAFGG-SYPGSLAAWLR 664 +K I G Y GSLAAW R Sbjct: 62 IKRIFVAGCGYAGSLAAWFR 81 >UniRef50_Q2UKB6 Cluster: Predicted protein; n=1; Aspergillus oryzae|Rep: Predicted protein - Aspergillus oryzae Length = 541 Score = 52.0 bits (119), Expect = 1e-05 Identities = 51/142 (35%), Positives = 69/142 (48%), Gaps = 11/142 (7%) Frame = +2 Query: 272 LDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPA--DARWMVTGT---WINYA 436 +DH +PS + T++ RY+V+ FY K GPVF++ GEG A A+ + G+ + Y Sbjct: 76 IDHEDPS-MGTYQNRYWVSADFY--KPGGPVFVLDAGEGNAYSVAQSYLGGSDNFFAEYL 132 Query: 437 KKFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLN----- 601 K+FN L + EHR QALADL F +KF LN Sbjct: 133 KEFNGLGLVWEHR----------------------QALADLPYF----AEKFTLNGTDLS 166 Query: 602 -EKVKWIAFGGSYPGSLAAWLR 664 + WI GGSYPG AA+ R Sbjct: 167 PKSSPWIMLGGSYPGMRAAFTR 188 >UniRef50_Q64YV4 Cluster: Putative secreted tripeptidyl aminopeptidase; n=6; Bacteroidales|Rep: Putative secreted tripeptidyl aminopeptidase - Bacteroides fragilis Length = 455 Score = 50.8 bits (116), Expect = 3e-05 Identities = 44/135 (32%), Positives = 67/135 (49%) Frame = +2 Query: 260 FKQKLDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAK 439 FKQ LDHS+P + ++ QR V YD P ++ G G A R + G + +K Sbjct: 81 FKQPLDHSHP-EKGSFSQRVIVAHVGYD----RPTLMVTEGYGAA--RSLNPGYYEELSK 133 Query: 440 KFNALCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNEKVKWI 619 FN I +EHR++ ES P K+ ++L+++ + DL ++++ FR KWI Sbjct: 134 LFNTNIIAVEHRYFLESTP----KPKDWKYLTAWNSARDL----HAIREAFRSIYPGKWI 185 Query: 620 AFGGSYPGSLAAWLR 664 A G S G A R Sbjct: 186 ATGISKGGQTAMLYR 200 >UniRef50_Q2U0Q2 Cluster: Hydrolytic enzymes of the alpha/beta hydrolase fold; n=6; Trichocomaceae|Rep: Hydrolytic enzymes of the alpha/beta hydrolase fold - Aspergillus oryzae Length = 569 Score = 42.3 bits (95), Expect = 0.010 Identities = 33/136 (24%), Positives = 64/136 (47%), Gaps = 5/136 (3%) Frame = +2 Query: 272 LDHSNPSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARWMVTGTWINYAKKFNA 451 +DH N + + T++ R++VND +Y+ P+ + GE A++ + + + F Sbjct: 76 IDH-NDTSVGTYQNRFWVNDDYYEAGR--PIIMYDAGETNAES---IAKNHLTSSLSFFR 129 Query: 452 LCINLEHRFYGESHPTLDLSIKNLQFLSSYQALADLANFISSM-KQKFRLNE----KVKW 616 + H + D ++ ++L++ QAL D+ F + + KF ++ W Sbjct: 130 KILEDTHAMGIIWEHSRDTPPEHFKYLTTKQALEDIPYFARNFSRPKFAEHDLTPSSTPW 189 Query: 617 IAFGGSYPGSLAAWLR 664 + GGSY G AA+ R Sbjct: 190 VLVGGSYAGIRAAFAR 205 >UniRef50_UPI0000D56B19 Cluster: PREDICTED: similar to CG31349-PB, isoform B; n=1; Tribolium castaneum|Rep: PREDICTED: similar to CG31349-PB, isoform B - Tribolium castaneum Length = 1543 Score = 29.5 bits (63), Expect(2) = 0.21 Identities = 11/25 (44%), Positives = 16/25 (64%) Frame = +2 Query: 182 HLGRSNGGNLGIPGGDYQSNLPPPQ 256 H G NGG++G P G+ +PPP+ Sbjct: 1152 HNGFDNGGHMGPPNGNGYKPVPPPK 1176 Score = 27.5 bits (58), Expect(2) = 0.21 Identities = 11/28 (39%), Positives = 13/28 (46%) Frame = +2 Query: 245 PPPQWFKQKLDHSNPSDLRTWKQRYYVN 328 PP ++ HSNP D R YY N Sbjct: 1197 PPMPTYQHGKSHSNPVDQRAQNMNYYYN 1224 >UniRef50_A6SFQ5 Cluster: Putative uncharacterized protein; n=1; Botryotinia fuckeliana B05.10|Rep: Putative uncharacterized protein - Botryotinia fuckeliana B05.10 Length = 450 Score = 34.3 bits (75), Expect = 2.7 Identities = 24/63 (38%), Positives = 34/63 (53%), Gaps = 8/63 (12%) Frame = +2 Query: 257 WFKQKLDHSN------PSDLRTWKQRYYVNDSFYDFKNQGPVFLMIGGEGPADARW--MV 412 W+ Q++DH P T+KQRY+ + +Y K GPV+L IGGE R+ + Sbjct: 11 WY-QRIDHFPNDPAYAPHTNATFKQRYWYDAKYY--KPGGPVYLYIGGETNGQNRFSNLQ 67 Query: 413 TGT 421 TGT Sbjct: 68 TGT 70 >UniRef50_A5UT10 Cluster: Spermine synthase; n=2; Roseiflexus|Rep: Spermine synthase - Roseiflexus sp. RS-1 Length = 819 Score = 33.5 bits (73), Expect = 4.7 Identities = 21/63 (33%), Positives = 27/63 (42%), Gaps = 3/63 (4%) Frame = +2 Query: 218 PGGDYQSNLPPPQWFKQKLDHSNPSDLR-TWKQRYYVNDSFYDFKNQGPVF--LMIGGEG 388 P G Y NL ++F+ LDH NP L TW V +F F ++IG Sbjct: 690 PNGAYSGNLYSEEYFRLLLDHLNPGGLAVTWVPTERVRTTFIQVFPHYVDFGDILIGSNA 749 Query: 389 PAD 397 P D Sbjct: 750 PID 752 >UniRef50_A1IEM6 Cluster: Hydrolase of the alpha/beta-hydrolase fold; n=1; Candidatus Desulfococcus oleovorans Hxd3|Rep: Hydrolase of the alpha/beta-hydrolase fold - Candidatus Desulfococcus oleovorans Hxd3 Length = 325 Score = 33.5 bits (73), Expect = 4.7 Identities = 19/51 (37%), Positives = 29/51 (56%) Frame = +2 Query: 341 DFKNQGPVFLMIGGEGPADARWMVTGTWINYAKKFNALCINLEHRFYGESH 493 D N+G V L+ G EG +D+ ++V+ Y + N +NL R +GESH Sbjct: 62 DGPNKGLVILIHGWEGSSDSMYLVSSAGHLYNQGLNVFRLNL--RDHGESH 110 >UniRef50_UPI0000D9A547 Cluster: PREDICTED: similar to mucin 4, partial; n=2; Macaca mulatta|Rep: PREDICTED: similar to mucin 4, partial - Macaca mulatta Length = 496 Score = 33.1 bits (72), Expect = 6.2 Identities = 31/112 (27%), Positives = 49/112 (43%) Frame = -1 Query: 494 DDSLHKNDVLNLYKVH*TSLHN*SKFLSPSILHQLDPHHQSSKILGLDF*NHKNCHLHNS 315 D LH +D+ + LH+ S P+ +L H S I ++ +H+ HLH+S Sbjct: 101 DCHLHHSDIQTPDHLSDCHLHH-SNIHPPN---KLSDRHSPSHIHTPNYLSHR--HLHHS 154 Query: 314 AVSKFEDRMDLSGLTFA*TTVEVANLIDNHHQVFLNYHHYYGPNEIFLHHQH 159 V +D D + + N + +HH L Y H + PN + HH H Sbjct: 155 HVHIPDDLPDHH---LCHSHIHTPNNLPDHH---LRYSHIHTPNHLADHHLH 200 >UniRef50_A3LVZ5 Cluster: Predicted protein; n=1; Pichia stipitis|Rep: Predicted protein - Pichia stipitis (Yeast) Length = 414 Score = 33.1 bits (72), Expect = 6.2 Identities = 19/45 (42%), Positives = 29/45 (64%), Gaps = 1/45 (2%) Frame = +2 Query: 473 RFY-GESHPTLDLSIKNLQFLSSYQALADLANFISSMKQKFRLNE 604 RF+ E+H ++DL IK+L +L +A LA+F++S FR NE Sbjct: 17 RFHQDENHDSIDLQIKSLLYLFKRCQIAKLAHFLNS-DGSFRYNE 60 >UniRef50_Q1KMD3 Cluster: Heterogeneous nuclear ribonucleoprotein U-like protein 2; n=18; Theria|Rep: Heterogeneous nuclear ribonucleoprotein U-like protein 2 - Homo sapiens (Human) Length = 747 Score = 33.1 bits (72), Expect = 6.2 Identities = 22/84 (26%), Positives = 37/84 (44%), Gaps = 3/84 (3%) Frame = +2 Query: 362 VFLMIGGEGPADARWMVTGTWINYAKKFNAL---CINLEHRFYGESHPTLDLSIKNLQFL 532 V LM+G G +W + N K++N L + + R G P +D ++L Sbjct: 454 VILMVGLPGSGKTQWALKYAKENPEKRYNVLGAETVLNQMRMKGLEEPEMDPKSRDLLVQ 513 Query: 533 SSYQALADLANFISSMKQKFRLNE 604 + Q L+ L S K+ F L++ Sbjct: 514 QASQCLSKLVQIASRTKRNFILDQ 537 >UniRef50_Q73LY9 Cluster: Putative uncharacterized protein; n=1; Treponema denticola|Rep: Putative uncharacterized protein - Treponema denticola Length = 354 Score = 32.7 bits (71), Expect = 8.2 Identities = 15/29 (51%), Positives = 19/29 (65%) Frame = +2 Query: 104 STNMKLYTILFNLYVALISVDGVKKFHLG 190 S+N K+YTILF L + V VKK H+G Sbjct: 158 SSNEKVYTILFGLIDEITEVFSVKKIHVG 186 >UniRef50_A3Y6D3 Cluster: Sensor protein; n=1; Marinomonas sp. MED121|Rep: Sensor protein - Marinomonas sp. MED121 Length = 2128 Score = 32.7 bits (71), Expect = 8.2 Identities = 15/34 (44%), Positives = 22/34 (64%) Frame = -1 Query: 248 VANLIDNHHQVFLNYHHYYGPNEIFLHHQH*LAL 147 V NLID++H + LNY+H+ E F+ + LAL Sbjct: 1099 VNNLIDSNHVLCLNYYHFSKTMEAFIFGDYPLAL 1132 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 674,312,767 Number of Sequences: 1657284 Number of extensions: 13611566 Number of successful extensions: 34689 Number of sequences better than 10.0: 121 Number of HSP's better than 10.0 without gapping: 33078 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 34416 length of database: 575,637,011 effective HSP length: 98 effective length of database: 413,223,179 effective search space used: 50826451017 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -