SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTP 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= BGIBMGA001098-TA|BGIBMGA001098-PA|IPR000917|Sulfatase
         (455 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_Q8SZ72 Cluster: RE14504p; n=9; Eumetazoa|Rep: RE14504p ...   502   e-141
UniRef50_A7RFN2 Cluster: Predicted protein; n=2; Nematostella ve...   360   4e-98
UniRef50_UPI0000E49A98 Cluster: PREDICTED: similar to ENSANGP000...   355   1e-96
UniRef50_Q9VVM4 Cluster: CG7402-PA; n=3; Diptera|Rep: CG7402-PA ...   355   2e-96
UniRef50_UPI0000D55F5E Cluster: PREDICTED: similar to CG8646-PA;...   352   1e-95
UniRef50_Q17B03 Cluster: Arylsulfatase b; n=3; Culicidae|Rep: Ar...   350   4e-95
UniRef50_UPI00015B5C4D Cluster: PREDICTED: similar to ENSANGP000...   347   3e-94
UniRef50_UPI0000D55D4D Cluster: PREDICTED: similar to CG8646-PA;...   330   6e-89
UniRef50_UPI00015B40BD Cluster: PREDICTED: similar to RE14504p; ...   323   7e-87
UniRef50_Q8IQS4 Cluster: CG32191-PA; n=2; Sophophora|Rep: CG3219...   317   4e-85
UniRef50_UPI0000DB708B Cluster: PREDICTED: similar to CG7402-PA ...   316   1e-84
UniRef50_UPI0000D56522 Cluster: PREDICTED: similar to CG7402-PA;...   311   2e-83
UniRef50_Q9VVM1 Cluster: CG7408-PB; n=2; Drosophila melanogaster...   310   4e-83
UniRef50_UPI00015B51A4 Cluster: PREDICTED: similar to arylsulfat...   309   9e-83
UniRef50_UPI0000DB708D Cluster: PREDICTED: similar to CG8646-PA;...   305   1e-81
UniRef50_UPI0000D56521 Cluster: PREDICTED: similar to CG7402-PA;...   303   8e-81
UniRef50_A7SBG5 Cluster: Predicted protein; n=1; Nematostella ve...   293   5e-78
UniRef50_Q8MPH9 Cluster: Glucosinolate sulphatase; n=3; Plutella...   288   2e-76
UniRef50_P50429 Cluster: Arylsulfatase B precursor; n=17; Eumeta...   266   6e-70
UniRef50_Q5FYB0 Cluster: Arylsulfatase J precursor; n=69; Eumeta...   259   9e-68
UniRef50_Q9NJU8 Cluster: Sulfatase 1; n=3; Coelomata|Rep: Sulfat...   258   2e-67
UniRef50_A7SRP2 Cluster: Predicted protein; n=2; Nematostella ve...   257   5e-67
UniRef50_UPI0000586CBA Cluster: PREDICTED: similar to arylsulfat...   253   6e-66
UniRef50_UPI0000E46777 Cluster: PREDICTED: similar to arylsulfat...   223   1e-56
UniRef50_A7S8Q2 Cluster: Predicted protein; n=2; Nematostella ve...   223   1e-56
UniRef50_A7SPY2 Cluster: Predicted protein; n=4; Eumetazoa|Rep: ...   214   5e-54
UniRef50_A3ZMN6 Cluster: Arylsulfatase B; n=1; Blastopirellula m...   197   4e-49
UniRef50_A7IPG5 Cluster: Sulfatase precursor; n=1; Xanthobacter ...   186   1e-45
UniRef50_Q18837 Cluster: Sulfatase domain protein protein 3, iso...   179   1e-43
UniRef50_UPI0000587D99 Cluster: PREDICTED: similar to arylsulfat...   163   7e-39
UniRef50_UPI0000E48607 Cluster: PREDICTED: similar to arylsulfat...   163   1e-38
UniRef50_UPI0000F20AE2 Cluster: PREDICTED: similar to Arylsulfat...   156   1e-36
UniRef50_A6DKP3 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...   150   9e-35
UniRef50_A6DSH3 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...   149   1e-34
UniRef50_UPI0000660330 Cluster: Arylsulfatase I precursor (EC 3....   149   2e-34
UniRef50_A3HWU7 Cluster: N-acetylgalactosamine 6-sulfatase; n=2;...   143   1e-32
UniRef50_Q4SNM7 Cluster: Chromosome 15 SCAF14542, whole genome s...   142   2e-32
UniRef50_Q15XG7 Cluster: Sulfatase precursor; n=2; Bacteria|Rep:...   141   3e-32
UniRef50_A6DKC9 Cluster: Sulfatase; n=1; Lentisphaera araneosa H...   138   2e-31
UniRef50_Q7UGD7 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...   136   1e-30
UniRef50_A4CMB0 Cluster: Arylsulfatase A; n=5; Bacteria|Rep: Ary...   134   6e-30
UniRef50_A6DLE2 Cluster: Sulfatase; n=1; Lentisphaera araneosa H...   133   8e-30
UniRef50_Q7UYW3 Cluster: Arylsulfatase B; n=1; Pirellula sp.|Rep...   132   2e-29
UniRef50_A0IXQ0 Cluster: Sulfatase; n=1; Shewanella woodyi ATCC ...   132   2e-29
UniRef50_A6CAY0 Cluster: N-acetylgalactosamine 6-sulfate sulfata...   130   1e-28
UniRef50_A3XJJ9 Cluster: Arylsulfatase B; n=1; Leeuwenhoekiella ...   130   1e-28
UniRef50_A0Z632 Cluster: Arylsulfatase B; n=1; marine gamma prot...   130   1e-28
UniRef50_A3ZLN5 Cluster: N-acetylgalactosamine 6-sulfate sulfata...   129   2e-28
UniRef50_A0JAA8 Cluster: Sulfatase precursor; n=1; Shewanella wo...   128   4e-28
UniRef50_Q8A219 Cluster: Arylsulfatase B; n=2; Bacteroides|Rep: ...   127   7e-28
UniRef50_A0Z9E1 Cluster: Sulfatase family protein; n=3; Proteoba...   127   7e-28
UniRef50_Q1VP00 Cluster: Arylsulfatase B; n=1; Psychroflexus tor...   126   1e-27
UniRef50_A6DKB8 Cluster: N-acetylgalactosamine 6-sulfatase; n=3;...   125   3e-27
UniRef50_UPI0000E4801A Cluster: PREDICTED: similar to sulfatase ...   124   6e-27
UniRef50_A4XED5 Cluster: Sulfatase precursor; n=1; Novosphingobi...   123   9e-27
UniRef50_A5ZER6 Cluster: Putative uncharacterized protein; n=1; ...   122   1e-26
UniRef50_A6LED1 Cluster: Arylsulfatase A; n=1; Parabacteroides d...   121   3e-26
UniRef50_A4GJF1 Cluster: Sulfatase; n=1; uncultured marine bacte...   121   5e-26
UniRef50_A6CAW6 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;...   120   8e-26
UniRef50_A6C4L0 Cluster: N-acetylgalactosamine-6-sulfate sulfata...   120   8e-26
UniRef50_A6DKD8 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...   118   2e-25
UniRef50_A6CD52 Cluster: Twin-arginine translocation pathway sig...   118   3e-25
UniRef50_A6C4W7 Cluster: Twin-arginine translocation pathway sig...   118   4e-25
UniRef50_Q7UX97 Cluster: Arylsulfatase B [Precursor]; n=1; Pirel...   117   6e-25
UniRef50_A0HG49 Cluster: Sulfatase; n=6; Comamonadaceae|Rep: Sul...   117   6e-25
UniRef50_Q7UGB4 Cluster: N-acetylgalactosamine 6-sulfate sulfata...   117   7e-25
UniRef50_Q3M597 Cluster: Twin-arginine translocation pathway sig...   116   1e-24
UniRef50_Q15XH3 Cluster: Sulfatase precursor; n=1; Pseudoalterom...   116   1e-24
UniRef50_Q1YSH0 Cluster: Sulfatase family protein; n=4; cellular...   116   2e-24
UniRef50_Q02AN8 Cluster: Sulfatase precursor; n=1; Solibacter us...   116   2e-24
UniRef50_A6C383 Cluster: Sulfatase; n=1; Planctomyces maris DSM ...   115   2e-24
UniRef50_A6DG78 Cluster: Sulfatase; n=1; Lentisphaera araneosa H...   115   3e-24
UniRef50_A4CMB1 Cluster: Arylsulphatase A; n=3; Bacteria|Rep: Ar...   114   4e-24
UniRef50_A6DKP2 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...   113   9e-24
UniRef50_Q7UHJ9 Cluster: Iduronate-sulfatase or arylsulfatase A;...   113   1e-23
UniRef50_Q2GB51 Cluster: Sulfatase; n=2; Proteobacteria|Rep: Sul...   113   1e-23
UniRef50_A6DMV0 Cluster: N-acetylgalactosamine-6-sulfate sulfata...   112   2e-23
UniRef50_A4CGL5 Cluster: Arylsulfatase A; n=4; Bacteria|Rep: Ary...   111   4e-23
UniRef50_A3I0L2 Cluster: Arylsulfatase A; n=2; Bacteroidetes|Rep...   111   4e-23
UniRef50_A0YAF7 Cluster: Arylsulfatase A; n=1; marine gamma prot...   111   4e-23
UniRef50_A6LDP6 Cluster: Arylsulfatase A; n=1; Parabacteroides d...   110   6e-23
UniRef50_A6KZI6 Cluster: Sulfatase; n=2; Bacteroides|Rep: Sulfat...   110   6e-23
UniRef50_Q7UKJ5 Cluster: Arylsulfatase A; n=3; Bacteria|Rep: Ary...   108   3e-22
UniRef50_A6DPC8 Cluster: Arylsulfatase A; n=1; Lentisphaera aran...   108   3e-22
UniRef50_A6DSP6 Cluster: Sulfatase; n=1; Lentisphaera araneosa H...   107   5e-22
UniRef50_A5FAW4 Cluster: Sulfatase precursor; n=1; Flavobacteriu...   107   5e-22
UniRef50_A3HZ22 Cluster: Putative exported uslfatase; n=1; Algor...   107   6e-22
UniRef50_P34059 Cluster: N-acetylgalactosamine-6-sulfatase precu...   107   6e-22
UniRef50_A3ZUT0 Cluster: Arylsulphatase A; n=1; Blastopirellula ...   107   8e-22
UniRef50_A6DR20 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1...   106   1e-21
UniRef50_A6DGX5 Cluster: N-acetylgalactosamine 6-sulfatase; n=2;...   106   1e-21
UniRef50_Q7UMZ5 Cluster: N-acetylgalactosamine-6-sulfate sulfata...   105   2e-21
UniRef50_A7AKS6 Cluster: Putative uncharacterized protein; n=1; ...   105   2e-21
UniRef50_A6DMX7 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1...   105   2e-21
UniRef50_A6CGG6 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...   105   2e-21
UniRef50_A0Z718 Cluster: Twin-arginine translocation pathway sig...   105   2e-21
UniRef50_Q482D6 Cluster: Sulfatase family protein; n=2; Bacteria...   105   3e-21
UniRef50_Q7UPK7 Cluster: Arylsulphatase A; n=1; Pirellula sp.|Re...   104   4e-21
UniRef50_A6C3C8 Cluster: Putative uncharacterized protein; n=1; ...   104   6e-21
UniRef50_Q15XI1 Cluster: Sulfatase; n=2; Bacteria|Rep: Sulfatase...   103   7e-21
UniRef50_A6DF72 Cluster: Putative secreted sulfatase ydeN; n=1; ...   103   7e-21
UniRef50_Q47Q78 Cluster: N-acetylgalactosamine-6-sulfate sulfata...   102   2e-20
UniRef50_A6P2X1 Cluster: Putative uncharacterized protein; n=1; ...   102   2e-20
UniRef50_A6DMY9 Cluster: Putative uncharacterized protein; n=2; ...   102   2e-20
UniRef50_A6C8S3 Cluster: Arylsulphatase A; n=1; Planctomyces mar...   102   2e-20
UniRef50_A6LHS9 Cluster: Arylsulfatase; n=4; Bacteroidetes|Rep: ...   102   2e-20
UniRef50_A6C1V3 Cluster: Putative secreted sulfatase ydeN; n=1; ...   102   2e-20
UniRef50_A6DHS2 Cluster: N-acetylgalactosamine-6-sulfate sulfata...   101   3e-20
UniRef50_A6C4V9 Cluster: Sulfatase; n=1; Planctomyces maris DSM ...   101   5e-20
UniRef50_A6CBI6 Cluster: Putative uncharacterized protein; n=1; ...   100   9e-20
UniRef50_Q1GWE7 Cluster: Sulfatase precursor; n=4; Alphaproteoba...    99   1e-19
UniRef50_Q3JD43 Cluster: Sulfatase; n=1; Nitrosococcus oceani AT...   100   2e-19
UniRef50_Q0C069 Cluster: Sulfatase family protein; n=2; Bacteria...   100   2e-19
UniRef50_A6DSG6 Cluster: Arylsulfatase A; n=1; Lentisphaera aran...   100   2e-19
UniRef50_A6DNJ0 Cluster: Sulfatase; n=1; Lentisphaera araneosa H...   100   2e-19
UniRef50_A6DKM2 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...   100   2e-19
UniRef50_Q7UJ66 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    99   3e-19
UniRef50_Q7UYD6 Cluster: N-acetyl-galactosamine-6-sulfatase; n=3...    97   6e-19
UniRef50_A6LCL3 Cluster: Arylsulfatase A; n=1; Parabacteroides d...    97   8e-19
UniRef50_A4A2W0 Cluster: Arylsulfatase A; n=1; Blastopirellula m...    97   8e-19
UniRef50_Q01N83 Cluster: Sulfatase precursor; n=1; Solibacter us...    97   1e-18
UniRef50_Q7UZ43 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;...    96   1e-18
UniRef50_Q7US96 Cluster: Arylsulphatase A; n=1; Pirellula sp.|Re...    96   1e-18
UniRef50_Q7URY7 Cluster: Aryl-sulphate sulphohydrolase; n=1; Pir...    96   1e-18
UniRef50_Q7ULE7 Cluster: Iduronate-sulfatase and sulfatase 1; n=...    95   3e-18
UniRef50_Q64P90 Cluster: Putative secreted sulfatase ydeN; n=2; ...    95   3e-18
UniRef50_A6LIX6 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...    95   3e-18
UniRef50_A0JAV8 Cluster: Sulfatase precursor; n=1; Shewanella wo...    95   3e-18
UniRef50_Q7UL93 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...    94   6e-18
UniRef50_A3XZF1 Cluster: Sulfatase family protein; n=5; Proteoba...    94   6e-18
UniRef50_A6DS95 Cluster: Arylsulfatase A; n=2; Lentisphaera aran...    94   8e-18
UniRef50_A3J5W3 Cluster: Putative arylsulfatase; n=1; Flavobacte...    94   8e-18
UniRef50_Q7UHK0 Cluster: Arylsulphatase A; n=1; Pirellula sp.|Re...    93   1e-17
UniRef50_A6KWS8 Cluster: Arylsulfatase; n=1; Bacteroides vulgatu...    93   1e-17
UniRef50_A6DSM5 Cluster: Arylsulfatase A; n=1; Lentisphaera aran...    93   1e-17
UniRef50_A6DGL0 Cluster: Arylsulfatase A; n=1; Lentisphaera aran...    93   1e-17
UniRef50_A6C4Q9 Cluster: Arylsulphatase A; n=1; Planctomyces mar...    93   1e-17
UniRef50_Q7URW3 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;...    93   2e-17
UniRef50_Q7UGB8 Cluster: Arylsulfatase homolog b1498; n=1; Pirel...    92   2e-17
UniRef50_A6DQ01 Cluster: N-acetylgalactosamine-4-sulfatase; n=2;...    92   2e-17
UniRef50_A6C4W8 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    92   2e-17
UniRef50_Q7UWW9 Cluster: Arylsulfatase; n=2; Planctomycetaceae|R...    92   3e-17
UniRef50_Q7UTH7 Cluster: Arylsulfatase A; n=5; Bacteria|Rep: Ary...    91   6e-17
UniRef50_Q64WT3 Cluster: N-acetylgalactosamine-6-sulfatase; n=5;...    91   6e-17
UniRef50_A6DFR6 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;...    91   6e-17
UniRef50_A4AAM5 Cluster: Sulfatase; n=1; Congregibacter litorali...    91   6e-17
UniRef50_A6BZT7 Cluster: Putative arylsulfatase; n=1; Planctomyc...    91   7e-17
UniRef50_Q7UX95 Cluster: Arylsulfatase; n=3; Planctomycetaceae|R...    90   1e-16
UniRef50_A6LED2 Cluster: Arylsulfatase A; n=1; Parabacteroides d...    90   1e-16
UniRef50_A4AQQ7 Cluster: N-acetylgalactosamine 6-sulfatase; n=4;...    90   1e-16
UniRef50_A6DHI0 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    89   2e-16
UniRef50_UPI00005887B4 Cluster: PREDICTED: similar to galactosam...    89   2e-16
UniRef50_Q7UG72 Cluster: Arylsulfatase A [precursor]; n=1; Pirel...    89   2e-16
UniRef50_A6PEH5 Cluster: Sulfatase precursor; n=1; Shewanella se...    89   2e-16
UniRef50_UPI00005846A1 Cluster: PREDICTED: similar to arylsulfat...    89   3e-16
UniRef50_Q7UYH3 Cluster: Arylsulfatase; n=1; Pirellula sp.|Rep: ...    89   3e-16
UniRef50_A6DGD3 Cluster: Putative exported uslfatase; n=3; Bacte...    89   3e-16
UniRef50_A6C6V5 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;...    89   3e-16
UniRef50_Q7UMZ6 Cluster: Arylsulfatase A; n=1; Pirellula sp.|Rep...    88   5e-16
UniRef50_Q7UER7 Cluster: Sulfatase 1; n=6; Bacteria|Rep: Sulfata...    88   5e-16
UniRef50_A6DSG4 Cluster: Arylsulphatase A; n=1; Lentisphaera ara...    88   5e-16
UniRef50_A6DID9 Cluster: Putative sulfatase protein; n=1; Lentis...    88   5e-16
UniRef50_A6CAR8 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    87   7e-16
UniRef50_A3I2G9 Cluster: Putative secreted sulfatase; n=1; Algor...    87   7e-16
UniRef50_A6KZ75 Cluster: Putative secreted sulfatase; n=1; Bacte...    87   9e-16
UniRef50_Q9NJU7 Cluster: Sulfatase 2; n=1; Helix pomatia|Rep: Su...    87   9e-16
UniRef50_Q7UPG6 Cluster: Arylsulphatase A; n=2; Bacteria|Rep: Ar...    87   1e-15
UniRef50_Q8A222 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;...    86   2e-15
UniRef50_Q7UQ05 Cluster: Arylsulfatase A; n=2; Planctomycetaceae...    86   2e-15
UniRef50_A6DTN4 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    86   2e-15
UniRef50_A6DSF1 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1...    86   2e-15
UniRef50_A6DHI1 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    86   2e-15
UniRef50_Q5DYT4 Cluster: Arylsulfatase; n=10; Gammaproteobacteri...    85   3e-15
UniRef50_A6DKN7 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...    85   3e-15
UniRef50_A4GIB1 Cluster: Arylsulfatase; n=1; uncultured marine b...    85   4e-15
UniRef50_A0JAV3 Cluster: Sulfatase precursor; n=1; Shewanella wo...    85   4e-15
UniRef50_A6CA27 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...    85   5e-15
UniRef50_A6C8R8 Cluster: Arylsulfatase A; n=1; Planctomyces mari...    85   5e-15
UniRef50_A6CEC4 Cluster: Aryl-sulphate sulphohydrolase; n=1; Pla...    84   6e-15
UniRef50_A4B5Y4 Cluster: Iduronate-sulfatase and sulfatase 1; n=...    84   6e-15
UniRef50_Q0BZE9 Cluster: Sulfatase family protein; n=1; Hyphomon...    83   1e-14
UniRef50_Q4RJR3 Cluster: Chromosome 13 SCAF15035, whole genome s...    83   1e-14
UniRef50_A6DM48 Cluster: Arylsulfatase A; n=1; Lentisphaera aran...    83   1e-14
UniRef50_A6DG54 Cluster: Arylsulphatase A; n=1; Lentisphaera ara...    83   1e-14
UniRef50_A6DHY0 Cluster: N-acetylgalactosamine 6-sulfatase; n=2;...    83   2e-14
UniRef50_A4W906 Cluster: Sulfatase precursor; n=10; Enterobacter...    83   2e-14
UniRef50_Q7UYW2 Cluster: Arylsulfatase; n=2; Planctomycetaceae|R...    82   3e-14
UniRef50_A6DMX9 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    82   3e-14
UniRef50_A6DMX6 Cluster: Arylsulphatase A; n=1; Lentisphaera ara...    82   3e-14
UniRef50_P77318 Cluster: Uncharacterized sulfatase ydeN precurso...    82   3e-14
UniRef50_A6DFN4 Cluster: Arylsulfatase; n=1; Lentisphaera araneo...    82   3e-14
UniRef50_Q7UYA5 Cluster: Arylsulfatase; n=1; Pirellula sp.|Rep: ...    81   6e-14
UniRef50_Q7UN55 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    81   8e-14
UniRef50_A6C176 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    80   1e-13
UniRef50_Q1MJX8 Cluster: Putative arylsulfatase precursor; n=1; ...    80   1e-13
UniRef50_A6DQW6 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    80   1e-13
UniRef50_Q7UYA9 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;...    79   2e-13
UniRef50_A4AVA7 Cluster: Aryl-sulphate sulphohydrolase; n=2; Bac...    79   2e-13
UniRef50_A6DHI2 Cluster: Aryl-sulphate sulphohydrolase; n=2; Len...    79   2e-13
UniRef50_A4ANR8 Cluster: Arylsulfatase; n=2; Bacteroidetes|Rep: ...    79   2e-13
UniRef50_A3ZY29 Cluster: Aryl-sulphate sulphohydrolase; n=1; Bla...    79   2e-13
UniRef50_A3XSU6 Cluster: Sulfatase family protein; n=2; Vibrio|R...    79   2e-13
UniRef50_Q8A171 Cluster: Putative secreted sulfatase ydeN; n=10;...    79   3e-13
UniRef50_Q15XP0 Cluster: Sulfatase precursor; n=1; Pseudoalterom...    79   3e-13
UniRef50_A6DMW2 Cluster: Putative exported uslfatase; n=1; Lenti...    79   3e-13
UniRef50_P25549 Cluster: Arylsulfatase precursor; n=12; Proteoba...    79   3e-13
UniRef50_Q7UYD2 Cluster: Sulfatase 1; n=2; Bacteria|Rep: Sulfata...    78   4e-13
UniRef50_A6DNI1 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...    78   4e-13
UniRef50_A6DMU3 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...    78   6e-13
UniRef50_UPI0000586CBD Cluster: PREDICTED: similar to MGC86251 p...    77   7e-13
UniRef50_Q8D7K3 Cluster: Arylsulfatase A; n=16; Bacteria|Rep: Ar...    77   7e-13
UniRef50_A0IXQ1 Cluster: Sulfatase precursor; n=1; Shewanella wo...    77   7e-13
UniRef50_Q7UYA6 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    77   1e-12
UniRef50_A6DU75 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    77   1e-12
UniRef50_A6DI17 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;...    77   1e-12
UniRef50_A3ZLD4 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...    77   1e-12
UniRef50_Q7UIU1 Cluster: Arylsulfatase A; n=1; Pirellula sp.|Rep...    77   1e-12
UniRef50_A6BYR0 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1...    77   1e-12
UniRef50_P50473 Cluster: Arylsulfatase precursor; n=7; Echinoida...    77   1e-12
UniRef50_UPI0000E0F7DD Cluster: aryl-sulphate sulphohydrolase; n...    76   2e-12
UniRef50_UPI00006A2B15 Cluster: UPI00006A2B15 related cluster; n...    76   2e-12
UniRef50_Q4SI19 Cluster: Chromosome 5 SCAF14581, whole genome sh...    76   2e-12
UniRef50_A6DHW4 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...    76   2e-12
UniRef50_A6DG52 Cluster: Arylsulphatase A; n=1; Lentisphaera ara...    76   2e-12
UniRef50_A3ZVD1 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    76   2e-12
UniRef50_Q15SA2 Cluster: Sulfatase precursor; n=1; Pseudoalterom...    75   3e-12
UniRef50_A6DID7 Cluster: Aryl-sulphate sulphohydrolase; n=2; Len...    75   4e-12
UniRef50_A6C430 Cluster: Arylsulphatase A; n=1; Planctomyces mar...    75   4e-12
UniRef50_A6C284 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...    75   4e-12
UniRef50_A3ZWK4 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...    75   4e-12
UniRef50_A3HYT7 Cluster: Arylsulphatase A; n=1; Algoriphagus sp....    75   4e-12
UniRef50_A6LEC5 Cluster: Arylsulfatase A; n=1; Parabacteroides d...    75   5e-12
UniRef50_A6DJ11 Cluster: Arylsulfatase A; n=1; Lentisphaera aran...    75   5e-12
UniRef50_Q7UL40 Cluster: Arylsulfatase A; n=1; Pirellula sp.|Rep...    74   7e-12
UniRef50_Q4BZ10 Cluster: Similar to Arylsulfatase A and related ...    74   7e-12
UniRef50_A6DM53 Cluster: Arylsulfatase; n=1; Lentisphaera araneo...    74   7e-12
UniRef50_A6DRX0 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    74   9e-12
UniRef50_A6DJ15 Cluster: Putative arylsulfatase; n=2; Lentisphae...    74   9e-12
UniRef50_A6CBM1 Cluster: Arylsulphatase A; n=1; Planctomyces mar...    73   1e-11
UniRef50_UPI0000E1104B Cluster: N-acetylgalactosamine 6-sulfate ...    73   2e-11
UniRef50_A6DNI9 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1...    73   2e-11
UniRef50_A4A218 Cluster: Arylsulfatase A; n=1; Blastopirellula m...    73   2e-11
UniRef50_Q89L10 Cluster: Bll4738 protein; n=6; Proteobacteria|Re...    72   3e-11
UniRef50_A6EGE7 Cluster: N-acetylgalactosamine-6-sulfatase; n=3;...    72   3e-11
UniRef50_A6DJL2 Cluster: Putative exported uslfatase; n=1; Lenti...    72   3e-11
UniRef50_A6DJE5 Cluster: Sulfatase 1; n=1; Lentisphaera araneosa...    72   3e-11
UniRef50_A6DIC6 Cluster: Arylsulfatase; n=1; Lentisphaera araneo...    72   3e-11
UniRef50_A6C4Q6 Cluster: Arylsulfatase; n=1; Planctomyces maris ...    72   3e-11
UniRef50_A4AR92 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;...    72   3e-11
UniRef50_Q612A1 Cluster: Putative uncharacterized protein CBG168...    72   3e-11
UniRef50_Q89L07 Cluster: Bll4741 protein; n=4; Bacteria|Rep: Bll...    72   4e-11
UniRef50_A6DUI7 Cluster: Putative exported uslfatase; n=1; Lenti...    72   4e-11
UniRef50_A6DJF1 Cluster: Sulfatase 1; n=1; Lentisphaera araneosa...    72   4e-11
UniRef50_A6DMY7 Cluster: Iduronate-sulfatase and sulfatase 1; n=...    71   5e-11
UniRef50_Q7UH63 Cluster: Arylsulphatase A; n=3; Bacteria|Rep: Ar...    71   6e-11
UniRef50_A4CK82 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...    71   6e-11
UniRef50_A0Z6R0 Cluster: Putative arylsulfatase; n=1; marine gam...    71   6e-11
UniRef50_A6DPC9 Cluster: Arylsulphatase A; n=1; Lentisphaera ara...    71   8e-11
UniRef50_A6DIG7 Cluster: Iduronate-sulfatase or arylsulfatase A;...    71   8e-11
UniRef50_Q98BQ3 Cluster: Arylsulfatase; n=77; cellular organisms...    70   1e-10
UniRef50_A6DMW5 Cluster: Iduronate-sulfatase and sulfatase 1; n=...    70   1e-10
UniRef50_A6DJI7 Cluster: Sulfatase 1; n=2; Lentisphaera araneosa...    70   1e-10
UniRef50_Q7UHJ6 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    70   1e-10
UniRef50_Q64MS8 Cluster: Arylsulfatase; n=7; Bacteria|Rep: Aryls...    69   2e-10
UniRef50_A6DI94 Cluster: Arylsulfatase A; n=1; Lentisphaera aran...    69   2e-10
UniRef50_A6DI30 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;...    69   2e-10
UniRef50_Q7UH85 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;...    69   3e-10
UniRef50_A6DNW5 Cluster: Arylsulfatase; n=1; Lentisphaera araneo...    69   3e-10
UniRef50_A0J9Y8 Cluster: Sulfatase precursor; n=1; Shewanella wo...    69   3e-10
UniRef50_Q7UUG3 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...    69   3e-10
UniRef50_Q7UM38 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...    69   3e-10
UniRef50_A6DMW1 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1...    69   3e-10
UniRef50_A6DI98 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    69   3e-10
UniRef50_A6DTP6 Cluster: Arylsulfatase; n=1; Lentisphaera araneo...    68   6e-10
UniRef50_A6DKM6 Cluster: Arylsulfatase A; n=1; Lentisphaera aran...    68   6e-10
UniRef50_A3XZ25 Cluster: Arylsulfatase A; n=2; Vibrionaceae|Rep:...    68   6e-10
UniRef50_UPI0000E4A9B1 Cluster: PREDICTED: similar to MGC86251 p...    67   8e-10
UniRef50_A6C1Q0 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    67   8e-10
UniRef50_A6DNY9 Cluster: Arylsulphatase A; n=3; Lentisphaera ara...    67   1e-09
UniRef50_Q7UVD9 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    66   1e-09
UniRef50_A6DPE4 Cluster: Arylsulfatase; n=1; Lentisphaera araneo...    66   1e-09
UniRef50_Q8A348 Cluster: Arylsulfatase; n=3; Bacteroides|Rep: Ar...    66   2e-09
UniRef50_Q7UH46 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...    66   2e-09
UniRef50_Q1VDY3 Cluster: Probable sulfatase; n=1; Vibrio alginol...    66   2e-09
UniRef50_P15289 Cluster: Arylsulfatase A precursor (EC 3.1.6.8) ...    66   2e-09
UniRef50_Q8A362 Cluster: Arylsulfatase; n=1; Bacteroides thetaio...    66   2e-09
UniRef50_A6DPE1 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    65   3e-09
UniRef50_A6DFS2 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;...    65   3e-09
UniRef50_A6CEG5 Cluster: Arylsulphatase A; n=2; Bacteria|Rep: Ar...    65   3e-09
UniRef50_A6BYP9 Cluster: Arylsulphatase A; n=1; Planctomyces mar...    65   3e-09
UniRef50_Q7ULF9 Cluster: Arylsulfatase; n=2; Bacteria|Rep: Aryls...    65   4e-09
UniRef50_A6DF77 Cluster: Arylsulphatase A; n=2; Lentisphaera ara...    65   4e-09
UniRef50_Q5AJI4 Cluster: Potential arylsulfatase; n=5; Saccharom...    65   4e-09
UniRef50_A6DLW9 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    64   6e-09
UniRef50_A6DJJ7 Cluster: Arylsulfatase; n=1; Lentisphaera araneo...    64   6e-09
UniRef50_A6DJ57 Cluster: Arylsulphatase A; n=2; Lentisphaera ara...    64   6e-09
UniRef50_A6CB33 Cluster: Arylsulfatase; n=1; Planctomyces maris ...    64   6e-09
UniRef50_Q64YV7 Cluster: Arylsulfatase; n=4; Bacteroides fragili...    64   7e-09
UniRef50_Q64R82 Cluster: N-acetylgalactosamine-6-sulfatase; n=8;...    64   7e-09
UniRef50_A6DR28 Cluster: Arylsulphatase A; n=2; Lentisphaera ara...    64   7e-09
UniRef50_UPI00015A6252 Cluster: Arylsulfatase E precursor (EC 3....    64   1e-08
UniRef50_Q15YX5 Cluster: Sulfatase; n=1; Pseudoalteromonas atlan...    64   1e-08
UniRef50_A7LZ49 Cluster: Putative uncharacterized protein; n=1; ...    64   1e-08
UniRef50_A3HUP5 Cluster: Arylsulfatase; n=1; Algoriphagus sp. PR...    64   1e-08
UniRef50_Q7UX23 Cluster: Arylsulfatase; n=1; Pirellula sp.|Rep: ...    63   1e-08
UniRef50_Q7UUA9 Cluster: N-acetylgalactosamine 6-sulfatase; n=2;...    63   1e-08
UniRef50_A2TWL0 Cluster: N-acetylgalactosamine 6-sulfatase; n=2;...    63   1e-08
UniRef50_Q7UTJ1 Cluster: Aryl-sulphate sulphohydrolase; n=1; Pir...    63   2e-08
UniRef50_Q15XN4 Cluster: Sulfatase precursor; n=1; Pseudoalterom...    63   2e-08
UniRef50_A6DR18 Cluster: Arylsulfatase; n=1; Lentisphaera araneo...    63   2e-08
UniRef50_Q89K44 Cluster: ArsA protein; n=4; Rhizobiales|Rep: Ars...    62   2e-08
UniRef50_Q7UXA2 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    62   2e-08
UniRef50_A4GIB2 Cluster: Putative secreted sulfatase; n=1; uncul...    62   2e-08
UniRef50_A6DR15 Cluster: Arylsulfatase; n=2; Lentisphaera araneo...    62   4e-08
UniRef50_A6CZV9 Cluster: Putative arylsulfatase; n=1; Vibrio shi...    62   4e-08
UniRef50_A5FF56 Cluster: Sulfatase precursor; n=2; Bacteria|Rep:...    62   4e-08
UniRef50_A6DJ49 Cluster: Arylsulphatase A; n=1; Lentisphaera ara...    61   5e-08
UniRef50_A5NY74 Cluster: Sulfatase precursor; n=11; Bacteria|Rep...    61   5e-08
UniRef50_A3ZV95 Cluster: N-acetylgalactosamine 6-sulfatase; n=3;...    61   5e-08
UniRef50_Q7UNI8 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    61   7e-08
UniRef50_A6CA66 Cluster: N-acetylgalactosamine 6-sulfatase; n=3;...    61   7e-08
UniRef50_A7LY79 Cluster: Putative uncharacterized protein; n=1; ...    60   9e-08
UniRef50_A3I0S5 Cluster: Putative sulfatase yidJ; n=1; Algoripha...    60   9e-08
UniRef50_UPI0000588E05 Cluster: PREDICTED: similar to steroid su...    60   1e-07
UniRef50_Q7UJQ8 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    60   1e-07
UniRef50_Q02B50 Cluster: Sulfatase precursor; n=1; Solibacter us...    60   1e-07
UniRef50_A6DJU1 Cluster: Sulfatase 1; n=1; Lentisphaera araneosa...    60   1e-07
UniRef50_A1FH14 Cluster: Sulfatase precursor; n=4; Pseudomonas p...    60   1e-07
UniRef50_Q0UZB2 Cluster: Putative uncharacterized protein; n=2; ...    60   1e-07
UniRef50_Q15US6 Cluster: Sulfatase precursor; n=3; Alteromonadal...    60   2e-07
UniRef50_A6DJ37 Cluster: Arylsulphatase A; n=1; Lentisphaera ara...    60   2e-07
UniRef50_A6DF76 Cluster: Arylsulfatase A; n=1; Lentisphaera aran...    60   2e-07
UniRef50_A0PKV5 Cluster: Arylsulfatase, AslA; n=1; Mycobacterium...    60   2e-07
UniRef50_P51691 Cluster: Arylsulfatase; n=14; cellular organisms...    60   2e-07
UniRef50_Q96EG1 Cluster: Arylsulfatase G precursor; n=20; Eutele...    60   2e-07
UniRef50_A6DR29 Cluster: N-acetylgalactosamine-6-sulfatase; n=2;...    59   2e-07
UniRef50_A0X0X5 Cluster: Sulfatase precursor; n=1; Shewanella pe...    59   2e-07
UniRef50_A6DJ33 Cluster: Arylsulphatase A; n=1; Lentisphaera ara...    59   3e-07
UniRef50_Q8TMK9 Cluster: Arylsulfatase; n=12; cellular organisms...    59   3e-07
UniRef50_Q32KK0 Cluster: Arylsulfatase E; n=1; Rattus norvegicus...    58   4e-07
UniRef50_Q7UXA8 Cluster: N-acetylgalactosamine-6-sulfate sulfata...    58   4e-07
UniRef50_Q7ULY7 Cluster: Arylsulphatase A; n=1; Pirellula sp.|Re...    58   4e-07
UniRef50_Q1YR77 Cluster: N-acetylgalactosamine 6-sulfatase; n=2;...    58   4e-07
UniRef50_A6DRV5 Cluster: Arylsulfatase A; n=1; Lentisphaera aran...    58   4e-07
UniRef50_A6DJD5 Cluster: Sulfatase 1; n=1; Lentisphaera araneosa...    58   4e-07
UniRef50_Q9CKE0 Cluster: Putative uncharacterized protein PM1682...    58   5e-07
UniRef50_Q7UJR3 Cluster: Arylsulfatase; n=2; Bacteria|Rep: Aryls...    58   5e-07
UniRef50_A6DMX8 Cluster: Iduronate-sulfatase or arylsulfatase A;...    58   5e-07
UniRef50_A6DJJ6 Cluster: Sulfatase 1; n=1; Lentisphaera araneosa...    58   5e-07
UniRef50_A6DGK3 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    58   5e-07
UniRef50_Q7UYE0 Cluster: Arylsulfatase; n=1; Pirellula sp.|Rep: ...    58   6e-07
UniRef50_A6V872 Cluster: Arylsulfatase; n=1; Pseudomonas aerugin...    58   6e-07
UniRef50_A0Q2E3 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    58   6e-07
UniRef50_Q8A168 Cluster: Putative sulfatase yidJ; n=5; Bacteroid...    57   8e-07
UniRef50_Q7UYH4 Cluster: Arylsulfatase; n=1; Pirellula sp.|Rep: ...    57   8e-07
UniRef50_Q650Q8 Cluster: Arylsulfatase; n=5; Bacteria|Rep: Aryls...    57   8e-07
UniRef50_A6CEL4 Cluster: Arylsulfatase A; n=1; Planctomyces mari...    57   8e-07
UniRef50_A6C8S0 Cluster: Arylsulphatase A; n=1; Planctomyces mar...    57   8e-07
UniRef50_Q0SBH5 Cluster: Arylsulfatase; n=1; Rhodococcus sp. RHA...    57   1e-06
UniRef50_A6DIE0 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...    57   1e-06
UniRef50_A6BZV9 Cluster: Arylsulfatase; n=3; Bacteria|Rep: Aryls...    57   1e-06
UniRef50_A4AWR5 Cluster: Arylsulphatase A; n=1; Flavobacteriales...    57   1e-06
UniRef50_A6DG53 Cluster: Arylsulfatase A; n=1; Lentisphaera aran...    56   1e-06
UniRef50_Q7US20 Cluster: Arylsulphatase A; n=1; Pirellula sp.|Re...    56   2e-06
UniRef50_Q01RE9 Cluster: Sulfatase precursor; n=4; Bacteria|Rep:...    56   2e-06
UniRef50_A3ZMT9 Cluster: Arylsulfatase; n=2; Planctomycetaceae|R...    56   2e-06
UniRef50_Q7UH28 Cluster: Mucin-desulfating sulfatase; n=2; Bacte...    56   3e-06
UniRef50_Q15XN1 Cluster: Sulfatase precursor; n=1; Pseudoalterom...    56   3e-06
UniRef50_A6DS43 Cluster: Arylsulfatase A; n=1; Lentisphaera aran...    56   3e-06
UniRef50_A6DI18 Cluster: Arylsulfatase A; n=2; Lentisphaera aran...    56   3e-06
UniRef50_A5FAW6 Cluster: Sulfatase precursor; n=1; Flavobacteriu...    56   3e-06
UniRef50_A7SK50 Cluster: Predicted protein; n=1; Nematostella ve...    56   3e-06
UniRef50_Q8A349 Cluster: Arylsulfatase; n=1; Bacteroides thetaio...    55   3e-06
UniRef50_A6DM25 Cluster: Sulfatase 1; n=1; Lentisphaera araneosa...    55   3e-06
UniRef50_A4ASX5 Cluster: Mucin-desulfating sulfatase; n=1; Flavo...    55   3e-06
UniRef50_UPI000065CD18 Cluster: Arylsulfatase G precursor (EC 3....    55   4e-06
UniRef50_A6DI59 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...    55   4e-06
UniRef50_Q4WVQ5 Cluster: Arylsulfatase, putative; n=13; Pezizomy...    55   4e-06
UniRef50_Q7UGA0 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    54   6e-06
UniRef50_A6DU78 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    54   6e-06
UniRef50_A6DRW5 Cluster: Putative sulfatase; n=2; Lentisphaera a...    54   6e-06
UniRef50_A6DLY1 Cluster: Putative sulfatase; n=1; Lentisphaera a...    54   6e-06
UniRef50_Q93P97 Cluster: MS134, putative arylsulfatase; n=1; Mic...    54   8e-06
UniRef50_Q1YUH3 Cluster: Arylsulfatase; n=1; gamma proteobacteri...    54   8e-06
UniRef50_Q9X759 Cluster: Arylsulfatase precursor; n=8; Enterobac...    54   8e-06
UniRef50_Q7NMX5 Cluster: Gll0640 protein; n=1; Gloeobacter viola...    54   1e-05
UniRef50_Q15XJ0 Cluster: Sulfatase; n=1; Pseudoalteromonas atlan...    54   1e-05
UniRef50_A7LZQ6 Cluster: Putative uncharacterized protein; n=1; ...    54   1e-05
UniRef50_A6C2T4 Cluster: Sulfatase; n=1; Planctomyces maris DSM ...    54   1e-05
UniRef50_Q9L5W0 Cluster: Mucin-desulfating sulfatase MdsA precur...    53   1e-05
UniRef50_Q6XUN3 Cluster: Arylsulfatase; n=1; Pseudomonas sp. ND6...    53   1e-05
UniRef50_Q1YP24 Cluster: Arylsulfatase A; n=1; gamma proteobacte...    53   1e-05
UniRef50_A3HT92 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...    53   1e-05
UniRef50_UPI0000E47BCC Cluster: PREDICTED: similar to arylsulfat...    53   2e-05
UniRef50_UPI0000ECD579 Cluster: UPI0000ECD579 related cluster; n...    53   2e-05
UniRef50_A6DKC6 Cluster: Sulfatase; n=1; Lentisphaera araneosa H...    53   2e-05
UniRef50_A3ZSK1 Cluster: Arylsulphatase A; n=1; Blastopirellula ...    53   2e-05
UniRef50_Q7UGI8 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;...    52   2e-05
UniRef50_Q15SD1 Cluster: Sulfatase precursor; n=1; Pseudoalterom...    52   2e-05
UniRef50_A6DHS3 Cluster: Arylsulfatase A; n=1; Lentisphaera aran...    52   2e-05
UniRef50_A6DGL5 Cluster: N-acetylgalactosamine 6-sulfate sulfata...    52   2e-05
UniRef50_A6DG39 Cluster: Arylsulfatase; n=1; Lentisphaera araneo...    52   2e-05
UniRef50_Q8A221 Cluster: Arylsulfatase; n=6; Bacteroidetes|Rep: ...    52   3e-05
UniRef50_A3UPZ2 Cluster: Arylsulfatase; n=2; Vibrio|Rep: Arylsul...    52   3e-05
UniRef50_UPI000023D942 Cluster: hypothetical protein FG08053.1; ...    52   4e-05
UniRef50_Q8FTJ9 Cluster: Putative arylsulfatase; n=1; Corynebact...    52   4e-05
UniRef50_Q15NY5 Cluster: Sulfatase precursor; n=1; Pseudoalterom...    52   4e-05
UniRef50_A6C9F6 Cluster: Iduronate-2-sulfatase; n=1; Planctomyce...    52   4e-05
UniRef50_A3HSW7 Cluster: Arylsulfatase A; n=1; Algoriphagus sp. ...    52   4e-05
UniRef50_P51690 Cluster: Arylsulfatase E precursor; n=7; Mammali...    52   4e-05
UniRef50_A6QA55 Cluster: Arylsulfatase; n=5; Proteobacteria|Rep:...    51   6e-05
UniRef50_A6EGE6 Cluster: Sulfatase; n=1; Pedobacter sp. BAL39|Re...    51   6e-05
UniRef50_A6DLD9 Cluster: Sulfatase; n=1; Lentisphaera araneosa H...    51   6e-05
UniRef50_A2SJ95 Cluster: Arylsulfatase; n=1; Methylibium petrole...    51   6e-05
UniRef50_Q32KI0 Cluster: Arylsulfatase F; n=2; Canis lupus famil...    51   6e-05
UniRef50_A6DKC5 Cluster: Putative sulfatase yidj; n=1; Lentispha...    51   7e-05
UniRef50_UPI0000E4880B Cluster: PREDICTED: similar to RE14504p, ...    50   1e-04
UniRef50_Q7UH86 Cluster: Arylsulfatase A; n=3; Bacteria|Rep: Ary...    50   1e-04
UniRef50_A6DFU7 Cluster: Mucin-desulfating sulfatase; n=1; Lenti...    50   1e-04
UniRef50_A0LYA0 Cluster: Sulfatase; n=3; Bacteria|Rep: Sulfatase...    50   1e-04
UniRef50_P51689 Cluster: Arylsulfatase D precursor; n=55; Eutele...    50   1e-04
UniRef50_UPI0000E484C0 Cluster: PREDICTED: similar to arylsulfat...    50   1e-04
UniRef50_A6DSF3 Cluster: Putative uncharacterized protein; n=1; ...    50   1e-04
UniRef50_A6DG55 Cluster: Arylsulfatase; n=1; Lentisphaera araneo...    50   1e-04
UniRef50_A6CGJ8 Cluster: Arylsulfatase A; n=1; Planctomyces mari...    50   1e-04
UniRef50_A6DQC0 Cluster: Mucin-desulfating sulfatase; n=1; Lenti...    50   2e-04
UniRef50_A6DJ52 Cluster: Arylsulfatase; n=1; Lentisphaera araneo...    50   2e-04
UniRef50_A4GIA7 Cluster: Iduronate sulfatase; n=1; uncultured ma...    50   2e-04
UniRef50_A3M3B5 Cluster: Arylsulfatase; n=1; Acinetobacter bauma...    50   2e-04
UniRef50_Q1YQ29 Cluster: Arylsulfatase; n=1; gamma proteobacteri...    49   2e-04
UniRef50_Q7UMT6 Cluster: Mucin-desulfating sulfatase; n=2; Bacte...    49   3e-04
UniRef50_A6DG34 Cluster: Choline sulfatase; n=1; Lentisphaera ar...    49   3e-04
UniRef50_A3HV62 Cluster: Arylsulfatase; n=1; Algoriphagus sp. PR...    49   3e-04
UniRef50_Q7UYS6 Cluster: Arylsulfatase A; n=3; Bacteria|Rep: Ary...    48   4e-04
UniRef50_A6UB68 Cluster: Sulfatase; n=1; Sinorhizobium medicae W...    48   4e-04
UniRef50_A6DJ74 Cluster: Arylsulfatase A; n=1; Lentisphaera aran...    48   4e-04
UniRef50_A6C781 Cluster: Putative sulfatase; n=1; Planctomyces m...    48   4e-04
UniRef50_A5VAS7 Cluster: Sulfatase precursor; n=3; Proteobacteri...    48   4e-04
UniRef50_A4A0M2 Cluster: Heparan N-sulfatase; n=1; Blastopirellu...    48   4e-04
UniRef50_Q4RQR4 Cluster: Chromosome 2 SCAF15004, whole genome sh...    48   5e-04
UniRef50_Q7UNN1 Cluster: Arylsulphatase A; n=3; Bacteria|Rep: Ar...    48   5e-04
UniRef50_A6DP41 Cluster: Arylsulfatase A; n=1; Lentisphaera aran...    48   5e-04
UniRef50_Q89YS5 Cluster: N-acetylglucosamine-6-sulfatase; n=2; B...    48   7e-04
UniRef50_A6DJ72 Cluster: Mucin-desulfating sulfatase; n=1; Lenti...    48   7e-04
UniRef50_A0YAK5 Cluster: Sulfatase; n=3; unclassified Gammaprote...    48   7e-04
UniRef50_UPI000065DE05 Cluster: Arylsulfatase E precursor (EC 3....    47   9e-04
UniRef50_A6U8K1 Cluster: Sulfatase; n=4; cellular organisms|Rep:...    47   9e-04
UniRef50_Q7UFA5 Cluster: Putative sulfatase yidj; n=1; Pirellula...    47   0.001
UniRef50_A6DKP1 Cluster: Arylsulphatase A; n=1; Lentisphaera ara...    47   0.001
UniRef50_A4AP83 Cluster: Putative sulfatase; n=1; Flavobacterial...    47   0.001
UniRef50_A0Z7Y7 Cluster: Arylsulfatase; n=1; marine gamma proteo...    47   0.001
UniRef50_P08842 Cluster: Steryl-sulfatase precursor; n=28; Eutel...    47   0.001
UniRef50_UPI0000E47F5E Cluster: PREDICTED: similar to arylsulfat...    46   0.002
UniRef50_A6M2E5 Cluster: Sulfatase; n=4; Clostridium|Rep: Sulfat...    46   0.002
UniRef50_A6EGE8 Cluster: Heparan N-sulfatase; n=1; Pedobacter sp...    46   0.002
UniRef50_Q7UMT5 Cluster: Probable sulfatase atsG; n=2; Planctomy...    46   0.002
UniRef50_A6DLR4 Cluster: Probable sulfatase atsG; n=1; Lentispha...    46   0.002
UniRef50_A6DHY1 Cluster: Mucin-desulfating sulfatase; n=1; Lenti...    46   0.002
UniRef50_UPI0000E0E27F Cluster: probable sulfatase atsG; n=1; al...    46   0.003
UniRef50_P95059 Cluster: POSSIBLE ARYLSULFATASE ATSA; n=21; Acti...    46   0.003
UniRef50_A7LZQ4 Cluster: Putative uncharacterized protein; n=1; ...    46   0.003
UniRef50_A7BT68 Cluster: Arylsulfatase; n=1; Beggiatoa sp. PS|Re...    46   0.003
UniRef50_A3ZTV8 Cluster: Mucin-desulfating sulfatase; n=1; Blast...    46   0.003
UniRef50_Q650K5 Cluster: Choline-sulfatase; n=7; Bacteroidales|R...    45   0.004
UniRef50_Q0HVG5 Cluster: Sulfatase precursor; n=7; Bacteria|Rep:...    45   0.004
UniRef50_A6DIZ7 Cluster: Arylsulfatase; n=1; Lentisphaera araneo...    45   0.004
UniRef50_A7SK49 Cluster: Predicted protein; n=2; Nematostella ve...    45   0.004
UniRef50_Q4WBJ6 Cluster: Arylsulfatase, putative; n=4; Pezizomyc...    45   0.004
UniRef50_Q061A4 Cluster: Putative sulfatase; n=1; Synechococcus ...    45   0.005
UniRef50_A4AM21 Cluster: Arylsulfatase A; n=2; Bacteroidetes|Rep...    45   0.005
UniRef50_Q4RYA1 Cluster: Chromosome 3 SCAF14978, whole genome sh...    44   0.006
UniRef50_Q392C1 Cluster: Sulfatase; n=11; Burkholderiaceae|Rep: ...    44   0.006
UniRef50_A7A9X1 Cluster: Putative uncharacterized protein; n=1; ...    44   0.006
UniRef50_A6DNI8 Cluster: Putative N-acetylglucosamine-6-sulfatas...    44   0.006
UniRef50_A6DGT7 Cluster: Sulfatase family protein; n=1; Lentisph...    44   0.006
UniRef50_UPI0000EBF0AD Cluster: PREDICTED: similar to arylsulfat...    44   0.008
UniRef50_Q482B9 Cluster: Sulfatase family protein; n=1; Colwelli...    44   0.008
UniRef50_Q028N3 Cluster: Sulfatase; n=1; Solibacter usitatus Ell...    44   0.008
UniRef50_A6DSG8 Cluster: Iduronate sulfatase; n=1; Lentisphaera ...    44   0.008
UniRef50_UPI0001555E0A Cluster: PREDICTED: similar to arylsulfat...    44   0.011
UniRef50_Q7UYC5 Cluster: N-acetyl-galactosamine-6-sulfatase; n=2...    44   0.011
UniRef50_Q1YSK8 Cluster: Mucin-desulfating sulfatase; n=1; gamma...    44   0.011
UniRef50_A6DHI3 Cluster: Probable sulfatase atsG; n=1; Lentispha...    44   0.011
UniRef50_A6DG59 Cluster: Arylsulfatase; n=1; Lentisphaera araneo...    44   0.011
UniRef50_Q2U8N6 Cluster: Sulfatases; n=1; Aspergillus oryzae|Rep...    44   0.011
UniRef50_Q7D5R3 Cluster: Sulfatase family protein; n=10; Mycobac...    43   0.015
UniRef50_Q4C1V0 Cluster: Similar to Arylsulfatase A and related ...    43   0.015
UniRef50_A6E7U2 Cluster: Putative exported sulfatase; n=1; Pedob...    43   0.015
UniRef50_A6CAZ0 Cluster: Probable sulfatase atsG; n=1; Planctomy...    43   0.015
UniRef50_A3VUB6 Cluster: Sulfatase; n=1; Parvularcula bermudensi...    43   0.015
UniRef50_Q18924 Cluster: Sulfatase domain protein protein 2; n=2...    43   0.015
UniRef50_Q2U5H2 Cluster: Sulfatases; n=9; Pezizomycotina|Rep: Su...    43   0.015
UniRef50_Q7UY39 Cluster: Similar to sulfatase 1; n=1; Pirellula ...    43   0.019
UniRef50_Q5LNC6 Cluster: Arylsulfatase; n=1; Silicibacter pomero...    43   0.019
UniRef50_Q2CEI6 Cluster: Putative choline-sulfatase; n=1; Oceani...    43   0.019
UniRef50_A7HUP5 Cluster: Sulfatase precursor; n=2; Alphaproteoba...    43   0.019
UniRef50_A6DPD1 Cluster: Probable sulfatase atsG; n=1; Lentispha...    43   0.019
UniRef50_A6DJ46 Cluster: Arylsulfatase; n=1; Lentisphaera araneo...    43   0.019
UniRef50_A6DFG6 Cluster: Arylsulfatase; n=1; Lentisphaera araneo...    43   0.019
UniRef50_A4XU46 Cluster: Sulfatase; n=1; Pseudomonas mendocina y...    43   0.019

>UniRef50_Q8SZ72 Cluster: RE14504p; n=9; Eumetazoa|Rep: RE14504p -
           Drosophila melanogaster (Fruit fly)
          Length = 562

 Score =  502 bits (1239), Expect = e-141
 Identities = 236/455 (51%), Positives = 304/455 (66%), Gaps = 9/455 (1%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           MQH V+Y AEPRGLPL EKILPQYL +LGY +H+ GKWHLG +K +Y PL RGF SHVGF
Sbjct: 91  MQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFSSHVGF 150

Query: 61  WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLF 119
           W+G  D  DHT +E   WG D R G +VA+DL G Y TDV TD ++KV+ +HN ++ PLF
Sbjct: 151 WSGHQDYNDHTAVENNQWGLDMRNGTQVAYDLHGHYTTDVITDHSVKVIANHNATKGPLF 210

Query: 120 LMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRG 179
           L +AH+A HS NPY P+  P   +    +I +  R+KFAA++SK+D SVG++V  L    
Sbjct: 211 LYVAHAACHSSNPYNPLPVPDNDVIKMSHIPNYKRRKFAAMVSKMDNSVGQIVDQLRKSN 270

Query: 180 LLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVA 239
           +LENSI++FS+DNGGPA GFN N ASNYPLKGVKNTLWEGGVR AG +WSPLL    RV+
Sbjct: 271 MLENSIIIFSSDNGGPAQGFNLNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKKSQRVS 330

Query: 240 YQKMHISDWLPTLYSAAGGD--LSVL-ENLDGVNQWDALSKNTESPRTSVLHNIDDIWGI 296
            Q MHI DWLPTL  AAGG   LS L + +DG + W AL ++  SPR +VLHNIDDIWG 
Sbjct: 331 NQTMHIIDWLPTLLEAAGGQPALSNLSKQIDGQSIWRALVQDKASPRLNVLHNIDDIWGS 390

Query: 297 AALTVDKYKLIKGTIYKGVWDNWYGPSGREG--AYNASLLYDSHAGRILDKLNLMPPKEK 354
           AAL+V  +KL+KGT Y+G WD WYGP+G      Y+  L+  S AG+ L+ L ++P +  
Sbjct: 391 AALSVGDWKLVKGTNYRGSWDGWYGPAGERDPRLYDWQLVGRSRAGKALEALKMLPSRAD 450

Query: 355 VMELRDEATVKC-NDSIEVIQCKPR--DAPCVFNIDEDPCERRNXXXXXXXXXXXXXXXM 411
              +R  ATV C   S +   C      APC+F+I +DPCE+ N               +
Sbjct: 451 QQRIRAAATVSCPGQSSQGTSCVATAFSAPCLFHIRDDPCEQYNLAKQYPEVVNALMTEL 510

Query: 412 HKLNVSAVAPNAQPIDARGDPQYWGRVYTNFGNYE 446
            + N +AV P+ +P D R DP++W   +TNFG+Y+
Sbjct: 511 ERFNATAVPPSNKPADPRADPRFWNYTWTNFGDYQ 545


>UniRef50_A7RFN2 Cluster: Predicted protein; n=2; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 512

 Score =  360 bits (886), Expect = 4e-98
 Identities = 187/441 (42%), Positives = 257/441 (58%), Gaps = 27/441 (6%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           MQH VI  A+P GL LNE ++PQYLK LGY TH VGKWHLG +K EY P+ RGFDS+ G+
Sbjct: 89  MQHSVILAAQPYGLGLNETLMPQYLKRLGYATHGVGKWHLGFFKYEYTPIQRGFDSYFGY 148

Query: 61  WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
           W G+ D +DH+  E+  WG D     +     +G Y++D++ ++A+ V+++HN S PLFL
Sbjct: 149 WCGKGDYWDHSNNEKYGWGLDLHDSEQDVWTEWGHYSSDLFAEKAVNVISTHNASVPLFL 208

Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
            L   AVHS N  +P++AP  LID FK I D  R+ FAA++S +D ++ KVV +L  R +
Sbjct: 209 YLPFQAVHSANFIQPLQAPPDLIDKFKNIKDERRRIFAAMVSSMDGAIKKVVDSLKARSM 268

Query: 181 LENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAY 240
             NSI+VF+TDNGGPA GF+ N ASN+PL+GVK TLWEGG+RG  F+ SPL+    RV  
Sbjct: 269 YNNSIIVFTTDNGGPANGFDSNMASNFPLRGVKRTLWEGGIRGTAFIHSPLITKPGRVMT 328

Query: 241 QKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALT 300
           + MH+SDWLPTLY+ AGGD+  L+NLDG + WD++S +  SPR  ++HNID +   AA  
Sbjct: 329 ELMHVSDWLPTLYTVAGGDIHDLQNLDGFDLWDSISTDAMSPREEMVHNIDPVNWEAAYR 388

Query: 301 VDKYKL-IKGTIYKGVWDNWYGPSGREGAYNASLLYDSHAGRILDKLNLMPPKEKVMELR 359
             ++K+ +  T Y   W  +  P+  E   + + L D+        +   PP E      
Sbjct: 389 FREWKIVVNQTKYMSGW--YPLPNIEEREPHPATLRDA-------VVKCGPPPE------ 433

Query: 360 DEATVKCNDSIEVIQCKPRDAPCVFNIDEDPCERRNXXXXXXXXXXXXXXXMHKLNVSAV 419
                     I V  C   D PC+FNI  DPCE  N               +       V
Sbjct: 434 ----------IPV-NCTASDGPCLFNIKNDPCEYVNLAKKELEILNNMLIWLEGYKKGMV 482

Query: 420 APNAQPIDARGDPQYWGRVYT 440
                P+D   +P  +G V+T
Sbjct: 483 PIRNTPLDPSANPANYGGVWT 503


>UniRef50_UPI0000E49A98 Cluster: PREDICTED: similar to
           ENSANGP00000029647, partial; n=7; Strongylocentrotus
           purpuratus|Rep: PREDICTED: similar to
           ENSANGP00000029647, partial - Strongylocentrotus
           purpuratus
          Length = 474

 Score =  355 bits (874), Expect = 1e-96
 Identities = 168/396 (42%), Positives = 244/396 (61%), Gaps = 22/396 (5%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           +Q+ VI   EP GL  NE I+PQYL+ LGY+TH+VGKWHLG +K+   P +RGF+S+ G+
Sbjct: 94  LQYSVIIADEPYGLGTNETIMPQYLRSLGYRTHMVGKWHLGFFKESLTPSHRGFESYYGY 153

Query: 61  WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
           + G  D + H + E    G DF     +   +FG Y+T++YT++  +++ +HN  EPL++
Sbjct: 154 YGGMQDYFTHESTEHTLTGFDFHVNGSIYKPVFGQYSTEIYTEKTQEIIRNHNPQEPLYI 213

Query: 121 MLAHSAVHSGNPY-EPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRG 179
            LAH AVHS N   + ++AP K  + F  I +  R+KFAA++S LD+S+G + + L    
Sbjct: 214 YLAHQAVHSANYNGQRLQAPYKYYERFPNITNENRRKFAAMVSALDDSLGNITQTLKESS 273

Query: 180 LLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVA 239
           L  N+++VF+TDNGGPA GF+ N A+N+PL+GVK+T WEGG+RGAGFLW  L++   R +
Sbjct: 274 LYNNTVIVFTTDNGGPAHGFDANYANNWPLRGVKDTTWEGGLRGAGFLWGALIEKPGRTS 333

Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAAL 299
              MH+ DW+PTLY  AGG+ S L++LDG++ W  LS+   SPR  +LHNID +  ++A+
Sbjct: 334 DGMMHVCDWVPTLYGLAGGNTSTLQHLDGIDVWPMLSRAEPSPREEILHNIDPVRNVSAI 393

Query: 300 TVDKYKLIKGTIYKGVWDNWYGPSGREGAYNASLLYDSHAGRILDKLNLMPPKEKVMELR 359
            +  YKL++G  Y G W +WY P G      +S+  DS           +P    V    
Sbjct: 394 RIGDYKLVQGQNYNGSWSDWYPPEG-----ESSVDVDSKP---------VPNAFVVSCPS 439

Query: 360 DEATVKCNDSIEVIQCKPRDAPCVFNIDEDPCERRN 395
             A    N       C P++ PC+FNI  DPCE  N
Sbjct: 440 KPANASTN-------CDPKEKPCLFNIRHDPCEFNN 468


>UniRef50_Q9VVM4 Cluster: CG7402-PA; n=3; Diptera|Rep: CG7402-PA -
           Drosophila melanogaster (Fruit fly)
          Length = 579

 Score =  355 bits (872), Expect = 2e-96
 Identities = 179/464 (38%), Positives = 262/464 (56%), Gaps = 13/464 (2%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           MQH VI   EP GLP  E+++P+  +D GY THLVGKWHLG ++K+  P  RGFD H G+
Sbjct: 93  MQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRGFDHHFGY 152

Query: 61  WTGRIDMYDHTTM---EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEP 117
           + G ID YDH         S G DFRR  E   +  G YAT+ +T EA +++  H+KS+P
Sbjct: 153 YNGYIDYYDHQVRMLDRNYSAGLDFRRDLEPCPEANGTYATEAFTSEAKRIIEQHDKSKP 212

Query: 118 LFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHT 177
           LF++L+H AVH+GN   P++AP++ +  F +I D  R+ +A ++S LD+SV + + AL  
Sbjct: 213 LFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKSVAQTIGALKD 272

Query: 178 RGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKAR 237
            G+L NSI++  +DNG P  G + NA SNYP +G K + WEGG+R AG LWSPLL  +  
Sbjct: 273 NGMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKESPWEGGIRSAGALWSPLLKERGY 332

Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIA 297
           V+ Q +H  DWLPTL  AAG  L     LDG+N W  LS N E    +++H +D+++G +
Sbjct: 333 VSNQAIHAVDWLPTLAGAAGVSLPQDLPLDGINLWPMLSGNEEPKPRTMIHVLDEVFGYS 392

Query: 298 ALTVDKYKLIKGTIYKGVWDNWYGPSGREGAYNASLLYDSH--AGRILDKL-NLMPPKEK 354
           +   D  K + G+ +KG +D W G             Y+ H  A  +   L N    K++
Sbjct: 393 SYMRDTLKYVNGSSFKGRYDQWLGELETNEDDPLGESYEQHVLASDVQSLLGNRGLTKDR 452

Query: 355 VMELRDEATVKC------NDSIEVIQCKPRDAPCVFNIDEDPCERRNXXXXXXXXXXXXX 408
           + ++R EAT  C      N      +C+P  APC F++ +DPCER N             
Sbjct: 453 IRQMRSEATETCPPIEGQNPLESHFKCEPLKAPCFFDLAKDPCERYNLAQMYPLQLQQLA 512

Query: 409 XXMHKLNVSAVAPNAQP-IDARGDPQYWGRVYTNFGNYETQHGS 451
             + ++  +A+     P  D+R +P +    +  + N +TQ GS
Sbjct: 513 DELEQIRKTAIPSARVPHSDSRANPTFHNGNWEWWNNTDTQSGS 556


>UniRef50_UPI0000D55F5E Cluster: PREDICTED: similar to CG8646-PA;
           n=1; Tribolium castaneum|Rep: PREDICTED: similar to
           CG8646-PA - Tribolium castaneum
          Length = 626

 Score =  352 bits (866), Expect = 1e-95
 Identities = 192/470 (40%), Positives = 259/470 (55%), Gaps = 44/470 (9%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           MQH VI   EP GLPLNE ILPQYLK  GY TH +GKWHLG ++KEY P  RGFDSH G+
Sbjct: 88  MQHLVILEPEPWGLPLNETILPQYLKRNGYATHAIGKWHLGFFRKEYTPTYRGFDSHYGY 147

Query: 61  WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
                               D RR   V     G Y+T ++TDEA++++  HN   P+F+
Sbjct: 148 --------------------DMRRNMTVDWSAQGKYSTTLFTDEAVRLIREHNTENPMFM 187

Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
            LAH A HSGN  +P++AP + I  F +I D  R+ +AA++S LD+SVG V+ AL  + +
Sbjct: 188 YLAHLAPHSGNDDDPLQAPDEEIAKFGHIADPERRIYAAMVSMLDKSVGSVIAALRDKHM 247

Query: 181 LENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAY 240
           LENSI+VF +DNG    G + N  SNYPL+G KN+ WEG +R    +WSPL+    RV+ 
Sbjct: 248 LENSIIVFMSDNGAKPDGIHANHGSNYPLRGNKNSAWEGAMRCVAAIWSPLIKKPQRVSN 307

Query: 241 QKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALT 300
             MHISDWLPT Y+AAG + + L  +DGV+ W ++S+  +SPRT +LHNID+I+   AL 
Sbjct: 308 SLMHISDWLPTFYTAAGLNKTELPKMDGVDMWASISEGKDSPRTELLHNIDEIYNYGALR 367

Query: 301 VDKYKLIKGTIYKGVWDNWYGPSGREG--AYNASLLYDSHAGRILDKLNLMPP-KEK--- 354
           V  +K + G+   G  D WYG SGR+    Y+ S +  S  G  L  L      KEK   
Sbjct: 368 VGNWKYLYGSTTNGKSDGWYGSSGRDPLYTYDDSAVLASQTGSTLAGLTTYQQIKEKHQG 427

Query: 355 -------------VMELRDEATVKC-----NDSIEVIQCKPRDAPCVFNIDEDPCERRNX 396
                        +  LR  A VKC      +  E  +C   ++PC+FNI EDPCE+ N 
Sbjct: 428 DTNFTHKLLDSETIKTLRGAAEVKCPRVNFEEIPESKKCNAVESPCLFNIKEDPCEQINL 487

Query: 397 XXXXXXXXXXXXXXMHKLNVSAVAPNAQPIDARGDPQYWGRVYTNFGNYE 446
                         + +   +A+     P D   DP  W   + N+ +YE
Sbjct: 488 AAERPMIVLNMEMALARFKQTALPIRNVPRDPNADPAKWNNTWVNWQDYE 537


>UniRef50_Q17B03 Cluster: Arylsulfatase b; n=3; Culicidae|Rep:
           Arylsulfatase b - Aedes aegypti (Yellowfever mosquito)
          Length = 675

 Score =  350 bits (861), Expect = 4e-95
 Identities = 175/451 (38%), Positives = 260/451 (57%), Gaps = 14/451 (3%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           MQH VI   EP GL L++KI+P+Y K+ GY+THLVGKWHLG   K+Y P  RGFD+HVG+
Sbjct: 97  MQHYVIVSDEPWGLGLDQKIMPEYFKEAGYRTHLVGKWHLGFSAKQYTPTMRGFDTHVGY 156

Query: 61  WTGRIDMYDHT---TMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEP 117
               +D +D+T   +  +   G D R    V +D  G YATD +T  A  ++  H+  +P
Sbjct: 157 LGPYVDYWDYTLKFSPPKSFQGYDMRNNLNVDYDSNGTYATDHFTKAASSIIERHDTKDP 216

Query: 118 LFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHT 177
           LFL++ H A H+ N  +P++AP++ I  F YI D  R+ +AA++SKLD+SVG++  +L +
Sbjct: 217 LFLVVNHLAPHAANDDDPLQAPEEDIRKFDYISDERRRIYAAMVSKLDDSVGQIFNSLRS 276

Query: 178 RGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKAR 237
           + +L+NSI++F +DNG P A  + N  SNYPL+G+K+  WE   R    +WSPLL  + R
Sbjct: 277 KNMLDNSIILFMSDNGAPTAALHANTGSNYPLRGIKSVPWEAATRCVAAIWSPLLQERQR 336

Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLEN---LDGVNQWDALSKNTESPRTSVLHNIDDIW 294
           V+ Q +HISDWLPTL SAAG D+   ++   +DG +QW+ALS +T +PR  VL+ ID+I+
Sbjct: 337 VSNQFIHISDWLPTLASAAGIDIPFSKDHSEIDGQDQWEALSYDTGNPRRVVLNMIDEIY 396

Query: 295 GIAALTVDKYKLIKGTIYKGVWDNWYGPSGREGAYNASLLYDSHAGRILDK-----LNLM 349
           G ++   + +K + GT   G +D WY   G+    + +L  D +   +L           
Sbjct: 397 GYSSYMENGFKFVNGTYSNGSYDGWY---GQPNTSDQTLSDDQYIDLVLQTEITRWAGET 453

Query: 350 PPKEKVMELRDEATVKCNDSIEVIQCKPRDAPCVFNIDEDPCERRNXXXXXXXXXXXXXX 409
             ++ +  LR  A V CN   E  +C P   PC+F+I  DPCE  +              
Sbjct: 454 ISRDTIKYLRKHARVNCNHQPEANKCNPLKRPCLFDIINDPCELNDLSHKFPMKFRELRS 513

Query: 410 XMHKLNVSAVAPNAQPIDARGDPQYWGRVYT 440
            +      A  P  +P D   +P  +G V+T
Sbjct: 514 TVQTYRRLATKPRNKPADPAANPANFGGVWT 544


>UniRef50_UPI00015B5C4D Cluster: PREDICTED: similar to
           ENSANGP00000018435; n=1; Nasonia vitripennis|Rep:
           PREDICTED: similar to ENSANGP00000018435 - Nasonia
           vitripennis
          Length = 710

 Score =  347 bits (854), Expect = 3e-94
 Identities = 169/347 (48%), Positives = 230/347 (66%), Gaps = 13/347 (3%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           MQH VI  AEPRGLPL+EKILPQYLK+ GY TH +GKWH G +++EY P  RGFDSH G+
Sbjct: 109 MQHLVILEAEPRGLPLHEKILPQYLKEAGYATHAIGKWHQGFHRREYTPTYRGFDSHFGY 168

Query: 61  WTGRIDMYDH----TTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHN-KS 115
           W G  D Y H    +  ++G  G D RR   +A D +G Y+TD++TDEA++++  H  ++
Sbjct: 169 WQGLQDYYTHEVGSSNPKEGFLGFDMRRNMSLARDTYGKYSTDLFTDEAVRLIEEHRPEA 228

Query: 116 EPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKAL 175
            P+FL LAH A HSGN  EP++AP + +  F Y++D  R+ +AA++SKLD+SVG+VV AL
Sbjct: 229 GPMFLYLAHLAPHSGNDNEPLQAPDEEVAKFSYVEDPERRIYAAMMSKLDQSVGEVVSAL 288

Query: 176 HTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSK 235
             + +L+NSIVVF  DNG    G + N  SNYPL+G+K + WEG VRGA  +WSPL+   
Sbjct: 289 RRKNMLQNSIVVFMADNGAATQGIHYNRGSNYPLRGIKASAWEGAVRGAAAVWSPLIQRP 348

Query: 236 ARVAYQKMHISDWLPTLYSAAG--GDLSVLENLDGVNQWDALSKNTES-PRTSVLHNIDD 292
            R+  + M I+DWLPTL SA+G    + V  N+DGV+QW A+S    S PR  +L NID 
Sbjct: 349 KRIYNELMSIADWLPTLLSASGLRDVVRVSANIDGVDQWPAISGVAPSPPRNEILVNIDP 408

Query: 293 IWGIAALTVDKYKLIKGTIYKGVWDNWYGPSGR---EGAYNASLLYD 336
           I+  +AL   ++K + GT+  G  + WYG +GR   +G   AS  YD
Sbjct: 409 IFNYSALRRGEFKYVLGTVGNG--EEWYGETGRPENQGLEGASPTYD 453



 Score = 69.3 bits (162), Expect = 2e-10
 Identities = 28/91 (30%), Positives = 49/91 (53%), Gaps = 1/91 (1%)

Query: 353 EKVMELRDEATVKCN-DSIEVIQCKPRDAPCVFNIDEDPCERRNXXXXXXXXXXXXXXXM 411
           +++++LR  A+++C     E + C P  +PC+FNI EDPCE+RN               +
Sbjct: 503 DELLKLRSSASLRCTVPESERVACHPLQSPCLFNIKEDPCEQRNLAASRAMILATLEEAL 562

Query: 412 HKLNVSAVAPNAQPIDARGDPQYWGRVYTNF 442
            K  V+A+ P+  P D + +P +W   + N+
Sbjct: 563 LKYRVTALPPSNVPNDPKANPAFWNHTWVNW 593


>UniRef50_UPI0000D55D4D Cluster: PREDICTED: similar to CG8646-PA;
           n=1; Tribolium castaneum|Rep: PREDICTED: similar to
           CG8646-PA - Tribolium castaneum
          Length = 558

 Score =  330 bits (810), Expect = 6e-89
 Identities = 177/407 (43%), Positives = 250/407 (61%), Gaps = 13/407 (3%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           +Q   I  AE R LP   KI+ +Y KD+GY THLVGKWHLG  +    P  RGFD   GF
Sbjct: 95  LQGPSITPAEGRSLP-EGKIMSEYFKDMGYATHLVGKWHLGHSRWNDTPTFRGFDHFFGF 153

Query: 61  WTGRIDMYDHTTM----EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE 116
           + G    YD+ +     ++   G D RR    + +  G YATD++ + A+ V+  HN + 
Sbjct: 154 YNGFTSYYDYVSNWKINDKEYSGFDLRRDTVPSWNDAGKYATDLFAEHAVDVIQKHNVNT 213

Query: 117 PLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALH 176
           PLF+M+AH AVH GN  + + APQ+ ++ FK+I D  R+ +AA++SKLD+S+G V +AL 
Sbjct: 214 PLFMMIAHLAVHVGNEGKWLEAPQETVNKFKHIRDPNRRTYAAMVSKLDDSIGAVFEALE 273

Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKA 236
            + +L+N+IVVF +DNG P  G + N  SNYPL+G+K+TL+EGGVR    +WSPLL   +
Sbjct: 274 AKNMLQNTIVVFISDNGAPTVGPHHNWGSNYPLRGIKDTLFEGGVRTVACIWSPLLVQSS 333

Query: 237 RVAYQKMHISDWLPTLYSAAGGDLSVLE-NLDGVNQWDALSKNTESPRTSVLHNIDDIWG 295
           RV+   +HI+DWLPTL++A GGDLSVL+ +LDG++QW +L  +  S R  +  NID+   
Sbjct: 334 RVSTDLIHITDWLPTLFTAVGGDLSVLDPDLDGIDQWSSLVYDLPSARNDIPLNIDEKTR 393

Query: 296 IAALTVDKYKLIKGTIYKGVWDNWYG----PSGREGAYNASLLYDSHAGRILDKLNLMPP 351
            AAL    +KLI GT   G ++ ++G     +  E  YN S + DS  GRI  K+N  P 
Sbjct: 394 NAALRFSYWKLIVGTSGNGSYNGYFGAPLNENIEEQQYNTSAINDSPVGRIAKKINYNPL 453

Query: 352 KEKVME-LRDEATVKCNDS-IEVIQCKPRD-APCVFNIDEDPCERRN 395
            E   + LR  AT+KC D+  +   C P   A C++NI  DPCE  +
Sbjct: 454 SETDFDGLRRVATLKCLDAKAKRNPCDPASGAVCLYNIPNDPCEEND 500


>UniRef50_UPI00015B40BD Cluster: PREDICTED: similar to RE14504p;
           n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
           RE14504p - Nasonia vitripennis
          Length = 571

 Score =  323 bits (793), Expect = 7e-87
 Identities = 168/397 (42%), Positives = 249/397 (62%), Gaps = 16/397 (4%)

Query: 9   AEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMY 68
           AEPRG+PL+E++LP+YL++LGY T LVGKWHLG Y  ++ P  RGFDS VG++ G I  +
Sbjct: 99  AEPRGVPLHERLLPEYLRELGYVTRLVGKWHLGYYTDKHTPTRRGFDSFVGYYGGVITYF 158

Query: 69  DHTTMEQGSWGTDFRRGFEVAHDLFG--VYATDVYTDEAIKVVNSHNKSEPLFLMLAHSA 126
           +HT  +    G D+        + F    Y TD  +D+A  V+ +H++ +PLFL LAH A
Sbjct: 159 NHTVTKDKHTGIDYHWDTSGKIEPFDNDQYVTDFISDQAEAVIKNHDRKKPLFLQLAHVA 218

Query: 127 VHSGNPYEPI--RAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
            H+    +PI  R   ++ D   YI D  R+K+A V++ +D+SVG+VVKAL    +L NS
Sbjct: 219 AHASENRDPIEVRNMTEVNDTLSYIPDINRRKYAGVVTAMDDSVGRVVKALKDANMLSNS 278

Query: 185 IVVFSTDNGGPAAGFN-DNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKM 243
           I++F +DNG P A     N  SNYPL+G+K T++EGGVR    ++SP L  + RV+ +  
Sbjct: 279 IIIFMSDNGSPTAEAPYTNYGSNYPLRGIKATVFEGGVRVPACVFSPRLKDRFRVSDELF 338

Query: 244 HISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALTVDK 303
           HI+DW PTLY  AGGDLS +++LDGV+QW ++S + +S R S+L NID++    A     
Sbjct: 339 HITDWFPTLYKLAGGDLSKIQDLDGVDQWSSISGSQKSNRESLLVNIDEVSNPEAAISGY 398

Query: 304 YKLIKGTIYKGVWDNWYGPSGRE---GAYNASLLYDSHAGRILDKL--NLMPPKEKVMEL 358
           YKLI+G      +D++YG  G +     Y+ + +  S AGR +  L    +PP++++ EL
Sbjct: 399 YKLIRGI---NRYDDYYGKDGNDYSPKTYDVTGVLSSLAGRAIASLGNQYLPPQKRITEL 455

Query: 359 RDEATVKCNDSIEVIQCKPRDAPCVFNIDEDPCERRN 395
           R++AT++C    +   C  RD  C+F+I +DPCE  N
Sbjct: 456 RNKATLRCEKKDDRPSC--RDT-CLFDIVKDPCETTN 489


>UniRef50_Q8IQS4 Cluster: CG32191-PA; n=2; Sophophora|Rep:
           CG32191-PA - Drosophila melanogaster (Fruit fly)
          Length = 554

 Score =  317 bits (778), Expect = 4e-85
 Identities = 165/407 (40%), Positives = 242/407 (59%), Gaps = 16/407 (3%)

Query: 2   QHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFW 61
           QH VI   EP  L LN  ++P+  K+ GY T+LVGKWHLG  + EY P  RGFD H G+W
Sbjct: 93  QHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYW 152

Query: 62  TGRIDMYDHTT---MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSH-NKSEP 117
              ID +   +   +   S G DFRR  E+     GVY TD+ T EA +++  H +K +P
Sbjct: 153 GAYIDYFQRRSKMPVANYSLGYDFRRNMELECRDRGVYVTDLLTAEAERLIKDHADKEQP 212

Query: 118 LFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHT 177
           LFLML+H A H+ N  +P++AP++ I  F YI D  R+K+AA++SKLD+SVG+++ AL +
Sbjct: 213 LFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMISKLDQSVGRIITALSS 272

Query: 178 RGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKAR 237
              LENSIV+F +DNG P+ G   N  SN+PL+G KNT WEGGVR AG +WS  L ++  
Sbjct: 273 TDQLENSIVIFYSDNGAPSVGMFSNTGSNFPLRGQKNTPWEGGVRVAGAIWSSGLQARGS 332

Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRT--SVLHNIDDIWG 295
           +  Q ++++DWLPTL  AA  +L     LDG++ W  LS + ++P     +LH +DD+W 
Sbjct: 333 IFRQPLYVADWLPTLSRAADIELDSSLKLDGIDLWPELSGSADAPHVPREILHILDDVWR 392

Query: 296 IAALTVDKYKLIKGTIYKGVWDN--WYGP----SGREGAYNASLLYDSHAGRILDKLNLM 349
           ++AL + ++K + GT   G +D+   Y        R+  Y A  + +S   R L + +L 
Sbjct: 393 LSALQMGQWKYVNGTTASGRYDSVLTYRELDDLDPRDSRY-AVTVRNSATSRALSRYDLR 451

Query: 350 P-PKEKVMELRDEATVKCNDSIEVIQCKPRDAPCVFNIDEDPCERRN 395
              ++++   R  A V+C D      C P    C+++I  DPCE+ N
Sbjct: 452 RLTQQRISLTRRLAAVRCGDLQR--SCNPLLEECLYDILSDPCEQNN 496


>UniRef50_UPI0000DB708B Cluster: PREDICTED: similar to CG7402-PA
           isoform 2; n=2; Apocrita|Rep: PREDICTED: similar to
           CG7402-PA isoform 2 - Apis mellifera
          Length = 609

 Score =  316 bits (775), Expect = 1e-84
 Identities = 173/442 (39%), Positives = 249/442 (56%), Gaps = 17/442 (3%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           MQ   I G EPRGLPL+ KILP++L+ LGY T L+GKWH+G +  +Y PL+RGFD+  GF
Sbjct: 96  MQGDGIRGGEPRGLPLHIKILPEHLRGLGYVTKLIGKWHMGFHTLQYTPLHRGFDTFFGF 155

Query: 61  WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
           +   I  YD+    Q   G D   G + A+ +   YATD++T+EAIK++ +H    PL+L
Sbjct: 156 YNSHITYYDYEYSNQNMTGYDMHCGDDPAYGMKREYATDLFTNEAIKIIENHELPRPLYL 215

Query: 121 MLAHSAVHSGNPYEPIRAPQKLI-DAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRG 179
            ++H AVH+     PI  P     D    I +  R+K+A ++SKLDESVG+VV AL  +G
Sbjct: 216 QISHLAVHA-----PIEQPDDSSRDEIVQIREPNRRKYAKMVSKLDESVGRVVHALGEKG 270

Query: 180 LLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVA 239
           +L +S+++F TDNG  + G   N  SNYPL+G K TL+EGGVRG   LWS  L+  ARV 
Sbjct: 271 MLRDSLILFLTDNGAASIGRYRNYGSNYPLRGTKYTLYEGGVRGVAALWSSRLEKGARVF 330

Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAAL 299
            + +HI+DWLPTLYSAAGGDL  L  +DG++QW  LS+     R  +L NID++      
Sbjct: 331 KKLIHITDWLPTLYSAAGGDLKDLGKIDGIDQWRVLSEGQGHGREKLLLNIDEVMITEGA 390

Query: 300 TVDKYKLIKGTIYKGVWDNWYGPSGR---EGAYNASLLYDSHAGRILDKL-NLMPPKEKV 355
              ++KL++G    G +D +YG SGR      Y   +L  + +  I   L   +     +
Sbjct: 391 IYSRFKLLRG---NGYYDKYYGDSGRTLETPPYTEVVLKSAVSQSITYHLGGPVTQPSTM 447

Query: 356 MELRDEATVKCNDSIEVIQCKP----RDAPCVFNIDEDPCERRNXXXXXXXXXXXXXXXM 411
           ++LR EATV+C+ ++               C+F+I  DPCE +N               +
Sbjct: 448 VQLRREATVQCHPNMSYYYRHSFTFCNVTECLFDIVNDPCETKNIAEAYARIARDLDLYL 507

Query: 412 HKLNVSAVAPNAQPIDARGDPQ 433
                  +    +P+D   DP+
Sbjct: 508 EHYGRVLMKQIRKPVDWLADPK 529


>UniRef50_UPI0000D56522 Cluster: PREDICTED: similar to CG7402-PA;
           n=1; Tribolium castaneum|Rep: PREDICTED: similar to
           CG7402-PA - Tribolium castaneum
          Length = 558

 Score =  311 bits (764), Expect = 2e-83
 Identities = 159/408 (38%), Positives = 240/408 (58%), Gaps = 16/408 (3%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           MQ   I   E R LPLN  ++PQ+LK+LGY+TH+VGKWHLGS  +   P  +GFDSH G+
Sbjct: 91  MQGLPIVAGENRSLPLNMPLMPQHLKNLGYRTHIVGKWHLGSAYRSSTPTEKGFDSHFGY 150

Query: 61  WTGRIDMYDHTTMEQGSW--GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPL 118
           W G    YD+ T    +   G D    FE      G YAT V+T+ A+ ++  HN + PL
Sbjct: 151 WNGFTGYYDYFTDFNSTAIEGFDLHDRFETERGYQGQYATRVFTERALDIIEGHNTTRPL 210

Query: 119 FLMLAHSAVHSGNPYEPIRAPQKLID--AFKYIDDSARQKFAAVLSKLDESVGKVVKALH 176
           FL++ H A H+G     +  P ++     + YI D  R+ +A ++++LD S+G+VV+ L 
Sbjct: 211 FLLMTHLAAHAGRDGTELGVPNEVEAQRTYSYIQDPRRRLYAEIVAELDRSIGQVVRKLS 270

Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKA 236
            R +LENSI++F +DNG P  G   N+ SN+PL+G+K T +EGG+RG   ++SPLL  + 
Sbjct: 271 ERQMLENSIILFFSDNGAPTVGPYTNSGSNWPLRGIKLTNFEGGIRGTATIFSPLLKKRG 330

Query: 237 RVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGI 296
            V  + +H+SDWLPT Y+AAGG+L+ L  +DGVNQW  LS +T SPR+ +L NI++    
Sbjct: 331 YVNKELIHVSDWLPTFYAAAGGNLADLGPIDGVNQWPTLSLDTPSPRSEILVNINEQDNT 390

Query: 297 AALTVD--KYKLIKGTIYKGVWDNWYGPSGRE---GAYNASLLYDSHAGRILDKLNLMP- 350
            ++  D  ++KL+ G    G +D ++G SGR      Y+   +  S     + +L   P 
Sbjct: 391 TSIITDNGRFKLVTGAFEGGTYDGYFGDSGRSPDTPPYDPFAVLQSETNIAIQELTQTPI 450

Query: 351 PKEKVMELRDEATVK-C-NDSIE-VIQCKPRDAPCVFNIDEDPCERRN 395
            ++++   R +  +  C NDS    + C     PC+F+++ DPCE  N
Sbjct: 451 TRQQIRVTRAQIDLSWCRNDSFRPPLNC---SQPCLFDLENDPCETTN 495


>UniRef50_Q9VVM1 Cluster: CG7408-PB; n=2; Drosophila
           melanogaster|Rep: CG7408-PB - Drosophila melanogaster
           (Fruit fly)
          Length = 585

 Score =  310 bits (762), Expect = 4e-83
 Identities = 166/453 (36%), Positives = 250/453 (55%), Gaps = 13/453 (2%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           MQH VI   +P GLPLNE  + +  ++ GY+T L+GKWHLG  ++ + P  RGFD H+G+
Sbjct: 100 MQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHLGY 159

Query: 61  WTGRIDMYDHTTMEQ--GSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSH---NKS 115
               +D Y  +  +Q  G  G DFR   +  HD  G Y TD+ TD A+K +  H   N S
Sbjct: 160 LGAYVDYYTQSYEQQNKGYNGHDFRDSLKSTHDHVGHYVTDLLTDAAVKEIEDHGSKNSS 219

Query: 116 EPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKAL 175
           +PLFL+L H A H+ N  +P++AP + +  F+YI +   + +AA++S+LD+SVG V+ AL
Sbjct: 220 QPLFLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKTHRYYAAMVSRLDKSVGSVIDAL 279

Query: 176 HTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSK 235
             + +L+NSI++F +DNGGP  G +   ASNYPL+G KN+ WEG +R +  +WS   +  
Sbjct: 280 ARQEMLQNSIILFLSDNGGPTQGQHSTTASNYPLRGQKNSPWEGALRSSAAIWSTEFERL 339

Query: 236 ARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWG 295
             V  Q+++I D LPTL +AAG       +LDG+N W AL    ES    ++H ID+   
Sbjct: 340 GSVWKQQIYIGDLLPTLAAAAGISPDPALHLDGLNLWSALKYGYESVEREIVHVIDEDVA 399

Query: 296 IAAL--TVDKYKLIKGTIYKGVWDNWYGPSGREGAYNASLLYDSHAGRILDKLNLMP--- 350
              L  T  K+K+I GT  +G++D W G          ++ Y+         L L     
Sbjct: 400 EPHLSYTRGKWKVISGTTNQGLYDGWLGHRETSEVDPRAVEYEELVRNTSVWLQLQQVSF 459

Query: 351 PKEKVMELRDEATVKCND-SIEVIQCKPRDAPCVFNIDEDPCERRNXXXXXXXXXXXXX- 408
            +  + ELRD++ ++C D +  V  C P + PC+F+I+ DPCER N              
Sbjct: 460 GERNISELRDQSRIECPDPATGVKPCLPLEGPCLFDIEADPCERSNLYAEYQNSTIFLDL 519

Query: 409 -XXMHKLNVSAVAPNAQPIDARGDPQYWGRVYT 440
              + +    A  PN +P D   DP+++   +T
Sbjct: 520 WSRIQQFAKQAHPPNNKPGDPNCDPRFYHNEWT 552


>UniRef50_UPI00015B51A4 Cluster: PREDICTED: similar to arylsulfatase
           b; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
           arylsulfatase b - Nasonia vitripennis
          Length = 581

 Score =  309 bits (759), Expect = 9e-83
 Identities = 171/462 (37%), Positives = 260/462 (56%), Gaps = 31/462 (6%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           MQ   +  AEPRG+PLN  ++P+ ++ LGY+T LVGKWHLG   ++Y P+ RGFD+  G+
Sbjct: 100 MQGTPMRPAEPRGIPLNVSLMPEQMRRLGYETRLVGKWHLGYTTEDYTPVRRGFDTFFGY 159

Query: 61  WTGRIDMYDHTTMEQGS---WGTDFRR----GFEVAHDLFGVYATDVYTDEAIKVVNSHN 113
           + G I  YD+      +    G D  R     FE+AH     Y TD+ TDEA K++ ++ 
Sbjct: 160 YNGFISYYDYWIGWNDTNEVTGYDLHRDESDSFELAHS--SEYFTDLITDEAEKIIRNNK 217

Query: 114 KSEPLFLMLAHSAVHSGNPY--EP--IRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVG 169
            ++PLFL ++H AVH+G+    +P  +R    +  +F YI+D   +K+A +++ LDESVG
Sbjct: 218 NAKPLFLEISHLAVHAGSKVHDDPLEVRRTDDVNASFPYIEDYQHRKYAGMMAALDESVG 277

Query: 170 KVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWS 229
           +VVKAL    +LENSI++F +DNG P  G  +N  SNYP++G+K  ++EG  R A  ++S
Sbjct: 278 RVVKALKEAEMLENSIIIFMSDNGAPTVGLYNNTGSNYPMRGIKGGMFEGAARAAACIFS 337

Query: 230 PLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLEN-------LDGVNQWDALSKNTESP 282
           PL+ + +RV+ + MHI DWLPTLY+AAGG+   L++       LDGV+QW ++     S 
Sbjct: 338 PLIKAHSRVSEELMHIVDWLPTLYTAAGGNPMDLQSQFDGALPLDGVSQWSSIVAGGPSS 397

Query: 283 RTSVLHNIDDIWGIAALTVDKYKLIKGTIYKGVWDNWYGPSGRE---GAYNASLLYDSHA 339
           R S+L NID+  G  A  + ++KL+KG   +   D +YG SG +    AYN   +  S A
Sbjct: 398 RQSLLVNIDEAQGFEAAIIGRHKLVKGMTKE---DGYYGNSGNDPSFPAYNVKKVLSSTA 454

Query: 340 GRILDKLN--LMPPKEKVMELRDEATVKCNDSIEVIQCKPRDAPCVFNIDEDPCERRNXX 397
           G  + KL     P   + + LR ++ + C        C      C+F++ +DPCE R+  
Sbjct: 455 GASIGKLAGFASPSARRALWLRQKSVITCKPFTSAANC---SGTCLFDLSKDPCETRDLS 511

Query: 398 XXXXXXXXXXXXXMHKLNVSAVAPNAQPIDARGDPQYWGRVY 439
                        + +     +     P DA G P+Y+  VY
Sbjct: 512 SKLPLIVKKLESFLGEYRRVLMPQTNSPQDACGLPKYFNGVY 553


>UniRef50_UPI0000DB708D Cluster: PREDICTED: similar to CG8646-PA;
           n=1; Apis mellifera|Rep: PREDICTED: similar to CG8646-PA
           - Apis mellifera
          Length = 506

 Score =  305 bits (749), Expect = 1e-81
 Identities = 160/398 (40%), Positives = 239/398 (60%), Gaps = 32/398 (8%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           MQ   +   EPR +PLN  +LP+YL+ LGY THLVGKWH+G Y   + P  RGFD+  G+
Sbjct: 73  MQGYPLKAGEPRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPTRRGFDTFFGY 132

Query: 61  WTGRIDMYDHTTMEQGSWGTD--FRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPL 118
           ++G I  ++HT  +    G D  +     ++ D    Y TD+ T+ A  ++ +H++ +PL
Sbjct: 133 YSGYISYFNHTIKQDDHIGYDLHYDNSKNLSIDYNFEYTTDLITERAENIIKNHDRRKPL 192

Query: 119 FLMLAHSAVHSGNPYE--PIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALH 176
           +L L H A HS +  E   +R  Q+     KYI+D  R+K+A V++ +DESVG+V+KAL 
Sbjct: 193 YLQLCHLAAHSSDAKEVMEVRDEQETNATLKYIEDYNRRKYAGVVTAMDESVGRVIKALG 252

Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKA 236
              +LENSI+VF +DNG    G  +N  SNYPL+G+K TL+EGG+RG   ++S L+ + +
Sbjct: 253 QSSMLENSIIVFISDNGAQTEGLLENYGSNYPLRGLKFTLFEGGIRGVACVYSRLIQNSS 312

Query: 237 RVAYQKMHISDWLPTLYSAAGGDLSVL-ENLDGVNQWDALSKNTESPRTSVLHNIDDIWG 295
           R++ + MHI+DWLPT YSAAGG+L  L EN+DGV+QWD +    ES R SVL NID++  
Sbjct: 313 RISNELMHITDWLPTFYSAAGGNLENLEENMDGVDQWDTIVSGKESKRESVLLNIDEVED 372

Query: 296 IAALTVDKYKLIKGTIYKGV-WDNWYGPSGREGAYNASLLYDSHAGRILDKLNLMPPKEK 354
           +++  + KYKLI     K + ++++YG +G   +Y                     P+  
Sbjct: 373 VSSALIGKYKLIING--KNIQYNDYYGDNGTSVSY---------------------PEYN 409

Query: 355 VMELRDEATVKCNDSIEVIQCKPRDAPCVFNIDEDPCE 392
           V  LR++A V CN+     +C  +   C+F+I  DPCE
Sbjct: 410 VSSLRNKARVVCNNFTSYSKCVDK---CLFDIYNDPCE 444


>UniRef50_UPI0000D56521 Cluster: PREDICTED: similar to CG7402-PA;
           n=1; Tribolium castaneum|Rep: PREDICTED: similar to
           CG7402-PA - Tribolium castaneum
          Length = 531

 Score =  303 bits (743), Expect = 8e-81
 Identities = 164/446 (36%), Positives = 246/446 (55%), Gaps = 30/446 (6%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           MQ   +   E R LPLN   +P + ++LGYKTHLVGKWHLG+  KE  PL +GFDSH G+
Sbjct: 89  MQGYPLKAGENRSLPLNMPTMPLHFQNLGYKTHLVGKWHLGAAYKEDTPLGKGFDSHFGY 148

Query: 61  WTGRIDMYDHTT---MEQGSW--GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKS 115
           W G +  +D+ +   M+ G+   G D    FE      G YAT+++T+ ++ V+  H+  
Sbjct: 149 WNGFVGYFDYVSFSKMDNGTLVKGLDLHDQFEPVWGSQGRYATELFTERSLDVIEGHDVR 208

Query: 116 EPLFLMLAHSAVHSGNPYEPIRAPQ--KLIDAFKYIDDSARQKFAAVLSKLDESVGKVVK 173
            PLFL+++H A H+G     +  P   +    F YI D  R+ +A V+S LD S+G+++ 
Sbjct: 209 VPLFLVVSHLAAHTGQNGSELGVPDVDQTNHEFSYIQDPRRRLYAGVVSHLDASIGRIMA 268

Query: 174 ALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLD 233
            L  + +L+NSIV+F +DNG    G  +N+ SN+PL+GVK + +EGGVR A  ++SPL  
Sbjct: 269 KLDEKQMLDNSIVLFFSDNGAQTVGMYENSGSNWPLRGVKFSDFEGGVRVAATIYSPLFH 328

Query: 234 SKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDI 293
            K  V+   +HISDWLPTLYSAAGGD++ L  +DG++QWDAL+ N  S RT +L NID++
Sbjct: 329 KKGYVSEHLIHISDWLPTLYSAAGGDVAHLGQIDGIDQWDALTNNNPSNRTEILINIDEV 388

Query: 294 WGIAALTVDKYKLIKGTIYKGVWDNWYGPSGREGAYNASLLYDSHAGRILDKLNLMPPKE 353
               A+  DK+KLI+G+ ++G +D +YG SGR G  N                   P   
Sbjct: 389 DENFAIIRDKFKLIQGSYHEGTFDQYYGDSGR-GPEN-------------------PTPN 428

Query: 354 KVMELRDEATVKCNDSIEVIQCKPRDAPCVFNIDEDPCERRNXXXXXXXXXXXXXXXMHK 413
                 D +  +  D   ++ C      C+F++D+DPCE  N               + +
Sbjct: 429 PNHTTTDLSWCRAPDQTPILNC---TKGCLFDLDKDPCETTNIIESEPEIANQLYEKIAQ 485

Query: 414 LNVSAVAPNAQPIDARGDPQYWGRVY 439
                V    +  D + DP ++   +
Sbjct: 486 FWKELVPQRNKDTDPKSDPIFYNNTW 511


>UniRef50_A7SBG5 Cluster: Predicted protein; n=1; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 584

 Score =  293 bits (720), Expect = 5e-78
 Identities = 160/416 (38%), Positives = 226/416 (54%), Gaps = 38/416 (9%)

Query: 35  VGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFG 94
           +G WHLG + KEY P+ RGFDS  GFW  + D ++H++ E   WG D R   E      G
Sbjct: 90  LGMWHLGFFTKEYTPVYRGFDSFYGFWNAKTDYWNHSSYENNFWGVDLRDNMEPVQSEDG 149

Query: 95  VYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDA--------F 146
            Y T+++T EA+KV+ +H+ S PLFL +AH AVH+ NP EP++APQ  ID         F
Sbjct: 150 TYGTELFTREAVKVIEAHDTSTPLFLYVAHQAVHTANPNEPLQAPQDKIDVSLKQRQQRF 209

Query: 147 K-YIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAAS 205
           K  IDD  RQ +AA+++ LD+SVG +  AL  R +L +S+V+F+TDNGG   G N N  S
Sbjct: 210 KGTIDDDQRQVYAAMVTSLDQSVGDIFAALSKRHMLRDSVVIFTTDNGGAPYGLNWNRGS 269

Query: 206 NYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVL-E 264
           N+PL+G K+ LWEGGV+G  F++S L+  K RV+ + + ++DW+PT+Y  AGG    L  
Sbjct: 270 NFPLRGGKDMLWEGGVKGVAFVYSDLIKQKGRVSKELIDVTDWVPTIYHLAGGTAEFLVP 329

Query: 265 NLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALTVDKYKLIKGT--IYKGVWDNWYGP 322
           N+DG N W  +S+   SPR  +LHNID     A L   KYK+++G    YKGV   WY  
Sbjct: 330 NMDGKNVWSTISEGAPSPRDEILHNIDPWRKFAGLRKGKYKIVQGMDDTYKGV--GWY-- 385

Query: 323 SGREGAYNASLLYDSHAGRILDKLNLMPPKEKVMELRDEATVKCNDSI-EVIQCKPRDAP 381
                        D + G  L  +       K  EL   A + C  +  E  +C   D  
Sbjct: 386 -------------DRYPGHALSSM-------KQPELLPGAVIDCKKTFDEERKCDSSDGK 425

Query: 382 -CVFNIDEDPCERRNXXXXXXXXXXXXXXXMHKLNVSAVAPNAQPIDARGDPQYWG 436
            C+F+++EDPCE  +               +      A+ P   PI+   +P  +G
Sbjct: 426 FCLFDMEEDPCEYHDLSNQLPEVLAEMKTRLEYYKNIALPPWFPPINKAANPANFG 481


>UniRef50_Q8MPH9 Cluster: Glucosinolate sulphatase; n=3; Plutella
           xylostella|Rep: Glucosinolate sulphatase - Plutella
           xylostella (Diamondback moth)
          Length = 547

 Score =  288 bits (706), Expect = 2e-76
 Identities = 164/460 (35%), Positives = 253/460 (55%), Gaps = 20/460 (4%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           MQ   +  AE RG+PL E+++ QYL+D GY+T +VGKWH+G    E LP  RGF++H G 
Sbjct: 88  MQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLPTYRGFENHFGV 147

Query: 61  WTGRIDMYDHTTMEQ--GSWGTDFRRGFEVAHDLFGV-YATDVYTDEAIKVVNSHNKSEP 117
             G ID Y++   EQ  G   T      ++  D     Y TDVYT+++  ++ +HN SEP
Sbjct: 148 RGGFIDYYEYNAQEQLDGRPVTGLCLFDDLQPDWTTEGYITDVYTEKSTTIIENHNVSEP 207

Query: 118 LFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHT 177
           L+L+L H A H+GN    ++AP + + A ++++   R+ FAA++ KLD+S+G++V  L  
Sbjct: 208 LYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAAMVKKLDDSIGEIVATLEK 267

Query: 178 RGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLW-----SPLL 232
           +G+LEN+I+ FSTDNG P  G   N+ SNYPL+GVK + WEGG+RG   +W     +P  
Sbjct: 268 KGMLENTIITFSTDNGAPTVGLGANSGSNYPLRGVKKSPWEGGIRGNAMIWAGPEVAPGN 327

Query: 233 DSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDD 292
             + +V    MH +DW+PTL  A G  +     LDG+  W  + +N  SPRT +   IDD
Sbjct: 328 AWRGKVYDGNMHAADWVPTLLEAIGEKIPA--GLDGIPMWSHIIENKPSPRTEIF-EIDD 384

Query: 293 IWGIAALTVDKYKLIKGTI----YKGVWDNWYGPSGREGAYNASLLYDSHAGRILDKLNL 348
            +  +++T+ ++KL+KGTI     K   ++  G  G    Y    L DS A   L+ + +
Sbjct: 385 YFNHSSVTLGRHKLVKGTIDESLSKHYGEDLRGIIGTPPDYKQK-LRDSKAWESLETIGI 443

Query: 349 MPPKEKVMELRDEATVKCNDSIEVIQCKP-RDAPCVFNIDEDPCERRNXXXXXXXXXXXX 407
            P    VM  RDEA V C + +    C P  ++ C+++I EDPCE R+            
Sbjct: 444 -PLDADVMADRDEAIVTCGNVVPK-PCSPSAESWCLYDIIEDPCELRDLSEELPQLAQIL 501

Query: 408 XXXMHKLNVSAVAPNAQPI-DARGDPQYWGRVYTNFGNYE 446
              + +     +    Q + D +  P+Y+   +  + + E
Sbjct: 502 LYRLEQEEAKIIPREGQYVADPKSAPKYFNYTWDAYLSVE 541


>UniRef50_P50429 Cluster: Arylsulfatase B precursor; n=17;
           Eumetazoa|Rep: Arylsulfatase B precursor - Mus musculus
           (Mouse)
          Length = 534

 Score =  266 bits (653), Expect = 6e-70
 Identities = 130/297 (43%), Positives = 186/297 (62%), Gaps = 15/297 (5%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           +QH +I   +P  +PL+EK+LPQ LK+ GY TH+VGKWHLG Y+KE LP  RGFD++ G+
Sbjct: 110 LQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGY 169

Query: 61  WTGRIDMYDHTTME--QGSWGT----DFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNK 114
             G  D Y H      +   GT    D R G E A +   +Y+T+++T  A  V+ +H  
Sbjct: 170 LLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKRATTVIANHPP 229

Query: 115 SEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKA 174
            +PLFL LA  +VH     +P++ P++ ++ + +I D  R+ +A ++S +DE+VG V KA
Sbjct: 230 EKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSLMDEAVGNVTKA 284

Query: 175 LHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDS 234
           L + GL  N++ +FSTDNGG       +  +N+PL+G K TLWEGG+RG GF+ SPLL  
Sbjct: 285 LKSHGLWNNTVFIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGTGFVASPLLKQ 340

Query: 235 KARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNID 291
           K   + + MHI+DWLPTL   AGG  +  + LDG N W  +S+   SPR  +LHNID
Sbjct: 341 KGVKSRELMHITDWLPTLVDLAGGSTNGTKPLDGFNMWKTISEGHPSPRVELLHNID 397


>UniRef50_Q5FYB0 Cluster: Arylsulfatase J precursor; n=69;
           Eumetazoa|Rep: Arylsulfatase J precursor - Homo sapiens
           (Human)
          Length = 599

 Score =  259 bits (635), Expect = 9e-68
 Identities = 129/297 (43%), Positives = 183/297 (61%), Gaps = 13/297 (4%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           +QH +I   +P  LPL+   LPQ LK++GY TH+VGKWHLG Y+KE +P  RGFD+  G 
Sbjct: 140 LQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGS 199

Query: 61  WTGRIDMYDHTTMEQ-GSWGTDFRRGFEVAHDLF-GVYATDVYTDEAIKVVNSHNKSEPL 118
             G  D Y H   +  G  G D       A D   G+Y+T +YT    +++ SHN ++P+
Sbjct: 200 LLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPI 259

Query: 119 FLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTR 178
           FL +A+ AVHS     P++AP +  + ++ I +  R+++AA+LS LDE++  V  AL T 
Sbjct: 260 FLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTY 314

Query: 179 GLLENSIVVFSTDNGG-PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKAR 237
           G   NSI+++S+DNGG P AG      SN+PL+G K T WEGG+R  GF+ SPLL +K  
Sbjct: 315 GFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGT 369

Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIW 294
           V  + +HI+DW PTL S A G +     LDG + W+ +S+   SPR  +LHNID I+
Sbjct: 370 VCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSPRVDILHNIDPIY 426


>UniRef50_Q9NJU8 Cluster: Sulfatase 1; n=3; Coelomata|Rep: Sulfatase
           1 - Helix pomatia (Roman snail) (Edible snail)
          Length = 503

 Score =  258 bits (632), Expect = 2e-67
 Identities = 139/332 (41%), Positives = 191/332 (57%), Gaps = 26/332 (7%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           +QHG+I   +P  LP +   L   LK+ GY TH+VGKWHLG YK+EYLP NRGFD++ G+
Sbjct: 98  LQHGIINSCQPNALPNDSPTLADKLKESGYATHMVGKWHLGFYKQEYLPWNRGFDTYFGY 157

Query: 61  WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
                D ++H    +     D R       +  G Y+  ++T +AI VV SHN S+PLFL
Sbjct: 158 LNAAEDYFNHNVPWRQVRYLDLRDNNGPVRNETGQYSAHLFTGKAIDVVQSHNTSKPLFL 217

Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
            LA+ +VH+     P+  P+K    ++ I D  R+ FA ++S LDE V  + +AL  +GL
Sbjct: 218 YLAYQSVHA-----PLEVPEKYEHKYRNITDKNRRTFAGMVSALDEGVANLTQALKDKGL 272

Query: 181 LENSIVVFSTDNGGPA-AGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVA 239
             N++++FSTDNGG   AG N     NYPL+G K +LWEGG  G GF+    L     V+
Sbjct: 273 WNNTVLIFSTDNGGQIHAGGN-----NYPLRGWKASLWEGGFHGVGFVSGGALKRSGAVS 327

Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIW---GI 296
              +H+SDW PTL + AGG+L+  + LDG NQWD +S  T SPR  +LHNID ++   G+
Sbjct: 328 KGLIHVSDWFPTLVTLAGGNLNGTKPLDGFNQWDTISNETPSPREILLHNIDILYPQKGV 387

Query: 297 ------------AALTVDKYKLIKGTIYKGVW 316
                       AA+ V  YKLI G    G W
Sbjct: 388 PLYSNTWDTRVRAAIRVGDYKLITGDPGNGSW 419


>UniRef50_A7SRP2 Cluster: Predicted protein; n=2; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 491

 Score =  257 bits (629), Expect = 5e-67
 Identities = 125/293 (42%), Positives = 180/293 (61%), Gaps = 13/293 (4%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           +QHG+I+   P GLPLN  +LPQ L+  GY TH++GKWHLG Y  E  P  RGFD+  GF
Sbjct: 89  LQHGIIHNGRPYGLPLNLTLLPQKLRKAGYSTHMLGKWHLGFYNWESTPTYRGFDTFYGF 148

Query: 61  WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
           ++G  + Y H    Q  +  D R   E+  D  G Y+  ++T  A ++V +H+ S PLF+
Sbjct: 149 YSGAENHYTHV---QDHY-LDLRDNEEIVRDQNGTYSAHLFTKRAEQIVRAHDPSTPLFM 204

Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
            +A   VHS     P++AP++ ID + +I D  R+ +AA+++ +D+++G + +A    GL
Sbjct: 205 YMAFQNVHS-----PVQAPKEYIDRYSFIKDPLRRTYAAMVTIMDDALGNLTRAFDKAGL 259

Query: 181 LENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAY 240
            EN+I++FSTDNGG       N   +YPL+G K+TLWEGGVRG  F+    L+       
Sbjct: 260 WENTILIFSTDNGGVPK----NGGYDYPLRGRKDTLWEGGVRGVAFVHGVALEQSGVKCK 315

Query: 241 QKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDI 293
             MH++DW PTL S AGG L   E+LDG + W+++S   ESPR  +LHNID I
Sbjct: 316 ALMHVTDWYPTLVSLAGGSLDEDEDLDGYDVWESISHGVESPRKELLHNIDTI 368


>UniRef50_UPI0000586CBA Cluster: PREDICTED: similar to arylsulfatase
           B; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to arylsulfatase B - Strongylocentrotus
           purpuratus
          Length = 596

 Score =  253 bits (620), Expect = 6e-66
 Identities = 132/302 (43%), Positives = 183/302 (60%), Gaps = 21/302 (6%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           +QH VI   +P  LPLNE  LPQ LK+ GY THLVGKWHLG YK E +PL RGFDS  G+
Sbjct: 163 LQHFVIIAPQPNCLPLNETTLPQKLKESGYATHLVGKWHLGFYKNECMPLQRGFDSSFGY 222

Query: 61  WTGRIDMYDHTTM-------EQGSW-GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSH 112
            +G  D + H          E   W G DF     VA +  G Y+  V+T+ A +V+  H
Sbjct: 223 LSGMQDYWTHFRSGSFPGFPEGNHWLGIDFWDNNRVAWEYTGNYSQFVFTERAQRVIQQH 282

Query: 113 NKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVV 172
           N ++PLFL L   +VH      P++ P+K +  + +  D  RQ +A +++ +DE+VGKVV
Sbjct: 283 NPNQPLFLYLPLQSVHG-----PLQVPEKYMKPYAHFQDVGRQTYAGMVATMDEAVGKVV 337

Query: 173 KALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLL 232
            +L   GL  ++++VF+TDNGG        + +N+PL+G KNTLWEGGV G GF+  P++
Sbjct: 338 DSLQEAGLWNDTVLVFTTDNGGTPG----KSGNNWPLRGTKNTLWEGGVHGVGFITGPMI 393

Query: 233 DS--KARVAYQKMHISDWLPTLY-SAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHN 289
            +  +  V+   MHISDW PTL    AGG+ + L  LD  N W++++K T SPR  +LHN
Sbjct: 394 PAGVQGTVSKHFMHISDWFPTLIEGVAGGNTAGLA-LDSYNMWNSITKGTPSPRKELLHN 452

Query: 290 ID 291
           ID
Sbjct: 453 ID 454


>UniRef50_UPI0000E46777 Cluster: PREDICTED: similar to arylsulfatase
           J; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to arylsulfatase J - Strongylocentrotus
           purpuratus
          Length = 588

 Score =  223 bits (544), Expect = 1e-56
 Identities = 149/445 (33%), Positives = 227/445 (51%), Gaps = 36/445 (8%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           MQH  ++   P  LPL+E  L Q LK  GY TH VGKWHLG   K+ LP  RGF+S  G 
Sbjct: 162 MQHLNLFPGRPCCLPLDETTLAQALKKQGYSTHAVGKWHLGYAWKDCLPSRRGFESFFGN 221

Query: 61  WTGRIDMYDHTTM----EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE 116
             G  D + H       ++   G        +     G ++T +YT+ A +++    +++
Sbjct: 222 IMGSADHWSHNKTALFGDKLVMGKSMYYNERIYWKHEGTFSTTLYTNRARQLIRKQPRNK 281

Query: 117 PLFLMLAHSAVHSGNPYEPIRAPQKLIDAFK-YIDDSARQKFAAVLSKLDESVGKVVKAL 175
           PLFL L++ AVH+     P+  P++    ++  I +S R+++A +++ LDE+V  V +AL
Sbjct: 282 PLFLYLSYEAVHT-----PLNVPEQYAKPYEGIIHNSKRRRYAGLVNILDEAVRNVTEAL 336

Query: 176 HTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLL--D 233
              GL +NS+++F+TDNGG       +  +N+PL+G K+TLWEGG+RG GF+ SPL+  +
Sbjct: 337 KYNGLYDNSVIIFTTDNGGRPK--PRSVGNNWPLRGGKSTLWEGGIRGVGFVHSPLIPWE 394

Query: 234 SKARVAYQKMHISDWLPTLYSA-AGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDD 292
            +  V  Q +H+SDW PT+    AGG L   + LDG +QW  +SK TES R  +LHNID 
Sbjct: 395 LRGTVNRQLIHVSDWFPTIVXGIAGGKLVTNKPLDGXHQWKTISKGTESNRHEILHNIDP 454

Query: 293 IWGIAALTVDKYKLIKGTIYKGVWDNWYGPSGREGAYNASLLYDSHAGRILDKLNLMPPK 352
           I+  A  T +  +        G   N          +NA++      G    KL+   P 
Sbjct: 455 IYPAAHWTRENERDF------GALSNL--------PFNATMRASIRVGNW--KLSTGLPH 498

Query: 353 EKVMELRDEATVKCNDSIEVIQCKPRDAPCVFNIDEDPCERRNXXXXXXXXXXXXXXXMH 412
           E   E   E+ +      E+   +      ++NI +DP ER+N               + 
Sbjct: 499 EDFWEPPKESEM----PPEMNDIRWSTPVRLYNIKKDPNERQNMAPYQKKIVYRLLKRLQ 554

Query: 413 KLNVSAVAP-NAQPIDARGDPQYWG 436
               +AV P +  P D RG+P+Y G
Sbjct: 555 DYQNTAVTPIHLGPKDERGNPKYHG 579


>UniRef50_A7S8Q2 Cluster: Predicted protein; n=2; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 540

 Score =  223 bits (544), Expect = 1e-56
 Identities = 114/292 (39%), Positives = 172/292 (58%), Gaps = 16/292 (5%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           MQH VI    P G+P     +PQ L+ LGY+T ++GKWHLG +  +Y PL RGFDS +GF
Sbjct: 101 MQHFVINITSPWGMPRRFPTIPQKLRTLGYRTSMIGKWHLGFFDWDYTPLRRGFDSFLGF 160

Query: 61  WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
           + G  D + H+ M       DFRR  E A++  G ++TDV+T EAI +   HN S+PLFL
Sbjct: 161 FAGEQDHWRHSKM----GFLDFRRDEEPANEYGGQHSTDVFTQEAINIAMRHNASQPLFL 216

Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
           +L+++AVH+     P++A    ++    + D  RQ +  ++   D S+G+++      GL
Sbjct: 217 LLSYAAVHT-----PLQAHPNDVNKIGGVSDKDRQNYLGMMGAADWSIGRLIDVYKRNGL 271

Query: 181 LENSIVVFSTDNGG-PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVA 239
             N+++++++DNG  P  G       N+PL+G K++L+EGGVR   F+   +L  K    
Sbjct: 272 WNNTLMIWASDNGAQPGKG----GGYNWPLRGYKSSLFEGGVRVPAFVHGEMLQRKGGTV 327

Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNID 291
               H++DW PTL   AGG+  V  ++DGV+QW  LS+   S R  +LHNID
Sbjct: 328 NDLFHVTDWYPTLVKLAGGE--VEPDIDGVDQWPTLSEGKPSKREEILHNID 377


>UniRef50_A7SPY2 Cluster: Predicted protein; n=4; Eumetazoa|Rep:
           Predicted protein - Nematostella vectensis
          Length = 270

 Score =  214 bits (522), Expect = 5e-54
 Identities = 94/198 (47%), Positives = 133/198 (67%), Gaps = 1/198 (0%)

Query: 3   HGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWT 62
           H  ++G +P GLPL E   PQY+K LGY TH +GKWHLG ++KEY P  RGFDS  GFW 
Sbjct: 73  HATVFGTQPYGLPLGETTTPQYMKSLGYVTHGIGKWHLGFFEKEYTPTYRGFDSFYGFWN 132

Query: 63  GRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLML 122
           G+ D +DH++ E   WGTD R   +   +  G Y T+++ + A ++++ HN+++PL+L L
Sbjct: 133 GKEDYWDHSSQED-VWGTDLRDNEKPVRNESGHYGTELFAERAAQIIHLHNQTKPLYLYL 191

Query: 123 AHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLE 182
           A   VHS N  EP++AP++LI  F +I    R+ +AA++S LDESV  V KAL   G+L 
Sbjct: 192 AQQGVHSANGNEPLQAPKRLIKKFSHISSPKRRIYAAMVSSLDESVETVHKALSETGMLN 251

Query: 183 NSIVVFSTDNGGPAAGFN 200
           N+++VF+TDNGG   GFN
Sbjct: 252 NTVLVFTTDNGGAPRGFN 269


>UniRef50_A3ZMN6 Cluster: Arylsulfatase B; n=1; Blastopirellula
           marina DSM 3645|Rep: Arylsulfatase B - Blastopirellula
           marina DSM 3645
          Length = 455

 Score =  197 bits (481), Expect = 4e-49
 Identities = 117/315 (37%), Positives = 173/315 (54%), Gaps = 21/315 (6%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           +Q GV+      GLPL+E+ L + L+D GY+T +VGKWHLG     YLP+ RGFD   G 
Sbjct: 93  LQVGVVRPWADYGLPLDERTLAEALQDAGYETAIVGKWHLGHVSPAYLPMARGFDHQYGH 152

Query: 61  WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
           + G +D + H        G D+ +   V  D    YAT +   EA++V+   +K +PLFL
Sbjct: 153 YNGALDYFTH----DRDGGHDWHKDDHVNRD--EGYATHLIAQEAVRVIQDRDKKKPLFL 206

Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYID-DSARQKFAAVLSKLDESVGKVVKALHTRG 179
            +  +AVHS     P++ P+    A  Y D    RQ +A +++ LDE+VG++V  +  + 
Sbjct: 207 YVPFNAVHS-----PLQVPESY--AAPYGDMKKRRQAYAGMVAALDEAVGQIVDEIQRQE 259

Query: 180 LLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARV 238
           +L+N++ +FS+DNGGP  G       N PL+G K+TL+EGGVR   F  W   +   ++V
Sbjct: 260 MLDNTLFIFSSDNGGPEPG---KLTDNGPLRGGKHTLYEGGVRVCAFASWKGRIAPGSKV 316

Query: 239 AYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAA 298
               +HI DW PTL   AGG L   + LDG N W +++    SP   ++ NI    G  A
Sbjct: 317 E-APLHIVDWYPTLIELAGGSLQQAKPLDGRNIWPSITTGEPSPHDVIVCNITPTEG--A 373

Query: 299 LTVDKYKLIKGTIYK 313
           + V  +KL+   I K
Sbjct: 374 IRVGDWKLVVHNIGK 388


>UniRef50_A7IPG5 Cluster: Sulfatase precursor; n=1; Xanthobacter
           autotrophicus Py2|Rep: Sulfatase precursor -
           Xanthobacter sp. (strain Py2)
          Length = 491

 Score =  186 bits (453), Expect = 1e-45
 Identities = 113/316 (35%), Positives = 173/316 (54%), Gaps = 25/316 (7%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           +Q G I      GL  +E +LPQ LKD+GY+T LVGKWHLG   +++ P  RGFDS  G 
Sbjct: 113 LQVGAIPSGAKYGLATDEFLLPQALKDVGYRTALVGKWHLGHADQKFWPRQRGFDSFYGP 172

Query: 61  WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
             G ID + H       W  D  +  E  +D      T+++  EA++++ +H+   PLFL
Sbjct: 173 LVGEIDHFKHEAHGVTDWYHDNTQVKEEGYD------TELFGKEAVRLIAAHDPKTPLFL 226

Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
            LA +A     P+ P +APQ  +D + +I    R+ +AA+++ +D+ +G VV AL +RG+
Sbjct: 227 YLAFTA-----PHTPFQAPQSYLDQYAHIAAPQRRAYAAMITAMDDQIGHVVAALTSRGM 281

Query: 181 LENSIVVFSTDNG--------GPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPL 231
            EN+++VF +DNG        G  A   D  ASN P +  K +L+EGG R      W   
Sbjct: 282 RENTLIVFHSDNGGTRSKMFAGEGAVAGDLPASNAPYRDGKGSLYEGGTRVVALANWPGR 341

Query: 232 LDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNID 291
           +   A  A   MH+ D LPTL   AG  L+  + LDGV+ W AL+   ++ R  +++N++
Sbjct: 342 IAPGA--AEGVMHVVDMLPTLAKLAGASLAKSKPLDGVDVWPALAAG-QAGRAGIVYNVE 398

Query: 292 DIWGIAALTVDKYKLI 307
              G  A+   ++KL+
Sbjct: 399 PTQG--AVRDGRWKLV 412


>UniRef50_Q18837 Cluster: Sulfatase domain protein protein 3,
           isoform a; n=2; Caenorhabditis elegans|Rep: Sulfatase
           domain protein protein 3, isoform a - Caenorhabditis
           elegans
          Length = 488

 Score =  179 bits (436), Expect = 1e-43
 Identities = 109/324 (33%), Positives = 174/324 (53%), Gaps = 25/324 (7%)

Query: 2   QHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFW 61
           Q+GV    EP G+P     L + ++ L Y T+LVGKWHLG  KKE+LP NRGFD   GF+
Sbjct: 98  QNGVFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFY 157

Query: 62  TGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLF-----------GVYATDVYTDEAIKVVN 110
             +   ++H+  +         +G ++  ++            GVY+TD++TD A+ V++
Sbjct: 158 GPQTGYFNHSADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQNGVYSTDLFTDVAMSVLD 217

Query: 111 SHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAA--VLSKLDESV 168
           +HN S+P F+ L++ AVH   P   +    K I   K      R    +  +L+ +D ++
Sbjct: 218 NHNNSKPFFMFLSYQAVH---PPLQVSQQSKTIGQGKEATFILRSHAHSTRMLTAMDFAI 274

Query: 169 GKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLW 228
           G++V+ L    L EN+++VF++DNGG A    +  ASN PL+G K+T+WEGG +   F+ 
Sbjct: 275 GRLVEYLKASNLYENTVIVFTSDNGGTA----NFGASNAPLRGEKDTIWEGGTKTTTFVH 330

Query: 229 SPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDAL-SKNTESPRTSVL 287
           SP+   +        H+ DW  T+ S  G  L +    DG+NQW+ L +   +  R   +
Sbjct: 331 SPMYIEEGGTRDMMFHVVDWHATILSITG--LEIDSYGDGINQWEYLKTGRPKFRRFQFV 388

Query: 288 HNIDDIWGIAALTVDKYKLIKGTI 311
           +NID+    +A+    YKLI G +
Sbjct: 389 YNIDNHG--SAIRDGDYKLIVGNV 410


>UniRef50_UPI0000587D99 Cluster: PREDICTED: similar to arylsulfatase
           B; ARSB; n=1; Strongylocentrotus purpuratus|Rep:
           PREDICTED: similar to arylsulfatase B; ARSB -
           Strongylocentrotus purpuratus
          Length = 365

 Score =  163 bits (397), Expect = 7e-39
 Identities = 93/239 (38%), Positives = 138/239 (57%), Gaps = 16/239 (6%)

Query: 59  GFWTGRIDMYDHTTMEQGSW-GTDFRRGFE-VAHDLFGVYATDVYTDEAIKVVNSHNKSE 116
           GF+T +        ++  +W G D R   E VA D  GVY+T ++T ++  ++  HN+S+
Sbjct: 15  GFYTHKHYGGHPGLVDSKNWSGYDLRDNLEQVAQDYQGVYSTHLFTQKSQNIIRRHNRSK 74

Query: 117 PLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALH 176
           PLFL  +  AVH      P+  P + ++ F YI D  R+ +A ++  +DE+VG + + L 
Sbjct: 75  PLFLYHSFQAVHY-----PLEVPPRYMEDFNYIADERRRTYAGMVKCMDEAVGNLTRTLK 129

Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKA 236
             GL  N+I++FS+DNG   A FN    SN+PL+G+K +LWEGG++  GF+ SPLL    
Sbjct: 130 KTGLWNNTIIIFSSDNG---ANFN-YGGSNWPLRGMKRSLWEGGIKSVGFIASPLLPKLV 185

Query: 237 R--VAYQKMHISDWLPTLY-SAAGGDLSVLENLDGVNQWDALSKNTES-PRTSVLHNID 291
           R  V     H++DW PTL    A G L    +LDG N W  L++  +S PR  +LHNID
Sbjct: 186 RGTVNNNLFHVTDWFPTLVRGVARGSLKG-THLDGHNLWKHLTRGKDSWPRKEILHNID 243


>UniRef50_UPI0000E48607 Cluster: PREDICTED: similar to arylsulfatase
           B; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to arylsulfatase B - Strongylocentrotus
           purpuratus
          Length = 531

 Score =  163 bits (395), Expect = 1e-38
 Identities = 84/194 (43%), Positives = 120/194 (61%), Gaps = 9/194 (4%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           +Q+GVI  A+P  LPL+E  LPQ LK+  Y TH+VGKWH+G YK    P  RGFDS+ G+
Sbjct: 97  LQYGVIRPAQPHCLPLDEVTLPQKLKERDYATHMVGKWHIGFYKDACTPTERGFDSYFGY 156

Query: 61  WTGRIDMYDHT-TMEQGS---WGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE 116
            +G  D Y H+ + + GS    G D       A    G Y+T ++T +AI V+N+H +S+
Sbjct: 157 LSGAEDYYSHSRSFQIGSKTLKGLDLMANKTPAFQYKGQYSTHLFTSKAIDVINNHERSK 216

Query: 117 PLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALH 176
           PLFL LA+ AVHS     P++ P K  + +  I  SAR+ +A ++S +DE +G V +AL 
Sbjct: 217 PLFLYLAYQAVHS-----PLQVPSKYEEPYANITSSARRAYAGMVSCMDEGIGNVTRALV 271

Query: 177 TRGLLENSIVVFST 190
             GL  N+I++FST
Sbjct: 272 DAGLYNNTIIIFST 285


>UniRef50_UPI0000F20AE2 Cluster: PREDICTED: similar to Arylsulfatase
           B precursor (ASB) (N-acetylgalactosamine-4-sulfatase)
           (G4S), partial; n=1; Danio rerio|Rep: PREDICTED: similar
           to Arylsulfatase B precursor (ASB)
           (N-acetylgalactosamine-4-sulfatase) (G4S), partial -
           Danio rerio
          Length = 373

 Score =  156 bits (379), Expect = 1e-36
 Identities = 74/196 (37%), Positives = 118/196 (60%), Gaps = 11/196 (5%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           +QH +I+  +P  +PL+EK+LPQ L++ GY TH+VGKWHLG ++K+ LP +RGF S  G+
Sbjct: 183 LQHQIIWPCQPYCVPLDEKLLPQVLRERGYHTHMVGKWHLGMFQKDCLPTHRGFQSFFGY 242

Query: 61  WTGRIDMYDH------TTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNK 114
            TG  D Y H        +       D R G  VA +  G Y+T++ T+ A  ++  H  
Sbjct: 243 LTGSEDYYTHKRCSLIAPLNVTRCALDLRDGDAVALNYSGRYSTELLTERATHIITQHTP 302

Query: 115 SEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKA 174
            +PLFL +A  AVH+     P++ P + I  + +I D  R+++A ++S +DE+VG +   
Sbjct: 303 DQPLFLYVALQAVHA-----PLQVPDRYIAPYSFIQDPHRRRYAGMVSAMDEAVGNITHT 357

Query: 175 LHTRGLLENSIVVFST 190
           L   GL +N++++FST
Sbjct: 358 LQETGLWDNTVLIFST 373


>UniRef50_A6DKP3 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine 6-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 465

 Score =  150 bits (363), Expect = 9e-35
 Identities = 91/264 (34%), Positives = 143/264 (54%), Gaps = 21/264 (7%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMY---- 68
           GLPL++K++P+ L   GY T +VGKWH G   K + P NRGF    GF  G I+ +    
Sbjct: 106 GLPLSQKLIPEILVKEGYATGMVGKWHDGDQHK-FWPYNRGFQEFYGFNNGAINNWVLKG 164

Query: 69  -DHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAV 127
            +HT  E   WG   R    V +   G Y T+ +  EA++ ++ H K+EP FL L+ +AV
Sbjct: 165 ENHTVDE---WGAVHRENKRVENS--GEYMTEAFGREAVEFIDRH-KTEPFFLYLSFNAV 218

Query: 128 HSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVV 187
           H      P++AP+   + FK+I    R    A+L  +D+++G V++ L   GL EN+I+ 
Sbjct: 219 HG-----PLQAPKSYTNQFKHIKPENRALCLAMLKSMDDNIGLVLEKLRKEGLEENTIIF 273

Query: 188 FSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKARVAYQKMHIS 246
           F++DNGG   G   N + N   +G KNT+++GG+       W   + ++ +     +H  
Sbjct: 274 FTSDNGGKLKG---NYSFNGKYRGEKNTVFDGGLHVPYAVQWKAQIPAQTKALEAPVHSI 330

Query: 247 DWLPTLYSAAGGDLSVLENLDGVN 270
           D   T+++AAG ++     LDG N
Sbjct: 331 DLAHTIFAAAGVEIKDEYKLDGRN 354


>UniRef50_A6DSH3 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine 6-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 455

 Score =  149 bits (362), Expect = 1e-34
 Identities = 105/297 (35%), Positives = 152/297 (51%), Gaps = 22/297 (7%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFW-TGRIDMYDHT 71
           G  LN K +P YLK+ GYK+   GKWHLG ++ +Y PL+RGFD   GF   G  D +   
Sbjct: 102 GTDLNAKFIPNYLKEAGYKSMAFGKWHLG-HEMKYHPLHRGFDDFYGFMGRGAHDFFRLE 160

Query: 72  TMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGN 131
               G +G    RG E   D    Y T   T+E +K +   NK +P F  +A++AVH+  
Sbjct: 161 KEYDGKFGGPIYRGLEPIDD--KGYLTTRITEETVKFI-EENKDKPFFAYVAYNAVHT-- 215

Query: 132 PYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
              P +AP + I A     D  R    A+L  LD  VG++VK L    + EN+I+++ +D
Sbjct: 216 ---PAQAPAEDIKAVS--GDETRDILVAMLKHLDLGVGEIVKTLKKHDIYENTIIIYLSD 270

Query: 192 NGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHIS-DWLP 250
           NGG         A+N PL+GVK+ +++GG+R   FL S     KA    Q   IS D LP
Sbjct: 271 NGGA----KSMVANNKPLRGVKHDIYDGGIR-VPFLMSWPAQIKAGQDTQSPVISLDILP 325

Query: 251 TLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALTVDKYKLI 307
           TL  AAG  L  L ++DG +    +  + ++       N  D  G   + ++ +KL+
Sbjct: 326 TLLDAAG--LPALSDIDGESMLPVIRGDKDNLDRPFFWNHGD--GQTGIQLNNWKLV 378


>UniRef50_UPI0000660330 Cluster: Arylsulfatase I precursor (EC
           3.1.6.-) (ASI).; n=1; Takifugu rubripes|Rep:
           Arylsulfatase I precursor (EC 3.1.6.-) (ASI). - Takifugu
           rubripes
          Length = 620

 Score =  149 bits (360), Expect = 2e-34
 Identities = 80/193 (41%), Positives = 117/193 (60%), Gaps = 12/193 (6%)

Query: 76  GSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGNPYE 134
           G  G D   G  VA    G Y+T ++T  A K++ SHN +E PLFL+L+  AVH+     
Sbjct: 200 GVCGYDLHDGEGVAWGQEGKYSTTLFTQRARKILESHNPTEKPLFLLLSLQAVHT----- 254

Query: 135 PIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG 194
           P++ P+  I  ++ + + AR+K AA++S +DE+V  V  AL   G   NS++++STDNG 
Sbjct: 255 PLQTPKSYIYPYRDMANIARRKLAAMVSTVDEAVRNVTYALRKYGFYRNSVIIYSTDNGA 314

Query: 195 -PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLY 253
            P  G      SN+PL+G K T WEGG+RG  F+ SPLL  + RV+   +HI+DW PTL 
Sbjct: 315 QPFTG-----GSNWPLRGRKGTYWEGGIRGVAFVHSPLLKRRRRVSKALLHITDWFPTLV 369

Query: 254 SAAGGDLSVLENL 266
             AGG++S +  +
Sbjct: 370 GLAGGNISQVSGM 382


>UniRef50_A3HWU7 Cluster: N-acetylgalactosamine 6-sulfatase; n=2;
           Bacteria|Rep: N-acetylgalactosamine 6-sulfatase -
           Algoriphagus sp. PR1
          Length = 472

 Score =  143 bits (346), Expect = 1e-32
 Identities = 83/258 (32%), Positives = 143/258 (55%), Gaps = 13/258 (5%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           G+PL++K +  +L  LGY   L+GKWHLG  + ++ PL RGFD   G+  G  D ++   
Sbjct: 118 GMPLSQKTIADHLNKLGYVNGLIGKWHLGK-EPQFHPLKRGFDEFWGYTGGGHDYFESLP 176

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
             +G +       F+    +   Y TD   +E++  +  H K EP FL  A +A     P
Sbjct: 177 NGKG-YKEPLESNFKTPDPI--TYITDDVGNESVDFIERH-KDEPFFLFAAFNA-----P 227

Query: 133 YEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDN 192
           + P++A ++ +  +++I+D  R+ +AA++ +LD +VGK++ +L  +GL EN++VVF +DN
Sbjct: 228 HTPMQALEEDLALYQHIEDKKRRTYAAMVHRLDLNVGKIMTSLEEQGLSENTLVVFFSDN 287

Query: 193 GGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTL 252
           GGP    + NA+ N P +G K  L EGG+     +  P L  +  +  +++   D +PT 
Sbjct: 288 GGPT---DSNASLNAPYRGQKGILLEGGIHVPFVMNLPGLLPEGLIYQEQVTSLDVVPTF 344

Query: 253 YSAAGGDLSVLENLDGVN 270
            + AG   + ++   GV+
Sbjct: 345 LALAGDTETSMDMFSGVD 362


>UniRef50_Q4SNM7 Cluster: Chromosome 15 SCAF14542, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 15 SCAF14542, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 650

 Score =  142 bits (343), Expect = 2e-32
 Identities = 77/191 (40%), Positives = 116/191 (60%), Gaps = 12/191 (6%)

Query: 76  GSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGNPYE 134
           G  G D   G  V     G Y+T ++T  A +++ SH+ +E PLFL+L+  AVH+     
Sbjct: 198 GVCGYDLHDGEGVVWGQEGKYSTALFTRRARQILESHDPAERPLFLLLSLQAVHT----- 252

Query: 135 PIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG 194
           P++ P+  I  ++ + + AR+K AA++S +DE+V  V  AL   G   NS++++STDNG 
Sbjct: 253 PLQTPKSYIYPYRDMTNVARRKLAAMVSTVDEAVRNVTYALRKYGYYRNSVIIYSTDNGA 312

Query: 195 -PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLY 253
            P  G      SN+PL+G K T WEGG+RG  F+ SPLL  + RV+   +HI+DW PTL 
Sbjct: 313 QPFTG-----GSNWPLRGRKGTYWEGGIRGVAFVHSPLLRRRRRVSKALLHITDWFPTLV 367

Query: 254 SAAGGDLSVLE 264
             AGG++S ++
Sbjct: 368 GLAGGNVSQIQ 378


>UniRef50_Q15XG7 Cluster: Sulfatase precursor; n=2; Bacteria|Rep:
           Sulfatase precursor - Pseudoalteromonas atlantica
           (strain T6c / BAA-1087)
          Length = 471

 Score =  141 bits (342), Expect = 3e-32
 Identities = 100/291 (34%), Positives = 153/291 (52%), Gaps = 20/291 (6%)

Query: 2   QHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFW 61
           +H  I GAE  G+PL+E  +  Y+K LGY+T   GKWHLG    E  P++RGFD   GF 
Sbjct: 104 EHSAIKGAE-MGIPLDEVTMGDYMKSLGYRTAFYGKWHLGG-TDELHPMHRGFDEFYGFR 161

Query: 62  TGRIDM--YDHTTMEQGSWG-TD--FRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE 116
            G      Y+    E+ S   TD     G +   +  G Y TDV  ++A + +      +
Sbjct: 162 GGDRSYWAYEVNAPERKSAVFTDKKLEHGIDQFQEHEG-YLTDVLAEKANQFIEKA-PDK 219

Query: 117 PLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALH 176
           P F+ L+ +AVH+     P+ A  + +  F  +    R++ AA+   LD + G V+  L 
Sbjct: 220 PFFIFLSFNAVHT-----PMEATPEDLAKFPQLKGK-RKEVAAMTLALDRASGAVLNKLK 273

Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKA 236
             GL ++++VVFS DNGGP    + NA+SNYPL G K+   EGG+R    +  P   +  
Sbjct: 274 ELGLEDDTLVVFSNDNGGPT---DKNASSNYPLAGTKSNFLEGGIRVPFLVKWPAKLAAG 330

Query: 237 RVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALS-KNTESPRTSV 286
           +V  + +   D LPT + A GG+  V+  LDGV+    ++ +N ++P  S+
Sbjct: 331 KVYDKPVSTLDLLPTFFKAGGGE-EVMSELDGVDLMPYITGQNNKAPHESM 380


>UniRef50_A6DKC9 Cluster: Sulfatase; n=1; Lentisphaera araneosa
           HTCC2155|Rep: Sulfatase - Lentisphaera araneosa HTCC2155
          Length = 454

 Score =  138 bits (335), Expect = 2e-31
 Identities = 86/248 (34%), Positives = 132/248 (53%), Gaps = 19/248 (7%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGS---YKKEYLPLNRGFDSHVGFWTGRIDMYD 69
           G+P   K L QY ++ GY T L GKWHLG    + K  +P +RGFD   G   G   +YD
Sbjct: 102 GMPREVKTLAQYFQEAGYATGLFGKWHLGGERLFDKTLMPTSRGFDEFFGILEGA-SLYD 160

Query: 70  HTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS 129
            T   +  +    R+  +   D  G Y TD    EA+  + +    +P FL L  +AVH+
Sbjct: 161 DTVNRERKY---IRQ--DTVIDYEGEYFTDAIGREAVSFI-TRKGDKPFFLYLPFTAVHA 214

Query: 130 GNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFS 189
                P++A +K +  F +I D  R+ FAA+LS +D+++G+V  AL  +G+L+N+++VF 
Sbjct: 215 -----PMQASEKYMQRFAHIADPNRRVFAAMLSAMDDNIGRVFDALEHQGILDNTLIVFW 269

Query: 190 TDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKARVAYQKMHISDW 248
           +DNGG     ++N + N+PLKG K   +EGG+R  A   W        +   Q + + D 
Sbjct: 270 SDNGGKP---DNNYSLNHPLKGQKTQFYEGGIRVPACVRWPKGQIPAGKTLDQPVFLMDI 326

Query: 249 LPTLYSAA 256
            P+   AA
Sbjct: 327 FPSALEAA 334


>UniRef50_Q7UGD7 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
           Pirellula sp.|Rep: N-acetylgalactosamine 6-sulfatase -
           Rhodopirellula baltica
          Length = 543

 Score =  136 bits (328), Expect = 1e-30
 Identities = 88/274 (32%), Positives = 135/274 (49%), Gaps = 14/274 (5%)

Query: 7   YGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRID 66
           +G +  G+PL+E  L   LK+ GY T  +GKWHLG   K + P  RGFD   GF  G   
Sbjct: 122 HGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLGD-AKPFWPNRRGFDEWFGFSGGGFS 180

Query: 67  MYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSA 126
            +    M+    G    RG E        + TD ++ EA+K +  H ++EP FL LA++A
Sbjct: 181 YWGDLGMKDPLLGV--HRGDEPVDPKTLTHLTDDFSTEAVKFIQRH-ETEPFFLYLAYNA 237

Query: 127 VHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIV 186
                P+ P  A +  +    +I+   R  + A+++ +DE +G+VV  +   GL EN+++
Sbjct: 238 -----PHAPDHATRAHLQKTAHIEYGGRAVYGAMVAGMDEGIGRVVDQIRESGLGENTMI 292

Query: 187 VFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHIS 246
           +F +DNGG         A N+P +G K  L+EGG+R    +  P            +   
Sbjct: 293 IFYSDNGG-----RREHAVNFPYRGHKGMLFEGGIRVPFLVSWPGTVRSGMKEESPITAL 347

Query: 247 DWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTE 280
           D  PT  +AAG D S  + LDG N    L+ + +
Sbjct: 348 DLFPTALAAAGMDPSQNDKLDGQNLLPVLTDDKQ 381


>UniRef50_A4CMB0 Cluster: Arylsulfatase A; n=5; Bacteria|Rep:
           Arylsulfatase A - Robiginitalea biformata HTCC2501
          Length = 492

 Score =  134 bits (323), Expect = 6e-30
 Identities = 99/302 (32%), Positives = 143/302 (47%), Gaps = 32/302 (10%)

Query: 4   GVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF--- 60
           GV +     G+P +E  L + LK  GY T +VGKWHLG +K+EYLP N GFD + G    
Sbjct: 122 GVFFPDSHNGMPASEITLAEQLKKAGYATGMVGKWHLG-HKEEYLPPNHGFDDYFGIPYS 180

Query: 61  ----WTGRIDMYD---------HTTMEQGSWGTDFRRGFE-VAHDLFGVYATDVYTDEAI 106
               +TG+   Y          + +++   +     RG E +   +     T  Y DEA+
Sbjct: 181 NDMDFTGQFTSYQDYFGRYTERYESLKTEEYNVPLIRGTEEIERPVNQNTITKRYNDEAV 240

Query: 107 KVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDE 166
           K +  H K EP F+ LAHS  H          P    D F+    SAR  +  V+ ++D 
Sbjct: 241 KWIREH-KDEPFFMYLAHSLPH---------VPLFTSDEFR--GTSARGLYGDVVEEIDH 288

Query: 167 SVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGF 226
            VG++++ L   GL EN+IVVF++DN GP      +  S   L+  K T WEGG+R    
Sbjct: 289 GVGQIMELLEAEGLAENTIVVFTSDN-GPWLPTGISGGSAGLLREGKGTTWEGGMREPTI 347

Query: 227 LWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSV 286
            W+P +   A+V        D   T  S AG  +     +DGV+    L  + ESPR  +
Sbjct: 348 FWAPGM-LPAKVVMDMGSTLDLFNTFSSLAGVPMPDDREMDGVDLSPILFGDAESPRKEM 406

Query: 287 LH 288
            +
Sbjct: 407 FY 408


>UniRef50_A6DLE2 Cluster: Sulfatase; n=1; Lentisphaera araneosa
           HTCC2155|Rep: Sulfatase - Lentisphaera araneosa HTCC2155
          Length = 441

 Score =  133 bits (322), Expect = 8e-30
 Identities = 87/269 (32%), Positives = 139/269 (51%), Gaps = 14/269 (5%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           GLP+ E  L   LK+LGY TH +GKWHLG     + P  RGFD+  GF +G    +    
Sbjct: 105 GLPVTEITLADSLKELGYSTHCIGKWHLGE-ADHFHPNARGFDNFYGFLSGARTYFLGGE 163

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
           + +G       R  E A    G Y T+V+T EAI+++    + +P F+ L+H+AVH    
Sbjct: 164 L-RGDMDR-IMRNKEFAEPSSG-YTTEVFTQEAIRII-QEEQDKPFFIYLSHNAVHG--- 216

Query: 133 YEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDN 192
             P+ A  + I ++ +  +  R+K++ ++  LD+  G +++AL      EN+++ F +DN
Sbjct: 217 --PMDAKDEDIMSYDF-KNPLRKKYSGLMKNLDDQTGLLLQALKDSKQYENTLIFFMSDN 273

Query: 193 GGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTL 252
           GGP      N +SN+PL+G K + +EGG R    L  P   S    + + +   D   T 
Sbjct: 274 GGPT---THNGSSNWPLRGFKGSEFEGGNRTPFLLQWPEKISAGLSSDKPIIAYDVFATC 330

Query: 253 YSAAGGDLSVLENLDGVNQWDALSKNTES 281
             AAGG+L       G++    ++K  E+
Sbjct: 331 IQAAGGELVTDRTYHGIDLLPVINKPQET 359


>UniRef50_Q7UYW3 Cluster: Arylsulfatase B; n=1; Pirellula sp.|Rep:
           Arylsulfatase B - Rhodopirellula baltica
          Length = 520

 Score =  132 bits (319), Expect = 2e-29
 Identities = 89/259 (34%), Positives = 132/259 (50%), Gaps = 23/259 (8%)

Query: 7   YGAEPR--GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGR 64
           Y   P   GLP +EK L  +L   GY T L+GKWHLG  +  + P  RGFD   G  TG 
Sbjct: 133 YATRPELLGLPTSEKTLADHLGAAGYATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGS 191

Query: 65  IDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSH---NKSEPLFLM 121
              Y   TM+         R  +   D    Y TD +TDE ++ ++ H   N  +P F+ 
Sbjct: 192 -HHYFPATMKHV-----IERNGKRVDDFSSEYLTDFFTDEGLRFIDQHKSANPDQPWFVF 245

Query: 122 LAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLL 181
            +++A     P+ P+ A +  +  F  I +  R+ +AA++  LD  VG++ + L   G  
Sbjct: 246 FSYNA-----PHTPMHATEADLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQW 300

Query: 182 ENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQ 241
           EN+++VF +DNGG      +N + N PL+GVK ++ EGG+R    +W+      A V Y 
Sbjct: 301 ENTLLVFFSDNGGA----TNNGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGVLYD 355

Query: 242 KMHIS-DWLPTLYSAAGGD 259
            +  S D LPT  SAAG +
Sbjct: 356 GVVSSLDLLPTFCSAAGAE 374


>UniRef50_A0IXQ0 Cluster: Sulfatase; n=1; Shewanella woodyi ATCC
           51908|Rep: Sulfatase - Shewanella woodyi ATCC 51908
          Length = 379

 Score =  132 bits (318), Expect = 2e-29
 Identities = 97/279 (34%), Positives = 134/279 (48%), Gaps = 28/279 (10%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHL--GSYKKEYL----PLNRGFDSHVGFWTGRID 66
           GLP+ E +L    +  GY+T  VGKWHL  G  K  Y     PL+RGFD   GF      
Sbjct: 16  GLPVEENVLANNFRKAGYRTGAVGKWHLTKGEKKASYTLAQHPLDRGFDFFFGFDRSGTP 75

Query: 67  MYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSA 126
            YD   +E        R+  +        Y TD  T+ AI  +N  +KS+P FL +A++A
Sbjct: 76  YYDSKILELN------RKPVKAEG-----YLTDQLTNHAIDFINQ-DKSKPFFLYMAYNA 123

Query: 127 VHSG-NPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSI 185
           VH   N   P           +Y+D      F + L  LD+ V K++K L + G L+N+I
Sbjct: 124 VHGPLNKAAPKEYQAPFNSGDRYLD-----YFYSYLYALDQGVAKIIKQLDSNGQLDNTI 178

Query: 186 VVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSP-LLDSKARVAYQKMH 244
           ++F +DNG P  G      +N P  G K  +W+GG R    +W P  L +  RV    + 
Sbjct: 179 IMFLSDNGAP-GGKPFPLPANAPFTGYKGQVWQGGTRVPVVIWGPKALVNGGRVDDAVIS 237

Query: 245 ISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPR 283
             D +PT  +AAG DLS  +NLDG N    L +  E  R
Sbjct: 238 SMDLIPTALAAAGVDLS--DNLDGNNLLPKLKRVEEDER 274


>UniRef50_A6CAY0 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
           n=1; Planctomyces maris DSM 8797|Rep:
           N-acetylgalactosamine 6-sulfate sulfatase - Planctomyces
           maris DSM 8797
          Length = 466

 Score =  130 bits (313), Expect = 1e-28
 Identities = 102/308 (33%), Positives = 154/308 (50%), Gaps = 25/308 (8%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           GL  +E ++P+YLK  GY+T   GKW++G +     P  RGFD   GF  G ID Y H  
Sbjct: 114 GLRKSEVLIPEYLKQQGYRTACFGKWNVG-FSPGSRPTERGFDEFFGFAAGNIDYYHHYY 172

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVH--SG 130
             +     D  RG +    + G Y+TD++ D A + +++ +  +P F+ L  +A H  S 
Sbjct: 173 AGRH----DLWRGLKEVF-VEG-YSTDLFADAACQYISAES-DQPFFIYLPFNAPHFPSQ 225

Query: 131 NPYEP-----IRAPQKLIDAFKYIDDSA--RQKFAAVLSKLDESVGKVVKALHTRGLLEN 183
              +P      +AP    + + Y   +   ++++ AV++ LD ++G+V+K L T GL + 
Sbjct: 226 RNKQPGQGNEWQAPDLAFEKYGYDPQTKNPQERYRAVVTALDSAIGRVLKQLDTSGLRDQ 285

Query: 184 SIVVFSTDNGGPAAGFND-NAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQK 242
           +IV++ +DNG           ASN PL+    TLWEGG+R    +  P    KA    Q 
Sbjct: 286 TIVIWYSDNGAFMLKERGLEVASNKPLRDGGVTLWEGGIRVPAIIRYP-GHLKAGTVNQS 344

Query: 243 MHIS-DWLPTLYSAAGGDLSVLENLDGVNQWDALSKNT-ESPRTSVLHNIDDIWGIAALT 300
             IS D LPTL + AGG L     LDG +   AL+  T   PRT            +A+ 
Sbjct: 345 PLISLDILPTLITLAGGPLPAERILDGQDMLPALAAQTAPEPRTFFF----QYRNFSAVR 400

Query: 301 VDKYKLIK 308
             KYKL++
Sbjct: 401 RGKYKLVR 408


>UniRef50_A3XJJ9 Cluster: Arylsulfatase B; n=1; Leeuwenhoekiella
           blandensis MED217|Rep: Arylsulfatase B -
           Leeuwenhoekiella blandensis MED217
          Length = 461

 Score =  130 bits (313), Expect = 1e-28
 Identities = 86/277 (31%), Positives = 144/277 (51%), Gaps = 26/277 (9%)

Query: 6   IYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRI 65
           I G     LP +   LPQ L  L YKT L+GKWHLG  K E  P   GFD   GF  G++
Sbjct: 107 ISGRSELNLPDSITTLPQALSKLNYKTALMGKWHLG-LKPESGPEVYGFDFSYGFLHGQL 165

Query: 66  DMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHS 125
           D Y HT     S  T +R G  ++      + TD+ T  A+  +++    +  +L +A+S
Sbjct: 166 DQYAHTYKNGDS--TWYRNGKFISEK---GHVTDLLTQSAVHYIDTLQTDQNFYLQVAYS 220

Query: 126 AVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSI 185
           A     P+ P++ PQ+ ++ +  I DS+R+ +AA ++ +D  +G++++ L  + L +N++
Sbjct: 221 A-----PHIPLQEPQEWLEKYTGIKDSSRRAYAAAMTHMDAGIGEILQKLKDKDLEKNTV 275

Query: 186 VVFSTDNGGPAA-----------GFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLD 233
           V+F +DNG               G N +  SN PL+  K + +EG +R    + W   L+
Sbjct: 276 VLFVSDNGAQEKWVPNTQYDGKYGPNYSLGSNLPLRDFKTSNYEGALRVPAIISWPENLN 335

Query: 234 SKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
           S     Y  ++++DW+PT  + A  +  +   ++GVN
Sbjct: 336 SGTSTNY--INVTDWMPTFLNWANAE-ELPSTVEGVN 369


>UniRef50_A0Z632 Cluster: Arylsulfatase B; n=1; marine gamma
           proteobacterium HTCC2080|Rep: Arylsulfatase B - marine
           gamma proteobacterium HTCC2080
          Length = 545

 Score =  130 bits (313), Expect = 1e-28
 Identities = 94/303 (31%), Positives = 148/303 (48%), Gaps = 35/303 (11%)

Query: 3   HGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWT 62
           +GVI+  +  G+  +E  +P+  +  GY+T ++GKWHLG  +  Y P NRGF+   G   
Sbjct: 99  YGVIFPWDNIGVHPDEHFMPETFQAAGYQTAIIGKWHLGHAQMTYHPNNRGFEHFYGHLH 158

Query: 63  GRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLML 122
             +  Y   +  QG  G DF+R   V+ D  G Y T +  DE  + +   ++  P  + +
Sbjct: 159 TEVGFYPPFS-NQG--GKDFQRN-GVSIDDQG-YETYLLADEVSRYIRERDRDRPFLVYM 213

Query: 123 AHSAVHSGNPYEPIRAPQKLIDAFKYI------------DD-----------SARQKFAA 159
              A     P+ P+ AP +L D +K I            DD           SAR  +AA
Sbjct: 214 PFIA-----PHTPLDAPVELQDKYKDIETDLPMARSRQTDDTRLISRVMLQPSARPMYAA 268

Query: 160 VLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEG 219
           V+  +D+++G+V+  L   G+ +N+IV+F +DNGG  A ++   A+N PL+G K   +EG
Sbjct: 269 VVDAMDQAIGRVLDTLDQEGISDNTIVLFFSDNGG--AAYSYGGANNAPLRGGKGETFEG 326

Query: 220 GVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNT 279
           G+R    +  P +    ++  Q M + D  PTL  AA         LDG + W AL    
Sbjct: 327 GIRVTSLMRWPAMLEPGQIFEQIMSVMDVFPTLVDAADVRPGNNFALDGRSMWTALKSGD 386

Query: 280 ESP 282
           + P
Sbjct: 387 QVP 389


>UniRef50_A3ZLN5 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
           n=1; Blastopirellula marina DSM 3645|Rep:
           N-acetylgalactosamine 6-sulfate sulfatase -
           Blastopirellula marina DSM 3645
          Length = 468

 Score =  129 bits (311), Expect = 2e-28
 Identities = 99/313 (31%), Positives = 149/313 (47%), Gaps = 24/313 (7%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           G  L E  L   LK  GY + + GKW  G  K+ YLPL RGFD + GF    +D + H  
Sbjct: 127 GTDLQEVFLADVLKQAGYVSAVFGKWDGGQLKR-YLPLQRGFDQYYGFANTGVDYFTH-- 183

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
            E+    + FR       D  G Y TD++  EAI+ ++  N   P FL L  +A HS + 
Sbjct: 184 -ERYGVPSMFRDNQPTEEDK-GTYLTDLFEREAIRFIDE-NHDRPFFLYLPFNAPHSASN 240

Query: 133 YE-PIR----APQKLIDAF---KYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
            +  IR    APQ+ +D F   +   +  RQ + A + ++DE++GKVV  L    + +N+
Sbjct: 241 LDRSIRGFAQAPQEYLDHFPGGESKQEKRRQAYLAAVERMDEAIGKVVDQLQQHQIADNT 300

Query: 185 IVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMH 244
           +++F +DNGG         A N PL+G K  ++EGG R    +  P      +V+ Q + 
Sbjct: 301 LIIFLSDNGG------GGGADNSPLRGGKAKMFEGGNRVPCIVHWPGKVPAGKVSNQFLT 354

Query: 245 ISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALTVDKY 304
             +  PT+ +A GG L      DG +    L+    SPR  +        G  A  V  +
Sbjct: 355 SLEVFPTVIAAIGGKLPDDVIYDGFDMLPVLN-GASSPREEMFWKRR---GDVAARVGDW 410

Query: 305 KLIKGTIYKGVWD 317
           K +     KG++D
Sbjct: 411 KWVDSAAGKGLFD 423


>UniRef50_A0JAA8 Cluster: Sulfatase precursor; n=1; Shewanella
           woodyi ATCC 51908|Rep: Sulfatase precursor - Shewanella
           woodyi ATCC 51908
          Length = 548

 Score =  128 bits (308), Expect = 4e-28
 Identities = 89/273 (32%), Positives = 139/273 (50%), Gaps = 25/273 (9%)

Query: 12  RGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDH- 70
           RGLP +E ++P+ LK+ GY T  +GKWHLG    E +P  +GFD  +   +G     DH 
Sbjct: 172 RGLPGSEILIPEILKESGYHTMHIGKWHLGR-SPEMMPNAQGFDESLMMDSGLYLPVDHP 230

Query: 71  ---------TTMEQGSWGTDFRRGFEVAHDLF--GVYATDVYTDEAIKVVNSHNKSEPLF 119
                    + +++  W T          ++F    Y TD +T+EA K + + N + P F
Sbjct: 231 ESVNAPVESSGLDRFIWATMRYSVNWNGGEIFKPNGYLTDYFTEEAEKAIEA-NANRPFF 289

Query: 120 LMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRG 179
           L LAH       P+ P++A +   +A   I    ++ +AA+L  +D SV +V+  L  +G
Sbjct: 290 LYLAH-----WGPHNPVQAKRADYEAVGDIQPHNKRVYAAMLRSIDRSVERVMAKLEKQG 344

Query: 180 LLENSIVVFSTDNGG-PAAGFNDNAASNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKAR 237
           + +N+IV+ S+DNGG      ND    N P +G KNT +EGG+R      W  ++D    
Sbjct: 345 IADNTIVILSSDNGGADYVAIND---LNKPYRGWKNTFFEGGIRVPFSVTWPNVIDESTV 401

Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
           +     HI D +PT+ + A  DL     +DGV+
Sbjct: 402 IEEPVNHI-DLMPTIINMANADLPQDREIDGVD 433


>UniRef50_Q8A219 Cluster: Arylsulfatase B; n=2; Bacteroides|Rep:
           Arylsulfatase B - Bacteroides thetaiotaomicron
          Length = 458

 Score =  127 bits (306), Expect = 7e-28
 Identities = 86/268 (32%), Positives = 137/268 (51%), Gaps = 27/268 (10%)

Query: 13  GLPLNEKILPQYLKDLGYKTH-LVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHT 71
           GL  NE+ L   L   GY    ++GKWHLG  +K + P+NRGF    G   G ID +DH 
Sbjct: 102 GLDENEETLADMLARNGYSNRAIIGKWHLGHTRKVHYPINRGFSHFYGHLNGAIDYFDH- 160

Query: 72  TMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGN 131
            M +G    D+   +E  +D    Y+T++ T EA++ +N++ K  P  L +A++A     
Sbjct: 161 -MREGE--LDWHNDWETCYD--KGYSTELITQEAVRCINTYEKEGPFLLYVAYNA----- 210

Query: 132 PYEPIRAPQKLIDAFKYIDD--------SARQKFAAVLSKLDESVGKVVKALHTRGLLEN 183
           P+ P++A +K I+   Y DD          R  + A++S +D  +G +V AL  +G+++N
Sbjct: 211 PHTPLQAQEKDIEL--YCDDFGSLTPKEQKRVTYQAMVSCMDRGIGTIVDALKKKGIMDN 268

Query: 184 SIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKARVAYQK 242
           + ++F +DN GPA       +S+  L+G K   W+GGVR  A F W     +   ++ Q 
Sbjct: 269 TFLIFFSDN-GPA---GVPGSSSGKLRGRKFDEWDGGVRVPAVFYWKRAESNYKNLSSQV 324

Query: 243 MHISDWLPTLYSAAGGDLSVLENLDGVN 270
               D +PTL    G         DG++
Sbjct: 325 TGFVDIVPTLKELVGDKNRPERAYDGIS 352


>UniRef50_A0Z9E1 Cluster: Sulfatase family protein; n=3;
           Proteobacteria|Rep: Sulfatase family protein - marine
           gamma proteobacterium HTCC2080
          Length = 558

 Score =  127 bits (306), Expect = 7e-28
 Identities = 92/276 (33%), Positives = 142/276 (51%), Gaps = 25/276 (9%)

Query: 10  EPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHV----GFWTGRI 65
           E +GLP +E  + + LK  GY T  +GKWHLG  +    P  +GFD  +    G +    
Sbjct: 175 ERKGLPASEVTIAETLKAKGYYTAHIGKWHLGR-ENGMAPHEQGFDDSLLMQSGMYLPEN 233

Query: 66  D------MYDHTTMEQGSW-GTDFRRGFEVAH-DLF--GVYATDVYTDEAIKVVNSHNKS 115
           D            +++  W G  F   +     D F  G Y TD +TDE+IKV+ + NK+
Sbjct: 234 DPNVVNAKVSFDPIDKFLWAGMGFSATYNSGEADKFKPGGYLTDYWTDESIKVIKA-NKN 292

Query: 116 EPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKAL 175
            P FL LAH       P+ P++A ++  DA + I+   ++ +A ++  +D SVG+++  L
Sbjct: 293 RPFFLYLAH-----WGPHTPLQATREDFDALEGIEPHRKRVYAGMIRAVDRSVGRILDTL 347

Query: 176 HTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDS 234
              G+  N++VVF++DNGG  AG+      N P +G K T++EGG+R   F+ W   +  
Sbjct: 348 EEEGIANNTVVVFTSDNGG--AGYIGIPEVNSPFRGFKITMFEGGLRVPLFVRWPAKIAP 405

Query: 235 KARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
              V     HI D +PTL +AAG        +DGV+
Sbjct: 406 GISVNEPVAHI-DVMPTLAAAAGASEPEGVVIDGVD 440


>UniRef50_Q1VP00 Cluster: Arylsulfatase B; n=1; Psychroflexus
           torquis ATCC 700755|Rep: Arylsulfatase B - Psychroflexus
           torquis ATCC 700755
          Length = 386

 Score =  126 bits (304), Expect = 1e-27
 Identities = 84/218 (38%), Positives = 122/218 (55%), Gaps = 23/218 (10%)

Query: 17  NEKILPQYLKDLG-YKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQ 75
           +E  + + L+D G Y T L+GKWHLG   +  LP + GF++ +G   G ID +   TM  
Sbjct: 109 HETTIAEVLRDEGAYDTALIGKWHLGHGDESMLPHHHGFNTFIGHTGGCIDFF---TMTY 165

Query: 76  GSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHN--KSEPLFLMLAHSAVHSGNPY 133
           G    D+    EV  +    YAT++ T+EAI  ++  N  ++EP FL LA++A H G  Y
Sbjct: 166 GII-PDWYHQSEVVSE--NGYATELITEEAIAFLSERNQKRTEPFFLYLAYNAPHFGKGY 222

Query: 134 EPI-RAPQKLIDA-------FKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSI 185
            P   AP  L+           +I+D  R++FAA+   LD+ +G+V+  L    L EN++
Sbjct: 223 SPSDEAPVNLMQPQAAELKRVHFIEDKIRREFAAMTVSLDDGIGQVLDCLEENDLKENTL 282

Query: 186 VVFSTDNGG-PAAGFNDNAASNYPLKGVKNTLWEGGVR 222
           V+F TD+GG P  G      SN PL+G K TL+EGGVR
Sbjct: 283 VIFLTDHGGDPTYG-----GSNLPLRGDKATLFEGGVR 315


>UniRef50_A6DKB8 Cluster: N-acetylgalactosamine 6-sulfatase; n=3;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine 6-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 465

 Score =  125 bits (301), Expect = 3e-27
 Identities = 87/283 (30%), Positives = 144/283 (50%), Gaps = 22/283 (7%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTG--RIDMYD-- 69
           LP +E  + + L  +GY   ++GKWHLG+ +    P  RGFD   G   G  R    D  
Sbjct: 102 LPKSEMTIAESLTQVGYHCGIIGKWHLGA-EPSLRPNKRGFDEFFGHLGGGHRFMPEDLV 160

Query: 70  --HTTMEQGSWGTDFRRGFEVAHDL---FGVYATDVYTDEAIKVVNSHNKSEPLFLMLAH 124
             HT  E+     D  R +   +D       Y T+ ++DEA+  +   N  +P FL L++
Sbjct: 161 IQHT--EEVKNELDSYRSWITRNDTPVKTTKYLTEEFSDEAVSFI-KRNHQKPFFLFLSY 217

Query: 125 SAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
           +A     P+ P++A +K +  F +I D  R+ +AA++S +D+ V +V+++L    + +N+
Sbjct: 218 NA-----PHLPLQATEKYLARFPHIKDPKRKTYAAMVSAVDDGVSQVMQSLKETNIADNT 272

Query: 185 IVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMH 244
           IV F +DNGGP+   + N + N+PLKG K+ +WEGG R    +  P      +V    + 
Sbjct: 273 IVFFLSDNGGPS---HKNKSDNFPLKGQKSDVWEGGFRVPFAMQYPAAIQAKQVYDHPVS 329

Query: 245 ISDWLPTLYSAAGGDLSVLENLDGVNQWDALS-KNTESPRTSV 286
             D   T+ S A       + LDGVN    ++ + T++P   +
Sbjct: 330 SLDIFATIASLAQSPTHADKPLDGVNLIPFITGEKTQAPHAQI 372


>UniRef50_UPI0000E4801A Cluster: PREDICTED: similar to sulfatase 1
           precursor; n=2; Strongylocentrotus purpuratus|Rep:
           PREDICTED: similar to sulfatase 1 precursor -
           Strongylocentrotus purpuratus
          Length = 470

 Score =  124 bits (298), Expect = 6e-27
 Identities = 68/183 (37%), Positives = 109/183 (59%), Gaps = 14/183 (7%)

Query: 115 SEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFK-YIDDSARQKFAAVLSKLDESVGKVVK 173
           ++P+F+ L++ A     P+ P   P +   +++  I++  R+ +A +++ LDES+GK+  
Sbjct: 165 TKPMFMYLSYQA-----PHLPFEVPDEYFVSYRGKINNRNRRTYAGMVTMLDESIGKLTD 219

Query: 174 ALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLD 233
            L   GL  +++ +FSTDNGG       NA +N+PL+GVK   +EGG+RG GF+  PLL 
Sbjct: 220 TLKEEGLWNDTVFIFSTDNGGVG---KKNAGNNWPLRGVKGNYFEGGIRGVGFVAGPLLS 276

Query: 234 SKAR--VAYQKMHISDWLPTLY-SAAGGDLSVLE-NLDGVNQWDALSK-NTESPRTSVLH 288
           +  +  ++   MHISDW PTL    A   L+  E  LDGVN WD +S+  +  P   +++
Sbjct: 277 TNVQGTISTDLMHISDWYPTLVEGVAKVTLNHTELGLDGVNMWDVISQGESGDPDREIVY 336

Query: 289 NID 291
           NID
Sbjct: 337 NID 339



 Score = 74.9 bits (176), Expect = 4e-12
 Identities = 36/64 (56%), Positives = 40/64 (62%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           MQH VI    PR LPL +  +   L D GY THLVGKWHLG YK+E  PLNRGF S  G 
Sbjct: 94  MQHLVIDPRVPRCLPLGDDTMANKLTDAGYATHLVGKWHLGFYKQECWPLNRGFQSFFGM 153

Query: 61  WTGR 64
             G+
Sbjct: 154 LLGQ 157


>UniRef50_A4XED5 Cluster: Sulfatase precursor; n=1; Novosphingobium
           aromaticivorans DSM 12444|Rep: Sulfatase precursor -
           Novosphingobium aromaticivorans (strain DSM 12444)
          Length = 462

 Score =  123 bits (297), Expect = 9e-27
 Identities = 84/260 (32%), Positives = 132/260 (50%), Gaps = 13/260 (5%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           G+PL+   +   +K LGY+T LVGKWHLG     + PL  G+D  +G   G  D + H  
Sbjct: 116 GVPLDRPTIASVMKALGYRTSLVGKWHLGE-PPAHGPLKHGYDHFLGIVEGGADYFVHRM 174

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVH-SGN 131
           +  G          +   D  G Y TD++ DEA++V+     ++P FL L  +A H    
Sbjct: 175 VMSGKPAGVGLAEDDAQTDRTG-YLTDIFGDEAVRVI-EEGGNQPFFLSLHFTAPHWPWE 232

Query: 132 PYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
             E  +  + L  +F Y +     K+  ++  +D++V KV+ A+   G  +N++VVF++D
Sbjct: 233 GREDEKLARALPSSFHY-EGGNLAKYREMVETMDQNVAKVLAAIDRSGKADNTVVVFTSD 291

Query: 192 NGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLP 250
           NGG    F+D     +P  G K  + EGGVR    + W   + + +R + Q M   D+LP
Sbjct: 292 NGGER--FSD----TWPFVGHKGEVLEGGVRVPLMVRWPRRIKAGSR-SEQVMVSMDFLP 344

Query: 251 TLYSAAGGDLSVLENLDGVN 270
           TL   AGGD + +   DG +
Sbjct: 345 TLLGMAGGDAARIGRFDGAD 364


>UniRef50_A5ZER6 Cluster: Putative uncharacterized protein; n=1;
           Bacteroides caccae ATCC 43185|Rep: Putative
           uncharacterized protein - Bacteroides caccae ATCC 43185
          Length = 463

 Score =  122 bits (295), Expect = 1e-26
 Identities = 75/210 (35%), Positives = 113/210 (53%), Gaps = 16/210 (7%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           GLPL E+ + +  K  GY+T  +GKWHLGS  +++ P NRGFD   G   G  D Y +  
Sbjct: 108 GLPLEEETIAEVFKTNGYRTAAIGKWHLGSRDEQH-PNNRGFDLFYGMKAGGRD-YFYNE 165

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
            +    G +           F  Y TD ++++A++ +N    S+P  + LA++AVH+   
Sbjct: 166 KKSDRPGDERNLLLNDRQVKFEKYLTDAFSEKAVEFINE--SSQPFMMYLAYNAVHT--- 220

Query: 133 YEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDN 192
             P++A  +  D  K+ +   RQK AA+   LD  VG V++ L   G  +N+++ F +DN
Sbjct: 221 --PMQATDE--DMAKF-EGHPRQKLAAMTYALDRGVGTVIRGLKDSGKFDNTLIFFLSDN 275

Query: 193 GGPAAGFNDNAASNYPLKGVKNTLWEGGVR 222
           GG       N +SNYPLKG K   +EGG R
Sbjct: 276 GGATT----NQSSNYPLKGFKGNKFEGGHR 301


>UniRef50_A6LED1 Cluster: Arylsulfatase A; n=1; Parabacteroides
           distasonis ATCC 8503|Rep: Arylsulfatase A -
           Parabacteroides distasonis (strain ATCC 8503 / DSM 20701
           / NCTC11152)
          Length = 459

 Score =  121 bits (292), Expect = 3e-26
 Identities = 98/305 (32%), Positives = 145/305 (47%), Gaps = 28/305 (9%)

Query: 5   VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGR 64
           V++     GL  +E  + + L+  GY T  VGKWHLG++   YLP + GFD++ G     
Sbjct: 104 VLFPNSKAGLGQDEVTIAKVLQQSGYATGCVGKWHLGAFSP-YLPTDHGFDTYFGIPYSN 162

Query: 65  IDMYDHTTMEQGSWGTDFRR------GFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPL 118
            DM       +G+   +F        G ++  +      T  YT++A+  + +H+K EP 
Sbjct: 163 -DM--SPVQNKGAHARNFPPTPLIVDGKQIESEPDQGELTRRYTEKAVSFIKNHSK-EPF 218

Query: 119 FLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTR 178
           FL  AH+      P+ P      L    ++   S R  +  V+ ++D SVG+V+KAL   
Sbjct: 219 FLYFAHTF-----PHIP------LYTNARFEGTSKRGLYGDVVEEIDWSVGEVLKALREN 267

Query: 179 GLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARV 238
           GL EN+ V+F++DN GP    ++N  S  PLK  K T WEGG R     W P   + A +
Sbjct: 268 GLDENTFVIFTSDN-GPWLTEHENGGSAGPLKDGKGTWWEGGFRVPAICWMPGKINPA-I 325

Query: 239 AYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAA 298
             + M   D  PT  S AG +      LDGVNQ   L +   S R  V +     WG   
Sbjct: 326 NDEIMTSMDLYPTFLSMAGIEQPKDLVLDGVNQTGLLFEEKHSARDEVYY----WWGSEL 381

Query: 299 LTVDK 303
           + + K
Sbjct: 382 MAIRK 386


>UniRef50_A4GJF1 Cluster: Sulfatase; n=1; uncultured marine
           bacterium EB0_50A10|Rep: Sulfatase - uncultured marine
           bacterium EB0_50A10
          Length = 544

 Score =  121 bits (291), Expect = 5e-26
 Identities = 82/282 (29%), Positives = 144/282 (51%), Gaps = 24/282 (8%)

Query: 12  RGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDH- 70
           +G+P  +  + + L+D GY T  +GKWHLG ++    P+++GF   +G         DH 
Sbjct: 172 QGMPTEQITIAEVLRDAGYYTAHIGKWHLG-HEYGMDPMSQGFQDSLGLVGPLYLPEDHP 230

Query: 71  --------TTMEQGSWGT-DFRRGFEVAHDLFGV--YATDVYTDEAIKVVNSHNKSEPLF 119
                   T +++  WG   +   F    DLF    Y TD YTDEA+KV+ + NK+ P F
Sbjct: 231 DVVNAKFDTRIDKMIWGMGQYSANFN-GGDLFAPDKYVTDYYTDEALKVIEN-NKNRPFF 288

Query: 120 LMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRG 179
           L L+H A+H  NP + +R+     +   ++     Q ++ +++ LD SVGK+++ L    
Sbjct: 289 LYLSHWAIH--NPLQALRSD---FEQMSHMHGHNLQVYSGMINSLDRSVGKIIEKLKELD 343

Query: 180 LLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVA 239
           +   ++++F++DNGG  A + +    N P +G K + ++GG+R    +  P   +  + +
Sbjct: 344 IYGKTLIIFTSDNGG--ANYIELNDINKPYRGWKISFFDGGIRVPYIISWPDEINPGKKS 401

Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTES 281
              +H  D  PT+  AAG  +     LDGV+    +  ++ S
Sbjct: 402 ENAVHHFDIFPTILKAAG--IESTNELDGVDLMPFIKNDSSS 441


>UniRef50_A6CAW6 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;
           Planctomyces maris DSM 8797|Rep:
           N-acetylgalactosamine-4-sulfatase - Planctomyces maris
           DSM 8797
          Length = 472

 Score =  120 bits (289), Expect = 8e-26
 Identities = 84/227 (37%), Positives = 124/227 (54%), Gaps = 21/227 (9%)

Query: 96  YATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQ 155
           Y TD +T EA+  +N H + +P FL LA++AVHS     P++  +K I  F  I+D  RQ
Sbjct: 221 YLTDAFTREAVSFINRH-QDKPFFLYLAYNAVHS-----PLQGKKKDIQHFTQIEDIHRQ 274

Query: 156 KFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNT 215
            FAA+LS +D+S+GK++K +   GL E +++VF +DNGGP     +  +SN PL+G K +
Sbjct: 275 IFAAMLSSMDQSIGKILKQVQQSGLDEKTLIVFLSDNGGPT---RELTSSNLPLRGEKGS 331

Query: 216 LWEGGVRGAGFL--WSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWD 273
           ++EGG+R   FL  W+  L  K  +      + D  PT  + AG  L   +NLDG N   
Sbjct: 332 MYEGGLR-VPFLMRWTGTLAPKQTIDVPVSSL-DIFPTSVALAGASLP--QNLDGRNLLP 387

Query: 274 -ALSKNTESPRTSVLHNIDDIWGIAALTVDKYKLI--KGTIYKGVWD 317
             L + TE P              AAL    +K++  +GT  K VW+
Sbjct: 388 LLLQQKTELPVADFFWRQG---RKAALRSGDWKIVQMRGTREKPVWE 431


>UniRef50_A6C4L0 Cluster: N-acetylgalactosamine-6-sulfate sulfatase;
           n=1; Planctomyces maris DSM 8797|Rep:
           N-acetylgalactosamine-6-sulfate sulfatase - Planctomyces
           maris DSM 8797
          Length = 413

 Score =  120 bits (289), Expect = 8e-26
 Identities = 89/277 (32%), Positives = 138/277 (49%), Gaps = 26/277 (9%)

Query: 4   GVIYGAEPR-----GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHV 58
           GV+Y A P+     GL  NE  L Q L+D GY+T + GKWHLG Y+++Y P  RGF   V
Sbjct: 63  GVVY-ANPKKNRHHGLQKNEITLAQCLQDAGYQTGMFGKWHLG-YQRQYNPTFRGFQQFV 120

Query: 59  GFWTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPL 118
           G+ +G +D + H     G+   D+    E+  +  G Y T +  D A++ +    + +P 
Sbjct: 121 GYVSGNVDYFAHL---DGTGVFDWWHNAELNREEQG-YVTHLINDHALEFIR-QQQEKPF 175

Query: 119 FLMLAHSAVHSGNPYE-PIRAPQKLIDAFKYIDDSARQKFA----AVLSKLDESVGKVVK 173
           F+ +AH AVHS  PY+ P   P +  +    I  + R+  A     + +++D+ +G++V 
Sbjct: 176 FVYIAHEAVHS--PYQGPHDQPMRK-EGGGDIKSAKRKDIANAYREMNTEMDKGIGQIVD 232

Query: 174 ALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLD 233
            L    L E + + F +DNG      N N  SN  L+G K +LWEGG R       P   
Sbjct: 233 VLKEVNLTEKTFIFFLSDNGA-----NKN-GSNGKLRGFKGSLWEGGHRVPAIACWPGRI 286

Query: 234 SKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
            +  V  + +   D +PT+   A   +     LDGV+
Sbjct: 287 PEGTVCDEPVISIDLMPTILELANAKIPAGHKLDGVS 323


>UniRef50_A6DKD8 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine 6-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 455

 Score =  118 bits (285), Expect = 2e-25
 Identities = 86/292 (29%), Positives = 141/292 (48%), Gaps = 19/292 (6%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           G+PL+E+++   LK   Y T ++GKWH+G    E  P  R  D + GF  G     +   
Sbjct: 106 GIPLDEQMIFDLLKPAAYTTGVIGKWHMG-LSHEQRPTQRSVDYYYGFLNGAHSYREAKM 164

Query: 73  MEQGSWGT--DFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSG 130
             +G+  T   FR    V    F  Y T+V+ DE +  +   NK +P FL +++++VH  
Sbjct: 165 DMKGAPMTWPIFRNNEPVP---FSGYTTEVFNDEGVNFIK-RNKDKPFFLYMSYNSVHG- 219

Query: 131 NPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFST 190
            P+E   A  K +    +I    R+ ++A+L  +D+ VG++++ L   G+ EN++V+F +
Sbjct: 220 -PWE---AQPKDLQRSDHIKKKWRRIYSAMLISMDDGVGRLIQTLKDEGIYENTLVIFMS 275

Query: 191 DNGGP-----AAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHI 245
           DNG P     A    D  ASN  L+G K   +EGG+R    +  P +  K       +  
Sbjct: 276 DNGAPNNLHEAERAGDYLASNGSLRGRKGDTYEGGIRVPYIMSWPQVIPKQSTYQHPVSG 335

Query: 246 SDWLPTLYSAAGGDLSVLENLDGVNQWDALS-KNTESPRTSVLHNIDDIWGI 296
            D +PTL   +       + L GVN    ++ + T  P  ++    DD + I
Sbjct: 336 LDIVPTLIHISQA-APAKKELSGVNLMPYITGEKTSRPHKTLYWRRDDDYAI 386


>UniRef50_A6CD52 Cluster: Twin-arginine translocation pathway
           signal; n=1; Planctomyces maris DSM 8797|Rep:
           Twin-arginine translocation pathway signal -
           Planctomyces maris DSM 8797
          Length = 460

 Score =  118 bits (284), Expect = 3e-25
 Identities = 86/277 (31%), Positives = 131/277 (47%), Gaps = 20/277 (7%)

Query: 12  RGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHT 71
           RG+   E  +   L+  GY+T L+GKWHLG   + +LP   GFD   G   G ID +  T
Sbjct: 109 RGIQPGETTIADVLQQNGYQTALLGKWHLGHGTESFLPTAHGFDLFRGHTGGCIDYFTMT 168

Query: 72  TMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSG 130
                 W  + R      H     YATD+ T+EA   +     ++ P FL L+++A H G
Sbjct: 169 YGNIPDWYHNQR------HVSENGYATDLITEEAEHFLKDQQTTDKPFFLFLSYNAPHFG 222

Query: 131 NPYEP-IRAPQKLIDA-------FKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLE 182
             + P  ++P  ++ A          I D  R++FAA+   LD+ +G+V+ +L   GL +
Sbjct: 223 KGWSPGDQSPVNIMQARGDDLKRVGTIKDKVRREFAAMTVSLDDGIGRVMSSLKNNGLDQ 282

Query: 183 NSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQK 242
           N++V+F TD+GG          +N P +G K TL+EGG+R    +  P          + 
Sbjct: 283 NTLVIFMTDHGGDYV----YGGNNQPFRGAKATLFEGGIRVPCIIRWPGKIKAGTETNEV 338

Query: 243 MHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNT 279
               D  PT+   A  D   L  LDG +    L++ T
Sbjct: 339 AWALDLFPTICHFANVDTDGL-TLDGKDISGLLTRQT 374


>UniRef50_A6C4W7 Cluster: Twin-arginine translocation pathway
           signal; n=1; Planctomyces maris DSM 8797|Rep:
           Twin-arginine translocation pathway signal -
           Planctomyces maris DSM 8797
          Length = 459

 Score =  118 bits (283), Expect = 4e-25
 Identities = 90/255 (35%), Positives = 130/255 (50%), Gaps = 19/255 (7%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           GLP     + + LK  GY T   GKWHLG Y+  +LP N+GFD   G  +G     DH T
Sbjct: 116 GLPHQAVTMAELLKQQGYATACFGKWHLG-YQPPWLPTNQGFDLFRGLTSGD---GDHHT 171

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVH---S 129
               S   D+    E++ +    Y  D+ +  ++  + + N++ P FL + H A+H    
Sbjct: 172 HVDRSGNEDWWHNNEISMEKG--YTADLLSKYSVAFMEA-NRTRPFFLYVPHLAIHFPWQ 228

Query: 130 GNPYEPIRAPQKLIDAFKY--IDD--SARQKFAAVLSKLDESVGKVVKALHTRGLLENSI 185
           G    P R   +   A K+  I D  +      A++  LD+SVGK++ AL    L +N++
Sbjct: 229 GPQDPPHRKAGQDYHAGKWGIIPDPGNVSPHTTAMIESLDQSVGKILSALKRLDLEQNTL 288

Query: 186 VVFSTDNGGPAA-GFN-DNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQK 242
           V+F++DNGG    G N  N +SN PL+G K TL+EGG R    + W  ++   A V  Q 
Sbjct: 289 VIFTSDNGGYLTYGKNFQNISSNGPLRGQKATLYEGGHRVPCLISWPGVI--TAGVTDQT 346

Query: 243 MHISDWLPTLYSAAG 257
            H  D LPTL  AAG
Sbjct: 347 AHSVDLLPTLAQAAG 361


>UniRef50_Q7UX97 Cluster: Arylsulfatase B [Precursor]; n=1;
           Pirellula sp.|Rep: Arylsulfatase B [Precursor] -
           Rhodopirellula baltica
          Length = 579

 Score =  117 bits (282), Expect = 6e-25
 Identities = 96/336 (28%), Positives = 157/336 (46%), Gaps = 49/336 (14%)

Query: 4   GVIYGAEPRGLPLNEKILPQYLKDLGYKTH-LVGKWHLGSYKKEYLPLNRGFDSHVGFWT 62
           GV+  ++  GLP   +  P++L  LGY    + GKWHLG     + PL+ G     G + 
Sbjct: 203 GVVSPSKKHGLPPQLETAPEHLSKLGYDHRAMFGKWHLGLASTLFHPLHHGMTEFYGHYN 262

Query: 63  GRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLML 122
           G ID +      Q  W     R F+  H+    Y+T++  +  +  ++ +  + P++  +
Sbjct: 263 GAIDYFSRERFGQLDW----HRDFDSVHE--EGYSTELVGNAVVDFIDRNANAGPVYAYV 316

Query: 123 AHSAVHSGNPYEPIR--------------AP---QKLIDAFKYID-------DSARQKFA 158
           A +A HS  P + +R              AP   +K+    K +D       +S RQ FA
Sbjct: 317 AFNAPHS--PLQALRSDLDEYGFDPNNKLAPNTDRKIAKREKALDYGKRGKGNSIRQTFA 374

Query: 159 AVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG-PAAGFNDNAASNYPLKGVKNTLW 217
           A+ + +D  +G+++ A+   G+ EN++VVF +DNG  P  G N     N PL+G K T W
Sbjct: 375 AMTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGADPKHGGN-----NEPLRGNKFTTW 429

Query: 218 EGGVRGAGFLWSPLLDSKARVAYQKM-HISDWLPTLYSAAGGDLSVLENLDGVNQWDALS 276
           EGGVR    +  P  +  A + Y  +    D LP++  AAG      E  DG+N    LS
Sbjct: 430 EGGVRVVAMMRWP-NELPAGITYDSVTSYVDLLPSMVGAAGSPPP--EETDGINLLPFLS 486

Query: 277 KNTESPRTSVLHNIDDIWGIAALTVDKYKLIKGTIY 312
                P  ++L + + +        D++KL  G ++
Sbjct: 487 GKASPPERTILLDAETV------VSDRWKLKAGELF 516


>UniRef50_A0HG49 Cluster: Sulfatase; n=6; Comamonadaceae|Rep:
           Sulfatase - Comamonas testosteroni KF-1
          Length = 457

 Score =  117 bits (282), Expect = 6e-25
 Identities = 82/269 (30%), Positives = 132/269 (49%), Gaps = 15/269 (5%)

Query: 4   GVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTG 63
           G + GA+  GLP     +   LK  GY+T L+GKWHLG Y   + PL  G++ + G  +G
Sbjct: 102 GTLLGAK-LGLPPEIPTVASLLKGAGYRTALIGKWHLG-YPPHFGPLRSGYEEYFGPMSG 159

Query: 64  RIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLML 122
            +D + H +    S   D   G E  HD    Y TD+ +  ++  VN  ++ + P FL L
Sbjct: 160 GVDYFTHLS---SSGQHDLWVGEEEHHD--EGYLTDLLSQRSVDFVNRMSEGDAPFFLSL 214

Query: 123 AHSAVH-SGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLL 181
            ++A H      +     ++L     ++      ++  ++  +DE +G +V+AL   G L
Sbjct: 215 HYTAPHWPWETRDDRETAEQLGAGITHLAGGNIHQYRRMIHHMDEGIGWIVEALRRNGQL 274

Query: 182 ENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQ 241
           +N+++VF++DNGG    F+D    ++PL G K  L EGG+R       P +    R + Q
Sbjct: 275 DNTLIVFTSDNGGER--FSD----SWPLVGGKMDLTEGGIRVPWIAHWPAVIEAHRSSAQ 328

Query: 242 KMHISDWLPTLYSAAGGDLSVLENLDGVN 270
                DW  T+  AAG        LDG++
Sbjct: 329 PCMSMDWSATVLDAAGVSADPDYPLDGIS 357


>UniRef50_Q7UGB4 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
           n=1; Pirellula sp.|Rep: N-acetylgalactosamine 6-sulfate
           sulfatase - Rhodopirellula baltica
          Length = 485

 Score =  117 bits (281), Expect = 7e-25
 Identities = 87/298 (29%), Positives = 138/298 (46%), Gaps = 39/298 (13%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           G+   E ILP  L+  GYK+ + GKW LG+ ++  LP +RGFD   GF    ID + H  
Sbjct: 124 GMDEREVILPAVLRPAGYKSGIFGKWDLGALQR-MLPTSRGFDDFYGFVNTGIDYFTHE- 181

Query: 73  MEQGSWGTD-FRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGN 131
                +G     R  E      G Y T ++  EA++ ++ H  +EP FL +  +A H+ +
Sbjct: 182 ----RYGVPCMVRNLEPTEADKGTYCTYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSS 237

Query: 132 P------------------YEPIRAPQKLIDAFKY-------IDDSARQKFAAVLSKLDE 166
                              Y P+    ++ D ++Y          + R+ + A ++ +D 
Sbjct: 238 SLVPTIRSSVQAPDQFKAMYPPVEVETRVTDRYRYGSPATVATPQARRRDYRAAVTCMDA 297

Query: 167 SVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGF 226
           ++G+++  L  + +L+ +IVVF +DNGG         A N PL+G K   WEGG+R    
Sbjct: 298 AIGEILDRLEAKQMLDETIVVFFSDNGG------SGGADNSPLRGHKAQTWEGGIRVPCL 351

Query: 227 LWSPLLDSKARVAYQKMHIS-DWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPR 283
           +  P     A V   +   S + LP+  +AAG +      LDG + W  L    ESPR
Sbjct: 352 VRWPAGQIPAGVVNDEFLTSLELLPSFAAAAGVEPPPGVVLDGFDWWPTLRGEAESPR 409


>UniRef50_Q3M597 Cluster: Twin-arginine translocation pathway signal
           precursor; n=1; Anabaena variabilis ATCC 29413|Rep:
           Twin-arginine translocation pathway signal precursor -
           Anabaena variabilis (strain ATCC 29413 / PCC 7937)
          Length = 457

 Score =  116 bits (280), Expect = 1e-24
 Identities = 80/259 (30%), Positives = 129/259 (49%), Gaps = 15/259 (5%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           G+P N+  +   LK  GY+T LVGKWH G Y   + PL +GFD + G  +G I+ + HT 
Sbjct: 126 GIPANQPTIASLLKANGYETALVGKWHAG-YPPNFGPLQKGFDEYFGHLSGGIEYFTHTG 184

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
            ++     D     +V     G Y TD++TD A++ +   + S P +L L ++A H    
Sbjct: 185 TDRI---LDLYEN-DVPVQRSG-YVTDLFTDRAVEFIQRPH-SRPFYLSLHYNAPHWPWQ 238

Query: 133 YEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDN 192
               +A         Y    ++  +AA++  LD+ VG+V+ AL   G  +N++V+F++DN
Sbjct: 239 GPNDQASTAFYLTNGYTVGGSQATYAAMVKSLDDGVGRVLDALEASGQADNTLVIFTSDN 298

Query: 193 GGPAAGFNDNAASNY-PLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPT 251
           GG          SN+ P +G K +L+EGG+R    +  P +    +V+ Q +   D   T
Sbjct: 299 GG-------ERFSNFGPFRGQKASLYEGGIRVPAIIRYPGVTQANQVSNQVIITFDLTAT 351

Query: 252 LYSAAGGDLSVLENLDGVN 270
           + +A G         DG N
Sbjct: 352 ILAATGTSFHPNYPPDGQN 370


>UniRef50_Q15XH3 Cluster: Sulfatase precursor; n=1;
           Pseudoalteromonas atlantica T6c|Rep: Sulfatase precursor
           - Pseudoalteromonas atlantica (strain T6c / BAA-1087)
          Length = 500

 Score =  116 bits (279), Expect = 1e-24
 Identities = 90/313 (28%), Positives = 149/313 (47%), Gaps = 31/313 (9%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTG-------RI 65
           G+  +E  + Q +K  GY T  +GKWHLG    EY P   GFD   GF  G       + 
Sbjct: 118 GVSADELFIAQTMKSAGYFTGAMGKWHLGE-ASEYHPNKHGFDEFYGFLGGGHNYFPEQF 176

Query: 66  DMYDHTTMEQGSWGTDF------RRGFEVAHDLFGVYATDVYTDEAIKVVN-SHNKSEPL 118
           +   +  + QG    +         G EV       Y TD  + EA+  V+ +  K +P 
Sbjct: 177 EAAYNKRVAQGMTNINMYLTPLEHNGKEVRET---EYITDGLSREAVNFVDKAAAKKKPF 233

Query: 119 FLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTR 178
           FL LA++A     P+ P++A ++ +  F  I D  R+ +A ++  +D  VG++V+ L   
Sbjct: 234 FLYLAYNA-----PHVPLQAKEEDMAMFSQIKDKKRRTYAGMVYAVDRGVGRIVEQLKKN 288

Query: 179 GLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKAR 237
           G  +N+++VF++DNGG         A+NYPLK  K ++ EGG R    + W   + + +R
Sbjct: 289 GQFDNTVIVFTSDNGGKLG----QGANNYPLKEGKGSVQEGGFRTPMLVHWPKHMKAGSR 344

Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGI- 296
            ++  + + D  PT     G  L   + LDG + W  +  NT   +   ++ +    G  
Sbjct: 345 FSHPVLAL-DLYPTFAGLGGAVLPEDKKLDGKDIWADIQANTAPHKDEFIYVLRHRNGYS 403

Query: 297 -AALTVDKYKLIK 308
            AA   +++K +K
Sbjct: 404 DAAARRNQFKAVK 416


>UniRef50_Q1YSH0 Cluster: Sulfatase family protein; n=4; cellular
           organisms|Rep: Sulfatase family protein - gamma
           proteobacterium HTCC2207
          Length = 557

 Score =  116 bits (278), Expect = 2e-24
 Identities = 89/290 (30%), Positives = 137/290 (47%), Gaps = 28/290 (9%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDS-------------HVG 59
           G+P  E  + + L+   Y T  +GKWHLGS   +  P  +GFD              H  
Sbjct: 177 GMPAAEITIGEVLQQQDYYTAHIGKWHLGS-NGDMRPEQQGFDDSLSMKGIFYLPPDHPD 235

Query: 60  FWTGRI--DMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEP 117
               +I  D  D      GS+   +  G     +  G Y TD +TD A+ V+ + N+  P
Sbjct: 236 VVNAKIPGDSIDSMVWAVGSYEVQWNGG--PPFEPKG-YLTDYFTDAAVDVIEA-NRHRP 291

Query: 118 LFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHT 177
            FL LAH       P+ P++A ++  DA  +I D   + +AA+L  LD SV K+  +L  
Sbjct: 292 FFLYLAH-----WGPHNPVQASREDYDALPHIKDHRLRTYAAMLRALDRSVEKIEASLQE 346

Query: 178 RGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKAR 237
            GL +N++++F++DNGG  AG+ D    N P +G K T +EGG         P      +
Sbjct: 347 NGLSDNTLIIFTSDNGG--AGYLDLTDLNKPYRGWKLTHFEGGTHVPYMAKWPAQIEAGQ 404

Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDAL-SKNTESPRTSV 286
            + + +H  D   T+ +AAG  +     LDGVN    +  K T +P  ++
Sbjct: 405 SSDEAIHHIDMFHTIAAAAGASVPTDRTLDGVNLLPFMQGKQTGAPHKTL 454


>UniRef50_Q02AN8 Cluster: Sulfatase precursor; n=1; Solibacter
           usitatus Ellin6076|Rep: Sulfatase precursor - Solibacter
           usitatus (strain Ellin6076)
          Length = 443

 Score =  116 bits (278), Expect = 2e-24
 Identities = 86/274 (31%), Positives = 131/274 (47%), Gaps = 24/274 (8%)

Query: 21  LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGT 80
           L   LK  GY+T   GKWHLGS   E  P   GFDS  GF +G +D Y H       WG 
Sbjct: 107 LASVLKGSGYQTGCFGKWHLGS-TDETAPTGHGFDSFYGFHSGCVDYYSHRFY----WGD 161

Query: 81  DFRRGFEVAHDLF--GVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRA 138
           ++   +    ++F  G Y T+   DEA   +    ++ P    +A +A     P+ P+ A
Sbjct: 162 NYHDLWHNRTEIFEDGRYLTERIADEAAGFIG---RNRPFLGYVAFNA-----PHYPMHA 213

Query: 139 PQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPA-- 196
           P +    F  +    RQ +AA+++ +D+ +G++ +AL T G  EN+++ F  DNG     
Sbjct: 214 PAQYKARFPNLAPE-RQTYAAMIAAVDDGIGQIQRALETTGAAENTLMFFIGDNGATTEK 272

Query: 197 -AGFNDN---AASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTL 252
            AG N +   A  N   KG K +L++GG+   GF+  P    K     +     D LPT+
Sbjct: 273 RAGLNGDFATAGDNGVFKGYKFSLFDGGMHVPGFVSWPAGIRKGGWTDELAMSMDILPTI 332

Query: 253 YSAAGGDLSVLENLDGVNQWDALSKNTESPRTSV 286
             A G  L     +DG +  + ++ N  SP  S+
Sbjct: 333 CRATGAPLP--PRVDGSDLLNTIASNAPSPHKSL 364


>UniRef50_A6C383 Cluster: Sulfatase; n=1; Planctomyces maris DSM
           8797|Rep: Sulfatase - Planctomyces maris DSM 8797
          Length = 405

 Score =  115 bits (277), Expect = 2e-24
 Identities = 82/287 (28%), Positives = 135/287 (47%), Gaps = 14/287 (4%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           G+P  +  + + ++  GY+T  +GKWHLG Y  E +P  +GF++  G   G ID Y H  
Sbjct: 87  GMPTEQITIAEMMQQAGYQTAHIGKWHLG-YTPETMPHGQGFETSFGHMGGCIDNYSHFF 145

Query: 73  MEQGSWGTD-FRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGN 131
              G    D +  G EV  D  G +  D+  ++    +      +P FL  A +      
Sbjct: 146 YWNGPNRHDLWENGKEVWRD--GAFFPDLMVEQCQDYIRKAG-DKPFFLYWAINV----- 197

Query: 132 PYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
           P+ P++  +K    + ++  S R K+AA +S +D+ +G+V+  L    L E +I++F +D
Sbjct: 198 PHYPLQGKEKWRKTYAHLS-SPRDKYAAFVSTMDDCIGEVLATLDACQLREKTIIIFQSD 256

Query: 192 NG-GPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLP 250
           +G            S  P +G K +L+EGG+R    +  P   ++  V  Q     DWLP
Sbjct: 257 HGHSHEERTFGGGGSAGPYRGAKFSLFEGGIRVPAMISWPGTIAEGEVRDQLATGCDWLP 316

Query: 251 TLYSAAGGDLSVLENLDGVNQWDALSKNT-ESPRTSVLHNIDDIWGI 296
           T+ +  G  L    +LDG N    +  +T +SP  +    I   W I
Sbjct: 317 TISALTGAPLPA-HHLDGKNLKAVIESSTAKSPHENFYWQIGKSWAI 362


>UniRef50_A6DG78 Cluster: Sulfatase; n=1; Lentisphaera araneosa
           HTCC2155|Rep: Sulfatase - Lentisphaera araneosa HTCC2155
          Length = 464

 Score =  115 bits (276), Expect = 3e-24
 Identities = 81/266 (30%), Positives = 129/266 (48%), Gaps = 33/266 (12%)

Query: 8   GAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDM 67
           G + R + L E  L + LKD GYKT L GKWHLG++  +Y P  +GFD   G   G ID 
Sbjct: 107 GPKTRNMNLEEYTLAEALKDSGYKTALFGKWHLGAH-LDYGPTKQGFDEFYGIRGGFIDN 165

Query: 68  YDHTTMEQGSWGTDFRRGFEVAHDLF--GVYATDVYTDEAIKVVNSHNKSEPLFLMLAHS 125
           Y+H  +     G  F   +E   ++F  G Y  ++ TD A+  ++  NK+ P FL LA +
Sbjct: 166 YNHYFLH----GEGFHDLYEGTKEVFDEGKYFPNLVTDRALNFID-RNKNNPFFLFLAFN 220

Query: 126 AVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSI 185
                 P+ P +A  K  + +K +    RQ +A ++S  D+ +G+++  L   G+ +N+I
Sbjct: 221 I-----PHYPEQADPKFDERYKNM-KMPRQSYAKMISTTDDHMGQIMSKLQEHGIYDNTI 274

Query: 186 VVFSTDNG-----------GPAAGFNDN--------AASNYPLKGVKNTLWEGGVRGAGF 226
           ++F +DNG              +G   N          +    +G K+  +EGG+R    
Sbjct: 275 IIFMSDNGHSRERNHIKFDNHKSGLAKNTKYGALGGGGNTGKWRGNKSNFYEGGIRVPAI 334

Query: 227 LWSPLLDSKARVAYQKMHISDWLPTL 252
           +  P    K  V  Q +   DW+PT+
Sbjct: 335 ITFPNKLPKGAVRDQAITAMDWMPTV 360


>UniRef50_A4CMB1 Cluster: Arylsulphatase A; n=3; Bacteria|Rep:
           Arylsulphatase A - Robiginitalea biformata HTCC2501
          Length = 459

 Score =  114 bits (275), Expect = 4e-24
 Identities = 93/275 (33%), Positives = 141/275 (51%), Gaps = 25/275 (9%)

Query: 20  ILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMY-DHTTMEQGSW 78
           ++P  L   GY T ++GKWHLG  + +  P +RGF    GF    +D Y DH    +G  
Sbjct: 129 LIPSELNPAGYHTGIIGKWHLGLEEPD-TPNDRGFTYFKGFLGDMMDDYWDH---RRG-- 182

Query: 79  GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGNPYEPIR 137
           G ++ R      D  G +ATD++TD  I  +      E P FL LA++A     P+ PI+
Sbjct: 183 GINWMRLNREEIDPKG-HATDLFTDWTIDFLKERQGEEQPFFLYLAYNA-----PHFPIQ 236

Query: 138 APQKLIDAFKYIDDSARQKFA---AVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG 194
            P++ +D  +  + +  +K A   A +  LD SVG+V++AL T GL EN++VVF +DNGG
Sbjct: 237 PPREWLDKVREREPNLTEKRAKNVAFVEHLDYSVGRVMEALKTTGLEENTLVVFVSDNGG 296

Query: 195 PAAGFNDNAASNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKARVAYQKMHISDWLPTLY 253
            A  +   A SN PL+G K  ++EGG+R  A F W   + +    +     + D  PT  
Sbjct: 297 -ALWY---AQSNGPLRGGKQDMYEGGIRVPAIFYWKGKI-APGTTSDNTALLMDLFPTFC 351

Query: 254 SAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLH 288
             AG      EN+DG++    L+   +      L+
Sbjct: 352 ELAG--RKPPENVDGISLVPTLTGQAQDTANRYLY 384


>UniRef50_A6DKP2 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine 6-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 446

 Score =  113 bits (272), Expect = 9e-24
 Identities = 83/266 (31%), Positives = 128/266 (48%), Gaps = 17/266 (6%)

Query: 5   VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGR 64
           V  G   +G+P ++K + + LK  GYK+   GKWHLGS KK   P +RGFD+  GF  G 
Sbjct: 87  VTNGDADKGIPKSQKNIAELLKPAGYKSGAFGKWHLGS-KKGQFPNDRGFDTFYGFHFGA 145

Query: 65  IDMY--DHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLML 122
            D Y  D    ++           ++     G Y T+  TD A++ +   NK +P F+ +
Sbjct: 146 HDYYRADKKLNKKKKGYAPIYFNQDIVDYKEGDYLTEKITDHAVEFI-EENKDQPFFMYV 204

Query: 123 AHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLE 182
           A+++VHS     P + P + +        + R+ F A++  +D+ VG++   L    L E
Sbjct: 205 AYNSVHS-----PWQVPDEYLARIPESVPAYRRLFLAMVLAMDDGVGRIRAKLKELNLDE 259

Query: 183 NSIVVFSTDNGGPAAGFNDNAASNYPL------KGVKNTLWEGGVRGAGFLWSPLLDSKA 236
           N+I VF+TDNG P  G        Y +      +G K   +EGG+R   F  S     K+
Sbjct: 260 NTIFVFTTDNGSPKIGNKKPNEGQYRMSMSQGFRGYKGDTYEGGIR-VPFCMSWPKKIKS 318

Query: 237 RVAYQKMHIS-DWLPTLYSAAGGDLS 261
              ++   I+ D  PT  SAA  + S
Sbjct: 319 GNKFEAPVIAYDLAPTFLSAASLEYS 344


>UniRef50_Q7UHJ9 Cluster: Iduronate-sulfatase or arylsulfatase A;
           n=5; cellular organisms|Rep: Iduronate-sulfatase or
           arylsulfatase A - Rhodopirellula baltica
          Length = 1012

 Score =  113 bits (271), Expect = 1e-23
 Identities = 92/326 (28%), Positives = 155/326 (47%), Gaps = 20/326 (6%)

Query: 4   GVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF-WT 62
           GV+   +P+GL  +E  + + LK  GY+T + GKWHLG  + E+LP  +GFD   G  ++
Sbjct: 644 GVLLAGDPKGLHPDEITIAEVLKTAGYRTGMFGKWHLGD-QPEFLPTKQGFDEFFGIPYS 702

Query: 63  GRIDMYDHTTMEQGSWGTD--FRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
             I  + H       +      +    +  D    + T   T++A+  +   NK +P FL
Sbjct: 703 HDIHPF-HPRQNHYHFPPLPLLQNDTVIEMDPDADFLTKRLTEQAVSFIE-RNKDQPFFL 760

Query: 121 MLAH----SAVHSGNPYEPIRAPQKLIDAFKY---IDDSARQK-FAAVLSKLDESVGKVV 172
            L H    + +H+  P+    A   +    K    ID + R   F   ++++D SVG+++
Sbjct: 761 YLPHPIPHAPLHASPPFMEGVADDVIAAIEKEDGNIDYATRANLFRQAIAEIDWSVGQIL 820

Query: 173 KALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLL 232
            AL + GL E ++V+F++DNG P    N   AS   L+G K T +EGG+R    +  P  
Sbjct: 821 DALRSNGLDEKTMVLFTSDNGPPK---NTLYASPGELRGHKGTTFEGGMREPTVVRWPGQ 877

Query: 233 DSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDD 292
                   + M   D LPT    AG  +     +DG + W  L   T++P  +  ++  +
Sbjct: 878 IPAGHQNDELMTAMDLLPTFAKLAGAAIPTDRVIDGKDIWPTLKGETQTPHDAFFYHRGN 937

Query: 293 IWGIAALTVDKYKL-IKGTIYKGVWD 317
              +AA+   K+KL +   + K ++D
Sbjct: 938 --QLAAVRSGKWKLHVNNGVAKQLYD 961



 Score = 57.2 bits (132), Expect = 8e-07
 Identities = 58/206 (28%), Positives = 96/206 (46%), Gaps = 19/206 (9%)

Query: 89  AHDLFGVYATD-VYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFK 147
           AH+++    T  + T+ A+K + +  K+EP FL  A   +H  +P+ P  AP+     FK
Sbjct: 233 AHEIYDDEKTGTLLTERAVKWI-TEKKNEPFFLYFATPNIH--HPFTP--APR-----FK 282

Query: 148 YIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG--PAAGFNDNAAS 205
               S    +   + +LD  VG++V++L   GL +N++V+F++DNG     AG +   A 
Sbjct: 283 --GTSQCGLYGDFVHELDWMVGEIVQSLEDNGLTDNTLVLFTSDNGAMLNRAGRDAIKAG 340

Query: 206 NYP---LKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSV 262
           + P   L G K  +WEGG R       P        + Q +   D   T  +    ++  
Sbjct: 341 HQPNGELLGFKFGVWEGGHRVPLIAKWPGKIKAGTQSDQLISQVDLFATFSALTEQEMPS 400

Query: 263 LENLDGVNQWDALSKNTESP-RTSVL 287
            E  D +N   AL  +   P RT ++
Sbjct: 401 SEQKDSINMLPALLDDPNEPLRTELV 426


>UniRef50_Q2GB51 Cluster: Sulfatase; n=2; Proteobacteria|Rep:
           Sulfatase - Novosphingobium aromaticivorans (strain DSM
           12444)
          Length = 491

 Score =  113 bits (271), Expect = 1e-23
 Identities = 90/280 (32%), Positives = 134/280 (47%), Gaps = 23/280 (8%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           GLP +   LP  L   GY+T L+GKWHLGS   ++ PL  G+ +  G  +G +D Y H T
Sbjct: 135 GLPPSHPTLPSLLAKAGYRTSLIGKWHLGSL-PDFDPLKSGYQTFWGIRSGGVDYYTHAT 193

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGN 131
                   D     E A      Y TD+  D A+  +   +  E P F+ L  +A H   
Sbjct: 194 SNGQPDLWDGPTPVERAG-----YLTDLLADRAVSEIREASSGEAPWFMSLHFTAPHW-- 246

Query: 132 PYE-PIRAPQ-----KLID--AFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLEN 183
           P+E P  A +     KL D  A  + D  +   +AA++ +LD  +G+V++AL      ++
Sbjct: 247 PWEGPDDASESARIAKLKDPSALFHFDGGSAAIYAAMVRRLDYQIGRVLEALKANRAEQD 306

Query: 184 SIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKM 243
           +IVVF++DNGG    F+D     +P  G K  L EGG+R    +  P +      +  ++
Sbjct: 307 TIVVFTSDNGGER--FSD----TWPFSGRKTELLEGGLRIPAIVRWPGVTRAGTTSDAQI 360

Query: 244 HISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPR 283
              DWLPT  +AAG         DGV+   AL   + + R
Sbjct: 361 ISMDWLPTFLAAAGSAPDPGHPSDGVDVTPALGGGSLAER 400


>UniRef50_A6DMV0 Cluster: N-acetylgalactosamine-6-sulfate sulfatase;
           n=1; Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine-6-sulfate sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 443

 Score =  112 bits (270), Expect = 2e-23
 Identities = 92/296 (31%), Positives = 139/296 (46%), Gaps = 20/296 (6%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           GL   +  LP+ LK  GYKT   GKWHLGS  K + P++ GFD + G   G  D Y +  
Sbjct: 114 GLLPEKNHLPKLLKKAGYKTGAFGKWHLGSQDK-FNPIHHGFDEYYGPLLGHCDYYTYKY 172

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
            +        R G +V  D    Y T    + A+  ++ H   +P F+ + H AVHS  P
Sbjct: 173 YDD---TYTLREGAKVIKD--SGYLTTNINERAVDFIDRH-ADKPFFMYVPHMAVHS--P 224

Query: 133 YEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDN 192
           Y+      K I     ++D  R  +AA++ ++D+ V  ++  L  + +   ++ V S+DN
Sbjct: 225 YQSADKKPKQITKTN-LNDGNRADYAAMVEEVDKGVEMIIAKLKEKKIFHKTLFVVSSDN 283

Query: 193 GGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTL 252
           GG  A F+DNA    PL   K TL+EGG+R    +  P    K  V+ Q     D   T 
Sbjct: 284 GG--AHFSDNA----PLFHRKTTLFEGGIRVPCIMHWPEKIGKGVVSDQIAITMDLSKTF 337

Query: 253 YSAAGGDLSVLENLDGVNQWDALS-KNTESPRTSVLHNIDDIWGIAALTVDKYKLI 307
            + AG D     + DG+N    ++ KN +  RT    +        A+ + K+K I
Sbjct: 338 LALAGID---EPSYDGINLLPMMTDKNNKVERTLFWRSNSKARRQKAVRMGKWKYI 390


>UniRef50_A4CGL5 Cluster: Arylsulfatase A; n=4; Bacteria|Rep:
           Arylsulfatase A - Robiginitalea biformata HTCC2501
          Length = 526

 Score =  111 bits (267), Expect = 4e-23
 Identities = 92/332 (27%), Positives = 148/332 (44%), Gaps = 20/332 (6%)

Query: 3   HGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWT 62
           H  +    P GL   E+ L + L+  GY+T + GKWHLG +  ++LP   GFD   G   
Sbjct: 141 HNALMPNSPVGLNPAEETLAELLRQQGYRTGIFGKWHLGDHP-DFLPTRHGFDEFFGIPY 199

Query: 63  GRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGV----YATDVYTDEAIKVVNSHNKSEPL 118
              DM+    ++   +       +E    +  +      T   T+ ++  +N H K EP 
Sbjct: 200 SN-DMWPLHPLQGPVFDFGPLPLYEQERVVDTLEDQRLLTRQITERSVDFINRH-KEEPF 257

Query: 119 FLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTR 178
           FL + H   H          P  + DAF+    S R  +  V+ ++D SVG+V+ AL   
Sbjct: 258 FLYVPHPQPH---------VPLFVSDAFR--GKSGRGLYGDVIMEIDWSVGQVLGALEDN 306

Query: 179 GLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARV 238
           GL +++ V+F++DNG P   + +++    PL+  K T WEGGVR    +  P    + +V
Sbjct: 307 GLTDDTWVIFTSDNG-PWLAYGNHSGRAEPLREGKGTNWEGGVREPCIMKFPGRLPRGKV 365

Query: 239 AYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAA 298
             + +   D LPT+ S  G      E +DG N W  LS           +    +  + A
Sbjct: 366 LDEPLMAIDLLPTIASVTGSPQPGRE-IDGKNAWGLLSGAEARGPQDAYYFYYRVNELQA 424

Query: 299 LTVDKYKLIKGTIYKGVWDNWYGPSGREGAYN 330
           +    +KL+    Y+ +     G  G  GAY+
Sbjct: 425 VRDGDWKLVLPHNYRTMQGQEPGADGLPGAYD 456


>UniRef50_A3I0L2 Cluster: Arylsulfatase A; n=2; Bacteroidetes|Rep:
           Arylsulfatase A - Algoriphagus sp. PR1
          Length = 481

 Score =  111 bits (267), Expect = 4e-23
 Identities = 90/268 (33%), Positives = 128/268 (47%), Gaps = 20/268 (7%)

Query: 4   GVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTG 63
           G +  +   GL   E  + + LK  GY T +VGKWHLG ++  +LP  +GFDS+ G    
Sbjct: 106 GALDHSAKHGLNPEETTIAEMLKANGYATGIVGKWHLG-HQAPFLPTEQGFDSYYGLPYS 164

Query: 64  RIDMYDHTTMEQGSWGT-DFRRGFEVAHDLFGV-YATDVYTDEAIKVVNSHNKSEPLFLM 121
             DM+ H    +G +          V   L      T  YT++A++ + + +K +P FL 
Sbjct: 165 N-DMWPHHPEVKGYYPPLPLYENTAVIDTLDDQSMLTTNYTEKALEFIEN-SKDKPFFLY 222

Query: 122 LAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLL 181
           LAHS  H          P  + D FK    S    +  V+ ++D SVG+V   L   GL 
Sbjct: 223 LAHSMTH---------VPLYVSDKFK--GKSEHGLYGDVMMEVDWSVGQVRNKLDELGLA 271

Query: 182 ENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAG-FLWSPLLDSKARVAY 240
           EN+IV+F++DNG P   +  +A     LK  K T W+GG+R  G F+W P      +V  
Sbjct: 272 ENTIVIFTSDNG-PWLSYGGHAGLTGGLKEGKGTSWDGGIREPGIFVW-PDHFPAGKVET 329

Query: 241 QKMHISDWLPTLYSAAGGDLSVLENLDG 268
           Q     D LPTL    G  L  L  +DG
Sbjct: 330 QAAMTIDILPTLAEITGSKLPELP-IDG 356


>UniRef50_A0YAF7 Cluster: Arylsulfatase A; n=1; marine gamma
           proteobacterium HTCC2143|Rep: Arylsulfatase A - marine
           gamma proteobacterium HTCC2143
          Length = 479

 Score =  111 bits (267), Expect = 4e-23
 Identities = 87/293 (29%), Positives = 137/293 (46%), Gaps = 24/293 (8%)

Query: 5   VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF-WTG 63
           V++     GLP  E  + + LK+  Y+T LVGKWHLG +   + PL+ GFD + G  ++ 
Sbjct: 111 VLFPTSTGGLPTTEITIAKALKEKDYRTALVGKWHLG-HLPGFQPLDHGFDEYFGIPYSN 169

Query: 64  RIDMYDH-------TTMEQGSWGTDFRRGFEVAHDLFGVYA-TDVYTDEAIKVVNSHNKS 115
             D+          T  + G +     +   +          T  YT EA+  +   N +
Sbjct: 170 DHDLKKELSYIQTITHAKDGDFNVPLMQNRSIIERPANQNTITKRYTQEAVSFIKK-NSN 228

Query: 116 EPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKAL 175
           +P FL LAHS      P+ P+ A     D F+   D  R  +  V+ ++D SVG+V+  L
Sbjct: 229 QPFFLYLAHSM-----PHVPLFAS----DQFRGSSD--RGLYGDVIEEIDWSVGQVLSTL 277

Query: 176 HTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSK 235
             +G+ EN++VVF++DN GP      +  S   LK  K T +EGG+R     W P    K
Sbjct: 278 SEQGISENTLVVFTSDN-GPWLIMGAHGGSAGLLKSGKGTSYEGGMREPAIFWWP-EKIK 335

Query: 236 ARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLH 288
             VA+      D  PT+ S AG D+    + DG +    + +   + R ++ +
Sbjct: 336 PAVAHNTASTLDLFPTIMSIAGIDMPSDRSYDGYDLSPTMFEQKSNERKNIFY 388


>UniRef50_A6LDP6 Cluster: Arylsulfatase A; n=1; Parabacteroides
           distasonis ATCC 8503|Rep: Arylsulfatase A -
           Parabacteroides distasonis (strain ATCC 8503 / DSM 20701
           / NCTC11152)
          Length = 452

 Score =  110 bits (265), Expect = 6e-23
 Identities = 82/286 (28%), Positives = 135/286 (47%), Gaps = 20/286 (6%)

Query: 2   QHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF- 60
           Q  V++     GLP  E  + + LK  GY T  +GKWHLG +  EY+PL  GFD   G+ 
Sbjct: 96  QRRVLFPDSKGGLPSEELTIAELLKQAGYHTACIGKWHLG-HLPEYMPLRHGFDYFYGYP 154

Query: 61  WTGRIDMYDHTTMEQGSWGTDF---RRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEP 117
           ++  +   +   +    +  ++    +  E+  +      T   T+ AI+ + S N++ P
Sbjct: 155 YSNDMSRKEQIKLGNTKYPYEYIIYEQEKELEREPQQYNLTQQVTEAAIRYIKS-NENSP 213

Query: 118 LFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHT 177
            FL LAH       P+ P+ A      +  +   SAR K+   + +LD SVG++++ L +
Sbjct: 214 FFLYLAHPM-----PHMPVYA------STDFQGKSARGKYGDTVEELDWSVGQILQTLKS 262

Query: 178 RGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKAR 237
            GL +N++V+F++DN GP         S  PLK  K +++EGG R    +W  ++  K  
Sbjct: 263 EGLDKNTLVIFTSDN-GPWLLCKQEGGSPGPLKDGKASMFEGGFRVPCIMWGAMV--KPG 319

Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPR 283
                    D LPT    AG  L    + DG++  + L   +   R
Sbjct: 320 YITDMASTLDLLPTFCEIAGIPLPSDRHYDGISLLNVLKDKSTCKR 365


>UniRef50_A6KZI6 Cluster: Sulfatase; n=2; Bacteroides|Rep: Sulfatase
           - Bacteroides vulgatus (strain ATCC 8482 / DSM 1447 /
           NCTC 11154)
          Length = 473

 Score =  110 bits (265), Expect = 6e-23
 Identities = 82/275 (29%), Positives = 136/275 (49%), Gaps = 28/275 (10%)

Query: 21  LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRID---MYDHTTMEQGS 77
           + + L++ GY+   +GKWHLG  +    PL++GF  +VG           Y +   ++  
Sbjct: 130 MAEALQEQGYQCGHIGKWHLGDDEDGTGPLSQGFIWNVGGNRAGAPYSYFYPYCLPDKSK 189

Query: 78  WGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIR 137
                  G      + G Y TD  T+EA+  + SH++  P FL L+H AVH+      ++
Sbjct: 190 CHVGLEEG------ILGEYLTDRLTEEAVSFIKSHSEG-PFFLHLSHHAVHT-----VLQ 237

Query: 138 APQKLIDAFKYIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG 194
           AP  LI+ ++        K   +AA++ KLD+SVG++ + + T G+ + +IV+F +DNGG
Sbjct: 238 APDSLINKYRNKTPGKYHKNPIYAAMIEKLDDSVGRICQVIKTLGIADRTIVIFYSDNGG 297

Query: 195 PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTLY 253
                ++    NYPL G K   +EGG R    + W+  ++   R +     + D+ PT  
Sbjct: 298 -----SEPVTDNYPLNGGKGMPYEGGSRVPLIIRWTGKIEGGIRSSVPITGV-DFYPTFV 351

Query: 254 SAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLH 288
           + A G +    NLDG + +  L  N E+ R    H
Sbjct: 352 TLAQGKIPA--NLDGKDIF-TLINNNETERDLFWH 383


>UniRef50_Q7UKJ5 Cluster: Arylsulfatase A; n=3; Bacteria|Rep:
           Arylsulfatase A - Rhodopirellula baltica
          Length = 489

 Score =  108 bits (260), Expect = 3e-22
 Identities = 86/285 (30%), Positives = 134/285 (47%), Gaps = 31/285 (10%)

Query: 2   QHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF- 60
           QH V++     GL  +E  +  +LK  GY T  VGKWHLG +K E LP + GFDS+ G  
Sbjct: 115 QH-VLFPQSTYGLHPDEVTIADHLKSAGYATACVGKWHLGHHK-ETLPTSNGFDSYYGIP 172

Query: 61  -----------WTGRIDMYDHTTMEQGS---WGTDFRRGFEVAH-DLFGVYATDVYTDEA 105
                        G++   D  T +  +   W T   +  E+    +     T  YTD A
Sbjct: 173 YSNDMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQRTVTRRYTDRA 232

Query: 106 IKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLD 165
           I+ V + N+ +P FL L HS      P+ P+  P+   D +   D   +  +  V+  +D
Sbjct: 233 IEFVEA-NQDKPFFLYLPHSM-----PHIPLYVPE---DVY---DPDPQNAYKCVIEHID 280

Query: 166 ESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAG 225
             VG++V+ +   GL E +++V+++DN GP   F ++  S  PL+  K T +EGG R   
Sbjct: 281 TEVGRLVQTVRDLGLSEKTLIVYTSDN-GPWLQFKNHGGSAGPLRAGKGTTFEGGQRVPC 339

Query: 226 FLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
            +W+P        +       D LPT+ S  G  L     +DG++
Sbjct: 340 IMWAPGRIPAGTSSNAFATNMDLLPTIASFTGVALENDRKIDGID 384


>UniRef50_A6DPC8 Cluster: Arylsulfatase A; n=1; Lentisphaera
           araneosa HTCC2155|Rep: Arylsulfatase A - Lentisphaera
           araneosa HTCC2155
          Length = 598

 Score =  108 bits (259), Expect = 3e-22
 Identities = 87/295 (29%), Positives = 140/295 (47%), Gaps = 44/295 (14%)

Query: 5   VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF---- 60
           V Y    +GL  +E  + + LK  GY+T ++GKWHLG  + ++LP N+GFDS+ G     
Sbjct: 93  VYYPGASQGLKPSEITIAEVLKTAGYRTSIIGKWHLGD-RNQFLPTNQGFDSYFGIPFSN 151

Query: 61  --WTGR-------IDMYDHTTMEQGSWGTDFR------RGFEVA---------HDLFGVY 96
             W  +       I ++   T+EQ   G   +      RG +V          + +   Y
Sbjct: 152 DMWMSKDLALADDIKLFGGVTVEQIKSGEASKAVKGEKRGGKVPLMRDEEVVEYPVDQTY 211

Query: 97  ATDVYTDEAIKVVN-SHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQ 155
            T  YTDEA+K++  S  K +P F+ LA++      P+ P+ A  K      +   SAR 
Sbjct: 212 ITQRYTDEALKIIKESEKKKQPYFIYLAYAM-----PHVPLYASPK------FAGKSARG 260

Query: 156 KFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNT 215
            +   + ++D  VG+++K L + G  +N++V+F++DNG    G  +   S  PL+G K +
Sbjct: 261 PYGDTVEEMDYHVGRILKHLKSSGADKNTLVIFTSDNGPWNLG--ERGGSALPLRGAKFS 318

Query: 216 LWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
            +EGG R    +W P        + +     D++PT    A   L     LDG N
Sbjct: 319 TYEGGHRVPCVMWWPGTIPAGTDSAEIATTLDFMPTFAKLANAQLP-NRTLDGKN 372


>UniRef50_A6DSP6 Cluster: Sulfatase; n=1; Lentisphaera araneosa
           HTCC2155|Rep: Sulfatase - Lentisphaera araneosa HTCC2155
          Length = 512

 Score =  107 bits (258), Expect = 5e-22
 Identities = 86/279 (30%), Positives = 132/279 (47%), Gaps = 40/279 (14%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           GLP ++ ++ + LK LGY   ++GKWH+G +     P  RG+D   GF  G  D Y   T
Sbjct: 106 GLPQSQSMISEELKTLGYTNGMIGKWHMG-FDMSLRPNQRGYDFFYGFINGSHD-YTEWT 163

Query: 73  ME----QGSWGTDFRRGFEVAH-----DLF---GV------YATDVYTDEAIKVVNSHNK 114
            E    +  W        E A+     D+F   GV      Y TD++TDEA+  ++  N 
Sbjct: 164 QEFAKGKSRWPIFRNEEMEPANKAQYIDVFKEKGVKVVDENYLTDLFTDEAVNFID-RNA 222

Query: 115 SEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYI-DDSARQKFAAVLSKLDESVGKVVK 173
            +P FL LA++AVH      P +  Q  +D   ++ DD     FA+++  +DE +GKV+K
Sbjct: 223 DKPFFLYLAYNAVH-----HPWQTTQHALDKTAHLKDDKNYHVFASMVYAMDEGIGKVMK 277

Query: 174 ALHTRGLLENSIVVFSTDNGGP-AAGFNDN------------AASNYPLKGVKNTLWEGG 220
            L  + + +N+I++F +DNG P   G   +             +S    +G K   +EGG
Sbjct: 278 KLKEKNIDDNTIIIFLSDNGSPQGQGIEHSPKDPNRHRGGFTMSSTGIFRGYKGDTYEGG 337

Query: 221 VRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGD 259
           +R    +  P    K       +   D  PTL  AAGG+
Sbjct: 338 IRVPFCIKWPQQIQKGTKYDMPISALDLQPTLVKAAGGN 376


>UniRef50_A5FAW4 Cluster: Sulfatase precursor; n=1; Flavobacterium
           johnsoniae UW101|Rep: Sulfatase precursor -
           Flavobacterium johnsoniae UW101
          Length = 539

 Score =  107 bits (258), Expect = 5e-22
 Identities = 85/289 (29%), Positives = 135/289 (46%), Gaps = 32/289 (11%)

Query: 12  RGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTG-------- 63
           +GLP +E       K  GY T ++GKWHLG + K + PL+RGFD H GF+          
Sbjct: 173 QGLPKSEITFADLAKKQGYSTAIIGKWHLG-HTKGFFPLDRGFDYHYGFYQAFSLFAPED 231

Query: 64  ---------RIDMYDHTTMEQGSWGT-DFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHN 113
                      D  D T    G  GT   RR   +  +    Y T+ + +EA   ++  N
Sbjct: 232 NNPDIINHHHTDFTDKTIWGNGRVGTGQIRRDSTIIDEK--KYLTEKFAEEAEAFIDK-N 288

Query: 114 KSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVK 173
           K++P  L +  +A     P+ P +  +K  D F  + D  ++ + A++S LD+++G +  
Sbjct: 289 KNKPFLLYVPFNA-----PHTPFQVRKKYYDRFPNVKDENKRVYFAMISALDDAIGLIRA 343

Query: 174 ALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLD 233
            +   GL EN+++ F++DNGG    +   A +N PLKG K + +EGGV    F  S    
Sbjct: 344 KVKKEGLEENTLIFFASDNGGADYTY---ATTNAPLKGGKFSHFEGGV-NVPFALSWKGK 399

Query: 234 SKARVAYQKMHIS-DWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTES 281
            K    Y+    S D   T+ +     L      DGV+  D ++ N ++
Sbjct: 400 IKPHTIYKTPVSSLDIFSTIAAVTHSGLPKDRVYDGVDLVDVVNNNKQA 448


>UniRef50_A3HZ22 Cluster: Putative exported uslfatase; n=1;
           Algoriphagus sp. PR1|Rep: Putative exported uslfatase -
           Algoriphagus sp. PR1
          Length = 489

 Score =  107 bits (257), Expect = 6e-22
 Identities = 77/260 (29%), Positives = 130/260 (50%), Gaps = 14/260 (5%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWT-GRIDMYDHTT 72
           LPL E  + + +K  GY T  VGKWHLG  ++ + P ++GFD ++G    G+   Y    
Sbjct: 146 LPLEEITIAERMKAHGYGTLHVGKWHLG--EEGFYPEDQGFDVNIGGNDLGQPPSYFDPY 203

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
           +       +F     +     G + TD   DE +  + +  K +  F+  A  AVH+   
Sbjct: 204 LPAKP--REFYEITTLKPRKEGEFLTDREGDEVVNYIQNQ-KGKKFFVHWAPYAVHT--- 257

Query: 133 YEPIRAPQKLIDAFKYIDDSARQK--FAAVLSKLDESVGKVVKALHTRGLLENSIVVFST 190
             PI     L++ ++  +   ++   +AA++  +D++VGKV+  L   GL EN++V+F++
Sbjct: 258 --PIMGKPDLVEKYEQKEPGNQRNPVYAALVESVDQNVGKVLSELERMGLRENTLVIFTS 315

Query: 191 DNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLP 250
           DNGG    +++   +NYPLK  K   +EGG+R    +  P    +  V    +   DW+P
Sbjct: 316 DNGGLIGNYDNPITNNYPLKSQKGYPYEGGIRIPTIVSWPGKIPQGFVDETPIITMDWIP 375

Query: 251 TLYSAAGGDLSVLENLDGVN 270
           T+    G D   L  L+GV+
Sbjct: 376 TILDFMGED-PTLPELEGVS 394


>UniRef50_P34059 Cluster: N-acetylgalactosamine-6-sulfatase
           precursor; n=32; Deuterostomia|Rep:
           N-acetylgalactosamine-6-sulfatase precursor - Homo
           sapiens (Human)
          Length = 522

 Score =  107 bits (257), Expect = 6e-22
 Identities = 84/313 (26%), Positives = 140/313 (44%), Gaps = 22/313 (7%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           G+P +E++LP+ LK  GY + +VGKWHLG ++ ++ PL  GFD   G        YD+  
Sbjct: 116 GIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKA 174

Query: 73  MEQ----GSWGTDFRRGFEVAHDLFGVYA--TDVYTDEAIKVVNSHNKSEPLFLMLAHSA 126
                    W    R   E   +L    A  T +Y  EA+  +    +  P FL  A  A
Sbjct: 175 RPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDA 234

Query: 127 VHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIV 186
            H+     P+ A +       ++  S R ++   + ++D+S+GK+++ L    + +N+ V
Sbjct: 235 THA-----PVYASKP------FLGTSQRGRYGDAVREIDDSIGKILELLQDLHVADNTFV 283

Query: 187 VFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHIS 246
            F++DNG       +   SN P    K T +EGG+R     W P   +  +V++Q   I 
Sbjct: 284 FFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIM 343

Query: 247 DWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALTVDKYKL 306
           D   T  + AG        +DG+N    L +     R    +  D    + A T+ ++K 
Sbjct: 344 DLFTTSLALAGLTPPSDRAIDGLNLLPTLLQGRLMDRPIFYYRGDT---LMAATLGQHKA 400

Query: 307 IKGTIYKGVWDNW 319
              T +   W+N+
Sbjct: 401 HFWT-WTNSWENF 412


>UniRef50_A3ZUT0 Cluster: Arylsulphatase A; n=1; Blastopirellula
           marina DSM 3645|Rep: Arylsulphatase A - Blastopirellula
           marina DSM 3645
          Length = 457

 Score =  107 bits (256), Expect = 8e-22
 Identities = 96/303 (31%), Positives = 146/303 (48%), Gaps = 27/303 (8%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGS-YKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           LPL+EK + Q L   GY+  ++GKWHLG  +  EY P NRGFD  V      I  Y +  
Sbjct: 122 LPLDEKTIAQELHADGYQCAMIGKWHLGEEHGPEYRPQNRGFDRVVLSEHHGIFNYFYPF 181

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
           ++Q  W      G    +   G Y  D  TDEAI  V   N+  P FL L+H +VH G  
Sbjct: 182 VDQQKWPY---AGPLPGNP--GDYLPDRLTDEAIDFVRE-NRERPFFLYLSHWSVH-GRY 234

Query: 133 YEPIRAPQKLIDAFKYIDDSARQK-FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
           +    AP+ LI  ++      R   +AA++  +D SVG+++  L    L +N++ VF +D
Sbjct: 235 F----APESLIAKYRERGLEERPAIYAAMMETVDNSVGRLMATLDELNLADNTLFVFMSD 290

Query: 192 NGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPT 251
           NGG      +   S  PL+G K +L+EGGVR    +  P +          +   D  PT
Sbjct: 291 NGG------ERITSMAPLRGSKGSLYEGGVRVPLIVRYPGVVKPNTTCSVPVISHDLFPT 344

Query: 252 LYSAAGGDLSVLEN-LDGVNQWDALS-KNTESPRTSVLHNIDDIWG----IAALTVDKYK 305
               A  + S  +N LDG +    L+ + +E  R ++  +    WG     +A+   ++K
Sbjct: 345 FLDFA--ERSYRDNKLDGHSIAGLLTGEQSELDRDALYWHFPHYWGSTRPCSAMRQGRWK 402

Query: 306 LIK 308
           L++
Sbjct: 403 LVE 405


>UniRef50_A6DR20 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetyl-galactosamine-6-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 608

 Score =  106 bits (255), Expect = 1e-21
 Identities = 75/206 (36%), Positives = 105/206 (50%), Gaps = 23/206 (11%)

Query: 21  LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGT 80
           L +  K+ GYKT   GKWHLG  K  Y PL  GFD  +  W G            GS+  
Sbjct: 127 LGKVFKNAGYKTAHFGKWHLG--KSPYSPLEHGFDIDIPHWPG--------PGPAGSFVA 176

Query: 81  DFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQ 140
            +R       +  G +  D   DE  K + S NK +P F+     +VH+     P  A Q
Sbjct: 177 PWRYP-NFKENYPGEHIDDRLGDEIAKYI-SENKDQPFFINFWQFSVHA-----PFNAKQ 229

Query: 141 KLIDAF-KYIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPA 196
           +LID + K ID +  Q    +AA++  +D+S+GKV+ AL T  L+E +I+VF +DNGG  
Sbjct: 230 ELIDKYRKLIDKNNPQHNPVYAAMVESMDDSIGKVIDALETNKLMEKTIIVFFSDNGGNI 289

Query: 197 AGFND--NAASNYPLKGVKNTLWEGG 220
               D   A SN P +G K +++EGG
Sbjct: 290 HSVVDGTTATSNKPFRGGKASIYEGG 315


>UniRef50_A6DGX5 Cluster: N-acetylgalactosamine 6-sulfatase; n=2;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine 6-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 446

 Score =  106 bits (254), Expect = 1e-21
 Identities = 83/292 (28%), Positives = 140/292 (47%), Gaps = 24/292 (8%)

Query: 26  KDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDFRRG 85
           K   Y+T L+GKWHLG     + P  RGF+   GF    +D Y     E    G  +   
Sbjct: 116 KSNNYRTSLIGKWHLGLQSPNH-PNERGFEIFHGFLGDMMDDY----WEHTRHGVAYMYH 170

Query: 86  FEVAHDLFGVYATDVYTDEAIKVVNSHNKS-EPLFLMLAHSAVHSGNPYEPIRAPQKLID 144
              A +  G +AT+++T+ AI+ +    K   P F  L+++A     P++PI  P+K  +
Sbjct: 171 NSTAVETKGTHATELFTNWAIEEIKQAQKDPRPFFQFLSYNA-----PHDPIHPPKKYYE 225

Query: 145 AFKYIDDSA---RQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFND 201
            FK    +    R K   ++  LD S+G+V+  L+   + +N++V+F++DNGG       
Sbjct: 226 YFKKKQPNTSEKRAKIGGLIEHLDYSIGRVLDTLNELEIDKNTLVIFTSDNGGKI----K 281

Query: 202 NAASNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDL 260
             A N  L+  K  ++EGG+R    F W   + SK+   ++ M + D++PTL  A   D 
Sbjct: 282 YGADNGELRADKTHMYEGGLRVCTSFTWPEKIRSKSLSDFRAMTM-DFMPTLLDAVNIDY 340

Query: 261 SVLENLDGVNQWDAL--SKNTESPRTSVLHNIDDIWGIAALTVDKYKLIKGT 310
           S   ++DG +    L   +     +    +    I+   AL +D +KL+  +
Sbjct: 341 S--GHMDGKSFLPELLFGQQENFTKRKQFYTWLQIYKKHALRIDDWKLVNNS 390


>UniRef50_Q7UMZ5 Cluster: N-acetylgalactosamine-6-sulfate sulfatase;
           n=1; Pirellula sp.|Rep: N-acetylgalactosamine-6-sulfate
           sulfatase - Rhodopirellula baltica
          Length = 484

 Score =  105 bits (253), Expect = 2e-21
 Identities = 82/279 (29%), Positives = 145/279 (51%), Gaps = 22/279 (7%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           GLP N   L + L  +GY+T L GKWHLG Y+ ++ P+  GFD  +    G +D Y +  
Sbjct: 131 GLPANRPTLAKRLSSVGYETALFGKWHLG-YEAKFSPMMHGFDEALYCIGGAMDYYHY-- 187

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGN 131
           ++  +    F  G  ++ +    Y TD  TD+A++ +   N ++ P FL L ++A H+  
Sbjct: 188 LDSVATYNLFHNGRPISGE---GYFTDTITDQAVRFIGDRNANDKPFFLYLPYTAPHT-- 242

Query: 132 PYE-PIRAPQKL--IDAFKYIDDSARQK-FAAVLSKLDESVGKVVKALHTRGLLENSIVV 187
           PY+ P  +P     ID+  +  ++     + A++  +DE +GKV+ A+    + + ++V+
Sbjct: 243 PYQAPGESPVDPLPIDSPLWKQNADPPGVYRAMVRHMDEGIGKVLHAIEESKMTDRTLVI 302

Query: 188 FSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISD 247
           F++DNGG +A  N+      PL+G K   +EGG+R       P    +  V+ Q     D
Sbjct: 303 FASDNGGTSASRNE------PLRGFKGQAFEGGIRVPLIARWPGHLPEGVVSDQVTITFD 356

Query: 248 WLPTLYSAAGGDLSVLENLDGVNQWDALSKNTE--SPRT 284
              ++ +AAG   +  + ++G++   +L+ N E   PRT
Sbjct: 357 LTASMLAAAGITPTQEDAMEGIDVL-SLAANDEPVQPRT 394


>UniRef50_A7AKS6 Cluster: Putative uncharacterized protein; n=1;
           Parabacteroides merdae ATCC 43184|Rep: Putative
           uncharacterized protein - Parabacteroides merdae ATCC
           43184
          Length = 464

 Score =  105 bits (253), Expect = 2e-21
 Identities = 74/212 (34%), Positives = 109/212 (51%), Gaps = 20/212 (9%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTG-RIDMYDHT 71
           GLP +E++LP  LK   Y+T  +GKWHLGS   +  P  +GFD+  G   G R   YD  
Sbjct: 110 GLPDDEELLPALLKRYDYRTGCIGKWHLGSEPSQR-PNAKGFDTFYGLLAGHRSYFYDPE 168

Query: 72  TMEQGSWGTDFR-RGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSG 130
           T ++      ++  G +++ D    Y TD    +A + V      +P  L ++ +A HS 
Sbjct: 169 TSDKDGNLQQYQYNGRKLSFD---GYFTDELASKAQQFVTE--SEQPFMLYMSFTAPHSP 223

Query: 131 NPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFST 190
           N      A ++ +  F   +   RQK+AA++  LD  VGK+V  L   G  +N+I+ F +
Sbjct: 224 N-----EATEEDLARF---EGQPRQKYAAMMYALDRGVGKIVDELKAAGKFDNTIIFFLS 275

Query: 191 DNGGPAAGFNDNAASNYPLKGVKNTLWEGGVR 222
           DNGG       N +SN PLKG K   +EGG R
Sbjct: 276 DNGGSTT----NQSSNLPLKGFKGNKFEGGQR 303


>UniRef50_A6DMX7 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetyl-galactosamine-6-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 578

 Score =  105 bits (253), Expect = 2e-21
 Identities = 79/270 (29%), Positives = 136/270 (50%), Gaps = 25/270 (9%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
           L  N   + + +K  GY+T   GKWHLG   + Y PL  GFD  +   TG      +   
Sbjct: 126 LDTNFPTIGKMMKQAGYETGHFGKWHLGP--EPYSPLQHGFDVDIPHHTGAGPGKSYVA- 182

Query: 74  EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPY 133
               W  +      +  +    Y  D   +E +K V+  +  +P F+     +VH+    
Sbjct: 183 ---PWSQE-----HIKPNYEKEYIEDRMVEECLKWVDGLSGDKPFFMNYWMFSVHA---- 230

Query: 134 EPIRAPQKLIDAFK-YIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFS 189
            P  A Q+LID +K  ID +++Q+   +AA++  LD++VG +++ L +RGL++N++++F+
Sbjct: 231 -PFDAKQELIDKYKKVIDPNSKQRSALYAAMVQSLDDAVGALLEGLESRGLMDNTVIIFT 289

Query: 190 TDNGGPAAGFNDNA---ASNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKARVAYQKMHI 245
           +DNGG      D      SN+PL G K ++ EGGVR     +W  +  + +R + + +  
Sbjct: 290 SDNGGNIYSQLDEGIVPTSNFPLSGGKASMCEGGVRVPCTVVWPGVTKAGSR-SDEIVQT 348

Query: 246 SDWLPTLYSAAGGDLSVLENLDGVNQWDAL 275
           SD+  T+   +G  L     +DG++   AL
Sbjct: 349 SDFYTTIIKGSGIALPEGHVVDGIDIRPAL 378


>UniRef50_A6CGG6 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
           Planctomyces maris DSM 8797|Rep: N-acetylgalactosamine
           6-sulfatase - Planctomyces maris DSM 8797
          Length = 461

 Score =  105 bits (252), Expect = 2e-21
 Identities = 80/259 (30%), Positives = 127/259 (49%), Gaps = 23/259 (8%)

Query: 29  GYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDFRRGFEV 88
           GY+T ++GKWHLG  +    P  RGFD   GF     DM D   + +   G ++ R  + 
Sbjct: 129 GYQTAIIGKWHLG-LESPNTPNERGFDLFRGFLG---DMMDDYYLHRRH-GVNYMRRNQK 183

Query: 89  AHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGNPYEPIRAPQKLIDAFK 147
             D  G +ATD++TD   + +     SE P FL LA++A     P+ PI+ P+  +   K
Sbjct: 184 TVDPQG-HATDLFTDWTCEYLKQQATSESPFFLYLAYNA-----PHTPIQPPEDWLGKVK 237

Query: 148 YID---DSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAA 204
             +   D  R +  A++  LD  +GKV++ L    L +N+++VFS+DNGG         A
Sbjct: 238 QRETGIDPDRARLVALIEHLDAGIGKVIQTLDETKLSDNTLIVFSSDNGGQLG----VGA 293

Query: 205 SNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVL 263
           +N  L+  K +++EGG++   G +W   +       +  M + D  PT+  A G  + V 
Sbjct: 294 NNGALRDGKQSMYEGGLKVPTGVVWKKHIAPHTETDFMAMSM-DIFPTVCEAGG--IKVP 350

Query: 264 ENLDGVNQWDALSKNTESP 282
             LD V+    L    + P
Sbjct: 351 SGLDAVSFLPTLQGRQQKP 369


>UniRef50_A0Z718 Cluster: Twin-arginine translocation pathway
           signal; n=1; marine gamma proteobacterium HTCC2080|Rep:
           Twin-arginine translocation pathway signal - marine
           gamma proteobacterium HTCC2080
          Length = 653

 Score =  105 bits (252), Expect = 2e-21
 Identities = 80/279 (28%), Positives = 131/279 (46%), Gaps = 21/279 (7%)

Query: 8   GAEPRGLPLNEKI--LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRI 65
           G  P G  ++ ++  LP  L+  GY TH VGKWHLG   ++  P+ +GFDS  GF    +
Sbjct: 120 GFRPAGRGISPEVITLPDMLRGAGYTTHHVGKWHLGFVSEQAWPIQQGFDSFFGFLDQFL 179

Query: 66  DMYDHT----TMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVV-NSHNKSEPLFL 120
               HT     +++ ++     +    A +    + +DV  +EAI ++    ++ +P F+
Sbjct: 180 LRGPHTGAGYNLKRPTYVNPLLQRDNGAFEKKSGHLSDVLVEEAIDLLARVKDQKQPWFI 239

Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
                   +  P+ P+    +    F    D+   K+ A+L ++D +VG+V++ L    L
Sbjct: 240 -----NYWTYLPHTPLTPATRFASKF---PDTPEGKYNAMLMQVDAAVGRVLETLDASDL 291

Query: 181 LENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAY 240
             +++V+  +DNGG          SN P  GVKNT  EGG+R    +  P +  K  V  
Sbjct: 292 TRSTLVIVVSDNGGT----EKQLPSNQPFIGVKNTFTEGGLRTPLLMRWPEVIPKNMVID 347

Query: 241 QKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNT 279
           + +   D+ PTL S   G+ S    L G N W     NT
Sbjct: 348 ETVSYLDYFPTLESLVTGNTS--GGLPGRNLWPLFVDNT 384


>UniRef50_Q482D6 Cluster: Sulfatase family protein; n=2;
           Bacteria|Rep: Sulfatase family protein - Colwellia
           psychrerythraea (strain 34H / ATCC BAA-681)
           (Vibriopsychroerythus)
          Length = 492

 Score =  105 bits (251), Expect = 3e-21
 Identities = 90/301 (29%), Positives = 146/301 (48%), Gaps = 36/301 (11%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHV--GFWTGRIDMY-DH 70
           LPL+     ++LK+ GY+T  +GKWHLG  K+   P  +GFDS +  G W      Y  +
Sbjct: 108 LPLSAVTFGEHLKEAGYQTGYIGKWHLG--KEGGDPTKQGFDSSIMAGHWGAPPSYYFPY 165

Query: 71  TTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS- 129
           T M +    +   +GF         Y TD  TDEA+  +    K +P  L+LAH AVH+ 
Sbjct: 166 TKMSK----SGKNKGFAKVEGSEEEYLTDRLTDEALTFI-EQKKDQPFLLVLAHYAVHTP 220

Query: 130 --GNP--YEPIRAPQKLI---------DAFKYIDDSARQK-------FAAVLSKLDESVG 169
             G P   +  +   K +         DA    D +   K       +AA++  +D SVG
Sbjct: 221 IEGKPALVKKYKTKMKKLGIANAGPKSDADLIKDSTGYHKTIQNNPDYAAMVESVDISVG 280

Query: 170 KVVKALHTRGLLENSIVVFSTDNGG-PAAGFNDN---AASNYPLKGVKNTLWEGGVRGAG 225
           ++ + L   GL +N+I++ ++D+GG  + G   N   A SN P +  K  +++GG R   
Sbjct: 281 RIEQQLKRLGLEDNTIIILTSDHGGLSSRGLKSNRVLATSNNPYRHGKGWIYDGGTRVPL 340

Query: 226 FLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTS 285
            +  P       ++  ++  +D  PT+   AG  LS  ++ DGV+   AL+ + E+PR +
Sbjct: 341 IVKWPEKVKAGSISQVQVTGTDHYPTILQMAGLSLSPKDHQDGVSYLAALNSD-ETPRKA 399

Query: 286 V 286
           +
Sbjct: 400 M 400


>UniRef50_Q7UPK7 Cluster: Arylsulphatase A; n=1; Pirellula sp.|Rep:
           Arylsulphatase A - Rhodopirellula baltica
          Length = 482

 Score =  104 bits (250), Expect = 4e-21
 Identities = 84/255 (32%), Positives = 127/255 (49%), Gaps = 26/255 (10%)

Query: 17  NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQG 76
           +E  +   LKD GY T LVGKWH G     + PL+RGFD   GF+ G  D+        G
Sbjct: 139 DETTIADVLKDAGYATGLVGKWHTGR-GDGFHPLDRGFDEFEGFF-GSDDV--------G 188

Query: 77  SWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPI 136
            +   F    +++ D+   Y TD     AI+ V  H++  P FL LAH A     P+ P+
Sbjct: 189 YFRYPFSEQRQIS-DVDESYLTDDLNRRAIEFVRRHHE-HPFFLHLAHYA-----PHRPL 241

Query: 137 RAPQKLIDAFKYID-DSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNG-G 194
            AP ++I  ++    D +     A++  +D  +G+++  +   GL E++IV+F++DNG  
Sbjct: 242 EAPPEVIARYREQGFDESTATIYAMIEVMDRGIGELLAEIDDLGLSEDTIVLFASDNGPD 301

Query: 195 PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTLY 253
           P  G       N  L+G K  + EGG+R   F+ WS  L    R   Q +   D +PT+ 
Sbjct: 302 PLTG----ERFNRELRGTKYQVNEGGIRVPLFVRWSKRLAPGQR--DQMVTFVDLMPTIL 355

Query: 254 SAAGGDLSVLENLDG 268
                D+S+L  LDG
Sbjct: 356 DLCRVDVSMLNRLDG 370


>UniRef50_A6C3C8 Cluster: Putative uncharacterized protein; n=1;
           Planctomyces maris DSM 8797|Rep: Putative
           uncharacterized protein - Planctomyces maris DSM 8797
          Length = 600

 Score =  104 bits (249), Expect = 6e-21
 Identities = 101/354 (28%), Positives = 157/354 (44%), Gaps = 35/354 (9%)

Query: 17  NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQG 76
           NE  + Q L+  GYKT L GKWHLG Y  +Y P  RGFD   G + G I+ Y +      
Sbjct: 113 NETTIAQVLQKAGYKTGLFGKWHLGRY-AQYQPQRRGFDHFFGHYHGHIERYTNPDQVVV 171

Query: 77  SWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPI 136
           +      RG          Y TD++TD AI  +   N+ +P F  LA++A HS    +  
Sbjct: 172 NGTPVETRG----------YVTDLFTDAAIDFI-QRNQQQPFFCYLAYNAPHSPFLLDTS 220

Query: 137 RAPQ----KLIDAFKYIDDSARQ-KFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
              Q    KLI+ +       R+ +  A++ ++D+++ ++++ +H   L + ++V+F++D
Sbjct: 221 HFGQPEGDKLIEKYLAKGLPLREARIYAMIERIDQNLSRLLQTVHDLKLDQETVVIFTSD 280

Query: 192 NGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLP 250
           NGG + GF         LKG K + +EGG R    + W+    +  +     +  +D  P
Sbjct: 281 NGGVSRGFKAG------LKGSKASAYEGGTRVPFVVRWTDHFPA-GKTTDAMVAQTDLFP 333

Query: 251 TLYSAAGGDLSVLENLDGVNQWDALSK-NTESPRTSVLHNIDDIWGIAALTVDKYK--LI 307
           T    AG  +     LDG +    + +   +SP   + H  D        T + Y    I
Sbjct: 334 TFCQLAGVPVPSNVKLDGESILSLMEQGGGKSPHQYLYHTWD------RYTPNPYHRWAI 387

Query: 308 KGTIYKGVWDNWYGPSGREGAYNASLLYDSHAGRILDKLNLMPPKEKVMELRDE 361
            G  +K V  +  G   +EG      LYD        K       EKV ELR E
Sbjct: 388 HGPRFKLVGHDPQGKKKKEGEPQGQ-LYDLQEDPGEKKNVADQYPEKVSELRGE 440


>UniRef50_Q15XI1 Cluster: Sulfatase; n=2; Bacteria|Rep: Sulfatase -
           Pseudoalteromonas atlantica (strain T6c / BAA-1087)
          Length = 510

 Score =  103 bits (248), Expect = 7e-21
 Identities = 82/279 (29%), Positives = 138/279 (49%), Gaps = 33/279 (11%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
           LPL+E  L +  K  GY T  +GKWHLG   ++  P N+GFD ++           + + 
Sbjct: 131 LPLSEITLAEAFKQNGYNTAFLGKWHLGK-TEDLWPENQGFDVNIAGTKNGHPAAGYFSP 189

Query: 74  EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHS--G 130
            + +  TD  +G          Y T   T+EAI +V+ ++K   P F+ML+   VH+   
Sbjct: 190 YKNARLTDGPKG---------EYLTQRLTNEAISLVDKYSKQTVPFFMMLSFYTVHTPLA 240

Query: 131 NPYEPIRAPQKLIDAFKYIDDSARQK-------------------FAAVLSKLDESVGKV 171
            P + ++  Q  I  + + D+  R++                   +AA++ ++D  VG++
Sbjct: 241 APNKDVQEYQAKIRQYAHNDEFQREEQVWPTAEKREVRVKQNHPTYAAMVKQMDTQVGRL 300

Query: 172 VKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPL 231
           +  L   G+ E+++VVF++DNGG ++    +  SN PL+G K  L+EGG+R    +  P 
Sbjct: 301 LAKLKQAGMEESTLVVFTSDNGGLSSA-EGSPTSNLPLRGGKGWLYEGGIRVPLLVKLPQ 359

Query: 232 LDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
              K     + +  +D  PTL SA   DL   ++LDGV+
Sbjct: 360 KKHKHLQINEPVTSTDLYPTLLSAGHLDLLPQQHLDGVD 398


>UniRef50_A6DF72 Cluster: Putative secreted sulfatase ydeN; n=1;
           Lentisphaera araneosa HTCC2155|Rep: Putative secreted
           sulfatase ydeN - Lentisphaera araneosa HTCC2155
          Length = 481

 Score =  103 bits (248), Expect = 7e-21
 Identities = 78/267 (29%), Positives = 133/267 (49%), Gaps = 23/267 (8%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHV-GFWTGRIDMYDHTT 72
           L   E  L +  K  GYKT  +GKWHLG     + P N+GFD ++ GF  G    +    
Sbjct: 113 LTAEEITLAEAFKATGYKTVHIGKWHLGEESVSW-PENQGFDENIAGFRAGSPSAHGG-- 169

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNK-SEPLFLMLAHSAVHSGN 131
              G + + +     +     G Y T+    EA + + S  K  +P F+ L    VH+  
Sbjct: 170 ---GGYFSPYNNP-RLKDGPKGEYLTERLAQEASQYIQSTAKLKKPFFMNLWLYNVHT-- 223

Query: 132 PYEPIRAPQKLIDAFKYIDDSARQK----FAAVLSKLDESVGKVVKALHTRGLLENSIVV 187
              P++A Q+ ID +  +     Q     +AA++  +D++VG V++A+   G+ +N+I++
Sbjct: 224 ---PLQARQEKIDKYTRLIQKGYQHTNPVYAAMVEHMDDAVGTVMQAVKDAGIEDNTIII 280

Query: 188 FSTDNGGPAAGFNDN---AASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKM 243
           F++DNGG    + +N     SNYPL+  K  ++EGGVR    + WS  + +  + +   +
Sbjct: 281 FNSDNGGLRGNYENNRQKVTSNYPLRSGKGDMYEGGVRVPMIIKWSRKIKA-GQTSSSPV 339

Query: 244 HISDWLPTLYSAAGGDLSVLENLDGVN 270
              D  PTL      D+S  +++DG++
Sbjct: 340 ISHDIYPTLLDLCKIDVSKKQDIDGIS 366


>UniRef50_Q47Q78 Cluster: N-acetylgalactosamine-6-sulfate sulfatase;
           n=1; Thermobifida fusca YX|Rep:
           N-acetylgalactosamine-6-sulfate sulfatase - Thermobifida
           fusca (strain YX)
          Length = 471

 Score =  102 bits (245), Expect = 2e-20
 Identities = 85/329 (25%), Positives = 150/329 (45%), Gaps = 26/329 (7%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           ++  ++  +   G+P     L   L + GY T + GKWH G +   Y PL  GF++  G 
Sbjct: 89  LEEPLVTRSPENGIPEGHPTLSSLLVEAGYATAMFGKWHCG-WLPWYSPLRIGFETFFGN 147

Query: 61  WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
           + G +D ++H     G    D   G E   +  G Y T++ ++ A + + +H ++ P ++
Sbjct: 148 FDGALDYFEHVDT-LGK--ADLYEG-ETPVEEVGYY-TEIISERAAEYITAH-RNRPFYV 201

Query: 121 MLAHSAVH---SGNPYEPI------RAPQKLIDA-FKYIDDSARQKFAAVLSKLDESVGK 170
            L ++A H    G     +      R  Q+   +   ++D  +  K+  ++  +D  +G+
Sbjct: 202 QLNYTAPHWPWEGPDDHEVGQEIRRRYQQRWEHSPLMHLDGGSIAKYGELVEAMDAGIGQ 261

Query: 171 VVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSP 230
           V+ AL   G  +N+IVVFS+DNGG      +  + N+P  G K  L EGG+R    +  P
Sbjct: 262 VLAALDRAGAADNTIVVFSSDNGG------ERWSKNWPFVGEKGDLTEGGIRVPLIVAWP 315

Query: 231 LLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNI 290
              +  +V+   +   DW  TL +AAG +      LDGV+    L    + P   +    
Sbjct: 316 EAIAGNQVSDHPVITMDWTATLLAAAGTEPHPDWPLDGVDLLPWLVDGADFPAHDLFWRT 375

Query: 291 DDIWGIAALTVDKYKLIKGTIYKGVWDNW 319
            +     AL   ++K ++    + V  NW
Sbjct: 376 SN---QGALRRGRFKYLRDRRDRAVLGNW 401


>UniRef50_A6P2X1 Cluster: Putative uncharacterized protein; n=1;
           Bacteroides capillosus ATCC 29799|Rep: Putative
           uncharacterized protein - Bacteroides capillosus ATCC
           29799
          Length = 494

 Score =  102 bits (245), Expect = 2e-20
 Identities = 91/305 (29%), Positives = 139/305 (45%), Gaps = 36/305 (11%)

Query: 7   YGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRID 66
           Y  +  GLP +E +LP+ L+  GY+T LVGKWHLG  ++E  P NRGFD   G       
Sbjct: 162 YPYQNDGLPTDEILLPEVLQQAGYETALVGKWHLG-IREEERPYNRGFDLFYG------A 214

Query: 67  MYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVN---SHNKSEPLFLMLA 123
           +Y         +  D     EV HD    Y     T E  +V       N+  P FL  A
Sbjct: 215 LYSDDNDPHRIYHND-----EVVHD--EPYDQSGMTKELTQVAKQFIDDNQDGPFFLYYA 267

Query: 124 HSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLEN 183
                S  P+ P  A +      +++  S    +   + ++D SVG+++  L   GLLEN
Sbjct: 268 -----SPFPHWPSNASE------EWLGTSQAGIYGDCMQEVDWSVGEIMDTLEENGLLEN 316

Query: 184 SIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKM 243
           ++V+F++DNG     + D A      +G K+T + GG       + P    +  V    M
Sbjct: 317 TLVIFTSDNG----PWYDGATGGQ--RGRKDTNYNGGSHVPFIAYMPGTIPEGEVYDGLM 370

Query: 244 HISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALTVDK 303
              D  PT+ +  G +L     +DG++ W  L+  ++SPRT +  N D      AL  D 
Sbjct: 371 SGVDVFPTILNLLGIELPQDRVIDGMDMWPFLTGQSDSPRTELFLNKDK--DTFALIEDN 428

Query: 304 YKLIK 308
           +K ++
Sbjct: 429 FKYLE 433


>UniRef50_A6DMY9 Cluster: Putative uncharacterized protein; n=2;
           Lentisphaera araneosa HTCC2155|Rep: Putative
           uncharacterized protein - Lentisphaera araneosa HTCC2155
          Length = 590

 Score =  102 bits (245), Expect = 2e-20
 Identities = 85/277 (30%), Positives = 129/277 (46%), Gaps = 30/277 (10%)

Query: 12  RGLPL---NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRI-DM 67
           RGL +    E  + +  K  GY+T L GKWH G +     P  +GFD + GF  G I D 
Sbjct: 96  RGLEIMRSEEATIAEVFKAQGYETGLFGKWHNGEHYPNNPP-GQGFDEYFGFCAGHIGDF 154

Query: 68  YDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAV 127
           +D T         D  + F         + TDV TD AI  +    + +P F  + ++A 
Sbjct: 155 FDATL--------DHNKTFVKTKG----FITDVLTDRAIDWIEKQ-QDKPFFAYIPYNA- 200

Query: 128 HSGNPYEPIRAPQKLIDAFKYIDDSARQKFA-AVLSKLDESVGKVVKALHTRGLLENSIV 186
               P+ P +   K  D F     SA    A  ++  LD+++G+++K L    L +N+IV
Sbjct: 201 ----PHAPYQVEDKYYDEFAAKGYSAAHSAAYGMIENLDDNIGRLLKILDDLNLTDNTIV 256

Query: 187 VFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHIS 246
           +F TDNG      N     N  +KG K ++ EGGVR   F+  P   +K R  +      
Sbjct: 257 IFLTDNGP-----NSPTRFNGGMKGSKGSVDEGGVRVPFFIRWPGKIAKGRTIHDLAAHI 311

Query: 247 DWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPR 283
           D LPTL   AG ++ +   LDG     +L  ++++P+
Sbjct: 312 DVLPTLMELAGVNVDLPNKLDG-RSLTSLISSSKTPK 347


>UniRef50_A6C8S3 Cluster: Arylsulphatase A; n=1; Planctomyces maris
           DSM 8797|Rep: Arylsulphatase A - Planctomyces maris DSM
           8797
          Length = 481

 Score =  102 bits (245), Expect = 2e-20
 Identities = 97/340 (28%), Positives = 152/340 (44%), Gaps = 47/340 (13%)

Query: 2   QHGVIYGAEPRG--------LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRG 53
           +HGV Y   P G        +  +E +L + LK+ GY T  +GKWHLG +  EY P   G
Sbjct: 103 RHGVWYNPAPDGQQFRSGVGIAESELLLSELLKENGYATICIGKWHLG-HDPEYYPTRHG 161

Query: 54  FDSHVGFWTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHN 113
           FD ++G      DM     M+    G        + + +     T  YT+ A+K +   N
Sbjct: 162 FDDYLGILYSN-DMRPVNLMQ----GEKL-----LEYPVIQANLTKRYTERAVKFIQE-N 210

Query: 114 KSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVK 173
           +  P FL L H+      P++P+ A +       +   S    +  V+++LD SVG++ K
Sbjct: 211 QEGPFFLYLPHAM-----PHKPLAASEA------FYKKSGAGLYGDVIAELDWSVGEIFK 259

Query: 174 ALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLD 233
            L    L EN++V+F++DNG     F  N A    L G+K+T WEGG+R       P   
Sbjct: 260 TLRELNLDENTLVIFASDNG---PWFGGNTAG---LSGMKSTTWEGGLRVPMIARWPGKI 313

Query: 234 SKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDI 293
              +V        D  PT+   AG  +     +DG + +  L+K   +P  ++      +
Sbjct: 314 PPRQVIDTVCGSIDVFPTILKQAGIPVPADRVIDGKDLFPVLTKQAPTPHQALY----SM 369

Query: 294 WGIAALTV--DKYKL-IKGT---IYKGVWDNWYGPSGREG 327
            G +  TV    +KL +K +   +  G   NW  P G +G
Sbjct: 370 KGNSLFTVRSGPWKLHVKPSPRQVLAGKGKNWIDPRGPDG 409


>UniRef50_A6LHS9 Cluster: Arylsulfatase; n=4; Bacteroidetes|Rep:
           Arylsulfatase - Parabacteroides distasonis (strain ATCC
           8503 / DSM 20701 / NCTC11152)
          Length = 471

 Score =  102 bits (244), Expect = 2e-20
 Identities = 90/313 (28%), Positives = 144/313 (46%), Gaps = 31/313 (9%)

Query: 21  LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGT 80
           L + +K  GY T + GKW LG+     +P   GFD   G+   R     H+      W  
Sbjct: 116 LGKLMKSAGYTTGIFGKWGLGNPGSVSIPNKMGFDEFYGYNCQR---QSHSFYPDHLWHN 172

Query: 81  DFRRGF-EVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS--GNPYEPI- 136
           + +  F E  ++    Y+ D+  ++A+K +  H K +P F ML ++  H+    P++ I 
Sbjct: 173 EEKVLFPENENNACKTYSQDLIHEQALKFIRDH-KEQPFFAMLTYTLPHAELNLPHDSIY 231

Query: 137 RAPQKLIDAFKYI------------DDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
           +  +   +   YI             +     FAA++S+LD+ VG V+  L   GL +N+
Sbjct: 232 KMYENSFEETPYIGKFDKVYGGYNTSEKPLASFAAMVSRLDKYVGDVMAELKELGLDKNT 291

Query: 185 IVVFSTDNGGPAAGFND-NAASNY-PLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQK 242
           IV+F++DNG    G  D +   +Y P +G+K  ++EGG+R     W P      +   Q 
Sbjct: 292 IVIFTSDNGPHHEGGADPDFFKSYGPFRGIKRDVYEGGIRIPMVAWCP---GTIKAGAQS 348

Query: 243 MHIS---DWLPTLYSAAGGDLSVLENLDGVNQWDA-LSKNTESPRTSVLHNIDDIWGIAA 298
            HIS   D +PTL    G  L   E  DG++     LSK  +     +     ++ G  A
Sbjct: 349 DHISAFWDVMPTLAELTGTVLP--EKTDGISFLPTLLSKKDQQAHDYLYWEFHELNGREA 406

Query: 299 LTVDKYKLIKGTI 311
           L   K+KLI+  I
Sbjct: 407 LRSGKWKLIRQPI 419


>UniRef50_A6C1V3 Cluster: Putative secreted sulfatase ydeN; n=1;
           Planctomyces maris DSM 8797|Rep: Putative secreted
           sulfatase ydeN - Planctomyces maris DSM 8797
          Length = 470

 Score =  102 bits (244), Expect = 2e-20
 Identities = 72/255 (28%), Positives = 122/255 (47%), Gaps = 20/255 (7%)

Query: 21  LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGT 80
           LP+ L+  GY+T  VGKWHLG   +  LP + GFD ++      +    H      +   
Sbjct: 132 LPEALRTAGYQTFHVGKWHLGG--RGNLPQDHGFDVNISGTNRGLPRSYHFPYGGDAMKW 189

Query: 81  DFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQ 140
           D         D    Y TD   DEA+ ++    + +P FL  +  +VHS     PI+   
Sbjct: 190 DSSLTEAERQDR---YLTDRMADEAVALIR-QQQDKPFFLYCSFYSVHS-----PIQGRP 240

Query: 141 KLIDAFKYIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAA 197
            L+  +K +    R K   +AA++  +DE++G+V   L   G+ + +++VF++DNG    
Sbjct: 241 DLVKKYKGLPAGKRHKNPEYAAMIQSVDEAIGRVRAQLKESGIADRTLIVFTSDNG---- 296

Query: 198 GFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAG 257
           G     ++N PL+G K   WEGG R    +  P +     V  + +   D+ PT+ +  G
Sbjct: 297 GVRRKTSNNDPLRGEKGQHWEGGTRVPAIVLWPGVTPAGSVCAEPIITMDFYPTILNITG 356

Query: 258 --GDLSVLENLDGVN 270
             G+    +++DG++
Sbjct: 357 VAGNTEHNQSVDGLS 371


>UniRef50_A6DHS2 Cluster: N-acetylgalactosamine-6-sulfate sulfatase;
           n=1; Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine-6-sulfate sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 447

 Score =  101 bits (243), Expect = 3e-20
 Identities = 93/310 (30%), Positives = 141/310 (45%), Gaps = 31/310 (10%)

Query: 12  RGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDH- 70
           RG+   E   P+ +K   Y T + GKWH+G YK E+ P+N GFD  VGF +G ID   H 
Sbjct: 102 RGIRDEEWTFPEAMKSADYATAVFGKWHIG-YKAEFHPMNHGFDEFVGFISGNIDAQSHY 160

Query: 71  TTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS- 129
             M    W        E  H       +D+ T+ ++  +   NK +P FL +AH   HS 
Sbjct: 161 DRMSTFDWWQARELKDEKGHH------SDLITEHSLDFI-ERNKEKPFFLYVAHGTPHSP 213

Query: 130 --GNPYEPIRAPQK-LIDAF----KYI----DDSARQKFAAVLSKLDESVGKVVKALHTR 178
                 +  R P K  + A+    +Y     DD+   K   +   +DE V +++  L   
Sbjct: 214 FQARGSKIQRGPNKGQVPAWAPKIEYSKTPGDDNWLMKHFTL--PVDEGVNRILDKLVEL 271

Query: 179 GLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARV 238
            + +N+IV F +DNG  AA  N + + N   +G K +++EGG R    +W+P       V
Sbjct: 272 KIDKNTIVWFLSDNG--AAKGNHSHSEN--TRGAKGSMYEGGHRVPALVWAPGRIKAGSV 327

Query: 239 AYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTE-SPRTSVLHNIDDIWGIA 297
           + Q M   D   +   AAG  +     LDGV+    +  N + + RT +  N     G  
Sbjct: 328 SDQTMMTFDITASSIKAAGVAIPANHQLDGVDIHPTVFNNKKLNERTLIWENGK---GSG 384

Query: 298 ALTVDKYKLI 307
           AL    +KL+
Sbjct: 385 ALRKGPWKLV 394


>UniRef50_A6C4V9 Cluster: Sulfatase; n=1; Planctomyces maris DSM
           8797|Rep: Sulfatase - Planctomyces maris DSM 8797
          Length = 480

 Score =  101 bits (241), Expect = 5e-20
 Identities = 78/267 (29%), Positives = 128/267 (47%), Gaps = 20/267 (7%)

Query: 12  RGLPLNEKILPQYLKDLGYKTHLVGKWHLG--SYKKEYLPLNRGFDSHVGFWTGRIDMYD 69
           +GL  +E    + LK  GY+T L+GKWH G      E+ P N GFD+ VG+ +G ID   
Sbjct: 118 KGLRKSENTFAELLKQAGYRTALIGKWHQGYPHNSAEFHPDNHGFDTFVGYHSGNIDFIS 177

Query: 70  HT-TMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVH 128
           H     +  W    +   E        Y+T +    A++ +   ++++P  L LAH A+H
Sbjct: 178 HVGDHVKHDWWHGRKETQETG------YSTHLINQYALQFI-KESRNQPFCLYLAHEAIH 230

Query: 129 S--GNPYEPIRAPQKL-IDAFKYIDDSAR-QKFAAVLSKLDESVGKVVKALHTRGLLENS 184
           +    P +PIR  +      +K   ++ R +KF  +   +D  VG++ + L   GL +N+
Sbjct: 231 NPVQVPGDPIRRTEAAGWKRWKPASEAERIEKFRGMTLPVDAGVGQIREFLVKSGLDKNT 290

Query: 185 IVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSP-LLDSKARVAYQKM 243
            V+F +DN GP+  F   +      +G K +++EGG R     W P  + +        +
Sbjct: 291 FVLFFSDN-GPSRDFPSGSPK---WRGAKGSVYEGGHRVPAIAWWPGKIQAGTETDVPAI 346

Query: 244 HISDWLPTLYSAAGGDLSVLENLDGVN 270
            + D +PTL   A  D+     LDGV+
Sbjct: 347 SL-DVMPTLLGIAHIDMPKERPLDGVD 372


>UniRef50_A6CBI6 Cluster: Putative uncharacterized protein; n=1;
           Planctomyces maris DSM 8797|Rep: Putative
           uncharacterized protein - Planctomyces maris DSM 8797
          Length = 599

 Score =  100 bits (239), Expect = 9e-20
 Identities = 85/269 (31%), Positives = 126/269 (46%), Gaps = 31/269 (11%)

Query: 3   HGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWT 62
           HGV  G E   +   E  + +  K  GYKT   GKWH G +   + P  +GFD   GF  
Sbjct: 97  HGVTRGFE--NMRAEETTIAEMFKAAGYKTGAFGKWHNGRHYPMH-PNGQGFDEFFGFCG 153

Query: 63  GRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLML 122
           G  + Y  T +E                     Y TDV TD AI  +   NK +P F  +
Sbjct: 154 GHWNRYFDTNLEHNKQPVKTEG-----------YITDVLTDRAIDFIKQ-NKDQPFFCYV 201

Query: 123 AHSAVHSGNPYEPIRAPQKLIDAF--KYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
            ++A HS     P   P+K  D +  K +DD AR  +A V   +D+++G++++ L    L
Sbjct: 202 PYNAPHS-----PWIVPEKYWDKYANKGLDDKARCAYAMV-ECVDDNLGRLMQTLDDLKL 255

Query: 181 LENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSP-LLDSKARVA 239
            +N+IV+F TDNG  +  +N N      ++G K ++ EGG+R   F+  P  + +   V 
Sbjct: 256 SDNTIVLFLTDNGPNSNRYNGN------MRGRKGSIHEGGIRVPLFVRYPGKIKAGTVVK 309

Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDG 268
               HI D LPTL      + +  + LDG
Sbjct: 310 PIAAHI-DILPTLLELCSVENTADQPLDG 337


>UniRef50_Q1GWE7 Cluster: Sulfatase precursor; n=4;
           Alphaproteobacteria|Rep: Sulfatase precursor -
           Sphingopyxis alaskensis (Sphingomonas alaskensis)
          Length = 543

 Score =   99 bits (238), Expect = 1e-19
 Identities = 77/270 (28%), Positives = 128/270 (47%), Gaps = 25/270 (9%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           G+P +E  + + +K  GY T  +GKWHLG    E  P  +GFD  +    G   +     
Sbjct: 173 GVPASEVTIAEAVKAAGYHTVHIGKWHLGE-APELQPHAQGFDESLAVLAGAAMLLPEDD 231

Query: 73  MEQGS----WGTDFRRGF-EVAHDL-------FGV--YATDVYTDEAIKVVNSHNKSEPL 118
            +  +    W    R  +  + H +       F    + TD + DEAIK + + N++ P 
Sbjct: 232 PDAVNAKLPWDPIDRFIWANLRHAVTFNGSKRFAAQGHMTDYFADEAIKAIEA-NRNRPF 290

Query: 119 FLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTR 178
           FL LA +A     P+ P++A +   D    I D   + + A+++++D  +G V+  L   
Sbjct: 291 FLYLAFTA-----PHTPLQATRADYDRLAAIKDHRTRVYGAMIAQMDRRIGDVMAKLKEA 345

Query: 179 GLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKAR 237
           G+ +N++V+F++DNGG  A +N     N P +G K T +EGG+R   F+ W   +     
Sbjct: 346 GIDDNTLVIFTSDNGG--AWYNGMPGLNAPFRGWKATFFEGGIRAPLFMRWPARIAPGTE 403

Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLENLD 267
                 H+ D   T+ +AAG  L     +D
Sbjct: 404 RGDVTGHL-DLFATIAAAAGAALPADRTID 432


>UniRef50_Q3JD43 Cluster: Sulfatase; n=1; Nitrosococcus oceani ATCC
           19707|Rep: Sulfatase - Nitrosococcus oceani (strain ATCC
           19707 / NCIMB 11848)
          Length = 440

 Score = 99.5 bits (237), Expect = 2e-19
 Identities = 90/278 (32%), Positives = 136/278 (48%), Gaps = 41/278 (14%)

Query: 9   AEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMY 68
           A  + + L E    + LK +GY T LVGKWHLG  +  +LP  +GFD + G        Y
Sbjct: 95  AMAKAMSLEEITFAEALKSVGYSTALVGKWHLGD-RPAFLPPRQGFDEYFGI------PY 147

Query: 69  DHTTMEQGSWGTDF-----RRGFEVAH---DLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
            H   +   W   F      RG E+     DL   + T   T+EA+K + S NK  P  L
Sbjct: 148 SH---DMHPWRKSFPPLPLMRGEEIVELNPDLD--HLTQYCTEEAVKFI-SKNKDRPFLL 201

Query: 121 MLAH----SAVHSGNPYEPIRAPQKLIDAFKYIDDSARQ-KFAAVLSKLDESVGKVVKAL 175
            + H      VH    +   R  ++ + A K  D  +R+  ++A + ++D SVG+++KA+
Sbjct: 202 YMPHPMPHQPVHVSERFAK-RFSKEQLAAIKGEDKKSRKFLYSATIEEIDWSVGEIIKAV 260

Query: 176 HTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL--WSPLLD 233
              G+ E++ V F++DN GPA G      S  PL+G K  LWEGG R   F+  W   + 
Sbjct: 261 RALGIEESTFVAFTSDN-GPAIG------SAGPLRGKKRELWEGGHR-VPFIAYWQEKI- 311

Query: 234 SKARVAYQKMHIS-DWLPTLYSAAGGDLSVLENLDGVN 270
            +  V   ++ +S D  PT+ +A G      + +DGVN
Sbjct: 312 -RPGVVIDEIAMSMDLFPTM-AAMGRAPLPRKKIDGVN 347


>UniRef50_Q0C069 Cluster: Sulfatase family protein; n=2;
           Bacteria|Rep: Sulfatase family protein - Hyphomonas
           neptunium (strain ATCC 15444)
          Length = 505

 Score = 99.5 bits (237), Expect = 2e-19
 Identities = 86/310 (27%), Positives = 138/310 (44%), Gaps = 35/310 (11%)

Query: 5   VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF---- 60
           V++     GLP +E  + + L+  GY +   GKWH+G +  E+LP + GF S+ G     
Sbjct: 121 VLFPTSTGGLPQSEVTIAELLQQEGYVSAAFGKWHMG-HLPEFLPTSHGFQSYFGIPYSN 179

Query: 61  -----------WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYA-TDVYTDEAIKV 108
                      W+  ID++      Q +W     +  E+       +  T  YT+ AI+ 
Sbjct: 180 DMNMPGGGETPWS--IDLFFEPPNIQ-NWDVPLMQDEEIIERPADQFTLTQRYTERAIEF 236

Query: 109 VN-SHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDES 167
           +  SH + +P FL LAH+  H+         P    + F  +  SA   +  V+ +LD S
Sbjct: 237 METSHAEGQPFFLYLAHNMPHT---------PLFTSEGFTGV--SAGGAYGDVIEELDWS 285

Query: 168 VGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL 227
           VG++V AL    + +N++V+F++DN GP      ++ S   L+  K T WEGG+R     
Sbjct: 286 VGEIVDALKDMKIEKNTLVIFTSDN-GPWLAMKTHSGSAGMLRDGKGTTWEGGMRVPAIF 344

Query: 228 WSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPR-TSV 286
           W P      R         D +PT  + +G  L      DG +   AL     SPR T  
Sbjct: 345 WWP-GQIAPRTVTDLGSALDLMPTFAAISGARLPEDRVYDGFDLSPALFSEGSSPRETLY 403

Query: 287 LHNIDDIWGI 296
            +   D++ +
Sbjct: 404 YYRFTDVFAV 413


>UniRef50_A6DSG6 Cluster: Arylsulfatase A; n=1; Lentisphaera
           araneosa HTCC2155|Rep: Arylsulfatase A - Lentisphaera
           araneosa HTCC2155
          Length = 499

 Score = 99.5 bits (237), Expect = 2e-19
 Identities = 86/292 (29%), Positives = 137/292 (46%), Gaps = 39/292 (13%)

Query: 5   VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF-WTG 63
           ++Y     GL      +P+ LK+ GY T L+GKWHLG +   YLP ++GFD + G   T 
Sbjct: 91  IVYPNSTTGLIKEVVTIPEMLKEKGYTTALIGKWHLG-HTAGYLPRDQGFDYYFGVPGTN 149

Query: 64  RIDMYDHTT-MEQG----------SWGTDFRRGFE------VAHDLFGVYATDV------ 100
             D   H   + +G           +  D  +G        + +D    + TD+      
Sbjct: 150 HGDAKTHKLPVAEGFKPSGEFTIEDYWADKGKGVHGNSTILMKNDNVIEWPTDITQLTKR 209

Query: 101 YTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAV 160
           YT +A++ +   NK +P FL  AH   H  +PY         +DA  +   S    +  +
Sbjct: 210 YTHDAVRYIKE-NKDKPFFLYFAHGTPH--HPYT--------VDA-AFRGKSDHGLYGDM 257

Query: 161 LSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNA--ASNYPLKGVKNTLWE 218
           + ++D SVG+V+KAL   G+ + +I+ F++DNG  +    ++A   SN PLKG K +  E
Sbjct: 258 IEEIDWSVGEVIKALQENGIEKKTIIAFTSDNGADSKPNKEHAEKGSNLPLKGWKGSSEE 317

Query: 219 GGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
           GGVR    L  P    + +   +   + D  PT  + AG +  V + +DG N
Sbjct: 318 GGVRVPFVLSWPGTLPEGKKTNEIASLMDIFPTYAALAGIEPEVPQKIDGNN 369


>UniRef50_A6DNJ0 Cluster: Sulfatase; n=1; Lentisphaera araneosa
           HTCC2155|Rep: Sulfatase - Lentisphaera araneosa HTCC2155
          Length = 630

 Score = 99.5 bits (237), Expect = 2e-19
 Identities = 79/280 (28%), Positives = 127/280 (45%), Gaps = 15/280 (5%)

Query: 7   YGAEPRGLPL-NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRI 65
           Y   P G+   N+  +  +LK+ GY T   GKW++G  K    P   GFD  +       
Sbjct: 105 YRGSPDGVVAKNDPTIAMWLKEAGYATAAYGKWNIGESKDVSWPGAHGFDDWL-IIDHNT 163

Query: 66  DMYDHTTMEQGSWGTD--FRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLA 123
             + H    +   G    F  G E   +L G Y TD++TD+AI  +    K +P F+ L 
Sbjct: 164 GYFQHKNANKDCEGRPMLFETGGERVTNLEGQYLTDIWTDKAIDFIQE-TKDQPFFIYLP 222

Query: 124 HSAVHSGNPYEPIRAPQKLIDA-FKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLE 182
            S  H+    +P   P    DA  K      R+ +  ++  LD  + ++ K+L  +G  +
Sbjct: 223 WSIPHTPLQ-DPASDPSLAFDAGAKPKTVEGREVYVKMVEYLDSHIARIFKSLKEQGKYD 281

Query: 183 NSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQK 242
           N++++F++DNGG        +A+ +PLK  K  L EGG+R    +  P       V  + 
Sbjct: 282 NTLIIFTSDNGGMV------SANCWPLKKTKQHLEEGGIRVPFLMQWPSKIKAGTVDQRA 335

Query: 243 MHISDWLPTLYSAAGGDLSVLEN--LDGVNQWDALSKNTE 280
             + D   T+ +AA     V ++  LDGVN +    +N E
Sbjct: 336 AIMMDASVTVLAAADAMKYVPKDRELDGVNLFANKEENRE 375


>UniRef50_A6DKM2 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine 6-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 472

 Score = 99.5 bits (237), Expect = 2e-19
 Identities = 84/288 (29%), Positives = 133/288 (46%), Gaps = 31/288 (10%)

Query: 11  PRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDH 70
           P  +P     L Q  KD GY T + GKWHLG ++ +  P   GFD ++ F  G      +
Sbjct: 116 PYHMPEGTITLGQAFKDAGYATAMFGKWHLG-HRPQDQPDKMGFDEYLTF-QGMKHFAPY 173

Query: 71  TTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHS 129
           T   +   G               VY TD+  D+AI  +     +E P FL      VH+
Sbjct: 174 TLPNKVQHGEK-------------VYLTDLTCDKAIDFMERKVAAEKPFFLYYPDFLVHA 220

Query: 130 GNPYEPIRAPQKLIDAF--KYIDDSARQKFAAVLSK-LDESVGKVVKALHTRGLLENSIV 186
                P+ A Q +I  F  K I    +    A ++K LD++VG++VK +   G+ EN+I+
Sbjct: 221 -----PMEAKQAMIQYFEKKTIGQHHKSVIGAAMTKHLDDTVGRLVKKVDELGIAENTII 275

Query: 187 VFSTDNGG----PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQK 242
           +F++DNGG       G+ D   SNYP +  K++ +EGG R       P +     ++++ 
Sbjct: 276 IFTSDNGGLGYKSDGGYGDKGTSNYPYRSAKSSHYEGGSRVPLIFHWPGVTEANSLSHEV 335

Query: 243 MHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTES--PRTSVLH 288
           +   D  PTL   A       + LDG++ + ++ KN +   P   + H
Sbjct: 336 VSGIDIYPTLLKIAQVAKPQEQILDGID-FSSILKNPKQKLPARDLFH 382


>UniRef50_Q7UJ66 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
           n=1; Pirellula sp.|Rep: N-acetylgalactosamine 6-sulfate
           sulfatase - Rhodopirellula baltica
          Length = 616

 Score = 98.7 bits (235), Expect = 3e-19
 Identities = 77/282 (27%), Positives = 126/282 (44%), Gaps = 17/282 (6%)

Query: 17  NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQG 76
           +E  + +  ++ GY+T + GKWHLG     + P  RG ++ V    G  D   + T    
Sbjct: 135 DETTMAETFRESGYRTGMFGKWHLGD-PPPFAPRERGLETVVRHMAGGADEIGNPTGNDY 193

Query: 77  SWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPI 136
              T +R G     + F  Y TD++ +EAI  +   ++ +P F  +  +A+HS     P 
Sbjct: 194 FDDTYYRNG---TPESFDGYCTDIWFEEAIDFIQKESE-QPFFAYIPTNAMHS-----PY 244

Query: 137 RAPQKLIDAFKYID-DSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGP 195
               +  D FK    +  R  F  ++   DE++G+++K L    L +N++++F +DNG  
Sbjct: 245 LVADRYSDPFKRQGIEPQRAAFYGMIQNFDENLGRLLKRLDQDNLRDNTMLIFMSDNGTA 304

Query: 196 --AAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTL 252
             A+  N     N  ++G K +++EGG R   F  W    D    V     H  DWLPTL
Sbjct: 305 QGASEQNRKVGFNAGMRGKKGSVYEGGHRVPCFASWPAKWDGNRPVDQLTCH-RDWLPTL 363

Query: 253 YSAAGGDLSVLENLDGVNQWDALSKNTES--PRTSVLHNIDD 292
                         DG +    LS +++    RT V+    D
Sbjct: 364 IELCDLKRPADVTFDGRSMAGLLSHSSQQWPERTLVIERQPD 405


>UniRef50_Q7UYD6 Cluster: N-acetyl-galactosamine-6-sulfatase; n=3;
           Bacteria|Rep: N-acetyl-galactosamine-6-sulfatase -
           Rhodopirellula baltica
          Length = 889

 Score = 97.5 bits (232), Expect = 6e-19
 Identities = 77/262 (29%), Positives = 125/262 (47%), Gaps = 23/262 (8%)

Query: 21  LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGT 80
           L +  +D GY T   GKWHLG   + Y PL  GFD  V    G            GS+  
Sbjct: 376 LAEMFRDNGYATGHFGKWHLGP--EPYSPLEHGFDVDVPHHPG--------PGPAGSYVA 425

Query: 81  DFR-RGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAP 139
            ++ + F+    +   +  D    EA++ +  H  +EP FL     +VH+     P  A 
Sbjct: 426 PWKFKDFDHDPVIPDEHLEDRMAKEAVRFLEQHT-NEPFFLNYWMFSVHA-----PFDAK 479

Query: 140 QKLIDAFK-YIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGP 195
           ++LI+ ++  +D    Q+   +AA++  +D+++G ++  L   G+ + +I+VF++DNGG 
Sbjct: 480 KELIEEYRDRVDPKDPQRCPTYAAMIESMDDAIGTLLDTLDRLGIADETIIVFASDNGGN 539

Query: 196 AAGFND--NAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLY 253
                D   A SN PL+G K T++EGGVRG   +  P +      +   +   D+ PTL 
Sbjct: 540 MYNEVDGTTATSNAPLRGGKATMYEGGVRGPAIVVQPGVVESGSRSDAIIQSIDFYPTLL 599

Query: 254 SAAGGDLSVLENLDGVNQWDAL 275
                D    +  DGV+   AL
Sbjct: 600 EMLAIDAQPNQRFDGVSIVPAL 621


>UniRef50_A6LCL3 Cluster: Arylsulfatase A; n=1; Parabacteroides
           distasonis ATCC 8503|Rep: Arylsulfatase A -
           Parabacteroides distasonis (strain ATCC 8503 / DSM 20701
           / NCTC11152)
          Length = 476

 Score = 97.1 bits (231), Expect = 8e-19
 Identities = 71/215 (33%), Positives = 111/215 (51%), Gaps = 21/215 (9%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           G+   E  + + LK  GY T + GKWHLGS +KE+LPL  GFD + G      DM+    
Sbjct: 101 GVHPEEMTIAEVLKQKGYSTAIFGKWHLGS-QKEFLPLQNGFDEYYGLPYSN-DMWPFHP 158

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATD---VYTDEAIKVVN--SHNKSEPLFLMLAHSAV 127
            +   +       ++  +++ G Y TD   + TD   + VN    NK++P FL LAH+  
Sbjct: 159 QQGEVFNFPDLPTYD-GNEIIG-YNTDQTRLTTDYTTRSVNFIKKNKNKPFFLYLAHNMP 216

Query: 128 HSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVV 187
           H          P  + D FK    S +  +  V+ ++D SVG++ KAL   GL +N++V+
Sbjct: 217 H---------VPLAVSDKFK--GKSEQGLYGDVMMEIDWSVGEIFKALRELGLEDNTLVI 265

Query: 188 FSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVR 222
            ++DN GP   + ++A S   L+  K T ++GG R
Sbjct: 266 LTSDN-GPWTNYGNHAGSAGGLREAKATTFDGGNR 299


>UniRef50_A4A2W0 Cluster: Arylsulfatase A; n=1; Blastopirellula
           marina DSM 3645|Rep: Arylsulfatase A - Blastopirellula
           marina DSM 3645
          Length = 477

 Score = 97.1 bits (231), Expect = 8e-19
 Identities = 86/296 (29%), Positives = 139/296 (46%), Gaps = 36/296 (12%)

Query: 5   VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGR 64
           V++     G+  NE  + + +K+ GY T ++GKWHLG  + ++LP  +GFD + G     
Sbjct: 100 VLFPGNAEGMSPNEVTIAELMKEQGYATAIIGKWHLGD-QPDFLPTRQGFDYYYGLPYSN 158

Query: 65  IDMYDHTTMEQGSWGTDF--RRGF-----------EVAHDLFGVYATDV---YTDEAIKV 108
            DM       + ++G     R+G             V   +     T++   YT+EAI+ 
Sbjct: 159 -DMGPAADGVKSNYGAPIPQRKGKGQPPLPLLRNETVLQRVLAKDQTELVTNYTEEAIQF 217

Query: 109 VNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESV 168
           +  H + +P FL L HSAVH          P    DAF+    ++   +   + ++D SV
Sbjct: 218 IRDHQE-KPFFLYLPHSAVHF---------PMYPGDAFR--GKNSHGLYNDWVEEVDWSV 265

Query: 169 GKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLW 228
           G+V++AL   GL + ++V+F++DNGG         A N PL+  K T +EGG+R    + 
Sbjct: 266 GQVLQALKDLGLDQRTLVIFTSDNGGQTR----FGAVNKPLRAGKATTYEGGMRVPTIVR 321

Query: 229 SPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALS--KNTESP 282
            P        +   + + D LPTL   AGG       +DG +    L+  K  +SP
Sbjct: 322 WPGKVPAGSSSDAVVGMIDVLPTLVKLAGGTTPTDRKIDGADIGPILAGVKEAKSP 377


>UniRef50_Q01N83 Cluster: Sulfatase precursor; n=1; Solibacter
           usitatus Ellin6076|Rep: Sulfatase precursor - Solibacter
           usitatus (strain Ellin6076)
          Length = 461

 Score = 96.7 bits (230), Expect = 1e-18
 Identities = 96/321 (29%), Positives = 135/321 (42%), Gaps = 32/321 (9%)

Query: 5   VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGR 64
           V+ G    GLP +E  + Q LK  GY+T  +GKWH+GS    YLP NRGFD   G     
Sbjct: 95  VVLGPGDAGLPDSEITMAQVLKSAGYRTSCIGKWHIGS-TPGYLPTNRGFDEFFGV-PYS 152

Query: 65  IDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAH 124
            D+     M   S          VA  +     T  +T EA+  +    +  P FL LAH
Sbjct: 153 ADITPCPLMRGSS---------VVAPAVDCSTLTSSFTQEALDFMR-RAQDNPFFLYLAH 202

Query: 125 SAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
           +A     P+ P+ A  +      +   S    +A V+ +LD S G+V+ AL   GL  N+
Sbjct: 203 TA-----PHLPLAASPR------FAGQSGLGMYADVVQELDWSTGQVMAALKATGLDSNT 251

Query: 185 IVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMH 244
           +V+FS+DNG    G      S   L+G K   +EGG+R       P +            
Sbjct: 252 LVMFSSDNGPWYQG------SQGKLRGRKGETYEGGMREPFLARYPGVIPSGIGCAGLAT 305

Query: 245 ISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALTVDKY 304
             D LPTL   AG   +    LDGV+ W  L+         V    D ++ +    + ++
Sbjct: 306 TMDLLPTLARLAGAQ-TPSNPLDGVDIWPVLTGERAEVDRDVFLYFDAVY-LQCARLGRW 363

Query: 305 KLIKGTIYKGVWDNWYGPSGR 325
           KL         W     P GR
Sbjct: 364 KLHLSRYNTKAWSP-LPPGGR 383


>UniRef50_Q7UZ43 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;
           Pirellula sp.|Rep: N-acetylgalactosamine-4-sulfatase -
           Rhodopirellula baltica
          Length = 608

 Score = 96.3 bits (229), Expect = 1e-18
 Identities = 89/306 (29%), Positives = 141/306 (46%), Gaps = 31/306 (10%)

Query: 17  NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDS---HVGFWTGRI-DMYDHTT 72
           NE    +   D GY+T + GKWHLG     Y   + GF     H G   G+  D +D+  
Sbjct: 110 NEVTFGEIFSDAGYQTGMFGKWHLGD-NYPYRAEDNGFTEVYRHGGGGVGQTPDFWDNAY 168

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKS-EPLFLMLAHSAVHSGN 131
            +    G+ F  G  V  + F    TDV+  E  + +    ++ EP F  +A +A     
Sbjct: 169 FD----GSYFHNGKAVKAEGF---CTDVFFKEGNRFIRECVEADEPFFAYIATNA----- 216

Query: 132 PYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
           P+ P+ APQK ID +  ++D+    F  +++ +D++VG+  K L   G+ +N+I +F+TD
Sbjct: 217 PHGPLHAPQKYIDMYPEMNDNVAT-FFGMITNVDDNVGQTRKLLRELGVHDNTIFIFTTD 275

Query: 192 NGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLD-SKARVAYQKMHISDWLP 250
           NG   AG    +  N  ++G K + +EGG R    +  P    +K+R      H  D +P
Sbjct: 276 NG--TAG--GASVYNAGMRGKKGSPYEGGHRVPFVMHYPEGGFAKSRTNNTLCHAVDVVP 331

Query: 251 TLYSAAGGDLSVLENLDGVNQWDALSKNTESP------RTSVLHNIDDI-WGIAALTVDK 303
           TL    G +       DG +    L    +S        T     ID I W  +++  DK
Sbjct: 332 TLLDMCGVEAPESVKFDGTSIVSLLKDEVDSSFNDRMLITDSQRVIDPIKWRQSSVMQDK 391

Query: 304 YKLIKG 309
           ++LI G
Sbjct: 392 WRLING 397


>UniRef50_Q7US96 Cluster: Arylsulphatase A; n=1; Pirellula sp.|Rep:
           Arylsulphatase A - Rhodopirellula baltica
          Length = 498

 Score = 96.3 bits (229), Expect = 1e-18
 Identities = 80/283 (28%), Positives = 130/283 (45%), Gaps = 24/283 (8%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
           LPL+   + + LK  GY T  VGKWHLG+   E+ P  +G+D         +        
Sbjct: 121 LPLDTVTIAESLKASGYTTGYVGKWHLGN-GPEFQPDRQGYDFSAVIGGPHLP------- 172

Query: 74  EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPY 133
             G +    R   +   +    Y TD   D  I  +   NK +P FLML+  AVH   P 
Sbjct: 173 --GRYRVQGRSDLKPKPNQ---YRTDFEADLCIDFMRQ-NKDQPFFLMLSPFAVHI--PL 224

Query: 134 EPIRAPQKLIDAF-KYIDDSARQK-FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
             +    +  +A  K   +S     +AA++   D+ VG++V +L    + +++++VF++D
Sbjct: 225 AAMSEKVQKYEAMAKQTGNSLPHPVYAAMIEHCDDMVGRLVDSLEQLDIADDTMIVFTSD 284

Query: 192 NGGPAAGFN------DNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHI 245
           NGG    ++      D  +S  PLKG K +L EGG+R    +  P     A V  +    
Sbjct: 285 NGGLYKRYDYRESADDLVSSQAPLKGEKGSLHEGGIRVPLIIRHPATVKSAGVCDEPTIS 344

Query: 246 SDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLH 288
            D+ PT    AGG+L + + +DG +    ++  T++     LH
Sbjct: 345 HDFYPTFVEMAGGELPINQTIDGHSLLPLMTAPTQTLDRDALH 387


>UniRef50_Q7URY7 Cluster: Aryl-sulphate sulphohydrolase; n=1;
           Pirellula sp.|Rep: Aryl-sulphate sulphohydrolase -
           Rhodopirellula baltica
          Length = 490

 Score = 96.3 bits (229), Expect = 1e-18
 Identities = 82/252 (32%), Positives = 126/252 (50%), Gaps = 33/252 (13%)

Query: 25  LKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHV-GFWTGRIDMYDHTTMEQGSWGTDFR 83
           ++D GY+T ++GKWHL        PL  GFD +V G  +G        +  +G +    +
Sbjct: 136 VRDAGYRTGIIGKWHLSDD-----PLPYGFDINVAGTHSG--------SPPKGYFPPHPK 182

Query: 84  -RGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKL 142
             G +   D    Y TD  TDEAI  + + N+    FL L+H AVH+     P++A   L
Sbjct: 183 VPGLQDTSD--DEYLTDRLTDEAIGFIEA-NQEWSWFLYLSHFAVHT-----PLQAKPDL 234

Query: 143 IDAFKYIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGF 199
           +  +K             AA++  +DE VG++V+ L   GL EN+ +VF++DNG    GF
Sbjct: 235 VAKYKAKQPGTLHDHAVMAAMIESVDEGVGRMVETLRELGLEENTAIVFTSDNG----GF 290

Query: 200 NDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTLYSAAGG 258
              A S  PL+G K T +EGG+R   F+ W  ++D+  + +   +  +D  PT     G 
Sbjct: 291 GP-ATSMKPLRGYKGTYYEGGIREPFFVTWPGVVDAGTK-SDVPVIAADLYPTFIEMTGA 348

Query: 259 DLSVLENLDGVN 270
            L   + LDGV+
Sbjct: 349 KLPADQPLDGVS 360


>UniRef50_Q7ULE7 Cluster: Iduronate-sulfatase and sulfatase 1; n=1;
           Pirellula sp.|Rep: Iduronate-sulfatase and sulfatase 1 -
           Rhodopirellula baltica
          Length = 1049

 Score = 95.5 bits (227), Expect = 3e-18
 Identities = 91/327 (27%), Positives = 150/327 (45%), Gaps = 43/327 (13%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLG------SYKKEYLPLNRGFDSH-VGFWTGRID 66
           LP N   + ++L+  GYKT  VGKWHL        + +  LP   G     V     +I+
Sbjct: 657 LPTNAVTIAEHLQPKGYKTGFVGKWHLEPNVTCIDWMRRELPAMAGKPRRKVRIPWNKIE 716

Query: 67  MYDHTTM--EQGSWG--TDFRRGFEVAH-DLFGV--------YATDVYTDEAIKVVNSHN 113
            Y  +    ++  WG  T++R  F++   +L           +  DV T+ A+K +   N
Sbjct: 717 PYSPSQQGFDEYYWGERTNYRTNFDLTSGELLAEMKPIRDERFRIDVQTNAAVKFIQ-RN 775

Query: 114 KSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVK 173
             +P +L L +       P+ P+ A QK +D F       R+   A++S +D+ VG++V 
Sbjct: 776 HDQPFYLQLNYYG-----PHTPLEATQKYLDRFPGPMPERRRYALAMISAIDDGVGQIVD 830

Query: 174 ALHTRGLLENSIVVFSTDNGGPA------AGFNDNAAS-----NYPLKGVKNTLWEGGVR 222
            L   G+L+N+++V ++DNG P       +  N +A       N P  G K  L EGG+R
Sbjct: 831 QLKAEGVLDNTLIVMTSDNGAPLKMTKTDSPINGDAGGWDGSLNDPWVGEKGMLSEGGIR 890

Query: 223 GAGFLWSPLLDSKARVAYQ-KMHISDWLPTLYSAAGGDL-SVLENLDGVNQWDALSKNTE 280
               +WS      + + Y   +   D  P++   AGG+L S     DG++    L+ + +
Sbjct: 891 -VPMIWSLPTQLPSGITYDWPVSALDIAPSVLKLAGGELPSGDAAFDGIDLIPRLN-DIQ 948

Query: 281 SPRTSVLHNIDDIWGIAALTVDKYKLI 307
           +P T  L+     W  AA+   K+K I
Sbjct: 949 NPPTRTLY--FRFWDQAAIRRGKWKYI 973


>UniRef50_Q64P90 Cluster: Putative secreted sulfatase ydeN; n=2;
           Bacteroides fragilis|Rep: Putative secreted sulfatase
           ydeN - Bacteroides fragilis
          Length = 493

 Score = 95.5 bits (227), Expect = 3e-18
 Identities = 82/275 (29%), Positives = 128/275 (46%), Gaps = 30/275 (10%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
           L  +E  + +  +  GY T + GKWHL     EY P   GFD ++G   G    +     
Sbjct: 120 LSKDEITMAEAFRQNGYSTFMAGKWHLAE-SAEYYPEQNGFDINIG---GNNTGHPSKGY 175

Query: 74  EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSG--N 131
                    + G E      G Y TD  TDE I+ + S  K +P F+ L++  VH     
Sbjct: 176 FSPYGNPQLKDGPE------GEYLTDRLTDEVIRYI-SEPKEKPFFVYLSYYTVHLPLQA 228

Query: 132 PYEPIRAPQKLIDAFKYIDDSARQK-------------FAAVLSKLDESVGKVVKALHTR 178
             E I   ++ +      D S  +K             +AA++  LDE++G+++  LH  
Sbjct: 229 KAEKIAKYRRKLSRAVPADSSFVKKGETYHKLVQDIPAYAAMVESLDENIGRLLDTLHRS 288

Query: 179 GLLENSIVVFSTDNGGPAAG--FNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSK 235
           GL E +IVVF++DNGG A      +   SN PL+  K  L+EGG++    + WS  L  +
Sbjct: 289 GLDERTIVVFTSDNGGMATSNTTRNIPTSNLPLRAGKGYLYEGGIKVPAIIRWSGHLKGR 348

Query: 236 ARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
            +V+   +  +D+ PTL    G  L   +++DGV+
Sbjct: 349 -QVSDTPIIGTDYYPTLLDLCGLPLLPGQHVDGVS 382


>UniRef50_A6LIX6 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
           Parabacteroides distasonis ATCC 8503|Rep:
           N-acetylgalactosamine 6-sulfatase - Parabacteroides
           distasonis (strain ATCC 8503 / DSM 20701 / NCTC11152)
          Length = 589

 Score = 95.5 bits (227), Expect = 3e-18
 Identities = 79/274 (28%), Positives = 129/274 (47%), Gaps = 27/274 (9%)

Query: 16  LNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQ 75
           L EK + +Y ++ GY T L GKWH G+ +  Y P  RGF+   GF +G    Y +  +E 
Sbjct: 104 LGEKTIAEYFREAGYATSLFGKWHSGT-QYPYHPNARGFEEFYGFCSGHWGNYWNPVLE- 161

Query: 76  GSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEP 135
                    G  ++ + F +   D  TD+A+  +  H K  P F+ L+++  HS      
Sbjct: 162 -------HNGEIISGEGFII---DDLTDKALDYIRDH-KEHPFFMFLSYNTPHSPMQVPD 210

Query: 136 I---RAPQKLID---AFKYIDDSARQKFAAVLSK-LDESVGKVVKALHTRGLLENSIVVF 188
               R   + +     F   +D+   K A  L++ LD ++G+V+  LH+  L + +IV++
Sbjct: 211 SWWNRVKDRTLSQRATFPEQEDTTFTKAALALAENLDWNIGRVLSLLHSLDLEQETIVIY 270

Query: 189 STDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDW 248
            +DNG  +  +N        +KG K +  EGGVR    +  P    K  V  Q     D 
Sbjct: 271 FSDNGPNSFRWNGG------MKGRKGSTDEGGVRSPFCIRWPGHIRKGAVETQLSGAIDL 324

Query: 249 LPTLYSAAGGDLSVLENLDGVNQWDALSKNTESP 282
           +PTL   AG + + L  LDG++ W     + ++P
Sbjct: 325 IPTLLGLAGIEYTPLRKLDGID-WGQRLLDEKAP 357


>UniRef50_A0JAV8 Cluster: Sulfatase precursor; n=1; Shewanella
           woodyi ATCC 51908|Rep: Sulfatase precursor - Shewanella
           woodyi ATCC 51908
          Length = 365

 Score = 95.1 bits (226), Expect = 3e-18
 Identities = 67/213 (31%), Positives = 105/213 (49%), Gaps = 26/213 (12%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
           LPL    + + +K LGY T   GKWHLGS  ++Y P+ +GFD   G  T       H   
Sbjct: 159 LPLEVTSIAEAVKPLGYYTAFSGKWHLGS--EDYFPIKQGFDEQFGVSTA-----GHPKS 211

Query: 74  EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPY 133
               +   +R  +  A    G   T+  TD+ +  +N ++K +P  L   + +VH+  P+
Sbjct: 212 YHAPFWEAYRNPYPDAPK--GKNLTERLTDDVVNFINGYDKDQPFMLTNFYYSVHT--PH 267

Query: 134 E-PIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDN 192
           + P  A QK +D      D     F +++  LD SVG++++AL   G  +N++V+F +D 
Sbjct: 268 QGPKAATQKYLDRGL---DKRYANFGSMVESLDTSVGRILQALEDSGQADNTVVIFYSDQ 324

Query: 193 GGPAAGFNDNAASNYPLKGVK---NTLWEGGVR 222
           GG          +N PL+G K     L+EGG R
Sbjct: 325 GG--------YFTNAPLRGGKIGGRALYEGGAR 349


>UniRef50_Q7UL93 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
           Pirellula sp.|Rep: N-acetylgalactosamine 6-sulfatase -
           Rhodopirellula baltica
          Length = 470

 Score = 94.3 bits (224), Expect = 6e-18
 Identities = 81/258 (31%), Positives = 119/258 (46%), Gaps = 26/258 (10%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVG-FWTGRIDMYDHTT 72
           LP     + + LK  GY T   GKWHLG  KK Y P   GFD +VG    G    Y    
Sbjct: 139 LPHETTTMAERLKAAGYTTGFFGKWHLGGDKK-YWPTEHGFDVNVGGCGLGGPPTY---- 193

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
                   D  R   +     G Y TD   DE I  +    K +P+F+ L      + NP
Sbjct: 194 -------FDPYRIPALPPRKEGEYLTDRLADETIAFMR-REKDKPMFVCL-----WTYNP 240

Query: 133 YEPIRAPQKLIDAFKYIDDSARQK--FAAVLSKLDESVGKVVKALHTRGLLENSIVVFST 190
           + P  AP+ LI+ +K  + +  +   +   +   D  VG+V++ L + G+ + ++VVF++
Sbjct: 241 HYPFEAPEDLIEHYKGKEGTGLKNPIYGGQIEATDRGVGRVLRELDSLGIADETLVVFTS 300

Query: 191 DNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLP 250
           DNGG +      A  N PL+  K  L+EGG+R    +  P +   A V    +   D   
Sbjct: 301 DNGGWS-----GATDNRPLREGKGFLFEGGLRVPLIVRWPGVTEAATVNETPVVSMDLTA 355

Query: 251 TLYSAAGGDLSVLENLDG 268
           T+  AAG  L+  E+LDG
Sbjct: 356 TILDAAGVSLANGESLDG 373


>UniRef50_A3XZF1 Cluster: Sulfatase family protein; n=5;
           Proteobacteria|Rep: Sulfatase family protein - Vibrio
           sp. MED222
          Length = 500

 Score = 94.3 bits (224), Expect = 6e-18
 Identities = 84/312 (26%), Positives = 132/312 (42%), Gaps = 32/312 (10%)

Query: 3   HGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWT 62
           H V     P GL  +   LP+ LK +GY T   GK HLG  + E+LP   GFD + G W 
Sbjct: 95  HSVGLPGGPVGLSADTPTLPEILKTMGYVTGQFGKNHLGD-RDEFLPTMHGFDEYWG-WL 152

Query: 63  GRIDMYDHTTMEQGSWGTDFRR-GFEVAHDLF----GVYATDVYTDEAIKVVNSHNKSEP 117
             ++  ++T  E   W  D     F   + ++    G     +  D A+ +       + 
Sbjct: 153 YHLNAMEYT--EDPDWPKDGSLDAFAPRNVIYARSDGKGGQTIEDDGALSIERMRTLDDE 210

Query: 118 L---FLMLAHSAVHSGNPYEPIRAPQK------LIDAFK-YIDDSARQKFAAVLSKLDES 167
           +    +     AV +  P+     P +      L   ++  +  +       V+  LD+ 
Sbjct: 211 VNKHAINFIERAVEADKPFFTWYCPSRGHVWTHLSPEYEAMLGQNGWGLQEVVMKDLDDH 270

Query: 168 VGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL 227
           VG+++  +   G+ +N+I++F+ DNG     + D   +  P  G K T WEGGVR    +
Sbjct: 271 VGEMMAKMEELGIADNTIIIFTADNGPEIMTWPDGGMT--PYHGEKGTTWEGGVRAPALV 328

Query: 228 WSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLE-----------NLDGVNQWDALS 276
             P       V        DWLPTL +AAGG   + E           +LDG NQ D L+
Sbjct: 329 SWPGKIPAGTVGNGIFDGMDWLPTLVAAAGGPTDLKEKLLKGHDGFKAHLDGYNQVDMLT 388

Query: 277 KNTESPRTSVLH 288
           +  ES R  + +
Sbjct: 389 EKGESNRKEIYY 400


>UniRef50_A6DS95 Cluster: Arylsulfatase A; n=2; Lentisphaera
           araneosa HTCC2155|Rep: Arylsulfatase A - Lentisphaera
           araneosa HTCC2155
          Length = 491

 Score = 93.9 bits (223), Expect = 8e-18
 Identities = 89/297 (29%), Positives = 144/297 (48%), Gaps = 35/297 (11%)

Query: 2   QHGVIYGAEPRG-LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           ++GV +  +PR  L      +   LK+ GYKT  VGKWHLG+  K Y P  RGFD +   
Sbjct: 90  RNGVTHTVQPREKLYKGALTIADILKEGGYKTGFVGKWHLGN-DKGYAPQYRGFDWYAKN 148

Query: 61  WTGRIDMYDHTTMEQGSWGTDFR-RGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLF 119
             G    ++H  +E    G  F+ +GF            D + DEA+  +    + +P F
Sbjct: 149 AKG---PHNHFDVEMIRNGKRFQTKGFR----------EDAFFDEAMTFMKEAGE-QPFF 194

Query: 120 LMLAHSAVHSGNPYEPIRAPQKLIDAFKY--IDDSARQKFAAVLSKLDESVGKVVKALHT 177
           L L      + +P+ P+ AP+ L+  +K   ++D+    + A++  +D+++G++ + L  
Sbjct: 195 LYLC-----TYSPHTPLGAPEDLLKKYKAKGLNDN-HAAYLAMIENIDDNLGRLDQFLKK 248

Query: 178 RGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWS-PLLDSKA 236
             L +++I++F  DN G   G +     N  ++G K T+WEGG R A  LW  P      
Sbjct: 249 ENLYDDTILIFMNDN-GVTVGLD---VYNADMRGPKCTIWEGGTR-AFSLWRWPKKWQPK 303

Query: 237 RVAYQKMHISDWLPTLYSAAGGDL--SVLENLDGVNQWDALS-KNTESPRTSVLHNI 290
            V     H+ D LPTL   AG D+   V   L+G +    L+ K+ E     + HN+
Sbjct: 304 TVENLTAHL-DVLPTLCELAGVDVPEKVQGELEGYSLSPLLNGKDWEHNNRLLFHNV 359


>UniRef50_A3J5W3 Cluster: Putative arylsulfatase; n=1; Flavobacteria
           bacterium BAL38|Rep: Putative arylsulfatase -
           Flavobacteria bacterium BAL38
          Length = 468

 Score = 93.9 bits (223), Expect = 8e-18
 Identities = 80/294 (27%), Positives = 141/294 (47%), Gaps = 37/294 (12%)

Query: 8   GAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDM 67
           G EP  +P +E  + + LK  GY T   GKW LG    E  P N+GFD   G+  G+I  
Sbjct: 105 GNEP--IPASEITVAEILKTAGYTTGAFGKWGLGYPASEGSPNNQGFDQFYGY-NGQIHA 161

Query: 68  YDH-TTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSA 126
           +++ T+  + +   +     +     + VY+ D+  D A++ V   NK+ P FL    + 
Sbjct: 162 HNYFTSYLRKNDLVELNANIDAP---YSVYSADIIKDRALEFVEV-NKNNPFFLYFCPTL 217

Query: 127 VHSGNPYEPIRAPQKLIDAFK----------YIDDSARQKFAAVLSKLDESVGKVVKALH 176
            H  NPY   +   K ++ +           + ++ +  K+AA+ S+LD+ VG+++  L 
Sbjct: 218 PH--NPYH--QPDDKTLEYYAKKTGFPIGDAHSEEFSVPKYAALSSRLDQQVGEIMAKLK 273

Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDN-AASNYPLKGVKNTLWEGGVRGAGFLWSPLL--- 232
              LL+N++++F++DNG       D+   +   L+G K+ ++EGG++      SPL+   
Sbjct: 274 ELNLLDNTLIIFASDNGSALTKEEDSYLRTGGDLRGRKSEVYEGGIK------SPLIAFW 327

Query: 233 DSKARVAYQKMHIS---DWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPR 283
             K        HIS   D+LPT            +N+DG++    L   T++ +
Sbjct: 328 KGKIIPGSSSNHISAFWDFLPTCAEIV--KAKTPDNIDGISYLPTLLGKTDNQK 379


>UniRef50_Q7UHK0 Cluster: Arylsulphatase A; n=1; Pirellula sp.|Rep:
           Arylsulphatase A - Rhodopirellula baltica
          Length = 478

 Score = 93.5 bits (222), Expect = 1e-17
 Identities = 79/273 (28%), Positives = 130/273 (47%), Gaps = 29/273 (10%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           G   +E  +P+ L   GY++ +VGKWHLG   +   PL+ GFD ++G  +     Y+   
Sbjct: 124 GFAPDEITIPELLGPAGYRSLMVGKWHLGMELEGSHPLDAGFDEYLGIPSN----YE--- 176

Query: 73  MEQGSWGTDFRRGFEVAH-DLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGN 131
             +G       RG +V   ++     T  YTDE I  +    K +P F+ ++H  VH  N
Sbjct: 177 PRRGKNHNTLYRGKQVEQKNVACEELTKRYTDEVIDFI-ERQKDDPFFIYVSHHIVH--N 233

Query: 132 PYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
           P +P  +P        ++  S + K+   + +LD S G++++ +   GL EN++V+F++D
Sbjct: 234 PLKP--SPD-------FVGTSEKGKYGDFIKELDHSTGRIMQTIRDAGLDENTLVIFTSD 284

Query: 192 NGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAG-FLWSPLLDSKARVAYQKMHISDWLP 250
           NG    G      S+  L G K    EGG R  G F W+  + +  +V+   +   D LP
Sbjct: 285 NGPTRNG------SSGELSGGKYCTMEGGHRVPGMFRWTSKI-APNQVSDVTLTSMDLLP 337

Query: 251 TLYSAAGGDLSVLENLDGVNQWDA-LSKNTESP 282
                AG  +     +DG +     L + +ESP
Sbjct: 338 LFCELAGVPIPDDRQIDGKSILPVLLGQTSESP 370


>UniRef50_A6KWS8 Cluster: Arylsulfatase; n=1; Bacteroides vulgatus
           ATCC 8482|Rep: Arylsulfatase - Bacteroides vulgatus
           (strain ATCC 8482 / DSM 1447 / NCTC 11154)
          Length = 464

 Score = 93.5 bits (222), Expect = 1e-17
 Identities = 84/311 (27%), Positives = 144/311 (46%), Gaps = 29/311 (9%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
           LP  E  +    K   Y T  VGKW +G    E +P   GFD   G+   R   + H   
Sbjct: 114 LPAGEVTVADIFKTKNYVTGCVGKWGMGGPGTEGMPGKHGFDYFYGYLGQR---FAH--- 167

Query: 74  EQGSWGTDFRRGFEVAHDLFG-VYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSG-- 130
              S+  +F    E    L G  Y+ D+  ++A+  ++  N  +P FL  + +  H+   
Sbjct: 168 ---SYYPEFLHENEQKIMLDGKYYSHDLMLEKALNFIDE-NAQKPFFLYFSPTIPHADLD 223

Query: 131 ------NPYEPIRAPQKL---IDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLL 181
                   YE            D +K    + R  +AA+++ LD+SVG ++K L  +GL 
Sbjct: 224 IMGEAMTEYEGEFCETPFGGSRDGYKS-QQNPRAAYAAMVTYLDKSVGLIIKELKEKGLY 282

Query: 182 ENSIVVFSTDNGGPAAGFNDNA--ASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVA 239
           +++I+VF++DNG  + G +D +   SN P +G K  L+EGG+R    +  P +  +  V 
Sbjct: 283 DHTIIVFTSDNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVT 342

Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALS-KNTESPRTSVLHNIDDIWG-IA 297
                  D+LPT+      D+   +N+DG++    L+ K T+     + +   +  G  +
Sbjct: 343 NHISAFWDFLPTIGELVQADIP--QNIDGISYLPTLTGKGTQKEHDCIYYEFFEFGGKQS 400

Query: 298 ALTVDKYKLIK 308
            +T D +KL++
Sbjct: 401 IMTPDGWKLVR 411


>UniRef50_A6DSM5 Cluster: Arylsulfatase A; n=1; Lentisphaera
           araneosa HTCC2155|Rep: Arylsulfatase A - Lentisphaera
           araneosa HTCC2155
          Length = 401

 Score = 93.5 bits (222), Expect = 1e-17
 Identities = 81/285 (28%), Positives = 128/285 (44%), Gaps = 26/285 (9%)

Query: 7   YGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRID 66
           +G   RG P     + + LK  GY T   GKWH G       PL RGFD H G      D
Sbjct: 40  HGPNGRGPPTEFATIAEPLKKSGYNTVHFGKWHCGDTNATR-PLARGFDEHAGLMYSN-D 97

Query: 67  MYDHTTMEQGSWGTDFRRGFEVAH-DLFGVYATDVY------TDEAIKVVNSHNKSEPLF 119
           M+    M+   WG    R +     ++  +   D        T++++  +   NK +P F
Sbjct: 98  MWHLHPMQPKHWGKFPLRFWNNGEIEIEDIQPKDQKNLTKWATEKSVDFIK-RNKDQPFF 156

Query: 120 LMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRG 179
           L   HS  H          P  +   F+ I  S +  +  VL++LD SVG++ +AL   G
Sbjct: 157 LYTTHSMPH---------VPLYVSKEFEGI--SGQGLYGDVLAELDWSVGQINQALKDNG 205

Query: 180 LLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVA 239
           + + ++++FS+DN GP AG+ D+A    P +  K T ++GG R    +  P +      +
Sbjct: 206 IEDKTMIIFSSDN-GPWAGYGDHAGKP-PYREAKATSFDGGTRSPLIVKYPKMIPPNSAS 263

Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALS--KNTESP 282
            +     D +PT+   AGG       +DG N  D ++  K  ++P
Sbjct: 264 KKVFCSIDLMPTILDLAGGP-HPDNKIDGKNVLDLMTDKKGAKNP 307


>UniRef50_A6DGL0 Cluster: Arylsulfatase A; n=1; Lentisphaera
           araneosa HTCC2155|Rep: Arylsulfatase A - Lentisphaera
           araneosa HTCC2155
          Length = 506

 Score = 93.5 bits (222), Expect = 1e-17
 Identities = 73/252 (28%), Positives = 115/252 (45%), Gaps = 22/252 (8%)

Query: 22  PQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTD 81
           PQ L+  GYKT L GKWHLG   +EY P NRGFD  +    G I  Y+    +  +    
Sbjct: 116 PQALQKSGYKTGLFGKWHLGD-GEEYRPQNRGFDEVLMHGAGGIGQYNFGDFKPNATNKY 174

Query: 82  FRRGFEVAHDLFGV--YATDVYTDEAIK-VVNSHNKSEPLFLMLAHSAVHSGNPYEPIRA 138
           F         +     + TDV+   A+  +   H  ++  F  ++ +A     P+ P+ A
Sbjct: 175 FDNVLLHNDTIVQTKGFCTDVFFKAALSWIKKQHENNQTYFAYISLNA-----PHGPLIA 229

Query: 139 PQKLIDAFKYIDDSARQKFAA---VLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGP 195
           P+K     ++ID+   Q  AA   ++  +D++ G +V+ L     L+N++++F TDNG  
Sbjct: 230 PEKY--KKRFIDEGYNQSVAARYGMIENIDDNFGLMVEKLKEWKALDNTLIIFMTDNGMA 287

Query: 196 AAGFNDNA------ASNYPLKGVKNTLWEGGVRGAGF-LWSPLLDSKARVAYQKMHISDW 248
                         A N  +KG K++ WEGG R   F  W  +L     ++    HI D 
Sbjct: 288 MKSIGKKGVKGKFNAWNAGMKGHKDSAWEGGSRVPSFWYWKGVLGEGVDISALSAHI-DL 346

Query: 249 LPTLYSAAGGDL 260
             T    AG ++
Sbjct: 347 YRTFCELAGTNI 358


>UniRef50_A6C4Q9 Cluster: Arylsulphatase A; n=1; Planctomyces maris
           DSM 8797|Rep: Arylsulphatase A - Planctomyces maris DSM
           8797
          Length = 490

 Score = 93.1 bits (221), Expect = 1e-17
 Identities = 76/266 (28%), Positives = 125/266 (46%), Gaps = 34/266 (12%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
           LPL      + L+   Y T   GKWHLG   + + P  +G+                T++
Sbjct: 124 LPLEIVTPGELLQSANYNTAYFGKWHLGP--ESHNPDQQGYQ---------------TSL 166

Query: 74  EQGSWGTDFRRGFEVAHDLF---GVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSG 130
             G  G  F   F            Y  D  TD+ I+ +   NKS+P F+ L+H AVH  
Sbjct: 167 VTG--GRHFAPRFRTTPSTRIPNKAYLADFLTDKTIEFIRQ-NKSKPFFVQLSHYAVHI- 222

Query: 131 NPYEPIRAPQKLIDAFKYIDDSA----RQKFAAVLSKLDESVGKVVKALHTRGLLENSIV 186
               P+ A Q++I  ++     A       +AA+++ +D+SVG++V AL    L EN++V
Sbjct: 223 ----PLEAKQQMIRKYQQKPKPAYGINNPVYAAMVAHVDDSVGRIVAALEELKLTENTVV 278

Query: 187 VFSTDNGGPAAGFN--DNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMH 244
           +F++DNGG    F+  D  ++N PL+  K +L+EGG+R    +  P + +  +   +   
Sbjct: 279 IFTSDNGGLRQSFSGGDIVSTNAPLRDEKGSLYEGGIRVPLIIKWPGVAAAGKTCAEPTI 338

Query: 245 ISDWLPTLYSAAGGDLSVLENLDGVN 270
             D+ PT    A   L   + +DG++
Sbjct: 339 SIDFWPTFAEIAHTTLQEHQTIDGLS 364


>UniRef50_Q7URW3 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;
           Pirellula sp.|Rep: N-acetylgalactosamine-4-sulfatase -
           Rhodopirellula baltica
          Length = 480

 Score = 92.7 bits (220), Expect = 2e-17
 Identities = 67/191 (35%), Positives = 105/191 (54%), Gaps = 13/191 (6%)

Query: 96  YATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQ 155
           Y TD  TD+AI  + +   S+P  ++++++AVHS     P++A  +   A + IDD  R+
Sbjct: 229 YLTDSITDKAIDAI-TQTASKPFAMVVSYNAVHS-----PMQASLEDHAAMELIDDPQRR 282

Query: 156 KFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNT 215
            FA +L  LD  VG++++ L  + L ++++VVF +DNGGP A   +  +SN PL+G K +
Sbjct: 283 IFAGMLIALDRGVGRIIEKLDQQKLRQDTLVVFFSDNGGPTA---ELTSSNAPLRGGKGS 339

Query: 216 LWEGGVRGAGFLWSPLLDSKARVAYQKMHIS-DWLPTLYSAAGGDLSVLENLDGVNQWDA 274
           L+EGGVR    +WS      A        +S D   +    A G+ S LE  DG N    
Sbjct: 340 LYEGGVR-IPMIWSMPGTIPAGAEEDTPILSLDIAASFLPLAVGEASQLET-DGTNVLPW 397

Query: 275 LSKNT-ESPRT 284
           + + T + PRT
Sbjct: 398 IGRGTFKLPRT 408



 Score = 51.6 bits (118), Expect = 4e-05
 Identities = 21/48 (43%), Positives = 32/48 (66%), Gaps = 1/48 (2%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           GLP  +K   ++L+  GY+T L+GKWHLG+ +   +P ++GFD   GF
Sbjct: 116 GLPPQQKTFVEHLQSAGYQTSLIGKWHLGT-RPSQVPTSKGFDRFFGF 162


>UniRef50_Q7UGB8 Cluster: Arylsulfatase homolog b1498; n=1;
           Pirellula sp.|Rep: Arylsulfatase homolog b1498 -
           Rhodopirellula baltica
          Length = 656

 Score = 92.3 bits (219), Expect = 2e-17
 Identities = 79/256 (30%), Positives = 121/256 (47%), Gaps = 34/256 (13%)

Query: 18  EKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGS 77
           E  L +  +  GY T   GKWH G+    + P  +GF+   GF  G  ++YD   +E+  
Sbjct: 181 ETTLAELYRSAGYATGCFGKWHNGAQMPLH-PNGQGFNEFFGFCGGHFNLYDDALLERN- 238

Query: 78  WGTDFR-RGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPI 136
            GT  + +G          Y TDV TD A++ + +H+   P F  +  +A     P+ P 
Sbjct: 239 -GTPVQTKG----------YITDVLTDAAVEFIQNHH-DRPFFCYVPFNA-----PHGPF 281

Query: 137 RAPQKLIDAFKYIDDSARQKFAAV---LSKLDESVGKVVKALHTRGLLENSIVVFSTDNG 193
           +  + L D  +Y D S  +K AAV   +  +D +V +++K L    L E +IVVF TDNG
Sbjct: 282 QVRRDLFD--RYNDGSIDEKTAAVYAMVQNIDTNVSRLLKCLSDHSLDEETIVVFLTDNG 339

Query: 194 GPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTL 252
                FN        ++G K ++ EGG R   F+ W+  +  ++ ++    HI D LPTL
Sbjct: 340 PNGKRFNGG------MRGTKGSVHEGGCRVPCFIRWTGNIQPQS-ISQVAAHI-DLLPTL 391

Query: 253 YSAAGGDLSVLENLDG 268
                  L     LDG
Sbjct: 392 MQWCDIPLPTKVPLDG 407


>UniRef50_A6DQ01 Cluster: N-acetylgalactosamine-4-sulfatase; n=2;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine-4-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 616

 Score = 92.3 bits (219), Expect = 2e-17
 Identities = 68/226 (30%), Positives = 114/226 (50%), Gaps = 23/226 (10%)

Query: 4   GVIYGAEPRGLPLNEKI-LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWT 62
           GV +  + R L    +I +   LKD GY T + GKWHLG     Y P +RGF   V    
Sbjct: 86  GVWHTVQGRHLMREREITMANILKDNGYATGIFGKWHLGD-AYPYRPEDRGFTHVVTHGA 144

Query: 63  GRIDMYDHTTMEQGSWGTD-FRRGFEVAHDL--FGVYATDVYTDEAIKVVNSH-NKSEPL 118
           G +            WG D F   + V  +   F  + TDV+ DEA K + +  +K +P 
Sbjct: 145 GGVGQVPDY------WGNDYFNDTYYVNGEFVKFEGFCTDVWFDEAKKFMKTQISKKKPF 198

Query: 119 FLMLAHSAVHSGNPYEPIRAPQKLIDAFKY--IDDSARQKFAAVLSKLDESVGKVVKALH 176
           F  +  +A     P+ P+RAPQK +D +    +  +  + F  +++ +D++ G++ + L 
Sbjct: 199 FTFITPNA-----PHGPMRAPQKYLDMYNQTKVKGTKLEAFFGMITNIDDNFGELREFLK 253

Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVR 222
             G+ +N++++F+TDNG  ++G       N  + G KN+ ++GG R
Sbjct: 254 DEGVADNTLLIFTTDNGS-SSGI---GVYNAGMTGAKNSNFDGGHR 295


>UniRef50_A6C4W8 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
           n=1; Planctomyces maris DSM 8797|Rep:
           N-acetylgalactosamine 6-sulfate sulfatase - Planctomyces
           maris DSM 8797
          Length = 459

 Score = 92.3 bits (219), Expect = 2e-17
 Identities = 72/259 (27%), Positives = 122/259 (47%), Gaps = 13/259 (5%)

Query: 21  LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGT 80
           + + L+  GY+   VGKW LG         N+GFD     W G ++  DH       +  
Sbjct: 113 IAEVLQKSGYRCGGVGKWSLGDAGTVGRATNQGFD----MWFGYLNQ-DHAHYYFTEYLD 167

Query: 81  DFRRGFEVAHDLFG--VYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVH-SGNPYEPIR 137
           D     E+  +      Y+ D+ T+ A++ +   + ++P FL  A++  H S    +P  
Sbjct: 168 DNEGRLELKGNTKNRQQYSHDLLTERALQFIRD-SAAQPFFLYAAYTLPHFSAKAEDPHG 226

Query: 138 APQKLIDAFKYID-DSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPA 196
                 + +   D D   +K+AA++ +LD  VG+++  ++   L E ++++F++DNGG  
Sbjct: 227 LAVPDTEPYSDRDWDIKSKKYAAMIHRLDRDVGRIMSLVNELQLRERTLIIFTSDNGG-H 285

Query: 197 AGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAA 256
            G      +N PL+G K  L EGG+R       P      +V+ + +   D LPT    A
Sbjct: 286 RGVPAQLHTNGPLRGFKRDLTEGGIRVPFIANWPGTIPAGKVSDEVIAFQDMLPTFAELA 345

Query: 257 GGDLSVLENLDGVNQWDAL 275
           G  +S   NLDG++   AL
Sbjct: 346 GAQVSA--NLDGISVLPAL 362


>UniRef50_Q7UWW9 Cluster: Arylsulfatase; n=2; Planctomycetaceae|Rep:
           Arylsulfatase - Rhodopirellula baltica
          Length = 622

 Score = 91.9 bits (218), Expect = 3e-17
 Identities = 83/277 (29%), Positives = 128/277 (46%), Gaps = 25/277 (9%)

Query: 19  KILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSW 78
           K +    +D GY+T + GKWHLG     + P +RGFD  + F +  I+            
Sbjct: 118 KTMADVFQDAGYRTGIFGKWHLGD-NYPFRPEDRGFDETLWFPSSHINSVPDFWDNDYFD 176

Query: 79  GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGNPY---E 134
            T  R G  VAH     Y TDV+ DEAI+     + ++ P F  +  ++ H   P+   +
Sbjct: 177 DTYIRNGKRVAHS---GYCTDVFFDEAIEWAKQTSPTDSPFFAFIPLNSAHW--PWFVPD 231

Query: 135 PIRAPQKLI-----DAFKYIDDSARQ-----KFAAVLSKLDESVGKVVKALHTRGLLENS 184
             RA  + +     +  + +D +         F A+   +D++VG + + L   GL EN+
Sbjct: 232 QYRARVRTMLGDTTELKRQLDTTPSNLEDLISFLAMGLNIDDNVGTLTQYLDESGLSENT 291

Query: 185 IVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMH 244
           IVVF TDNG   + F D+   N  ++G K  LWEGG R    +  P   +  ++     H
Sbjct: 292 IVVFLTDNG---STFGDH-YFNAGMRGKKTQLWEGGHRVPCLIRWPEQITAQKID-DLTH 346

Query: 245 ISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTES 281
           + D LPTL + A  D  +   LDG +    L   T+S
Sbjct: 347 VQDLLPTLAALADCDEHLPGPLDGTSLAPRLLGETDS 383


>UniRef50_Q7UTH7 Cluster: Arylsulfatase A; n=5; Bacteria|Rep:
           Arylsulfatase A - Rhodopirellula baltica
          Length = 496

 Score = 91.1 bits (216), Expect = 6e-17
 Identities = 73/261 (27%), Positives = 120/261 (45%), Gaps = 18/261 (6%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
           + L    + + LK  GY T + GKWHLG  +  Y P  RGFD       G I   +    
Sbjct: 110 MALTSTTIAEVLKSAGYTTGIFGKWHLGD-EDAYQPDRRGFDETFIHGAGGIGQ-NFAGS 167

Query: 74  EQGSWGTDFRRGFEVAHDLFGV---YATDVYTDEAIKVVNSHNKSE--PLFLMLAHSAVH 128
           +  + GT +       +  F     Y TDV+  +A+  +    KS+  P F  +  +A  
Sbjct: 168 QSDAPGTSYFNPIIKHNGTFVQTEGYCTDVFFQQALGWIRLQTKSDTKPFFAYIPTNA-- 225

Query: 129 SGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVF 188
              P+ P +  ++  D F+    S + +F  ++  +D+++GK++  L    L +N++++F
Sbjct: 226 ---PHAPYKVEKRYSDRFRDKCSSPQSEFLGMIVNIDDNMGKLMGKLDEWDLADNTLLIF 282

Query: 189 STDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSP-LLDSKARVAYQKMHISD 247
            TDNG  A G   +   N  +KG K T+ EGG R   F+  P   +S   +     H+ D
Sbjct: 283 MTDNGS-AKG---SKIYNAGMKGGKGTVNEGGSRVPLFMRLPGFTNSGVDIETMTRHV-D 337

Query: 248 WLPTLYSAAGGDLSVLENLDG 268
             PTL   A  ++    +LDG
Sbjct: 338 LFPTLAEIAHAEIPAEADLDG 358


>UniRef50_Q64WT3 Cluster: N-acetylgalactosamine-6-sulfatase; n=5;
           Bacteria|Rep: N-acetylgalactosamine-6-sulfatase -
           Bacteroides fragilis
          Length = 509

 Score = 91.1 bits (216), Expect = 6e-17
 Identities = 85/277 (30%), Positives = 128/277 (46%), Gaps = 24/277 (8%)

Query: 12  RGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKE-YLPLNRGFDSHV-GFWTGRIDMYD 69
           +GL   + I P  L+  GYKT  VGK H G  K E   P N GFD ++ G   G    Y 
Sbjct: 128 KGLTHQDMIYPYLLQQAGYKTIHVGKAHFGCLKSEGENPTNLGFDVNIAGSAIGHPGSYH 187

Query: 70  HTTMEQGSWGTDFRR--GFEVAHDLFGVYATDVYTDEAIKVVNSH-NKSEPLFLMLAHSA 126
                    G   R     E  H     + +D  T EA K +     + +P +L +AH A
Sbjct: 188 GENGYGWIKGQRARAVPDLEQYHKTH-TFLSDALTLEAGKEIEKAVAEKKPFYLNMAHYA 246

Query: 127 VHSGNPYEPIRAPQKLIDAFKYIDDSARQK-FAAVLSKLDESVGKVVKALHTRGLLENSI 185
           VHS     P    ++ I  +   + S + + FA ++  +D+S+G ++  L   G+ EN++
Sbjct: 247 VHS-----PFETDERFISHYTDPNKSQQARAFATLIEGMDKSLGDILDKLEDMGIAENTL 301

Query: 186 VVFSTDNGGPA-AGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WS-PLLDSKARVAY-- 240
           ++F  DNGG A  G   +  S+ P KG K + +EGGVR    + W+ P  ++K + AY  
Sbjct: 302 IIFLGDNGGDAPLGDAADYGSSAPFKGKKGSEYEGGVRVPFIVSWAHPNPNNKFQKAYPI 361

Query: 241 -------QKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
                  Q   + D  PT+ S AG   +    LDG +
Sbjct: 362 ARNAIQTQMGTVMDIYPTVLSVAGVKPAPNHILDGAD 398


>UniRef50_A6DFR6 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine-4-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 573

 Score = 91.1 bits (216), Expect = 6e-17
 Identities = 86/302 (28%), Positives = 138/302 (45%), Gaps = 23/302 (7%)

Query: 17  NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQG 76
           +EK +  +    GYKT +VGKWHLG     Y P +RGF        G I           
Sbjct: 99  DEKTIADHFVAAGYKTGMVGKWHLGD-NAPYRPEDRGFQDVFRIGGGSIGQLPDYWKNDL 157

Query: 77  SWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPI 136
             G  + +G  V    F    TDV  D A+  V   NK  P FL ++ +A HS     P 
Sbjct: 158 WDGHYWNKGQWVKTKGF---CTDVQFDYALDFV-EENKKSPFFLFISTTAPHS-----PT 208

Query: 137 RAPQKLIDAFKYID-DSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGP 195
            A +K ++ ++ +  D     F  +++ +D+++G++   L    L EN+I++FS+DNG  
Sbjct: 209 GADKKYLEPYEKLGLDKGICAFYGMVTNIDDNIGRLRNKLRELKLEENTILIFSSDNGSA 268

Query: 196 AAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSP---LLDSKARVAYQKMHISDWLPTL 252
                D  + N  ++G K +L+EGG R   FL+ P    +  K ++     HI D LPTL
Sbjct: 269 CDKKGD--SFNGGMQGKKGSLYEGGHRVPCFLYWPKGGWIGGK-QLDQVTAHI-DILPTL 324

Query: 253 YSAAGGDLSVLENLDGVNQWDALSKNTES-PRTSVLHN----IDDIWGIAALTVDKYKLI 307
             A   +  +    DG+     ++K  +   R  +  N     D  +  + +  D+++LI
Sbjct: 325 LKACAIENPLNTAFDGIELNGIIAKPAQKLSRLLITENKANKRDQEFQNSVVLTDEWRLI 384

Query: 308 KG 309
            G
Sbjct: 385 DG 386


>UniRef50_A4AAM5 Cluster: Sulfatase; n=1; Congregibacter litoralis
           KT71|Rep: Sulfatase - Congregibacter litoralis KT71
          Length = 500

 Score = 91.1 bits (216), Expect = 6e-17
 Identities = 84/287 (29%), Positives = 133/287 (46%), Gaps = 41/287 (14%)

Query: 18  EKILPQYLKDLGYKTHLVGKWHL--GSYKKEY-LPLNRGFDSHVGF--WTGRIDMYDHTT 72
           E  L    K  GY+T ++GKWHL  G + ++   P + GFD   G   W     + D T 
Sbjct: 128 ETTLADLAKARGYRTAVIGKWHLNGGLHMRDVPQPRDFGFDYQYGLAAWVKNASVADSTE 187

Query: 73  MEQGSWGTDFRRGFEVAHDLFGV---YATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS 129
           + +   G  F       ++  GV   Y+ ++ +DEAI  + +   S+P FL+L +S VH+
Sbjct: 188 LPRR--GPMFPDNMYRNNEPVGVTDKYSAELVSDEAIGWLQA--SSDPFFLLLTYSEVHT 243

Query: 130 GNPYEPIRAPQKLIDAFK-YIDDSA------------------RQKFAAVLSKLDESVGK 170
                PI +P   +DA++ Y+ D A                  R ++ A +S LD  +G+
Sbjct: 244 -----PIASPPAYLDAYREYLSDEAKHNPFLYYFDWRNRPWRGRGEYYANISFLDAQLGR 298

Query: 171 VVKALHTRGLLENSIVVFSTDNGGPA-AGFN----DNAASNYPLKGVKNTLWEGGVRGAG 225
           V+  L  + +L+N+++VFS+DNG    A         A     L+G K  L+EGG+R  G
Sbjct: 299 VIGHLRDQKILDNTLIVFSSDNGPVTDAALTPWELGMAGETGGLRGKKRFLFEGGIRVPG 358

Query: 226 FLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQW 272
            +  P      RV ++ +   D  PTL      D+     LDG + W
Sbjct: 359 IIRYPHRIEAGRVEHRAVTALDIFPTLAEWLDVDVEPRVPLDGQSLW 405


>UniRef50_A6BZT7 Cluster: Putative arylsulfatase; n=1; Planctomyces
           maris DSM 8797|Rep: Putative arylsulfatase -
           Planctomyces maris DSM 8797
          Length = 459

 Score = 90.6 bits (215), Expect = 7e-17
 Identities = 75/280 (26%), Positives = 126/280 (45%), Gaps = 18/280 (6%)

Query: 23  QYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDF 82
           + LK  GY T   GKW LG       P  +GFD   G     + ++ H       W  + 
Sbjct: 101 EVLKIAGYATGAFGKWGLGYEGTPGRPGQQGFDDFTG---QLLQVHAHFYYPFWIWNNEH 157

Query: 83  RRGF-EVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVH--------SGNPY 133
           R    E  ++  G Y  D+  ++A K     NK++P F  L +   H        S  PY
Sbjct: 158 RLMLPENENNQRGRYIHDLIHEDA-KAFIQKNKAQPFFAYLPYIIPHVELVVPEESEKPY 216

Query: 134 EPIRAPQKLIDAFK-YI-DDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
                 ++++D    YI  +     FA ++S+LD+ VG++V  L   G+ +N++++F++D
Sbjct: 217 RGQFPKKQILDPRPGYIGSEDGLTTFAGMVSRLDDHVGEIVTLLEDLGIRDNTLIIFTSD 276

Query: 192 NGGPAAGF---NDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDW 248
           NGG    +    D    N PL+G K +++EGG+R       P   +  + +  ++   D 
Sbjct: 277 NGGQGGTWKEMTDFFNGNAPLRGHKGSMYEGGIRVPFIANWPGKIAAGKTSDLQIAFWDV 336

Query: 249 LPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLH 288
           LPTL   AG  +    ++DG++    L    + P    L+
Sbjct: 337 LPTLAQVAGTTVPSGVDIDGISFLPTLLGKGKQPEHEYLY 376


>UniRef50_Q7UX95 Cluster: Arylsulfatase; n=3; Planctomycetaceae|Rep:
           Arylsulfatase - Rhodopirellula baltica
          Length = 538

 Score = 90.2 bits (214), Expect = 1e-16
 Identities = 79/282 (28%), Positives = 137/282 (48%), Gaps = 37/282 (13%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
           LP++E  + +YLK +GY+T   GKW LG +     P  +GFD   GF       + H   
Sbjct: 166 LPVDEVTIAEYLKSVGYRTGAFGKWGLGHFGTTGDPNEQGFDLFYGF---NCQRHAHNHY 222

Query: 74  EQGSWGTDFRRGFEVAHD--LFG-VYATDVYTDEAIKVVN---SHNKSEPLFLMLAHSAV 127
               W    +   +  +D  L G  Y+ D + +EA + +    + +K++P F  L  +  
Sbjct: 223 PNFLWRNRVKE-VQPGNDRTLHGETYSQDQFVNEACEFIRQSVAEDKTQPFFAYLPFAV- 280

Query: 128 HSGNPYEPIRAPQKLIDAFKYIDDSA-------------RQKFAAVLSKLDESVGKVVKA 174
               P+  I+ P++ +DA+  + + A             R  +AA+++++DE VG+VV  
Sbjct: 281 ----PHLSIQVPEEEVDAYDGVIEEADYEHHGYLKHPRPRAGYAAMVTRMDEGVGQVVDL 336

Query: 175 LHTRGLLENSIVVFSTDNGG--PAAGFND----NAASNYPLKGVKNTLWEGGVRGAGFLW 228
           + + GL EN++++F++DNG      G +D    N+AS   +KG+K  L EGG+R      
Sbjct: 337 VDSLGLGENTLIMFTSDNGPTYDRLGGSDSDYFNSASG--MKGLKGQLDEGGIRVPMIAR 394

Query: 229 SPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
              +    R +       D+LPT+  AAG ++      DG++
Sbjct: 395 QTGVVPAGRTSDWIGAWWDFLPTITDAAGVEVDA-STTDGIS 435


>UniRef50_A6LED2 Cluster: Arylsulfatase A; n=1; Parabacteroides
           distasonis ATCC 8503|Rep: Arylsulfatase A -
           Parabacteroides distasonis (strain ATCC 8503 / DSM 20701
           / NCTC11152)
          Length = 468

 Score = 89.8 bits (213), Expect = 1e-16
 Identities = 76/282 (26%), Positives = 130/282 (46%), Gaps = 28/282 (9%)

Query: 5   VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGR 64
           V++ A  +GL   E  + + +K+ GY T  +GKWHLG  +  +LP  +GFD + G     
Sbjct: 108 VLFPASHKGLNPGEITIAELMKEQGYATACIGKWHLGD-QLPFLPTRQGFDYYYGIPYSN 166

Query: 65  IDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAH 124
            DM D           +      V HD   +     YT++ ++ + SH +S P F+ L H
Sbjct: 167 -DM-DRPYCPLPLMEQEEVIVAPVGHDSLTIR----YTNKTVEFIKSHKES-PFFIYLCH 219

Query: 125 SAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
           +  H+     P+ A      AFK    S    +     +LD S+G +++ L   GL +N+
Sbjct: 220 NMTHN-----PLAASP----AFK--GKSQNGLYGDATEELDWSMGVLLETLKEEGLDQNT 268

Query: 185 IVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMH 244
           +++F++DNG           +N PL+G K T +EGG R    +  P      +     + 
Sbjct: 269 LIIFTSDNGAD----EHFGGTNRPLRGQKGTTYEGGFRVPCIMRWPAKIPAGQETDNLVT 324

Query: 245 ISDWLPTL-----YSAAGGDLSVLENLDGVNQWDALSKNTES 281
             D+LPTL     Y+     +    N+ G+ + ++++  TE+
Sbjct: 325 SMDFLPTLAHYCSYAVPSDRVIDGHNVSGILEGESMASPTET 366


>UniRef50_A4AQQ7 Cluster: N-acetylgalactosamine 6-sulfatase; n=4;
           Bacteria|Rep: N-acetylgalactosamine 6-sulfatase -
           Flavobacteriales bacterium HTCC2170
          Length = 596

 Score = 89.8 bits (213), Expect = 1e-16
 Identities = 85/312 (27%), Positives = 142/312 (45%), Gaps = 34/312 (10%)

Query: 6   IYGAEPRGLPLNEK--ILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTG 63
           +Y     G   N K   + +  K  GYKT   GKWH G  +  Y P +RGFD + GF +G
Sbjct: 102 VYSTSTGGERFNSKETTIAEIFKKAGYKTTAYGKWHSGM-QPPYHPNSRGFDDYYGFTSG 160

Query: 64  RIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLA 123
               Y    +E          G  V  + F V   D  T++ +  + + NK+ P FL L 
Sbjct: 161 HWGNYFSPMLEHN--------GEIVKGEGFLV---DDLTNKGLDFI-TENKNNPFFLYLP 208

Query: 124 HSAVHSG----NPYEPIRAPQKLIDAFKYIDDSARQKFA----AVLSKLDESVGKVVKAL 175
           ++  HS     N Y   R  +K +D     ++   + F     A++  +D ++G++   L
Sbjct: 209 YNTPHSPMQVPNEYWE-RFEKKKLDMRYQGNEEESENFTRAALAMVENIDFNMGRLTNKL 267

Query: 176 HTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDS 234
              GL EN+I+V+ +DNG     +N        ++G K +  EGGVR   F+ W   +  
Sbjct: 268 KELGLEENTIIVYLSDNGPNGWRWNGG------MRGRKGSTDEGGVRSPFFIQWKNTIPK 321

Query: 235 KARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIW 294
             +++ Q     D LPTL S AG +   ++++DG +    ++   ++P     H ++   
Sbjct: 322 NKKIS-QIAGAIDILPTLTSLAGINQPTIKSIDGKDLKTLIA--DKNPTWESRHIVNHWR 378

Query: 295 GIAALTVDKYKL 306
           G  ++   KY+L
Sbjct: 379 GKTSIRTQKYRL 390


>UniRef50_A6DHI0 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
           n=1; Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine 6-sulfate sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 456

 Score = 89.4 bits (212), Expect = 2e-16
 Identities = 81/282 (28%), Positives = 126/282 (44%), Gaps = 26/282 (9%)

Query: 8   GAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDM 67
           G EP  +P     + + +K+ GY T L+GKW LG    E  P  +GFD   G+       
Sbjct: 95  GQEP--IPAETITVAEKMKEAGYATALIGKWGLGYPGSEGEPNKQGFDYFFGY---NDQK 149

Query: 68  YDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAV 127
           + H    +     +     +        Y+  + TDEA   +   NK  P FL LA+   
Sbjct: 150 HAHNHFPKFLLRNEETLTLKNNSGKEIEYSQYMLTDEAKGFIKK-NKDNPFFLYLAYVIP 208

Query: 128 HSGNPYEPIRAPQKLIDAFKYIDDS---ARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
           HS      ++ P       +Y D+S    ++K A ++S+LD+ VG ++  L    L EN+
Sbjct: 209 HSR-----LQIPGDDECYLQYKDESWPEKQKKHAGMISRLDKDVGSILDLLKEMNLAENT 263

Query: 185 IVVFSTDNGGPAAG------FNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARV 238
           +VVF++DNG    G      FND+     PL G+K +++EGGVR       P +    +V
Sbjct: 264 LVVFTSDNGAHREGGARPEFFNDSG----PLSGIKRSMYEGGVRVPFIAHWPGVIKPGQV 319

Query: 239 AYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTE 280
           +       D +PT     G  +   E +DG++    L  N E
Sbjct: 320 SNHIGAHWDLMPTACELGG--VQPPEGIDGISYVPLLKGNME 359


>UniRef50_UPI00005887B4 Cluster: PREDICTED: similar to galactosamine
           (N-acetyl)-6-sulfate sulfatase; n=1; Strongylocentrotus
           purpuratus|Rep: PREDICTED: similar to galactosamine
           (N-acetyl)-6-sulfate sulfatase - Strongylocentrotus
           purpuratus
          Length = 465

 Score = 89.0 bits (211), Expect = 2e-16
 Identities = 73/245 (29%), Positives = 111/245 (45%), Gaps = 31/245 (12%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           G+P +E +LP+ LK  GYK+ +VGKWHLG +  +YLPL  GFD   G     I    +  
Sbjct: 81  GIPDSEILLPKLLKLSGYKSKIVGKWHLG-HLPQYLPLKHGFDEWFGAPNCHIKSLPNIP 139

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVN-SHNKSEPLFLMLAHSAVHSGN 131
           + + S             ++ G Y    +  E +  +  S    +P FL     A H   
Sbjct: 140 VYRDS-------------EMIGRY----FEQEGLNFIEKSAEAKQPFFLYWTPDATH--- 179

Query: 132 PYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
             EP+ A +       ++  S R  +   + +LDE VG+++  L    +  N+ VVF++D
Sbjct: 180 --EPVYASKP------FLGRSQRGLYGDAVIELDEGVGQILGKLKELQIDTNTFVVFTSD 231

Query: 192 NGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPT 251
           NG  A    +N  +N P    K T +EGG+R     W P      RV +Q  +I D   T
Sbjct: 232 NGA-ATYAKENGGTNGPYLCGKRTTYEGGMRVPTIAWWPTHIKPGRVTHQIGNIMDLFTT 290

Query: 252 LYSAA 256
             + A
Sbjct: 291 ALNLA 295


>UniRef50_Q7UG72 Cluster: Arylsulfatase A [precursor]; n=1;
           Pirellula sp.|Rep: Arylsulfatase A [precursor] -
           Rhodopirellula baltica
          Length = 503

 Score = 89.0 bits (211), Expect = 2e-16
 Identities = 85/338 (25%), Positives = 138/338 (40%), Gaps = 36/338 (10%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF------WT---G 63
           GL   E    +  K  GY+T   GKWHLG + K +LP N+GFD   G       W     
Sbjct: 110 GLAPAETTFAEVCKSAGYRTACHGKWHLGHHPK-FLPTNQGFDQFYGIPYSNDMWPLHPD 168

Query: 64  RIDMYDHTTMEQGSW----------GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHN 113
            I        + G+W          G   R   +          T   T  +++ + + +
Sbjct: 169 TIRRQQKDPNDPGNWPPLPIIESIAGQPPRIVNDNVQPADQEQMTVELTRRSVEFIKNQS 228

Query: 114 KSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVK 173
             +P  L L H  VH          P  + + F+    S    F  V+ ++D SVG+++ 
Sbjct: 229 SDKPFLLYLPHPMVH---------VPLYVSERFR--GKSGAGLFGDVMMEVDWSVGEILS 277

Query: 174 ALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLD 233
           A+ +    +N++V+F++DNG P   + ++A S  PL+  K T WEGGVR    +W P   
Sbjct: 278 AIESIDQQKNTLVIFTSDNG-PWLSYGNHAGSAAPLREGKGTQWEGGVREPTLMWWPETI 336

Query: 234 SKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDAL--SKNTESPRTSVLHNID 291
                        D LPT+    GG+ +    +DG +  D +      +SP  S +    
Sbjct: 337 PAGTTCETFCSTIDVLPTIVELTGGE-APERKIDGHSIVDLMLDVPGAKSPHESFVGYYG 395

Query: 292 DIWGIAALTVDKYKLIKGTIYKGVWDNWYGPSGREGAY 329
               +  +  +++KL+    Y+ + D   G  G    Y
Sbjct: 396 G-GQLQTIRNERFKLVFPHAYRTLGDREPGKDGMPDGY 432


>UniRef50_A6PEH5 Cluster: Sulfatase precursor; n=1; Shewanella
           sediminis HAW-EB3|Rep: Sulfatase precursor - Shewanella
           sediminis HAW-EB3
          Length = 517

 Score = 89.0 bits (211), Expect = 2e-16
 Identities = 89/331 (26%), Positives = 137/331 (41%), Gaps = 41/331 (12%)

Query: 12  RGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHT 71
           RGL   +  L + LKD GY T  VGK HLG    ++LP   GFD   GF      M  H 
Sbjct: 104 RGLQKEDPTLAEMLKDKGYATVHVGKSHLGD-NNDHLPTVHGFDEFYGFLYHLNVMEMHE 162

Query: 72  TMEQGSWGTDFRRGFEVAHDL------------FGVYATDVYTDEAIKVVNSHNKSEPLF 119
             E         RG  + H +            FGV      +D+           +  F
Sbjct: 163 QPEFPKDPNFKGRGRNMIHTVATDKFDDTVDPRFGVIGKQTISDQGELGAKRMQTVDGEF 222

Query: 120 LMLA------HSAVHSGNPY----EPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVG 169
           L  A      H A +   PY     P R  QK     +Y   S    +   L +LD+ +G
Sbjct: 223 LDFAINWLEKHEATNDDQPYFMWYNPTRMHQKTHVRPEYQGASQHNTYYDGLVELDDQIG 282

Query: 170 KVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWS 229
            ++  L   G ++N+I++F++DNG     + D+ A+++  +G K T W+GG R    +  
Sbjct: 283 VLLDKLEATGEIDNTIILFTSDNGVNLDHWPDSGAASF--RGQKGTTWDGGFRVPMLVSW 340

Query: 230 PLLDSKARVAYQKMHISDWLPTLYSAAG--------------GDLSVLENLDGVNQWDAL 275
           P    +       M   DW+PT+ +AAG               D +   ++DG NQ D L
Sbjct: 341 PAKIPQGEYTDGLMSAEDWVPTIMAAAGDADIKQDLLTGKKINDETYKVHIDGYNQLDML 400

Query: 276 SKNTESPRTSVLHNIDDIWGIAALTVDKYKL 306
           ++  +S R       ++   + A  VD++K+
Sbjct: 401 TEGGKSNRHEFFFYNEN--SLNAFRVDEWKV 429


>UniRef50_UPI00005846A1 Cluster: PREDICTED: similar to
           arylsulfatase; n=1; Strongylocentrotus purpuratus|Rep:
           PREDICTED: similar to arylsulfatase - Strongylocentrotus
           purpuratus
          Length = 552

 Score = 88.6 bits (210), Expect = 3e-16
 Identities = 77/290 (26%), Positives = 125/290 (43%), Gaps = 24/290 (8%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKE-----YLPLNRGFD--SHVGFWTGRI 65
           GLP  E  + + LK+ GY T + GKWHLG   +      +LP++ GFD   H+  +T  +
Sbjct: 142 GLPSTELTIAEALKEEGYTTGMAGKWHLGLNSETRDDGVHLPMHHGFDFVGHILPFTNSM 201

Query: 66  DMYDHTTMEQGSWGTD---FRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLML 122
              D          T    ++R   VA      Y T  + ++A+  +   N  +P F   
Sbjct: 202 ACDDTGRFVDFPDVTKCFLYKRDQIVAQPFNHTYLTQTFVNDAVSFIED-NAHDPFFFYF 260

Query: 123 AHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLE 182
             S     +P+ P+ A        ++   S R ++   ++++  +VG+V+ AL  +GL +
Sbjct: 261 PFS-----HPHVPLYASP------RFAGKSQRGEYGDNINEMSWAVGEVIDALEAKGLSQ 309

Query: 183 NSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQK 242
           N++V+F  D+ GP   +  +       KG K   WEGG+R     + P      R +   
Sbjct: 310 NTLVLFLADH-GPQPEYCAHGGDPSIFKGYKTNTWEGGIRVPFVAYWP-GQITPRESDAL 367

Query: 243 MHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDD 292
           +   D + T+   A G L      DG    D L KN  SP   + H   D
Sbjct: 368 VSTLDIMRTVVDLANGTLPDDTAYDGEVITDVLLKNAPSPHDVLYHYCKD 417


>UniRef50_Q7UYH3 Cluster: Arylsulfatase; n=1; Pirellula sp.|Rep:
           Arylsulfatase - Rhodopirellula baltica
          Length = 598

 Score = 88.6 bits (210), Expect = 3e-16
 Identities = 79/273 (28%), Positives = 127/273 (46%), Gaps = 36/273 (13%)

Query: 17  NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQG 76
           +E  L + L + GY+T + GKWHLG       P+++GFD  +    G I         +G
Sbjct: 114 DEVTLAERLSEAGYQTGIFGKWHLGD-NYPMRPMDQGFDESLIHRGGGIGQPSDPIGAEG 172

Query: 77  SW--GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKS-EPLFLMLAHSAVHSGNPY 133
            +   T F  G EVA +    Y TD++ D AI       +S +P F  +A +A H   P+
Sbjct: 173 KYTDPTLFHNGDEVAME---GYCTDIFFDAAIDFARKQTESGKPFFTYIATNAPH--GPF 227

Query: 134 EPIRAPQKLIDAFKYID--------------DSARQKFA---AVLSKLDESVGKVVKALH 176
           + +  P +L + +K +D              D+   K A   A+++ +D++VGK+  +L 
Sbjct: 228 DDV--PNELYEEYKQVDFTPILVSDLPAKRRDAEFDKLARISAMITNIDQNVGKLFASLD 285

Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRG-AGFLWSPLLDSK 235
              + EN+IV++  DNG  +  +  N      ++G K  + +GG+R    F W   +D+ 
Sbjct: 286 ELKIRENTIVLYLNDNGPNSRRYVGN------MRGNKTQVDDGGIRSPLLFHWPAKVDAS 339

Query: 236 ARVAYQKMHISDWLPTLYSAAGGDLSVLENLDG 268
                   HI D +PTL  A G   S    LDG
Sbjct: 340 DTTDVMLAHI-DLMPTLLDACGVAASESPALDG 371


>UniRef50_A6DGD3 Cluster: Putative exported uslfatase; n=3;
           Bacteria|Rep: Putative exported uslfatase - Lentisphaera
           araneosa HTCC2155
          Length = 713

 Score = 88.6 bits (210), Expect = 3e-16
 Identities = 85/324 (26%), Positives = 144/324 (44%), Gaps = 35/324 (10%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSY---KKEYLPLNRGFDSHV-GFWTGRIDMYD 69
           +PL +  L + LK++GYKT  +GKWHL ++    + + P   GFD ++ G   G+   + 
Sbjct: 331 MPLEDITLAEALKEVGYKTAHIGKWHLQAHHDTSRNHFPEKHGFDLNIAGHRMGQPGSFY 390

Query: 70  HTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS 129
                +    T+     ++A    G Y TD  TD+AI  +   NK  P FL   +  VH+
Sbjct: 391 FPYKSKQHPSTNVP---DMADGQEGDYLTDKLTDKAIHYI-KENKDTPFFLNFWYYTVHT 446

Query: 130 --------GNPYEP------IRAPQKLIDAFKYIDDSARQ--KFAAVLSKLDESVGKVVK 173
                      YE       I   Q  I   K    S++    +AA++  +DE++G++ K
Sbjct: 447 PIIPRQDLKKKYEAKANELGINKNQPGIPVLKSFARSSQNNPSYAAMVEAMDENIGRIFK 506

Query: 174 ALHTRGLLENSIVVFSTDNGGPAAGFNDNA-ASNYPLKGVKNTLWEGGVRGAGFL-WSPL 231
            L    + + +I++F +DNGG +     N   S  PLK  K  ++EGG+R    + W   
Sbjct: 507 TLKELQIDDETIIIFCSDNGGLSTSTGPNCPTSQLPLKAGKAWVYEGGIRIPFIIKWPGK 566

Query: 232 LDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVL---- 287
              K   A   +  +D  PTL           ++LDGV+    ++   +  +   L    
Sbjct: 567 KGGKELQA--PVCTTDIYPTLLDMLKLPAKPEQHLDGVSLTSLMNGQAKELQREALFIHY 624

Query: 288 ---HNIDDIWGIAALTVDKYKLIK 308
              H+I+ +    A+ +  YKL++
Sbjct: 625 PHYHHINSMGPAGAVRMGDYKLVE 648


>UniRef50_A6C6V5 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;
           Planctomyces maris DSM 8797|Rep:
           N-acetylgalactosamine-6-sulfatase - Planctomyces maris
           DSM 8797
          Length = 520

 Score = 88.6 bits (210), Expect = 3e-16
 Identities = 70/220 (31%), Positives = 107/220 (48%), Gaps = 19/220 (8%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYL-PLNRGFDSHV-GFWTGRIDMYDH 70
           GL  ++  LP+ L+  GY+T  VGK H G+       PLN GFD ++ G   G    Y  
Sbjct: 139 GLKKDDVTLPRLLEKAGYRTIHVGKGHFGADGFPGAEPLNLGFDVNIAGSSFGAPGSYHG 198

Query: 71  TTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE------PLFLMLAH 124
             M++   GT  RR  +    L   + TD++  EA+ +  +   +E      P FL +AH
Sbjct: 199 --MKKFGLGT--RRAHQAVPHLEKYHDTDIFLTEALTIEANATLAETVKADQPFFLYMAH 254

Query: 125 SAVHSGNPYEPIRAPQKLIDAFKYIDDSAR-QKFAAVLSKLDESVGKVVKALHTRGLLEN 183
            AVH+     P  +  +  D +K  D     Q FA ++  +D+S+G ++  L   G+ EN
Sbjct: 255 YAVHA-----PFDSDPRFADHYKDSDKPKNAQAFATLIEGMDKSLGDIMNQLDQLGVAEN 309

Query: 184 SIVVFSTDNGGPA-AGFNDNAASNYPLKGVKNTLWEGGVR 222
           +++ F  DNG  A  G     A   PL+G K   +EGG+R
Sbjct: 310 TLIFFLGDNGSDAPLGHQHAVACAAPLRGKKGAHYEGGMR 349


>UniRef50_Q7UMZ6 Cluster: Arylsulfatase A; n=1; Pirellula sp.|Rep:
           Arylsulfatase A - Rhodopirellula baltica
          Length = 492

 Score = 87.8 bits (208), Expect = 5e-16
 Identities = 81/271 (29%), Positives = 119/271 (43%), Gaps = 21/271 (7%)

Query: 5   VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGR 64
           V+    P GL  +E  + + LK   Y T LVGKWHLG  + E+LP ++GFD   G     
Sbjct: 104 VLRPVSPYGLHPDEITIAEVLKQQNYATALVGKWHLGD-QPEFLPTHQGFDWFFGV-PYS 161

Query: 65  IDMYDHTTMEQGS-WG----TDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLF 119
            DM +    + GS W      +     E   +  G+  T  YT+ A++ +  H K EP F
Sbjct: 162 DDMTERIWKQDGSHWPPLPLMENETVIEAPCNRDGL--TKRYTERAMQWIAEH-KDEPFF 218

Query: 120 LMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRG 179
           L    +    G+   P  +     DAF+    S    +   + +LD S+G+++  L   G
Sbjct: 219 LYFPQAM--PGSTKTPFSS-----DAFR--GKSRNGPWGDAVEELDWSIGQMLDQLVKLG 269

Query: 180 LLENSIVVFSTDNGGPAAGFNDNAA--SNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKAR 237
           + E + V++++DNG P     D+ +  SN PL G   T  EG  R     W P       
Sbjct: 270 IAEKTFVIWTSDNGAPINRDPDDLSRGSNLPLHGRGYTTSEGAFRVPTIAWHPGKVPAGT 329

Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLENLDG 268
              +     D LPT  + AG  L     LDG
Sbjct: 330 QCDELATTMDLLPTFANLAGCKLPTNRKLDG 360


>UniRef50_Q7UER7 Cluster: Sulfatase 1; n=6; Bacteria|Rep: Sulfatase
           1 - Rhodopirellula baltica
          Length = 553

 Score = 87.8 bits (208), Expect = 5e-16
 Identities = 73/267 (27%), Positives = 122/267 (45%), Gaps = 22/267 (8%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWT------GRIDM 67
           +P  +  LP+ L++ GYKT   GKWHLG   +  +P + GFD ++G         G    
Sbjct: 151 MPAKDVTLPEALRESGYKTFFAGKWHLGG--EGSMPTDHGFDINIGGHHRGSPPGGFFAP 208

Query: 68  YDHTTMEQGSWGTDFRR--GFEVAHDLFGVYATDVYTDEAIKVVNSH-NKSEPLFLMLAH 124
           + +  ME G  G    R  G E A  + G      +   +   V+     ++ L+     
Sbjct: 209 FKNPVMEDGPDGESLTRRLGKETASFIEGQDDQPYFAMLSFYAVHGPIQTTQELWQKYRE 268

Query: 125 SAVHSGNPYEPIRAPQKLIDA---FKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLL 181
           SA     P  P    +  ID     + I D+    +A ++  LD +VG V+ A+   G  
Sbjct: 269 SA-----PAPPADGNRFKIDRTLPVRQIQDNP--VYAGMMETLDNAVGDVMAAIEASGKA 321

Query: 182 ENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQ 241
           +N++V+F+ DNGG ++G +  + SN P +G K   WEGG+R   ++  P +  +   +  
Sbjct: 322 DNTLVIFTGDNGGVSSG-DAYSTSNLPHRGGKGRQWEGGLREPYYVSMPAIVPENSTSDV 380

Query: 242 KMHISDWLPTLYSAAGGDLSVLENLDG 268
            +  SD  PT+       L   +++DG
Sbjct: 381 PVIGSDLYPTILDVCNLPLRPQQHIDG 407


>UniRef50_A6DSG4 Cluster: Arylsulphatase A; n=1; Lentisphaera
           araneosa HTCC2155|Rep: Arylsulphatase A - Lentisphaera
           araneosa HTCC2155
          Length = 489

 Score = 87.8 bits (208), Expect = 5e-16
 Identities = 85/273 (31%), Positives = 120/273 (43%), Gaps = 24/273 (8%)

Query: 11  PRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDH 70
           P GL  +E  LP+ +K  GY T LVGKWHLG +K  + PLN G+D   GF          
Sbjct: 106 PIGLNPSEITLPELMKTAGYNTALVGKWHLGEWKP-FHPLNHGYDYFYGFLK-------- 156

Query: 71  TTMEQGSWGTDFRRGFEVAHDLFGVYA-TDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS 129
             +E     +      E+A  +             AI  +  H K+ P FL+ +    H+
Sbjct: 157 -VIEGSEKPSLIENRKELASKIQKTEGQAPGMVKAAINFMTKHKKN-PFFLVYSDPMPHA 214

Query: 130 GNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFS 189
             PY P        + FK    S R  +  V+ ++D     ++ AL   GL EN+IVVF+
Sbjct: 215 --PYFPS-------EQFK--GTSKRGNYGEVIHEIDWQFKHLMDALDELGLKENTIVVFT 263

Query: 190 TDNGGPAAGFND-NAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDW 248
           +DNG P       +   + PL+  K T +EGGVR    +  P        +   + I D 
Sbjct: 264 SDNGPPVERQKKYDVGLSGPLRDGKWTNFEGGVRVPFIIRWPGKVKVDASSDAMIGIIDM 323

Query: 249 LPTLYSAAGGDLSVLENLDGVNQWDALSKNTES 281
           LPT    AG D+     +DGVN    L  + ES
Sbjct: 324 LPTFCELAGVDVPNDRVIDGVNILPQLLGDQES 356


>UniRef50_A6DID9 Cluster: Putative sulfatase protein; n=1;
           Lentisphaera araneosa HTCC2155|Rep: Putative sulfatase
           protein - Lentisphaera araneosa HTCC2155
          Length = 483

 Score = 87.8 bits (208), Expect = 5e-16
 Identities = 82/280 (29%), Positives = 129/280 (46%), Gaps = 44/280 (15%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLG-SYKKEYLPLNRGFDSHVGFWTGRIDMYDHT 71
           G+   + +LP  LK+ GY+T  +GK H G  +  +  PLN GFD              H 
Sbjct: 129 GIQQGDILLPALLKETGYRTICIGKAHFGMGFSAD--PLNLGFDRK------------HY 174

Query: 72  TMEQGS-WGTDF--RRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAV 127
             E GS  G  F  R  + V  D   V+ ++  T EA K ++   K E P FL L+H A+
Sbjct: 175 ANESGSPIGRRFGGRDPYHVKRDGEQVHLSEALTLEAKKEISDAVKEEKPFFLYLSHYAI 234

Query: 128 HSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVV 187
           H+     PI   ++    +  +D   R  +  ++   D+S+G V+  +   G+ E+++ +
Sbjct: 235 HT-----PIIEDKRFSKNYPNLDTKIRA-YVTLVEGADKSLGDVMDHIEKLGIAEDTLFI 288

Query: 188 FSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSK----------A 236
           ++ DNGG          SN P+KG+KN  +EGG R    + W    +++           
Sbjct: 289 WTADNGG--------LRSNAPMKGLKNDAYEGGHRIPNMVAWGAQDETRVHQKRMPLKPG 340

Query: 237 RVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALS 276
           RV  +     DW+PTL S AG      + LDG +  + LS
Sbjct: 341 RVENRPYIHQDWMPTLLSLAGAQHPKPDLLDGYDITELLS 380


>UniRef50_A6CAR8 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
           n=1; Planctomyces maris DSM 8797|Rep:
           N-acetylgalactosamine 6-sulfate sulfatase - Planctomyces
           maris DSM 8797
          Length = 501

 Score = 87.4 bits (207), Expect = 7e-16
 Identities = 82/302 (27%), Positives = 128/302 (42%), Gaps = 34/302 (11%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           G+ + EK+LP  LK  GY + + GKW LG +K+ +LPL RGFD   GF    ID + H  
Sbjct: 133 GMDVREKLLPALLKPAGYVSAIYGKWDLGIHKR-FLPLARGFDDFYGFTNTGIDYFTH-- 189

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
            E+    + +R       D  G Y T ++  EA++ +   N  +P FL L  +A H  + 
Sbjct: 190 -ERYGVPSMYRNNQPTEEDK-GTYCTYLFQREAVRFI-KENHQKPFFLYLPFNAPHGASS 246

Query: 133 YEP-----IRAPQKLIDAFKYIDDS-ARQKFAAVLSKLDESVGKVVK---ALHTRGLLEN 183
            +P      +AP+K  + + ++ D+   +K        +   G V+    +   R L   
Sbjct: 247 LDPRIRGGAQAPEKYKNMYPHLKDTLVTKKKTGRYEFRERPDGPVIHQGVSASKRRLEYV 306

Query: 184 SIVVFSTDNGGPAAG---------------FNDN----AASNYPLKGVKNTLWEGGVRGA 224
           + +    D  G   G               F+DN     A N PLKG K  ++EGG+R  
Sbjct: 307 ASITCMDDAIGEVLGLLDEYQIADNTIVVFFSDNGGSGGADNSPLKGKKGMMFEGGIRVP 366

Query: 225 GFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRT 284
             +  P       V  + +   + +PT    A   L     +DG +    L   T SPR 
Sbjct: 367 CLVRYPAKIKPGTVNDELLTSLELVPTFLKEAAIPLPENVVIDGYDMLPVLMGKTTSPRN 426

Query: 285 SV 286
            +
Sbjct: 427 EM 428


>UniRef50_A3I2G9 Cluster: Putative secreted sulfatase; n=1;
           Algoriphagus sp. PR1|Rep: Putative secreted sulfatase -
           Algoriphagus sp. PR1
          Length = 512

 Score = 87.4 bits (207), Expect = 7e-16
 Identities = 78/275 (28%), Positives = 127/275 (46%), Gaps = 23/275 (8%)

Query: 18  EKILPQYLKDLGYKTHLVGKWH---LGSYKKEYLPLNRGFDSHV---GFWTGRIDMYDHT 71
           E +LP  LK  GY+T + GK+H   L    K   P   GFD ++   GF   +   Y   
Sbjct: 127 ENMLPAMLKKQGYRTIISGKYHACDLCPEDKSPTPEAAGFDVNIAGTGFGAPK-SYYGID 185

Query: 72  TMEQGSWGTDFRRGFEVAHDLFG--VYATDVYTDEAIKVVN-SHNKSEPLFLMLAHSAVH 128
           + ++ +  T    G E     FG  ++ T+  T EA+K    + +K +P FL L+H AVH
Sbjct: 186 SFQRKNTETQPMPGLE---SYFGKEIHLTEALTIEALKASKVAVDKGQPFFLYLSHHAVH 242

Query: 129 SG-NPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVV 187
           +     +P R    L +     +  A   +A ++  +D S+G+V+KAL   G+  N++++
Sbjct: 243 TPIQEQKPYRENYTLTEG----EPEAEAAYATMIEGVDNSLGEVIKALDDWGIANNTLLI 298

Query: 188 FSTDNGGPA-----AGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQK 242
           F +DNGG            +   NYPL+  K + +EGG+R    +  P    K  V+   
Sbjct: 299 FYSDNGGRVLFRGKKSLYGDFEFNYPLRSGKASNYEGGIRVPCVVRWPGKVKKQTVSDAP 358

Query: 243 MHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSK 277
           + I D   T+  A    +     +DG++    L K
Sbjct: 359 LVIEDIYTTVLEATHTKIPDDYAIDGMSWLPVLEK 393


>UniRef50_A6KZ75 Cluster: Putative secreted sulfatase; n=1;
           Bacteroides vulgatus ATCC 8482|Rep: Putative secreted
           sulfatase - Bacteroides vulgatus (strain ATCC 8482 / DSM
           1447 / NCTC 11154)
          Length = 517

 Score = 87.0 bits (206), Expect = 9e-16
 Identities = 81/294 (27%), Positives = 140/294 (47%), Gaps = 33/294 (11%)

Query: 23  QYLKDLGYKTHLVGKWHLGSYKKE-YLPLNRGFDSHV-GFWTGRIDMYDHTTMEQGSWGT 80
           + L+  GY T   GK H GS       P + GF+ ++ G   G +  Y     EQ    T
Sbjct: 151 ELLRQNGYHTIHCGKAHFGSIDTPGENPTHWGFEVNIAGHAAGGLATY---LSEQNYGHT 207

Query: 81  DFRRGFEVA-----HDLFG--VYATDVYTDEAIKVVNSHNK-SEPLFLMLAHSAVHSGNP 132
              + + +       D +G  ++AT+  T EAIK ++   K ++P +L +AH A+H    
Sbjct: 208 RDGKPYSLMAIPGLEDYWGTGIFATEALTQEAIKALDKAKKYNQPFYLYMAHYAIHV--- 264

Query: 133 YEPIRAPQKLIDAFKYIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFS 189
             P+    +     KYI      K   +A+++  +D+S+G ++  L      +N++++F 
Sbjct: 265 --PVDKDMRFFP--KYIKKGLSDKEAAYASLIEGMDKSLGDLMNWLEKNDEADNTVIIFM 320

Query: 190 TDNGGPAA--GFNDNA--ASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMH 244
           +DNGG AA  G+ D      N PL   K +L+EGG+R    + W  ++    R   + + 
Sbjct: 321 SDNGGLAAEPGWRDGQIHTQNAPLNSGKGSLYEGGIREPMIVSWPGVVTPNTR-CDKYLI 379

Query: 245 ISDWLPTLYSAAG-GDLSVLENLDGVNQWDALSKNTESPR--TSVLHNIDDIWG 295
           I D+ PT+   AG  +   +  +DG++ +  L K T  P    +++ N  +IWG
Sbjct: 380 IEDFYPTILEMAGITNYKTVNPIDGIS-FMPLLKGTGDPSKGRALVWNFPNIWG 432


>UniRef50_Q9NJU7 Cluster: Sulfatase 2; n=1; Helix pomatia|Rep:
           Sulfatase 2 - Helix pomatia (Roman snail) (Edible snail)
          Length = 266

 Score = 87.0 bits (206), Expect = 9e-16
 Identities = 36/70 (51%), Positives = 47/70 (67%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           +QH +I+ ++P GLPL    +   LK +GY TH +GKWHLG YKKEY PL RGFDS+ G+
Sbjct: 92  LQHDIIWPSQPYGLPLQFPTIADMLKSVGYSTHAIGKWHLGLYKKEYTPLYRGFDSYYGY 151

Query: 61  WTGRIDMYDH 70
             G  D Y +
Sbjct: 152 LEGGEDYYTY 161



 Score = 42.7 bits (96), Expect = 0.019
 Identities = 25/78 (32%), Positives = 40/78 (51%), Gaps = 7/78 (8%)

Query: 74  EQGSW-GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVN-SHNKSEPLFLMLAHSAVHSGN 131
           ++  W G D R   E   D+ G Y+T +YT +AI ++N +    +P  L LA+ AVHS  
Sbjct: 194 DENKWCGYDLRDMNEPVTDMNGTYSTHLYTKKAIDIINGASTGGKPFLLYLAYQAVHS-- 251

Query: 132 PYEPIRAPQKLIDAFKYI 149
              P+  P +    + +I
Sbjct: 252 ---PMEVPAEYTKPYTFI 266


>UniRef50_Q7UPG6 Cluster: Arylsulphatase A; n=2; Bacteria|Rep:
           Arylsulphatase A - Rhodopirellula baltica
          Length = 485

 Score = 86.6 bits (205), Expect = 1e-15
 Identities = 68/242 (28%), Positives = 108/242 (44%), Gaps = 21/242 (8%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEY---LPLNRGFDSHVGFWTGRIDMYDH 70
           L L E  L + L+D GY T  VGKWHLG   +E     P   GFD     W        H
Sbjct: 125 LRLREVTLAEVLRDAGYATAHVGKWHLGLPTEERDKPTPDQHGFDHWFATWNNA--QPSH 182

Query: 71  TTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSG 130
              +      +F R  E    L G Y+  +  DEAI+ ++ H +S+P      +   H  
Sbjct: 183 RNPD------NFIRNGEPVGQLEG-YSCQLVADEAIRWMDRHRESDPDQPFFLNVWFH-- 233

Query: 131 NPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFST 190
            P+ PI AP ++   +  + D     ++  +   D+++ +++  L   G+ EN+++V+++
Sbjct: 234 EPHAPIAAPDEVTQKYGKLSDKG-AVYSGTIDNTDQAIKRLLAKLDALGVRENTLIVYAS 292

Query: 191 DNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLP 250
           DNG       D       L+G K   WEGG+R  G    P       V+ +   + D LP
Sbjct: 293 DNGSYR---TDRVGK---LRGRKGANWEGGIRVPGIFHWPGHIPAGVVSNEPAGLVDVLP 346

Query: 251 TL 252
           T+
Sbjct: 347 TI 348


>UniRef50_Q8A222 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;
           Bacteroides thetaiotaomicron|Rep:
           N-acetylgalactosamine-6-sulfatase - Bacteroides
           thetaiotaomicron
          Length = 453

 Score = 86.2 bits (204), Expect = 2e-15
 Identities = 80/279 (28%), Positives = 134/279 (48%), Gaps = 29/279 (10%)

Query: 16  LNEKI--LPQYLKDLGYKTHLVGKWHLGSYKKEY-LPL--NRGFDSHVGFWTGRIDMYDH 70
           L++K+  + +  ++ GY T  +GKWH+G  +  +  P   N GFD ++  +    D    
Sbjct: 110 LDDKLPSMARAFQNAGYATGHIGKWHMGGGRDVHNAPSIKNYGFDEYLSTYESP-DPDPA 168

Query: 71  TTMEQGSW-GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS 129
            T  +  W   D  + ++          T+ + D++I  +  H K  P FL L    +H+
Sbjct: 169 ITASKWIWCDNDSIKRWK---------RTEYFVDKSIDFIKRH-KDSPFFLNLWPDDMHT 218

Query: 130 GNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFS 189
             P+ P    QK   +++      ++ F+ VL ++D+ +G+ +KAL   GL EN+I++F+
Sbjct: 219 --PWVP-EFKQKERKSWE-----TKEAFSPVLGEMDKQIGRFIKALDDMGLSENTIIIFT 270

Query: 190 TDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHIS-DW 248
           +DN GPA  F   A  +  L+G KN+L+EGG+R    +  P      RV    +  + D 
Sbjct: 271 SDN-GPAPSF--KAVRSAYLRGTKNSLYEGGIRMPFIVKYPKKIKPGRVNNSSVLCAVDL 327

Query: 249 LPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVL 287
            PTL S AG         DG N    L   +E+ R + L
Sbjct: 328 YPTLCSVAGIKTEKNYKGDGQNYAKVLLGKSEAKRKTDL 366


>UniRef50_Q7UQ05 Cluster: Arylsulfatase A; n=2;
           Planctomycetaceae|Rep: Arylsulfatase A - Rhodopirellula
           baltica
          Length = 525

 Score = 85.8 bits (203), Expect = 2e-15
 Identities = 89/337 (26%), Positives = 156/337 (46%), Gaps = 58/337 (17%)

Query: 14  LPLNEKILPQYLKDLG-YKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           L L+E  + ++L+D   Y+T  +GKWHLG     +LP ++GF  ++G          H  
Sbjct: 146 LALDEVTIAEHLRDAADYQTFFLGKWHLGDVG--HLPTDQGFQINIGG--------GHKG 195

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGN 131
              G + + ++  +  A    G Y T   TDEA+ +V++ ++ + P F+M+++  VHS  
Sbjct: 196 SPPGGYYSPWKNPYLKAKQ-DGEYLTTRLTDEAVSLVDTASREDKPFFMMMSYYNVHS-- 252

Query: 132 PYEPIRAPQKLIDAFKYIDDSA-------------------RQK---FAAVLSKLDESVG 169
              PI   ++ ID F+    ++                   RQ    +A+++  +D SVG
Sbjct: 253 ---PITPDKRTIDHFEEKQSNSPELQGDTPTIAERDAVTRGRQDNPAYASMVKAVDTSVG 309

Query: 170 KVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWS 229
           +++KAL   G+ +N++V+F +DNGG +         N PL+  K  L+EGG+R    +  
Sbjct: 310 RIMKALKEHGVDDNTLVIFFSDNGGLSTLRKFGPTCNSPLRAGKGWLYEGGIREPLLVRL 369

Query: 230 PLL----DSKARVAYQKMHI------SDWLPTLYSAAGGDLSVLENLDGVNQWDAL---- 275
           P       +   V++Q   +      +D  PT+    G  L    + DG++   A+    
Sbjct: 370 PKTMPGGATNETVSHQPKTVDSVACSTDLFPTILDVVGLPLQPESHADGISLLPAIAGEA 429

Query: 276 SKNTESPRT---SVLHNIDDIWGI-AALTVDKYKLIK 308
           ++   SPR       H    +W   AA+    YKLI+
Sbjct: 430 AETDSSPRDLHWHYPHYHGSLWRPGAAIRRGNYKLIE 466


>UniRef50_A6DTN4 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
           n=2; Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine 6-sulfate sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 482

 Score = 85.8 bits (203), Expect = 2e-15
 Identities = 88/314 (28%), Positives = 141/314 (44%), Gaps = 32/314 (10%)

Query: 17  NEKILPQYLKDLGYKTHLVGKWHLGSYKKE-YLPLNRGFDSHVGFWTGRIDMYDHTTMEQ 75
           ++ I P+ L+  GY T ++GK  +G    +  LP  +GFD   GF +       H     
Sbjct: 99  HDLIFPKALQKAGYHTAMIGKSGMGCNTDDAALPYQKGFDYFFGFTS---HTQAHWFFPT 155

Query: 76  GSWGTDFR-RGFEVAHDLF---GVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSG- 130
             W  D +    E  ++       Y+++V  +EA+  V    K  P FL LA    H+  
Sbjct: 156 HLWKNDGKVTKVEYPNNTLHEGDNYSSEVVMNEALDYV-ERQKDGPFFLHLAFQIPHASL 214

Query: 131 -------NPYEPIRAPQKLIDAFKY----IDDSARQKFAAVLSKLDESVGKVVKALHTRG 179
                    Y PI   + L    K+     +   +  FAA++S +D +VG + K L   G
Sbjct: 215 RAKEEWKAKYRPILKEKLLPKKDKHPHYSYEREPKTTFAAMVSYMDHNVGLLNKKLEDLG 274

Query: 180 LLENSIVVFSTDNGGPAAGFN--DNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKAR 237
           L EN++++F++DNG    G +  D+  SN  L+G K  ++EGGVR     + P    K +
Sbjct: 275 LAENTLIMFASDNGAMQEGGHKRDSFDSNGVLRGGKRDMYEGGVRTPMIAYWP---GKIK 331

Query: 238 VAYQKMHIS---DWLPTLYSAAGGDLSVLENLDGVNQWDA-LSKNTESPRTSVLHNIDDI 293
                 HIS   D  PT+   AG    V E+ DG++     L K +++    +     + 
Sbjct: 332 AGQTSDHISAFWDISPTVRELAGA--KVQEDTDGISFVPTLLGKGSQTKHDYLYWEFFEQ 389

Query: 294 WGIAALTVDKYKLI 307
            G  A+ + K+KLI
Sbjct: 390 GGKRAIRMGKWKLI 403


>UniRef50_A6DSF1 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetyl-galactosamine-6-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 517

 Score = 85.8 bits (203), Expect = 2e-15
 Identities = 72/251 (28%), Positives = 120/251 (47%), Gaps = 35/251 (13%)

Query: 24  YLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDFR 83
           YLK+ GY T  +GKWH+     E      G+D   G           T+ ++GS      
Sbjct: 145 YLKNQGYATAHIGKWHIYGGGPE----KHGYDVSSG----------ETSNDEGS-----P 185

Query: 84  RGFEVAHDLFGVYATDVYTDEAIKVVNSH-NKSEPLFLMLAHSAVHSGNPYEP-IRAPQK 141
           +     +D   +++    T  +IK +    NK +P F+ ++H A HS     P   A  +
Sbjct: 186 KNITDPNDPKRIFSI---TKNSIKFIEKQTNKEKPFFIQVSHYAEHSAQMSLPETLASYE 242

Query: 142 LIDAFKYIDDSARQK----FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAA 197
              A K I D   +K      A ++ +D S+G ++  L    +L+N+ V+F++DNG    
Sbjct: 243 NDPAIKKIKDKKFKKEVITHGAAVTDMDTSIGMIIDKLKELNILDNTYVIFTSDNG--KG 300

Query: 198 GFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAG 257
             +D       L+G K +LWEGG+R    +  P +++K+R + + +   D LPT+Y  AG
Sbjct: 301 LLHDKRI----LRGSKWSLWEGGIRVPFMIMGPGIEAKSRCS-ENIIGYDMLPTIYELAG 355

Query: 258 GDLSVLENLDG 268
           G+   + N+DG
Sbjct: 356 GNTEDMPNVDG 366


>UniRef50_A6DHI1 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
           n=1; Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine 6-sulfate sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 472

 Score = 85.8 bits (203), Expect = 2e-15
 Identities = 67/278 (24%), Positives = 122/278 (43%), Gaps = 16/278 (5%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
           +P + + L + +K  GY T  +GKW LG +     P  +GFD   G+   R     H   
Sbjct: 101 IPADSETLGKLMKRAGYATACIGKWGLGGFHNAGNPHKQGFDHFYGYTDQR---KAHNYY 157

Query: 74  EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPY 133
            +  W    +      +     Y+ D+ T +A+K +    K +P FL LA+       P+
Sbjct: 158 PEYLWRNGEKEMLNNKNGEENDYSHDLMTVDALKYI-EEKKDQPFFLYLAYLI-----PH 211

Query: 134 EPIRAPQKLIDAFKYIDDSARQKF-AAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDN 192
              + P   +  +K  D     K  AA+ S++D  +G + + L   G+ +N++++F++DN
Sbjct: 212 VKYQVPD--LAQYKDKDWPKEMKIHAAMTSRMDRDIGTIARRLEELGIADNTLIMFNSDN 269

Query: 193 GGPAAGFNDN-AASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPT 251
           G      ++    ++  LKG+K ++++GGVR     + P       V+       D +PT
Sbjct: 270 GAHGKSNSEKFFNTSGDLKGLKRSMYDGGVRSPMIAYWPGTIQAGSVSDHISAFWDMMPT 329

Query: 252 LYSAAGGDLSVLENLDGVNQWDA-LSKNTESPRTSVLH 288
                G         DG++     L K++E  +   L+
Sbjct: 330 FSELTGEPFK--GETDGISMLPTLLGKDSEQKQHKYLY 365


>UniRef50_Q5DYT4 Cluster: Arylsulfatase; n=10;
           Gammaproteobacteria|Rep: Arylsulfatase - Vibrio fischeri
           (strain ATCC 700601 / ES114)
          Length = 537

 Score = 85.4 bits (202), Expect = 3e-15
 Identities = 83/292 (28%), Positives = 127/292 (43%), Gaps = 42/292 (14%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYK-----------------------KEYLP 49
           G+PL+ K+LP   ++ GY+T  +GKWH    K                       K Y P
Sbjct: 137 GIPLDIKLLPALFQENGYRTATIGKWHNAKIKGKNLVDEDKRTRDYHDNQITVTPKGYGP 196

Query: 50  LNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVV 109
             RGFD    F+     ++D   + Q   G +      + H+L         T++A+K +
Sbjct: 197 EERGFDYSYSFYASGAALWDSPAIWQN--GKNISAPGYLTHNL---------TEQALKFI 245

Query: 110 NSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVG 169
           +     +P F+ LA S  H   P E   +P K +D F   +  A + FAA+ +  DES+G
Sbjct: 246 DESG-DKPFFVNLAFSVPHI--PLEEA-SPAKYMDRFNTGNVEADKYFAAI-NAADESLG 300

Query: 170 KVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWS 229
            ++  L  +G L+N+I+ F +DNG   A        N   KG K  ++ GGVR     + 
Sbjct: 301 IIMDNLEKKGELDNTIIFFLSDNG---AVHESPMPMNGMDKGFKGQMYNGGVRVPFVAYW 357

Query: 230 PLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTES 281
           P        +   +   D LPT  +AAG D+     +DG N    L   TE+
Sbjct: 358 PKHIPAGGESDSLISALDILPTALAAAGIDIPEDMQVDGKNIMPVLEGKTET 409


>UniRef50_A6DKN7 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine 6-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 465

 Score = 85.4 bits (202), Expect = 3e-15
 Identities = 83/284 (29%), Positives = 126/284 (44%), Gaps = 36/284 (12%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           G+  +E ++P  +K  GY+T  +GKWHLGS  +E+ P  RGFD   G+  G    Y   +
Sbjct: 104 GVKTSEIMIPALMKKGGYQTCAIGKWHLGS-SEEFQPNARGFDHWFGY-RGSCGFYQFKS 161

Query: 73  MEQGSW-GTDFRR-------GFEVAHDLFGV----YATDVYTDEAIKVVNSHNKSEPLFL 120
             Q +  G + +          +V  +   V    Y TD ++DEA   +   NK  P F+
Sbjct: 162 QVQSAKKGQELKPLPSGEDPNLDVVRNGESVRLEGYLTDHFSDEAANWIKE-NKERPFFM 220

Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
             A   VH+     P   P K I       D        V++ LD SV  ++ AL   G+
Sbjct: 221 YFAPYNVHA-----PDTVPNKYIPKGGTAHDG-------VIAALDASVQTILDALKEAGI 268

Query: 181 LENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKARVA 239
            +N++VVFS DNGG      D + +    KG K T +EGG+R      W   +++ ++  
Sbjct: 269 ADNTLVVFSNDNGGK----KDYSKT---FKGNKATFYEGGIRVPFAMRWPKGIEAGSKY- 320

Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPR 283
              +   D LPT  + A  DL      DG N    +  + +  R
Sbjct: 321 NGVVSTLDLLPTFAALAKVDLPSDRVYDGQNLLPVIKDSAKDQR 364


>UniRef50_A4GIB1 Cluster: Arylsulfatase; n=1; uncultured marine
           bacterium HF10_49E08|Rep: Arylsulfatase - uncultured
           marine bacterium HF10_49E08
          Length = 608

 Score = 85.0 bits (201), Expect = 4e-15
 Identities = 76/284 (26%), Positives = 124/284 (43%), Gaps = 29/284 (10%)

Query: 21  LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGT 80
           +  Y ++ GY T + GKWHLG+    + P +RGF   V + +  I            WG 
Sbjct: 102 IANYYEEAGYSTGVFGKWHLGA-NYPFRPQDRGFQESVWYPSSSIPSVP------AYWGN 154

Query: 81  DFRRGFEVAHDL---FGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGNPYEPI 136
           D+     + +     F  Y  DV+ +EA++ ++   KS+ P    LA +  H   P+ P 
Sbjct: 155 DYFDDVYIHNGKEKRFEGYCADVFFNEAMRFMSESAKSKKPFMCYLATNTPHG--PFWPK 212

Query: 137 RAPQKLI------DAFKYIDDSARQKFAAVLS---KLDESVGKVVKALHTRGLLENSIVV 187
              +K I        F  +D++ +++ A  L     +D ++G ++K L    L E++I++
Sbjct: 213 EEDRKEIAEVLAQSKFDNLDNNLKKRLALYLGMIRNIDWNMGNLLKFLKEENLAEDTILI 272

Query: 188 FSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHIS 246
           F TDNG        NA     ++G K  +WEGG R   F+ W      KAR       + 
Sbjct: 273 FKTDNGSLLGPQYFNAG----MRGKKTEIWEGGHRVPCFIRWPNGGFGKARDIGGLTQVQ 328

Query: 247 DWLPTLYSAAGGDLSVLENLDGVNQWDAL--SKNTESPRTSVLH 288
           D LPT+    G         DG++    L   K     RT +++
Sbjct: 329 DILPTVLDLCGIKPRKNTKFDGISLASVLRGKKKVSEDRTIIIN 372


>UniRef50_A0JAV3 Cluster: Sulfatase precursor; n=1; Shewanella
           woodyi ATCC 51908|Rep: Sulfatase precursor - Shewanella
           woodyi ATCC 51908
          Length = 356

 Score = 85.0 bits (201), Expect = 4e-15
 Identities = 66/201 (32%), Positives = 102/201 (50%), Gaps = 25/201 (12%)

Query: 26  KDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF-WTGRIDMYDHTTMEQGSWGTDFRR 84
           K  GY T ++GKWHLG    +  P   GFD+ +     G    Y +      S G     
Sbjct: 141 KQQGYATAVIGKWHLG----KTAPTEYGFDTAIAASHLGHPPSYFYPY----SKGKRKLI 192

Query: 85  GFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLID 144
           G E    L   Y ++  T EA+  ++S    +P FL L   AVH+     PI AP++ ++
Sbjct: 193 GLEEG-GLKDEYLSNRITREAVNYISSQR--QPFFLYLPFYAVHT-----PIEAPKEWVN 244

Query: 145 AFKYIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFND 201
                  +   K   +AA+++ LD  VGK+++AL   G  EN++VVF++DNG       D
Sbjct: 245 QHNARQQAGEIKSAAYAAMIANLDRDVGKLLQALDKSGQRENTLVVFASDNGA-----YD 299

Query: 202 NAASNYPLKGVKNTLWEGGVR 222
            A S+ P +G K++L+EGG++
Sbjct: 300 PATSSLPYRGYKSSLFEGGIK 320


>UniRef50_A6CA27 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
           Planctomyces maris DSM 8797|Rep: N-acetylgalactosamine
           6-sulfatase - Planctomyces maris DSM 8797
          Length = 491

 Score = 84.6 bits (200), Expect = 5e-15
 Identities = 74/259 (28%), Positives = 112/259 (43%), Gaps = 26/259 (10%)

Query: 18  EKILPQYLKDLGYKTHLVGKWHLGSYKKE--YLPLNRGFDSHVGFWTGRIDMYDHTTMEQ 75
           E  + + +K +GY T   GKWHLGS +      P N GFD     W    + Y++     
Sbjct: 111 EVTVAEAVKSVGYTTGHFGKWHLGSVQSNSPVSPGNSGFDE----WVSSPNFYENDPYMS 166

Query: 76  GSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEP 135
            +          V   L G  ++ V  D A+  +   +K +  FL    + +  GNP+ P
Sbjct: 167 HNG---------VVKQLKGE-SSRVTVDAALDFIKQADKDKKPFL----AVIWFGNPHTP 212

Query: 136 IRAPQKLIDAFKYIDDSAR-QKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG 194
             A  +L D   Y D     Q +   +S +D ++G +   L   GL EN+++ F++DNG 
Sbjct: 213 HEAVSELKDL--YPDQKPNFQNYFGEISGVDRAMGHLRSQLRDLGLAENTLLWFTSDNGP 270

Query: 195 PAAGFNDNAASNYP---LKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPT 251
               F    A +     L G K  LWEGGVR    +  P +  K  V+       D  PT
Sbjct: 271 RPPQFKTEEARSQATGGLAGFKGNLWEGGVRVPSLIEWPAVIKKPEVSNVPCGTIDIYPT 330

Query: 252 LYSAAGGDLSVLENLDGVN 270
           + +  G  +S    LDGV+
Sbjct: 331 VLAMTGAKVSHQPQLDGVS 349


>UniRef50_A6C8R8 Cluster: Arylsulfatase A; n=1; Planctomyces maris
           DSM 8797|Rep: Arylsulfatase A - Planctomyces maris DSM
           8797
          Length = 510

 Score = 84.6 bits (200), Expect = 5e-15
 Identities = 79/274 (28%), Positives = 123/274 (44%), Gaps = 22/274 (8%)

Query: 5   VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGR 64
           V+    P GL  +E  + + LK  GYKT ++GKWHLG  +  +LP  +GFD   G     
Sbjct: 123 VLRPISPYGLNPDEITVAEVLKKQGYKTGMIGKWHLGD-QTPFLPTRQGFDYFYGIPYSD 181

Query: 65  IDMYDHTTMEQGSW--GTDFRRGFEVAHDLF---GV---YATDVYTDEAIKVVNSHNKSE 116
            DM        G    G ++     + +D     GV     T  YT++A++ +   NK++
Sbjct: 182 -DMTQAVGQRLGDRLDGKNWPPLPVMLNDTVIEAGVDRNLLTKDYTEKAVEFIEK-NKNQ 239

Query: 117 PLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALH 176
           P FL    +    G+  +P  +     DAF+    S    +   + +LD S G+++  L 
Sbjct: 240 PFFLYFPQAM--PGSTRKPFAS-----DAFR--GKSKNGPWGDSIEELDWSTGQILDKLV 290

Query: 177 TRGLLENSIVVFSTDNGGP-AAGFND-NAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDS 234
             G+ +N++V++++DNG P A   N     +N PL G   T  EG  R    +W P    
Sbjct: 291 ELGIDKNTLVIWTSDNGSPMAKDMNSTERGTNKPLNGRGYTTSEGAFRVPTIVWWPETVP 350

Query: 235 KARVAYQKMHISDWLPTLYSAAGGDLSVLENLDG 268
              V  +     D LPT    AGG +     +DG
Sbjct: 351 AGTVCEELATTMDLLPTFARLAGGKVPSDRIIDG 384


>UniRef50_A6CEC4 Cluster: Aryl-sulphate sulphohydrolase; n=1;
           Planctomyces maris DSM 8797|Rep: Aryl-sulphate
           sulphohydrolase - Planctomyces maris DSM 8797
          Length = 467

 Score = 84.2 bits (199), Expect = 6e-15
 Identities = 70/247 (28%), Positives = 117/247 (47%), Gaps = 28/247 (11%)

Query: 25  LKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDFRR 84
           L   GY+   VGKWHLG       PL++GF  ++          + T   +G + + ++ 
Sbjct: 132 LSQAGYRCASVGKWHLGQS-----PLSQGFQVNIAG--------NQTGSPRGGYFSPYQN 178

Query: 85  GFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLID 144
             +++    G + TD  T  A + +   N+  P FL L H AVH+     P++A ++ I 
Sbjct: 179 P-QLSDGEQGEFLTDRLTTAACQFIKD-NQGSPFFLYLTHYAVHT-----PLQAKKEDIA 231

Query: 145 AFKYIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFND 201
            F+        +   +AA++  +D+S+G+V++ L  + L +N+IVVF++DNGG       
Sbjct: 232 YFQSKPAGKLHQHATYAAMIRSMDQSIGRVLQTLREQQLDQNTIVVFTSDNGGYGP---- 287

Query: 202 NAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLS 261
            A S  PL+G K  L+EGG+R    +  P +        + +   D  PT        + 
Sbjct: 288 -ATSMLPLRGSKGMLYEGGIRVPLLIKWPGVTQPGSTTGEAVINVDLYPTFLEMTNIPVL 346

Query: 262 VLENLDG 268
             E LDG
Sbjct: 347 ESELLDG 353


>UniRef50_A4B5Y4 Cluster: Iduronate-sulfatase and sulfatase 1; n=1;
           Alteromonas macleodii 'Deep ecotype'|Rep:
           Iduronate-sulfatase and sulfatase 1 - Alteromonas
           macleodii 'Deep ecotype'
          Length = 588

 Score = 84.2 bits (199), Expect = 6e-15
 Identities = 88/324 (27%), Positives = 135/324 (41%), Gaps = 38/324 (11%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGF-DSHVG-FWTGRIDMYDHT 71
           +P N   +     DLGY T +VGKWHL   +         F D+ +  F  GR+      
Sbjct: 208 IPENVVTMGDRYSDLGYTTGMVGKWHLEIDQNSKPWFKENFPDTPISEFNLGRLPSSLKE 267

Query: 72  TMEQGSWGTDFR-----RGFEVAHDLFGV-----------YATDVYTDEAIKVVNSHNKS 115
                S G  +        +   +DL G            Y  DV +D A + ++  N  
Sbjct: 268 RYYPSSKGYKYNYFGYANRYWANYDLKGNQTQLGWISNSDYRLDVVSDAATQFIDI-NHD 326

Query: 116 EPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKAL 175
           EP +L +AH A     P+ P+ A +  +  F     + R+   A++  +D  VG +V  L
Sbjct: 327 EPFYLHVAHYA-----PHVPLEATEDYLSLFPEQSSNRRRYALAMMYAVDAGVGSIVSKL 381

Query: 176 HTRGLLENSIVVFSTDNGGP-AAGFND---------NAASNYPLKGVKNTLWEGGVRGAG 225
              G+LEN+I+ F +DNG P    F D         N + N PL G K  L +GG++   
Sbjct: 382 EEYGILENTIIAFISDNGAPIGLDFTDAPIAEKEAWNGSLNAPLLGEKGMLTDGGIKVPF 441

Query: 226 FL-WSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRT 284
            + W   L S   V  + +   D L +    AG   +VL  LDGV+ +     +T +   
Sbjct: 442 IVHWPEKLQSNT-VIDEPVISLDVLYSAIKRAGASETVLSELDGVDIFPTQGFDTSALMN 500

Query: 285 SVLHNIDDIWGIAALTVDKYKLIK 308
             L      W  +A+ +  YK +K
Sbjct: 501 RPL--FWRFWNQSAVRLGNYKYLK 522


>UniRef50_Q0BZE9 Cluster: Sulfatase family protein; n=1; Hyphomonas
           neptunium ATCC 15444|Rep: Sulfatase family protein -
           Hyphomonas neptunium (strain ATCC 15444)
          Length = 459

 Score = 83.4 bits (197), Expect = 1e-14
 Identities = 83/313 (26%), Positives = 133/313 (42%), Gaps = 32/313 (10%)

Query: 1   MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
           MQH VI+     GLP  E  + + LK+ GY+T +VGKWHLG +++EY P N+GFD   G 
Sbjct: 105 MQH-VIFPHSQDGLPAEEITISEMLKNAGYRTGMVGKWHLG-HQEEYWPTNQGFDWFYGV 162

Query: 61  WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
                DM             D  RG E+           +   +A K     +  +P FL
Sbjct: 163 PYSN-DMAPF----------DLYRGKEIIESPADQSQLSLNYAKAAKEFIEDSSDKPFFL 211

Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
             A +      P+ P+  P+           S    +  V+  +D  +G V+  L   G+
Sbjct: 212 YYAETF-----PHIPLFVPEDRSGT------SDAGLYGDVVETVDAGIGIVLDTLDEAGV 260

Query: 181 LENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAY 240
            ++++++F++DNG     F  +A      +G K    EGG R       P    K  V++
Sbjct: 261 ADDTLIIFTSDNG---PWFEGSAGE---FRGRKGETHEGGFRVPFLARWPGHIPKGSVSH 314

Query: 241 QKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALT 300
           +     D LPT  S +G  L     +DG +    L+    +P   +L   D    + A  
Sbjct: 315 EMAMNIDLLPTAASLSGATLPADRVIDGKDLTSLLTAGAPTPH-DILFFFDGNEIVGARD 373

Query: 301 VDKYKLIKGTIYK 313
             +++L+  T Y+
Sbjct: 374 A-RFRLVLNTFYR 385


>UniRef50_Q4RJR3 Cluster: Chromosome 13 SCAF15035, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 13 SCAF15035, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 474

 Score = 83.0 bits (196), Expect = 1e-14
 Identities = 86/310 (27%), Positives = 133/310 (42%), Gaps = 38/310 (12%)

Query: 4   GVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKE---YLPLNRGFD-SHVG 59
           GV+Y     GLPLNE  + + LK  GY T  VGKWHLG   +    + P  + F    VG
Sbjct: 91  GVLYPGSRGGLPLNETTIAEVLKPRGYATAAVGKWHLGGPCQNLTCFPPDVKCFGLCDVG 150

Query: 60  FWTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLF 119
             T  + M+D    +Q     D  + +         +A D  T  A        + +P F
Sbjct: 151 TVTVPL-MHDEVIKQQPVNFLDLEKAYSD-------FAKDFITTSA-------KRKQPFF 195

Query: 120 LMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRG 179
           L       H      P  A + L           R  F   L + D+++G ++  L   G
Sbjct: 196 LYFPSHHTHYPQYAGPGAAGKSL-----------RGPFGDALLEFDQTIGSLLATLERTG 244

Query: 180 LLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGF-LWSPLLDSKARV 238
           ++ N+++ F++DNG P         +  PL+  K T +EGG+R      W  L+  +  V
Sbjct: 245 VINNTLIFFTSDNG-PELMRMSRGGNAGPLRCGKGTTYEGGMREPAIAYWQGLI--QPGV 301

Query: 239 AYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIW---G 295
            ++     D LPT  S AG  L  +  LDGV+  + L    +S R +++    D     G
Sbjct: 302 THEMASTLDILPTFASLAGAKLPQV-MLDGVDMTNILFSQGKSKREAMMFYPTDPSEKNG 360

Query: 296 IAALTVDKYK 305
           + A+ ++KYK
Sbjct: 361 LFAIRLEKYK 370


>UniRef50_A6DM48 Cluster: Arylsulfatase A; n=1; Lentisphaera
           araneosa HTCC2155|Rep: Arylsulfatase A - Lentisphaera
           araneosa HTCC2155
          Length = 484

 Score = 83.0 bits (196), Expect = 1e-14
 Identities = 66/212 (31%), Positives = 103/212 (48%), Gaps = 27/212 (12%)

Query: 20  ILPQYLKDLGYKTHLVGKWHL-------GSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           ILPQ +K  GY+T +VGKWHL       G   K   P  RGFD+ + +   ++  ++ T 
Sbjct: 118 ILPQIMKQGGYQTGMVGKWHLSEPGHKTGLTGKPLEPHRRGFDTAI-YTFNQLGRFNPTL 176

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
              G             +  +  Y  DV  DE IK + S +K +P F  LA S      P
Sbjct: 177 SHNGK------------NSKYEGYCGDVVFDEGIKWMESCSKEKPYFAYLATSI-----P 219

Query: 133 YEPIRAPQKLIDAFKYIDDSARQK-FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
           + P+ APQ+  D +        +K + A++S +DE++GK++  + +R     +I++F TD
Sbjct: 220 HTPLAAPQRYKDLYSGAKLKNNEKNYYAMISAVDENIGKLMTWMASRKDDRETILIFMTD 279

Query: 192 NGGPAAGFNDNAASNYPLKGVKNTLWEGGVRG 223
           NG   +G  D A  +   +  KN L+  G RG
Sbjct: 280 NGHAISG-PDGAGHSRDGRLKKNGLYNFGFRG 310


>UniRef50_A6DG54 Cluster: Arylsulphatase A; n=1; Lentisphaera
           araneosa HTCC2155|Rep: Arylsulphatase A - Lentisphaera
           araneosa HTCC2155
          Length = 469

 Score = 83.0 bits (196), Expect = 1e-14
 Identities = 84/302 (27%), Positives = 131/302 (43%), Gaps = 32/302 (10%)

Query: 11  PRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSY--KKEYLPLNRGFDSHVGFWTGRIDMY 68
           P  LP +E  + + LK  GY T + GKWHLG+   K    P  +GFD    +W       
Sbjct: 102 PMHLPDSEITIAEALKTKGYATGMFGKWHLGNLDGKSHPTPSEQGFD----YWLA----C 153

Query: 69  DHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVH 128
           D+  ++        R G  V       +A  V  DEA + +    ++ P F  +A S  H
Sbjct: 154 DNNLIKHNPKSL-IRNGKPVGK--IAGWAAQVVADEANEWMK--KQTSPFFAYIAFSETH 208

Query: 129 SGNPYEPIRAPQKLIDAFKYI---DDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSI 185
           S     P+ AP++LI   KYI   ++  R  +  +    D +VG ++K L   G+ +N++
Sbjct: 209 S-----PLDAPEELIT--KYIERGENKKRATYRGMTEYSDAAVGSILKTLDDMGVSDNTL 261

Query: 186 VVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHI 245
           V  ++DN GP +   D+      L+G K+  WEGG+R    +  P            +  
Sbjct: 262 VFLASDN-GPTS--EDSCEG---LRGKKSYTWEGGIRVPAIIRWPGKVKPGSEYNDPVGG 315

Query: 246 SDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALTVDKYK 305
            D LPTL    G +L    ++DGV+    L        T +L         A++ +  Y 
Sbjct: 316 IDLLPTLCDIVGAELP-KRHIDGVSIRSVLEGKPFKRNTPILSFFYRTSPAASMRMGDYV 374

Query: 306 LI 307
           LI
Sbjct: 375 LI 376


>UniRef50_A6DHY0 Cluster: N-acetylgalactosamine 6-sulfatase; n=2;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine 6-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 507

 Score = 82.6 bits (195), Expect = 2e-14
 Identities = 87/314 (27%), Positives = 129/314 (41%), Gaps = 21/314 (6%)

Query: 6   IYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRI 65
           I+GA    LP  E  L   LK  GY T   GKWHLG+  K+Y            F     
Sbjct: 86  IWGANVGHLPKEEITLASVLKQQGYVTGHFGKWHLGTLNKDYSTKGESRKPTENFAPPWE 145

Query: 66  DMYDHTTMEQGSWGT---------DFRRGFEVAHDLFGVY--ATDVYTDEAIKVVNSHNK 114
             YD + + + S  T          +  G  +      +Y  A  V  D+AI  +     
Sbjct: 146 RDYDESFVVESSVSTWDPASEKNPFYINGVPMKGTEESLYGGAARVVVDKAIPFMERAVS 205

Query: 115 SEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKA 174
               FL    + V    P+EPI+A  K ++ +K   ++A   +   L+++DE VG++   
Sbjct: 206 EGNPFL----AVVWFNAPHEPIKAGPKYLEMYKEHGEAAH--YYGCLTEMDEQVGRIRAK 259

Query: 175 LHTRGLLENSIVVFSTDNGGPA-AGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLD 233
           L   G+ +N+++ F +DNG          A +   L+G K +L++GGVR       P   
Sbjct: 260 LREMGVEKNTVLFFCSDNGPEGKKAKGAKAGTTSGLRGRKRSLYDGGVRVPALAEWPGKI 319

Query: 234 SKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDI 293
               V    M   D+LPT+ +     +     LDG N   AL    ES R   +  I   
Sbjct: 320 QAGSVIDAAMSTLDYLPTVIALQNHQMPDERPLDGENIL-ALLTGEESQRKRGIPFIHR- 377

Query: 294 WGIAALTVDKYKLI 307
            G A L    YKL+
Sbjct: 378 -GKAVLNRGDYKLV 390


>UniRef50_A4W906 Cluster: Sulfatase precursor; n=10;
           Enterobacteriaceae|Rep: Sulfatase precursor -
           Enterobacter sp. 638
          Length = 501

 Score = 82.6 bits (195), Expect = 2e-14
 Identities = 79/306 (25%), Positives = 135/306 (44%), Gaps = 37/306 (12%)

Query: 17  NEKILPQYLKDLGYKTHLVGKWHLGS-YKKEYLPL--NRGFD----SHVGFWTGRIDMYD 69
           NEK +  YLKD GY T ++GKWHL +   +   P   + GFD    +  GF T  +D   
Sbjct: 117 NEKTIASYLKDQGYDTAMMGKWHLNAGVDRHDQPQAEDAGFDYTLVNAAGFVTSDLDKAK 176

Query: 70  HTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS 129
                   +   F R  +    +  + + +  + EAI  +N    ++P F+ +A + VH+
Sbjct: 177 ERPRNGVVYPNGFYRNGKALGTVNQI-SGEFVSQEAINWLNDKKDNKPFFMYVAFTEVHT 235

Query: 130 GNPYEPIRAPQKLIDAFK-YIDDSARQ------------------KFAAVLSKLDESVGK 170
                P+ +P+K ++ +K Y+ +  +Q                  ++ A +S +DE VGK
Sbjct: 236 -----PLASPKKYLEIYKNYMSEYEKQHPDMFYADWVDKPYRGPGEYYANISYMDEQVGK 290

Query: 171 VVKALHTRGLLENSIVVFSTDNGGPAAGFN-----DNAASNYPLKGVKNTLWEGGVRGAG 225
           V+  + + G  +N+I++F++DNG            + A     L+G K+ LWEGG+R   
Sbjct: 291 VLAKIKSMGQEDNTIIIFTSDNGPVTREARKWYELNMAGETDGLRGRKDNLWEGGIRVPA 350

Query: 226 FLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTS 285
            +          V    +   D LPTL      +L     +DG +    L   T + +  
Sbjct: 351 IIKYGQHLHAGTVTDTPVSGLDILPTLAELTHFNLPTDRIIDGESIVPVLEGQTMNRQQP 410

Query: 286 VLHNID 291
           +L  ID
Sbjct: 411 LLFAID 416


>UniRef50_Q7UYW2 Cluster: Arylsulfatase; n=2; Planctomycetaceae|Rep:
           Arylsulfatase - Rhodopirellula baltica
          Length = 484

 Score = 82.2 bits (194), Expect = 3e-14
 Identities = 79/289 (27%), Positives = 132/289 (45%), Gaps = 34/289 (11%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
           LPL E+ + + LKD GY+T   GKWH+ S+ + YL  +    +H     G    ++    
Sbjct: 127 LPLEEQTIAECLKDEGYQTAFFGKWHVSSHHERYLGWS---PTHGPAKQG----FEFAEE 179

Query: 74  EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPY 133
           + G+   D++R         G +A D    + +      +   P F M +   VH+    
Sbjct: 180 DYGAHPYDWKRSPVATIKEPGRFAPDSMV-QRVGAFLRQDHDRPYFAMASSFYVHT---- 234

Query: 134 EPIRAP----QKLIDAFKYIDDSARQ---KFAAVLSKLDESVGKVVKALHTRGLLENSIV 186
            P+R P    ++  DA        R    ++AA L   D  VG+++ +L   G  + +IV
Sbjct: 235 -PVRTPCQWLREKYDARVPATSKKRNNRIEYAAFLETFDHHVGQILNSLEASGRADRTIV 293

Query: 187 VFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHI 245
           + ++DNGG     +    +N PL+G K  L+EGG+R    + W  ++  K  +    +  
Sbjct: 294 ILNSDNGG-----HPEYTANAPLRGSKWNLYEGGIRVPMIVRWPGVVQPKTEIDRPVIGY 348

Query: 246 SDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIW 294
            D LPT+ + AGG+       DG  +  A S   +SP T+  H++  IW
Sbjct: 349 -DLLPTMVALAGGN---PPKCDG--ESFAGSLRGDSPPTNEQHSL--IW 389


>UniRef50_A6DMX9 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
           n=3; Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine 6-sulfate sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 467

 Score = 82.2 bits (194), Expect = 3e-14
 Identities = 73/298 (24%), Positives = 136/298 (45%), Gaps = 47/298 (15%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKK--EYLPLNRGFDSHVGFW----TGRID 66
           G+P +E    + LK+ GY+T  VGKW + + +     +P  +GFD + G      +G+ID
Sbjct: 92  GMPASEITFAEMLKETGYQTACVGKWDVSNRQPIIPRMPNAQGFDYYYGTLGGNGSGKID 151

Query: 67  MYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHS 125
           +Y++   E+               D+  +  T +YT++AI  +      E P  L LAH+
Sbjct: 152 LYENNKKER------------TTEDMASL--TRLYTNKAIDFLEKQRDPEKPFILYLAHT 197

Query: 126 AVHSGNPYEPIRAPQKLIDAF-KYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
             H+            ++DA  K+ + +    + A + +LD   G+++  L+   L +N+
Sbjct: 198 MTHT------------VVDASPKFKEKTGDNLYRAAVEELDYETGRLLNKLNQLNLSKNT 245

Query: 185 IVVFSTDNG--GPAAGFNDNAASNYPLKGV-----------KNTLWEGGVRGAGFLWSPL 231
           +V++++DNG        N  A +++P   +           K ++WEGG      +  P 
Sbjct: 246 LVIYTSDNGPWNQPKYINGGAKNDHPENSIFWGDAGEFRDGKASIWEGGAHVPCVMRWPG 305

Query: 232 LDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHN 289
             +  +     M   D+LPTL +  G  +     +DGVNQ   +   +E+ R + ++N
Sbjct: 306 KIAAGKTNDGLMATIDFLPTLAAVTGAKIPDERVIDGVNQLGFICGKSETARETYIYN 363


>UniRef50_A6DMX6 Cluster: Arylsulphatase A; n=1; Lentisphaera
           araneosa HTCC2155|Rep: Arylsulphatase A - Lentisphaera
           araneosa HTCC2155
          Length = 484

 Score = 82.2 bits (194), Expect = 3e-14
 Identities = 84/298 (28%), Positives = 128/298 (42%), Gaps = 42/298 (14%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
           LP N   L   +K  GYKT   GKWHL     +        D            YD + M
Sbjct: 110 LPENIFTLGDAMKSAGYKTGYFGKWHLNDRTAKGKEARHTPDERG---------YDKSYM 160

Query: 74  EQGSWGTDFRRGFEVAHDLFGVYA-TDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
             G  G  +R  F+ A+ L      + V TD  +  +   NK +P FL ++H  VH    
Sbjct: 161 YNG--GGFYRPVFQPAYKLDKPKRLSQVLTDMGVDFIKE-NKDQPFFLFVSHYDVHV--- 214

Query: 133 YEPIRAPQKLIDAF--KYIDDS--ARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVF 188
              + A + LID +  K  D +      +AA++   D+SVG+++KA+  +GL +N++ +F
Sbjct: 215 --QLDADKDLIDKYLNKKRDPNYPGNAVYAAMIEHTDDSVGQLMKAIDDQGLADNTLFIF 272

Query: 189 STDNGGPAAGFND--------------------NAASNYPLKGVKNTLWEGGVRGAGFLW 228
            +DNGG    ++D                     A SN PL+  K T++EGG+R    + 
Sbjct: 273 YSDNGGVDNRYDDIPLLGGRSVNVYPEGHPLRYVATSNAPLRSGKGTVYEGGIRVPLIVR 332

Query: 229 SPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSV 286
            P   S    +      SD+ P+            + LDGV+   AL+KN+  P   V
Sbjct: 333 WPGKVSPGTRSEAVFSSSDFYPSFLEVTKTQAPKNQVLDGVSMVPALTKNSFDPEREV 390


>UniRef50_P77318 Cluster: Uncharacterized sulfatase ydeN precursor;
           n=32; Gammaproteobacteria|Rep: Uncharacterized sulfatase
           ydeN precursor - Escherichia coli (strain K12)
          Length = 560

 Score = 82.2 bits (194), Expect = 3e-14
 Identities = 82/279 (29%), Positives = 128/279 (45%), Gaps = 38/279 (13%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGF-DSHVGFWTGRIDMYDHT 71
           G+PL E  LP+  ++ GY T  VGKWHL       +P ++   D H  F T     +   
Sbjct: 159 GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT-----FSAE 213

Query: 72  TMEQGSWGTDFRRGFEVAHDLF---------------GVYATDVYTDEAIKVVN-SHNKS 115
             +  + G D+  GF  A   +                 Y +D  TDEAI VV+ +    
Sbjct: 214 EWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLD 273

Query: 116 EPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKAL 175
           +P  L LA++A H  N   P  AP +    F     +A   +A+V S +D+ V ++++ L
Sbjct: 274 QPFMLYLAYNAPHLPND-NP--APDQYQKQFNTGSQTADNYYASVYS-VDQGVKRILEQL 329

Query: 176 HTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSK 235
              G  +N+I++F++DNG   A  +     N   KG K+  + GG     F+W      K
Sbjct: 330 KKNGQYDNTIILFTSDNG---AVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMW---WKGK 383

Query: 236 ARVA-YQKM-HISDWLPTLYSAAGGDLSVLEN--LDGVN 270
            +   Y K+    D+ PT   AA  D+S+ ++  LDGV+
Sbjct: 384 LQPGNYDKLISAMDFYPTALDAA--DISIPKDLKLDGVS 420


>UniRef50_A6DFN4 Cluster: Arylsulfatase; n=1; Lentisphaera araneosa
           HTCC2155|Rep: Arylsulfatase - Lentisphaera araneosa
           HTCC2155
          Length = 481

 Score = 81.8 bits (193), Expect = 3e-14
 Identities = 87/320 (27%), Positives = 135/320 (42%), Gaps = 30/320 (9%)

Query: 8   GAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRI-- 65
           G EP  +P     L Q  KD GY T   GKW LG       P   GFD+  G+   R+  
Sbjct: 96  GQEP--IPEPGMTLAQIFKDKGYATGAFGKWGLGYPGSSSDPKALGFDTFYGYNCQRVAH 153

Query: 66  -----DMYDH----TTMEQ---GSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHN 113
                 M+ +    T  E+   G W       F+ +      YA D+  DEA+K +   N
Sbjct: 154 SFYPPHMWSNDKNITINEKPVPGHWRKAVGPDFDFSQFYAENYAPDLILDEALKFIKD-N 212

Query: 114 KSEPLFLMLA----HSAVHSGNPY-----EPIRAPQKLIDAFKYIDDSARQKFAAVLSKL 164
           K +P F  L     H A+H  + +     +   +P++   A        R  +AA++S L
Sbjct: 213 KDKPFFAYLPFVEPHLAMHPPHSWVDSYPKEWDSPKESYKAAYLPHLRPRAGYAAMISDL 272

Query: 165 DESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNA--ASNYPLKGVKNTLWEGGVR 222
           DE VG V++ L    L+EN++V+F++DNG       D+    S   L+G+K +++EGG+R
Sbjct: 273 DEHVGSVMQLLKELDLVENTLVIFTSDNGASHCIEVDHEFFNSTKDLRGLKGSVYEGGLR 332

Query: 223 GAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESP 282
                  P    KA+V+       D + T            +  DGV+    L    + P
Sbjct: 333 VPMIAHWPGKIKKAQVSDHVSGFVDVMATFCDLL--QTEAPQTSDGVSFLPTLKGEKQEP 390

Query: 283 RTSVLHNIDDIWGIAALTVD 302
           +  +        G  A+ +D
Sbjct: 391 QPVLAWEFQGYSGQQAIILD 410


>UniRef50_Q7UYA5 Cluster: Arylsulfatase; n=1; Pirellula sp.|Rep:
           Arylsulfatase - Rhodopirellula baltica
          Length = 562

 Score = 81.0 bits (191), Expect = 6e-14
 Identities = 83/299 (27%), Positives = 131/299 (43%), Gaps = 33/299 (11%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
           LP +   + + LKD GY T  +GKWHLG    +  P  R   +  G      D Y  T +
Sbjct: 200 LPESATTVAELLKDAGYNTAHIGKWHLGGLHVDE-PGKR-LTNQPGPRQHGFDFYQ-TQI 256

Query: 74  EQ----GSWGTD---FRRGFEV--------AHD--LFGVYATDVYTDEAIKVVNSHNKSE 116
           EQ    G  G D   FR+G  V        + D   +  + TD   D A++++   +  E
Sbjct: 257 EQQPLRGQMGRDKTLFRKGGTVLLRNDQRISQDDPYYHKHFTDANGDFAVEMIEKLSSEE 316

Query: 117 -PLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKAL 175
            P F+ +     H   PYEP   P     A   I D  + +F +++  +D  VG +++ L
Sbjct: 317 DPFFINMWWLVPHK--PYEPAPEPHWSDTAADDITDD-QHRFRSMVQHMDAKVGAILRKL 373

Query: 176 HTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSK 235
               + +N++V+F++DNG    GF       + LKG K  L +GG+R    +  P     
Sbjct: 374 DELKIADNTLVLFTSDNGAAFEGF------IHDLKGGKTELHDGGIRVPMIVRWPDAIPA 427

Query: 236 ARVAYQKMHISDWLPTLYSAAGGDLSVLENLDG---VNQWDALSKNTESPRTSVLHNID 291
            + +    H +D LPT   AA   L     LDG   ++ W   +  ++  R +V   +D
Sbjct: 428 GQTSQTFSHTNDLLPTFCDAASVQLPSDLPLDGLSLLSHWKGGTPPSQVERGTVFWQLD 486


>UniRef50_Q7UN55 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
           n=1; Pirellula sp.|Rep: N-acetylgalactosamine 6-sulfate
           sulfatase - Rhodopirellula baltica
          Length = 501

 Score = 80.6 bits (190), Expect = 8e-14
 Identities = 77/291 (26%), Positives = 128/291 (43%), Gaps = 24/291 (8%)

Query: 6   IYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF--WTG 63
           + G   R L   +  +   L D GY T  VGKW LG+        N G     GF  WTG
Sbjct: 121 LIGNAARNLTGEQPTVASLLSDAGYATGGVGKWALGNVDVPEEIENPGHPLANGFDAWTG 180

Query: 64  RIDMYD-HTTMEQGSWGTDFRRGFE--------VAHDLFGV----YATDVYTDEAIKVVN 110
            ++  + H    +  W    RR F         +A     V    Y+ DV TD A   + 
Sbjct: 181 YMNQSNAHNYYPRFLWQNYERRFFPGNVISTDPIARGRVAVKRESYSHDVMTDAAFDFIR 240

Query: 111 SHNKSEPLFLMLAHSAVHSGNPYEPIRAP-QKLIDAFKYIDD---SARQKFAAVLSKLDE 166
            H +S+P  L +  +  H+ N    +     ++ D   Y D+   +  + FAA+++++D 
Sbjct: 241 EH-RSDPFLLHVHWTIPHANNEGGRLNGDGMEVPDYGIYADEGWPNPEKGFAAMITRMDR 299

Query: 167 SVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNA--ASNYPLKGVKNTLWEGGVRGA 224
            +G+++  L    L E ++V+F++DNG    G + +    S+ PL+G K ++ EGG+R  
Sbjct: 300 DMGRLMDLLEELKLSEKTLVIFTSDNGPHHEGGHSDLFFNSSGPLQGSKRSMHEGGIRVP 359

Query: 225 GFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDAL 275
                P       ++       D+LPT    AG +     ++DG++   AL
Sbjct: 360 FIAKWPGTIEPGTISDHPSAFWDFLPTACELAGAEPPA--DIDGISYLPAL 408


>UniRef50_A6C176 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
           n=1; Planctomyces maris DSM 8797|Rep:
           N-acetylgalactosamine 6-sulfate sulfatase - Planctomyces
           maris DSM 8797
          Length = 599

 Score = 80.2 bits (189), Expect = 1e-13
 Identities = 60/185 (32%), Positives = 96/185 (51%), Gaps = 16/185 (8%)

Query: 17  NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQG 76
           NE  L +  K  GY+T L GKWHLG       P ++GF + V    G +           
Sbjct: 109 NEVTLAEVFKSNGYRTGLFGKWHLGD-NYPLRPQDQGFGTVVQHGGGGVGQTPDDWQNDY 167

Query: 77  SWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPI 136
              T  R G     + F  Y TD++ DEA+K + + ++++P F  L+ +A HS  PY  +
Sbjct: 168 FSDTYLRNG---KPEKFQGYCTDIWFDEALKFIEA-DRTKPFFAYLSTNAPHS--PY--L 219

Query: 137 RAPQKLIDAFKYIDDSARQKFAA---VLSKLDESVGKVVKALHTRGLLENSIVVFSTDNG 193
             P+    +  Y D    +K AA   +++ +DE++G++++ L   GL +N+I++F TDN 
Sbjct: 220 VDPEY---SDPYEDKGVPKKMAAFYGMITNIDENMGRLLRYLKESGLEKNTILIFMTDN- 275

Query: 194 GPAAG 198
           G AAG
Sbjct: 276 GTAAG 280


>UniRef50_Q1MJX8 Cluster: Putative arylsulfatase precursor; n=1;
           Rhizobium leguminosarum bv. viciae 3841|Rep: Putative
           arylsulfatase precursor - Rhizobium leguminosarum bv.
           viciae (strain 3841)
          Length = 517

 Score = 79.8 bits (188), Expect = 1e-13
 Identities = 77/268 (28%), Positives = 116/268 (43%), Gaps = 20/268 (7%)

Query: 11  PRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDH 70
           P GL   +  L + LK  GY T   GK HLG    E+L  N GFD    +W     +  +
Sbjct: 108 PIGLQKEDITLAEILKTEGYATAQFGKNHLGDLN-EHLLCNHGFDE---YWGNLYHLNAN 163

Query: 71  TTMEQGSWGTD--FRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLF---LMLAHS 125
             +E     +D  FR+ F+    +      DV  +  + V       E +    L     
Sbjct: 164 EDLEDQDRPSDPQFRKKFDPRGIVSCTAGGDVKDEGPLSVKRMETFDEEVATKSLSYLDQ 223

Query: 126 AVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAV-------LSKLDESVGKVVKALHTR 178
               G P+       +        D+S +   A +       L++ D  VG+++  L   
Sbjct: 224 RAKDGKPFFLWHNSTRQHVFIHLKDESRKLSRAGIDDTYGNGLAEHDAQVGELLDKLDQT 283

Query: 179 GLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARV 238
           GL +N+IVV+++DNG     +     S  P KG K T WEGGVR    +  P      RV
Sbjct: 284 GLAKNTIVVYTSDNGAYQYMWPQGGTS--PFKGDKGTTWEGGVRVPAIIRWPGAPG-GRV 340

Query: 239 AYQKMHISDWLPTLYSAAGGDLSVLENL 266
           + + + ++D+LPTL +AA GD  V+E L
Sbjct: 341 SAEIVDMTDFLPTL-AAAAGDNDVVEKL 367


>UniRef50_A6DQW6 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
           n=2; Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine 6-sulfate sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 505

 Score = 79.8 bits (188), Expect = 1e-13
 Identities = 74/289 (25%), Positives = 129/289 (44%), Gaps = 28/289 (9%)

Query: 15  PLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWT--GRIDMYDHTT 72
           P    +L    +D GY T   GK   G  +K       GFD  +GF +     D Y H  
Sbjct: 122 PPKHPMLGSVARDAGYATAGFGKLSAGGTEKPETITGYGFDYWLGFLSHFDCRDYYPHHI 181

Query: 73  MEQGSW------GTDFRRGFEVAHDL----------FGVYATDVYTDEAIKVVNSHNK-S 115
            E G          D   G  +  +            G +  ++Y D+AI+ +  +++  
Sbjct: 182 YENGQQIELPKNRPDLLEGTIIPSNKNTSGGVVPPGVGTFTENLYVDKAIEFIKKNSEIK 241

Query: 116 EPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQK-FAAVLSKLDESVGKVVKA 174
           +P F+ LA +  H G P   +R P  +    +Y + + R+K + A+++  D +VG+++ A
Sbjct: 242 KPFFIYLASTVPHGGMP-GGMRVPD-MAGYDQYEELTLREKVYCALMTHHDRNVGRIIDA 299

Query: 175 LHTRGLLENSIVVFSTDNGGPAAGF--NDNAASNYPLKGVKNTLWEGGVRGAGFLWSP-L 231
           +   G+  N+I+++++DNG   + +   D    N  L+  K  L+EGG+R     W P  
Sbjct: 300 VEDLGIQNNTIIMWTSDNGDEDSYYLRTDTFKGNGDLRMYKRYLYEGGIRVPLIAWWPGT 359

Query: 232 LDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTE 280
           ++S +          D +PTL  A G  L+  E +DG++    L   +E
Sbjct: 360 IESNSTCDLPTTQY-DLMPTLADAGGKALT--EEMDGISIMPTLRGKSE 405


>UniRef50_Q7UYA9 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;
           Pirellula sp.|Rep: N-acetylgalactosamine-6-sulfatase -
           Rhodopirellula baltica
          Length = 474

 Score = 79.4 bits (187), Expect = 2e-13
 Identities = 77/288 (26%), Positives = 128/288 (44%), Gaps = 32/288 (11%)

Query: 6   IYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGF---DSHVGF-- 60
           I  A   G+ + E  + + L+  GY T + GKWH+G  K + +   RGF    SH GF  
Sbjct: 99  ILAAHTGGMRVGEITIAEMLQKRGYATGMFGKWHIGWVKPDEVS-TRGFYSPPSHHGFDE 157

Query: 61  ---WTGRIDMYDHTTMEQG--SWGTD----FRRGFEVAHD----LFGVYATD--VYTDEA 105
               T  +  +D T   Q   SWG      ++ GF   H+       +   D  V  D  
Sbjct: 158 YFATTSAVPTWDPTITPQDWDSWGNGPGEPWKGGFPYVHNGREAKENLSGDDSRVIMDRV 217

Query: 106 IKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLD 165
           I  + + N+++P F  +   A     P+EP+ A ++    +     S R+ +   ++ +D
Sbjct: 218 IPFIEA-NQAKPFFATVWFHA-----PHEPVVAGEEFKKLYPKAG-SKRKNYYGCITAMD 270

Query: 166 ESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGF-NDNAASNYPLKGVKNTLWEGGVRGA 224
           + VG++   L   G+ +N++V F +DNG P+ G      AS  P KG K+T++EGG+   
Sbjct: 271 QQVGRLRAKLRELGIEKNTVVFFCSDNG-PSDGLAKKGVASAGPFKGHKHTMYEGGLLVP 329

Query: 225 GFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDL--SVLENLDGVN 270
                P           +    D+LPT+ S  G  +       +DG++
Sbjct: 330 ACAEWPGTIPAGTSTEVRCSTVDFLPTVASIVGDSMVQKATRPIDGID 377


>UniRef50_A4AVA7 Cluster: Aryl-sulphate sulphohydrolase; n=2;
           Bacteroidetes|Rep: Aryl-sulphate sulphohydrolase -
           Flavobacteriales bacterium HTCC2170
          Length = 487

 Score = 79.4 bits (187), Expect = 2e-13
 Identities = 66/239 (27%), Positives = 107/239 (44%), Gaps = 27/239 (11%)

Query: 20  ILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWG 79
           +LP+ L+   YKT   GKWHL        PL+ GFD ++G          H       +G
Sbjct: 145 VLPEVLQLNNYKTIHAGKWHLSES-----PLDYGFDINIGGGHN-----GHPKSYYPPYG 194

Query: 80  TDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAP 139
               R           Y TD+   + I+V+N     EP FL  A  AVH+  P +P+ + 
Sbjct: 195 NVKLRSPNKE------YLTDLIARQTIEVLNK--TIEPFFLNYAPYAVHT--PIQPVDSI 244

Query: 140 QKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGF 199
               +           K+A ++  LD ++G ++ AL   G  +N++++F++DNGG     
Sbjct: 245 LSKYNRKTAWKGQNNAKYATMVENLDRNIGLLIAALKDNGHYKNTLIIFTSDNGGLY--- 301

Query: 200 NDNAASNYPLKGVKNTLWEGGVRGA-GFLWSPLLDSKARVAYQKMHISDWLPTLYSAAG 257
                   PL+  K + +EGG+R    F+W+  + S  +      H+ D  P++  AAG
Sbjct: 302 --GITKQQPLRAGKGSYYEGGIREPFFFMWNDKIKSNTKSNVPISHL-DLFPSIVEAAG 357


>UniRef50_A6DHI2 Cluster: Aryl-sulphate sulphohydrolase; n=2;
           Lentisphaera araneosa HTCC2155|Rep: Aryl-sulphate
           sulphohydrolase - Lentisphaera araneosa HTCC2155
          Length = 493

 Score = 79.0 bits (186), Expect = 2e-13
 Identities = 66/245 (26%), Positives = 116/245 (47%), Gaps = 22/245 (8%)

Query: 25  LKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDFRR 84
           +   GY T  +GK+H+    K+  PL  G+  +VG   GR    +      G + + +  
Sbjct: 125 MNSAGYLTATLGKYHVA---KD--PLTHGWKINVG---GR----EFGGPYNGGYHSPYEY 172

Query: 85  GFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLID 144
              +     G Y  D  TDEAI +   H   +P+F+   +  +H+     P   P+    
Sbjct: 173 P-NLKETEKGRYLCDHLTDEAIGIFKEHGAQQPIFMYFPYYTIHAPIQGHPKFEPKYKAK 231

Query: 145 AFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAA 204
           A          K+AA++  LD +VG++V AL  +GL E ++++F++DNGG     +   +
Sbjct: 232 A--KTKGHFNPKYAAMIEALDHNVGRLVAALEEQGLREKTLIMFTSDNGG-----HMKFS 284

Query: 205 SNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVL 263
              PL+  K + +EGG+R   F  W  ++++ +R       + D+ PT+   AG +L   
Sbjct: 285 RQEPLRAGKGSYYEGGIRVPFFASWPGVIEAGSRSQVPVTGL-DFYPTVCELAGVELPDD 343

Query: 264 ENLDG 268
           + +DG
Sbjct: 344 KVVDG 348


>UniRef50_A4ANR8 Cluster: Arylsulfatase; n=2; Bacteroidetes|Rep:
           Arylsulfatase - Flavobacteriales bacterium HTCC2170
          Length = 589

 Score = 79.0 bits (186), Expect = 2e-13
 Identities = 74/276 (26%), Positives = 123/276 (44%), Gaps = 40/276 (14%)

Query: 17  NEKILPQYLKDLGYKTHLVGKWHLG-SYKKEYLPLNRGFDS---HVGFWTGRIDMYDHTT 72
           NE  + + LK   YKT + GKWHLG +Y     P ++GFD    H+    G++  +    
Sbjct: 110 NEVTIAEMLKQANYKTGVFGKWHLGDNYPSR--PNDQGFDESLIHLSGGMGQVGDFTTYF 167

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
            ++ S+  D         + +  Y +D++ + AI  +   N  +P F  L+ +A     P
Sbjct: 168 QKERSY-FDPVLWHNGERESYEGYCSDIFAENAIDFIEK-NHDQPFFCYLSFNA-----P 220

Query: 133 YEPIRAPQKLIDAFKYIDDSA-------------------RQKFAAVLSKLDESVGKVVK 173
           + P++ P K    +K ID S+                    +K  A++S +D+++GK+++
Sbjct: 221 HTPLQVPDKYYQQYKDIDPSSGFEDDSRPFVEMTKKNKEDARKVYAMVSNIDDNIGKLMR 280

Query: 174 ALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSP-LL 232
            L    + EN++VVF TDNG     +         ++G K +++ GGVR   +L  P   
Sbjct: 281 KLDDLKIAENTLVVFMTDNGPQQVRYVAG------MRGRKGSVYRGGVRVPFYLRYPSKW 334

Query: 233 DSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDG 268
                V     HI D LPTL       L     +DG
Sbjct: 335 QGNQDVETTTAHI-DVLPTLSEICDVKLPENRKIDG 369


>UniRef50_A3ZY29 Cluster: Aryl-sulphate sulphohydrolase; n=1;
           Blastopirellula marina DSM 3645|Rep: Aryl-sulphate
           sulphohydrolase - Blastopirellula marina DSM 3645
          Length = 498

 Score = 79.0 bits (186), Expect = 2e-13
 Identities = 80/279 (28%), Positives = 125/279 (44%), Gaps = 34/279 (12%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           GL      + + L+  GY T   GKWHL    +  LP  +GFD  V F     D +    
Sbjct: 129 GLAKENVTMAEALQAAGYVTGHFGKWHLAG-PEGALPSEQGFD--VTF-----DSFGEGE 180

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
           + +GS G   ++G     D  GV+     T +A + + + N+  P F  LAH A+H    
Sbjct: 181 LREGSEGN--KKG--PPDDPKGVFTL---TRKACEFIEA-NQDRPFFCYLAHHAIHG--- 229

Query: 133 YEPIRAPQKLIDAFKY-----IDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVV 187
             P++   + ++ FK      +D  A   +AA    LD SVG ++  L    L + ++V 
Sbjct: 230 --PLQGRAETLEKFKAKTRRKLDPGAM--YAACTYDLDASVGMLLAKLDELKLADKTLVA 285

Query: 188 FSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISD 247
           F++DNG         AAS  PL+G K   +EGG+R    +  P +   +  +   +   D
Sbjct: 286 FTSDNGA------TQAASQEPLRGSKGGYYEGGIREPLIIRWPGVTQPSSTSDVPVINVD 339

Query: 248 WLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSV 286
           + PT  +AAG  +   + LDG +    LS      RT +
Sbjct: 340 FYPTFLAAAGAPVPAGKILDGESLLPLLSGAGPLKRTGI 378


>UniRef50_A3XSU6 Cluster: Sulfatase family protein; n=2; Vibrio|Rep:
           Sulfatase family protein - Vibrio sp. MED222
          Length = 512

 Score = 79.0 bits (186), Expect = 2e-13
 Identities = 93/333 (27%), Positives = 141/333 (42%), Gaps = 47/333 (14%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           GL   +  L + LKD GY T  VGK HLG     +LP   GFD   GF    +++ +   
Sbjct: 104 GLQKEDPTLAEMLKDKGYATVHVGKSHLGD-NNSHLPTVHGFDEFFGFLY-HLNVMEMPE 161

Query: 73  MEQGSWGTDFR-RGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE--PL----------- 118
             +     +FR R   V H +    + D+  D    VV      +  PL           
Sbjct: 162 QPEFPTDPNFRGRPRNVLHTV-ATESVDMQEDPRFGVVGKQTIEDKGPLGSKRMQTVDGE 220

Query: 119 FLMLA------HSAVHSGNPY----EPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESV 168
           FL  A      H A     PY     P R  QK     +Y   S    +   L +LD+ +
Sbjct: 221 FLEFATNWLDRHEAEKDEQPYFMWYNPTRMHQKTHVRPEYQGASQINTYYDGLIELDDQI 280

Query: 169 GKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLW 228
           G ++  L   G ++N+I++F++DNG     + D  ++++  +G K T W+GG R    + 
Sbjct: 281 GVLLDKLEDLGEIDNTIILFTSDNGVNLDHWPDGGSASF--RGQKGTTWDGGFRVPMLVS 338

Query: 229 SPLLDSKARVAYQKMHISDWLPTLYSAAG-GDL--SVLE-----------NLDGVNQWDA 274
            P    +       M   DW+PT+ +A G GD+   +L+           +LDG NQ D 
Sbjct: 339 WPDKIPQGEYTDGFMTSEDWVPTIMAAVGEGDIKQELLDGKELNGERYQVHLDGYNQLDM 398

Query: 275 LSKNTESPRTS-VLHNIDDIWGIAALTVDKYKL 306
           L+K   S R     +N  D   + A  VD +K+
Sbjct: 399 LTKGEPSQRHEFFFYNEQD---LNAFRVDDWKV 428


>UniRef50_Q8A171 Cluster: Putative secreted sulfatase ydeN; n=10;
           Bacteroidetes|Rep: Putative secreted sulfatase ydeN -
           Bacteroides thetaiotaomicron
          Length = 518

 Score = 78.6 bits (185), Expect = 3e-13
 Identities = 75/288 (26%), Positives = 131/288 (45%), Gaps = 21/288 (7%)

Query: 23  QYLKDLGYKTHLVGKWHLGSYKKEYL-PLNRGFDSHV-GFWTGRIDMY-DHTTMEQGSWG 79
           Q LKD GY T   GK H G+       P + GF+ ++ G   G +  Y           G
Sbjct: 151 QLLKDSGYHTIHCGKAHFGAIDTPGEDPHHWGFEVNIAGHAAGGLASYLGEENYGHNKDG 210

Query: 80  TDFR-RGFEVAHDLFGV--YATDVYTDEAIKVVNSHNK-SEPLFLMLAHSAVHSGNPYEP 135
                         +G   + T+  T EAIK +N   K ++P +L ++  A+H      P
Sbjct: 211 KPISLMAVPGLEKYWGTETFVTEALTLEAIKALNKAKKYNQPFYLYMSQYAIHV-----P 265

Query: 136 IRAPQKLIDAFKYIDDSARQK-FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG 194
           +   ++  D +K    +  +  +A ++  +D+S+G ++  L   G  +N+I++F +DNGG
Sbjct: 266 LDKDKRFYDKYKKKGMTDHEAAYATLIEGMDKSLGDLMDWLEKSGEADNTIIIFMSDNGG 325

Query: 195 PAAG--FNDNA--ASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLP 250
            AA   + D      N+PL   K + +EGG+R    +  P + +        + I D+ P
Sbjct: 326 LAAESYWRDGKLHTQNHPLNSGKGSTYEGGIREPMIVSWPGVVAPGSKCNDYLLIEDFYP 385

Query: 251 TLYSAAG-GDLSVLENLDGVNQWDALSKNTESPR--TSVLHNIDDIWG 295
           T+   AG      ++ +DG++ +  L K T +P    S+  N+ + WG
Sbjct: 386 TILEMAGIKKYKTVQPIDGIS-FMPLLKQTRNPSKGRSLFWNMPNNWG 432


>UniRef50_Q15XP0 Cluster: Sulfatase precursor; n=1;
           Pseudoalteromonas atlantica T6c|Rep: Sulfatase precursor
           - Pseudoalteromonas atlantica (strain T6c / BAA-1087)
          Length = 627

 Score = 78.6 bits (185), Expect = 3e-13
 Identities = 68/230 (29%), Positives = 108/230 (46%), Gaps = 33/230 (14%)

Query: 21  LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDS---HVGFWTGRIDMYDHTTMEQGS 77
           L + L++ GY+T + GKWHLG     Y P ++GFD    H G   G+   Y   T    +
Sbjct: 126 LAESLQENGYRTGIFGKWHLGD-NYPYRPQDQGFDDVLIHGGGGVGQTPDYWGNTQFNDT 184

Query: 78  WGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIR 137
           +   +R G     + F  YAT ++ DEA K ++  + + P F  +A +A     P+ P R
Sbjct: 185 Y---YRNG---TPEKFSGYATKIWFDEAKKFIDKQHDT-PYFAYIALNA-----PHGPYR 232

Query: 138 APQKLIDAFKYID-DSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG-- 194
           AP+  I+ ++    +     F  ++S +DE VG++   L  +  L+N+I +F TDNG   
Sbjct: 233 APETHIEPYEKRGLNRDMASFYGMISYIDEQVGELRAHLRAQDQLDNTIFIFMTDNGSSY 292

Query: 195 --------------PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSP 230
                         P A    N   N  ++G K  ++EGG R   F+  P
Sbjct: 293 KPTDAKTHLTKRHLPLAEQYPNWQPNDNMRGYKGEVYEGGHRVPFFISYP 342


>UniRef50_A6DMW2 Cluster: Putative exported uslfatase; n=1;
           Lentisphaera araneosa HTCC2155|Rep: Putative exported
           uslfatase - Lentisphaera araneosa HTCC2155
          Length = 479

 Score = 78.6 bits (185), Expect = 3e-13
 Identities = 74/278 (26%), Positives = 119/278 (42%), Gaps = 38/278 (13%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
           L L      + L+   Y+T + GKWHLG+ ++              F+TG+         
Sbjct: 118 LSLKLPTFARVLQKNDYRTAMFGKWHLGNEER--------------FFTGK--------- 154

Query: 74  EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPY 133
           E  ++G D   G       +     ++ T+  ++ +   NK +P  L L H       P+
Sbjct: 155 EHKAYGFDEAFGVSGKAKAYDKGVNEL-TERTLRFLKE-NKKKPFMLCLMHHV-----PH 207

Query: 134 EPIRAP---QKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFST 190
            P+  P   + L D+          K+A ++S  D S+ KV+ AL   GL +N++V+ ++
Sbjct: 208 VPVACPPYAKALYDSVPKGKHQKNSKYAGMISHFDNSIKKVLDALRALGLDDNTVVIVTS 267

Query: 191 DNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLP 250
           DNGG +     N +SN P  G K +L+EGG R    +  P   +   V    +  +D+ P
Sbjct: 268 DNGGLS-----NLSSNKPYNGGKGSLYEGGTRVPLLIRWPGKITPGSVNKSVVISNDFFP 322

Query: 251 TLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLH 288
           T    AG  L    +LDG +    L   T   RT   H
Sbjct: 323 TFLELAGLPLMPEAHLDGKSMMPLLKGKTLGKRTLYWH 360


>UniRef50_P25549 Cluster: Arylsulfatase precursor; n=12;
           Proteobacteria|Rep: Arylsulfatase precursor -
           Escherichia coli (strain K12)
          Length = 551

 Score = 78.6 bits (185), Expect = 3e-13
 Identities = 100/345 (28%), Positives = 153/345 (44%), Gaps = 62/345 (17%)

Query: 1   MQHGVI----YGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDS 56
           + HG++    YG +P GL      LPQ L D GY T  +GKWH+G   KE  P N GFD 
Sbjct: 150 IHHGILMPPMYG-QPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQPQNVGFDD 206

Query: 57  HVGFWTGRIDMYD-----HTTME------------QGSWGTD----FRRGFEVA-HDLFG 94
             GF     DMY      H   E            Q  +  D     R G + A  D+  
Sbjct: 207 FRGF-NSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITP 265

Query: 95  VYATDV---YTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYID 150
            Y  D+   + D  +K ++   KS+ P FL       H  N       P       KY  
Sbjct: 266 KYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN------YPNA-----KYAG 314

Query: 151 DS-ARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPL 209
            S AR  +   + ++++    + K L   G L+N+++VF++DN GP A    +  +  P 
Sbjct: 315 SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN-GPEAEVPPHGRT--PF 371

Query: 210 KGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENL-- 266
           +G K + WEGGVR   F+ W  ++  + R +   + ++D  PT    AG   + + NL  
Sbjct: 372 RGAKGSTWEGGVRVPTFVYWKGMI--QPRKSDGIVDLADLFPTALDLAGHPGAKVANLVP 429

Query: 267 -----DGVNQWD-ALSKNTESPRTSVLHNIDDIWGIAALTVDKYK 305
                DGV+Q    L  N +S R +  + ++    +AA+ +D++K
Sbjct: 430 KTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG--KLAAVRMDEFK 472


>UniRef50_Q7UYD2 Cluster: Sulfatase 1; n=2; Bacteria|Rep: Sulfatase
           1 - Rhodopirellula baltica
          Length = 478

 Score = 78.2 bits (184), Expect = 4e-13
 Identities = 82/313 (26%), Positives = 136/313 (43%), Gaps = 32/313 (10%)

Query: 25  LKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDFRR 84
           LK  GY T L GK+ +GS      PL  GFD+  G ++    +  H       W    + 
Sbjct: 144 LKHAGYDTALFGKYSIGSQMGVTDPLAMGFDTWYGMYS---ILEGHRQYPTILWRDGKKL 200

Query: 85  GFEVAH-DLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLI 143
             E       G YA  ++T EAI+ +   + + P F++LA+S+ H+     P    ++  
Sbjct: 201 RIEENEAGRKGAYAQALFTHEAIQYIKQDHDN-PFFVLLAYSSPHAELAAPP-EFVERYK 258

Query: 144 DAF---KY---IDDSARQKFA-----------AVLS----KLDESVGKVVKALHTRGLLE 182
           DAF   +Y    + +   K+A           AVL+     LD  VG++ ++L ++G+ +
Sbjct: 259 DAFPETRYGGMSNGTPSDKYAWYYPEPVERPHAVLAGMVTALDAYVGQIYQSLESKGIAD 318

Query: 183 NSIVVFSTDNGGPAAGFNDNA--ASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAY 240
           N++++F++DNG    G  D     ++ P KG+K  L++GG+        P      RV  
Sbjct: 319 NTLILFTSDNGPHDEGGGDPTFFRASEPYKGMKRDLYDGGIHVPMIAHWPAAIRSPRVDD 378

Query: 241 QKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALT 300
                +D LPT    AG  L ++  +   N    L +  + PR   L N    W      
Sbjct: 379 TPWAFADVLPTFADIAGVSLDIVPRVK-TNGVSVLPRLRDDPRP--LPNRTLYWEFGKQA 435

Query: 301 VDKYKLIKGTIYK 313
            D    + G +Y+
Sbjct: 436 GDPNSGVVGEVYQ 448


>UniRef50_A6DNI1 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine 6-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 273

 Score = 78.2 bits (184), Expect = 4e-13
 Identities = 58/194 (29%), Positives = 102/194 (52%), Gaps = 15/194 (7%)

Query: 96  YATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQ 155
           Y+TD +  EAI+ +   NK +P FL +++       P+ P+ A +  +  F++I D  R+
Sbjct: 13  YSTDAFGREAIEFIE-RNKKKPFFLFVSYIT-----PHVPMEAKESDLKRFEHIKDPLRR 66

Query: 156 KFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNT 215
              A+++ +D++VG+++K L    L +++++ F +DNG    G+  NA+   P  G K+ 
Sbjct: 67  TSLAMIACMDDNVGRMLKVLKDNKLEKDTLIFFISDNG----GYPGNASLCTPYSGSKSQ 122

Query: 216 LWEGGVRGAGFL-WSPLLDSKARVAYQKMHIS-DWLPTLYSAAGGDLSVLENLDGVNQWD 273
           + EGG+     + W   +  + +V Y K  IS D  PT   AAG  +     LDGV+   
Sbjct: 123 MLEGGIHVPFIMQWKGTI-PRGKV-YGKPIISLDIKPTALVAAGATIKDQWQLDGVDLIP 180

Query: 274 ALS-KNTESPRTSV 286
            L+ + T  P  S+
Sbjct: 181 YLNGQKTSDPHESL 194


>UniRef50_A6DMU3 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine 6-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 448

 Score = 77.8 bits (183), Expect = 6e-13
 Identities = 57/209 (27%), Positives = 101/209 (48%), Gaps = 15/209 (7%)

Query: 68  YDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKS-EPLFLMLAHSA 126
           YD +  E G+        F+  H +     T+  TD  I  +     S +P +  +++ A
Sbjct: 141 YDLSDGETGNVTGGMEDKFQPYHIMDDPKRTNSVTDRTIAFIKEQKSSGKPFYAQVSYYA 200

Query: 127 VHSGNPYEPIRAPQKLIDAFKYIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLEN 183
            H       +   +K +  F+   +  R+    FA +L + D ++G+++ AL    + +N
Sbjct: 201 THLS-----VELEEKSLKKFQGKGEPDRRYTAGFAGMLQETDRAIGRILDALDELEIADN 255

Query: 184 SIVVFSTDNGG----PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVA 239
           + V+FS+DNGG    P A   +    NYPL G K+TL EGG+R   ++  P +   +  +
Sbjct: 256 TYVIFSSDNGGRGEIPGAA-TEGLDPNYPLTGYKHTLNEGGIRVPFYVRGPGVKPNS-WS 313

Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDG 268
           ++ +   D LP+ Y  AGG  ++ E +DG
Sbjct: 314 HEIVSSYDLLPSFYELAGGTEALPETVDG 342


>UniRef50_UPI0000586CBD Cluster: PREDICTED: similar to MGC86251
           protein; n=4; Strongylocentrotus purpuratus|Rep:
           PREDICTED: similar to MGC86251 protein -
           Strongylocentrotus purpuratus
          Length = 525

 Score = 77.4 bits (182), Expect = 7e-13
 Identities = 81/314 (25%), Positives = 130/314 (41%), Gaps = 28/314 (8%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLG-SYKKEYLPLNRGFDSHVGFWTG------RI 65
           GLPLNE ++ + LK  GY++  VGKWHLG      YLP N GFD  +G           +
Sbjct: 103 GLPLNETLISKMLKPEGYRSAAVGKWHLGLGNNSVYLPHNHGFDEFLGLPASPSQCRCSV 162

Query: 66  DMYDHTTMEQGSWGTDFR-----RGFEVAHDLFGVYA-TDVYTDEAIKVVNSH-NKSEPL 118
             Y + T  +     ++       G  +      +    D Y  ++ + + ++     P 
Sbjct: 163 CFYPNVTCHRAPCSPEYSPCALFNGTTIIEQPADLLTLDDKYAMQSRRFIRTNVETGTPF 222

Query: 119 FLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTR 178
           FL  A     S + + P  A ++          S R +F   L+ LD  VG++ + L   
Sbjct: 223 FLYYA-----SHHTHHPQYAGKETSGT------SIRGRFGDSLAALDWEVGQIYEELKEN 271

Query: 179 GLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARV 238
           G+LE++   FS+DN GP+    +   +   +K  K T +EGG+R    +  P   +  R 
Sbjct: 272 GILEDTFFFFSSDN-GPSLSLENFGGNAGLMKCGKATTYEGGIRVPAIVHWPGQITPGR- 329

Query: 239 AYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAA 298
           + +     D LPT+ S     L  +  LDG +    L +   S R S  +    +     
Sbjct: 330 SMELSSTLDVLPTIASITNAKLPNV-TLDGYDMSPFLFQGMPSLRESFFYYPSKVDTEHK 388

Query: 299 LTVDKYKLIKGTIY 312
               +YK  K   Y
Sbjct: 389 SYAVRYKQYKAVFY 402


>UniRef50_Q8D7K3 Cluster: Arylsulfatase A; n=16; Bacteria|Rep:
           Arylsulfatase A - Vibrio vulnificus
          Length = 521

 Score = 77.4 bits (182), Expect = 7e-13
 Identities = 72/254 (28%), Positives = 108/254 (42%), Gaps = 15/254 (5%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           G+P     +   LK+ GY T   GK HLG  + ++LP N GFD   G     ++  +   
Sbjct: 105 GIPDWAPTIADLLKEQGYMTAQFGKNHLGD-QDQHLPTNHGFDEFFGNLY-HLNAEEEPE 162

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEA--IKVVNSHNKSEPLFLMLA--HSAVH 128
                   +FR+ +     +   YA     D     +    H   E L   LA    AV 
Sbjct: 163 TYYYPKDPEFRKNYG-PRGVIKSYADGKIEDTGPMTRKRMEHADEEFLESSLAFMEKAVK 221

Query: 129 SGNPY----EPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
           +  P+       R         KY   S    +A  + + D+ VG ++  L   G+ +N+
Sbjct: 222 ADKPFFIWHNTTRMHVWTRLQEKYQGKSGVSIYADGMLEHDDQVGILLDKLDELGVADNT 281

Query: 185 IVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKM 243
           IV++STDNG     + D  A+  P  G K T WEGG+R    + W  ++    ++     
Sbjct: 282 IVIYSTDNGAETVTWPDGGAT--PFYGEKGTTWEGGMRVPQLVRWPGVIKPGTKINDMMA 339

Query: 244 HISDWLPTLYSAAG 257
           H  DWLPTL +AAG
Sbjct: 340 H-QDWLPTLMAAAG 352


>UniRef50_A0IXQ1 Cluster: Sulfatase precursor; n=1; Shewanella
           woodyi ATCC 51908|Rep: Sulfatase precursor - Shewanella
           woodyi ATCC 51908
          Length = 358

 Score = 77.4 bits (182), Expect = 7e-13
 Identities = 65/229 (28%), Positives = 111/229 (48%), Gaps = 32/229 (13%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
           L L E  L +  K  GY+T   GKWH+G   + YLP ++GFD ++G          H   
Sbjct: 125 LALTELTLAEAFKSQGYETFFAGKWHMGG--EGYLPTDQGFDINIGGM--------HRGS 174

Query: 74  EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPY 133
             G +   ++    + +   G + T   TDE I  + S    +P F +L++  VH+    
Sbjct: 175 PPGGYYDPYKNP-NLPNRNKGEHLTKRLTDETIDFL-SQKHEKPFFALLSYYGVHTPLQA 232

Query: 134 EPIR-----------APQK--LIDAFKYIDDSARQ---KFAAVLSKLDESVGKVVKALHT 177
            P +           A +K  LID          Q    +A+++  +D+SVG+++++L  
Sbjct: 233 GPDKLAYFKEKTNTVAGEKAFLIDKGHQSRTQINQVDANYASMIWAVDKSVGRILESLEK 292

Query: 178 RGLLENSIVVFSTDNGGPAAGFNDN----AASNYPLKGVKNTLWEGGVR 222
           +GL +N++VV ++DNGG +     +    + +N PL+  K  ++EGGVR
Sbjct: 293 QGLDKNTLVVLTSDNGGFSTRHQGDERVTSTANLPLRSGKGWVYEGGVR 341


>UniRef50_Q7UYA6 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
           n=1; Pirellula sp.|Rep: N-acetylgalactosamine 6-sulfate
           sulfatase - Rhodopirellula baltica
          Length = 490

 Score = 77.0 bits (181), Expect = 1e-12
 Identities = 73/267 (27%), Positives = 121/267 (45%), Gaps = 39/267 (14%)

Query: 17  NEKILPQYLKDLGYKTHLVGKWHLGSYKK-----EYLPLNRGFDSHVGFWTGRIDMYDHT 71
           +E  + + LK  GY +   GKW L  + +     + LP  +GFD   G  T    + +  
Sbjct: 102 DEVTIAEVLKTKGYASACFGKWDLAKHAQSGFFSDLLPTGQGFDYFYGTPTSNDRVANLY 161

Query: 72  TMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGN 131
             E+           E   D+  +  T  YTDEAI  +   N+++P F+ + H+  H+  
Sbjct: 162 RNEEL---------IEPESDMATL--TRRYTDEAISFIEK-NQNQPFFVYIPHTMPHTR- 208

Query: 132 PYEPIRAPQKLIDAFK-YIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFST 190
                      +DA K +   S R  +  V+ ++D +VG+++ +L+   L +N+ V+F++
Sbjct: 209 -----------LDASKDFKGKSKRGLYGDVIEEIDFNVGRILDSLNELNLADNTYVLFTS 257

Query: 191 DNG-------GPAAG--FNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQ 241
           DNG       G A G    D+  S  PL+  K + +EGGVR    LW+P       V   
Sbjct: 258 DNGPWLVKNKGHADGHRLGDHGGSAGPLRSGKVSTFEGGVRVPAILWAPGKVPAGTVCDS 317

Query: 242 KMHISDWLPTLYSAAGGDLSVLENLDG 268
                D +PTL + AG ++     +DG
Sbjct: 318 IATTMDVMPTLAALAGAEIPTDRVIDG 344


>UniRef50_A6DU75 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
           n=1; Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine 6-sulfate sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 537

 Score = 77.0 bits (181), Expect = 1e-12
 Identities = 59/181 (32%), Positives = 90/181 (49%), Gaps = 19/181 (10%)

Query: 18  EKILPQYLKDLGYKTHLVGKWHLG-SYKKEYLPLNRGFDS---HVGFWTGRI-DMYDHTT 72
           EK L  + KD GYKT + GKWHLG SY   Y P  RGF+    H G   G++ D + +T 
Sbjct: 98  EKTLANFFKDAGYKTAIFGKWHLGMSY--PYAPRFRGFEESFIHGGGGIGQLEDAHGNTH 155

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
           ++   W      G  V       Y++D+  D+AI  +   NK +P F  ++  A H+  P
Sbjct: 156 IDAHYW----HNGKLVPSK---GYSSDILFDKAIDFIEK-NKDKPFFCFVSTPATHA--P 205

Query: 133 YEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDN 192
           Y+      K I A      +      +++  +D+ VGK++K L    L +N+IV+ +TD 
Sbjct: 206 YQEHPEAAKRIRARGI--TTGNIALYSMIENIDDCVGKILKKLDDLKLKDNTIVIIATDQ 263

Query: 193 G 193
           G
Sbjct: 264 G 264


>UniRef50_A6DI17 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine-4-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 590

 Score = 77.0 bits (181), Expect = 1e-12
 Identities = 58/214 (27%), Positives = 98/214 (45%), Gaps = 16/214 (7%)

Query: 21  LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGT 80
           + +  +D GY T   GKWHLG       P+++GFD  V    G +              T
Sbjct: 102 IAEAFRDQGYATGHFGKWHLGD-NYPMRPMDQGFDEVVALGCGAVGQIGDYWANDYFDDT 160

Query: 81  DFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQ 140
               G    +  +  Y TDV+ +E ++ +    K +P F+ LA +  H      P+    
Sbjct: 161 YIHNG---EYKKYEGYCTDVFFNETMRFIKE-TKDKPFFIYLAPNVTHL-----PLIVAD 211

Query: 141 KLIDAFKYIDDSARQKFAA---VLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAA 197
           K     ++ID+    K A    ++  LDE+ G+++  L   G LEN+I++++TD+G   A
Sbjct: 212 KYSQ--RHIDNGINPKLATFYGMVDNLDENFGRLMDCLKEEGELENTILLYTTDDGMQGA 269

Query: 198 GFNDNAASNYP-LKGVKNTLWEGGVRGAGFLWSP 230
             N    + +  ++G K +  EGG R + F+  P
Sbjct: 270 AGNSTPTTWFKGMRGKKGSKEEGGHRVSCFMSWP 303


>UniRef50_A3ZLD4 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
           Blastopirellula marina DSM 3645|Rep:
           N-acetylgalactosamine 6-sulfatase - Blastopirellula
           marina DSM 3645
          Length = 587

 Score = 77.0 bits (181), Expect = 1e-12
 Identities = 80/277 (28%), Positives = 119/277 (42%), Gaps = 38/277 (13%)

Query: 4   GVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTG 63
           GV  G E   L ++E+   +  +D GY T   GKWH G+ +  Y P  RGFD   GF +G
Sbjct: 93  GVSTGQER--LNVDEQTFVEAFRDAGYATAAFGKWHNGT-QFPYHPNARGFDEFCGFCSG 149

Query: 64  RIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLA 123
               Y    +E  +      RG          +  D  T+ AI+ +  H + EP    + 
Sbjct: 150 HWGNYFDPLLEHNN---QLIRGEG--------FIVDDLTNRAIQFIERH-QDEPFLCYVP 197

Query: 124 HSAVHSGNPYEPIRAPQKLIDAFKYID------DSARQKFA------AVLSKLDESVGKV 171
            +  HS     P++ P K  D F  +D      D  ++  A      A+   +D +VG+V
Sbjct: 198 FNTPHS-----PMQVPDKFYDKFADVDFEMKNRDPQKEDLAMTRAALAMCENIDWNVGRV 252

Query: 172 VKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPL 231
           ++ L    L +++IVV+ +DNG  +  +N        +KG K +  EGGVR   F+  P 
Sbjct: 253 LQKLDDLKLTDDTIVVYFSDNGPNSWRWNGG------MKGRKGSTDEGGVRSPLFIRWPK 306

Query: 232 LDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDG 268
             S      Q     D  PTL   AG      + LDG
Sbjct: 307 HISAGLKIEQVAGAIDLGPTLADLAGVKFQPQKRLDG 343


>UniRef50_Q7UIU1 Cluster: Arylsulfatase A; n=1; Pirellula sp.|Rep:
           Arylsulfatase A - Rhodopirellula baltica
          Length = 529

 Score = 76.6 bits (180), Expect = 1e-12
 Identities = 86/357 (24%), Positives = 147/357 (41%), Gaps = 30/357 (8%)

Query: 4   GVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLG----------SYKKEYL--PLN 51
           GV+ G     +P +   L   L+  GY T ++GKWHLG           + K  L  P N
Sbjct: 120 GVLSGTSEHLIPGDRATLGHLLQGAGYHTAMIGKWHLGWDWHKNGKEIDFSKPVLNGPDN 179

Query: 52  RGFDSHVGFWTGRIDMYDHTTMEQGSWGT--DFRRGFEVAHDLFGVYATDVYTDE-AIKV 108
            GFD + G   G +DM  +  ++ G+  +    + G     + +G Y      D+  I+ 
Sbjct: 180 NGFDQYYGH-CGSLDMPPYVWVDTGTPTSVPTRKEGVTKKQNPYGWYRNGPIGDDFEIEQ 238

Query: 109 VNSHNKSEPLFLMLAHSAVHSGNP---YEPIRAPQK-LIDAFKYIDDSARQKFAAVLSKL 164
           V  H   + +  +     V    P   Y P+ AP   ++    + D S    +A  + ++
Sbjct: 239 VLPHLFDKSIAYV--EERVKEDKPFFLYLPLPAPHTPIVPVPPFKDASGMNPYADFVMQM 296

Query: 165 DESVGKVVKALHTRGLLENSIVVFSTDNG-GPAAGFNDNAASNY----PLKGVKNTLWEG 219
           D  +G+++ A+   G+ EN++V+F++DNG  P A F + A   +      +G K  ++EG
Sbjct: 297 DHHMGQLLDAISKAGIDENTLVIFTSDNGCSPEANFGELAKHGHDPSGKYRGHKADIYEG 356

Query: 220 GVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNT 279
           G R    +  P      +       ++D   TL S            DG +  D    + 
Sbjct: 357 GHRVPFIVRWPGKVVAGKTTNALTCLTDVYATLQSITDQPREATGGEDGFDLTDVFGGDD 416

Query: 280 ESPRTSVLHNIDDIWGIAALTVDKYKLIKGTIYKGVWDNWYGPSGREGAYNASLLYD 336
            S R +++ +   I G  A+  D +KL   +   G W N   P  +        L+D
Sbjct: 417 SSDREALVSH--SIGGSFAIRRDSWKLCL-SHGSGGWSNPREPKAKLQGLPPMQLFD 470


>UniRef50_A6BYR0 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1;
           Planctomyces maris DSM 8797|Rep:
           N-acetyl-galactosamine-6-sulfatase - Planctomyces maris
           DSM 8797
          Length = 658

 Score = 76.6 bits (180), Expect = 1e-12
 Identities = 66/235 (28%), Positives = 109/235 (46%), Gaps = 21/235 (8%)

Query: 17  NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDS--HVGFWTGRIDMYDHTTME 74
           N+  L + L+D GY+T   GKWHLG     + P  +GF++  H     G    +    + 
Sbjct: 130 NQYTLAEALRDAGYRTGHFGKWHLG-LTTPHRPDKQGFETVWHCAPDPGPPSYFSPYGVT 188

Query: 75  QGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYE 134
                T   R   +     G + TD  T EAI+ + +H +SEP FL L H +VH   P++
Sbjct: 189 PTGKPTAQHRVGNITDGPDGEHITDRLTSEAIQFMEAH-RSEPFFLNLWHYSVHG--PWQ 245

Query: 135 PIRAPQKLIDAFKYIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
                +   +  K  D    Q+    A++L  +DES+G++++ L    L +N++ +F +D
Sbjct: 246 --HKAEYTAEFAKKQDPRKEQRNPVMASMLRNVDESLGRILQKLDELKLADNTLFIFYSD 303

Query: 192 NGGPAAGFNDN------AASNYPLKGVKNTL--WEGGVRGAGFLWSPLLDSKARV 238
           NGG A  ++ +          +PL    N+   W GG        +PL + K R+
Sbjct: 304 NGGNAHSWSSDDPKLKKITDKHPLYKTINSYRKWAGGEPPTNN--APLREGKGRI 356


>UniRef50_P50473 Cluster: Arylsulfatase precursor; n=7;
           Echinoida|Rep: Arylsulfatase precursor -
           Strongylocentrotus purpuratus (Purple sea urchin)
          Length = 567

 Score = 76.6 bits (180), Expect = 1e-12
 Identities = 76/304 (25%), Positives = 129/304 (42%), Gaps = 28/304 (9%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLG-----SYKKEYLPLNRGFDSHVG--FWTGRI 65
           GLPL E  + + +K  GY T +VGKWHLG     S    +LP NRGFD  VG     G  
Sbjct: 147 GLPLYEVTIAEAMKGAGYTTGMVGKWHLGINENSSSDGAHLPANRGFD-FVGHNLPFGNS 205

Query: 66  DMYDHTTMEQGSWGTD----FRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLM 121
              D T + Q    T+    +     VA        T +  D+ +  +   N ++P F+ 
Sbjct: 206 WRCDDTGLHQDFPDTNACFLYYNSTSVAQPFQHKGLTQLLRDDTVGFIED-NVNKPFFMY 264

Query: 122 LAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLL 181
           ++ + +H+            L  +  +   S R ++   L ++D+++ ++V  L    + 
Sbjct: 265 VSFAHMHT-----------SLFSSDDFSCTSRRGRYGDNLREMDQAIEQIVTTLVDNDID 313

Query: 182 ENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQ 241
           +N+++ F++D+ GP   +          +G K   WEGG R    ++ P   S   V+++
Sbjct: 314 DNTVIFFTSDH-GPHREYCGEGGDANVFRGGKGQSWEGGHRIPYIVYWPGTISPG-VSHE 371

Query: 242 KMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALTV 301
            +   D + T  +  G  L      DG      L +   SP     +   D   + A+ V
Sbjct: 372 IVTSMDIIATAVNLGGSQLPTDRIYDGKCLKSVLLEGASSPHDDFFYYCKDT--LMAVRV 429

Query: 302 DKYK 305
            KYK
Sbjct: 430 GKYK 433


>UniRef50_UPI0000E0F7DD Cluster: aryl-sulphate sulphohydrolase; n=3;
           alpha proteobacterium HTCC2255|Rep: aryl-sulphate
           sulphohydrolase - alpha proteobacterium HTCC2255
          Length = 493

 Score = 76.2 bits (179), Expect = 2e-12
 Identities = 64/216 (29%), Positives = 99/216 (45%), Gaps = 34/216 (15%)

Query: 12  RGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFD-----SHVGFWTGRID 66
           RGL  +   + + LK  GY T   GKWHLG+      P  +GFD     SH G       
Sbjct: 130 RGLTTDIITIGESLKTAGYTTGTFGKWHLGAD-----PDKQGFDVNVAGSHQGMTFHYFS 184

Query: 67  MYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSA 126
            Y    +E G  G                Y T+  T E I  V S +K +P F  + +  
Sbjct: 185 PYQLPNIEDGPKGE---------------YLTERLTTEVIDWVKS-SKDQPFFAYVPYYT 228

Query: 127 VHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIV 186
           VH+  PY+ +          K I       +AA++  +D++VG++   L + GL EN++V
Sbjct: 229 VHT--PYQAVVDKVNKYHE-KGIKSKREATYAAMVEHMDDNVGRIFDMLDSEGLAENTVV 285

Query: 187 VFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVR 222
           +F++DNGG        ++   PL+G K + ++GG+R
Sbjct: 286 IFTSDNGGYRM-----SSFPTPLRGGKGSYYDGGLR 316


>UniRef50_UPI00006A2B15 Cluster: UPI00006A2B15 related cluster; n=1;
           Xenopus tropicalis|Rep: UPI00006A2B15 UniRef100 entry -
           Xenopus tropicalis
          Length = 323

 Score = 75.8 bits (178), Expect = 2e-12
 Identities = 76/261 (29%), Positives = 122/261 (46%), Gaps = 35/261 (13%)

Query: 16  LNEKI--LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
           LN+++  LPQ +KD GY T + GKWHLG+  +   P +RGF+       G    +    M
Sbjct: 78  LNQRVAALPQIMKDGGYWTVMAGKWHLGA-SEGMQPNHRGFERSYALMDGGASHFKQKVM 136

Query: 74  EQGSWG---TDFRRGFEVAHDL-FGVYATDVYTDEAIKVV-NSHNKSEPLFLMLAHSAVH 128
              S     T    G +V  DL    Y++  YTD+ +  + +   + +P F   A++A  
Sbjct: 137 RLASEAPEPTYLENGQKV--DLPDDFYSSRTYTDKLMTYLKDPQREGKPFFAYAAYTA-- 192

Query: 129 SGNPYEPIRAP----QKLIDAFKYIDDSARQK--------FAAVLSKLDESVGKVVKALH 176
              P+ P++AP    QK    +    D   Q+        +AA +  LD +VG+++  L 
Sbjct: 193 ---PHLPLQAPDDELQKKRGQYDVGYDVIAQRRIARTMEVYAAQVRDLDRNVGRLIDNLK 249

Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDNAA-SNYPLKGVKNTLWEGGVRGAGFLWSPLLDSK 235
             G  +N++++F +DNG      ND AA  ++  K  KN   EGG+R   F+  P    K
Sbjct: 250 ASGQYDNTLIIFLSDNGPEG---NDWAADGSFDPKWFKN---EGGIRSPSFVSYP-GHVK 302

Query: 236 ARVAYQKMHISDWLPTLYSAA 256
              + Q + + D  PT+   A
Sbjct: 303 PGKSEQILTVKDIAPTILDVA 323


>UniRef50_Q4SI19 Cluster: Chromosome 5 SCAF14581, whole genome
           shotgun sequence; n=4; Euteleostomi|Rep: Chromosome 5
           SCAF14581, whole genome shotgun sequence - Tetraodon
           nigroviridis (Green puffer)
          Length = 554

 Score = 75.8 bits (178), Expect = 2e-12
 Identities = 53/188 (28%), Positives = 91/188 (48%), Gaps = 19/188 (10%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHT- 71
           G+  +E +LPQ LK  GY + +VGKWHLG ++ +YLPL  GFD  +G        Y+++ 
Sbjct: 92  GISKDEILLPQMLKKRGYISKIVGKWHLG-HRPQYLPLEHGFDEWLGAPNCHFGPYNNSV 150

Query: 72  -----TMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHS 125
                          +   F +   +     T +Y  E++  V    +++ P FL  A  
Sbjct: 151 KPNIPVYNNSEMLGRYYEEFRIDRKMGESNLTQMYLLESLDFVRRQAEAQRPFFLYWAPD 210

Query: 126 AVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSI 185
           A H+     P+ A +       ++  S R ++   + +LD SVG+++  L + G+  N+ 
Sbjct: 211 ATHA-----PVYASK------GFLGKSQRGRYGDAVVELDYSVGEILSLLRSLGIDNNTF 259

Query: 186 VVFSTDNG 193
           V F++DNG
Sbjct: 260 VFFTSDNG 267


>UniRef50_A6DHW4 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetylgalactosamine 6-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 512

 Score = 75.8 bits (178), Expect = 2e-12
 Identities = 84/314 (26%), Positives = 138/314 (43%), Gaps = 56/314 (17%)

Query: 4   GVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF--- 60
           G  Y  E  GLP  E+ + + LK +GYKT  VGK H+    K++ P++ GFD  +GF   
Sbjct: 87  GAYYYGEG-GLPKEEQTIAEALKSIGYKTMKVGKTHMNKGFKQH-PMDHGFDDFLGFIDH 144

Query: 61  -WT------GRIDMYDHTTMEQGSWGT-----DFRRGFEVAHDLFGVYATDVYTDEAIKV 108
            W         +D Y     + G  G         RG+E          TDV+T EA K 
Sbjct: 145 SWDFFMLSQEHLDAYKKRAKKAGHKGNIKFLGPLMRGYEKNASFKDTNITDVFTVEAQKF 204

Query: 109 VNSHNKSEPLFLMLA----HSAVH-------------------SGNPYE-PIRAPQ--KL 142
           +   NK EP +L L+    H+ +H                   + + +E P+  P+  K 
Sbjct: 205 I-VENKDEPFYLRLSFNAVHTPLHLVPEELAKKHGIKQPKWDPNASTWEYPLWDPKTLKY 263

Query: 143 IDAFKYI------DDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPA 196
            + +K +      D   R K+   L  +D+++GK++K L  + + +N+++ FS+DNGG  
Sbjct: 264 NEWYKQVCHLQNPDPYGRLKYLIHLEMIDQAIGKILKTLDEQQIRDNTLIFFSSDNGGS- 322

Query: 197 AGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAA 256
              + + A+N  L   K ++ +G +     +  P    KA  +   +   D   T+    
Sbjct: 323 ---HQSYANNGHLNAFKYSVMDGALHVPFLVSYPAKLPKANKSDALVSHMDIFATIADLT 379

Query: 257 GGDLSVLENLDGVN 270
           G  LS    LDG++
Sbjct: 380 G--LSPKNKLDGLS 391


>UniRef50_A6DG52 Cluster: Arylsulphatase A; n=1; Lentisphaera
           araneosa HTCC2155|Rep: Arylsulphatase A - Lentisphaera
           araneosa HTCC2155
          Length = 419

 Score = 75.8 bits (178), Expect = 2e-12
 Identities = 73/264 (27%), Positives = 115/264 (43%), Gaps = 29/264 (10%)

Query: 25  LKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDFRR 84
           +K+ GY T + GKW L + +K  L  + GFD++   W+     Y  T   +  W     R
Sbjct: 96  MKEAGYATAVAGKWQLYTGRKGSLAPDCGFDTYC-LWS-----YPGTERSR-FWNPSLIR 148

Query: 85  GFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLID 144
             +        Y  D+ TD  I  +   NKS+P F       VHS  P+ P   P    D
Sbjct: 149 DGKKVPVTPNSYGPDICTDFIIDFIKK-NKSQPFFAYYPMLLVHS--PFVP--TP----D 199

Query: 145 AFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAA 204
           +      +  + +  ++S +D+ +G+++  L    L +N+IV+F+TDNG           
Sbjct: 200 SKDKNSTNKLENYRDMVSYMDKCIGRIIDTLEETNLRKNTIVLFTTDNG-------TGRP 252

Query: 205 SNYPLKGVKNT-----LWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGD 259
             YP KG K         +GG      +  P + S+  V+   +  SD+LPTL   +G +
Sbjct: 253 LTYPYKGEKRVGEKAYPTDGGSHVPLIVNGPGIVSQGLVSDDIVDFSDFLPTLADISGAN 312

Query: 260 LSVLENLDGVNQWDALSKNTESPR 283
           L  +  LDG + W        SPR
Sbjct: 313 LPNV-TLDGRSFWPQCLGKKGSPR 335


>UniRef50_A3ZVD1 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
           n=2; Planctomycetaceae|Rep: N-acetylgalactosamine
           6-sulfate sulfatase - Blastopirellula marina DSM 3645
          Length = 496

 Score = 75.8 bits (178), Expect = 2e-12
 Identities = 74/279 (26%), Positives = 123/279 (44%), Gaps = 31/279 (11%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
           L L E  L + LK  GY T   GKWHLG   +   P ++GFD ++G    R   Y     
Sbjct: 136 LALEETTLAEALKQRGYATFFAGKWHLGP--EGNWPEDQGFDVNIG-GIDRGGPYGGKKY 192

Query: 74  EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSG--- 130
                      G +      G +  D    E +K +  H + +P    L+  +VH+    
Sbjct: 193 FSPYGNPRLTDGPD------GEHLPDRLASETVKFIEQH-QDQPFLAYLSFYSVHTPLMA 245

Query: 131 -----NPYEPIRAPQKLIDAFKYIDDSARQK---------FAAVLSKLDESVGKVVKALH 176
                  Y+ I+  Q++  A     +  + K         +A ++  +D +VGKV+ AL 
Sbjct: 246 REDLKQKYDEIK--QRIRFAGPIWGEEGKSKLRLVQEHSVYAGMVEAMDAAVGKVLDALD 303

Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKA 236
              L +N++V+F++DNGG +     +  SN PL+G K  ++EGG+R    +  P + S  
Sbjct: 304 RLKLTDNTLVIFTSDNGGLSTS-EGHPTSNLPLRGGKGWMYEGGIREPLVVRYPGVTSPG 362

Query: 237 RVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDAL 275
             +   +   D+LPT+ +        ++  DGV+   AL
Sbjct: 363 SESDALVTSPDFLPTILAVVDKPGDKIDT-DGVSIISAL 400


>UniRef50_Q15SA2 Cluster: Sulfatase precursor; n=1;
           Pseudoalteromonas atlantica T6c|Rep: Sulfatase precursor
           - Pseudoalteromonas atlantica (strain T6c / BAA-1087)
          Length = 724

 Score = 75.4 bits (177), Expect = 3e-12
 Identities = 78/301 (25%), Positives = 128/301 (42%), Gaps = 29/301 (9%)

Query: 21  LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGT 80
           L    K  GY T   GKWHLG++   Y P   GFD  +  + G            G +  
Sbjct: 133 LSSIAKANGYHTAHFGKWHLGAHP--YSPSEHGFDIDIPNFQG--------AGPTGGYLA 182

Query: 81  DFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQ 140
            +    ++   + G +       EA K + S     P FL     +VH+     P  A  
Sbjct: 183 PWSFAPDIQPQIAGEHIDIRLAKEAKKWIFSVKDDGPFFLNFWAFSVHA-----PFNADA 237

Query: 141 KLIDAF----KYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPA 196
             ID F              +AA++ + D+++G + +AL    + +N+I++F++DNGG  
Sbjct: 238 DEIDYFINKRSGFHSQRNATYAAMVKQFDDAIGVLWQALVEAKVEKNTIIIFTSDNGGNM 297

Query: 197 AGF--NDNAASNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKARVAYQKMHISDWLPTLY 253
                N +A SN+PLKG K T +EGG++     +W P L     ++   +  +D+ PTL 
Sbjct: 298 YTVVGNTHATSNFPLKGGKATEYEGGLKVPTAVIW-PGLTQPNTLSNTPIQTADFFPTLL 356

Query: 254 SAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLH-----NIDD-IWGIAALTVDKYKLI 307
           +           +DG +    L   T   R    +      + D +   A +T+D +KLI
Sbjct: 357 NGVNLSWPSTHIVDGRDIRPVLQGGTLETRAIFTYYPAEPKVPDWLPPSATVTLDGWKLI 416

Query: 308 K 308
           +
Sbjct: 417 R 417


>UniRef50_A6DID7 Cluster: Aryl-sulphate sulphohydrolase; n=2;
           Lentisphaera araneosa HTCC2155|Rep: Aryl-sulphate
           sulphohydrolase - Lentisphaera araneosa HTCC2155
          Length = 523

 Score = 74.9 bits (176), Expect = 4e-12
 Identities = 69/258 (26%), Positives = 117/258 (45%), Gaps = 27/258 (10%)

Query: 17  NEKI-LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQ 75
           +EK+   + LK +GY T + GKWH+  + +    ++ G    +    G  D+ DH+  + 
Sbjct: 143 DEKVSFAEALKKVGYSTAMYGKWHISGHGRYGSGVDGGVSPQM---QGFDDVIDHSARDL 199

Query: 76  GSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVN-SHNKSEPLFLMLAHSAVHSGNPYE 134
            S    F++  +    +F       YT  AI+    S   ++P  + LAH AVH+GN   
Sbjct: 200 DSL---FKKNGD-PKQMF------TYTKRAIEFAEKSTQDNKPFMIYLAHHAVHTGNDVG 249

Query: 135 PIRAPQKLIDAFKYI---DDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
                +K     K +   ++     +AA L+  D S+G ++  L    + +N++++F +D
Sbjct: 250 SRTETRKYFTDKKSMGKYEEKVNTSYAAHLADTDTSIGLLLDKLEELKIKDNTVIMFLSD 309

Query: 192 NGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPT 251
           NGG     +       PL+  K + +EGG+R   F+  P            M I D  PT
Sbjct: 310 NGGIPTRLHQK-----PLRSWKGSYYEGGIRVPFFISWPKQFKPTETDVPAMAI-DLYPT 363

Query: 252 LYSAAGGDLSVLEN-LDG 268
           +   AG  +  +EN LDG
Sbjct: 364 MLELAG--VKDIENHLDG 379


>UniRef50_A6C430 Cluster: Arylsulphatase A; n=1; Planctomyces maris
           DSM 8797|Rep: Arylsulphatase A - Planctomyces maris DSM
           8797
          Length = 503

 Score = 74.9 bits (176), Expect = 4e-12
 Identities = 66/264 (25%), Positives = 116/264 (43%), Gaps = 19/264 (7%)

Query: 11  PRGLPLNEKILPQYLKDLGYKTHLVGKWHL-GSYKK--EYLPLNRGFDSHVGFWTGRIDM 67
           P  +   E  +   L+  GY T  VGKWHL G +    +  P + GFD         +  
Sbjct: 110 PMHVRKREITIATLLRQAGYATCHVGKWHLNGMFNMVGQPQPSDHGFDHWFSTQNNALPT 169

Query: 68  YDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIK-VVNSHNKSEPLFLMLAHSA 126
           +++          +F R       L G +A+ +  DEA + +    +K +P F+ +    
Sbjct: 170 HENPF--------NFVRNARPVGPLQG-FASQLVADEAEEWLTQLRDKEKPFFMFVCFH- 219

Query: 127 VHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIV 186
                P+EPI + ++    +   + S        ++++D++ G+++K L  + L EN+++
Sbjct: 220 ----EPHEPIASAERFRKLYTAPEGSTLPAHHGNVTQMDDAFGRILKTLDDQKLRENTLI 275

Query: 187 VFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHIS 246
           +F++DN GPA        S+ PL+  K   +EGG+R  G +  P        +   +   
Sbjct: 276 IFTSDN-GPAITRRHPHGSSGPLRDKKGATYEGGIRVPGIVQWPEHVQPGTTSDVPVCGV 334

Query: 247 DWLPTLYSAAGGDLSVLENLDGVN 270
           D LPTL + A         LDG N
Sbjct: 335 DILPTLCAVADIPAPTDRVLDGTN 358


>UniRef50_A6C284 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
           Planctomyces maris DSM 8797|Rep: N-acetylgalactosamine
           6-sulfatase - Planctomyces maris DSM 8797
          Length = 605

 Score = 74.9 bits (176), Expect = 4e-12
 Identities = 84/312 (26%), Positives = 142/312 (45%), Gaps = 43/312 (13%)

Query: 17  NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQG 76
           +E  + Q  K  GY T   GKWH G+    + P  +GFD + GF +G    Y    ++  
Sbjct: 121 DEYTIAQAFKAAGYATGAFGKWHNGTQYPNH-PNAKGFDEYYGFTSGHWGHYFSPMLDHN 179

Query: 77  SWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKS-EPLFLMLAHSAVHSGNPYEP 135
             GT F +G          Y TD  TD+A+  +    ++ +P F  L +   HS     P
Sbjct: 180 --GT-FVKG--------NGYITDDLTDKAMAFIEQQVQNHKPFFAYLPYCTPHS-----P 223

Query: 136 IRAPQKLIDAFK------YIDDSARQK------FAAVLSKLDESVGKVVKALHTRGLLEN 183
           ++ P +  D FK      +  +  R++        A+   +D +VG+V+K L++  + ++
Sbjct: 224 MQVPDQYWDRFKDKQLKLHNREPDREQPDHLRAALAMCENVDWNVGRVLKKLNSLRITDD 283

Query: 184 SIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQK 242
           +IV++ +DNG     +N +      +KG K +L EGGVR    + W   L +   V  Q 
Sbjct: 284 TIVIYFSDNGPNGVRWNGD------MKGKKGSLDEGGVRSPFVIRWPGHLPAGQEV-NQI 336

Query: 243 MHISDWLPTLYSAAGGDLSVLENLDGVNQWD-ALSKNTESPRTSVLHNIDDIWGIAALTV 301
               D LPTL   AG      + +DGV+     L+   + P   +  ++ +     ++  
Sbjct: 337 AGAIDLLPTLTDLAGIKRPEPKPIDGVSLKPLMLNSKADWPERMIFSSLRN---RVSVRT 393

Query: 302 DKYKLI-KGTIY 312
           D+Y+L  KG +Y
Sbjct: 394 DQYRLSRKGELY 405


>UniRef50_A3ZWK4 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
           Blastopirellula marina DSM 3645|Rep:
           N-acetylgalactosamine 6-sulfatase - Blastopirellula
           marina DSM 3645
          Length = 442

 Score = 74.9 bits (176), Expect = 4e-12
 Identities = 73/266 (27%), Positives = 116/266 (43%), Gaps = 28/266 (10%)

Query: 18  EKILPQYLKDLGYKTHLVGKWHLGSYKKE--YLPLNRGFDSHVGFWTGRIDMYDHTTMEQ 75
           E  L + L+  GY T   GKWHLGS +K+    P   GFD     W    + YD+  +  
Sbjct: 72  EITLAERLQAAGYATSHFGKWHLGSVRKDSPVSPGKCGFDD----WISAPNFYDNDPIM- 126

Query: 76  GSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEP 135
               +D  R  +   +     ++DV  D AI  + +  K E  F     S V  G+P+ P
Sbjct: 127 ----SDQGRAVQYHGE-----SSDVTADLAIDWIRAQAKEEKPFF----SVVWFGSPHSP 173

Query: 136 IRAPQKLIDAFKYIDDSAR-QKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG 194
             A     D   Y D+ A+ + +   ++ +D + GK+   L   G+ +N+I+ + +DNG 
Sbjct: 174 HIAAD--ADRELYKDEPAKFRDYYGEVTGIDRAYGKIRSTLKELGISDNTILWYCSDNGA 231

Query: 195 PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYS 254
             A       S  P +  K +++EGG+   G L  P      +    +    D  PT+ +
Sbjct: 232 DKA-----KGSAGPFREKKGSIYEGGLLVPGILDWPARFPAPQTTSLRATTCDIFPTVLA 286

Query: 255 AAGGDLSVLENLDGVNQWDALSKNTE 280
           AAG        LDG+N    L+  T+
Sbjct: 287 AAGLSPDKQRPLDGINLLPLLTAKTD 312


>UniRef50_A3HYT7 Cluster: Arylsulphatase A; n=1; Algoriphagus sp.
           PR1|Rep: Arylsulphatase A - Algoriphagus sp. PR1
          Length = 437

 Score = 74.9 bits (176), Expect = 4e-12
 Identities = 81/314 (25%), Positives = 128/314 (40%), Gaps = 18/314 (5%)

Query: 14  LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
           L  ++    + LKD GYKT + GKW LG  K+   P + GF+     W   +   D    
Sbjct: 101 LDRSQTTFAKLLKDAGYKTAIAGKWQLG--KESDSPQHFGFEESC-LWQHMLGATDKNGN 157

Query: 74  EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPY 133
           +               H   G ++TD+ +D  I  +   NK +P F        H   P+
Sbjct: 158 DTRYSNPVLEINGVPKHFDGGQFSTDITSDFLIDFMEK-NKDQPFFAYYPMIITHC--PF 214

Query: 134 EPIRAPQKLIDAFKYIDDSAR---QKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFST 190
            P     K  D       + +   Q F  +++ +D++VGK++  +   GL E +I++F+ 
Sbjct: 215 VPT-PDSKDWDPSSPGSPTYKGDPQYFGDMVAYMDKTVGKIIAKVEEMGLSEETIIIFTG 273

Query: 191 DNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWL 249
           DNG      +     +YP  G K    E G+     + W   +DS  +     +  SD+L
Sbjct: 274 DNGTDQPIVSSYRGKDYP--GGKKFTTENGIHVPLVVKWKGKIDSGIQ-NEDLIDFSDFL 330

Query: 250 PTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRT---SVLHNIDDIWGIAALTVDK-YK 305
           PTL   AG        LDGV+    L     +PR    S      D+  +     +K YK
Sbjct: 331 PTLLDLAGIKAVHGIPLDGVSFMPQLMGKEGNPRNWIYSWYSRNGDLESLQEFVWNKEYK 390

Query: 306 LIKGTIYKGVWDNW 319
           L K   +  + D+W
Sbjct: 391 LYKTGEFFNIQDDW 404


>UniRef50_A6LEC5 Cluster: Arylsulfatase A; n=1; Parabacteroides
           distasonis ATCC 8503|Rep: Arylsulfatase A -
           Parabacteroides distasonis (strain ATCC 8503 / DSM 20701
           / NCTC11152)
          Length = 483

 Score = 74.5 bits (175), Expect = 5e-12
 Identities = 71/262 (27%), Positives = 122/262 (46%), Gaps = 29/262 (11%)

Query: 11  PRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEY-LPLNRGFDSHVGFWTGRIDMYD 69
           P  L  +E  + + LK   Y T   GKWHL S + +   P ++GFD    F+     +  
Sbjct: 117 PMHLRDSEVTIAEVLKQADYATGHFGKWHLSSGRPDQPYPNDQGFD--YSFYALNNSVPS 174

Query: 70  HTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS 129
           H         T+F R  E   ++ G Y+ D+   EA++ ++  NK EP FL      V  
Sbjct: 175 HHNP------TNFFRNGEPQGEIEG-YSCDIVVTEALQWLDK-NKQEPFFLN-----VWF 221

Query: 130 GNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFS 189
             P+ P+ AP++L         +   ++   +  +D ++GK++  L  + L +N+IV+F+
Sbjct: 222 NEPHFPMEAPEELKKRH-----AINPEYYGCIENMDIAIGKLMNYLKEQNLEDNTIVIFA 276

Query: 190 TDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDW 248
           +DNG      +    SN P +G K+  +EGG+R    + W   + +     +     +D 
Sbjct: 277 SDNG------SQWDYSNLPFRGEKHFNYEGGLRVPCIVRWHKHVPTGVISEFNGC-FTDI 329

Query: 249 LPTLYSAAGGDLSVLENLDGVN 270
           LPTL S A   +     +DG++
Sbjct: 330 LPTLASLADAPVPTDRVIDGMD 351


>UniRef50_A6DJ11 Cluster: Arylsulfatase A; n=1; Lentisphaera
           araneosa HTCC2155|Rep: Arylsulfatase A - Lentisphaera
           araneosa HTCC2155
          Length = 462

 Score = 74.5 bits (175), Expect = 5e-12
 Identities = 73/278 (26%), Positives = 121/278 (43%), Gaps = 22/278 (7%)

Query: 12  RGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF-----WTGRID 66
           +GL    + + + LK +GY T  VGKWHLG  + E+LP N+GFDS+ G       T    
Sbjct: 100 KGLDPKHQTIAKLLKSVGYATKAVGKWHLGD-ELEFLPTNQGFDSYYGIPYSNDMTPAFS 158

Query: 67  M-YDHTTM-EQGSWGTDFRRGFEVAHDLFGVYATD----VYTDEAIKVVNSHNKSEPLF- 119
           M Y    +  +G      ++ FE A+ +  V   D    +  DE I++    +     F 
Sbjct: 159 MKYSENCLYREGVDQEALKKAFE-ANKIKPVGMKDKVPLMRNDECIEMPADQSTITKRFT 217

Query: 120 ---LMLAHSAVHSGNP---YEPIRAPQKLIDAFK-YIDDSARQKFAAVLSKLDESVGKVV 172
              +     +  S  P   Y     P   +   K +   SA   +  V+ ++D +VG+++
Sbjct: 218 DESIKFIDESTASNKPFFLYLAHSMPHTPLYVSKDFEGKSAGGIYGDVIEEIDYNVGRII 277

Query: 173 KALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLL 232
             L+ + + EN++ ++++DN GP      +  S  PL   K T +EGG R    +  P  
Sbjct: 278 DHLNEKNIAENTLFIYTSDN-GPWLIKKSHGGSALPLFEGKMTSFEGGQRVPAIIRWPAK 336

Query: 233 DSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
             K  V+ +     D  PTL    G      + ++G N
Sbjct: 337 IPKDSVSNEMTLSMDIFPTLAKITGAKAQDADLINGKN 374


>UniRef50_Q7UL40 Cluster: Arylsulfatase A; n=1; Pirellula sp.|Rep:
           Arylsulfatase A - Rhodopirellula baltica
          Length = 592

 Score = 74.1 bits (174), Expect = 7e-12
 Identities = 69/259 (26%), Positives = 114/259 (44%), Gaps = 26/259 (10%)

Query: 18  EKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPL---NRGFDSHVGFWTGRIDMY-DHTTM 73
           E  + +     GY+T + GKWHLG    E  P+   ++GF   V    G I  + D+   
Sbjct: 126 ETTIAEVFAGAGYRTGIFGKWHLG----ENFPMRAEDQGFQKVVVHGGGGIGQFADYPGN 181

Query: 74  EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPY 133
                   +   F+ A      Y TDV+ DE+I+ +    + +P F  L  +  HS  P+
Sbjct: 182 TYWDPTLQYNDSFKKAKG----YCTDVFIDESIQFMKDSGE-QPFFCYLPLNVPHS--PF 234

Query: 134 EPIRAPQKLIDAFKYIDDSARQKFAAV---LSKLDESVGKVVKALHTRGLLENSIVVFST 190
           +     +   D     D   R+  A +   +++ D + G++++A+   G  EN+I++F +
Sbjct: 235 DVADEFRADYDNQNLADPDGRKWVAPIYGMITQFDGAFGRLLEAVEDMGQRENTIILFMS 294

Query: 191 DNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWL 249
           DNG  +  F         L+  K +++E G+R    + W   L    +     MHI D L
Sbjct: 295 DNGPNSTYFTAG------LRAKKGSVYENGIRSPFVIQWPKTLQGGRKFDTPAMHI-DLL 347

Query: 250 PTLYSAAGGDLSVLENLDG 268
           PTL  A G  L     +DG
Sbjct: 348 PTLADACGIGLPADLQVDG 366


>UniRef50_Q4BZ10 Cluster: Similar to Arylsulfatase A and related
           enzymes; n=1; Crocosphaera watsonii WH 8501|Rep: Similar
           to Arylsulfatase A and related enzymes - Crocosphaera
           watsonii
          Length = 407

 Score = 74.1 bits (174), Expect = 7e-12
 Identities = 63/200 (31%), Positives = 97/200 (48%), Gaps = 26/200 (13%)

Query: 21  LPQYLKDLGYKTHLVG--KWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSW 78
           +P+ L+D GY T LVG  KW +GS+ +   PL+RGF           +M  H    +   
Sbjct: 1   MPETLRDAGYVTGLVGALKWDIGSWNQG--PLDRGFT----------EMALHPPRTEP-- 46

Query: 79  GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNK--SEPLFLMLAHSAVHSGNPYEPI 136
            T F  G      + G Y T+V     ++ +  H K   +P FL  A  A+H  +   P 
Sbjct: 47  -TIFGGGSTYL-GVDGSYLTEVEGQYVLEFLERHGKRRDKPFFLYFAPLAIHIPHTEVPK 104

Query: 137 RAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG-P 195
           +  ++L       + S RQ   A L  LD+ +G+++K +   G+ EN++V+FS+DNGG P
Sbjct: 105 KYLKRLYPEHTEKEYSKRQYLQANLLALDDQIGRMIKKISELGIKENTLVMFSSDNGGDP 164

Query: 196 AAGFNDNAASNYPLKGVKNT 215
            A    +     P +G KNT
Sbjct: 165 LADHRPD-----PYRGGKNT 179


>UniRef50_A6DM53 Cluster: Arylsulfatase; n=1; Lentisphaera araneosa
           HTCC2155|Rep: Arylsulfatase - Lentisphaera araneosa
           HTCC2155
          Length = 540

 Score = 74.1 bits (174), Expect = 7e-12
 Identities = 55/147 (37%), Positives = 73/147 (49%), Gaps = 20/147 (13%)

Query: 8   GAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKE----YLPLNRGFDSHVGFWTG 63
           G+    L  N   +PQ LK  GYKT +VGKWHLG    +      P+NRGFD   G   G
Sbjct: 108 GSYDNYLNKNRITIPQVLKTTGYKTAMVGKWHLGGKSFDPNGPNAPMNRGFDDFYGTLHG 167

Query: 64  RIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLML 122
               YD  T+      T  R+  E  H+ F  Y TD   +EA++ + +  K+E P F  +
Sbjct: 168 AGSYYDPMTL------TRNRKSMEPDHESF--YYTDKIGEEAVRQIKALAKAEQPFFQYI 219

Query: 123 AHSAVHSGNPYEPIRAPQKLIDAFKYI 149
           A +A     P+ PI AP+K I   KYI
Sbjct: 220 AFTA-----PHWPIHAPEKTIQ--KYI 239


>UniRef50_A6DRX0 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
           n=3; Bacteria|Rep: N-acetylgalactosamine 6-sulfate
           sulfatase - Lentisphaera araneosa HTCC2155
          Length = 486

 Score = 73.7 bits (173), Expect = 9e-12
 Identities = 65/239 (27%), Positives = 100/239 (41%), Gaps = 18/239 (7%)

Query: 25  LKDLGYKTHLVGKWHLGSYKKEYLPLNR-GFDSHVGFWTGRIDMYDHTTMEQGS---WGT 80
           +KDLGY+T   GKW L  ++ E L + + GFD     WTG     D T  ++ +   W  
Sbjct: 126 MKDLGYRTFATGKWQLNDFRLEPLAMQKHGFDDWA-MWTGCETSKDKTHEKKSTQRYWNA 184

Query: 81  DFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQ 140
                 E +    G +  D+YTD  I  +   NK +P+ +           P+ P+ A  
Sbjct: 185 HINTK-EGSKTYKGQFGPDLYTDHLINFMRK-NKDKPMCIYYPMVL-----PHTPVAATP 237

Query: 141 KLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNG-GPAAGF 199
               A   +      K  A++  +D+ VGK+V  L   G+ E +I++F+TDNG  P    
Sbjct: 238 DEPKAKGVLG-----KHKAMVRYIDKMVGKLVNELDELGIRERTIIIFTTDNGSAPPPRG 292

Query: 200 NDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGG 258
                +   + G K+T  E G+     +  P L             +D LPT     GG
Sbjct: 293 VIGTRNGRKIVGAKSTETEVGICAPFIVNGPGLVPAGVETDALTDFTDMLPTFLELGGG 351


>UniRef50_A6DJ15 Cluster: Putative arylsulfatase; n=2; Lentisphaera
           araneosa HTCC2155|Rep: Putative arylsulfatase -
           Lentisphaera araneosa HTCC2155
          Length = 469

 Score = 73.7 bits (173), Expect = 9e-12
 Identities = 83/306 (27%), Positives = 134/306 (43%), Gaps = 40/306 (13%)

Query: 3   HGVI---YGAEPRG----LPLNEK--ILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRG 53
           HG++   Y   P G    LPL  +   L + +K  GY T L+GKW +G       P  +G
Sbjct: 84  HGLVRGNYEVGPHGFGGELPLRPEDVSLAEVMKSAGYATGLIGKWGMGMDGTTGEPRKKG 143

Query: 54  FDSHVGFWTGRIDMYDHTTMEQGSWGTDFRRGF-EVAHDLFGVYATDVYTDEAIKVVNSH 112
           FD   GF       + H    +  +    +    E   D  G+Y +D + ++ I+ V   
Sbjct: 144 FDYSYGFLN---QAHAHHYYPEYIYENGEKLMIPENKDDARGLYISDTFAEKGIEFVEE- 199

Query: 113 NKSEPLFLMLAHSAVHS-------------GN-PYEPIRAPQKLIDAFK-----YID-DS 152
           NK +P FL  A    H+             G  P  P    ++  D        Y   D 
Sbjct: 200 NKDKPFFLFWAFVTPHAELLVPDDSLNEFKGKWPETPFVMGKQGGDGTDNPFGVYASQDH 259

Query: 153 ARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDN--AASNYPLK 210
            R  F+ ++++LD+ VG +   L   G+ +N+I++FS+DNG    G  D     SN  L 
Sbjct: 260 PRAAFSGMITRLDKRVGDLFDKLEELGIDDNTIIMFSSDNGPHKEGGADPDFFDSNAELT 319

Query: 211 GVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGV 269
           G K  L EGG+R    + W  ++ ++++ ++      D +PT+   A  D    E++DG+
Sbjct: 320 GYKRDLTEGGIRVPFMVRWPNVVKARSKSSHASA-FWDVMPTIAEIANTDSP--EDIDGL 376

Query: 270 NQWDAL 275
           +   AL
Sbjct: 377 SFLPAL 382


>UniRef50_A6CBM1 Cluster: Arylsulphatase A; n=1; Planctomyces maris
           DSM 8797|Rep: Arylsulphatase A - Planctomyces maris DSM
           8797
          Length = 497

 Score = 73.3 bits (172), Expect = 1e-11
 Identities = 73/291 (25%), Positives = 128/291 (43%), Gaps = 28/291 (9%)

Query: 11  PRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSY---KKEYLPLNRGFDSHVGFWTGRIDM 67
           P  L  +E  + Q L+  GY T  VGKWH       K++  P + GF          +  
Sbjct: 108 PMHLKRDEVTVAQLLQQAGYDTAHVGKWHCNGMFNSKEQPQPGDHGFRHWFSTQNNALPT 167

Query: 68  YDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNS-HNKSEPLFLMLAHSA 126
           +++          +F R  +   ++ G ++  +  DE I+ ++    K +P FL   H  
Sbjct: 168 HENPN--------NFVRNGKPLGEIEG-FSCQIVADEGIRWLSDWREKEKPFFL---HVC 215

Query: 127 VHSGNPYEPIRAPQKLIDAF--KYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
            H   P+E + +P  L++ +  K + +   Q FA V + +D +VGK++  L    + +N+
Sbjct: 216 FHE--PHERVASPPALVETYLDKSLYEDQAQYFANV-ANMDRAVGKLLIKLDELKVADNT 272

Query: 185 IVVFSTDNGGP-----AAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARV 238
           +V F++DNG         G   +  S   L+G+K  ++EGG+R  G + W   + +   +
Sbjct: 273 LVFFTSDNGPETLNRYGKGSRRSWGSPGVLRGMKLHIYEGGIRVPGIVRWPGKIKAGQEI 332

Query: 239 AYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHN 289
           A     + D LPT    AG  +     LDG +     + N     T +  N
Sbjct: 333 ATPVCSV-DLLPTFCEIAGVAVPDQRPLDGASLLPLFAGNKIERTTPLFWN 382


>UniRef50_UPI0000E1104B Cluster: N-acetylgalactosamine 6-sulfate
           sulfatase; n=1; alpha proteobacterium HTCC2255|Rep:
           N-acetylgalactosamine 6-sulfate sulfatase - alpha
           proteobacterium HTCC2255
          Length = 485

 Score = 72.9 bits (171), Expect = 2e-11
 Identities = 71/274 (25%), Positives = 123/274 (44%), Gaps = 39/274 (14%)

Query: 13  GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
           G+  + +  P+ L+ +GYKT L+GKWHLG Y+ E+ P   G+D  +GF  G     D   
Sbjct: 110 GIEQSYETWPEILQKVGYKTGLIGKWHLG-YQPEHHPTQHGYDEFIGFLAGGTTPEDPRL 168

Query: 73  MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVH---- 128
              G           V  +  G+   +V T+ AI  +N H K +   L L + A H    
Sbjct: 169 EVNG-----------VETNELGL-TVEVLTNHAIAFLNRH-KDDKFALSLHYRAPHYRFL 215

Query: 129 -----SGNPYEPIRAPQKLIDAFKYIDDSAR---QKFAAVLSKLDESVGKVVKALHTRGL 180
                   PYE +       D      + AR   +++ + ++ +D +VG +++ L   GL
Sbjct: 216 PVAPEDAAPYEDVEIALPHPDYPGLNTERARKLMREYMSSVTGIDRNVGLLMQTLEQLGL 275

Query: 181 LENSIVVFSTDNG-GPAAGFNDNAASNY------PL------KGVKNTLWEGGVRGAGFL 227
            +N++V+F++D+G   A     +  + Y      PL      +G +  +++  ++    +
Sbjct: 276 SQNTVVIFTSDHGYNIAHNGMWHKGNGYWLLYEPPLGTPNVPRGQRPNMYDNSLKVPTIV 335

Query: 228 WSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLS 261
             P +  KA +    M   DW PTL + A G +S
Sbjct: 336 RWPGVIPKASINDSTMSNLDWFPTLVAIARGKVS 369


>UniRef50_A6DNI9 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1;
           Lentisphaera araneosa HTCC2155|Rep:
           N-acetyl-galactosamine-6-sulfatase - Lentisphaera
           araneosa HTCC2155
          Length = 500

 Score = 72.9 bits (171), Expect = 2e-11
 Identities = 86/300 (28%), Positives = 123/300 (41%), Gaps = 51/300 (17%)

Query: 11  PRGLPLNEKILPQYLKDL---GYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDM 67
           P+     + + P Y K L   GY T   GKWHLG   + Y PL  GFD  V         
Sbjct: 119 PKSTTRLDTVFPTYAKVLKAQGYVTGHYGKWHLGH--EPYTPLEHGFDVDV--------- 167

Query: 68  YDHTTMEQGSWGTDFRRGFEVAHDLF----GVYATDVYTDEAIKVVNSHNKSEPLFLMLA 123
             HT    G  G+ F  G +   D F    G +  D    EAI+ +   NK  P  L   
Sbjct: 168 -PHTK-SHGPKGSYF--GPKKYSDSFTLKKGEHLEDRMGQEAIEFIKE-NKDRPFLLNYW 222

Query: 124 HSAVHSGNPYEPIRAPQKLIDAFKY----IDDSARQK---FAAVLSKLDESVGKVVKALH 176
             +VHS     P+ A   L+D ++     +   A+Q+   FA ++   D++VG ++KA+ 
Sbjct: 223 AFSVHS-----PMFAKLDLLDKYRKKATKLPTDAQQRNPIFAGMIETFDDNVGLLLKAID 277

Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDN----------------AASNYPLKGVKNTLWEGG 220
             G+ + +I+V S+DNGG       +                A SNYPLK  K T+ +GG
Sbjct: 278 EAGIADRTIIVLSSDNGGTIESAYTHEAYWGNGTVEEIVDIPATSNYPLKSGKGTIHDGG 337

Query: 221 VRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTE 280
                 +  P        +       D  PT    AG  +     +DGV+Q  AL    E
Sbjct: 338 TAVPFIVVWPGKIKAGTKSDSYFSGVDVFPTFVEMAGAKMPSGVAIDGVSQVPALITGEE 397


>UniRef50_A4A218 Cluster: Arylsulfatase A; n=1; Blastopirellula
           marina DSM 3645|Rep: Arylsulfatase A - Blastopirellula
           marina DSM 3645
          Length = 491

 Score = 72.9 bits (171), Expect = 2e-11
 Identities = 71/264 (26%), Positives = 113/264 (42%), Gaps = 15/264 (5%)

Query: 5   VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGR 64
           V+   + +GL   E  + + L  +GY T + GKWHLG  + E+LP  +GFD+  G     
Sbjct: 115 VLRPLDTKGLNPKETTMAEVLHSVGYATGIFGKWHLGD-QPEFLPTQQGFDTFFGIPYSD 173

Query: 65  IDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAH 124
            DM      +        R    +   +         T+EAI  +   N+  P F+ + H
Sbjct: 174 -DMTKDLRPQLWPELPLMRDEQVIEAPVDRDLLVKRCTEEAIAFIEQ-NQERPFFVYIPH 231

Query: 125 SAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
           +    G+   P  +P     AF+    S    +   + +LD S G+V++ L    L E +
Sbjct: 232 TM--PGSTKRPFSSP-----AFQ--GKSKNGPYGDSVEELDWSTGQVMETLKRLDLDEQT 282

Query: 185 IVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMH 244
           +V++++DNG P    N    SN P +G      EG +R    +  P   S  ++      
Sbjct: 283 LVIWTSDNGAPHR--NPPQGSNLPYQGDGYNTSEGAMRMPCVMRWPGKISAGQINDALCT 340

Query: 245 ISDWLPTLYSAAGGDLSVLENLDG 268
             D LPT    AG  +S  E +DG
Sbjct: 341 TMDLLPTFGKLAGATMSKTE-IDG 363


>UniRef50_Q89L10 Cluster: Bll4738 protein; n=6; Proteobacteria|Rep:
           Bll4738 protein - Bradyrhizobium japonicum
          Length = 487

 Score = 72.1 bits (169), Expect = 3e-11
 Identities = 74/292 (25%), Positives = 122/292 (41%), Gaps = 20/292 (6%)

Query: 8   GAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF-WTGRID 66
           G  P GL   E    + L   GY T + GKWHLGS  ++ +P N+GFD   G   T    
Sbjct: 107 GGIPDGLTQWEITTAELLSGQGYATGMWGKWHLGS-AEDRMPTNQGFDEWYGIPRTYDEA 165

Query: 67  MYDHTTMEQGSW-GTDFRRGF--EVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLA 123
           M+      +  W     R+G+  +V H      A        + V++  ++   +   + 
Sbjct: 166 MWPSLDETRSMWPSVGNRQGWNAKVVHPQHIYEARKGDKPRQVAVLDE-DRRRTMDAEIT 224

Query: 124 HSAVH-------SGNP---YEPI-RAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVV 172
             AV        SG P   Y P        +   ++   +    +A  L+++D   G+++
Sbjct: 225 SRAVEFIKRNASSGKPFYAYVPFAHVHMPTLPNLEFAGRTGNGDWADCLAEMDYRTGQIL 284

Query: 173 KALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLL 232
            A+   G+  +++V+F++DNG  A   N     N P +G   T  EG +R    +  P  
Sbjct: 285 DAIKQAGIENDTLVIFASDNGPEAT--NPWEGDNGPWRGTYFTAMEGSLRAPFIIRWPGK 342

Query: 233 DSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWD-ALSKNTESPR 283
               R++ + +H  D   TL    G ++     +DGV+Q D  L K   S R
Sbjct: 343 VPAGRISNEIVHTVDLFTTLARVGGAEVPTDRAIDGVDQLDFFLGKQEASNR 394


>UniRef50_A6EGE7 Cluster: N-acetylgalactosamine-6-sulfatase; n=3;
           Bacteroidetes|Rep: N-acetylgalactosamine-6-sulfatase -
           Pedobacter sp. BAL39
          Length = 464

 Score = 72.1 bits (169), Expect = 3e-11
 Identities = 65/259 (25%), Positives = 116/259 (44%), Gaps = 21/259 (8%)

Query: 21  LPQYLKDLGYKTHLVGKWHLGSYKKEY-LPL--NRGFDSHVGFWTGRIDMYDHTTMEQGS 77
           + ++ ++ GY T   GKWH+G  +     P     G D HV         Y+    +   
Sbjct: 128 MARFFQEAGYATGHFGKWHMGGGRDVTGAPTFDQYGIDEHVS-------TYESPEPDPAI 180

Query: 78  WGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIR 137
             T++    + +   +    T  + D+ +  +  H K  P F+ L    VH+  P+ P R
Sbjct: 181 TATNWIWSDQDSIKRWD--RTKYFVDKTLDFMKRH-KGTPCFVNLWPDDVHT--PWVP-R 234

Query: 138 APQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAA 197
           +  +    F  +D    + F  VL   D  +G+++  L   GL EN+I++F++DN GPA 
Sbjct: 235 SGDEFNGKFP-MDPQEEEAFKGVLKTYDVQIGRLLDGLQELGLAENTIIIFTSDN-GPAP 292

Query: 198 GFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTLYSAA 256
            F  +    +  +G K +L+EGG+R    + W     +       +++ +D LP+L   +
Sbjct: 293 SFRGSRTGGF--RGAKASLYEGGIRMPFIISWPGHTPAGKTDDRSELNATDLLPSLAKLS 350

Query: 257 GGDLSVLENLDGVNQWDAL 275
           G  L      DG+++ D L
Sbjct: 351 GVKLPDSYAGDGIDRSDLL 369


>UniRef50_A6DJL2 Cluster: Putative exported uslfatase; n=1;
           Lentisphaera araneosa HTCC2155|Rep: Putative exported
           uslfatase - Lentisphaera araneosa HTCC2155
          Length = 493

 Score = 72.1 bits (169), Expect = 3e-11
 Identities = 62/242 (25%), Positives = 112/242 (46%), Gaps = 22/242 (9%)

Query: 30  YKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFW-TGRIDMYDHTTMEQGSWGTDFRRGFEV 88
           Y T  +GKWH+   + E      GFD H G+   G  + Y+         G D ++    
Sbjct: 130 YATAHLGKWHVPKLQPEVA----GFDVHDGYTGNGGGEYYE------AHKGKDKKK--LP 177

Query: 89  AHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSG--NPYEPIRAPQKLIDA 145
             D   +Y     ++ A   +    K++ P +L ++H AVH    +  + +   +K +  
Sbjct: 178 PEDPKQIYTI---SERACDFIAQQAKAKKPFYLQISHYAVHVSLQSRAKTLERTKKRLAT 234

Query: 146 FKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAAS 205
                      FAA++  LD  VG ++  +  +G+ +N+ ++F++DNGG    + + +  
Sbjct: 235 THPKLHQRTIDFAAMVEDLDIGVGMILDEVEKQGIKDNTYIIFTSDNGG--FSYANTSGQ 292

Query: 206 NYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLEN 265
           N PLKG K  L+EGG+R    +  P + +      Q +   D+LPT Y   GG  ++ ++
Sbjct: 293 NTPLKGGKRWLYEGGIRVPFVIQGPKIKA-GTYCNQPIINWDFLPTFYDLVGGTEALSQD 351

Query: 266 LD 267
           L+
Sbjct: 352 LE 353


>UniRef50_A6DJE5 Cluster: Sulfatase 1; n=1; Lentisphaera araneosa
           HTCC2155|Rep: Sulfatase 1 - Lentisphaera araneosa
           HTCC2155
          Length = 490

 Score = 72.1 bits (169), Expect = 3e-11
 Identities = 51/181 (28%), Positives = 95/181 (52%), Gaps = 15/181 (8%)

Query: 102 TDEAIKVVNSHNKS-EPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYID-DSARQKFAA 159
           T  ++  +N+  K+ +P FLM++H AVH  +      A ++ I  ++  D D    ++AA
Sbjct: 198 TKSSVDFINTQAKANKPFFLMVSHYAVHVKHA-----ALEETIKKYQIGDVDYKDARYAA 252

Query: 160 VLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEG 219
           ++  LD+S+G ++KAL   G+ +N+ V+F++DNGG   G       N  L+G K  + EG
Sbjct: 253 LIEHLDDSLGAMLKALDDNGIADNTYVIFTSDNGGGHGG-------NPSLQGGKAKMMEG 305

Query: 220 GVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNT 279
           G+R    +  P + + ++     +   D+L TL+  +G    + +++DG +  D      
Sbjct: 306 GLRVPTVVRGPGIPADSQCDVPIVQY-DFLATLHELSGNPNPLPDDIDGGSLVDVFRNGN 364

Query: 280 E 280
           E
Sbjct: 365 E 365


>UniRef50_A6DIC6 Cluster: Arylsulfatase; n=1; Lentisphaera araneosa
           HTCC2155|Rep: Arylsulfatase - Lentisphaera araneosa
           HTCC2155
          Length = 528

 Score = 72.1 bits (169), Expect = 3e-11
 Identities = 79/282 (28%), Positives = 127/282 (45%), Gaps = 25/282 (8%)

Query: 18  EKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFD-SHVGFWTGRIDMYDHTTMEQG 76
           E  L +  KD  Y T L GKWHLG     Y  +++GFD S +    G     DH    + 
Sbjct: 74  EYTLAEAFKDNQYSTGLFGKWHLGDC-YPYRAMDQGFDYSLIHRGGGLGQPADHPENNRA 132

Query: 77  SWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSH-NKSEPLFLMLAHSAVHSGNPYEP 135
              +   R  EVA    G + TDV+  EA K ++     ++P F  +  +A HS  P+  
Sbjct: 133 YTNSMLYRN-EVAFRSEG-FCTDVFFREARKWISEKVENNKPFFACIMPNAPHS--PFHD 188

Query: 136 IRAPQKLIDAFKYID-----DSARQKFAAV---LSKLDESVGKVVKALHTRGLLENSIVV 187
           +  P  L+  +K  D      S + K AA+   +  +D+++  +   L    + + +I++
Sbjct: 189 V--PADLLKKYKNADWSQHKGSDKDKVAAIYAMVENIDQNIADLRDELKKLNIDKKTIIL 246

Query: 188 FSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISD 247
           FS+DNGG    F+        L+G K++ +EGGV     L+ P   SK   + + +   D
Sbjct: 247 FSSDNGGWGERFDAG------LRGSKSSSFEGGVLSPLMLFVPGQASKQ--STEAIAHYD 298

Query: 248 WLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHN 289
            LPTL       +     LDG +    LS  +   R+ +L +
Sbjct: 299 VLPTLVDLCDLKVDFPNELDGRSFLPILSGESLPERSIILQS 340


>UniRef50_A6C4Q6 Cluster: Arylsulfatase; n=1; Planctomyces maris DSM
           8797|Rep: Arylsulfatase - Planctomyces maris DSM 8797
          Length = 574

 Score = 72.1 bits (169), Expect = 3e-11
 Identities = 76/286 (26%), Positives = 128/286 (44%), Gaps = 22/286 (7%)

Query: 8   GAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDM 67
           GA+ +G    E  + + L+  GY+T + GKWHLG       P ++GF   +   +G I  
Sbjct: 107 GAKMQG---EEVTVAELLQQAGYQTGIFGKWHLGD-NYPMRPQDQGFAESLIHKSGGIGQ 162

Query: 68  YDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSA 126
              +  +  S+         VA    G Y TDV+ D A+  ++   K+E P F+ LA +A
Sbjct: 163 ---SPDQPNSYFHPKLWKNGVAFQSTG-YCTDVFFDAALDFIDRQTKTEKPFFVYLATNA 218

Query: 127 VHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIV 186
            H+  P E   +  K     + +D++  + +  +++ LDE++GK++  L    L E ++V
Sbjct: 219 PHT--PLEIAESYWKPYQR-QGLDETTARVYG-MITNLDENIGKLLSHLERSALAEKTVV 274

Query: 187 VFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHI 245
           +F  DNG     +         L+G K+  +EGG+R      W        ++     HI
Sbjct: 275 LFLGDNGPQQKRYTGG------LRGRKSWTYEGGIRVPCLAQWPGHFREGEKIDQIAAHI 328

Query: 246 SDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTES-PRTSVLHNI 290
            D +PTL +           LDGV+    L+   E  P  S+   +
Sbjct: 329 -DLMPTLLALTETRCPESLKLDGVDLSPLLTGRKEKLPARSLFFQV 373


>UniRef50_A4AR92 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;
           Flavobacteriales bacterium HTCC2170|Rep:
           N-acetylgalactosamine-6-sulfatase - Flavobacteriales
           bacterium HTCC2170
          Length = 479

 Score = 72.1 bits (169), Expect = 3e-11
 Identities = 83/297 (27%), Positives = 121/297 (40%), Gaps = 39/297 (13%)

Query: 13  GLPLNEKI-LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRG----FDSHVGFWT----- 62
           G  L E+I LP+ LK  GY T   GKWHLG+  K+ L  NRG    FD+H    T     
Sbjct: 104 GHMLPEEITLPELLKGQGYATGHFGKWHLGTLTKDTLDANRGGREKFDAHYSLPTEHGYD 163

Query: 63  ------GRIDMYDHTTM-EQGSWGTDFRRGFEVAHDLFGV--YATDVYTDEAIKVVNSHN 113
                  ++  YD     E    G   R G+       G   Y T  +T E  KV  +  
Sbjct: 164 EFFSTESKVPTYDPMIYPENFDEGESLRYGWRSVESNEGTKPYGTAYWTGENQKVTTNIE 223

Query: 114 KSEPLFLM-----LAHSAVHSGNPYEP---IRAPQKLI---DAFK--YID-DSARQKFAA 159
                 +M         A+    P+     +  P   +    A +  Y D D  +Q +  
Sbjct: 224 GDNSRVIMDRVLPFIDRAITEEKPFFSTLWLHTPHLPVVSDSAHRSLYPDLDLQQQIYNG 283

Query: 160 VLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEG 219
            L+ +DE +G++   L    + EN+I+ F +DNG      ND   S    +  K +L+EG
Sbjct: 284 TLTAMDEQIGRLWSKLEALDIQENTIIFFCSDNGPE----NDTPGSAGVFRERKRSLYEG 339

Query: 220 GVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDAL 275
           GVR   F+ W   +    R +Y     SD+LPTL             +DG + W+ +
Sbjct: 340 GVRVPAFMVWKNHVTGGQR-SYFPSVTSDYLPTLLDILNITYPDNRPVDGESLWEVV 395


>UniRef50_Q612A1 Cluster: Putative uncharacterized protein CBG16830;
           n=1; Caenorhabditis briggsae|Rep: Putative
           uncharacterized protein CBG16830 - Caenorhabditis
           briggsae
          Length = 268

 Score = 72.1 bits (169), Expect = 3e-11
 Identities = 32/70 (45%), Positives = 43/70 (61%)

Query: 2   QHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFW 61
           Q GV    EP G+P     L + ++ L Y T+LVGKWHLG  KKE+LP NRGFD   GF+
Sbjct: 31  QAGVFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFY 90

Query: 62  TGRIDMYDHT 71
             +   ++H+
Sbjct: 91  GPQTGYFNHS 100



 Score = 67.3 bits (157), Expect = 8e-10
 Identities = 45/126 (35%), Positives = 66/126 (52%), Gaps = 10/126 (7%)

Query: 187 VFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHIS 246
           V+ST NGG +    +  ASN PL+G K+T+WEGG +   F+ SP+   +        H+ 
Sbjct: 135 VYST-NGGTS----NFGASNAPLRGEKDTIWEGGTKTTTFVHSPMYVEEGGNREMMFHVV 189

Query: 247 DWLPTLYSAAGGDLSVLENLDGVNQWDALSKN-TESPRTSVLHNIDDIWGIAALTVDKYK 305
           DW  T+ S  G  L V    DG+NQW+ +  N  +  R   ++NI D    +A+    YK
Sbjct: 190 DWHATILSITG--LEVDSYGDGINQWEYIRTNRPKFRRFQFVYNIADHG--SAIRDGDYK 245

Query: 306 LIKGTI 311
           LI G +
Sbjct: 246 LIVGNV 251


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.317    0.136    0.425 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 543,781,209
Number of Sequences: 1657284
Number of extensions: 23536543
Number of successful extensions: 49405
Number of sequences better than 10.0: 500
Number of HSP's better than 10.0 without gapping: 569
Number of HSP's successfully gapped in prelim test: 153
Number of HSP's that attempted gapping in prelim test: 47434
Number of HSP's gapped (non-prelim): 1328
length of query: 455
length of database: 575,637,011
effective HSP length: 103
effective length of query: 352
effective length of database: 404,936,759
effective search space: 142537739168
effective search space used: 142537739168
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.6 bits)
S2: 74 (33.9 bits)

- SilkBase 1999-2023 -