BLASTP 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= BGIBMGA001098-TA|BGIBMGA001098-PA|IPR000917|Sulfatase
(455 letters)
Database: uniref50
1,657,284 sequences; 575,637,011 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef50_Q8SZ72 Cluster: RE14504p; n=9; Eumetazoa|Rep: RE14504p ... 502 e-141
UniRef50_A7RFN2 Cluster: Predicted protein; n=2; Nematostella ve... 360 4e-98
UniRef50_UPI0000E49A98 Cluster: PREDICTED: similar to ENSANGP000... 355 1e-96
UniRef50_Q9VVM4 Cluster: CG7402-PA; n=3; Diptera|Rep: CG7402-PA ... 355 2e-96
UniRef50_UPI0000D55F5E Cluster: PREDICTED: similar to CG8646-PA;... 352 1e-95
UniRef50_Q17B03 Cluster: Arylsulfatase b; n=3; Culicidae|Rep: Ar... 350 4e-95
UniRef50_UPI00015B5C4D Cluster: PREDICTED: similar to ENSANGP000... 347 3e-94
UniRef50_UPI0000D55D4D Cluster: PREDICTED: similar to CG8646-PA;... 330 6e-89
UniRef50_UPI00015B40BD Cluster: PREDICTED: similar to RE14504p; ... 323 7e-87
UniRef50_Q8IQS4 Cluster: CG32191-PA; n=2; Sophophora|Rep: CG3219... 317 4e-85
UniRef50_UPI0000DB708B Cluster: PREDICTED: similar to CG7402-PA ... 316 1e-84
UniRef50_UPI0000D56522 Cluster: PREDICTED: similar to CG7402-PA;... 311 2e-83
UniRef50_Q9VVM1 Cluster: CG7408-PB; n=2; Drosophila melanogaster... 310 4e-83
UniRef50_UPI00015B51A4 Cluster: PREDICTED: similar to arylsulfat... 309 9e-83
UniRef50_UPI0000DB708D Cluster: PREDICTED: similar to CG8646-PA;... 305 1e-81
UniRef50_UPI0000D56521 Cluster: PREDICTED: similar to CG7402-PA;... 303 8e-81
UniRef50_A7SBG5 Cluster: Predicted protein; n=1; Nematostella ve... 293 5e-78
UniRef50_Q8MPH9 Cluster: Glucosinolate sulphatase; n=3; Plutella... 288 2e-76
UniRef50_P50429 Cluster: Arylsulfatase B precursor; n=17; Eumeta... 266 6e-70
UniRef50_Q5FYB0 Cluster: Arylsulfatase J precursor; n=69; Eumeta... 259 9e-68
UniRef50_Q9NJU8 Cluster: Sulfatase 1; n=3; Coelomata|Rep: Sulfat... 258 2e-67
UniRef50_A7SRP2 Cluster: Predicted protein; n=2; Nematostella ve... 257 5e-67
UniRef50_UPI0000586CBA Cluster: PREDICTED: similar to arylsulfat... 253 6e-66
UniRef50_UPI0000E46777 Cluster: PREDICTED: similar to arylsulfat... 223 1e-56
UniRef50_A7S8Q2 Cluster: Predicted protein; n=2; Nematostella ve... 223 1e-56
UniRef50_A7SPY2 Cluster: Predicted protein; n=4; Eumetazoa|Rep: ... 214 5e-54
UniRef50_A3ZMN6 Cluster: Arylsulfatase B; n=1; Blastopirellula m... 197 4e-49
UniRef50_A7IPG5 Cluster: Sulfatase precursor; n=1; Xanthobacter ... 186 1e-45
UniRef50_Q18837 Cluster: Sulfatase domain protein protein 3, iso... 179 1e-43
UniRef50_UPI0000587D99 Cluster: PREDICTED: similar to arylsulfat... 163 7e-39
UniRef50_UPI0000E48607 Cluster: PREDICTED: similar to arylsulfat... 163 1e-38
UniRef50_UPI0000F20AE2 Cluster: PREDICTED: similar to Arylsulfat... 156 1e-36
UniRef50_A6DKP3 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 150 9e-35
UniRef50_A6DSH3 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 149 1e-34
UniRef50_UPI0000660330 Cluster: Arylsulfatase I precursor (EC 3.... 149 2e-34
UniRef50_A3HWU7 Cluster: N-acetylgalactosamine 6-sulfatase; n=2;... 143 1e-32
UniRef50_Q4SNM7 Cluster: Chromosome 15 SCAF14542, whole genome s... 142 2e-32
UniRef50_Q15XG7 Cluster: Sulfatase precursor; n=2; Bacteria|Rep:... 141 3e-32
UniRef50_A6DKC9 Cluster: Sulfatase; n=1; Lentisphaera araneosa H... 138 2e-31
UniRef50_Q7UGD7 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 136 1e-30
UniRef50_A4CMB0 Cluster: Arylsulfatase A; n=5; Bacteria|Rep: Ary... 134 6e-30
UniRef50_A6DLE2 Cluster: Sulfatase; n=1; Lentisphaera araneosa H... 133 8e-30
UniRef50_Q7UYW3 Cluster: Arylsulfatase B; n=1; Pirellula sp.|Rep... 132 2e-29
UniRef50_A0IXQ0 Cluster: Sulfatase; n=1; Shewanella woodyi ATCC ... 132 2e-29
UniRef50_A6CAY0 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 130 1e-28
UniRef50_A3XJJ9 Cluster: Arylsulfatase B; n=1; Leeuwenhoekiella ... 130 1e-28
UniRef50_A0Z632 Cluster: Arylsulfatase B; n=1; marine gamma prot... 130 1e-28
UniRef50_A3ZLN5 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 129 2e-28
UniRef50_A0JAA8 Cluster: Sulfatase precursor; n=1; Shewanella wo... 128 4e-28
UniRef50_Q8A219 Cluster: Arylsulfatase B; n=2; Bacteroides|Rep: ... 127 7e-28
UniRef50_A0Z9E1 Cluster: Sulfatase family protein; n=3; Proteoba... 127 7e-28
UniRef50_Q1VP00 Cluster: Arylsulfatase B; n=1; Psychroflexus tor... 126 1e-27
UniRef50_A6DKB8 Cluster: N-acetylgalactosamine 6-sulfatase; n=3;... 125 3e-27
UniRef50_UPI0000E4801A Cluster: PREDICTED: similar to sulfatase ... 124 6e-27
UniRef50_A4XED5 Cluster: Sulfatase precursor; n=1; Novosphingobi... 123 9e-27
UniRef50_A5ZER6 Cluster: Putative uncharacterized protein; n=1; ... 122 1e-26
UniRef50_A6LED1 Cluster: Arylsulfatase A; n=1; Parabacteroides d... 121 3e-26
UniRef50_A4GJF1 Cluster: Sulfatase; n=1; uncultured marine bacte... 121 5e-26
UniRef50_A6CAW6 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;... 120 8e-26
UniRef50_A6C4L0 Cluster: N-acetylgalactosamine-6-sulfate sulfata... 120 8e-26
UniRef50_A6DKD8 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 118 2e-25
UniRef50_A6CD52 Cluster: Twin-arginine translocation pathway sig... 118 3e-25
UniRef50_A6C4W7 Cluster: Twin-arginine translocation pathway sig... 118 4e-25
UniRef50_Q7UX97 Cluster: Arylsulfatase B [Precursor]; n=1; Pirel... 117 6e-25
UniRef50_A0HG49 Cluster: Sulfatase; n=6; Comamonadaceae|Rep: Sul... 117 6e-25
UniRef50_Q7UGB4 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 117 7e-25
UniRef50_Q3M597 Cluster: Twin-arginine translocation pathway sig... 116 1e-24
UniRef50_Q15XH3 Cluster: Sulfatase precursor; n=1; Pseudoalterom... 116 1e-24
UniRef50_Q1YSH0 Cluster: Sulfatase family protein; n=4; cellular... 116 2e-24
UniRef50_Q02AN8 Cluster: Sulfatase precursor; n=1; Solibacter us... 116 2e-24
UniRef50_A6C383 Cluster: Sulfatase; n=1; Planctomyces maris DSM ... 115 2e-24
UniRef50_A6DG78 Cluster: Sulfatase; n=1; Lentisphaera araneosa H... 115 3e-24
UniRef50_A4CMB1 Cluster: Arylsulphatase A; n=3; Bacteria|Rep: Ar... 114 4e-24
UniRef50_A6DKP2 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 113 9e-24
UniRef50_Q7UHJ9 Cluster: Iduronate-sulfatase or arylsulfatase A;... 113 1e-23
UniRef50_Q2GB51 Cluster: Sulfatase; n=2; Proteobacteria|Rep: Sul... 113 1e-23
UniRef50_A6DMV0 Cluster: N-acetylgalactosamine-6-sulfate sulfata... 112 2e-23
UniRef50_A4CGL5 Cluster: Arylsulfatase A; n=4; Bacteria|Rep: Ary... 111 4e-23
UniRef50_A3I0L2 Cluster: Arylsulfatase A; n=2; Bacteroidetes|Rep... 111 4e-23
UniRef50_A0YAF7 Cluster: Arylsulfatase A; n=1; marine gamma prot... 111 4e-23
UniRef50_A6LDP6 Cluster: Arylsulfatase A; n=1; Parabacteroides d... 110 6e-23
UniRef50_A6KZI6 Cluster: Sulfatase; n=2; Bacteroides|Rep: Sulfat... 110 6e-23
UniRef50_Q7UKJ5 Cluster: Arylsulfatase A; n=3; Bacteria|Rep: Ary... 108 3e-22
UniRef50_A6DPC8 Cluster: Arylsulfatase A; n=1; Lentisphaera aran... 108 3e-22
UniRef50_A6DSP6 Cluster: Sulfatase; n=1; Lentisphaera araneosa H... 107 5e-22
UniRef50_A5FAW4 Cluster: Sulfatase precursor; n=1; Flavobacteriu... 107 5e-22
UniRef50_A3HZ22 Cluster: Putative exported uslfatase; n=1; Algor... 107 6e-22
UniRef50_P34059 Cluster: N-acetylgalactosamine-6-sulfatase precu... 107 6e-22
UniRef50_A3ZUT0 Cluster: Arylsulphatase A; n=1; Blastopirellula ... 107 8e-22
UniRef50_A6DR20 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1... 106 1e-21
UniRef50_A6DGX5 Cluster: N-acetylgalactosamine 6-sulfatase; n=2;... 106 1e-21
UniRef50_Q7UMZ5 Cluster: N-acetylgalactosamine-6-sulfate sulfata... 105 2e-21
UniRef50_A7AKS6 Cluster: Putative uncharacterized protein; n=1; ... 105 2e-21
UniRef50_A6DMX7 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1... 105 2e-21
UniRef50_A6CGG6 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 105 2e-21
UniRef50_A0Z718 Cluster: Twin-arginine translocation pathway sig... 105 2e-21
UniRef50_Q482D6 Cluster: Sulfatase family protein; n=2; Bacteria... 105 3e-21
UniRef50_Q7UPK7 Cluster: Arylsulphatase A; n=1; Pirellula sp.|Re... 104 4e-21
UniRef50_A6C3C8 Cluster: Putative uncharacterized protein; n=1; ... 104 6e-21
UniRef50_Q15XI1 Cluster: Sulfatase; n=2; Bacteria|Rep: Sulfatase... 103 7e-21
UniRef50_A6DF72 Cluster: Putative secreted sulfatase ydeN; n=1; ... 103 7e-21
UniRef50_Q47Q78 Cluster: N-acetylgalactosamine-6-sulfate sulfata... 102 2e-20
UniRef50_A6P2X1 Cluster: Putative uncharacterized protein; n=1; ... 102 2e-20
UniRef50_A6DMY9 Cluster: Putative uncharacterized protein; n=2; ... 102 2e-20
UniRef50_A6C8S3 Cluster: Arylsulphatase A; n=1; Planctomyces mar... 102 2e-20
UniRef50_A6LHS9 Cluster: Arylsulfatase; n=4; Bacteroidetes|Rep: ... 102 2e-20
UniRef50_A6C1V3 Cluster: Putative secreted sulfatase ydeN; n=1; ... 102 2e-20
UniRef50_A6DHS2 Cluster: N-acetylgalactosamine-6-sulfate sulfata... 101 3e-20
UniRef50_A6C4V9 Cluster: Sulfatase; n=1; Planctomyces maris DSM ... 101 5e-20
UniRef50_A6CBI6 Cluster: Putative uncharacterized protein; n=1; ... 100 9e-20
UniRef50_Q1GWE7 Cluster: Sulfatase precursor; n=4; Alphaproteoba... 99 1e-19
UniRef50_Q3JD43 Cluster: Sulfatase; n=1; Nitrosococcus oceani AT... 100 2e-19
UniRef50_Q0C069 Cluster: Sulfatase family protein; n=2; Bacteria... 100 2e-19
UniRef50_A6DSG6 Cluster: Arylsulfatase A; n=1; Lentisphaera aran... 100 2e-19
UniRef50_A6DNJ0 Cluster: Sulfatase; n=1; Lentisphaera araneosa H... 100 2e-19
UniRef50_A6DKM2 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 100 2e-19
UniRef50_Q7UJ66 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 99 3e-19
UniRef50_Q7UYD6 Cluster: N-acetyl-galactosamine-6-sulfatase; n=3... 97 6e-19
UniRef50_A6LCL3 Cluster: Arylsulfatase A; n=1; Parabacteroides d... 97 8e-19
UniRef50_A4A2W0 Cluster: Arylsulfatase A; n=1; Blastopirellula m... 97 8e-19
UniRef50_Q01N83 Cluster: Sulfatase precursor; n=1; Solibacter us... 97 1e-18
UniRef50_Q7UZ43 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;... 96 1e-18
UniRef50_Q7US96 Cluster: Arylsulphatase A; n=1; Pirellula sp.|Re... 96 1e-18
UniRef50_Q7URY7 Cluster: Aryl-sulphate sulphohydrolase; n=1; Pir... 96 1e-18
UniRef50_Q7ULE7 Cluster: Iduronate-sulfatase and sulfatase 1; n=... 95 3e-18
UniRef50_Q64P90 Cluster: Putative secreted sulfatase ydeN; n=2; ... 95 3e-18
UniRef50_A6LIX6 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 95 3e-18
UniRef50_A0JAV8 Cluster: Sulfatase precursor; n=1; Shewanella wo... 95 3e-18
UniRef50_Q7UL93 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 94 6e-18
UniRef50_A3XZF1 Cluster: Sulfatase family protein; n=5; Proteoba... 94 6e-18
UniRef50_A6DS95 Cluster: Arylsulfatase A; n=2; Lentisphaera aran... 94 8e-18
UniRef50_A3J5W3 Cluster: Putative arylsulfatase; n=1; Flavobacte... 94 8e-18
UniRef50_Q7UHK0 Cluster: Arylsulphatase A; n=1; Pirellula sp.|Re... 93 1e-17
UniRef50_A6KWS8 Cluster: Arylsulfatase; n=1; Bacteroides vulgatu... 93 1e-17
UniRef50_A6DSM5 Cluster: Arylsulfatase A; n=1; Lentisphaera aran... 93 1e-17
UniRef50_A6DGL0 Cluster: Arylsulfatase A; n=1; Lentisphaera aran... 93 1e-17
UniRef50_A6C4Q9 Cluster: Arylsulphatase A; n=1; Planctomyces mar... 93 1e-17
UniRef50_Q7URW3 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;... 93 2e-17
UniRef50_Q7UGB8 Cluster: Arylsulfatase homolog b1498; n=1; Pirel... 92 2e-17
UniRef50_A6DQ01 Cluster: N-acetylgalactosamine-4-sulfatase; n=2;... 92 2e-17
UniRef50_A6C4W8 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 92 2e-17
UniRef50_Q7UWW9 Cluster: Arylsulfatase; n=2; Planctomycetaceae|R... 92 3e-17
UniRef50_Q7UTH7 Cluster: Arylsulfatase A; n=5; Bacteria|Rep: Ary... 91 6e-17
UniRef50_Q64WT3 Cluster: N-acetylgalactosamine-6-sulfatase; n=5;... 91 6e-17
UniRef50_A6DFR6 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;... 91 6e-17
UniRef50_A4AAM5 Cluster: Sulfatase; n=1; Congregibacter litorali... 91 6e-17
UniRef50_A6BZT7 Cluster: Putative arylsulfatase; n=1; Planctomyc... 91 7e-17
UniRef50_Q7UX95 Cluster: Arylsulfatase; n=3; Planctomycetaceae|R... 90 1e-16
UniRef50_A6LED2 Cluster: Arylsulfatase A; n=1; Parabacteroides d... 90 1e-16
UniRef50_A4AQQ7 Cluster: N-acetylgalactosamine 6-sulfatase; n=4;... 90 1e-16
UniRef50_A6DHI0 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 89 2e-16
UniRef50_UPI00005887B4 Cluster: PREDICTED: similar to galactosam... 89 2e-16
UniRef50_Q7UG72 Cluster: Arylsulfatase A [precursor]; n=1; Pirel... 89 2e-16
UniRef50_A6PEH5 Cluster: Sulfatase precursor; n=1; Shewanella se... 89 2e-16
UniRef50_UPI00005846A1 Cluster: PREDICTED: similar to arylsulfat... 89 3e-16
UniRef50_Q7UYH3 Cluster: Arylsulfatase; n=1; Pirellula sp.|Rep: ... 89 3e-16
UniRef50_A6DGD3 Cluster: Putative exported uslfatase; n=3; Bacte... 89 3e-16
UniRef50_A6C6V5 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;... 89 3e-16
UniRef50_Q7UMZ6 Cluster: Arylsulfatase A; n=1; Pirellula sp.|Rep... 88 5e-16
UniRef50_Q7UER7 Cluster: Sulfatase 1; n=6; Bacteria|Rep: Sulfata... 88 5e-16
UniRef50_A6DSG4 Cluster: Arylsulphatase A; n=1; Lentisphaera ara... 88 5e-16
UniRef50_A6DID9 Cluster: Putative sulfatase protein; n=1; Lentis... 88 5e-16
UniRef50_A6CAR8 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 87 7e-16
UniRef50_A3I2G9 Cluster: Putative secreted sulfatase; n=1; Algor... 87 7e-16
UniRef50_A6KZ75 Cluster: Putative secreted sulfatase; n=1; Bacte... 87 9e-16
UniRef50_Q9NJU7 Cluster: Sulfatase 2; n=1; Helix pomatia|Rep: Su... 87 9e-16
UniRef50_Q7UPG6 Cluster: Arylsulphatase A; n=2; Bacteria|Rep: Ar... 87 1e-15
UniRef50_Q8A222 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;... 86 2e-15
UniRef50_Q7UQ05 Cluster: Arylsulfatase A; n=2; Planctomycetaceae... 86 2e-15
UniRef50_A6DTN4 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 86 2e-15
UniRef50_A6DSF1 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1... 86 2e-15
UniRef50_A6DHI1 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 86 2e-15
UniRef50_Q5DYT4 Cluster: Arylsulfatase; n=10; Gammaproteobacteri... 85 3e-15
UniRef50_A6DKN7 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 85 3e-15
UniRef50_A4GIB1 Cluster: Arylsulfatase; n=1; uncultured marine b... 85 4e-15
UniRef50_A0JAV3 Cluster: Sulfatase precursor; n=1; Shewanella wo... 85 4e-15
UniRef50_A6CA27 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 85 5e-15
UniRef50_A6C8R8 Cluster: Arylsulfatase A; n=1; Planctomyces mari... 85 5e-15
UniRef50_A6CEC4 Cluster: Aryl-sulphate sulphohydrolase; n=1; Pla... 84 6e-15
UniRef50_A4B5Y4 Cluster: Iduronate-sulfatase and sulfatase 1; n=... 84 6e-15
UniRef50_Q0BZE9 Cluster: Sulfatase family protein; n=1; Hyphomon... 83 1e-14
UniRef50_Q4RJR3 Cluster: Chromosome 13 SCAF15035, whole genome s... 83 1e-14
UniRef50_A6DM48 Cluster: Arylsulfatase A; n=1; Lentisphaera aran... 83 1e-14
UniRef50_A6DG54 Cluster: Arylsulphatase A; n=1; Lentisphaera ara... 83 1e-14
UniRef50_A6DHY0 Cluster: N-acetylgalactosamine 6-sulfatase; n=2;... 83 2e-14
UniRef50_A4W906 Cluster: Sulfatase precursor; n=10; Enterobacter... 83 2e-14
UniRef50_Q7UYW2 Cluster: Arylsulfatase; n=2; Planctomycetaceae|R... 82 3e-14
UniRef50_A6DMX9 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 82 3e-14
UniRef50_A6DMX6 Cluster: Arylsulphatase A; n=1; Lentisphaera ara... 82 3e-14
UniRef50_P77318 Cluster: Uncharacterized sulfatase ydeN precurso... 82 3e-14
UniRef50_A6DFN4 Cluster: Arylsulfatase; n=1; Lentisphaera araneo... 82 3e-14
UniRef50_Q7UYA5 Cluster: Arylsulfatase; n=1; Pirellula sp.|Rep: ... 81 6e-14
UniRef50_Q7UN55 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 81 8e-14
UniRef50_A6C176 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 80 1e-13
UniRef50_Q1MJX8 Cluster: Putative arylsulfatase precursor; n=1; ... 80 1e-13
UniRef50_A6DQW6 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 80 1e-13
UniRef50_Q7UYA9 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;... 79 2e-13
UniRef50_A4AVA7 Cluster: Aryl-sulphate sulphohydrolase; n=2; Bac... 79 2e-13
UniRef50_A6DHI2 Cluster: Aryl-sulphate sulphohydrolase; n=2; Len... 79 2e-13
UniRef50_A4ANR8 Cluster: Arylsulfatase; n=2; Bacteroidetes|Rep: ... 79 2e-13
UniRef50_A3ZY29 Cluster: Aryl-sulphate sulphohydrolase; n=1; Bla... 79 2e-13
UniRef50_A3XSU6 Cluster: Sulfatase family protein; n=2; Vibrio|R... 79 2e-13
UniRef50_Q8A171 Cluster: Putative secreted sulfatase ydeN; n=10;... 79 3e-13
UniRef50_Q15XP0 Cluster: Sulfatase precursor; n=1; Pseudoalterom... 79 3e-13
UniRef50_A6DMW2 Cluster: Putative exported uslfatase; n=1; Lenti... 79 3e-13
UniRef50_P25549 Cluster: Arylsulfatase precursor; n=12; Proteoba... 79 3e-13
UniRef50_Q7UYD2 Cluster: Sulfatase 1; n=2; Bacteria|Rep: Sulfata... 78 4e-13
UniRef50_A6DNI1 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 78 4e-13
UniRef50_A6DMU3 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 78 6e-13
UniRef50_UPI0000586CBD Cluster: PREDICTED: similar to MGC86251 p... 77 7e-13
UniRef50_Q8D7K3 Cluster: Arylsulfatase A; n=16; Bacteria|Rep: Ar... 77 7e-13
UniRef50_A0IXQ1 Cluster: Sulfatase precursor; n=1; Shewanella wo... 77 7e-13
UniRef50_Q7UYA6 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 77 1e-12
UniRef50_A6DU75 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 77 1e-12
UniRef50_A6DI17 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;... 77 1e-12
UniRef50_A3ZLD4 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 77 1e-12
UniRef50_Q7UIU1 Cluster: Arylsulfatase A; n=1; Pirellula sp.|Rep... 77 1e-12
UniRef50_A6BYR0 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1... 77 1e-12
UniRef50_P50473 Cluster: Arylsulfatase precursor; n=7; Echinoida... 77 1e-12
UniRef50_UPI0000E0F7DD Cluster: aryl-sulphate sulphohydrolase; n... 76 2e-12
UniRef50_UPI00006A2B15 Cluster: UPI00006A2B15 related cluster; n... 76 2e-12
UniRef50_Q4SI19 Cluster: Chromosome 5 SCAF14581, whole genome sh... 76 2e-12
UniRef50_A6DHW4 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 76 2e-12
UniRef50_A6DG52 Cluster: Arylsulphatase A; n=1; Lentisphaera ara... 76 2e-12
UniRef50_A3ZVD1 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 76 2e-12
UniRef50_Q15SA2 Cluster: Sulfatase precursor; n=1; Pseudoalterom... 75 3e-12
UniRef50_A6DID7 Cluster: Aryl-sulphate sulphohydrolase; n=2; Len... 75 4e-12
UniRef50_A6C430 Cluster: Arylsulphatase A; n=1; Planctomyces mar... 75 4e-12
UniRef50_A6C284 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 75 4e-12
UniRef50_A3ZWK4 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 75 4e-12
UniRef50_A3HYT7 Cluster: Arylsulphatase A; n=1; Algoriphagus sp.... 75 4e-12
UniRef50_A6LEC5 Cluster: Arylsulfatase A; n=1; Parabacteroides d... 75 5e-12
UniRef50_A6DJ11 Cluster: Arylsulfatase A; n=1; Lentisphaera aran... 75 5e-12
UniRef50_Q7UL40 Cluster: Arylsulfatase A; n=1; Pirellula sp.|Rep... 74 7e-12
UniRef50_Q4BZ10 Cluster: Similar to Arylsulfatase A and related ... 74 7e-12
UniRef50_A6DM53 Cluster: Arylsulfatase; n=1; Lentisphaera araneo... 74 7e-12
UniRef50_A6DRX0 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 74 9e-12
UniRef50_A6DJ15 Cluster: Putative arylsulfatase; n=2; Lentisphae... 74 9e-12
UniRef50_A6CBM1 Cluster: Arylsulphatase A; n=1; Planctomyces mar... 73 1e-11
UniRef50_UPI0000E1104B Cluster: N-acetylgalactosamine 6-sulfate ... 73 2e-11
UniRef50_A6DNI9 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1... 73 2e-11
UniRef50_A4A218 Cluster: Arylsulfatase A; n=1; Blastopirellula m... 73 2e-11
UniRef50_Q89L10 Cluster: Bll4738 protein; n=6; Proteobacteria|Re... 72 3e-11
UniRef50_A6EGE7 Cluster: N-acetylgalactosamine-6-sulfatase; n=3;... 72 3e-11
UniRef50_A6DJL2 Cluster: Putative exported uslfatase; n=1; Lenti... 72 3e-11
UniRef50_A6DJE5 Cluster: Sulfatase 1; n=1; Lentisphaera araneosa... 72 3e-11
UniRef50_A6DIC6 Cluster: Arylsulfatase; n=1; Lentisphaera araneo... 72 3e-11
UniRef50_A6C4Q6 Cluster: Arylsulfatase; n=1; Planctomyces maris ... 72 3e-11
UniRef50_A4AR92 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;... 72 3e-11
UniRef50_Q612A1 Cluster: Putative uncharacterized protein CBG168... 72 3e-11
UniRef50_Q89L07 Cluster: Bll4741 protein; n=4; Bacteria|Rep: Bll... 72 4e-11
UniRef50_A6DUI7 Cluster: Putative exported uslfatase; n=1; Lenti... 72 4e-11
UniRef50_A6DJF1 Cluster: Sulfatase 1; n=1; Lentisphaera araneosa... 72 4e-11
UniRef50_A6DMY7 Cluster: Iduronate-sulfatase and sulfatase 1; n=... 71 5e-11
UniRef50_Q7UH63 Cluster: Arylsulphatase A; n=3; Bacteria|Rep: Ar... 71 6e-11
UniRef50_A4CK82 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 71 6e-11
UniRef50_A0Z6R0 Cluster: Putative arylsulfatase; n=1; marine gam... 71 6e-11
UniRef50_A6DPC9 Cluster: Arylsulphatase A; n=1; Lentisphaera ara... 71 8e-11
UniRef50_A6DIG7 Cluster: Iduronate-sulfatase or arylsulfatase A;... 71 8e-11
UniRef50_Q98BQ3 Cluster: Arylsulfatase; n=77; cellular organisms... 70 1e-10
UniRef50_A6DMW5 Cluster: Iduronate-sulfatase and sulfatase 1; n=... 70 1e-10
UniRef50_A6DJI7 Cluster: Sulfatase 1; n=2; Lentisphaera araneosa... 70 1e-10
UniRef50_Q7UHJ6 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 70 1e-10
UniRef50_Q64MS8 Cluster: Arylsulfatase; n=7; Bacteria|Rep: Aryls... 69 2e-10
UniRef50_A6DI94 Cluster: Arylsulfatase A; n=1; Lentisphaera aran... 69 2e-10
UniRef50_A6DI30 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;... 69 2e-10
UniRef50_Q7UH85 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;... 69 3e-10
UniRef50_A6DNW5 Cluster: Arylsulfatase; n=1; Lentisphaera araneo... 69 3e-10
UniRef50_A0J9Y8 Cluster: Sulfatase precursor; n=1; Shewanella wo... 69 3e-10
UniRef50_Q7UUG3 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 69 3e-10
UniRef50_Q7UM38 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 69 3e-10
UniRef50_A6DMW1 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1... 69 3e-10
UniRef50_A6DI98 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 69 3e-10
UniRef50_A6DTP6 Cluster: Arylsulfatase; n=1; Lentisphaera araneo... 68 6e-10
UniRef50_A6DKM6 Cluster: Arylsulfatase A; n=1; Lentisphaera aran... 68 6e-10
UniRef50_A3XZ25 Cluster: Arylsulfatase A; n=2; Vibrionaceae|Rep:... 68 6e-10
UniRef50_UPI0000E4A9B1 Cluster: PREDICTED: similar to MGC86251 p... 67 8e-10
UniRef50_A6C1Q0 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 67 8e-10
UniRef50_A6DNY9 Cluster: Arylsulphatase A; n=3; Lentisphaera ara... 67 1e-09
UniRef50_Q7UVD9 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 66 1e-09
UniRef50_A6DPE4 Cluster: Arylsulfatase; n=1; Lentisphaera araneo... 66 1e-09
UniRef50_Q8A348 Cluster: Arylsulfatase; n=3; Bacteroides|Rep: Ar... 66 2e-09
UniRef50_Q7UH46 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 66 2e-09
UniRef50_Q1VDY3 Cluster: Probable sulfatase; n=1; Vibrio alginol... 66 2e-09
UniRef50_P15289 Cluster: Arylsulfatase A precursor (EC 3.1.6.8) ... 66 2e-09
UniRef50_Q8A362 Cluster: Arylsulfatase; n=1; Bacteroides thetaio... 66 2e-09
UniRef50_A6DPE1 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 65 3e-09
UniRef50_A6DFS2 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;... 65 3e-09
UniRef50_A6CEG5 Cluster: Arylsulphatase A; n=2; Bacteria|Rep: Ar... 65 3e-09
UniRef50_A6BYP9 Cluster: Arylsulphatase A; n=1; Planctomyces mar... 65 3e-09
UniRef50_Q7ULF9 Cluster: Arylsulfatase; n=2; Bacteria|Rep: Aryls... 65 4e-09
UniRef50_A6DF77 Cluster: Arylsulphatase A; n=2; Lentisphaera ara... 65 4e-09
UniRef50_Q5AJI4 Cluster: Potential arylsulfatase; n=5; Saccharom... 65 4e-09
UniRef50_A6DLW9 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 64 6e-09
UniRef50_A6DJJ7 Cluster: Arylsulfatase; n=1; Lentisphaera araneo... 64 6e-09
UniRef50_A6DJ57 Cluster: Arylsulphatase A; n=2; Lentisphaera ara... 64 6e-09
UniRef50_A6CB33 Cluster: Arylsulfatase; n=1; Planctomyces maris ... 64 6e-09
UniRef50_Q64YV7 Cluster: Arylsulfatase; n=4; Bacteroides fragili... 64 7e-09
UniRef50_Q64R82 Cluster: N-acetylgalactosamine-6-sulfatase; n=8;... 64 7e-09
UniRef50_A6DR28 Cluster: Arylsulphatase A; n=2; Lentisphaera ara... 64 7e-09
UniRef50_UPI00015A6252 Cluster: Arylsulfatase E precursor (EC 3.... 64 1e-08
UniRef50_Q15YX5 Cluster: Sulfatase; n=1; Pseudoalteromonas atlan... 64 1e-08
UniRef50_A7LZ49 Cluster: Putative uncharacterized protein; n=1; ... 64 1e-08
UniRef50_A3HUP5 Cluster: Arylsulfatase; n=1; Algoriphagus sp. PR... 64 1e-08
UniRef50_Q7UX23 Cluster: Arylsulfatase; n=1; Pirellula sp.|Rep: ... 63 1e-08
UniRef50_Q7UUA9 Cluster: N-acetylgalactosamine 6-sulfatase; n=2;... 63 1e-08
UniRef50_A2TWL0 Cluster: N-acetylgalactosamine 6-sulfatase; n=2;... 63 1e-08
UniRef50_Q7UTJ1 Cluster: Aryl-sulphate sulphohydrolase; n=1; Pir... 63 2e-08
UniRef50_Q15XN4 Cluster: Sulfatase precursor; n=1; Pseudoalterom... 63 2e-08
UniRef50_A6DR18 Cluster: Arylsulfatase; n=1; Lentisphaera araneo... 63 2e-08
UniRef50_Q89K44 Cluster: ArsA protein; n=4; Rhizobiales|Rep: Ars... 62 2e-08
UniRef50_Q7UXA2 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 62 2e-08
UniRef50_A4GIB2 Cluster: Putative secreted sulfatase; n=1; uncul... 62 2e-08
UniRef50_A6DR15 Cluster: Arylsulfatase; n=2; Lentisphaera araneo... 62 4e-08
UniRef50_A6CZV9 Cluster: Putative arylsulfatase; n=1; Vibrio shi... 62 4e-08
UniRef50_A5FF56 Cluster: Sulfatase precursor; n=2; Bacteria|Rep:... 62 4e-08
UniRef50_A6DJ49 Cluster: Arylsulphatase A; n=1; Lentisphaera ara... 61 5e-08
UniRef50_A5NY74 Cluster: Sulfatase precursor; n=11; Bacteria|Rep... 61 5e-08
UniRef50_A3ZV95 Cluster: N-acetylgalactosamine 6-sulfatase; n=3;... 61 5e-08
UniRef50_Q7UNI8 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 61 7e-08
UniRef50_A6CA66 Cluster: N-acetylgalactosamine 6-sulfatase; n=3;... 61 7e-08
UniRef50_A7LY79 Cluster: Putative uncharacterized protein; n=1; ... 60 9e-08
UniRef50_A3I0S5 Cluster: Putative sulfatase yidJ; n=1; Algoripha... 60 9e-08
UniRef50_UPI0000588E05 Cluster: PREDICTED: similar to steroid su... 60 1e-07
UniRef50_Q7UJQ8 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 60 1e-07
UniRef50_Q02B50 Cluster: Sulfatase precursor; n=1; Solibacter us... 60 1e-07
UniRef50_A6DJU1 Cluster: Sulfatase 1; n=1; Lentisphaera araneosa... 60 1e-07
UniRef50_A1FH14 Cluster: Sulfatase precursor; n=4; Pseudomonas p... 60 1e-07
UniRef50_Q0UZB2 Cluster: Putative uncharacterized protein; n=2; ... 60 1e-07
UniRef50_Q15US6 Cluster: Sulfatase precursor; n=3; Alteromonadal... 60 2e-07
UniRef50_A6DJ37 Cluster: Arylsulphatase A; n=1; Lentisphaera ara... 60 2e-07
UniRef50_A6DF76 Cluster: Arylsulfatase A; n=1; Lentisphaera aran... 60 2e-07
UniRef50_A0PKV5 Cluster: Arylsulfatase, AslA; n=1; Mycobacterium... 60 2e-07
UniRef50_P51691 Cluster: Arylsulfatase; n=14; cellular organisms... 60 2e-07
UniRef50_Q96EG1 Cluster: Arylsulfatase G precursor; n=20; Eutele... 60 2e-07
UniRef50_A6DR29 Cluster: N-acetylgalactosamine-6-sulfatase; n=2;... 59 2e-07
UniRef50_A0X0X5 Cluster: Sulfatase precursor; n=1; Shewanella pe... 59 2e-07
UniRef50_A6DJ33 Cluster: Arylsulphatase A; n=1; Lentisphaera ara... 59 3e-07
UniRef50_Q8TMK9 Cluster: Arylsulfatase; n=12; cellular organisms... 59 3e-07
UniRef50_Q32KK0 Cluster: Arylsulfatase E; n=1; Rattus norvegicus... 58 4e-07
UniRef50_Q7UXA8 Cluster: N-acetylgalactosamine-6-sulfate sulfata... 58 4e-07
UniRef50_Q7ULY7 Cluster: Arylsulphatase A; n=1; Pirellula sp.|Re... 58 4e-07
UniRef50_Q1YR77 Cluster: N-acetylgalactosamine 6-sulfatase; n=2;... 58 4e-07
UniRef50_A6DRV5 Cluster: Arylsulfatase A; n=1; Lentisphaera aran... 58 4e-07
UniRef50_A6DJD5 Cluster: Sulfatase 1; n=1; Lentisphaera araneosa... 58 4e-07
UniRef50_Q9CKE0 Cluster: Putative uncharacterized protein PM1682... 58 5e-07
UniRef50_Q7UJR3 Cluster: Arylsulfatase; n=2; Bacteria|Rep: Aryls... 58 5e-07
UniRef50_A6DMX8 Cluster: Iduronate-sulfatase or arylsulfatase A;... 58 5e-07
UniRef50_A6DJJ6 Cluster: Sulfatase 1; n=1; Lentisphaera araneosa... 58 5e-07
UniRef50_A6DGK3 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 58 5e-07
UniRef50_Q7UYE0 Cluster: Arylsulfatase; n=1; Pirellula sp.|Rep: ... 58 6e-07
UniRef50_A6V872 Cluster: Arylsulfatase; n=1; Pseudomonas aerugin... 58 6e-07
UniRef50_A0Q2E3 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 58 6e-07
UniRef50_Q8A168 Cluster: Putative sulfatase yidJ; n=5; Bacteroid... 57 8e-07
UniRef50_Q7UYH4 Cluster: Arylsulfatase; n=1; Pirellula sp.|Rep: ... 57 8e-07
UniRef50_Q650Q8 Cluster: Arylsulfatase; n=5; Bacteria|Rep: Aryls... 57 8e-07
UniRef50_A6CEL4 Cluster: Arylsulfatase A; n=1; Planctomyces mari... 57 8e-07
UniRef50_A6C8S0 Cluster: Arylsulphatase A; n=1; Planctomyces mar... 57 8e-07
UniRef50_Q0SBH5 Cluster: Arylsulfatase; n=1; Rhodococcus sp. RHA... 57 1e-06
UniRef50_A6DIE0 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 57 1e-06
UniRef50_A6BZV9 Cluster: Arylsulfatase; n=3; Bacteria|Rep: Aryls... 57 1e-06
UniRef50_A4AWR5 Cluster: Arylsulphatase A; n=1; Flavobacteriales... 57 1e-06
UniRef50_A6DG53 Cluster: Arylsulfatase A; n=1; Lentisphaera aran... 56 1e-06
UniRef50_Q7US20 Cluster: Arylsulphatase A; n=1; Pirellula sp.|Re... 56 2e-06
UniRef50_Q01RE9 Cluster: Sulfatase precursor; n=4; Bacteria|Rep:... 56 2e-06
UniRef50_A3ZMT9 Cluster: Arylsulfatase; n=2; Planctomycetaceae|R... 56 2e-06
UniRef50_Q7UH28 Cluster: Mucin-desulfating sulfatase; n=2; Bacte... 56 3e-06
UniRef50_Q15XN1 Cluster: Sulfatase precursor; n=1; Pseudoalterom... 56 3e-06
UniRef50_A6DS43 Cluster: Arylsulfatase A; n=1; Lentisphaera aran... 56 3e-06
UniRef50_A6DI18 Cluster: Arylsulfatase A; n=2; Lentisphaera aran... 56 3e-06
UniRef50_A5FAW6 Cluster: Sulfatase precursor; n=1; Flavobacteriu... 56 3e-06
UniRef50_A7SK50 Cluster: Predicted protein; n=1; Nematostella ve... 56 3e-06
UniRef50_Q8A349 Cluster: Arylsulfatase; n=1; Bacteroides thetaio... 55 3e-06
UniRef50_A6DM25 Cluster: Sulfatase 1; n=1; Lentisphaera araneosa... 55 3e-06
UniRef50_A4ASX5 Cluster: Mucin-desulfating sulfatase; n=1; Flavo... 55 3e-06
UniRef50_UPI000065CD18 Cluster: Arylsulfatase G precursor (EC 3.... 55 4e-06
UniRef50_A6DI59 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 55 4e-06
UniRef50_Q4WVQ5 Cluster: Arylsulfatase, putative; n=13; Pezizomy... 55 4e-06
UniRef50_Q7UGA0 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 54 6e-06
UniRef50_A6DU78 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 54 6e-06
UniRef50_A6DRW5 Cluster: Putative sulfatase; n=2; Lentisphaera a... 54 6e-06
UniRef50_A6DLY1 Cluster: Putative sulfatase; n=1; Lentisphaera a... 54 6e-06
UniRef50_Q93P97 Cluster: MS134, putative arylsulfatase; n=1; Mic... 54 8e-06
UniRef50_Q1YUH3 Cluster: Arylsulfatase; n=1; gamma proteobacteri... 54 8e-06
UniRef50_Q9X759 Cluster: Arylsulfatase precursor; n=8; Enterobac... 54 8e-06
UniRef50_Q7NMX5 Cluster: Gll0640 protein; n=1; Gloeobacter viola... 54 1e-05
UniRef50_Q15XJ0 Cluster: Sulfatase; n=1; Pseudoalteromonas atlan... 54 1e-05
UniRef50_A7LZQ6 Cluster: Putative uncharacterized protein; n=1; ... 54 1e-05
UniRef50_A6C2T4 Cluster: Sulfatase; n=1; Planctomyces maris DSM ... 54 1e-05
UniRef50_Q9L5W0 Cluster: Mucin-desulfating sulfatase MdsA precur... 53 1e-05
UniRef50_Q6XUN3 Cluster: Arylsulfatase; n=1; Pseudomonas sp. ND6... 53 1e-05
UniRef50_Q1YP24 Cluster: Arylsulfatase A; n=1; gamma proteobacte... 53 1e-05
UniRef50_A3HT92 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 53 1e-05
UniRef50_UPI0000E47BCC Cluster: PREDICTED: similar to arylsulfat... 53 2e-05
UniRef50_UPI0000ECD579 Cluster: UPI0000ECD579 related cluster; n... 53 2e-05
UniRef50_A6DKC6 Cluster: Sulfatase; n=1; Lentisphaera araneosa H... 53 2e-05
UniRef50_A3ZSK1 Cluster: Arylsulphatase A; n=1; Blastopirellula ... 53 2e-05
UniRef50_Q7UGI8 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;... 52 2e-05
UniRef50_Q15SD1 Cluster: Sulfatase precursor; n=1; Pseudoalterom... 52 2e-05
UniRef50_A6DHS3 Cluster: Arylsulfatase A; n=1; Lentisphaera aran... 52 2e-05
UniRef50_A6DGL5 Cluster: N-acetylgalactosamine 6-sulfate sulfata... 52 2e-05
UniRef50_A6DG39 Cluster: Arylsulfatase; n=1; Lentisphaera araneo... 52 2e-05
UniRef50_Q8A221 Cluster: Arylsulfatase; n=6; Bacteroidetes|Rep: ... 52 3e-05
UniRef50_A3UPZ2 Cluster: Arylsulfatase; n=2; Vibrio|Rep: Arylsul... 52 3e-05
UniRef50_UPI000023D942 Cluster: hypothetical protein FG08053.1; ... 52 4e-05
UniRef50_Q8FTJ9 Cluster: Putative arylsulfatase; n=1; Corynebact... 52 4e-05
UniRef50_Q15NY5 Cluster: Sulfatase precursor; n=1; Pseudoalterom... 52 4e-05
UniRef50_A6C9F6 Cluster: Iduronate-2-sulfatase; n=1; Planctomyce... 52 4e-05
UniRef50_A3HSW7 Cluster: Arylsulfatase A; n=1; Algoriphagus sp. ... 52 4e-05
UniRef50_P51690 Cluster: Arylsulfatase E precursor; n=7; Mammali... 52 4e-05
UniRef50_A6QA55 Cluster: Arylsulfatase; n=5; Proteobacteria|Rep:... 51 6e-05
UniRef50_A6EGE6 Cluster: Sulfatase; n=1; Pedobacter sp. BAL39|Re... 51 6e-05
UniRef50_A6DLD9 Cluster: Sulfatase; n=1; Lentisphaera araneosa H... 51 6e-05
UniRef50_A2SJ95 Cluster: Arylsulfatase; n=1; Methylibium petrole... 51 6e-05
UniRef50_Q32KI0 Cluster: Arylsulfatase F; n=2; Canis lupus famil... 51 6e-05
UniRef50_A6DKC5 Cluster: Putative sulfatase yidj; n=1; Lentispha... 51 7e-05
UniRef50_UPI0000E4880B Cluster: PREDICTED: similar to RE14504p, ... 50 1e-04
UniRef50_Q7UH86 Cluster: Arylsulfatase A; n=3; Bacteria|Rep: Ary... 50 1e-04
UniRef50_A6DFU7 Cluster: Mucin-desulfating sulfatase; n=1; Lenti... 50 1e-04
UniRef50_A0LYA0 Cluster: Sulfatase; n=3; Bacteria|Rep: Sulfatase... 50 1e-04
UniRef50_P51689 Cluster: Arylsulfatase D precursor; n=55; Eutele... 50 1e-04
UniRef50_UPI0000E484C0 Cluster: PREDICTED: similar to arylsulfat... 50 1e-04
UniRef50_A6DSF3 Cluster: Putative uncharacterized protein; n=1; ... 50 1e-04
UniRef50_A6DG55 Cluster: Arylsulfatase; n=1; Lentisphaera araneo... 50 1e-04
UniRef50_A6CGJ8 Cluster: Arylsulfatase A; n=1; Planctomyces mari... 50 1e-04
UniRef50_A6DQC0 Cluster: Mucin-desulfating sulfatase; n=1; Lenti... 50 2e-04
UniRef50_A6DJ52 Cluster: Arylsulfatase; n=1; Lentisphaera araneo... 50 2e-04
UniRef50_A4GIA7 Cluster: Iduronate sulfatase; n=1; uncultured ma... 50 2e-04
UniRef50_A3M3B5 Cluster: Arylsulfatase; n=1; Acinetobacter bauma... 50 2e-04
UniRef50_Q1YQ29 Cluster: Arylsulfatase; n=1; gamma proteobacteri... 49 2e-04
UniRef50_Q7UMT6 Cluster: Mucin-desulfating sulfatase; n=2; Bacte... 49 3e-04
UniRef50_A6DG34 Cluster: Choline sulfatase; n=1; Lentisphaera ar... 49 3e-04
UniRef50_A3HV62 Cluster: Arylsulfatase; n=1; Algoriphagus sp. PR... 49 3e-04
UniRef50_Q7UYS6 Cluster: Arylsulfatase A; n=3; Bacteria|Rep: Ary... 48 4e-04
UniRef50_A6UB68 Cluster: Sulfatase; n=1; Sinorhizobium medicae W... 48 4e-04
UniRef50_A6DJ74 Cluster: Arylsulfatase A; n=1; Lentisphaera aran... 48 4e-04
UniRef50_A6C781 Cluster: Putative sulfatase; n=1; Planctomyces m... 48 4e-04
UniRef50_A5VAS7 Cluster: Sulfatase precursor; n=3; Proteobacteri... 48 4e-04
UniRef50_A4A0M2 Cluster: Heparan N-sulfatase; n=1; Blastopirellu... 48 4e-04
UniRef50_Q4RQR4 Cluster: Chromosome 2 SCAF15004, whole genome sh... 48 5e-04
UniRef50_Q7UNN1 Cluster: Arylsulphatase A; n=3; Bacteria|Rep: Ar... 48 5e-04
UniRef50_A6DP41 Cluster: Arylsulfatase A; n=1; Lentisphaera aran... 48 5e-04
UniRef50_Q89YS5 Cluster: N-acetylglucosamine-6-sulfatase; n=2; B... 48 7e-04
UniRef50_A6DJ72 Cluster: Mucin-desulfating sulfatase; n=1; Lenti... 48 7e-04
UniRef50_A0YAK5 Cluster: Sulfatase; n=3; unclassified Gammaprote... 48 7e-04
UniRef50_UPI000065DE05 Cluster: Arylsulfatase E precursor (EC 3.... 47 9e-04
UniRef50_A6U8K1 Cluster: Sulfatase; n=4; cellular organisms|Rep:... 47 9e-04
UniRef50_Q7UFA5 Cluster: Putative sulfatase yidj; n=1; Pirellula... 47 0.001
UniRef50_A6DKP1 Cluster: Arylsulphatase A; n=1; Lentisphaera ara... 47 0.001
UniRef50_A4AP83 Cluster: Putative sulfatase; n=1; Flavobacterial... 47 0.001
UniRef50_A0Z7Y7 Cluster: Arylsulfatase; n=1; marine gamma proteo... 47 0.001
UniRef50_P08842 Cluster: Steryl-sulfatase precursor; n=28; Eutel... 47 0.001
UniRef50_UPI0000E47F5E Cluster: PREDICTED: similar to arylsulfat... 46 0.002
UniRef50_A6M2E5 Cluster: Sulfatase; n=4; Clostridium|Rep: Sulfat... 46 0.002
UniRef50_A6EGE8 Cluster: Heparan N-sulfatase; n=1; Pedobacter sp... 46 0.002
UniRef50_Q7UMT5 Cluster: Probable sulfatase atsG; n=2; Planctomy... 46 0.002
UniRef50_A6DLR4 Cluster: Probable sulfatase atsG; n=1; Lentispha... 46 0.002
UniRef50_A6DHY1 Cluster: Mucin-desulfating sulfatase; n=1; Lenti... 46 0.002
UniRef50_UPI0000E0E27F Cluster: probable sulfatase atsG; n=1; al... 46 0.003
UniRef50_P95059 Cluster: POSSIBLE ARYLSULFATASE ATSA; n=21; Acti... 46 0.003
UniRef50_A7LZQ4 Cluster: Putative uncharacterized protein; n=1; ... 46 0.003
UniRef50_A7BT68 Cluster: Arylsulfatase; n=1; Beggiatoa sp. PS|Re... 46 0.003
UniRef50_A3ZTV8 Cluster: Mucin-desulfating sulfatase; n=1; Blast... 46 0.003
UniRef50_Q650K5 Cluster: Choline-sulfatase; n=7; Bacteroidales|R... 45 0.004
UniRef50_Q0HVG5 Cluster: Sulfatase precursor; n=7; Bacteria|Rep:... 45 0.004
UniRef50_A6DIZ7 Cluster: Arylsulfatase; n=1; Lentisphaera araneo... 45 0.004
UniRef50_A7SK49 Cluster: Predicted protein; n=2; Nematostella ve... 45 0.004
UniRef50_Q4WBJ6 Cluster: Arylsulfatase, putative; n=4; Pezizomyc... 45 0.004
UniRef50_Q061A4 Cluster: Putative sulfatase; n=1; Synechococcus ... 45 0.005
UniRef50_A4AM21 Cluster: Arylsulfatase A; n=2; Bacteroidetes|Rep... 45 0.005
UniRef50_Q4RYA1 Cluster: Chromosome 3 SCAF14978, whole genome sh... 44 0.006
UniRef50_Q392C1 Cluster: Sulfatase; n=11; Burkholderiaceae|Rep: ... 44 0.006
UniRef50_A7A9X1 Cluster: Putative uncharacterized protein; n=1; ... 44 0.006
UniRef50_A6DNI8 Cluster: Putative N-acetylglucosamine-6-sulfatas... 44 0.006
UniRef50_A6DGT7 Cluster: Sulfatase family protein; n=1; Lentisph... 44 0.006
UniRef50_UPI0000EBF0AD Cluster: PREDICTED: similar to arylsulfat... 44 0.008
UniRef50_Q482B9 Cluster: Sulfatase family protein; n=1; Colwelli... 44 0.008
UniRef50_Q028N3 Cluster: Sulfatase; n=1; Solibacter usitatus Ell... 44 0.008
UniRef50_A6DSG8 Cluster: Iduronate sulfatase; n=1; Lentisphaera ... 44 0.008
UniRef50_UPI0001555E0A Cluster: PREDICTED: similar to arylsulfat... 44 0.011
UniRef50_Q7UYC5 Cluster: N-acetyl-galactosamine-6-sulfatase; n=2... 44 0.011
UniRef50_Q1YSK8 Cluster: Mucin-desulfating sulfatase; n=1; gamma... 44 0.011
UniRef50_A6DHI3 Cluster: Probable sulfatase atsG; n=1; Lentispha... 44 0.011
UniRef50_A6DG59 Cluster: Arylsulfatase; n=1; Lentisphaera araneo... 44 0.011
UniRef50_Q2U8N6 Cluster: Sulfatases; n=1; Aspergillus oryzae|Rep... 44 0.011
UniRef50_Q7D5R3 Cluster: Sulfatase family protein; n=10; Mycobac... 43 0.015
UniRef50_Q4C1V0 Cluster: Similar to Arylsulfatase A and related ... 43 0.015
UniRef50_A6E7U2 Cluster: Putative exported sulfatase; n=1; Pedob... 43 0.015
UniRef50_A6CAZ0 Cluster: Probable sulfatase atsG; n=1; Planctomy... 43 0.015
UniRef50_A3VUB6 Cluster: Sulfatase; n=1; Parvularcula bermudensi... 43 0.015
UniRef50_Q18924 Cluster: Sulfatase domain protein protein 2; n=2... 43 0.015
UniRef50_Q2U5H2 Cluster: Sulfatases; n=9; Pezizomycotina|Rep: Su... 43 0.015
UniRef50_Q7UY39 Cluster: Similar to sulfatase 1; n=1; Pirellula ... 43 0.019
UniRef50_Q5LNC6 Cluster: Arylsulfatase; n=1; Silicibacter pomero... 43 0.019
UniRef50_Q2CEI6 Cluster: Putative choline-sulfatase; n=1; Oceani... 43 0.019
UniRef50_A7HUP5 Cluster: Sulfatase precursor; n=2; Alphaproteoba... 43 0.019
UniRef50_A6DPD1 Cluster: Probable sulfatase atsG; n=1; Lentispha... 43 0.019
UniRef50_A6DJ46 Cluster: Arylsulfatase; n=1; Lentisphaera araneo... 43 0.019
UniRef50_A6DFG6 Cluster: Arylsulfatase; n=1; Lentisphaera araneo... 43 0.019
UniRef50_A4XU46 Cluster: Sulfatase; n=1; Pseudomonas mendocina y... 43 0.019
>UniRef50_Q8SZ72 Cluster: RE14504p; n=9; Eumetazoa|Rep: RE14504p -
Drosophila melanogaster (Fruit fly)
Length = 562
Score = 502 bits (1239), Expect = e-141
Identities = 236/455 (51%), Positives = 304/455 (66%), Gaps = 9/455 (1%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
MQH V+Y AEPRGLPL EKILPQYL +LGY +H+ GKWHLG +K +Y PL RGF SHVGF
Sbjct: 91 MQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFSSHVGF 150
Query: 61 WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLF 119
W+G D DHT +E WG D R G +VA+DL G Y TDV TD ++KV+ +HN ++ PLF
Sbjct: 151 WSGHQDYNDHTAVENNQWGLDMRNGTQVAYDLHGHYTTDVITDHSVKVIANHNATKGPLF 210
Query: 120 LMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRG 179
L +AH+A HS NPY P+ P + +I + R+KFAA++SK+D SVG++V L
Sbjct: 211 LYVAHAACHSSNPYNPLPVPDNDVIKMSHIPNYKRRKFAAMVSKMDNSVGQIVDQLRKSN 270
Query: 180 LLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVA 239
+LENSI++FS+DNGGPA GFN N ASNYPLKGVKNTLWEGGVR AG +WSPLL RV+
Sbjct: 271 MLENSIIIFSSDNGGPAQGFNLNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKKSQRVS 330
Query: 240 YQKMHISDWLPTLYSAAGGD--LSVL-ENLDGVNQWDALSKNTESPRTSVLHNIDDIWGI 296
Q MHI DWLPTL AAGG LS L + +DG + W AL ++ SPR +VLHNIDDIWG
Sbjct: 331 NQTMHIIDWLPTLLEAAGGQPALSNLSKQIDGQSIWRALVQDKASPRLNVLHNIDDIWGS 390
Query: 297 AALTVDKYKLIKGTIYKGVWDNWYGPSGREG--AYNASLLYDSHAGRILDKLNLMPPKEK 354
AAL+V +KL+KGT Y+G WD WYGP+G Y+ L+ S AG+ L+ L ++P +
Sbjct: 391 AALSVGDWKLVKGTNYRGSWDGWYGPAGERDPRLYDWQLVGRSRAGKALEALKMLPSRAD 450
Query: 355 VMELRDEATVKC-NDSIEVIQCKPR--DAPCVFNIDEDPCERRNXXXXXXXXXXXXXXXM 411
+R ATV C S + C APC+F+I +DPCE+ N +
Sbjct: 451 QQRIRAAATVSCPGQSSQGTSCVATAFSAPCLFHIRDDPCEQYNLAKQYPEVVNALMTEL 510
Query: 412 HKLNVSAVAPNAQPIDARGDPQYWGRVYTNFGNYE 446
+ N +AV P+ +P D R DP++W +TNFG+Y+
Sbjct: 511 ERFNATAVPPSNKPADPRADPRFWNYTWTNFGDYQ 545
>UniRef50_A7RFN2 Cluster: Predicted protein; n=2; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 512
Score = 360 bits (886), Expect = 4e-98
Identities = 187/441 (42%), Positives = 257/441 (58%), Gaps = 27/441 (6%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
MQH VI A+P GL LNE ++PQYLK LGY TH VGKWHLG +K EY P+ RGFDS+ G+
Sbjct: 89 MQHSVILAAQPYGLGLNETLMPQYLKRLGYATHGVGKWHLGFFKYEYTPIQRGFDSYFGY 148
Query: 61 WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
W G+ D +DH+ E+ WG D + +G Y++D++ ++A+ V+++HN S PLFL
Sbjct: 149 WCGKGDYWDHSNNEKYGWGLDLHDSEQDVWTEWGHYSSDLFAEKAVNVISTHNASVPLFL 208
Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
L AVHS N +P++AP LID FK I D R+ FAA++S +D ++ KVV +L R +
Sbjct: 209 YLPFQAVHSANFIQPLQAPPDLIDKFKNIKDERRRIFAAMVSSMDGAIKKVVDSLKARSM 268
Query: 181 LENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAY 240
NSI+VF+TDNGGPA GF+ N ASN+PL+GVK TLWEGG+RG F+ SPL+ RV
Sbjct: 269 YNNSIIVFTTDNGGPANGFDSNMASNFPLRGVKRTLWEGGIRGTAFIHSPLITKPGRVMT 328
Query: 241 QKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALT 300
+ MH+SDWLPTLY+ AGGD+ L+NLDG + WD++S + SPR ++HNID + AA
Sbjct: 329 ELMHVSDWLPTLYTVAGGDIHDLQNLDGFDLWDSISTDAMSPREEMVHNIDPVNWEAAYR 388
Query: 301 VDKYKL-IKGTIYKGVWDNWYGPSGREGAYNASLLYDSHAGRILDKLNLMPPKEKVMELR 359
++K+ + T Y W + P+ E + + L D+ + PP E
Sbjct: 389 FREWKIVVNQTKYMSGW--YPLPNIEEREPHPATLRDA-------VVKCGPPPE------ 433
Query: 360 DEATVKCNDSIEVIQCKPRDAPCVFNIDEDPCERRNXXXXXXXXXXXXXXXMHKLNVSAV 419
I V C D PC+FNI DPCE N + V
Sbjct: 434 ----------IPV-NCTASDGPCLFNIKNDPCEYVNLAKKELEILNNMLIWLEGYKKGMV 482
Query: 420 APNAQPIDARGDPQYWGRVYT 440
P+D +P +G V+T
Sbjct: 483 PIRNTPLDPSANPANYGGVWT 503
>UniRef50_UPI0000E49A98 Cluster: PREDICTED: similar to
ENSANGP00000029647, partial; n=7; Strongylocentrotus
purpuratus|Rep: PREDICTED: similar to
ENSANGP00000029647, partial - Strongylocentrotus
purpuratus
Length = 474
Score = 355 bits (874), Expect = 1e-96
Identities = 168/396 (42%), Positives = 244/396 (61%), Gaps = 22/396 (5%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
+Q+ VI EP GL NE I+PQYL+ LGY+TH+VGKWHLG +K+ P +RGF+S+ G+
Sbjct: 94 LQYSVIIADEPYGLGTNETIMPQYLRSLGYRTHMVGKWHLGFFKESLTPSHRGFESYYGY 153
Query: 61 WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
+ G D + H + E G DF + +FG Y+T++YT++ +++ +HN EPL++
Sbjct: 154 YGGMQDYFTHESTEHTLTGFDFHVNGSIYKPVFGQYSTEIYTEKTQEIIRNHNPQEPLYI 213
Query: 121 MLAHSAVHSGNPY-EPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRG 179
LAH AVHS N + ++AP K + F I + R+KFAA++S LD+S+G + + L
Sbjct: 214 YLAHQAVHSANYNGQRLQAPYKYYERFPNITNENRRKFAAMVSALDDSLGNITQTLKESS 273
Query: 180 LLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVA 239
L N+++VF+TDNGGPA GF+ N A+N+PL+GVK+T WEGG+RGAGFLW L++ R +
Sbjct: 274 LYNNTVIVFTTDNGGPAHGFDANYANNWPLRGVKDTTWEGGLRGAGFLWGALIEKPGRTS 333
Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAAL 299
MH+ DW+PTLY AGG+ S L++LDG++ W LS+ SPR +LHNID + ++A+
Sbjct: 334 DGMMHVCDWVPTLYGLAGGNTSTLQHLDGIDVWPMLSRAEPSPREEILHNIDPVRNVSAI 393
Query: 300 TVDKYKLIKGTIYKGVWDNWYGPSGREGAYNASLLYDSHAGRILDKLNLMPPKEKVMELR 359
+ YKL++G Y G W +WY P G +S+ DS +P V
Sbjct: 394 RIGDYKLVQGQNYNGSWSDWYPPEG-----ESSVDVDSKP---------VPNAFVVSCPS 439
Query: 360 DEATVKCNDSIEVIQCKPRDAPCVFNIDEDPCERRN 395
A N C P++ PC+FNI DPCE N
Sbjct: 440 KPANASTN-------CDPKEKPCLFNIRHDPCEFNN 468
>UniRef50_Q9VVM4 Cluster: CG7402-PA; n=3; Diptera|Rep: CG7402-PA -
Drosophila melanogaster (Fruit fly)
Length = 579
Score = 355 bits (872), Expect = 2e-96
Identities = 179/464 (38%), Positives = 262/464 (56%), Gaps = 13/464 (2%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
MQH VI EP GLP E+++P+ +D GY THLVGKWHLG ++K+ P RGFD H G+
Sbjct: 93 MQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRGFDHHFGY 152
Query: 61 WTGRIDMYDHTTM---EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEP 117
+ G ID YDH S G DFRR E + G YAT+ +T EA +++ H+KS+P
Sbjct: 153 YNGYIDYYDHQVRMLDRNYSAGLDFRRDLEPCPEANGTYATEAFTSEAKRIIEQHDKSKP 212
Query: 118 LFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHT 177
LF++L+H AVH+GN P++AP++ + F +I D R+ +A ++S LD+SV + + AL
Sbjct: 213 LFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKSVAQTIGALKD 272
Query: 178 RGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKAR 237
G+L NSI++ +DNG P G + NA SNYP +G K + WEGG+R AG LWSPLL +
Sbjct: 273 NGMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKESPWEGGIRSAGALWSPLLKERGY 332
Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIA 297
V+ Q +H DWLPTL AAG L LDG+N W LS N E +++H +D+++G +
Sbjct: 333 VSNQAIHAVDWLPTLAGAAGVSLPQDLPLDGINLWPMLSGNEEPKPRTMIHVLDEVFGYS 392
Query: 298 ALTVDKYKLIKGTIYKGVWDNWYGPSGREGAYNASLLYDSH--AGRILDKL-NLMPPKEK 354
+ D K + G+ +KG +D W G Y+ H A + L N K++
Sbjct: 393 SYMRDTLKYVNGSSFKGRYDQWLGELETNEDDPLGESYEQHVLASDVQSLLGNRGLTKDR 452
Query: 355 VMELRDEATVKC------NDSIEVIQCKPRDAPCVFNIDEDPCERRNXXXXXXXXXXXXX 408
+ ++R EAT C N +C+P APC F++ +DPCER N
Sbjct: 453 IRQMRSEATETCPPIEGQNPLESHFKCEPLKAPCFFDLAKDPCERYNLAQMYPLQLQQLA 512
Query: 409 XXMHKLNVSAVAPNAQP-IDARGDPQYWGRVYTNFGNYETQHGS 451
+ ++ +A+ P D+R +P + + + N +TQ GS
Sbjct: 513 DELEQIRKTAIPSARVPHSDSRANPTFHNGNWEWWNNTDTQSGS 556
>UniRef50_UPI0000D55F5E Cluster: PREDICTED: similar to CG8646-PA;
n=1; Tribolium castaneum|Rep: PREDICTED: similar to
CG8646-PA - Tribolium castaneum
Length = 626
Score = 352 bits (866), Expect = 1e-95
Identities = 192/470 (40%), Positives = 259/470 (55%), Gaps = 44/470 (9%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
MQH VI EP GLPLNE ILPQYLK GY TH +GKWHLG ++KEY P RGFDSH G+
Sbjct: 88 MQHLVILEPEPWGLPLNETILPQYLKRNGYATHAIGKWHLGFFRKEYTPTYRGFDSHYGY 147
Query: 61 WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
D RR V G Y+T ++TDEA++++ HN P+F+
Sbjct: 148 --------------------DMRRNMTVDWSAQGKYSTTLFTDEAVRLIREHNTENPMFM 187
Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
LAH A HSGN +P++AP + I F +I D R+ +AA++S LD+SVG V+ AL + +
Sbjct: 188 YLAHLAPHSGNDDDPLQAPDEEIAKFGHIADPERRIYAAMVSMLDKSVGSVIAALRDKHM 247
Query: 181 LENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAY 240
LENSI+VF +DNG G + N SNYPL+G KN+ WEG +R +WSPL+ RV+
Sbjct: 248 LENSIIVFMSDNGAKPDGIHANHGSNYPLRGNKNSAWEGAMRCVAAIWSPLIKKPQRVSN 307
Query: 241 QKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALT 300
MHISDWLPT Y+AAG + + L +DGV+ W ++S+ +SPRT +LHNID+I+ AL
Sbjct: 308 SLMHISDWLPTFYTAAGLNKTELPKMDGVDMWASISEGKDSPRTELLHNIDEIYNYGALR 367
Query: 301 VDKYKLIKGTIYKGVWDNWYGPSGREG--AYNASLLYDSHAGRILDKLNLMPP-KEK--- 354
V +K + G+ G D WYG SGR+ Y+ S + S G L L KEK
Sbjct: 368 VGNWKYLYGSTTNGKSDGWYGSSGRDPLYTYDDSAVLASQTGSTLAGLTTYQQIKEKHQG 427
Query: 355 -------------VMELRDEATVKC-----NDSIEVIQCKPRDAPCVFNIDEDPCERRNX 396
+ LR A VKC + E +C ++PC+FNI EDPCE+ N
Sbjct: 428 DTNFTHKLLDSETIKTLRGAAEVKCPRVNFEEIPESKKCNAVESPCLFNIKEDPCEQINL 487
Query: 397 XXXXXXXXXXXXXXMHKLNVSAVAPNAQPIDARGDPQYWGRVYTNFGNYE 446
+ + +A+ P D DP W + N+ +YE
Sbjct: 488 AAERPMIVLNMEMALARFKQTALPIRNVPRDPNADPAKWNNTWVNWQDYE 537
>UniRef50_Q17B03 Cluster: Arylsulfatase b; n=3; Culicidae|Rep:
Arylsulfatase b - Aedes aegypti (Yellowfever mosquito)
Length = 675
Score = 350 bits (861), Expect = 4e-95
Identities = 175/451 (38%), Positives = 260/451 (57%), Gaps = 14/451 (3%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
MQH VI EP GL L++KI+P+Y K+ GY+THLVGKWHLG K+Y P RGFD+HVG+
Sbjct: 97 MQHYVIVSDEPWGLGLDQKIMPEYFKEAGYRTHLVGKWHLGFSAKQYTPTMRGFDTHVGY 156
Query: 61 WTGRIDMYDHT---TMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEP 117
+D +D+T + + G D R V +D G YATD +T A ++ H+ +P
Sbjct: 157 LGPYVDYWDYTLKFSPPKSFQGYDMRNNLNVDYDSNGTYATDHFTKAASSIIERHDTKDP 216
Query: 118 LFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHT 177
LFL++ H A H+ N +P++AP++ I F YI D R+ +AA++SKLD+SVG++ +L +
Sbjct: 217 LFLVVNHLAPHAANDDDPLQAPEEDIRKFDYISDERRRIYAAMVSKLDDSVGQIFNSLRS 276
Query: 178 RGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKAR 237
+ +L+NSI++F +DNG P A + N SNYPL+G+K+ WE R +WSPLL + R
Sbjct: 277 KNMLDNSIILFMSDNGAPTAALHANTGSNYPLRGIKSVPWEAATRCVAAIWSPLLQERQR 336
Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLEN---LDGVNQWDALSKNTESPRTSVLHNIDDIW 294
V+ Q +HISDWLPTL SAAG D+ ++ +DG +QW+ALS +T +PR VL+ ID+I+
Sbjct: 337 VSNQFIHISDWLPTLASAAGIDIPFSKDHSEIDGQDQWEALSYDTGNPRRVVLNMIDEIY 396
Query: 295 GIAALTVDKYKLIKGTIYKGVWDNWYGPSGREGAYNASLLYDSHAGRILDK-----LNLM 349
G ++ + +K + GT G +D WY G+ + +L D + +L
Sbjct: 397 GYSSYMENGFKFVNGTYSNGSYDGWY---GQPNTSDQTLSDDQYIDLVLQTEITRWAGET 453
Query: 350 PPKEKVMELRDEATVKCNDSIEVIQCKPRDAPCVFNIDEDPCERRNXXXXXXXXXXXXXX 409
++ + LR A V CN E +C P PC+F+I DPCE +
Sbjct: 454 ISRDTIKYLRKHARVNCNHQPEANKCNPLKRPCLFDIINDPCELNDLSHKFPMKFRELRS 513
Query: 410 XMHKLNVSAVAPNAQPIDARGDPQYWGRVYT 440
+ A P +P D +P +G V+T
Sbjct: 514 TVQTYRRLATKPRNKPADPAANPANFGGVWT 544
>UniRef50_UPI00015B5C4D Cluster: PREDICTED: similar to
ENSANGP00000018435; n=1; Nasonia vitripennis|Rep:
PREDICTED: similar to ENSANGP00000018435 - Nasonia
vitripennis
Length = 710
Score = 347 bits (854), Expect = 3e-94
Identities = 169/347 (48%), Positives = 230/347 (66%), Gaps = 13/347 (3%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
MQH VI AEPRGLPL+EKILPQYLK+ GY TH +GKWH G +++EY P RGFDSH G+
Sbjct: 109 MQHLVILEAEPRGLPLHEKILPQYLKEAGYATHAIGKWHQGFHRREYTPTYRGFDSHFGY 168
Query: 61 WTGRIDMYDH----TTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHN-KS 115
W G D Y H + ++G G D RR +A D +G Y+TD++TDEA++++ H ++
Sbjct: 169 WQGLQDYYTHEVGSSNPKEGFLGFDMRRNMSLARDTYGKYSTDLFTDEAVRLIEEHRPEA 228
Query: 116 EPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKAL 175
P+FL LAH A HSGN EP++AP + + F Y++D R+ +AA++SKLD+SVG+VV AL
Sbjct: 229 GPMFLYLAHLAPHSGNDNEPLQAPDEEVAKFSYVEDPERRIYAAMMSKLDQSVGEVVSAL 288
Query: 176 HTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSK 235
+ +L+NSIVVF DNG G + N SNYPL+G+K + WEG VRGA +WSPL+
Sbjct: 289 RRKNMLQNSIVVFMADNGAATQGIHYNRGSNYPLRGIKASAWEGAVRGAAAVWSPLIQRP 348
Query: 236 ARVAYQKMHISDWLPTLYSAAG--GDLSVLENLDGVNQWDALSKNTES-PRTSVLHNIDD 292
R+ + M I+DWLPTL SA+G + V N+DGV+QW A+S S PR +L NID
Sbjct: 349 KRIYNELMSIADWLPTLLSASGLRDVVRVSANIDGVDQWPAISGVAPSPPRNEILVNIDP 408
Query: 293 IWGIAALTVDKYKLIKGTIYKGVWDNWYGPSGR---EGAYNASLLYD 336
I+ +AL ++K + GT+ G + WYG +GR +G AS YD
Sbjct: 409 IFNYSALRRGEFKYVLGTVGNG--EEWYGETGRPENQGLEGASPTYD 453
Score = 69.3 bits (162), Expect = 2e-10
Identities = 28/91 (30%), Positives = 49/91 (53%), Gaps = 1/91 (1%)
Query: 353 EKVMELRDEATVKCN-DSIEVIQCKPRDAPCVFNIDEDPCERRNXXXXXXXXXXXXXXXM 411
+++++LR A+++C E + C P +PC+FNI EDPCE+RN +
Sbjct: 503 DELLKLRSSASLRCTVPESERVACHPLQSPCLFNIKEDPCEQRNLAASRAMILATLEEAL 562
Query: 412 HKLNVSAVAPNAQPIDARGDPQYWGRVYTNF 442
K V+A+ P+ P D + +P +W + N+
Sbjct: 563 LKYRVTALPPSNVPNDPKANPAFWNHTWVNW 593
>UniRef50_UPI0000D55D4D Cluster: PREDICTED: similar to CG8646-PA;
n=1; Tribolium castaneum|Rep: PREDICTED: similar to
CG8646-PA - Tribolium castaneum
Length = 558
Score = 330 bits (810), Expect = 6e-89
Identities = 177/407 (43%), Positives = 250/407 (61%), Gaps = 13/407 (3%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
+Q I AE R LP KI+ +Y KD+GY THLVGKWHLG + P RGFD GF
Sbjct: 95 LQGPSITPAEGRSLP-EGKIMSEYFKDMGYATHLVGKWHLGHSRWNDTPTFRGFDHFFGF 153
Query: 61 WTGRIDMYDHTTM----EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE 116
+ G YD+ + ++ G D RR + + G YATD++ + A+ V+ HN +
Sbjct: 154 YNGFTSYYDYVSNWKINDKEYSGFDLRRDTVPSWNDAGKYATDLFAEHAVDVIQKHNVNT 213
Query: 117 PLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALH 176
PLF+M+AH AVH GN + + APQ+ ++ FK+I D R+ +AA++SKLD+S+G V +AL
Sbjct: 214 PLFMMIAHLAVHVGNEGKWLEAPQETVNKFKHIRDPNRRTYAAMVSKLDDSIGAVFEALE 273
Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKA 236
+ +L+N+IVVF +DNG P G + N SNYPL+G+K+TL+EGGVR +WSPLL +
Sbjct: 274 AKNMLQNTIVVFISDNGAPTVGPHHNWGSNYPLRGIKDTLFEGGVRTVACIWSPLLVQSS 333
Query: 237 RVAYQKMHISDWLPTLYSAAGGDLSVLE-NLDGVNQWDALSKNTESPRTSVLHNIDDIWG 295
RV+ +HI+DWLPTL++A GGDLSVL+ +LDG++QW +L + S R + NID+
Sbjct: 334 RVSTDLIHITDWLPTLFTAVGGDLSVLDPDLDGIDQWSSLVYDLPSARNDIPLNIDEKTR 393
Query: 296 IAALTVDKYKLIKGTIYKGVWDNWYG----PSGREGAYNASLLYDSHAGRILDKLNLMPP 351
AAL +KLI GT G ++ ++G + E YN S + DS GRI K+N P
Sbjct: 394 NAALRFSYWKLIVGTSGNGSYNGYFGAPLNENIEEQQYNTSAINDSPVGRIAKKINYNPL 453
Query: 352 KEKVME-LRDEATVKCNDS-IEVIQCKPRD-APCVFNIDEDPCERRN 395
E + LR AT+KC D+ + C P A C++NI DPCE +
Sbjct: 454 SETDFDGLRRVATLKCLDAKAKRNPCDPASGAVCLYNIPNDPCEEND 500
>UniRef50_UPI00015B40BD Cluster: PREDICTED: similar to RE14504p;
n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
RE14504p - Nasonia vitripennis
Length = 571
Score = 323 bits (793), Expect = 7e-87
Identities = 168/397 (42%), Positives = 249/397 (62%), Gaps = 16/397 (4%)
Query: 9 AEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMY 68
AEPRG+PL+E++LP+YL++LGY T LVGKWHLG Y ++ P RGFDS VG++ G I +
Sbjct: 99 AEPRGVPLHERLLPEYLRELGYVTRLVGKWHLGYYTDKHTPTRRGFDSFVGYYGGVITYF 158
Query: 69 DHTTMEQGSWGTDFRRGFEVAHDLFG--VYATDVYTDEAIKVVNSHNKSEPLFLMLAHSA 126
+HT + G D+ + F Y TD +D+A V+ +H++ +PLFL LAH A
Sbjct: 159 NHTVTKDKHTGIDYHWDTSGKIEPFDNDQYVTDFISDQAEAVIKNHDRKKPLFLQLAHVA 218
Query: 127 VHSGNPYEPI--RAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
H+ +PI R ++ D YI D R+K+A V++ +D+SVG+VVKAL +L NS
Sbjct: 219 AHASENRDPIEVRNMTEVNDTLSYIPDINRRKYAGVVTAMDDSVGRVVKALKDANMLSNS 278
Query: 185 IVVFSTDNGGPAAGFN-DNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKM 243
I++F +DNG P A N SNYPL+G+K T++EGGVR ++SP L + RV+ +
Sbjct: 279 IIIFMSDNGSPTAEAPYTNYGSNYPLRGIKATVFEGGVRVPACVFSPRLKDRFRVSDELF 338
Query: 244 HISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALTVDK 303
HI+DW PTLY AGGDLS +++LDGV+QW ++S + +S R S+L NID++ A
Sbjct: 339 HITDWFPTLYKLAGGDLSKIQDLDGVDQWSSISGSQKSNRESLLVNIDEVSNPEAAISGY 398
Query: 304 YKLIKGTIYKGVWDNWYGPSGRE---GAYNASLLYDSHAGRILDKL--NLMPPKEKVMEL 358
YKLI+G +D++YG G + Y+ + + S AGR + L +PP++++ EL
Sbjct: 399 YKLIRGI---NRYDDYYGKDGNDYSPKTYDVTGVLSSLAGRAIASLGNQYLPPQKRITEL 455
Query: 359 RDEATVKCNDSIEVIQCKPRDAPCVFNIDEDPCERRN 395
R++AT++C + C RD C+F+I +DPCE N
Sbjct: 456 RNKATLRCEKKDDRPSC--RDT-CLFDIVKDPCETTN 489
>UniRef50_Q8IQS4 Cluster: CG32191-PA; n=2; Sophophora|Rep:
CG32191-PA - Drosophila melanogaster (Fruit fly)
Length = 554
Score = 317 bits (778), Expect = 4e-85
Identities = 165/407 (40%), Positives = 242/407 (59%), Gaps = 16/407 (3%)
Query: 2 QHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFW 61
QH VI EP L LN ++P+ K+ GY T+LVGKWHLG + EY P RGFD H G+W
Sbjct: 93 QHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYW 152
Query: 62 TGRIDMYDHTT---MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSH-NKSEP 117
ID + + + S G DFRR E+ GVY TD+ T EA +++ H +K +P
Sbjct: 153 GAYIDYFQRRSKMPVANYSLGYDFRRNMELECRDRGVYVTDLLTAEAERLIKDHADKEQP 212
Query: 118 LFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHT 177
LFLML+H A H+ N +P++AP++ I F YI D R+K+AA++SKLD+SVG+++ AL +
Sbjct: 213 LFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMISKLDQSVGRIITALSS 272
Query: 178 RGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKAR 237
LENSIV+F +DNG P+ G N SN+PL+G KNT WEGGVR AG +WS L ++
Sbjct: 273 TDQLENSIVIFYSDNGAPSVGMFSNTGSNFPLRGQKNTPWEGGVRVAGAIWSSGLQARGS 332
Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRT--SVLHNIDDIWG 295
+ Q ++++DWLPTL AA +L LDG++ W LS + ++P +LH +DD+W
Sbjct: 333 IFRQPLYVADWLPTLSRAADIELDSSLKLDGIDLWPELSGSADAPHVPREILHILDDVWR 392
Query: 296 IAALTVDKYKLIKGTIYKGVWDN--WYGP----SGREGAYNASLLYDSHAGRILDKLNLM 349
++AL + ++K + GT G +D+ Y R+ Y A + +S R L + +L
Sbjct: 393 LSALQMGQWKYVNGTTASGRYDSVLTYRELDDLDPRDSRY-AVTVRNSATSRALSRYDLR 451
Query: 350 P-PKEKVMELRDEATVKCNDSIEVIQCKPRDAPCVFNIDEDPCERRN 395
++++ R A V+C D C P C+++I DPCE+ N
Sbjct: 452 RLTQQRISLTRRLAAVRCGDLQR--SCNPLLEECLYDILSDPCEQNN 496
>UniRef50_UPI0000DB708B Cluster: PREDICTED: similar to CG7402-PA
isoform 2; n=2; Apocrita|Rep: PREDICTED: similar to
CG7402-PA isoform 2 - Apis mellifera
Length = 609
Score = 316 bits (775), Expect = 1e-84
Identities = 173/442 (39%), Positives = 249/442 (56%), Gaps = 17/442 (3%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
MQ I G EPRGLPL+ KILP++L+ LGY T L+GKWH+G + +Y PL+RGFD+ GF
Sbjct: 96 MQGDGIRGGEPRGLPLHIKILPEHLRGLGYVTKLIGKWHMGFHTLQYTPLHRGFDTFFGF 155
Query: 61 WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
+ I YD+ Q G D G + A+ + YATD++T+EAIK++ +H PL+L
Sbjct: 156 YNSHITYYDYEYSNQNMTGYDMHCGDDPAYGMKREYATDLFTNEAIKIIENHELPRPLYL 215
Query: 121 MLAHSAVHSGNPYEPIRAPQKLI-DAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRG 179
++H AVH+ PI P D I + R+K+A ++SKLDESVG+VV AL +G
Sbjct: 216 QISHLAVHA-----PIEQPDDSSRDEIVQIREPNRRKYAKMVSKLDESVGRVVHALGEKG 270
Query: 180 LLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVA 239
+L +S+++F TDNG + G N SNYPL+G K TL+EGGVRG LWS L+ ARV
Sbjct: 271 MLRDSLILFLTDNGAASIGRYRNYGSNYPLRGTKYTLYEGGVRGVAALWSSRLEKGARVF 330
Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAAL 299
+ +HI+DWLPTLYSAAGGDL L +DG++QW LS+ R +L NID++
Sbjct: 331 KKLIHITDWLPTLYSAAGGDLKDLGKIDGIDQWRVLSEGQGHGREKLLLNIDEVMITEGA 390
Query: 300 TVDKYKLIKGTIYKGVWDNWYGPSGR---EGAYNASLLYDSHAGRILDKL-NLMPPKEKV 355
++KL++G G +D +YG SGR Y +L + + I L + +
Sbjct: 391 IYSRFKLLRG---NGYYDKYYGDSGRTLETPPYTEVVLKSAVSQSITYHLGGPVTQPSTM 447
Query: 356 MELRDEATVKCNDSIEVIQCKP----RDAPCVFNIDEDPCERRNXXXXXXXXXXXXXXXM 411
++LR EATV+C+ ++ C+F+I DPCE +N +
Sbjct: 448 VQLRREATVQCHPNMSYYYRHSFTFCNVTECLFDIVNDPCETKNIAEAYARIARDLDLYL 507
Query: 412 HKLNVSAVAPNAQPIDARGDPQ 433
+ +P+D DP+
Sbjct: 508 EHYGRVLMKQIRKPVDWLADPK 529
>UniRef50_UPI0000D56522 Cluster: PREDICTED: similar to CG7402-PA;
n=1; Tribolium castaneum|Rep: PREDICTED: similar to
CG7402-PA - Tribolium castaneum
Length = 558
Score = 311 bits (764), Expect = 2e-83
Identities = 159/408 (38%), Positives = 240/408 (58%), Gaps = 16/408 (3%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
MQ I E R LPLN ++PQ+LK+LGY+TH+VGKWHLGS + P +GFDSH G+
Sbjct: 91 MQGLPIVAGENRSLPLNMPLMPQHLKNLGYRTHIVGKWHLGSAYRSSTPTEKGFDSHFGY 150
Query: 61 WTGRIDMYDHTTMEQGSW--GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPL 118
W G YD+ T + G D FE G YAT V+T+ A+ ++ HN + PL
Sbjct: 151 WNGFTGYYDYFTDFNSTAIEGFDLHDRFETERGYQGQYATRVFTERALDIIEGHNTTRPL 210
Query: 119 FLMLAHSAVHSGNPYEPIRAPQKLID--AFKYIDDSARQKFAAVLSKLDESVGKVVKALH 176
FL++ H A H+G + P ++ + YI D R+ +A ++++LD S+G+VV+ L
Sbjct: 211 FLLMTHLAAHAGRDGTELGVPNEVEAQRTYSYIQDPRRRLYAEIVAELDRSIGQVVRKLS 270
Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKA 236
R +LENSI++F +DNG P G N+ SN+PL+G+K T +EGG+RG ++SPLL +
Sbjct: 271 ERQMLENSIILFFSDNGAPTVGPYTNSGSNWPLRGIKLTNFEGGIRGTATIFSPLLKKRG 330
Query: 237 RVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGI 296
V + +H+SDWLPT Y+AAGG+L+ L +DGVNQW LS +T SPR+ +L NI++
Sbjct: 331 YVNKELIHVSDWLPTFYAAAGGNLADLGPIDGVNQWPTLSLDTPSPRSEILVNINEQDNT 390
Query: 297 AALTVD--KYKLIKGTIYKGVWDNWYGPSGRE---GAYNASLLYDSHAGRILDKLNLMP- 350
++ D ++KL+ G G +D ++G SGR Y+ + S + +L P
Sbjct: 391 TSIITDNGRFKLVTGAFEGGTYDGYFGDSGRSPDTPPYDPFAVLQSETNIAIQELTQTPI 450
Query: 351 PKEKVMELRDEATVK-C-NDSIE-VIQCKPRDAPCVFNIDEDPCERRN 395
++++ R + + C NDS + C PC+F+++ DPCE N
Sbjct: 451 TRQQIRVTRAQIDLSWCRNDSFRPPLNC---SQPCLFDLENDPCETTN 495
>UniRef50_Q9VVM1 Cluster: CG7408-PB; n=2; Drosophila
melanogaster|Rep: CG7408-PB - Drosophila melanogaster
(Fruit fly)
Length = 585
Score = 310 bits (762), Expect = 4e-83
Identities = 166/453 (36%), Positives = 250/453 (55%), Gaps = 13/453 (2%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
MQH VI +P GLPLNE + + ++ GY+T L+GKWHLG ++ + P RGFD H+G+
Sbjct: 100 MQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHLGY 159
Query: 61 WTGRIDMYDHTTMEQ--GSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSH---NKS 115
+D Y + +Q G G DFR + HD G Y TD+ TD A+K + H N S
Sbjct: 160 LGAYVDYYTQSYEQQNKGYNGHDFRDSLKSTHDHVGHYVTDLLTDAAVKEIEDHGSKNSS 219
Query: 116 EPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKAL 175
+PLFL+L H A H+ N +P++AP + + F+YI + + +AA++S+LD+SVG V+ AL
Sbjct: 220 QPLFLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKTHRYYAAMVSRLDKSVGSVIDAL 279
Query: 176 HTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSK 235
+ +L+NSI++F +DNGGP G + ASNYPL+G KN+ WEG +R + +WS +
Sbjct: 280 ARQEMLQNSIILFLSDNGGPTQGQHSTTASNYPLRGQKNSPWEGALRSSAAIWSTEFERL 339
Query: 236 ARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWG 295
V Q+++I D LPTL +AAG +LDG+N W AL ES ++H ID+
Sbjct: 340 GSVWKQQIYIGDLLPTLAAAAGISPDPALHLDGLNLWSALKYGYESVEREIVHVIDEDVA 399
Query: 296 IAAL--TVDKYKLIKGTIYKGVWDNWYGPSGREGAYNASLLYDSHAGRILDKLNLMP--- 350
L T K+K+I GT +G++D W G ++ Y+ L L
Sbjct: 400 EPHLSYTRGKWKVISGTTNQGLYDGWLGHRETSEVDPRAVEYEELVRNTSVWLQLQQVSF 459
Query: 351 PKEKVMELRDEATVKCND-SIEVIQCKPRDAPCVFNIDEDPCERRNXXXXXXXXXXXXX- 408
+ + ELRD++ ++C D + V C P + PC+F+I+ DPCER N
Sbjct: 460 GERNISELRDQSRIECPDPATGVKPCLPLEGPCLFDIEADPCERSNLYAEYQNSTIFLDL 519
Query: 409 -XXMHKLNVSAVAPNAQPIDARGDPQYWGRVYT 440
+ + A PN +P D DP+++ +T
Sbjct: 520 WSRIQQFAKQAHPPNNKPGDPNCDPRFYHNEWT 552
>UniRef50_UPI00015B51A4 Cluster: PREDICTED: similar to arylsulfatase
b; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
arylsulfatase b - Nasonia vitripennis
Length = 581
Score = 309 bits (759), Expect = 9e-83
Identities = 171/462 (37%), Positives = 260/462 (56%), Gaps = 31/462 (6%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
MQ + AEPRG+PLN ++P+ ++ LGY+T LVGKWHLG ++Y P+ RGFD+ G+
Sbjct: 100 MQGTPMRPAEPRGIPLNVSLMPEQMRRLGYETRLVGKWHLGYTTEDYTPVRRGFDTFFGY 159
Query: 61 WTGRIDMYDHTTMEQGS---WGTDFRR----GFEVAHDLFGVYATDVYTDEAIKVVNSHN 113
+ G I YD+ + G D R FE+AH Y TD+ TDEA K++ ++
Sbjct: 160 YNGFISYYDYWIGWNDTNEVTGYDLHRDESDSFELAHS--SEYFTDLITDEAEKIIRNNK 217
Query: 114 KSEPLFLMLAHSAVHSGNPY--EP--IRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVG 169
++PLFL ++H AVH+G+ +P +R + +F YI+D +K+A +++ LDESVG
Sbjct: 218 NAKPLFLEISHLAVHAGSKVHDDPLEVRRTDDVNASFPYIEDYQHRKYAGMMAALDESVG 277
Query: 170 KVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWS 229
+VVKAL +LENSI++F +DNG P G +N SNYP++G+K ++EG R A ++S
Sbjct: 278 RVVKALKEAEMLENSIIIFMSDNGAPTVGLYNNTGSNYPMRGIKGGMFEGAARAAACIFS 337
Query: 230 PLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLEN-------LDGVNQWDALSKNTESP 282
PL+ + +RV+ + MHI DWLPTLY+AAGG+ L++ LDGV+QW ++ S
Sbjct: 338 PLIKAHSRVSEELMHIVDWLPTLYTAAGGNPMDLQSQFDGALPLDGVSQWSSIVAGGPSS 397
Query: 283 RTSVLHNIDDIWGIAALTVDKYKLIKGTIYKGVWDNWYGPSGRE---GAYNASLLYDSHA 339
R S+L NID+ G A + ++KL+KG + D +YG SG + AYN + S A
Sbjct: 398 RQSLLVNIDEAQGFEAAIIGRHKLVKGMTKE---DGYYGNSGNDPSFPAYNVKKVLSSTA 454
Query: 340 GRILDKLN--LMPPKEKVMELRDEATVKCNDSIEVIQCKPRDAPCVFNIDEDPCERRNXX 397
G + KL P + + LR ++ + C C C+F++ +DPCE R+
Sbjct: 455 GASIGKLAGFASPSARRALWLRQKSVITCKPFTSAANC---SGTCLFDLSKDPCETRDLS 511
Query: 398 XXXXXXXXXXXXXMHKLNVSAVAPNAQPIDARGDPQYWGRVY 439
+ + + P DA G P+Y+ VY
Sbjct: 512 SKLPLIVKKLESFLGEYRRVLMPQTNSPQDACGLPKYFNGVY 553
>UniRef50_UPI0000DB708D Cluster: PREDICTED: similar to CG8646-PA;
n=1; Apis mellifera|Rep: PREDICTED: similar to CG8646-PA
- Apis mellifera
Length = 506
Score = 305 bits (749), Expect = 1e-81
Identities = 160/398 (40%), Positives = 239/398 (60%), Gaps = 32/398 (8%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
MQ + EPR +PLN +LP+YL+ LGY THLVGKWH+G Y + P RGFD+ G+
Sbjct: 73 MQGYPLKAGEPRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPTRRGFDTFFGY 132
Query: 61 WTGRIDMYDHTTMEQGSWGTD--FRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPL 118
++G I ++HT + G D + ++ D Y TD+ T+ A ++ +H++ +PL
Sbjct: 133 YSGYISYFNHTIKQDDHIGYDLHYDNSKNLSIDYNFEYTTDLITERAENIIKNHDRRKPL 192
Query: 119 FLMLAHSAVHSGNPYE--PIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALH 176
+L L H A HS + E +R Q+ KYI+D R+K+A V++ +DESVG+V+KAL
Sbjct: 193 YLQLCHLAAHSSDAKEVMEVRDEQETNATLKYIEDYNRRKYAGVVTAMDESVGRVIKALG 252
Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKA 236
+LENSI+VF +DNG G +N SNYPL+G+K TL+EGG+RG ++S L+ + +
Sbjct: 253 QSSMLENSIIVFISDNGAQTEGLLENYGSNYPLRGLKFTLFEGGIRGVACVYSRLIQNSS 312
Query: 237 RVAYQKMHISDWLPTLYSAAGGDLSVL-ENLDGVNQWDALSKNTESPRTSVLHNIDDIWG 295
R++ + MHI+DWLPT YSAAGG+L L EN+DGV+QWD + ES R SVL NID++
Sbjct: 313 RISNELMHITDWLPTFYSAAGGNLENLEENMDGVDQWDTIVSGKESKRESVLLNIDEVED 372
Query: 296 IAALTVDKYKLIKGTIYKGV-WDNWYGPSGREGAYNASLLYDSHAGRILDKLNLMPPKEK 354
+++ + KYKLI K + ++++YG +G +Y P+
Sbjct: 373 VSSALIGKYKLIING--KNIQYNDYYGDNGTSVSY---------------------PEYN 409
Query: 355 VMELRDEATVKCNDSIEVIQCKPRDAPCVFNIDEDPCE 392
V LR++A V CN+ +C + C+F+I DPCE
Sbjct: 410 VSSLRNKARVVCNNFTSYSKCVDK---CLFDIYNDPCE 444
>UniRef50_UPI0000D56521 Cluster: PREDICTED: similar to CG7402-PA;
n=1; Tribolium castaneum|Rep: PREDICTED: similar to
CG7402-PA - Tribolium castaneum
Length = 531
Score = 303 bits (743), Expect = 8e-81
Identities = 164/446 (36%), Positives = 246/446 (55%), Gaps = 30/446 (6%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
MQ + E R LPLN +P + ++LGYKTHLVGKWHLG+ KE PL +GFDSH G+
Sbjct: 89 MQGYPLKAGENRSLPLNMPTMPLHFQNLGYKTHLVGKWHLGAAYKEDTPLGKGFDSHFGY 148
Query: 61 WTGRIDMYDHTT---MEQGSW--GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKS 115
W G + +D+ + M+ G+ G D FE G YAT+++T+ ++ V+ H+
Sbjct: 149 WNGFVGYFDYVSFSKMDNGTLVKGLDLHDQFEPVWGSQGRYATELFTERSLDVIEGHDVR 208
Query: 116 EPLFLMLAHSAVHSGNPYEPIRAPQ--KLIDAFKYIDDSARQKFAAVLSKLDESVGKVVK 173
PLFL+++H A H+G + P + F YI D R+ +A V+S LD S+G+++
Sbjct: 209 VPLFLVVSHLAAHTGQNGSELGVPDVDQTNHEFSYIQDPRRRLYAGVVSHLDASIGRIMA 268
Query: 174 ALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLD 233
L + +L+NSIV+F +DNG G +N+ SN+PL+GVK + +EGGVR A ++SPL
Sbjct: 269 KLDEKQMLDNSIVLFFSDNGAQTVGMYENSGSNWPLRGVKFSDFEGGVRVAATIYSPLFH 328
Query: 234 SKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDI 293
K V+ +HISDWLPTLYSAAGGD++ L +DG++QWDAL+ N S RT +L NID++
Sbjct: 329 KKGYVSEHLIHISDWLPTLYSAAGGDVAHLGQIDGIDQWDALTNNNPSNRTEILINIDEV 388
Query: 294 WGIAALTVDKYKLIKGTIYKGVWDNWYGPSGREGAYNASLLYDSHAGRILDKLNLMPPKE 353
A+ DK+KLI+G+ ++G +D +YG SGR G N P
Sbjct: 389 DENFAIIRDKFKLIQGSYHEGTFDQYYGDSGR-GPEN-------------------PTPN 428
Query: 354 KVMELRDEATVKCNDSIEVIQCKPRDAPCVFNIDEDPCERRNXXXXXXXXXXXXXXXMHK 413
D + + D ++ C C+F++D+DPCE N + +
Sbjct: 429 PNHTTTDLSWCRAPDQTPILNC---TKGCLFDLDKDPCETTNIIESEPEIANQLYEKIAQ 485
Query: 414 LNVSAVAPNAQPIDARGDPQYWGRVY 439
V + D + DP ++ +
Sbjct: 486 FWKELVPQRNKDTDPKSDPIFYNNTW 511
>UniRef50_A7SBG5 Cluster: Predicted protein; n=1; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 584
Score = 293 bits (720), Expect = 5e-78
Identities = 160/416 (38%), Positives = 226/416 (54%), Gaps = 38/416 (9%)
Query: 35 VGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFG 94
+G WHLG + KEY P+ RGFDS GFW + D ++H++ E WG D R E G
Sbjct: 90 LGMWHLGFFTKEYTPVYRGFDSFYGFWNAKTDYWNHSSYENNFWGVDLRDNMEPVQSEDG 149
Query: 95 VYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDA--------F 146
Y T+++T EA+KV+ +H+ S PLFL +AH AVH+ NP EP++APQ ID F
Sbjct: 150 TYGTELFTREAVKVIEAHDTSTPLFLYVAHQAVHTANPNEPLQAPQDKIDVSLKQRQQRF 209
Query: 147 K-YIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAAS 205
K IDD RQ +AA+++ LD+SVG + AL R +L +S+V+F+TDNGG G N N S
Sbjct: 210 KGTIDDDQRQVYAAMVTSLDQSVGDIFAALSKRHMLRDSVVIFTTDNGGAPYGLNWNRGS 269
Query: 206 NYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVL-E 264
N+PL+G K+ LWEGGV+G F++S L+ K RV+ + + ++DW+PT+Y AGG L
Sbjct: 270 NFPLRGGKDMLWEGGVKGVAFVYSDLIKQKGRVSKELIDVTDWVPTIYHLAGGTAEFLVP 329
Query: 265 NLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALTVDKYKLIKGT--IYKGVWDNWYGP 322
N+DG N W +S+ SPR +LHNID A L KYK+++G YKGV WY
Sbjct: 330 NMDGKNVWSTISEGAPSPRDEILHNIDPWRKFAGLRKGKYKIVQGMDDTYKGV--GWY-- 385
Query: 323 SGREGAYNASLLYDSHAGRILDKLNLMPPKEKVMELRDEATVKCNDSI-EVIQCKPRDAP 381
D + G L + K EL A + C + E +C D
Sbjct: 386 -------------DRYPGHALSSM-------KQPELLPGAVIDCKKTFDEERKCDSSDGK 425
Query: 382 -CVFNIDEDPCERRNXXXXXXXXXXXXXXXMHKLNVSAVAPNAQPIDARGDPQYWG 436
C+F+++EDPCE + + A+ P PI+ +P +G
Sbjct: 426 FCLFDMEEDPCEYHDLSNQLPEVLAEMKTRLEYYKNIALPPWFPPINKAANPANFG 481
>UniRef50_Q8MPH9 Cluster: Glucosinolate sulphatase; n=3; Plutella
xylostella|Rep: Glucosinolate sulphatase - Plutella
xylostella (Diamondback moth)
Length = 547
Score = 288 bits (706), Expect = 2e-76
Identities = 164/460 (35%), Positives = 253/460 (55%), Gaps = 20/460 (4%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
MQ + AE RG+PL E+++ QYL+D GY+T +VGKWH+G E LP RGF++H G
Sbjct: 88 MQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLPTYRGFENHFGV 147
Query: 61 WTGRIDMYDHTTMEQ--GSWGTDFRRGFEVAHDLFGV-YATDVYTDEAIKVVNSHNKSEP 117
G ID Y++ EQ G T ++ D Y TDVYT+++ ++ +HN SEP
Sbjct: 148 RGGFIDYYEYNAQEQLDGRPVTGLCLFDDLQPDWTTEGYITDVYTEKSTTIIENHNVSEP 207
Query: 118 LFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHT 177
L+L+L H A H+GN ++AP + + A ++++ R+ FAA++ KLD+S+G++V L
Sbjct: 208 LYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAAMVKKLDDSIGEIVATLEK 267
Query: 178 RGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLW-----SPLL 232
+G+LEN+I+ FSTDNG P G N+ SNYPL+GVK + WEGG+RG +W +P
Sbjct: 268 KGMLENTIITFSTDNGAPTVGLGANSGSNYPLRGVKKSPWEGGIRGNAMIWAGPEVAPGN 327
Query: 233 DSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDD 292
+ +V MH +DW+PTL A G + LDG+ W + +N SPRT + IDD
Sbjct: 328 AWRGKVYDGNMHAADWVPTLLEAIGEKIPA--GLDGIPMWSHIIENKPSPRTEIF-EIDD 384
Query: 293 IWGIAALTVDKYKLIKGTI----YKGVWDNWYGPSGREGAYNASLLYDSHAGRILDKLNL 348
+ +++T+ ++KL+KGTI K ++ G G Y L DS A L+ + +
Sbjct: 385 YFNHSSVTLGRHKLVKGTIDESLSKHYGEDLRGIIGTPPDYKQK-LRDSKAWESLETIGI 443
Query: 349 MPPKEKVMELRDEATVKCNDSIEVIQCKP-RDAPCVFNIDEDPCERRNXXXXXXXXXXXX 407
P VM RDEA V C + + C P ++ C+++I EDPCE R+
Sbjct: 444 -PLDADVMADRDEAIVTCGNVVPK-PCSPSAESWCLYDIIEDPCELRDLSEELPQLAQIL 501
Query: 408 XXXMHKLNVSAVAPNAQPI-DARGDPQYWGRVYTNFGNYE 446
+ + + Q + D + P+Y+ + + + E
Sbjct: 502 LYRLEQEEAKIIPREGQYVADPKSAPKYFNYTWDAYLSVE 541
>UniRef50_P50429 Cluster: Arylsulfatase B precursor; n=17;
Eumetazoa|Rep: Arylsulfatase B precursor - Mus musculus
(Mouse)
Length = 534
Score = 266 bits (653), Expect = 6e-70
Identities = 130/297 (43%), Positives = 186/297 (62%), Gaps = 15/297 (5%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
+QH +I +P +PL+EK+LPQ LK+ GY TH+VGKWHLG Y+KE LP RGFD++ G+
Sbjct: 110 LQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGY 169
Query: 61 WTGRIDMYDHTTME--QGSWGT----DFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNK 114
G D Y H + GT D R G E A + +Y+T+++T A V+ +H
Sbjct: 170 LLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKRATTVIANHPP 229
Query: 115 SEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKA 174
+PLFL LA +VH +P++ P++ ++ + +I D R+ +A ++S +DE+VG V KA
Sbjct: 230 EKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSLMDEAVGNVTKA 284
Query: 175 LHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDS 234
L + GL N++ +FSTDNGG + +N+PL+G K TLWEGG+RG GF+ SPLL
Sbjct: 285 LKSHGLWNNTVFIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGTGFVASPLLKQ 340
Query: 235 KARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNID 291
K + + MHI+DWLPTL AGG + + LDG N W +S+ SPR +LHNID
Sbjct: 341 KGVKSRELMHITDWLPTLVDLAGGSTNGTKPLDGFNMWKTISEGHPSPRVELLHNID 397
>UniRef50_Q5FYB0 Cluster: Arylsulfatase J precursor; n=69;
Eumetazoa|Rep: Arylsulfatase J precursor - Homo sapiens
(Human)
Length = 599
Score = 259 bits (635), Expect = 9e-68
Identities = 129/297 (43%), Positives = 183/297 (61%), Gaps = 13/297 (4%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
+QH +I +P LPL+ LPQ LK++GY TH+VGKWHLG Y+KE +P RGFD+ G
Sbjct: 140 LQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGS 199
Query: 61 WTGRIDMYDHTTMEQ-GSWGTDFRRGFEVAHDLF-GVYATDVYTDEAIKVVNSHNKSEPL 118
G D Y H + G G D A D G+Y+T +YT +++ SHN ++P+
Sbjct: 200 LLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPI 259
Query: 119 FLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTR 178
FL +A+ AVHS P++AP + + ++ I + R+++AA+LS LDE++ V AL T
Sbjct: 260 FLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTY 314
Query: 179 GLLENSIVVFSTDNGG-PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKAR 237
G NSI+++S+DNGG P AG SN+PL+G K T WEGG+R GF+ SPLL +K
Sbjct: 315 GFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGT 369
Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIW 294
V + +HI+DW PTL S A G + LDG + W+ +S+ SPR +LHNID I+
Sbjct: 370 VCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSPRVDILHNIDPIY 426
>UniRef50_Q9NJU8 Cluster: Sulfatase 1; n=3; Coelomata|Rep: Sulfatase
1 - Helix pomatia (Roman snail) (Edible snail)
Length = 503
Score = 258 bits (632), Expect = 2e-67
Identities = 139/332 (41%), Positives = 191/332 (57%), Gaps = 26/332 (7%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
+QHG+I +P LP + L LK+ GY TH+VGKWHLG YK+EYLP NRGFD++ G+
Sbjct: 98 LQHGIINSCQPNALPNDSPTLADKLKESGYATHMVGKWHLGFYKQEYLPWNRGFDTYFGY 157
Query: 61 WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
D ++H + D R + G Y+ ++T +AI VV SHN S+PLFL
Sbjct: 158 LNAAEDYFNHNVPWRQVRYLDLRDNNGPVRNETGQYSAHLFTGKAIDVVQSHNTSKPLFL 217
Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
LA+ +VH+ P+ P+K ++ I D R+ FA ++S LDE V + +AL +GL
Sbjct: 218 YLAYQSVHA-----PLEVPEKYEHKYRNITDKNRRTFAGMVSALDEGVANLTQALKDKGL 272
Query: 181 LENSIVVFSTDNGGPA-AGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVA 239
N++++FSTDNGG AG N NYPL+G K +LWEGG G GF+ L V+
Sbjct: 273 WNNTVLIFSTDNGGQIHAGGN-----NYPLRGWKASLWEGGFHGVGFVSGGALKRSGAVS 327
Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIW---GI 296
+H+SDW PTL + AGG+L+ + LDG NQWD +S T SPR +LHNID ++ G+
Sbjct: 328 KGLIHVSDWFPTLVTLAGGNLNGTKPLDGFNQWDTISNETPSPREILLHNIDILYPQKGV 387
Query: 297 ------------AALTVDKYKLIKGTIYKGVW 316
AA+ V YKLI G G W
Sbjct: 388 PLYSNTWDTRVRAAIRVGDYKLITGDPGNGSW 419
>UniRef50_A7SRP2 Cluster: Predicted protein; n=2; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 491
Score = 257 bits (629), Expect = 5e-67
Identities = 125/293 (42%), Positives = 180/293 (61%), Gaps = 13/293 (4%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
+QHG+I+ P GLPLN +LPQ L+ GY TH++GKWHLG Y E P RGFD+ GF
Sbjct: 89 LQHGIIHNGRPYGLPLNLTLLPQKLRKAGYSTHMLGKWHLGFYNWESTPTYRGFDTFYGF 148
Query: 61 WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
++G + Y H Q + D R E+ D G Y+ ++T A ++V +H+ S PLF+
Sbjct: 149 YSGAENHYTHV---QDHY-LDLRDNEEIVRDQNGTYSAHLFTKRAEQIVRAHDPSTPLFM 204
Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
+A VHS P++AP++ ID + +I D R+ +AA+++ +D+++G + +A GL
Sbjct: 205 YMAFQNVHS-----PVQAPKEYIDRYSFIKDPLRRTYAAMVTIMDDALGNLTRAFDKAGL 259
Query: 181 LENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAY 240
EN+I++FSTDNGG N +YPL+G K+TLWEGGVRG F+ L+
Sbjct: 260 WENTILIFSTDNGGVPK----NGGYDYPLRGRKDTLWEGGVRGVAFVHGVALEQSGVKCK 315
Query: 241 QKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDI 293
MH++DW PTL S AGG L E+LDG + W+++S ESPR +LHNID I
Sbjct: 316 ALMHVTDWYPTLVSLAGGSLDEDEDLDGYDVWESISHGVESPRKELLHNIDTI 368
>UniRef50_UPI0000586CBA Cluster: PREDICTED: similar to arylsulfatase
B; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
similar to arylsulfatase B - Strongylocentrotus
purpuratus
Length = 596
Score = 253 bits (620), Expect = 6e-66
Identities = 132/302 (43%), Positives = 183/302 (60%), Gaps = 21/302 (6%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
+QH VI +P LPLNE LPQ LK+ GY THLVGKWHLG YK E +PL RGFDS G+
Sbjct: 163 LQHFVIIAPQPNCLPLNETTLPQKLKESGYATHLVGKWHLGFYKNECMPLQRGFDSSFGY 222
Query: 61 WTGRIDMYDHTTM-------EQGSW-GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSH 112
+G D + H E W G DF VA + G Y+ V+T+ A +V+ H
Sbjct: 223 LSGMQDYWTHFRSGSFPGFPEGNHWLGIDFWDNNRVAWEYTGNYSQFVFTERAQRVIQQH 282
Query: 113 NKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVV 172
N ++PLFL L +VH P++ P+K + + + D RQ +A +++ +DE+VGKVV
Sbjct: 283 NPNQPLFLYLPLQSVHG-----PLQVPEKYMKPYAHFQDVGRQTYAGMVATMDEAVGKVV 337
Query: 173 KALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLL 232
+L GL ++++VF+TDNGG + +N+PL+G KNTLWEGGV G GF+ P++
Sbjct: 338 DSLQEAGLWNDTVLVFTTDNGGTPG----KSGNNWPLRGTKNTLWEGGVHGVGFITGPMI 393
Query: 233 DS--KARVAYQKMHISDWLPTLY-SAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHN 289
+ + V+ MHISDW PTL AGG+ + L LD N W++++K T SPR +LHN
Sbjct: 394 PAGVQGTVSKHFMHISDWFPTLIEGVAGGNTAGLA-LDSYNMWNSITKGTPSPRKELLHN 452
Query: 290 ID 291
ID
Sbjct: 453 ID 454
>UniRef50_UPI0000E46777 Cluster: PREDICTED: similar to arylsulfatase
J; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
similar to arylsulfatase J - Strongylocentrotus
purpuratus
Length = 588
Score = 223 bits (544), Expect = 1e-56
Identities = 149/445 (33%), Positives = 227/445 (51%), Gaps = 36/445 (8%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
MQH ++ P LPL+E L Q LK GY TH VGKWHLG K+ LP RGF+S G
Sbjct: 162 MQHLNLFPGRPCCLPLDETTLAQALKKQGYSTHAVGKWHLGYAWKDCLPSRRGFESFFGN 221
Query: 61 WTGRIDMYDHTTM----EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE 116
G D + H ++ G + G ++T +YT+ A +++ +++
Sbjct: 222 IMGSADHWSHNKTALFGDKLVMGKSMYYNERIYWKHEGTFSTTLYTNRARQLIRKQPRNK 281
Query: 117 PLFLMLAHSAVHSGNPYEPIRAPQKLIDAFK-YIDDSARQKFAAVLSKLDESVGKVVKAL 175
PLFL L++ AVH+ P+ P++ ++ I +S R+++A +++ LDE+V V +AL
Sbjct: 282 PLFLYLSYEAVHT-----PLNVPEQYAKPYEGIIHNSKRRRYAGLVNILDEAVRNVTEAL 336
Query: 176 HTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLL--D 233
GL +NS+++F+TDNGG + +N+PL+G K+TLWEGG+RG GF+ SPL+ +
Sbjct: 337 KYNGLYDNSVIIFTTDNGGRPK--PRSVGNNWPLRGGKSTLWEGGIRGVGFVHSPLIPWE 394
Query: 234 SKARVAYQKMHISDWLPTLYSA-AGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDD 292
+ V Q +H+SDW PT+ AGG L + LDG +QW +SK TES R +LHNID
Sbjct: 395 LRGTVNRQLIHVSDWFPTIVXGIAGGKLVTNKPLDGXHQWKTISKGTESNRHEILHNIDP 454
Query: 293 IWGIAALTVDKYKLIKGTIYKGVWDNWYGPSGREGAYNASLLYDSHAGRILDKLNLMPPK 352
I+ A T + + G N +NA++ G KL+ P
Sbjct: 455 IYPAAHWTRENERDF------GALSNL--------PFNATMRASIRVGNW--KLSTGLPH 498
Query: 353 EKVMELRDEATVKCNDSIEVIQCKPRDAPCVFNIDEDPCERRNXXXXXXXXXXXXXXXMH 412
E E E+ + E+ + ++NI +DP ER+N +
Sbjct: 499 EDFWEPPKESEM----PPEMNDIRWSTPVRLYNIKKDPNERQNMAPYQKKIVYRLLKRLQ 554
Query: 413 KLNVSAVAP-NAQPIDARGDPQYWG 436
+AV P + P D RG+P+Y G
Sbjct: 555 DYQNTAVTPIHLGPKDERGNPKYHG 579
>UniRef50_A7S8Q2 Cluster: Predicted protein; n=2; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 540
Score = 223 bits (544), Expect = 1e-56
Identities = 114/292 (39%), Positives = 172/292 (58%), Gaps = 16/292 (5%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
MQH VI P G+P +PQ L+ LGY+T ++GKWHLG + +Y PL RGFDS +GF
Sbjct: 101 MQHFVINITSPWGMPRRFPTIPQKLRTLGYRTSMIGKWHLGFFDWDYTPLRRGFDSFLGF 160
Query: 61 WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
+ G D + H+ M DFRR E A++ G ++TDV+T EAI + HN S+PLFL
Sbjct: 161 FAGEQDHWRHSKM----GFLDFRRDEEPANEYGGQHSTDVFTQEAINIAMRHNASQPLFL 216
Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
+L+++AVH+ P++A ++ + D RQ + ++ D S+G+++ GL
Sbjct: 217 LLSYAAVHT-----PLQAHPNDVNKIGGVSDKDRQNYLGMMGAADWSIGRLIDVYKRNGL 271
Query: 181 LENSIVVFSTDNGG-PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVA 239
N+++++++DNG P G N+PL+G K++L+EGGVR F+ +L K
Sbjct: 272 WNNTLMIWASDNGAQPGKG----GGYNWPLRGYKSSLFEGGVRVPAFVHGEMLQRKGGTV 327
Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNID 291
H++DW PTL AGG+ V ++DGV+QW LS+ S R +LHNID
Sbjct: 328 NDLFHVTDWYPTLVKLAGGE--VEPDIDGVDQWPTLSEGKPSKREEILHNID 377
>UniRef50_A7SPY2 Cluster: Predicted protein; n=4; Eumetazoa|Rep:
Predicted protein - Nematostella vectensis
Length = 270
Score = 214 bits (522), Expect = 5e-54
Identities = 94/198 (47%), Positives = 133/198 (67%), Gaps = 1/198 (0%)
Query: 3 HGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWT 62
H ++G +P GLPL E PQY+K LGY TH +GKWHLG ++KEY P RGFDS GFW
Sbjct: 73 HATVFGTQPYGLPLGETTTPQYMKSLGYVTHGIGKWHLGFFEKEYTPTYRGFDSFYGFWN 132
Query: 63 GRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLML 122
G+ D +DH++ E WGTD R + + G Y T+++ + A ++++ HN+++PL+L L
Sbjct: 133 GKEDYWDHSSQED-VWGTDLRDNEKPVRNESGHYGTELFAERAAQIIHLHNQTKPLYLYL 191
Query: 123 AHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLE 182
A VHS N EP++AP++LI F +I R+ +AA++S LDESV V KAL G+L
Sbjct: 192 AQQGVHSANGNEPLQAPKRLIKKFSHISSPKRRIYAAMVSSLDESVETVHKALSETGMLN 251
Query: 183 NSIVVFSTDNGGPAAGFN 200
N+++VF+TDNGG GFN
Sbjct: 252 NTVLVFTTDNGGAPRGFN 269
>UniRef50_A3ZMN6 Cluster: Arylsulfatase B; n=1; Blastopirellula
marina DSM 3645|Rep: Arylsulfatase B - Blastopirellula
marina DSM 3645
Length = 455
Score = 197 bits (481), Expect = 4e-49
Identities = 117/315 (37%), Positives = 173/315 (54%), Gaps = 21/315 (6%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
+Q GV+ GLPL+E+ L + L+D GY+T +VGKWHLG YLP+ RGFD G
Sbjct: 93 LQVGVVRPWADYGLPLDERTLAEALQDAGYETAIVGKWHLGHVSPAYLPMARGFDHQYGH 152
Query: 61 WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
+ G +D + H G D+ + V D YAT + EA++V+ +K +PLFL
Sbjct: 153 YNGALDYFTH----DRDGGHDWHKDDHVNRD--EGYATHLIAQEAVRVIQDRDKKKPLFL 206
Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYID-DSARQKFAAVLSKLDESVGKVVKALHTRG 179
+ +AVHS P++ P+ A Y D RQ +A +++ LDE+VG++V + +
Sbjct: 207 YVPFNAVHS-----PLQVPESY--AAPYGDMKKRRQAYAGMVAALDEAVGQIVDEIQRQE 259
Query: 180 LLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARV 238
+L+N++ +FS+DNGGP G N PL+G K+TL+EGGVR F W + ++V
Sbjct: 260 MLDNTLFIFSSDNGGPEPG---KLTDNGPLRGGKHTLYEGGVRVCAFASWKGRIAPGSKV 316
Query: 239 AYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAA 298
+HI DW PTL AGG L + LDG N W +++ SP ++ NI G A
Sbjct: 317 E-APLHIVDWYPTLIELAGGSLQQAKPLDGRNIWPSITTGEPSPHDVIVCNITPTEG--A 373
Query: 299 LTVDKYKLIKGTIYK 313
+ V +KL+ I K
Sbjct: 374 IRVGDWKLVVHNIGK 388
>UniRef50_A7IPG5 Cluster: Sulfatase precursor; n=1; Xanthobacter
autotrophicus Py2|Rep: Sulfatase precursor -
Xanthobacter sp. (strain Py2)
Length = 491
Score = 186 bits (453), Expect = 1e-45
Identities = 113/316 (35%), Positives = 173/316 (54%), Gaps = 25/316 (7%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
+Q G I GL +E +LPQ LKD+GY+T LVGKWHLG +++ P RGFDS G
Sbjct: 113 LQVGAIPSGAKYGLATDEFLLPQALKDVGYRTALVGKWHLGHADQKFWPRQRGFDSFYGP 172
Query: 61 WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
G ID + H W D + E +D T+++ EA++++ +H+ PLFL
Sbjct: 173 LVGEIDHFKHEAHGVTDWYHDNTQVKEEGYD------TELFGKEAVRLIAAHDPKTPLFL 226
Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
LA +A P+ P +APQ +D + +I R+ +AA+++ +D+ +G VV AL +RG+
Sbjct: 227 YLAFTA-----PHTPFQAPQSYLDQYAHIAAPQRRAYAAMITAMDDQIGHVVAALTSRGM 281
Query: 181 LENSIVVFSTDNG--------GPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPL 231
EN+++VF +DNG G A D ASN P + K +L+EGG R W
Sbjct: 282 RENTLIVFHSDNGGTRSKMFAGEGAVAGDLPASNAPYRDGKGSLYEGGTRVVALANWPGR 341
Query: 232 LDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNID 291
+ A A MH+ D LPTL AG L+ + LDGV+ W AL+ ++ R +++N++
Sbjct: 342 IAPGA--AEGVMHVVDMLPTLAKLAGASLAKSKPLDGVDVWPALAAG-QAGRAGIVYNVE 398
Query: 292 DIWGIAALTVDKYKLI 307
G A+ ++KL+
Sbjct: 399 PTQG--AVRDGRWKLV 412
>UniRef50_Q18837 Cluster: Sulfatase domain protein protein 3,
isoform a; n=2; Caenorhabditis elegans|Rep: Sulfatase
domain protein protein 3, isoform a - Caenorhabditis
elegans
Length = 488
Score = 179 bits (436), Expect = 1e-43
Identities = 109/324 (33%), Positives = 174/324 (53%), Gaps = 25/324 (7%)
Query: 2 QHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFW 61
Q+GV EP G+P L + ++ L Y T+LVGKWHLG KKE+LP NRGFD GF+
Sbjct: 98 QNGVFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFY 157
Query: 62 TGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLF-----------GVYATDVYTDEAIKVVN 110
+ ++H+ + +G ++ ++ GVY+TD++TD A+ V++
Sbjct: 158 GPQTGYFNHSADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQNGVYSTDLFTDVAMSVLD 217
Query: 111 SHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAA--VLSKLDESV 168
+HN S+P F+ L++ AVH P + K I K R + +L+ +D ++
Sbjct: 218 NHNNSKPFFMFLSYQAVH---PPLQVSQQSKTIGQGKEATFILRSHAHSTRMLTAMDFAI 274
Query: 169 GKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLW 228
G++V+ L L EN+++VF++DNGG A + ASN PL+G K+T+WEGG + F+
Sbjct: 275 GRLVEYLKASNLYENTVIVFTSDNGGTA----NFGASNAPLRGEKDTIWEGGTKTTTFVH 330
Query: 229 SPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDAL-SKNTESPRTSVL 287
SP+ + H+ DW T+ S G L + DG+NQW+ L + + R +
Sbjct: 331 SPMYIEEGGTRDMMFHVVDWHATILSITG--LEIDSYGDGINQWEYLKTGRPKFRRFQFV 388
Query: 288 HNIDDIWGIAALTVDKYKLIKGTI 311
+NID+ +A+ YKLI G +
Sbjct: 389 YNIDNHG--SAIRDGDYKLIVGNV 410
>UniRef50_UPI0000587D99 Cluster: PREDICTED: similar to arylsulfatase
B; ARSB; n=1; Strongylocentrotus purpuratus|Rep:
PREDICTED: similar to arylsulfatase B; ARSB -
Strongylocentrotus purpuratus
Length = 365
Score = 163 bits (397), Expect = 7e-39
Identities = 93/239 (38%), Positives = 138/239 (57%), Gaps = 16/239 (6%)
Query: 59 GFWTGRIDMYDHTTMEQGSW-GTDFRRGFE-VAHDLFGVYATDVYTDEAIKVVNSHNKSE 116
GF+T + ++ +W G D R E VA D GVY+T ++T ++ ++ HN+S+
Sbjct: 15 GFYTHKHYGGHPGLVDSKNWSGYDLRDNLEQVAQDYQGVYSTHLFTQKSQNIIRRHNRSK 74
Query: 117 PLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALH 176
PLFL + AVH P+ P + ++ F YI D R+ +A ++ +DE+VG + + L
Sbjct: 75 PLFLYHSFQAVHY-----PLEVPPRYMEDFNYIADERRRTYAGMVKCMDEAVGNLTRTLK 129
Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKA 236
GL N+I++FS+DNG A FN SN+PL+G+K +LWEGG++ GF+ SPLL
Sbjct: 130 KTGLWNNTIIIFSSDNG---ANFN-YGGSNWPLRGMKRSLWEGGIKSVGFIASPLLPKLV 185
Query: 237 R--VAYQKMHISDWLPTLY-SAAGGDLSVLENLDGVNQWDALSKNTES-PRTSVLHNID 291
R V H++DW PTL A G L +LDG N W L++ +S PR +LHNID
Sbjct: 186 RGTVNNNLFHVTDWFPTLVRGVARGSLKG-THLDGHNLWKHLTRGKDSWPRKEILHNID 243
>UniRef50_UPI0000E48607 Cluster: PREDICTED: similar to arylsulfatase
B; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
similar to arylsulfatase B - Strongylocentrotus
purpuratus
Length = 531
Score = 163 bits (395), Expect = 1e-38
Identities = 84/194 (43%), Positives = 120/194 (61%), Gaps = 9/194 (4%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
+Q+GVI A+P LPL+E LPQ LK+ Y TH+VGKWH+G YK P RGFDS+ G+
Sbjct: 97 LQYGVIRPAQPHCLPLDEVTLPQKLKERDYATHMVGKWHIGFYKDACTPTERGFDSYFGY 156
Query: 61 WTGRIDMYDHT-TMEQGS---WGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE 116
+G D Y H+ + + GS G D A G Y+T ++T +AI V+N+H +S+
Sbjct: 157 LSGAEDYYSHSRSFQIGSKTLKGLDLMANKTPAFQYKGQYSTHLFTSKAIDVINNHERSK 216
Query: 117 PLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALH 176
PLFL LA+ AVHS P++ P K + + I SAR+ +A ++S +DE +G V +AL
Sbjct: 217 PLFLYLAYQAVHS-----PLQVPSKYEEPYANITSSARRAYAGMVSCMDEGIGNVTRALV 271
Query: 177 TRGLLENSIVVFST 190
GL N+I++FST
Sbjct: 272 DAGLYNNTIIIFST 285
>UniRef50_UPI0000F20AE2 Cluster: PREDICTED: similar to Arylsulfatase
B precursor (ASB) (N-acetylgalactosamine-4-sulfatase)
(G4S), partial; n=1; Danio rerio|Rep: PREDICTED: similar
to Arylsulfatase B precursor (ASB)
(N-acetylgalactosamine-4-sulfatase) (G4S), partial -
Danio rerio
Length = 373
Score = 156 bits (379), Expect = 1e-36
Identities = 74/196 (37%), Positives = 118/196 (60%), Gaps = 11/196 (5%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
+QH +I+ +P +PL+EK+LPQ L++ GY TH+VGKWHLG ++K+ LP +RGF S G+
Sbjct: 183 LQHQIIWPCQPYCVPLDEKLLPQVLRERGYHTHMVGKWHLGMFQKDCLPTHRGFQSFFGY 242
Query: 61 WTGRIDMYDH------TTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNK 114
TG D Y H + D R G VA + G Y+T++ T+ A ++ H
Sbjct: 243 LTGSEDYYTHKRCSLIAPLNVTRCALDLRDGDAVALNYSGRYSTELLTERATHIITQHTP 302
Query: 115 SEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKA 174
+PLFL +A AVH+ P++ P + I + +I D R+++A ++S +DE+VG +
Sbjct: 303 DQPLFLYVALQAVHA-----PLQVPDRYIAPYSFIQDPHRRRYAGMVSAMDEAVGNITHT 357
Query: 175 LHTRGLLENSIVVFST 190
L GL +N++++FST
Sbjct: 358 LQETGLWDNTVLIFST 373
>UniRef50_A6DKP3 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine 6-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 465
Score = 150 bits (363), Expect = 9e-35
Identities = 91/264 (34%), Positives = 143/264 (54%), Gaps = 21/264 (7%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMY---- 68
GLPL++K++P+ L GY T +VGKWH G K + P NRGF GF G I+ +
Sbjct: 106 GLPLSQKLIPEILVKEGYATGMVGKWHDGDQHK-FWPYNRGFQEFYGFNNGAINNWVLKG 164
Query: 69 -DHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAV 127
+HT E WG R V + G Y T+ + EA++ ++ H K+EP FL L+ +AV
Sbjct: 165 ENHTVDE---WGAVHRENKRVENS--GEYMTEAFGREAVEFIDRH-KTEPFFLYLSFNAV 218
Query: 128 HSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVV 187
H P++AP+ + FK+I R A+L +D+++G V++ L GL EN+I+
Sbjct: 219 HG-----PLQAPKSYTNQFKHIKPENRALCLAMLKSMDDNIGLVLEKLRKEGLEENTIIF 273
Query: 188 FSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKARVAYQKMHIS 246
F++DNGG G N + N +G KNT+++GG+ W + ++ + +H
Sbjct: 274 FTSDNGGKLKG---NYSFNGKYRGEKNTVFDGGLHVPYAVQWKAQIPAQTKALEAPVHSI 330
Query: 247 DWLPTLYSAAGGDLSVLENLDGVN 270
D T+++AAG ++ LDG N
Sbjct: 331 DLAHTIFAAAGVEIKDEYKLDGRN 354
>UniRef50_A6DSH3 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine 6-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 455
Score = 149 bits (362), Expect = 1e-34
Identities = 105/297 (35%), Positives = 152/297 (51%), Gaps = 22/297 (7%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFW-TGRIDMYDHT 71
G LN K +P YLK+ GYK+ GKWHLG ++ +Y PL+RGFD GF G D +
Sbjct: 102 GTDLNAKFIPNYLKEAGYKSMAFGKWHLG-HEMKYHPLHRGFDDFYGFMGRGAHDFFRLE 160
Query: 72 TMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGN 131
G +G RG E D Y T T+E +K + NK +P F +A++AVH+
Sbjct: 161 KEYDGKFGGPIYRGLEPIDD--KGYLTTRITEETVKFI-EENKDKPFFAYVAYNAVHT-- 215
Query: 132 PYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
P +AP + I A D R A+L LD VG++VK L + EN+I+++ +D
Sbjct: 216 ---PAQAPAEDIKAVS--GDETRDILVAMLKHLDLGVGEIVKTLKKHDIYENTIIIYLSD 270
Query: 192 NGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHIS-DWLP 250
NGG A+N PL+GVK+ +++GG+R FL S KA Q IS D LP
Sbjct: 271 NGGA----KSMVANNKPLRGVKHDIYDGGIR-VPFLMSWPAQIKAGQDTQSPVISLDILP 325
Query: 251 TLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALTVDKYKLI 307
TL AAG L L ++DG + + + ++ N D G + ++ +KL+
Sbjct: 326 TLLDAAG--LPALSDIDGESMLPVIRGDKDNLDRPFFWNHGD--GQTGIQLNNWKLV 378
>UniRef50_UPI0000660330 Cluster: Arylsulfatase I precursor (EC
3.1.6.-) (ASI).; n=1; Takifugu rubripes|Rep:
Arylsulfatase I precursor (EC 3.1.6.-) (ASI). - Takifugu
rubripes
Length = 620
Score = 149 bits (360), Expect = 2e-34
Identities = 80/193 (41%), Positives = 117/193 (60%), Gaps = 12/193 (6%)
Query: 76 GSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGNPYE 134
G G D G VA G Y+T ++T A K++ SHN +E PLFL+L+ AVH+
Sbjct: 200 GVCGYDLHDGEGVAWGQEGKYSTTLFTQRARKILESHNPTEKPLFLLLSLQAVHT----- 254
Query: 135 PIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG 194
P++ P+ I ++ + + AR+K AA++S +DE+V V AL G NS++++STDNG
Sbjct: 255 PLQTPKSYIYPYRDMANIARRKLAAMVSTVDEAVRNVTYALRKYGFYRNSVIIYSTDNGA 314
Query: 195 -PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLY 253
P G SN+PL+G K T WEGG+RG F+ SPLL + RV+ +HI+DW PTL
Sbjct: 315 QPFTG-----GSNWPLRGRKGTYWEGGIRGVAFVHSPLLKRRRRVSKALLHITDWFPTLV 369
Query: 254 SAAGGDLSVLENL 266
AGG++S + +
Sbjct: 370 GLAGGNISQVSGM 382
>UniRef50_A3HWU7 Cluster: N-acetylgalactosamine 6-sulfatase; n=2;
Bacteria|Rep: N-acetylgalactosamine 6-sulfatase -
Algoriphagus sp. PR1
Length = 472
Score = 143 bits (346), Expect = 1e-32
Identities = 83/258 (32%), Positives = 143/258 (55%), Gaps = 13/258 (5%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
G+PL++K + +L LGY L+GKWHLG + ++ PL RGFD G+ G D ++
Sbjct: 118 GMPLSQKTIADHLNKLGYVNGLIGKWHLGK-EPQFHPLKRGFDEFWGYTGGGHDYFESLP 176
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
+G + F+ + Y TD +E++ + H K EP FL A +A P
Sbjct: 177 NGKG-YKEPLESNFKTPDPI--TYITDDVGNESVDFIERH-KDEPFFLFAAFNA-----P 227
Query: 133 YEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDN 192
+ P++A ++ + +++I+D R+ +AA++ +LD +VGK++ +L +GL EN++VVF +DN
Sbjct: 228 HTPMQALEEDLALYQHIEDKKRRTYAAMVHRLDLNVGKIMTSLEEQGLSENTLVVFFSDN 287
Query: 193 GGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTL 252
GGP + NA+ N P +G K L EGG+ + P L + + +++ D +PT
Sbjct: 288 GGPT---DSNASLNAPYRGQKGILLEGGIHVPFVMNLPGLLPEGLIYQEQVTSLDVVPTF 344
Query: 253 YSAAGGDLSVLENLDGVN 270
+ AG + ++ GV+
Sbjct: 345 LALAGDTETSMDMFSGVD 362
>UniRef50_Q4SNM7 Cluster: Chromosome 15 SCAF14542, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 15 SCAF14542, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 650
Score = 142 bits (343), Expect = 2e-32
Identities = 77/191 (40%), Positives = 116/191 (60%), Gaps = 12/191 (6%)
Query: 76 GSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGNPYE 134
G G D G V G Y+T ++T A +++ SH+ +E PLFL+L+ AVH+
Sbjct: 198 GVCGYDLHDGEGVVWGQEGKYSTALFTRRARQILESHDPAERPLFLLLSLQAVHT----- 252
Query: 135 PIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG 194
P++ P+ I ++ + + AR+K AA++S +DE+V V AL G NS++++STDNG
Sbjct: 253 PLQTPKSYIYPYRDMTNVARRKLAAMVSTVDEAVRNVTYALRKYGYYRNSVIIYSTDNGA 312
Query: 195 -PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLY 253
P G SN+PL+G K T WEGG+RG F+ SPLL + RV+ +HI+DW PTL
Sbjct: 313 QPFTG-----GSNWPLRGRKGTYWEGGIRGVAFVHSPLLRRRRRVSKALLHITDWFPTLV 367
Query: 254 SAAGGDLSVLE 264
AGG++S ++
Sbjct: 368 GLAGGNVSQIQ 378
>UniRef50_Q15XG7 Cluster: Sulfatase precursor; n=2; Bacteria|Rep:
Sulfatase precursor - Pseudoalteromonas atlantica
(strain T6c / BAA-1087)
Length = 471
Score = 141 bits (342), Expect = 3e-32
Identities = 100/291 (34%), Positives = 153/291 (52%), Gaps = 20/291 (6%)
Query: 2 QHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFW 61
+H I GAE G+PL+E + Y+K LGY+T GKWHLG E P++RGFD GF
Sbjct: 104 EHSAIKGAE-MGIPLDEVTMGDYMKSLGYRTAFYGKWHLGG-TDELHPMHRGFDEFYGFR 161
Query: 62 TGRIDM--YDHTTMEQGSWG-TD--FRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE 116
G Y+ E+ S TD G + + G Y TDV ++A + + +
Sbjct: 162 GGDRSYWAYEVNAPERKSAVFTDKKLEHGIDQFQEHEG-YLTDVLAEKANQFIEKA-PDK 219
Query: 117 PLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALH 176
P F+ L+ +AVH+ P+ A + + F + R++ AA+ LD + G V+ L
Sbjct: 220 PFFIFLSFNAVHT-----PMEATPEDLAKFPQLKGK-RKEVAAMTLALDRASGAVLNKLK 273
Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKA 236
GL ++++VVFS DNGGP + NA+SNYPL G K+ EGG+R + P +
Sbjct: 274 ELGLEDDTLVVFSNDNGGPT---DKNASSNYPLAGTKSNFLEGGIRVPFLVKWPAKLAAG 330
Query: 237 RVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALS-KNTESPRTSV 286
+V + + D LPT + A GG+ V+ LDGV+ ++ +N ++P S+
Sbjct: 331 KVYDKPVSTLDLLPTFFKAGGGE-EVMSELDGVDLMPYITGQNNKAPHESM 380
>UniRef50_A6DKC9 Cluster: Sulfatase; n=1; Lentisphaera araneosa
HTCC2155|Rep: Sulfatase - Lentisphaera araneosa HTCC2155
Length = 454
Score = 138 bits (335), Expect = 2e-31
Identities = 86/248 (34%), Positives = 132/248 (53%), Gaps = 19/248 (7%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGS---YKKEYLPLNRGFDSHVGFWTGRIDMYD 69
G+P K L QY ++ GY T L GKWHLG + K +P +RGFD G G +YD
Sbjct: 102 GMPREVKTLAQYFQEAGYATGLFGKWHLGGERLFDKTLMPTSRGFDEFFGILEGA-SLYD 160
Query: 70 HTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS 129
T + + R+ + D G Y TD EA+ + + +P FL L +AVH+
Sbjct: 161 DTVNRERKY---IRQ--DTVIDYEGEYFTDAIGREAVSFI-TRKGDKPFFLYLPFTAVHA 214
Query: 130 GNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFS 189
P++A +K + F +I D R+ FAA+LS +D+++G+V AL +G+L+N+++VF
Sbjct: 215 -----PMQASEKYMQRFAHIADPNRRVFAAMLSAMDDNIGRVFDALEHQGILDNTLIVFW 269
Query: 190 TDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKARVAYQKMHISDW 248
+DNGG ++N + N+PLKG K +EGG+R A W + Q + + D
Sbjct: 270 SDNGGKP---DNNYSLNHPLKGQKTQFYEGGIRVPACVRWPKGQIPAGKTLDQPVFLMDI 326
Query: 249 LPTLYSAA 256
P+ AA
Sbjct: 327 FPSALEAA 334
>UniRef50_Q7UGD7 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
Pirellula sp.|Rep: N-acetylgalactosamine 6-sulfatase -
Rhodopirellula baltica
Length = 543
Score = 136 bits (328), Expect = 1e-30
Identities = 88/274 (32%), Positives = 135/274 (49%), Gaps = 14/274 (5%)
Query: 7 YGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRID 66
+G + G+PL+E L LK+ GY T +GKWHLG K + P RGFD GF G
Sbjct: 122 HGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLGD-AKPFWPNRRGFDEWFGFSGGGFS 180
Query: 67 MYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSA 126
+ M+ G RG E + TD ++ EA+K + H ++EP FL LA++A
Sbjct: 181 YWGDLGMKDPLLGV--HRGDEPVDPKTLTHLTDDFSTEAVKFIQRH-ETEPFFLYLAYNA 237
Query: 127 VHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIV 186
P+ P A + + +I+ R + A+++ +DE +G+VV + GL EN+++
Sbjct: 238 -----PHAPDHATRAHLQKTAHIEYGGRAVYGAMVAGMDEGIGRVVDQIRESGLGENTMI 292
Query: 187 VFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHIS 246
+F +DNGG A N+P +G K L+EGG+R + P +
Sbjct: 293 IFYSDNGG-----RREHAVNFPYRGHKGMLFEGGIRVPFLVSWPGTVRSGMKEESPITAL 347
Query: 247 DWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTE 280
D PT +AAG D S + LDG N L+ + +
Sbjct: 348 DLFPTALAAAGMDPSQNDKLDGQNLLPVLTDDKQ 381
>UniRef50_A4CMB0 Cluster: Arylsulfatase A; n=5; Bacteria|Rep:
Arylsulfatase A - Robiginitalea biformata HTCC2501
Length = 492
Score = 134 bits (323), Expect = 6e-30
Identities = 99/302 (32%), Positives = 143/302 (47%), Gaps = 32/302 (10%)
Query: 4 GVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF--- 60
GV + G+P +E L + LK GY T +VGKWHLG +K+EYLP N GFD + G
Sbjct: 122 GVFFPDSHNGMPASEITLAEQLKKAGYATGMVGKWHLG-HKEEYLPPNHGFDDYFGIPYS 180
Query: 61 ----WTGRIDMYD---------HTTMEQGSWGTDFRRGFE-VAHDLFGVYATDVYTDEAI 106
+TG+ Y + +++ + RG E + + T Y DEA+
Sbjct: 181 NDMDFTGQFTSYQDYFGRYTERYESLKTEEYNVPLIRGTEEIERPVNQNTITKRYNDEAV 240
Query: 107 KVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDE 166
K + H K EP F+ LAHS H P D F+ SAR + V+ ++D
Sbjct: 241 KWIREH-KDEPFFMYLAHSLPH---------VPLFTSDEFR--GTSARGLYGDVVEEIDH 288
Query: 167 SVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGF 226
VG++++ L GL EN+IVVF++DN GP + S L+ K T WEGG+R
Sbjct: 289 GVGQIMELLEAEGLAENTIVVFTSDN-GPWLPTGISGGSAGLLREGKGTTWEGGMREPTI 347
Query: 227 LWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSV 286
W+P + A+V D T S AG + +DGV+ L + ESPR +
Sbjct: 348 FWAPGM-LPAKVVMDMGSTLDLFNTFSSLAGVPMPDDREMDGVDLSPILFGDAESPRKEM 406
Query: 287 LH 288
+
Sbjct: 407 FY 408
>UniRef50_A6DLE2 Cluster: Sulfatase; n=1; Lentisphaera araneosa
HTCC2155|Rep: Sulfatase - Lentisphaera araneosa HTCC2155
Length = 441
Score = 133 bits (322), Expect = 8e-30
Identities = 87/269 (32%), Positives = 139/269 (51%), Gaps = 14/269 (5%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
GLP+ E L LK+LGY TH +GKWHLG + P RGFD+ GF +G +
Sbjct: 105 GLPVTEITLADSLKELGYSTHCIGKWHLGE-ADHFHPNARGFDNFYGFLSGARTYFLGGE 163
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
+ +G R E A G Y T+V+T EAI+++ + +P F+ L+H+AVH
Sbjct: 164 L-RGDMDR-IMRNKEFAEPSSG-YTTEVFTQEAIRII-QEEQDKPFFIYLSHNAVHG--- 216
Query: 133 YEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDN 192
P+ A + I ++ + + R+K++ ++ LD+ G +++AL EN+++ F +DN
Sbjct: 217 --PMDAKDEDIMSYDF-KNPLRKKYSGLMKNLDDQTGLLLQALKDSKQYENTLIFFMSDN 273
Query: 193 GGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTL 252
GGP N +SN+PL+G K + +EGG R L P S + + + D T
Sbjct: 274 GGPT---THNGSSNWPLRGFKGSEFEGGNRTPFLLQWPEKISAGLSSDKPIIAYDVFATC 330
Query: 253 YSAAGGDLSVLENLDGVNQWDALSKNTES 281
AAGG+L G++ ++K E+
Sbjct: 331 IQAAGGELVTDRTYHGIDLLPVINKPQET 359
>UniRef50_Q7UYW3 Cluster: Arylsulfatase B; n=1; Pirellula sp.|Rep:
Arylsulfatase B - Rhodopirellula baltica
Length = 520
Score = 132 bits (319), Expect = 2e-29
Identities = 89/259 (34%), Positives = 132/259 (50%), Gaps = 23/259 (8%)
Query: 7 YGAEPR--GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGR 64
Y P GLP +EK L +L GY T L+GKWHLG + + P RGFD G TG
Sbjct: 133 YATRPELLGLPTSEKTLADHLGAAGYATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGS 191
Query: 65 IDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSH---NKSEPLFLM 121
Y TM+ R + D Y TD +TDE ++ ++ H N +P F+
Sbjct: 192 -HHYFPATMKHV-----IERNGKRVDDFSSEYLTDFFTDEGLRFIDQHKSANPDQPWFVF 245
Query: 122 LAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLL 181
+++A P+ P+ A + + F I + R+ +AA++ LD VG++ + L G
Sbjct: 246 FSYNA-----PHTPMHATEADLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQW 300
Query: 182 ENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQ 241
EN+++VF +DNGG +N + N PL+GVK ++ EGG+R +W+ A V Y
Sbjct: 301 ENTLLVFFSDNGGA----TNNGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGVLYD 355
Query: 242 KMHIS-DWLPTLYSAAGGD 259
+ S D LPT SAAG +
Sbjct: 356 GVVSSLDLLPTFCSAAGAE 374
>UniRef50_A0IXQ0 Cluster: Sulfatase; n=1; Shewanella woodyi ATCC
51908|Rep: Sulfatase - Shewanella woodyi ATCC 51908
Length = 379
Score = 132 bits (318), Expect = 2e-29
Identities = 97/279 (34%), Positives = 134/279 (48%), Gaps = 28/279 (10%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHL--GSYKKEYL----PLNRGFDSHVGFWTGRID 66
GLP+ E +L + GY+T VGKWHL G K Y PL+RGFD GF
Sbjct: 16 GLPVEENVLANNFRKAGYRTGAVGKWHLTKGEKKASYTLAQHPLDRGFDFFFGFDRSGTP 75
Query: 67 MYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSA 126
YD +E R+ + Y TD T+ AI +N +KS+P FL +A++A
Sbjct: 76 YYDSKILELN------RKPVKAEG-----YLTDQLTNHAIDFINQ-DKSKPFFLYMAYNA 123
Query: 127 VHSG-NPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSI 185
VH N P +Y+D F + L LD+ V K++K L + G L+N+I
Sbjct: 124 VHGPLNKAAPKEYQAPFNSGDRYLD-----YFYSYLYALDQGVAKIIKQLDSNGQLDNTI 178
Query: 186 VVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSP-LLDSKARVAYQKMH 244
++F +DNG P G +N P G K +W+GG R +W P L + RV +
Sbjct: 179 IMFLSDNGAP-GGKPFPLPANAPFTGYKGQVWQGGTRVPVVIWGPKALVNGGRVDDAVIS 237
Query: 245 ISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPR 283
D +PT +AAG DLS +NLDG N L + E R
Sbjct: 238 SMDLIPTALAAAGVDLS--DNLDGNNLLPKLKRVEEDER 274
>UniRef50_A6CAY0 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
n=1; Planctomyces maris DSM 8797|Rep:
N-acetylgalactosamine 6-sulfate sulfatase - Planctomyces
maris DSM 8797
Length = 466
Score = 130 bits (313), Expect = 1e-28
Identities = 102/308 (33%), Positives = 154/308 (50%), Gaps = 25/308 (8%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
GL +E ++P+YLK GY+T GKW++G + P RGFD GF G ID Y H
Sbjct: 114 GLRKSEVLIPEYLKQQGYRTACFGKWNVG-FSPGSRPTERGFDEFFGFAAGNIDYYHHYY 172
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVH--SG 130
+ D RG + + G Y+TD++ D A + +++ + +P F+ L +A H S
Sbjct: 173 AGRH----DLWRGLKEVF-VEG-YSTDLFADAACQYISAES-DQPFFIYLPFNAPHFPSQ 225
Query: 131 NPYEP-----IRAPQKLIDAFKYIDDSA--RQKFAAVLSKLDESVGKVVKALHTRGLLEN 183
+P +AP + + Y + ++++ AV++ LD ++G+V+K L T GL +
Sbjct: 226 RNKQPGQGNEWQAPDLAFEKYGYDPQTKNPQERYRAVVTALDSAIGRVLKQLDTSGLRDQ 285
Query: 184 SIVVFSTDNGGPAAGFND-NAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQK 242
+IV++ +DNG ASN PL+ TLWEGG+R + P KA Q
Sbjct: 286 TIVIWYSDNGAFMLKERGLEVASNKPLRDGGVTLWEGGIRVPAIIRYP-GHLKAGTVNQS 344
Query: 243 MHIS-DWLPTLYSAAGGDLSVLENLDGVNQWDALSKNT-ESPRTSVLHNIDDIWGIAALT 300
IS D LPTL + AGG L LDG + AL+ T PRT +A+
Sbjct: 345 PLISLDILPTLITLAGGPLPAERILDGQDMLPALAAQTAPEPRTFFF----QYRNFSAVR 400
Query: 301 VDKYKLIK 308
KYKL++
Sbjct: 401 RGKYKLVR 408
>UniRef50_A3XJJ9 Cluster: Arylsulfatase B; n=1; Leeuwenhoekiella
blandensis MED217|Rep: Arylsulfatase B -
Leeuwenhoekiella blandensis MED217
Length = 461
Score = 130 bits (313), Expect = 1e-28
Identities = 86/277 (31%), Positives = 144/277 (51%), Gaps = 26/277 (9%)
Query: 6 IYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRI 65
I G LP + LPQ L L YKT L+GKWHLG K E P GFD GF G++
Sbjct: 107 ISGRSELNLPDSITTLPQALSKLNYKTALMGKWHLG-LKPESGPEVYGFDFSYGFLHGQL 165
Query: 66 DMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHS 125
D Y HT S T +R G ++ + TD+ T A+ +++ + +L +A+S
Sbjct: 166 DQYAHTYKNGDS--TWYRNGKFISEK---GHVTDLLTQSAVHYIDTLQTDQNFYLQVAYS 220
Query: 126 AVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSI 185
A P+ P++ PQ+ ++ + I DS+R+ +AA ++ +D +G++++ L + L +N++
Sbjct: 221 A-----PHIPLQEPQEWLEKYTGIKDSSRRAYAAAMTHMDAGIGEILQKLKDKDLEKNTV 275
Query: 186 VVFSTDNGGPAA-----------GFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLD 233
V+F +DNG G N + SN PL+ K + +EG +R + W L+
Sbjct: 276 VLFVSDNGAQEKWVPNTQYDGKYGPNYSLGSNLPLRDFKTSNYEGALRVPAIISWPENLN 335
Query: 234 SKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
S Y ++++DW+PT + A + + ++GVN
Sbjct: 336 SGTSTNY--INVTDWMPTFLNWANAE-ELPSTVEGVN 369
>UniRef50_A0Z632 Cluster: Arylsulfatase B; n=1; marine gamma
proteobacterium HTCC2080|Rep: Arylsulfatase B - marine
gamma proteobacterium HTCC2080
Length = 545
Score = 130 bits (313), Expect = 1e-28
Identities = 94/303 (31%), Positives = 148/303 (48%), Gaps = 35/303 (11%)
Query: 3 HGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWT 62
+GVI+ + G+ +E +P+ + GY+T ++GKWHLG + Y P NRGF+ G
Sbjct: 99 YGVIFPWDNIGVHPDEHFMPETFQAAGYQTAIIGKWHLGHAQMTYHPNNRGFEHFYGHLH 158
Query: 63 GRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLML 122
+ Y + QG G DF+R V+ D G Y T + DE + + ++ P + +
Sbjct: 159 TEVGFYPPFS-NQG--GKDFQRN-GVSIDDQG-YETYLLADEVSRYIRERDRDRPFLVYM 213
Query: 123 AHSAVHSGNPYEPIRAPQKLIDAFKYI------------DD-----------SARQKFAA 159
A P+ P+ AP +L D +K I DD SAR +AA
Sbjct: 214 PFIA-----PHTPLDAPVELQDKYKDIETDLPMARSRQTDDTRLISRVMLQPSARPMYAA 268
Query: 160 VLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEG 219
V+ +D+++G+V+ L G+ +N+IV+F +DNGG A ++ A+N PL+G K +EG
Sbjct: 269 VVDAMDQAIGRVLDTLDQEGISDNTIVLFFSDNGG--AAYSYGGANNAPLRGGKGETFEG 326
Query: 220 GVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNT 279
G+R + P + ++ Q M + D PTL AA LDG + W AL
Sbjct: 327 GIRVTSLMRWPAMLEPGQIFEQIMSVMDVFPTLVDAADVRPGNNFALDGRSMWTALKSGD 386
Query: 280 ESP 282
+ P
Sbjct: 387 QVP 389
>UniRef50_A3ZLN5 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
n=1; Blastopirellula marina DSM 3645|Rep:
N-acetylgalactosamine 6-sulfate sulfatase -
Blastopirellula marina DSM 3645
Length = 468
Score = 129 bits (311), Expect = 2e-28
Identities = 99/313 (31%), Positives = 149/313 (47%), Gaps = 24/313 (7%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
G L E L LK GY + + GKW G K+ YLPL RGFD + GF +D + H
Sbjct: 127 GTDLQEVFLADVLKQAGYVSAVFGKWDGGQLKR-YLPLQRGFDQYYGFANTGVDYFTH-- 183
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
E+ + FR D G Y TD++ EAI+ ++ N P FL L +A HS +
Sbjct: 184 -ERYGVPSMFRDNQPTEEDK-GTYLTDLFEREAIRFIDE-NHDRPFFLYLPFNAPHSASN 240
Query: 133 YE-PIR----APQKLIDAF---KYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
+ IR APQ+ +D F + + RQ + A + ++DE++GKVV L + +N+
Sbjct: 241 LDRSIRGFAQAPQEYLDHFPGGESKQEKRRQAYLAAVERMDEAIGKVVDQLQQHQIADNT 300
Query: 185 IVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMH 244
+++F +DNGG A N PL+G K ++EGG R + P +V+ Q +
Sbjct: 301 LIIFLSDNGG------GGGADNSPLRGGKAKMFEGGNRVPCIVHWPGKVPAGKVSNQFLT 354
Query: 245 ISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALTVDKY 304
+ PT+ +A GG L DG + L+ SPR + G A V +
Sbjct: 355 SLEVFPTVIAAIGGKLPDDVIYDGFDMLPVLN-GASSPREEMFWKRR---GDVAARVGDW 410
Query: 305 KLIKGTIYKGVWD 317
K + KG++D
Sbjct: 411 KWVDSAAGKGLFD 423
>UniRef50_A0JAA8 Cluster: Sulfatase precursor; n=1; Shewanella
woodyi ATCC 51908|Rep: Sulfatase precursor - Shewanella
woodyi ATCC 51908
Length = 548
Score = 128 bits (308), Expect = 4e-28
Identities = 89/273 (32%), Positives = 139/273 (50%), Gaps = 25/273 (9%)
Query: 12 RGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDH- 70
RGLP +E ++P+ LK+ GY T +GKWHLG E +P +GFD + +G DH
Sbjct: 172 RGLPGSEILIPEILKESGYHTMHIGKWHLGR-SPEMMPNAQGFDESLMMDSGLYLPVDHP 230
Query: 71 ---------TTMEQGSWGTDFRRGFEVAHDLF--GVYATDVYTDEAIKVVNSHNKSEPLF 119
+ +++ W T ++F Y TD +T+EA K + + N + P F
Sbjct: 231 ESVNAPVESSGLDRFIWATMRYSVNWNGGEIFKPNGYLTDYFTEEAEKAIEA-NANRPFF 289
Query: 120 LMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRG 179
L LAH P+ P++A + +A I ++ +AA+L +D SV +V+ L +G
Sbjct: 290 LYLAH-----WGPHNPVQAKRADYEAVGDIQPHNKRVYAAMLRSIDRSVERVMAKLEKQG 344
Query: 180 LLENSIVVFSTDNGG-PAAGFNDNAASNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKAR 237
+ +N+IV+ S+DNGG ND N P +G KNT +EGG+R W ++D
Sbjct: 345 IADNTIVILSSDNGGADYVAIND---LNKPYRGWKNTFFEGGIRVPFSVTWPNVIDESTV 401
Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
+ HI D +PT+ + A DL +DGV+
Sbjct: 402 IEEPVNHI-DLMPTIINMANADLPQDREIDGVD 433
>UniRef50_Q8A219 Cluster: Arylsulfatase B; n=2; Bacteroides|Rep:
Arylsulfatase B - Bacteroides thetaiotaomicron
Length = 458
Score = 127 bits (306), Expect = 7e-28
Identities = 86/268 (32%), Positives = 137/268 (51%), Gaps = 27/268 (10%)
Query: 13 GLPLNEKILPQYLKDLGYKTH-LVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHT 71
GL NE+ L L GY ++GKWHLG +K + P+NRGF G G ID +DH
Sbjct: 102 GLDENEETLADMLARNGYSNRAIIGKWHLGHTRKVHYPINRGFSHFYGHLNGAIDYFDH- 160
Query: 72 TMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGN 131
M +G D+ +E +D Y+T++ T EA++ +N++ K P L +A++A
Sbjct: 161 -MREGE--LDWHNDWETCYD--KGYSTELITQEAVRCINTYEKEGPFLLYVAYNA----- 210
Query: 132 PYEPIRAPQKLIDAFKYIDD--------SARQKFAAVLSKLDESVGKVVKALHTRGLLEN 183
P+ P++A +K I+ Y DD R + A++S +D +G +V AL +G+++N
Sbjct: 211 PHTPLQAQEKDIEL--YCDDFGSLTPKEQKRVTYQAMVSCMDRGIGTIVDALKKKGIMDN 268
Query: 184 SIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKARVAYQK 242
+ ++F +DN GPA +S+ L+G K W+GGVR A F W + ++ Q
Sbjct: 269 TFLIFFSDN-GPA---GVPGSSSGKLRGRKFDEWDGGVRVPAVFYWKRAESNYKNLSSQV 324
Query: 243 MHISDWLPTLYSAAGGDLSVLENLDGVN 270
D +PTL G DG++
Sbjct: 325 TGFVDIVPTLKELVGDKNRPERAYDGIS 352
>UniRef50_A0Z9E1 Cluster: Sulfatase family protein; n=3;
Proteobacteria|Rep: Sulfatase family protein - marine
gamma proteobacterium HTCC2080
Length = 558
Score = 127 bits (306), Expect = 7e-28
Identities = 92/276 (33%), Positives = 142/276 (51%), Gaps = 25/276 (9%)
Query: 10 EPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHV----GFWTGRI 65
E +GLP +E + + LK GY T +GKWHLG + P +GFD + G +
Sbjct: 175 ERKGLPASEVTIAETLKAKGYYTAHIGKWHLGR-ENGMAPHEQGFDDSLLMQSGMYLPEN 233
Query: 66 D------MYDHTTMEQGSW-GTDFRRGFEVAH-DLF--GVYATDVYTDEAIKVVNSHNKS 115
D +++ W G F + D F G Y TD +TDE+IKV+ + NK+
Sbjct: 234 DPNVVNAKVSFDPIDKFLWAGMGFSATYNSGEADKFKPGGYLTDYWTDESIKVIKA-NKN 292
Query: 116 EPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKAL 175
P FL LAH P+ P++A ++ DA + I+ ++ +A ++ +D SVG+++ L
Sbjct: 293 RPFFLYLAH-----WGPHTPLQATREDFDALEGIEPHRKRVYAGMIRAVDRSVGRILDTL 347
Query: 176 HTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDS 234
G+ N++VVF++DNGG AG+ N P +G K T++EGG+R F+ W +
Sbjct: 348 EEEGIANNTVVVFTSDNGG--AGYIGIPEVNSPFRGFKITMFEGGLRVPLFVRWPAKIAP 405
Query: 235 KARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
V HI D +PTL +AAG +DGV+
Sbjct: 406 GISVNEPVAHI-DVMPTLAAAAGASEPEGVVIDGVD 440
>UniRef50_Q1VP00 Cluster: Arylsulfatase B; n=1; Psychroflexus
torquis ATCC 700755|Rep: Arylsulfatase B - Psychroflexus
torquis ATCC 700755
Length = 386
Score = 126 bits (304), Expect = 1e-27
Identities = 84/218 (38%), Positives = 122/218 (55%), Gaps = 23/218 (10%)
Query: 17 NEKILPQYLKDLG-YKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQ 75
+E + + L+D G Y T L+GKWHLG + LP + GF++ +G G ID + TM
Sbjct: 109 HETTIAEVLRDEGAYDTALIGKWHLGHGDESMLPHHHGFNTFIGHTGGCIDFF---TMTY 165
Query: 76 GSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHN--KSEPLFLMLAHSAVHSGNPY 133
G D+ EV + YAT++ T+EAI ++ N ++EP FL LA++A H G Y
Sbjct: 166 GII-PDWYHQSEVVSE--NGYATELITEEAIAFLSERNQKRTEPFFLYLAYNAPHFGKGY 222
Query: 134 EPI-RAPQKLIDA-------FKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSI 185
P AP L+ +I+D R++FAA+ LD+ +G+V+ L L EN++
Sbjct: 223 SPSDEAPVNLMQPQAAELKRVHFIEDKIRREFAAMTVSLDDGIGQVLDCLEENDLKENTL 282
Query: 186 VVFSTDNGG-PAAGFNDNAASNYPLKGVKNTLWEGGVR 222
V+F TD+GG P G SN PL+G K TL+EGGVR
Sbjct: 283 VIFLTDHGGDPTYG-----GSNLPLRGDKATLFEGGVR 315
>UniRef50_A6DKB8 Cluster: N-acetylgalactosamine 6-sulfatase; n=3;
Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine 6-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 465
Score = 125 bits (301), Expect = 3e-27
Identities = 87/283 (30%), Positives = 144/283 (50%), Gaps = 22/283 (7%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTG--RIDMYD-- 69
LP +E + + L +GY ++GKWHLG+ + P RGFD G G R D
Sbjct: 102 LPKSEMTIAESLTQVGYHCGIIGKWHLGA-EPSLRPNKRGFDEFFGHLGGGHRFMPEDLV 160
Query: 70 --HTTMEQGSWGTDFRRGFEVAHDL---FGVYATDVYTDEAIKVVNSHNKSEPLFLMLAH 124
HT E+ D R + +D Y T+ ++DEA+ + N +P FL L++
Sbjct: 161 IQHT--EEVKNELDSYRSWITRNDTPVKTTKYLTEEFSDEAVSFI-KRNHQKPFFLFLSY 217
Query: 125 SAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
+A P+ P++A +K + F +I D R+ +AA++S +D+ V +V+++L + +N+
Sbjct: 218 NA-----PHLPLQATEKYLARFPHIKDPKRKTYAAMVSAVDDGVSQVMQSLKETNIADNT 272
Query: 185 IVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMH 244
IV F +DNGGP+ + N + N+PLKG K+ +WEGG R + P +V +
Sbjct: 273 IVFFLSDNGGPS---HKNKSDNFPLKGQKSDVWEGGFRVPFAMQYPAAIQAKQVYDHPVS 329
Query: 245 ISDWLPTLYSAAGGDLSVLENLDGVNQWDALS-KNTESPRTSV 286
D T+ S A + LDGVN ++ + T++P +
Sbjct: 330 SLDIFATIASLAQSPTHADKPLDGVNLIPFITGEKTQAPHAQI 372
>UniRef50_UPI0000E4801A Cluster: PREDICTED: similar to sulfatase 1
precursor; n=2; Strongylocentrotus purpuratus|Rep:
PREDICTED: similar to sulfatase 1 precursor -
Strongylocentrotus purpuratus
Length = 470
Score = 124 bits (298), Expect = 6e-27
Identities = 68/183 (37%), Positives = 109/183 (59%), Gaps = 14/183 (7%)
Query: 115 SEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFK-YIDDSARQKFAAVLSKLDESVGKVVK 173
++P+F+ L++ A P+ P P + +++ I++ R+ +A +++ LDES+GK+
Sbjct: 165 TKPMFMYLSYQA-----PHLPFEVPDEYFVSYRGKINNRNRRTYAGMVTMLDESIGKLTD 219
Query: 174 ALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLD 233
L GL +++ +FSTDNGG NA +N+PL+GVK +EGG+RG GF+ PLL
Sbjct: 220 TLKEEGLWNDTVFIFSTDNGGVG---KKNAGNNWPLRGVKGNYFEGGIRGVGFVAGPLLS 276
Query: 234 SKAR--VAYQKMHISDWLPTLY-SAAGGDLSVLE-NLDGVNQWDALSK-NTESPRTSVLH 288
+ + ++ MHISDW PTL A L+ E LDGVN WD +S+ + P +++
Sbjct: 277 TNVQGTISTDLMHISDWYPTLVEGVAKVTLNHTELGLDGVNMWDVISQGESGDPDREIVY 336
Query: 289 NID 291
NID
Sbjct: 337 NID 339
Score = 74.9 bits (176), Expect = 4e-12
Identities = 36/64 (56%), Positives = 40/64 (62%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
MQH VI PR LPL + + L D GY THLVGKWHLG YK+E PLNRGF S G
Sbjct: 94 MQHLVIDPRVPRCLPLGDDTMANKLTDAGYATHLVGKWHLGFYKQECWPLNRGFQSFFGM 153
Query: 61 WTGR 64
G+
Sbjct: 154 LLGQ 157
>UniRef50_A4XED5 Cluster: Sulfatase precursor; n=1; Novosphingobium
aromaticivorans DSM 12444|Rep: Sulfatase precursor -
Novosphingobium aromaticivorans (strain DSM 12444)
Length = 462
Score = 123 bits (297), Expect = 9e-27
Identities = 84/260 (32%), Positives = 132/260 (50%), Gaps = 13/260 (5%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
G+PL+ + +K LGY+T LVGKWHLG + PL G+D +G G D + H
Sbjct: 116 GVPLDRPTIASVMKALGYRTSLVGKWHLGE-PPAHGPLKHGYDHFLGIVEGGADYFVHRM 174
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVH-SGN 131
+ G + D G Y TD++ DEA++V+ ++P FL L +A H
Sbjct: 175 VMSGKPAGVGLAEDDAQTDRTG-YLTDIFGDEAVRVI-EEGGNQPFFLSLHFTAPHWPWE 232
Query: 132 PYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
E + + L +F Y + K+ ++ +D++V KV+ A+ G +N++VVF++D
Sbjct: 233 GREDEKLARALPSSFHY-EGGNLAKYREMVETMDQNVAKVLAAIDRSGKADNTVVVFTSD 291
Query: 192 NGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLP 250
NGG F+D +P G K + EGGVR + W + + +R + Q M D+LP
Sbjct: 292 NGGER--FSD----TWPFVGHKGEVLEGGVRVPLMVRWPRRIKAGSR-SEQVMVSMDFLP 344
Query: 251 TLYSAAGGDLSVLENLDGVN 270
TL AGGD + + DG +
Sbjct: 345 TLLGMAGGDAARIGRFDGAD 364
>UniRef50_A5ZER6 Cluster: Putative uncharacterized protein; n=1;
Bacteroides caccae ATCC 43185|Rep: Putative
uncharacterized protein - Bacteroides caccae ATCC 43185
Length = 463
Score = 122 bits (295), Expect = 1e-26
Identities = 75/210 (35%), Positives = 113/210 (53%), Gaps = 16/210 (7%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
GLPL E+ + + K GY+T +GKWHLGS +++ P NRGFD G G D Y +
Sbjct: 108 GLPLEEETIAEVFKTNGYRTAAIGKWHLGSRDEQH-PNNRGFDLFYGMKAGGRD-YFYNE 165
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
+ G + F Y TD ++++A++ +N S+P + LA++AVH+
Sbjct: 166 KKSDRPGDERNLLLNDRQVKFEKYLTDAFSEKAVEFINE--SSQPFMMYLAYNAVHT--- 220
Query: 133 YEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDN 192
P++A + D K+ + RQK AA+ LD VG V++ L G +N+++ F +DN
Sbjct: 221 --PMQATDE--DMAKF-EGHPRQKLAAMTYALDRGVGTVIRGLKDSGKFDNTLIFFLSDN 275
Query: 193 GGPAAGFNDNAASNYPLKGVKNTLWEGGVR 222
GG N +SNYPLKG K +EGG R
Sbjct: 276 GGATT----NQSSNYPLKGFKGNKFEGGHR 301
>UniRef50_A6LED1 Cluster: Arylsulfatase A; n=1; Parabacteroides
distasonis ATCC 8503|Rep: Arylsulfatase A -
Parabacteroides distasonis (strain ATCC 8503 / DSM 20701
/ NCTC11152)
Length = 459
Score = 121 bits (292), Expect = 3e-26
Identities = 98/305 (32%), Positives = 145/305 (47%), Gaps = 28/305 (9%)
Query: 5 VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGR 64
V++ GL +E + + L+ GY T VGKWHLG++ YLP + GFD++ G
Sbjct: 104 VLFPNSKAGLGQDEVTIAKVLQQSGYATGCVGKWHLGAFSP-YLPTDHGFDTYFGIPYSN 162
Query: 65 IDMYDHTTMEQGSWGTDFRR------GFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPL 118
DM +G+ +F G ++ + T YT++A+ + +H+K EP
Sbjct: 163 -DM--SPVQNKGAHARNFPPTPLIVDGKQIESEPDQGELTRRYTEKAVSFIKNHSK-EPF 218
Query: 119 FLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTR 178
FL AH+ P+ P L ++ S R + V+ ++D SVG+V+KAL
Sbjct: 219 FLYFAHTF-----PHIP------LYTNARFEGTSKRGLYGDVVEEIDWSVGEVLKALREN 267
Query: 179 GLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARV 238
GL EN+ V+F++DN GP ++N S PLK K T WEGG R W P + A +
Sbjct: 268 GLDENTFVIFTSDN-GPWLTEHENGGSAGPLKDGKGTWWEGGFRVPAICWMPGKINPA-I 325
Query: 239 AYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAA 298
+ M D PT S AG + LDGVNQ L + S R V + WG
Sbjct: 326 NDEIMTSMDLYPTFLSMAGIEQPKDLVLDGVNQTGLLFEEKHSARDEVYY----WWGSEL 381
Query: 299 LTVDK 303
+ + K
Sbjct: 382 MAIRK 386
>UniRef50_A4GJF1 Cluster: Sulfatase; n=1; uncultured marine
bacterium EB0_50A10|Rep: Sulfatase - uncultured marine
bacterium EB0_50A10
Length = 544
Score = 121 bits (291), Expect = 5e-26
Identities = 82/282 (29%), Positives = 144/282 (51%), Gaps = 24/282 (8%)
Query: 12 RGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDH- 70
+G+P + + + L+D GY T +GKWHLG ++ P+++GF +G DH
Sbjct: 172 QGMPTEQITIAEVLRDAGYYTAHIGKWHLG-HEYGMDPMSQGFQDSLGLVGPLYLPEDHP 230
Query: 71 --------TTMEQGSWGT-DFRRGFEVAHDLFGV--YATDVYTDEAIKVVNSHNKSEPLF 119
T +++ WG + F DLF Y TD YTDEA+KV+ + NK+ P F
Sbjct: 231 DVVNAKFDTRIDKMIWGMGQYSANFN-GGDLFAPDKYVTDYYTDEALKVIEN-NKNRPFF 288
Query: 120 LMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRG 179
L L+H A+H NP + +R+ + ++ Q ++ +++ LD SVGK+++ L
Sbjct: 289 LYLSHWAIH--NPLQALRSD---FEQMSHMHGHNLQVYSGMINSLDRSVGKIIEKLKELD 343
Query: 180 LLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVA 239
+ ++++F++DNGG A + + N P +G K + ++GG+R + P + + +
Sbjct: 344 IYGKTLIIFTSDNGG--ANYIELNDINKPYRGWKISFFDGGIRVPYIISWPDEINPGKKS 401
Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTES 281
+H D PT+ AAG + LDGV+ + ++ S
Sbjct: 402 ENAVHHFDIFPTILKAAG--IESTNELDGVDLMPFIKNDSSS 441
>UniRef50_A6CAW6 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;
Planctomyces maris DSM 8797|Rep:
N-acetylgalactosamine-4-sulfatase - Planctomyces maris
DSM 8797
Length = 472
Score = 120 bits (289), Expect = 8e-26
Identities = 84/227 (37%), Positives = 124/227 (54%), Gaps = 21/227 (9%)
Query: 96 YATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQ 155
Y TD +T EA+ +N H + +P FL LA++AVHS P++ +K I F I+D RQ
Sbjct: 221 YLTDAFTREAVSFINRH-QDKPFFLYLAYNAVHS-----PLQGKKKDIQHFTQIEDIHRQ 274
Query: 156 KFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNT 215
FAA+LS +D+S+GK++K + GL E +++VF +DNGGP + +SN PL+G K +
Sbjct: 275 IFAAMLSSMDQSIGKILKQVQQSGLDEKTLIVFLSDNGGPT---RELTSSNLPLRGEKGS 331
Query: 216 LWEGGVRGAGFL--WSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWD 273
++EGG+R FL W+ L K + + D PT + AG L +NLDG N
Sbjct: 332 MYEGGLR-VPFLMRWTGTLAPKQTIDVPVSSL-DIFPTSVALAGASLP--QNLDGRNLLP 387
Query: 274 -ALSKNTESPRTSVLHNIDDIWGIAALTVDKYKLI--KGTIYKGVWD 317
L + TE P AAL +K++ +GT K VW+
Sbjct: 388 LLLQQKTELPVADFFWRQG---RKAALRSGDWKIVQMRGTREKPVWE 431
>UniRef50_A6C4L0 Cluster: N-acetylgalactosamine-6-sulfate sulfatase;
n=1; Planctomyces maris DSM 8797|Rep:
N-acetylgalactosamine-6-sulfate sulfatase - Planctomyces
maris DSM 8797
Length = 413
Score = 120 bits (289), Expect = 8e-26
Identities = 89/277 (32%), Positives = 138/277 (49%), Gaps = 26/277 (9%)
Query: 4 GVIYGAEPR-----GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHV 58
GV+Y A P+ GL NE L Q L+D GY+T + GKWHLG Y+++Y P RGF V
Sbjct: 63 GVVY-ANPKKNRHHGLQKNEITLAQCLQDAGYQTGMFGKWHLG-YQRQYNPTFRGFQQFV 120
Query: 59 GFWTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPL 118
G+ +G +D + H G+ D+ E+ + G Y T + D A++ + + +P
Sbjct: 121 GYVSGNVDYFAHL---DGTGVFDWWHNAELNREEQG-YVTHLINDHALEFIR-QQQEKPF 175
Query: 119 FLMLAHSAVHSGNPYE-PIRAPQKLIDAFKYIDDSARQKFA----AVLSKLDESVGKVVK 173
F+ +AH AVHS PY+ P P + + I + R+ A + +++D+ +G++V
Sbjct: 176 FVYIAHEAVHS--PYQGPHDQPMRK-EGGGDIKSAKRKDIANAYREMNTEMDKGIGQIVD 232
Query: 174 ALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLD 233
L L E + + F +DNG N N SN L+G K +LWEGG R P
Sbjct: 233 VLKEVNLTEKTFIFFLSDNGA-----NKN-GSNGKLRGFKGSLWEGGHRVPAIACWPGRI 286
Query: 234 SKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
+ V + + D +PT+ A + LDGV+
Sbjct: 287 PEGTVCDEPVISIDLMPTILELANAKIPAGHKLDGVS 323
>UniRef50_A6DKD8 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine 6-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 455
Score = 118 bits (285), Expect = 2e-25
Identities = 86/292 (29%), Positives = 141/292 (48%), Gaps = 19/292 (6%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
G+PL+E+++ LK Y T ++GKWH+G E P R D + GF G +
Sbjct: 106 GIPLDEQMIFDLLKPAAYTTGVIGKWHMG-LSHEQRPTQRSVDYYYGFLNGAHSYREAKM 164
Query: 73 MEQGSWGT--DFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSG 130
+G+ T FR V F Y T+V+ DE + + NK +P FL +++++VH
Sbjct: 165 DMKGAPMTWPIFRNNEPVP---FSGYTTEVFNDEGVNFIK-RNKDKPFFLYMSYNSVHG- 219
Query: 131 NPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFST 190
P+E A K + +I R+ ++A+L +D+ VG++++ L G+ EN++V+F +
Sbjct: 220 -PWE---AQPKDLQRSDHIKKKWRRIYSAMLISMDDGVGRLIQTLKDEGIYENTLVIFMS 275
Query: 191 DNGGP-----AAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHI 245
DNG P A D ASN L+G K +EGG+R + P + K +
Sbjct: 276 DNGAPNNLHEAERAGDYLASNGSLRGRKGDTYEGGIRVPYIMSWPQVIPKQSTYQHPVSG 335
Query: 246 SDWLPTLYSAAGGDLSVLENLDGVNQWDALS-KNTESPRTSVLHNIDDIWGI 296
D +PTL + + L GVN ++ + T P ++ DD + I
Sbjct: 336 LDIVPTLIHISQA-APAKKELSGVNLMPYITGEKTSRPHKTLYWRRDDDYAI 386
>UniRef50_A6CD52 Cluster: Twin-arginine translocation pathway
signal; n=1; Planctomyces maris DSM 8797|Rep:
Twin-arginine translocation pathway signal -
Planctomyces maris DSM 8797
Length = 460
Score = 118 bits (284), Expect = 3e-25
Identities = 86/277 (31%), Positives = 131/277 (47%), Gaps = 20/277 (7%)
Query: 12 RGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHT 71
RG+ E + L+ GY+T L+GKWHLG + +LP GFD G G ID + T
Sbjct: 109 RGIQPGETTIADVLQQNGYQTALLGKWHLGHGTESFLPTAHGFDLFRGHTGGCIDYFTMT 168
Query: 72 TMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSG 130
W + R H YATD+ T+EA + ++ P FL L+++A H G
Sbjct: 169 YGNIPDWYHNQR------HVSENGYATDLITEEAEHFLKDQQTTDKPFFLFLSYNAPHFG 222
Query: 131 NPYEP-IRAPQKLIDA-------FKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLE 182
+ P ++P ++ A I D R++FAA+ LD+ +G+V+ +L GL +
Sbjct: 223 KGWSPGDQSPVNIMQARGDDLKRVGTIKDKVRREFAAMTVSLDDGIGRVMSSLKNNGLDQ 282
Query: 183 NSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQK 242
N++V+F TD+GG +N P +G K TL+EGG+R + P +
Sbjct: 283 NTLVIFMTDHGGDYV----YGGNNQPFRGAKATLFEGGIRVPCIIRWPGKIKAGTETNEV 338
Query: 243 MHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNT 279
D PT+ A D L LDG + L++ T
Sbjct: 339 AWALDLFPTICHFANVDTDGL-TLDGKDISGLLTRQT 374
>UniRef50_A6C4W7 Cluster: Twin-arginine translocation pathway
signal; n=1; Planctomyces maris DSM 8797|Rep:
Twin-arginine translocation pathway signal -
Planctomyces maris DSM 8797
Length = 459
Score = 118 bits (283), Expect = 4e-25
Identities = 90/255 (35%), Positives = 130/255 (50%), Gaps = 19/255 (7%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
GLP + + LK GY T GKWHLG Y+ +LP N+GFD G +G DH T
Sbjct: 116 GLPHQAVTMAELLKQQGYATACFGKWHLG-YQPPWLPTNQGFDLFRGLTSGD---GDHHT 171
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVH---S 129
S D+ E++ + Y D+ + ++ + + N++ P FL + H A+H
Sbjct: 172 HVDRSGNEDWWHNNEISMEKG--YTADLLSKYSVAFMEA-NRTRPFFLYVPHLAIHFPWQ 228
Query: 130 GNPYEPIRAPQKLIDAFKY--IDD--SARQKFAAVLSKLDESVGKVVKALHTRGLLENSI 185
G P R + A K+ I D + A++ LD+SVGK++ AL L +N++
Sbjct: 229 GPQDPPHRKAGQDYHAGKWGIIPDPGNVSPHTTAMIESLDQSVGKILSALKRLDLEQNTL 288
Query: 186 VVFSTDNGGPAA-GFN-DNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQK 242
V+F++DNGG G N N +SN PL+G K TL+EGG R + W ++ A V Q
Sbjct: 289 VIFTSDNGGYLTYGKNFQNISSNGPLRGQKATLYEGGHRVPCLISWPGVI--TAGVTDQT 346
Query: 243 MHISDWLPTLYSAAG 257
H D LPTL AAG
Sbjct: 347 AHSVDLLPTLAQAAG 361
>UniRef50_Q7UX97 Cluster: Arylsulfatase B [Precursor]; n=1;
Pirellula sp.|Rep: Arylsulfatase B [Precursor] -
Rhodopirellula baltica
Length = 579
Score = 117 bits (282), Expect = 6e-25
Identities = 96/336 (28%), Positives = 157/336 (46%), Gaps = 49/336 (14%)
Query: 4 GVIYGAEPRGLPLNEKILPQYLKDLGYKTH-LVGKWHLGSYKKEYLPLNRGFDSHVGFWT 62
GV+ ++ GLP + P++L LGY + GKWHLG + PL+ G G +
Sbjct: 203 GVVSPSKKHGLPPQLETAPEHLSKLGYDHRAMFGKWHLGLASTLFHPLHHGMTEFYGHYN 262
Query: 63 GRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLML 122
G ID + Q W R F+ H+ Y+T++ + + ++ + + P++ +
Sbjct: 263 GAIDYFSRERFGQLDW----HRDFDSVHE--EGYSTELVGNAVVDFIDRNANAGPVYAYV 316
Query: 123 AHSAVHSGNPYEPIR--------------AP---QKLIDAFKYID-------DSARQKFA 158
A +A HS P + +R AP +K+ K +D +S RQ FA
Sbjct: 317 AFNAPHS--PLQALRSDLDEYGFDPNNKLAPNTDRKIAKREKALDYGKRGKGNSIRQTFA 374
Query: 159 AVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG-PAAGFNDNAASNYPLKGVKNTLW 217
A+ + +D +G+++ A+ G+ EN++VVF +DNG P G N N PL+G K T W
Sbjct: 375 AMTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGADPKHGGN-----NEPLRGNKFTTW 429
Query: 218 EGGVRGAGFLWSPLLDSKARVAYQKM-HISDWLPTLYSAAGGDLSVLENLDGVNQWDALS 276
EGGVR + P + A + Y + D LP++ AAG E DG+N LS
Sbjct: 430 EGGVRVVAMMRWP-NELPAGITYDSVTSYVDLLPSMVGAAGSPPP--EETDGINLLPFLS 486
Query: 277 KNTESPRTSVLHNIDDIWGIAALTVDKYKLIKGTIY 312
P ++L + + + D++KL G ++
Sbjct: 487 GKASPPERTILLDAETV------VSDRWKLKAGELF 516
>UniRef50_A0HG49 Cluster: Sulfatase; n=6; Comamonadaceae|Rep:
Sulfatase - Comamonas testosteroni KF-1
Length = 457
Score = 117 bits (282), Expect = 6e-25
Identities = 82/269 (30%), Positives = 132/269 (49%), Gaps = 15/269 (5%)
Query: 4 GVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTG 63
G + GA+ GLP + LK GY+T L+GKWHLG Y + PL G++ + G +G
Sbjct: 102 GTLLGAK-LGLPPEIPTVASLLKGAGYRTALIGKWHLG-YPPHFGPLRSGYEEYFGPMSG 159
Query: 64 RIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLML 122
+D + H + S D G E HD Y TD+ + ++ VN ++ + P FL L
Sbjct: 160 GVDYFTHLS---SSGQHDLWVGEEEHHD--EGYLTDLLSQRSVDFVNRMSEGDAPFFLSL 214
Query: 123 AHSAVH-SGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLL 181
++A H + ++L ++ ++ ++ +DE +G +V+AL G L
Sbjct: 215 HYTAPHWPWETRDDRETAEQLGAGITHLAGGNIHQYRRMIHHMDEGIGWIVEALRRNGQL 274
Query: 182 ENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQ 241
+N+++VF++DNGG F+D ++PL G K L EGG+R P + R + Q
Sbjct: 275 DNTLIVFTSDNGGER--FSD----SWPLVGGKMDLTEGGIRVPWIAHWPAVIEAHRSSAQ 328
Query: 242 KMHISDWLPTLYSAAGGDLSVLENLDGVN 270
DW T+ AAG LDG++
Sbjct: 329 PCMSMDWSATVLDAAGVSADPDYPLDGIS 357
>UniRef50_Q7UGB4 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
n=1; Pirellula sp.|Rep: N-acetylgalactosamine 6-sulfate
sulfatase - Rhodopirellula baltica
Length = 485
Score = 117 bits (281), Expect = 7e-25
Identities = 87/298 (29%), Positives = 138/298 (46%), Gaps = 39/298 (13%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
G+ E ILP L+ GYK+ + GKW LG+ ++ LP +RGFD GF ID + H
Sbjct: 124 GMDEREVILPAVLRPAGYKSGIFGKWDLGALQR-MLPTSRGFDDFYGFVNTGIDYFTHE- 181
Query: 73 MEQGSWGTD-FRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGN 131
+G R E G Y T ++ EA++ ++ H +EP FL + +A H+ +
Sbjct: 182 ----RYGVPCMVRNLEPTEADKGTYCTYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSS 237
Query: 132 P------------------YEPIRAPQKLIDAFKY-------IDDSARQKFAAVLSKLDE 166
Y P+ ++ D ++Y + R+ + A ++ +D
Sbjct: 238 SLVPTIRSSVQAPDQFKAMYPPVEVETRVTDRYRYGSPATVATPQARRRDYRAAVTCMDA 297
Query: 167 SVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGF 226
++G+++ L + +L+ +IVVF +DNGG A N PL+G K WEGG+R
Sbjct: 298 AIGEILDRLEAKQMLDETIVVFFSDNGG------SGGADNSPLRGHKAQTWEGGIRVPCL 351
Query: 227 LWSPLLDSKARVAYQKMHIS-DWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPR 283
+ P A V + S + LP+ +AAG + LDG + W L ESPR
Sbjct: 352 VRWPAGQIPAGVVNDEFLTSLELLPSFAAAAGVEPPPGVVLDGFDWWPTLRGEAESPR 409
>UniRef50_Q3M597 Cluster: Twin-arginine translocation pathway signal
precursor; n=1; Anabaena variabilis ATCC 29413|Rep:
Twin-arginine translocation pathway signal precursor -
Anabaena variabilis (strain ATCC 29413 / PCC 7937)
Length = 457
Score = 116 bits (280), Expect = 1e-24
Identities = 80/259 (30%), Positives = 129/259 (49%), Gaps = 15/259 (5%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
G+P N+ + LK GY+T LVGKWH G Y + PL +GFD + G +G I+ + HT
Sbjct: 126 GIPANQPTIASLLKANGYETALVGKWHAG-YPPNFGPLQKGFDEYFGHLSGGIEYFTHTG 184
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
++ D +V G Y TD++TD A++ + + S P +L L ++A H
Sbjct: 185 TDRI---LDLYEN-DVPVQRSG-YVTDLFTDRAVEFIQRPH-SRPFYLSLHYNAPHWPWQ 238
Query: 133 YEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDN 192
+A Y ++ +AA++ LD+ VG+V+ AL G +N++V+F++DN
Sbjct: 239 GPNDQASTAFYLTNGYTVGGSQATYAAMVKSLDDGVGRVLDALEASGQADNTLVIFTSDN 298
Query: 193 GGPAAGFNDNAASNY-PLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPT 251
GG SN+ P +G K +L+EGG+R + P + +V+ Q + D T
Sbjct: 299 GG-------ERFSNFGPFRGQKASLYEGGIRVPAIIRYPGVTQANQVSNQVIITFDLTAT 351
Query: 252 LYSAAGGDLSVLENLDGVN 270
+ +A G DG N
Sbjct: 352 ILAATGTSFHPNYPPDGQN 370
>UniRef50_Q15XH3 Cluster: Sulfatase precursor; n=1;
Pseudoalteromonas atlantica T6c|Rep: Sulfatase precursor
- Pseudoalteromonas atlantica (strain T6c / BAA-1087)
Length = 500
Score = 116 bits (279), Expect = 1e-24
Identities = 90/313 (28%), Positives = 149/313 (47%), Gaps = 31/313 (9%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTG-------RI 65
G+ +E + Q +K GY T +GKWHLG EY P GFD GF G +
Sbjct: 118 GVSADELFIAQTMKSAGYFTGAMGKWHLGE-ASEYHPNKHGFDEFYGFLGGGHNYFPEQF 176
Query: 66 DMYDHTTMEQGSWGTDF------RRGFEVAHDLFGVYATDVYTDEAIKVVN-SHNKSEPL 118
+ + + QG + G EV Y TD + EA+ V+ + K +P
Sbjct: 177 EAAYNKRVAQGMTNINMYLTPLEHNGKEVRET---EYITDGLSREAVNFVDKAAAKKKPF 233
Query: 119 FLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTR 178
FL LA++A P+ P++A ++ + F I D R+ +A ++ +D VG++V+ L
Sbjct: 234 FLYLAYNA-----PHVPLQAKEEDMAMFSQIKDKKRRTYAGMVYAVDRGVGRIVEQLKKN 288
Query: 179 GLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKAR 237
G +N+++VF++DNGG A+NYPLK K ++ EGG R + W + + +R
Sbjct: 289 GQFDNTVIVFTSDNGGKLG----QGANNYPLKEGKGSVQEGGFRTPMLVHWPKHMKAGSR 344
Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGI- 296
++ + + D PT G L + LDG + W + NT + ++ + G
Sbjct: 345 FSHPVLAL-DLYPTFAGLGGAVLPEDKKLDGKDIWADIQANTAPHKDEFIYVLRHRNGYS 403
Query: 297 -AALTVDKYKLIK 308
AA +++K +K
Sbjct: 404 DAAARRNQFKAVK 416
>UniRef50_Q1YSH0 Cluster: Sulfatase family protein; n=4; cellular
organisms|Rep: Sulfatase family protein - gamma
proteobacterium HTCC2207
Length = 557
Score = 116 bits (278), Expect = 2e-24
Identities = 89/290 (30%), Positives = 137/290 (47%), Gaps = 28/290 (9%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDS-------------HVG 59
G+P E + + L+ Y T +GKWHLGS + P +GFD H
Sbjct: 177 GMPAAEITIGEVLQQQDYYTAHIGKWHLGS-NGDMRPEQQGFDDSLSMKGIFYLPPDHPD 235
Query: 60 FWTGRI--DMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEP 117
+I D D GS+ + G + G Y TD +TD A+ V+ + N+ P
Sbjct: 236 VVNAKIPGDSIDSMVWAVGSYEVQWNGG--PPFEPKG-YLTDYFTDAAVDVIEA-NRHRP 291
Query: 118 LFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHT 177
FL LAH P+ P++A ++ DA +I D + +AA+L LD SV K+ +L
Sbjct: 292 FFLYLAH-----WGPHNPVQASREDYDALPHIKDHRLRTYAAMLRALDRSVEKIEASLQE 346
Query: 178 RGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKAR 237
GL +N++++F++DNGG AG+ D N P +G K T +EGG P +
Sbjct: 347 NGLSDNTLIIFTSDNGG--AGYLDLTDLNKPYRGWKLTHFEGGTHVPYMAKWPAQIEAGQ 404
Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDAL-SKNTESPRTSV 286
+ + +H D T+ +AAG + LDGVN + K T +P ++
Sbjct: 405 SSDEAIHHIDMFHTIAAAAGASVPTDRTLDGVNLLPFMQGKQTGAPHKTL 454
>UniRef50_Q02AN8 Cluster: Sulfatase precursor; n=1; Solibacter
usitatus Ellin6076|Rep: Sulfatase precursor - Solibacter
usitatus (strain Ellin6076)
Length = 443
Score = 116 bits (278), Expect = 2e-24
Identities = 86/274 (31%), Positives = 131/274 (47%), Gaps = 24/274 (8%)
Query: 21 LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGT 80
L LK GY+T GKWHLGS E P GFDS GF +G +D Y H WG
Sbjct: 107 LASVLKGSGYQTGCFGKWHLGS-TDETAPTGHGFDSFYGFHSGCVDYYSHRFY----WGD 161
Query: 81 DFRRGFEVAHDLF--GVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRA 138
++ + ++F G Y T+ DEA + ++ P +A +A P+ P+ A
Sbjct: 162 NYHDLWHNRTEIFEDGRYLTERIADEAAGFIG---RNRPFLGYVAFNA-----PHYPMHA 213
Query: 139 PQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPA-- 196
P + F + RQ +AA+++ +D+ +G++ +AL T G EN+++ F DNG
Sbjct: 214 PAQYKARFPNLAPE-RQTYAAMIAAVDDGIGQIQRALETTGAAENTLMFFIGDNGATTEK 272
Query: 197 -AGFNDN---AASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTL 252
AG N + A N KG K +L++GG+ GF+ P K + D LPT+
Sbjct: 273 RAGLNGDFATAGDNGVFKGYKFSLFDGGMHVPGFVSWPAGIRKGGWTDELAMSMDILPTI 332
Query: 253 YSAAGGDLSVLENLDGVNQWDALSKNTESPRTSV 286
A G L +DG + + ++ N SP S+
Sbjct: 333 CRATGAPLP--PRVDGSDLLNTIASNAPSPHKSL 364
>UniRef50_A6C383 Cluster: Sulfatase; n=1; Planctomyces maris DSM
8797|Rep: Sulfatase - Planctomyces maris DSM 8797
Length = 405
Score = 115 bits (277), Expect = 2e-24
Identities = 82/287 (28%), Positives = 135/287 (47%), Gaps = 14/287 (4%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
G+P + + + ++ GY+T +GKWHLG Y E +P +GF++ G G ID Y H
Sbjct: 87 GMPTEQITIAEMMQQAGYQTAHIGKWHLG-YTPETMPHGQGFETSFGHMGGCIDNYSHFF 145
Query: 73 MEQGSWGTD-FRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGN 131
G D + G EV D G + D+ ++ + +P FL A +
Sbjct: 146 YWNGPNRHDLWENGKEVWRD--GAFFPDLMVEQCQDYIRKAG-DKPFFLYWAINV----- 197
Query: 132 PYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
P+ P++ +K + ++ S R K+AA +S +D+ +G+V+ L L E +I++F +D
Sbjct: 198 PHYPLQGKEKWRKTYAHLS-SPRDKYAAFVSTMDDCIGEVLATLDACQLREKTIIIFQSD 256
Query: 192 NG-GPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLP 250
+G S P +G K +L+EGG+R + P ++ V Q DWLP
Sbjct: 257 HGHSHEERTFGGGGSAGPYRGAKFSLFEGGIRVPAMISWPGTIAEGEVRDQLATGCDWLP 316
Query: 251 TLYSAAGGDLSVLENLDGVNQWDALSKNT-ESPRTSVLHNIDDIWGI 296
T+ + G L +LDG N + +T +SP + I W I
Sbjct: 317 TISALTGAPLPA-HHLDGKNLKAVIESSTAKSPHENFYWQIGKSWAI 362
>UniRef50_A6DG78 Cluster: Sulfatase; n=1; Lentisphaera araneosa
HTCC2155|Rep: Sulfatase - Lentisphaera araneosa HTCC2155
Length = 464
Score = 115 bits (276), Expect = 3e-24
Identities = 81/266 (30%), Positives = 129/266 (48%), Gaps = 33/266 (12%)
Query: 8 GAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDM 67
G + R + L E L + LKD GYKT L GKWHLG++ +Y P +GFD G G ID
Sbjct: 107 GPKTRNMNLEEYTLAEALKDSGYKTALFGKWHLGAH-LDYGPTKQGFDEFYGIRGGFIDN 165
Query: 68 YDHTTMEQGSWGTDFRRGFEVAHDLF--GVYATDVYTDEAIKVVNSHNKSEPLFLMLAHS 125
Y+H + G F +E ++F G Y ++ TD A+ ++ NK+ P FL LA +
Sbjct: 166 YNHYFLH----GEGFHDLYEGTKEVFDEGKYFPNLVTDRALNFID-RNKNNPFFLFLAFN 220
Query: 126 AVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSI 185
P+ P +A K + +K + RQ +A ++S D+ +G+++ L G+ +N+I
Sbjct: 221 I-----PHYPEQADPKFDERYKNM-KMPRQSYAKMISTTDDHMGQIMSKLQEHGIYDNTI 274
Query: 186 VVFSTDNG-----------GPAAGFNDN--------AASNYPLKGVKNTLWEGGVRGAGF 226
++F +DNG +G N + +G K+ +EGG+R
Sbjct: 275 IIFMSDNGHSRERNHIKFDNHKSGLAKNTKYGALGGGGNTGKWRGNKSNFYEGGIRVPAI 334
Query: 227 LWSPLLDSKARVAYQKMHISDWLPTL 252
+ P K V Q + DW+PT+
Sbjct: 335 ITFPNKLPKGAVRDQAITAMDWMPTV 360
>UniRef50_A4CMB1 Cluster: Arylsulphatase A; n=3; Bacteria|Rep:
Arylsulphatase A - Robiginitalea biformata HTCC2501
Length = 459
Score = 114 bits (275), Expect = 4e-24
Identities = 93/275 (33%), Positives = 141/275 (51%), Gaps = 25/275 (9%)
Query: 20 ILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMY-DHTTMEQGSW 78
++P L GY T ++GKWHLG + + P +RGF GF +D Y DH +G
Sbjct: 129 LIPSELNPAGYHTGIIGKWHLGLEEPD-TPNDRGFTYFKGFLGDMMDDYWDH---RRG-- 182
Query: 79 GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGNPYEPIR 137
G ++ R D G +ATD++TD I + E P FL LA++A P+ PI+
Sbjct: 183 GINWMRLNREEIDPKG-HATDLFTDWTIDFLKERQGEEQPFFLYLAYNA-----PHFPIQ 236
Query: 138 APQKLIDAFKYIDDSARQKFA---AVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG 194
P++ +D + + + +K A A + LD SVG+V++AL T GL EN++VVF +DNGG
Sbjct: 237 PPREWLDKVREREPNLTEKRAKNVAFVEHLDYSVGRVMEALKTTGLEENTLVVFVSDNGG 296
Query: 195 PAAGFNDNAASNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKARVAYQKMHISDWLPTLY 253
A + A SN PL+G K ++EGG+R A F W + + + + D PT
Sbjct: 297 -ALWY---AQSNGPLRGGKQDMYEGGIRVPAIFYWKGKI-APGTTSDNTALLMDLFPTFC 351
Query: 254 SAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLH 288
AG EN+DG++ L+ + L+
Sbjct: 352 ELAG--RKPPENVDGISLVPTLTGQAQDTANRYLY 384
>UniRef50_A6DKP2 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine 6-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 446
Score = 113 bits (272), Expect = 9e-24
Identities = 83/266 (31%), Positives = 128/266 (48%), Gaps = 17/266 (6%)
Query: 5 VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGR 64
V G +G+P ++K + + LK GYK+ GKWHLGS KK P +RGFD+ GF G
Sbjct: 87 VTNGDADKGIPKSQKNIAELLKPAGYKSGAFGKWHLGS-KKGQFPNDRGFDTFYGFHFGA 145
Query: 65 IDMY--DHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLML 122
D Y D ++ ++ G Y T+ TD A++ + NK +P F+ +
Sbjct: 146 HDYYRADKKLNKKKKGYAPIYFNQDIVDYKEGDYLTEKITDHAVEFI-EENKDQPFFMYV 204
Query: 123 AHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLE 182
A+++VHS P + P + + + R+ F A++ +D+ VG++ L L E
Sbjct: 205 AYNSVHS-----PWQVPDEYLARIPESVPAYRRLFLAMVLAMDDGVGRIRAKLKELNLDE 259
Query: 183 NSIVVFSTDNGGPAAGFNDNAASNYPL------KGVKNTLWEGGVRGAGFLWSPLLDSKA 236
N+I VF+TDNG P G Y + +G K +EGG+R F S K+
Sbjct: 260 NTIFVFTTDNGSPKIGNKKPNEGQYRMSMSQGFRGYKGDTYEGGIR-VPFCMSWPKKIKS 318
Query: 237 RVAYQKMHIS-DWLPTLYSAAGGDLS 261
++ I+ D PT SAA + S
Sbjct: 319 GNKFEAPVIAYDLAPTFLSAASLEYS 344
>UniRef50_Q7UHJ9 Cluster: Iduronate-sulfatase or arylsulfatase A;
n=5; cellular organisms|Rep: Iduronate-sulfatase or
arylsulfatase A - Rhodopirellula baltica
Length = 1012
Score = 113 bits (271), Expect = 1e-23
Identities = 92/326 (28%), Positives = 155/326 (47%), Gaps = 20/326 (6%)
Query: 4 GVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF-WT 62
GV+ +P+GL +E + + LK GY+T + GKWHLG + E+LP +GFD G ++
Sbjct: 644 GVLLAGDPKGLHPDEITIAEVLKTAGYRTGMFGKWHLGD-QPEFLPTKQGFDEFFGIPYS 702
Query: 63 GRIDMYDHTTMEQGSWGTD--FRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
I + H + + + D + T T++A+ + NK +P FL
Sbjct: 703 HDIHPF-HPRQNHYHFPPLPLLQNDTVIEMDPDADFLTKRLTEQAVSFIE-RNKDQPFFL 760
Query: 121 MLAH----SAVHSGNPYEPIRAPQKLIDAFKY---IDDSARQK-FAAVLSKLDESVGKVV 172
L H + +H+ P+ A + K ID + R F ++++D SVG+++
Sbjct: 761 YLPHPIPHAPLHASPPFMEGVADDVIAAIEKEDGNIDYATRANLFRQAIAEIDWSVGQIL 820
Query: 173 KALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLL 232
AL + GL E ++V+F++DNG P N AS L+G K T +EGG+R + P
Sbjct: 821 DALRSNGLDEKTMVLFTSDNGPPK---NTLYASPGELRGHKGTTFEGGMREPTVVRWPGQ 877
Query: 233 DSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDD 292
+ M D LPT AG + +DG + W L T++P + ++ +
Sbjct: 878 IPAGHQNDELMTAMDLLPTFAKLAGAAIPTDRVIDGKDIWPTLKGETQTPHDAFFYHRGN 937
Query: 293 IWGIAALTVDKYKL-IKGTIYKGVWD 317
+AA+ K+KL + + K ++D
Sbjct: 938 --QLAAVRSGKWKLHVNNGVAKQLYD 961
Score = 57.2 bits (132), Expect = 8e-07
Identities = 58/206 (28%), Positives = 96/206 (46%), Gaps = 19/206 (9%)
Query: 89 AHDLFGVYATD-VYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFK 147
AH+++ T + T+ A+K + + K+EP FL A +H +P+ P AP+ FK
Sbjct: 233 AHEIYDDEKTGTLLTERAVKWI-TEKKNEPFFLYFATPNIH--HPFTP--APR-----FK 282
Query: 148 YIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG--PAAGFNDNAAS 205
S + + +LD VG++V++L GL +N++V+F++DNG AG + A
Sbjct: 283 --GTSQCGLYGDFVHELDWMVGEIVQSLEDNGLTDNTLVLFTSDNGAMLNRAGRDAIKAG 340
Query: 206 NYP---LKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSV 262
+ P L G K +WEGG R P + Q + D T + ++
Sbjct: 341 HQPNGELLGFKFGVWEGGHRVPLIAKWPGKIKAGTQSDQLISQVDLFATFSALTEQEMPS 400
Query: 263 LENLDGVNQWDALSKNTESP-RTSVL 287
E D +N AL + P RT ++
Sbjct: 401 SEQKDSINMLPALLDDPNEPLRTELV 426
>UniRef50_Q2GB51 Cluster: Sulfatase; n=2; Proteobacteria|Rep:
Sulfatase - Novosphingobium aromaticivorans (strain DSM
12444)
Length = 491
Score = 113 bits (271), Expect = 1e-23
Identities = 90/280 (32%), Positives = 134/280 (47%), Gaps = 23/280 (8%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
GLP + LP L GY+T L+GKWHLGS ++ PL G+ + G +G +D Y H T
Sbjct: 135 GLPPSHPTLPSLLAKAGYRTSLIGKWHLGSL-PDFDPLKSGYQTFWGIRSGGVDYYTHAT 193
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGN 131
D E A Y TD+ D A+ + + E P F+ L +A H
Sbjct: 194 SNGQPDLWDGPTPVERAG-----YLTDLLADRAVSEIREASSGEAPWFMSLHFTAPHW-- 246
Query: 132 PYE-PIRAPQ-----KLID--AFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLEN 183
P+E P A + KL D A + D + +AA++ +LD +G+V++AL ++
Sbjct: 247 PWEGPDDASESARIAKLKDPSALFHFDGGSAAIYAAMVRRLDYQIGRVLEALKANRAEQD 306
Query: 184 SIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKM 243
+IVVF++DNGG F+D +P G K L EGG+R + P + + ++
Sbjct: 307 TIVVFTSDNGGER--FSD----TWPFSGRKTELLEGGLRIPAIVRWPGVTRAGTTSDAQI 360
Query: 244 HISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPR 283
DWLPT +AAG DGV+ AL + + R
Sbjct: 361 ISMDWLPTFLAAAGSAPDPGHPSDGVDVTPALGGGSLAER 400
>UniRef50_A6DMV0 Cluster: N-acetylgalactosamine-6-sulfate sulfatase;
n=1; Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine-6-sulfate sulfatase - Lentisphaera
araneosa HTCC2155
Length = 443
Score = 112 bits (270), Expect = 2e-23
Identities = 92/296 (31%), Positives = 139/296 (46%), Gaps = 20/296 (6%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
GL + LP+ LK GYKT GKWHLGS K + P++ GFD + G G D Y +
Sbjct: 114 GLLPEKNHLPKLLKKAGYKTGAFGKWHLGSQDK-FNPIHHGFDEYYGPLLGHCDYYTYKY 172
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
+ R G +V D Y T + A+ ++ H +P F+ + H AVHS P
Sbjct: 173 YDD---TYTLREGAKVIKD--SGYLTTNINERAVDFIDRH-ADKPFFMYVPHMAVHS--P 224
Query: 133 YEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDN 192
Y+ K I ++D R +AA++ ++D+ V ++ L + + ++ V S+DN
Sbjct: 225 YQSADKKPKQITKTN-LNDGNRADYAAMVEEVDKGVEMIIAKLKEKKIFHKTLFVVSSDN 283
Query: 193 GGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTL 252
GG A F+DNA PL K TL+EGG+R + P K V+ Q D T
Sbjct: 284 GG--AHFSDNA----PLFHRKTTLFEGGIRVPCIMHWPEKIGKGVVSDQIAITMDLSKTF 337
Query: 253 YSAAGGDLSVLENLDGVNQWDALS-KNTESPRTSVLHNIDDIWGIAALTVDKYKLI 307
+ AG D + DG+N ++ KN + RT + A+ + K+K I
Sbjct: 338 LALAGID---EPSYDGINLLPMMTDKNNKVERTLFWRSNSKARRQKAVRMGKWKYI 390
>UniRef50_A4CGL5 Cluster: Arylsulfatase A; n=4; Bacteria|Rep:
Arylsulfatase A - Robiginitalea biformata HTCC2501
Length = 526
Score = 111 bits (267), Expect = 4e-23
Identities = 92/332 (27%), Positives = 148/332 (44%), Gaps = 20/332 (6%)
Query: 3 HGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWT 62
H + P GL E+ L + L+ GY+T + GKWHLG + ++LP GFD G
Sbjct: 141 HNALMPNSPVGLNPAEETLAELLRQQGYRTGIFGKWHLGDHP-DFLPTRHGFDEFFGIPY 199
Query: 63 GRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGV----YATDVYTDEAIKVVNSHNKSEPL 118
DM+ ++ + +E + + T T+ ++ +N H K EP
Sbjct: 200 SN-DMWPLHPLQGPVFDFGPLPLYEQERVVDTLEDQRLLTRQITERSVDFINRH-KEEPF 257
Query: 119 FLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTR 178
FL + H H P + DAF+ S R + V+ ++D SVG+V+ AL
Sbjct: 258 FLYVPHPQPH---------VPLFVSDAFR--GKSGRGLYGDVIMEIDWSVGQVLGALEDN 306
Query: 179 GLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARV 238
GL +++ V+F++DNG P + +++ PL+ K T WEGGVR + P + +V
Sbjct: 307 GLTDDTWVIFTSDNG-PWLAYGNHSGRAEPLREGKGTNWEGGVREPCIMKFPGRLPRGKV 365
Query: 239 AYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAA 298
+ + D LPT+ S G E +DG N W LS + + + A
Sbjct: 366 LDEPLMAIDLLPTIASVTGSPQPGRE-IDGKNAWGLLSGAEARGPQDAYYFYYRVNELQA 424
Query: 299 LTVDKYKLIKGTIYKGVWDNWYGPSGREGAYN 330
+ +KL+ Y+ + G G GAY+
Sbjct: 425 VRDGDWKLVLPHNYRTMQGQEPGADGLPGAYD 456
>UniRef50_A3I0L2 Cluster: Arylsulfatase A; n=2; Bacteroidetes|Rep:
Arylsulfatase A - Algoriphagus sp. PR1
Length = 481
Score = 111 bits (267), Expect = 4e-23
Identities = 90/268 (33%), Positives = 128/268 (47%), Gaps = 20/268 (7%)
Query: 4 GVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTG 63
G + + GL E + + LK GY T +VGKWHLG ++ +LP +GFDS+ G
Sbjct: 106 GALDHSAKHGLNPEETTIAEMLKANGYATGIVGKWHLG-HQAPFLPTEQGFDSYYGLPYS 164
Query: 64 RIDMYDHTTMEQGSWGT-DFRRGFEVAHDLFGV-YATDVYTDEAIKVVNSHNKSEPLFLM 121
DM+ H +G + V L T YT++A++ + + +K +P FL
Sbjct: 165 N-DMWPHHPEVKGYYPPLPLYENTAVIDTLDDQSMLTTNYTEKALEFIEN-SKDKPFFLY 222
Query: 122 LAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLL 181
LAHS H P + D FK S + V+ ++D SVG+V L GL
Sbjct: 223 LAHSMTH---------VPLYVSDKFK--GKSEHGLYGDVMMEVDWSVGQVRNKLDELGLA 271
Query: 182 ENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAG-FLWSPLLDSKARVAY 240
EN+IV+F++DNG P + +A LK K T W+GG+R G F+W P +V
Sbjct: 272 ENTIVIFTSDNG-PWLSYGGHAGLTGGLKEGKGTSWDGGIREPGIFVW-PDHFPAGKVET 329
Query: 241 QKMHISDWLPTLYSAAGGDLSVLENLDG 268
Q D LPTL G L L +DG
Sbjct: 330 QAAMTIDILPTLAEITGSKLPELP-IDG 356
>UniRef50_A0YAF7 Cluster: Arylsulfatase A; n=1; marine gamma
proteobacterium HTCC2143|Rep: Arylsulfatase A - marine
gamma proteobacterium HTCC2143
Length = 479
Score = 111 bits (267), Expect = 4e-23
Identities = 87/293 (29%), Positives = 137/293 (46%), Gaps = 24/293 (8%)
Query: 5 VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF-WTG 63
V++ GLP E + + LK+ Y+T LVGKWHLG + + PL+ GFD + G ++
Sbjct: 111 VLFPTSTGGLPTTEITIAKALKEKDYRTALVGKWHLG-HLPGFQPLDHGFDEYFGIPYSN 169
Query: 64 RIDMYDH-------TTMEQGSWGTDFRRGFEVAHDLFGVYA-TDVYTDEAIKVVNSHNKS 115
D+ T + G + + + T YT EA+ + N +
Sbjct: 170 DHDLKKELSYIQTITHAKDGDFNVPLMQNRSIIERPANQNTITKRYTQEAVSFIKK-NSN 228
Query: 116 EPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKAL 175
+P FL LAHS P+ P+ A D F+ D R + V+ ++D SVG+V+ L
Sbjct: 229 QPFFLYLAHSM-----PHVPLFAS----DQFRGSSD--RGLYGDVIEEIDWSVGQVLSTL 277
Query: 176 HTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSK 235
+G+ EN++VVF++DN GP + S LK K T +EGG+R W P K
Sbjct: 278 SEQGISENTLVVFTSDN-GPWLIMGAHGGSAGLLKSGKGTSYEGGMREPAIFWWP-EKIK 335
Query: 236 ARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLH 288
VA+ D PT+ S AG D+ + DG + + + + R ++ +
Sbjct: 336 PAVAHNTASTLDLFPTIMSIAGIDMPSDRSYDGYDLSPTMFEQKSNERKNIFY 388
>UniRef50_A6LDP6 Cluster: Arylsulfatase A; n=1; Parabacteroides
distasonis ATCC 8503|Rep: Arylsulfatase A -
Parabacteroides distasonis (strain ATCC 8503 / DSM 20701
/ NCTC11152)
Length = 452
Score = 110 bits (265), Expect = 6e-23
Identities = 82/286 (28%), Positives = 135/286 (47%), Gaps = 20/286 (6%)
Query: 2 QHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF- 60
Q V++ GLP E + + LK GY T +GKWHLG + EY+PL GFD G+
Sbjct: 96 QRRVLFPDSKGGLPSEELTIAELLKQAGYHTACIGKWHLG-HLPEYMPLRHGFDYFYGYP 154
Query: 61 WTGRIDMYDHTTMEQGSWGTDF---RRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEP 117
++ + + + + ++ + E+ + T T+ AI+ + S N++ P
Sbjct: 155 YSNDMSRKEQIKLGNTKYPYEYIIYEQEKELEREPQQYNLTQQVTEAAIRYIKS-NENSP 213
Query: 118 LFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHT 177
FL LAH P+ P+ A + + SAR K+ + +LD SVG++++ L +
Sbjct: 214 FFLYLAHPM-----PHMPVYA------STDFQGKSARGKYGDTVEELDWSVGQILQTLKS 262
Query: 178 RGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKAR 237
GL +N++V+F++DN GP S PLK K +++EGG R +W ++ K
Sbjct: 263 EGLDKNTLVIFTSDN-GPWLLCKQEGGSPGPLKDGKASMFEGGFRVPCIMWGAMV--KPG 319
Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPR 283
D LPT AG L + DG++ + L + R
Sbjct: 320 YITDMASTLDLLPTFCEIAGIPLPSDRHYDGISLLNVLKDKSTCKR 365
>UniRef50_A6KZI6 Cluster: Sulfatase; n=2; Bacteroides|Rep: Sulfatase
- Bacteroides vulgatus (strain ATCC 8482 / DSM 1447 /
NCTC 11154)
Length = 473
Score = 110 bits (265), Expect = 6e-23
Identities = 82/275 (29%), Positives = 136/275 (49%), Gaps = 28/275 (10%)
Query: 21 LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRID---MYDHTTMEQGS 77
+ + L++ GY+ +GKWHLG + PL++GF +VG Y + ++
Sbjct: 130 MAEALQEQGYQCGHIGKWHLGDDEDGTGPLSQGFIWNVGGNRAGAPYSYFYPYCLPDKSK 189
Query: 78 WGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIR 137
G + G Y TD T+EA+ + SH++ P FL L+H AVH+ ++
Sbjct: 190 CHVGLEEG------ILGEYLTDRLTEEAVSFIKSHSEG-PFFLHLSHHAVHT-----VLQ 237
Query: 138 APQKLIDAFKYIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG 194
AP LI+ ++ K +AA++ KLD+SVG++ + + T G+ + +IV+F +DNGG
Sbjct: 238 APDSLINKYRNKTPGKYHKNPIYAAMIEKLDDSVGRICQVIKTLGIADRTIVIFYSDNGG 297
Query: 195 PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTLY 253
++ NYPL G K +EGG R + W+ ++ R + + D+ PT
Sbjct: 298 -----SEPVTDNYPLNGGKGMPYEGGSRVPLIIRWTGKIEGGIRSSVPITGV-DFYPTFV 351
Query: 254 SAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLH 288
+ A G + NLDG + + L N E+ R H
Sbjct: 352 TLAQGKIPA--NLDGKDIF-TLINNNETERDLFWH 383
>UniRef50_Q7UKJ5 Cluster: Arylsulfatase A; n=3; Bacteria|Rep:
Arylsulfatase A - Rhodopirellula baltica
Length = 489
Score = 108 bits (260), Expect = 3e-22
Identities = 86/285 (30%), Positives = 134/285 (47%), Gaps = 31/285 (10%)
Query: 2 QHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF- 60
QH V++ GL +E + +LK GY T VGKWHLG +K E LP + GFDS+ G
Sbjct: 115 QH-VLFPQSTYGLHPDEVTIADHLKSAGYATACVGKWHLGHHK-ETLPTSNGFDSYYGIP 172
Query: 61 -----------WTGRIDMYDHTTMEQGS---WGTDFRRGFEVAH-DLFGVYATDVYTDEA 105
G++ D T + + W T + E+ + T YTD A
Sbjct: 173 YSNDMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQRTVTRRYTDRA 232
Query: 106 IKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLD 165
I+ V + N+ +P FL L HS P+ P+ P+ D + D + + V+ +D
Sbjct: 233 IEFVEA-NQDKPFFLYLPHSM-----PHIPLYVPE---DVY---DPDPQNAYKCVIEHID 280
Query: 166 ESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAG 225
VG++V+ + GL E +++V+++DN GP F ++ S PL+ K T +EGG R
Sbjct: 281 TEVGRLVQTVRDLGLSEKTLIVYTSDN-GPWLQFKNHGGSAGPLRAGKGTTFEGGQRVPC 339
Query: 226 FLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
+W+P + D LPT+ S G L +DG++
Sbjct: 340 IMWAPGRIPAGTSSNAFATNMDLLPTIASFTGVALENDRKIDGID 384
>UniRef50_A6DPC8 Cluster: Arylsulfatase A; n=1; Lentisphaera
araneosa HTCC2155|Rep: Arylsulfatase A - Lentisphaera
araneosa HTCC2155
Length = 598
Score = 108 bits (259), Expect = 3e-22
Identities = 87/295 (29%), Positives = 140/295 (47%), Gaps = 44/295 (14%)
Query: 5 VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF---- 60
V Y +GL +E + + LK GY+T ++GKWHLG + ++LP N+GFDS+ G
Sbjct: 93 VYYPGASQGLKPSEITIAEVLKTAGYRTSIIGKWHLGD-RNQFLPTNQGFDSYFGIPFSN 151
Query: 61 --WTGR-------IDMYDHTTMEQGSWGTDFR------RGFEVA---------HDLFGVY 96
W + I ++ T+EQ G + RG +V + + Y
Sbjct: 152 DMWMSKDLALADDIKLFGGVTVEQIKSGEASKAVKGEKRGGKVPLMRDEEVVEYPVDQTY 211
Query: 97 ATDVYTDEAIKVVN-SHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQ 155
T YTDEA+K++ S K +P F+ LA++ P+ P+ A K + SAR
Sbjct: 212 ITQRYTDEALKIIKESEKKKQPYFIYLAYAM-----PHVPLYASPK------FAGKSARG 260
Query: 156 KFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNT 215
+ + ++D VG+++K L + G +N++V+F++DNG G + S PL+G K +
Sbjct: 261 PYGDTVEEMDYHVGRILKHLKSSGADKNTLVIFTSDNGPWNLG--ERGGSALPLRGAKFS 318
Query: 216 LWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
+EGG R +W P + + D++PT A L LDG N
Sbjct: 319 TYEGGHRVPCVMWWPGTIPAGTDSAEIATTLDFMPTFAKLANAQLP-NRTLDGKN 372
>UniRef50_A6DSP6 Cluster: Sulfatase; n=1; Lentisphaera araneosa
HTCC2155|Rep: Sulfatase - Lentisphaera araneosa HTCC2155
Length = 512
Score = 107 bits (258), Expect = 5e-22
Identities = 86/279 (30%), Positives = 132/279 (47%), Gaps = 40/279 (14%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
GLP ++ ++ + LK LGY ++GKWH+G + P RG+D GF G D Y T
Sbjct: 106 GLPQSQSMISEELKTLGYTNGMIGKWHMG-FDMSLRPNQRGYDFFYGFINGSHD-YTEWT 163
Query: 73 ME----QGSWGTDFRRGFEVAH-----DLF---GV------YATDVYTDEAIKVVNSHNK 114
E + W E A+ D+F GV Y TD++TDEA+ ++ N
Sbjct: 164 QEFAKGKSRWPIFRNEEMEPANKAQYIDVFKEKGVKVVDENYLTDLFTDEAVNFID-RNA 222
Query: 115 SEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYI-DDSARQKFAAVLSKLDESVGKVVK 173
+P FL LA++AVH P + Q +D ++ DD FA+++ +DE +GKV+K
Sbjct: 223 DKPFFLYLAYNAVH-----HPWQTTQHALDKTAHLKDDKNYHVFASMVYAMDEGIGKVMK 277
Query: 174 ALHTRGLLENSIVVFSTDNGGP-AAGFNDN------------AASNYPLKGVKNTLWEGG 220
L + + +N+I++F +DNG P G + +S +G K +EGG
Sbjct: 278 KLKEKNIDDNTIIIFLSDNGSPQGQGIEHSPKDPNRHRGGFTMSSTGIFRGYKGDTYEGG 337
Query: 221 VRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGD 259
+R + P K + D PTL AAGG+
Sbjct: 338 IRVPFCIKWPQQIQKGTKYDMPISALDLQPTLVKAAGGN 376
>UniRef50_A5FAW4 Cluster: Sulfatase precursor; n=1; Flavobacterium
johnsoniae UW101|Rep: Sulfatase precursor -
Flavobacterium johnsoniae UW101
Length = 539
Score = 107 bits (258), Expect = 5e-22
Identities = 85/289 (29%), Positives = 135/289 (46%), Gaps = 32/289 (11%)
Query: 12 RGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTG-------- 63
+GLP +E K GY T ++GKWHLG + K + PL+RGFD H GF+
Sbjct: 173 QGLPKSEITFADLAKKQGYSTAIIGKWHLG-HTKGFFPLDRGFDYHYGFYQAFSLFAPED 231
Query: 64 ---------RIDMYDHTTMEQGSWGT-DFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHN 113
D D T G GT RR + + Y T+ + +EA ++ N
Sbjct: 232 NNPDIINHHHTDFTDKTIWGNGRVGTGQIRRDSTIIDEK--KYLTEKFAEEAEAFIDK-N 288
Query: 114 KSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVK 173
K++P L + +A P+ P + +K D F + D ++ + A++S LD+++G +
Sbjct: 289 KNKPFLLYVPFNA-----PHTPFQVRKKYYDRFPNVKDENKRVYFAMISALDDAIGLIRA 343
Query: 174 ALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLD 233
+ GL EN+++ F++DNGG + A +N PLKG K + +EGGV F S
Sbjct: 344 KVKKEGLEENTLIFFASDNGGADYTY---ATTNAPLKGGKFSHFEGGV-NVPFALSWKGK 399
Query: 234 SKARVAYQKMHIS-DWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTES 281
K Y+ S D T+ + L DGV+ D ++ N ++
Sbjct: 400 IKPHTIYKTPVSSLDIFSTIAAVTHSGLPKDRVYDGVDLVDVVNNNKQA 448
>UniRef50_A3HZ22 Cluster: Putative exported uslfatase; n=1;
Algoriphagus sp. PR1|Rep: Putative exported uslfatase -
Algoriphagus sp. PR1
Length = 489
Score = 107 bits (257), Expect = 6e-22
Identities = 77/260 (29%), Positives = 130/260 (50%), Gaps = 14/260 (5%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWT-GRIDMYDHTT 72
LPL E + + +K GY T VGKWHLG ++ + P ++GFD ++G G+ Y
Sbjct: 146 LPLEEITIAERMKAHGYGTLHVGKWHLG--EEGFYPEDQGFDVNIGGNDLGQPPSYFDPY 203
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
+ +F + G + TD DE + + + K + F+ A AVH+
Sbjct: 204 LPAKP--REFYEITTLKPRKEGEFLTDREGDEVVNYIQNQ-KGKKFFVHWAPYAVHT--- 257
Query: 133 YEPIRAPQKLIDAFKYIDDSARQK--FAAVLSKLDESVGKVVKALHTRGLLENSIVVFST 190
PI L++ ++ + ++ +AA++ +D++VGKV+ L GL EN++V+F++
Sbjct: 258 --PIMGKPDLVEKYEQKEPGNQRNPVYAALVESVDQNVGKVLSELERMGLRENTLVIFTS 315
Query: 191 DNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLP 250
DNGG +++ +NYPLK K +EGG+R + P + V + DW+P
Sbjct: 316 DNGGLIGNYDNPITNNYPLKSQKGYPYEGGIRIPTIVSWPGKIPQGFVDETPIITMDWIP 375
Query: 251 TLYSAAGGDLSVLENLDGVN 270
T+ G D L L+GV+
Sbjct: 376 TILDFMGED-PTLPELEGVS 394
>UniRef50_P34059 Cluster: N-acetylgalactosamine-6-sulfatase
precursor; n=32; Deuterostomia|Rep:
N-acetylgalactosamine-6-sulfatase precursor - Homo
sapiens (Human)
Length = 522
Score = 107 bits (257), Expect = 6e-22
Identities = 84/313 (26%), Positives = 140/313 (44%), Gaps = 22/313 (7%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
G+P +E++LP+ LK GY + +VGKWHLG ++ ++ PL GFD G YD+
Sbjct: 116 GIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKA 174
Query: 73 MEQ----GSWGTDFRRGFEVAHDLFGVYA--TDVYTDEAIKVVNSHNKSEPLFLMLAHSA 126
W R E +L A T +Y EA+ + + P FL A A
Sbjct: 175 RPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDA 234
Query: 127 VHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIV 186
H+ P+ A + ++ S R ++ + ++D+S+GK+++ L + +N+ V
Sbjct: 235 THA-----PVYASKP------FLGTSQRGRYGDAVREIDDSIGKILELLQDLHVADNTFV 283
Query: 187 VFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHIS 246
F++DNG + SN P K T +EGG+R W P + +V++Q I
Sbjct: 284 FFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIM 343
Query: 247 DWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALTVDKYKL 306
D T + AG +DG+N L + R + D + A T+ ++K
Sbjct: 344 DLFTTSLALAGLTPPSDRAIDGLNLLPTLLQGRLMDRPIFYYRGDT---LMAATLGQHKA 400
Query: 307 IKGTIYKGVWDNW 319
T + W+N+
Sbjct: 401 HFWT-WTNSWENF 412
>UniRef50_A3ZUT0 Cluster: Arylsulphatase A; n=1; Blastopirellula
marina DSM 3645|Rep: Arylsulphatase A - Blastopirellula
marina DSM 3645
Length = 457
Score = 107 bits (256), Expect = 8e-22
Identities = 96/303 (31%), Positives = 146/303 (48%), Gaps = 27/303 (8%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGS-YKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
LPL+EK + Q L GY+ ++GKWHLG + EY P NRGFD V I Y +
Sbjct: 122 LPLDEKTIAQELHADGYQCAMIGKWHLGEEHGPEYRPQNRGFDRVVLSEHHGIFNYFYPF 181
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
++Q W G + G Y D TDEAI V N+ P FL L+H +VH G
Sbjct: 182 VDQQKWPY---AGPLPGNP--GDYLPDRLTDEAIDFVRE-NRERPFFLYLSHWSVH-GRY 234
Query: 133 YEPIRAPQKLIDAFKYIDDSARQK-FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
+ AP+ LI ++ R +AA++ +D SVG+++ L L +N++ VF +D
Sbjct: 235 F----APESLIAKYRERGLEERPAIYAAMMETVDNSVGRLMATLDELNLADNTLFVFMSD 290
Query: 192 NGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPT 251
NGG + S PL+G K +L+EGGVR + P + + D PT
Sbjct: 291 NGG------ERITSMAPLRGSKGSLYEGGVRVPLIVRYPGVVKPNTTCSVPVISHDLFPT 344
Query: 252 LYSAAGGDLSVLEN-LDGVNQWDALS-KNTESPRTSVLHNIDDIWG----IAALTVDKYK 305
A + S +N LDG + L+ + +E R ++ + WG +A+ ++K
Sbjct: 345 FLDFA--ERSYRDNKLDGHSIAGLLTGEQSELDRDALYWHFPHYWGSTRPCSAMRQGRWK 402
Query: 306 LIK 308
L++
Sbjct: 403 LVE 405
>UniRef50_A6DR20 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1;
Lentisphaera araneosa HTCC2155|Rep:
N-acetyl-galactosamine-6-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 608
Score = 106 bits (255), Expect = 1e-21
Identities = 75/206 (36%), Positives = 105/206 (50%), Gaps = 23/206 (11%)
Query: 21 LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGT 80
L + K+ GYKT GKWHLG K Y PL GFD + W G GS+
Sbjct: 127 LGKVFKNAGYKTAHFGKWHLG--KSPYSPLEHGFDIDIPHWPG--------PGPAGSFVA 176
Query: 81 DFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQ 140
+R + G + D DE K + S NK +P F+ +VH+ P A Q
Sbjct: 177 PWRYP-NFKENYPGEHIDDRLGDEIAKYI-SENKDQPFFINFWQFSVHA-----PFNAKQ 229
Query: 141 KLIDAF-KYIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPA 196
+LID + K ID + Q +AA++ +D+S+GKV+ AL T L+E +I+VF +DNGG
Sbjct: 230 ELIDKYRKLIDKNNPQHNPVYAAMVESMDDSIGKVIDALETNKLMEKTIIVFFSDNGGNI 289
Query: 197 AGFND--NAASNYPLKGVKNTLWEGG 220
D A SN P +G K +++EGG
Sbjct: 290 HSVVDGTTATSNKPFRGGKASIYEGG 315
>UniRef50_A6DGX5 Cluster: N-acetylgalactosamine 6-sulfatase; n=2;
Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine 6-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 446
Score = 106 bits (254), Expect = 1e-21
Identities = 83/292 (28%), Positives = 140/292 (47%), Gaps = 24/292 (8%)
Query: 26 KDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDFRRG 85
K Y+T L+GKWHLG + P RGF+ GF +D Y E G +
Sbjct: 116 KSNNYRTSLIGKWHLGLQSPNH-PNERGFEIFHGFLGDMMDDY----WEHTRHGVAYMYH 170
Query: 86 FEVAHDLFGVYATDVYTDEAIKVVNSHNKS-EPLFLMLAHSAVHSGNPYEPIRAPQKLID 144
A + G +AT+++T+ AI+ + K P F L+++A P++PI P+K +
Sbjct: 171 NSTAVETKGTHATELFTNWAIEEIKQAQKDPRPFFQFLSYNA-----PHDPIHPPKKYYE 225
Query: 145 AFKYIDDSA---RQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFND 201
FK + R K ++ LD S+G+V+ L+ + +N++V+F++DNGG
Sbjct: 226 YFKKKQPNTSEKRAKIGGLIEHLDYSIGRVLDTLNELEIDKNTLVIFTSDNGGKI----K 281
Query: 202 NAASNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDL 260
A N L+ K ++EGG+R F W + SK+ ++ M + D++PTL A D
Sbjct: 282 YGADNGELRADKTHMYEGGLRVCTSFTWPEKIRSKSLSDFRAMTM-DFMPTLLDAVNIDY 340
Query: 261 SVLENLDGVNQWDAL--SKNTESPRTSVLHNIDDIWGIAALTVDKYKLIKGT 310
S ++DG + L + + + I+ AL +D +KL+ +
Sbjct: 341 S--GHMDGKSFLPELLFGQQENFTKRKQFYTWLQIYKKHALRIDDWKLVNNS 390
>UniRef50_Q7UMZ5 Cluster: N-acetylgalactosamine-6-sulfate sulfatase;
n=1; Pirellula sp.|Rep: N-acetylgalactosamine-6-sulfate
sulfatase - Rhodopirellula baltica
Length = 484
Score = 105 bits (253), Expect = 2e-21
Identities = 82/279 (29%), Positives = 145/279 (51%), Gaps = 22/279 (7%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
GLP N L + L +GY+T L GKWHLG Y+ ++ P+ GFD + G +D Y +
Sbjct: 131 GLPANRPTLAKRLSSVGYETALFGKWHLG-YEAKFSPMMHGFDEALYCIGGAMDYYHY-- 187
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGN 131
++ + F G ++ + Y TD TD+A++ + N ++ P FL L ++A H+
Sbjct: 188 LDSVATYNLFHNGRPISGE---GYFTDTITDQAVRFIGDRNANDKPFFLYLPYTAPHT-- 242
Query: 132 PYE-PIRAPQKL--IDAFKYIDDSARQK-FAAVLSKLDESVGKVVKALHTRGLLENSIVV 187
PY+ P +P ID+ + ++ + A++ +DE +GKV+ A+ + + ++V+
Sbjct: 243 PYQAPGESPVDPLPIDSPLWKQNADPPGVYRAMVRHMDEGIGKVLHAIEESKMTDRTLVI 302
Query: 188 FSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISD 247
F++DNGG +A N+ PL+G K +EGG+R P + V+ Q D
Sbjct: 303 FASDNGGTSASRNE------PLRGFKGQAFEGGIRVPLIARWPGHLPEGVVSDQVTITFD 356
Query: 248 WLPTLYSAAGGDLSVLENLDGVNQWDALSKNTE--SPRT 284
++ +AAG + + ++G++ +L+ N E PRT
Sbjct: 357 LTASMLAAAGITPTQEDAMEGIDVL-SLAANDEPVQPRT 394
>UniRef50_A7AKS6 Cluster: Putative uncharacterized protein; n=1;
Parabacteroides merdae ATCC 43184|Rep: Putative
uncharacterized protein - Parabacteroides merdae ATCC
43184
Length = 464
Score = 105 bits (253), Expect = 2e-21
Identities = 74/212 (34%), Positives = 109/212 (51%), Gaps = 20/212 (9%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTG-RIDMYDHT 71
GLP +E++LP LK Y+T +GKWHLGS + P +GFD+ G G R YD
Sbjct: 110 GLPDDEELLPALLKRYDYRTGCIGKWHLGSEPSQR-PNAKGFDTFYGLLAGHRSYFYDPE 168
Query: 72 TMEQGSWGTDFR-RGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSG 130
T ++ ++ G +++ D Y TD +A + V +P L ++ +A HS
Sbjct: 169 TSDKDGNLQQYQYNGRKLSFD---GYFTDELASKAQQFVTE--SEQPFMLYMSFTAPHSP 223
Query: 131 NPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFST 190
N A ++ + F + RQK+AA++ LD VGK+V L G +N+I+ F +
Sbjct: 224 N-----EATEEDLARF---EGQPRQKYAAMMYALDRGVGKIVDELKAAGKFDNTIIFFLS 275
Query: 191 DNGGPAAGFNDNAASNYPLKGVKNTLWEGGVR 222
DNGG N +SN PLKG K +EGG R
Sbjct: 276 DNGGSTT----NQSSNLPLKGFKGNKFEGGQR 303
>UniRef50_A6DMX7 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1;
Lentisphaera araneosa HTCC2155|Rep:
N-acetyl-galactosamine-6-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 578
Score = 105 bits (253), Expect = 2e-21
Identities = 79/270 (29%), Positives = 136/270 (50%), Gaps = 25/270 (9%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
L N + + +K GY+T GKWHLG + Y PL GFD + TG +
Sbjct: 126 LDTNFPTIGKMMKQAGYETGHFGKWHLGP--EPYSPLQHGFDVDIPHHTGAGPGKSYVA- 182
Query: 74 EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPY 133
W + + + Y D +E +K V+ + +P F+ +VH+
Sbjct: 183 ---PWSQE-----HIKPNYEKEYIEDRMVEECLKWVDGLSGDKPFFMNYWMFSVHA---- 230
Query: 134 EPIRAPQKLIDAFK-YIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFS 189
P A Q+LID +K ID +++Q+ +AA++ LD++VG +++ L +RGL++N++++F+
Sbjct: 231 -PFDAKQELIDKYKKVIDPNSKQRSALYAAMVQSLDDAVGALLEGLESRGLMDNTVIIFT 289
Query: 190 TDNGGPAAGFNDNA---ASNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKARVAYQKMHI 245
+DNGG D SN+PL G K ++ EGGVR +W + + +R + + +
Sbjct: 290 SDNGGNIYSQLDEGIVPTSNFPLSGGKASMCEGGVRVPCTVVWPGVTKAGSR-SDEIVQT 348
Query: 246 SDWLPTLYSAAGGDLSVLENLDGVNQWDAL 275
SD+ T+ +G L +DG++ AL
Sbjct: 349 SDFYTTIIKGSGIALPEGHVVDGIDIRPAL 378
>UniRef50_A6CGG6 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
Planctomyces maris DSM 8797|Rep: N-acetylgalactosamine
6-sulfatase - Planctomyces maris DSM 8797
Length = 461
Score = 105 bits (252), Expect = 2e-21
Identities = 80/259 (30%), Positives = 127/259 (49%), Gaps = 23/259 (8%)
Query: 29 GYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDFRRGFEV 88
GY+T ++GKWHLG + P RGFD GF DM D + + G ++ R +
Sbjct: 129 GYQTAIIGKWHLG-LESPNTPNERGFDLFRGFLG---DMMDDYYLHRRH-GVNYMRRNQK 183
Query: 89 AHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGNPYEPIRAPQKLIDAFK 147
D G +ATD++TD + + SE P FL LA++A P+ PI+ P+ + K
Sbjct: 184 TVDPQG-HATDLFTDWTCEYLKQQATSESPFFLYLAYNA-----PHTPIQPPEDWLGKVK 237
Query: 148 YID---DSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAA 204
+ D R + A++ LD +GKV++ L L +N+++VFS+DNGG A
Sbjct: 238 QRETGIDPDRARLVALIEHLDAGIGKVIQTLDETKLSDNTLIVFSSDNGGQLG----VGA 293
Query: 205 SNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVL 263
+N L+ K +++EGG++ G +W + + M + D PT+ A G + V
Sbjct: 294 NNGALRDGKQSMYEGGLKVPTGVVWKKHIAPHTETDFMAMSM-DIFPTVCEAGG--IKVP 350
Query: 264 ENLDGVNQWDALSKNTESP 282
LD V+ L + P
Sbjct: 351 SGLDAVSFLPTLQGRQQKP 369
>UniRef50_A0Z718 Cluster: Twin-arginine translocation pathway
signal; n=1; marine gamma proteobacterium HTCC2080|Rep:
Twin-arginine translocation pathway signal - marine
gamma proteobacterium HTCC2080
Length = 653
Score = 105 bits (252), Expect = 2e-21
Identities = 80/279 (28%), Positives = 131/279 (46%), Gaps = 21/279 (7%)
Query: 8 GAEPRGLPLNEKI--LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRI 65
G P G ++ ++ LP L+ GY TH VGKWHLG ++ P+ +GFDS GF +
Sbjct: 120 GFRPAGRGISPEVITLPDMLRGAGYTTHHVGKWHLGFVSEQAWPIQQGFDSFFGFLDQFL 179
Query: 66 DMYDHT----TMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVV-NSHNKSEPLFL 120
HT +++ ++ + A + + +DV +EAI ++ ++ +P F+
Sbjct: 180 LRGPHTGAGYNLKRPTYVNPLLQRDNGAFEKKSGHLSDVLVEEAIDLLARVKDQKQPWFI 239
Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
+ P+ P+ + F D+ K+ A+L ++D +VG+V++ L L
Sbjct: 240 -----NYWTYLPHTPLTPATRFASKF---PDTPEGKYNAMLMQVDAAVGRVLETLDASDL 291
Query: 181 LENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAY 240
+++V+ +DNGG SN P GVKNT EGG+R + P + K V
Sbjct: 292 TRSTLVIVVSDNGGT----EKQLPSNQPFIGVKNTFTEGGLRTPLLMRWPEVIPKNMVID 347
Query: 241 QKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNT 279
+ + D+ PTL S G+ S L G N W NT
Sbjct: 348 ETVSYLDYFPTLESLVTGNTS--GGLPGRNLWPLFVDNT 384
>UniRef50_Q482D6 Cluster: Sulfatase family protein; n=2;
Bacteria|Rep: Sulfatase family protein - Colwellia
psychrerythraea (strain 34H / ATCC BAA-681)
(Vibriopsychroerythus)
Length = 492
Score = 105 bits (251), Expect = 3e-21
Identities = 90/301 (29%), Positives = 146/301 (48%), Gaps = 36/301 (11%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHV--GFWTGRIDMY-DH 70
LPL+ ++LK+ GY+T +GKWHLG K+ P +GFDS + G W Y +
Sbjct: 108 LPLSAVTFGEHLKEAGYQTGYIGKWHLG--KEGGDPTKQGFDSSIMAGHWGAPPSYYFPY 165
Query: 71 TTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS- 129
T M + + +GF Y TD TDEA+ + K +P L+LAH AVH+
Sbjct: 166 TKMSK----SGKNKGFAKVEGSEEEYLTDRLTDEALTFI-EQKKDQPFLLVLAHYAVHTP 220
Query: 130 --GNP--YEPIRAPQKLI---------DAFKYIDDSARQK-------FAAVLSKLDESVG 169
G P + + K + DA D + K +AA++ +D SVG
Sbjct: 221 IEGKPALVKKYKTKMKKLGIANAGPKSDADLIKDSTGYHKTIQNNPDYAAMVESVDISVG 280
Query: 170 KVVKALHTRGLLENSIVVFSTDNGG-PAAGFNDN---AASNYPLKGVKNTLWEGGVRGAG 225
++ + L GL +N+I++ ++D+GG + G N A SN P + K +++GG R
Sbjct: 281 RIEQQLKRLGLEDNTIIILTSDHGGLSSRGLKSNRVLATSNNPYRHGKGWIYDGGTRVPL 340
Query: 226 FLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTS 285
+ P ++ ++ +D PT+ AG LS ++ DGV+ AL+ + E+PR +
Sbjct: 341 IVKWPEKVKAGSISQVQVTGTDHYPTILQMAGLSLSPKDHQDGVSYLAALNSD-ETPRKA 399
Query: 286 V 286
+
Sbjct: 400 M 400
>UniRef50_Q7UPK7 Cluster: Arylsulphatase A; n=1; Pirellula sp.|Rep:
Arylsulphatase A - Rhodopirellula baltica
Length = 482
Score = 104 bits (250), Expect = 4e-21
Identities = 84/255 (32%), Positives = 127/255 (49%), Gaps = 26/255 (10%)
Query: 17 NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQG 76
+E + LKD GY T LVGKWH G + PL+RGFD GF+ G D+ G
Sbjct: 139 DETTIADVLKDAGYATGLVGKWHTGR-GDGFHPLDRGFDEFEGFF-GSDDV--------G 188
Query: 77 SWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPI 136
+ F +++ D+ Y TD AI+ V H++ P FL LAH A P+ P+
Sbjct: 189 YFRYPFSEQRQIS-DVDESYLTDDLNRRAIEFVRRHHE-HPFFLHLAHYA-----PHRPL 241
Query: 137 RAPQKLIDAFKYID-DSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNG-G 194
AP ++I ++ D + A++ +D +G+++ + GL E++IV+F++DNG
Sbjct: 242 EAPPEVIARYREQGFDESTATIYAMIEVMDRGIGELLAEIDDLGLSEDTIVLFASDNGPD 301
Query: 195 PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTLY 253
P G N L+G K + EGG+R F+ WS L R Q + D +PT+
Sbjct: 302 PLTG----ERFNRELRGTKYQVNEGGIRVPLFVRWSKRLAPGQR--DQMVTFVDLMPTIL 355
Query: 254 SAAGGDLSVLENLDG 268
D+S+L LDG
Sbjct: 356 DLCRVDVSMLNRLDG 370
>UniRef50_A6C3C8 Cluster: Putative uncharacterized protein; n=1;
Planctomyces maris DSM 8797|Rep: Putative
uncharacterized protein - Planctomyces maris DSM 8797
Length = 600
Score = 104 bits (249), Expect = 6e-21
Identities = 101/354 (28%), Positives = 157/354 (44%), Gaps = 35/354 (9%)
Query: 17 NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQG 76
NE + Q L+ GYKT L GKWHLG Y +Y P RGFD G + G I+ Y +
Sbjct: 113 NETTIAQVLQKAGYKTGLFGKWHLGRY-AQYQPQRRGFDHFFGHYHGHIERYTNPDQVVV 171
Query: 77 SWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPI 136
+ RG Y TD++TD AI + N+ +P F LA++A HS +
Sbjct: 172 NGTPVETRG----------YVTDLFTDAAIDFI-QRNQQQPFFCYLAYNAPHSPFLLDTS 220
Query: 137 RAPQ----KLIDAFKYIDDSARQ-KFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
Q KLI+ + R+ + A++ ++D+++ ++++ +H L + ++V+F++D
Sbjct: 221 HFGQPEGDKLIEKYLAKGLPLREARIYAMIERIDQNLSRLLQTVHDLKLDQETVVIFTSD 280
Query: 192 NGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLP 250
NGG + GF LKG K + +EGG R + W+ + + + +D P
Sbjct: 281 NGGVSRGFKAG------LKGSKASAYEGGTRVPFVVRWTDHFPA-GKTTDAMVAQTDLFP 333
Query: 251 TLYSAAGGDLSVLENLDGVNQWDALSK-NTESPRTSVLHNIDDIWGIAALTVDKYK--LI 307
T AG + LDG + + + +SP + H D T + Y I
Sbjct: 334 TFCQLAGVPVPSNVKLDGESILSLMEQGGGKSPHQYLYHTWD------RYTPNPYHRWAI 387
Query: 308 KGTIYKGVWDNWYGPSGREGAYNASLLYDSHAGRILDKLNLMPPKEKVMELRDE 361
G +K V + G +EG LYD K EKV ELR E
Sbjct: 388 HGPRFKLVGHDPQGKKKKEGEPQGQ-LYDLQEDPGEKKNVADQYPEKVSELRGE 440
>UniRef50_Q15XI1 Cluster: Sulfatase; n=2; Bacteria|Rep: Sulfatase -
Pseudoalteromonas atlantica (strain T6c / BAA-1087)
Length = 510
Score = 103 bits (248), Expect = 7e-21
Identities = 82/279 (29%), Positives = 138/279 (49%), Gaps = 33/279 (11%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
LPL+E L + K GY T +GKWHLG ++ P N+GFD ++ + +
Sbjct: 131 LPLSEITLAEAFKQNGYNTAFLGKWHLGK-TEDLWPENQGFDVNIAGTKNGHPAAGYFSP 189
Query: 74 EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHS--G 130
+ + TD +G Y T T+EAI +V+ ++K P F+ML+ VH+
Sbjct: 190 YKNARLTDGPKG---------EYLTQRLTNEAISLVDKYSKQTVPFFMMLSFYTVHTPLA 240
Query: 131 NPYEPIRAPQKLIDAFKYIDDSARQK-------------------FAAVLSKLDESVGKV 171
P + ++ Q I + + D+ R++ +AA++ ++D VG++
Sbjct: 241 APNKDVQEYQAKIRQYAHNDEFQREEQVWPTAEKREVRVKQNHPTYAAMVKQMDTQVGRL 300
Query: 172 VKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPL 231
+ L G+ E+++VVF++DNGG ++ + SN PL+G K L+EGG+R + P
Sbjct: 301 LAKLKQAGMEESTLVVFTSDNGGLSSA-EGSPTSNLPLRGGKGWLYEGGIRVPLLVKLPQ 359
Query: 232 LDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
K + + +D PTL SA DL ++LDGV+
Sbjct: 360 KKHKHLQINEPVTSTDLYPTLLSAGHLDLLPQQHLDGVD 398
>UniRef50_A6DF72 Cluster: Putative secreted sulfatase ydeN; n=1;
Lentisphaera araneosa HTCC2155|Rep: Putative secreted
sulfatase ydeN - Lentisphaera araneosa HTCC2155
Length = 481
Score = 103 bits (248), Expect = 7e-21
Identities = 78/267 (29%), Positives = 133/267 (49%), Gaps = 23/267 (8%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHV-GFWTGRIDMYDHTT 72
L E L + K GYKT +GKWHLG + P N+GFD ++ GF G +
Sbjct: 113 LTAEEITLAEAFKATGYKTVHIGKWHLGEESVSW-PENQGFDENIAGFRAGSPSAHGG-- 169
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNK-SEPLFLMLAHSAVHSGN 131
G + + + + G Y T+ EA + + S K +P F+ L VH+
Sbjct: 170 ---GGYFSPYNNP-RLKDGPKGEYLTERLAQEASQYIQSTAKLKKPFFMNLWLYNVHT-- 223
Query: 132 PYEPIRAPQKLIDAFKYIDDSARQK----FAAVLSKLDESVGKVVKALHTRGLLENSIVV 187
P++A Q+ ID + + Q +AA++ +D++VG V++A+ G+ +N+I++
Sbjct: 224 ---PLQARQEKIDKYTRLIQKGYQHTNPVYAAMVEHMDDAVGTVMQAVKDAGIEDNTIII 280
Query: 188 FSTDNGGPAAGFNDN---AASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKM 243
F++DNGG + +N SNYPL+ K ++EGGVR + WS + + + + +
Sbjct: 281 FNSDNGGLRGNYENNRQKVTSNYPLRSGKGDMYEGGVRVPMIIKWSRKIKA-GQTSSSPV 339
Query: 244 HISDWLPTLYSAAGGDLSVLENLDGVN 270
D PTL D+S +++DG++
Sbjct: 340 ISHDIYPTLLDLCKIDVSKKQDIDGIS 366
>UniRef50_Q47Q78 Cluster: N-acetylgalactosamine-6-sulfate sulfatase;
n=1; Thermobifida fusca YX|Rep:
N-acetylgalactosamine-6-sulfate sulfatase - Thermobifida
fusca (strain YX)
Length = 471
Score = 102 bits (245), Expect = 2e-20
Identities = 85/329 (25%), Positives = 150/329 (45%), Gaps = 26/329 (7%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
++ ++ + G+P L L + GY T + GKWH G + Y PL GF++ G
Sbjct: 89 LEEPLVTRSPENGIPEGHPTLSSLLVEAGYATAMFGKWHCG-WLPWYSPLRIGFETFFGN 147
Query: 61 WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
+ G +D ++H G D G E + G Y T++ ++ A + + +H ++ P ++
Sbjct: 148 FDGALDYFEHVDT-LGK--ADLYEG-ETPVEEVGYY-TEIISERAAEYITAH-RNRPFYV 201
Query: 121 MLAHSAVH---SGNPYEPI------RAPQKLIDA-FKYIDDSARQKFAAVLSKLDESVGK 170
L ++A H G + R Q+ + ++D + K+ ++ +D +G+
Sbjct: 202 QLNYTAPHWPWEGPDDHEVGQEIRRRYQQRWEHSPLMHLDGGSIAKYGELVEAMDAGIGQ 261
Query: 171 VVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSP 230
V+ AL G +N+IVVFS+DNGG + + N+P G K L EGG+R + P
Sbjct: 262 VLAALDRAGAADNTIVVFSSDNGG------ERWSKNWPFVGEKGDLTEGGIRVPLIVAWP 315
Query: 231 LLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNI 290
+ +V+ + DW TL +AAG + LDGV+ L + P +
Sbjct: 316 EAIAGNQVSDHPVITMDWTATLLAAAGTEPHPDWPLDGVDLLPWLVDGADFPAHDLFWRT 375
Query: 291 DDIWGIAALTVDKYKLIKGTIYKGVWDNW 319
+ AL ++K ++ + V NW
Sbjct: 376 SN---QGALRRGRFKYLRDRRDRAVLGNW 401
>UniRef50_A6P2X1 Cluster: Putative uncharacterized protein; n=1;
Bacteroides capillosus ATCC 29799|Rep: Putative
uncharacterized protein - Bacteroides capillosus ATCC
29799
Length = 494
Score = 102 bits (245), Expect = 2e-20
Identities = 91/305 (29%), Positives = 139/305 (45%), Gaps = 36/305 (11%)
Query: 7 YGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRID 66
Y + GLP +E +LP+ L+ GY+T LVGKWHLG ++E P NRGFD G
Sbjct: 162 YPYQNDGLPTDEILLPEVLQQAGYETALVGKWHLG-IREEERPYNRGFDLFYG------A 214
Query: 67 MYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVN---SHNKSEPLFLMLA 123
+Y + D EV HD Y T E +V N+ P FL A
Sbjct: 215 LYSDDNDPHRIYHND-----EVVHD--EPYDQSGMTKELTQVAKQFIDDNQDGPFFLYYA 267
Query: 124 HSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLEN 183
S P+ P A + +++ S + + ++D SVG+++ L GLLEN
Sbjct: 268 -----SPFPHWPSNASE------EWLGTSQAGIYGDCMQEVDWSVGEIMDTLEENGLLEN 316
Query: 184 SIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKM 243
++V+F++DNG + D A +G K+T + GG + P + V M
Sbjct: 317 TLVIFTSDNG----PWYDGATGGQ--RGRKDTNYNGGSHVPFIAYMPGTIPEGEVYDGLM 370
Query: 244 HISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALTVDK 303
D PT+ + G +L +DG++ W L+ ++SPRT + N D AL D
Sbjct: 371 SGVDVFPTILNLLGIELPQDRVIDGMDMWPFLTGQSDSPRTELFLNKDK--DTFALIEDN 428
Query: 304 YKLIK 308
+K ++
Sbjct: 429 FKYLE 433
>UniRef50_A6DMY9 Cluster: Putative uncharacterized protein; n=2;
Lentisphaera araneosa HTCC2155|Rep: Putative
uncharacterized protein - Lentisphaera araneosa HTCC2155
Length = 590
Score = 102 bits (245), Expect = 2e-20
Identities = 85/277 (30%), Positives = 129/277 (46%), Gaps = 30/277 (10%)
Query: 12 RGLPL---NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRI-DM 67
RGL + E + + K GY+T L GKWH G + P +GFD + GF G I D
Sbjct: 96 RGLEIMRSEEATIAEVFKAQGYETGLFGKWHNGEHYPNNPP-GQGFDEYFGFCAGHIGDF 154
Query: 68 YDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAV 127
+D T D + F + TDV TD AI + + +P F + ++A
Sbjct: 155 FDATL--------DHNKTFVKTKG----FITDVLTDRAIDWIEKQ-QDKPFFAYIPYNA- 200
Query: 128 HSGNPYEPIRAPQKLIDAFKYIDDSARQKFA-AVLSKLDESVGKVVKALHTRGLLENSIV 186
P+ P + K D F SA A ++ LD+++G+++K L L +N+IV
Sbjct: 201 ----PHAPYQVEDKYYDEFAAKGYSAAHSAAYGMIENLDDNIGRLLKILDDLNLTDNTIV 256
Query: 187 VFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHIS 246
+F TDNG N N +KG K ++ EGGVR F+ P +K R +
Sbjct: 257 IFLTDNGP-----NSPTRFNGGMKGSKGSVDEGGVRVPFFIRWPGKIAKGRTIHDLAAHI 311
Query: 247 DWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPR 283
D LPTL AG ++ + LDG +L ++++P+
Sbjct: 312 DVLPTLMELAGVNVDLPNKLDG-RSLTSLISSSKTPK 347
>UniRef50_A6C8S3 Cluster: Arylsulphatase A; n=1; Planctomyces maris
DSM 8797|Rep: Arylsulphatase A - Planctomyces maris DSM
8797
Length = 481
Score = 102 bits (245), Expect = 2e-20
Identities = 97/340 (28%), Positives = 152/340 (44%), Gaps = 47/340 (13%)
Query: 2 QHGVIYGAEPRG--------LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRG 53
+HGV Y P G + +E +L + LK+ GY T +GKWHLG + EY P G
Sbjct: 103 RHGVWYNPAPDGQQFRSGVGIAESELLLSELLKENGYATICIGKWHLG-HDPEYYPTRHG 161
Query: 54 FDSHVGFWTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHN 113
FD ++G DM M+ G + + + T YT+ A+K + N
Sbjct: 162 FDDYLGILYSN-DMRPVNLMQ----GEKL-----LEYPVIQANLTKRYTERAVKFIQE-N 210
Query: 114 KSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVK 173
+ P FL L H+ P++P+ A + + S + V+++LD SVG++ K
Sbjct: 211 QEGPFFLYLPHAM-----PHKPLAASEA------FYKKSGAGLYGDVIAELDWSVGEIFK 259
Query: 174 ALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLD 233
L L EN++V+F++DNG F N A L G+K+T WEGG+R P
Sbjct: 260 TLRELNLDENTLVIFASDNG---PWFGGNTAG---LSGMKSTTWEGGLRVPMIARWPGKI 313
Query: 234 SKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDI 293
+V D PT+ AG + +DG + + L+K +P ++ +
Sbjct: 314 PPRQVIDTVCGSIDVFPTILKQAGIPVPADRVIDGKDLFPVLTKQAPTPHQALY----SM 369
Query: 294 WGIAALTV--DKYKL-IKGT---IYKGVWDNWYGPSGREG 327
G + TV +KL +K + + G NW P G +G
Sbjct: 370 KGNSLFTVRSGPWKLHVKPSPRQVLAGKGKNWIDPRGPDG 409
>UniRef50_A6LHS9 Cluster: Arylsulfatase; n=4; Bacteroidetes|Rep:
Arylsulfatase - Parabacteroides distasonis (strain ATCC
8503 / DSM 20701 / NCTC11152)
Length = 471
Score = 102 bits (244), Expect = 2e-20
Identities = 90/313 (28%), Positives = 144/313 (46%), Gaps = 31/313 (9%)
Query: 21 LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGT 80
L + +K GY T + GKW LG+ +P GFD G+ R H+ W
Sbjct: 116 LGKLMKSAGYTTGIFGKWGLGNPGSVSIPNKMGFDEFYGYNCQR---QSHSFYPDHLWHN 172
Query: 81 DFRRGF-EVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS--GNPYEPI- 136
+ + F E ++ Y+ D+ ++A+K + H K +P F ML ++ H+ P++ I
Sbjct: 173 EEKVLFPENENNACKTYSQDLIHEQALKFIRDH-KEQPFFAMLTYTLPHAELNLPHDSIY 231
Query: 137 RAPQKLIDAFKYI------------DDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
+ + + YI + FAA++S+LD+ VG V+ L GL +N+
Sbjct: 232 KMYENSFEETPYIGKFDKVYGGYNTSEKPLASFAAMVSRLDKYVGDVMAELKELGLDKNT 291
Query: 185 IVVFSTDNGGPAAGFND-NAASNY-PLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQK 242
IV+F++DNG G D + +Y P +G+K ++EGG+R W P + Q
Sbjct: 292 IVIFTSDNGPHHEGGADPDFFKSYGPFRGIKRDVYEGGIRIPMVAWCP---GTIKAGAQS 348
Query: 243 MHIS---DWLPTLYSAAGGDLSVLENLDGVNQWDA-LSKNTESPRTSVLHNIDDIWGIAA 298
HIS D +PTL G L E DG++ LSK + + ++ G A
Sbjct: 349 DHISAFWDVMPTLAELTGTVLP--EKTDGISFLPTLLSKKDQQAHDYLYWEFHELNGREA 406
Query: 299 LTVDKYKLIKGTI 311
L K+KLI+ I
Sbjct: 407 LRSGKWKLIRQPI 419
>UniRef50_A6C1V3 Cluster: Putative secreted sulfatase ydeN; n=1;
Planctomyces maris DSM 8797|Rep: Putative secreted
sulfatase ydeN - Planctomyces maris DSM 8797
Length = 470
Score = 102 bits (244), Expect = 2e-20
Identities = 72/255 (28%), Positives = 122/255 (47%), Gaps = 20/255 (7%)
Query: 21 LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGT 80
LP+ L+ GY+T VGKWHLG + LP + GFD ++ + H +
Sbjct: 132 LPEALRTAGYQTFHVGKWHLGG--RGNLPQDHGFDVNISGTNRGLPRSYHFPYGGDAMKW 189
Query: 81 DFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQ 140
D D Y TD DEA+ ++ + +P FL + +VHS PI+
Sbjct: 190 DSSLTEAERQDR---YLTDRMADEAVALIR-QQQDKPFFLYCSFYSVHS-----PIQGRP 240
Query: 141 KLIDAFKYIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAA 197
L+ +K + R K +AA++ +DE++G+V L G+ + +++VF++DNG
Sbjct: 241 DLVKKYKGLPAGKRHKNPEYAAMIQSVDEAIGRVRAQLKESGIADRTLIVFTSDNG---- 296
Query: 198 GFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAG 257
G ++N PL+G K WEGG R + P + V + + D+ PT+ + G
Sbjct: 297 GVRRKTSNNDPLRGEKGQHWEGGTRVPAIVLWPGVTPAGSVCAEPIITMDFYPTILNITG 356
Query: 258 --GDLSVLENLDGVN 270
G+ +++DG++
Sbjct: 357 VAGNTEHNQSVDGLS 371
>UniRef50_A6DHS2 Cluster: N-acetylgalactosamine-6-sulfate sulfatase;
n=1; Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine-6-sulfate sulfatase - Lentisphaera
araneosa HTCC2155
Length = 447
Score = 101 bits (243), Expect = 3e-20
Identities = 93/310 (30%), Positives = 141/310 (45%), Gaps = 31/310 (10%)
Query: 12 RGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDH- 70
RG+ E P+ +K Y T + GKWH+G YK E+ P+N GFD VGF +G ID H
Sbjct: 102 RGIRDEEWTFPEAMKSADYATAVFGKWHIG-YKAEFHPMNHGFDEFVGFISGNIDAQSHY 160
Query: 71 TTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS- 129
M W E H +D+ T+ ++ + NK +P FL +AH HS
Sbjct: 161 DRMSTFDWWQARELKDEKGHH------SDLITEHSLDFI-ERNKEKPFFLYVAHGTPHSP 213
Query: 130 --GNPYEPIRAPQK-LIDAF----KYI----DDSARQKFAAVLSKLDESVGKVVKALHTR 178
+ R P K + A+ +Y DD+ K + +DE V +++ L
Sbjct: 214 FQARGSKIQRGPNKGQVPAWAPKIEYSKTPGDDNWLMKHFTL--PVDEGVNRILDKLVEL 271
Query: 179 GLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARV 238
+ +N+IV F +DNG AA N + + N +G K +++EGG R +W+P V
Sbjct: 272 KIDKNTIVWFLSDNG--AAKGNHSHSEN--TRGAKGSMYEGGHRVPALVWAPGRIKAGSV 327
Query: 239 AYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTE-SPRTSVLHNIDDIWGIA 297
+ Q M D + AAG + LDGV+ + N + + RT + N G
Sbjct: 328 SDQTMMTFDITASSIKAAGVAIPANHQLDGVDIHPTVFNNKKLNERTLIWENGK---GSG 384
Query: 298 ALTVDKYKLI 307
AL +KL+
Sbjct: 385 ALRKGPWKLV 394
>UniRef50_A6C4V9 Cluster: Sulfatase; n=1; Planctomyces maris DSM
8797|Rep: Sulfatase - Planctomyces maris DSM 8797
Length = 480
Score = 101 bits (241), Expect = 5e-20
Identities = 78/267 (29%), Positives = 128/267 (47%), Gaps = 20/267 (7%)
Query: 12 RGLPLNEKILPQYLKDLGYKTHLVGKWHLG--SYKKEYLPLNRGFDSHVGFWTGRIDMYD 69
+GL +E + LK GY+T L+GKWH G E+ P N GFD+ VG+ +G ID
Sbjct: 118 KGLRKSENTFAELLKQAGYRTALIGKWHQGYPHNSAEFHPDNHGFDTFVGYHSGNIDFIS 177
Query: 70 HT-TMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVH 128
H + W + E Y+T + A++ + ++++P L LAH A+H
Sbjct: 178 HVGDHVKHDWWHGRKETQETG------YSTHLINQYALQFI-KESRNQPFCLYLAHEAIH 230
Query: 129 S--GNPYEPIRAPQKL-IDAFKYIDDSAR-QKFAAVLSKLDESVGKVVKALHTRGLLENS 184
+ P +PIR + +K ++ R +KF + +D VG++ + L GL +N+
Sbjct: 231 NPVQVPGDPIRRTEAAGWKRWKPASEAERIEKFRGMTLPVDAGVGQIREFLVKSGLDKNT 290
Query: 185 IVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSP-LLDSKARVAYQKM 243
V+F +DN GP+ F + +G K +++EGG R W P + + +
Sbjct: 291 FVLFFSDN-GPSRDFPSGSPK---WRGAKGSVYEGGHRVPAIAWWPGKIQAGTETDVPAI 346
Query: 244 HISDWLPTLYSAAGGDLSVLENLDGVN 270
+ D +PTL A D+ LDGV+
Sbjct: 347 SL-DVMPTLLGIAHIDMPKERPLDGVD 372
>UniRef50_A6CBI6 Cluster: Putative uncharacterized protein; n=1;
Planctomyces maris DSM 8797|Rep: Putative
uncharacterized protein - Planctomyces maris DSM 8797
Length = 599
Score = 100 bits (239), Expect = 9e-20
Identities = 85/269 (31%), Positives = 126/269 (46%), Gaps = 31/269 (11%)
Query: 3 HGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWT 62
HGV G E + E + + K GYKT GKWH G + + P +GFD GF
Sbjct: 97 HGVTRGFE--NMRAEETTIAEMFKAAGYKTGAFGKWHNGRHYPMH-PNGQGFDEFFGFCG 153
Query: 63 GRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLML 122
G + Y T +E Y TDV TD AI + NK +P F +
Sbjct: 154 GHWNRYFDTNLEHNKQPVKTEG-----------YITDVLTDRAIDFIKQ-NKDQPFFCYV 201
Query: 123 AHSAVHSGNPYEPIRAPQKLIDAF--KYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
++A HS P P+K D + K +DD AR +A V +D+++G++++ L L
Sbjct: 202 PYNAPHS-----PWIVPEKYWDKYANKGLDDKARCAYAMV-ECVDDNLGRLMQTLDDLKL 255
Query: 181 LENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSP-LLDSKARVA 239
+N+IV+F TDNG + +N N ++G K ++ EGG+R F+ P + + V
Sbjct: 256 SDNTIVLFLTDNGPNSNRYNGN------MRGRKGSIHEGGIRVPLFVRYPGKIKAGTVVK 309
Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDG 268
HI D LPTL + + + LDG
Sbjct: 310 PIAAHI-DILPTLLELCSVENTADQPLDG 337
>UniRef50_Q1GWE7 Cluster: Sulfatase precursor; n=4;
Alphaproteobacteria|Rep: Sulfatase precursor -
Sphingopyxis alaskensis (Sphingomonas alaskensis)
Length = 543
Score = 99 bits (238), Expect = 1e-19
Identities = 77/270 (28%), Positives = 128/270 (47%), Gaps = 25/270 (9%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
G+P +E + + +K GY T +GKWHLG E P +GFD + G +
Sbjct: 173 GVPASEVTIAEAVKAAGYHTVHIGKWHLGE-APELQPHAQGFDESLAVLAGAAMLLPEDD 231
Query: 73 MEQGS----WGTDFRRGF-EVAHDL-------FGV--YATDVYTDEAIKVVNSHNKSEPL 118
+ + W R + + H + F + TD + DEAIK + + N++ P
Sbjct: 232 PDAVNAKLPWDPIDRFIWANLRHAVTFNGSKRFAAQGHMTDYFADEAIKAIEA-NRNRPF 290
Query: 119 FLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTR 178
FL LA +A P+ P++A + D I D + + A+++++D +G V+ L
Sbjct: 291 FLYLAFTA-----PHTPLQATRADYDRLAAIKDHRTRVYGAMIAQMDRRIGDVMAKLKEA 345
Query: 179 GLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKAR 237
G+ +N++V+F++DNGG A +N N P +G K T +EGG+R F+ W +
Sbjct: 346 GIDDNTLVIFTSDNGG--AWYNGMPGLNAPFRGWKATFFEGGIRAPLFMRWPARIAPGTE 403
Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLENLD 267
H+ D T+ +AAG L +D
Sbjct: 404 RGDVTGHL-DLFATIAAAAGAALPADRTID 432
>UniRef50_Q3JD43 Cluster: Sulfatase; n=1; Nitrosococcus oceani ATCC
19707|Rep: Sulfatase - Nitrosococcus oceani (strain ATCC
19707 / NCIMB 11848)
Length = 440
Score = 99.5 bits (237), Expect = 2e-19
Identities = 90/278 (32%), Positives = 136/278 (48%), Gaps = 41/278 (14%)
Query: 9 AEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMY 68
A + + L E + LK +GY T LVGKWHLG + +LP +GFD + G Y
Sbjct: 95 AMAKAMSLEEITFAEALKSVGYSTALVGKWHLGD-RPAFLPPRQGFDEYFGI------PY 147
Query: 69 DHTTMEQGSWGTDF-----RRGFEVAH---DLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
H + W F RG E+ DL + T T+EA+K + S NK P L
Sbjct: 148 SH---DMHPWRKSFPPLPLMRGEEIVELNPDLD--HLTQYCTEEAVKFI-SKNKDRPFLL 201
Query: 121 MLAH----SAVHSGNPYEPIRAPQKLIDAFKYIDDSARQ-KFAAVLSKLDESVGKVVKAL 175
+ H VH + R ++ + A K D +R+ ++A + ++D SVG+++KA+
Sbjct: 202 YMPHPMPHQPVHVSERFAK-RFSKEQLAAIKGEDKKSRKFLYSATIEEIDWSVGEIIKAV 260
Query: 176 HTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL--WSPLLD 233
G+ E++ V F++DN GPA G S PL+G K LWEGG R F+ W +
Sbjct: 261 RALGIEESTFVAFTSDN-GPAIG------SAGPLRGKKRELWEGGHR-VPFIAYWQEKI- 311
Query: 234 SKARVAYQKMHIS-DWLPTLYSAAGGDLSVLENLDGVN 270
+ V ++ +S D PT+ +A G + +DGVN
Sbjct: 312 -RPGVVIDEIAMSMDLFPTM-AAMGRAPLPRKKIDGVN 347
>UniRef50_Q0C069 Cluster: Sulfatase family protein; n=2;
Bacteria|Rep: Sulfatase family protein - Hyphomonas
neptunium (strain ATCC 15444)
Length = 505
Score = 99.5 bits (237), Expect = 2e-19
Identities = 86/310 (27%), Positives = 138/310 (44%), Gaps = 35/310 (11%)
Query: 5 VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF---- 60
V++ GLP +E + + L+ GY + GKWH+G + E+LP + GF S+ G
Sbjct: 121 VLFPTSTGGLPQSEVTIAELLQQEGYVSAAFGKWHMG-HLPEFLPTSHGFQSYFGIPYSN 179
Query: 61 -----------WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYA-TDVYTDEAIKV 108
W+ ID++ Q +W + E+ + T YT+ AI+
Sbjct: 180 DMNMPGGGETPWS--IDLFFEPPNIQ-NWDVPLMQDEEIIERPADQFTLTQRYTERAIEF 236
Query: 109 VN-SHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDES 167
+ SH + +P FL LAH+ H+ P + F + SA + V+ +LD S
Sbjct: 237 METSHAEGQPFFLYLAHNMPHT---------PLFTSEGFTGV--SAGGAYGDVIEELDWS 285
Query: 168 VGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL 227
VG++V AL + +N++V+F++DN GP ++ S L+ K T WEGG+R
Sbjct: 286 VGEIVDALKDMKIEKNTLVIFTSDN-GPWLAMKTHSGSAGMLRDGKGTTWEGGMRVPAIF 344
Query: 228 WSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPR-TSV 286
W P R D +PT + +G L DG + AL SPR T
Sbjct: 345 WWP-GQIAPRTVTDLGSALDLMPTFAAISGARLPEDRVYDGFDLSPALFSEGSSPRETLY 403
Query: 287 LHNIDDIWGI 296
+ D++ +
Sbjct: 404 YYRFTDVFAV 413
>UniRef50_A6DSG6 Cluster: Arylsulfatase A; n=1; Lentisphaera
araneosa HTCC2155|Rep: Arylsulfatase A - Lentisphaera
araneosa HTCC2155
Length = 499
Score = 99.5 bits (237), Expect = 2e-19
Identities = 86/292 (29%), Positives = 137/292 (46%), Gaps = 39/292 (13%)
Query: 5 VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF-WTG 63
++Y GL +P+ LK+ GY T L+GKWHLG + YLP ++GFD + G T
Sbjct: 91 IVYPNSTTGLIKEVVTIPEMLKEKGYTTALIGKWHLG-HTAGYLPRDQGFDYYFGVPGTN 149
Query: 64 RIDMYDHTT-MEQG----------SWGTDFRRGFE------VAHDLFGVYATDV------ 100
D H + +G + D +G + +D + TD+
Sbjct: 150 HGDAKTHKLPVAEGFKPSGEFTIEDYWADKGKGVHGNSTILMKNDNVIEWPTDITQLTKR 209
Query: 101 YTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAV 160
YT +A++ + NK +P FL AH H +PY +DA + S + +
Sbjct: 210 YTHDAVRYIKE-NKDKPFFLYFAHGTPH--HPYT--------VDA-AFRGKSDHGLYGDM 257
Query: 161 LSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNA--ASNYPLKGVKNTLWE 218
+ ++D SVG+V+KAL G+ + +I+ F++DNG + ++A SN PLKG K + E
Sbjct: 258 IEEIDWSVGEVIKALQENGIEKKTIIAFTSDNGADSKPNKEHAEKGSNLPLKGWKGSSEE 317
Query: 219 GGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
GGVR L P + + + + D PT + AG + V + +DG N
Sbjct: 318 GGVRVPFVLSWPGTLPEGKKTNEIASLMDIFPTYAALAGIEPEVPQKIDGNN 369
>UniRef50_A6DNJ0 Cluster: Sulfatase; n=1; Lentisphaera araneosa
HTCC2155|Rep: Sulfatase - Lentisphaera araneosa HTCC2155
Length = 630
Score = 99.5 bits (237), Expect = 2e-19
Identities = 79/280 (28%), Positives = 127/280 (45%), Gaps = 15/280 (5%)
Query: 7 YGAEPRGLPL-NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRI 65
Y P G+ N+ + +LK+ GY T GKW++G K P GFD +
Sbjct: 105 YRGSPDGVVAKNDPTIAMWLKEAGYATAAYGKWNIGESKDVSWPGAHGFDDWL-IIDHNT 163
Query: 66 DMYDHTTMEQGSWGTD--FRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLA 123
+ H + G F G E +L G Y TD++TD+AI + K +P F+ L
Sbjct: 164 GYFQHKNANKDCEGRPMLFETGGERVTNLEGQYLTDIWTDKAIDFIQE-TKDQPFFIYLP 222
Query: 124 HSAVHSGNPYEPIRAPQKLIDA-FKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLE 182
S H+ +P P DA K R+ + ++ LD + ++ K+L +G +
Sbjct: 223 WSIPHTPLQ-DPASDPSLAFDAGAKPKTVEGREVYVKMVEYLDSHIARIFKSLKEQGKYD 281
Query: 183 NSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQK 242
N++++F++DNGG +A+ +PLK K L EGG+R + P V +
Sbjct: 282 NTLIIFTSDNGGMV------SANCWPLKKTKQHLEEGGIRVPFLMQWPSKIKAGTVDQRA 335
Query: 243 MHISDWLPTLYSAAGGDLSVLEN--LDGVNQWDALSKNTE 280
+ D T+ +AA V ++ LDGVN + +N E
Sbjct: 336 AIMMDASVTVLAAADAMKYVPKDRELDGVNLFANKEENRE 375
>UniRef50_A6DKM2 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine 6-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 472
Score = 99.5 bits (237), Expect = 2e-19
Identities = 84/288 (29%), Positives = 133/288 (46%), Gaps = 31/288 (10%)
Query: 11 PRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDH 70
P +P L Q KD GY T + GKWHLG ++ + P GFD ++ F G +
Sbjct: 116 PYHMPEGTITLGQAFKDAGYATAMFGKWHLG-HRPQDQPDKMGFDEYLTF-QGMKHFAPY 173
Query: 71 TTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHS 129
T + G VY TD+ D+AI + +E P FL VH+
Sbjct: 174 TLPNKVQHGEK-------------VYLTDLTCDKAIDFMERKVAAEKPFFLYYPDFLVHA 220
Query: 130 GNPYEPIRAPQKLIDAF--KYIDDSARQKFAAVLSK-LDESVGKVVKALHTRGLLENSIV 186
P+ A Q +I F K I + A ++K LD++VG++VK + G+ EN+I+
Sbjct: 221 -----PMEAKQAMIQYFEKKTIGQHHKSVIGAAMTKHLDDTVGRLVKKVDELGIAENTII 275
Query: 187 VFSTDNGG----PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQK 242
+F++DNGG G+ D SNYP + K++ +EGG R P + ++++
Sbjct: 276 IFTSDNGGLGYKSDGGYGDKGTSNYPYRSAKSSHYEGGSRVPLIFHWPGVTEANSLSHEV 335
Query: 243 MHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTES--PRTSVLH 288
+ D PTL A + LDG++ + ++ KN + P + H
Sbjct: 336 VSGIDIYPTLLKIAQVAKPQEQILDGID-FSSILKNPKQKLPARDLFH 382
>UniRef50_Q7UJ66 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
n=1; Pirellula sp.|Rep: N-acetylgalactosamine 6-sulfate
sulfatase - Rhodopirellula baltica
Length = 616
Score = 98.7 bits (235), Expect = 3e-19
Identities = 77/282 (27%), Positives = 126/282 (44%), Gaps = 17/282 (6%)
Query: 17 NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQG 76
+E + + ++ GY+T + GKWHLG + P RG ++ V G D + T
Sbjct: 135 DETTMAETFRESGYRTGMFGKWHLGD-PPPFAPRERGLETVVRHMAGGADEIGNPTGNDY 193
Query: 77 SWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPI 136
T +R G + F Y TD++ +EAI + ++ +P F + +A+HS P
Sbjct: 194 FDDTYYRNG---TPESFDGYCTDIWFEEAIDFIQKESE-QPFFAYIPTNAMHS-----PY 244
Query: 137 RAPQKLIDAFKYID-DSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGP 195
+ D FK + R F ++ DE++G+++K L L +N++++F +DNG
Sbjct: 245 LVADRYSDPFKRQGIEPQRAAFYGMIQNFDENLGRLLKRLDQDNLRDNTMLIFMSDNGTA 304
Query: 196 --AAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTL 252
A+ N N ++G K +++EGG R F W D V H DWLPTL
Sbjct: 305 QGASEQNRKVGFNAGMRGKKGSVYEGGHRVPCFASWPAKWDGNRPVDQLTCH-RDWLPTL 363
Query: 253 YSAAGGDLSVLENLDGVNQWDALSKNTES--PRTSVLHNIDD 292
DG + LS +++ RT V+ D
Sbjct: 364 IELCDLKRPADVTFDGRSMAGLLSHSSQQWPERTLVIERQPD 405
>UniRef50_Q7UYD6 Cluster: N-acetyl-galactosamine-6-sulfatase; n=3;
Bacteria|Rep: N-acetyl-galactosamine-6-sulfatase -
Rhodopirellula baltica
Length = 889
Score = 97.5 bits (232), Expect = 6e-19
Identities = 77/262 (29%), Positives = 125/262 (47%), Gaps = 23/262 (8%)
Query: 21 LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGT 80
L + +D GY T GKWHLG + Y PL GFD V G GS+
Sbjct: 376 LAEMFRDNGYATGHFGKWHLGP--EPYSPLEHGFDVDVPHHPG--------PGPAGSYVA 425
Query: 81 DFR-RGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAP 139
++ + F+ + + D EA++ + H +EP FL +VH+ P A
Sbjct: 426 PWKFKDFDHDPVIPDEHLEDRMAKEAVRFLEQHT-NEPFFLNYWMFSVHA-----PFDAK 479
Query: 140 QKLIDAFK-YIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGP 195
++LI+ ++ +D Q+ +AA++ +D+++G ++ L G+ + +I+VF++DNGG
Sbjct: 480 KELIEEYRDRVDPKDPQRCPTYAAMIESMDDAIGTLLDTLDRLGIADETIIVFASDNGGN 539
Query: 196 AAGFND--NAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLY 253
D A SN PL+G K T++EGGVRG + P + + + D+ PTL
Sbjct: 540 MYNEVDGTTATSNAPLRGGKATMYEGGVRGPAIVVQPGVVESGSRSDAIIQSIDFYPTLL 599
Query: 254 SAAGGDLSVLENLDGVNQWDAL 275
D + DGV+ AL
Sbjct: 600 EMLAIDAQPNQRFDGVSIVPAL 621
>UniRef50_A6LCL3 Cluster: Arylsulfatase A; n=1; Parabacteroides
distasonis ATCC 8503|Rep: Arylsulfatase A -
Parabacteroides distasonis (strain ATCC 8503 / DSM 20701
/ NCTC11152)
Length = 476
Score = 97.1 bits (231), Expect = 8e-19
Identities = 71/215 (33%), Positives = 111/215 (51%), Gaps = 21/215 (9%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
G+ E + + LK GY T + GKWHLGS +KE+LPL GFD + G DM+
Sbjct: 101 GVHPEEMTIAEVLKQKGYSTAIFGKWHLGS-QKEFLPLQNGFDEYYGLPYSN-DMWPFHP 158
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATD---VYTDEAIKVVN--SHNKSEPLFLMLAHSAV 127
+ + ++ +++ G Y TD + TD + VN NK++P FL LAH+
Sbjct: 159 QQGEVFNFPDLPTYD-GNEIIG-YNTDQTRLTTDYTTRSVNFIKKNKNKPFFLYLAHNMP 216
Query: 128 HSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVV 187
H P + D FK S + + V+ ++D SVG++ KAL GL +N++V+
Sbjct: 217 H---------VPLAVSDKFK--GKSEQGLYGDVMMEIDWSVGEIFKALRELGLEDNTLVI 265
Query: 188 FSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVR 222
++DN GP + ++A S L+ K T ++GG R
Sbjct: 266 LTSDN-GPWTNYGNHAGSAGGLREAKATTFDGGNR 299
>UniRef50_A4A2W0 Cluster: Arylsulfatase A; n=1; Blastopirellula
marina DSM 3645|Rep: Arylsulfatase A - Blastopirellula
marina DSM 3645
Length = 477
Score = 97.1 bits (231), Expect = 8e-19
Identities = 86/296 (29%), Positives = 139/296 (46%), Gaps = 36/296 (12%)
Query: 5 VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGR 64
V++ G+ NE + + +K+ GY T ++GKWHLG + ++LP +GFD + G
Sbjct: 100 VLFPGNAEGMSPNEVTIAELMKEQGYATAIIGKWHLGD-QPDFLPTRQGFDYYYGLPYSN 158
Query: 65 IDMYDHTTMEQGSWGTDF--RRGF-----------EVAHDLFGVYATDV---YTDEAIKV 108
DM + ++G R+G V + T++ YT+EAI+
Sbjct: 159 -DMGPAADGVKSNYGAPIPQRKGKGQPPLPLLRNETVLQRVLAKDQTELVTNYTEEAIQF 217
Query: 109 VNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESV 168
+ H + +P FL L HSAVH P DAF+ ++ + + ++D SV
Sbjct: 218 IRDHQE-KPFFLYLPHSAVHF---------PMYPGDAFR--GKNSHGLYNDWVEEVDWSV 265
Query: 169 GKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLW 228
G+V++AL GL + ++V+F++DNGG A N PL+ K T +EGG+R +
Sbjct: 266 GQVLQALKDLGLDQRTLVIFTSDNGGQTR----FGAVNKPLRAGKATTYEGGMRVPTIVR 321
Query: 229 SPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALS--KNTESP 282
P + + + D LPTL AGG +DG + L+ K +SP
Sbjct: 322 WPGKVPAGSSSDAVVGMIDVLPTLVKLAGGTTPTDRKIDGADIGPILAGVKEAKSP 377
>UniRef50_Q01N83 Cluster: Sulfatase precursor; n=1; Solibacter
usitatus Ellin6076|Rep: Sulfatase precursor - Solibacter
usitatus (strain Ellin6076)
Length = 461
Score = 96.7 bits (230), Expect = 1e-18
Identities = 96/321 (29%), Positives = 135/321 (42%), Gaps = 32/321 (9%)
Query: 5 VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGR 64
V+ G GLP +E + Q LK GY+T +GKWH+GS YLP NRGFD G
Sbjct: 95 VVLGPGDAGLPDSEITMAQVLKSAGYRTSCIGKWHIGS-TPGYLPTNRGFDEFFGV-PYS 152
Query: 65 IDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAH 124
D+ M S VA + T +T EA+ + + P FL LAH
Sbjct: 153 ADITPCPLMRGSS---------VVAPAVDCSTLTSSFTQEALDFMR-RAQDNPFFLYLAH 202
Query: 125 SAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
+A P+ P+ A + + S +A V+ +LD S G+V+ AL GL N+
Sbjct: 203 TA-----PHLPLAASPR------FAGQSGLGMYADVVQELDWSTGQVMAALKATGLDSNT 251
Query: 185 IVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMH 244
+V+FS+DNG G S L+G K +EGG+R P +
Sbjct: 252 LVMFSSDNGPWYQG------SQGKLRGRKGETYEGGMREPFLARYPGVIPSGIGCAGLAT 305
Query: 245 ISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALTVDKY 304
D LPTL AG + LDGV+ W L+ V D ++ + + ++
Sbjct: 306 TMDLLPTLARLAGAQ-TPSNPLDGVDIWPVLTGERAEVDRDVFLYFDAVY-LQCARLGRW 363
Query: 305 KLIKGTIYKGVWDNWYGPSGR 325
KL W P GR
Sbjct: 364 KLHLSRYNTKAWSP-LPPGGR 383
>UniRef50_Q7UZ43 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;
Pirellula sp.|Rep: N-acetylgalactosamine-4-sulfatase -
Rhodopirellula baltica
Length = 608
Score = 96.3 bits (229), Expect = 1e-18
Identities = 89/306 (29%), Positives = 141/306 (46%), Gaps = 31/306 (10%)
Query: 17 NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDS---HVGFWTGRI-DMYDHTT 72
NE + D GY+T + GKWHLG Y + GF H G G+ D +D+
Sbjct: 110 NEVTFGEIFSDAGYQTGMFGKWHLGD-NYPYRAEDNGFTEVYRHGGGGVGQTPDFWDNAY 168
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKS-EPLFLMLAHSAVHSGN 131
+ G+ F G V + F TDV+ E + + ++ EP F +A +A
Sbjct: 169 FD----GSYFHNGKAVKAEGF---CTDVFFKEGNRFIRECVEADEPFFAYIATNA----- 216
Query: 132 PYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
P+ P+ APQK ID + ++D+ F +++ +D++VG+ K L G+ +N+I +F+TD
Sbjct: 217 PHGPLHAPQKYIDMYPEMNDNVAT-FFGMITNVDDNVGQTRKLLRELGVHDNTIFIFTTD 275
Query: 192 NGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLD-SKARVAYQKMHISDWLP 250
NG AG + N ++G K + +EGG R + P +K+R H D +P
Sbjct: 276 NG--TAG--GASVYNAGMRGKKGSPYEGGHRVPFVMHYPEGGFAKSRTNNTLCHAVDVVP 331
Query: 251 TLYSAAGGDLSVLENLDGVNQWDALSKNTESP------RTSVLHNIDDI-WGIAALTVDK 303
TL G + DG + L +S T ID I W +++ DK
Sbjct: 332 TLLDMCGVEAPESVKFDGTSIVSLLKDEVDSSFNDRMLITDSQRVIDPIKWRQSSVMQDK 391
Query: 304 YKLIKG 309
++LI G
Sbjct: 392 WRLING 397
>UniRef50_Q7US96 Cluster: Arylsulphatase A; n=1; Pirellula sp.|Rep:
Arylsulphatase A - Rhodopirellula baltica
Length = 498
Score = 96.3 bits (229), Expect = 1e-18
Identities = 80/283 (28%), Positives = 130/283 (45%), Gaps = 24/283 (8%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
LPL+ + + LK GY T VGKWHLG+ E+ P +G+D +
Sbjct: 121 LPLDTVTIAESLKASGYTTGYVGKWHLGN-GPEFQPDRQGYDFSAVIGGPHLP------- 172
Query: 74 EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPY 133
G + R + + Y TD D I + NK +P FLML+ AVH P
Sbjct: 173 --GRYRVQGRSDLKPKPNQ---YRTDFEADLCIDFMRQ-NKDQPFFLMLSPFAVHI--PL 224
Query: 134 EPIRAPQKLIDAF-KYIDDSARQK-FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
+ + +A K +S +AA++ D+ VG++V +L + +++++VF++D
Sbjct: 225 AAMSEKVQKYEAMAKQTGNSLPHPVYAAMIEHCDDMVGRLVDSLEQLDIADDTMIVFTSD 284
Query: 192 NGGPAAGFN------DNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHI 245
NGG ++ D +S PLKG K +L EGG+R + P A V +
Sbjct: 285 NGGLYKRYDYRESADDLVSSQAPLKGEKGSLHEGGIRVPLIIRHPATVKSAGVCDEPTIS 344
Query: 246 SDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLH 288
D+ PT AGG+L + + +DG + ++ T++ LH
Sbjct: 345 HDFYPTFVEMAGGELPINQTIDGHSLLPLMTAPTQTLDRDALH 387
>UniRef50_Q7URY7 Cluster: Aryl-sulphate sulphohydrolase; n=1;
Pirellula sp.|Rep: Aryl-sulphate sulphohydrolase -
Rhodopirellula baltica
Length = 490
Score = 96.3 bits (229), Expect = 1e-18
Identities = 82/252 (32%), Positives = 126/252 (50%), Gaps = 33/252 (13%)
Query: 25 LKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHV-GFWTGRIDMYDHTTMEQGSWGTDFR 83
++D GY+T ++GKWHL PL GFD +V G +G + +G + +
Sbjct: 136 VRDAGYRTGIIGKWHLSDD-----PLPYGFDINVAGTHSG--------SPPKGYFPPHPK 182
Query: 84 -RGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKL 142
G + D Y TD TDEAI + + N+ FL L+H AVH+ P++A L
Sbjct: 183 VPGLQDTSD--DEYLTDRLTDEAIGFIEA-NQEWSWFLYLSHFAVHT-----PLQAKPDL 234
Query: 143 IDAFKYIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGF 199
+ +K AA++ +DE VG++V+ L GL EN+ +VF++DNG GF
Sbjct: 235 VAKYKAKQPGTLHDHAVMAAMIESVDEGVGRMVETLRELGLEENTAIVFTSDNG----GF 290
Query: 200 NDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTLYSAAGG 258
A S PL+G K T +EGG+R F+ W ++D+ + + + +D PT G
Sbjct: 291 GP-ATSMKPLRGYKGTYYEGGIREPFFVTWPGVVDAGTK-SDVPVIAADLYPTFIEMTGA 348
Query: 259 DLSVLENLDGVN 270
L + LDGV+
Sbjct: 349 KLPADQPLDGVS 360
>UniRef50_Q7ULE7 Cluster: Iduronate-sulfatase and sulfatase 1; n=1;
Pirellula sp.|Rep: Iduronate-sulfatase and sulfatase 1 -
Rhodopirellula baltica
Length = 1049
Score = 95.5 bits (227), Expect = 3e-18
Identities = 91/327 (27%), Positives = 150/327 (45%), Gaps = 43/327 (13%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLG------SYKKEYLPLNRGFDSH-VGFWTGRID 66
LP N + ++L+ GYKT VGKWHL + + LP G V +I+
Sbjct: 657 LPTNAVTIAEHLQPKGYKTGFVGKWHLEPNVTCIDWMRRELPAMAGKPRRKVRIPWNKIE 716
Query: 67 MYDHTTM--EQGSWG--TDFRRGFEVAH-DLFGV--------YATDVYTDEAIKVVNSHN 113
Y + ++ WG T++R F++ +L + DV T+ A+K + N
Sbjct: 717 PYSPSQQGFDEYYWGERTNYRTNFDLTSGELLAEMKPIRDERFRIDVQTNAAVKFIQ-RN 775
Query: 114 KSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVK 173
+P +L L + P+ P+ A QK +D F R+ A++S +D+ VG++V
Sbjct: 776 HDQPFYLQLNYYG-----PHTPLEATQKYLDRFPGPMPERRRYALAMISAIDDGVGQIVD 830
Query: 174 ALHTRGLLENSIVVFSTDNGGPA------AGFNDNAAS-----NYPLKGVKNTLWEGGVR 222
L G+L+N+++V ++DNG P + N +A N P G K L EGG+R
Sbjct: 831 QLKAEGVLDNTLIVMTSDNGAPLKMTKTDSPINGDAGGWDGSLNDPWVGEKGMLSEGGIR 890
Query: 223 GAGFLWSPLLDSKARVAYQ-KMHISDWLPTLYSAAGGDL-SVLENLDGVNQWDALSKNTE 280
+WS + + Y + D P++ AGG+L S DG++ L+ + +
Sbjct: 891 -VPMIWSLPTQLPSGITYDWPVSALDIAPSVLKLAGGELPSGDAAFDGIDLIPRLN-DIQ 948
Query: 281 SPRTSVLHNIDDIWGIAALTVDKYKLI 307
+P T L+ W AA+ K+K I
Sbjct: 949 NPPTRTLY--FRFWDQAAIRRGKWKYI 973
>UniRef50_Q64P90 Cluster: Putative secreted sulfatase ydeN; n=2;
Bacteroides fragilis|Rep: Putative secreted sulfatase
ydeN - Bacteroides fragilis
Length = 493
Score = 95.5 bits (227), Expect = 3e-18
Identities = 82/275 (29%), Positives = 128/275 (46%), Gaps = 30/275 (10%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
L +E + + + GY T + GKWHL EY P GFD ++G G +
Sbjct: 120 LSKDEITMAEAFRQNGYSTFMAGKWHLAE-SAEYYPEQNGFDINIG---GNNTGHPSKGY 175
Query: 74 EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSG--N 131
+ G E G Y TD TDE I+ + S K +P F+ L++ VH
Sbjct: 176 FSPYGNPQLKDGPE------GEYLTDRLTDEVIRYI-SEPKEKPFFVYLSYYTVHLPLQA 228
Query: 132 PYEPIRAPQKLIDAFKYIDDSARQK-------------FAAVLSKLDESVGKVVKALHTR 178
E I ++ + D S +K +AA++ LDE++G+++ LH
Sbjct: 229 KAEKIAKYRRKLSRAVPADSSFVKKGETYHKLVQDIPAYAAMVESLDENIGRLLDTLHRS 288
Query: 179 GLLENSIVVFSTDNGGPAAG--FNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSK 235
GL E +IVVF++DNGG A + SN PL+ K L+EGG++ + WS L +
Sbjct: 289 GLDERTIVVFTSDNGGMATSNTTRNIPTSNLPLRAGKGYLYEGGIKVPAIIRWSGHLKGR 348
Query: 236 ARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
+V+ + +D+ PTL G L +++DGV+
Sbjct: 349 -QVSDTPIIGTDYYPTLLDLCGLPLLPGQHVDGVS 382
>UniRef50_A6LIX6 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
Parabacteroides distasonis ATCC 8503|Rep:
N-acetylgalactosamine 6-sulfatase - Parabacteroides
distasonis (strain ATCC 8503 / DSM 20701 / NCTC11152)
Length = 589
Score = 95.5 bits (227), Expect = 3e-18
Identities = 79/274 (28%), Positives = 129/274 (47%), Gaps = 27/274 (9%)
Query: 16 LNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQ 75
L EK + +Y ++ GY T L GKWH G+ + Y P RGF+ GF +G Y + +E
Sbjct: 104 LGEKTIAEYFREAGYATSLFGKWHSGT-QYPYHPNARGFEEFYGFCSGHWGNYWNPVLE- 161
Query: 76 GSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEP 135
G ++ + F + D TD+A+ + H K P F+ L+++ HS
Sbjct: 162 -------HNGEIISGEGFII---DDLTDKALDYIRDH-KEHPFFMFLSYNTPHSPMQVPD 210
Query: 136 I---RAPQKLID---AFKYIDDSARQKFAAVLSK-LDESVGKVVKALHTRGLLENSIVVF 188
R + + F +D+ K A L++ LD ++G+V+ LH+ L + +IV++
Sbjct: 211 SWWNRVKDRTLSQRATFPEQEDTTFTKAALALAENLDWNIGRVLSLLHSLDLEQETIVIY 270
Query: 189 STDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDW 248
+DNG + +N +KG K + EGGVR + P K V Q D
Sbjct: 271 FSDNGPNSFRWNGG------MKGRKGSTDEGGVRSPFCIRWPGHIRKGAVETQLSGAIDL 324
Query: 249 LPTLYSAAGGDLSVLENLDGVNQWDALSKNTESP 282
+PTL AG + + L LDG++ W + ++P
Sbjct: 325 IPTLLGLAGIEYTPLRKLDGID-WGQRLLDEKAP 357
>UniRef50_A0JAV8 Cluster: Sulfatase precursor; n=1; Shewanella
woodyi ATCC 51908|Rep: Sulfatase precursor - Shewanella
woodyi ATCC 51908
Length = 365
Score = 95.1 bits (226), Expect = 3e-18
Identities = 67/213 (31%), Positives = 105/213 (49%), Gaps = 26/213 (12%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
LPL + + +K LGY T GKWHLGS ++Y P+ +GFD G T H
Sbjct: 159 LPLEVTSIAEAVKPLGYYTAFSGKWHLGS--EDYFPIKQGFDEQFGVSTA-----GHPKS 211
Query: 74 EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPY 133
+ +R + A G T+ TD+ + +N ++K +P L + +VH+ P+
Sbjct: 212 YHAPFWEAYRNPYPDAPK--GKNLTERLTDDVVNFINGYDKDQPFMLTNFYYSVHT--PH 267
Query: 134 E-PIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDN 192
+ P A QK +D D F +++ LD SVG++++AL G +N++V+F +D
Sbjct: 268 QGPKAATQKYLDRGL---DKRYANFGSMVESLDTSVGRILQALEDSGQADNTVVIFYSDQ 324
Query: 193 GGPAAGFNDNAASNYPLKGVK---NTLWEGGVR 222
GG +N PL+G K L+EGG R
Sbjct: 325 GG--------YFTNAPLRGGKIGGRALYEGGAR 349
>UniRef50_Q7UL93 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
Pirellula sp.|Rep: N-acetylgalactosamine 6-sulfatase -
Rhodopirellula baltica
Length = 470
Score = 94.3 bits (224), Expect = 6e-18
Identities = 81/258 (31%), Positives = 119/258 (46%), Gaps = 26/258 (10%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVG-FWTGRIDMYDHTT 72
LP + + LK GY T GKWHLG KK Y P GFD +VG G Y
Sbjct: 139 LPHETTTMAERLKAAGYTTGFFGKWHLGGDKK-YWPTEHGFDVNVGGCGLGGPPTY---- 193
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
D R + G Y TD DE I + K +P+F+ L + NP
Sbjct: 194 -------FDPYRIPALPPRKEGEYLTDRLADETIAFMR-REKDKPMFVCL-----WTYNP 240
Query: 133 YEPIRAPQKLIDAFKYIDDSARQK--FAAVLSKLDESVGKVVKALHTRGLLENSIVVFST 190
+ P AP+ LI+ +K + + + + + D VG+V++ L + G+ + ++VVF++
Sbjct: 241 HYPFEAPEDLIEHYKGKEGTGLKNPIYGGQIEATDRGVGRVLRELDSLGIADETLVVFTS 300
Query: 191 DNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLP 250
DNGG + A N PL+ K L+EGG+R + P + A V + D
Sbjct: 301 DNGGWS-----GATDNRPLREGKGFLFEGGLRVPLIVRWPGVTEAATVNETPVVSMDLTA 355
Query: 251 TLYSAAGGDLSVLENLDG 268
T+ AAG L+ E+LDG
Sbjct: 356 TILDAAGVSLANGESLDG 373
>UniRef50_A3XZF1 Cluster: Sulfatase family protein; n=5;
Proteobacteria|Rep: Sulfatase family protein - Vibrio
sp. MED222
Length = 500
Score = 94.3 bits (224), Expect = 6e-18
Identities = 84/312 (26%), Positives = 132/312 (42%), Gaps = 32/312 (10%)
Query: 3 HGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWT 62
H V P GL + LP+ LK +GY T GK HLG + E+LP GFD + G W
Sbjct: 95 HSVGLPGGPVGLSADTPTLPEILKTMGYVTGQFGKNHLGD-RDEFLPTMHGFDEYWG-WL 152
Query: 63 GRIDMYDHTTMEQGSWGTDFRR-GFEVAHDLF----GVYATDVYTDEAIKVVNSHNKSEP 117
++ ++T E W D F + ++ G + D A+ + +
Sbjct: 153 YHLNAMEYT--EDPDWPKDGSLDAFAPRNVIYARSDGKGGQTIEDDGALSIERMRTLDDE 210
Query: 118 L---FLMLAHSAVHSGNPYEPIRAPQK------LIDAFK-YIDDSARQKFAAVLSKLDES 167
+ + AV + P+ P + L ++ + + V+ LD+
Sbjct: 211 VNKHAINFIERAVEADKPFFTWYCPSRGHVWTHLSPEYEAMLGQNGWGLQEVVMKDLDDH 270
Query: 168 VGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL 227
VG+++ + G+ +N+I++F+ DNG + D + P G K T WEGGVR +
Sbjct: 271 VGEMMAKMEELGIADNTIIIFTADNGPEIMTWPDGGMT--PYHGEKGTTWEGGVRAPALV 328
Query: 228 WSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLE-----------NLDGVNQWDALS 276
P V DWLPTL +AAGG + E +LDG NQ D L+
Sbjct: 329 SWPGKIPAGTVGNGIFDGMDWLPTLVAAAGGPTDLKEKLLKGHDGFKAHLDGYNQVDMLT 388
Query: 277 KNTESPRTSVLH 288
+ ES R + +
Sbjct: 389 EKGESNRKEIYY 400
>UniRef50_A6DS95 Cluster: Arylsulfatase A; n=2; Lentisphaera
araneosa HTCC2155|Rep: Arylsulfatase A - Lentisphaera
araneosa HTCC2155
Length = 491
Score = 93.9 bits (223), Expect = 8e-18
Identities = 89/297 (29%), Positives = 144/297 (48%), Gaps = 35/297 (11%)
Query: 2 QHGVIYGAEPRG-LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
++GV + +PR L + LK+ GYKT VGKWHLG+ K Y P RGFD +
Sbjct: 90 RNGVTHTVQPREKLYKGALTIADILKEGGYKTGFVGKWHLGN-DKGYAPQYRGFDWYAKN 148
Query: 61 WTGRIDMYDHTTMEQGSWGTDFR-RGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLF 119
G ++H +E G F+ +GF D + DEA+ + + +P F
Sbjct: 149 AKG---PHNHFDVEMIRNGKRFQTKGFR----------EDAFFDEAMTFMKEAGE-QPFF 194
Query: 120 LMLAHSAVHSGNPYEPIRAPQKLIDAFKY--IDDSARQKFAAVLSKLDESVGKVVKALHT 177
L L + +P+ P+ AP+ L+ +K ++D+ + A++ +D+++G++ + L
Sbjct: 195 LYLC-----TYSPHTPLGAPEDLLKKYKAKGLNDN-HAAYLAMIENIDDNLGRLDQFLKK 248
Query: 178 RGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWS-PLLDSKA 236
L +++I++F DN G G + N ++G K T+WEGG R A LW P
Sbjct: 249 ENLYDDTILIFMNDN-GVTVGLD---VYNADMRGPKCTIWEGGTR-AFSLWRWPKKWQPK 303
Query: 237 RVAYQKMHISDWLPTLYSAAGGDL--SVLENLDGVNQWDALS-KNTESPRTSVLHNI 290
V H+ D LPTL AG D+ V L+G + L+ K+ E + HN+
Sbjct: 304 TVENLTAHL-DVLPTLCELAGVDVPEKVQGELEGYSLSPLLNGKDWEHNNRLLFHNV 359
>UniRef50_A3J5W3 Cluster: Putative arylsulfatase; n=1; Flavobacteria
bacterium BAL38|Rep: Putative arylsulfatase -
Flavobacteria bacterium BAL38
Length = 468
Score = 93.9 bits (223), Expect = 8e-18
Identities = 80/294 (27%), Positives = 141/294 (47%), Gaps = 37/294 (12%)
Query: 8 GAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDM 67
G EP +P +E + + LK GY T GKW LG E P N+GFD G+ G+I
Sbjct: 105 GNEP--IPASEITVAEILKTAGYTTGAFGKWGLGYPASEGSPNNQGFDQFYGY-NGQIHA 161
Query: 68 YDH-TTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSA 126
+++ T+ + + + + + VY+ D+ D A++ V NK+ P FL +
Sbjct: 162 HNYFTSYLRKNDLVELNANIDAP---YSVYSADIIKDRALEFVEV-NKNNPFFLYFCPTL 217
Query: 127 VHSGNPYEPIRAPQKLIDAFK----------YIDDSARQKFAAVLSKLDESVGKVVKALH 176
H NPY + K ++ + + ++ + K+AA+ S+LD+ VG+++ L
Sbjct: 218 PH--NPYH--QPDDKTLEYYAKKTGFPIGDAHSEEFSVPKYAALSSRLDQQVGEIMAKLK 273
Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDN-AASNYPLKGVKNTLWEGGVRGAGFLWSPLL--- 232
LL+N++++F++DNG D+ + L+G K+ ++EGG++ SPL+
Sbjct: 274 ELNLLDNTLIIFASDNGSALTKEEDSYLRTGGDLRGRKSEVYEGGIK------SPLIAFW 327
Query: 233 DSKARVAYQKMHIS---DWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPR 283
K HIS D+LPT +N+DG++ L T++ +
Sbjct: 328 KGKIIPGSSSNHISAFWDFLPTCAEIV--KAKTPDNIDGISYLPTLLGKTDNQK 379
>UniRef50_Q7UHK0 Cluster: Arylsulphatase A; n=1; Pirellula sp.|Rep:
Arylsulphatase A - Rhodopirellula baltica
Length = 478
Score = 93.5 bits (222), Expect = 1e-17
Identities = 79/273 (28%), Positives = 130/273 (47%), Gaps = 29/273 (10%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
G +E +P+ L GY++ +VGKWHLG + PL+ GFD ++G + Y+
Sbjct: 124 GFAPDEITIPELLGPAGYRSLMVGKWHLGMELEGSHPLDAGFDEYLGIPSN----YE--- 176
Query: 73 MEQGSWGTDFRRGFEVAH-DLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGN 131
+G RG +V ++ T YTDE I + K +P F+ ++H VH N
Sbjct: 177 PRRGKNHNTLYRGKQVEQKNVACEELTKRYTDEVIDFI-ERQKDDPFFIYVSHHIVH--N 233
Query: 132 PYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
P +P +P ++ S + K+ + +LD S G++++ + GL EN++V+F++D
Sbjct: 234 PLKP--SPD-------FVGTSEKGKYGDFIKELDHSTGRIMQTIRDAGLDENTLVIFTSD 284
Query: 192 NGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAG-FLWSPLLDSKARVAYQKMHISDWLP 250
NG G S+ L G K EGG R G F W+ + + +V+ + D LP
Sbjct: 285 NGPTRNG------SSGELSGGKYCTMEGGHRVPGMFRWTSKI-APNQVSDVTLTSMDLLP 337
Query: 251 TLYSAAGGDLSVLENLDGVNQWDA-LSKNTESP 282
AG + +DG + L + +ESP
Sbjct: 338 LFCELAGVPIPDDRQIDGKSILPVLLGQTSESP 370
>UniRef50_A6KWS8 Cluster: Arylsulfatase; n=1; Bacteroides vulgatus
ATCC 8482|Rep: Arylsulfatase - Bacteroides vulgatus
(strain ATCC 8482 / DSM 1447 / NCTC 11154)
Length = 464
Score = 93.5 bits (222), Expect = 1e-17
Identities = 84/311 (27%), Positives = 144/311 (46%), Gaps = 29/311 (9%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
LP E + K Y T VGKW +G E +P GFD G+ R + H
Sbjct: 114 LPAGEVTVADIFKTKNYVTGCVGKWGMGGPGTEGMPGKHGFDYFYGYLGQR---FAH--- 167
Query: 74 EQGSWGTDFRRGFEVAHDLFG-VYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSG-- 130
S+ +F E L G Y+ D+ ++A+ ++ N +P FL + + H+
Sbjct: 168 ---SYYPEFLHENEQKIMLDGKYYSHDLMLEKALNFIDE-NAQKPFFLYFSPTIPHADLD 223
Query: 131 ------NPYEPIRAPQKL---IDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLL 181
YE D +K + R +AA+++ LD+SVG ++K L +GL
Sbjct: 224 IMGEAMTEYEGEFCETPFGGSRDGYKS-QQNPRAAYAAMVTYLDKSVGLIIKELKEKGLY 282
Query: 182 ENSIVVFSTDNGGPAAGFNDNA--ASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVA 239
+++I+VF++DNG + G +D + SN P +G K L+EGG+R + P + + V
Sbjct: 283 DHTIIVFTSDNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVT 342
Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALS-KNTESPRTSVLHNIDDIWG-IA 297
D+LPT+ D+ +N+DG++ L+ K T+ + + + G +
Sbjct: 343 NHISAFWDFLPTIGELVQADIP--QNIDGISYLPTLTGKGTQKEHDCIYYEFFEFGGKQS 400
Query: 298 ALTVDKYKLIK 308
+T D +KL++
Sbjct: 401 IMTPDGWKLVR 411
>UniRef50_A6DSM5 Cluster: Arylsulfatase A; n=1; Lentisphaera
araneosa HTCC2155|Rep: Arylsulfatase A - Lentisphaera
araneosa HTCC2155
Length = 401
Score = 93.5 bits (222), Expect = 1e-17
Identities = 81/285 (28%), Positives = 128/285 (44%), Gaps = 26/285 (9%)
Query: 7 YGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRID 66
+G RG P + + LK GY T GKWH G PL RGFD H G D
Sbjct: 40 HGPNGRGPPTEFATIAEPLKKSGYNTVHFGKWHCGDTNATR-PLARGFDEHAGLMYSN-D 97
Query: 67 MYDHTTMEQGSWGTDFRRGFEVAH-DLFGVYATDVY------TDEAIKVVNSHNKSEPLF 119
M+ M+ WG R + ++ + D T++++ + NK +P F
Sbjct: 98 MWHLHPMQPKHWGKFPLRFWNNGEIEIEDIQPKDQKNLTKWATEKSVDFIK-RNKDQPFF 156
Query: 120 LMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRG 179
L HS H P + F+ I S + + VL++LD SVG++ +AL G
Sbjct: 157 LYTTHSMPH---------VPLYVSKEFEGI--SGQGLYGDVLAELDWSVGQINQALKDNG 205
Query: 180 LLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVA 239
+ + ++++FS+DN GP AG+ D+A P + K T ++GG R + P + +
Sbjct: 206 IEDKTMIIFSSDN-GPWAGYGDHAGKP-PYREAKATSFDGGTRSPLIVKYPKMIPPNSAS 263
Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALS--KNTESP 282
+ D +PT+ AGG +DG N D ++ K ++P
Sbjct: 264 KKVFCSIDLMPTILDLAGGP-HPDNKIDGKNVLDLMTDKKGAKNP 307
>UniRef50_A6DGL0 Cluster: Arylsulfatase A; n=1; Lentisphaera
araneosa HTCC2155|Rep: Arylsulfatase A - Lentisphaera
araneosa HTCC2155
Length = 506
Score = 93.5 bits (222), Expect = 1e-17
Identities = 73/252 (28%), Positives = 115/252 (45%), Gaps = 22/252 (8%)
Query: 22 PQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTD 81
PQ L+ GYKT L GKWHLG +EY P NRGFD + G I Y+ + +
Sbjct: 116 PQALQKSGYKTGLFGKWHLGD-GEEYRPQNRGFDEVLMHGAGGIGQYNFGDFKPNATNKY 174
Query: 82 FRRGFEVAHDLFGV--YATDVYTDEAIK-VVNSHNKSEPLFLMLAHSAVHSGNPYEPIRA 138
F + + TDV+ A+ + H ++ F ++ +A P+ P+ A
Sbjct: 175 FDNVLLHNDTIVQTKGFCTDVFFKAALSWIKKQHENNQTYFAYISLNA-----PHGPLIA 229
Query: 139 PQKLIDAFKYIDDSARQKFAA---VLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGP 195
P+K ++ID+ Q AA ++ +D++ G +V+ L L+N++++F TDNG
Sbjct: 230 PEKY--KKRFIDEGYNQSVAARYGMIENIDDNFGLMVEKLKEWKALDNTLIIFMTDNGMA 287
Query: 196 AAGFNDNA------ASNYPLKGVKNTLWEGGVRGAGF-LWSPLLDSKARVAYQKMHISDW 248
A N +KG K++ WEGG R F W +L ++ HI D
Sbjct: 288 MKSIGKKGVKGKFNAWNAGMKGHKDSAWEGGSRVPSFWYWKGVLGEGVDISALSAHI-DL 346
Query: 249 LPTLYSAAGGDL 260
T AG ++
Sbjct: 347 YRTFCELAGTNI 358
>UniRef50_A6C4Q9 Cluster: Arylsulphatase A; n=1; Planctomyces maris
DSM 8797|Rep: Arylsulphatase A - Planctomyces maris DSM
8797
Length = 490
Score = 93.1 bits (221), Expect = 1e-17
Identities = 76/266 (28%), Positives = 125/266 (46%), Gaps = 34/266 (12%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
LPL + L+ Y T GKWHLG + + P +G+ T++
Sbjct: 124 LPLEIVTPGELLQSANYNTAYFGKWHLGP--ESHNPDQQGYQ---------------TSL 166
Query: 74 EQGSWGTDFRRGFEVAHDLF---GVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSG 130
G G F F Y D TD+ I+ + NKS+P F+ L+H AVH
Sbjct: 167 VTG--GRHFAPRFRTTPSTRIPNKAYLADFLTDKTIEFIRQ-NKSKPFFVQLSHYAVHI- 222
Query: 131 NPYEPIRAPQKLIDAFKYIDDSA----RQKFAAVLSKLDESVGKVVKALHTRGLLENSIV 186
P+ A Q++I ++ A +AA+++ +D+SVG++V AL L EN++V
Sbjct: 223 ----PLEAKQQMIRKYQQKPKPAYGINNPVYAAMVAHVDDSVGRIVAALEELKLTENTVV 278
Query: 187 VFSTDNGGPAAGFN--DNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMH 244
+F++DNGG F+ D ++N PL+ K +L+EGG+R + P + + + +
Sbjct: 279 IFTSDNGGLRQSFSGGDIVSTNAPLRDEKGSLYEGGIRVPLIIKWPGVAAAGKTCAEPTI 338
Query: 245 ISDWLPTLYSAAGGDLSVLENLDGVN 270
D+ PT A L + +DG++
Sbjct: 339 SIDFWPTFAEIAHTTLQEHQTIDGLS 364
>UniRef50_Q7URW3 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;
Pirellula sp.|Rep: N-acetylgalactosamine-4-sulfatase -
Rhodopirellula baltica
Length = 480
Score = 92.7 bits (220), Expect = 2e-17
Identities = 67/191 (35%), Positives = 105/191 (54%), Gaps = 13/191 (6%)
Query: 96 YATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQ 155
Y TD TD+AI + + S+P ++++++AVHS P++A + A + IDD R+
Sbjct: 229 YLTDSITDKAIDAI-TQTASKPFAMVVSYNAVHS-----PMQASLEDHAAMELIDDPQRR 282
Query: 156 KFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNT 215
FA +L LD VG++++ L + L ++++VVF +DNGGP A + +SN PL+G K +
Sbjct: 283 IFAGMLIALDRGVGRIIEKLDQQKLRQDTLVVFFSDNGGPTA---ELTSSNAPLRGGKGS 339
Query: 216 LWEGGVRGAGFLWSPLLDSKARVAYQKMHIS-DWLPTLYSAAGGDLSVLENLDGVNQWDA 274
L+EGGVR +WS A +S D + A G+ S LE DG N
Sbjct: 340 LYEGGVR-IPMIWSMPGTIPAGAEEDTPILSLDIAASFLPLAVGEASQLET-DGTNVLPW 397
Query: 275 LSKNT-ESPRT 284
+ + T + PRT
Sbjct: 398 IGRGTFKLPRT 408
Score = 51.6 bits (118), Expect = 4e-05
Identities = 21/48 (43%), Positives = 32/48 (66%), Gaps = 1/48 (2%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
GLP +K ++L+ GY+T L+GKWHLG+ + +P ++GFD GF
Sbjct: 116 GLPPQQKTFVEHLQSAGYQTSLIGKWHLGT-RPSQVPTSKGFDRFFGF 162
>UniRef50_Q7UGB8 Cluster: Arylsulfatase homolog b1498; n=1;
Pirellula sp.|Rep: Arylsulfatase homolog b1498 -
Rhodopirellula baltica
Length = 656
Score = 92.3 bits (219), Expect = 2e-17
Identities = 79/256 (30%), Positives = 121/256 (47%), Gaps = 34/256 (13%)
Query: 18 EKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGS 77
E L + + GY T GKWH G+ + P +GF+ GF G ++YD +E+
Sbjct: 181 ETTLAELYRSAGYATGCFGKWHNGAQMPLH-PNGQGFNEFFGFCGGHFNLYDDALLERN- 238
Query: 78 WGTDFR-RGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPI 136
GT + +G Y TDV TD A++ + +H+ P F + +A P+ P
Sbjct: 239 -GTPVQTKG----------YITDVLTDAAVEFIQNHH-DRPFFCYVPFNA-----PHGPF 281
Query: 137 RAPQKLIDAFKYIDDSARQKFAAV---LSKLDESVGKVVKALHTRGLLENSIVVFSTDNG 193
+ + L D +Y D S +K AAV + +D +V +++K L L E +IVVF TDNG
Sbjct: 282 QVRRDLFD--RYNDGSIDEKTAAVYAMVQNIDTNVSRLLKCLSDHSLDEETIVVFLTDNG 339
Query: 194 GPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTL 252
FN ++G K ++ EGG R F+ W+ + ++ ++ HI D LPTL
Sbjct: 340 PNGKRFNGG------MRGTKGSVHEGGCRVPCFIRWTGNIQPQS-ISQVAAHI-DLLPTL 391
Query: 253 YSAAGGDLSVLENLDG 268
L LDG
Sbjct: 392 MQWCDIPLPTKVPLDG 407
>UniRef50_A6DQ01 Cluster: N-acetylgalactosamine-4-sulfatase; n=2;
Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine-4-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 616
Score = 92.3 bits (219), Expect = 2e-17
Identities = 68/226 (30%), Positives = 114/226 (50%), Gaps = 23/226 (10%)
Query: 4 GVIYGAEPRGLPLNEKI-LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWT 62
GV + + R L +I + LKD GY T + GKWHLG Y P +RGF V
Sbjct: 86 GVWHTVQGRHLMREREITMANILKDNGYATGIFGKWHLGD-AYPYRPEDRGFTHVVTHGA 144
Query: 63 GRIDMYDHTTMEQGSWGTD-FRRGFEVAHDL--FGVYATDVYTDEAIKVVNSH-NKSEPL 118
G + WG D F + V + F + TDV+ DEA K + + +K +P
Sbjct: 145 GGVGQVPDY------WGNDYFNDTYYVNGEFVKFEGFCTDVWFDEAKKFMKTQISKKKPF 198
Query: 119 FLMLAHSAVHSGNPYEPIRAPQKLIDAFKY--IDDSARQKFAAVLSKLDESVGKVVKALH 176
F + +A P+ P+RAPQK +D + + + + F +++ +D++ G++ + L
Sbjct: 199 FTFITPNA-----PHGPMRAPQKYLDMYNQTKVKGTKLEAFFGMITNIDDNFGELREFLK 253
Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVR 222
G+ +N++++F+TDNG ++G N + G KN+ ++GG R
Sbjct: 254 DEGVADNTLLIFTTDNGS-SSGI---GVYNAGMTGAKNSNFDGGHR 295
>UniRef50_A6C4W8 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
n=1; Planctomyces maris DSM 8797|Rep:
N-acetylgalactosamine 6-sulfate sulfatase - Planctomyces
maris DSM 8797
Length = 459
Score = 92.3 bits (219), Expect = 2e-17
Identities = 72/259 (27%), Positives = 122/259 (47%), Gaps = 13/259 (5%)
Query: 21 LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGT 80
+ + L+ GY+ VGKW LG N+GFD W G ++ DH +
Sbjct: 113 IAEVLQKSGYRCGGVGKWSLGDAGTVGRATNQGFD----MWFGYLNQ-DHAHYYFTEYLD 167
Query: 81 DFRRGFEVAHDLFG--VYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVH-SGNPYEPIR 137
D E+ + Y+ D+ T+ A++ + + ++P FL A++ H S +P
Sbjct: 168 DNEGRLELKGNTKNRQQYSHDLLTERALQFIRD-SAAQPFFLYAAYTLPHFSAKAEDPHG 226
Query: 138 APQKLIDAFKYID-DSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPA 196
+ + D D +K+AA++ +LD VG+++ ++ L E ++++F++DNGG
Sbjct: 227 LAVPDTEPYSDRDWDIKSKKYAAMIHRLDRDVGRIMSLVNELQLRERTLIIFTSDNGG-H 285
Query: 197 AGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAA 256
G +N PL+G K L EGG+R P +V+ + + D LPT A
Sbjct: 286 RGVPAQLHTNGPLRGFKRDLTEGGIRVPFIANWPGTIPAGKVSDEVIAFQDMLPTFAELA 345
Query: 257 GGDLSVLENLDGVNQWDAL 275
G +S NLDG++ AL
Sbjct: 346 GAQVSA--NLDGISVLPAL 362
>UniRef50_Q7UWW9 Cluster: Arylsulfatase; n=2; Planctomycetaceae|Rep:
Arylsulfatase - Rhodopirellula baltica
Length = 622
Score = 91.9 bits (218), Expect = 3e-17
Identities = 83/277 (29%), Positives = 128/277 (46%), Gaps = 25/277 (9%)
Query: 19 KILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSW 78
K + +D GY+T + GKWHLG + P +RGFD + F + I+
Sbjct: 118 KTMADVFQDAGYRTGIFGKWHLGD-NYPFRPEDRGFDETLWFPSSHINSVPDFWDNDYFD 176
Query: 79 GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGNPY---E 134
T R G VAH Y TDV+ DEAI+ + ++ P F + ++ H P+ +
Sbjct: 177 DTYIRNGKRVAHS---GYCTDVFFDEAIEWAKQTSPTDSPFFAFIPLNSAHW--PWFVPD 231
Query: 135 PIRAPQKLI-----DAFKYIDDSARQ-----KFAAVLSKLDESVGKVVKALHTRGLLENS 184
RA + + + + +D + F A+ +D++VG + + L GL EN+
Sbjct: 232 QYRARVRTMLGDTTELKRQLDTTPSNLEDLISFLAMGLNIDDNVGTLTQYLDESGLSENT 291
Query: 185 IVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMH 244
IVVF TDNG + F D+ N ++G K LWEGG R + P + ++ H
Sbjct: 292 IVVFLTDNG---STFGDH-YFNAGMRGKKTQLWEGGHRVPCLIRWPEQITAQKID-DLTH 346
Query: 245 ISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTES 281
+ D LPTL + A D + LDG + L T+S
Sbjct: 347 VQDLLPTLAALADCDEHLPGPLDGTSLAPRLLGETDS 383
>UniRef50_Q7UTH7 Cluster: Arylsulfatase A; n=5; Bacteria|Rep:
Arylsulfatase A - Rhodopirellula baltica
Length = 496
Score = 91.1 bits (216), Expect = 6e-17
Identities = 73/261 (27%), Positives = 120/261 (45%), Gaps = 18/261 (6%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
+ L + + LK GY T + GKWHLG + Y P RGFD G I +
Sbjct: 110 MALTSTTIAEVLKSAGYTTGIFGKWHLGD-EDAYQPDRRGFDETFIHGAGGIGQ-NFAGS 167
Query: 74 EQGSWGTDFRRGFEVAHDLFGV---YATDVYTDEAIKVVNSHNKSE--PLFLMLAHSAVH 128
+ + GT + + F Y TDV+ +A+ + KS+ P F + +A
Sbjct: 168 QSDAPGTSYFNPIIKHNGTFVQTEGYCTDVFFQQALGWIRLQTKSDTKPFFAYIPTNA-- 225
Query: 129 SGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVF 188
P+ P + ++ D F+ S + +F ++ +D+++GK++ L L +N++++F
Sbjct: 226 ---PHAPYKVEKRYSDRFRDKCSSPQSEFLGMIVNIDDNMGKLMGKLDEWDLADNTLLIF 282
Query: 189 STDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSP-LLDSKARVAYQKMHISD 247
TDNG A G + N +KG K T+ EGG R F+ P +S + H+ D
Sbjct: 283 MTDNGS-AKG---SKIYNAGMKGGKGTVNEGGSRVPLFMRLPGFTNSGVDIETMTRHV-D 337
Query: 248 WLPTLYSAAGGDLSVLENLDG 268
PTL A ++ +LDG
Sbjct: 338 LFPTLAEIAHAEIPAEADLDG 358
>UniRef50_Q64WT3 Cluster: N-acetylgalactosamine-6-sulfatase; n=5;
Bacteria|Rep: N-acetylgalactosamine-6-sulfatase -
Bacteroides fragilis
Length = 509
Score = 91.1 bits (216), Expect = 6e-17
Identities = 85/277 (30%), Positives = 128/277 (46%), Gaps = 24/277 (8%)
Query: 12 RGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKE-YLPLNRGFDSHV-GFWTGRIDMYD 69
+GL + I P L+ GYKT VGK H G K E P N GFD ++ G G Y
Sbjct: 128 KGLTHQDMIYPYLLQQAGYKTIHVGKAHFGCLKSEGENPTNLGFDVNIAGSAIGHPGSYH 187
Query: 70 HTTMEQGSWGTDFRR--GFEVAHDLFGVYATDVYTDEAIKVVNSH-NKSEPLFLMLAHSA 126
G R E H + +D T EA K + + +P +L +AH A
Sbjct: 188 GENGYGWIKGQRARAVPDLEQYHKTH-TFLSDALTLEAGKEIEKAVAEKKPFYLNMAHYA 246
Query: 127 VHSGNPYEPIRAPQKLIDAFKYIDDSARQK-FAAVLSKLDESVGKVVKALHTRGLLENSI 185
VHS P ++ I + + S + + FA ++ +D+S+G ++ L G+ EN++
Sbjct: 247 VHS-----PFETDERFISHYTDPNKSQQARAFATLIEGMDKSLGDILDKLEDMGIAENTL 301
Query: 186 VVFSTDNGGPA-AGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WS-PLLDSKARVAY-- 240
++F DNGG A G + S+ P KG K + +EGGVR + W+ P ++K + AY
Sbjct: 302 IIFLGDNGGDAPLGDAADYGSSAPFKGKKGSEYEGGVRVPFIVSWAHPNPNNKFQKAYPI 361
Query: 241 -------QKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
Q + D PT+ S AG + LDG +
Sbjct: 362 ARNAIQTQMGTVMDIYPTVLSVAGVKPAPNHILDGAD 398
>UniRef50_A6DFR6 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;
Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine-4-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 573
Score = 91.1 bits (216), Expect = 6e-17
Identities = 86/302 (28%), Positives = 138/302 (45%), Gaps = 23/302 (7%)
Query: 17 NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQG 76
+EK + + GYKT +VGKWHLG Y P +RGF G I
Sbjct: 99 DEKTIADHFVAAGYKTGMVGKWHLGD-NAPYRPEDRGFQDVFRIGGGSIGQLPDYWKNDL 157
Query: 77 SWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPI 136
G + +G V F TDV D A+ V NK P FL ++ +A HS P
Sbjct: 158 WDGHYWNKGQWVKTKGF---CTDVQFDYALDFV-EENKKSPFFLFISTTAPHS-----PT 208
Query: 137 RAPQKLIDAFKYID-DSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGP 195
A +K ++ ++ + D F +++ +D+++G++ L L EN+I++FS+DNG
Sbjct: 209 GADKKYLEPYEKLGLDKGICAFYGMVTNIDDNIGRLRNKLRELKLEENTILIFSSDNGSA 268
Query: 196 AAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSP---LLDSKARVAYQKMHISDWLPTL 252
D + N ++G K +L+EGG R FL+ P + K ++ HI D LPTL
Sbjct: 269 CDKKGD--SFNGGMQGKKGSLYEGGHRVPCFLYWPKGGWIGGK-QLDQVTAHI-DILPTL 324
Query: 253 YSAAGGDLSVLENLDGVNQWDALSKNTES-PRTSVLHN----IDDIWGIAALTVDKYKLI 307
A + + DG+ ++K + R + N D + + + D+++LI
Sbjct: 325 LKACAIENPLNTAFDGIELNGIIAKPAQKLSRLLITENKANKRDQEFQNSVVLTDEWRLI 384
Query: 308 KG 309
G
Sbjct: 385 DG 386
>UniRef50_A4AAM5 Cluster: Sulfatase; n=1; Congregibacter litoralis
KT71|Rep: Sulfatase - Congregibacter litoralis KT71
Length = 500
Score = 91.1 bits (216), Expect = 6e-17
Identities = 84/287 (29%), Positives = 133/287 (46%), Gaps = 41/287 (14%)
Query: 18 EKILPQYLKDLGYKTHLVGKWHL--GSYKKEY-LPLNRGFDSHVGF--WTGRIDMYDHTT 72
E L K GY+T ++GKWHL G + ++ P + GFD G W + D T
Sbjct: 128 ETTLADLAKARGYRTAVIGKWHLNGGLHMRDVPQPRDFGFDYQYGLAAWVKNASVADSTE 187
Query: 73 MEQGSWGTDFRRGFEVAHDLFGV---YATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS 129
+ + G F ++ GV Y+ ++ +DEAI + + S+P FL+L +S VH+
Sbjct: 188 LPRR--GPMFPDNMYRNNEPVGVTDKYSAELVSDEAIGWLQA--SSDPFFLLLTYSEVHT 243
Query: 130 GNPYEPIRAPQKLIDAFK-YIDDSA------------------RQKFAAVLSKLDESVGK 170
PI +P +DA++ Y+ D A R ++ A +S LD +G+
Sbjct: 244 -----PIASPPAYLDAYREYLSDEAKHNPFLYYFDWRNRPWRGRGEYYANISFLDAQLGR 298
Query: 171 VVKALHTRGLLENSIVVFSTDNGGPA-AGFN----DNAASNYPLKGVKNTLWEGGVRGAG 225
V+ L + +L+N+++VFS+DNG A A L+G K L+EGG+R G
Sbjct: 299 VIGHLRDQKILDNTLIVFSSDNGPVTDAALTPWELGMAGETGGLRGKKRFLFEGGIRVPG 358
Query: 226 FLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQW 272
+ P RV ++ + D PTL D+ LDG + W
Sbjct: 359 IIRYPHRIEAGRVEHRAVTALDIFPTLAEWLDVDVEPRVPLDGQSLW 405
>UniRef50_A6BZT7 Cluster: Putative arylsulfatase; n=1; Planctomyces
maris DSM 8797|Rep: Putative arylsulfatase -
Planctomyces maris DSM 8797
Length = 459
Score = 90.6 bits (215), Expect = 7e-17
Identities = 75/280 (26%), Positives = 126/280 (45%), Gaps = 18/280 (6%)
Query: 23 QYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDF 82
+ LK GY T GKW LG P +GFD G + ++ H W +
Sbjct: 101 EVLKIAGYATGAFGKWGLGYEGTPGRPGQQGFDDFTG---QLLQVHAHFYYPFWIWNNEH 157
Query: 83 RRGF-EVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVH--------SGNPY 133
R E ++ G Y D+ ++A K NK++P F L + H S PY
Sbjct: 158 RLMLPENENNQRGRYIHDLIHEDA-KAFIQKNKAQPFFAYLPYIIPHVELVVPEESEKPY 216
Query: 134 EPIRAPQKLIDAFK-YI-DDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
++++D YI + FA ++S+LD+ VG++V L G+ +N++++F++D
Sbjct: 217 RGQFPKKQILDPRPGYIGSEDGLTTFAGMVSRLDDHVGEIVTLLEDLGIRDNTLIIFTSD 276
Query: 192 NGGPAAGF---NDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDW 248
NGG + D N PL+G K +++EGG+R P + + + ++ D
Sbjct: 277 NGGQGGTWKEMTDFFNGNAPLRGHKGSMYEGGIRVPFIANWPGKIAAGKTSDLQIAFWDV 336
Query: 249 LPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLH 288
LPTL AG + ++DG++ L + P L+
Sbjct: 337 LPTLAQVAGTTVPSGVDIDGISFLPTLLGKGKQPEHEYLY 376
>UniRef50_Q7UX95 Cluster: Arylsulfatase; n=3; Planctomycetaceae|Rep:
Arylsulfatase - Rhodopirellula baltica
Length = 538
Score = 90.2 bits (214), Expect = 1e-16
Identities = 79/282 (28%), Positives = 137/282 (48%), Gaps = 37/282 (13%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
LP++E + +YLK +GY+T GKW LG + P +GFD GF + H
Sbjct: 166 LPVDEVTIAEYLKSVGYRTGAFGKWGLGHFGTTGDPNEQGFDLFYGF---NCQRHAHNHY 222
Query: 74 EQGSWGTDFRRGFEVAHD--LFG-VYATDVYTDEAIKVVN---SHNKSEPLFLMLAHSAV 127
W + + +D L G Y+ D + +EA + + + +K++P F L +
Sbjct: 223 PNFLWRNRVKE-VQPGNDRTLHGETYSQDQFVNEACEFIRQSVAEDKTQPFFAYLPFAV- 280
Query: 128 HSGNPYEPIRAPQKLIDAFKYIDDSA-------------RQKFAAVLSKLDESVGKVVKA 174
P+ I+ P++ +DA+ + + A R +AA+++++DE VG+VV
Sbjct: 281 ----PHLSIQVPEEEVDAYDGVIEEADYEHHGYLKHPRPRAGYAAMVTRMDEGVGQVVDL 336
Query: 175 LHTRGLLENSIVVFSTDNGG--PAAGFND----NAASNYPLKGVKNTLWEGGVRGAGFLW 228
+ + GL EN++++F++DNG G +D N+AS +KG+K L EGG+R
Sbjct: 337 VDSLGLGENTLIMFTSDNGPTYDRLGGSDSDYFNSASG--MKGLKGQLDEGGIRVPMIAR 394
Query: 229 SPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
+ R + D+LPT+ AAG ++ DG++
Sbjct: 395 QTGVVPAGRTSDWIGAWWDFLPTITDAAGVEVDA-STTDGIS 435
>UniRef50_A6LED2 Cluster: Arylsulfatase A; n=1; Parabacteroides
distasonis ATCC 8503|Rep: Arylsulfatase A -
Parabacteroides distasonis (strain ATCC 8503 / DSM 20701
/ NCTC11152)
Length = 468
Score = 89.8 bits (213), Expect = 1e-16
Identities = 76/282 (26%), Positives = 130/282 (46%), Gaps = 28/282 (9%)
Query: 5 VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGR 64
V++ A +GL E + + +K+ GY T +GKWHLG + +LP +GFD + G
Sbjct: 108 VLFPASHKGLNPGEITIAELMKEQGYATACIGKWHLGD-QLPFLPTRQGFDYYYGIPYSN 166
Query: 65 IDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAH 124
DM D + V HD + YT++ ++ + SH +S P F+ L H
Sbjct: 167 -DM-DRPYCPLPLMEQEEVIVAPVGHDSLTIR----YTNKTVEFIKSHKES-PFFIYLCH 219
Query: 125 SAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
+ H+ P+ A AFK S + +LD S+G +++ L GL +N+
Sbjct: 220 NMTHN-----PLAASP----AFK--GKSQNGLYGDATEELDWSMGVLLETLKEEGLDQNT 268
Query: 185 IVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMH 244
+++F++DNG +N PL+G K T +EGG R + P + +
Sbjct: 269 LIIFTSDNGAD----EHFGGTNRPLRGQKGTTYEGGFRVPCIMRWPAKIPAGQETDNLVT 324
Query: 245 ISDWLPTL-----YSAAGGDLSVLENLDGVNQWDALSKNTES 281
D+LPTL Y+ + N+ G+ + ++++ TE+
Sbjct: 325 SMDFLPTLAHYCSYAVPSDRVIDGHNVSGILEGESMASPTET 366
>UniRef50_A4AQQ7 Cluster: N-acetylgalactosamine 6-sulfatase; n=4;
Bacteria|Rep: N-acetylgalactosamine 6-sulfatase -
Flavobacteriales bacterium HTCC2170
Length = 596
Score = 89.8 bits (213), Expect = 1e-16
Identities = 85/312 (27%), Positives = 142/312 (45%), Gaps = 34/312 (10%)
Query: 6 IYGAEPRGLPLNEK--ILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTG 63
+Y G N K + + K GYKT GKWH G + Y P +RGFD + GF +G
Sbjct: 102 VYSTSTGGERFNSKETTIAEIFKKAGYKTTAYGKWHSGM-QPPYHPNSRGFDDYYGFTSG 160
Query: 64 RIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLA 123
Y +E G V + F V D T++ + + + NK+ P FL L
Sbjct: 161 HWGNYFSPMLEHN--------GEIVKGEGFLV---DDLTNKGLDFI-TENKNNPFFLYLP 208
Query: 124 HSAVHSG----NPYEPIRAPQKLIDAFKYIDDSARQKFA----AVLSKLDESVGKVVKAL 175
++ HS N Y R +K +D ++ + F A++ +D ++G++ L
Sbjct: 209 YNTPHSPMQVPNEYWE-RFEKKKLDMRYQGNEEESENFTRAALAMVENIDFNMGRLTNKL 267
Query: 176 HTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDS 234
GL EN+I+V+ +DNG +N ++G K + EGGVR F+ W +
Sbjct: 268 KELGLEENTIIVYLSDNGPNGWRWNGG------MRGRKGSTDEGGVRSPFFIQWKNTIPK 321
Query: 235 KARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIW 294
+++ Q D LPTL S AG + ++++DG + ++ ++P H ++
Sbjct: 322 NKKIS-QIAGAIDILPTLTSLAGINQPTIKSIDGKDLKTLIA--DKNPTWESRHIVNHWR 378
Query: 295 GIAALTVDKYKL 306
G ++ KY+L
Sbjct: 379 GKTSIRTQKYRL 390
>UniRef50_A6DHI0 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
n=1; Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine 6-sulfate sulfatase - Lentisphaera
araneosa HTCC2155
Length = 456
Score = 89.4 bits (212), Expect = 2e-16
Identities = 81/282 (28%), Positives = 126/282 (44%), Gaps = 26/282 (9%)
Query: 8 GAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDM 67
G EP +P + + +K+ GY T L+GKW LG E P +GFD G+
Sbjct: 95 GQEP--IPAETITVAEKMKEAGYATALIGKWGLGYPGSEGEPNKQGFDYFFGY---NDQK 149
Query: 68 YDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAV 127
+ H + + + Y+ + TDEA + NK P FL LA+
Sbjct: 150 HAHNHFPKFLLRNEETLTLKNNSGKEIEYSQYMLTDEAKGFIKK-NKDNPFFLYLAYVIP 208
Query: 128 HSGNPYEPIRAPQKLIDAFKYIDDS---ARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
HS ++ P +Y D+S ++K A ++S+LD+ VG ++ L L EN+
Sbjct: 209 HSR-----LQIPGDDECYLQYKDESWPEKQKKHAGMISRLDKDVGSILDLLKEMNLAENT 263
Query: 185 IVVFSTDNGGPAAG------FNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARV 238
+VVF++DNG G FND+ PL G+K +++EGGVR P + +V
Sbjct: 264 LVVFTSDNGAHREGGARPEFFNDSG----PLSGIKRSMYEGGVRVPFIAHWPGVIKPGQV 319
Query: 239 AYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTE 280
+ D +PT G + E +DG++ L N E
Sbjct: 320 SNHIGAHWDLMPTACELGG--VQPPEGIDGISYVPLLKGNME 359
>UniRef50_UPI00005887B4 Cluster: PREDICTED: similar to galactosamine
(N-acetyl)-6-sulfate sulfatase; n=1; Strongylocentrotus
purpuratus|Rep: PREDICTED: similar to galactosamine
(N-acetyl)-6-sulfate sulfatase - Strongylocentrotus
purpuratus
Length = 465
Score = 89.0 bits (211), Expect = 2e-16
Identities = 73/245 (29%), Positives = 111/245 (45%), Gaps = 31/245 (12%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
G+P +E +LP+ LK GYK+ +VGKWHLG + +YLPL GFD G I +
Sbjct: 81 GIPDSEILLPKLLKLSGYKSKIVGKWHLG-HLPQYLPLKHGFDEWFGAPNCHIKSLPNIP 139
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVN-SHNKSEPLFLMLAHSAVHSGN 131
+ + S ++ G Y + E + + S +P FL A H
Sbjct: 140 VYRDS-------------EMIGRY----FEQEGLNFIEKSAEAKQPFFLYWTPDATH--- 179
Query: 132 PYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
EP+ A + ++ S R + + +LDE VG+++ L + N+ VVF++D
Sbjct: 180 --EPVYASKP------FLGRSQRGLYGDAVIELDEGVGQILGKLKELQIDTNTFVVFTSD 231
Query: 192 NGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPT 251
NG A +N +N P K T +EGG+R W P RV +Q +I D T
Sbjct: 232 NGA-ATYAKENGGTNGPYLCGKRTTYEGGMRVPTIAWWPTHIKPGRVTHQIGNIMDLFTT 290
Query: 252 LYSAA 256
+ A
Sbjct: 291 ALNLA 295
>UniRef50_Q7UG72 Cluster: Arylsulfatase A [precursor]; n=1;
Pirellula sp.|Rep: Arylsulfatase A [precursor] -
Rhodopirellula baltica
Length = 503
Score = 89.0 bits (211), Expect = 2e-16
Identities = 85/338 (25%), Positives = 138/338 (40%), Gaps = 36/338 (10%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF------WT---G 63
GL E + K GY+T GKWHLG + K +LP N+GFD G W
Sbjct: 110 GLAPAETTFAEVCKSAGYRTACHGKWHLGHHPK-FLPTNQGFDQFYGIPYSNDMWPLHPD 168
Query: 64 RIDMYDHTTMEQGSW----------GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHN 113
I + G+W G R + T T +++ + + +
Sbjct: 169 TIRRQQKDPNDPGNWPPLPIIESIAGQPPRIVNDNVQPADQEQMTVELTRRSVEFIKNQS 228
Query: 114 KSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVK 173
+P L L H VH P + + F+ S F V+ ++D SVG+++
Sbjct: 229 SDKPFLLYLPHPMVH---------VPLYVSERFR--GKSGAGLFGDVMMEVDWSVGEILS 277
Query: 174 ALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLD 233
A+ + +N++V+F++DNG P + ++A S PL+ K T WEGGVR +W P
Sbjct: 278 AIESIDQQKNTLVIFTSDNG-PWLSYGNHAGSAAPLREGKGTQWEGGVREPTLMWWPETI 336
Query: 234 SKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDAL--SKNTESPRTSVLHNID 291
D LPT+ GG+ + +DG + D + +SP S +
Sbjct: 337 PAGTTCETFCSTIDVLPTIVELTGGE-APERKIDGHSIVDLMLDVPGAKSPHESFVGYYG 395
Query: 292 DIWGIAALTVDKYKLIKGTIYKGVWDNWYGPSGREGAY 329
+ + +++KL+ Y+ + D G G Y
Sbjct: 396 G-GQLQTIRNERFKLVFPHAYRTLGDREPGKDGMPDGY 432
>UniRef50_A6PEH5 Cluster: Sulfatase precursor; n=1; Shewanella
sediminis HAW-EB3|Rep: Sulfatase precursor - Shewanella
sediminis HAW-EB3
Length = 517
Score = 89.0 bits (211), Expect = 2e-16
Identities = 89/331 (26%), Positives = 137/331 (41%), Gaps = 41/331 (12%)
Query: 12 RGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHT 71
RGL + L + LKD GY T VGK HLG ++LP GFD GF M H
Sbjct: 104 RGLQKEDPTLAEMLKDKGYATVHVGKSHLGD-NNDHLPTVHGFDEFYGFLYHLNVMEMHE 162
Query: 72 TMEQGSWGTDFRRGFEVAHDL------------FGVYATDVYTDEAIKVVNSHNKSEPLF 119
E RG + H + FGV +D+ + F
Sbjct: 163 QPEFPKDPNFKGRGRNMIHTVATDKFDDTVDPRFGVIGKQTISDQGELGAKRMQTVDGEF 222
Query: 120 LMLA------HSAVHSGNPY----EPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVG 169
L A H A + PY P R QK +Y S + L +LD+ +G
Sbjct: 223 LDFAINWLEKHEATNDDQPYFMWYNPTRMHQKTHVRPEYQGASQHNTYYDGLVELDDQIG 282
Query: 170 KVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWS 229
++ L G ++N+I++F++DNG + D+ A+++ +G K T W+GG R +
Sbjct: 283 VLLDKLEATGEIDNTIILFTSDNGVNLDHWPDSGAASF--RGQKGTTWDGGFRVPMLVSW 340
Query: 230 PLLDSKARVAYQKMHISDWLPTLYSAAG--------------GDLSVLENLDGVNQWDAL 275
P + M DW+PT+ +AAG D + ++DG NQ D L
Sbjct: 341 PAKIPQGEYTDGLMSAEDWVPTIMAAAGDADIKQDLLTGKKINDETYKVHIDGYNQLDML 400
Query: 276 SKNTESPRTSVLHNIDDIWGIAALTVDKYKL 306
++ +S R ++ + A VD++K+
Sbjct: 401 TEGGKSNRHEFFFYNEN--SLNAFRVDEWKV 429
>UniRef50_UPI00005846A1 Cluster: PREDICTED: similar to
arylsulfatase; n=1; Strongylocentrotus purpuratus|Rep:
PREDICTED: similar to arylsulfatase - Strongylocentrotus
purpuratus
Length = 552
Score = 88.6 bits (210), Expect = 3e-16
Identities = 77/290 (26%), Positives = 125/290 (43%), Gaps = 24/290 (8%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKE-----YLPLNRGFD--SHVGFWTGRI 65
GLP E + + LK+ GY T + GKWHLG + +LP++ GFD H+ +T +
Sbjct: 142 GLPSTELTIAEALKEEGYTTGMAGKWHLGLNSETRDDGVHLPMHHGFDFVGHILPFTNSM 201
Query: 66 DMYDHTTMEQGSWGTD---FRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLML 122
D T ++R VA Y T + ++A+ + N +P F
Sbjct: 202 ACDDTGRFVDFPDVTKCFLYKRDQIVAQPFNHTYLTQTFVNDAVSFIED-NAHDPFFFYF 260
Query: 123 AHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLE 182
S +P+ P+ A ++ S R ++ ++++ +VG+V+ AL +GL +
Sbjct: 261 PFS-----HPHVPLYASP------RFAGKSQRGEYGDNINEMSWAVGEVIDALEAKGLSQ 309
Query: 183 NSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQK 242
N++V+F D+ GP + + KG K WEGG+R + P R +
Sbjct: 310 NTLVLFLADH-GPQPEYCAHGGDPSIFKGYKTNTWEGGIRVPFVAYWP-GQITPRESDAL 367
Query: 243 MHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDD 292
+ D + T+ A G L DG D L KN SP + H D
Sbjct: 368 VSTLDIMRTVVDLANGTLPDDTAYDGEVITDVLLKNAPSPHDVLYHYCKD 417
>UniRef50_Q7UYH3 Cluster: Arylsulfatase; n=1; Pirellula sp.|Rep:
Arylsulfatase - Rhodopirellula baltica
Length = 598
Score = 88.6 bits (210), Expect = 3e-16
Identities = 79/273 (28%), Positives = 127/273 (46%), Gaps = 36/273 (13%)
Query: 17 NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQG 76
+E L + L + GY+T + GKWHLG P+++GFD + G I +G
Sbjct: 114 DEVTLAERLSEAGYQTGIFGKWHLGD-NYPMRPMDQGFDESLIHRGGGIGQPSDPIGAEG 172
Query: 77 SW--GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKS-EPLFLMLAHSAVHSGNPY 133
+ T F G EVA + Y TD++ D AI +S +P F +A +A H P+
Sbjct: 173 KYTDPTLFHNGDEVAME---GYCTDIFFDAAIDFARKQTESGKPFFTYIATNAPH--GPF 227
Query: 134 EPIRAPQKLIDAFKYID--------------DSARQKFA---AVLSKLDESVGKVVKALH 176
+ + P +L + +K +D D+ K A A+++ +D++VGK+ +L
Sbjct: 228 DDV--PNELYEEYKQVDFTPILVSDLPAKRRDAEFDKLARISAMITNIDQNVGKLFASLD 285
Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRG-AGFLWSPLLDSK 235
+ EN+IV++ DNG + + N ++G K + +GG+R F W +D+
Sbjct: 286 ELKIRENTIVLYLNDNGPNSRRYVGN------MRGNKTQVDDGGIRSPLLFHWPAKVDAS 339
Query: 236 ARVAYQKMHISDWLPTLYSAAGGDLSVLENLDG 268
HI D +PTL A G S LDG
Sbjct: 340 DTTDVMLAHI-DLMPTLLDACGVAASESPALDG 371
>UniRef50_A6DGD3 Cluster: Putative exported uslfatase; n=3;
Bacteria|Rep: Putative exported uslfatase - Lentisphaera
araneosa HTCC2155
Length = 713
Score = 88.6 bits (210), Expect = 3e-16
Identities = 85/324 (26%), Positives = 144/324 (44%), Gaps = 35/324 (10%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSY---KKEYLPLNRGFDSHV-GFWTGRIDMYD 69
+PL + L + LK++GYKT +GKWHL ++ + + P GFD ++ G G+ +
Sbjct: 331 MPLEDITLAEALKEVGYKTAHIGKWHLQAHHDTSRNHFPEKHGFDLNIAGHRMGQPGSFY 390
Query: 70 HTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS 129
+ T+ ++A G Y TD TD+AI + NK P FL + VH+
Sbjct: 391 FPYKSKQHPSTNVP---DMADGQEGDYLTDKLTDKAIHYI-KENKDTPFFLNFWYYTVHT 446
Query: 130 --------GNPYEP------IRAPQKLIDAFKYIDDSARQ--KFAAVLSKLDESVGKVVK 173
YE I Q I K S++ +AA++ +DE++G++ K
Sbjct: 447 PIIPRQDLKKKYEAKANELGINKNQPGIPVLKSFARSSQNNPSYAAMVEAMDENIGRIFK 506
Query: 174 ALHTRGLLENSIVVFSTDNGGPAAGFNDNA-ASNYPLKGVKNTLWEGGVRGAGFL-WSPL 231
L + + +I++F +DNGG + N S PLK K ++EGG+R + W
Sbjct: 507 TLKELQIDDETIIIFCSDNGGLSTSTGPNCPTSQLPLKAGKAWVYEGGIRIPFIIKWPGK 566
Query: 232 LDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVL---- 287
K A + +D PTL ++LDGV+ ++ + + L
Sbjct: 567 KGGKELQA--PVCTTDIYPTLLDMLKLPAKPEQHLDGVSLTSLMNGQAKELQREALFIHY 624
Query: 288 ---HNIDDIWGIAALTVDKYKLIK 308
H+I+ + A+ + YKL++
Sbjct: 625 PHYHHINSMGPAGAVRMGDYKLVE 648
>UniRef50_A6C6V5 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;
Planctomyces maris DSM 8797|Rep:
N-acetylgalactosamine-6-sulfatase - Planctomyces maris
DSM 8797
Length = 520
Score = 88.6 bits (210), Expect = 3e-16
Identities = 70/220 (31%), Positives = 107/220 (48%), Gaps = 19/220 (8%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYL-PLNRGFDSHV-GFWTGRIDMYDH 70
GL ++ LP+ L+ GY+T VGK H G+ PLN GFD ++ G G Y
Sbjct: 139 GLKKDDVTLPRLLEKAGYRTIHVGKGHFGADGFPGAEPLNLGFDVNIAGSSFGAPGSYHG 198
Query: 71 TTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE------PLFLMLAH 124
M++ GT RR + L + TD++ EA+ + + +E P FL +AH
Sbjct: 199 --MKKFGLGT--RRAHQAVPHLEKYHDTDIFLTEALTIEANATLAETVKADQPFFLYMAH 254
Query: 125 SAVHSGNPYEPIRAPQKLIDAFKYIDDSAR-QKFAAVLSKLDESVGKVVKALHTRGLLEN 183
AVH+ P + + D +K D Q FA ++ +D+S+G ++ L G+ EN
Sbjct: 255 YAVHA-----PFDSDPRFADHYKDSDKPKNAQAFATLIEGMDKSLGDIMNQLDQLGVAEN 309
Query: 184 SIVVFSTDNGGPA-AGFNDNAASNYPLKGVKNTLWEGGVR 222
+++ F DNG A G A PL+G K +EGG+R
Sbjct: 310 TLIFFLGDNGSDAPLGHQHAVACAAPLRGKKGAHYEGGMR 349
>UniRef50_Q7UMZ6 Cluster: Arylsulfatase A; n=1; Pirellula sp.|Rep:
Arylsulfatase A - Rhodopirellula baltica
Length = 492
Score = 87.8 bits (208), Expect = 5e-16
Identities = 81/271 (29%), Positives = 119/271 (43%), Gaps = 21/271 (7%)
Query: 5 VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGR 64
V+ P GL +E + + LK Y T LVGKWHLG + E+LP ++GFD G
Sbjct: 104 VLRPVSPYGLHPDEITIAEVLKQQNYATALVGKWHLGD-QPEFLPTHQGFDWFFGV-PYS 161
Query: 65 IDMYDHTTMEQGS-WG----TDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLF 119
DM + + GS W + E + G+ T YT+ A++ + H K EP F
Sbjct: 162 DDMTERIWKQDGSHWPPLPLMENETVIEAPCNRDGL--TKRYTERAMQWIAEH-KDEPFF 218
Query: 120 LMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRG 179
L + G+ P + DAF+ S + + +LD S+G+++ L G
Sbjct: 219 LYFPQAM--PGSTKTPFSS-----DAFR--GKSRNGPWGDAVEELDWSIGQMLDQLVKLG 269
Query: 180 LLENSIVVFSTDNGGPAAGFNDNAA--SNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKAR 237
+ E + V++++DNG P D+ + SN PL G T EG R W P
Sbjct: 270 IAEKTFVIWTSDNGAPINRDPDDLSRGSNLPLHGRGYTTSEGAFRVPTIAWHPGKVPAGT 329
Query: 238 VAYQKMHISDWLPTLYSAAGGDLSVLENLDG 268
+ D LPT + AG L LDG
Sbjct: 330 QCDELATTMDLLPTFANLAGCKLPTNRKLDG 360
>UniRef50_Q7UER7 Cluster: Sulfatase 1; n=6; Bacteria|Rep: Sulfatase
1 - Rhodopirellula baltica
Length = 553
Score = 87.8 bits (208), Expect = 5e-16
Identities = 73/267 (27%), Positives = 122/267 (45%), Gaps = 22/267 (8%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWT------GRIDM 67
+P + LP+ L++ GYKT GKWHLG + +P + GFD ++G G
Sbjct: 151 MPAKDVTLPEALRESGYKTFFAGKWHLGG--EGSMPTDHGFDINIGGHHRGSPPGGFFAP 208
Query: 68 YDHTTMEQGSWGTDFRR--GFEVAHDLFGVYATDVYTDEAIKVVNSH-NKSEPLFLMLAH 124
+ + ME G G R G E A + G + + V+ ++ L+
Sbjct: 209 FKNPVMEDGPDGESLTRRLGKETASFIEGQDDQPYFAMLSFYAVHGPIQTTQELWQKYRE 268
Query: 125 SAVHSGNPYEPIRAPQKLIDA---FKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLL 181
SA P P + ID + I D+ +A ++ LD +VG V+ A+ G
Sbjct: 269 SA-----PAPPADGNRFKIDRTLPVRQIQDNP--VYAGMMETLDNAVGDVMAAIEASGKA 321
Query: 182 ENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQ 241
+N++V+F+ DNGG ++G + + SN P +G K WEGG+R ++ P + + +
Sbjct: 322 DNTLVIFTGDNGGVSSG-DAYSTSNLPHRGGKGRQWEGGLREPYYVSMPAIVPENSTSDV 380
Query: 242 KMHISDWLPTLYSAAGGDLSVLENLDG 268
+ SD PT+ L +++DG
Sbjct: 381 PVIGSDLYPTILDVCNLPLRPQQHIDG 407
>UniRef50_A6DSG4 Cluster: Arylsulphatase A; n=1; Lentisphaera
araneosa HTCC2155|Rep: Arylsulphatase A - Lentisphaera
araneosa HTCC2155
Length = 489
Score = 87.8 bits (208), Expect = 5e-16
Identities = 85/273 (31%), Positives = 120/273 (43%), Gaps = 24/273 (8%)
Query: 11 PRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDH 70
P GL +E LP+ +K GY T LVGKWHLG +K + PLN G+D GF
Sbjct: 106 PIGLNPSEITLPELMKTAGYNTALVGKWHLGEWKP-FHPLNHGYDYFYGFLK-------- 156
Query: 71 TTMEQGSWGTDFRRGFEVAHDLFGVYA-TDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS 129
+E + E+A + AI + H K+ P FL+ + H+
Sbjct: 157 -VIEGSEKPSLIENRKELASKIQKTEGQAPGMVKAAINFMTKHKKN-PFFLVYSDPMPHA 214
Query: 130 GNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFS 189
PY P + FK S R + V+ ++D ++ AL GL EN+IVVF+
Sbjct: 215 --PYFPS-------EQFK--GTSKRGNYGEVIHEIDWQFKHLMDALDELGLKENTIVVFT 263
Query: 190 TDNGGPAAGFND-NAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDW 248
+DNG P + + PL+ K T +EGGVR + P + + I D
Sbjct: 264 SDNGPPVERQKKYDVGLSGPLRDGKWTNFEGGVRVPFIIRWPGKVKVDASSDAMIGIIDM 323
Query: 249 LPTLYSAAGGDLSVLENLDGVNQWDALSKNTES 281
LPT AG D+ +DGVN L + ES
Sbjct: 324 LPTFCELAGVDVPNDRVIDGVNILPQLLGDQES 356
>UniRef50_A6DID9 Cluster: Putative sulfatase protein; n=1;
Lentisphaera araneosa HTCC2155|Rep: Putative sulfatase
protein - Lentisphaera araneosa HTCC2155
Length = 483
Score = 87.8 bits (208), Expect = 5e-16
Identities = 82/280 (29%), Positives = 129/280 (46%), Gaps = 44/280 (15%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLG-SYKKEYLPLNRGFDSHVGFWTGRIDMYDHT 71
G+ + +LP LK+ GY+T +GK H G + + PLN GFD H
Sbjct: 129 GIQQGDILLPALLKETGYRTICIGKAHFGMGFSAD--PLNLGFDRK------------HY 174
Query: 72 TMEQGS-WGTDF--RRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAV 127
E GS G F R + V D V+ ++ T EA K ++ K E P FL L+H A+
Sbjct: 175 ANESGSPIGRRFGGRDPYHVKRDGEQVHLSEALTLEAKKEISDAVKEEKPFFLYLSHYAI 234
Query: 128 HSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVV 187
H+ PI ++ + +D R + ++ D+S+G V+ + G+ E+++ +
Sbjct: 235 HT-----PIIEDKRFSKNYPNLDTKIRA-YVTLVEGADKSLGDVMDHIEKLGIAEDTLFI 288
Query: 188 FSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSK----------A 236
++ DNGG SN P+KG+KN +EGG R + W +++
Sbjct: 289 WTADNGG--------LRSNAPMKGLKNDAYEGGHRIPNMVAWGAQDETRVHQKRMPLKPG 340
Query: 237 RVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALS 276
RV + DW+PTL S AG + LDG + + LS
Sbjct: 341 RVENRPYIHQDWMPTLLSLAGAQHPKPDLLDGYDITELLS 380
>UniRef50_A6CAR8 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
n=1; Planctomyces maris DSM 8797|Rep:
N-acetylgalactosamine 6-sulfate sulfatase - Planctomyces
maris DSM 8797
Length = 501
Score = 87.4 bits (207), Expect = 7e-16
Identities = 82/302 (27%), Positives = 128/302 (42%), Gaps = 34/302 (11%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
G+ + EK+LP LK GY + + GKW LG +K+ +LPL RGFD GF ID + H
Sbjct: 133 GMDVREKLLPALLKPAGYVSAIYGKWDLGIHKR-FLPLARGFDDFYGFTNTGIDYFTH-- 189
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
E+ + +R D G Y T ++ EA++ + N +P FL L +A H +
Sbjct: 190 -ERYGVPSMYRNNQPTEEDK-GTYCTYLFQREAVRFI-KENHQKPFFLYLPFNAPHGASS 246
Query: 133 YEP-----IRAPQKLIDAFKYIDDS-ARQKFAAVLSKLDESVGKVVK---ALHTRGLLEN 183
+P +AP+K + + ++ D+ +K + G V+ + R L
Sbjct: 247 LDPRIRGGAQAPEKYKNMYPHLKDTLVTKKKTGRYEFRERPDGPVIHQGVSASKRRLEYV 306
Query: 184 SIVVFSTDNGGPAAG---------------FNDN----AASNYPLKGVKNTLWEGGVRGA 224
+ + D G G F+DN A N PLKG K ++EGG+R
Sbjct: 307 ASITCMDDAIGEVLGLLDEYQIADNTIVVFFSDNGGSGGADNSPLKGKKGMMFEGGIRVP 366
Query: 225 GFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRT 284
+ P V + + + +PT A L +DG + L T SPR
Sbjct: 367 CLVRYPAKIKPGTVNDELLTSLELVPTFLKEAAIPLPENVVIDGYDMLPVLMGKTTSPRN 426
Query: 285 SV 286
+
Sbjct: 427 EM 428
>UniRef50_A3I2G9 Cluster: Putative secreted sulfatase; n=1;
Algoriphagus sp. PR1|Rep: Putative secreted sulfatase -
Algoriphagus sp. PR1
Length = 512
Score = 87.4 bits (207), Expect = 7e-16
Identities = 78/275 (28%), Positives = 127/275 (46%), Gaps = 23/275 (8%)
Query: 18 EKILPQYLKDLGYKTHLVGKWH---LGSYKKEYLPLNRGFDSHV---GFWTGRIDMYDHT 71
E +LP LK GY+T + GK+H L K P GFD ++ GF + Y
Sbjct: 127 ENMLPAMLKKQGYRTIISGKYHACDLCPEDKSPTPEAAGFDVNIAGTGFGAPK-SYYGID 185
Query: 72 TMEQGSWGTDFRRGFEVAHDLFG--VYATDVYTDEAIKVVN-SHNKSEPLFLMLAHSAVH 128
+ ++ + T G E FG ++ T+ T EA+K + +K +P FL L+H AVH
Sbjct: 186 SFQRKNTETQPMPGLE---SYFGKEIHLTEALTIEALKASKVAVDKGQPFFLYLSHHAVH 242
Query: 129 SG-NPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVV 187
+ +P R L + + A +A ++ +D S+G+V+KAL G+ N++++
Sbjct: 243 TPIQEQKPYRENYTLTEG----EPEAEAAYATMIEGVDNSLGEVIKALDDWGIANNTLLI 298
Query: 188 FSTDNGGPA-----AGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQK 242
F +DNGG + NYPL+ K + +EGG+R + P K V+
Sbjct: 299 FYSDNGGRVLFRGKKSLYGDFEFNYPLRSGKASNYEGGIRVPCVVRWPGKVKKQTVSDAP 358
Query: 243 MHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSK 277
+ I D T+ A + +DG++ L K
Sbjct: 359 LVIEDIYTTVLEATHTKIPDDYAIDGMSWLPVLEK 393
>UniRef50_A6KZ75 Cluster: Putative secreted sulfatase; n=1;
Bacteroides vulgatus ATCC 8482|Rep: Putative secreted
sulfatase - Bacteroides vulgatus (strain ATCC 8482 / DSM
1447 / NCTC 11154)
Length = 517
Score = 87.0 bits (206), Expect = 9e-16
Identities = 81/294 (27%), Positives = 140/294 (47%), Gaps = 33/294 (11%)
Query: 23 QYLKDLGYKTHLVGKWHLGSYKKE-YLPLNRGFDSHV-GFWTGRIDMYDHTTMEQGSWGT 80
+ L+ GY T GK H GS P + GF+ ++ G G + Y EQ T
Sbjct: 151 ELLRQNGYHTIHCGKAHFGSIDTPGENPTHWGFEVNIAGHAAGGLATY---LSEQNYGHT 207
Query: 81 DFRRGFEVA-----HDLFG--VYATDVYTDEAIKVVNSHNK-SEPLFLMLAHSAVHSGNP 132
+ + + D +G ++AT+ T EAIK ++ K ++P +L +AH A+H
Sbjct: 208 RDGKPYSLMAIPGLEDYWGTGIFATEALTQEAIKALDKAKKYNQPFYLYMAHYAIHV--- 264
Query: 133 YEPIRAPQKLIDAFKYIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFS 189
P+ + KYI K +A+++ +D+S+G ++ L +N++++F
Sbjct: 265 --PVDKDMRFFP--KYIKKGLSDKEAAYASLIEGMDKSLGDLMNWLEKNDEADNTVIIFM 320
Query: 190 TDNGGPAA--GFNDNA--ASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMH 244
+DNGG AA G+ D N PL K +L+EGG+R + W ++ R + +
Sbjct: 321 SDNGGLAAEPGWRDGQIHTQNAPLNSGKGSLYEGGIREPMIVSWPGVVTPNTR-CDKYLI 379
Query: 245 ISDWLPTLYSAAG-GDLSVLENLDGVNQWDALSKNTESPR--TSVLHNIDDIWG 295
I D+ PT+ AG + + +DG++ + L K T P +++ N +IWG
Sbjct: 380 IEDFYPTILEMAGITNYKTVNPIDGIS-FMPLLKGTGDPSKGRALVWNFPNIWG 432
>UniRef50_Q9NJU7 Cluster: Sulfatase 2; n=1; Helix pomatia|Rep:
Sulfatase 2 - Helix pomatia (Roman snail) (Edible snail)
Length = 266
Score = 87.0 bits (206), Expect = 9e-16
Identities = 36/70 (51%), Positives = 47/70 (67%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
+QH +I+ ++P GLPL + LK +GY TH +GKWHLG YKKEY PL RGFDS+ G+
Sbjct: 92 LQHDIIWPSQPYGLPLQFPTIADMLKSVGYSTHAIGKWHLGLYKKEYTPLYRGFDSYYGY 151
Query: 61 WTGRIDMYDH 70
G D Y +
Sbjct: 152 LEGGEDYYTY 161
Score = 42.7 bits (96), Expect = 0.019
Identities = 25/78 (32%), Positives = 40/78 (51%), Gaps = 7/78 (8%)
Query: 74 EQGSW-GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVN-SHNKSEPLFLMLAHSAVHSGN 131
++ W G D R E D+ G Y+T +YT +AI ++N + +P L LA+ AVHS
Sbjct: 194 DENKWCGYDLRDMNEPVTDMNGTYSTHLYTKKAIDIINGASTGGKPFLLYLAYQAVHS-- 251
Query: 132 PYEPIRAPQKLIDAFKYI 149
P+ P + + +I
Sbjct: 252 ---PMEVPAEYTKPYTFI 266
>UniRef50_Q7UPG6 Cluster: Arylsulphatase A; n=2; Bacteria|Rep:
Arylsulphatase A - Rhodopirellula baltica
Length = 485
Score = 86.6 bits (205), Expect = 1e-15
Identities = 68/242 (28%), Positives = 108/242 (44%), Gaps = 21/242 (8%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEY---LPLNRGFDSHVGFWTGRIDMYDH 70
L L E L + L+D GY T VGKWHLG +E P GFD W H
Sbjct: 125 LRLREVTLAEVLRDAGYATAHVGKWHLGLPTEERDKPTPDQHGFDHWFATWNNA--QPSH 182
Query: 71 TTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSG 130
+ +F R E L G Y+ + DEAI+ ++ H +S+P + H
Sbjct: 183 RNPD------NFIRNGEPVGQLEG-YSCQLVADEAIRWMDRHRESDPDQPFFLNVWFH-- 233
Query: 131 NPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFST 190
P+ PI AP ++ + + D ++ + D+++ +++ L G+ EN+++V+++
Sbjct: 234 EPHAPIAAPDEVTQKYGKLSDKG-AVYSGTIDNTDQAIKRLLAKLDALGVRENTLIVYAS 292
Query: 191 DNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLP 250
DNG D L+G K WEGG+R G P V+ + + D LP
Sbjct: 293 DNGSYR---TDRVGK---LRGRKGANWEGGIRVPGIFHWPGHIPAGVVSNEPAGLVDVLP 346
Query: 251 TL 252
T+
Sbjct: 347 TI 348
>UniRef50_Q8A222 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;
Bacteroides thetaiotaomicron|Rep:
N-acetylgalactosamine-6-sulfatase - Bacteroides
thetaiotaomicron
Length = 453
Score = 86.2 bits (204), Expect = 2e-15
Identities = 80/279 (28%), Positives = 134/279 (48%), Gaps = 29/279 (10%)
Query: 16 LNEKI--LPQYLKDLGYKTHLVGKWHLGSYKKEY-LPL--NRGFDSHVGFWTGRIDMYDH 70
L++K+ + + ++ GY T +GKWH+G + + P N GFD ++ + D
Sbjct: 110 LDDKLPSMARAFQNAGYATGHIGKWHMGGGRDVHNAPSIKNYGFDEYLSTYESP-DPDPA 168
Query: 71 TTMEQGSW-GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS 129
T + W D + ++ T+ + D++I + H K P FL L +H+
Sbjct: 169 ITASKWIWCDNDSIKRWK---------RTEYFVDKSIDFIKRH-KDSPFFLNLWPDDMHT 218
Query: 130 GNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFS 189
P+ P QK +++ ++ F+ VL ++D+ +G+ +KAL GL EN+I++F+
Sbjct: 219 --PWVP-EFKQKERKSWE-----TKEAFSPVLGEMDKQIGRFIKALDDMGLSENTIIIFT 270
Query: 190 TDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHIS-DW 248
+DN GPA F A + L+G KN+L+EGG+R + P RV + + D
Sbjct: 271 SDN-GPAPSF--KAVRSAYLRGTKNSLYEGGIRMPFIVKYPKKIKPGRVNNSSVLCAVDL 327
Query: 249 LPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVL 287
PTL S AG DG N L +E+ R + L
Sbjct: 328 YPTLCSVAGIKTEKNYKGDGQNYAKVLLGKSEAKRKTDL 366
>UniRef50_Q7UQ05 Cluster: Arylsulfatase A; n=2;
Planctomycetaceae|Rep: Arylsulfatase A - Rhodopirellula
baltica
Length = 525
Score = 85.8 bits (203), Expect = 2e-15
Identities = 89/337 (26%), Positives = 156/337 (46%), Gaps = 58/337 (17%)
Query: 14 LPLNEKILPQYLKDLG-YKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
L L+E + ++L+D Y+T +GKWHLG +LP ++GF ++G H
Sbjct: 146 LALDEVTIAEHLRDAADYQTFFLGKWHLGDVG--HLPTDQGFQINIGG--------GHKG 195
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGN 131
G + + ++ + A G Y T TDEA+ +V++ ++ + P F+M+++ VHS
Sbjct: 196 SPPGGYYSPWKNPYLKAKQ-DGEYLTTRLTDEAVSLVDTASREDKPFFMMMSYYNVHS-- 252
Query: 132 PYEPIRAPQKLIDAFKYIDDSA-------------------RQK---FAAVLSKLDESVG 169
PI ++ ID F+ ++ RQ +A+++ +D SVG
Sbjct: 253 ---PITPDKRTIDHFEEKQSNSPELQGDTPTIAERDAVTRGRQDNPAYASMVKAVDTSVG 309
Query: 170 KVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWS 229
+++KAL G+ +N++V+F +DNGG + N PL+ K L+EGG+R +
Sbjct: 310 RIMKALKEHGVDDNTLVIFFSDNGGLSTLRKFGPTCNSPLRAGKGWLYEGGIREPLLVRL 369
Query: 230 PLL----DSKARVAYQKMHI------SDWLPTLYSAAGGDLSVLENLDGVNQWDAL---- 275
P + V++Q + +D PT+ G L + DG++ A+
Sbjct: 370 PKTMPGGATNETVSHQPKTVDSVACSTDLFPTILDVVGLPLQPESHADGISLLPAIAGEA 429
Query: 276 SKNTESPRT---SVLHNIDDIWGI-AALTVDKYKLIK 308
++ SPR H +W AA+ YKLI+
Sbjct: 430 AETDSSPRDLHWHYPHYHGSLWRPGAAIRRGNYKLIE 466
>UniRef50_A6DTN4 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
n=2; Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine 6-sulfate sulfatase - Lentisphaera
araneosa HTCC2155
Length = 482
Score = 85.8 bits (203), Expect = 2e-15
Identities = 88/314 (28%), Positives = 141/314 (44%), Gaps = 32/314 (10%)
Query: 17 NEKILPQYLKDLGYKTHLVGKWHLGSYKKE-YLPLNRGFDSHVGFWTGRIDMYDHTTMEQ 75
++ I P+ L+ GY T ++GK +G + LP +GFD GF + H
Sbjct: 99 HDLIFPKALQKAGYHTAMIGKSGMGCNTDDAALPYQKGFDYFFGFTS---HTQAHWFFPT 155
Query: 76 GSWGTDFR-RGFEVAHDLF---GVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSG- 130
W D + E ++ Y+++V +EA+ V K P FL LA H+
Sbjct: 156 HLWKNDGKVTKVEYPNNTLHEGDNYSSEVVMNEALDYV-ERQKDGPFFLHLAFQIPHASL 214
Query: 131 -------NPYEPIRAPQKLIDAFKY----IDDSARQKFAAVLSKLDESVGKVVKALHTRG 179
Y PI + L K+ + + FAA++S +D +VG + K L G
Sbjct: 215 RAKEEWKAKYRPILKEKLLPKKDKHPHYSYEREPKTTFAAMVSYMDHNVGLLNKKLEDLG 274
Query: 180 LLENSIVVFSTDNGGPAAGFN--DNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKAR 237
L EN++++F++DNG G + D+ SN L+G K ++EGGVR + P K +
Sbjct: 275 LAENTLIMFASDNGAMQEGGHKRDSFDSNGVLRGGKRDMYEGGVRTPMIAYWP---GKIK 331
Query: 238 VAYQKMHIS---DWLPTLYSAAGGDLSVLENLDGVNQWDA-LSKNTESPRTSVLHNIDDI 293
HIS D PT+ AG V E+ DG++ L K +++ + +
Sbjct: 332 AGQTSDHISAFWDISPTVRELAGA--KVQEDTDGISFVPTLLGKGSQTKHDYLYWEFFEQ 389
Query: 294 WGIAALTVDKYKLI 307
G A+ + K+KLI
Sbjct: 390 GGKRAIRMGKWKLI 403
>UniRef50_A6DSF1 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1;
Lentisphaera araneosa HTCC2155|Rep:
N-acetyl-galactosamine-6-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 517
Score = 85.8 bits (203), Expect = 2e-15
Identities = 72/251 (28%), Positives = 120/251 (47%), Gaps = 35/251 (13%)
Query: 24 YLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDFR 83
YLK+ GY T +GKWH+ E G+D G T+ ++GS
Sbjct: 145 YLKNQGYATAHIGKWHIYGGGPE----KHGYDVSSG----------ETSNDEGS-----P 185
Query: 84 RGFEVAHDLFGVYATDVYTDEAIKVVNSH-NKSEPLFLMLAHSAVHSGNPYEP-IRAPQK 141
+ +D +++ T +IK + NK +P F+ ++H A HS P A +
Sbjct: 186 KNITDPNDPKRIFSI---TKNSIKFIEKQTNKEKPFFIQVSHYAEHSAQMSLPETLASYE 242
Query: 142 LIDAFKYIDDSARQK----FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAA 197
A K I D +K A ++ +D S+G ++ L +L+N+ V+F++DNG
Sbjct: 243 NDPAIKKIKDKKFKKEVITHGAAVTDMDTSIGMIIDKLKELNILDNTYVIFTSDNG--KG 300
Query: 198 GFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAG 257
+D L+G K +LWEGG+R + P +++K+R + + + D LPT+Y AG
Sbjct: 301 LLHDKRI----LRGSKWSLWEGGIRVPFMIMGPGIEAKSRCS-ENIIGYDMLPTIYELAG 355
Query: 258 GDLSVLENLDG 268
G+ + N+DG
Sbjct: 356 GNTEDMPNVDG 366
>UniRef50_A6DHI1 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
n=1; Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine 6-sulfate sulfatase - Lentisphaera
araneosa HTCC2155
Length = 472
Score = 85.8 bits (203), Expect = 2e-15
Identities = 67/278 (24%), Positives = 122/278 (43%), Gaps = 16/278 (5%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
+P + + L + +K GY T +GKW LG + P +GFD G+ R H
Sbjct: 101 IPADSETLGKLMKRAGYATACIGKWGLGGFHNAGNPHKQGFDHFYGYTDQR---KAHNYY 157
Query: 74 EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPY 133
+ W + + Y+ D+ T +A+K + K +P FL LA+ P+
Sbjct: 158 PEYLWRNGEKEMLNNKNGEENDYSHDLMTVDALKYI-EEKKDQPFFLYLAYLI-----PH 211
Query: 134 EPIRAPQKLIDAFKYIDDSARQKF-AAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDN 192
+ P + +K D K AA+ S++D +G + + L G+ +N++++F++DN
Sbjct: 212 VKYQVPD--LAQYKDKDWPKEMKIHAAMTSRMDRDIGTIARRLEELGIADNTLIMFNSDN 269
Query: 193 GGPAAGFNDN-AASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPT 251
G ++ ++ LKG+K ++++GGVR + P V+ D +PT
Sbjct: 270 GAHGKSNSEKFFNTSGDLKGLKRSMYDGGVRSPMIAYWPGTIQAGSVSDHISAFWDMMPT 329
Query: 252 LYSAAGGDLSVLENLDGVNQWDA-LSKNTESPRTSVLH 288
G DG++ L K++E + L+
Sbjct: 330 FSELTGEPFK--GETDGISMLPTLLGKDSEQKQHKYLY 365
>UniRef50_Q5DYT4 Cluster: Arylsulfatase; n=10;
Gammaproteobacteria|Rep: Arylsulfatase - Vibrio fischeri
(strain ATCC 700601 / ES114)
Length = 537
Score = 85.4 bits (202), Expect = 3e-15
Identities = 83/292 (28%), Positives = 127/292 (43%), Gaps = 42/292 (14%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYK-----------------------KEYLP 49
G+PL+ K+LP ++ GY+T +GKWH K K Y P
Sbjct: 137 GIPLDIKLLPALFQENGYRTATIGKWHNAKIKGKNLVDEDKRTRDYHDNQITVTPKGYGP 196
Query: 50 LNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVV 109
RGFD F+ ++D + Q G + + H+L T++A+K +
Sbjct: 197 EERGFDYSYSFYASGAALWDSPAIWQN--GKNISAPGYLTHNL---------TEQALKFI 245
Query: 110 NSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVG 169
+ +P F+ LA S H P E +P K +D F + A + FAA+ + DES+G
Sbjct: 246 DESG-DKPFFVNLAFSVPHI--PLEEA-SPAKYMDRFNTGNVEADKYFAAI-NAADESLG 300
Query: 170 KVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWS 229
++ L +G L+N+I+ F +DNG A N KG K ++ GGVR +
Sbjct: 301 IIMDNLEKKGELDNTIIFFLSDNG---AVHESPMPMNGMDKGFKGQMYNGGVRVPFVAYW 357
Query: 230 PLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTES 281
P + + D LPT +AAG D+ +DG N L TE+
Sbjct: 358 PKHIPAGGESDSLISALDILPTALAAAGIDIPEDMQVDGKNIMPVLEGKTET 409
>UniRef50_A6DKN7 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine 6-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 465
Score = 85.4 bits (202), Expect = 3e-15
Identities = 83/284 (29%), Positives = 126/284 (44%), Gaps = 36/284 (12%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
G+ +E ++P +K GY+T +GKWHLGS +E+ P RGFD G+ G Y +
Sbjct: 104 GVKTSEIMIPALMKKGGYQTCAIGKWHLGS-SEEFQPNARGFDHWFGY-RGSCGFYQFKS 161
Query: 73 MEQGSW-GTDFRR-------GFEVAHDLFGV----YATDVYTDEAIKVVNSHNKSEPLFL 120
Q + G + + +V + V Y TD ++DEA + NK P F+
Sbjct: 162 QVQSAKKGQELKPLPSGEDPNLDVVRNGESVRLEGYLTDHFSDEAANWIKE-NKERPFFM 220
Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
A VH+ P P K I D V++ LD SV ++ AL G+
Sbjct: 221 YFAPYNVHA-----PDTVPNKYIPKGGTAHDG-------VIAALDASVQTILDALKEAGI 268
Query: 181 LENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKARVA 239
+N++VVFS DNGG D + + KG K T +EGG+R W +++ ++
Sbjct: 269 ADNTLVVFSNDNGGK----KDYSKT---FKGNKATFYEGGIRVPFAMRWPKGIEAGSKY- 320
Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPR 283
+ D LPT + A DL DG N + + + R
Sbjct: 321 NGVVSTLDLLPTFAALAKVDLPSDRVYDGQNLLPVIKDSAKDQR 364
>UniRef50_A4GIB1 Cluster: Arylsulfatase; n=1; uncultured marine
bacterium HF10_49E08|Rep: Arylsulfatase - uncultured
marine bacterium HF10_49E08
Length = 608
Score = 85.0 bits (201), Expect = 4e-15
Identities = 76/284 (26%), Positives = 124/284 (43%), Gaps = 29/284 (10%)
Query: 21 LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGT 80
+ Y ++ GY T + GKWHLG+ + P +RGF V + + I WG
Sbjct: 102 IANYYEEAGYSTGVFGKWHLGA-NYPFRPQDRGFQESVWYPSSSIPSVP------AYWGN 154
Query: 81 DFRRGFEVAHDL---FGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGNPYEPI 136
D+ + + F Y DV+ +EA++ ++ KS+ P LA + H P+ P
Sbjct: 155 DYFDDVYIHNGKEKRFEGYCADVFFNEAMRFMSESAKSKKPFMCYLATNTPHG--PFWPK 212
Query: 137 RAPQKLI------DAFKYIDDSARQKFAAVLS---KLDESVGKVVKALHTRGLLENSIVV 187
+K I F +D++ +++ A L +D ++G ++K L L E++I++
Sbjct: 213 EEDRKEIAEVLAQSKFDNLDNNLKKRLALYLGMIRNIDWNMGNLLKFLKEENLAEDTILI 272
Query: 188 FSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHIS 246
F TDNG NA ++G K +WEGG R F+ W KAR +
Sbjct: 273 FKTDNGSLLGPQYFNAG----MRGKKTEIWEGGHRVPCFIRWPNGGFGKARDIGGLTQVQ 328
Query: 247 DWLPTLYSAAGGDLSVLENLDGVNQWDAL--SKNTESPRTSVLH 288
D LPT+ G DG++ L K RT +++
Sbjct: 329 DILPTVLDLCGIKPRKNTKFDGISLASVLRGKKKVSEDRTIIIN 372
>UniRef50_A0JAV3 Cluster: Sulfatase precursor; n=1; Shewanella
woodyi ATCC 51908|Rep: Sulfatase precursor - Shewanella
woodyi ATCC 51908
Length = 356
Score = 85.0 bits (201), Expect = 4e-15
Identities = 66/201 (32%), Positives = 102/201 (50%), Gaps = 25/201 (12%)
Query: 26 KDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF-WTGRIDMYDHTTMEQGSWGTDFRR 84
K GY T ++GKWHLG + P GFD+ + G Y + S G
Sbjct: 141 KQQGYATAVIGKWHLG----KTAPTEYGFDTAIAASHLGHPPSYFYPY----SKGKRKLI 192
Query: 85 GFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLID 144
G E L Y ++ T EA+ ++S +P FL L AVH+ PI AP++ ++
Sbjct: 193 GLEEG-GLKDEYLSNRITREAVNYISSQR--QPFFLYLPFYAVHT-----PIEAPKEWVN 244
Query: 145 AFKYIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFND 201
+ K +AA+++ LD VGK+++AL G EN++VVF++DNG D
Sbjct: 245 QHNARQQAGEIKSAAYAAMIANLDRDVGKLLQALDKSGQRENTLVVFASDNGA-----YD 299
Query: 202 NAASNYPLKGVKNTLWEGGVR 222
A S+ P +G K++L+EGG++
Sbjct: 300 PATSSLPYRGYKSSLFEGGIK 320
>UniRef50_A6CA27 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
Planctomyces maris DSM 8797|Rep: N-acetylgalactosamine
6-sulfatase - Planctomyces maris DSM 8797
Length = 491
Score = 84.6 bits (200), Expect = 5e-15
Identities = 74/259 (28%), Positives = 112/259 (43%), Gaps = 26/259 (10%)
Query: 18 EKILPQYLKDLGYKTHLVGKWHLGSYKKE--YLPLNRGFDSHVGFWTGRIDMYDHTTMEQ 75
E + + +K +GY T GKWHLGS + P N GFD W + Y++
Sbjct: 111 EVTVAEAVKSVGYTTGHFGKWHLGSVQSNSPVSPGNSGFDE----WVSSPNFYENDPYMS 166
Query: 76 GSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEP 135
+ V L G ++ V D A+ + +K + FL + + GNP+ P
Sbjct: 167 HNG---------VVKQLKGE-SSRVTVDAALDFIKQADKDKKPFL----AVIWFGNPHTP 212
Query: 136 IRAPQKLIDAFKYIDDSAR-QKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG 194
A +L D Y D Q + +S +D ++G + L GL EN+++ F++DNG
Sbjct: 213 HEAVSELKDL--YPDQKPNFQNYFGEISGVDRAMGHLRSQLRDLGLAENTLLWFTSDNGP 270
Query: 195 PAAGFNDNAASNYP---LKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPT 251
F A + L G K LWEGGVR + P + K V+ D PT
Sbjct: 271 RPPQFKTEEARSQATGGLAGFKGNLWEGGVRVPSLIEWPAVIKKPEVSNVPCGTIDIYPT 330
Query: 252 LYSAAGGDLSVLENLDGVN 270
+ + G +S LDGV+
Sbjct: 331 VLAMTGAKVSHQPQLDGVS 349
>UniRef50_A6C8R8 Cluster: Arylsulfatase A; n=1; Planctomyces maris
DSM 8797|Rep: Arylsulfatase A - Planctomyces maris DSM
8797
Length = 510
Score = 84.6 bits (200), Expect = 5e-15
Identities = 79/274 (28%), Positives = 123/274 (44%), Gaps = 22/274 (8%)
Query: 5 VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGR 64
V+ P GL +E + + LK GYKT ++GKWHLG + +LP +GFD G
Sbjct: 123 VLRPISPYGLNPDEITVAEVLKKQGYKTGMIGKWHLGD-QTPFLPTRQGFDYFYGIPYSD 181
Query: 65 IDMYDHTTMEQGSW--GTDFRRGFEVAHDLF---GV---YATDVYTDEAIKVVNSHNKSE 116
DM G G ++ + +D GV T YT++A++ + NK++
Sbjct: 182 -DMTQAVGQRLGDRLDGKNWPPLPVMLNDTVIEAGVDRNLLTKDYTEKAVEFIEK-NKNQ 239
Query: 117 PLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALH 176
P FL + G+ +P + DAF+ S + + +LD S G+++ L
Sbjct: 240 PFFLYFPQAM--PGSTRKPFAS-----DAFR--GKSKNGPWGDSIEELDWSTGQILDKLV 290
Query: 177 TRGLLENSIVVFSTDNGGP-AAGFND-NAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDS 234
G+ +N++V++++DNG P A N +N PL G T EG R +W P
Sbjct: 291 ELGIDKNTLVIWTSDNGSPMAKDMNSTERGTNKPLNGRGYTTSEGAFRVPTIVWWPETVP 350
Query: 235 KARVAYQKMHISDWLPTLYSAAGGDLSVLENLDG 268
V + D LPT AGG + +DG
Sbjct: 351 AGTVCEELATTMDLLPTFARLAGGKVPSDRIIDG 384
>UniRef50_A6CEC4 Cluster: Aryl-sulphate sulphohydrolase; n=1;
Planctomyces maris DSM 8797|Rep: Aryl-sulphate
sulphohydrolase - Planctomyces maris DSM 8797
Length = 467
Score = 84.2 bits (199), Expect = 6e-15
Identities = 70/247 (28%), Positives = 117/247 (47%), Gaps = 28/247 (11%)
Query: 25 LKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDFRR 84
L GY+ VGKWHLG PL++GF ++ + T +G + + ++
Sbjct: 132 LSQAGYRCASVGKWHLGQS-----PLSQGFQVNIAG--------NQTGSPRGGYFSPYQN 178
Query: 85 GFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLID 144
+++ G + TD T A + + N+ P FL L H AVH+ P++A ++ I
Sbjct: 179 P-QLSDGEQGEFLTDRLTTAACQFIKD-NQGSPFFLYLTHYAVHT-----PLQAKKEDIA 231
Query: 145 AFKYIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFND 201
F+ + +AA++ +D+S+G+V++ L + L +N+IVVF++DNGG
Sbjct: 232 YFQSKPAGKLHQHATYAAMIRSMDQSIGRVLQTLREQQLDQNTIVVFTSDNGGYGP---- 287
Query: 202 NAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLS 261
A S PL+G K L+EGG+R + P + + + D PT +
Sbjct: 288 -ATSMLPLRGSKGMLYEGGIRVPLLIKWPGVTQPGSTTGEAVINVDLYPTFLEMTNIPVL 346
Query: 262 VLENLDG 268
E LDG
Sbjct: 347 ESELLDG 353
>UniRef50_A4B5Y4 Cluster: Iduronate-sulfatase and sulfatase 1; n=1;
Alteromonas macleodii 'Deep ecotype'|Rep:
Iduronate-sulfatase and sulfatase 1 - Alteromonas
macleodii 'Deep ecotype'
Length = 588
Score = 84.2 bits (199), Expect = 6e-15
Identities = 88/324 (27%), Positives = 135/324 (41%), Gaps = 38/324 (11%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGF-DSHVG-FWTGRIDMYDHT 71
+P N + DLGY T +VGKWHL + F D+ + F GR+
Sbjct: 208 IPENVVTMGDRYSDLGYTTGMVGKWHLEIDQNSKPWFKENFPDTPISEFNLGRLPSSLKE 267
Query: 72 TMEQGSWGTDFR-----RGFEVAHDLFGV-----------YATDVYTDEAIKVVNSHNKS 115
S G + + +DL G Y DV +D A + ++ N
Sbjct: 268 RYYPSSKGYKYNYFGYANRYWANYDLKGNQTQLGWISNSDYRLDVVSDAATQFIDI-NHD 326
Query: 116 EPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKAL 175
EP +L +AH A P+ P+ A + + F + R+ A++ +D VG +V L
Sbjct: 327 EPFYLHVAHYA-----PHVPLEATEDYLSLFPEQSSNRRRYALAMMYAVDAGVGSIVSKL 381
Query: 176 HTRGLLENSIVVFSTDNGGP-AAGFND---------NAASNYPLKGVKNTLWEGGVRGAG 225
G+LEN+I+ F +DNG P F D N + N PL G K L +GG++
Sbjct: 382 EEYGILENTIIAFISDNGAPIGLDFTDAPIAEKEAWNGSLNAPLLGEKGMLTDGGIKVPF 441
Query: 226 FL-WSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRT 284
+ W L S V + + D L + AG +VL LDGV+ + +T +
Sbjct: 442 IVHWPEKLQSNT-VIDEPVISLDVLYSAIKRAGASETVLSELDGVDIFPTQGFDTSALMN 500
Query: 285 SVLHNIDDIWGIAALTVDKYKLIK 308
L W +A+ + YK +K
Sbjct: 501 RPL--FWRFWNQSAVRLGNYKYLK 522
>UniRef50_Q0BZE9 Cluster: Sulfatase family protein; n=1; Hyphomonas
neptunium ATCC 15444|Rep: Sulfatase family protein -
Hyphomonas neptunium (strain ATCC 15444)
Length = 459
Score = 83.4 bits (197), Expect = 1e-14
Identities = 83/313 (26%), Positives = 133/313 (42%), Gaps = 32/313 (10%)
Query: 1 MQHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF 60
MQH VI+ GLP E + + LK+ GY+T +VGKWHLG +++EY P N+GFD G
Sbjct: 105 MQH-VIFPHSQDGLPAEEITISEMLKNAGYRTGMVGKWHLG-HQEEYWPTNQGFDWFYGV 162
Query: 61 WTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFL 120
DM D RG E+ + +A K + +P FL
Sbjct: 163 PYSN-DMAPF----------DLYRGKEIIESPADQSQLSLNYAKAAKEFIEDSSDKPFFL 211
Query: 121 MLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGL 180
A + P+ P+ P+ S + V+ +D +G V+ L G+
Sbjct: 212 YYAETF-----PHIPLFVPEDRSGT------SDAGLYGDVVETVDAGIGIVLDTLDEAGV 260
Query: 181 LENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAY 240
++++++F++DNG F +A +G K EGG R P K V++
Sbjct: 261 ADDTLIIFTSDNG---PWFEGSAGE---FRGRKGETHEGGFRVPFLARWPGHIPKGSVSH 314
Query: 241 QKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALT 300
+ D LPT S +G L +DG + L+ +P +L D + A
Sbjct: 315 EMAMNIDLLPTAASLSGATLPADRVIDGKDLTSLLTAGAPTPH-DILFFFDGNEIVGARD 373
Query: 301 VDKYKLIKGTIYK 313
+++L+ T Y+
Sbjct: 374 A-RFRLVLNTFYR 385
>UniRef50_Q4RJR3 Cluster: Chromosome 13 SCAF15035, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 13 SCAF15035, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 474
Score = 83.0 bits (196), Expect = 1e-14
Identities = 86/310 (27%), Positives = 133/310 (42%), Gaps = 38/310 (12%)
Query: 4 GVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKE---YLPLNRGFD-SHVG 59
GV+Y GLPLNE + + LK GY T VGKWHLG + + P + F VG
Sbjct: 91 GVLYPGSRGGLPLNETTIAEVLKPRGYATAAVGKWHLGGPCQNLTCFPPDVKCFGLCDVG 150
Query: 60 FWTGRIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLF 119
T + M+D +Q D + + +A D T A + +P F
Sbjct: 151 TVTVPL-MHDEVIKQQPVNFLDLEKAYSD-------FAKDFITTSA-------KRKQPFF 195
Query: 120 LMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRG 179
L H P A + L R F L + D+++G ++ L G
Sbjct: 196 LYFPSHHTHYPQYAGPGAAGKSL-----------RGPFGDALLEFDQTIGSLLATLERTG 244
Query: 180 LLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGF-LWSPLLDSKARV 238
++ N+++ F++DNG P + PL+ K T +EGG+R W L+ + V
Sbjct: 245 VINNTLIFFTSDNG-PELMRMSRGGNAGPLRCGKGTTYEGGMREPAIAYWQGLI--QPGV 301
Query: 239 AYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIW---G 295
++ D LPT S AG L + LDGV+ + L +S R +++ D G
Sbjct: 302 THEMASTLDILPTFASLAGAKLPQV-MLDGVDMTNILFSQGKSKREAMMFYPTDPSEKNG 360
Query: 296 IAALTVDKYK 305
+ A+ ++KYK
Sbjct: 361 LFAIRLEKYK 370
>UniRef50_A6DM48 Cluster: Arylsulfatase A; n=1; Lentisphaera
araneosa HTCC2155|Rep: Arylsulfatase A - Lentisphaera
araneosa HTCC2155
Length = 484
Score = 83.0 bits (196), Expect = 1e-14
Identities = 66/212 (31%), Positives = 103/212 (48%), Gaps = 27/212 (12%)
Query: 20 ILPQYLKDLGYKTHLVGKWHL-------GSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
ILPQ +K GY+T +VGKWHL G K P RGFD+ + + ++ ++ T
Sbjct: 118 ILPQIMKQGGYQTGMVGKWHLSEPGHKTGLTGKPLEPHRRGFDTAI-YTFNQLGRFNPTL 176
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
G + + Y DV DE IK + S +K +P F LA S P
Sbjct: 177 SHNGK------------NSKYEGYCGDVVFDEGIKWMESCSKEKPYFAYLATSI-----P 219
Query: 133 YEPIRAPQKLIDAFKYIDDSARQK-FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
+ P+ APQ+ D + +K + A++S +DE++GK++ + +R +I++F TD
Sbjct: 220 HTPLAAPQRYKDLYSGAKLKNNEKNYYAMISAVDENIGKLMTWMASRKDDRETILIFMTD 279
Query: 192 NGGPAAGFNDNAASNYPLKGVKNTLWEGGVRG 223
NG +G D A + + KN L+ G RG
Sbjct: 280 NGHAISG-PDGAGHSRDGRLKKNGLYNFGFRG 310
>UniRef50_A6DG54 Cluster: Arylsulphatase A; n=1; Lentisphaera
araneosa HTCC2155|Rep: Arylsulphatase A - Lentisphaera
araneosa HTCC2155
Length = 469
Score = 83.0 bits (196), Expect = 1e-14
Identities = 84/302 (27%), Positives = 131/302 (43%), Gaps = 32/302 (10%)
Query: 11 PRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSY--KKEYLPLNRGFDSHVGFWTGRIDMY 68
P LP +E + + LK GY T + GKWHLG+ K P +GFD +W
Sbjct: 102 PMHLPDSEITIAEALKTKGYATGMFGKWHLGNLDGKSHPTPSEQGFD----YWLA----C 153
Query: 69 DHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVH 128
D+ ++ R G V +A V DEA + + ++ P F +A S H
Sbjct: 154 DNNLIKHNPKSL-IRNGKPVGK--IAGWAAQVVADEANEWMK--KQTSPFFAYIAFSETH 208
Query: 129 SGNPYEPIRAPQKLIDAFKYI---DDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSI 185
S P+ AP++LI KYI ++ R + + D +VG ++K L G+ +N++
Sbjct: 209 S-----PLDAPEELIT--KYIERGENKKRATYRGMTEYSDAAVGSILKTLDDMGVSDNTL 261
Query: 186 VVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHI 245
V ++DN GP + D+ L+G K+ WEGG+R + P +
Sbjct: 262 VFLASDN-GPTS--EDSCEG---LRGKKSYTWEGGIRVPAIIRWPGKVKPGSEYNDPVGG 315
Query: 246 SDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALTVDKYK 305
D LPTL G +L ++DGV+ L T +L A++ + Y
Sbjct: 316 IDLLPTLCDIVGAELP-KRHIDGVSIRSVLEGKPFKRNTPILSFFYRTSPAASMRMGDYV 374
Query: 306 LI 307
LI
Sbjct: 375 LI 376
>UniRef50_A6DHY0 Cluster: N-acetylgalactosamine 6-sulfatase; n=2;
Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine 6-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 507
Score = 82.6 bits (195), Expect = 2e-14
Identities = 87/314 (27%), Positives = 129/314 (41%), Gaps = 21/314 (6%)
Query: 6 IYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRI 65
I+GA LP E L LK GY T GKWHLG+ K+Y F
Sbjct: 86 IWGANVGHLPKEEITLASVLKQQGYVTGHFGKWHLGTLNKDYSTKGESRKPTENFAPPWE 145
Query: 66 DMYDHTTMEQGSWGT---------DFRRGFEVAHDLFGVY--ATDVYTDEAIKVVNSHNK 114
YD + + + S T + G + +Y A V D+AI +
Sbjct: 146 RDYDESFVVESSVSTWDPASEKNPFYINGVPMKGTEESLYGGAARVVVDKAIPFMERAVS 205
Query: 115 SEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKA 174
FL + V P+EPI+A K ++ +K ++A + L+++DE VG++
Sbjct: 206 EGNPFL----AVVWFNAPHEPIKAGPKYLEMYKEHGEAAH--YYGCLTEMDEQVGRIRAK 259
Query: 175 LHTRGLLENSIVVFSTDNGGPA-AGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLD 233
L G+ +N+++ F +DNG A + L+G K +L++GGVR P
Sbjct: 260 LREMGVEKNTVLFFCSDNGPEGKKAKGAKAGTTSGLRGRKRSLYDGGVRVPALAEWPGKI 319
Query: 234 SKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDI 293
V M D+LPT+ + + LDG N AL ES R + I
Sbjct: 320 QAGSVIDAAMSTLDYLPTVIALQNHQMPDERPLDGENIL-ALLTGEESQRKRGIPFIHR- 377
Query: 294 WGIAALTVDKYKLI 307
G A L YKL+
Sbjct: 378 -GKAVLNRGDYKLV 390
>UniRef50_A4W906 Cluster: Sulfatase precursor; n=10;
Enterobacteriaceae|Rep: Sulfatase precursor -
Enterobacter sp. 638
Length = 501
Score = 82.6 bits (195), Expect = 2e-14
Identities = 79/306 (25%), Positives = 135/306 (44%), Gaps = 37/306 (12%)
Query: 17 NEKILPQYLKDLGYKTHLVGKWHLGS-YKKEYLPL--NRGFD----SHVGFWTGRIDMYD 69
NEK + YLKD GY T ++GKWHL + + P + GFD + GF T +D
Sbjct: 117 NEKTIASYLKDQGYDTAMMGKWHLNAGVDRHDQPQAEDAGFDYTLVNAAGFVTSDLDKAK 176
Query: 70 HTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS 129
+ F R + + + + + + EAI +N ++P F+ +A + VH+
Sbjct: 177 ERPRNGVVYPNGFYRNGKALGTVNQI-SGEFVSQEAINWLNDKKDNKPFFMYVAFTEVHT 235
Query: 130 GNPYEPIRAPQKLIDAFK-YIDDSARQ------------------KFAAVLSKLDESVGK 170
P+ +P+K ++ +K Y+ + +Q ++ A +S +DE VGK
Sbjct: 236 -----PLASPKKYLEIYKNYMSEYEKQHPDMFYADWVDKPYRGPGEYYANISYMDEQVGK 290
Query: 171 VVKALHTRGLLENSIVVFSTDNGGPAAGFN-----DNAASNYPLKGVKNTLWEGGVRGAG 225
V+ + + G +N+I++F++DNG + A L+G K+ LWEGG+R
Sbjct: 291 VLAKIKSMGQEDNTIIIFTSDNGPVTREARKWYELNMAGETDGLRGRKDNLWEGGIRVPA 350
Query: 226 FLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTS 285
+ V + D LPTL +L +DG + L T + +
Sbjct: 351 IIKYGQHLHAGTVTDTPVSGLDILPTLAELTHFNLPTDRIIDGESIVPVLEGQTMNRQQP 410
Query: 286 VLHNID 291
+L ID
Sbjct: 411 LLFAID 416
>UniRef50_Q7UYW2 Cluster: Arylsulfatase; n=2; Planctomycetaceae|Rep:
Arylsulfatase - Rhodopirellula baltica
Length = 484
Score = 82.2 bits (194), Expect = 3e-14
Identities = 79/289 (27%), Positives = 132/289 (45%), Gaps = 34/289 (11%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
LPL E+ + + LKD GY+T GKWH+ S+ + YL + +H G ++
Sbjct: 127 LPLEEQTIAECLKDEGYQTAFFGKWHVSSHHERYLGWS---PTHGPAKQG----FEFAEE 179
Query: 74 EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPY 133
+ G+ D++R G +A D + + + P F M + VH+
Sbjct: 180 DYGAHPYDWKRSPVATIKEPGRFAPDSMV-QRVGAFLRQDHDRPYFAMASSFYVHT---- 234
Query: 134 EPIRAP----QKLIDAFKYIDDSARQ---KFAAVLSKLDESVGKVVKALHTRGLLENSIV 186
P+R P ++ DA R ++AA L D VG+++ +L G + +IV
Sbjct: 235 -PVRTPCQWLREKYDARVPATSKKRNNRIEYAAFLETFDHHVGQILNSLEASGRADRTIV 293
Query: 187 VFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHI 245
+ ++DNGG + +N PL+G K L+EGG+R + W ++ K + +
Sbjct: 294 ILNSDNGG-----HPEYTANAPLRGSKWNLYEGGIRVPMIVRWPGVVQPKTEIDRPVIGY 348
Query: 246 SDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIW 294
D LPT+ + AGG+ DG + A S +SP T+ H++ IW
Sbjct: 349 -DLLPTMVALAGGN---PPKCDG--ESFAGSLRGDSPPTNEQHSL--IW 389
>UniRef50_A6DMX9 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
n=3; Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine 6-sulfate sulfatase - Lentisphaera
araneosa HTCC2155
Length = 467
Score = 82.2 bits (194), Expect = 3e-14
Identities = 73/298 (24%), Positives = 136/298 (45%), Gaps = 47/298 (15%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKK--EYLPLNRGFDSHVGFW----TGRID 66
G+P +E + LK+ GY+T VGKW + + + +P +GFD + G +G+ID
Sbjct: 92 GMPASEITFAEMLKETGYQTACVGKWDVSNRQPIIPRMPNAQGFDYYYGTLGGNGSGKID 151
Query: 67 MYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHS 125
+Y++ E+ D+ + T +YT++AI + E P L LAH+
Sbjct: 152 LYENNKKER------------TTEDMASL--TRLYTNKAIDFLEKQRDPEKPFILYLAHT 197
Query: 126 AVHSGNPYEPIRAPQKLIDAF-KYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
H+ ++DA K+ + + + A + +LD G+++ L+ L +N+
Sbjct: 198 MTHT------------VVDASPKFKEKTGDNLYRAAVEELDYETGRLLNKLNQLNLSKNT 245
Query: 185 IVVFSTDNG--GPAAGFNDNAASNYPLKGV-----------KNTLWEGGVRGAGFLWSPL 231
+V++++DNG N A +++P + K ++WEGG + P
Sbjct: 246 LVIYTSDNGPWNQPKYINGGAKNDHPENSIFWGDAGEFRDGKASIWEGGAHVPCVMRWPG 305
Query: 232 LDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHN 289
+ + M D+LPTL + G + +DGVNQ + +E+ R + ++N
Sbjct: 306 KIAAGKTNDGLMATIDFLPTLAAVTGAKIPDERVIDGVNQLGFICGKSETARETYIYN 363
>UniRef50_A6DMX6 Cluster: Arylsulphatase A; n=1; Lentisphaera
araneosa HTCC2155|Rep: Arylsulphatase A - Lentisphaera
araneosa HTCC2155
Length = 484
Score = 82.2 bits (194), Expect = 3e-14
Identities = 84/298 (28%), Positives = 128/298 (42%), Gaps = 42/298 (14%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
LP N L +K GYKT GKWHL + D YD + M
Sbjct: 110 LPENIFTLGDAMKSAGYKTGYFGKWHLNDRTAKGKEARHTPDERG---------YDKSYM 160
Query: 74 EQGSWGTDFRRGFEVAHDLFGVYA-TDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
G G +R F+ A+ L + V TD + + NK +P FL ++H VH
Sbjct: 161 YNG--GGFYRPVFQPAYKLDKPKRLSQVLTDMGVDFIKE-NKDQPFFLFVSHYDVHV--- 214
Query: 133 YEPIRAPQKLIDAF--KYIDDS--ARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVF 188
+ A + LID + K D + +AA++ D+SVG+++KA+ +GL +N++ +F
Sbjct: 215 --QLDADKDLIDKYLNKKRDPNYPGNAVYAAMIEHTDDSVGQLMKAIDDQGLADNTLFIF 272
Query: 189 STDNGGPAAGFND--------------------NAASNYPLKGVKNTLWEGGVRGAGFLW 228
+DNGG ++D A SN PL+ K T++EGG+R +
Sbjct: 273 YSDNGGVDNRYDDIPLLGGRSVNVYPEGHPLRYVATSNAPLRSGKGTVYEGGIRVPLIVR 332
Query: 229 SPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSV 286
P S + SD+ P+ + LDGV+ AL+KN+ P V
Sbjct: 333 WPGKVSPGTRSEAVFSSSDFYPSFLEVTKTQAPKNQVLDGVSMVPALTKNSFDPEREV 390
>UniRef50_P77318 Cluster: Uncharacterized sulfatase ydeN precursor;
n=32; Gammaproteobacteria|Rep: Uncharacterized sulfatase
ydeN precursor - Escherichia coli (strain K12)
Length = 560
Score = 82.2 bits (194), Expect = 3e-14
Identities = 82/279 (29%), Positives = 128/279 (45%), Gaps = 38/279 (13%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGF-DSHVGFWTGRIDMYDHT 71
G+PL E LP+ ++ GY T VGKWHL +P ++ D H F T +
Sbjct: 159 GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT-----FSAE 213
Query: 72 TMEQGSWGTDFRRGFEVAHDLF---------------GVYATDVYTDEAIKVVN-SHNKS 115
+ + G D+ GF A + Y +D TDEAI VV+ +
Sbjct: 214 EWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLD 273
Query: 116 EPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKAL 175
+P L LA++A H N P AP + F +A +A+V S +D+ V ++++ L
Sbjct: 274 QPFMLYLAYNAPHLPND-NP--APDQYQKQFNTGSQTADNYYASVYS-VDQGVKRILEQL 329
Query: 176 HTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSK 235
G +N+I++F++DNG A + N KG K+ + GG F+W K
Sbjct: 330 KKNGQYDNTIILFTSDNG---AVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMW---WKGK 383
Query: 236 ARVA-YQKM-HISDWLPTLYSAAGGDLSVLEN--LDGVN 270
+ Y K+ D+ PT AA D+S+ ++ LDGV+
Sbjct: 384 LQPGNYDKLISAMDFYPTALDAA--DISIPKDLKLDGVS 420
>UniRef50_A6DFN4 Cluster: Arylsulfatase; n=1; Lentisphaera araneosa
HTCC2155|Rep: Arylsulfatase - Lentisphaera araneosa
HTCC2155
Length = 481
Score = 81.8 bits (193), Expect = 3e-14
Identities = 87/320 (27%), Positives = 135/320 (42%), Gaps = 30/320 (9%)
Query: 8 GAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRI-- 65
G EP +P L Q KD GY T GKW LG P GFD+ G+ R+
Sbjct: 96 GQEP--IPEPGMTLAQIFKDKGYATGAFGKWGLGYPGSSSDPKALGFDTFYGYNCQRVAH 153
Query: 66 -----DMYDH----TTMEQ---GSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHN 113
M+ + T E+ G W F+ + YA D+ DEA+K + N
Sbjct: 154 SFYPPHMWSNDKNITINEKPVPGHWRKAVGPDFDFSQFYAENYAPDLILDEALKFIKD-N 212
Query: 114 KSEPLFLMLA----HSAVHSGNPY-----EPIRAPQKLIDAFKYIDDSARQKFAAVLSKL 164
K +P F L H A+H + + + +P++ A R +AA++S L
Sbjct: 213 KDKPFFAYLPFVEPHLAMHPPHSWVDSYPKEWDSPKESYKAAYLPHLRPRAGYAAMISDL 272
Query: 165 DESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNA--ASNYPLKGVKNTLWEGGVR 222
DE VG V++ L L+EN++V+F++DNG D+ S L+G+K +++EGG+R
Sbjct: 273 DEHVGSVMQLLKELDLVENTLVIFTSDNGASHCIEVDHEFFNSTKDLRGLKGSVYEGGLR 332
Query: 223 GAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESP 282
P KA+V+ D + T + DGV+ L + P
Sbjct: 333 VPMIAHWPGKIKKAQVSDHVSGFVDVMATFCDLL--QTEAPQTSDGVSFLPTLKGEKQEP 390
Query: 283 RTSVLHNIDDIWGIAALTVD 302
+ + G A+ +D
Sbjct: 391 QPVLAWEFQGYSGQQAIILD 410
>UniRef50_Q7UYA5 Cluster: Arylsulfatase; n=1; Pirellula sp.|Rep:
Arylsulfatase - Rhodopirellula baltica
Length = 562
Score = 81.0 bits (191), Expect = 6e-14
Identities = 83/299 (27%), Positives = 131/299 (43%), Gaps = 33/299 (11%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
LP + + + LKD GY T +GKWHLG + P R + G D Y T +
Sbjct: 200 LPESATTVAELLKDAGYNTAHIGKWHLGGLHVDE-PGKR-LTNQPGPRQHGFDFYQ-TQI 256
Query: 74 EQ----GSWGTD---FRRGFEV--------AHD--LFGVYATDVYTDEAIKVVNSHNKSE 116
EQ G G D FR+G V + D + + TD D A++++ + E
Sbjct: 257 EQQPLRGQMGRDKTLFRKGGTVLLRNDQRISQDDPYYHKHFTDANGDFAVEMIEKLSSEE 316
Query: 117 -PLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKAL 175
P F+ + H PYEP P A I D + +F +++ +D VG +++ L
Sbjct: 317 DPFFINMWWLVPHK--PYEPAPEPHWSDTAADDITDD-QHRFRSMVQHMDAKVGAILRKL 373
Query: 176 HTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSK 235
+ +N++V+F++DNG GF + LKG K L +GG+R + P
Sbjct: 374 DELKIADNTLVLFTSDNGAAFEGF------IHDLKGGKTELHDGGIRVPMIVRWPDAIPA 427
Query: 236 ARVAYQKMHISDWLPTLYSAAGGDLSVLENLDG---VNQWDALSKNTESPRTSVLHNID 291
+ + H +D LPT AA L LDG ++ W + ++ R +V +D
Sbjct: 428 GQTSQTFSHTNDLLPTFCDAASVQLPSDLPLDGLSLLSHWKGGTPPSQVERGTVFWQLD 486
>UniRef50_Q7UN55 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
n=1; Pirellula sp.|Rep: N-acetylgalactosamine 6-sulfate
sulfatase - Rhodopirellula baltica
Length = 501
Score = 80.6 bits (190), Expect = 8e-14
Identities = 77/291 (26%), Positives = 128/291 (43%), Gaps = 24/291 (8%)
Query: 6 IYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF--WTG 63
+ G R L + + L D GY T VGKW LG+ N G GF WTG
Sbjct: 121 LIGNAARNLTGEQPTVASLLSDAGYATGGVGKWALGNVDVPEEIENPGHPLANGFDAWTG 180
Query: 64 RIDMYD-HTTMEQGSWGTDFRRGFE--------VAHDLFGV----YATDVYTDEAIKVVN 110
++ + H + W RR F +A V Y+ DV TD A +
Sbjct: 181 YMNQSNAHNYYPRFLWQNYERRFFPGNVISTDPIARGRVAVKRESYSHDVMTDAAFDFIR 240
Query: 111 SHNKSEPLFLMLAHSAVHSGNPYEPIRAP-QKLIDAFKYIDD---SARQKFAAVLSKLDE 166
H +S+P L + + H+ N + ++ D Y D+ + + FAA+++++D
Sbjct: 241 EH-RSDPFLLHVHWTIPHANNEGGRLNGDGMEVPDYGIYADEGWPNPEKGFAAMITRMDR 299
Query: 167 SVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNA--ASNYPLKGVKNTLWEGGVRGA 224
+G+++ L L E ++V+F++DNG G + + S+ PL+G K ++ EGG+R
Sbjct: 300 DMGRLMDLLEELKLSEKTLVIFTSDNGPHHEGGHSDLFFNSSGPLQGSKRSMHEGGIRVP 359
Query: 225 GFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDAL 275
P ++ D+LPT AG + ++DG++ AL
Sbjct: 360 FIAKWPGTIEPGTISDHPSAFWDFLPTACELAGAEPPA--DIDGISYLPAL 408
>UniRef50_A6C176 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
n=1; Planctomyces maris DSM 8797|Rep:
N-acetylgalactosamine 6-sulfate sulfatase - Planctomyces
maris DSM 8797
Length = 599
Score = 80.2 bits (189), Expect = 1e-13
Identities = 60/185 (32%), Positives = 96/185 (51%), Gaps = 16/185 (8%)
Query: 17 NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQG 76
NE L + K GY+T L GKWHLG P ++GF + V G +
Sbjct: 109 NEVTLAEVFKSNGYRTGLFGKWHLGD-NYPLRPQDQGFGTVVQHGGGGVGQTPDDWQNDY 167
Query: 77 SWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPI 136
T R G + F Y TD++ DEA+K + + ++++P F L+ +A HS PY +
Sbjct: 168 FSDTYLRNG---KPEKFQGYCTDIWFDEALKFIEA-DRTKPFFAYLSTNAPHS--PY--L 219
Query: 137 RAPQKLIDAFKYIDDSARQKFAA---VLSKLDESVGKVVKALHTRGLLENSIVVFSTDNG 193
P+ + Y D +K AA +++ +DE++G++++ L GL +N+I++F TDN
Sbjct: 220 VDPEY---SDPYEDKGVPKKMAAFYGMITNIDENMGRLLRYLKESGLEKNTILIFMTDN- 275
Query: 194 GPAAG 198
G AAG
Sbjct: 276 GTAAG 280
>UniRef50_Q1MJX8 Cluster: Putative arylsulfatase precursor; n=1;
Rhizobium leguminosarum bv. viciae 3841|Rep: Putative
arylsulfatase precursor - Rhizobium leguminosarum bv.
viciae (strain 3841)
Length = 517
Score = 79.8 bits (188), Expect = 1e-13
Identities = 77/268 (28%), Positives = 116/268 (43%), Gaps = 20/268 (7%)
Query: 11 PRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDH 70
P GL + L + LK GY T GK HLG E+L N GFD +W + +
Sbjct: 108 PIGLQKEDITLAEILKTEGYATAQFGKNHLGDLN-EHLLCNHGFDE---YWGNLYHLNAN 163
Query: 71 TTMEQGSWGTD--FRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLF---LMLAHS 125
+E +D FR+ F+ + DV + + V E + L
Sbjct: 164 EDLEDQDRPSDPQFRKKFDPRGIVSCTAGGDVKDEGPLSVKRMETFDEEVATKSLSYLDQ 223
Query: 126 AVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAV-------LSKLDESVGKVVKALHTR 178
G P+ + D+S + A + L++ D VG+++ L
Sbjct: 224 RAKDGKPFFLWHNSTRQHVFIHLKDESRKLSRAGIDDTYGNGLAEHDAQVGELLDKLDQT 283
Query: 179 GLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARV 238
GL +N+IVV+++DNG + S P KG K T WEGGVR + P RV
Sbjct: 284 GLAKNTIVVYTSDNGAYQYMWPQGGTS--PFKGDKGTTWEGGVRVPAIIRWPGAPG-GRV 340
Query: 239 AYQKMHISDWLPTLYSAAGGDLSVLENL 266
+ + + ++D+LPTL +AA GD V+E L
Sbjct: 341 SAEIVDMTDFLPTL-AAAAGDNDVVEKL 367
>UniRef50_A6DQW6 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
n=2; Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine 6-sulfate sulfatase - Lentisphaera
araneosa HTCC2155
Length = 505
Score = 79.8 bits (188), Expect = 1e-13
Identities = 74/289 (25%), Positives = 129/289 (44%), Gaps = 28/289 (9%)
Query: 15 PLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWT--GRIDMYDHTT 72
P +L +D GY T GK G +K GFD +GF + D Y H
Sbjct: 122 PPKHPMLGSVARDAGYATAGFGKLSAGGTEKPETITGYGFDYWLGFLSHFDCRDYYPHHI 181
Query: 73 MEQGSW------GTDFRRGFEVAHDL----------FGVYATDVYTDEAIKVVNSHNK-S 115
E G D G + + G + ++Y D+AI+ + +++
Sbjct: 182 YENGQQIELPKNRPDLLEGTIIPSNKNTSGGVVPPGVGTFTENLYVDKAIEFIKKNSEIK 241
Query: 116 EPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQK-FAAVLSKLDESVGKVVKA 174
+P F+ LA + H G P +R P + +Y + + R+K + A+++ D +VG+++ A
Sbjct: 242 KPFFIYLASTVPHGGMP-GGMRVPD-MAGYDQYEELTLREKVYCALMTHHDRNVGRIIDA 299
Query: 175 LHTRGLLENSIVVFSTDNGGPAAGF--NDNAASNYPLKGVKNTLWEGGVRGAGFLWSP-L 231
+ G+ N+I+++++DNG + + D N L+ K L+EGG+R W P
Sbjct: 300 VEDLGIQNNTIIMWTSDNGDEDSYYLRTDTFKGNGDLRMYKRYLYEGGIRVPLIAWWPGT 359
Query: 232 LDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTE 280
++S + D +PTL A G L+ E +DG++ L +E
Sbjct: 360 IESNSTCDLPTTQY-DLMPTLADAGGKALT--EEMDGISIMPTLRGKSE 405
>UniRef50_Q7UYA9 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;
Pirellula sp.|Rep: N-acetylgalactosamine-6-sulfatase -
Rhodopirellula baltica
Length = 474
Score = 79.4 bits (187), Expect = 2e-13
Identities = 77/288 (26%), Positives = 128/288 (44%), Gaps = 32/288 (11%)
Query: 6 IYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGF---DSHVGF-- 60
I A G+ + E + + L+ GY T + GKWH+G K + + RGF SH GF
Sbjct: 99 ILAAHTGGMRVGEITIAEMLQKRGYATGMFGKWHIGWVKPDEVS-TRGFYSPPSHHGFDE 157
Query: 61 ---WTGRIDMYDHTTMEQG--SWGTD----FRRGFEVAHD----LFGVYATD--VYTDEA 105
T + +D T Q SWG ++ GF H+ + D V D
Sbjct: 158 YFATTSAVPTWDPTITPQDWDSWGNGPGEPWKGGFPYVHNGREAKENLSGDDSRVIMDRV 217
Query: 106 IKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLD 165
I + + N+++P F + A P+EP+ A ++ + S R+ + ++ +D
Sbjct: 218 IPFIEA-NQAKPFFATVWFHA-----PHEPVVAGEEFKKLYPKAG-SKRKNYYGCITAMD 270
Query: 166 ESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGF-NDNAASNYPLKGVKNTLWEGGVRGA 224
+ VG++ L G+ +N++V F +DNG P+ G AS P KG K+T++EGG+
Sbjct: 271 QQVGRLRAKLRELGIEKNTVVFFCSDNG-PSDGLAKKGVASAGPFKGHKHTMYEGGLLVP 329
Query: 225 GFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDL--SVLENLDGVN 270
P + D+LPT+ S G + +DG++
Sbjct: 330 ACAEWPGTIPAGTSTEVRCSTVDFLPTVASIVGDSMVQKATRPIDGID 377
>UniRef50_A4AVA7 Cluster: Aryl-sulphate sulphohydrolase; n=2;
Bacteroidetes|Rep: Aryl-sulphate sulphohydrolase -
Flavobacteriales bacterium HTCC2170
Length = 487
Score = 79.4 bits (187), Expect = 2e-13
Identities = 66/239 (27%), Positives = 107/239 (44%), Gaps = 27/239 (11%)
Query: 20 ILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWG 79
+LP+ L+ YKT GKWHL PL+ GFD ++G H +G
Sbjct: 145 VLPEVLQLNNYKTIHAGKWHLSES-----PLDYGFDINIGGGHN-----GHPKSYYPPYG 194
Query: 80 TDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAP 139
R Y TD+ + I+V+N EP FL A AVH+ P +P+ +
Sbjct: 195 NVKLRSPNKE------YLTDLIARQTIEVLNK--TIEPFFLNYAPYAVHT--PIQPVDSI 244
Query: 140 QKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGF 199
+ K+A ++ LD ++G ++ AL G +N++++F++DNGG
Sbjct: 245 LSKYNRKTAWKGQNNAKYATMVENLDRNIGLLIAALKDNGHYKNTLIIFTSDNGGLY--- 301
Query: 200 NDNAASNYPLKGVKNTLWEGGVRGA-GFLWSPLLDSKARVAYQKMHISDWLPTLYSAAG 257
PL+ K + +EGG+R F+W+ + S + H+ D P++ AAG
Sbjct: 302 --GITKQQPLRAGKGSYYEGGIREPFFFMWNDKIKSNTKSNVPISHL-DLFPSIVEAAG 357
>UniRef50_A6DHI2 Cluster: Aryl-sulphate sulphohydrolase; n=2;
Lentisphaera araneosa HTCC2155|Rep: Aryl-sulphate
sulphohydrolase - Lentisphaera araneosa HTCC2155
Length = 493
Score = 79.0 bits (186), Expect = 2e-13
Identities = 66/245 (26%), Positives = 116/245 (47%), Gaps = 22/245 (8%)
Query: 25 LKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDFRR 84
+ GY T +GK+H+ K+ PL G+ +VG GR + G + + +
Sbjct: 125 MNSAGYLTATLGKYHVA---KD--PLTHGWKINVG---GR----EFGGPYNGGYHSPYEY 172
Query: 85 GFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLID 144
+ G Y D TDEAI + H +P+F+ + +H+ P P+
Sbjct: 173 P-NLKETEKGRYLCDHLTDEAIGIFKEHGAQQPIFMYFPYYTIHAPIQGHPKFEPKYKAK 231
Query: 145 AFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAA 204
A K+AA++ LD +VG++V AL +GL E ++++F++DNGG + +
Sbjct: 232 A--KTKGHFNPKYAAMIEALDHNVGRLVAALEEQGLREKTLIMFTSDNGG-----HMKFS 284
Query: 205 SNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVL 263
PL+ K + +EGG+R F W ++++ +R + D+ PT+ AG +L
Sbjct: 285 RQEPLRAGKGSYYEGGIRVPFFASWPGVIEAGSRSQVPVTGL-DFYPTVCELAGVELPDD 343
Query: 264 ENLDG 268
+ +DG
Sbjct: 344 KVVDG 348
>UniRef50_A4ANR8 Cluster: Arylsulfatase; n=2; Bacteroidetes|Rep:
Arylsulfatase - Flavobacteriales bacterium HTCC2170
Length = 589
Score = 79.0 bits (186), Expect = 2e-13
Identities = 74/276 (26%), Positives = 123/276 (44%), Gaps = 40/276 (14%)
Query: 17 NEKILPQYLKDLGYKTHLVGKWHLG-SYKKEYLPLNRGFDS---HVGFWTGRIDMYDHTT 72
NE + + LK YKT + GKWHLG +Y P ++GFD H+ G++ +
Sbjct: 110 NEVTIAEMLKQANYKTGVFGKWHLGDNYPSR--PNDQGFDESLIHLSGGMGQVGDFTTYF 167
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
++ S+ D + + Y +D++ + AI + N +P F L+ +A P
Sbjct: 168 QKERSY-FDPVLWHNGERESYEGYCSDIFAENAIDFIEK-NHDQPFFCYLSFNA-----P 220
Query: 133 YEPIRAPQKLIDAFKYIDDSA-------------------RQKFAAVLSKLDESVGKVVK 173
+ P++ P K +K ID S+ +K A++S +D+++GK+++
Sbjct: 221 HTPLQVPDKYYQQYKDIDPSSGFEDDSRPFVEMTKKNKEDARKVYAMVSNIDDNIGKLMR 280
Query: 174 ALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSP-LL 232
L + EN++VVF TDNG + ++G K +++ GGVR +L P
Sbjct: 281 KLDDLKIAENTLVVFMTDNGPQQVRYVAG------MRGRKGSVYRGGVRVPFYLRYPSKW 334
Query: 233 DSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDG 268
V HI D LPTL L +DG
Sbjct: 335 QGNQDVETTTAHI-DVLPTLSEICDVKLPENRKIDG 369
>UniRef50_A3ZY29 Cluster: Aryl-sulphate sulphohydrolase; n=1;
Blastopirellula marina DSM 3645|Rep: Aryl-sulphate
sulphohydrolase - Blastopirellula marina DSM 3645
Length = 498
Score = 79.0 bits (186), Expect = 2e-13
Identities = 80/279 (28%), Positives = 125/279 (44%), Gaps = 34/279 (12%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
GL + + L+ GY T GKWHL + LP +GFD V F D +
Sbjct: 129 GLAKENVTMAEALQAAGYVTGHFGKWHLAG-PEGALPSEQGFD--VTF-----DSFGEGE 180
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
+ +GS G ++G D GV+ T +A + + + N+ P F LAH A+H
Sbjct: 181 LREGSEGN--KKG--PPDDPKGVFTL---TRKACEFIEA-NQDRPFFCYLAHHAIHG--- 229
Query: 133 YEPIRAPQKLIDAFKY-----IDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVV 187
P++ + ++ FK +D A +AA LD SVG ++ L L + ++V
Sbjct: 230 --PLQGRAETLEKFKAKTRRKLDPGAM--YAACTYDLDASVGMLLAKLDELKLADKTLVA 285
Query: 188 FSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISD 247
F++DNG AAS PL+G K +EGG+R + P + + + + D
Sbjct: 286 FTSDNGA------TQAASQEPLRGSKGGYYEGGIREPLIIRWPGVTQPSSTSDVPVINVD 339
Query: 248 WLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSV 286
+ PT +AAG + + LDG + LS RT +
Sbjct: 340 FYPTFLAAAGAPVPAGKILDGESLLPLLSGAGPLKRTGI 378
>UniRef50_A3XSU6 Cluster: Sulfatase family protein; n=2; Vibrio|Rep:
Sulfatase family protein - Vibrio sp. MED222
Length = 512
Score = 79.0 bits (186), Expect = 2e-13
Identities = 93/333 (27%), Positives = 141/333 (42%), Gaps = 47/333 (14%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
GL + L + LKD GY T VGK HLG +LP GFD GF +++ +
Sbjct: 104 GLQKEDPTLAEMLKDKGYATVHVGKSHLGD-NNSHLPTVHGFDEFFGFLY-HLNVMEMPE 161
Query: 73 MEQGSWGTDFR-RGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE--PL----------- 118
+ +FR R V H + + D+ D VV + PL
Sbjct: 162 QPEFPTDPNFRGRPRNVLHTV-ATESVDMQEDPRFGVVGKQTIEDKGPLGSKRMQTVDGE 220
Query: 119 FLMLA------HSAVHSGNPY----EPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESV 168
FL A H A PY P R QK +Y S + L +LD+ +
Sbjct: 221 FLEFATNWLDRHEAEKDEQPYFMWYNPTRMHQKTHVRPEYQGASQINTYYDGLIELDDQI 280
Query: 169 GKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLW 228
G ++ L G ++N+I++F++DNG + D ++++ +G K T W+GG R +
Sbjct: 281 GVLLDKLEDLGEIDNTIILFTSDNGVNLDHWPDGGSASF--RGQKGTTWDGGFRVPMLVS 338
Query: 229 SPLLDSKARVAYQKMHISDWLPTLYSAAG-GDL--SVLE-----------NLDGVNQWDA 274
P + M DW+PT+ +A G GD+ +L+ +LDG NQ D
Sbjct: 339 WPDKIPQGEYTDGFMTSEDWVPTIMAAVGEGDIKQELLDGKELNGERYQVHLDGYNQLDM 398
Query: 275 LSKNTESPRTS-VLHNIDDIWGIAALTVDKYKL 306
L+K S R +N D + A VD +K+
Sbjct: 399 LTKGEPSQRHEFFFYNEQD---LNAFRVDDWKV 428
>UniRef50_Q8A171 Cluster: Putative secreted sulfatase ydeN; n=10;
Bacteroidetes|Rep: Putative secreted sulfatase ydeN -
Bacteroides thetaiotaomicron
Length = 518
Score = 78.6 bits (185), Expect = 3e-13
Identities = 75/288 (26%), Positives = 131/288 (45%), Gaps = 21/288 (7%)
Query: 23 QYLKDLGYKTHLVGKWHLGSYKKEYL-PLNRGFDSHV-GFWTGRIDMY-DHTTMEQGSWG 79
Q LKD GY T GK H G+ P + GF+ ++ G G + Y G
Sbjct: 151 QLLKDSGYHTIHCGKAHFGAIDTPGEDPHHWGFEVNIAGHAAGGLASYLGEENYGHNKDG 210
Query: 80 TDFR-RGFEVAHDLFGV--YATDVYTDEAIKVVNSHNK-SEPLFLMLAHSAVHSGNPYEP 135
+G + T+ T EAIK +N K ++P +L ++ A+H P
Sbjct: 211 KPISLMAVPGLEKYWGTETFVTEALTLEAIKALNKAKKYNQPFYLYMSQYAIHV-----P 265
Query: 136 IRAPQKLIDAFKYIDDSARQK-FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG 194
+ ++ D +K + + +A ++ +D+S+G ++ L G +N+I++F +DNGG
Sbjct: 266 LDKDKRFYDKYKKKGMTDHEAAYATLIEGMDKSLGDLMDWLEKSGEADNTIIIFMSDNGG 325
Query: 195 PAAG--FNDNA--ASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLP 250
AA + D N+PL K + +EGG+R + P + + + I D+ P
Sbjct: 326 LAAESYWRDGKLHTQNHPLNSGKGSTYEGGIREPMIVSWPGVVAPGSKCNDYLLIEDFYP 385
Query: 251 TLYSAAG-GDLSVLENLDGVNQWDALSKNTESPR--TSVLHNIDDIWG 295
T+ AG ++ +DG++ + L K T +P S+ N+ + WG
Sbjct: 386 TILEMAGIKKYKTVQPIDGIS-FMPLLKQTRNPSKGRSLFWNMPNNWG 432
>UniRef50_Q15XP0 Cluster: Sulfatase precursor; n=1;
Pseudoalteromonas atlantica T6c|Rep: Sulfatase precursor
- Pseudoalteromonas atlantica (strain T6c / BAA-1087)
Length = 627
Score = 78.6 bits (185), Expect = 3e-13
Identities = 68/230 (29%), Positives = 108/230 (46%), Gaps = 33/230 (14%)
Query: 21 LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDS---HVGFWTGRIDMYDHTTMEQGS 77
L + L++ GY+T + GKWHLG Y P ++GFD H G G+ Y T +
Sbjct: 126 LAESLQENGYRTGIFGKWHLGD-NYPYRPQDQGFDDVLIHGGGGVGQTPDYWGNTQFNDT 184
Query: 78 WGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIR 137
+ +R G + F YAT ++ DEA K ++ + + P F +A +A P+ P R
Sbjct: 185 Y---YRNG---TPEKFSGYATKIWFDEAKKFIDKQHDT-PYFAYIALNA-----PHGPYR 232
Query: 138 APQKLIDAFKYID-DSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG-- 194
AP+ I+ ++ + F ++S +DE VG++ L + L+N+I +F TDNG
Sbjct: 233 APETHIEPYEKRGLNRDMASFYGMISYIDEQVGELRAHLRAQDQLDNTIFIFMTDNGSSY 292
Query: 195 --------------PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSP 230
P A N N ++G K ++EGG R F+ P
Sbjct: 293 KPTDAKTHLTKRHLPLAEQYPNWQPNDNMRGYKGEVYEGGHRVPFFISYP 342
>UniRef50_A6DMW2 Cluster: Putative exported uslfatase; n=1;
Lentisphaera araneosa HTCC2155|Rep: Putative exported
uslfatase - Lentisphaera araneosa HTCC2155
Length = 479
Score = 78.6 bits (185), Expect = 3e-13
Identities = 74/278 (26%), Positives = 119/278 (42%), Gaps = 38/278 (13%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
L L + L+ Y+T + GKWHLG+ ++ F+TG+
Sbjct: 118 LSLKLPTFARVLQKNDYRTAMFGKWHLGNEER--------------FFTGK--------- 154
Query: 74 EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPY 133
E ++G D G + ++ T+ ++ + NK +P L L H P+
Sbjct: 155 EHKAYGFDEAFGVSGKAKAYDKGVNEL-TERTLRFLKE-NKKKPFMLCLMHHV-----PH 207
Query: 134 EPIRAP---QKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFST 190
P+ P + L D+ K+A ++S D S+ KV+ AL GL +N++V+ ++
Sbjct: 208 VPVACPPYAKALYDSVPKGKHQKNSKYAGMISHFDNSIKKVLDALRALGLDDNTVVIVTS 267
Query: 191 DNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLP 250
DNGG + N +SN P G K +L+EGG R + P + V + +D+ P
Sbjct: 268 DNGGLS-----NLSSNKPYNGGKGSLYEGGTRVPLLIRWPGKITPGSVNKSVVISNDFFP 322
Query: 251 TLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLH 288
T AG L +LDG + L T RT H
Sbjct: 323 TFLELAGLPLMPEAHLDGKSMMPLLKGKTLGKRTLYWH 360
>UniRef50_P25549 Cluster: Arylsulfatase precursor; n=12;
Proteobacteria|Rep: Arylsulfatase precursor -
Escherichia coli (strain K12)
Length = 551
Score = 78.6 bits (185), Expect = 3e-13
Identities = 100/345 (28%), Positives = 153/345 (44%), Gaps = 62/345 (17%)
Query: 1 MQHGVI----YGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDS 56
+ HG++ YG +P GL LPQ L D GY T +GKWH+G KE P N GFD
Sbjct: 150 IHHGILMPPMYG-QPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQPQNVGFDD 206
Query: 57 HVGFWTGRIDMYD-----HTTME------------QGSWGTD----FRRGFEVA-HDLFG 94
GF DMY H E Q + D R G + A D+
Sbjct: 207 FRGF-NSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITP 265
Query: 95 VYATDV---YTDEAIKVVNSHNKSE-PLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYID 150
Y D+ + D +K ++ KS+ P FL H N P KY
Sbjct: 266 KYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN------YPNA-----KYAG 314
Query: 151 DS-ARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPL 209
S AR + + ++++ + K L G L+N+++VF++DN GP A + + P
Sbjct: 315 SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN-GPEAEVPPHGRT--PF 371
Query: 210 KGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENL-- 266
+G K + WEGGVR F+ W ++ + R + + ++D PT AG + + NL
Sbjct: 372 RGAKGSTWEGGVRVPTFVYWKGMI--QPRKSDGIVDLADLFPTALDLAGHPGAKVANLVP 429
Query: 267 -----DGVNQWD-ALSKNTESPRTSVLHNIDDIWGIAALTVDKYK 305
DGV+Q L N +S R + + ++ +AA+ +D++K
Sbjct: 430 KTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG--KLAAVRMDEFK 472
>UniRef50_Q7UYD2 Cluster: Sulfatase 1; n=2; Bacteria|Rep: Sulfatase
1 - Rhodopirellula baltica
Length = 478
Score = 78.2 bits (184), Expect = 4e-13
Identities = 82/313 (26%), Positives = 136/313 (43%), Gaps = 32/313 (10%)
Query: 25 LKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDFRR 84
LK GY T L GK+ +GS PL GFD+ G ++ + H W +
Sbjct: 144 LKHAGYDTALFGKYSIGSQMGVTDPLAMGFDTWYGMYS---ILEGHRQYPTILWRDGKKL 200
Query: 85 GFEVAH-DLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLI 143
E G YA ++T EAI+ + + + P F++LA+S+ H+ P ++
Sbjct: 201 RIEENEAGRKGAYAQALFTHEAIQYIKQDHDN-PFFVLLAYSSPHAELAAPP-EFVERYK 258
Query: 144 DAF---KY---IDDSARQKFA-----------AVLS----KLDESVGKVVKALHTRGLLE 182
DAF +Y + + K+A AVL+ LD VG++ ++L ++G+ +
Sbjct: 259 DAFPETRYGGMSNGTPSDKYAWYYPEPVERPHAVLAGMVTALDAYVGQIYQSLESKGIAD 318
Query: 183 NSIVVFSTDNGGPAAGFNDNA--ASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAY 240
N++++F++DNG G D ++ P KG+K L++GG+ P RV
Sbjct: 319 NTLILFTSDNGPHDEGGGDPTFFRASEPYKGMKRDLYDGGIHVPMIAHWPAAIRSPRVDD 378
Query: 241 QKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALT 300
+D LPT AG L ++ + N L + + PR L N W
Sbjct: 379 TPWAFADVLPTFADIAGVSLDIVPRVK-TNGVSVLPRLRDDPRP--LPNRTLYWEFGKQA 435
Query: 301 VDKYKLIKGTIYK 313
D + G +Y+
Sbjct: 436 GDPNSGVVGEVYQ 448
>UniRef50_A6DNI1 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine 6-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 273
Score = 78.2 bits (184), Expect = 4e-13
Identities = 58/194 (29%), Positives = 102/194 (52%), Gaps = 15/194 (7%)
Query: 96 YATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQ 155
Y+TD + EAI+ + NK +P FL +++ P+ P+ A + + F++I D R+
Sbjct: 13 YSTDAFGREAIEFIE-RNKKKPFFLFVSYIT-----PHVPMEAKESDLKRFEHIKDPLRR 66
Query: 156 KFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNT 215
A+++ +D++VG+++K L L +++++ F +DNG G+ NA+ P G K+
Sbjct: 67 TSLAMIACMDDNVGRMLKVLKDNKLEKDTLIFFISDNG----GYPGNASLCTPYSGSKSQ 122
Query: 216 LWEGGVRGAGFL-WSPLLDSKARVAYQKMHIS-DWLPTLYSAAGGDLSVLENLDGVNQWD 273
+ EGG+ + W + + +V Y K IS D PT AAG + LDGV+
Sbjct: 123 MLEGGIHVPFIMQWKGTI-PRGKV-YGKPIISLDIKPTALVAAGATIKDQWQLDGVDLIP 180
Query: 274 ALS-KNTESPRTSV 286
L+ + T P S+
Sbjct: 181 YLNGQKTSDPHESL 194
>UniRef50_A6DMU3 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine 6-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 448
Score = 77.8 bits (183), Expect = 6e-13
Identities = 57/209 (27%), Positives = 101/209 (48%), Gaps = 15/209 (7%)
Query: 68 YDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKS-EPLFLMLAHSA 126
YD + E G+ F+ H + T+ TD I + S +P + +++ A
Sbjct: 141 YDLSDGETGNVTGGMEDKFQPYHIMDDPKRTNSVTDRTIAFIKEQKSSGKPFYAQVSYYA 200
Query: 127 VHSGNPYEPIRAPQKLIDAFKYIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLEN 183
H + +K + F+ + R+ FA +L + D ++G+++ AL + +N
Sbjct: 201 THLS-----VELEEKSLKKFQGKGEPDRRYTAGFAGMLQETDRAIGRILDALDELEIADN 255
Query: 184 SIVVFSTDNGG----PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVA 239
+ V+FS+DNGG P A + NYPL G K+TL EGG+R ++ P + + +
Sbjct: 256 TYVIFSSDNGGRGEIPGAA-TEGLDPNYPLTGYKHTLNEGGIRVPFYVRGPGVKPNS-WS 313
Query: 240 YQKMHISDWLPTLYSAAGGDLSVLENLDG 268
++ + D LP+ Y AGG ++ E +DG
Sbjct: 314 HEIVSSYDLLPSFYELAGGTEALPETVDG 342
>UniRef50_UPI0000586CBD Cluster: PREDICTED: similar to MGC86251
protein; n=4; Strongylocentrotus purpuratus|Rep:
PREDICTED: similar to MGC86251 protein -
Strongylocentrotus purpuratus
Length = 525
Score = 77.4 bits (182), Expect = 7e-13
Identities = 81/314 (25%), Positives = 130/314 (41%), Gaps = 28/314 (8%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLG-SYKKEYLPLNRGFDSHVGFWTG------RI 65
GLPLNE ++ + LK GY++ VGKWHLG YLP N GFD +G +
Sbjct: 103 GLPLNETLISKMLKPEGYRSAAVGKWHLGLGNNSVYLPHNHGFDEFLGLPASPSQCRCSV 162
Query: 66 DMYDHTTMEQGSWGTDFR-----RGFEVAHDLFGVYA-TDVYTDEAIKVVNSH-NKSEPL 118
Y + T + ++ G + + D Y ++ + + ++ P
Sbjct: 163 CFYPNVTCHRAPCSPEYSPCALFNGTTIIEQPADLLTLDDKYAMQSRRFIRTNVETGTPF 222
Query: 119 FLMLAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTR 178
FL A S + + P A ++ S R +F L+ LD VG++ + L
Sbjct: 223 FLYYA-----SHHTHHPQYAGKETSGT------SIRGRFGDSLAALDWEVGQIYEELKEN 271
Query: 179 GLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARV 238
G+LE++ FS+DN GP+ + + +K K T +EGG+R + P + R
Sbjct: 272 GILEDTFFFFSSDN-GPSLSLENFGGNAGLMKCGKATTYEGGIRVPAIVHWPGQITPGR- 329
Query: 239 AYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAA 298
+ + D LPT+ S L + LDG + L + S R S + +
Sbjct: 330 SMELSSTLDVLPTIASITNAKLPNV-TLDGYDMSPFLFQGMPSLRESFFYYPSKVDTEHK 388
Query: 299 LTVDKYKLIKGTIY 312
+YK K Y
Sbjct: 389 SYAVRYKQYKAVFY 402
>UniRef50_Q8D7K3 Cluster: Arylsulfatase A; n=16; Bacteria|Rep:
Arylsulfatase A - Vibrio vulnificus
Length = 521
Score = 77.4 bits (182), Expect = 7e-13
Identities = 72/254 (28%), Positives = 108/254 (42%), Gaps = 15/254 (5%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
G+P + LK+ GY T GK HLG + ++LP N GFD G ++ +
Sbjct: 105 GIPDWAPTIADLLKEQGYMTAQFGKNHLGD-QDQHLPTNHGFDEFFGNLY-HLNAEEEPE 162
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEA--IKVVNSHNKSEPLFLMLA--HSAVH 128
+FR+ + + YA D + H E L LA AV
Sbjct: 163 TYYYPKDPEFRKNYG-PRGVIKSYADGKIEDTGPMTRKRMEHADEEFLESSLAFMEKAVK 221
Query: 129 SGNPY----EPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
+ P+ R KY S +A + + D+ VG ++ L G+ +N+
Sbjct: 222 ADKPFFIWHNTTRMHVWTRLQEKYQGKSGVSIYADGMLEHDDQVGILLDKLDELGVADNT 281
Query: 185 IVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKM 243
IV++STDNG + D A+ P G K T WEGG+R + W ++ ++
Sbjct: 282 IVIYSTDNGAETVTWPDGGAT--PFYGEKGTTWEGGMRVPQLVRWPGVIKPGTKINDMMA 339
Query: 244 HISDWLPTLYSAAG 257
H DWLPTL +AAG
Sbjct: 340 H-QDWLPTLMAAAG 352
>UniRef50_A0IXQ1 Cluster: Sulfatase precursor; n=1; Shewanella
woodyi ATCC 51908|Rep: Sulfatase precursor - Shewanella
woodyi ATCC 51908
Length = 358
Score = 77.4 bits (182), Expect = 7e-13
Identities = 65/229 (28%), Positives = 111/229 (48%), Gaps = 32/229 (13%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
L L E L + K GY+T GKWH+G + YLP ++GFD ++G H
Sbjct: 125 LALTELTLAEAFKSQGYETFFAGKWHMGG--EGYLPTDQGFDINIGGM--------HRGS 174
Query: 74 EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPY 133
G + ++ + + G + T TDE I + S +P F +L++ VH+
Sbjct: 175 PPGGYYDPYKNP-NLPNRNKGEHLTKRLTDETIDFL-SQKHEKPFFALLSYYGVHTPLQA 232
Query: 134 EPIR-----------APQK--LIDAFKYIDDSARQ---KFAAVLSKLDESVGKVVKALHT 177
P + A +K LID Q +A+++ +D+SVG+++++L
Sbjct: 233 GPDKLAYFKEKTNTVAGEKAFLIDKGHQSRTQINQVDANYASMIWAVDKSVGRILESLEK 292
Query: 178 RGLLENSIVVFSTDNGGPAAGFNDN----AASNYPLKGVKNTLWEGGVR 222
+GL +N++VV ++DNGG + + + +N PL+ K ++EGGVR
Sbjct: 293 QGLDKNTLVVLTSDNGGFSTRHQGDERVTSTANLPLRSGKGWVYEGGVR 341
>UniRef50_Q7UYA6 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
n=1; Pirellula sp.|Rep: N-acetylgalactosamine 6-sulfate
sulfatase - Rhodopirellula baltica
Length = 490
Score = 77.0 bits (181), Expect = 1e-12
Identities = 73/267 (27%), Positives = 121/267 (45%), Gaps = 39/267 (14%)
Query: 17 NEKILPQYLKDLGYKTHLVGKWHLGSYKK-----EYLPLNRGFDSHVGFWTGRIDMYDHT 71
+E + + LK GY + GKW L + + + LP +GFD G T + +
Sbjct: 102 DEVTIAEVLKTKGYASACFGKWDLAKHAQSGFFSDLLPTGQGFDYFYGTPTSNDRVANLY 161
Query: 72 TMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGN 131
E+ E D+ + T YTDEAI + N+++P F+ + H+ H+
Sbjct: 162 RNEEL---------IEPESDMATL--TRRYTDEAISFIEK-NQNQPFFVYIPHTMPHTR- 208
Query: 132 PYEPIRAPQKLIDAFK-YIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFST 190
+DA K + S R + V+ ++D +VG+++ +L+ L +N+ V+F++
Sbjct: 209 -----------LDASKDFKGKSKRGLYGDVIEEIDFNVGRILDSLNELNLADNTYVLFTS 257
Query: 191 DNG-------GPAAG--FNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQ 241
DNG G A G D+ S PL+ K + +EGGVR LW+P V
Sbjct: 258 DNGPWLVKNKGHADGHRLGDHGGSAGPLRSGKVSTFEGGVRVPAILWAPGKVPAGTVCDS 317
Query: 242 KMHISDWLPTLYSAAGGDLSVLENLDG 268
D +PTL + AG ++ +DG
Sbjct: 318 IATTMDVMPTLAALAGAEIPTDRVIDG 344
>UniRef50_A6DU75 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
n=1; Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine 6-sulfate sulfatase - Lentisphaera
araneosa HTCC2155
Length = 537
Score = 77.0 bits (181), Expect = 1e-12
Identities = 59/181 (32%), Positives = 90/181 (49%), Gaps = 19/181 (10%)
Query: 18 EKILPQYLKDLGYKTHLVGKWHLG-SYKKEYLPLNRGFDS---HVGFWTGRI-DMYDHTT 72
EK L + KD GYKT + GKWHLG SY Y P RGF+ H G G++ D + +T
Sbjct: 98 EKTLANFFKDAGYKTAIFGKWHLGMSY--PYAPRFRGFEESFIHGGGGIGQLEDAHGNTH 155
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNP 132
++ W G V Y++D+ D+AI + NK +P F ++ A H+ P
Sbjct: 156 IDAHYW----HNGKLVPSK---GYSSDILFDKAIDFIEK-NKDKPFFCFVSTPATHA--P 205
Query: 133 YEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDN 192
Y+ K I A + +++ +D+ VGK++K L L +N+IV+ +TD
Sbjct: 206 YQEHPEAAKRIRARGI--TTGNIALYSMIENIDDCVGKILKKLDDLKLKDNTIVIIATDQ 263
Query: 193 G 193
G
Sbjct: 264 G 264
>UniRef50_A6DI17 Cluster: N-acetylgalactosamine-4-sulfatase; n=1;
Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine-4-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 590
Score = 77.0 bits (181), Expect = 1e-12
Identities = 58/214 (27%), Positives = 98/214 (45%), Gaps = 16/214 (7%)
Query: 21 LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGT 80
+ + +D GY T GKWHLG P+++GFD V G + T
Sbjct: 102 IAEAFRDQGYATGHFGKWHLGD-NYPMRPMDQGFDEVVALGCGAVGQIGDYWANDYFDDT 160
Query: 81 DFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQ 140
G + + Y TDV+ +E ++ + K +P F+ LA + H P+
Sbjct: 161 YIHNG---EYKKYEGYCTDVFFNETMRFIKE-TKDKPFFIYLAPNVTHL-----PLIVAD 211
Query: 141 KLIDAFKYIDDSARQKFAA---VLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAA 197
K ++ID+ K A ++ LDE+ G+++ L G LEN+I++++TD+G A
Sbjct: 212 KYSQ--RHIDNGINPKLATFYGMVDNLDENFGRLMDCLKEEGELENTILLYTTDDGMQGA 269
Query: 198 GFNDNAASNYP-LKGVKNTLWEGGVRGAGFLWSP 230
N + + ++G K + EGG R + F+ P
Sbjct: 270 AGNSTPTTWFKGMRGKKGSKEEGGHRVSCFMSWP 303
>UniRef50_A3ZLD4 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
Blastopirellula marina DSM 3645|Rep:
N-acetylgalactosamine 6-sulfatase - Blastopirellula
marina DSM 3645
Length = 587
Score = 77.0 bits (181), Expect = 1e-12
Identities = 80/277 (28%), Positives = 119/277 (42%), Gaps = 38/277 (13%)
Query: 4 GVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTG 63
GV G E L ++E+ + +D GY T GKWH G+ + Y P RGFD GF +G
Sbjct: 93 GVSTGQER--LNVDEQTFVEAFRDAGYATAAFGKWHNGT-QFPYHPNARGFDEFCGFCSG 149
Query: 64 RIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLA 123
Y +E + RG + D T+ AI+ + H + EP +
Sbjct: 150 HWGNYFDPLLEHNN---QLIRGEG--------FIVDDLTNRAIQFIERH-QDEPFLCYVP 197
Query: 124 HSAVHSGNPYEPIRAPQKLIDAFKYID------DSARQKFA------AVLSKLDESVGKV 171
+ HS P++ P K D F +D D ++ A A+ +D +VG+V
Sbjct: 198 FNTPHS-----PMQVPDKFYDKFADVDFEMKNRDPQKEDLAMTRAALAMCENIDWNVGRV 252
Query: 172 VKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPL 231
++ L L +++IVV+ +DNG + +N +KG K + EGGVR F+ P
Sbjct: 253 LQKLDDLKLTDDTIVVYFSDNGPNSWRWNGG------MKGRKGSTDEGGVRSPLFIRWPK 306
Query: 232 LDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDG 268
S Q D PTL AG + LDG
Sbjct: 307 HISAGLKIEQVAGAIDLGPTLADLAGVKFQPQKRLDG 343
>UniRef50_Q7UIU1 Cluster: Arylsulfatase A; n=1; Pirellula sp.|Rep:
Arylsulfatase A - Rhodopirellula baltica
Length = 529
Score = 76.6 bits (180), Expect = 1e-12
Identities = 86/357 (24%), Positives = 147/357 (41%), Gaps = 30/357 (8%)
Query: 4 GVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLG----------SYKKEYL--PLN 51
GV+ G +P + L L+ GY T ++GKWHLG + K L P N
Sbjct: 120 GVLSGTSEHLIPGDRATLGHLLQGAGYHTAMIGKWHLGWDWHKNGKEIDFSKPVLNGPDN 179
Query: 52 RGFDSHVGFWTGRIDMYDHTTMEQGSWGT--DFRRGFEVAHDLFGVYATDVYTDE-AIKV 108
GFD + G G +DM + ++ G+ + + G + +G Y D+ I+
Sbjct: 180 NGFDQYYGH-CGSLDMPPYVWVDTGTPTSVPTRKEGVTKKQNPYGWYRNGPIGDDFEIEQ 238
Query: 109 VNSHNKSEPLFLMLAHSAVHSGNP---YEPIRAPQK-LIDAFKYIDDSARQKFAAVLSKL 164
V H + + + V P Y P+ AP ++ + D S +A + ++
Sbjct: 239 VLPHLFDKSIAYV--EERVKEDKPFFLYLPLPAPHTPIVPVPPFKDASGMNPYADFVMQM 296
Query: 165 DESVGKVVKALHTRGLLENSIVVFSTDNG-GPAAGFNDNAASNY----PLKGVKNTLWEG 219
D +G+++ A+ G+ EN++V+F++DNG P A F + A + +G K ++EG
Sbjct: 297 DHHMGQLLDAISKAGIDENTLVIFTSDNGCSPEANFGELAKHGHDPSGKYRGHKADIYEG 356
Query: 220 GVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNT 279
G R + P + ++D TL S DG + D +
Sbjct: 357 GHRVPFIVRWPGKVVAGKTTNALTCLTDVYATLQSITDQPREATGGEDGFDLTDVFGGDD 416
Query: 280 ESPRTSVLHNIDDIWGIAALTVDKYKLIKGTIYKGVWDNWYGPSGREGAYNASLLYD 336
S R +++ + I G A+ D +KL + G W N P + L+D
Sbjct: 417 SSDREALVSH--SIGGSFAIRRDSWKLCL-SHGSGGWSNPREPKAKLQGLPPMQLFD 470
>UniRef50_A6BYR0 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1;
Planctomyces maris DSM 8797|Rep:
N-acetyl-galactosamine-6-sulfatase - Planctomyces maris
DSM 8797
Length = 658
Score = 76.6 bits (180), Expect = 1e-12
Identities = 66/235 (28%), Positives = 109/235 (46%), Gaps = 21/235 (8%)
Query: 17 NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDS--HVGFWTGRIDMYDHTTME 74
N+ L + L+D GY+T GKWHLG + P +GF++ H G + +
Sbjct: 130 NQYTLAEALRDAGYRTGHFGKWHLG-LTTPHRPDKQGFETVWHCAPDPGPPSYFSPYGVT 188
Query: 75 QGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYE 134
T R + G + TD T EAI+ + +H +SEP FL L H +VH P++
Sbjct: 189 PTGKPTAQHRVGNITDGPDGEHITDRLTSEAIQFMEAH-RSEPFFLNLWHYSVHG--PWQ 245
Query: 135 PIRAPQKLIDAFKYIDDSARQK---FAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
+ + K D Q+ A++L +DES+G++++ L L +N++ +F +D
Sbjct: 246 --HKAEYTAEFAKKQDPRKEQRNPVMASMLRNVDESLGRILQKLDELKLADNTLFIFYSD 303
Query: 192 NGGPAAGFNDN------AASNYPLKGVKNTL--WEGGVRGAGFLWSPLLDSKARV 238
NGG A ++ + +PL N+ W GG +PL + K R+
Sbjct: 304 NGGNAHSWSSDDPKLKKITDKHPLYKTINSYRKWAGGEPPTNN--APLREGKGRI 356
>UniRef50_P50473 Cluster: Arylsulfatase precursor; n=7;
Echinoida|Rep: Arylsulfatase precursor -
Strongylocentrotus purpuratus (Purple sea urchin)
Length = 567
Score = 76.6 bits (180), Expect = 1e-12
Identities = 76/304 (25%), Positives = 129/304 (42%), Gaps = 28/304 (9%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLG-----SYKKEYLPLNRGFDSHVG--FWTGRI 65
GLPL E + + +K GY T +VGKWHLG S +LP NRGFD VG G
Sbjct: 147 GLPLYEVTIAEAMKGAGYTTGMVGKWHLGINENSSSDGAHLPANRGFD-FVGHNLPFGNS 205
Query: 66 DMYDHTTMEQGSWGTD----FRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLM 121
D T + Q T+ + VA T + D+ + + N ++P F+
Sbjct: 206 WRCDDTGLHQDFPDTNACFLYYNSTSVAQPFQHKGLTQLLRDDTVGFIED-NVNKPFFMY 264
Query: 122 LAHSAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLL 181
++ + +H+ L + + S R ++ L ++D+++ ++V L +
Sbjct: 265 VSFAHMHT-----------SLFSSDDFSCTSRRGRYGDNLREMDQAIEQIVTTLVDNDID 313
Query: 182 ENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQ 241
+N+++ F++D+ GP + +G K WEGG R ++ P S V+++
Sbjct: 314 DNTVIFFTSDH-GPHREYCGEGGDANVFRGGKGQSWEGGHRIPYIVYWPGTISPG-VSHE 371
Query: 242 KMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHNIDDIWGIAALTV 301
+ D + T + G L DG L + SP + D + A+ V
Sbjct: 372 IVTSMDIIATAVNLGGSQLPTDRIYDGKCLKSVLLEGASSPHDDFFYYCKDT--LMAVRV 429
Query: 302 DKYK 305
KYK
Sbjct: 430 GKYK 433
>UniRef50_UPI0000E0F7DD Cluster: aryl-sulphate sulphohydrolase; n=3;
alpha proteobacterium HTCC2255|Rep: aryl-sulphate
sulphohydrolase - alpha proteobacterium HTCC2255
Length = 493
Score = 76.2 bits (179), Expect = 2e-12
Identities = 64/216 (29%), Positives = 99/216 (45%), Gaps = 34/216 (15%)
Query: 12 RGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFD-----SHVGFWTGRID 66
RGL + + + LK GY T GKWHLG+ P +GFD SH G
Sbjct: 130 RGLTTDIITIGESLKTAGYTTGTFGKWHLGAD-----PDKQGFDVNVAGSHQGMTFHYFS 184
Query: 67 MYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSA 126
Y +E G G Y T+ T E I V S +K +P F + +
Sbjct: 185 PYQLPNIEDGPKGE---------------YLTERLTTEVIDWVKS-SKDQPFFAYVPYYT 228
Query: 127 VHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIV 186
VH+ PY+ + K I +AA++ +D++VG++ L + GL EN++V
Sbjct: 229 VHT--PYQAVVDKVNKYHE-KGIKSKREATYAAMVEHMDDNVGRIFDMLDSEGLAENTVV 285
Query: 187 VFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVR 222
+F++DNGG ++ PL+G K + ++GG+R
Sbjct: 286 IFTSDNGGYRM-----SSFPTPLRGGKGSYYDGGLR 316
>UniRef50_UPI00006A2B15 Cluster: UPI00006A2B15 related cluster; n=1;
Xenopus tropicalis|Rep: UPI00006A2B15 UniRef100 entry -
Xenopus tropicalis
Length = 323
Score = 75.8 bits (178), Expect = 2e-12
Identities = 76/261 (29%), Positives = 122/261 (46%), Gaps = 35/261 (13%)
Query: 16 LNEKI--LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
LN+++ LPQ +KD GY T + GKWHLG+ + P +RGF+ G + M
Sbjct: 78 LNQRVAALPQIMKDGGYWTVMAGKWHLGA-SEGMQPNHRGFERSYALMDGGASHFKQKVM 136
Query: 74 EQGSWG---TDFRRGFEVAHDL-FGVYATDVYTDEAIKVV-NSHNKSEPLFLMLAHSAVH 128
S T G +V DL Y++ YTD+ + + + + +P F A++A
Sbjct: 137 RLASEAPEPTYLENGQKV--DLPDDFYSSRTYTDKLMTYLKDPQREGKPFFAYAAYTA-- 192
Query: 129 SGNPYEPIRAP----QKLIDAFKYIDDSARQK--------FAAVLSKLDESVGKVVKALH 176
P+ P++AP QK + D Q+ +AA + LD +VG+++ L
Sbjct: 193 ---PHLPLQAPDDELQKKRGQYDVGYDVIAQRRIARTMEVYAAQVRDLDRNVGRLIDNLK 249
Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDNAA-SNYPLKGVKNTLWEGGVRGAGFLWSPLLDSK 235
G +N++++F +DNG ND AA ++ K KN EGG+R F+ P K
Sbjct: 250 ASGQYDNTLIIFLSDNGPEG---NDWAADGSFDPKWFKN---EGGIRSPSFVSYP-GHVK 302
Query: 236 ARVAYQKMHISDWLPTLYSAA 256
+ Q + + D PT+ A
Sbjct: 303 PGKSEQILTVKDIAPTILDVA 323
>UniRef50_Q4SI19 Cluster: Chromosome 5 SCAF14581, whole genome
shotgun sequence; n=4; Euteleostomi|Rep: Chromosome 5
SCAF14581, whole genome shotgun sequence - Tetraodon
nigroviridis (Green puffer)
Length = 554
Score = 75.8 bits (178), Expect = 2e-12
Identities = 53/188 (28%), Positives = 91/188 (48%), Gaps = 19/188 (10%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHT- 71
G+ +E +LPQ LK GY + +VGKWHLG ++ +YLPL GFD +G Y+++
Sbjct: 92 GISKDEILLPQMLKKRGYISKIVGKWHLG-HRPQYLPLEHGFDEWLGAPNCHFGPYNNSV 150
Query: 72 -----TMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHS 125
+ F + + T +Y E++ V +++ P FL A
Sbjct: 151 KPNIPVYNNSEMLGRYYEEFRIDRKMGESNLTQMYLLESLDFVRRQAEAQRPFFLYWAPD 210
Query: 126 AVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSI 185
A H+ P+ A + ++ S R ++ + +LD SVG+++ L + G+ N+
Sbjct: 211 ATHA-----PVYASK------GFLGKSQRGRYGDAVVELDYSVGEILSLLRSLGIDNNTF 259
Query: 186 VVFSTDNG 193
V F++DNG
Sbjct: 260 VFFTSDNG 267
>UniRef50_A6DHW4 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
Lentisphaera araneosa HTCC2155|Rep:
N-acetylgalactosamine 6-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 512
Score = 75.8 bits (178), Expect = 2e-12
Identities = 84/314 (26%), Positives = 138/314 (43%), Gaps = 56/314 (17%)
Query: 4 GVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF--- 60
G Y E GLP E+ + + LK +GYKT VGK H+ K++ P++ GFD +GF
Sbjct: 87 GAYYYGEG-GLPKEEQTIAEALKSIGYKTMKVGKTHMNKGFKQH-PMDHGFDDFLGFIDH 144
Query: 61 -WT------GRIDMYDHTTMEQGSWGT-----DFRRGFEVAHDLFGVYATDVYTDEAIKV 108
W +D Y + G G RG+E TDV+T EA K
Sbjct: 145 SWDFFMLSQEHLDAYKKRAKKAGHKGNIKFLGPLMRGYEKNASFKDTNITDVFTVEAQKF 204
Query: 109 VNSHNKSEPLFLMLA----HSAVH-------------------SGNPYE-PIRAPQ--KL 142
+ NK EP +L L+ H+ +H + + +E P+ P+ K
Sbjct: 205 I-VENKDEPFYLRLSFNAVHTPLHLVPEELAKKHGIKQPKWDPNASTWEYPLWDPKTLKY 263
Query: 143 IDAFKYI------DDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPA 196
+ +K + D R K+ L +D+++GK++K L + + +N+++ FS+DNGG
Sbjct: 264 NEWYKQVCHLQNPDPYGRLKYLIHLEMIDQAIGKILKTLDEQQIRDNTLIFFSSDNGGS- 322
Query: 197 AGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAA 256
+ + A+N L K ++ +G + + P KA + + D T+
Sbjct: 323 ---HQSYANNGHLNAFKYSVMDGALHVPFLVSYPAKLPKANKSDALVSHMDIFATIADLT 379
Query: 257 GGDLSVLENLDGVN 270
G LS LDG++
Sbjct: 380 G--LSPKNKLDGLS 391
>UniRef50_A6DG52 Cluster: Arylsulphatase A; n=1; Lentisphaera
araneosa HTCC2155|Rep: Arylsulphatase A - Lentisphaera
araneosa HTCC2155
Length = 419
Score = 75.8 bits (178), Expect = 2e-12
Identities = 73/264 (27%), Positives = 115/264 (43%), Gaps = 29/264 (10%)
Query: 25 LKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGTDFRR 84
+K+ GY T + GKW L + +K L + GFD++ W+ Y T + W R
Sbjct: 96 MKEAGYATAVAGKWQLYTGRKGSLAPDCGFDTYC-LWS-----YPGTERSR-FWNPSLIR 148
Query: 85 GFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQKLID 144
+ Y D+ TD I + NKS+P F VHS P+ P P D
Sbjct: 149 DGKKVPVTPNSYGPDICTDFIIDFIKK-NKSQPFFAYYPMLLVHS--PFVP--TP----D 199
Query: 145 AFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAA 204
+ + + + ++S +D+ +G+++ L L +N+IV+F+TDNG
Sbjct: 200 SKDKNSTNKLENYRDMVSYMDKCIGRIIDTLEETNLRKNTIVLFTTDNG-------TGRP 252
Query: 205 SNYPLKGVKNT-----LWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGD 259
YP KG K +GG + P + S+ V+ + SD+LPTL +G +
Sbjct: 253 LTYPYKGEKRVGEKAYPTDGGSHVPLIVNGPGIVSQGLVSDDIVDFSDFLPTLADISGAN 312
Query: 260 LSVLENLDGVNQWDALSKNTESPR 283
L + LDG + W SPR
Sbjct: 313 LPNV-TLDGRSFWPQCLGKKGSPR 335
>UniRef50_A3ZVD1 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
n=2; Planctomycetaceae|Rep: N-acetylgalactosamine
6-sulfate sulfatase - Blastopirellula marina DSM 3645
Length = 496
Score = 75.8 bits (178), Expect = 2e-12
Identities = 74/279 (26%), Positives = 123/279 (44%), Gaps = 31/279 (11%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
L L E L + LK GY T GKWHLG + P ++GFD ++G R Y
Sbjct: 136 LALEETTLAEALKQRGYATFFAGKWHLGP--EGNWPEDQGFDVNIG-GIDRGGPYGGKKY 192
Query: 74 EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSG--- 130
G + G + D E +K + H + +P L+ +VH+
Sbjct: 193 FSPYGNPRLTDGPD------GEHLPDRLASETVKFIEQH-QDQPFLAYLSFYSVHTPLMA 245
Query: 131 -----NPYEPIRAPQKLIDAFKYIDDSARQK---------FAAVLSKLDESVGKVVKALH 176
Y+ I+ Q++ A + + K +A ++ +D +VGKV+ AL
Sbjct: 246 REDLKQKYDEIK--QRIRFAGPIWGEEGKSKLRLVQEHSVYAGMVEAMDAAVGKVLDALD 303
Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKA 236
L +N++V+F++DNGG + + SN PL+G K ++EGG+R + P + S
Sbjct: 304 RLKLTDNTLVIFTSDNGGLSTS-EGHPTSNLPLRGGKGWMYEGGIREPLVVRYPGVTSPG 362
Query: 237 RVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDAL 275
+ + D+LPT+ + ++ DGV+ AL
Sbjct: 363 SESDALVTSPDFLPTILAVVDKPGDKIDT-DGVSIISAL 400
>UniRef50_Q15SA2 Cluster: Sulfatase precursor; n=1;
Pseudoalteromonas atlantica T6c|Rep: Sulfatase precursor
- Pseudoalteromonas atlantica (strain T6c / BAA-1087)
Length = 724
Score = 75.4 bits (177), Expect = 3e-12
Identities = 78/301 (25%), Positives = 128/301 (42%), Gaps = 29/301 (9%)
Query: 21 LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSWGT 80
L K GY T GKWHLG++ Y P GFD + + G G +
Sbjct: 133 LSSIAKANGYHTAHFGKWHLGAHP--YSPSEHGFDIDIPNFQG--------AGPTGGYLA 182
Query: 81 DFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQ 140
+ ++ + G + EA K + S P FL +VH+ P A
Sbjct: 183 PWSFAPDIQPQIAGEHIDIRLAKEAKKWIFSVKDDGPFFLNFWAFSVHA-----PFNADA 237
Query: 141 KLIDAF----KYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPA 196
ID F +AA++ + D+++G + +AL + +N+I++F++DNGG
Sbjct: 238 DEIDYFINKRSGFHSQRNATYAAMVKQFDDAIGVLWQALVEAKVEKNTIIIFTSDNGGNM 297
Query: 197 AGF--NDNAASNYPLKGVKNTLWEGGVR-GAGFLWSPLLDSKARVAYQKMHISDWLPTLY 253
N +A SN+PLKG K T +EGG++ +W P L ++ + +D+ PTL
Sbjct: 298 YTVVGNTHATSNFPLKGGKATEYEGGLKVPTAVIW-PGLTQPNTLSNTPIQTADFFPTLL 356
Query: 254 SAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLH-----NIDD-IWGIAALTVDKYKLI 307
+ +DG + L T R + + D + A +T+D +KLI
Sbjct: 357 NGVNLSWPSTHIVDGRDIRPVLQGGTLETRAIFTYYPAEPKVPDWLPPSATVTLDGWKLI 416
Query: 308 K 308
+
Sbjct: 417 R 417
>UniRef50_A6DID7 Cluster: Aryl-sulphate sulphohydrolase; n=2;
Lentisphaera araneosa HTCC2155|Rep: Aryl-sulphate
sulphohydrolase - Lentisphaera araneosa HTCC2155
Length = 523
Score = 74.9 bits (176), Expect = 4e-12
Identities = 69/258 (26%), Positives = 117/258 (45%), Gaps = 27/258 (10%)
Query: 17 NEKI-LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQ 75
+EK+ + LK +GY T + GKWH+ + + ++ G + G D+ DH+ +
Sbjct: 143 DEKVSFAEALKKVGYSTAMYGKWHISGHGRYGSGVDGGVSPQM---QGFDDVIDHSARDL 199
Query: 76 GSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVN-SHNKSEPLFLMLAHSAVHSGNPYE 134
S F++ + +F YT AI+ S ++P + LAH AVH+GN
Sbjct: 200 DSL---FKKNGD-PKQMF------TYTKRAIEFAEKSTQDNKPFMIYLAHHAVHTGNDVG 249
Query: 135 PIRAPQKLIDAFKYI---DDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTD 191
+K K + ++ +AA L+ D S+G ++ L + +N++++F +D
Sbjct: 250 SRTETRKYFTDKKSMGKYEEKVNTSYAAHLADTDTSIGLLLDKLEELKIKDNTVIMFLSD 309
Query: 192 NGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPT 251
NGG + PL+ K + +EGG+R F+ P M I D PT
Sbjct: 310 NGGIPTRLHQK-----PLRSWKGSYYEGGIRVPFFISWPKQFKPTETDVPAMAI-DLYPT 363
Query: 252 LYSAAGGDLSVLEN-LDG 268
+ AG + +EN LDG
Sbjct: 364 MLELAG--VKDIENHLDG 379
>UniRef50_A6C430 Cluster: Arylsulphatase A; n=1; Planctomyces maris
DSM 8797|Rep: Arylsulphatase A - Planctomyces maris DSM
8797
Length = 503
Score = 74.9 bits (176), Expect = 4e-12
Identities = 66/264 (25%), Positives = 116/264 (43%), Gaps = 19/264 (7%)
Query: 11 PRGLPLNEKILPQYLKDLGYKTHLVGKWHL-GSYKK--EYLPLNRGFDSHVGFWTGRIDM 67
P + E + L+ GY T VGKWHL G + + P + GFD +
Sbjct: 110 PMHVRKREITIATLLRQAGYATCHVGKWHLNGMFNMVGQPQPSDHGFDHWFSTQNNALPT 169
Query: 68 YDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIK-VVNSHNKSEPLFLMLAHSA 126
+++ +F R L G +A+ + DEA + + +K +P F+ +
Sbjct: 170 HENPF--------NFVRNARPVGPLQG-FASQLVADEAEEWLTQLRDKEKPFFMFVCFH- 219
Query: 127 VHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIV 186
P+EPI + ++ + + S ++++D++ G+++K L + L EN+++
Sbjct: 220 ----EPHEPIASAERFRKLYTAPEGSTLPAHHGNVTQMDDAFGRILKTLDDQKLRENTLI 275
Query: 187 VFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHIS 246
+F++DN GPA S+ PL+ K +EGG+R G + P + +
Sbjct: 276 IFTSDN-GPAITRRHPHGSSGPLRDKKGATYEGGIRVPGIVQWPEHVQPGTTSDVPVCGV 334
Query: 247 DWLPTLYSAAGGDLSVLENLDGVN 270
D LPTL + A LDG N
Sbjct: 335 DILPTLCAVADIPAPTDRVLDGTN 358
>UniRef50_A6C284 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
Planctomyces maris DSM 8797|Rep: N-acetylgalactosamine
6-sulfatase - Planctomyces maris DSM 8797
Length = 605
Score = 74.9 bits (176), Expect = 4e-12
Identities = 84/312 (26%), Positives = 142/312 (45%), Gaps = 43/312 (13%)
Query: 17 NEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQG 76
+E + Q K GY T GKWH G+ + P +GFD + GF +G Y ++
Sbjct: 121 DEYTIAQAFKAAGYATGAFGKWHNGTQYPNH-PNAKGFDEYYGFTSGHWGHYFSPMLDHN 179
Query: 77 SWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKS-EPLFLMLAHSAVHSGNPYEP 135
GT F +G Y TD TD+A+ + ++ +P F L + HS P
Sbjct: 180 --GT-FVKG--------NGYITDDLTDKAMAFIEQQVQNHKPFFAYLPYCTPHS-----P 223
Query: 136 IRAPQKLIDAFK------YIDDSARQK------FAAVLSKLDESVGKVVKALHTRGLLEN 183
++ P + D FK + + R++ A+ +D +VG+V+K L++ + ++
Sbjct: 224 MQVPDQYWDRFKDKQLKLHNREPDREQPDHLRAALAMCENVDWNVGRVLKKLNSLRITDD 283
Query: 184 SIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQK 242
+IV++ +DNG +N + +KG K +L EGGVR + W L + V Q
Sbjct: 284 TIVIYFSDNGPNGVRWNGD------MKGKKGSLDEGGVRSPFVIRWPGHLPAGQEV-NQI 336
Query: 243 MHISDWLPTLYSAAGGDLSVLENLDGVNQWD-ALSKNTESPRTSVLHNIDDIWGIAALTV 301
D LPTL AG + +DGV+ L+ + P + ++ + ++
Sbjct: 337 AGAIDLLPTLTDLAGIKRPEPKPIDGVSLKPLMLNSKADWPERMIFSSLRN---RVSVRT 393
Query: 302 DKYKLI-KGTIY 312
D+Y+L KG +Y
Sbjct: 394 DQYRLSRKGELY 405
>UniRef50_A3ZWK4 Cluster: N-acetylgalactosamine 6-sulfatase; n=1;
Blastopirellula marina DSM 3645|Rep:
N-acetylgalactosamine 6-sulfatase - Blastopirellula
marina DSM 3645
Length = 442
Score = 74.9 bits (176), Expect = 4e-12
Identities = 73/266 (27%), Positives = 116/266 (43%), Gaps = 28/266 (10%)
Query: 18 EKILPQYLKDLGYKTHLVGKWHLGSYKKE--YLPLNRGFDSHVGFWTGRIDMYDHTTMEQ 75
E L + L+ GY T GKWHLGS +K+ P GFD W + YD+ +
Sbjct: 72 EITLAERLQAAGYATSHFGKWHLGSVRKDSPVSPGKCGFDD----WISAPNFYDNDPIM- 126
Query: 76 GSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEP 135
+D R + + ++DV D AI + + K E F S V G+P+ P
Sbjct: 127 ----SDQGRAVQYHGE-----SSDVTADLAIDWIRAQAKEEKPFF----SVVWFGSPHSP 173
Query: 136 IRAPQKLIDAFKYIDDSAR-QKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG 194
A D Y D+ A+ + + ++ +D + GK+ L G+ +N+I+ + +DNG
Sbjct: 174 HIAAD--ADRELYKDEPAKFRDYYGEVTGIDRAYGKIRSTLKELGISDNTILWYCSDNGA 231
Query: 195 PAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYS 254
A S P + K +++EGG+ G L P + + D PT+ +
Sbjct: 232 DKA-----KGSAGPFREKKGSIYEGGLLVPGILDWPARFPAPQTTSLRATTCDIFPTVLA 286
Query: 255 AAGGDLSVLENLDGVNQWDALSKNTE 280
AAG LDG+N L+ T+
Sbjct: 287 AAGLSPDKQRPLDGINLLPLLTAKTD 312
>UniRef50_A3HYT7 Cluster: Arylsulphatase A; n=1; Algoriphagus sp.
PR1|Rep: Arylsulphatase A - Algoriphagus sp. PR1
Length = 437
Score = 74.9 bits (176), Expect = 4e-12
Identities = 81/314 (25%), Positives = 128/314 (40%), Gaps = 18/314 (5%)
Query: 14 LPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTM 73
L ++ + LKD GYKT + GKW LG K+ P + GF+ W + D
Sbjct: 101 LDRSQTTFAKLLKDAGYKTAIAGKWQLG--KESDSPQHFGFEESC-LWQHMLGATDKNGN 157
Query: 74 EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPY 133
+ H G ++TD+ +D I + NK +P F H P+
Sbjct: 158 DTRYSNPVLEINGVPKHFDGGQFSTDITSDFLIDFMEK-NKDQPFFAYYPMIITHC--PF 214
Query: 134 EPIRAPQKLIDAFKYIDDSAR---QKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFST 190
P K D + + Q F +++ +D++VGK++ + GL E +I++F+
Sbjct: 215 VPT-PDSKDWDPSSPGSPTYKGDPQYFGDMVAYMDKTVGKIIAKVEEMGLSEETIIIFTG 273
Query: 191 DNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWL 249
DNG + +YP G K E G+ + W +DS + + SD+L
Sbjct: 274 DNGTDQPIVSSYRGKDYP--GGKKFTTENGIHVPLVVKWKGKIDSGIQ-NEDLIDFSDFL 330
Query: 250 PTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRT---SVLHNIDDIWGIAALTVDK-YK 305
PTL AG LDGV+ L +PR S D+ + +K YK
Sbjct: 331 PTLLDLAGIKAVHGIPLDGVSFMPQLMGKEGNPRNWIYSWYSRNGDLESLQEFVWNKEYK 390
Query: 306 LIKGTIYKGVWDNW 319
L K + + D+W
Sbjct: 391 LYKTGEFFNIQDDW 404
>UniRef50_A6LEC5 Cluster: Arylsulfatase A; n=1; Parabacteroides
distasonis ATCC 8503|Rep: Arylsulfatase A -
Parabacteroides distasonis (strain ATCC 8503 / DSM 20701
/ NCTC11152)
Length = 483
Score = 74.5 bits (175), Expect = 5e-12
Identities = 71/262 (27%), Positives = 122/262 (46%), Gaps = 29/262 (11%)
Query: 11 PRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEY-LPLNRGFDSHVGFWTGRIDMYD 69
P L +E + + LK Y T GKWHL S + + P ++GFD F+ +
Sbjct: 117 PMHLRDSEVTIAEVLKQADYATGHFGKWHLSSGRPDQPYPNDQGFD--YSFYALNNSVPS 174
Query: 70 HTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHS 129
H T+F R E ++ G Y+ D+ EA++ ++ NK EP FL V
Sbjct: 175 HHNP------TNFFRNGEPQGEIEG-YSCDIVVTEALQWLDK-NKQEPFFLN-----VWF 221
Query: 130 GNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFS 189
P+ P+ AP++L + ++ + +D ++GK++ L + L +N+IV+F+
Sbjct: 222 NEPHFPMEAPEELKKRH-----AINPEYYGCIENMDIAIGKLMNYLKEQNLEDNTIVIFA 276
Query: 190 TDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDW 248
+DNG + SN P +G K+ +EGG+R + W + + + +D
Sbjct: 277 SDNG------SQWDYSNLPFRGEKHFNYEGGLRVPCIVRWHKHVPTGVISEFNGC-FTDI 329
Query: 249 LPTLYSAAGGDLSVLENLDGVN 270
LPTL S A + +DG++
Sbjct: 330 LPTLASLADAPVPTDRVIDGMD 351
>UniRef50_A6DJ11 Cluster: Arylsulfatase A; n=1; Lentisphaera
araneosa HTCC2155|Rep: Arylsulfatase A - Lentisphaera
araneosa HTCC2155
Length = 462
Score = 74.5 bits (175), Expect = 5e-12
Identities = 73/278 (26%), Positives = 121/278 (43%), Gaps = 22/278 (7%)
Query: 12 RGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF-----WTGRID 66
+GL + + + LK +GY T VGKWHLG + E+LP N+GFDS+ G T
Sbjct: 100 KGLDPKHQTIAKLLKSVGYATKAVGKWHLGD-ELEFLPTNQGFDSYYGIPYSNDMTPAFS 158
Query: 67 M-YDHTTM-EQGSWGTDFRRGFEVAHDLFGVYATD----VYTDEAIKVVNSHNKSEPLF- 119
M Y + +G ++ FE A+ + V D + DE I++ + F
Sbjct: 159 MKYSENCLYREGVDQEALKKAFE-ANKIKPVGMKDKVPLMRNDECIEMPADQSTITKRFT 217
Query: 120 ---LMLAHSAVHSGNP---YEPIRAPQKLIDAFK-YIDDSARQKFAAVLSKLDESVGKVV 172
+ + S P Y P + K + SA + V+ ++D +VG+++
Sbjct: 218 DESIKFIDESTASNKPFFLYLAHSMPHTPLYVSKDFEGKSAGGIYGDVIEEIDYNVGRII 277
Query: 173 KALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLL 232
L+ + + EN++ ++++DN GP + S PL K T +EGG R + P
Sbjct: 278 DHLNEKNIAENTLFIYTSDN-GPWLIKKSHGGSALPLFEGKMTSFEGGQRVPAIIRWPAK 336
Query: 233 DSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVN 270
K V+ + D PTL G + ++G N
Sbjct: 337 IPKDSVSNEMTLSMDIFPTLAKITGAKAQDADLINGKN 374
>UniRef50_Q7UL40 Cluster: Arylsulfatase A; n=1; Pirellula sp.|Rep:
Arylsulfatase A - Rhodopirellula baltica
Length = 592
Score = 74.1 bits (174), Expect = 7e-12
Identities = 69/259 (26%), Positives = 114/259 (44%), Gaps = 26/259 (10%)
Query: 18 EKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPL---NRGFDSHVGFWTGRIDMY-DHTTM 73
E + + GY+T + GKWHLG E P+ ++GF V G I + D+
Sbjct: 126 ETTIAEVFAGAGYRTGIFGKWHLG----ENFPMRAEDQGFQKVVVHGGGGIGQFADYPGN 181
Query: 74 EQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPY 133
+ F+ A Y TDV+ DE+I+ + + +P F L + HS P+
Sbjct: 182 TYWDPTLQYNDSFKKAKG----YCTDVFIDESIQFMKDSGE-QPFFCYLPLNVPHS--PF 234
Query: 134 EPIRAPQKLIDAFKYIDDSARQKFAAV---LSKLDESVGKVVKALHTRGLLENSIVVFST 190
+ + D D R+ A + +++ D + G++++A+ G EN+I++F +
Sbjct: 235 DVADEFRADYDNQNLADPDGRKWVAPIYGMITQFDGAFGRLLEAVEDMGQRENTIILFMS 294
Query: 191 DNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWL 249
DNG + F L+ K +++E G+R + W L + MHI D L
Sbjct: 295 DNGPNSTYFTAG------LRAKKGSVYENGIRSPFVIQWPKTLQGGRKFDTPAMHI-DLL 347
Query: 250 PTLYSAAGGDLSVLENLDG 268
PTL A G L +DG
Sbjct: 348 PTLADACGIGLPADLQVDG 366
>UniRef50_Q4BZ10 Cluster: Similar to Arylsulfatase A and related
enzymes; n=1; Crocosphaera watsonii WH 8501|Rep: Similar
to Arylsulfatase A and related enzymes - Crocosphaera
watsonii
Length = 407
Score = 74.1 bits (174), Expect = 7e-12
Identities = 63/200 (31%), Positives = 97/200 (48%), Gaps = 26/200 (13%)
Query: 21 LPQYLKDLGYKTHLVG--KWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTTMEQGSW 78
+P+ L+D GY T LVG KW +GS+ + PL+RGF +M H +
Sbjct: 1 MPETLRDAGYVTGLVGALKWDIGSWNQG--PLDRGFT----------EMALHPPRTEP-- 46
Query: 79 GTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNK--SEPLFLMLAHSAVHSGNPYEPI 136
T F G + G Y T+V ++ + H K +P FL A A+H + P
Sbjct: 47 -TIFGGGSTYL-GVDGSYLTEVEGQYVLEFLERHGKRRDKPFFLYFAPLAIHIPHTEVPK 104
Query: 137 RAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGG-P 195
+ ++L + S RQ A L LD+ +G+++K + G+ EN++V+FS+DNGG P
Sbjct: 105 KYLKRLYPEHTEKEYSKRQYLQANLLALDDQIGRMIKKISELGIKENTLVMFSSDNGGDP 164
Query: 196 AAGFNDNAASNYPLKGVKNT 215
A + P +G KNT
Sbjct: 165 LADHRPD-----PYRGGKNT 179
>UniRef50_A6DM53 Cluster: Arylsulfatase; n=1; Lentisphaera araneosa
HTCC2155|Rep: Arylsulfatase - Lentisphaera araneosa
HTCC2155
Length = 540
Score = 74.1 bits (174), Expect = 7e-12
Identities = 55/147 (37%), Positives = 73/147 (49%), Gaps = 20/147 (13%)
Query: 8 GAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKE----YLPLNRGFDSHVGFWTG 63
G+ L N +PQ LK GYKT +VGKWHLG + P+NRGFD G G
Sbjct: 108 GSYDNYLNKNRITIPQVLKTTGYKTAMVGKWHLGGKSFDPNGPNAPMNRGFDDFYGTLHG 167
Query: 64 RIDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLML 122
YD T+ T R+ E H+ F Y TD +EA++ + + K+E P F +
Sbjct: 168 AGSYYDPMTL------TRNRKSMEPDHESF--YYTDKIGEEAVRQIKALAKAEQPFFQYI 219
Query: 123 AHSAVHSGNPYEPIRAPQKLIDAFKYI 149
A +A P+ PI AP+K I KYI
Sbjct: 220 AFTA-----PHWPIHAPEKTIQ--KYI 239
>UniRef50_A6DRX0 Cluster: N-acetylgalactosamine 6-sulfate sulfatase;
n=3; Bacteria|Rep: N-acetylgalactosamine 6-sulfate
sulfatase - Lentisphaera araneosa HTCC2155
Length = 486
Score = 73.7 bits (173), Expect = 9e-12
Identities = 65/239 (27%), Positives = 100/239 (41%), Gaps = 18/239 (7%)
Query: 25 LKDLGYKTHLVGKWHLGSYKKEYLPLNR-GFDSHVGFWTGRIDMYDHTTMEQGS---WGT 80
+KDLGY+T GKW L ++ E L + + GFD WTG D T ++ + W
Sbjct: 126 MKDLGYRTFATGKWQLNDFRLEPLAMQKHGFDDWA-MWTGCETSKDKTHEKKSTQRYWNA 184
Query: 81 DFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIRAPQ 140
E + G + D+YTD I + NK +P+ + P+ P+ A
Sbjct: 185 HINTK-EGSKTYKGQFGPDLYTDHLINFMRK-NKDKPMCIYYPMVL-----PHTPVAATP 237
Query: 141 KLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNG-GPAAGF 199
A + K A++ +D+ VGK+V L G+ E +I++F+TDNG P
Sbjct: 238 DEPKAKGVLG-----KHKAMVRYIDKMVGKLVNELDELGIRERTIIIFTTDNGSAPPPRG 292
Query: 200 NDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGG 258
+ + G K+T E G+ + P L +D LPT GG
Sbjct: 293 VIGTRNGRKIVGAKSTETEVGICAPFIVNGPGLVPAGVETDALTDFTDMLPTFLELGGG 351
>UniRef50_A6DJ15 Cluster: Putative arylsulfatase; n=2; Lentisphaera
araneosa HTCC2155|Rep: Putative arylsulfatase -
Lentisphaera araneosa HTCC2155
Length = 469
Score = 73.7 bits (173), Expect = 9e-12
Identities = 83/306 (27%), Positives = 134/306 (43%), Gaps = 40/306 (13%)
Query: 3 HGVI---YGAEPRG----LPLNEK--ILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRG 53
HG++ Y P G LPL + L + +K GY T L+GKW +G P +G
Sbjct: 84 HGLVRGNYEVGPHGFGGELPLRPEDVSLAEVMKSAGYATGLIGKWGMGMDGTTGEPRKKG 143
Query: 54 FDSHVGFWTGRIDMYDHTTMEQGSWGTDFRRGF-EVAHDLFGVYATDVYTDEAIKVVNSH 112
FD GF + H + + + E D G+Y +D + ++ I+ V
Sbjct: 144 FDYSYGFLN---QAHAHHYYPEYIYENGEKLMIPENKDDARGLYISDTFAEKGIEFVEE- 199
Query: 113 NKSEPLFLMLAHSAVHS-------------GN-PYEPIRAPQKLIDAFK-----YID-DS 152
NK +P FL A H+ G P P ++ D Y D
Sbjct: 200 NKDKPFFLFWAFVTPHAELLVPDDSLNEFKGKWPETPFVMGKQGGDGTDNPFGVYASQDH 259
Query: 153 ARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDN--AASNYPLK 210
R F+ ++++LD+ VG + L G+ +N+I++FS+DNG G D SN L
Sbjct: 260 PRAAFSGMITRLDKRVGDLFDKLEELGIDDNTIIMFSSDNGPHKEGGADPDFFDSNAELT 319
Query: 211 GVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGV 269
G K L EGG+R + W ++ ++++ ++ D +PT+ A D E++DG+
Sbjct: 320 GYKRDLTEGGIRVPFMVRWPNVVKARSKSSHASA-FWDVMPTIAEIANTDSP--EDIDGL 376
Query: 270 NQWDAL 275
+ AL
Sbjct: 377 SFLPAL 382
>UniRef50_A6CBM1 Cluster: Arylsulphatase A; n=1; Planctomyces maris
DSM 8797|Rep: Arylsulphatase A - Planctomyces maris DSM
8797
Length = 497
Score = 73.3 bits (172), Expect = 1e-11
Identities = 73/291 (25%), Positives = 128/291 (43%), Gaps = 28/291 (9%)
Query: 11 PRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSY---KKEYLPLNRGFDSHVGFWTGRIDM 67
P L +E + Q L+ GY T VGKWH K++ P + GF +
Sbjct: 108 PMHLKRDEVTVAQLLQQAGYDTAHVGKWHCNGMFNSKEQPQPGDHGFRHWFSTQNNALPT 167
Query: 68 YDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNS-HNKSEPLFLMLAHSA 126
+++ +F R + ++ G ++ + DE I+ ++ K +P FL H
Sbjct: 168 HENPN--------NFVRNGKPLGEIEG-FSCQIVADEGIRWLSDWREKEKPFFL---HVC 215
Query: 127 VHSGNPYEPIRAPQKLIDAF--KYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
H P+E + +P L++ + K + + Q FA V + +D +VGK++ L + +N+
Sbjct: 216 FHE--PHERVASPPALVETYLDKSLYEDQAQYFANV-ANMDRAVGKLLIKLDELKVADNT 272
Query: 185 IVVFSTDNGGP-----AAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARV 238
+V F++DNG G + S L+G+K ++EGG+R G + W + + +
Sbjct: 273 LVFFTSDNGPETLNRYGKGSRRSWGSPGVLRGMKLHIYEGGIRVPGIVRWPGKIKAGQEI 332
Query: 239 AYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHN 289
A + D LPT AG + LDG + + N T + N
Sbjct: 333 ATPVCSV-DLLPTFCEIAGVAVPDQRPLDGASLLPLFAGNKIERTTPLFWN 382
>UniRef50_UPI0000E1104B Cluster: N-acetylgalactosamine 6-sulfate
sulfatase; n=1; alpha proteobacterium HTCC2255|Rep:
N-acetylgalactosamine 6-sulfate sulfatase - alpha
proteobacterium HTCC2255
Length = 485
Score = 72.9 bits (171), Expect = 2e-11
Identities = 71/274 (25%), Positives = 123/274 (44%), Gaps = 39/274 (14%)
Query: 13 GLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDMYDHTT 72
G+ + + P+ L+ +GYKT L+GKWHLG Y+ E+ P G+D +GF G D
Sbjct: 110 GIEQSYETWPEILQKVGYKTGLIGKWHLG-YQPEHHPTQHGYDEFIGFLAGGTTPEDPRL 168
Query: 73 MEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVH---- 128
G V + G+ +V T+ AI +N H K + L L + A H
Sbjct: 169 EVNG-----------VETNELGL-TVEVLTNHAIAFLNRH-KDDKFALSLHYRAPHYRFL 215
Query: 129 -----SGNPYEPIRAPQKLIDAFKYIDDSAR---QKFAAVLSKLDESVGKVVKALHTRGL 180
PYE + D + AR +++ + ++ +D +VG +++ L GL
Sbjct: 216 PVAPEDAAPYEDVEIALPHPDYPGLNTERARKLMREYMSSVTGIDRNVGLLMQTLEQLGL 275
Query: 181 LENSIVVFSTDNG-GPAAGFNDNAASNY------PL------KGVKNTLWEGGVRGAGFL 227
+N++V+F++D+G A + + Y PL +G + +++ ++ +
Sbjct: 276 SQNTVVIFTSDHGYNIAHNGMWHKGNGYWLLYEPPLGTPNVPRGQRPNMYDNSLKVPTIV 335
Query: 228 WSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLS 261
P + KA + M DW PTL + A G +S
Sbjct: 336 RWPGVIPKASINDSTMSNLDWFPTLVAIARGKVS 369
>UniRef50_A6DNI9 Cluster: N-acetyl-galactosamine-6-sulfatase; n=1;
Lentisphaera araneosa HTCC2155|Rep:
N-acetyl-galactosamine-6-sulfatase - Lentisphaera
araneosa HTCC2155
Length = 500
Score = 72.9 bits (171), Expect = 2e-11
Identities = 86/300 (28%), Positives = 123/300 (41%), Gaps = 51/300 (17%)
Query: 11 PRGLPLNEKILPQYLKDL---GYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDM 67
P+ + + P Y K L GY T GKWHLG + Y PL GFD V
Sbjct: 119 PKSTTRLDTVFPTYAKVLKAQGYVTGHYGKWHLGH--EPYTPLEHGFDVDV--------- 167
Query: 68 YDHTTMEQGSWGTDFRRGFEVAHDLF----GVYATDVYTDEAIKVVNSHNKSEPLFLMLA 123
HT G G+ F G + D F G + D EAI+ + NK P L
Sbjct: 168 -PHTK-SHGPKGSYF--GPKKYSDSFTLKKGEHLEDRMGQEAIEFIKE-NKDRPFLLNYW 222
Query: 124 HSAVHSGNPYEPIRAPQKLIDAFKY----IDDSARQK---FAAVLSKLDESVGKVVKALH 176
+VHS P+ A L+D ++ + A+Q+ FA ++ D++VG ++KA+
Sbjct: 223 AFSVHS-----PMFAKLDLLDKYRKKATKLPTDAQQRNPIFAGMIETFDDNVGLLLKAID 277
Query: 177 TRGLLENSIVVFSTDNGGPAAGFNDN----------------AASNYPLKGVKNTLWEGG 220
G+ + +I+V S+DNGG + A SNYPLK K T+ +GG
Sbjct: 278 EAGIADRTIIVLSSDNGGTIESAYTHEAYWGNGTVEEIVDIPATSNYPLKSGKGTIHDGG 337
Query: 221 VRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTE 280
+ P + D PT AG + +DGV+Q AL E
Sbjct: 338 TAVPFIVVWPGKIKAGTKSDSYFSGVDVFPTFVEMAGAKMPSGVAIDGVSQVPALITGEE 397
>UniRef50_A4A218 Cluster: Arylsulfatase A; n=1; Blastopirellula
marina DSM 3645|Rep: Arylsulfatase A - Blastopirellula
marina DSM 3645
Length = 491
Score = 72.9 bits (171), Expect = 2e-11
Identities = 71/264 (26%), Positives = 113/264 (42%), Gaps = 15/264 (5%)
Query: 5 VIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGR 64
V+ + +GL E + + L +GY T + GKWHLG + E+LP +GFD+ G
Sbjct: 115 VLRPLDTKGLNPKETTMAEVLHSVGYATGIFGKWHLGD-QPEFLPTQQGFDTFFGIPYSD 173
Query: 65 IDMYDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAH 124
DM + R + + T+EAI + N+ P F+ + H
Sbjct: 174 -DMTKDLRPQLWPELPLMRDEQVIEAPVDRDLLVKRCTEEAIAFIEQ-NQERPFFVYIPH 231
Query: 125 SAVHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENS 184
+ G+ P +P AF+ S + + +LD S G+V++ L L E +
Sbjct: 232 TM--PGSTKRPFSSP-----AFQ--GKSKNGPYGDSVEELDWSTGQVMETLKRLDLDEQT 282
Query: 185 IVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMH 244
+V++++DNG P N SN P +G EG +R + P S ++
Sbjct: 283 LVIWTSDNGAPHR--NPPQGSNLPYQGDGYNTSEGAMRMPCVMRWPGKISAGQINDALCT 340
Query: 245 ISDWLPTLYSAAGGDLSVLENLDG 268
D LPT AG +S E +DG
Sbjct: 341 TMDLLPTFGKLAGATMSKTE-IDG 363
>UniRef50_Q89L10 Cluster: Bll4738 protein; n=6; Proteobacteria|Rep:
Bll4738 protein - Bradyrhizobium japonicum
Length = 487
Score = 72.1 bits (169), Expect = 3e-11
Identities = 74/292 (25%), Positives = 122/292 (41%), Gaps = 20/292 (6%)
Query: 8 GAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGF-WTGRID 66
G P GL E + L GY T + GKWHLGS ++ +P N+GFD G T
Sbjct: 107 GGIPDGLTQWEITTAELLSGQGYATGMWGKWHLGS-AEDRMPTNQGFDEWYGIPRTYDEA 165
Query: 67 MYDHTTMEQGSW-GTDFRRGF--EVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLA 123
M+ + W R+G+ +V H A + V++ ++ + +
Sbjct: 166 MWPSLDETRSMWPSVGNRQGWNAKVVHPQHIYEARKGDKPRQVAVLDE-DRRRTMDAEIT 224
Query: 124 HSAVH-------SGNP---YEPI-RAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVV 172
AV SG P Y P + ++ + +A L+++D G+++
Sbjct: 225 SRAVEFIKRNASSGKPFYAYVPFAHVHMPTLPNLEFAGRTGNGDWADCLAEMDYRTGQIL 284
Query: 173 KALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLL 232
A+ G+ +++V+F++DNG A N N P +G T EG +R + P
Sbjct: 285 DAIKQAGIENDTLVIFASDNGPEAT--NPWEGDNGPWRGTYFTAMEGSLRAPFIIRWPGK 342
Query: 233 DSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWD-ALSKNTESPR 283
R++ + +H D TL G ++ +DGV+Q D L K S R
Sbjct: 343 VPAGRISNEIVHTVDLFTTLARVGGAEVPTDRAIDGVDQLDFFLGKQEASNR 394
>UniRef50_A6EGE7 Cluster: N-acetylgalactosamine-6-sulfatase; n=3;
Bacteroidetes|Rep: N-acetylgalactosamine-6-sulfatase -
Pedobacter sp. BAL39
Length = 464
Score = 72.1 bits (169), Expect = 3e-11
Identities = 65/259 (25%), Positives = 116/259 (44%), Gaps = 21/259 (8%)
Query: 21 LPQYLKDLGYKTHLVGKWHLGSYKKEY-LPL--NRGFDSHVGFWTGRIDMYDHTTMEQGS 77
+ ++ ++ GY T GKWH+G + P G D HV Y+ +
Sbjct: 128 MARFFQEAGYATGHFGKWHMGGGRDVTGAPTFDQYGIDEHVS-------TYESPEPDPAI 180
Query: 78 WGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSEPLFLMLAHSAVHSGNPYEPIR 137
T++ + + + T + D+ + + H K P F+ L VH+ P+ P R
Sbjct: 181 TATNWIWSDQDSIKRWD--RTKYFVDKTLDFMKRH-KGTPCFVNLWPDDVHT--PWVP-R 234
Query: 138 APQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAA 197
+ + F +D + F VL D +G+++ L GL EN+I++F++DN GPA
Sbjct: 235 SGDEFNGKFP-MDPQEEEAFKGVLKTYDVQIGRLLDGLQELGLAENTIIIFTSDN-GPAP 292
Query: 198 GFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTLYSAA 256
F + + +G K +L+EGG+R + W + +++ +D LP+L +
Sbjct: 293 SFRGSRTGGF--RGAKASLYEGGIRMPFIISWPGHTPAGKTDDRSELNATDLLPSLAKLS 350
Query: 257 GGDLSVLENLDGVNQWDAL 275
G L DG+++ D L
Sbjct: 351 GVKLPDSYAGDGIDRSDLL 369
>UniRef50_A6DJL2 Cluster: Putative exported uslfatase; n=1;
Lentisphaera araneosa HTCC2155|Rep: Putative exported
uslfatase - Lentisphaera araneosa HTCC2155
Length = 493
Score = 72.1 bits (169), Expect = 3e-11
Identities = 62/242 (25%), Positives = 112/242 (46%), Gaps = 22/242 (9%)
Query: 30 YKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFW-TGRIDMYDHTTMEQGSWGTDFRRGFEV 88
Y T +GKWH+ + E GFD H G+ G + Y+ G D ++
Sbjct: 130 YATAHLGKWHVPKLQPEVA----GFDVHDGYTGNGGGEYYE------AHKGKDKKK--LP 177
Query: 89 AHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSAVHSG--NPYEPIRAPQKLIDA 145
D +Y ++ A + K++ P +L ++H AVH + + + +K +
Sbjct: 178 PEDPKQIYTI---SERACDFIAQQAKAKKPFYLQISHYAVHVSLQSRAKTLERTKKRLAT 234
Query: 146 FKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAAS 205
FAA++ LD VG ++ + +G+ +N+ ++F++DNGG + + +
Sbjct: 235 THPKLHQRTIDFAAMVEDLDIGVGMILDEVEKQGIKDNTYIIFTSDNGG--FSYANTSGQ 292
Query: 206 NYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLEN 265
N PLKG K L+EGG+R + P + + Q + D+LPT Y GG ++ ++
Sbjct: 293 NTPLKGGKRWLYEGGIRVPFVIQGPKIKA-GTYCNQPIINWDFLPTFYDLVGGTEALSQD 351
Query: 266 LD 267
L+
Sbjct: 352 LE 353
>UniRef50_A6DJE5 Cluster: Sulfatase 1; n=1; Lentisphaera araneosa
HTCC2155|Rep: Sulfatase 1 - Lentisphaera araneosa
HTCC2155
Length = 490
Score = 72.1 bits (169), Expect = 3e-11
Identities = 51/181 (28%), Positives = 95/181 (52%), Gaps = 15/181 (8%)
Query: 102 TDEAIKVVNSHNKS-EPLFLMLAHSAVHSGNPYEPIRAPQKLIDAFKYID-DSARQKFAA 159
T ++ +N+ K+ +P FLM++H AVH + A ++ I ++ D D ++AA
Sbjct: 198 TKSSVDFINTQAKANKPFFLMVSHYAVHVKHA-----ALEETIKKYQIGDVDYKDARYAA 252
Query: 160 VLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEG 219
++ LD+S+G ++KAL G+ +N+ V+F++DNGG G N L+G K + EG
Sbjct: 253 LIEHLDDSLGAMLKALDDNGIADNTYVIFTSDNGGGHGG-------NPSLQGGKAKMMEG 305
Query: 220 GVRGAGFLWSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNT 279
G+R + P + + ++ + D+L TL+ +G + +++DG + D
Sbjct: 306 GLRVPTVVRGPGIPADSQCDVPIVQY-DFLATLHELSGNPNPLPDDIDGGSLVDVFRNGN 364
Query: 280 E 280
E
Sbjct: 365 E 365
>UniRef50_A6DIC6 Cluster: Arylsulfatase; n=1; Lentisphaera araneosa
HTCC2155|Rep: Arylsulfatase - Lentisphaera araneosa
HTCC2155
Length = 528
Score = 72.1 bits (169), Expect = 3e-11
Identities = 79/282 (28%), Positives = 127/282 (45%), Gaps = 25/282 (8%)
Query: 18 EKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFD-SHVGFWTGRIDMYDHTTMEQG 76
E L + KD Y T L GKWHLG Y +++GFD S + G DH +
Sbjct: 74 EYTLAEAFKDNQYSTGLFGKWHLGDC-YPYRAMDQGFDYSLIHRGGGLGQPADHPENNRA 132
Query: 77 SWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSH-NKSEPLFLMLAHSAVHSGNPYEP 135
+ R EVA G + TDV+ EA K ++ ++P F + +A HS P+
Sbjct: 133 YTNSMLYRN-EVAFRSEG-FCTDVFFREARKWISEKVENNKPFFACIMPNAPHS--PFHD 188
Query: 136 IRAPQKLIDAFKYID-----DSARQKFAAV---LSKLDESVGKVVKALHTRGLLENSIVV 187
+ P L+ +K D S + K AA+ + +D+++ + L + + +I++
Sbjct: 189 V--PADLLKKYKNADWSQHKGSDKDKVAAIYAMVENIDQNIADLRDELKKLNIDKKTIIL 246
Query: 188 FSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHISD 247
FS+DNGG F+ L+G K++ +EGGV L+ P SK + + + D
Sbjct: 247 FSSDNGGWGERFDAG------LRGSKSSSFEGGVLSPLMLFVPGQASKQ--STEAIAHYD 298
Query: 248 WLPTLYSAAGGDLSVLENLDGVNQWDALSKNTESPRTSVLHN 289
LPTL + LDG + LS + R+ +L +
Sbjct: 299 VLPTLVDLCDLKVDFPNELDGRSFLPILSGESLPERSIILQS 340
>UniRef50_A6C4Q6 Cluster: Arylsulfatase; n=1; Planctomyces maris DSM
8797|Rep: Arylsulfatase - Planctomyces maris DSM 8797
Length = 574
Score = 72.1 bits (169), Expect = 3e-11
Identities = 76/286 (26%), Positives = 128/286 (44%), Gaps = 22/286 (7%)
Query: 8 GAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFWTGRIDM 67
GA+ +G E + + L+ GY+T + GKWHLG P ++GF + +G I
Sbjct: 107 GAKMQG---EEVTVAELLQQAGYQTGIFGKWHLGD-NYPMRPQDQGFAESLIHKSGGIGQ 162
Query: 68 YDHTTMEQGSWGTDFRRGFEVAHDLFGVYATDVYTDEAIKVVNSHNKSE-PLFLMLAHSA 126
+ + S+ VA G Y TDV+ D A+ ++ K+E P F+ LA +A
Sbjct: 163 ---SPDQPNSYFHPKLWKNGVAFQSTG-YCTDVFFDAALDFIDRQTKTEKPFFVYLATNA 218
Query: 127 VHSGNPYEPIRAPQKLIDAFKYIDDSARQKFAAVLSKLDESVGKVVKALHTRGLLENSIV 186
H+ P E + K + +D++ + + +++ LDE++GK++ L L E ++V
Sbjct: 219 PHT--PLEIAESYWKPYQR-QGLDETTARVYG-MITNLDENIGKLLSHLERSALAEKTVV 274
Query: 187 VFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFL-WSPLLDSKARVAYQKMHI 245
+F DNG + L+G K+ +EGG+R W ++ HI
Sbjct: 275 LFLGDNGPQQKRYTGG------LRGRKSWTYEGGIRVPCLAQWPGHFREGEKIDQIAAHI 328
Query: 246 SDWLPTLYSAAGGDLSVLENLDGVNQWDALSKNTES-PRTSVLHNI 290
D +PTL + LDGV+ L+ E P S+ +
Sbjct: 329 -DLMPTLLALTETRCPESLKLDGVDLSPLLTGRKEKLPARSLFFQV 373
>UniRef50_A4AR92 Cluster: N-acetylgalactosamine-6-sulfatase; n=1;
Flavobacteriales bacterium HTCC2170|Rep:
N-acetylgalactosamine-6-sulfatase - Flavobacteriales
bacterium HTCC2170
Length = 479
Score = 72.1 bits (169), Expect = 3e-11
Identities = 83/297 (27%), Positives = 121/297 (40%), Gaps = 39/297 (13%)
Query: 13 GLPLNEKI-LPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRG----FDSHVGFWT----- 62
G L E+I LP+ LK GY T GKWHLG+ K+ L NRG FD+H T
Sbjct: 104 GHMLPEEITLPELLKGQGYATGHFGKWHLGTLTKDTLDANRGGREKFDAHYSLPTEHGYD 163
Query: 63 ------GRIDMYDHTTM-EQGSWGTDFRRGFEVAHDLFGV--YATDVYTDEAIKVVNSHN 113
++ YD E G R G+ G Y T +T E KV +
Sbjct: 164 EFFSTESKVPTYDPMIYPENFDEGESLRYGWRSVESNEGTKPYGTAYWTGENQKVTTNIE 223
Query: 114 KSEPLFLM-----LAHSAVHSGNPYEP---IRAPQKLI---DAFK--YID-DSARQKFAA 159
+M A+ P+ + P + A + Y D D +Q +
Sbjct: 224 GDNSRVIMDRVLPFIDRAITEEKPFFSTLWLHTPHLPVVSDSAHRSLYPDLDLQQQIYNG 283
Query: 160 VLSKLDESVGKVVKALHTRGLLENSIVVFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEG 219
L+ +DE +G++ L + EN+I+ F +DNG ND S + K +L+EG
Sbjct: 284 TLTAMDEQIGRLWSKLEALDIQENTIIFFCSDNGPE----NDTPGSAGVFRERKRSLYEG 339
Query: 220 GVRGAGFL-WSPLLDSKARVAYQKMHISDWLPTLYSAAGGDLSVLENLDGVNQWDAL 275
GVR F+ W + R +Y SD+LPTL +DG + W+ +
Sbjct: 340 GVRVPAFMVWKNHVTGGQR-SYFPSVTSDYLPTLLDILNITYPDNRPVDGESLWEVV 395
>UniRef50_Q612A1 Cluster: Putative uncharacterized protein CBG16830;
n=1; Caenorhabditis briggsae|Rep: Putative
uncharacterized protein CBG16830 - Caenorhabditis
briggsae
Length = 268
Score = 72.1 bits (169), Expect = 3e-11
Identities = 32/70 (45%), Positives = 43/70 (61%)
Query: 2 QHGVIYGAEPRGLPLNEKILPQYLKDLGYKTHLVGKWHLGSYKKEYLPLNRGFDSHVGFW 61
Q GV EP G+P L + ++ L Y T+LVGKWHLG KKE+LP NRGFD GF+
Sbjct: 31 QAGVFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFY 90
Query: 62 TGRIDMYDHT 71
+ ++H+
Sbjct: 91 GPQTGYFNHS 100
Score = 67.3 bits (157), Expect = 8e-10
Identities = 45/126 (35%), Positives = 66/126 (52%), Gaps = 10/126 (7%)
Query: 187 VFSTDNGGPAAGFNDNAASNYPLKGVKNTLWEGGVRGAGFLWSPLLDSKARVAYQKMHIS 246
V+ST NGG + + ASN PL+G K+T+WEGG + F+ SP+ + H+
Sbjct: 135 VYST-NGGTS----NFGASNAPLRGEKDTIWEGGTKTTTFVHSPMYVEEGGNREMMFHVV 189
Query: 247 DWLPTLYSAAGGDLSVLENLDGVNQWDALSKN-TESPRTSVLHNIDDIWGIAALTVDKYK 305
DW T+ S G L V DG+NQW+ + N + R ++NI D +A+ YK
Sbjct: 190 DWHATILSITG--LEVDSYGDGINQWEYIRTNRPKFRRFQFVYNIADHG--SAIRDGDYK 245
Query: 306 LIKGTI 311
LI G +
Sbjct: 246 LIVGNV 251
Database: uniref50
Posted date: Oct 5, 2007 11:19 AM
Number of letters in database: 575,637,011
Number of sequences in database: 1,657,284
Lambda K H
0.317 0.136 0.425
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 543,781,209
Number of Sequences: 1657284
Number of extensions: 23536543
Number of successful extensions: 49405
Number of sequences better than 10.0: 500
Number of HSP's better than 10.0 without gapping: 569
Number of HSP's successfully gapped in prelim test: 153
Number of HSP's that attempted gapping in prelim test: 47434
Number of HSP's gapped (non-prelim): 1328
length of query: 455
length of database: 575,637,011
effective HSP length: 103
effective length of query: 352
effective length of database: 404,936,759
effective search space: 142537739168
effective search space used: 142537739168
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.6 bits)
S2: 74 (33.9 bits)
- SilkBase 1999-2023 -