BLASTP 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= BGIBMGA001064-TA|BGIBMGA001064-PA|IPR001173|Glycosyl
transferase, family 2, IPR000772|Ricin B lectin, IPR008997|Ricin
B-related lectin
(575 letters)
Database: uniref50
1,657,284 sequences; 575,637,011 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef50_Q17NN8 Cluster: N-acetylgalactosaminyltransferase; n=4;... 599 e-170
UniRef50_Q6WV16 Cluster: N-acetylgalactosaminyltransferase 6; n=... 581 e-164
UniRef50_O45947 Cluster: Putative polypeptide N-acetylgalactosam... 577 e-163
UniRef50_Q86SR1 Cluster: Polypeptide N-acetylgalactosaminyltrans... 573 e-162
UniRef50_Q8IA42 Cluster: N-acetylgalactosaminyltransferase 4; n=... 503 e-141
UniRef50_A7SZ28 Cluster: Predicted protein; n=1; Nematostella ve... 417 e-115
UniRef50_Q10472 Cluster: Polypeptide N-acetylgalactosaminyltrans... 400 e-110
UniRef50_Q9VUT6 Cluster: Polypeptide N-acetylgalactosaminyltrans... 383 e-105
UniRef50_Q86SF2 Cluster: N-acetylgalactosaminyltransferase 7; n=... 375 e-102
UniRef50_A2AQQ1 Cluster: UDP-N-acetyl-alpha-D-galactosamine: pol... 374 e-102
UniRef50_A7RRV7 Cluster: Predicted protein; n=1; Nematostella ve... 367 e-100
UniRef50_Q4RQL8 Cluster: Chromosome 2 SCAF15004, whole genome sh... 363 5e-99
UniRef50_Q8IA43 Cluster: Putative polypeptide N-acetylgalactosam... 362 1e-98
UniRef50_Q8MV48 Cluster: N-acetylgalactosaminyltransferase 7; n=... 357 5e-97
UniRef50_UPI00015B5D50 Cluster: PREDICTED: similar to ENSANGP000... 353 6e-96
UniRef50_O61397 Cluster: Probable N-acetylgalactosaminyltransfer... 349 2e-94
UniRef50_Q7Z7M9 Cluster: Polypeptide N-acetylgalactosaminyltrans... 347 6e-94
UniRef50_Q8NCW6 Cluster: Polypeptide N-acetylgalactosaminyltrans... 344 4e-93
UniRef50_Q95ZJ1 Cluster: Polypeptide N-acetylgalactosaminyltrans... 341 4e-92
UniRef50_O61394 Cluster: Probable N-acetylgalactosaminyltransfer... 339 1e-91
UniRef50_Q6WV19 Cluster: Polypeptide N-acetylgalactosaminyltrans... 339 1e-91
UniRef50_Q10471 Cluster: Polypeptide N-acetylgalactosaminyltrans... 338 3e-91
UniRef50_UPI0000E461C0 Cluster: PREDICTED: hypothetical protein,... 333 8e-90
UniRef50_A7SDQ3 Cluster: Predicted protein; n=1; Nematostella ve... 331 3e-89
UniRef50_P34678 Cluster: Polypeptide N-acetylgalactosaminyltrans... 330 6e-89
UniRef50_UPI00015B453F Cluster: PREDICTED: similar to GA20875-PA... 328 2e-88
UniRef50_Q176D5 Cluster: N-acetylgalactosaminyltransferase; n=3;... 326 1e-87
UniRef50_Q8N428 Cluster: Putative polypeptide N-acetylgalactosam... 326 1e-87
UniRef50_UPI00015B515F Cluster: PREDICTED: similar to n-acetylga... 324 4e-87
UniRef50_UPI0000D56CDA Cluster: PREDICTED: similar to CG4445-PA;... 321 4e-86
UniRef50_Q9Y117 Cluster: Polypeptide N-acetylgalactosaminyltrans... 320 8e-86
UniRef50_Q6WV20 Cluster: Polypeptide N-acetylgalactosaminyltrans... 318 2e-85
UniRef50_UPI0000D564C6 Cluster: PREDICTED: similar to CG8182-PA,... 318 3e-85
UniRef50_Q16ZW8 Cluster: N-acetylgalactosaminyltransferase; n=4;... 317 6e-85
UniRef50_A7RGG9 Cluster: Predicted protein; n=3; Eumetazoa|Rep: ... 310 5e-83
UniRef50_Q8IXK2 Cluster: Polypeptide N-acetylgalactosaminyltrans... 310 5e-83
UniRef50_Q17M60 Cluster: N-acetylgalactosaminyltransferase; n=1;... 310 7e-83
UniRef50_Q5DD76 Cluster: SJCHGC09400 protein; n=2; Schistosoma j... 308 2e-82
UniRef50_Q96FL9 Cluster: Polypeptide N-acetylgalactosaminyltrans... 308 4e-82
UniRef50_UPI0000E45D84 Cluster: PREDICTED: hypothetical protein;... 307 6e-82
UniRef50_Q9U2C4 Cluster: Probable N-acetylgalactosaminyltransfer... 305 3e-81
UniRef50_UPI0000586DC6 Cluster: PREDICTED: similar to polypeptid... 303 6e-81
UniRef50_A0NGH9 Cluster: ENSANGP00000031751; n=1; Anopheles gamb... 298 3e-79
UniRef50_Q7K755 Cluster: Putative polypeptide N-acetylgalactosam... 297 4e-79
UniRef50_Q5TWJ3 Cluster: ENSANGP00000028412; n=1; Anopheles gamb... 297 7e-79
UniRef50_Q16ZA7 Cluster: N-acetylgalactosaminyltransferase; n=7;... 295 2e-78
UniRef50_UPI000069E576 Cluster: Polypeptide N-acetylgalactosamin... 293 6e-78
UniRef50_UPI0000E4710F Cluster: PREDICTED: similar to pp-GalNAc-... 291 4e-77
UniRef50_Q16SH9 Cluster: N-acetylgalactosaminyltransferase; n=2;... 288 2e-76
UniRef50_Q14435 Cluster: Polypeptide N-acetylgalactosaminyltrans... 288 3e-76
UniRef50_Q8I136 Cluster: Polypeptide N-acetylgalactosaminyltrans... 287 5e-76
UniRef50_Q6V2D0 Cluster: UDP-N-acetyl-D-galactosamine:polypeptid... 286 1e-75
UniRef50_Q8MVS5 Cluster: Polypeptide N-acetylgalactosaminyltrans... 286 1e-75
UniRef50_Q8IA41 Cluster: Putative polypeptide N-acetylgalactosam... 284 5e-75
UniRef50_Q9D4M9 Cluster: Putative polypeptide N-acetylgalactosam... 282 2e-74
UniRef50_UPI000069E1C8 Cluster: Polypeptide N-acetylgalactosamin... 277 4e-73
UniRef50_Q8N3T1 Cluster: Polypeptide N-acetylgalactosaminyltrans... 277 4e-73
UniRef50_Q7TT15-2 Cluster: Isoform 2 of Q7TT15 ; n=9; Mammalia|R... 269 1e-70
UniRef50_Q4RKI0 Cluster: Chromosome 21 SCAF15029, whole genome s... 261 4e-68
UniRef50_Q9HCQ5 Cluster: Polypeptide N-acetylgalactosaminyltrans... 260 9e-68
UniRef50_Q6P9A2 Cluster: Putative polypeptide N-acetylgalactosam... 258 2e-67
UniRef50_Q8IA44 Cluster: Putative polypeptide N-acetylgalactosam... 255 3e-66
UniRef50_Q6YBY0 Cluster: UDP-N-acetyl-D-galactosamine:polypeptid... 250 6e-65
UniRef50_Q5CY08 Cluster: Extracellular protein with a signal pep... 248 2e-64
UniRef50_Q8MM26 Cluster: UDP-N-acetyl-D-galactosamine:polypeptid... 247 5e-64
UniRef50_Q8K1B9 Cluster: Putative polypeptide N-acetylgalactosam... 244 5e-63
UniRef50_O45293 Cluster: Probable N-acetylgalactosaminyltransfer... 244 5e-63
UniRef50_Q9NY28 Cluster: Probable polypeptide N-acetylgalactosam... 241 5e-62
UniRef50_Q8MYY6 Cluster: Putative polypeptide N-acetylgalactosam... 233 7e-60
UniRef50_Q5CKF0 Cluster: UDP-N-acetyl-D-galactosamine:polypeptid... 233 1e-59
UniRef50_UPI000065D031 Cluster: Probable polypeptide N-acetylgal... 231 5e-59
UniRef50_UPI0000E4974C Cluster: PREDICTED: hypothetical protein;... 225 2e-57
UniRef50_UPI0000D9AA48 Cluster: PREDICTED: similar to UDP-N-acet... 218 4e-55
UniRef50_UPI0000E46FFD Cluster: PREDICTED: similar to n-acetylga... 200 6e-50
UniRef50_Q6TBR4 Cluster: UDP-N-acetyl-D-galactosamine:polypeptid... 200 6e-50
UniRef50_Q4RNJ5 Cluster: Chromosome 21 SCAF15012, whole genome s... 198 3e-49
UniRef50_Q4SKF7 Cluster: Chromosome 13 SCAF14566, whole genome s... 194 5e-48
UniRef50_Q4RQK9 Cluster: Chromosome 2 SCAF15004, whole genome sh... 192 2e-47
UniRef50_UPI0001554C17 Cluster: PREDICTED: similar to Polypeptid... 192 3e-47
UniRef50_UPI0000E46EB4 Cluster: PREDICTED: similar to MGC81846 p... 184 8e-45
UniRef50_Q6YK77 Cluster: UDP-N-acetyl-D-galactosamine:polypeptid... 178 4e-43
UniRef50_UPI000065D57A Cluster: Putative polypeptide N-acetylgal... 174 5e-42
UniRef50_UPI0000E46551 Cluster: PREDICTED: hypothetical protein,... 173 1e-41
UniRef50_Q5CYR4 Cluster: Extracellular protein with a signal pep... 169 2e-40
UniRef50_Q4TCW9 Cluster: Chromosome undetermined SCAF6660, whole... 163 9e-39
UniRef50_UPI0000D8AB1E Cluster: UDP-N-acetyl-alpha-D-galactosami... 159 2e-37
UniRef50_UPI0000E234D0 Cluster: PREDICTED: similar to UDP-GalNAc... 158 4e-37
UniRef50_Q5CHA1 Cluster: Glycosyl transferase; n=4; Cryptosporid... 149 2e-34
UniRef50_UPI0000F20FAB Cluster: PREDICTED: hypothetical protein;... 145 3e-33
UniRef50_UPI00005A4710 Cluster: PREDICTED: similar to GalNAc tra... 144 6e-33
UniRef50_Q5BYW7 Cluster: SJCHGC07375 protein; n=1; Schistosoma j... 116 1e-24
UniRef50_Q4RNJ6 Cluster: Chromosome 21 SCAF15012, whole genome s... 113 2e-23
UniRef50_UPI0000E49DD9 Cluster: PREDICTED: hypothetical protein;... 111 6e-23
UniRef50_Q4T9B6 Cluster: Chromosome undetermined SCAF7602, whole... 103 1e-20
UniRef50_UPI0000E46BB8 Cluster: PREDICTED: similar to UDP-GalNAc... 102 2e-20
UniRef50_Q2B871 Cluster: Glycosyl transferase, group 2 family pr... 95 3e-18
UniRef50_Q4T0W3 Cluster: Chromosome undetermined SCAF10824, whol... 93 1e-17
UniRef50_UPI0000F1FCC6 Cluster: PREDICTED: hypothetical protein;... 91 6e-17
UniRef50_Q3A8Y2 Cluster: Glycosyl transferase, group 2 family; n... 91 6e-17
UniRef50_Q4SIA0 Cluster: Chromosome 5 SCAF14581, whole genome sh... 89 3e-16
UniRef50_Q7Q046 Cluster: ENSANGP00000016624; n=1; Anopheles gamb... 88 7e-16
UniRef50_Q4STJ6 Cluster: Chromosome undetermined SCAF14183, whol... 82 5e-14
UniRef50_UPI000155C133 Cluster: PREDICTED: similar to UDP-N-acet... 80 1e-13
UniRef50_UPI00005A4DE7 Cluster: PREDICTED: similar to Probable p... 78 6e-13
UniRef50_A5D6B4 Cluster: Predicted glycosyltransferases; n=1; Pe... 76 2e-12
UniRef50_Q3A8Y3 Cluster: Glycosyl transferase, group 2 family; n... 74 9e-12
UniRef50_A7T195 Cluster: Predicted protein; n=1; Nematostella ve... 71 6e-11
UniRef50_Q4RAK4 Cluster: Chromosome undetermined SCAF23488, whol... 69 3e-10
UniRef50_Q01V98 Cluster: Glycosyl transferase, family 2; n=1; So... 69 3e-10
UniRef50_Q5BZK1 Cluster: SJCHGC04837 protein; n=1; Schistosoma j... 68 6e-10
UniRef50_A3Z180 Cluster: Glycosyltransferase; n=1; Synechococcus... 64 7e-09
UniRef50_Q4RPK0 Cluster: Chromosome 12 SCAF15007, whole genome s... 63 2e-08
UniRef50_Q01PS5 Cluster: Glycosyl transferase, family 2 precurso... 61 9e-08
UniRef50_Q68VJ7 Cluster: Polypeptide N-acetylgalactosaminyltrans... 59 3e-07
UniRef50_Q4SU00 Cluster: Chromosome undetermined SCAF14054, whol... 59 4e-07
UniRef50_A1BDD3 Cluster: Glycosyl transferase, family 2; n=1; Ch... 59 4e-07
UniRef50_A4J8G0 Cluster: Glycosyl transferase, family 2; n=1; De... 58 5e-07
UniRef50_A7DQM1 Cluster: Glycosyl transferase, family 2; n=1; Ca... 56 3e-06
UniRef50_Q8TNV8 Cluster: Glycosyltransferase; n=1; Methanosarcin... 56 3e-06
UniRef50_Q2S1Y8 Cluster: Glycosyl transferase, group 2 family pr... 55 4e-06
UniRef50_Q0S4J4 Cluster: Possible glycosyltransferase; n=2; Cory... 54 8e-06
UniRef50_Q4TDW9 Cluster: Chromosome undetermined SCAF5986, whole... 54 1e-05
UniRef50_Q8FSL1 Cluster: Putative uncharacterized protein; n=1; ... 54 1e-05
UniRef50_A1AUA8 Cluster: Glycosyl transferase, family 2; n=1; Pe... 54 1e-05
UniRef50_Q5LF31 Cluster: Putative glycosyltransferase O-antigen ... 53 2e-05
UniRef50_Q02W60 Cluster: Glycosyltransferase; n=2; Lactococcus l... 52 3e-05
UniRef50_Q8U2R3 Cluster: Glycosyl transferase; n=2; Pyrococcus|R... 52 3e-05
UniRef50_Q1ARC2 Cluster: Glycosyl transferase, family 2; n=1; Ru... 52 4e-05
UniRef50_Q5CY12 Cluster: UDP-N-acetylgalactosamine: polypeptide ... 52 4e-05
UniRef50_Q1MNP5 Cluster: Predicted glycosyltransferases; n=1; La... 52 6e-05
UniRef50_A0Z0H0 Cluster: Glycosyl transferase, family 2:Glycosyl... 52 6e-05
UniRef50_A1ALI9 Cluster: Glycosyl transferase, family 2; n=1; Pe... 51 1e-04
UniRef50_Q54Q41 Cluster: Putative glycosyltransferase; n=1; Dict... 51 1e-04
UniRef50_Q0VR09 Cluster: Glycosyl transferase, group 2 family pr... 50 1e-04
UniRef50_A5D4N5 Cluster: Hypothetical glycosyltransferase; n=1; ... 50 1e-04
UniRef50_Q2JDU9 Cluster: Glycosyl transferase, family 2; n=3; Fr... 50 2e-04
UniRef50_Q4JYX6 Cluster: Putative glycosyl transferase; n=4; Str... 50 2e-04
UniRef50_Q1PVM1 Cluster: Similar to family 2 glycosyltransferase... 50 2e-04
UniRef50_A5ZFA1 Cluster: Putative uncharacterized protein; n=1; ... 50 2e-04
UniRef50_Q8U0I3 Cluster: Dolichol-phosphate mannose synthase; n=... 50 2e-04
UniRef50_Q4K1A7 Cluster: Putative glycosyl transferase; n=1; Str... 49 3e-04
UniRef50_A5FFS1 Cluster: Glycosyl transferase, family 2; n=15; B... 49 3e-04
UniRef50_A7BZS1 Cluster: Glycosyl transferase, family 2; n=1; Be... 49 4e-04
UniRef50_Q04RA2 Cluster: Glycosyltransferase; n=4; Leptospira|Re... 48 5e-04
UniRef50_A4F5Y3 Cluster: Glycosyl transferase; n=1; Saccharopoly... 48 5e-04
UniRef50_Q82D87 Cluster: Putative glycosyltransferase; n=5; Stre... 48 7e-04
UniRef50_A6PMY9 Cluster: Glycosyl transferase, family 2; n=1; Vi... 48 7e-04
UniRef50_A1SDW3 Cluster: Glycosyl transferase, family 2; n=1; No... 48 7e-04
UniRef50_Q8TRJ1 Cluster: Glycosyltransferase group 2 family prot... 48 7e-04
UniRef50_Q027Q8 Cluster: Glycosyl transferase, family 2; n=1; So... 48 9e-04
UniRef50_Q4K1D1 Cluster: Putative glycosyl transferase; n=1; Str... 47 0.001
UniRef50_Q1N7I9 Cluster: Predicted glycosyltransferase; n=1; Sph... 47 0.001
UniRef50_A6NTT8 Cluster: Putative uncharacterized protein; n=1; ... 47 0.001
UniRef50_UPI0000ECF595 Cluster: UPI0000ECF595 related cluster; n... 47 0.002
UniRef50_Q8A2D5 Cluster: Putative glycosyltransferase yibD; n=1;... 47 0.002
UniRef50_Q2CER4 Cluster: Glycosyltransferase; n=1; Oceanicola gr... 47 0.002
UniRef50_A7LVT4 Cluster: Putative uncharacterized protein; n=1; ... 47 0.002
UniRef50_A6XVK9 Cluster: Glycosyl transferase, group 2 family pr... 47 0.002
UniRef50_A5UT62 Cluster: Glycosyl transferase, family 2; n=5; Ch... 47 0.002
UniRef50_A4AYH1 Cluster: Dolichyl-phosphate mannose synthase rel... 47 0.002
UniRef50_A3Y6T7 Cluster: Putative uncharacterized protein; n=1; ... 47 0.002
UniRef50_Q9AK48 Cluster: Putative glycosyltransferase; n=2; Stre... 46 0.002
UniRef50_Q8XN57 Cluster: Beta-1,4-galactosyltransferase; n=1; Cl... 46 0.002
UniRef50_Q1BYJ1 Cluster: Methyltransferase FkbM; n=2; Burkholder... 46 0.002
UniRef50_A6ELC1 Cluster: TuaG; n=1; unidentified eubacterium SCB... 46 0.002
UniRef50_A1GD35 Cluster: Ricin B lectin precursor; n=2; Salinisp... 46 0.002
UniRef50_Q4JZ78 Cluster: Putative glycosyl transferase; n=1; Str... 46 0.003
UniRef50_A7B4B8 Cluster: Putative uncharacterized protein; n=1; ... 46 0.003
UniRef50_A6M2B0 Cluster: Glycosyl transferase, family 2; n=1; Cl... 46 0.003
UniRef50_A3W6W1 Cluster: Glycosyltransferase; n=2; Roseovarius|R... 46 0.003
UniRef50_A3S9H6 Cluster: Putative uncharacterized protein; n=2; ... 46 0.003
UniRef50_A1ASN8 Cluster: Glycosyl transferase, family 2; n=1; Pe... 46 0.003
UniRef50_P0C0H0 Cluster: Hyaluronan synthase; n=17; Streptococcu... 46 0.003
UniRef50_Q8YNR8 Cluster: Alr4493 protein; n=3; Nostocaceae|Rep: ... 46 0.004
UniRef50_Q8A2D9 Cluster: Putative glycosyltransferase yibD; n=1;... 46 0.004
UniRef50_Q748H9 Cluster: Glycosyl transferase, group 1/2 family ... 46 0.004
UniRef50_Q2IYC9 Cluster: Glycosyl transferase, family 2; n=1; Rh... 46 0.004
UniRef50_A0L7J6 Cluster: Glycosyl transferase, family 2; n=4; Ba... 46 0.004
UniRef50_Q97Q65 Cluster: Glycosyl transferase, family 2; n=13; S... 45 0.005
UniRef50_Q8PGL8 Cluster: Truncated O-antigen biosynthesis protei... 45 0.005
UniRef50_Q311W2 Cluster: Glycosyl transferase, group 2 family pr... 45 0.005
UniRef50_Q4IU61 Cluster: Glycosyl transferase, family 2; n=1; Az... 45 0.005
UniRef50_A6L5V6 Cluster: Glycosyltransferase family 2; n=1; Bact... 45 0.005
UniRef50_P75905 Cluster: Biofilm PGA synthesis N-glycosyltransfe... 45 0.005
UniRef50_Q8FPM5 Cluster: Putative uncharacterized protein; n=1; ... 45 0.006
UniRef50_A7AD36 Cluster: Putative uncharacterized protein; n=2; ... 45 0.006
UniRef50_A6PMZ0 Cluster: Glycosyl transferase, family 2; n=1; Vi... 45 0.006
UniRef50_A3W9Y6 Cluster: Glucosyltransferase; n=1; Erythrobacter... 45 0.006
UniRef50_Q8UCW1 Cluster: UDP-hexose transferase; n=1; Agrobacter... 44 0.008
UniRef50_Q64Q34 Cluster: Putative glycosyltransferase; n=1; Bact... 44 0.008
UniRef50_Q1IPB3 Cluster: Glycosyl transferase, family 2; n=2; Ba... 44 0.008
UniRef50_A0W5F3 Cluster: Glycosyl transferase, family 2; n=1; Ge... 44 0.008
UniRef50_Q1MS29 Cluster: Predicted glycosyltransferases; n=1; La... 44 0.011
UniRef50_A6W918 Cluster: Glycosyl transferase family 2; n=1; Kin... 44 0.011
UniRef50_A5ZFA7 Cluster: Putative uncharacterized protein; n=1; ... 44 0.011
UniRef50_A5D3A4 Cluster: Predicted glycosyltransferases; n=1; Pe... 44 0.011
UniRef50_A0V2D1 Cluster: Glycosyl transferase, family 2; n=1; Cl... 44 0.011
UniRef50_O32268 Cluster: Putative teichuronic acid biosynthesis ... 44 0.011
UniRef50_Q2KA13 Cluster: Putative beta-D-1,6 glucosyltransferase... 44 0.015
UniRef50_Q83VE8 Cluster: EpsN; n=1; Lactococcus lactis subsp. cr... 44 0.015
UniRef50_Q1PUL2 Cluster: Putative uncharacterized protein; n=1; ... 44 0.015
UniRef50_Q1FJU1 Cluster: Glycosyl transferase, family 2:CDP-glyc... 44 0.015
UniRef50_A7BMC2 Cluster: Glycosyl transferase, group 2 family; n... 44 0.015
UniRef50_A5FSS4 Cluster: Histidinol-phosphate phosphatase family... 44 0.015
UniRef50_A0VWC2 Cluster: Glycosyl transferase, family 2; n=3; Rh... 44 0.015
UniRef50_Q8PUV7 Cluster: Glycosyltransferase; n=3; Euryarchaeota... 44 0.015
UniRef50_A0RYV6 Cluster: Dolichol-phosphate mannosyltransferase;... 44 0.015
UniRef50_Q97GN9 Cluster: Glycosyltransferase domain containing p... 43 0.019
UniRef50_Q4UUG0 Cluster: O-antigen biosynthesis protein; n=2; Xa... 43 0.019
UniRef50_Q6TDC7 Cluster: WaaV; n=42; Campylobacter|Rep: WaaV - C... 43 0.019
UniRef50_Q0YLA6 Cluster: Glycosyl transferase, family 2; n=2; Ge... 43 0.019
UniRef50_Q0S4J5 Cluster: Possible glycosyltransferase; n=2; Cory... 43 0.019
UniRef50_P74948 Cluster: ORF35x9 protein; n=4; Vibrio cholerae|R... 43 0.019
UniRef50_A7NHD1 Cluster: Glycosyl transferase family 2; n=1; Ros... 43 0.019
UniRef50_A7CSW3 Cluster: Glycosyl transferase family 2; n=1; Opi... 43 0.019
UniRef50_A6BIG1 Cluster: Putative uncharacterized protein; n=1; ... 43 0.019
UniRef50_Q5LBM4 Cluster: Putative glycosyltransferase protein; n... 43 0.026
UniRef50_Q9X363 Cluster: PXO1-93; n=7; Bacilli|Rep: PXO1-93 - Ba... 43 0.026
UniRef50_Q04TX6 Cluster: UndP-glycosyltransferase; n=2; Leptospi... 43 0.026
UniRef50_A7NQ51 Cluster: Glycosyl transferase family 2; n=1; Ros... 43 0.026
UniRef50_A6C910 Cluster: Truncated O-antigen biosynthesis protei... 43 0.026
UniRef50_A3U4F3 Cluster: Putative fucosyl transferase; n=1; Croc... 43 0.026
UniRef50_A7RJ47 Cluster: Predicted protein; n=1; Nematostella ve... 43 0.026
UniRef50_Q57022 Cluster: Uncharacterized glycosyltransferase HI0... 43 0.026
UniRef50_Q3ADF7 Cluster: Glycosyl transferase, group 2 family; n... 42 0.034
UniRef50_Q1Q4P0 Cluster: Similar to glycosyltransferase family 2... 42 0.034
UniRef50_Q15RB7 Cluster: Glycosyl transferase, family 2; n=1; Ps... 42 0.034
UniRef50_Q027L5 Cluster: Glycosyl transferase, family 2; n=1; So... 42 0.034
UniRef50_A6L4H5 Cluster: Glycosyltransferase family 2; n=1; Bact... 42 0.034
UniRef50_A4AE68 Cluster: Glycosyltransferase family protein; n=1... 42 0.034
UniRef50_A0YST2 Cluster: Putative uncharacterized protein; n=1; ... 42 0.034
UniRef50_Q2NHK4 Cluster: Predicted glycosyltransferase; n=2; cel... 42 0.034
UniRef50_Q8ABR6 Cluster: Putative glycosyltransferase; n=1; Bact... 42 0.045
UniRef50_Q3ASS6 Cluster: Teichuronic acid biosynthesis; n=1; Chl... 42 0.045
UniRef50_Q2RQ74 Cluster: Glycosyl transferase, family 2; n=1; Rh... 42 0.045
UniRef50_Q9S5F4 Cluster: Glycosyl transferase; n=7; Enterobacter... 42 0.045
UniRef50_Q8GPA4 Cluster: Eps7G; n=1; Streptococcus thermophilus|... 42 0.045
UniRef50_Q4MJH5 Cluster: TagF domain protein; n=2; Bacillaceae|R... 42 0.045
UniRef50_Q1PZ07 Cluster: Putative uncharacterized protein; n=1; ... 42 0.045
UniRef50_Q052L4 Cluster: Glycosyltransferase; n=10; Leptospira|R... 42 0.045
UniRef50_A7AFY8 Cluster: Putative uncharacterized protein; n=1; ... 42 0.045
UniRef50_A4EQH0 Cluster: Probable glycosyltransferase protein; n... 42 0.045
UniRef50_A3HRI5 Cluster: Glycosyl transferase, family 2; n=1; Al... 42 0.045
UniRef50_A1G8G0 Cluster: Glycosyl transferase, family 2; n=4; Ac... 42 0.045
UniRef50_A4G0E0 Cluster: Glycosyl transferase, family 2; n=2; ce... 42 0.045
UniRef50_Q64W33 Cluster: Glycosyltransferase; n=1; Bacteroides f... 42 0.059
UniRef50_Q47P42 Cluster: Probable glycosyltransferase; n=1; Ther... 42 0.059
UniRef50_A6Q1D6 Cluster: Glucosaminyltransferase; n=1; Nitratiru... 42 0.059
UniRef50_A3QCT5 Cluster: Glycosyl transferase, family 2; n=1; Sh... 42 0.059
UniRef50_A3D6K1 Cluster: Glycosyl transferase, family 2; n=1; Sh... 42 0.059
UniRef50_A3CKM9 Cluster: Glycosyltransferase, family 2/glycosylt... 42 0.059
UniRef50_A0JQU1 Cluster: Glycosyl transferase, family 2; n=2; Ar... 42 0.059
UniRef50_Q4KF36 Cluster: Glycosyl transferase, group 2 family pr... 41 0.078
UniRef50_A4J3H2 Cluster: Glycosyl transferase, family 2; n=1; De... 41 0.078
UniRef50_A1TVF9 Cluster: Glycosyl transferase, family 2; n=1; Ac... 41 0.078
UniRef50_A0A016 Cluster: MoeGT3; n=1; Streptomyces ghanaensis|Re... 41 0.078
UniRef50_A7RJ49 Cluster: Predicted protein; n=1; Nematostella ve... 41 0.078
UniRef50_UPI0000DD81B4 Cluster: PREDICTED: similar to Polypeptid... 41 0.10
UniRef50_Q9A6L6 Cluster: Glycosyl transferase family protein; n=... 41 0.10
UniRef50_Q8XN34 Cluster: Spore coat polysaccharide biosynthesis ... 41 0.10
UniRef50_Q3B487 Cluster: Glucosaminyltransferase; n=2; Chlorobiu... 41 0.10
UniRef50_Q7BG51 Cluster: WsaE; n=1; Geobacillus stearothermophil... 41 0.10
UniRef50_A6LCI9 Cluster: Glycosyltransferase family 2; n=1; Para... 41 0.10
UniRef50_A6BIH4 Cluster: Putative uncharacterized protein; n=1; ... 41 0.10
UniRef50_A5FN38 Cluster: Glycosyl transferase, family 2; n=1; Fl... 41 0.10
UniRef50_A3ZV34 Cluster: Glycosyl transferase, group 2 family pr... 41 0.10
UniRef50_A0L591 Cluster: Glycosyl transferase, family 2; n=1; Ma... 41 0.10
UniRef50_A0GYU7 Cluster: Glycosyl transferase, family 2; n=4; Ch... 41 0.10
UniRef50_A7DSA8 Cluster: Glycosyl transferase, family 2; n=1; Ca... 41 0.10
UniRef50_Q82KX7 Cluster: Putative galactosamine-containing minor... 40 0.14
UniRef50_Q7U947 Cluster: Putative uncharacterized protein; n=1; ... 40 0.14
UniRef50_Q2JCN5 Cluster: Glycosyl transferase, family 2 precurso... 40 0.14
UniRef50_Q15RB6 Cluster: Glycosyl transferase, family 2; n=1; Ps... 40 0.14
UniRef50_Q13KT8 Cluster: Putative glycosyl transferase; n=1; Bur... 40 0.14
UniRef50_Q0VR08 Cluster: Glycosyl transferase, putative; n=1; Al... 40 0.14
UniRef50_A6M2M4 Cluster: Glycosyl transferase, family 2; n=1; Cl... 40 0.14
UniRef50_A4EJ81 Cluster: Sugar transferase-putative a glycosyl t... 40 0.14
UniRef50_A3I1X7 Cluster: Exopolysaccharide biosynthesis protein,... 40 0.14
UniRef50_A1IAI5 Cluster: Glycosyl transferase; n=1; Candidatus D... 40 0.14
UniRef50_A0YT86 Cluster: Glycosyltransferase; n=1; Lyngbya sp. P... 40 0.14
UniRef50_A0LCS4 Cluster: Glycosyl transferase, family 2; n=1; Ma... 40 0.14
UniRef50_A0KM86 Cluster: Glycosyl transferase, group 2 family pr... 40 0.14
UniRef50_Q5JIZ9 Cluster: Glycosyltransferase, family 2; n=2; The... 40 0.14
UniRef50_Q9A618 Cluster: Glycosyl transferase family protein; n=... 40 0.18
UniRef50_Q88TX5 Cluster: Glycosyltransferase; n=1; Lactobacillus... 40 0.18
UniRef50_Q64NZ8 Cluster: Putative glycosyltransferase; n=2; Bact... 40 0.18
UniRef50_Q5P2T1 Cluster: Glycosyl transferase; n=1; Azoarcus sp.... 40 0.18
UniRef50_Q2K5C3 Cluster: Probable glycosyltransferase protein; n... 40 0.18
UniRef50_Q9ZFD7 Cluster: O-antigen biosynthesis protein RbfC; n=... 40 0.18
UniRef50_Q1NA00 Cluster: Putative glycosyltransferase; n=1; Sphi... 40 0.18
UniRef50_Q116A2 Cluster: Glycosyl transferase, group 1; n=2; cel... 40 0.18
UniRef50_A7NFP6 Cluster: Glycosyl transferase family 2; n=1; Ros... 40 0.18
UniRef50_A7LRY9 Cluster: Putative uncharacterized protein; n=1; ... 40 0.18
UniRef50_A7HI30 Cluster: Glycosyl transferase family 2 precursor... 40 0.18
UniRef50_A7BXP2 Cluster: Glycosyl transferase, family 2; n=1; Be... 40 0.18
UniRef50_A6WEW4 Cluster: Glycosyl transferase family 2; n=3; Act... 40 0.18
UniRef50_A6UIJ8 Cluster: Glycosyl transferase family 2; n=3; Rhi... 40 0.18
UniRef50_A4F0P6 Cluster: Glycosyltransferase; n=1; Roseobacter s... 40 0.18
UniRef50_Q5V4T8 Cluster: Succinoglycan biosynthesis protein; n=4... 40 0.18
UniRef50_Q0W1H0 Cluster: Putative glycosyltransferase; n=2; uncu... 40 0.18
UniRef50_Q45539 Cluster: Putative glycosyltransferase csbB; n=26... 40 0.18
UniRef50_Q8A821 Cluster: Glycosyltransferase; n=1; Bacteroides t... 40 0.24
UniRef50_Q73MS4 Cluster: Glycosyl transferase, group 2 family pr... 40 0.24
UniRef50_Q47DD2 Cluster: Glycosyl transferase, family 2:Polysacc... 40 0.24
UniRef50_Q4ALR3 Cluster: Glycosyl transferase, family 2; n=2; Ch... 40 0.24
UniRef50_Q1L2K4 Cluster: Glucosyltransferase; n=2; Streptomyces ... 40 0.24
UniRef50_Q0C2C3 Cluster: Glycosyl transferase, group 2 family pr... 40 0.24
UniRef50_Q0BWS4 Cluster: Glycosyl transferase, group 2 family pr... 40 0.24
UniRef50_Q03A90 Cluster: Glycosyltransferase related enzyme; n=3... 40 0.24
UniRef50_A7D8V4 Cluster: Glycosyl transferase, family 2; n=2; Me... 40 0.24
UniRef50_A6CCQ7 Cluster: Glycosyltransferase; n=1; Planctomyces ... 40 0.24
UniRef50_A5V1J5 Cluster: Glycosyl transferase, family 2; n=35; B... 40 0.24
UniRef50_A5P6M1 Cluster: Putative uncharacterized protein; n=1; ... 40 0.24
UniRef50_A5KMN1 Cluster: Putative uncharacterized protein; n=1; ... 40 0.24
UniRef50_A4BPR7 Cluster: Glycosyl transferase, group 2 family pr... 40 0.24
UniRef50_A4AVC0 Cluster: Glycosyl transferase; n=5; Bacteroidete... 40 0.24
UniRef50_A0YT87 Cluster: Glycosyl transferase; n=1; Lyngbya sp. ... 40 0.24
UniRef50_A0W4R3 Cluster: Glycosyl transferase, family 2; n=1; Ge... 40 0.24
UniRef50_Q8ZWD5 Cluster: Glycosyl transferase, putative; n=4; Py... 40 0.24
UniRef50_A3CRX6 Cluster: Glycosyl transferase, family 2; n=1; Me... 40 0.24
UniRef50_Q7UND3 Cluster: Putative uncharacterized protein; n=1; ... 39 0.31
UniRef50_Q9AH91 Cluster: WciV; n=3; Streptococcus pneumoniae|Rep... 39 0.31
UniRef50_Q15QW5 Cluster: Glycosyl transferase, family 2; n=1; Ps... 39 0.31
UniRef50_Q03KL3 Cluster: Glycosyltransferase involved in cell wa... 39 0.31
UniRef50_A4X6N0 Cluster: Glycosyl transferase, family 2; n=1; Sa... 39 0.31
UniRef50_A4AYI1 Cluster: Glycosyltransferase; n=1; Alteromonas m... 39 0.31
UniRef50_A0QSC1 Cluster: Probable membrane sugar transferase; n=... 39 0.31
UniRef50_A0LST8 Cluster: Glycosyl transferase, family 2; n=1; Ac... 39 0.31
UniRef50_A5UMV5 Cluster: Glycosyltransferase, GT2 family; n=1; M... 39 0.31
UniRef50_Q65EK9 Cluster: YveT; n=3; Bacillus|Rep: YveT - Bacillu... 39 0.42
UniRef50_Q2JCN6 Cluster: Glycosyl transferase, family 2; n=2; Fr... 39 0.42
UniRef50_Q9AFH3 Cluster: CpsVJ; n=5; Streptococcus agalactiae|Re... 39 0.42
UniRef50_A6WE95 Cluster: Glycosyl transferase family 2; n=1; Kin... 39 0.42
UniRef50_A6NTT3 Cluster: Putative uncharacterized protein; n=1; ... 39 0.42
UniRef50_A3HYL0 Cluster: Glucosaminyltransferase; n=1; Algoripha... 39 0.42
UniRef50_A3ERW8 Cluster: Glycosyltransferase; n=1; Leptospirillu... 39 0.42
UniRef50_A1ZSA7 Cluster: Putative uncharacterized protein; n=1; ... 39 0.42
UniRef50_A1HM87 Cluster: Glycosyl transferase, family 2; n=2; Ba... 39 0.42
UniRef50_A1A3J7 Cluster: Glycosyl transferase; n=1; Bifidobacter... 39 0.42
UniRef50_A0W3Z1 Cluster: Glycosyl transferase, family 2; n=1; Ge... 39 0.42
UniRef50_A5UMV6 Cluster: Predicted glycosyltransferase, GT2 fami... 39 0.42
UniRef50_A3MTE5 Cluster: Glycosyl transferase, family 2; n=1; Py... 39 0.42
UniRef50_Q319Q2 Cluster: Putative uncharacterized protein; n=1; ... 38 0.55
UniRef50_A6D0H3 Cluster: Bactoprenol glucosyl transferase; n=1; ... 38 0.55
UniRef50_A4J4W8 Cluster: Glycosyl transferase, family 2; n=1; De... 38 0.55
UniRef50_A3N3P0 Cluster: Putative uncharacterized protein; n=2; ... 38 0.55
UniRef50_A1W338 Cluster: Glycosyl transferase, family 2 precurso... 38 0.55
UniRef50_A0YT83 Cluster: Putative uncharacterized protein; n=1; ... 38 0.55
UniRef50_A0P222 Cluster: Putative uncharacterized protein; n=1; ... 38 0.55
UniRef50_A0LEJ6 Cluster: Glycosyl transferase, family 2; n=2; Sy... 38 0.55
UniRef50_A5UNM1 Cluster: Glycosyltransferase/CDP-glycerol:poly(G... 38 0.55
UniRef50_A5UMV0 Cluster: Glycosyltransferase, GT2 family; n=2; M... 38 0.55
UniRef50_UPI0001597BBE Cluster: hypothetical protein RBAM_032950... 38 0.73
UniRef50_Q97H42 Cluster: Predicted glycosyltransferase; n=1; Clo... 38 0.73
UniRef50_Q8YW79 Cluster: All1737 protein; n=8; Cyanobacteria|Rep... 38 0.73
UniRef50_Q8ABR4 Cluster: Putative glycosyltransferase; n=1; Bact... 38 0.73
UniRef50_Q2S3U8 Cluster: Rhamnosyl transferase; n=1; Salinibacte... 38 0.73
UniRef50_Q2LPU6 Cluster: Glycosyltransferase; n=1; Syntrophus ac... 38 0.73
UniRef50_Q9AGZ0 Cluster: Putative sugar transferase; n=1; Aneuri... 38 0.73
UniRef50_Q7P6D4 Cluster: Dolichol-phosphate mannosyltransferase;... 38 0.73
UniRef50_Q183K7 Cluster: Putative teichuronic acid biosynthesis ... 38 0.73
UniRef50_A5ZX72 Cluster: Putative uncharacterized protein; n=1; ... 38 0.73
UniRef50_A4F9K9 Cluster: Dolichol-phosphate mannosyltransferase;... 38 0.73
UniRef50_A4BIW6 Cluster: Glycosyltransferase; n=1; Reinekea sp. ... 38 0.73
UniRef50_A4B5Y6 Cluster: Glycosyltransferase; n=1; Alteromonas m... 38 0.73
UniRef50_A4A2A1 Cluster: Glycosyl transferase, family 2; n=1; Bl... 38 0.73
UniRef50_A3U4F4 Cluster: Glycosyl transferase, group 2 family pr... 38 0.73
UniRef50_A3TST7 Cluster: Sugar transferase; n=1; Oceanicola bats... 38 0.73
UniRef50_A1HMB8 Cluster: Glycosyl transferase, family 2; n=1; Th... 38 0.73
UniRef50_Q46FX2 Cluster: Putative uncharacterized protein; n=1; ... 38 0.73
UniRef50_A0RWZ2 Cluster: Glycosyltransferase involved in cell wa... 38 0.73
UniRef50_P55465 Cluster: Uncharacterized protein y4gI; n=1; Rhiz... 38 0.73
UniRef50_P28590 Cluster: Abrin-c precursor [Contains: Abrin-c A ... 38 0.73
UniRef50_UPI000069E575 Cluster: Polypeptide N-acetylgalactosamin... 38 0.96
UniRef50_Q98GK8 Cluster: Mll3280 protein; n=1; Mesorhizobium lot... 38 0.96
UniRef50_Q8KAF3 Cluster: Glycosyl transferase; n=1; Chlorobaculu... 38 0.96
UniRef50_Q7NYW2 Cluster: Probable glycosyl transferase; n=1; Chr... 38 0.96
UniRef50_Q74D09 Cluster: Glycosyl transferase, group 2 family pr... 38 0.96
UniRef50_Q4KG44 Cluster: Glycosyl transferase, group 2 family pr... 38 0.96
UniRef50_Q27GR1 Cluster: Putative galactocerebrosidase; n=1; Act... 38 0.96
UniRef50_Q26CJ3 Cluster: Glycosyl transferase; n=1; Flavobacteri... 38 0.96
UniRef50_Q1J349 Cluster: Glycosyl transferase, family 2; n=2; De... 38 0.96
UniRef50_Q11M89 Cluster: Glycosyl transferase, family 2; n=1; Me... 38 0.96
UniRef50_Q0VM55 Cluster: Glycosyltransferase; n=1; Alcanivorax b... 38 0.96
UniRef50_Q03MS9 Cluster: Glycosyltransferase, probably involved ... 38 0.96
UniRef50_O06035 Cluster: EpsG; n=1; Lactococcus lactis subsp. cr... 38 0.96
UniRef50_O05200 Cluster: Glycosyltransferase; n=4; Rhizobiales|R... 38 0.96
UniRef50_A7CUB6 Cluster: Glycosyl transferase family 2; n=1; Opi... 38 0.96
UniRef50_A6VYP6 Cluster: Glycosyl transferase family 2; n=12; Ga... 38 0.96
UniRef50_A4SKW8 Cluster: Glycosyl transferase, group 2 family pr... 38 0.96
UniRef50_A3ZLM2 Cluster: Glycosyltransferase; n=3; Bacteria|Rep:... 38 0.96
UniRef50_A0LNM4 Cluster: Glycosyl transferase, family 2; n=1; Sy... 38 0.96
UniRef50_Q9HQP4 Cluster: Putative uncharacterized protein; n=1; ... 38 0.96
UniRef50_A1RWN5 Cluster: Glycosyl transferase, family 2; n=1; Th... 38 0.96
UniRef50_A4X219 Cluster: CDP-glycerol:poly(Glycerophosphate) gly... 37 1.3
UniRef50_A4FCB8 Cluster: Probable membrane sugar transferase; n=... 37 1.3
UniRef50_A0RPD7 Cluster: Sugar transferase; n=1; Campylobacter f... 37 1.3
UniRef50_O29674 Cluster: Dolichol-P-glucose synthetase, putative... 37 1.3
UniRef50_A3CVX3 Cluster: Glycosyl transferase, family 2; n=1; Me... 37 1.3
UniRef50_P26024 Cluster: N-acetylglucosaminyltransferase; n=286;... 37 1.3
UniRef50_UPI00015C4F33 Cluster: glycosyl transferase, group 2 fa... 37 1.7
UniRef50_Q8YYD5 Cluster: All0915 protein; n=3; Cyanobacteria|Rep... 37 1.7
UniRef50_Q83DW4 Cluster: Glycosyl transferase, group 2 family pr... 37 1.7
UniRef50_Q72XI4 Cluster: Glycosyl transferase, group 2 family pr... 37 1.7
UniRef50_Q5P2A3 Cluster: Glycosyl transferase, family 2; n=1; Az... 37 1.7
UniRef50_Q39U31 Cluster: Glycosyl transferase, family 2; n=1; Ge... 37 1.7
UniRef50_Q21BJ4 Cluster: Glycosyl transferase, family 2; n=1; Rh... 37 1.7
UniRef50_Q1VNP0 Cluster: Glycosyltransferase; n=1; Psychroflexus... 37 1.7
UniRef50_Q1GJ85 Cluster: Glycosyl transferase family 2; n=4; Rho... 37 1.7
UniRef50_Q01PM1 Cluster: Glycosyl transferase, family 2; n=2; Ba... 37 1.7
UniRef50_A6PP74 Cluster: Glycosyl transferase, family 2 precurso... 37 1.7
UniRef50_A6ERL9 Cluster: Glycosyltransferase involved in cell wa... 37 1.7
UniRef50_A6CCC4 Cluster: Glycosyltransferase, group 2 family pro... 37 1.7
UniRef50_A6BH37 Cluster: Putative uncharacterized protein; n=1; ... 37 1.7
UniRef50_A6BH35 Cluster: Putative uncharacterized protein; n=1; ... 37 1.7
UniRef50_A5G7E0 Cluster: Glycosyl transferase, family 2; n=1; Ge... 37 1.7
UniRef50_A5G2M9 Cluster: Glycosyl transferase, family 2; n=1; Ac... 37 1.7
UniRef50_A1UHQ6 Cluster: Glycosyl transferase, family 2; n=2; My... 37 1.7
UniRef50_A0YLN7 Cluster: Glycosyl transferase, group 2 family pr... 37 1.7
UniRef50_Q8ZZ63 Cluster: Glycosyl transferase, putative; n=3; Py... 37 1.7
UniRef50_Q8U168 Cluster: Glycosyl transferase; n=1; Pyrococcus f... 37 1.7
UniRef50_Q2FTA5 Cluster: Glycosyl transferase, family 2; n=2; Me... 37 1.7
UniRef50_Q9F324 Cluster: Putative glycosyl transferase; n=1; Str... 36 2.2
UniRef50_Q629X5 Cluster: Glycosyl transferase, group 2 family pr... 36 2.2
UniRef50_Q6I3S4 Cluster: Glycosyl transferase, group 2 family pr... 36 2.2
UniRef50_Q4K0T1 Cluster: Putative glycosyl transferase; n=1; Str... 36 2.2
UniRef50_A6SYV8 Cluster: Glucosyl transferase; n=6; Proteobacter... 36 2.2
UniRef50_A6M2A8 Cluster: Glycosyl transferase, family 2; n=1; Cl... 36 2.2
UniRef50_A5ZX61 Cluster: Putative uncharacterized protein; n=2; ... 36 2.2
UniRef50_A2RHU0 Cluster: Similar to glycosyl transferase; n=1; L... 36 2.2
UniRef50_A1R2V1 Cluster: Glycosyl transferase, group 2 family do... 36 2.2
UniRef50_A1FVF9 Cluster: Glycosyl transferase, family 2; n=1; St... 36 2.2
UniRef50_Q97YM4 Cluster: N-acetylglucosaminyltransferase related... 36 2.2
UniRef50_Q7UC63 Cluster: Undecaprenyl-phosphate 4-deoxy-4-formam... 36 2.2
UniRef50_Q8Y828 Cluster: Lmo1090 protein; n=6; Listeria monocyto... 36 2.9
UniRef50_Q831L3 Cluster: Glycosyl transferase, group 2 family pr... 36 2.9
UniRef50_Q7ULA0 Cluster: Glycosyl transferase; n=1; Pirellula sp... 36 2.9
UniRef50_Q74BF3 Cluster: Glycosyl transferase, group 2 family pr... 36 2.9
UniRef50_Q5SGE1 Cluster: Heparosan synthase B; n=5; Pasteurella ... 36 2.9
UniRef50_Q41FN2 Cluster: Glycosyl transferase, family 2; n=1; Ex... 36 2.9
UniRef50_Q06BC1 Cluster: Glycosyltransferase; n=2; Vibrio choler... 36 2.9
UniRef50_A6M9B5 Cluster: Glycosyltransferase family 2; n=1; Esch... 36 2.9
UniRef50_A5ZI15 Cluster: Putative uncharacterized protein; n=1; ... 36 2.9
UniRef50_A4BT75 Cluster: Glycosyl transferase, family 2; n=1; Ni... 36 2.9
UniRef50_A1FZQ1 Cluster: Glycosyl transferase, family 2; n=8; Pr... 36 2.9
UniRef50_A0YLN1 Cluster: Glycosyl transferase, family 2; n=2; Ba... 36 2.9
UniRef50_Q4JC59 Cluster: Conserved Archaeal membrane protein; n=... 36 2.9
UniRef50_Q6DEJ9 Cluster: Dolichyl-phosphate mannosyltransferase ... 36 3.9
UniRef50_Q8RA31 Cluster: Glycosyltransferases involved in cell w... 36 3.9
UniRef50_Q8A2D4 Cluster: Putative glycosyltransferase yibD; n=2;... 36 3.9
UniRef50_Q5YXL3 Cluster: Putative glycosyltransferase; n=1; Noca... 36 3.9
UniRef50_Q47MV8 Cluster: Similar to Putative glycosyl/glyceropho... 36 3.9
UniRef50_Q2K5A1 Cluster: Glycosyltransferase protein; n=8; Rhizo... 36 3.9
UniRef50_Q2J696 Cluster: Glycosyl transferase, family 2; n=1; Fr... 36 3.9
UniRef50_Q4K1T8 Cluster: Putative glycosyl transferase; n=1; Str... 36 3.9
UniRef50_Q2BI44 Cluster: ATP-dependent helicase HrpB; n=8; Gamma... 36 3.9
UniRef50_Q1Z875 Cluster: Glycosyltransferase; n=1; Photobacteriu... 36 3.9
UniRef50_Q1QJ06 Cluster: Glycosyl transferase, family 2; n=1; Ni... 36 3.9
UniRef50_Q03CF8 Cluster: Glycosyltransferase related enzyme; n=1... 36 3.9
UniRef50_A6PMY8 Cluster: Glycosyl transferase, family 2; n=1; Vi... 36 3.9
UniRef50_A5ZI16 Cluster: Putative uncharacterized protein; n=1; ... 36 3.9
UniRef50_A5KX95 Cluster: Protein containing bacterial pre-peptid... 36 3.9
UniRef50_A3J396 Cluster: Glycosyltransferase; n=1; Flavobacteria... 36 3.9
UniRef50_A1VS57 Cluster: Glycosyl transferase, family 2; n=1; Po... 36 3.9
UniRef50_A1K6G6 Cluster: Membrane-anchored glycosyltransferase; ... 36 3.9
UniRef50_UPI00015BCEF9 Cluster: UPI00015BCEF9 related cluster; n... 35 5.1
UniRef50_Q832M8 Cluster: Glycosyl transferase, group 2 family pr... 35 5.1
UniRef50_Q72QG5 Cluster: Glycosyltransferase; n=2; Leptospira in... 35 5.1
UniRef50_Q2RLA5 Cluster: LmbE-like protein; n=1; Moorella thermo... 35 5.1
UniRef50_Q6I3S0 Cluster: Glycosyl transferase, group 2 family pr... 35 5.1
UniRef50_Q1H1J2 Cluster: Glycosyl transferase, family 2; n=1; Me... 35 5.1
UniRef50_Q0RDH4 Cluster: Putative glycosyl transferase; n=1; Fra... 35 5.1
UniRef50_A6ET48 Cluster: Putative uncharacterized protein; n=1; ... 35 5.1
UniRef50_A6A3G7 Cluster: Chitin synthase; n=2; Vibrio cholerae|R... 35 5.1
UniRef50_A5ZM19 Cluster: Putative uncharacterized protein; n=1; ... 35 5.1
UniRef50_A4AYH0 Cluster: Glycosyl transferase; n=1; Alteromonas ... 35 5.1
UniRef50_A3TKW7 Cluster: Putative uncharacterized protein; n=1; ... 35 5.1
UniRef50_A3DK13 Cluster: Glycosyl transferase, family 2; n=4; Cl... 35 5.1
UniRef50_A0ZM92 Cluster: Glycosyltransferase; n=1; Nodularia spu... 35 5.1
UniRef50_A0P221 Cluster: Glycosyltransferase-like protein; n=1; ... 35 5.1
UniRef50_A0LZL8 Cluster: Transmembrane family-2 glycosyl transfe... 35 5.1
UniRef50_Q9UYC7 Cluster: Glucosyltransferase; n=4; Thermococcace... 35 5.1
UniRef50_Q8TN31 Cluster: Glucosaminyltransferase; n=2; Methanosa... 35 5.1
UniRef50_P33697 Cluster: Succinoglycan biosynthesis protein exoO... 35 5.1
UniRef50_Q9K6R9 Cluster: BH3658 protein; n=33; cellular organism... 35 6.8
UniRef50_Q8KFK9 Cluster: Glycosyl transferase; n=10; Chlorobiace... 35 6.8
UniRef50_Q5NQ78 Cluster: Putative dolichol-phosphate mannosyltra... 35 6.8
UniRef50_Q5FT07 Cluster: Glycosyltransferase; n=1; Gluconobacter... 35 6.8
UniRef50_Q50FZ1 Cluster: Cj81-010; n=7; Campylobacter jejuni|Rep... 35 6.8
UniRef50_Q3E565 Cluster: Glycosyl transferase, family 2; n=3; Ch... 35 6.8
UniRef50_Q1EXB6 Cluster: Glycosyl transferase, family 2; n=5; Cl... 35 6.8
UniRef50_Q0P9B5 Cluster: Beta-1,3 galactosyltransferase; n=24; C... 35 6.8
UniRef50_Q0LDR9 Cluster: Glycosyl transferase, family 2; n=2; He... 35 6.8
UniRef50_P72520 Cluster: Synthase of the type 3 pneumococcal cap... 35 6.8
UniRef50_A6QB23 Cluster: Glycosyl transferase; n=1; Sulfurovum s... 35 6.8
UniRef50_A5W7C0 Cluster: Capsule polysaccharide biosynthesis pro... 35 6.8
UniRef50_A5UYR9 Cluster: Ribonuclease III; n=14; Bacteria|Rep: R... 35 6.8
UniRef50_A5NT28 Cluster: Glycosyl transferase, family 2; n=1; Me... 35 6.8
UniRef50_A5FAD6 Cluster: Hyaluronan synthase; n=5; Flavobacteria... 35 6.8
UniRef50_A3W6W3 Cluster: Glycosyltransferase; n=2; Roseovarius|R... 35 6.8
>UniRef50_Q17NN8 Cluster: N-acetylgalactosaminyltransferase; n=4;
Endopterygota|Rep: N-acetylgalactosaminyltransferase -
Aedes aegypti (Yellowfever mosquito)
Length = 613
Score = 599 bits (1480), Expect = e-170
Identities = 292/540 (54%), Positives = 367/540 (67%), Gaps = 31/540 (5%)
Query: 31 KIDWHNYEQISSERKRVGIGEQGLPAHLPK-EESDLEESLYSVNGFNGALSDKIPLSRSL 89
++DWH+Y + E +R G GE G P L K E+ L E L+ NG++
Sbjct: 91 RVDWHDYAYMEYEAEREGPGEHGKPLKLEKLEDIKLNEKLFKENGYSAL----------- 139
Query: 90 PDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDAS 149
SG KK Y++ LP+VSV+V F+NEHWSTLLRT YSVL RSP HL+KEI LV+D S
Sbjct: 140 -----SGVGKKRYLQELPTVSVIVIFYNEHWSTLLRTVYSVLNRSPSHLLKEIVLVNDHS 194
Query: 150 TKDFLGKALDDYLEGNM-PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVN 208
TK+FL + L D++ + PKVK+I LP RSGLITARL GA+ AT DVLI LDSHTE NVN
Sbjct: 195 TKEFLWEPLQDFVRTELAPKVKLISLPVRSGLITARLTGAKAATGDVLIVLDSHTEVNVN 254
Query: 209 WLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKN 268
WLPPL+EPIA DYRT +CPFIDVIA DTF+YRAQDEG RGAFDW+F YKRLP+ +D +
Sbjct: 255 WLPPLIEPIAEDYRTCVCPFIDVIAHDTFQYRAQDEGKRGAFDWKFLYKRLPLRAQDMVD 314
Query: 269 MPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSR 328
EPFESP+MAGGLFAIS FFWELGGYD GLDIWGGEQYELSFK+WQCGGRM+DAPCSR
Sbjct: 315 PTEPFESPIMAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFKVWQCGGRMVDAPCSR 374
Query: 329 VGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRL 388
VGH+YR +APFPNP +FV +N++RVAEVWMDEY + LY R P + + D GDL QK L
Sbjct: 375 VGHVYRGYAPFPNPRGTNFVTRNFKRVAEVWMDEYKQFLYERNPQFDQTDAGDLTKQKAL 434
Query: 389 REKLNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXLCADAAHGPQLA 448
RE+L CKPFKWF+ ++A DL +YP +PKPFA G++ LC D+ +
Sbjct: 435 RERLQCKPFKWFLEEVAPDLVVRYPLRDPKPFASGRV----QSAANPKLCLDSMNHKAKE 490
Query: 449 ALRLARCDVTTNAEQH---FVLTWHKDIRCKTRNMCWDLPDSSPGSPIVLYACHLGGGNQ 505
+ + C Q+ F LT+++DIR + + C D SS GS ++L+ CH GNQ
Sbjct: 491 PIGVFSCAANRTYPQNNQFFTLTYYRDIRVSSVDKCLDA--SSDGSEVILFNCHESQGNQ 548
Query: 506 LWRYYPDTKQIKHG--GHSNCVDHEPTTLRLHISRCDAAADTQRWTLDTVDAELLAAWDA 563
LW+Y +T+ I+HG + C+D ++ +S+CD TQRW VD E L W++
Sbjct: 549 LWQYDTETQMIRHGKPTRNQCLD--LVERKVVVSKCDHRKKTQRWEWGFVDEENLNNWES 606
>UniRef50_Q6WV16 Cluster: N-acetylgalactosaminyltransferase 6; n=4;
Diptera|Rep: N-acetylgalactosaminyltransferase 6 -
Drosophila melanogaster (Fruit fly)
Length = 666
Score = 581 bits (1435), Expect = e-164
Identities = 279/535 (52%), Positives = 358/535 (66%), Gaps = 6/535 (1%)
Query: 28 SLKKIDWHNYEQISSERKRVGIGEQGLPAHLPKE-ESDLEESLYSVNGFNGALSDKIPLS 86
S+KK DWH+Y + + KRVG+GE G + L E + DLE+ + NGFN LSD I ++
Sbjct: 123 SVKK-DWHDYTFMEKDAKRVGLGEGGKASTLDDESQRDLEKRMSLENGFNALLSDSISVN 181
Query: 87 RSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVD 146
RS+PDIRH C+KK Y+ LP+VSV++ F+NE+ S L+R+ +S++ RSP L+KEI LVD
Sbjct: 182 RSVPDIRHPLCRKKEYVAKLPTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVD 241
Query: 147 DASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEAN 206
D S +++LGK L+ Y+ + V+V+RLP+R+GLI AR AGA ATA+VLIFLDSH EAN
Sbjct: 242 DHSDREYLGKELETYIAEHFKWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEAN 301
Query: 207 VNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDE 266
NWLPPLLEPIAL+ RT +CPFIDVI F YRAQDEGARGAFDWEFFYKRLP+LP+D
Sbjct: 302 YNWLPPLLEPIALNKRTAVCPFIDVIDHTNFHYRAQDEGARGAFDWEFFYKRLPLLPEDL 361
Query: 267 KNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPC 326
K+ +PF+SP+MAGGLFAISR FFWELGGYD GLDIWGGEQYELSFKIW CGG M DAPC
Sbjct: 362 KHPADPFKSPIMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPC 421
Query: 327 SRVGHIYRKFAPF-PNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPH-YRRIDPGDLAP 384
SR+GHIYR P+P GD++ KNY+RVAEVWMDEY +LY Y +DPGDL
Sbjct: 422 SRIGHIYRGPRNHQPSPRKGDYLHKNYKRVAEVWMDEYKNYLYSHGDGLYESVDPGDLTE 481
Query: 385 QKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXLCADAAHG 444
QK +R KLNCK FKWFM ++AFDL YPPV+P +A G + H
Sbjct: 482 QKAIRTKLNCKSFKWFMEEVAFDLMKTYPPVDPPSYAMGALQNVGNQNLCLDTLGRKKHN 541
Query: 445 PQLAALRLARCDVTTNAEQHFVLTWHKDIRCKTRNMCWDLPDSSPGSPIVLYACHLGGGN 504
++ A T Q + L+W +D+R + + C D+ +P+ L+ CH GGN
Sbjct: 542 -KMGMYACADNIKTPQRTQFWELSWKRDLRLRRKKECLDVQIWDANAPVWLWDCHSQGGN 600
Query: 505 QLWRYYPDTKQIKHGGHS-NCVDHEPTTLRLHISRCDAAADTQRWTLDTVDAELL 558
Q W Y KQ+KHG C++ P + + ++CD Q+W + + L
Sbjct: 601 QYWYYDYRHKQLKHGTEGRRCLELLPFSQEVVANKCDTDNRFQQWNFGSFNKTAL 655
>UniRef50_O45947 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase 10; n=4;
Caenorhabditis|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase 10 - Caenorhabditis
elegans
Length = 622
Score = 577 bits (1424), Expect = e-163
Identities = 289/539 (53%), Positives = 354/539 (65%), Gaps = 16/539 (2%)
Query: 30 KKIDWHNYEQISSERKRVGIGEQGLPAHLPKEESDLEE--SLYSVNGFNGALSDKIPLSR 87
+KI W + E++R G GE G P LP+++ +E SLY NG+N +SD I L+R
Sbjct: 78 EKIKWEDEAAYEKEKRREGPGEWGKPVKLPEDKEVEKEALSLYKANGYNAYISDMISLNR 137
Query: 88 SLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDD 147
S+ DIRH CK MY LP+VSV+ PFH EH STLLR+ YSV+ RSP L+KEI LVDD
Sbjct: 138 SIKDIRHKECKNMMYSAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDD 197
Query: 148 ASTKDFLGKALDDYLEGNMPK--VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEA 205
S K L + L+D+L+ N VKV+R KR GLI R GA+ AT ++LIFLD+H+EA
Sbjct: 198 FSEKPALRQPLEDFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQDATGEILIFLDAHSEA 257
Query: 206 NVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKD 265
N NWLPPLL+PIA DYRTV+CPF+DVI +T+E R QDEGARG+FDW F YKRLP+ KD
Sbjct: 258 NYNWLPPLLDPIAEDYRTVVCPFVDVIDCETYEVRPQDEGARGSFDWAFNYKRLPLTKKD 317
Query: 266 EKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAP 325
++ +PF SPVMAGG FAIS +FWELGGYD GLDIWGGEQYELSFK+WQC GRM+DAP
Sbjct: 318 RESPTKPFNSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGRMVDAP 377
Query: 326 CSRVGHIYR-KFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAP 384
CSRV HIYR K+APF N G GDFV +NY+RVAEVWMD+Y E LY+ RP D GDL
Sbjct: 378 CSRVAHIYRCKYAPFKNAGMGDFVSRNYKRVAEVWMDDYKETLYKHRPGVGNADAGDLKL 437
Query: 385 QKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXLCADAAHG 444
K +REKL CK F WFM +IAFD YP VEPK AEG+I C D
Sbjct: 438 MKGIREKLQCKSFDWFMKEIAFDQDKYYPAVEPKASAEGEI-----RNVGTNFCIDTQFK 492
Query: 445 PQLAALRLARC---DVTTNAEQHFVLT-WHKDIRCKTRNMCWDLPDSSPGSPIVLYACHL 500
Q L +C D EQ LT WH DIR K R +C+D S +P++L+ CH
Sbjct: 493 EQNQRFGLRKCTSDDKDGGGEQDLRLTRWH-DIRPKGRKICFDCSTSVDKAPVILFDCHS 551
Query: 501 GGGNQLWRYYPDTKQIKHGGHSNCVDHEPTTLR-LHISRCDAAADTQRWTLDTVDAELL 558
GNQL++Y KQI H C+ + LH+ +CD+++D Q+W TVD ELL
Sbjct: 552 MKGNQLFKYRVAQKQIYHPISGQCLTADENGKGFLHMKKCDSSSDLQKWAWQTVDNELL 610
>UniRef50_Q86SR1 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 10; n=77;
Coelomata|Rep: Polypeptide
N-acetylgalactosaminyltransferase 10 - Homo sapiens
(Human)
Length = 603
Score = 573 bits (1414), Expect = e-162
Identities = 281/550 (51%), Positives = 363/550 (66%), Gaps = 23/550 (4%)
Query: 20 KENHFVPKSLKKIDWHNYEQISSERKRVGIGEQGLPAHLPKEESDLEESLYSVNGFNGAL 79
K+ F+ K DWH+ E I + +RVG GEQG P P +++ + Y NGFN +
Sbjct: 60 KKTFFLGDGQKLKDWHDKEAIRRDAQRVGNGEQGRP--YPMTDAERVDQAYRENGFNIYV 117
Query: 80 SDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLI 139
SDKI L+RSLPDIRH C K Y+E+LP+ S+++PFHNE WS+LLRT +SVL RSP L+
Sbjct: 118 SDKISLNRSLPDIRHPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELV 177
Query: 140 KEIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFL 199
EI LVDD S ++ L K L+DY+ P V+++R KR GLI R+ GA AT DV+ FL
Sbjct: 178 AEIVLVDDFSDREHLKKPLEDYM-ALFPSVRILRTKKREGLIRTRMLGASVATGDVITFL 236
Query: 200 DSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQ-DEGARGAFDWEFFYKR 258
DSH EANVNWLPPLL+ IA + +T++CP IDVI D F Y Q + RGAFDWE +YKR
Sbjct: 237 DSHCEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKR 296
Query: 259 LPVLPKDEKNMP-EPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQC 317
+P+ P+ +K P +PFESPVMAGGLFA+ R +FWELGGYDPGL+IWGGEQYE+SFK+W C
Sbjct: 297 IPIPPELQKADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMC 356
Query: 318 GGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRI 377
GGRM D PCSRVGHIYRK+ P+ P G + +N +RVAEVWMDEYAE++Y+RRP YR +
Sbjct: 357 GGRMEDIPCSRVGHIYRKYVPYKVPA-GVSLARNLKRVAEVWMDEYAEYIYQRRPEYRHL 415
Query: 378 DPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXL 437
GD+A QK+LR LNCK FKWFMT+IA+DL YPPVEP A G+I L
Sbjct: 416 SAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEI-----RNVGTGL 470
Query: 438 CADAAHGPQLAALRLARC-----DVTTNAEQHFVLTWHKDIR----CKTRNMCWDLPDSS 488
CAD HG + LRL C + N Q F TW +DIR T+ C+D S
Sbjct: 471 CADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKFCFDA--IS 528
Query: 489 PGSPIVLYACHLGGGNQLWRYYPDTKQIKHGGHSNCVDHEPTTLRLHISRCDAAADTQRW 548
SP+ LY CH GNQLW+Y D K + H +C+D + R+ ++ C+ ++ TQ+W
Sbjct: 529 HTSPVTLYDCHSMKGNQLWKYRKD-KTLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQW 587
Query: 549 TLDTVDAELL 558
+ ++ +L
Sbjct: 588 LFEHTNSTVL 597
>UniRef50_Q8IA42 Cluster: N-acetylgalactosaminyltransferase 4; n=2;
Sophophora|Rep: N-acetylgalactosaminyltransferase 4 -
Drosophila melanogaster (Fruit fly)
Length = 659
Score = 503 bits (1241), Expect = e-141
Identities = 246/545 (45%), Positives = 346/545 (63%), Gaps = 17/545 (3%)
Query: 26 PKSLKKIDWHNYEQISSERKRVGIGEQGLPAHLPK-EESDLEESLYSVNGFNGALSDKIP 84
PK +K DWH+Y + ++RKR G GE G+ + +E LE+ Y +NGFNG +SD+I
Sbjct: 112 PKGERK-DWHDYAAMEADRKRSGFGEHGVAVKIENPDEKQLEKEHYEMNGFNGLISDRIS 170
Query: 85 LSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFL 144
++RS+PD+R CK + Y+ LP++SV+ F NEH++TLLR+ YSV+ R+P L+K+I L
Sbjct: 171 VNRSVPDLRLEACKTRKYLAKLPNISVIFIFFNEHFNTLLRSIYSVINRTPPELLKQIVL 230
Query: 145 VDDASTKDFLGKALDDYLEGNMPK-VKVIRLPKRSGLITARLAGAERATADVLIFLDSHT 203
VDD S D L + LDDY++ + P V ++R P+R GLI AR+AGA+ A V++F DSH
Sbjct: 231 VDDGSEWDVLKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAVGQVMVFFDSHI 290
Query: 204 EANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEY-RAQDEGARGAFDWEFFYKRLPVL 262
E N NWLPPL+EPIA++ + CP +D I+ + F Y +GARG FDW+ YK+LPVL
Sbjct: 291 EVNYNWLPPLIEPIAINPKISTCPMVDTISHEDFSYFSGNKDGARGGFDWKMLYKQLPVL 350
Query: 263 PKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRML 322
P+D + P+ SPVM GGLFAI+ FFW+LGGYD LDIWGGEQYELSFKIW CGG +L
Sbjct: 351 PEDALDKSMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGGEQYELSFKIWMCGGMLL 410
Query: 323 DAPCSRVGHIYR-KFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPH-YRRIDPG 380
D PCSRV HI+R P NP +FV KN++RVAEVWMDEY +++Y+R P Y +D G
Sbjct: 411 DVPCSRVAHIFRGPMKPRGNPRGHNFVAKNHKRVAEVWMDEYKQYVYKRDPKTYDNLDAG 470
Query: 381 DLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXLCAD 440
DL Q+ +RE+L CK F WFMT++A D K+PPVEP +A G I C D
Sbjct: 471 DLTRQRGVRERLKCKSFHWFMTEVAPDFLVKFPPVEPPSYAAGII----QNVANPVYCLD 526
Query: 441 AAHGPQLAALRLARC-DVTTNAE--QHFVLTWHKDIRCK-TRNMCWDLPDSSPGSPIVLY 496
A+ + C D T+ + Q + L+ +D+R K ++C D+ + P + + ++
Sbjct: 527 NMGKSTEEAVGMFSCADNRTHPQPNQFWELSIFRDLRMKGFDSVCLDVHEGPPNATVWMW 586
Query: 497 ACHLGGGNQLWRYYPDTKQIKHG-GHSNCVDH--EPTTLRLHISRCDAAADTQRWTLDTV 553
+CH GGNQ W Y T+++ HG + C++ E ++ + C+ D QRW V
Sbjct: 587 SCHSQGGNQFWYYDRQTQRLVHGENNKRCLEGFVENGIAKVVANSCENGNDRQRWEFGFV 646
Query: 554 DAELL 558
+ +L
Sbjct: 647 NHTML 651
>UniRef50_A7SZ28 Cluster: Predicted protein; n=1; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 454
Score = 417 bits (1027), Expect = e-115
Identities = 211/461 (45%), Positives = 289/461 (62%), Gaps = 20/461 (4%)
Query: 48 GIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLP 107
G GE G P E+ +++ YS GFN +SD+I L R++ D RH CK++ Y +LP
Sbjct: 1 GPGENGEPVETKAEDESKKDAAYSEFGFNQFVSDQISLERTISDTRHQACKQRSYPINLP 60
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
SVV+ FHNE WSTL+RT ++VL RSP H+++EI +VDD S KDFL + LDDY + +
Sbjct: 61 KASVVIVFHNEGWSTLMRTVHTVLLRSPPHMLQEIVMVDDFSNKDFLKQKLDDYTK-KLG 119
Query: 168 KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCP 227
K+K++R +R GLI AR+ GA A +V+IFLD+H E N WLPPLLE IAL+ RT +CP
Sbjct: 120 KIKIVRTKERVGLIKARVIGANNAVGEVVIFLDAHCECNKGWLPPLLERIALNRRTAVCP 179
Query: 228 FIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEP---FESPVMAGGLFA 284
ID I TF+Y+ D RG F+W F YK V P++ +P +SPVMAGGLFA
Sbjct: 180 TIDFIDHKTFQYKPMDPYIRGTFNWRFDYKERAVRPEEMAKRRDPTQEVKSPVMAGGLFA 239
Query: 285 ISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGH 344
I+R FF ELG YDPG+ IWGGEQYE+SFK+WQCGG++ + PCSRVGH+YR P+ P H
Sbjct: 240 INREFFSELGQYDPGMFIWGGEQYEISFKLWQCGGQLENIPCSRVGHVYRHHVPYTYPKH 299
Query: 345 GDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQI 404
D N+RRVAEVWMDEY + LY +RP + +D GD++ + LR++L CK FKW++ +
Sbjct: 300 -DATLVNFRRVAEVWMDEYKDWLYDKRPEIKSVDYGDISDRIALRKRLKCKSFKWYLENV 358
Query: 405 AFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXLCADAAHGPQLAALRLARCDVTTNAEQH 464
A D V+ K A ++ +C D+ G + + LA C Q
Sbjct: 359 AND------TVKTKLCACFQV-----RNQGKNMCLDSM-GRKDGHVGLASCH-NMGGNQA 405
Query: 465 FVLTWHKDIRCKTRNMCWDLPDSSPGSPIVLYACHLGGGNQ 505
F T+ +++R T C+D+ +S PG+ + + CH GNQ
Sbjct: 406 FQYTYIRELR--TDETCFDVHESFPGAKVHFFPCHEMKGNQ 444
>UniRef50_Q10472 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 1 (EC 2.4.1.41)
(Protein-UDP acetylgalactosaminyltransferase 1) (UDP-
GalNAc:polypeptide N-acetylgalactosaminyltransferase 1)
(Polypeptide GalNAc transferase 1) (GalNAc-T1)
(pp-GaNTase 1) [Contains: Polypeptide
N-acetylgalactosaminyltransferase 1 soluble form]; n=66;
Eumetazoa|Rep: Polypeptide
N-acetylgalactosaminyltransferase 1 (EC 2.4.1.41)
(Protein-UDP acetylgalactosaminyltransferase 1) (UDP-
GalNAc:polypeptide N-acetylgalactosaminyltransferase 1)
(Polypeptide GalNAc transferase 1) (GalNAc-T1)
(pp-GaNTase 1) [Contains: Polypeptide
N-acetylgalactosaminyltransferase 1 soluble form] - Homo
sapiens (Human)
Length = 559
Score = 400 bits (986), Expect = e-110
Identities = 214/518 (41%), Positives = 299/518 (57%), Gaps = 23/518 (4%)
Query: 43 ERKRVGIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMY 102
++ G GE G P +PKE+ + + ++ +N FN S+ I L+RSLPD+R GCK K+Y
Sbjct: 52 QKPHEGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVY 111
Query: 103 IESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYL 162
++LP+ SVV+ FHNE WSTLLRT +SV+ RSP H+I+EI LVDDAS +DFL + L+ Y+
Sbjct: 112 PDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRPLESYV 171
Query: 163 EGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYR 222
+ V VIR+ +RSGLI ARL GA + V+ FLD+H E V WL PLL I D R
Sbjct: 172 KKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARIKHDRR 231
Query: 223 TVMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPV----LPKDEKNMPEPFESPVM 278
TV+CP IDVI+ DTFEY A + G F+W+ ++ PV + + + + P +P M
Sbjct: 232 TVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTM 291
Query: 279 AGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAP 338
AGGLF+I R +F E+G YD G+DIWGGE E+SF+IWQCGG + CS VGH++RK P
Sbjct: 292 AGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKATP 351
Query: 339 FPNP-GHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPF 397
+ P G G + KN RR+AEVWMDE+ Y P ++D GD++ + LR KL CKPF
Sbjct: 352 YTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPF 411
Query: 398 KWFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXLCADAAHGPQLAALRLARCDV 457
W++ I D + F+ G+I C D + + + C
Sbjct: 412 SWYLENIYPD-----SQIPRHYFSLGEIRNVETNQ-----CLDNMARKENEKVGIFNCH- 460
Query: 458 TTNAEQHFVLTWHKDIRCKTRNMCWDLPDSSPGSPIVLYACHLGGGNQLWRYYPDTKQIK 517
Q F T +K+IR T ++C D+ S P+ + CH GNQLW Y P ++
Sbjct: 461 GMGGNQVFSYTANKEIR--TDDLCLDV--SKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQ 516
Query: 518 HGGHSNCVDH--EPTTLRLHISRCDAAADTQRWTLDTV 553
H + C+D E + I C+ + +Q+W L V
Sbjct: 517 HVNSNQCLDKATEEDSQVPSIRDCNGSR-SQQWLLRNV 553
>UniRef50_Q9VUT6 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 8; n=1; Drosophila
melanogaster|Rep: Polypeptide
N-acetylgalactosaminyltransferase 8 - Drosophila
melanogaster (Fruit fly)
Length = 590
Score = 383 bits (942), Expect = e-105
Identities = 215/531 (40%), Positives = 303/531 (57%), Gaps = 29/531 (5%)
Query: 49 IGEQGLPAHLPKEESDLEESLYSVN--GFNGALSDKIPLSRSLPDIRHSGCKKKMY-IES 105
+G G PA E LE S G+N LS +I RSL D+RH CKK Y +E
Sbjct: 67 LGALGRPARGNWTEEQLEAIAKSQRETGYNAWLSKRISPERSLYDMRHRSCKKLKYPMEK 126
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
LPSVSVV+ +HNE S LLRT S+ R+P L++E+ LVDD ST+ + L+D+++
Sbjct: 127 LPSVSVVITYHNEEASVLLRTLSSLRSRTPIQLLREVILVDDGSTQ--ADEKLNDFIKIK 184
Query: 166 -MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTV 224
+ V+ R+ + GL+ AR+ GAE A ADVL+FLDSH E WL PL+ PI D RT
Sbjct: 185 FLNMVQHRRITTQVGLMHARVVGAELALADVLVFLDSHVEVTKGWLEPLIAPILEDNRTC 244
Query: 225 MCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFA 284
P ID I FD F YR + + +RG F+WEF Y +LP+L ++ MP P ++P+M GGLFA
Sbjct: 245 TTPIIDTIDFDNFAYR-RGKPSRGFFNWEFNYIQLPLLKEEAVAMPAPHKNPIMNGGLFA 303
Query: 285 ISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYR------KFAP 338
I R +F ELGGYD GL IWG EQ+ELS K+W CGG++L+ PCSRVGH++R ++
Sbjct: 304 IGREWFSELGGYDKGLKIWGAEQFELSLKLWLCGGQILEVPCSRVGHLFRDGNFQIRYTN 363
Query: 339 FPNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFK 398
+ +NYRRVAE+W+DEY + L+ PH I G+LA Q+ L+ +L+CKPFK
Sbjct: 364 KDKNSEKKLISRNYRRVAEIWLDEYKDKLFANMPHLTVIPVGNLAEQRDLKNRLHCKPFK 423
Query: 399 WFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXLCADAAHGPQLAALRLARCD-- 456
WF+ +A D YP ++P +A G + LC D P +LA C
Sbjct: 424 WFLDNLATDFLNLYPILDPAEYASGVL----QSISSPKLCLD-RKDPSHGQPKLAPCSSD 478
Query: 457 -VTTNAEQHFVLTWHKDIRCKTRNMCWDLPDSSPGSPIVLYACHLGGGNQLWRYYPDTKQ 515
V + EQ++ LT H+++R + C ++ + G + +Y CH GNQ W + T Q
Sbjct: 479 HVFPSPEQYWSLTNHRELR--SGFYCLEV--RNHGVNVHIYQCHGQSGNQFWSFDSKTHQ 534
Query: 516 IKHGGHSN---CVDHEPTTLRLHISRCDAAADTQRWTLDTVDAELLA-AWD 562
+ G N C++ +P + S CD Q+W ++++ L WD
Sbjct: 535 VISGQQQNFRHCLEAQPELNAVTSSVCDPKNHKQQWKFGYLNSQRLQHFWD 585
>UniRef50_Q86SF2 Cluster: N-acetylgalactosaminyltransferase 7; n=31;
Euteleostomi|Rep: N-acetylgalactosaminyltransferase 7 -
Homo sapiens (Human)
Length = 657
Score = 375 bits (922), Expect = e-102
Identities = 211/520 (40%), Positives = 293/520 (56%), Gaps = 27/520 (5%)
Query: 48 GIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLP 107
G GE+ P L E ++ GFN SD I L RS+ D+R CK Y E+L
Sbjct: 148 GPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMISLDRSVNDLRQEECKYWHYDENLL 207
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
+ SVV+ FHNE WSTL+RT +SV+ R+P + EI L+DD S K+ L + LD+Y++
Sbjct: 208 TSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIVLIDDFSNKEHLKEKLDEYIKLWNG 267
Query: 168 KVKVIRLPKRSGLITARLAGAERAT-ADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
VKV R +R GLI AR GA++A VLI+LD+H E VNW PL+ PI+ D
Sbjct: 268 LVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAPLVAPISKDRTICTV 327
Query: 227 PFIDVIAFDTFEYRAQDEG-----ARGAFDWEFFYKRLPVLPKDE---KNMPEPFESPVM 278
P IDVI +T+E Q G ARGA+DW +KR+P+ P+++ K EP+ SP M
Sbjct: 328 PLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKRLRKTKTEPYRSPAM 387
Query: 279 AGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAP 338
AGGLFAI R FF+ELG YDPGL IWGGE +E+S+KIWQCGG++L PCSRVGHIYR
Sbjct: 388 AGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFVPCSRVGHIYRLEGW 447
Query: 339 FPNPGHGDFVG-----KNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLN 393
NP +VG KNY RV EVW DEY ++ Y RP + + GD++ K+ RE N
Sbjct: 448 QGNP-PPIYVGSSPTLKNYVRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHN 506
Query: 394 CKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXLCADAAHGPQLAALRLA 453
CK FKWFM +IA+D+T+ Y P+ PK G+I C D+ + L
Sbjct: 507 CKSFKWFMEEIAYDITSHY-PLPPKNVDWGEI-----RGFETAYCIDSMGKTNGGFVELG 560
Query: 454 RCDVTTNAEQHFVLTWHKDIRCKTRNMCWDLPDSSPGSPIVLYACHLGGGNQLWRYYPDT 513
C Q F + ++ + + C L + GS +++ C+L + W+Y+ +
Sbjct: 561 PCH-RMGGNQLFRI--NEANQLMQYDQC--LTKGADGSKVMITHCNLNEFKE-WQYFKNL 614
Query: 514 KQIKHGGHSNCVDHEPTTLRLHISRCDAAADTQRWTLDTV 553
+ H C+D ++ IS CD++ TQ+W ++ +
Sbjct: 615 HRFTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654
>UniRef50_A2AQQ1 Cluster: UDP-N-acetyl-alpha-D-galactosamine:
polypeptide N- acetylgalactosaminyltransferase 13; n=10;
Coelomata|Rep: UDP-N-acetyl-alpha-D-galactosamine:
polypeptide N- acetylgalactosaminyltransferase 13 - Mus
musculus (Mouse)
Length = 592
Score = 374 bits (919), Expect = e-102
Identities = 176/366 (48%), Positives = 240/366 (65%), Gaps = 5/366 (1%)
Query: 44 RKRVGIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYI 103
R + G GE G +PK++ + + L+ +N FN SD I L+RSLPD+R GCK K+Y
Sbjct: 52 RNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYP 111
Query: 104 ESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLE 163
+ LP+ SVV+ FHNE WSTLLRT YSV+ RSP +L+ E+ LVDDAS +DFL L++Y++
Sbjct: 112 DELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVK 171
Query: 164 GNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRT 223
VK+IR+ +RSGLI ARL GA + V+ FLD+H E + WL PLL I D +T
Sbjct: 172 TLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKT 231
Query: 224 VMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPV----LPKDEKNMPEPFESPVMA 279
V+CP IDVI+ DTFEY A + G F+W+ ++ PV + + + + P +P MA
Sbjct: 232 VVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMA 291
Query: 280 GGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPF 339
GGLF+I R +F E+G YD G+DIWGGE E+SF+IWQCGG + CS VGH++RK P+
Sbjct: 292 GGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPY 351
Query: 340 PNP-GHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFK 398
P G G + KN RR+AEVWMDE+ + Y P ++D GD++ +K LRE L CKPF
Sbjct: 352 TFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFS 411
Query: 399 WFMTQI 404
W++ I
Sbjct: 412 WYLENI 417
Score = 47.6 bits (108), Expect = 9e-04
Identities = 27/90 (30%), Positives = 45/90 (50%), Gaps = 5/90 (5%)
Query: 462 EQHFVLTWHKDIRCKTRNMCWDLPDSSPGSPIVLYACHLGGGNQLWRYYPDTKQIKHGGH 521
EQ F T K+IR T ++C D+ S P+++ CH GNQLW Y + ++H
Sbjct: 500 EQVFSYTADKEIR--TDDLCLDV--SRLSGPVIMLKCHHMRGNQLWEYDAERLTLRHANS 555
Query: 522 SNCVDHEPTTLRLHISRCDAAAD-TQRWTL 550
+ C+D ++ + D + +Q+W L
Sbjct: 556 NQCLDEPSEEDKMVPTMQDCSGSRSQQWLL 585
>UniRef50_A7RRV7 Cluster: Predicted protein; n=1; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 513
Score = 367 bits (903), Expect = e-100
Identities = 201/510 (39%), Positives = 291/510 (57%), Gaps = 22/510 (4%)
Query: 49 IGEQGLPAHLPKEESD-LEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKK--MYIES 105
+G G PA L EE+ L E ++ + FN LSDKI L R+L D+R CK K Y
Sbjct: 7 LGGGGKPAFLESEENKKLAEKYFANHSFNWLLSDKISLDRTLDDVRSERCKAKHNTYPAK 66
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
LP+ SV++ FH E S LLRT +SV+ R+P L+ E+ +VDD S LGK LDD++
Sbjct: 67 LPTTSVIICFHKERLSVLLRTVHSVINRTPPELLAEVIVVDDFSQDAKLGKPLDDHV-AQ 125
Query: 166 MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVM 225
KVKV+R+ KR GL+ ARL GA A DVL FLDSH EA W PLL IA D R V+
Sbjct: 126 FTKVKVLRMKKREGLVRARLQGANTAKGDVLTFLDSHCEATPGWAEPLLARIAADRRNVV 185
Query: 226 CPFIDVIAFDTFEYRAQDEG-ARGAFDWEFFYKRLPVLPKDEK---NMPEPFESPVMAGG 281
CP I+VI DTF Y+ RG F W+ F+K + P+++K + +P +P MAGG
Sbjct: 186 CPAIEVINADTFAYQGSTNADQRGGFSWDLFFKWKGIPPEEQKLRNDDSDPIRTPTMAGG 245
Query: 282 LFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKF-APFP 340
LF+I R +F+++G YD +DIWGGE ELSF++W CGGR+ CSRVGH++RK+ +P+
Sbjct: 246 LFSIHRQYFFDIGSYDEEMDIWGGENLELSFRVWMCGGRLEIVTCSRVGHVFRKYTSPYK 305
Query: 341 NP-GHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKW 399
P G + KN+ R+AEVWMDEY + Y ++P + D GD++ + LR++L CK FKW
Sbjct: 306 FPDGVERTLTKNFNRLAEVWMDEYKDLYYNKKPQAKNSDYGDISKRLELRKRLKCKSFKW 365
Query: 400 FMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXLCADAAHGPQLAALRLARCDVTT 459
++ I D+ + P ++P A G++ L A H A + + C
Sbjct: 366 YINNIYPDV--QMPELDPP--ARGEVRNPSSNQCLDSLGAKPEHN---ARVGIYTCHGQG 418
Query: 460 NAEQHFVLTWHKDIRCKTRNMCWDLPDSSPGSPIVLYACHLGGGNQLWRYYPDTKQIKHG 519
+ + +++ + N C+D+ + PG+P+ L CH GNQ W++ + + H
Sbjct: 419 GNQVSKYMP--RELIFEEEN-CFDVSKTHPGAPVELMKCHGMRGNQEWKHDREKGTLMHF 475
Query: 520 GHSNCVDH-EPTTLRLHISRCDAAADTQRW 548
C+D P+ ++ CD ++QRW
Sbjct: 476 TTQQCLDRGSPSDQYAVMNPCD-GRESQRW 504
>UniRef50_Q4RQL8 Cluster: Chromosome 2 SCAF15004, whole genome
shotgun sequence; n=5; Euteleostomi|Rep: Chromosome 2
SCAF15004, whole genome shotgun sequence - Tetraodon
nigroviridis (Green puffer)
Length = 632
Score = 363 bits (894), Expect = 5e-99
Identities = 201/495 (40%), Positives = 282/495 (56%), Gaps = 45/495 (9%)
Query: 44 RKRVGIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYI 103
R+ G GE G +PK+E + + L+ +N FN SD I L+RSLPD+R GCK K+Y
Sbjct: 52 RRHEGPGEMGKAVVIPKDEQEKMKELFKINQFNLMASDMIALNRSLPDVRLDGCKTKVYP 111
Query: 104 ESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLE 163
+ +P+ SVV+ FHNE WSTLLRT +SV+ RSP HL+ EI LVDDAS +DFL K L++Y+
Sbjct: 112 DDVPNTSVVIVFHNEAWSTLLRTVHSVINRSPRHLLVEIVLVDDASERDFLKKKLENYVR 171
Query: 164 GNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALD--- 220
V+++R+ +RSGLI ARL GA V+ FLD+H E V WL PLL I D
Sbjct: 172 TLEVPVRILRMEQRSGLIRARLRGAAATKGQVITFLDAHCECTVGWLEPLLARIKEDRWD 231
Query: 221 -------------------YRT-VMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLP 260
+RT V+CP IDVI+ +TFEY A + G F+W+ ++ P
Sbjct: 232 CNTALCVCVFERPSFRCFLFRTAVVCPIIDVISDETFEYMAGSDMTYGGFNWKLNFRWYP 291
Query: 261 VLPKDEKNMPE-----PFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIW 315
V P+ E + + P +P MAGGLF+I + +F E+G YDPG+DIWGGE E+SF+IW
Sbjct: 292 V-PQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEEIGSYDPGMDIWGGENLEMSFRIW 350
Query: 316 QCGGRMLDAPCSRVGHIYRKFAPFPNP-GHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHY 374
QCGG + CS VGH++RK P+ P G G + KN RR+AEVWMD++ + Y P
Sbjct: 351 QCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQVINKNNRRLAEVWMDDFKDFFYIISPGV 410
Query: 375 RRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXX 434
R+D GD++ +K LR+ L+CKPF W++ I D + + ++ G+I
Sbjct: 411 MRVDYGDVSSRKGLRDALHCKPFSWYLENIYPD-----SQIPRRYYSLGEIRNVETNQ-- 463
Query: 435 XXLCADAAHGPQLAALRLARCDVTTNAEQHFVLTWHKDIRCKTRNMCWDLPDSSPGSPIV 494
C D + + C Q F T K+IR T ++C D+ S P++
Sbjct: 464 ---CVDNMGRKENEKVGFFNCH-GMGGNQVFSYTADKEIR--TDDLCLDV--SRLNGPVL 515
Query: 495 LYACHLGGGNQLWRY 509
+ CH GNQ++ Y
Sbjct: 516 MLKCHHMKGNQMFEY 530
>UniRef50_Q8IA43 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase 10; n=1; Drosophila
melanogaster|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase 10 - Drosophila
melanogaster (Fruit fly)
Length = 630
Score = 362 bits (891), Expect = 1e-98
Identities = 191/510 (37%), Positives = 288/510 (56%), Gaps = 18/510 (3%)
Query: 18 VNKENHF-VPKSLKKIDWHNYEQISSERKRVGIGEQGLPAHLPKEESDLEESLYSVNGFN 76
+ + N F +P+ +W + + R G+GEQG LP + + ++ F
Sbjct: 81 IQRNNSFRLPQQEAIKEWQDAIFFKEDSNRRGLGEQGRAVQLPNAKLNPDD----FQDFY 136
Query: 77 GALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPE 136
LSD+IPL+RSLPD R C+K+ Y+E+LP+V+V++ FH+EH S LLR+ S++ RSP
Sbjct: 137 AELSDRIPLNRSLPDTRPISCRKRKYLENLPNVTVIIAFHDEHLSVLLRSITSIINRSPV 196
Query: 137 HLIKEIFLVDDASTKDFLGKALDDYLEGNMPKV-KVIRLPKRSGLITARLAGAERATADV 195
L+K+I LVDD S LG+ L++ + N PK+ ++RLP+R G I AR+ ++ V
Sbjct: 197 ELLKQIVLVDDDSNLPELGQQLEEIVAQNFPKIIHILRLPERRGSIKARMEAIRVSSCQV 256
Query: 196 LIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWEFF 255
L+FLDSH E N NWLPPLLEPI ++ V P +D I+ TF Y Q+ R F+W
Sbjct: 257 LVFLDSHIEVNTNWLPPLLEPIVINPHIVTRPILDAISRKTFAYAKQNTMTRSGFNWWLE 316
Query: 256 YKRLPVLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIW 315
+ LP+ P+D+ P+ +PV++G + AI R +F LGG+D LD W E++E+SFK+W
Sbjct: 317 SESLPIFPEDKSPDSTPYRTPVLSGAM-AIDRNYFLNLGGFDEQLDTWEAEKFEISFKVW 375
Query: 316 QCGGRMLDAPCSRVGHI-YRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPH- 373
CGG ML PC+RVGHI R +PG+ +F+ +NY+RVAEVWMD Y +++Y + P
Sbjct: 376 MCGGMMLYVPCARVGHIGKRPMKSISSPGYHNFLARNYKRVAEVWMDNYKKYVYDKNPKL 435
Query: 374 YRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXX 433
Y+ + G L +K R L CK F W+MT++A D +Y ++ G I
Sbjct: 436 YKMANAGLLFQRKTKRNALECKTFDWYMTKVAPDFLKRYLALDSPLVFSGVI----ESVA 491
Query: 434 XXXLCADAAHGPQLAALRLARC---DVTTNAEQHFVLTWHKDIR-CKTRNMCWDLPDSSP 489
C D+ + + LARC + Q++ LT +I+ +++ C +
Sbjct: 492 FPGFCVDSLNCRHTKPVVLARCTGHNSMPGEHQNWSLTQDHEIQLTNSKDDCLEAQGLRS 551
Query: 490 GSPIVLYACHLGGGNQLWRYYPDTKQIKHG 519
S + L+ CH GGNQ W Y + I+ G
Sbjct: 552 KS-VWLFRCHKNGGNQYWYYNHRHRWIQQG 580
>UniRef50_Q8MV48 Cluster: N-acetylgalactosaminyltransferase 7; n=5;
Endopterygota|Rep: N-acetylgalactosaminyltransferase 7 -
Drosophila melanogaster (Fruit fly)
Length = 591
Score = 357 bits (878), Expect = 5e-97
Identities = 201/517 (38%), Positives = 277/517 (53%), Gaps = 25/517 (4%)
Query: 44 RKRVGIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYI 103
+ R G GE G L ++ + ++ G N A SD+I + RS+ D R C+ Y
Sbjct: 79 KPRSGPGENGEAHSLSPDKKHMSDASEMEYGMNIACSDEISMHRSVRDTRLEECRHWDYP 138
Query: 104 ESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLE 163
LP SV++ FHNE +S L+RT +SV+ RSP H++ EI LVDD S K+ L LD+Y+
Sbjct: 139 FDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPTHMLHEIILVDDFSDKENLRSQLDEYVL 198
Query: 164 GNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRT 223
VKVIR +R GLI R GA AT +V++FLD+H E N NWLPPLL PI D
Sbjct: 199 QFKGLVKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEVNTNWLPPLLAPIYRDRTV 258
Query: 224 VMCPFIDVIAFDTFEYR---AQDEGARGAFDWEFFYKRLPVLPKDEK---NMPEPFESPV 277
+ P ID I FEYR D RG F+W YK V ++++ + EP+ SP
Sbjct: 259 MTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVPRREQRRRAHNSEPYRSPT 318
Query: 278 MAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFA 337
AGGLFAI+R +F ELG YDPGL +WGGE +ELSFKIWQCGG + PCSRVGH+YR F
Sbjct: 319 HAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPCSRVGHVYRGFM 378
Query: 338 PF-----PNPGHGDFVGKNYRRVAEVWMDE-YAEHLYRRRPHYRRIDPGDLAPQKRLREK 391
P+ + G + NY+RV E W D+ + E+ Y R P R +D GD++ Q L+++
Sbjct: 379 PYNFGKLASKKKGPLITINYKRVIETWFDDTHKEYFYTREPLARYLDMGDISEQLALKKR 438
Query: 392 LNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXLCADAAHGPQLAALR 451
LNCK F+WFM IA+D+ K+P + P G++ C D+ A +
Sbjct: 439 LNCKSFQWFMDHIAYDVYDKFPGL-PANLHWGELRSVASDG-----CLDSMGHQPPAIMG 492
Query: 452 LARCDVTTNAEQHFVLTWHKDIRCKTRNMCWDLPDSSPGSPIVLYACHLGGGNQLWRYYP 511
L C N + ++ + + C + + I L C LG + W+Y
Sbjct: 493 LTYCHGGGNNQ---LVRLNAAGQLGVGERCVE----ADRQGIKLAVCRLGTVDGPWQYNE 545
Query: 512 DTKQIKHGGHSNCVDHEPTTLRLHISRCDAAADTQRW 548
TK + H H C+ P T +L + CD Q+W
Sbjct: 546 HTKHLMHRVHKKCMALHPATQQLSLGHCDVNDSYQQW 582
>UniRef50_UPI00015B5D50 Cluster: PREDICTED: similar to
ENSANGP00000021852; n=1; Nasonia vitripennis|Rep:
PREDICTED: similar to ENSANGP00000021852 - Nasonia
vitripennis
Length = 612
Score = 353 bits (869), Expect = 6e-96
Identities = 197/516 (38%), Positives = 285/516 (55%), Gaps = 19/516 (3%)
Query: 50 GEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGC-KKKMYIESLPS 108
G G P +P ++ + L+ +N +N SD+IPL+R+LPD+R C + + LPS
Sbjct: 100 GNNGEPVQIPAKDFQKMQQLFQINRYNLLASDRIPLNRTLPDVRKKKCITRYANLGDLPS 159
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYL-EGNMP 167
SV++ FHNE WSTLLRT +SV+ RSP L++EI LVDD S +DFL K LD+Y+ + N+P
Sbjct: 160 TSVIIVFHNEAWSTLLRTVHSVINRSPRKLLEEIILVDDNSDRDFLRKPLDEYVAQLNVP 219
Query: 168 KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCP 227
+V+R KR GL+ ARL GA A +VL FLD+H E WL PLLE I+ + V+ P
Sbjct: 220 -TRVLRSDKRVGLVNARLMGANEAKGEVLTFLDAHCECTAGWLEPLLEAISKNRTRVVSP 278
Query: 228 FIDVIAFDTFEYRAQDEGARGAFDWEFFYKRL----PVLPKDEKNMPEPFESPVMAGGLF 283
ID+I DTF Y E GAF+W+ ++ L +L + +N+ +PF++P MAGGLF
Sbjct: 279 VIDIINDDTFSYTRSFELHWGAFNWDLHFRWLMLNGALLRERRENIVDPFKTPAMAGGLF 338
Query: 284 AISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPG 343
++ R +F+ELG YD + IWGGE ELSF++WQCGG + APCS VGHI+RK +P+ PG
Sbjct: 339 SMDREYFFELGSYDEHMRIWGGENLELSFRVWQCGGSVEIAPCSHVGHIFRKSSPYTFPG 398
Query: 344 HGD-FVGKNYRRVAEVWMDEYAEHLYRRRPHYRRI-DPGDLAPQKRLREKLNCKPFKWFM 401
D + N RVA VWMDE+ + + P +R+ D + + LRE+L CK F+W++
Sbjct: 399 GVDEILYGNLARVALVWMDEWGKFYFNFNPQAQRVRDKQQIRSRLELRERLKCKSFEWYL 458
Query: 402 TQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXLCADAAHGPQLAALRLARCDVTTNA 461
+ D + P + + F I + A+ + C V N
Sbjct: 459 DNVWPD---HFFPKDDRFFGYVSILHPSNKKCLMRPMSKGAYSQPSGFVAYQDCIVPPNL 515
Query: 462 EQHFVLTWHKD-IRCKTRNMCWDLP--DSSPGSP-IVLYACHLGGGNQLWRYYPDTKQIK 517
Q FV+ KD + ++C D P D+ P + L AC G +Q W Y + I+
Sbjct: 516 SQMFVM--RKDGVIMTDESVCLDAPEKDNRHEKPKVKLMACS-GFASQKWEYDEKERVIR 572
Query: 518 HGGHSNCVDHEPTTLRLHISRCDAAADTQRWTLDTV 553
H C++ P+ + D Q W L +V
Sbjct: 573 HVASGMCLEESPSENESPVISACTGYDEQNWELKSV 608
>UniRef50_O61397 Cluster: Probable N-acetylgalactosaminyltransferase
7; n=5; Bilateria|Rep: Probable
N-acetylgalactosaminyltransferase 7 - Caenorhabditis
elegans
Length = 601
Score = 349 bits (857), Expect = 2e-94
Identities = 179/395 (45%), Positives = 238/395 (60%), Gaps = 16/395 (4%)
Query: 36 NYEQISSERKRVGIGEQGLPAHLPKEESDLE-ESLYSVNGFNGALSDKIPLSRSLPDIRH 94
NYE E GE G P + EE + GFN +SD I ++R++PDIR
Sbjct: 84 NYEPKEPEIPSNQPGEHGKPVPVTDEEGMAAGRAAEKEFGFNTYVSDMISMNRTIPDIRP 143
Query: 95 SGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFL 154
CK Y E LP+VSVVV FHNE W+ LLRT +SVL RSP LI+++ +VDD S K L
Sbjct: 144 EECKHWDYPEKLPTVSVVVVFHNEGWTPLLRTVHSVLLRSPPELIEQVVMVDDDSDKPHL 203
Query: 155 GKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLL 214
+ LD Y+ KV V+R +R GLI AR GA+ +T +V++FLD+H E N NWLPPLL
Sbjct: 204 KEKLDKYVTRFNGKVIVVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTNWLPPLL 263
Query: 215 EPIALDYRTVMCPFIDVIAFDTFEYR----AQDEGARGAFDWEFFYKRLPVLPKD---EK 267
PI + + + P ID I +++EYR + + G F+W YK + ++ K
Sbjct: 264 APIKRNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITERETAHRK 323
Query: 268 NMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCS 327
+ +PF SP AGGLFAI+R +F ELG YD GL IWGGEQYELSFKIWQCGG ++ PCS
Sbjct: 324 HNSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSFKIWQCGGGIVFVPCS 383
Query: 328 RVGHIYRKFAPFPNPGHGDFVGK-----NYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDL 382
VGH+YR P+ G F GK N RV + WMD+Y+++ R P ++PGD+
Sbjct: 384 HVGHVYRSHMPY---SFGKFSGKPVISINMMRVVKTWMDDYSKYYLTREPQATNVNPGDI 440
Query: 383 APQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEP 417
+ Q LR+KL CK FKW+M +A+D+ YP + P
Sbjct: 441 SAQLALRDKLQCKSFKWYMENVAYDVLKSYPMLPP 475
>UniRef50_Q7Z7M9 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 5; n=29;
Deuterostomia|Rep: Polypeptide
N-acetylgalactosaminyltransferase 5 - Homo sapiens
(Human)
Length = 940
Score = 347 bits (852), Expect = 6e-94
Identities = 176/368 (47%), Positives = 231/368 (62%), Gaps = 8/368 (2%)
Query: 50 GEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSV 109
G+ G P +P + E + FN LSD IP+ R++ D R +GC +++ +LP+
Sbjct: 439 GQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNNLPTT 498
Query: 110 SVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKV 169
SV++ F +E WSTLLR+ +SV+ RSP HLIKEI LVDD STKD+L LD Y+ PKV
Sbjct: 499 SVIMCFVDEVWSTLLRSVHSVINRSPPHLIKEILLVDDFSTKDYLKDNLDKYMS-QFPKV 557
Query: 170 KVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFI 229
+++RL +R GLI ARLAGA+ AT DVL FLDSH E NV WL PLLE + L + V CP I
Sbjct: 558 RILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVACPVI 617
Query: 230 DVIAFDTFEYRAQDEGARGAFDW--EFFYKRLP--VLPKDEKNMPEPFESPVMAGGLFAI 285
+VI Y D RG F W F ++ +P V+ K+ + PVMAGGLF+I
Sbjct: 618 EVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDTIRCPVMAGGLFSI 677
Query: 286 SRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNP-GH 344
+++F+ELG YDPGLD+WGGE ELSFK+W CGG + PCSRVGHI+R P+ P
Sbjct: 678 DKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPYSFPKDR 737
Query: 345 GDFVGKNYRRVAEVWMDEYAEHLYRRRPHY--RRIDPGDLAPQKRLREKLNCKPFKWFMT 402
V +N RVAEVW+DEY E Y H + +D G+L Q+ LR+KL CK FKW++
Sbjct: 738 MKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTQQRELRKKLKCKSFKWYLE 797
Query: 403 QIAFDLTA 410
+ DL A
Sbjct: 798 NVFPDLRA 805
>UniRef50_Q8NCW6 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 11; n=33;
Eumetazoa|Rep: Polypeptide
N-acetylgalactosaminyltransferase 11 - Homo sapiens
(Human)
Length = 608
Score = 344 bits (845), Expect = 4e-93
Identities = 191/512 (37%), Positives = 283/512 (55%), Gaps = 28/512 (5%)
Query: 60 KEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEH 119
+ + +L + Y + FN +SD++ R +PD R++ CK+K Y LP+ SVV+ F+NE
Sbjct: 104 ERDQELRDLGYQKHAFNMLISDRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEA 163
Query: 120 WSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP-KVKVIRLPKRS 178
+S LLRT +SV+ R+P HL+ EI LVDD S D L LD+Y++ +P K+KVIR KR
Sbjct: 164 FSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKRE 223
Query: 179 GLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFE 238
GLI R+ GA AT +VL+FLDSH E NV WL PLL I D TV+CP ID+I+ DT
Sbjct: 224 GLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLA 283
Query: 239 YRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPE----PFESPVMAGGLFAISRAFFWELG 294
Y + RG F+W +K ++P E E P +SP MAGGLFA++R +F ELG
Sbjct: 284 YSSSPV-VRGGFNWGLHFK-WDLVPLSELGRAEGATAPIKSPTMAGGLFAMNRQYFHELG 341
Query: 295 GYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRR 354
YD G+DIWGGE E+SF+IW CGG++ PCSRVGHI+RK P+ +P D + N R
Sbjct: 342 QYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMTHNSLR 401
Query: 355 VAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQI--AFDLTAKY 412
+A VW+DEY E + RP + G+++ + LR+KL CK FKW++ + ++ +
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQISGSH 461
Query: 413 P-----------PVEPKPFAEGKIXXXXXXXXXXXLCADAAHGPQLAALRLARCDVTTNA 461
P PK G++ L A + + L CD ++
Sbjct: 462 AKPQQPIFVNRGPKRPKVLQRGRL---YHLQTNKCLVAQGRPSQKGGLVVLKACDY-SDP 517
Query: 462 EQHFVLTWHKDIRCKTRNMCWDLPDSSPGSPIVLYACHLGGGNQLWRYYPDTK--QIKHG 519
Q ++ ++ + +C D+ ++ P L CH GG+Q W + + + Q+ G
Sbjct: 518 NQIWIYNEEHELVLNSL-LCLDMSETRSSDPPRLMKCHGSGGSQQWTFGKNNRLYQVSVG 576
Query: 520 GHSNCVDHEPTTLRLHISRCDAAADTQRWTLD 551
VD + ++ CD ++ +Q+W L+
Sbjct: 577 QCLRAVDPLGQKGSVAMAICDGSS-SQQWHLE 607
>UniRef50_Q95ZJ1 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 5; n=13;
Bilateria|Rep: Polypeptide
N-acetylgalactosaminyltransferase 5 - Caenorhabditis
elegans
Length = 626
Score = 341 bits (837), Expect = 4e-92
Identities = 189/481 (39%), Positives = 269/481 (55%), Gaps = 19/481 (3%)
Query: 73 NGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLY 132
N FN SD I + R+LP + CK + Y E+LP SV++ FHNE WS LLRT +SVL
Sbjct: 141 NAFNQYASDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLE 200
Query: 133 RSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERAT 192
R+P+HL++E+ LVDD S D + L++Y+ KVK++R+ KR GLI ARL GA AT
Sbjct: 201 RTPDHLLEEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVAT 260
Query: 193 ADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDE--GARGAF 250
+VL +LDSH E W+ PLL+ I D TV+CP IDVI +TFEY + G F
Sbjct: 261 GEVLTYLDSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGF 320
Query: 251 DW--EFFYKRLPVLPKDEKNMP-EPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQ 307
DW +F + +P + + P +P SP MAGGLF+I + +F +LG YDPG DIWGGE
Sbjct: 321 DWGLQFNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGEN 380
Query: 308 YELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHL 367
ELSFKIW CGG + PCS VGH++RK +P+ + + +N R+AEVW+D+Y +
Sbjct: 381 LELSFKIWMCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLKRNSIRLAEVWLDDYKTYY 440
Query: 368 YRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXX 427
Y R + + D GD++ +K+LRE L CK FKW++ I +L + P E A+G++
Sbjct: 441 Y-ERINNQLGDFGDISSRKKLREDLGCKSFKWYLDNIYPEL---FVPGE--SVAKGEVRN 494
Query: 428 XXXXXXXXXLCADAAHGPQLAALRLARCDVTTNAEQHFVLTWHKDIRCKTRNMCWDLPDS 487
C H + + +C Q+++L+ KD + C D
Sbjct: 495 SAVQPARCLDCMVGRH-EKNRPVGTYQCH-GQGGNQYWMLS--KDGEIRRDESCVDY--- 547
Query: 488 SPGSPIVLYACHLGGGNQLWRYYPDTKQIKHGGHSNCVDHEPTTLRLHISRCDAAADTQR 547
GS ++++ CH GNQ WRY DT +++H C+ +L + C Q
Sbjct: 548 -AGSDVMVFPCHGMKGNQEWRYNHDTGRLQHAVSQKCLGMTKDGAKLEMVACQYDDPYQH 606
Query: 548 W 548
W
Sbjct: 607 W 607
>UniRef50_O61394 Cluster: Probable N-acetylgalactosaminyltransferase
6; n=4; Caenorhabditis|Rep: Probable
N-acetylgalactosaminyltransferase 6 - Caenorhabditis
elegans
Length = 618
Score = 339 bits (833), Expect = 1e-91
Identities = 173/368 (47%), Positives = 236/368 (64%), Gaps = 9/368 (2%)
Query: 48 GIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLP 107
G G G+ +HL E+ L +S ++VN FN +SD I + RSLP+IR C+ Y ++LP
Sbjct: 99 GEGGAGV-SHLTPEQQKLADSTFAVNQFNLLVSDGISVRRSLPEIRKPSCRNMTYPDNLP 157
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLG-KALDDYLEGNM 166
+ SV++ +HNE +STLLRT +SV+ RSP+ L+KEI LVDD S ++FL LD L+
Sbjct: 158 TTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLRYPTLDTTLKPLP 217
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
+K+IR +R GLI AR+ GA+ A DVL FLDSH E WL PLL I L+ + V C
Sbjct: 218 TDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLTRIKLNRKAVPC 277
Query: 227 PFIDVIAFDTFEYRAQDEGARGAFDW--EFFYKRLPV-LPKDEKNMPE-PFESPVMAGGL 282
P ID+I +TF+Y+ E RG F+W +F + +P + K P P ESP MAGGL
Sbjct: 278 PVIDIINDNTFQYQKGIEMFRGGFNWNLQFRWYGMPTAMAKQHLLDPTGPIESPTMAGGL 337
Query: 283 FAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAP--FP 340
F+I+R +F ELG YDPG+DIWGGE E+SF+IWQCGGR+ PCS VGH++RK +P FP
Sbjct: 338 FSINRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCSHVGHVFRKSSPHDFP 397
Query: 341 NPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPG-DLAPQKRLREKLNCKPFKW 399
G + N RVAEVWMD++ + Y+ P R+ D++ + LR+KLNCK FKW
Sbjct: 398 GKSSGKVLNTNLLRVAEVWMDDWKHYFYKIAPQAHRMRSSIDVSERVELRKKLNCKSFKW 457
Query: 400 FMTQIAFD 407
++ + D
Sbjct: 458 YLQNVFQD 465
>UniRef50_Q6WV19 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 2; n=2;
Sophophora|Rep: Polypeptide
N-acetylgalactosaminyltransferase 2 - Drosophila
melanogaster (Fruit fly)
Length = 633
Score = 339 bits (833), Expect = 1e-91
Identities = 196/490 (40%), Positives = 267/490 (54%), Gaps = 30/490 (6%)
Query: 67 ESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRT 126
E Y N FN SD +P +R +PD R+ C+ K Y E LP SV++ FHNE STLLRT
Sbjct: 162 EDPYIRNRFNQEASDALPSNRDIPDTRNPMCRTKKYREDLPETSVIITFHNEARSTLLRT 221
Query: 127 AYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLE-GNMPKVKVIRLPKRSGLITARL 185
SVL RSPEHLI+EI LVDD S +D LE + KV+VIR KR GL+ +R+
Sbjct: 222 IVSVLNRSPEHLIREIVLVDDYSDHP------EDGLELAKIDKVRVIRNDKREGLVRSRV 275
Query: 186 AGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEG 245
GA+ A + VL FLDSH E N WL PLLE + D V+CP IDVI+ D F+Y
Sbjct: 276 KGADAAVSSVLTFLDSHVECNEMWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASAD 335
Query: 246 ARGAFDWEFFYKRLPVLPKD---EKNMP-EPFESPVMAGGLFAISRAFFWELGGYDPGLD 301
RG FDW +K + P + N P +P++AGGLF I +A+F +LG YD +D
Sbjct: 336 LRGGFDWNLIFKWEYLSPSERAMRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMD 395
Query: 302 IWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNP-GHGDFVGKNYRRVAEVWM 360
+WGGE E+SF++WQCGG + PCSRVGH++RK P+ P G G+ +N RR AEVWM
Sbjct: 396 VWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYTFPGGSGNVFARNTRRAAEVWM 455
Query: 361 DEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPF 420
D+Y +H Y P + I G++ + L+EKL+CKPFKW++ + DL A P P
Sbjct: 456 DDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQA------PDPQ 509
Query: 421 AEGKIXXXXXXXXXXXLCADAAHGPQLAALRLARCDVTTNAEQHFVLTWHKDIRCKTRNM 480
G+ C D + + C T Q + T +I K ++
Sbjct: 510 EVGQFRQDSTE------CLDTMGHLIDGTVGIFPCH-NTGGNQEWAFTKRGEI--KHDDL 560
Query: 481 CWDLPDSSPGSPIVLYACHLGGGNQLWRYYPDTKQIKHGGHSNCVD-HEPTTLRLHISRC 539
C L + GS +VL AC NQ W + ++H + C+D + + + C
Sbjct: 561 CLTLVTFARGSQVVLKACD-DSENQRW-IMREGGLVRHYKINVCLDSRDQSQQGVSAQHC 618
Query: 540 DAAADTQRWT 549
++A TQRW+
Sbjct: 619 NSALGTQRWS 628
>UniRef50_Q10471 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 2 (EC 2.4.1.41)
(Protein-UDP acetylgalactosaminyltransferase 2) (UDP-
GalNAc:polypeptide N-acetylgalactosaminyltransferase 2)
(Polypeptide GalNAc transferase 2) (GalNAc-T2)
(pp-GaNTase 2) [Contains: Polypeptide
N-acetylgalactosaminyltransferase 2 soluble form]; n=32;
Coelomata|Rep: Polypeptide
N-acetylgalactosaminyltransferase 2 (EC 2.4.1.41)
(Protein-UDP acetylgalactosaminyltransferase 2) (UDP-
GalNAc:polypeptide N-acetylgalactosaminyltransferase 2)
(Polypeptide GalNAc transferase 2) (GalNAc-T2)
(pp-GaNTase 2) [Contains: Polypeptide
N-acetylgalactosaminyltransferase 2 soluble form] - Homo
sapiens (Human)
Length = 571
Score = 338 bits (830), Expect = 3e-91
Identities = 188/485 (38%), Positives = 263/485 (54%), Gaps = 26/485 (5%)
Query: 70 YSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYS 129
Y+ N FN SDK+ + R++PD RH C++K + LP+ SVV+ FHNE S LLRT S
Sbjct: 99 YARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVS 158
Query: 130 VLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAE 189
VL +SP HLIKEI LVDD S G L G + KV+V+R +R GL+ +R+ GA+
Sbjct: 159 VLKKSPPHLIKEIILVDDYSNDPEDGALL-----GKIEKVRVLRNDRREGLMRSRVRGAD 213
Query: 190 RATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGA 249
A A VL FLDSH E N +WL PLLE +A D V+ P IDVI D F+Y +G
Sbjct: 214 AAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLKGG 273
Query: 250 FDWEFFYKRLPVLPKDEK----NMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGG 305
FDW +K + P+ + N P ++P++AGGLF + + +F ELG YD +D+WGG
Sbjct: 274 FDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGG 333
Query: 306 EQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNP-GHGDFVGKNYRRVAEVWMDEYA 364
E E+SF++WQCGG + PCSRVGH++RK P+ P G G +N RR AEVWMDEY
Sbjct: 334 ENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYK 393
Query: 365 EHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGK 424
Y P R + G++ + LR+KL+CKPFKW++ + +L + P + + A G
Sbjct: 394 NFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPEL--RVP--DHQDIAFGA 449
Query: 425 IXXXXXXXXXXXLCADAAHGPQLAALRLARCDVTTNAEQHFVLTWHKDIRCKTRNMCWDL 484
+ AD G + C Q + LT K + K ++C +
Sbjct: 450 LQQGTNCLDTLGHFADGVVG-------VYECH-NAGGNQEWALTKEKSV--KHMDLCLTV 499
Query: 485 PDSSPGSPIVLYACHLGGGNQLWRYYPDTKQIKHGGHSNCVDHEPT-TLRLHISRCDAAA 543
D +PGS I L C Q W +++H G + C+D + L + C A
Sbjct: 500 VDRAPGSLIKLQGCRENDSRQKWEQIEGNSKLRHVGSNLCLDSRTAKSGGLSVEVCGPAL 559
Query: 544 DTQRW 548
+Q+W
Sbjct: 560 -SQQW 563
>UniRef50_UPI0000E461C0 Cluster: PREDICTED: hypothetical protein,
partial; n=1; Strongylocentrotus purpuratus|Rep:
PREDICTED: hypothetical protein, partial -
Strongylocentrotus purpuratus
Length = 639
Score = 333 bits (818), Expect = 8e-90
Identities = 167/364 (45%), Positives = 221/364 (60%), Gaps = 10/364 (2%)
Query: 49 IGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPS 108
+GE G P + ++LY N FN SD I +RSLPD+R CK +Y E LP+
Sbjct: 233 LGEMGKPVIFEGDMKTHADALYHKNAFNLLASDMIAFNRSLPDVRPQQCKSLVYPEVLPT 292
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM-- 166
SV++ FHNE +S LLRT +SV+ RSP HL+KEI LVDDAST++ L LDDY+ +
Sbjct: 293 TSVIIIFHNEAFSALLRTVHSVINRSPRHLLKEIILVDDASTQEHLKVKLDDYISRHFHS 352
Query: 167 -PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVM 225
+V++ RLP RSGLI AR+ GA A D+L FLDSH E NV WL PLL I D R V+
Sbjct: 353 SARVRIERLPTRSGLIRARIHGALNAIGDILTFLDSHCEVNVGWLEPLLAVIDKDRRNVV 412
Query: 226 CPFIDVIAFDTFEYRAQDEGAR-GAFDWEFFYKRLPV----LPKDEKNMPEPFESPVMAG 280
P IDVI + Y+ D+ + G+F W ++ + L + ++N P SP MAG
Sbjct: 413 TPTIDVIDDNDLAYKGSDQLPQVGSFGWTMAFRWTAIQTMDLEEAKRNPTLPIRSPTMAG 472
Query: 281 GLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFP 340
GLF+I + +F ELG YDPG IWG E ELSFK W CGG + CS VGHI+RKFAP+
Sbjct: 473 GLFSIDKGYFMELGMYDPGFQIWGAENIELSFKTWMCGGSLYTMACSHVGHIFRKFAPY- 531
Query: 341 NPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWF 400
G G + +N +R+ EVW+ + Y+ P RID GD+ Q LR+KL+CK F W+
Sbjct: 532 -SGMGSYFHRNNKRLIEVWLGDARAFYYKLHPDVLRIDAGDIQDQINLRKKLDCKSFDWY 590
Query: 401 MTQI 404
+ +
Sbjct: 591 LDNV 594
>UniRef50_A7SDQ3 Cluster: Predicted protein; n=1; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 575
Score = 331 bits (813), Expect = 3e-89
Identities = 187/492 (38%), Positives = 261/492 (53%), Gaps = 26/492 (5%)
Query: 67 ESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRT 126
+ Y+ N +N SD++P+ R +PD+R CK +++ LP+ ++++ FHNE S LLRT
Sbjct: 100 DDAYAKNAYNIKKSDQLPVDREVPDVRDQQCKSQVWPHDLPTTTIIICFHNEGRSALLRT 159
Query: 127 AYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLA 186
S L RSP HL+KEI LVDD S+ G+ L +PKVK+IR KR GLI +R+
Sbjct: 160 VISALNRSPPHLLKEIILVDDFSSDPKDGRRLL-----KLPKVKLIRNTKREGLIRSRVK 214
Query: 187 GAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGA 246
GA A +VL FLDSH E N NWL PLL I +T++ P IDVI DTF+Y
Sbjct: 215 GANLARGEVLTFLDSHCECNKNWLEPLLLRIKESPKTIVSPIIDVINLDTFDYLGSSADL 274
Query: 247 RGAFDWEFFYKR--LP--VLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDI 302
RG F W +K LP +L + + P +SPV+AGGLF++++ +F LG YD +D+
Sbjct: 275 RGGFGWNLNFKWDFLPPHILAERQGKPTLPIKSPVIAGGLFSVAKKWFETLGKYDMQMDV 334
Query: 303 WGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHG-DFVGKNYRRVAEVWMD 361
WGGE E+SF+ WQCGG M PCSRVGH++R P+ PG + KN RR EVWMD
Sbjct: 335 WGGENLEISFRTWQCGGAMEIIPCSRVGHVFRNRHPYQFPGGSMNVFQKNTRRAVEVWMD 394
Query: 362 EYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVE-PKPF 420
+Y + Y P+ + GD+ + LR KL C+PFKW++ + +L K P E K F
Sbjct: 395 DYKRYYYAAVPYAKNTPYGDIEERVELRRKLRCRPFKWYVQNVYPEL--KLPSDESTKSF 452
Query: 421 AEGKIXXXXXXXXXXXLCADAAHGPQLAALRLARCDVTTNAEQHFVLTWHKDIRCKTRNM 480
E K C D + + L C Q + LT K K M
Sbjct: 453 GEIK---------QGNQCVDTLGHMRGQTIGLFECH-GAGGNQMWSLT--KSSLLKHETM 500
Query: 481 CWDLPDSSPGSPIVLYACHLGGGNQLWRYYPDTKQIKHGGHSNCVDHEP-TTLRLHISRC 539
C + D P+ L C Q W Y T +++H S C+ + T L + +C
Sbjct: 501 CLGVNDGKATEPVQLLDCDENNSMQHWEYEKATSRLRHKPTSLCLSSDKHKTSGLTLEQC 560
Query: 540 DAAADTQRWTLD 551
+ +A +Q W +
Sbjct: 561 NGSAFSQHWAFE 572
>UniRef50_P34678 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 3; n=2;
Caenorhabditis|Rep: Polypeptide
N-acetylgalactosaminyltransferase 3 - Caenorhabditis
elegans
Length = 612
Score = 330 bits (811), Expect = 6e-89
Identities = 155/364 (42%), Positives = 223/364 (61%), Gaps = 10/364 (2%)
Query: 50 GEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCK---KKMYIESL 106
G+ G +P+++ ++E + N FN S+ I ++R+LPD R C+ + +
Sbjct: 110 GQGGTGVTVPEDKKTIKEKRFLENQFNVVASEMISVNRTLPDYRSDACRTSGNNLKTAGM 169
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P S+++ FHNE W+TLLRT +SV+ RSP HL++EI LVDD S +D+L K LD Y++
Sbjct: 170 PKTSIIIVFHNEAWTTLLRTLHSVINRSPRHLLEEIILVDDKSDRDYLVKPLDSYIKMFP 229
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
+ ++ L RSGLI ARL G+E A +L+FLD+H E WL PL+ +A D + V+
Sbjct: 230 IPIHLVHLENRSGLIRARLTGSEMAKGKILLFLDAHVEVTDGWLEPLVSRVAEDRKRVVA 289
Query: 227 PFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKN-----MPEPFESPVMAGG 281
P IDVI+ DTFEY E G F+W + R +PK E N P ++P +AGG
Sbjct: 290 PIIDVISDDTFEYVTASETTWGGFNWHLNF-RWYAVPKRELNRRGSDRSMPIQTPTIAGG 348
Query: 282 LFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPN 341
LFAI + FF+++G YD G+ +WGGE E+SF++W CGG + PCSRVGH++RK P+
Sbjct: 349 LFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPCSRVGHVFRKQTPYTF 408
Query: 342 P-GHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWF 400
P G + N R AEVWMDEY Y+ P R ++ GD++ +K+LRE L CK FKW+
Sbjct: 409 PGGTAKVIHHNAARTAEVWMDEYKAFFYKMVPAARNVEAGDVSERKKLRETLQCKSFKWY 468
Query: 401 MTQI 404
+ I
Sbjct: 469 LENI 472
>UniRef50_UPI00015B453F Cluster: PREDICTED: similar to GA20875-PA;
n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
GA20875-PA - Nasonia vitripennis
Length = 793
Score = 328 bits (806), Expect = 2e-88
Identities = 197/514 (38%), Positives = 268/514 (52%), Gaps = 23/514 (4%)
Query: 49 IGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPS 108
+G+ G A+L E +YS N LS+KIPL R + D+R CK Y LP+
Sbjct: 239 LGDFGEAAYLSDSEKQNGSLVYSKRAVNVVLSNKIPLQRRIRDMRDPLCKSVTYDTKLPT 298
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
SVV+ FHNE WS LLRT YSVL SP +KEI LVDD S ++ L L Y+E +PK
Sbjct: 299 TSVVIIFHNEAWSVLLRTVYSVLQESPPKFLKEIILVDDNSNEEELEDILAYYIETRLPK 358
Query: 169 -VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCP 227
VK++RLPKR GLI ARLAGA++AT DVL+FLD+H E WL PLL I V+ P
Sbjct: 359 KVKLLRLPKRQGLIRARLAGAQQATGDVLVFLDAHCEVTKGWLSPLLHRIKARPNAVLIP 418
Query: 228 FIDVIAFDTFEYRAQDEGAR---GAFDW--EFFYKRLPVLPKDEKNMP-EPFESPVMAGG 281
IDVI T EY+ G+ G F W +F + + PK P +P +P MAGG
Sbjct: 419 VIDVIDAKTLEYKLAARGSHMPIGGFKWTGDFTWINMEDSPKRTTASPIDPINTPTMAGG 478
Query: 282 LFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPN 341
LFAI R +FW +G YD +D WGGE E+SF+IWQCGG + PCSRVGHI+R F P+
Sbjct: 479 LFAIDRKYFWVIGSYDELMDGWGGENLEMSFRIWQCGGSIEIVPCSRVGHIFRDFFPYEF 538
Query: 342 PGHGDFVGKNYRRVAEVWMDEYAE--HLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKW 399
P D N R A VWMD+Y L+ + + GDL +K+LRE+L C FKW
Sbjct: 539 PSSRDTYLINTARAAHVWMDDYKRLFFLHHKNMEGNTKEIGDLTARKKLRERLQCASFKW 598
Query: 400 FMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXLCADAAHGPQLAALRLARCDVTT 459
++ + K+ P E A G+ + ++ H P A+ + + D+T
Sbjct: 599 YLQNV---YPEKFIPDE-NVLAYGRARSPRRNLCLDSITSNDEH-PFFLAVNICQDDLT- 652
Query: 460 NAEQHFVLTWHKDIRCKTRNMCWDLPDSSPGSPIVLYACHLGGGNQLWRYYPDTKQIKHG 519
EQ+F + +R N+C + S + + +C+ + WR QI H
Sbjct: 653 -PEQYFSVDEFGQLR--KENVCATIVKDS--FMVQMVSCNKHVEPKKWR-LTKYGQIMHV 706
Query: 520 GHSNCVDHE--PTTLRLHISRCDAAADTQRWTLD 551
C+D ++ ++ C Q+W D
Sbjct: 707 QSGLCLDASGLKSSQKILARTCSKNTPDQQWKFD 740
>UniRef50_Q176D5 Cluster: N-acetylgalactosaminyltransferase; n=3;
Endopterygota|Rep: N-acetylgalactosaminyltransferase -
Aedes aegypti (Yellowfever mosquito)
Length = 661
Score = 326 bits (801), Expect = 1e-87
Identities = 154/362 (42%), Positives = 226/362 (62%), Gaps = 5/362 (1%)
Query: 48 GIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLP 107
G+GE G P + ++ L + L+ +N +N SD++ L+RSLPD+R S C K Y LP
Sbjct: 84 GVGENGDPVVIQAKDLLLMQQLFQINRYNLLASDRVALNRSLPDVRKSKCVSKEYPSKLP 143
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
+ S+++ FHNE WS LLRT +SV+ RSP HLIKEI LVDDAS + FL L++Y++
Sbjct: 144 TTSIIIVFHNEAWSVLLRTVWSVIIRSPRHLIKEILLVDDASDRRFLKNDLENYVQKLPV 203
Query: 168 KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCP 227
+ ++RL KR GL+ ARL GA AT D L FLD+H E + WL PLL + + + V+CP
Sbjct: 204 VISILRLNKREGLVAARLMGARVATGDTLTFLDAHCECSPGWLEPLLARVQENPKKVVCP 263
Query: 228 FIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPV----LPKDEKNMPEPFESPVMAGGLF 283
ID+I+ D F Y E GAF+W+ ++ + L + K+ PF +P MAGGLF
Sbjct: 264 VIDIISDDNFSYIKSFEFHWGAFNWQMHFRWYTLSDEELAERRKDTTMPFHTPAMAGGLF 323
Query: 284 AISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPG 343
I R +F+++G YD L IWGG+ E+SF+IWQCGG + APCS VGH++RK +P+ PG
Sbjct: 324 TIDRKYFFDVGAYDERLKIWGGDNLEMSFRIWQCGGEIEIAPCSHVGHLFRKSSPYTFPG 383
Query: 344 H-GDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWFMT 402
+ +N RVA VWMD++A+ ++ +++ + L++ L+CK F W++
Sbjct: 384 GVSGILNENLARVALVWMDDWAKFFFKFNKGTEEFKSLNVSSRVALKKHLSCKSFDWYLR 443
Query: 403 QI 404
+I
Sbjct: 444 KI 445
>UniRef50_Q8N428 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1; n=43;
Eumetazoa|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 - Homo
sapiens (Human)
Length = 558
Score = 326 bits (800), Expect = 1e-87
Identities = 191/506 (37%), Positives = 264/506 (52%), Gaps = 34/506 (6%)
Query: 56 AHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPF 115
A+L ++ E Y + FN SDK+ R + D RH C Y LP+ SV++ F
Sbjct: 72 AYLSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVIITF 131
Query: 116 HNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLP 175
HNE STLLRT SVL R+P +LI+EI LVDD F D L +PKVK +R
Sbjct: 132 HNEARSTLLRTVKSVLNRTPANLIQEIILVDD-----FSSDPEDCLLLTRIPKVKCLRND 186
Query: 176 KRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFD 235
+R GLI +R+ GA+ A A VL FLDSH E N WLPP+L+ + D+ V+ P IDVI+ D
Sbjct: 187 RREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLD 246
Query: 236 TFEYRAQDEGARGAFDWEFFYK--RLPVLPKDEKNMP-EPFESPVMAGGLFAISRAFFWE 292
F Y A RG FDW +K ++P+ K + P P +PV+AGG+F I +++F
Sbjct: 247 NFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNH 306
Query: 293 LGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNP-GHGDFVGKN 351
LG YD +DIWGGE +ELSF++W CGG + PCSRVGH++RK P+ P G+ +N
Sbjct: 307 LGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFPEGNALTYIRN 366
Query: 352 YRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAK 411
+R AEVWMDEY ++ Y RP G +A + R+K+NCK F+W++ + +LT
Sbjct: 367 TKRTAEVWMDEYKQYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVP 426
Query: 412 YPPVEPKPFAEG-------KIXXXXXXXXXXXLCADAAHGPQLAALRLARCDVTTNAEQH 464
P +G +C +A PQ A L H
Sbjct: 427 VKEALPGIIKQGVNCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWL--------FSDH 478
Query: 465 FVLTWHKDIRCKTRNMCWDLPDSSPGSPIVLYACHLGGGNQLWRYYPDTKQIKHGGHSNC 524
+ K + + M SSPGSP++L C+ G Q WR I+H C
Sbjct: 479 LIQQQGKCLAATSTLM------SSPGSPVILQMCNPREGKQKWR--RKGSFIQHSVSGLC 530
Query: 525 VDHEPTTLRLHISRCDAAADTQRWTL 550
++ +P +L S+C A A Q+W L
Sbjct: 531 LETKPA--QLVTSKCQADAQAQQWQL 554
>UniRef50_UPI00015B515F Cluster: PREDICTED: similar to
n-acetylgalactosaminyltransferase; n=1; Nasonia
vitripennis|Rep: PREDICTED: similar to
n-acetylgalactosaminyltransferase - Nasonia vitripennis
Length = 826
Score = 324 bits (796), Expect = 4e-87
Identities = 171/406 (42%), Positives = 231/406 (56%), Gaps = 12/406 (2%)
Query: 8 LHAPPDLNKAVNKENHFVPKSLKKIDWHNYEQISSERKRVGIGEQGLPAHLPKEESDLEE 67
+H L + +H + + DW RK +GE G PA+L EE
Sbjct: 274 IHEEKTLLSNITHNSHNYETNERYEDWITKYVQGISRK---LGEYGRPAYLSGEEKIKGN 330
Query: 68 SLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTA 127
+ N LS+KIPL R LPD+R CK Y LPS S+++ FHNE +S LLRT
Sbjct: 331 EVLKKKAVNIILSNKIPLQRKLPDVRDPLCKNVTYDSVLPSASIIIIFHNEAFSVLLRTV 390
Query: 128 YSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK-VKVIRLPKRSGLITARLA 186
YSV+ +P L+KEI LVDD S ++ L L+ Y++ +PK VK++RL +R GL+ ARL
Sbjct: 391 YSVIKETPPKLLKEIILVDDKSNEEELLGLLEYYIQTRLPKKVKLLRLDERQGLVRARLK 450
Query: 187 GAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGA 246
GA+ AT DVL+FLD+H E WL PLL+ I V+ P ID I+ +TFEY DE +
Sbjct: 451 GAKSATGDVLMFLDAHCEVTKQWLEPLLQRIKEKKNAVVTPIIDNISEETFEYSHSDEPS 510
Query: 247 R---GAFDWEFFYKRLPVLPKDEKNMPE---PFESPVMAGGLFAISRAFFWELGGYDPGL 300
G F W + + + D K+ P +SP MAGGLFAI+R +FW++G YD +
Sbjct: 511 FFQVGGFTWSGHFTWINIQEADLKSKTSAISPVKSPTMAGGLFAINRKYFWDIGSYDDKM 570
Query: 301 DIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWM 360
+ WGGE E+SF+IWQCGG + PCSRVGH++R F P+ P D G N R+A VWM
Sbjct: 571 EGWGGENLEMSFRIWQCGGVLETIPCSRVGHVFRNFLPYKFPMDKDTHGINTARLANVWM 630
Query: 361 DEYAEHLYRRRPHYRRIDP--GDLAPQKRLREKLNCKPFKWFMTQI 404
D+Y Y R Y+ GD+ + LREKL CK FKW++ +
Sbjct: 631 DDYKRLYYLHREEYKDKPELIGDIKERVNLREKLKCKSFKWYLDNV 676
>UniRef50_UPI0000D56CDA Cluster: PREDICTED: similar to CG4445-PA;
n=1; Tribolium castaneum|Rep: PREDICTED: similar to
CG4445-PA - Tribolium castaneum
Length = 602
Score = 321 bits (788), Expect = 4e-86
Identities = 194/546 (35%), Positives = 285/546 (52%), Gaps = 24/546 (4%)
Query: 23 HFVPKSLKKIDWHNYEQISSER--KRVGIGEQGLPAHLPKEESDLEESLYSVNGFNGALS 80
H++ + I ++ I+ R G G+ G P + ++ + + +N FN S
Sbjct: 63 HYIGNPVHAIPNATHDMINENNYDPRPGAGKNGNPVVIEPKDLLKMQQYFQINRFNLLAS 122
Query: 81 DKIPLSRSLPDIRHSGCKKKMY-IESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLI 139
D+IPL+RSLPD R C + P S+++ FHNE WSTLLRT +SV+ RSP L+
Sbjct: 123 DRIPLNRSLPDFRRKKCATLFGDYPTYPKTSIIIVFHNEAWSTLLRTVWSVINRSPPELL 182
Query: 140 KEIFLVDDASTKDFLGKALDDYLEGNMP-KVKVIRLPKRSGLITARLAGAERATADVLIF 198
+EI LVDD+S + FL K LDDY+ N+P KV+R R GLI ARL GA A VL F
Sbjct: 183 EEIILVDDSSERKFLKKPLDDYV-ANLPVPTKVLRSQARIGLIKARLKGALVAKGPVLTF 241
Query: 199 LDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKR 258
LD+H E WL LL I D V+CP ID+I DTF Y E GAF+W ++
Sbjct: 242 LDAHCECTTGWLEALLSVIKQDRTAVVCPVIDIINDDTFAYVKSFELHWGAFNWNLQFRW 301
Query: 259 LPV----LPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKI 314
+ L + + +PF +P MAGGLFAI R +F+E+G YD G++IWGGE E+SF+I
Sbjct: 302 FTLGGRELKLRKNDATQPFNTPTMAGGLFAIDREYFFEMGAYDDGMNIWGGENLEMSFRI 361
Query: 315 WQCGGRMLDAPCSRVGHIYRKFAPFPNPGH-GDFVGKNYRRVAEVWMDEYAEHLYR-RRP 372
WQCGG++ APCSRVGH++RK +P+ PG + N RVA VWMD++A ++ P
Sbjct: 362 WQCGGKVQIAPCSRVGHLFRKSSPYSFPGGINKTLFSNLARVARVWMDDWARFYFKFNEP 421
Query: 373 HYRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXX 432
R + ++ + LR K CK F+W++ + + P + + F G+I
Sbjct: 422 ADRIKNEQNVTSRIELRRKHKCKGFEWYLDNV---WPQHFFPKDDRFF--GRIRNLGQNM 476
Query: 433 XXXXLCADAAHGPQLAALRLARCDVTTNAEQHFVLTWHKDIRCKTRNMCWDLPDSSPGSP 492
+ ++ C + FV+T I ++C D P+ P
Sbjct: 477 CLIKPQKKVVSNQPMGIAKIDMCLGDEVILEMFVMTKEGFIMTDD-SICLDAPEKVVIGP 535
Query: 493 --IVLYACHLGGGNQLWRYYPDTKQIKHGGHSNCVDHEPT---TLRLHISRCDAAADTQR 547
+ + AC G Q W Y + ++I+H + C+D + T L + C + TQ+
Sbjct: 536 SKVRIMACS-GYSRQKWVYDKEKQEIRHLTNQKCLDVSDSSKFTDGLITTNCTGSL-TQK 593
Query: 548 WTLDTV 553
W L++V
Sbjct: 594 WALESV 599
>UniRef50_Q9Y117 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 3; n=2;
Sophophora|Rep: Polypeptide
N-acetylgalactosaminyltransferase 3 - Drosophila
melanogaster (Fruit fly)
Length = 667
Score = 320 bits (785), Expect = 8e-86
Identities = 157/366 (42%), Positives = 215/366 (58%), Gaps = 9/366 (2%)
Query: 48 GIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLP 107
G G G P +P + + + +N FN SD+IPL+R+L D R C+ K Y LP
Sbjct: 91 GEGADGRPVVVPPRDRFRMQRFFRLNSFNLLASDRIPLNRTLKDYRTPECRDKKYASGLP 150
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
S SV++ FHNE WS LLRT SV+ RSP HL+KEI LVDDAS + +L + L+ Y++
Sbjct: 151 STSVIIVFHNEAWSVLLRTITSVINRSPRHLLKEIILVDDASDRSYLKRQLESYVKVLAV 210
Query: 168 KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCP 227
++ R+ KRSGL+ ARL GAE A DVL FLD+H E + WL PLL I + V+CP
Sbjct: 211 PTRIFRMKKRSGLVPARLLGAENARGDVLTFLDAHCECSRGWLEPLLSRIKESRKVVICP 270
Query: 228 FIDVIAFDTFEYRAQDEGARGAFDWEFFY------KRLPVLPKDEKNMPEPFESPVMAGG 281
ID+I+ D F Y E GAF+W+ + ++ K+ +P +P MAGG
Sbjct: 271 VIDIISDDNFSYTKTFENHWGAFNWQLSFRWFSSDRKRQTAGNSSKDSTDPIATPGMAGG 330
Query: 282 LFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPN 341
LFAI R +F+E+G YD + +WGGE E+SF+IWQCGGR+ +PCS VGH++R P+
Sbjct: 331 LFAIDRKYFYEMGSYDSNMRVWGGENVEMSFRIWQCGGRVEISPCSHVGHVFRSSTPYTF 390
Query: 342 P-GHGDFVGKNYRRVAEVWMD--EYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFK 398
P G + + N R A VWMD +Y LY D ++ + LRE+L CKPF
Sbjct: 391 PGGMSEVLTDNLARAATVWMDDWQYFIMLYTSGLTLGAKDKVNVTERVALRERLQCKPFS 450
Query: 399 WFMTQI 404
W++ I
Sbjct: 451 WYLENI 456
>UniRef50_Q6WV20 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 1; n=5; Diptera|Rep:
Polypeptide N-acetylgalactosaminyltransferase 1 -
Drosophila melanogaster (Fruit fly)
Length = 601
Score = 318 bits (782), Expect = 2e-85
Identities = 197/530 (37%), Positives = 274/530 (51%), Gaps = 26/530 (4%)
Query: 39 QISSERKRVGIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCK 98
Q+ ++++VG+GEQG+ HL + + +Y N LS+++ +RS+ D R+ C
Sbjct: 80 QLDLQKQKVGLGEQGVAVHLSGAAKERGDEIYKKIALNEELSEQLTYNRSVGDHRNPLCA 139
Query: 99 KKMY-IESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKA 157
K+ + +SLP+ SVV+ F NE +S LLRT +S L E +KEI LVDD S LG
Sbjct: 140 KQRFDSDSLPTASVVIIFFNEPYSVLLRTVHSTLSTCNEKALKEIILVDDGSDNVELGAK 199
Query: 158 LDDYLEGNMP--KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLE 215
LD Y+ +P KV ++RL R GLI ARLAGA AT DVLIFLD+H E N+ W PLL+
Sbjct: 200 LDYYVRTRIPSGKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQ 259
Query: 216 PIALDYRTVMCPFIDVIAFDTFEYRAQDEGAR--GAFDWE--FFYKRLPVLPKDE----- 266
I +V+ P IDVI + F+Y + G F W F + LP K
Sbjct: 260 RIKESRTSVLVPIIDVIDANDFQYSTNGYKSFQVGGFQWNGHFDWINLPEREKQRQRREC 319
Query: 267 KNMPE--PFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDA 324
K E P SP MAGGLFAI R +FWE+G YD +D WGGE E+SF+IWQCGG +
Sbjct: 320 KQEREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETI 379
Query: 325 PCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYR-RIDPGDLA 383
PCSRVGHI+R F P+ P D G N R+A VWMDEY + RP + D GD+
Sbjct: 380 PCSRVGHIFRDFHPYKFPNDRDTHGINTARMALVWMDEYINIFFLNRPDLKFHADIGDVT 439
Query: 384 PQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXLCADAAH 443
+ LR+KL CK F+W++ I K+ P + GK+ L +
Sbjct: 440 HRVMLRKKLRCKSFEWYLKNI---YPEKFVPTKDVQ-GWGKVHAVNSNICLDDLLQN-NE 494
Query: 444 GPQLAALRLARCDVTTNAEQHFVLTWHKDIRCKTRNMCWDLPDSSPGSPIVLYACHLGGG 503
P A L C Q F T +R + +S P +++
Sbjct: 495 KPYNAG--LYPCGKVLQKSQLFSFTNTNVLRNELSCATVQHSESPPYRVVMVPCMENDEF 552
Query: 504 NQLWRYYPDTKQIKHGGHSNCVDHE--PTTLRLHISRCDAAADTQRWTLD 551
N+ WRY + + I H C+DH+ + ++ CD +++QRWT++
Sbjct: 553 NEQWRY--EHQHIIHSNTGMCLDHQGLKSLDDAQVAPCDPHSESQRWTIE 600
>UniRef50_UPI0000D564C6 Cluster: PREDICTED: similar to CG8182-PA,
isoform A; n=1; Tribolium castaneum|Rep: PREDICTED:
similar to CG8182-PA, isoform A - Tribolium castaneum
Length = 545
Score = 318 bits (781), Expect = 3e-85
Identities = 185/475 (38%), Positives = 259/475 (54%), Gaps = 22/475 (4%)
Query: 43 ERKRV-GIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKM 101
ERK + +G G A L +++ E N LSD++PL R L D R+ CK
Sbjct: 36 ERKIIPNLGHNGEAAFLEGQDAKEGEKALKKFALNTVLSDRMPLDRKLRDPRNPKCKTFT 95
Query: 102 YIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDY 161
Y L + SVVV F+NE S +LRT +SV+ ++P+ L++EI LVDDAST++ L LD Y
Sbjct: 96 YNPKLKA-SVVVIFYNELLSVILRTVWSVILQTPKELLEEIILVDDASTEESLKGLLDYY 154
Query: 162 LEGNMP--KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIAL 219
+E + K+++I L R GLI ARL GA AT DVLIFLD+H EA +W+ PLL I
Sbjct: 155 IETRLSSTKLRLIHLKTRMGLIRARLQGARIATGDVLIFLDAHCEATTDWMEPLLSRIEQ 214
Query: 220 DYRTVMCPFIDVIAFDTFEYRAQDEGAR--GAFDWEFFYKRLPVLPKDEKNMPEPFESPV 277
+ V+ P IDVI +T Y + + G F W + + + +++K+ P +SP
Sbjct: 215 EPTAVLVPIIDVIEANTLAYSTNGDTSYQVGGFSWSGHFTWIDIQNEEDKHKLTPVKSPT 274
Query: 278 MAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFA 337
MAGGLFAI R FFWE+G YD +D WGGE E+SF+IWQCGGR+ PCSRVGHI+R F
Sbjct: 275 MAGGLFAIDRKFFWEIGSYDEQMDGWGGENLEMSFRIWQCGGRLETVPCSRVGHIFRDFH 334
Query: 338 PFPNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDP--GDLAPQKRLREKLNCK 395
P+ P + D G N R+A VWMD+Y + +P +P GDL +K+LR+KL CK
Sbjct: 335 PYSFPDNKDTHGINTARLAHVWMDDYKRFFFMYQPALEN-NPVVGDLTHRKQLRQKLRCK 393
Query: 396 PFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXLCADAAHGPQLAALRLARC 455
FKW++ + K+ P E +A G++ + GP L L +C
Sbjct: 394 SFKWYLENV---YPEKFIPDE-NVYAHGQVQNDYGMCLDDLQLGEDKIGP----LGLYQC 445
Query: 456 DVTTNAEQHFVLTWHKDIRCKTRNMCWDLPDSSPGSPIVLYACHLGGGNQLWRYY 510
Q+F L + ++R N C ++ + L CH Q W +Y
Sbjct: 446 HPYLAMSQYFSLNFKGELR--KENFC---AETFGVREVQLTECHGHKREQFWMFY 495
>UniRef50_Q16ZW8 Cluster: N-acetylgalactosaminyltransferase; n=4;
Endopterygota|Rep: N-acetylgalactosaminyltransferase -
Aedes aegypti (Yellowfever mosquito)
Length = 662
Score = 317 bits (778), Expect = 6e-85
Identities = 177/469 (37%), Positives = 262/469 (55%), Gaps = 23/469 (4%)
Query: 61 EESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHW 120
EE + + Y + FN +S+KI R +PD RH C ++ Y + LPS S+++ F+NEH
Sbjct: 151 EEQFIRDVGYRKHAFNVLVSNKIGPFRGVPDTRHKLCHEQSYDKVLPSASIIMCFYNEHL 210
Query: 121 STLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEG-NMPKVKVIRLPKRSG 179
TL+R+ S++ R+P +L+ EI LVDD S D L L+ L KV++IR +R G
Sbjct: 211 ETLVRSVTSIIRRTPSYLLHEIILVDDCSDLDDLRDNLEHELNALKNSKVRLIRNAEREG 270
Query: 180 LITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEY 239
L+ +R+ GA AT DVLIFLDSH E NV+W+ PLL+ I + + P ID+I DTF Y
Sbjct: 271 LMRSRVYGARNATGDVLIFLDSHIEVNVDWVEPLLQRIKTNKTILAMPVIDIINSDTFIY 330
Query: 240 RAQDEGARGAFDWEFFYK--RLP--VLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGG 295
+ RG F+W +K LP L K E + PF+SP MAGGLFA+ R +F +LG
Sbjct: 331 SSSPL-VRGGFNWGLHFKWDNLPKGTLAK-ESDFVGPFQSPTMAGGLFAVDRQYFKDLGE 388
Query: 296 YDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRV 355
YD G+D+WGGE E+SF+ WQCGG + PCSR+GH++RK P+ +P D + +N R+
Sbjct: 389 YDMGMDVWGGENLEISFRTWQCGGSIELVPCSRIGHVFRKRRPYGSPDGSDTMIRNSLRL 448
Query: 356 AEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDL------T 409
+ VWMD+Y ++ +P +++DPGDL + LR++LNCK F+W++ I L T
Sbjct: 449 SRVWMDDYIKYFLENQPQAKKVDPGDLTDRHDLRKRLNCKSFEWYLKNIYPQLKLPGEKT 508
Query: 410 AKYPPVEPK--PFAEGK---IXXXXXXXXXXXLCADAAHGPQLAALRLARCDVTTNAEQH 464
+PK P+ K I LC + + + V +
Sbjct: 509 TDSNVSQPKFQPWHSRKRNYISSFQIRLSNSSLCVTTESAKEKSLWKKGSHLVLHPCLRV 568
Query: 465 FVLTWHKDIRCKT---RNMCWDLPDS-SPGSPIVLYACHLGGGNQLWRY 509
W++ R + + +C + P S + GSP +L CH GG+Q W++
Sbjct: 569 KAQMWYETERAELVLGQLLCLEAPSSATKGSP-MLNKCHEMGGDQAWKH 616
>UniRef50_A7RGG9 Cluster: Predicted protein; n=3; Eumetazoa|Rep:
Predicted protein - Nematostella vectensis
Length = 353
Score = 310 bits (762), Expect = 5e-83
Identities = 148/311 (47%), Positives = 193/311 (62%), Gaps = 4/311 (1%)
Query: 97 CKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGK 156
C K Y LPS +VV+ FHNE WSTLLRT +SV+ RSP HL++EI L+DD ST D+L
Sbjct: 26 CSSKSYPSYLPSTTVVICFHNEAWSTLLRTVHSVIDRSPAHLLREILLIDDFSTHDYLKS 85
Query: 157 ALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEP 216
L Y+ + V+V+R KR GLI ARL GA A DV+ FLD+H EANV+WL PLL
Sbjct: 86 KLTAYV-AKLRNVRVLRTSKREGLIRARLIGARAAKGDVITFLDAHCEANVDWLQPLLSR 144
Query: 217 IALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWE--FFYKRLPVLPKDE-KNMPEPF 273
I D V P ID+I+ F Y G F W+ F + LP + E K+ P
Sbjct: 145 IHSDRTIVAVPVIDIISSTNFMYSGTPSAVIGGFSWDMQFTWHSLPNNRQSERKDRTAPI 204
Query: 274 ESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIY 333
+P MAGGLF+I R +F+E G YD G+D+WGGE E+SF+IWQCGG++ PCSRVGH++
Sbjct: 205 RTPTMAGGLFSIDRKYFFESGSYDEGMDVWGGENLEMSFRIWQCGGKLEILPCSRVGHVF 264
Query: 334 RKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLN 393
R P+ PG V N RV VWMDEY +++Y +RP + + GD+ + LR KL
Sbjct: 265 RTRFPYSFPGGYSEVSVNLARVVHVWMDEYNQYVYMKRPDLQSLKYGDITSRVALRNKLK 324
Query: 394 CKPFKWFMTQI 404
CK FKW++ +
Sbjct: 325 CKSFKWYLENV 335
>UniRef50_Q8IXK2 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 12; n=41;
Eumetazoa|Rep: Polypeptide
N-acetylgalactosaminyltransferase 12 - Homo sapiens
(Human)
Length = 581
Score = 310 bits (762), Expect = 5e-83
Identities = 163/363 (44%), Positives = 219/363 (60%), Gaps = 10/363 (2%)
Query: 47 VGIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMY-IES 105
+G + + L EE L+E ++ N LSD+I L R LP+ + CK+K Y ++
Sbjct: 75 LGARGEAVRLQLQGEELRLQEESVRLHQINIYLSDRISLHRRLPERWNPLCKEKKYDYDN 134
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
LP SV++ F+NE WSTLLRT YSVL SP+ L++E+ LVDD S ++ L + L + L G
Sbjct: 135 LPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLANELSG- 193
Query: 166 MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVM 225
+PKV++IR KR GL+ ARL GA A DVL FLD H E + WL PLL+ I + V+
Sbjct: 194 LPKVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQRIHEEESAVV 253
Query: 226 CPFIDVIAFDTFEYRAQD-EGARGAFDWEFFYKRLPVLPKDEKNMPEPFE---SPVMAGG 281
CP IDVI ++TFEY E G FDW + V ++ M P + SP MAGG
Sbjct: 254 CPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRMQSPVDVIRSPTMAGG 313
Query: 282 LFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPN 341
LFA+S+ +F LG YD G+++WGGE E SF+IWQCGG + PCS VGH++ K AP+
Sbjct: 314 LFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLETHPCSHVGHVFPKQAPYSR 373
Query: 342 PGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWFM 401
+ N R AEVWMDE+ E Y R P R GD+ +K+LR+KL CK FKWF+
Sbjct: 374 ----NKALANSVRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCKDFKWFL 429
Query: 402 TQI 404
+
Sbjct: 430 ETV 432
>UniRef50_Q17M60 Cluster: N-acetylgalactosaminyltransferase; n=1;
Aedes aegypti|Rep: N-acetylgalactosaminyltransferase -
Aedes aegypti (Yellowfever mosquito)
Length = 552
Score = 310 bits (761), Expect = 7e-83
Identities = 201/543 (37%), Positives = 277/543 (51%), Gaps = 33/543 (6%)
Query: 17 AVNKENHFVPKSLKKIDWHNYEQISSERKRVGIGEQGLPAHLPKEES----DLEESLYSV 72
+V+K +H V S ++ + EQ E G+ G P +P + ++ E +
Sbjct: 24 SVDKASHAVTVSSQQ---NATEQPVKEELPAPPGDMGAPVFIPDDAPLGVREVMERQFKT 80
Query: 73 NGFNGALSDKIPLSRSLPDIRHSGCKKKMYI-ESLPSVSVVVPFHNEHWSTLLRTAYSVL 131
N S I R LPD R CK K I E LP +VV+ F NE WS L+RT YSVL
Sbjct: 81 FALNEYASALISAHRRLPDYRDPWCKVKGRIMEHLPETTVVIVFFNEPWSVLVRTVYSVL 140
Query: 132 YRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERA 191
RSP LIKE+ LVDD S L +Y PKV+++R P+R GLI ARL GA A
Sbjct: 141 DRSPPELIKEVLLVDDCSFMPHTKTQLQEYF-AKEPKVRILRSPQRLGLIKARLMGARNA 199
Query: 192 TADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGAR-GAF 250
T ++L FLD+H E WL P L+ +A + TV P ID + + A GA
Sbjct: 200 TTEILTFLDAHCECTTGWLEPQLDRVARNPTTVAIPTIDWVDEHNLAFIANRSHIYYGAC 259
Query: 251 DW--EFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQY 308
DW +F ++ +N EPF +P+MAGGLF+I++ FF +G YD GL I+GGE
Sbjct: 260 DWGLQFGWRGRWDRKVKPENKLEPFPTPIMAGGLFSINKTFFAHIGWYDEGLGIYGGENV 319
Query: 309 ELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLY 368
ELS K W CGGR+ PCSRVGHI + P+ + D+V RVAEVWMD+YA+ +Y
Sbjct: 320 ELSLKAWMCGGRLETIPCSRVGHIQKAGHPYLDGVKTDWVRVGSVRVAEVWMDQYAQVVY 379
Query: 369 RR--RPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIX 426
P +R + GD++ +K+LRE LNCK FKW++ +L PV GK
Sbjct: 380 DMFGGPEFRG-NFGDVSDRKKLRESLNCKSFKWYLENAFPELE---DPVS-YGVGHGKFT 434
Query: 427 XXXXXXXXXXLCADAAHGPQLAALRLARCDVTTNAEQHFVLTWHKDIRCKTRNMCWDLPD 486
A + R+ C T + QH+V +I T N+C D
Sbjct: 435 NLGVGKNFCPRYRKAGY-----TFRMEPC--TDDDYQHWVHNMLGEI--STSNVCLDY-- 483
Query: 487 SSPGSPIVLYACHLGGGNQLWRYYPDTKQIKHGGHSNCVDHEPT-TLRLHISRCDAAADT 545
G + ++ CH G GNQ WRY TKQ + + C+D P ++L +C+A ++
Sbjct: 484 --DGITLYMFECHKGQGNQKWRYSKSTKQFTNVKNKVCLDVGPAPEMKLVAEKCNATKES 541
Query: 546 QRW 548
Q+W
Sbjct: 542 QKW 544
>UniRef50_Q5DD76 Cluster: SJCHGC09400 protein; n=2; Schistosoma
japonicum|Rep: SJCHGC09400 protein - Schistosoma
japonicum (Blood fluke)
Length = 737
Score = 308 bits (757), Expect = 2e-82
Identities = 190/505 (37%), Positives = 272/505 (53%), Gaps = 31/505 (6%)
Query: 42 SERKRVGIGEQGLPAHLPKEE-SDLEESLYSV----NGFNGALSDKIPLSRSLPDIRHSG 96
S+ +G GE G+P + +E+ S E+ ++ N FN SD+I + R LPD R
Sbjct: 175 SDANSIGPGEGGIPYTVNREDISPAEQVIFDKGWKDNAFNQLASDRISVRRYLPDYREGT 234
Query: 97 CKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGK 156
CK Y +LPS S+++ FHNE WS LLR+ +SV+ RSP +L+ EI LVDD S + L +
Sbjct: 235 CKDNKYSRNLPSASIIICFHNEAWSVLLRSVHSVIDRSPSYLLHEIILVDDFSDRPHLKE 294
Query: 157 ALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEP 216
AL++Y++ + VK++R +R GLI AR+ GA +++ VL+FLDSH E WL PLL+
Sbjct: 295 ALEEYMK-MLNVVKIVRTKRREGLIRARMLGAAQSSGKVLVFLDSHIECTTGWLEPLLDR 353
Query: 217 IALDYRTVMCPFIDVIAFDTFEYRAQDEGAR--GAFDWE--FFYKRLPVLPKDEKNMP-E 271
IA + V+ P I VI T +Y G FDW F + K+ P
Sbjct: 354 IAYNSSIVVVPVITVINDKTLKYDLPSPSRVQIGGFDWSLSFIWHEQTERHKNRPGAPYS 413
Query: 272 PFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGH 331
P +SP MAGGLFAISR +F LG YDPG+++WGGE ELSFKIW CGG + CS+VGH
Sbjct: 414 PVQSPTMAGGLFAISREYFNHLGMYDPGMEVWGGENLELSFKIWMCGGSLEIVICSQVGH 473
Query: 332 IYRKFAPFP-NPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLRE 390
I+R +P+ + D + +N R+A+VW+D+Y + Y R + +D G+++ +K LRE
Sbjct: 474 IFRDRSPYIWDVDVKDPLKRNLLRLADVWLDDY-KRFYHARIGFEMVDIGNVSERKALRE 532
Query: 391 KLNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXLCADA---AHGPQL 447
KL C F W++T I +L V K A G I C DA +
Sbjct: 533 KLKCHSFDWYLTNIYPELF-----VPSKALASGDIESAAGPH-----CLDAPLPSENDSS 582
Query: 448 AALRLARCDVTTNAEQHFVLTWHKDIRCKTRNMCWDLPDSSPGSPIVLYACHLGGGNQLW 507
+ + R Q ++L+ +IR + C+D I LY CH GNQ +
Sbjct: 583 SVIIKTRPCHKQGGNQFWLLSSENEIR--RDDYCFD--SGIQKYSIGLYHCHGSHGNQEF 638
Query: 508 RYYPDTKQIKHGGHSNCVDHEPTTL 532
Y D I+H G VD E +++
Sbjct: 639 TYEKD-DTIRHQGLCLEVDIENSSV 662
>UniRef50_Q96FL9 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 14; n=30;
Tetrapoda|Rep: Polypeptide
N-acetylgalactosaminyltransferase 14 - Homo sapiens
(Human)
Length = 552
Score = 308 bits (755), Expect = 4e-82
Identities = 178/495 (35%), Positives = 257/495 (51%), Gaps = 33/495 (6%)
Query: 70 YSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYS 129
Y + FN S++I +R++PD RH C +Y LP S+++ FHNE STLLRT S
Sbjct: 74 YKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLLRTIRS 133
Query: 130 VLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAE 189
VL R+P HLI+EI LVDD S K L +PKVK +R +R GL+ +R+ GA+
Sbjct: 134 VLNRTPTHLIREIILVDDFSNDPDDCKQLI-----KLPKVKCLRNNERQGLVRSRIRGAD 188
Query: 190 RATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGA 249
A L FLDSH E N +WL PLL + DY V+CP ID+I DTF Y RG
Sbjct: 189 IAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIESASELRGG 248
Query: 250 FDWEFFYKRLPVLPKDEKNM---PEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGE 306
FDW ++ + P+ + EP +P++AGGLF I +A+F LG YD +DIWGGE
Sbjct: 249 FDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGE 308
Query: 307 QYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNP-GHGDFVGKNYRRVAEVWMDEYAE 365
+E+SF++W CGG + PCSRVGH++RK P+ P G+ + KN +R AEVWMDEY +
Sbjct: 309 NFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQ 368
Query: 366 HLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVE-PKPFAEGK 424
+ Y RP G++ + LR+ L C+ FKW++ I YP + PK E
Sbjct: 369 YYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENI-------YPELSIPK---ESS 418
Query: 425 IXXXXXXXXXXXLCADAAHGPQLAALRLARCDVTTNAE---QHFVLTWHKDIRCKTRNMC 481
I L + + + L+L+ C + Q + T+ + I +C
Sbjct: 419 IQKGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQI--LQEELC 476
Query: 482 WDLPDSSPGSPIVLYACHLGGGNQLWRYYPDTKQIKHGGHSNCVDHE------PTTLRLH 535
+ PG+P+VL C G Q W I+H C+D + +
Sbjct: 477 LSVITLFPGAPVVLVLCKNGDDRQQWT--KTGSHIEHIASHLCLDTDMFGDGTENGKEIV 534
Query: 536 ISRCDAAADTQRWTL 550
++ C+++ +Q W +
Sbjct: 535 VNPCESSLMSQHWDM 549
>UniRef50_UPI0000E45D84 Cluster: PREDICTED: hypothetical protein;
n=3; Strongylocentrotus purpuratus|Rep: PREDICTED:
hypothetical protein - Strongylocentrotus purpuratus
Length = 662
Score = 307 bits (753), Expect = 6e-82
Identities = 153/305 (50%), Positives = 196/305 (64%), Gaps = 13/305 (4%)
Query: 46 RVGIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIES 105
R G GE G E + L GFN +SD+I L R++ D+R CK Y E+
Sbjct: 343 REGPGEGGAAVRTQPSEKAKVDRLIQEYGFNQYVSDQISLDRNIADLRSQQCKHWHYPET 402
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
LP+ SV++ FHNE WSTLLRT +SV RSP L+ EI LVDD STK+ L + L+DY++
Sbjct: 403 LPTTSVIIVFHNEGWSTLLRTVHSVFNRSPSQLLHEIILVDDFSTKEHLKERLEDYVQEA 462
Query: 166 M--PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRT 223
K+K++R +R GLI R+ GA +T DVL++LD+H E VNWLPPLL PIA++ T
Sbjct: 463 RFNGKLKLVRNSRREGLIRTRIIGARHSTGDVLLWLDAHCEVGVNWLPPLLTPIAVNRTT 522
Query: 224 VMCPFIDVIAFDTFEYRAQDEGA----RGAFDWEFFYKRLPVLPKDEKNM----PEPFES 275
+CP IDVI D +YR +G RG FDW ++K LPV P+ EK+ EP+ S
Sbjct: 523 AVCPIIDVI--DNMDYRVYPQGTGDQDRGGFDWSLYWKHLPV-PQFEKSRRQHASEPYRS 579
Query: 276 PVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRK 335
P MAGGLFA+ R +F+ELG YD GL+IWGGE +ELSFKIW CGG +L PCSRVGH+YR
Sbjct: 580 PAMAGGLFAMDRKYFFELGAYDEGLEIWGGENFELSFKIWMCGGSLLWVPCSRVGHVYRI 639
Query: 336 FAPFP 340
P
Sbjct: 640 LGKVP 644
>UniRef50_Q9U2C4 Cluster: Probable N-acetylgalactosaminyltransferase
9; n=3; Caenorhabditis|Rep: Probable
N-acetylgalactosaminyltransferase 9 - Caenorhabditis
elegans
Length = 579
Score = 305 bits (748), Expect = 3e-81
Identities = 154/367 (41%), Positives = 218/367 (59%), Gaps = 7/367 (1%)
Query: 46 RVGIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMY-IE 104
R G GE+G P L ++++L ++ N SDKI L R +PD R CK Y
Sbjct: 72 REGPGEKGKPVVLTGKDAELGQADMKKWFMNVHASDKISLDRDVPDPRIQACKDIKYDYA 131
Query: 105 SLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEG 164
+LP SV++ F +E W+ LLRT +SV+ RSP L++E+ L+DD S + L + LD++++
Sbjct: 132 ALPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEVILLDDNSKRQELQEPLDEHIKR 191
Query: 165 NMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTV 224
KV++IR R GLI A+LAGA A D+++FLDSH EAN WL P+++ I+ + +
Sbjct: 192 FGGKVRLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEPIVQRISDERTAI 251
Query: 225 MCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEP---FESPVMAGG 281
+CP ID I+ +T Y + G F W + + +++K +P SP MAGG
Sbjct: 252 VCPMIDSISDNTLAYHGDWSLSTGGFSWALHFTWEGLSEEEQKRRTKPTDYIRSPTMAGG 311
Query: 282 LFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPN 341
L A +R +F+E+GGYD +DIWGGE E+SF+ W CGG + PCS VGHI+R P+
Sbjct: 312 LLAANREYFFEVGGYDEEMDIWGGENLEISFRAWMCGGSIEFIPCSHVGHIFRAGHPYNM 371
Query: 342 PGHG---DFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFK 398
G D G N +R+AEVWMD+Y Y R R D GDL + LR++LNCKPFK
Sbjct: 372 TGRNNNKDVHGTNSKRLAEVWMDDYKRLYYMHREDLRTKDVGDLTARHELRKRLNCKPFK 431
Query: 399 WFMTQIA 405
WF+ IA
Sbjct: 432 WFLDNIA 438
>UniRef50_UPI0000586DC6 Cluster: PREDICTED: similar to polypeptide
N-acetylgalactosaminyltransferase 10; n=1;
Strongylocentrotus purpuratus|Rep: PREDICTED: similar to
polypeptide N-acetylgalactosaminyltransferase 10 -
Strongylocentrotus purpuratus
Length = 376
Score = 303 bits (745), Expect = 6e-81
Identities = 147/319 (46%), Positives = 197/319 (61%), Gaps = 16/319 (5%)
Query: 201 SHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQ-DEGARGAFDWEFFYKRL 259
++ +V+ + P E IAL+ R ++CP IDVI+ + F Y +Q + RGAFDWE +YKR+
Sbjct: 7 TYLSVSVHIIFPSTERIALNRRRIVCPMIDVISNEDFHYESQAGDVMRGAFDWELYYKRI 66
Query: 260 PVLPKDEKNMP---EPFESPVMAGGLFAISRAFFWE-LGGYDPGLDIWGGEQYELSFKIW 315
P+ + K +PF +P+MAGGLFA+ R +F E LGGYD GL+IWGGEQY+LSFK+W
Sbjct: 67 PISEAENKRRSHESDPFRTPIMAGGLFAVDRKYFMEELGGYDEGLEIWGGEQYDLSFKVW 126
Query: 316 QCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYR 375
CGG M + PCSRVGHIYRKF + PG + KN RV EVWMDE+ ++ Y RRP+ +
Sbjct: 127 MCGGEMEEIPCSRVGHIYRKFMSYTVPGGAGVINKNLLRVVEVWMDEWGKYFYERRPYLK 186
Query: 376 RIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXX 435
D GD++ Q LRE+L CK F WF+T++A D+ YPPVEP+ A+G I
Sbjct: 187 GQDYGDISKQLALRERLQCKNFTWFLTEVAPDILQYYPPVEPEGGAKGHITHTSTGK--- 243
Query: 436 XLCADAAHGPQLAALRLARCD---VTTNAEQHFVLTWHKDIR--CKTRNMCWDLPDSSPG 490
C + G + LR+ C+ + Q + LTWH D R K+R C D P G
Sbjct: 244 --CLTLSQGGK-DELRVQECNPRSMKQGGSQFWELTWHDDFRPSSKSRKQCVDFPYGREG 300
Query: 491 SPIVLYACHLGGGNQLWRY 509
+ +LY CH GGGNQLW Y
Sbjct: 301 AEPILYPCHHGGGNQLWVY 319
>UniRef50_A0NGH9 Cluster: ENSANGP00000031751; n=1; Anopheles gambiae
str. PEST|Rep: ENSANGP00000031751 - Anopheles gambiae
str. PEST
Length = 499
Score = 298 bits (731), Expect = 3e-79
Identities = 185/516 (35%), Positives = 274/516 (53%), Gaps = 29/516 (5%)
Query: 50 GEQGLPA--HLPKEE-SDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKK-MYIES 105
G+ G P +L E+ + L + GFN SD + + R LP+IR C K ++
Sbjct: 1 GDMGRPVTVNLTSEQIAALTQQGIQTQGFNQYFSDLMSVRRRLPEIRDPWCAKPGRFLAD 60
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDD-YLEG 164
LP+ S+V+ F NE WS +LRT +SVL RSP HL+KEI LVDD ST G + Y
Sbjct: 61 LPATSIVIVFFNEAWSVVLRTVHSVLDRSPAHLVKEIVLVDDCSTLGEDGLNIFMLYKFL 120
Query: 165 NMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTV 224
M + +++R PKR GLI AR+ G + D++ FLD+H E V WL L++P+ + T+
Sbjct: 121 GMARFRIVRAPKRLGLIRARMLGGKSTKTDLITFLDAHVEVTVGWLEALIQPVVESWTTI 180
Query: 225 MCPFIDVIAFDTFEYRAQDEGAR--GAFDWE--FFYKRLPVLPKDEKNMPEPFESPVMAG 280
P ID I + +YR D+ GA+DW+ F + K N EPF++P MAG
Sbjct: 181 AIPTIDWIDENNMKYR-DDKAPTFVGAYDWDLNFGWWGRWSQKKQNANKMEPFDTPAMAG 239
Query: 281 GLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFP 340
GLFAI+R FF +G YD G DI+G E ELS K W CGG+M+ PCSRVGHI + P+
Sbjct: 240 GLFAINRTFFERIGWYDDGFDIYGIENIELSVKSWMCGGKMVTVPCSRVGHIQKTGHPYL 299
Query: 341 NPGHGDFVGKNYRRVAEVWMDEYAEHLY--RRRPHYRRIDPGDLAPQKRLREKLNCKPFK 398
D V N R+AEVWMDEY ++ PHY + G +A +K +RE CKPF
Sbjct: 300 YKQPKDVVRANSIRLAEVWMDEYKRIIFDIYGIPHYLEEEFGSVATRKAIRESAKCKPFS 359
Query: 399 WFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXLCADAAHGPQLAALRLARCDVT 458
+++ + AF + P+ P F G++ C + + L +A CD
Sbjct: 360 YYL-ENAF--PEMHNPLVPGAF-RGEVHNMALGNGS---C--LTYRTRDRFLGMAPCD-H 409
Query: 459 TNAEQHFVLTWHKDIRCKTRNMCWDLPDSSPGSPIVLYACHLGGGNQLWRYYPDTKQIKH 518
+Q++ +++++ RN C D + G+ + +Y CH GNQ W+ +++QI
Sbjct: 410 LEKDQYWTHNYYQELN-SYRN-CID----AVGTVVEVYQCHRSRGNQAWKVLVESQQILS 463
Query: 519 GGHSNCVD-HEPTTLRLHISRCDAAADTQRWTLDTV 553
+ C+ + T L + +CDA +Q+W + +
Sbjct: 464 VARNLCLALNLQTKTTLLLEKCDATKPSQQWNVSYI 499
>UniRef50_Q7K755 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase 11; n=3;
Caenorhabditis|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase 11 - Caenorhabditis
elegans
Length = 605
Score = 297 bits (730), Expect = 4e-79
Identities = 187/512 (36%), Positives = 264/512 (51%), Gaps = 28/512 (5%)
Query: 10 APPDLNKAVNKENHFVPKSLKKIDWHNYEQISSERKRVGIGEQGLPAHLPKEESDLEESL 69
A D AV K+N P K ++ K + I L K E DL+
Sbjct: 67 AEKDEEDAVEKQNIAAPPLPK-----SFTTFPDRSKEIEIDTDLLGKINGKAEDDLQVEG 121
Query: 70 YSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYS 129
Y FNG LSD+I R + D R++ C Y +SLP+ S+VV + NE S L+R S
Sbjct: 122 YKKYQFNGLLSDRIGSRRKIKDSRNARCSSLTYSDSLPAASIVVCYFNESPSVLIRMVNS 181
Query: 130 VLYRS-PEHLIKEIFLVDDASTKDFLGKALDDYLEGNM---PKVKVIRLPKRSGLITARL 185
+ R+ PEHL EI LVDD+S Y E ++ KVK ++ K GLI A++
Sbjct: 182 IFDRTKPEHL-HEILLVDDSSEWSNATDEAIKYREKHIIQWEKVKFLKTDKNEGLIRAKI 240
Query: 186 AGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEG 245
GA RA +VL+FLDSH E N WLPPLL+ I + R V+CP ID+I T +Y +
Sbjct: 241 FGARRANGEVLVFLDSHCEVNEEWLPPLLDQIKQNRRRVVCPIIDIIDAITMKY-VESPV 299
Query: 246 ARGAFDWEFFYK-RLPVLP--KDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDI 302
G +W +K P +D N P +SP MAGGLFAI + +F+E+G YD G+D+
Sbjct: 300 CTGGVNWAMTFKWDYPHRSYFEDPMNYVNPLKSPTMAGGLFAIDKEYFFEIGSYDEGMDV 359
Query: 303 WGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDE 362
WG E E+S +IW CGG +L PCSRVGHI+R+ P+ D +GKN R+A VW+DE
Sbjct: 360 WGAENVEISVRIWTCGGELLIMPCSRVGHIFRRQRPYGI--KTDSMGKNSVRLARVWLDE 417
Query: 363 YAEHLYRRRPHYRRI-DPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPP--VEPKP 419
Y E+ + RP+YR D GDL + LR L CKPFKW++ I +L P + +
Sbjct: 418 YLENFFEARPNYRTFTDYGDLTSRISLRRNLQCKPFKWYLENIYPELLPDNTPNQLNNQI 477
Query: 420 FAEGKIXXXXXXXXXXXLCADAAHGPQLA---ALRLARCDVTTNAEQHFVLTWHKDIRCK 476
GK L A+ + G ++A + + +C+ +Q + ++
Sbjct: 478 LVAGKKYLIKMANGTHCLSAENSQG-RIANGNRVEMRKCNHMERMQQWKYSSTNELRPMG 536
Query: 477 TRNMCWDLPDSSPGSPIVLYACHLGGGNQLWR 508
+ MC DS G ++L CH G +Q+W+
Sbjct: 537 SSRMCL---DSLRGISVIL--CHNQGAHQMWQ 563
>UniRef50_Q5TWJ3 Cluster: ENSANGP00000028412; n=1; Anopheles gambiae
str. PEST|Rep: ENSANGP00000028412 - Anopheles gambiae
str. PEST
Length = 523
Score = 297 bits (728), Expect = 7e-79
Identities = 161/380 (42%), Positives = 224/380 (58%), Gaps = 26/380 (6%)
Query: 50 GEQGLPAHLPKEESDLEESL-------YSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMY 102
G G P LP+E+ ++ +S+ + G+N +SD I + R LPD+R C+ +
Sbjct: 2 GHMGEPVTLPQEQGNIPDSVQEQIALGWQRQGYNQFVSDLISVRRELPDVRDPWCRDRKR 61
Query: 103 IESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYL 162
+LP VS+V+ FH+E S LLRT +SVL R+P L++EI L+DD S+ L LDDY
Sbjct: 62 -SALPPVSIVIVFHDEALSVLLRTVHSVLNRTPPELVQEILLIDDWSSLVQLKTFLDDYF 120
Query: 163 EGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDY- 221
KV+++R PKR GLIT R+ GA+RA+AD L+FLD+H E WL PLLE +A +
Sbjct: 121 LPYSDKVRILRTPKRLGLITGRIFGAKRASADYLLFLDAHCECLAGWLEPLLELVASNQE 180
Query: 222 --RTVMCPFIDVIAFDTFEYR-AQDEGARGAFDWEFFYK---RLPVLPKDEKNMPEPFES 275
+ V P ID + T + G GAFDW ++ R L ++N+ EPF++
Sbjct: 181 NRKVVAVPTIDWLNETTLALQVGASSGLYGAFDWNLSFQWRPRYDRLQAPQENLLEPFDT 240
Query: 276 PVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRK 335
PVMAGGLF I +AFF +LG YDPGL ++GGE ELSFK+W CGG + PCS V HI ++
Sbjct: 241 PVMAGGLFCIEKAFFAQLGWYDPGLQVYGGENMELSFKVWMCGGAIRTVPCSHVAHIQKR 300
Query: 336 FAPF--PNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYR---------RIDPGDLAP 384
P+ D +N RVAEVWMDEYAE LYR P YR + +L
Sbjct: 301 NNPYIGSYTKERDLTMRNSLRVAEVWMDEYAEFLYRLHPDYRALLASRTSHSLSNVNLDA 360
Query: 385 QKRLREKLNCKPFKWFMTQI 404
+++LR +L CK F+W++ +
Sbjct: 361 RRQLRSELGCKSFRWYLQHV 380
>UniRef50_Q16ZA7 Cluster: N-acetylgalactosaminyltransferase; n=7;
Culicidae|Rep: N-acetylgalactosaminyltransferase - Aedes
aegypti (Yellowfever mosquito)
Length = 648
Score = 295 bits (724), Expect = 2e-78
Identities = 178/496 (35%), Positives = 258/496 (52%), Gaps = 32/496 (6%)
Query: 65 LEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKK-MYIESLPSVSVVVPFHNEHWSTL 123
L E Y+ GFN LSD I + R LPD R S CK+ Y+++LP S+V+ F+NE WS L
Sbjct: 161 LVEEGYNDQGFNQVLSDLISVRRRLPDYRDSWCKQPGRYLKNLPDTSIVIVFYNEAWSVL 220
Query: 124 LRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITA 183
+RT +S+L RSP +L++EI LVDD S LD++ + + PKV+++R R GLI A
Sbjct: 221 VRTVHSILDRSPPNLVREIVLVDDFSFLPHTKTQLDEHFQ-SYPKVRILRAASRLGLIRA 279
Query: 184 RLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQ- 242
++ GA TA ++ FLD+H E V WL PLL +A + + P +D I DT Q
Sbjct: 280 KMLGAWNTTAQIITFLDAHVECEVGWLEPLLNQVARNPTAIAIPSMDWIDGDTMTLDPQV 339
Query: 243 DEGARGAFDWEFFYK---RLPVLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPG 299
+ G FDW ++ R K+ EPF+SPVM GGLFAI+R F LG YD
Sbjct: 340 SQLIYGKFDWMGNFQWGLRRDRRQPQAKHPMEPFDSPVMPGGLFAINRTLFAHLGWYDEQ 399
Query: 300 LDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPF--PNPGHGDFVGKNYRRVAE 357
+ +G E ELSFK W CGG M PCSRV H+ + P+ G D + +N R+AE
Sbjct: 400 FETYGAEHLELSFKTWMCGGSMQIVPCSRVAHVQKPNHPYITKTSGSEDVIKRNLVRMAE 459
Query: 358 VWMDEYAEHLYRR-RPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVE 416
VWMDEYA + Y +R D GD++ +K+LR+ LNCK F+W++ + + V
Sbjct: 460 VWMDEYALYYYETFGGPDKRGDFGDVSSRKQLRQHLNCKSFRWYLENVFPEQFDPSRAVG 519
Query: 417 PKPFAEGKIXXXXXXXXXXXLCADAAHGPQLAALRLARCDVTT--NAEQHFVLTWHKDIR 474
F G+ C D L +C VT+ +H + + ++
Sbjct: 520 RGEFRNGE--------NGTDRCLD-------WPLARNQCGVTSCHGRGRHQMWYFTREGE 564
Query: 475 CKTRNMCWDLPDSSPGSPIVLYACHLGGGNQLWRYYPDTKQIKHGGHSNCVDHEPTTLRL 534
++ C D G + + CH GGNQLW Y T+Q +H C++ +L
Sbjct: 565 ITRKDHCLDY----DGKTLEMNRCHQMGGNQLWEYAEKTQQFRHFLSKKCLEFSEG--KL 618
Query: 535 HISRCDAAADTQRWTL 550
++ +C + Q+W +
Sbjct: 619 NMKKCMKSGSGQKWII 634
>UniRef50_UPI000069E576 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 5 (EC 2.4.1.41)
(Protein-UDP acetylgalactosaminyltransferase 5) (UDP-
GalNAc:polypeptide N-acetylgalactosaminyltransferase 5)
(Polypeptide GalNAc transferase 5) (GalNAc-T5)
(pp-GaNTase 5).; n=1; Xenopus tropicalis|Rep:
Polypeptide N-acetylgalactosaminyltransferase 5 (EC
2.4.1.41) (Protein-UDP acetylgalactosaminyltransferase
5) (UDP- GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5) (Polypeptide GalNAc
transferase 5) (GalNAc-T5) (pp-GaNTase 5). - Xenopus
tropicalis
Length = 307
Score = 293 bits (720), Expect = 6e-78
Identities = 153/316 (48%), Positives = 201/316 (63%), Gaps = 23/316 (7%)
Query: 97 CKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGK 156
C ++ + LP+ S+++ F +E WSTL+R+ YSVL RSPEHLIKEI LVDD ST+D+L +
Sbjct: 1 CSNQLIHDDLPTTSIIICFIDEVWSTLMRSVYSVLNRSPEHLIKEIILVDDFSTRDYLKE 60
Query: 157 ALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEP 216
LD Y++ +PKV+V+ LP+R GLI AR+AGA AT +VL FLDSH E NV WL PLLE
Sbjct: 61 KLDTYVK-KLPKVRVLHLPERHGLIRARIAGANIATGEVLTFLDSHVECNVGWLEPLLEQ 119
Query: 217 IALDYRTVMCPFIDVI-AFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFES 275
+ ++ R V CP I+VI A D R + +F+ N+ F
Sbjct: 120 VRINRRKVACPVIEVISALDLSVSRFISQAT-------YFH-----------NVCFCFRC 161
Query: 276 PVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRK 335
PVMAGGLF+I + +F+ELG YDPGLD+WGGE E+SFKIW CGG + PCSRVGHI+R
Sbjct: 162 PVMAGGLFSIEKNYFYELGTYDPGLDVWGGENMEISFKIWMCGGEIEIIPCSRVGHIFRN 221
Query: 336 FAPFPNP-GHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDP--GDLAPQKRLREKL 392
P+ P V +N RVAEVW+D+Y E Y H + P GDL QK+LRE+L
Sbjct: 222 DNPYSFPKDRIKTVERNLVRVAEVWLDDYKEIFYGHGQHLLKYLPNIGDLTEQKQLRERL 281
Query: 393 NCKPFKWFMTQIAFDL 408
CK F W++ + D+
Sbjct: 282 QCKNFNWYIKNVFPDM 297
>UniRef50_UPI0000E4710F Cluster: PREDICTED: similar to
pp-GalNAc-transferase 17; n=2; Strongylocentrotus
purpuratus|Rep: PREDICTED: similar to
pp-GalNAc-transferase 17 - Strongylocentrotus purpuratus
Length = 315
Score = 291 bits (713), Expect = 4e-77
Identities = 141/271 (52%), Positives = 178/271 (65%), Gaps = 7/271 (2%)
Query: 154 LGKALDDYLEGNM--PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLP 211
LG L Y++ KVK+ R KR GLI +R+ GAE++ VL FLD+H E + NWL
Sbjct: 21 LGARLQQYMDVPQLQGKVKLYRNSKREGLIRSRIFGAEQSRGQVLTFLDAHCECSPNWLV 80
Query: 212 PLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEG-ARGAFDWEFFYKRLPV-LPKDE--- 266
PLL IAL+ TV+CP +D I+ D FEYR+Q +G RGA DW+F+YKR+PV L +
Sbjct: 81 PLLTEIALNRTTVVCPTVDSISADNFEYRSQGDGLCRGAMDWDFWYKRIPVDLSRQRLGL 140
Query: 267 KNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPC 326
K EP++SP+MAGGLFA+ R FF+ELGGYDPGL IWGGE +E+SFK W CGG + PC
Sbjct: 141 KYQSEPYDSPMMAGGLFALDREFFFELGGYDPGLQIWGGENFEISFKAWMCGGSLKFVPC 200
Query: 327 SRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQK 386
SRVGH+YRK P+ P G NY RVAEVW+DE+ E Y RP R GD+ Q
Sbjct: 201 SRVGHVYRKGVPYTYPDSGVPGVSNYMRVAEVWLDEFKEFFYTSRPDLRGKPYGDIGEQI 260
Query: 387 RLREKLNCKPFKWFMTQIAFDLTAKYPPVEP 417
R R+ K FKWFM ++AFD K+PP +P
Sbjct: 261 RFRKHHCPKSFKWFMEEVAFDSLEKFPPPQP 291
>UniRef50_Q16SH9 Cluster: N-acetylgalactosaminyltransferase; n=2;
Culicidae|Rep: N-acetylgalactosaminyltransferase - Aedes
aegypti (Yellowfever mosquito)
Length = 569
Score = 288 bits (707), Expect = 2e-76
Identities = 146/345 (42%), Positives = 207/345 (60%), Gaps = 10/345 (2%)
Query: 70 YSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLP-SVSVVVPFHNEHWSTLLRTAY 128
Y + FN LSD+I + R+LPD RH+ CK + ++ S + SV++ FHNE STLLRT
Sbjct: 79 YQRHYFNLELSDRIGVDRTLPDTRHANCKTREFLTSPGRTTSVIITFHNEATSTLLRTIG 138
Query: 129 SVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGA 188
SVL ++P L++EI ++DD ST L LD +L G +P V+ R R GLI +R G
Sbjct: 139 SVLKQTPPELLQEIIVIDDCSTS--LEHNLD-FL-GRIPLVRFHRNFVREGLIRSRNIGV 194
Query: 189 ERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARG 248
A+ D ++FLDSH E N WL PL++ + +D V+ P ID+I D+FEYR RG
Sbjct: 195 AYASGDFVLFLDSHCEVNRGWLEPLVDRLTVDSTAVLSPIIDIIDADSFEYRPNSARLRG 254
Query: 249 AFDWEFFYKRLPVLPKD--EKNMPE--PFESPVMAGGLFAISRAFFWELGGYDPGLDIWG 304
FDW ++ LPV ++ +N E PF SP ++GG+F +S+ F +LGG+D GL+IWG
Sbjct: 255 GFDWSLRFRWLPVAEEELEHRNHDESQPFYSPAISGGVFIVSKTLFQQLGGFDGGLEIWG 314
Query: 305 GEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNP-GHGDFVGKNYRRVAEVWMDEY 363
GE E S K W CG + PCSR+GH++R+ P+ P G +N +R+A VWMDE+
Sbjct: 315 GESLEFSLKAWLCGAHVEVVPCSRIGHVFRRKHPYGFPQGSAATYLRNTKRIASVWMDEF 374
Query: 364 AEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDL 408
Y+ RP + G L K L+ +LNC+ F W+M + DL
Sbjct: 375 QNFFYKTRPEASALSVGSLQQMKDLKRRLNCRKFSWYMQNVFLDL 419
>UniRef50_Q14435 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 3; n=53;
Euteleostomi|Rep: Polypeptide
N-acetylgalactosaminyltransferase 3 - Homo sapiens
(Human)
Length = 633
Score = 288 bits (706), Expect = 3e-76
Identities = 158/362 (43%), Positives = 217/362 (59%), Gaps = 15/362 (4%)
Query: 57 HLPKEESDLEESLYSVNGFNGALSDKIPLSRSL-PDIRHSGCKKKMY--IESLPSVSVVV 113
+L EE +E + + FN SD+I L R L PD R C ++ + LP+ SV++
Sbjct: 132 NLSVEEQKEKERGEAKHCFNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVII 191
Query: 114 PFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVKVIR 173
FHNE WSTLLRT +SVLY SP L+KEI LVDDAS ++L LD+Y++ VK++R
Sbjct: 192 VFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLDEYVK-QFSIVKIVR 250
Query: 174 LPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIA 233
+R GLITARL GA ATA+ L FLD+H E WL PLL IA +Y V+ P I I
Sbjct: 251 QRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVVSPDIASID 310
Query: 234 FDTFEY-RAQDEGA---RGAFDWE--FFYKRLPVLPKDE-KNMPEPFESPVMAGGLFAIS 286
+TFE+ + G+ RG FDW F ++ LP K K+ P ++P AGGLF+IS
Sbjct: 311 LNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRKDETYPIKTPTFAGGLFSIS 370
Query: 287 RAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGD 346
+ +F +G YD ++IWGGE E+SF++WQCGG++ PCS VGH++R +P P
Sbjct: 371 KEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQ 430
Query: 347 FVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDP----GDLAPQKRLREKLNCKPFKWFMT 402
+ +N R+AEVWMDEY E YRR +I GDL+ + ++ +L CK F W++
Sbjct: 431 VIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLRCKNFTWYLN 490
Query: 403 QI 404
I
Sbjct: 491 NI 492
>UniRef50_Q8I136 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 4; n=3;
Caenorhabditis|Rep: Polypeptide
N-acetylgalactosaminyltransferase 4 - Caenorhabditis
elegans
Length = 589
Score = 287 bits (704), Expect = 5e-76
Identities = 164/438 (37%), Positives = 228/438 (52%), Gaps = 22/438 (5%)
Query: 67 ESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIE-SLPSVSVVVPFHNEHWSTLLR 125
E Y N FN SD + +R +PD R C+ Y + + +V++ +HNE S+LLR
Sbjct: 110 EDKYKANSFNQEASDALNPTRKIPDSREPQCRDVDYSKVGMQPTTVIITYHNEARSSLLR 169
Query: 126 TAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARL 185
T +SV +SPE L+ EI LVDD S +GK L + ++ V+R +R GLI +R+
Sbjct: 170 TVFSVFNQSPEELLLEIVLVDDNSQDVEIGKEL-----AQIQRITVLRNNQREGLIRSRV 224
Query: 186 AGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEG 245
GA+ A A VL FLDSH E N WL PLL IA + + V+ P IDVI D F Y
Sbjct: 225 KGAQVARAPVLTFLDSHIECNQKWLEPLLARIAENPKAVVAPIIDVINVDNFNYVGASAD 284
Query: 246 ARGAFDWEFFYK---RLPVLPKDEKNMPE-PFESPVMAGGLFAISRAFFWELGGYDPGLD 301
RG FDW ++ L K+ P P SP MAGGLFAIS+ +F ELG YD ++
Sbjct: 285 LRGGFDWTLVFRWEFMNEQLRKERHAHPTAPIRSPTMAGGLFAISKEWFNELGTYDLDME 344
Query: 302 IWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNP-GHGDFVGKNYRRVAEVWM 360
+WGGE E+SF++WQCGG + PCSRVGH++RK P+ P G G+ KN RR AEVWM
Sbjct: 345 VWGGENLEMSFRVWQCGGSLEIMPCSRVGHVFRKKHPYTFPGGSGNVFQKNTRRAAEVWM 404
Query: 361 DEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPF 420
DEY + P R ++ GD+ + +R++L CK FKW++ + YP +E
Sbjct: 405 DEYKAIYLKNVPSARFVNFGDITDRLAIRDRLQCKSFKWYLENV-------YPQLEIPRK 457
Query: 421 AEGKIXXXXXXXXXXXLCADAAHGPQLAALRLARCDVTTNAEQHFVLTWHKDIRCKTRNM 480
GK LC D+ + A L C T ++ K + +
Sbjct: 458 TPGK----SFQMKIGNLCLDSMARKESEAPGLFGCHGTGGNQEWVFDQLTKTFKNAISQL 513
Query: 481 CWDLPDSSPGSPIVLYAC 498
C D ++ + + C
Sbjct: 514 CLDFSSNTENKTVTMVKC 531
Score = 36.3 bits (80), Expect = 2.2
Identities = 19/61 (31%), Positives = 25/61 (40%)
Query: 471 KDIRCKTRNMCWDLPDSSPGSPIVLYACHLGGGNQLWRYYPDTKQIKHGGHSNCVDHEPT 530
K + K N+C D L+ CH GGNQ W + TK K+ C+D
Sbjct: 461 KSFQMKIGNLCLDSMARKESEAPGLFGCHGTGGNQEWVFDQLTKTFKNAISQLCLDFSSN 520
Query: 531 T 531
T
Sbjct: 521 T 521
>UniRef50_Q6V2D0 Cluster: UDP-N-acetyl-D-galactosamine:polypeptide
N- acetylgalactosaminyltransferase; n=1; Echinococcus
granulosus|Rep: UDP-N-acetyl-D-galactosamine:polypeptide
N- acetylgalactosaminyltransferase - Echinococcus
granulosus
Length = 659
Score = 286 bits (702), Expect = 1e-75
Identities = 157/379 (41%), Positives = 224/379 (59%), Gaps = 17/379 (4%)
Query: 42 SERKRVGIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKM 101
SER R G GE G + + +++ + +G+N + L RSL C +
Sbjct: 89 SERFRSGPGELGRGVDVRISDEEMKR-VNDADGYNSHACKLVALDRSLGHRPAKECLAVI 147
Query: 102 YIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDY 161
Y + LP+ SV++ F NE + ++RT +SV+ RSP L+KE+ L+DD ST+ L LD +
Sbjct: 148 YPDKLPTASVILIFFNEPFRLIIRTVFSVVNRSPPALLKEVILLDDGSTQSDLLDNLDKF 207
Query: 162 LEGNMPK--VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIAL 219
+ P V+++RLP+R+GLI ARL GA+ ATADV+IFLD+H EA WL PLL I
Sbjct: 208 VANTWPDGIVRIVRLPQRTGLIRARLEGAKAATADVIIFLDAHCEATYRWLEPLLYRIWQ 267
Query: 220 DYRTVMCPFI---DVIAFDTF--EYRAQDEG----ARGAFDWE--FFYKRLP--VLPKDE 266
V+CP I D F + R ++G G+F W+ F ++ P + K +
Sbjct: 268 KPDAVVCPAIANIDRFTLKIFRTDVRYTEDGWLSLRVGSFAWDGMFVFEHPPRSSVVKRQ 327
Query: 267 KNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPC 326
N+ + ES M GGLFAI R +F++LGGYD G++IWGGE ELSF+IWQCGG + +PC
Sbjct: 328 SNV-DTIESLTMPGGLFAIHRDYFFKLGGYDDGMEIWGGENLELSFRIWQCGGSLEFSPC 386
Query: 327 SRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQK 386
S VGH+YR P+ P D+ G N R+AEVWMD Y E+ Y R + +D GD++ +K
Sbjct: 387 STVGHVYRAIHPYSFPSKKDYNGYNTARMAEVWMDMYKENFYLARGDIKNMDYGDVSKRK 446
Query: 387 RLREKLNCKPFKWFMTQIA 405
+LR L C+ F+WF+ IA
Sbjct: 447 KLRNDLGCRNFQWFLDNIA 465
>UniRef50_Q8MVS5 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 35A (EC 2.4.1.41)
(Protein-UDP acetylgalactosaminyltransferase 35A) (UDP-
GalNAc:polypeptide N-acetylgalactosaminyltransferase
35A) (pp-GaNTase 35A) (Protein l(2)35Aa); n=2;
Sophophora|Rep: Polypeptide
N-acetylgalactosaminyltransferase 35A (EC 2.4.1.41)
(Protein-UDP acetylgalactosaminyltransferase 35A) (UDP-
GalNAc:polypeptide N-acetylgalactosaminyltransferase
35A) (pp-GaNTase 35A) (Protein l(2)35Aa) - Drosophila
melanogaster (Fruit fly)
Length = 632
Score = 286 bits (702), Expect = 1e-75
Identities = 146/353 (41%), Positives = 210/353 (59%), Gaps = 10/353 (2%)
Query: 60 KEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKK--MYIESLPSVSVVVPFHN 117
K++ + + Y + FN +S+ I L R++PD RH C ++ E+LP S+V+ F+N
Sbjct: 99 KQDKYIRDIGYKHHAFNALVSNNIGLFRAIPDTRHKVCDRQETTEAENLPQASIVMCFYN 158
Query: 118 EHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP--KVKVIRLP 175
EH TL+R+ +VL R+P +L++EI LVDD S L L L + ++ I+
Sbjct: 159 EHKMTLMRSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRARLKYDNLRYIKNE 218
Query: 176 KRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFD 235
+R GLI +R+ GA A DVL+FLDSH E N WL PLL I + T+ P ID+I D
Sbjct: 219 QREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSENATLAVPVIDLINAD 278
Query: 236 TFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPE----PFESPVMAGGLFAISRAFFW 291
TFEY RG F+W + R LP+ +PE PF SP MAGGLFA++R +F
Sbjct: 279 TFEYTPSPL-VRGGFNWGLHF-RWENLPEGTLKVPEDFRGPFRSPTMAGGLFAVNRKYFQ 336
Query: 292 ELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKN 351
LG YD +DIWGGE E+SF+ WQCGG + PCSRVGHI+RK P+ +P + + KN
Sbjct: 337 HLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIFRKRRPYTSPDGANTMLKN 396
Query: 352 YRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQI 404
R+A VWMD+Y ++ + + D GD++ + +LRE+L C+ F W++ +
Sbjct: 397 SLRLAHVWMDQYKDYYLKHEKVPKTYDYGDISDRLKLRERLQCRDFAWYLKNV 449
>UniRef50_Q8IA41 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase 11; n=2; Drosophila
melanogaster|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase 11 - Drosophila
melanogaster (Fruit fly)
Length = 557
Score = 284 bits (696), Expect = 5e-75
Identities = 181/517 (35%), Positives = 263/517 (50%), Gaps = 37/517 (7%)
Query: 49 IGEQGLPAHLPKEESDLEESLYSVNGF--NGALSDKIPLSRSLPDIRHSGCKKKMYI-ES 105
+GE G AHL E+DL ++ + N LS++IPL R+L D R C K Y E
Sbjct: 49 LGELGKEAHLQMTETDLVDAQLQNEKYQYNAWLSERIPLKRTLEDYRDPQCLKINYSSEK 108
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
+VS+V+ EH TLLR YSV+ ++ +L+KEI LV D L + ++
Sbjct: 109 TVTVSIVIAIQQEHPHTLLRGIYSVITQTSPYLLKEIVLVHDGHPDLDLIR----HIHHK 164
Query: 166 MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVM 225
+P V + + G+I ARL GA AT D+L+FL+ H E WLPPLLEPI L+ +TV
Sbjct: 165 LPIVIQLEMESSKGIIHARLTGAGVATGDILVFLNGHMEVTRGWLPPLLEPILLNNQTVT 224
Query: 226 CPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAI 285
P +D I+ ++F YR E + AFDW+ + LP+ ++P+P+ S + G +FAI
Sbjct: 225 EPIVDAISRESFAYRKLVEPEQLAFDWQLDHIFLPLDQHSWNSLPKPYPSSQLEGRVFAI 284
Query: 286 SRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRK------FAPF 339
R +FW LGG+D GL +GG+ ELS K+WQCGG +L PCSRVG IY++ AP
Sbjct: 285 DRKWFWHLGGWDEGLRDYGGDALELSLKVWQCGGLILAVPCSRVGIIYKRDELEAQMAPN 344
Query: 340 PNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKW 399
NP V KN++RV +VW+DEY H YR P R + L + LR +LNCK F+W
Sbjct: 345 RNPSLQ--VQKNFKRVVDVWLDEYKLHFYRYNPKLRNLTAESLDKPRDLRRRLNCKSFEW 402
Query: 400 FMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXLCADAAHGPQLAALRLARCDVTT 459
+ +Q+A + + + GKI C G + + +C +T
Sbjct: 403 YRSQVAPQIRNHFLHAGLTNYPIGKI----MPFVAPHFCLSIKGGFPV----IRKCH-ST 453
Query: 460 NAEQHFVLTWHKDIRCKTR--NMCWDLPDSSPGSPIVLYACHLGGGNQLWRY-YPDTKQI 516
N E W RC+ + NMC D+ + + C W Y Y + +
Sbjct: 454 NFED-----WTLTSRCQLKHGNMCLDVDYK---NNVRATKCTKKLSKNPWHYNYQHSSFV 505
Query: 517 KHGGHSNCVDHEPTTLRLHISRCDAAADTQRWTLDTV 553
+G + C+ + + L +S CD+ QRW V
Sbjct: 506 SNG--NKCLQIDVNKVGLILSACDSDVTEQRWMFTKV 540
>UniRef50_Q9D4M9 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5; n=9;
Eutheria|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5 - Mus
musculus (Mouse)
Length = 431
Score = 282 bits (692), Expect = 2e-74
Identities = 146/335 (43%), Positives = 205/335 (61%), Gaps = 7/335 (2%)
Query: 74 GFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYR 133
G N +S ++ + R +PD R C++K Y +LP+ S+++ F+NE ++TLLR SV+
Sbjct: 82 GLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSSVVNL 141
Query: 134 SPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATA 193
SP+HL++EI LVDD S D L LD YLE KVK+IR KR GLI +++ GA RA+
Sbjct: 142 SPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGASRASG 201
Query: 194 DVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWE 253
D+L+FLDSH E N WL PLL IA D++ V+CP IDVI T +Y A RGAFDW
Sbjct: 202 DILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLDYMAAPI-VRGAFDWN 260
Query: 254 FFYKRLPVLPKDEKNMPE----PFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYE 309
+ V E + PE P SP M GG+FAI+R +F ELG YD G+DI GGE E
Sbjct: 261 LNLRWDNVFAY-ELDGPEGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDICGGENVE 319
Query: 310 LSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLYR 369
LS +IW CGG++ PCSRVG+ + + + + +N RV VW+DEY + +
Sbjct: 320 LSLRIWMCGGQLFILPCSRVGYNSKALSQH-RRANQSALSRNLLRVVHVWLDEYKGNFFL 378
Query: 370 RRPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQI 404
+RP + G+++ + LR++L CK F+W++ I
Sbjct: 379 QRPSLTYVSCGNISERVELRKRLGCKSFQWYLDNI 413
>UniRef50_UPI000069E1C8 Cluster: Polypeptide
N-acetylgalactosaminyltransferase-like protein 2 (EC
2.4.1.41) (Protein-UDP
acetylgalactosaminyltransferase-like protein 2)
(UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase- like protein 2)
(Polypeptide GalNAc transferase-like protein; n=1;
Xenopus tropicalis|Rep: Polypeptide
N-acetylgalactosaminyltransferase-like protein 2 (EC
2.4.1.41) (Protein-UDP
acetylgalactosaminyltransferase-like protein 2)
(UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase- like protein 2)
(Polypeptide GalNAc transferase-like protein - Xenopus
tropicalis
Length = 611
Score = 277 bits (680), Expect = 4e-73
Identities = 144/340 (42%), Positives = 200/340 (58%), Gaps = 9/340 (2%)
Query: 71 SVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSV 130
+ NGF+ +S IPL R +PD RH C ++ Y E LP SV++ FHNE WSTLLRT +SV
Sbjct: 133 NTNGFDEEVSKNIPLHRIIPDGRHPECLQQNYGEKLPIASVIICFHNEGWSTLLRTVHSV 192
Query: 131 LYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAER 190
L SP +KEI LVDD S ++ L AL +Y+ + VK+IR KR G+I R+ GA R
Sbjct: 193 LDNSPRTFLKEIILVDDLSHQEHLKSALSEYI-SRIGGVKLIRSNKRLGVIGGRMLGAAR 251
Query: 191 ATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAF 250
AT +VLIF+DSH E + WL PLL I + ++ P ID I + TFEY +G F
Sbjct: 252 ATGEVLIFMDSHCECHPGWLEPLLSRIMHNRNRIVSPVIDFIDWKTFEYSHSSLLQQGVF 311
Query: 251 DWEFFYKRLPVLPKDEKNMPE----PFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGE 306
DW+ + +P LP+ E+ + + PF SPV+ G + A R +F +GG+D G++ WG E
Sbjct: 312 DWKLDFHWVP-LPEHEEKVRQSPIIPFRSPVIPGYVLASDRHYFQNIGGFDTGINSWGVE 370
Query: 307 QYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEH 366
ELS ++W CGG + PCSRVGH Y+ N + V ++ R AE+WMD Y
Sbjct: 371 TTELSIRVWLCGGSVEIVPCSRVGHAYQNHT-MHNSVQNEAVLRSKVRTAELWMDSYKAI 429
Query: 367 LYRR--RPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQI 404
YR RI D+ ++LR++L CK F+WF+ +
Sbjct: 430 FYRNVGNSLLNRIQESDINEHEQLRQRLGCKRFQWFLANV 469
>UniRef50_Q8N3T1 Cluster: Polypeptide
N-acetylgalactosaminyltransferase-like protein 2; n=21;
Amniota|Rep: Polypeptide
N-acetylgalactosaminyltransferase-like protein 2 - Homo
sapiens (Human)
Length = 639
Score = 277 bits (680), Expect = 4e-73
Identities = 179/509 (35%), Positives = 262/509 (51%), Gaps = 32/509 (6%)
Query: 61 EESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHW 120
EE +L G ALS +IPL R+LP++RH C ++ +SLP+ SV++ FH+E W
Sbjct: 145 EEEELTPFSLDPRGLQEALSARIPLQRALPEVRHPLCLQQHPQDSLPTASVILCFHDEAW 204
Query: 121 STLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGL 180
STLLRT +S+L P +KEI LVDD S + L AL +Y+ + VK++R KR G
Sbjct: 205 STLLRTVHSILDTVPRAFLKEIILVDDLSQQGQLKSALSEYV-ARLEGVKLLRSNKRLGA 263
Query: 181 ITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYR 240
I AR+ GA RAT DVL+F+D+H E + WL PLL IA D V+ P IDVI + TF+Y
Sbjct: 264 IRARMLGATRATGDVLVFMDAHCECHPGWLEPLLSRIAGDRSRVVSPVIDVIDWKTFQYY 323
Query: 241 AQDEGARGAFDW--EFFYKRLPVLPKDEKNMP-EPFESPVMAGGLFAISRAFFWELGGYD 297
+ RG DW +F ++ LP + P P SPV+ G + A+ R +F G YD
Sbjct: 324 PSKDLQRGVLDWKLDFHWEPLPEHVRKALQSPISPIRSPVVPGEVVAMDRHYFQNTGAYD 383
Query: 298 PGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAE 357
+ + GGE ELSFK W CGG + PCSRVGHIY+ +P + +N R+AE
Sbjct: 384 SLMSLRGGENLELSFKAWLCGGSVEILPCSRVGHIYQN-QDSHSPLDQEATLRNRVRIAE 442
Query: 358 VWMDEYAEHLYRRRPH---YRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPP 414
W+ + E Y+ P + + D + +L+ +L C+ F WF+ + +L YP
Sbjct: 443 TWLGSFKETFYKHSPEAFSLSKAEKPDCMERLQLQRRLGCRTFHWFLANVYPEL---YPS 499
Query: 415 VEPKPFAEGKIXXXXXXXXXXXLCADA-AHGPQLAA-LRLARCDVTTNAEQHFVLTWHKD 472
EP+P GK+ LCAD A G L + LA C + +Q+ T K+
Sbjct: 500 -EPRPSFSGKL-----HNTGLGLCADCQAEGDILGCPMVLAPCS-DSRQQQYLQHTSRKE 552
Query: 473 IRC-KTRNMCWDLPDSSPGSPIVLYACHLGG---GNQLWRYYPDTKQIKHGGHSNCVDH- 527
I +++C+ + ++L C G Q W + + I H C++
Sbjct: 553 IHFGSPQHLCFAVRQEQ----VILQNCTEEGLAIHQQHWD-FQENGMIVHILSGKCMEAV 607
Query: 528 -EPTTLRLHISRCDAAADTQRWTLDTVDA 555
+ L++ CD A Q+W D ++A
Sbjct: 608 VQENNKDLYLRPCDGKA-RQQWRFDQINA 635
>UniRef50_Q7TT15-2 Cluster: Isoform 2 of Q7TT15 ; n=9; Mammalia|Rep:
Isoform 2 of Q7TT15 - Mus musculus (Mouse)
Length = 596
Score = 269 bits (660), Expect = 1e-70
Identities = 147/364 (40%), Positives = 206/364 (56%), Gaps = 12/364 (3%)
Query: 48 GIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLP 107
G G+ GLPA L E + + + G+N LS+KI L RS+PD R + CK+ Y + LP
Sbjct: 93 GRGKGGLPATLSPSEEEKAKGPHEKYGYNSYLSEKISLDRSIPDYRPTKCKELKYSKELP 152
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
+S++ F NE S +LR+ +S + +P HL+KEI LVDD ++ L L++Y+ P
Sbjct: 153 QISIIFIFVNEALSVILRSVHSAVNHTPTHLLKEIILVDD--NREELKAPLEEYVHKRYP 210
Query: 168 -KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
VKV+R KR GLI AR+ G + AT V F D+H E W P+L I + + V+
Sbjct: 211 GLVKVVRNQKREGLIRARIEGWKAATGQVTGFFDAHVEFTAGWAEPVLSRIQENRKRVIL 270
Query: 227 PFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKD--EKNMPE-PFESPVMAGGLF 283
P ID I D FE + + A G + WE + + PKD + P P +P M G F
Sbjct: 271 PSIDNIKQDNFEVQRYENSAHG-YSWELWCMYISP-PKDWWDAGDPSLPIRTPAMIGCSF 328
Query: 284 AISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPG 343
++R FF E+G DPG+D++GGE EL K+W CGG M PCSRV HI RK P+ N
Sbjct: 329 VVNRKFFGEIGLLDPGMDVYGGENIELGIKVWLCGGSMEVLPCSRVAHIERKKKPY-NSN 387
Query: 344 HGDFVGKNYRRVAEVWMDEYAEHLY---RRRPHYRRIDPGDLAPQKRLREKLNCKPFKWF 400
G + +N RVAEVWMD+Y H+Y ID GD++ +K LR+ L CK F+W+
Sbjct: 388 IGFYTKRNALRVAEVWMDDYKSHVYIAWNLPLENPGIDIGDVSERKALRKSLKCKNFQWY 447
Query: 401 MTQI 404
+ +
Sbjct: 448 LDHV 451
>UniRef50_Q4RKI0 Cluster: Chromosome 21 SCAF15029, whole genome
shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 21
SCAF15029, whole genome shotgun sequence - Tetraodon
nigroviridis (Green puffer)
Length = 531
Score = 261 bits (639), Expect = 4e-68
Identities = 143/338 (42%), Positives = 195/338 (57%), Gaps = 19/338 (5%)
Query: 50 GEQGLPAHLPKEES--DLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLP 107
GE G L E++ D E S GFN A+S+ I + R LP+ RH C ++ Y ESLP
Sbjct: 120 GEAGNAMRLYLEDTGEDAEPSSLRKYGFNEAVSEGISVHRRLPEARHPRCLQQQYSESLP 179
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
S SVV+ FHNE WSTLLRT +SVL +P ++E+ LVDD S L L +YL ++
Sbjct: 180 SASVVICFHNEAWSTLLRTVHSVLSTAPRRHLRELLLVDDLSQHGHLKGVLSEYL-SHLS 238
Query: 168 KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCP 227
+V+++R +R G+ R GA +A ++L+F+DSH E WL PLLE +A D V+ P
Sbjct: 239 RVRLLRSARRLGVAGCRALGASKAEGELLVFMDSHCECQKGWLEPLLERVAQDRTRVVSP 298
Query: 228 FIDVIAFDTFEYRAQDEGARGAFDW--EFFYKRLPVLP-KDE----------KNMPEP-- 272
ID I + TF Y A RG F+W +F ++ +LP KD + + E
Sbjct: 299 IIDAIDWRTFRYNATQWPVRGVFNWRLDFRWESHTLLPDKDPGSAVRALRLCRRLTETAR 358
Query: 273 FESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHI 332
F SPV+ G +FAI R FF +GG+DPG+ +WG EQ ELS ++W CGG M APCSRV H+
Sbjct: 359 FRSPVLGGEVFAIDRHFFQHVGGFDPGMLLWGEEQIELSIRVWSCGGSMEVAPCSRVAHL 418
Query: 333 YRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLYRR 370
P+ P D + N R+AE+WM Y + YRR
Sbjct: 419 DHHSLPYTFPDQ-DLLENNKIRIAEIWMGAYRKIFYRR 455
>UniRef50_Q9HCQ5 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 9; n=51;
Euteleostomi|Rep: Polypeptide
N-acetylgalactosaminyltransferase 9 - Homo sapiens
(Human)
Length = 603
Score = 260 bits (636), Expect = 9e-68
Identities = 142/363 (39%), Positives = 201/363 (55%), Gaps = 9/363 (2%)
Query: 48 GIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLP 107
G+G+ GL A L +++ E Y G+N LSD+I L RS+PD R C++ Y + LP
Sbjct: 93 GLGQGGLAATL-RDDGQEAEGKYEEYGYNAQLSDRISLDRSIPDYRPRKCRQMSYAQDLP 151
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
VSVV F NE S +LR+ +SV+ +P L+KE+ LVDD S L LD Y+ P
Sbjct: 152 QVSVVFIFVNEALSVILRSVHSVVNHTPSQLLKEVILVDDNSDNVELKFNLDQYVNKRYP 211
Query: 168 -KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
VK++R +R GLI ARL G + ATA V+ F D+H E N W P L I D R ++
Sbjct: 212 GLVKIVRNSRREGLIRARLQGWKAATAPVVGFFDAHVEFNTGWAEPALSRIREDRRRIVL 271
Query: 227 PFIDVIAFDTFEYRAQDEGARGAFDW--EFFYKRLPVLPKDEKNMPEPFESPVMAGGLFA 284
P ID I + TFE + A G ++W Y P D + P +P M G F
Sbjct: 272 PAIDNIKYSTFEVQQYANAAHG-YNWGLRCMYIIPPQDWLDRGDESAPIRTPAMIGCSFV 330
Query: 285 ISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGH 344
+ R +F ++G DPG++++GGE EL ++WQCGG M PCSRV HI R P+ N
Sbjct: 331 VDREYFGDIGLLDPGMEVYGGENVELGMRVWQCGGSMEVLPCSRVAHIERTRKPY-NNDI 389
Query: 345 GDFVGKNYRRVAEVWMDEYAEHLYR--RRPHYR-RIDPGDLAPQKRLREKLNCKPFKWFM 401
+ +N R AEVWMD++ H+Y P +D GD++ + LR++L C+ FKW++
Sbjct: 390 DYYAKRNALRAAEVWMDDFKSHVYMAWNIPMSNPGVDFGDVSERLALRQRLKCRSFKWYL 449
Query: 402 TQI 404
+
Sbjct: 450 ENV 452
>UniRef50_Q6P9A2 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 4; n=28;
Euteleostomi|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 4 - Homo
sapiens (Human)
Length = 607
Score = 258 bits (633), Expect = 2e-67
Identities = 137/340 (40%), Positives = 197/340 (57%), Gaps = 12/340 (3%)
Query: 74 GFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYR 133
G+N LSD++PL R LPD+R SGC+ + +SLP VS+V F NE S LLR+ +S + R
Sbjct: 121 GYNAYLSDRLPLDRPLPDLRPSGCRNLSFPDSLPEVSIVFIFVNEALSVLLRSIHSAMER 180
Query: 134 SPEHLIKEIFLVDDASTKDFLGKALDDYLEG-NMPK---VKVIRLPKRSGLITARLAGAE 189
+P HL+KEI LVDD S+ + L + L +Y++ N K +KV+R K+ GLI +R++G
Sbjct: 181 TPPHLLKEIILVDDNSSNEELKEKLTEYVDKVNSQKPGFIKVVRHSKQEGLIRSRVSGWR 240
Query: 190 RATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGA 249
ATA V+ D+H E NV W P+L I + + ++ P D I +D FE A+G
Sbjct: 241 AATAPVVALFDAHVEFNVGWAEPVLTRIKENRKRIISPSFDNIKYDNFEIEEYPLAAQG- 299
Query: 250 FDWEFF--YKRLPVLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQ 307
FDWE + Y P +N P SP + G F + R +F E+G D G++++GGE
Sbjct: 300 FDWELWCRYLNPPKAWWKLENSTAPIRSPALI-GCFIVDRQYFQEIGLLDEGMEVYGGEN 358
Query: 308 YELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHL 367
EL ++WQCGG + PCSR+ HI R P+ V +N RVAEVWMDE+ H+
Sbjct: 359 VELGIRVWQCGGSVEVLPCSRIAHIERAHKPYTEDLTA-HVRRNALRVAEVWMDEFKSHV 417
Query: 368 YRR---RPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQI 404
Y ID GD+ +K LR++L CK F+W++ +
Sbjct: 418 YMAWNIPQEDSGIDIGDITARKALRKQLQCKTFRWYLVSV 457
>UniRef50_Q8IA44 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase 12; n=2; Drosophila
melanogaster|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase 12 - Drosophila
melanogaster (Fruit fly)
Length = 563
Score = 255 bits (624), Expect = 3e-66
Identities = 167/551 (30%), Positives = 272/551 (49%), Gaps = 32/551 (5%)
Query: 24 FVPKSLKKIDWHNYEQISSERKRVGIGEQGLPAHL--PKEESDLEESLYSVNGFNGALSD 81
F+ L D +++ + G+GE G ++L P E D + +N L++
Sbjct: 22 FIVLLLLHRDLSSWDGLMGPLSHPGLGENGSASYLSVPSWEIDEYTQGWRYYLYNSWLAE 81
Query: 82 KIPLSRSLPDIRHSGCKKKMYIES---LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHL 138
+IPL RSLPD+R C+K Y E + S+++ F NE LLRT +S++ R+P++L
Sbjct: 82 RIPLRRSLPDLRDHRCQKLEYDEDSDEMKPASIIMIFRNEQLVVLLRTLHSLVERTPKYL 141
Query: 139 IKEIFLVDDASTKDFLGKAL-----DDYLEGNM-PKVKVIRLPKRSGLITARLAGAERAT 192
E+ LV+D S DF L D+Y+ + PK +++ LP++ GLI AR A A
Sbjct: 142 YIELILVNDHSDTDFWNDKLSLIFFDNYVHRYIHPKARILHLPEQVGLIKARNLAASEAK 201
Query: 193 ADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDW 252
A+ L+F+D+ E WL PLL+ IA T+ P +D + T Y+ E RG +DW
Sbjct: 202 AENLVFVDAQVEFTNGWLSPLLDTIAEQSYTLATPILDNLDEQTLAYQRSIE-RRGMYDW 260
Query: 253 EFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSF 312
+ +P+ ++P P+E + +FAI +F ++ +D L +G + ELSF
Sbjct: 261 SLTRREVPLSRARRSHLPWPYEVAAVRTSVFAIPAVWFQDISNFDNNLRGFGAAELELSF 320
Query: 313 KIWQCGGRMLDAPCSRVGHIYRKFAPF-PNPGH----GDFVGKNYRRVAEVWMDEYAEHL 367
K+W GGR++ PCSRVGH+ K + G G+ +N +R+ EVW + +
Sbjct: 321 KVWCTGGRIVQVPCSRVGHLQPKDEDYLKRYGDLHKMGEQKSRNLKRIIEVWTGDLKSAI 380
Query: 368 YRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXX 427
Y+ +PH I GDL ++L ++ C+ FK F+ I L + +A G +
Sbjct: 381 YKYQPHLLNISEGDLNEPRKLYKQNECQSFKEFINDITPGLN-HVAALNRTDYASGHVKT 439
Query: 428 XXXXXXXXXLCADAAHGPQLAALRLARCDVTTNAEQHFVLTWHKDIRCKTRNMCWDLPDS 487
+ A + + L L RC T N Q++ LT+ KD+R N+C ++
Sbjct: 440 LEFPKKCLTINAKSQN------LFLERCS-TNNTLQNWTLTYVKDLRV-AGNICAEV--- 488
Query: 488 SPGSPIVLYACHLGGGNQLWRYYPDTKQIKHGGHSNCVDHEPTTLRLHISRCDAAADTQR 547
P + CH GG Q W Y + Q+ ++ C++ L + ++ CDAA QR
Sbjct: 489 RPNLRLGYSFCHSLGGRQSWHYDSVSNQLM--SNTKCLEF-TDELNIFLAICDAANGKQR 545
Query: 548 WTLDTVDAELL 558
W LD ++ ++
Sbjct: 546 WILDNINLSVM 556
>UniRef50_Q6YBY0 Cluster: UDP-N-acetyl-D-galactosamine:polypeptide
N- acetylgalactosaminyltransferase T3; n=1; Toxoplasma
gondii|Rep: UDP-N-acetyl-D-galactosamine:polypeptide N-
acetylgalactosaminyltransferase T3 - Toxoplasma gondii
Length = 635
Score = 250 bits (613), Expect = 6e-65
Identities = 151/382 (39%), Positives = 221/382 (57%), Gaps = 31/382 (8%)
Query: 46 RVGIGEQGLPAHL--PKEESDLE--ESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKM 101
R+GI G P + P +DL ++L GFN LSD + L R+ PD RH+ C++
Sbjct: 113 RLGIKPDGQPGYTRAPSPPTDLSMPQALARGGGFNLYLSDHLELDRTAPDARHASCRQLH 172
Query: 102 Y-IESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFL----GK 156
Y + +LP SV++ F+NE +STL+R+ +SVL +P +++E+ LVDD ST ++ +
Sbjct: 173 YDLSTLPKASVIIVFYNEPFSTLMRSVHSVLNGTPPQILEELILVDDGSTLPYIREDGNQ 232
Query: 157 ALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEP 216
L +YL+ KV++IR R G++ AR+ G + A + LDSH E + WL PLL
Sbjct: 233 QLVEYLKLLPAKVRLIRNEVRKGIVGARMKGIRASRAPIFAILDSHIEVSPQWLEPLLLR 292
Query: 217 IALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDW---EFFYK-----RLPVLPKDEKN 268
I D R V+ P ID I +TF++ A G + F W E Y+ RLP P++ +
Sbjct: 293 IKEDGRRVVMPQIDGIDAETFKHIAGGIGCKLGFLWKLMEHSYEGHQTARLP--PEERQP 350
Query: 269 MPEPFE-SPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCS 327
P F+ SP MAGGLFA ++AFF+++G YD WG E ELSF++WQCGG + APCS
Sbjct: 351 SPTDFQTSPAMAGGLFAANKAFFFDVGAYDEDFQFWGTENLELSFRLWQCGGVLECAPCS 410
Query: 328 RVGHIYRK-FAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLYR----RRPHYRRIDPGDL 382
RV HI+RK + + +P GD + N R +WMDEYA+ +R R +YR P L
Sbjct: 411 RVYHIFRKGGSGYSSP--GDSITINKMRTM-LWMDEYADLAWRVIGKPRVNYR---PESL 464
Query: 383 APQKRLREKLNCKPFKWFMTQI 404
++ R++ CK F+WFM +
Sbjct: 465 EKRREWRKRKGCKSFRWFMENV 486
>UniRef50_Q5CY08 Cluster: Extracellular protein with a signal
peptide followed by family 2 glycosyltransferase and
ricin domains; n=3; Cryptosporidium|Rep: Extracellular
protein with a signal peptide followed by family 2
glycosyltransferase and ricin domains - Cryptosporidium
parvum Iowa II
Length = 637
Score = 248 bits (608), Expect = 2e-64
Identities = 139/363 (38%), Positives = 205/363 (56%), Gaps = 19/363 (5%)
Query: 55 PAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMY-IESLPSVSVVV 113
P+ P ++ ++ L GFN LSD +PL R++ D R CK Y I + ++SV++
Sbjct: 121 PSPPPPKDLNIVAVLSQGGGFNLNLSDSLPLDRNVSDYRDLQCKLISYDISKMDTISVII 180
Query: 114 PFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFL----GKALDDYLEGNMPKV 169
F+NE +STL+R+ +SVL R+P L+ EI LVDD S + + L +Y+ +PKV
Sbjct: 181 VFYNEPFSTLMRSVHSVLNRTPPSLLDEIILVDDGSNSEHIKVGGNNLLVNYIS-TLPKV 239
Query: 170 KVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFI 229
++IR KRSG++ ARLAG + + + LDSH E W P+++ I D R ++ P I
Sbjct: 240 RLIRNAKRSGIVGARLAGINACKSPIFVILDSHIEVQPVWAEPIVKRIQEDPRRIVMPQI 299
Query: 230 DVIAFDTFEYRAQDEGARGAFDWEFFYKRLP--VLPKDEKNMPEPFE---SPVMAGGLFA 284
D I +TFE+ G F W+ P + P + + ++ SP MAGGL A
Sbjct: 300 DSIDSETFEFVNGGIGCTLGFLWKLIEHAFPQQISPDPRRRYAKNYDYVSSPTMAGGLLA 359
Query: 285 ISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRK-FAPFPNPG 343
+ AFF ++G YDP + WG E ELSF++W CGG + APCSRV H++RK + +P
Sbjct: 360 ANVAFFKQIGSYDPQFEYWGTENLELSFRVWMCGGFIECAPCSRVFHVFRKGGVGYSSPS 419
Query: 344 HGDFVGKNYRRVAEVWMDEYAEHLYR--RRPHYRRIDPGDLAPQKRLREKLNCKPFKWFM 401
H V KN R +WMDE+ + +R RP R+D G L + +LRE+L C FKWF+
Sbjct: 420 HA--VLKNKLRTLYLWMDEFGDLAWRVMGRP---RVDTGPLDERIKLRERLRCNSFKWFL 474
Query: 402 TQI 404
+
Sbjct: 475 ENV 477
>UniRef50_Q8MM26 Cluster: UDP-N-acetyl-D-galactosamine:polypeptide
N- acetylgalactosaminyltransferase T1; n=1; Toxoplasma
gondii|Rep: UDP-N-acetyl-D-galactosamine:polypeptide N-
acetylgalactosaminyltransferase T1 - Toxoplasma gondii
Length = 751
Score = 247 bits (605), Expect = 5e-64
Identities = 143/335 (42%), Positives = 195/335 (58%), Gaps = 27/335 (8%)
Query: 105 SLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDAS---TKDFLGKALDDY 161
SLP SVV+ F+NE+ S LLR+ +SVL R+P L+KEI +VDD S T +LGK L+DY
Sbjct: 281 SLPDTSVVIVFYNENLSVLLRSIHSVLNRTPPSLLKEIIVVDDFSDRQTHPWLGKQLEDY 340
Query: 162 LEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDY 221
+ G +PK +++RL +R GL+ AR AGA A+A+ + FLDSH E WL PLL + D+
Sbjct: 341 ISGTLPKTRLLRLLQRRGLMGARAAGAAAASAETVTFLDSHIECLPYWLQPLLFHVKQDW 400
Query: 222 RTVMCPFIDVIAFDTFEYRAQDEGARG-AFDWEFFY--------KRLPVLPKDE--KNMP 270
R + P I I D F R +D G + AF W + R+ L +DE KN
Sbjct: 401 RRIAMPLIPTIDADNF--RIKDGGLKTLAFTWGMSHYHIHDKIRHRIEELGQDEAAKNPD 458
Query: 271 EPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVG 330
P SP+MAGGLF I++A++ LGGYD + I+GGE++E+SFK W CGG + PCSRVG
Sbjct: 459 APTMSPIMAGGLFTITKAWWDTLGGYDKEMQIYGGEEFEISFKTWMCGGSLHLVPCSRVG 518
Query: 331 HIYRKFAPFPNPGH---GDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDP-GDLAPQK 386
H++R + + G + +N R A VWM EYA + P + P GDL K
Sbjct: 519 HVFRSNEFWQGQVYTVPGALIHRNKLRTAHVWMGEYARIVELVIPRLPQDKPLGDLTELK 578
Query: 387 RLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPFA 421
LR++L CK F W++ I YP +EP A
Sbjct: 579 ALRDRLKCKDFNWYLKNI-------YPELEPPNLA 606
>UniRef50_Q8K1B9 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 4; n=9;
Euteleostomi|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 4 - Mus
musculus (Mouse)
Length = 622
Score = 244 bits (597), Expect = 5e-63
Identities = 137/355 (38%), Positives = 197/355 (55%), Gaps = 27/355 (7%)
Query: 74 GFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYR 133
G+N LSD++PL R LPD+R SGC+ + +SLP VS+V F NE S LLR+ +S + R
Sbjct: 121 GYNAYLSDRLPLDRPLPDLRPSGCRNLSFPDSLPEVSIVFIFVNEALSVLLRSIHSAMER 180
Query: 134 SPEHLIKEIFLVDDASTKDFLGKALDDYLE---GNMPK-VKVIRLPKRSGLITARLAGAE 189
+P HL+KEI LVDD S+ + L + L +Y++ G P +KV+R K+ GLI +R++G
Sbjct: 181 TPSHLLKEIILVDDNSSNEELKEKLTEYVDKVNGQKPGFIKVVRHSKQEGLIRSRVSGWR 240
Query: 190 RATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGA 249
ATA V+ D+H E NV W P+L I + + ++ P D I +D FE A+G
Sbjct: 241 AATAPVVALFDAHVEFNVGWAEPVLTRIKENRKRIISPSFDNIKYDNFEIEEYPLAAQG- 299
Query: 250 FDWEFF--YKRLPVLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQ 307
FDWE + Y P +N P SP + G F + R +F E+G D G++++GGE
Sbjct: 300 FDWELWCRYLNPPKAWWKLENSTAPIRSPALI-GCFIVDRQYFEEIGLLDEGMEVYGGEN 358
Query: 308 YELSFK---------------IWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNY 352
EL + +WQCGG + PCSR+ HI R P+ V +N
Sbjct: 359 VELGIRVSEISHTGLSSAPMMVWQCGGSVEVLPCSRIAHIERAHKPYTEDLTA-HVRRNA 417
Query: 353 RRVAEVWMDEYAEHLYRR---RPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQI 404
RVAEVWMDE+ H+Y ID GD+ +K LR++L CK F+W++ +
Sbjct: 418 LRVAEVWMDEFKSHVYMAWNIPQEDSGIDIGDITARKALRKQLQCKTFRWYLVSV 472
>UniRef50_O45293 Cluster: Probable N-acetylgalactosaminyltransferase
8; n=2; Caenorhabditis|Rep: Probable
N-acetylgalactosaminyltransferase 8 - Caenorhabditis
elegans
Length = 421
Score = 244 bits (597), Expect = 5e-63
Identities = 139/372 (37%), Positives = 210/372 (56%), Gaps = 17/372 (4%)
Query: 54 LPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVV 113
L +L + ES E F+ S+K+ +R++ H C+++ Y S S SVVV
Sbjct: 55 LKENLTEAESKKSEWGIKSFAFDALSSEKLGPNRNVGKQAHKLCEEEKYDASY-STSVVV 113
Query: 114 PFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDF-LGKALDDY--LEGNMPKVK 170
HNE ST+LR ++ +P+ L+KEI L +DAS +D L K L+ + ++G K+
Sbjct: 114 IHHNEALSTILRMINGIIEFTPKSLLKEIVLYEDASEEDHVLTKHLEKFAKIKGLEDKLI 173
Query: 171 VIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFID 230
+ R R GLI A++ + AT +V++F+DSH E WL PLL+PI D ++++ P +D
Sbjct: 174 IKRSEYRQGLIRAKVHASRLATGEVIVFMDSHCEVAERWLEPLLQPIKEDPKSIVLPVVD 233
Query: 231 VIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPE----PFESPVMAGGLFAIS 286
+I +F+Y + A+ FDW F +K + LP + PE PF SP M GGL A+
Sbjct: 234 LINPVSFDY-SPSMVAKSGFDWGFTFKWI-YLPWEYFETPENNVKPFNSPAMPGGLLAMR 291
Query: 287 RAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGD 346
+ +F ELG YD G++IWG E ELS K W CGGR++ APCSRVGH++R P+ + D
Sbjct: 292 KEYFVELGEYDMGMEIWGSENIELSLKAWLCGGRVVVAPCSRVGHVFRMRRPYTSKPGMD 351
Query: 347 FVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAF 406
N RVA+ W+ EY + +P ++ GDL +++++L CK KWF+ +
Sbjct: 352 TALYNAVRVAKTWLGEYESKFFAVKPRGAKMVFGDLTEPMQVKDRLKCKDMKWFIENV-- 409
Query: 407 DLTAKYPPVEPK 418
YP +EPK
Sbjct: 410 -----YPELEPK 416
>UniRef50_Q9NY28 Cluster: Probable polypeptide
N-acetylgalactosaminyltransferase 8; n=9; Theria|Rep:
Probable polypeptide N-acetylgalactosaminyltransferase 8
- Homo sapiens (Human)
Length = 637
Score = 241 bits (589), Expect = 5e-62
Identities = 137/356 (38%), Positives = 200/356 (56%), Gaps = 12/356 (3%)
Query: 58 LPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHN 117
L + + + L+ G+N LS+++PL+R++PD R C +K Y LPS+SV++ F N
Sbjct: 132 LSEAQQKAAQDLFRKFGYNAYLSNQLPLNRTIPDTRDYRCLRKTYPSQLPSLSVILIFVN 191
Query: 118 EHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK----VKVIR 173
E S + R S++ R+P L+KEI LVDD S+ L LD+ ++ K +K+IR
Sbjct: 192 EALSIIQRAITSIINRTPSRLLKEIILVDDFSSNGELKVHLDEKIKLYNQKYPGLLKIIR 251
Query: 174 LPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIA 233
P+R GL AR G E ATADV+ LD+H E NV W P+L I D ++ P D I
Sbjct: 252 HPERKGLAQARNTGWEAATADVVAILDAHIEVNVGWAEPILARIQEDRTVIVSPVFDNIR 311
Query: 234 FDTFEYRAQDEGARGAFDWEFF--YKRLPVLPKDEKNMPEPFESPVMAGGLFAISRAFFW 291
FDTF+ + G F+WE + Y LP D ++ P +SP + G+ A +R F
Sbjct: 312 FDTFKLDKYELAVDG-FNWELWCRYDALPQAWIDLHDVTAPVKSPSIM-GILAANRHFLG 369
Query: 292 ELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKN 351
E+G D G+ I+GGE ELS ++WQCGG++ PCSR+ H+ R P+ + +N
Sbjct: 370 EIGSLDGGMLIYGGENVELSLRVWQCGGKVEILPCSRIAHLERHHKPYALDLTA-ALKRN 428
Query: 352 YRRVAEVWMDEYAEHLYR--RRP-HYRRIDPGDLAPQKRLREKLNCKPFKWFMTQI 404
RVAE+WMDE+ +Y P ID GD++ + LREKL CK F W++ +
Sbjct: 429 ALRVAEIWMDEHKHMVYLAWNIPLQNSGIDFGDVSSRMALREKLKCKTFDWYLKNV 484
>UniRef50_Q8MYY6 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase 13; n=1; Drosophila
melanogaster|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase 13 - Drosophila
melanogaster (Fruit fly)
Length = 558
Score = 233 bits (571), Expect = 7e-60
Identities = 133/352 (37%), Positives = 193/352 (54%), Gaps = 18/352 (5%)
Query: 75 FNGALSDKIPLSRSLPDIRHSGCKKKMYIESLP---SVSVVVPFHNEHWSTLLRTAYSVL 131
+N LS+ + L R LP RH C + I P +VSVV+ FHNE S LLRT S+L
Sbjct: 75 YNIHLSNALGLIRKLPVTRHHSCTTRNSILPAPLEANVSVVISFHNEARSMLLRTIVSLL 134
Query: 132 YRSPEHLIKEIFLVDDASTKDF-LGKALDDYLEGNMPK-----VKVIRLPKRSGLITARL 185
RSPE + E+ LVDD S +D L L ++ G + +R +R GLI +R
Sbjct: 135 SRSPEDYLHELILVDDGSQRDVTLLDDLKRWMGGVFGSRYRLGLTFLRNQERMGLIWSRN 194
Query: 186 AGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEG 245
GA A+ ++FLDSH E N WL PLLE +AL+ + P +D I T YR +E
Sbjct: 195 RGASLASGRYVLFLDSHCEVNEGWLEPLLERLALNTNLAVSPLLDPIDPTTLSYRKGNEL 254
Query: 246 ARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGG 305
+G FDW + L +++++ P++SP AGG+ +SR +F +LG ++P L IWGG
Sbjct: 255 LKGGFDWSLHFHWLKRQLTNQESLEMPYQSPAFAGGVLMMSREWFLKLGSFNPYLKIWGG 314
Query: 306 EQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGD--------FVGKNYRRVAE 357
E EL+ K+W CGG++ PCSR+GHI+R+ F P D N + +AE
Sbjct: 315 ESIELAIKLWLCGGQIEIVPCSRIGHIFRRRHAFDFPPQSDRQLSPAQETYLHNSKIIAE 374
Query: 358 VWMDEYAEHLYRRRPHYRRIDPGDLAPQ-KRLREKLNCKPFKWFMTQIAFDL 408
W+DEY Y RP RRI + +R+R++ C PF+W++ ++ +L
Sbjct: 375 SWLDEYKNMFYALRPAARRIPLDHTYDELQRMRKERRCHPFEWYLRHVSPEL 426
>UniRef50_Q5CKF0 Cluster: UDP-N-acetyl-D-galactosamine:polypeptide
N- acetylgalactosaminyltransferase T3; n=4;
Eimeriorina|Rep:
UDP-N-acetyl-D-galactosamine:polypeptide N-
acetylgalactosaminyltransferase T3 - Cryptosporidium
hominis
Length = 732
Score = 233 bits (569), Expect = 1e-59
Identities = 134/345 (38%), Positives = 193/345 (55%), Gaps = 12/345 (3%)
Query: 66 EESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMY-IESLPSVSVVVPFHNEHWSTLL 124
E+ L GFN LSD + L R ++R C+ +Y I+ L S+++ F+NE STLL
Sbjct: 240 EDFLAKGGGFNRQLSDFLSLDRVPLEVRDPICRNMIYPIKDLDDASIIITFYNEPLSTLL 299
Query: 125 RTAYSVLYRSPEHLIKEIFLVDDASTKDFL--GKALDDYLEGNMPKVKVIRLPKRSGLIT 182
R+ +SVL +P L++EI LV+D S L G LDDY+ +PKV+V+ L +R G++
Sbjct: 300 RSVHSVLNNTPPPLLREIILVNDGSDMIDLVPGGFLDDYIR-LLPKVRVVHLTERKGIVG 358
Query: 183 ARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEY-RA 241
ARL+G A+A V++ LDSH E + WL P L + ++V+ P ID I F +
Sbjct: 359 ARLSGVRAASAPVIVILDSHIETSRQWLEPQLLRLKESPKSVVMPQIDSIDPVNFAFSNF 418
Query: 242 QDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLD 301
G R F + + P ++ P +SP+MAGGLFA+ R +FW LGGYD
Sbjct: 419 SGIGCRLGFKYSILEQATLTGPINDTT---PIKSPMMAGGLFAMKRDYFWHLGGYDEKFR 475
Query: 302 IWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYR-KFAPFPNPGHGDFVGKNYRRVAEVWM 360
WG E E+SF+IW CGG++ PCSRV HI+R K + +P + + N R A VWM
Sbjct: 476 HWGAENVEISFRIWMCGGQIECTPCSRVFHIFRKKGVGYSSP--PESLWHNRLRTARVWM 533
Query: 361 DEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQIA 405
DE+ + P+ I G L++KL CKPF+WF+ +A
Sbjct: 534 DEFYQITEMLAPN-PNIKLGSFDDMLHLKKKLKCKPFRWFLDNVA 577
>UniRef50_UPI000065D031 Cluster: Probable polypeptide
N-acetylgalactosaminyltransferase 8 (EC 2.4.1.41)
(Protein-UDP acetylgalactosaminyltransferase 8) (UDP-
GalNAc:polypeptide N-acetylgalactosaminyltransferase 8)
(Polypeptide GalNAc transferase 8) (GalNAc-T8)
(pp-GaNTase 8).; n=1; Takifugu rubripes|Rep: Probable
polypeptide N-acetylgalactosaminyltransferase 8 (EC
2.4.1.41) (Protein-UDP acetylgalactosaminyltransferase
8) (UDP- GalNAc:polypeptide
N-acetylgalactosaminyltransferase 8) (Polypeptide GalNAc
transferase 8) (GalNAc-T8) (pp-GaNTase 8). - Takifugu
rubripes
Length = 565
Score = 231 bits (564), Expect = 5e-59
Identities = 135/370 (36%), Positives = 200/370 (54%), Gaps = 26/370 (7%)
Query: 58 LPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHN 117
L +EE E L+ G+N LSD++PL+R +PD R + C +K Y E LP++SVV+ + +
Sbjct: 14 LSEEEQKEAERLFQQYGYNAFLSDRLPLNREIPDTRPTRCAEKKYPEELPNISVVLIYLD 73
Query: 118 EHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYL----EGNMPKVKVIR 173
E S + R S++ ++P L+ EI LVDD S+ + LGK LD+Y+ E V+ +R
Sbjct: 74 EALSVIKRAIRSLIDKTPARLLTEIILVDDHSSNEDLGKKLDEYIGSIHEERPGLVRKVR 133
Query: 174 LPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIA 233
++ GL ARL+G + A DV+ LD+H E +V W PLL I D ++ P D +
Sbjct: 134 HAEQLGLTQARLSGWKAAVGDVVAILDAHIEVHVQWAEPLLARIKEDRTVILTPVFDNVK 193
Query: 234 FDTFEYRAQDEGARGAFDWEFFYKRLPVLPK--DEKNMPEPFESPVMAGGLFAISRAFFW 291
+D + A AFDW + P+ D K+ P +SP + G+ R FF
Sbjct: 194 YDDLTV-LHYQPAADAFDWALWCMYESFRPEWYDLKDDSLPGKSPSIM-GIVVAERKFFG 251
Query: 292 ELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKN 351
E+G D G+ I+GGE EL + W CGG + PCS++ HI R P+ P + +N
Sbjct: 252 EIGSLDGGMKIYGGENVELGIRAWSCGGSIEVIPCSKIAHIERAMKPY-LPDLSVTMKRN 310
Query: 352 YRRVAEVWMDEYAEHL--------------YRRRPHYRR---IDPGDLAPQKRLREKLNC 394
RVAEVWMDEY ++ H+R ID GD++ +K+LR++LNC
Sbjct: 311 ALRVAEVWMDEYKSNVNVAWNLPLVASASKMWLSAHFRANHGIDIGDVSERKKLRKRLNC 370
Query: 395 KPFKWFMTQI 404
KPF W++ I
Sbjct: 371 KPFSWYLENI 380
>UniRef50_UPI0000E4974C Cluster: PREDICTED: hypothetical protein;
n=3; Strongylocentrotus purpuratus|Rep: PREDICTED:
hypothetical protein - Strongylocentrotus purpuratus
Length = 953
Score = 225 bits (550), Expect = 2e-57
Identities = 107/258 (41%), Positives = 152/258 (58%), Gaps = 5/258 (1%)
Query: 152 DFLGKALDDYLEG-NMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWL 210
+ L + ++DY+ + +++++R KR GLI AR+ G + + + FLDSH E + WL
Sbjct: 557 EHLQEPIEDYISQIHSVRIRMVRAEKRLGLIKARMMGVDASEGETFTFLDSHVEVMIGWL 616
Query: 211 PPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGA-RGAFDWEFFYKRLPVLPKDEK-N 268
PLL +A D V+ P +D I DTF Y E RG F+W F Y+ P+ D++ +
Sbjct: 617 EPLLARLASDRTIVVMPVVDEINKDTFNYNVVPEPLQRGGFNWRFEYRWKPIPNYDKRPS 676
Query: 269 MPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSR 328
P +SP M GGL + R+FF ELGG+D G+++WGGE E S KIW CGG + PCSR
Sbjct: 677 KVAPIKSPAMPGGLLTMDRSFFLELGGFDLGMEVWGGENLETSLKIWMCGGSIEIIPCSR 736
Query: 329 VGHIYRKFAPFPNPGHG--DFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQK 386
VGH+YR +P+ G D V N RV EVW DE+ H Y R P + D GD++ +K
Sbjct: 737 VGHVYRDTSPYSFLGQNPLDIVEHNAMRVVEVWTDEHKYHFYDRLPMLKNRDFGDVSKRK 796
Query: 387 RLREKLNCKPFKWFMTQI 404
+LRE LNC F W++ +
Sbjct: 797 KLRESLNCYDFNWYLANV 814
>UniRef50_UPI0000D9AA48 Cluster: PREDICTED: similar to
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5; n=2; Macaca
mulatta|Rep: PREDICTED: similar to
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 - Macaca
mulatta
Length = 442
Score = 218 bits (532), Expect = 4e-55
Identities = 130/337 (38%), Positives = 180/337 (53%), Gaps = 32/337 (9%)
Query: 74 GFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYR 133
GFN +S + + R +PD R+ C +K Y LP+ S+V+ FHNE + L RT SV+
Sbjct: 133 GFNVIISRSLGIEREVPDTRNKMCLQKHYPARLPTASIVICFHNEEFHALFRTVSSVMNL 192
Query: 134 SPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATA 193
+P + ++EI LVDD S D L + LD +LE K+K+IR KR GLI ARL GA A+
Sbjct: 193 TPHYFLEEIILVDDMSEVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGASHASG 252
Query: 194 DVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWE 253
DVL+FLDSH E N WL PLL IA D + V+CP IDVI T EY+ RGAFDW
Sbjct: 253 DVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLEYKPSPV-VRGAFDWN 311
Query: 254 FFYKRLPVLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFK 313
+K V E + PE P+ D G+ IW
Sbjct: 312 LQFKWDNVF-SYEMDGPEGPTKPIRV-----------------DCGMRIW---------- 343
Query: 314 IWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPH 373
CGG++ PCSRVGHI +K + + NY R+ VW+DEY E + R+P
Sbjct: 344 --MCGGQLFIIPCSRVGHISKKQTRKTSAIISATI-HNYLRLVHVWLDEYKEQFFLRKPG 400
Query: 374 YRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTA 410
+ + G++ + +LR++L CK F+W++ + +L A
Sbjct: 401 LKYVTYGNIHERVQLRKRLGCKSFQWYLDNVFPELEA 437
>UniRef50_UPI0000E46FFD Cluster: PREDICTED: similar to
n-acetylgalactosaminyltransferase, partial; n=1;
Strongylocentrotus purpuratus|Rep: PREDICTED: similar to
n-acetylgalactosaminyltransferase, partial -
Strongylocentrotus purpuratus
Length = 405
Score = 200 bits (489), Expect = 6e-50
Identities = 107/279 (38%), Positives = 161/279 (57%), Gaps = 5/279 (1%)
Query: 39 QISSERKRVGIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCK 98
Q+ + R G GE G P L ++ + + + FN +SDKI L R++ D R S C+
Sbjct: 82 QLDKDIPRFGPGEGGQPLILSGKDMEKAKKSRDQHNFNLVVSDKISLERTVKDTRDSRCQ 141
Query: 99 KKMY-IESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKA 157
Y P+ SV++ FHNE WSTL+RT +SV+ R+P ++ E+ LVDDAST + L K
Sbjct: 142 DITYRFSKFPTASVIIAFHNEAWSTLMRTVHSVVNRTPRDILTEVVLVDDASTDEDLKKK 201
Query: 158 LDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPI 217
L + + KV+++ R G+ A++ GA A +VL+FLD+H E N +WL P+L+ +
Sbjct: 202 LLNIPKSVRGKVRLVHTTHREGVARAKMRGAREARGEVLVFLDAHCEVNTHWLEPMLDLV 261
Query: 218 ALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPV--LPKDEKNMP-EPFE 274
TV+ P ID I +TF + AR F W +R+P+ + K E+ P EP
Sbjct: 262 HQGPTTVVSPIIDKIDPETFGFE-DGSLARVTFRWSLETRRIPLSQIEKAERLNPLEPVR 320
Query: 275 SPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFK 313
SP+ GG+FA+S++FF ++GG D GLD WG + + S K
Sbjct: 321 SPLTNGGIFAVSKSFFEKIGGIDAGLDGWGADGLDFSMK 359
>UniRef50_Q6TBR4 Cluster: UDP-N-acetyl-D-galactosamine:polypeptide
N- acetylgalactosaminyltransferase T4; n=1; Toxoplasma
gondii|Rep: UDP-N-acetyl-D-galactosamine:polypeptide N-
acetylgalactosaminyltransferase T4 - Toxoplasma gondii
Length = 329
Score = 200 bits (489), Expect = 6e-50
Identities = 111/307 (36%), Positives = 173/307 (56%), Gaps = 21/307 (6%)
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLG-----KALDD 160
LP+ S+++ F+NE+ + L RT +S+L+R+P L++EI LVDD ST F+ ++L
Sbjct: 31 LPAASIIIAFYNEYPTALHRTLHSILHRTPLQLLEEIILVDDGSTFPFITDDNAPQSLSQ 90
Query: 161 YLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALD 220
Y++ +PKV+++R R G+ AR G A + V + LDSH E WL PL+ +A +
Sbjct: 91 YVK-TLPKVRLLRHQTRKGVTVARSTGIRAAKSHVFVILDSHVEVGYQWLEPLVARVASN 149
Query: 221 YRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFE-SPVMA 279
T++ P +D + + T E+++ G + F +P+ P+ P + SP M
Sbjct: 150 PETIVFPVVDAVDYRTLEFKSSGVGLIWSVMEHGF---VPLSPERLAYSPGAYRPSPTMM 206
Query: 280 GGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPF 339
G +FA + +F + GGYD G+ G E ELS + WQCGGR+ +PCSRV H++R A
Sbjct: 207 GSVFAADKNYFLQHGGYDEGMRFEGAENIELSLRQWQCGGRLECSPCSRVFHLFRSGAD- 265
Query: 340 PNPGHGDFVGKNYRRVAEVWMDEYAEHLYR--RRPHYRRIDPGDLAPQKRLREKLNCKPF 397
P V N R VWMDEY + +R PH + GD++ + +LRE+L CK F
Sbjct: 266 AQP-----VTWNRLRTMAVWMDEYGDLAWRVTGEPH---VSLGDISDRIKLRERLGCKSF 317
Query: 398 KWFMTQI 404
+WF+ +
Sbjct: 318 QWFLDNV 324
>UniRef50_Q4RNJ5 Cluster: Chromosome 21 SCAF15012, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 21 SCAF15012, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 364
Score = 198 bits (483), Expect = 3e-49
Identities = 119/295 (40%), Positives = 162/295 (54%), Gaps = 31/295 (10%)
Query: 50 GEQGLPAHLP--KEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMY-IESL 106
GE G L +EE EE + N +SD++ L R LP+ + C++ Y SL
Sbjct: 70 GEMGRAVRLTLSEEEKQKEEESLQKHQINIYVSDQVSLHRRLPEKWNPRCRELEYDYRSL 129
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P+ SVV+ F+NE WSTLLRT +SVL SP+ L+KE+ LVDD S + L + L+ Y+ G +
Sbjct: 130 PTTSVVIAFYNEAWSTLLRTVHSVLETSPDILLKEVVLVDDYSDRAHLKEPLEKYVSG-L 188
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
KV++IR KR GL+ ARL GA T +VL FLD H E + WL PLL+ I + V+C
Sbjct: 189 KKVRLIRATKREGLVRARLLGASITTGEVLTFLDCHCECHEGWLEPLLQRIKEEPSAVVC 248
Query: 227 PFIDVIAFDTFEYRAQ-DEGARGAFDWE--FFYKRLPVLPKDEKNMPEPF---------- 273
P IDVI ++TFEY E G FDW F + +P + + P
Sbjct: 249 PVIDVIHWNTFEYLGNPGEPQIGGFDWRLVFTWHIIPEYEQKRRRSPTDVIRYGRLFRTL 308
Query: 274 --------------ESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKI 314
SP MAGGLFA+S+ +F LG YD G+++WGGE E SF++
Sbjct: 309 ALRAGSSDVPSARRRSPTMAGGLFAVSKNYFHYLGTYDTGMEVWGGENLEFSFRV 363
>UniRef50_Q4SKF7 Cluster: Chromosome 13 SCAF14566, whole genome
shotgun sequence; n=14; Clupeocephala|Rep: Chromosome 13
SCAF14566, whole genome shotgun sequence - Tetraodon
nigroviridis (Green puffer)
Length = 530
Score = 194 bits (473), Expect = 5e-48
Identities = 130/364 (35%), Positives = 189/364 (51%), Gaps = 42/364 (11%)
Query: 67 ESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRT 126
E+L+ G+N LSD++PL R L D R C KK Y + LP++SVV+ + NE S + R
Sbjct: 23 EALFKKYGYNVFLSDRLPLDRPLADTREPRCSKKSYPKDLPTLSVVLIYLNEALSVIKRA 82
Query: 127 AYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDY---LEGNMPKVKV--IRLPKRSGLI 181
S+L R+P+HL+KEI +VDD S+ + L LD Y LE P V+ +R ++ GL
Sbjct: 83 LRSILNRTPKHLLKEIIMVDDNSSNEDLKGDLDFYVKALEKENPSVRFTRVRHTEQKGLS 142
Query: 182 TARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRA 241
AR++G ATADV+ LD+H E + W PLL I D V+ P D + +D +
Sbjct: 143 HARVSGWSAATADVVAILDAHIEVHEMWAEPLLTQIRADRSVVVSPVFDRVNYDDLKVIK 202
Query: 242 QDEGARGAFDWEFFYKRLPVLPKDEK--NMPEPFESPVMAGGLFAISRAFFWELGGYDPG 299
A AFDW + P+ K + P +SP + G L A R F E+G D G
Sbjct: 203 YSPAAH-AFDWALWCMYEGFTPEYYKLADSSLPGKSPSVMGILVA-DRKFLGEIGVLDEG 260
Query: 300 LDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVW 359
+ K+W CGG + PCS++ H+ R + P + +N RVAEVW
Sbjct: 261 M------------KVWTCGGSIEVVPCSKIAHMERAHKRY-MPDLTLAMKRNALRVAEVW 307
Query: 360 MDEYAEHL-------------------YRRRPHYRRIDPGDLAPQKRLREKLNCKPFKWF 400
MDEY ++ +RRP++ ID G++ +K+LRE+L CKPFKW+
Sbjct: 308 MDEYKHNVNLAWNLPFQVFENEKRSSGNKRRPNH-GIDIGNVTERKQLRERLKCKPFKWY 366
Query: 401 MTQI 404
+ +
Sbjct: 367 LENV 370
>UniRef50_Q4RQK9 Cluster: Chromosome 2 SCAF15004, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 2 SCAF15004, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 449
Score = 192 bits (469), Expect = 2e-47
Identities = 105/237 (44%), Positives = 132/237 (55%), Gaps = 1/237 (0%)
Query: 44 RKRVGIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYI 103
R +G+ G + E + FN LSD+IP+ R++PD R C + +
Sbjct: 18 RDAKAVGQFGQAVLVSSSEDAQVRERWDEGFFNVYLSDRIPVDRAVPDTRPESCAQSLIH 77
Query: 104 ESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLE 163
+ LPS SV+ F +E WSTLLR+ +SVL RSP HL++EI LVDD STK++L LD Y+
Sbjct: 78 DDLPSTSVIFCFVDEVWSTLLRSVHSVLNRSPLHLLREIILVDDFSTKEYLKGPLDKYM- 136
Query: 164 GNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRT 223
PKV++IRL +R GLI ARLAGA AT +VL FLDSH E NV WL PLLE I LD R
Sbjct: 137 AQFPKVRIIRLQERQGLIRARLAGAAAATGEVLTFLDSHVECNVGWLEPLLERIYLDRRK 196
Query: 224 VMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAG 280
V CP I+VI Y D RG F W + PV K P+ G
Sbjct: 197 VPCPVIEVINDKDMSYMLVDNFQRGIFKWPLVFGWSPVPEAHIKKHNLTISDPIRYG 253
>UniRef50_UPI0001554C17 Cluster: PREDICTED: similar to Polypeptide
N-acetylgalactosaminyltransferase 17; n=1;
Ornithorhynchus anatinus|Rep: PREDICTED: similar to
Polypeptide N-acetylgalactosaminyltransferase 17 -
Ornithorhynchus anatinus
Length = 328
Score = 192 bits (467), Expect = 3e-47
Identities = 96/210 (45%), Positives = 123/210 (58%), Gaps = 17/210 (8%)
Query: 309 ELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLY 368
+++ K+W CGG M D PCSRVGHIYRK+ P+ P G + +N +RVAE WMDE+AE++Y
Sbjct: 91 DVAVKVWMCGGGMFDVPCSRVGHIYRKYVPYKVPS-GTSLARNLKRVAETWMDEFAEYIY 149
Query: 369 RRRPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXXX 428
+RRP YR + GD++ QK LR+ L CK FKWFM +A+D+ YPPVEP P A G+I
Sbjct: 150 QRRPEYRHLSTGDISAQKELRKHLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEI--- 206
Query: 429 XXXXXXXXLCADAAHGPQLAALRLARC-----DVTTNAEQHFVLTWHKDIR----CKTRN 479
LC D+ HG +RL C + T + EQ F W +DIR TR
Sbjct: 207 --RNVVANLCVDSKHGATGTEVRLDVCVKDGSERTWSHEQLFTFGWREDIRPGEPLHTRK 264
Query: 480 MCWDLPDSSPGSPIVLYACHLGGGNQLWRY 509
C D S SP+ LY CH GNQ W Y
Sbjct: 265 FCLDA--ISHSSPVTLYDCHGMKGNQHWSY 292
>UniRef50_UPI0000E46EB4 Cluster: PREDICTED: similar to MGC81846
protein, partial; n=2; Strongylocentrotus
purpuratus|Rep: PREDICTED: similar to MGC81846 protein,
partial - Strongylocentrotus purpuratus
Length = 358
Score = 184 bits (447), Expect = 8e-45
Identities = 99/209 (47%), Positives = 123/209 (58%), Gaps = 2/209 (0%)
Query: 50 GEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSV 109
GE L EE + + Y + FN +S +I R++ D R+ CK ++Y E LP+V
Sbjct: 132 GEDELGMVRTDEERSIRDGGYRQHAFNELISQRIGFHRNVTDTRNPLCKYQVYSEELPTV 191
Query: 110 SVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM-PK 168
S+V+ F+NE WSTLLRT YSVL R+P LI E+ LVDD S L K LD Y+ N
Sbjct: 192 SIVICFYNEAWSTLLRTVYSVLDRTPRRLIHELILVDDFSELTHLKKELDQYMSKNFNGL 251
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPF 228
V VI +R GLI AR GA AT DVL+FLDSH E N WL PLLE I D TV+CP
Sbjct: 252 VHVIHNGQREGLIRARTIGARYATGDVLMFLDSHCEVNEQWLEPLLERIKADSHTVVCPI 311
Query: 229 IDVIAFDTFEYRAQDEGARGAFDWEFFYK 257
ID+I DTF Y A +G F+W +K
Sbjct: 312 IDIINHDTFAYTASPL-VKGGFNWGMHFK 339
>UniRef50_Q6YK77 Cluster: UDP-N-acetyl-D-galactosamine:polypeptide
N- acetylgalactosaminyltransferase T2; n=1; Toxoplasma
gondii|Rep: UDP-N-acetyl-D-galactosamine:polypeptide N-
acetylgalactosaminyltransferase T2 - Toxoplasma gondii
Length = 692
Score = 178 bits (433), Expect = 4e-43
Identities = 111/306 (36%), Positives = 162/306 (52%), Gaps = 23/306 (7%)
Query: 110 SVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDF---LGKALDDYLEGNM 166
SV++P NE + L +T + +P I+E+ +VDD S K L K L ++ +
Sbjct: 155 SVIIPVRNEE-AYLPKTLTFLFEVTPIERIREVIVVDDHSDKPIKAILEKTLPQHI---L 210
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
K +VIR GLI R+AGA AT+D FLD H V W PLL + +YR + C
Sbjct: 211 DKTRVIRFDSPQGLIRGRVAGAAIATSDNFFFLDGHCRPKVGWAEPLLAHLKTNYRRIAC 270
Query: 227 PFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAIS 286
P I I D++E G + F+W F + L E PV+AGG+ A++
Sbjct: 271 PKIYDIYLDSWE-DVGTHGTKMMFEWTFEFGWFEDLED---------EVPVLAGGILAMT 320
Query: 287 RAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGD 346
+ ++ E G YD G+ WGGE E S + W CGG ++ S++GHI+ + P PNPG+
Sbjct: 321 KKWWIESGLYDEGMLEWGGENLEQSIRSWLCGGEIVAVQESKIGHIFSR-PPKPNPGNRL 379
Query: 347 F--VGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLR-EKLNCKPFKWFMTQ 403
V KN +R A+VW+DEY Y+ R GD+ +K+LR E+L C PF+W++ +
Sbjct: 380 VIQVQKNQKRGAKVWLDEYYFLFYKYHREVRGHQEGDITQRKKLRYEQLTCMPFQWYVEK 439
Query: 404 I--AFD 407
AFD
Sbjct: 440 FKTAFD 445
>UniRef50_UPI000065D57A Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 3 (EC
2.4.1.41) (Protein-UDP
acetylgalactosaminyltransferase-like protein 3)
(UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase- like protein 3)
(Polypeptide GalNAc transferase-lik; n=1; Takifugu
rubripes|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 3 (EC
2.4.1.41) (Protein-UDP
acetylgalactosaminyltransferase-like protein 3)
(UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase- like protein 3)
(Polypeptide GalNAc transferase-lik - Takifugu rubripes
Length = 605
Score = 174 bits (424), Expect = 5e-42
Identities = 99/267 (37%), Positives = 143/267 (53%), Gaps = 11/267 (4%)
Query: 147 DASTKDFLGKALDDYLEGNMPK-VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEA 205
++ T + L L++Y+ P VK++R KR GLI AR+ G + A+A+V F D+H E
Sbjct: 169 NSDTTEQLKGPLEEYVNKRYPGLVKIVRNQKREGLIRARIEGWKVASAEVTGFFDAHVEF 228
Query: 206 NVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWEFF--YKRLPVLP 263
+W P+L I DY+ ++ P ID I DTFE + G ++WE + Y P
Sbjct: 229 TPSWAEPVLARIKEDYKRIILPSIDNIKHDTFEVERYENSGHG-YNWELWCMYINPPKQW 287
Query: 264 KDEKNMPEPFE---SPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGR 320
DE + P +P M G F +R +F ELG D G+D++GGE EL ++W CGG
Sbjct: 288 WDEGDASAPIRHDPTPAMIGCSFVANRDYFGELGLLDSGMDVYGGENIELGIRVWLCGGS 347
Query: 321 MLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLY---RRRPHYRRI 377
M PCSRV HI R P+ + +N RVAEVWMDEY ++Y I
Sbjct: 348 MEVLPCSRVAHIARVKKPY-HSNIAYHTRRNALRVAEVWMDEYRSNVYLAWNIPMENHGI 406
Query: 378 DPGDLAPQKRLREKLNCKPFKWFMTQI 404
D GD++ + LR+ L CK F+W++ +
Sbjct: 407 DYGDISQRVALRKSLQCKSFEWYLENV 433
Score = 77.4 bits (182), Expect = 1e-12
Identities = 40/91 (43%), Positives = 56/91 (61%), Gaps = 1/91 (1%)
Query: 59 PKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNE 118
P EESD + L G+N LSD+I L R++PD R C+K Y LP +S++ F NE
Sbjct: 3 PAEESDAKY-LREKYGYNAYLSDRISLDRTIPDHRPGKCRKVGYPRDLPQISLIFIFVNE 61
Query: 119 HWSTLLRTAYSVLYRSPEHLIKEIFLVDDAS 149
S +LR+ +S + +P HL+KEI LV+D S
Sbjct: 62 ALSVILRSVHSAVNHTPAHLLKEIILVNDNS 92
>UniRef50_UPI0000E46551 Cluster: PREDICTED: hypothetical protein,
partial; n=1; Strongylocentrotus purpuratus|Rep:
PREDICTED: hypothetical protein, partial -
Strongylocentrotus purpuratus
Length = 325
Score = 173 bits (420), Expect = 1e-41
Identities = 72/130 (55%), Positives = 95/130 (73%)
Query: 275 SPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYR 334
SP MAGGLFA++R +F +LG YD G+DIWGGE E+SF+IWQCGG++ PCSRVGH++R
Sbjct: 1 SPTMAGGLFAMNREYFHKLGDYDEGMDIWGGENLEISFRIWQCGGKLEIVPCSRVGHVFR 60
Query: 335 KFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNC 394
K P+ +P D KN RVAEVWMDEY EH Y+ +P + ID GD++ + LRE+L C
Sbjct: 61 KRRPYGSPNRQDTTTKNAVRVAEVWMDEYKEHFYQVQPKAKNIDYGDISSRVALREELKC 120
Query: 395 KPFKWFMTQI 404
K FKW++ +
Sbjct: 121 KSFKWYLDTV 130
>UniRef50_Q5CYR4 Cluster: Extracellular protein with a signal
peptide followed by a family 2 glycosyltransferase and
ricin domains; n=3; Cryptosporidium|Rep: Extracellular
protein with a signal peptide followed by a family 2
glycosyltransferase and ricin domains - Cryptosporidium
parvum Iowa II
Length = 545
Score = 169 bits (410), Expect = 2e-40
Identities = 100/303 (33%), Positives = 163/303 (53%), Gaps = 25/303 (8%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
+S+++P +E + +T L + ++L+KEI +VDD S ++ + +
Sbjct: 53 ISLIIPIRDED-EFISKTISFTLGLNNKNLLKEIIVVDDCSKREVSDLLNKELPSSYLKY 111
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPF 228
+KVIRL K GLI +R+ GA+ + + V++F+D H NW+ PL+ + + ++CP
Sbjct: 112 IKVIRLDKCEGLIRSRILGADASKSSVIVFMDGHCRPKENWIEPLINRLKEKPKAIVCPM 171
Query: 229 IDVIAFDTFEYRAQDE-GARGAFDWEFFYKRLPVLPKDEKNMPEPFES--PVMAGGLFAI 285
I+ I D + ++ G + FDW F E N E F P+ +GGL+AI
Sbjct: 172 IEDI--DRYTWKDLGTFGLKMMFDWNF-----------EFNWYEDFTDVIPIASGGLYAI 218
Query: 286 SRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHG 345
+R ++ E G YDPG+ WGGE E S +IW+CGG ++ SRVGHI+++ P PNP +
Sbjct: 219 TREWWEESGKYDPGMLEWGGENIEQSIRIWRCGGEIVAEKKSRVGHIFKR-DPKPNPENK 277
Query: 346 DF--VGKNYRRVAEVWMD----EYAEHLYRRRPHYRRIDPG-DLAPQKRLREKLNCKPFK 398
V +N +R A VW+D +Y E ++ G DL + ++E+L CKPF
Sbjct: 278 LVLQVQRNQKRAAMVWLDKKRYKYFETIHDVVKSLNETQSGVDLEQRHSIKERLKCKPFS 337
Query: 399 WFM 401
W++
Sbjct: 338 WYV 340
>UniRef50_Q4TCW9 Cluster: Chromosome undetermined SCAF6660, whole
genome shotgun sequence; n=2; Deuterostomia|Rep:
Chromosome undetermined SCAF6660, whole genome shotgun
sequence - Tetraodon nigroviridis (Green puffer)
Length = 157
Score = 163 bits (397), Expect = 9e-39
Identities = 78/145 (53%), Positives = 100/145 (68%)
Query: 97 CKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGK 156
CK K+Y + +P+ SVV+ FHNE WSTLLRT +SV+ RSP HL+ EI LVDDAS +DFL K
Sbjct: 1 CKTKVYPDDVPNTSVVIVFHNEAWSTLLRTVHSVINRSPRHLLVEIVLVDDASERDFLKK 60
Query: 157 ALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEP 216
L++Y+ V+++R+ +RSGLI ARL GA V+ FLD+H E V WL PLL
Sbjct: 61 KLENYVRTLEVPVRILRMEQRSGLIRARLRGAAATKGQVITFLDAHCECTVGWLEPLLAR 120
Query: 217 IALDYRTVMCPFIDVIAFDTFEYRA 241
I D V+CP IDVI+ +TFEY A
Sbjct: 121 IKEDRTAVVCPIIDVISDETFEYMA 145
>UniRef50_UPI0000D8AB1E Cluster:
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 9; n=2;
Euarchontoglires|Rep:
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 9 - Mus musculus
Length = 311
Score = 159 bits (385), Expect = 2e-37
Identities = 104/295 (35%), Positives = 149/295 (50%), Gaps = 30/295 (10%)
Query: 48 GIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLP 107
G+G+ G+ A L +++S E + G+N LSD+I L R++PD R C++ Y E LP
Sbjct: 18 GLGQGGVAATL-QDDSQETEGKFEDFGYNAQLSDRISLDRTIPDYRPKRCRQITYSEDLP 76
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
+SVV F NE S +LR+ +SV+ +P L+KE+ LVDD S L LD Y+ P
Sbjct: 77 QISVVFIFVNEALSVILRSVHSVVNHTPSQLLKEVILVDDNSDNVELKFNLDQYVHKRYP 136
Query: 168 K-VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
VKV+R +R GLI ARL G + ATA ++ F D+H E N W P L I D R ++
Sbjct: 137 GLVKVVRNSRREGLIRARLQGWKVATAPIVGFFDAHVEFNTGWAEPALARIQEDRRRIIL 196
Query: 227 PFIDVIAFDTFEYRAQ-----------------------DEGARGAFDWEFFYK-RLPVL 262
P ID I + TFE + D G G+ WE +
Sbjct: 197 PAIDNIKYSTFEVQQYASAAHGYNWGLWCMYIIPPQDWLDRGEPGSV-WELGSSTKAGST 255
Query: 263 PKDEKNMPEPFE---SPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKI 314
PK E P +P M G F + R +F ++G DPG++++G E EL ++
Sbjct: 256 PKGESFPSVPTHRGRTPAMIGCSFVVDREYFGDIGLLDPGMEVYGAENIELGMRV 310
>UniRef50_UPI0000E234D0 Cluster: PREDICTED: similar to UDP-GalNAc:
polypeptide N-acetylgalactosaminyltransferase; n=1; Pan
troglodytes|Rep: PREDICTED: similar to UDP-GalNAc:
polypeptide N-acetylgalactosaminyltransferase - Pan
troglodytes
Length = 459
Score = 158 bits (383), Expect = 4e-37
Identities = 97/277 (35%), Positives = 139/277 (50%), Gaps = 11/277 (3%)
Query: 48 GIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLP 107
G+G+ GL A L +++ E Y G+N LSD+I L RS+PD R C++ Y + LP
Sbjct: 93 GLGQGGLAATL-RDDGQEAEGKYEEYGYNAQLSDRISLDRSIPDYRPRKCRQMSYAQDLP 151
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
VSVV F NE S +LR+ +SV+ +P L+KE+ LVDD S L LD Y+ P
Sbjct: 152 QVSVVFIFVNEALSVILRSVHSVVNHTPSQLLKEVILVDDNSDNVELKFNLDQYVNKRYP 211
Query: 168 -KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
VK++R +R GLI ARL G + ATA V+ F D+H E N W P L I D R
Sbjct: 212 GLVKIVRNSRREGLIRARLQGWKAATAPVVGFFDAHVEFNTGWAEPALSRIREDSRPQFL 271
Query: 227 PFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKN---------MPEPFESPV 277
D + F R A+ + P + + + P +P
Sbjct: 272 LSHDELKLSCFSVRYSRRAHHYAWLIFCIFSFPGSWPSRQGSGLHSEVLTPLSLPPRTPA 331
Query: 278 MAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKI 314
M G F + R +F ++G DPG++++GGE EL ++
Sbjct: 332 MIGCSFVVDREYFGDIGLLDPGMEVYGGENVELGMRV 368
>UniRef50_Q5CHA1 Cluster: Glycosyl transferase; n=4;
Cryptosporidium|Rep: Glycosyl transferase -
Cryptosporidium hominis
Length = 809
Score = 149 bits (362), Expect = 2e-34
Identities = 98/307 (31%), Positives = 155/307 (50%), Gaps = 28/307 (9%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
+S+V+P HNE + +T + +P L++EI +VDD S K + L++ L N K
Sbjct: 274 ISIVIPAHNED-EFISKTIIFTIESTPTELLREIIIVDDFSEKPVF-EILEEELPENYKK 331
Query: 169 -VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCP 227
VK+IRL K GLI +++ GA+ A + FLD H + W L++ I +Y+ V+CP
Sbjct: 332 YVKIIRLKKCEGLIRSKIIGADAALGPNIFFLDGHCKPKKGWSEALVKSIRENYKRVVCP 391
Query: 228 FIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAISR 287
+ I+ + GA+ +W F + + +P E P+ +GG+ I++
Sbjct: 392 IVQSISNIDWS-DIGTAGAKMMIEWNFAFHWY------DDGLP---EIPIASGGILMITK 441
Query: 288 AFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDF 347
++ E G YDPG+ WGGE E SF+IW CGG + S VGHI+ + N + DF
Sbjct: 442 RWWEESGKYDPGMLYWGGENIEQSFRIWLCGGEIHVVRNSLVGHIFER--NNSNKRNQDF 499
Query: 348 ---------VGKNYRRVAEVWMDEYAEHLYRRRPH---YRRID-PGDLAPQKRLREKLNC 394
+ N++R A VW+ E Y + H Y I L+ + L+ L C
Sbjct: 500 QYKKMLIDNMNSNHQRTAFVWLSEQFYETYFKNYHVLGYLPISYTKGLSERLSLKHILKC 559
Query: 395 KPFKWFM 401
KPF+W++
Sbjct: 560 KPFEWYI 566
>UniRef50_UPI0000F20FAB Cluster: PREDICTED: hypothetical protein;
n=1; Danio rerio|Rep: PREDICTED: hypothetical protein -
Danio rerio
Length = 204
Score = 145 bits (351), Expect = 3e-33
Identities = 71/122 (58%), Positives = 85/122 (69%), Gaps = 1/122 (0%)
Query: 97 CKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGK 156
CK K+Y LP+ SV++PFHNE WS+LLRT +SVL RSP LI EI LVDD S K L
Sbjct: 69 CKLKLYTADLPNTSVIIPFHNEGWSSLLRTVHSVLDRSPPLLIAEIILVDDFSDKGHLKA 128
Query: 157 ALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEP 216
L+ Y+ +PKV+++R KR GLI RL GA A V+ FLDSH EANVNWLPPLL+
Sbjct: 129 PLEQYMV-RLPKVRILRTQKREGLIRTRLLGAAAARGQVITFLDSHCEANVNWLPPLLDR 187
Query: 217 IA 218
IA
Sbjct: 188 IA 189
>UniRef50_UPI00005A4710 Cluster: PREDICTED: similar to GalNAc
transferase 10 isoform a; n=1; Canis lupus
familiaris|Rep: PREDICTED: similar to GalNAc transferase
10 isoform a - Canis familiaris
Length = 216
Score = 144 bits (349), Expect = 6e-33
Identities = 66/123 (53%), Positives = 86/123 (69%), Gaps = 2/123 (1%)
Query: 29 LKKIDWHNYEQISSERKRVGIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRS 88
L++ DWH+YE I E R G GE G P P E D ++S Y NGFN +S+ I L RS
Sbjct: 45 LRRKDWHDYESIQKEAMRSGKGEHGKP--YPLTEEDRDDSAYRENGFNIFVSNSIALERS 102
Query: 89 LPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDA 148
LPDIRH+ CK KMY+E LP+ S+++PFHNE W++LLRT +S++ R+PE LI EI LVDD
Sbjct: 103 LPDIRHANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDF 162
Query: 149 STK 151
S +
Sbjct: 163 SDR 165
>UniRef50_Q5BYW7 Cluster: SJCHGC07375 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC07375 protein - Schistosoma
japonicum (Blood fluke)
Length = 202
Score = 116 bits (280), Expect = 1e-24
Identities = 61/136 (44%), Positives = 83/136 (61%), Gaps = 2/136 (1%)
Query: 50 GEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESL-P- 107
G+ G P LP + +S+N FN SD I L R+L D RH C +++ ++ L P
Sbjct: 67 GDMGRPVILPASLQAESKRTFSINEFNLVASDLIGLRRNLDDFRHPSCPRQIPLDKLIPF 126
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
SV++ FHNE WS LLRT +SVL R+PE L+ EI LVDDAST+ LG LD+Y+
Sbjct: 127 KTSVIIVFHNEAWSALLRTVHSVLDRTPEQLLHEIILVDDASTQSHLGHQLDNYISSLNK 186
Query: 168 KVKVIRLPKRSGLITA 183
+++ R+ RSGLI A
Sbjct: 187 PIRLERMSNRSGLIRA 202
>UniRef50_Q4RNJ6 Cluster: Chromosome 21 SCAF15012, whole genome
shotgun sequence; n=2; Clupeocephala|Rep: Chromosome 21
SCAF15012, whole genome shotgun sequence - Tetraodon
nigroviridis (Green puffer)
Length = 534
Score = 113 bits (271), Expect = 2e-23
Identities = 66/185 (35%), Positives = 95/185 (51%), Gaps = 23/185 (12%)
Query: 223 TVMCPFIDVIAFDTFEYRAQ-DEGARGAFDWEFFYKRLPVLPKDEKNMPEPFE--SPVMA 279
T++CP ID I ++TFEY Q DE G FDW ++ V ++ K + P
Sbjct: 208 TIVCPVIDTIDWNTFEYYMQTDEPMIGGFDWRLTFQWHSVPERERKRRSSRIDPIRPRCR 267
Query: 280 GGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPF 339
G L A+S + + E+ +WQCGG + PCS VGH++ K AP+
Sbjct: 268 GALAAMSLSLAFR----------------EIRGNVWQCGGSLEIHPCSHVGHVFPKKAPY 311
Query: 340 PNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPGDLAPQKRLREKLNCKPFKW 399
P +F+ +N R AEVWMD Y +H Y R P R+ GD++ + LREKL C+ F W
Sbjct: 312 ARP---NFL-QNTVRAAEVWMDSYKQHFYNRNPAARKETYGDISGRLLLREKLKCQSFTW 367
Query: 400 FMTQI 404
++ I
Sbjct: 368 YLKNI 372
Score = 78.6 bits (185), Expect = 4e-13
Identities = 46/104 (44%), Positives = 59/104 (56%), Gaps = 4/104 (3%)
Query: 50 GEQGLPA--HLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLP 107
GE G PA HL +E EE N +SDKI L R + D R + C LP
Sbjct: 81 GEWGRPARLHLSPDEKKQEEDSVERYAINIFVSDKISLHRHIQDHRMNECA--FDYRRLP 138
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTK 151
+ SV++ F+NE WSTLLRT +SVL +P L+KEI L+DD S +
Sbjct: 139 TTSVIIAFYNEAWSTLLRTIHSVLETTPAILLKEIILIDDFSDR 182
>UniRef50_UPI0000E49DD9 Cluster: PREDICTED: hypothetical protein;
n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
hypothetical protein - Strongylocentrotus purpuratus
Length = 220
Score = 111 bits (266), Expect = 6e-23
Identities = 61/151 (40%), Positives = 84/151 (55%)
Query: 8 LHAPPDLNKAVNKENHFVPKSLKKIDWHNYEQISSERKRVGIGEQGLPAHLPKEESDLEE 67
L D+++ +E V + + N+E + +R G GE G P L + D +
Sbjct: 48 LRKDDDIHEIQAEETLLVEPPVLRGTLGNFEPHPEDERRQGAGEYGRPVMLNPNDQDKYD 107
Query: 68 SLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTA 127
GFN +SD+I L R++ DIRH CK Y ++LP+ +VVV FHNE WSTLLRT
Sbjct: 108 QSLKEYGFNMVISDRIALDRAVNDIRHDECKYWHYPKNLPNTTVVVVFHNEGWSTLLRTV 167
Query: 128 YSVLYRSPEHLIKEIFLVDDASTKDFLGKAL 158
+SV+ SP +L+ EI LVDD S K L K L
Sbjct: 168 HSVINTSPPYLLHEIILVDDFSDKISLVKRL 198
>UniRef50_Q4T9B6 Cluster: Chromosome undetermined SCAF7602, whole
genome shotgun sequence; n=1; Tetraodon
nigroviridis|Rep: Chromosome undetermined SCAF7602,
whole genome shotgun sequence - Tetraodon nigroviridis
(Green puffer)
Length = 451
Score = 103 bits (247), Expect = 1e-20
Identities = 81/244 (33%), Positives = 117/244 (47%), Gaps = 15/244 (6%)
Query: 48 GIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLP 107
G+GE P L E D + GFN SD I L R++ DIRH CK Y E+L
Sbjct: 138 GLGEGAKPFVLGPEYKDAIQDSIKEFGFNMVASDMISLDRTISDIRHDECKYWHYDENLL 197
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFL-GKALD-DYLEGN 165
+ SVV+ FHNE WSTL+RT +SV+ R+P + EI ++DD S K+ AL + +G+
Sbjct: 198 TSSVVIVFHNEGWSTLMRTVHSVIKRTPRRYLAEIVMIDDFSNKEKAHHTALPVHFFDGS 257
Query: 166 MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTV- 224
M + I + S ++ A A +F + T + + PI RTV
Sbjct: 258 M--LTFIPVLFVSPSLSPNPTSALSLLAQ--LFCPACTPRARLLVLYVHGPIPRSVRTVC 313
Query: 225 MCPFIDVIAFDTFEYRAQDEG-----ARGAFDWEFFYKRLPVLPKDE---KNMPEPFESP 276
P ID I + Q G ARGA+DW +KR+P+ +++ K EP+
Sbjct: 314 TVPLIDSIDGQKYTIEPQGGGDQDGFARGAWDWSMLWKRVPLGDREKHLRKTQTEPYRCS 373
Query: 277 VMAG 280
+G
Sbjct: 374 TASG 377
>UniRef50_UPI0000E46BB8 Cluster: PREDICTED: similar to
UDP-GalNAc:polypeptide,
N-acetylgalactosaminyltransferase, partial; n=3;
Strongylocentrotus purpuratus|Rep: PREDICTED: similar to
UDP-GalNAc:polypeptide,
N-acetylgalactosaminyltransferase, partial -
Strongylocentrotus purpuratus
Length = 112
Score = 102 bits (245), Expect = 2e-20
Identities = 50/102 (49%), Positives = 66/102 (64%)
Query: 48 GIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLP 107
G GE G P + E E L+ +N FN SD+I L+RSLPD+R GC K+Y + LP
Sbjct: 9 GPGEMGKPVIIEPEGKAESERLWPINEFNLMASDRIALNRSLPDVRPRGCANKVYPKKLP 68
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDAS 149
+ SV++ +HNE STLLR +S++ RSP L+ EI LVDDAS
Sbjct: 69 TTSVILVYHNEARSTLLRNVHSIINRSPHDLLAEIILVDDAS 110
>UniRef50_Q2B871 Cluster: Glycosyl transferase, group 2 family
protein; n=1; Bacillus sp. NRRL B-14911|Rep: Glycosyl
transferase, group 2 family protein - Bacillus sp. NRRL
B-14911
Length = 297
Score = 95.5 bits (227), Expect = 3e-18
Identities = 67/230 (29%), Positives = 104/230 (45%), Gaps = 14/230 (6%)
Query: 110 SVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKV 169
S++ P NE + +T S + +P E+ +VDD ST DYL+ N+ +
Sbjct: 8 SIIFPVKNEG-ENVKKTLES-FFSAPIRQPYELIIVDDRSTDQCCDFLKADYLDKNISLI 65
Query: 170 KVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFI 229
+ G AR GA+ A VL+F D+H E WL L+EP+ + P I
Sbjct: 66 TTDGV----GAANARNEGAKLAKGQVLVFCDAHLEFEDYWLDLLIEPLLTGLTDAVTPAI 121
Query: 230 DVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAISRAF 289
I F G W + + + FE+ ++ GG FAI+R+
Sbjct: 122 GAIGNPHFT-------GYGQTLW-VNERSSKIRTHWNVKQDDLFETAILPGGCFAINRSV 173
Query: 290 FWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPF 339
F E GG++ G +WG E E+S K+W G + P ++V H++RK P+
Sbjct: 174 FEEAGGFETGFPVWGYEDVEISIKLWLFGYKCHVQPKAKVLHLFRKVQPY 223
>UniRef50_Q4T0W3 Cluster: Chromosome undetermined SCAF10824, whole
genome shotgun sequence; n=9; Euteleostomi|Rep:
Chromosome undetermined SCAF10824, whole genome shotgun
sequence - Tetraodon nigroviridis (Green puffer)
Length = 149
Score = 93.5 bits (222), Expect = 1e-17
Identities = 46/95 (48%), Positives = 56/95 (58%), Gaps = 4/95 (4%)
Query: 314 IWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLYRRRPH 373
IWQCGG + PCS VGH++ K AP+ + N R AEVWMDEY E Y R PH
Sbjct: 1 IWQCGGSLEIHPCSHVGHVFPKKAPYSR----NKALANSVRAAEVWMDEYKEIYYHRNPH 56
Query: 374 YRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDL 408
R GD+ +++LREKL CK F WF+ I DL
Sbjct: 57 ARLEAFGDVTERRKLREKLGCKSFGWFLENIYPDL 91
>UniRef50_UPI0000F1FCC6 Cluster: PREDICTED: hypothetical protein;
n=1; Danio rerio|Rep: PREDICTED: hypothetical protein -
Danio rerio
Length = 213
Score = 91.5 bits (217), Expect = 6e-17
Identities = 47/102 (46%), Positives = 60/102 (58%)
Query: 48 GIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLP 107
G G G+PA L E + L G+N LSDKI L RSLPD R S CKK + LP
Sbjct: 107 GRGRGGIPATLTPAEEKEAKFLREKYGYNAFLSDKISLDRSLPDYRPSKCKKAFFPRDLP 166
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDAS 149
+S++ F NE S +LR+ +S + +P HL+KEI LVDD S
Sbjct: 167 QISIIFIFVNEALSVILRSVHSAVNHTPAHLLKEIILVDDNS 208
>UniRef50_Q3A8Y2 Cluster: Glycosyl transferase, group 2 family; n=1;
Carboxydothermus hydrogenoformans Z-2901|Rep: Glycosyl
transferase, group 2 family - Carboxydothermus
hydrogenoformans (strain Z-2901 / DSM 6008)
Length = 288
Score = 91.5 bits (217), Expect = 6e-17
Identities = 71/224 (31%), Positives = 107/224 (47%), Gaps = 22/224 (9%)
Query: 111 VVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVK 170
+++ ++ W L +T S L + + EI ++DD ST D L+ GN +
Sbjct: 6 IIISYNEGQW--LRKTVESFL-AAKTGIPFEIIVIDDGST-DQSTVFLNS---GNFKNID 58
Query: 171 VIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFID 230
+I+LP RSG+ A+ AGA +A +VLIF D+H WL +LE L +T++ P +
Sbjct: 59 LIKLP-RSGVTRAKNAGANKARGEVLIFSDAHILVEDFWLEKMLED--LQEKTILSPLVV 115
Query: 231 VIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAISRAFF 290
+ E R F + LP N E + PV+ GG I + F
Sbjct: 116 SLL----------EPQRVGMGLTFNNELLPCWRSYTTNSVE--DVPVLPGGFMLIKKNDF 163
Query: 291 WELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYR 334
LGGYD GL IWG + E S + W G +L P ++V H++R
Sbjct: 164 IALGGYDEGLKIWGYDDCEFSLRAWLMGFNLLVTPRTKVFHLFR 207
>UniRef50_Q4SIA0 Cluster: Chromosome 5 SCAF14581, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 5 SCAF14581, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 404
Score = 89.0 bits (211), Expect = 3e-16
Identities = 40/82 (48%), Positives = 56/82 (68%)
Query: 69 LYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAY 128
++ G+NG LSD++PL R +PD+R GC+ Y SLP VS+V F NE S +LR+ +
Sbjct: 58 MFQYYGYNGYLSDRLPLDRPIPDLRPDGCRNTTYPLSLPQVSIVFIFVNEALSVILRSIH 117
Query: 129 SVLYRSPEHLIKEIFLVDDAST 150
S + R+P HL+KEI LVDD S+
Sbjct: 118 SAINRTPSHLLKEIILVDDNSS 139
Score = 57.6 bits (133), Expect = 8e-07
Identities = 33/90 (36%), Positives = 48/90 (53%), Gaps = 6/90 (6%)
Query: 227 PFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKD---EKNMPEPFESPVMAGGLF 283
P D I +DTFE A+G FDWE + + L PK + N P +SP + G F
Sbjct: 159 PSFDNIKYDTFEIEEYPLSAQG-FDWELWCRYLNP-PKSWWFKGNKSAPIQSPALIG-CF 215
Query: 284 AISRAFFWELGGYDPGLDIWGGEQYELSFK 313
+ R +F E+G D G++++GGE EL +
Sbjct: 216 VVDRLYFEEIGLLDEGMEVYGGENVELGIR 245
Score = 35.9 bits (79), Expect = 2.9
Identities = 12/28 (42%), Positives = 21/28 (75%)
Query: 377 IDPGDLAPQKRLREKLNCKPFKWFMTQI 404
ID GD++ +K LR++L CK F+W++ +
Sbjct: 249 IDIGDVSDRKALRKRLQCKTFRWYLVNM 276
>UniRef50_Q7Q046 Cluster: ENSANGP00000016624; n=1; Anopheles gambiae
str. PEST|Rep: ENSANGP00000016624 - Anopheles gambiae
str. PEST
Length = 205
Score = 87.8 bits (208), Expect = 7e-16
Identities = 46/109 (42%), Positives = 62/109 (56%), Gaps = 1/109 (0%)
Query: 54 LPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKM-YIESLPSVSVV 112
LP ++ +E L Y G N +SD IP+ R LPD+R C + + LP S+V
Sbjct: 96 LPDNITEEVHQLVRQGYDQQGLNQYVSDLIPVRRRLPDLRDPWCTAETRLLPELPQASIV 155
Query: 113 VPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDY 161
+ F NE WS L+RT +S+L R+P LI+EI LVDD S L LD+Y
Sbjct: 156 IVFFNEAWSVLVRTVHSILDRTPHALIREIILVDDFSNLAHLRTQLDEY 204
>UniRef50_Q4STJ6 Cluster: Chromosome undetermined SCAF14183, whole
genome shotgun sequence; n=2; Tetraodon
nigroviridis|Rep: Chromosome undetermined SCAF14183,
whole genome shotgun sequence - Tetraodon nigroviridis
(Green puffer)
Length = 344
Score = 81.8 bits (193), Expect = 5e-14
Identities = 38/92 (41%), Positives = 52/92 (56%), Gaps = 3/92 (3%)
Query: 220 DYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEP---FESP 276
D V+ P ID+I DTF Y A RG FDW +K + P+ +P ++P
Sbjct: 2 DPTRVVSPVIDIINMDTFAYVAASADLRGGFDWSLHFKWEQLSPEQRARRTDPAQPIKTP 61
Query: 277 VMAGGLFAISRAFFWELGGYDPGLDIWGGEQY 308
++AGGLF I R++F LG YD +DIWGGE +
Sbjct: 62 IIAGGLFVIDRSWFNHLGKYDTAMDIWGGENF 93
>UniRef50_UPI000155C133 Cluster: PREDICTED: similar to
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 4, partial; n=1;
Ornithorhynchus anatinus|Rep: PREDICTED: similar to
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 4, partial -
Ornithorhynchus anatinus
Length = 305
Score = 80.2 bits (189), Expect = 1e-13
Identities = 38/94 (40%), Positives = 53/94 (56%), Gaps = 4/94 (4%)
Query: 314 IWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLYRR--- 370
+WQCGG + PCSR+ HI R P+ V +N RVAEVWMDE+ H+Y
Sbjct: 105 VWQCGGSVEVLPCSRIAHIERAHKPYTEDLTA-HVRRNALRVAEVWMDEFKSHVYMAWNI 163
Query: 371 RPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQI 404
ID GD++ +K LR+ L CK F+W++ +
Sbjct: 164 PQEDSGIDIGDISERKALRKALQCKTFRWYLVNV 197
>UniRef50_UPI00005A4DE7 Cluster: PREDICTED: similar to Probable
polypeptide N-acetylgalactosaminyltransferase 8
(Protein-UDP acetylgalactosaminyltransferase 8)
(UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 8) (Polypeptide GalNAc
transferase 8) (GalNAc-T8) (pp-GaNTase 8)...; n=1; Canis
lupus familiaris|Rep: PREDICTED: similar to Probable
polypeptide N-acetylgalactosaminyltransferase 8
(Protein-UDP acetylgalactosaminyltransferase 8)
(UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 8) (Polypeptide GalNAc
transferase 8) (GalNAc-T8) (pp-GaNTase 8)... - Canis
familiaris
Length = 437
Score = 78.2 bits (184), Expect = 6e-13
Identities = 38/93 (40%), Positives = 58/93 (62%)
Query: 58 LPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHN 117
L + + E L+ G+N LS+++PL+R++PD R S C +K Y LPS+ V++ F N
Sbjct: 55 LSEAQQKKAEDLFQEFGYNVYLSNQLPLNRTIPDTRDSRCLQKTYSSQLPSLGVILIFMN 114
Query: 118 EHWSTLLRTAYSVLYRSPEHLIKEIFLVDDAST 150
E S + R S++ R+P L+KEI LVDD S+
Sbjct: 115 EALSIIQRAITSIINRTPTQLLKEIILVDDFSS 147
Score = 54.0 bits (124), Expect = 1e-05
Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 4/96 (4%)
Query: 220 DYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWEFF--YKRLPVLPKDEKNMPEPFESPV 277
D ++ P D I FDTFE G F+W+ + Y LP D ++ P +SP
Sbjct: 160 DRTVIVSPVFDNINFDTFELDKYALAVDG-FNWKLWCRYDPLPEAWFDLHDVTAPIKSPS 218
Query: 278 MAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFK 313
+ G + A +R F E+G D G+ ++GGE ELS +
Sbjct: 219 IMG-ILAANRIFLGEIGSLDGGMLVYGGENVELSLR 253
>UniRef50_A5D6B4 Cluster: Predicted glycosyltransferases; n=1;
Pelotomaculum thermopropionicum SI|Rep: Predicted
glycosyltransferases - Pelotomaculum thermopropionicum
SI
Length = 274
Score = 76.2 bits (179), Expect = 2e-12
Identities = 62/200 (31%), Positives = 84/200 (42%), Gaps = 15/200 (7%)
Query: 141 EIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLD 200
EI +VDD S+ D + L N +VK+I G AR GA A LIF D
Sbjct: 16 EIIVVDDGSS-DGCCRFLQQ--SNNYERVKLIS-SSGLGAARARNLGAASARGKYLIFCD 71
Query: 201 SHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLP 260
+H NWL LL+ + + P I + Q +R W
Sbjct: 72 AHITVPQNWLEALLDTFSRPGVDAVSPAIGSLENPAAVGYGQTWNSRLETVW-------- 123
Query: 261 VLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGR 320
LP P P++ GG A+ F +GG+D G +WG E ELS K+W G R
Sbjct: 124 -LPPPGGMPAGPV--PLLPGGCLAVRAGAFRRVGGFDEGFIVWGCEDAELSLKLWLFGCR 180
Query: 321 MLDAPCSRVGHIYRKFAPFP 340
+ P RV H++R P+P
Sbjct: 181 LYVVPSVRVLHLFRSRHPYP 200
>UniRef50_Q3A8Y3 Cluster: Glycosyl transferase, group 2 family; n=1;
Carboxydothermus hydrogenoformans Z-2901|Rep: Glycosyl
transferase, group 2 family - Carboxydothermus
hydrogenoformans (strain Z-2901 / DSM 6008)
Length = 288
Score = 74.1 bits (174), Expect = 9e-12
Identities = 63/200 (31%), Positives = 95/200 (47%), Gaps = 17/200 (8%)
Query: 141 EIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLD 200
EI +VDD ST D L+ + ++K+I+ + GL A+ GA+ A+ L+F D
Sbjct: 32 EIIVVDDGST-DHSTAFLEK--DSRYNQIKLIKT-EGIGLARAKNLGAKYASGKYLVFSD 87
Query: 201 SHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLP 260
+H WL L +A +CP I +A + G EF RL
Sbjct: 88 AHMSYQTFWLDHLEAFLAEKDVGGICPAIASLA------EPERIGYGQTISPEF---RLT 138
Query: 261 VLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGR 320
L N + E PV+ GGL I F+E+GG++ ++ WG E ELS ++W G R
Sbjct: 139 WLA----NPGKVAEIPVVPGGLMVIKSKVFFEVGGFEGLMERWGWEDAELSLRLWLMGYR 194
Query: 321 MLDAPCSRVGHIYRKFAPFP 340
+L P V H++R+ P+P
Sbjct: 195 LLVVPEVVVYHLFRERQPYP 214
>UniRef50_A7T195 Cluster: Predicted protein; n=1; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 693
Score = 71.3 bits (167), Expect = 6e-11
Identities = 33/65 (50%), Positives = 43/65 (66%)
Query: 67 ESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRT 126
E Y N FN A+SDKI R +PD RHS C+ + Y +LP+ S+++ FHNE STLLRT
Sbjct: 28 EDAYGKNQFNQAISDKIGGDRDVPDTRHSHCRYEAYPSTLPATSIIITFHNEARSTLLRT 87
Query: 127 AYSVL 131
S+L
Sbjct: 88 VKSLL 92
>UniRef50_Q4RAK4 Cluster: Chromosome undetermined SCAF23488, whole
genome shotgun sequence; n=2; Tetraodontidae|Rep:
Chromosome undetermined SCAF23488, whole genome shotgun
sequence - Tetraodon nigroviridis (Green puffer)
Length = 174
Score = 69.3 bits (162), Expect = 3e-10
Identities = 47/185 (25%), Positives = 80/185 (43%), Gaps = 12/185 (6%)
Query: 371 RPHYRRIDPGDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXX 430
RP + GD++ KR RE+ CK FKWFM +IA+D+ Y P+ PK G+I
Sbjct: 1 RPESLTLAYGDISELKRFREEHRCKSFKWFMEEIAYDIPLHY-PMPPKNVDWGEI----- 54
Query: 431 XXXXXXLCADAAHGPQLAALRLARCDVTTNAEQHFVLTWHKDIRCKTRNMCWDLPDSSPG 490
C D+ + + C Q F + ++ + + C L +
Sbjct: 55 RGLDTSYCIDSMGHTNGGNVEIGPCH-RMGGNQLFRI--NEANQLMQYDQC--LTRGTDN 109
Query: 491 SPIVLYACHLGGGNQLWRYYPDTKQIKHGGHSNCVDHEPTTLRLHISRCDAAADTQRWTL 550
S +++ C + W+Y+ D + H C+D ++ IS CD + TQ+W +
Sbjct: 110 SGVIITHCDQNQHTE-WKYFKDLHRFTHVTTGKCLDRSDLLHKVFISDCDTSKTTQKWEM 168
Query: 551 DTVDA 555
+ + A
Sbjct: 169 NNIVA 173
>UniRef50_Q01V98 Cluster: Glycosyl transferase, family 2; n=1;
Solibacter usitatus Ellin6076|Rep: Glycosyl transferase,
family 2 - Solibacter usitatus (strain Ellin6076)
Length = 282
Score = 69.3 bits (162), Expect = 3e-10
Identities = 68/232 (29%), Positives = 100/232 (43%), Gaps = 26/232 (11%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
+SVV+ NE + L RT S P+ EI +VDD ST +G+
Sbjct: 3 LSVVIISRNEG-AELRRTVESFRATLPQRA--EIVIVDDRST------------DGSTRG 47
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPF 228
++ + G+ AR G + T D+L F D+H NW PL E L+ R V
Sbjct: 48 LRRVIKTNGIGVACARNLGVSKTTGDMLFFADAHIRLEKNWWQPLAE--VLEDRKVAA-- 103
Query: 229 IDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAISRA 288
V T + G F R LP+ PF +P++ G + RA
Sbjct: 104 --VAPAVTHLPATRRRGFGLTFTGPDLDARW--LPRQGVT---PFSAPILPGCSLMMRRA 156
Query: 289 FFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFP 340
F +GG+D GL GG E+S ++W G ++ AP V H++R +P+P
Sbjct: 157 TFDAVGGWDGGLLHRGGVDNEMSVRLWLLGYELMVAPQVVVPHLFRSASPYP 208
>UniRef50_Q5BZK1 Cluster: SJCHGC04837 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC04837 protein - Schistosoma
japonicum (Blood fluke)
Length = 143
Score = 68.1 bits (159), Expect = 6e-10
Identities = 36/130 (27%), Positives = 60/130 (46%), Gaps = 9/130 (6%)
Query: 437 LCADAAHGPQLAALRLARCDVTT---NAEQHFVLTWHKDIRCKTRNMCWDLPDSSPGSPI 493
LC DA + ++L CD ++ Q F + +DIR C D P +
Sbjct: 4 LCLDATDASEYTPVKLRPCDKDNPGVSSVQKFEYNYRQDIRVLNHGTCLDEPGLKNEKTV 63
Query: 494 VLYACHLGGGNQLWRYYP------DTKQIKHGGHSNCVDHEPTTLRLHISRCDAAADTQR 547
LY+CH GGNQ W+ P + ++ HG ++ C++ + + + CD + TQR
Sbjct: 64 FLYSCHGQGGNQRWKIQPVNNDKSNPIRLIHGANAGCLEADTKNRVVSVKPCDYGSFTQR 123
Query: 548 WTLDTVDAEL 557
WT + + +L
Sbjct: 124 WTWEKLQFDL 133
>UniRef50_A3Z180 Cluster: Glycosyltransferase; n=1; Synechococcus
sp. WH 5701|Rep: Glycosyltransferase - Synechococcus sp.
WH 5701
Length = 785
Score = 64.5 bits (150), Expect = 7e-09
Identities = 69/235 (29%), Positives = 103/235 (43%), Gaps = 18/235 (7%)
Query: 100 KMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALD 159
++ + + P VSVVVP HN++ T R + L +P + E+ +VDD S+ D AL
Sbjct: 295 QLNLSAEPLVSVVVPVHNKYAVT--RRCLAALAYAPTRIPYELIVVDDGSS-DGTASALA 351
Query: 160 DYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIAL 219
+ P V+VIR G A +G + D ++ L++ TE WL LLEP
Sbjct: 352 E----EAPGVQVIRHDYSRGFNQACHSGVAASRGDFIVLLNNDTEPCCRWLEELLEPFQR 407
Query: 220 DYRTVMCPFIDVIAFDTFEYRAQDEGA---RGAFDWEFFYKRLPVLPKDEKNMPEPFESP 276
T M +I D R Q+ G W + R P P + +
Sbjct: 408 WSDTGMTG-SQLIYPDG---RLQEAGGIVWGNGEPWNYGRGRNPHHP----GVGYTRQVD 459
Query: 277 VMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGH 331
++G AI R + +LGG+ P E +L+FK+ Q G + A SRV H
Sbjct: 460 YVSGAALAIRRQLWNQLGGFSPEFSPAYYEDTDLAFKVRQAGLCVRYAALSRVIH 514
>UniRef50_Q4RPK0 Cluster: Chromosome 12 SCAF15007, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 12 SCAF15007, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 554
Score = 62.9 bits (146), Expect = 2e-08
Identities = 50/178 (28%), Positives = 75/178 (42%), Gaps = 21/178 (11%)
Query: 202 HTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDW--------- 252
H + P+L + D+ ++ P ID I ++TFE + A G ++W
Sbjct: 243 HIDFRTRLAEPILTRMKEDHTRIILPAIDNIKYNTFEVQQYANAAHG-YNWGLWCMYIIP 301
Query: 253 --EFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYEL 310
E+ K V + K S + + + F + G+ P L
Sbjct: 302 PQEWLDKGRGVTCLEIKKDELQTSSKKRSCARWILDLCF--QFNGFTPV------SLCVL 353
Query: 311 SFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAEHLY 368
++WQCGG M PC+RV HI R P+ N + +N R AEVWMDEY H+Y
Sbjct: 354 PVQVWQCGGSMEVLPCARVAHIERTKKPY-NNDIDYYAKRNALRAAEVWMDEYKSHVY 410
Score = 44.0 bits (99), Expect = 0.011
Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 1/57 (1%)
Query: 48 GIGEQGLPAHLPKEESDLEESLYSVNGFNGALSDKIPLSRSLPDIRHSGCKKKMYIE 104
G+G+ G PA L ++ D E Y G+N LSD+I L RS+PD R Y++
Sbjct: 94 GLGQGGAPATLGEDSRDAEGK-YEEYGYNAQLSDRISLDRSIPDYRPKNAVLLSYLD 149
>UniRef50_Q01PS5 Cluster: Glycosyl transferase, family 2 precursor;
n=1; Solibacter usitatus Ellin6076|Rep: Glycosyl
transferase, family 2 precursor - Solibacter usitatus
(strain Ellin6076)
Length = 783
Score = 60.9 bits (141), Expect = 9e-08
Identities = 63/230 (27%), Positives = 99/230 (43%), Gaps = 12/230 (5%)
Query: 105 SLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEG 164
S + +VV+P N L + S++ + EI +VD+ ST G A D+L
Sbjct: 49 SCDAATVVIPNWNGR-DLLEKYIPSIVTALSSNPANEILVVDNGSTD---GSA--DFLRA 102
Query: 165 NMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTV 224
P+V V+ LP+ G AG A D+++ L+S ++L PLLE V
Sbjct: 103 AFPQVNVLALPENLGFGGGSNAGFRAAENDIVVLLNSDMRVAEDFLAPLLE--GFQDPEV 160
Query: 225 MCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFA 284
+ D + R + +G W+ L V + + + + F GG A
Sbjct: 161 FAVSCQIFFSDPAKLREETGLTQGW--WQ--DGTLRVRHRIDDQVDDLFPCFYGGGGSCA 216
Query: 285 ISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYR 334
R F ELGG+D L + E +L + W+ G ++L P S V H +R
Sbjct: 217 FDRRKFLELGGFDELLAPFYLEDTDLGYMAWKRGWKVLYQPRSIVFHEHR 266
>UniRef50_Q68VJ7 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 1; n=3;
Euteleostomi|Rep: Polypeptide
N-acetylgalactosaminyltransferase 1 - Homo sapiens
(Human)
Length = 170
Score = 59.3 bits (137), Expect = 3e-07
Identities = 49/176 (27%), Positives = 76/176 (43%), Gaps = 18/176 (10%)
Query: 380 GDLAPQKRLREKLNCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGKIXXXXXXXXXXXLCA 439
GD++ + LR KL CKPF W++ I D + F+ G+I C
Sbjct: 5 GDISSRVGLRHKLQCKPFSWYLENIYPD-----SQIPRHYFSLGEIRNVETNQ-----CL 54
Query: 440 DAAHGPQLAALRLARCDVTTNAEQHFVLTWHKDIRCKTRNMCWDLPDSSPGSPIVLYACH 499
D + + + C Q F T +K+IR T ++C D+ S P+ + CH
Sbjct: 55 DNMARKENEKVGIFNCH-GMGGNQVFSYTANKEIR--TDDLCLDV--SKLNGPVTMLKCH 109
Query: 500 LGGGNQLWRYYPDTKQIKHGGHSNCVDH--EPTTLRLHISRCDAAADTQRWTLDTV 553
GNQLW Y P ++H + C+D E + I C+ + +Q+W L V
Sbjct: 110 HLKGNQLWEYDPVKLTLQHVNSNQCLDKATEEDSQVPSIRDCNGSR-SQQWLLRNV 164
>UniRef50_Q4SU00 Cluster: Chromosome undetermined SCAF14054, whole
genome shotgun sequence; n=1; Tetraodon
nigroviridis|Rep: Chromosome undetermined SCAF14054,
whole genome shotgun sequence - Tetraodon nigroviridis
(Green puffer)
Length = 253
Score = 58.8 bits (136), Expect = 4e-07
Identities = 25/62 (40%), Positives = 37/62 (59%), Gaps = 3/62 (4%)
Query: 250 FDWEFFYKRLPVLPKDEKNMPEP---FESPVMAGGLFAISRAFFWELGGYDPGLDIWGGE 306
FDW +K + P+ +P ++P++AGGLF I R++F LG YD +DIWGGE
Sbjct: 2 FDWSLHFKWEQLSPEQRARRTDPAQPIKTPIIAGGLFVIDRSWFNHLGKYDTAMDIWGGE 61
Query: 307 QY 308
+
Sbjct: 62 NF 63
>UniRef50_A1BDD3 Cluster: Glycosyl transferase, family 2; n=1;
Chlorobium phaeobacteroides DSM 266|Rep: Glycosyl
transferase, family 2 - Chlorobium phaeobacteroides
(strain DSM 266)
Length = 994
Score = 58.8 bits (136), Expect = 4e-07
Identities = 66/241 (27%), Positives = 108/241 (44%), Gaps = 15/241 (6%)
Query: 92 IRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTK 151
I H + ES P+ S+++P + + S ++ S+L P+ EI ++DD S
Sbjct: 351 IDHLAENNSFFKESKPTASIIIPVYG-NISYTIKCIESLL-NLPDSTSFEIIIIDDHSP- 407
Query: 152 DFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLP 211
D + L+ + ++K+IR G I + +GA AT + LIFL++ TE WL
Sbjct: 408 DNSYETLN-----KISQIKLIRNDCNKGFIHSCNSGASLATGEYLIFLNNDTEVLNAWLD 462
Query: 212 PLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWE-FFYKRLPVLPKDEKNMP 270
L+ P + + V +I D R Q+ G D Y RL K E N
Sbjct: 463 SLIAPFII-HDNVGLVGSQIIYPDG---RLQEAGGVILSDGSGLNYGRLSDPNKPEYNFL 518
Query: 271 EPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVG 330
E +G AI ++ F +GG+D E +++F + + G ++L P S+V
Sbjct: 519 R--EVDYCSGCSIAIKKSLFDSIGGFDTLFIPAYYEDTDIAFTVRKMGYKVLYQPASKVI 576
Query: 331 H 331
H
Sbjct: 577 H 577
>UniRef50_A4J8G0 Cluster: Glycosyl transferase, family 2; n=1;
Desulfotomaculum reducens MI-1|Rep: Glycosyl
transferase, family 2 - Desulfotomaculum reducens MI-1
Length = 291
Score = 58.4 bits (135), Expect = 5e-07
Identities = 52/234 (22%), Positives = 98/234 (41%), Gaps = 19/234 (8%)
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
+ VS+++P NE L++ + +P + +I +V+D ST + +
Sbjct: 1 MAQVSIIIPCKNE--GKLIKQTIESILETPCRIPYDITVVNDGSTDSCCTFLNRSKIYSS 58
Query: 166 MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVM 225
+ + + G AR GA++ ++L+F D+H +WL L E + +
Sbjct: 59 VKLINTTGI----GAANARNLGAQQCAGEILVFCDAHITVEPDWLENLSEGLLERGSGAV 114
Query: 226 CPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAI 285
P I + + G ++ + + LP + + E P+ GG A+
Sbjct: 115 SPGIANMNMN------HAIGYGMTWNKQLEARWLP-------STGDVAEVPIAPGGCVAV 161
Query: 286 SRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPF 339
R F ++GG++ G +G E E S K+W G R+ P + H +R P+
Sbjct: 162 HRDVFNDVGGFETGFRTYGFEDAEFSLKLWLFGYRVEVDPSVVIQHHFRSKHPY 215
>UniRef50_A7DQM1 Cluster: Glycosyl transferase, family 2; n=1;
Candidatus Nitrosopumilus maritimus SCM1|Rep: Glycosyl
transferase, family 2 - Candidatus Nitrosopumilus
maritimus SCM1
Length = 386
Score = 56.0 bits (129), Expect = 3e-06
Identities = 36/114 (31%), Positives = 62/114 (54%), Gaps = 8/114 (7%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
+S+++P +NE +L S+ P+++ + +VDD S D GK +DDYL+ N+ K
Sbjct: 10 ISIIIPTYNES-QNILNILKSIKENLPKNISAQAIVVDDNSP-DGTGKIVDDYLK-NLKK 66
Query: 169 V-----KVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPI 217
+ +VI + GL +A L G ++AT D ++ +DS +P L+E I
Sbjct: 67 ITNYTIEVIHRKTKDGLGSAILKGIQQATGDTIVVMDSDFSHPPQIIPKLVESI 120
>UniRef50_Q8TNV8 Cluster: Glycosyltransferase; n=1; Methanosarcina
acetivorans|Rep: Glycosyltransferase - Methanosarcina
acetivorans
Length = 350
Score = 55.6 bits (128), Expect = 3e-06
Identities = 48/191 (25%), Positives = 86/191 (45%), Gaps = 11/191 (5%)
Query: 141 EIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLD 200
EI +VD+AST A ++++ N P++KV++ K G AG E A + ++ ++
Sbjct: 37 EIIVVDNASTD-----ASEEFVRINYPEIKVVQTGKNLGYAAGNNAGFEVAEGEYIVIIN 91
Query: 201 SHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLP 260
T A+ WL L++P+ D T+ ++ + Y+ A +
Sbjct: 92 PDTVADSKWLAELIKPLENDL-TIAATTSKILMY----YQNDKINACSTINHYTGLTFCR 146
Query: 261 VLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGR 320
L K E +AG FAI R + G+D ++ E +LS+++ GG+
Sbjct: 147 GLNKPACEFDNYQEVGAVAGCSFAIRRDMLKNIDGFDSEFFLY-LEDTDLSWRVRFAGGK 205
Query: 321 MLDAPCSRVGH 331
++ AP S + H
Sbjct: 206 IMYAPGSIIFH 216
>UniRef50_Q2S1Y8 Cluster: Glycosyl transferase, group 2 family
protein; n=1; Salinibacter ruber DSM 13855|Rep: Glycosyl
transferase, group 2 family protein - Salinibacter ruber
(strain DSM 13855)
Length = 391
Score = 55.2 bits (127), Expect = 4e-06
Identities = 63/259 (24%), Positives = 108/259 (41%), Gaps = 12/259 (4%)
Query: 87 RSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVD 146
RS+ + H+ + + S P VS+++ N L++T + + ++ EI L D
Sbjct: 28 RSVRSLPHAVRPQAGSLMSAPPVSIIIVTWNA--KELVQTCLPSVVAT-DYPNFEIILAD 84
Query: 147 DASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEAN 206
+AST G A ++ P+VK++R P A A+ ++ L++ E
Sbjct: 85 NASTD---GTAA--WVAREHPEVKIVRHPGNWLFCRGNNAALPHASGRFVVLLNNDVEVP 139
Query: 207 VNWLPPLLEPIA--LDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRL-PVLP 263
WL PL+E A D V + FEY G + F RL +
Sbjct: 140 PGWLHPLVEAAAGRPDVAAVQPKLLQYDDRGRFEYAGGAGGFLDRAGYPFTRGRLFETME 199
Query: 264 KDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLD 323
+D +P + G + R+ E+G D ++ E+ +L +++W+ G R+
Sbjct: 200 RDRGQYDDPRDVFWATGAALLLRRSALDEVGPLDERFEM-HMEEIDLCWRLWRHGYRVRV 258
Query: 324 APCSRVGHIYRKFAPFPNP 342
AP S V HI P +P
Sbjct: 259 APESTVYHIGGASLPQSSP 277
>UniRef50_Q0S4J4 Cluster: Possible glycosyltransferase; n=2;
Corynebacterineae|Rep: Possible glycosyltransferase -
Rhodococcus sp. (strain RHA1)
Length = 436
Score = 54.4 bits (125), Expect = 8e-06
Identities = 63/234 (26%), Positives = 102/234 (43%), Gaps = 15/234 (6%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
PSV+VVV + +L S+L S +H E+ +VD+AS D + + + +
Sbjct: 105 PSVTVVV-CTRDRVDSLKVALESIL--SLDHSDYEVIVVDNASRTDATRRYVAELGD--- 158
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
P+V+V+ P R GL AR AG A +++ F D + +WL L+ + V C
Sbjct: 159 PRVRVVTEP-RPGLSRARNAGVRAANTEIVAFTDDDVVVDPHWLTELVAGFGAG-KAVGC 216
Query: 227 PFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMP-EPFESPVM-AGGLFA 284
++ AQ + RL L ++P PF+ V G FA
Sbjct: 217 -VCGIVPSGEIRTPAQAYFDQRVGWASSCVPRLFDLAHPPADVPLFPFQVGVYGTGANFA 275
Query: 285 ISRAFFWELGGYDPGLDIW----GGEQYELSFKIWQCGGRMLDAPCSRVGHIYR 334
+ R + LGG+D L GGE ++ F++ G ++ P + V H +R
Sbjct: 276 VDRQAVFALGGFDEALGAGAPTDGGEDLDMFFRVLHSGRQLAYRPGAVVWHRHR 329
>UniRef50_Q4TDW9 Cluster: Chromosome undetermined SCAF5986, whole
genome shotgun sequence; n=1; Tetraodon
nigroviridis|Rep: Chromosome undetermined SCAF5986,
whole genome shotgun sequence - Tetraodon nigroviridis
(Green puffer)
Length = 123
Score = 54.0 bits (124), Expect = 1e-05
Identities = 26/55 (47%), Positives = 31/55 (56%), Gaps = 1/55 (1%)
Query: 327 SRVGHIYRKFAPFPNPG-HGDFVGKNYRRVAEVWMDEYAEHLYRRRPHYRRIDPG 380
SRVGH++RK P+ PG G +N RR AEVWMDEY Y P R + G
Sbjct: 1 SRVGHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSARNVPYG 55
>UniRef50_Q8FSL1 Cluster: Putative uncharacterized protein; n=1;
Corynebacterium efficiens|Rep: Putative uncharacterized
protein - Corynebacterium efficiens
Length = 336
Score = 54.0 bits (124), Expect = 1e-05
Identities = 40/108 (37%), Positives = 61/108 (56%), Gaps = 7/108 (6%)
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
S SV++PF N+ +T+ R S+ ++ L E+ +VDDAST+ K + +E +
Sbjct: 58 SFSVLIPFFNDS-ATIRRALDSISNQTL--LPAEVIIVDDASTEVENAKLREILIEYSHL 114
Query: 168 KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLE 215
+ +I LP+ G TAR AG + ATAD + FLD A+ +W P LE
Sbjct: 115 QPSLIVLPENQGPSTARNAGWKNATADFVAFLD----ADDSWHPSKLE 158
>UniRef50_A1AUA8 Cluster: Glycosyl transferase, family 2; n=1;
Pelobacter propionicus DSM 2379|Rep: Glycosyl
transferase, family 2 - Pelobacter propionicus (strain
DSM 2379)
Length = 1268
Score = 53.6 bits (123), Expect = 1e-05
Identities = 34/106 (32%), Positives = 52/106 (49%), Gaps = 4/106 (3%)
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
S+S+V+P HN+ T + L+ SP + EI +VDD S+ D + L L
Sbjct: 8 SLSIVIPVHNQVVYT--QACLKALHASPPDIDHEIIIVDDGSS-DSTPQVLAQ-LAATSS 63
Query: 168 KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPL 213
+++R G A GA+ A +L+FL++ TE W PPL
Sbjct: 64 HFRIVRNDYAHGFAAACNRGADVARGHLLLFLNNDTEVQPGWFPPL 109
>UniRef50_Q5LF31 Cluster: Putative glycosyltransferase O-antigen
related protein; n=1; Bacteroides fragilis NCTC
9343|Rep: Putative glycosyltransferase O-antigen related
protein - Bacteroides fragilis (strain ATCC 25285 / NCTC
9343)
Length = 259
Score = 52.8 bits (121), Expect = 2e-05
Identities = 39/117 (33%), Positives = 63/117 (53%), Gaps = 6/117 (5%)
Query: 110 SVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKV 169
SVV+P +N +T+ R S L ++ L EI +V+D S + L +++Y N+ V
Sbjct: 8 SVVIPVYNGS-NTVTRAVDSCLKQTT--LPWEIIIVNDCSKDNTLDVLIENYQGNNI--V 62
Query: 170 KVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
K+ LPK SG+ AR G ++AT D + FLD+ + + L +L + Y+ V C
Sbjct: 63 KIYSLPKNSGVSVARNLGWDKATGDFIAFLDADDVWHPSKL-EVLNELKHGYKDVSC 118
>UniRef50_Q02W60 Cluster: Glycosyltransferase; n=2; Lactococcus
lactis subsp. cremoris|Rep: Glycosyltransferase -
Lactococcus lactis subsp. cremoris (strain SK11)
Length = 301
Score = 52.4 bits (120), Expect = 3e-05
Identities = 39/125 (31%), Positives = 64/125 (51%), Gaps = 7/125 (5%)
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
+P VSV+VP +N L R SVL +S + E+ L+DD S KD G+ D+Y + +
Sbjct: 1 MPKVSVIVPVYNVE-KLLERCLNSVLNQSFQDF--ELLLIDDGS-KDSSGQICDNYAKKD 56
Query: 166 MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEP-IALDYRTV 224
+V+V +P G AR G + ++F+DS +++L L +P + + V
Sbjct: 57 Q-RVRVWHIPN-GGQSAARNLGIDNVYGTYIVFIDSDDFVELDYLEQLYQPMVEYEADVV 114
Query: 225 MCPFI 229
C +I
Sbjct: 115 SCRYI 119
>UniRef50_Q8U2R3 Cluster: Glycosyl transferase; n=2; Pyrococcus|Rep:
Glycosyl transferase - Pyrococcus furiosus
Length = 298
Score = 52.4 bits (120), Expect = 3e-05
Identities = 41/111 (36%), Positives = 57/111 (51%), Gaps = 11/111 (9%)
Query: 105 SLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEG 164
S P+VSV++P HN L R SVL ++ E EI +VDDAS+ D ++ +
Sbjct: 2 SRPTVSVIIPTHNRA-KLLKRAIISVLNQTFEDF--EIIVVDDASS-DNTPNIIESIKDS 57
Query: 165 NMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLE 215
+++ IRL K SG TAR G ++A + LD E WLP LE
Sbjct: 58 ---RIRYIRLEKNSGAPTARNIGIKKARGKFIALLDDDDE----WLPRKLE 101
>UniRef50_Q1ARC2 Cluster: Glycosyl transferase, family 2; n=1;
Rubrobacter xylanophilus DSM 9941|Rep: Glycosyl
transferase, family 2 - Rubrobacter xylanophilus (strain
DSM 9941 / NBRC 16129)
Length = 753
Score = 52.0 bits (119), Expect = 4e-05
Identities = 64/228 (28%), Positives = 94/228 (41%), Gaps = 22/228 (9%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
PS +VVVP N A + P + EI VDDAS L +AL +
Sbjct: 343 PSATVVVPSRNRPRRARACAASLLALDYPRDRL-EIIFVDDASEPP-LSRAL-----AGL 395
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
P V+V+RL +R+G AR G A +V+ F D E WL L P+
Sbjct: 396 P-VRVLRLERRAGQSAARNLGLRAARGEVVAFTDDDCEVLPGWLRALAAPLC-------T 447
Query: 227 PFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAIS 286
P +++ G GAF E L + P+ + P P + A
Sbjct: 448 PGVELAGGRVLS--PPPAGRLGAF--EAARSPLDMGPEGGEVGPRG-AVPYLPSCNLAGD 502
Query: 287 RAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYR 334
R LGG+D G+++ GE +L ++ + G + P +RV H +R
Sbjct: 503 RRALLRLGGFDEGMEL--GEDADLVWRAVRAGLGVRYEPSARVVHRHR 548
>UniRef50_Q5CY12 Cluster: UDP-N-acetylgalactosamine: polypeptide N-
acetylgalactosaminyltransferase, signal peptide; n=2;
Cryptosporidium|Rep: UDP-N-acetylgalactosamine:
polypeptide N- acetylgalactosaminyltransferase, signal
peptide - Cryptosporidium parvum Iowa II
Length = 414
Score = 52.0 bits (119), Expect = 4e-05
Identities = 45/204 (22%), Positives = 96/204 (47%), Gaps = 20/204 (9%)
Query: 126 TAYSVLYRSPEHLIKEIFLVDDA---STKDFLGKALDDYLEGNMPKVKVIRLPKRSGLIT 182
T S+L + +L+ EI ++ + S KD + +L+ P +K+I + L
Sbjct: 75 TINSILLTANRNLLHEIIIISNDCRDSGKDIKSHLGEKFLD--KPLIKIIETELQE-LGE 131
Query: 183 ARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQ 242
+ GA +T ++++F+ S T NW+ P++ ++ +Y++++ P + D + +
Sbjct: 132 LQNLGANNSTGEIILFVPSATLFPKNWMSPIMRSLSDNYKSIIVPRFKKLNKDKWTFSNN 191
Query: 243 DE--GARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGY-DPG 299
D + F EF + L D K P+ +FAI+++++ + DP
Sbjct: 192 DPVYSPKMMFTKEFELTNIHTL--DNK-------VPMFYSKIFAITKSWWLNISKLSDPT 242
Query: 300 LDI--WGGEQYELSFKIWQCGGRM 321
+++ +++S + W CGGR+
Sbjct: 243 INLIFKTSINFDISLRSWNCGGRV 266
>UniRef50_Q1MNP5 Cluster: Predicted glycosyltransferases; n=1;
Lawsonia intracellularis PHE/MN1-00|Rep: Predicted
glycosyltransferases - Lawsonia intracellularis (strain
PHE/MN1-00)
Length = 353
Score = 51.6 bits (118), Expect = 6e-05
Identities = 34/117 (29%), Positives = 57/117 (48%), Gaps = 6/117 (5%)
Query: 101 MYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDD 160
M S P +S++V +N+H S + S+L +P + EI LVD+ ST ++
Sbjct: 1 MTYSSSPEISIIVLTYNQHASYTIPCIESLLQTTPNNC--EIILVDNGSTDG----TVEW 54
Query: 161 YLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPI 217
+ + P + VI K G G ++A + +I L++ T WL PL+EP+
Sbjct: 55 MQQLSYPHLTVIDNKKNLGYAAGNNVGLKKAKGEFIILLNNDTLLPPGWLEPLIEPL 111
>UniRef50_A0Z0H0 Cluster: Glycosyl transferase, family 2:Glycosyl
transferase, group 1; n=1; marine gamma proteobacterium
HTCC2080|Rep: Glycosyl transferase, family 2:Glycosyl
transferase, group 1 - marine gamma proteobacterium
HTCC2080
Length = 1185
Score = 51.6 bits (118), Expect = 6e-05
Identities = 56/228 (24%), Positives = 96/228 (42%), Gaps = 21/228 (9%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P VS+V+P +NE W R +S+L + E+ + DD ST + + A Y++G
Sbjct: 548 PVVSIVIPVYNE-WVITHRCLWSILQHTEGEY--EVIVADDCSTDETVNIAT--YVQG-- 600
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
V V+R + + A +A + ++ L++ T WLPPLL + ++
Sbjct: 601 --VHVVRHAENQRFLRNCNIAAAQARGEFILLLNNDTAVTAGWLPPLLALFENESVGIVG 658
Query: 227 PFIDVIAFDTFEYRAQDEGA---RGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLF 283
P + F + Q+ G A W Y R + + N E+ ++G
Sbjct: 659 P---KLLFP--NGKLQEAGGIIWNDASGWN--YGRADEPSRPQYNYVR--ETDYVSGAAL 709
Query: 284 AISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGH 331
I ++ + + G+D E +L F Q G ++L P S V H
Sbjct: 710 MIRQSLWQAVDGFDEAFVPAYYEDTDLCFTARQAGFKVLYQPASTVVH 757
>UniRef50_A1ALI9 Cluster: Glycosyl transferase, family 2; n=1;
Pelobacter propionicus DSM 2379|Rep: Glycosyl
transferase, family 2 - Pelobacter propionicus (strain
DSM 2379)
Length = 525
Score = 50.8 bits (116), Expect = 1e-04
Identities = 61/226 (26%), Positives = 90/226 (39%), Gaps = 16/226 (7%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P S+++P +N+ + L R + S E+ L+D+ S D+ +YL
Sbjct: 251 PKASIIIPLYNQ--AHLTRMCVEAIKASAPITSYELVLIDNGS-HDWT----PEYLASLG 303
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
V VI + G A GA A D+L+FL++ T WL L+ A+D
Sbjct: 304 TSVTVITNRENRGFAVACNQGARAARGDILVFLNNDTVPEQGWLDELV--AAIDNGEAEI 361
Query: 227 PFIDVIAFDTFEYRAQDEGARGAFDWE-FFYKRLPVLPKDEKNMPEPFESPVMAGGLFAI 285
++ D R Q G AFD Y D + E + A+
Sbjct: 362 CGARLLYPDG---RCQHAGV--AFDERGLGYHIFAGFQGDSAPVRERRLMQAVTAACMAM 416
Query: 286 SRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGH 331
+ F ELGG+D G G E +L + Q G R+L P S V H
Sbjct: 417 RKGLFHELGGFDEGFRN-GFEDIDLCLRAGQRGHRILFVPESVVIH 461
>UniRef50_Q54Q41 Cluster: Putative glycosyltransferase; n=1;
Dictyostelium discoideum AX4|Rep: Putative
glycosyltransferase - Dictyostelium discoideum AX4
Length = 1065
Score = 50.8 bits (116), Expect = 1e-04
Identities = 49/187 (26%), Positives = 87/187 (46%), Gaps = 8/187 (4%)
Query: 15 NKAVNKENHFVPKSLKKIDWHNYEQISSERKRVGIGEQGLPAHLPKEESDLEESLYSVNG 74
NK +N +N + + I++ N ++ + G GE+ +E+ D E+ + +N
Sbjct: 159 NKIINNKNDIINQYC--INFLNNNNNNNNNNK-GEGEEKKEE---EEDDDDEKCSFDLNS 212
Query: 75 FNGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRS 134
+ + + I +R + S ++++ + P VSV+VPF N S+L R
Sbjct: 213 PDYSCTP-ISKNRVINFNYSSNYEERVVNWNKPIVSVIVPFLNTDPQQFNECMESILIRQ 271
Query: 135 PEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATAD 194
LI E+ L+DD ST D K+ ++L+ + +VI P GL +R G + D
Sbjct: 272 SFKLI-EVILIDDGSTNDKSIKSYREWLKREPIRFRVIVNPFNLGLPHSRNIGVYNSRGD 330
Query: 195 VLIFLDS 201
+ F DS
Sbjct: 331 YIFFFDS 337
>UniRef50_Q0VR09 Cluster: Glycosyl transferase, group 2 family
protein, putative; n=1; Alcanivorax borkumensis SK2|Rep:
Glycosyl transferase, group 2 family protein, putative -
Alcanivorax borkumensis (strain SK2 / ATCC 700651 / DSM
11573)
Length = 274
Score = 50.4 bits (115), Expect = 1e-04
Identities = 38/104 (36%), Positives = 51/104 (49%), Gaps = 5/104 (4%)
Query: 101 MYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDD 160
M + +P VS++ P +N TL RT SVL S L E+ LVDD ST D A
Sbjct: 1 MDVNMIPEVSIITPCYNAR-HTLARTIESVL--SQNFLSWELLLVDDKSTDDTDQHAQAF 57
Query: 161 YLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTE 204
N ++K+IRL + SG AR G A + F+D+ E
Sbjct: 58 CDRDN--RIKLIRLKENSGAAVARNRGIASARGRYIAFIDADDE 99
>UniRef50_A5D4N5 Cluster: Hypothetical glycosyltransferase; n=1;
Pelotomaculum thermopropionicum SI|Rep: Hypothetical
glycosyltransferase - Pelotomaculum thermopropionicum SI
Length = 425
Score = 50.4 bits (115), Expect = 1e-04
Identities = 66/232 (28%), Positives = 94/232 (40%), Gaps = 24/232 (10%)
Query: 104 ESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLE 163
E LP VSVVVP N L+ S+L EI +VDDAS LE
Sbjct: 11 ELLPFVSVVVPTRNRQ-DVLVSCVESLLDVDYPADKLEIIVVDDASDPPVA-------LE 62
Query: 164 GNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRT 223
G +VK IRL + G AR A+ A +++ F+D A WL L+ YR
Sbjct: 63 GYRRRVKSIRLDENRGPGAARNEAAKLARGEIIAFIDDDCLAGRGWLKDLVP--CFQYR- 119
Query: 224 VMCPFIDVIAFDTFEYRAQDEGARGAFD-WEFFYKRLPVLPKDEKNMPEPFESPVMAGGL 282
DV A + G + +E L + K S + A L
Sbjct: 120 ------DVAAAGGM---VEPAGLTRLLEKYERCQSPLSMGKVQRKAAKNTAVSYLPACNL 170
Query: 283 FAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYR 334
+ + F GG+DP L + GE +L +++ + GG++ P V H +R
Sbjct: 171 L-VRKDLFAAAGGFDPALQV--GEDVDLCWRLLEKGGQIYYIPGGAVYHRHR 219
>UniRef50_Q2JDU9 Cluster: Glycosyl transferase, family 2; n=3;
Frankia|Rep: Glycosyl transferase, family 2 - Frankia
sp. (strain CcI3)
Length = 320
Score = 50.0 bits (114), Expect = 2e-04
Identities = 39/108 (36%), Positives = 56/108 (51%), Gaps = 7/108 (6%)
Query: 109 VSVVVPFHNEHWST--LLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
VSVV+P +NE L SV++R P E+ +VDD ST A+ + L ++
Sbjct: 6 VSVVIPAYNEAMRLPGSLPPLISVMHRIPG---AEVIVVDDGSTDGTA--AIAEELLADL 60
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLL 214
P +V+RLP SG A G A + + FLD+ ++VN LP LL
Sbjct: 61 PGGRVLRLPWNSGKGAAVRMGVSAAHGESIAFLDADGASDVNDLPLLL 108
>UniRef50_Q4JYX6 Cluster: Putative glycosyl transferase; n=4;
Streptococcus pneumoniae|Rep: Putative glycosyl
transferase - Streptococcus pneumoniae
Length = 369
Score = 50.0 bits (114), Expect = 2e-04
Identities = 37/101 (36%), Positives = 49/101 (48%), Gaps = 5/101 (4%)
Query: 100 KMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALD 159
+M + P SV+VP + S L + S+LY S + EI LVDD D GK LD
Sbjct: 10 EMLEKESPLFSVIVPTYKISESNLRQCIDSLLYDSQD---VEIILVDDNKNLDICGKVLD 66
Query: 160 DYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLD 200
+Y E + VI + G+ AR AG A LIF+D
Sbjct: 67 EYAE-KYNNISVIH-QENQGVSVARNAGMSIAVGKYLIFVD 105
>UniRef50_Q1PVM1 Cluster: Similar to family 2 glycosyltransferase
SpsQ; n=1; Candidatus Kuenenia stuttgartiensis|Rep:
Similar to family 2 glycosyltransferase SpsQ -
Candidatus Kuenenia stuttgartiensis
Length = 324
Score = 50.0 bits (114), Expect = 2e-04
Identities = 55/226 (24%), Positives = 101/226 (44%), Gaps = 28/226 (12%)
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
S+SV++P +N T+ + +++ ++ E+ LVDD S KD G+ Y
Sbjct: 4 SISVIIPTYNAE-KTIGQCLHALKQQNYPSASYEVILVDDGS-KDATGEIARTY------ 55
Query: 168 KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCP 227
+K +R + SG TAR GA A ++++F DS +W+ + P + ++VM
Sbjct: 56 DIKYLR-QENSGPATARNKGAIVAKGEIILFTDSDCVPEPDWIREMAAP--FNDKSVMAV 112
Query: 228 FIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEK-NMPEPFESPVMAGGLFAIS 286
YR + + F F +R +L K +M + + +
Sbjct: 113 --------KGAYRNRQKSIVARFAQLEFEERFEMLKKAASIDMVDTYSA--------GFR 156
Query: 287 RAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHI 332
+ F +LGG+D + E ELS+++ + G +M+ P + V H+
Sbjct: 157 KDAFLQLGGFDTSFPVANNEDTELSYRMSKLGFKMVFNPNAIVFHL 202
>UniRef50_A5ZFA1 Cluster: Putative uncharacterized protein; n=1;
Bacteroides caccae ATCC 43185|Rep: Putative
uncharacterized protein - Bacteroides caccae ATCC 43185
Length = 322
Score = 50.0 bits (114), Expect = 2e-04
Identities = 33/96 (34%), Positives = 52/96 (54%), Gaps = 6/96 (6%)
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
+P++S+++P +N L + S+L ++ + EI L+DD ST DF GK D+Y
Sbjct: 1 MPTLSIIIPVYNSE-KYLKQCLDSILAQASDDF--EILLIDDGST-DFSGKLCDEYAS-R 55
Query: 166 MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
+ V K G+ AR G ERA + +IF+DS
Sbjct: 56 YNNIYVFH-EKNRGVSAARNKGIERAQGEYVIFVDS 90
>UniRef50_Q8U0I3 Cluster: Dolichol-phosphate mannose synthase; n=4;
Thermococcaceae|Rep: Dolichol-phosphate mannose synthase
- Pyrococcus furiosus
Length = 215
Score = 50.0 bits (114), Expect = 2e-04
Identities = 35/109 (32%), Positives = 56/109 (51%), Gaps = 13/109 (11%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
VSV++P +NE + +VL R P+ + E+ ++DD S+ D + Y +
Sbjct: 7 VSVIIPAYNEE-----KRIGNVLARIPD-FVDEVIVIDDGSS-DATYEVAKRYTD----- 54
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPI 217
K IRL K G A G A+ D+++F+D+ + N +P LLEPI
Sbjct: 55 -KAIRLNKNMGKGAALREGLRHASGDIIVFMDADGQHNPKEIPKLLEPI 102
>UniRef50_Q4K1A7 Cluster: Putative glycosyl transferase; n=1;
Streptococcus pneumoniae|Rep: Putative glycosyl
transferase - Streptococcus pneumoniae
Length = 648
Score = 49.2 bits (112), Expect = 3e-04
Identities = 34/122 (27%), Positives = 64/122 (52%), Gaps = 10/122 (8%)
Query: 109 VSVVVPFHN--EHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
VS++VP +N E+ + L T Y++ E + LVDD ST D + +++YL +
Sbjct: 7 VSIIVPIYNQEEYLNISLPTLKQQTYKNLEFI-----LVDDGST-DSSKQIINEYLNADS 60
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
+ K + + GL++A +AG + A+ D L FLD + ++++ ++ + DY +
Sbjct: 61 -RFKYV-YKENGGLVSATIAGVKEASGDYLAFLDPDDKLGLDYIENFIKELDCDYDFIAA 118
Query: 227 PF 228
F
Sbjct: 119 GF 120
>UniRef50_A5FFS1 Cluster: Glycosyl transferase, family 2; n=15;
Bacteroidetes|Rep: Glycosyl transferase, family 2 -
Flavobacterium johnsoniae UW101
Length = 339
Score = 49.2 bits (112), Expect = 3e-04
Identities = 39/159 (24%), Positives = 68/159 (42%), Gaps = 8/159 (5%)
Query: 142 IFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
I++ D+AST A ++++ N P VK+I+ G + A++ ++S
Sbjct: 33 IYVADNASTD-----ASVNFVKENYPSVKIIQNTGNYGFAKGYNDALQNVEAEIYALVNS 87
Query: 202 HTEANVNWLPPLLEPIALDYRT--VMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRL 259
E NWL P++E + +T + +D + FEY G + + F R+
Sbjct: 88 DIEVTENWLKPIIETFDKEKQTAIIQPKILDFKNKEYFEYAGAAGGFIDKYGFPFCRGRI 147
Query: 260 -PVLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYD 297
L KD + E +G F I + + EL G+D
Sbjct: 148 FETLEKDNGQYNDDCELFWASGACFFIRKEVYHELKGFD 186
>UniRef50_A7BZS1 Cluster: Glycosyl transferase, family 2; n=1;
Beggiatoa sp. PS|Rep: Glycosyl transferase, family 2 -
Beggiatoa sp. PS
Length = 311
Score = 48.8 bits (111), Expect = 4e-04
Identities = 61/212 (28%), Positives = 94/212 (44%), Gaps = 31/212 (14%)
Query: 110 SVVVPFHNEHWSTL--LRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
S+V+P +N L L T Y Y H E+ + DD ST D + +++ N P
Sbjct: 15 SIVIPTYNRKVDVLNLLATIYQQTY---SHY--EVIVCDDCSTDDSV-----PHIKLNYP 64
Query: 168 KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCP 227
VK+I+L SG AR G + A +V+I LDS L P + +A + P
Sbjct: 65 DVKIIQLDNNSGPAVARNKGIKSAMGEVIIGLDSDV------LLPDNDFLAQVNQLFHIP 118
Query: 228 FIDVIAFDTFEYRAQ-DEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAIS 286
+ AF Y ++ D+ R W + LP+ EK +PF + +G +A
Sbjct: 119 QLFGAAFRIQNYYSERDDSGR----W---WHPLPI----EKFRDKPFWTDYFSGTGYAFR 167
Query: 287 RAFFWELGGYDPGLDIWGGEQYELSFKIWQCG 318
R F E GY P E+ +L+F++ + G
Sbjct: 168 REVF-EKAGYYPEDLFMDMEENDLAFRLLEAG 198
>UniRef50_Q04RA2 Cluster: Glycosyltransferase; n=4; Leptospira|Rep:
Glycosyltransferase - Leptospira borgpetersenii serovar
Hardjo-bovis (strain JB197)
Length = 298
Score = 48.4 bits (110), Expect = 5e-04
Identities = 36/108 (33%), Positives = 57/108 (52%), Gaps = 9/108 (8%)
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
+VSVV+P N +++ SVL ++ + KEI +VDD ST + K + + +
Sbjct: 3 TVSVVIPTFNRE-DKVIKAISSVLAQTFQP--KEIIVVDDGSTDSTIFK-IQETFPNSFD 58
Query: 168 KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLE 215
++++ L + G+ AR G E+A D + FLDS E WLP LE
Sbjct: 59 GIRILSL-EHKGVSHARNRGVEKAAGDWIAFLDSDDE----WLPEKLE 101
>UniRef50_A4F5Y3 Cluster: Glycosyl transferase; n=1;
Saccharopolyspora erythraea NRRL 2338|Rep: Glycosyl
transferase - Saccharopolyspora erythraea (strain NRRL
23338)
Length = 1083
Score = 48.4 bits (110), Expect = 5e-04
Identities = 39/116 (33%), Positives = 54/116 (46%), Gaps = 8/116 (6%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P VSV+VP H + W R L + E+ +VDDAS D K EG
Sbjct: 440 PVVSVIVPVHGK-WP-YTRQCLRFLAGHLVSVPFEVIVVDDASPDDSAAKLAA--CEG-- 493
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYR 222
V+++R + G I A GAE A + L FL++ E +WL L+E + D R
Sbjct: 494 --VRLVRAERNLGFIGACNLGAEHARGEHLFFLNNDAEVTESWLDILVETMESDER 547
>UniRef50_Q82D87 Cluster: Putative glycosyltransferase; n=5;
Streptomyces|Rep: Putative glycosyltransferase -
Streptomyces avermitilis
Length = 778
Score = 48.0 bits (109), Expect = 7e-04
Identities = 44/130 (33%), Positives = 65/130 (50%), Gaps = 11/130 (8%)
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
LP SV+VP + + L SVL +S L E+ VDD S D G +D++ +
Sbjct: 10 LPRFSVIVPAYKVQ-AYLHECLESVLEQSFPDL--ELIAVDDCSP-DACGAIIDEFAARD 65
Query: 166 MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVM 225
+V+ + LP+ +GL AR A AT D LIFLD A+ + P L+ +A +
Sbjct: 66 T-RVRPVHLPQNTGLGRARNAALAEATGDYLIFLD----ADDSLTPGALQAVADRIKETS 120
Query: 226 CPFIDVIAFD 235
P DV+ +D
Sbjct: 121 EP--DVLVYD 128
>UniRef50_A6PMY9 Cluster: Glycosyl transferase, family 2; n=1;
Victivallis vadensis ATCC BAA-548|Rep: Glycosyl
transferase, family 2 - Victivallis vadensis ATCC
BAA-548
Length = 757
Score = 48.0 bits (109), Expect = 7e-04
Identities = 32/98 (32%), Positives = 55/98 (56%), Gaps = 5/98 (5%)
Query: 104 ESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLE 163
++ P VS+++P +N L R SV ++ + EI +V+DAS D + L +Y
Sbjct: 6 QTKPRVSILIPVYNTA-QYLPRCLASVQNQTLHEI--EIIVVNDASP-DNAAEVLAEYAA 61
Query: 164 GNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
+ +++++ K G++ ARL+G AT D LIFLD+
Sbjct: 62 KDS-RIRIVTHEKNGGILAARLSGIAAATGDYLIFLDA 98
>UniRef50_A1SDW3 Cluster: Glycosyl transferase, family 2; n=1;
Nocardioides sp. JS614|Rep: Glycosyl transferase, family
2 - Nocardioides sp. (strain BAA-499 / JS614)
Length = 370
Score = 48.0 bits (109), Expect = 7e-04
Identities = 38/95 (40%), Positives = 51/95 (53%), Gaps = 6/95 (6%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P VSVVVP ++ + L SVL +S +L E+ +VDD S D G D+Y G
Sbjct: 8 PLVSVVVPVYDVA-AYLPEALDSVLAQSHRNL--EVVVVDDGSP-DASGVIADEYA-GRD 62
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
P+++V+ R GL AR G AT D+L F DS
Sbjct: 63 PRIRVVHTANR-GLGAARNEGIRHATGDLLAFADS 96
>UniRef50_Q8TRJ1 Cluster: Glycosyltransferase group 2 family
protein; n=2; Methanomicrobia|Rep: Glycosyltransferase
group 2 family protein - Methanosarcina acetivorans
Length = 314
Score = 48.0 bits (109), Expect = 7e-04
Identities = 39/120 (32%), Positives = 59/120 (49%), Gaps = 6/120 (5%)
Query: 109 VSVVVPFHNEHWST--LLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
+SVV+P +NE + R S L P + EI VDD S KD K L + L N
Sbjct: 5 LSVVIPAYNEEENIEPCYREITSAL--EPLGINYEIIFVDDGS-KDSTFKELQN-LSKND 60
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
K+KVI+ K G A AG + A +++ +D+ + + +P LLE + + V+C
Sbjct: 61 NKLKVIKFRKNFGQSAALRAGLDHAAGRIIVTMDADLQNDPKDIPKLLEKLEKEDLDVVC 120
>UniRef50_Q027Q8 Cluster: Glycosyl transferase, family 2; n=1;
Solibacter usitatus Ellin6076|Rep: Glycosyl transferase,
family 2 - Solibacter usitatus (strain Ellin6076)
Length = 325
Score = 47.6 bits (108), Expect = 9e-04
Identities = 38/113 (33%), Positives = 52/113 (46%), Gaps = 2/113 (1%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
+S+V+P HNE S L P EI VDDAS D + L + +E +
Sbjct: 4 LSIVIPIHNEEPSILPLYDRLTAVLEPLQRPYEILFVDDASA-DRSFELLANLVETD-GH 61
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDY 221
+KVIRL + G A AG A D++I +D + +P LLE I Y
Sbjct: 62 LKVIRLRRNFGQTAALSAGFHEAKGDIVIAMDGDLQHAPEDIPALLEKIDQGY 114
>UniRef50_Q4K1D1 Cluster: Putative glycosyl transferase; n=1;
Streptococcus pneumoniae|Rep: Putative glycosyl
transferase - Streptococcus pneumoniae
Length = 651
Score = 47.2 bits (107), Expect = 0.001
Identities = 35/120 (29%), Positives = 60/120 (50%), Gaps = 6/120 (5%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
VSVV+P +N+ L ++ +SVL +S ++L EI V+D S D + ++++ N
Sbjct: 7 VSVVIPIYNQE-IYLRKSIHSVLTQSYQNL--EIIAVNDGSI-DSSQQIMEEF--ANQDS 60
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPF 228
+ GL+ A + G E+AT D ++FLD N++ + + DY V F
Sbjct: 61 RIFLYEKSNGGLVDATIYGVEKATGDYIVFLDPDDYIGENYIQKFVSYLDHDYDFVAMGF 120
>UniRef50_Q1N7I9 Cluster: Predicted glycosyltransferase; n=1;
Sphingomonas sp. SKA58|Rep: Predicted glycosyltransferase
- Sphingomonas sp. SKA58
Length = 1454
Score = 47.2 bits (107), Expect = 0.001
Identities = 35/121 (28%), Positives = 66/121 (54%), Gaps = 10/121 (8%)
Query: 109 VSVVVPFHNEHWSTLLRTAY-SVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
VS+++P N + ++ A SVL +S H E+ ++DD ST D L AL+ Y + P
Sbjct: 1003 VSIIMPVLNR--AAIIGDAIASVLAQSHAHF--ELIIIDDGSTDD-LPSALERYPD---P 1054
Query: 168 KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCP 227
++ +I + + AR G +RA+ +++ +LDS + ++L ++ +A D+R C
Sbjct: 1055 RILLISNARSERVSAARNRGLDRASGEIIAYLDSDNSWDPDYLAVMVNALA-DHRGHACA 1113
Query: 228 F 228
+
Sbjct: 1114 Y 1114
>UniRef50_A6NTT8 Cluster: Putative uncharacterized protein; n=1;
Bacteroides capillosus ATCC 29799|Rep: Putative
uncharacterized protein - Bacteroides capillosus ATCC
29799
Length = 331
Score = 47.2 bits (107), Expect = 0.001
Identities = 36/93 (38%), Positives = 51/93 (54%), Gaps = 6/93 (6%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
+SV++P + + L ++ SVL S HL E+ LVDD S D G+ D Y + +
Sbjct: 5 ISVIIPIYKVE-AFLDKSIQSVL--SQTHLNLELILVDDGSP-DRCGEICDSYARQDK-R 59
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
V VI R GL AR AG + AT D ++FLD+
Sbjct: 60 VTVIHQTNR-GLSGARNAGLKAATGDYIVFLDA 91
>UniRef50_UPI0000ECF595 Cluster: UPI0000ECF595 related cluster; n=1;
unknown|Rep: UPI0000ECF595 UniRef100 entry - unknown
Length = 338
Score = 46.8 bits (106), Expect = 0.002
Identities = 38/92 (41%), Positives = 46/92 (50%), Gaps = 6/92 (6%)
Query: 110 SVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKV 169
S++VP +N L SVL + E+ LVDD S D GK D+Y G P +
Sbjct: 4 SIIVPIYNVE-KYLRCCVDSVL--AENFADYEMILVDDGSP-DGCGKICDEYA-GKYPHI 58
Query: 170 KVIRLPKRSGLITARLAGAERATADVLIFLDS 201
KVI + GL AR AG A D LIFLDS
Sbjct: 59 KVIH-QENGGLSDARNAGIRAAKGDYLIFLDS 89
>UniRef50_Q8A2D5 Cluster: Putative glycosyltransferase yibD; n=1;
Bacteroides thetaiotaomicron|Rep: Putative
glycosyltransferase yibD - Bacteroides thetaiotaomicron
Length = 329
Score = 46.8 bits (106), Expect = 0.002
Identities = 39/110 (35%), Positives = 60/110 (54%), Gaps = 6/110 (5%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
+SV+VP HN + L + S+ +S E++ EI LVD+ ST D + D+Y + +
Sbjct: 7 ISVIVPVHNTV-NYLRKCIESIRNQSLENI--EIILVDNLST-DGSSEVCDEYASMDT-R 61
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIA 218
VKVI L + + AR AG + A+A + F+DS +VN LL I+
Sbjct: 62 VKVIHLSVANASV-ARNAGIDMASAPYIGFIDSDDYIDVNMYEELLAAIS 110
>UniRef50_Q2CER4 Cluster: Glycosyltransferase; n=1; Oceanicola
granulosus HTCC2516|Rep: Glycosyltransferase -
Oceanicola granulosus HTCC2516
Length = 276
Score = 46.8 bits (106), Expect = 0.002
Identities = 39/105 (37%), Positives = 54/105 (51%), Gaps = 12/105 (11%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P VSVV+P N L S L +S + E+ +VDDAST+D +G AL +
Sbjct: 4 PRVSVVIPTCNRS-GFLAEAVASALGQSVA--VDEVIVVDDASTED-IGPALARF----G 55
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLP 211
+V++ RL +RSG AR G E A +++ FLD WLP
Sbjct: 56 ARVRLERLAERSGANVARNRGVELARGELVAFLDDDDV----WLP 96
>UniRef50_A7LVT4 Cluster: Putative uncharacterized protein; n=1;
Bacteroides ovatus ATCC 8483|Rep: Putative
uncharacterized protein - Bacteroides ovatus ATCC 8483
Length = 338
Score = 46.8 bits (106), Expect = 0.002
Identities = 33/104 (31%), Positives = 58/104 (55%), Gaps = 6/104 (5%)
Query: 98 KKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKA 157
K + + P VSV++P +N H +++ +T SV+ ++ +L EI +V+D ST D L
Sbjct: 11 KNDIMEKEYPLVSVIIPVYNAH-NSITKTLQSVINQTYTNL--EIVVVNDGSTDDSLD-I 66
Query: 158 LDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
+ Y + P++ V + GL+ AR +G + AT + +LDS
Sbjct: 67 IKTYAAED-PRIVVFN-KQNEGLVQARKSGIDIATGKYIQYLDS 108
>UniRef50_A6XVK9 Cluster: Glycosyl transferase, group 2 family
protein; n=1; Vibrio cholerae AM-19226|Rep: Glycosyl
transferase, group 2 family protein - Vibrio cholerae
AM-19226
Length = 634
Score = 46.8 bits (106), Expect = 0.002
Identities = 33/103 (32%), Positives = 54/103 (52%), Gaps = 7/103 (6%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
VSV++P ++ W L S+L +S E E+ L+ D S ++ + +D+Y + K
Sbjct: 74 VSVIIPIYDRTWE-LKEAIQSILDQSYEKT--ELILICDGSPQETIN-IVDEYQSND--K 127
Query: 169 VKVIRLPKRSG-LITARLAGAERATADVLIFLDSHTEANVNWL 210
VK+ + P SG + R G + A + + FLDS A+VN L
Sbjct: 128 VKIFKYPTSSGNAVRGRNKGIKEANGEFIAFLDSDDIADVNRL 170
>UniRef50_A5UT62 Cluster: Glycosyl transferase, family 2; n=5;
Chloroflexi (class)|Rep: Glycosyl transferase, family 2
- Roseiflexus sp. RS-1
Length = 312
Score = 46.8 bits (106), Expect = 0.002
Identities = 33/106 (31%), Positives = 53/106 (50%), Gaps = 8/106 (7%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
+ +++P +N + LL T L R+ + +VDD ST D + Y P+
Sbjct: 2 IDIIIPNYNG--AALLPTCLDSL-RAQTRRDFCVVVVDDGSTDDSVALVRRRY-----PE 53
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLL 214
V+VI LP+ GL A AG E + ++ L++ TEA+ WL L+
Sbjct: 54 VQVIALPRNRGLAAAVNAGIEATGGEYVVLLNNDTEAHPRWLEHLI 99
>UniRef50_A4AYH1 Cluster: Dolichyl-phosphate mannose synthase
related protein; n=1; Alteromonas macleodii 'Deep
ecotype'|Rep: Dolichyl-phosphate mannose synthase
related protein - Alteromonas macleodii 'Deep ecotype'
Length = 309
Score = 46.8 bits (106), Expect = 0.002
Identities = 33/111 (29%), Positives = 53/111 (47%), Gaps = 6/111 (5%)
Query: 107 PSVSVVVPFHNE--HWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEG 164
P +SV+VP +N+ + + ++ Y S + EI +VD+ ST L+ +
Sbjct: 12 PLISVIVPVYNDLIRIGQCIDSLVALSYPSSNY---EIIIVDNCSTDGTYDYLLEKLNQA 68
Query: 165 NMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLE 215
P V++I+ G AR AG + A D + F DS + NWL L+E
Sbjct: 69 YQPNVRLIKC-STPGSYAARNAGLKIAKGDFVAFTDSDCLVSENWLTNLIE 118
>UniRef50_A3Y6T7 Cluster: Putative uncharacterized protein; n=1;
Marinomonas sp. MED121|Rep: Putative uncharacterized
protein - Marinomonas sp. MED121
Length = 704
Score = 46.8 bits (106), Expect = 0.002
Identities = 34/102 (33%), Positives = 52/102 (50%), Gaps = 12/102 (11%)
Query: 455 CDVTTNAEQHFVLTWHKDIRCKTRNMCWDLPDSS--PGSPIVLYACHLGGGNQLWRYYPD 512
CD+T NA Q FVL + DIR +C D+P S G +++Y+CH G NQ W PD
Sbjct: 596 CDIT-NANQTFVLGDNNDIR--VNGICLDVPRSEAFAGQSVIVYSCHY-GDNQSWDLNPD 651
Query: 513 -TKQIKHG---GHSNCVDHEPTT--LRLHISRCDAAADTQRW 548
T Q G + C++ T + + + CD++ Q++
Sbjct: 652 GTIQSALNAVDGTNLCLEANDATSDIAITLQACDSSNPAQKF 693
>UniRef50_Q9AK48 Cluster: Putative glycosyltransferase; n=2;
Streptomyces|Rep: Putative glycosyltransferase -
Streptomyces coelicolor
Length = 326
Score = 46.4 bits (105), Expect = 0.002
Identities = 40/128 (31%), Positives = 58/128 (45%), Gaps = 17/128 (13%)
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDAS---TKDFLGKALDDYL 162
+P +SV+VPF+N T S+ + + E LVDD S T D L +A
Sbjct: 1 MPKLSVIVPFYNVR-QYAPDTLRSLRANARDDF--EFILVDDCSSDGTADLLARA----- 52
Query: 163 EGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPP-LLEPIALDY 221
E +P ++R + GL TAR G +RA + L FLD +WL P E +
Sbjct: 53 ESELPGAVLVRHERNGGLATARNTGIDRARGEYLTFLDGD-----DWLAPGYYERLLAAI 107
Query: 222 RTVMCPFI 229
+ C F+
Sbjct: 108 EELGCDFV 115
>UniRef50_Q8XN57 Cluster: Beta-1,4-galactosyltransferase; n=1;
Clostridium perfringens|Rep:
Beta-1,4-galactosyltransferase - Clostridium perfringens
Length = 342
Score = 46.4 bits (105), Expect = 0.002
Identities = 37/96 (38%), Positives = 52/96 (54%), Gaps = 6/96 (6%)
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
LP VS++VP + L + S++ +S ++L EI LVDD ST D G+ +D Y E
Sbjct: 2 LPKVSLIVPIYKVQ-DYLNKCIDSIINQSYKNL--EIILVDDGST-DRCGEIIDLY-EKQ 56
Query: 166 MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
++KV K GL AR G T D +IF+DS
Sbjct: 57 DNRIKVFH-KKNGGLSDARNYGMNFITGDYVIFVDS 91
>UniRef50_Q1BYJ1 Cluster: Methyltransferase FkbM; n=2; Burkholderia
cenocepacia|Rep: Methyltransferase FkbM - Burkholderia
cenocepacia (strain AU 1054)
Length = 1644
Score = 46.4 bits (105), Expect = 0.002
Identities = 60/234 (25%), Positives = 94/234 (40%), Gaps = 17/234 (7%)
Query: 103 IESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYL 162
IE P VS+V+P N+ LLR L + ++ KE+ +VD S D A + L
Sbjct: 701 IERAPLVSIVIPTKNK--PELLRMCLEGLLHATDYPNKEVVIVDTGSD-DPETLAYYEQL 757
Query: 163 EGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTE-ANVNWLPPLLEPIALDY 221
+ P+V+++ + A GA RA + L+FL++ E +WL L+
Sbjct: 758 KSE-PQVRIVHFRNKFNYSAACNFGAVRARGEFLLFLNNDIEIIKSDWLQELV------- 809
Query: 222 RTVMCPFIDVIAFDTF--EYRAQDEGAR-GAFDWEFFYKRLPVLPKDEKNMPE-PFESPV 277
R M P + V+ Q G G Y+ + D P+ P
Sbjct: 810 RFAMRPGVGVVGTKLIYPSLELQHAGVSIGIHLAALMYRSGGGIEWDVFGSPDHPRNWLA 869
Query: 278 MAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGH 331
+ G + R F E+GG+D I L +IW+ G R P + + H
Sbjct: 870 IMGACQMVRRDAFEEVGGFDESYLI-AMSDVALCMRIWRAGYRTAYTPYACLVH 922
>UniRef50_A6ELC1 Cluster: TuaG; n=1; unidentified eubacterium
SCB49|Rep: TuaG - unidentified eubacterium SCB49
Length = 269
Score = 46.4 bits (105), Expect = 0.002
Identities = 35/106 (33%), Positives = 52/106 (49%), Gaps = 5/106 (4%)
Query: 96 GCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLG 155
G KK + + S P VS++ P +N + +T SVL ++ + E+ +VDDAST + L
Sbjct: 5 GKKKHINMVSKPLVSIITPLYNAA-PFIEKTIQSVLDQTYSNW--ELLIVDDASTDNGLQ 61
Query: 156 KALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
D + GN P++K G R AT D + FLDS
Sbjct: 62 IVRDFF--GNDPRIKFYPNTINKGAAHCRNQATSEATGDYIAFLDS 105
>UniRef50_A1GD35 Cluster: Ricin B lectin precursor; n=2;
Salinispora|Rep: Ricin B lectin precursor - Salinispora
arenicola CNS205
Length = 522
Score = 46.4 bits (105), Expect = 0.002
Identities = 30/76 (39%), Positives = 40/76 (52%), Gaps = 7/76 (9%)
Query: 437 LCADAAHGPQL--AALRLARCDVTTNAEQHFVLTWHKDIRCKTRNMCWDLPD--SSPGSP 492
LC D A G + A ++LA C + N Q +VLT D+ N C D+ D S+ G+
Sbjct: 448 LCMDVAWGSRENGAVVQLAHC--SGNPAQQWVLTGANDLVNPQANKCLDVKDWNSADGAR 505
Query: 493 IVLYACHLGGGNQLWR 508
+ Y CH GG NQ WR
Sbjct: 506 LQTYECH-GGANQKWR 520
>UniRef50_Q4JZ78 Cluster: Putative glycosyl transferase; n=1;
Streptococcus pneumoniae|Rep: Putative glycosyl
transferase - Streptococcus pneumoniae
Length = 321
Score = 46.0 bits (104), Expect = 0.003
Identities = 38/115 (33%), Positives = 63/115 (54%), Gaps = 9/115 (7%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
VS+VVP +N L S++ +S + + EI LVDD ST D G+ D Y + + +
Sbjct: 4 VSIVVPVYNVE-DYLKYCIDSLINQSYKDI--EIILVDDGSTDDS-GRICDKYAQED-DR 58
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDS---HTEANVNWLPPLLEPIALD 220
++V+ + + GL AR G + A+A+ +IF+DS + V +L L + A+D
Sbjct: 59 IRVLHI-ENGGLSNARNTGVKFASAEWIIFIDSDDYYDRRTVEYLVQLQKKYAVD 112
>UniRef50_A7B4B8 Cluster: Putative uncharacterized protein; n=1;
Ruminococcus gnavus ATCC 29149|Rep: Putative
uncharacterized protein - Ruminococcus gnavus ATCC 29149
Length = 336
Score = 46.0 bits (104), Expect = 0.003
Identities = 34/93 (36%), Positives = 49/93 (52%), Gaps = 6/93 (6%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
VS++VP +N + T L R I E+ LVDD ST D G+ D Y E + +
Sbjct: 6 VSIIVPVYNVE--PYIETCIQSLIRQTMGNI-EVILVDDGST-DRSGELCDQYAEAD-ER 60
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
++VI + GL +AR AG A + L+F+DS
Sbjct: 61 IRVIH-KQNGGLSSARNAGISAAKGEYLLFVDS 92
>UniRef50_A6M2B0 Cluster: Glycosyl transferase, family 2; n=1;
Clostridium beijerinckii NCIMB 8052|Rep: Glycosyl
transferase, family 2 - Clostridium beijerinckii NCIMB
8052
Length = 322
Score = 46.0 bits (104), Expect = 0.003
Identities = 35/93 (37%), Positives = 55/93 (59%), Gaps = 6/93 (6%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
+SV+VP +N L + YS+L ++ ++L EI LVDD S D GK D++ + + +
Sbjct: 8 ISVIVPVYNVE-KYLPQCIYSILNQTEKNL--EIILVDDGSL-DNSGKICDEFSKKD-DR 62
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
+ VI K +GL +AR AG E A + + F+DS
Sbjct: 63 IVVIH-KKNNGLSSARNAGLEIAKGNYIGFVDS 94
>UniRef50_A3W6W1 Cluster: Glycosyltransferase; n=2; Roseovarius|Rep:
Glycosyltransferase - Roseovarius sp. 217
Length = 438
Score = 46.0 bits (104), Expect = 0.003
Identities = 36/116 (31%), Positives = 60/116 (51%), Gaps = 6/116 (5%)
Query: 87 RSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVD 146
R +PD+ S K + E P+V+V++P N T+L S L +S + E+ ++D
Sbjct: 191 RDMPDLLRSEALKLLG-EKKPNVTVILPTWNRA-HTVLSAVASALLQS--YTPTEVIVID 246
Query: 147 DASTKDFLGKALDDYLEGNMPKVKVIRLPKRS-GLITARLAGAERATADVLIFLDS 201
D ST DF + L + + +++ + + S G+ AR AG RA D + +LDS
Sbjct: 247 DGST-DFTVELLRQRFPEPIREGRLVLMTQESSGVSAARNAGLARAQGDFIAYLDS 301
>UniRef50_A3S9H6 Cluster: Putative uncharacterized protein; n=2;
Sulfitobacter|Rep: Putative uncharacterized protein -
Sulfitobacter sp. EE-36
Length = 305
Score = 46.0 bits (104), Expect = 0.003
Identities = 37/119 (31%), Positives = 58/119 (48%), Gaps = 10/119 (8%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P ++VV+P N+H L + L + + E+ +VD+ S + G ++DY
Sbjct: 8 PGITVVIPHLNQH--NALARCLAALDAGTQKPV-EVIVVDNGSARSPQG-VVNDY----- 58
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVM 225
P V +I P G AR GA RA DVL FLD+ + +WL + +A D T++
Sbjct: 59 PWVSLIHQPI-PGPGPARNLGAARARGDVLAFLDADCLPDAHWLAHIAAHMARDPHTIL 116
>UniRef50_A1ASN8 Cluster: Glycosyl transferase, family 2; n=1;
Pelobacter propionicus DSM 2379|Rep: Glycosyl
transferase, family 2 - Pelobacter propionicus (strain
DSM 2379)
Length = 340
Score = 46.0 bits (104), Expect = 0.003
Identities = 35/104 (33%), Positives = 57/104 (54%), Gaps = 6/104 (5%)
Query: 113 VPFHNEHWSTLLRTAY-SVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVKV 171
V +N ++S+ L A SVL +S L EI +VDD ST ++Y GN P++K+
Sbjct: 5 VLINNYNYSSFLSKAVNSVLEQSV--LPDEIIVVDDGSTDGSCELLQNNY--GNNPRIKI 60
Query: 172 IRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLE 215
I K G ++A G ++ D+L FLDS E + +++ +L+
Sbjct: 61 IS-SKNRGQLSAMNLGFIHSSGDILFFLDSDDEYDPHYIESVLD 103
>UniRef50_P0C0H0 Cluster: Hyaluronan synthase; n=17; Streptococcus
pyogenes|Rep: Hyaluronan synthase - Streptococcus
pyogenes
Length = 419
Score = 46.0 bits (104), Expect = 0.003
Identities = 35/110 (31%), Positives = 57/110 (51%), Gaps = 5/110 (4%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP- 167
V+ V+P +NE +LL T SVL ++ + + EI++VDD S+ + +++Y+ +
Sbjct: 66 VAAVIPSYNEDAESLLETLKSVLAQT--YPLSEIYIVDDGSSNTDAIQLIEEYVNREVDI 123
Query: 168 --KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLE 215
V V R G A+ ER+ ADV + +DS T N L LL+
Sbjct: 124 CRNVIVHRSLVNKGKRHAQAWAFERSDADVFLTVDSDTYIYPNALEELLK 173
>UniRef50_Q8YNR8 Cluster: Alr4493 protein; n=3; Nostocaceae|Rep:
Alr4493 protein - Anabaena sp. (strain PCC 7120)
Length = 295
Score = 45.6 bits (103), Expect = 0.004
Identities = 52/200 (26%), Positives = 88/200 (44%), Gaps = 18/200 (9%)
Query: 141 EIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLD 200
EI LVD+AS + D+ P VK++RLPK G AGA+ A L+ L+
Sbjct: 39 EILLVDNASQDNSYELVAQDF-----PDVKIVRLPKNYGFGAGNNAGAKIAKGKFLLLLN 93
Query: 201 SHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYR-AQDEGARGAFDWEFFYKRL 259
+ T N LP L++ + + + V +TF+ A G +G E+ ++L
Sbjct: 94 TDTILTTNILPHLIDLMRENPDVGIIGTKLVFPDETFQISFAYTIGIKG----EYKSRKL 149
Query: 260 PVLPKDE---KNMPEPF----ESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSF 312
+D+ ++ + F E ++ G I F +GG+D I+ E +L
Sbjct: 150 HEYAEDKSKLNSLEQEFNTIKEVDIVVGAALFIRADLFHSVGGFDEKFFIY-FEDADLCR 208
Query: 313 KIWQCGGRMLDAPCSRVGHI 332
++ G ++L P + HI
Sbjct: 209 RVQNQGYKILYTPQVSLIHI 228
>UniRef50_Q8A2D9 Cluster: Putative glycosyltransferase yibD; n=1;
Bacteroides thetaiotaomicron|Rep: Putative
glycosyltransferase yibD - Bacteroides thetaiotaomicron
Length = 336
Score = 45.6 bits (103), Expect = 0.004
Identities = 36/97 (37%), Positives = 56/97 (57%), Gaps = 10/97 (10%)
Query: 107 PSVSVVVPFHN--EHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEG 164
P +S++VP +N ++ T L+ S+L ++ ++L EI LVDD S G D+Y
Sbjct: 8 PLISLIVPVYNVKDYLKTCLQ---SILEQTYKNL--EIILVDDGSDDGSSG-ICDEYARM 61
Query: 165 NMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
+ ++K I LP SG+ AR AG AT ++L F+DS
Sbjct: 62 DQ-RIKTIHLP-HSGVSAARNAGLAAATGELLGFVDS 96
>UniRef50_Q748H9 Cluster: Glycosyl transferase, group 1/2 family
protein; n=1; Geobacter sulfurreducens|Rep: Glycosyl
transferase, group 1/2 family protein - Geobacter
sulfurreducens
Length = 2401
Score = 45.6 bits (103), Expect = 0.004
Identities = 60/254 (23%), Positives = 99/254 (38%), Gaps = 20/254 (7%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P VS++VP +N+ T + ++ + + L E+ LVD+AS+ G A +YL
Sbjct: 1671 PKVSIIVPLYNKAEYTK-QCLEALALNTEQALNYEVILVDNASSD---GTA--EYLRTLS 1724
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
V ++ K G A G A L+FL++ T + WL L++ D
Sbjct: 1725 GDVTIVTNLKNLGFAKACNQGGRIARGRYLVFLNNDTIPHPGWLDGLIKGAEQD------ 1778
Query: 227 PFIDVIAFDTFEYRAQDEGARGAFDWEFF-YKRLPVLPKDEKNMPEPFESPVMAGGLFAI 285
D++ + + A AF+ + Y P D + + +
Sbjct: 1779 -GADIVGARLLYPNGRVQHAGVAFNEQSIGYHIFNGFPADSPAVNRKRFMQCVTAACMLV 1837
Query: 286 SRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHG 345
+ F ELGG+D G + G E + + + G R+L P S + H + G
Sbjct: 1838 KQELFAELGGFDEGY-VNGFEDVDFCLRAGERGRRILYTPESVLIHFEE-----TSEGRK 1891
Query: 346 DFVGKNYRRVAEVW 359
D N RR W
Sbjct: 1892 DHDTPNIRRFLARW 1905
>UniRef50_Q2IYC9 Cluster: Glycosyl transferase, family 2; n=1;
Rhodopseudomonas palustris HaA2|Rep: Glycosyl
transferase, family 2 - Rhodopseudomonas palustris
(strain HaA2)
Length = 303
Score = 45.6 bits (103), Expect = 0.004
Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 1/47 (2%)
Query: 279 AGGLFAISRAFFWELGGYDPGLDI-WGGEQYELSFKIWQCGGRMLDA 324
+G A+SRA F + GGYD + WG E EL +++W+ G R ++A
Sbjct: 202 SGSGVALSRALFMQSGGYDEAFGLRWGAEAIELGYRLWRGGARFVEA 248
>UniRef50_A0L7J6 Cluster: Glycosyl transferase, family 2; n=4;
Bacteria|Rep: Glycosyl transferase, family 2 -
Magnetococcus sp. (strain MC-1)
Length = 244
Score = 45.6 bits (103), Expect = 0.004
Identities = 31/114 (27%), Positives = 54/114 (47%), Gaps = 3/114 (2%)
Query: 103 IESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYL 162
+ L S++V++P +NE +TL VL + L E+ +V+D S + + D L
Sbjct: 5 MRELTSLTVIMPCYNEE-NTLEEVVSQVLQANILGLQLELIIVNDGSVDG--SRQIMDQL 61
Query: 163 EGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEP 216
P ++ + G A G + AT D+++ D+ E + N P LL+P
Sbjct: 62 AAQHPAIRALHHDVNQGKGAALSTGFKAATGDLVLIQDADLEYSPNEYPSLLQP 115
>UniRef50_Q97Q65 Cluster: Glycosyl transferase, family 2; n=13;
Streptococcus pneumoniae|Rep: Glycosyl transferase,
family 2 - Streptococcus pneumoniae
Length = 328
Score = 45.2 bits (102), Expect = 0.005
Identities = 41/128 (32%), Positives = 65/128 (50%), Gaps = 7/128 (5%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
+SV+VP +N L ++ S+ ++ ++L EI LVDD +T D G+ D E + +
Sbjct: 6 ISVIVPVYNVA-QYLEKSIASIQKQTYQNL--EIILVDDGAT-DESGRLCDSIAEQD-DR 60
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTV-MCP 227
V V+ K GL AR G ++A D LIF+DS + + L E + + V C
Sbjct: 61 VSVLH-KKNEGLSQARNDGMKQAHGDYLIFIDSDDYIHPEMIQSLYEQLVQEDADVSSCG 119
Query: 228 FIDVIAFD 235
++V A D
Sbjct: 120 VMNVYAND 127
>UniRef50_Q8PGL8 Cluster: Truncated O-antigen biosynthesis protein;
n=2; Xanthomonas|Rep: Truncated O-antigen biosynthesis
protein - Xanthomonas axonopodis pv. citri
Length = 614
Score = 45.2 bits (102), Expect = 0.005
Identities = 57/226 (25%), Positives = 93/226 (41%), Gaps = 9/226 (3%)
Query: 102 YIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDY 161
Y + P S+++P + LLR + ++ EI +VD+ S + LD
Sbjct: 336 YAQHAPKASIIIPTRDG--LDLLRPCIESIQGRTDYTNFEIIVVDNGSKDPATLEWLDAM 393
Query: 162 LEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTE-ANVNWLPPLLEPIALD 220
N P +V++ A A +VL+FL++ TE A+ +WL L+E AL
Sbjct: 394 --SNEPWFRVLQADVPFNWSALNNLAAASAEGEVLVFLNNDTEVASADWLQRLVEN-ALR 450
Query: 221 YRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPE-PFESPVMA 279
C + + A + ++ G G D F + P+ + P E +
Sbjct: 451 PDVGACGALLLYADGSIQHAGVVVGMGGWAD-HVFKGQQPIHHQHLFVSPILRREVLAVT 509
Query: 280 GGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAP 325
G AISRA F +GG+D + G + EL + G R + P
Sbjct: 510 GACVAISRANFDAIGGFDESFVVCGSD-VELCLRAMANGKRTVYVP 554
Score = 36.7 bits (81), Expect = 1.7
Identities = 34/140 (24%), Positives = 60/140 (42%), Gaps = 3/140 (2%)
Query: 76 NGALSDKIPLSRSLPDIRHSGCKKKMYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSP 135
+G + P+S + HS + +SV++P N L SV R+
Sbjct: 54 SGGSASPPPVSLDAYESWHSALPQLEVPADSGMISVIIPVCNTPELFLREAIESV--RTQ 111
Query: 136 EHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADV 195
+ EI + DDAS + +G+ L ++ + +P ++V R RSG+ A A
Sbjct: 112 TYQNWEICIHDDASNEPHVGRILAEFKK-RIPNLRVTRSETRSGIAATTNAALASARGRW 170
Query: 196 LIFLDSHTEANVNWLPPLLE 215
+ FLD + N L +++
Sbjct: 171 VTFLDHDDVLDRNALAAVVD 190
>UniRef50_Q311W2 Cluster: Glycosyl transferase, group 2 family
protein; n=3; Desulfovibrio|Rep: Glycosyl transferase,
group 2 family protein - Desulfovibrio desulfuricans
(strain G20)
Length = 427
Score = 45.2 bits (102), Expect = 0.005
Identities = 60/243 (24%), Positives = 94/243 (38%), Gaps = 22/243 (9%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKA--LDDYLEGNM 166
+S+++P N+ W S+ +P E+ + D+ S + + L + L G M
Sbjct: 14 LSIIIPVWNQ-WHLTRACLESLRQHTPGDFF-EVIVADNGSGDETAVQLAPLGEKLFGRM 71
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRT-VM 225
+ IRL G A GA+ A + L+FL++ T WLPPL++ D R
Sbjct: 72 --FRRIRLDTNQGFGPACNLGAKSARGEKLLFLNNDTLLTSGWLPPLMKAFDEDARLGAA 129
Query: 226 CPFIDVIAFDTFEYRAQDEGA--RGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLF 283
P + D R Q G A + Y P D + + G
Sbjct: 130 GPLLLYPESD----RVQHAGIVFTPALRTQHLYANFPA---DHPVLRTRRTLQAITGACL 182
Query: 284 AISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPG 343
+ F + GG+ G G E EL +I + G ++ SRV H+ + PG
Sbjct: 183 LVPSGLFRQCGGFYEGYKN-GSEDLELCCRIREAGKKLRCVTESRVYHLESQ-----TPG 236
Query: 344 HGD 346
GD
Sbjct: 237 RGD 239
>UniRef50_Q4IU61 Cluster: Glycosyl transferase, family 2; n=1;
Azotobacter vinelandii AvOP|Rep: Glycosyl transferase,
family 2 - Azotobacter vinelandii AvOP
Length = 1182
Score = 45.2 bits (102), Expect = 0.005
Identities = 34/108 (31%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 103 IESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYL 162
++ P S+VVP +N S LL S + R+ + E+ LVDDAS + +AL
Sbjct: 651 LDITPLFSIVVPAYNTS-SELLDAVLSSV-RAQWYPHWELILVDDASPSEETRRAL---A 705
Query: 163 EGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWL 210
E + PK++V+ L G+ A G A + ++F+D E V+ L
Sbjct: 706 EIDDPKIRVLHLESNKGISGATNVGLAAAQGEFIVFMDHDDELTVDCL 753
>UniRef50_A6L5V6 Cluster: Glycosyltransferase family 2; n=1;
Bacteroides vulgatus ATCC 8482|Rep: Glycosyltransferase
family 2 - Bacteroides vulgatus (strain ATCC 8482 / DSM
1447 / NCTC 11154)
Length = 344
Score = 45.2 bits (102), Expect = 0.005
Identities = 46/126 (36%), Positives = 61/126 (48%), Gaps = 12/126 (9%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEG-- 164
P VS++VP +N L R S+L ++ + L EI LVDD S + AL D E
Sbjct: 4 PKVSIIVPVYNVE-KYLDRCMQSLLGQTLKEL--EIILVDDGSPDN--SPALCDNYESAS 58
Query: 165 ---NMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPL-LEPIALD 220
+ P +KVI GL AR +G E AT + + FLDS +VN L L+ A D
Sbjct: 59 DNLHWPIIKVIH-KINEGLGFARNSGLEIATGEYIAFLDSDDFVDVNMYEQLYLKAKAKD 117
Query: 221 YRTVMC 226
V C
Sbjct: 118 IDVVYC 123
>UniRef50_P75905 Cluster: Biofilm PGA synthesis
N-glycosyltransferase pgaC; n=35; Proteobacteria|Rep:
Biofilm PGA synthesis N-glycosyltransferase pgaC -
Escherichia coli (strain K12)
Length = 441
Score = 45.2 bits (102), Expect = 0.005
Identities = 30/116 (25%), Positives = 59/116 (50%), Gaps = 5/116 (4%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
PS+S+++P NE + T ++ L + E++ E+ V+D ST +A+ D + +
Sbjct: 75 PSISIIIPCFNEE-KNVEETIHAALAQRYENI--EVIAVNDGSTDK--TRAILDRMAAQI 129
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYR 222
P ++VI L + G A GA A ++ L+ +D + + ++EP+ + R
Sbjct: 130 PHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185
>UniRef50_Q8FPM5 Cluster: Putative uncharacterized protein; n=1;
Corynebacterium efficiens|Rep: Putative uncharacterized
protein - Corynebacterium efficiens
Length = 679
Score = 44.8 bits (101), Expect = 0.006
Identities = 55/248 (22%), Positives = 96/248 (38%), Gaps = 27/248 (10%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
VSVV+P++N+ S LR + + + EI + DD ST + ++P
Sbjct: 319 VSVVIPYYNDLDS--LRRVVAGVENNGHGSDVEIIIADDGSTTA-------PEVTTSLPV 369
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPF 228
V + AR G A +V++FLD T +L + + D R C
Sbjct: 370 TVVRQDDLGFRAAAARNLGVRSARHEVVVFLDGDTVPCPGYLTAMSRWVTADPR---CVV 426
Query: 229 IDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAISRA 288
+ D E + ++ W + D + + + + A S+
Sbjct: 427 VGTRLQDGVEPQWLNDA------WGY---------TDHLRLADETSFRFIISSVLATSKT 471
Query: 289 FFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFV 348
F ++GG+D + +GGE +EL +++W G L P + H+ +A P
Sbjct: 472 MFDKVGGFDETMVGYGGEDWELGWRLWNAGAIFLHDPEAIADHLEPDWAARGKPEEMKLA 531
Query: 349 GKNYRRVA 356
KN +A
Sbjct: 532 EKNAETIA 539
>UniRef50_A7AD36 Cluster: Putative uncharacterized protein; n=2;
Bacteroidales|Rep: Putative uncharacterized protein -
Parabacteroides merdae ATCC 43184
Length = 354
Score = 44.8 bits (101), Expect = 0.006
Identities = 47/196 (23%), Positives = 79/196 (40%), Gaps = 11/196 (5%)
Query: 141 EIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLD 200
++ + D+ ST D + + L+ P V +IRL K G + + + L+
Sbjct: 50 DVIVADNGSTDDSI-----EMLKTKFPTVGIIRLDKNYGFAEGYNQALKHIGHEYTVLLN 104
Query: 201 SHTEANVNWLPPLLEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFD-WEFFYKR- 258
S E WL + + D +T+ + A EY A G D + + Y R
Sbjct: 105 SDVEVTPGWLDAPIAALDAD-KTIAGVQPKIRAQRNKEYFEYAGAAGGYMDRYGYPYCRG 163
Query: 259 --LPVLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQ 316
L V+ KD P + G + A + E+GG D G E+ +L +++
Sbjct: 164 RVLHVVEKDAGQYDTPADLLWATGACLFVRTATYKEVGGLDAGF-FAHQEEIDLCWRLRS 222
Query: 317 CGGRMLDAPCSRVGHI 332
G R++ P S V H+
Sbjct: 223 RGYRLVCTPSSVVYHV 238
>UniRef50_A6PMZ0 Cluster: Glycosyl transferase, family 2; n=1;
Victivallis vadensis ATCC BAA-548|Rep: Glycosyl
transferase, family 2 - Victivallis vadensis ATCC
BAA-548
Length = 646
Score = 44.8 bits (101), Expect = 0.006
Identities = 33/109 (30%), Positives = 59/109 (54%), Gaps = 5/109 (4%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P V++++P +N L R SVL ++ ++L E+ V+DAS D + L DY +
Sbjct: 8 PDVTILIPVYNTA-RFLPRALDSVLAQTWKNL--EVVAVNDASP-DNAAEVLADYAARD- 62
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLE 215
P+V+++ P+ G + ARL+G A L+ LD+ + + + L+E
Sbjct: 63 PRVRIVTCPENGGTLKARLSGLATARGRYLMNLDADDTFDPDCVKTLVE 111
>UniRef50_A3W9Y6 Cluster: Glucosyltransferase; n=1; Erythrobacter
sp. NAP1|Rep: Glucosyltransferase - Erythrobacter sp.
NAP1
Length = 310
Score = 44.8 bits (101), Expect = 0.006
Identities = 33/97 (34%), Positives = 51/97 (52%), Gaps = 6/97 (6%)
Query: 105 SLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEG 164
S P SVV+P +N +TL T SVL ++ + EI +VDD S+ L LD L
Sbjct: 3 SAPKFSVVIPAYNAG-ATLRSTVVSVLAQTDQDF--EIVVVDDGSSDGTLNAMLD--LAV 57
Query: 165 NMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
+++++ P SG+ R GA A +++ FLD+
Sbjct: 58 KDERIRIVSQPN-SGVSATRNYGASLAKGELIAFLDA 93
>UniRef50_Q8UCW1 Cluster: UDP-hexose transferase; n=1; Agrobacterium
tumefaciens str. C58|Rep: UDP-hexose transferase -
Agrobacterium tumefaciens (strain C58 / ATCC 33970)
Length = 316
Score = 44.4 bits (100), Expect = 0.008
Identities = 32/102 (31%), Positives = 56/102 (54%), Gaps = 8/102 (7%)
Query: 101 MYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDD 160
M ++SL SVS+V+P N ++ SVL + I+E+ +VDD ST D + ++
Sbjct: 9 MELKSLVSVSIVIPVRNGE-RYIVEAIESVLLQG--ETIREVLVVDDGST-DATAQKVEG 64
Query: 161 YLEGNMPKVKVIRLPK-RSGLITARLAGAERATADVLIFLDS 201
+ + P+VK++ P+ R G+ R G +A + +FLD+
Sbjct: 65 FSD---PRVKLLARPQGRQGVSAVRNFGLSQARGEWTMFLDA 103
>UniRef50_Q64Q34 Cluster: Putative glycosyltransferase; n=1;
Bacteroides fragilis|Rep: Putative glycosyltransferase -
Bacteroides fragilis
Length = 350
Score = 44.4 bits (100), Expect = 0.008
Identities = 30/96 (31%), Positives = 50/96 (52%), Gaps = 6/96 (6%)
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
+P +SV++P + L + SVL ++ + + EI L+DD S D G+ D Y
Sbjct: 1 MPKISVIIPIYKSE-KFLRKCVDSVLSQTFKDI--EILLIDDGSP-DLSGEICDQYAS-K 55
Query: 166 MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
+VK +P SG+ AR G +++ + L F+DS
Sbjct: 56 YSRVKAFHIPN-SGVSAARNFGISKSSGEYLCFIDS 90
>UniRef50_Q1IPB3 Cluster: Glycosyl transferase, family 2; n=2;
Bacteria|Rep: Glycosyl transferase, family 2 -
Acidobacteria bacterium (strain Ellin345)
Length = 325
Score = 44.4 bits (100), Expect = 0.008
Identities = 34/113 (30%), Positives = 55/113 (48%), Gaps = 4/113 (3%)
Query: 106 LPSVSVVVPFHNEHWS-TLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEG 164
+P S+VVPFHNE + T L V+ + + +F VDD S +D K L +
Sbjct: 1 MPKYSIVVPFHNEEENVTELYDRLKVVMETVGDTFELVF-VDDGS-RDCTFKLLQQ-IAA 57
Query: 165 NMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPI 217
+V V++L + G +A AG A D +I +D + + N +P +E +
Sbjct: 58 VDSRVVVVKLRRNFGQTSALAAGFHNAQGDYVIAMDGDLQHDPNDIPLFVEKV 110
>UniRef50_A0W5F3 Cluster: Glycosyl transferase, family 2; n=1;
Geobacter lovleyi SZ|Rep: Glycosyl transferase, family 2
- Geobacter lovleyi SZ
Length = 1435
Score = 44.4 bits (100), Expect = 0.008
Identities = 32/109 (29%), Positives = 53/109 (48%), Gaps = 5/109 (4%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P++S+++P HN+ W L R S L P L EI +VDD S+ + + AL +
Sbjct: 18 PALSIIIPVHNQ-WH-LTRACISSLLCFPPALPFEIIVVDDGSSDETV-VALSQLYPAEL 74
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEAN-VNWLPPLL 214
++++I A GA A +L+F+++ EA W PL+
Sbjct: 75 -RIRLISNQAPHSFARACNCGAREARGTLLLFMNNDIEAQAAGWCEPLV 122
>UniRef50_Q1MS29 Cluster: Predicted glycosyltransferases; n=1;
Lawsonia intracellularis PHE/MN1-00|Rep: Predicted
glycosyltransferases - Lawsonia intracellularis (strain
PHE/MN1-00)
Length = 418
Score = 44.0 bits (99), Expect = 0.011
Identities = 51/225 (22%), Positives = 95/225 (42%), Gaps = 13/225 (5%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKD--FLGKALDDYLEGNM 166
VS+++P +NE W L +T L + ++ ++++++ S+ + L + L G+
Sbjct: 4 VSIIIPVYNE-WQ-LTKTCLISLAKYTNNV--HVYVINNGSSDETNLACPTLGNILFGS- 58
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
I A GA A D ++FL++ TE + WLPPL+E ALD +
Sbjct: 59 -NFHYISFDTNHNFGPACNTGAYAANTDYILFLNNDTEVSYGWLPPLVE--ALDSDPYLA 115
Query: 227 PFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAIS 286
++ + + Q G D + +P + + + ++ +
Sbjct: 116 GVGSLLIYP--DSTVQHLGVTITPDGNYVNHLYNSIPATIPLVHKKRKFKIITAACLLLH 173
Query: 287 RAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGH 331
R F ++GG+ G G E EL ++ + G M+ P S V H
Sbjct: 174 RKNFIDIGGFFNGYKN-GLEDVELCLRLTEQGKYMMCIPESVVIH 217
>UniRef50_A6W918 Cluster: Glycosyl transferase family 2; n=1;
Kineococcus radiotolerans SRS30216|Rep: Glycosyl
transferase family 2 - Kineococcus radiotolerans
SRS30216
Length = 345
Score = 44.0 bits (99), Expect = 0.011
Identities = 35/100 (35%), Positives = 51/100 (51%), Gaps = 9/100 (9%)
Query: 107 PSVSVVVPFHN--EHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEG 164
P VSVV+ H+ H + R ++ P+ ++ LVDDAST D G LDD+
Sbjct: 4 PRVSVVLAVHDGAPHLAAARRQIEALT--DPD---LQLLLVDDAST-DGSGATLDDWARE 57
Query: 165 NMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTE 204
+ P+ ++R P R+G+ AR E ATA + F D E
Sbjct: 58 D-PRATLLRNPVRTGVAAARNRALEHATAPYVWFTDCDDE 96
>UniRef50_A5ZFA7 Cluster: Putative uncharacterized protein; n=1;
Bacteroides caccae ATCC 43185|Rep: Putative
uncharacterized protein - Bacteroides caccae ATCC 43185
Length = 328
Score = 44.0 bits (99), Expect = 0.011
Identities = 39/113 (34%), Positives = 60/113 (53%), Gaps = 6/113 (5%)
Query: 103 IESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYL 162
+ES VSV++P HN L + SV +S + + EI LV++ S D + D+Y
Sbjct: 1 MESNYLVSVIIPVHNTA-PYLHKCVESVRNQSLKEI--EIILVENMSI-DNSAEICDEYA 56
Query: 163 EGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLE 215
+ + ++KVI LP +GL AR AG + ATA + F+DS + L+E
Sbjct: 57 KIDS-RIKVIHLPI-AGLCIARNAGIDAATAPYIGFVDSDDYIGTDMFKDLVE 107
>UniRef50_A5D3A4 Cluster: Predicted glycosyltransferases; n=1;
Pelotomaculum thermopropionicum SI|Rep: Predicted
glycosyltransferases - Pelotomaculum thermopropionicum
SI
Length = 343
Score = 44.0 bits (99), Expect = 0.011
Identities = 36/122 (29%), Positives = 58/122 (47%), Gaps = 8/122 (6%)
Query: 105 SLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEG 164
S P SVV+P N + LL+T + L R EI +VD+ S+ G A + ++
Sbjct: 2 SAPLASVVIP--NWNGRMLLQTCLASLRRQTCRDF-EIIVVDNGSSD---GSA--EMIQA 53
Query: 165 NMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTV 224
P+V++IRL + G A G A + L++ TEA+ WL L++ + +
Sbjct: 54 CYPEVRLIRLSRNEGFSRAINRGIRAAAGKYIALLNNDTEADPKWLGELIKALEQNPEAG 113
Query: 225 MC 226
C
Sbjct: 114 FC 115
>UniRef50_A0V2D1 Cluster: Glycosyl transferase, family 2; n=1;
Clostridium cellulolyticum H10|Rep: Glycosyl
transferase, family 2 - Clostridium cellulolyticum H10
Length = 333
Score = 44.0 bits (99), Expect = 0.011
Identities = 33/99 (33%), Positives = 52/99 (52%), Gaps = 10/99 (10%)
Query: 103 IESLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYL 162
+ S P VSV++P +N + TL + +++ ++ + EI VDD ST + + A
Sbjct: 1 MNSKPLVSVIIPNYN-YEKTLPKCFDTLMNQTYKDF--EIIFVDDGSTDNSIEIAK---- 53
Query: 163 EGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
P K+ + PK G+ AR G E A+ D+L FLDS
Sbjct: 54 --KYP-CKIFKTPKNGGVAAARNLGVEYASGDILFFLDS 89
>UniRef50_O32268 Cluster: Putative teichuronic acid biosynthesis
glycosyltransferase tuaG; n=9; Firmicutes|Rep: Putative
teichuronic acid biosynthesis glycosyltransferase tuaG -
Bacillus subtilis
Length = 252
Score = 44.0 bits (99), Expect = 0.011
Identities = 38/109 (34%), Positives = 52/109 (47%), Gaps = 5/109 (4%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P VSV+ P +N + T +SVL +S H E+ +VDD ST D L Y E
Sbjct: 6 PLVSVITPSYNAR-DYIEDTVHSVLDQSHPHW--EMIIVDDCST-DGTRDILQQY-EKID 60
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLE 215
++ V+ L + SG AR ERA + FLDS + + L LE
Sbjct: 61 ERIHVVYLEENSGAAVARNKALERAQGRYVAFLDSDDKWKKDKLEKQLE 109
>UniRef50_Q2KA13 Cluster: Putative beta-D-1,6 glucosyltransferase
protein; n=1; Rhizobium etli CFN 42|Rep: Putative
beta-D-1,6 glucosyltransferase protein - Rhizobium etli
(strain CFN 42 / ATCC 51251)
Length = 790
Score = 43.6 bits (98), Expect = 0.015
Identities = 34/106 (32%), Positives = 53/106 (50%), Gaps = 4/106 (3%)
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
S SV+VP +N + + R S+L + + EI ++DD S D + KA ++ G+
Sbjct: 22 SFSVIVPAYNAS-NVIGRCLQSLLNQQGQAAF-EIIVIDDCSADDTIAKA-EEIAHGHN- 77
Query: 168 KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPL 213
V V+RL G AR AG RA + + F+D+ N+L L
Sbjct: 78 NVSVVRLDSNGGPGIARNAGVLRAKGEWICFVDADDVVENNFLEML 123
>UniRef50_Q83VE8 Cluster: EpsN; n=1; Lactococcus lactis subsp.
cremoris|Rep: EpsN - Lactococcus lactis subsp. cremoris
(Streptococcus cremoris)
Length = 351
Score = 43.6 bits (98), Expect = 0.015
Identities = 40/109 (36%), Positives = 57/109 (52%), Gaps = 6/109 (5%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
+SVVVP +N LL+ S+ ++ +L EI LVDD S KD GK DD L+ +
Sbjct: 5 ISVVVPVYNVE-KYLLKCFESISKQTYTNL--EIILVDDGS-KDTSGKMCDD-LKLLDDR 59
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPI 217
V V+ K +GL AR G E A+ ++F+DS N + L + I
Sbjct: 60 VIVVH-KKNAGLGYARNTGLEMASGKYVLFVDSDDYIEENMVSRLYQRI 107
>UniRef50_Q1PUL2 Cluster: Putative uncharacterized protein; n=1;
Candidatus Kuenenia stuttgartiensis|Rep: Putative
uncharacterized protein - Candidatus Kuenenia
stuttgartiensis
Length = 342
Score = 43.6 bits (98), Expect = 0.015
Identities = 32/103 (31%), Positives = 55/103 (53%), Gaps = 11/103 (10%)
Query: 110 SVVVPFHNEHWS--TLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
S+++P +NE S T L + SV Y P+ KEI ++DDAS+ D K ++ +
Sbjct: 4 SILIPAYNEEKSISTCLDSLLSVTY--PD---KEIIVIDDASS-DHTVKEVETFASRG-- 55
Query: 168 KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWL 210
V +++ K G A +G RAT ++++ D+ T + +WL
Sbjct: 56 -VILVKREKNGGRAAALNSGLSRATGEIVVTTDADTGVSPHWL 97
>UniRef50_Q1FJU1 Cluster: Glycosyl transferase, family
2:CDP-glycerol:poly(Glycerophosphate)
glycerophosphotransferase; n=1; Clostridium
phytofermentans ISDg|Rep: Glycosyl transferase, family
2:CDP-glycerol:poly(Glycerophosphate)
glycerophosphotransferase - Clostridium phytofermentans
ISDg
Length = 979
Score = 43.6 bits (98), Expect = 0.015
Identities = 39/125 (31%), Positives = 62/125 (49%), Gaps = 8/125 (6%)
Query: 109 VSVVVPFHNEHWSTLLRTAY-SVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
VS+++P +N S LR S++ ++ E + E+ +V+D ST D L L +Y E P
Sbjct: 6 VSIIMPVYNV--SRYLRNCLDSIVAQTMEDI--ELIVVNDGSTDDSLS-ILKEY-EVQYP 59
Query: 168 KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYR-TVMC 226
+ + + G+ AR G ERA+ + L+F+DS L E A D V+C
Sbjct: 60 EWMHVYTTENRGVSHARNYGIERASGEYLLFVDSDDFVEPEICEKLYEKAAQDNNDVVIC 119
Query: 227 PFIDV 231
+ DV
Sbjct: 120 RYNDV 124
>UniRef50_A7BMC2 Cluster: Glycosyl transferase, group 2 family; n=1;
Beggiatoa sp. SS|Rep: Glycosyl transferase, group 2
family - Beggiatoa sp. SS
Length = 306
Score = 43.6 bits (98), Expect = 0.015
Identities = 45/191 (23%), Positives = 85/191 (44%), Gaps = 12/191 (6%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P VS+V+P N+ T L + + KE+ ++++AST D + L L+G
Sbjct: 117 PRVSIVIPVFNQALYTY--NCLLTLQACDQTISKEVIIINNAST-DETAELLAQ-LQGAF 172
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
++ + G + A GAE A + ++FL++ T+ WL L+ + D +T+
Sbjct: 173 T---ILNNEENQGFVHACRQGAEVARGEFILFLNNDTQVMPGWLSNSLKAMDSD-QTIGI 228
Query: 227 PFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAIS 286
+I D A A + + + P+LP+ +N + +G I
Sbjct: 229 TGSKLIYPDGHLQEAGGIIFNDASGYHYGRSQNPLLPQFNQNRIVDY----CSGASLMIR 284
Query: 287 RAFFWELGGYD 297
++ + +LGG+D
Sbjct: 285 KSLWAQLGGFD 295
>UniRef50_A5FSS4 Cluster: Histidinol-phosphate phosphatase family
protein; n=2; Dehalococcoides|Rep: Histidinol-phosphate
phosphatase family protein - Dehalococcoides sp. BAV1
Length = 410
Score = 43.6 bits (98), Expect = 0.015
Identities = 41/117 (35%), Positives = 56/117 (47%), Gaps = 10/117 (8%)
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
SVSVV+P NE + + L P I E+ LVD ST + KAL + P
Sbjct: 188 SVSVVIPTLNEEKN--IGCVLPRLRCVPG--ITEVILVDGKSTDRTVEKALQLW-----P 238
Query: 168 KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTV 224
++KVI P R G A AG E AT D+++ LD+ LP + P+ Y+ V
Sbjct: 239 EIKVINQPGR-GKGNAMRAGFEAATGDIVVTLDADGSLAPEELPRYIGPLFEGYQMV 294
>UniRef50_A0VWC2 Cluster: Glycosyl transferase, family 2; n=3;
Rhodobacteraceae|Rep: Glycosyl transferase, family 2 -
Dinoroseobacter shibae DFL 12
Length = 291
Score = 43.6 bits (98), Expect = 0.015
Identities = 31/106 (29%), Positives = 53/106 (50%), Gaps = 6/106 (5%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
+++++P +N+ + L T L + E+ +VD+ ST+D LG ++ L+G P
Sbjct: 4 IALIIPHYND--TARLATCLGALAPQMTEAV-ELIVVDNGSTED-LGP-IEALLQGRFPG 58
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLL 214
++I P + G AR G TA L+FLD+ +WL L
Sbjct: 59 ARLIHEPGK-GAAFARNRGVAETTAPDLLFLDADCVPGPDWLTTAL 103
>UniRef50_Q8PUV7 Cluster: Glycosyltransferase; n=3;
Euryarchaeota|Rep: Glycosyltransferase - Methanosarcina
mazei (Methanosarcina frisia)
Length = 313
Score = 43.6 bits (98), Expect = 0.015
Identities = 32/111 (28%), Positives = 53/111 (47%), Gaps = 8/111 (7%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIK--EIFLVDDASTKDFLGKALDDYLEGNM 166
+S+++P HNE + L Y LY + K EI VDD ST D K +E
Sbjct: 7 LSIIIPVHNEEENIL--ELYKSLYNILSLVGKTYEIIYVDDGSTDDSFEKIKS--IED-- 60
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPI 217
+VKV+R + G A G +++ D++I +D + + +P +E +
Sbjct: 61 ARVKVVRFQRNYGKAAALSCGFKKSKGDIVITMDGDLQDDPKEIPRFIEEL 111
>UniRef50_A0RYV6 Cluster: Dolichol-phosphate mannosyltransferase;
n=1; Cenarchaeum symbiosum|Rep: Dolichol-phosphate
mannosyltransferase - Cenarchaeum symbiosum
Length = 385
Score = 43.6 bits (98), Expect = 0.015
Identities = 29/113 (25%), Positives = 58/113 (51%), Gaps = 6/113 (5%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
VS++VP +NE ++ SV P+++ E ++DD S D G+ ++DY+ K
Sbjct: 9 VSILVPTYNES-QNIIGLLKSVAESLPKNIAAETIVIDDNSP-DGTGRLVEDYIRSVGKK 66
Query: 169 ----VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPI 217
+ +I + GL +A + G ++A + ++ +DS + +P +LE +
Sbjct: 67 AGQTIGIIHRRTKRGLGSAIIHGIQQARGETIVVMDSDFSHPPSAIPRMLESL 119
>UniRef50_Q97GN9 Cluster: Glycosyltransferase domain containing
protein; n=1; Clostridium acetobutylicum|Rep:
Glycosyltransferase domain containing protein -
Clostridium acetobutylicum
Length = 1044
Score = 43.2 bits (97), Expect = 0.019
Identities = 47/219 (21%), Positives = 92/219 (42%), Gaps = 13/219 (5%)
Query: 103 IESLPSVSVVVPFHN--EHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDD 160
+ S P VS ++ +N +H L+ +S L K ++V D +KD G +
Sbjct: 1 MNSYPKVSFIIVNYNGLQH----LKNCFSELKNLSYPSDKIEYIVVDNGSKD--GSV--E 52
Query: 161 YLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIA-L 219
+L+ N P VK+I+ G A+ A + L +++ + + NWL + E +
Sbjct: 53 FLKKNYPAVKIIKNDSNEGFAKPNDDAAKIAEGEYLALINNDMKLDKNWLNDMFETLENC 112
Query: 220 DYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMA 279
+ + +C ++ +D + G+ + + Y + + K E +
Sbjct: 113 NDNSYVCAGSKIVNWDGSKLDFAG-GSVSFAGYGYQYDYGMDIKEANKKYNEDRDILFAC 171
Query: 280 GGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCG 318
GG I + F E+GG+D + E +L +++W G
Sbjct: 172 GGSMLIRKDVFIEIGGFDKDYFAY-YEDVDLGWRLWVLG 209
>UniRef50_Q4UUG0 Cluster: O-antigen biosynthesis protein; n=2;
Xanthomonas campestris pv. campestris|Rep: O-antigen
biosynthesis protein - Xanthomonas campestris pv.
campestris (strain 8004)
Length = 1203
Score = 43.2 bits (97), Expect = 0.019
Identities = 50/225 (22%), Positives = 87/225 (38%), Gaps = 6/225 (2%)
Query: 96 GCKKKMYIES-LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFL 154
GC + Y + P VS++VP N+ +L+ L + E+ +VD+ ST
Sbjct: 573 GCYRIEYGHAHTPGVSIIVPTKNQ--LGMLQRCVETLLEKTAYSNYELLIVDNGSTDADA 630
Query: 155 GKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTE-ANVNWLPPL 213
+ LD + P+++V+R P A+ A + L+ L++ T NWL L
Sbjct: 631 CQWLDGIEAMDSPQLRVLRYPHPFNYAAMNNLAAQHARGEYLVLLNNDTAILQENWLDAL 690
Query: 214 LEPIALDYRTVMCPFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPF 273
L V+ + + T ++ G RG + F + L P +
Sbjct: 691 LNHAQRPEVGVVGAKL-LYPNGTVQHAGVVLGLRGPAEHPFNGQALDA-PGYMYRLQVDQ 748
Query: 274 ESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIWQCG 318
+ G + + + E+GG D + +L KI Q G
Sbjct: 749 NYSAVTGACMMVRASLYAEVGGLDEDVFKVSYNDVDLCLKIRQAG 793
>UniRef50_Q6TDC7 Cluster: WaaV; n=42; Campylobacter|Rep: WaaV -
Campylobacter jejuni
Length = 275
Score = 43.2 bits (97), Expect = 0.019
Identities = 34/137 (24%), Positives = 69/137 (50%), Gaps = 13/137 (9%)
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
+P +S+++P N +LR S + ++ + + EI ++DD S + L L+
Sbjct: 1 MPQLSIIIPLFNS-CDFILRALQSCINQTLKDI--EILIIDDKSKDNSLNMVLE--FAKK 55
Query: 166 MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVM 225
P++K+ + + G +R G +++D ++FLDS ++L P IA ++ +
Sbjct: 56 DPRIKIFQNEENLGTFASRNLGVLHSSSDFIMFLDSD-----DFLTPGACEIA--FKEMK 108
Query: 226 CPFIDVIAFDTFEYRAQ 242
F D++ FD F +R +
Sbjct: 109 KGF-DLLCFDAFVHRVK 124
>UniRef50_Q0YLA6 Cluster: Glycosyl transferase, family 2; n=2;
Geobacter|Rep: Glycosyl transferase, family 2 -
Geobacter sp. FRC-32
Length = 346
Score = 43.2 bits (97), Expect = 0.019
Identities = 29/109 (26%), Positives = 51/109 (46%), Gaps = 5/109 (4%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P++ ++VP N T ++L SP + +VD+ S ++ K L ++ EG
Sbjct: 4 PAIDIIVPVWNRPIETR-NCLVNLLNHSPN---ARLIMVDNGSDRE-TEKLLQEFSEGLD 58
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLE 215
+ ++R GL+ A G E+ A+ L + + + WL PLLE
Sbjct: 59 DRAFLLRNDVNQGLVKALNRGLEKGEAEYLFIVKNTSLVGPGWLEPLLE 107
>UniRef50_Q0S4J5 Cluster: Possible glycosyltransferase; n=2;
Corynebacterineae|Rep: Possible glycosyltransferase -
Rhodococcus sp. (strain RHA1)
Length = 316
Score = 43.2 bits (97), Expect = 0.019
Identities = 35/110 (31%), Positives = 51/110 (46%), Gaps = 6/110 (5%)
Query: 101 MYIESLPSVSVVV-PFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALD 159
M + +L S SVV+ + E W+ L SVL ++ L E+ +V D S + F +
Sbjct: 1 MTVSTLVSCSVVICAYTTERWTDLCAAVDSVLTQNVPVL--EVLVVIDHSEELF--SRAE 56
Query: 160 DYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNW 209
+ G +V V+R GL AR G A DV+ FLD A +W
Sbjct: 57 RHFAGQT-RVTVLRNSGSKGLSGARNTGVRAARGDVIAFLDDDARAEPDW 105
>UniRef50_P74948 Cluster: ORF35x9 protein; n=4; Vibrio cholerae|Rep:
ORF35x9 protein - Vibrio cholerae
Length = 310
Score = 43.2 bits (97), Expect = 0.019
Identities = 36/98 (36%), Positives = 56/98 (57%), Gaps = 6/98 (6%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEG-NMP 167
+SVV+P +N+++S ++R SVLY++ L EI +V+D ST D L LD YL+ +M
Sbjct: 5 ISVVIPLYNKNYS-IVRCLNSVLYQT--ILPFEIIIVNDGSTDDSL-VVLDYYLKTISMR 60
Query: 168 KVKVI-RLPKRSGLITARLAGAERATADVLIFLDSHTE 204
V VI + G+ AR G A ++ + LD+ E
Sbjct: 61 DVNVIVHDQQNQGVSRARNNGIALAKSNYIALLDADDE 98
>UniRef50_A7NHD1 Cluster: Glycosyl transferase family 2; n=1;
Roseiflexus castenholzii DSM 13941|Rep: Glycosyl
transferase family 2 - Roseiflexus castenholzii DSM
13941
Length = 358
Score = 43.2 bits (97), Expect = 0.019
Identities = 54/225 (24%), Positives = 91/225 (40%), Gaps = 12/225 (5%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P V++V+ +N + + SV +PE + +VD+AST + + + + +
Sbjct: 6 PGVTIVLITYNSA-AYVRECLQSVRCAAPE---VHVLIVDNASTDNTV-----EIVRRDF 56
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
P+ ++ LP+ G A +RA ++FLD T V WL PLL A + V
Sbjct: 57 PECTLVPLPQNIGHSAACNLALQRAKTAWVLFLDHDTTTPVGWLEPLLAIAAATWPDV-- 114
Query: 227 PFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAIS 286
+ A + R +G + + D P E A +
Sbjct: 115 GMVGSRAVLVEQGRIHHDGGYAHYVGHMTLRNGFAPLADVACDTTPVEVGAQASTSLLVH 174
Query: 287 RAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGH 331
R +GG+DP I+ + ++LS ++ G R AP S V H
Sbjct: 175 RERALAVGGFDPRFFIYLND-FDLSLRMRLRGWRCYVAPESVVYH 218
>UniRef50_A7CSW3 Cluster: Glycosyl transferase family 2; n=1;
Opitutaceae bacterium TAV2|Rep: Glycosyl transferase
family 2 - Opitutaceae bacterium TAV2
Length = 337
Score = 43.2 bits (97), Expect = 0.019
Identities = 35/99 (35%), Positives = 52/99 (52%), Gaps = 10/99 (10%)
Query: 107 PSVSVVVPFHNEHWSTLLR----TAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYL 162
P +SV++P N LLR + S+ S + L E+ +VDD S +D G L Y
Sbjct: 3 PRLSVILPVFNR--VPLLRHPLDSLRSMQEASADRLSWEVIVVDDGSVEDVSG-ILTAY- 58
Query: 163 EGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
+P + + RL K SGL+ ARL G A + ++FLD+
Sbjct: 59 -SGLP-IHLHRLQKNSGLLRARLEGLALAGGEAVMFLDA 95
>UniRef50_A6BIG1 Cluster: Putative uncharacterized protein; n=1;
Dorea longicatena DSM 13814|Rep: Putative
uncharacterized protein - Dorea longicatena DSM 13814
Length = 291
Score = 43.2 bits (97), Expect = 0.019
Identities = 33/103 (32%), Positives = 54/103 (52%), Gaps = 6/103 (5%)
Query: 129 SVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGA 188
S+L+++ +++ EI +VDD S D GK D Y P++KVI K +GL AR +G
Sbjct: 3 SLLHQTYKNI--EIIMVDDGSPDD-CGKKCDRYA-AEEPRIKVIH-KKNAGLGRARNSGL 57
Query: 189 ERATADVLIFLDSHTEANVNWLPPLLEPIALD-YRTVMCPFID 230
E A + ++F+DS +V + L + + T C + D
Sbjct: 58 EIAEGEYVMFIDSDDYTDVRMIERLYHRLTEEGADTCFCRYYD 100
>UniRef50_Q5LBM4 Cluster: Putative glycosyltransferase protein; n=1;
Bacteroides fragilis NCTC 9343|Rep: Putative
glycosyltransferase protein - Bacteroides fragilis
(strain ATCC 25285 / NCTC 9343)
Length = 342
Score = 42.7 bits (96), Expect = 0.026
Identities = 34/96 (35%), Positives = 50/96 (52%), Gaps = 6/96 (6%)
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
+P VSV+VP +N L + ++L ++ + EI L+DD S D K DDY
Sbjct: 2 IPKVSVIVPIYNVE-KYLDQCVQALLAQTLSDI--EIILIDDES-PDNCPKICDDY-AAQ 56
Query: 166 MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
P +KVI K +GL A +G + AT + + F DS
Sbjct: 57 YPNIKVIH-KKNAGLGMACNSGLDVATGEYVAFCDS 91
>UniRef50_Q9X363 Cluster: PXO1-93; n=7; Bacilli|Rep: PXO1-93 -
Bacillus anthracis
Length = 366
Score = 42.7 bits (96), Expect = 0.026
Identities = 41/128 (32%), Positives = 61/128 (47%), Gaps = 22/128 (17%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTK----DFLGKALDDYLEG 164
V+VVVP +NE S ++ T SVL + ++ I EIF VDD S + K ++ L
Sbjct: 3 VAVVVPSYNESASAIVNTINSVL--AQDYPIHEIFFVDDGSKDKSAYEVALKMREELLRT 60
Query: 165 N----------------MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVN 208
+P + V RLPK G A+L +R TAD ++ +DS + N
Sbjct: 61 QREIAATTKNICSEILGIPDLIVHRLPKNCGKRHAQLWAFKRTTADAIVTIDSDGDLFPN 120
Query: 209 WLPPLLEP 216
+ LL+P
Sbjct: 121 AVRELLKP 128
>UniRef50_Q04TX6 Cluster: UndP-glycosyltransferase; n=2; Leptospira
borgpetersenii serovar Hardjo-bovis|Rep:
UndP-glycosyltransferase - Leptospira borgpetersenii
serovar Hardjo-bovis (strain JB197)
Length = 380
Score = 42.7 bits (96), Expect = 0.026
Identities = 29/109 (26%), Positives = 52/109 (47%), Gaps = 2/109 (1%)
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
+P S+++P +NE + +L + + + EI +VDD S D K + E
Sbjct: 1 MPQFSLILPTYNEKENLILLLPKLIALFKSKKIDYEIIIVDDDS-PDLTWKWFQN-KEKE 58
Query: 166 MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLL 214
P V++IR GL +A L G + + L +D+ + + N LP ++
Sbjct: 59 FPSVRLIRRIHEKGLSSAVLTGMASSQGEYLCVMDADLQHDENILPEMI 107
>UniRef50_A7NQ51 Cluster: Glycosyl transferase family 2; n=1;
Roseiflexus castenholzii DSM 13941|Rep: Glycosyl
transferase family 2 - Roseiflexus castenholzii DSM
13941
Length = 738
Score = 42.7 bits (96), Expect = 0.026
Identities = 32/93 (34%), Positives = 50/93 (53%), Gaps = 4/93 (4%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
+SVVVP+HN + L T S++ S L +I +VDD S AL+ + +
Sbjct: 445 LSVVVPYHNLG-AYLAETIASIVASSYRPL--DIVIVDDGSDDPASVAALEQVGQMHPDL 501
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
V+++R +R GL AR GA+ A + L F+D+
Sbjct: 502 VRIVRC-ERGGLARARNRGAQAARGEFLAFVDA 533
>UniRef50_A6C910 Cluster: Truncated O-antigen biosynthesis protein;
n=1; Planctomyces maris DSM 8797|Rep: Truncated
O-antigen biosynthesis protein - Planctomyces maris DSM
8797
Length = 903
Score = 42.7 bits (96), Expect = 0.026
Identities = 29/111 (26%), Positives = 56/111 (50%), Gaps = 3/111 (2%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P +S+++P +N L SVL ++ + E+ + DDASTK + +++Y + +
Sbjct: 375 PLISIILPTYNTKEKILRACIESVLAQTYSNW--ELCIADDASTKSRVRDVINEYSKQD- 431
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPI 217
++K + + + A ++ AE D + FLD E N N L +++ I
Sbjct: 432 SRIKSVFRTENGHISEAMISAAELMEGDYISFLDHDDELNKNALLFIVDAI 482
Score = 41.1 bits (92), Expect = 0.078
Identities = 49/206 (23%), Positives = 87/206 (42%), Gaps = 20/206 (9%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
VS+++P ++ LL + H+ EI +VD+ S + + ++ + +
Sbjct: 635 VSIIIPTKDK--IDLLDDCIESIRNRSSHINWEIIIVDNRSEETASKEYFSTVVQDS--R 690
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDSHT-EANVNWLPPLLEPIALDYRTVMCP 227
+KV+ GA+ AT DV +FL++ T +W+ L +L ++ P
Sbjct: 691 IKVVEADVEFNWSMINNIGAKAATGDVFVFLNNDTLVITPDWIEKLASMASLPEVGLVGP 750
Query: 228 FIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVM-------AG 280
+ + +T ++ G G D F +LPV + PF SP++ G
Sbjct: 751 QL-LYEDNTIQHAGVVVGMGGWAD-HVFKNQLPV------HRSGPFVSPMLNRNVLAITG 802
Query: 281 GLFAISRAFFWELGGYDPGLDIWGGE 306
I RA F +LGG+D I G +
Sbjct: 803 ACQVIERAKFEQLGGFDEQFIICGSD 828
>UniRef50_A3U4F3 Cluster: Putative fucosyl transferase; n=1;
Croceibacter atlanticus HTCC2559|Rep: Putative fucosyl
transferase - Croceibacter atlanticus HTCC2559
Length = 311
Score = 42.7 bits (96), Expect = 0.026
Identities = 59/206 (28%), Positives = 93/206 (45%), Gaps = 22/206 (10%)
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
+P SV++P N+ + RT SVL +S + E+ +V+D ST + L K + + +
Sbjct: 1 MPFFSVIIPLFNKA-EYVERTLNSVLLQSFDDF--EVIIVNDGSTDNSL-KIAESFTDDR 56
Query: 166 MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVM 225
+ VI G+ TAR G +AT++ + FLD A+ W P L AL V
Sbjct: 57 I----VIYSKDNKGVSTARNYGVTKATSNYIAFLD----ADDIWKPNHLN--ALHNLIVD 106
Query: 226 CPFIDVIAFDTFEYRAQDEGARGAFDW-EFFYKRLPVLPKDEKNMPEPFESPVMAGGLFA 284
P + + +++ R F YK V+ KN PF + G A
Sbjct: 107 MPSCALYCMGYIKQKSEKSKIRAQFGTIADGYK--GVVSGYFKN-SLPF--TIAGMGAVA 161
Query: 285 ISRAFFWELGGYDPGLDIWGGEQYEL 310
+++A F E+GG+ G + GE EL
Sbjct: 162 VNKAKFIEVGGFTEG--VTHGEDIEL 185
>UniRef50_A7RJ47 Cluster: Predicted protein; n=1; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 237
Score = 42.7 bits (96), Expect = 0.026
Identities = 21/70 (30%), Positives = 35/70 (50%), Gaps = 3/70 (4%)
Query: 481 CWDLPDSSPGSPIVLYACHLGGGNQLWRYYPDTKQIKHGGHSNCVDHEPTTLR--LHISR 538
C D + + G L+ACH GGNQ W Y D Q+K+ +C+ P + + + +
Sbjct: 127 CVDTMERTEGGFPELFACHQKGGNQEWEYTSD-NQLKNPLRGDCLTAPPNKEKTIIELRQ 185
Query: 539 CDAAADTQRW 548
C + + Q+W
Sbjct: 186 CSSDSPLQKW 195
>UniRef50_Q57022 Cluster: Uncharacterized glycosyltransferase
HI0868; n=12; Haemophilus influenzae|Rep:
Uncharacterized glycosyltransferase HI0868 - Haemophilus
influenzae
Length = 250
Score = 42.7 bits (96), Expect = 0.026
Identities = 28/97 (28%), Positives = 54/97 (55%), Gaps = 5/97 (5%)
Query: 105 SLPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEG 164
++P +S+++P +N L + S L +S +++ E+ L+DD ST + + +++ ++
Sbjct: 2 NMPLISIIMPVYNAE-CYLNQGILSCLNQSYQNI--ELILIDDGSTDKSI-EIINNIIDK 57
Query: 165 NMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
+ +VK+ P G AR G E+A D + FLDS
Sbjct: 58 DK-RVKLFFTPTNQGPAAARNIGLEKAQGDYITFLDS 93
>UniRef50_Q3ADF7 Cluster: Glycosyl transferase, group 2 family; n=1;
Carboxydothermus hydrogenoformans Z-2901|Rep: Glycosyl
transferase, group 2 family - Carboxydothermus
hydrogenoformans (strain Z-2901 / DSM 6008)
Length = 833
Score = 42.3 bits (95), Expect = 0.034
Identities = 48/224 (21%), Positives = 91/224 (40%), Gaps = 18/224 (8%)
Query: 141 EIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLD 200
E+ +VD+ ST+ L + L + L+ VKV LP+ G G + ++FL+
Sbjct: 477 ELIIVDNGSTETQLLEYLKN-LQERRDDVKVYFLPENVGYGLGNNFGLLHSVGKYVVFLN 535
Query: 201 SHTEANVNWLPPLLEPIALDYRT-VMCPFIDVIA----FDTFEYRAQDEGARGAFDWEFF 255
+ T +WL L+ + +D +T ++ P + ++ + Y+ +E A WE
Sbjct: 536 NDTIVTPDWLEKLIIDLNMDEKTGLIGPRTNYVSGEQIVNKVNYKNMNELLEFANKWE-- 593
Query: 256 YKRLPVLPKDEKNMPEPFESPVMAGGLFAISRAFFWELGGYDPGLDIWGGEQYELSFKIW 315
N E E+ + G R ++GG+DP +I E +L + +
Sbjct: 594 ----------AVNYREVTETKKLVGFCLLAKREVIEKIGGFDPLFEIGNFEDDDLCRRAY 643
Query: 316 QCGGRMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVW 359
G ++ A + H K + + + KN + + W
Sbjct: 644 LAGFKLKIADDVFIHHYGSKTFISEKINYDEIMKKNREKYLKKW 687
>UniRef50_Q1Q4P0 Cluster: Similar to glycosyltransferase family 2;
n=1; Candidatus Kuenenia stuttgartiensis|Rep: Similar to
glycosyltransferase family 2 - Candidatus Kuenenia
stuttgartiensis
Length = 295
Score = 42.3 bits (95), Expect = 0.034
Identities = 38/109 (34%), Positives = 56/109 (51%), Gaps = 12/109 (11%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P VSV++P +N L + SVL ++ + E+ ++DD ST D L Y+E
Sbjct: 6 PLVSVIMPTYNCA-VYLQESIESVLAQTYDAY--EVVVIDDGST-DNTKLVLKPYME--- 58
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLE 215
K+K I L + GL TAR G + A + + FLD A+ W+P LE
Sbjct: 59 -KIKYIDLGRNEGLPTARNLGIQSAKGEYVAFLD----ADDIWMPEKLE 102
>UniRef50_Q15RB7 Cluster: Glycosyl transferase, family 2; n=1;
Pseudoalteromonas atlantica T6c|Rep: Glycosyl
transferase, family 2 - Pseudoalteromonas atlantica
(strain T6c / BAA-1087)
Length = 289
Score = 42.3 bits (95), Expect = 0.034
Identities = 34/93 (36%), Positives = 51/93 (54%), Gaps = 7/93 (7%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
VSVV+P +N S + SVL ++ + EI ++DD ST L L Y + K
Sbjct: 5 VSVVIPVYNSQ-SYIADCIDSVLAQTYQDF--EIIVIDDGSTDSSLD-ILKSYQKKQ--K 58
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
+K++++ + SG AR G E+AT LIF+DS
Sbjct: 59 IKLLQI-ENSGQSVARNLGIEQATGKYLIFIDS 90
>UniRef50_Q027L5 Cluster: Glycosyl transferase, family 2; n=1;
Solibacter usitatus Ellin6076|Rep: Glycosyl transferase,
family 2 - Solibacter usitatus (strain Ellin6076)
Length = 241
Score = 42.3 bits (95), Expect = 0.034
Identities = 35/117 (29%), Positives = 56/117 (47%), Gaps = 7/117 (5%)
Query: 107 PSVSVVVPFHNEHWST--LLRTAYSVLYRSPEHLIK-EIFLVDDASTKDFLGKALDDYLE 163
PS+SV P +N+ S LL + L EH+ E+ +V+D S D GK L++ +
Sbjct: 6 PSLSVFFPAYNDAPSLPGLLAKTFHAL---EEHVADYEVIVVNDGSYDD-TGKVLEELRQ 61
Query: 164 GNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALD 220
P ++V+ + G A G E AT + + + D + + LP LLE + D
Sbjct: 62 KYHPYLRVVTHEQNRGYGGALRTGFESATREFVFYTDGDGQYDAGELPKLLELMTPD 118
>UniRef50_A6L4H5 Cluster: Glycosyltransferase family 2; n=1;
Bacteroides vulgatus ATCC 8482|Rep: Glycosyltransferase
family 2 - Bacteroides vulgatus (strain ATCC 8482 / DSM
1447 / NCTC 11154)
Length = 321
Score = 42.3 bits (95), Expect = 0.034
Identities = 44/151 (29%), Positives = 67/151 (44%), Gaps = 7/151 (4%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
+S++VP +N L R S++ ++ ++ EI LVDD S D G+ D+Y + + +
Sbjct: 8 ISIIVPVYNVE-QYLSRCVDSLVNQTYHNI--EIILVDDGSP-DRSGEICDEYAKKDK-R 62
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMCPF 228
VKVI GL AR + A D L+F+DS +L +A DY+ F
Sbjct: 63 VKVIH-QSNGGLSDARNTALDIAKGDYLMFVDSDDWIEKKTC-EILNRLASDYKADTVIF 120
Query: 229 IDVIAFDTFEYRAQDEGARGAFDWEFFYKRL 259
FD+ + G G D K L
Sbjct: 121 GLNFVFDSGKVIRSKRGMAGLTDKNMCMKYL 151
>UniRef50_A4AE68 Cluster: Glycosyltransferase family protein; n=1;
Congregibacter litoralis KT71|Rep: Glycosyltransferase
family protein - Congregibacter litoralis KT71
Length = 1270
Score = 42.3 bits (95), Expect = 0.034
Identities = 51/225 (22%), Positives = 87/225 (38%), Gaps = 14/225 (6%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P VS++VP +N++ T+ ++R + L E+ + DD ST L + + + G
Sbjct: 631 PRVSIIVPVYNDYRVTI--NCLQSVHRFSQDLDYEVIIADDCSTD--LTTTITERMRG-- 684
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
+ V R + + +A + L+FL++ T WL L+EP V
Sbjct: 685 --ITVSRTTENLRFLKNCNQAVAQARGEFLVFLNNDTAVTEGWLATLIEPFNDQSVGVTG 742
Query: 227 PFIDVIAFDTFEYRAQDEGARGAFDWEFFYKRLPVLPKDEKNMPEPFESPVMAGGLFAIS 286
P ++ D A A W F P P + ++G AI
Sbjct: 743 P--KLLFADGVLQEAGGIIWNDASGWNFGRADDPDKPAFNYRRDVDY----VSGACLAIR 796
Query: 287 RAFFWELGGYDPGLDIWGGEQYELSFKIWQCGGRMLDAPCSRVGH 331
+ +LGG+D E ++ F G R++ P S + H
Sbjct: 797 GDLWQQLGGFDERFAPAYYEDADICFAARAAGFRVVYQPESVIYH 841
>UniRef50_A0YST2 Cluster: Putative uncharacterized protein; n=1;
Lyngbya sp. PCC 8106|Rep: Putative uncharacterized
protein - Lyngbya sp. PCC 8106
Length = 328
Score = 42.3 bits (95), Expect = 0.034
Identities = 34/110 (30%), Positives = 58/110 (52%), Gaps = 12/110 (10%)
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
+P VSV++P +N T+ T SVL +S E+ +++D ST D + L++ +
Sbjct: 1 MPQVSVIIPVYNGE-KTIQETIKSVLDQSFTDF--ELIIINDGST-DKTPEILENISDS- 55
Query: 166 MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLE 215
++K+ P +GL T+R G +RA+ + F+D A+ W P LE
Sbjct: 56 --RIKIFSFPN-AGLSTSRNRGIDRASGKYIAFID----ADDLWKPGKLE 98
>UniRef50_Q2NHK4 Cluster: Predicted glycosyltransferase; n=2;
cellular organisms|Rep: Predicted glycosyltransferase -
Methanosphaera stadtmanae (strain DSM 3091)
Length = 888
Score = 42.3 bits (95), Expect = 0.034
Identities = 37/133 (27%), Positives = 65/133 (48%), Gaps = 13/133 (9%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P+VS+++P +N+ L S+ +S + + E+ VDD ST D L L Y +
Sbjct: 4 PTVSIIIPIYNKE-KYLNNCISSIKNQSLKDI--EVICVDDGSTDDSL-STLKSYTSDD- 58
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
+ K+I+ R G AR G E AT + ++FLD+ +W+ E + Y+
Sbjct: 59 SRFKIIKQENR-GPGLARNVGLENATGEYVLFLDAD-----DWIES--ESLEQLYKQATS 110
Query: 227 PFIDVIAFDTFEY 239
D++ F++ E+
Sbjct: 111 NNSDLVLFNSIEH 123
>UniRef50_Q8ABR6 Cluster: Putative glycosyltransferase; n=1;
Bacteroides thetaiotaomicron|Rep: Putative
glycosyltransferase - Bacteroides thetaiotaomicron
Length = 314
Score = 41.9 bits (94), Expect = 0.045
Identities = 29/105 (27%), Positives = 53/105 (50%), Gaps = 6/105 (5%)
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
+P ++++VP +N TL R S+ ++ + E+ L+DD S +D+ K D+Y +
Sbjct: 1 MPQITIIVPVYNAE-RTLERCVRSIQAQTVVNW--ELLLIDDGS-EDYSSKLCDEYAIQD 56
Query: 166 MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWL 210
+++V G +AR G E + ++F DS + NWL
Sbjct: 57 S-RIRVFH-QVNGGASSARNVGLEYTKGEWVVFCDSDDWVDSNWL 99
>UniRef50_Q3ASS6 Cluster: Teichuronic acid biosynthesis; n=1;
Chlorobium chlorochromatii CaD3|Rep: Teichuronic acid
biosynthesis - Chlorobium chlorochromatii (strain CaD3)
Length = 285
Score = 41.9 bits (94), Expect = 0.045
Identities = 39/117 (33%), Positives = 60/117 (51%), Gaps = 16/117 (13%)
Query: 101 MYIESLPSVSVVVPFHNEHWSTLLRTAYSVLYR--SPEHLIKEIFLVDDASTKDFLGKAL 158
M+ ++L +S++ P +N + L R SV + + EH+I +DDAST D + L
Sbjct: 1 MFTDNL--ISIITPVYNTY-PFLFRLVQSVQMQKVNVEHII-----IDDAST-DHSYETL 51
Query: 159 DDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLE 215
+Y + +++IRLP G + AR G + A L FLD A+ WLP LE
Sbjct: 52 IEYAK-KYSNIRLIRLPLNRGPVVARNEGIKIAQGRFLAFLD----ADDLWLPNKLE 103
>UniRef50_Q2RQ74 Cluster: Glycosyl transferase, family 2; n=1;
Rhodospirillum rubrum ATCC 11170|Rep: Glycosyl
transferase, family 2 - Rhodospirillum rubrum (strain
ATCC 11170 / NCIB 8255)
Length = 262
Score = 41.9 bits (94), Expect = 0.045
Identities = 30/72 (41%), Positives = 38/72 (52%), Gaps = 6/72 (8%)
Query: 141 EIFLVDDASTKDFLGKALDDYLEGNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLD 200
EI LVDDAST L AL L P++ VI + G AR AGA A ++L +LD
Sbjct: 49 EIILVDDASTDATL--ALARRLAAEDPRITVIARRRNGGAGPARNAGARAAKGEILFYLD 106
Query: 201 SHTEANVNWLPP 212
A+ +LPP
Sbjct: 107 ----ADDGFLPP 114
>UniRef50_Q9S5F4 Cluster: Glycosyl transferase; n=7;
Enterobacteriaceae|Rep: Glycosyl transferase -
Escherichia coli
Length = 260
Score = 41.9 bits (94), Expect = 0.045
Identities = 24/94 (25%), Positives = 50/94 (53%), Gaps = 5/94 (5%)
Query: 108 SVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
+VS+++P +N T++ + S++++S + + ++++DD ST D Y N
Sbjct: 5 TVSIIMPVYNGA-KTIISSVESIIHQSYQDFV--LYIIDDCSTDDTFSLINSRYK--NNQ 59
Query: 168 KVKVIRLPKRSGLITARLAGAERATADVLIFLDS 201
K++++R G+ +R G E AT + F D+
Sbjct: 60 KIRILRNKTNLGVAESRNYGIEMATGKYISFCDA 93
>UniRef50_Q8GPA4 Cluster: Eps7G; n=1; Streptococcus
thermophilus|Rep: Eps7G - Streptococcus thermophilus
Length = 328
Score = 41.9 bits (94), Expect = 0.045
Identities = 32/108 (29%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 110 SVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKV 169
S+++P +N L R YSVL + EI L++D S + L ++++ K+
Sbjct: 5 SIIIPVYNLE-DYLYRCLYSVLNQDYNDF--EIILINDGSDDNSLN-IIEEFKNQYCSKI 60
Query: 170 KVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPI 217
KVI + G+ +AR G + A + +IF+D + N L +LE I
Sbjct: 61 KVISQVNQ-GVSSARNKGLQEAEGEYIIFIDGDDYIDSNHLSNILEYI 107
>UniRef50_Q4MJH5 Cluster: TagF domain protein; n=2; Bacillaceae|Rep:
TagF domain protein - Bacillus cereus G9241
Length = 710
Score = 41.9 bits (94), Expect = 0.045
Identities = 34/122 (27%), Positives = 61/122 (50%), Gaps = 6/122 (4%)
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
+ +VSV+VP +N+ L R SVL ++ +L EI L++D ST + G L +++
Sbjct: 1 MKTVSVIVPVYNKE-DYLKRCINSVLNQTYTNL--EIILINDGSTDNSKG-ILTEFVR-E 55
Query: 166 MPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVM 225
+++ + G+ AR G +AT + + FLD+ + L++ I DY +
Sbjct: 56 YEQIRYYEFQRNYGVAKARNYGLSKATGEYVYFLDADDYLMAQAVEVLVDNIG-DYSAIA 114
Query: 226 CP 227
P
Sbjct: 115 GP 116
>UniRef50_Q1PZ07 Cluster: Putative uncharacterized protein; n=1;
Candidatus Kuenenia stuttgartiensis|Rep: Putative
uncharacterized protein - Candidatus Kuenenia
stuttgartiensis
Length = 1028
Score = 41.9 bits (94), Expect = 0.045
Identities = 28/106 (26%), Positives = 56/106 (52%), Gaps = 7/106 (6%)
Query: 110 SVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPKV 169
S+++P N+ T + L+++ I EI +VD+AS+ + ++L+ + K+
Sbjct: 42 SIIIPVFNKVEYT--KKCLEALFKNTPGDIYEIIVVDNASSDE-----TPEFLKRMINKL 94
Query: 170 KVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLE 215
+VI + G +A GA+ A + L+FL++ TE WL +++
Sbjct: 95 EVITNQENMGFTSACNQGAKIAKGNYLVFLNNDTEPQPGWLESMVK 140
>UniRef50_Q052L4 Cluster: Glycosyltransferase; n=10; Leptospira|Rep:
Glycosyltransferase - Leptospira borgpetersenii serovar
Hardjo-bovis (strain L550)
Length = 281
Score = 41.9 bits (94), Expect = 0.045
Identities = 32/104 (30%), Positives = 54/104 (51%), Gaps = 6/104 (5%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P VS+V+P +N + + T S+L + ++ E+ +VDD S F L++Y N
Sbjct: 3 PLVSIVIPCYN-YGRYIHETVESILRQRYKYW--EVIIVDDGSNDPFTISILEEYK--NK 57
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWL 210
V+ +P RSG TAR G + A + ++ LDS + ++L
Sbjct: 58 SGFTVLSIP-RSGPSTARNIGIDTAQGEFILPLDSDDMIHEDYL 100
>UniRef50_A7AFY8 Cluster: Putative uncharacterized protein; n=1;
Parabacteroides merdae ATCC 43184|Rep: Putative
uncharacterized protein - Parabacteroides merdae ATCC
43184
Length = 312
Score = 41.9 bits (94), Expect = 0.045
Identities = 31/112 (27%), Positives = 55/112 (49%), Gaps = 6/112 (5%)
Query: 106 LPSVSVVVPFHNEHW--STLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLE 163
+ ++V+VP +NE + R L++ P ++ EI ++D S +D LD+ +
Sbjct: 3 MKKLAVIVPCYNEELVIAESYRRTREALHKLP--VLTEIIYINDGS-RDRTRLMLDE-IA 58
Query: 164 GNMPKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLE 215
+ P+VKVI + G A AG AD+ + LD+ + +P +LE
Sbjct: 59 ASDPQVKVIHFSRNFGHQPAVTAGINNCDADLAVILDADMQDPPELIPSILE 110
>UniRef50_A4EQH0 Cluster: Probable glycosyltransferase protein; n=3;
Rhodobacteraceae|Rep: Probable glycosyltransferase
protein - Roseobacter sp. SK209-2-6
Length = 332
Score = 41.9 bits (94), Expect = 0.045
Identities = 33/107 (30%), Positives = 50/107 (46%), Gaps = 2/107 (1%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
+SVV+P +NE + L P EI LVDD S+ D +A++D N P
Sbjct: 3 LSVVIPCYNEEEAIPLMVERLTAAVEPWKNSAEIILVDDGSS-DSTWEAIEDAHTLN-PM 60
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLE 215
V+ +RL G A AG E A + + LD+ + LP +++
Sbjct: 61 VRGLRLSANRGHQVALTAGLEAAKGERIFMLDADLQDPPELLPDMMK 107
>UniRef50_A3HRI5 Cluster: Glycosyl transferase, family 2; n=1;
Algoriphagus sp. PR1|Rep: Glycosyl transferase, family 2
- Algoriphagus sp. PR1
Length = 335
Score = 41.9 bits (94), Expect = 0.045
Identities = 28/106 (26%), Positives = 53/106 (50%), Gaps = 4/106 (3%)
Query: 106 LPSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGN 165
+ VS++V HN L S + E+ L+D+ S+ D +G + ++ E N
Sbjct: 2 IEGVSIIVCTHNG--KNRLEPTLSSFTKLKSRFPIELILIDNGSS-DGVGNWVKEWWEIN 58
Query: 166 MPKVKV-IRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWL 210
+ KV + + + GL+ AR+ G + ++ D L+F D E + ++L
Sbjct: 59 LVKVPIKVFQEQNPGLMNARIKGIQESSFDFLLFCDDDNELSFDYL 104
>UniRef50_A1G8G0 Cluster: Glycosyl transferase, family 2; n=4;
Actinomycetales|Rep: Glycosyl transferase, family 2 -
Salinispora arenicola CNS205
Length = 333
Score = 41.9 bits (94), Expect = 0.045
Identities = 35/107 (32%), Positives = 49/107 (45%), Gaps = 3/107 (2%)
Query: 109 VSVVVPFHNEHWST-LLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMP 167
+SVVVP NE S L TA + + E+ VDD S L AL L P
Sbjct: 3 LSVVVPCFNEEASVEQLHTAVTAAVAELSDVEIEVVYVDDGSVDGTLA-ALRR-LAAIDP 60
Query: 168 KVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLL 214
V+ L + G A LAG +RAT D ++ +D+ + LP ++
Sbjct: 61 AVRYTSLSRNFGKEAAMLAGLKRATGDAVVIMDADLQHPPRLLPDMV 107
>UniRef50_A4G0E0 Cluster: Glycosyl transferase, family 2; n=2;
cellular organisms|Rep: Glycosyl transferase, family 2 -
Methanococcus maripaludis
Length = 234
Score = 41.9 bits (94), Expect = 0.045
Identities = 31/110 (28%), Positives = 62/110 (56%), Gaps = 5/110 (4%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
+S+++P +NE T+L+T V+ + + KEI +V+D ST D + +++ ++ P+
Sbjct: 3 LSIIIPAYNEE-KTILKTLEEVVAVTLP-VDKEIIIVNDGST-DGTEQIIENSIK-KFPE 58
Query: 169 VKVIRLPKRSGLITARLA-GAERATADVLIFLDSHTEANVNWLPPLLEPI 217
+ L K++G + L G ++T D++I D+ E + N L++PI
Sbjct: 59 SNIKLLSKKNGGKGSALKEGMRKSTGDIIIIQDADLEYDPNDYSKLIKPI 108
>UniRef50_Q64W33 Cluster: Glycosyltransferase; n=1; Bacteroides
fragilis|Rep: Glycosyltransferase - Bacteroides fragilis
Length = 333
Score = 41.5 bits (93), Expect = 0.059
Identities = 40/137 (29%), Positives = 68/137 (49%), Gaps = 13/137 (9%)
Query: 109 VSVVVPFHN--EHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
VS+VVP +N EH + R SV+ +S ++ E+ LV+D ST D L + L+
Sbjct: 9 VSIVVPVYNAAEH---IERCVESVIDQSYDNW--ELILVNDGSTDDSL--VVCYALQQRD 61
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIALDYRTVMC 226
++ V+ G +AR G +RAT L F+DS ++L ++E D +++
Sbjct: 62 KRITVVD-KTNGGASSARNVGIDRATGHFLCFIDSDDYVGSHYLSDMVEAYIPDSFSLVI 120
Query: 227 PFIDVIA---FDTFEYR 240
+ ++ F TF Y+
Sbjct: 121 TGMKLLCNGQFQTFNYQ 137
>UniRef50_Q47P42 Cluster: Probable glycosyltransferase; n=1;
Thermobifida fusca YX|Rep: Probable glycosyltransferase
- Thermobifida fusca (strain YX)
Length = 281
Score = 41.5 bits (93), Expect = 0.059
Identities = 37/110 (33%), Positives = 54/110 (49%), Gaps = 8/110 (7%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
+SVVVP +NE ++ + L + EI +VD+ ST D + + Y E
Sbjct: 3 ISVVVPAYNEE--AVIERCLNALVNQIVP-VDEILVVDNNST-DRTAEIAEKYREYG--- 55
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIA 218
V+VIR + GLI AR G + AT D+L +D+ T +W L E A
Sbjct: 56 VRVIRETVQ-GLIPARNRGFDEATGDILGRIDADTVVEPDWSQRLAEAFA 104
>UniRef50_A6Q1D6 Cluster: Glucosaminyltransferase; n=1;
Nitratiruptor sp. SB155-2|Rep: Glucosaminyltransferase -
Nitratiruptor sp. (strain SB155-2)
Length = 438
Score = 41.5 bits (93), Expect = 0.059
Identities = 31/102 (30%), Positives = 54/102 (52%), Gaps = 4/102 (3%)
Query: 107 PSVSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNM 166
P VSV+VP +NE T+ + S+L ++ +L EI +VDD S+ + KA
Sbjct: 84 PKVSVIVPAYNEA-KTIATSISSLLTQNYPNL--EIIVVDDGSSDETYFKAKQFEHNEFC 140
Query: 167 PKVKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVN 208
+++V R K G A G ER+T +++ +D+ ++ + N
Sbjct: 141 KEIRVFR-KKNEGKSKAINYGIERSTGELIFVMDADSKISQN 181
>UniRef50_A3QCT5 Cluster: Glycosyl transferase, family 2; n=1;
Shewanella loihica PV-4|Rep: Glycosyl transferase,
family 2 - Shewanella loihica (strain BAA-1088 / PV-4)
Length = 269
Score = 41.5 bits (93), Expect = 0.059
Identities = 33/110 (30%), Positives = 55/110 (50%), Gaps = 6/110 (5%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
++VV+P +N T+ + SV+ P + EI +VDD + F ++ +
Sbjct: 5 ITVVIPTYNRRRKTI-QAIESVISSKPG--LVEIIVVDDCGSIPFEYPSVRNNCG---VA 58
Query: 169 VKVIRLPKRSGLITARLAGAERATADVLIFLDSHTEANVNWLPPLLEPIA 218
V+V+ L SG AR+ G + AT++ + FLDS + W+ LL IA
Sbjct: 59 VRVLSLDVNSGPGIARMLGVKHATSNYIAFLDSDDVYSPAWVDYLLTKIA 108
>UniRef50_A3D6K1 Cluster: Glycosyl transferase, family 2; n=1;
Shewanella baltica OS155|Rep: Glycosyl transferase,
family 2 - Shewanella baltica OS155
Length = 271
Score = 41.5 bits (93), Expect = 0.059
Identities = 31/94 (32%), Positives = 50/94 (53%), Gaps = 5/94 (5%)
Query: 109 VSVVVPFHNEHWSTLLRTAYSVLYRSPEHLIKEIFLVDDASTKDFLGKALDDYLEGNMPK 168
+SV++P N T+ + SV + LI EI +++D ST D ++ +E N P
Sbjct: 3 ISVIIPAFNAE-KTIEESINSVFLQDMNELI-EIIVINDGST-DLTVNVVNKIIEQN-PL 58
Query: 169 VKVIRLPKRS-GLITARLAGAERATADVLIFLDS 201
K+I L + + G+ TAR G +A + + FLDS
Sbjct: 59 SKIIILNQNNQGVATARNNGVNKAIGEYIAFLDS 92
Database: uniref50
Posted date: Oct 5, 2007 11:19 AM
Number of letters in database: 575,637,011
Number of sequences in database: 1,657,284
Lambda K H
0.321 0.138 0.448
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 721,741,991
Number of Sequences: 1657284
Number of extensions: 32488061
Number of successful extensions: 59918
Number of sequences better than 10.0: 500
Number of HSP's better than 10.0 without gapping: 160
Number of HSP's successfully gapped in prelim test: 368
Number of HSP's that attempted gapping in prelim test: 59224
Number of HSP's gapped (non-prelim): 664
length of query: 575
length of database: 575,637,011
effective HSP length: 105
effective length of query: 470
effective length of database: 401,622,191
effective search space: 188762429770
effective search space used: 188762429770
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.8 bits)
S2: 75 (34.3 bits)
- SilkBase 1999-2023 -